CN114189899B

CN114189899B - User equipment selection method based on random aggregation beam forming

Info

Publication number: CN114189899B
Application number: CN202111503756.4A
Authority: CN
Inventors: 刘升恒; 黄永明; 徐春梅; 傅凝宁
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2021-12-10
Filing date: 2021-12-10
Publication date: 2023-03-31
Anticipated expiration: 2041-12-10
Also published as: CN114189899A

Abstract

The invention discloses a user equipment selection method based on random aggregation beam forming, which adopts an air computing technology to improve communication efficiency when a federal learning aggregation model is adopted, and aggregation errors generated by the method have influence on final learning performance. The goal of the problem is to achieve both the goals of minimizing aggregation errors and maximizing the number of user equipments by selecting user equipments and designing an aggregate beamforming vector. Then, a user equipment selection method based on random aggregation beamforming is proposed. When the number of the user equipment is increased, compared with the original algorithm, the algorithm of the invention can obtain lower aggregation error and select more user equipment, thereby obtaining better learning performance.

Description

User equipment selection method based on random aggregation beam forming

Technical Field

The invention belongs to the field of wireless communication, and particularly relates to a user equipment selection method based on random aggregation beam forming.

Background

With the development of wireless communication and the breakthrough of artificial intelligence technology, the more and more intelligent services are undertaken by the edge of the wireless communication network. Usually, people will use traditional centralized training method to accomplish these tasks, but this method will bring high delay and privacy disclosure problems. In recent years, federal learning has attracted more and more researchers' attention as an emerging distributed learning mode. The learning mode does not need to transmit sensitive data when a global model is trained, so privacy disclosure is avoided, and transmission delay is reduced.

However, since the central node and the ue need to communicate frequently, and spectrum resources are very precious for an edge intelligent system, how to improve the communication efficiency becomes a bottleneck for the federal learning development. The traditional communication computation separation mechanism can only perform computation after decoding the signals, so that the efficiency is not high, and therefore, researchers introduce an emerging air computation technology into the framework of federal learning. The technology carries out analog modulation on signals, utilizes a waveform superposition principle, can complete calculation while transmitting, and has proved that under the same conditions, higher communication efficiency can be obtained than a digital modulation scheme based on a communication calculation separation mechanism.

However, due to the fading characteristics of the channel and the influence of noise, the over-the-air computation technique may also bring aggregation errors, and if the aggregation errors are too large, the model training may be negatively affected. On the other hand, selecting more user equipment to participate in model aggregation proves that the learning efficiency can be improved, and the model convergence is accelerated. Therefore, researchers have designed a joint optimization scheme of user equipment selection and aggregation beamforming vectors, which aims to reduce aggregation errors and select more user equipments at the same time, but this method is computationally complex and difficult to implement in practical applications. To solve the problem, a user equipment selection method based on random aggregation beamforming is provided, and the method has low complexity, can be realized in practical application and can ensure performance.

Disclosure of Invention

The invention aims to provide two user equipment selection methods based on random aggregation beamforming, which respectively aim at minimizing mean square error and maximizing the number of selected equipment users.

In order to solve the technical problems, the specific technical scheme of the invention is as follows:

a user equipment selection method based on random aggregation beam forming comprises the following steps:

step 1, constructing a federal learning system model;

step 1.1, in an edge intelligent system, K edge user devices equipped with an antenna and a central node equipped with N antennas, wherein N is less than K; the set of all user equipments is denoted as

Each user device k has its own local data set ≥>

And a global data set is combined from the local data sets>

The user devices together implement an intelligent application;

step 1.2, adopting a federal learning distributed learning framework;

step 2, training the distributed federal learning model constructed in the step 1, improving the communication efficiency by using an air computing technology, and determining an aggregation error generated by adopting the air computing technology;

step 3, constructing two optimization problems according to the aggregation error determined in the step 2, wherein one optimization problem is the problem of minimizing the mean square error under the condition that the selection number of the user equipment is fixed, and the other optimization problem is the problem of maximizing the number of the selectable user equipment under the condition that the selection number of the user equipment is low;

step 4, aiming at the two optimization problems constructed in the step 3, respectively providing a minimum mean square error algorithm based on random aggregation beam forming and a maximum user equipment selection algorithm based on random aggregation beam forming for solving;

the minimum mean square error algorithm based on the random aggregation beamforming comprises the following steps:

step A, determining the number of user equipment to be selected

A random number of samples N _m Each cycle generates an auxiliary variable tmp, tmp _max Is a variable that records the maximum tmp cycled through to the current round, and initializes tmp _max =0; from 1 to N _m Continuously circulating the following three steps B to D, namely totally executing N _m Secondly;

step B, from complex Gaussian distribution

M is obtained by sampling at random once _r Where I is an N × N dimensional identity matrix, and for m _r Perform normalization m _r ＝m _r /||m _r ||；

Step C, calculating equivalent channel power for each user equipment k

Then, all equivalent channel powers are arranged in a descending order, and the S-th value is selected and recorded as tmp;

step D, if tmp > tmp _max Let tmp _max = tmp, and then the subset of user equipments participating in the update to be selected in the round is determined as

And m = m _r ；

E, after the circulation execution is finished, obtaining the required user equipment subset participating in the update

And a beamforming vector m for the center node;

the maximized user equipment selection algorithm based on random aggregation beamforming comprises the following steps:

step a, firstly determining a threshold value for limiting MSE

Number of random samplings N _m (ii) a And initializing a selectable maximum subset of user devices to an empty set, i.e. </er>

From 1 to N _m Continuously circulating the four steps from step b to step e, namely circulating N _m Secondly;

step b, from complex Gaussian distribution

In random sampling oneThen m is obtained _r Where I is an N × N dimensional identity matrix, and for m _r Normalization is carried out, i.e. m _r ＝m _r /||m _r ||；

Step c, calculating for each user equipment k

Step d, determining the subset of the user equipment participating in the updating as

Step e, if

Then it is asserted>

Step f, obtaining the selectable maximum user equipment subset after the execution of the circulation process is finished

And an optimal beamforming vector m for the center node.

Further, the step 1.2 of adopting a federal learning distributed learning framework includes the following steps:

step 1.2.1, selecting a subset among all user equipments

To participate in the training of the round; />

Step 1.2.2, the selected user equipment updates the local model parameters for E times according to the local data set of the user equipment, and then uploads the parameters to the central node;

and step 1.2.3, the central node updates the global model according to the received parameters.

Further, in step 1.2.2, the specific update formula of the local model parameters of the kth ue in the nth round is:

wherein the gradient is accumulated

Is expressed as->

And->

μ ⁱ Is the learning rate of step i; />

Is a mean loss function, wherein>

Is in a sample>

The loss value on the jth data of (1);

let each user equipment have a local data set of the same size, then in the nth pass the global model parameters of the central node can be obtained by:

further, the specific steps of step 2 are: in the nth round, the symbol matrix transmitted by the user equipment k is defined as

And normalized by the unit variance, i.e. < >>

Wherein I represents an identity matrix;

and g _k Is abbreviated as s _k ，w _k And g _k (ii) a Representing the selected set of user devices as ≧>

The ideal signal obtained by the central node is represented as:

s _k after analog modulation, the transmission coefficient b _k Performing precoding, b _k Represents the transmission power of user equipment k, and its phase can be used to help the central node align the received signal; these signals are then superimposed in the air and multiplied by the beamforming vector of the central node; finally, amplifying by a scale factor eta; due to fading and noise in the wireless channel, the signal y actually received by the central node is represented as:

wherein h is _k Is a channel vector from the user equipment k to the central node, subject to a complex Gaussian distribution of unity power, i.e.

The channels are subject to independent equal distribution; the vector n is additive white Gaussian noise, i.e. < >>

m ^H Represents the conjugate transpose of the beamforming vector, and therefore the aggregate error produced by the over-the-air computation technique is represented as:

order to

Wherein a is _k Is the degree of deviation of the kth user equipment signal, the mean squared error is expressed as: />

The first item

Is an error caused by fading, the second term η m | | non-woven phosphor ² σ ² Is an error caused by noise;

according to equation (6), the following conditions are guaranteed to eliminate the fading related error:

finally, the mean square error is written as follows:

further, the optimization problem of minimizing the mean square error specifically includes:

making the number of user equipments selected per round fixed

Wherein->

Is the user device subset->

The number of the elements in the optimization problem is determined by the scaling factor eta and the transmission coefficient b _k Subscriber device subset->

And a beamforming vector m; since these variables are independent of the noise n, the optimization goal need only be reduced to η | | | m | | survival ² The whole optimization problem can be expressed as:

transmission coefficient b _k The design is as follows:

the scale factor η is designed as:

where P is the maximum allowed transmit power of each ue, and equation (11) is replaced by equation (9 a), since m exists in the numerator and denominator, the optimization problem is equivalent to:

s.t.||m||＝1 (12b)

further, the optimization problem of maximizing the number of user equipments specifically includes: setting a MSE threshold as

The objective of the optimization problem is to maximize the number of ues under this constraint; we redefine a ≥ per user device>

Where m | =1, the final MSE is defined as

The transmission coefficient b is set to be the same as in the equations (10) and (11) _k And the scale factor eta is taken as an optimal value, and the whole optimization problem is expressed as follows:

||m||＝1 (13c)。

the two user equipment selection methods based on the random aggregation beam forming have the following advantages:

1. the MSE minimizing algorithm based on the random aggregation beamforming adopts a random sampling mode to replace iterative computation to approach an optimal solution, so that the complexity of the algorithm is greatly reduced, the method can be realized in an actual application scene, and the performance is also ensured.

2. The algorithm for maximizing the number of the user equipment based on the random aggregation beamforming can select the equipment participating in updating as much as possible under the condition of ensuring that the MSE does not exceed a certain threshold value, so that the diversity of the equipment is increased, and the prediction capability of the model is improved.

Drawings

FIG. 1 is a schematic diagram of a Federal learning System model of the present invention;

FIG. 2 is a schematic diagram of pseudo code of the mean square error minimization algorithm based on random aggregation beamforming according to the present invention;

FIG. 3 is a pseudo code diagram of a UE number maximization algorithm based on random aggregation beamforming according to the present invention;

FIG. 4 (a) is an algorithm, DC algorithm and

comparing MSE results of the algorithm with a graph;

FIG. 4 (b) shows different random sampling times N according to the present invention _m A schematic diagram of the influence on the MSE result of the algorithm of the invention;

FIG. 5 (a) shows the algorithm, DC algorithm and DC algorithm of the present invention for different total numbers of user equipments

A comparison schematic diagram when the total number of the user equipment selected by the algorithm is 50;

FIG. 5 (b) shows the algorithm, DC algorithm and DC algorithm of the present invention for different total numbers of user equipments

A comparison schematic diagram when the total number of the user equipment selected by the algorithm is 100;

FIG. 5 (c) shows the algorithm, DC algorithm and DC algorithm of the present invention for different total numbers of user equipments

Comparison of total number of user equipment selected by algorithm to 150A schematic diagram;

FIG. 6 (a) is a schematic diagram showing the effect of mean square error on MNIST10 data set on test accuracy;

FIG. 6 (b) is a schematic diagram showing the effect of mean square error on CIFAR10 data set on test accuracy;

fig. 7 (a) is a schematic diagram of the impact of user equipment count on MNIST10 data set on test accuracy;

FIG. 7 (b) is a diagram illustrating the effect of the number of user equipments on the accuracy of the test on the CIFAR10 data set.

Detailed Description

For better understanding of the purpose, structure and function of the present invention, a method for selecting a ue based on random aggregation beamforming according to the present invention is described in further detail below with reference to the accompanying drawings.

The invention specifically comprises the following steps:

step 1, model construction

Step 1.1, the system model is as shown in fig. 1, in an edge intelligent system, there are K edge user equipments equipped with one antenna and a center node equipped with N antennas, where N is much smaller than K. The set of all user equipments is denoted as

Each user device k has its own local data set ≥>

And a global data set is combined from the local data sets>

These user devices together implement an intelligent application. In general, the goal of a machine learning task is to find a set of model parameters w ^o So that the loss function->

Minimum, wherein +>

Is a parameter of the model, wherein>

Is a D-dimensional real vector space.

Step 1.2, adopting a distributed learning framework of federal learning

Each round of federal learning can be divided into three phases: 1) Selecting a subset of all user equipments

To participate in the training of the round; 2) The selected user equipment updates local model parameters for E times according to the local data set of the user equipment, and then uploads the parameters to the central node; 3) The central node updates the global model based on the received parameters.

The specific update formula of the local model parameters of the kth ue in the nth round is as follows:

wherein the gradient is accumulated

Can be expressed as->

And->

μ ⁱ Is the learning rate of the ith step. />

Is a mean loss function, wherein &>

Is in the sample->

The loss value on the jth data of (1). Assuming that each user equipment has a local data set of the same size, then in the nth pass, the global model parameters of the central node can be obtained by:

and 2, training the distributed federal learning model constructed in the step 1, improving the communication efficiency by using an air computing technology, and determining an aggregation error generated by adopting the air computing technology.

In the nth round, the symbol matrix transmitted by user equipment k may be defined as

And assumes normalization with unit variance, i.e. < >>

Where I is the identity matrix. For convenience of presentation, be>

And g _k The d-th element of (a) can be abbreviated as s _k ，w _k And g _k . Representing a selected set of user devices as ÷ based>

The desired signal for the central node can be expressed as:

due to the use of over-the-air computing techniques, s _k After analog modulation, the transmission coefficient b _k To carry out pre-preparationCode, b _k Represents the transmission power of the user equipment k, whose phase can be used to help the central node align the received signal; these signals are then superimposed in the air and multiplied by the beamforming vector of the central node; finally, the amplification is carried out through a scaling factor eta. Due to fading and noise in the wireless channel, the signal y actually received by the central node can be expressed as:

wherein h is _k Is a channel vector from the user equipment k to the central node, assumed to be a complex gaussian distribution with unity power, i.e.

This also assumes that the channels follow independent co-distributions. The vector n being additive white Gaussian noise, i.e.

m ^H Is a conjugate transpose of the beamforming vector. Thus, the aggregate error produced by the over-the-air computation technique can be expressed as:

order to

The Mean Square Error (MSE) may be expressed as:

wherein a is _k Is the degree of deviation from the kth user equipment signal,

is to the | | e | | non-conducting phosphor ² The mathematical expectation is obtained.First item

To reduce MSE, we can eliminate errors due to fading while minimizing errors due to noise. Thus, although the resulting MSE is not optimal, if the error due to fading is dominant and the central node is equipped with multiple antennas, the resulting MSE is very close to the minimum. According to equation (6), the following conditions must be ensured for eliminating fading related errors:

finally, the MSE can be written as follows:

and 3, constructing two optimization problems according to the aggregation error determined in the step 2, wherein one optimization problem is the problem of minimizing the mean square error under the condition that the selection number of the user equipment is fixed, and the other optimization problem is the problem of maximizing the number of the selectable user equipment under the condition that the mean square error is ensured to be at a low level.

The minimization of the mean square error optimization problem specifically includes the following: to ensure the learning performance of the model, it is necessary to reduce the value of MSE as much as possible, assuming that the number of ues selected per round is fixed

Wherein->

Is the user device subset->

Number of elements in (1). The decision variables of the optimization problem then include the scaling factor η, the transmission coefficient b _k Subscriber device subset->

And a beamforming vector m. Since these variables are independent of noise n, the optimization goal need only be reduced to η | | | m | | computation ² . The whole optimization problem can be expressed as:

/>

according to the existing literature, the transmission coefficient b _k Can be directly designed as follows:

then, the scale factor η can be designed as:

s.t.||m||＝1 (12b)

the optimization problem of maximizing the number of user equipments specifically includes the following: in a real-world situation, if the value of MSE is small, the aggregate error due to the over-the-air computation can be used as a regularization tool to prevent overfitting of the model. In case it is guaranteed that the value of MSE is maintained at a low level, as many user equipments as possible may be selected, which may improve the learning efficiency. Therefore, we set a threshold for MSE to be noted

The goal of the problem is to maximize the number of user equipments under this constraint. We redefine a @ per user device>

Where m | =1, MSE _k Representing the mean square error of the kth user equipment, the final MSE being defined as ≥>

Transmission coefficient b is measured as above _k And the scale factor η takes an optimum value. The whole optimization problem can be expressed as:

||m||＝1 (13c)

random aggregation beamforming scheme:

the final learning performance of the Federal learning model established by the invention depends on the relevant hyper-parameters of machine learning, such as data quantity, data distribution, equipment computing capacity and the like, and relevant parameters under communication transmission, such as the number of user equipment participating in updating

Transmission coefficient b _k A beamforming vector m for the center node, a scaling factor η, etc. The user equipments considered in this patent assume the same data volume, data distribution and computational power, but only differences in communication transmission related parameters exist. According to the literature, we refer to the transmission coefficient b _k And the scaling factor eta is designed to an optimal value, and then the beamforming vector m and the user device selection scheme->

And (5) performing joint optimization.

For the problem of minimizing Mean Square Error (MSE), the algorithm for minimizing MSE based on random aggregation beamforming specifically includes the following steps, and the pseudo code of the algorithm is shown in fig. 2:

(1) Determining number of user equipments per round selection

A random number of samples N _m And initializing two auxiliary variables, tmp _max And =0. From 1 to N _m Continuously circulating the following three steps (2) to (4), namely executing N in total _m Next, the process is carried out.

(2) From complex Gaussian distribution

M is obtained by sampling at random once _r Where I is an N identity matrix. And to m _r Perform normalization m _r ＝m _r /||m _r ||。

(3) For each user equipment k, calculating equivalent channel power

Then all the equivalent channel powers are sorted in descending order, and the S-th value is selected and recorded as tmp.

(4) If tmp > tmp _max Let tmp _max = tmp, and then the subset of user equipments participating in the update to be selected in the round can be determined as

And m = m _r 。

(5) After the loop execution is completed, the required user equipment subset participating in the update can be obtained

And a beamforming vector m for the center node.

For the problem of maximizing the number of user equipments, the algorithm for maximizing the number of user equipments based on random aggregation beamforming specifically includes the following steps, and the pseudo code of the algorithm is shown in fig. 3:

(1) First, a threshold value limiting MSE is determined

A random number of samples N _m . And initializing the selectable maximum subset of user devices to an empty set, i.e. < >>

From 1 to N _m Continuously cycling through the four steps (2) to (5), namely, cycling N _m Next, the process is carried out.

(2) From complex Gaussian distribution

M is obtained by sampling at random once _r Where I is an N identity matrix. And to m _r Normalization is carried out, i.e. m _r ＝m _r /||m _r ||。

(3) For each user equipment k, calculating

(4) It may then be determined that a subset of the user devices participating in the update are

(5) If it is not

Then it is asserted>

(6) After the loop process is completed, the selectable maximum user equipment subset can be obtained

And an optimal beamforming vector m for the center node.

To verify the performance advantage of the algorithm, an example flow of the invention is given below.

1. Experimental parameter settings

According to the research of practical situation, we set the maximum transmission power P allowed by each ue to 0dB. In order to prove the effectiveness of the method, several most advanced algorithms are selected for comparison. For the problem of minimizing MSE, a comparison algorithm is an iterative user equipment selection scheme, and the method is to optimize a beam forming vector by using a convex difference function method, select a user equipment subset by the size of equivalent channel power, and continuously iterate until convergence; the second algorithm for comparison is called random ue selection, which also uses the convex difference function to optimize the beamforming vector, but the subset of ues is randomly selectedIn (1). For the problem of maximizing the number of the user equipment selections, a comparative algorithm is a DC algorithm, sparsity is introduced by using a convex difference representation, and then an optimal beam forming vector is obtained; the second algorithm for comparison is

Algorithm using l ₁ The norm introduces sparsity and then a beam forming vector is obtained by using an SDR method.

2. The effect of the algorithm of the present invention in reducing mean square error

To fully illustrate the effectiveness of the algorithm of the present invention, we stipulate the number of ues participating in updating per round S =10 and the number of random sampling times N of the beamforming vector under the experimental condition _m =1, the total number of user equipments in the system tests two cases of K =1000 and K =10000, and the number of antennas of the central node is gradually increased from 2 to 12. The results of the test are shown in fig. 4 (a), and it can be seen that the present algorithm is significantly superior to the random ue selection scheme. In addition, although the MSE obtained by the algorithm is slightly higher than the iterative ue selection scheme, the computational complexity of the algorithm is significantly less than that of the iterative method, and the gap between them is smaller as the K value increases.

After that we changed N _m Gradually increased from 1 to 100, and the test results are shown in fig. 4 (b). It can be seen from the figure that the larger the sampling times of the beamforming vector is, the smaller the value of MSE obtained by the algorithm is, and when N is _m The algorithm can even exceed the iterative user equipment selection scheme when the number N of the antennas of the central node is large.

3. The algorithm of the present invention increases the number of user equipments

In this experimental environment, we will

Is limited to [ -6, + 6)]Between dB, the number of antennas at the central node is 4. We consider three experimental scenarios in which the total number of user equipments K is 50, 100 and 150, respectively. Is differentThe experimental results of the algorithm under the three scenes are shown in fig. 5, and it can be seen that under the same conditions, the algorithm can be obviously better than that of the algorithm

The algorithm selects more user equipments. And with N _m And the performance of the algorithm is improved continuously due to the increase of K, and although the performance of the algorithm is comparable to that of the DC algorithm when K =50, the performance of the algorithm is obviously better than that of the DC algorithm when K = 150.

4. Learning performance

We also tested how much the increasing the number of ues can have an impact on the final machine learning task, and in this experiment we selected two most classical machine learning datasets, MNIST10 and CIFAR10, respectively, and assumed that the datasets on each ue are not subject to independent and same distribution. The MNIST10 data set for which we used a multi-layer perceptron neural network contains ten-digit black and white handwritten digital pictures from 0 to 9 digits. CIFAR10 contains color pictures of class 10 objects and is therefore more difficult to train than MNIST10, so we use the ResNet18 neural network for this data set. To conveniently represent the impact of aggregation error on learning performance, we model the impact of aggregation error as a model retransmission probability p, expressed as p =1-exp (-aMSE/σ) ² ) Wherein a is set to 1.

The total number of the user equipments in the experiment is set to K =100, 10 user equipments are selected to participate in the update in each round, and the number of the antennas of the central node is 4. MSE/sigma derived from the present algorithm, iterative and random UE selection schemes ² 0.3221,0.3410 and 1.3531, respectively. As shown in fig. 6, the present algorithm and iterative ue selection scheme have comparable training results, while the random ue selection scheme is slightly lower than the first two, whereby a reduced MSE/σ can be found ² The classification capability of the model can be improved to a certain extent.

Then we set the MSE threshold to MSE/σ ² = 2dB. In this case, the present algorithm (N) _m Case of = 1000), DC algorithm and

the algorithm yields maximum numbers of ues of 30, 17 and 3, respectively. FIG. 7 illustrates the training effect of the three methods on MNIST10 and CIFAR10, which performs slightly better than the DC algorithm because the algorithm can select slightly more user devices than the DC algorithm and ^ and/or ^ based on the user device>

The algorithm performs much less than the first two. />

It is to be understood that the present invention has been described with reference to certain embodiments, and that various changes in the features and embodiments, or equivalent substitutions may be made therein by those skilled in the art without departing from the spirit and scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed, but that the invention will include all embodiments falling within the scope of the appended claims.

Claims

1. A method for selecting user equipment based on Random Aggregation Beamforming (RABF), comprising the steps of:

step 1, constructing a federal learning system model;

Each user device k has its own local data set ≥>

And a global data set is combined from the local data sets>

The user devices together implement an intelligent application;

step 1.2, adopting a federal learning distributed learning framework;

step 3, constructing two optimization problems according to the aggregation error determined in the step 2, wherein one optimization problem is the problem of minimizing the mean square error under the condition that the selection number of the user equipment is fixed, and the other optimization problem is the problem of maximizing the number of the selectable user equipment under the condition that the mean square error is ensured to be at a low level;

step A, determining the number of user equipment to be selected

A random number of samples N _m Each cycle generates an auxiliary variable tmp, tmp _max Is a variable that records the maximum tmp cycled through to the current round, and initializes tmp _max =0; from 1 to N _m Continuously circulating the following three steps B to D, namely totally executing N _m Secondly; wherein +>

Representing the number of elements in S;

step B, from complex Gaussian distribution

Obtaining m by sampling at random _r Where I is an N × N dimensional identity matrix, and for m _r Perform normalization m _r ＝m _r /||m _r ||；m _r Slave complex gaussian distribution->

The physical meaning of the result obtained by sampling once at random is a beam forming vector generated at random;

step C, calculating equivalent channel power for each user equipment k

Then, all equivalent channel powers are arranged in a descending order, and the S-th value is selected and recorded as tmp; h is _k Is the channel vector between the kth user and the central node;

step D, if tmp>tmp _max Let tmp _max = tmp, then the subset of user equipments participating in the update to be selected for the round is determined as

And m = m _r ；

Step E, obtaining the required user equipment subset participating in updating after the circulation execution is finished

And a beamforming vector m for the center node;

step a, firstly determining a threshold value for limiting MSE

step b, from complex Gaussian distribution

M is obtained by sampling at random once _r Where I is an N × N dimensional identity matrix, and for m _r Normalization is carried out, i.e. m _r ＝m _r /||m _r ||；/>

Step c, calculating for each user equipment k

P is the upper limit value of the signal transmission power of each user equipment;

Step e, if

Then it is asserted>

m＝m _r ；

And an optimal beamforming vector m for the center node.

2. The method of claim 1, wherein the step 1.2 of employing a federated learning distributed learning framework comprises the steps of:

step 1.2.1, selecting a subset of all user equipments

To participate in the training of the round;

3. The method of claim 2, wherein the specific update formula of the local model parameters of the kth ue in step 1.2.2 in the nth round is as follows:

in which the gradient is accumulated

Is expressed as->

And/or>

μ ⁱ Is the learning rate of step i; />

Is a function of the average loss of the signal,wherein +>

Is in the sample->

The loss value on the jth data of (1); />

Representing the local model parameters of the kth user equipment in the nth round, wherein the nth round is nE because the local user equipment updates the model parameters for E times in each round; w is a ^(n-1)E Global model parameters representing the (n-1) th round; />

Representing the model parameters updated by the (i + 1) th local user equipment in the nth round; />

Representing the model parameters updated by the ith local user equipment in the nth round;

let each user equipment have a local data set of the same size, then in the nth round, the global model parameters of the central node can be obtained by:

wherein w ^nE Representing the global model parameters for the nth pass.

4. The method of claim 3, wherein the step 2 comprises the following steps: in the nth round, the symbol matrix transmitted by user equipment k is defined as

And normalized by the unit variance, i.e. < >>

Wherein I represents an identity matrix; />

And g _k Is abbreviated as s _k ，w _k And g _k (ii) a Representing a selected set of user devices as ÷ based>

The ideal signal obtained by the central node is represented as:

s _k after analog modulation, the transmission coefficient b _k Performing precoding, b _k Represents the transmission power of the user equipment k, whose phase can be used to help the central node align the received signal; these signals are then superimposed in the air and multiplied by the beamforming vector of the central node; finally, amplifying by a scale factor eta; due to fading and noise in the wireless channel, the signal y actually received by the central node is represented as:

order to

Wherein a is _k Is the degree of deviation of the kth ue signal, the mean squared error is expressed as:

the first item of

according to equation (6), the following conditions are ensured for eliminating fading related errors:

finally, the mean square error is written as follows:

5. the method of claim 4, wherein the optimization problem of minimizing mean square error specifically comprises:

making the number of user equipments selected per round fixed

Wherein->

Is the user device subset->

transmission coefficient b _k The design is as follows:

/>

the scale factor η is designed as:

s.t.||m||＝1 (12b)

6. the method of claim 5, wherein the optimization problem of maximizing the number of ues specifically comprises: setting a MSE threshold as

The objective of the optimization problem is to maximize the number of ues under this constraint; we redefine one for each user equipment

Where m | =1, the final MSE is defined as | |>

The transmission coefficient b is expressed as in the equations (10) and (11) _k And the scale factor eta is taken as an optimal value, and the whole optimization problem is expressed as follows:

||m||＝1 (13c)。