CN112637883B

CN112637883B - Federal learning method with robustness to wireless environment change in electric power Internet of things

Info

Publication number: CN112637883B
Application number: CN202011432782.8A
Authority: CN
Inventors: 付美明; 刘庆扬; 王祥; 那辰星; 王学良; 李晓霞; 汤志颖; 汪志亮; 李莉华
Original assignee: State Grid Information and Telecommunication Co Ltd; Beijing Smartchip Microelectronics Technology Co Ltd; China Gridcom Co Ltd; Shenzhen Zhixin Microelectronics Technology Co Ltd
Current assignee: State Grid Information and Telecommunication Co Ltd; Beijing Smartchip Microelectronics Technology Co Ltd; China Gridcom Co Ltd; Shenzhen Zhixin Microelectronics Technology Co Ltd
Priority date: 2020-12-09
Filing date: 2020-12-09
Publication date: 2023-04-28
Anticipated expiration: 2040-12-09
Also published as: CN112637883A

Abstract

The invention provides a federal learning method with robustness to wireless environment change in the Internet of things of electric power, which comprises the following steps: building a neural network model, initializing weight parameters of the neural network model, and distributing a data set to UEs at a multi-user equipment end; training a neural network model; uploading the parameters of the trained neural network model to a wireless Access Point (AP), and simulating wireless environment factors to analyze convergence performance of a global FL model; analyzing convergence performance of the global FL model by simulating wireless environment factors; and selecting a client for executing federal learning by adopting a checking mechanism and a scheduling strategy at the access point AP to weaken the influence of corresponding wireless environment factors on the convergence of the global model, and having a robust federal learning method for wireless environment change. The invention uses a checking mechanism to detect the local FL model received by the access point and adopts a dispatching strategy to select the client for executing the federal learning to weaken the influence of infinite factors, so that the federal learning framework has robustness to the wireless environment change.

Description

Federal learning method with robustness to wireless environment change in electric power Internet of things

Technical Field

The disclosure relates to the technical field of wireless communication networks, in particular to a federal learning method with robustness to wireless environment changes in the electric power internet of things.

Background

The electric power Internet of things is an application of the Internet of things in the smart grid, is a result of the development of information communication technology to a certain stage, effectively integrates communication infrastructure resources and electric power system infrastructure resources, improves the informatization level of the electric power system, improves the utilization efficiency of the existing infrastructure of the electric power system, and provides important technical support for links such as power grid generation, power transmission, power transformation, power distribution and power consumption.

Standard Machine learning (Machine learning) methods require training data to be concentrated on a Machine or data center. Due to latency, bandwidth, and privacy constraints, it is often not feasible to transfer a user-collected data set to a data center or cloud to implement a powerful machine learning solution. While there are hard-to-break barriers between data sources, the data required for artificial intelligence typically involves multiple fields. In most industries, data exists in island form, and due to the problems of industry competition, privacy safety, complex administrative procedures and the like, even if data integration is realized among different departments of the same company, important resistance is faced, and in reality, it is almost impossible or the required cost is huge to integrate data scattered in various places and various institutions.

Federal Learning (FL), which allows model training to be decoupled from the need to directly access the raw training data. Unlike centralized learning at a data center, FL typically operates in a wireless edge network where the communication medium is resource constrained and unreliable. The iterative algorithm running on FL requires a very low latency and high throughput connection between the computing units, but the AP typically needs to link a large number of UEs over a resource-constrained spectrum, thus allowing only a limited number of multi-user device-side UEs to send their trained weight parameters over unreliable channels in each round of global aggregation. Due to bandwidth limitations, only a portion of UEs can be scheduled for updating in each iteration. Due to the shared nature of the wireless medium, transmissions are subject to interference and cannot be guaranteed.

Disclosure of Invention

The present disclosure addresses the above-described problems by providing a federal learning method in the power internet of things that is robust to wireless environment changes.

In order to solve at least one of the above technical problems, the present disclosure proposes the following technical solutions:

the invention provides a federal learning method with robustness to wireless environment change in the electric power Internet of things, which comprises the following steps:

step one: building a neural network model for learning, initializing weight parameters of the neural network model, and distributing a data set to UEs at a multi-user equipment end;

step two: federal learning is carried out through the built neural network model so as to obtain a trained neural network model;

step three: uploading the parameters of the trained neural network model to a wireless Access Point (AP), and simulating wireless environment factors to analyze the convergence performance of the global FL model at the moment;

step four: analyzing the convergence performance of the global FL model at the moment by simulating the wireless environment factors;

step five: and selecting a client for executing federal learning by adopting a checking mechanism and a dispatching strategy at the wireless access point AP according to the acquired convergence performance condition to weaken the influence of corresponding wireless environment factors on the convergence of the global model, and finally obtaining the robust federal learning method for the wireless environment change in the electric power Internet of things.

In some embodiments, building the neural network model for learning in step one includes:

determining the type of neural network used;

determining the number of layers of the neural network used and the number of ganglion points of each layer;

and initializing the weight and bias parameters of all the neural networks.

In some embodiments, step two comprises:

determining parameters of a neural network model, training by using a data set based on the model, wherein the parameters of the neural network model comprise communication round times, the number of clients K, the client score C, the local training generation E, the local small lot B, the test data small lot, the learning rate lr and an optimization method;

distributing the original data set to K clients in a uniformly and uniformly distributed or non-uniformly and uniformly distributed mode;

the K clients learn by using the local training data sets until the total loss function reaches the minimum value, and upload the local parameter model of the neural network to the wireless access point AP through a wireless link;

after receiving the local model parameters from all the UEs, the wireless access point AP starts to aggregate K local parameter models to obtain a global average model, and transmits the model to all the UEs, and the UEs start to learn in a new round according to the updated local FL model, and learn in a plurality of communication rounds to obtain a trained model.

In some embodiments, in the step of distributing the original data set to the K clients in a uniformly and homogeneously distributed or non-uniformly and homogeneously distributed manner,

the uniform and uniform distribution is to divide the original data set into K equal parts directly, but the non-uniform and uniform distribution needs to sort the original data set according to the size of the tag data and then distribute the sorted original data set to K clients, i.e. the data set of each client is non-uniform distribution.

In some embodiments, uploading the parameters of the trained neural network model to the wireless access point AP in step three includes:

in the training process of the communication round, each user equipment side first uses its training data to train the local FL model w _k ；

Each user equipment will w through the wireless cellular link _k Transmitting to the AP;

the AP receives the local FL models of all the participating user equipment, updates the global FL model, and updates the global FL model w ^t+1 Transmitting the local FL model to all user equipment ends to optimize the local FL model; the global FL model is obtained according to all local FL models of all user equipment participating in learning, and can be expressed by the following formula:

wherein w is ^t+1 Is a global FL model, n, generated by a wireless access point AP _k Representing the size of a K local data set of the user device, K representing a kth client, t representing a t communication round, w _k Representing the local FL model.

In some embodiments, the convergence performance of the global FL model at this time is analyzed by simulating wireless environmental factors in step four, wherein the wireless environmental factors include bandwidth limitations, inter-zone interference and path loss,

bandwidth limitations in wireless environment factors are simulated by limiting the number of clients participating in federal learning,

the inter-zone interference and path loss in the wireless environment factors are simulated by adding errors to the local FL model parameters.

In some embodiments, the step of simulating the inter-zone interference and path loss in the wireless environment factors by adding an error to the local FL model parameters includes:

in each communication round, K (C E (0, 1)) clients are selected to participate in training;

in the K C clients, the influence of model parameter errors of the a K C clients, the b K C clients and the C K C clients on the recognition precision of the final aggregation model is tested and selected respectively; wherein a, b, c each are each independently of the other E (0, 1);

the error mode of client model parameter error comprises:

setting the model neural network layer number to be L layers, selecting all parameters of which M layers are wrong, traversing all parameters of the M layers to enable the parameters to be wrong with the probability of Q, wherein M is less than L, and setting the Q epsilon (0, 1) error as follows:

W＝W+randomuniform(-1,1)

wherein, random (-1, 1) is a random generation of a floating point number from-1 to 1, w represents the local FL model.

In some embodiments, the scheduling policy in step five comprises:

the method comprises the steps that a wireless Access Point (AP) uniformly and randomly selects a plurality of relevant multi-user equipment terminals UEs to update parameters in each communication round, and a special sub-channel is allocated to each selected UE to send training parameters.

In some embodiments, the verification mechanism is used to check for data errors in a local FL model received at the wireless access point AP;

the wireless access point AP does not update the global FL model by using the received data error local FL model;

the wireless access point AP will directly update the global FL model with the remaining correct local FL model.

The beneficial effects of the present disclosure are: the invention uses a checking mechanism to detect the local FL model received by the access point and adopts a dispatching strategy to select the client for executing the federal learning to weaken the influence of infinite factors, so that the federal learning framework has robustness to the wireless environment change.

In addition, in the technical solutions of the present disclosure, the technical solutions may be implemented by adopting conventional means in the art, which are not specifically described.

Drawings

In order to more clearly illustrate the embodiments of the present disclosure or the technical solutions in the prior art, the drawings that are required in the description of the embodiments or the prior art will be briefly described, and it is apparent that the drawings in the following description are some embodiments of the present disclosure, and other drawings may be obtained according to the drawings without inventive effort for a person skilled in the art.

Fig. 1 is a schematic structural diagram of a federal learning method with robustness to wireless environmental changes in an electric power internet of things according to an embodiment of the present disclosure;

fig. 2 is a flowchart of a federal learning method in the disclosed power internet of things that is robust to wireless environmental changes.

Detailed Description

In order to make the objects, technical solutions and advantages of the present disclosure more apparent, the present disclosure will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are illustrative of some, but not all embodiments of the disclosure and are not intended to limit the disclosure. All other embodiments, which can be made by one of ordinary skill in the art without inventive effort, based on the embodiments in this disclosure are intended to be within the scope of this disclosure.

It should be noted that the terms "comprises" and "comprising," along with any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or server that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed or inherent to such process, method, article, or apparatus, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

UEs in the present application are user facilities, i.e. multi-user equipment terminals, access points: the access point, FL, federated learning, refers to federal learning and the UE is herein a client, i.e., user Equipment.

Example 1:

1-2 of the accompanying drawings, an embodiment of the present application provides a federal learning method with robustness to wireless environment changes in the electric power internet of things, including the following steps:

step one: s1, building a neural network model for learning, initializing weight parameters of the neural network model, and distributing a data set to UEs at a multi-user equipment end;

step two: s2, performing federal learning through the built neural network model to obtain a trained neural network model;

step three: s3, uploading parameters of the trained neural network model to a wireless Access Point (AP), and simulating wireless environment factors to analyze convergence performance of the global FL model at the moment;

step four: s4, analyzing the convergence performance of the global FL model at the moment by simulating the wireless environment factors;

step five: s5, selecting a client for executing federal learning at the wireless access point AP by adopting a checking mechanism and a scheduling strategy according to the acquired convergence performance condition to weaken the influence of corresponding wireless environment factors on the global model convergence, and finally obtaining the federal learning method with robustness to wireless environment change in the electric power Internet of things.

As shown in fig. 1, (a) in fig. 1 shows that each UE calculates a single update based on its locally stored data, and (B) in fig. 1 the access point AP aggregates the updates received from UEs to build a new global model, and (C) in fig. 1 sends the new model back to UEs, and the process is repeated.

The building of the neural network model for learning in the first step comprises the following steps:

determining the type of neural network used; wherein, the type of the neural network can be MLP or CNN;

and initializing the weight and bias parameters of all the neural networks.

Specifically, the second step includes:

determining parameters of a neural network model, namely selecting a general model, training by using a data set based on the model, and specifically determining the number of communication rounds, the number K of clients, the number C of clients, the local training generation E, the local small lot B, the small lot of test data, the learning rate lr and the optimization method; the number of communication rounds is also called a communication round, which refers to one training and updating exchange between an AP and related UEs, the number K of clients refers to the number of UEs associated with the AP, and the fraction C of clients refers to the number of UEs which are involved in federal learning to the number of clients;

Illustrating: a cellular network comprising an AP and a set of associated K multi-user equipment-side UEs cooperatively performing FL algorithm for data analysis and inference, e.g., wireless access point assuming the AP has within its Thiessen polygonK associated multi-user UEs are evenly distributed, and in this network a fixed number of frequency spectrum is divided equally into N radio access channels, where N < K. For a generic user equipment k, it is considered to be equipped with a base station having n _K A local data set of sample points. For the Mnist data set, the data set is disordered, 60000/K label data and 10000/K test data are sequentially distributed to a client, the client score is C, namely, the number of user equipment participating in federal learning is m=KxC, the local training generation is local_ep, the local small lot is local_bs, the learning rate is lr, and the communication round is epochs. The task is to classify a handwritten word using the MNIST dataset or to classify a color image using the CIFAR-10 dataset.

The neural network model comprises two types of MLP and CNN, wherein the MLP model comprises an input layer, a hidden layer and an output layer, the number of the hidden layers and the number of nodes are selected to be proper according to the requirement, the number of the nodes of the input layer and the number of the nodes of the output layer are determined according to a specific data set and classification conditions, but for the two classification problems, the number of the nodes of the output layer can be 10. The CNN model contains a build-up of volumes activated using the Relu function, fully connected layers activated using the Relu function, and one softmax output layer. The number of convolution layers and the number of channels, the number of pooling layers, the number of channels and the pooling method used select proper values according to the requirements.

Specifically, the step of minimizing the overall loss function includes: at each AP, the goal is to learn a statistical model from the data present on the K relevant multi-user device-side UEs, i.e., the AP needs to fit a vector wεR ^d In order to minimize a specific loss function by using the entire data set of all multi-user equipment-side UEs under its service. Formally, such tasks can be expressed as:

the formula is a multi-client minimum loss function, is a loss function of an AP (access point) end, aims to minimize the sum of loss functions of all user equipment ends participating in learning, and can be used as a termination condition of local model training.

Wherein n is _k ＝|D _K I indicates the size of the user device k local data set,

represents the size, w, of the entire dataset ^t Is composed of A _P The global FL model generated, f (w _i ) Is a loss function, different loss functions can be defined for different FL algorithms;

k represents the number of UEs, i.e. the number of UEs associated with the AP;

in particular, the constraint is to ensure that once the FL algorithm converges, all user devices and APs will share the same FL model for their learning task, i.e., the downloading of the global FL model to all clients;

the purpose of this formulation is to minimize the sum of the loss functions of all clients participating in the learning, which can be used as termination conditions for the local model training.

In the FL algorithm, a local FL model w for each user k _k Is dependent on the global model w ^t While the global model w ^t Is dependent on the local FL models of all users. Local FL model w _k Is dependent on the learning algorithm. Global model w ^t The update of (2) is given by:

in the training process, each user first trains the local FL model w using its training data _k W is then transmitted over the wireless cellular link _k To the AP. Once the AP receives the local FL models for all participating users, it will update the global FL model based on the above equation and will update the global FL model w ^t+1 To all users to optimize the local FL model. Over time, APs and multi-user equipment end UEsTheir optimal FL models can be found and used to minimize the loss function.

Preferably, in the step of distributing the original data set to the K clients in a uniformly and uniformly distributed or non-uniformly and uniformly distributed manner,

Preferably, uploading the parameters of the trained neural network model to the wireless access point AP in step three includes:

Preferably, the convergence performance of the global FL model at this time is analyzed by simulating wireless environment factors in step four, wherein the wireless environment factors include bandwidth limitation, section interference and path loss,

simulating a bandwidth limitation in a wireless environment factor by limiting the number of clients participating in federal learning;

In particular, for bandwidth limitations, the wireless access point AP should only select a subset of the multi-user equipment UEs and update its parameters at the same time to keep the communication time within an acceptable range. For this reason, the scheduling policy plays a key role in allocating the resource-limited radio channels to the appropriate multi-user equipments UEs. The invention adopts random scheduling strategy, and allocates wireless channels with limited resources to proper multi-user equipment terminals UEs, including in each communication round, the wireless access point AP uniformly and randomly selects a plurality of related multi-user equipment terminals UEs to update parameters, and allocates a dedicated sub-channel to each selected UE to transmit training parameters.

Wherein the step of simulating the inter-zone interference and the path loss in the wireless environment factor by adding an error to the local FL model parameter comprises:

in the K C clients, the influence of model parameter errors of the a K C clients, the b K C clients and the C K C clients on the recognition precision of the final aggregation model is tested and selected respectively; wherein a, b, c each are each independently of the other E (0, 1); specifically, the ratio is user_error_rate_list= [ a, b, c ]; a, b, c E (0, 1) and are different from each other

The error mode of client model parameter error comprises:

setting the model neural network layer number to be L layers, selecting all parameters in which M layers are wrong, traversing all the M layers, and enabling each parameter to have Q probability error, wherein M < L, Q E (0, 1), specifically setting as layer_error=M/L, M < L, value_error_prob=Q, and setting as Q E (0, 1) error:

W＝W+randomuniform(-1,1)

Specifically, for interval interference, path loss, and instability, a verification mechanism is used to check for data errors in the local FL model received at the wireless access point AP; assume each local FL model w _k Will be sent as a single packet in the uplink, where C (w _k ) =0 indicates that the local FL model received by the AP contains data errors; otherwise, we have C (w _k )＝1。

In the system, as long as the received local FL model contains errors, the wireless access point AP will not update the global FL model with the received data erroneous local FL model;

When C (w) _k ) When =0, i.e. the local FL model parameters contain errors, the AP will discard the local FL model containing errors, update the global model with the remaining correct local FL model, otherwise C (w _k ) The ap updates the global FL model with the integrated local FL model=1, and performs training for the next communication round. Thus, the global model update formula can be given by:

this formula simulates the process of the verification mechanism at the AP point discarding the erroneous packet, i.e., the global average of the local FL model with the correct parameters. K represents K clients, i.e. also user equipment in the present application.

Note that there is a constant alternation between communication and computation during the training phase. In this regard, retransmission of the corrupted packet may be of no benefit because each uplink transmission of the local update will be followed by a globally averaged downlink transmission, and after receipt of this transmission the multi-user device end UEs will refresh its reference parameters and begin to use the local data to solve the new sub-problem. Therefore, if retransmission is requested when there is an error, random queuing delay and retransmission delay may be increased, so that learning delay is increased, and convergence performance of FL is further affected.

The principle of the invention is as follows:

the FL system in the present invention optimizes the global model by repeating the following process: (1) the multi-user equipment-side UEs perform local calculations using their local data to minimize a predefined empirical risk function, i.e., a loss function, and update the weights of the training to the AP, (2) the AP gathers updates from the multi-user equipment-side UEs and consults the FL unit to generate an improved global model, (3) re-distributes the output of the FL model to the multi-user equipment-side UEs, which use the global model as a reference for further local training.

In the local FL model uploading stage, the bandwidth, interval interference, path loss and instability of wireless environment factors are simulated, and finally the global aggregation model is influenced, so that the global model is distributed to the UEs of the multi-user equipment end, and the local learning of the UEs of the multi-user equipment end is influenced. By adopting a scheduling strategy and performing a checking mechanism at the AP, the constructed federal learning framework has robustness to wireless environment changes.

As shown in the figure, according to the simulation result, the loss function value in the training process is continuously reduced along with the increase of the communication rounds, the recognition accuracy obtained by testing the overall aggregated model parameters is continuously increased, and the result can be seen to have certain fluctuation. This is a signal that federal learning is normally ongoing, indicating that the global FL model derived from the local FL model is gradually fitting data, i.e., that the federal learning framework is learning, through uninterrupted parameter updates between the AP and the multi-user equipment UEs.

As shown in fig. 2, after the influence of the wireless factor is simulated, in the communication round which is just started, the difference between the recognition accuracy obtained by testing the globally aggregated model by the loss function value in the training process under the condition of different packet error rates is very small, and even better than the general federal learning algorithm, because the tiny change is equivalent to playing a role of adjusting the parameters of the weight matrix, so that the model has better performance. With the increase of communication rounds, the performance of federal learning with larger error rate gradually worsens or even fails, and the more the error parameters are, the larger the curve fluctuation amplitude is, and the larger the influence on the aggregation model performance is.

Specifically, a verification mechanism is adopted to discard parameter packets with errors and random scheduling strategies in the model, the performance of the overall aggregated model is greatly improved, and the loss function value in the training process and the recognition precision obtained by testing the overall aggregated model are converged according to a certain trend.

The greater the error rate, the greater the number of clients containing errors, so the more local model parameter updates are discarded by the checking mechanism, i.e. the fewer the local models participating in the global aggregation, at this time, no retransmission delay or queuing delay is increased. Therefore, the larger the error rate, the greater the impact on the model convergence performance, i.e. the more communication rounds are required to achieve the same recognition accuracy than the general federal learning algorithm, and secondly, it can be observed that the larger the error rate, the larger the fluctuation of the three curves thereof is, which is caused by the instability of the global FL model due to the fewer number of local models participating in the global aggregation. The more these results and participating users, the better the learning effect tends to be.

Thus, the present invention works by first constructing a federal learning method by K clients; on the basis, the limitation and sharing property of the wireless medium are simulated by limiting the number of clients participating in federal learning and adding a packet error in the update and uploading stage of a local federal learning model; and then, selecting a client for learning by adopting a dispatching strategy, discarding the parameter model with errors at an access point through a checking mechanism, and the like to obtain a federal learning framework with robustness to wireless environment changes in the electric power Internet of things. The invention solves the problem that in the wireless edge network, the Federal learning performance is rapidly deteriorated due to the resource limitation of the communication medium and the sharing property of the wireless medium, including the parameter update loss from the user equipment due to the resource limitation of the communication medium; the sharing property of the wireless medium causes that communication receives interval interference, average path loss of a communication link, large-scale fading and the like, and the parameter updating encounters unavoidable faults and the like.

It should be noted that: the embodiment sequence of the present disclosure is only for description, and does not represent the advantages and disadvantages of the embodiments. And the foregoing description has described specific embodiments of this specification. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims can be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.

In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. In particular, for embodiments of the apparatus, device and storage medium, the description is relatively simple as it is substantially similar to the method embodiments, with reference to the description of portions of the method embodiments being relevant.

The foregoing description of the preferred embodiments of the invention is not intended to limit the invention to the precise form disclosed, and any such modifications, equivalents, and alternatives falling within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims

1. The federal learning method with robustness to wireless environment change in the electric power Internet of things is characterized by comprising the following steps of:

step three: uploading the parameters of the trained neural network model to a wireless Access Point (AP);

step five: selecting a client for executing federal learning by adopting a checking mechanism and a scheduling strategy at a wireless Access Point (AP) according to the acquired convergence performance condition to weaken the influence of corresponding wireless environment factors on the convergence of a global model, and finally obtaining a robust federal learning method for wireless environment change in the electric power Internet of things; the scheduling policy refers to allocating a wireless channel with limited resources to a proper multi-user equipment UEs, including that in each communication round, the wireless access point AP uniformly and randomly selects a plurality of related multi-user equipment UEs to update parameters, and allocates a dedicated sub-channel to each selected UEs to transmit training parameters; the checking mechanism is used to check data errors in the local FL model received at the wireless access point AP; the wireless access point AP does not update the global FL model by using the received local FL model with data errors; the wireless access point AP will directly update the global FL model with the remaining correct local FL model;

step three, uploading the parameters of the trained neural network model to the wireless access point AP includes:

Each user equipment end transmits w through a wireless cellular link _k Transmitting to the AP;

the AP receives the local FL models of all the participating user equipment, updates the global FL model, and updates the global FL model w ^t+1 Transmitting the local FL model to all user equipment ends to optimize the local FL model; the global FL model is obtained according to all local FL models of all user equipment terminals participating in learning, and can be represented by the following formula:

wherein the method comprises the steps of，w ^t+1 Is a global FL model, n, generated by a wireless access point AP _k Representing the size of a K local data set of the user equipment, K representing a K-th client, t representing a t-th communication round, w _k Representing a local FL model;

analyzing convergence performance of the global FL model at the time by simulating wireless environment factors including bandwidth limitation, inter-zone interference and path loss,

bandwidth limitations in wireless environment factors are simulated by limiting the number of clients participating in federal learning, and inter-zone interference and path loss in wireless environment factors are simulated by adding errors to the local FL model parameters.

2. The federal learning method with robustness to wireless environment variation in the power internet of things according to claim 1, wherein building the neural network model for learning in step one comprises:

determining the type of neural network used;

and initializing the weight and bias parameters of all the neural networks.

3. The method for federal learning in power internet of things having robustness to wireless environmental changes of claim 1,

the second step comprises:

determining parameters of a neural network model, training by using a data set based on the model, wherein the parameters of the neural network model comprise the number of communication rounds, the number of clients K, the client score C, the local training generation E, the local small lot B, the test data small lot, the learning rate lr and an optimization method;

the K clients learn by using the local training data sets until the overall loss function reaches the minimum value, and upload the local parameter model of the neural network to the wireless access point AP through a wireless link;

4. The method for federal learning in power internet of things having robustness to wireless environment changes according to claim 3, wherein in the step of distributing the original data set to K clients in a uniformly co-distributed or non-uniformly co-distributed manner,

5. The federal learning method for robustness to wireless environment variation in power internet of things according to claim 1, wherein in the step of simulating inter-zone interference and path loss in a wireless environment factor by adding an error to a local FL model parameter, comprising:

in each communication round, K clients are selected from K clients to participate in training;

the error mode of client model parameter error comprises:

setting the model neural network layer number to be L layers, selecting M layers to be wrong, traversing all parameters of the M layers to be wrong, and enabling each parameter to be wrong with the probability of Q, wherein M is smaller than L, and setting the Q epsilon (0, 1) to be wrong is as follows:

W＝W+randomuniform(-1,1)

wherein, random form (-1, 1) is a random generation of a floating point number of-1 to 1, and w represents a local FL model.