CN114513270B

CN114513270B - Heterogeneous wireless network spectrum resource sensing method and system based on federal learning

Info

Publication number: CN114513270B
Application number: CN202210225145.6A
Authority: CN
Inventors: 盛洁; 吴澄; 朱巧明; 李庆洋; 张瑾; 汪一鸣
Original assignee: Suzhou University
Current assignee: Suzhou University
Priority date: 2022-03-07
Filing date: 2022-03-07
Publication date: 2022-12-02
Anticipated expiration: 2042-03-07
Also published as: CN114513270A

Abstract

The invention discloses a heterogeneous wireless network spectrum resource sensing method and system based on federal learning, which comprises the following steps: s1, sending a global model to all base stations in a region to which the global model belongs; s2, selecting partial base stations, and performing random gradient descent training on a local model of the selected base stations by using the global model to obtain a final local model; s3, obtaining a parameter difference value of the final local model and the global model, and updating the global model by federal aggregation; s4, repeating the steps S1-S3 to carry out iterative training on the global model until convergence, and obtaining a final global model; and S5, predicting the current occupation situation of all available spectrum resources in the region by using the parameters of the final global model. The invention greatly reduces the communication overhead and effectively prevents the leakage and illegal tampering of data.

Description

Heterogeneous wireless network spectrum resource sensing method and system based on federal learning

Technical Field

The invention relates to the technical field of communication, in particular to a heterogeneous wireless network spectrum resource sensing method and system based on federal learning.

Background

With the development of 5G communication and the progress of high-speed railways, mobile terminals and Internet of things equipment in high-speed railway transportation will increase explosively. In order to ensure that the quality of service is guaranteed and the communication requirements of the train, the trackside equipment and the passengers are better met, further optimization of the high-speed rail wireless communication resource management strategy is required.

Currently, in the field of railway wireless communication, LTE still faces many challenges. Moreover, the existing high-speed railway spectrum sharing mechanism also has a plurality of problems to be solved. The traditional spectrum sharing mechanism is centralized, but this generates a large amount of communication overhead and brings many security risks, such as single point of failure and denial of service attack. In addition, in the railway wireless communication system, the base stations are erected around the track and are mainly distributed in a chain manner, and the base stations form a chain heterogeneous network. How to realize the integration of spectrum resources in the chained heterogeneous network is a key challenge. In addition, the security and privacy of data are receiving more and more attention, and it is also a difficult point to ensure efficient security while sharing spectrum.

In 1999, joseph Mitola first proposed the concept of cognitive radio, and on this basis, in 2006 Ryan w. The cognitive radio network is a wireless communication network capable of perceiving the surrounding network environment and carrying out analysis and decision according to the perceived content, and can be self-adapted to the external complex dynamic network environment through the cognitive cycle, so that the network performance of end-to-end communication is improved. In the field of high-speed rail wireless communication, GSM-R, LTE-R and 5G-R exist simultaneously for a long time, and the cognitive radio network is of great help to improve the expandability and compatibility of the system. The cognitive radio technology provided by Ashwin Amanna for the first time can be applied to the field of rail transit, and the high-speed rail cognitive radio gets more and more attention and development. Compared with the traditional cognitive radio network, the high-speed rail cognitive radio network has new change. In the spectrum sensing stage, not only the video resource occupation condition of an authorized user needs to be sensed, but also the external track and physical environment condition need to be sensed; in a frequency spectrum decision stage, the method needs to adapt to the parameter change condition in the high-speed rail field and meet a specific optimization target; in the spectrum switching state, the traditional switching method is replaced by channel switching, trans-regional switching and parameter self-adaptive switching; in the spectrum sharing stage, due to the particularity of the high-speed rail wireless communication field, the secondary users generally do not act independently, so that the calculation of the channel capacity is simplified. However, the high-speed rail cognitive radio still has security threat, and only in the spectrum sensing stage, the high-speed rail cognitive radio can have the functions of imitating a master user attack, a terminal-spectrum sensing data tampering attack, eavesdropping on a control channel, a learning attack, a C-SSDF attack and the like.

The existing spectrum sharing technology is centralized, needs to be supported by a third-party organization, and generates massive communication overhead under the condition that a great number of users exist, which brings huge pressure to the network. In addition, there are various security issues with this approach, including single point of failure and denial of service attacks. The scene in the high-speed rail wireless communication system is special, the base stations are distributed in a chain manner and have isomerism, and how to sense and predict frequency spectrum resources in a chain heterogeneous network is a great difficulty. Moreover, the existing high-speed rail cognitive radio technology still needs to be continuously improved, the privacy security problem cannot be thoroughly solved, and the risks of data leakage and illegal tampering can be suffered.

Disclosure of Invention

The invention aims to provide a heterogeneous wireless network spectrum resource sensing method and system based on federal learning, which can greatly reduce communication overhead and effectively prevent data leakage and illegal tampering.

In order to solve the technical problem, the invention provides a heterogeneous wireless network spectrum resource sensing method based on federal learning, which comprises the following steps:

s1, sending a global model to all base stations in a region to which the global model belongs; the method specifically comprises the following steps:

s11, J base stations are arranged in the region where the global model belongs, each base station has a local data set, N samples are arranged in each local data set, and each sample consists of (N, x) ₁ ，x ₂ ，x ₃ ，x ₄ ，x ₅ Y) composition;

where n represents the ID of a sample, x represents the characteristics of the sample, and x ₁ Indicates the area, x, to which the base station belongs ₂ Number, x, indicating the area to which the base station belongs ₃ Indicates the type of the base station, x ₄ Representing the number of the channel in the base station, selecting a time point as a time 0 point, x ₅ Then represents the time t, y represents the label of the sample, and when the value of y is 0, it represents that the channel is in the idle state at the time; when the value of y is 1, the channel is in an occupied state at the moment;

s12, when the e-th global model is iterated, the global model G is processed ^e ＝(w ^e b ^e ) Sending to all base stations in the region, wherein w ^e ＝(w ₁ ，w ₂ ，w ₃ ，w ₄ ，w ₅ ) Representing the weight taken up by each feature of the sample, b ^e Is a scalar, e represents the number of iterations performed by the global model;

s2, selecting partial base stations, and performing random gradient descent training on a local model of the selected base stations by using the global model to obtain a final local model; the method specifically comprises the following steps:

s21, randomly selecting M base stations from the I base stations in the area;

s22, the selected base station m receives the global model G ^e ＝(w ^e b ^e ) For local model

Performing local training, wherein the local training adopts a random gradient descent method to perform multiple iterative training, M is less than or equal to M, Q is less than or equal to Q, and Q represents the iteration times;

each local training process comprises the following steps:

s221, the base station m performs local model

Generating a preliminary prediction value of y for each sample thereof

S222, using sigmoid function to perform

Is compressed to [0,1 ]]In between, the expression is:

wherein p is the probability that the predicted value of y is 1, 0 represents idle, and 1 represents occupied;

the final predicted value

S223, the loss function of the base station m is:

wherein, y _n Representing the true value, p, in the sample n _n Representing predicted values for sample n

Probability of 1, N _m Representing the total number of samples in the sample set;

s224, the local gradient value of the q-th iteration of the base station m is:

wherein the content of the first and second substances,

gradient operation is adopted, after the gradient is obtained, the local model is obtained by updating

S23, obtaining a final local model of the base station m after iteration is finished

S3, obtaining a parameter difference value of the final local model and the global model, and updating the global model by federal aggregation; the method specifically comprises the following steps:

obtaining the parameter difference between the local model and the global model

Performing a global aggregation operation, updating the global model according to the following formula:

wherein λ represents a learning rate;

s4, repeating the steps S1-S3 to carry out iterative training on the global model until convergence, and obtaining a final global model; the loss function of the global model for iterative training is as follows:

the convergence goal is to minimize the loss function:

O(w，b)＝arg min F (8)

obtaining parameters (w, b) of the final global model;

and S5, repeatedly utilizing the formulas (1), (2) and (3) by utilizing the parameters (w, b) of the final global model to predict the current occupation situation of all available spectrum resources in the region.

A heterogeneous wireless network spectrum resource sensing system based on federal learning comprises a central base station, wherein a plurality of base stations are arranged in the area of the central base station, and the central base station and the base stations form a communication link to jointly provide service for users; the central base station carries out spectrum resource sensing by adopting the method for sensing the spectrum resources of the heterogeneous wireless network based on the federal learning.

As a further development of the invention, the base stations are provided with edge mobile computing servers and the central base station is provided with a federal learning server.

As a further refinement of the present invention, the users include primary users and secondary users; the main user is an authorized user of the spectrum resource, and the owned idle spectrum is transacted with the base station of the nearby affiliated area; and the secondary user is an unauthorized user of the frequency spectrum resource and seeks network access service.

The invention has the beneficial effects that: the invention takes the heterogeneous base stations along the railway as the users for federal study, and each central base station as a server: the user database stores data about channel information, and a user only needs to upload trained parameter information to a server without uploading data, so that the communication overhead is greatly reduced, and data leakage and illegal tampering are effectively prevented; the server can predict the occupation conditions of the channels of all base stations in the system where the server is located according to the global model obtained after aggregation, so that spectrum resources are reasonably distributed, and users can share the spectrum while protecting the data privacy of the users; in addition, the federal learning needs a plurality of iterations to be converged, if all the participating users in each round send the training gradient information to the server, the communication overhead is increased, the problem is solved by the user scheduling method, namely, only part of the users are selected in each iteration to carry out local training, and the communication overhead is reduced.

Drawings

FIG. 1 is a schematic representation of a federated learning process according to an embodiment of the present invention;

FIG. 2 is a diagram of a system model according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a third simulation experiment scenario according to an embodiment of the present invention;

FIG. 4 is a diagram of the results of a third local training session according to an embodiment of the present invention;

FIG. 5 is a diagram of a triple Bandwidth learning accuracy result in accordance with an embodiment of the present invention;

FIG. 6 is a graph of the triple Bangbang learning loss results of the embodiment of the present invention.

Detailed Description

The present invention is further described below in conjunction with the following figures and specific examples so that those skilled in the art may better understand the present invention and practice it, but the examples are not intended to limit the present invention.

The invention provides a heterogeneous wireless network spectrum resource sensing method based on federal learning, which comprises the following steps:

s1, sending the global model to all base stations in the area to which the global model belongs;

s2, selecting partial base stations, and performing random gradient descent training on a local model of the selected base stations by using the global model to obtain a final local model;

s3, obtaining a parameter difference value of the final local model and the global model, and updating the global model by federal aggregation;

s4, repeating the steps S1-S3 to carry out iterative training on the global model until convergence, and obtaining a final global model;

and S5, predicting the current occupation situation of all available spectrum resources in the region by using the parameters of the final global model.

Federal learning is a promising approach to edge intelligence to protect privacy in distributed scenarios. In traditional machine learning, the server would collect all training data, and federal learning largely solved this problem. The user would first download the latest global model information and then train the data locally. In traditional federal learning, a user can train gradient information according to own data during local training, and the data is not required to be uploaded to a server, but only the gradient information is sent to the server. The server will aggregate the gradient information of multiple users to update the global model parameters and send the latest global model to the participating users. The invention takes the heterogeneous base stations along the railway as users and each central base station as a server. The user database stores data related to channel information, and a user only needs to upload trained parameter information to the server without uploading the data, so that the communication overhead is greatly reduced, and the data leakage and illegal tampering are effectively prevented. The server can predict the occupation status of the channels of all base stations in the system where the server is located according to the global model obtained after aggregation, so that spectrum resources are reasonably distributed, and users can share the spectrum while protecting the data privacy of the users. The performance of federal learning is limited due to limited available wireless spectrum resources and insufficient computing power of the users themselves. In addition, federal learning requires multiple iterations to converge, which increases communication overhead if all participating users in each round send their own training gradient information to the server. The invention solves the problem by a user scheduling method, namely only part of users are selected for local training in each iteration, which reduces the communication overhead.

Example one

Based on the above embodiment, assuming that there are J base stations in the area i and numbering the base stations, the base stations J may use a set

And (4) showing. For the

All have a local data set

Each one of which

In which is N _j Samples, each sample representing the occupancy state of a channel of the base station at a time in the past, are represented by (n, x) ₁ ，x ₂ ，x ₃ ，x ₄ ，x ₅ Y) where n represents the ID of the sample, x represents the characteristics of the sample, and x ₁ Indicates the area to which the base station belongs, x ₂ Number, x, indicating the area to which the base station belongs ₃ Indicates the type of the base station, x ₄ Representing the number of the channel in the base station, and selecting a time point as a time point 0, x ₅ Representing the time t. y represents the label of the sample, and when the value of y is 0, the channel is in an idle state at the moment; a value of y of 1 indicates that the channel is occupied at that time.

Such as

It is the data set for base station # 3 in area 2, which is a class II baseAnd (4) a station. It has a total of 160 samples from 0 to 159. Sample 0 indicates that channel 0 is in the occupied state at time 4 and sample 159 indicates that channel 9 is in the idle state at time 8.

Local model training and global model aggregation for federal learning:

in a coverage area of a certain base station group, sensing and predicting of frequency spectrum resources are achieved by using a federal learning algorithm. In the invention, the federated learning algorithm comprises a global model broadcast, a training node selection, a local model training and a global model aggregation in each iteration process.

Suppose that at the e-th algorithm iteration, the server base station (central base station) in the area first maps the global model G ^e To users in the area (including three types of common base stations, i.e., client base stations), where G ^e ＝(w ^e b ^e )，w ^e ＝(w ₁ ，w ₂ ，w ₃ ，w ₄ ，w ₅ ) Representing the weight of each feature of the sample, b ^e Is a scalar. Then, the server base station randomly selects M base stations from the J base stations in the area to serve as a training set of the spectrum resource perception model based on the federal learning, and the remaining (J-M) base stations serve as a test set of the model to be used for verifying the performance of the model. On the basis, the federal learning local model training and the global model aggregation are developed.

Local model training: when the selected base station receives the global model, local training is started, and the process adopts a random gradient descent method for training, which is also a process of multiple iterations. Each local iteration process q will comprise the following process.

User m utilizes the updated local model

Training data of which

When q =0, the ratio of q to q is set to zero,

user m will generate a preliminary prediction of y for each of its samples according to the local model

0 indicates idle and 1 indicates occupied. The specific process is as follows:

wherein, the first and the second end of the pipe are connected with each other,

the value range is not in [0,1 ]]In the method, sigmoid function is required to be used

Is compressed to [0,1 ]]In the meantime. The specific expression is as follows:

where p can be understood as the probability that the predicted value of y is 1, the final predicted value

The loss function for user m can be expressed as:

A probability of 1.

The local gradient values for user m are:

wherein the content of the first and second substances,

is a gradient operation. After obtaining the gradient, the local model is updated

After the iteration is completed, the final local model can be obtained

Will be different value

And sending the data to a server.

And (3) global model aggregation: after receiving the m pieces of parameter information, the server executes global aggregation operation, and updates the global model according to a formula (6):

where λ represents a learning rate.

In the present invention, the global loss function in the global iterative process e can be represented as:

the invention improves the training accuracy rate by the loss function reduction, and the aim is to minimize the loss function, namely:

O(w，b)＝arg min F (8)

the spectrum resource sensing algorithm of the chain heterogeneous wireless network based on the federal learning comprises the following steps:if a sub-user s is in a certain area _req To a nearby central base station c _i Application for access service, c _i It is necessary to know the spectrum information of the base stations in this area and even other areas for s _req The central base station is required to interact with a general base station in the area in real time, and the current occupation situation of all available spectrum resources in the area is sensed and predicted by utilizing the prior spectrum occupation situation. And sensing and predicting the frequency spectrum in the region by adopting a federal learning algorithm. The parameters (w, b) can be obtained based on the formula (8), and then the current occupation situation of all available spectrum resources in the region can be predicted by repeatedly using the formulas (1), (2) and (3). The whole spectrum resource sensing algorithm of the chain heterogeneous wireless network is shown in table 1, and the flow chart is shown in fig. 1.

Table 1:

example two

The embodiment provides a heterogeneous wireless network spectrum resource sensing system based on federal learning, which divides a certain section of railway link type wireless communication network into I areas, and the areas cover the section of network. Communication bodies of 4 types of parameters and spectrum resource management are defined in each area, and comprise Primary Users (PU), secondary Users (SU), base Stations (BS) and Central Base Stations (CBS). With S = { S = _i SU, which refers to an unauthorized user of spectrum resources, i.e., communication devices on the train, and mobile terminals of passengers, seeking network access services. With P = { P _i Define PU, which is an authorized user of spectrum resources, and to obtain revenue, PU will trade the free spectrum owned in hand with nearby BSs. By using

To define the jth BS of the ith zone. Each BS is equipped with an edge mobile computing (MEC) server with certain computing and communication capabilities, forming an uplink with the CBS and a downlink with the PU and SU. With C = { C _i Define CBS in the entire link. Each CBS is equipped with a federal learning server (FL) and has extremely strong computing and communication capabilities. The schematic diagram of the system model is shown in fig. 2, in which the train represents SU and the large base station at the center of the lower row represents CBS in a certain area. The other small grey base stations (with and without background grey boxes) represent three different types of common BSs, which are distributed in chains along the railway. After the PU and the BS complete the transaction, the BS manages the spectrum resources of the transaction, so the PU is not shown in the figure, but the PU is also an important component of the whole system.

The system adopts the spectrum resource sensing method of the heterogeneous wireless network based on the federal learning as the first implementation mode or embodiment to sense the spectrum resource.

EXAMPLE III

In the embodiment, a chain heterogeneous network environment is built on a python3.8 software platform to carry out simulation experiments. An experimental scenario was designed as shown in fig. 3. In the figure, one large circle indicates an area in which a straight solid line indicates a railroad link, o indicates a class I base station, x indicates a class ii base station, a-indicates a class iii base station, and ^ indicates a central server. The data sets are stored in the form of tuples in the database of each base station. The detailed parameters of the three types of base stations are shown in table 2.

Table 2:

local model convergence and spectrum sensing accuracy experiment: as shown in fig. 4, the result is obtained by local training of a class I base station in a global iteration, and it can be seen from the figure that when the number of local iterations increases, the accuracy of predicting channel occupancy continuously increases and rapidly converges, and gradually stabilizes after reaching 0.9 or more, and the cross entropy loss thereof also continuously decreases and tends to stabilize after reaching 0.41, thereby achieving convergence.

The federal learning global aggregation model convergence and perception accuracy experiment comprises: the federally learned simulation parameters for a region are shown in table 3. When one part of base stations are selected in the area for federal learning, the other part of the rest base stations are used as a test set for verifying the accuracy of federal learning prediction.

Table 3:

number of areas	1
		Number of repeated tests	10
Number of global iterations	40
		Number of class I base stations participating in federal learning	12
Number of class II base stations participating in federal learning	12
		Number of class III base stations participating in federal learning	4
Number of base stations selected per global iteration	3/6/12

The simulation results of the federal learning are shown in fig. 5 and fig. 6, in which "m =3", "m =6" and "m =12" in the legend represent the number of users selected at each global iteration of the federal learning.

Fig. 5 shows the average accuracy of the simulation results when the number of nodes participating in the training is different after a plurality of tests, and it is obvious from the figure that the larger the number of nodes is, the faster the convergence speed of the accuracy is, and the fewer the training times required for achieving the convergence are. However, the three converged are almost the same in accuracy, and eventually approach 0.98. The results were the same as expected. Note also that the more nodes that participate in the local training, the more communication overhead in the network will increase, but correspondingly, the less computation overhead will be required by the central base station.

Fig. 6 shows the average loss of simulation results when the number of nodes participating in training is different after multiple experiments. It is obvious from the figure that the larger the number of the participating nodes is, the faster the convergence speed is lost, and the less the training times are required for reaching the convergence. However, the three converged accuracies are almost the same, and eventually approach 0.1.

The above-mentioned embodiments are merely preferred embodiments for fully illustrating the present invention, and the scope of the present invention is not limited thereto. The equivalent substitution or change made by the technical personnel in the technical field on the basis of the invention is all within the protection scope of the invention. The protection scope of the invention is subject to the claims.

Claims

1. A heterogeneous wireless network spectrum resource sensing method based on federal learning is characterized in that: the method comprises the following steps:

s1, sending the global model to all base stations in the area to which the global model belongs; the method specifically comprises the following steps:

s11, J base stations are arranged in the region where the global model belongs, each base station has a local data set, N samples are arranged in each local data set, and each sample is composed of (N, x) ₁ ，x ₂ ，x ₃ ，x ₄ ，x ₅ Y) composition;

where n represents the ID of a sample, x represents the characteristics of the sample, and x ₁ Indicates the area, x, to which the base station belongs ₂ Number, x, indicating the area to which the base station belongs ₃ Indicates the type of the base station, x ₄ Representing the number of the channel in the base station, and selecting a time point as a time point 0, x ₅ Then it represents the time of dayt, y represents the label of the sample, and when the value of y is 0, the channel is in an idle state at the moment; when the value of y is 1, the channel is in an occupied state at the moment;

s12, when the e-th global model is iterated, the global model G is used ^e ＝(W ^e b ^e ) Sending to all base stations in the region, wherein w ^e ＝(w ₁ ，w ₂ ，w ₃ ，w ₄ ，w ₅ ) Representing the weight of each feature of the sample, b ^e Is a scalar, e represents the number of iterations performed by the global model;

s21, randomly selecting M base stations from the J base stations in the area;

Performing local training, wherein the local training adopts a random gradient descent method to perform multiple iterative training, M is less than or equal to M, Q is less than or equal to Q, and Q represents the number of iterations;

each local training process comprises the following steps:

s221, the base station m performs local model

Generating a preliminary prediction of y for each of its samples

S222, using sigmoid function to convert

Is compressed to [0,1 ]]In between, the expression is:

the final predicted value

S223, the loss function of the base station m is as follows:

s224, the local gradient value of the q-th iteration of the base station m is:

is gradient operation, after obtaining the gradient, the update is obtainedLocal model

obtaining the parameter difference value of the local model and the global model

wherein λ represents a learning rate;

the convergence goal is to minimize the loss function:

O(w，b)＝arg min F (8)

obtaining parameters (w, b) of a final global model;

2. A heterogeneous wireless network spectrum resource sensing system based on federal learning is characterized in that: the system comprises a central base station, wherein a plurality of base stations are arranged in the area of the central base station, and the central base station and the base stations form a communication link to jointly provide service for users; the central base station performs spectrum resource sensing by using the spectrum resource sensing method of the heterogeneous wireless network based on federal learning according to claim 1.

3. The system for sensing spectrum resources of a heterogeneous wireless network based on federated learning of claim 2, wherein: the base station is provided with an edge mobile computing server, and the central base station is provided with a federal learning server.

4. The system for sensing spectrum resources of a heterogeneous wireless network based on federated learning of claim 2, wherein: the users comprise primary users and secondary users; the main user is an authorized user of the spectrum resource, and the owned idle spectrum is transacted with the base station of the nearby affiliated area; and the secondary user is an unauthorized user of the frequency spectrum resource and seeks network access service.