CN114266406A

CN114266406A - Method for predicting traffic flow state of large-scale road network based on federal learning

Info

Publication number: CN114266406A
Application number: CN202111601256.4A
Authority: CN
Inventors: 于海洋; 梁育豪; 任毅龙; 赵亚楠; 兰征兴
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2021-12-24
Filing date: 2021-12-24
Publication date: 2022-04-01

Abstract

The present disclosure relates to a large-scale road network traffic flow state prediction scheme based on federal learning, which is characterized by comprising: step one, constructing a directed graph; step two, establishing an initial model; step three, updating the training parameters by using a back propagation algorithm; and step four, obtaining a prediction result by using a federal average algorithm. Based on the method, a large-scale road network is decomposed into a plurality of sub-networks, a plurality of base stations in each sub-network collect traffic flow characteristics of vehicles in a certain range within a period of time, each base station serves as a participant in federal learning, receives global models respectively, trains current sub-network traffic flow prediction models by locally using own data sets, uploads the models to a server for global aggregation, and predicts future states of the road network by the server. The operation cost of the server can be effectively reduced, the training efficiency of the model can be higher, and the prediction effect is better.

Description

Method for predicting traffic flow state of large-scale road network based on federal learning

Technical Field

The invention belongs to the field of road network traffic flow prediction, federal learning and intelligent traffic systems, and particularly relates to a technology for predicting traffic flow states of a large-range road network segmentation sub-road network by using a distributed machine learning framework of federal learning.

Background

The quantity of motor vehicles in China rapidly increases year by year, a series of road network resources such as traffic jam, frequent traffic accidents and the like and the problem of contradiction between supply and demand among motor vehicles are generated, and great inconvenience and trouble are brought to travelers and traffic management departments. The intelligent traffic system is a comprehensive transportation system which effectively and comprehensively applies new-generation scientific technologies such as computer technology, data communication technology and the like to the traffic fields such as traffic transportation, service control and the like, road network traffic flow prediction is an important research direction in the intelligent traffic system, the traffic flow prediction can predict the situation of traffic network evolution in a future period of time of road conditions, accurate travel information is provided for travelers, a basis is provided for traffic managers to actively control traffic, the road network efficiency can be effectively improved, the traffic safety is guaranteed, and meanwhile, the environment is improved and energy is saved.

Federal learning is a distributed machine learning technique proposed by google in 2016. In particular, people train algorithms on multiple decentralized edge devices or servers that own local data samples. The method is obviously different from the traditional centralized machine learning technology, the traditional centralized machine learning technology uploads all local data sets to one server, and the federal learning is that the results are transmitted to the server after the data are trained locally, so that the direct disclosure of personal data is avoided from the source, and the privacy safety of users is protected. Meanwhile, the operation pressure of the central server can be relieved without concentrating the data on the server for operation, so that the model training efficiency is higher.

Conventional traffic flow prediction methods may be classified into parametric models and non-parametric models. The parametric model mainly includes an autoregressive sum moving average model (ARIMA) and a kalman filtering method, and the non-parametric model mainly includes a KNN model and a support vector machine method. With the prosperity of the machine learning method in the new era, a plurality of scholars apply the machine learning method to traffic flow prediction, and the method mainly comprises a convolutional neural network, a generation countermeasure network, a tensor neural network and the like. However, the prediction of the road network traffic flow by the current method is often directed to the road network in a small range, and the load of the server is too large in the case of a large amount of data, so that the prediction time is prolonged, and the prediction of the road network state in a large range is not facilitated.

Disclosure of Invention

The invention aims to solve the technical problem of designing a large-scale road network traffic flow state prediction scheme based on federal learning.

The invention adopts the technical scheme for solving the requirements as follows:

the method comprises the following steps: a large-scale road network is divided into several subnets.

Step two: the server constructs a global prediction model by using a gating cycle unit and a fully connected neural network, generates initial parameters, and distributes the initial global model to the base stations participating in the federal learning in each subnet.

Step three: and the base station in each subnet uses the traffic flow characteristic time sequence data acquired by the base station in a certain time period to train the global model for a plurality of rounds, and uploads the trained local model parameters to the server after the training is finished.

Step four: and the server aggregates the models by receiving the uploaded local model parameters by using a federal learning average aggregation algorithm to generate a new global model, and predicts the traffic flow state at a plurality of moments in the future by using the new global model.

Specifically, the method includes:

a large-scale road network traffic flow state prediction scheme based on federal learning comprises the following steps:

step one, constructing a directed graph

And (C) simulating the large-range road network into a directed graph G (V, E), wherein V is a point set, intersections are simulated into vertexes in the directed graph, E is an edge set, and road sections between the two intersections are simulated into directed edges. Dividing a road network into n disjoint directed sub-graphs according to actual physical characteristics of the traffic road network

Step two, establishing an initial model

For each sub-network

The number of segments obtained by dividing the time segments according to the time interval delta T is marked as T, each base station collects GPS information transmitted by vehicles, and data are gathered and expanded into a matrix

N represents the number of time series; the building gate is formed by connecting T GRUs in series to control a circulation unit h_tOutput for the t GRU unit; h is to be_TAs input of fully connected neural network, input into the network for training to obtain predicted result

Step three, updating the training parameters by using a back propagation algorithm

Updating and training parameters of the fully-connected neural network and parameters of the gating cycle unit by using a back propagation algorithm; wherein the loss function of the back propagation algorithm

Wherein v is the true value in the training set;

obtaining a trained parameter set through a plurality of times of forward propagation and backward propagation

Parameter set

Sending the data to a server;

step four, obtaining a prediction result by using a federal average algorithm

For each subnet G_qThe global model is updated using the federal averaging algorithm:

wherein, | BS_qI is the sum of the number of all base stations participating in federal learning of the current subnet, and i is a base station label; each subnetwork G_qAfter the global model of (2) is updated, W is used_qConstructing a global prediction model by the determined gating cycle units and the fully-connected neural network; inputting the stored historical traffic flow data omega into the model to obtain a prediction result

Preferably, in the second step, in the gating cycle unit, W_r，W_z，W，U_r，U_zU is a weight parameter matrix to be trained, h_tFor the t GRU output, x_tIs a column vector of matrix X; the forward propagation formula is: r is_t＝σ(W_rx_t+U_rh_t-1)；z_t＝σ(W_zx_t+U_zh_t-1)；

Wherein the function is:

output of gated cyclic unit

H is to be_TAs the input of the fully connected neural network, inputting the input into the network for training; w⁽ⁱ⁾Is the weight matrix of the i-th layer of the fully-connected neural network, b⁽ⁱ⁾Is the bias of the i-th layer, z⁽ⁱ⁾Is the output of the i-th layer, a^(i-1)Is an input to the ith layer; the objective function is defined as v ═ W^Ty + b, then the formula for the i-th layer forward propagation is: z is a radical of⁽ⁱ⁾＝W⁽ⁱ⁾a^(i-1)+b⁽ⁱ⁾；a⁽ⁱ⁾＝σ(z⁽ⁱ⁾) And a is a⁽⁰⁾＝h_T(ii) a When the number of hidden layers l is 1, the predicted result

Preferably, in said third step, the parameter W for the gated-cycle cell_r，W_z，W，U_r，U_zU is trained using back propagation, the formula is as follows:

wherein:

after a plurality of times of forward propagation and backward propagation, a group of trained parameter sets can be obtained

According to the technical scheme, the method for predicting the traffic flow state of the large-scale road network based on the federal learning comprises the steps that the large-scale road network is decomposed into a plurality of sub-networks, a plurality of base stations in each sub-network collect traffic flow characteristics of vehicles in a certain range within a period of time, each base station serves as a participant in the federal learning and is enabled to receive a global model respectively, the current sub-network traffic flow prediction model is trained by locally using a data set of the base station, then the current sub-network traffic flow prediction model is uploaded to a server to be subjected to global aggregation, and the future state of the road network is predicted by the server. The method can effectively reduce the operation cost of the server, and can also improve the training efficiency and the prediction effect of the model.

Drawings

FIG. 1 is a schematic flow chart of the present invention.

Detailed Description

The following describes in detail specific embodiments of the present invention.

The method comprises the following steps: and (C) simulating the large-range road network into a directed graph G (V, E), wherein V is a point set, intersections are simulated into vertexes in the directed graph, E is an edge set, and road sections between the two intersections are simulated into directed edges. Dividing a road network into a plurality of disjoint directed sub-graphs according to actual physical characteristics of the traffic road network

Namely:

step two: establishing an initial model:

for each sub-network

Determining a time interval delta T, determining a time period (supposing 1 hour), wherein the number of the segments obtained by dividing the time period according to the time interval delta T is marked as T, and each base station has N time sequence sequences by collecting GPS information transmitted by vehicles. The data set for each base station is spanned into a matrix

A gated cycle unit (GRU) is established. Is formed by connecting T GRUs in series, wherein W_r，W_z，W，U_r，U_zU is a weight parameter matrix to be trained, h_tFor the t GRU output, x_tIs the column vector of matrix X. The forward propagation formula is:

r_t＝σ(W_rx_t+U_rh_t-1)

z_t＝σ(W_zx_t+U_zh_t-1)

wherein the function is:

output of gated cyclic unit

H is to be_TThe input of the fully connected neural network is input into the network for training. W⁽ⁱ⁾Is the weight matrix of the i-th layer of the fully-connected neural network, b⁽ⁱ⁾Is the bias of the i-th layer, z⁽ⁱ⁾Is the output of the i-th layer, a^(i-1)Is an input to the ith layer. Let the objective function be v ═ W^Ty + b, then the formula for the i-th layer forward propagation is:

z⁽ⁱ⁾＝W⁽ⁱ⁾a^(i-1)+b⁽ⁱ⁾

a⁽ⁱ⁾＝σ(z⁽ⁱ⁾) And a is a⁽⁰⁾＝h_T

Assuming that the number of hidden layers is 1, the final prediction result is obtained

Step three: each subnet G_qMiddle base station

Extracting the average speed of the road sections divided by time in a period of time from the database, and opening the data set into a matrix

Each base station uses the data set

Freely segmenting the training set and the test set, for the training set

Using the algorithm as step 2 to forward propagate and obtain the result

v is the true value in the training set. Defining a loss function

The training parameters are then updated using a back propagation algorithm. For parameters of the fully-connected neural network, the formula for the i-th layer back propagation is as follows:

where α represents the learning rate.

Parameter W for gated cycle cell_r，W_z，W，U_r，U_zU is trained using back propagation, the formula is as follows:

wherein:

through a plurality of forward propagation and backward propagation, a group of trained parameter sets can be obtained:

and sending the parameter list to the server.

Step four: server for each subnet G_q，|BS_qAnd | is the sum of all base stations participating in federal learning of the current subnet, and a global model is updated by using a federal mean (FedAVG) algorithm:

each subnetwork G_qAfter the global model of (2) is updated, W is used_qAnd constructing a global prediction model by the determined gating cycle units and the fully-connected neural network. The server uses the stored historical traffic flow data omega to input the historical traffic flow data omega into the model to obtain a prediction result

Claims

1. A method for predicting traffic flow states of a large-scale road network based on federal learning is characterized by comprising the following steps:

step one, constructing a directed graph

Step two, establishing an initial model

For each sub-network

Wherein v is the true value in the training set;

Parameter set

Sending the data to a server;

step four, obtaining a prediction result by using a federal average algorithm

wherein, | BS_qI is the sum of the number of all base stations participating in federal learning of the current subnet, and i is a base station label; each subnetwork G_qIs updated and then used

Constructing a global prediction model by the determined gating cycle units and the fully-connected neural network; inputting the stored historical traffic flow data omega into the model to obtain a prediction result

2. The method according to claim 1, wherein in step two,

in the gated cyclic unit, W_r，W_z，W，U_r，U_zU is a weight parameter matrix to be trained, h_tFor the t GRU output, x_tIs a column vector of matrix X; the forward propagation formula is: r is_t＝σ(W_rx_t+U_rh_t-1)；z_t＝σ(W_zx_t+U_zh_t-1)；

Wherein the function is:

output of gated cyclic unit

H is to be_TAs the input of the fully connected neural network, inputting the input into the network for training; w⁽ⁱ⁾Is the weight matrix of the i-th layer of the fully-connected neural network, b⁽ⁱ⁾Is the bias of the i-th layer, z⁽ⁱ⁾Is the output of the i-th layer, a⁽ⁱ ^-1)Is an input to the ith layer; the objective function is defined as v ═ W^Ty + b, then the formula for the i-th layer forward propagation is:z⁽ⁱ⁾＝W⁽ⁱ⁾a^(i-1)+b⁽ⁱ⁾；a⁽ⁱ⁾＝σ(z⁽ⁱ⁾) And a is a⁽⁰⁾＝h_T(ii) a When the number of hidden layers l is 1, the predicted result

3. The method for predicting traffic flow status of a road network in a wide range based on federal learning according to claim 3, wherein in the third step,

wherein: