CN117133129A

CN117133129A - Traffic speed prediction method based on multi-component attention-seeking neural network

Info

Publication number: CN117133129A
Application number: CN202311394555.4A
Authority: CN
Inventors: 李超; 赵志华; 曾庆田; 段华
Original assignee: Shandong University of Science and Technology
Current assignee: Shandong University of Science and Technology
Priority date: 2023-10-26
Filing date: 2023-10-26
Publication date: 2023-11-28
Anticipated expiration: 2043-10-26
Also published as: CN117133129B

Abstract

The invention discloses a traffic speed prediction method based on a multi-component attention map neural network, which belongs to the field of intelligent traffic and comprises the following steps: defining a network structure of a traffic speed sensor, processing a traffic speed sequence at a historical moment, and then establishing a model mapping relation; constructing a multi-component attention-seeking neural network model for traffic speed prediction; training a multi-component attention-seeking neural network model to obtain a trained model; and acquiring traffic speed data of one hour before the current moment, acquiring corresponding daily period and week period information from a traffic speed sequence of a historical moment, inputting a trained multicomponent attention force diagram neural network model, and predicting the traffic speed of a future time period. According to the invention, various period information is linked with the time characteristics of the predicted time, different period fusion weights are used at different times, and the prediction accuracy of traffic speed is improved.

Description

Traffic speed prediction method based on multi-component attention-seeking neural network

Technical Field

The invention belongs to the field of intelligent traffic, and particularly relates to a traffic speed prediction method based on a multi-component attention-seeking neural network.

Background

With the continuous improvement of artificial intelligence technology and the rapid development of intelligent traffic systems, intelligent traffic has become one of the important directions for building intelligent cities, and in recent years, traffic speed prediction is focused by researchers in intelligent traffic systems and becomes an important research direction. The reliable and accurate traffic speed prediction not only can provide a basis for a passer to select a real-time and accurate driving route, but also can further balance the traffic flow of a road, thereby effectively relieving or avoiding the traffic jam problem.

Traffic speed prediction is a typical spatiotemporal sequence prediction problem, and the historical data of human driving behaviors on each road section form complex three-dimensional data. Since the driving behavior of human beings in real traffic is dynamically and periodically changed, for example, the driving behavior of monday is similar to that of monday and the driving behavior of the last monday, and the driving behavior of monday is similar to that of sunday to a lesser extent. Thus, how to efficiently model temporal, spatial correlation in spatio-temporal sequences and dynamic periodic features in historical information is critical to solving such problems.

In recent years, researchers have achieved some success in traffic prediction by modeling spatial correlation in traffic sequences through a graph neural network. However, in the past, the future time is predicted based on the traffic condition at the latest time, or a plurality of historical information are fused by using a fixed fusion weight, so that the periodicity of dynamic change in the traffic sequence is ignored.

Disclosure of Invention

In order to solve the problems, the invention provides a traffic speed prediction method based on a multi-component attention force diagram neural network, which is used for finely dividing historical data of traffic speed, modeling various historical period information, fully fusing local and global space-time characteristics by using an attention mechanism and improving the fine granularity of space-time modeling; meanwhile, various period information is connected with the time characteristics of the prediction time, different period fusion weights are used at different times, and the prediction effect of the model is improved.

The technical scheme of the invention is as follows:

a traffic speed prediction method based on a multi-component attention-seeking neural network comprises the following steps:

step 1, defining a network structure of a traffic speed sensor, processing a traffic speed sequence at a historical moment, and then establishing a model mapping relation;

step 2, constructing a multi-component attention-seeking neural network model for traffic speed prediction according to the time and space correlation of the traffic speed sequence;

step 3, training a multi-component attention seeking neural network model by using the traffic speed sequence at the historical moment processed in the step 1 to obtain a trained model;

and 4, acquiring traffic speed data of one hour before the current moment, acquiring corresponding daily period and weekly period information from a traffic speed sequence at the historical moment, inputting a trained multicomponent attention map neural network model, and predicting the traffic speed in a future time period.

Further, in step 1, modeling the traffic speed sensor network in the real traffic situation as a directed graphWherein->Representing a set of traffic speed sensor nodes, < +.>Representing traffic speedA set of connection relations between sensor nodes, < >>Adjacency matrix representing a traffic speed sensor network, < >>Representing the number of traffic speed sensor nodes; is provided with->Representing adjacency matrix->Is a special element of the traffic speed sensor node->Traffic speed sensor node->When the distance of (2) is below the threshold value,/->1, otherwise->Is 0;

processing the traffic speed sequence at the historical moment into four-dimensional time-space sequenceWherein->Slice number representing a spatio-temporal sequence, +.>Representing the number of sequences contained in each slice, corresponding to the number of traffic speed sensor nodes, namely，/>Representing the length of the time sequence in each hour, and 3 representing that each time-space sequence contains traffic speed data, time points and Zhou three characteristics;

modeling by fusing the historical period information of the hours, the days and the Wednesday with the time information of the predicted time, and defining the historical period information of the hours asThe date and time history information is defined asCycle history information is defined asThe method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicates the current time, ++>Indicates the number of sequences per day,/->Representing the length of the employed history sequence; />Indicating that the traffic speed sensor network is +.>Traffic speed information at the moment; time information of the predicted time is defined as +.>Wherein->Time step representing the predicted moment, +.>Indicating that the traffic speed sensor network is +.>Time information of the moment;

the three-period historical information, the time information of the predicted moment and the traffic speed sensor network are gatheredTogether as input to the model, traffic speed +.>The method comprises the steps of carrying out a first treatment on the surface of the The mapping relation between the model input and the model output is expressed as follows: />The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Representing the mapping.

In step 2, the multi-component attention seeking neural network model comprises five parts, namely an hour period space-time feature extraction module, a day period space-time feature extraction module, a week period space-time feature extraction module, a multi-period feature fusion module and an output layer; the specific process of traffic speed prediction by the multi-component attention-seeking neural network model is as follows:

step 2.1, extracting the time-space characteristics of the hour period, the day period and the week period respectively through an hour period time-space characteristic extraction module, a day period time-space characteristic extraction module and a week period time-space characteristic extraction module;

step 2.2, combining the time-space characteristics of the hour period, the day period and the week period through a multicycle characteristic fusion module to obtain multicycle time-space characteristics;

step 2.3, setting time information of a predicted time, and inputting a component attention layer by combining multi-period space-time characteristics to obtain final space-time characteristics;

step 2.4, the final space-time characteristics pass through the output layer to make the characteristic dimension be defined byAnd scaling to 1 to obtain a final traffic speed prediction result.

Further, in step 2.1, the hour period space-time feature extraction module, the day period space-time feature extraction module and the week period space-time feature extraction module are all constructed by using space-time feature extraction modules with the same structure; the space-time feature extraction module consists of a convolution layer, three time-space layers and a jumping attention layer; the three space-time layers are respectively used for extracting space-time characteristics of different scales, the output of the first space-time layer is the input of the second space-time layer, and the output of the second space-time layer is the input of the third space-time layer; the result of the output cascade of the three time-space layers is the final required time-space characteristics; each time space layer has the same structure and comprises two time convolution layers and a picture scroll lamination layer.

Further, the process of extracting the space-time features by the space-time feature extracting module is as follows:

step 2.1.1, inputting cycle history information into oneIs subjected to two-dimensional convolution operation to obtain a characteristic dimension of +.>Is>And will->Feeding a first time blank layer;

step 2.1.2,Extracting features by two parallel time convolution layers, performing nonlinear transformation by using an activation function, and multiplying the transformed results to obtain time features ∈ ->；/>The calculation formula of (2) is as follows:

(1)；

wherein,representing the Hadamard product; />Representing a Tanh activation function; />Representing a Sigmoid activation function;and->The weights of the two time convolution layers respectively;

step 2.1.3, willInput diffusion convolution layer, in->Modeling the spatial correlation of adjacent traffic speed sensor nodes in the forward direction, the reverse direction and the global direction by using a forward diffusion matrix, a backward diffusion matrix and an adaptive diffusion matrix on each spatial step to obtain space-time characteristics ∈ ->；/>The calculation formula of (2) is as follows:

(2)；

(3)；

(4)；

(5)；

wherein,is a forward diffusion matrix->Is->Forward diffusion matrices for each spatial step; />Is a backward diffusion matrix->Is->A backward diffusion matrix of each spatial step; />Is an adaptive diffusion matrix; />Is->An adaptive diffusion matrix of individual spatial steps; />、/>And->Respectively represent +.>A parameter matrix which can be learned in each space step; />For summing the diffusion matrix by rows; />Is a normalized exponential function; />Activating a function for Randomized Leaky ReLU; />Embedding a vector for a source node; />Embedding a vector for a target node;

step 2.1.4, willAnd->Residual connection is carried out to obtain the output +.>；

Step 2.1.5, willAs input of the second time-space layer, the output of the second time-space layer is obtained according to the same process as in steps 2.1.2-2.1.4>；

Step 2.1.6, willAs input of the third time-space layer, the output of the third time-space layer is obtained according to the same process as in steps 2.1.2-2.1.4>；

Step 2.1.7, will、/>、/>Merging in the last dimension to obtain new spatio-temporal featuresWherein->Representation->、/>、/>The sum of the three sequence lengths;

step 2.1.8, willSending into a jump attention layer, calculating the correlation between the space-time characteristics of different scales to obtain the space-time characteristics +.>The method comprises the steps of carrying out a first treatment on the surface of the The jumping attention layer adopts 4 attention heads, and each attention head uses different weights;

in the hour period space-time feature extraction module, the period history information input in step 2.1.1 is the hour period history informationAfter the process of the steps 2.1.1 to 2.1.8, the hour period space-time characteristic is extracted>；

In the time-space feature extraction module of the day period, the period history information input in the step 2.1.1 is the day period history informationAfter the process of the steps 2.1.1 to 2.1.8, the day-period space-time characteristics are extracted>；

In the cycle time-space feature extraction module, the cycle history information input in step 2.1.1 is cycle history informationAfter the process of the steps 2.1.1 to 2.1.8, the weekly-periodic space-time characteristics are extracted>。

Further, the specific process of step 2.2 is as follows:

will be、/>、/>Combining in feature dimension to obtain multi-period space-time featureHere +.>，/>The number of period information is represented.

Further, the specific process of step 2.3 is as follows:

step 2.3.1, time information of the predicted time is setWherein->Feature dimensions representing time information; will->Input a +.>Is given a characteristic dimension +.>Time characteristics of (2)；

Step 2.3.2, the multi-component attention layer adopts a multi-head attention mechanism; characterization of timeMulti-period spatiotemporal feature as query>As key and value, inputting multi-component attention layer, multiple head attention mechanism will be mapped to three subspaces respectively, calculating correlation of time feature and multi-period time-space feature, obtaining final time-space feature。

Further, the specific process of step 2.4 is as follows:

first, willInputting an RReLU activation function to perform nonlinear activation on the decoded space-time characteristics; then, a +.>Is to scale up the dimension to +.>And performing nonlinear activation by using the RReLU activation function; finally, a +.>Scaling the dimension to 1 to obtain a final output result; the specific formula is as follows:

(6)；

wherein,、/>respectively representing the weights of the two convolution layers.

The invention has the beneficial technical effects that: the invention provides a multi-component attention layer, historical information is subdivided according to different periods, and the historical information of different periods is fused by using the time characteristics of the predicted time, so that different fusion weights are realized according to different predicted times. The invention provides a space-time feature extraction module which fuses space-time features with different scales, so that the problem of losing local information caused by increasing the expansion coefficient of time convolution is solved, and the space-time modeling granularity of a model is improved. The invention discloses a traffic speed prediction method based on multicycle information, which provides a transit time prediction method for fusing different cycle history information by using time characteristics of prediction moments, and solves the problem that the traditional statistical model and the existing deep learning prediction method can not capture the dynamic periodicity of a time sequence well.

Drawings

FIG. 1 is a flow chart of a traffic speed prediction method based on a multi-component attention-seeking neural network according to the present invention.

Detailed Description

The invention is described in further detail below with reference to the attached drawings and detailed description:

as shown in fig. 1, the invention provides a traffic speed prediction method based on a multi-component attention-seeking neural network, which can improve the prediction accuracy of urban traffic speed, and mainly comprises the following steps:

and step 1, defining a traffic speed sensor network structure, processing a traffic speed sequence at a historical moment, and then establishing a model mapping relation.

Modeling traffic speed sensor network in real traffic situation as a directed graphWherein->Representing a set of traffic speed sensor nodes, < +.>Representing a set of connection relationships between traffic speed sensor nodes,adjacency matrix representing a traffic speed sensor network, < >>Representing the number of traffic speed sensor nodes; is provided with->Representing adjacency matrix->Is a special element of the traffic speed sensor node->Traffic speed sensor node->When the distance of (2) is below the threshold value,/->1, otherwise->Is 0; processing traffic speed sequence at historical moment into three-dimensional space-time sequenceWherein->Slice number representing a spatio-temporal sequence, +.>Representing the number of sequences contained in each slice, corresponding to the number of traffic speed sensor nodes, i.e. +.>，/>Representing the length of the time sequence in each hour, and 3 representing that each time-space sequence contains traffic speed data, time points and Zhou three characteristics;

the invention combines the historical period information of three kinds of hours, days and weeks with the time information of the predicted time to model, and defines the historical period information of the hours asThe date and time history information is defined asCycle history information is defined asThe method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicates the current time, ++>Representing the number of sequences per dayQuantity (S)>Representing the length of the employed history sequence; />Indicating that the traffic speed sensor network is +.>Traffic speed information at the moment; the time information of the predicted moment is defined as +.>Wherein->Time step representing the predicted moment, +.>Indicating that the traffic speed sensor network is +.>Time information of the moment.

The three-period historical information, the time information of the predicted moment and the traffic speed sensor network are gatheredTogether as input to the model, traffic speed +.>. The mapping of model inputs and outputs can be expressed as: />The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Representing the mapping.

Step 2, constructing a multi-component attention-seeking neural network model for traffic speed prediction according to the time and space correlation of the traffic speed sequence; the multi-component attention-seeking neural network model mainly comprises an hour period space-time feature extraction module, a day period space-time feature extraction module, a week period space-time feature extraction module, a multi-period feature fusion module and an output layer.

The specific process of traffic speed prediction by the multi-component attention-seeking neural network model is as follows:

the hour period space-time feature extraction module, the day period space-time feature extraction module and the week period space-time feature extraction module are all constructed by using space-time feature extraction modules with the same structure. The space-time feature extraction module consists of a convolution layer, three time-space layers and a jumping attention layer; the three space-time layers are respectively used for extracting space-time characteristics of different scales, the output of the first space-time layer is the input of the second space-time layer, and the output of the second space-time layer is the input of the third space-time layer; the result of the output cascade of the three time-space layers is the final required time-space characteristics; each time space layer has the same structure and comprises two time convolution layers and a picture scroll lamination layer.

The process of extracting the space-time characteristics by the space-time characteristic extracting module is as follows:

step 2.1.1, inputting period information into oneIs to change +.>Is subjected to two-dimensional convolution operation to obtain the characteristic dimension of +.>Is>And will->Feeding a first time blank layer;

step 2.1.2,Extracting features by two parallel time convolution layers, performing nonlinear transformation by using an activation function, and multiplying the transformed results to obtain time features ∈ ->. One of the temporal convolution layers is scaled between (-1, 1) using the Tanh activation function; another temporal convolution layer is scaled between (0, 1) using a Sigmoid activation function.

The calculation formula of (2) is as follows:

(1)；

wherein,representing the Hadamard product; />Representing a Tanh activation function; />Representing a Sigmoid activation function;and->The weights of the two temporal convolution layers, respectively.

Step 2.1.3, willInput diffusion convolution layer, in->Modeling the spatial correlation of adjacent traffic speed sensor nodes in the forward direction, the reverse direction and the global direction by using a forward diffusion matrix, a backward diffusion matrix and an adaptive diffusion matrix on each spatial step to obtain space-time characteristics ∈ ->；

The calculation formula of (2) is as follows:

(2)；

(3)；

(4)；

(5)；

wherein,is a forward diffusion matrix->Is->Forward diffusion matrix of individual spatial steps, calculate +.>At a value of (2) byPersonal->Multiplication is carried out to obtain; />Is a backward diffusion matrix->Is->The backward diffusion matrix of the individual spatial steps, calculating +.>When the value of (2) is->Personal->Multiplication is carried out to obtain; />Is an adaptive diffusion matrix; />Is->Adaptive diffusion matrix for individual spatial steps, computationWhen the value of (2) is->Personal->Multiplication is carried out to obtain; />、/>And->Respectively represent +.>A parameter matrix which can be learned in each space step; />For summing the diffusion matrix by rows; />Is a normalized exponential function;activating a function for Randomized Leaky ReLU; />Embedding a vector for a source node; />A vector is embedded for the target node.

Step 2.1.4, at last, willAnd->Residual connection is carried out to obtain the output +.>；

Step 2.1.5, the spatio-temporal features extracted by the first spatio-temporal layer are relatively local and will therefore beAs input of the second time-space layer, the output of the second time-space layer is obtained according to the same process as in steps 2.1.2-2.1.4>；

Step 2.1.7 due to、/>、/>Different sequence lengths belonging to different scales of spatiotemporal features, thus will +.>、/>、/>Combining in the last dimension to get a new spatio-temporal feature +.>Wherein->Representation->、/>、/>The sum of the three sequence lengths; the spatio-temporal features extracted at this time contain local and global information.

Step 2.1.8, willSend into the jumping attention layer, calculate +.>Correlation between them, a single period of spatio-temporal features ∈>The method comprises the steps of carrying out a first treatment on the surface of the The jumping attention layer adopts 4 attention heads, and each attention head uses different weights, so that the space-time characteristics of different scales are fused, and the purpose of enriching the space-time characteristic expression capability is achieved.

In the cycle time-space feature extraction module, the cycle history information input in step 2.1.1 is cycle history informationAfter the process of the steps 2.1.1 to 2.1.8, the weekly-periodic space-time characteristics are extracted>；

、/>、/>Is +.>A four-dimensional tensor;

step 2.2, combining the time-space characteristics of the hour period, the day period and the week period through a multicycle characteristic fusion module to obtain multicycle time-space characteristics; the specific process is as follows:

will be、/>、/>Combining in feature dimension to obtain multi-period space-time featureHere +.>，/>The number of the period information is represented; in the present invention, the hour period, day period and week period are used, so +.>Set to 3.

Step 2.3, setting time information of a predicted time, and inputting a component attention layer by combining multi-period space-time characteristics to obtain final space-time characteristics; the specific process is as follows:

step 2.3.1, time information of the predicted time is setWherein->Feature dimensions representing time information, such as: week, time, holiday, etc.; />A time step representing the predicted time; will->Input a +.>Is given a characteristic dimension +.>Time characteristics of->The feature dimensions herein are consistent with the feature dimensions of the extracted multicycle spatiotemporal features.

Step 2.4, the final space-time characteristics pass through the output layer to make the characteristic dimension be defined byScaling to 1 to obtain a final traffic speed prediction result; the specific process is as followsThe following steps:

first, willInputting an RReLU activation function to perform nonlinear activation on the decoded space-time characteristics; then, a +.>Is to scale up the dimension to +.>And performing nonlinear activation by using the RReLU activation function; finally, a +.>The dimension is scaled to 1 to obtain the final output result. The specific formula is as follows:

(6)；

wherein,、/>respectively representing the weights of the two convolution layers; />Representing Randomized Leaky ReLU activation functions.

The multi-period feature fusion module applies an attention mechanism to the fusion of the traffic speed historical information by considering different period influence weights under different time, calculates the fusion weights of different period historical information by the time features of the prediction moment, realizes dynamic periodic modeling, and improves the accuracy of traffic speed prediction.

And 3, training a multi-component attention seeking neural network model by using the traffic speed sequence at the historical moment processed in the step 1, setting the convolution layer number to 3, setting the attention head to 4 in the training process, adjusting the learning rate by using a cosine annealing function, and optimizing the model by using an Adamw optimizer to obtain a trained model.

In order to demonstrate the feasibility and superiority of the present invention, the following comparative experiments were performed. The specific experimental results are shown in tables 1 and 2.

Table 1 experimental comparison results on the los angeles loop speed dataset;

。

table 2 experimental comparison results on the los angeles bay area velocity dataset;

。

the experiment is a comparison experiment of five time sequence prediction models of a multi-component attention seeking neural network model MCAGCN, a long and short period memory neural network FC-LSTM, a diffusion convolution circulation neural network DCRNN, a Graph wavelet network Graph Wavenet, an attention time space Graph neural network ASTGCN and a Graph multi-attention network GMANN, which are designed and realized by using average absolute error MAE, root mean square error RMSE and average absolute percentage error MAPE as evaluation indexes on two data sets of los Angeles loop traffic speed and los Angeles bay area traffic speed. 15 minutes, 30 minutes, 60 minutes in the table represent data at 15 minutes, 30 minutes, 60 minutes, respectively, in the future, such as 3.44 MAE for the long and short term memory neural network at 15 minutes in the los Angeles loop speed dataset. As can be seen from tables 1 and 2, the multi-component attention seeking neural network model of the invention is obviously superior to other network models in MAE, RMSE, MAPE index, has the least error value of the prediction result in short-term, medium-term and long-term predictions and the best prediction effect, namely, the invention can realize the prediction of traffic speed data and has higher accuracy and applicability in short-term, medium-term and long-term scales. Therefore, the model of the invention can be used as an effective traffic speed prediction model to provide technical support for traffic speed prediction and analysis.

The invention builds the multi-component attention force diagram neural network model for traffic speed prediction based on the multi-component attention layer and the space-time attention layer, solves the problems of periodic modeling of traffic speed and fine granularity modeling of space-time information by extracting various kinds of periodicity and local and global information among sequences in the traffic speed sequence, and improves the accuracy of the traffic speed prediction model.

It should be understood that the above description is not intended to limit the invention to the particular embodiments disclosed, but to limit the invention to the particular embodiments disclosed, and that the invention is not limited to the particular embodiments disclosed, but is intended to cover modifications, adaptations, additions and alternatives falling within the spirit and scope of the invention.

Claims

1. The traffic speed prediction method based on the multi-component attention-seeking neural network is characterized by comprising the following steps of:

2. According to claim 1The traffic speed prediction method based on the multicomponent attention seeking neural network is characterized in that in the step 1, a traffic speed sensor network in a real traffic situation is modeled as a directed graphWherein->Representing a set of traffic speed sensor nodes, < +.>Representing a set of connection relations between traffic speed sensor nodes,/->Adjacency matrix representing a traffic speed sensor network, < >>Representing the number of traffic speed sensor nodes; is provided with->Representing adjacency matrix->Is a special element of the traffic speed sensor node->Traffic speed sensor node->When the distance of (2) is below the threshold value,/->1, otherwise->Is 0;

processing the traffic speed sequence at the historical moment into four-dimensional time-space sequenceWherein->Slice number representing a spatio-temporal sequence, +.>Representing the number of sequences contained in each slice, corresponding to the number of traffic speed sensor nodes, i.e. +.>，Representing the length of the time sequence in each hour, and 3 representing that each time-space sequence contains traffic speed data, time points and Zhou three characteristics;

3. The traffic speed prediction method based on the multicomponent attention seeking neural network according to claim 2, wherein in the step 2, the multicomponent attention seeking neural network model comprises five parts of an hour period space-time feature extraction module, a day period space-time feature extraction module, a week period space-time feature extraction module, a multicycle feature fusion module and an output layer; the specific process of traffic speed prediction by the multi-component attention-seeking neural network model is as follows:

4. The traffic speed prediction method based on multi-component attention-seeking neural network according to claim 3, wherein in the step 2.1, an hour period space-time feature extraction module, a day period space-time feature extraction module and a week period space-time feature extraction module are all constructed by using space-time feature extraction modules with the same structure; the space-time feature extraction module consists of a convolution layer, three time-space layers and a jumping attention layer; the three space-time layers are respectively used for extracting space-time characteristics of different scales, the output of the first space-time layer is the input of the second space-time layer, and the output of the second space-time layer is the input of the third space-time layer; the result of the output cascade of the three time-space layers is the final required time-space characteristics; each time space layer has the same structure and comprises two time convolution layers and a picture scroll lamination layer.

5. The traffic speed prediction method based on a multi-component attention-seeking neural network according to claim 4, wherein the process of extracting the spatio-temporal features by the spatio-temporal feature extraction module is as follows:

(1)；

wherein,representing the Hadamard product; />Representing a Tanh activation function; />Representing a Sigmoid activation function; />Andthe weights of the two time convolution layers respectively;

(2)；

(3)；

(4)；

(5)；

step 2.1.8, willSending into a jump attention layer, calculating the correlation between the space-time characteristics of different scales to obtain the space-time characteristics +.>The method comprises the steps of carrying out a first treatment on the surface of the The jumping attention layer uses 4 attention heads, each attentionThe heads all use different weights;

6. The traffic speed prediction method based on a multicomponent attention seeking neural network according to claim 5, wherein the specific process of step 2.2 is as follows:

will be、/>、/>Combining in feature dimension to obtain multicycle space-time feature ++>Here +.>，/>The number of period information is represented.

7. The traffic speed prediction method based on a multicomponent attention seeking neural network according to claim 6, wherein the specific process of step 2.3 is as follows:

step 2.3.1, time information of the predicted time is setWherein->Feature dimensions representing time information; will->Input a +.>Is given a characteristic dimension +.>Time characteristics of->；

Step 2.3.2, the multi-component attention layer adopts a multi-head attention mechanism; characterization of timeMulti-period spatiotemporal feature as query>As key and value, inputting multi-component attention layer, multiple head attention mechanism will map to three subspaces respectively, calculate correlation of time feature and multi-period time-space feature, obtain final time-space feature>。

8. The traffic speed prediction method based on a multicomponent attention seeking neural network according to claim 7, wherein the specific process of step 2.4 is as follows:

(6)；