CN111242394B

CN111242394B - Method and system for extracting spatial correlation characteristics

Info

Publication number: CN111242394B
Application number: CN202010336499.9A
Authority: CN
Inventors: 韦伟; 刘岭; 石晶; 刘军; 王莹
Original assignee: CRSC Research and Design Institute Group Co Ltd
Current assignee: CRSC Research and Design Institute Group Co Ltd
Priority date: 2020-04-26
Filing date: 2020-04-26
Publication date: 2021-03-23
Anticipated expiration: 2040-04-26
Also published as: CN111242394A

Abstract

The invention belongs to the technical field of data processing, and discloses a method and a system for extracting spatial correlation characteristics, wherein the method for extracting the spatial correlation characteristics comprises the following steps: step S1: selecting the spatial correlation data to obtain at least one periodic data set and a trend data set; step S2: processing at least one periodic data set based on a time sliding mechanism, and performing feature extraction and aggregation on the processed at least one periodic data set to obtain a feature map; step S3: and constructing a residual convolution unit, and extracting spatial correlation characteristics according to the trend data set and the characteristic diagram. The invention realizes accurate description of complex space dependence and provides more accurate data support for space-time prediction in the future.

Description

Method and system for extracting spatial correlation characteristics

Technical Field

The invention belongs to the field of data processing, and particularly relates to a method and a system for extracting spatial correlation characteristics.

Background

The OD data refers to the magnitude of a social or economic interactive relationship variable from a departure point (o point) to a destination point (d point) in a certain period of time, and is also called OD data. The interactive relationship between the spatial geographic units in a certain time period comprises the economic trade scale, the social communication frequency or the passenger and goods transportation volume between areas. The combination of the OD quantities between all the origin (o) and destination (d) points is called the OD matrix. Prediction of OD data is often an important basis for making socio-economic activity plans and decisions between regions. Generally, the task of OD data prediction is to predict OD data in a subsequent period on the premise of given historical OD data (and corresponding environmental information), so as to be an important basis for socio-economic activity organization and management decision.

For a certain traffic line network, there exists a complex spatial dependency relationship of OD data in space, and such spatial dependency can be further divided into origin dependency and destination dependency according to cause. The two spatial dependencies differ significantly from the spatial dependency relationship embodied in conventional spatial data. How to accurately identify the data is an important basis for accurate prediction of OD data.

The origin depends on: there is a correlation between OD amounts starting from an adjacent departure point and ending at a specific destination. For example, in a subway line network, the same change law is exhibited by the traveling amounts of passengers arriving at the same station in the center of a city, starting from adjacent stations in the vicinity of a certain residential area.

Destination dependence: there is a correlation between OD amounts starting from a specific departure point and ending at an adjacent destination point. For example, in a subway line network, the traveling volume of passengers departing from a station in a certain residential area and arriving at an adjacent station in the center of a city has a strong positive correlation.

In recent years, successful application of deep learning in various fields has stimulated research and attempts for its application in the fields of transportation and transportation. For example, some studies consider road network traffic throughout a city as a thermodynamic diagram (where each pixel value represents traffic within the corresponding region), and model the spatial dependence of the non-linearity using Convolutional Neural Networks (CNNs). In addition, some researchers have proposed using Recurrent Neural Networks (RNNs) to build non-linear time-dependent models for traffic flow prediction. In subsequent researches, CNN and RNN are organically fused, a comprehensive prediction model considering the time and space dependence relationship is provided, and the prediction accuracy is further improved. However, in many of these studies, urban road traffic flow is considered as a study target, and the spatial distribution and the dependency characteristics that require the OD data in a complicated manner are not considered enough, and it is difficult to realize accurate prediction.

Disclosure of Invention

In view of the above problem, the present invention provides a method for extracting spatial correlation features, where the method includes:

step S1: selecting the spatial correlation data to obtain a periodic data set and a trend data set;

step S2: processing the periodic data set based on a time sliding mechanism, and then performing feature extraction and aggregation on the periodic data set to obtain a feature map;

step S3: and constructing a residual convolution unit, and extracting spatial correlation characteristics according to the trend data set and the characteristic diagram.

In the above extraction method, step S1 includes:

and selecting a periodic data set and a trend data set by taking different time periods as prediction targets according to the spatial correlation data.

In the above extraction method, the cycle data set includes a weekly interval historical cycle data set and a daily interval historical cycle data set, and the trend data set is a current day trend data set.

In the above extraction method, step S2 includes:

step S21: processing the weekly interval historical period data set and the daily interval historical period data set based on a time sliding mechanism to obtain a sliding weekly interval historical period data set and sliding daily interval historical period data;

step S22: and performing feature extraction and aggregation on the sliding week interval historical period data set and the sliding day interval historical period data set through a two-dimensional point-by-point convolution layer to obtain a feature map.

In the above extraction method, step S3 includes:

step S31: extracting the characteristics of departure place dependence and destination dependence by stacking a plurality of depth-separated one-dimensional convolution layers;

step S32: and extracting a spatial correlation structure through the one-dimensional point-by-point convolutional layer, and aggregating the departure place dependence and destination dependence characteristics to output spatial correlation characteristics.

The invention also provides a system for extracting the spatial correlation characteristics, which comprises the following steps:

a data set construction unit: selecting the spatial correlation data to obtain a periodic data set and a trend data set;

a feature map obtaining unit: processing the periodic data set based on a time sliding mechanism, and then performing feature extraction and aggregation on the periodic data set to obtain a feature map;

residual convolution unit construction unit: and constructing a residual convolution unit, and extracting spatial correlation characteristics according to the trend data set and the characteristic diagram.

In the above extraction system, the data set construction unit selects the cycle data set and the trend data set for the prediction target at different time periods according to the spatial correlation data.

In the above extraction system, the cycle data set includes a weekly interval historical cycle data set and a daily interval historical cycle data set, and the trend data set is a current day trend data set.

In the above extraction system, the feature map obtaining unit includes:

a data set processing module: processing the weekly interval historical period data set and the daily interval historical period data set based on a time sliding mechanism to obtain a sliding weekly interval historical period data set and sliding daily interval historical period data;

a characteristic diagram obtaining module: and performing feature extraction and aggregation on the sliding week interval historical period data set and the sliding day interval historical period data set through a two-dimensional point-by-point convolution layer to obtain a feature map.

In the above extraction system, the residual convolution unit construction unit includes:

a dependent feature extraction module: extracting the characteristics of departure place dependence and destination dependence by stacking a plurality of depth-separated one-dimensional convolution layers;

the spatial correlation characteristic output module: and extracting a spatial correlation structure through the one-dimensional point-by-point convolutional layer, and aggregating the departure place dependence and destination dependence characteristics to output spatial correlation characteristics.

Aiming at the prior art, the invention has the following effects: aiming at the complex spatial correlation of spatial correlation data, a passenger flow residual error convolution unit is constructed, meanwhile, departure place dependence and destination dependence characteristics are identified, accurate description of the complex spatial dependence is achieved, and more accurate data support is provided for space-time prediction in the future.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.

FIG. 1 is a flow chart of the extraction method of the present invention;

FIG. 2 is a flowchart illustrating the substeps of step S2 in FIG. 1;

FIG. 3 is a flowchart illustrating the substeps of step S3 in FIG. 1;

FIG. 4 is a schematic diagram of the extraction system;

FIG. 5 is a schematic diagram of an OD sparse spatiotemporal residual network model;

FIG. 6 is a schematic diagram of a sliding time window mechanism for periodic historical feature extraction;

FIG. 7 is a schematic diagram of the structure of an OD residual convolution unit;

FIG. 8 is a schematic diagram of spatial feature extraction;

FIG. 9 is a simplified schematic diagram of a sequential processing unit;

FIG. 10 is a schematic diagram of a sequential stacked feature matrix processing procedure;

fig. 11 is a schematic diagram of a non-zero element attention mechanism considering sparsity.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As used herein, the terms "comprising," "including," "having," "containing," and the like are open-ended terms that mean including, but not limited to.

References to "a plurality" herein include "two" and "more than two".

Referring to fig. 1, fig. 1 is a flow chart of the extraction method of the present invention. As shown in fig. 1, the method for extracting spatial correlation features of the present invention includes:

step S1: and selecting the spatial correlation data to obtain at least one periodic data set and a trend data set.

Specifically, in step S1, at least one period data set and a trend data set are selected from the spatial correlation data and the prediction targets in different time periods, wherein the at least one period data set includes at least one of a week interval historical period data set, a day interval historical period data set, a month interval historical period data set and a year interval historical period data set, and the trend data set is a current day trend data set.

Step S2: and processing at least one periodic data set based on a time sliding mechanism, and performing feature extraction and aggregation on the processed at least one periodic data set to obtain a feature map.

Referring to fig. 2, fig. 2 is a flowchart illustrating a sub-step of step S2 in fig. 1. As shown in fig. 2, the step S2 includes:

step S21: processing at least one periodic data set based on a time sliding mechanism to correspondingly obtain at least one sliding periodic data set;

step S22: and performing feature extraction and aggregation on at least one sliding period data set through a two-dimensional point-by-point convolution layer to obtain a feature map.

Referring to fig. 3, fig. 3 is a flowchart illustrating a substep of step S3 in fig. 1. As shown in fig. 3, the step S3 includes:

Referring to fig. 5-10, the principle of the extraction method will be described with reference to fig. 5-10 in an embodiment. It should be noted that, in the embodiment, the extraction method of the present invention is applied to demand prediction of rail transit passenger flow, but the present invention is not limited thereto, and in other embodiments, the present invention can also be applied to analysis and prediction of other socioeconomic phenomena having origin-destination points at the same time, such as trade between countries and regions, population migration, and various types of passenger and cargo transportation.

First, the spatial correlation data is preprocessed.

Specifically, the weekly interval historical cycle data set, the daily interval historical cycle data set, and the current day trend data set are selected from the historical data using different time periods as prediction targets in this step.

Definition 1: for a railway transit network including N stations, the set of all stations is S (S = {1,2,3, …, N }), and the set of all trips in the t-th time slot of the study is P_tFor stations i (i ∈ S), j (j ∈ S), OD passenger flow volume from station i to station j in t-th time period

Can be defined as:

wherein p is_oAnd p_dRepresents the origin and destination of the p-th trip in the t-th time period, | circuitry represents the aggregate potential, which is the number of elements in the aggregate, in equation (1),

|{p_o=i∧p_d= j } | denotes the set { p_o=i∧p_dNumber of elements in = j }.

Define 2 (OD traffic matrix): in the studied railway traffic network, the OD passenger flow between all stations in the t-th time period

Forming an OD passenger flow matrix X of the rail transit line network in the t-th time period_t，X_t∈R^N×NR is a real number, i.e. X_tThe matrix is a real number matrix of dimension N × N, as shown in the following equation.

（2）

For a particular rail transit network, the OD passenger flow matrix { X over the first m-1 time periods is given_tI t =1,2,3, …, m-1}, and predicting OD traffic matrix X in the m-th time period thereof_mThe problem of predicting passenger flow of a rail transit network can be expressed as follows:

（3）

in which Ω(s) are prediction models or prediction functions, E_tThe environmental variables in the t-th time period are used for describing environmental information such as weather conditions (sunny days, cloudy days, rain, snow and fog), air temperature, wind speed, whether the weather is a holiday or not and the like. And secondly, constructing an OD passenger flow sparse space-time residual error network model according to the processed data.

Wherein, the method comprises the following steps:

1) and processing the weekly interval historical period data set and the daily interval historical period data set based on a time sliding mechanism to obtain a sliding weekly interval historical period data set and sliding daily interval historical period data.

Specifically, referring to fig. 5, fig. 5 is a schematic diagram of a sparse space-time residual error network model for OD passenger flow. As shown in FIG. 5, the predicted OD traffic matrix of the t-th time period is not all strongly correlated with the OD traffic matrix of the first t-1 time periods. For example, the OD passenger flow of 8:00 of the early peak of a certain Tuesday is only strongly correlated with 7:00-8:00 of the day, and is also strongly correlated with historical data around a plurality of working days and a plurality of 8:00 early peaks of the Tuesday, and is less dependent on the passenger flow data of other time periods. Therefore, the invention respectively selects a weekly interval historical period data set, a daily interval historical period data set and a current day trend data set from the historical data as input variables of the OD passenger flow sparse space-time residual error network model.

Wherein the trend data set S of the day_trFrom the top q of the pseudo-prediction period t_trThe OD traffic matrix for each time interval is constructed as shown below. Wherein the content of the first and second substances,

backtracking k for a quasi-prediction time period t_trThe OD traffic matrix for each time period,

r is a real number, i.e. S_trBelong to q_trX N dimensional real matrix.

（4）

Because the evolution law of the OD passenger flow in each period has certain difference, the OD passenger flow in the period t to be predicted has strong relevance with the observed value before the interval of the historical integer periods, and depends on the observed value of the surrounding period to a certain extent. For example, the OD traffic of 8:00 early peak on a certain tuesday is strongly correlated with the observed values of 8:00 early peak on the last tuesday and the adjacent time period. Therefore, a certain time sliding mechanism needs to be considered when constructing the weekly and daily interval historical period data sets. For the k-th_wInterval of every week and k_dSliding historical data set of daily intervals of individuals

And

and can be represented by formula (5).

（5）

Wherein the content of the first and second substances,

r is a real number, i.e.

Belong to (2 p)_wA real matrix of + 1) x NxN dimensions,

r is a real number, i.e.

Belong to (2 p)_dA real matrix of + 1) × nxn dimensions. q. q.s_wAnd q is_dHistorical cycles in the historical cycle data set are weekly intervals and daily intervals, respectively. p is a radical of_wAnd p_dThe time sliding window sizes are respectively. I is the number of time periods of the day, related to the length of the study period. Weekly and daily interval historical period data set S_wAnd S_dHistorical periodic data sets, possibly from slip cycle intervals

Historical periodic data set spaced from sliding days

The construction is carried out as shown in formula (6).

（6）

Wherein the content of the first and second substances,

，

。

2) and constructing a two-dimensional point-by-point convolution layer, and performing feature extraction and aggregation on the sliding week interval historical period data set and the sliding day interval historical period data set to obtain a feature map.

Referring to fig. 6, fig. 6 is a schematic diagram of a sliding time window mechanism for periodic historical feature extraction. 5-6, part a of FIG. 5, the present invention employs two-dimensional point-by-point convolutional layer 1x1_ cov2 for sliding history data set

And

carrying out feature extraction and aggregation to obtain a feature map

And

，

，

as shown in equation (7).

（7）

Wherein the content of the first and second substances,

and

for learnable parameters, f is an activation function,

the convolution operation is shown, and the ReLU function is selected to ensure the sparsity of the feature, but the invention is not limited thereto.

And thirdly, constructing an OD passenger flow residual error convolution unit, and extracting spatial features from the feature map and the current day trend data set.

The method comprises the following steps:

1) constructing a plurality of depth separation one-dimensional convolution layers, and extracting the characteristics of departure place dependence or destination dependence by stacking the depth separation one-dimensional convolution layers;

2) and constructing a one-dimensional point-by-point convolutional layer, extracting a spatial correlation structure through the one-dimensional point-by-point convolutional layer, and aggregating the two spatial dependencies to output spatial characteristics.

Please refer to fig. 7 and 8; FIG. 7 is a schematic diagram of the OD passenger flow residual convolution unit structure; fig. 8 is a schematic diagram of spatial feature extraction. As shown in fig. 7 and 8, in part b of fig. 5, the present invention proposes an OD passenger flow residual convolution unit OD _ result for extracting the characteristics of origin dependency and destination dependency at the same time, and its structure is shown in fig. 7. By stacking a plurality of depth-separated one-dimensional convolutional layers Ds _ cov1, the OD passenger flow residual convolutional unit realizes the extraction of the departure place dependence or destination dependence characteristics, meanwhile, the one-dimensional point-by-point convolutional layers 1x1_ cov1 are used for extracting the spatial correlation structure, and the two spatial dependencies are aggregated and output. The OD passenger flow residual convolution unit also introduces residual concatenation to prevent gradient vanishing from occurring. Are respectively paired

、

And

extracting spatial features to obtain features

、

And

as shown in formula (8).

（8）

FIG. 8 shows the spatial feature extraction principle based on Ds _ cov1 and 1x1_ cov 1. Taking destination dependent feature extraction as an example, rows in the OD traffic matrix are considered as channel dimensions herein. The OD traffic matrix can be converted into a 1 xn dimensional picture containing N channels, each channel representing the OD traffic structure from a particular station. Ds _ cov1 may utilize a 1 × N structure, wherein the present invention uses a 1 × 3 convolution kernel to extract the correlation between OD traffic arriving at adjacent stations from the same station, but the present invention is not limited thereto. Because Ds _ cov1 adopts different convolution kernels in different channels, the OD passenger flow residual convolution unit realizes the destination dependent feature extraction of all departure stations. By stacking multiple Ds _ cov1 layers, the OD passenger flow residual convolution unit can further identify a larger spatial range of destination dependent features. Finally, the OD passenger flow residual convolution unit utilizes 1x1_ cov1 to identify the association structure between a certain road section (channel) and other road sections, and the description capacity of the destination association feature is further improved. The extraction process of the OD passenger flow residual convolution unit on the departure place dependence characteristic is similar to that of the destination dependence characteristic, and only the input OD passenger flow matrix needs to be transposed before extraction.

And constructing a simplified time sequence processing unit to extract nonlinear time sequence correlation characteristics from the plurality of spatial characteristics to complete time-space characteristic aggregation and obtain historical time-space prediction.

The method comprises the following steps:

1) stacking the spatial features in time to obtain a current day trend spatial feature set, a week interval period spatial feature set and a day interval period spatial feature set;

2) constructing a plurality of two-dimensional point-by-point convolution layers, and extracting nonlinear time sequence correlation characteristics from a current day trend space characteristic set, a week interval period space characteristic set and a day interval period space characteristic set through the continuous two-dimensional point-by-point convolution layers;

3) obtaining a current day trend space-time prediction value, a week interval period space-time prediction value and a day interval period space-time prediction value according to the extracted time sequence correlation characteristics;

4) and obtaining a historical space-time predicted value according to the current day trend space-time predicted value, the week interval period space-time predicted value and the day interval period space-time predicted value.

Please refer to fig. 9 and 10; FIG. 9 is a simplified schematic diagram of a sequential processing unit; FIG. 10 is a schematic diagram of a process for processing a time-series stacked feature matrix. As shown in fig. 9 and 10, a conventional Recurrent Neural Network (RNN), such as a long-term memory network (LSTM), can automatically learn long-term dependencies in a time series. However, predicting the OD passenger flow matrix of the entire traffic network requires RNN to have higher hidden layer and output layer feature dimensions, which requires a huge amount of training samples and computational resources, and is often difficult to satisfy in most application scenarios. According to the OD traffic time dependency feature described above, the OD traffic of the time period to be predicted has a strong correlation only with observations of several time periods, weeks or days before. In view of the small number of time periods in the two periodic dependencies and the daily trend dependency considered in the model, the present invention proposes a simplified timing processing unit (Simp _ SeqUnit).

Fig. 9 shows the complete structure of a simplified sequential processing unit. In the simplified time sequence processing unit, a plurality of continuous two-dimensional point-by-point convolution layers 1x1_ cov2 are used for extracting nonlinear time sequence correlation characteristics from the spatial characteristics of a plurality of time segments, and 1x1_ cov2 layers with an output channel of 1 complete final space-time characteristic aggregation. The detailed principle of the simplified sequential processing unit is shown in fig. 10. Compared with the traditional RNN, when the number of considered time periods is small, the simplified time sequence processing unit can utilize fewer parameters to realize the nonlinear time characteristic extraction and prediction of all station OD passenger flows, and is more flexible and efficient on a small data set.

Specifically, section c of FIG. 5 illustrates where simplified timing is utilizedAnd the processing unit extracts the OD passenger flow time dependence characteristics. Spatial features obtained by OD passenger flow residual convolution unit

、

And

stacking in time to obtain a current day trend spatial feature set U_tr (

) Periodic spatial feature set U_w（

) And a daily interval periodic spatial feature set U_d（

) As shown in formula (9).

（9）

Using reduced sequential processing unit pairs U, respectively_tr、U_wAnd U_dExtracting time sequence characteristics to obtain a time-space predicted value Z of the current day trend_tr (

) And a week interval period space-time predicted value Z_w（

) And the space-time prediction value Z of the day interval period_d（

) As shown in formula (10).

（10）

Wherein phi is a processing function of the simplified time sequence processing unit,

、

and

to simplify the learnable parameters of the sequential processing unit. Summing the three space-time predicted values to obtain a historical space-time predicted value Z_st，

As shown in formula (11).

（11）

Then, a preliminary prediction is obtained according to the historical space-time prediction and the external environment prediction.

Specifically, the historical spatio-temporal predicted value Z is calculated under the condition of considering the influence of external environmental factors_stAnd the external environment predicted value Z_Et(derived from Et by the ordinary linear layer) are summed to obtain a preliminary prediction for the t period

As shown in formula (12).

（12）

And then, introducing a non-zero attention mechanism of sparsity to obtain final prediction according to the preliminary prediction. The method comprises the following steps:

1) obtaining an OD passenger flow matrix mean value according to the weekly interval historical period data set, the daily interval historical period data set and the current day trend data set;

2) converting the OD passenger flow matrix mean value into a non-zero element attention matrix through a non-zero activation function;

3) and filtering the preliminary predicted value through a non-zero element attention matrix to obtain a final predicted value of the OD passenger flow matrix in a certain time period.

Additionally, on the basis of current-term trend dependence, daily interval period dependence, and weekly interval period dependence considered herein, monthly interval (4 weeks) period dependence and yearly interval period dependence (52 weeks) may be further considered. In step S21 of the method proposed by the present invention, a monthly interval period and yearly interval period historical data set is added, and a processing procedure similar to the daily interval period and yearly interval period historical data set is adopted to extract corresponding spatiotemporal features, so as to be used for final OD passenger flow prediction, which may be used as another embodiment of the present invention. The calculation of the historical spatiotemporal prediction value shown in equation (11) can be updated to equation (11-1) on the basis of consideration of the monthly interval cycle dependency and the yearly interval cycle dependency, where the sums are the monthly interval cycle and yearly interval cycle spatiotemporal prediction values, respectively.

（11-1）

Referring to fig. 11 again, fig. 11 is a schematic diagram of a non-zero element attention mechanism considering sparsity. As shown in fig. 11, in order to describe the sparsity of the OD passenger flow in the spatial distribution, in section d of fig. 5, the present invention also introduces a Non-zero attention mechanism (Non-zero Activation), the principle of which is shown in fig. 11. The sparsity of the traffic matrix remains stable due to the OD over time. Thus, the OD passenger flow matrix mean of all input data sets is used herein

(

) Known as sparsely distributed features. By the action of the non-zero activation function a,

is converted to a non-zero element attention matrix. Use it to predict the initial value

Filtering to obtain the final predicted value of the OD passenger flow matrix in the t period

Thus, the sparsity of the final prediction result is ensured, as shown in formula (13).

（13）

Where denotes the element-by-element multiplication of the matrix. As used herein, the non-zero activation function Lambda is as equation (14), and its parameter Lambda can be set according to the actual data distribution. When the parameter λ is large enough, Λ (x) approaches 1 even if the value of x is minute. Therefore, the designed non-zero attention mechanism not only ensures to a certain extent

The sparse structure of the network simultaneously limits the space-time feature extraction of the OD-SparsesSTnet network to be only carried out around non-zero elements, thereby improving

The prediction accuracy of (2).

（14）。

The OD passenger flow sparse space-time residual error network model is trained by using a back propagation rule of a deep neural network and an Adam algorithm. Specifically, a training sample set is formed according to a current day trend data set, a week interval period data set and a day interval period data set, and model training is carried out according to the training sample set by using a back propagation rule of a deep neural network and an Adam algorithm. The model training objective is to minimize the loss of the square Root of Mean Square Error (RMSE), which is shown in equation (15).

（15）

However, for the predicted sparse OD passenger flow matrix, since the general prediction deviation evaluation index, average absolute percentage error (MAPE), may have a dividend of 0, the present invention provides a total absolute percentage error (GAPE) to estimate the prediction accuracy of the entire OD passenger flow matrix, as shown in the following formula:

（16）

specifically, a set of normalized historical OD passenger flow matrix observations is input: { X1, X2, …, Xm-1 }; external environment information observation set: { E1, E2, …, Em-1 }; sequence length of weekly dependence history data, daily dependence history data and current trend history data: qw, qd, qtr weekly dependency history data, daily dependency history data sliding window: pw, pd; study period length: h (unit: minutes, total number of time periods contained in one day: I = 1440/h), an OD passenger flow sparse spatio-temporal residual network model was trained.

The invention verifies the advancement of the proposed OD passenger flow sparse space-time residual error network model by introducing 7 existing models as a reference. The basic settings of the reference model in 7 are as follows:

historical Average (HA): and directly carrying out historical synchronous averaging on the input weekly interval period data set and the daily interval period data set to serve as a predicted value of the OD passenger flow matrix at the predicted time.

Autoregressive moving average (ARIMA): autoregressive moving average is a classical model of time series prediction, which is used here to predict the time series of current day trends.

Three-dimensional convolutional network (CNN 3): the three-dimensional convolution network is directly utilized to carry out space and time feature simultaneous extraction (without independent processing of departure place and destination dependence) and prediction on the current day trend time series.

General Recurrent Neural Network (RNN): and firstly, extracting the characteristics of each OD passenger flow matrix in the time sequence of the daily trend, and then predicting by using the RNN.

Long and short term memory networks (LSTM): and (3) firstly, extracting the characteristics of each OD passenger flow matrix in the daily trend time sequence, and then predicting by using the LSTM.

Deep space-time network (deep st): the space-time deep neural network prediction model aiming at the space-time data is mainly used for carrying out people stream aggregation prediction in an urban range.

Space-time residual network (ST-ResNet): aiming at a space-time residual error network model of space-time data, a plurality of CNNs and corresponding residual error connections are stacked for space characteristic identification, and the method is mainly used for urban traffic flow prediction.

TABLE 17 comparison of predicted results of the reference model with the proposed OD-SparsesSTnet

Table 1 shows the predicted comparison of 7 reference models with the proposed OD-SparsesSTnet model over the 8:00-8:15 early peak (or 8:00-8:05 early peak). Because the HA method only partially considers the historical periodic dependence of the weekly interval and the daily interval, and the spatial dependence is not considered, the RMSE and GAPE values of the prediction result are the highest, and the prediction deviation is the largest. The ARIMA, the RNN and the LSTM are time sequence models essentially, can better identify the dependence of the trend of the day but cannot identify the dependence of the OD passenger flow in a complex space, and the prediction indexes of the ARIMA, the RNN and the LSTM are only superior to those of the HA method. Particularly, when the output characteristic dimension is large, due to the fact that a large number of hidden layer parameters are needed by the RNN and the LSTM, the method is difficult to be applied to OD passenger flow matrix prediction of the whole traffic network, and large prediction deviation exists.

The CNN3 can regard the time change of the OD passenger flow matrix as a third dimension, can extract the time-of-day dependence and space dependence characteristics to a certain extent, and the prediction result is superior to three time sequence models and HA methods. As a typical spatio-temporal data processing model, the DeepST and the ST-ResNet can simultaneously extract multiple time correlation and external factor characteristics, and the internal CNN can also identify spatial dependence to a certain extent. The prediction deviation of the two models is greatly reduced relative to other reference models. However, deep ST and ST-ResNet still cannot finely describe the dependency of origin and destination specific to OD traffic, and there is no consideration for sparse distribution of data.

The proposed OD-SparsesSTnet further introduces a non-zero activation mechanism to describe the sparse distribution characteristic of OD passenger flows on the basis of considering the complex time dependence (week interval period dependence, day interval period dependence, current day trend dependence) and space dependence (departure place dependence and destination dependence) specific to the OD passenger flows of the traffic line network. The prediction deviation is obviously lower than that of the existing space model (CNN 3), time sequence model (ARIMA, RNN and LSTM) and space-time model (DeepsT and ST-ResNe), and the prediction deviation is reduced by more than 14.89%.

Referring to fig. 4, fig. 4 is a schematic structural diagram of an extraction system. As shown in fig. 4, the extraction system of the present invention comprises:

the data set construction unit 11: selecting the spatial correlation data to obtain at least one periodic data set and a trend data set;

feature map obtaining unit 12: processing at least one periodic data set based on a time sliding mechanism, and performing feature extraction and aggregation on the processed at least one periodic data set to obtain a feature map;

residual convolution unit construction unit 13: and constructing a residual convolution unit, and extracting spatial correlation characteristics according to the trend data set and the characteristic diagram.

The data set constructing unit 11 selects the cycle data set and the trend data set for the prediction target at different time periods according to the spatial correlation data.

Wherein the at least one periodic data set includes at least one of a weekly interval historical period data set, a daily interval historical period data set, a monthly interval historical period data set, a yearly interval historical period data set, and a current day trend data set, the trend data set being a current day trend data set.

Further, the feature map obtaining unit 12 includes:

the data set processing module 121: processing at least one periodic data set based on a time sliding mechanism to correspondingly obtain at least one sliding periodic data set;

the feature map obtaining module 122: and performing feature extraction and aggregation on at least one sliding period data set through a two-dimensional point-by-point convolution layer to obtain a feature map.

Still further, the residual convolution unit construction unit 13 includes:

the dependent feature extraction module 131: extracting the characteristics of departure place dependence and destination dependence by stacking a plurality of depth-separated one-dimensional convolution layers;

the spatial correlation feature output module 132: and extracting a spatial correlation structure through the one-dimensional point-by-point convolutional layer, and aggregating the departure place dependence and destination dependence characteristics to output spatial correlation characteristics.

In conclusion, for the complex spatial correlation of the spatial correlation data, a residual convolution unit is constructed and the characteristics of departure place dependence and destination dependence are identified, so that the accurate description of the complex spatial dependence is realized, and more accurate data support is provided for the future time-space prediction.

Although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A method for extracting spatial correlation features is characterized by comprising the following steps:

step S1: selecting the spatial correlation data to obtain at least one periodic data set and a current day trend data set; wherein the at least one periodic data set comprises at least one of a weekly interval historical period data set, a daily interval historical period data set, a monthly interval historical period data set, and an annual interval historical period data set;

step S2: processing at least one periodic data set based on a time sliding mechanism, and performing feature extraction and aggregation on the processed at least one periodic data set to obtain a feature map; the step S2 includes:

step S22: performing feature extraction and aggregation on at least one sliding period data set through a two-dimensional point-by-point convolution layer to obtain a feature map;

step S3: constructing a residual convolution unit, and extracting spatial correlation characteristics according to the current day trend data set and the characteristic diagram;

in step S3, the residual convolution unit uses a row in the OD passenger flow matrix as a channel dimension, and converts the OD passenger flow matrix into a 1 × N-dimensional picture including N channels, where each channel represents an OD passenger flow structure from a specific station, and step S3 includes:

2. The extraction method according to claim 1, wherein the step S1 includes:

and selecting at least one periodic data set and a current day trend data set by taking different time periods as prediction targets according to the spatial correlation data.

3. A system for extracting spatially correlated features, comprising:

a data set construction unit: selecting the spatial correlation data to obtain at least one periodic data set and a current day trend data set; wherein the at least one periodic data set comprises at least one of a weekly interval historical period data set, a daily interval historical period data set, a monthly interval historical period data set, and an annual interval historical period data set;

a feature map obtaining unit: processing at least one periodic data set based on a time sliding mechanism, and performing feature extraction and aggregation on the processed at least one periodic data set to obtain a feature map; the feature map obtaining unit includes:

a data set processing module: processing at least one periodic data set based on a time sliding mechanism to correspondingly obtain at least one sliding periodic data set;

a characteristic diagram obtaining module: performing feature extraction and aggregation on at least one sliding period data set through a two-dimensional point-by-point convolution layer to obtain a feature map;

residual convolution unit construction unit: constructing a residual convolution unit, and extracting spatial correlation characteristics according to the current day trend data set and the characteristic diagram;

the residual convolution unit construction unit takes a row in an OD passenger flow matrix as a channel dimension, converts the OD passenger flow matrix into a 1 xN dimensional picture containing N channels, and each channel represents an OD passenger flow structure starting from a specific station, and comprises:

4. The extraction system according to claim 3, wherein the data set construction unit selects at least one periodic data set and a current day tendency data set for the prediction target at different time periods based on the spatial correlation data.