CN117217445A

CN117217445A - Method and device for predicting freight demand of multiple railway stations

Info

Publication number: CN117217445A
Application number: CN202311067340.1A
Authority: CN
Inventors: 董海荣; 连文博; 吴兴堂; 马建军; 王斌; 周敏
Original assignee: Beijing Jiaotong University
Current assignee: Beijing Jiaotong University
Priority date: 2023-08-23
Filing date: 2023-08-23
Publication date: 2023-12-12

Abstract

The invention provides a method and a device for predicting the freight demands of multiple railway stations, wherein the predicting method comprises the following steps: s1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network; s2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step. The technical problem that the influence of space-time characteristics of a railway topology network on prediction precision is ignored in the traditional railway freight demand is solved.

Description

Method and device for predicting freight demand of multiple railway stations

Technical Field

The invention relates to the technical field of railway freight demand prediction, in particular to a method and a device for predicting railway multi-station freight demand.

Background

The railway freight has the advantages of large freight volume, high speed, energy saving and the like, occupies a great proportion in a transportation system, has a low 2022 year ago, has an operating mileage of 15.5 km, and has a total delivery capacity of 49.84 hundred million tons in a national railway freight year. The scientific prediction of the development trend of the freight traffic of the railway has positive practical significance for the adjustment of the railway transportation structure, the optimization of the social resource allocation and the improvement of the railway freight management level.

However, railway freight is less active than passenger transport, the freight needs to be manually negotiated and ordered, and complex processes such as transportation, transfer and classification are required to be carried out in the transportation process, so that the transportation service cannot be directly and simply purchased and automatically arrived and departed like passengers, and the transportation process is greatly influenced by unreliability, which makes the establishment of railway freight schedules extremely difficult. In addition, the railway freight requirement has complex characteristics of multiple scattered and concentrated freight requirements, multiple-class freight classification and assembly, multiple-destination batch transportation and the like, so that the railway freight requirement overall shows nonlinear variation. The current freight demand prediction mainly comprises medium-long-term prediction based on a time sequence prediction method, wherein the prediction mainly comprises coarse granularity prediction taking quarterly and year as time units, and the fine granularity prediction on the short-term freight demand of a station is lacking; meanwhile, due to the internal coupling relation between freight station sites, the freight flow transportation is influenced by the space-time characteristics of different sites, so that the fusion of the space-time characteristics in freight flow data plays an important role in improving the prediction precision of freight demand.

For this reason, there is a need for an improved method of predicting railroad freight demand.

Disclosure of Invention

First, the technical problem to be solved

In view of the above-mentioned shortcomings and disadvantages of the prior art, the present invention provides a method and apparatus for predicting a railway multi-station freight demand, which solve the technical problem that the traditional railway freight demand ignores the influence of space-time characteristics of a railway topology network on prediction accuracy.

(II) technical scheme

In order to achieve the above purpose, the main technical scheme adopted by the invention comprises the following steps:

in a first aspect, the present invention provides a method for predicting freight demand for multiple railway stations, comprising:

s1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;

s2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step;

the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into a first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic values of all stations in a space-time diagram of a future preset time step;

the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.

Optionally, the constructed railway freight network time space diagram isWherein (1)>Representing freight characteristic quantity of n stations at time t in a railway freight network topology map, epsilon representing connectivity between stations in the railway freight network topology map, and +.>Representation->Is a weighted adjacency matrix,/>Representing a real number;

the weighted adjacency matrix W is obtained according to interval distances among n stations in the railway freight network topological graph, and the expression is as follows:

wherein d _ij Is the distance between sites i and j; sigma (sigma) ² And λ is the distribution threshold and sparsity threshold of W, respectively; omega _ij Is an element in W, represented by d _ij The weight of the edge determined.

Optionally, the shipment characteristic includes shipment delivery and truck loading rate.

Optionally, between S1 and S2 further comprises:

s11, preprocessing a time sequence; the pretreatment step comprises the following steps:

a1, replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight transmission capacity in the time sequence, and replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight full rate in the time sequence;

a2, carrying out normalization processing on the independent variable of the time sequence;

s2, inputting the preprocessed time sequence and the preprocessed space matrix into a pre-trained space-time diagram convolutional neural network.

Optionally, the time-gated convolutional layer is composed of a one-dimensional causal convolutional kernel and a nonlinear gated linear unit.

Optionally, the expression of the spatial convolution layer is:

wherein Θ is a graph convolution kernel; x is the input signal of the spatial convolution layer;vector representing polynomial coefficients of chebyshev polynomial limiting Θ to Λ,/->I _n Is a unit matrix; />Is an angle matrix, wherein D _ij ＝∑ _j W _ij ；/>Meaning that x has C _i A signal of the channel; c (C) ₀ An output channel which is a space convolution layer; y is _j Having C as a spatial convolution layer _j An output of the channel;

the gating time convolution layer has the expression:

wherein,is a convolution kernel; />Is represented as having C _i The input signal of the channel, M is the length of the input sequence of the space-time convolution module; p, Q are the inputs of gates in the gated linear units, P being related to the constituent structure and dynamic variance in the time series, σ (Q) controlling the corresponding input values of the current state; the product of Hadamard of the element is indicated.

Optionally, the expression of the spatio-temporal convolution module is:

wherein,representing the input of a space-time convolution module I, M is the length of an input sequence of the space-time convolution module, n is the number of stations in a railway freight network topological graph, and C ^l An input channel of the space-time convolution module l;representing the output, K, of the spatio-temporal convolution module l _t Is the width of a one-dimensional causal convolution kernel, C ^l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) ^l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.

Optionally, the mapping, by the output layer, the convolution time sequence output by the last space-time convolution module to freight feature quantities of all stations in a space-time diagram of a preset time step in the future includes:

wherein,representing the output of the last spatio-temporal convolution module; />Is a weight vector; b is the deviation.

Optionally, the space-time convolutional neural network comprises two space-time convolutional modules and one output layer which are sequentially connected.

Optionally, before S1, the method further includes: receiving a railway freight network time space diagram for training, and generating a training sample; based on the loss function, the training samples are input into a space-time diagram convolutional neural network for model training.

In a second aspect, the present invention provides a railway multi-station freight demand prediction apparatus comprising:

the acquisition module is used for acquiring a railway freight network topological graph and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;

the prediction module is used for inputting the time sequence and the space matrix into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future;

the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a future preset time step; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.

(III) beneficial effects

The beneficial effects of the invention are as follows:

according to the railway multi-station freight demand prediction method, the railway freight network time-space diagram is constructed based on the railway freight network topological diagram, so that the time-space characteristics of the railway freight network topology can be extracted, the time-space diagram is input into the time-space diagram convolutional neural network to predict freight demands of stations in the topological diagram, the influence of the railway topological network time-space characteristics on station freight demand prediction is considered, and the prediction accuracy is higher.

Drawings

FIG. 1 is a flow chart of a method for predicting freight demand for multiple railway stations according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a space-time convolutional neural network according to an embodiment of the present invention;

FIG. 3 is a railway freight network topology according to an embodiment of the present invention;

FIG. 4 is a adjacency matrix heatmap of a railway network topology;

FIG. 5 is a schematic diagram of a shipping volume sequence for the east well set station of FIG. 3 over a historical time step;

FIG. 6 is a schematic diagram of a truck loading rate sequence for the east well set station of FIG. 3 over a historical time step.

Detailed Description

The invention will be better explained by the following detailed description of the embodiments with reference to the drawings.

According to the railway multi-station freight demand prediction method provided by the embodiment of the invention, the railway freight network time-space diagram is constructed based on the railway freight network topological diagram, so that the time-space characteristics of the railway freight network topology can be extracted, the time-space diagram is input into the time-space diagram convolutional neural network to predict the freight demands of stations in the topological diagram, the influence of the railway topological network time-space characteristics on the station freight demand prediction is considered, and the prediction accuracy is higher.

In order that the above-described aspects may be better understood, exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present invention are shown in the drawings, it should be understood that the present invention may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

Fig. 1 is a flow chart of a method for predicting the freight demand of multiple railway stations according to the present embodiment. As shown in fig. 1, the method for predicting the freight demand of multiple railway stations according to the present embodiment includes the following steps:

step S1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network.

The railway freight network topology graph is a network graph constructed by taking stations as nodes and taking intervals as edges, as shown in fig. 3.

Preferably, the time-space diagram of the railway freight network is constructed as followsWherein (1)>Is a limited vertex set and represents freight characteristic quantity of n stations at time t in a railway freight network topological graph; epsilon represents connectivity between sites in the railway freight network topology; />Representation->Is a weighted adjacency matrix,/>Representing a real number. Thus, the structured railway freight network time space diagram focuses on the structured freight time sequence, v _t Are not independent but are connected by pairs of edges in the topology.

Further, the weighted adjacency matrix W is obtained according to the interval distance among n stations in the railway freight network topological graph, and the expression is:

wherein d _ij Is the distance between sites i and j; sigma (sigma) ² And λ is the distribution threshold and sparsity threshold of W, respectively; omega _ij Is an element in W, represented by d _ij The weight of the edge determined. Further preferably, sigma ² 10 and lambda of 0.5.

Preferably, the shipment characteristic amount includes a shipment transmission amount and a truck loading rate. I.e.Each node of (a) has two shipping characteristics: freight delivery and truck loading rate. Wherein the "freight delivery amount" corresponds to the total weight of the freight delivered by the station in a specific time, generally in tons, and the "freight rate" corresponds to the ratio of the actual weight of the loaded freight in the railway freight car delivered by the station in a specific time to the maximum freight weight that the freight car can accommodate, expressed as a percentage. In this way, in this embodiment, the freight transmission amount and the truck loading rate in the actual physical topological graph are used as node characteristics, so that space-time correlation between different nodes (stations) is sought, and further the freight transmission amount is predicted.

Further preferably, the "freight delivery amount" corresponds to the total weight of the freight delivered by the station in one day, and the "freight full rate" corresponds to the ratio of the actual weight of the loaded freight in the railway freight car delivered by the station in one day to the maximum freight weight that the freight car can accommodate. Therefore, the problem of coarse granularity of the traditional prediction space-time is solved by taking the fine granularity time sequence as the data base of prediction, and the short-term demand of the station freight can be predicted.

Step S11, preprocessing the time sequence.

Preferably, the step of pre-treating comprises:

a1, replacing a value with zero freight transmission quantity in the time sequence by using an average value of freight transmission quantity in the time sequence, and replacing a value with zero freight transmission quantity in the time sequence by using an average value of freight full rate in the time sequence. Thus, the problem of excessive zero values in the historical data is solved.

A2, carrying out normalization processing on the independent variable of the time sequence. In this way, the influence of different dimensions on the prediction accuracy can be eliminated.

Further preferably, the independent variable of the time series is Z-Score normalized.

And S2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step.

The space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs the convolved time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolved time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.

In the space-time diagram convolutional neural network provided by the embodiment, each space-time convolution module comprises a three-layer structure, namely an upper layer and a lower layer are gating time convolution layers for extracting time features, and a middle layer is a space diagram convolution layer for extracting space features, so that a sandwich structure is formed. The space layer is connected with the two time layers, so that the space state from graph convolution to time convolution can be rapidly propagated; the sandwich structure is beneficial to the neural network to fully apply bottleneck strategies, and the channels are downscaled and upscaled through the graph convolution layer, so that the scale compression and the feature compression are realized. Therefore, the space-time diagram is input into the space-time diagram convolutional neural network, useful space-time information is extracted from the space-time diagram by applying the convolutional theorem and the map theory, the freight demand of the station in the topological diagram is predicted, the influence of the space-time characteristics of the railway topological network on the freight demand prediction of the station is considered, and the prediction accuracy is higher.

Specifically, layer normalization is used in each spatio-temporal convolution module to prevent overfitting.

Preferably, in the space-time convolution module, the expression of the spatial convolution layer is:

wherein Θ is a graph convolution kernel; x is the input signal of the spatial convolution layer;vector representing polynomial coefficients of chebyshev polynomial limiting Θ to Λ,/->I _n Is a unit matrix; />Is an angle matrix, wherein D _ij ＝∑ _j W _ij ；/>Meaning that x has C _i A signal of the channel; c (C) ₀ An output channel which is a space convolution layer; y is _j Having C as a spatial convolution layer _j The output of the channel.

Specifically, the derivation process of the spatial convolution layer expression is as follows.

Graph convolution is used as a signal by introducing a concept of a graph convolution operator' based on a concept of spectrogram convolution (utilizing a graph theory)Product of the product with the convolution kernel Θ:

Θ*x＝Θ(L)x＝Θ(UΛU ^T )x＝UΘ(Λ)U ^T x (1)；

wherein the method comprises the steps ofI _n Is an identity matrix>Is an angle matrix, D _ij ＝∑ _j W _ij ；/>Is a eigenvector matrix of the normalized graph Laplace matrix L; u (U) ^T Is the transposition of U; />Is the diagonal matrix of eigenvalues of L; Θ (Λ) is the angle matrix.

The graph signal x is filtered by the kernel Θ and fourier transformed at Θ and graph U according to the definition of equation (1) ^T Multiplying x, calculating the graph convolution kernel theta by the formula (1), wherein the complexity of the method is O (n) ² ) Too time consuming, two approximation strategies are applied to overcome this problem, namely Chebyshev polynomial approximation and first order approximation.

The basic idea of Chebyshev polynomial approximation is to locate the filter and reduce the number of parameters, limiting the graph convolution kernel Θ to polynomials of Λ, such asWherein (1)>Is a vector of polynomial coefficients, K is the kernel size of the graph convolution, determining the maximum radius from the center node to the convolution. Conventionally, chebyshev polynomials T _k (x) The kernel can be approximated as a truncated expansion of the K-1 order, e.g. +.>Rescaling->λ _max Representing the maximum eigenvalue of L. Thus, applying the Chebyshev polynomial approximation strategy, equation (1) can be transformed into:

wherein,is in scaling Laplace operator +.>The K-th order chebyshev polynomial calculated below.

By recursively computing the K partial convolution with polynomial approximation, the complexity of equation (2) can be reduced. Due to scaling and normalization in neural networks, λ can be further assumed _max Approximately 2, equation (2) can be reduced to:

wherein θ ₀ 、θ ₁ Is two shared parameters of the graph convolution kernel.

To constrain the parameters and stabilize the numerical performance, θ ₀ And theta ₁ Replaced by a single parameter θ, let θ=θ ₀ ＝-θ ₁ The method comprises the steps of carrying out a first treatment on the surface of the W and D are respectively passed throughAnd->And (5) renormalizing. The graph convolution can be expressed as:

the information of the (K-1) order neighborhood from the central node is utilized to realize the graph convolution stack with the first order approximation, and the effect similar to the K local convolution level is realized. In this case, K is the number of continuous filtering operations or convolution layers in the model. Furthermore, hierarchical linear structures are parametrically economical and efficient for large scale graphs, as the approximate order is limited to 1. Is defined inThe above volume integrator "x" can be extended to multidimensional tensors, for having C _i Signal of channel->The graph convolution can be summarized as:

wherein,with C _i ×C _o Vector, C _i ×C _o Input of feature map respectivelyAnd the magnitude of the output, and,

preferably, the time-gated convolutional layer consists of a one-dimensional causal convolutional kernel and a nonlinear gated linear unit.

Preferably, in the space-time convolution module, the gating time convolution layer has the expression:

wherein,for convolution kernel, it is intended to map the input Y to a single output element +.>P, Q is divided into two halves, and the sizes are the same; />Is represented as having C _i The input signal of the channel, M is the length of the input sequence of the space-time convolution module; p, Q are the inputs of gates in the gated linear units, P being related to the constituent structure and dynamic variance in the time series, σ (Q) controlling the corresponding input values of the current state; the product of Hadamard of the element is indicated.

Preferably, the expression of the spatio-temporal convolution module is:

wherein the input and output of the space-time convolution module are 3-D tensors;representing the input of the space-time convolution module, M is the length of the input sequence of the space-time convolution module, and n is the railway freightNumber of stations in network topology, C ^l The number of input channels is the space-time convolution module l; />Representing the output, K, of the spatio-temporal convolution module l _t Is the width of a one-dimensional causal convolution kernel, C ^l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) ^l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.

As one example, the expression for the linear activation function is:

where x is the input signal.

Preferably, channel c=16 of the spatial convolution layer and channel c=64 of the gating time convolution layer. In this way, the space-time diagram convolutional neural network prediction is made more accurate.

Preferably, the space-time convolutional neural network comprises two space-time convolutional modules and one output layer which are connected in sequence. In this way, the space-time diagram convolutional neural network prediction is made more accurate.

Preferably, the mapping, by the output layer, the convolution time sequence output by the last space-time convolution module to freight feature quantities of all stations in a space-time diagram of a preset time step in the future includes:

Preferably, before step S1, the method further comprises: receiving a railway freight network time space diagram for training, and generating a training sample; based on the loss function, the training samples are input into a space-time diagram convolutional neural network for model training.

Specifically, training a time space graph convolutional neural network model according to preset iteration step number, batch size, learning rate, learner and other parameters, and manually adjusting model parameters according to loss results to enable loss values to be converged and at optimal values.

Preferably, the expression of the loss function is:

wherein W is _θ Is all trainable parameters in the model; v _t+1 Is a true value;is a predictive value of the model.

According to the railway multi-station freight demand prediction method provided by the embodiment, after the railway freight demand is severely increased, the freight flow to be increased in a specific area and a station is predicted in time, trucks are allocated in time and grouped in time, and an order of an employer is completed according to the points in time; when the railway freight demand is reduced, the idle trucks are allocated in time to participate in other transportation orders, so that the utilization rate of railway transportation resources is maximized; for railway freight dispatchers, the change condition of future freight flows can be predicted in advance, errors caused by manual work under the condition of hasty handling are avoided, the formulation of freight grouping strategies is more reasonable and intelligent, and the high intelligence of railway freight management level is improved. The auxiliary dispatching department timely senses the fluctuation of future freight demands, timely adjusts the freight train grouping plan to meet new freight demands, improves the capacity of freight organizations, maximizes the utilization rate of train resources and promotes the development of the railway system for adjusting the traffic flow to process the traffic flow from passive reaction to active decision.

To measure and evaluate the performance of the rail multi-station freight demand prediction method proposed in this embodiment, L is selected ₂ The loss function was used as the loss function of the model and the space-time diagram convolutional neural network model proposed in this example was compared with the following reference model:

(1) Attention network (Graph attention network, GAT)

Importance among nodes of the graph structure data is allocated by using an attention mechanism, and the graph structure data belongs to graph neural network representation based on airspace.

(2) Convolutional neural network of figure (Graph Convolutional Neural Network, GCN)

The graph convolution neural network is a graph neural network representation based on a spectral domain, after graph embedding representation learning is carried out on graph data, the problem of high complexity caused by direct application of a convolution idea in the traditional CNN to the graph embedding representation data is solved, and the graph convolution neural network is a further version of a chebnet model.

(3) Chebnet model

The model is an improvement to the convolution kernel using chebyshev polynomials (ChebShev Polynomial) and demonstrates that computational complexity can be reduced.

Specifically, as the index for measuring and evaluating the model performance, average absolute error (Mean Absolute Error, MAE), average absolute percent error (Mean Absolute Percentage Error, MAPE), and root mean square error (Root Mean Square Error, RMSE) are used, and the formula is as follows:

wherein y is _i Representing the actual value;representing the predicted value; n represents the number of predictions; the smaller the RMSE, MAPE and MAE values, the smaller the error between the predicted and actual values; note that y cannot be 0.

Specifically, the model performance is evaluated, when a comparison test is performed, parameters such as iteration step number, batch size, learning rate, learner and the like are required to be set uniformly, and then a model result is obtained by analyzing MAE, MAPE, RMSE and time consuming procedures and analyzing a comparison result.

The model performance will be evaluated by comparative experiments in conjunction with railway specific networks. 20 interconnected stations are selected to construct a railway freight network topology as shown in fig. 3, and a weighted adjacency matrix of the railway freight network topology is shown in fig. 4. 20 station shipment data from day-to-day records of 2021, 1-1, 12-31, 2022 were used and named "daqin dataset". The main freight characteristics considered in this data are freight volume and full rate. To ensure data integrity and interpretability, an Dongjin set station in the Dafraxinus dataset was chosen as a representative example, as shown in FIGS. 5 and 6.

It should be noted that due to the limitations of the acquisition system, the shipping data may not be recorded every day, resulting in many zero values in the raw data. In this embodiment, this problem is solved by replacing the zero value in 759 instances of the large ash dataset with the average of the two years of collected data. During the performance of the simulation experiments of the present invention, the data set of fraxinus was divided into a training set (containing 80% of the data) and a test set (containing 20% of the data). Experiments were performed on a Lennovo computer equipped with Intel (R) Core (TM) i5-6500 CPU operating at 3.20GHz with 4 CPUs and 16GB of memory. For comparison experiments, GAT, GCN and Chebnet were chosen as reference models. In order to reduce the computational complexity, the model parameters of the STGCN model and the comparison model in all simulation experiments are required to be consistent. Specifically, the freight volume of the past 12 time steps (i.e., h=12) was predicted for 1 time step in the future, and experimental analysis and evaluation were performed on the space-time convolutional neural network, GAT, GCN, and Chebnet based on the actual freight data set, and the results are shown in table 1.

Table 1 comparison of freight demand and forecast results for "daqin" actual railway network

As can be seen from table 1, MAE, RMSE, RMAPE of the space-time diagram convolutional neural network model is 0.059, 0.09 and 0.01 respectively, and simulation results show that compared with other reference models, the space-time diagram convolutional neural network model has the highest prediction precision and the lowest time consumption. The space-time diagram convolutional neural network model can be used for capturing the inherent complex space-time relationship in the railway freight demand data, and can provide more accurate information for railway freight demand prediction.

The embodiment also provides a railway multi-station freight demand prediction device, which comprises: the acquisition module is used for acquiring a railway freight network topological graph and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network; the prediction module is used for inputting the time sequence and the space matrix into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future. The space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a future preset time step; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.

It should be noted that, specific functions of each module in the railway multi-station freight demand prediction device and the railway multi-station freight demand prediction flow provided in this embodiment may refer to the detailed description of the railway multi-station freight demand prediction method provided in the foregoing embodiment, and will not be repeated herein.

In summary, the railway multi-station freight demand prediction device provided by the embodiment of the invention constructs the railway freight network time-space diagram based on the railway freight network topological diagram, can extract the time-space characteristics of the railway freight network topology, inputs the time-space diagram into the time-space diagram convolutional neural network to predict the freight demands of stations in the topological diagram, considers the influence of the time-space characteristics of the railway topological network on the prediction of the freight demands of the stations, and ensures higher prediction accuracy.

In the description of the present invention, it should be understood that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present invention, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.

In the present invention, unless explicitly specified and limited otherwise, the terms "mounted," "connected," "secured," and the like are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally formed; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium; may be a communication between two elements or an interaction between two elements. The specific meaning of the above terms in the present invention can be understood by those of ordinary skill in the art according to the specific circumstances.

In the present invention, unless expressly stated or limited otherwise, a first feature is "on" or "under" a second feature, which may be in direct contact with the first and second features, or in indirect contact with the first and second features via an intervening medium. Moreover, a first feature "above," "over" and "on" a second feature may be a first feature directly above or obliquely above the second feature, or simply indicate that the first feature is higher in level than the second feature. The first feature being "under", "below" and "beneath" the second feature may be the first feature being directly under or obliquely below the second feature, or simply indicating that the first feature is level lower than the second feature.

In the description of the present specification, the terms "one embodiment," "some embodiments," "examples," "particular examples," or "some examples," etc., refer to particular features, structures, materials, or characteristics described in connection with the embodiment or example as being included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.

While embodiments of the present invention have been shown and described above, it will be understood that the above embodiments are illustrative and not to be construed as limiting the invention, and that alterations, modifications, substitutions and variations may be made in the above embodiments by those skilled in the art within the scope of the invention.

Claims

1. A method for predicting freight demand for multiple railway stations, comprising:

2. The method for predicting freight demand for multiple stations on a railway in accordance with claim 1, wherein,

the constructed railway freight network time-space diagram is thatWherein (1)>Represents freight characteristic quantity of n stations at time t in the railway freight network topology diagram, epsilon represents connectivity among stations in the railway freight network topology diagram,representation->Is a weighted adjacency matrix,/>Representing a real number;

3. The railway multi-station freight demand prediction method according to claim 1 or 2, wherein the freight characteristic quantity includes freight delivery quantity and freight full rate.

4. A method of predicting a freight demand for a railway plural station as in claim 3, further comprising between S1 and S2:

5. The method for predicting freight demand for multiple stations on a railway in accordance with claim 2, wherein,

the time-gated convolution layer consists of a one-dimensional causal convolution kernel and a nonlinear gated linear unit.

6. The method for predicting freight demand for multiple stations on a railway in accordance with claim 2, wherein,

the expression of the spatial convolution layer is:

the gating time convolution layer has the expression:

7. The method of claim 5, wherein the expression of the space-time convolution module is:

wherein,representing the input of a space-time convolution module I, M is the length of an input sequence of the space-time convolution module, n is the number of stations in a railway freight network topological graph, and C ^l The number of input channels is the space-time convolution module l;representing the output, K, of the spatio-temporal convolution module l _t Is the width of a one-dimensional causal convolution kernel, C ^l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) ^l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.

8. The method for predicting freight demand for multiple stations on a railway as set forth in claim 6 or 7, wherein the outputting layer maps the convolution time sequence outputted by the last space-time convolution module to freight feature values of all stations in a space-time diagram of a preset time step in the future, comprising:

9. The method of claim 1, wherein the space-time convolutional neural network comprises two space-time convolutional modules and one output layer connected in sequence.

10. A railway multi-station freight demand prediction apparatus, comprising: