CN117217445A - Method and device for predicting freight demand of multiple railway stations - Google Patents

Method and device for predicting freight demand of multiple railway stations Download PDF

Info

Publication number
CN117217445A
CN117217445A CN202311067340.1A CN202311067340A CN117217445A CN 117217445 A CN117217445 A CN 117217445A CN 202311067340 A CN202311067340 A CN 202311067340A CN 117217445 A CN117217445 A CN 117217445A
Authority
CN
China
Prior art keywords
time
space
freight
diagram
railway
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311067340.1A
Other languages
Chinese (zh)
Inventor
董海荣
连文博
吴兴堂
马建军
王斌
周敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jiaotong University
Original Assignee
Beijing Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jiaotong University filed Critical Beijing Jiaotong University
Priority to CN202311067340.1A priority Critical patent/CN117217445A/en
Publication of CN117217445A publication Critical patent/CN117217445A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a method and a device for predicting the freight demands of multiple railway stations, wherein the predicting method comprises the following steps: s1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network; s2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step. The technical problem that the influence of space-time characteristics of a railway topology network on prediction precision is ignored in the traditional railway freight demand is solved.

Description

Method and device for predicting freight demand of multiple railway stations
Technical Field
The invention relates to the technical field of railway freight demand prediction, in particular to a method and a device for predicting railway multi-station freight demand.
Background
The railway freight has the advantages of large freight volume, high speed, energy saving and the like, occupies a great proportion in a transportation system, has a low 2022 year ago, has an operating mileage of 15.5 km, and has a total delivery capacity of 49.84 hundred million tons in a national railway freight year. The scientific prediction of the development trend of the freight traffic of the railway has positive practical significance for the adjustment of the railway transportation structure, the optimization of the social resource allocation and the improvement of the railway freight management level.
However, railway freight is less active than passenger transport, the freight needs to be manually negotiated and ordered, and complex processes such as transportation, transfer and classification are required to be carried out in the transportation process, so that the transportation service cannot be directly and simply purchased and automatically arrived and departed like passengers, and the transportation process is greatly influenced by unreliability, which makes the establishment of railway freight schedules extremely difficult. In addition, the railway freight requirement has complex characteristics of multiple scattered and concentrated freight requirements, multiple-class freight classification and assembly, multiple-destination batch transportation and the like, so that the railway freight requirement overall shows nonlinear variation. The current freight demand prediction mainly comprises medium-long-term prediction based on a time sequence prediction method, wherein the prediction mainly comprises coarse granularity prediction taking quarterly and year as time units, and the fine granularity prediction on the short-term freight demand of a station is lacking; meanwhile, due to the internal coupling relation between freight station sites, the freight flow transportation is influenced by the space-time characteristics of different sites, so that the fusion of the space-time characteristics in freight flow data plays an important role in improving the prediction precision of freight demand.
For this reason, there is a need for an improved method of predicting railroad freight demand.
Disclosure of Invention
First, the technical problem to be solved
In view of the above-mentioned shortcomings and disadvantages of the prior art, the present invention provides a method and apparatus for predicting a railway multi-station freight demand, which solve the technical problem that the traditional railway freight demand ignores the influence of space-time characteristics of a railway topology network on prediction accuracy.
(II) technical scheme
In order to achieve the above purpose, the main technical scheme adopted by the invention comprises the following steps:
in a first aspect, the present invention provides a method for predicting freight demand for multiple railway stations, comprising:
s1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;
s2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step;
the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into a first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic values of all stations in a space-time diagram of a future preset time step;
the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
Optionally, the constructed railway freight network time space diagram isWherein (1)>Representing freight characteristic quantity of n stations at time t in a railway freight network topology map, epsilon representing connectivity between stations in the railway freight network topology map, and +.>Representation->Is a weighted adjacency matrix,/>Representing a real number;
the weighted adjacency matrix W is obtained according to interval distances among n stations in the railway freight network topological graph, and the expression is as follows:
wherein d ij Is the distance between sites i and j; sigma (sigma) 2 And λ is the distribution threshold and sparsity threshold of W, respectively; omega ij Is an element in W, represented by d ij The weight of the edge determined.
Optionally, the shipment characteristic includes shipment delivery and truck loading rate.
Optionally, between S1 and S2 further comprises:
s11, preprocessing a time sequence; the pretreatment step comprises the following steps:
a1, replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight transmission capacity in the time sequence, and replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight full rate in the time sequence;
a2, carrying out normalization processing on the independent variable of the time sequence;
s2, inputting the preprocessed time sequence and the preprocessed space matrix into a pre-trained space-time diagram convolutional neural network.
Optionally, the time-gated convolutional layer is composed of a one-dimensional causal convolutional kernel and a nonlinear gated linear unit.
Optionally, the expression of the spatial convolution layer is:
wherein Θ is a graph convolution kernel; x is the input signal of the spatial convolution layer;vector representing polynomial coefficients of chebyshev polynomial limiting Θ to Λ,/->I n Is a unit matrix; />Is an angle matrix, wherein D ij =∑ j W ij ;/>Meaning that x has C i A signal of the channel; c (C) 0 An output channel which is a space convolution layer; y is j Having C as a spatial convolution layer j An output of the channel;
the gating time convolution layer has the expression:
wherein,is a convolution kernel; />Is represented as having C i The input signal of the channel, M is the length of the input sequence of the space-time convolution module; p, Q are the inputs of gates in the gated linear units, P being related to the constituent structure and dynamic variance in the time series, σ (Q) controlling the corresponding input values of the current state; the product of Hadamard of the element is indicated.
Optionally, the expression of the spatio-temporal convolution module is:
wherein,representing the input of a space-time convolution module I, M is the length of an input sequence of the space-time convolution module, n is the number of stations in a railway freight network topological graph, and C l An input channel of the space-time convolution module l;representing the output, K, of the spatio-temporal convolution module l t Is the width of a one-dimensional causal convolution kernel, C l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.
Optionally, the mapping, by the output layer, the convolution time sequence output by the last space-time convolution module to freight feature quantities of all stations in a space-time diagram of a preset time step in the future includes:
wherein,representing the output of the last spatio-temporal convolution module; />Is a weight vector; b is the deviation.
Optionally, the space-time convolutional neural network comprises two space-time convolutional modules and one output layer which are sequentially connected.
Optionally, before S1, the method further includes: receiving a railway freight network time space diagram for training, and generating a training sample; based on the loss function, the training samples are input into a space-time diagram convolutional neural network for model training.
In a second aspect, the present invention provides a railway multi-station freight demand prediction apparatus comprising:
the acquisition module is used for acquiring a railway freight network topological graph and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;
the prediction module is used for inputting the time sequence and the space matrix into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future;
the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a future preset time step; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
(III) beneficial effects
The beneficial effects of the invention are as follows:
according to the railway multi-station freight demand prediction method, the railway freight network time-space diagram is constructed based on the railway freight network topological diagram, so that the time-space characteristics of the railway freight network topology can be extracted, the time-space diagram is input into the time-space diagram convolutional neural network to predict freight demands of stations in the topological diagram, the influence of the railway topological network time-space characteristics on station freight demand prediction is considered, and the prediction accuracy is higher.
Drawings
FIG. 1 is a flow chart of a method for predicting freight demand for multiple railway stations according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a space-time convolutional neural network according to an embodiment of the present invention;
FIG. 3 is a railway freight network topology according to an embodiment of the present invention;
FIG. 4 is a adjacency matrix heatmap of a railway network topology;
FIG. 5 is a schematic diagram of a shipping volume sequence for the east well set station of FIG. 3 over a historical time step;
FIG. 6 is a schematic diagram of a truck loading rate sequence for the east well set station of FIG. 3 over a historical time step.
Detailed Description
The invention will be better explained by the following detailed description of the embodiments with reference to the drawings.
According to the railway multi-station freight demand prediction method provided by the embodiment of the invention, the railway freight network time-space diagram is constructed based on the railway freight network topological diagram, so that the time-space characteristics of the railway freight network topology can be extracted, the time-space diagram is input into the time-space diagram convolutional neural network to predict the freight demands of stations in the topological diagram, the influence of the railway topological network time-space characteristics on the station freight demand prediction is considered, and the prediction accuracy is higher.
In order that the above-described aspects may be better understood, exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present invention are shown in the drawings, it should be understood that the present invention may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
Fig. 1 is a flow chart of a method for predicting the freight demand of multiple railway stations according to the present embodiment. As shown in fig. 1, the method for predicting the freight demand of multiple railway stations according to the present embodiment includes the following steps:
step S1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network.
The railway freight network topology graph is a network graph constructed by taking stations as nodes and taking intervals as edges, as shown in fig. 3.
Preferably, the time-space diagram of the railway freight network is constructed as followsWherein (1)>Is a limited vertex set and represents freight characteristic quantity of n stations at time t in a railway freight network topological graph; epsilon represents connectivity between sites in the railway freight network topology; />Representation->Is a weighted adjacency matrix,/>Representing a real number. Thus, the structured railway freight network time space diagram focuses on the structured freight time sequence, v t Are not independent but are connected by pairs of edges in the topology.
Further, the weighted adjacency matrix W is obtained according to the interval distance among n stations in the railway freight network topological graph, and the expression is:
wherein d ij Is the distance between sites i and j; sigma (sigma) 2 And λ is the distribution threshold and sparsity threshold of W, respectively; omega ij Is an element in W, represented by d ij The weight of the edge determined. Further preferably, sigma 2 10 and lambda of 0.5.
Preferably, the shipment characteristic amount includes a shipment transmission amount and a truck loading rate. I.e.Each node of (a) has two shipping characteristics: freight delivery and truck loading rate. Wherein the "freight delivery amount" corresponds to the total weight of the freight delivered by the station in a specific time, generally in tons, and the "freight rate" corresponds to the ratio of the actual weight of the loaded freight in the railway freight car delivered by the station in a specific time to the maximum freight weight that the freight car can accommodate, expressed as a percentage. In this way, in this embodiment, the freight transmission amount and the truck loading rate in the actual physical topological graph are used as node characteristics, so that space-time correlation between different nodes (stations) is sought, and further the freight transmission amount is predicted.
Further preferably, the "freight delivery amount" corresponds to the total weight of the freight delivered by the station in one day, and the "freight full rate" corresponds to the ratio of the actual weight of the loaded freight in the railway freight car delivered by the station in one day to the maximum freight weight that the freight car can accommodate. Therefore, the problem of coarse granularity of the traditional prediction space-time is solved by taking the fine granularity time sequence as the data base of prediction, and the short-term demand of the station freight can be predicted.
Step S11, preprocessing the time sequence.
Preferably, the step of pre-treating comprises:
a1, replacing a value with zero freight transmission quantity in the time sequence by using an average value of freight transmission quantity in the time sequence, and replacing a value with zero freight transmission quantity in the time sequence by using an average value of freight full rate in the time sequence. Thus, the problem of excessive zero values in the historical data is solved.
A2, carrying out normalization processing on the independent variable of the time sequence. In this way, the influence of different dimensions on the prediction accuracy can be eliminated.
Further preferably, the independent variable of the time series is Z-Score normalized.
And S2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step.
The space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs the convolved time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolved time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
In the space-time diagram convolutional neural network provided by the embodiment, each space-time convolution module comprises a three-layer structure, namely an upper layer and a lower layer are gating time convolution layers for extracting time features, and a middle layer is a space diagram convolution layer for extracting space features, so that a sandwich structure is formed. The space layer is connected with the two time layers, so that the space state from graph convolution to time convolution can be rapidly propagated; the sandwich structure is beneficial to the neural network to fully apply bottleneck strategies, and the channels are downscaled and upscaled through the graph convolution layer, so that the scale compression and the feature compression are realized. Therefore, the space-time diagram is input into the space-time diagram convolutional neural network, useful space-time information is extracted from the space-time diagram by applying the convolutional theorem and the map theory, the freight demand of the station in the topological diagram is predicted, the influence of the space-time characteristics of the railway topological network on the freight demand prediction of the station is considered, and the prediction accuracy is higher.
Specifically, layer normalization is used in each spatio-temporal convolution module to prevent overfitting.
Preferably, in the space-time convolution module, the expression of the spatial convolution layer is:
wherein Θ is a graph convolution kernel; x is the input signal of the spatial convolution layer;vector representing polynomial coefficients of chebyshev polynomial limiting Θ to Λ,/->I n Is a unit matrix; />Is an angle matrix, wherein D ij =∑ j W ij ;/>Meaning that x has C i A signal of the channel; c (C) 0 An output channel which is a space convolution layer; y is j Having C as a spatial convolution layer j The output of the channel.
Specifically, the derivation process of the spatial convolution layer expression is as follows.
Graph convolution is used as a signal by introducing a concept of a graph convolution operator' based on a concept of spectrogram convolution (utilizing a graph theory)Product of the product with the convolution kernel Θ:
Θ*x=Θ(L)x=Θ(UΛU T )x=UΘ(Λ)U T x (1);
wherein the method comprises the steps ofI n Is an identity matrix>Is an angle matrix, D ij =∑ j W ij ;/>Is a eigenvector matrix of the normalized graph Laplace matrix L; u (U) T Is the transposition of U; />Is the diagonal matrix of eigenvalues of L; Θ (Λ) is the angle matrix.
The graph signal x is filtered by the kernel Θ and fourier transformed at Θ and graph U according to the definition of equation (1) T Multiplying x, calculating the graph convolution kernel theta by the formula (1), wherein the complexity of the method is O (n) 2 ) Too time consuming, two approximation strategies are applied to overcome this problem, namely Chebyshev polynomial approximation and first order approximation.
The basic idea of Chebyshev polynomial approximation is to locate the filter and reduce the number of parameters, limiting the graph convolution kernel Θ to polynomials of Λ, such asWherein (1)>Is a vector of polynomial coefficients, K is the kernel size of the graph convolution, determining the maximum radius from the center node to the convolution. Conventionally, chebyshev polynomials T k (x) The kernel can be approximated as a truncated expansion of the K-1 order, e.g. +.>Rescaling->λ max Representing the maximum eigenvalue of L. Thus, applying the Chebyshev polynomial approximation strategy, equation (1) can be transformed into:
wherein,is in scaling Laplace operator +.>The K-th order chebyshev polynomial calculated below.
By recursively computing the K partial convolution with polynomial approximation, the complexity of equation (2) can be reduced. Due to scaling and normalization in neural networks, λ can be further assumed max Approximately 2, equation (2) can be reduced to:
wherein θ 0 、θ 1 Is two shared parameters of the graph convolution kernel.
To constrain the parameters and stabilize the numerical performance, θ 0 And theta 1 Replaced by a single parameter θ, let θ=θ 0 =-θ 1 The method comprises the steps of carrying out a first treatment on the surface of the W and D are respectively passed throughAnd->And (5) renormalizing. The graph convolution can be expressed as:
the information of the (K-1) order neighborhood from the central node is utilized to realize the graph convolution stack with the first order approximation, and the effect similar to the K local convolution level is realized. In this case, K is the number of continuous filtering operations or convolution layers in the model. Furthermore, hierarchical linear structures are parametrically economical and efficient for large scale graphs, as the approximate order is limited to 1. Is defined inThe above volume integrator "x" can be extended to multidimensional tensors, for having C i Signal of channel->The graph convolution can be summarized as:
wherein,with C i ×C o Vector, C i ×C o Input of feature map respectivelyAnd the magnitude of the output, and,
preferably, the time-gated convolutional layer consists of a one-dimensional causal convolutional kernel and a nonlinear gated linear unit.
Preferably, in the space-time convolution module, the gating time convolution layer has the expression:
wherein,for convolution kernel, it is intended to map the input Y to a single output element +.>P, Q is divided into two halves, and the sizes are the same; />Is represented as having C i The input signal of the channel, M is the length of the input sequence of the space-time convolution module; p, Q are the inputs of gates in the gated linear units, P being related to the constituent structure and dynamic variance in the time series, σ (Q) controlling the corresponding input values of the current state; the product of Hadamard of the element is indicated.
Preferably, the expression of the spatio-temporal convolution module is:
wherein the input and output of the space-time convolution module are 3-D tensors;representing the input of the space-time convolution module, M is the length of the input sequence of the space-time convolution module, and n is the railway freightNumber of stations in network topology, C l The number of input channels is the space-time convolution module l; />Representing the output, K, of the spatio-temporal convolution module l t Is the width of a one-dimensional causal convolution kernel, C l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.
As one example, the expression for the linear activation function is:
where x is the input signal.
Preferably, channel c=16 of the spatial convolution layer and channel c=64 of the gating time convolution layer. In this way, the space-time diagram convolutional neural network prediction is made more accurate.
Preferably, the space-time convolutional neural network comprises two space-time convolutional modules and one output layer which are connected in sequence. In this way, the space-time diagram convolutional neural network prediction is made more accurate.
Preferably, the mapping, by the output layer, the convolution time sequence output by the last space-time convolution module to freight feature quantities of all stations in a space-time diagram of a preset time step in the future includes:
wherein,representing the output of the last spatio-temporal convolution module; />Is a weight vector; b is the deviation.
Preferably, before step S1, the method further comprises: receiving a railway freight network time space diagram for training, and generating a training sample; based on the loss function, the training samples are input into a space-time diagram convolutional neural network for model training.
Specifically, training a time space graph convolutional neural network model according to preset iteration step number, batch size, learning rate, learner and other parameters, and manually adjusting model parameters according to loss results to enable loss values to be converged and at optimal values.
Preferably, the expression of the loss function is:
wherein W is θ Is all trainable parameters in the model; v t+1 Is a true value;is a predictive value of the model.
According to the railway multi-station freight demand prediction method provided by the embodiment, after the railway freight demand is severely increased, the freight flow to be increased in a specific area and a station is predicted in time, trucks are allocated in time and grouped in time, and an order of an employer is completed according to the points in time; when the railway freight demand is reduced, the idle trucks are allocated in time to participate in other transportation orders, so that the utilization rate of railway transportation resources is maximized; for railway freight dispatchers, the change condition of future freight flows can be predicted in advance, errors caused by manual work under the condition of hasty handling are avoided, the formulation of freight grouping strategies is more reasonable and intelligent, and the high intelligence of railway freight management level is improved. The auxiliary dispatching department timely senses the fluctuation of future freight demands, timely adjusts the freight train grouping plan to meet new freight demands, improves the capacity of freight organizations, maximizes the utilization rate of train resources and promotes the development of the railway system for adjusting the traffic flow to process the traffic flow from passive reaction to active decision.
To measure and evaluate the performance of the rail multi-station freight demand prediction method proposed in this embodiment, L is selected 2 The loss function was used as the loss function of the model and the space-time diagram convolutional neural network model proposed in this example was compared with the following reference model:
(1) Attention network (Graph attention network, GAT)
Importance among nodes of the graph structure data is allocated by using an attention mechanism, and the graph structure data belongs to graph neural network representation based on airspace.
(2) Convolutional neural network of figure (Graph Convolutional Neural Network, GCN)
The graph convolution neural network is a graph neural network representation based on a spectral domain, after graph embedding representation learning is carried out on graph data, the problem of high complexity caused by direct application of a convolution idea in the traditional CNN to the graph embedding representation data is solved, and the graph convolution neural network is a further version of a chebnet model.
(3) Chebnet model
The model is an improvement to the convolution kernel using chebyshev polynomials (ChebShev Polynomial) and demonstrates that computational complexity can be reduced.
Specifically, as the index for measuring and evaluating the model performance, average absolute error (Mean Absolute Error, MAE), average absolute percent error (Mean Absolute Percentage Error, MAPE), and root mean square error (Root Mean Square Error, RMSE) are used, and the formula is as follows:
wherein y is i Representing the actual value;representing the predicted value; n represents the number of predictions; the smaller the RMSE, MAPE and MAE values, the smaller the error between the predicted and actual values; note that y cannot be 0.
Specifically, the model performance is evaluated, when a comparison test is performed, parameters such as iteration step number, batch size, learning rate, learner and the like are required to be set uniformly, and then a model result is obtained by analyzing MAE, MAPE, RMSE and time consuming procedures and analyzing a comparison result.
The model performance will be evaluated by comparative experiments in conjunction with railway specific networks. 20 interconnected stations are selected to construct a railway freight network topology as shown in fig. 3, and a weighted adjacency matrix of the railway freight network topology is shown in fig. 4. 20 station shipment data from day-to-day records of 2021, 1-1, 12-31, 2022 were used and named "daqin dataset". The main freight characteristics considered in this data are freight volume and full rate. To ensure data integrity and interpretability, an Dongjin set station in the Dafraxinus dataset was chosen as a representative example, as shown in FIGS. 5 and 6.
It should be noted that due to the limitations of the acquisition system, the shipping data may not be recorded every day, resulting in many zero values in the raw data. In this embodiment, this problem is solved by replacing the zero value in 759 instances of the large ash dataset with the average of the two years of collected data. During the performance of the simulation experiments of the present invention, the data set of fraxinus was divided into a training set (containing 80% of the data) and a test set (containing 20% of the data). Experiments were performed on a Lennovo computer equipped with Intel (R) Core (TM) i5-6500 CPU operating at 3.20GHz with 4 CPUs and 16GB of memory. For comparison experiments, GAT, GCN and Chebnet were chosen as reference models. In order to reduce the computational complexity, the model parameters of the STGCN model and the comparison model in all simulation experiments are required to be consistent. Specifically, the freight volume of the past 12 time steps (i.e., h=12) was predicted for 1 time step in the future, and experimental analysis and evaluation were performed on the space-time convolutional neural network, GAT, GCN, and Chebnet based on the actual freight data set, and the results are shown in table 1.
Table 1 comparison of freight demand and forecast results for "daqin" actual railway network
As can be seen from table 1, MAE, RMSE, RMAPE of the space-time diagram convolutional neural network model is 0.059, 0.09 and 0.01 respectively, and simulation results show that compared with other reference models, the space-time diagram convolutional neural network model has the highest prediction precision and the lowest time consumption. The space-time diagram convolutional neural network model can be used for capturing the inherent complex space-time relationship in the railway freight demand data, and can provide more accurate information for railway freight demand prediction.
The embodiment also provides a railway multi-station freight demand prediction device, which comprises: the acquisition module is used for acquiring a railway freight network topological graph and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network; the prediction module is used for inputting the time sequence and the space matrix into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future. The space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a future preset time step; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
It should be noted that, specific functions of each module in the railway multi-station freight demand prediction device and the railway multi-station freight demand prediction flow provided in this embodiment may refer to the detailed description of the railway multi-station freight demand prediction method provided in the foregoing embodiment, and will not be repeated herein.
In summary, the railway multi-station freight demand prediction device provided by the embodiment of the invention constructs the railway freight network time-space diagram based on the railway freight network topological diagram, can extract the time-space characteristics of the railway freight network topology, inputs the time-space diagram into the time-space diagram convolutional neural network to predict the freight demands of stations in the topological diagram, considers the influence of the time-space characteristics of the railway topological network on the prediction of the freight demands of the stations, and ensures higher prediction accuracy.
In the description of the present invention, it should be understood that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present invention, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.
In the present invention, unless explicitly specified and limited otherwise, the terms "mounted," "connected," "secured," and the like are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally formed; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium; may be a communication between two elements or an interaction between two elements. The specific meaning of the above terms in the present invention can be understood by those of ordinary skill in the art according to the specific circumstances.
In the present invention, unless expressly stated or limited otherwise, a first feature is "on" or "under" a second feature, which may be in direct contact with the first and second features, or in indirect contact with the first and second features via an intervening medium. Moreover, a first feature "above," "over" and "on" a second feature may be a first feature directly above or obliquely above the second feature, or simply indicate that the first feature is higher in level than the second feature. The first feature being "under", "below" and "beneath" the second feature may be the first feature being directly under or obliquely below the second feature, or simply indicating that the first feature is level lower than the second feature.
In the description of the present specification, the terms "one embodiment," "some embodiments," "examples," "particular examples," or "some examples," etc., refer to particular features, structures, materials, or characteristics described in connection with the embodiment or example as being included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.
While embodiments of the present invention have been shown and described above, it will be understood that the above embodiments are illustrative and not to be construed as limiting the invention, and that alterations, modifications, substitutions and variations may be made in the above embodiments by those skilled in the art within the scope of the invention.

Claims (10)

1. A method for predicting freight demand for multiple railway stations, comprising:
s1, acquiring a railway freight network topological graph, and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;
s2, inputting the space-time diagram into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a topological diagram of a future preset time step;
the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into a first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic values of all stations in a space-time diagram of a future preset time step;
the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
2. The method for predicting freight demand for multiple stations on a railway in accordance with claim 1, wherein,
the constructed railway freight network time-space diagram is thatWherein (1)>Represents freight characteristic quantity of n stations at time t in the railway freight network topology diagram, epsilon represents connectivity among stations in the railway freight network topology diagram,representation->Is a weighted adjacency matrix,/>Representing a real number;
the weighted adjacency matrix W is obtained according to interval distances among n stations in the railway freight network topological graph, and the expression is as follows:
wherein d ij Is the distance between sites i and j; sigma (sigma) 2 And λ is the distribution threshold and sparsity threshold of W, respectively; omega ij Is an element in W, represented by d ij The weight of the edge determined.
3. The railway multi-station freight demand prediction method according to claim 1 or 2, wherein the freight characteristic quantity includes freight delivery quantity and freight full rate.
4. A method of predicting a freight demand for a railway plural station as in claim 3, further comprising between S1 and S2:
s11, preprocessing a time sequence; the pretreatment step comprises the following steps:
a1, replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight transmission capacity in the time sequence, and replacing a value with zero freight transmission capacity in the time sequence by adopting an average value of freight full rate in the time sequence;
a2, carrying out normalization processing on the independent variable of the time sequence;
s2, inputting the preprocessed time sequence and the preprocessed space matrix into a pre-trained space-time diagram convolutional neural network.
5. The method for predicting freight demand for multiple stations on a railway in accordance with claim 2, wherein,
the time-gated convolution layer consists of a one-dimensional causal convolution kernel and a nonlinear gated linear unit.
6. The method for predicting freight demand for multiple stations on a railway in accordance with claim 2, wherein,
the expression of the spatial convolution layer is:
wherein Θ is a graph convolution kernel; x is the input signal of the spatial convolution layer;vector representing polynomial coefficients of chebyshev polynomial limiting Θ to Λ,/->I n Is a unit matrix; />Is an angle matrix, wherein D ij =∑ j W ij ;/>Meaning that x has C i A signal of the channel; c (C) 0 An output channel which is a space convolution layer; y is j Having C as a spatial convolution layer j An output of the channel;
the gating time convolution layer has the expression:
wherein,is a convolution kernel; />Is represented as having C i The input signal of the channel, M is the length of the input sequence of the space-time convolution module; p, Q are the inputs of gates in the gated linear units, P being related to the constituent structure and dynamic variance in the time series, σ (Q) controlling the corresponding input values of the current state; the product of Hadamard of the element is indicated.
7. The method of claim 5, wherein the expression of the space-time convolution module is:
wherein,representing the input of a space-time convolution module I, M is the length of an input sequence of the space-time convolution module, n is the number of stations in a railway freight network topological graph, and C l The number of input channels is the space-time convolution module l;representing the output, K, of the spatio-temporal convolution module l t Is the width of a one-dimensional causal convolution kernel, C l+1 The output channel of the space-time convolution module l is also the input channel of the real-time space-time convolution module l+1; />Is a first time convolution kernel within the space-time convolution module l; />Is a second time convolution kernel within the spatio-temporal convolution module l; theta (theta) l The method comprises the steps of (1) performing graph convolution kernel for a space-time convolution module; reLU (·) is a linear activation function.
8. The method for predicting freight demand for multiple stations on a railway as set forth in claim 6 or 7, wherein the outputting layer maps the convolution time sequence outputted by the last space-time convolution module to freight feature values of all stations in a space-time diagram of a preset time step in the future, comprising:
wherein,representing the output of the last spatio-temporal convolution module; />Is a weight vector; b is the deviation.
9. The method of claim 1, wherein the space-time convolutional neural network comprises two space-time convolutional modules and one output layer connected in sequence.
10. A railway multi-station freight demand prediction apparatus, comprising:
the acquisition module is used for acquiring a railway freight network topological graph and constructing a railway freight network time-space diagram based on the railway freight network topological graph; the space-time diagram comprises a time sequence and a space matrix, wherein the time sequence is a freight characteristic quantity sequence of all stations in the railway freight network topological diagram in a preset historical time step, and the space matrix is a matrix used for representing the space characteristics of the railway freight network;
the prediction module is used for inputting the time sequence and the space matrix into a pre-trained space-time diagram convolutional neural network to obtain freight characteristic quantities of all stations in a space-time diagram of a preset time step in the future;
the space-time diagram convolutional neural network comprises at least one space-time convolutional module and one output layer which are sequentially connected, wherein a time sequence is input into the first space-time convolutional module, a space matrix is respectively input into each space-time convolutional module, the space-time convolutional module outputs a convolutional time sequence as the input of the next space-time convolutional module connected with the space-time convolutional module, and the output layer maps the convolutional time sequence output by the last space-time convolutional module to freight characteristic quantities of all stations in a space-time diagram of a future preset time step; the space-time convolution module comprises a first gating time convolution layer, a space diagram convolution layer and a second gating time convolution layer which are sequentially connected, and the space matrix is input into the space diagram convolution layer.
CN202311067340.1A 2023-08-23 2023-08-23 Method and device for predicting freight demand of multiple railway stations Pending CN117217445A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311067340.1A CN117217445A (en) 2023-08-23 2023-08-23 Method and device for predicting freight demand of multiple railway stations

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311067340.1A CN117217445A (en) 2023-08-23 2023-08-23 Method and device for predicting freight demand of multiple railway stations

Publications (1)

Publication Number Publication Date
CN117217445A true CN117217445A (en) 2023-12-12

Family

ID=89045238

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311067340.1A Pending CN117217445A (en) 2023-08-23 2023-08-23 Method and device for predicting freight demand of multiple railway stations

Country Status (1)

Country Link
CN (1) CN117217445A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113487066A (en) * 2021-06-18 2021-10-08 山西云时代智慧城市技术发展有限公司 Long-time-sequence freight volume prediction method based on multi-attribute enhanced graph convolution-Informer model
CN114580617A (en) * 2022-03-11 2022-06-03 北京航空航天大学 National road transport capacity distribution prediction method based on space-time graph convolutional network
CN115017809A (en) * 2022-06-07 2022-09-06 上海海事大学 Ship arrival amount prediction system and method
CN115223402A (en) * 2022-06-29 2022-10-21 北京航空航天大学 Space domain sector complexity prediction method based on space-time graph convolutional network
CN115829158A (en) * 2022-12-28 2023-03-21 中铁第一勘察设计院集团有限公司 Urban rapid freight volume prediction method based on deep learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113487066A (en) * 2021-06-18 2021-10-08 山西云时代智慧城市技术发展有限公司 Long-time-sequence freight volume prediction method based on multi-attribute enhanced graph convolution-Informer model
CN114580617A (en) * 2022-03-11 2022-06-03 北京航空航天大学 National road transport capacity distribution prediction method based on space-time graph convolutional network
CN115017809A (en) * 2022-06-07 2022-09-06 上海海事大学 Ship arrival amount prediction system and method
CN115223402A (en) * 2022-06-29 2022-10-21 北京航空航天大学 Space domain sector complexity prediction method based on space-time graph convolutional network
CN115829158A (en) * 2022-12-28 2023-03-21 中铁第一勘察设计院集团有限公司 Urban rapid freight volume prediction method based on deep learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BING YU 等: "Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting", pages 2 - 4, Retrieved from the Internet <URL:《https://arxiv.org/pdf/1709.04875.pdf》> *
毛海军: "《物流系统规划与设计 第2版》", 31 January 2017, 南京:东南大学出版社, pages: 125 *

Similar Documents

Publication Publication Date Title
Wan et al. CTS-LSTM: LSTM-based neural networks for correlatedtime series prediction
Ping et al. Impact of driver behavior on fuel consumption: Classification, evaluation and prediction using machine learning
CN113077090B (en) Passenger flow prediction method, system and computer readable storage medium
Zhang et al. Network‐wide traffic speed forecasting: 3D convolutional neural network with ensemble empirical mode decomposition
CN111860951A (en) Rail transit passenger flow prediction method based on dynamic hypergraph convolutional network
Li et al. Graph CNNs for urban traffic passenger flows prediction
CN116187640B (en) Power distribution network planning method and device based on grid multi-attribute image system
Fan et al. Predicting vacant parking space availability: a long short-term memory approach
CN114943356B (en) Short-time demand integrated prediction method for airport arrival passenger to take taxi
Zhao et al. A novel hybrid model for short-term high-speed railway passenger demand forecasting
Wang et al. Deep learning of spatiotemporal patterns for urban mobility prediction using big data
CN114462623B (en) Data analysis method, system and platform based on edge calculation
CN106650959A (en) Power distribution network repair ability assessment method based on improved grey clustering
CN114066073A (en) Power grid load prediction method
CN112308298B (en) Multi-scenario performance index prediction method and system for semiconductor production line
CN111008726A (en) Class image conversion method in power load prediction
Jiang et al. A new form of deep learning in smart logistics with IoT environment
CN116824861A (en) Method and system for scheduling sharing bicycle based on multidimensional data of urban brain platform
Bao et al. Forecasting network-wide multi-step metro ridership with an attention-weighted multi-view graph to sequence learning approach
Wu et al. Forecasting metro rail transit passenger flow with multiple-attention deep neural networks and surrounding vehicle detection devices
Li et al. Short-term traffic demand prediction using graph convolutional neural networks
Zhu et al. A novel hybrid deep learning model for taxi demand forecasting based on decomposition of time series and fusion of text data
Rodríguez-Rueda et al. Origin–Destination matrix estimation and prediction from socioeconomic variables using automatic feature selection procedure-based machine learning model
Ye et al. Demand forecasting of online car‐hailing by exhaustively capturing the temporal dependency with TCN and Attention approaches
CN117217445A (en) Method and device for predicting freight demand of multiple railway stations

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination