CN114118508A - OD market aviation passenger flow prediction method based on space-time convolution network - Google Patents
OD market aviation passenger flow prediction method based on space-time convolution network Download PDFInfo
- Publication number
- CN114118508A CN114118508A CN202110878862.4A CN202110878862A CN114118508A CN 114118508 A CN114118508 A CN 114118508A CN 202110878862 A CN202110878862 A CN 202110878862A CN 114118508 A CN114118508 A CN 114118508A
- Authority
- CN
- China
- Prior art keywords
- passenger flow
- time
- space
- market
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000012549 training Methods 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 17
- 230000000737 periodic effect Effects 0.000 claims description 16
- 238000013528 artificial neural network Methods 0.000 claims description 11
- 239000011159 matrix material Substances 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 9
- 238000010586 diagram Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- 238000013507 mapping Methods 0.000 claims description 4
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000000354 decomposition reaction Methods 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 239000012634 fragment Substances 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims description 3
- 239000000126 substance Substances 0.000 claims description 3
- 230000000694 effects Effects 0.000 description 8
- 238000010801 machine learning Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 238000010276 construction Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000001364 causal effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000007637 random forest analysis Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/40—Business processes related to the transportation industry
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Marketing (AREA)
- Biophysics (AREA)
- General Business, Economics & Management (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Tourism & Hospitality (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses an OD market aviation passenger flow prediction method based on a time-space convolution network, which belongs to the technical field of big data, and comprises the steps of counting, sorting and storing aviation passenger flow data from each departure airport to a destination airport in a multi-airport system in a region, building a prediction model of the OD market aviation passenger flow based on the time-space convolution network, determining the optimal hyper-parameter setting of the prediction model under different data sets, predicting the aviation passenger flow of a plurality of OD markets according to aviation passenger flow historical data, solving the technical problems of predicting the aviation passenger flow by adopting the time-space convolution network and simultaneously predicting the passenger flow of a plurality of departure airports to a plurality of OD markets of the same destination airport in the same region, firstly applying the time-space convolution network to the field of aviation passenger flow prediction and simultaneously predicting the passenger flow of a plurality of departure airports to the same destination airport in the same region, a new idea is provided for predicting the passenger flow volume of the aviation.
Description
Technical Field
The invention belongs to the technical field of big data, and particularly relates to an OD market aviation passenger flow prediction method based on a space-time convolutional network.
Background
For a long time, accurate prediction of the air passenger flow is always the primary task for overall planning of all main bodies in the air transportation industry. The airline can accordingly determine whether the OD market can make profits or make adjustments to existing capacity supplies, and the airport can timely assess the matching of current infrastructure capacity and passenger flow. However, the passenger flow is influenced by various factors in the air transportation market, and uncertainty and unpredictability exist, so how to scientifically and accurately predict the air passenger flow in the OD market is always one of the hot spots in the research of the field of civil aviation transportation.
According to the existing research conditions at home and abroad, the traffic flow prediction method in the traffic field can be divided into a parametric method and a non-parametric method based on a model construction mechanism. The model structure of the parametric method needs to be determined according to corresponding theoretical hypothesis and statistical analysis, the model parameters are estimated by using relevant historical time sequence data, and the relevant model of the method comprises the following steps: least squares regression models, autoregressive moving average models, autoregressive distribution lag models, time varying parameter models, vector autoregression, gray prediction models, and the like. In addition, from the perspective of system dynamics, a dynamic submodel of each factor influencing passenger flow volume changes can be constructed, and an aviation passenger flow volume prediction result is obtained through system simulation. Since the parameter method prediction model is based on the idea of metering economic modeling, the establishment and fitting of a concrete model capable of accurately reflecting the passenger flow volume change are attempted, but the model hardly represents the non-linear relation of causal variables and has insufficient capability of processing data generalization with severe fluctuation change.
The nonparametric method based on machine learning is also widely applied to the market passenger flow prediction, and relevant application means of the method are as follows: BP neural network, support vector regression, random forest regression, adaptive fuzzy neural network, wavelet neural network, and combined prediction model. Although the results obtained by the combined prediction model are superior to those of a single prediction model, most of the built models are shallow-layer structures and cannot accurately mine deep-layer information behind large-scale data, and the machine learning method still has limitation on processing high-dimensional nonlinear traffic data, so that further expansion of machine learning prediction is restricted.
In recent years, under the large background that traffic data is explosively increased, deep learning derived and developed from artificial neural network research provides a new way for further improving the passenger flow prediction accuracy. The traditional neural network takes a one-dimensional vector data form as input, and the convolutional neural network is different in that a multidimensional matrix data form is taken as input, and an original neuron connection mode is replaced by convolutional operation, and the multilayer convolutional operation can automatically extract correlation characteristics among data with spatial characteristics, so that errors caused by artificially constructed characteristics are reduced, and the convolutional neural network can be migrated and applied to the prediction problem of the aviation passenger flow volume data with the space-time characteristics.
In summary, the conventional methods for predicting the passenger flow volume of the aviation mostly focus on a parameter method, shallow machine learning and combined prediction. The parameter method relates to model construction, which is influenced by human subjective factors to a great extent, and meanwhile, the accuracy of parameter estimation influences the final prediction precision of the model; shallow machine learning and combined prediction may cause situations such as the prediction process falling into overfitting. In addition, the historical data of a single OD market in the time dimension are only considered in the aviation passenger flow prediction, the influence of adjacent airports with linkage correlation effects in the space position on the predicted OD market passenger flow is ignored, and the OD market passenger flow cannot be predicted from the perspective of a plurality of regional airports.
Disclosure of Invention
The invention aims to provide an OD market passenger flow prediction method based on a space-time convolution network, which solves the technical problem that the space-time convolution network is adopted to predict the passenger flow of the aviation, and the simultaneous prediction of the passenger flow of a plurality of OD markets from a plurality of departure airports to a same destination airport in the same region is realized.
In order to achieve the purpose, the invention adopts the following technical scheme:
an OD market aviation passenger flow prediction method based on a space-time convolutional network comprises the following steps:
selecting a certain regional multi-airport system through a client server, and counting, sorting and storing the aviation passenger flow data from each departure airport to a destination airport in the regional multi-airport system;
the method comprises the steps that a prediction model server obtains aviation passenger flow volume data, preprocesses the aviation passenger flow volume data, constructs an OD passenger flow grid diagram and external influence factor characteristic vectors, and builds an OD market aviation passenger flow volume prediction model based on a time-space convolution network to form a corresponding training set of the prediction model;
the OD market refers to an air passenger transportation market from an Origin Airport to a Destination Airport;
determining the optimal hyper-parameter setting of the prediction model under different data sets by adjusting the data characteristics and the network structure of the prediction model; the hyper-parameters comprise hyper-parameters of a network structure and hyper-parameters of data characteristics, the hyper-parameters of the network structure comprise the number of convolution layers, the size of convolution kernels and the number of convolution kernels, and the hyper-parameters of the data characteristics comprise the sample length selected by a trend segment and the sample length selected by a periodic segment;
and the central server simultaneously predicts the aviation passenger flow of a plurality of OD markets according to the aviation passenger flow historical data.
Preferably, a manager inputs a three-character code of a departure airport and a three-character code of a destination airport in the multi-airport system through a client server, and the monthly passenger flow corresponding to each OD market;
and the client server automatically arranges according to the time to generate passenger flow time sequence data of each OD market, and transmits the passenger flow time sequence data to the database module for storage.
Preferably, the method for preprocessing the aviation passenger flow volume data comprises wavelet threshold denoising and data normalization processing, wherein the wavelet threshold denoising adopts a soft threshold two-layer decomposition method;
preferably, the constructing of the OD passenger flow grid map and the external influence factor feature vector specifically includes:
step A1: definition O ═ { O1,o2,…,oNRepresenting a set of departure airports, wherein N is the total number of the departure airports; d represents the destination airport; by using<o,d>Represents an OD market, O belongs to O;
the prediction model server maps N departure airports to an I multiplied by J grid graph R according to the longitude and latitude and the relative geographic position distribution condition of the departure airports in the multi-airport system of each region, a grid (I, J) represents any grid in the grid graph R, I represents the row number of the grid, J represents the column number of the grid, and the grid (I, J) represents the passenger flow from the departure airport o in the system corresponding to the position to the destination airport d outside the system at the t monthBy a two-dimensional tensor Xt∈RI×JA grid graph showing the OD passenger flow from departure airport to the same destination airport outside the system during the t month, wherein,
step A2: defining two characteristics of external influence factors including month attribute and whether to contain holidays, and processing the external influence factors into a 0-1 vector, namely representing the predicted month attribute and whether to contain holidays by 0 or 1, wherein the first 12 bits correspondingly represent 1 month to 12 months, and the 13 th bit represents whether to contain holidays;
preferably, the prediction model server builds a prediction model of the OD market aviation passenger flow based on the space-time convolution network by using a functional module in a Keras neural network library of Python, and the OD market aviation passenger flow is predicted according to the known m historical observed values XM={XtL t 1, 2, …, m, forecast airline passenger flow X for several OD markets for k monthsK={Xt|t=m+1,m+2,…,m+k};
The optimal hyper-parameter setting of the prediction model is determined by changing two hyper-parameters of a network structure and data characteristics, the prediction model is trained and optimized by using an Adam optimizer, and overfitting of the model is avoided by adopting an early-stop strategy in the training process.
Preferably, the specific steps of predicting the airline passenger flow of the plurality of OD markets by the central server include:
step B1: extracting segments of the OD passenger flow grid diagram, and extracting two time segments according to different time intervals aiming at the prediction time point according to the dependency of the passenger flow on the time dimension: respectively, are trending grid graph segments XTreAnd periodic grid map fragment XPerThe specific extraction form is as follows:
wherein ltreAnd lperThe method is used as an adjustable data characteristic hyper-parameter in a prediction model, and respectively represents the sample length selected by a trend segment and the sample length selected by a periodic segment;
step B2: constructing a space-time convolution network, and respectively constructing space-time characteristics of space-time convolution network branches with the same structure for capturing OD passenger flow based on two extracted grid graph segments with different time intervals, wherein the space-time convolution network branches are based on S +1(S is more than or equal to 1) convolution layers;
taking the space-time convolution network branch of the trend part as an example, the method uses Representing a trending grid graph segment, passing through the first convolution layer C1Will (X)Tre)(0)Into a new tensor (X)Tre)(1)The conversion formula is as follows:
wherein the content of the first and second substances,andis the learning parameter of the first convolutional layer, in convolutional layer C1After the operation, continuously adding (S-1) convolution layers according to the formula, and after the S convolution layer, passing through a convolution layer C only containing one convolution kernelS+1Finally, the output result (X) of the trend part is obtainedTre)(S+1)(ii) a Similarly, the same operation is used for constructing the space-time convolution network branch of the periodic part to obtain an output result (X)Per)(S+1);
Step B3: constructing an external influence factor network, wherein external influence factors considered by a prediction model comprise month attributes and whether holidays or not, and obtaining corresponding external influence factor characteristic vectors E based on the tth prediction time pointtThe first layer can be regarded as an embedded layer, and mainly quantifies and adds the external factors into a prediction model, and the second layer maps the features obtained by the first layer into a high-dimensional tensor, the size of which should be equal to XtSo as to be fused with the output result of the space-time convolution network to obtain an output result XExt;
Step B4: fusing the obtained prediction result and giving the output result (X) of the space-time convolution network in the form of learning parametersTre)(S+1)And (X)Per)(S+1)Different weight matrixes are aggregated to obtain a weighted output result, and the calculation formula is as follows:
XCon=WTre*(XTre)(S+1)+WPer*(XPer)(S+1)
wherein denotes the hadamard product; wTreAnd WPerRespectively represent trendThe weights of the potential part and the periodic part, namely the influence degrees of the two parts of output on the final prediction result respectively, further calculate the result X through the tanh functionConAnd external influencing factor network output result XExtMapping to [ -1, 1 [ ]]Get the final predicted result XKThe calculation formula is as follows:
XK=tanh(Xcon+XExt)
to minimize the predictor matrix XKAnd true value matrixThe mean square error between them is the target training prediction model:
where θ represents all learning parameters of the model.
The invention relates to an OD market aviation passenger flow prediction method based on a time-space convolution network, which solves the technical problems that the time-space convolution network is adopted to predict the aviation passenger flow and the simultaneous prediction of a plurality of OD market passenger flows from a plurality of departure airports to a same destination airport in the same area is realized; according to the invention, the convolution kernels in the convolution network are regarded as media for capturing OD passenger flow spatial correlation, so that a plurality of convolution kernels in one convolution layer can accurately capture the complex potential characteristic relation of OD passenger flow in spatial dimension, and the prediction effect is improved; the method comprehensively considers the time dependence, the spatial correlation and the effect of external influence factors of OD passenger flow, takes RMSE as the evaluation standard of the model, takes OD passenger flow data of 16 main airports from Long-triangle to Guangzhou white cloud international airport as an example, and the prediction model has better fitting effect compared with ARIMA (autoregressive moving average model), SVR (support vector regression), Elman neural network and LSTM (long-short term memory network).
Drawings
FIG. 1 is a general flow chart of the method for predicting the air passenger flow of the OD market based on the space-time convolution network of the present invention;
FIG. 2 is a model framework for predicting the air passenger flow volume in the OD market based on the space-time convolution network;
FIG. 3 is a schematic diagram of the construction of an OD passenger flow grid in an embodiment of the invention;
FIG. 4 is a result of fitting the passenger flow in different OD markets by the model method proposed in the embodiment of the present invention with other models;
fig. 5 shows the passenger flow fitting results of the model method proposed in the embodiment of the present invention and other models at different prediction intervals.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1-5, an OD market air passenger flow prediction method based on a space-time convolutional network includes:
selecting a certain regional multi-airport system through a client server, and counting, sorting and storing the aviation passenger flow data from each departure airport to a destination airport in the regional multi-airport system;
the method comprises the steps that a prediction model server obtains aviation passenger flow volume data, preprocesses the aviation passenger flow volume data, constructs an OD passenger flow grid diagram and external influence factor characteristic vectors, and builds an OD market aviation passenger flow volume prediction model based on a time-space convolution network to form a corresponding training set of the prediction model;
the OD market refers to an air passenger transportation market from an Origin Airport to a Destination Airport;
the destination airport is the destination airport outside the multi-airport system.
Determining the optimal hyper-parameter setting of the prediction model under different data sets by adjusting the data characteristics and the network structure of the prediction model; the hyper-parameters comprise hyper-parameters of a network structure and hyper-parameters of data characteristics, the hyper-parameters of the network structure comprise the number of convolution layers, the size of convolution kernels and the number of convolution kernels, and the hyper-parameters of the data characteristics comprise the sample length selected by a trend segment and the sample length selected by a periodic segment;
the data set refers to the aviation passenger flow data (namely a plurality of OD market aviation passenger flows) from each departure airport to a destination airport in the regional multi-airport system; different regional multi-airport systems will form different data sets, but the data types are all traffic volumes.
Hyper-parameters refer to model parameters that are artificially set before the start of training the predictive model.
And the central server simultaneously predicts the aviation passenger flow of a plurality of OD markets according to the aviation passenger flow historical data.
Preferably, a manager inputs a three-character code of a departure airport and a three-character code of a destination airport in the multi-airport system through a client server, and the monthly passenger flow corresponding to each OD market;
"departure airport" refers to all or part of the airports within the selected "regional multi-airport system," such as all airports in the Long triangular region.
The 'destination airport' refers to an airport outside the selected 'regional multi-airport system', and only one airport is selected, such as the international airport of capital in Beijing.
And the client server automatically arranges according to the time to generate passenger flow time sequence data of each OD market, and transmits the passenger flow time sequence data to the database module for storage.
Preferably, the method for preprocessing the aviation passenger flow volume data comprises wavelet threshold denoising and data normalization processing, wherein the wavelet threshold denoising adopts a soft threshold two-layer decomposition method.
Preferably, the constructing of the OD passenger flow grid map and the external influence factor feature vector specifically includes:
step A1: definition O ═ { O1,o2,…,oNRepresenting a set of departure airports, wherein N is the total number of the departure airports; d represents the destination airport; by using<o,d>Represents an OD market, O belongs to O;
the prediction model server maps N departure airports to an I multiplied by J grid graph R according to the longitude and latitude and the relative geographic position distribution condition of the departure airports in the multi-airport system of each region, a grid (I, J) represents any grid in the grid graph R, I represents the row number of the grid, J represents the column number of the grid, and the grid (I, J) represents the passenger flow from the departure airport o in the system corresponding to the position to the destination airport d outside the system at the t monthBy a two-dimensional tensor Xt∈RI×JA grid graph showing the OD passenger flow from departure airport to the same destination airport outside the system during the t month, wherein,
as shown in fig. 3, each grid corresponds to a departure airport o in advance, and grid (i, j) corresponds to the passenger flow from the departure airport o to the destination airport d represented by the grid in a month; the grid map is equivalent to a flow matrix as input to the model.
In this embodiment, 16 departure airports are mapped to a 4 × 4 grid map according to the longitude and latitude and the relative geographic location distribution of the departure airports in the long-triangular region multi-airport system, and the mapping positions of the departure airports in the grid map are shown in the lower left corner of fig. 3.
Step A2: defining two characteristics of external influence factors including a month attribute and whether the external influence factors contain holidays, processing the external influence factors into a 0-1 vector, namely representing the predicted month attribute and whether the external influence factors contain holidays by 0 or 1, wherein the first 12 bits correspondingly represent 1 month to 12 months, and the 13 th bit represents whether the holidays contain holidays, and if the passenger flow of 2018 year 10 month is predicted, converting the passenger flow into [1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1 ];
preferably, the prediction model server builds a prediction model of the OD market aviation passenger flow based on the space-time convolution network by using a functional module in a Keras neural network library of Python, as shown in fig. 2, which is a frame of the prediction model;
by knowing m historical observations XM={XtL t 1, 2, …, m, forecast airline passenger flow X for several OD markets for k monthsK={Xt|t=m+1,m+2,…,m+k};
The optimal hyper-parameter setting of the prediction model is determined by changing two hyper-parameters of a network structure and data characteristics, the prediction model is trained and optimized by using an Adam optimizer, and overfitting of the model is avoided by adopting an early-stop strategy in the training process.
Early-stop strategy: 1) by dividing an original training data set into a training set and a validation set; 2) training the model on a training set and calculating the error of the model on a verification set in each epoch period; 3) stopping training when the error of the model on the verification set is worse than the last training result; 4) and using the parameters in the last iteration result as final parameters of the model.
Namely, the optimal model training effect is achieved by balancing the training period and generalization errors, and model overfitting is avoided
In this embodiment, the fixed learning rate is 0.002, the number of samples processed in each batch is 3, the number of training rounds is 100, and for example, OD passenger flow data of 16 major airports from long triangle to Guangzhou Baiyun International airport is taken as an example, the number of convolution layers in a network structure is set to be ∈ {1, 2, 3}, the size of convolution kernels is ∈ { (2, 2), (3, 3) }, the number of convolution kernels is ∈ {16, 32, 64, 128}, and the sample length selected by a trend segment and a periodic segment in data characteristics is ∈ {2, 3, 4 }; in the embodiment, data from 1 month to 2017 and 12 months in 2010 are used as training data, data from 1 month to 2018 and 12 months in 2018 are used as test data, and the prediction time interval is 1 month.
With the absolute error Δ, MAE (mean absolute error) and RMSE (root mean square error) as evaluation criteria, the air passenger flow prediction pair of different algorithm models is shown in table 1, the fitting result with other models in different OD markets is shown in fig. 4, and the fitting result with other models in different prediction intervals is shown in fig. 5:
TABLE 1
And predicting the aviation passenger flow of a plurality of OD markets in a period of time by using a prediction model for determining the super parameter setting according to the aviation passenger flow historical data.
Preferably, the specific steps of predicting the airline passenger flow of the plurality of OD markets by the central server include:
step B1: extracting segments of the OD passenger flow grid diagram, and extracting two time segments according to different time intervals aiming at the prediction time point according to the dependency of the passenger flow on the time dimension: respectively, are trending grid graph segments XTreAnd periodic grid map fragment XPerThe specific extraction form is as follows:
wherein ltreAnd lperThe method is used as an adjustable data characteristic hyper-parameter in a prediction model, and respectively represents the sample length selected by a trend segment and the sample length selected by a periodic segment;
step B2: constructing a space-time convolution network, and respectively constructing space-time characteristics of space-time convolution network branches with the same structure for capturing OD passenger flow based on two extracted grid graph segments with different time intervals, wherein the space-time convolution network branches are based on S +1(S is more than or equal to 1) convolution layers;
taking the space-time convolution network branch of the trend part as an example, the method uses Representing a trending grid graph segment, passing through the first convolution layer C1Will (X)Tre)(0)Into a new tensor (X)Tre)(1)The conversion formula is as follows:
wherein the content of the first and second substances,andis the learning parameter of the first convolutional layer, in convolutional layer C1After the operation, continuously adding (S-1) convolution layers according to the formula, and after the S convolution layer, passing through a convolution layer C only containing one convolution kernelS+1Finally, the output result (X) of the trend part is obtainedTre)(S+1)(ii) a Similarly, the same operation is used for constructing the space-time convolution network branch of the periodic part to obtain an output result (X)Per)(S+1);
Step B3: constructing an external influence factor network, wherein external influence factors considered by a prediction model comprise month attributes and whether holidays or not, and obtaining corresponding external influence factor characteristic vectors E based on the tth prediction time pointtThe first layer can be regarded as an embedded layer, and mainly quantifies and adds the external factors into a prediction model, and the second layer maps the features obtained by the first layer into a high-dimensional tensor, the size of which should be equal to XtSo as to be fused with the output result of the space-time convolution network to obtain an output result XExt;
Step B4: fusing the obtained prediction result and giving the output result (X) of the space-time convolution network in the form of learning parametersTre)(S+1)And (X)Per)(S+1)Is differentAnd the weight matrix is aggregated to obtain a weighted output result, and the calculation formula is as follows:
XCon=WTre*(XTre)(S+1)+WPer*(XPer)(S+1)
wherein denotes the hadamard product; wTreAnd WPerRespectively representing the weight of the trend part and the periodic part, namely the influence degree of the output of the two parts on the final prediction result, and further calculating the result X through a tanh functionConAnd external influencing factor network output result XExtMapping to [ -1, 1 [ ]]Get the final predicted result XKThe calculation formula is as follows:
XK=tanh(XCon+XExt)
to minimize the predictor matrix XKAnd true value matrixThe mean square error between them is the target training prediction model:
where θ represents all learning parameters of the model.
The invention relates to an OD market aviation passenger flow prediction method based on a time-space convolution network, which solves the technical problems that the time-space convolution network is adopted to predict the aviation passenger flow and the simultaneous prediction of a plurality of OD market passenger flows from a plurality of departure airports to a same destination airport in the same area is realized; according to the invention, the convolution kernels in the convolution network are regarded as media for capturing OD passenger flow spatial correlation, so that a plurality of convolution kernels in one convolution layer can accurately capture the complex potential characteristic relation of OD passenger flow in spatial dimension, and the prediction effect is improved; the method comprehensively considers the time dependence, the spatial correlation and the effect of external influence factors of OD passenger flow, takes RMSE as the evaluation standard of the model, takes OD passenger flow data of 16 main airports from Long-triangle to Guangzhou white cloud international airport as an example, and the prediction model has better fitting effect compared with ARIMA (autoregressive moving average model), SVR (support vector regression), Elman neural network and LSTM (long-short term memory network).
In the present invention, any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the embodiments of the present invention.
The logic and/or steps represented in the flowcharts or otherwise described herein, e.g., an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.
It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.
In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.
Claims (6)
1. An OD market aviation passenger flow prediction method based on a space-time convolution network is characterized by comprising the following steps: the method comprises the following steps:
selecting a certain regional multi-airport system through a client server, and counting, sorting and storing the aviation passenger flow data from each departure airport to a destination airport in the regional multi-airport system;
the method comprises the steps that a prediction model server obtains aviation passenger flow volume data, preprocesses the aviation passenger flow volume data, constructs an OD passenger flow grid diagram and external influence factor characteristic vectors, and builds a prediction model of the OD market aviation passenger flow volume based on a space-time convolution network to form a corresponding training set of the prediction model;
the OD market refers to an air passenger transportation market from an Origin Airport to a Destination Airport;
determining the optimal hyper-parameter setting of the prediction model under different data sets by adjusting the data characteristics and the network structure of the prediction model; the hyper-parameters comprise hyper-parameters of a network structure and hyper-parameters of data characteristics, the hyper-parameters of the network structure comprise the number of convolution layers, the size of convolution kernels and the number of convolution kernels, and the hyper-parameters of the data characteristics comprise the sample length selected by a trend segment and the sample length selected by a periodic segment;
and the central server simultaneously predicts the aviation passenger flow of a plurality of OD markets according to the aviation passenger flow historical data.
2. The space-time convolutional network-based OD market air passenger flow prediction method of claim 1, characterized in that: a manager inputs a three-character code of a departure airport and a three-character code of a destination airport in the multi-airport system through a client server, and the monthly passenger flow corresponding to each OD market;
and the client server automatically arranges according to the time to generate passenger flow time sequence data of each OD market, and transmits the passenger flow time sequence data to the database module for storage.
3. The space-time convolutional network-based OD market air passenger flow prediction method of claim 1, characterized in that: preprocessing the aviation passenger flow data, including wavelet threshold denoising and data normalization processing, wherein the wavelet threshold denoising adopts a soft threshold two-layer decomposition method.
4. The space-time convolutional network-based OD market air passenger flow prediction method of claim 1, characterized in that: constructing an OD passenger flow grid diagram and external influence factor characteristic vectors, which specifically comprise the following steps:
step A1: definition O ═ { O1,o2,…,oNRepresenting a set of departure airports, wherein N is the total number of the departure airports; d represents the destination airport; using < O, d > to represent an OD market, and O belongs to O;
the prediction model server maps N departure airports to an I multiplied by J grid graph R according to the longitude and latitude and the relative geographic position distribution condition of the departure airports in the multi-airport system of each region, a grid (I, J) represents any grid in the grid graph R, I represents the row number of the grid, J represents the column number of the grid, and the grid (I, J) represents the passenger flow from the departure airport o in the system corresponding to the position to the destination airport d outside the system at the t monthBy a two-dimensional tensor Xt∈RI×JA grid graph showing the OD passenger flow from departure airport to the same destination airport outside the system during the t month, wherein,
step A2: defining two characteristics of external influence factors including a month attribute and whether to contain holidays, and processing the external influence factors into a 0-1 vector, namely representing the predicted month attribute and whether to contain the holidays by 0 or 1, wherein the first 12 bits correspondingly represent 1 month to 12 months, and the 13 th bit represents whether to contain the holidays.
5. The space-time convolutional network-based OD market air passenger flow prediction method of claim 4, characterized in that: the prediction model server builds a prediction model of the OD market aviation passenger flow based on the space-time convolution network by using a functional module in a Keras neural network library of Python, and the OD market aviation passenger flow is predicted according to the known m historical observed values XM={XtL t 1, 2, …, m, forecast airline passenger flow X for several OD markets for k monthsK={Xt|t=m+1,m+2,…,m+k};
The optimal hyper-parameter setting of the prediction model is determined by changing two hyper-parameters of a network structure and data characteristics, the prediction model is trained and optimized by using an Adam optimizer, and overfitting of the model is avoided by adopting an early-stop strategy in the training process.
6. The space-time convolutional network-based OD market air passenger flow prediction method of claim 5, characterized in that: the specific steps of predicting the aviation passenger flow of a plurality of OD markets by the central server comprise:
step B1: extracting segments of the OD passenger flow grid diagram, and extracting two time segments according to different time intervals aiming at the prediction time point according to the dependency of the passenger flow on the time dimension: respectively, are trending grid graph segments XTreAnd periodic grid map fragment XPerThe specific extraction form is as follows:
wherein ltreAnd lperThe method is used as an adjustable data characteristic hyper-parameter in a prediction model, and respectively represents the sample length selected by a trend segment and the sample length selected by a periodic segment;
step B2: constructing a space-time convolution network, and respectively constructing space-time characteristics of space-time convolution network branches with the same structure for capturing OD passenger flow based on two extracted grid graph segments with different time intervals, wherein the space-time convolution network branches are based on S +1(S is more than or equal to 1) convolution layers;
taking the space-time convolution network branch of the trend part as an example, the method usesRepresenting a trending grid graph segment, passing through the first convolution layer C1Will (X)Tre)(0)Into a new tensor (X)Tre)(1)The conversion formula is as follows:
wherein the content of the first and second substances,andis the learning parameter of the first convolutional layer, in convolutional layer C1After the operation, continuously adding (S-1) convolution layers according to the formula, and after the S convolution layer, passing through a convolution layer C only containing one convolution kernelS+1Finally, the output result (X) of the trend part is obtainedTre)(S+1)(ii) a Similarly, the same operation is used for constructing the space-time convolution network branch of the periodic part to obtain an output result (X)Per)(S+1);
Step B3: constructing an external influence factor network, wherein external influence factors considered by a prediction model comprise month attributes and whether holidays or not, and obtaining corresponding external influence factor characteristic vectors E based on the tth prediction time pointtUsing a two-layer fully-connected neural network branch as the external influencing factor network, the first layer can be considered as the embedding layer, mainly the embedding layerThe external factors are quantitatively added into a prediction model, and the second layer maps the features obtained by the first layer into a high-dimensional tensor, the size of which is equal to XtSo as to be fused with the output result of the space-time convolution network to obtain an output result XExt;
Step B4: fusing the obtained prediction result and giving the output result (X) of the space-time convolution network in the form of learning parametersTre)(S+1)And (X)Per)(S+1)Different weight matrixes are aggregated to obtain a weighted output result, and the calculation formula is as follows:
XCon=WTre*(XTre)(S+1)+WPer*(XPer)(S+1);
wherein denotes the hadamard product; wTreAnd WPerRespectively representing the weight of the trend part and the periodic part, namely the influence degree of the output of the two parts on the final prediction result, and further calculating the result X through a tanh functionConAnd external influencing factor network output result XExtMapping to [ -1, 1 [ ]]Get the final predicted result XKThe calculation formula is as follows:
XK=tanh(XCon+XExt);
to minimize the predictor matrix XKAnd true value matrixThe mean square error between them is the target training prediction model:
where θ represents all learning parameters of the model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110878862.4A CN114118508A (en) | 2021-08-02 | 2021-08-02 | OD market aviation passenger flow prediction method based on space-time convolution network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110878862.4A CN114118508A (en) | 2021-08-02 | 2021-08-02 | OD market aviation passenger flow prediction method based on space-time convolution network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114118508A true CN114118508A (en) | 2022-03-01 |
Family
ID=80440730
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110878862.4A Pending CN114118508A (en) | 2021-08-02 | 2021-08-02 | OD market aviation passenger flow prediction method based on space-time convolution network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114118508A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116703011A (en) * | 2023-08-09 | 2023-09-05 | 民航机场规划设计研究总院有限公司 | Aviation passenger flow distribution prediction method and device, electronic equipment and storage medium |
-
2021
- 2021-08-02 CN CN202110878862.4A patent/CN114118508A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116703011A (en) * | 2023-08-09 | 2023-09-05 | 民航机场规划设计研究总院有限公司 | Aviation passenger flow distribution prediction method and device, electronic equipment and storage medium |
CN116703011B (en) * | 2023-08-09 | 2023-10-20 | 民航机场规划设计研究总院有限公司 | Aviation passenger flow distribution prediction method and device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7497342B2 (en) | Method and device for predicting power system thermal load | |
US11714937B2 (en) | Estimating physical parameters of a physical system based on a spatial-temporal emulator | |
Li et al. | Prediction for tourism flow based on LSTM neural network | |
Palmer et al. | Designing an artificial neural network for forecasting tourism time series | |
Pan et al. | Development of a global road safety performance function using deep neural networks | |
CN112446419A (en) | Time-space neural network radar echo extrapolation forecasting method based on attention mechanism | |
CN111047078B (en) | Traffic characteristic prediction method, system and storage medium | |
US11720727B2 (en) | Method and system for increasing the resolution of physical gridded data | |
CN107704970A (en) | A kind of Demand-side load forecasting method based on Spark | |
Bai et al. | A forecasting method of forest pests based on the rough set and PSO-BP neural network | |
CN111160622A (en) | Scenic spot passenger flow prediction method and device based on hybrid neural network model | |
Yalpır | Enhancement of parcel valuation with adaptive artificial neural network modeling | |
CN113591380A (en) | Traffic flow prediction method, medium and equipment based on graph Gaussian process | |
CN111027662A (en) | SD-LSSVR short-time traffic flow prediction method based on chaos quantum particle swarm optimization | |
CN115545334B (en) | Land utilization type prediction method and device, electronic equipment and storage medium | |
CN114723188A (en) | Water quality prediction method, device, computer equipment and storage medium | |
CN116187835A (en) | Data-driven-based method and system for estimating theoretical line loss interval of transformer area | |
Fan et al. | Multi-system fusion based on deep neural network and cloud edge computing and its application in intelligent manufacturing | |
CN113887704A (en) | Traffic information prediction method, device, equipment and storage medium | |
Robati et al. | Inflation rate modeling: adaptive neuro-fuzzy inference system approach and particle swarm optimization algorithm (ANFIS-PSO) | |
Zhang et al. | A local semi-supervised ensemble learning strategy for the data‐driven soft sensor of the power prediction in wind power generation | |
Afshari Nia et al. | Convolutional neural network-ANN-E (Tanh): a new deep learning model for predicting rainfall | |
CN114118508A (en) | OD market aviation passenger flow prediction method based on space-time convolution network | |
Prabowo et al. | Continually learning out-of-distribution spatiotemporal data for robust energy forecasting | |
CN112711912A (en) | Air quality monitoring and alarming method, system, device and medium based on cloud computing and machine learning algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |