WO2020010717A1 - Short-term traffic flow prediction method based on spatio-temporal correlation - Google Patents

Short-term traffic flow prediction method based on spatio-temporal correlation Download PDF

Info

Publication number
WO2020010717A1
WO2020010717A1 PCT/CN2018/107987 CN2018107987W WO2020010717A1 WO 2020010717 A1 WO2020010717 A1 WO 2020010717A1 CN 2018107987 W CN2018107987 W CN 2018107987W WO 2020010717 A1 WO2020010717 A1 WO 2020010717A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
traffic flow
prediction
short
spatio
Prior art date
Application number
PCT/CN2018/107987
Other languages
French (fr)
Chinese (zh)
Inventor
戚湧
熊亭
张伟斌
高盼军
Original Assignee
南京理工大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京理工大学 filed Critical 南京理工大学
Priority to AU2018432145A priority Critical patent/AU2018432145A1/en
Publication of WO2020010717A1 publication Critical patent/WO2020010717A1/en

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry

Definitions

  • the invention relates to the technical fields of machine learning methods, traffic flow prediction, and the like, and in particular, to a short-term traffic flow prediction method based on spatio-temporal correlation.
  • ITS intelligent transportation systems
  • the first type is prediction models based on mathematical statistics and calculus, such as reference 1 (traffic flow parameter prediction method based on fuzzy Kalman filter, publication number: CN102629418A), and this type of model is based on observation data Internal statistical characteristics, dynamically processing traffic flow data, and predicting future traffic flow; however, most of these models only use historical flow data to predict, and ignore other factors such as season, climate, and upstream and downstream flow, which is difficult to adapt to traffic Due to the strong randomness of the stream, the accuracy of this type of prediction method is not very high; the second type is a prediction model based on modern science and technology such as machine learning, including support vector machines, neural networks, and models based on chaos theory Etc., such as reference 2 (traffic parameter prediction method based on deep confidence network, publication number: CN106295874A), this type of model usually uses machine learning or artificial intelligence to predict
  • the third type is a combination prediction model, such as reference 3 (a traffic flow prediction method based on a combination of support vector machines and BP neural network, publication number: CN107705556A).
  • the combination model is to use multiple models together.
  • most combination models do not consider the characteristics of the traffic flow, but simply combine them randomly, which results in the model's prediction effect has not significantly improved, and even increased the complexity of the model.
  • the purpose of the present invention is to provide a short-term traffic flow prediction method based on spatio-temporal correlation, so as to improve the good analysis ability and feature mining ability of traffic flow data, and further improve the prediction accuracy of the model.
  • the technical solution to achieve the purpose of the present invention is: a short-term traffic flow prediction method based on spatio-temporal correlation, including the following steps:
  • Step 1 Select a road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain historical short-term traffic flow data of all breakpoints in the selected road segment;
  • Step 2 Determine a prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data
  • Step 3 Verify whether the historical traffic flow data of the predicted breakpoint is periodic based on the short-term traffic flow historical data of the breakpoint;
  • Step 4 Use the normalization method to perform normalization processing on the traffic flow data, and divide the normalized data set into a training data set and a test data set;
  • Step 5 Use the SARIMA model to perform a predictive analysis on the test data set to obtain an initial prediction result
  • Step 6 Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result;
  • Step 7 Compare the test data set with the final prediction data and analyze the errors.
  • the short-term traffic flow historical data of the breakpoint in step 1 refers to data collection date, time, traffic flow speed value and traffic flow value at the breakpoint.
  • step 2 Further, the prediction period described in step 2 is 5 minutes.
  • verifying whether the historical traffic flow data of the prediction breakpoint is periodic in step 3 refers to periodic verification using an autocorrelation function, and the specific process is as follows:
  • the autocorrelation coefficient r k is used to measure the degree of autocorrelation between the sequence values, and r k is the number of observations separated by k periods.
  • the degree of correlation is calculated by the following formula:
  • n represents the length of the time series, That is the average of the time series data
  • X tk represents the sequence value that is k periods away from X t .
  • step 4 is as follows:
  • x represents the traffic flow data after normalization processing
  • min represents the minimum value of the sample data
  • max represents the maximum value of the sample data
  • x represents the data to be normalized.
  • the normalized data set is divided into a training data set and a test data set as described in step 4, specifically: after normalization processing, 80% of the data in the historical traffic flow data is used as the training set, 20% of the data is used as the test set.
  • step 5 the SARIMA model is used to perform a predictive analysis on the test data set to obtain an initial prediction result, which specifically includes the following steps:
  • step (5.1) Check whether the original traffic flow data is a stable sequence: The test result is that the traffic flow data is non-stationary, and it is stabilized; the test result is that the traffic flow data is stable, and directly enter step (5.2);
  • step 6 the prediction result obtained by the SARIMA model is taken as an input feature and is brought into a random forest model to obtain the final prediction result, which specifically includes the following steps:
  • the initial prediction results obtained by the SARIMA model are used as input features reflecting the periodic pattern, and are combined with other input feature combinations into the random forest model.
  • the parameters are adjusted using the grid method to finally obtain the predicted values.
  • comparing the test data set with the final prediction data and analyzing the error described in step 7, specifically includes the following steps:
  • the error analysis is performed on the forecast data through the average percentage error MAPE and root mean square error RMSE.
  • the calculation formula is as follows:
  • n represents the number of test data selected in total
  • u i is the actual traffic volume value in the i-th period. The flow value obtained by the model for the i-th period.
  • the present invention has significant advantages: (1) it can deeply dig the characteristics of the periodic and non-linear parts of the traffic flow data; (2) analyze the traffic data from the perspective of the space-time correlation of the traffic flow, Decomposing it into periodic parts and random fluctuation parts with obvious trends can further improve the prediction accuracy of the model.
  • FIG. 1 is a layout topology diagram of a metadata exchange system of the present invention.
  • FIG. 2 is a structural diagram of a metadata exchange system of the present invention.
  • FIG. 3 is a layout diagram of functional modules of a metadata synchronization subsystem of the present invention.
  • FIG. 4 is a flowchart of metadata exchange and clipping according to the present invention.
  • FIG. 5 is a layout diagram of functional modules of the metadata and directory management subsystem of the present invention.
  • FIG. 6 is a flowchart of a metadata collection and publishing module of the present invention.
  • FIG. 7 is a diagram of a metadata directory service architecture of the present invention.
  • FIG. 8 is a structural diagram of a metadata and directory management subsystem of the present invention.
  • the short-term traffic flow prediction method based on the spatio-temporal correlation of the present invention includes the following steps:
  • Step 1 Select a road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain historical short-term traffic flow data of all breakpoints in the selected road segment;
  • the short-term traffic flow historical data of the breakpoint refers to data collection date, time, traffic flow speed value and traffic flow value at the breakpoint.
  • Step 2 Determine a prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data
  • the prediction period is 5 minutes.
  • Step 3 According to the short-term traffic flow historical data of the breakpoint, verify whether the historical traffic flow data of the predicted breakpoint is periodic.
  • the specific process is as follows:
  • the autocorrelation coefficient r k is used to measure the degree of autocorrelation between the sequence values, and r k is the number of observations separated by k periods.
  • the degree of correlation is calculated by the following formula:
  • n represents the length of the time series, That is the average of the time series data
  • X tk represents the sequence value that is k periods away from X t .
  • Step 4 Use the normalization method to perform normalization processing on the traffic flow data, and divide the normalized data set into a training data set and a test data set;
  • the normalization method is as follows:
  • x represents the traffic flow data after normalization processing
  • min represents the minimum value of the sample data
  • max represents the maximum value of the sample data
  • x represents the data to be normalized.
  • the dividing the normalized data set into a training data set and a test data set is specifically: after normalization processing, 80% of the data in the historical traffic flow data is used as the training set, and 20% of the data As a test set.
  • Step 5 Use the SARIMA model to perform a predictive analysis on the test data set to obtain the initial prediction result, which specifically includes the following steps:
  • step (5.1) Check whether the original traffic flow data is a stable sequence: The test result is that the traffic flow data is non-stationary, and it is stabilized; the test result is that the traffic flow data is stable, and it proceeds directly to step (5.2);
  • Step 6 Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result, which specifically includes the following steps:
  • the initial prediction results obtained by the SARIMA model are used as input features reflecting the periodic pattern, and are combined with other input feature combinations into the random forest model.
  • the parameters are adjusted using the grid method to finally obtain the predicted values.
  • Step 7 Compare the test data set with the final prediction data and analyze the error, which specifically includes the following steps:
  • the error analysis is performed on the forecast data through the average percentage error MAPE and root mean square error RMSE.
  • the calculation formula is as follows:
  • n represents the number of test data selected in total
  • u i is the actual traffic volume value in the i-th period. The flow value obtained by the model for the i-th period.
  • Step 1 Select the road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain the historical short-term traffic flow data of all breakpoints in the selected road segment;
  • Step 2 Determine the prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data
  • Step 3 verify whether the historical traffic flow data of the predicted breakpoint is periodic based on the short-term traffic flow historical data of the breakpoint;
  • Step four normalize the traffic flow data by using a normalization method, and divide the normalized data set into a training data set and a test data set;
  • Step 5 Use the SARIMA model to perform prediction analysis on the test data set to obtain the initial prediction result
  • Step 6 Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result.
  • Step 7 Compare the test data set with the final prediction data and analyze the errors.
  • the traffic flow data is collected through a coil, and the obtained traffic flow data is the number of vehicles passing by a specific breakpoint within a certain time interval.
  • the time interval is 5 minutes.
  • t 1,2, ... T ⁇ , where f t represents the traffic flow parameter of the specific breakpoint of the road network at time t, and the difference between time T and time T + 1
  • the value is the prediction time interval.
  • the prediction time interval used in this example is 5 minutes.
  • periodic verification is performed through an autocorrelation function. Taking the data from 6 am to 24 pm in the day with a time interval of 5 minutes as experimental data, it has been verified that the traffic flow data has a daily periodicity and is 216, which is consistent with the actual situation.
  • the periodic verification chart is shown in Figures 3 and 4.
  • x represents the traffic flow data after normalization processing
  • min represents the minimum value of the sample data
  • max represents the maximum value of the sample data
  • x represents the data to be normalized.
  • data of 25 working days are used as experimental data, of which 20 days of traffic flow data are used as training data, and 5 days of traffic flow data are used as test data.
  • the SARIMA model is a model that can describe seasonal time series. It is a variant of the autoregressive Integral Moving Average (ARIMA) model [14].
  • ARIMA Autoregressive Integral Moving Average
  • a traffic flow sequence ⁇ X t ⁇ can be fitted by the SARIMA (p, d, q) (P, D, Q) S model, where the parameter S represents the length of the set seasonal period, and the parameter d represents the conversion into a stationary sequence.
  • the required number of differences, the meaning of the parameter D is the order of the required seasonal difference;
  • the stationary time series after the difference be ⁇ Y t ⁇ , as shown in equation (2), where B represents the backward shift operator,
  • the traffic flow has the relationship shown in equation (3):
  • the parameter c represents a constant term
  • ⁇ t represents the residual term of the model, and satisfies ⁇ t ⁇ N (0, ⁇ 2 )
  • BS represents a post-season shift operator, and satisfies the following relationship:
  • ⁇ (B) 1- ⁇ 1 B- ⁇ 2 B 2 -...- ⁇ p B p , (5)
  • ⁇ (B S ) 1- ⁇ 1 B S, 1 - ⁇ 2 B S, 2- ... - ⁇ p B S, p , (6)
  • ⁇ (B) 1- ⁇ 1 B- ⁇ 2 B 2 -...- ⁇ q B q , (7)
  • ⁇ (B S ) 1- ⁇ 1 B S, 1 - ⁇ 2 B S, 2 -...- ⁇ q B S, Q , (8)
  • the data amount of the three days before the prediction time t is used as training data, and dynamic prediction is performed in the form of a sliding window, and the model is refitted every 12 times, the parameters are adjusted, and finally the vehicle in the test set is predicted for one week. Traffic data.
  • Random forest is a powerful tool for data mining and machine learning. It is an integrated learning method that combines a large number of regression trees and then obtains prediction results. It combines a large number of weak models into a strong model.
  • the prediction process of RF can be intuitively explained by evaluating the importance of the predictive factor.
  • the algorithm is robust to noise and outliers in the data, can effectively run on big traffic data, and is also good for high-dimensional data. Adaptability.
  • the initial prediction result obtained by the SARIMA model is used as a feature reflecting the periodic pattern, combined with other input features, and brought into the random forest model to obtain the final prediction result.
  • n represents the number of test data selected in total
  • u i is the actual traffic volume value in the i-th period.
  • the method of the present invention deeply explores the randomness and uncertainty of traffic flow data, fully considers the spatio-temporal correlation in traffic flow data, and decomposes the flow data into a periodic part and a random fluctuation part with a clear trend. It is analyzed to improve the prediction accuracy of traffic flow data.

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Development Economics (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Traffic Control Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A short-term traffic flow prediction method based on spatio-temporal correlation. The method comprises the following steps: selecting a road section requiring traffic prediction and break points in the road section, acquiring short-term traffic flow historical data of all of the break points in the selected road section, determining a prediction time period of the short term traffic flow prediction, and verifying whether the historical traffic flow data of the prediction break points has periodicity; after using a normalisation method to normalise the traffic flow data, dividing the data set into a training data set and a testing data set; using a SARIMA model to perform predictive analysis on the testing data set to obtain an initial prediction result; using the prediction result as an input feature, entering same into a random forest model to obtain a final prediction result; comparing the testing data with the final prediction data and analysing errors. The present method breaks down flow data into periodic parts with evident trends and random fluctuation parts for analysis, increasing the precision of traffic flow data prediction.

Description

一种基于时空相关性的短时交通流预测方法Short-term traffic flow prediction method based on spatio-temporal correlation 技术领域Technical field
本发明涉及机器学习方法和交通流预测等技术领域,具体涉及一种基于时空相关性的短时交通流预测方法。The invention relates to the technical fields of machine learning methods, traffic flow prediction, and the like, and in particular, to a short-term traffic flow prediction method based on spatio-temporal correlation.
背景技术Background technique
随着当今社会现代化进程不断加快,城市化水平不断提高,车辆的数目也随之迅速增长,现有的路网通行条件难以满足日益增长的交通通行需求。20世纪初期,智能交通系统(ITS)的概念也应运而生。在ITS中,实时且准确的短时交通流量预测有着至关重要的作用,它不仅影响人们对交通流的控制和诱导,更是系统从被动应变到主动控制的关键。With the acceleration of modernization of society and the continuous improvement of the level of urbanization, the number of vehicles has also increased rapidly. The existing road network conditions are difficult to meet the increasing demand for traffic. In the early 20th century, the concept of intelligent transportation systems (ITS) also came into being. In ITS, real-time and accurate short-term traffic flow prediction plays a vital role. It not only affects people's control and induction of traffic flow, but also the key to the system's transition from passive to active control.
随着短时交通流分析与预测工作的不断深入,研究人员依据不同的分析角度以及应用条件提出了许多模型。这些模型可以分为三类:第一类是基于数理统计以及微积分的预测模型,如文献1(基于模糊卡尔曼滤波的交通流参数预测方法,公开号:CN102629418A),该类模型通过观测数据内部的统计特征,动态地处理交通流数据,预测未来的交通流量;但该类模型大多仅利用历史流量数据来预测,而忽略了季节、气候,上下游流量的影响等其他因素,难以适应交通流随机性强的特点,故该类预测方法的准确率并不是很高;第二类是基于机器学习等现代科学技术为基础的预测模型,包括支持向量机,神经网络,基于混沌理论的模型等,如文献2(基于深信度网络的交通流参数预测方法,公开号:CN106295874A),该类模型通常采用机器学习或者人工智能的方法预测短时交通流量,缺点是往往会忽视交通流数据所固有的一些特性。第三类就是组合预测模型,如文献3(一种基于支持向量机和BP神经网络结合的交通流预测方法,公开号:CN107705556A),顾名思义,组合模型即将多个模型综合在一起使用。但大多数组合模型并没有考虑交通流特征,只是单纯地随机组合,这导致模型的预测效果并没有显著提升,甚至增加了模型的复杂程度。显然,单一预测模型难以兼顾交通流数据中的固有特征,以及季节,气候或者人为因素所造成的外部影响,因此存在难以反映交通流数据中固有的复杂特性,并且无法全面考虑外部空间关联对预测研究的影响等缺陷。With the continuous deepening of short-term traffic flow analysis and prediction, researchers have proposed many models based on different analysis angles and application conditions. These models can be divided into three categories: the first type is prediction models based on mathematical statistics and calculus, such as reference 1 (traffic flow parameter prediction method based on fuzzy Kalman filter, publication number: CN102629418A), and this type of model is based on observation data Internal statistical characteristics, dynamically processing traffic flow data, and predicting future traffic flow; however, most of these models only use historical flow data to predict, and ignore other factors such as season, climate, and upstream and downstream flow, which is difficult to adapt to traffic Due to the strong randomness of the stream, the accuracy of this type of prediction method is not very high; the second type is a prediction model based on modern science and technology such as machine learning, including support vector machines, neural networks, and models based on chaos theory Etc., such as reference 2 (traffic parameter prediction method based on deep confidence network, publication number: CN106295874A), this type of model usually uses machine learning or artificial intelligence to predict short-term traffic flow, but the disadvantage is that the traffic flow data is often ignored. Some inherent characteristics. The third type is a combination prediction model, such as reference 3 (a traffic flow prediction method based on a combination of support vector machines and BP neural network, publication number: CN107705556A). As the name suggests, the combination model is to use multiple models together. However, most combination models do not consider the characteristics of the traffic flow, but simply combine them randomly, which results in the model's prediction effect has not significantly improved, and even increased the complexity of the model. Obviously, it is difficult for a single prediction model to take into account the inherent characteristics of traffic flow data and the external effects caused by seasonal, climatic or human factors. Therefore, it is difficult to reflect the complex characteristics inherent in traffic flow data, and it is impossible to comprehensively consider the external spatial correlation to forecast. Study the impact of other flaws.
发明内容Summary of the invention
本发明的目的在于提供一种基于时空相关性的短时交通流预测方法,以提高对交通 流数据的良好的分析能力以及特征的挖掘能力,并进一步提高模型的预测精度。The purpose of the present invention is to provide a short-term traffic flow prediction method based on spatio-temporal correlation, so as to improve the good analysis ability and feature mining ability of traffic flow data, and further improve the prediction accuracy of the model.
实现本发明目的的技术解决方案为:一种基于时空相关性的短时交通流预测方法,包括以下步骤:The technical solution to achieve the purpose of the present invention is: a short-term traffic flow prediction method based on spatio-temporal correlation, including the following steps:
步骤1,选定需要进行交通流预测的路段以及该路段中的断点,获取所选路段中所有断点的短时交通流量历史数据;Step 1: Select a road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain historical short-term traffic flow data of all breakpoints in the selected road segment;
步骤2,根据获取的短时交通流历史数据,确定短时交通流预测的预测时段;Step 2: Determine a prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data;
步骤3,根据断点的短时交通流量历史数据,验证预测断点的历史交通流量数据是否具有周期性;Step 3: Verify whether the historical traffic flow data of the predicted breakpoint is periodic based on the short-term traffic flow historical data of the breakpoint;
步骤4,利用归一化方法对交通流数据进行归一化处理,将归一化后的数据集划分为训练数据集和测试数据集;Step 4. Use the normalization method to perform normalization processing on the traffic flow data, and divide the normalized data set into a training data set and a test data set;
步骤5,利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果;Step 5. Use the SARIMA model to perform a predictive analysis on the test data set to obtain an initial prediction result;
步骤6,将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果;Step 6. Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result;
步骤7,将测试数据集与最终的预测数据进行比较,并分析误差。Step 7. Compare the test data set with the final prediction data and analyze the errors.
进一步地,步骤1中所述断点的短时交通流历史数据是指数据采集日期,时间,断点处的交通流速度值以及交通流量值。Further, the short-term traffic flow historical data of the breakpoint in step 1 refers to data collection date, time, traffic flow speed value and traffic flow value at the breakpoint.
进一步地,步骤2中所述的预测时段为5分钟。Further, the prediction period described in step 2 is 5 minutes.
进一步地,步骤3中所述验证预测断点的历史交通流量数据是否具有周期性,是指利用自相关函数进行周期性验证,具体过程如下:Further, verifying whether the historical traffic flow data of the prediction breakpoint is periodic in step 3 refers to periodic verification using an autocorrelation function, and the specific process is as follows:
对于构成时间序列的每个序列值X t,X t-1,…X t-k,利用自相关系数r k度量序列值之间的自相关程度,r k即为相隔k期的观测值之间的相关程度,通过以下的公式计算: For each of the sequence values X t , X t-1 , ... X tk constituting the time series, the autocorrelation coefficient r k is used to measure the degree of autocorrelation between the sequence values, and r k is the number of observations separated by k periods. The degree of correlation is calculated by the following formula:
Figure PCTCN2018107987-appb-000001
Figure PCTCN2018107987-appb-000001
其中n代表时间序列的长度,
Figure PCTCN2018107987-appb-000002
即为时序数据的平均值,X t-k则表示与X t相距k期的序列值。
Where n represents the length of the time series,
Figure PCTCN2018107987-appb-000002
That is the average of the time series data, and X tk represents the sequence value that is k periods away from X t .
进一步地,步骤4中所述的归一化方法,具体过程如下:Further, the normalization method described in step 4 is as follows:
分别计算历史交通流数据某一个样本中的最小值min和最大值max,使用min-max 标准化方法对数据进行归一化,使得归一化之后的交通流数据结果映射到[0,1]之间,即根据交通流数据集合F={f t|t=1,2,...T}求得集合中最大值max和最小值min,对集合中的每个数据计算: Calculate the minimum min and maximum max in a sample of historical traffic flow data, and use the min-max normalization method to normalize the data so that the normalized traffic flow data results are mapped to [0,1] In other words, the maximum value max and the minimum value min in the set are obtained according to the traffic flow data set F = {f t | t = 1,2, ... T}, and each data in the set is calculated:
Figure PCTCN2018107987-appb-000003
Figure PCTCN2018107987-appb-000003
其中x’表示归一化处理后的交通流数据,min表示样本数据中的最小值,max表示样本数据最大值,x表示待归一化处理的数据。Where x 'represents the traffic flow data after normalization processing, min represents the minimum value of the sample data, max represents the maximum value of the sample data, and x represents the data to be normalized.
进一步地,步骤4中所述将归一化后的数据集划分为训练数据集和测试数据集,具体为:归一化处理后将历史交通流量数据中百分之80的数据作为训练集,百分之20的数据作为测试集。Further, the normalized data set is divided into a training data set and a test data set as described in step 4, specifically: after normalization processing, 80% of the data in the historical traffic flow data is used as the training set, 20% of the data is used as the test set.
进一步地,步骤5所述利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果,具体包括以下步骤:Further, in step 5, the SARIMA model is used to perform a predictive analysis on the test data set to obtain an initial prediction result, which specifically includes the following steps:
(5.1)检验原始交通流数据是否为平稳序列:检验结果为交通流数据是非平稳的,对其进行平稳化处理;检验结果为交通流数据是平稳的,直接进入步骤(5.2);(5.1) Check whether the original traffic flow data is a stable sequence: The test result is that the traffic flow data is non-stationary, and it is stabilized; the test result is that the traffic flow data is stable, and directly enter step (5.2);
(5.2)依据平稳化时间序列数据的ACF函数与PACF函数以及AIC最小准则,对SARIMA模型的四个参数p,q,P,Q取值;(5.2) According to the ACF function and PACF function of the stationary time series data and the AIC minimum criterion, the four parameters p, q, P, Q of the SARIMA model are valued;
(5.3)预测过程中以预测时刻t前d天的数据量当做训练数据,并采用滑动窗口的形式动态预测,且设定模型每执行n次就重新拟合,并调整参数,最终得出步骤5所述的初始预测结果。(5.3) During the prediction process, the amount of data d days before the prediction time t is used as training data, and dynamic prediction is performed in the form of a sliding window, and the model is refitted every n times, and the parameters are adjusted to finally obtain the steps The initial prediction results described in 5.
进一步地,步骤6所述将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果,具体包括以下步骤:Further, in step 6, the prediction result obtained by the SARIMA model is taken as an input feature and is brought into a random forest model to obtain the final prediction result, which specifically includes the following steps:
将SARIMA模型得到的初始预测结果作为反映周期性模式的输入特征,与其他输入特征组合共同带入随机森林模型中,采用网格法对参数进行调整,最终得出预测值。The initial prediction results obtained by the SARIMA model are used as input features reflecting the periodic pattern, and are combined with other input feature combinations into the random forest model. The parameters are adjusted using the grid method to finally obtain the predicted values.
进一步地,步骤7所述将测试数据集与最终的预测数据进行比较,并分析误差,具体包括以下步骤:Further, comparing the test data set with the final prediction data and analyzing the error described in step 7, specifically includes the following steps:
通过平均百分比误差MAPE和均方根误差RMSE来对预测数据进行误差分析,计算公式如下:The error analysis is performed on the forecast data through the average percentage error MAPE and root mean square error RMSE. The calculation formula is as follows:
Figure PCTCN2018107987-appb-000004
Figure PCTCN2018107987-appb-000004
Figure PCTCN2018107987-appb-000005
Figure PCTCN2018107987-appb-000005
其中n代表共选取测试数据的个数,u i为第i个时段实际车流量值,
Figure PCTCN2018107987-appb-000006
为模型对第i个时段预测得到的流量值。
Where n represents the number of test data selected in total, and u i is the actual traffic volume value in the i-th period.
Figure PCTCN2018107987-appb-000006
The flow value obtained by the model for the i-th period.
本发明与现有技术相比,其显著优点为:(1)能够深入挖掘对交通流数据的周期性部分、非线性部分特征;(2)从交通流的时空相关性角度分析,将流量数据分解成带有明显趋势的周期性部分和随机波动部分,可以进一步提高模型的预测精度。Compared with the prior art, the present invention has significant advantages: (1) it can deeply dig the characteristics of the periodic and non-linear parts of the traffic flow data; (2) analyze the traffic data from the perspective of the space-time correlation of the traffic flow, Decomposing it into periodic parts and random fluctuation parts with obvious trends can further improve the prediction accuracy of the model.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为本发明的元数据交换系统布局拓扑图。FIG. 1 is a layout topology diagram of a metadata exchange system of the present invention.
图2为本发明的元数据交换系统组成结构图。FIG. 2 is a structural diagram of a metadata exchange system of the present invention.
图3为本发明的元数据同步子系统功能模块布局图。FIG. 3 is a layout diagram of functional modules of a metadata synchronization subsystem of the present invention.
图4为本发明的元数据交换与裁剪流程图。FIG. 4 is a flowchart of metadata exchange and clipping according to the present invention.
图5为本发明的元数据与目录管理子系统功能模块布局图。FIG. 5 is a layout diagram of functional modules of the metadata and directory management subsystem of the present invention.
图6为本发明的元数据采集发布模块流程图。FIG. 6 is a flowchart of a metadata collection and publishing module of the present invention.
图7为本发明的元数据目录服务体系结构图。FIG. 7 is a diagram of a metadata directory service architecture of the present invention.
图8为本发明的元数据与目录管理子系统结构图。FIG. 8 is a structural diagram of a metadata and directory management subsystem of the present invention.
具体实施方式detailed description
本发明基于时空相关性的短时交通流预测方法,包括以下步骤:The short-term traffic flow prediction method based on the spatio-temporal correlation of the present invention includes the following steps:
步骤1,选定需要进行交通流预测的路段以及该路段中的断点,获取所选路段中所有断点的短时交通流量历史数据;Step 1: Select a road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain historical short-term traffic flow data of all breakpoints in the selected road segment;
所述断点的短时交通流历史数据是指数据采集日期,时间,断点处的交通流速度值以及交通流量值。The short-term traffic flow historical data of the breakpoint refers to data collection date, time, traffic flow speed value and traffic flow value at the breakpoint.
步骤2,根据获取的短时交通流历史数据,确定短时交通流预测的预测时段;Step 2: Determine a prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data;
例如,所述的预测时段为5分钟。For example, the prediction period is 5 minutes.
步骤3,根据断点的短时交通流量历史数据,验证预测断点的历史交通流量数据是否具有周期性,具体过程如下:Step 3: According to the short-term traffic flow historical data of the breakpoint, verify whether the historical traffic flow data of the predicted breakpoint is periodic. The specific process is as follows:
对于构成时间序列的每个序列值X t,X t-1,…X t-k,利用自相关系数r k度量序列值之间的自相关程度,r k即为相隔k期的观测值之间的相关程度,通过以下的公式计算: For each of the sequence values X t , X t-1 , ... X tk constituting the time series, the autocorrelation coefficient r k is used to measure the degree of autocorrelation between the sequence values, and r k is the number of observations separated by k periods. The degree of correlation is calculated by the following formula:
Figure PCTCN2018107987-appb-000007
Figure PCTCN2018107987-appb-000007
其中n代表时间序列的长度,
Figure PCTCN2018107987-appb-000008
即为时序数据的平均值,X t-k则表示与X t相距k期的序列值。
Where n represents the length of the time series,
Figure PCTCN2018107987-appb-000008
That is the average of the time series data, and X tk represents the sequence value that is k periods away from X t .
步骤4,利用归一化方法对交通流数据进行归一化处理,将归一化后的数据集划分为训练数据集和测试数据集;Step 4. Use the normalization method to perform normalization processing on the traffic flow data, and divide the normalized data set into a training data set and a test data set;
所述的归一化方法,具体过程如下:The normalization method is as follows:
分别计算历史交通流数据某一个样本中的最小值min和最大值max,使用min-max标准化方法对数据进行归一化,使得归一化之后的交通流数据结果映射到[0,1]之间,即根据交通流数据集合F={f t|t=1,2,...T}求得集合中最大值max和最小值min,对集合中的每个数据计算: Calculate the minimum min and maximum max in a sample of historical traffic flow data, and use the min-max normalization method to normalize the data so that the normalized traffic flow data results are mapped to [0,1]. In other words, the maximum value max and the minimum value min in the set are obtained according to the traffic flow data set F = {f t | t = 1,2, ... T}, and each data in the set is calculated:
Figure PCTCN2018107987-appb-000009
Figure PCTCN2018107987-appb-000009
其中x’表示归一化处理后的交通流数据,min表示样本数据中的最小值,max表示样本数据最大值,x表示待归一化处理的数据。Where x 'represents the traffic flow data after normalization processing, min represents the minimum value of the sample data, max represents the maximum value of the sample data, and x represents the data to be normalized.
所述将归一化后的数据集划分为训练数据集和测试数据集,具体为:归一化处理后将历史交通流量数据中百分之80的数据作为训练集,百分之20的数据作为测试集。The dividing the normalized data set into a training data set and a test data set is specifically: after normalization processing, 80% of the data in the historical traffic flow data is used as the training set, and 20% of the data As a test set.
步骤5,利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果,具体包括以下步骤:Step 5. Use the SARIMA model to perform a predictive analysis on the test data set to obtain the initial prediction result, which specifically includes the following steps:
(5.1)检验原始交通流数据是否为平稳序列:检验结果为交通流数据是非平稳的,对其进行平稳化处理;检验结果为交通流数据是平稳的,直接进入步骤(5.2);(5.1) Check whether the original traffic flow data is a stable sequence: The test result is that the traffic flow data is non-stationary, and it is stabilized; the test result is that the traffic flow data is stable, and it proceeds directly to step (5.2);
(5.2)依据平稳化时间序列数据的ACF函数与PACF函数以及AIC最小准则,对SARIMA模型的四个参数p,q,P,Q取值;(5.2) According to the ACF function and PACF function of the stationary time series data and the AIC minimum criterion, the four parameters p, q, P, Q of the SARIMA model are valued;
(5.3)预测过程中以预测时刻t前d天的数据量当做训练数据,并采用滑动窗口的形式动态预测,且设定模型每执行n次就重新拟合,并调整参数,最终得出步骤5所述 的初始预测结果。(5.3) During the prediction process, the amount of data d days before the prediction time t is used as training data, and dynamic prediction is performed in the form of a sliding window, and the model is refitted every n times, and the parameters are adjusted to finally obtain the steps. The initial prediction results described in 5.
步骤6,将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果,具体包括以下步骤:Step 6. Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result, which specifically includes the following steps:
将SARIMA模型得到的初始预测结果作为反映周期性模式的输入特征,与其他输入特征组合共同带入随机森林模型中,采用网格法对参数进行调整,最终得出预测值。The initial prediction results obtained by the SARIMA model are used as input features reflecting the periodic pattern, and are combined with other input feature combinations into the random forest model. The parameters are adjusted using the grid method to finally obtain the predicted values.
步骤7,将测试数据集与最终的预测数据进行比较,并分析误差,具体包括以下步骤:Step 7. Compare the test data set with the final prediction data and analyze the error, which specifically includes the following steps:
通过平均百分比误差MAPE和均方根误差RMSE来对预测数据进行误差分析,计算公式如下:The error analysis is performed on the forecast data through the average percentage error MAPE and root mean square error RMSE. The calculation formula is as follows:
Figure PCTCN2018107987-appb-000010
Figure PCTCN2018107987-appb-000010
Figure PCTCN2018107987-appb-000011
Figure PCTCN2018107987-appb-000011
其中n代表共选取测试数据的个数,u i为第i个时段实际车流量值,
Figure PCTCN2018107987-appb-000012
为模型对第i个时段预测得到的流量值。
Where n represents the number of test data selected in total, and u i is the actual traffic volume value in the i-th period.
Figure PCTCN2018107987-appb-000012
The flow value obtained by the model for the i-th period.
为了更好地理解本发明,下面结合附图和具体实施例对本发明的内容做进一步的说明。In order to better understand the present invention, the content of the present invention will be further described below with reference to the accompanying drawings and specific embodiments.
实施例1Example 1
本实施例中基于时空相关性的短时交通流预测方法,主要流程图及其结构图如图1和图2所示,包括如下步骤:The short-term traffic flow prediction method based on the spatio-temporal correlation in this embodiment, the main flowchart and its structure diagram are shown in Fig. 1 and Fig. 2, including the following steps:
步骤一,选定需要进行交通流预测的路段以及该路段中的断点,获取所选路段中所有断点的短时交通流量历史数据;Step 1: Select the road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain the historical short-term traffic flow data of all breakpoints in the selected road segment;
步骤二,根据获取的短时交通流历史数据,确定短时交通流预测的预测时段;Step 2: Determine the prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data;
步骤三,根据断点的短时交通流量历史数据,验证预测断点的历史交通流量数据是否具有周期性;Step 3: verify whether the historical traffic flow data of the predicted breakpoint is periodic based on the short-term traffic flow historical data of the breakpoint;
步骤四,利用归一化方法对交通流数据进行归一化处理,将归一化后的数据集划分为训练数据集和测试数据集;Step four: normalize the traffic flow data by using a normalization method, and divide the normalized data set into a training data set and a test data set;
步骤五,利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果;Step 5: Use the SARIMA model to perform prediction analysis on the test data set to obtain the initial prediction result;
步骤六,将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果;Step 6. Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result.
步骤七:将测试数据集与最终的预测数据进行比较,并分析误差。Step 7: Compare the test data set with the final prediction data and analyze the errors.
在本实施用例中,交通流数据通过线圈采集得到,获取得到的交通流量数据为特定断点在一定时间间隔内经过的车辆数目,在本实例中,该时间间隔为5分钟。历史观测数据集合表示为F={f t|t=1,2,...T},其中f t表示路网特定断点在t时刻的交通流参数,T时刻与T+1时刻的差值为预测时间间隔,本实例中采用的预测时间间隔为5分钟。 In the use case of this embodiment, the traffic flow data is collected through a coil, and the obtained traffic flow data is the number of vehicles passing by a specific breakpoint within a certain time interval. In this example, the time interval is 5 minutes. The historical observation data set is expressed as F = {f t | t = 1,2, ... T}, where f t represents the traffic flow parameter of the specific breakpoint of the road network at time t, and the difference between time T and time T + 1 The value is the prediction time interval. The prediction time interval used in this example is 5 minutes.
若挖掘分析交通流量的周期性模式,必须先验证数据集具有周期性,本实例通过自相关函数进行周期性验证。以一天中早六点至晚24点,时间间隔为5分钟的数据为实验数据,经验证,交通流量数据具有日周期性,且为216,与实际相符。周期性验证图如图3,4所示。If you analyze and analyze the periodic pattern of traffic flow, you must first verify that the data set is periodic. In this example, periodic verification is performed through an autocorrelation function. Taking the data from 6 am to 24 pm in the day with a time interval of 5 minutes as experimental data, it has been verified that the traffic flow data has a daily periodicity and is 216, which is consistent with the actual situation. The periodic verification chart is shown in Figures 3 and 4.
接着,分别计算历史交通流数据某一个样本中的最小值min和最大值max,使用min-max标准化方法对数据进行归一化,使得归一化之后的交通流数据结果映射到[0,1]之间,即根据交通流数据集合F={f t|t=1,2,...T}求得集合中最大值max和最小值min,对集合中的每个数据计算: Next, calculate the minimum min and maximum max in a sample of historical traffic flow data, and use the min-max normalization method to normalize the data, so that the normalized traffic flow data results are mapped to [0,1 ], That is, the maximum value max and the minimum value min in the set are obtained according to the traffic flow data set F = {f t | t = 1,2, ... T}, and each data in the set is calculated:
Figure PCTCN2018107987-appb-000013
Figure PCTCN2018107987-appb-000013
其中x’表示归一化处理后的交通流数据,min表示样本数据中的最小值,max表示样本数据最大值,x表示待归一化处理的数据。Where x 'represents the traffic flow data after normalization processing, min represents the minimum value of the sample data, max represents the maximum value of the sample data, and x represents the data to be normalized.
本实例选取25个工作日数据作为实验数据,其中20天的交通流数据作为训练数据,5天的交通流数据作为测试数据。In this example, data of 25 working days are used as experimental data, of which 20 days of traffic flow data are used as training data, and 5 days of traffic flow data are used as test data.
SARIMA模型就是一种可以描述季节性时间序列的模型,它是自回归积分移动平均(ARIMA)模型的一种变形形式[14]。The SARIMA model is a model that can describe seasonal time series. It is a variant of the Autoregressive Integral Moving Average (ARIMA) model [14].
假设一个交通流序列{X t}可由SARIMA(p,d,q)(P,D,Q)S模型拟合,其中参数S表示设定的季节周期的长度,参数d表示转换成平稳序列所需的差分次数,参数D的含义为所需季节差分的阶数;设差分后的的平稳时间序列为{Y t},如式(2)所示,其中B表示后移算子,它与交通流量有如式(3)所示关系: Assume that a traffic flow sequence {X t } can be fitted by the SARIMA (p, d, q) (P, D, Q) S model, where the parameter S represents the length of the set seasonal period, and the parameter d represents the conversion into a stationary sequence. The required number of differences, the meaning of the parameter D is the order of the required seasonal difference; let the stationary time series after the difference be {Y t }, as shown in equation (2), where B represents the backward shift operator, The traffic flow has the relationship shown in equation (3):
Y t=(1-B) d(1-B S) DX t  (2) Y t = (1-B) d (1-B S ) D X t (2)
B jX t=X t-j  (3) B j X t = X tj (3)
则SARIMA模型可表示为式(4)的形式:Then the SARIMA model can be expressed in the form of equation (4):
φ(B)Φ(B S)(1-B) d(1-B S) DY t=c+θ(B)Θ(B St  (4) φ (B) Φ (B S ) (1-B) d (1-B S ) D Y t = c + θ (B) Θ (B S ) ε t (4)
其中参数c表示常数项,ε t表示模型的残差项,且满足ε t~N(0,δ 2),BS表示季节后移算子,并满足以下关系: The parameter c represents a constant term, ε t represents the residual term of the model, and satisfies ε t ~ N (0, δ 2 ), and BS represents a post-season shift operator, and satisfies the following relationship:
φ(B)=1-φ 1B-φ 2B 2-…-φ pB p,  (5) φ (B) = 1-φ 1 B-φ 2 B 2 -...- φ p B p , (5)
φ(B S)=1-φ 1B S,12B S,2-…-φ pB S,p,  (6) φ (B S ) = 1-φ 1 B S, 12 B S, 2- … -φ p B S, p , (6)
θ(B)=1-θ 1B-θ 2B 2-…-φ qB q,  (7) θ (B) = 1-θ 1 B-θ 2 B 2 -...- φ q B q , (7)
θ(B S)=1-θ 1B S,12B S,2-…-φ qB S,Q,  (8) θ (B S ) = 1-θ 1 B S, 12 B S, 2 -...- φ q B S, Q , (8)
SARIMA(p,d,q)(P,D,Q) S模型预测的基本步骤如图5所示。本实例中首先检验原始交通流数据是否为平稳序列。检验结果为交通流数据是非平稳的,故对其进行平稳化处理,得出d取1,D取1,S为156;第二步依据处理后的平稳化时间序列的ACF函数与PACF函数以及AIC最小准则,对p,q,P,Q取值。在预测过程中以预测时刻t前三天的数据量当做训练数据,并采用滑动窗口的形式动态预测,且设定模型每执行12次就重新拟合,调整参数,最终预测测试集中一周的车流量数据。 The basic steps of SARIMA (p, d, q) (P, D, Q) S model prediction are shown in Figure 5. In this example, it is first checked whether the original traffic flow data is a stationary sequence. The test result is that the traffic flow data is non-stationary, so it is stabilized, and it is obtained that d takes 1, D takes 1, and S is 156. The second step is based on the processed ACF function and PACF function of the stabilized time series and AIC minimum criterion. Values for p, q, P, Q. In the prediction process, the data amount of the three days before the prediction time t is used as training data, and dynamic prediction is performed in the form of a sliding window, and the model is refitted every 12 times, the parameters are adjusted, and finally the vehicle in the test set is predicted for one week. Traffic data.
随机森林(Random Forest,简称RF)是数据挖掘和机器学习的强大工具,是将大量的回归树结合继而得出预测结果的集成学习方法,通过将大量弱模型组合构建成强模型。RF的预测过程可以通过评估预测因子的重要程度来直观地解释,该算法对于数据中的噪声和异常值具有鲁棒性,可有效地运行在交通大数据上,并且对于高维数据也有很好的适应性。本实例中,将SARIMA模型得到的初始预测结果作为反映周期性模式的特征,与其他输入特征组合在一起,带入随机森林模型中得到最终预测结果。并选取三个时段:早7点至晚20点(时段1),早8点至10点(时段2),下午14点至16点(时段3),对测试数据集合预测数据比较,进行误差分析。误差通过两个指标来评价:即平均百分比误差(MAPE)和均方根误差(RMSE),计算公式如下:Random forest (RF) is a powerful tool for data mining and machine learning. It is an integrated learning method that combines a large number of regression trees and then obtains prediction results. It combines a large number of weak models into a strong model. The prediction process of RF can be intuitively explained by evaluating the importance of the predictive factor. The algorithm is robust to noise and outliers in the data, can effectively run on big traffic data, and is also good for high-dimensional data. Adaptability. In this example, the initial prediction result obtained by the SARIMA model is used as a feature reflecting the periodic pattern, combined with other input features, and brought into the random forest model to obtain the final prediction result. And select three time periods: 7 am to 20 pm (period 1), 8 am to 10 pm (period 2), 14 pm to 16 pm (period 3), and compare the prediction data of the test data set with errors. analysis. The error is evaluated by two indicators: the average percentage error (MAPE) and the root mean square error (RMSE). The calculation formula is as follows:
Figure PCTCN2018107987-appb-000014
Figure PCTCN2018107987-appb-000014
Figure PCTCN2018107987-appb-000015
Figure PCTCN2018107987-appb-000015
其中n代表共选取测试数据的个数,u i为第i个时段实际车流量值,
Figure PCTCN2018107987-appb-000016
为模型第i个时段预测得到的流量值。本发明方法的预测结果与现有方法预测结果的对比图如图6、7、8所示。
Where n represents the number of test data selected in total, and u i is the actual traffic volume value in the i-th period.
Figure PCTCN2018107987-appb-000016
The predicted flow value for the i-th period of the model. The comparison between the prediction results of the method of the present invention and the prediction results of the existing methods is shown in Figures 6, 7, and 8.
综上所述,本发明方法深入挖掘交通流数据的随机性与不确定性,充分考虑交通流数据中的时空相关性,将流量数据分解成带有明显趋势的周期性部分和随机波动部分,加以分析,从而提高了交通流量数据的预测精度。In summary, the method of the present invention deeply explores the randomness and uncertainty of traffic flow data, fully considers the spatio-temporal correlation in traffic flow data, and decomposes the flow data into a periodic part and a random fluctuation part with a clear trend. It is analyzed to improve the prediction accuracy of traffic flow data.

Claims (9)

  1. 一种基于时空相关性的短时交通流预测方法,其特征在于,包括以下步骤:A short-term traffic flow prediction method based on spatio-temporal correlation, which is characterized by including the following steps:
    步骤1,选定需要进行交通流预测的路段以及该路段中的断点,获取所选路段中所有断点的短时交通流量历史数据;Step 1: Select a road segment to be predicted for traffic flow and the breakpoints in the road segment, and obtain historical short-term traffic flow data of all breakpoints in the selected road segment;
    步骤2,根据获取的短时交通流历史数据,确定短时交通流预测的预测时段;Step 2: Determine a prediction period of the short-term traffic flow prediction based on the obtained short-term traffic flow historical data;
    步骤3,根据断点的短时交通流量历史数据,验证预测断点的历史交通流量数据是否具有周期性;Step 3: Verify whether the historical traffic flow data of the predicted breakpoint is periodic based on the short-term traffic flow historical data of the breakpoint;
    步骤4,利用归一化方法对交通流数据进行归一化处理,将归一化后的数据集划分为训练数据集和测试数据集;Step 4. Use the normalization method to perform normalization processing on the traffic flow data, and divide the normalized data set into a training data set and a test data set;
    步骤5,利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果;Step 5. Use the SARIMA model to perform a predictive analysis on the test data set to obtain an initial prediction result;
    步骤6,将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果;Step 6. Take the prediction result obtained by the SARIMA model as an input feature and bring it into the random forest model to obtain the final prediction result;
    步骤7,将测试数据集与最终的预测数据进行比较,并分析误差。Step 7. Compare the test data set with the final prediction data and analyze the errors.
  2. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤1中所述断点的短时交通流历史数据是指数据采集日期,时间,断点处的交通流速度值以及交通流量值。The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that the short-term traffic flow historical data of the breakpoint in step 1 refers to data collection date, time, and traffic at the breakpoint Flow speed value and traffic flow value.
  3. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤2中所述的预测时段为5分钟。The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, wherein the prediction period described in step 2 is 5 minutes.
  4. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤3中所述验证预测断点的历史交通流量数据是否具有周期性,是指利用自相关函数进行周期性验证,具体过程如下:The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that, in step 3, verifying whether the historical traffic flow data of the prediction breakpoint has periodicity refers to using the autocorrelation function to perform the periodicity. Sexual verification, the specific process is as follows:
    对于构成时间序列的每个序列值X t,X t-1,…X t-k,利用自相关系数r k度量序列值之间的自相关程度,r k即为相隔k期的观测值之间的相关程度,通过以下的公式计算: For each of the sequence values X t , X t-1 , ... X tk constituting the time series, the autocorrelation coefficient r k is used to measure the degree of autocorrelation between the sequence values, and r k is the number of observations separated by k periods. The degree of correlation is calculated by the following formula:
    Figure PCTCN2018107987-appb-100001
    Figure PCTCN2018107987-appb-100001
    其中n代表时间序列的长度,
    Figure PCTCN2018107987-appb-100002
    即为时序数据的平均值,X t-k则表示与X t相距k期的序列值。
    Where n represents the length of the time series,
    Figure PCTCN2018107987-appb-100002
    That is the average of the time series data, and X tk represents the sequence value that is k periods away from X t .
  5. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于, 步骤4中所述的归一化方法,具体过程如下:The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, wherein the normalization method described in step 4 is as follows:
    分别计算历史交通流数据某一个样本中的最小值min和最大值max,使用min-max标准化方法对数据进行归一化,使得归一化之后的交通流数据结果映射到[0,1]之间,即根据交通流数据集合F={f t|t=1,2,...T}求得集合中最大值max和最小值min,对集合中的每个数据计算: Calculate the minimum min and maximum max in a sample of historical traffic flow data, and use the min-max normalization method to normalize the data so that the normalized traffic flow data results are mapped to In other words, the maximum value max and the minimum value min in the set are obtained according to the traffic flow data set F = {f t | t = 1,2, ... T}, and each data in the set is calculated:
    Figure PCTCN2018107987-appb-100003
    Figure PCTCN2018107987-appb-100003
    其中x’表示归一化处理后的交通流数据,min表示样本数据中的最小值,max表示样本数据最大值,x表示待归一化处理的数据。Where x 'represents the traffic flow data after normalization processing, min represents the minimum value of the sample data, max represents the maximum value of the sample data, and x represents the data to be normalized.
  6. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤4中所述将归一化后的数据集划分为训练数据集和测试数据集,具体为:归一化处理后将历史交通流量数据中百分之80的数据作为训练集,百分之20的数据作为测试集。The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that, in step 4, the normalized data set is divided into a training data set and a test data set, specifically: After the normalization process, 80% of the historical traffic data is used as the training set, and 20% of the data is used as the test set.
  7. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤5所述利用SARIMA模型对测试数据集进行预测分析,得到初始预测结果,具体包括以下步骤:The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that, in step 5, the SARIMA model is used to perform prediction analysis on the test data set to obtain an initial prediction result, which specifically includes the following steps:
    (5.1)检验原始交通流数据是否为平稳序列:检验结果为交通流数据是非平稳的,对其进行平稳化处理;检验结果为交通流数据是平稳的,直接进入步骤(5.2);(5.1) Check whether the original traffic flow data is a stable sequence: The test result is that the traffic flow data is non-stationary, and it is stabilized; the test result is that the traffic flow data is stable, and it proceeds directly to step (5.2);
    (5.2)依据平稳化时间序列数据的ACF函数与PACF函数以及AIC最小准则,对SARIMA模型的四个参数p,q,P,Q取值;(5.2) According to the ACF function and PACF function of the stationary time series data and the AIC minimum criterion, the four parameters p, q, P, Q of the SARIMA model are valued;
    (5.3)预测过程中以预测时刻t前d天的数据量当做训练数据,并采用滑动窗口的形式动态预测,且设定模型每执行n次就重新拟合,并调整参数,最终得出步骤5所述的初始预测结果。(5.3) During the prediction process, the amount of data d days before the prediction time t is used as training data, and dynamic prediction is performed in the form of a sliding window, and the model is refitted every n times, and the parameters are adjusted to finally obtain the steps The initial prediction results described in 5.
  8. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步骤6所述将SARIMA模型得到的预测结果作为一项输入特征,带入随机森林模型,得到最终的预测结果,具体包括以下步骤:The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that, in step 6, the prediction result obtained by the SARIMA model is used as an input feature to be brought into a random forest model to obtain the final prediction. As a result, it includes the following steps:
    将SARIMA模型得到的初始预测结果作为反映周期性模式的输入特征,与其他输入特征组合共同带入随机森林模型中,采用网格法对参数进行调整,最终得出预测值。The initial prediction results obtained by the SARIMA model are used as input features reflecting the periodic pattern, and are combined with other input feature combinations into the random forest model. The parameters are adjusted using the grid method to finally obtain the predicted values.
  9. 根据权利要求1所述的基于时空相关性的短时交通流预测方法,其特征在于,步 骤7所述将测试数据集与最终的预测数据进行比较,并分析误差,具体包括以下步骤:The short-term traffic flow prediction method based on spatio-temporal correlation according to claim 1, characterized in that, in step 7, the test data set is compared with the final prediction data, and the error is analyzed, which specifically includes the following steps:
    通过平均百分比误差MAPE和均方根误差RMSE来对预测数据进行误差分析,计算公式如下:The error analysis is performed on the forecast data through the average percentage error MAPE and root mean square error RMSE. The calculation formula is as follows:
    Figure PCTCN2018107987-appb-100004
    Figure PCTCN2018107987-appb-100004
    Figure PCTCN2018107987-appb-100005
    Figure PCTCN2018107987-appb-100005
    其中n代表共选取测试数据的个数,u i为第i个时段实际车流量值,
    Figure PCTCN2018107987-appb-100006
    为模型对第i个时段预测得到的流量值。
    Where n represents the number of test data selected in total, and u i is the actual traffic volume value in the i-th period.
    Figure PCTCN2018107987-appb-100006
    The flow value obtained by the model for the i-th period.
PCT/CN2018/107987 2018-07-13 2018-09-27 Short-term traffic flow prediction method based on spatio-temporal correlation WO2020010717A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2018432145A AU2018432145A1 (en) 2018-07-13 2018-09-27 Method for short-term traffic flow prediction based on spatiotemporal correlation

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810772451.5A CN108877223A (en) 2018-07-13 2018-07-13 A kind of Short-time Traffic Flow Forecasting Methods based on temporal correlation
CN201810772451.5 2018-07-13

Publications (1)

Publication Number Publication Date
WO2020010717A1 true WO2020010717A1 (en) 2020-01-16

Family

ID=64301706

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/107987 WO2020010717A1 (en) 2018-07-13 2018-09-27 Short-term traffic flow prediction method based on spatio-temporal correlation

Country Status (3)

Country Link
CN (1) CN108877223A (en)
AU (2) AU2018432145A1 (en)
WO (1) WO2020010717A1 (en)

Cited By (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111465025A (en) * 2020-03-06 2020-07-28 重庆邮电大学 Tourism city 5G network networking method based on novel capacity prediction model
CN111489013A (en) * 2020-03-11 2020-08-04 北京交通大学 Traffic station flow prediction method based on space-time multi-graph convolution network
CN111639787A (en) * 2020-04-28 2020-09-08 北京工商大学 Spatio-temporal data prediction method based on graph convolution network
CN111754775A (en) * 2020-07-03 2020-10-09 浙江大学 Traffic flow prediction method based on feature reconstruction error
CN111860621A (en) * 2020-06-30 2020-10-30 贵州民族大学 Data-driven distributed traffic flow prediction method and system
CN111951037A (en) * 2020-07-02 2020-11-17 西安丝路智慧科技有限公司 Big data-based passenger flow prediction and travel marketing method
CN112071065A (en) * 2020-09-16 2020-12-11 山东理工大学 Traffic flow prediction method based on global diffusion convolution residual error network
CN112232985A (en) * 2020-10-15 2021-01-15 国网天津市电力公司 Power distribution and utilization data monitoring method and device for ubiquitous power Internet of things
CN112651545A (en) * 2020-12-07 2021-04-13 华电郑州机械设计研究院有限公司 Industrial heat load prediction method based on characteristic decomposition
CN112669595A (en) * 2020-12-10 2021-04-16 浙江大学 Online taxi booking flow prediction method based on deep learning
CN112669599A (en) * 2020-12-14 2021-04-16 成都易书桥科技有限公司 Model based on time sequence analysis and residual error matching
CN112798949A (en) * 2020-10-22 2021-05-14 国家电网有限公司 Pumped storage unit generator temperature early warning method and system
CN113077281A (en) * 2021-03-12 2021-07-06 中山大学 Subway passenger flow distribution prediction method and device
CN113112791A (en) * 2021-03-26 2021-07-13 华南理工大学 Traffic flow prediction method based on sliding window long-and-short term memory network
CN113177657A (en) * 2021-04-20 2021-07-27 上海大学 Rail transit passenger flow prediction method and device
CN113190429A (en) * 2021-06-03 2021-07-30 河北师范大学 Server performance prediction method and device and terminal equipment
CN113762338A (en) * 2021-07-30 2021-12-07 湖南大学 Traffic flow prediction method, equipment and medium based on multi-graph attention mechanism
CN113870565A (en) * 2021-10-26 2021-12-31 邢吉平 Method and system for estimating unknown flow of urban road network based on transfer learning model
CN113946795A (en) * 2021-09-17 2022-01-18 山东大学 Ultrasonic flight time estimation method
CN113947025A (en) * 2021-10-26 2022-01-18 山东通广电子有限公司 Method and device for predicting short-term traffic flow, terminal device and storage medium
CN113988414A (en) * 2021-10-27 2022-01-28 内蒙古工业大学 Wind power output power prediction method based on P _ LSTNet and weighted Markov verification
CN114158085A (en) * 2021-11-18 2022-03-08 湖北工业大学 Mobile cellular traffic high-efficiency prediction method based on spatio-temporal aggregation graph convolution network
CN114205190A (en) * 2021-12-03 2022-03-18 中国长江三峡集团有限公司 Autonomous monitoring and coordinating method for Internet of things gateway
CN114202921A (en) * 2021-12-09 2022-03-18 洛阳师范学院 Multi-scale symbol dynamic entropy analysis method for traffic flow
CN114444813A (en) * 2022-02-18 2022-05-06 中南大学 Traffic flow prediction method based on deep learning
CN114492978A (en) * 2022-01-20 2022-05-13 哈尔滨工程大学 Time-space sequence prediction method and device based on multi-layer attention mechanism
CN114529081A (en) * 2022-02-18 2022-05-24 哈尔滨工程大学 Space-time combined traffic flow prediction method and device
CN114548459A (en) * 2022-02-25 2022-05-27 江苏明月软件技术有限公司 Ticket data regulation and control method and system and computer readable storage medium
CN114970350A (en) * 2022-05-30 2022-08-30 西安热工研究院有限公司 Short-term heat supply load combined prediction method based on ARIMA and BP neural network
CN115051925A (en) * 2021-06-18 2022-09-13 北京邮电大学 Time-space sequence prediction method based on transfer learning
CN115240425A (en) * 2022-07-26 2022-10-25 西北师范大学 Traffic prediction method based on multi-scale space-time fusion graph network
CN115547055A (en) * 2022-11-30 2022-12-30 广东省科技基础条件平台中心 Traffic signal lamp coordination control method and device, storage medium and equipment
CN116071932A (en) * 2023-03-09 2023-05-05 华东交通大学 Traffic flow prediction method, system, storage medium and terminal equipment
CN116151478A (en) * 2023-04-03 2023-05-23 江西锦路科技开发有限公司 Short-time traffic flow prediction method, device and medium for improving sparrow search algorithm
CN116153069A (en) * 2023-02-09 2023-05-23 东南大学 Traffic flow model and data fusion driven traffic state estimation method and device
CN116611552A (en) * 2023-05-12 2023-08-18 中国人民解放军92493部队试验训练总体研究所 Ship traffic flow prediction method and system based on target monitoring data
CN117116051A (en) * 2023-10-25 2023-11-24 深圳市交投科技有限公司 Intelligent traffic management system and method based on artificial intelligence
CN117273414A (en) * 2023-11-23 2023-12-22 苏州航天系统工程有限公司 System and method for analyzing and identifying big data of smart city
CN117408393A (en) * 2023-12-06 2024-01-16 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event
CN117455554A (en) * 2023-12-26 2024-01-26 数字苏州建设有限公司 LBS-based large-scale activity key traffic road traffic time-period prediction method

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109377761A (en) * 2018-12-10 2019-02-22 南京理工大学 Traffic factor network establishing method based on Markov-chain model
CN109886387B (en) * 2019-01-07 2021-02-26 北京大学 Traffic time sequence prediction method based on gating network and gradient lifting regression
CN109754126A (en) * 2019-01-30 2019-05-14 银江股份有限公司 Short-time Traffic Flow Forecasting Methods based on temporal correlation and convolutional neural networks
CN109727455A (en) * 2019-03-05 2019-05-07 湖北汇程信息技术有限公司 A kind of processing method of traffic information
CN109961180A (en) * 2019-03-15 2019-07-02 浙江工业大学 A kind of short-term traffic flow forecast method based on temporal correlation
CN110047291B (en) * 2019-05-27 2020-06-19 清华大学深圳研究生院 Short-term traffic flow prediction method considering diffusion process
CN110223517B (en) * 2019-06-20 2021-05-18 青岛科技大学 Short-term traffic flow prediction method based on space-time correlation
CN111653084A (en) * 2019-07-26 2020-09-11 银江股份有限公司 Short-term traffic flow prediction method based on space-time feature selection and Kalman filtering
CN110782658B (en) * 2019-08-16 2022-03-29 华南理工大学 Traffic prediction method based on LightGBM algorithm
CN111027662B (en) * 2019-11-08 2024-01-12 南通大学 SD-LSSVR short-time traffic flow prediction method based on chaotic quantum particle swarm optimization
CN111192453A (en) * 2019-12-30 2020-05-22 深圳市麦谷科技有限公司 Short-term traffic flow prediction method and system based on Bayesian optimization
CN113096388B (en) * 2021-03-22 2022-09-09 北京工业大学 Short-term traffic flow prediction method based on gradient lifting decision tree
CN113313937B (en) * 2021-04-14 2022-05-06 交通运输部规划研究院 Road network traffic flow dynamic simulation method based on land utilization information
CN113222645A (en) * 2021-04-23 2021-08-06 深圳市城市交通规划设计研究中心股份有限公司 Urban hot spot area peak trip demand prediction method based on multi-source data fusion
CN113516845A (en) * 2021-04-27 2021-10-19 中南大学 Road monthly traffic volume prediction method based on SARIMA-NAR combined model
CN113205685B (en) * 2021-04-30 2022-06-10 南通大学 Short-term traffic flow prediction method based on global-local residual error combination model
CN113379099B (en) * 2021-04-30 2022-06-03 广东工业大学 Machine learning and copula model-based highway traffic flow self-adaptive prediction method
CN113392880B (en) * 2021-05-27 2021-11-23 扬州大学 Traffic flow short-time prediction method based on deviation correction random forest
CN113570866B (en) * 2021-09-24 2021-12-21 成都宜泊信息科技有限公司 Parking lot management method and system, storage medium and electronic equipment
CN113870591B (en) * 2021-10-22 2023-08-01 上海应用技术大学 Traffic prediction-based signal control period dividing method, device and equipment
CN113837503B (en) * 2021-11-29 2022-03-29 新风光电子科技股份有限公司 Power grid load prediction method and system based on artificial intelligence
CN114239948B (en) * 2021-12-10 2023-07-21 浙江省交通投资集团有限公司智慧交通研究分公司 Deep traffic flow prediction method, medium and equipment based on time sequence decomposition unit
CN114881371A (en) * 2022-06-21 2022-08-09 华东师范大学 Traffic flow prediction method based on federal learning
CN116978222B (en) * 2023-07-24 2024-04-16 重庆邮电大学 Short-time traffic flow prediction method based on space-time data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104134351A (en) * 2014-08-14 2014-11-05 中国科学院自动化研究所 Short-term traffic flow predicting method
CN106971547A (en) * 2017-05-18 2017-07-21 福州大学 A kind of Short-time Traffic Flow Forecasting Methods for considering temporal correlation
CN107464419A (en) * 2017-08-28 2017-12-12 北京工业大学 A kind of Short-time Traffic Flow Forecasting Methods for considering space-time characterisation
CN107705556A (en) * 2017-09-01 2018-02-16 南京邮电大学 A kind of traffic flow forecasting method combined based on SVMs and BP neural network

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104992244A (en) * 2015-07-08 2015-10-21 中国民用航空总局第二研究所 Airport freight traffic prediction analysis method based on SARIMA and RBF neural network integration combination model
CN107563540B (en) * 2017-07-25 2021-03-30 中南大学 Method for predicting short-time bus boarding passenger flow based on random forest
CN107742420A (en) * 2017-09-22 2018-02-27 北京交通大学 It is a kind of to be used for the method that road traffic flow is predicted under emergent traffic incident

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104134351A (en) * 2014-08-14 2014-11-05 中国科学院自动化研究所 Short-term traffic flow predicting method
CN106971547A (en) * 2017-05-18 2017-07-21 福州大学 A kind of Short-time Traffic Flow Forecasting Methods for considering temporal correlation
CN107464419A (en) * 2017-08-28 2017-12-12 北京工业大学 A kind of Short-time Traffic Flow Forecasting Methods for considering space-time characterisation
CN107705556A (en) * 2017-09-01 2018-02-16 南京邮电大学 A kind of traffic flow forecasting method combined based on SVMs and BP neural network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHEN, JUNGANG ET AL.: "Traffic Flow Forecasting Method Based on Gradient Boosting Decision Tree", CHINESE MASTER S THESES FULL-TEXT DATABASE, ENGINEERING SCIENCE & TECHNOLOGY II, no. 4, 15 April 2018 (2018-04-15), ISSN: 1674-0246 *

Cited By (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111465025A (en) * 2020-03-06 2020-07-28 重庆邮电大学 Tourism city 5G network networking method based on novel capacity prediction model
CN111465025B (en) * 2020-03-06 2023-04-07 重庆邮电大学 Tourism city 5G network networking method based on novel capacity prediction model
CN111489013A (en) * 2020-03-11 2020-08-04 北京交通大学 Traffic station flow prediction method based on space-time multi-graph convolution network
CN111639787A (en) * 2020-04-28 2020-09-08 北京工商大学 Spatio-temporal data prediction method based on graph convolution network
CN111639787B (en) * 2020-04-28 2024-03-15 北京工商大学 Spatio-temporal data prediction method based on graph convolution network
CN111860621B (en) * 2020-06-30 2024-05-03 贵州民族大学 Data-driven distributed traffic flow prediction method and system
CN111860621A (en) * 2020-06-30 2020-10-30 贵州民族大学 Data-driven distributed traffic flow prediction method and system
CN111951037A (en) * 2020-07-02 2020-11-17 西安丝路智慧科技有限公司 Big data-based passenger flow prediction and travel marketing method
CN111754775A (en) * 2020-07-03 2020-10-09 浙江大学 Traffic flow prediction method based on feature reconstruction error
CN112071065A (en) * 2020-09-16 2020-12-11 山东理工大学 Traffic flow prediction method based on global diffusion convolution residual error network
CN112232985A (en) * 2020-10-15 2021-01-15 国网天津市电力公司 Power distribution and utilization data monitoring method and device for ubiquitous power Internet of things
CN112232985B (en) * 2020-10-15 2023-02-28 国网天津市电力公司 Power distribution and utilization data monitoring method and device for ubiquitous power Internet of things
CN112798949A (en) * 2020-10-22 2021-05-14 国家电网有限公司 Pumped storage unit generator temperature early warning method and system
CN112651545A (en) * 2020-12-07 2021-04-13 华电郑州机械设计研究院有限公司 Industrial heat load prediction method based on characteristic decomposition
CN112669595B (en) * 2020-12-10 2022-07-01 浙江大学 Network taxi booking flow prediction method based on deep learning
CN112669595A (en) * 2020-12-10 2021-04-16 浙江大学 Online taxi booking flow prediction method based on deep learning
CN112669599A (en) * 2020-12-14 2021-04-16 成都易书桥科技有限公司 Model based on time sequence analysis and residual error matching
CN113077281A (en) * 2021-03-12 2021-07-06 中山大学 Subway passenger flow distribution prediction method and device
CN113112791A (en) * 2021-03-26 2021-07-13 华南理工大学 Traffic flow prediction method based on sliding window long-and-short term memory network
CN113177657A (en) * 2021-04-20 2021-07-27 上海大学 Rail transit passenger flow prediction method and device
CN113177657B (en) * 2021-04-20 2022-08-30 上海大学 Rail transit passenger flow prediction method and device
CN113190429B (en) * 2021-06-03 2022-07-19 河北师范大学 Server performance prediction method and device and terminal equipment
CN113190429A (en) * 2021-06-03 2021-07-30 河北师范大学 Server performance prediction method and device and terminal equipment
CN115051925A (en) * 2021-06-18 2022-09-13 北京邮电大学 Time-space sequence prediction method based on transfer learning
CN115051925B (en) * 2021-06-18 2023-07-18 北京邮电大学 Time-space sequence prediction method based on transfer learning
CN113762338A (en) * 2021-07-30 2021-12-07 湖南大学 Traffic flow prediction method, equipment and medium based on multi-graph attention mechanism
CN113762338B (en) * 2021-07-30 2023-08-25 湖南大学 Traffic flow prediction method, equipment and medium based on multiple graph attention mechanism
CN113946795A (en) * 2021-09-17 2022-01-18 山东大学 Ultrasonic flight time estimation method
CN113870565B (en) * 2021-10-26 2022-12-23 邢吉平 Method and system for estimating unknown flow of urban road network based on transfer learning model
CN113947025B (en) * 2021-10-26 2024-08-27 山东通广电子股份有限公司 Short-time traffic flow prediction method, device, terminal equipment and storage medium
CN113947025A (en) * 2021-10-26 2022-01-18 山东通广电子有限公司 Method and device for predicting short-term traffic flow, terminal device and storage medium
CN113870565A (en) * 2021-10-26 2021-12-31 邢吉平 Method and system for estimating unknown flow of urban road network based on transfer learning model
CN113988414B (en) * 2021-10-27 2024-05-28 内蒙古工业大学 Wind power output power prediction method based on P_ LSTNet and weighted Markov verification
CN113988414A (en) * 2021-10-27 2022-01-28 内蒙古工业大学 Wind power output power prediction method based on P _ LSTNet and weighted Markov verification
CN114158085A (en) * 2021-11-18 2022-03-08 湖北工业大学 Mobile cellular traffic high-efficiency prediction method based on spatio-temporal aggregation graph convolution network
CN114158085B (en) * 2021-11-18 2023-07-21 湖北工业大学 Mobile cellular flow high-efficiency prediction method based on space-time aggregation graph convolution network
CN114205190B (en) * 2021-12-03 2023-07-14 中国长江三峡集团有限公司 Autonomous monitoring coordination method for Internet of things gateway
CN114205190A (en) * 2021-12-03 2022-03-18 中国长江三峡集团有限公司 Autonomous monitoring and coordinating method for Internet of things gateway
CN114202921B (en) * 2021-12-09 2023-08-04 洛阳师范学院 Multi-scale symbol dynamic entropy analysis method for traffic flow
CN114202921A (en) * 2021-12-09 2022-03-18 洛阳师范学院 Multi-scale symbol dynamic entropy analysis method for traffic flow
CN114492978A (en) * 2022-01-20 2022-05-13 哈尔滨工程大学 Time-space sequence prediction method and device based on multi-layer attention mechanism
CN114529081A (en) * 2022-02-18 2022-05-24 哈尔滨工程大学 Space-time combined traffic flow prediction method and device
CN114444813A (en) * 2022-02-18 2022-05-06 中南大学 Traffic flow prediction method based on deep learning
CN114548459A (en) * 2022-02-25 2022-05-27 江苏明月软件技术有限公司 Ticket data regulation and control method and system and computer readable storage medium
CN114548459B (en) * 2022-02-25 2024-04-26 江苏明月软件技术股份有限公司 Ticket data regulation and control method and system and computer readable storage medium
CN114970350A (en) * 2022-05-30 2022-08-30 西安热工研究院有限公司 Short-term heat supply load combined prediction method based on ARIMA and BP neural network
CN115240425A (en) * 2022-07-26 2022-10-25 西北师范大学 Traffic prediction method based on multi-scale space-time fusion graph network
CN115547055B (en) * 2022-11-30 2023-03-28 广东省科技基础条件平台中心 Traffic signal lamp coordination control method and device, storage medium and equipment
CN115547055A (en) * 2022-11-30 2022-12-30 广东省科技基础条件平台中心 Traffic signal lamp coordination control method and device, storage medium and equipment
CN116153069A (en) * 2023-02-09 2023-05-23 东南大学 Traffic flow model and data fusion driven traffic state estimation method and device
CN116153069B (en) * 2023-02-09 2024-01-30 东南大学 Traffic flow model and data fusion driven traffic state estimation method and device
CN116071932A (en) * 2023-03-09 2023-05-05 华东交通大学 Traffic flow prediction method, system, storage medium and terminal equipment
CN116151478A (en) * 2023-04-03 2023-05-23 江西锦路科技开发有限公司 Short-time traffic flow prediction method, device and medium for improving sparrow search algorithm
CN116611552A (en) * 2023-05-12 2023-08-18 中国人民解放军92493部队试验训练总体研究所 Ship traffic flow prediction method and system based on target monitoring data
CN116611552B (en) * 2023-05-12 2024-04-26 中国人民解放军92493部队试验训练总体研究所 Ship traffic flow prediction method and system based on target monitoring data
CN117116051B (en) * 2023-10-25 2023-12-22 深圳市交投科技有限公司 Intelligent traffic management system and method based on artificial intelligence
CN117116051A (en) * 2023-10-25 2023-11-24 深圳市交投科技有限公司 Intelligent traffic management system and method based on artificial intelligence
CN117273414A (en) * 2023-11-23 2023-12-22 苏州航天系统工程有限公司 System and method for analyzing and identifying big data of smart city
CN117408393B (en) * 2023-12-06 2024-03-19 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event
CN117408393A (en) * 2023-12-06 2024-01-16 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event
CN117455554B (en) * 2023-12-26 2024-03-22 数字苏州建设有限公司 LBS-based large-scale activity key traffic road traffic time-period prediction method
CN117455554A (en) * 2023-12-26 2024-01-26 数字苏州建设有限公司 LBS-based large-scale activity key traffic road traffic time-period prediction method

Also Published As

Publication number Publication date
AU2018432145A1 (en) 2020-08-13
CN108877223A (en) 2018-11-23
AU2018102176A4 (en) 2020-09-17

Similar Documents

Publication Publication Date Title
WO2020010717A1 (en) Short-term traffic flow prediction method based on spatio-temporal correlation
Sun et al. Using Bayesian deep learning to capture uncertainty for residential net load forecasting
Wang et al. A compound framework for wind speed forecasting based on comprehensive feature selection, quantile regression incorporated into convolutional simplified long short-term memory network and residual error correction
CN107992968B (en) Electric energy meter metering error prediction method based on integrated time series analysis technology
Wu et al. Traffic incident duration prediction based on support vector regression
CN107944612B (en) Bus net load prediction method based on ARIMA and phase space reconstruction SVR
Lu et al. Short-term demand forecasting for online car-hailing using ConvLSTM networks
CN110119845A (en) A kind of application method of track traffic for passenger flow prediction
CN107292446B (en) Hybrid wind speed prediction method based on component relevance wavelet decomposition
CN115148019A (en) Early warning method and system based on holiday congestion prediction algorithm
CN109308343A (en) A kind of Forecasting of Travel Time and degree of reiability method based on Stochastic Volatility Model
Zhang et al. Speed prediction based on a traffic factor state network model
CN113033110B (en) Important area personnel emergency evacuation system and method based on traffic flow model
Bhatia et al. A simulation-based statistical method for planning modular construction manufacturing.
Zekić-Sušac et al. Cluster analysis and artificial neural networks in predicting energy efficiency of public buildings as a cost-saving approach
Fang et al. Prediction modelling of rutting depth index for asphalt pavement using de-noising method
CN114564487A (en) Meteorological raster data updating method combining forecast prediction
Fışkın et al. Forecasting domestic shipping demand of cement: comparison of SARIMAX, ANN and hybrid SARIMAX-ANN
CN117196286A (en) Building construction safety risk pressure evaluation method based on toughness city theory
Minghui Population Prediction of China Based on ARIMA-LSTM Combined Model
Tzavidis et al. Outlier robust semi-parametric small area methods for poverty estimation
CN118444576B (en) Sewage treatment monitoring method and system based on Internet of things
Falchi et al. Deep Learning and Structural Health Monitoring: A TFT-Based Approach for Anomaly Detection in Masonry Towers
Niu Analyzing and Predicting Temperature Data in Baoji: A SARIMA Model Approach
Nie et al. Employment of Long-Run Equilibrium Relationships in Multivariate Short-Term Traffic Speed Forecasting

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18925974

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2018432145

Country of ref document: AU

Date of ref document: 20180927

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18925974

Country of ref document: EP

Kind code of ref document: A1