CN112330158B

CN112330158B - Method for identifying traffic index time sequence based on autoregressive differential moving average-convolution neural network

Info

Publication number: CN112330158B
Application number: CN202011231975.7A
Authority: CN
Inventors: 张学东; 卢剑; 张健钦; 徐志洁; 王家川; 石瑞轩
Original assignee: BEIJING TRANSPORTATION INFORMATION CENTER; Beijing University of Civil Engineering and Architecture
Current assignee: BEIJING TRANSPORTATION INFORMATION CENTER; Beijing University of Civil Engineering and Architecture
Priority date: 2020-11-06
Filing date: 2020-11-06
Publication date: 2023-10-13
Anticipated expiration: 2040-11-06
Also published as: CN112330158A

Abstract

The invention relates to a method for identifying traffic index time sequence based on autoregressive differential moving average-convolution neural network, which comprises the following steps: acquiring a traffic index original data set and generating a traffic index time sequence; converting the traffic index time sequence into a stable sequence, and fitting an autoregressive differential moving average algorithm model according to a Bayesian information criterion matrix to realize traffic index prediction; generating a training traffic index time sequence and a test traffic index time sequence according to the traffic index original data set, extracting traffic index feature information according to the training traffic index time sequence, acquiring an optimal convolutional neural network model, integrating the traffic index feature information into a one-dimensional feature vector by utilizing the optimal convolutional neural network model, determining the mode category of the one-dimensional feature vector according to a Softmax classifier, and further identifying the category to which the test traffic index time sequence belongs.

Description

Method for identifying traffic index time sequence based on autoregressive differential moving average-convolution neural network

Technical Field

The invention relates to a method for identifying traffic index time sequence based on autoregressive differential moving average-convolution neural network.

Background

Traffic is the pulse of a city. With the rapid development of urban economy, traffic jam problems are also becoming more severe, and areas with more developed economy are more prominent. In order to cope with complex and changeable traffic conditions of cities and relieve urban traffic pressure, a series of traffic laws and regulations are set out by traffic management departments to restrict driving regulations, and scientific research institutions also use technologies such as the Internet of things to assist traffic management departments in monitoring road congestion. Although various industries achieve certain effects of reducing the risk of traffic jam from various aspects, the adverse effects of the traffic jam on cities cannot be eliminated at present in the face of the influence of complex environments, emergency conditions, human intervention and other factors. The traffic index is an important index for researching the running state of urban traffic, reflects the quantification result of urban congestion, and has a certain rule in the time dimension. Traffic index is generally represented by a value of 0-10, with larger values representing more congested traffic and worse traffic conditions. Therefore, if the potential characteristics of resident travel are obtained from the historical traffic indexes through the time sequence pattern recognition, the category of the resident travel is distinguished, basic data can be provided for traffic running state research and prediction, and the method has important value for city traffic jam relief.

At present, the traffic passenger flow index prediction method mainly comprises the following steps: kalman filtering recursive algorithm (Recursive algorithm of Kalman filter), gray Theory (Grey Theory), support vector machine (Support Vector Machine), deep Learning (Deep Learning) and other methods. Although iterative estimation models based on Kalman filter recursion algorithms have been widely used in passenger flow predictions, kalman filter recursion algorithms require a large number of matrix and vector operations, and suffer from an insufficient computational efficiency (PeiYan, biswasSwarnendu, fusseldonald S, et al, an elementary introduction to Kalman filtering [ J ]. Communications of the ACM,2019.Baptista M,Henriques E M P,De Medeiros I P,et al.Remaining useful life estimation in aeronautics:Combining data-driven and Kalman filtering [ J ]. Reliability Engineering & System Safety,2019,184 (APR.): 228-239.Yang D.On post-processing day-ahead NWP forecasts using Kalman filtering [ J ]. Solar Energy,2019,182 (APR.): 179-181.). The gray theoretical model achieves the aim of prediction by identifying the degree of dissimilarity of the development trend among the system factors, but has better effect of predicting the data only in a short period and not in a long period (Wang Q, jiang F. Integrating linear and nonlinear forecasting techniques based on grey theory and artificial intelligence to forecast shale gas monthly production in Pennsylvania and Texas of the United States [ J ]. Energy,2019,178.Huiming Duan,Di Wang,Xinyu Pang,Yunmei Liu,Suhua Zeng,A novel forecasting approach based on Multi-Kernel Nonlinear Multivariable Grey Model: a case report, journal of Cleaner Production (2020)). The support vector machine maps the input layers to a high dimensional space and solves the separation hyperplane with very large computational effort (Fan S, chen L.short-Term Load Forecasting Based on an Adaptive Hybrid Method [ J ]. IEEE Transactions on Power Systems,2006, 21 (1): p.392-401.Kalra A,Ahmad S.Using Oceanic-Atmospheric Oscillations for Long Lead Time Streamflow Forecasting [ J ]. Water Resources Research,2009,45 (3) [ Gwo-Fong, lin, et al typhoon flood forecasting using integrated two-stage Support Vector Machine approach [ J ]. Journal of Hydrology, 2013). The Long-term memory network (Long Short Term Memory, LSTM) deep learning model alleviates the gradient vanishing problem in the recurrent neural network model, but the LSTM model requires a linear layer to run in each sequential time step, while the layer requires a large amount of memory bandwidth computation, which takes a Long time to train (Azzouni A, pujolle G.A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction [ J ].2017.Gers F A,Schraudolph N N,Schmidhuber,J u-rgen. Learning Precise Timing with LSTM Recurrent Networks [ J ]. Journal of Machine Learning Research,2003,3 (1): 115-143.Selvin S,Vinayakumar R,Gopalakrishnan E A,et al.Stock price prediction using LSTM,RNN and CNN-sliding window model [ C ]//2017International Conference on Advances in Computing,Communications and Informatics (ICACCI) & IEEE, 2017.). The method for identifying the traffic passenger flow mode mainly comprises the following steps: distance-based pattern recognition and feature-based pattern recognition. Distance-based pattern recognition generally uses Euclidean distance (Euclidean distance) to measure the similarity of traffic and passenger flows, such as the K-Nearest Neighbor (KNN) algorithm (Macqueen J.Some Methods for Classification and Analysis of Multi Variate Observations [ C ]. Proc of Berkeley Symposium on Mathematical Statistics & Probality, 1965. Saroj, kavita. Review: study on Simple K Mean and Modified K Mean Clustering Technique [ J ] International Journal of Computer Science Engineering and Technology,2016,6 (7): 279-281.). Feature-based pattern recognition algorithms typically look for differential subsections to distinguish the categories to which traffic flows belong, e.g., the shape algorithm looks for the most representative consecutive subsequences in the data (Zhu L, lu C, sun y. Time Series Shapelet Classification Based Online Short-Term Voltage Stability Assessment [ J ]. Power Systems, IEEE Transactions on,2016,31 (2): 1430-1439.). Although both types of pattern recognition methods can obtain good classification results under specific conditions, the pattern recognition methods are influenced by multiparty traffic factors, and the traffic index time series data have certain distortion and deformation phenomena, so that the conventional distance-based and feature-based recognition methods have certain defects on the pattern recognition capability of the traffic index time series data.

The autoregressive differential moving average algorithm (Auto-regressive Integrated Moving Average algorithm, ARIMA) refers to a model that is built by regressing a dependent variable only on its hysteresis value and the present and hysteresis values of a random error term in converting a non-stationary time series to a stationary time series (Ensor, bennett K.Time Series Analysis and Its Applications [ J ]. Journal of the American Statistical Association,2002,97 (458): 656-657.). The basic idea of the algorithm is to consider the data formed by the predicted object with time as a random sequence, describe the sequence by using a certain mathematical model, and predict future values according to past values by using the model, wherein the sequence is composed of an autoregressive process, an averaging process and a differential process. ARIMA includes an Auto-regressive Model (AR), a moving average Model (Moving Average Model, MA) and an Auto-Regressive Moving Average Model (ARMA) Model, depending on the stationarity of the time series data and the part involved in the regression analysis. The ARIMA prediction process for time series data is shown in fig. 2.

Prior to time series data prediction, data preprocessing is required, including randomness and stationarity detection. Data can be classified into three types by detection results: a purely random sequence, a stationary non-random sequence, and a non-stationary non-random sequence. The pure random sequence is also called a white noise sequence, and the pure random sequence is not related to each item of the sequence, is completely disordered and randomly distributed, and has no analysis and research values; the mean and variance of the stationary non-random sequence are constants, and fitting is generally performed through a linear model, so that a sequence rule is extracted; the mean and variance of the non-stationary non-random sequence are not determined and need to be converted to a stationary sequence by differential operation. The mean and variance of the stationary sequence data do not vary excessively over a period of time, and the fitted curve of the time sequence maintains the existing morphological continuation for a short time in the future. The plateau sequence includes both a strict plateau and a generalized plateau. Strictly and smoothly refers to data distribution not changing with time; the generalized plateau is that the expected and correlation coefficients of the sequence data do not change. Strict stability is often over-absolute, and data distribution in real life is mostly generalized stability.

The pure randomness test is detected by constructing test statistics, calculating p values corresponding to the test statistics, and if the p values are larger than the significance level alpha, the p values are the pure random sequences. For a non-random sequence, the time sequence is a stationary sequence if the fluctuation range of the time sequence at a certain moment is limited, i.e. there is a mean and a variance, and the periodic variation of the autocovariance is equal to the autocorrelation coefficient. The nature of the time series is generally determined using a time chart and a unit root test method.

Convolutional neural networks are neural networks that implement image processing by back propagation (Zhou Feiyan, jin Linpen, dong jun. Review of Convolutional Neural Network [ J ]. Chinese Journal of Computers,2017 (6)). As shown in fig. 3, it includes an Input layer (Input layer), a convolution layer (convolutional layer), a pooling layer (pooling layer), a full-connection layer (fully connected layer), and an output layer (output layer).

In convolutional neural networks, the input layer is multidimensional data. The input layer of the one-dimensional convolutional neural network is generally a one-dimensional or two-dimensional array; the input layer of the two-dimensional convolutional neural network is a two-dimensional or three-dimensional array; the input layer of the three-dimensional convolution is a four-dimensional array. Because of the characteristics of convolutional neural networks, the algorithm is widely applied in the field of computer image processing, so that input layers in many researches are two-dimensional pixel points or RGB images, and output layers can output the size, coordinates, classification and the like of objects according to research contents.

The function of the convolutional layer is to traverse the input layer through an internal convolutional kernel (convolution kernel) to achieve feature extraction (Palmieri F an, buonano a. Belief propagation and learning in convolution multi-layer factor graphs [ C ]// International Workshop on Cognitive Information processing.ieee, 2014.). The algorithm and size of the convolution kernel is determined according to the size of the input layer, and the convolution kernel is moved a fixed unit length on the input layer each time, as shown in fig. 4. Equation (5) is an expression in which the input layer performs a convolution operation with the convolution kernel.

Wherein, the liquid crystal display device comprises a liquid crystal display device,the result is output after convolution operation; />Convolving for layer j; />Is a bias parameter; f (x) is an activation function (activation function).

In order to solve the linear inseparable problem, the result after convolution operation needs to be converted in a nonlinear way by using an activation function again. Common activation functions include Sigmoid function (equation 6), tanh function (equation 7), and ReLU function (equation 8) (Lecun Y, bottou l.gradient-based learning applied to document recognition [ J ]. Proceedings of the IEEE,1998,86 (11): p.2278-2324.). The calculated amount of the Sigmoid function and the Tanh function is large, and meanwhile, the situation of gradient disappearance or gradient explosion can be caused, so that information is lost, and the neural network training is not facilitated; and the ReLU function can effectively reduce the influence of gradient disappearance (gradient disappearance) and gradient explosion (gradient explosion), so that the calculation process is optimized, the parameter dependence is reduced, and the probability of over-fitting phenomenon is reduced.

After the convolution layer and the pooling layer are processed, the characteristic information of the input layer is extracted, and the input layer is classified by the full-connection layer. The fully-connected layer integrates the feature layers, so that local features integrate information in a high dimension, and output a feature vector (Szegedy, c., liu, w., jia, y., seranet, p., reed, s., anguelov, d., erhan, d., vanhoucke, v.and Rabinovich, a.,2015.Going deeper with convolutions.In Proceedings of the IEEE conference on computer vision and pattern recognition (pp.1-9)) integrating all input layer feature information. For the integrated feature vectors, classification is performed using a Softmax function (Ran pen, wang Ling, li Xin, liu pengwei.improved Softmax Classifier for Deep Convolution Neural Networks and Its Application in Face Recognition [ J ]. Journal of Shanghai University (Natural Science), 2018,24 (03): 352-366.Simone Totaro,Amir Hussain,Simone Scardapane.A Non-parametric Softmax for Improving Neural Attention in Time-series Forecasting [ J ]. Neurostarting, 2019 ]), the probability of each class occurrence is calculated by the formula (9), and the corresponding class with the largest probability value is determined as the classification result according to the probability.

Wherein a is _j A j-th value representing a vector in the fully connected layer; a, a _k Representing each bit value in the full connection layer; t represents a preset number of classification categories.

Time series data refers to a sequence of random variables that change over time, differing from ordinary data in the effect of time on the data. The time intervals in the time series may be years, months, dates, or any other time interval, depending on the time of observation. Time series data is very common in the fields of finance, transportation, etc., such as stock exchanges and traffic volumes. Assume a set of random variables x= { X ₁ ，X ₂ ，……，X _n Time is defined as t= { T } ₁ ，t ₂ ，……，t _n Then define X _t ＝{X ₁ ，X ₂ ，......，X _t T e T is a time sequence within time T.

Disclosure of Invention

In order to solve the technical problems, the invention provides a method for identifying traffic index time sequence based on an autoregressive differential moving average-convolution neural network, which comprises the following steps:

0000, acquiring a traffic index original data set, and generating a traffic index time sequence;

step 1000, converting the traffic index time sequence into a stable sequence, fitting an autoregressive differential moving average algorithm model according to a Bayesian information criterion matrix, and realizing traffic index prediction by using the autoregressive differential moving average algorithm model;

And 2000, generating a training traffic index time sequence and a test traffic index time sequence according to the traffic index original data set, extracting traffic index feature information according to the training traffic index time sequence, acquiring an optimal convolutional neural network model, integrating the traffic index feature information into a one-dimensional feature vector by using the optimal convolutional neural network model, determining the mode category of the one-dimensional feature vector according to a Softmax classifier, and further identifying the category to which the test traffic index time sequence belongs.

Further, the method for generating the traffic index time sequence in the step 0000 includes:

step 0200: and calculating the average traffic index at each moment according to the traffic index original data set.

Step 0400: and selecting a first continuous time sequence and/or a second continuous time sequence, wherein the first continuous time sequence is a continuous time sequence with the average traffic index being greater than a first threshold value, and the second continuous time sequence is a continuous time sequence with the average traffic index being greater than a second threshold value and less than the first threshold value.

Step 0600: and dividing the traffic index original data set according to the traffic index original data set, the first continuous time sequence and/or the second continuous time sequence.

Step 0800: and generating a traffic index time sequence according to the division result of the step 0600.

Further, step 1000 includes:

step 1200: and (3) carrying out stability test on the traffic index time sequence, if the traffic index time sequence is a non-stable time sequence, turning to step 1400, otherwise turning to step 1600.

Step 1400: and (3) carrying out stabilization treatment on the traffic index time sequence, and turning to step 1200.

Step 1600: and selecting a Bayesian information criterion matrix to determine values of parameters p and q in each period according to the traffic index time sequence, and constructing an ARIMA model, wherein p is the autoregressive term number, and q is the moving average term number.

Step 1800: and fitting the traffic index time sequence through the ARIMA model.

In particular, the smoothing method in step 1400 is:

let { x } _t T epsilon T is a group of time series data, B is a backward operator, namely Bx _t ＝x _t-1 RecordingCalled a difference operator, at which time,

generally there are

And (3) performing differential operation on the non-stationary time series data according to the formula (1) and the formula (2) to obtain a stationary sequence.

Further, the method of stationarity check in step 1200 is the ADF check.

In particular, the ARIMA model in step 1600 includes an autoregressive operation and an average movement process.

In step 2000, the CNN framework is formed by two groups of convolution pooling layers, where the convolution pooling layers include a convolution layer, a ReLU activation function, and a maximum pooling layer, where the convolution kernel of the convolution layer is 5*5, and the maximum pooling layer is 2×2.

The invention can accurately predict the traffic index time sequence data and the capability of identifying the mode, and can assist traffic management related departments to conduct traffic dispersion in advance so as to relieve urban traffic pressure.

Drawings

Fig. 1 is a flowchart of a first embodiment of the present invention.

FIG. 2 is a flowchart of ARIMA prediction in the present invention.

Fig. 3 is a schematic diagram of a convolutional neural network according to the present invention.

Fig. 4 is a schematic diagram of convolution operation in the present invention.

FIG. 5 (a) is a training traffic index time series of 05:00-07:15 time series data according to the present invention.

FIG. 5 (b) is a chart of time series data of 07:30-09:45 of training traffic index time series according to the present invention.

FIG. 5 (c) is a 10:00-12:15 time series of training traffic index time series data according to the present invention.

FIG. 5 (d) is a 12:30-14:45 time series of training traffic index time series data according to the present invention.

FIG. 5 (e) is 15:00-17:15 time series data of training traffic index time series in the present invention.

FIG. 5 (f) is a 17:30-19:45 time series of training traffic index time series of the present invention.

FIG. 5 (g) is a 20:00-23:00 time series of training traffic index time series data according to the present invention.

FIG. 6 (a) is an autocorrelation of a time series of 05:00-07:15 training traffic index in the present invention.

FIG. 6 (b) is an autocorrelation chart of the time series 07:30-09:45 of the training traffic index time series of the present invention.

FIG. 6 (c) is a 10:00-12:15 time series autocorrelation chart of the training traffic index time series in the present invention.

FIG. 6 (d) is a 12:30-14:45 time series autocorrelation graph of a training traffic index time series in the present invention.

FIG. 6 (e) is a 15:00-17:15 time series autocorrelation graph of a training traffic index time series in the present invention.

FIG. 6 (f) is a 17:30-19:45 time series autocorrelation graph of a training traffic index time series in the present invention.

FIG. 6 (g) is a 20:00-23:00 time series autocorrelation graph of training traffic index time series in the present invention.

FIG. 7 (a) is a graph showing the time series index difference between 05:00 and 07:15 of the time series of training traffic indexes according to the present invention.

FIG. 7 (b) is a graph showing the time series index differences between 07:30 and 09:45 of the training traffic index time series according to the present invention.

FIG. 7 (c) is a graph showing the time series index difference between 10:00 and 12:15 of the time series of training traffic indexes according to the present invention.

FIG. 7 (d) is a graph showing the time series index difference between 12:30 and 14:45 of the training traffic index time series in the present invention.

FIG. 7 (e) is a 15:00-17:15 time series index difference chart of the training traffic index time series in the present invention.

FIG. 7 (f) is a graph showing the time series index difference between 17:30 and 19:45 of the training traffic index time series in the present invention.

FIG. 7 (g) is a 20:00-23:00 time series index difference chart of the training traffic index time series in the present invention.

FIG. 8 (a) is a graph of the differential autocorrelation of the time series 05:00-07:15 of the training traffic index time series of the present invention.

FIG. 8 (b) is a differential autocorrelation chart of the time series 07:30-09:45 of the training traffic index time series of the present invention.

FIG. 8 (c) is a 10:00-12:15 time series differential autocorrelation diagram of a training traffic index time series in the present invention.

FIG. 8 (d) is a 12:30-14:45 time series differential autocorrelation graph of training traffic index time series in the present invention.

FIG. 8 (e) is a 15:00-17:15 time series differential autocorrelation graph of training traffic index time series in the present invention.

FIG. 8 (f) is a 17:30-19:45 time series differential autocorrelation graph of training traffic index time series in the present invention.

FIG. 8 (g) is a 20:00-23:00 time series differential autocorrelation graph of training traffic index time series in the present invention.

FIG. 9 (a) is a graph of differential partial autocorrelation of a time series of 05:00-07:15 training traffic index time series in the present invention.

FIG. 9 (b) is a differential partial autocorrelation chart of the time series 07:30-09:45 of the training traffic index time series in the present invention.

FIG. 9 (c) is a 10:00-12:15 time series differential partial autocorrelation chart of the training traffic index time series in the present invention.

FIG. 9 (d) is a graph of differential partial autocorrelation of the time series 12:30-14:45 of the training traffic index time series according to the present invention.

FIG. 9 (e) is a 15:00-17:15 time series differential partial autocorrelation graph of training traffic index time series in the present invention.

FIG. 9 (f) is a graph of differential partial autocorrelation of the time series 17:30-19:45 of the training traffic index time series of the present invention.

FIG. 9 (g) is a 20:00-23:00 time series differential partial autocorrelation graph of training traffic index time series in the present invention.

FIG. 10 is a graph showing the comparison of the predicted results of the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the accompanying drawings. The description herein describes specific embodiments, by way of illustration and not limitation, consistent with the principles of the present invention, which are described in sufficient detail to enable those skilled in the art to practice the invention, other embodiments may be utilized and the structure of elements may be changed and/or replaced without departing from the scope and spirit of the invention. The following detailed description is, therefore, not to be taken in a limiting sense.

As shown in fig. 1, according to a first aspect of the present invention, there is provided a method for identifying traffic index time series based on an autoregressive differential moving average-convolutional neural network, including:

and step 1000, converting the traffic index time sequence into a stable sequence, fitting an autoregressive differential moving average algorithm model according to a Bayesian information criterion matrix, and realizing traffic index prediction by using the autoregressive differential moving average algorithm model.

The traffic index original data set is { X } _ij ∈[X _min ,X _max ]|i∈[1,n ₁ ]，j∈[1,n ₂ ]，n ₁ ，n ₂ ∈N}，X _min ,X _max Respectively minimum and maximum values of traffic index, X _ij For the traffic index at the j-th moment on the i-th day, the average traffic index at the j-th moment

Step 0400: selecting a first continuous time sequence, a first threshold k ₁ ∈[l ₁ *X，X _max ]，l ₁ Is a natural number, preferably 3,is the average of all traffic indexes.

Step 0600: and dividing the traffic index original data set according to the traffic index original data set and the first continuous time sequence.

If the number of times of the first continuous time sequence is n _t Dividing the traffic index data of the ith day into n ₀ For a period of time, preferably, n ₀ Is n ₂ And n _t More preferably, the function of (c) is, representing a rounding down. For example, if the first continuous time series of traffic index data having 20 moments per day includes 5 moments, then n _t ＝5，n ₂ The daily traffic index data was divided into 4 time periods =20.

Step 0800: and generating a traffic index time sequence according to the division result of the step 0600. The specific method is that traffic index data in the same time period of each day are sequentially arranged to generate a traffic index time sequence.

For example, the traffic index raw data set comprises 10 days of data, the data of each day comprises 20 time periods, the traffic index data of each day is divided into 4 sections, then the data of the first time period from 1 st day to 10 th day are sequentially arranged to generate a traffic index time sequence of the first time period, and the traffic index time sequences of the second time period, the third time period and the fourth time period can be obtained by analogy, and the traffic index time sequences of the total 4 traffic index time sequences comprise 50 traffic index data.

The traffic index time sequence generation method has the advantages that the time with higher traffic index has higher probability and can be divided into a time period, so that the peak time period of traffic can be conveniently identified, and countermeasures can be actively carried out. In addition, through the division of time periods, the dimension is reduced for the subsequent data processing, and the operation efficiency is improved.

In particular, step 1000 further comprises:

Step 1600: and selecting a Bayesian information criterion matrix to determine values of parameters p and q in each period according to the traffic index time sequence, constructing an ARIMA model, wherein p is the autoregressive term number, and q is the moving average term number.

Step 1800: fitting to the time series by the ARIMA model.

Further, the method of stationarity check in step 1200 is the ADF check.

In particular, the smoothing method in step 1400 is:

generally there are

Step 2000, generating a training traffic index time sequence and a test traffic index time sequence according to the traffic index original data set, extracting traffic index feature information according to the training traffic index time sequence, obtaining an optimal convolutional neural network model, integrating the traffic index feature information into a one-dimensional feature vector by using the optimal convolutional neural network model, determining the mode category of the one-dimensional feature vector according to a Softmax classifier, and further identifying the category to which the test traffic index time sequence belongs.

According to a second aspect of the present invention, there is also provided a specific embodiment of a method for identifying a traffic index time series based on an autoregressive differential moving average-convolutional neural network:

the method for generating the traffic index time sequence in the step 0000 includes:

Step 0400: a first continuous time sequence and a second continuous time sequence are selected.

Wherein the second threshold k ₂ ∈(l ₂ *X，l ₁ *X)，l ₂ Is a positive real number and l ₂ 1 or more, preferably, l ₂ 1.5.

Step 0600: and dividing the traffic index original data set according to the traffic index original data set, the first continuous time sequence and the second continuous time sequence.

Step 0600 further comprises:

step 0620, if the number of times of the first continuous time sequence is n _t Dividing the traffic index data of the ith day into n ₀ For a period of time, preferably, n ₀ Is n ₂ And n _t More preferably, the function of (c) is, representing a rounding down.

Alternatively, if there are a plurality of first continuous time sequences, n _t Is the average number of instants of the first consecutive time series.

Alternatively, ifThe remainder of the third threshold k is greater than ₃ ，k ₃ Is n _t Is more preferably, ++>Then the traffic index data on the i-th day is divided into n ₀ +1 time period, redundant data as nth ₀ +1 time period data, otherwise divided into n ₀ The redundant data is divided into the last time period data.

For example, if the first continuous time series of traffic index data having 23 moments per day includes 5 moments, then n _t ＝5，n ₂ =23, dividing the daily traffic index data into 5 time periods, the first four time periods including data of 5 moments, and the fifth time period including data of 3 moments; if there are 22 times per day, n _t ＝5，n ₂ =22, dividing the daily traffic index data into 4 time periods, the 4 th time period comprising 6 momentsData.

In step 0640, the traffic index raw data set is divided into a first data subset, a second data subset …, and a Z data subset, Z being a natural number.

Preferably, the traffic index raw data set is divided into monday data subsets, monday data subset … sunday data subsets.

The benefit of using this partitioning method is that the traffic index is generally relatively fixed with the cycle of the week, so partitioning the traffic index raw data set into monday data subsets, the monday data subset … sunday data subset, facilitates finding the traffic index change law within the week. In addition, the ARIMA model is good at short-time sequence prediction, and the calculation efficiency is improved by dividing the traffic index original data set.

Optionally, in step 0660, calculating an average traffic index x of a second continuous time sequence of the first data subset and the second data subset …, where the second continuous time sequence of the second data subset is the Z data subset, obtaining the first data subset, and calculating a second continuous time sequence feature vector z=α×x+β×y, where α, β is a weighting coefficient, where α+β=1, preferably α=0.27, and β=0.73, where the number of times y of the second continuous time sequence of the second data subset …, where the average traffic index is greater than the second threshold and less than the first threshold, is the second continuous time sequence feature vector y. The first data subset and the second data subset … are divided into a first mode subset and a second mode subset … and a Y mode subset according to the Z value, Y is a natural number and is less than or equal to 7.

Further, the specific method for dividing the first data subset, the second data subset … and the Z data subset into the first mode subset and the second mode subset … and the Y mode subset according to the Z value is as follows:

Preferably, z ₁ 、z ₂ …z _Y-1 、z _Y Is z _min And z _max Wherein z is _min And z _max Respectively the minimum and maximum value of the z value. More preferably, the method further comprises the steps of,

optionally, the parameters of the second continuous time sequence that may be acquired further include peak size, valley size, presence or absence of a second peak, second peak-to-peak value, second peak duration, etc.

The meaning of further combining the first data subset and the second data subset … and the Z data subset is that the data with similar traffic index change trend are put together, so that the success rate of the subsequent neural network training is improved.

For example, the traffic index raw data set includes 10 days of data, the data includes 30 time periods each day, step 0620 divides the data into 3 time periods, step 0660 divides the data into 2 modes, then the first time period data of the first mode is sequentially arranged to generate a first time period traffic index time sequence of the first mode, and so on to obtain a first time period traffic index time sequence of the second mode, …, and a third time period traffic index time sequence of the second mode, which includes 6 traffic index time sequences, each traffic index time sequence including 50 traffic index data.

According to a third aspect of the present invention, there is also provided a specific embodiment of a method for identifying a traffic index time series based on an autoregressive differential moving average-convolutional neural network:

step 0000, obtaining a traffic index original data set, wherein the traffic index original data set is a Beijing city traffic index of 2016, 11, 2017, 11 and 11 (without data of 2018, 11, 30 days) of three months, the time interval frequency is 15 minutes, and each day is recorded as 05:00-23: 00 traffic index, 73 values per day.

TABLE 1 Beijing urban traffic index schematic chart

Table 1 shows the partial traffic index for each time slot on each date of beijing. Traffic index time series data can show obvious differences at different times and different dates.

Step 0200: and calculating the average traffic index at each moment according to the traffic index original data set. For example, the average traffic index at time 05:00 is (1.0+0.8+ … +1.1+1.4)/89= 1.1078 (four significant digits, the following).

Step 0400: a first continuous time sequence and a second continuous time sequence are selected. Calculated that the average of all traffic indexes is 1.3273, taking l ₁ =3, having k ₁ ∈[3.9819，8.5]Preferably 4, then the first continuous time sequence is 17:30-19:45 (10 instants); l (L) ₂ 1.5, k ₁ E (1.9909,3.9819), preferably 2, the second consecutive time series is 16:15-17:30 (7 instants).

In step 0620, the number of times of the first continuous time sequence is 10, 73 times are total per day, 73 divided by 10 is 7, and remainder is 3, preferablyThe daily traffic index data is divided into 7 time periods, the 7 th time period including 13 moments.

In step 0640, the traffic index raw data set is divided into a monday data subset, a monday data subset … sunday data subset.

In step 0660, an average traffic index x of a second continuous time sequence of the monday data subset and the sunday data subset … is calculated, the monday data subset is obtained, the number y of times when the average traffic index in the second continuous time sequence of the sunday data subset … is greater than the second threshold and less than the first threshold is calculated, and a second continuous time sequence feature vector z=α×x+β×y, α=0.27, β=0.73.

Taking the monday data subset as an example, the average traffic index x of 16:15-17:30 (7 times) of the monday data subset is 1.7382, the number y of times when the average traffic index is greater than 2 and less than 4 is 0, then the second continuous time sequence feature vector z of the monday data subset=0.27×1.7382+0.73×0= 0.4693, and the second continuous time sequence feature vector z of the monday data subset … is 2.9347, 2.8424, 2.9892, 4.6539, 3.8346, 5.7532 respectively.

From the z values, it can be seen that the z values for monday are the smallest, the z values for friday, are close, and the difference is significant, so the monday data is subset, the sunday data subset … sunday data subset is divided into a monday pattern subset, a mid-week pattern subset, a friday pattern subset, a Saturday pattern subset, and a sunday pattern subset.

TABLE 2 traffic index characterization

Alternatively, as shown in Table 2, the present invention divides traffic index patterns from the factors of peak size, valley size, presence or absence of a second peak, second peak-to-peak value, and second peak duration, etc., into: monday mode, mid-week mode, friday mode, saturday mode, and sunday mode.

Step 0800: the experimental data of the invention is 1 x 73-dimensional time series data, the data time span is larger, and according to the dividing method of the step 0660, the invention divides one day of data into 7 time periods: 05:00-07:15 (10 moments), 07:30-09:45 (10 moments), 10:00-12:15 (10 moments), 12:30-14:45 (10 moments), 15:00-17:15 (10 moments), 17:30-19:45 (10 moments), and 20:00-23:00 (13 moments). As shown in fig. 5 (a) -5 (g), the present invention obtains the corresponding index predicted values from the index data of 7 time periods through ARIMA algorithm, and then combines the predicted values of 7 time periods into a predicted index of one day according to time sequence. Taking the monday pattern data prediction as an example, the method comprises the following steps of using 2016, 11, 2017, 11, 6, 2017, 11, 13, etc predicting 9 days of data in total from 2017, 11, 20, 27, 5, and 12, and finally, comparing the precision with the actual value.

TABLE 3 ADF test values for time periods

Step 1200: and (3) carrying out stability test on the traffic index time series, and drawing an autocorrelation chart of each other according to each time series data predicted by the Monday mode data, as shown in fig. 6 (a) -6 (g). A root-by-root test (ADF) was performed for 7 time periods of monday mode, and the test results are shown in table 3.

Based on the time series plot, the autocorrelation plot, and the ADF detection values, 7 sets of time series data are found to be non-stationary data, and step 1400 is performed.

TABLE 4 ADF test values after differentiation

Step 1400: and (3) carrying out stabilization treatment on the traffic index time sequence, namely carrying out differential operation on seven groups of data, wherein the operation result is shown in fig. 7, and an autocorrelation chart and a partial autocorrelation chart are drawn, and as shown in fig. 8 and 9, the step 1200 is carried out.

TABLE 5 parameter p and parameter q values for each period

Step 1200: the traffic index time series was subjected to a stability test, and the test results are shown in table 4. According to the differential exponential distribution diagram, the autocorrelation diagram, the partial correlation diagram and the result of the ADF inspection table, the p value is smaller than 0.05, and it can be determined that the non-stationary time series data is converted into a stationary sequence through differential operation, and the step 1600 is shifted.

Step 1600: and selecting a Bayesian information criterion matrix to determine the values of parameters p and q in each period according to the traffic index time sequence, and constructing an ARIMA model as shown in a table 5. Where the parameters p and q are the order of the ARIMA model.

Step 1800: fitting (results retain 3 significant digits) to the time series was performed by the ARIMA model as shown in table 6.

TABLE 6 prediction results for each period

7 groups of time series prediction results are integrated to form a 1-73-dimensional traffic index prediction result of a whole day, and the prediction result is compared with an actual traffic index (the traffic index of 11 months 19 days in 2018: [1.3,1.2,1,1,1,1.1,1,1.5,2.2,3.5,5.3,6.5,7.1,7.7,7.7,7.6,7.2,6.8,5.9,6.2,5,4.5,4.1,3.7,3.3,2.9,2.7,2.2,2.1,2.1,1.9,1.8,1.7,1.7,1.7,1.8,2,2.4,2.5,2.6,2.6,2.6, 2.5,2.5,2.5,2.5,2.7,2.8,3,3.4,4,5.8,6.5,7.2,7.3,7.1,6.7,6,4.6,3.5,2.9,2.4,2.1,2.1,1.9,2,1.7,1.8,1.7,1.5,1.6,1.5,1.5 ]) as shown in fig. 10, the overall change of the prediction index is consistent with the actual situation, and the prediction result accords with the Monday mode characteristics.

The invention adopts continuous data of 11 months in 3 years, takes 2016 years 11 months, 2017 years 11 months and 2018 years 11 months 1-15 days as training sets, takes 2018 years 11 months 16-29 days as test sets, marks according to the real category, expressed by 0 and 1, for example, (1, 0) represents a monday pattern, (0, 1, 0) represents a mid-week pattern, (0, 1, 0) represents a friday mode, (0, 1, 0) represents a Saturday mode, and (0,0,0,0,1) represents a sunday mode. The daily data is supplemented with a value of 0 at the first digit of the time series data to form a 10 x 10 dimensional matrix, and labels are marked at the tail of the matrix to distinguish the daily modes, so that an input layer of the convolutional neural network is formed. For example, the traffic index of 2018, 11, and 19 may be expressed as:

The invention adopts a convolutional neural network as a deep learning model, and realizes the experimental process in TensorFlow by using a CPU version under Python language. Even-numbered convolution kernels result in loss of image boundary information and shift of position information, and thus, convolution kernels are typically present in an odd-numbered form. The size of the convolution kernel is determined by the size of the input layer, common dimensions include: 3*3, 5*5 and 7*7, an increase in the convolution kernel side length can lead to a steep increase in the computation effort. In the invention, the receptive field range of the convolution kernel with the size of 7*7 is too large, the calculated amount is large, and the difference of the calculated results is not obvious; 5*5 is larger than 3*3 in the range of the convolution kernel receptive field, so that more characteristic information can be extracted, and in conclusion, the convolution kernel with the size of 5*5 is selected. Based on the operational characteristics of the mean pooling layer and the maximum pooling layer, the mean pooling layer is more prone to preserving background information, while the maximum pooling layer is better at extracting texture characteristics of the neighborhood. The traffic index time sequence analysis method and the traffic index time sequence analysis system mainly divide the modes according to the change rule of the traffic index time sequence, and the value with the most obvious characteristic difference can effectively reflect the change rule of the traffic index, so that the background information characteristics of the traffic index time sequence analysis method and the traffic index time sequence analysis system can be ignored, and the characteristic information of the traffic index can be characterized by searching the most value in the neighborhood. The operation mode of the maximum pooling layer just meets the experimental requirement. The invention uses 2 x 2 maximum pooling layer to realize local feature compression and extraction, and the pooling layer with larger size can cause loss of layer information and reduce robustness of extracted features.

The overall framework adopted by the mode classification part of the invention consists of CNN and Softmax classifiers. The CNN framework consists of two groups of convolution layers, a ReLU activation function and a maximum pooling layer; the Softmax classifier consists of a Softmax function classification layer; the full connectivity layer connects the CNN and Softmax classifiers. Because the traffic index data sample size is smaller, in order to avoid the phenomenon of overfitting, the experiment adopts two groups of convolution kernels and a pooling layer to extract the characteristics. Firstly, performing feature extraction on a 10 x 10 dimensional matrix of an input layer by utilizing a 5*5 convolution kernel, and performing nonlinear conversion by adopting a ReLU activation function after convolution treatment, thereby solving the problems of gradient disappearance, buffer occurrence of a fitting phenomenon and the like; the results after nonlinear conversion of the activation function are processed with a maximum pooling of 2 x 2. The second group of convolution pooling processing is the same as the first, the output layer after the two processing realizes multidimensional data unidimensional through the full-connection layer, the multidimensional data unidimensional is used as an input layer of Softmax function logistic regression, the probability of each mode is output, and the mode corresponding to the maximum probability is taken as the category of the time sequence data mode of the day.

The test set data identification results are shown in table 7. The Monday pattern (11 month 19 day, 11 month 22 day, 11 month 26 day, 11 month 27 day), the Sunday pattern (11 month 20 day, 11 month 21 day, 11 month 28 day, 11 month 29 day), the Sunday pattern (11 month 16 day, 11 month 23 day), the Saturday pattern (11 month 17 day, 11 month 24 day) and the Sunday pattern (11 month 18 day, 11 month 25 day) can be distinguished by the convolution algorithm. However, the classification accuracy of the test set is up to 85.7% when the test set is mistakenly divided into a week mode from the 11 month 22 day and the 11 month 27 day.

TABLE 7 test set experimental results

And similarly, the prediction result is processed to form a 10 x 10 dimensional matrix as an input layer, and the matrix is classified by using a convolutional neural network to obtain a Monday mode which accords with the actual situation. Experimental results show that the ARIMA-CNN model constructed by the invention can be used for realizing the prediction and classification of traffic index time sequence data; and meanwhile, the classification result is used as a judgment standard, a mode threshold line corresponding to the corresponding category and the prediction result are called for judgment, and the abnormal point exceeding the threshold value is used as a key attention point. Traffic departments can conduct traffic control according to the abnormal value as the basis of auxiliary decision, and the risk of urban traffic jam is reduced.

In order to make up for the defect of the traditional traffic jam analysis capability, the invention provides an ARIMA-CNN model for realizing traffic index prediction and traffic index type classification, and searching for abnormal conditions to be generated in traffic so as to achieve the purpose of prediction and early warning. According to the invention, the traffic index data of Beijing city in 2016-2018 is taken as a research object, the traffic index data of Beijing city is checked to be a non-stable sequence by using an ADF, the non-stable sequence is converted into a stable sequence through differential operation, and the traffic index is predicted according to ARIMA fitted by a BIC matrix; and extracting the time sequence mode characteristics of the traffic index by using a CNN algorithm, so as to realize the mode identification of the predicted traffic index. The experimental result shows that the ARIMA-CNN model accurately predicts the Beijing city traffic index data and identifies the prediction result as a Monday mode which is in accordance with the actual situation. The model has the capability of accurately predicting time sequence data and identifying modes, and can assist traffic management related departments to conduct traffic dispersion in advance so as to relieve urban traffic pressure.

In addition, in future research, the influence of factors such as traffic emergency, bad weather, major events and the like on model weight can be considered, and the prediction and decision are synchronously carried out by combining the internet of things technology, so that the risk of urban traffic jam is reduced as much as possible.

Furthermore, other implementations of the invention will be apparent to those skilled in the art from consideration of the specification of the invention disclosed. Embodiments and/or aspects of embodiments may be used in the systems and methods of the present invention alone or in any combination. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.

Claims

1. A method of identifying a time series of traffic indexes based on an autoregressive differential moving average-convolutional neural network, comprising:

step 0200: calculating the average traffic index at each moment according to the traffic index original data set, wherein the traffic index original data set is that，/>，/>Respectively, minimum and maximum of traffic index, < ->Is->Day->Traffic index at each moment, then +.>Average traffic index at each moment；

Step 0400: selecting a first continuous time sequence and or a second continuous time sequence, wherein the first continuous time sequence is a continuous time sequence with the average traffic index being larger than a first threshold value, and the second continuous time sequence is a continuous time sequence with the average traffic index being larger than a second threshold value and smaller than the first threshold value; first threshold value ，/>Is natural number (i.e.)>Average over all traffic indexes; second threshold->，Is positive real number and +.>；

Step 0600: dividing the traffic index raw data set according to the traffic index raw data set and the first continuous time sequence:

if the number of time instances of the first continuous time sequence isWill be->Traffic index data of dayIs divided into->Time period(s)>Is->And->Function of->，/>Representing a downward rounding; if->The remainder of the first signal is greater than a third threshold，/>Then->Traffic index data of day is divided into +.>+1 time period, redundant data as +.>+1 time period data, otherwise split into +.>The redundant data are divided into the data of the last time period;

step 0640, dividing the traffic index original data set into a first data subset, a second data subset … and a Z data subset;

step 0660, calculating an average traffic index x of a second continuous time sequence of the first data subset, the second data subset … and the Z data subset, obtaining the first data subset, calculating a number y of times when the average traffic index is greater than a second threshold and less than the first threshold in the second continuous time sequence of the second data subset … and the Z data subset, and calculating a second continuous time sequence feature vector ，/>For the weighting factor>The method comprises the steps of carrying out a first treatment on the surface of the Dividing the first data subset, the second data subset … and the Z data subset into a first mode subset, a second mode subset … and a Y mode subset according to the Z value, wherein Y is a natural number；

；

、/> 、/>Is->And->Wherein>And->Minimum and maximum value of z value, respectively, +.>；

Step 0800: generating a traffic index time sequence according to the division result of the step 0660;

2. The method of identifying traffic index time series based on an autoregressive differential moving average-convolutional neural network of claim 1, wherein step 1000 further comprises:

step 1200: performing stability test on the traffic index time sequence, if the traffic index time sequence is a non-stable time sequence, turning to step 1400, otherwise turning to step 1600;

step 1400: performing stabilization treatment on the traffic index time sequence, and turning to step 1200;

step 1600: according to the traffic index time sequence, selecting a Bayesian information criterion matrix to determine values of parameters p and q in each period, and constructing an autoregressive differential moving average algorithm model, wherein p is the number of autoregressive terms, and q is the number of moving average terms;

step 1800: fitting the time sequence through the autoregressive differential moving average algorithm model.

3. The method for identifying traffic index time series based on autoregressive differential moving average-convolutional neural network according to claim 2, wherein the smoothing processing method in step 1400 is as follows:

for a set of time series data, B is the shift operator,>record->=1-B, there is

(1)

Then

(2)

4. A method of identifying traffic index time series based on an autoregressive differential moving average-convolutional neural network according to any one of claims 1-3, wherein the convolutional neural network framework in step 2000 is composed of two sets of convolutional pooling layers including a convolutional layer, a ReLU activation function, and a max pooling layer.

5. A method of identifying traffic index time series based on an autoregressive differential moving average-convolutional neural network according to any one of claims 1-3, wherein the convolution kernel of the convolutional layer is 55。

6. A method of identifying traffic index time series based on an autoregressive differential moving average-convolutional neural network according to any one of claims 1-3, wherein the maximum pooling layer is 22。