CN107180278A - A kind of real-time passenger flow forecasting of track traffic - Google Patents
A kind of real-time passenger flow forecasting of track traffic Download PDFInfo
- Publication number
- CN107180278A CN107180278A CN201710387638.9A CN201710387638A CN107180278A CN 107180278 A CN107180278 A CN 107180278A CN 201710387638 A CN201710387638 A CN 201710387638A CN 107180278 A CN107180278 A CN 107180278A
- Authority
- CN
- China
- Prior art keywords
- mrow
- passenger flow
- msub
- function
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000013598 vector Substances 0.000 claims abstract description 67
- 238000000034 method Methods 0.000 claims abstract description 34
- 238000007781 pre-processing Methods 0.000 claims abstract description 9
- 230000006870 function Effects 0.000 claims description 82
- 238000012549 training Methods 0.000 claims description 16
- 238000004364 calculation method Methods 0.000 claims description 9
- 238000000605 extraction Methods 0.000 claims description 8
- 238000012706 support-vector machine Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 9
- 238000012417 linear regression Methods 0.000 description 9
- 238000004422 calculation algorithm Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000011160 research Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 238000010801 machine learning Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000003203 everyday effect Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/40—Business processes related to the transportation industry
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Analysis (AREA)
- Tourism & Hospitality (AREA)
- Operations Research (AREA)
- General Business, Economics & Management (AREA)
- Marketing (AREA)
- Mathematical Optimization (AREA)
- Computational Mathematics (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Quality & Reliability (AREA)
- Entrepreneurship & Innovation (AREA)
- Probability & Statistics with Applications (AREA)
- Game Theory and Decision Science (AREA)
- Algebra (AREA)
- Development Economics (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention relates to a kind of real-time passenger flow forecasting of track traffic, low, the computationally intensive technical problem of predictablity rate present in prior art is mainly solved, the present invention includes by using method:N historical data is gathered as original sample from AFC system, and original sample progress is pre-processed and obtains pre-processing sample;Sample is pre-processed according in, the Passenger flow forecast model in short-term based on support vector regression is set up according to kernel function and fitting regression function, the kernel function is RBF;Anticipation function is fitted using time series vector X and with time series to the corresponding passenger flow vector Y of X amounts as input, the input is fitted anticipation function input step Passenger flow forecast model, passenger flow forecast vector Y in short-termn+1;Technical scheme, the problem is preferably resolved, available in the real-time passenger flow estimation of track traffic.
Description
Technical Field
The invention relates to the field of rail transit passenger flow prediction, in particular to a rail transit real-time passenger flow prediction method.
Background
With the rapid development of the urbanization process, the conflict between the travel demand of urban population and the urban traffic capacity is more prominent. Urban rail transit is distinguished from various traffic modes by the advantages of high speed, high capacity, environmental protection and the like, and becomes a main vehicle for solving traffic jam. The urban rail transit system is applied to construction of various cities, the urban rail transit is transformed from single-line operation to line network operation, the scale and complexity of the urban rail transit system are improved, and meanwhile challenges are provided for network management and development of the rail transit system. The rapid and accurate passenger flow prediction is not only the basis for scientifically making a driving plan, but also an important basis for adjusting an operation plan in real time, and is beneficial to traffic operation management to play a role more comprehensively and excellently. The Support Vector Machine (SVM) is used as a convex quadratic programming problem, and the obtained extreme value solution is guaranteed to be a global optimal solution. These characteristics make SVMs sufficient as an excellent data-based machine learning method. The SVM has outstanding performance in solving the problems of small samples, high-dimensional pattern recognition and the like, and can be applied and popularized to function fitting and other related machine learning researches. Different from the traditional machine learning algorithm, the SVM maps the original sample space to a high-dimensional feature space, and obtains the optimal linear classification surface in a new space. This mapping transformation, i.e. the non-linear transformation, is implemented using a suitable inner product function. The SVM successfully solves the local minimum value problem and the high-dimensional problem. And various learning algorithms such as a Bayes classifier, a radial basis function method, a multilayer perceptron network and the like can be realized by defining different inner product functions. The SVM uses a large interval factor to control the training process of machine learning, so that the SVM selects only the classification hyperplane with the maximum classification interval. The SVM algorithm has a complete theoretical basis, and has very good generalization performance in the application of some fields, so that the SVM algorithm has unsophisticated effects in the solution of classification, regression and density function estimation, and is successfully applied to the aspects of regression estimation, mode recognition and the like. Such as text classification, speech recognition, etc. SVMs were originally designed for classification problems and are widely used. In recent years, it has also shown very good performance in terms of regression problems.
The existing rail transit real-time passenger flow prediction method comprises the step that Roos J provides a dynamic Bayesian network method for predicting short-term passenger flow in a Paris railway network, and the method has a good processing effect on the condition of data insufficiency caused by system failure. The existing rail transit real-time passenger flow prediction method has the technical problems of low prediction accuracy and large calculation amount. Therefore, it is necessary to provide a rail transit real-time passenger flow prediction method with high prediction accuracy and small calculation amount.
Disclosure of Invention
The invention aims to solve the technical problems of low prediction accuracy and large calculation amount in the prior art. The rail transit real-time passenger flow prediction method has the characteristics of high prediction accuracy and small calculation amount.
In order to solve the technical problems, the technical scheme is as follows:
a rail transit real-time passenger flow prediction method, comprising:
(1) acquiring n pieces of historical data from an automatic fare collection system as original samples, and preprocessing the original samples to obtain preprocessed samples, wherein the preprocessed samples comprise time sequence vectors X and passenger flow vectors Y corresponding to the time sequence vectors X;
(2) establishing a short-time passenger flow prediction model based on a support vector regression machine according to the preprocessed samples in the step (1) and a kernel function and a fitting regression function, wherein the kernel function is a radial basis function;
(3) taking the time series vector X and the passenger flow vector Y corresponding to the time series vector X as input fitting prediction functions, inputting the input fitting prediction functions into the short-time passenger flow prediction model based on the support vector regression in the step (2), and predicting the passenger flow vector Yn+1;
Wherein X ═ t1,t2,...,tn},Y={y1,y2,...,yn}。
The working principle of the invention is as follows: the time series is a sequence of numbers ordered by time. The analysis process of the time series is to use the statistically observed time series data as a sample and build a model for predicting the occurrence of future events. The method comprises the following two steps: first, the continuity of event development is acknowledged; secondly, the randomness of the occurrence of the event is considered. The time series prediction mainly reflects three change rules mainly including periodic change, trend change and random change. The short-time passenger flow data has the change rule, and the prediction of the short-time passenger flow aims to help the rail transit operation safety operation and improve the service efficiency and the service quality. The passenger flow trend can be accurately predicted by establishing a high-precision prediction model. The passenger flow data has complexity and mutability, so that a linear prediction method is excluded, and a non-linear prediction method is selected.
Because the original samples are a large amount of data collected from the real world, and diversity, uncertainty and complexity exist between real production and actual life and scientific research, the collected original data are scattered, and the standard degree of the knowledge acquisition research conforming to the prediction algorithm is low. Therefore, the raw data must be processed and converted first before prediction is performed.
The method comprises the steps of collecting n pieces of historical data from an automatic fare collection system as original samples, preprocessing the original samples to obtain preprocessed samples, and establishing a short-time passenger flow prediction model based on a support vector regression machine according to a kernel function and a fitting regression function, wherein the kernel function is a radial basis function; and predicting future passenger flow according to a short-time passenger flow prediction model based on a support vector regression. The support vector regression model has only one type of sample points, and the optimal hyperplane sought is to minimize the total deviation value of all the sample points from the hyperplane. At this point, when the sample points are all contained between the two boundaries, finding the optimal regression hyperplane is equivalent to finding the maximum separation. The nonlinear support vector regression mainly uses a predetermined nonlinear mapping to map an input vector to a high-dimensional feature space, and performs linear regression in the high-dimensional feature space, so as to obtain the same effect as the original spatial nonlinear regression. In several kernel functions, a linear kernel function cannot process input values, the number of parameters of a high-dimensional kernel in a radial basis kernel function is less than that of a polynomial kernel function, but in the training process of a support vector regression machine, the training time required by the experiment of the polynomial kernel function is far longer than that of the radial basis kernel function, and when a Sigmoid kernel function is adopted, values in some parameters are wrong. Thus, the present invention employs a radial basis kernel function.
In the foregoing technical solution, for optimization, further, the establishing a short-time passenger flow prediction model based on a support vector regression includes:
(A) using the preprocessed samples as a given training set T { (x)1,y1),…,(xn,yn)}∈(R×y)n;
(B) According to the kernel function K (x, x'), the calculation precision is greater than 0 and the penalty function C is greater than 0, a convex quadratic programming target function is constructed and solved;
K(xi,xj)=Φ(xi)·Φ(xj);
(C) calculating a decision function according to a quadratic programming target function and a nonlinear fitting function:
the nonlinear fitting function is:
the calculated decision function is:
(D) according to a decision function, taking n pieces of historical data as a training set, performing regression prediction on an n + 1-dimensional hyperplane, and calculating to obtain the short-time passenger flow prediction model as follows:
f(x+1)=β1f(x)+β2f(x-1)+...+βnf(x-n+1);
where K (x, x') is the kernel function, xi∈Rn,yi∈ y R, i 1 …, n, f (x +1) indicates the passenger flow at time x +1, βiRepresents a weight coefficient, i ═ 1, 2.
Further, the radial basis function is a gaussian kernel function:
further, the passenger flow vector Yn+1Including short-term traffic prediction and peak prediction.
Further, the peak prediction includes an early peak prediction and a late peak prediction.
Further, the preprocessing comprises key information extraction, data arrangement and minute-level passenger flow volume entering and exiting statistics.
Further, the key information extraction comprises extracting parameters of the card swiping time, the platform and the traffic card type 3 as key information.
Further, the statistical frequency of the minute-scale passenger flow volume statistics is not less than once every 5 minutes.
Support vector regression, i.e., support vector regression machines, is generally classified into linear regression and nonlinear regression. For linear regression, the sample data is estimated using a linear regression function. For nonlinear regression, data is mapped to a high-dimensional feature space through nonlinear mapping, linear regression is carried out in the space, the nonlinear regression adds a step of increasing dimensions, and the linear regression is carried out in the high-dimensional space to replace the nonlinear regression in a low-dimensional space, so that complex dot product operation in the high-dimensional space is omitted. The support vector regression algorithm is converted into the solution of the convex quadratic programming problem, and the scale of the solution is twice of that of the SVM classification problem under the same sample size.
Support vector regression mainly includes both linear and non-linear cases.
In a linear support vector regression machine, a training set is setFinding parameters in the regression function f (x, α) ═ ω · x + b using-insensitive loss function calculationAndnamely:
(ω·xi+b)-yi≤+ξi,i=1,…,l
wherein, ξiAndis a relaxation variable, introduces Lagrange function
Wherein Lagrange multipliers satisfy
Let L pair b, w, ξ(*)Is 0 with respect to α(*)Maximum value of (a):
non-linear regression machineIn the first place, non-linear mapping is usedMapping the corresponding data to a high-dimensional feature space, and performing linear regression in the high-dimensional feature space. In the optimization process, only the characteristic space inner product operation is considered, so that the kernel function k (x, y) is used for replacingNon-linear regression can be achieved. The nonlinear regression is:
wherein,
find α(*)Value of (1), mostα with values of 0 and not 0(*)The corresponding sample is the support vector. The expression of f (x) is:
wherein, b is:
any support vector can calculate the b value.
The method provided by the invention is used for verifying the effectiveness of the prediction model of the support vector regression machine and comparing the fitting effect of the Gaussian kernel function and the RBF kernel function. The realization effect shows that the passenger flow prediction model based on the support vector regression has certain prediction capability, and the prediction conclusion is effective. Among the two kernel functions, the prediction effect of the Gaussian kernel function is obviously better than that of the RBF kernel function, and preferably, the Gaussian kernel function is selected as the optimal choice. The original sample is obtained from an Automatic Fare Collection (AFC) system in practical application, and the obtained passenger access information at any time is messy due to a slightly large experimental data volume. Therefore, an integration and key information extraction are required for a huge data set. In the selection of key information, the invention selects three kinds of key information from 7 kinds of labels, wherein the three kinds of key information comprise card swiping time, platform and traffic card type. The invention traverses all experimental data and arranges all contents of key information. The original data form is disordered, so that experimental prediction is not facilitated. Therefore, the data are arranged according to a certain rule, so that the subsequent experiment can be favorably carried out. The invention predicts the travel passenger flow of different stations, takes the station number in the station list arranged in the previous step as a classification label, and classifies the entrance and exit records of the rail transit network in a certain time. The original data of the invention contains a plurality of information, and each piece of information only records the access state of a certain passenger at a certain moment. If a prediction model is to be built for passenger flow prediction, the original data must be counted. Therefore, the number of the inbound people and the number of the outbound people of the station are counted every 5 minutes by traversing each sorted station data file.
The invention has the beneficial effects that:
the method has the advantages that the first effect is that the prediction accuracy is improved;
the second effect is that the prediction efficiency is improved;
and the third effect is that the data is preprocessed, so that the diversity, uncertainty and complexity of prediction are reduced.
Drawings
The invention is further illustrated with reference to the following figures and examples.
FIG. 1 is a schematic diagram of a data preprocessing flow.
Fig. 2, No. 0321 station arrival prediction support vector regression model.
Fig. 3, No. 0321 station outbound prediction support vector regression model.
Figure 4, No. 0212 station arrival prediction support vector regression model.
Figure 5, No. 0212 station outbound predicted support vector regression model.
The regression model of station entry prediction support vector of 0315 in fig. 6.
The model of the predictive support vector regression for station number 0315 in fig. 7.
Fig. 8, No. 0613 platform early peak prediction support vector regression model.
Fig. 9, 0210 station early peak prediction support vector regression model.
The model of model vector regression is supported by station late peak prediction of fig. 10 and 0613.
Figure 11, 0210 station late peak prediction support vector regression model.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Example 1
The embodiment provides a rail transit real-time passenger flow prediction method, which comprises the following steps:
(1) acquiring n pieces of historical data from an automatic fare collection system as original samples, and preprocessing the original samples to obtain preprocessed samples, wherein the preprocessed samples comprise time sequence vectors X and passenger flow vectors Y corresponding to the time sequence vectors X;
(2) establishing a short-time passenger flow prediction model based on a support vector regression machine according to the preprocessed samples in the step (1) and a kernel function and a fitting regression function, wherein the kernel function is a radial basis function;
(3) taking the time series vector X and the passenger flow vector Y corresponding to the time series vector X as input fitting prediction functions, inputting the input fitting prediction functions into the short-time passenger flow prediction model based on the support vector regression in the step (2), and predicting the passenger flow vector Yn+1;
Wherein X ═ t1,t2,...,tn},Y={y1,y2,...,yn}。
Specifically, the establishing of the short-time passenger flow prediction model based on the support vector regression includes:
(A) using the preprocessed samples as a given training set T { (x)1,y1),…,(xn,yn)}∈(R×y)n;
(B) According to the kernel function K (x, x'), the calculation precision is greater than 0 and the penalty function C is greater than 0, a convex quadratic programming target function is constructed and solved;
K(xi,xj)=Φ(xi)·Φ(xj);
(C) calculating a decision function according to a quadratic programming target function and a nonlinear fitting function:
the nonlinear fitting function is:
the calculated decision function is:
(D) according to a decision function, taking n pieces of historical data as a training set, performing regression prediction on an n + 1-dimensional hyperplane, and calculating to obtain the short-time passenger flow prediction model as follows:
f(x+1)=β1f(x)+β2f(x-1)+...+βnf(x-n+1);
where K (x, x') is the kernel function, xi∈Rn,yi∈ y R, i 1 …, n, f (x +1) indicates the passenger flow at time x +1, βiRepresents a weight coefficient, i ═ 1, 2.
Preferably, further, the radial basis function is a gaussian kernel function:
in particular, the passenger flow vector Yn+1Including short-term traffic prediction and peak prediction.
More specifically, the peak forecast includes an early peak forecast and a late peak forecast.
Because the original samples are a large amount of data collected from the real world, and diversity, uncertainty and complexity exist between real production and actual life and scientific research, the collected original data are scattered, and the degree of conforming to a prediction algorithm to perform knowledge acquisition research is low.
Preferably, as shown in fig. 1, the preprocessing includes key information extraction, data sorting and minute-level incoming and outgoing passenger flow statistics. Specifically, the key information extraction in this embodiment includes extracting parameters of the card swiping time, the platform and the traffic card type 3 as key information.
The original sample is obtained from an automatic ticket selling and checking system in practical application, and due to the large data volume, the obtained passenger entering and exiting information at any time is messy. Therefore, an integration and key information extraction are required for huge data sets. In the selection of the key information, three tags with more research value are selected from the 7 tags as the key information, which include the card-swiping time, the platform and the traffic card type. In the embodiment, all experimental data are traversed in the experimental process, and all contents of the key information are sorted. And (4) finishing results: table 1 is a data sample keyword time list, table 2 is a data sample keyword platform list, and table 3 is a data sample keyword card class number list.
Time of day | Time of day |
20140101 | 20140117 |
20140102 | 20140118 |
20140103 | 20140119 |
20140104 | 20140120 |
20140105 | 20140121 |
20140106 | 20140122 |
20140107 | 20140123 |
20140108 | 20140124 |
20140109 | 20140125 |
20140110 | 20140126 |
20140111 | 20140127 |
20140112 | 20140128 |
20140113 | 20140129 |
20140114 | 20140130 |
20140115 | 20140131 |
20140116 |
TABLE 1
Station number | Station number | Station number | Station number |
0102 | 0201 | 0301 | 0609 |
0103 | 0202 | 0302 | 0610 |
0104 | 0203 | 0303 | 0611 |
0105 | 0204 | 0304 | 0613 |
0106 | 0205 | 0305 | 0614 |
… | … | … | … |
0120 | 0215 | 0336 | 0623 |
0121 | 0216 | 0337 | 0625 |
0122 | 0217 | 0338 | 0626 |
0123 | 0218 | 0339 | 0628 |
TABLE 2
Card number | Card type | Card number | Card type |
00 | Common card for livable | 05 | Yijuyue Ticket |
14 | Free card for living in | 77 | Free counting card for livable living |
15 | Livable employee's card | 82 | Official ticket |
20 | Love preferential card | 88 | Employee's ticket |
03 | Card for student living in livable house | 89 | One-way commemorative ticket |
44 | Rail police on-duty card | 94 | Track order ticket |
48 | Track service card | 98 | One-way ticket |
TABLE 3
The validation data collection times shown in table 1 ranged from 1 month 1 day 2014 to 1 month 31 day 2014. As can be seen from table 2, the first two digits of the stations of the statistical data represent the names of the rail transit lines, and the last two digits represent the station numbers and increase from beginning to end. Wherein the missing station number indicates that the station on the line has not been opened. The traffic card type statistics are shown in table 3, and the card type name corresponding to the number is also correspondingly shown.
The original data form is disordered, so that experimental prediction is not facilitated. Therefore, the data are arranged according to a certain rule, so that the subsequent experiment can be favorably carried out. And planning to predict the travel passenger flow of different stations, and taking the station numbers in the station list sorted in the previous step as classification labels to classify the access records of the rail transit network in one month. Each site file contains records of the flow of passengers and passengers for that site in month 1 of 2014. The original data is more, and each piece of information only records the access state of a certain passenger at a certain moment. If a prediction model is built for passenger flow prediction, the calculation amount can be reduced by counting the original data. Thus, each consolidated station data file is traversed.
Preferably, the number of inbound people and the number of outbound people for the site are counted every 5 minutes, and the statistics are shown in Table 4.
Counting the start time | Statistical cut-off time | The number of people entering the room | Number of people going out |
20140112-070000 | 20140112-070500 | 29 | 63 |
20140112-070500 | 20140112-071000 | 18 | 67 |
20140112-071000 | 20140112-071500 | 23 | 25 |
20140112-071500 | 20140112-072000 | 18 | 86 |
20140112-072000 | 20140112-072500 | 21 | 143 |
20140112-072500 | 20140112-073000 | 24 | 112 |
20140112-073000 | 20140112-073500 | 25 | 36 |
20140112-073500 | 20140112-074000 | 43 | 142 |
20140112-074000 | 20140112-074500 | 30 | 135 |
TABLE 4
The prediction result of the embodiment is as follows:
short-time passenger flow prediction: selecting a station No. 0321, a station No. 0212 and a station No. 0315 as prediction objects, training the inbound/outbound data of the station from 15 hours to 21 hours in the period from 1 month 6 to 1 month 21 in 2014, predicting the inbound/outbound passenger flow in the same period on 22 days in 2014, and counting the training data once every 15 minutes. The experimental results are shown in FIGS. 2-7. Wherein, fig. 2 and fig. 3 are the predicted support vector regression model for station No. 0321 entering/exiting respectively; fig. 4 and 5 are schematic diagrams of predicted support vector regression models for station 0212 entering/exiting respectively; fig. 6 and 7 show the model of the predictive support vector regression for station number 0315.
Wherein the abscissa is a scale every 5 minutes during the time period, each of the projections represents the traffic situation during business hours of the day, and the last projection is the predicted data.
Peak prediction:
early peak prediction: selecting the platform No. 0613 and the platform No. 0210 as prediction objects, training the arrival data of the station from 6 hours to 30 hours to 9 hours every day from 1 month and 6 days to 1 month and 21 days in 2014, predicting the arrival passenger flow of the station in the same period of 22 days in 2014, and counting the training data once every 15 minutes. The experimental results are shown in fig. 8 and 9, where fig. 8 is the model of the early peak prediction support vector regression for station 0613 and fig. 9 is the model of the early peak prediction support vector regression for station 0210. Wherein the abscissa is a scale every 20 minutes during the time period.
And (3) late peak prediction: selecting the platform No. 0613 and the platform No. 0210 as prediction objects, training the arrival data of the station from 17 hours to 20 hours every day from 1 month 6 days to 1 month 21 days in 2014, predicting the arrival passenger flow in the same period of 22 days in 2014, and counting the training data once every 15 minutes. The experimental results are shown in fig. 10 and fig. 11, where fig. 10 is a model of model 0613 for model 0210 for model.
Wherein the abscissa is a scale every 20 minutes during the time period.
Although the illustrative embodiments of the present invention have been described above to enable those skilled in the art to understand the present invention, the present invention is not limited to the scope of the embodiments, and it is apparent to those skilled in the art that all the inventive concepts using the present invention are protected as long as they can be changed within the spirit and scope of the present invention as defined and defined by the appended claims.
Claims (8)
1. A rail transit real-time passenger flow prediction method is characterized by comprising the following steps: the method comprises the following steps:
(1) acquiring n pieces of historical data from an automatic fare collection system as original samples, and preprocessing the original samples to obtain preprocessed samples, wherein the preprocessed samples comprise time sequence vectors X and passenger flow vectors Y corresponding to the time sequence vectors X;
(2) establishing a short-time passenger flow prediction model based on a support vector regression machine according to the preprocessed samples in the step (1) and a kernel function and a fitting regression function, wherein the kernel function is a radial basis function;
(3) taking the time series vector X and the passenger flow vector Y corresponding to the time series vector X as input fitting prediction functions, inputting the input fitting prediction functions into the short-time passenger flow prediction model based on the support vector regression in the step (2), and predicting the passenger flow vector Yn+1;
Wherein X ═ t1,t2,...,tn},Y={y1,y2,...,yn}。
2. The rail transit real-time passenger flow prediction method according to claim 1, characterized in that: the establishing of the short-time passenger flow prediction model based on the support vector regression comprises the following steps:
(A) using the preprocessed samples as a given training set T { (x)1,y1),…,(xn,yn)}∈(R×y)n;
(B) According to the kernel function K (x, x'), the calculation precision is greater than 0 and the penalty function C is greater than 0, a convex quadratic programming target function is constructed and solved;
<mfenced open = "" close = ""> <mtable> <mtr> <mtd> <mrow> <mi>W</mi> <mrow> <mo>(</mo> <mrow> <mi>a</mi> <mo>,</mo> <msup> <mi>a</mi> <mo>*</mo> </msup> </mrow> <mo>)</mo> </mrow> <mo>=</mo> <mo>-</mo> <mfrac> <mn>1</mn> <mn>2</mn> </mfrac> <munderover> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> <mo>,</mo> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <mrow> <mo>(</mo> <mrow> <msub> <mi>a</mi> <mi>i</mi> </msub> <mo>-</mo> <msubsup> <mi>a</mi> <mi>i</mi> <mo>*</mo> </msubsup> </mrow> <mo>)</mo> </mrow> <mrow> <mo>(</mo> <mrow> <msub> <mi>a</mi> <mi>j</mi> </msub> <mo>-</mo> <msubsup> <mi>a</mi> <mi>j</mi> <mo>*</mo> </msubsup> </mrow> <mo>)</mo> </mrow> <mo>&CenterDot;</mo> <mi>K</mi> <mrow> <mo>(</mo> <mrow> <mi>x</mi> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> </mrow> <mo>)</mo> </mrow> </mrow> </mtd> </mtr> <mtr> <mtd> <mrow> <mo>+</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> <mo>,</mo> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <mrow> <mo>(</mo> <mrow> <msub> <mi>a</mi> <mi>i</mi> </msub> <mo>-</mo> <msubsup> <mi>a</mi> <mi>i</mi> <mo>*</mo> </msubsup> </mrow> <mo>)</mo> </mrow> <msub> <mi>y</mi> <mi>i</mi> </msub> <mo>-</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> <mo>,</mo> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <mrow> <mo>(</mo> <mrow> <msub> <mi>a</mi> <mi>i</mi> </msub> <mo>+</mo> <msubsup> <mi>a</mi> <mi>i</mi> <mo>*</mo> </msubsup> </mrow> <mo>)</mo> </mrow> <mi>&epsiv;</mi> </mrow> </mtd> </mtr> </mtable> </mfenced>
K(xi,xj)=Φ(xi)·Φ(xj);
(C) calculating a decision function according to a quadratic programming target function and a nonlinear fitting function:
the nonlinear fitting function is:
<mfenced open = "" close = ""> <mtable> <mtr> <mtd> <mrow> <mi>f</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>&omega;</mi> <mo>&CenterDot;</mo> <mi>&Phi;</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>+</mo> <mi>b</mi> </mrow> </mtd> </mtr> <mtr> <mtd> <mrow> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <mrow> <mo>(</mo> <msub> <mi>a</mi> <mi>i</mi> </msub> <mo>-</mo> <msubsup> <mi>a</mi> <mi>i</mi> <mo>*</mo> </msubsup> <mo>)</mo> </mrow> <mi>K</mi> <mo>(</mo> <mrow> <mi>x</mi> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> </mrow> <mo>)</mo> <mo>+</mo> <mi>b</mi> </mrow> </mtd> </mtr> </mtable> </mfenced>
the calculated decision function is:
<mrow> <mi>f</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <mrow> <mo>(</mo> <msubsup> <mover> <mi>&alpha;</mi> <mo>&OverBar;</mo> </mover> <mi>i</mi> <mo>*</mo> </msubsup> <mo>-</mo> <msub> <mover> <mi>&alpha;</mi> <mo>&OverBar;</mo> </mover> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mi>K</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>+</mo> <mover> <mi>b</mi> <mo>&OverBar;</mo> </mover> <mo>;</mo> </mrow>
(D) according to a decision function, taking n pieces of historical data as a training set, performing regression prediction on an n + 1-dimensional hyperplane, and calculating to obtain the short-time passenger flow prediction model as follows:
f(x+1)=β1f(x)+β2f(x-1)+...+βnf(x-n+1);
where K (x, x') is the kernel function, xi∈Rn,yi∈ y R, i 1 …, n, f (x +1) indicates the passenger flow at time x +1, βiRepresents a weight coefficient, i ═ 1, 2.
3. The rail transit real-time passenger flow prediction method according to claim 2, characterized in that: the radial basis function is a gaussian kernel function:
<mrow> <mi>k</mi> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <msub> <mi>x</mi> <mi>j</mi> </msub> <mo>)</mo> </mrow> <mo>=</mo> <mi>exp</mi> <mrow> <mo>(</mo> <mo>-</mo> <mfrac> <mrow> <mo>|</mo> <mo>|</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>-</mo> <msub> <mi>x</mi> <mi>j</mi> </msub> <mo>|</mo> <msup> <mo>|</mo> <mn>2</mn> </msup> </mrow> <mrow> <mn>2</mn> <msup> <mi>&sigma;</mi> <mn>2</mn> </msup> </mrow> </mfrac> <mo>)</mo> </mrow> <mo>.</mo> </mrow>
4. the rail transit real-time passenger flow prediction method according to claim 1, characterized in that: the passenger flow vector Yn+1Including short-term traffic prediction and peak prediction.
5. The rail transit real-time passenger flow prediction method according to claim 4, characterized in that: the peak prediction includes an early peak prediction and a late peak prediction.
6. The rail transit real-time passenger flow prediction method according to claim 1, characterized in that: the preprocessing comprises key information extraction, data arrangement and minute-level passenger flow volume entering and exiting statistics.
7. The rail transit real-time passenger flow prediction method of claim 6, characterized in that: the key information extraction comprises extracting parameters in the card swiping time, the platform and the traffic card type 3 as key information.
8. The rail transit real-time passenger flow prediction method of claim 6, characterized in that: the statistical frequency of the minute-level passenger flow volume is not less than 5 minutes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710387638.9A CN107180278A (en) | 2017-05-27 | 2017-05-27 | A kind of real-time passenger flow forecasting of track traffic |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710387638.9A CN107180278A (en) | 2017-05-27 | 2017-05-27 | A kind of real-time passenger flow forecasting of track traffic |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107180278A true CN107180278A (en) | 2017-09-19 |
Family
ID=59835094
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710387638.9A Pending CN107180278A (en) | 2017-05-27 | 2017-05-27 | A kind of real-time passenger flow forecasting of track traffic |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107180278A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109754115A (en) * | 2018-12-04 | 2019-05-14 | 东软集团股份有限公司 | Method, apparatus, storage medium and the electronic equipment of data prediction |
CN109993341A (en) * | 2018-09-29 | 2019-07-09 | 上海电科智能系统股份有限公司 | A kind of passenger flow forecast method based on radial basis function neural network |
CN110119845A (en) * | 2019-05-11 | 2019-08-13 | 北京京投亿雅捷交通科技有限公司 | A kind of application method of track traffic for passenger flow prediction |
CN110134088A (en) * | 2019-05-21 | 2019-08-16 | 浙江大学 | A kind of adaptive quality forecasting procedure based on increment support vector regression |
CN111414719A (en) * | 2020-04-28 | 2020-07-14 | 中南大学 | Method and device for extracting peripheral features of subway station and estimating traffic demand |
CN113255978A (en) * | 2021-05-14 | 2021-08-13 | 上海仪电物联技术股份有限公司 | Medium-and-long-term passenger flow prediction method and system based on rail transit |
CN113781693A (en) * | 2020-06-28 | 2021-12-10 | 朱俊达 | Block chain identity information authentication system based on big data |
CN114091579A (en) * | 2021-11-03 | 2022-02-25 | 深圳技术大学 | Urban rail transit passenger flow early warning system and method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102819768A (en) * | 2011-11-07 | 2012-12-12 | 金蝶软件(中国)有限公司 | Method and system for analyzing passenger flow data |
CN103150609A (en) * | 2013-02-18 | 2013-06-12 | 健雄职业技术学院 | Modeling method for short time traffic flow predicting model |
CN103310287A (en) * | 2013-07-02 | 2013-09-18 | 北京航空航天大学 | Rail transit passenger flow predicting method for predicting passenger travel probability and based on support vector machine (SVM) |
-
2017
- 2017-05-27 CN CN201710387638.9A patent/CN107180278A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102819768A (en) * | 2011-11-07 | 2012-12-12 | 金蝶软件(中国)有限公司 | Method and system for analyzing passenger flow data |
CN103150609A (en) * | 2013-02-18 | 2013-06-12 | 健雄职业技术学院 | Modeling method for short time traffic flow predicting model |
CN103310287A (en) * | 2013-07-02 | 2013-09-18 | 北京航空航天大学 | Rail transit passenger flow predicting method for predicting passenger travel probability and based on support vector machine (SVM) |
Non-Patent Citations (1)
Title |
---|
赵钰棠等: "基于支持向量机的地铁客流量预测", 《都市快轨交通》 * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109993341A (en) * | 2018-09-29 | 2019-07-09 | 上海电科智能系统股份有限公司 | A kind of passenger flow forecast method based on radial basis function neural network |
CN109754115A (en) * | 2018-12-04 | 2019-05-14 | 东软集团股份有限公司 | Method, apparatus, storage medium and the electronic equipment of data prediction |
CN109754115B (en) * | 2018-12-04 | 2021-03-26 | 东软集团股份有限公司 | Data prediction method and device, storage medium and electronic equipment |
CN110119845A (en) * | 2019-05-11 | 2019-08-13 | 北京京投亿雅捷交通科技有限公司 | A kind of application method of track traffic for passenger flow prediction |
CN110134088A (en) * | 2019-05-21 | 2019-08-16 | 浙江大学 | A kind of adaptive quality forecasting procedure based on increment support vector regression |
CN111414719A (en) * | 2020-04-28 | 2020-07-14 | 中南大学 | Method and device for extracting peripheral features of subway station and estimating traffic demand |
CN113781693A (en) * | 2020-06-28 | 2021-12-10 | 朱俊达 | Block chain identity information authentication system based on big data |
CN113781693B (en) * | 2020-06-28 | 2023-05-23 | 江苏人加信息科技有限公司 | Block chain identity information authentication system based on big data |
CN113255978A (en) * | 2021-05-14 | 2021-08-13 | 上海仪电物联技术股份有限公司 | Medium-and-long-term passenger flow prediction method and system based on rail transit |
CN114091579A (en) * | 2021-11-03 | 2022-02-25 | 深圳技术大学 | Urban rail transit passenger flow early warning system and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107180278A (en) | A kind of real-time passenger flow forecasting of track traffic | |
Zhang et al. | Deep learning architecture for short-term passenger flow forecasting in urban rail transit | |
CN112700072B (en) | Traffic condition prediction method, electronic device, and storage medium | |
Zhang et al. | Group pooling for deep tourism demand forecasting | |
Bin et al. | Bus arrival time prediction using support vector machines | |
Dai et al. | Short‐Term Origin‐Destination Based Metro Flow Prediction with Probabilistic Model Selection Approach | |
CN110599336B (en) | Financial product purchase prediction method and system | |
CN107103394A (en) | A kind of real-time passenger flow forecasting of track traffic based on neutral net | |
Schrodt et al. | Data-based computational approaches to forecasting political violence | |
CN114626585B (en) | Urban rail transit short-time passenger flow prediction method based on generation countermeasure network | |
Shi et al. | Short-term metro passenger flow forecasting using ensemble-chaos support vector regression | |
CN113380043B (en) | Bus arrival time prediction method based on deep neural network calculation | |
CN110276677A (en) | Refund prediction technique, device, equipment and storage medium based on big data platform | |
Brahimi et al. | Modelling on Car‐Sharing Serial Prediction Based on Machine Learning and Deep Learning | |
Rao et al. | Flight Ticket Prediction using Random Forest Regressor Compared with Decision Tree Regressor | |
Ruiz-Aguilar et al. | A two-stage procedure for forecasting freight inspections at Border Inspection Posts using SOMs and support vector regression | |
Utku et al. | New deep learning-based passenger flow prediction model | |
Rasaizadi et al. | Short‐Term Prediction of Traffic State for a Rural Road Applying Ensemble Learning Process | |
Hu et al. | Self-supervised pre-training for robust and generic spatial-temporal representations | |
Rodríguez-Rueda et al. | Origin–Destination matrix estimation and prediction from socioeconomic variables using automatic feature selection procedure-based machine learning model | |
Vadlamani et al. | Using machine learning to analyze and predict entry patterns of low-cost airlines: a study of Southwest Airlines | |
Darapaneni et al. | Demand and revenue forecasting through machine learning | |
Chen et al. | Macro-level accident fatality prediction using a combined model based on ARIMA and multivariable linear regression | |
Tu et al. | Analysis and prediction of differential parking behaviors | |
JP7310459B2 (en) | Information processing method and information processing device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170919 |
|
RJ01 | Rejection of invention patent application after publication |