CN115269758A - Passenger-guidance-oriented road network passenger flow state deduction method and system - Google Patents

Passenger-guidance-oriented road network passenger flow state deduction method and system Download PDF

Info

Publication number
CN115269758A
CN115269758A CN202210922005.4A CN202210922005A CN115269758A CN 115269758 A CN115269758 A CN 115269758A CN 202210922005 A CN202210922005 A CN 202210922005A CN 115269758 A CN115269758 A CN 115269758A
Authority
CN
China
Prior art keywords
passenger flow
time
passenger
data
station
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210922005.4A
Other languages
Chinese (zh)
Inventor
张迪
张开婷
崔闰虎
王欣
李强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Quantutong Position Network Co ltd
Original Assignee
Quantutong Position Network Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quantutong Position Network Co ltd filed Critical Quantutong Position Network Co ltd
Priority to CN202210922005.4A priority Critical patent/CN115269758A/en
Publication of CN115269758A publication Critical patent/CN115269758A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Probability & Statistics with Applications (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Software Systems (AREA)
  • General Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Development Economics (AREA)
  • Fuzzy Systems (AREA)
  • Computational Linguistics (AREA)
  • Marketing (AREA)
  • Remote Sensing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Game Theory and Decision Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a road network passenger flow state deduction method and system oriented to passenger guidance, wherein the method comprises the following steps: acquiring historical passenger flow data of each station of urban rail transit; based on historical passenger flow volume data and weather data of each station, relevant factors influencing passenger flow are analyzed and determined by utilizing Pearson correlation coefficients; performing clustering analysis on historical passenger flow volume data of all the sites to obtain classification results of all the sites, and analyzing the passenger flow volume distribution conditions of different types of the sites; and (3) building a short-time passenger flow prediction model, collecting real-time passenger flow data of various stations, and inputting the real-time passenger flow data of various stations into the short-time passenger flow prediction model to obtain a short-time passenger flow prediction result. The invention improves the accuracy, real-time property and adaptability to emergency.

Description

Passenger-guidance-oriented road network passenger flow state deduction method and system
Technical Field
The invention relates to the technical field of rail transit, in particular to a road network passenger flow state deduction method and system oriented to passenger guidance.
Background
The existing urban rail transit passenger flow dynamic prediction method is based on the historical passenger flow OD data, and urban rail transit passenger flow prediction is carried out by using a probability tree, a traditional machine learning model and the like. The method comprises the specific implementation steps of obtaining historical passenger flow OD data of a traffic network, wherein the historical passenger flow OD data comprises historical time and time-period passenger flow OD data corresponding to the date of the historical time, and the historical time comprises the date and the time period; and performing cluster analysis on the time-interval passenger flow OD data, and determining a plurality of categories corresponding to the time-interval passenger flow OD data. And determining the category of the predicted passenger flow OD data corresponding to the time to be predicted, and determining the predicted passenger flow OD data of the time to be predicted according to the category.
Most of the existing urban rail transit passenger flow prediction methods use time series models, the models generally have certain assumptions and high model complexity, and the methods are difficult to apply to actual production practice and have the problems of low accuracy and low efficiency.
The existing urban rail transit passenger flow prediction method mainly depends on historical passenger flow OD data, the real-time performance of the data is weak, and the performance is poor when unpredictable situations such as emergency situations, short-time changes and the like are met.
The existing urban rail transit passenger flow prediction method does not carry out scientific classification on all stations of a road network, lacks personalized time and space analysis on key stations, and loses certain precision in actual production practice.
Disclosure of Invention
In view of this, the invention provides a passenger-oriented road network passenger flow state deduction method and system, which improve the accuracy, real-time performance and adaptability to emergency situations of prediction.
The invention provides a passenger-oriented road network passenger flow state deduction method, which comprises the following steps: acquiring historical passenger flow data of each station of urban rail transit; based on historical passenger flow volume data and weather data of each station, relevant factors influencing passenger flow are analyzed and determined by using Pearson correlation coefficients; performing clustering analysis on historical passenger flow data of each station to obtain classification results of various stations, and analyzing passenger flow distribution conditions of different types of stations; and (3) building a short-time passenger flow prediction model, collecting real-time passenger flow data of various stations, and inputting the real-time passenger flow data of various stations into the short-time passenger flow prediction model to obtain a short-time passenger flow prediction result.
Further, the passenger flow volume data at least comprises the station number of the passenger entering and exiting the station, the station name, the time of entering and exiting the station and the card type.
Further, the step of determining relevant factors influencing passenger flow by using Pearson correlation coefficient analysis comprises: dividing the historical passenger flow volume data of each station into passenger flow volume data of a plurality of time periods; calculating a corresponding Pearson correlation coefficient by using the passenger flow volume data and the weather data of each time period; comparing the passenger flow volume data of each time period with the Pearson correlation coefficient corresponding to the weather data, and determining related passenger flow volume factors and related weather factors which influence the passenger flow; and selecting daily passenger flow data of part of the sites in one month and average passenger flow data of different time periods of each day in one week for analysis, and determining two factors of whether the work day and the passenger flow peak period influence the passenger flow.
Further, the calculation formula of the Pearson correlation coefficient is as follows:
Figure BDA0003777965040000021
wherein, X i Indicating the amount of traffic in the ith time slot,
Figure BDA0003777965040000022
and S X Respectively representing the mean value and the variance of the passenger flow; y is i Indicates the corresponding influence factor value of the ith time period,
Figure BDA0003777965040000023
and S Y Respectively representing the mean and variance of the corresponding influence factors; and r is Pearson correlation coefficient.
Further, the relevant factors influencing the passenger flow include temperature, humidity, visibility, precipitation, cloud cover, previous time period passenger flow/number, whether working day and whether peak time period.
Further, the step of performing cluster analysis on the historical passenger flow volume data of each site includes: acquiring the category number of each site and a site passenger flow data set; solving the maximum value and the minimum value of passenger flows of all stations; taking a K +1 equant point between the maximum value and the minimum value as an initial clustering center; dividing all the sites into the clustering categories with the closest passenger flow; accumulating and summing the difference value between the passenger flow of each vector station in the cluster and the cluster center to serve as a convergence judgment condition of the algorithm; finding the mean value of each class of passenger flow as a new clustering center, and finishing iteration when the difference of clustering structures of two consecutive times is smaller than a set threshold value; and outputting a site clustering result, wherein the site clustering result comprises a high passenger flow site, a medium passenger flow site and a low passenger flow site.
Further, the step of analyzing the passenger flow distribution conditions of different types of sites includes: the distribution rules of the historical passenger flow data of the high passenger flow station, the medium passenger flow station and the low passenger flow station in time and space are analyzed respectively, and different passenger flow peak time periods of various stations are determined.
Further, the step of building a short-time passenger flow prediction model comprises the following steps: constructing an input layer, a hidden layer and an output layer of a short-time passenger flow prediction model; dividing historical passenger flow volume data of each station into a training set and a test set according to a certain proportion, collecting real-time passenger flow volume data of each station, and adding the real-time passenger flow volume data into the test set; training the short-time passenger flow prediction model by using a training set; and verifying the trained short-time passenger flow prediction model by using a test set.
Further, the step of inputting the real-time passenger flow volume data of each station into the short-time passenger flow prediction model to obtain the short-time passenger flow prediction result comprises the following steps: preprocessing the real-time passenger flow data of each station, and dividing the preprocessed real-time passenger flow data of each station by selecting different time granularities; and carrying out predictive analysis on the real-time passenger flow data in different time to obtain the passenger flow state of each station.
In a second aspect of the present invention, a passenger-oriented guidance-oriented road network passenger flow state deduction system includes: a memory for storing a computer program; a processor for implementing the steps of the passenger-oriented guidance road network passenger flow state deduction method when executing the computer program.
According to the passenger guidance oriented road network passenger flow state deduction method and system, pearson correlation coefficients are used for analysis, and whether factors such as weather conditions, historical passenger flow, whether the factors are in peak time periods or not, whether the factors are in working days and the like are determined as short-time passenger flow influence factors of rail transit; dividing actual rail transit stations into three types of stations of high, medium and low passenger flow by using an improved K-means clustering algorithm, analyzing the distribution rule of the passenger flow of each type of stations in time and space, and determining different passenger flow peak time periods of each type of stations; the urban rail transit short-time passenger flow prediction method based on the long-time memory neural network and the gate control circulation unit is adopted to carry out prediction analysis on the passenger flow of different passenger flow type stations in different time, so that the prediction accuracy, the real-time performance and the adaptability to emergency situations are improved.
Drawings
For purposes of illustration and not limitation, the present invention will now be described in accordance with its preferred embodiments, particularly with reference to the accompanying drawings, in which:
fig. 1 is a flowchart of a method for deducing passenger traffic state of a road network oriented to passenger guidance according to an embodiment of the present invention;
FIG. 2 is a flow chart of analyzing passenger flow characteristics based on an improved K-means clustering algorithm;
FIG. 3 is a schematic diagram of high, medium and low traffic stations corresponding to peak hours;
FIG. 4 is a schematic diagram of a short-time passenger flow prediction model;
fig. 5 is a schematic structural diagram of a passenger-oriented guidance road network passenger flow state deduction system according to another embodiment of the present invention.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, a detailed description of the present invention will be given below with reference to the accompanying drawings and specific embodiments. It should be noted that the embodiments of the present invention and features of the embodiments may be combined with each other without conflict.
In the following description, numerous specific details are set forth to provide a thorough understanding of the present invention, and the described embodiments are merely a subset of the embodiments of the present invention, rather than a complete embodiment. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention.
Fig. 1 is a flowchart of a method for deducing passenger-oriented road network traffic state according to an embodiment of the present invention. Specific implementations of the present application are described in detail below with reference to the accompanying drawings.
Referring to fig. 1, the passenger-oriented guidance-oriented road network passenger flow state deduction method includes the following steps:
s100, AFC data of each station of the urban rail transit in actual operation are obtained.
AFC data is historical passenger flow data of passengers entering and leaving a station in the whole network collected by an automatic fare collection system for urban rail transit, and the passenger flow data generally comprises attributes such as passenger station number, station name, time of entering and leaving the station, card type and the like. The data has the characteristics of high precision, strong quasi-real-time performance and the like in passenger flow records, and is often applied to the research on passenger travel behaviors, passenger flow distribution characteristics, flow characteristics and the like. The quasi-real-time performance of AFC data is mainly embodied in that after a passenger punches a card in a station to go in and out, the system uploads the time of the station to go in and out, the station to go in and out and the like according to a certain time interval, so that the AFC data is used as a data basis for carrying out passenger flow OD dynamic prediction, the prediction result is more accurate, and the real-time performance of passenger flow can be more embodied.
In this embodiment, the dynamic OD data in the AFC data is mainly acquired, where the dynamic OD data includes time and time-period passenger flow OD data corresponding to the time, and the time includes date and time period.
And S200, based on AFC data and weather data, determining relevant factors influencing passenger flow by utilizing Pearson correlation coefficient analysis.
In this embodiment, the Pearson correlation coefficient calculation formula is as follows:
Figure BDA0003777965040000051
wherein, X i Indicating the amount of traffic in the ith time period,
Figure BDA0003777965040000052
and S X Respectively representing the mean value and the variance of the passenger flow; y is i Indicates the corresponding influence factor value for the ith period,
Figure BDA0003777965040000053
and S Y Respectively, mean and variance of the respective influencing factors. The Pearson correlation coefficient r has a value range of [ -1,1]A smaller absolute value indicates a lower correlation of the influencing factor with the passenger flow volume. The positive and negative values of r represent different correlations, with positive values representing positive correlations and negative values representing negative correlations.
Weather has a great influence on urban rail transit passenger flow, and particularly, some passengers who go out for shopping and play can flexibly select travel modes and time according to weather conditions. In this embodiment, passenger flow influence analysis is mainly performed on five weather conditions, namely, temperature, humidity, visibility, precipitation and cloudiness, and the obtained Pearson correlation coefficients are all within a 99% confidence interval, as shown in table 1.
TABLE 1 Pearson correlation coefficient of correlation factors affecting passenger flow
Figure BDA0003777965040000054
Note: * Indicating significant correlation at the 0.01 level (99% confidence); significance (two-tailed) refers to the differential check value
Intuitively, rail transit passenger flow is influenced by whether the rail transit passenger flow works or not and whether the rail transit passenger flow is in a rush hour or not, and the rail transit passenger flow fluctuates. And selecting daily passenger flow data of part of the stations in one month and average passenger flow data of different time periods in one week for analysis, determining whether two factors, namely working days and whether the peak time of the passenger flow is influenced on the rail passenger flow, and using the two factors as influence factors for subsequent passenger flow prediction.
According to the analysis, factors influencing the rail transit passenger flow change have diversity, and part or all of 8 factors are used as parameter indexes for short-time passenger flow analysis and prediction according to actual conditions, and specific parameters are shown in table 2.
TABLE 2 relevant factors influencing passenger flow
Figure BDA0003777965040000061
As can be seen from table 2, relevant factors affecting the passenger flow include temperature, humidity, visibility, precipitation, cloudiness, previous-period passenger flow/number, whether the day of work is on, and whether the peak period is on.
In the embodiment, for AFC data of each station of the rail transit in actual operation, a Pearson correlation coefficient is used for analysis, and weather conditions, historical passenger flow volume, whether the time period is a peak time period, whether the time period is a working day and the like are determined as short-time passenger flow influence factors of the rail transit.
And S300, analyzing the passenger flow characteristics based on the improved K-means clustering algorithm.
Based on K-means algorithm improvement, a station clustering algorithm (SK-means) for rail transit is provided for clustering station passenger flow, the passenger flow classification standard of various Stations is determined, and the distribution condition of different types of passenger flow Stations is analyzed. Based on the general rule of passenger flow distribution of rail transit stations, the SK-means algorithm improves the selection of the initial clustering center. As the passenger flow of most stations of the rail transit in the same city is uniformly distributed between the maximum value and the minimum value of the passenger flow of the stations, the K +1 equal division point between the maximum value and the minimum value of the passenger flow is selected as an initial clustering central point by the algorithm. The stations are divided into three types of high, medium and low passenger flow.
Fig. 2 is a flowchart of a station clustering algorithm for rail transit. Referring to fig. 2, the station clustering algorithm for rail transit includes the following steps:
s301, acquiring the class number K of each site and a site passenger flow data set N;
s302, solving the maximum value and the minimum value of passenger flows of N stations, wherein the maximum value and the minimum value are as follows: max, min = getMaxAndMin (N);
s303, taking a K +1 bisector between the maximum value and the minimum value as an initial clustering center, wherein the initial clustering center is centlist = initcentlist (max, min, K + 1);
s304, dividing the N sites into the clustering classes with the closest passenger flow, wherein the closest clustering class is clusterDict = minDistance (N, centerList);
s305, accumulating and summing the difference value between the passenger flow of each vector station in the cluster and the cluster center, and taking the sum as a convergence judgment condition of the algorithm; wherein nVar = getVar (clusterDict, centerList);
s306, finding the mean value of each type of passenger flow to serve as a new clustering center, and ending iteration when the difference of the clustering structures of two consecutive times is less than 0.0001.
while abs(nVar-oVar)>=0.0001;
centerList=getcenterList(clusterDict);
clusterDict=minDistance(N,centerList);
oVar=nVar;
nVar=getVar(clusterDict,centerList).
And S307, outputting the clustering result clusterDict of the sites.
An example of the partitioning result is as follows:
Figure BDA0003777965040000071
s308, analyzing the distribution rule of the passenger flow of various sites in time and space, and determining different passenger flow peak time periods of various sites.
In the rail transit operation time period, the passenger flow of each station is influenced by the time of residents going to and from work and the time of going to and from school, and the regular change of the peak in the morning and at night is presented. And the morning and evening peak hours may be different for different types of sites in different cities. The corresponding peak periods are set based on a characterization of historical AFC data.
Taking the average station passenger flow volume of the weekdays (except the working day of rest) from 9 to 12 months in 2019 of a certain city and the days of monday (except the holidays) as examples, the passenger flow time distribution characteristics of high, medium and low passenger flow stations (respectively selecting 2 stations) are analyzed, as shown in fig. 3.
Setting the peak time period of working days of various stations as 7;
setting the non-working day peak time period of the high passenger flow rate station as 14;
and setting the non-working day peak time period of the medium passenger flow station as 17.
In this embodiment, a K-means clustering algorithm (K-means clustering, K-means) is improved to divide actual rail transit stations into three types of stations, namely high, medium and low passenger flow rates, and the distribution rules of the passenger flow rates of the various stations in time and space are analyzed to determine different passenger flow peak time periods of the various stations
S400, building a short-time passenger flow prediction model.
In this embodiment, an LSTM or GRU model is used as a basic processing unit, and a seven-layer Short-term Passenger Flow prediction model structure (spf) is built, and the model structure is shown in fig. 4.
As shown in fig. 4, the SPFF model includes an input layer, a hidden layer, and an output layer. The input layer and the output layer of the SPFF model are all connected layers. The input layer meets the input requirement of neurons in the hidden layer by carrying out primary processing on AFC sample data; the output layer maps a plurality of actual results of the hidden layer into expected results of the model through a full-connection network again. The hidden layer of the SPPF model comprises seven layers of neural network layers and four layers of Dropout layers which are mutually interpenetrated, wherein the Dropout layers can temporarily remove the neurons from the network according to a set probability, so that overfitting is effectively prevented; the neurons of the neural network layer are unified as LSTM or GRU. In fig. 4, the subscripts t-1 and t represent the last time and the current time, x and h represent the input and output data vectors, respectively, C represents the state vector of the hidden layer,
Figure BDA0003777965040000081
a candidate value vector created by a tanh function (as shown in equation 2) is represented, σ represents a sigmoid function (as shown in equation 3), and f, i, and o are calculation results of a plurality of σ functions (as shown in equation 4, where W is a weight matrix and b is a bias value).
The expression of the output of the SPFF model is:
Figure BDA0003777965040000091
Figure BDA0003777965040000092
Figure BDA0003777965040000093
the calculated relationship between other intermediate quantities and output quantities in the SPFF model is as follows:
Figure BDA0003777965040000094
in the SPFF model, the hyperbolic tangent function tanh is used as the activation function (as shown in equation 2), the Adma is used as the optimizer, and the Mean Square Error (MSE) is used as the loss function. Furthermore, there are 3 other main parameters in the SPFF model as follows:
(1) Time granularity (Tg)
The time granularity Tg refers to a time interval for collecting data, and in this embodiment refers to a time interval for extracting passenger flow volume data. For short-term traffic prediction, the time granularity is typically selected to be no more than 15 minutes. The amount of information contained in different time granularities can be different, the larger the time granularity is, the more the information amount is, the more accurate the prediction is, but the larger time interval can cause the slight change of the passenger flow to be not easy to be perceived. In order to research the optimal model for short-time passenger flow prediction, different time granularity analysis can be selected to compare prediction results.
(2) Time step (Ts)
The time step Ts refers to the sequence length of a single sample, i.e. the number of data periods contained in a single sample. Different Ts will influence the length of training time, and although the iteration speed is accelerated by over-small Ts, the problem of easy falling into local optimal solution also exists; too large Ts also loses prediction accuracy due to reduced correlation of data in the same sample. There are studies that yield better predictions for short-term traffic predictions of different granularity, with Ts values of 3 or 6.
(3) Iteration time (It)
The number of iterations It is related to the model convergence speed. As the number of iterations increases, the model gradually converges and the loss value of the model gradually decreases, but when the loss value decreases to a lower value, the loss value cannot decrease again with the iteration of the model. Therefore, the excessive number of iterations cannot make the model loss value always decrease, and the excessive training time is consumed.
In order to improve the prediction effect of the SPPF model on short-time passenger flow, prediction practice is respectively carried out on high, medium and low passenger flow stations, AFC passenger flow data of each station are correspondingly divided into a training set and a test set according to the proportion of 8. The time granularity Tg is set to be not less than 5 minutes, the time step Ts is set to be three types of 1/3/6, and the iteration number It is set to be 800.
S500, collecting real-time passenger flow data of various stations, inputting the real-time passenger flow data of each station into a short-time passenger flow prediction model to predict short-time passenger flow, and obtaining a short-time passenger flow prediction result.
The short-time passenger flow prediction model is used for predicting and analyzing the passenger flow of real-time passenger flow data of various stations in different time periods, and the short-time passenger flow prediction result generated by prediction is synchronously uploaded to a corresponding module server for urban rail transit station group passenger flow analysis, so that accurate road network passenger flow state is provided for personalized travel of passengers, and accurate travel time information is provided for dynamic path planning.
The passenger guidance oriented road network passenger flow state deduction method utilizes Pearson correlation coefficients to analyze and determine whether factors such as weather conditions, historical passenger flow, whether the factors are in peak time periods, whether the factors are in working days and the like are determined as short-time passenger flow influence factors of rail transit; dividing actual rail transit stations into three types of stations of high, medium and low passenger flow by using an improved K-means clustering algorithm, analyzing the distribution rule of the passenger flow of each type of stations in time and space, and determining different passenger flow peak time periods of each type of stations; two urban rail transit short-time passenger flow prediction methods respectively based on a long-time memory neural network and a gate control circulation unit are adopted to carry out prediction analysis on the passenger flow of different passenger flow type stations in different time. Compared with the traditional road network passenger flow prediction method based on historical OD data, the accuracy, the real-time performance and the adaptability to emergency situations are improved.
In correspondence to the above method embodiment, referring to fig. 5, fig. 5 is a schematic diagram of a passenger-oriented guidance road network passenger flow state deduction system according to another embodiment of the present invention, where the system 100 may include:
a memory 101 for storing a computer program;
the processor 102, when executing the computer program stored in the memory 101, may implement the following steps:
acquiring historical passenger flow data of each station of urban rail transit; based on historical passenger flow volume data and weather data of each station, relevant factors influencing passenger flow are analyzed and determined by using Pearson correlation coefficients; performing clustering analysis on historical passenger flow data of each station to obtain classification results of various stations, and analyzing passenger flow distribution conditions of different types of stations; and (3) building a short-time passenger flow prediction model, collecting real-time passenger flow data of various stations, and inputting the real-time passenger flow data of various stations into the short-time passenger flow prediction model to obtain a short-time passenger flow prediction result.
The above-described embodiments should not be construed as limiting the scope of the invention. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A passenger-oriented guidance-oriented road network passenger flow state deduction method is characterized by comprising the following steps:
acquiring historical passenger flow data of each station of urban rail transit;
based on historical passenger flow volume data and weather data of each station, relevant factors influencing passenger flow are analyzed and determined by using Pearson correlation coefficients;
performing clustering analysis on historical passenger flow data of each station to obtain classification results of various stations, and analyzing passenger flow distribution conditions of different types of stations;
and (3) building a short-time passenger flow prediction model, acquiring real-time passenger flow data of various stations, and inputting the real-time passenger flow data of the various stations into the short-time passenger flow prediction model to obtain a short-time passenger flow prediction result.
2. The passenger-oriented guidance-oriented road network passenger flow state deduction method as claimed in claim 1, wherein the passenger flow data at least comprises passenger station number, station name, station time and card type.
3. The passenger-oriented guidance road network passenger flow state deduction method according to claim 1, wherein the step of determining relevant factors influencing the passenger flow by using Pearson correlation coefficient analysis comprises:
dividing historical passenger flow data of each station into passenger flow data of a plurality of time periods;
calculating a corresponding Pearson correlation coefficient by using the passenger flow volume data and the weather data of each time period;
comparing the passenger flow volume data of each time period with the Pearson correlation coefficient corresponding to the weather data, and determining related passenger flow volume factors and related weather factors which influence the passenger flow;
and selecting daily passenger flow data of part of the sites in one month and average passenger flow data of different time periods of each day in one week for analysis, and determining two factors of whether the work day and the passenger flow peak period influence the passenger flow.
4. The passenger-oriented guidance road network passenger flow state deduction method according to claim 3, wherein the Pearson correlation coefficient is calculated as follows:
Figure FDA0003777965030000011
wherein, X i Indicating the amount of traffic for the ith time period,
Figure FDA0003777965030000012
and S X Respectively representing the mean value and the variance of the passenger flow; y is i Indicates the corresponding influence factor value of the ith time period,
Figure FDA0003777965030000021
and S Y Respectively representing the mean and variance of the corresponding influence factors; r is Pearson correlation coefficient.
5. The passenger-oriented guidance-oriented road network passenger flow state deduction method as claimed in claim 3, wherein the relevant factors influencing the passenger flow include temperature, humidity, visibility, precipitation, cloud cover, previous period passenger flow/number, whether working day and whether peak time period.
6. The passenger-oriented guidance road network passenger flow state deduction method as claimed in claim 1, wherein the step of performing cluster analysis on the historical passenger flow data of each station comprises:
acquiring the category number of each site and a site passenger flow data set;
solving the maximum value and the minimum value of passenger flows of all stations;
taking a K +1 equant point between the maximum value and the minimum value as an initial clustering center;
dividing all the sites into the clustering categories with the closest passenger flow;
accumulating and summing the difference value between the passenger flow of each vector station in the cluster and the cluster center to serve as a convergence judgment condition of the algorithm;
finding the mean value of each class of passenger flow as a new clustering center, and finishing iteration when the difference of clustering structures of two consecutive times is smaller than a set threshold value;
and outputting a site clustering result, wherein the site clustering result comprises a high passenger flow site, a medium passenger flow site and a low passenger flow site.
7. The passenger-oriented guidance road network passenger flow state deduction method as claimed in claim 6, wherein the step of analyzing the passenger flow distribution of different types of stations comprises:
the distribution rules of the historical passenger flow data of the high passenger flow station, the medium passenger flow station and the low passenger flow station in time and space are analyzed respectively, and different passenger flow peak time periods of various stations are determined.
8. The passenger-oriented guidance road network passenger flow state deduction method according to claim 1, wherein the step of building a short-time passenger flow prediction model comprises:
constructing an input layer, a hidden layer and an output layer of a short-time passenger flow prediction model;
dividing historical passenger flow volume data of each station into a training set and a test set according to a certain proportion, collecting real-time passenger flow volume data of each station, and adding the real-time passenger flow volume data into the test set;
training a short-time passenger flow prediction model by using a training set;
and verifying the trained short-time passenger flow prediction model by using a test set.
9. The passenger-oriented guidance road network passenger flow state deduction method as claimed in claim 1, wherein the step of inputting the real-time passenger flow data of each station into the short-time passenger flow prediction model to obtain the short-time passenger flow prediction result comprises:
the real-time passenger flow data of each station is preprocessed,
selecting real-time passenger flow data of each station after different time granularity division preprocessing;
and performing predictive analysis on the real-time passenger flow data in different time to obtain the passenger flow state of each station.
10. A passenger-oriented guidance-oriented road network passenger flow state deduction system, comprising:
a memory for storing a computer program;
processor for implementing the steps of the passenger oriented guidance road network traffic state deduction method according to any of claims 1 to 9 when executing said computer program.
CN202210922005.4A 2022-08-02 2022-08-02 Passenger-guidance-oriented road network passenger flow state deduction method and system Pending CN115269758A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210922005.4A CN115269758A (en) 2022-08-02 2022-08-02 Passenger-guidance-oriented road network passenger flow state deduction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210922005.4A CN115269758A (en) 2022-08-02 2022-08-02 Passenger-guidance-oriented road network passenger flow state deduction method and system

Publications (1)

Publication Number Publication Date
CN115269758A true CN115269758A (en) 2022-11-01

Family

ID=83747535

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210922005.4A Pending CN115269758A (en) 2022-08-02 2022-08-02 Passenger-guidance-oriented road network passenger flow state deduction method and system

Country Status (1)

Country Link
CN (1) CN115269758A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116778739A (en) * 2023-06-20 2023-09-19 深圳市中车智联科技有限公司 Public transportation scheduling method and system based on demand response
CN117408393A (en) * 2023-12-06 2024-01-16 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116778739A (en) * 2023-06-20 2023-09-19 深圳市中车智联科技有限公司 Public transportation scheduling method and system based on demand response
CN117408393A (en) * 2023-12-06 2024-01-16 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event
CN117408393B (en) * 2023-12-06 2024-03-19 华中科技大学 Prediction method and system for comprehensive passenger transportation hub traffic flow under abnormal event

Similar Documents

Publication Publication Date Title
CN110570651B (en) Road network traffic situation prediction method and system based on deep learning
WO2023056696A1 (en) Urban rail transit short-term passenger flow forecasting method based on recurrent neural network
CN107610464A (en) A kind of trajectory predictions method based on Gaussian Mixture time series models
Wang Predicting tourism demand using fuzzy time series and hybrid grey theory
CN115269758A (en) Passenger-guidance-oriented road network passenger flow state deduction method and system
CN113096388B (en) Short-term traffic flow prediction method based on gradient lifting decision tree
CN107978153B (en) A kind of multimode traffic factors influencing demand analysis method based on space vector autoregression model
CN113159364A (en) Passenger flow prediction method and system for large-scale traffic station
CN110390349A (en) Bus passenger flow volume based on XGBoost model predicts modeling method
CN111860989B (en) LSTM neural network short-time traffic flow prediction method based on ant colony optimization
CN111582559B (en) Arrival time estimation method and device
CN109376906B (en) Travel time prediction method and system based on multi-dimensional trajectory and electronic equipment
CN112183605B (en) Civil aviation control sector classification method based on operation characteristics
CN116681176B (en) Traffic flow prediction method based on clustering and heterogeneous graph neural network
CN113380071B (en) Parking space level parking demand state evaluation method for large complex
CN112884222B (en) Time-period-oriented LSTM traffic flow density prediction method
CN113205223A (en) Electric quantity prediction system and prediction method thereof
CN112149922A (en) Method for predicting severity of accident in exit and entrance area of down-link of highway tunnel
Zekić-Sušac et al. Cluster analysis and artificial neural networks in predicting energy efficiency of public buildings as a cost-saving approach
CN113674524A (en) LSTM-GASVR-based multi-scale short-time traffic flow prediction modeling and prediction method and system
CN116187835A (en) Data-driven-based method and system for estimating theoretical line loss interval of transformer area
CN110490365B (en) Method for predicting network car booking order quantity based on multi-source data fusion
CN113553350B (en) Traffic flow partition model for similar evolution mode clustering and dynamic time zone division
CN117912235B (en) Planning data processing method and system for smart city
CN114117903B (en) Short-time passenger flow prediction method for rail transit based on bp neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination