CN110322064B

CN110322064B - Urban trip demand prediction method

Info

Publication number: CN110322064B
Application number: CN201910570152.8A
Authority: CN
Inventors: 杨铮; 褚菁; 王需; 赵毅
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2019-06-27
Filing date: 2019-06-27
Publication date: 2022-04-15
Anticipated expiration: 2039-06-27
Also published as: CN110322064A

Abstract

The invention provides a method for forecasting urban travel demand, which comprises the following steps: for any region in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the region in the previous time period of the current time period; inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period; the graph convolution neural network model is obtained after training based on training sample data and a predetermined travel demand prediction result. The method and the system can accurately predict the urban travel demand.

Description

Urban trip demand prediction method

Technical Field

The invention belongs to the technical field of intelligent transportation, and particularly relates to a city travel demand prediction method.

Background

With the popularization of the mobile internet and the continuous improvement of the urbanization development level, China has made a leap-type development in the shared economy field in recent years. The sharing economy integrates originally dispersed goods or services through the internet platform and provides the users with more flexible market prices, thereby realizing best-of-things and demand allocation.

At present, China has achieved outstanding achievements in the field of shared economy, particularly in the field of shared travel. However, the problem of difficulty in taxi taking is still severe, taxi taking efficiency and taxi taking response rate are reduced, and taxi taking difficulty is increased more obviously in a main people-intensive area in a city. In extreme weather such as heavy rain and high temperature, the response time of taxi taking is prolonged. In order to continuously deal with the problem of difficulty in taxi taking, the existing large data platform resources need to be fully utilized, vehicle resources need to be finely scheduled, and intelligent traffic supply is realized.

Meanwhile, with the rapid development of wireless communication and sensor technologies, wireless mobile terminal devices are spreading explosively. The huge user group provides possibility for researching refined intelligent travel: the method comprises the steps that rich user client position information can be provided through user connection base station information, and population density distribution and travel change rules of different areas can be obtained in time through a big data technology; and secondly, when the user uses the internet trip application, the used data flow can completely record the taxi taking behavior information of the user, so that almost complete trip demand information of the user can be obtained. The position distribution information of the user and the taxi taking behavior information are combined, and a user travel demand model with finer granularity and higher precision can be established.

Although methods for analyzing and predicting urban travel needs exist in the prior art, the studies all face the problems of sparse data coverage and the inability to effectively model spatial relationships between areas. Aiming at the research of data coverage sparseness and based on the data of the traditional taxi company in the early stage, the position of a user for getting on or off the taxi is mainly extracted from GPS track data, so that the taxi taking requirement of the user is analyzed. However, the amount of taxi taking demands inferred from historical GPS data may deviate from the actual taxi taking demands of the user. For the effective modeling of the spatial relationship between the areas, the prior art includes a prediction model based on autoregressive integrated moving average (ARIMA), a prediction model based on long and short term memory network (LSTM), a prediction model based on spatial autocorrelation and the like, but the above schemes either ignore the influence of spatial factors or use an approximate estimation model to model the travel demand, so that the models cannot well reflect the long-distance spatial influence caused by the movement of the user, and thus the high-precision urban travel demand prediction cannot be realized.

Disclosure of Invention

In order to overcome the problem of low prediction accuracy of the existing urban travel demand prediction method or at least partially solve the problem, embodiments of the present invention provide an urban travel demand prediction method.

According to a first aspect of the embodiments of the present invention, there is provided a method for predicting urban travel demand, including:

for any region in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the region in the previous time period of the current time period;

inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period;

the graph convolution neural network model is obtained after training based on training sample data and a predetermined travel demand prediction result.

According to a second aspect of the embodiments of the present invention, there is also provided an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor calls the program instructions to be able to execute the method for predicting urban travel demand provided in any one of the various possible implementations of the first aspect.

The embodiment of the invention provides a city travel demand prediction method, which is used for predicting travel demands by utilizing a graph convolution neural network model established based on a region space dependency relationship according to the time correlation of the travel demands of various regions and the correlation of region crowd flow and the travel demands, and fully considering the influence of the flow characteristics and the historical data of the occurrence demand characteristics of the region crowd on the region travel demands, thereby accurately predicting the city travel demands.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.

Fig. 1 is a schematic overall flow chart of a method for predicting urban travel demand according to an embodiment of the present invention;

fig. 2 is a schematic diagram of the degree of cross-correlation between the population outflow and the travel demand characteristics of each area in the urban travel demand prediction method provided by the embodiment of the present invention;

fig. 3 is a schematic structural diagram of a graph convolution neural network in the urban travel demand prediction method according to the embodiment of the present invention;

fig. 4 is a schematic view of an overall structure of an electronic device according to an embodiment of the present invention.

Detailed Description

Fig. 1 is a schematic overall flow chart of a method for predicting urban travel demand according to an embodiment of the present invention, where the method includes: s101, for any area in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the area in the previous time period of the current time period;

the city is further required to be divided into a plurality of areas before travel demand prediction is performed on the city, and the embodiment is not limited to the method for dividing the city area. The travel demand characteristics are characteristics reflecting the travel demands of the users, such as the number of the users needing to travel. The crowd flowing characteristics are characteristics representing crowd flowing, such as the population outflow volume, the population inflow volume and the like. And respectively acquiring the travel demand characteristics and the crowd flow characteristics of each region in the previous time period.

S102, inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period; the graph convolution neural network model is obtained after training based on training sample data and a predetermined travel demand prediction result.

The graph convolution neural network model is established in advance based on the spatial dependence relationship among all regions of a city, and the spatial dependence relationship among the regions is determined by the crowd flow quantity among the regions. And taking the trained graph convolution neural network model as a travel demand prediction model. And inputting the travel demand characteristics and the crowd flow characteristics of the previous time period of each region into a pre-trained graph convolution neural network model, and acquiring the output travel demand prediction result of each region of the city.

According to the embodiment, the travel demand is predicted by utilizing the graph convolution neural network model established based on the regional space dependency relationship according to the time correlation of the travel demands of each region and the correlation of regional crowd flow and the travel demands, the influence of the flow characteristics of the regional crowd and the historical data with the occurrence demand characteristics on the regional travel demands is fully considered, and therefore the urban travel demands are predicted accurately.

On the basis of the foregoing embodiment, in this embodiment, the step of acquiring the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further includes: acquiring the training sample data, and constructing a directed graph according to the training sample data; constructing a graph convolution neural network model according to the directed graph; inputting the training sample data into the graph convolution neural network model, and training the graph convolution neural network model.

Specifically, before the travel demand of each region of the city is predicted by using a trained graph convolution neural network model, training sample data is obtained, and a directed graph is constructed according to the training sample data. Directed graph D is an ordered triple (V (D), A (D), φ (D)), where φ (D) is a correlation function that makes each element in A (D), called a directed edge or arc, correspond to an ordered pair of elements in V (D), called a vertex or point. In the embodiment of the invention, the spatial dependency relationship of the areas is expressed as a directed graph, elements in the directed graph are areas, and edges connecting different elements express the population outflow volume of the areas and the people flow moving direction of the areas.

And constructing a graph convolution neural network model based on the directed graph. The training sample data comprises input quantity of the graph convolution neural network model and a travel demand prediction result corresponding to the input quantity. The input quantity is node travel demand characteristic samples and crowd flow characteristic samples of multiple time intervals in each region of the city. And inputting training sample data into the graph convolution neural network model for training, and taking the trained graph convolution neural network model as a city trip demand prediction model. During training, sample data is divided into two parts, namely training sample data and test sample data, and training is performed by adopting a cross validation method.

On the basis of the above embodiment, the training sample data in this embodiment includes a crowd flow feature sample and a travel demand feature sample; correspondingly, the step of acquiring the training sample data specifically includes: corresponding to any one of the areas, acquiring mobile network data of the area; the mobile network data of the area comprises a user client set and a flow record, wherein each base station in the area is accessed in a preset time period immediately before the current time period; dividing the preset time period into a plurality of sub time periods, acquiring the position of a user client in each sub time period according to the mobile network data generated in each sub time period, and acquiring the crowd flow characteristic sample of each area in each sub time period according to the position of the user client in each sub time period; and performing network packet capturing on internet trip software in each sub-time period, performing protocol analysis on data packet contents acquired by the network packet capturing, and acquiring trip demand characteristic samples of each area in each sub-time period according to a protocol analysis result and the user client position corresponding to the protocol analysis result.

Specifically, in order to utilize large-scale mobile network data and extract relevant data such as urban trip demands and the like so as to perform effective modeling, protocol analysis of various internet trip software is realized by describing and analyzing the urban trip demands and protocol reverse engineering based on network tracks, so that the description analysis of trip demand data is realized in the mobile network big data. For any area, mobile network data is obtained from data packets sent by user clients in the area to the base station. The mobile network data in the area includes a user client set accessed by each base station in a preset time period immediately before the current time period t and a traffic record, such as a user client ID, a base station ID, a URL (uniform Resource Locator) and a URI (uniform Resource Identifier) of the accessed content. A data packet sent to a base station by a user client connected with a cellular network is recorded based on a cellular network monitoring system, the data packet is stored in a database in a mode of multi-element group (user ID, equipment type, uplink flow, downlink flow, APP type, URL and URI), and incomplete data records in the database are roamed to local records to be cleaned. And estimating the position of the user client through the ID of the base station accessed by the user client, and acquiring the crowd flow sample characteristics of each area in each time period according to the position of the user client in each sub-time period. The method comprises the steps of carrying out network packet capturing on various internet trip software, carrying out protocol analysis on data packet contents obtained by the network packet capturing, matching URL and URI in the data packets with packet capturing analysis results, and obtaining trip demand characteristic samples of each region in each sub-time period. When training is carried out, the travel demand prediction result of the t time period is predicted by using training sample data of k time periods before the t time period. Fig. 2 shows a strong correlation between the regional travel demand and the regional population outflow, which indicates that the regional population flow characteristics can better reflect the regional potential travel demand, and meanwhile, the regional population outflow and the historical data of the travel demand characteristics are input into the convolutional neural network, and the two parts of characteristics are fused for learning, so that the trained convolutional neural network is more accurate.

On the basis of the foregoing embodiment, in this embodiment, the step of obtaining the crowd flow characteristic sample of each area in each sub-time period according to the user client position in each sub-time period specifically includes: for any one of the sub-time periods and any one of the areas, counting the number of users whose user client positions are in the area in the sub-time period before the sub-time period and whose user client positions are not in the area in the sub-time period, and taking the user data as the crowd flow characteristic sample of the area in the sub-time period.

Specifically, the travel demand characteristic of any region c in any time period t is specifically

Wherein,

the trip demand characteristics of the region c in the time period t, namely the total number of the user clients initiating the taxi taking request in the region c in the time period t, A_T(c) The user client initiating the taxi taking request in the area c for the period t, u represents the user client initiating the taxi taking request,

and initiating the number of taxi taking requests for the user client u. The sample of the flow characteristics of the population in any region c at any time t is specifically

Wherein

Samples of flow characteristics of the population at any time t representing any region c, B_T(c) Is in the t-1 time period region c, and the t time period is in the user client set of other regions. Correspondingly, the travel demand characteristic sample in the input quantity in the training sample is specifically

The crowd flow characteristic sample in the input quantity is specifically

Input quantityThe corresponding travel demand prediction result is specifically

Where d is the dependent edge of the region c vector.

On the basis of the foregoing embodiment, the step of constructing a directed graph according to the training sample data in this embodiment specifically includes: taking each region as a vertex of the directed graph; for any two of the regions, acquiring the population outflow from one of the two regions to the other of the two regions according to the training sample data; taking the direction from the one region to the other region as the direction of the edge between the vertex of the one region and the vertex of the other region; and taking the population outflow from the one area to the other area as the weight of the edge between the vertex of the one area and the vertex of the other area.

Specifically, the present embodiment creates a directed graph according to the spatial dependency relationship among the regions of the city. The directed graph includes a set of regions, a set of edges, and a weight vector for the edges. The directed graph is represented by G ═ C, E, W) G ═ C, E, W; wherein C is a set of regions, E is a set of edges, and W is a weight vector of the edges; the set of edges E includes at least one dependent edge, the dependent edge includes population outflow and an outflow moving direction of the area, and the weight of the dependent edge is the population outflow corresponding to the dependent edge. When in specific drawing, if the user moves from the area a to the area b, an edge is created between the two areas, and the direction can be noted on the edge, namely the direction of the outgoing flow movement of the area. By creating a directed graph between regions, the spatial dependency can be effectively reflected. In addition, the adjacency may be either directly adjacent or indirectly adjacent separated by several regions. If user u is located in region c during the period t-1₁In the region c during the period t₂Then, it represents c₁And c₂Storing a dependent edge e_1,2，e_1,2Weight w of_1,2Number vector defined as the class of users

Considering the existence of data noise, and the fact that more edges in the directed graph will greatly increase the complexity of the calculation, performance will be degraded. Therefore, the obtained set of edges may be filtered by using a method of setting a threshold, that is, a threshold is set, and only the spatial dependency relationship that the number of 0 in the weight vector is smaller than the threshold is retained.

The graph convolution neural network model is constructed based on a modeled urban area spatial dependence directed graph G, Fourier transformation is carried out on the directed graph G, and a Laplace matrix L of the directed graph G is defined as:

wherein D represents a diagonal matrix and D_ii＝∑_jW_ij，I_nIs an identity matrix, and L is a real symmetric positive semi-definite matrix and contains a complete set of orthogonal feature vectors

I.e., the fourier pattern of the directed graph, and the set of non-negative eigenvalues

I.e. the frequency of the directed graph. Eigenvalues of the directed graph G

May be expressed as g_θ(Λ) ═ diag (θ), where θ ∈ RⁿIs a fourier coefficient vector learned from a neural network. The convolution operation for the graph feature x is defined as

y＝g_θ(L)x＝g_θ(UΛU^T)x＝Ug_θ(Λ)U^Tx

Wherein U ═ U₁,…,u_n]∈R^n×nIs a matrix of eigenvectors, Λ ═ diag ([ λ [ ])₀,…,λ_n-1])∈R^n×nIs the eigenvalue diagonal matrix of the laplacian matrix L of the normalized graph. For larger graphs, the eigenvalue decomposition computation of L is large, and to solve this problem, a polynomial filter g is used_θ(Λ):

Wherein the parameter theta is equal to R^KIs a polynomial coefficient matrix and K is a hyperparameter. Due to d_G(i,j)>K, indicates (L)_K)_i,jIs 0, wherein d_GIs the shortest path distance. During training, a graph convolution neural network with Laplace K-order polynomial as a spectrum filter is used to realize modeling learning of spatial correlation.

Furthermore, residual connectivity is introduced in the graph convolution neural network. Residual learning, namely, in order to model the spatial dependence of a large-scale city and increase the depth of the network to realize a better prediction effect, an identical mapping layer with y being equal to x is superposed on the basis of a shallow network, so that the network is not degraded along with the increase of the depth; a residual unit containing identity mapping is defined as:

X^(l+1)＝X^(l)+F(X^(l))

wherein X^(l)And X^(l+1)Which respectively represent the input and output of the l-th layer residual unit, F is a residual method.

In the embodiment, the travel demand prediction model is established based on the graph convolution neural network and residual error learning, and the influence of time-space correlation, people flow characteristics and the like on travel is fully considered, so that the travel demand is accurately predicted.

On the basis of the foregoing embodiments, in this embodiment, the step of acquiring the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further includes: acquiring travel demand characteristics and crowd flow characteristics of the region in a daily periodic time period before the current period, and inputting the travel demand characteristics and the crowd flow characteristics in the daily periodic time period into a graph convolution neural network model; correspondingly, the step of inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model and outputting the travel demand prediction result of the area in the current time period specifically comprises the following steps: inputting the travel demand characteristics and the crowd flow characteristics into the graph convolution neural network model to obtain first characteristics output by the last convolution layer of the graph convolution neural network model; inputting the travel demand characteristics and the crowd flow characteristics in the daily periodic time period into the graph convolutional neural network model to obtain second characteristics output by the last convolutional layer; and fusing the first characteristic and the second characteristic, and acquiring the travel demand prediction result according to a fusion result.

The daily cycle time period refers to the same time period of each day in preset days before the current time period. The prediction of the travel demands of different regions of the city can be influenced by time correlation and space correlation at the same time, the travel demands of the city often show obvious periodicity, the daily periodic travel demand characteristics and the crowd flow characteristics are extracted for modeling and re-fusion, and the prediction method can be used for predicting the travel demands of different time dimensions, and is shown in fig. 3.

On the basis of the foregoing embodiments, in this embodiment, the step of acquiring the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further includes: acquiring cross-domain information characteristics of the region in a time period before the current time period; wherein the cross-domain information features include one or more of weather, holidays, and festivals; correspondingly, the step of inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model and outputting the travel demand prediction result of the area in the current time period specifically comprises the following steps: and inputting the travel demand characteristics, the crowd flow characteristics and the cross-domain information characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period.

Specifically, cross-domain data are fused for more accurately predicting the travel demand of each region in a future period of time. Multisource information data such as holidays, holidays and weather conditions have certain influence on the travel demands of different types of areas, the taxi taking demand suddenly increases in general holidays, the taxi taking demand of an office area can be obviously reduced on weekends, and the taxi taking demand is obviously increased in rainy and snowy weather. In order to integrate the cross-domain data information in deep learning, and simultaneously considering the flexibility of a model, a two-layer full-connection network is used for processing external data. The first layer can be considered an embedding layer, and the second layer converts the dimension of external data output to be the same as the dimension of the network output modeled by the spatio-temporal sequence. And then performing output fusion, putting the fusion into a new full-connection network to learn again, and outputting a final result, as shown in fig. 3.

According to the embodiment, the trip requirements of users in different regions of the city at different time periods are accurately predicted by using the large-scale mobile network data, the factors such as space, time, weather and events and the relevance among the factors according to the spatial difference and the dependency of the trip requirements among the regions in the large-scale city.

On the basis of the foregoing embodiments, in this embodiment, before the step of acquiring the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period, the method further includes: dividing the city into a plurality of parts according to the road network of the city; and clustering all parts of the city according to the population outflow of each part of the city, and taking the clustering result as the region division result of the city.

The road network is a road system formed by interconnecting and interlacing various roads in a certain area. Due to the spatial dynamics of urban population traveling, in order to carry out reasonable area division, the influence of road networks and urban pedestrian flow characteristics is fully considered, and urban irregular areas are divided. The present embodiment first performs fine-grained division on cities according to a road network. The divided areas are too small only by utilizing the road network, and in order to increase the size of the divided areas, each area contains a base station, so that the division of the areas is meaningful, and the fine-grained parts of the road network division are clustered by further utilizing the crowd flow characteristics. The crowd flowing characteristics of the region can well reflect the functionality of the region and the cityThe attractiveness of the population. The population flow characteristic of a region is the population outflow of the region, which is the number of people moving from a region to another region. For regional population outflow calculation, P is used_t(r_i) Representing the time period t, t + Δ t]Is located in the region r_iThe total number of people in the area r_iThe number of candidate people in this time period is C_t(r_i)＝P_t(r_i)\P_t+1(r_i) I.e. if a user client is not in the region for the next time period, it is considered that this time period is leaving the region r_i. However, considering that a user client leaves an area, it may leave the area after staying in the area for a long time, or it may just pass on the way and generate data information. In order to reduce the influence of the user on the analysis caused only in the way, the user client is set to count the user client into the outflow population of the area only when the user client stays in the area for more than a preset time, for example, leaves the area after 1 hour, so that the real outflow population is supposed to be selected from the candidate outflow populations.

On the basis of the above embodiment, in this embodiment, the clustering is spectral clustering; correspondingly, according to the population outflow of each part of the city, clustering all parts of the city, and taking the clustering result as the region division result of the city specifically comprises the following steps: corresponding to any two parts of the city, constructing a people stream feature sequence of one part according to the population outflow of one part of the two parts in a plurality of historical time periods, and constructing a people stream feature sequence of the other part according to the population outflow of the other part of the two parts in the plurality of historical time periods; calculating a similarity between the two portions of their respective human stream feature sequences based on minkowski distance; calculating an adjacency matrix of the spectral clustering according to the corresponding similarity of any two parts of the city; wherein, the weight values of two parts which are not adjacent in the adjacent matrix are 0, and the similarity corresponding to the two parts is in direct proportion to the weight values of the two parts; and acquiring a Laplacian matrix of the spectral cluster according to the adjacency matrix, standardizing the Laplacian matrix of the spectral cluster, clustering the standardized Laplacian matrix, and acquiring a region division result of the city.

In particular, for clustering of fine-grained portions, the similarity between the human-stream feature sequences of the portions is computed using Minkowski distances after normalizing the population outflow of the portions. The characteristic sequence of the stream of people like any part is an n-dimensional variable a (x)₁₁,x₁₂,…,x_1n) And b (x)₁₁,x₁₂,…,x_1n) The minkowski distance between is defined as:

wherein n is the number of the historical time periods, and p is a preset value, for example, p is 2. The embodiment clusters the fine-grained part by using a spectral clustering mode; firstly, a similarity matrix is used for calculating an adjacent matrix in a spectrum cluster, the two parts with small similarity have lower weight values and the parts with large similarity have high weight values, and meanwhile, the weight values exist between the adjacent parts. Since the adjacency matrix W requires an asymmetric matrix, for any r_iAnd r_jTwo-part adjacency matrix W_ij：

Wherein alpha is_ijIs the human-stream characteristic minkowski distance between two regions, which is the standard deviation of all sequences of minkowski distances. A degree matrix D and a Laplace matrix L are obtained through calculation of an adjacency matrix W, the Laplace matrix is normalized, and eigenvectors corresponding to eigenvalues of the normalized Laplace matrix are obtained. And clustering the Laplace matrix standardized by rows by using a traditional clustering mode to form a final urban area division result.

The embodiment provides an electronic device, and fig. 4 is a schematic view of an overall structure of the electronic device according to the embodiment of the present invention, where the electronic device includes: at least one processor 401, at least one memory 402, and a bus 403; wherein,

the processor 401 and the memory 402 communicate with each other via a bus 403;

the memory 402 stores program instructions executable by the processor 401, and the processor calls the program instructions to perform the methods provided by the above method embodiments, for example, the methods include: for any region in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the region in the previous time period of the current time period; inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period; the graph convolution neural network model is obtained after training based on training sample data and a predetermined travel demand prediction result.

The present embodiments provide a non-transitory computer-readable storage medium storing computer instructions that cause a computer to perform the methods provided by the above method embodiments, for example, including: for any region in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the region in the previous time period of the current time period; inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period; the graph convolution neural network model is obtained after training based on training sample data and a predetermined travel demand prediction result.

Those of ordinary skill in the art will understand that: all or part of the steps for implementing the method embodiments may be implemented by hardware related to program instructions, and the program may be stored in a computer readable storage medium, and when executed, the program performs the steps including the method embodiments; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A method for predicting urban travel demand is characterized by comprising the following steps:

for any region in a city, acquiring the travel demand characteristics and the crowd flow characteristics of the region in the previous time period of the current time period; the travel demand characteristics comprise the number of users to be traveled;

the graph convolution neural network model is obtained by training based on training sample data and a predetermined travel demand prediction result;

wherein, the step of obtaining the travel demand characteristics and the crowd flow characteristics of the area in the previous time quantum of the current time quantum further comprises the following steps:

acquiring the training sample data, and constructing a directed graph according to the training sample data;

constructing a graph convolution neural network model according to the directed graph;

inputting the training sample data into the graph convolution neural network model, and training the graph convolution neural network model;

the step of constructing the directed graph according to the training sample data specifically comprises the following steps:

taking each region as a vertex of the directed graph;

for any two of the regions, acquiring the population outflow from one of the two regions to the other of the two regions according to the training sample data;

taking the direction from the one region to the other region as the direction of the edge between the vertex of the one region and the vertex of the other region;

and taking the population outflow from the one area to the other area as the weight of the edge between the vertex of the one area and the vertex of the other area.

2. The urban travel demand forecasting method according to claim 1, wherein the training sample data comprises crowd flow feature samples and travel demand feature samples;

correspondingly, the step of acquiring the training sample data specifically includes:

corresponding to any one of the areas, acquiring mobile network data of the area; the mobile network data of the area comprises a user client set and a flow record, wherein each base station in the area is accessed in a preset time period immediately before the current time period;

dividing the preset time period into a plurality of sub time periods, acquiring the position of a user client in each sub time period according to the mobile network data generated in each sub time period, and acquiring the crowd flow characteristic sample of each area in each sub time period according to the position of the user client in each sub time period;

and performing network packet capturing on internet trip software in each sub-time period, performing protocol analysis on data packet contents acquired by the network packet capturing, and acquiring trip demand characteristic samples of each area in each sub-time period according to a protocol analysis result and the user client position corresponding to the protocol analysis result.

3. The urban travel demand prediction method according to claim 2, wherein the step of obtaining the crowd flow characteristic samples of each of the regions in each of the sub-time periods according to the user client position in each of the sub-time periods specifically comprises:

for any one of the sub-time periods and any one of the areas, counting the number of users whose user client positions are in the area in the sub-time period before the sub-time period and whose user client positions are not in the area in the sub-time period, and taking the number of users as the crowd flow characteristic sample of the area in the sub-time period.

4. The urban travel demand prediction method according to any one of claims 1 to 3, wherein the step of obtaining the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further comprises:

acquiring travel demand characteristics and crowd flow characteristics of the region in a daily periodic time period before the current time period, and inputting the travel demand characteristics and the crowd flow characteristics in the daily periodic time period into a graph convolution neural network model;

correspondingly, the step of inputting the travel demand characteristics and the crowd flow characteristics into a graph convolution neural network model and outputting the travel demand prediction result of the area in the current time period specifically comprises the following steps:

inputting the travel demand characteristics and the crowd flow characteristics into the graph convolution neural network model to obtain first characteristics output by the last convolution layer of the graph convolution neural network model;

inputting the travel demand characteristics and the crowd flow characteristics in the daily periodic time period into the graph convolutional neural network model to obtain second characteristics output by the last convolutional layer;

and fusing the first characteristic and the second characteristic, and acquiring the travel demand prediction result according to a fusion result.

5. The urban travel demand prediction method according to any one of claims 1 to 3, wherein the step of obtaining the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further comprises:

acquiring cross-domain information characteristics of the region in a time period before the current time period; wherein the cross-domain information features include one or more of weather, holidays, and festivals;

and inputting the travel demand characteristics, the crowd flow characteristics and the cross-domain information characteristics into a graph convolution neural network model, and outputting a travel demand prediction result of the region in the current time period.

6. The urban travel demand prediction method according to any one of claims 1 to 3, wherein the step of obtaining the travel demand characteristic and the crowd flow characteristic of the area in the time period before the current time period further comprises:

dividing the city into a plurality of parts according to the road network of the city;

and clustering all parts of the city according to the population outflow of each part of the city, and taking the clustering result as the region division result of the city.

7. The urban travel demand prediction method according to claim 6,

the clustering is spectral clustering;

correspondingly, according to the population outflow of each part of the city, clustering all parts of the city, and taking the clustering result as the region division result of the city specifically comprises the following steps:

corresponding to any two parts of the city, constructing a people stream feature sequence of one part according to the population outflow of one part of the two parts in a plurality of historical time periods, and constructing a people stream feature sequence of the other part according to the population outflow of the other part of the two parts in the plurality of historical time periods;

calculating a similarity between the two portions of their respective human stream feature sequences based on minkowski distance;

calculating an adjacency matrix of the spectral clustering according to the corresponding similarity of any two parts of the city; wherein, the weight values of two parts which are not adjacent in the adjacent matrix are 0, and the similarity corresponding to the two parts is in direct proportion to the weight values of the two parts;

and acquiring a Laplacian matrix of the spectral cluster according to the adjacency matrix, standardizing the Laplacian matrix of the spectral cluster, clustering the standardized Laplacian matrix, and acquiring a region division result of the city.

8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the steps of the method for predicting urban travel demand according to any one of claims 1 to 7 are implemented when the processor executes the program.