CN108399748B - Road travel time prediction method based on random forest and clustering algorithm - Google Patents

Road travel time prediction method based on random forest and clustering algorithm Download PDF

Info

Publication number
CN108399748B
CN108399748B CN201810190151.6A CN201810190151A CN108399748B CN 108399748 B CN108399748 B CN 108399748B CN 201810190151 A CN201810190151 A CN 201810190151A CN 108399748 B CN108399748 B CN 108399748B
Authority
CN
China
Prior art keywords
sample
prediction
value
random forest
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810190151.6A
Other languages
Chinese (zh)
Other versions
CN108399748A (en
Inventor
宋万超
周应华
程爱华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University of Post and Telecommunications
Original Assignee
Chongqing University of Post and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University of Post and Telecommunications filed Critical Chongqing University of Post and Telecommunications
Priority to CN201810190151.6A priority Critical patent/CN108399748B/en
Publication of CN108399748A publication Critical patent/CN108399748A/en
Application granted granted Critical
Publication of CN108399748B publication Critical patent/CN108399748B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • G08G1/0141Measuring and analyzing of parameters relative to traffic conditions for specific applications for traffic information dissemination
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • G08G1/0145Measuring and analyzing of parameters relative to traffic conditions for specific applications for active traffic flow control

Landscapes

  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a road travel time prediction method based on random forests and a clustering algorithm, which accurately predicts the travel time of each key road section in a certain time period by using a mixed prediction model of a density clustering algorithm (DBSCAN) and the Random Forests (RF) according to the time sequence rule of historical traffic data and by combining the attributes of roads, weather factors, holiday information and the states of upstream and downstream traffic flows of the roads. The prediction result can be used for prejudging the development trend of the traffic state, making a control scheme for a potentially congested road in advance, and can also be used for dynamic path induction, planning an optimal trip plan for travelers and assisting social intelligent trip. The prediction method improves the prediction precision of each tree in the random forest through density clustering, thereby improving the overall accuracy of prediction.

Description

Road travel time prediction method based on random forest and clustering algorithm
Technical Field
The invention belongs to the field of road travel time prediction, and particularly relates to a DBSCAN-RF hybrid prediction model for optimizing a random forest prediction result by using a density clustering algorithm.
Background
The travel time of the road is one of important indexes reflecting traffic states, is used as a basis for road traffic jam management and road network optimization integration, and is also an important content in intelligent traffic research. Accurate travel time prediction is an important basis of a modern traffic guidance system and an advanced traveler information system, and can provide decision support for traffic management departments and plan an optimal travel path for travelers. In the past, a time series prediction algorithm is often adopted when prediction analysis is carried out on the field, the algorithm is sensitive to data loss, only the time sequence rule of historical traffic data is considered, and other related factors are ignored; in the case of such a complex prediction scenario, it is far from being satisfied to consider only the timing law.
The method adopts an integrated prediction algorithm based on trees, namely random forests, combines the defects of a random forest algorithm, and provides a mixed prediction model combining a density clustering algorithm and the random forests to improve the accuracy of the prediction model.
Disclosure of Invention
The present invention is directed to solving the above problems of the prior art. The road travel time prediction method based on the random forest and clustering algorithm is provided, so that the algorithm has good convergence, and the prediction accuracy is improved. The technical scheme of the invention is as follows:
a road travel time prediction method based on random forests and a clustering algorithm comprises the following steps:
1) extracting a relevant feature set V influencing a prediction result from historical traffic data, weather data, a road network structure relation and road section self attributes according to the time-space characteristics of a road network, and constructing a sample data set D, wherein V is { V ═ V { (V }1,v2,v3,…,vnThe data structure of the sample data set D is: { v1,v2,v3,…,vn,y},v1As features, n is a feature dimension, and y is a prediction target;
2) carrying out data cleaning and missing value supplement on the sample data set D in the step 1);
3) and measuring the feature importance of the feature set V and the predicted target y by using a feature selection method commonly used by a random forest algorithm to obtain the correlation degree V 'between each feature in V and the predicted target, wherein V' ═ V { (V)1′,v2′,v3′,…,vn′},v1The larger the value of' the characteristic v is specified1The more relevant the predicted target, the more important this feature, the more important features are preferentially selected for constructing random forests;
4) constructing a density clustering and random forest mixed prediction model by using the sample data set D;
5) inputting sample data X related to the predicted road section into the density clustering and random forest mixed prediction model constructed in the step 4) to obtain a final prediction result.
Further, the step 3) of constructing a density clustering and random forest mixed prediction model specifically comprises the steps of:
initializing Generation of regression in random forestThe number m of trees T and the depth d of the trees; wherein T ═ { T ═ T1,t2,t3,…,tm},t1Representing the serial number of the regression tree, and m represents the generation of m regression trees;
secondly, generating m sample subsets D which are consistent with the number of the established regression tree into the sample data set D obtained in the step 2) by a Bootstrap method through a re-sampling mode with a back-put on the sample data set D, wherein the number of the m sample subsets D is equal to that of the { D }1,D2,…,Dm};
Respectively creating m regression tree models without pruning for the m sample subsets, adopting the minimum residual variance as a basis for selecting splitting attributes and splitting points, and after the regression tree is created, enabling leaf nodes of each regression tree to correspond to partial sample data;
fourthly, clustering sample data existing in leaf nodes of each regression tree in the third step by using a density clustering algorithm, forming a plurality of clusters C (C1, C2, C3, …, ck) with any shapes in the sample data set in the leaf nodes, and calculating the clustering center D of each cluster in the set C, wherein k is the number of the clustersc={d1,d2,d3,…,dkIn which d is1Representing the cluster center of the class cluster c 1.
Further, the step 5) of obtaining a final prediction result specifically includes the steps of:
firstly, traversing each regression tree in the random forest by using a sample X until a leaf node of each regression tree;
respectively calculating the distance between the sample X and the center of each class cluster in the leaf nodes by using an Euclidean distance formula, and finally returning the class cluster closest to the predicted sample;
calculating the average value of the target variable values of the samples in the most similar cluster returned in the step two to serve as the prediction result of the regression tree;
fourthly, obtaining the prediction result of each regression tree, and taking the average value as the final prediction result of the random forest:
Figure BDA0001591455280000031
wherein, yrfAs final predicted value of random forest, yt1Denotes the t-th1Prediction of regression trees.
Considering the weight of the sample attribute, calculating the distance between the sample X and each cluster center through the weighted Euclidean distance, wherein the cluster with smaller distance value is generally considered to be more similar to the predicted sample;
weighted euclidean distance formula:
Figure BDA0001591455280000032
wherein, X: prediction samples, d: cluster center of class cluster, n: sample dimension, V'i: and (4) feature correlation degree.
Further, the method includes the steps of respectively constructing m regression tree models without pruning for m sample subsets, and using the minimum residual variance as a basis for selecting splitting attributes and splitting points, and specifically includes the following steps: firstly, traversing each feature in a sample subset, and sequencing data under each feature from small to large; secondly, taking the sorted characteristic data as splitting nodes in sequence, enabling the data smaller than the splitting nodes to enter a left branch, enabling the data larger than or equal to the splitting nodes to enter a right branch, and dividing the original sample subset into two data sets; then, the S value before division and the S value of the left branch after division are respectively calculated by using a minimum residual variance formulaLValue, S of the right branchRValue, calculate S- (S)L+SR) S represents the minimum residual variance value, a characteristic value meeting the maximum difference value is selected as a splitting node, and a characteristic item where the splitting node is located is used as the current splitting characteristic; recursively performing the above partitioning operation, and checking whether a recursion termination condition is satisfied: the depth of the tree is larger than the initial value of d, if the depth of the tree is larger than the initial value of d, the growth of the tree is stopped, and if the depth of the tree is larger than the initial value of d, the regression tree is allowed to continue to grow.
Further, the minimum residual variance formula is:
Figure BDA0001591455280000033
where N is the number of samples in each sample subset, y is the prediction target,
Figure BDA0001591455280000034
is the mean of the predicted targets.
The invention has the following advantages and beneficial effects:
the invention adopts an integrated prediction algorithm based on a tree, namely random forest, combines the defects of a random forest algorithm and adopts a mixed prediction model combining a density clustering algorithm and the random forest, so that the algorithm has good convergence and the prediction accuracy is improved. The prediction model can accurately predict the travel time of each key road section in a certain time period, the result can be used for prejudging the development trend of the traffic state, a control scheme is made on the road with potential congestion in advance, and the prediction model can also be used for dynamic path induction, so that the best travel plan is planned for travelers, and social intelligent travel is assisted.
Drawings
FIG. 1 is a flow chart of the operation of a preferred embodiment of the present invention;
FIG. 2 is a feature importance metric made for a portion of the correlation factors;
FIG. 3 is a diagram of a model architecture combining density clustering and regression trees;
FIG. 4 is a travel time trend graph predicted for a particular road by the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described in detail and clearly with reference to the accompanying drawings. The described embodiments are only some of the embodiments of the present invention.
The technical scheme for solving the technical problems is as follows:
the specific steps of the embodiment provided by the invention comprise:
1. a historical traffic data set is obtained.
The data set mainly comprises: the system comprises a travel time data set, a road section attribute data set, a road network topology data set, a weather data set and the like, wherein the specific information is as follows:
Figure BDA0001591455280000041
2. and extracting key features having influence on the prediction result.
2.1 the present invention is directed to the prediction of time series, when predicting the travel time of a certain time period in the future, the data of the first few minutes of the time period has a large influence on the prediction result, so that a characteristic related to time needs to be constructed. Two minutes are taken as a time period, if the time period to be predicted is 2017/06/0108:04-08:06, time is taken as the current travel time, time1 is taken as the travel time of 08:02-08:04, time2 is taken as the travel time of 08:00-08:02, and the like, and the specific steps are shown in the following table. The feature vectors constructed here are time1, time2, …, and time 8.
Time period (2017/06/01) time time1 time2 time3 ··· time7 time8
08:00-08:02 10 null null null ··· null null
08:02-08:04 8.2 10 null null ··· null null
08:04-08:06 7.8 8.2 10 null ··· null null
And 2.2, extracting relevant data of the upper and lower streams of the road section as features by combining the road network topological structure.
And calculating the feature correlation degree of the feature vector and the target vector. And (3) performing feature importance measurement on the features extracted in the step (2.1) by utilizing a feature selection method commonly used in a random forest algorithm, and taking the feature importance measurement as a measure for the correlation degree between the feature vector and the target vector. The results are shown in FIG. 2.
3. And establishing a prediction model.
The prediction model is to improve a random forest by using density clustering algorithm (DBSCAN), the structures of a single regression tree and the DBSCAN model are shown in figure 3, and the specific method comprises the following steps:
1) and creating a regression tree by using the training sample set, wherein the training samples are selected to enter a left branch or a right branch according to the splitting attribute and the splitting point, and when the regression tree stops growing, leaf nodes of the training samples accumulate partial sample data. The sum of the sample data of all leaf nodes is equal to the original training sample for creating the regression tree, and when the leaf nodes accumulate more data, the abnormal sample does not exist in the training sample.
2) And (3) clustering the sample data accumulated in the leaf nodes of the regression tree in the step 1) by adopting a density clustering algorithm (DBSCAN), wherein the sample data set in the leaf nodes forms a plurality of arbitrarily-shaped clusters C (C1, C2, C3, …, ck), and k is the number of the clusters. And calculating the clustering center D of each cluster in the set Cc={d1,d2,d3,…,dkIn which d is1Representing the cluster center of the class cluster c 1.
3) When the regression tree created in the step 1) is used for prediction, firstly, the prediction sample X starts to move from the root node of the regression tree, traverses the split attributes and the split nodes, and finally falls into the corresponding leaf nodes, and a plurality of clusters are formed after the sample data in the leaf nodes at the moment is processed in the step 2).
When a new sample X is predicted, the sample X is respectively calculated to be most similar to the class cluster in the leaf node, the distance between the sample X and the center of each cluster is calculated by the weighted Euclidean distance in consideration of the weight of the sample attribute, the class cluster with the closer distance is generally considered to be more similar to the predicted sample, and finally the cluster ci with the closest distance to the predicted sample is returned.
Weighted euclidean distance formula:
Figure BDA0001591455280000061
wherein, X: prediction samples, d: cluster center of class cluster, n: sample dimension, V'i: is the calculated feature correlation.
4) Acquisition step 3)The most similar cluster ci returned in the step (c), and the mean value V of the target variable values of the samples in the cluster ci is calculatedCiBy VCiAs the final prediction result of the regression tree.
5) By the method for obtaining the prediction result, the prediction result of each tree in the random forest is obtained, and the average value of the prediction results is taken as the final prediction result of the random forest.
Fig. 4 predicts the travel time trend of a road segment in the time period of 2017, 6, month 26, 5:00 to 20: 00.
The above examples are to be construed as merely illustrative and not limitative of the remainder of the disclosure. After reading the description of the invention, the skilled person can make various changes or modifications to the invention, and these equivalent changes and modifications also fall into the scope of the invention defined by the claims.

Claims (5)

1. A road travel time prediction method based on random forests and a clustering algorithm is characterized by comprising the following steps:
1) extracting a relevant feature set V influencing a prediction result from historical traffic data, weather data, a road network structure relation and road section self attributes according to the time-space characteristics of a road network, and constructing a sample data set D, wherein V is { V ═ V { (V }1,v2,v3,…,vnThe data structure of the sample data set D is: { v1,v2,v3,…,vn,y},vnAs features, n is a feature dimension, and y is a prediction target;
2) carrying out data cleaning and missing value supplement on the sample data set D in the step 1);
3) and measuring the feature importance of the feature set V and the predicted target y by using a feature selection method commonly used by a random forest algorithm to obtain the correlation degree V 'between each feature in V and the predicted target, wherein V' ═ V { (V)1′,v2′,v3′,…,vn′},v1The larger the value of' the characteristic v is specified1The more relevant the predicted target, the more important this feature, the more important features are preferentially selected for constructionAn organic forest;
4) constructing a density clustering and random forest mixed prediction model by using the sample data set D;
5) inputting sample data X related to the predicted road section into the density clustering and random forest mixed prediction model constructed in the step 4) to obtain a final prediction result;
the step 4) of constructing the density clustering and random forest mixed prediction model specifically comprises the following steps:
firstly, initializing the number m of generating regression trees T and the depth d of the trees in a random forest; wherein T ═ { T ═ T1,t2,t3,…,tm},t1Representing the serial number of the regression tree, and m represents the generation of m regression trees;
secondly, generating m sample subsets D which are consistent with the number of the established regression tree into the sample data set D obtained in the step 2) by a Bootstrap method through a re-sampling mode with a back-put on the sample data set D, wherein the number of the m sample subsets D is equal to that of the { D }1,D2,…,Dm};
Respectively creating m regression tree models without pruning for the m sample subsets, adopting the minimum residual variance as a basis for selecting splitting attributes and splitting points, and after the regression tree is created, enabling leaf nodes of each regression tree to correspond to partial sample data;
fourthly, clustering sample data existing in leaf nodes of each regression tree in the third step by using a density clustering algorithm, forming a plurality of clusters C (C1, C2, C3, …, ck) with any shapes in the sample data set in the leaf nodes, and calculating the clustering center D of each cluster in the set C, wherein k is the number of the clustersc={d1,d2,d3,…,dkIn which d is1Representing the cluster center of the class cluster c 1.
2. The method for predicting road travel time based on random forests and clustering algorithms according to claim 1, wherein the step 5) of obtaining a final prediction result specifically comprises the steps of:
firstly, traversing each regression tree in the random forest by using a sample X until a leaf node of each regression tree;
respectively calculating the distance between the sample X and the center of each class cluster in the leaf nodes by using an Euclidean distance formula, and finally returning the class cluster closest to the predicted sample;
calculating the average value of the target variable values of the samples in the most similar cluster returned in the step two to serve as the prediction result of the regression tree;
fourthly, obtaining the prediction result of each regression tree, and taking the average value as the final prediction result of the random forest:
Figure FDA0002755259110000021
wherein, yrfAs final predicted value of random forest, yt1Denotes the t-th1Prediction of regression trees.
3. The method for predicting road travel time based on random forest and clustering algorithm as claimed in claim 2, wherein the step two is to calculate the distance between the sample X and each clustering center by weighted Euclidean distance in consideration of the weight of the sample attribute, and generally considering that the cluster with smaller distance value is more similar to the predicted sample;
weighted euclidean distance formula:
Figure FDA0002755259110000022
wherein, X: prediction samples, d: cluster center of class cluster, n: sample dimension, V'i: and (4) feature correlation degree.
4. The method for predicting road travel time based on random forests and clustering algorithms according to claim 2, wherein the m regression tree models without pruning are respectively constructed for the m sample subsets, and the minimum residual variance is used as a basis for selecting splitting attributes and splitting points, and specifically comprises the following steps: first, each feature in the sample subset is traversed, and the data under each feature is sorted according to the orderSorting from small to large; secondly, taking the sorted characteristic data as splitting nodes in sequence, enabling the data smaller than the splitting nodes to enter a left branch, enabling the data larger than or equal to the splitting nodes to enter a right branch, and dividing the original sample subset into two data sets; then, respectively calculating the S value before division and the S value of the left branch after division by using a minimum residual variance formulaLValue, S of the right branchRValue, calculate S- (S)L+SR) S represents the minimum residual variance value, a characteristic value meeting the maximum difference value is selected as a splitting node, and a characteristic item where the splitting node is located is used as the current splitting characteristic; recursively performing the above partitioning operation, and checking whether a recursion termination condition is satisfied: the depth of the tree is larger than the initial value of d, if the depth of the tree is larger than the initial value of d, the growth of the tree is stopped, and if the depth of the tree is larger than the initial value of d, the regression tree is allowed to continue to grow.
5. The method for predicting road travel time based on random forest and clustering algorithm according to claim 4, wherein the minimum residual variance formula is as follows:
Figure FDA0002755259110000031
where N is the number of samples in each sample subset, y is the prediction target,
Figure FDA0002755259110000032
is the mean of the predicted targets.
CN201810190151.6A 2018-03-08 2018-03-08 Road travel time prediction method based on random forest and clustering algorithm Active CN108399748B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810190151.6A CN108399748B (en) 2018-03-08 2018-03-08 Road travel time prediction method based on random forest and clustering algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810190151.6A CN108399748B (en) 2018-03-08 2018-03-08 Road travel time prediction method based on random forest and clustering algorithm

Publications (2)

Publication Number Publication Date
CN108399748A CN108399748A (en) 2018-08-14
CN108399748B true CN108399748B (en) 2020-12-22

Family

ID=63091710

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810190151.6A Active CN108399748B (en) 2018-03-08 2018-03-08 Road travel time prediction method based on random forest and clustering algorithm

Country Status (1)

Country Link
CN (1) CN108399748B (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109307159A (en) * 2018-08-21 2019-02-05 湖南大学 A kind of pipe network model alarm method based on water consumption optimal prediction model
CN109376906B (en) * 2018-09-21 2021-11-19 中国科学院深圳先进技术研究院 Travel time prediction method and system based on multi-dimensional trajectory and electronic equipment
CN109448366B (en) * 2018-10-18 2021-09-17 南京航空航天大学 Airspace sector crowding degree prediction method based on random forest
CN109753499A (en) * 2018-12-17 2019-05-14 云南电网有限责任公司信息中心 A kind of O&M monitoring data administering method
CN109703606B (en) * 2019-01-16 2020-12-15 北京交通大学 High-speed train intelligent driving control method based on historical operation data
CN109859475B (en) * 2019-03-14 2021-08-31 江苏中设集团股份有限公司 Intersection signal control method, device and system based on DBSCAN density clustering
CN109856307B (en) * 2019-03-27 2021-04-16 大连理工大学 Metabolic component molecular variable comprehensive screening technology
CN110288849B (en) * 2019-07-29 2020-05-29 电子科技大学 Travel route recommendation method based on hybrid traffic mode
CN110719223B (en) * 2019-09-16 2020-08-11 清华大学 Detection period cooperative self-adaptive adjustment method and system
CN111242206B (en) * 2020-01-08 2022-06-17 吉林大学 High-resolution ocean water temperature calculation method based on hierarchical clustering and random forests
CN111309852B (en) * 2020-03-16 2021-09-03 青岛百洋智能科技股份有限公司 Method, system, device and storage medium for generating visual decision tree set model
CN113269347B (en) * 2021-03-31 2023-05-30 安徽农业大学 College express network node flow prediction method based on random forest
CN112905583A (en) * 2021-04-01 2021-06-04 辽宁工程技术大学 High-dimensional big data outlier detection method
CN114461704B (en) * 2022-04-14 2022-06-24 浙江理工大学 Method and system for predicting loom availability based on loom productivity

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1134713A1 (en) * 2000-03-10 2001-09-19 ROTIS INC., (Road traffic Information System) Method and apparatus for traffic information collection
CN103761419A (en) * 2013-12-28 2014-04-30 辽宁师范大学 Density self-adaptive non-supervision clustering method based on space shortest distance average
CN105405293A (en) * 2015-12-23 2016-03-16 青岛海信网络科技股份有限公司 Short-term prediction method of road travel time and system
CN106096748A (en) * 2016-04-28 2016-11-09 武汉宝钢华中贸易有限公司 Entrucking forecast model in man-hour based on cluster analysis and decision Tree algorithms
CN107045788A (en) * 2017-06-28 2017-08-15 北京数行健科技有限公司 Traffic Forecasting Methodology and device
CN107171848A (en) * 2017-05-27 2017-09-15 华为技术有限公司 A kind of method for predicting and device
CN107180362A (en) * 2017-05-03 2017-09-19 浙江工商大学 Retail commodity sales forecasting method based on deep learning

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324932B (en) * 2013-06-07 2017-04-12 东软集团股份有限公司 Video-based vehicle detecting and tracking method and system
CN103400391B (en) * 2013-08-09 2016-08-10 北京博思廷科技有限公司 A kind of multi-object tracking method based on the random forest improved and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1134713A1 (en) * 2000-03-10 2001-09-19 ROTIS INC., (Road traffic Information System) Method and apparatus for traffic information collection
CN103761419A (en) * 2013-12-28 2014-04-30 辽宁师范大学 Density self-adaptive non-supervision clustering method based on space shortest distance average
CN105405293A (en) * 2015-12-23 2016-03-16 青岛海信网络科技股份有限公司 Short-term prediction method of road travel time and system
CN106096748A (en) * 2016-04-28 2016-11-09 武汉宝钢华中贸易有限公司 Entrucking forecast model in man-hour based on cluster analysis and decision Tree algorithms
CN107180362A (en) * 2017-05-03 2017-09-19 浙江工商大学 Retail commodity sales forecasting method based on deep learning
CN107171848A (en) * 2017-05-27 2017-09-15 华为技术有限公司 A kind of method for predicting and device
CN107045788A (en) * 2017-06-28 2017-08-15 北京数行健科技有限公司 Traffic Forecasting Methodology and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
随机森林模型在分类与回归分析中的应用;李欣海;《应用昆虫学报》;20130715;正文第1-8页 *

Also Published As

Publication number Publication date
CN108399748A (en) 2018-08-14

Similar Documents

Publication Publication Date Title
CN108399748B (en) Road travel time prediction method based on random forest and clustering algorithm
CN108846517B (en) Integration method for predicating quantile probabilistic short-term power load
CN101093559B (en) Method for constructing expert system based on knowledge discovery
CN105809672B (en) A kind of image multiple target collaboration dividing method constrained based on super-pixel and structuring
CN106022477A (en) Intelligent analysis decision system and method
CN103106279A (en) Clustering method simultaneously based on node attribute and structural relationship similarity
CN105335752A (en) Principal component analysis multivariable decision-making tree-based connection manner identification method
CN112820105A (en) Road network abnormal area processing method and system
CN111292534A (en) Traffic state estimation method based on clustering and deep sequence learning
CN109948066A (en) A kind of point of interest recommended method based on Heterogeneous Information network
CN109411093A (en) A kind of intelligent medical treatment big data analysis processing method based on cloud computing
CN111709560A (en) Method for solving vehicle path problem based on improved ant colony algorithm
CN102571431A (en) Group concept-based improved Fast-Newman clustering method applied to complex network
CN106789149B (en) Intrusion detection method adopting improved self-organizing characteristic neural network clustering algorithm
CN106326923A (en) Sign-in position data clustering method in consideration of position repetition and density peak point
Xu et al. Map matching based on conditional random fields and route preference mining for uncertain trajectories
CN116681176B (en) Traffic flow prediction method based on clustering and heterogeneous graph neural network
CN114819589A (en) Urban space high-quality utilization determination method, system, computer equipment and terminal
CN111882114A (en) Short-term traffic flow prediction model construction method and prediction method
Ding et al. Inferring road type in crowdsourced map services
CN108416395A (en) A kind of Interactive Decision-Making tree constructing method based on attribute loop
CN117078049A (en) Homeland space planning evaluation method and system
CN117034116A (en) Machine learning-based traditional village space type identification method
CN108647189B (en) Method and device for identifying user crowd attributes
CN117010373A (en) Recommendation method for category and group to which asset management data of power equipment belong

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant