CN108492560A - A kind of Road Detection device missing data complementing method and device - Google Patents
A kind of Road Detection device missing data complementing method and device Download PDFInfo
- Publication number
- CN108492560A CN108492560A CN201810293819.XA CN201810293819A CN108492560A CN 108492560 A CN108492560 A CN 108492560A CN 201810293819 A CN201810293819 A CN 201810293819A CN 108492560 A CN108492560 A CN 108492560A
- Authority
- CN
- China
- Prior art keywords
- function
- missing data
- data
- multivariate
- detector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000001514 detection method Methods 0.000 title claims abstract description 11
- 238000012549 training Methods 0.000 claims abstract description 19
- 238000012795 verification Methods 0.000 claims abstract description 19
- 230000001419 dependent effect Effects 0.000 claims abstract description 7
- 238000005457 optimization Methods 0.000 claims description 13
- 238000004364 calculation method Methods 0.000 claims description 7
- 239000000284 extract Substances 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 4
- 230000007423 decrease Effects 0.000 claims description 4
- 238000013075 data extraction Methods 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims 1
- 238000011156 evaluation Methods 0.000 abstract description 2
- 238000003556 assay Methods 0.000 abstract 1
- 238000000605 extraction Methods 0.000 abstract 1
- 230000000875 corresponding effect Effects 0.000 description 13
- 230000002596 correlated effect Effects 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000032683 aging Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/40—Business processes related to the transportation industry
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Human Resources & Organizations (AREA)
- Tourism & Hospitality (AREA)
- Marketing (AREA)
- Theoretical Computer Science (AREA)
- General Business, Economics & Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Analytical Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Development Economics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Traffic Control Systems (AREA)
Abstract
Description
技术领域technical field
本发明涉及高速公路交通信息采集,属于高速公路交通技术领域,具体涉及一种道路检测器缺失数据补全方法及装置。The invention relates to expressway traffic information collection, belongs to the technical field of expressway traffic, and in particular relates to a method and device for complementing missing data of a road detector.
背景技术Background technique
随着世界经济的不断发展,交通运输事业,特别是公路交通运输,也得到了长足的发展。尤其是最近几十年以来,高速公路以其快速、高效、安全等特点在世界范围内得到了迅猛的发展,大大增强了地区之间的经济联系,加快了世界各国、地区的社会经济发展。With the continuous development of the world economy, the transportation industry, especially the road transportation, has also been greatly developed. Especially in recent decades, expressways have developed rapidly around the world due to their characteristics of speed, efficiency, and safety, which have greatly enhanced the economic ties between regions and accelerated the social and economic development of countries and regions in the world.
交通数据采集是高速公路运营及监管的基础,准确的道路交通数据支持交通管理者制定行之有效的管理决策。Traffic data collection is the basis of expressway operation and supervision. Accurate road traffic data supports traffic managers to make effective management decisions.
然而,由于检测设备老化、传输线路故障、检测环境不良、设备调试和使用不当等原因,使得检测器采集到的动态交通数据存在着各种各样的质量问题。如何对已有的检测器数据进行分析,进而准确高效的补全缺失数据,具有重要的工程意义。However, there are various quality problems in the dynamic traffic data collected by the detectors due to aging detection equipment, transmission line failure, poor detection environment, equipment debugging and improper use, etc. How to analyze the existing detector data, and then fill in the missing data accurately and efficiently, has important engineering significance.
发明内容Contents of the invention
发明目的:针对现有技术中存在的问题,本发明提供一种道路检测器缺失数据补全方法及装置,对缺失的路段检测器数据分类型进行补全,提供给交通管理者完整的高速公路交通数据,从而制定合理的管理策略,提升高速公路网整体运行效率。Purpose of the invention: Aiming at the problems existing in the prior art, the present invention provides a method and device for complementing the missing data of road detectors, complementing the missing data types of road section detectors, and providing traffic managers with a complete expressway traffic data, so as to formulate reasonable management strategies and improve the overall operating efficiency of the expressway network.
技术方案:为实现上述发明目的,本发明采用如下技术方案:Technical solution: In order to achieve the above-mentioned purpose of the invention, the present invention adopts the following technical solution:
一种道路检测器缺失数据补全方法,包括如下步骤:A method for complementing missing data of a road detector, comprising the steps of:
(1)对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;(1) Analyze the data collected by the road detector, extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate;
(2)明确缺失的数据类型,并找出每一条缺失的数据所在位置;(2) Specify the type of missing data, and find out the location of each missing data;
(3)从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;(3) Delete the missing data from the original data, and divide the deleted data to form a training set and a verification set;
(4)根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1,β2,β3,β4为调整参数;(4) According to the positive correlation between speed and the ratio of flow and occupancy, add adjustment parameters to establish some multivariate function forms with speed V as dependent variable and flow Q and occupancy O as independent variables for selection and fitting; the established Multivariate functional forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters;
(5)在训练集中,调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;(5) In the training set, adjust the undetermined parameters of each multivariate function, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value;
(6)在验证集中,用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;(6) In the verification set, use the minimum root mean square error as the basis for selection, calculate the fitting error of each multivariate function, and select the optimal multivariate function form;
(7)应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。(7) Using the optimal function and its corresponding optimization parameters, the independent variable corresponding to the missing data of the detector is substituted into the calculation, and the estimated data is calculated to fill the missing data value.
作为优选,所述步骤(4)中,所建立的多元函数形式还包括如下至少1种: As a preference, in the step (4), the established multivariate function form also includes at least one of the following:
作为优选,所述步骤(5)中,确定每个多元函数的最优化参数,包括:(5.1)设定多元函数模型的初始参数值;(5.2)利用最小二乘法原理建立误差函数,不断调整模型的参数值;(5.3)经过多次迭代及调整后,当误差函数值下降到某一个给定范围内,停止迭代,记录模型参数作为该多元函数的最优化参数。As preferably, in described step (5), determine the optimal parameter of each multivariate function, comprise: (5.1) set the initial parameter value of multivariate function model; (5.2) utilize least square method principle to establish error function, constantly adjust The parameter value of the model; (5.3) After multiple iterations and adjustments, when the error function value drops to a certain given range, the iteration is stopped, and the model parameters are recorded as the optimal parameters of the multivariate function.
作为优选,所述步骤(5.2)中,调整模型参数的方法为:在训练集中,假定速度为缺失数据,将流量Q、占有率O作为自变量输入,带入待拟合多元函数中,计算出估计速度V’,以误差函数作为判断依据,向使误差函数值下降最快的方向不断调整模型的参数值,使得估计速度V’与真实速度V之间的误差逐步减小。As a preference, in the step (5.2), the method of adjusting the model parameters is as follows: in the training set, assume that the speed is missing data, input the flow Q and the occupancy rate O as independent variables, bring them into the multivariate function to be fitted, and calculate Calculate the estimated velocity V', use the error function as the basis for judgment, and continuously adjust the parameter values of the model in the direction that makes the error function value decrease the fastest, so that the error between the estimated velocity V' and the real velocity V gradually decreases.
作为优选,所述步骤(6)中,选取最优多元函数的方法为:应用各多元函数的最优化参数,将验证集数据中的自变量代入多元函数,计算所缺失数据类型的估计值,进而计算验证集中每种多元函数拟合出估计值与实测值的均方根误差,对比各个多元函数,选出最小的均方根误差对应的函数形式。As a preference, in the step (6), the method for selecting the optimal multivariate function is: applying the optimization parameters of each multivariate function, substituting the independent variable in the verification set data into the multivariate function, and calculating the estimated value of the missing data type, Then calculate the root mean square error of each multivariate function in the verification set to fit the estimated value and the measured value, compare the various multivariate functions, and select the function form corresponding to the smallest root mean square error.
本发明另一方面提供的一种道路检测器缺失数据补全装置,包括:Another aspect of the present invention provides a road detector missing data complement device, comprising:
数据提取模块,用于对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;The data extraction module is used to analyze the data collected by the road detector, and extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate;
缺失定位模块,用于明确缺失的数据类型,并找出每一条缺失的数据所在位置;The missing location module is used to clarify the type of missing data and find out the location of each missing data;
预处理模块,用于从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;The preprocessing module is used to delete missing data from the original data, and divide the deleted data to form a training set and a verification set;
多元函数构建模块,用于根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1,β2,β3,β4为调整参数;The multivariate function building block is used to add adjustment parameters to establish a number of multivariate function forms with speed V as the dependent variable and flow Q and occupancy O as independent variables according to the positive correlation between the speed and the ratio of flow and occupancy. ; The established multivariate function forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters;
函数拟合模块,用于基于训练集调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;The function fitting module is used to adjust the undetermined parameters of each multivariate function based on the training set, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value;
最优函数选取模块,用于基于验证集用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;The optimal function selection module is used to calculate the fitting error of each multivariate function based on the verification set with the minimum root mean square error as the selection basis, and select the optimal multivariate function form;
以及,缺失填补模块,用于应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。And, the missing filling module is used to apply the optimal function and its corresponding optimization parameters, substitute the independent variable corresponding to the missing data of the detector for calculation, and calculate estimated data to fill the missing data value.
作为优选,所述多元函数构建模块中所建立的多元函数形式还包括如下至少1种: Preferably, the multivariate function forms established in the multivariate function building block also include at least one of the following:
有益效果:与现有技术相比,本发明的有益效果在于:Beneficial effect: compared with prior art, the beneficial effect of the present invention is:
1、本发明提供的多元函数拟合形式符合交通流理论中速度与流量和密度的比值成正相关,密度和时间占有率成正相关,速度与流量和占有率的比值成正相关的相互关系,由此可以应用已有流量、占有率数据估计缺失的速度数据。2、本发明以均方根误差作为评价标准,对每个缺失数据集选用误差最低的多元函数拟合形式,能够保证较高的补全精度。3、本发明的补全道路缺失速度数据,能够完善高速公路交通运行信息,为交通管理者制定管理策略提供依据,进而提升交通运行的安全性,提高高速公路整体运行水平。1, the multivariate function fitting form that the present invention provides meets the ratio of speed and flow and density in the traffic flow theory and is positively correlated, and density and time occupancy are positively correlated, and the ratio of speed and flow and occupancy is positively correlated with each other, thus Existing traffic and occupancy data can be used to estimate missing speed data. 2. The present invention uses the root mean square error as the evaluation standard, and selects the multivariate function fitting form with the lowest error for each missing data set, which can ensure higher completion accuracy. 3. Complementing the missing speed data of roads in the present invention can improve the traffic operation information of expressways, provide a basis for traffic managers to formulate management strategies, and then improve the safety of traffic operation and improve the overall operation level of expressways.
附图说明Description of drawings
图1为本发明道路检测器缺失数据补全流程图;Fig. 1 is the flow chart of complementing the missing data of the road detector of the present invention;
图2为本发明道路检测器采集数据实例数据部分样本截图;Fig. 2 is a partial sample screenshot of data collected by the road detector of the present invention;
图3为本发明实例缺失数据补全的多元函数误差分析结果图。Fig. 3 is a diagram of error analysis results of multivariate function for missing data completion in the example of the present invention.
具体实施方式Detailed ways
下面结合附图和具体实施例,对本发明作更进一步的说明。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
如附图1所示,本发明实施例公开的一种道路检测器缺失数据补全方法,首先对采集数据进行分析,提取其中有用信息;明确缺失数据,从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集。然后建立供选择拟合的多元函数形式,调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小。接着用均方根误差作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式。最后代入最优函数进行计算,填补缺失的数据值。详细步骤如下:As shown in Figure 1, a method for complementing missing data of a road detector disclosed in an embodiment of the present invention firstly analyzes the collected data and extracts useful information therein; defines the missing data and deletes the missing data from the original data, And the deleted data is divided to form a training set and a verification set. Then establish the multivariate function form for selection and fitting, adjust the undetermined parameters of each multivariate function, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitting value. Then, the root mean square error is used as the basis for selection, and the fitting error of each multivariate function is calculated to select the optimal form of multivariate function. Finally, the optimal function is substituted into the calculation to fill in the missing data values. The detailed steps are as follows:
(1)对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率。首先,收集路上布设道路检测器(以线圈检测器和地磁检测器为主)采集的数据,然后对收集的交通采集信息进行初步过滤和分析,收集其中的有用信息,路段检测器采集的交通有效信息主要包含交通流量、速度、道路占有率,采集间隔与检测器的类型有关,一般在30秒至300秒中间。(1) Analyze the data collected by the road detector, and extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate. First, collect the data collected by road detectors (mainly coil detectors and geomagnetic detectors) on the road, and then conduct preliminary filtering and analysis on the collected traffic collection information to collect useful information, and the traffic collected by road section detectors is effective. The information mainly includes traffic flow, speed, and road occupancy rate. The collection interval is related to the type of detector, generally between 30 seconds and 300 seconds.
(2)明确缺失的数据类型,并找出每一条缺失的数据所在位置。本步骤中,对采集的数据进行剖析,明确缺失的检测器数据的类型。图2中,可以看出其中检测器检测的某些速度值出现丢失或者数据异常现象,所以本例以车流的平均速度作为需要填补的数据类型,找到速度数据异常的位置。(2) Identify the type of missing data, and find out the location of each missing data. In this step, the collected data is analyzed to clarify the type of missing detector data. In Figure 2, it can be seen that some speed values detected by the detector are missing or have abnormal data. Therefore, in this example, the average speed of the traffic flow is used as the data type to be filled in to find the location of the abnormal speed data.
(3)从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集。本步骤中,将数据集中缺失的数据对应的数据行删除掉,将每个数据集样本中的数据大致从中间分成两段,前一段作为调整参数的训练集合,后一段作为选择最优函数的验证集合。本例中,将出现速度缺失或异常的数据行进行删除,将其一分为二,一部分作为训练集,另一部分作为验证集。(3) Delete the missing data from the original data, and divide the deleted data to form a training set and a verification set. In this step, the data rows corresponding to the missing data in the data set are deleted, and the data in each data set sample is roughly divided into two sections from the middle. Validation collection. In this example, the data rows with missing or abnormal speeds are deleted and divided into two, one part is used as a training set and the other part is used as a validation set.
(4)建立供选择拟合的多元函数形式。根据交通流理论,速度与流量和密度的比值成正比,车流密度和时间占有率成正比,因此速度与流量和占有率的比值成正相关关系。本例中,检测器已有的准确数据为流量Q和占有率O,待补充的缺失数据为速度参数V。因此,以V(速度)为因变量,Q(流量)、O(占有率)为自变量,βi为待定参数,建立7个多元函数形式作为拟合函数。这7个拟合函数分别为:(4) Establish a multivariate function form for selection and fitting. According to the traffic flow theory, the speed is proportional to the ratio of flow and density, and the traffic density is proportional to the time occupancy, so the speed is positively correlated with the ratio of flow and occupancy. In this example, the existing accurate data of the detector are the flow rate Q and the occupancy rate O, and the missing data to be supplemented is the velocity parameter V. Therefore, with V (velocity) as the dependent variable, Q (flow rate) and O (occupancy rate) as independent variables, and βi as undetermined parameters, 7 multivariate function forms are established as fitting functions. The seven fitting functions are:
第一个函数形式 first functional form
第二个函数形式 The second functional form
第三个函数形式 The third functional form
第四个函数形式 The fourth functional form
第五个函数形式 fifth functional form
第六个函数形式 sixth functional form
第七个函数形式 The seventh function form
上述7种函数形式为经大量样本检验过,匹配程度较高的函数形式,其中第四、五、六函数形式普适性最好,在实际操作中,可以选择全部或部分函数供拟合筛选。The above 7 function forms have been tested by a large number of samples and have a high degree of matching. Among them, the fourth, fifth and sixth function forms are the most universal. In actual operation, all or part of the functions can be selected for fitting and screening .
(5)在训练集中,对每个多元函数调整待定参数,确定最优化参数。本步骤中,每个多元函数拟合主要包括以下步骤:(5) In the training set, adjust the undetermined parameters for each multivariate function to determine the optimal parameters. In this step, each multivariate function fitting mainly includes the following steps:
步骤(5.1)设定初始参数值。Step (5.1) sets initial parameter values.
步骤(5.2)利用最小二乘法原理建立误差函数,不断调整模型的参数值。Step (5.2) uses the principle of the least square method to establish an error function, and continuously adjusts the parameter values of the model.
步骤(5.3)经过多次迭代及调整后,当误差函数值下降到某一个给定范围内,停止迭代,此时记录模型参数作为该多元函数的最优化参数。After multiple iterations and adjustments in step (5.3), when the value of the error function drops to a certain given range, the iteration is stopped, and the model parameters are recorded as the optimization parameters of the multivariate function.
采用上述步骤最终确定出7个拟合函数各自的最优待定参数。The optimal undetermined parameters of each of the seven fitting functions are finally determined by the above steps.
具体地,首先设定各个函数中待定参数βi的初始值,将训练集合中的每一条检测器数据的流量Q、占有率O作为自变量带入到这7个拟合函数中,得到估计速度V’,利用最小二乘原理建立误差函数作为参数是否优化的标准,向使误差函数值下降最快的方向不断调整模型的参数值,最终找到使每个训练集中估计速度V’与真实速度V平均误差最小的最优待定参数βi值。Specifically, first set the initial value of the undetermined parameter βi in each function, and bring the flow Q and occupancy O of each detector data in the training set into the seven fitting functions as independent variables to obtain the estimated speed V', using the least squares principle to establish an error function as a criterion for parameter optimization, continuously adjust the parameter values of the model in the direction that makes the error function value drop the fastest, and finally find the estimated speed V' and the real speed V in each training set The optimal undetermined parameter βi value with the smallest average error.
(6)用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式。本步骤中以均方根误差作为多元函数选择依据,均方根误差是所选数据均方误差的算术平方根,又称为标准误差。其基本公式如下,其中observedt表示实际检测值,predictedt表示计算出的估计值,N为样本总数。(6) Using the minimum root mean square error as the selection basis, calculate the fitting error of each multivariate function, and select the optimal multivariate function form. In this step, the root mean square error is used as the basis for selecting the multivariate function, and the root mean square error is the arithmetic square root of the mean square error of the selected data, also known as the standard error. The basic formula is as follows, where observed t represents the actual detected value, predicted t represents the calculated estimated value, and N is the total number of samples.
取出验证集合,将验证集中每一条检测器采集的流量Q、占有率O作为自变量输入,带入每一个拟合函数中,利用其最优待定参数计算出估计速度V’。计算估计速度V’和验证集中真实速度V的均方根误差并记录。Take out the verification set, input the flow Q and occupancy rate O collected by each detector in the verification set as independent variables, bring them into each fitting function, and calculate the estimated velocity V' by using its optimal undetermined parameters. Calculate the root mean square error between the estimated velocity V' and the true velocity V in the validation set and record it.
对比7个拟合函数计算出的估计速度V’与真实速度V的均方根误差,选出其中最小的误差对应的拟合函数作为该数据的缺失补全函数,该函数对应的最优待定参数βi为最优化参数。Compare the root mean square error between the estimated speed V' calculated by the seven fitting functions and the real speed V, and select the fitting function corresponding to the smallest error as the missing completion function of the data. The optimal function corresponding to this function is to be determined. Parameter βi is the optimization parameter.
(7)应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。本例中,用缺失速度数据对应的检测器采集到的流量Q、占有率O作为自变量输入该函数,求出估计速度值,从而填补删去的缺失速度值。由图3可知,所选数据用第四个多元函数形式计算出的均方根误差,最小误差值为8.21%,以该函数作为该数据的缺失补全函数,最终填补精度在90%以上。(7) Using the optimal function and its corresponding optimization parameters, the independent variable corresponding to the missing data of the detector is substituted into the calculation, and the estimated data is calculated to fill the missing data value. In this example, the flow Q and occupancy rate O collected by the detector corresponding to the missing speed data are used as independent variables to input the function, and the estimated speed value is obtained, so as to fill in the deleted missing speed value. It can be seen from Figure 3 that the root mean square error calculated by the fourth multivariate function form of the selected data has a minimum error value of 8.21%. Using this function as the missing completion function of the data, the final filling accuracy is above 90%.
本发明另一实施例公开的道路检测器缺失数据补全装置,包括:数据提取模块,用于对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;缺失定位模块,用于明确缺失的数据类型,并找出每一条缺失的数据所在位置;预处理模块,用于从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;Another embodiment of the present invention discloses a missing data complementing device for a road detector, comprising: a data extraction module for analyzing the data collected by the road detector, extracting the average speed of the motor vehicle passing through the detector section, the passage of each detection interval The motor vehicle flow and road occupancy rate; the missing location module is used to clarify the type of missing data, and find out the location of each missing data; the preprocessing module is used to delete the missing data from the original data, and The deleted data is divided to form a training set and a verification set;
多元函数构建模块,用于根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 函数拟合模块,用于基于训练集调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;最优函数选取模块,用于基于验证集用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;以及,缺失填补模块,用于应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。The multivariate function building block is used to add adjustment parameters to establish a number of multivariate function forms with speed V as the dependent variable and flow Q and occupancy O as independent variables according to the positive correlation between the speed and the ratio of flow and occupancy. ; The established multivariate function forms include at least: The function fitting module is used to adjust the undetermined parameters of each multivariate function based on the training set, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value; the optimal function selection module is used to use the mean square value based on the verification set The minimum root error is used as the basis for selection, and the fitting error of each multivariate function is calculated to select the optimal multivariate function form; and, the missing filling module is used to apply the optimal function and its corresponding optimization parameters to correspond to The independent variable is substituted into the calculation, and the estimated data is calculated to fill in the missing data values.
上述道路检测器缺失数据补全装置实施例可以用于执行上述道路检测器缺失数据补全方法实施例,其技术原理、所解决的技术问题及产生的技术效果相似,上述描述的道路检测器缺失数据补全的具体工作过程及有关说明,可以参考前述道路检测器缺失数据补全方法实施例中的对应过程,在此不再赘述。The above embodiment of the missing data complementing device for road detectors can be used to implement the above embodiment of the missing data complementing method for road detectors. The technical principles, technical problems solved and technical effects produced are similar. For the specific working process and relevant descriptions of the data completion, reference may be made to the corresponding process in the above-mentioned embodiment of the missing data completion method of the road detector, and details are not repeated here.
本领域技术人员可以理解,可以对实施例中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个系统中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。Those skilled in the art can understand that the modules in the embodiment can be adaptively changed and installed in one or more systems different from the embodiment. Modules or units or components in the embodiments may be combined into one module or unit or component, and furthermore may be divided into a plurality of sub-modules or sub-units or sub-assemblies.
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以作出若干改进,这些改进也应视为本发明的保护范围。The above is only a preferred embodiment of the present invention, it should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present invention, some improvements can also be made, and these improvements should also be regarded as the present invention. scope of protection.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810293819.XA CN108492560A (en) | 2018-04-04 | 2018-04-04 | A kind of Road Detection device missing data complementing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810293819.XA CN108492560A (en) | 2018-04-04 | 2018-04-04 | A kind of Road Detection device missing data complementing method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108492560A true CN108492560A (en) | 2018-09-04 |
Family
ID=63317926
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810293819.XA Pending CN108492560A (en) | 2018-04-04 | 2018-04-04 | A kind of Road Detection device missing data complementing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108492560A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109584553A (en) * | 2018-11-29 | 2019-04-05 | 中电海康集团有限公司 | A kind of section degree of association missing complementing method based on space time information |
CN109710659A (en) * | 2018-12-16 | 2019-05-03 | 苏州城方信息技术有限公司 | The complementing method of detector missing data based on temporal correlation |
CN110490419A (en) * | 2019-07-19 | 2019-11-22 | 珠海市岭南大数据研究院 | Processing method, device, computer equipment and the storage medium of Bus information data |
CN113377508A (en) * | 2021-05-28 | 2021-09-10 | 张燕 | Mass data rapid transmission method |
CN113782128A (en) * | 2021-08-09 | 2021-12-10 | 中国中医科学院中医临床基础医学研究所 | A missing data fitting method, device and computer equipment |
CN118885480A (en) * | 2024-07-15 | 2024-11-01 | 江苏奕隆机电科技有限公司 | A method for generating a dynamic index table of current-flow-pressure difference for a hydraulic brake-by-wire solenoid valve |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6236932B1 (en) * | 1996-12-16 | 2001-05-22 | Mannesmann Ag | Process for completing and/or verifying data concerning the state of a road network; traffic information centre |
CN103971520A (en) * | 2014-04-17 | 2014-08-06 | 浙江大学 | Traffic flow data recovery method based on space-time correlation |
CN106781457A (en) * | 2016-11-29 | 2017-05-31 | 东南大学 | A kind of freeway traffic flow parameter correction method based on multi-source fusion data |
CN107193876A (en) * | 2017-04-21 | 2017-09-22 | 美林数据技术股份有限公司 | A kind of missing data complementing method based on arest neighbors KNN algorithms |
-
2018
- 2018-04-04 CN CN201810293819.XA patent/CN108492560A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6236932B1 (en) * | 1996-12-16 | 2001-05-22 | Mannesmann Ag | Process for completing and/or verifying data concerning the state of a road network; traffic information centre |
CN103971520A (en) * | 2014-04-17 | 2014-08-06 | 浙江大学 | Traffic flow data recovery method based on space-time correlation |
CN106781457A (en) * | 2016-11-29 | 2017-05-31 | 东南大学 | A kind of freeway traffic flow parameter correction method based on multi-source fusion data |
CN107193876A (en) * | 2017-04-21 | 2017-09-22 | 美林数据技术股份有限公司 | A kind of missing data complementing method based on arest neighbors KNN algorithms |
Non-Patent Citations (1)
Title |
---|
金盛: "环形线圈检测器交通数据预处理方法研究", 《中国优秀硕士学位论文全文数据库·工程科技Ⅱ辑》 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109584553A (en) * | 2018-11-29 | 2019-04-05 | 中电海康集团有限公司 | A kind of section degree of association missing complementing method based on space time information |
CN109710659A (en) * | 2018-12-16 | 2019-05-03 | 苏州城方信息技术有限公司 | The complementing method of detector missing data based on temporal correlation |
CN109710659B (en) * | 2018-12-16 | 2022-11-25 | 苏州城方信息技术有限公司 | Method for filling detector missing data based on space-time correlation |
CN110490419A (en) * | 2019-07-19 | 2019-11-22 | 珠海市岭南大数据研究院 | Processing method, device, computer equipment and the storage medium of Bus information data |
CN113377508A (en) * | 2021-05-28 | 2021-09-10 | 张燕 | Mass data rapid transmission method |
CN113377508B (en) * | 2021-05-28 | 2023-08-22 | 张燕 | Mass data rapid transmission method |
CN113782128A (en) * | 2021-08-09 | 2021-12-10 | 中国中医科学院中医临床基础医学研究所 | A missing data fitting method, device and computer equipment |
CN118885480A (en) * | 2024-07-15 | 2024-11-01 | 江苏奕隆机电科技有限公司 | A method for generating a dynamic index table of current-flow-pressure difference for a hydraulic brake-by-wire solenoid valve |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108492560A (en) | A kind of Road Detection device missing data complementing method and device | |
Park et al. | Application of microscopic simulation model calibration and validation procedure: Case study of coordinated actuated signal system | |
CN103150900B (en) | Traffic jam event automatic detecting method based on videos | |
CN106997669B (en) | A method for judging the causes of traffic congestion based on feature importance | |
CN104408925B (en) | Crossing evaluation of running status method based on display radar | |
CN102800197B (en) | Preprocessing method of road section dynamic traffic stream essential data of urban road | |
CN106781486B (en) | Traffic state evaluation method based on floating car data | |
CN101783074A (en) | A method and system for real-time identification of urban road traffic flow status | |
CN107038167A (en) | Big data excavating analysis system and its analysis method based on model evaluation | |
CN114783183A (en) | Monitoring method and system based on traffic situation algorithm | |
CN105931458B (en) | A kind of method of road traffic flow detection device reliability assessment | |
CN104574968B (en) | Determining method for threshold traffic state parameter | |
CN108922168B (en) | A Meso-Level Accident-prone Road Discrimination Method | |
CN106649709B (en) | Vehicle stopping behavior pattern prediction and evaluation method based on data mining | |
CN105679025B (en) | A kind of arterial street travel time estimation method based on Changeable weight mixed distribution | |
CN104700646A (en) | Online GPS data based abnormal taxi track real-time detection method | |
Ban et al. | Bottleneck identification and calibration for corridor management planning | |
CN107330217A (en) | A kind of middle sight oil consumption Forecasting Methodology based on RBFNN | |
CN110929359A (en) | Pipe network siltation risk prediction modeling method based on PNN neural network and SWMM technology | |
CN112950940B (en) | Traffic diversion method in road construction period | |
CN111062648A (en) | Method for evaluating comprehensive performance of asphalt pavement | |
CN105551250A (en) | Method for discriminating urban road intersection operation state on the basis of interval clustering | |
CN104750919B (en) | A kind of road passage capability influence factor recognition methods | |
CN107230350A (en) | A kind of urban transportation amount acquisition methods based on bayonet socket Yu mobile phone flow call bill data | |
CN116402390A (en) | Road construction area traffic influence evaluation method and system based on primitive model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180904 |