CN108492560A - A kind of Road Detection device missing data complementing method and device - Google Patents

A kind of Road Detection device missing data complementing method and device Download PDF

Info

Publication number
CN108492560A
CN108492560A CN201810293819.XA CN201810293819A CN108492560A CN 108492560 A CN108492560 A CN 108492560A CN 201810293819 A CN201810293819 A CN 201810293819A CN 108492560 A CN108492560 A CN 108492560A
Authority
CN
China
Prior art keywords
function
missing data
data
multivariate
detector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810293819.XA
Other languages
Chinese (zh)
Inventor
王昊
张辉
姚东成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN201810293819.XA priority Critical patent/CN108492560A/en
Publication of CN108492560A publication Critical patent/CN108492560A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Development Economics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a kind of Road Detection device missing data complementing methods and device, this method to include:Road Detection device gathered data is analyzed, extraction by the motor vehicle average speed of detector section, each assay intervals by motor vehicle flow and roadway occupancy;It using traffic flow theory as foundation, is fitted using the function of many variables, using speed as dependent variable, flow, occupation rate are that independent variable establishes several function of many variables forms;The function of many variables are fitted based on training set, determine that the most optimized parameter keeps actual value and the root-mean-square error of match value minimum;The evaluation index judged as fitting function quality of root-mean-square error using on verification collection, finally using optimal function and its corresponding the most optimized parameter, missing data progress completion caused by because of the equipment the problems such as.Detector missing data complementing method provided by the invention has easy to implement, easy to operate, the features such as applicability is high, can improve detector gathered data quality, so as to improve road management, promote road traffic efficiency.

Description

一种道路检测器缺失数据补全方法及装置A method and device for complementing missing data of a road detector

技术领域technical field

本发明涉及高速公路交通信息采集,属于高速公路交通技术领域,具体涉及一种道路检测器缺失数据补全方法及装置。The invention relates to expressway traffic information collection, belongs to the technical field of expressway traffic, and in particular relates to a method and device for complementing missing data of a road detector.

背景技术Background technique

随着世界经济的不断发展,交通运输事业,特别是公路交通运输,也得到了长足的发展。尤其是最近几十年以来,高速公路以其快速、高效、安全等特点在世界范围内得到了迅猛的发展,大大增强了地区之间的经济联系,加快了世界各国、地区的社会经济发展。With the continuous development of the world economy, the transportation industry, especially the road transportation, has also been greatly developed. Especially in recent decades, expressways have developed rapidly around the world due to their characteristics of speed, efficiency, and safety, which have greatly enhanced the economic ties between regions and accelerated the social and economic development of countries and regions in the world.

交通数据采集是高速公路运营及监管的基础,准确的道路交通数据支持交通管理者制定行之有效的管理决策。Traffic data collection is the basis of expressway operation and supervision. Accurate road traffic data supports traffic managers to make effective management decisions.

然而,由于检测设备老化、传输线路故障、检测环境不良、设备调试和使用不当等原因,使得检测器采集到的动态交通数据存在着各种各样的质量问题。如何对已有的检测器数据进行分析,进而准确高效的补全缺失数据,具有重要的工程意义。However, there are various quality problems in the dynamic traffic data collected by the detectors due to aging detection equipment, transmission line failure, poor detection environment, equipment debugging and improper use, etc. How to analyze the existing detector data, and then fill in the missing data accurately and efficiently, has important engineering significance.

发明内容Contents of the invention

发明目的:针对现有技术中存在的问题,本发明提供一种道路检测器缺失数据补全方法及装置,对缺失的路段检测器数据分类型进行补全,提供给交通管理者完整的高速公路交通数据,从而制定合理的管理策略,提升高速公路网整体运行效率。Purpose of the invention: Aiming at the problems existing in the prior art, the present invention provides a method and device for complementing the missing data of road detectors, complementing the missing data types of road section detectors, and providing traffic managers with a complete expressway traffic data, so as to formulate reasonable management strategies and improve the overall operating efficiency of the expressway network.

技术方案:为实现上述发明目的,本发明采用如下技术方案:Technical solution: In order to achieve the above-mentioned purpose of the invention, the present invention adopts the following technical solution:

一种道路检测器缺失数据补全方法,包括如下步骤:A method for complementing missing data of a road detector, comprising the steps of:

(1)对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;(1) Analyze the data collected by the road detector, extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate;

(2)明确缺失的数据类型,并找出每一条缺失的数据所在位置;(2) Specify the type of missing data, and find out the location of each missing data;

(3)从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;(3) Delete the missing data from the original data, and divide the deleted data to form a training set and a verification set;

(4)根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1234为调整参数;(4) According to the positive correlation between speed and the ratio of flow and occupancy, add adjustment parameters to establish some multivariate function forms with speed V as dependent variable and flow Q and occupancy O as independent variables for selection and fitting; the established Multivariate functional forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters;

(5)在训练集中,调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;(5) In the training set, adjust the undetermined parameters of each multivariate function, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value;

(6)在验证集中,用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;(6) In the verification set, use the minimum root mean square error as the basis for selection, calculate the fitting error of each multivariate function, and select the optimal multivariate function form;

(7)应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。(7) Using the optimal function and its corresponding optimization parameters, the independent variable corresponding to the missing data of the detector is substituted into the calculation, and the estimated data is calculated to fill the missing data value.

作为优选,所述步骤(4)中,所建立的多元函数形式还包括如下至少1种: As a preference, in the step (4), the established multivariate function form also includes at least one of the following:

作为优选,所述步骤(5)中,确定每个多元函数的最优化参数,包括:(5.1)设定多元函数模型的初始参数值;(5.2)利用最小二乘法原理建立误差函数,不断调整模型的参数值;(5.3)经过多次迭代及调整后,当误差函数值下降到某一个给定范围内,停止迭代,记录模型参数作为该多元函数的最优化参数。As preferably, in described step (5), determine the optimal parameter of each multivariate function, comprise: (5.1) set the initial parameter value of multivariate function model; (5.2) utilize least square method principle to establish error function, constantly adjust The parameter value of the model; (5.3) After multiple iterations and adjustments, when the error function value drops to a certain given range, the iteration is stopped, and the model parameters are recorded as the optimal parameters of the multivariate function.

作为优选,所述步骤(5.2)中,调整模型参数的方法为:在训练集中,假定速度为缺失数据,将流量Q、占有率O作为自变量输入,带入待拟合多元函数中,计算出估计速度V’,以误差函数作为判断依据,向使误差函数值下降最快的方向不断调整模型的参数值,使得估计速度V’与真实速度V之间的误差逐步减小。As a preference, in the step (5.2), the method of adjusting the model parameters is as follows: in the training set, assume that the speed is missing data, input the flow Q and the occupancy rate O as independent variables, bring them into the multivariate function to be fitted, and calculate Calculate the estimated velocity V', use the error function as the basis for judgment, and continuously adjust the parameter values of the model in the direction that makes the error function value decrease the fastest, so that the error between the estimated velocity V' and the real velocity V gradually decreases.

作为优选,所述步骤(6)中,选取最优多元函数的方法为:应用各多元函数的最优化参数,将验证集数据中的自变量代入多元函数,计算所缺失数据类型的估计值,进而计算验证集中每种多元函数拟合出估计值与实测值的均方根误差,对比各个多元函数,选出最小的均方根误差对应的函数形式。As a preference, in the step (6), the method for selecting the optimal multivariate function is: applying the optimization parameters of each multivariate function, substituting the independent variable in the verification set data into the multivariate function, and calculating the estimated value of the missing data type, Then calculate the root mean square error of each multivariate function in the verification set to fit the estimated value and the measured value, compare the various multivariate functions, and select the function form corresponding to the smallest root mean square error.

本发明另一方面提供的一种道路检测器缺失数据补全装置,包括:Another aspect of the present invention provides a road detector missing data complement device, comprising:

数据提取模块,用于对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;The data extraction module is used to analyze the data collected by the road detector, and extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate;

缺失定位模块,用于明确缺失的数据类型,并找出每一条缺失的数据所在位置;The missing location module is used to clarify the type of missing data and find out the location of each missing data;

预处理模块,用于从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;The preprocessing module is used to delete missing data from the original data, and divide the deleted data to form a training set and a verification set;

多元函数构建模块,用于根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1234为调整参数;The multivariate function building block is used to add adjustment parameters to establish a number of multivariate function forms with speed V as the dependent variable and flow Q and occupancy O as independent variables according to the positive correlation between the speed and the ratio of flow and occupancy. ; The established multivariate function forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters;

函数拟合模块,用于基于训练集调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;The function fitting module is used to adjust the undetermined parameters of each multivariate function based on the training set, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value;

最优函数选取模块,用于基于验证集用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;The optimal function selection module is used to calculate the fitting error of each multivariate function based on the verification set with the minimum root mean square error as the selection basis, and select the optimal multivariate function form;

以及,缺失填补模块,用于应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。And, the missing filling module is used to apply the optimal function and its corresponding optimization parameters, substitute the independent variable corresponding to the missing data of the detector for calculation, and calculate estimated data to fill the missing data value.

作为优选,所述多元函数构建模块中所建立的多元函数形式还包括如下至少1种: Preferably, the multivariate function forms established in the multivariate function building block also include at least one of the following:

有益效果:与现有技术相比,本发明的有益效果在于:Beneficial effect: compared with prior art, the beneficial effect of the present invention is:

1、本发明提供的多元函数拟合形式符合交通流理论中速度与流量和密度的比值成正相关,密度和时间占有率成正相关,速度与流量和占有率的比值成正相关的相互关系,由此可以应用已有流量、占有率数据估计缺失的速度数据。2、本发明以均方根误差作为评价标准,对每个缺失数据集选用误差最低的多元函数拟合形式,能够保证较高的补全精度。3、本发明的补全道路缺失速度数据,能够完善高速公路交通运行信息,为交通管理者制定管理策略提供依据,进而提升交通运行的安全性,提高高速公路整体运行水平。1, the multivariate function fitting form that the present invention provides meets the ratio of speed and flow and density in the traffic flow theory and is positively correlated, and density and time occupancy are positively correlated, and the ratio of speed and flow and occupancy is positively correlated with each other, thus Existing traffic and occupancy data can be used to estimate missing speed data. 2. The present invention uses the root mean square error as the evaluation standard, and selects the multivariate function fitting form with the lowest error for each missing data set, which can ensure higher completion accuracy. 3. Complementing the missing speed data of roads in the present invention can improve the traffic operation information of expressways, provide a basis for traffic managers to formulate management strategies, and then improve the safety of traffic operation and improve the overall operation level of expressways.

附图说明Description of drawings

图1为本发明道路检测器缺失数据补全流程图;Fig. 1 is the flow chart of complementing the missing data of the road detector of the present invention;

图2为本发明道路检测器采集数据实例数据部分样本截图;Fig. 2 is a partial sample screenshot of data collected by the road detector of the present invention;

图3为本发明实例缺失数据补全的多元函数误差分析结果图。Fig. 3 is a diagram of error analysis results of multivariate function for missing data completion in the example of the present invention.

具体实施方式Detailed ways

下面结合附图和具体实施例,对本发明作更进一步的说明。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

如附图1所示,本发明实施例公开的一种道路检测器缺失数据补全方法,首先对采集数据进行分析,提取其中有用信息;明确缺失数据,从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集。然后建立供选择拟合的多元函数形式,调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小。接着用均方根误差作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式。最后代入最优函数进行计算,填补缺失的数据值。详细步骤如下:As shown in Figure 1, a method for complementing missing data of a road detector disclosed in an embodiment of the present invention firstly analyzes the collected data and extracts useful information therein; defines the missing data and deletes the missing data from the original data, And the deleted data is divided to form a training set and a verification set. Then establish the multivariate function form for selection and fitting, adjust the undetermined parameters of each multivariate function, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitting value. Then, the root mean square error is used as the basis for selection, and the fitting error of each multivariate function is calculated to select the optimal form of multivariate function. Finally, the optimal function is substituted into the calculation to fill in the missing data values. The detailed steps are as follows:

(1)对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率。首先,收集路上布设道路检测器(以线圈检测器和地磁检测器为主)采集的数据,然后对收集的交通采集信息进行初步过滤和分析,收集其中的有用信息,路段检测器采集的交通有效信息主要包含交通流量、速度、道路占有率,采集间隔与检测器的类型有关,一般在30秒至300秒中间。(1) Analyze the data collected by the road detector, and extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate. First, collect the data collected by road detectors (mainly coil detectors and geomagnetic detectors) on the road, and then conduct preliminary filtering and analysis on the collected traffic collection information to collect useful information, and the traffic collected by road section detectors is effective. The information mainly includes traffic flow, speed, and road occupancy rate. The collection interval is related to the type of detector, generally between 30 seconds and 300 seconds.

(2)明确缺失的数据类型,并找出每一条缺失的数据所在位置。本步骤中,对采集的数据进行剖析,明确缺失的检测器数据的类型。图2中,可以看出其中检测器检测的某些速度值出现丢失或者数据异常现象,所以本例以车流的平均速度作为需要填补的数据类型,找到速度数据异常的位置。(2) Identify the type of missing data, and find out the location of each missing data. In this step, the collected data is analyzed to clarify the type of missing detector data. In Figure 2, it can be seen that some speed values detected by the detector are missing or have abnormal data. Therefore, in this example, the average speed of the traffic flow is used as the data type to be filled in to find the location of the abnormal speed data.

(3)从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集。本步骤中,将数据集中缺失的数据对应的数据行删除掉,将每个数据集样本中的数据大致从中间分成两段,前一段作为调整参数的训练集合,后一段作为选择最优函数的验证集合。本例中,将出现速度缺失或异常的数据行进行删除,将其一分为二,一部分作为训练集,另一部分作为验证集。(3) Delete the missing data from the original data, and divide the deleted data to form a training set and a verification set. In this step, the data rows corresponding to the missing data in the data set are deleted, and the data in each data set sample is roughly divided into two sections from the middle. Validation collection. In this example, the data rows with missing or abnormal speeds are deleted and divided into two, one part is used as a training set and the other part is used as a validation set.

(4)建立供选择拟合的多元函数形式。根据交通流理论,速度与流量和密度的比值成正比,车流密度和时间占有率成正比,因此速度与流量和占有率的比值成正相关关系。本例中,检测器已有的准确数据为流量Q和占有率O,待补充的缺失数据为速度参数V。因此,以V(速度)为因变量,Q(流量)、O(占有率)为自变量,βi为待定参数,建立7个多元函数形式作为拟合函数。这7个拟合函数分别为:(4) Establish a multivariate function form for selection and fitting. According to the traffic flow theory, the speed is proportional to the ratio of flow and density, and the traffic density is proportional to the time occupancy, so the speed is positively correlated with the ratio of flow and occupancy. In this example, the existing accurate data of the detector are the flow rate Q and the occupancy rate O, and the missing data to be supplemented is the velocity parameter V. Therefore, with V (velocity) as the dependent variable, Q (flow rate) and O (occupancy rate) as independent variables, and βi as undetermined parameters, 7 multivariate function forms are established as fitting functions. The seven fitting functions are:

第一个函数形式 first functional form

第二个函数形式 The second functional form

第三个函数形式 The third functional form

第四个函数形式 The fourth functional form

第五个函数形式 fifth functional form

第六个函数形式 sixth functional form

第七个函数形式 The seventh function form

上述7种函数形式为经大量样本检验过,匹配程度较高的函数形式,其中第四、五、六函数形式普适性最好,在实际操作中,可以选择全部或部分函数供拟合筛选。The above 7 function forms have been tested by a large number of samples and have a high degree of matching. Among them, the fourth, fifth and sixth function forms are the most universal. In actual operation, all or part of the functions can be selected for fitting and screening .

(5)在训练集中,对每个多元函数调整待定参数,确定最优化参数。本步骤中,每个多元函数拟合主要包括以下步骤:(5) In the training set, adjust the undetermined parameters for each multivariate function to determine the optimal parameters. In this step, each multivariate function fitting mainly includes the following steps:

步骤(5.1)设定初始参数值。Step (5.1) sets initial parameter values.

步骤(5.2)利用最小二乘法原理建立误差函数,不断调整模型的参数值。Step (5.2) uses the principle of the least square method to establish an error function, and continuously adjusts the parameter values of the model.

步骤(5.3)经过多次迭代及调整后,当误差函数值下降到某一个给定范围内,停止迭代,此时记录模型参数作为该多元函数的最优化参数。After multiple iterations and adjustments in step (5.3), when the value of the error function drops to a certain given range, the iteration is stopped, and the model parameters are recorded as the optimization parameters of the multivariate function.

采用上述步骤最终确定出7个拟合函数各自的最优待定参数。The optimal undetermined parameters of each of the seven fitting functions are finally determined by the above steps.

具体地,首先设定各个函数中待定参数βi的初始值,将训练集合中的每一条检测器数据的流量Q、占有率O作为自变量带入到这7个拟合函数中,得到估计速度V’,利用最小二乘原理建立误差函数作为参数是否优化的标准,向使误差函数值下降最快的方向不断调整模型的参数值,最终找到使每个训练集中估计速度V’与真实速度V平均误差最小的最优待定参数βi值。Specifically, first set the initial value of the undetermined parameter βi in each function, and bring the flow Q and occupancy O of each detector data in the training set into the seven fitting functions as independent variables to obtain the estimated speed V', using the least squares principle to establish an error function as a criterion for parameter optimization, continuously adjust the parameter values of the model in the direction that makes the error function value drop the fastest, and finally find the estimated speed V' and the real speed V in each training set The optimal undetermined parameter βi value with the smallest average error.

(6)用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式。本步骤中以均方根误差作为多元函数选择依据,均方根误差是所选数据均方误差的算术平方根,又称为标准误差。其基本公式如下,其中observedt表示实际检测值,predictedt表示计算出的估计值,N为样本总数。(6) Using the minimum root mean square error as the selection basis, calculate the fitting error of each multivariate function, and select the optimal multivariate function form. In this step, the root mean square error is used as the basis for selecting the multivariate function, and the root mean square error is the arithmetic square root of the mean square error of the selected data, also known as the standard error. The basic formula is as follows, where observed t represents the actual detected value, predicted t represents the calculated estimated value, and N is the total number of samples.

取出验证集合,将验证集中每一条检测器采集的流量Q、占有率O作为自变量输入,带入每一个拟合函数中,利用其最优待定参数计算出估计速度V’。计算估计速度V’和验证集中真实速度V的均方根误差并记录。Take out the verification set, input the flow Q and occupancy rate O collected by each detector in the verification set as independent variables, bring them into each fitting function, and calculate the estimated velocity V' by using its optimal undetermined parameters. Calculate the root mean square error between the estimated velocity V' and the true velocity V in the validation set and record it.

对比7个拟合函数计算出的估计速度V’与真实速度V的均方根误差,选出其中最小的误差对应的拟合函数作为该数据的缺失补全函数,该函数对应的最优待定参数βi为最优化参数。Compare the root mean square error between the estimated speed V' calculated by the seven fitting functions and the real speed V, and select the fitting function corresponding to the smallest error as the missing completion function of the data. The optimal function corresponding to this function is to be determined. Parameter βi is the optimization parameter.

(7)应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。本例中,用缺失速度数据对应的检测器采集到的流量Q、占有率O作为自变量输入该函数,求出估计速度值,从而填补删去的缺失速度值。由图3可知,所选数据用第四个多元函数形式计算出的均方根误差,最小误差值为8.21%,以该函数作为该数据的缺失补全函数,最终填补精度在90%以上。(7) Using the optimal function and its corresponding optimization parameters, the independent variable corresponding to the missing data of the detector is substituted into the calculation, and the estimated data is calculated to fill the missing data value. In this example, the flow Q and occupancy rate O collected by the detector corresponding to the missing speed data are used as independent variables to input the function, and the estimated speed value is obtained, so as to fill in the deleted missing speed value. It can be seen from Figure 3 that the root mean square error calculated by the fourth multivariate function form of the selected data has a minimum error value of 8.21%. Using this function as the missing completion function of the data, the final filling accuracy is above 90%.

本发明另一实施例公开的道路检测器缺失数据补全装置,包括:数据提取模块,用于对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;缺失定位模块,用于明确缺失的数据类型,并找出每一条缺失的数据所在位置;预处理模块,用于从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;Another embodiment of the present invention discloses a missing data complementing device for a road detector, comprising: a data extraction module for analyzing the data collected by the road detector, extracting the average speed of the motor vehicle passing through the detector section, the passage of each detection interval The motor vehicle flow and road occupancy rate; the missing location module is used to clarify the type of missing data, and find out the location of each missing data; the preprocessing module is used to delete the missing data from the original data, and The deleted data is divided to form a training set and a verification set;

多元函数构建模块,用于根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 函数拟合模块,用于基于训练集调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;最优函数选取模块,用于基于验证集用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;以及,缺失填补模块,用于应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。The multivariate function building block is used to add adjustment parameters to establish a number of multivariate function forms with speed V as the dependent variable and flow Q and occupancy O as independent variables according to the positive correlation between the speed and the ratio of flow and occupancy. ; The established multivariate function forms include at least: The function fitting module is used to adjust the undetermined parameters of each multivariate function based on the training set, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value; the optimal function selection module is used to use the mean square value based on the verification set The minimum root error is used as the basis for selection, and the fitting error of each multivariate function is calculated to select the optimal multivariate function form; and, the missing filling module is used to apply the optimal function and its corresponding optimization parameters to correspond to The independent variable is substituted into the calculation, and the estimated data is calculated to fill in the missing data values.

上述道路检测器缺失数据补全装置实施例可以用于执行上述道路检测器缺失数据补全方法实施例,其技术原理、所解决的技术问题及产生的技术效果相似,上述描述的道路检测器缺失数据补全的具体工作过程及有关说明,可以参考前述道路检测器缺失数据补全方法实施例中的对应过程,在此不再赘述。The above embodiment of the missing data complementing device for road detectors can be used to implement the above embodiment of the missing data complementing method for road detectors. The technical principles, technical problems solved and technical effects produced are similar. For the specific working process and relevant descriptions of the data completion, reference may be made to the corresponding process in the above-mentioned embodiment of the missing data completion method of the road detector, and details are not repeated here.

本领域技术人员可以理解,可以对实施例中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个系统中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。Those skilled in the art can understand that the modules in the embodiment can be adaptively changed and installed in one or more systems different from the embodiment. Modules or units or components in the embodiments may be combined into one module or unit or component, and furthermore may be divided into a plurality of sub-modules or sub-units or sub-assemblies.

以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以作出若干改进,这些改进也应视为本发明的保护范围。The above is only a preferred embodiment of the present invention, it should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present invention, some improvements can also be made, and these improvements should also be regarded as the present invention. scope of protection.

Claims (7)

1.一种道路检测器缺失数据补全方法,其特征在于:该方法包括如下步骤:1. A road detector missing data complement method is characterized in that: the method comprises the steps: (1)对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;(1) Analyze the data collected by the road detector, extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate; (2)明确缺失的数据类型,并找出每一条缺失的数据所在位置;(2) Specify the type of missing data, and find out the location of each missing data; (3)从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;(3) Delete the missing data from the original data, and divide the deleted data to form a training set and a verification set; (4)根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1234为调整参数;(4) According to the positive correlation between speed and the ratio of flow and occupancy, add adjustment parameters to establish some multivariate function forms with speed V as dependent variable and flow Q and occupancy O as independent variables for selection and fitting; the established Multivariate functional forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters; (5)在训练集中,调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;(5) In the training set, adjust the undetermined parameters of each multivariate function, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value; (6)在验证集中,用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;(6) In the verification set, use the minimum root mean square error as the basis for selection, calculate the fitting error of each multivariate function, and select the optimal multivariate function form; (7)应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。(7) Using the optimal function and its corresponding optimization parameters, the independent variable corresponding to the missing data of the detector is substituted into the calculation, and the estimated data is calculated to fill the missing data value. 2.根据权利要求1所述的一种道路检测器缺失数据补全方法,其特征在于:所述步骤(4)中,所建立的多元函数形式还包括如下至少1种: 2. A kind of road detector missing data completion method according to claim 1, is characterized in that: in described step (4), the multivariate function form that establishes also comprises following at least one kind: 3.根据权利要求1所述的一种道路检测器缺失数据补全方法,其特征在于:所述步骤(5)中,确定每个多元函数的最优化参数,包括:(5.1)设定多元函数模型的初始参数值;(5.2)利用最小二乘法原理建立误差函数,不断调整模型的参数值;(5.3)经过多次迭代及调整后,当误差函数值下降到某一个给定范围内,停止迭代,记录模型参数作为该多元函数的最优化参数。3. A kind of road detector missing data complement method according to claim 1, is characterized in that: in described step (5), determine the optimization parameter of each multivariate function, comprise: (5.1) set multivariate The initial parameter value of the function model; (5.2) Use the least square method to establish the error function, and continuously adjust the parameter value of the model; (5.3) After multiple iterations and adjustments, when the error function value drops to a given range, Stop iterating and record the model parameters as the optimization parameters for this multivariate function. 4.根据权利要求3所述的一种道路检测器缺失数据补全方法,其特征在于:所述步骤(5.2)中,调整模型参数的方法为:在训练集中,假定速度为缺失数据,将流量Q、占有率O作为自变量输入,带入待拟合多元函数中,计算出估计速度V’,以误差函数作为判断依据,向使误差函数值下降最快的方向不断调整模型的参数值,使得估计速度V’与真实速度V之间的误差逐步减小。4. a kind of road detector missing data completion method according to claim 3, is characterized in that: in described step (5.2), the method for adjusting model parameter is: in training set, suppose speed is missing data, will The flow Q and occupancy O are input as independent variables and brought into the multivariate function to be fitted to calculate the estimated velocity V', and the error function is used as the basis for judgment, and the parameter values of the model are continuously adjusted in the direction that makes the error function value decrease the fastest , so that the error between the estimated velocity V' and the real velocity V gradually decreases. 5.根据权利要求1所述的一种道路检测器缺失数据补全方法,其特征在于:所述步骤(6)中,选取最优多元函数的方法为:应用各多元函数的最优化参数,将验证集数据中的自变量代入多元函数,计算所缺失数据类型的估计值,进而计算验证集中每种多元函数拟合出估计值与实测值的均方根误差,对比各个多元函数,选出最小的均方根误差对应的函数形式。5. a kind of road detector missing data supplementary method according to claim 1, is characterized in that: in described step (6), the method for selecting optimal multivariate function is: apply the optimization parameter of each multivariate function, Substitute the independent variables in the verification set data into the multivariate function, calculate the estimated value of the missing data type, and then calculate the root mean square error between the estimated value and the measured value of each multivariate function in the verification set, compare each multivariate function, and select The functional form corresponding to the smallest root mean square error. 6.一种道路检测器缺失数据补全装置,其特征在于,包括:6. A road detector missing data complement device, characterized in that, comprising: 数据提取模块,用于对道路检测器采集数据进行分析,提取经过检测器断面的机动车平均速度、每次检测间隔通过的机动车流量和道路占有率;The data extraction module is used to analyze the data collected by the road detector, and extract the average speed of motor vehicles passing through the detector section, the flow of motor vehicles passing through each detection interval, and the road occupancy rate; 缺失定位模块,用于明确缺失的数据类型,并找出每一条缺失的数据所在位置;The missing location module is used to clarify the type of missing data and find out the location of each missing data; 预处理模块,用于从原始数据中删去缺失的数据,并对删除后的数据进行分割,形成训练集与验证集;The preprocessing module is used to delete missing data from the original data, and divide the deleted data to form a training set and a verification set; 多元函数构建模块,用于根据速度与流量和占有率的比值成正相关关系,添加调整参数建立若干供选择拟合的以速度V为因变量,流量Q和占有率O为自变量的多元函数形式;所建立的多元函数形式至少包括: 其中,β1234为调整参数;The multivariate function building block is used to add adjustment parameters to establish a number of multivariate function forms with speed V as the dependent variable and flow Q and occupancy O as independent variables according to the positive correlation between the speed and the ratio of flow and occupancy. ; The established multivariate function forms include at least: Among them, β 1 , β 2 , β 3 , β 4 are adjustment parameters; 函数拟合模块,用于基于训练集调整每个多元函数待定参数,确定最优化参数使实际值与拟合值的均方根误差最小;The function fitting module is used to adjust the undetermined parameters of each multivariate function based on the training set, and determine the optimal parameters to minimize the root mean square error between the actual value and the fitted value; 最优函数选取模块,用于基于验证集用均方根误差最小作为选择依据,计算每个多元函数的拟合误差,选出最优多元函数形式;The optimal function selection module is used to calculate the fitting error of each multivariate function based on the verification set with the minimum root mean square error as the selection basis, and select the optimal multivariate function form; 以及,缺失填补模块,用于应用最优函数及其相应最优化参数,将检测器缺失数据对应的自变量代入进行计算,计算得出估计数据来填补缺失的数据值。And, the missing filling module is used to apply the optimal function and its corresponding optimization parameters, substitute the independent variable corresponding to the missing data of the detector for calculation, and calculate estimated data to fill the missing data value. 7.根据权利要求6所述的一种道路检测器缺失数据补全装置,其特征在于,所述多元函数构建模块中所建立的多元函数形式还包括如下至少1种:7. A kind of road detector missing data completion device according to claim 6, is characterized in that, the multivariate function form established in the described multivariate function construction module also includes following at least one:
CN201810293819.XA 2018-04-04 2018-04-04 A kind of Road Detection device missing data complementing method and device Pending CN108492560A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810293819.XA CN108492560A (en) 2018-04-04 2018-04-04 A kind of Road Detection device missing data complementing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810293819.XA CN108492560A (en) 2018-04-04 2018-04-04 A kind of Road Detection device missing data complementing method and device

Publications (1)

Publication Number Publication Date
CN108492560A true CN108492560A (en) 2018-09-04

Family

ID=63317926

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810293819.XA Pending CN108492560A (en) 2018-04-04 2018-04-04 A kind of Road Detection device missing data complementing method and device

Country Status (1)

Country Link
CN (1) CN108492560A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584553A (en) * 2018-11-29 2019-04-05 中电海康集团有限公司 A kind of section degree of association missing complementing method based on space time information
CN109710659A (en) * 2018-12-16 2019-05-03 苏州城方信息技术有限公司 The complementing method of detector missing data based on temporal correlation
CN110490419A (en) * 2019-07-19 2019-11-22 珠海市岭南大数据研究院 Processing method, device, computer equipment and the storage medium of Bus information data
CN113377508A (en) * 2021-05-28 2021-09-10 张燕 Mass data rapid transmission method
CN113782128A (en) * 2021-08-09 2021-12-10 中国中医科学院中医临床基础医学研究所 A missing data fitting method, device and computer equipment
CN118885480A (en) * 2024-07-15 2024-11-01 江苏奕隆机电科技有限公司 A method for generating a dynamic index table of current-flow-pressure difference for a hydraulic brake-by-wire solenoid valve

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6236932B1 (en) * 1996-12-16 2001-05-22 Mannesmann Ag Process for completing and/or verifying data concerning the state of a road network; traffic information centre
CN103971520A (en) * 2014-04-17 2014-08-06 浙江大学 Traffic flow data recovery method based on space-time correlation
CN106781457A (en) * 2016-11-29 2017-05-31 东南大学 A kind of freeway traffic flow parameter correction method based on multi-source fusion data
CN107193876A (en) * 2017-04-21 2017-09-22 美林数据技术股份有限公司 A kind of missing data complementing method based on arest neighbors KNN algorithms

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6236932B1 (en) * 1996-12-16 2001-05-22 Mannesmann Ag Process for completing and/or verifying data concerning the state of a road network; traffic information centre
CN103971520A (en) * 2014-04-17 2014-08-06 浙江大学 Traffic flow data recovery method based on space-time correlation
CN106781457A (en) * 2016-11-29 2017-05-31 东南大学 A kind of freeway traffic flow parameter correction method based on multi-source fusion data
CN107193876A (en) * 2017-04-21 2017-09-22 美林数据技术股份有限公司 A kind of missing data complementing method based on arest neighbors KNN algorithms

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
金盛: "环形线圈检测器交通数据预处理方法研究", 《中国优秀硕士学位论文全文数据库·工程科技Ⅱ辑》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584553A (en) * 2018-11-29 2019-04-05 中电海康集团有限公司 A kind of section degree of association missing complementing method based on space time information
CN109710659A (en) * 2018-12-16 2019-05-03 苏州城方信息技术有限公司 The complementing method of detector missing data based on temporal correlation
CN109710659B (en) * 2018-12-16 2022-11-25 苏州城方信息技术有限公司 Method for filling detector missing data based on space-time correlation
CN110490419A (en) * 2019-07-19 2019-11-22 珠海市岭南大数据研究院 Processing method, device, computer equipment and the storage medium of Bus information data
CN113377508A (en) * 2021-05-28 2021-09-10 张燕 Mass data rapid transmission method
CN113377508B (en) * 2021-05-28 2023-08-22 张燕 Mass data rapid transmission method
CN113782128A (en) * 2021-08-09 2021-12-10 中国中医科学院中医临床基础医学研究所 A missing data fitting method, device and computer equipment
CN118885480A (en) * 2024-07-15 2024-11-01 江苏奕隆机电科技有限公司 A method for generating a dynamic index table of current-flow-pressure difference for a hydraulic brake-by-wire solenoid valve

Similar Documents

Publication Publication Date Title
CN108492560A (en) A kind of Road Detection device missing data complementing method and device
Park et al. Application of microscopic simulation model calibration and validation procedure: Case study of coordinated actuated signal system
CN103150900B (en) Traffic jam event automatic detecting method based on videos
CN106997669B (en) A method for judging the causes of traffic congestion based on feature importance
CN104408925B (en) Crossing evaluation of running status method based on display radar
CN102800197B (en) Preprocessing method of road section dynamic traffic stream essential data of urban road
CN106781486B (en) Traffic state evaluation method based on floating car data
CN101783074A (en) A method and system for real-time identification of urban road traffic flow status
CN107038167A (en) Big data excavating analysis system and its analysis method based on model evaluation
CN114783183A (en) Monitoring method and system based on traffic situation algorithm
CN105931458B (en) A kind of method of road traffic flow detection device reliability assessment
CN104574968B (en) Determining method for threshold traffic state parameter
CN108922168B (en) A Meso-Level Accident-prone Road Discrimination Method
CN106649709B (en) Vehicle stopping behavior pattern prediction and evaluation method based on data mining
CN105679025B (en) A kind of arterial street travel time estimation method based on Changeable weight mixed distribution
CN104700646A (en) Online GPS data based abnormal taxi track real-time detection method
Ban et al. Bottleneck identification and calibration for corridor management planning
CN107330217A (en) A kind of middle sight oil consumption Forecasting Methodology based on RBFNN
CN110929359A (en) Pipe network siltation risk prediction modeling method based on PNN neural network and SWMM technology
CN112950940B (en) Traffic diversion method in road construction period
CN111062648A (en) Method for evaluating comprehensive performance of asphalt pavement
CN105551250A (en) Method for discriminating urban road intersection operation state on the basis of interval clustering
CN104750919B (en) A kind of road passage capability influence factor recognition methods
CN107230350A (en) A kind of urban transportation amount acquisition methods based on bayonet socket Yu mobile phone flow call bill data
CN116402390A (en) Road construction area traffic influence evaluation method and system based on primitive model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180904