WO2011060725A1 - Abnormal vehicle speed data identifying method and device thereof - Google Patents

Abnormal vehicle speed data identifying method and device thereof Download PDF

Info

Publication number
WO2011060725A1
WO2011060725A1 PCT/CN2010/078875 CN2010078875W WO2011060725A1 WO 2011060725 A1 WO2011060725 A1 WO 2011060725A1 CN 2010078875 W CN2010078875 W CN 2010078875W WO 2011060725 A1 WO2011060725 A1 WO 2011060725A1
Authority
WO
WIPO (PCT)
Prior art keywords
vehicle speed
speed data
distance
parameter
data set
Prior art date
Application number
PCT/CN2010/078875
Other languages
French (fr)
Chinese (zh)
Inventor
昝艳
付新刚
贾学力
李建军
Original Assignee
北京世纪高通科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京世纪高通科技有限公司 filed Critical 北京世纪高通科技有限公司
Publication of WO2011060725A1 publication Critical patent/WO2011060725A1/en

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/052Detecting movement of traffic to be counted or controlled with provision for determining speed or overspeed

Definitions

  • the present invention relates to the field of intelligent transportation, and in particular to a method and apparatus for identifying abnormal vehicle speed data.
  • the intelligent transportation system is currently the most effective way to solve the transportation problems, especially traffic congestion, traffic congestion, traffic accidents and traffic pollution.
  • dynamic traffic information service is one of the core research directions of intelligent transportation system. It can dynamically reflect the traffic conditions in the area in real time, guide the optimal driving route, improve the efficiency of road and vehicle use, and is important to alleviate traffic congestion. Measures. In the study of dynamic traffic information, it is a hot issue to analyze the road historical speed value and find the parameters that can reflect the cyclical trend of the road, and then extract the dynamic traffic characteristics of the road. However, due to various interferences in the process of collecting and processing the speed data, the vehicle speed data will be polluted by different degrees of noise.
  • Denoising that is, identifying and deleting abnormal vehicle speed values.
  • the main methods used in the prior art for identifying anomalous data are statistical methods, distance-based methods, and density-based clustering-based methods.
  • the inventors have found that at least the following problems exist in the prior art:
  • statistical methods to identify abnormal vehicle speed data it is generally required to know knowledge about data set parameters. , such as distribution models, distribution parameters, etc. But in most cases, the distribution may be unknown.
  • the distance-based method to identify abnormal vehicle speed data the distance between data objects needs to be calculated according to a distance function, which is a data object with a higher distance than all other objects. But it requires the user to provide the minimum acceptable distance directly. And this is hard to determine.
  • the density clustering based method can identify abnormal data from the data of unknown distribution form, but the density clustering based method has not been used in the prior art to identify abnormal vehicle speed data.
  • Embodiments of the present invention provide a method and apparatus for identifying abnormal vehicle speed data, which employs a density clustering based method to identify abnormal vehicle speed data from a target vehicle speed data set.
  • a method for identifying abnormal vehicle speed data comprising:
  • a radius parameter of the target vehicle speed data set is selected from all of the above-mentioned distances; and a vehicle speed data object corresponding to the distance of the radius parameter is identified as abnormal vehicle speed data.
  • An apparatus for identifying abnormal vehicle speed data comprising: a calculating unit, configured to calculate a distance of each vehicle speed data object in the target vehicle speed data set according to a preset parameter;
  • a selection unit configured to select a radius parameter of the target vehicle speed data set from all of the above-distances
  • a first identification unit configured to identify a vehicle speed data object corresponding to the distance greater than the radius parameter as abnormal vehicle speed data.
  • the preset parameter is a density threshold in a density clustering based method
  • the fc of each vehicle speed data object in the target vehicle speed data set may be calculated according to the preset parameter. -distance.
  • the distance of the vehicle speed data object is the maximum distance of the vehicle speed data object to its nearest vehicle speed data object.
  • the radius parameters of the target vehicle speed data set are selected from all of the above-distances.
  • the radius parameter of the target vehicle speed data set is the radius of the density clustering in the density clustering based method.
  • the vehicle speed data object corresponding to the distance corresponding to the radius parameter is identified as abnormal vehicle speed data.
  • a method based on density clustering is used to identify abnormal vehicle speed data.
  • Embodiment 1 is a flow chart showing a method for identifying abnormal vehicle speed data in Embodiment 1;
  • FIG. 2 is a structural block diagram of an apparatus for identifying abnormal vehicle speed data of Embodiment 1;
  • Embodiment 3 is a flow chart showing a method for identifying abnormal vehicle speed data in Embodiment 2;
  • Fig. 4 is a block diagram showing the configuration of an apparatus for identifying abnormal vehicle speed data of the second embodiment.
  • An embodiment of the present invention provides a method for identifying abnormal vehicle speed data. As shown in FIG. 1, the method includes the following steps:
  • the density clustering based method identifies the anomaly data from the cluster of arbitrary shapes by setting the density threshold and the domain radius, when the density clustering based method is used to identify the abnormal vehicle speed data, the density threshold and the domain radius are also set. value.
  • the preset parameter is a density threshold in a method based on density clustering, and the parameter can be set by the experience of a technician.
  • the distance of the vehicle speed data object may be defined as: a vehicle speed data object P and a vehicle speed data object.
  • the distance between e D is di S t P, o, and satisfies:
  • 1 at least one vehicle speed data object object e , such that dist(p, q) ⁇ dist(p, o);
  • a vehicle speed data object object such that dist(p, q) ⁇ dist(p, o) 102.
  • a radius parameter of the target vehicle speed data set is selected from all the above-distances.
  • Each vehicle speed data object in the target data set has a corresponding-distance, and one of all the above-distances is selected as a radius parameter of the target vehicle speed data set.
  • the radius parameter is the value of the domain radius in the density clustering based method.
  • the vehicle speed data object corresponding to the distance-distance parameter is identified as the abnormal vehicle speed data.
  • the method based on density clustering is applied to identify abnormal vehicle speed data.
  • an embodiment of the present invention further provides an identification device for abnormal vehicle speed data.
  • the device includes: a calculation unit 21, a selection unit 22, and a first identification unit 23.
  • the calculating unit 21 is configured to calculate an fc-distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter.
  • the selecting unit 22 is configured to select the radius parameter of the target vehicle speed data set from all of the above-mentioned distances.
  • the radius parameter is the value of the radius of the field in the method based on density clustering.
  • the first identification unit 23 is configured to identify the vehicle speed data object corresponding to the distance of the radius parameter as the abnormal vehicle speed data.
  • the apparatus determines a density threshold and a domain radius in the density clustering based method by the calculating unit 21 and the selecting unit 22, and the first identifying unit 23 compares each vehicle speed data in the target data set according to the density clustering based method.
  • the object-to-distance and the size of the radius parameter of the target vehicle speed data set identify abnormal vehicle speed data from the target data set. Thereby, a method based on density clustering is used to identify abnormal vehicle speed data.
  • a method for identifying abnormal vehicle speed data is described in detail by taking a historical traffic flow on a designated road link as an example. As shown in FIG. 3, the method includes the following steps:
  • the target data set may be a vehicle speed value at a specific time point of a week characteristic day on a specified road link. For example, when the target data set is taken from the vehicle speed data object of one year, there may be a loss of the vehicle speed value, and the target data set has a maximum of 52 vehicles. Speed value.
  • the week feature day refers to a set of certain flood seasons in which the traffic flow data changes in the week have certain similar characteristics. For example, the date of the week can be divided into two types of weekdays, weekdays and Sundays. It can also be divided into seven characteristic days from Monday to Sunday according to the weekly similarity of traffic flow.
  • the preset parameter is a density threshold in a method based on density clustering, and the parameter can be set by the experience of a technician.
  • the fc - distance is sorted in ascending order. Then, according to the ordinate as fc - distance, the abscissa is sorted by the number of points, such as: 1, 2, 3, etc., respectively used to represent the first vehicle speed data, the second vehicle speed data, the third vehicle speed data, etc. Draw the sorted values.
  • the distance-correspondence corresponding to the maximum distance change as the radius parameter of the target vehicle speed data set.
  • a derivative method can be used to determine the magnitude of the -distance increment change.
  • the fc - distance is used as a dependent variable, and the number of the points is used as an argument.
  • the magnitude of the change in the increment of the distance is calculated relative to the increment of the value of the point, and the corresponding distance-distance is taken as the radius parameter of the target vehicle speed data set.
  • the radius parameter is the value of the radius of the field in the method based on density clustering.
  • the vehicle speed data object corresponding to the distance of the radius parameter is identified as the abnormal vehicle speed data.
  • the abnormal vehicle speed data can be deleted.
  • the method based on density clustering not only realizes the identification of abnormal vehicle speed data from the target dataset, but also identifies the boundary vehicle speed data and the core vehicle speed data, which provides a basis for studying the vehicle speed change in the historical traffic flow.
  • the embodiment of the present invention further provides an apparatus for identifying abnormal vehicle speed data.
  • the method includes: a loading unit 41, a calculating unit 42, a selecting unit 43, a first identifying unit 44, and a second label.
  • the unit 45 and the third identification unit 46 are identified.
  • the loading unit 41 is configured to load the target data set into the cache.
  • the target data set may be a vehicle speed value at a specific time point of a week characteristic day on a specified road link.
  • the calculating unit 42 is configured to calculate a distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter.
  • the selecting unit 43 is configured to select a radius parameter of the target vehicle speed data set from all of the above-mentioned distances.
  • the selection unit 43 includes a sorting module 43A and a selection module 43B.
  • the sorting module 43A is used to sort all of the above-distances in ascending order.
  • the selection module 43B is configured to select a distance corresponding to the maximum distance change maximum as the radius parameter of the target vehicle speed data set.
  • the radius parameter is the value of the radius of the field in the method based on density clustering.
  • the density threshold and the domain radius in the density cluster based method are determined by the calculation unit 42 and the selection unit 43.
  • the first identification unit 44 is configured to be larger than the radius by comparing the distance of each vehicle speed data object in the target data set with the radius parameter of the target vehicle speed data set.
  • the vehicle speed data object corresponding to the k-distance of the parameter is identified as abnormal vehicle speed data.
  • the second identification unit 45 is configured to identify a vehicle speed data object corresponding to the distance of the radius parameter as the boundary vehicle speed data.
  • the third identification unit 46 is configured to identify a vehicle speed data object corresponding to a distance smaller than the radius parameter as core vehicle speed data. Therefore, based on the density clustering method, the abnormal vehicle speed data is identified from the target data set.
  • the embodiments of the present invention are mainly applied to the field of intelligent transportation, and the method based on density clustering is used to identify abnormal vehicle speed data.
  • the present invention can be implemented by means of software plus necessary general hardware, and of course, by hardware, but in many cases, the former is a better implementation. .
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a readable storage medium, such as a floppy disk of a computer. , hard disk or CD, etc., including a number of instructions to make a computer device (can be an individual)
  • a computer, server, or network device, etc. performs the methods described in various embodiments of the present invention.

Abstract

An abnormal vehicle speed data identifying method and device thereof are provided. The method includes: computing the distance of each vehicle speed data object in a target vehicle speed data set according to a preset parameter k; a radius parameter of the target set is selected from the distance; the vehicle object corresponding to the distance larger than the radius parameter is marked as an abnormal vehicle speed data.

Description

异常车速数据的识别方法和装置 本申请要求于 2 009 年 1 1 月 1 9 日提交中国专利局、 申请号为 2 009 1 02 3794 0. K 发明名称为 "异常车速数据的识别方法和装置" 的中国专利申请的优先权, 其全部内容通过引用结合在本申请中。 技术领域  Method and device for identifying abnormal vehicle speed data This application claims to be submitted to the Chinese Patent Office on January 19, 2010, and the application number is 2 009 1 02 3794 0. K. The invention is entitled "Identification method and device for abnormal vehicle speed data". Priority of the Chinese Patent Application, the entire contents of which is incorporated herein by reference. Technical field
本发明涉及智能交通领域, 尤其涉及一种异常车速数据的识别 方法和装置。  The present invention relates to the field of intelligent transportation, and in particular to a method and apparatus for identifying abnormal vehicle speed data.
背景技术 Background technique
智能交通系统是目前能全面有效地解决交通运输领域问题, 特 别是交通拥挤、 交通阻塞、 交通事故和交通污染等的最佳途径。 其 中, 动态交通信息服务是智能交通系统的核心研究方向之一, 它可 以动态实时反映区域内的交通路况, 指引最佳的行驶路线, 提高道 路和车辆的使用效率, 是緩解交通拥堵状况的重要措施。 在动态交 通信息的研究中, 分析道路历史车速值, 找到能够反映道路周期性 趋势的参数, 进而提取出道路的动态交通特征是个热点问题。 但是 由于在车速数据的采集和处理输出过程中存在各种干扰, 会使车速 数据受到不同程度的噪声污染, 使用被污染的数据建模和统计分析, 可能会导致错误结果, 故而需要对车速数据进行除噪, 即识别并删 除异常车速值。 现有技术中用于识别异常数据的主要方法有统计方 法、 基于距离的方法和基于密度聚类的方法。  The intelligent transportation system is currently the most effective way to solve the transportation problems, especially traffic congestion, traffic congestion, traffic accidents and traffic pollution. Among them, dynamic traffic information service is one of the core research directions of intelligent transportation system. It can dynamically reflect the traffic conditions in the area in real time, guide the optimal driving route, improve the efficiency of road and vehicle use, and is important to alleviate traffic congestion. Measures. In the study of dynamic traffic information, it is a hot issue to analyze the road historical speed value and find the parameters that can reflect the cyclical trend of the road, and then extract the dynamic traffic characteristics of the road. However, due to various interferences in the process of collecting and processing the speed data, the vehicle speed data will be polluted by different degrees of noise. Using contaminated data modeling and statistical analysis may lead to erroneous results, so the vehicle speed data is required. Denoising, that is, identifying and deleting abnormal vehicle speed values. The main methods used in the prior art for identifying anomalous data are statistical methods, distance-based methods, and density-based clustering-based methods.
然而, 在将统计方法和基于距离的方法用于识别异常车速数据 的过程中, 发明人发现现有技术中至少存在如下问题: 采用统计方 法识别异常车速数据时通常要求知道关于数据集参数的知识, 如分 布模型, 分布参数等。 但是在大多数情况下, 分布可能是未知的。 采用基于距离的方法识别异常车速数据时需要根据某个距离函数来 计算数据对象之间的距离, 异常数据是那些与所有其他对象相比有 更高距离的数据对象。 但是它需要用户直接提供最小的可接受距离, 而这个是很难确定的。 而基于密度聚类的方法可以从未知分布形式 的数据中识别出异常数据, 但是现有技术中还没有将基于密度聚类 的方法用于识别异常车速数据。 However, in the process of using statistical methods and distance-based methods for identifying abnormal vehicle speed data, the inventors have found that at least the following problems exist in the prior art: When using statistical methods to identify abnormal vehicle speed data, it is generally required to know knowledge about data set parameters. , such as distribution models, distribution parameters, etc. But in most cases, the distribution may be unknown. When using the distance-based method to identify abnormal vehicle speed data, the distance between data objects needs to be calculated according to a distance function, which is a data object with a higher distance than all other objects. But it requires the user to provide the minimum acceptable distance directly. And this is hard to determine. The density clustering based method can identify abnormal data from the data of unknown distribution form, but the density clustering based method has not been used in the prior art to identify abnormal vehicle speed data.
发明内容 Summary of the invention
本发明的实施例提供一种异常车速数据的识别方法和装置,采用基于 密度聚类的方法实现了从目标车速数据集中识别异常车速数据。  Embodiments of the present invention provide a method and apparatus for identifying abnormal vehicle speed data, which employs a density clustering based method to identify abnormal vehicle speed data from a target vehicle speed data set.
为达到上述目的, 本发明的实施例采用如下技术方案:  In order to achieve the above object, the embodiment of the present invention adopts the following technical solutions:
一种异常车速数据的识别方法, 包括:  A method for identifying abnormal vehicle speed data, comprising:
根据预设参数 计算目标车速数据集中每一个车速数据对象的 -距 离;  Calculating the distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameters;
从所有上述 -距离中选出所述目标车速数据集的半径参数; 将大于所述半径参数的 -距离对应的车速数据对象标识为异常车速 数据。  A radius parameter of the target vehicle speed data set is selected from all of the above-mentioned distances; and a vehicle speed data object corresponding to the distance of the radius parameter is identified as abnormal vehicle speed data.
一种异常车速数据的识别装置, 包括: 计算单元, 用于根据预设参数 计算目标车速数据集中每一个车速 数据对象的 距离;  An apparatus for identifying abnormal vehicle speed data, comprising: a calculating unit, configured to calculate a distance of each vehicle speed data object in the target vehicle speed data set according to a preset parameter;
选择单元, 用于从所有上述 -距离中选出所述目标车速数据集的半 径参数;  a selection unit, configured to select a radius parameter of the target vehicle speed data set from all of the above-distances;
第一标识单元, 用于将大于所述半径参数的 -距离对应的车速数据 对象标识为异常车速数据。  And a first identification unit, configured to identify a vehicle speed data object corresponding to the distance greater than the radius parameter as abnormal vehicle speed data.
由上述方案描述的本发明实施例中, 所述预设的参数 为基于密度聚 类的方法中的密度阈值, 根据所述预设的参数 可以计算出目标车速数据 集中每一个车速数据对象的 fc -距离。 所述车速数据对象的 -距离为所述 车速数据对象到其 个最邻近的车速数据对象的最大距离。 计算出所述目 标数据集中所有车速数据对象的 -距离之后, 从所有上述 -距离中选出 所述目标车速数据集的半径参数。所述目标车速数据集的半径参数即为基 于密度聚类的方法中的密度聚类的半径。通过依次比较所述目标数据集中 的每个车速数据对象的 -距离与所述目标车速数据集的半径参数的大 小, 将大于所述半径参数的 -距离对应的车速数据对象标识为异常车速 数据。 从而实现了将基于密度聚类的方法用于识别异常车速数据。 In the embodiment of the present invention described by the foregoing solution, the preset parameter is a density threshold in a density clustering based method, and the fc of each vehicle speed data object in the target vehicle speed data set may be calculated according to the preset parameter. -distance. The distance of the vehicle speed data object is the maximum distance of the vehicle speed data object to its nearest vehicle speed data object. After calculating the distance of all the vehicle speed data objects in the target data set, the radius parameters of the target vehicle speed data set are selected from all of the above-distances. The radius parameter of the target vehicle speed data set is the radius of the density clustering in the density clustering based method. By sequentially comparing the distance of each vehicle speed data object in the target data set with the radius parameter of the target vehicle speed data set Small, the vehicle speed data object corresponding to the distance corresponding to the radius parameter is identified as abnormal vehicle speed data. Thereby, a method based on density clustering is used to identify abnormal vehicle speed data.
附图说明 DRAWINGS
图 1为实施例 1异常车速数据的识别方法的流程图;  1 is a flow chart showing a method for identifying abnormal vehicle speed data in Embodiment 1;
图 2为实施例 1异常车速数据的识别装置的结构框图;  2 is a structural block diagram of an apparatus for identifying abnormal vehicle speed data of Embodiment 1;
图 3为实施例 2异常车速数据的识别方法的流程图;  3 is a flow chart showing a method for identifying abnormal vehicle speed data in Embodiment 2;
图 4为实施例 2异常车速数据的识别装置的结构框图。  Fig. 4 is a block diagram showing the configuration of an apparatus for identifying abnormal vehicle speed data of the second embodiment.
具体实施方式 detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进 行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例, 而不是全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没 有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的 范围。  The technical solutions in the embodiments of the present invention are clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
实施例 1 :  Example 1
本发明实施例提供一种异常车速数据的识别方法, 如图 1所示, 该方 法包括以下步骤:  An embodiment of the present invention provides a method for identifying abnormal vehicle speed data. As shown in FIG. 1, the method includes the following steps:
1 01、 根据预设参数 计算目标车速数据集中每一个车速数据对象 的 -距离。  1 01. Calculate the distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameters.
由于基于密度聚类的方法通过设置密度阈值与领域半径从任意形状 的簇中识别异常数据,因此在将基于密度聚类的方法用于识别异常车速数 据时, 也要设置密度阈值与领域半径的值。 所述预设参数 即为基于密度 聚类的方法中的密度阈值, 可以通过技术人员的经验设定参数  Since the density clustering based method identifies the anomaly data from the cluster of arbitrary shapes by setting the density threshold and the domain radius, when the density clustering based method is used to identify the abnormal vehicle speed data, the density threshold and the domain radius are also set. value. The preset parameter is a density threshold in a method based on density clustering, and the parameter can be set by the experience of a technician.
所述车速数据对象的 -距离可以定义为:车速数据对象 P与车速数据 对象。 e D之间的距离 diSt P, o、,且满足: The distance of the vehicle speed data object may be defined as: a vehicle speed data object P and a vehicle speed data object. The distance between e D is di S t P, o, and satisfies:
①至少有 个车速数据对象对象 e , 使得 dist(p, q) < dist(p, o); 1 at least one vehicle speed data object object e , such that dist(p, q) < dist(p, o);
② 并且至 多 有 个车速数据对 象对 象 , 使得 dist(p, q) < dist(p, o) 102、 从所有上述 -距离中选出所述目标车速数据集的半径参数。 所 述目标数据集中每一个车速数据对象都有对应的 -距离, 从所有上述 - 距离中选择一个 -距离作为所述目标车速数据集的半径参数。 所述半径 参数即为基于密度聚类的方法中的领域半径的值。 2 and at most there is a vehicle speed data object object, such that dist(p, q) < dist(p, o) 102. Select, from all the above-distances, a radius parameter of the target vehicle speed data set. Each vehicle speed data object in the target data set has a corresponding-distance, and one of all the above-distances is selected as a radius parameter of the target vehicle speed data set. The radius parameter is the value of the domain radius in the density clustering based method.
103、 所述密度阈值与领域半径的值确定之后, 根据基于密度聚类的 方法, 将大于所述半径参数的 -距离对应的车速数据对象标识为异常车 速数据。 从而实现了将基于密度聚类的方法应用于识别异常车速数据。  103. After determining the values of the density threshold and the domain radius, according to the density clustering based method, the vehicle speed data object corresponding to the distance-distance parameter is identified as the abnormal vehicle speed data. Thereby, the method based on density clustering is applied to identify abnormal vehicle speed data.
为了实现上述方法,本发明实施例还提供一种异常车速数据的识别装 置, 如图 2所示, 该装置包括: 计算单元 21、 选择单元 22和第一标识单 元 23。  In order to implement the above method, an embodiment of the present invention further provides an identification device for abnormal vehicle speed data. As shown in FIG. 2, the device includes: a calculation unit 21, a selection unit 22, and a first identification unit 23.
其中, 所述计算单元 21 用于根据预设参数 计算目标车速数据集 中每一个车速数据对象的 fc -距离。 所述计算单元 21计算出每一个车速数 据对象的 -距离之后, 选择单元 22用于从所有上述 -距离中选出所述目 标车速数据集的半径参数。所述半径参数即为基于密度聚类的方法中的领 域半径的值。 基于密度聚类的方法中的密度阈值和领域半径都确定之后, 第一标识单元 23用于将大于所述半径参数的 -距离对应的车速数据对象 标识为异常车速数据。该装置通过计算单元 21和选择单元 22来确定基于 密度聚类的方法中的密度阈值和领域半径, 第一标识单元 23根据基于密 度聚类的方法通过比较所述目标数据集中的每个车速数据对象的 -距离 与所述目标车速数据集的半径参数的大小,从所述目标数据集中识别出异 常车速数据。 从而实现了将基于密度聚类的方法用于识别异常车速数据。  The calculating unit 21 is configured to calculate an fc-distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter. After the calculating unit 21 calculates the distance of each vehicle speed data object, the selecting unit 22 is configured to select the radius parameter of the target vehicle speed data set from all of the above-mentioned distances. The radius parameter is the value of the radius of the field in the method based on density clustering. After the density threshold and the domain radius are determined in the density clustering based method, the first identification unit 23 is configured to identify the vehicle speed data object corresponding to the distance of the radius parameter as the abnormal vehicle speed data. The apparatus determines a density threshold and a domain radius in the density clustering based method by the calculating unit 21 and the selecting unit 22, and the first identifying unit 23 compares each vehicle speed data in the target data set according to the density clustering based method. The object-to-distance and the size of the radius parameter of the target vehicle speed data set identify abnormal vehicle speed data from the target data set. Thereby, a method based on density clustering is used to identify abnormal vehicle speed data.
实施例 2:  Example 2:
本发明实施例以指定路链上历史交通流为例来详细介绍异常车速数 据的识别方法, 如图 3所示, 该方法包括如下步骤:  In the embodiment of the present invention, a method for identifying abnormal vehicle speed data is described in detail by taking a historical traffic flow on a designated road link as an example. As shown in FIG. 3, the method includes the following steps:
301、 加载目标数据集。 所述目标数据集可以为指定路链上某个星期 特征日中特定时间点的车速值。 如: 从一年的车速数据对象中取目标数据 集时, 由于车速值可能有丟失的现象, 所述目标数据集中最多有 52个车 速值。所述星期特征日是指一周中交通流数据变化具有一定相似特征的某 些曰期的集合。 如: 一周的日期可以筒单地分为工作日和周日两类星期特 征日, 也可以按交通流的周相似性细分为周一到周日共 7个特征日等。 301. Load the target data set. The target data set may be a vehicle speed value at a specific time point of a week characteristic day on a specified road link. For example, when the target data set is taken from the vehicle speed data object of one year, there may be a loss of the vehicle speed value, and the target data set has a maximum of 52 vehicles. Speed value. The week feature day refers to a set of certain flood seasons in which the traffic flow data changes in the week have certain similar characteristics. For example, the date of the week can be divided into two types of weekdays, weekdays and Sundays. It can also be divided into seven characteristic days from Monday to Sunday according to the weekly similarity of traffic flow.
302、 根据预设参数 计算目标车速数据集中每一个车速数据对象 的 -距离。 所述预设参数 即为基于密度聚类的方法中的密度阈值, 可以 通过技术人员的经验设定参数  302. Calculate a distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter. The preset parameter is a density threshold in a method based on density clustering, and the parameter can be set by the experience of a technician.
303、 将所有上述 -距离按照递增次序进行排序。 对于上述所有车速 数据对象的 fc -距离, 以递增次序进行排序。 然后按纵坐标为 fc -距离, 横坐 标为点的个数排序, 如: 1、 2、 3等, 分别用以表示第一个车速数据、 第 二个车速数据、 第三个车速数据等, 绘制排序后的值。  303. Sort all of the above-distances in ascending order. For all of the above vehicle speed data objects, the fc - distance is sorted in ascending order. Then, according to the ordinate as fc - distance, the abscissa is sorted by the number of points, such as: 1, 2, 3, etc., respectively used to represent the first vehicle speed data, the second vehicle speed data, the third vehicle speed data, etc. Draw the sorted values.
304、 选择所述 -距离增量变化最大时对应的 -距离作为所述目标车 速数据集的半径参数。 可以采用导数的方法来确定所述 -距离增量变化 的大小。 将所述 fc -距离作为因变量, 所述点的个数作为自变量。 相对于点 的个数值的增量计算所述 -距离的增量的变化大小, 取增量变化最大时 对应的 -距离作为所述目标车速数据集的半径参数。 所述半径参数即为 基于密度聚类的方法中的领域半径的值。  304. Select the distance-correspondence corresponding to the maximum distance change as the radius parameter of the target vehicle speed data set. A derivative method can be used to determine the magnitude of the -distance increment change. The fc - distance is used as a dependent variable, and the number of the points is used as an argument. The magnitude of the change in the increment of the distance is calculated relative to the increment of the value of the point, and the corresponding distance-distance is taken as the radius parameter of the target vehicle speed data set. The radius parameter is the value of the radius of the field in the method based on density clustering.
305、 所述密度阈值与领域半径的值确定之后, 根据基于密度聚类的 方法, 将大于所述半径参数的 -距离对应的车速数据对象标识为异常车 速数据。 将异常车速数据识别出来之后, 可以删除所述异常车速数据。  305. After determining the values of the density threshold and the domain radius, according to the density clustering-based method, the vehicle speed data object corresponding to the distance of the radius parameter is identified as the abnormal vehicle speed data. After the abnormal vehicle speed data is identified, the abnormal vehicle speed data can be deleted.
306、 将等于所述半径参数的 -距离对应的车速数据对象标识为边界 车速数据。  306. Identify a vehicle speed data object corresponding to the distance of the radius parameter as boundary vehicle speed data.
307、 将小于所述半径参数的 k -距离对应的车速数据对象标识为核心 车速数据。 本方法中采用基于密度聚类的方法, 不仅实现了从目标数据集 中识别异常车速数据, 同时也识别出边界车速数据和核心车速数据, 为研 究历史交通流中的车速变化提供依据。  307. Identify a vehicle speed data object corresponding to a k-distance smaller than the radius parameter as core vehicle speed data. The method based on density clustering not only realizes the identification of abnormal vehicle speed data from the target dataset, but also identifies the boundary vehicle speed data and the core vehicle speed data, which provides a basis for studying the vehicle speed change in the historical traffic flow.
本发明实施例还提供一种异常车速数据的识别装置, 如图 4所示, 包 括: 加载单元 41、 计算单元 42、 选择单元 43、 第一标识单元 44、 第二标 识单元 45和第三标识单元 46。 The embodiment of the present invention further provides an apparatus for identifying abnormal vehicle speed data. As shown in FIG. 4, the method includes: a loading unit 41, a calculating unit 42, a selecting unit 43, a first identifying unit 44, and a second label. The unit 45 and the third identification unit 46 are identified.
其中, 所述加载单元 41 用于将目标数据集加载到緩存中。 所述目标 数据集可以为指定路链上某个星期特征日中特定时间点的车速值。 所述加 载单元 41加载目标数据集之后, 所述计算单元 42用于根据预设参数 计算目标车速数据集中每一个车速数据对象的 -距离。  The loading unit 41 is configured to load the target data set into the cache. The target data set may be a vehicle speed value at a specific time point of a week characteristic day on a specified road link. After the loading unit 41 loads the target data set, the calculating unit 42 is configured to calculate a distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter.
然后, 所述选择单元 43用于从所有上述 -距离中选出所述目标车速 数据集的半径参数。所述选择单元 43包括:排序模块 43A和选择模块 43B。 所述排序模块 43A 用于将所有上述 -距离按照递增次序进行排序。 选择 模块 43B用于选择所述 距离增量变化最大时对应的 -距离作为所述目 标车速数据集的半径参数。所述半径参数即为基于密度聚类的方法中的领 域半径的值。通过计算单元 42和选择单元 43确定了基于密度聚类的方法 中的密度阈值和领域半径。  Then, the selecting unit 43 is configured to select a radius parameter of the target vehicle speed data set from all of the above-mentioned distances. The selection unit 43 includes a sorting module 43A and a selection module 43B. The sorting module 43A is used to sort all of the above-distances in ascending order. The selection module 43B is configured to select a distance corresponding to the maximum distance change maximum as the radius parameter of the target vehicle speed data set. The radius parameter is the value of the radius of the field in the method based on density clustering. The density threshold and the domain radius in the density cluster based method are determined by the calculation unit 42 and the selection unit 43.
然后,根据基于密度聚类的方法通过比较所述目标数据集中的每个车 速数据对象的 -距离与所述目标车速数据集的半径参数的大小, 第一标 识单元 44用于将大于所述半径参数的 k -距离对应的车速数据对象标识为 异常车速数据。 第二标识单元 45用于将等于所述半径参数的 -距离对应 的车速数据对象标识为边界车速数据。 第三标识单元 46用于将小于所述 半径参数的 -距离对应的车速数据对象标识为核心车速数据。 从而基于 密度聚类的方法实现了从目标数据集中识别异常车速数据。  Then, according to the density clustering based method, the first identification unit 44 is configured to be larger than the radius by comparing the distance of each vehicle speed data object in the target data set with the radius parameter of the target vehicle speed data set. The vehicle speed data object corresponding to the k-distance of the parameter is identified as abnormal vehicle speed data. The second identification unit 45 is configured to identify a vehicle speed data object corresponding to the distance of the radius parameter as the boundary vehicle speed data. The third identification unit 46 is configured to identify a vehicle speed data object corresponding to a distance smaller than the radius parameter as core vehicle speed data. Therefore, based on the density clustering method, the abnormal vehicle speed data is identified from the target data set.
本发明实施例主要应用于智能交通领域,实现了将基于密度聚类的方 法用于识别异常车速数据。  The embodiments of the present invention are mainly applied to the field of intelligent transportation, and the method based on density clustering is used to identify abnormal vehicle speed data.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到 本发明可借助软件加必需的通用硬件的方式来实现, 当然也可以通过硬 件, 但很多情况下前者是更佳的实施方式。 基于这样的理解, 本发明的技 术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式 体现出来, 该计算机软件产品存储在可读取的存储介质中, 如计算机的软 盘, 硬盘或光盘等, 包括若干指令用以使得一台计算机设备(可以是个人 计算机, 服务器, 或者网络设备等) 执行本发明各个实施例所述的方法。 以上所述,仅为本发明的具体实施方式, 但本发明的保护范围并不局限于 此, 任何熟悉本技术领域的技术人员在本发明揭露的技术范围内, 可轻易 想到变化或替换, 都应涵盖在本发明的保护范围之内。 因此, 本发明的保 护范围应所述以权利要求的保护范围为准。 Through the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented by means of software plus necessary general hardware, and of course, by hardware, but in many cases, the former is a better implementation. . Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a readable storage medium, such as a floppy disk of a computer. , hard disk or CD, etc., including a number of instructions to make a computer device (can be an individual) A computer, server, or network device, etc.) performs the methods described in various embodiments of the present invention. The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the claims.

Claims

权 利 要 求 书 Claim
1、 一种异常车速数据的识别方法, 其特征在于, 包括:  A method for identifying abnormal vehicle speed data, comprising:
根据预设参数 计算目标车速数据集中每一个车速数据对象的 -距 离;  Calculating the distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameters;
从所有上述 -距离中选出所述目标车速数据集的半径参数;  Selecting a radius parameter of the target vehicle speed data set from all of the above-distances;
将大于所述半径参数的 -距离对应的车速数据对象标识为异常车速数 据。  The vehicle speed data object corresponding to the distance corresponding to the radius parameter is identified as abnormal vehicle speed data.
2、 根据权利要求 1所述的异常车速数据的识别方法, 其特征在于, 所 述从所有车速数据对象的 -距离中选出所述目标车速数据集的半径参数包 括:  2. The method for identifying abnormal vehicle speed data according to claim 1, wherein the radius parameter of the target vehicle speed data set selected from the distances of all vehicle speed data objects comprises:
将所有上述 -距离按照递增次序进行排序;  Sort all of the above - distances in ascending order;
选择所述 -距离增量变化最大时对应的 -距离作为所述目标车速数 据集的半径参数。  Selecting - the distance corresponding to the maximum distance change maximum is used as the radius parameter of the target vehicle speed data set.
3、 根据权利要求 1所述的异常车速数据的识别方法, 其特征在于, 还 包括:  The method for identifying abnormal vehicle speed data according to claim 1, further comprising:
将等于所述半径参数的 -距离对应的车速数据对象标识为边界车速数 据;  Identifying a vehicle speed data object corresponding to the distance parameter of the radius parameter as boundary vehicle speed data;
将小于所述半径参数的 -距离对应的车速数据对象标识为核心车速数 据。  A vehicle speed data object corresponding to the distance-less distance parameter is identified as core vehicle speed data.
4、 一种异常车速数据的识别装置, 其特征在于, 包括:  4. An apparatus for identifying abnormal vehicle speed data, comprising:
计算单元, 用于根据预设参数 计算目标车速数据集中每一个车速数 据对象的 -距离;  a calculating unit, configured to calculate a distance of each vehicle speed data object in the target vehicle speed data set according to the preset parameter;
选择单元, 用于从所有上述 -距离中选出所述目标车速数据集的半径 参数;  a selection unit, configured to select a radius parameter of the target vehicle speed data set from all the above-distances;
第一标识单元, 用于将大于所述半径参数的 -距离对应的车速数据对 象标识为异常车速数据。  And a first identification unit, configured to identify a vehicle speed data object corresponding to the distance greater than the radius parameter as abnormal vehicle speed data.
5、 根据权利要求 4所述的异常车速数据的识别装置, 其特征在于, 所 述选择单元包括: 5. The apparatus for identifying abnormal vehicle speed data according to claim 4, wherein The selection unit includes:
排序模块, 用于将所有上述 -距离按照递增次序进行排序;  a sorting module for sorting all of the above-distances in ascending order;
选择模块, 用于选择所述 距离增量变化最大时对应的 距离作为所 述目标车速数据集的半径参数。  And a selection module, configured to select a distance corresponding to the maximum distance increment change as a radius parameter of the target vehicle speed data set.
6、 根据权利要求 4所述的异常车速数据的识别装置, 其特征在于, 还 包括:  The device for identifying abnormal vehicle speed data according to claim 4, further comprising:
第二标识单元, 用于将等于所述半径参数的 -距离对应的车速数据对 象标识为边界车速数据;  a second identification unit, configured to identify a vehicle speed data object corresponding to the distance corresponding to the radius parameter as boundary vehicle speed data;
第三标识单元, 用于将小于所述半径参数的 -距离对应的车速数据对 象标识为核心车速数据。  And a third identification unit, configured to identify a vehicle speed data object corresponding to the distance smaller than the radius parameter as core vehicle speed data.
PCT/CN2010/078875 2009-11-19 2010-11-18 Abnormal vehicle speed data identifying method and device thereof WO2011060725A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200910237940.1 2009-11-19
CN2009102379401A CN101727749B (en) 2009-11-19 2009-11-19 Method and device for identifying abnormal vehicle speed data

Publications (1)

Publication Number Publication Date
WO2011060725A1 true WO2011060725A1 (en) 2011-05-26

Family

ID=42448581

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/078875 WO2011060725A1 (en) 2009-11-19 2010-11-18 Abnormal vehicle speed data identifying method and device thereof

Country Status (2)

Country Link
CN (1) CN101727749B (en)
WO (1) WO2011060725A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101727749B (en) * 2009-11-19 2012-01-04 北京世纪高通科技有限公司 Method and device for identifying abnormal vehicle speed data
CN106021852B (en) * 2016-05-06 2018-11-06 鼎泰生物科技(海南)有限公司 Blood glucose level data exception value calculating method based on density clustering algorithm and device
CN108847022B (en) * 2018-06-08 2021-04-16 浙江银江智慧交通集团有限公司 Abnormal value detection method of microwave traffic data acquisition equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1855169A (en) * 2005-04-29 2006-11-01 宋怀淳 Overspeed driving total-distance monitoring method on highway
CN101102277A (en) * 2007-06-20 2008-01-09 华为技术有限公司 Recognition method and system for service data and recognition control device
US20080243350A1 (en) * 2007-03-30 2008-10-02 Harkness Johnnie C System and method for receiving and using data associated with driving conditions and related parameters
WO2009012248A1 (en) * 2007-07-16 2009-01-22 Crucs Holdings, Llc System and method for monitoring vehicles on a roadway
CN101465053A (en) * 2007-12-21 2009-06-24 厦门雅迅网络股份有限公司 Method for real time accounting automobile idle speed through wireless communication network
CN101727749A (en) * 2009-11-19 2010-06-09 北京世纪高通科技有限公司 Method and device for identifying abnormal vehicle speed data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1855169A (en) * 2005-04-29 2006-11-01 宋怀淳 Overspeed driving total-distance monitoring method on highway
US20080243350A1 (en) * 2007-03-30 2008-10-02 Harkness Johnnie C System and method for receiving and using data associated with driving conditions and related parameters
CN101102277A (en) * 2007-06-20 2008-01-09 华为技术有限公司 Recognition method and system for service data and recognition control device
WO2009012248A1 (en) * 2007-07-16 2009-01-22 Crucs Holdings, Llc System and method for monitoring vehicles on a roadway
CN101465053A (en) * 2007-12-21 2009-06-24 厦门雅迅网络股份有限公司 Method for real time accounting automobile idle speed through wireless communication network
CN101727749A (en) * 2009-11-19 2010-06-09 北京世纪高通科技有限公司 Method and device for identifying abnormal vehicle speed data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XIANG HONGYAN, TRAFFIC INCIDENT DETECTION BASED ON THE PREDICTION DIVIATION CENTRAL SOUTH HIGHWAY ENGINEERING, vol. 32, no. 2, April 2007 (2007-04-01), pages 142 - 147 *

Also Published As

Publication number Publication date
CN101727749A (en) 2010-06-09
CN101727749B (en) 2012-01-04

Similar Documents

Publication Publication Date Title
CN106960571B (en) Method and device for determining road congestion bottleneck point, server and storage medium
Jia et al. Data driven congestion trends prediction of urban transportation
WO2019047905A1 (en) Road traffic analysis system, method and apparatus
CN108922174B (en) Dynamic classification method for paths in group of intersections around expressway entrance ramp
CN106887141B (en) Queuing theory-based continuous traffic node congestion degree prediction model, system and method
CN106651027B (en) Internet regular bus route optimization method based on social network
CN108847022B (en) Abnormal value detection method of microwave traffic data acquisition equipment
CN109635914B (en) Optimized extreme learning machine trajectory prediction method based on hybrid intelligent genetic particle swarm
US11238728B2 (en) Determining traffic congestion patterns
CN108182240B (en) Interest point increasing rate prediction model training and prediction method, device and storage medium
Sevlian et al. Travel time estimation using floating car data
Hwang et al. Apply Scikit-learn in python to analyze driver behavior based on OBD data
WO2018177421A1 (en) Method and device for predicting vehicle emergence point
CN106530685B (en) A kind of traffic data Forecasting Approach for Short-term and device
WO2011060725A1 (en) Abnormal vehicle speed data identifying method and device thereof
CN108830414B (en) Load prediction method for commercial charging area of electric automobile
CN113034940A (en) Fisher ordered clustering-based single-point signalized intersection optimization timing method
CN109344247B (en) Method and apparatus for outputting information
Xu et al. Applying finite mixture models to New York City travel times
Sun et al. Traffic congestion pattern detection using an improved mcmaster algorithm
CN113159416B (en) Calculation method for bus single card swiping get-off station and intelligent terminal
CN114564627A (en) Enterprise relation analysis method and system based on truck data and storage medium
Waury et al. Indexing trajectories for travel-time histogram retrieval
CN112967495A (en) Short-time traffic flow prediction method and system based on big data of movement track
Kim et al. Analysis on the transportation point in Cheongju City using Pagerank algorithm

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10831140

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10831140

Country of ref document: EP

Kind code of ref document: A1