CN106202335B - 一种基于云计算框架的交通大数据清洗方法 - Google Patents
一种基于云计算框架的交通大数据清洗方法 Download PDFInfo
- Publication number
- CN106202335B CN106202335B CN201610517414.0A CN201610517414A CN106202335B CN 106202335 B CN106202335 B CN 106202335B CN 201610517414 A CN201610517414 A CN 201610517414A CN 106202335 B CN106202335 B CN 106202335B
- Authority
- CN
- China
- Prior art keywords
- data
- cluster
- label
- cluster center
- traffic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000004140 cleaning Methods 0.000 title claims abstract description 33
- 230000002159 abnormal effect Effects 0.000 claims abstract description 13
- 230000008569 process Effects 0.000 claims description 18
- 230000006835 compression Effects 0.000 claims description 9
- 238000007906 compression Methods 0.000 claims description 9
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000008859 change Effects 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims 1
- 238000012545 processing Methods 0.000 description 6
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 241001417517 Scatophagidae Species 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000011056 performance test Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 238000012731 temporal analysis Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Traffic Control Systems (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610517414.0A CN106202335B (zh) | 2016-06-28 | 2016-06-28 | 一种基于云计算框架的交通大数据清洗方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610517414.0A CN106202335B (zh) | 2016-06-28 | 2016-06-28 | 一种基于云计算框架的交通大数据清洗方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106202335A CN106202335A (zh) | 2016-12-07 |
CN106202335B true CN106202335B (zh) | 2019-06-28 |
Family
ID=57464827
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610517414.0A Active CN106202335B (zh) | 2016-06-28 | 2016-06-28 | 一种基于云计算框架的交通大数据清洗方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106202335B (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107092637B (zh) * | 2017-02-16 | 2020-01-31 | 北京星选科技有限公司 | 数据处理方法及装置 |
CN107241693B (zh) * | 2017-05-08 | 2020-06-30 | 上海世脉信息科技有限公司 | 一种大数据环境下无坐标传感器位置确定方法 |
CN107067852A (zh) * | 2017-06-06 | 2017-08-18 | 陈俊竹 | 应用于公共管理教学的信息系统及其使用方法 |
CN107958027A (zh) * | 2017-11-16 | 2018-04-24 | 南京邮电大学 | 一种具有QoS保障的传感网数据获取方法 |
CN109165818B (zh) * | 2018-08-02 | 2022-02-08 | 国网湖北省电力有限公司电力科学研究院 | 一种用于电气设备风险评估的负点计算方法 |
CN109189771A (zh) * | 2018-08-17 | 2019-01-11 | 浙江捷尚视觉科技股份有限公司 | 一种基于离线和在线聚类的车型数据库清洗方法 |
CN109359679A (zh) * | 2018-10-10 | 2019-02-19 | 洪月华 | 适用于广域网的分布式交通大数据并行聚类方法 |
CN109242209B (zh) * | 2018-10-12 | 2022-03-15 | 北京交通大学 | 基于K-means聚类的铁路突发事件分级预警方法 |
CN110399685A (zh) * | 2019-07-29 | 2019-11-01 | 云南电网有限责任公司电力科学研究院 | 电容型设备缺陷等级预测方法及装置 |
CN111028004A (zh) * | 2019-11-28 | 2020-04-17 | 国网吉林省电力有限公司 | 一种基于大数据技术的市场评估分析方法 |
CN111191687B (zh) * | 2019-12-14 | 2023-02-10 | 贵州电网有限责任公司 | 基于改进K-means算法的电力通信数据聚类方法 |
CN113377753A (zh) * | 2021-06-09 | 2021-09-10 | 国网吉林省电力有限公司 | 一种蓄热式电锅炉负荷数据清洗系统 |
CN114528284A (zh) * | 2022-02-18 | 2022-05-24 | 广东电网有限责任公司 | 一种底层数据清洗方法、装置、移动终端和存储介质 |
CN119116017A (zh) * | 2024-10-11 | 2024-12-13 | 深圳信息职业技术学院 | 一种工业机器人的自动化性能测试方法及系统 |
CN119226720B (zh) * | 2024-11-27 | 2025-03-21 | 北京市政交通一卡通支付有限公司 | 一种交通智能卡数据清洗方法及系统 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103064974A (zh) * | 2013-01-10 | 2013-04-24 | 东南大学 | 基于时空分析的交通流数据清洗方法 |
CN103577602A (zh) * | 2013-11-18 | 2014-02-12 | 浪潮(北京)电子信息产业有限公司 | 一种二次聚类方法及系统 |
-
2016
- 2016-06-28 CN CN201610517414.0A patent/CN106202335B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103064974A (zh) * | 2013-01-10 | 2013-04-24 | 东南大学 | 基于时空分析的交通流数据清洗方法 |
CN103577602A (zh) * | 2013-11-18 | 2014-02-12 | 浪潮(北京)电子信息产业有限公司 | 一种二次聚类方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN106202335A (zh) | 2016-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106202335B (zh) | 一种基于云计算框架的交通大数据清洗方法 | |
Wang et al. | Fast large-scale trajectory clustering | |
Sousa et al. | Vehicle trajectory similarity: Models, methods, and applications | |
Huang et al. | Survey on vehicle map matching techniques | |
CN103838863B (zh) | 一种基于云计算平台的大数据聚类算法 | |
CN103699654B (zh) | 一种跨比例尺矢量地图水网数据同名目标匹配方法 | |
CN105117488B (zh) | 一种基于混合层次聚类的分布式存储rdf数据平衡分割方法 | |
WO2022227303A1 (zh) | 信息处理方法、装置、计算机设备及存储介质 | |
CN108765961B (zh) | 一种基于改进型限幅平均滤波的浮动车数据处理方法 | |
Kong et al. | Spatial-temporal-cost combination based taxi driving fraud detection for collaborative internet of vehicles | |
Mao et al. | Outlier detection over distributed trajectory streams | |
CN117132893B (zh) | 基于深度学习与空间数据查询的地质灾害监测方法及系统 | |
US20140370920A1 (en) | Systems and methods for generating and employing an index associating geographic locations with geographic objects | |
Cho et al. | A GPS trajectory map-matching mechanism with DTG big data on the HBase system | |
CN112988849A (zh) | 一种交通轨迹模式分布式挖掘方法 | |
CN109800231B (zh) | 一种基于Flink的实时轨迹co-movement运动模式检测方法 | |
He et al. | Multiple routes recommendation system on massive taxi trajectories | |
Qing et al. | Using feature interaction among GPS Data for road intersection detection | |
CN105354243A (zh) | 基于归并聚类的并行化频繁概率子图搜索方法 | |
Temirbekova et al. | IDENTIFICATION OF AN ALGORITHM FOR THE ANALYSIS AND STUDY OF URBAN ROAD NETWORK TRAJECTORIES. | |
Li et al. | Design and implementation of trajectory data management and analysis technology framework based on spatiotemporal grid model | |
CN118262527B (zh) | 基于大数据的交通平台数据管理方法 | |
CN110059142A (zh) | 一种高效的并行不确定性数据聚类方法 | |
Tian et al. | A distributed framework for large-scale semantic trajectory similarity join | |
Moavinis | Detection of anomalous trajectories: comparison and proposal of methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 310012 1st floor, building 1, 223 Yile Road, Hangzhou City, Zhejiang Province Patentee after: Yinjiang Technology Co.,Ltd. Address before: 310012 1st floor, building 1, 223 Yile Road, Hangzhou City, Zhejiang Province Patentee before: ENJOYOR Co.,Ltd. |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20161207 Assignee: ZHEJIANG YINJIANG ZHIHUI TRAFFIC GROUP Co.,Ltd. Assignor: Yinjiang Technology Co.,Ltd. Contract record no.: X2024980042643 Denomination of invention: A method for cleaning transportation big data based on cloud computing framework Granted publication date: 20190628 License type: Common License Record date: 20250102 |