CN109101549B - Taxi GPS track data extraction method beyond operation range - Google Patents

Taxi GPS track data extraction method beyond operation range Download PDF

Info

Publication number
CN109101549B
CN109101549B CN201810746928.2A CN201810746928A CN109101549B CN 109101549 B CN109101549 B CN 109101549B CN 201810746928 A CN201810746928 A CN 201810746928A CN 109101549 B CN109101549 B CN 109101549B
Authority
CN
China
Prior art keywords
taxi
data
gps track
range
partition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810746928.2A
Other languages
Chinese (zh)
Other versions
CN109101549A (en
Inventor
汪劲松
黄林彬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Cirrus Sci Tech Dev Co ltd
Original Assignee
Guangdong Cirrus Sci Tech Dev Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Cirrus Sci Tech Dev Co ltd filed Critical Guangdong Cirrus Sci Tech Dev Co ltd
Priority to CN201810746928.2A priority Critical patent/CN109101549B/en
Publication of CN109101549A publication Critical patent/CN109101549A/en
Application granted granted Critical
Publication of CN109101549B publication Critical patent/CN109101549B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0645Rental transactions; Leasing transactions

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Traffic Control Systems (AREA)
  • Position Fixing By Use Of Radio Waves (AREA)

Abstract

The invention discloses a method for extracting taxi GPS track data beyond an operation range, which comprises the steps of dividing a taxi operation table into a plurality of taxi operation partition tables, dividing the taxi GPS track table into a plurality of taxi GPS track partition tables, searching taxi operation data in the taxi operation partition tables, comparing the taxi operation data with electronic fence data of a selected urban area, taking taxi operation data beyond the longitude and latitude range of the electronic fence data as abnormal taxi operation data, and extracting taxi GPS track data with the same license plate number in the taxi GPS track partition tables and the abnormal taxi operation data as taxi GPS track data beyond the operation range of the selected urban area. By implementing the method, massive taxi data can be rapidly and effectively processed and stored, taxi GPS track data beyond the operation range can be obtained through automatic calculation and matching, and whether the taxi is operated beyond the range or not is judged without manual experience.

Description

Taxi GPS track data extraction method beyond operation range
Technical Field
The invention relates to the technical field of information, in particular to a method for extracting taxi GPS track data beyond an operation range.
Background
With the continuous development of transportation in China, taking a taxi for going out becomes an indispensable going-out mode for people. The number of taxis is rapidly increased, and meanwhile, the problem of over-range operation of a plurality of taxis occurs, so that how to judge whether the taxis operate in a specified area is important to the management of the taxis. The inventor finds that the following technical problems exist in the prior art in the process of implementing the invention: in the prior art, taxi operation track GPS data simply extracted by using single data nodes can only be used for taxi operation track playback, whether a taxi exceeds an operation range or not cannot be effectively judged, the operation range of the taxi needs to be judged according to manual experience, meanwhile, monthly data volume is more than a million level, and a traditional data processing mode cannot quickly and effectively process and store massive data.
Disclosure of Invention
The method for extracting the taxi GPS track data beyond the operating range can quickly and effectively process and store massive taxi data, and can automatically calculate and match the taxi GPS track data beyond the operating range, so that whether the taxi is operated beyond the range is judged, and whether the taxi is operated beyond the range is judged without manual experience.
A taxi GPS track data extraction method beyond the operation range comprises the following steps:
the taxi operation table is divided into a plurality of taxi operation partition tables through a database horizontal partition technology, and the taxi GPS track table is divided into a plurality of taxi GPS track partition tables.
And retrieving taxi operation data in the taxi operation partition table, comparing the taxi operation data with the electronic fence data of the pre-selected urban area, and taking the taxi operation data beyond the latitude and longitude range of the electronic fence data as abnormal taxi operation data.
And extracting taxi GPS track data with the same license plate number in the taxi GPS track partition table and the abnormal taxi operation data to be used as taxi GPS track data exceeding the selected urban area operation range.
Further, the taxi operation table and the taxi GPS track table are horizontally partitioned through a database horizontal partitioning technology, and are divided into a plurality of taxi operation partition tables and a plurality of taxi GPS track partition tables, and the method specifically comprises the following steps:
the taxi operation table is horizontally partitioned according to months and is divided into one table per month to obtain a plurality of taxi operation partition tables, and the taxi GPS track table is horizontally partitioned according to days and is divided into one table per day to obtain a plurality of taxi GPS track partition tables.
Further, retrieving taxi operation data in the taxi operation partition table specifically comprises:
and executing computer operation defining a timing task, automatically retrieving data in the taxi operation partition table of the current month according to a preset frequency, and sequencing according to data generation time to obtain taxi operation data.
Further, comparing the taxi operation data with the electronic fence data of the pre-selected urban area, and taking the taxi operation data exceeding the longitude and latitude range of the electronic fence data as abnormal taxi operation data, specifically:
and E, storing the electronic fence data of the selected urban area into a distributed cache queue A, and storing the taxi operation data into a distributed cache queue B.
And executing computer operation for starting a plurality of taxi effective operation range calculation services, wherein each service takes out taxi operation data from the distributed cache queue B, then takes out electronic fence data from the distributed cache queue A, compares the starting-pricing longitude and latitude and the ending-pricing longitude and latitude in the taxi operation data with the longitude and latitude of the electronic fence, and stores the taxi operation data which is not in the longitude and latitude range of the electronic fence as abnormal taxi operation data in the distributed cache queue C.
Further, extracting taxi GPS track data with the same license plate number in the taxi GPS track partition table and the abnormal taxi operation data, as taxi GPS track data exceeding the selected downtown operation range, specifically:
and executing computer operation for starting a plurality of taxi GPS track data extraction services, wherein each service takes out abnormal taxi operation data from the distributed cache queue C, and matching taxi GPS track data with the same license plate number in a taxi GPS track partition table on the corresponding date through the date of the starting time and the ending time in the abnormal taxi operation data to obtain taxi GPS track data beyond the operation range.
Further, the executed calculation operation is realized by a Hadoop technology.
The method for extracting taxi GPS track data beyond the operating range provided by the embodiment of the invention has the following beneficial effects: the taxi management data and the taxi GPS data are subjected to data partitioning and storage through a database horizontal partitioning and distributed storage technology, the problem of how to quickly and effectively process and store massive data is solved, meanwhile, the taxi management data, selected electronic fences of various urban areas and taxi management GPS track data are calculated and processed through a Hadoop technology, and taxi GPS track data which are operated in an out-of-range mode are screened out.
Drawings
Fig. 1 is a schematic flow chart of a method for extracting taxi GPS track data beyond an operating range according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, a schematic flow chart of a method for extracting taxi GPS track data beyond an operating range according to an embodiment of the present invention is shown, including:
s101: the taxi operation table is divided into a plurality of taxi operation partition tables through a database horizontal partition technology, and the taxi GPS track table is divided into a plurality of taxi GPS track partition tables, which specifically comprise:
the taxi operation table is horizontally partitioned according to months and is divided into one table per month to obtain a plurality of taxi operation partition tables, and the taxi GPS track table is horizontally partitioned according to days and is divided into one table per day to obtain a plurality of taxi GPS track partition tables.
S102: retrieving taxi operation data in the taxi operation partition table, comparing the taxi operation data with fence data of a pre-selected urban area, and taking taxi operation data exceeding the longitude and latitude range of the fence data as abnormal taxi operation data, wherein the method specifically comprises the following steps: executing computer operation defining a timing task, automatically retrieving data in the taxi operation partition table of the current month according to preset frequency, sequencing according to data generation time, obtaining taxi operation data, then storing the electronic fence data of the selected urban area into a distributed cache queue A, and storing the taxi operation data into a distributed cache queue B.
And executing computer operation for starting a plurality of taxi effective operation range calculation services, wherein each service takes out taxi operation data from the distributed cache queue B, then takes out electronic fence data from the distributed cache queue A, compares the starting-pricing longitude and latitude and the ending-pricing longitude and latitude in the taxi operation data with the longitude and latitude of the electronic fence, and stores the taxi operation data which is not in the longitude and latitude range of the electronic fence as abnormal taxi operation data in the distributed cache queue C.
S103: extracting taxi GPS track data with the same license plate number in the taxi GPS track partition table and the abnormal taxi operation data as taxi GPS track data exceeding the selected downtown operation range, specifically:
and executing computer operation for starting a plurality of taxi GPS track data extraction services, wherein each service takes out abnormal taxi operation data from the distributed cache queue C, and matching taxi GPS track data with the same license plate number in a taxi GPS track partition table on the corresponding date through the date of the starting time and the ending time in the abnormal taxi operation data to obtain taxi GPS track data beyond the operation range.
In order to better illustrate the flow of the implementation of the method of the invention, the method of the invention is described in detail below:
taking a taxi operated in the Foshan city as a selected taxi object, taking longitude and latitude data of the region range of the Foshan city as electronic fence data of the selected city area, and storing the electronic fence data in a distributed cache queue A.
By a database horizontal partitioning technology, a taxi operation data table for operating taxies in the Fushan city is partitioned according to the year and the month and is divided into a table for each month, and a taxi GPS track table is partitioned according to the day and is divided into tables for each day.
By Hadoop technology, the following operations are executed in a computer:
defining a timing task 'JOB' to automatically search the taxi operation data in the taxi operation table in the current month according to the frequency of once every 5 minutes, and taking out the latest taxi operation data according to the time sequence generated by the data and storing the latest taxi operation data in a distributed cache queue B.
And starting a plurality of taxi effective operation range calculation services, sequentially taking taxi operation data from a distributed cache queue B by each service according to the first-in first-out characteristic of the queue, simultaneously taking the Foshan city electronic fence data stored in the distributed cache queue A, comparing the longitude and latitude of the taxi in which the taxi starts to be priced and finishes to the longitude and latitude of the area range in the Foshan city electronic fence data, if the longitude and latitude of the taxi in which the taxi starts to be priced and the longitude and latitude of the taxi in which the taxi finishes to be priced do not fall within the longitude and latitude range of the Foshan city, judging that the taxi is operated out of the range, taking out the operation data of the taxi as abnormal taxi operation data, and storing the abnormal taxi operation data in a distributed cache queue C.
Starting a plurality of taxi GPS track data extraction services, taking abnormal taxi operation data from the buffer queue C by each service, searching taxi GPS track data which is the same as the license plate number in the abnormal taxi operation data in the taxi GPS track partition table on the corresponding date, and taking out and storing the taxi GPS track data as an effective evidence relation table.
And storing the taxi GPS track data which exceeds the operation range and is extracted from the calculation result in a relational database to finish the whole calculation and data matching process.
By implementing the embodiment of the invention, the taxi operation data and the taxi GPS data can be subjected to data partitioning and storage by utilizing a database horizontal partitioning and distributed storage technology, the problem of how to quickly and effectively process and store massive data is solved, meanwhile, the taxi operation data, the selected electronic fences of various urban areas and the taxi operation GPS track data are calculated and processed by utilizing a Hadoop technology, the taxi GPS track data which is operated in an out-of-range mode is obtained by calculation and matching, and whether the taxi is operated in the out-of-range mode or not is judged according to manual experience.
It should be noted that the above describes a preferred embodiment of the present invention, and it will be apparent to those skilled in the art that several modifications and adaptations can be made without departing from the principle of the present invention, and such modifications and adaptations are also considered to be within the scope of the present invention.

Claims (5)

1. A taxi GPS track data extraction method beyond the operation range is characterized by comprising the following steps:
dividing a taxi operation table into a plurality of taxi operation partition tables by a database horizontal partition technology, and dividing a taxi GPS track table into a plurality of taxi GPS track partition tables;
retrieving taxi operation data in the taxi operation partition table, comparing the taxi operation data with electronic fence data of a pre-selected urban area, and taking taxi operation data exceeding the longitude and latitude range of the electronic fence data as abnormal taxi operation data;
extracting taxi GPS track data with the same license plate number in the taxi GPS track partition table and the abnormal taxi operation data to serve as taxi GPS track data exceeding the selected urban area operation range;
the taxi operation data are compared with electronic fence data of a pre-selected urban area, taxi operation data exceeding the longitude and latitude range of the electronic fence data are used as abnormal taxi operation data, and the method specifically comprises the following steps:
the electronic fence data of the selected urban area are stored in a distributed cache queue A, and the taxi operation data are stored in a distributed cache queue B;
and executing computer operation for starting a plurality of taxi effective operation range calculation services, wherein each service takes out taxi operation data from the distributed cache queue B, then takes out the data of the electronic fence from the distributed cache queue A, compares the starting-pricing longitude and latitude and the ending-pricing longitude and latitude in the taxi operation data with the longitude and latitude of the electronic fence, and stores the taxi operation data, as abnormal taxi operation data, in the distributed cache queue C, wherein the starting-pricing longitude and latitude and the ending-pricing longitude are not in the longitude and latitude range of the electronic fence.
2. The method for extracting taxi GPS track data beyond the operating range according to claim 1, wherein the taxi operation table is divided into a plurality of taxi operation partition tables by a database horizontal partition technique, and the taxi GPS track table is divided into a plurality of taxi GPS track partition tables, specifically:
the taxi operation table is horizontally partitioned according to months and is divided into one table per month to obtain a plurality of taxi operation partition tables, and the taxi GPS track table is horizontally partitioned according to days and is divided into one table per day to obtain a plurality of taxi GPS track partition tables.
3. The method for extracting taxi GPS track data beyond the operating range according to claim 2, wherein the retrieving of the taxi operating data in the taxi operating partition table specifically includes:
and executing computer operation for defining a timing task, automatically retrieving data in the taxi operation partition table of the current month according to a preset frequency, and sequencing according to data generation time to obtain taxi operation data.
4. The method for extracting taxi GPS track data beyond the operating range according to claim 3, wherein the taxi GPS track data of which the license plate number is the same as that in the abnormal taxi operating data is extracted as taxi GPS track data beyond the selected downtown operating range, and specifically:
and executing computer operation for starting a plurality of taxi GPS track data extraction services, wherein each service takes out the abnormal taxi operation data from the distributed cache queue C, and matching taxi GPS track data with the same license plate number in a taxi GPS track partition table on the corresponding date through the date of the starting time and the ending time in the abnormal taxi operation data to obtain taxi GPS track data beyond the operation range.
5. The method for extracting taxi GPS track data beyond the operating range according to any one of claims 3-4, wherein the computer operation is realized by Hadoop technology.
CN201810746928.2A 2018-07-09 2018-07-09 Taxi GPS track data extraction method beyond operation range Active CN109101549B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810746928.2A CN109101549B (en) 2018-07-09 2018-07-09 Taxi GPS track data extraction method beyond operation range

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810746928.2A CN109101549B (en) 2018-07-09 2018-07-09 Taxi GPS track data extraction method beyond operation range

Publications (2)

Publication Number Publication Date
CN109101549A CN109101549A (en) 2018-12-28
CN109101549B true CN109101549B (en) 2021-12-28

Family

ID=64845931

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810746928.2A Active CN109101549B (en) 2018-07-09 2018-07-09 Taxi GPS track data extraction method beyond operation range

Country Status (1)

Country Link
CN (1) CN109101549B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101315731A (en) * 2008-07-17 2008-12-03 上海途锐信息技术有限公司 System for implementing vehicle monitoring scheduling by GPS intelligent vehicle mounted terminal
CN101398973A (en) * 2007-09-30 2009-04-01 厦门雅迅网络股份有限公司 Automobile checking method and device by GPS technology
CN104700646A (en) * 2015-03-31 2015-06-10 南京大学 Online GPS data based abnormal taxi track real-time detection method
CN105045788A (en) * 2014-05-02 2015-11-11 国民大学校产学协力团 Method of processing and analysing vehicle driving big data and system thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI269046B (en) * 2004-06-29 2006-12-21 Lite On Automotive Corp A GPS having a vehicle condition real-time reporting function

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101398973A (en) * 2007-09-30 2009-04-01 厦门雅迅网络股份有限公司 Automobile checking method and device by GPS technology
CN101315731A (en) * 2008-07-17 2008-12-03 上海途锐信息技术有限公司 System for implementing vehicle monitoring scheduling by GPS intelligent vehicle mounted terminal
CN105045788A (en) * 2014-05-02 2015-11-11 国民大学校产学协力团 Method of processing and analysing vehicle driving big data and system thereof
CN104700646A (en) * 2015-03-31 2015-06-10 南京大学 Online GPS data based abnormal taxi track real-time detection method

Also Published As

Publication number Publication date
CN109101549A (en) 2018-12-28

Similar Documents

Publication Publication Date Title
US10520326B2 (en) Driving route matching method and apparatus, and storage medium
CN107610469B (en) Day-dimension area traffic index prediction method considering multi-factor influence
CN109916413B (en) Road matching method, system, device and storage medium based on grid division
CN109151168B (en) Switching method and device of riding codes, mobile terminal and readable storage medium
US20160377443A1 (en) Method and apparatus for determining a location of a point of interest
CN107665289B (en) Operator data processing method and system
CN104142954B (en) A kind of tables of data based on frequency subregion compares update method and device
CN110766280A (en) Vehicle scheduling method and generation method and device of target order prediction model
CN111177222A (en) Model testing method and device, computing equipment and storage medium
CN111311908A (en) Method and device for identifying and processing repeated traffic information
CN108986448A (en) A kind of traffic facility management method and terminal device
CN108806254B (en) Method and device for identifying urban traffic corridor and computer readable storage medium
CN109218985A (en) A kind of shipping work abnormal behavior detection method and system based on Pattern similarity
WO2020119269A1 (en) User position determination method and apparatus, device, and computer readable storage medium
CN111008255A (en) Method and device for identifying violation location, electronic equipment and storage medium
CN112488419A (en) Passenger flow distribution prediction method, device, equipment and storage medium based on OD analysis
CN109101549B (en) Taxi GPS track data extraction method beyond operation range
CA3010482A1 (en) System and method for airport terminal area trajectory data clustering for selecting efficient terminal area procedures
CN116611678B (en) Data processing method, device, computer equipment and storage medium
CN116433053B (en) Data processing method, device, computer equipment and storage medium
CN112862283A (en) Hierarchical analysis-based urban management dispatching method and system
Cho et al. GIS-based analysis on vulnerability of ambulance response coverage to traffic condition: a case study of Seoul
CN111243266B (en) Vehicle information determination method and device and electronic equipment
CN106649341B (en) Data processing method and device
CN114358404A (en) Flight data processing method, device electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant