CN103514233A - Trajectory data checking method and device - Google Patents

Trajectory data checking method and device Download PDF

Info

Publication number
CN103514233A
CN103514233A CN201210227146.0A CN201210227146A CN103514233A CN 103514233 A CN103514233 A CN 103514233A CN 201210227146 A CN201210227146 A CN 201210227146A CN 103514233 A CN103514233 A CN 103514233A
Authority
CN
China
Prior art keywords
location point
sequence
interval
verified
intervals
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201210227146.0A
Other languages
Chinese (zh)
Other versions
CN103514233B (en
Inventor
张传明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210227146.0A priority Critical patent/CN103514233B/en
Publication of CN103514233A publication Critical patent/CN103514233A/en
Application granted granted Critical
Publication of CN103514233B publication Critical patent/CN103514233B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/006Identification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a trajectory data checking method and device. The trajectory data checking method includes the steps that a position point sequence Sa and a position point sequence Sb to be checked are acquired, wherein the Sa and the Sb are collected through equipment; the Sa and the Sb are sampled according to the same intervals to acquire a sequence Sa' and a sequence Sb'; the position region where the Sa' is located and the position region where the Sb' is located are divided into a plurality of equilong sections; statistics is conducted on the number of position points of the Sa' in each section and the number of position points of the Sb' in each section respectively, and whether the position point sequence Sb to be checked is wrong or not is judged according to the difference between the number of the position points of the Sa' in each section and the number of the position points of the Sb' in each section. Due to the utilization of the technical scheme, automatic checking of manually-input data is achieved, human cost consumption is effectively reduced, and checking efficiency is improved. Meanwhile, complex mathematic calculation is not needed in the whole scheme, and therefore the trajectory data checking method and device are beneficial to being implemented and popularized.

Description

A kind of track data check method and device
Technical field
The present invention relates to electronic map technique field, particularly relate to a kind of track data check method and device.
Background technology
Electronic chart, also claims numerical map, is to utilize computer technology, with the map of digital form storage and inquiry.Compare with traditional paper map, the making of electronic chart, management, reading and use can realize integrated, are convenient to modify in real time; Utilize the data query ability of computing machine, electronic chart can be realized more quick and diversified inquiry mode, is user-friendly to.In addition, electronic chart can carry out factor combination, the splicing of arbitrary form easily to the content of general map, form new map, and the information of the types such as image, sound and word can be synthesized together, and realizes multiple bandwagon effect.
The inquiry of public traffic information is an important application branch of electronic chart.At present, a lot of electronic chart products all provide Bus information query function, and for example, user inputs starting point and destination, can inquire easily the transit riding mode that comes and goes between two places, estimated time etc.For electronic chart product, the track of public bus network can be identified on the relevant position of map exactly, be to provide the prerequisite of high-quality Query Result.Current conventional track data acquisition mode, that for example, collecting device with positioning function (GPS) is installed on public transit vehicle, during vehicle operating, the collection position information line item of going forward side by side periodically, after whole process of Vehicle Driving Cycle, the positional information unification of record is gathered and is identified on map, then the location point of dispersion is connected into track.
In the implementation process of said method, because collecting device positioning function itself exists certain error, therefore the data point collecting generally cannot be linked to be rational trajectory shape automatically, as shown in Figure 1, arrow place, left side, because down time is longer, cause at same place multi collect, and the data coordinates collecting is not quite identical again; Right side arrow place, the data coordinates collecting deviating road is far, and this point obviously should not appear in normal vehicle driving trace.For fear of similar problem, occur, currently used method is: first the coordinate data of collection is identified on map, then with sign data as a reference, by artificial mode, revise, depict the rational track of shape, finally by revised track data typing map system.
Because artificial typing mode has certain randomness, therefore may there is typing Problem-Error, for fear of mistake, be necessary the data of manual entry to verify, current pattern verification is by artificial mode, the data of the data of equipment collection and manual entry to be compared, not only need to consume a large amount of human costs, and verify inefficiency.
Summary of the invention
For solving the problems of the technologies described above, the embodiment of the present invention provides a kind of track data check method and device, to realize the automatic inspection of track data, reduces human cost consumption, improves and verifies efficiency, and technical scheme is as follows:
The embodiment of the present invention provides a kind of track data check method, and the method comprises:
The location point sequence Sa of equipment collection and location point sequence Sb to be verified;
To Sa and Sb, use identical interval to sample, obtain sequence Sa ' and sequence Sb ';
Sa ' and the residing band of position of Sb ' are divided into some isometric intervals;
Add up respectively Sa ' and Sb ' in each interval location point number, according to Sa ' and Sb ', in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified.
According to a kind of embodiment of the present invention, described sampling interval is greater than the Acquisition Error of collecting device.
According to a kind of embodiment of the present invention, described Sa ' and the residing band of position of Sb ' are divided into some isometric intervals, comprising:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And/or
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals.
According to a kind of embodiment of the present invention, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
Location point number difference condition according to Sa ' and Sb ' in each abscissa zone, judges whether mistake of location point sequence Sb to be verified;
And/or
Location point number difference condition according to Sa ' and Sb ' in each ordinate interval, judges whether mistake of location point sequence Sb to be verified.
According to a kind of embodiment of the present invention, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
If count in the position of Sa ' and Sb ' arbitrary intervals, difference is greater than default first threshold, judges Sb mistake.
According to a kind of embodiment of the present invention, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
If Sa ' and Sb ' count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judges Sb mistake.
The embodiment of the present invention also provides a kind of track data check device, it is characterized in that, this device comprises:
Data input cell, for the location point sequence Sa of equipment collection and location point sequence Sb to be verified;
Sampling unit, for using identical interval to sample to Sa and Sb, obtains sequence Sa ' and sequence Sb ';
Interval division unit, for being divided into some isometric intervals by Sa ' and the residing band of position of Sb ';
Judging unit, for adding up respectively Sa ' and Sb ' in each interval location point number, in each interval location point number difference condition, judges whether mistake of location point sequence Sb to be verified according to Sa ' and Sb '.
According to a kind of embodiment of the present invention, described sampling interval is greater than the Acquisition Error of collecting device.
According to a kind of embodiment of the present invention, described interval division unit, specifically for:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And/or
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals.
According to a kind of embodiment of the present invention, described judging unit, specifically for:
Location point number difference condition according to Sa ' and Sb ' in each abscissa zone, judges whether mistake of location point sequence Sb to be verified;
And/or
Location point number difference condition according to Sa ' and Sb ' in each ordinate interval, judges whether mistake of location point sequence Sb to be verified.
According to a kind of embodiment of the present invention, described judging unit, specifically for:
In the situation that count in the position of Sa ' and Sb ' arbitrary intervals, difference is greater than default first threshold, judgement Sb mistake.
According to a kind of embodiment of the present invention, described judging unit, specifically for:
At Sa ' and Sb ', in the situation that count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judgement Sb mistake.
The technical scheme that the embodiment of the present invention provides, first carries out equal interval sampling to the data of the data of equipment collection and manual entry, and the scale of two groups of data is reached unanimity; Further, by the difference of counting of two groups of sequences after relatively sampling, judge that whether the data of manual entry are wrong.The present invention program has realized the automatic inspection to manual entry data, has effectively reduced human cost consumption, has improved verification efficiency.Meanwhile, overall plan does not need to carry out complicated mathematical operation, realizes cost lower, is more conducive to implement and promote.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment or description of the Prior Art be briefly described below, apparently, the accompanying drawing the following describes is only some embodiment that record in the present invention, for those of ordinary skills, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is that positional information gathers schematic diagram;
Fig. 2 is a kind of schematic flow sheet of embodiment of the present invention track data check method;
Fig. 3 is the schematic flow sheet that the embodiment of the present invention is sampled to sequence;
Fig. 4 is the sequence of the embodiment of the present invention contrast histogram that distributes of counting;
Fig. 5 is a kind of structural representation of embodiment of the present invention track data check device.
Embodiment
Because collecting device positioning function itself is deposited error, the location point therefore automatically collecting is difficult to automatically be connected to become route track, need to revise and typing in artificial mode.And for example, for many-sided reason (careless, use inferior materials and turn out substandard goods etc.), the data correctness of manual entry is difficult to guarantee, therefore the data of manual entry is again verified and also become a kind of objective demand.From surface, the content of verification is exactly that the track of automatic collection and artificial corrected track are compared, and sees between the two, whether there is obvious irrational deviation.This work at present need to be by manually completing, and it is very low to verify efficiency.
For realizing the automatic inspection to track data, a kind of thinking of solution is, utilize computing machine to compare the similarity of the locus of two geometric locuses and form, according to comparative result, judge whether logging data exists mistake, but this method need to be carried out complicated mathematical operation, realize high cost, for data check, such scheme is difficult to practical application.
The actual demand of verifying for track data, the embodiment of the present invention provides a kind of track data check method, and the method comprises following basic step:
The location point sequence Sa of equipment collection and location point sequence Sb to be verified;
To Sa and Sb, use identical interval to sample, obtain sequence Sa ' and sequence Sb ';
Sa ' and the residing band of position of Sb ' are divided into some isometric intervals;
Add up respectively Sa ' and Sb ' in each interval location point number, according to Sa ' and Sb ', in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified.
The check method that the embodiment of the present invention provides, first carries out equal interval sampling to the data of the data of equipment collection and manual entry, makes reaching unanimity in scale of two groups of data; Further, by the difference of counting of two groups of sequences after relatively sampling, judge that whether the data of manual entry are wrong.The present invention program has realized the automatic inspection to manual entry data, has effectively reduced human cost consumption, has improved verification efficiency.Meanwhile, overall plan does not need to carry out complicated mathematical operation, is conducive to implement and promote.
In order to make those skilled in the art understand better the technical scheme in the present invention, below in conjunction with the accompanying drawing in the embodiment of the present invention, technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, rather than whole embodiment.Embodiment based in the present invention, the every other embodiment that those of ordinary skills obtain, should belong to the scope of protection of the invention.
Shown in Fig. 2, the process flow diagram of a kind of track data check method providing for the embodiment of the present invention, the method comprises the following steps:
S101, the location point sequence Sa of equipment collection and location point sequence Sb to be verified;
First, the location point sequence that for obtaining, equipment gathers automatically, collecting device is arranged on vehicle, during vehicle operating, periodically gathers the current positional information line item of going forward side by side.Collection period can arrange according to the actual requirements, in order to collect enough data, generally can choose less collection period, for example, gather a point every 1 second.
In the present embodiment, suppose that the location point sequence that equipment collects is automatically:
Sa={(Xa1,Ya1)、(Xa2,Ya2)、(Xa3,Ya3)、……(Xan,Yan)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point, and n is the sum of location point in Sa.
Because collecting device exists error, may cause Sa cannot form the rational circuit track of shape, therefore need operating personnel that the location point in Sa is identified on map one by one, then with sign data as a reference, utilize multiple correction strategy, depict the rational circuit track of shape, wherein correction strategy can comprise the obvious irrational point of removal, according to trend of road, the position locus of points be carried out to matching etc., concrete correction strategy and the present invention program are irrelevant, are therefore not described in detail.For revised track, also need to above track, choose some unique points, generate typing sequence to carry out record.Wherein the Selection Strategy of unique point can be the point of choosing corner, or uniformly-spaced get a little etc., for concrete unique point Selection Strategy, also do not need in the present embodiment to describe in detail, but generally speaking, typing sequence Qu Dian interval can be much larger than the location point interval of equipment acquisition sequence.
In the present embodiment, suppose to after track correct and the sequence of getting be a little:
Sb={(Xb1,Yb1)、(Xb2,Yb2)、(Xb3,Yb3)、……(Xbm,Ybm)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point, and m is the sum of location point in Sb, generally speaking, compare with Sa, and the location point number m of Sb can be much smaller than n.
Because the randomness of manual entry is larger, therefore there is the objective demand that Sb is verified, the object of the present embodiment, contrasts Sa and Sb by certain means exactly, thereby realize automatic decision Sb, whether has mistake.
S102, is used identical interval to sample to Sa and Sb, obtains sequence Sa ' and sequence Sb ';
Embodiment of the present invention scheme adopts the method for the contrast of counting, and realizes the verification to track data.But for initial sequence, the location point of Sa is intensive, counting of Sb is much relatively sparse, counts and differs greatly, and cannot directly contrast.Therefore in this step, use the method for sampling that the scale of two groups of data is reached unanimity, so that follow-up further comparison.
A selected sampling interval d1, resamples respectively to Sa and Sb, and for any one group of sequence, concrete sampling rule is as follows:
S102a, the current dot information of initialization;
S102b, calculates the distance of the available point of the upper record of current distance, and judges whether the distance of the upper available point of current distance is greater than in d1, if so, and execution step S102c, otherwise execution step S102d.
The distance here adopts Euclidean distance formula to calculate, and the choosing of d1, general recommendations is greater than the Acquisition Error of collecting device, for example, be greater than 50m.Certainly choosing of d1 can arrange flexibly according to actual conditions, and the present invention does not need this to limit.
S102c, is recorded as available point by current point, jumps to next sequence of points, carries out S102a;
S102d, jumps to next sequence of points, carries out S102a;
Utilize above-mentioned flow process, travel through respectively each point in Sa and Sb, the sequence Sa ' after being sampled and Sb ':
Sa’={(Xa1’,Ya1’)、(Xa2’,Ya2’)、……(Xai’,Yai’)}
Sb’={(Xb1’,Yb1’)、(Xb2’,Yb2’)、……(Xbj’,Ybj’)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point in sequence, and i is the sum of the middle location point of Sa ', and j is the sum of location point in Sb '.Under normal circumstances, the total length of two tracks is approximately equalised, and sampling interval is also consistent, if therefore logging data does not exist gross error, the numerical value of i and j also should approach so.A kind of extreme situation is: the difference of counting of Sa ' and Sb ' is obvious, for example, differ several times, and this illustrates that the two corresponding course length also falls far short, and in this case, can directly judge that the data Sb of manual entry is wrong.Yet in actual applications, more general a kind of situation is counting of Sa ' and Sb ' approximate (even likely identical), cannot directly judge logging data mistake in this case, for this situation, in the subsequent step of the present embodiment, adopt the mode that sequence demarcation interval is contrasted to realize data check.
S103, is divided into some isometric intervals by Sa ' and the residing band of position of Sb ';
Ensuing problem, be converted into and judge two problems whether sequence Sa ' is similar with Sb ', remove above outside said extreme case, in most of situation, after sampling, Sa ' and counting of Sb ' are all similar, even if typing is correct, owing to there being the artificial environment of revising, also may cause counting of Sa ' and Sb ' not quite identical, therefore the location point of Sa ' and Sb ' is judged to be inappropriate between two, therefore the mode that in embodiments of the present invention, adopts by stages relatively to count realizes verification.
A kind of the most direct method is, chooses a burst length d2, and the track of Sa ' and Sb ' is divided into respectively to the segmentation that some length is d2.Therefore yet under many circumstances, the track of Sa ' and Sb ' all more or less can exist bending, directly measures length and implements relative complex, in a kind of preferred implementation of the present invention, adopt the scheme that horizontal ordinate and ordinate are processed respectively, that is:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
Certainly, in actual applications, can only to horizontal ordinate, process, or only ordinate be processed, or transverse and longitudinal coordinate is all processed, the present invention does not need this to limit.
On the basis of such scheme, specifically can adopt multiple interval division strategy, for example:
A) fixed range burst length d2;
No matter how long the actual coordinate span of sequence has, all it is divided with the burst length of fixing, and default d2=500m for example, so:
The sequence that is 4km for coordinate span, is divided into 8 intervals;
The sequence that is 5km for coordinate span, is divided into 10 intervals;
The sequence that is 5.5km for coordinate span, is divided into 11 intervals;
......
B) the interval quantity of fixed partition;
No matter how long the actual coordinate span of sequence has, and is all divided into the interval of fixed qty, and for example default division numbers is 10, so:
The sequence that is 4km for coordinate span, burst length d2 is 400m;
The sequence that is 5km for coordinate span, burst length d2 is 500m;
The sequence that is 5.5km for coordinate span, burst length d2 is 550m;
......
Above two kinds of modes, all applicable respectively for horizontal ordinate and ordinate, wherein, interval division can be usingd the end points of Sa ' or Sb ' as starting point, also can directly utilize the coordinate-system of map self.But it should be noted that, because being compares Sa ' and Sb ', therefore no matter adopt what kind of interval division strategy, must guarantee that Sa ' is consistent with the residing coordinate-system of Sb ', and the burst length d2 that division Sa ' adopts with Sb ' is also consistent.
In addition, for d2 numerical value, choose, do not have strict regulation, but it will be understood by those skilled in the art that, the effect of relatively counting in order to realize by stages, the point that should comprise some in each interval, so the value of d2 generally should reach the several times of sampled distance d1, for example, stipulate that the value of d2 is between 5*d1 and 15*d1.
S104, adds up respectively Sa ' and Sb ' in each interval location point number, according to Sa ' and Sb ', in each interval location point number difference condition, judges whether mistake of location point sequence Sb to be verified.
According to the explanation of preceding step, owing to there being the artificial link of revising track, therefore for Sa ' and Sb ', allow to exist to a certain degree location point sum difference, if but difference is excessive, can directly judge that the data Sb of manual entry is wrong.
On the other hand, this is for Sa ' and Sb ', if typing is correct, it should be also similar counting of each interval, distributing, based on this rule, in this step, add up respectively Sa ' and Sb ' in each interval location point number, then according to counting difference condition, judge whether to exist typing mistake.Concrete determination strategy can comprise:
If a) position of Sa ' and Sb ' the arbitrary intervals difference of counting is greater than default first threshold, judge Sb mistake;
That is to say, if discovery Sa ' and Sb ' are excessive in the difference of counting in certain interval, think and may have typing mistake.
B) if Sa ' and Sb ' count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judges Sb mistake.
That is to say, allow Sa ' and Sb ' in single interval, to have certain difference of counting, if but the difference of counting in a plurality of intervals is accumulated to a certain degree, and think and may have typing mistake.
Be understandable that, in the process of Cumulative sum, the difference of counting in each interval should be with absolute value representation.Based on similar thought, can also utilize variance/standard deviation to judge, be not described in detail here.
In actual applications, above-mentioned strategy can be distinguished independent use, also can be combined with, and the present invention does not need this to limit.
In addition, due in explanation above, taked the mode that horizontal ordinate and ordinate are processed respectively, therefore in this step, correspondingly also can based on horizontal ordinate or ordinate, judge independently, or horizontal ordinate and ordinate are all judged, to realize more comprehensively, verify.
Certainly, be understandable that, if what adopt when interval division is the mode of directly geometric locus being carried out to segmentation, correspondingly tackles in this step counting in each curve segmentation and compare.
Below in conjunction with a simple embodiment, the solution of the present invention is schematically illustrated:
Suppose that sequence Sa ' and Sb ' after sampling are as follows respectively:
Sa ', totally 100 points, coordinate is as follows respectively:
A1(1,99)
A2(2,98)
A3(2.5,97)
A4(2.8,96.5)
A5(3,99)
......
A100(80,25)
Sb ', totally 95 points, coordinate is as follows respectively:
B1(1.4,99.2)
B2(2,99)
B3(2.6,97.5)
B4(3.1,97)
B5(3.2,95.2)
......
B1(79,24.5)
Interval division is carried out in DuiSa’ locus below, take horizontal ordinate as example, and the horizontal ordinate span of Sa ' and Sb ' is [1,80], if adopt the own coordinate system of map system, and according to length 10 demarcation intervals, the horizontal ordinate span of Sa ' can be divided into 8 intervals;
[0,10)、[10,20)、[20,30)、……[70,80)
Add up respectively Sa ' and Sb ' and drop on above-mentioned 8 interval Nei positions and count, shown in as if statistics result table 1:
Abscissa zone Sa ' counts Sb ' counts The difference of counting
[0,10) 15 15 0
[10,20) 8 7 1
[20,30) 10 10 0
[30,40) 13 11 2
[40,50) 14 14 0
[50,60) 16 15 1
[60,70) 14 13 1
[70,80) 10 10 0
Table 1
As shown in Figure 4, the data in form can be represented with represented as histograms, be conducive to find out more intuitively that Sa ' and Sb ' are in the distribution situation of counting in individual interval.Wherein in single interval, the maximum difference of counting is 2, and total difference of counting in all intervals is 5.Suppose default standard be single interval count poor be no more than 5, total difference of counting in all intervals is no more than 10, so from horizontal ordinate aspect, can judge that data typing is qualified.Further can adopt similar mode, by ordinate, judge.For arbitrary group of data, if having any one to count, horizontal ordinate and ordinate exceed standard, can judge that this group data typing is defective.
The final purpose that track data is verified, just in order to detect the data that may have typing mistake the data from a large amount of, do not need to realize automatic error-correcting, do not need to carry out data analysis or prediction yet, embodiment of the present invention scheme is according to the regularity of distribution of track data, the mode of relatively counting by by stages has realized the automatic inspection to manual entry data, has effectively reduced human cost consumption, has improved verification efficiency.Meanwhile, overall plan does not need to carry out complicated mathematical operation, is more conducive to implement and promote.
Corresponding to embodiment of the method above, the present invention also provides a kind of track data check device, shown in Figure 5, and this device can comprise data input cell 110, sampling unit 120, interval division unit 130, judging unit 140, specific as follows:
Data input cell 110, for the location point sequence Sa of equipment collection and location point sequence Sb to be verified;
First, the location point sequence that for obtaining, equipment gathers automatically, collecting device is arranged on vehicle, during vehicle operating, periodically gathers the current positional information line item of going forward side by side.Collection period can arrange according to the actual requirements, in order to collect enough data, generally can choose less collection period, for example, gather a point every 1 second.
In the present embodiment, suppose that the location point sequence that equipment collects is automatically:
Sa={(Xa1,Ya1)、(Xa2,Ya2)、(Xa3,Ya3)、……(Xan,Yan)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point, and n is the sum of location point in Sa.
Because collecting device exists error, may cause Sa cannot form the rational circuit track of shape, therefore need operating personnel that the location point in Sa is identified on map one by one, then with sign data as a reference, utilize multiple correction strategy, depict the rational circuit track of shape, wherein correction strategy can comprise the obvious irrational point of removal, according to trend of road, the position locus of points be carried out to matching etc., concrete correction strategy and the present invention program are irrelevant, are therefore not described in detail.For revised track, also need to above track, choose some unique points, generate typing sequence to carry out record.Wherein the Selection Strategy of unique point can be the point of choosing corner, or uniformly-spaced get a little etc., for concrete unique point Selection Strategy, also do not need in the present embodiment to describe in detail, but generally speaking, typing sequence Qu Dian interval can be much larger than the location point interval of equipment acquisition sequence.
In the present embodiment, suppose to after track correct and the sequence of getting be a little:
Sb={(Xb1,Yb1)、(Xb2,Yb2)、(Xb3,Yb3)、……(Xbm,Ybm)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point, and m is the sum of location point in Sb, generally speaking, compare with Sa, and the location point number m of Sb can be much smaller than n.
Because the randomness of manual entry is larger, therefore there is the objective demand that Sb is verified, the object of the present embodiment, contrasts Sa and Sb by certain means exactly, thereby realize automatic decision Sb, whether has mistake.
Sampling unit 120, for using identical interval to sample to Sa and Sb, obtains sequence Sa ' and sequence Sb ';
Embodiment of the present invention scheme adopts the method for the contrast of counting, and realizes the verification to track data.But for initial sequence, the location point of Sa is intensive, counting of Sb is much relatively sparse, counts and differs greatly, and cannot directly contrast.Therefore in this step, use the method for sampling that the scale of two groups of data is reached unanimity, so that follow-up further comparison.
Sequence after sampling is respectively Sa ' and Sb ':
Sa’={(Xa1’,Ya1’)、(Xa2’,Ya2’)、……(Xai’,Yai’)}
Sb’={(Xb1’,Yb1’)、(Xb2’,Yb2’)、……(Xbj’,Ybj’)}
Wherein X, Y represent respectively horizontal ordinate and the ordinate of location point in sequence, and i is the sum of the middle location point of Sa ', and j is the sum of location point in Sb '.Under normal circumstances, the total length of two tracks is approximately equalised, and sampling interval is also consistent, if therefore logging data does not exist gross error, the numerical value of i and j also should approach so.A kind of extreme situation is: the difference of counting of Sa ' and Sb ' is obvious, for example, differ several times, and this illustrates that the two corresponding course length also falls far short, and in this case, can directly judge that the data Sb of manual entry is wrong.Yet in actual applications, more general a kind of situation is counting of Sa ' and Sb ' approximate (even likely identical), cannot directly judge logging data mistake in this case, for this situation, in the follow-up function unit of the present embodiment, adopt the mode that sequence demarcation interval is contrasted to realize data check.
Interval division unit 130, for being divided into some isometric intervals by Sa ' and the residing band of position of Sb ';
Ensuing problem, be converted into and judge two problems whether sequence Sa ' is similar with Sb ', remove above outside said extreme case, in most of situation, after sampling, Sa ' and counting of Sb ' are all similar, even if typing is correct, owing to there being the artificial environment of revising, also may cause counting of Sa ' and Sb ' not quite identical, therefore the location point of Sa ' and Sb ' is judged to be inappropriate between two, therefore the mode that in embodiments of the present invention, adopts by stages relatively to count realizes verification.
A kind of the most direct method is, chooses a burst length d2, and the track of Sa ' and Sb ' is divided into respectively to the segmentation that some length is d2.Therefore yet under many circumstances, the track of Sa ' and Sb ' all more or less can exist bending, directly measures length and implements relative complex, in a kind of preferred implementation of the present invention, adopt the scheme that horizontal ordinate and ordinate are processed respectively, that is:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
Certainly, in actual applications, can only to horizontal ordinate, process, or only ordinate be processed, or transverse and longitudinal coordinate is all processed, the present invention does not need this to limit.
On the basis of such scheme, specifically can adopt multiple interval division strategy, for example:
A) fixed range burst length d2;
No matter how long the actual coordinate span of sequence has, all it is divided with the burst length of fixing, and default d2=500m for example, so:
The sequence that is 4km for coordinate span, is divided into 8 intervals;
The sequence that is 5km for coordinate span, is divided into 10 intervals;
The sequence that is 5.5km for coordinate span, is divided into 11 intervals;
......
B) the interval quantity of fixed partition;
No matter how long the actual coordinate span of sequence has, and is all divided into the interval of fixed qty, and for example default division numbers is 10, so:
The sequence that is 4km for coordinate span, burst length d2 is 400m;
The sequence that is 5km for coordinate span, burst length d2 is 500m;
The sequence that is 5.5km for coordinate span, burst length d2 is 550m;
......
Above two kinds of modes, all applicable respectively for horizontal ordinate and ordinate, wherein, interval division can be usingd the end points of Sa ' or Sb ' as starting point, also can directly utilize the coordinate-system of map self.But it should be noted that, because being compares Sa ' and Sb ', therefore no matter adopt what kind of interval division strategy, must guarantee that Sa ' is consistent with the residing coordinate-system of Sb ', and the burst length d2 that division Sa ' adopts with Sb ' is also consistent.
In addition, for d2 numerical value, choose, do not have strict regulation, but it will be understood by those skilled in the art that, the effect of relatively counting in order to realize by stages, the point that should comprise some in each interval, so the value of d2 generally should reach the several times of sampled distance d1, for example, stipulate that the value of d2 is between 5*d1 and 15*d1.
Judging unit 140, for adding up respectively Sa ' and Sb ' in each interval location point number, in each interval location point number difference condition, judges whether mistake of location point sequence Sb to be verified according to Sa ' and Sb '.
According to explanation above, owing to there being the artificial link of revising track, therefore for Sa ' and Sb ', allow to exist to a certain degree location point sum difference, if but difference is excessive, can directly judge that the data Sb of manual entry is wrong.
On the other hand, this is for Sa ' and Sb ', if typing is correct, it should be also similar counting of each interval, distributing, based on this rule, in judging unit 140, add up respectively Sa ' and Sb ' in each interval location point number, then according to counting difference condition, judge whether to exist typing mistake.Concrete determination strategy can comprise:
If a) position of Sa ' and Sb ' the arbitrary intervals difference of counting is greater than default first threshold, judge Sb mistake;
That is to say, if discovery Sa ' and Sb ' are excessive in the difference of counting in certain interval, think and may have typing mistake.
B) if Sa ' and Sb ' count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judges Sb mistake.
That is to say, allow Sa ' and Sb ' in single interval, to have certain difference of counting, if but the difference of counting in a plurality of intervals is accumulated to a certain degree, and think and may have typing mistake.
Be understandable that, in the process of Cumulative sum, the difference of counting in each interval should be with absolute value representation.Based on similar thought, can also utilize variance/standard deviation to judge, be not described in detail here.
In actual applications, above-mentioned strategy can be distinguished independent use, also can be combined with, and the present invention does not need this to limit.
In addition, due in explanation above, taked the mode that horizontal ordinate and ordinate are processed respectively, therefore in judging unit 140, correspondingly also can based on horizontal ordinate or ordinate, judge independently, or horizontal ordinate and ordinate are all judged, to realize more comprehensively, verify.
While for convenience of description, describing above device, with function, being divided into various unit describes respectively.Certainly, implement when of the present invention can Ba Ge unit function in same or a plurality of software and/or hardware, realize.
As seen through the above description of the embodiments, those skilled in the art can be well understood to the mode that the present invention can add essential general hardware platform by software and realizes.Understanding based on such, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product can be stored in storage medium, as ROM/RAM, magnetic disc, CD etc., comprise that some instructions are with so that a computer equipment (can be personal computer, server, or the network equipment etc.) carry out the method described in some part of each embodiment of the present invention or embodiment.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, between each embodiment identical similar part mutually referring to, each embodiment stresses is the difference with other embodiment.Especially, for device embodiment, because it is substantially similar in appearance to embodiment of the method, so describe fairly simplely, relevant part is referring to the part explanation of embodiment of the method.Apparatus and system embodiment described above is only schematic, the wherein said unit as separating component explanation can or can not be also physically to separate, the parts that show as unit can be or can not be also physical locations, can be positioned at a place, or also can be distributed in a plurality of network element.Can select according to the actual needs some or all of module wherein to realize the object of the present embodiment scheme.Those of ordinary skills, in the situation that not paying creative work, are appreciated that and implement.
The above is only the specific embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (12)

1. a track data check method, is characterized in that, the method comprises:
The location point sequence Sa of equipment collection and location point sequence Sb to be verified;
To Sa and Sb, use identical interval to sample, obtain sequence Sa ' and sequence Sb ';
Sa ' and the residing band of position of Sb ' are divided into some isometric intervals;
Add up respectively Sa ' and Sb ' in each interval location point number, according to Sa ' and Sb ', in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified.
2. method according to claim 1, is characterized in that, described sampling interval is greater than the Acquisition Error of collecting device.
3. method according to claim 1, is characterized in that, described Sa ' and the residing band of position of Sb ' is divided into some isometric intervals, comprising:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And/or
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals.
4. method according to claim 3, is characterized in that, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
Location point number difference condition according to Sa ' and Sb ' in each abscissa zone, judges whether mistake of location point sequence Sb to be verified;
And/or
Location point number difference condition according to Sa ' and Sb ' in each ordinate interval, judges whether mistake of location point sequence Sb to be verified.
5. method according to claim 1, is characterized in that, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
If count in the position of Sa ' and Sb ' arbitrary intervals, difference is greater than default first threshold, judges Sb mistake.
6. method according to claim 1, is characterized in that, described according to Sa ' and Sb ' in each interval location point number difference condition, judge whether mistake of location point sequence Sb to be verified, comprising:
If Sa ' and Sb ' count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judges Sb mistake.
7. a track data check device, is characterized in that, this device comprises:
Data input cell, for the location point sequence Sa of equipment collection and location point sequence Sb to be verified;
Sampling unit, for using identical interval to sample to Sa and Sb, obtains sequence Sa ' and sequence Sb ';
Interval division unit, for being divided into some isometric intervals by Sa ' and the residing band of position of Sb ';
Judging unit, for adding up respectively Sa ' and Sb ' in each interval location point number, in each interval location point number difference condition, judges whether mistake of location point sequence Sb to be verified according to Sa ' and Sb '.
8. device according to claim 7, is characterized in that, described sampling interval is greater than the Acquisition Error of collecting device.
9. device according to claim 7, is characterized in that, described interval division unit, specifically for:
The horizontal ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals;
And/or
The ordinate span of Sa ' and the residing band of position of Sb ' is divided into some isometric intervals.
10. device according to claim 9, is characterized in that, described judging unit, specifically for:
Location point number difference condition according to Sa ' and Sb ' in each abscissa zone, judges whether mistake of location point sequence Sb to be verified;
And/or
Location point number difference condition according to Sa ' and Sb ' in each ordinate interval, judges whether mistake of location point sequence Sb to be verified.
11. devices according to claim 7, is characterized in that, described judging unit, specifically for:
In the situation that count in the position of Sa ' and Sb ' arbitrary intervals, difference is greater than default first threshold, judgement Sb mistake.
12. devices according to claim 7, is characterized in that, described judging unit, specifically for:
At Sa ' and Sb ', in the situation that count in the position in a plurality of intervals, difference summation is greater than default Second Threshold, judgement Sb mistake.
CN201210227146.0A 2012-06-29 2012-06-29 A kind of track data check method and device Active CN103514233B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210227146.0A CN103514233B (en) 2012-06-29 2012-06-29 A kind of track data check method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210227146.0A CN103514233B (en) 2012-06-29 2012-06-29 A kind of track data check method and device

Publications (2)

Publication Number Publication Date
CN103514233A true CN103514233A (en) 2014-01-15
CN103514233B CN103514233B (en) 2018-06-12

Family

ID=49896964

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210227146.0A Active CN103514233B (en) 2012-06-29 2012-06-29 A kind of track data check method and device

Country Status (1)

Country Link
CN (1) CN103514233B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326398A (en) * 2016-08-19 2017-01-11 东软集团股份有限公司 Data consistency comparison method and device
CN108151745A (en) * 2017-12-25 2018-06-12 千寻位置网络有限公司 NMEA tracks difference automatically analyze and identification method
CN111127891A (en) * 2019-12-27 2020-05-08 中国交通通信信息中心 Road network checking method based on floating car track big data
CN111949688A (en) * 2019-05-16 2020-11-17 广州汽车集团股份有限公司 Method, client and server for sampling vehicle track data
CN112562315A (en) * 2020-11-02 2021-03-26 鹏城实验室 Method, terminal and storage medium for acquiring traffic flow information
CN117273811A (en) * 2023-11-21 2023-12-22 国网浙江省电力有限公司 Enterprise power consumption prediction data processing method based on power consumption data

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090099717A1 (en) * 2006-03-14 2009-04-16 Hidenori Yabushita Trajectory tracking control system and method for mobile unit
CN102194332A (en) * 2011-03-24 2011-09-21 中国船舶重工集团公司第七○九研究所 Self-adaptation flight path data correlation method
CN102375151A (en) * 2010-08-13 2012-03-14 国基电子(上海)有限公司 Electronic device and GPS error detection method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090099717A1 (en) * 2006-03-14 2009-04-16 Hidenori Yabushita Trajectory tracking control system and method for mobile unit
CN102375151A (en) * 2010-08-13 2012-03-14 国基电子(上海)有限公司 Electronic device and GPS error detection method thereof
CN102194332A (en) * 2011-03-24 2011-09-21 中国船舶重工集团公司第七○九研究所 Self-adaptation flight path data correlation method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
雷小锋 等: "基于时空邻域的多粒度轨迹相似性查询", 《计算机研究与发展》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326398A (en) * 2016-08-19 2017-01-11 东软集团股份有限公司 Data consistency comparison method and device
CN108151745A (en) * 2017-12-25 2018-06-12 千寻位置网络有限公司 NMEA tracks difference automatically analyze and identification method
CN111949688A (en) * 2019-05-16 2020-11-17 广州汽车集团股份有限公司 Method, client and server for sampling vehicle track data
CN111127891A (en) * 2019-12-27 2020-05-08 中国交通通信信息中心 Road network checking method based on floating car track big data
CN112562315A (en) * 2020-11-02 2021-03-26 鹏城实验室 Method, terminal and storage medium for acquiring traffic flow information
CN117273811A (en) * 2023-11-21 2023-12-22 国网浙江省电力有限公司 Enterprise power consumption prediction data processing method based on power consumption data
CN117273811B (en) * 2023-11-21 2024-02-09 国网浙江省电力有限公司 Enterprise power consumption prediction data processing method based on power consumption data

Also Published As

Publication number Publication date
CN103514233B (en) 2018-06-12

Similar Documents

Publication Publication Date Title
CN103514233A (en) Trajectory data checking method and device
US9285242B2 (en) Determining a connectivity model in smart grids
Yap et al. Where shall we sync? Clustering passenger flows to identify urban public transport hubs and their key synchronization priorities
CN109143291B (en) Vehicle-mounted GPS track space index accurate matching method
CN106909372B (en) Method and system for calculating purchase path of mobile terminal user
CN103473612A (en) Site selection and transportation optimization method for super-large scale logistics distribution
CN103903474B (en) A kind of fleet's induced travel method based on K-means cluster
CN103020790A (en) Method and device for pre-sorting order
CN113077103B (en) Transportation network planning method and device
CN105989145B (en) Automatic updating method, device and system for bus line
WO2021072128A1 (en) Systems and methods for big data analytics
CN108957397A (en) The localization method and device of crusing robot
CN109772753B (en) Power battery sorting system and method
EP3135556A1 (en) Traffic system optimization device
CN104751369A (en) Improved line loss analysis method for K-means clustering
CN103617517A (en) Cost calculation and quotation model system
CN103473458B (en) Method for comparatively analyzing similarities of fold lines
CN101667219A (en) Optimization method of printed-circuit board test path
CN115207909B (en) Method, device, equipment and storage medium for identifying topology of platform area
CN108509576B (en) Electronic map-based filtering optimization method for wrong point positions of patrol track
WO2016206191A1 (en) Data processing method and device
CN110138892A (en) Determine the method and device of equipment regional information
CN106681967A (en) Meter changing operation method based on data backtracking mechanism
CN114550466B (en) Parking space state detection method and device and electronic equipment
CN107391728A (en) Data digging method and data mining device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant