Invention content
In view of this, the purpose of the present invention is to provide a kind of vehicle on highway speed based on mobile phone signaling big data
Computational methods are higher by the shorter calculating speed precision of distance apart from road grid to improve the precision of car speed estimation
The characteristics of, setting one is with apart from relevant best weight value function, being fused in traditional speed calculation formula, greatly improving
Computational accuracy.
In order to achieve the above objectives, the present invention provides following technical solution:
A kind of vehicle on highway speed calculation method based on mobile phone signaling big data, this method step are:
S1:The step of vehicle on highway speed calculates is carried out according to based on mobile phone signaling data, carries out mobile phone signaling number
According to Collecting and dealing;Then map match is carried out, establishes geographical grid system;Then highway user identification is carried out, is obtained
To for calculating the highway user signaling data track set H_D of expressway road grid average speed;
S2:After the highway user signaling data track set H_D for obtaining T to the T+t periods, order traversal highway
Subscriber signaling data track set H_D extracts the track sets of each user;
S3:Traversed the tracing point in the track sets of each user two-by-two to each user's sequence, tracing point is
Base sites include timestamp and geographical location information in the signaling data of base station, in user trajectory sequence per track two-by-two
Computing unit of the point as a calculating user velocity;
S4:According to the road grid that geographical grid system divides, corresponding speed container is set for each road grid, is used
In the weighting velocity amplitude that each computing unit of preservation obtains;
S5:According to the computing unit that traversal obtains, it is defined as tracing point TracenWith tracing point Tracem, calculate each single
The path distance of member and distance time;
S6:According to the thought of physical motion, by obtaining feature in the angle of statistical significance:Comprising road grid
The path distance of two tracing point computing units is shorter, then bigger to the contribution margin of the road grid average speed;According to this feature,
Set a Gauss weight function with path distance inverse correlation
S7:The weighting velocity amplitude of two tracing point computing units is calculated, by the path distance between two tracing points that are calculated
Dn,mWith distance time Tn,mRatio multiplied by with Gauss weight functionObtain the weighting velocity amplitude of the computing unitIt and will
The velocity amplitudeIt is put into the corresponding road grid velocity container between two tracing points,
S8:Step S2~S7 is repeated, until in the highway user signaling data track set H_D of T to T+t periods
User trajectory sequence all traverses completion;
S9:The weighting sets of speeds in each road grid velocity container is finally traversed, to adding in each road grid
Power sets of speeds carries out and operation obtains the average speed value V of the road gridk, 1≤k≤N,
Wherein l is expressed as l-th of subscriber signaling in k-th of section, NkRepresent the number of users in k-th of section,Represent the
N section is to the distance in m-th of section, 1≤k≤Nk,Represent n-th of section to the time in m-th of section, 1≤l≤Nk;
Further, the step S5 is specially:
S501:The tracing point of sequence traverse user track sets two-by-two, with per tracing point Trace two-by-twonAnd TracemAs
One computing unit then according to the matched geographical grid system in base station in step S1 and road grid, obtains two tracks
Point TracenAnd TracemBetween road grid subsequence { gn[an,bn],…,gm[am,bm], wherein [an,bn] represent grid position
Serial number is put, then two tracing point TracenAnd TracemDistance Dn,mIt is calculated by equation below:
The number of road grids of the wherein M between two tracing points, L are the length of road grid;
S502:Calculate tracing point TracenAnd TracemBetween journey time, if tracing point TracenTimestamp be
TimeStampn, tracing point TracemTimestamp be TimeStampm, then the journey time between two tracing points is Tn,m;
Tn,m=TimeStampn-TimeStampm。
The beneficial effects of the present invention are:The present invention is higher by the shorter calculating speed precision of distance apart from road grid
The characteristics of, setting one is with apart from relevant best weight value function, being fused in traditional speed calculation formula, greatly improving
Computational accuracy, to better meet the demand of ITS systems.
Specific embodiment
Below in conjunction with attached drawing, the preferred embodiment of the present invention is described in detail.
As shown in Figure 1, the present invention provides a kind of vehicle on highway side speed calculations based on mobile phone signaling big data
Method, step are:
Step 1, the rectangular grid for being defined to L according to length and width according to highway route carry out dividing the processing of grid section,
And a direction of setting high-speed highway is its positive direction, it can thus be appreciated that the grid route sequence of highway is G={ g1,g2,
g3,…,gn, g thereiniFor a road grid, whole raster combineds is a highway route.
Step 2 obtains the base station information data that expressway nearby monitors region, then establishes base station and super expressway grid
Match information table B_G, the match information table of base station and road includes base station location area number LAI, base station cell number CI, base
The longitude LNG to stand, the latitude LAC of base station, place grid number GD, matched road grid number GS.Wherein base station and road
The matching step of grid is as follows:
Step 2.1, according to the base station information table near expressway, can obtain base station grid sequence according to the grid processing of step 1
It is classified as B={ b1,b2,b3,…,bn}。
Step 2.2 obtains base station b according to the coverage area of base stationiRoad grid { the g of coveringn,…,gm, then basis
Euclidean distance formula DbigiCalculation base station biThe each road grid g included to itiDistance, be expressed as Dbg={ dn,…,
dm}。
Step 2.3 takes DbgThe minimum grid of middle distance is set to matched road grid G Si, step 2.1~2.3 are repeated, most
The match information table B_G of base station road is understood afterwards.
Step 3, all user mobile phone signaling track datas in the highw ay m onitoring region of acquisition T to T+t times, user
Mobile phone signaling data mainly includes unique ID, signaling data position area number LAI, base station cell number CI and timestamp
TimeStamp fields.Then the data predictions such as dirty data filtering, ping-pong are carried out to signaling data;Finally according to signaling
The timestamp sequencing of data is combined arrangement to the signaling track data of each user, obtains the user of monitoring time section
Mobile phone signaling data track set U_D.
Step 4 carries out height according to the match information table B_G of user mobile phone signaling data track set U_D and base station road
The matching judgement of fast highway user according to longest one of user mobile phone signaling data track section of effectively going on a journey, judges user
Whether be running on expressway user, and generate for calculate highway each road grid velocity highway
Subscriber signaling data track set H_D.Specific steps are as follows:
Step 4.1, the user trajectory of extraction mobile phone signaling data track set U_D carry out the segmentation of effectively trip section, take
The longest sequence of trip section sequence, which is used as, judges sequence.
Step 4.2 obtains longest effective trip section track sets Trace={ bn,…,bm(wherein, biRepresent base station
Grid serial number, sequencing arrangement temporally), according to N number of base station of Trace track sets, count b in Trace sequencesi
The base station number N being present in the match information table B_G of base station road1, calculate Trace tracks and the match information of base station road
Similarity λ=N of table B_G1/ N sets a threshold value M, as λ >=M, is then determined as height to determine whether for highway user
Fast highway user simultaneously jumps to step 4.3;Otherwise it is determined as non-freeway user.
Step 4.3, according to the grid serial number of track sets in Trace { n ..., m } be incremented by and successively decrease determine public affairs at a high speed
The forward travel and backward going of road user is then forward travel if it is being incremented by, otherwise backward going.By judging, finally
Obtain highway user signaling data track set H_D.
Step 5, the average speed that highway grid section is carried out according to highway user signaling data track set H_D
Degree calculates.The track sets of each user are extracted from the set H_D of highway user signaling data track, are then traversed two-by-two
Tracing point in the track sets of each user, tracing point is base sites, include in the signaling data of base station timestamp and
Geographical location information, as the computing unit of a calculating speed, is then calculated using in user trajectory sequence per tracing point two-by-two
Each path distance of unit and distance time, with the ratio of path distance and distance time multiplied by anti-with path distance with one
Relevant Gauss weight function obtains the velocity amplitude of a Weighted distance, and is put into corresponding road grid velocity container,
Finally the velocity amplitude in road grid velocity container is carried out and operation obtains the average speed value of the road grid.It is specific
Step is as follows:
Step 5.1 sets corresponding speed container for each road grid, for preserve it is that each computing unit obtains plus
Weigh velocity amplitude.
User trajectory sequence in step 5.2, order traversal extraction highway user signaling data track set H_D, so
The tracing point of sequence traverse user track sets two-by-two afterwards, with per tracing point Trace two-by-twonAnd TracemIt is calculated as one single
Member then according to the matched geographical grid system in base station in step 1 and road grid, obtains two tracing point TracenWith
TracemBetween road grid subsequence { gn[an,bn],…,gm[am,bm], wherein [an,bn] represent grid positions serial number, then
Two tracing point TracenAnd TracemDistance Dn,mIt can be calculated by equation below:
The number of road grids of the wherein M between two tracing points, L are the length of road grid.
Step 5.3 calculates tracing point TracenAnd TracemBetween journey time, if tracing point TracenTimestamp
For TimeStampn, tracing point TracemTimestamp be TimeStampm, then the journey time between two tracing points is Tn,m。
Tn,m=TimeStampn-TimeStampm
Step 5.4, the thought according to physical motion, by being understood in the angle of statistical significance comprising road grid
The path distance of two tracing point computing units is shorter, then bigger to the contribution margin of the road grid average speed.According to this feature,
Set a Gauss weight function with path distance inverse correlation
Step 5.5, the weighting velocity amplitude for calculating two tracing point computing units, by the distance between two tracing points that are calculated
Distance Dn,mWith distance time Tn,mRatio multiplied by with Gauss weight functionObtain the weighting velocity amplitude of the computing unit
And by the velocity amplitudeIt is put into the corresponding road grid velocity container between two tracing points.
Step 5.6 repeats step 5.2~5.5, until the highway user signaling data track collection of T to T+t periods
It closes user trajectory sequence in H_D and all traverses completion.
Step 5.7 finally traverses weighting sets of speeds in each road grid velocity container, in each road grid
Speed collection carry out and operation obtain the average speed value V of the road gridk(1≤k≤N)。
Wherein l is expressed as l-th of subscriber signaling in k-th of section, NkRepresent the number of users in k-th of section,Represent the
N section is to the distance (1≤k≤N in m-th of sectionk),Represent n-th of section to m-th of section time (1≤l≤
Nk)。
Fig. 2 is the circuit mapping graph of highway geography grid system;Fig. 3 includes a section grid not for expressway
With the user trajectory figure of path length.
Finally illustrate, preferred embodiment above is merely illustrative of the technical solution of the present invention and unrestricted, although logical
It crosses above preferred embodiment the present invention is described in detail, however, those skilled in the art should understand that, can be
Various changes are made to it in form and in details, without departing from claims of the present invention limited range.