CN109409499B

CN109409499B - Track recovery method based on deep learning and Kalman filtering correction

Info

Publication number: CN109409499B
Application number: CN201811101999.3A
Authority: CN
Inventors: 王静远; 吴宁
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2018-09-20
Filing date: 2018-09-20
Publication date: 2022-03-15
Anticipated expiration: 2038-09-20
Also published as: CN109409499A

Abstract

The invention discloses a track recovery method based on deep learning and Kalman filtering correction, which comprises the following specific steps of S1 discretization of track points; s2 modeling a recurrent neural network and a track; s3 track recovery; s4, obtaining an attention model based on a sequence-to-sequence model by utilizing a space-time attention mechanism; s5, Kalman filtering and a recurrent neural network are combined, Kalman filtering optimization mean square error is introduced, Kalman filtering and the attention model are trained in a coordinated mode, and a final model is obtained. The invention provides a track recovery method based on deep learning and Kalman filtering correction, which models a transfer rule between track points through a recurrent neural network, helps track recovery by using an attention mechanism in the deep learning, and finally introduces Kalman filtering to model the movement of an object in time and space, thereby reducing the unexplainable property and the error of a deep learning model, having stronger interpretability and reducing the error of track recovery.

Description

Track recovery method based on deep learning and Kalman filtering correction

Technical Field

The invention relates to the field of track data mining, in particular to a track recovery method based on deep learning and Kalman filtering correction.

Background

The biggest difference between current solutions to the problem of track recovery compared to past track recovery solutions is that existing solutions are data driven. By mining from a large amount of historical data, analysis can be performed to restore the trajectory from a low sampling rate to a high sampling rate with great accuracy.

However, the existing solutions still have insufficient utilization of data, most solutions only simply count the data and then search out the optimal trajectory through a complex heuristic search scheme, and none of the solutions can capture the complex transfer rule between trajectory points.

On the other hand, with the rise of deep learning, although a deep learning model can capture complex rules in data and can be fused with multivariate information for help, the behavior of the deep learning model is difficult to explain, and the deep learning model cannot display modeling spatio-temporal information, which also causes problems when modeling tracks.

Therefore, it is an urgent need to solve the problem of the art to provide a trajectory recovery method based on kalman filter correction, which can capture deep information of trajectory data and has better interpretability.

Disclosure of Invention

In view of the above, the invention provides a track recovery method based on deep learning and kalman filtering correction, which models a transfer rule between track points through a recurrent neural network, helps to perform track recovery by using an attention mechanism in the deep learning, and finally introduces kalman filtering to model the movement of an object in time and space, thereby reducing the unexplainable property and the error of a deep learning model, having stronger interpretability, and reducing the error of the track recovery.

In order to achieve the above purpose, the invention provides the following technical scheme:

a track recovery method based on deep learning and Kalman filtering correction comprises the following specific steps:

discretization of the trace points of S1: dividing a geographical area into segments of length l_aA disjoint grid of; the set of the grids is recorded as J, the J stores the coordinates of the central points of all the grids according to rows, and the central coordinates are recorded as grid id;

s2 modeling the recurrent neural network and the track: modeling the discretized track points by utilizing a cyclic neural network to obtain an incomplete track point sequence, wherein the incomplete track point sequence does not comprise missing track points;

s3 trajectory recovery: the incomplete track point sequence is coded into a context vector by using a track recovery model from the sequence to the sequence, and then a decoder predicts the missing track points through the context vector;

s4 attention mechanism in space-time: the sequence-to-sequence trajectory recovery model focuses attention on the incomplete trajectory point sequence by utilizing spatio-temporal information, all points in the incomplete trajectory point sequence are used, but the weight of points considered to be important by an algorithm is larger, and the weight of unimportant points is smaller. By focusing the attention of the model on partial key points in the incomplete track point sequence, an attention model based on the sequence-to-sequence model can be obtained;

s5 introduces kalman filtering to modify the attention model: and combining Kalman filtering and a cyclic neural network, introducing Kalman filtering optimization mean square error, and cooperatively training Kalman filtering and the attention model to obtain a final model.

Through the technical scheme, the invention has the technical effects that: introducing attention in the track recovery task by attention mechanism, the sequence-to-sequence model can learn to focus attention on a particular portion of an incomplete track rather than relying on the hybrid output of the encoder. Further, based on the initial attention mechanism from the sequence to the sequence model, we propose a spatio-temporal attention mechanism to capture spatio-temporal variations in long-term trajectories; combining Kalman filtering and a recurrent neural network to better model a track, and introducing priori knowledge to help to evaluate the real state of a track point; in addition, the concise form of Kalman filtering can be realized by matrix operation, so that the method can be conveniently combined with a neural network, and because the Kalman filtering can be written into the form of a matrix, the neural network versicolor filtering which is efficient and supports batch processing can be obtained.

Preferably, in the above trajectory recovery method based on deep learning and kalman filter correction, the trajectory points in S1 are mapped onto a grid, and the trajectory points are represented by a grid id; the input of the recurrent neural network is a grid id, and the output of the recurrent neural network is a predicted grid id to which the position belongs; trace point p_iFrom a moving object and recorded in the form of a tuple (x, y, s, id); p is a radical of_iX is longitude, p_iY is the latitude, p_iS is the time stamp of this point, p_iId is the id of the location to which this point belongs.

Through the technical scheme, the invention has the technical effects that: the space-time efficiency of the algorithm is improved by mapping the limited individuals in the infinite space into the limited space.

Preferably, in the above trajectory recovery method based on deep learning and kalman filtering correction, the recurrent neural network in S2 includes but is not limited to: a long-time memory model and a threshold control unit model.

Through the technical scheme, the invention has the technical effects that: the two models solve the problem of gradient disappearance existing in a common recurrent neural network through a gate mechanism, so that the two models can be used for modeling a longer track sequence and capturing a longer-term dependency relationship.

Preferably, in the above-mentioned trajectory recovery method based on deep learning and kalman filter correction, in S3, the incomplete trajectory point sequence t is^incAs input, the corresponding complete trajectory t is restored based on the input^comThe method comprises the following specific steps: the encoder calculates a representation s from the input sequence, the decoder generates a word at each moment, the incomplete track point sequence is encoded into a context vector based on a track recovery model from sequence to sequence, the decoder predicts the missing points with the help of the context vector, and the conditional probability distribution is as follows:

wherein, t^comIs a track with the length n and is ordered according to time and keeps a constant sampling interval epsilon;

t^com＝b₁→b₂→...→b_n；

t^incis a complete trajectory t of size m^comA subset of (a);

t^inc＝a_j1→a_j2→...→a_jm；

s31 models an encoder of known sequence: the grid id is used as the input of an encoder, the input is converted into a low-dimensional vector by an embedded layer, and the low-dimensional vector is sent into a two-way long-time and short-time memory model; the bidirectional long-time and short-time memory model comprises a forward LSTM and a backward LSTM;

forward LSTM reads in sequence the input vector sequence to calculate a forward hidden state sequence

The incoming vector sequence is read into the LSTM in reverse order,and calculating a backward hidden state sequence

At j (h)_kOutput of time h_jkIs composed of

And

sum of, is recorded as

S32 decoder for reconstructing missing tracks: when the decoder generates the next point, if the point is a known track point, the known track point is selected to be copied as the output of the neural network, and the copying mechanism is formally written as:

suppose j_k≤i＜j_k+1Adding end point limitation to the prediction stage to model local track information and decode missing points

The rewrite is:

hidden state h_iComprises the following steps: h is_i＝LSTM(b_i-1,e_i,h_i-1)；(4)

e_iThe method is characterized in that the method is an end point limiting vector, and the end point limiting vector comprising a local track starting point and an end point is introduced; the endpoint limit vector at time i is denoted as e_i，

g_eIs limited in the forward direction

And backward limitation

Capturing endpoint limit information as an input; the function consists of an embedded layer and a multilayer perceptron; all parameters of the embedding layer are shared; for the training of the sequence-to-sequence recovery model, the following is written:

where D represents the training set.

Through the technical scheme, the invention has the technical effects that:

preferably, in the above trajectory recovery method based on deep learning and kalman filter correction, in S4, attention is paid to the introduction of the mechanism, and formula (4) is written as:

h_j＝LSTM(b_i-1,e_i,h_i-1,c_i) (7)；

context information c_iExpression (c):

wherein, c_iWeighting the hidden state output by the encoder i at the moment; s_iIncluding the state of the encoder at time i;

the weight of each hidden state is calculated via softmax:

wherein u is_ij＝v^Ttanh(W_hh_i+W_ss_j) (10); wherein v is a meterCalculating a dimensionality reduction vector used by the inner product;

W_his a decoder output vector transform matrix; w_sIs the encoder output vector transform matrix;

u_ik＝v^Ttanh(W_hh_i+W_ss_k)，k＝1，2，…，m；

since fixed time intervals are considered, the temporal distance can be obtained by the relative position of the sequences;

in addition, the higher the similarity degree of points with similar positions; the temporal distance between the ith point to be predicted and the kth input trajectory point d

Is defined as:

distance in space

Is defined as

Converting the time distance and the space distance into vectors by using a time distance embedding matrix T and a space distance embedding matrix G, and respectively recording the vectors as

And

finally, the spatiotemporal attention weight is calculated as:

wherein, W_tdIs a time-distance vector transformation matrix; w_sdIs a spatial distance vector transformation matrix.

Through the technical scheme, the invention has the technical effects that: by means of vector addition, spatio-temporal information, position representation information or high-level semantic information is fused into the weight of the attention mechanism, so that the effect of the original attention mechanism on a track recovery task is greatly improved. From another perspective, attention weights add a temporal and spatial regularization term, which helps to alleviate overfitting problems caused by attention mechanisms.

Preferably, in the above trajectory recovery method based on deep learning and kalman filtering correction, the step S5 of introducing kalman filtering to correct the trajectory recovery model includes:

general description of the S51 kalman filter: kalman filtering will bring noisy measurement coordinates z_iAnd the corresponding covariance matrix R_iCalculating posterior state estimates as revenue

Here, the

And

the velocities in the x and y directions at time i are estimated; the output of the kalman filter at time i is recorded as:

wherein H_iIs a measurement matrix describing the relationship between the posterior state and the measurement state, and subscripts (i, i) represent the posterior state at time i; hidden state of Kalman filtering at moment i { x_(i,i),C_(i,i)By x_(i,i)And the corresponding covariance matrix C_(i,i)Composition is carried out;

s52 measured covariance estimated at time i by soft max vector calculated by formula (3) and formula (4)

And estimated location

Written as follows:

wherein m is_i＝J^TI_i，

In particular when

When generated by a copy mechanism,

will be set to a constant matrix close to 0; j is a matrix of n x 2 in shape, n being the number of grids; j. the design is a square_kIs the coordinates of the kth grid; i is_kCalculating the kth dimension of the softmax vector for the neural network at the ith moment;

time-based backpropagation on the S53 Kalman Filter, loss function of Kalman Filter:

wherein o is_iIs the output of Kalman filtering at time i, b_iIs the actual position at time i;

for the process covariance at time t, the gradient at time t + n is derived as:

s54 final loss function expression:

L＝L₁+L₂ (21)；

wherein L is₁Training of a recovery model representing sequence-to-sequence; l is₂Representing the mean square error used to optimize kalman filtering.

Through the technical scheme, the invention has the technical effects that: and combining Kalman filtering and a cyclic neural network, introducing Kalman filtering optimization mean square error, and cooperatively training Kalman filtering and the attention model to obtain a final model.

Preferably, in the above trajectory recovery method based on deep learning and kalman filter correction, the specific step of calculating the posterior state estimation by using the S51 kalman filter includes:

s511 predicts: the prediction process uses the a posteriori state estimate of the previous time instance { x }_(i-1,i-1),C_(i-1,i-1)To produce an a priori state estimate at time i x_(i,i-1),C_(i,i-1)}；

Wherein x is_(i,i-1)＝Φ_i-1x_(i-1,i-1)(22)；

C_(i,i-1)＝Φ_i-1x_(i-1,i-1)Φ_i-1 ^T+Q_i-1(ii) a The subscript (i, i-1) is used to indicate the prior state at time i based on the posterior state at time i-1; Φ is the state transition matrix of kalman filtering;

s512, updating: the update process incorporates a priori state estimates { x }_(i,i-1),C_(i,i-1)And measurement state z_i,R_iGet the posterior state estimate { x }_(i,i),C_(i,i)}；

x_(i,i)＝x_(i,i-1)+K_i(z_i-H_ix_(i,i-1)) (24)；

C_(i,i)＝(I-K_iH_i)C_(i,i-1) (25)；

R_iIs a constant matrix; q_iIs the process covariance matrix at time i.

Through the technical scheme, the invention has the technical effects that: the covariance describes the confidence in the internal motion model of the kalman filter, and the lower the covariance, the more biased the prediction results towards the a priori state estimation of the kalman filter.

According to the technical scheme, compared with the prior art, the invention discloses a track recovery method based on deep learning and Kalman filtering correction, a transfer rule between track points is modeled through a recurrent neural network, the track recovery is assisted by an attention mechanism in the deep learning, and finally Kalman filtering is introduced to model the movement of an object in time and space, so that the unexplainable property and the error of a deep learning model are reduced, the interpretability is stronger, and the error of the track recovery is reduced.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

FIG. 1 is a block diagram of an overall model of the present invention;

FIG. 2 is a block diagram of a sequence-to-sequence model of the present invention;

FIG. 3 is a drawing of an attention model of the present invention;

FIG. 4 is a model of the Kalman filtering and recurrent neural network combination of the present invention;

FIG. 5 is a graph illustrating the effect of embedding vector magnitude according to the present invention;

FIG. 6 is a graph illustrating the effect of the grid length of the present invention;

FIG. 7 is a diagram of a trace restore thumbnail of the present invention;

FIG. 8 is a graph of an attention weight visualization of the present invention;

fig. 9 is a visualization diagram of the internal state of the kalman filter on a divided grid space.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The embodiment of the invention discloses a track recovery method based on deep learning and Kalman filtering correction, wherein a transfer rule between track points is modeled through a cyclic neural network, the track recovery is assisted by an attention mechanism in the deep learning, and finally Kalman filtering is introduced to model the movement of an object in time and space, so that the unexplainable property and the error of a deep learning model are reduced, the higher interpretable property is realized, and the error of the track recovery is reduced.

And (3) construction of a data set:

to measure the trace recovery performance of our model, three real trace datasets were used for evaluation, from beijing, shenzhen and polo graphs (spanish second major city), respectively. The taxi data set from beijing is sampled once per minute, but the data set sampling frequency of shenzhen is not uniform. Since the sampling interval of the Shenzhen dataset is below five minutes, the trace is preprocessed into a five minute time interval. The public trajectory data set from the bohr diagram is provided by a trajectory prediction competition on kaggle (a well known data mining competition platform). The sampling interval of the raw data was 15 seconds, which we preprocessed into one minute intervals. The three cities have completely different road network structures and traffic conditions, and detailed statistical characteristics of the data set are summarized in detail in table 1.

In the preprocessing, we first need to divide the target region into disjoint grids and map the coordinates to grid numbers. In addition to this, it is necessary to ensure that the time intervals in one data set are consistent. Different mesh lengths are selected for different data sets. The length of grids of the Beijing and Boerhen graph data sets is 100m, and the length of the grids of the Shenzhen data set is 200 m; firstly, grid division, then trajectory discretization, next, immobile point removal, ultralong trajectory cutting, and finally, training data, verification data and test data division.

To accomplish these steps, a number of data visualization mechanisms need to be used, using a LEAFLET to construct the map, and using an online map api to access the map data and then visualize portions of the trajectory data. Meanwhile, regions corresponding to the data set are selected, and regions with sparse track points are deleted by the user, so that the calculation amount of the neural network is reduced; and finally, selecting a proper grid length for each data set. The too small grid length can cause the too large vocabulary to be processed by the neural network, and the too large vocabulary can cause the slow running speed of the neural network, greatly improve the occupied video memory and be very difficult to converge in a proper time. Too large a grid length leads to inaccurate prediction of the model, and an excessively large grid inherently brings a high error although the kalman filter can make corrections.

TABLE 1

Data set	Beijing	Shenzhen (Shenzhen medicine)	Boer diagram
				Data coverage duration	One month	Two weeks	One year
Number of taxis	18298	17998	442
				Number of tracks	313560	149230	284100
Number of tracing points	31356000	1492300	8523000
				Number of positions	15870	19306	6351
Sampling interval	One minute	Five minutes	One minute

And (3) setting a task: for each data set, the data set is divided into three parts, a training set, a verification set and a test set. The training set is used to train the model, and the hyperparameters, such as the size of embedding, or the length of the trellis, are selected by the validation set. The final results were evaluated on a test set. In all of our experiments, seventy percent of the traces were used as the training set, ten percent of the traces were used as the validation set, and the remaining twenty percent of the traces were used as the test set. For each incomplete track, it is generated by random sampling of the corresponding complete track. The sampling rate is defined as the ratio of the length of the incomplete track to the length of the complete track. To study the ratio change of the DRKF model under different environments, the change of the model effect at thirty percent, fifty percent and seventy percent sampling rate of the DRKF was tested. The constant sampling strategy can cause the deviation of results, so that each batch is sampled again in the training, verifying and testing processes. All evaluations on the test set were repeated three times.

A method of comparison is needed that compares our model with two traditional models and two deep learning models on three data sets. These methods were evaluated by four evaluation indexes. A generic sequence-to-sequence model and an attention model based on the sequence-to-sequence model are selected as the baseline model. The baseline model has a good coverage for the correlation model.

And (3) the RICK builds a routing table for the uncertain track, and is used for responding to the online inquiry of the user by searching the top-k track on the graph.

MPR: the most popular tracks can be found from a transition network based on breadth-first searching.

Seq2Seq: it uses the encoder to compress an incomplete track into a vector from which its decoder recovers a complete track.

Bahdanauthention, based on a sequence-to-sequence model, will focus attention on important points in incomplete tracks.

And setting parameters, namely all models have some parameters to be adjusted, and optimizing each model respectively according to the reported optimal parameters or through a verification set. The invention uses a two-layer LSTM with an embedding size of 512 dimensions.

Results and analysis:

the performance of all the methods has been written in table 2, and the following phenomena can be observed:

(1) first, our proposed DRKF model is stronger than other models in both three datasets and four evaluation indices. The neural network architecture of the present invention evolves from a sequence-to-sequence model and attention mechanism. Experimental results show that the traditional sequence-to-sequence model and attention mechanism are not so efficient at the trajectory recovery task, which our DRKF fits well.

(2) Second, RICK, a conventional a-algorithm based model, performed very good results in experiments, whereas a-algorithm required a long time to search for an optimal trajectory and it was very difficult to parallelize. Moreover, MPR does not work well for any situation, as it is difficult to adapt to situations with thousands of positions. Too high a calculation amount may result in MPR not giving a proper result in a proper time. The data set includes an area of hundreds of square kilometers, containing tens of thousands of locations. The temporal complexity of MPR is really so high that it is necessary to reduce the number of bins and increase the area of the bins to obtain the result in a time that can be tolerated, resulting in a very fast degradation of performance.

(3) Finally, when the data set and sampling rate are the same, the euclidean distance will always be greater than the NDTW distance, since the phenomenon of time drift is very common between the predicted trajectory and the actual trajectory. The ratio of the euclidean distance to the NDTW distance should demonstrate that the time-shift phenomenon is significant in both deep learning models. When the sampling rate is seventy percent, the ratio of euclidean distance to NDTW distance of the DRKF is smaller than the sequence-to-sequence model and the bahdana u attention mechanism, which means that the DRKF can partially mitigate this phenomenon with the help of kalman filtering. In addition, when the sampling rate increases, the error rate of the DRKF decreases faster than the MPR and RICK, which indicates that the DRKF can better utilize the information provided in the incomplete tracks.

TABLE 2

Analysis of the depth trajectory recovery model:

the space-time information is very important context information in the track recovery task, and a space-time attention mechanism and a Bahdana attention mechanism are compared, so that the difference of the attention mechanisms is avoided; the performance differences of the bidirectional recurrent neural network and the unidirectional recurrent neural network as encoders were also compared. Consider the variation of the effect of each model at five sampling rates, 30%, 40%, 50%, 60% and 70%, under different sampling rates. As shown in table 3, the bahdana attention mechanism may help the subsequence-to-sequence model to some extent, but this help is very limited and it is found that both temporal and spatial information can improve performance. The best performance of the spatio-temporal attention mechanism proposed in this application means that the present invention does find an effective way to introduce spatio-temporal information into the attention mechanism. The use of bi-directional LSTM does improve performance; in addition, the effect of the model is greatly improved along with the rise of the sampling rate, and the error of the two-way LSTM model is reduced fastest when the sampling rate rises, so that the characteristics can be more effectively extracted from the input sequence, the track recovery task is completed, and finally the STA + BiLSTM item is adopted in the deep learning part of the model.

TABLE 3

Analysis of Kalman Filter:

the percentage of error reduction is used as a measure. In table 4, comparing the difference in performance between the static method and the dynamic method, the effect of the dynamic covariance is better because the confidence of each predicted position can be evaluated by changing the covariance size. If the estimated covariance is very large, the confidence of the neural network prediction is not high, and therefore some predicted low confidence points may be modified by kalman filtering. Obviously, kalman filtering becomes more efficient as the sampling rate increases, because more samples can help kalman filtering to give a more accurate a priori state estimate, thus effectively reducing noise from the neural network. .

TABLE 4 impact of different covariance evaluation methods

And (3) optimizing parameters:

the effect of the hyper-parameters on performance was studied on the data set of Beijing. In all baseline models, the bahdana u attention mechanism performs very well and has similar superparameters as the DRKF. Therefore, bahdana u attention was made as the only reference baseline model. The Embedding dimension is a very important hyper-parameter of the recurrent neural network, and directly influences the capacity of the model. In fig. 5, the hidden state size is set to 128, 256, 384, 512, 640. Through experiments on the validation set, it was found that the Embedding dimension of 512 can provide the best performance for the two models. After the Embedding dimension is adjusted, the Embedding dimension is fixed to 512, and then the influence of different grid lengths is compared. In fig. 6, the grid lengths are set to 80m, 100m, 120m, 140m, 160 m. With a grid of 100m length giving the best results. When the grid length is reduced to 80m, the number of positions is too large, the training of the model is difficult, and therefore the prediction error starts to rise.

As shown in fig. 1, from left to right, the extraction of spatio-temporal information, the encoder composed of bi-directional LSTM, the calculation of spatio-temporal attention mechanism, and the decoder of subsequence to sequence model are sequentially performed. The legend on the right indicates the names of several neural network elements, including bi-directional LSTM, Kalman filtering, and uni-directional LSTM. As shown in fig. 2, a block diagram of the sequence-to-sequence model illustrates how sequence recovery occurs through end-point restriction and copy mechanisms. As shown in fig. 3, trajectory recovery is performed using temporal and spatial information by an attention mechanism. As shown in fig. 4, the output of the neural network is used instead of the output of the kalman filter, and a co-training algorithm is proposed. As shown in fig. 5, which shows how a complete track is restored and the change of attention weight in the restoration process, fig. 7 is a thumbnail of track restoration drawn by using a leaf, and shows the general effect of track restoration. As shown in fig. 8, the attention weight matrix is drawn by using matplotlib. The darker the color, the greater the weight of the representation, that is, the greater the input to output relationship, with the horizontal axis being the restored point and the vertical axis being the input point. An example of trace recovery is shown, where thirty sampled points are used to recover the original one hundred points. In fig. 7, the purple icon is the sampled point and the red line is the corresponding recovered trace. Fig. 8 illustrates the change in attention weight during recovery between a recovered point and an incomplete trace point. The upper matrix of FIG. 8 illustrates that it is difficult for the conventional attention mechanism to estimate the weights of all sampled points, and the lower matrix of FIG. 8 illustrates that our spatiotemporal attention mechanism can compute a more reasonable attention weight with the help of spatiotemporal information. As shown in fig. 9, the correction process of the kalman filter is demonstrated, and the two-dimensional gaussian distribution represents the prior, posterior, and measurement states of the kalman filter. The purple dots in the figure are the real dots that need to be restored. In the prediction stage, Kalman filtering makes an a priori state estimation, the result of the estimation is very close to the target position, and the prediction of the neural network deviates a lot from the target. However, after the update phase, the result of the correction is very close to the target. It can also be seen that the kalman filter is essentially a linear prediction, and based on the internal states estimated at times i-2 and i-1, the kalman filter makes a linear extrapolation, taking into account time and vehicle speed, and therefore having a considerable reference value, whereas the uncertainty of the neural network on its own prediction ultimately leads to the model decision to be more accurate than the kalman filter internally predicts. As can be seen from the figure, the Gaussian distribution predicted by Kalman filtering is obviously brighter than that predicted by a neural network, and the prior state estimation of Kalman filtering is more reliable.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. A track recovery method based on deep learning and Kalman filtering correction is characterized by comprising the following specific steps:

discretization of the trace points of S1: dividing a geographical area into segments of length l_aA disjoint grid of; the set of the grids is recorded as J, the J stores the coordinates of the central points of all the grids according to rows, and the coordinates of the central points are recorded as grid id;

s4 attention mechanism in space-time: the sequence-to-sequence track recovery model focuses attention on the incomplete track point sequence by utilizing spatio-temporal information to obtain an attention model based on the sequence-to-sequence model;

2. The trajectory recovery method based on deep learning and kalman filtering correction according to claim 1, wherein the trajectory points in S1 are mapped onto a grid, and the trajectory points are represented by a grid id; the input of the recurrent neural network is a grid id, and the output of the recurrent neural network is a predicted grid id to which the position belongs; trace point p_iFrom a moving object and recorded in the form of a tuple (x, y, s, id); p is a radical of_iX is longitude, p_iY is the latitude, p_iS is the time stamp of this point, p_iId is the id of the location to which this point belongs.

3. The trajectory recovery method based on deep learning and kalman filtering modification according to claim 1, wherein the recurrent neural network in S2 includes but is not limited to: a long-time memory model and a threshold control unit model.

4. The method for trajectory recovery based on deep learning and Kalman filtering correction as claimed in claim 1, wherein in S3, the incomplete trajectory point sequence t^incAs input, baseRestoring the corresponding complete trajectory t from the input^comThe method comprises the following specific steps: the encoder calculates a representation s from the input sequence, the decoder generates a word at each moment, the incomplete track point sequence is encoded into a context vector based on a track recovery model from sequence to sequence, the decoder predicts the missing points with the help of the context vector, and the conditional probability distribution is as follows:

wherein, t^comIs a track with the length n and is ordered according to time and keeps a constant sampling interval epsilon; t is t^com＝b₁→b₂→…→b_n；

t^incIs a complete trajectory t of size m^comA subset of (a);

t^inc＝a_j1→a_j2→…→a_jm；

Backward LSTM reads in the input vector sequence in reverse order and calculates a backward hidden state sequence

At j (h)_kOutput of time of day

Is composed of

And

sum of, is recorded as

The rewrite is:

g_eIs limited in the forward direction

And backward limitation

where D represents the training set.

5. The trajectory recovery method based on deep learning and kalman filtering modification according to claim 1, wherein in S4, attention is paid to the introduction of a mechanism, and formula (4) is written as:

h_j＝LSTM(b_i-1,e_i,h_i-1,c_i) (7)；

context information c_iExpression (c):

wherein, c_iWeighting the hidden state output by the encoder i at the moment; s_iIncluding the state of the encoder at time i; the weight of each hidden state is calculated via softmax:

wherein u is_ij＝v^Ttanh(W_hh_i+W_ss_j) (10); wherein v is a dimensionality reduction vector used to compute the inner product; w_hIs a decoder output vector transform matrix; w_sIs the encoder output vector transform matrix; u. of_ik＝v^Ttanh(W_hh_i+W_ss_k)，k＝1，2，…，m；tan h (·) is an arctangent function;

Is defined as:

distance in space

Is defined as

And

finally, the spatiotemporal attention weight is calculated as:

6. The method for trajectory recovery based on deep learning and kalman filter correction according to claim 1, wherein the step S5 of introducing kalman filter to correct the trajectory recovery model includes: one of S51 Kalman filteringDescription of the general terms: kalman filtering will bring noisy measurement coordinates z_iAnd the corresponding covariance matrix R_iCalculating posterior state estimates as revenue

Here, the

And

the velocities in the x and y directions at time i are estimated; the output of the kalman filter at time i is recorded as: o_i＝H_ix_(i,i)(15)；

s52 measured covariance estimated at time i by using softmax vector calculated by equations (3) and (4)

And estimated location

Written as follows:

wherein m is_i＝J^TI_i，

When in use

When generated by a copy mechanism,

for the process covariance at time t, the gradient at time t + n is derived as:

s54 final loss function expression:

L＝L₁+L₂ (21)；

7. The trajectory recovery method based on deep learning and Kalman filtering correction according to claim 6, wherein the S51 Kalman filtering calculation posterior state estimation specific step comprises:

Wherein x is_(i,i-1)＝Φ_i-1x_(i-1,i-1) (22)；

x_(i,i)＝x_(i,i-1)+K_i(z_i-H_ix_(i,i-1)) (24)；

C_(i,i)＝(I-K_iH_i)C_(i,i-1) (25)；

R_iIs a constant matrix; q_iIs the process covariance matrix at time i.