CN110826698A

CN110826698A - Method for embedding and representing crowd moving mode through context-dependent graph

Info

Publication number: CN110826698A
Application number: CN201911066089.0A
Authority: CN
Inventors: 刘芳; 钟婷; 周帆
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2019-11-04
Filing date: 2019-11-04
Publication date: 2020-02-21

Abstract

The invention provides a novel method for representing the movement of people through a context-dependent graph embedding model. The method comprises the steps of firstly generating a fully-connected context correlation graph according to historical tracks of a user, firstly realizing the application of a word embedding technology to a graph model, learning to fuse the sign-in point vector representation of context semantic information, further obtaining the vector representation of each track through a recurrent neural network, and then introducing a reinforcement learning method to find out a generator of the current track. In the whole training process of the model, the method synchronously trains by using the tracks with the labels and the tracks without the labels, and meanwhile, the method also updates the model parameters by using a strategy gradient method in the counterstudy, thereby further enhancing the performance of the model.

Description

Method for embedding and representing crowd moving mode through context-dependent graph

Technical Field

The invention belongs to the field of Neural networks in machine learning, and relates to a deep learning-based method, which is characterized in that a word embedding technology is applied to a graph model for the first time, the sign-in point vector representation of context semantic information is learnt to be fused, the vector representation of each track is obtained through a Recurrent Neural Network (RNN), and then the implicit movement mode of a user is extracted from track data with a label and without the label by utilizing reinforcement learning, so that the user generating the current track is found out.

Background

In a Social networking service (lbs n) application Based on geographic Location information, such as personalized Location recommendation, travel planning, and crowd movement prediction, learning a crowd movement law is an important and complex task, and the task has a very wide application scenario, for example: and finding out the user generating the current track according to the learned crowd mobility rule, and further finding out the friendship of the user, thereby realizing the prediction of the corresponding friendship of the current track. In terms of practical application, the invention can be used to help find extreme organizations in society, such as: given a set of anonymous tracks, identifying a set of criminals/terrorists, or their partnerships, using only the track information; it can also be used for friendship reconstruction, i.e. reconstructing social circles from the respective trajectories.

In recent years, deeply learned models such as Recurrent Neural Networks (RNNs) often utilize low-dimensional embedding techniques to model trajectory information, such as embedded representations of learned check-in points or embedded representations of trajectories. RNN-based methods basically capture semantic information from those continuous variable-length sequences of check-in points, and each RNN stores information from previous units that affects subsequent units. Gradient descent algorithms, such as SGD, Adam, and AMSGRAD, are then typically used to optimize the learnable parameters for a particular task in the RNN model.

However, existing work still faces the following challenges in effectively modeling user movement patterns: (1) there is a lack of a corpus for learning the check-in point embedded representation. The difference between spatio-temporal data processing and word embedding in the natural language processing field is that a rich corpus can be used to train dense representations of different words in the text processing process of natural language, and semantic information contained in each word representation is different for different natural language processing tasks. However, for social network data based on geographic location, semantic information contained in the social network data is obviously insufficient, because most users usually only access a small number of interest points, and because of irregularity of movement tracks and sparseness of check-in points, it is difficult to represent a certain check-in point by using the existing interest points. (2) The restrictions of the context information are ignored. In a given trajectory, different check-in points have their respective geographic location information and check-in time information, which means that each check-in point is limited by the context space affected by the user's movement patterns or check-in preferences, such as: the user's next check-in point may be some point around the current check-in point, and the user prefers to access points of interest that are closer to himself. As can be seen, geographic constraints play an important role in the embedded representation of check-in points and the embedded representation of tracks. (3) Labeled training data is lacking. In many real application scenarios, it is difficult to acquire a large amount of labeled data training models, but this is also the key in the learning of user movement patterns.

Based on the problems, the invention provides a new method for representing crowd movement through context-dependent graph embedding, and the method introduces a reinforced learning idea in the learning process, fully utilizes the geographical position information of the check-in point and the track information without a label, thereby effectively relieving the problems of data sparseness, limited tagged data and the like.

Disclosure of Invention

The invention aims to learn the expression of the movement pattern of a crowd through a new context-dependent graph embedding model, and further introduce a reinforcement learning method to find out a generator of a current track. In the whole training process of the model, the method synchronously trains the tracks with the labels and the tracks without the labels, and simultaneously updates the model parameters by a strategy gradient method in the counterstudy, thereby further enhancing the performance of the model.

Based on the above inventive concept, the present invention designs a method for representing crowd movement patterns by context-dependent graph embedding, which comprises the following steps:

s1, preprocessing data: cutting the original track according to a specific time interval and dividing a training set D_LAnd test set D_U；

S2, synthesizing a context correlation graph: according to D_LAnd D_URespectively constructing check-in point space diagrams G by historical tracks of users_spatialAnd access sequence diagram G_visitThen, a fully connected context correlation graph G is synthesized by the two graphs_total；

S3, for the model defined in S31, S32, S33, perform back propagation until the model converges:

s31, context dependent graph embedding: for D_LAnd D_UIn each check-in point l_τBased on the context correlation graph G_totalGenerating a low-dimensional distributed representation C (l) of check-in points_τ) Then, introducing nonlinear transformation to cooperate with the distribution representation of the training check-in point, and further obtaining the embedded representation C' (l) of the initialized check-in point_τ). Then, the invention makes the time vector u (l) corresponding to the check-in point_τ) Embedded representation C' (l) with initialization check-in point_τ) Splicing to obtain the final representation C of the check-in point_τ；

S32, track coding: based on the final representation of the check-in point obtained in the previous step, the track can be coded by using a recurrent neural network, and the current track is represented by a result X after the last hidden state normalization of the RNN;

s33, pre-training trajectory classifier, discriminator: based on the track embedding representation obtained in the previous step, the marked track (positive sample) { (T) is utilized^l,U^l)∈D_LPre-training trajectory classifier p_θWherein T is^lBeing sub-tracks of users, U^lLabels for track correspondences, D_LRepresenting a set of positive samples. Next, the classifier is used to classify the unlabeled trajectory (negative examples)

Predict its corresponding label

The discriminator F is then pre-trained by minimizing the cross entropy of the classifier on positive and negative samples_ε；

S4, updating the model parameters by using a small batch training method, and repeating the steps until the model traverses all data: from positive samples D_LIn the collection of B group data pairs (labeled tracks, corresponding labels), from the negative examples D_UData of group B (unlabeled trajectories) were also acquired and passed through a classifier p_θPredicting the corresponding labels of the unmarked tracks; using a discriminator F_εJudging the credibility of the label predicted by the classifier, and updating the behavior value function a (-) of the discriminator along with the credibility; calculating a gradient of the desired reward

Updating parameters { theta, omega, phi } in the model through the strategy gradient, wherein theta is a parameter of the track classifier, omega is a parameter for embedding the sign-in point, and phi is a parameter for embedding the track;

s5, giving an unmarked trackUsing the above-described trained classifier p_θAnd discriminator F_εPredicting the probability distribution vector corresponding to the track

Each dimension represents the probability value that the user corresponding to the dimension is the prediction result of the time, and the user corresponding to the dimension with the maximum probability value is selected as the final prediction result.

In the above method for representing the movement pattern of the crowd through the context-dependent graph embedding, the step S1 aims to cut the track and divide the training set D_LAnd test set D_U. Utensil for cleaning buttockThe method comprises the following steps:

and S11, dividing a complete track into continuous sub-tracks according to a fixed time interval (6 hours), wherein each sub-track is a check-in point sequence divided based on the time interval. If the current trajectory is known by user u_kIf it is generated, the trace is recorded asWhere N represents the number of users, each l_τ(τ∈[1,n]) Representative time stamp t_τA point of attendance, and each l_τAll have geographic coordinates corresponding thereto

Representing the warp and weft values, respectively. For tracks that do not know who produced, i.e. unmarked tracks, they are noted as

S12, in order to further mine the moving mode of the user, the invention adds a time factor. Specifically, one day is divided into 24 discrete time stamps and the 24 density-based time representations are sampled in a Gaussian distribution, i.e., at t_τNodes that are time-checked-in and their corresponding time vectors may be represented as

Wherein d is^*A dimension representing a time vector;

s13, dividing training set and testing set, selecting 50% T of each user sub-track^lAnd its corresponding tag U^lAs a training set { (T)^l,U^l)∈D_LThe remaining 50% was used as the test set

S14, obtaining a vector representation of the known label: for a known label of the training set data, the present invention will use 0-1 vector encoding to represent the label;

s15, sorting the traces according to their lengths so that the trace lengths of the same batch are as similar as possible. This operation may reduce unnecessary padding during processing of the trace.

In the above method for representing the movement pattern of people by embedding the context-dependent graph, the step S2 is to synthesize a fully connected context-dependent graph G_total. The method specifically comprises the following steps:

s21, constructing a check-in point space map, and determining whether two points in the check-in point set should be connected according to whether the distance between the two points is greater than a given threshold value, i.e. constructing a non-directional space map G_spatial{V,E_sWhere V is the set of check-in points sampled from the check-in records of the training set and test set, E_sRepresenting a collection of edges between nodes, but if a given threshold is too small, some distant check-in points may become outliers, causing the graph to not communicate. In order to avoid the appearance of isolated points, the invention introduces three virtual nodes, namely a starting point sp, a terminal point ep and a completion node cp, and further connects all the nodes in V with the three nodes one by one;

s22, constructing an access sequence diagram, and further constructing a directed diagram G in order to integrate the access intention and the serialized moving mode of the user_visit{V,A,E_visitIn which E_visitRepresenting conversion of places visited by the user, A being G_visitSet of neutron trajectories, for example: if it is

Then any check-in point v it contains_i∈V；

S23, synthesizing a fully connected context correlation graph G_total{ V, E }, first with G_spatialInitialization G_totalThen E is_visitThe edges in the graph E are added into the graph E one by one, and an undirected graph G for learning the sign-in point representation is obtained_totalThe map takes into account both the check-in point geographical location information and the sequence pattern of the check-in.

In the method for representing the movement pattern of the crowd through the context-dependent graph embedding, the step S31 aims to obtain the final representation C of the check-in point through the context-dependent graph embedding_τ. The method specifically comprises the following steps:

s311, in the context correlation graph G_totalContext Con (l) for getting check-in point by random walk_τ)＝l_τ-a:l_τ+aWherein l is_τ-a:l_τ+aRepresents a check-in point l_τNearby continuous 2a +1 check-in point sequences, wherein a is the size of a fixed sliding window, and tau is the index position of the current check-in point in the track;

s312, obtaining a check-in point l by using a continuous bag-of-words model_τLow dimensional distribution of (a) represents C (l)_τ)∈R^dD is the dimension of the low-dimensional vector space;

s313, introducing nonlinear transformation to cooperatively train the representation of each check-in point, and further obtaining an embedded representation C' (l) of the initialized check-in point_τ)＝Relu(W'C(l_τ) + b'), wherein C (l)_τ) For signing in point l_τThe vector of (1) represents, Relu is a specific activation function, and ω ═ { W ', b' } is a parameter which can be learned in the process of embedding and representing the check-in point;

s314, embedding the initialization of each check-in point into a representation C' (l)_τ) And time information u (l)_τ) Final representation C of check-in points obtained by splicing_τFormally represented as C_τ＝[C'(l_τ),u(l_τ)]。

In the method for embedding and representing the crowd movement pattern by the context-dependent graph, the step S32 aims to encode the trajectory by using RNN, and further obtain the final representation X of the current trajectory. The method specifically comprises the following steps:

s321, for each (sub) track T ═ l₁,l₂,...,l_nIs given by_τ-1And h_τ(τ e {1, 2.. eta., n }) represents the previous step and the current hidden state of the trace, respectively, with an initial oneState h₀Is a vector of all 0 s. The input gate, forgetting gate and output gate of RNN can be obtained by the following calculation: i.e. i_τ＝σ(W_iC_τ+U_ih_τ-1+V_ic_τ-1+b_i)，f_τ＝σ(W_fC_τ+U_fh_τ-1+V_fc_τ-1+b_f)，o_τ＝σ(W_oC_τ+U_oh_τ-1+V_oc_τ-1+b_o) Wherein b is_*Is a deviation vector, σ is a logistic activation function, C_τRepresents a check-in point l_τIs finally expressed, matrix W_*，U_*，V_*Are different door parameters. Memory cell c_τThe update rule of (1) is: c. C_τ＝f_τc_τ-1+i_τtanh(W_cC_τ+U_ch_τ-1+b_c) Where tanh (. cndot.) is a hyperbolic tangent function, initial state c₀Is also an all-0 vector, hidden state h_tThe update rule is as follows: h is_τ＝tanh(c_τ)o_τ；

S322, the invention selects the result after the last hidden state normalization of the current track as the final representation of the track:

wherein

For the last hidden state h_nD' is a hidden state h_nOf the dimension, [,]representing a stitching operation between vector elements.

In the above method for representing the movement pattern of the crowd through the context-dependent graph embedding, the step S33 is to utilize the tagging track (positive sample) { (T)^l,U^l)∈D_LTraining trajectory classifier p_θAnd further using the classifier as an unlabeled trajectory (negative example)

Predict its corresponding label

The discriminator F is then pre-trained by minimizing the cross entropy of the classifier on positive and negative samples_ε. The method specifically comprises the following steps:

s331, using the marked track (positive sample) { (T)^l,U^l)∈D_LPre-training trajectory classifier p_θWith training objectives to maximize the desired reward R_θ＝∑_u∈UΡ_θ(u | X) a (X, u), wherein, p_θ(u | X) represents the probability that the predicted tag for track X is u, and a (-) is a behavioral cost function for selecting u as the current predicted tag, which is used to measure or evaluate the authenticity of the predicted tag, where a (X, u) ═ 1 if X is data sampled from the labeled track, and otherwise, a (X, u) ═ F_ε(X, u), order

Reward-based impact classifier as an unmarked trace, W "and b" are learning parameters, o ═ Relu (X)^TW_ru), Relu is the activation function, W_rIs a learned parameter matrix;

s332, using classifier p_θAs an unmarked track (negative sample)

Predict its corresponding label

S333, pre-training the discriminator F by minimizing the cross entropy of the classifier on positive and negative samples_εI.e. by

In the above method for representing the crowd moving pattern by embedding the context-dependent graph, the step S4 is to continuously update the model parameters { θ, ω, Φ } by using a small-batch training method, and repeat the step until the model traverses all data. The method specifically comprises the following steps:

s41, dividing the small batch data: from training data { (T)^l,U^l)∈D_LB groups of real labeling tracks and labels corresponding to the real labeling tracks are collected, and test data are obtainedB sets of unlabeled trajectory data and the labels predicted for it by the model are also collected

B is the sample size of each small batch;

s42, updating the behavior merit function of the discriminator, i.e. passing through the discriminator F_ε(X, u) compute the classifier predicted label Pp_θ(u | X) is the probability of a true tag;

s43, calculating a gradient of the desired reward: let the vector of the policy parameter be θ, the update of the policy parameter can be expressed as:

wherein the content of the first and second substances,

i.e. U is a real tag U^lAnd predicted labels

The training target of the classifier can be rewritten asIn the training process, the invention uses a small batch training method to update the parameter theta, and the approximate gradient of the parameter theta can be obtained by the following formula:

wherein B is the batch size, u^l，X^l，

Respectively, a true tag, a tagged track, a predicted tag, an untagged track. Therefore, the update rule of the parameter θ is:

where β is the learning rate, θ_newAnd theta_oldRespectively representing the updated parameter and the current parameter;

s44, in the present invention, in order to better encode the check-in point and the track, the parameter ω embedded to represent the check-in point and the parameter φ embedded to represent the track are updated simultaneously, so the parameter updating formula can be rewritten as

Where γ is { θ, ω, Φ }.

Compared with the prior art, the invention has the following beneficial effects:

1. the invention tries to learn the embedded representation of the user check-in point by using a model based on a graph for the first time, and the method provides a new visual angle for a task of learning a mobile mode by using geographical position information and unmarked mobile data;

2. the invention provides a method which integrates context semantic information and can dynamically update the sign-in point embedded representation and the track embedded representation, the method not only provides more accurate and more comprehensive mobile mode representation, but also effectively relieves the problems of data sparseness and limited tagged data;

3. the invention provides a method for extracting an implicit moving mode from tagged and untagged track data by using reinforcement learning, which can approximate real user track distribution, designs a discriminator by using the thought of counterstudy to judge the authenticity of the generated data and further enhances the model performance.

Drawings

FIG. 1 is an overall framework diagram of a method of representing crowd movement patterns through context-dependent graph embedding.

Interpretation of terms

LBSN is an abbreviation for Location-based Social Network, representing a "Location-based Social Network". The LBSN not only contains the relationship between people in the traditional social network, but also records the information of the user sign-in time, the geographic position and the like.

RNN is an abbreviation for current Neural Network, denoted "Recurrent Neural Network", a class of Neural networks used for processing sequence data.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

Examples

The method for representing crowd movement patterns through context-dependent graph embedding provided by the present implementation can be used for such data sets containing user check-in records. Real world LBSN data set as shown in Table 1, in Gowalla: (B)http:// snap.stanford.edu/data/loc-gowalla.htmlAcquisition) was performed as an example.

Table 1: information related to the experimental data of the invention

| u | represents the number of users;

|D_L|/|D_Ul respectively represents the number of sub-tracks used for training and testing;

l represents the number of different check-in points in the dataset;

representing the average length of the track before being divided;

T_rrepresenting a range of track lengths in the data set;

new York represents data in the data set Foursquare with a New York area;

tokyo represents data in the data set Foursquare with an area of Tokyo.

In this embodiment, the present invention represents people by context-dependent graph embedding, as shown in FIG. 1The method of the group moving mode includes steps S1 to S5. Step S1, aiming at the Gowalla data set, firstly dividing the track to obtain a (sub) track set and a check-in point set, then connecting each sub track related by each user with the corresponding user, and selecting 50% of the marked tracks of the total sub tracks of each user as a training set D_LThe remaining 50% unmarked sub-tracks D_UIs used for testing; step S2, respectively constructing check-in point space graph G according to the historical track of the user_spatialAnd access sequence diagram G_visitThen, a fully connected context correlation graph G is synthesized by the two graphs_total(ii) a Step S3, for the model defined in S31, S32, S33, performs back propagation until the model converges: s31, for D_LAnd D_UIn each check-in point l_τBased on the context correlation graph G_totalGenerating a low-dimensional distributed representation C (l) of check-in points_τ) Then, introducing nonlinear transformation to cooperate with the distribution representation of the training check-in point to obtain the embedded representation C' (l) of the initialized check-in point_τ) Then, the present invention provides the time vector u (l) corresponding to the check-in point_τ) Embedded representation C' (l) with initialization check-in point_τ) Splicing to obtain the final representation C of the check-in point_τ(ii) a S32, based on the embedded expression of the check-in point obtained in the previous step, the track can be coded by using a recurrent neural network, wherein the current track is expressed by the result X after the last hidden state normalization of the RNN; s33, based on the trace embedding representation obtained in the previous step, using the marked trace (positive sample) { (T)^l,U^l)∈D_LPretraining a trajectory classifier, initializing the trajectory classifier p_θAnd further using the classifier as an unlabeled trajectory (negative example)Predict its corresponding label

The discriminator F is then pre-trained by minimizing the cross entropy of the classifier on positive and negative samples_ε(ii) a Step S4, updating the model parameters using a small batch training method, and repeating the step until the model traverses all data: from the tagging track D_LIn the collection of B group data pairs (labeled tracks, corresponding labels), from unlabeled track D_UData of group B (unlabeled trajectories) were also acquired and passed through a classifier p_θPredicting the corresponding labels of the unmarked tracks; using a discriminator F_εJudging the credibility of the label predicted by the classifier, and updating the behavior value function a (-) of the discriminator along with the credibility; calculating a gradient of the desired reward

Updating parameters { theta, omega, phi } in the model through the strategy gradient; step S5, an unmarked track is given

Using the above-described trained classifier p_θAnd discriminator F_εPredicting the probability distribution vector corresponding to the track

As can be seen from the experimental results in table 2, the prediction effect of the method for representing the movement pattern of the crowd through the context-dependent graph embedding of the present invention is completely superior to that of the conventional method. Therefore, the graph-based method which is provided by the invention and integrates the context semantic information and can dynamically update the check-in point embedded representation and the track embedded representation actually improves the prediction performance of the model.

ACC @ K represents the accuracy, namely the proportion of the number of predicted correct tracks to the number of all tracks;

macro-P represents the accuracy rate, namely the proportion of the number of tracks which are predicted correctly to the number of tracks which are predicted;

macro-R represents the recall rate, namely the proportion of the number of predicted correct tracks to the number of real tracks;

macro-F1 represents the harmonic mean of the exact value and recall;

TULER-LSTM, TULER-GRU, Bi-TULER: the present invention replicates three TULER methods based on LSTM, GRU, and bi-directional LSTM units, respectively, TULER being the best method to perform on the user-trajectory link problem in the near future.

HTULER-L, HTULER-G, HTULER-B: three htule methods, based on multi-layer LSTM, GRU, and bi-directional LSTM cells, respectively, are reproduced for purposes of the present invention.

TULVAE is a method to solve the user-trajectory linkage problem in a semi-supervised manner by variational inference. TLUTE-L, TLUTE-G: TLUTE was originally a method to measure the similarity between a user's embedded representation and the embedded representation of a track, but the task of the present invention is to link a track to the corresponding user, so the task can be considered as a classification task. Here, for the convenience of comparison with the method proposed by the present invention, the present invention removes the module of similarity measure in the original TLUTE, and TLUTE-L and TLUTE-G are simplified version TLUTE methods based on LSTM and GRU units, respectively.

Table 2: effects of GraphTUL on track-user Link prediction on four real datasets

It will be appreciated by those of ordinary skill in the art that the embodiments described herein are intended to assist the reader in understanding the principles of the invention and are to be construed as being without limitation to such specifically recited embodiments and examples. Those skilled in the art can make various other specific changes and combinations based on the teachings of the present invention without departing from the spirit of the invention, and these changes and combinations are within the scope of the invention.

Claims

1. A method for representing movement patterns of a population by context-dependent graph embedding, comprising the steps of:

S32, track coding: and coding the track by utilizing a recurrent neural network based on the final representation of the check-in point obtained in the previous step. In this case, the present invention uses the result X after the last hidden state normalization of RNN to represent the current trace;

s33, pre-training trajectory classifier, discriminator: based on the trace-embedding representation obtained in the previous step, using the scaled trace, i.e., { (T) positive samples^l,U^l)∈D_LPre-training trajectory classifier p_θWherein T is^lBeing sub-tracks of users, U^lLabels for track correspondences, D_LRepresenting a set of positive samples. Subsequently, the classifier is used to classify the unlabeled trajectory, i.e. the negative examples

Predict its corresponding label

S4, updating the model parameters by using a small batch training method, and repeating the steps until the model traverses all data: from positive samples D_LIn the collection of group B data pairs, from negative samples D_UB group data are also acquired and passed through a classifier p_θPredicting the corresponding labels of the unmarked tracks; using a discriminator F_εJudging the credibility of the label predicted by the classifier, and updating the behavior value function a (-) of the discriminator along with the credibility; calculating a gradient of the desired reward

s5, giving an unmarked track

2. The method of claim 1, wherein the step S1 comprises the following sub-steps:

s11, dividing a complete track into continuous sub-tracks according to a fixed time interval, such as 6 hours, where each sub-track is a sequence of check-in points divided based on the time interval. If the current trajectory is known by user u_kIf it is generated, the trace is recorded as

Where N represents the number of users, each l_τ(τ∈[1,n]) Representative time stamp t_τA point of attendance, and each l_τAll have geographic coordinates corresponding thereto

S12, in order to further mine the moving mode of the user, the invention adds a time factor. Specifically, one day is divided into 24 discrete time stamps and the 24 density-based time representations are sampled in a Gaussian distribution, i.e., at t_τNodes that are time-checked-in and their corresponding time vectors may be represented asWherein d is^*A dimension representing a time vector;

3. The method of claim 1, wherein the step S2 comprises the following sub-steps:

Then any check-in point v it contains_i∈V；

S23, synthesizing a fully connected context correlation graph G_total{ V, E }, first with G_spatialInitializationG_totalThen E is_visitThe edges in the graph E are added into the graph E one by one, and an undirected graph G for learning the sign-in point representation is obtained_totalThe map takes into account both the check-in point geographical location information and the sequence pattern of the check-in.

4. The method of claim 1, wherein the step S31 comprises the following sub-steps:

5. The method of claim 1, wherein the step S32 comprises the following sub-steps:

s321, for each sub-track T ═ { l ═ l₁,l₂,...,l_nIs given by_τ-1And h_τ(τ e {1, 2.. eta., n }) represents the previous step and the current hidden state of the trace, respectively, with an initial state h₀Is a vector of all 0 s. The input gate, forgetting gate and output gate of RNN can be obtained by the following calculation: i.e. i_τ＝σ(W_iC_τ+U_ih_τ-1+V_ic_τ-1+b_i)，f_τ＝σ(W_fC_τ+U_fh_τ-1+V_fc_τ-1+b_f)，o_τ＝σ(W_oC_τ+U_oh_τ-1+V_oc_τ-1+b_o) Wherein b is_*Is a deviation vector, σ is a logistic activation function, C_τRepresents a check-in point l_τIs finally expressed, matrix W_*，U_*，V_*Are different door parameters. Memory cell c_τThe update rule of (1) is: c. C_τ＝f_τc_τ-1+i_τtanh(W_cC_τ+U_ch_τ-1+b_c) Where tanh (. cndot.) is a hyperbolic tangent function, initial state c₀Is also an all-0 vector, hidden state h_tThe update rule is as follows: h is_τ＝tanh(c_τ)o_τ；

wherein

6. The method of claim 1, wherein the step S33 comprises the following sub-steps:

s331, utilizing and addingTrace, i.e. positive sample { (T)^l,U^l)∈D_LPre-training trajectory classifier p_θWith training objectives to maximize the desired reward R_θ＝∑_u∈UΡ_θ(u | X) a (X, u), wherein, p_θ(u | X) represents the probability that the predicted tag for track X is u, and a (-) is a behavioral cost function for selecting u as the current predicted tag, which is used to measure or evaluate the authenticity of the predicted tag, where a (X, u) ═ 1 if X is data sampled from the labeled track, and otherwise, a (X, u) ═ F_ε(X, u), order

s332, using classifier p_θFor unmarked tracks, i.e. negative examples

Predict its corresponding label

7. The method of claim 1, wherein the step S4 comprises the following sub-steps:

s41, dividing the small batch data: from training data { (T)^l,U^l)∈D_LB groups of real labeling tracks and labels corresponding to the real labeling tracks are collected, and test data are obtained

B sets of unlabeled trajectory data and the labels predicted for it by the model are also collectedB is the sample size of each small batch;

wherein the content of the first and second substances,

i.e. U is a real tag U^lAnd predicted labels

The training target of the classifier can be rewritten as

In the training process, the invention uses a small batch training method to update the parameter theta, and the approximate gradient of the parameter theta can be obtained by the following formula:

wherein B is the batch size, u^l，X^l，

Where γ is { θ, ω, Φ }.