CN111223126B

CN111223126B - Cross-view-angle trajectory model construction method based on transfer learning

Info

Publication number: CN111223126B
Application number: CN202010010171.8A
Authority: CN
Inventors: 刘龙; 丁婕; 徐小平
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2020-01-06
Filing date: 2020-01-06
Publication date: 2023-03-31
Anticipated expiration: 2040-01-06
Also published as: CN111223126A

Abstract

The invention discloses a cross-view trajectory model construction method based on transfer learning, which comprises the following steps of 1, constructing a target domain target trajectory characteristic value sequence set; classifying the characteristic value sequence of the known label according to the label; step 2, constructing a source domain target track characteristic value sequence set; classifying the characteristic value sequence according to the label; step 3, training the HMM model by adopting the characteristic value sequence in the step 2; step 4, constructing a mapping model between the source domain and the target domain features according to the characteristic value sequence set in the step 1 and the characteristic value sequence set in the step 2, and obtaining the target domain observation probability according to the model; and 5, calibrating the target domain transfer probability according to the characteristic value sequence set in the step 1 and the training model parameters in the step 4 to obtain the target domain hidden Markov model. The method solves the problem that the target track model is not suitable and has low accuracy due to different visual angles in the prior art.

Description

Cross-view-angle trajectory model construction method based on transfer learning

Technical Field

The invention belongs to the technical field of monitoring video processing, and particularly relates to a cross-view trajectory model construction method based on transfer learning.

Background

Motion information reflecting temporal changes in video content is essential to portray semantic content in video. The target motion track describes motion information of a plurality of semantic contents, so that track modeling analysis has important significance for a plurality of applications including video monitoring, object behavior analysis, video retrieval and the like. In the existing monitoring system, a plurality of visual angle cameras are often cooperated to play a role. For example, under the same scene, the multi-view cameras jointly work and cooperate; the multi-view camera can also provide effective information for the occurrence of tracks with the same semantics in different scenes. Learning a new model for each perspective is not practical, the training cost of obtaining multiple labeled samples for each behavior from each perspective is high, and the model is not convenient for wide popularization.

Traditional machine learning algorithms (support vector machines, decision trees, random forests, dynamic bayesian networks, support vector machines, etc.) are often used to classify trajectory-based behaviors. Most of the existing track analysis methods have the defects of high false alarm rate, overfitting, neglecting some useful characteristics of behaviors, incapability of covering various anomalies due to the particularity of the method and the unavailability of data, and the like. In recent years, the most excellent neural network model has strong data processing capacity to classify and identify data, and a good classification and identification effect is obtained, but the neural network model has weak modeling capacity on a track with a time sequence relation, and a large amount of sample data is required for training to achieve accurate convergence.

Disclosure of Invention

The invention aims to provide a cross-view-angle trajectory model construction method based on transfer learning, and solves the problem that a target trajectory model is not applicable and has low accuracy due to different view angles in the prior art.

The invention adopts the technical scheme that a cross-view trajectory model construction method based on transfer learning is implemented according to the following steps:

step 1, constructing a target track characteristic value sequence set of a target domain; classifying the characteristic value sequence of the known label according to the label to obtain B _n A class characteristic value sequence set; wherein the target domain consists of a sequence of eigenvalues of which x tags are known and a sequence of eigenvalues of which y tags are unknown, and y > x;

step 2, constructing a source domain target track characteristic value sequence set by adopting the construction method in the step 1; classifying the characteristic value sequence according to the label to obtain C _n A class characteristic value sequence set; wherein, the characteristic value sequence labels of the source domain are known;

step 3, training the HMM model by adopting the characteristic value sequence in the step 2 to obtain C _n HMM models for each trajectory category;

step 4, constructing a mapping model between the source domain and the target domain features according to the characteristic value sequence set in the step 1 and the characteristic value sequence set in the step 2, and obtaining the target domain observation probability according to the model;

and 5, calibrating the target domain transition probability according to the characteristic value sequence set in the step 1 and the training model parameters in the step 4 to obtain the target domain hidden Markov model.

The invention is also characterized in that:

the step 1 is implemented according to the following steps:

step 1.1, tracking a target in a video frame sequence to obtain a target track coordinate sequence

Selecting a first frame target area in a frame sequence of a video as a tracking template, and extracting target color characteristics; tracking the target frame by adopting a particle filter tracking frame to obtain a track coordinate sequence; tracking track coordinate sequence in time interval of delta t =0.3s

Uniformly sampling; wherein (x) _t ,y _t ) Is the target position coordinate at time t;

step 1.2, denoising the target track coordinate sequence in the step 1.1

Filtering noise points of the track coordinate sequence obtained in the step 1.1 by using an average filter with the size of a sliding window being 5; the mean filtering formula is as follows:

step 1.3, extracting the angle characteristics of the target track coordinate sequence in the step 1.2

The following formula is adopted to extract the angle characteristics:

in the formula (x) _t ,y _t ) Is the target position coordinate at time t;

step 1.4, discretizing the angle characteristics extracted in step 1.3 to obtain a characteristic value sequence

According to the obtained angle

Obtaining a characteristic value O by discretizing a 24-direction chain code _t Further, a characteristic value sequence O is obtained _T ＝O ₁ O ₂ …O _t …；

Step 1.5, classifying the characteristic value sequences according to the labels to obtain B _n And (5) collecting the class characteristic value sequences.

In step 1.1, the specific process of extracting the target color features is as follows:

assume that the center position of the target region is (x) ₀ ,y ₀ ) Then the width and height of the target region are w ₀ And h ₀ At a certain point p in the target area _i ＝(x _i ,y _i ) The target feature may be represented as:

in the formula, k is a normalization coefficient; a. n respectively represents the pixel number and the scale of the target area; u. of _i Representing each feature subspace; delta is a dirac function; k (r) =1-r ² Is a weight function;

assuming the particle state as

Observed value is Z ^k Establishing a candidate model q = { q ] of the region where the particle is located _i } _i＝1,…N And measuring the similarity of the particle region and the target region by adopting a Bhattacharyya coefficient:

state X at time t _t The observation equation of (a) is:

in step 1.1, the particle filter tracking process is specifically as follows:

(1) Particle initialization

When t =0, particle initialization is performed to randomly generate particle subsets

Setting a weight value, wherein the weight value is 1/N;

(2) Predicting; predicting the state of each particle according to the prediction process of the system

Predicted current position during prediction

The position from the previous instant is a linear gaussian relationship, the so-called equation of motion:

in the formula u _k Is an external input, ω _k Is a gaussian error;

(3) Updating; updating the weight of the particle according to the observed value

Normalized weight

(4) Resampling; copying a part of particles with high weight and removing a part of particles with low weight

According to respective normalized weight

Size copy/discard samples->

Obtaining N approximate obeys>

Distributed sample->

Make->

i＝1,…,N；

(5) Outputting; estimating current state using particles and weights

The output being a set of particles

And estimating the current state by using the particle state and the weight value so as to obtain a target coordinate at the current moment:

(6) And (4) tracking the rest video frames by adopting the methods from (2) to (4) to obtain a track coordinate sequence.

In step 1.4, the discretization of the 24-direction chain code is specifically as follows:

dividing an angle area, namely 360 degrees into 24 intervals on average, marking the 24 intervals with 1-24, wherein one number corresponds to one angle interval; angle of rotation

In which angle interval, it is recorded as the number corresponding to the interval.

The specific process of the step 3 is as follows:

step 3.1, randomly initializing an HMM model lambda = (A, B, pi) to obtain an initial HMM model; wherein A is the transition state probability, B is the observation state probability, and π is the initial state probability distribution;

step 3.2, calculating M characteristic value sequences O in certain category of tracks _S Probability of occurrence P (O) under this model _S Multiplication by multiplication of I | λ)

Wherein, I is a hidden state sequence;

step 3.3 maximization using Baum-Welch algorithm

Step 3.4, to the initial HMM model λ _S ＝(A _S ,B _S ,π _S ) Reestimating until the iteration of the model parameters is not improved any more, and obtaining the optimal HMM model of the sequence

Step 3.5, training the rest track categories by adopting the methods from step 3.1 to step 3.4 to obtain the source domain C _n HMM model for individual trajectory classes

For initial HMM model λ _S ＝(A _S ,B _S ,π _S ) The re-estimation process is specifically as follows:

(1) Defining forward variables

α _t (i)＝P(O ₁ ，O ₂ ，…O _t ，I/λ) 1≤t≤T (11)

In the formula, a _ij ,b _j Matrix parameters of A and B are respectively;

(2) Defining a backward variable

β _t (i)＝P(O _t-1 ，O _t-2 ，…O _T ，I/λ) 1≤t≤T-1 (13)

In the formula, a _ij ,b _j Matrix parameters of A and B are respectively;

(3) For alpha _t (i) To perform treatment

Initialization

Recursion:

(4) For beta is _t (i) To perform treatment

Initialization

Recursive method

(5) Recalculation

In the formula (I), the compound is shown in the specification,

matrix parameters of pi, A, B, respectively.

Step 4 is specifically implemented according to the following steps:

step 4.1, constructing a mapping model between the source domain and the target domain according to the characteristic value sequence set in the step 1 and the characteristic value sequence set in the step 2, wherein the mapping relation is as follows:

in the formula, w and b are coefficients of a characteristic mapping fitting curve equation; o is _S Is a source domain coded sample;

is the mapped target domain coded data;

the objective function is:

in the formula, O _T Is the true target domain encoded data;

step 4.2, the optimal HMM model in the step 3 is obtained

Is based on the observation state probability->

Assigning the initial value B of the probability of the observation state of the target domain according to the mapping relation of the step 4.1 _T 。

Step 5 is specifically implemented according to the following steps:

step 5.1, model parameters in the step 4.3

As a corresponding target domain model λ _T Is greater than or equal to>

π _T ；

Step 5.2, according to the model

Making a plurality of groups of simulation data;

step 5.3, calculating the similarity of the simulation data in the step 5.2 and the target domain same track category characteristic value sequence in the step 1;

step 5.4, calculating a target domain transfer summary A by adopting an optimization algorithm by taking the similarity height as a target function _T (ii) a The calculation formula is as follows:

in the formula, g (-) is a model

Simulating to generate a mean value of the data;

step 5.5, calibrating the target domain transition probability by adopting a constraint optimization algorithm to obtain a target domain hidden Markov model

Solving the optimal delta A by adopting an interior point method, and calculating a target domain model

Simulation data and O _T If the similarity is larger than or equal to the similarity threshold, the delta A obtained in the previous step is used as an initial value to enter the iteration of the interior point method again until the value is smaller than the similarity threshold; namely the target domain hidden Markov model>

Wherein the constraint is that the constraint is a transition probability matrix->

And->

Is greater than 0 and the sum of each row of elements is 1.

The specific process of step 5.2 is as follows:

given an HMM model λ = (a, B, pi), the observation sequence O = O ₁ O ₂ …O _k Can be produced by the following steps:

(1) According to the initial state probability distribution pi = pi _i Selecting an initial state Q ₁ ＝i；

(2) Let t =1;

(3) Output probability distribution b from state i _jk Output O _t ＝k；

(4) Output probability distribution b from state i _jk Output O _t ＝k；

(5) If t = t +1, if t < k, repeating (3) and (4), otherwise ending;

in step 5.3, the measurement of the similarity is determined by the euclidean distance, and the euclidean distance calculation formula is as follows:

in the formula (I), the compound is shown in the specification,

O _T respectively are models>

The simulated data set mean value and the labeled characteristic value sequence set mean value in the step 1 belong to the same track category;

the similarity calculation formula is as follows:

the invention has the beneficial effects that:

the invention relates to a cross-view track model construction method based on transfer learning, which comprises the steps of constructing a target track characteristic data set, training a hidden Markov model under a source domain view, establishing a source domain characteristic and target domain characteristic mapping model to optimize transfer observation probability parameters, and optimizing a target domain transfer probability based on a small number of target domain labeled samples; by adopting the model constructed by the invention, the behavior state of the target track can be judged under a specific visual angle; the method solves the problems of poor recognition effect and low robustness in the prior art during cross-view model migration under the condition of less labeled data in the target field, and the model constructed by the method has good performance for recognizing the target track of the track sample under different views.

Drawings

FIG. 1 is a flow chart of a cross-perspective trajectory model construction method based on transfer learning according to the present invention;

FIG. 2 is a 24-direction chain code diagram in the cross-view trajectory model construction method based on transfer learning according to the present invention;

FIG. 3 is a source domain feature and target domain feature mapping fitting curve in step 4 of the cross-view trajectory model construction method based on transfer learning.

Detailed Description

The invention is described in detail below with reference to the drawings and the detailed description.

As shown in fig. 1, the invention relates to a cross-view trajectory model construction method based on transfer learning, which is implemented specifically according to the following steps:

step 1, constructing a target track characteristic value sequence set of a target domain; classifying the characteristic value sequence of the known label according to the label to obtain B _n A class characteristic value sequence set; wherein the target domain consists of a sequence of eigenvalues of which x labels are known and a sequence of eigenvalues of which y labels are unknown, and y > x;

the step 1 is implemented according to the following steps:

step 1.1, tracking the target in the video frame sequence to obtain the target track coordinate sequence

Selecting a first frame target area in a video frame sequence as a tracking template, and extracting target color characteristics; tracking the target frame by adopting a particle filter tracking frame to obtain a track coordinate sequence; tracking track coordinate sequence in time interval of delta t =0.3s

the specific process of extracting the target color features is as follows:

in the formula, k is a normalization coefficient; a. n respectively represents the number of pixels and the size of the target area; u. of _i Representing each feature subspace; delta is a dirac function; k (r) =1-r ² Is a weight function;

assuming the particle state as

state X at time t _t The observation equation of (a) is:

the particle filter tracking process is concretely as follows:

(1) Particle initialization

Setting a weight value, wherein the weight value is 1/N;

Predicted current position during prediction

in the formula u _k Is an external input, ω _k Is a gaussian error;

Normalized weight

According to respective normalized weight

Size copy/discard sample->

Deriving N approximate obeys>

Distributed sample->

Make->

(5) Outputting; estimating current state using particles and weights

The output being a set of particles

And estimating the current state by using the particle state and the weight value, thereby obtaining the target coordinate at the current moment:

(6) Tracking the rest video frames by adopting the methods (2) to (4) to obtain a track coordinate sequence;

step 1.2, denoising the target track coordinate sequence in the step 1.1

Filtering noise points of the track coordinate sequence obtained in the step 1.1 by using an average filter with a sliding window size of 5; the mean filtering formula is as follows:

The following formula is adopted to extract the angle characteristics:

wherein (x) _t ,y _t ) Is the target position coordinate at the time t;

According to the obtained angle

The discretization of the 24-direction chain code is specifically as follows (as shown in fig. 2):

In which angle interval, the angle interval is marked as the number corresponding to the interval;

step 1.5, sequence of characteristic values according to labelsLine classification to obtain B _n And (5) collecting the class characteristic value sequence.

the specific process of the step 3 is as follows:

step 3.2, calculating M characteristic value sequences O in certain category of tracks _S Probability of occurrence P (O) under this model _S Multiplication of I | λ)

Wherein, I is a hidden state sequence;

step 3.3 maximization using Baum-Welch algorithm

(1) Defining forward variables

α _t (i)＝P(O ₁ ，O ₂ ，…O _t ，I/λ) 1≤t≤T (11)

In the formula, a _ij ,b _j Matrix parameters of A and B are respectively;

(2) Defining a backward variable

β _t (i)＝P(O _t-1 ，O _t-2 ，…O _T ，I/λ) 1≤t≤T-1 (13)

/>

In the formula, a _ij ,b _j Matrix parameters of A and B are respectively;

(3) For alpha _t (i) To perform treatment

Initialization

Recursion:

(4) For beta is _t (i) To perform treatment

Initialization

Recursive method

(5) Recalculation

/>

In the formula (I), the compound is shown in the specification,

matrix parameters of pi, A and B respectively;

step 3.5, training the rest track categories by adopting the methods of the step 3.1 to the step 3.4 to obtain a source domain C _n HMM model for individual trajectory classes

as shown in fig. 3, step 4 is specifically implemented according to the following steps:

in the formula, w and b are coefficients of a characteristic mapping fitting curve equation; o is _S Is a source domain encoded sample;

is the mapped target domain coded data;

the objective function is:

in the formula, O _T Is the true target domain encoded data;

step 4.2, the optimal HMM model in the step 3 is used

In (b) is determined by the observation state probability>

Assigning the initial value B of the probability of the observation state of the target domain according to the mapping relation of the step 4.1 _T ；

Step 5 is specifically implemented according to the following steps:

step 5.1, model parameters in the step 4.3

As a corresponding target domain model λ _T Is greater than or equal to>

π _T ；

Step 5.2, according to the model

Making a plurality of groups of simulation data;

the specific process of the step 5.2 is as follows:

(2) Let t =1;

(3) Output probability distribution b from state i _jk Output O _t ＝k；

(4) Output probability distribution b from state i _jk Output O _t ＝k；

(5) If t = t +1, if t < k, repeating (3) and (4), otherwise ending;

in the formula (I), the compound is shown in the specification,

O _T are respectively the model->

the similarity calculation formula is as follows:

step 5.4, the similarity height is taken as a target function, and an optimization algorithm is adopted to calculate a target domain transfer summary A _T (ii) a The calculation formula is as follows:

in the formula, g (-) is a model

Simulating to generate a mean value of the data;

Simulation data and O _T If the similarity is larger than or equal to the similarity threshold, the delta A obtained in the previous step is used as an initial value to enter the iteration of the interior point method again until the delta A is smaller than the similarity threshold; namely the target domain hidden Markov model>

And->

Is greater than 0 and the sum of each row of elements is 1.

The invention relates to a cross-view track model construction method based on transfer learning, which comprises the steps of constructing a target track characteristic data set, training a hidden Markov model under a source domain view, establishing a source domain characteristic and target domain characteristic mapping model to optimize transfer observation probability parameters, and optimizing a target domain transfer probability based on a small number of target domain labeled samples; by adopting the model constructed by the invention, the behavior state of the target track can be judged under a specific visual angle; the method solves the problems of poor recognition effect and low robustness in cross-view model migration in the prior art under the condition of less labeled data in the target field, and the model constructed by the method has good performance in target track recognition of track samples under different views.

Claims

1. A cross-view trajectory model construction method based on transfer learning is characterized by being implemented according to the following steps:

the step 1 is specifically implemented according to the following steps:

Selecting a first frame target area in a video frame sequence as a tracking template, and extracting target color characteristics; tracking the target frame by adopting a particle filter tracking frame to obtain a track coordinate sequence; tracking track coordinate sequence according to time interval of delta t =0.3s

Uniformly sampling; wherein (x) _t ,y _t ) Is the target position coordinate at the time t;

in the step 1.1, the specific process of extracting the target color features is as follows:

in the formula, k is a normalization coefficient; a. n respectively represents the pixel number and the scale of the target area; u. u _i Representing each feature subspace; delta is a dirac function; k (r) =1-r ² Is a weight function;

assuming the particle state as

state X at time t _t The observation equation of (a) is:

in step 1.1, the particle filter tracking process specifically includes:

(1) Particle initialization

Setting a weight value, wherein the weight value is 1/N;

Predicted current position during prediction

in the formula u _k Is an external input, ω _k Is a gaussian error;

Normalized weight

According to respective normalized weight

Size copy/discard sample->

Obtaining N approximate obeys>

Distributed sample->

Make->

(5) Outputting; estimating current state using particles and weights

The output being a set of particles

step 1.2, denoising the target track coordinate sequence in the step 1.1

The following formula is adopted to extract the angle characteristics:

in the formula (x) _t ,y _t ) Is the target position coordinate at the time t;

According to the obtained angle

In the step 1.4, the discretization of the 24-direction chain code is specifically as follows:

step 1.5, classifying the characteristic value sequences according to the labels to obtain B _n A class characteristic value sequence set;

2. The cross-perspective trajectory model building method based on transfer learning according to claim 1, wherein the specific process in step 3 is as follows:

step 3.2Calculating M characteristic value sequences O in a certain category of tracks _S Probability of occurrence P (O) under this model _S Multiplication of I | λ)

Wherein, I is a hidden state sequence;

step 3.3 maximization using Baum-Welch algorithm

3. The method for constructing a cross-perspective trajectory model based on transfer learning of claim 2, wherein λ is an initial HMM model _S ＝(A _S ，B _S ，π _S ) The re-estimation process is specifically as follows:

(1) Defining forward variables

α _t (i)＝P(O ₁ ，O ₂ ，…O _t ，I/λ)1≤t≤T (11)

In the formula, a _ij ，b _j Matrix parameters of A and B are respectively;

(2) Defining a backward variable

β _t (i)＝P(O _t-1 ，O _t-2 ，…O _T ，I/λ)1≤t≤T-1 (13)

In the formula, a _ij ，b _j Matrix parameters of A and B are respectively;

(3) For alpha _t (i) To perform treatment

Initialization

Recursion:

(4) For beta is _t (i) To perform treatment

Initialization

Recursive

(5) Recalculating

In the formula (I), the compound is shown in the specification,

matrix parameters of pi, A and B respectively.

4. The method for constructing a cross-perspective trajectory model based on transfer learning according to claim 2, wherein the step 4 is specifically implemented according to the following steps:

/>

is the mapped target domain coded data;

the objective function is:

in the formula, O _T Is the true target domain encoded data;

step 4.2, the optimal HMM model in the step 3 is obtained

Is based on the observation state probability->

5. The method for constructing a cross-perspective trajectory model based on migration learning according to claim 4, wherein the step 5 is specifically implemented according to the following steps:

step 5.1, model parameters in the step 4.3

As a corresponding target domain model λ _T Is greater than or equal to>

π _T ；

Step 5.2, according to the model

Making a plurality of groups of simulation data;

in the formula, g (-) is a model

Simulating to generate a mean value of the data;

Simulation data and O _T If the similarity is larger than or equal to the similarity threshold, the delta A obtained by the last step of optimization is used as an initial value to enter the iteration of the interior point method again until the value is smaller than the similarity threshold; namely the target domain hidden Markov model>

And->

Is greater than 0 and the sum of each row of elements is 1.

6. The method for constructing the cross-perspective trajectory model based on the transfer learning of claim 4, wherein the step 5.2 comprises the following specific processes:

given the HMM model λ = (a, B, pi),then observe sequence O = O ₁ O ₂ …O _k Can be produced by the following steps:

(2) Let t =1;

(3) Output probability distribution b from state i _jk Output O _t ＝k；

(4) Output probability distribution b from state i _jk Output O _t ＝k；

(5) If t = t +1, if t < k, repeating (3) and (4), otherwise ending;

in the step 5.3, the measurement of the similarity is determined by the euclidean distance, and the euclidean distance is calculated according to the following formula:

/>

in the formula (I), the compound is shown in the specification,

O _T are respectively the model->

the similarity calculation formula is as follows:

/>