CN114648557A

CN114648557A - Multi-target cooperative tracking method based on high-altitude visual angle and ground visual angle

Info

Publication number: CN114648557A
Application number: CN202210311724.2A
Authority: CN
Inventors: 冯伟; 韩瑞泽; 万亮; 王松
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2022-03-28
Filing date: 2022-03-28
Publication date: 2022-06-21

Abstract

The invention discloses a multi-target cooperative tracking method based on a high-altitude visual angle and a ground visual angle, which comprises the following steps of: step s1, dividing the two sections of synchronous videos of the space-ground collaborative visual angle into V small sections of videos with the same length; step s2, respectively generating a small segment track of the target under different visual angles for each small segment of video according to the target detection result; step s3, calculating similarity scores between different small segment tracks; step s4, performing track association by adopting a joint optimization function according to the similarity score to generate a short cross-view track; step s5, repeating steps s2 to s4, and connecting the short cross-view tracks, so as to obtain a cross-view long track as a final tracking result; the method has higher matching precision than the existing method, and obtains good cross-domain performance.

Description

Multi-target cooperative tracking method based on high-altitude visual angle and ground visual angle

Technical Field

The invention belongs to the field of target tracking, relates to target tracking of videos shot by a wearable camera and an unmanned aerial vehicle at two visual angles, and particularly relates to a multi-target cooperative tracking method based on a high-altitude visual angle and a ground visual angle.

Background

The background art related to the present invention is as follows:

(1) pedestrian target detector (see document [1 ]): as an important component of data preprocessing, a pedestrian target detector has been widely applied in various fields such as target tracking, pedestrian repositioning, and motion recognition. The pedestrian detectors commonly used at present can be divided into two main categories, namely methods based on background modeling and methods based on statistical learning. The former must adapt to environmental changes and is limited by picture shaking caused by camera shaking. The latter is a commonly used method for detecting pedestrians at present, and a pedestrian detection classifier is constructed by constructing a large number of samples. At present, the role of deep learning in pedestrian detection is not negligible. Wherein a YOLO detector is used in the present invention to provide a corresponding target detection box.

(2) Space-ground collaborative perspective mobile camera network: advances in motion camera technology provide new perspectives for video surveillance. The drone may provide an overhead view of a set of objects on the ground. The wearable cameras can provide a ground view of the same set of objects. The invention provides a new air-ground collaborative visual angle mobile camera network, wherein a top view can provide target global information, a ground view can provide target local details, and therefore the top view and the ground view can complement information well, and better coverage and flexibility are provided for outdoor monitoring. The method can be effectively applied to tasks such as cooperative tracking, personal/group activity identification and the like through the complementary network.

(3) Cross-view multi-target association based on spatial distribution: since the appearance and motion information cannot be utilized, the association of the top view and the ground view becomes very difficult, and cross-view association based on spatial distribution can realize the association between objects by means of spatial position distribution information between high-altitude view and ground view. By the method, the data of different visual angles can be better subjected to collaborative analysis, so that the method is applied to tasks such as target tracking, pedestrian re-recognition, action recognition and the like.

Disclosure of Invention

The invention aims to provide a multi-target tracking technology combining global information and local information for synchronous videos under a hollow-ground cooperative visual angle in the same scene, and realizes cooperative tracking of targets under the hollow-ground cooperative visual angle by means of wearing cameras and unmanned aerial vehicle equipment and utilizing a relatively simple and effective strategy; the method can be used for simultaneously tracking the targets in the ground and high-altitude visual angles under the condition that the number of the targets in the visual angles is not limited by utilizing the space and ground cooperated visual angles in the same scene, and the targets in the two visual angles are in one-to-one correspondence.

The invention is implemented by adopting the following technical scheme:

a multi-target cooperative tracking method based on a high-altitude visual angle and a ground visual angle comprises the following steps:

step s1, dividing the two sections of synchronous videos of the air-ground collaborative visual angle into V small sections of videos with the same length;

step s2, respectively generating a small segment track of the target under different visual angles for each small segment of video according to the target detection result;

step s3, calculating similarity scores between different small segment tracks;

step s4, performing track association by adopting a joint optimization function according to the similarity score C to generate a short cross-view track;

step s5, repeating steps s2 to s4, and connecting the short cross-view tracks, so as to obtain a cross-view long track as a final tracking result; wherein: the short cross-view trajectory generation process:

obtaining similarity measurement between any cross-space-time trajectory segments through a similarity score C;

solving by adopting the following formula according to the similarity measurement:

wherein: c. C_ijRepresenting the similarity between the track segments i and j, which can be obtained by the calculation in the third step; a is_ijA binary variable of 0-1 indicating whether two track segments belong to the same target.

Further, a step of obtaining similarity scores between two segment trajectories at the same time and different angles in step S3 by using cross-view trajectory similarity:

201. using spatial distribution uniformity

Obtaining a matching result of the high altitude and the ground view by using a method based on spatial distribution:

if the detection box B_mAnd a detection frame B_qIf the matching result is the same person, S (B)_m,B_q) 1 is ═ 1; otherwise, S (B)_m,B_q) 0; high altitude view track B_mAnd ground view track B_qThe similarity between them is expressed as follows:

where F is the number of overlapping frames, B_mIs the detection result under the top view, B_qIs the detection result under the ground view, | B_mI is track B_mNumber of, | B_qI is track B_qThe number of (c);

202. by appearance similarity

Measuring inter-track similarity by using a twin network;

respectively calculating average images of the two sections of tracks under the space-ground collaborative visual angle and inputting the average images into a network;

extracting features

Calculating Euclidean distance between features to obtain similarity score

Combining the similarity scores, calculating the weight of the edge by utilizing linear combination as follows:

further, a step of obtaining similarity score generation between two segments of trajectories at the same view angle at different times in step S3 by adopting the inter-time trajectory consistency:

302. appearance trajectory consistency is obtained by using a color histogram to measure appearance similarity of single-view objects:

calculating a target color histogram of a single view trajectory;

taking the median Ψ (B) of the color histogram as an appearance descriptor of the track B;

computing inter-track appearance similarity using histogram intersection

302. Motion continuity is obtained by predicting motion consistency by using a constant speed motion model:

calculating the forward deviation error delta by using a motion model_pAnd reverse bias error delta_n；

By δ being equal to α (δ)_p+δ_n) Measuring the difference between the tracks;

converting errors into similarities

advantageous effects

According to the invention, by using the wearable camera and the unmanned aerial vehicle device, a simple and effective strategy is selected, a combined air-ground cooperative visual angle multi-target tracking method is autonomously designed, the problem of shielding in a multi-pedestrian motion scene is effectively solved, the result is accurate, the algorithm is efficient, and particularly, the method has the following advantages:

a. high tracking precision

The global motion track and the local appearance details of the target can be obtained by utilizing the video shot under the space and ground cooperative view angle, the target loss caused by shielding is greatly relieved, and therefore higher tracking precision is achieved. The visualization of the results of the collaborative tracking is shown in fig. 2.

b. The algorithm has low complexity and high speed

The method is used for target tracking, and the running speed is 4.24fps through a large number of experimental statistics, so that the method is superior to the existing most tracking methods. On the premise of ensuring the accuracy, the time consumption is greatly reduced.

c. Strong migration ability

The invention does not need to train a large amount of data, does not depend on specific data characteristics, and has strong universality.

Drawings

FIG. 1: air-ground cooperative visual angle multi-target tracking problem flow chart

FIG. 2: space-ground cooperative view angle multi-target tracking result schematic diagram

Detailed Description

The invention is described in detail below with reference to the attached drawing figures:

the invention provides a multi-target cooperative tracking method based on a high-altitude visual angle and a ground visual angle. Then, the trajectories of adjacent video segments in the multiple views are used to establish spatiotemporal data associations, resulting in intersecting view trajectories. And finally, connecting the short tracks to obtain the cross-view long track as a final tracking result. The specific process shown in fig. 1:

step one, s1, dividing two sections of synchronous videos of the space-ground collaborative view angle into V small sections of videos with the same length; wherein: synchronized video segmentation

Given a video from a high-altitude perspective and multiple videos from a ground perspective, we segment both videos synchronously into video segments of the same length.

Step s2, respectively generating a small segment track of the target under different visual angles for each small segment of video according to the target detection result; step s3, calculating similarity scores between different small segment tracks;

and a step of obtaining similarity scores between two short segment tracks at the same time and different angles of view in the step S3 by using the cross-angle track similarity:

201. using spatial distribution uniformity

if the detection box B_mAnd a detection frame B_qIf the matching result is the same person, thenS(B_m,B_q) 1 is ═ 1; otherwise, S (B)_m,B_q) 0; high altitude view track B_mWith ground view track B_qThe similarity between them is expressed as follows:

where F is the number of overlapping frames, B_mIs the detection result under the top view, B_qIs the detection result under the ground view, | B_mI is track B_mNumber of, | B_qI is track B_qThe number of (2);

202. by appearance similarity

Measuring inter-track similarity by using a twin network;

respectively calculating average images of two sections of tracks under the space-ground collaborative visual angle and inputting the average images into a network;

extracting features;

calculating Euclidean distance between features to obtain similarity score

and a step of generating similarity scores between two small segment tracks at the same view angle at different times in the step S3 by using the consistency of the tracks across time:

301. appearance trajectory consistency is obtained by using a color histogram to measure appearance similarity of single-view objects:

calculating a target color histogram of a single view trajectory;

computing inter-track appearance similarity using histogram intersection

converting errors into similarities

solving by the following formula according to the similarity measure

Wherein: c. C_ijRepresenting the similarity between track segments i and j; a is_ijA binary variable of 0-1 indicating whether two track segments belong to the same target.

Claims

1. A multi-target cooperative tracking method based on a high-altitude visual angle and a ground visual angle is characterized by comprising the following steps:

step s1, dividing the two sections of synchronous videos of the space-ground collaborative visual angle into V small sections of videos with the same length;

step s3, calculating similarity scores between different small segment tracks;

solving by the following formula according to the similarity measure

Function(s)

Wherein: c. C_ijRepresenting the similarity between the track segments i and j, which can be obtained by the calculation in the third step; a is_ijA binary variable of 0-1, indicating whether two track segments belong to the same target.

2. The multi-target cooperative tracking method based on the high-altitude visual angle and the ground visual angle as claimed in claim 1, characterized in that: and a step of obtaining similarity scores between two short segment tracks at the same time and different angles of view in the step S3 by using the cross-angle track similarity:

201. using spatial distribution uniformity

if the detection box B_mAnd a detection frame B_qIf the matching result is the same person, S (B)_m,B_q) 1 is ═ 1; otherwise, S (B)_m,B_q)＝0；

High altitude view track B_mTrack for viewing groundTrace B_qThe similarity between them is expressed as follows:

wherein F is the number of overlapping frames, B_mIs the detection result under the top view, B_qIs the detection result under the ground view, | B_mI is track B_mNumber of, | B_qI is track B_qThe number of (2);

202. by appearance similarity

Measuring inter-track similarity by using a twin network;

extracting features

Calculating Euclidean distance between features to obtain similarity score

3. the multi-target cooperative tracking method based on the high-altitude visual angle and the ground visual angle as claimed in claim 1, characterized in that: and a step of generating similarity scores between two small segment tracks at the same view angle at different times in the step S3 by using the consistency of the tracks across time:

calculating a target color histogram of a single view trajectory;

computing inter-track appearance similarity using histogram intersection

converting errors into similarities