CN114926500A

CN114926500A - Twin network target tracking method and system based on sorting

Info

Publication number: CN114926500A
Application number: CN202210549797.5A
Authority: CN
Inventors: 凌强; 汤峰
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2022-05-20
Filing date: 2022-05-20
Publication date: 2022-08-19

Abstract

The invention relates to a twin network target tracking method and a twin network target tracking system based on sequencing, wherein the method comprises the following steps: s1: constructing a classification and sequencing loss function, and training classification branches in the twin RPN target tracking network; s2: constructing a sequencing loss function based on IoU, and aligning classification branches and regression branches in the twin RPN target tracking network; s3: and combining the classification sorting loss function, the sorting loss function based on IoU and the original loss function in the RPN network to construct a total loss function and guide the training of the twin RPN target tracking network. According to the method provided by the invention, the target tracking precision of the existing twin RPN network can be effectively improved by constructing the sorting loss and the sorting loss function based on IoU.

Description

Twin network target tracking method and system based on sorting

Technical Field

The invention relates to the field of pattern recognition and computer vision, in particular to a twin network target tracking method and system based on sequencing.

Background

Recently, target tracking algorithms SiamRPN and SiamRPN + + based on twin networks have attracted great attention. Current twin network based trackers mainly describe visual tracking as two independent subtasks, classification and regression. When learning to classify the sub-network, these methods process each sample separately, for example, a certain sample label is a positive sample (1) or a negative sample (0), and the network outputs a classification score of 1 or 0 for the sample as much as possible under the guidance of the classification loss function, but the relationship between the positive and negative samples is not considered. This can result in the twin network tracing having difficulty in discriminating difficult negative examples (objects similar to the tracked object). In the training phase, the classification sub-network is responsible for classifying the training samples, i.e. the simple negative samples containing a large amount of semantic-free information, while some difficult negative samples, which are extremely rare, are easily swamped by a large amount of simple negative samples in the training phase. Although most non-target samples (falling on the background area) can be identified as the background by the classifier when tested, as long as an interfering target has a high foreground classification score, it can interfere with the tracker, and once its score exceeds the classification score of the true target in a certain frame, the tracker will bias towards the interfering target, resulting in tracking failure, which frequently occurs in the past twin network trackers.

Furthermore, there is a mismatch problem between classification and regression, since the classification and regression tasks are handled independently. In particular, the classification loss function causes the model to distinguish between foreground and background without taking regression branches into account. The purpose of the regression branch is to regress the bounding box of the target for all positive samples, regardless of the classification of the samples. Thus, samples with higher target box regression accuracy may have relatively lower target classification scores, while samples with higher target classification scores may yield lower regression accuracy. Therefore, how to improve the target tracking accuracy of the twin RPN network becomes a problem to be solved urgently.

Disclosure of Invention

In order to solve the technical problem, the invention provides a twin network target tracking method and system based on sorting.

The technical solution of the invention is as follows: a twin network target tracking method based on sorting comprises the following steps:

step S1: constructing a classification and sequencing loss function, and training classification branches in the twin RPN target tracking network;

step S2: constructing an IoU-based ordering loss function, and aligning classification branches and regression branches in the twin RPN target tracking network;

step S3: and combining the classification sorting loss function, the sorting loss function based on IoU and the original loss function in the RPN network to construct a total loss function and guide the training of the twin RPN target tracking network.

Compared with the prior art, the invention has the following advantages:

1. the invention discloses a twin network target tracking method based on sorting, which utilizes a sorting and sorting loss function to restrict positive sample sorting scores to be larger than difficult negative sample scores, so that the difficult negative samples can be classified as foreground targets, and the tracking failure caused by mistakenly selecting negative samples by a tracker is avoided.

2. The classification and regression branches in the twin RPN network are connected by using the IoU-based sequencing loss function, so that the classification precision and the regression prediction precision can be reflected by the classification branch prediction score.

Drawings

FIG. 1 is a flowchart of a twin network target tracking method based on sorting according to an embodiment of the present invention;

FIG. 2 is a system architecture diagram of a twin RPN target tracking network in an embodiment of the present invention;

fig. 3 is a block diagram of a twin network target tracking system based on sorting according to an embodiment of the present invention.

Detailed Description

The invention provides a twin network target tracking method based on sorting, which can effectively improve the target tracking precision of the existing twin network by constructing sorting loss and a sorting loss function based on IoU.

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings.

Example one

As shown in fig. 1, a twin network target tracking method based on sorting provided by an embodiment of the present invention includes the following steps:

step S2: constructing a sequencing loss function based on IoU, and aligning classification branches and regression branches in the twin RPN target tracking network;

As shown in fig. 2, a system architecture of a twin RPN target tracking network is shown, which has two inputs, one is a template image containing a first frame tracking target, and the other is a search image containing a subsequent frame tracking target and a background, and a backbone network is used for extracting features of two paths of images. Inputting the two characteristic graphs into an RPN module, firstly fusing the two characteristics, and respectively inputting the fused characteristics into a classification branch for predicting the category of each sample (candidate frame), namely whether the sample belongs to a foreground target or a background; and a regression branch for predicting the bounding box of the tracked target.

In one embodiment, step S1: constructing a classification and sequencing loss function, and training classification branches in a twin RPN target tracking network, wherein the method specifically comprises the following steps:

step S11: respectively calculating the foreground classification score mean value of positive and negative samples output by the classification branch in the twin RPN according to the following formula (1):

wherein, A _pos Is a positive sample set, A _neg Is a difficult negative sample set; negative sample j _- A weight coefficient of

exp () is an exponential function; positive sample j ₊ Weight w of _j+ Is composed of

Wherein N is _pos Is a positive sample set A _pos The number of the middle samples; p is a radical of _j+ And p _j- Respectively positive samples j predicted by the branch of the classification ₊ And difficult negative example j _- (ii) a foreground classification score of;

according to the embodiment of the invention, all negative samples are sorted according to the foreground classification scores output by the classification branches in the twin RPN network, the negative samples with the foreground classification scores lower than 0.5 are filtered, and the remaining negative samples form a difficult negative sample set A _neg ；

Step S12: p obtained according to step S11 ₊ And P _- And constructing a classification and sorting loss function as shown in formula (2):

wherein exp () and log () are an exponential function and a logarithmic function, respectively; beta is a parameter for controlling the size of the loss value, and alpha is a parameter for controlling the sorting distance, in the embodiment of the invention, the value of beta can be 4, and the value of alpha is 0.5. Equation (2) can constrain the mean value P of the foreground classification scores of the positive samples ₊ Greater than the mean value P of the classification of the foreground of the difficult negative sample _- In particular, if P _- Ratio P ₊ If the value is large, then L _{rank_cls} The loss value will be large; on the contrary, if P _- Ratio P ₊ If the value is small, then L _{rank_cls} The loss value will be smaller. Therefore, the neural network is in reverse propagation for L _{rank_cls} The value is as small as possible, and P is constrained _- As much as possible of P ₊ Therefore, the classification score of the difficult negative sample is effectively reduced, and the purpose of suppressing the difficult background is achieved.

The invention converts the classification problem in the twin RPN network into a sorting problem, and restricts the foreground classification score of the positive sample to be larger than the foreground classification score of the difficult negative sample. Compared with the existing classification problem, the sequencing mode provided by the invention serves as a loose constraint term, and the difficult negative sample foreground classification mean value P in the formula (2) _- May be large, fall on the foreground object class, but only require a guaranteeProve their foreground classification score mean P _- Lower than positive sample foreground classification score mean P ₊ Therefore, the tracker can be prevented from selecting negative samples by mistake, and the tracking failure can be avoided.

In one embodiment, the step S2: constructing an IoU-based sequencing loss function, and aligning classification branches and regression branches in the twin RPN target tracking network, wherein the method specifically comprises the following steps:

step S21: for positive samples i ₊ ,j ₊ ∈A _pos In other words, the following constraint is agreed, as shown in equation (3):

wherein the content of the first and second substances,

and

are respectively positive samples i ₊ And j ₊ (ii) a foreground classification score of;

and

are respectively positive samples i ₊ And j ₊ The regression score obtained by the regression branch prediction is represented by iou (intersection over union);

according to the formula (3), the current sample i ₊ Regression score of (2)

Greater than positive sample j ₊ Regression score of

Then the sample i can be constrained ₊ Foreground classification score of

Greater than positive sample j ₊ Foreground classification score of

Similarly, when the sample i is positive ₊ Foreground classification score of

Greater than positive sample j ₊ Foreground classification score of

Then the positive sample i may be constrained ₊ Regression score of

Greater than positive sample j ₊ Regression score of

Through the constraint condition of the formula (3), the score of the classification branch not only reflects the foreground classification precision of the target, but also reflects the regression precision of the target;

step S22: construction of IoU-based ordering loss function L _rank-iou As shown in the following formula (4):

wherein xp () is an exponential function; gamma is a parameter for controlling the magnitude of the loss value, and in the embodiment of the invention, the value of gamma can be 3.

The first term of equation (4) reflects the first constraint in equation (3), i.e., when

Is greater than

When the temperature of the water is higher than the set temperature,

is greater than

The smaller the value of the first term of equation (4); similarly, the second term of equation (4) reflects the second constraint in equation (3), i.e., when

Is greater than

When the temperature of the water is higher than the set temperature,

is greater than

The smaller the value of the second term of equation (4) is. Therefore, under the action of the formula (4), the higher the regression accuracy, the higher the sample classification score is, and the classification score of the classification branch can reflect the accuracy of target frame prediction to a certain extent, so that the formula (4) can connect the classification branch with the regression branch, and the classification branch can reflect the classification accuracy and the regression accuracy at the same time.

The invention constructs the sequencing loss function based on IoU, and the loss function can connect the classification branch and the regression branch on the premise of not adding additional branches, so that the classification branch can reflect the classification precision and the regression precision (namely the target frame prediction precision) at the same time.

In one embodiment, the step S3: combining the classification sorting loss function, the sorting loss function based on IoU and the original loss function in the RPN network to construct a total loss function and guide the training of a twin RPN target tracking network, specifically comprising:

sorting order loss function L _{rank_cls} IoU-based ordering loss function L _{rank_iou} And the loss function L existing in the RPN network _RPN Adding to construct the total loss function L _total Expressed by the following formula (5):

L _total ＝L _RPN +L _rank-cls +L _rank-iou (5)

through L _total The training of the twin RPN target tracking network can be guided, and the training is used for optimizing and updating network parameters.

Example two

As shown in fig. 3, an embodiment of the present invention provides a twin network target tracking system based on sorting, which includes the following modules:

a classification loss function building module 41, configured to build a classification loss function and train classification branches in the twin RPN target tracking network;

an IoU-based ranking loss function building module 42 configured to build a IoU-based ranking loss function to align classification branches and regression branches in the twin RPN target tracking network;

and a total loss function constructing module 43, configured to combine the sorted ranking loss function, the ranking loss function based on IoU, and the original loss function in the RPN network, to construct a total loss function, and guide training of the twin RPN target tracking network.

The above examples are provided for the purpose of describing the present invention only and are not intended to limit the scope of the present invention. The scope of the invention is defined by the appended claims. Various equivalent substitutions and modifications can be made without departing from the spirit and principles of the invention, and are intended to be within the scope of the invention.

Claims

1. A twin network target tracking method based on sequencing is characterized by comprising the following steps:

step S1: constructing a classification sequencing loss function, and training classification branches in the twin RPN target tracking network;

2. The twin network target tracking method based on sorting as claimed in claim 1, wherein the step S1: constructing a classification and sequencing loss function, and training classification branches in a twin RPN target tracking network, wherein the method specifically comprises the following steps:

step S11: respectively calculating the foreground classification score mean values of positive and negative samples output by the classification branches in the twin RPN according to the following formula (1):

wherein A is _pos Is a positive sample set, A _neg Is a difficult negative sample set; negative example j _- A weight coefficient of

exp () is an exponential function; positive sample j ₊ Weight of (2)

Is composed of

Wherein N is _pos Is a positive sample set A _pos The number of the middle samples; p is a radical of _j+ And p _j- Respectively positive samples j predicted by the classification branch ₊ And difficult negative example j _- (ii) a foreground classification score of;

wherein exp () and log () are exponential function and logarithmic function, respectively; beta is a parameter for controlling the size of the loss value, and alpha is a parameter for controlling the sorting distance.

3. The twin network target tracking method based on sorting as claimed in claim 1, wherein the step S2: constructing an IoU-based ordering loss function, and aligning classification branches and regression branches in the twin RPN target tracking network, wherein the method specifically comprises the following steps:

wherein the content of the first and second substances,

and

and

are respectively positive samples i ₊ And j ₊ (ii) a regression score derived from the regression branch prediction;

where γ is a parameter controlling the magnitude of the loss value, and exp () is an exponential function.

4. The twin network target tracking method based on sorting as claimed in claim 1, wherein the step S3: combining the classification ranking loss function, the IoU-based ranking loss function and the original loss function in the RPN network to construct a total loss function and guide the training of the twin RPN target tracking network, specifically comprising:

sorting the classification order loss function L _{rank_cls} The IoU-based ordering loss function L _{rank_iou} And loss function L existing in RPN network _RPN Adding to construct the total loss function L _total Expressed by the following formula (5):

L _total ＝L _RPN +L _rank-cls +L _rank-iou (5)

5. a twin network target tracking system based on sequencing is characterized by comprising the following modules:

a classification loss function building module for building a classification loss function and training classification branches in the twin RPN target tracking network;

an IoU-based ranking loss function building module for building a IoU-based ranking loss function to align classification branches and regression branches in the twin RPN target tracking network;

and a total loss function constructing module, configured to combine the classification sorting loss function, the sorting loss function based on IoU, and an original loss function in the RPN network to construct a total loss function, and guide training of the twin RPN target tracking network.