CN108256394B

CN108256394B - Target tracking method based on contour gradient

Info

Publication number: CN108256394B
Application number: CN201611239192.7A
Authority: CN
Inventors: 李波; 左春婷; 蔡宇; 黄艳金
Original assignee: China Forestry Star Beijing Technology Information Co ltd
Current assignee: China Forestry Star Beijing Technology Information Co ltd
Priority date: 2016-12-28
Filing date: 2016-12-28
Publication date: 2020-09-25
Anticipated expiration: 2036-12-28
Also published as: CN108256394A

Abstract

The invention discloses a target tracking method based on contour gradient, which comprises the steps of segmenting a candidate target from an initial frame of an input video or image sequence and extracting a reference image; extracting the reference image outline as a standard characteristic template, and carrying out scale and angle transformation to obtain a multi-scale multi-angle template sequence; inputting a next frame of a video or image sequence, and extracting gradient features of the target image; performing sliding window scanning with step length of 1 on a characteristic image of a target image by using the multi-scale multi-angle template sequence, and calculating the matching similarity of the two; according to the information such as the matching position, the scale factor, the angle factor and the like, dividing a sub-region where the target is located in the target image to serve as a reference image for the next detection for updating; until all the frames of the video or image sequence are detected. The invention aims to: the target tracking method based on the contour gradient is provided to adapt to the scale change and/or the angle change of a target and improve the accuracy of target tracking.

Description

Target tracking method based on contour gradient

Technical Field

The invention belongs to the technical field of computer image processing, and particularly discloses a contour gradient-based target tracking method.

Background

In a security monitoring task, in order to meet the requirement of automatic target tracking, it is often necessary to determine a target position of an area of interest in a video or a continuous image sequence, and to correspond the target position in each frame of image.

The research content of the target tracking technology is mainly divided into the following two aspects: firstly, the required information, such as the track of the target and the related motion parameters such as speed, acceleration, position, etc., is detected, tracked, identified and extracted from the moving target in the captured video sequence. And secondly, predicting and estimating the target by using the acquired motion parameters to assist decision making. Therefore, the accurate extraction of the characteristics of the moving target is a precondition for improving the target tracking, identification and classification accuracy; the accuracy of tracking in turn affects the accuracy and difficulty of high-level decisions.

The conventional target tracking solution can be described as follows:

(1) determining a template, generally called a reference image, on the acquired image sequence, wherein the reference image is recorded with a target to be tracked;

(2) taking each pixel point of the reference image as a feature point to form an original feature point set; or, in order to improve the efficiency of the operation, the pixels are uniformly extracted from the reference image at equal intervals (the process can be called equal-interval sampling), and an original characteristic point set is formed;

(3) calculating the original characteristic point set and the neighborhood thereof to obtain a new characteristic point set, and determining a matching region on the image to be matched according to the new characteristic point set;

(4) and calculating gray scale or texture information between the matching area and the reference template, and obtaining a matching coefficient matrix between the matching area and the reference image through iteration by using a method of minimizing errors, wherein the area corresponding to the maximum value of the matching coefficient is the tracked target.

(5) And (5) repeating the steps (3) - (4) on the acquired image sequence, and finally realizing continuous target tracking through template matching between the image frames.

The traditional target tracking method has the following defects:

(1) after the reference image is obtained, the reference image is used as a standard template and is not updated. However, in an actual tracking system, when an image capturing device such as a camera moves and a target moves, scales or rotates, a determined reference image may partially move out of a picture capturing range of the camera, so that a partial region of the reference image is not included in images of a subsequent image sequence, and if an initial reference image is adopted, tracking failure may be caused, and the target may be lost. Therefore, the tracking method with strong anti-interference capability has good application value.

(2) The obtained reference image has large randomness of characteristic points, usually contains less image information, can not well represent image characteristics, has low reliability and stability, and ensures that a tracking algorithm has no good robustness.

(3) The feature matching calculation is difficult to achieve real time, and the response speed of target tracking is influenced.

Disclosure of Invention

The purpose of the invention is: the target tracking method based on the contour gradient is provided to adapt to the scale change and/or the angle change of a target and improve the accuracy of target tracking; firstly, extracting the outline of a specified target and a gradient vector thereof as a standard scale template through a sobel operator, and then obtaining a multi-scale and multi-angle template sequence through scale and angle sampling; secondly, obtaining optimal matching information by a template matching method, wherein the optimal matching information comprises a position, a scale factor and an angle factor; and finally, cutting out an optimal matching area on the target image according to the optimal matching information, and taking the outline and the gradient vector thereof as an updated standard scale template.

The purpose of the invention is realized by the following technical scheme:

a target tracking method based on contour gradient comprises the following steps;

step 1: dividing a candidate target from an initial frame of an input video or image sequence, and extracting a reference image containing the candidate target;

step 2: extracting the outline of the target reference image to obtain a coordinate sequence p of each pixel point of the reference image_i＝(x_i,y_i)^TAnd corresponding gradient sequences d in horizontal and vertical directions_i＝(t_i,u_i)^TAs standard feature templates;

if the size of the target reference image is mxn, and the number of the detected edge points is L according to the preset threshold condition T, i is 1,2, 3., mxn, where only L points are sobel edge points, and gradients corresponding to other non-edge points are defined as (0, 0);

the sobel edge detection process is a process of utilizing a sobel operator to perform convolution with all pixel points and neighborhoods of reference images and then determining edge points according to a preset threshold condition T;

through sobel edge detection, the point p can be obtained_i＝(x_i,y_i)^TAnd its corresponding gradient direction d_i＝(t_i,u_i)^T；

And step 3: carrying out dimension and angle d on the standard characteristic template_i＝(t_i,u_i)^TTransforming to obtain a multi-scale and multi-angle template sequence (P)_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TTo enhance the robustness of the matching;

and 4, step 4: inputting a next frame of the video or image sequence, and extracting the gradient feature g of the target image_r＝(v_r,w_r)^T；

performing sobel edge detection on the target image with the size of M × N by using the same method and preset threshold condition as the step2 to obtain the gradient feature g of the target image_r＝(v_r,w_r)^Twherein, r is 1,2,3,., mxn, L' is the number of edge points in the detected target image, and if and only if the pixel point is an edge, the corresponding gradient has a value, otherwise (0, 0);

and 5: the M groups of multi-scale and multi-angle template pictures (P) obtained in the step3_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TRespectively carrying out sliding window scanning with the step length being a fixed value with the characteristic image of the target image, calculating the similarity matched with the characteristic image and the characteristic image, comparing the similarity obtained under all templates and all window positions, and obtaining the window position corresponding to the maximum value of the similarity, namely the best matching position, which is the final result of the target tracking of the current frame;

step 6: dividing a sub-region where the target is located in the target image according to the optimal matching position, the scale factor and the angle factor, and updating the sub-region as a reference image for next detection;

and 7: and (6) repeating the steps from the step2 to the step 6 until all the frames of the video or image sequence are detected.

Further, the method for performing appropriate scale and angle transformation on the standard feature template in step3 includes:

a) the standard scale template sequence (p)_i,d_i)^TEach point p_i＝(x_i,y_i)^TAbscissa x of_iEnlargement/reduction

s_xMultiple, ordinate y_iEnlargement/reduction s_yDoubling;

b) by different transformation factors s_x、s_yAfter processing, a plurality of multi-scale feature template sequences (P) can be obtained_i1,d_i)^T，(P_i2,d_i)^T，(P_i3,d_i)^T，...,(P_ik,d_i)^TWherein k is the number of times of carrying out different scale transformation;

c) performing proper angle transformation on all the multi-scale characteristic templates obtained in the step b), and performing proper angle transformation on a multi-scale characteristic template sequence (P)_ij,d_i)^T(j ═ 1,2, 3.., k) is rotated by an angle θ, and the sequence of templates (P) is rotated in the positive direction by the rightward rotation_ij,d_i)^TEach point P of_ij＝(X_ij,Y_ij)^TClockwise rotation of θ about (0,0) to obtain a point P_ij'＝(X_ij',Y_ij')^TThe mathematical formula can be expressed as:

d) obtaining M groups of multi-scale multi-angle template sequences (P)_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^T。

Further, the scale transformation factor s_x、s_yIn the range of0.9-1.1, and the angle theta is in the range of-30 degrees to 30 degrees.

Further, the method for calculating the similarity in step 5 includes:

1) for any point (x, y) on the target image, the gradient direction is recorded as g_x,y＝(v_x,y,w_x,y)^T；

2) When the characteristic template window of the reference image is matched with a certain range with the same size in the target image to be detected, the matching similarity s of the two points can be defined as: normalizing the sum of cosine values at the included angles of the direction of the corresponding gradient vectors of the same coordinate position in the feature matrix of the feature template matrix and the feature matrix of the target image;

3) the gradient vector value of the non-edge point is (0,0), so the similarity can be simplified to the sum of the normalized cosine values of the included angles of the edge points corresponding to the gradient vector direction at the same coordinate position:

in the formula, the (x, y) value is the coordinate of the upper left corner of the position where the sliding window is located in the target image, the position of the window is directly represented by the coordinate of the point or the coordinate converted into the center point of the window, and the value range of the similarity s is 0-1;

4) combining M groups of multi-scale multi-angle templates (P)_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TAnd respectively scanning sliding windows with the step length of 1 on the feature matrix of the target image, comparing the similarity obtained under all templates and all window positions, and obtaining the window position corresponding to the maximum value of the similarity, namely the best matching position, which is the final result of the target tracking of the current frame.

Further, the method for fast calculating the similarity in step 5, which uses Fast Fourier Transform (FFT) to perform convolution operation to convert the similarity to a frequency domain, includes:

(1) for any multi-scale multi-angle feature template (p)_ik，d_ik)^TDefining two gradient direction component matrices T_x、T_ythe size of the matrix is m × n, and the values of m and n are respectively represented by p_ikDetermining the maximum and minimum values of the horizontal and vertical coordinates; p is a radical of_ik＝(x_ik,y_ik)T，d_ik＝(t_ik,u_ik)^TAccording to p_ikCoordinate value of (1) is respectively for T_x、T_yAssigning a value of T_xMiddle (x)_ikColumn y_ikThe line element is assigned a value of t_ikWill T_yMiddle (x)_ikColumn y_ikThe row element is assigned a value of u_ik；

(2) In the same way of (1), two gradient direction component matrixes O of the target image are obtained through assignment_x、O_yAssignment is similar;

(3)T_xand O_xThe convolution operation is realized through FFT to obtain

Wherein the content of the first and second substances,

and calculating to obtain:

similarly, calculating to obtain:

(4) and according to the equivalent relation between convolution and Fast Fourier Transform (FFT), carrying out the following processing on the calculation result to obtain a similarity matrix:

i.e. the set of matching similarities s.

The invention has the following beneficial effects:

firstly, the invention extracts the contour gradient of the target as the characteristic and adopts the gradient threshold value to process, thereby obtaining a reasonable and effective characteristic template based on the edge point gradient vector as the characteristic and effectively realizing the tracking of the target.

Secondly, the invention carries out proper scale and rotation transformation on the characteristic template to obtain the multi-scale multi-angle characteristic template, thereby avoiding the problem of target loss caused by the motion of image acquisition equipment such as a camera and the like or a target and the scale or rotation change of the target.

Thirdly, the method and the device can segment the target in time and update the template in real time, thereby avoiding the problem of tracking failure caused by adopting a fixed template in the traditional mode.

Fourthly, compared with the prior art, the method uses the gradient vector based on the edge point as the feature template, and the non-edge point does not participate in the calculation process of feature matching, thereby greatly reducing the calculation amount of feature matching.

Fifth, the method of the invention also provides a rapid calculation method for the implementation of feature matching, which realizes the magnitude order reduction of the calculated amount and can well adapt to the real-time requirement in the security monitoring system.

Drawings

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings, in which:

FIG. 1 is a flow chart of an implementation of a contour gradient-based target tracking method according to the present invention;

FIG. 2 is a diagram illustrating the segmentation of candidate targets from an initial frame according to the present invention;

FIG. 3 is a schematic diagram of the sobel operator of the present invention;

FIG. 4 is a diagram illustrating a neighborhood of a pixel in an image according to the present invention;

FIG. 5 is a schematic diagram of a standard feature template of the present invention.

Detailed Description

Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. It should be understood that the preferred embodiments are illustrative of the invention only and are not limiting upon the scope of the invention.

As shown in fig. 1-5, in the first embodiment: a target tracking method based on contour gradient is provided, which comprises the following steps: establishing a feature template, matching the feature template with a target image and updating the feature template;

step1, segmenting candidate targets from an initial frame of an input video or image sequence, and extracting a reference image containing the candidate targets. This step is illustrated in figure 2. According to the requirements of practical application, the step is taken as the first step of the whole process, and can be realized by manual labeling or by implementing a complete segmentation algorithm on the target.

Step2, extracting the outline of the target reference image to obtain a coordinate sequence p of each pixel point of the reference image_i＝(x_i,y_i)^TAnd corresponding gradient sequences d in horizontal and vertical directions_i＝(t_i,u_i)^TAs standard feature templates.

It should be noted that, if the size of the target reference image is m × n, and the number of detected edge points is L according to the preset threshold condition T, i is 1,2, 3.

The sobel edge detection process is a process of convolving all pixel points of a reference image and neighborhoods thereof with sobel operators (Gx and Gy shown in fig. 3), and then determining edge points according to a preset threshold condition T.

Let a certain pixel (x, y) in the reference image and its neighborhood pixel value distribution be as shown in fig. 4.

Through sobel edge detection, the point p can be obtained_i＝(x_i,y_i)^TAnd its corresponding gradient direction d_i＝(t_i,u_i)^TWherein, the coordinate values of the gradient direction are as follows:

thus, the prototype of the feature template is obtained-with the gradient features d of the reference image_i ^Tthe constructed set/sequence, referred to herein as a standard feature template, is the same size as the reference image, m n in size, and an example of the distribution of gradient feature values is shown in FIG. 5.

Step3, carrying out scale and angle transformation on the standard characteristic template to obtain a multi-scale multi-angle template sequence (P)_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TTo enhance the robustness of the matching.

Performing appropriate scale transformation on standard feature templates, i.e. standard scale template sequence (p)_i,d_i)^TEach point p_i＝(x_i,y_i)^TAbscissa x of_iEnlargement (reduction) of s_xMultiple, ordinate y_iEnlargement (reduction) of s_yMultiplying, scaling and transforming factor s_x、s_yThe range is 0.9 to 1.1. If with P_i＝(X_i,Y_i)^TRepresenting the scaled result, the mathematical formula of the process can be expressed as:

by different transformation factors s_x、s_yAfter processing, a plurality of multi-scale feature template sequences (P) can be obtained_i1,d_i)^T，(P_i2,d_i)^T，(P_i3,d_i)^T，...,(P_ik,d_i)^TAnd k is the number of times of carrying out different scale transformation.

Further, all the obtained multi-scale feature templates are subjected to proper angle transformation. Centering on the upper left corner of the reference image, and aligning the multi-scale feature template sequence (P)_ij,d_i)^TAnd (j ═ 1,2, 3.., k) rotation is performed at an angle θ, and the right rotation is assumed to be a positive direction, and the angle θ is taken to be in a range of-30 ° to 30 °. Template sequence (P)_ij,d_i)^TEach point P of_ij＝(X_ij,Y_ij)^TRotate θ around (0,0) to obtain a point P_ij'＝(X_ij',Y_ij')^TThe mathematical formula can be expressed as:

here, the method is not limited to the mode in which the upper left corner of the reference image is used as the rotation center, and if other fixed points such as the reference image center are selected as the rotation center, the essence is the same as that of the embodiment.

Through the processing, M groups of multi-scale multi-angle template sequences (P) are finally obtained_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^T。

The multi-scale multi-angle characteristic template obtained by the process can solve the problem of target loss caused by the motion of image acquisition equipment such as a camera or a target and the change of the scale or the angle of the target, has good robustness, and has simple and convenient transformation process and good real-time property.

Step4, inputExtracting gradient characteristic g of the target image from the next frame of the video or image sequence_r＝(v_r,w_r)^T:

performing sobel edge detection on the target image with the size of M × N by using the same method and preset threshold condition as the step2 to obtain the gradient feature g of the target image_r＝(v_r,w_r)^Twhere r is 1,2,3,., mxn, L' is the number of edge points in the detected target image, and if and only if a pixel point is an edge, the corresponding gradient has a value, otherwise it is (0, 0).

The specific implementation method of this step is shown in step2, and is not described herein again.

Step 5, the M groups of multi-scale multi-angle template pictures (P) obtained in the step3_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TRespectively carrying out sliding window scanning with the step length of 1 with the characteristic image of the target image, and calculating the matching similarity of the two. When the similarity obtains the maximum value, the corresponding position is the best matching position, and the multi-scale multi-angle template sequence for matching can give out the information such as the scale factor, the angle factor and the like corresponding to the best matching. The specific matching algorithm is as follows.

If any point (x, y) on the target image, the gradient direction is marked as g_x,y＝(v_x,y,w_x,y)^T. When the characteristic template window of the reference image is matched with a certain range with the same size in the target image to be detected, the matching similarity s of the two points can be defined as: and normalizing the sum of cosine values at the included angles of the direction of the corresponding gradient vectors of the feature template matrix and the feature matrix of the target image at the same coordinate position.

In fact, since the gradient vector values of the non-edge points are (0,0), the similarity can be simplified to the sum of the normalized cosine values of the included angle between the edge points at the same coordinate position and the gradient vector direction. The mathematical formula can be expressed as:

in the formula, the (x, y) value is the coordinate of the upper left corner of the position where the sliding window is located in the target image, the position of the window is directly represented by the coordinate of the point or the coordinate converted into the center point of the window, and the value range of the similarity s is 0-1.

Combining M groups of multi-scale multi-angle templates (P)_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TAnd respectively scanning sliding windows with the step length of 1 on the feature matrix of the target image, comparing the similarity obtained under all templates and all window positions, and obtaining the window position corresponding to the maximum value of the similarity, namely the best matching position, which is the final result of the target tracking of the current frame.

It can be seen that the contour gradient vectors are extracted through the sobel operator and used as features for matching, only edge points need to be calculated in calculation, and compared with a traditional feature template, the calculation amount is greatly reduced, and the real-time performance of the algorithm is improved.

And 6, according to the information such as the optimal matching position, the scale factor, the angle factor and the like, dividing a sub-region where the target is located in the target image, and updating the sub-region as a reference image for next detection.

And 7, repeating the contents from the step2 to the step 6 until all frames of the video or image sequence are detected.

Example two: in the first embodiment, the similarity calculation involves a convolution operation of the feature template of the reference image and the gradient feature value of the feature matrix of the target image within a certain window. Although based on the definition that the feature gradient corresponds to a gradient value of (0,0) at a non-edge point, the calculation process has a greatly reduced amount of calculation compared with the conventional feature matching method. However, the convolution calculation is mechanical and tedious, and still consumes a lot of time, so that the matching similarity calculation is difficult to achieve in real time.

Therefore, a fast calculation method is provided herein, which uses Fast Fourier Transform (FFT) to perform convolution operation to convert the similarity into the frequency domain, and the specific method is as follows:

step1. for any multi-scale multi-angle feature template (p)_ik，d_ik)^TDefining two gradient direction component matrices T_x、T_ythe size of the matrix is m × n, and the values of m and n are respectively represented by p_ikIs determined by the maximum and minimum values of the horizontal and vertical coordinates of (a). p is a radical of_ik＝(x_ik,y_ik)T，d_ik＝(t_ik,u_ik)^TAccording to p_ikCoordinate value of (1) is respectively for T_x、T_yAssigning a value of T_xMiddle (x)_ikColumn y_ikThe line element is assigned a value of t_ikWill T_yMiddle (x)_ikColumn y_ikThe row element is assigned a value of u_ik。

Step2, in the same way, assigning values to obtain two gradient direction component matrixes O of the target image_x、O_yThe assignment is similar.

Step3. consider T_xAnd O_xThe convolution operation is realized through FFT to obtain

Wherein the content of the first and second substances,

and calculating to obtain:

similarly, calculating to obtain:

step4, according to the equivalent relation between convolution and Fast Fourier Transform (FFT), the similarity matrix obtained by processing the calculation result as follows:

i.e. the set of matching similarities s.

And according to the position of the obtained maximum similarity in the similarity matrix, obtaining the best matching position of the characteristic template in the target image, and realizing the target tracking result.

By adopting a rapid calculation mode, the calculation amount of feature matching can be reduced by orders of magnitude, and the aim of real-time calculation is achieved.

Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims

1. A target tracking method based on contour gradient is characterized in that: comprises the following steps;

step 2: extracting the outline of the target reference image to obtain a coordinate sequence p of each pixel point of the reference image_i＝(x_i,y_i)^TAnd corresponding gradient sequences d in horizontal and vertical directions_i＝(t_i,u_i)^TAs a standardA feature template;

and 5: the M groups of multi-scale and multi-angle template pictures (P) obtained in the step3_i1',d_i)^T，(P_i2',d_i)^T，(P_i3',d_i)^T，......,(P_iM',d_i)^TRespectively scanning with the characteristic image of target image by sliding window with fixed step length, and calculating twoComparing the similarity obtained under all templates and all window positions, and obtaining the window position corresponding to the maximum value of the similarity, namely the best matching position, which is the final result of the target tracking of the current frame;

2. The contour gradient-based target tracking method according to claim 1, wherein: the method for performing appropriate scale and angle transformation on the standard feature template in the step3 comprises the following steps:

a) the standard scale template sequence (p)_i,d_i)^TEach point p_i＝(x_i,y_i)^TAbscissa x of_iEnlargement/reduction s_xMultiple, ordinate y_iEnlargement/reduction s_yDoubling;

c) performing proper angle transformation on all the multi-scale characteristic templates obtained in the step b), and performing proper angle transformation on a multi-scale characteristic template sequence (P)_ij,d_i)^T(j ═ 1,2, 3.., k) is rotated by an angle θ, and the sequence of templates (P) is rotated in the positive direction by the rightward rotation_ij,d_i)^TEach point P of_ij＝(X_ij,Y_ij)^TClockwise rotation of θ about (0,0) to obtain a point P_ij'＝(X_ij',Y_ij')^TExpressed by a mathematical formula as:

3. The contour gradient-based target tracking method according to claim 2, wherein: the scale transformation factor s_x、s_yThe range is 0.9-1.1, and the range of the angle theta is-30 degrees to 30 degrees.

4. The contour gradient-based target tracking method according to claim 1, wherein: the method for calculating the similarity in the step 5 includes:

2) When the characteristic template window of the reference image is matched with a certain range with the same size in the target image to be detected, the matching similarity s of the two points is defined as: normalizing the sum of cosine values at the included angles of the direction of the corresponding gradient vectors of the same coordinate position in the feature matrix of the feature template matrix and the feature matrix of the target image;

3) the gradient vector value of the non-edge point is (0,0), so the similarity is simplified to the sum of the normalized cosine values of the included angles of the edge points corresponding to the gradient vector direction at the same coordinate position:

5. The contour gradient-based target tracking method according to claim 1, wherein: the method for fast calculating the similarity in step 5, which uses Fast Fourier Transform (FFT) to perform convolution operation to convert the similarity to a frequency domain, includes:

(1) for any multi-scale multi-angle feature template (p)_ik，d_ik)^TDefining two gradient direction component matrices T_x、T_ythe size of the matrix is m × n, and the values of m and n are respectively represented by p_ikDetermining the maximum and minimum values of the horizontal and vertical coordinates; p is a radical of_ik＝(x_ik,y_ik)^T，d_ik＝(t_ik,u_ik)^TAccording to p_ikCoordinate value of (1) is respectively for T_x、T_yAssigning a value of T_xMiddle (x)_ikColumn y_ikThe line element is assigned a value of t_ikWill T_yMiddle (x)_ikColumn y_ikThe row element is assigned a value of u_ik；

(3)T_xand O_xThe convolution operation is realized through FFT to obtain

Wherein the content of the first and second substances,

and calculating to obtain:

similarly, calculating to obtain:

i.e. the set of matching similarities s.