CN109993776A - A kind of correlation filtering method for tracking target and system based on multistage template - Google Patents

A kind of correlation filtering method for tracking target and system based on multistage template Download PDF

Info

Publication number
CN109993776A
CN109993776A CN201910270129.7A CN201910270129A CN109993776A CN 109993776 A CN109993776 A CN 109993776A CN 201910270129 A CN201910270129 A CN 201910270129A CN 109993776 A CN109993776 A CN 109993776A
Authority
CN
China
Prior art keywords
response
template
translation
max
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910270129.7A
Other languages
Chinese (zh)
Other versions
CN109993776B (en
Inventor
严军荣
钟鲁超
姚英彪
姜显扬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Dianzi University
Hangzhou Electronic Science and Technology University
Original Assignee
Hangzhou Electronic Science and Technology University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Electronic Science and Technology University filed Critical Hangzhou Electronic Science and Technology University
Priority to CN201910270129.7A priority Critical patent/CN109993776B/en
Publication of CN109993776A publication Critical patent/CN109993776A/en
Application granted granted Critical
Publication of CN109993776B publication Critical patent/CN109993776B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/262Analysis of motion using transform domain methods, e.g. Fourier domain methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20048Transform domain processing
    • G06T2207/20056Discrete and fast Fourier transform, [DFT, FFT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of correlation filtering method for tracking target and system based on multistage template.Its method includes the size for calculating multistage translation filter template, determines multistage translation filter template, determines multistage translation filter output response output, judge to translate filter template whether meet peak value of response requirement, in multistage translation filter template the suitable response output of selection, according to translating filter prediction target's center in the position of present frame.Method and system of the invention, which are solved, quickly to be moved in face of target and when the mixed and disorderly situation of background, the ineffective technical problem of tracker.

Description

A kind of correlation filtering method for tracking target and system based on multistage template
Technical field
The invention belongs to computation vision target tracking domain, in particular to a kind of correlation filtering target based on multistage template Tracking and system.
Background technique
Target following is the basic research of computer vision field, in sides such as precision control, automatic Pilot and robots Face has a wide range of applications.Nearest lot of domestic and foreign researcher achieves many significant achievements in succession in this direction, But in such methods application process, there is various challenges, including partial occlusion, quickly movement, motion blur, background are miscellaneous Disorderly, situations such as illumination change, this kind of challenge causes tracker that can not accurately track target or even can track failure.To overcome view The challenge content being likely to occur in frequency sequence, the tracking tool for designing a kind of robustness acquire a certain degree of difficulty.
Correlation filtering tracking is the MOSSE proposed by Bolme earliest, and this method is that the research of subsequent related fields is established Basis is determined.Henriques is by introducing circulating sampling and utilizing Fourier's diagonalization property of Cyclic Moment speed-raising training and inspection Survey process proposes CSK.Later, Henriques is further improved CSK, introduces kernel function and extends multichannel direction gradient feature (HOG) feature extraction KCF.Danelljan M uses multichannel color characteristic Color Names (CN) on the basis of CSK, The colouring information for efficiently using target, proposes CN.
Although these methods have been achieved with all well and good effect in target tracking domain, all only wrapped in their frame Containing a filter template, which immobilizes, and the detection range of target must be consistent with the size of target, This causes target search range limited.When there is the too fast equal peculair motions situation of movement velocity in target, the correlation of single template Filtering method tends not to well adapt to, and leads to tracking drift even tracking failure.A kind of target that faces is needed quickly to move Guarantee the technical solution of tracker effect when situation mixed and disorderly with background, for this purpose, proposing a kind of double-template dimension self-adaption correlation filter Wave method for real time tracking and system.
Summary of the invention
When the technical problem to be solved by the present invention is to quickly move situation mixed and disorderly with background in face of target, tracker effect Bad problem proposes a kind of correlation filtering method for tracking target and system based on multistage template.
The x-y coordinate system for indicating image pixel positions, target's center position (x are established in advancen,yn) indicate, wherein N indicates frame number.Target's center position (the x of video sequence first frame1,y1) setting in advance, target size (high, width) is in advance Setting, the velocity and acceleration of prior estimating target motion.
Correlation filtering method for tracking target based on multistage template of the invention, comprising the following steps:
Calculate the size of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, ) and the estimation peak value v of target speed widthmaxThe full-size for calculating translation filter, with variable sizemax= (size_highmax,size_widthmax) indicate, according to the estimated acceleration a of target movement, target size and translation filter Full-size calculates the rank m of translation filter template, calculates filter templates at different levels according to translation filter template rank m Size is denoted as window_sz_i, wherein 1≤i≤m.
It is described
The rank of the translation filter template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance.
The window_sz_1=Tmin(high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.
Determine multistage translation filter template: heart position (x in the targetn,yn), the ruler of filter template is translated according to m grades Very little window_sz_i determines Gaussian label yf_i, interception image block patch_for_train_i_n, and wherein n is indicated Frame number;Image block characteristics are extracted respectively, and addition Cosine Window obtains translation features sample xf_for_train_i_n, high using translation This type label and translation feature samples obtain m various sizes of translation filter templates, are indicated with α _ i;
The translation filter templateWherein, α indicates α _ i,It indicates in inverse Fu Leaf transformation, ()*Indicate conjugation,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is nuclear matrix K Generation sample Fourier transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.
It determines the output response output of multistage translation filter: enabling n=n+1, video sequence n-th frame is read, in the (n-1)th frame Target's center position (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_for_ Det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, utilizes translation Response output matrix response_i and peak value of response max_response_i is calculated in template α _ i.
It is described Indicate inverse Fourier transform,Indicate Fourier Transformation, ⊙ representing matrix element point multiplication operation symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.
Judge translate filter template whether meet peak value of response requirement: judge peak value of response max_response_i (i's Initial value is 1) whether to be greater than output response threshold value R set in advance, if so, determining i-stage translation translation filter response Peak value meet demand enables response output matrix response=response_i, peak value of response max_response=max_ Response_i is entered step: otherwise prediction target's center determines that i-stage translates filter response peak in the position of present frame Value is unsatisfactory for demand, and i=i+1 is entered step: the suitable response output of selection in multistage translation filter template.
The suitable response output of selection in multistage translation filter template: compare the translation filter template of two ranks The peak value of response being calculated selects (i-1)-th grade of translation filter if max_response_i-1 > max_response_i The response of template exports, even response output matrix response=response_i-1, peak value of response max_response= max_response_i-1;Otherwise return step: judge to translate whether filter template meets peak value of response requirement.
According to translation filter prediction target's center in the position of present frame: exporting peak value according to the response of translation filter Position of the max_response in response output matrix response, position (x of the prediction target's center in current n-th framen, yn).Return step: multistage translation filter template is determined.
Correlation filtering Target Tracking System based on multistage template of the invention, comprising:
Video sequence;
Computer;
And
One or more programs, wherein one or more of programs are stored in the memory of computer, and by It is configured to be executed by the processor of the computer, described program includes:
Calculate the size module of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, ) and the estimation peak value v of target speed widthmaxThe full-size for calculating translation filter, with variable sizemax= (size_highmax,size_widthmax) indicate, according to the estimated acceleration a of target movement, target size and translation filter Full-size calculates the rank m of translation filter template, calculates filter templates at different levels according to translation filter template rank m Size is denoted as window_sz_i, wherein 1≤i≤m.
It is described
The rank of the translation filter template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance.
The window_sz_1=Tmin(high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.
Determine multistage translation filter template module: heart position (x in the targetn,yn), according to m grades of translation filter templates Size window_sz_i, determine Gaussian label yf_i, interception image block patch_for_train_i_n, wherein n Indicate frame number;Extract image block characteristics respectively, addition Cosine Window obtains translation features sample xf_for_train_i_n, using flat It moves Gaussian label and translation feature samples obtains m various sizes of translation filter templates, indicated with α _ i;
The translation filter templateWherein, α indicates α _ i,It indicates in inverse Fu Leaf transformation, ()*Indicate conjugation,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is nuclear matrix K Generation sample Fourier transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.
It determines the output response output module of multistage translation filter: enabling n=n+1, read video sequence n-th frame, the N-1 frame target's center position (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_ For_det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, utilizes Response output matrix response_i and peak value of response max_response_i is calculated in translation template α _ i.
It is described Indicate inverse Fourier transform,Indicate Fourier Transformation, ⊙ representing matrix element point multiplication operation symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.
Judge to translate whether filter template meets peak value of response requirement module: judging peak value of response max_response_i Whether (initial value of i is 1) is greater than output response threshold value R set in advance, if so, determining i-stage translation translation filter Peak value of response meet demand enables response output matrix response=response_i, peak value of response max_response= Otherwise max_response_i determines i-stage translation filter response into prediction target's center in the position module of present frame Peak value is unsatisfactory for demand, i=i+1, into the suitable response output module of selection in multistage translation filter template.
The suitable response output module of selection in multistage translation filter template: compare the translation filter of two ranks The peak value of response that formwork calculation obtains selects (i-1)-th grade of translation filter if max_response_i-1 > max_response_i The response of wave device template exports, even response output matrix response=response_i-1, peak value of response max_ Response=max_response_i-1;Otherwise it returns and judges whether translation filter template meets peak value of response and want modulus Block.
According to translation filter prediction target's center present frame position module: according to translation filter response export Position of the peak value max_response in response output matrix response, predicts target's center in the position of current n-th frame (xn,yn).It returns and determines multistage translation filter template module.
Present invention has the advantage that
(1) when facing fast-moving target, suitable filter template quantity and filter template size are chosen, even if Target is moved with maximum speed, and detection range still may include complete target;
(2) using multistage translation filter template predicted motion target position, make target detection process searches variable range, It is adapted to the changeable movement velocity of target;
(3) in final future position, the testing result of multiple filter templates is compared, until response exports peak value It is more other than upper level low, can to avoid filter template it is excessive caused by include excessive mixed and disorderly background and response in detection range Export influence of the fluctuation generated to final predicted position;
Detailed description of the invention
Fig. 1 is the correlation filtering method for tracking target flow chart based on multistage template of the embodiment of the present invention;
Fig. 2 is the correlation filtering Target Tracking System structural schematic diagram based on multistage template of the embodiment of the present invention.
Specific embodiment
It elaborates below to the preferred embodiment of the present invention.
The x-y coordinate system for indicating image pixel positions, target's center position (x are established in advancen,yn) indicate, wherein N indicates frame number.Target's center position (the x of video sequence first frame1,y1) setting in advance, target size (high, width) is in advance Setting, the velocity and acceleration of prior estimating target motion.In the present embodiment, in image pixel positions coordinate system, image upper left The position of angle pixel is (1,1), provides target's center position (x in first frame image1,y1)=(48,65), target size is 20 pixels × 20 pixels, i.e. high=20, width=20, the velocity peak values v of prior estimating target motionmax=2.5pix/ms, Acceleration a=0.025pix/ms2
Correlation filtering method for tracking target based on multistage template of the invention, comprising the following steps:
Calculate the size of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, ) and the estimation peak value v of target speed widthmaxThe full-size for calculating translation filter, with variable sizemax= (size_highmax,size_widthmax) indicate, according to the estimated acceleration a of target movement, target size and translation filter Full-size calculates the rank m of translation filter template, calculates filter templates at different levels according to translation filter template rank m Size is denoted as window_sz_i, wherein 1≤i≤m.
It is described
The rank of the translation filter template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance.
The window_sz_1=Tmin(high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.In the present embodiment, video frame rate 50fps is then calculated Step-size factor γ=16 set in advance, minimum step T set in advancemin= 1.2, then calculate the rank of translation filter templateAnd it is full Foot Then m=5, root The size of filter templates at different levels, respectively window_sz_1=T are calculated according to translation filter template rank mmin·(high, Width)=(24,24), window_sz_2=(Tmin+ (i-1) γ a) (high, width)=(1.2+16 × 0.025) × 20=(32,32), and so on, window_sz_3=(40,40), window_sz_4=(48,48), Window_sz_5=(56,56).
Determine multistage translation filter template: heart position (x in the targetn,yn), the ruler of filter template is translated according to m grades Very little window_sz_i determines Gaussian label yf_i, interception image block patch_for_train_i_n, and wherein n is indicated Frame number;Image block characteristics are extracted respectively, and addition Cosine Window obtains translation features sample xf_for_train_i_n, high using translation This type label and translation feature samples obtain m various sizes of translation filter templates, are indicated with α _ i;
The translation filter templateWherein, α indicates α _ i,It indicates in inverse Fu Leaf transformation, ()*Indicate conjugation,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is nuclear matrix K Generation sample Fourier transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.This In embodiment, according to template size size, Gaussian label yf_i is determined, it is 1 that tag hub, which is maximized, and surrounding numerical value is gradually Reduce, edge 0, numerical value is at Gaussian Profile.Heart position (48,65) in the target, according to search box size, (i.e. template size is big Bear) interception image block patch_for_train_i_1, extracts image block characteristics respectively, then adds Cosine Window and obtain feature Sample xf_for_train_i_1, size be respectively (24,24), (32,32), (40,40), (48,48), (56,56), here Cosine Window is equivalent to a weight matrix, assigns focus target region bigger weight, smaller closer to edge weights.Final root According to ridge regression training pattern, using feature samples and translation Gaussian label be calculated 5 ranks filter template α _ 1, α _ 2, α _ 3, α _ 4 and α _ 5.
It determines the output response output of multistage translation filter: enabling n=n+1, video sequence n-th frame is read, in the (n-1)th frame Target's center position (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_for_ Det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, utilizes translation Response output matrix response_i and peak value of response max_response_i is calculated in template α _ i.
It is described Indicate inverse Fourier transform,Indicate Fourier Transformation, ⊙ representing matrix element point multiplication operation symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.This reality It applies in example, n=1+1=2, in the 1st frame target's center position (48,65), according to translation filter template size window_sz_i Interception image block patch_for_det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_ to be detected Response output matrix response_i and peak value of response max_ is calculated using translation template α _ i in for_det_i_n Response_i, wherein max_response_1=0.7, max_response_2=0.7, max_response_3=0.6, Max_response_4=0.6, max_response_5=0.5.
Judge translate filter template whether meet peak value of response requirement: judge peak value of response max_response_i (i's Initial value is 1) whether to be greater than output response threshold value R set in advance, if so, determining i-stage translation translation filter response Peak value meet demand enables response output matrix response=response_i, peak value of response max_response=max_ Response_i is entered step: otherwise prediction target's center determines that i-stage translates filter response peak in the position of present frame Value is unsatisfactory for demand, and i=i+1 is entered step: the suitable response output of selection in multistage translation filter template.This implementation In example, output response threshold value R=0.8 set in advance judge max_response_1=0.7 < 0.8 in the present embodiment, judgement 1st grade of translation filter peak value of response is unsatisfactory for demand, and i=i+1=2 is entered step: selecting in multistage translation filter template Select suitable response output.
The suitable response output of selection in multistage translation filter template: compare the translation filter template of two ranks The peak value of response being calculated selects (i-1)-th grade of translation filter if max_response_i-1 > max_response_i The response of template exports, even response output matrix response=response_i-1, peak value of response max_response= max_response_i-1;Otherwise return step: judge to translate whether filter template meets peak value of response requirement.The present embodiment In, then return step: max_response_1=max_response_2 judges to translate whether filter template meets response peak Value requires.
In step: judge to translate whether filter template meets in peak value of response requirement, max_response_2=0.7 < 0.8, determine that the 2nd grade of translation filter peak value of response is unsatisfactory for demand, i=i+1=3 is entered step: translating filter in multistage Suitable response output is selected in template.
In step: in multistage translation filter template in the suitable response output of selection, max_response_2=0.7 > max_response_3=0.6 then selects the response output of the 2nd grade of translation filter template, even response output matrix Response=response_2, peak value of response max_response=max_response_2.
According to translation filter prediction target's center in the position of present frame: exporting peak value according to the response of translation filter Position of the max_response in response output matrix response, position (x of the prediction target's center in current n-th framen, yn).Return step: multistage translation filter template is determined.In the present embodiment, peak value is exported according to the response of translation filter Position of the max_response in response output matrix response, position (x of the prediction target's center in current 2nd frame2, y2)=(50,67), return step: determine multistage translation filter template.
Update translation filter after, read video sequence next frame, executed according to above-mentioned steps, until video last Frame.
The correlation filtering method for tracking target flow chart based on multistage template of the present embodiment, as shown in Figure 1.
The correlation filtering Target Tracking System based on multistage template of the present embodiment, comprising:
Video sequence;
Computer;
And
One or more programs, wherein one or more of programs are stored in the memory of computer, and by It is configured to be executed by the processor of the computer, described program includes:
Calculate the size module of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, ) and the estimation peak value v of target speed widthmaxThe full-size for calculating translation filter, with variable sizemax= (size_highmax,size_widthmax) indicate, according to the estimated acceleration a of target movement, target size and translation filter Full-size calculates the rank m of translation filter template, calculates filter templates at different levels according to translation filter template rank m Size is denoted as window_sz_i, wherein 1≤i≤m.
It is described
The rank of the translation filter template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance.
The window_sz_1=Tmin(high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.In the present embodiment, video frame rate 50fps is then calculated Step-size factor γ=16 set in advance, minimum step T set in advancemin= 1.2, then calculate the rank of translation filter templateAnd it is full Foot Then m=5, root The size of filter templates at different levels, respectively window_sz_1=T are calculated according to translation filter template rank mmin·(high, Width)=(24,24), window_sz_2=(Tmin+ (i-1) γ a) (high, width)=(1.2+16 × 0.025) × 20=(32,32), and so on, window_sz_3=(40,40), window_sz_4=(48,48), Window_sz_5=(56,56).
Determine multistage translation filter template module: heart position (x in the targetn,yn), according to m grades of translation filter templates Size window_sz_i, determine Gaussian label yf_i, interception image block patch_for_train_i_n, wherein n Indicate frame number;Extract image block characteristics respectively, addition Cosine Window obtains translation features sample xf_for_train_i_n, using flat It moves Gaussian label and translation feature samples obtains m various sizes of translation filter templates, indicated with α _ i;
The translation filter templateWherein, α indicates α _ i,It indicates in inverse Fu Leaf transformation, ()*Indicate conjugation,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is nuclear matrix K Generation sample Fourier transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.This In embodiment, according to template size size, Gaussian label yf_i is determined, it is 1 that tag hub, which is maximized, and surrounding numerical value is gradually Reduce, edge 0, numerical value is at Gaussian Profile.Heart position (48,65) in the target, according to search box size, (i.e. template size is big Bear) interception image block patch_for_train_i_1, extracts image block characteristics respectively, then adds Cosine Window and obtain feature Sample xf_for_train_i_1, size be respectively (24,24), (32,32), (40,40), (48,48), (56,56), here Cosine Window is equivalent to a weight matrix, assigns focus target region bigger weight, smaller closer to edge weights.Final root According to ridge regression training pattern, using feature samples and translation Gaussian label be calculated 5 ranks filter template α _ 1, α _ 2, α _ 3, α _ 4 and α _ 5.
It determines the output response output module of multistage translation filter: enabling n=n+1, read video sequence n-th frame, the N-1 frame target's center position (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_ For_det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, utilizes Response output matrix response_i and peak value of response max_response_i is calculated in translation template α _ i.
It is described Indicate inverse Fourier transform,Indicate Fourier Transformation, ⊙ representing matrix element point multiplication operation symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.This reality It applies in example, n=1+1=2, in the 1st frame target's center position (48,65), according to translation filter template size window_sz_i Interception image block patch_for_det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_ to be detected Response output matrix response_i and peak value of response max_ is calculated using translation template α _ i in for_det_i_n Response_i, wherein max_response_1=0.7, max_response_2=0.7, max_response_3=0.6, Max_response_4=0.6, max_response_5=0.5.
Judge to translate whether filter template meets peak value of response requirement module: judging peak value of response max_response_i Whether (initial value of i is 1) is greater than output response threshold value R set in advance, if so, determining i-stage translation translation filter Peak value of response meet demand enables response output matrix response=response_i, peak value of response max_response= Otherwise max_response_i determines i-stage translation filter response into prediction target's center in the position module of present frame Peak value is unsatisfactory for demand, i=i+1, into the suitable response output module of selection in multistage translation filter template.This implementation In example, output response threshold value R=0.8 set in advance judge max_response_1=0.7 < 0.8 in the present embodiment, judgement 1st grade of translation filter peak value of response is unsatisfactory for demand, and i=i+1=2 selects to close into multistage translation filter template Suitable response output module.
The suitable response output module of selection in multistage translation filter template: compare the translation filter of two ranks The peak value of response that formwork calculation obtains selects (i-1)-th grade of translation filter if max_response_i-1 > max_response_i The response of wave device template exports, even response output matrix response=response_i-1, peak value of response max_ Response=max_response_i-1;Otherwise it returns and judges whether translation filter template meets peak value of response and want modulus Block.In the present embodiment, max_response_1=max_response_2 is then returned and is judged to translate whether filter template meets Peak value of response requires module.
Judging to translate whether filter template meets in peak value of response requirement module, max_response_2=0.7 < 0.8, determine that the 2nd grade of translation filter peak value of response is unsatisfactory for demand, i=i+1=3 translates filter template into multistage The middle suitable response output module of selection.
It is being selected in suitable response output module in multistage translation filter template, max_response_2=0.7 > Max_response_3=0.6 then selects the response output of the 2nd grade of translation filter template, even response output matrix Response=response_2, peak value of response max_response=max_response_2.
According to translation filter prediction target's center present frame position module: according to translation filter response export Position of the peak value max_response in response output matrix response, predicts target's center in the position of current n-th frame (xn,yn).It returns and determines multistage translation filter template module.In the present embodiment, peak value is exported according to the response of translation filter Position of the max_response in response output matrix response, position (x of the prediction target's center in current 2nd frame2, y2)=(50,67), it returns and determines multistage translation filter template module.
Update translation filter after, read video sequence next frame, executed according to above-mentioned steps, until video last Frame.
The correlation filtering Target Tracking System structural schematic diagram based on multistage template of the present embodiment, as shown in Figure 2.
Certainly, those of ordinary skill in the art is it should be appreciated that above embodiments are intended merely to illustrate this hair It is bright, and be not intended as limitation of the invention, as long as within the scope of the invention, all to the variations of above embodiments, modification Protection scope of the present invention will be fallen into.

Claims (10)

1. a kind of correlation filtering method for tracking target based on multistage template, it is characterised in that the following steps are included:
Calculate the size of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, width) and The estimation peak value v of target speedmaxThe full-size for calculating translation filter, with variable sizemax=(size_highmax, size_widthmax) indicate, it is calculated according to the estimated acceleration a of target movement, target size and translation filter full-size The rank m for translating filter template calculates the size of filter templates at different levels according to translation filter template rank m, is denoted as Window_sz_i, wherein 1≤i≤m;
Determine multistage translation filter template: heart position (x in the targetn,yn), the size of filter template is translated according to m grades Window_sz_i determines Gaussian label yf_i, interception image block patch_for_train_i_n, and wherein n indicates frame Number;Image block characteristics are extracted respectively, and addition Cosine Window obtains translation features sample xf_for_train_i_n, utilizes translation Gauss Type label and translation feature samples obtain m various sizes of translation filter templates, are indicated with α _ i;
It determines the output response output of multistage translation filter: enabling n=n+1, video sequence n-th frame is read, in the (n-1)th frame target Center (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_for_det_i_ N extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, using translation template α _ Response output matrix response_i and peak value of response max_response_i is calculated in i;
Judge to translate whether filter template meets peak value of response requirement: judging whether peak value of response max_response_i is greater than Output response threshold value R set in advance, wherein the initial value of i is 1, if so, determining i-stage translation translation filter response peak It is worth meet demand, enables response output matrix response=response_i, peak value of response max_response=max_ Response_i is entered step: otherwise prediction target's center determines that i-stage translates filter response peak in the position of present frame Value is unsatisfactory for demand, and i=i+1 is entered step: the suitable response output of selection in multistage translation filter template;
The suitable response output of selection in multistage translation filter template: the translation filter template for comparing two ranks calculates Obtained peak value of response selects (i-1)-th grade of translation filter template if max_response_i-1 > max_response_i Response output, even response output matrix response=response_i-1, peak value of response max_response=max_ response_i-1;Otherwise return step: judge to translate whether filter template meets peak value of response requirement;
According to translation filter prediction target's center in the position of present frame: exporting peak value max_ according to the response of translation filter Position of the response in response output matrix response, position (x of the prediction target's center in current n-th framen,yn);It returns It returns step: determining multistage translation filter template.
2. the correlation filtering method for tracking target according to claim 1 based on multistage template, which is characterized in that described
3. the correlation filtering method for tracking target according to claim 1 based on multistage template, which is characterized in that described flat The rank of shift filter template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance;The window_sz_1=Tmin· (high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.
4. the correlation filtering method for tracking target according to claim 1 based on multistage template, which is characterized in that described flat Shift filter templateWherein, α indicates α _ i,Indicate inverse Fourier transform, ()*Table Show conjugation,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is Fu of the generation sample of nuclear matrix K In leaf transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.
5. the correlation filtering method for tracking target according to claim 1 based on multistage template, which is characterized in that described Indicate inverse Fourier transform,Indicate Fourier transformation, ⊙ indicates square Array element vegetarian refreshments multiplication symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.
6. a kind of correlation filtering Target Tracking System based on multistage template, characterized by comprising:
Video sequence;
Computer
And
One or more programs wherein one or more of programs are stored in the memory of computer, and are configured At the processor execution by the computer, described program includes:
Calculate the size module of multistage translation filter template: reading the 1st frame of video sequence, according to target size (high, ) and the estimation peak value v of target speed widthmaxThe full-size for calculating translation filter, with variable sizemax= (size_highmax,size_widthmax) indicate, according to the estimated acceleration a of target movement, target size and translation filter Full-size calculates the rank m of translation filter template, calculates filter templates at different levels according to translation filter template rank m Size is denoted as window_sz_i, wherein 1≤i≤m;
Determine multistage translation filter template module: heart position (x in the targetn,yn), the ruler of filter template is translated according to m grades Very little window_sz_i determines Gaussian label yf_i, interception image block patch_for_train_i_n, and wherein n is indicated Frame number;Image block characteristics are extracted respectively, and addition Cosine Window obtains translation features sample xf_for_train_i_n, high using translation This type label and translation feature samples obtain m various sizes of translation filter templates, are indicated with α _ i;
It determines the output response output module of multistage translation filter: enabling n=n+1, video sequence n-th frame is read, in the (n-1)th frame Target's center position (xn-1,yn-1), according to translation filter template size window_sz_i interception image block patch_for_ Det_i_n extracts characteristics of image and adds Cosine Window and obtain translation features sample zf_for_det_i_n to be detected, utilizes translation Response output matrix response_i and peak value of response max_response_i is calculated in template α _ i;
Judge to translate whether filter template meets peak value of response requirement: judging whether peak value of response max_response_i is greater than Output response threshold value R set in advance, wherein the initial value of i is 1, if so, determining i-stage translation translation filter response peak It is worth meet demand, enables response output matrix response=response_i, peak value of response max_response=max_ Response_i is entered step: otherwise prediction target's center determines that i-stage translates filter response peak in the position of present frame Value is unsatisfactory for demand, i=i+1, into the suitable response output module of selection in multistage translation filter template;
The suitable response output module of selection in multistage translation filter template: compare the translation filter template of two ranks The peak value of response being calculated selects (i-1)-th grade of translation filter if max_response_i-1 > max_response_i The response of template exports, even response output matrix response=response_i-1, peak value of response max_response= max_response_i-1;Otherwise it returns and judges to translate whether filter template meets peak value of response requirement module;
According to translation filter prediction target's center present frame position module: according to translation filter response export peak value Position of the max_response in response output matrix response, position (x of the prediction target's center in current n-th framen, yn), it returns and determines multistage translation filter template module.
7. the correlation filtering target following according to claim 6 based on multistage template, which is characterized in that described
8. the correlation filtering target following according to claim 6 based on multistage template, which is characterized in that the translation filter The rank of wave device template Wherein γ is step-size factor set in advance, TminIt is minimum step set in advance;The window_sz_1=Tmin· (high, width), wherein TminIt is minimum step set in advance, 2≤i≤m, wherein γ is step-size factor set in advance.
9. the correlation filtering target following according to claim 6 based on multistage template, which is characterized in that the translation filter Wave device templateWherein, α indicates α _ i,Indicate inverse Fourier transform, ()*It indicates altogether Yoke,Indicating the Fourier transformation of Gaussian label, λ is regularization parameter,It is the Fourier of the generation sample of nuclear matrix K Transformation, nuclear matrix K is a circular matrix, the generation sample of matrix the first behavior nuclear matrix.
10. the correlation filtering target following according to claim 6 based on multistage template, which is characterized in that described Indicate inverse Fourier transform,Indicate Fourier transformation, ⊙ indicates square Array element vegetarian refreshments multiplication symbol, kxzIndicate the generator matrix of the nuclear matrix of sample x and sample z to be detected.
CN201910270129.7A 2019-04-04 2019-04-04 Related filtering target tracking method and system based on multi-level template Active CN109993776B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910270129.7A CN109993776B (en) 2019-04-04 2019-04-04 Related filtering target tracking method and system based on multi-level template

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910270129.7A CN109993776B (en) 2019-04-04 2019-04-04 Related filtering target tracking method and system based on multi-level template

Publications (2)

Publication Number Publication Date
CN109993776A true CN109993776A (en) 2019-07-09
CN109993776B CN109993776B (en) 2020-12-11

Family

ID=67132366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910270129.7A Active CN109993776B (en) 2019-04-04 2019-04-04 Related filtering target tracking method and system based on multi-level template

Country Status (1)

Country Link
CN (1) CN109993776B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110490899A (en) * 2019-07-11 2019-11-22 东南大学 A kind of real-time detection method of the deformable construction machinery of combining target tracking

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846851A (en) * 2018-04-25 2018-11-20 河北工业职业技术学院 Motion target tracking method and terminal device
CN109087333A (en) * 2018-06-14 2018-12-25 中国科学院福建物质结构研究所 Target scale estimation method and its device based on correlation filter tracking algorithm
CN109543615A (en) * 2018-11-23 2019-03-29 长沙理工大学 A kind of double learning model method for tracking target based on multi-stage characteristics

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846851A (en) * 2018-04-25 2018-11-20 河北工业职业技术学院 Motion target tracking method and terminal device
CN109087333A (en) * 2018-06-14 2018-12-25 中国科学院福建物质结构研究所 Target scale estimation method and its device based on correlation filter tracking algorithm
CN109543615A (en) * 2018-11-23 2019-03-29 长沙理工大学 A kind of double learning model method for tracking target based on multi-stage characteristics

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CONGMEI ZHAO: "An algorithm of adaptive scale object tracking in occlusion", 《 AIP CONFERENCE PROCEEDINGS》 *
韩光星,李崇荣: "网络视频监控中运动目标跟踪方法改进", 《通信学报》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110490899A (en) * 2019-07-11 2019-11-22 东南大学 A kind of real-time detection method of the deformable construction machinery of combining target tracking

Also Published As

Publication number Publication date
CN109993776B (en) 2020-12-11

Similar Documents

Publication Publication Date Title
CN108734723B (en) Relevant filtering target tracking method based on adaptive weight joint learning
CN109087337B (en) Long-time target tracking method and system based on hierarchical convolution characteristics
Zhang et al. Learning scale-adaptive tight correlation filter for object tracking
CN112949408B (en) Real-time identification method and system for target fish passing through fish channel
CN112183537B (en) Model training method and device, and text region detection method and device
Zhang et al. Sparse learning-based correlation filter for robust tracking
CN111429481B (en) Target tracking method, device and terminal based on adaptive expression
CN111914878A (en) Feature point tracking training and tracking method and device, electronic equipment and storage medium
CN116665095B (en) Method and system for detecting motion ship, storage medium and electronic equipment
CN112990298A (en) Key point detection model training method, key point detection method and device
Zhao et al. Effective local and global search for fast long-term tracking
CN109993776A (en) A kind of correlation filtering method for tracking target and system based on multistage template
CN107566826B (en) The test method and device of grating image processor
CN111507135A (en) Face detection method and device, computer equipment and storage medium
KR20220103370A (en) Method and apparatus for detecting out-of-distribution using noise filter
Kuipers et al. Hard occlusions in visual object tracking
Jiang et al. MultiBSP: multi-branch and multi-scale perception object tracking framework based on siamese CNN
JP2018120402A (en) Event detecting device, event detecting method, and program
CN108154522A (en) Target tracking system
CN116433722A (en) Target tracking method, electronic device, storage medium, and program product
Huang et al. Visual attention learning and antiocclusion-based correlation filter for visual object tracking
CN113658217B (en) Self-adaptive target tracking method, device and storage medium
CN113706580B (en) Target tracking method, system, equipment and medium based on relevant filtering tracker
CN112614158B (en) Sampling frame self-adaptive multi-feature fusion online target tracking method
CN109993777A (en) A kind of method for tracking target and system based on double-template adaptive threshold

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant