CN106203495B - Target tracking method based on sparse discriminant learning - Google Patents

Target tracking method based on sparse discriminant learning Download PDF

Info

Publication number
CN106203495B
CN106203495B CN201610519851.6A CN201610519851A CN106203495B CN 106203495 B CN106203495 B CN 106203495B CN 201610519851 A CN201610519851 A CN 201610519851A CN 106203495 B CN106203495 B CN 106203495B
Authority
CN
China
Prior art keywords
target
dictionary
sparse
template
tracking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610519851.6A
Other languages
Chinese (zh)
Other versions
CN106203495A (en
Inventor
詹瑾
肖政宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Polytechnic Normal University
Original Assignee
Guangdong Polytechnic Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Polytechnic Normal University filed Critical Guangdong Polytechnic Normal University
Priority to CN201610519851.6A priority Critical patent/CN106203495B/en
Publication of CN106203495A publication Critical patent/CN106203495A/en
Application granted granted Critical
Publication of CN106203495B publication Critical patent/CN106203495B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/755Deformable models or variational models, e.g. snakes or active contours
    • G06V10/7557Deformable models or variational models, e.g. snakes or active contours based on appearance, e.g. active appearance models [AAM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a target tracking method based on sparse discriminant learning, which comprises the following steps: sampling a target and a surrounding background of the target for a first frame, and constructing a target appearance model; extracting two-dimensional image features of the target appearance model and carrying out normalization processing to obtain an initial dictionary; a supervised discriminative dictionary learning method is introduced, classification error terms are added on the basis of error term reconstruction, and a discriminative dictionary is trained; solving a minimum reconstruction error term under the constraint of sparsity, and updating a dictionary and sparse codes by adopting an iterative exchange optimization strategy; and measuring the similarity between the samples by adopting the Euclidean distance, and taking the sample with the highest similarity as a tracking target. According to the embodiment of the invention, when the appearance model of the target is established, the surrounding background which has spatial correlation with the target is added as a clue template, and the change of the posture of the target is processed more robustly.

Description

Target tracking method based on sparse discriminant learning
Technical Field
The invention relates to the technical field of information, in particular to a target tracking method based on sparse discriminant learning
Background
Target tracking is one of the important basic problems in the field of computer vision research, and has very wide application in monitoring, motion estimation, human-computer interaction and the like. Many tracking algorithms appearing in recent years can better track target objects in certain scenes, such as particle filtering, Boosting algorithm and L1Tracking algorithms, etc. However, since a video is a time sequence image sequence in a complex scene, the complex scene includes illumination change, occlusion, motion deformation, background clutter, target scale change, and the like, constructing a self-adaptive target expression model to obtain a robust tracking algorithm is a research hotspot in the current tracking field and is also a difficult problem.
In recent tracking algorithms, some of the tracking problems are formulated as optimal template matching or maximum likelihood region estimation problems by establishing a target appearance model, these methods are called tracking algorithms based on a generation model, and how to construct an accurate appearance model to adapt to the change of the target appearance is the core of the algorithms. Some of the methods consider tracking as a binary classification problem, and consider tracking as separating foreground objects from background, and this method adopts many classifier algorithms, called discriminant model-based tracking algorithms. Such as a naive bayes classifier, boosting-based classifier, support vector machine, P-N learning classifier, etc. The accuracy of the classifier algorithm performs better in the case of large training examples, so online updating is usually used to obtain more training examples.
The technology for reconstructing signals by sparse representation has prominent expression in the applications of face recognition, image classification, image de-noising and deblurring and the like, and the tracking algorithm based on sparse representation also achieves certain progress. The algorithms use the first frame of target information to create a template set comprising target and noise, each sample can be represented as a sparse linear combination of a number of template column vectors, and the sparsity is solved by solving for L1And solving by regularizing a least square problem, and taking the candidate with the minimum reconstruction error as a tracking target. Due to the introduction of the noise template, the algorithm is more accurate and robust when the occlusion is processed, but when the target is deformed frequently, the base vectors of the template set cannot represent the deformed target. In addition, the template set is actually a dictionary without a learning process, so that the obtained sparse code cannot ensure the maximum sparsity, and the sample with the minimum reconstruction error is not necessarily the best candidate, so that the drift error is easy to accumulate.
Disclosure of Invention
The invention provides a target tracking method based on sparse discriminant learning, wherein when a target appearance model is established, a surrounding background which has spatial correlation with a target is added as a clue template so as to approximate the change of the target posture.
The invention provides a target tracking method based on sparse discriminant learning, which comprises the following steps:
constructing a target appearance model;
converting a two-dimensional image block into a one-dimensional vector for processing through feature extraction on the constructed target appearance model, and performing normalization processing on the one-dimensional vector;
a discrimination dictionary learning method based on supervision is introduced into the current tracking, and classification error items are added on the basis of reconstruction error items;
solving a minimum reconstruction error term under the constraint of sparsity, and updating a dictionary and sparse codes by adopting an iterative exchange optimization strategy;
and measuring the similarity between vectors by adopting Euclidean distance, and taking the sample with the highest similarity as a candidate tracking target.
The constructing of the target appearance model comprises:
and representing a target appearance model by using a dictionary, converting the tracking problem into an optimal sparse approximation problem, and modeling the target appearance as a template dictionary containing a target and noise.
The supervised discriminative dictionary learning method is introduced into the current tracking, and the addition of the classification error term on the basis of the reconstruction error term specifically comprises the following steps:
solving an objective function based on a dictionary and sparse coding, wherein the objective function is as follows:
Figure GDA0002251070550000021
wherein the first term is a reconstruction error term, Y ═ Y1,y2...yN]∈Rn×NIs a training sample, D is a set of overcomplete dictionary bases to learn, X ═ X1,x2...xN]∈Rk×NIs a sparse coding coefficient; the second term is the classification error term, which is expressed herein using the loss function of the linear classifier f (x) ═ Wx + b, W is the classifier parameter, hiIs yiLabeling the category of (1); ε is the threshold to control sparsity.
The solving of the minimum reconstruction error term under the sparsity constraint is specifically as follows:
converting the constraint condition into an L1 norm of sparse coding as a regular term to ensure sparsity, and converting into solving an unconstrained optimization problem:
Figure GDA0002251070550000031
wherein G ═ G1,g2...gN]∈R2×NIs a category label set with Y as foreground or background when Y isiIs the target sampling time gi=[1,0]TOtherwise gi=[0,1]Tα is a parameter for controlling the weight of class information, and λ is a regular parameter for controlling the sparsity of reconstruction.
The updating of the dictionary and the sparse coding by adopting the iterative exchange optimization strategy specifically comprises the following steps:
order to
Figure GDA0002251070550000032
The equation is converted into a common approximation problem of the minimum reconstruction error under sparsity constraint:
Figure GDA0002251070550000033
updating D and X by adopting an iterative alternation optimization strategy:
the method comprises the following steps: initializing Y, G, D, Y ', D';
step two: fixing D', adopting OMP to solve optimization model, updating sparse code X
Step three: fixing X, solving a least square problem with constraint, and updating a dictionary D';
step four: if the maximum iteration times are reached, the iteration is terminated; or the function absolute value difference after two adjacent iterations converges on a threshold value, and the iteration is terminated; otherwise, returning to the step two.
The method for measuring the similarity between vectors by using the euclidean distance specifically includes the following steps of:
Figure GDA0002251070550000034
where R (i) is the reconstruction error of the ith sample, diIs the dictionary column vector corresponding to the ith sample sparse coding maximum nonnegative value, d0Is the dictionary column vector corresponding to the initial target.
In the present invention, the spatial correlation of the target with the surroundings is taken into account when modeling the appearance of the target. Since the periphery of the object contains a part of the object information and the background information, when the object is deformed in a period of time, the object can be approximately represented. By using a supervised discriminative dictionary learning method, an over-complete dictionary which gives consideration to both reconstruction and discriminative performance is solved through iterative optimization, and the obtained sparse codes are very sparse and have discriminative performance. And on a candidate target searching algorithm, constructing similarity according to the distribution of the maximum non-negative values of the sample sparse codes and the minimum reconstruction errors to determine candidate targets. When the appearance model of the target is established, a surrounding background which has spatial correlation with the target is added as a clue template so as to approximate the change of the target posture. Through the dictionary of the discrimination sparse learning, the obtained sparse code has higher sparsity and discrimination. And obtaining the estimated candidate tracking target by utilizing the distribution of the dictionary basis vectors corresponding to the maximum non-negative value in the sparse coding and the minimum reconstruction error of the sample. Experimental results show that the method can better track the target under the conditions of processing shielding, illumination change, posture change, background disorder and the like.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flowchart of a target tracking method based on sparse discriminant learning according to an embodiment of the present invention;
fig. 2 is a schematic diagram of a target tracking method based on sparse discriminant learning in the embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention considers the spatial correlation of the target and the surrounding when the appearance of the target is modeled. Since the periphery of the object contains a part of the object information and the background information, when the object is deformed in a period of time, the object can be approximately represented. In addition, by using a supervised discriminative dictionary learning method, an over-complete dictionary which has both reconstruction and discriminative performance is solved through iterative optimization, and the obtained sparse code is very sparse and has discriminative performance. And on a candidate target searching algorithm, constructing similarity according to the distribution of the maximum non-negative values of the sample sparse codes and the minimum reconstruction errors to determine candidate targets.
Fig. 1 shows a flowchart of a target tracking method based on discriminant sparse template matching in an embodiment of the present invention, including the following steps:
s101, constructing a target appearance model;
in sparse representation theory, a signal y ∈ RnIt can be optimally represented by m basis vectors of a dictionary D:
Figure GDA0002251070550000051
where Λ represents the selected radix vector index set and di represents the column vectors in dictionary D. If all basis vectors in the dictionary form a linear correlation set and the dictionary is redundant, there will be an infinite number of representations of zero error for the target signal, but the sparsest of them is found, i.e. the following optimization problem is solved:
min||x||0s.t.y=Dx
wherein D represents a matrix formed by taking dictionary atoms (basis vectors) as columns, and 0 norm | | · | | purple0Representing the number of non-zero elements in the vector. If the dictionary D is fixed, then this is a least squares problem; if the optimal dictionary base is to be selected, then this is a combination problem, which is an NP-hard problem. The existing ceramic huchenxuan et al prove that L is under the condition that D meets the UUP principle0The norm may use L1Norm substitution, so the problem becomes a convex optimization problem. Considering the minimum reconstruction error approximation of the target signal under the sparsity constraint, converting the minimum reconstruction error approximation into the following optimization problem:
Figure GDA0002251070550000052
a dictionary is used for representing an appearance model of a target, the tracking problem is converted into an optimal sparse approximation problem, and how to learn an effective dictionary to process various target changes is very important. Usually the appearance of an object is modeled as a template dictionary containing objects and noise, but this does not approximate an object that has been deformed. The background information around the object does not change much in a short time when the object is deformed, that is, the spatial information of the object and the background is maintained, and thus can be used to approximate the object. Thus here the template dictionary is extended into three parts: a target template T, a cue template B, and a noise template, wherein the cue template is sampled around the target. The tracking problem is finding the sample in the subsequent frame that best matches the target template or the cue template because the basis vectors of the template dictionary of the same class exist in a linear subspace, and when there are enough basis vectors, the template dictionary can represent two cases: first, when the target is not deformed, the candidate target sample y1 of the subsequent frame exists approximately in the subspace spanned by the column vectors of T1Tx + e, the non-target sample y2 exists approximately in the sub-space spanned by the column vector of B, y2X + e, e is used to denote occlusion and noise, and is a small noise figure. Second, when the target is partially deformed, both the candidate target sample y1 and the non-target sample y2 are approximately represented byB column vector, but the target candidate sample has a very small noise coefficient, while the non-target sample has a very large noise coefficient, y ≈ Bx + e. Combining the above two equations, a sampled signal can be expressed as:
Figure GDA0002251070550000061
wherein T ═ T1,t2...tp]∈Rn×pIs the target template, B ═ B1,b2...bq]∈Rn×qIs a clue template, I is an element of Rn ×nIs a noise template, is an n-order unit diagonal matrix. n is the characteristic dimension of a signal.
D=[d1,d2...dk]∈Rn×k(k ═ p + q + n) is an initial template dictionary, used for dictionary learning. x is the corresponding sparse coding of y under dictionary D.
S102, converting a two-dimensional image block into a one-dimensional vector for processing through feature extraction on the constructed target appearance model, and performing normalization processing on the one-dimensional vector;
in this embodiment, the template is sampled as image blocks of equal size, two-dimensional image blocks are converted into one-dimensional vectors through feature extraction for processing, and the one-dimensional vectors are normalized. After normalization, the energy of the one-dimensional vectors is equal, and the vectors can be regarded as vectors with the starting point in different directions of the origin. The target template T is sampled around the target frame given in the first frame, let l0(c) Representing the position of the center point of the target, the sampling center point of the target template falls on0(c) As a circle center, within a range of radius r0, { T | | | l0(t)-l0(c)||<r0}. The thread template falls at the center point by l0(c) Randomly sampling in a circular range as a center, wherein B is { B | r1<||l0(b)-l0(c)||<r2Are and r0<r1<r2
S103, introducing a discrimination dictionary learning method based on supervision in the current tracking, and adding classification error terms on the basis of error term reconstruction;
the dictionary learning aims at finding a new feature space with strong expression capacity, and the learned dictionary has very important influence on the performance of sparse coding. In terms of tracking problems, although a dictionary in a non-learning process or a non-supervised dictionary learning can achieve certain tracking accuracy, in order to obtain a better effect, classification information of an object and a background needs to be combined, and the method is called a supervised discriminative dictionary learning method, wherein discriminative performance is represented as: the dictionary base atoms selected by the optimal sparse coding must be linearly independent. A supervised discriminative dictionary learning method is applied to the tracking problem, and classification error terms are added on the basis of reconstruction error terms. Therefore, solving the dictionary and sparse coding objective function is:
Figure GDA0002251070550000071
where the first term is the reconstruction error term, Y ═ Y1,y2...yN]∈Rn×NIs a training sample and D is a set of overcomplete dictionary bases to be learned. X ═ X1,x2...xN]∈Rk×NAre sparse coding coefficients. The second term is the classification error term, which is expressed herein using the loss function of the linear classifier f (x) ═ Wx + b, W is the classifier parameter, hiIs yiThe category label of (1). ε is the threshold to control sparsity.
S104, solving a minimum reconstruction error term under sparsity constraint, and updating a dictionary and sparse codes by adopting an iterative exchange optimization strategy;
converting the constraint condition of equation 2 into an L1 norm of sparse coding as a regular term to ensure sparsity, namely converting into solving an unconstrained optimization problem:
Figure GDA0002251070550000072
wherein G ═ G1,g2...gN]∈R2×NIs Y is frontClass label set of scene or background when yiIs the target sampling time gi=[1,0]TOtherwise gi=[0,1]Tα is a parameter that controls the class information weight and λ is a regularization parameter that controls the sparsity of the reconstruction.
Equation (3) is appropriately transformed for solution. Order to
Figure GDA0002251070550000073
This transforms equation 3 into the common least reconstruction error approximation under sparsity constraints:
Figure GDA0002251070550000074
updating D and X by adopting an iterative alternation optimization strategy:
(1) initializing Y, G, D, Y ', D';
(2) fixing D', solving an optimization model by adopting OMP, and updating the sparse code X;
(3) fixing X, solving a least square problem with constraint, and updating a dictionary D';
(4) and if the maximum iteration times are reached, terminating the iteration. Or the absolute value difference of the function after two adjacent iterations converges on the threshold value, and the iteration is terminated. Otherwise, return to (2).
And S105, measuring the similarity between vectors by adopting the Euclidean distance, and taking the sample with the highest similarity as a candidate tracking target.
Typically, the change of a video object between previous and subsequent frames is correlated, so that in the current frame, the samples obtained by sampling around the center point of the object in the previous frame are the estimated object regions. After the discriminant dictionary is learned in the previous section, sparse coding and reconstruction errors of the sampling samples can be solved. The row number of the maximum non-negative value in the sparse coding corresponds to the column basis vector of the dictionary, namely the template vector which can be most approximate to the sample. The vectors discussed later herein are all referred to as dictionary basis vectors.
Fig. 2 shows a schematic diagram of a target tracking method based on discriminant sparse template matching in an embodiment of the present invention, where, of all samples, the sample with the smallest reconstruction error is not necessarily the best tracking target. Since the basis vectors corresponding to the target template exist in a linear subspace, the best candidate target should also be in this subspace and closest to the initial target vector. Here, euclidean distance is used to measure the similarity between vectors, and the sample with the highest similarity is the candidate tracking target. The number of samples meeting the condition is often large, and then the sample with the minimum reconstruction error is selected as the tracking target sample, that is, the candidate tracking target is the sample with the minimum reconstruction error under the sparse coding distribution constraint. The best tracking target sample obtained by the method is as follows:
Figure GDA0002251070550000081
where R (i) is the reconstruction error of the ith sample, diIs the dictionary column vector corresponding to the ith sample sparse coding maximum nonnegative value, d0Is the dictionary column vector corresponding to the initial target.
In summary, the spatial correlation of the target with the surroundings is taken into account when modeling the appearance of the target. Since the periphery of the object contains a part of the object information and the background information, when the object is deformed in a period of time, the object can be approximately represented. By using a supervised discriminative dictionary learning method, an over-complete dictionary which gives consideration to both reconstruction and discriminative performance is solved through iterative optimization, and the obtained sparse codes are very sparse and have discriminative performance. And on a candidate target searching algorithm, constructing similarity according to the distribution of the maximum non-negative values of the sample sparse codes and the minimum reconstruction errors to determine candidate targets. When the appearance model of the target is established, a surrounding background which has spatial correlation with the target is added as a clue template so as to approximate the change of the target posture. Through the dictionary of the discrimination sparse learning, the obtained sparse code has higher sparsity and discrimination. And obtaining the estimated candidate tracking target by utilizing the distribution of the dictionary basis vectors corresponding to the maximum non-negative value in the sparse coding and the minimum reconstruction error of the sample. Experimental results show that the method can better track the target under the conditions of processing shielding, illumination change, posture change, background disorder and the like.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by instructions associated with hardware via a program, which may be stored in a computer-readable storage medium, and the storage medium may include: a Read Only Memory (ROM), a Random Access Memory (RAM), a magnetic or optical disk, or the like.
The target tracking method based on sparse discriminant learning provided by the embodiment of the present invention is described in detail above, and a specific example is applied in the present document to explain the principle and the implementation of the present invention, and the description of the above embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (5)

1. A target tracking method based on sparse discriminant learning is characterized by comprising the following steps:
constructing a target appearance model;
converting a two-dimensional image block into a one-dimensional vector for processing through feature extraction on the constructed target appearance model, and performing normalization processing on the one-dimensional vector;
a discrimination dictionary learning method based on supervision is introduced into the current tracking, and classification error items are added on the basis of reconstruction error items;
solving a minimum reconstruction error term under the constraint of sparsity, and updating a dictionary and sparse codes by adopting an iterative exchange optimization strategy;
measuring the similarity between vectors by adopting Euclidean distance, and taking a sample with the highest similarity as a candidate tracking target;
the method for learning the discrimination dictionary based on supervision is introduced into the current tracking, and the method for adding the classification error items on the basis of the reconstruction error items specifically comprises the following steps:
solving an objective function based on a dictionary and sparse coding, wherein the objective function is as follows:
Figure FDA0002251070540000011
wherein the first term is a reconstruction error term, Y ═ Y1,y2...yN]∈Rn×NIs a training sample, D is a set of overcomplete dictionary bases to learn, X ═ X1,x2...xN]∈Rk×NIs a sparse coding coefficient; the second term is a classification error term, expressed as a loss function of the linear classifier f (x) ═ Wx + b, W is the classifier parameter, hiIs yiLabeling the category of (1); ε is the threshold controlling sparsity; l { } is the classification loss function;
the method comprises the following steps of converting a two-dimensional image block into a one-dimensional vector for processing through feature extraction on a constructed target appearance model, and performing normalization processing on the one-dimensional vector, wherein the normalization processing comprises the following steps:
sampling a template into image blocks with equal size, converting two-dimensional image blocks into one-dimensional vectors through feature extraction, processing, and normalizing the one-dimensional vectors; after normalization processing, the energy of the one-dimensional vectors is equal and can be regarded as the vectors of the starting point in different directions of the origin;
the target template T is sampled around the target frame given in the first frame, let l0(c) Representing the position of the center point of the target, the sampling center point of the target template falls on0(c) As a circle center and has a radius r0In the range of (1), T ═ T | | | l0(t)-l0(c)||<r0};l0(T) represents the sampling center point position of each sample in the target template T; t represents each sample in the set T;
the thread template falls at the center point by l0(c) Randomly sampling in a circular range as a center, wherein a clue template B is { B | r ═1<||l0(b)-l0(c)||<r2Are and r0<r1<r2(ii) a The annular radius is r1And r2And r is1<r2;l0(b) Representing the location of the sampling center point for each sample in the cue template.
2. The sparse discriminant learning-based target tracking method of claim 1, wherein the constructing the target appearance model comprises:
and representing a target appearance model by using a dictionary, converting the tracking problem into an optimal sparse approximation problem, wherein the target appearance model is a template dictionary comprising a target template T, a clue template B and a noise template.
3. The sparse discriminant learning-based target tracking method of claim 2, wherein the solving of the minimum reconstruction error term under the sparsity constraint is specifically:
converting constraints to L to be sparsely encoded1The norm is used as a regular term to ensure sparsity and is converted into a solution unconstrained optimization problem:
Figure FDA0002251070540000021
wherein G ═ G1,g2...gN]∈R2×NIs a category label set with Y as foreground or background when Y isiIs the target sampling time gi=[1,0]TOtherwise gi=[0,1]Tα is a parameter for controlling the weight of class information, and λ is a regular parameter for controlling the sparsity of reconstruction.
4. The sparse discriminant learning-based target tracking method of claim 3, wherein the updating of the dictionary and the sparse coding by the iterative exchange optimization strategy specifically comprises:
order to
Figure FDA0002251070540000022
The equation is converted into a common approximation problem of the minimum reconstruction error under sparsity constraint:
Figure FDA0002251070540000023
updating D and X by adopting an iterative alternation optimization strategy:
the method comprises the following steps: initializing Y, G, D, Y ', D';
step two: fixing D', solving an optimization model by adopting OMP, and updating the sparse code X;
step three: fixing X, solving a least square problem with constraint, and updating a dictionary D';
step four: if the maximum iteration times are reached, the iteration is terminated; or the function absolute value difference after two adjacent iterations converges on a threshold value, and the iteration is terminated; otherwise, returning to the step two.
5. The target tracking method based on sparse discriminant learning as claimed in claim 4, wherein the euclidean distance is used to measure the similarity between vectors, and the sample with the highest similarity is specifically used as the candidate tracking target:
Figure FDA0002251070540000031
where R (i) is the reconstruction error of the ith sample, diIs the dictionary column vector corresponding to the ith sample sparse coding maximum nonnegative value, d0Is the dictionary column vector corresponding to the initial target.
CN201610519851.6A 2016-07-01 2016-07-01 Target tracking method based on sparse discriminant learning Active CN106203495B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610519851.6A CN106203495B (en) 2016-07-01 2016-07-01 Target tracking method based on sparse discriminant learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610519851.6A CN106203495B (en) 2016-07-01 2016-07-01 Target tracking method based on sparse discriminant learning

Publications (2)

Publication Number Publication Date
CN106203495A CN106203495A (en) 2016-12-07
CN106203495B true CN106203495B (en) 2020-03-17

Family

ID=57464691

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610519851.6A Active CN106203495B (en) 2016-07-01 2016-07-01 Target tracking method based on sparse discriminant learning

Country Status (1)

Country Link
CN (1) CN106203495B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106815599B (en) * 2016-12-16 2020-12-22 合肥工业大学 Universal discriminant sparse coding dictionary learning method in image classification
CN107025444A (en) * 2017-04-08 2017-08-08 华南理工大学 Piecemeal collaboration represents that embedded nuclear sparse expression blocks face identification method and device
CN107844739B (en) * 2017-07-27 2020-09-04 电子科技大学 Robust target tracking method based on self-adaptive simultaneous sparse representation
CN108280451B (en) * 2018-01-19 2020-12-29 北京市商汤科技开发有限公司 Semantic segmentation and network training method and device, equipment and medium
CN110580488B (en) * 2018-06-08 2022-04-01 中南大学 Multi-working-condition industrial monitoring method, device, equipment and medium based on dictionary learning
CN110895705B (en) * 2018-09-13 2024-05-14 富士通株式会社 Abnormal sample detection device, training device and training method thereof
CN109509180B (en) * 2018-10-24 2021-12-10 东华大学 Metal button flaw detection method based on machine vision
CN109584270B (en) * 2018-11-13 2023-05-30 大连大学 Visual tracking method based on discriminant dictionary learning
CN109523587A (en) * 2018-11-20 2019-03-26 广东技术师范学院 The method for tracking target and system learnt based on multiple features and self-adapting dictionary
CN109685830B (en) * 2018-12-20 2021-06-15 浙江大华技术股份有限公司 Target tracking method, device and equipment and computer storage medium
CN109714602B (en) * 2018-12-29 2022-11-01 武汉大学 Unmanned aerial vehicle video compression method based on background template and sparse coding
CN109712138B (en) * 2018-12-29 2020-09-08 苏州大学 Image segmentation method based on appearance dictionary learning and shape sparse representation
CN110111338B (en) * 2019-04-24 2023-03-31 广东技术师范大学 Visual tracking method based on superpixel space-time saliency segmentation
CN111862167B (en) * 2020-07-21 2022-05-10 厦门大学 Rapid robust target tracking method based on sparse compact correlation filter
CN113985733B (en) * 2021-10-26 2023-11-17 云南电网有限责任公司电力科学研究院 Power distribution network fault identification method based on self-adaptive probability learning

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104091151A (en) * 2014-06-30 2014-10-08 南京信息工程大学 Vehicle identification method based on Gabor feature extraction and sparse representation
CN104361609A (en) * 2014-11-18 2015-02-18 电子科技大学 Target tracking method based on sparse representation

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7369682B2 (en) * 2004-07-09 2008-05-06 Honda Motor Co., Ltd. Adaptive discriminative generative model and application to visual tracking

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104091151A (en) * 2014-06-30 2014-10-08 南京信息工程大学 Vehicle identification method based on Gabor feature extraction and sparse representation
CN104361609A (en) * 2014-11-18 2015-02-18 电子科技大学 Target tracking method based on sparse representation

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Robust tracking via discriminative sparse feature selection;Jin Zhan 等;《The Visual Computer: International Journal of Computer Graphics archive》;20150531;第31卷(第5期);第575-580页 *
Robust Visual Tracking via Rank-Constrained Sparse Learning;Behzad Bozorgtabar 等;《2014 International Conference on Digital Image Computing: Techniques and Applications》;20141127;第1-7页 *
加权局部特征结合判别式字典的目标跟踪;王飞 等;《中国图像图形学报》;20140930;第19卷(第9期);第1316-1323页 *

Also Published As

Publication number Publication date
CN106203495A (en) 2016-12-07

Similar Documents

Publication Publication Date Title
CN106203495B (en) Target tracking method based on sparse discriminant learning
Patel et al. Kernel sparse subspace clustering
KR102221118B1 (en) Method for extracting feature of image to recognize object
Kishore et al. Indian classical dance action identification and classification with convolutional neural networks
Hong et al. Image-based three-dimensional human pose recovery by multiview locality-sensitive sparse retrieval
Patel et al. Latent space sparse and low-rank subspace clustering
KR102010378B1 (en) Device and method to extract feature of image including object
CN104820696B (en) A kind of large-scale image search method based on multi-tag least square hash algorithm
JP5591178B2 (en) Method for classifying objects in test images
Zhang et al. Robust embedded deep k-means clustering
CN110889865A (en) Video target tracking method based on local weighted sparse feature selection
Elmadany et al. Multiview emotion recognition via multi-set locality preserving canonical correlation analysis
Liu et al. Spatial neighborhood-constrained linear coding for visual object tracking
Idan et al. A new separable moments based on Tchebichef-Krawtchouk polynomials
Shao et al. Action recognition using correlogram of body poses and spectral regression
Lee et al. Reinforced adaboost learning for object detection with local pattern representations
CN111753583A (en) Identification method and device
Yao A compressed deep convolutional neural networks for face recognition
Stutz Neural codes for image retrieval
Zhang et al. Weighted score-level feature fusion based on Dempster–Shafer evidence theory for action recognition
CN115169436A (en) Data dimension reduction method based on fuzzy local discriminant analysis
Zhu et al. Boosted cross-domain dictionary learning for visual categorization
Balachandran et al. Machine learning based video segmentation of moving scene by motion index using IO detector and shot segmentation
CN113936196A (en) Robust data dimension reduction method based on low-rank Laplace graph learning
Miao et al. Manifold regularized slow feature analysis for dynamic texture recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CB02 Change of applicant information

Address after: 293 No. 510000 Guangdong city of Guangzhou province Tianhe District Zhongshan Shipai Road

Applicant after: Guangdong Normal University of Technology

Address before: 293 No. 510665 Guangdong city of Guangzhou province Tianhe District Zhongshan Shipai Road

Applicant before: Guangdong Technical Normal College

CB02 Change of applicant information