CN110490206A - A kind of quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis - Google Patents

A kind of quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis Download PDF

Info

Publication number
CN110490206A
CN110490206A CN201910771452.2A CN201910771452A CN110490206A CN 110490206 A CN110490206 A CN 110490206A CN 201910771452 A CN201910771452 A CN 201910771452A CN 110490206 A CN110490206 A CN 110490206A
Authority
CN
China
Prior art keywords
matrix
super
pixel
low
follows
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910771452.2A
Other languages
Chinese (zh)
Other versions
CN110490206B (en
Inventor
刘明明
刘兵
仇文宁
付红
戚海永
李姗姗
孙伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Institute of Architectural Technology
Original Assignee
Jiangsu Institute of Architectural Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Institute of Architectural Technology filed Critical Jiangsu Institute of Architectural Technology
Priority to CN201910771452.2A priority Critical patent/CN110490206B/en
Publication of CN110490206A publication Critical patent/CN110490206A/en
Application granted granted Critical
Publication of CN110490206B publication Critical patent/CN110490206B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention proposes a kind of decomposition of low-rank matrix double factor and structural sparse matrix decomposition combined optimization models, and are applied to conspicuousness target detection.Algorithm is not decomposed merely with low-rank matrix double factor and alternating direction method (ADM) reduces time overhead, and is introduced into the sparse regularization of layering and portrays spatial relationship in sparse matrix between element.In addition, mentioned algorithm can Seamless integration- high level priori knowledge instruct matrix decomposition process.The experimental results showed that proposing that the detection performance of model and algorithm is better than the unsupervised conspicuousness algorithm of target detection of current main-stream, and there is lower time complexity.

Description

A kind of quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis
Technical field
The quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis that the present invention relates to a kind of, belongs to conspicuousness Target detection technique field.
Background technique
Conspicuousness target detection is a research hotspot of computer vision field in recent years.It is answered in field of image processing With extensive, such as Target detection and identification, image classification and retrieval, Target Segmentation etc..Conspicuousness target detection is intended to from background It positions and divides most significant foreground target, and export so-called " notable figure ", wherein the saliency value of each pixel indicates the picture Element belongs to the probability of well-marked target.
Many conspicuousness target detection models have been proposed in the past few decades.According to whether being known using priori Know, these models are generally divided into two classes: from bottom to top with top-down model.Bottom-to-top method utilizes such as color, The low-level image feature on intensity, gradient, edge and boundary constructs the notable figure corresponding to each feature space.Finally, combination These notable figures generate the target significant image based on predetermined convergence strategy.However, in fact, when there is image background in a jumble simultaneously And when having similar appearance with conspicuousness target, bottom-to-top method is difficult to detect accurate conspicuousness target.It is another Aspect, top-to-bottom method utilize the priori knowledge of human perception, such as context, semantic and position, to instruct notable figure Calculating.But the diversity of real world target limits the scalability of these methods.
Recently, some researchers combine top-down advanced priori with low-level image feature from bottom to top, And it is introduced into unified conspicuousness target detection frame.Wherein representative method is that low-rank matrix is restored to (LR) theory Applied to conspicuousness target detection.These existing conspicuousness target detection models based on LR usually assume that image can be divided Solution is two parts: the image section (such as background area) of high redundancy and sparse signal portion (such as foreground target area Domain).Therefore, the background of image is usually located in lower-dimensional subspace, can approximate representation be low-rank eigenmatrix, and marking area can It is considered as sparse noise or error, is represented by sparse matrix.Therefore, low-rank matrix, which restores (LR) model, to be naturally introduced into It is detected to well-marked target, for separating foreground target region and background area.Although these methods can calculate given image Notable figure simultaneously detects well-marked target, they generally assume that each element in sparse matrix is independent, and has ignored it Between potential relationship and structure, cause the notable figure generated to occur dissipating or imperfect phenomenon.Model does not still solve High computation complexity problem caused by singular value decomposition (SVD) is answered to limit the model in extensive eigenmatrix Expandability in.
Summary of the invention
To solve the above-mentioned problems, this paper presents a kind of new structured matrix low-rank representation models, and are applied to figure As conspicuousness target detection.By the inspiration of SMD method, the sparse induced norm based on index tree is introduced in model, is passed through Sparse matrix is constrained to characterize the structured relations between element.Meanwhile by converting primal problem on small Scale Matrixes Matrix nuclear norm minimization problem, reduce the time complexity of model.
The present invention be solve its technical problem adopt the following technical scheme that it is a kind of based on the quick of low-rank matrix dualistic analysis Conspicuousness algorithm of target detection, this method is decomposed using low-rank matrix double factor and alternating direction algorithm, dilute in combination with being layered It dredges regularization method and promotes conspicuousness target detection performance, specific detecting step is as follows:
Step 1: eigenmatrix generates;
Step 2: index tree constructs;
Step 3: advanced priori knowledge is integrated;
Step 4: structuring low-rank matrix is decomposed;
Step 5: notable figure generates.
Further, detailed process is as follows for the first step:
A, RGB color is extracted from original image and Gabor filter generates low-level features, indicated with D=53 dimensional vector;
B, input picture is excessively cut by using simple linear iteration cluster (SLIC) algorithm by non-overlap super-pixel collection P ={ P1, P2..., Pn, each super-pixel block PiBy D dimensional feature vector xiIt indicates;
C, by combining the corresponding feature vector of all super-pixel block, the eigenmatrix of original image is obtained
Further, detailed process is as follows for the second step:
Based on super-pixel collection P, structural information is encoded by being layered segmentation, in the same node for constraining index tree T Super-pixel generate identical saliency value;
In this step, neighbour's matrix of neighbouring super pixels pair is calculated using formula (5) first;Then, it is adjacent to merge space Super-pixel and layering sequence of partitions is obtained by control threshold;Finally, by the way that the super-pixel serial number after segmentation is distributed to Node in corresponding index tree ultimately generates hierarchical index tree construction;
Wherein, the index tree T is divided into three layers, whereinIndicate i-th layer of j-th of node.Based on predefined rope Draw tree, the sparse norm of weighting structuresization is defined as follows:
Wherein,Indicate corresponding nodeWeight,(| | indicate set element number) it is S Corresponding nodeSubmatrix, | | | |Indicate lNorm, wherein groupSaliency value byThe maximum of middle super-pixel is significant Value determines that the pixel in that is, same group is induced to generate similar saliency value;
Neighbour's matrix of neighbouring super pixels pair is calculated using formula (5), gives one group of super-pixel P={ P1, P2..., PN, closely Adjacent matrixIs defined as:
Wherein Ω indicates the set of neighbouring super pixels pair, Laplce's regular termsIs defined as:
Wherein siIt is the i-th column of S,Indicate Laplacian Matrix, i.e. Lg=D-W (D is degree matrix), Tr The mark of () representing matrix, Laplace regularization can be efficiently separated well-marked target and background by the column in smooth S.
Further, detailed process is as follows for the third step:
A, using position, color and contour connection priori, and them is merged and generates priori figure;
B, using formula (4), weight is generated by priori figureFor constraining sparse matrix S;
Wherein, detailed process is as follows by the step a: 1. by calculating distance of the pixel away from picture centre based on Gaussian Profile To obtain location-prior;Warm colour obtains color priori, such as red and yellow;2. being connected to the different figures of image boundary by calculating Boundary connection priori is introduced as the probability in region;3. being based on above three priori, they are multiplied to generate advanced priori figure;
Detailed process is as follows by the step b: for each super-pixel block Pi∈ P, corresponding advanced priori value are expressed as hi, [0,1] is normalized to indicate PiBelong to the probability of well-marked target, advanced priori can pass through different weightsIt is seamless It is integrated into structural sparse norm, as follows:
If a node of index tree is endowed lesser weightThen corresponding saliency value will be tended to larger.
Further, detailed process is as follows for the 4th step:
A, eigenmatrix X can be decomposed into structuring three matrixes, i.e. M, R and S, indicated are as follows:
X=MR+S=L+S;
B, optimal M is multiplied with R, calculates optimal low-rank matrix L;
Wherein, L=MR.
Calculating process is as follows:
Augmented Lagrangian Functions are constructed firstAre as follows:
Then alternative optimization M, R and S obtains optimal solution.
That is M*=QR (Z (Rk)T)(11)
Wherein QR () indicates QR decomposition operator.
With closed solutionWherein SVTμ(Z)=Udiag (max (σ-μ, 0)) VTIndicate odd Different value threshold operator, Indicate the singular value of Z, i.e. Z= Udiag(σ)VT
Wherein, λ=α/(2 μk),Indicate i-th layer of index tree of j-th of node weight,
Further, detailed process is as follows for the 5th step: the significant of each super-pixel is estimated using function Sal () Value:
Sal(Pi)=| | si||1 (26)
Wherein siIt is the i-th column of structured matrix S, and | | si||1Represent some super-pixel PiSaliency value.Therefore, Sal(Pi) indicate a possibility that i belongs to well-marked target.
Beneficial effects of the present invention are as follows:
1, the calculating cost for executing SVD is reduced by minimizing to small-scale matrix nuclear norm;
2, the advantages of inheriting structural sparse matrix decomposition;
3, the matrix low rank decomposition model proposed has optimal solution identical with primal problem;
4, the efficiency of saliency target detection is improved;
5, the present invention peomotes the specific neck such as image reorientation, image automatic cutting, compression of images and target identification The application in domain.
Detailed description of the invention
Fig. 1 is the index of the corresponding segmentation result of layering segmentation (b) based on super-pixel construction index tree (a) input picture Tree;
Fig. 2 is conspicuousness target detection frame;
Fig. 3 is each algorithm PR curve comparison (a) MSRA10K. (b) DUTOMRON. (c) iCoSeg. on five data sets (d)SOD.(e)ECSSD;
Fig. 4 is each algorithm F-measure curve comparison (a) MSRA10K. (b) DUTOMRON. (c) on five data sets iCoSeg.(d)SOD.(e)ECSSD;
Fig. 5 is that each algorithm generates notable figure visualization comparison;
Fig. 6 is to propose that algorithm and SMD algorithm generate notable figure visualization and compare.
Specific embodiment
Below in conjunction with attached drawing 1-6 and table, by specific embodiment, the present invention is further illustrated.
1, the structured matrix low-rank representation modelling based on dualistic analysis is as follows:
Assuming that input picture χ is divided into n non-overlap super-pixel P={ P1, P2..., Pn}.From each piece of PiMiddle extraction One D dimensional feature vectorThen x can be expressed as by n feature vectorBenefit With matrix low-rank representation model, eigenmatrix F can be decomposed into low-rank matrix L (i.e. background area) and sparse matrix S is (i.e. aobvious Write target).
It can be r by order to reduce the time complexity of nuclear norm minimization problem*Low-rank matrix be decomposed into two Compared with the product of minor matrix, i.e. L=MR,MTM=I, wherein r < < min (m, n) representing matrix L order The upper limit.According to | | L | |*=| | MR | |*=| | R | |*, therefore the nuclear norm by minimizing a more minor matrix R, propose one The new structured matrix low-rank representation model of kind is as follows:
S.t.F=L+S, L=MR, MTM=I,
Wherein | | R | |*The nuclear norm of R is represented,Structural sparse regular terms is indicated, for characterizing the sky of element in S Between relationship,Indicate Laplace regularization item, it is therefore an objective to preferably separate sparse matrix with low-rank matrix, α, β are just Balance parameters.
Unstructured sparse induced norm, such as l1Norm, l2Norm and l2,1Norm, to the progress of S sparse constraint by column, suddenly The spatial coherence and characteristic similarity between matrix element are omited.Structural sparse norm based on index tree can be more accurate Ground characterizes sparse matrix, generates the consistent notable figure of structure.It is actually the layering of image by the index tree that hierarchy nodes form Segmentation, wherein each node includes the index of one group of super-pixel, as shown in Figure 1, index tree T is divided into three layers, whereinIt indicates I-th layer of j-th of node.Based on predefined index tree, the sparse norm of weighting structuresization is defined as follows:
Wherein,Indicate corresponding nodeWeight,(| | indicate set element number) it is S Corresponding nodeSubmatrix, | | | |Indicate lNorm, wherein groupSaliency value byThe maximum of middle super-pixel is significant Value determines that the pixel in that is, same group is induced to generate similar saliency value.
Other than introducing structural sparse induced norm, the Laplace regularization assumed based on local invariant is also normal For promoting the performance of conspicuousness target detection.Give one group of super-pixel P={ P1, P2..., PN, neighbour's matrix Is defined as:
Wherein Ω indicates the set of neighbouring super pixels pair.Laplce's regular termsIs defined as:
Wherein siIt is the i-th column of S,Indicate Laplacian Matrix, i.e. Lg=D-W (D is degree matrix), Tr The mark of () representing matrix.Laplace regularization can be efficiently separated well-marked target and background by the column in smooth S.
Then, optimization problem (3) is indicated again are as follows:
S.t.F=L+S, L=MR, MTM=I.
Eliminate the variables L in optimization problem (7) first to accelerate the convergence of model, then introducing auxiliary variable H makes target Function is separable.Problem (7) is converted into following optimization problem:
S.t.F=MR+S, MTM=I, S=E.
Problem (8) can be solved by alternating direction method (ADM), Augmented Lagrangian FunctionsAre as follows:
WhereinWithIt is Lagrange multiplier, μ > 0 is the punishment parameter for violating linear restriction. ADM method is in each iteration respectively to M, R, E, S, Y1And Y2Alternative optimization, specific algorithm are described as follows:
(1) M is updatedk+1
Fixed remaining variables optimize M, and optimization subproblem is expressed as follows:
WhereinAbove formula is the least square problem for meeting orthogonality constraint, according to document [24], Mk+1With following closed solution:
Mk+1=QR (Z (Rk)T)(11)
Wherein QR () indicates QR decomposition operator.Therefore, Mk+1Actually spaceOrthogonal basis.
(2) R is updatedk+1
Solve the optimization subproblem of variable R is defined as:
The problem single order optimal condition is as follows:
WhereinIndicate Non-smooth surface convex function | | | |*Subdifferential, be expressed as
Wherein, Rk+1=U ∑ VTIndicate Rk+1SVD decompose.In view of constraint condition (Mk+1)TMk+1=I,
Optimal condition (13) conversion are as follows:
Optimal condition (15) is the single order optimal condition of following problems:
The problem can be solved by following lemma.
Lemma 1 is for any given matrixIts order is r, and μ > 0, then problem:
With closed solutionWherein SVTμ(Z)=Udiag (max (σ-μ, 0)) VTIndicate unusual It is worth threshold operator,Indicate the singular value of Z, i.e. Z=Udiag (σ)VT
According to lemma 1, the nuclear norm minimization problem in problem (16) can be solved by SVT operator:
Wherein,
(3) E is updatedk+1And Sk+1
Fixed Mk+1, Sk, Rk+1,WithValue, obtain following optimization subproblem:
To the E derivation in above formula, can obtain:
Update Ek+1Afterwards, S is updated in the case where its dependent variable is fixedk+1, obtaining following structural sparse indicates optimization Problem:
Wherein, λ=α/(2 μk),Indicate i-th layer of index tree of j-th of node weight,
Problem (20) can be solved by layering proximal end operator, as described in algorithm 2.The expression of index tree structural sparse is asked Solution:
2, the quick conspicuousness object detection method based on low-rank matrix dualistic analysis
The structuring low-rank matrix decomposition model of proposition is applied to conspicuousness target detection, this method utilizes low-rank matrix Double factor decomposes and alternating direction algorithm, promotes conspicuousness target detection performance in combination with sparse regularization method is layered.Inspection Structuring low-rank matrix decomposition method from bottom to top is not used only to reduce time overhead in method of determining and calculating, and combines from top to bottom Advanced priori knowledge promotes detection performance.Fig. 2 gives the conspicuousness target detection frame based on low-rank matrix dualistic analysis, life Entitled BSMD (Bi-factorization based Structured Matrix Decomposition).
Figure it is seen that the frame proposed includes five steps: eigenmatrix generates, index tree construction, advanced elder generation Knowledge Aggregation is tested, structuring low-rank matrix is decomposed and notable figure generates.
Step 1: eigenmatrix generates.Relatively for justice, we extract RGB color and Gabor filter from original image Wave device generates low-level features, is indicated with D=53 dimensional vector.It then, will by using simple linear iteration cluster (SLIC) algorithm Input picture is excessively cut into non-overlap super-pixel collection P={ P1, P2..., Pn}.Each super-pixel block PiBy D dimensional feature vector xiTable Show.Finally, obtaining the eigenmatrix of original image by combining the corresponding feature vector of all super-pixel block
Step 2: index tree construction.Based on super-pixel collection P, structural information is encoded by being layered segmentation, constrains T's Super-pixel in same node generates identical saliency value.In this step, neighbouring super pixels pair are calculated using formula (5) first Neighbour's matrix.Then, merge the adjacent super-pixel in space and pass through control thresholdTo obtain layering sequence of partitions.Finally, logical It crosses and the segmentation result of each granularity is distributed into the node in index tree to construct the hierarchical index tree construction of input picture.
Wherein, the index tree T is divided into three layers, whereinIndicate i-th layer of j-th of node.Based on predefined rope Draw tree, the sparse norm of weighting structuresization is defined as follows:
Wherein,Indicate corresponding nodeWeight,(| | indicate set element number) it is S Corresponding nodeSubmatrix, | | | |Indicate lNorm, wherein groupSaliency value byThe maximum of middle super-pixel is significant Value determines that the pixel in that is, same group is induced to generate similar saliency value;
Neighbour's matrix of neighbouring super pixels pair is calculated using formula (5), gives one group of super-pixel P={ P1, P2..., PN, closely Adjacent matrixIs defined as:
Wherein Ω indicates the set of neighbouring super pixels pair, Laplce's regular termsIs defined as:
Wherein siIt is the i-th column of S,Indicate Laplacian Matrix, i.e. Lg=D-W (D is degree matrix), Tr The mark of () representing matrix, Laplace regularization can be efficiently separated well-marked target and background by the column in smooth S.
Step 3: advanced priori knowledge is integrated.In this step, position, color and contour connection priori are used first, and It merges them and generates priori figure.Then, using formula (4), weight w_j^i is generated by priori figure, for constraining sparse matrix S;. Specifically, since the object near picture centre is more attractive to people, by calculating picture based on Gaussian Profile Distance of the element away from picture centre obtains location-prior.Warm colour, such as red and yellow are more attractive to human eye.By the side WLRR Method inspires, and is connected to the probability in the different images region of image boundary by calculating and is connected to priori to introduce boundary.Based on above-mentioned They are multiplied to generate advanced priori figure (referring to fig. 2) by three priori.For each super-pixel block Pi∈ P, it is corresponding advanced Priori value is expressed as hi, [0,1] is normalized to indicate PiBelong to the probability of well-marked target.Finally, advanced priori can lead to Cross different weightsIt is integrating seamlessly into structural sparse norm, as follows:
If a node of index tree is endowed lesser weightThen corresponding saliency value will be tended to larger.Together The super-pixel saliency value having the same for the structural sparse norm limitation Same Vertices that up-to-date style (4) defines.Therefore, by will be high Grade priori knowledge is integrated, can more effectively and accurately protrude well-marked target.
Step 4: structuring low-rank matrix is decomposed.Based on low-rank matrix decomposition model is proposed, eigenmatrix X can be with structuring It is decomposed into three matrixes, i.e. M, R and S.As shown in Fig. 2, the process of matrix decomposition is guided by advanced priori and index tree T.Therefore, By the way that optimal M is multiplied with R, optimal low-rank matrix L can be effectively calculated.Since structure is sparse and Laplace regularization It is united and applied in the frame of proposition, input feature vector matrix X can effectively be decomposed into low-rank matrix L and structure sparse matrix S.
A, eigenmatrix X can be decomposed into structuring three matrixes, i.e. M, R and S, indicated are as follows:
X=MR+S=L+S;
B, optimal M is multiplied with R, calculates optimal low-rank matrix L;
Wherein, L=MR.
Calculating process is as follows:
Augmented Lagrangian Functions are constructed firstAre as follows:
Then alternative optimization M, R and S obtains optimal solution.
That is M*=QR (Z (Rk)T)(11)
Wherein QR () indicates QR decomposition operator.
With closed solutionWherein SVTμ(Z)=Udiag (max (σ-μ, 0)) VTIndicate odd Different value threshold operator, Indicate the singular value of Z, i.e. Z= Udiag(σ)VT
Wherein, λ=α/(2 μk),Indicate i-th layer of index tree of j-th of node weight,
Step 5: notable figure generates.The saliency value of each super-pixel is estimated using function Sal ():
Sal(Pi)=| | si||1 (26)
Wherein siIt is the i-th column of structured matrix S, and | | si||1Represent some super-pixel PiSaliency value.Therefore, Sal(Pi) indicate a possibility that i belongs to well-marked target.
In order to assess the performance of BSMD, five algorithms best with performance are compared first in five benchmark datasets Compared with as shown in table 2 to table 6, best 3 results are highlighted with runic.
From table 2 into table 6 as can be seen that in most cases, the performance of BSMD, SMD and DRFI are better than other algorithms. On DUT-OMRON, DRFI is better than BSMD and SMD.But DRFI be it is a kind of need a large amount of training samples have measure of supervision. In contrast, BSMD and SMD is unsupervised model, does not need to be trained in advance.Therefore, they are more more flexible than DRFI.It is based on The performance of the BSMD of rand estination and the performance of SMD are almost the same.We are discussed below the BSMD performance under the conditions of fixed order.
1) test result of monocular logo image
Firstly, using the performance of single goal data set MSRA10K and DUT-OMRON test algorithms of different.PR and F- Measure curve is as shown in Figure 3 and Figure 4.Table 2 gives MAE, WF, the comparison of AUC and OR index.On MSRA10K, BSMD and SMD is better than other algorithms on WF, OR and MAE index, and DRFI and WLRR obtain first and second AUC index respectively.From PR curve in Fig. 3 A can be seen that BSMD and SMD and behave oneself best in all methods.According to F-measure curve (Fig. 4 A), It can be observed that BSMD and SMD have relatively good performance in very large range, and DRFI and WLRR are unstable, only in small model Preferable result is obtained in enclosing.
Each algorithm performance comparison on 2 MSRA10K data set of table
From table 3, it can be seen that all methods showed on DUT-OMRON it is bad, this is because DUT-OMRON data set Complexity and diversity.SMD and BSMD obtains first and second OR index result respectively.DRFI performance is best, this be because Multistage notable figure, which is merged, using supervision message for DRFI improves its robustness.The PR curve of the upper algorithms of different of DUT-OMRON is as schemed Shown in 3B, it can be seen that the precision of BSMD and SMD is more competitive under high recall rate.In addition, Fig. 4 B shows BSMD and SMD Better performance is obtained in the larger context, and especially when threshold value is higher, they are substantially better than WLRR and DRFI.
Each algorithm performance comparison on 3 DUT-OMRON data set of table
2) test result of multi-Target Image
The performance of algorithms of different, PR and F-measure curve on multi-Target Image iCoSeg and SOD further described below As shown in Fig. 3 C, 3D, 4C and 4D, MAE, WF, AUC and OR index result is set forth in table 4 and table 5.
On iCoSeg (table 4), SMD and BSMD come first and second respectively on MAE, WF and OR index, WLRR Achieve best AUC value.But as shown in Figure 3 C, the precision of WLRR is significantly lower than BSMD and SMD under low recall rate.It is special Not, Fig. 3 C and Fig. 4 C show that PR the and F-measure curve of BSMD and SMD is generally better than other methods.In addition, they F-measure value is insensitive to the selection of threshold value, keeps preferable performance in the larger context.
Each algorithm performance comparison on 4 iCoSeg data set of table
On SOD (as shown in table 5), the WF index result of BSMD is best, and OR index is number two.PR curve in Fig. 3 D The performance of display SMD and BSMD is slightly below DRFI, but is better than other algorithms.In the F-measure curve of Fig. 4 D, BSMD and SMD is better than WLRR within the scope of lower threshold, and they are performed better than than DRFI under higher thresholds range.
Each algorithm performance comparison on 5 SOD data set of table
3) test result of complex scene image
The method proposed is tested on complex scene data set ECSSD, the results are shown in Table 6.BSMD and SMD It is better than other algorithms on MAE, WF and OR index.The AUC value of BSMD is number two, but very small with the gap of optimum (0.005).It can be seen that BSMD from the PR curve in Fig. 3 E and SMD be substantially better than other methods.In Fig. 4 E, BSMD and SMD It obtains preferably interior on a large scale as a result, this demonstrates BSMD and SMD when handling the image of complex scene with very strong Robustness.
Each algorithm performance comparison on 6 ECSSD data set of table
4) notable figure visualization comparison
In order to compare the notable figure of distinct methods generation, the notable figure exported under different situations is selected to carry out visualization pair Than as shown in Figure 5.In monocular logo image, BSMD and SMD can more accurately detect well-marked target, and have less Divergent Phenomenon.This is because by structural sparse regularization and index tree method, they consider sparse matrix different lines it Between correlation, thus utmostly eliminate target diverging and incomplete phenomenon.In multi-Target Image, DSR, RBD, Some background areas are mistakenly divided into well-marked target by DRFI and WLRR, and DSR and SMD are lost part well-marked target. In contrast, the BSMD method proposed is successfully extracted multiple conspicuousness targets.For the image with complex scene, BSMD and SMD successfully detected conspicuousness target, and other unsupervised approaches can not detect conspicuousness target.For having The image of similar foreground and background, BSMD and SMD can preferably separate conspicuousness target with image background.
5) compared with SMD
From the discussion above it may be concluded that BSMD has the performance closely similar with SMD.Fixed order item is tested below BSMD performance under part, and be compared with SMD.In order to guarantee to compare with SMD justice, the quantity of super-pixel is remained set to 200.Fig. 4 gives the corresponding notable figure generated under different order values.As can be seen that with the increase of order r, the performance of BSMD becomes The performance of Xiang Yuyu SMD is consistent.When r is greater than 10, the performance of BSMD is become stable and almost identical as the performance of SMD.This It is the statistical data because according in [23], the order of the corresponding eigenmatrix of 90% image data set is less than 10.According to theorem 1, if using the suitable order upper bound, the optimal solution of model is identical as original optimization problem.Therefore, experimental result and theory analysis It is completely the same.
In table 2 to 6, by estimating that the suitable upper bound of order realizes the performance similar with SMD, these experiment synthesis show The BSMD method proposed is reasonable and feasible in practical applications.
Table 7 lists the runing time of BSMD and SMD on two larger data collections DUT-OMRON and MSRA10K.From table In as can be seen that with super-pixel quantity increase, BSMD runs faster, and the value of N is bigger, and BSMD is more effective.Therefore, BSMD Scalability be better than SMD, while it have the test index almost the same with SMD.
Table 7 proposes algorithm and SMD runing time comparison (minute)
The present invention proposes one kind by the way that the decomposition of low-rank matrix double factor is sparse with structure and manifold regularization combines The matrix low rank decomposition model of rapid structure, the model are solved using alternating direction method (ADM), with current low-rank matrix point Solution model is compared, and the method for proposition has lower time complexity.
The model proposed is applied to conspicuousness target detection, respectively to single goal, more in five benchmark datasets Target and complex scene image carry out experimental verification.The experimental results showed that the algorithm of proposition is better than existing in six kinds of performance indicators There is unsupervised conspicuousness algorithm of target detection.

Claims (6)

1. a kind of quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis, it is characterised in that:
This method is decomposed using low-rank matrix double factor and alternating direction algorithm, is promoted in combination with sparse regularization method is layered Conspicuousness target detection performance, specific detecting step are as follows:
Step 1: eigenmatrix generates;
Step 2: index tree constructs;
Step 3: advanced priori knowledge is integrated;
Step 4: structuring low-rank matrix is decomposed;
Step 5: notable figure generates.
2. the quick conspicuousness algorithm of target detection according to claim 1 based on low-rank matrix dualistic analysis, feature It is:
Detailed process is as follows for the first step:
A, RGB color is extracted from original image and Gabor filter generates low-level features, indicated with D=53 dimensional vector;
B, input picture is excessively cut into non-overlap super-pixel collection P={ P by using simple linear Iterative Clustering1, P2..., Pn, each super-pixel block PiBy D dimensional feature vector xiIt indicates;
C, by combining the corresponding feature vector of all super-pixel block, the eigenmatrix of original image is obtained
3. the quick conspicuousness algorithm of target detection according to claim 1 based on low-rank matrix dualistic analysis, feature It is:
Detailed process is as follows for the second step:
Based on super-pixel collection P, structural information is encoded by being layered segmentation, is constrained super in the same node of index tree T Pixel generates identical saliency value;
In this step, neighbour's matrix of neighbouring super pixels pair is calculated using formula (5) first;Then, merging space is adjacent surpasses Pixel simultaneously obtains layering sequence of partitions by control threshold;Finally, by the way that the super-pixel serial number after segmentation is distributed to correspondence Index tree in node, ultimately generate hierarchical index tree construction;
Wherein, in step a, index tree T is divided into d layers, whereinIndicate i-th layer of j-th of node.Based on predefined index Tree, the sparse norm of weighting structuresization are defined as follows:
Wherein,Indicate corresponding nodeWeight,(| | indicate set element number) it is S corresponding NodeSubmatrix, | | | |Indicate lNorm, wherein groupSaliency value byThe maximum saliency value of middle super-pixel is true Fixed, i.e., the pixel in same group is induced to generate similar saliency value;
Step b calculates neighbour's matrix of neighbouring super pixels pair using formula (5), gives one group of super-pixel P={ P1, P2..., PN, Neighbour's matrixIs defined as:
Wherein Ω indicates the set of neighbouring super pixels pair, Laplce's regular termsIs defined as:
Wherein siIt is the i-th column of S,Indicate Laplacian Matrix, i.e. Lg=D-W (D is degree matrix), Tr () is indicated The mark of matrix, Laplace regularization can be efficiently separated well-marked target and background by the column in smooth S.
4. the quick conspicuousness algorithm of target detection according to claim 1 based on low-rank matrix dualistic analysis, feature Be: detailed process is as follows for the third step:
A, using position, color and contour connection priori, and them is merged and generates priori figure;
B, using formula (4), weight is generated by priori figureFor constraining sparse matrix S;
Wherein, detailed process is as follows by the step a: 1. being obtained by calculating distance of the pixel away from picture centre based on Gaussian Profile Obtain location-prior;Warm colour obtains color priori, such as red and yellow;2. being connected to the different images area of image boundary by calculating The probability in domain is connected to priori to introduce boundary;3. being based on above three priori, they are multiplied to generate advanced priori figure;
Detailed process is as follows by the step b: for each super-pixel block Pi∈ P, corresponding advanced priori value are expressed as hi, by its [0,1] is normalized to indicate PiBelong to the probability of well-marked target, advanced priori can pass through different weightsSeamless integration- It is as follows into structural sparse norm:
If a node of index tree is endowed lesser weightThen corresponding saliency value will be tended to larger.
5. the quick conspicuousness algorithm of target detection according to claim 1 based on low-rank matrix dualistic analysis, feature Be: detailed process is as follows for the 4th step:
A, eigenmatrix X can be decomposed into structuring three matrixes, i.e. M, R and S, indicated are as follows:
X=MR+S=L+S;
B, optimal M is multiplied with R, calculates optimal low-rank matrix L;
Wherein, L=MR;
Calculating process is as follows:
Augmented Lagrangian Functions are constructed firstAre as follows:
Then alternative optimization M, R and S obtains optimal solution.
(1)
That is M*=QR (Z (Rk)T) (11)
Wherein QR () indicates QR decomposition operator;
(2)
With closed solutionWherein SVTμ(Z)=Udiag (max (σ-μ, 0)) VTIndicate singular value threshold It is worth operator, Indicate the singular value of Z, i.e. Z=Udiag (σ) VT
(3)
Wherein, λ=α/(2 μk),Indicate i-th layer of index tree of j-th of node weight,
6. the quick conspicuousness algorithm of target detection according to claim 1 based on low-rank matrix dualistic analysis, feature Be: detailed process is as follows for the 5th step: the saliency value of each super-pixel is estimated using function Sal ():
Sal(Pi)=| | si||1 (26)
Wherein siIt is the i-th column of structured matrix S, and | | si||1Represent some super-pixel PiSaliency value.Therefore, Sal (Pi) Indicate a possibility that i belongs to well-marked target.
CN201910771452.2A 2019-08-20 2019-08-20 Rapid saliency target detection algorithm based on low-rank matrix binary decomposition Active CN110490206B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910771452.2A CN110490206B (en) 2019-08-20 2019-08-20 Rapid saliency target detection algorithm based on low-rank matrix binary decomposition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910771452.2A CN110490206B (en) 2019-08-20 2019-08-20 Rapid saliency target detection algorithm based on low-rank matrix binary decomposition

Publications (2)

Publication Number Publication Date
CN110490206A true CN110490206A (en) 2019-11-22
CN110490206B CN110490206B (en) 2023-12-26

Family

ID=68552401

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910771452.2A Active CN110490206B (en) 2019-08-20 2019-08-20 Rapid saliency target detection algorithm based on low-rank matrix binary decomposition

Country Status (1)

Country Link
CN (1) CN110490206B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111191617A (en) * 2020-01-02 2020-05-22 武汉大学 Remote sensing scene classification method based on hierarchical structure

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105574534A (en) * 2015-12-17 2016-05-11 西安电子科技大学 Significant object detection method based on sparse subspace clustering and low-order expression
CN106778814A (en) * 2016-11-24 2017-05-31 郑州航空工业管理学院 A kind of method of the removal SAR image spot based on projection spectral clustering
CN109101978A (en) * 2018-07-06 2018-12-28 中国地质大学(武汉) Conspicuousness object detection method and system based on weighting low-rank matrix Restoration model

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105574534A (en) * 2015-12-17 2016-05-11 西安电子科技大学 Significant object detection method based on sparse subspace clustering and low-order expression
CN106778814A (en) * 2016-11-24 2017-05-31 郑州航空工业管理学院 A kind of method of the removal SAR image spot based on projection spectral clustering
CN109101978A (en) * 2018-07-06 2018-12-28 中国地质大学(武汉) Conspicuousness object detection method and system based on weighting low-rank matrix Restoration model

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HOUWEN PENG: ""Salient Object Detection via Low-Rank and Structured Sparse Matrix Decomposition"", 《PROCEEDINGS OF THE TWENTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 *
HOUWEN PENG等: ""Salient Object Detection via Structured Matrix Decomposition"", 《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111191617A (en) * 2020-01-02 2020-05-22 武汉大学 Remote sensing scene classification method based on hierarchical structure
CN111191617B (en) * 2020-01-02 2022-02-01 武汉大学 Remote sensing scene classification method based on hierarchical structure

Also Published As

Publication number Publication date
CN110490206B (en) 2023-12-26

Similar Documents

Publication Publication Date Title
US20230386186A1 (en) Method and Apparatus for Employing Specialist Belief Propagation Networks
Gould et al. Decomposing a scene into geometric and semantically consistent regions
Donahue et al. Decaf: A deep convolutional activation feature for generic visual recognition
Lim et al. Sketch tokens: A learned mid-level representation for contour and object detection
Schwartz et al. Visual material traits: Recognizing per-pixel material context
Yu et al. Multi-attribute adaptive aggregation transformer for vehicle re-identification
Esmaeili et al. Fast-at: Fast automatic thumbnail generation using deep neural networks
CN106548149A (en) The recognition methods of the micro- facial expression image sequence of face in monitor video sequence
Asokan et al. Machine learning based image processing techniques for satellite image analysis-a survey
Zou et al. Chronological classification of ancient paintings using appearance and shape features
CN112232164A (en) Video classification method and device
CN113963032A (en) Twin network structure target tracking method fusing target re-identification
Tang et al. Deep saliency quality assessment network with joint metric
Rodrigues et al. Evaluating cluster detection algorithms and feature extraction techniques in automatic classification of fish species
Yuan et al. Few-shot scene classification with multi-attention deepemd network in remote sensing
Xiao et al. Pedestrian object detection with fusion of visual attention mechanism and semantic computation
Luo et al. View and clothing invariant gait recognition via 3D human semantic folding
CN110503113A (en) A kind of saliency object detection method restored based on low-rank matrix
Liu et al. Soft SVM and its application in video-object extraction
Zhang et al. Gathering effective information for real-time material recognition
CN110503049B (en) Satellite video vehicle number estimation method based on generation countermeasure network
CN110490206A (en) A kind of quick conspicuousness algorithm of target detection based on low-rank matrix dualistic analysis
Simon et al. DeepLumina: A method based on deep features and luminance information for color texture classification
Li et al. Spatial and temporal information fusion for human action recognition via Center Boundary Balancing Multimodal Classifier
Cho Content-based structural recognition for flower image classification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant