CN111666967B

CN111666967B - Image classification method based on incoherence combined dictionary learning

Info

Publication number: CN111666967B
Application number: CN202010316589.1A
Authority: CN
Inventors: 李胜; 马悦; 何熊熊
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2020-04-21
Filing date: 2020-04-21
Publication date: 2023-06-13
Anticipated expiration: 2040-04-21
Also published as: CN111666967A

Abstract

An image classification method based on incoherence combined dictionary learning is characterized in that a class dictionary is trained for each class of images, a shared dictionary is trained for all images, low rank property of the shared dictionary is guaranteed to prevent the shared dictionary from absorbing class characteristics, and coherence constraint items are added between the low rank shared dictionary and the class dictionary to prevent the shared characteristics from appearing in the class dictionary. The method increases the discriminant of the training dictionary, improves the sparse representation capability of the dictionary, and further improves the accuracy of image classification.

Description

Image classification method based on incoherence combined dictionary learning

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to an image classification method based on incoherence combined dictionary learning.

Background

In recent years, sparse representation has achieved great success in the field of image processing, such as image classification, image denoising, compressed sensing, etc., which represents a signal as a linear combination of a few atoms in a redundant dictionary. In the sparse representation process, the training dictionary largely determines the quality of the sparse representation capability.

Currently, researchers have proposed various dictionary training methods to improve sparse representation capabilities. The simplest dictionary training directly uses all samples as a dictionary, such as Sparse Representation Classification (SRC), but when the training samples are too large, the complexity of the algorithm is too high and there is great redundancy in training the dictionary. K-SVD is a classical dictionary training algorithm that does not work well in image classification due to the fact that the class characteristics of the training samples are not introduced during the dictionary training process. As an improvement, class-oriented dictionary training algorithms D-KSVD, DLSI and FDDL were proposed. For example: the D-KSVD introduces a linear classifier to make the coding coefficients of the same sample more similar, and the coding coefficient difference of different samples is increased; DLSI introduces coherence constraint among class dictionaries to improve the discrimination capability of the dictionaries; FDDL adds a fisher discriminant term to increase the discriminant ability of the coding coefficients. Furthermore, experimental studies have found that although different classes of dictionaries have their own unique features, they share some features, so researchers have proposed class dictionary and shared dictionary joint training methods LRSDL, COPAR. The LRSDL prevents the shared dictionary from absorbing the characteristics of the class dictionary by ensuring the low rank property of the shared dictionary, so that the discriminant of the dictionary is improved. However, during the joint dictionary training process, shared features may appear in the class dictionary, which may reduce the accuracy of image classification.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides the image classification method based on the incoherence combined dictionary, which prevents shared features from appearing in the class dictionary so as to improve the discrimination capability of the dictionary, optimize the sparse representation capability of the image and further improve the accuracy of image classification.

The technical scheme of the invention is as follows:

an image classification method based on an incoherence joint dictionary is characterized in that a class dictionary is trained for each class of images, a shared dictionary is trained for all images, the low rank property of the shared dictionary is guaranteed to avoid the shared dictionary from absorbing the characteristics of the class dictionary, a coherent constraint item is added between the low rank shared dictionary and the class dictionary to prevent the shared characteristics from appearing in the class dictionary, and the model is as follows:

/>

wherein the training sample

Comprises class C->

Represents a class c training sample,/->

Is a training dictionary,/->

Representing a class c training dictionary,/->

Representing a shared dictionary->

Is a class dictionary that is used to store the data,

is a matrix of coding coefficients>

Is Y _c Coding coefficients of the corresponding dictionary D, < >>

Is Y _c Corresponding dictionary D _c Coding coefficient of>

Is X _c Remove->

Given a matrix a and a natural number n, let μ (a, n) be a matrix of n identical columns, each column being the average vector of all columns in a, let>

Further, updating variables in the dictionary model with alternating iterative solutions includes: random gradient descent method for updating class dictionary

Iterative updating of shared dictionary D by alternate direction multiplier method _C+1 The fast soft threshold iterative algorithm updates the coding coefficient matrix X as follows:

1) Updating class dictionary

Assume that

Then->

When fixed X, D _C+1 Solving +.>

wherein ,

given a matrix T of the type T,

is defined as follows:

taking into account the advantages of low complexity and fast running time of SGD by random gradient descent method, SGD update is used

Here alpha ₁ Is the step size of the gradient descent;

2) Updating shared dictionary D _C+1 ：

Let x= [ X ] ¹ ；...；X ^c ；...；X ^C ；X ^C+1 ]，

And

coding coefficients of training samples under class c dictionary and shared dictionary, respectively, when +.>

X fixed time, update shared dictionary D _C+1 ：

/>

I＝S(X ^C+1 ) ^T ；J＝X ^C+1 (X ^C+1 ) ^T

Updating D using alternate direction multiplier method _C+1 Iteratively updating formulas (6) - (8) until convergence, and solving a nuclear norm minimization problem by a singular value threshold algorithm;

U＝U+D _C+1 -V (8)

θ is a singular value contraction operator, E represents an identity matrix, α ₂ Representing the step size of the gradient descent. Updating D in equation (6) by adopting random gradient descent method _C+1 ；

3) Updating coding coefficient matrix X

When dictionary D is fixed, a fast soft threshold iterative algorithm is used to update X, assuming

In each iterative update, the derivative of s (X) is required;

wherein

The beneficial effects of the invention are as follows: a new dictionary learning method is provided for image classification, the shared features are prevented from appearing in the class dictionary by the coherence constraint items between the low-rank shared dictionary and the class dictionary, the discriminant of the training dictionary is improved, and the sparse representation capability of the image is optimized, so that the accuracy of the image classification is increased.

Drawings

FIG. 1 is a dictionary sparse representation model in accordance with the present invention.

Fig. 2 is a flow chart of the classification of experimental images according to the present invention.

FIG. 3 is a graph of experimental data set described in the present invention, wherein (a) represents an AR generator data set and (b) is a COIL-100 image of the data set.

FIG. 4 is a graph showing the effect of shared dictionary size on the overall classification accuracy of AR geneder data in the present invention.

FIG. 5 is a graph showing the effect of shared dictionary size on overall classification accuracy of COIL-100 data in the present invention.

Detailed Description

The invention will be further described with reference to specific examples, but the scope of the invention is not limited thereto.

Referring to fig. 1 to 5, in an image classification method based on incoherence combined dictionary learning, since sharing features are included between different types of images, the invention trains one class dictionary for each type of image and trains one shared dictionary for all images. In addition, the low rank property of the shared dictionary is ensured to avoid the shared dictionary from absorbing the characteristics of the class dictionary, and a coherent constraint item is added between the low rank shared dictionary and the class dictionary to prevent the shared characteristics from appearing in the class dictionary.

Wherein the training sample

Comprises class C->

Represents a class c training sample,/->

Is a training dictionary,/->

Representing a class c training dictionary,/->

Representing a shared dictionary->

Is a class dictionary that is used to store the data,

is a matrix of coding coefficients>

Is Y _c Coding coefficients of the corresponding dictionary D, < >>

Is Y _c Corresponding dictionary D _c Coding coefficient of>

Is X _c Remove->

In order to minimize dictionary objective functions, the class dictionaries are preferably updated separately using an alternating iterative solution method

Shared dictionary D _C+1 A sparse coding matrix X;

1) Updating class dictionary

Assume that

Then->

When fixed X, D _C+1 Solving +.>

wherein ,

given a matrix T of the type T,

is defined as follows:

/>

Here alpha ₁ Is the step size of the gradient descent;

2) Updating shared dictionary D _C+1 ：

Let x= [ X ] ¹ ；...；X ^c ；...；X ^C ；X ^C+1 ]，

And

X fixed time, update shared dictionary D _C+1 ：

I＝S(X ^C+1 ) ^T ；J＝X ^C+1 (X ^C+1 ) ^T

U＝U+D _C+1 -V (8)

θ is a singular value contraction operator, E represents an identity matrix, α ₂ Step size representing gradient descent, updating D in formula (6) by adopting random gradient descent method _C+1 ；

3) Updating coding coefficient matrix X

In each iterative update, the derivative of s (X) is required;

/>

wherein

Classification was performed on two classes of image dataset, including the AR gener database, the COIL-100 database. The experimental procedure is divided into five steps: firstly, preprocessing all data sets, and reducing the interference of training samples on experiments; secondly, extracting the characteristics of each image in order to extract useful information of the image and reduce the dimension of the characteristics of the image; furthermore, the dictionary learning method is adopted to train the discriminant dictionary; then, sparsely coding the image on the trained dictionary; and finally, classifying the image according to the reconstruction error and the dictionary coding item to obtain the Overall Classification Accuracy (OCA).

AR gene experiment: AR gender face dataset images are divided into two categories, each containing 700 pictures. The dimension of each image after feature extraction is 300, the size of each class of dictionary in the invention is 300, the size of the shared dictionary is set to be 3, and the influence of the size of the shared dictionary on classification accuracy is shown in figure 4. The classification accuracy obtained by various dictionary learning methods is shown in the table one.

Dictionary learningMethod	Classification accuracy (%)
		SRC	91
COPAR	93.56
		LRSDL	92.86
DLSI	93.86
		FDDL	91.71
Proposed Method	94.56

TABLE 1

As shown in Table 1, the classification accuracy obtained by adopting the dictionary training method of the invention on the data set AR gener is 94.56%, which is improved by 0.7% compared with the suboptimal algorithm DLSI, and is improved by 3.56%, 1%, 1.7% and 2.85% compared with the SRC, COPAR, LRSDL, FDDL dictionary learning method.

COIL-100 experiment: the COIL-100 image dataset contained 100 classes, each containing 72 pictures, with each image having dimensions 324 after feature extraction. In the invention, the size of each class of dictionary is 45, the size of the shared dictionary is 3, and the influence of the size of the shared dictionary on the whole classification precision is shown in figure 5.

The overall classification accuracy obtained by various dictionary learning methods is shown in a second table.

Dictionary learning method	Overall classification accuracy (%)
		SRC	89.61
COPAR	90.29
		LRSDL	91.76
DLSI	92.94
		FDDL	88.82
Proposed Method	93.53

TABLE 2

As can be seen from Table 2, the overall classification accuracy obtained by the dictionary training method of the invention on the COIL-100 data set is 93.53%, which is improved by 0.59% compared with the suboptimal algorithm DLSI, and is respectively improved by 3.92%, 3.24%, 1.77% and 4.71% compared with the SRC, COPAR, LRSDL, FDDL dictionary learning method.

The foregoing is considered as illustrative of the principles of the present invention, and has been described herein before with reference to the accompanying drawings, in which the invention is not limited to the specific embodiments shown.

What is not described in detail in this specification is prior art known to those skilled in the art.

Claims

1. An image classification method based on incoherence joint dictionary learning is characterized by comprising the following steps:

firstly, preprocessing all data sets;

secondly, extracting the characteristics of each image;

furthermore, training a discriminant dictionary by adopting a dictionary learning method;

then, sparsely coding the image on the trained dictionary;

finally, classifying the images according to the reconstruction errors and dictionary coding items;

the process of training the discriminant dictionary by adopting the dictionary learning method is as follows:

training a class dictionary for each class of images, training a shared dictionary for all images, ensuring the low rank property of the shared dictionary, and adding a coherent constraint term between the low rank shared dictionary and the class dictionary, wherein the model is as follows:

wherein the training sample

Comprises class C->

Represents a class c training sample,/->

Is a training dictionary,/->

Representing a class c training dictionary,/->

Representing a shared dictionary->

Is a class dictionary that is used to store the data,

is a matrix of coding coefficients>

Is Y _c Coding coefficients of the corresponding dictionary D, < >>

Is Y _c Corresponding dictionary D _c Coding coefficient of>

Is X _c Remove->

Given a matrix a and a natural number n, let μ (a, n) be a matrix of n identical columns, each column being the average vector of all columns in a, let

Alternating iterative solutions to update variables in the dictionary model includes: the dictionary is updated by adopting a random gradient descent method, the shared dictionary is iteratively updated by adopting an alternate direction multiplier method, and the encoding coefficient matrix is updated by adopting a rapid soft threshold iterative algorithm, wherein the method comprises the following steps:

1) Updating class dictionary