CN108268872B

CN108268872B - Robust nonnegative matrix factorization method based on incremental learning

Info

Publication number: CN108268872B
Application number: CN201810166689.3A
Authority: CN
Inventors: 曹宗杰; 曹昌杰; 崔宗勇; 闵锐; 皮亦鸣
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2018-02-28
Filing date: 2018-02-28
Publication date: 2021-06-08
Anticipated expiration: 2038-02-28
Also published as: CN108268872A

Abstract

The invention relates to the field of image recognition, in particular to a robust nonnegative matrix decomposition method based on incremental learning. The invention provides a general robust nonnegative matrix factorization method with incremental property based on the traditional robust nonnegative matrix factorization method, and is applied to image identification feature extraction. As an incremental non-negative matrix factorization method, IRNMF keeps incremental property, so that image recognition has the capability of self-updating, repeated training is avoided, recognition efficiency is improved, and meanwhile, the time cost is only 53.7% of the time of traditional INMF feature extraction. Meanwhile, as a feature extraction method, the IRNMF has more stable feature extraction result compared with the traditional feature extraction methods such as INMF, NMF and the like.

Description

Robust nonnegative matrix factorization method based on incremental learning

Technical Field

The invention relates to the field of image recognition, in particular to a robust nonnegative matrix factorization method based on incremental learning.

Background

In the task of image recognition, reducing redundant parts in high-dimensional image data through feature extraction is an important step for improving image recognition accuracy and reducing image recognition time. In a conventional feature extraction method, Principal Component Analysis (PCA) can achieve the effect of reducing data dimensionality and find an effective dimensionality representation direction of image data, but in the process of reducing data redundancy, the method inevitably reduces a part of judgment information which is significant for image identification, which undoubtedly results in the reduction of the accuracy of image identification. Although Linear Discriminant Analysis (LDA) can find an effective decision direction for high-dimensional image data, it is easy to cause a "small sample problem" because the number of training samples is smaller than the feature space dimension of the high-dimensional image, resulting in overfitting of the training model and loss of generalization capability. In contrast to the above method, non-Negative Matrix Factorization (NMF) uses a non-linear dimension reduction for high-dimensional image data and a purely additive description for the decomposed data. The image data can be widely applied on the premise of more conforming to the psychological and physiological structures of human beings in the real world. Although the traditional NMF feature extraction method can realize effective dimensionality reduction of high-dimensional image data, the feature extraction result is very prone to abnormal image samples generated by noise in the image data, and therefore the effect of a feature extraction model is unstable. Robust non-negative Matrix Factorization (RNMF) skillfully solves the problem by defining new paradigm constraints on the basis of NMF, thereby becoming an extremely effective feature extraction mode.

However, with the continuous abundance of various high-dimensional image data resources, the training samples of the feature extraction model are also increased rapidly. In the process of image recognition, the training time of the feature extraction model is directly related to the number of training samples. In the traditional feature extraction model training, a step of adding a newly added sample directly to an original sample set and performing training on a training sample again usually means repeated training of the training sample, which results in increased calculation cost and reduced recognition efficiency, and meanwhile, a larger data storage space has to be spent on storing the existing training sample.

One approach to solve this problem is to make the feature extraction model have the ability of self-updating and growth learning, and use the incremental learning method to complete feature extraction. In recent years, in other fields such as video surveillance and face recognition, many incremental learning feature extraction methods are proposed, such as Incremental Principal Component Analysis (IPCA), Incremental Linear Discriminant Analysis (ILDA), incremental non-negative matrix factorization (INMF), and the like. Although the method enables the original feature extraction model to have the capacity of incremental learning, the method still has to be limited by some inherent defects of the method. In the existing traditional feature extraction method, RNMF is a more robust feature extraction method, so that if an incremental learning manner is applied to RNMF, on the premise of ensuring that high-dimensional image data can obtain as much decision information as possible, training time can be greatly reduced, storage space of training samples is reduced, and the feature extraction method can be kept stable. This will certainly be a better incremental feature extraction method.

Disclosure of Invention

Aiming at the problems or the defects, the invention provides a Robust non-negative Matrix Factorization (IRNMF) method based on Incremental learning, which aims to overcome the defect that the traditional feature extraction method needs repeated learning under the condition that training samples are increased, so that the feature extraction method has the capabilities of self-updating and growth learning, can extract the discrimination information of high-dimensional image data as much as possible and keeps the stability of a model.

The invention is realized by the following steps, and the characteristic extraction algorithm is shown in the attached figure 1.

Step 1, carrying out Robust Nonnegative Matrix Factorization (RNMF) initialization on the existing image sample training data to obtain an initial projection matrix W, a coding matrix H and a diagonal element matrix D. In RNMF, for a sample matrix V ∈ R^m×nEach column represents a training sample with m pixel points, n training samples are counted, and the training samples are decomposed into a base matrix W epsilon R^m×rThe coding matrix H is formed by R^r×nAnd obtaining a diagonal element matrix D epsilon R^r×rI.e. by

V_mn＝W_mrH_rn

D＝diag(D₁₁,D₂₂,...,D_nn)

Wherein W is not less than 0_,H≥0,

r represents the dimensionality after dimensionality reduction, D_iiIs the diagonal element of the diagonal element matrix, i.e.:

because the objective function of RNMF is defined as:

||·||_2，1represents a newly defined 2-1 norm form, and adopts a gradient descent method to obtain an RNMF iteration rule as follows:

wherein, i is 1, …, m, j is 1, …, n;

according to an iterative formula, each element W of the initial projection matrix is obtained after iteration is carried out until convergence_mrEach element H of the coding matrix_rmThereby calculating a diagonal element matrix D;

and 2, after the robust nonnegative matrix decomposition initialization of part of training samples is completed, and an initial projection matrix W, a coding matrix H and a diagonal matrix D are obtained. When a new training sample is added into model training, the IRNMF algorithm is calculated through new sample information, the global updating of the projection matrix W and the local updating of the diagonal matrix D are completed, and incremental learning is achieved.

Let the number of initial training samples be k, and its cost function be:

when a training sample v is newly added_k+1I.e. when the number of training samples is k +1The cost function is:

therefore, the following steps are carried out:

wherein, F_k+1Cost function, W, representing k +1 training samples_k+1Projection matrix, H, representing k +1 training samples_k+1Coding matrix representing k +1 training samples, h_k+1Column k +1 of H, representing the newly added sample in the coding matrix, v_k+1Column k +1 of V, representing a newly added training sample, f_k+1Is a cost function of the incremental portion.

In the course of incremental learning, the cost function F_k+1The independent variables of (a) are as follows: projection matrix W_k+1Newly added sample h of coding matrix_k+1And a diagonal matrix adding element d_k+1Firstly, a gradient descent method is adopted to solve a newly added sample h of an encoding matrix_k+1The iteration rule is as follows:

step size mu_αThe following were chosen:

initializing diagonal element d_k+1＝D_kkNew samples h of the coding matrix can be obtained_k+1The iteration rule of (1) is:

subsequently, the pair is realizedLast diagonal element d of the corner matrix_k+1Updating:

let D_k+1,k+1＝d_k+1Thereby completing a diagonal matrix D_k+1Local updates to single samples.

Finally, each element (W) of the new projection matrix is obtained by adopting a gradient descent method_k+1)_iαThe iteration rule of (1) is:

wherein each element (W) of the new projection matrix_k+1)_iαThe step size selected for the gradient descent is:

a new projection matrix W can be obtained_k+1The iteration rule of (1) is:

iterating to converge to obtain a new projection matrix W_k+1Completion of W_k+1Updating of a single sample.

Step 3, after updating of the projection matrix W, projecting the training sample and the sample to be identified in the feature space;

first, all training samples are re-projected:

V’_train＝(W^TW)^-1W^TV_train

wherein, V'_train∈R^r×nFor training a sample matrix V_train∈R^m×nProjection in the feature space W;

then, projecting the sample to be identified:

h’_test＝(W^TW)^-1W^Th_test

wherein, h'_test∈R^rTo identify a sample vector h_test∈R^mProjection in the feature space W;

step 4, carrying out classification and identification after feature extraction, and carrying out feature V 'on the training sample'_trainTraining is carried out, and a sample h 'to be recognized is treated'_testAnd carrying out classification and identification.

The invention provides a general incremental learning method based on a traditional robust nonnegative matrix factorization method, which comprises the following steps: robust non-negative matrix factorization based on incremental learning and applied to image recognition. As an effective incremental feature extraction method, IRNMF not only can greatly reduce training time and reduce storage space of training samples, but also can keep the feature extraction method stable on the premise of ensuring that high-dimensional image data can obtain as much judgment information as possible, so that an image recognition model can meet higher performance requirements.

In conclusion, compared with the existing feature extraction method, the method has the capability of incremental online learning, repeated training is not needed, and the recognition efficiency is greatly improved; after the image information is subjected to feature extraction, the stability of a feature extraction model can be ensured on the premise of retaining effective judgment information as much as possible; on the basis of improving the recognition efficiency, the recognition rate of the incremental NMF is higher than that of the traditional method, and meanwhile, the training time is greatly reduced.

Drawings

FIG. 1 is a flow chart of the feature extraction method of the present invention

FIG. 2 is a MSTAR target slice read image presentation

FIG. 3 shows recognition rate statistics of three methods for SAR three-class target recognition incremental learning task

FIG. 4 is a time cost comparison of three methods for SAR three-class target recognition incremental learning task

Detailed Description

The invention is further explained by simulating actual incremental learning application by taking three types of MSTAR target image recognition tasks as examples.

The samples used in the experiment are MSTAR three-class target slices, the slices are RAW format data of 64 multiplied by 64, the training samples are targets with a pitch angle of 17 degrees, and the testing samples are targets with a pitch angle of 15 degrees. Table 1 shows the MSTAR class three target distributions. An example of a target slice read image is shown in fig. 2.

TABLE 1 MSTAR three classes target distribution

In the invention, the RNMF has increment learning capability, so that a training sample is divided into an initial sample and a newly added sample, the newly added sample is divided into a plurality of batches, and the condition of batch acquisition of the samples in practical application is simulated. The test sample is an unknown label sample and does not participate in training. And observing the unit feature extraction time of the new training sample when a batch of new samples are obtained every time, and reflecting the effect of the feature extraction by testing the identification accuracy of the samples under the condition that other comparison conditions are completely the same.

The experimental plan sets the number of initial training samples to be 100, and 50 new training samples are added in each batch (less than 50 training samples are also added as a group of incremental samples to be trained), and are obtained in 12 batches. Respectively counting the identification accuracy after each sample acquisition and the time consumed by each feature extraction of three methods, namely non-Negative Matrix Factorization (NMF), incremental non-negative matrix factorization (INMF) and incremental robust non-negative matrix factorization (IRNMF) (because the time required by each feature extraction of RNMF increases along with the number of training samples, the RNMF feature extraction method is not added into the comparison). NMF is a traditional method that requires retraining. The recognition accuracy of the three methods increases with the number of samples and is plotted in figure 3. As can be seen from the figure, the identification accuracy of IRNMF is higher than that of NMF and INMF in the traditional method, and the final identification accuracy reaches 96.2637%. In addition, in the increment process, the learning effect of IRNMF always increases steadily with the increase of the number of training samples, while the learning effect of INMF and NMF fluctuates to different degrees with the increase of the number of samples.

In addition, the time cost consumed by the three feature extraction methods along with the increase of the number of samples in each training process is shown in fig. 4. The training time cost of the non-incremental NMF method is in a linear increasing trend along with the increase of the number of samples, although INMF greatly reduces the time cost consumed by feature extraction in the training process by avoiding repeated training, IRNMF also reduces the feature extraction time of a single training sample from 0.0255s to 0.0137s on the basis of the INMF, namely only 53.7% of the feature extraction time cost of the INMF. The task of feature extraction in the image recognition process is completed in a faster and more stable manner.

Claims

1. A robust nonnegative matrix factorization method based on incremental learning is characterized by comprising the following steps:

step 1, performing RNMF initialization on the existing image sample training data to obtain an initial projection matrix W, a coding matrix H and a diagonal element matrix D; the method specifically comprises the following steps:

in RNMF, for a sample matrix V ∈ R^m×nEach column represents a training sample with m pixel points, n training samples are counted, and the training samples are decomposed into a base matrix W epsilon R^m×rThe coding matrix H is formed by R^r×nAnd obtaining a diagonal element matrix D epsilon R^r ^×rNamely:

V_mn＝W_mrH_rn

D＝diag(D₁₁,...,D_ii,...,D_rr)

wherein W is not less than 0_,H≥0,

the objective function of RNMF is defined as:

wherein, i is 1, …, m, j is 1, …, n;

step 2, when a new training sample is added into model training, computing a robust nonnegative matrix decomposition algorithm of incremental learning through new sample information, and completing local updating of a coding matrix H, local updating of a diagonal matrix D and global updating of a projection matrix W to realize incremental learning;

when a training sample v is newly added_k+1That is, when the number of training samples is k +1, the cost function is:

therefore, the following steps are carried out:

wherein, F_k+1Cost function, W, representing k +1 training samples_k+1Projection matrix, H, representing k +1 training samples_k+1Coding matrix representing k +1 training samples, h_k+1Column k +1 of H, representing the newly added sample in the coding matrix, v_k+1Column k +1 of V, representing a newly added training sample, f_k+1A cost function that is an incremental portion;

adopting a gradient descent method, firstly solving a newly added sample h of the coding matrix_k+1Each element (h) thereof_k+1)_αThe iteration rule of (1) is:

wherein the step size mu_αThe following were chosen:

iterate to converge and get the new code matrix H_k+1H, local updating of the single sample is completed;

then, the last diagonal element d of the diagonal matrix is implemented_k+1Updating:

let D_k+1,k+1＝d_k+1Thereby completing a diagonal matrix D_k+1Local updates to single samples;

obtaining each element (W) of the new projection matrix_k+1)_iαThe iteration rule of (1) is:

wherein each of the new projection matricesElement (W)_k+1)_iαThe step size selected for the gradient descent is:

iterating to converge to obtain a new projection matrix W_k+1Completing the updating of the single sample by W;

all training samples were re-projected:

V′_train＝(W^TW)^-1W^TV_train

projecting a sample to be identified:

h′_test＝(W^TW)^-1W^Th_test

2. The robust nonnegative matrix factorization method based on incremental learning as claimed in claim 1, wherein: step 2, after each new sample is updated, except that the current iteration result h needs to be saved_k+1、d_k+1、W_k+1Besides, it is also necessary to store the history information for the next update: