CN107451565B

CN107451565B - Semi-supervised small sample deep learning image mode classification and identification method

Info

Publication number: CN107451565B
Application number: CN201710647312.5A
Authority: CN
Inventors: 周喜川; 刘念; 唐枋; 胡盛东; 林�智
Original assignee: Chongqing University
Current assignee: Chongqing University
Priority date: 2017-08-01
Filing date: 2017-08-01
Publication date: 2020-12-11
Anticipated expiration: 2037-08-01
Also published as: CN107451565A

Abstract

The invention relates to a semi-supervised small sample deep learning image mode classification and identification method, and belongs to the field of image identification. The method comprises the following steps: s1: preprocessing an image sample; s2: inputting the data obtained by preprocessing into a trained network, and extracting the characteristics of the network through a 3D convolutional layer to obtain a characteristic layer; s3: each convolution layer is followed by a pooling layer for reducing the size of the characteristic layer to reduce the number of parameters in the network; s4: connecting the features extracted by the multilayer convolution layer and the pooling layer with a full-connection layer to extract and rearrange the features to be classified; the layer introduces local neighbor preserving regularization operation; s5: and inputting a sample to be detected to obtain the classification accuracy. The invention utilizes the position correlation among a large number of collected label-free samples, and improves the applicability and accuracy of the algorithm under a small sample set.

Description

Semi-supervised small sample deep learning image mode classification and identification method

Technical Field

The invention belongs to the field of image recognition, and relates to a semi-supervised small sample deep learning image mode classification recognition method.

Background

Under the condition that the number of the label samples is enough, the deep learning method can extract the characteristics of the image in a self-adaptive manner by establishing a hierarchical model, and further the accuracy of pattern classification and identification is obviously improved. The accuracy of the deep Convolutional Neural Network (CNN) on a part of remote sensing image data sets reaches 95% -99%. An article published by Yushi Chen2016 on TGRS provides a feature extraction model based on a 3D convolutional neural network (3DCNN), and the classification accuracy rate is more than 98%. But because the model parameters are large in scale, the requirement on the number of label samples for training is high.

The deep learning model supervised by CNN (requiring label training samples) is large in parameter scale, and a large number of label samples are required for training to improve the classification accuracy. However, in some fields, due to the reasons of difficult sample collection, high label analysis cost and the like, the pattern classification and identification method based on deep learning faces the difficulty of label sample shortage. At present, the hyperspectral image processing based on the deep learning method generally uses 10 per category³～10⁴The optimal performance can be achieved only by training the labels, which is far beyond the label sample size which can be provided in practical applications such as mineral identification.

Disclosure of Invention

In view of the above, the present invention provides a semi-supervised small sample deep learning image pattern classification and identification method, which combines an unsupervised (label-free training sample) pattern identification method, and utilizes a large amount of obtained unlabelled sample data on the basis of 3DCNN to reduce the dependency of the deep learning method on the labeled sample and improve the pattern classification and identification accuracy based on deep learning.

In order to achieve the purpose, the invention provides the following technical scheme:

a semi-supervised small sample deep learning image pattern classification and identification method comprises the following steps:

s1: preprocessing an image sample;

s2: inputting the data obtained by preprocessing into a trained network, and extracting the characteristics of the network through a 3D convolutional layer to obtain a characteristic layer;

s3: each convolution layer is followed by a pooling layer for reducing the size of the characteristic layer to reduce the number of parameters in the network;

s4: connecting the features extracted by the multilayer convolution layer and the pooling layer with a full-connection layer to extract and rearrange the features to be classified; the layer introduces local neighbor preserving regularization operation, reduces the characteristic difference of adjacent samples, and accordingly reduces the classification accuracy reduction caused by the lack of label samples;

s5: and inputting a sample to be detected to obtain the classification accuracy.

Further, the S1 specifically includes: selecting a number A of target pixel points and corresponding labels as training samples, and forming a neighborhood matrix including the target pixel points by the number A of pixel points around each target pixel point; and taking the labels and the neighborhood matrixes of all target pixel points as input data of a 3D Convolutional Neural Network (CNN).

Further, the specific algorithm of S2 is:

wherein

Representing the neuron output value positioned at (x, y, z) on the jth characteristic layer of the ith layer, m representing the index value of the characteristic layer connected with the i-1 layer and the jth characteristic layer, and P_i、Q_i、R_iRespectively representing the height, width and depth of the convolution kernel of the ith layer,

is the value on the mth characteristic layer and at the (p, q, r) point, b_ijFor the bias of the jth feature layer, g () is an activation function.

Further, the specific algorithm of S3 is:

where u (nxnxnxnxn) denotes a three-dimensional window, alpha, acting on the convolutional layer output features_i,j,mRepresenting the maximum of the feature points in the neighborhood.

Further, the specific algorithm of S4 is:

h_f ⁽ⁱ⁾＝sig(W_fv_f ⁽ⁱ⁾+b_f)

wherein s is_ijRepresents the distance, g, between the ith and jth training samples⁽ⁱ⁾And g^(j)Coordinates of the ith and jth training samples, h_f ⁽ⁱ⁾And h_f ^(j)Respectively representing the extracted features of the ith training sample and the jth training sample; r (W)_f) Representing a regularization term; when training samples i are adjacent to j, s_ijThe larger the parameter W_fv_f ⁽ⁱ⁾And W_fv_f ^(j)The smaller the difference is, the smaller the feature h is_f ⁽ⁱ⁾And h_f ^(j)And also adjacent.

Further, R (W)_f) Satisfies the following conditions:

wherein V_f＝[v_f ⁽¹⁾,...v_f ^(L),...v_f ^(N)]In the form of a matrix representing the input vector of the fully-connected layer, D is the diagonal matrix (D)_ii＝∑_js_ij)，P＝D-S；

The regularization term is derived:

the invention has the beneficial effects that: the invention provides a deep learning model of small label samples on the basis of the existing 3DCNN algorithm, and improves the applicability and accuracy of the algorithm under a small sample set by utilizing the position correlation among a large number of collected label-free samples.

Drawings

In order to make the object, technical scheme and beneficial effect of the invention more clear, the invention provides the following drawings for explanation:

FIG. 1 is a block diagram of the present invention;

FIG. 2 is a schematic diagram of the present invention.

Detailed Description

Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

In a large number of image pattern recognition applications, the correlation of nearby samples is widely present. For example, in applications of scene recognition, remote sensing feature recognition, and medical image recognition, labels of nearby pixel samples tend to have correlation. Therefore, a local adjacent Convolutional Neural Network (local compressive Neural Network) based mode classification and identification method is provided, the position correlation among a large number of collected label-free samples is utilized, and the applicability and accuracy of the algorithm under a small sample set are improved. As shown in fig. 1, the specific steps are as follows:

1. data pre-processing

And selecting a certain number of target pixel points and corresponding labels as training samples. And forming a neighborhood matrix (including the target pixel points) by using the same number of pixel points around each target pixel point. And taking the labels and the neighborhood matrixes of all target pixel points as input data of the 3 DCNN.

2. Feature extraction

(1) Convolutional layer(s)

Inputting the obtained data into a trained network, performing feature extraction on the network through 3D convolution layers, wherein each layer comprises different numbers of 3D convolution kernels, and performing feature extraction on the input data to generate different feature layers.

The specific algorithm is as follows:

wherein

And representing the output value of the neuron positioned at (x, y, z) on the jth characteristic image layer of the ith layer. And m represents the index value of the characteristic layer connected with the i-1 layer and the jth characteristic layer. P_iAnd Q_iRespectively representing the height and width R of the convolution kernel of the i-th layer_iIndicating the depth of the i-th layer convolution kernel.

And the value of the point (p, q, r) on the mth feature layer. b_ijIs the bias of the jth characteristic layer. g () is an activation function.

(2) Pooling layer (multiple)

Each convolutional layer is followed by a pooling layer, which serves to reduce the size of the feature layer to reduce the number of parameters in the network. The specific algorithm for the most common pooling operation (max pooling operation) is as follows:

(3) Fully connected layer with local neighbor preserving regularization

The features extracted by the multi-layer convolutional layer and the pooling layer are connected with a full connection layer to extract and rearrange the features to be classified. The layer introduces local neighbor preserving regularization operation, reduces the feature difference of adjacent samples, and accordingly reduces the classification accuracy reduction caused by the lack of label samples.

The specific algorithm is as follows:

h_f ⁽ⁱ⁾＝sig(W_fv_f ⁽ⁱ⁾+b_f)

wherein s is_ijRepresents the distance, g, between the ith and jth training samples⁽ⁱ⁾And g^(j)The coordinates of the ith and jth training samples are represented, respectively. h is_f ⁽ⁱ⁾And h_f ^(j)Respectively representing the extracted features of the ith and jth training samples.

When training samples i are adjacent to j, s_ijThe larger the parameter W_fv_f ⁽ⁱ⁾And W_fv_f ^(j)The smaller the difference is, the smaller the feature h is_f ⁽ⁱ⁾And h_f ^(j)And also adjacent.

The regularization term in the above equation is expressed in matrix form as follows:

wherein V_f＝[v_f ⁽¹⁾,...v_f ^(L),...v_f ^(N)]In the form of a matrix representing the input vector of the fully-connected layer, D is the diagonal matrix (D)_ii＝∑_js_ij)，P＝D-S。

The regularization term is derived:

thus, the regularization term may be reduced using a standard gradient descent method.

3. A classification layer

And inputting a sample to be detected, and outputting the type of the sample label with the width as the classification accuracy.

As shown in FIG. 2, assume A₁、A₂And B₁、B₂Respectively associated with different labels, and the output features of the classification extraction also keep the relevance in consideration of the relevance of adjacent samples.

Finally, it is noted that the above-mentioned preferred embodiments illustrate rather than limit the invention, and that, although the invention has been described in detail with reference to the above-mentioned preferred embodiments, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the scope of the invention as defined by the appended claims.

Claims

1. A semi-supervised small sample deep learning image mode classification and identification method is characterized by comprising the following steps: the method comprises the following steps:

s1: preprocessing an image sample;

s4: connecting the features extracted by the multilayer convolution layer and the pooling layer with a full-connection layer to extract and rearrange the features to be classified; the full-connection layer introduces local neighbor-preserving regularization operation, so that the characteristic difference of adjacent samples is reduced, and the classification accuracy reduction caused by the lack of label samples is reduced;

s5: inputting a sample to be detected to obtain classification accuracy;

the S1 specifically includes: selecting a number A of target pixel points and corresponding labels as training samples, and forming a neighborhood matrix including the target pixel points by the number A of pixel points around each target pixel point; taking the labels and the neighborhood matrixes of all target pixel points as input data of a 3D Convolutional Neural Network (CNN);

the specific algorithm of S2 is as follows:

wherein

2. The semi-supervised small-sample deep learning image pattern classification and identification method as claimed in claim 1, wherein: the specific algorithm of S3 is as follows:

3. The semi-supervised small-sample deep learning image pattern classification and identification method as claimed in claim 1, wherein: the specific algorithm of S4 is as follows:

h_f ⁽ⁱ⁾＝sig(W_fv_f ⁽ⁱ⁾+b_f)

wherein s is_ijRepresents the distance, g, between the ith and jth training samples⁽ⁱ⁾And g^(j)Coordinates of the ith and jth training samples, h_f ⁽ⁱ⁾Representing the extracted features of the ith training sample; r (W)_f) Representing a regularization term; when training samples i are adjacent to j, s_ijThe larger the parameter W_fv_f ⁽ⁱ⁾And W_fv_f ^(j)The smaller the difference.

4. The semi-supervised small-sample deep learning image pattern classification and identification method as claimed in claim 3, wherein: the R (W)_f) Satisfies the following conditions:

The regularization term is derived: