CN110443318B

CN110443318B - Deep neural network method based on principal component analysis and cluster analysis

Info

Publication number: CN110443318B
Application number: CN201910734831.4A
Authority: CN
Inventors: 金勇�
Original assignee: Wuhan Firehome Putian Information Technology Co ltd
Current assignee: Wuhan Firehome Putian Information Technology Co ltd
Priority date: 2019-08-09
Filing date: 2019-08-09
Publication date: 2023-12-08
Anticipated expiration: 2039-08-09
Also published as: CN110443318A

Abstract

The invention relates to a deep neural network method based on principal component analysis and cluster analysis, which specifically comprises the following steps: s1: dividing the label data into training data and test data, wherein the training data is used for training and learning of a model; s2: performing feature dimension reduction on all initial data features of training data by using PCA, and extracting new main components; s3: K-Means cluster analysis is carried out on all training samples according to the principal components extracted by PCA; s4: and taking an upper layer data result as input, and combining the label obtained by clustering to form a single-layer neural network for training to obtain the network weight parameter. According to the deep neural network method based on principal component analysis and cluster analysis, the statistical feature learning method is combined with the neural network, the training mode of the traditional multi-layer neural network is optimized in the training process, and a better test effect is obtained for learning of a common deep neural network.

Description

Deep neural network method based on principal component analysis and cluster analysis

Technical Field

The invention relates to the technical field of machine learning and artificial intelligence, in particular to a deep neural network method based on principal component analysis and cluster analysis.

Background

In recent years, artificial intelligence technology has attracted attention in both industry and academia, and machine learning methods play a central role in the field of artificial intelligence, and have been rapidly developed in many fields such as biometric sequences, natural language processing, computer vision, image recognition, financial market analysis, and the like. Among them, deep learning is remarkable in many fields. The deep belief network (Deep Belief Network, DBN) is used as a classical deep learning method, and has higher research significance in the aspects of feature extraction and classification learning.

The main idea of the Deep Belief Network (DBN) is divided into two stages of learning, wherein the first stage is to extract features by using an unsupervised pre-training (RBM) limited boltzmann machine (Restricted Boltzmann Machine), and the second stage is to combine the features obtained in the first stage and the labels corresponding to the data, and fine-tune model parameters by adopting a traditional Back propagation algorithm (BP). The first stage is similar to a feature abstraction process, for which the Cluster analysis and principal component analysis methods are both statistical-based feature extraction methods, and on the other hand, the Cluster analysis has a relatively low clustering efficiency for a large number of input features, so that in summary, the development of a method for embedding principal component analysis (Principle Component Analysis, PCA) and Cluster analysis (Cluster analysis) into a deep neural network model to perform unsupervised pre-training is urgent, and then the same BP algorithm is performed in the second stage to improve the training learning effect of the deep neural network.

Disclosure of Invention

In order to overcome the defects in the prior art, the invention provides a deep neural network method based on principal component analysis and cluster analysis, which solves the problem of poor learning effect of the conventional common deep neural network.

The invention is realized by the following technical scheme:

a deep neural network method based on principal component analysis and cluster analysis specifically comprises the following steps:

s1: dividing an image set: dividing the label data into training data and test data, wherein the training data is used for training and learning of the model, and the test set is used for testing the comprehensive effect of the model;

s2: PCA feature dimension reduction: performing feature dimension reduction on all initial data features of training data by using PCA, and extracting new main components;

s3: K-Means cluster analysis: K-Means cluster analysis is carried out on all training samples according to the principal components extracted by PCA, the number of clusters is the number of hidden layers of the network structure, and the category of the clustering result is used as a hidden label corresponding to each training sample;

s4: taking the upper layer data result as input, and combining the label obtained by clustering to form a single-layer neural network for training to obtain a network weight parameter;

s5: the single-layer neural network weight obtained in the step S4 is used as the initial weight of the whole neural network, and then the weight is optimized by using a back propagation algorithm;

s6: the training model parameters described above are used for application prediction of the model.

Further, the PCA feature dimension reduction in S2 is to substantially reduce the initial feature dimension on the premise of retaining most of the information of the initial input.

Compared with the prior art, the invention has the beneficial effects that:

according to the deep neural network method based on principal component analysis and cluster analysis, the statistical characteristic learning method and the neural network are combined, the training mode of the traditional multi-layer neural network is optimized in the training process, a better test effect is obtained for learning of a common deep neural network, and meanwhile, the positive effect is achieved for training and learning of the deep neural network.

Drawings

Fig. 1 is a flowchart of an implementation of a deep neural network method based on principal component analysis and cluster analysis according to an embodiment of the present invention.

Detailed Description

The following examples are presented to specifically illustrate certain embodiments of the invention and should not be construed as limiting the scope of the invention. Modifications to the disclosure of the invention may be made in the materials, methods, and reaction conditions, all of which are intended to fall within the spirit and scope of the invention.

As shown in fig. 1, a deep neural network method based on principal component analysis and cluster analysis specifically includes the following steps:

s2: PCA feature dimension reduction: performing feature dimension reduction on all initial data features of training data by using PCA, and extracting new main components; for example, 784 dimension, wherein more than 95% of the principal component information is reserved through PCA analysis, and 60 dimension characteristics are obtained and used as variables of the next clustering analysis;

s3: K-Means cluster analysis: K-Means cluster analysis is carried out on all training samples according to principal components extracted by PCA, the number of clusters is the number of hidden layers of a network structure, the number of hidden layers is greater than or equal to 1, and the category of a clustering result is used as a hidden label corresponding to each training sample; for example, the network structure is 784-500-300-300-10, and the number of results of the first cluster is 500;

The experimental procedure used four data sets (image classification) for training tests, the profile of which is given in table 1 below:

table 1 training data basic information

The experimental effect of the deep neural network method (PCA-KMeas-NN) based on principal component analysis and cluster analysis is better than that of a common deep neural network (DNN, which is trained only by BP algorithm), the error rate on the corresponding test data set is lower (the error rate is shown in the following table 2), and the PCA-KMeas-NN is better than DNN in terms of the error rate performance of four data sets in the test analysis.

TABLE 2 test results (test set error Rate)

Data set	PCA-KMeans-NN	DNN
			MNIST	1.70％	1.98％
smallMNIST	3.50％	4.10％
			USPS	2.46％	2.85％
COIL20	0.68％	1.56％

Therefore, the method of the invention realizes the combined application of the statistical feature learning method and the neural network, optimizes the training mode of the traditional multi-layer neural network from the training process, and obtains better test effect aiming at the learning of the common deep neural network. The method provides a new learning method in the aspect of training the pre-training characteristics of the deep neural network, and has positive effects on training and learning of the deep neural network.

The foregoing description is only illustrative of the present invention and is not intended to limit the scope of the invention, and all equivalent structures or equivalent processes or direct or indirect application in other related technical fields are included in the scope of the present invention.

Claims

1. The deep neural network method based on principal component analysis and cluster analysis is characterized by comprising the following steps of:

2. The method of deep neural network based on principal component analysis and cluster analysis according to claim 1, wherein the PCA feature dimension reduction in S2 is a substantial reduction of the initial feature dimension while retaining most of the initial input information.