CN111916204A

CN111916204A - Brain disease data evaluation method based on self-adaptive sparse deep neural network

Info

Publication number: CN111916204A
Application number: CN202010653663.9A
Authority: CN
Inventors: 乔琛; 胡鑫钰; 石砚; 刁豫贤; 高斌; 董鑫
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2020-07-08
Filing date: 2020-07-08
Publication date: 2020-11-10

Abstract

The invention discloses a brain disease data evaluation method based on a self-adaptive sparse deep neural network, which comprises the following steps: 1) collecting documented brain disease data; 2) summarizing each individual and the corresponding data characteristic and the change value thereof into a piece of unit data, and then constructing a data matrix by using the unit data corresponding to each individual; 3) dividing the data matrix obtained in the step 2) into a training set and a test set; 4) establishing a deep neural network model based on a sparse enhancement BP algorithm; 5) training the deep neural network model based on the sparse enhancement BP algorithm established in the step 4) by using the training set and the test set obtained in the step 3) to obtain a trained deep neural network model based on the sparse enhancement BP algorithm; 6) the method can effectively solve the over-fitting problem of the existing deep learning model for processing the high latitude small sample data.

Description

Brain disease data evaluation method based on self-adaptive sparse deep neural network

Technical Field

The invention belongs to the field of data processing, and relates to a brain disease data evaluation method based on a self-adaptive sparse deep neural network.

Background

High-dimensional small sample data is commonly found in the biomedical field, such as genomic data, medical image data, protein data and the like, and the data has the characteristics of small sample size but huge sample characteristics. This feature presents certain challenges to the process of data processing and analysis. When the sample size to sample feature ratio is small, classical machine learning algorithms tend to fail because irrelevant and redundant features may be contained in the high dimensional data. At present, deep learning is proved to be one of the strongest tools in big data analysis, but the traditional deep learning algorithm is still very limited in application of bioinformatics, mainly because when the sample size of data is far smaller than the feature number, the model is often trapped in overfitting, and then data processing is inaccurate. Therefore, it is necessary to find a deep network model with strong self-sparse learning capability for such data with high latitude small sample characteristics, so as to solve the existing data processing problem.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a brain disease data evaluation method based on an adaptive sparse deep neural network, which can effectively solve the over-fitting problem of processing high latitude small sample data by the existing deep learning model.

In order to achieve the above purpose, the brain disease data evaluation method based on the adaptive sparse deep neural network of the present invention comprises the following steps:

1) collecting documented brain disease data;

2) summarizing each individual and the corresponding data characteristics and the variation value thereof into a unit data in the acquired brain disease data, and then constructing a data matrix by using the unit data corresponding to each individual;

3) dividing the data matrix obtained in the step 2) into a training set and a test set;

4) establishing a deep neural network model based on a sparse enhancement BP algorithm;

5) training the deep neural network model based on the sparse enhancement BP algorithm established in the step 4) by using the training set and the test set obtained in the step 3) to obtain a trained deep neural network model based on the sparse enhancement BP algorithm;

6) and evaluating the brain disease data to be evaluated based on the trained deep neural network model based on the sparse enhancement BP algorithm.

The deep neural network model based on the sparse enhancement BP algorithm comprises an input layer, five hidden layers and an output layer, wherein the five hidden layers are formed by stacking five restricted Boltzmann machines.

Adding log-sum functional form to response and connection of hidden layer neurons after loss function of sparse enhancement BP algorithm

The sparse penalty term of (1).

The objective function of the sparse enhancement BP algorithm is as follows:

wherein, tau₁And τ₂Penalty coefficients respectively representing a log-sum response penalty term and a log-sum connection penalty term,₁and₂is a given constant.

The invention has the following beneficial effects:

the brain disease data evaluation method based on the self-adaptive sparse deep neural network comprises the steps that during specific operation, brain disease data is evaluated based on a deep neural network model of a sparse enhancement BP algorithm, wherein a hidden layer in the network model is formed by stacking five limited Boltzmann machines, meanwhile, a target function of the sparse enhancement BP algorithm is modified, and sparse penalty terms are added to response and connection of neurons of the hidden layer after a loss function of the sparse enhancement BP algorithm, so that the complexity of the model is reduced, and the problem of overfitting is solved.

Drawings

FIG. 1 is a diagram of an adaptive sparse deep neural network model architecture;

FIG. 2 is a diagram of an embedded restricted Boltzmann RBM model;

FIG. 3 is a flow chart of the application of the entire MRI data;

FIG. 4 is a graph of the results of comparing the two networks with the best sparsity to LSES-DNN;

FIG. 5 is a graph showing the response values of each of LSES-DNN, DP-DNN, DPC-DNN, Pr-DNN and SVD-DNN;

FIG. 6 is a graph of classification errors of 5 deep learning methods in training set and test.

Detailed Description

The invention is described in further detail below with reference to the accompanying drawings:

referring to fig. 1, the method for analyzing brain disease data based on the adaptive sparse deep neural network according to the present invention includes the following steps:

1) collecting the recorded brain disease data (high latitude small sample), and storing the recorded brain disease data in a computer;

2) in the acquired brain disease data, summarizing each individual and the corresponding data characteristics and the variation value thereof into a unit data, and then constructing a data matrix by using the unit data corresponding to each individual, wherein the data matrix comprises a sample volume N and a sample characteristic p, and the data usually has the characteristics of a high latitude small sample, namely N < < p;

3) dividing the data matrix obtained in the step 2) into a training set and a test set, and then inputting the training set and the test set obtained by division into a memory corresponding to a deep neural network model based on a sparse enhancement BP algorithm;

the traditional deep learning algorithm comprises a Pre-training process (Pre-training) and a Fine-tuning process (Fine-tuning), the sparse enhancement BP algorithm provided by the invention is mainly realized in the Fine-tuning process, the data are optimized and calculated through a deep neural network model based on the sparse enhancement BP algorithm, and the specific optimization process is as follows:

a) setting a deep learning basic framework, and establishing a deep neural network model comprising an input layer, five hidden layers and an output layer according to data characteristics for data in a training set, wherein the input layer comprises a plurality of nodes with data characteristics, namely the number of the nodes of the input layer is the sample characteristics p of the training set, the output layer comprises a plurality of nodes with medical diagnosis data characteristics, namely the number of label types corresponding to each sample, and is uniformly expressed by label hereinafter, and each hidden layer comprises a plurality of nodes with mapping corresponding relations with the input value of the previous layer.

b) And establishing a data model of each node of each layer of network by adopting a mathematical equation, and presetting related parameter values in the mathematical equation.

In the deep network, 5 Restricted Boltzmann Machines (RBMs) are selected for stacking, namely the number of hidden layers is 5, the number of neurons of five hidden layers is respectively 2000, 1000, 800, 500 and 200, the number of nodes of an input layer is a training set sample characteristic and is marked as p, the number of neurons in an output layer is a characteristic dimension of medical diagnosis data and is marked as M_nAccording to the above setting, the deep neural network model includes 7 neural layers, which are an input layer, 5 hidden layers and an output layer.

c) In the deep neural network model, a Gaussian-Bernoulli RBM (GBRBM) is used as a first layer RBM, a Bernoulli-Bernoulli RBM (BBRBM) is used as other layer RBMs, the learning rates of the network are 0.001 and 0.1 respectively for the GBRBM and the BBRBM, the punishment parameters lambda 1 are 0.03 and 0.0002 respectively, and the punishment coefficients of the norm re-weighted-L1 are 0.001 and 0.001 respectively and 0.1 respectively;

d) initializing parameter values Ai of each layer of network, including weight Wi connected with the network and corresponding bias bi, learning the network structure of the training set data input layer, comparing the output value of each node in the output layer with the corresponding medical diagnosis data label, repeatedly correcting the parameter values of each layer of neural network, and sequentially circulating to obtain the parameter values of each layer of neural network node corresponding to the output value with the highest similarity of each node of the output layer and the brain disease data characteristics.

e) Inputting the test set data into a trained deep neural network model, judging whether the result of an output layer is equal to the original label of the test set data, and calculating the accuracy of the deep neural network model on the brain disease data prediction according to the number of misclassifications of the deep neural network model on the test set data.

It should be noted that the mathematical equation is a parametric mathematical equation, and may be a linear model or a neuron model, such as a sigmoid activation function or a convolution operation model, and the mathematical model is set in the following manner:

y＝g(X)＝f_n(f_n-1(f_n-2(...f₁(X))

wherein y is a medical diagnostic data feature in the output layer and the dimension is M_nX is training set data and characteristic dimensions are p, f₁To f_nFor each layer of set operational equations, and each layer of equations f_iFor example, from the first input layer to the first hidden layer, the training data with the dimension p is converted into the output with the dimension 200, and the analogy is performed according to the number of the neurons in each layer. Wherein each layer model f_iThere is a matching set of parameters Ai, including the connection weights Wi and the offsets bi of the various layers.

To overcome the overfitting problem, for the fine tuning process, constraints are added to the response and connection of hidden neurons after the loss function of the original BP algorithm to reduce the complexity of the model, considering that the log-sum function has the best approximation to the common sparse norm L0, i.e., the log-sum function is approximated to the common sparse norm L0

Adding log-sum functional forms for responses and connections to the loss function

So that the neurons and network connections in the network model become sparse, wherein the specific objective function of the BP algorithm is:

wherein J (W) is an objective function of the conventional BP algorithm, wherein tau₁And τ₂Penalty coefficients respectively representing a log-sum response penalty term and a log-sum connection penalty term,₁and₂is a given constant.

The architecture diagram of the adaptive sparse deep neural network model is shown in fig. 1, and the embedded restricted boltzmann RBM model is shown in fig. 2.

Comparative experiment

MRI has been widely used for early detection, diagnosis and disease treatment as an important medical imaging mode, and such biomedical data generally has the characteristics of high-dimensional small samples, i.e. has a large number of features and a low sample number, and may have an overfitting problem when a deep learning method is applied to perform corresponding brain structure and brain function abnormality on such data. In order to verify that the method can effectively avoid overfitting and improve the identification capability. We refer to philiadelphia Neuro-degenerative cowort (PNC) data from a large-scale experimental data cooperative study of brain behaviour by pennsylvania university and Philadelphia children hospital. This data included fMRI data for 878 adolescents aged 8-22 using standard pre-processing SPM12(http:// www.fil.ion.ucl.ac.uk/SPM /).

The present invention verifies that the brain development difference is a function of age using an adaptive sparse depth network and selects a data subset of the complete data set according to age (in months), wherein subjects above 216 months of age belong to a first class and subjects below 144 months of age belong to a second class, and the pearson correlation coefficient between them is first calculated from the regional average oxygenation level correlation (BOLD) signals of 264 regions of interest (ROIs) of the brain. Therefore, the dimension of the input data is 34716, the total sample size is 397, the whole MRI data application flow chart is shown in FIG. 3, and the performance of the network is verified respectively from the network sparsity and the classification accuracy by the invention:

sparsity of network connections

Comparing the sparsity of the common depth networks DNN, SDNN, DP-DNN, DPC-DNN, Pr-DNN, SVD-DNN and the invention (LSES-DNN), and further comparing and verifying the two networks with the best sparsity with the LSES-DNN, the specific comparison results are shown in FIG. 4 and Table 1, wherein FIG. 4 shows LSES-DNN, DP-DNN, DPC-DNN, the connection weight non-zero ratio of each layer of the three depth networks, the length of each petal represents the non-zero ratio of each depth network connected to the current layer, and it is obvious that LSES-DNN has sparseness compared with the other depth networks.

TABLE 1

Response sparsity of hidden neurons

FIG. 5 shows the difference of response values of each of LSES-DNN, DP-DNN, DPC-DNN, Pr-DNN and SVD-DNN, wherein the horizontal axis represents the activation probability of a certain layer, the vertical axis represents the ratio of the activation probabilities at different time intervals, and the hidden layer neuron activation ratio of LSES-DNN is significantly smaller than that of other deep networks.

Accuracy of classification

The invention compares 17 conventional and commonly used classifiers of LSES-DNN, uses 70% of original data as a training set and the remaining 30% as a test set for each classifier, repeats the whole process for 30 times, selects the same network parameters of LSES-DNN for all the classifiers of the deep network to train, and the average classification accuracy of the test set is shown in Table 2. In addition, fig. 6 shows the classification errors of 5 deep learning methods in the training set and the actual measurement. For the training data, the error of LSES-DNN decreased rapidly and converged around epochs-50, where the error was 1.8%; meanwhile, the error curves of other methods are converged, but the time is long. For the test data, the error of LSES-DNN decreased to 16.33% around epoch-30 and finally to 12.93% at epoch-75. The error of sDNN and DNN is 51.52% and 51.52%, respectively, and the error curve converges to 32.58% when epoch is 30, 17.57% when epoch is 90, and 18.01%. It can be seen that LSES-DNN performs well on test data, with the fastest convergence rate and the best age identification performance.

TABLE 2

Through the sparsity of network connection, the response sparsity and classification precision comparison of hidden neurons, the LSES-DNN has the sparsity of a topological structure and the response sparsity of the hidden neurons, and the sparse learning capability of the LSES-DNN is the best in the existing network. Further explaining data with the characteristic of high latitude small samples such as an MRI data set aiming at brain disease diagnosis and normal development, the deep neural network LSES-DNN based on the sparse enhancement BP algorithm has good diagnosis and identification effects, can effectively extract the hierarchical sparse characteristics of the data, further learn the real distribution of the brain disease data, lay a foundation for subsequent diagnosis, treatment and prediction according to the data, and further improve the treatment level.

Claims

1. A brain disease data evaluation method based on an adaptive sparse deep neural network is characterized by comprising the following steps:

1) collecting documented brain disease data;

2. The brain disease data evaluation method based on the adaptive sparse deep neural network of claim 1, wherein the deep neural network model based on the sparse enhancement BP algorithm comprises an input layer, five hidden layers and an output layer, wherein the five hidden layers are formed by stacking five restricted Boltzmann machines.

3. The adaptive sparse depth neural network-based brain disease data assessment method of claim 1, wherein log-sum functional form is added to the response and connection of hidden layer neurons after the loss function of sparse enhancement BP algorithm

The sparse penalty term of (1).

4. The brain disease data evaluation method based on the adaptive sparse deep neural network of claim 1, wherein the objective function of the sparse enhancement BP algorithm is:

wherein, tau₁And τ₂Represents the log-sum response penalty term and the log-sum join respectivelyThe penalty coefficient of the penalty term is set to,₁and₂is a given constant.