CN117338313A

CN117338313A - Multi-dimensional characteristic electroencephalogram signal identification method based on stacking integration technology

Info

Publication number: CN117338313A
Application number: CN202311196950.1A
Authority: CN
Inventors: 许进; 余紫微; 陈耀
Original assignee: Wuhan Textile University
Current assignee: Wuhan Textile University
Priority date: 2023-09-15
Filing date: 2023-09-15
Publication date: 2024-01-05
Anticipated expiration: 2043-09-15
Also published as: CN117338313B

Abstract

The invention discloses a multi-dimensional characteristic electroencephalogram signal identification method based on a stacking integration technology, which comprises the following steps of: 1) Acquiring two different electroencephalogram signals and preprocessing the data of the electroencephalogram signals; 2) Performing multidimensional feature extraction on the preprocessed electroencephalogram data, and constructing a feature matrix to obtain an original feature matrix; 3) Performing dimension reduction processing on the original feature matrix by using a principal component analysis algorithm to obtain a final feature matrix; 4) Constructing a multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology by taking the final characteristic matrix in the step 3) as input based on a stacking integration learning algorithm on the preprocessed electroencephalogram data in the step 1); 5) And preprocessing the electroencephalogram signals to be identified and extracting features, and inputting the preprocessed electroencephalogram signals into a trained model to obtain an identification result. The invention extracts multidimensional characteristics of the electroencephalogram signals, and improves the identification degree of the extracted electroencephalogram signals by using a stacking integration technology.

Description

Multi-dimensional characteristic electroencephalogram signal identification method based on stacking integration technology

Technical Field

The invention relates to the technical field of electroencephalogram signal processing, in particular to a multi-dimensional characteristic electroencephalogram signal identification method based on a stacking integration technology.

Background

With the acceleration of the social rhythm, the mental stress of people is gradually increased, and the mental health problem has become one of the important problems facing the contemporary society. According to the published data of the world health organization, the number of patients with global depression reaches 3.22 hundred million in 2022, and the prevalence rate of neuropsychiatric diseases such as autism is also continuously rising, which brings serious influence to society.

Most psychological diseases are complicated in etiology, high in treatment difficulty and insufficient in pertinence, so that the detection of the psychological diseases is particularly important. The existing psychological disease detection method mainly depends on subjective judgment of doctors and self-feedback of patients, and has the problems of low diagnosis accuracy, long time consumption, high cost and the like.

With the progress of science and technology and the appearance of deep learning algorithms, a non-invasive electroencephalogram signal recognition technology plays an important role in the detection and treatment of psychological diseases such as depression. However, the existing electroencephalogram signal identification technology has a plurality of problems, such as low identification accuracy, insufficient extraction of characteristics of the electroencephalogram signal, and incapability of effectively processing nonlinearity, high-dimensional characteristics and the like of the electroencephalogram signal, so that identification of psychological diseases such as depression and the like is affected.

Disclosure of Invention

The invention mainly aims to provide a multi-dimensional characteristic electroencephalogram signal identification method based on a stacking integration technology, which is used for extracting multi-dimensional characteristics of electroencephalogram signals and improving the identification degree of the extracted electroencephalogram signals.

The technical scheme adopted by the invention is as follows:

a multi-dimensional characteristic electroencephalogram signal identification method based on a stacking integration technology comprises the following steps:

1) Acquiring two different electroencephalogram signals of a healthy subject and a non-healthy subject so as to facilitate the identification of subsequent electroencephalogram signals, and carrying out data preprocessing on the electroencephalogram signals;

2) Performing multidimensional feature extraction on the preprocessed electroencephalogram data, and constructing a feature matrix to obtain an original feature matrix;

3) Performing dimension reduction processing on the original feature matrix by using a principal component analysis algorithm to obtain a final feature matrix;

4) Constructing a multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology by taking the final characteristic matrix in the step 3) as input based on a stacking integration learning algorithm on the preprocessed electroencephalogram data in the step 1); training the model to obtain a trained recognition model;

5) And (3) carrying out data preprocessing and multi-dimensional feature extraction on the electroencephalogram signals to be identified, and inputting the electroencephalogram signals to be identified into the trained multi-dimensional feature electroencephalogram signal identification model in the step (4) to obtain identification results.

In a further scheme, the step of preprocessing the data of the electroencephalogram signal in the step 1) mainly comprises denoising and normalization so as to eliminate noise components in the signal, improve the signal quality and obtain the subsequently available electroencephalogram data.

In a further scheme, the step 1) of preprocessing the data of the electroencephalogram signal comprises the following steps:

11 Denoising the original electroencephalogram signal to eliminate noise components in the signal, specifically: firstly, carrying out filtering treatment by using band-pass filtering of 0.5-40 Hz; secondly, removing artifacts such as blinks, eyeball movements and the like in the data by using an independent component analysis method;

12 Normalized processing is carried out on the denoised signal, and the amplitude range of the signal is scaled to be between 0 and 1, specifically: the signals were pre-processed using Min-Max normalization as follows:

wherein z is _j Z, which is the j-th element in sample z _max Z is the maximum value in the sample data _min Is the minimum value in the sample data;

in step 2), feature extraction is performed on the preprocessed electroencephalogram data in a time-frequency domain and a space domain to obtain multidimensional features, so that an original feature matrix X { X } is constructed ₁ ，x ₂ ,…,x _m }。

In a further scheme, in the step 2), feature extraction is performed on the preprocessed electroencephalogram data in a time-frequency domain and a space domain to obtain multidimensional features, so that the step of constructing an original feature matrix is as follows:

21 Performing feature extraction on the preprocessed electroencephalogram signal on a time-frequency domain based on discrete wavelet transformation to obtain time-frequency domain features;

22 Performing feature extraction on the preprocessed electroencephalogram signals on a airspace based on a co-space mode (CSP) method to obtain airspace features; CSP is a feature extraction algorithm under two classification tasks, and minimizes one class of variance while maximizing the other class of variance, so as to obtain a feature vector with the greatest degree of distinction;

23 Combining the extracted time-frequency domain features and the spatial domain features to construct a combined feature matrix, thereby obtaining an original feature matrix.

In a further scheme, in step 21), an electroencephalogram signal is extracted based on discrete wavelet transformation, an approximation component and a detail component are obtained, and energy information of the detail component is used as time-frequency domain feature data.

In a further scheme, in the step 3), the method for performing dimension reduction processing on the original feature matrix by using a Principal Component Analysis (PCA) algorithm to obtain a final feature matrix comprises the following steps:

the method comprises the steps of performing dimension reduction processing on an original feature matrix by using a principal component analysis algorithm, reducing high-dimensional data into a low-dimensional space, reducing redundant information of the data, improving the processing efficiency of the data and the precision of a model, and obtaining a final feature matrix, wherein the method comprises the following specific steps of:

31 For the original feature matrix X { X } ₁ ，x ₂ ,…,x _m Performing decentration, m represents the number of feature vectors, x _m Representing the m-th feature vector to obtain a matrix Y after decentration;

32 Calculating a covariance matrix D of the matrix Y after the decentralization;

wherein m is the number of eigenvectors, Y is the matrix after decentration, Y ^T Is the transposed matrix of matrix Y;

33 Calculating eigenvalues and eigenvectors of the covariance matrix D through singular value decomposition;

34 The characteristic values obtained in the step 33) are ranked from large to small, and k maximum characteristic values are selected; then respectively taking k corresponding eigenvectors as column vectors to form an eigenvector matrix H; wherein k is calculated according to the accumulated contribution rate;

35 According to the matrix H, the final feature matrix F=H X after PCA dimension reduction is obtained, wherein X is the original feature matrix. The final feature matrix after PCA dimension reduction represents useful information to the greatest extent, reduces redundant information in data, improves the processing efficiency of the data, and is better as input of a classifier in the next step.

In step 4), an electroencephalogram signal recognition model is built by using a stacking integration algorithm, two algorithms, namely a convolutional neural network and a long-short-term memory network, are selected as a base model of a first layer, and a logistic regression classifier is selected as a meta model of a second layer; the recognition model is divided into two layers, wherein the first layer is two base models, each base model is trained by utilizing a training set, and then the trained base model is used for classifying and predicting data and outputting a prediction label; the second layer is a meta-model, the meta-model predicts the output result of the first layer as the input of the first layer, and finally obtains the multi-dimensional characteristic electroencephalogram signal identification model based on the stacking integration technology by combining a 5-fold cross verification method. Training the multi-dimensional characteristic electroencephalogram signal recognition model to obtain a trained recognition model; the model can effectively improve the recognition degree of the brain electrical signals.

Further, the specific steps of training the multi-dimensional characteristic electroencephalogram signal recognition model to obtain a trained recognition model are as follows:

41 Dividing the electroencephalogram data preprocessed in the step 1) into a training set D _train And test set D _test Then training set D _train Equally divided into five subsets, one of which is selected as verification set D for each training _m (m=1, 2,3,4, 5), the remaining four constituting a new training set;

42 Training the base model of the first layer by using different new training sets, wherein for the same base model, five models with different parameters can be trained by five different training sets;

43 Using the trained base model to corresponding verification set D _m Predicting to obtain a predicted result M _i (i=1, 2,3,4, 5); then the trained basic model is utilized to test the set D _test Predicting to obtain a predicted result N _i (i＝1,2,3,4,5)；

44 Then training all base models, repeating steps 42) and 43), model training and prediction using the new training set and verification set, respectively, and finally each base model can obtain a set M of prediction results of the verification set _n Marking the set of prediction results of the verification set of all base models as M; predicting all base models by using the test set, and finally predicting the test set of each base model to obtain a predicted result N _i Take a weighted average, denoted N _n Marking a test set prediction result set of all base models as N;

45 Training the verification set prediction result set M of all the base models obtained in the step 44) as a training set of the second-layer base model to obtain a trained meta model. And training the model by taking the test set prediction result set N of all the base models as a test set of the second layer element model, thereby obtaining a final multi-dimensional characteristic electroencephalogram signal identification model based on the stacking integration technology.

In a further scheme, a first layer base model of the multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology and parameters thereof are selected as follows:

a. convolutional Neural Network (CNN): the model uses 2 convolution layers, wherein the first convolution layer is 1 Dropout layer, and the second convolution layer is 1 maximum pooling layer and 1 full connection layer; wherein the core size of the first layer of convolution layer is 64×5, and the core size of the second layer of convolution layer is 128×3; then use ReLU as an activation function after each convolutional layer; the maximum pooling layer uses maximum pooling to reduce the input size, the memory usage amount and the parameter number, thereby reducing the operation amount; the Dropout layer is used for preventing the neural network from being over fitted, and finally, a Softmax function is used as the class prediction output of the classification problem;

b. long-term memory network: the size of a hidden layer in the LSTM unit is set to be 64, and the LSTM structure consists of a storage unit for storing information and three gates, namely an input gate, an output gate and a forget gate; the three gates control the input and output of data; there are four different functions in LSTM, namely sigmoid, tanh, multiplication and addition, for more easily updating weights during model training; finally, in the fully connected layer, the use of Softmax activation functions enables the neural network to implement a dichotomous function.

The invention also provides a multi-dimensional characteristic electroencephalogram signal identification system based on the stacking integration technology, which adopts the multi-dimensional characteristic electroencephalogram signal identification method based on the stacking integration technology, and comprises the following steps:

the electroencephalogram signal acquisition module is used for acquiring electroencephalogram signals;

the preprocessing module is used for preprocessing the acquired electroencephalogram signals;

the multidimensional feature extraction module is used for carrying out multidimensional feature extraction on the preprocessed electroencephalogram data, constructing a feature matrix and obtaining an original feature matrix;

the dimension reduction processing module is used for carrying out dimension reduction processing on the original feature matrix by utilizing a principal component analysis algorithm to obtain a final feature matrix;

the stacking integrated learning module is used for constructing a multidimensional characteristic electroencephalogram signal recognition model based on a stacking integrated technology by taking a final characteristic matrix as input on the basis of a stacking integrated learning algorithm on the basis of the preprocessed electroencephalogram data; training the model to obtain a trained recognition model; and carrying out data preprocessing and multi-dimensional feature extraction on the electroencephalogram signals to be identified, and inputting the electroencephalogram signals to be identified into a trained multi-dimensional feature electroencephalogram signal identification model to obtain identification results.

The invention has the beneficial effects that:

according to the invention, the time-frequency domain and the spatial domain characteristics in the electroencephalogram signals are extracted, compared with the extraction of single electroencephalogram characteristics, the multidimensional characteristics keep the information contained in the electroencephalogram signals as completely as possible, the recognition precision and the classification performance of a model can be effectively improved, and meanwhile, the feature matrix is subjected to dimension reduction processing, so that the processing efficiency of data is improved;

the identification model in the invention can combine the advantages of multiple basic learners, effectively improve the generalization capability of the model, make up the deficiency of a single model and improve the classification effect of the model; the model uses cross verification, so that overfitting can be effectively prevented, and generalization capability and accuracy of the model are further improved;

the invention applies deep learning to the brain electrical signals, and identifies and classifies the brain electrical signals through an artificial intelligent algorithm, thereby further improving the accuracy of identification and achieving the expected effect of assisting in diagnosing psychological diseases such as depression and the like;

the invention extracts the multidimensional characteristics of the brain electrical signals and keeps the information in the brain electrical signals as completely as possible;

establishing a dimension characteristic electroencephalogram signal identification model through electroencephalogram signals of a healthy subject and a non-healthy subject, and training the dimension characteristic electroencephalogram signal identification model to improve judgment accuracy;

the invention adopts the stacking integration technology to combine the advantages of a plurality of models and improve the generalization capability and the accuracy of the models, thereby achieving the effect of assisting in diagnosing psychological diseases.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the following description will briefly explain the drawings used in the embodiments or the description of the prior art, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a multi-dimensional characteristic EEG signal recognition method based on a stacked integration technology;

FIG. 2 is a diagram showing specific positions of electrodes during electroencephalogram signal acquisition according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of training and predicting a first tier model of a stacked integration strategy.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

According to the invention, the multi-dimensional characteristic electroencephalogram recognition model based on the stacking integration technology is trained by collecting the electroencephalogram signals of a plurality of subjects (the electroencephalogram signals of healthy subjects and unhealthy subjects), so that a trained recognition model is obtained. Preprocessing the electroencephalogram signals to be identified through denoising, normalization and the like, extracting multidimensional features to obtain an original feature matrix, and performing dimension reduction processing on the matrix by using a principal component analysis algorithm to obtain a final feature matrix. The final feature matrix is input into a trained recognition model, a prediction result 0 or 1 (wherein 0 represents health and 1 represents non-health) is output, whether the subject has psychological diseases or not is judged in an auxiliary mode according to the prediction result given by the model, and the recognition degree of the electroencephalogram signals can be effectively improved.

The invention uses a stacked integrated learning algorithm which improves the accuracy of prediction by integrating the prediction results of a plurality of base learners. In the invention, multidimensional feature extraction is used, a principal component analysis algorithm is utilized to carry out dimension reduction processing on the feature matrix, and a convolutional neural network and a long-term and short-term memory network are used for model training so as to realize accurate identification of the electroencephalogram signals.

Example 1

Referring to fig. 1, a flow chart of a multi-dimensional characteristic electroencephalogram signal identification method based on a stacked integration technology comprises the following steps: acquiring two different electroencephalogram signals of a healthy subject and a non-healthy subject and preprocessing data of the electroencephalogram signals; then carrying out multidimensional feature extraction on the preprocessed electroencephalogram data to obtain an original feature matrix; performing dimension reduction treatment on the original feature matrix to obtain a final feature matrix; constructing a multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology by taking the final characteristic matrix as input; training the model to obtain a trained recognition model; the electroencephalogram signal to be identified is subjected to data preprocessing and multidimensional feature extraction and then is input into a trained model, a prediction result 0 or 1 (wherein 0 represents health and 1 represents non-health) is output, whether the subject has psychological diseases or not is judged in an auxiliary mode according to the prediction result given by the model, and the identification degree of the electroencephalogram signal can be effectively improved. The method comprises the following specific steps:

s1: and acquiring two different brain electrical signals of the healthy subject and the unhealthy subject, and preprocessing the data of the signals, wherein the preprocessing mainly comprises denoising and normalizing to eliminate noise components in the signals, improve the signal quality and obtain the subsequently available brain electrical data.

S2: and carrying out multidimensional feature extraction on the preprocessed data, and constructing a feature matrix to obtain an original feature matrix. And carrying out feature extraction on the preprocessed electroencephalogram data in a time-frequency domain and a space domain to obtain multidimensional features, thereby constructing a feature matrix.

S3: and performing dimension reduction treatment on the original feature matrix by using a principal component analysis algorithm to obtain a final feature matrix.

S4: based on the obtained electroencephalogram data and a stacking integrated learning algorithm, constructing a multidimensional characteristic electroencephalogram recognition model based on a stacking integrated technology by using the final characteristic matrix in the step 3 as input; training the model to obtain a trained recognition model;

s5, carrying out data preprocessing and multi-dimensional feature extraction on the electroencephalogram signals to be identified, inputting the electroencephalogram signals to be identified into a trained multi-dimensional feature electroencephalogram signal identification model in the step 4), and outputting a prediction result 0 or 1 (wherein 0 represents health and 1 represents non-health).

The specific method of the step 1 is as follows:

s11, acquiring two types of EEG signal data of a healthy subject and a non-healthy subject. First, an electroencephalogram signal is acquired from a subject using an electroencephalogram acquisition apparatus. In order to obtain the best signal quality, the following parameters are set: data acquisition was performed using 64 channels, one of which was set as the reference electrode; the sampling rate was set at 500Hz. The specific location of the electrodes may be referred to in fig. 2.

S12, denoising the original electroencephalogram signal to eliminate noise components in the signal. Firstly, carrying out filtering treatment by using band-pass filtering of 0.5-40 Hz; and secondly, removing the artifacts such as blinks, eyeball movements and the like in the data by using an independent component analysis method.

S13, carrying out normalization processing on the denoised signal, and scaling the amplitude range of the signal to be between 0 and 1. Data were pre-processed using Min-Max normalization as follows:

wherein z is _j Z, which is the j-th element in sample z _max Z is the maximum value in the sample data _min Is the minimum value in the sample data.

The specific method of the step 2 is as follows:

s21, performing feature extraction on the preprocessed electroencephalogram signal on a time-frequency domain based on discrete wavelet transformation to obtain time-frequency domain features; and extracting an electroencephalogram signal based on discrete wavelet transformation, obtaining an approximation component and a detail component, and using energy information of the detail component as time-frequency domain feature data.

Specifically, the discrete wavelet is defined as follows:

in the course of this formula (ii) the formula,for wavelet basis functions, a and n represent frequency resolution and time shift amount respectively, f (t) represents preprocessed electroencephalogram signals, t represents time index, and the wavelet function selected by the invention is db4. The signals were decomposed using the Mallat algorithm:

in this formula, x [ e ]]Is a discrete output signal, e represents a time index, L is the number of decomposition layers, A _L As low-pass approximation component, D _i The detail components corresponding to each layer.

S22, carrying out feature extraction on the preprocessed electroencephalogram signals on the airspace based on a co-space mode (CSP) method to obtain airspace features; CSP is a feature extraction algorithm under two classification tasks, and the feature vector with the greatest degree of distinction is obtained by maximizing one class of variance and minimizing the other class of variance.

S23, combining the extracted time-frequency domain features and the spatial domain features to construct a combined feature matrix, and obtaining an original feature matrix.

The specific method of the step 3 is as follows:

s31, for the original characteristic matrix X { X } ₁ ，x ₂ ,…,x _m Performing decentration, m represents the number of feature vectors, x _m Representing the m-th feature vector to obtain a matrix Y after decentration;

s32, calculating a covariance matrix D of the matrix Y after the decentralization;

s33, calculating eigenvalues and eigenvectors of a covariance matrix D through singular value decomposition;

s34, sorting the obtained characteristic values from large to small, and selecting the largest k of the characteristic values. And then respectively forming a characteristic vector matrix H by using k corresponding characteristic vectors as column vectors. Wherein k is calculated according to the accumulated contribution rate;

and S35, obtaining a final feature matrix F=H X after dimension reduction according to the matrix H, wherein X is an original feature matrix. The final feature matrix after PCA dimension reduction represents useful information to the greatest extent, reduces redundant information in data, improves the processing efficiency of the data, and is better as input of a classifier in the next step.

The specific method of the step 4 is as follows:

constructing an electroencephalogram signal identification model by using a stacked integration algorithm, selecting two algorithms, namely a convolutional neural network and a long-short-term memory network as a base model of a first layer, and selecting a logistic regression classifier as a meta model of a second layer; the recognition model is divided into two layers, wherein the first layer is two base models, each base model is trained by utilizing a training set, and then the trained base model is used for classifying and predicting data and outputting a prediction label; the second layer is a meta model, the meta model predicts the output result of the first layer as the input of the first layer, and finally obtains a multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology by combining a 5-fold cross verification method; and then training the multi-dimensional characteristic electroencephalogram signal recognition model to obtain a trained recognition model. The training and prediction steps of the first layer base model are specifically seen in fig. 3.

Training the multidimensional characteristic electroencephalogram signal recognition model to obtain a trained recognition model, wherein the specific steps are as follows:

s41, dividing the electroencephalogram data into a training set D _train And test set D _test Then training set D _train Equally divided into five subsets, one of which is selected as verification set D for each training _m (m=1, 2,3,4, 5), the remaining four constituting a new training set;

s42, training the base model of the first layer by using different new training sets. For the same basic model, five different training sets can train out five models with different parameters;

s43, utilizing the trained basic model to carry out corresponding verification set D _m Predicting to obtain a predicted result M _i (i=1, 2,3,4, 5); then the trained basic model is utilized to test the set D _test Predicting to obtain a predicted result N _i (i＝1,2,3,4,5)；

S44, training all the base models, repeating the steps S42 and S43,model training and prediction are respectively carried out by utilizing a new training set and a verification set, and finally each base model can obtain a set M of prediction results of the verification set _n Marking the set of prediction results of the verification set of all base models as M; predicting all base models by using the test set, and finally predicting the test set of each base model to obtain a predicted result N _i Take a weighted average, denoted N _n Marking a test set prediction result set of all base models as N;

s45, training the verification set prediction result set M of all the base models obtained in the step S44 as a training set of the second-layer base model to obtain a trained meta model. And training the model by taking the test set prediction result set N of all the base models as a test set of the second layer element model, thereby obtaining a final multi-dimensional characteristic electroencephalogram signal identification model based on the stacking integration technology.

The first layer base model of the multidimensional characteristic EEG signal identification model based on the stacking integration technology and parameters thereof are selected as follows:

a. convolutional Neural Network (CNN): the model uses 2 convolutional layers, 1 Dropout layer, 1 max pooling layer and 1 fully connected layer. Wherein the kernel size of the first layer is 64×5, and the kernel size of the second layer is 128×3. The ReLU is then used as an activation function after each convolutional layer. The maximum pooling layer uses maximum pooling to reduce the input size, the memory usage and the parameter number, thereby reducing the operation amount. The Dropout technique is used to prevent overfitting of the neural network, and finally uses the Softmax function as the class prediction output for the classification problem.

b. Long and short term memory network (LSTM): the size of the hidden layer in the LSTM cell is set to 64, and the LSTM structure is composed of one storage unit for storing information and three gates (input gate, output gate, and forget gate). The three gates control the input and output of data. There are four different functions in LSTM, sigmoid, tanh, multiplication and addition, for more easily updating weights during model training. Finally, in the fully connected layer, the use of Softmax activation functions enables the neural network to implement a dichotomous function.

Example 2

A multi-dimensional characteristic EEG signal identification system based on a stacking integration technology adopts the multi-dimensional characteristic EEG signal identification method based on the stacking integration technology in the embodiment 1, which comprises the following steps:

It will be understood that modifications and variations will be apparent to those skilled in the art from the foregoing description, and it is intended that all such modifications and variations be included within the scope of the following claims.

Claims

1. The multi-dimensional characteristic electroencephalogram signal identification method based on the stacking integration technology is characterized by comprising the following steps of:

1) Acquiring brain electrical signals of a healthy subject and a non-healthy subject, and preprocessing data of the two different brain electrical signals;

2. The multi-dimensional characteristic electroencephalogram signal identification method based on stacked integration technology according to claim 1, wherein the method comprises the following steps of: the step of preprocessing the data of the electroencephalogram signals in the step 1) mainly comprises denoising and normalization; the method comprises the following steps:

11 Denoising the original electroencephalogram signal to eliminate noise components in the signal, specifically: firstly, filtering treatment is carried out; secondly, removing blinks and eyeball movement artifacts in the data by using an independent component analysis method;

3. The multi-dimensional characteristic electroencephalogram signal identification method based on stacked integration technology according to claim 1, wherein the method comprises the following steps of: in the step 2), the preprocessed electroencephalogram data is subjected to feature extraction on a time frequency domain and a space domain to obtain multidimensional features, so that an original feature matrix X { X } is constructed ₁ ，x ₂ ,…,x _m }。

4. The multi-dimensional characteristic electroencephalogram identification method based on stacking integration technology according to claim 3, wherein the method comprises the following steps of: in the step 2), the preprocessed electroencephalogram data is subjected to feature extraction on a time frequency domain and a space domain to obtain multidimensional features, so that the step of constructing an original feature matrix is as follows:

22 Performing feature extraction on the preprocessed electroencephalogram signals in a space domain based on a co-space mode method to obtain space domain features;

5. The multi-dimensional characteristic electroencephalogram signal identification method based on stacking integration technology according to claim 4, wherein the method comprises the following steps of: in step 21), an electroencephalogram signal is extracted based on discrete wavelet transformation, an approximation component and a detail component are obtained, and energy information of the detail component is used as time-frequency domain feature data.

6. The multi-dimensional characteristic electroencephalogram signal identification method based on stacked integration technology according to claim 1, wherein the method comprises the following steps of: in the step 3), the primary feature matrix is subjected to dimension reduction processing by using a principal component analysis algorithm, and the method for obtaining the final feature matrix comprises the following steps:

34 The characteristic values obtained in the step 33) are ranked from large to small, and k maximum characteristic values are selected; then respectively taking k corresponding eigenvectors as column vectors to form an eigenvector matrix H; wherein k is obtained by calculation according to the accumulated contribution rate;

35 According to the matrix H, obtaining a final feature matrix F=H X after dimension reduction, wherein X is an original feature matrix.

7. The multi-dimensional characteristic electroencephalogram signal identification method based on stacked integration technology according to claim 1, wherein the method comprises the following steps of: in the step 4), an electroencephalogram signal identification model is built by using a stacking integration algorithm, two algorithms, namely a convolutional neural network and a long-short-term memory network, are selected as a base model of a first layer, and a logistic regression classifier is selected as a meta model of a second layer; the recognition model is divided into two layers, wherein the first layer is two base models, each base model is trained by utilizing a training set, and then the trained base model is used for classifying and predicting data and outputting a prediction label; the second layer is a meta model, the meta model predicts the output result of the first layer as the input of the first layer, and finally obtains a multi-dimensional characteristic electroencephalogram signal identification model based on a stacking integration technology by combining a 5-fold cross verification method; and training the multi-dimensional characteristic electroencephalogram signal recognition model to obtain a trained recognition model.

8. The multi-dimensional characteristic electroencephalogram identification method based on stacked integration technology according to claim 7, wherein the method comprises the following steps:

45 Training the verification set prediction result set M of all the base models obtained in the step 44) as a training set of the second-layer base model to obtain a trained meta model; and training the model by taking the test set prediction result set N of all the base models as a test set of the second layer element model, thereby obtaining a final multi-dimensional characteristic electroencephalogram signal identification model based on the stacking integration technology.

9. The multi-dimensional characteristic electroencephalogram signal identification method based on stacked integration technology according to claim 7 or 8, wherein the method comprises the following steps of:

the base model of the first layer and its parameters are chosen as follows:

a. convolutional neural network: the model uses 2 convolution layers, wherein the first convolution layer is 1 Dropout layer, and the second convolution layer is 1 maximum pooling layer and 1 full connection layer; wherein the core size of the first layer of convolution layer is 64×5, and the core size of the second layer of convolution layer is 128×3; then use ReLU as an activation function after each convolutional layer; the maximum pooling layer uses maximum pooling to reduce the input size, the memory usage amount and the parameter number, thereby reducing the operation amount; the Dropout layer is used for preventing the neural network from being over fitted, and finally, a Softmax function is used as the class prediction output of the classification problem;

b. long-term memory network: the size of a hidden layer in the LSTM unit is set to be 64, and the LSTM structure consists of a storage unit for storing information and three gates, namely an input gate, an output gate and a forget gate; the three gates control the input and output of data; there are four different functions in LSTM, namely sigmoid, tanh, multiplication and addition, for updating weights during model training; finally, in the fully connected layer, the use of Softmax activation functions enables the neural network to implement a dichotomous function.

10. A multi-dimensional characteristic electroencephalogram signal identification system based on a stacking integration technology is characterized in that: the recognition system adopts the multi-dimensional characteristic electroencephalogram signal recognition method based on the stacking integration technology as claimed in claims 1-9, and comprises the following steps: