CN115017960A

CN115017960A - Electroencephalogram signal classification method based on space-time combined MLP network and application

Info

Publication number: CN115017960A
Application number: CN202210849649.5A
Authority: CN
Inventors: 李畅; 邵成浩; 宋仁成; 刘羽; 成娟; 陈勋
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2022-07-19
Filing date: 2022-07-19
Publication date: 2022-09-06
Anticipated expiration: 2042-07-19
Also published as: CN115017960B

Abstract

The invention discloses an electroencephalogram classification method based on a space-time combined MLP network and application thereof, and the method comprises the following steps: 1, preprocessing original electroencephalogram data, including selection of data to be classified, sliding window slicing, data up-sampling and selection of data input shapes; 2, establishing a deep learning model of the multilayer perceptron network; 3, in a training stage, inputting data and continuously optimizing model parameters through cross entropy loss to obtain a final classification model for classifying the electroencephalogram signals to be tested; and 4, calibrating the prediction result sequence of the model by using a moving average filtering algorithm. The invention applies the time-space information of the electroencephalogram data to the multilayer sensor network, and can obviously improve the classification accuracy of the electroencephalogram signals, thereby increasing the application value of the electroencephalogram signals in the fields of medical treatment and the like.

Description

Electroencephalogram signal classification method based on space-time combined MLP network and application

Technical Field

The invention relates to the field of electroencephalogram signal classification, in particular to a method for predicting and classifying electroencephalogram signals by combining an MLP (multi-level label layer) network of time and space information of multi-channel electroencephalogram signals.

Background

Electroencephalography (EEG) is a physiological technique for recording physiological electrical signals of the brain. The recognition and prediction of physiological and psychological states from patterns of neural activity observed in the scalp and intracranial electroencephalograms are widely used in the field of brain-computer interfaces such as emotion recognition, motor imagery, medical health, and the like. Conventional machine learning methods, such as auto-regressive coefficients and Lyapunov indices, which are manually extracted, have been used with some success in a tightly controlled experimental environment. However, these manually extracted features often require researchers to have extensive expertise and to perform a large number of experimental attempts. In addition, in real electroencephalogram recording influenced by various artifacts, manually extracted features often only cover part of electroencephalogram information, so that the system has poor robustness.

The deep learning algorithm is stimulated to be widely applied to electroencephalogram classification and prediction due to the excellent generalization ability and the strong ability of automatically learning high-efficiency characteristics. At present, most deep learning methods for classification of electroencephalogram signals carry out feature preprocessing, such as short-time Fourier transform, public space mode and the like. These preprocessing operations on the original brain wave, while resulting in more "clean" data, may also lose some important information. Models that use feature pre-processing and directly use raw brain signals in recent years typically have more complex architectures and larger kernels, resulting in greater memory resource consumption and computational power requirements.

At present, most deep learning algorithms for classification of electroencephalogram signals are generally used as feature classifiers. Researchers extract time domain features, frequency domain features or time-frequency domain features from electroencephalogram signals through existing professional knowledge, and then use a deep learning algorithm to perform classification tasks. Although this method also achieves good classification performance, it requires deep mathematical knowledge for extracting features, and it neglects the strong data driving capability of deep learning algorithms. The simultaneously extracted features, although being a better data representation to some extent, also lose much of the spatial correlation information and temporal correlation information present in the original multi-channel electroencephalogram data. There are also a few deep learning algorithms that use an end-to-end architecture, but none of them fully utilize the spatial correlation information and temporal correlation information present in multi-channel brain electrical signals. Due to the limitation of various conditions, the total data amount of the electroencephalogram signals is seriously insufficient, so that the development of the classification method of the electroencephalogram signals is greatly limited. And because the electroencephalogram data of different types have the serious data imbalance problem, the development of the classification method is greatly limited.

Disclosure of Invention

The invention aims to overcome the defects and provides the electroencephalogram signal classification method based on the space-time joint MLP network and the application thereof, so that the spatial correlation information and the time correlation information of multi-channel electroencephalograms can be extracted from the original electroencephalogram signals and applied to the multilayer perceptron network, the electroencephalogram signal classification accuracy can be obviously improved, and the application value of the electroencephalogram signals in the fields of medical treatment and the like is increased.

In order to achieve the purpose, the invention adopts the following technical scheme:

the invention relates to an electroencephalogram classification method based on a space-time combined MLP network, which is characterized by comprising the following steps:

step 1, acquiring an electroencephalogram signal data set with labeled category information, performing sliding slicing processing on the electroencephalogram signal of each category, reconstructing the input shape of the sliced electroencephalogram signal, obtaining N segments of electroencephalogram signal samples with total duration T, and recording the N segments of electroencephalogram signal samples as a training sample set X{X ₁ ,X ₂ ,...,X _i ,...,X _N And denote the label set of the training sample X as Y ═ Y ₁ ,Y ₂ ,...,Y _i ,...,Y _N }; wherein, X _i ∈R ^C×1×L Representing the ith section of electroencephalogram signal sample after input shape reconstruction, C representing the number of channels of the electroencephalogram signal sample, and L representing the length of the electroencephalogram signal sample; y is _i As the ith segment of EEG signal sample X _i A corresponding label;

step 2, establishing a space-time combination-based MLP network, comprising: the system comprises a denoising weighting module, a space-time joint MLP module and a classification module;

step 2.1, the denoising weighting module comprises: a denoising layer, a weighting layer and a dimensionality reduction layer;

the de-noising layer comprises a manually set matrix filter with randomly initialized element values, and firstly, the de-noising layer converts a training sample set X (X) through fast Fourier transform ₁ ,X ₂ ,...,X _i ,...,X _N Converting the time domain into the frequency domain, multiplying the training sample set transformed into the frequency domain by a learnable matrix filter to obtain a denoised training sample set, and transforming the denoised training sample set into the time domain through inverse fast Fourier transform to obtain a time domain denoised electroencephalogram sample sequence

Wherein the content of the first and second substances,

representing the denoised i-th segment of time domain electroencephalogram signal sample;

the brain electrical signal sequence

Converting from three dimensions to two dimensions to obtain a two-dimensional de-noised electroencephalogram sample sequence

Wherein the content of the first and second substances,

representing a two-dimensional ith segment of electroencephalogram signal sample;

the weighting layer comprises a channel weight matrix which is preset manually and can learn diagonal element values. The weighting layer firstly carries out two-dimensional electroencephalogram sample sequence

Multiplying the weighted sequence by the channel weight matrix to obtain a weighted sequence

Wherein the content of the first and second substances,

representing the ith segment of electroencephalogram signal sample after channel weighting;

the dimensionality reduction layer comprises a group of 1 xk convolution kernels and weights the channel to the electroencephalogram sample sequence

The redundant information is removed in the time dimension (length dimension) to obtain the redundancy-removed electroencephalogram sample sequence

Wherein the content of the first and second substances,

representing the ith section of electroencephalogram sample after redundant information is removed;

step 2.2, the space-time combined MLP module comprises: an inter-channel MLP layer and an intra-channel MLP layer;

the inter-channel MLP layer sequentially comprises: a layer norm layer, a transformation full-link layer, a GELU nonlinear activation function and a restoration full-link layer;

layer pair redundancy removal electroencephalogram sample sequence

After normalization processing, the spatial correlation electroencephalogram samples are obtained after processing of the transformation full-connection layer, the GELU activation function and the restoration full-connection layer in sequenceSequence of

Wherein the content of the first and second substances,

representing the ith segment of electroencephalogram signal sample which is extracted, integrated and associated with the channel space;

the intra-channel MLP layer and the inter-channel MLP layer have the same structure, and are used for spatially correlated electroencephalogram sample sequences

After normalization processing, the time information electroencephalogram sample sequence is obtained after processing of the transformation full-connection layer, the GELU activation function and the restoration full-connection layer in sequence

Wherein the content of the first and second substances,

representing the ith section of electroencephalogram signal sample extracted through time information in the channel;

step 2.3, the classification module comprises: an averaging pooling layer, a full link layer and a Softmax layer;

time information electroencephalogram sample sequence

After the average pooling layer and the full-connection layer are sequentially processed, the score of each electroencephalogram signal sample corresponding to each category is obtained, finally, the score of each electroencephalogram signal sample corresponding to each category is converted into the probability value of each category through the Softmax layer, and the maximum probability value is selected as the prediction classification result of each electroencephalogram signal sample;

step 3, model training:

based on the training sample set X and the label set Y thereof, adopting cross entropy as a loss function, training the space-time combined MLP network by using an ADAM optimizer, and calculating the gradient of the loss function to update network parameters until the maximum iteration times or the loss function convergence is reached, thereby obtaining a trained electroencephalogram signal classification model;

step 4, calibrating the prediction result sequence of the model by using a moving average filtering algorithm:

taking the ith segment of electroencephalogram sample X _i And its following M-1 segment of electroencephalogram samples { X _i+1 ,X _i+2 ,...,X _i+M-1 The mean value of probability values of each category corresponding to each electroencephalogram signal sample in the data is correspondingly used as the X of the ith sample _i Each category probability value of (1).

The electronic device comprises a memory and a processor, and is characterized in that the memory is used for storing programs for supporting the processor to execute the electroencephalogram classification method, and the processor is configured to execute the programs stored in the memory.

The invention relates to a computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when executed by a processor, performs the steps of the electroencephalogram classification method.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention provides a new end-to-end electroencephalogram signal classification method based on an MLP network, which designs a denoising weighting module before electroencephalogram signal classification, wherein the denoising weighting module comprises three independent functional layers: the denoising layer is used for reducing various artifact noises in the original electroencephalogram signal; the weighting layer automatically learns channels more favorable for making correct inference from the multi-channel electroencephalogram, and strengthens the function of the channels favorable for correct classification by giving larger weight; the dimensionality reduction layer is used for eliminating redundant information in the electroencephalogram signals, so that the robustness of the electroencephalogram signal classification result is improved.

2. Aiming at the characteristics of multi-channel electroencephalogram signals, the invention designs a space-time combined MLP module which comprises two independent functional layers: the inter-channel MLP layer is used for extracting and integrating spatial correlation existing among different channel electroencephalogram signals; the intra-channel MLP layer is used for extracting time information of each channel of electroencephalogram data, and time and space associated information of electroencephalogram signals are extracted and integrated, so that the generalization performance of the model is remarkably improved.

Drawings

FIG. 1 is a schematic diagram of a network architecture according to the present invention;

FIG. 2 is a schematic diagram of a denoising weighting module according to the present invention;

FIG. 3 is a schematic diagram of a spatiotemporal interaction MLP module according to the present invention;

FIG. 4 is a schematic diagram of the input and output of the de-noising layer of the present invention (data from the CHB-MIT data set subject 5).

Detailed Description

In the embodiment, the electroencephalogram signal classification method based on the space-time interaction MLP network firstly reduces noise in the electroencephalogram signals through a denoising weighting module, strengthens the effect of the electroencephalogram signals of important channels, and rejects redundant information in the electroencephalogram signals; extracting spatial correlation and time information of the integrated multi-channel electroencephalogram through a space-time interactive MLP module; the data sequence is converted into a prediction result through a classification module, as shown in fig. 1, specifically, the method comprises the following steps:

step 1, acquiring an electroencephalogram signal data set with labeled category information, performing data selection on the electroencephalogram signals to obtain electroencephalogram signals of C channels, slicing the electroencephalogram signals of the C channels through a sliding window, reconstructing the input shape of the sliced electroencephalogram signals, and accordingly obtaining N sections of electroencephalogram signal samples with the total duration of T, and recording as a training sample set X ═ { X ═ X ₁ ,X ₂ ,...,X _i ,...,X _N And set Y of labels for training sample X to { Y ═ Y } ₁ ,Y ₂ ,...,Y _i ,...,Y _N }; wherein X _i ∈R ^C ^×1×L Representing the ith section of electroencephalogram signal sample after input shape reconstruction, C representing the number of channels of the electroencephalogram signal, and L representing the length of the electroencephalogram signal sample; y is _i Is the ith segment of EEG signal sample X _i In the embodiment, the number of channels of the electroencephalogram signal is 22, the length of the sliding window is 30s, and the sampling rate of the electroencephalogram signal per second is 256 Hz; the method uses two publicationsAn electroencephalogram data set: a CHB-MIT dataset and a Kaggle dataset;

step 2, establishing a space-time joint based MLP network, the structure of which is shown in FIG. 1 and comprises: the system comprises a denoising weighting module, a space-time joint MLP module and a classification module;

as shown in fig. 2, the denoising weighting module includes: a denoising layer, a weighting layer and a dimensionality reduction layer.

The denoising layer comprises a fast Fourier transform, an inverse fast Fourier transform and a learnable matrix filter; the weighting layer comprises a learnable channel weight matrix; the dimensionality reduction layer comprises a set of convolution kernels;

as shown in fig. 3, the spatiotemporal union MLP module includes: inter-channel MLP layer and intra-channel MLP layer.

The inter-channel MLP layer and the intra-channel MLP layer have the same structure and comprise a layer norm layer, a transformation full-connection layer, a GELU nonlinear activation function and a restoration full-connection layer, and the difference is that the inter-channel MLP layer acts on the space dimension of electroencephalogram data, and the intra-channel MLP layer acts on the time dimension of electroencephalogram signals;

the classification module comprises: an averaging pooling layer, a fully connected layer and a Softmax layer.

Step 2.1, initializing the model parameters:

before training, randomly initializing all weight parameters in the network;

step 2.2, the reconstructed slice sample X _i ∈R ^C×1×L Inputting the data into a network, and obtaining the data after the data is processed by a denoising weighting module

Wherein, C _r And L _r Respectively representing the number and the length of channels of the processed electroencephalogram samples;

in this embodiment, the denoising weighting module is implemented as follows:

firstly, brain electrical data X is changed into X ₁ ,X ₂ ,...,X _i ,...,X _N Converting the frequency domain EEG signal sequence into the frequency domain through Fast Fourier Transform (FFT)

Wherein the content of the first and second substances,

representing the ith segment of EEG signal sample of frequency domain, multiplying the EEG data of frequency domain by a manually preset matrix filter W with learnable element value ₁ Then obtaining a frequency domain denoising sequence

Wherein the content of the first and second substances,

the frequency domain denoised electroencephalogram signal sample of the ith segment is represented, the matrix filter can effectively reduce high-frequency noise artifacts in the electroencephalogram signal, as shown in fig. 4, the output of a denoising layer is smoother compared with the input, and high-frequency components are effectively removed.

Converting the denoised frequency domain data into a time domain through Inverse Fast Fourier Transform (IFFT) to obtain a time domain denoising sequence

Wherein

Representing an ith time domain denoised electroencephalogram signal sample; then three-dimensionally

Is removed to obtain two-dimensional

And then has a two-dimensional denoising sequence

Inputting the two-dimensional noise reduction sequence into a weighting layer, multiplying the two-dimensional noise reduction sequence by a channel weight diagonal matrix which is preset manually and can learn diagonal element values (each diagonal element value corresponds to a channel weight value of a channel)To obtain a channel weighting sequence

Wherein the content of the first and second substances,

the weighting layer gives greater weight to electroencephalogram channels containing important information to strengthen the role of the electroencephalogram channels in electroencephalogram classification, a part of channel weight values learned by the weighting layer are shown in table 3, channels with the learned channel weight values being more than or equal to 0.6 are defined as important channels, and the channels have 6 important channels for the 21 st subject in the CHB-MIT data set and have more important significance for model deduction.

Table 3 shows the weight values of the weighting layers of the present invention (data derived from CHB-MIT data set subject 21).

channel	LOOCV-1	LOOCV-2	LOOCV-3	LOOCV-4
					u’FP1-F7’	0.233	0.230	0.225	0.213
u’F7-T7’	0.212	0.210	0.206	0.202
					u’T7-P7’	0.202	0.200	0.198	0.191
u’P7-O1’	0.600	0.594	0.594	0.591
					u’FP1-F3’	0.519	0.512	0.507	0.501
u’F3-C3’	0.543	0.536	0.533	0.533
					u’C3-P3’	0.899	0.899	0.895	0.894
u’P3-O1’	0.215	0.216	0.219	0.220
					u’FP2-F4’	0.294	0.290	0.289	0.286
u’F4-C4’	0.302	0.300	0.300	0.297
					u’C4-P4’	0.658	0.657	0.655	0.653
u’P4-O2’	0.122	0.136	0.142	0.146
					u’FP2-F8’	0.429	0.418	0.413	0.402
u’F8-T8’	0.207	0.206	0.201	0.198
					u’T8-P8’	0534	0.533	0.533	0.531
u’P8-O2’	0.407	0.420	0.426	0.432
					u’FZ-CZ’	0.788	0.797	0.806	0.818
u’CZ-PZ’	0.433	0.433	0.434	0.435
					u’P7-T7’	0.948	0.944	0.937	0.929
u’T7-FT9’	0.317	0.318	0.316	0.315
					u’FT9-FT10’	0.811	0.811	0.804	0.803
u’FT10-T8’	0.508	0.509	0.502	0.492

Finally, the electroencephalogram signal sample after channel weighting is input to a dimensionality reduction layer, a large amount of redundant information in electroencephalogram data is eliminated by using one-dimensional convolution, the robustness of the model is improved, and a redundancy removal sequence is obtained

Wherein

Representing the electroencephalogram signal sample with the i-th segment removed with redundant information, and calculating as shown in formula (1):

in the formula (1), w ₁ ,w ₂ ,…,w _n Representing a learnable channel weight, Conv1D represents a one-dimensional convolution.

Step 2.3, will remove redundant sequence

Input into a spatio-temporal joint MLP module to extract integersThe time-space combination characteristic is implemented as follows: firstly inputting a redundancy removing sequence into an inter-channel MLP layer, firstly standardizing the redundancy removing sequence through a layer standardization (LN) operation, then expanding the number of channels of the redundancy removing sequence through a conversion full-connection layer, then carrying out nonlinear mapping on data by using a nonlinear activation function GELU (global warming) to improve the nonlinear fitting capability of a network on electroencephalogram signals, and finally restoring the expanded number of channels to the number of channels before expansion through a restoration full-connection layer, and realizing the extraction and integration of spatial correlation among channels of the multichannel electroencephalogram signals through the above operations to obtain a spatial correlation sequence

Wherein

Representing the ith section of electroencephalogram signal sample which is extracted and integrated with the spatial correlation between channels;

the intra-channel MLP layer and the inter-channel MLP layer have similar internal structures, except that the inter-channel MLP layer operates on the channel dimension (spatial dimension) of the input data, and the intra-channel MLP layer operates on the length dimension (temporal dimension) of the input data. Through similar operation in the MLP layer between the channels, the MLP layer in the channels extracts time information in each channel of the multi-channel electroencephalogram signals to obtain a time information sequence

Wherein

Representing the ith section of electroencephalogram signal sample after time information extraction, and calculating by the space-time joint MLP module as shown in formula (2):

in the formula (2), W ₂ ,W ₃ ,W ₄ ,W ₅ Indicating full connectivity, LN indicates layer normalization, and σ indicates the GELU nonlinear activation function.

Step 2.4,

After being processed by the classification module, the ith section of electroencephalogram sample X is output _i So that the final classification result can be obtained; the specific process is as follows:

extracting time information electroencephalogram sample sequence of spatial correlation and time information

Inputting the data into a classification module, and obtaining a time dimension reduction sequence after the data is processed by an average pooling layer

Wherein the content of the first and second substances,

representing the I-th time dimension (length dimension) reduced EEG sample, and then reducing the time dimension sequence through the full connection layer

After space dimension reduction, a category score sequence is obtained

Wherein the content of the first and second substances,

representing each category score of the ith electroencephalogram sample, and finally outputting the ith electroencephalogram sample X through a Softmax layer _i Selecting the class with the highest probability as a final classification result, wherein the calculation process is shown as formula (3):

in the formula (3), f _{averagepooling} (. represents average pooling f) _softmax (. for) Softmax

Step 3, model training:

in the model training stage, cross entropy is used as a loss function, an ADAM optimizer is used for optimizing the network, and the gradient of the loss function is calculated and used for updating the network weight parameters until the maximum iteration times are reached; in the example, the number of samples trained each time is set to be 16, the initial learning rate of the ADAM optimizer is set to be 0.001, and a model with the lowest verification loss in a test set is selected as an optimal model;

step 4, calibrating the model:

in practical situations, another electroencephalogram signal does not appear in the continuous electroencephalogram signals. However, the condition that another isolated electroencephalogram signal appears in the continuous electroencephalogram signals is easy to appear in the electroencephalogram signal classification task. To solve this problem, a moving average filtering algorithm is used to calibrate the sequence of predictions for the model. Taking the ith segment of electroencephalogram sample X _i And the following M-1 segment of electroencephalogram sample { X _i+1 ,X _i+2 ,...,X _i+M-1 Taking the mean value of the probabilities of each category as X of the ith sample _i The probability of each class in (c) is calculated as shown in equation (4).

In the formula (4), p _i+k Indicates the class probabilities for the i + k sample,

indicating the calculated class probabilities for the ith sample.

The application example of the space-time interactive MLP network is electroencephalogram type prediction based on electroencephalogram signals, experiments are respectively carried out on two public data sets of CHB-MIT and Kaggle, a representative CNN model and a traditional MLP model are used as comparison models, and the performance of the proposed network is verified. In this example, four widely used evaluation indexes were used to measure model performance. The sensitivity (Sn) is the ratio of the number of times of correctly predicting the electroencephalogram and the total number of times of appearing the electroencephalogram, which are obtained by adopting a k-threshold method (continuous k certain predictions mean that the electroencephalogram is detected); the false alarm rate (FPR) is defined as the number of times of wrong prediction per hour; AUC is an important index for accurately balancing the model prediction performance, the AUC value of the random classifier is 0.5, and the AUC value of the perfect classifier can reach 1; the p value can be used for measuring whether the model is superior to a stochastic classifier, and when the p value (p-value) is less than 0.05, the model is obviously superior to the stochastic classifier under the significance level of 0.05. Tables 1 and 2 show the behavior of the spatio-temporal interactive MLP network of the present invention and the two comparative models on the CHB-MIT data set and the Kaggle data set, respectively. Compared with the traditional CNN model, the space-time combined MLP network provided by the invention realizes a better prediction result by using approximately equal parameters and a shallower network structure, and compared with the traditional MLP network, the space-time combined MLP network provided by the invention realizes a better prediction result by using less than 1/4 parameters, so that the validity of the space-time interactive MLP network provided by the invention is fully shown, the space relation and time information are fully extracted, the classification prediction performance is improved, and very excellent performances are obtained on both a CHB-MIT data set and a Kaggle data set, which shows that the space-time interactive MLP network provided by the invention has better identification capability and stronger generalization performance on electroencephalogram categories of a subject.

TABLE 1 average Performance of contrast model on CHB-MIT database for electroencephalogram classification

TABLE 2 average performance of contrast models on Kaggle database for classification of EEG signals

In conclusion, the method can effectively reduce high-frequency noise artifacts in the original electroencephalogram signals, strengthen the effect of important channels on making correct inference, effectively remove redundant information in the electroencephalogram signals, and improve the identification robustness of the model on the original electroencephalogram data; meanwhile, abundant time information and spatial correlation in the electroencephalogram signals can be fully utilized, the accuracy of the prediction result of the model is greatly improved, a remarkable prediction result is obtained in the prediction of the two-classification electroencephalogram types on the data sets CHB-MIT and Kaggle, and meanwhile, the false alarm rate of the electroencephalogram type prediction is effectively reduced; and because the MLP network has a simple structure, the spatio-temporal joint MLP network provided by the invention is easier to train and has less time for training and testing compared with a CNN network.

Claims

1. An electroencephalogram classification method based on a space-time combined MLP network is characterized by comprising the following steps:

step 1, acquiring an electroencephalogram signal data set with labeled category information, performing sliding slicing processing on the electroencephalogram signal of each category, reconstructing the input shape of the sliced electroencephalogram signal, obtaining N segments of electroencephalogram signal samples with total time length T, and recording as a training sample set X ═ { X ═ X ₁ ,X ₂ ,...,X _i ,...,X _N And recording the label set of the training sample X as Y ═ Y ₁ ,Y ₂ ,...,Y _i ,...,Y _N }; wherein, X _i ∈R ^C×1×L Representing the ith section of electroencephalogram signal sample after input shape reconstruction, C representing the number of channels of the electroencephalogram signal sample, and L representing the length of the electroencephalogram signal sample; y is _i As the ith segment of EEG signal sample X _i A corresponding label;

the de-noising layer comprises a manually set matrix filter with randomly initialized element values, and firstly, the de-noising layer converts a training sample set X (X) through fast Fourier transform ₁ ,X ₂ ,...,X _i ,...,X _N Converting the time domain to the frequency domain, multiplying the training sample set transformed to the frequency domain by a learnable matrix filter to obtain a denoised training sample set, and then performing inverse fast Fourier transform on the denoised training sample setConverting the training sample set into a time domain to obtain a time domain denoising electroencephalogram sample sequence

Wherein, X _i ^d ∈R ^C×1×L Representing a denoised i-th segment of time domain electroencephalogram signal sample;

the brain electrical signal sequence

Wherein the content of the first and second substances,

the weighting layer comprises a channel weight matrix which is manually preset and can be learned by diagonal element values; the weighting layer firstly carries out two-dimensional electroencephalogram sample sequence

Wherein the content of the first and second substances,

Wherein the content of the first and second substances,

representing the ith section of electroencephalogram sample after removing redundant information;

layer pair redundancy removal electroencephalogram sample sequence

After normalization processing, the spatial correlation electroencephalogram sample sequence is obtained after processing of the transformation full-connection layer, the GELU activation function and the restoration full-connection layer in sequence

Wherein the content of the first and second substances,

the intra-channel MLP layer and the inter-channel MLP layer have the same structure, and are used for spatially correlating electroencephalogram sample sequences

Wherein the content of the first and second substances,

time information electroencephalogram sample sequence

step 3, model training:

taking the ith section of electroencephalogram sample X _i And its following M-1 segment of electroencephalogram samples { X _i+1 ,X _i+2 ,...,X _i+M-1 The mean value of probability values of each category corresponding to each electroencephalogram signal sample in the data is correspondingly used as the X of the ith sample _i Each category probability value of (1).

2. An electronic device comprising a memory and a processor, wherein the memory is configured to store a program that enables the processor to perform the electroencephalogram classification method of claim 1, and the processor is configured to execute the program stored in the memory.

3. A computer-readable storage medium, having a computer program stored thereon, wherein the computer program, when executed by a processor, performs the steps of the electroencephalogram classification method of claim 1.