CN113486794B - Motor imagery electroencephalogram signal classification method based on hybrid model - Google Patents

Motor imagery electroencephalogram signal classification method based on hybrid model Download PDF

Info

Publication number
CN113486794B
CN113486794B CN202110763496.8A CN202110763496A CN113486794B CN 113486794 B CN113486794 B CN 113486794B CN 202110763496 A CN202110763496 A CN 202110763496A CN 113486794 B CN113486794 B CN 113486794B
Authority
CN
China
Prior art keywords
electroencephalogram
matrix
data
motor imagery
cluster
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110763496.8A
Other languages
Chinese (zh)
Other versions
CN113486794A (en
Inventor
付荣荣
向艺凡
王世伟
李威帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yanshan University
Original Assignee
Yanshan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yanshan University filed Critical Yanshan University
Priority to CN202110763496.8A priority Critical patent/CN113486794B/en
Publication of CN113486794A publication Critical patent/CN113486794A/en
Application granted granted Critical
Publication of CN113486794B publication Critical patent/CN113486794B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction

Abstract

The invention provides a motor imagery electroencephalogram signal classification method based on a hybrid model, which comprises the following steps: s1, data extraction: collecting two groups of data from a database as an electroencephalogram database; s2, data reconstruction: the electroencephalogram data in the electroencephalogram database are normalized and stored in a four-dimensional tensor structure; s3, feature optimization: optimizing the electroencephalogram data by using an ODV-CSSD algorithm to obtain optimal electroencephalogram characteristics; and S4, forming interpretable clusters by using the prior information, and establishing decision boundaries to obtain final class attribution results. The invention provides a new motor imagery electroencephalogram signal classification method aiming at the problem that a common space mode is too sensitive to abnormal data to cause easy overfitting, provides a new feature optimization algorithm ODV-CSSD and researches.

Description

Motor imagery electroencephalogram signal classification method based on hybrid model
Technical Field
The invention relates to the field of electroencephalogram signal classification, in particular to a motor imagery electroencephalogram signal classification method based on a mixed model.
Background
The human brain is composed of neurons connected to each other, and an Electroencephalogram (EEG), which is an electrical signal generated by the activity between neurons, records the change of the electrical signal during the brain activity, and is an overall reflection of the electrophysiological activity of brain neurons on the surface of the cerebral cortex or scalp. Brain-Computer Interface (BCI) technology converts such electrical signals into control commands, thereby providing a communication path between the Brain and external devices (e.g., BCI wheelchairs, prosthetics, robotic arms).
The core of BCI technology is to regulate the self-adaptive control relation between human brain and machine, i.e. to find out proper signal processing and information converting control algorithm and to convert the brain electric signal into one kind of operation system signal capable of being identified accurately by computer. The original electroencephalogram signals inevitably contain electro-ocular interference, myoelectric interference and power frequency interference, and the noise can greatly increase the computation amount, complexity and precision of electroencephalogram signal processing, so that in a BCI system, effective noise elimination and extraction of electroencephalogram features with high separability are very important for accurate decoding of brain activities.
At present, a great deal of research at home and abroad is dedicated to extraction and classification of electroencephalogram features in a Spatial domain, a Spatial filter is designed to improve the quality of electroencephalogram signals and extract features, a Common Spatial Pattern (CSP) is a relatively representative algorithm, a certain class of variance is maximized and another class of variance is minimized by finding a group of Spatial filters, and the Spatial filters are widely applied to electroencephalogram signal processing.
Disclosure of Invention
In order to solve the defects of the prior art, the invention provides a motor imagery electroencephalogram signal classification method based on a hybrid model, which classifies the motor imagery electroencephalogram signals by adopting a hybrid discrimination model, improves the quality of the electroencephalogram signals, and simultaneously helps solve the problem of overfitting easily appearing in a small sample training set by selecting the optimal number of electroencephalogram features, thereby improving the classification effect.
In order to achieve the purpose, the invention discloses the following technical scheme:
specifically, the invention provides a motor imagery electroencephalogram signal classification method based on a hybrid model, which comprises the following steps:
s1, data extraction: collecting two groups of data from a database as an electroencephalogram database;
s2, data reconstruction: the electroencephalogram data in the electroencephalogram database are normalized and stored in a four-dimensional tensor structure;
s3, feature optimization: the method for optimizing the electroencephalogram data by utilizing the ODV-CSSD algorithm to obtain the optimal electroencephalogram characteristics specifically comprises the following substeps:
s31, setting XiRepresenting multi-channel brain electrical signals under a motor imagery task, wherein i is 1,2, wherein 1 represents left hand motor imagery, 2 represents right hand motor imagery, and a normalized spatial covariance matrix RiComprises the following steps:
Figure BDA0003150828890000021
wherein, XTIs the transpose of matrix X, trace (,) is the sum of diagonal elements of the matrix;
s32, averaging covariance matrixes of a plurality of electroencephalogram data in the same task mode to obtain an average spatial covariance matrix:
Figure BDA0003150828890000022
Figure BDA0003150828890000023
wherein N represents the number of trials for each class;
s33, obtaining the eigenvalue and the eigenvector according to the following formulas:
Figure BDA0003150828890000024
wherein W is a characteristic vector matrix, and Λ is a diagonal matrix of characteristic values, passing through the W matrix;
s34, filtering the electroencephalogram signal by using the following formula:
ZC×T=WC×CRC×T
wherein C represents the channel number of the EEG signal, T represents the sampling number in one test, and ZC×TFor filteringA wave matrix;
s35 Slave filter matrix ZC×TExtracting a feature vector Fp:
Figure BDA0003150828890000025
wherein, the matrix ZpFront m rows and back m rows containing Z;
s35, setting the following optimal discrimination criteria, and searching the optimal discrimination vector:
Figure BDA0003150828890000026
wherein A ═ cW1+(1-c)W2C is 0-1, d is the L-dimensional column vector of the projection data, dTIs a transpose of d, WiThe calculation formula of the in-class dispersion of the ith class is as follows:
Figure BDA0003150828890000027
wherein Δ is the difference between the estimated mean values, and the calculation formula is: Δ ═ μ12In which μiThe mean value for class i is calculated according to the following formula:
Figure BDA0003150828890000031
wherein the content of the first and second substances,
Figure BDA0003150828890000032
it represents the jth sample vector of class i, NiThe number of samples in the ith class;
wherein, the first optimal projection direction is:
d1=α1A-1Δ
wherein alpha is1Is to make
Figure BDA0003150828890000033
And is
Figure BDA0003150828890000034
The second optimal projection direction is obtained by maximizing R, and the constraint that the second optimal projection direction needs to be satisfied is d1And d2Are orthogonal, i.e.
Figure BDA0003150828890000035
Solving for the second optimal projection direction is:
Figure BDA0003150828890000036
wherein constant b1Comprises the following steps:
Figure BDA0003150828890000037
and S4, forming interpretable clusters by using the prior information, and establishing decision boundaries to obtain final class attribution results.
Preferably, the step S2 specifically includes performing slicing processing on the acquired EEG data, and storing the processed EEG data in a structure of a four-dimensional tensor, where four dimensions of the four-dimensional tensor are the sampling number, the channel number, the event trigger number, and the category.
Preferably, in step S4, forming interpretable clusters by using the priori information through a DRMM algorithm, and establishing decision boundaries to obtain final class attribution results.
Preferably, the implementation process of the DRMM algorithm in step S4 includes the following sub-steps:
s41, X represents a feature generated by a rule, and y includes a cluster structure. The features are normalized so that each feature has a zero mean and a unit variance. Let k be the number of classes, using ZnTo indicate class attribution of the nth sample in k-dimensional data, e.g. Z nk1 denotes the firstThe n data belong to the kth cluster. Using vectors
Figure BDA0003150828890000038
Represents a decision boundary of the cluster k in d-dimension, wherein
Figure BDA0003150828890000039
The lower boundary is represented by the lower boundary,
Figure BDA00031508288900000310
the upper boundary is represented by the upper boundary,
Figure BDA00031508288900000311
to represent
Figure BDA00031508288900000312
The transposing of (1). Assuming that all samples belonging to the kth cluster are within the corresponding decision rectangle, i.e. x for all samples belonging to the clusterndAll satisfy
Figure BDA00031508288900000313
Suppose that all decision boundaries obey the prior distribution p (t)kd)
K=2
Wherein the content of the first and second substances,
Figure BDA0003150828890000041
and
Figure BDA0003150828890000042
the position of a decision boundary of the prior rule is represented, if the prior rule does not exist, the position is
Figure BDA0003150828890000043
αtAnd betatIs a positive parameter controlling the balance between these links;
s42, determining matrix
Figure BDA0003150828890000044
Defining a parameter Zn to represent the cluster index:
Figure BDA0003150828890000045
since the present invention computes the feature to two dimensions, D ═ 2. f (t) is an exponential function satisfying t ≧ 0:
Figure BDA0003150828890000046
the result of the calculation is a logical AND operator, so if xndWithin the decision rectangle, then Z nk1, otherwise ZnkThe irreparability of 0, f (t) makes it difficult to optimize the decision rectangle, so g (t) is used instead of f (t), and g (t) is defined as follows:
Figure BDA0003150828890000047
wherein a is a positive parameter for defining a soft steepness function, such that a is greater in value to indicate that g (t) is closer to f (t), g (t) is substituted for f (t), and a new variable γ is definednk
Figure BDA0003150828890000048
Due to 0<g(t)<1, then t has 0 for all actual values<γnk<1, for the kth cluster, if sample xndWithin the rectangular decision boundary, then γ nk1, or else γ nk0. From this can be seen γnkCan become a soft clustering index;
s43, suppose ynObeying mixed Gaussian distributions
Figure BDA0003150828890000049
Wherein, mukSum ΣkIs the mean vector and covariance matrix of the kth cluster;
the joint probability of DRMM is:
Figure BDA00031508288900000410
wherein the content of the first and second substances,
Figure BDA0003150828890000051
representing all of the decision boundaries of the rectangle,
Figure BDA0003150828890000052
all parameters are represented;
given the parameters { T, Θ } and the observed variables { Φ, X, Y }, the posterior probabilities of the latent variables Z are calculated:
Figure BDA0003150828890000053
where const denotes a constant that is not a function of Z, and the expectation of the combined logarithm is calculated given the posterior distribution Z as follows:
Figure BDA00031508288900000510
wherein the content of the first and second substances,
Figure BDA0003150828890000054
is that
Figure BDA0003150828890000055
The average vector of (a); d is the extracted feature dimension; k is a clustering number; pinkIs a k-dimensional vector representing the expected values of the posterior distribution.
Finding the optimal value of T, Θ to maximize W (T, Θ) using the following formula:
Figure BDA0003150828890000056
computing
Figure BDA0003150828890000057
And for each cluster K e 1,2, …, K ∈ { μ }k,∑k};
Figure BDA0003150828890000058
The optimum value of (c) is calculated as follows:
Figure BDA0003150828890000059
k,∑kis calculated as follows:
Figure BDA0003150828890000061
Figure BDA0003150828890000062
compared with the prior art, the invention has the following beneficial effects:
the invention provides a novel motor imagery electroencephalogram signal classification method aiming at the problem that a common space mode is too sensitive to abnormal data and is easy to overfit, feature selection and optimization are carried out through a novel provided algorithm ODV-CSSD, and feature optimization is realized through the algorithm from two angles: firstly, optimization is realized by finding a group of optimal feature vectors and projecting all the optimal feature vectors to construct an optimal feature space, and the influence generated by abnormal data can be reduced by the features found by the optimal feature space; and secondly, the optimal feature number is searched and decision is made, and the feature is prevented from containing redundant information, so that optimization is realized. The invention realizes the extraction and classification of the left and right hand motor imagery electroencephalogram signals and effectively improves the identification accuracy.
Drawings
FIG. 1 is a schematic flow diagram of the present invention;
FIG. 2 is a schematic of a reconstructed data structure;
FIG. 3 is a schematic of the clustering results for the extracted two-dimensional optimal features;
FIG. 4 is a ROC curve for a two-dimensional optimal feature;
fig. 5 is an overall flow chart of the present invention.
Detailed Description
Exemplary embodiments, features and aspects of the present invention will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
The core of the invention is to provide a new spatial filtering method to process electroencephalogram signals, a mixed discrimination model is adopted to classify the motor imagery electroencephalogram signals, the quality of the electroencephalogram signals is improved, and simultaneously, the optimal electroencephalogram feature number is selected to help solve the problem of overfitting easily appearing in a small sample training set, so that the classification effect is improved.
The invention provides a motor imagery electroencephalogram signal classification method based on a mixed model, which comprises the following steps of:
s1, establishing an electroencephalogram database, and collecting electroencephalogram data by 59 EEG electrodes densely distributed in a brain sensorimotor region or directly calling the existing electroencephalogram data to serve as the electroencephalogram database.
S2, data reconstruction: the electroencephalogram data are normalized and stored in a four-dimensional tensor structure, the data structure is shown in fig. 2, wherein the four dimensions are as follows: sampling number, channel number, event triggering times and category.
S3, feature optimization: and obtaining the optimal electroencephalogram characteristics by using an ODV-CSSD algorithm.
S4, forming interpretable clusters by using the prior information, establishing decision boundaries, and obtaining a final category attribution result, as shown in FIG. 3.
In an embodiment of the present invention, step S1 is specifically executed by constructing the input data of the present invention from data of two different sources: subjects a, B, C, D are MI-EEG data from the german team in the fourth brain-computer interface competition; subjects E, F, G were tested in the same paradigm as brain-computer interface competition and were acquired with 59 EEG electrodes densely distributed in the sensorimotor region.
In an embodiment of the present invention, step S2 specifically includes performing slicing processing on the acquired EEG data, and storing the processed EEG data in a four-dimensional tensor structure, where the four dimensions of the tensor are: number of samples, number of channels (number of electrodes), number of event triggers, and type.
In one embodiment of the invention, in step S3, obtaining the optimal electroencephalogram features is achieved by using an ODV-CSSD algorithm.
In one embodiment of the invention, the specific steps of calculating the optimal electroencephalogram characteristic by adopting the ODV-CSSD method are as follows:
let Xi (i ═ 1,2, where 1 represents the left-hand motor imagery and 2 represents the right-hand motor imagery) denote the multichannel brain electrical signals under the motor imagery task. The normalized spatial covariance matrix Ri is:
Figure BDA0003150828890000071
where XT is the transpose of matrix X and trace (,) is the sum of the diagonal elements of the matrix.
Carrying out mean value processing on covariance matrixes of a plurality of experimental data in the same task mode to obtain an average spatial covariance matrix:
Figure BDA0003150828890000072
Figure BDA0003150828890000073
where N represents the number of trials per class.
The eigenvalues and eigenvectors are obtained according to the following equations:
Figure BDA0003150828890000074
wherein W is an eigenvector matrix, and Λ is a diagonal matrix of eigenvalues. Filtering the electroencephalogram signal through the W matrix using:
ZC×T=WC×CRC×T
wherein C represents the channel number of the electroencephalogram signal, and T represents the sampling number in one test.
From the filter matrix ZC×TExtracting a feature vector Fp (the dimension does not exceed the number of channels):
Figure BDA0003150828890000081
wherein the matrix Zp comprises the first m rows and the last m rows of Z.
To find the optimal discrimination vector, the following optimal discrimination criteria are set:
Figure BDA0003150828890000082
wherein A ═ cW1+(1-c)W2C is more than or equal to 0 and less than or equal to 1. d is an L-dimensional column vector of the projection data. dT is the transpose of d. Wi is the in-class dispersion of the ith class, and the calculation formula is as follows:
Figure BDA0003150828890000083
wherein Δ is the difference between the estimated mean values, and the calculation formula is: Δ ═ μ12. Wherein muiThe mean value for class i is calculated according to the following formula:
Figure BDA0003150828890000084
wherein the content of the first and second substances,
Figure BDA0003150828890000085
which represents the jth sample vector of the ith class. Ni is the number of samples of the ith class. It is worth noting that: the generalized ratio R is independent of the size of the vector d, i.e., R (d) ═ R (α d).
The first optimal projection direction is:
d1=α1A-1Δ
wherein α 1 is
Figure BDA0003150828890000086
And is
Figure BDA0003150828890000087
The second best projection direction is obtained by maximizing R, and the constraint that the second best projection direction is orthogonal to d2, i.e. d1 is orthogonal to d2
Figure BDA0003150828890000088
Solving for the second optimal projection direction is:
Figure BDA0003150828890000089
where the constant b1 is:
Figure BDA00031508288900000810
the CSP is used for extracting features from the original electroencephalogram signals, and the optimal electroencephalogram features can be obtained after the optimal discrimination vector projection. Meanwhile, the number of the optimal electroencephalogram features can be controlled to achieve the optimal dimensionality.
In one embodiment of the present invention, in step S4, interpretable clusters are formed using a priori information through a DRMM algorithm.
In an embodiment of the present invention, the DRMM algorithm is implemented as follows:
let X denote via rule generationThe feature y comprises a cluster structure. The features are normalized so that each feature has a zero mean and a unit variance. Let k be the number of classes, and Zn is used to represent class attribution of nth sample in k-dimensional data, such as Z nk1 indicates that the nth data belongs to the kth cluster.
Using vectors
Figure BDA0003150828890000091
Representing the decision boundary of cluster k in d-dimension. It is assumed that all samples belonging to the kth cluster are within the corresponding decision rectangle, i.e. all samples belonging to the cluster satisfy
Figure BDA0003150828890000092
Suppose that all decision boundaries obey the prior distribution p (t)kd):
Figure BDA0003150828890000093
Wherein the content of the first and second substances,
Figure BDA0003150828890000094
and
Figure BDA0003150828890000095
indicating where the decision boundary of the prior rule is located. If there is no prior rule, then
Figure BDA0003150828890000096
α t and β t are positive parameters that control the balance between these links.
In the decision matrix
Figure BDA0003150828890000097
Defining a parameter Zn to represent the cluster index:
Figure BDA0003150828890000098
wherein the content of the first and second substances,
Figure BDA0003150828890000099
wherein, f (t) is an exponential function satisfying t ≧ 0, and the calculation result can be regarded as a logical AND operator. Thus, if xn is inside the decision rectangle, Znk is 1, otherwise Znk is 0. The irreparability of f (t) makes it difficult to optimize the decision rectangle, so g (t) is substituted for f (t), which is defined as follows:
Figure BDA00031508288900000910
where a is a positive parameter used to define a soft "steepness" function, so that a greater value of a indicates that g (t) is closer to f (t). Replacing f (t) with g (t), and defining a new variable γ nk:
Figure BDA00031508288900000911
since 0< g (t) <1, then 0< γ nk <1 for all actual values of t. For the kth cluster, if sample xn is within the rectangular decision boundary, γ nk ≈ 1, otherwise γ nk ═ 0. Therefore, the gamma nk can be a soft clustering index.
If γ nk is used as a soft clustering index, the following two conditions must be satisfied:
the xn is positioned in a judgment rectangle of the kth cluster, namely gamma nk is approximately equal to 1;
xn is located outside all other decision rectangles, i.e. γ nk ≈ 0 for all j ≠ k.
Suppose yn follows a mixed Gaussian distribution:
Figure BDA0003150828890000101
where μ k and Σ k are the mean vector and covariance matrix of the kth cluster.
The joint probability of DRMM is:
Figure BDA0003150828890000102
wherein the content of the first and second substances,
Figure BDA0003150828890000103
representing all of the decision boundaries of the rectangle,
Figure BDA0003150828890000104
all parameters are indicated.
Calculating posterior probability of latent variable Z given parameters { T, Θ } and observed variables { Φ, X, Y }, and calculating posterior probability of latent variable Z
Figure BDA0003150828890000105
Where const denotes a constant that is not a function of Z. With the posterior distribution Z known, the expectation of the combined logarithm is calculated by:
Figure BDA0003150828890000111
the optimal T, Θ value is found using the following formula to maximize W (T, Θ)
Figure BDA0003150828890000112
Computing
Figure BDA0003150828890000113
And for each cluster K e 1,2, …, K ∈ { μ }k,∑k}。
Figure BDA0003150828890000114
The optimum value of (c) is calculated as follows:
Figure BDA0003150828890000115
k,∑kis calculated as follows:
Figure BDA0003150828890000116
Figure BDA0003150828890000117
the result of clustering according to the extracted two-dimensional optimal electroencephalogram features is shown in fig. 3. The DRMM algorithm, after learning the optimal features, gives a predicted class label, and at the same time, finds a rectangular decision rule for each cluster. The rectangular decision rule can visually interpret the differences between each cluster. Receiver operating characteristic curves (ROCs) are plotted for the predicted labels after optimal feature learning according to the DRMM and corresponding AUC values are calculated, as shown in fig. 4. The experimental objects A and B are brain-computer interface competition data, the experimental objects F and G are self-collected data, and the AUC values of the other three experimental objects except the experimental object G are all larger than 0.96, so that the experimental object A and the experimental object B have better performance.
In order to embody the performance of the algorithm of the invention, the DRMM clustering result is compared with the Fuzzy C Mean (FCM) and K-means clustering results, as shown in Table 1, wherein the bold numbers are the optimal identification accuracy of each experimental object. As can be seen from the table, the identification accuracy of the DRMM is greater than or equal to 0.91, and the DRMM has better performance than FCM and K-means for all test objects, so that the effectiveness of the feature optimization algorithm is verified.
Figure BDA0003150828890000121
The overall flow chart of the present invention is shown in fig. 5.
Finally, it should be noted that: the above-mentioned embodiments are only used for illustrating the technical solution of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (4)

1. A motor imagery electroencephalogram signal classification method based on a hybrid model is characterized in that: which comprises the following steps:
s1, data extraction: collecting two groups of data from a database as an electroencephalogram database;
s2, data reconstruction: the electroencephalogram data in the electroencephalogram database are normalized and stored in a four-dimensional tensor structure;
s3, feature optimization: the method for optimizing the electroencephalogram data by utilizing the ODV-CSSD algorithm to obtain the optimal electroencephalogram characteristics specifically comprises the following substeps:
s31, setting XiRepresenting multi-channel electroencephalogram signals under motor imagery tasks stored in a database, wherein i is 1 and 2, wherein 1 represents left-hand motor imagery, 2 represents right-hand motor imagery, and a normalized spatial covariance matrix RiComprises the following steps:
Figure FDA0003520201250000011
wherein, XiIs an electroencephalogram signal, Xi TIs a transposed matrix of matrix X, trace () is the sum of diagonal elements of the matrix;
s32, averaging covariance matrixes of a plurality of electroencephalogram data in the same task mode to obtain an average spatial covariance matrix:
Figure FDA0003520201250000012
Figure FDA0003520201250000013
wherein the content of the first and second substances,
Figure FDA0003520201250000014
is an average spatial covariance matrix under the left-hand motor imagery,
Figure FDA0003520201250000015
the mean space covariance matrix under the right-hand motor imagery is obtained, and N is the test times of each class; r1nAnd R2nRespectively is a normalized spatial covariance matrix under the nth test left-hand and right-hand motor imagery tasks;
s33, obtaining the eigenvalue and the eigenvector according to the following formulas:
Figure FDA0003520201250000016
wherein W is an eigenvector matrix, Λ is a diagonal matrix of eigenvalues, and W is obtained through the W matrixTA transposed matrix that is W;
s34, filtering the electroencephalogram signal by using the following formula:
ZC×T=WC×CRC×T
wherein C represents the number of channels when the electroencephalogram signal is obtained, T represents the number of samples in one test, and ZC×TIs a filter matrix;
s35 selecting a filter matrix ZC×TThe front m rows and the rear m rows ofpM is the logarithm of the spatial filter and p is the matrix ZpFrom the matrix Z, i.e. p is 2mpExtracted feature vector Fp
Figure FDA0003520201250000017
Wherein, the matrix ZpComprising ZC×TM is the logarithm of the spatial filter, i.e. m pairs of spatial filters are selected, i is used for summation, representing the calculation var (Z)1),…,var(Z2m) And performing summation calculation on the two;
s36, setting the following optimal discriminant criterion R (d) and searching the optimal discriminant vector:
Figure FDA0003520201250000021
wherein A ═ cW1+(1-c)W2C is 0-1, d is the column vector of the projection data, dTIs a transpose of d, W1And W2The dispersion in class is 1 class and 2 class respectively, and the calculation formulas are respectively:
Figure FDA0003520201250000022
Figure FDA0003520201250000023
wherein c is W1And W2Since the contribution levels of the two types of tests are consistent, c is 0.5, Δ is the difference between the estimated average values, and the calculation formula is as follows: Δ ═ μ12In which μiThe mean value for class i is calculated according to the following formula:
Figure FDA0003520201250000024
wherein x isij=(xij1,xij2..........xijL)TWhich represents the jth sample vector of class i, L being the number of vector elements, NiThe number of samples in the ith class;
first optimal projection direction d1Calculated as follows:
d1=α1A-1Δ
wherein alpha is1Is to make
Figure FDA0003520201250000025
And is
Figure FDA0003520201250000026
Obtaining a second optimal projection direction by maximizing R (d), wherein the constraint that the second optimal projection direction needs to be satisfied is d1And d2Are orthogonal, i.e.
Figure FDA0003520201250000027
Solving for the second optimal projection direction is:
d2=α2{A-1-b1[A-1]2
wherein alpha is2Has the same action as alpha1,b1Comprises the following steps:
Figure FDA0003520201250000028
and S4, forming interpretable clusters by using the prior information, and establishing decision boundaries to obtain final class attribution results.
2. The hybrid model-based motor imagery electroencephalogram signal classification method of claim 1, wherein: the step S2 is specifically to slice the acquired EEG data, store the processed EEG data in a structure of a four-dimensional tensor, where four dimensions of the four-dimensional tensor are the sampling number, the channel number, the event triggering number and the category.
3. The hybrid model-based motor imagery electroencephalogram signal classification method of claim 1, wherein: in step S4, forming interpretable clusters by using the priori information through a DRMM algorithm, and establishing decision boundaries to obtain final class attribution results.
4. The hybrid model-based motor imagery electroencephalogram signal classification method of claim 3, wherein: the implementation process of the DRMM algorithm in step S4 includes the following sub-steps:
s41, let X denote the feature generated by the rule, ynIncluding a clustering structure, normalizing the features such that each feature has a zero mean and a unit variance, assuming k as a serial number of the cluster and Z as the serial number of the clusternkTo indicate whether the nth sample belongs to the kth cluster or not, using a vector
Figure FDA0003520201250000031
Represents a decision boundary of the cluster k in d-dimension, wherein
Figure FDA0003520201250000032
The lower boundary is represented by the lower boundary,
Figure FDA0003520201250000033
the upper boundary is represented by the upper boundary,
Figure FDA0003520201250000034
to represent
Figure FDA0003520201250000035
Assuming that all samples belonging to the kth cluster are within the corresponding decision rectangle, i.e. for all samples x belonging to the clusterndAll satisfy
Figure FDA0003520201250000036
Suppose that all decision boundaries obey the prior distribution p (t)kd):
Figure FDA0003520201250000037
Wherein the content of the first and second substances,
Figure FDA0003520201250000038
and
Figure FDA0003520201250000039
the position of a decision boundary of the prior rule is represented, if the prior rule does not exist, the position is
Figure FDA00035202012500000310
αtAnd betatAre positive parameters for controlling the balance among the links;
s42, determining matrix
Figure FDA00035202012500000311
Define a parameter ZnkTo represent the cluster index:
Figure FDA00035202012500000312
wherein D is 2, f (t) is an exponential function satisfying t ≧ 0:
Figure FDA00035202012500000313
the result of the calculation is a logical AND operator, so if xndWithin the decision rectangle, then Znk1, otherwise ZnkThe irreparability of 0, f (t) makes it difficult to optimize the decision rectangle, so g (t) is used instead of f (t), and g (t) is defined as follows:
Figure FDA0003520201250000041
wherein a is a positive parameter for defining a soft steepness function, such that a is greater in value to indicate that g (t) is closer to f (t), g (t) is substituted for f (t), and a new variable γ is definednk
Figure FDA0003520201250000042
Due to 0<g(t)<1, then t has 0 for all real time values<γnk<1, for the kth cluster, if sample xndWithin the rectangular decision boundary, then γnk1, or else γnkWhen the measured value is 0, gamma is observednkCan be used as a soft clustering index;
s43, suppose ynObeying a mixed Gaussian distribution:
Figure FDA0003520201250000043
wherein, yn|znIs the conditional probability, μkSum ΣkIs the mean vector and covariance matrix of the kth cluster;
the joint probability of DRMM is:
Figure FDA0003520201250000044
wherein the content of the first and second substances,
Figure FDA0003520201250000045
representing all of the decision boundaries of the rectangle,
Figure FDA0003520201250000046
representing all parameters, random variables
Figure FDA0003520201250000047
If the feature is within the decision rectangle then phin1, otherwise phin0, Z is a latent variable, { X, Y } is an observed variable, N is the number of samples;
given the parameters { T, Θ } and the observed variables { Φ, X, Y }, the posterior probabilities of the latent variables Z are calculated:
Figure FDA0003520201250000048
wherein x isnFor the nth feature, const represents a constant that is not a function of Z, and the expected value of the joint logarithm W (T, Θ) is calculated by the following equation given the posterior distribution Z:
Figure FDA0003520201250000051
wherein the content of the first and second substances,
Figure FDA0003520201250000052
is that
Figure FDA0003520201250000053
The average vector of (a); d is the extracted feature dimension; k is a clustering number; pinkIs the expected value of the posterior distribution;
finding the best estimate of T, Θ using the following formula
Figure FDA0003520201250000054
Thereby maximizing W (T, Θ):
Figure FDA0003520201250000055
computing
Figure FDA0003520201250000056
And for each cluster K e 1,2, …, K ∈ { μ }k,∑k};
Figure FDA0003520201250000057
The optimum value of (c) is calculated as follows:
Figure FDA0003520201250000058
k,∑kis calculated as follows:
Figure FDA0003520201250000059
Figure FDA00035202012500000510
CN202110763496.8A 2021-07-06 2021-07-06 Motor imagery electroencephalogram signal classification method based on hybrid model Active CN113486794B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110763496.8A CN113486794B (en) 2021-07-06 2021-07-06 Motor imagery electroencephalogram signal classification method based on hybrid model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110763496.8A CN113486794B (en) 2021-07-06 2021-07-06 Motor imagery electroencephalogram signal classification method based on hybrid model

Publications (2)

Publication Number Publication Date
CN113486794A CN113486794A (en) 2021-10-08
CN113486794B true CN113486794B (en) 2022-04-15

Family

ID=77940594

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110763496.8A Active CN113486794B (en) 2021-07-06 2021-07-06 Motor imagery electroencephalogram signal classification method based on hybrid model

Country Status (1)

Country Link
CN (1) CN113486794B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113688952B (en) * 2021-10-25 2022-01-25 中国科学院自动化研究所 Brain-computer interface decoding acceleration method and system based on self-adaptive electroencephalogram channel selection

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259741A (en) * 2020-01-09 2020-06-09 燕山大学 Electroencephalogram signal classification method and system
CN111582082A (en) * 2020-04-24 2020-08-25 燕山大学 Two-classification motor imagery electroencephalogram signal identification method based on interpretable clustering model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109480834A (en) * 2018-12-20 2019-03-19 浙江大学 A kind of Method of EEG signals classification based on quick multiple dimension empirical mode decomposition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259741A (en) * 2020-01-09 2020-06-09 燕山大学 Electroencephalogram signal classification method and system
CN111582082A (en) * 2020-04-24 2020-08-25 燕山大学 Two-classification motor imagery electroencephalogram signal identification method based on interpretable clustering model

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Recognizing single-trial motor imagery EEG based on interpretable clustering method;Rongrong Fu 等;《Biomedical SignalProcessingandControl》;20200906;全文 *
Spatial Filtering for EEG-Based Regression Problems in Brain-Computer Interface (BCI);Dongrui Wu 等;《arXiv》;20170209;全文 *
基于 Fisher 准则的单次运动想象脑电信号意图识别研究;付荣荣 等;《生物医学工程学杂志》;20181031;第35卷(第5期);全文 *

Also Published As

Publication number Publication date
CN113486794A (en) 2021-10-08

Similar Documents

Publication Publication Date Title
CN111967506A (en) Electroencephalogram signal classification method for optimizing BP neural network by artificial bee colony
Yin et al. Physiological-signal-based mental workload estimation via transfer dynamical autoencoders in a deep learning framework
Preetha et al. Retracted Article: Early diagnose breast cancer with PCA-LDA based FER and neuro-fuzzy classification system
CN110522412B (en) Method for classifying electroencephalogram signals based on multi-scale brain function network
Lee et al. EEG artifact removal by Bayesian deep learning & ICA
CN111582082B (en) Two-classification motor imagery electroencephalogram signal identification method based on interpretable clustering model
CN113486794B (en) Motor imagery electroencephalogram signal classification method based on hybrid model
CN112884063B (en) P300 signal detection and identification method based on multi-element space-time convolution neural network
CN114492513A (en) Electroencephalogram emotion recognition method for adaptation to immunity domain based on attention mechanism in cross-user scene
CN114781441B (en) EEG motor imagery classification method and multi-space convolution neural network model
Deepthi et al. An intelligent Alzheimer’s disease prediction using convolutional neural network (CNN)
CN114595725B (en) Electroencephalogram signal classification method based on addition network and supervised contrast learning
CN115363531A (en) Epilepsy detection system based on bimodal electroencephalogram signal information bottleneck
Das et al. An effect-size based channel selection algorithm for mental task classification in brain computer interface
Shen et al. Multi-source signal alignment and efficient multi-dimensional feature classification in the application of EEG-based subject-independent drowsiness detection
Shooshtari et al. Selection of optimal EEG channels for classification of signals correlated with alcohol abusers
Rejer et al. Classic genetic algorithm vs. genetic algorithm with aggressive mutation for feature selection for a brain-computer interface
Dai et al. Study of cardiac arrhythmia classification based on convolutional neural network
CN114861738B (en) Electroencephalogram tracing and dipole selection-based motor imagery classification method
Singh et al. Motor imagery classification based on subject to subject transfer in Riemannian manifold
CN112438741B (en) Driving state detection method and system based on electroencephalogram feature transfer learning
CN116226693A (en) Gaussian mixture model nuclear power operation condition division method based on density peak clustering
Liu et al. Application of convolutional neural network in automatic classification of arrhythmia
Chen et al. Multi-subject unsupervised transfer with weighted subspace alignment for common spatial patterns
Balachander et al. An empirical comparison of dimensionality reduction techniques for pattern classification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant