CN110390272B - EEG signal feature dimension reduction method based on weighted principal component analysis - Google Patents

EEG signal feature dimension reduction method based on weighted principal component analysis Download PDF

Info

Publication number
CN110390272B
CN110390272B CN201910582226.XA CN201910582226A CN110390272B CN 110390272 B CN110390272 B CN 110390272B CN 201910582226 A CN201910582226 A CN 201910582226A CN 110390272 B CN110390272 B CN 110390272B
Authority
CN
China
Prior art keywords
eeg signal
matrix
dimension
eeg
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910582226.XA
Other languages
Chinese (zh)
Other versions
CN110390272A (en
Inventor
董娜
李英杰
常建芳
高忠科
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201910582226.XA priority Critical patent/CN110390272B/en
Publication of CN110390272A publication Critical patent/CN110390272A/en
Application granted granted Critical
Publication of CN110390272B publication Critical patent/CN110390272B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/16Devices for psychotechnics; Testing reaction times ; Devices for evaluating the psychological state
    • A61B5/18Devices for psychotechnics; Testing reaction times ; Devices for evaluating the psychological state for vehicle drivers or machine operators
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/369Electroencephalography [EEG]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7264Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
    • A61B5/7267Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction

Abstract

An EEG signal feature dimension reduction method based on weighted principal component analysis comprises the following steps: extracting samples of m EEG signals of fatigue driving, and dividing the samples into a training set and a testing set for training to obtain a classification total accuracy A and n different classification accuracy; respectively making differences between the total accuracy A and different classification accuracy to obtain n differences; normalizing the n differences to obtain n weights; constructing weight diagonal matrixes for n weights; writing samples of m EEG signals into an m x n dimensional matrix; multiplying the m-by-n dimensional matrix with the diagonal matrix of the weight to obtain weighted EEG signal characteristic data; calculating and decomposing a covariance matrix of the vector, and obtaining a characteristic value of the covariance matrix and a unitized characteristic vector corresponding to the characteristic value; selecting unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices to be combined to form a mapping matrix; thereby obtaining the EEG signal characteristic data with reduced dimension. The invention effectively improves the classification recognition precision and reduces the training time of the recognition model.

Description

EEG signal feature dimension reduction method based on weighted principal component analysis
Technical Field
The invention relates to an EEG signal characteristic dimension reduction method. In particular to an EEG signal characteristic dimension reduction method based on weighted principal component analysis.
Background
In recent decades, the number of automobiles in China has drastically increased, and traffic accidents have increased [1]. According to the related reports, china becomes one of the countries with frequent traffic accidents. There are many factors that cause traffic accidents, and among them, driver fatigue driving is a main cause of traffic accidents. The driver is distracted and thinking is reduced in fatigue, so that the driver has a slow response and reduced control power of the vehicle, and the possibility of traffic accidents is increased [2]. Therefore, it is important to accurately and rapidly detect the driving fatigue state of the driver. The driving fatigue detection based on the physiological information is an objective means [3] which can detect and identify the fatigue state of the driver according to the physiological index change. The study shows that the fatigue state of the driver can be effectively evaluated and detected through the electrophysiological information such as body temperature, blood pressure, electrocardiogram (EEG), electromyogram (EEG) and the like. The analysis of the electroencephalogram signal is the most commonly used method for distinguishing.
The brain electricity reflects the electrophysiological signals of the brain nervous system, and the fatigue state of a driver can be detected and analyzed through the brain electricity signals. Since the characteristics of the electroencephalogram signals are usually high-dimensional characteristic signals, a certain difficulty is brought to processing of the electroencephalogram signals, and in order to solve the problem, the high-dimensional characteristic signals need to be subjected to dimension reduction processing. Principal component analysis (principal component analysis, PCA) is a commonly used dimension reduction method, which can resolve the main influencing factors from a plurality of things, reveal the nature of the things and simplify the complex problem.
Liu J et al propose a mixed dimensional feature reduction scheme [4] for emotion classification problems, extracting 14 different features from EEG, minimum redundant maximum correlation (mRMR) for maximizing the correlation between features and classification variables and minimizing the correlation between features, further reducing the generated features by PCA to extract the principal components. Bousseta R and the like extract EEG features [5] through two methods of Continuous Wavelet Transform (CWT) and Empirical Mode Decomposition (EMD), perform feature dimension reduction by using PCA, and perform left and right hand motor imagery classification through a linear and Radial Basis Function (RBF) kernel function SVM classifier. Sun Ying and the like propose a fusion algorithm [6] based on nonlinear global features and power spectrum entropy of PCA, extract nonlinear geometric features of EEG as new emotion electroencephalogram features, combine the nonlinear attribute features of the EEG such as the power spectrum entropy and Hurst index and the like, perform dimension reduction and feature fusion through PCA technology, and perform emotion recognition by using SVM as a classifier. Neshov NN et al propose an algorithm to identify five psychological tasks using 6-channel EEG data [7], the main idea being to divide the original EEG signal into several frames and calculate its spectrum, apply the second derivative of Gaussian to extract features, and use PCA to reduce feature dimensions. Xin L I and the like extract 14 channels of electroencephalogram data [8] of which 8 positive and negative emotions represent brain regions, reconstruct delta, theta, alpha, beta four rhythm waves based on wavelet decomposition, fuse wavelet characteristics, approximate entropy and electroencephalogram characteristics of Hurst indexes by using Principal Component Analysis (PCA), and reduce characteristic dimensions. Zarei R et al propose a feature extraction method [9] combining PCA and cross covariance technique (CCOV), extracting discrimination information from mental states based on EEG signals in brain-computer interface technique application, applying a correlation-based variable selection method, and performing an optimal first search on the extracted features to identify an optimal feature set for characterizing mental state signal distribution.
In the application of PCA, the predecessor treats each dimension characteristic on average based on the dimension reduction of the data variance. However, the different features play a different role in the recognition process [10], and therefore each dimension feature cannot be treated equally. From the above analysis, the PCA dimension reduction method is still further improved and updated.
Reference to the literature
[1] Wang Fuwang and Wang Hong brain-electric characteristic analysis of fatigue state of coach driver [ J ]. Instrument and instrument report, 2013,34 (5): 1146-1152.
[2] Yu Xulei, li Xiangze real-time fatigue driving detection system based on portable brain data [ J ]. Modern information technology, 2018, v.2 (04): 45-47.
[3] She Jianfang, liu Jiang, li Xueying. Optimization study of fatigue driving detection recognition model based on random forest [ J ]. Automobile Utility technology, 2018, no.268 (13): 46-50.
[4]Liu J,Meng H,Li M,et al.Emotion detection from EEG recordings based on supervised and unsupervised dimension reduction[J].Concurrency and Computation:Practice and Experience,2018:e4446.
[5]Bousseta R,Tayeb S,Ouakouak I E,et al.EEG efficient classification of imagined right and left hand movement using RBF kernel SVM and the joint CWT_PCA[J].2018.
[6]Ying S,Jianghe M A,Xueying Z.EEG emotion recognition based on nonlinear global features and spectral feature[J].Computer Engineering and Applications,2018.
[7]Neshov NN,Manolova AH,Draganov IR,et al.Classification of Mental Tasks from EEG Signals Using Spectral Analysis,PCA and SVM[J].Cybernetics and Information Technologies,2018.
[8] Li Xin, cai Erjuan, tian Yanxiu et al, 32-39+50, an improved electroencephalogram feature extraction algorithm and its use in emotion recognition [ J ]. J.biomedical engineering journal, 2017 (04).
[9]Zarei R,He J,Siuly S,et al.A PCA Aided Cross-Covariance Scheme for Discriminative Feature Extraction from EEG Signals[J].Computer methods and programs in biomedicine,2017,146:47.
[10] Wang Yongxin, zhang Huaxiang, wang Shuang principal component analysis algorithms based on attribute weighting [ J ]. University of Jinan university journal (Nature science edition), 2015,29 (6): 438-443.
Disclosure of Invention
The invention aims to solve the technical problem of providing an EEG signal feature dimension reduction method based on weighted principal component analysis, which can improve the fatigue driving state classification accuracy.
The technical scheme adopted by the invention is as follows: an EEG signal feature dimension reduction method based on weighted principal component analysis, comprising the steps of:
1) Extracting samples of m EEG signals of fatigue driving by using an AR model, wherein each sample contains n-dimensional characteristic data;
2) Dividing m samples of EEG signals into a training set and a testing set, respectively training the training set and the testing set by using SVM, and classifying the testing set to obtain a classification total accuracy A;
3) Removing first dimension characteristic data in the n-dimension characteristic data of each EEG signal sample to obtain each EEG signalN-1 dimension characteristic data of the number sample, dividing the m samples of EEG signals into a training set and a testing set, respectively training the training set and the testing set by using SVM, and classifying the testing set to obtain a classification accuracy A 1 The method comprises the steps of carrying out a first treatment on the surface of the Removing second dimension data in n-dimension characteristic data of each EEG signal sample, dividing the m EEG signal samples into a training set and a test set, training and classifying the rest n-1 dimension data by using SVM to obtain classification accuracy A 2 And analogically, finally obtaining the accuracy A of n classifications 1 ,A 2 ,…A i ,…,A n
4) The classification total accuracy A obtained in the step 2) is respectively compared with the classification accuracy A obtained in the step 3) 1 ,A 2 ,…A i ,…,A n Performing difference to obtain n difference values; when the difference is negative, the classification accuracy A of the difference is made i The corresponding removed ith dimension features have negative influence on classification, and when the difference value is positive, the classification accuracy A is poor i The corresponding removed ith dimension feature has positive influence on classification;
5) Respectively carrying out normalization processing on the n differences to obtain n weights;
6) For n weights w i i=1, 2, …, n, constructing a weight diagonal matrix W n*n
7) Writing samples of m EEG signals into an m X n dimensional matrix X m*n
Wherein x is 1 ...x m Samples of m EEG signals;
8) Matrix X in m X n dimensions m*n Diagonal matrix W with weight W n*n Multiplying to obtain weighted EEG signal characteristic data Z m*n
9) Computing weighted EEG signal feature data Z m*n Covariance matrix C';
10 Decomposing the covariance matrix C' to obtain a characteristic value of the covariance matrix and a unitized characteristic vector corresponding to the characteristic value;
11 Selecting unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices according to a given accumulated contribution rate, and combining to form a mapping matrix P';
12 Using the mapping matrix P ', obtaining reduced-dimension EEG signal characteristic data Y' by the following formula:
Y′=Z m*n P′ (9)
wherein Z is m*n Is weighted EEG signal characteristic data.
The normalization processing in the step 5) adopts the following formula:
wherein d is data before normalization, max is the largest value of n differences, min is the smallest value of n differences, and w is a weight.
The covariance matrix in step 9) is calculated by the following formula:
wherein, C' is covariance matrix; n represents the dimension of each sample feature; z is Z m*n Representing weighted EEG signal characteristic data.
The decomposition covariance matrix C' of step 10) is obtained by using the following formula
λ i u i =C′u i ,i=1,2,3,…,n (6)
Wherein lambda is i As the ith eigenvalue of the covariance matrix, u i Lambda is lambda i Corresponding unitized eigenvectors are arranged in descending order for the obtained n eigenvalues to obtain lambda 12 ,…,λ n
Step 11) includes selecting eigenvalues of the first k covariance matrices, and calculating cumulative contribution rates of eigenvalues of the ith covariance matrix as follows:
according to the accumulated contribution rate, unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices are selected and combined to form a mapping matrix P ''
P′=[u 1 ,u 2 ,…,u k ] (8)
Wherein i is less than or equal to k, alpha i To accumulate contribution rate lambda j As the j-th eigenvalue of covariance matrix, u 1 ,u 2 ,…,u k The eigenvectors are unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices.
The EEG signal feature dimension reduction method based on weighted principal component analysis has the beneficial effects that:
first, principal component analysis is based on data variance dimensionality reduction, with each dimension of the features being treated on average, however the role played by different features in the recognition process is different. The invention reduces the dimension based on the attribute of the characteristic data, strengthens some characteristics which are critical to the identification, and weakens some characteristics which have little relation with the identification.
Secondly, the invention can effectively improve the classification recognition precision and reduce the training time of the recognition model by reducing the dimension of the electroencephalogram characteristic data.
Drawings
FIG. 1 is a flow chart of a method of feature dimensionality reduction in EEG signals based on weighted principal component analysis in accordance with the present invention;
FIG. 2a is a graph of experimental results for subject 1;
FIG. 2b is a graph of experimental results for subject 2;
FIG. 2c is a graph of experimental results for subject 3;
FIG. 2d is a graph of experimental results for subject 4;
fig. 2e is a graph of experimental results for subject 5.
Detailed Description
A method for reducing the feature dimensions of EEG signals based on weighted principal component analysis according to the invention will now be described in more detail with reference to the examples and drawings.
According to the EEG signal feature dimension reduction method based on the weighted principal component analysis, one feature is sequentially removed, the influence of different features on fatigue state classification performance is counted, then the accuracy reduction value normalization processing of the different features is used as the weight of the feature, and finally the weighted principal component analysis method is established to reduce the dimension of the feature.
The invention discloses an EEG signal characteristic dimension reduction method based on weighted principal component analysis, which comprises the following steps:
1) Extracting samples of m EEG signals of fatigue driving by using an AR model, wherein each sample contains n-dimensional characteristic data;
2) Dividing m samples of EEG signals into a training set and a testing set, respectively training the training set and the testing set by using SVM, and classifying the testing set to obtain a classification total accuracy A;
3) Removing first dimension characteristic data in n dimension characteristic data of each EEG signal sample to obtain n-1 dimension characteristic data of each EEG signal sample, dividing the m EEG signal samples into a training set and a test set, respectively training the training set and the test set by using SVM, classifying the test set, and obtaining classification accuracy A 1 The method comprises the steps of carrying out a first treatment on the surface of the Removing second dimension data in n-dimension characteristic data of each EEG signal sample, dividing the m EEG signal samples into a training set and a test set, training and classifying the rest n-1 dimension data by using SVM to obtain classification accuracy A 2 And analogically, finally obtaining the accuracy A of n classifications 1 ,A 2 …A i ,…,A n
4) The classification total accuracy A obtained in the step 2) is respectively compared with the classification accuracy A obtained in the step 3) 1 ,A 2 …A i ,…,A n Performing difference to obtain n difference values; when the difference is negative, the classification accuracy A of the difference is made i The corresponding removed ith dimension features have negative influence on classification, and when the difference value is positive, the classification accuracy A is poor i The corresponding removed ith dimension feature has positive influence on classification;
5) Respectively carrying out normalization processing on the n differences to obtain n weights; the normalization process adopts the following formula:
wherein d is data before normalization, max is the largest value of n differences, min is the smallest value of n differences, and w is a weight.
6) For n weights w i i=1, 2, …, n, constructing a weight diagonal matrix W n*n
7) Writing samples of m EEG signals into an m X n dimensional matrix X m*n
Wherein x is 1 ...x m Samples of m EEG signals;
8) Matrix X in m X n dimensions m*n Diagonal matrix W with weight W n*n Multiplying to obtain weighted EEG signal characteristic data Z m*n
9) Computing weighted EEG signal feature data Z m*n Covariance matrix C'; the covariance matrix is calculated by adopting the following formula:
wherein, C' is covariance matrix; n represents the dimension of each sample feature; z is Z m*n Representing weighted EEG signal characteristic data.
10 Decomposing the covariance matrix C' to obtain a characteristic value of the covariance matrix and a unitized characteristic vector corresponding to the characteristic value; the decomposition covariance matrix C' is prepared by adopting the following formula
λ i u i =C′u i ,i=1,2,3,…,n (6)
Wherein lambda is i As the ith eigenvalue of the covariance matrix, u i Lambda is lambda i Corresponding unitized eigenvectors are arranged in descending order for the obtained n eigenvalues to obtain lambda 12 ,…,λ n
11 Selecting unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices according to a given accumulated contribution rate, and combining to form a mapping matrix P'; the method comprises the steps of selecting eigenvalues of the first k covariance matrixes, and calculating the cumulative contribution rate of the eigenvalues of the ith covariance matrix as follows:
according to the accumulated contribution rate, unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices are selected and combined to form a mapping matrix P ''
P′=[u 1 ,u 2 ,…,u k ] (8)
Wherein i is less than or equal to k, alpha i To accumulate contribution rate lambda j As the j-th eigenvalue of covariance matrix, u 1 ,u 2 ,…,u k The eigenvectors are unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices.
12 Using the mapping matrix P ', obtaining reduced-dimension EEG signal characteristic data Y' by the following formula:
Y′=Z m*n P′ (9)
wherein Z is m*n Is weighted EEG signal characteristic data.
EEG signal acquisition in the examples of the present invention was performed in simulated driving experiments in which 5 right-handed college students aged 19 years to 26 years (3 men and 2 women; average age: 22.73) voluntarily participated in the experiment, and none of them had psychosis-related diseases. Two days prior to the experiment, subjects were required to avoid taking anti-fatigue beverages or drugs that cause somnolence. At the same time, they need to remain at a reasonable rest, with sleep times exceeding 7 hours per night. Since all subjects did not contact the driving simulator, they had to practice driving until proficiency was achieved before the experiment.
Experiments used PGFD001 driving simulators and were equipped with pedals, steering wheel and clutches. In the virtual driver 3DInstructor2, ordinary automobile pharton 2.0l is used, defaulting to automatic gear shifting. The experimental road is a highway with almost no curve. In addition, a webcam 360D618, a projector and a stereo speaker are added to improve the perceived ability.
In order to improve the fatigue of the subjects, the experimental time was 14:00-15:30, there are studies showing that this is a period of time that is prone to fatigue, with a trial time of about 90 minutes. The subject's whole scalp EEG signal is collected in an isolated and quiet room. In addition, we also monitored the facial status of the subject through the front-facing camera to verify the extent of fatigue. Before the start of the experiment there was a 10 minute scene setting and approximately 20 minutes driving exercise. After the start of the experiment, the subject will continue to drive until his mild fatigue is reported, which typically lasts about 30 minutes as a state of alertness. Then, after a continuous driving transition for 10 minutes, the subject underwent driving for another 30 minutes, which was used as a fatigue state. After this stage, each subject was asked to report some questions and post-experimental comments. The recording time for each subject was approximately 90 minutes, which varies slightly from individual to individual. As the experiment progresses, the fatigue of the subject becomes more and more intense. As important physiological measures of fatigue status, their subjective evaluation and actual behavior show more and more fatigue.
Data acquisition and data preprocessing:
as a measure of driver fatigue status, the EEG recording device was equipped with an ESI Neuroscan system, with 40 electrodes and arranged according to the International Standard 10/20 System, with a sampling frequency of 200Hz. Prior to acquisition, the skin impedance of the EEG electrode was adjusted to below 5kΩ by injecting a conductive gel. During the experiment, all subjects limited as much unnecessary physical movements as possible, maintained constant speed as much as possible and avoided car collisions. Of these 40 electrodes, two are defined as reference electrodes in addition to four electrodes as internal structures, and four (placed across the horizontal and vertical directions) are used to monitor eye movements. The original EEG signal is subject to high frequency noise and low frequency noise based on eye power and therefore requires pre-processing using the EEGLAB kit. A 30-channel EEG signal is obtained after eliminating the interference of noise and ocular artifacts.
The data collected in the first 10min of the alert state are selected as non-fatigue data, and the data collected in the last 10min of the fatigue state are selected as fatigue data. The data segmentation was performed using a sliding window with a fixed length, the window length being 1 second, without overlap. After the data segmentation was completed, 1200 sets of samples were obtained for each subject.
The embodiment of the invention applies an AR model to extract characteristics of electroencephalogram data, and the AR modeling principle is as follows:
where x (t) represents EEG data at time t, P represents AR model order, e (t) is a white noise sequence, and a (k) represents AR model coefficients.
AR modeling requires selection of model orders. The orders are 3, 4 and 5, respectively. The size of the AR features is equal to the AR order multiplied by 30 EEG channel units, so the AR 3 order gets 90 feature units, the AR 4 order gets 120 feature units, and the AR 5 order gets 150 feature units.
For comparison, the embodiment of the invention also adopts the power spectrum density as a characteristic extraction mode. Electroencephalogram includes four bands: delta (1-3 Hz), theta (4-7 Hz), alpha (8-13 Hz) and beta (14-30 Hz). The total power of each EEG activity band is calculated by adopting a trapezoid numerical integration method as a characteristic value, and 4 units of characteristic values can be obtained, so that 120 characteristic units are obtained in total for 30 EEG channels.
The EEG signal feature dimension reduction method based on weighted principal component analysis is verified through a classifier and a classification index:
the support vector machine is an efficient supervised two-classifier and plays an important role in realizing data classification of small samples, nonlinearities and high-dimensional modes. The invention selects SVM as classifier and Radial Basis Function (RBF) as kernel function.
For evaluating the classification result, the following three evaluation indexes are used:
accuracy rate:
sensitivity:
specificity:
TP is the number of correctly identified positive samples, namely the number of correctly identified samples for fatigue driving; TN is the number of samples that correctly identifies the negative sample, i.e., identifies normal driving as normal driving; FN is the number of samples for which no positive samples are identified, namely the number of samples for which fatigue driving is identified as normal driving; FP is the number of samples for which no negative samples are identified, i.e., the number of samples for which normal driving is identified as fatigue driving. The accuracy reflects the proportion of the samples with correct classification to the total samples, the sensitivity reflects the classification accuracy of the positive samples, and the specificity reflects the classification accuracy of the negative samples.
Matlab experiment settings:
in the 30-channel electroencephalogram data set of 5 subjects, 1200 sets of sample data were obtained for each subject. The invention adopts an AR model to extract the characteristics of the data set, the orders are 3 order, 4 order and 5 order respectively, and simultaneously adopts the power spectrum density as the characteristic extraction mode.
In the embodiment of the invention, WPCA is adopted to reduce the dimension of the characteristic data, the cumulative contribution rate is 0.95, and SVM is selected as a classifier. To verify the reliability of the experimental results, a 10-fold cross-validation method was used: the dataset was divided into 10 parts, 1 part as test set and the remaining 9 parts as training set. The cross-validation was repeated 10 times so that each could be tested as a test set and finally the average of 10 test data was taken as the final result.

Claims (2)

1. An EEG signal feature dimension reduction method based on weighted principal component analysis, comprising the steps of:
1) Extracting samples of m EEG signals of fatigue driving by using an AR model, wherein each sample contains n-dimensional characteristic data;
2) Dividing m samples of EEG signals into a training set and a testing set, respectively training the training set and the testing set by using SVM, and classifying the testing set to obtain a classification total accuracy A;
3) Removing first dimension characteristic data in n dimension characteristic data of each EEG signal sample to obtain n-1 dimension characteristic data of each EEG signal sample, dividing the m EEG signal samples into a training set and a test set, respectively training the training set and the test set by using SVM, classifying the test set, and obtaining classification accuracy A 1 The method comprises the steps of carrying out a first treatment on the surface of the Removing second dimension data in n-dimension characteristic data of each EEG signal sample, dividing the m EEG signal samples into a training set and a test set, training and classifying the rest n-1 dimension data by using SVM to obtain classification accuracy A 2 And analogically, finally obtaining the accuracy A of n classifications 1 ,A 2 ,…A i ,…,A n
4) The classification total accuracy A obtained in the step 2) is respectively compared with the classification accuracy A obtained in the step 3) 1 ,A 2 ,…A i ,…,A n Performing difference to obtain n difference values; when the difference is negative, the classification accuracy A of the difference is made i The corresponding removed ith dimension features have negative influence on classification, and when the difference value is positive, the classification accuracy A is poor i The corresponding removed ith dimension feature has positive influence on classification;
5) Respectively carrying out normalization processing on the n differences to obtain n weights;
6) For n weights w i i=1, 2, …, n, constructing a weight diagonal matrix W n*n
7) Writing samples of m EEG signals into an m X n dimensional matrix X m*n
Wherein x is 1 ...x m Samples of m EEG signals;
8) Matrix X in m X n dimensions m*n Diagonal matrix W with weight W n*n Multiplying to obtain weighted EEG signal characteristic data Z m*n
9) Computing weighted EEG signal feature data Z m*n Covariance matrix C'; the covariance matrix is calculated by adopting the following formula:
wherein, C' is covariance matrix; n represents the dimension of each sample feature; z is Z m*n Representing weighted EEG signal characteristic data;
10 Decomposing the covariance matrix C' to obtain a characteristic value of the covariance matrix and a unitized characteristic vector corresponding to the characteristic value; the decomposition covariance matrix C' is prepared by adopting the following formula:
λ i u i =C′u i ,i=1,2,3,…,n (6)
wherein lambda is i As the ith eigenvalue of the covariance matrix, u i Lambda is lambda i Corresponding unitized eigenvectors are arranged in descending order for the obtained n eigenvalues to obtain lambda 12 ,…,λ n
11 Selecting unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices according to a given accumulated contribution rate, and combining to form a mapping matrix P'; the method comprises the steps of selecting eigenvalues of the first k covariance matrixes, and calculating the cumulative contribution rate of the eigenvalues of the ith covariance matrix as follows:
according to the accumulated contribution rate, unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices are selected and combined to form a mapping matrix P ''
P′=[u 1 ,u 2 ,…,u k ] (8)
Wherein i is less than or equal to k, alpha i To accumulate contribution rate lambda j As the j-th eigenvalue of covariance matrix, u 1 ,u 2 ,…,u k Unitized eigenvectors corresponding to eigenvalues of the first k covariance matrices;
12 Using the mapping matrix P ', obtaining reduced-dimension EEG signal characteristic data Y' by the following formula:
Y′=Z m*n P′ (9)
wherein Z is m*n Is weighted EEG signal characteristic data.
2. The method of claim 1, wherein the normalization in step 5) is performed using the following formula:
wherein d is data before normalization, max is the largest value of n differences, min is the smallest value of n differences, and w is a weight.
CN201910582226.XA 2019-06-30 2019-06-30 EEG signal feature dimension reduction method based on weighted principal component analysis Active CN110390272B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910582226.XA CN110390272B (en) 2019-06-30 2019-06-30 EEG signal feature dimension reduction method based on weighted principal component analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910582226.XA CN110390272B (en) 2019-06-30 2019-06-30 EEG signal feature dimension reduction method based on weighted principal component analysis

Publications (2)

Publication Number Publication Date
CN110390272A CN110390272A (en) 2019-10-29
CN110390272B true CN110390272B (en) 2023-07-18

Family

ID=68285867

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910582226.XA Active CN110390272B (en) 2019-06-30 2019-06-30 EEG signal feature dimension reduction method based on weighted principal component analysis

Country Status (1)

Country Link
CN (1) CN110390272B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444659A (en) * 2020-03-26 2020-07-24 武汉工程大学 Centrifugal pump fault diagnosis method, system and medium based on improved particle filtering
CN112016504B (en) * 2020-09-06 2022-06-03 天津大学 Fatigue driving electroencephalogram signal regression analysis method based on ensemble learning
CN112883895B (en) * 2021-03-08 2022-03-25 山东大学 Illegal electromagnetic signal detection method based on self-adaptive weighted PCA and realization system thereof
CN113220120B (en) * 2021-04-27 2022-08-30 武汉理工大学 Self-adaptive motor imagery brain-computer interface training method fusing subjective and objective evaluation
CN113642600B (en) * 2021-06-29 2024-02-02 桂林电子科技大学 Driving behavior feature extraction method based on mRMR algorithm and principal component analysis

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915445A (en) * 2012-09-17 2013-02-06 杭州电子科技大学 Method for classifying hyperspectral remote sensing images of improved neural network
CN103584840A (en) * 2013-11-25 2014-02-19 天津大学 Automatic sleep stage method based on electroencephalogram, heart rate variability and coherence between electroencephalogram and heart rate variability
CN106108894A (en) * 2016-07-18 2016-11-16 天津大学 A kind of emotion electroencephalogramrecognition recognition method improving Emotion identification model time robustness
CN106952052A (en) * 2017-04-06 2017-07-14 东北林业大学 Based on hybrid weight core principle component analysis enterprise supplier evaluation method
CN107025468A (en) * 2017-05-18 2017-08-08 重庆大学 Highway congestion recognition methods based on PCA GA SVM algorithms
CN108596268A (en) * 2018-05-03 2018-09-28 湖南大学 A kind of data classification method
CN109034087A (en) * 2018-08-06 2018-12-18 河海大学 A kind of hybrid machine learning signal classification method based on PCA dimensionality reduction

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103876734B (en) * 2014-03-24 2015-09-02 北京工业大学 A kind of EEG signals feature selection approach based on decision tree

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915445A (en) * 2012-09-17 2013-02-06 杭州电子科技大学 Method for classifying hyperspectral remote sensing images of improved neural network
CN103584840A (en) * 2013-11-25 2014-02-19 天津大学 Automatic sleep stage method based on electroencephalogram, heart rate variability and coherence between electroencephalogram and heart rate variability
CN106108894A (en) * 2016-07-18 2016-11-16 天津大学 A kind of emotion electroencephalogramrecognition recognition method improving Emotion identification model time robustness
CN106952052A (en) * 2017-04-06 2017-07-14 东北林业大学 Based on hybrid weight core principle component analysis enterprise supplier evaluation method
CN107025468A (en) * 2017-05-18 2017-08-08 重庆大学 Highway congestion recognition methods based on PCA GA SVM algorithms
CN108596268A (en) * 2018-05-03 2018-09-28 湖南大学 A kind of data classification method
CN109034087A (en) * 2018-08-06 2018-12-18 河海大学 A kind of hybrid machine learning signal classification method based on PCA dimensionality reduction

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于属性加权的主成分分析算法;王永欣等;《济南大学学报(自然科学版)》;20151130;第1-6页 *
机器学习与数据挖掘特征选择与特征降维;清华文库;《https://www.doc88.com/p-0886177639657.html》;20130522;第1-20页 *

Also Published As

Publication number Publication date
CN110390272A (en) 2019-10-29

Similar Documents

Publication Publication Date Title
CN110390272B (en) EEG signal feature dimension reduction method based on weighted principal component analysis
CN109157231B (en) Portable multichannel depression tendency evaluation system based on emotional stimulation task
Khare et al. Time–frequency representation and convolutional neural network-based emotion recognition
Maheshwari et al. Automated accurate emotion recognition system using rhythm-specific deep convolutional neural network technique with multi-channel EEG signals
Khare et al. PDCNNet: An automatic framework for the detection of Parkinson’s disease using EEG signals
Jindal et al. An adaptive deep learning approach for PPG-based identification
Wang et al. Driving fatigue classification based on fusion entropy analysis combining EOG and EEG
US9149719B2 (en) Device and method for generating a representation of a subject's attention level
US8862581B2 (en) Method and system for concentration detection
Krishna et al. An efficient mixture model approach in brain-machine interface systems for extracting the psychological status of mentally impaired persons using EEG signals
CN111714118B (en) Brain cognition model fusion method based on ensemble learning
CN113729707A (en) FECNN-LSTM-based emotion recognition method based on multi-mode fusion of eye movement and PPG
Alturki et al. Common spatial pattern technique with EEG signals for diagnosis of autism and epilepsy disorders
Sengupta et al. A multimodal system for assessing alertness levels due to cognitive loading
CN114052735A (en) Electroencephalogram emotion recognition method and system based on depth field self-adaption
Dong et al. A WPCA-based method for detecting fatigue driving from EEG-based internet of vehicles system
Ahmadi et al. Automated detection of driver fatigue from electroencephalography through wavelet-based connectivity
CN111920420A (en) Patient behavior multi-modal analysis and prediction system based on statistical learning
Ramos-Aguilar et al. Analysis of EEG signal processing techniques based on spectrograms
Mahmoudi et al. Multi-class EEG classification of motor imagery signal by finding optimal time segments and features using SNR-based mutual information
Pan et al. Recognition of human inner emotion based on two-stage FCA-ReliefF feature optimization
Bayram et al. Analysis of epileptic iEEG data by applying convolutional neural networks to low-frequency scalograms
Murugappan et al. Subtractive fuzzy classifier based driver drowsiness levels classification using EEG
Reaves et al. Assessment and application of EEG: A literature review
Baghdadi et al. Effectiveness of dominance for anxiety vs anger detection

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant