CN103077318A - Classifying method based on sparse measurement - Google Patents

Classifying method based on sparse measurement Download PDF

Info

Publication number
CN103077318A
CN103077318A CN2013100176108A CN201310017610A CN103077318A CN 103077318 A CN103077318 A CN 103077318A CN 2013100176108 A CN2013100176108 A CN 2013100176108A CN 201310017610 A CN201310017610 A CN 201310017610A CN 103077318 A CN103077318 A CN 103077318A
Authority
CN
China
Prior art keywords
vector
matrix
sparse
iteration
objective function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013100176108A
Other languages
Chinese (zh)
Inventor
徐鹏
李沛洋
张锐
田春阳
郭兰锦
尧德中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN2013100176108A priority Critical patent/CN103077318A/en
Publication of CN103077318A publication Critical patent/CN103077318A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a classifying method based on sparse measurement. The classifying method specifically comprises the following steps of: culturing a distribution information matrix among classes and a distribution information matrix among classes; decomposing the obtained matrixes; converting a Fisher criterion to an L1 module structure; and estimating a projection vector for enabling an objective function to obtain the maximum value. As the existing LDA (Linear Discriminant Analysis) based on L2 mode measure function generates unfavorable amplification to the noises such as Outliers and the like, the classifying method disclosed by the invention can overcome the problem that the existing linear discriminant analysis based on L2 mode is affected by the Outliers and noise isolated points by adopting a measure function based on L1 mode during a discriminant analysis constructing process, so that stable classifying and identifying effect is obtained, and the stability of a BCI (Brain Computer Interface) system is improved to certain extent.

Description

A kind of sorting technique based on sparse tolerance
Technical field
The invention belongs to the biomedical information technical field, be specifically related to the method for classifying modes in the field of brain-computer interfaces.
Background technology
Brain-computer interface (Brain Computer Interface, BCI) be to utilize to realize human brain and extraneous passage (the Wolpaw JR that directly exchanges and control between computing machine or other external electronic device, Birbaumer N, McFarland DJ, Pfurtscheller G, Vaughan TM (2002) Brain-computer interfaces for communication and control.Clin Neurophysiol113,767-791).BCI research relates to numerous subjects, such as: Neuscience, input, signal processing, pattern-recognition, control theory etc., the cross development of these subjects have promoted the propelling of BCI research.The basic theory of BCI and clinical application research have been included into the category of brain science and neuroengineering, also by international a lot of authoritative institutions think 21 century brain and Neuroscience Research in forefront and one of focus.
A BCI system comprises signals collecting, data processing and three parts of Peripheral Equipment ﹠ Interface.At Data processing, the pattern classification of EEG signal is the key that can the BCI system translate brain information.The method for classifying modes that is used for brain-computer interface is a lot, wherein, linear discriminant analysis method (Linear Discriminant Analysis, LDA) is because simple structural principle and lower computation complexity and be widely used in the BCI pattern-recognition.
The basic thought of linear discriminant analysis is the sample in the higher dimensional space to be projected to a low dimensional feature space and so that the sample after the projection has dispersion in dispersion between maximum class and the minimum class in new space, the feature space that satisfies condition is makes (1) formula reach the maximum subspace that proper vector w opened:
J ( w ) = w t S B w w t S W w - - - ( 1 )
S B = Σ i = 1 c n i ( μ i - μ ) ( μ i - μ ) T - - - ( 2 )
S W = Σ i = 1 c Σ x k ∈ classi ( μ i - x k ) ( μ i - x k ) T - - - ( 3 )
μ i = 1 n i Σ x ∈ classi x - - - ( 4 )
μ = 1 m Σ i = 1 m x i - - - ( 5 )
Wherein, m is the population sample number, and c is total classification number, n iThe number of samples that represents i class sample, and have
Figure BDA00002747188900016
x kK sample in the expression training set, classi represents that the classification of sample is i, μ iBe the sample average of i class sample, μ is the population sample average; W is the column vector of a m * 1, w TBe its transposition, and S BAnd S wBe respectively scatter matrix and within class scatter matrix between class, and both are the matrix of m * m.
By certain linear change and introduce Lagrange multiplier λ, then formula (1) can be expressed as:
L(w,λ)=w TS Bw-λ(w TS Ww-1) (6)
Formula (6) two ends ask local derviation to get to w simultaneously:
∂ L ( w , λ ) ∂ w = S B w - λS W w - - - ( 7 )
Making it be 0 has:
S Bw=λS Ww (8)
The generalized eigenvector w of formula (8) is the projection vector of obtaining by LDA.
Because the essence of LDA is based on the exemplary number of L2 and defines, yet in EEG research, the EEG signals sample dimension that collects from eeg amplifier is usually much larger than number of samples, and has very strong randomness, particularly in the brain-computer interface field, this class problem shows more outstandingly.Although the solution that produces based on the L2 modeling method has good flatness, but do not possess sparse property, and be subject to easily in the experimentation because the exceptional value (Outliers) that electrode is loosening, nictation and misoperation produce and the impact of isolated noise point, even sometimes so that signal matrix becomes unusual, thereby may produce the mistake of projection w is estimated, finally affected classifying quality; And these interference can increase the reading difficulty of EEG signals and hinder the subsequent analysis research of EEG signals.
Summary of the invention
The objective of the invention is to propose a kind of sorting technique based on sparse tolerance in order to solve existing the problems referred to above that exist based on the linear discriminant analysis method of L2 mould
The technical solution used in the present invention is: a kind of sorting technique based on sparse tolerance specifically comprises the steps:
S1. according to scatter matrix S between EEG signals training sample difference compute classes BWith within class scatter matrix S W
Scatter matrix S between the class that S2. obtains according to step S1 BWith within class scatter matrix S WConstruct corresponding φ BWith Wherein, φ BWith
Figure BDA00002747188900023
Satisfy respectively
Figure BDA00002747188900024
Figure BDA00002747188900025
T is the transpose of a matrix computing;
S3. the Fisher criterion is converted into the L1 mode configuration, namely has:
Figure BDA00002747188900026
This structure mapping is converted into following objective function to log space:
Figure BDA00002747188900027
Wherein, w is projection vector, estimates so that objective function is obtained peaked projection vector w.
Further, said method comprises that also step S4. projects to the space that vectorial w opens with test sample book and classifies.
The invention has the beneficial effects as follows: existing LDA based on L2 mould measure function can produce disadvantageous amplification to noises such as Outliers, method of the present invention has adopted the measure function based on the L1 mould when the structure discriminatory analysis, can overcome the problem that existing linear discriminant analysis based on the L2 mould is subject to Outliers and the impact of noise isolated point, obtain sane Classification and Identification effect, improve to a certain extent the BCI Systems balanth; Experimental results show that method that the present invention proposes can be at short time Convergence, thereby satisfy the brain-computer interface on-line system to the requirement of real-time.
Description of drawings
The schematic flow sheet of Fig. 1 the inventive method.
Gaussian data emulation synoptic diagram in Fig. 2 embodiment of the invention.
Colorectal cancer gene recognition effect comparison diagram in Fig. 3 embodiment of the invention.
Embodiment
The present invention is described further below in conjunction with the drawings and specific embodiments.
Because S B, S WBe real symmetric matrix, consisted of by the product of two matrixes respectively at the transposed matrix of self in essence, so formula (1) also can be expressed as:
Figure BDA00002747188900031
S B, S WCan find corresponding φ by certain matrix decomposition theorem BWith
Figure BDA00002747188900032
Therefore method of the present invention can be described below:
S1. according to scatter matrix S between EEG signals training sample difference compute classes BWith within class scatter matrix S W
In order to construct scatter matrix and within class scatter matrix between corresponding class, can in training sample, calculate respectively in advance the sample average μ of each class iWith population sample average μ, calculate respectively scatter matrix S between class according to formula (2) and formula (3) BAnd within class scatter matrix S W
Scatter matrix S between the class that S2. obtains according to step S1 BWith within class scatter matrix S WConstruct corresponding φ BWith Wherein, φ BWith Satisfy respectively
Figure BDA00002747188900035
Figure BDA00002747188900036
T is the transpose of a matrix computing;
Here can adopt svd to construct corresponding φ BWith
Figure BDA00002747188900037
Formula is as follows:
SVD ( S B ) = U B Σ B V B = U B Σ B U B T = U B Σ B Σ B U B T = φ B φ B T - - - ( 10 )
Figure BDA00002747188900039
Wherein, because S BAnd S WBe real symmetric matrix, therefore carry out later its left singular matrix U of svd and right singular matrix V and only have a transposition relation, its size is m * m; ∑ B=diag (σ B1, σ B2..., σ Bm) and ∑ W=diag (σ W1, σ W2..., σ Wm) be respectively S BAnd S WThe diagonal matrix that consists of of singular value, and σ is arranged B1〉=σ B2〉=... 〉=σ Bm=0 and σ W1〉=σ W2〉=... 〉=σ Wm=0;
Figure BDA00002747188900041
Be the matrix of m * m.
S3. the Fisher criterion is converted into the L1 mode configuration, namely has:
Figure BDA00002747188900043
This structure mapping is converted into following objective function to log space: Wherein, w is projection vector, estimates so that objective function is obtained peaked projection vector w.
Said process is for the process of training projection vector w, as a kind of preferred version, comprises also that here test process step S4. projects to the space that vectorial w opens with test sample book and classifies, and specifically can adopt the arest neighbors mode to estimate the pattern of test sample book.
Here in step S3, estimate so that objective function to obtain the detailed process of peaked projection vector w as follows:
S31. to objective function J *(w) both sides while derived function goes out corresponding gradient:
Figure BDA00002747188900045
Wherein,
P i = 1 w T &phi; B ( : , i ) &GreaterEqual; 0 - 1 w T &phi; B ( : , i ) < 0 ,
φ B(:, i),
Figure BDA00002747188900048
Difference representing matrix φ BWith I column vector and j column vector, m represents to consist of matrix φ BWith
Figure BDA000027471889000410
The column vector number;
S32. utilize gradient descent method that object vector w is estimated, the detailed process of estimation is as follows:
Renewal process is expressed as:
w(t+1)=w(t)-δ·w(t) T·dw
Wherein, t=[0,1,, n] and the expression iterations, w (t) is the vector of a m * 1, represent the estimated result that estimated result w (t+1) expression that the t time iteration obtains obtains for the t+1 time, it is the linear combination of w (t), vectorial w (0) the ∈ R of primary iteration mCan be set to a non-zero vector arbitrarily; δ is the iteration step distance, here can select so that J *(w (t+1)) reaches a value of maximum as the δ of this iteration, thereby guarantees the fast convergence of algorithm; When satisfying || J *(w (t+1))-J *(w (t)) || during<α, iteration stopping, α is predefined threshold value.In the present embodiment, α≤1e-005.
For feasibility and the effect of verifying this programme, with method of the present invention (L1_LDA) and traditional LDA contrast, three groups of comparative experiments have been done.First group is adopted the Gaussian data of the different averages of two classes and standard deviation and introduces outlier as emulation experiment, second group of data that adopt international brain-computer interface match, the effect of checking this programme in brain-computer interface; The 3rd group of Colorectal Carcinoma data that adopt the Princeton University, the expansion possibility of checking this programme.
First group of experiment:
It is [3 that data A derives from mean vector; 3], the standard deviation vector is [0.5; 0.5] the Gaussian distribution matrix; The mean vector of data B is specifically as shown in table 1:
Table 1
Figure BDA00002747188900051
The standard deviation vector is [0.5; 0.5] Gaussian distribution matrix (data B on diagonal gradually near data A), Outlier is [7.25 data from mean vector; 7.25], the standard deviation vector is [0.5; 0.5] the Gaussian distribution matrix, the number of Outliers is 10, data distribution in the experiment and projection are as shown in Figure 2, wherein, (a) the later scatter plot of data of Outliers is introduced in expression, and ' * ' and ' o ' represents respectively the data from different sample sets, can find out and introduce after the Outliers, skew has occured in part data, becomes exceptional value; (b) the traditional LDA method of expression can be found out because be subject to the impact of Outliers the classification projection of test set sample, a part of data communication device is crossed the LDA projection vector and has been mapped to from the position of cluster centre away from, has increased the probability of mistake minute; (c) the LDA method of expression process L1 modular constraint is to the classification projection of test sample book, can find out since the L1 mould to the inhibiting effect of Outliers so that two class data are concentrated after through the projection vector mapping has been distributed near the cluster centre of two class data, improved the stability of classifying quality.
Classification accuracy is as shown in table 2, and wherein runic represents that effect is better, and " * " expression has significant difference (p<0.05).
Table 2
Figure BDA00002747188900052
Figure BDA00002747188900061
Second group of experiment:
The international brain-computer interface tested numbering of the experimental data A(aa al av aw ay that competes for the third time); Specifically as shown in table 3:
Table 3
Figure BDA00002747188900062
The 3rd group of experiment:
Description of test: data from the rectal cancer group organization data of Princeton University, wherein comprised 62 Colorectal Tissues that difference is tested, each tissue has 2000 gene expressions, wherein has 40 organization table to reveal the positive, and 22 organization table reveal feminine gender.Adopt 5 times of cross validations, test 50 times average classification accuracy as shown in Figure 3.
Above experimental result proves that all method of the present invention is feasible, and in some cases, it is particularly evident that method of the present invention is compared traditional LDA effect.On the whole, no matter be emulation experiment or real experimental data, its result verification validity and the feasibility of method of the present invention, for improving the EEG research Systems balanth, extracting more effectively, cerebral function information has great theory and practice meaning.Method of the present invention not only has preferably effect in the brain-computer interface field, and proof also has preferably effect for the pattern-recognition in other field by experiment.
Those of ordinary skill in the art will appreciate that embodiment described here is in order to help reader understanding's principle of the present invention, should to be understood to that protection scope of the present invention is not limited to such special statement and embodiment.Those of ordinary skill in the art can make various other various concrete distortion and combinations that do not break away from essence of the present invention according to these technology enlightenments disclosed by the invention, and these distortion and combination are still in protection scope of the present invention.

Claims (5)

1. the sorting technique based on sparse tolerance specifically comprises the steps:
S1. according to scatter matrix S between EEG signals training sample difference compute classes BWith within class scatter matrix S W
Scatter matrix S between the class that S2. obtains according to step S1 BWith within class scatter matrix S WConstruct corresponding φ BWith
Figure FDA00002747188800011
Wherein, φ BWith
Figure FDA00002747188800012
Satisfy respectively
Figure FDA00002747188800013
Figure FDA00002747188800014
S3. the Fisher criterion is converted into the L1 mode configuration, namely has:
Figure FDA00002747188800015
Described L1 mode configuration is mapped to log space is converted into following objective function:
Figure FDA00002747188800016
Wherein, w is projection vector, estimates so that objective function is obtained peaked projection vector w.
2. the sorting technique based on sparse tolerance according to claim 1 is characterized in that, comprises that also step S4. projects to the space that vectorial w opens with test sample book and classifies.
3. the sorting technique based on sparse tolerance according to claim 1 and 2 is characterized in that, step S2 specifically adopts svd structure φ BWith
4. the sorting technique based on sparse tolerance according to claim 1 and 2 is characterized in that, estimate among the step S3 so that objective function to obtain the detailed process of peaked projection vector w as follows:
S31. to objective function J *(w) both sides while derived function goes out corresponding gradient:
Figure FDA00002747188800018
Wherein,
P i = 1 w T &phi; B ( : , i ) &GreaterEqual; 0 - 1 w T &phi; B ( : , i ) < 0 ,
φ B(:, i), Difference representing matrix φ BWith
Figure FDA000027471888000112
I column vector and j column vector, m represents to consist of matrix φ BWith
Figure FDA000027471888000113
The column vector number;
S32. utilize gradient descent method that object vector w is estimated.
5. the sorting technique based on sparse tolerance according to claim 4 is characterized in that, step S32 utilizes gradient descent method as follows to the detailed process that object vector w estimates:
Renewal process is expressed as:
w(t+1)=w(t)-δ·w(t) T·dw
Wherein, t represents iterations, and w (t) is the vector of a m * 1, represents the estimated result that the t time iteration obtains, the estimated result that w (t+1) expression obtains for the t+1 time, vectorial w (0) the ∈ R of primary iteration mBe set to a non-zero vector arbitrarily; δ is the iteration step distance, selects so that J *(w (t+1)) reaches a value of maximum as the δ of this iteration; When satisfying || J *(w (t+1))-J *(w (t)) || during<α, iteration stopping, α is predefined threshold value.
CN2013100176108A 2013-01-17 2013-01-17 Classifying method based on sparse measurement Pending CN103077318A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013100176108A CN103077318A (en) 2013-01-17 2013-01-17 Classifying method based on sparse measurement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013100176108A CN103077318A (en) 2013-01-17 2013-01-17 Classifying method based on sparse measurement

Publications (1)

Publication Number Publication Date
CN103077318A true CN103077318A (en) 2013-05-01

Family

ID=48153847

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013100176108A Pending CN103077318A (en) 2013-01-17 2013-01-17 Classifying method based on sparse measurement

Country Status (1)

Country Link
CN (1) CN103077318A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104463234A (en) * 2015-01-04 2015-03-25 深圳信息职业技术学院 Face recognition method
CN106599907A (en) * 2016-11-29 2017-04-26 北京航空航天大学 Multi-feature fusion-based dynamic scene classification method and apparatus
CN113938198A (en) * 2021-11-08 2022-01-14 杭州电子科技大学 Optical fiber transmission system, method and module for simplifying nonlinear equalizer based on LDA
CN115081542A (en) * 2022-07-22 2022-09-20 深圳大学 Subspace clustering method, terminal device and computer-readable storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102629156A (en) * 2012-03-06 2012-08-08 上海大学 Method for achieving motor imagery brain computer interface based on Matlab and digital signal processor (DSP)
CN102647942A (en) * 2009-10-09 2012-08-22 纽罗斯凯公司 Sensory-evoked potential (SEP) classification/detection in the time domain

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102647942A (en) * 2009-10-09 2012-08-22 纽罗斯凯公司 Sensory-evoked potential (SEP) classification/detection in the time domain
CN102629156A (en) * 2012-03-06 2012-08-08 上海大学 Method for achieving motor imagery brain computer interface based on Matlab and digital signal processor (DSP)

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HAIXIAN WANG等: "L1-Norm-Based Common Spatial Patterns", 《IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING》 *
李洁: "多模态脑电信号分析及脑机接口应用", 《中国博士学位论文全文数据库 信息科技辑》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104463234A (en) * 2015-01-04 2015-03-25 深圳信息职业技术学院 Face recognition method
CN104463234B (en) * 2015-01-04 2017-10-27 深圳信息职业技术学院 Face identification method
CN106599907A (en) * 2016-11-29 2017-04-26 北京航空航天大学 Multi-feature fusion-based dynamic scene classification method and apparatus
CN106599907B (en) * 2016-11-29 2019-11-29 北京航空航天大学 The dynamic scene classification method and device of multiple features fusion
CN113938198A (en) * 2021-11-08 2022-01-14 杭州电子科技大学 Optical fiber transmission system, method and module for simplifying nonlinear equalizer based on LDA
CN113938198B (en) * 2021-11-08 2024-04-02 杭州电子科技大学 Optical fiber transmission system, LDA-based method and module for simplifying nonlinear equalizer
CN115081542A (en) * 2022-07-22 2022-09-20 深圳大学 Subspace clustering method, terminal device and computer-readable storage medium
CN115081542B (en) * 2022-07-22 2023-07-07 深圳大学 Subspace clustering method, terminal equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
EP3703054B1 (en) Target voice detection method and apparatus
CN102142082B (en) Virtual sample based kernel discrimination method for face recognition
Li et al. Study on the BeiHang keystroke dynamics database
CN102664010B (en) Robust speaker distinguishing method based on multifactor frequency displacement invariant feature
CN103077318A (en) Classifying method based on sparse measurement
Kanagasundaram et al. Weighted LDA techniques for i-vector based speaker verification
CN105701470A (en) Analog circuit fault characteristic extraction method based on optimal wavelet packet decomposition
CN107273825A (en) Personal identification method is merged based on the physiological signal for improving canonical correlation analysis
CN103941244A (en) Radar target one-dimensional range profile local optimal sub-space recognition method
Abdulhai et al. Spatio-temporal inductance-pattern recognition for vehicle re-identification
CN106548136A (en) A kind of wireless channel scene classification method
Wang et al. Hydraulic system fault diagnosis method based on a multi‐feature fusion support vector machine
CN105335689A (en) Character recognition method and apparatus
CN105931646A (en) Speaker identification method base on simple direct tolerance learning algorithm
Mahapatra et al. Power system disturbance classification with online event-driven neuromorphic computing
Laiadi et al. Multi-view deep features for robust facial kinship verification
CN107403618A (en) Based on the audio event sorting technique and computer equipment for stacking base rarefaction representation
CN104318224A (en) Face recognition method and monitoring equipment
CN103295007B (en) A kind of Feature Dimension Reduction optimization method for Chinese Character Recognition
CN104166981B (en) Human body movement learning method based on multigraph expression
CN102237089A (en) Method for reducing error identification rate of text irrelevant speaker identification system
Gao et al. Unbiased risk estimator to multi-labeled complementary label learning
Sheng et al. Centroid-aware local discriminative metric learning in speaker verification
Zhao et al. A novel signal representation in SEI: Manifold
KR101094763B1 (en) Apparatus and method for extracting feature vector for user authentication

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130501