CN106096649A - Sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis - Google Patents

Sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis Download PDF

Info

Publication number
CN106096649A
CN106096649A CN201610404407.XA CN201610404407A CN106096649A CN 106096649 A CN106096649 A CN 106096649A CN 201610404407 A CN201610404407 A CN 201610404407A CN 106096649 A CN106096649 A CN 106096649A
Authority
CN
China
Prior art keywords
sample
phi
tea
discriminant analysis
kernel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610404407.XA
Other languages
Chinese (zh)
Other versions
CN106096649B (en
Inventor
支瑞聪
张德政
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology Beijing USTB
Original Assignee
University of Science and Technology Beijing USTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology Beijing USTB filed Critical University of Science and Technology Beijing USTB
Priority to CN201610404407.XA priority Critical patent/CN106096649B/en
Publication of CN106096649A publication Critical patent/CN106096649A/en
Application granted granted Critical
Publication of CN106096649B publication Critical patent/CN106096649B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The present invention provides a kind of sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis, and method includes: utilize electronic tongues to detect Tea Samples, obtains sensor response clock signal;Principal component residual sum mahalanobis distance method is used to be analyzed exceptional sample and reject according to described response clock signal;The parameter of core linear discriminant analysis method is optimized, with Longjing tea quality grade correct recognition rata for according to the parameter selecting core linear discriminant analysis method;Use core linear discriminant analysis method to carry out Nonlinear feature extraction to sensor response signal, obtain the flavor characteristics of Tea Samples;By the flavor characteristics input grader of Tea Samples, carry out tea leaf quality grade judgement.Abnormality value removing is carried out to Tea Samples, utilizes the core linear discriminant analysis method after Optimal Parameters can preferably characterize the nonlinear characteristic of different brackets Tea Samples, promote the signal difference opposite sex in high-dimensional feature space for the sample after Nonlinear Mapping.

Description

Kernel linearity discriminant analysis-based taste sense signal difference feature extraction method
Technical Field
The invention relates to the technical field of tea detection, in particular to a taste sense induction signal difference characteristic extraction method based on nuclear linear discriminant analysis.
Background
In recent years, tea quality testing has been a difficult task because tea contains many ingredients and their effects on tea quality are very different. West lake Longjing tea is a typical representative of Chinese green tea. Some vendors fry other green tea into flat shape to serve as the dragon well tea or use the dragon well of other production places of Zhejiang to serve as the west lake dragon well, which disturbs the market of the dragon well tea and damages the benefits of consumers, therefore, the method has important significance for scientific detection and evaluation of the quality of the west lake dragon well tea.
Sensory evaluation is an important method for evaluating the quality of tea leaves for a long time, but the method needs rich tea science knowledge and evaluation experience, and the sensory organ sensitivity of professional tea tasters is easily changed by the interference of external factors. Many analytical tools are therefore used to analyse tea leaf chemicals, such as high performance liquid chromatography, gas chromatography and the like. But the traditional linear feature extraction method cannot effectively explore the intrinsic regularity existing in the nonlinear data.
Disclosure of Invention
The invention aims to provide a taste sense induction signal difference characteristic extraction method based on nuclear linear discriminant analysis, which can effectively extract difference characteristics of tea.
In order to solve the above technical problems, an embodiment of the present invention provides a taste sensation signal difference feature extraction method based on kernel linear discriminant analysis, including:
detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
Preferably, the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
Preferably, the detecting the tea sample by using the electronic tongue comprises the following steps:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
Preferably, the analyzing and rejecting abnormal samples by using principal component residual error and mahalanobis distance method according to the response time sequence signal includes:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from large to small, and sorting the eigenvectors corresponding to the eigenvalues according to the sequence from large to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2
And judging the sample points which are away from the same type sample points and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
Preferably, the optimizing the parameters of the kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method based on the correct recognition rate of the quality grade of the tea leaves, includes:
taking a Gaussian kernel function as a nonlinear conversion function of a kernel linear discriminant analysis method, and calculating the linear discriminant analysis of the Gaussian kernel function k (x, y) as exp (- | | x-y | |)2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
Preferably, the gaussian kernel function is:
k ( x , y ) = exp ( - | | x - y | | 2 2 σ 2 )
preferably, the performing nonlinear feature extraction on the sensor response signal by using a kernel linear discriminant analysis method to obtain the flavor features of the tea sample includes:
by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
Preferably, said transformation is by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
S b Φ = Σ i = 1 L P i ( m Φ , i - m Φ ) ( m Φ , i - m Φ ) T
S w Φ = Σ i = 1 L Σ x k ∈ c i ( Φ ( x k ) - m Φ , i ) ( Φ ( x k ) - m Φ , i ) T
wherein m isΦAnd mΦ,iRespectively representing the mean values of all training samples in the high-dimensional feature space and the mean value of the ith class of training samples;
the Fisher criterion function in the high-dimensional feature space is:
J f ( W ) = | W T S b Φ W W T S w Φ W |
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT
C = d i a g ( n 1 , n 2 , ... , n L ) ∈ R L × L , G = d i a g ( 1 n 1 1 n 1 × 1 , ... , 1 n L 1 n L × 1 ) ,
W = d i a g ( I n 1 - 1 n 1 1 n 1 × n 1 , ... , I n L - 1 n L 1 n L × n L ) ∈ R N × N
Preferably, the performing nonlinear feature extraction on the sensor response signal to obtain the taste features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The Fisher criterion function maximization is converted into a problem of solving generalized eigenvalues, and the eigenvalues of KBK α -lambda KWK α and corresponding eigenvectors α - α are solved12,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]12,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
Preferably, the inputting of the taste characteristics of the tea sample into the classifier to perform the quality grade determination of tea comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
d ( x ~ i , x i ) = Σ k = 1 d ( x i k ′ - x i k ) 2
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
The technical scheme of the invention has the following beneficial effects:
according to the scheme, abnormal values of tea samples can be eliminated, nonlinear characteristics of the tea samples in different grades can be better represented by using the kernel linear discriminant analysis method after parameters are optimized, and the signal difference of the samples subjected to nonlinear mapping in a high-dimensional characteristic space is improved.
Drawings
FIG. 1 is a flowchart of a taste sensation signal difference feature extraction method based on kernel linear discriminant analysis according to an embodiment of the present invention;
FIG. 2 is an electronic tongue response map of a tea sample according to an embodiment of the invention;
FIGS. 3a-3d are graphs of the principal component residue-Mahalanobis distance distribution for an embodiment of the present invention;
FIG. 4 shows the correct recognition rate and parameter σ of tea samples according to the embodiment of the present invention2Selecting a relation graph;
FIGS. 5a-5d are comparative results of the KLDA and linear dimensionality reduction method of the present invention with respect to tea sample differentiation;
fig. 6a and 6b are the results of comparing the correct recognition rate curves of KLDA and linear dimensionality reduction methods of embodiments of the present invention as a function of dimensionality reduction.
Detailed Description
In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.
As shown in fig. 1, the method for extracting difference features of taste sensation signal based on kernel linear discriminant analysis according to the embodiment of the present invention includes:
step 101: detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
step 102: analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
step 103: optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
step 104: performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
step 105: inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
The gustation sensing signal difference characteristic extraction method based on the kernel linear discriminant analysis can remove abnormal values of tea samples, the kernel linear discriminant analysis method after parameters are optimized can better represent nonlinear characteristics of the tea samples in different grades, and the signal difference of samples subjected to nonlinear mapping in a high-dimensional characteristic space is improved.
Preferably, the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
Preferably, the detecting the tea sample by using the electronic tongue comprises the following steps:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
The invention can adopt ASTREE electronic tongue system of French Alpha MOS company to detect the Longjing tea sample, and is specially designed for gustatory analysis technology.
Before data acquisition, the electronic tongue system can be subjected to steps of self-checking, activation, training, calibration, diagnosis and the like, so that the acquired data is ensured to have reliability and stability. After the collection, seven electronic tongue response fingerprints are obtained from each tea sample, as shown in figure 2. The horizontal axis represents measurement time, and the vertical axis represents the acquired induced voltage value. The point on the curve represents the change of the potential difference with time when the tea soup flavor substance passes through the sensor channel. In the measurement process, each detection time is 120s, a set of data is acquired every 0.5s by the electronic tongue, and each sample is detected to finally obtain 7 time sequence signals which change along with time, such as response signals of the electronic tongue sensor shown in the attached figure 2. Thus, for each sample test, the data obtained is a matrix of 7 × 240 dimensions. The stable value of the sensor response at 120s can be selected as a characteristic point for the subsequent establishment of the tea quality model.
Preferably, the analyzing and rejecting abnormal samples by using principal component residual error and mahalanobis distance method according to the response time sequence signal includes:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from big to small, the characteristic value corresponds toThe eigenvectors are sorted in the order from big to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2
And judging the sample points which are far away from the same type sample point and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
Wherein, the principal component analysis and mahalanobis distance value calculation are performed on the fine sample set, the special sample set, the first sample set and the second sample set, respectively, as shown in fig. 3a-3 d. The points marked in the graph are abnormal sample points. And performing subsequent data processing on the tea sample data from which the abnormal sample points are removed.
Preferably, the optimizing the parameters of the kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method based on the positive recognition rate of the quality grade of the tea leaves, includes:
linear discrimination using gaussian kernel function as kernelThe nonlinear conversion function of the analysis method is used for solving the problem that the Gaussian kernel function k (x, y) is exp (- | | x-y | | | survival rate2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
Preferably, the gaussian kernel function is:
k ( x , y ) = exp ( - | | x - y | | 2 2 σ 2 )
the parameter selection is an important factor influencing the algorithm discrimination effect, the selection of proper parameters can enhance the effectiveness of the algorithm, while the selection of improper parameters can greatly weaken the function of the algorithm and even make the algorithm effective. For the kernel linear discriminant analysis algorithm (KLDA), the construction of the kernel function is the core of the algorithm. The high-dimensional map has no explicit form and needs to be computed by means of a kernel function. All (Φ (x) · Φ (y)) are replaced by a kernel function k (x, y). The choice of kernel functions determines the transformation function Φ and the feature space F.
The invention can adopt the most widely applied Gaussian radial basis kernel function, and the kernel function needs to be corresponding to the parameter sigma2And (4) carrying out optimization selection, and determining the optimal value of the parameter through a group of experiments. Respectively take sigma20.5,5,50,500,5000,50000, the parameters take six specific values, and the characteristic dimension is increased continuously in a certain step lengthIs large. So as to obtain the change of the correct recognition rate in the process of continuously increasing the feature dimension. The results of the correct recognition rate for different parameters as the features increase tertiary are shown in fig. 3.
As can be seen from FIG. 4, when σ is2When the number of samples is 0.5,5, and 500, the correct recognition rate of the sample is low. Sigma2When the number is 50, the correct recognition rate of the sample is the highest. Sigma2The correct recognition rate for the sample is similar when 5000,50000. Therefore, the invention selects 50 as the parameter σ in the KLDA algorithm2The value of (a).
Preferably, the performing nonlinear feature extraction on the sensor response signal by using a kernel linear discriminant analysis method to obtain the flavor features of the tea sample includes:
by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
Preferably, said transformation is by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
S b Φ = Σ i = 1 L P i ( m Φ , i - m Φ ) ( m Φ , i - m Φ ) T
S w Φ = Σ i = 1 L Σ x k ∈ c i ( Φ ( x k ) - m Φ , i ) ( Φ ( x k ) - m Φ , i ) T
wherein m isΦAnd mΦ,iRespectively representing the mean values of all training samples in the high-dimensional feature space and the mean value of the ith class of training samples;
the Fisher criterion function in the high-dimensional feature space is:
J f ( W ) = | W T S b Φ W W T S w Φ W |
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT
C = d i a g ( n 1 , n 2 , ... , n L ) ∈ R L × L , G = d i a g ( 1 n 1 1 n 1 × 1 , ... , 1 n L 1 n L × 1 ) ,
W = d i a g ( I n 1 - 1 n 1 1 n 1 × n 1 , ... , I n L - 1 n L 1 n L × n L ) ∈ R N × N
Preferably, the performing nonlinear feature extraction on the sensor response signal to obtain the taste features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
Optimizing the Fisher criterion functionThe maximization is converted into the problem of solving the generalized eigenvalue, and the eigenvalue of KBK α ═ λ KWK α and the corresponding eigenvector α ═ α12,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]12,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
Preferably, the inputting of the taste characteristics of the tea sample into the classifier to perform the quality grade determination of tea comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
d ( x ~ i , x i ) = Σ k = 1 d ( x i k ′ - x i k ) 2
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
According to the taste sensation induction signal difference characteristic extraction method based on the kernel linear discriminant analysis, the discrimination capability of the kernel linear discriminant analysis method and the discrimination capability of the traditional linear dimensionality reduction method on tea samples are compared, KLDA, PCA, LDA and LPP are respectively adopted to reduce dimensionality of data of an electronic tongue intelligent sensory instrument, and the characteristic dimensionality is selected to be 2. The distribution diagram of the tea sample points after the dimensionality reduction is shown in figure 5.
As can be seen from fig. 5a-5d, for the linear dimensionality reduction method, whether unsupervised (PCA) or supervised (LDA, LPP) algorithms, the different grades of tea samples are severely aliased in the two-dimensional dimensionality reduction space. Although the LDA and LPP algorithms are supervised methods, the reduced-dimension samples are still severely aliased in the two-dimensional space. Experimental results show that the KLDA algorithm achieves the best sample separation, sample points of the same class are grouped together, while sample points of different classes are correctly separated. The algorithm utilizes the category information of tea samples to optimize a discriminant function, and the kernel-based feature extraction algorithm can mine the nonlinear features of the tea sample data, so that samples which cannot be correctly classified in an original data space are correctly classified in a high-dimensional space after being subjected to high-dimensional mapping.
The invention also analyzes and compares the quality grade classification results of different grades of Longjing tea by using the KLDA algorithm. After the abnormal value is eliminated, the number of the samples for tea grade discrimination is 212. In order to verify the adaptability and generalization of the kernel principal component analysis method, the judgment range of the algorithm is widened. In the experiment, 20 (case 1) samples and 30 (case 2) samples are randomly selected from tea samples of each grade for training, and the rest samples are tested. FIGS. 6a-6b show the comparison of the KLDA algorithm and the PCA, LDA, LPP algorithms on the correct classification recognition rates (as a function of the dimensionality reduction) for different grades of tea samples under two experimental conditions. It can be seen that the correct recognition rate of the PCA algorithm is overall lower than LDA and LPP. Both LDA and LPP algorithms adopt a supervision type calculation method, and as can be seen from the figure, when the feature dimension is lower, the correct recognition rate of LPP is better than that of LDA, and the difference between the two gradually decreases with the increase of the feature dimension. In general, the accurate recognition rate of the KLDA algorithm is higher than that of other linear dimension reduction methods. As can be seen from the variation trend of the graph, the highest correct recognition rate of the algorithm usually does not occur in the situation that the feature dimension is the largest
In this case, the recognition rate curve tends to increase in the early stage and generally tends to decrease or level in the latter half. Therefore, it can be understood that a small number of features can effectively represent original sample information, and the mapping of high-dimensional samples in a low-dimensional space can extract sample effective information and eliminate noise.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A taste sense signal difference feature extraction method based on nuclear linear discriminant analysis is characterized by comprising the following steps of:
detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
2. The method of claim 1, wherein the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
3. The method for extracting difference characteristics of taste sensation signals based on kernel linear discriminant analysis according to any one of claims 1 or 2, wherein the detecting of the tea sample by using the electronic tongue comprises:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
4. The method for extracting difference features of taste sensation signals based on kernel linear discriminant analysis according to claim 1, wherein the analyzing and removing abnormal samples according to the response time series signals by using principal component residuals and mahalanobis distance method comprises:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from large to small, and sorting the eigenvectors corresponding to the eigenvalues according to the sequence from large to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2
And judging the sample points which are far away from the same type sample point and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
5. The method of any one of claims 1 or 4, wherein the optimizing the parameters of the kernel linear discriminant analysis method to select the parameters of the kernel linear discriminant analysis method based on the correct recognition rate of the quality grade of the tea comprises:
taking a Gaussian kernel function as a nonlinear conversion function of a kernel linear discriminant analysis method, and calculating the linear discriminant analysis of the Gaussian kernel function k (x, y) as exp (- | | x-y | |)2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
6. The method of claim 5, wherein the Gaussian kernel function is:
k ( x , y ) = exp ( - | | x - y | | 2 2 σ 2 )
7. the method for extracting difference characteristics of taste sensation signals based on kernel linear discriminant analysis according to claim 5, wherein the non-linear feature extraction of sensor response signals by using kernel linear discriminant analysis method to obtain flavor characteristics of tea samples comprises:
by a non-linear transformation Φ:mapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
8. The method of claim 7, wherein said difference features of taste sensation signals are extracted by a nonlinear transformation Φ:mapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
S b Φ = Σ i = 1 L P i ( m Φ , i - m Φ ) ( m Φ , i - m Φ ) T
S w Φ = Σ i = 1 L Σ x k ∈ c i ( Φ ( x k ) - m Φ , i ) ( Φ ( x k ) - m Φ , i ) T
wherein m isΦAnd mΦ,iRepresenting the mean and class i training samples of all training samples in the high-dimensional feature space, respectivelyMean value;
the Fisher criterion function in the high-dimensional feature space is:
J f ( W ) = | W T S b Φ W W T S w Φ W |
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT
C=diag(n1,n2,…,nL)∈RL×L
W = d i a g ( I n 1 - 1 n 1 1 n 1 × n 1 , ... , I n L - 1 n L 1 n L × n L ) ∈ R N × N
9. The method of claim 8, wherein the performing nonlinear feature extraction on the sensor response signals to obtain the flavor features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The Fisher criterion function maximization is converted into a problem of solving generalized eigenvalues, and the eigenvalues of KBK α -lambda KWK α and corresponding eigenvectors α - α are solved12,…,αN]TAnd is from large according to the characteristic valueSorting to a small order;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]12,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
Φ ( x i ) T v k = Φ ( x i ) T Σ j = 1 N α j k Φ ( x j ) = Σ j = 1 N α j k Φ ( x i ) T Φ ( x j ) = Σ j = 1 N α j k K i j
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
10. The method for extracting difference features of taste sensation signals based on kernel linear discriminant analysis according to claim 1, wherein the step of inputting the taste features of the tea sample into a classifier to perform tea quality grade determination comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
d ( x ~ i , x i ) = Σ k = 1 d ( x i k ′ - x i k ) 2
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
CN201610404407.XA 2016-06-08 2016-06-08 Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis Active CN106096649B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610404407.XA CN106096649B (en) 2016-06-08 2016-06-08 Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610404407.XA CN106096649B (en) 2016-06-08 2016-06-08 Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis

Publications (2)

Publication Number Publication Date
CN106096649A true CN106096649A (en) 2016-11-09
CN106096649B CN106096649B (en) 2019-08-06

Family

ID=57227570

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610404407.XA Active CN106096649B (en) 2016-06-08 2016-06-08 Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis

Country Status (1)

Country Link
CN (1) CN106096649B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326915A (en) * 2016-08-10 2017-01-11 北京理工大学 Improved-Fisher-based chemical process fault diagnosis method
CN107220670A (en) * 2017-05-27 2017-09-29 重庆大学 Supervised Artifical Taste system features extracting method is had based on wavelet transform
CN110135372A (en) * 2019-05-20 2019-08-16 闽江学院 Action identification method based on linear judgement and SVM under VR artistic medium interactive environment
CN111144522A (en) * 2019-12-16 2020-05-12 浙江大学 Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference
CN111476702A (en) * 2020-04-07 2020-07-31 兰州交通大学 Image steganography detection method and system based on nonlinear mixed kernel feature mapping
CN113190797A (en) * 2021-04-18 2021-07-30 宁波大学科学技术学院 PTA device gross error discrimination method based on online rolling discrimination feature analysis

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103487537A (en) * 2013-07-30 2014-01-01 中国标准化研究院 Detection method for producing areas of Xihulongjing tea based on genetic algorithm optimization
CN103487558A (en) * 2013-07-30 2014-01-01 中国标准化研究院 Detection method for abnormal samples in mode identification and analysis of tea quality through intelligent sensory signals
CN104036298A (en) * 2013-09-23 2014-09-10 苏州工业职业技术学院 High-spectrum remote sensing image end-member classification method based on Fisher self-adaptive learning
CN104504407A (en) * 2014-12-17 2015-04-08 西南大学 Electronic nose feature selection optimization method on basis of multiple Fisher kernel discriminant analysis
CN105181761A (en) * 2015-08-26 2015-12-23 安徽农业大学 Method for rapidly identifying irradiation absorbed dose of tea by using electronic nose

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103487537A (en) * 2013-07-30 2014-01-01 中国标准化研究院 Detection method for producing areas of Xihulongjing tea based on genetic algorithm optimization
CN103487558A (en) * 2013-07-30 2014-01-01 中国标准化研究院 Detection method for abnormal samples in mode identification and analysis of tea quality through intelligent sensory signals
CN104036298A (en) * 2013-09-23 2014-09-10 苏州工业职业技术学院 High-spectrum remote sensing image end-member classification method based on Fisher self-adaptive learning
CN104504407A (en) * 2014-12-17 2015-04-08 西南大学 Electronic nose feature selection optimization method on basis of multiple Fisher kernel discriminant analysis
CN105181761A (en) * 2015-08-26 2015-12-23 安徽农业大学 Method for rapidly identifying irradiation absorbed dose of tea by using electronic nose

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
RUICONG ZHI等: "New dimensionality reduction model (manifold learning) coupled with electronic tongue for green tea grade identification", 《EUROPEAN FOOD RESEARCH AND TECHNOLOGY》 *
周昌亮: "基于LDA和KDA的人脸识别算法研究", 《中国优秀硕士学位论文库,信息科技辑》 *
支瑞聪: "基于谱图理论的人脸表情识别算法研究", 《中国博士学位论文全文数据库 信息科技辑》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326915A (en) * 2016-08-10 2017-01-11 北京理工大学 Improved-Fisher-based chemical process fault diagnosis method
CN106326915B (en) * 2016-08-10 2019-08-02 北京理工大学 A kind of Fault Diagnosis for Chemical Process method based on improvement core Fisher
CN107220670A (en) * 2017-05-27 2017-09-29 重庆大学 Supervised Artifical Taste system features extracting method is had based on wavelet transform
CN107220670B (en) * 2017-05-27 2020-07-14 重庆大学 Method for extracting characteristics of supervised artificial taste system based on discrete wavelet transform
CN110135372A (en) * 2019-05-20 2019-08-16 闽江学院 Action identification method based on linear judgement and SVM under VR artistic medium interactive environment
CN111144522A (en) * 2019-12-16 2020-05-12 浙江大学 Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference
CN111144522B (en) * 2019-12-16 2021-01-08 浙江大学 Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference
CN111476702A (en) * 2020-04-07 2020-07-31 兰州交通大学 Image steganography detection method and system based on nonlinear mixed kernel feature mapping
CN113190797A (en) * 2021-04-18 2021-07-30 宁波大学科学技术学院 PTA device gross error discrimination method based on online rolling discrimination feature analysis
CN113190797B (en) * 2021-04-18 2024-07-16 南京医工交叉创新研究院有限公司 PTA device rough difference discriminating method based on online rolling discriminating characteristic analysis

Also Published As

Publication number Publication date
CN106096649B (en) 2019-08-06

Similar Documents

Publication Publication Date Title
CN106096649B (en) Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis
CN107818298B (en) General Raman spectrum feature extraction method for machine learning substance identification algorithm
CN109142317B (en) Raman spectrum substance identification method based on random forest model
CN103412003B (en) Gas detection method based on self-adaption of semi-supervised domain
CN109002859B (en) Sensor array feature selection and array optimization method based on principal component analysis
CN106951914B (en) Method for identifying vinegar variety by electronic nose for optimizing fuzzy identification vector extraction
CN110378374B (en) Tea near infrared spectrum classification method for extracting fuzzy identification information
CN104374739A (en) Identification method for authenticity of varieties of seeds on basis of near-infrared quantitative analysis
CN105954412A (en) Sensor array optimization method for Carya cathayensis freshness detection
Tripathy et al. Electronic nose for black tea quality evaluation using kernel based clustering approach
Liao et al. Recognition of partial discharge patterns
CN103942526B (en) Linear feature extraction method for discrete data point set
KR102013392B1 (en) Gas detection method using SVM classifier
CN109886296A (en) A kind of authentication information extracts the local tea variety classification method of formula noise cluster
CN105913856A (en) Audio tampering detection method and system based on amplitude co-occurrence vector characteristics
CN110987856B (en) Cosmetic quality rapid identification method based on formula system and fingerprint spectrum
US6496742B1 (en) Classifying apparatus designed in particular for odor recognition
CN106018515B (en) A kind of electronic tongues signal characteristic extracting methods based on manifold learning
Lazaro et al. Chemometric data analysis for black tea fermentation using principal component analysis
JP5802916B2 (en) Sensory data identification device and program
CN111275100B (en) Image feature identification method based on training set sample low-rank screening
Dusio et al. Fingerprint sample quality assessment via ridge line count using laplacian of gaussian edge finding
Nasution et al. A Low Cost Electronic Nose System for Classification of Gayo Arabica Coffee Roasting Levels Using Stepwise Linear Discriminant and K-Nearest Neighbor.
Gupta et al. Survey on Tea Discriminator
Lelono et al. Quality Classification of Chili Sauce Using Electronic Nose with Principal Component Analysis

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant