CN106096649A - Sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis - Google Patents
Sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis Download PDFInfo
- Publication number
- CN106096649A CN106096649A CN201610404407.XA CN201610404407A CN106096649A CN 106096649 A CN106096649 A CN 106096649A CN 201610404407 A CN201610404407 A CN 201610404407A CN 106096649 A CN106096649 A CN 106096649A
- Authority
- CN
- China
- Prior art keywords
- sample
- phi
- tea
- discriminant analysis
- kernel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 48
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000014860 sensory perception of taste Effects 0.000 title abstract description 3
- 230000004044 response Effects 0.000 claims abstract description 55
- 238000000605 extraction Methods 0.000 claims abstract description 23
- 238000013507 mapping Methods 0.000 claims abstract description 11
- 239000000796 flavoring agent Substances 0.000 claims abstract description 10
- 235000019634 flavors Nutrition 0.000 claims abstract description 10
- 230000006870 function Effects 0.000 claims description 46
- 239000011159 matrix material Substances 0.000 claims description 28
- 238000012549 training Methods 0.000 claims description 26
- 239000013598 vector Substances 0.000 claims description 21
- 230000002159 abnormal effect Effects 0.000 claims description 14
- 238000012360 testing method Methods 0.000 claims description 10
- 235000019613 sensory perceptions of taste Nutrition 0.000 claims description 9
- 230000035923 taste sensation Effects 0.000 claims description 9
- 239000006185 dispersion Substances 0.000 claims description 6
- 238000005406 washing Methods 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 5
- 238000005457 optimization Methods 0.000 claims description 4
- 235000014347 soups Nutrition 0.000 claims description 4
- 229910021607 Silver chloride Inorganic materials 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000004140 cleaning Methods 0.000 claims description 3
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- HKZLPVFGJNLROG-UHFFFAOYSA-M silver monochloride Chemical compound [Cl-].[Ag+] HKZLPVFGJNLROG-UHFFFAOYSA-M 0.000 claims description 3
- 241001122767 Theaceae Species 0.000 claims 14
- 244000269722 Thea sinensis Species 0.000 abstract description 66
- 244000287680 Garcinia dulcis Species 0.000 abstract 1
- 230000005856 abnormality Effects 0.000 abstract 1
- 210000002105 tongue Anatomy 0.000 abstract 1
- 235000013616 tea Nutrition 0.000 description 62
- 230000009467 reduction Effects 0.000 description 9
- 230000008859 change Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000009569 green tea Nutrition 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000001953 sensory effect Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 230000001339 gustatory effect Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 238000012847 principal component analysis method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000012372 quality testing Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 210000000697 sensory organ Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Investigating Or Analysing Materials By Optical Means (AREA)
Abstract
The present invention provides a kind of sense of taste induced signal otherness feature extracting method based on core linear discriminant analysis, and method includes: utilize electronic tongues to detect Tea Samples, obtains sensor response clock signal;Principal component residual sum mahalanobis distance method is used to be analyzed exceptional sample and reject according to described response clock signal;The parameter of core linear discriminant analysis method is optimized, with Longjing tea quality grade correct recognition rata for according to the parameter selecting core linear discriminant analysis method;Use core linear discriminant analysis method to carry out Nonlinear feature extraction to sensor response signal, obtain the flavor characteristics of Tea Samples;By the flavor characteristics input grader of Tea Samples, carry out tea leaf quality grade judgement.Abnormality value removing is carried out to Tea Samples, utilizes the core linear discriminant analysis method after Optimal Parameters can preferably characterize the nonlinear characteristic of different brackets Tea Samples, promote the signal difference opposite sex in high-dimensional feature space for the sample after Nonlinear Mapping.
Description
Technical Field
The invention relates to the technical field of tea detection, in particular to a taste sense induction signal difference characteristic extraction method based on nuclear linear discriminant analysis.
Background
In recent years, tea quality testing has been a difficult task because tea contains many ingredients and their effects on tea quality are very different. West lake Longjing tea is a typical representative of Chinese green tea. Some vendors fry other green tea into flat shape to serve as the dragon well tea or use the dragon well of other production places of Zhejiang to serve as the west lake dragon well, which disturbs the market of the dragon well tea and damages the benefits of consumers, therefore, the method has important significance for scientific detection and evaluation of the quality of the west lake dragon well tea.
Sensory evaluation is an important method for evaluating the quality of tea leaves for a long time, but the method needs rich tea science knowledge and evaluation experience, and the sensory organ sensitivity of professional tea tasters is easily changed by the interference of external factors. Many analytical tools are therefore used to analyse tea leaf chemicals, such as high performance liquid chromatography, gas chromatography and the like. But the traditional linear feature extraction method cannot effectively explore the intrinsic regularity existing in the nonlinear data.
Disclosure of Invention
The invention aims to provide a taste sense induction signal difference characteristic extraction method based on nuclear linear discriminant analysis, which can effectively extract difference characteristics of tea.
In order to solve the above technical problems, an embodiment of the present invention provides a taste sensation signal difference feature extraction method based on kernel linear discriminant analysis, including:
detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
Preferably, the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
Preferably, the detecting the tea sample by using the electronic tongue comprises the following steps:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
Preferably, the analyzing and rejecting abnormal samples by using principal component residual error and mahalanobis distance method according to the response time sequence signal includes:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,;
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from large to small, and sorting the eigenvectors corresponding to the eigenvalues according to the sequence from large to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2;
And judging the sample points which are away from the same type sample points and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
Preferably, the optimizing the parameters of the kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method based on the correct recognition rate of the quality grade of the tea leaves, includes:
taking a Gaussian kernel function as a nonlinear conversion function of a kernel linear discriminant analysis method, and calculating the linear discriminant analysis of the Gaussian kernel function k (x, y) as exp (- | | x-y | |)2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
Preferably, the gaussian kernel function is:
preferably, the performing nonlinear feature extraction on the sensor response signal by using a kernel linear discriminant analysis method to obtain the flavor features of the tea sample includes:
by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
Preferably, said transformation is by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
wherein m isΦAnd mΦ,iRespectively representing the mean values of all training samples in the high-dimensional feature space and the mean value of the ith class of training samples;
the Fisher criterion function in the high-dimensional feature space is:
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT,
Preferably, the performing nonlinear feature extraction on the sensor response signal to obtain the taste features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The Fisher criterion function maximization is converted into a problem of solving generalized eigenvalues, and the eigenvalues of KBK α -lambda KWK α and corresponding eigenvectors α - α are solved1,α2,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]1,α2,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
Preferably, the inputting of the taste characteristics of the tea sample into the classifier to perform the quality grade determination of tea comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
The technical scheme of the invention has the following beneficial effects:
according to the scheme, abnormal values of tea samples can be eliminated, nonlinear characteristics of the tea samples in different grades can be better represented by using the kernel linear discriminant analysis method after parameters are optimized, and the signal difference of the samples subjected to nonlinear mapping in a high-dimensional characteristic space is improved.
Drawings
FIG. 1 is a flowchart of a taste sensation signal difference feature extraction method based on kernel linear discriminant analysis according to an embodiment of the present invention;
FIG. 2 is an electronic tongue response map of a tea sample according to an embodiment of the invention;
FIGS. 3a-3d are graphs of the principal component residue-Mahalanobis distance distribution for an embodiment of the present invention;
FIG. 4 shows the correct recognition rate and parameter σ of tea samples according to the embodiment of the present invention2Selecting a relation graph;
FIGS. 5a-5d are comparative results of the KLDA and linear dimensionality reduction method of the present invention with respect to tea sample differentiation;
fig. 6a and 6b are the results of comparing the correct recognition rate curves of KLDA and linear dimensionality reduction methods of embodiments of the present invention as a function of dimensionality reduction.
Detailed Description
In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.
As shown in fig. 1, the method for extracting difference features of taste sensation signal based on kernel linear discriminant analysis according to the embodiment of the present invention includes:
step 101: detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
step 102: analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
step 103: optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
step 104: performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
step 105: inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
The gustation sensing signal difference characteristic extraction method based on the kernel linear discriminant analysis can remove abnormal values of tea samples, the kernel linear discriminant analysis method after parameters are optimized can better represent nonlinear characteristics of the tea samples in different grades, and the signal difference of samples subjected to nonlinear mapping in a high-dimensional characteristic space is improved.
Preferably, the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
Preferably, the detecting the tea sample by using the electronic tongue comprises the following steps:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
The invention can adopt ASTREE electronic tongue system of French Alpha MOS company to detect the Longjing tea sample, and is specially designed for gustatory analysis technology.
Before data acquisition, the electronic tongue system can be subjected to steps of self-checking, activation, training, calibration, diagnosis and the like, so that the acquired data is ensured to have reliability and stability. After the collection, seven electronic tongue response fingerprints are obtained from each tea sample, as shown in figure 2. The horizontal axis represents measurement time, and the vertical axis represents the acquired induced voltage value. The point on the curve represents the change of the potential difference with time when the tea soup flavor substance passes through the sensor channel. In the measurement process, each detection time is 120s, a set of data is acquired every 0.5s by the electronic tongue, and each sample is detected to finally obtain 7 time sequence signals which change along with time, such as response signals of the electronic tongue sensor shown in the attached figure 2. Thus, for each sample test, the data obtained is a matrix of 7 × 240 dimensions. The stable value of the sensor response at 120s can be selected as a characteristic point for the subsequent establishment of the tea quality model.
Preferably, the analyzing and rejecting abnormal samples by using principal component residual error and mahalanobis distance method according to the response time sequence signal includes:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from big to small, the characteristic value corresponds toThe eigenvectors are sorted in the order from big to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2;
And judging the sample points which are far away from the same type sample point and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
Wherein, the principal component analysis and mahalanobis distance value calculation are performed on the fine sample set, the special sample set, the first sample set and the second sample set, respectively, as shown in fig. 3a-3 d. The points marked in the graph are abnormal sample points. And performing subsequent data processing on the tea sample data from which the abnormal sample points are removed.
Preferably, the optimizing the parameters of the kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method based on the positive recognition rate of the quality grade of the tea leaves, includes:
linear discrimination using gaussian kernel function as kernelThe nonlinear conversion function of the analysis method is used for solving the problem that the Gaussian kernel function k (x, y) is exp (- | | x-y | | | survival rate2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
Preferably, the gaussian kernel function is:
the parameter selection is an important factor influencing the algorithm discrimination effect, the selection of proper parameters can enhance the effectiveness of the algorithm, while the selection of improper parameters can greatly weaken the function of the algorithm and even make the algorithm effective. For the kernel linear discriminant analysis algorithm (KLDA), the construction of the kernel function is the core of the algorithm. The high-dimensional map has no explicit form and needs to be computed by means of a kernel function. All (Φ (x) · Φ (y)) are replaced by a kernel function k (x, y). The choice of kernel functions determines the transformation function Φ and the feature space F.
The invention can adopt the most widely applied Gaussian radial basis kernel function, and the kernel function needs to be corresponding to the parameter sigma2And (4) carrying out optimization selection, and determining the optimal value of the parameter through a group of experiments. Respectively take sigma20.5,5,50,500,5000,50000, the parameters take six specific values, and the characteristic dimension is increased continuously in a certain step lengthIs large. So as to obtain the change of the correct recognition rate in the process of continuously increasing the feature dimension. The results of the correct recognition rate for different parameters as the features increase tertiary are shown in fig. 3.
As can be seen from FIG. 4, when σ is2When the number of samples is 0.5,5, and 500, the correct recognition rate of the sample is low. Sigma2When the number is 50, the correct recognition rate of the sample is the highest. Sigma2The correct recognition rate for the sample is similar when 5000,50000. Therefore, the invention selects 50 as the parameter σ in the KLDA algorithm2The value of (a).
Preferably, the performing nonlinear feature extraction on the sensor response signal by using a kernel linear discriminant analysis method to obtain the flavor features of the tea sample includes:
by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
Preferably, said transformation is by a non-linear transformationMapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
wherein m isΦAnd mΦ,iRespectively representing the mean values of all training samples in the high-dimensional feature space and the mean value of the ith class of training samples;
the Fisher criterion function in the high-dimensional feature space is:
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT,
Preferably, the performing nonlinear feature extraction on the sensor response signal to obtain the taste features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
Optimizing the Fisher criterion functionThe maximization is converted into the problem of solving the generalized eigenvalue, and the eigenvalue of KBK α ═ λ KWK α and the corresponding eigenvector α ═ α1,α2,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]1,α2,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
Preferably, the inputting of the taste characteristics of the tea sample into the classifier to perform the quality grade determination of tea comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
According to the taste sensation induction signal difference characteristic extraction method based on the kernel linear discriminant analysis, the discrimination capability of the kernel linear discriminant analysis method and the discrimination capability of the traditional linear dimensionality reduction method on tea samples are compared, KLDA, PCA, LDA and LPP are respectively adopted to reduce dimensionality of data of an electronic tongue intelligent sensory instrument, and the characteristic dimensionality is selected to be 2. The distribution diagram of the tea sample points after the dimensionality reduction is shown in figure 5.
As can be seen from fig. 5a-5d, for the linear dimensionality reduction method, whether unsupervised (PCA) or supervised (LDA, LPP) algorithms, the different grades of tea samples are severely aliased in the two-dimensional dimensionality reduction space. Although the LDA and LPP algorithms are supervised methods, the reduced-dimension samples are still severely aliased in the two-dimensional space. Experimental results show that the KLDA algorithm achieves the best sample separation, sample points of the same class are grouped together, while sample points of different classes are correctly separated. The algorithm utilizes the category information of tea samples to optimize a discriminant function, and the kernel-based feature extraction algorithm can mine the nonlinear features of the tea sample data, so that samples which cannot be correctly classified in an original data space are correctly classified in a high-dimensional space after being subjected to high-dimensional mapping.
The invention also analyzes and compares the quality grade classification results of different grades of Longjing tea by using the KLDA algorithm. After the abnormal value is eliminated, the number of the samples for tea grade discrimination is 212. In order to verify the adaptability and generalization of the kernel principal component analysis method, the judgment range of the algorithm is widened. In the experiment, 20 (case 1) samples and 30 (case 2) samples are randomly selected from tea samples of each grade for training, and the rest samples are tested. FIGS. 6a-6b show the comparison of the KLDA algorithm and the PCA, LDA, LPP algorithms on the correct classification recognition rates (as a function of the dimensionality reduction) for different grades of tea samples under two experimental conditions. It can be seen that the correct recognition rate of the PCA algorithm is overall lower than LDA and LPP. Both LDA and LPP algorithms adopt a supervision type calculation method, and as can be seen from the figure, when the feature dimension is lower, the correct recognition rate of LPP is better than that of LDA, and the difference between the two gradually decreases with the increase of the feature dimension. In general, the accurate recognition rate of the KLDA algorithm is higher than that of other linear dimension reduction methods. As can be seen from the variation trend of the graph, the highest correct recognition rate of the algorithm usually does not occur in the situation that the feature dimension is the largest
In this case, the recognition rate curve tends to increase in the early stage and generally tends to decrease or level in the latter half. Therefore, it can be understood that a small number of features can effectively represent original sample information, and the mapping of high-dimensional samples in a low-dimensional space can extract sample effective information and eliminate noise.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.
Claims (10)
1. A taste sense signal difference feature extraction method based on nuclear linear discriminant analysis is characterized by comprising the following steps of:
detecting a tea sample by using an electronic tongue to obtain a sensor response time sequence signal;
analyzing and eliminating abnormal samples by adopting a principal component residual error and Mahalanobis distance method according to the response time sequence signal;
optimizing parameters of a kernel linear discriminant analysis method, and selecting the parameters of the kernel linear discriminant analysis method according to the quality grade correct recognition rate of the Longjing tea;
performing nonlinear feature extraction on the sensor response signals by adopting a kernel linear discriminant analysis method to obtain the flavor features of the tea samples;
inputting the taste characteristics of the tea sample into a classifier, and judging the quality grade of the tea.
2. The method of claim 1, wherein the sensor response timing signal comprises: at least one of a ZA sensor response time sequence signal, a BB sensor response time sequence signal, a JE sensor response time sequence signal, a GA sensor response time sequence signal, an HA sensor response time sequence signal, a JB sensor response time sequence signal, a CA sensor response time sequence signal and an Ag/AgCl reference electrode sensor response time sequence signal.
3. The method for extracting difference characteristics of taste sensation signals based on kernel linear discriminant analysis according to any one of claims 1 or 2, wherein the detecting of the tea sample by using the electronic tongue comprises:
placing the sample and the cleaning solution on an autosampler of the electronic tongue in sequence;
the collection of each sample was repeated, and each collection was performed according to the procedure of "tea soup sample → washing solution 1 → washing solution 2".
4. The method for extracting difference features of taste sensation signals based on kernel linear discriminant analysis according to claim 1, wherein the analyzing and removing abnormal samples according to the response time series signals by using principal component residuals and mahalanobis distance method comprises:
for data set X ═ X1,x2,…,xN]∈Rm×NThe centralization is carried out, and the device is,
calculating a covariance matrix of the centralized data:
calculating eigenvalues and eigenvectors of the covariance matrix: cv ═ λ v;
the eigenvalue lambda of the covariance matrixiSorting according to the sequence from large to small, and sorting the eigenvectors corresponding to the eigenvalues according to the sequence from large to small;
by usingProjecting the data sample onto a feature vector obtained in Cv ═ λ v;
by usingCalculating the estimated value of the sample, wherein the principal component residual is the difference between the true value and the estimated value of the sample, i.e.
Wherein,v is a characteristic vector corresponding to the characteristic value;
the mahalanobis distance between sample points is: dij=[(xi-xj)T[Cov(X)]-1(xi-xj)]1/2;
And judging the sample points which are far away from the same type sample point and are distributed in the whole manner as abnormal sample elimination according to the principal component residual value and the Mahalanobis distance between the sample points and the same type sample mean value.
5. The method of any one of claims 1 or 4, wherein the optimizing the parameters of the kernel linear discriminant analysis method to select the parameters of the kernel linear discriminant analysis method based on the correct recognition rate of the quality grade of the tea comprises:
taking a Gaussian kernel function as a nonlinear conversion function of a kernel linear discriminant analysis method, and calculating the linear discriminant analysis of the Gaussian kernel function k (x, y) as exp (- | | x-y | |)2/2σ2) Parameter σ in2Carrying out optimization selection;
and selecting parameter values according to the correct recognition rate determined by the quality grade of the tea during parameter selection.
6. The method of claim 5, wherein the Gaussian kernel function is:
7. the method for extracting difference characteristics of taste sensation signals based on kernel linear discriminant analysis according to claim 5, wherein the non-linear feature extraction of sensor response signals by using kernel linear discriminant analysis method to obtain flavor characteristics of tea samples comprises:
by a non-linear transformation Φ:mapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN);
In a high-dimensional feature space, converting the problem of maximization of the Fisher criterion function into a problem of solving a feature value and a feature vector of a feature equation;
and carrying out nonlinear characteristic extraction on the sensor response signals to obtain the taste characteristics of the tea sample.
8. The method of claim 7, wherein said difference features of taste sensation signals are extracted by a nonlinear transformation Φ:mapping the input data to high-dimensional feature space, and obtaining a data point phi (x) after nonlinear transformation1),Φ(x2),…,Φ(xN) The method comprises the following steps:
inter-class dispersion matrix of training samples in high-dimensional feature spaceAnd intra-class dispersion matrixComprises the following steps:
wherein m isΦAnd mΦ,iRepresenting the mean and class i training samples of all training samples in the high-dimensional feature space, respectivelyMean value;
the Fisher criterion function in the high-dimensional feature space is:
in the high-dimensional feature space, the problem of maximizing the Fisher criterion function is converted into a problem of solving the feature value and the feature vector of the feature equation, and the method comprises the following steps:
define the kernel matrix K ═ K of N × Nij]Then the above formula becomes
KBKα=λKWKα
Wherein, Kij=k(xi,xj)=Φ(xi)TΦ(xj),B=GCGT,
C=diag(n1,n2,…,nL)∈RL×L,
9. The method of claim 8, wherein the performing nonlinear feature extraction on the sensor response signals to obtain the flavor features of the tea sample comprises:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The Fisher criterion function maximization is converted into a problem of solving generalized eigenvalues, and the eigenvalues of KBK α -lambda KWK α and corresponding eigenvectors α - α are solved1,α2,…,αN]TAnd is from large according to the characteristic valueSorting to a small order;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating a kernel matrix K-K of the training sample set according to the determined kernel function and the optimized kernel function parametersij]In which K isij=k(xi,xj)=Φ(xi)TΦ(xj);
The eigenvalues of KBK α λ KWK α and the corresponding eigenvectors α [ α ]1,α2,…,αN]TSorting according to the sequence of the characteristic values from large to small;
will train sample phi (x)i) The most sampled nonlinear feature projected onto the kth feature vector:
calculating test samplesAnd a kernel matrix K' between the training set samples, projecting the test samples onto the feature vectors
10. The method for extracting difference features of taste sensation signals based on kernel linear discriminant analysis according to claim 1, wherein the step of inputting the taste features of the tea sample into a classifier to perform tea quality grade determination comprises:
for the sample to be testedAnd training image sample xiCalculating the similarity between the image sample to be tested and the training image sample
If it isSample xiIf it belongs to class k, the test sample is testedIs decided as class k.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610404407.XA CN106096649B (en) | 2016-06-08 | 2016-06-08 | Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610404407.XA CN106096649B (en) | 2016-06-08 | 2016-06-08 | Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106096649A true CN106096649A (en) | 2016-11-09 |
CN106096649B CN106096649B (en) | 2019-08-06 |
Family
ID=57227570
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610404407.XA Active CN106096649B (en) | 2016-06-08 | 2016-06-08 | Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106096649B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106326915A (en) * | 2016-08-10 | 2017-01-11 | 北京理工大学 | Improved-Fisher-based chemical process fault diagnosis method |
CN107220670A (en) * | 2017-05-27 | 2017-09-29 | 重庆大学 | Supervised Artifical Taste system features extracting method is had based on wavelet transform |
CN110135372A (en) * | 2019-05-20 | 2019-08-16 | 闽江学院 | Action identification method based on linear judgement and SVM under VR artistic medium interactive environment |
CN111144522A (en) * | 2019-12-16 | 2020-05-12 | 浙江大学 | Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference |
CN111476702A (en) * | 2020-04-07 | 2020-07-31 | 兰州交通大学 | Image steganography detection method and system based on nonlinear mixed kernel feature mapping |
CN113190797A (en) * | 2021-04-18 | 2021-07-30 | 宁波大学科学技术学院 | PTA device gross error discrimination method based on online rolling discrimination feature analysis |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103487537A (en) * | 2013-07-30 | 2014-01-01 | 中国标准化研究院 | Detection method for producing areas of Xihulongjing tea based on genetic algorithm optimization |
CN103487558A (en) * | 2013-07-30 | 2014-01-01 | 中国标准化研究院 | Detection method for abnormal samples in mode identification and analysis of tea quality through intelligent sensory signals |
CN104036298A (en) * | 2013-09-23 | 2014-09-10 | 苏州工业职业技术学院 | High-spectrum remote sensing image end-member classification method based on Fisher self-adaptive learning |
CN104504407A (en) * | 2014-12-17 | 2015-04-08 | 西南大学 | Electronic nose feature selection optimization method on basis of multiple Fisher kernel discriminant analysis |
CN105181761A (en) * | 2015-08-26 | 2015-12-23 | 安徽农业大学 | Method for rapidly identifying irradiation absorbed dose of tea by using electronic nose |
-
2016
- 2016-06-08 CN CN201610404407.XA patent/CN106096649B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103487537A (en) * | 2013-07-30 | 2014-01-01 | 中国标准化研究院 | Detection method for producing areas of Xihulongjing tea based on genetic algorithm optimization |
CN103487558A (en) * | 2013-07-30 | 2014-01-01 | 中国标准化研究院 | Detection method for abnormal samples in mode identification and analysis of tea quality through intelligent sensory signals |
CN104036298A (en) * | 2013-09-23 | 2014-09-10 | 苏州工业职业技术学院 | High-spectrum remote sensing image end-member classification method based on Fisher self-adaptive learning |
CN104504407A (en) * | 2014-12-17 | 2015-04-08 | 西南大学 | Electronic nose feature selection optimization method on basis of multiple Fisher kernel discriminant analysis |
CN105181761A (en) * | 2015-08-26 | 2015-12-23 | 安徽农业大学 | Method for rapidly identifying irradiation absorbed dose of tea by using electronic nose |
Non-Patent Citations (3)
Title |
---|
RUICONG ZHI等: "New dimensionality reduction model (manifold learning) coupled with electronic tongue for green tea grade identification", 《EUROPEAN FOOD RESEARCH AND TECHNOLOGY》 * |
周昌亮: "基于LDA和KDA的人脸识别算法研究", 《中国优秀硕士学位论文库,信息科技辑》 * |
支瑞聪: "基于谱图理论的人脸表情识别算法研究", 《中国博士学位论文全文数据库 信息科技辑》 * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106326915A (en) * | 2016-08-10 | 2017-01-11 | 北京理工大学 | Improved-Fisher-based chemical process fault diagnosis method |
CN106326915B (en) * | 2016-08-10 | 2019-08-02 | 北京理工大学 | A kind of Fault Diagnosis for Chemical Process method based on improvement core Fisher |
CN107220670A (en) * | 2017-05-27 | 2017-09-29 | 重庆大学 | Supervised Artifical Taste system features extracting method is had based on wavelet transform |
CN107220670B (en) * | 2017-05-27 | 2020-07-14 | 重庆大学 | Method for extracting characteristics of supervised artificial taste system based on discrete wavelet transform |
CN110135372A (en) * | 2019-05-20 | 2019-08-16 | 闽江学院 | Action identification method based on linear judgement and SVM under VR artistic medium interactive environment |
CN111144522A (en) * | 2019-12-16 | 2020-05-12 | 浙江大学 | Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference |
CN111144522B (en) * | 2019-12-16 | 2021-01-08 | 浙江大学 | Power grid NFC equipment fingerprint authentication method based on hardware intrinsic difference |
CN111476702A (en) * | 2020-04-07 | 2020-07-31 | 兰州交通大学 | Image steganography detection method and system based on nonlinear mixed kernel feature mapping |
CN113190797A (en) * | 2021-04-18 | 2021-07-30 | 宁波大学科学技术学院 | PTA device gross error discrimination method based on online rolling discrimination feature analysis |
CN113190797B (en) * | 2021-04-18 | 2024-07-16 | 南京医工交叉创新研究院有限公司 | PTA device rough difference discriminating method based on online rolling discriminating characteristic analysis |
Also Published As
Publication number | Publication date |
---|---|
CN106096649B (en) | 2019-08-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106096649B (en) | Sense of taste inductive signal otherness feature extracting method based on core linear discriminant analysis | |
CN107818298B (en) | General Raman spectrum feature extraction method for machine learning substance identification algorithm | |
CN109142317B (en) | Raman spectrum substance identification method based on random forest model | |
CN103412003B (en) | Gas detection method based on self-adaption of semi-supervised domain | |
CN109002859B (en) | Sensor array feature selection and array optimization method based on principal component analysis | |
CN106951914B (en) | Method for identifying vinegar variety by electronic nose for optimizing fuzzy identification vector extraction | |
CN110378374B (en) | Tea near infrared spectrum classification method for extracting fuzzy identification information | |
CN104374739A (en) | Identification method for authenticity of varieties of seeds on basis of near-infrared quantitative analysis | |
CN105954412A (en) | Sensor array optimization method for Carya cathayensis freshness detection | |
Tripathy et al. | Electronic nose for black tea quality evaluation using kernel based clustering approach | |
Liao et al. | Recognition of partial discharge patterns | |
CN103942526B (en) | Linear feature extraction method for discrete data point set | |
KR102013392B1 (en) | Gas detection method using SVM classifier | |
CN109886296A (en) | A kind of authentication information extracts the local tea variety classification method of formula noise cluster | |
CN105913856A (en) | Audio tampering detection method and system based on amplitude co-occurrence vector characteristics | |
CN110987856B (en) | Cosmetic quality rapid identification method based on formula system and fingerprint spectrum | |
US6496742B1 (en) | Classifying apparatus designed in particular for odor recognition | |
CN106018515B (en) | A kind of electronic tongues signal characteristic extracting methods based on manifold learning | |
Lazaro et al. | Chemometric data analysis for black tea fermentation using principal component analysis | |
JP5802916B2 (en) | Sensory data identification device and program | |
CN111275100B (en) | Image feature identification method based on training set sample low-rank screening | |
Dusio et al. | Fingerprint sample quality assessment via ridge line count using laplacian of gaussian edge finding | |
Nasution et al. | A Low Cost Electronic Nose System for Classification of Gayo Arabica Coffee Roasting Levels Using Stepwise Linear Discriminant and K-Nearest Neighbor. | |
Gupta et al. | Survey on Tea Discriminator | |
Lelono et al. | Quality Classification of Chili Sauce Using Electronic Nose with Principal Component Analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |