CN103198331B - Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA - Google Patents
Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA Download PDFInfo
- Publication number
- CN103198331B CN103198331B CN201310098307.5A CN201310098307A CN103198331B CN 103198331 B CN103198331 B CN 103198331B CN 201310098307 A CN201310098307 A CN 201310098307A CN 103198331 B CN103198331 B CN 103198331B
- Authority
- CN
- China
- Prior art keywords
- sample
- pca
- rightarrow
- matrix
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
The invention discloses a multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA. The multiple spectrogram characteristic amalgamation and recognition method based on analysis of the PCA-LDA comprises the following steps: utilizing a PCA-LDA method to extract spectrogram characteristics of all samples; amalgamating the extracted spectrogram characteristics of all the samples to a multiple spectrogram characteristics amalgamation matrix; establishing a multiple spectrogram amalgamation SVM classifier; and utilizing multiple spectrogram amalgamation SVM classifier samples to recognize. The multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA can quickly conduct accurate analytical recognition to the integral characteristic of complex mixture.
Description
Technical field
The present invention relates to a kind of many spectrograms Feature Fusion recognition methodss based on PCA_LDA analysis, particularly a kind of quick
Complex mixture global feature spectrum analysis method.
Background technology
Pattern recognition is widely used in respectively with reference to analytical technologies such as spectrum, chromatograph, ion mobility spectrometry, mass spectrum, nuclear magnetic resonance, NMR
On based food quality control.Raman spectrum is a kind of scattering spectrum.Raman spectrum analyses method is to be drawn based on India scientist C.V.
Graceful found Raman scattering effect, is analyzed to the scattering spectrum different from incident light frequency to obtain molecular vibration, to turn
Dynamic aspect information, and it is applied to a kind of analysis method of molecular structure research.Being mainly characterized by of Raman spectrum be quick, simple,
Sample is reproducible without pre-treatment, signal.Additionally due to the Raman scattering of water is very faint, therefore Raman spectrum can carry out water
Material measurement detection in solution, greatly expands and carries out studying detection range.
Ion mobility spectrometry is that a kind of new gas phase occurring in early 1970s separates and detection technique.It is with ion
The difference of drift time is qualitative come the separation to carry out ion, by the concept similar to chromatographic retention, is originally referred to as
Gas ions chromatograph.Ion mobility spectrometry is a kind of efficient separate analytical technique, have been widely used petrochemical industry, environmental analyses,
The fields such as food analyses.Mass spectrum is a kind of analysis side being analyzed by the mensure of the mass-to-charge ratio to sample ion
Method.Because mass spectral analyses have, sensitivity is high, and amount of samples is few, and analyze speed is fast, separates and identifies the advantages of simultaneously carry out, because
This, mass-spectrometric technique is also widely used in chemical, environmental analyses, the field such as medicine analysis.
With the quick change of fake and forged technology, all kinds of in current China public product particularly Safety of Food Quality
Risk is increasingly increased with suffering, increasingly urgent to the demand of the science of Product quality and safety information, quick obtaining technology and equipment.
In recent years, a series of food safety malignant events of interior particularly domestic generation at the international level, show to take with a certain item or
A few eigenvalues of person come also more and more significant the drawbacks of judging product quality, carry out qualitative, quantitative using Global Information to product
Judge becoming the focus of research.
For complicated chemical combination objects system, individual spectrogram is difficult to comprehensively reflect the Chemical Characteristics of product, needs
It is combined many spectrogram information, will reflect that the different semiochemical various spectrogram information of product merge, comprehensive characterization product
Chemical composition characteristic so that various information carries out effectively complementary, strengthen the trust degree of data, improve precision of prediction, can
By property and robustness, this judges there is important Research Significance to the Global Information of food.Although the grinding of many spectrograms information fusion
Study carefully and achieve certain achievement, in color-analytical tool such as matter combination, GC-FTIR, have some simple data fusion, but
Mass spectrum, infrared spectrum are depended on to material entirety qualitative/quantitative judgement aspect.Do not realize spectrogram information inside combined system
Merge, be more not carried out the information fusion between different combined systems.
Content of the invention
The technical problem to be solved is to overcome the deficiencies in the prior art, provides one kind can be quickly to multiple
Hybrid compound global feature is accurately analyzed the many spectrograms Feature Fusion recognition methodss based on PCA_LDA analysis of identification.
The present invention specifically employs the following technical solutions solution above-mentioned technical problem:The present invention devises one kind and is based on PCA_
Many spectrograms Feature Fusion recognition methodss of LDA analysis, comprise the following specific steps that:
Step(1):N sample of collection, sets n sample and belongs to c kind classification, and each sample is chosen with m different spectrum
Figure, using PCA_LDA method extract all samples chromatogram characteristic, and by extract all samples chromatogram characteristic be fused to many
Spectrum fusion feature matrix;
Step(2):Using step(1)In multispectral fusion feature matrix set up multispectral fusion SVM classifier;
Step(3):Using step(2)The multispectral fusion SVM classifier of middle foundation is to step(1)The sample of middle collection is carried out
Identification.
A kind of optimization method as the present invention:Described step(1)Including specifically processing as follows:
Step(11):The training sample set data of i-th sample of labelling isThe m dimension data collection vector of i-th sample
ForThen n × m raw data matrix of all samples isWherein, i=1,2,
3 ... n, liIt is defined asClass label;
Step(12):Using step(11)In raw data matrix X and equation below calculate class in mean dispersion error matrix Sw:
Wherein, ngRepresent the number of samples of g class,It is defined as the mean vector of the sample of g class, g=1,2,3 ... c,It is defined as the vector of p-th sample in g class sample, T is defined as matrix transpose, p=1,2,3 ... ng;
Step(13):Using step(11)In raw data matrix X and equation below calculate class between mean dispersion error matrix Sb:
Wherein,It is defined as the mean vector of all samples,It is defined as the mean vector of g class sample;
Step(14), using step(12)And step(13)In Sw、SbAnd formulaCalculate eigenvalue λ1,
λ2,…,λc-1And the characteristic vector corresponding to eigenvalueAnd obtain projection matrix
Step(15):To step(14)In projection matrix WLDACarry out PCA dimension-reduction treatment, obtain PCA projection matrix WPCA,
Again by PCA projection matrix WPCAProject in q dimension space, obtain q dimension space eigenmatrix XPCA, wherein, XPCA=X·WPCA;
Step(16):By WLDA、WPCAAnd XPCAComplete final feature extraction using equation below:
Then after the individual not projection of cospectral graph of m, eigenmatrix is respectivelyWherein,It is defined as the spy after projecting
Levy matrix;
Step(17):Using formulaAfter the different collection of illustrative plates projections of i-th sample
Eigenmatrix carries out the multispectral fusion feature vector that fusion treatment obtains i-th sample, and all of sample is carried out same
Fusion treatment, obtains the multispectral fusion feature matrix of all samplesWherein,Eigenmatrix vector after definition projection, is θf
It is defined as the combination coefficient of f kind collection of illustrative plates, f=1,2,3 ... m.
A kind of optimization method as the present invention:Described step(2)Including specifically processing as follows:
Step(21):By step(17)In the multispectral fusion feature matrix that obtainsSubstitute into such as minor function:
Step(22):Using step(21)The supporting vector obtainingTry to achieve the multispectral fusion of z-th sample with equation below
Bigoted amount b of characteristic vector* z:
Step(23):Step(22)Bigoted amount b obtaining* zAnd supporting vectorSubstitute into as in minor function:
The present invention compared with prior art has the advantage that:
Dissimilar spectral data can be carried out multidimensional characteristic fusion by the present invention, can be efficiently applied to divide based on spectrogram
Analysis complex mixture Classification and Identification.
Brief description
Fig. 1 is many spectrograms Feature fusion flow chart in the present invention.
Specific embodiment
The present invention is described in further detail below in conjunction with the accompanying drawings:
The present invention devises a kind of many spectrograms Feature Fusion recognition methodss based on PCA_LDA analysis, including concrete as follows
Step:
Step(1):N sample of collection, sets n sample and belongs to c kind classification, and each sample is chosen with m different spectrum
Figure, using PCA_LDA method extract all samples chromatogram characteristic, and by extract all samples chromatogram characteristic be fused to many
Spectrum fusion feature matrix;
Step(2):Using step(1)In multispectral fusion feature matrix set up multispectral fusion SVM classifier;
Step(3):Using step(2)The multispectral fusion SVM classifier of middle foundation is to step(1)The sample of middle collection is carried out
Identification.
As shown in figure 1, a kind of optimization method as the present invention:Described step(1)Including specifically processing as follows:
Step(11):The training sample set data of i-th sample of labelling isThe m dimension data collection of i-th sample to
Measure and beThen n × m raw data matrix of all samples isWherein, i=1,
2,3 ... n, liIt is defined asClass label;
Step(12):Using step(11)In raw data matrix X and equation below calculate class in mean dispersion error matrix Sw:
Wherein, ngRepresent the number of samples of g class,It is defined as the mean vector of the sample of g class, g=1,2,3 ... c,It is defined as the vector of p-th sample in g class sample, T is defined as matrix transpose, p=1,2,3 ... ng;
Step(13):Using step(11)In raw data matrix X and equation below calculate class between mean dispersion error matrix Sb:
Wherein,It is defined as the mean vector of all samples,It is defined as the mean vector of g class sample;
Step(14), using step(12)And step(13)In Sw、SbAnd formulaCalculate eigenvalue λ1,
λ2,…,λc-1And the characteristic vector corresponding to eigenvalueAnd obtain projection matrix
Step(15):To step(14)In projection matrix WLDACarry out PCA dimension-reduction treatment, obtain PCA projection matrix WPCA,
Again by PCA projection matrix WPCAProject in q dimension space, obtain q dimension space eigenmatrix XPCA, wherein, XPCA=X·WPCA;
Step(16):By WLDA、WPCAAnd XPCAComplete final feature extraction using equation below:
Then after the individual not projection of cospectral graph of m, eigenmatrix is respectivelyWherein,It is defined as the spy after projecting
Levy matrix;
Step(17):Using formulaAfter the different collection of illustrative plates projections of i-th sample
Eigenmatrix carries out the multispectral fusion feature vector that fusion treatment obtains i-th sample, and all of sample is carried out same
Fusion treatment, obtains the multispectral fusion feature matrix of all samplesWherein,Eigenmatrix vector after definition projection, is θf
It is defined as the combination coefficient of f kind collection of illustrative plates, f=1,2,3 ... m.
A kind of optimization method as the present invention:Described step(2)Including specifically processing as follows:
Step(21):By step(17)In the multispectral fusion feature matrix that obtainsSubstitute into such as minor function:
Step(22):Using step(21)The supporting vector obtainingTry to achieve the multispectral fusion of z-th sample with equation below
Bigoted amount b of characteristic vector* z:
Step(23):Step(22)Bigoted amount b obtaining* zAnd supporting vectorSubstitute into as in minor function:
In a particular embodiment, our experiment respectively the indigo plant wine sample to 50 skies and 50 seas indigo plant wine sample collection Raman
Spectrogram, mass spectrum and ion transfer spectrogram.
Raman spectrogram collection experiment is as follows:
Draw wine sample with capillary tube, be added to injection port, carry out spectrogram collection under conditions of HONGGUANG 632.8nm, collection
During indoor holding dark state, in order to avoid the impact to experiment for the visible ray.
Mass spectrum collection experiment is as follows:
It is analyzed using the direct Head-space sampling of atmospheric pressure capillary tube under room temperature.Major parameter condition:1)Ionization mode:Single
Photon ionizes;2)Ionized region stable gas pressure is in about 15.50Pa;3)Accelerating region voltage is 2650V, and 250 micron capillary column directly enter
Sample, no heating measures;4)Every TOF full Spectral Signal cumulative time is 15s;5)Head space volume about 10ml.
Test sample collection assumes extraordinary separation property, the blue sample in the indigo plant sample in sky and sea in the feature space after fusion
This is assembled in three dimensions in two different regions in feature space, illustrates that PCA_LDA feature proposed by the present invention carries
Take with many spectrograms Feature Fusion Algorithm to extract different brackets Chinese liquor internal characteristicses be effective.
Claims (1)
1. a kind of many spectrograms Feature Fusion recognition methodss based on PCA_LDA analysis are it is characterised in that include specifically walking as follows
Suddenly:
Step (1):N sample of collection, sets n sample and belongs to c kind classification, and each sample is chosen with m not cospectral graph, profit
Extract the chromatogram characteristic of all samples with PCA_LDA method, and by extract the chromatogram characteristic of all samples is fused to multispectral melting
Close eigenmatrix;
Described step (1) includes specifically processing as follows:
Step (11):The training sample set data of i-th sample of labelling isThe m dimension data collection vector of i-th sample isThen n × m raw data matrix of all samples isWherein, i=1,2,
3 ... n, liIt is defined asClass label;
Step (12):Calculate mean dispersion error matrix S in class using the raw data matrix X in step (11) and equation beloww:
Wherein, ngRepresent the number of samples of g class,It is defined as the mean vector of the sample of g class, g=1,2,3 ... c,Fixed
Justice is defined as matrix transpose, p=1,2,3 ... n for the vector of p-th sample in g class sample, Tg;
Step (13):Calculate mean dispersion error matrix S between class using the raw data matrix X in step (11) and equation belowb:
Wherein,It is defined as the mean vector of all samples,It is defined as the mean vector of the sample of g class;
Step (14):Using the S in step (12) and step (13)w、SbAnd formulaCalculate eigenvalue λ1, λ2..., λc-1
And the characteristic vector corresponding to eigenvalueAnd obtain projection matrix
Step (15):To the projection matrix W in step (14)LDACarry out PCA dimension-reduction treatment, obtain PCA projection matrix WPCA, then will
PCA projection matrix WPCAProject in q dimension space, obtain q dimension space eigenmatrix XPCA, wherein, XPCA=X WPCA;
Step (16):By WLDA、WPCAAnd XPCAComplete final feature extraction using equation below:
Then after the individual not projection of cospectral graph of m, eigenmatrix is respectivelyWherein,It is defined as the feature square after projecting
Battle array;
Step (17):Using formulaBy feature after the different collection of illustrative plates projections of i-th sample
Matrix carries out the multispectral fusion feature vector that fusion treatment obtains i-th sample, and carries out same fusion to all of sample
Process, obtain the multispectral fusion feature matrix of all samplesWherein,Eigenmatrix vector after definition projection, is θfDefinition
For the combination coefficient of f kind collection of illustrative plates, f=1,2,3 ... m;
Step (2):Set up multispectral fusion SVM classifier using the multispectral fusion feature matrix in step (1);
Described step (2) includes specifically processing as follows:
Step (21):The multispectral fusion feature matrix that will obtain in step (17)Substitute into such as minor function:
Try to achieveWhen support
VectorWherein, z=1,2,3 ... n, j=1,2,3 ... n,It is defined as the optimization of the multispectral fusion feature vector of z-th sample
Function;azAnd ajIt is all Lagrange multiplier, wherein z, j corresponds to multiplier subscript, and value is from 1 to n;lzRepresent the classification mark of z sample
Note, value from 1 to c, ljRepresent the category label of j sample, value is from 1 to c;
Step (22):The supporting vector being obtained using step (21)Try to achieve z-th sample multispectral fusion feature arrow with equation below
Bigoted amount b of amount* z:
Step (23):Bigoted amount b that step (22) obtains* zAnd supporting vectorSubstitute into as in minor function:
Wherein, k () is defined as the kernel operation of support vector machine,I.e.
Discriminant function for SVM classifier;
Step (3):Using the multispectral fusion SVM classifier set up in step (2), the sample of collection in step (1) is known
Not.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310098307.5A CN103198331B (en) | 2013-03-25 | 2013-03-25 | Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310098307.5A CN103198331B (en) | 2013-03-25 | 2013-03-25 | Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103198331A CN103198331A (en) | 2013-07-10 |
CN103198331B true CN103198331B (en) | 2017-02-08 |
Family
ID=48720862
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310098307.5A Active CN103198331B (en) | 2013-03-25 | 2013-03-25 | Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103198331B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103440292B (en) * | 2013-08-16 | 2016-12-28 | 新浪网技术(中国)有限公司 | Multimedia information retrieval method and system based on bit vectors |
CN109974835B (en) * | 2018-12-29 | 2021-06-04 | 无锡联河光子技术有限公司 | Vibration detection identification and space-time positioning method and system based on optical fiber signal characteristics |
CN113138248B (en) * | 2021-04-25 | 2022-08-02 | 贵州中烟工业有限责任公司 | Characteristic spectrum extraction method and detection method for feed liquid preparation quality stability |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20040042500A (en) * | 2002-11-14 | 2004-05-20 | 엘지전자 주식회사 | Face detection based on pca-lda |
CN101093539A (en) * | 2007-07-27 | 2007-12-26 | 哈尔滨工程大学 | Matching identification method by extracting characters of vein from finger |
CN102479320A (en) * | 2010-11-25 | 2012-05-30 | 康佳集团股份有限公司 | Face recognition method and device as well as mobile terminal |
CN102982322A (en) * | 2012-12-07 | 2013-03-20 | 大连大学 | Face recognition method based on PCA (principal component analysis) image reconstruction and LDA (linear discriminant analysis) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100543707B1 (en) * | 2003-12-04 | 2006-01-20 | 삼성전자주식회사 | Face recognition method and apparatus using PCA learning per subgroup |
-
2013
- 2013-03-25 CN CN201310098307.5A patent/CN103198331B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20040042500A (en) * | 2002-11-14 | 2004-05-20 | 엘지전자 주식회사 | Face detection based on pca-lda |
CN101093539A (en) * | 2007-07-27 | 2007-12-26 | 哈尔滨工程大学 | Matching identification method by extracting characters of vein from finger |
CN102479320A (en) * | 2010-11-25 | 2012-05-30 | 康佳集团股份有限公司 | Face recognition method and device as well as mobile terminal |
CN102982322A (en) * | 2012-12-07 | 2013-03-20 | 大连大学 | Face recognition method based on PCA (principal component analysis) image reconstruction and LDA (linear discriminant analysis) |
Non-Patent Citations (1)
Title |
---|
《基于最优特征子集的葡萄品种识别》;张航;《中国优秀硕士学位论文全文数据库信息科技辑》;20130215;I138-1361 * |
Also Published As
Publication number | Publication date |
---|---|
CN103198331A (en) | 2013-07-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhou et al. | Strategies for large-scale targeted metabolomics quantification by liquid chromatography-mass spectrometry | |
CN104297355B (en) | Simulative-target metabonomics analytic method based on combination of liquid chromatography and mass spectrum | |
Ridgeway et al. | Trends in trapped ion mobility–Mass spectrometry instrumentation | |
Dwivedi et al. | Metabolic profiling by ion mobility mass spectrometry (IMMS) | |
Vaclavik et al. | The use of high performance liquid chromatography–quadrupole time-of-flight mass spectrometry coupled to advanced data mining and chemometric tools for discrimination and classification of red wines according to their variety | |
Tuli et al. | LC–MS based detection of differential protein expression | |
Pan et al. | Untargeted metabolomic analysis of Chinese red wines for geographical origin traceability by UPLC-QTOF-MS coupled with chemometrics | |
Hu et al. | Characterization of volatile components in four vegetable oils by headspace two-dimensional comprehensive chromatography time-of-flight mass spectrometry | |
CN104297406B (en) | A kind of wide spectrum identifies the method for beta-receptor stimulant medicine | |
Li et al. | Simultaneous determination of eight illegal dyes in chili products by liquid chromatography–tandem mass spectrometry | |
Chen et al. | Application of dispersive solid-phase extraction and ultra-fast liquid chromatography–tandem quadrupole mass spectrometry in food additive residue analysis of red wine | |
CN108072690B (en) | Ion mobility spectrometry and ion trap mass spectrometry combined device and analysis method | |
Agin et al. | Metabolomics–an overview. From basic principles to potential biomarkers (part 1) | |
CN103293141A (en) | A liquor vintage recognition method based on a fusion technology of ion mobility spectrometry/ mass spectrometry/ Raman spectroscopy | |
CN103198331B (en) | Multiple spectrogram characteristic amalgamation and recognition method based on analysis of PCA-LDA | |
Hao et al. | Metandem: an online software tool for mass spectrometry-based isobaric labeling metabolomics | |
CN105891316A (en) | Method for simultaneously screening and analyzing forbidden and restricted dye in textiles | |
CN104364641A (en) | Quantification of an analyte in serum and other biological matrices | |
CN103675147B (en) | Method for rapidly determining caffeine in drink | |
WO2016168391A1 (en) | Structural elucidation of isotopically labeled analytes | |
Shvartsburg et al. | Protein analyses using differential ion mobility microchips with mass spectrometry | |
CN106198790B (en) | A kind of horse and mule common characteristic polypeptide and its application | |
Munjal et al. | Analytical techniques used in metabolomics: A review | |
J Brown et al. | Field asymmetric waveform ion mobility spectrometry analysis of proteins and peptides: a review | |
CN110483357A (en) | A kind of isolation and purification method of tangerine peel major carotenoids |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 210038 Jiangsu city of Nanjing Province Economic and Technological Development Zone No. 12 Chong Xing Ke Lu base room 103 Applicant after: Jiangsu Yipu Technology Co., Ltd. Address before: Baixia Road, Baixia District of Nanjing city of Jiangsu Province, No. 273 210000 Applicant before: Jiangsu Yipu Technology Co., Ltd. |
|
COR | Change of bibliographic data | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |