CN102072767A - Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device - Google Patents

Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device Download PDF

Info

Publication number
CN102072767A
CN102072767A CN 201010601351 CN201010601351A CN102072767A CN 102072767 A CN102072767 A CN 102072767A CN 201010601351 CN201010601351 CN 201010601351 CN 201010601351 A CN201010601351 A CN 201010601351A CN 102072767 A CN102072767 A CN 102072767A
Authority
CN
China
Prior art keywords
wavelength
infrared spectrum
spectrum
regression
quantitative analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010601351
Other languages
Chinese (zh)
Inventor
郝勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
East China Jiaotong University
Original Assignee
East China Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by East China Jiaotong University filed Critical East China Jiaotong University
Priority to CN 201010601351 priority Critical patent/CN102072767A/en
Publication of CN102072767A publication Critical patent/CN102072767A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and a wavelength similarity consensus regression-based infrared spectrum quantitative analysis device. The device comprises a spectrometer, a preprocessor, a wavelength filter and a partial least-squares regression analyzer which are connected through a data signal line; and the wavelength of the near-infrared spectrum ranges from 780 to 50,000nm. By performing cluster analysis on spectra along the wavelength direction, a spectrum is divided into different information blocks, multiple models are constructed, and the difficulty for a single model method to extract information of a spectrum signal is overcome; and by singly determining factor number for different submodels, effective information is fully extracted, and the prediction accuracy and robustness of an infrared spectrum analysis model are improved.

Description

Return the IR spectrum quantitative analysis method and apparatus based on wavelength similarity common recognition
Technical field
The present invention relates to a kind of IR spectrum quantitative analysis method and apparatus, particularly return the IR spectrum quantitative analysis method and apparatus based on wavelength similarity common recognition.
Background technology
Owing to both contained useful information in the spectral signal that spectrometer obtains, other various stochastic errors (background and noise) are also superposeing simultaneously.Therefore, when utilizing partial least-square regression method to carry out quantitative test, be difficult to obtain the higher model of precision of prediction.At this problem of precision of prediction that improves the partial least squares regression model, number of research projects is carried out, mainly comprise preprocessing procedures and variable STUDY ON SCREENING at present, these methods have been successfully used to the elimination of spectral background correction, noise removing, no information variable.Yet when utilizing said method to carry out information extraction, regular meeting face useful information extract insufficient or the information extracted in comprise noisy problem.When utilizing multivariate calibration methods to carry out the quantitative test of infrared spectrum; often face the problem (as regular parameter and nuclear parameter selection etc. in selection of factor number and the least square support vector regression in the partial least squares regression) that parameter is selected; if spectral signal is adopted unified parameter, usually can cause the information extraction of Partial Variable insufficient or introduce the phenomenon of noise information.
Cluster analysis belongs to no supervised recognition method, is usually used in the division of sample ownership.If sample belongs to same class, show that sample has similarity preferably in such.When clustering method was used for the attribution analysis of spectrum similar argument, the spectral variables in the same class included similar spectral information; And the spectral variables between class has been represented different composition informations.
Based on the analysis of above-mentioned two aspects, this paper has invented a kind of IR spectrum quantitative analysis method and apparatus that returns based on similar wavelength cluster common recognition.In this invention, directly utilize the several similar spectral variables zone after the cluster analysis to set up a plurality of recurrence submodels, last antithetical phrase predicted results is carried out integrated, can select different regression parameters at different spectral information classes like this, extract the useful information under each spectral variables class more flexibly effectively.Therefore be the infrared spectrum analysis that a kind of extremely has application prospect based on similar wavelength cluster common recognition homing method.
Summary of the invention
In the application Infrared Spectrum Technology component to be measured is carried out in the express-analysis process, in order to solve the inaccurate problem of withdrawing spectral information.The invention provides a kind of IR spectrum quantitative analysis method and apparatus that returns based on wavelength similarity cluster common recognition.This method and apparatus makes full use of the characteristics of cluster analysis " things of a kind come together, people of a mind fall into the same group ", and the message block that being divided into of the similar spectral variables of infrared spectrum is different has realized making full use of and reasonable distribution of infrared spectrum information.
The device of realizing technique scheme comprises: grow up to be a useful person through spectrometer, pretreater, cluster analysis device, partial least squares regression mode set that data signal line links to each other.
Infrared and near infrared spectrum during described infrared spectrum comprises, promptly wavelength coverage is: 780 nm-50000 nm.
Described pretreater takes centralization and vector normalization that the original signal of spectrometer collection is handled.
Described cluster analysis device, the treatment step that specifically comprises is as follows: at first calculate the Euclidean distance between spectral variables, its computing formula is:
Figure 980184DEST_PATH_IMAGE001
Wherein Be
Figure 127448DEST_PATH_IMAGE003
Wavelength points and Distance between the wavelength points,
Figure 154627DEST_PATH_IMAGE005
With
Figure 884686DEST_PATH_IMAGE006
Be respectively
Figure 458625DEST_PATH_IMAGE003
Wavelength points and
Figure 451988DEST_PATH_IMAGE004
Wavelength points is at spectrum
Figure 737476DEST_PATH_IMAGE007
The absorbance at place.The distance of calculating between newly-built class and other wavelength points or the class adopts the mean distance method.Secondly, carry out cluster analysis and divide different spectral variables message block (block) into according to the distance between the spectral variables of aforementioned calculation.
Described partial least squares regression mode set is grown up to be a useful person, and its concrete operations are as follows: analyze each the spectral variables piece after handling through the cluster analysis device is set up corresponding partial least squares regression submodel.In each submodel, number of principal components can be selected different numerical value.Utilize the submodel of having set up respectively test set sample index to be predicted, predicted the outcome accordingly respectively.Then predicting the outcome of each submodel is weighted, obtains the predicted value of final sample index.By comparing under the different decomposition yardstick, the predicted root mean square error of model (RMSEP) value is determined class number and the model parameter and the preservation of suitable cluster analysis,, be used for the forecast analysis of follow-up fresh sample infrared spectrum.
Because the present invention adopts above technical scheme, obtains following effect:
By spectrum is carried out cluster analysis along wavelength direction, make a spectrum be divided into different message block, realized the structure of multi-model, overcome the difficulty of single model method to the spectral signal information extraction; By different submodels are determined factor number separately, realized the abundant extraction of effective information, improved the precision of prediction and the robustness of Infrared spectroscopy model.
Description of drawings
The IR spectrum quantitative analysis method and apparatus synoptic diagram that Fig. 1 wavelength similarity cluster common recognition returns;
Fig. 2 near infrared light spectrogram;
Fig. 3 wavelength similarity cluster common recognition homing method synoptic diagram;
Fig. 4 cluster analysis divided block number is to the influence of wavelength similarity cluster common recognition homing method precision of prediction
Fig. 5 wavelength similarity cluster synoptic diagram;
The IR spectrum quantitative analysis model predication value that Fig. 6 wavelength similarity cluster common recognition returns and the correlogram of reference value.
Embodiment
Embodiment describes in conjunction with following embodiment.Near infrared spectrum with pears is an example, the pol index of pears inside is carried out the structure of wavelength similarity cluster common recognition regression model.
Fig. 1 is the IR spectrum quantitative analysis method and apparatus synoptic diagram that wavelength similarity cluster common recognition returns, and Fig. 2 is the near infrared light spectrogram, and spectral range is 750 ~ 1800 nm, and every spectrum comprises 1051 data points.All samples is divided into calibration set and test set according to the ratio of 2:1.
All samples spectrum is carried out cluster analysis, in the compute classes and the distance between class adopt Euclidean distance method and average Furthest Neighbor respectively.Figure 3 shows that wavelength similarity cluster common recognition homing method synoptic diagram, as we know from the figure, the principle of this method is to select wavelength points at random, and the wavelength points that comprises in the same spectral information piece after the cluster is not necessarily adjacent.
Figure 4 shows that of the influence (wherein spectral information piece get 1 o'clock, corresponding be the predicted root mean square error of overall optical spectrum model) of cluster analysis divided block number to wavelength similarity cluster common recognition homing method precision of prediction.As we know from the figure, along with the increase of clustering information piece, predicted root mean square error reduces earlier afterwards to increase, and different clustering information pieces are counted predicting the outcome of drag and had notable difference.
Figure 5 shows that wavelength similarity cluster synoptic diagram.From figure can, spectrum is 3 message block along the wavelength direction cluster analysis.Respectively 3 message block are set up the partial least squares regression model, corresponding number of principal components is respectively 3,7,3, and the number of principal components of full spectrum is 4, as seen adopts wavelength similarity cluster common recognition homing method to be more conducive to extract at comprising useful information in the spectrum.
Fig. 6 is the correlogram of infrared spectrum wavelength similarity cluster common recognition model predication value and reference value.As we know from the figure, obtained quantitative result preferably.
Effect of the present invention is: more help to extract the useful information of the lower infrared spectrum in different wave length point zone by adopting wavelength similarity cluster common recognition homing method, obtain more accurate, stable predicting the outcome. Therefore, wavelength similarity cluster common recognition homing method is expected to become the spectroscopic analysis methods that a kind of extremely has application prospect.

Claims (1)

1. return the IR spectrum quantitative analysis method and apparatus based on wavelength similarity common recognition, it is characterized in that: comprise through data signal line continuous spectrometer, pretreater, cluster analysis device, partial least squares regression mode set and growing up to be a useful person;
Infrared and near infrared spectrum during described infrared spectrum comprises, promptly wavelength coverage is: 780 nm-50000 nm;
Described pretreater takes centralization and vector normalization that the original signal of spectrometer collection is handled.
CN 201010601351 2010-12-23 2010-12-23 Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device Pending CN102072767A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010601351 CN102072767A (en) 2010-12-23 2010-12-23 Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010601351 CN102072767A (en) 2010-12-23 2010-12-23 Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device

Publications (1)

Publication Number Publication Date
CN102072767A true CN102072767A (en) 2011-05-25

Family

ID=44031402

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010601351 Pending CN102072767A (en) 2010-12-23 2010-12-23 Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device

Country Status (1)

Country Link
CN (1) CN102072767A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103018195A (en) * 2012-12-07 2013-04-03 西安近代化学研究所 Method for determination of PCTFE content in PBX explosive by near infrared spectrum
CN104833653A (en) * 2015-04-15 2015-08-12 北京理工大学 Method for rapidly analyzing content of hexogen in mixed explosive
CN104865219A (en) * 2015-05-06 2015-08-26 江西出入境检验检疫局检验检疫综合技术中心 Method for identifying cotton and hemp fibers rapidly
CN104865218A (en) * 2015-05-06 2015-08-26 江西出入境检验检疫局检验检疫综合技术中心 Method for measuring cotton and hemp blend fiber content
CN106841092A (en) * 2017-03-01 2017-06-13 广西民族大学 A kind of Sugarcane Disease infrared identification method
CN107831135A (en) * 2017-10-23 2018-03-23 大连理工大学 It is a kind of to establish two-dimentional qualitative analysis model using near infrared spectroscopy to differentiate the method in the fresh extra large stichopus japonicus place of production
CN105842183B (en) * 2016-03-17 2018-10-02 东北大学 A kind of infrared spectrum modeling method based on common recognition selection technique
CN109063767A (en) * 2018-07-31 2018-12-21 温州大学 A kind of near infrared spectrum modeling method known together based on sample and variable
CN111257277A (en) * 2018-11-30 2020-06-09 湖南中烟工业有限责任公司 Tobacco leaf similarity judgment method based on near infrared spectrum technology

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1696660A (en) * 2005-04-05 2005-11-16 中国药品生物制品检定所 Method and equipment for identifying medication by using near infrared spectral analysis
CN1831515A (en) * 2006-04-03 2006-09-13 浙江大学 Method for nondistructive discriminating crop seed variety using visible light and near-infrared spectrum technology
CN1900697A (en) * 2006-07-27 2007-01-24 河南科技大学 Near infrared spectrum quick detecting technique for E.coli
WO2009087614A2 (en) * 2008-01-08 2009-07-16 Opgal Optronic Industries Ltd. System and method for gas leakage detection
CN101915744A (en) * 2010-07-05 2010-12-15 北京航空航天大学 Near infrared spectrum nondestructive testing method and device for material component content

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1696660A (en) * 2005-04-05 2005-11-16 中国药品生物制品检定所 Method and equipment for identifying medication by using near infrared spectral analysis
CN1831515A (en) * 2006-04-03 2006-09-13 浙江大学 Method for nondistructive discriminating crop seed variety using visible light and near-infrared spectrum technology
CN1900697A (en) * 2006-07-27 2007-01-24 河南科技大学 Near infrared spectrum quick detecting technique for E.coli
WO2009087614A2 (en) * 2008-01-08 2009-07-16 Opgal Optronic Industries Ltd. System and method for gas leakage detection
CN101915744A (en) * 2010-07-05 2010-12-15 北京航空航天大学 Near infrared spectrum nondestructive testing method and device for material component content

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《中国博士学位论文全文数据库 工程科技I辑》 20100715 郝勇 近红外光谱微量分析方法研究 , *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103018195A (en) * 2012-12-07 2013-04-03 西安近代化学研究所 Method for determination of PCTFE content in PBX explosive by near infrared spectrum
CN104833653A (en) * 2015-04-15 2015-08-12 北京理工大学 Method for rapidly analyzing content of hexogen in mixed explosive
CN104865219A (en) * 2015-05-06 2015-08-26 江西出入境检验检疫局检验检疫综合技术中心 Method for identifying cotton and hemp fibers rapidly
CN104865218A (en) * 2015-05-06 2015-08-26 江西出入境检验检疫局检验检疫综合技术中心 Method for measuring cotton and hemp blend fiber content
CN104865219B (en) * 2015-05-06 2017-06-23 江西出入境检验检疫局检验检疫综合技术中心 The method of quick discriminating cotton fibriia
CN104865218B (en) * 2015-05-06 2017-06-23 江西出入境检验检疫局检验检疫综合技术中心 The quick method for determining cotton ramie blended spinning fiber content
CN105842183B (en) * 2016-03-17 2018-10-02 东北大学 A kind of infrared spectrum modeling method based on common recognition selection technique
CN106841092A (en) * 2017-03-01 2017-06-13 广西民族大学 A kind of Sugarcane Disease infrared identification method
CN107831135A (en) * 2017-10-23 2018-03-23 大连理工大学 It is a kind of to establish two-dimentional qualitative analysis model using near infrared spectroscopy to differentiate the method in the fresh extra large stichopus japonicus place of production
CN109063767A (en) * 2018-07-31 2018-12-21 温州大学 A kind of near infrared spectrum modeling method known together based on sample and variable
CN111257277A (en) * 2018-11-30 2020-06-09 湖南中烟工业有限责任公司 Tobacco leaf similarity judgment method based on near infrared spectrum technology
CN111257277B (en) * 2018-11-30 2023-02-17 湖南中烟工业有限责任公司 Tobacco leaf similarity judgment method based on near infrared spectrum technology

Similar Documents

Publication Publication Date Title
CN102072767A (en) Wavelength similarity consensus regression-based infrared spectrum quantitative analysis method and device
CN104062263B (en) The near-infrared universal model detection method of light physical property close fruit quality index
CN105486655B (en) The soil organism rapid detection method of model is intelligently identified based on infrared spectroscopy
CN108181263B (en) Tobacco leaf position feature extraction and discrimination method based on near infrared spectrum
CN110531054B (en) Soil organic carbon prediction uncertainty estimation method based on Bootstrap sampling
CN107179310B (en) Raman spectrum characteristic peak recognition methods based on robust noise variance evaluation
CN103528990A (en) Method for establishing multiple models of near infrared spectrums
CN104990895B (en) A kind of near infrared spectrum signal standards normal state bearing calibration based on regional area
WO2017045296A1 (en) Online near-infrared sample size determining method
CN109060771B (en) Consensus model construction method based on different characteristic sets of spectrum
CN104215591A (en) Damage-free visible-near infrared light spectrum detecting method
CN104063710A (en) Method for removing abnormal spectrum in actual measurement spectrum curve based on support vector machine model
CN113008805A (en) Radix angelicae decoction piece quality prediction method based on hyperspectral imaging depth analysis
CN114216877B (en) Automatic detection and reconstruction method and system for spectral peak in tea near infrared spectral analysis
CN112525869A (en) Sectional type detection method for pesticide residues
CN102937575B (en) Watermelon sugar degree rapid modeling method based on secondary spectrum recombination
CN102128805A (en) Method and device for near infrared spectrum wavelength selection and quick quantitative analysis of fruit
CN110779875B (en) Method for detecting moisture content of winter wheat ear based on hyperspectral technology
CN102135496A (en) Infrared spectrum quantitative analysis method and infrared spectrum quantitative analysis device based on multi-scale regression
CN109283153B (en) Method for establishing quantitative analysis model of soy sauce
CN117095771B (en) High-precision spectrum measurement data optimization processing method
CN102353648A (en) Method for detecting fertilizer components
CN110231328B (en) Raman spectrum quantitative analysis method based on half-peak height distance method
CN116858822A (en) Quantitative analysis method for sulfadiazine in water based on machine learning and Raman spectrum
Wang et al. Monitoring model for predicting maize grain moisture at the filling stage using NIRS and a small sample size

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20110525