CN105203498A - Near infrared spectrum variable selection method based on LASSO - Google Patents

Near infrared spectrum variable selection method based on LASSO Download PDF

Info

Publication number
CN105203498A
CN105203498A CN201510581659.5A CN201510581659A CN105203498A CN 105203498 A CN105203498 A CN 105203498A CN 201510581659 A CN201510581659 A CN 201510581659A CN 105203498 A CN105203498 A CN 105203498A
Authority
CN
China
Prior art keywords
lasso
gamma
near infrared
infrared spectrum
beta
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510581659.5A
Other languages
Chinese (zh)
Inventor
卞希慧
颜鼎荷
李淑娟
谭小耀
李翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Polytechnic University
Original Assignee
Tianjin Polytechnic University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Polytechnic University filed Critical Tianjin Polytechnic University
Priority to CN201510581659.5A priority Critical patent/CN105203498A/en
Publication of CN105203498A publication Critical patent/CN105203498A/en
Pending legal-status Critical Current

Links

Landscapes

  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention provides a near infrared spectrum variable selection method based on LASSO. The method comprises the following concrete processes of collecting a near infrared spectrum of a sample, and using a conventional method for measuring a concentration vector of a tested ingredient; dividing a data set into a training set and a prediction set by adopting a certain grouping mode; determining the constraint value t of an LASSO method through crossed verification; using a minimum angle regression algorithm for calculating the regression coefficient beta; remaining the position of a wavelength point of the beta being not zero; building a partial least squares regression model between a training set spectrum and the concentration vector by utilizing the training set spectrum corresponding to the remained wavelength, and predicting the concentration of a tested ingredient of a prediction set sample. The method has the advantages that an effective wavelength can be extracted; a quantitative analysis model is simplified; the prediction precision of the model is improved. Compared with an existing variable selection method, the method has the advantages that the speed is high; the repeating performance can be realized; the higher prediction precision can be reached by using fewer variables. The near infrared spectrum variable selection method is applicable to the variable selection of complicated sample near infrared spectrums.

Description

A kind of near infrared spectrum Variable Selection based on LASSO
Technical field
This method invention belongs to the nondestructive analysis technical field in analytical chemistry field, is specifically related to a kind of near infrared spectrum Variable Selection based on LASSO.
Background technology
Near-infrared spectral analysis technology is the technology of high speed development in analytical chemistry field, and it has, and analysis efficiency is high, detection speed fast, without the need to advantages such as sample pretreatments, be widely used in the industry such as food, oil.Modling model between the content or classification of near infrared spectrum and measured matter, can realize the direct qualitative and quantitative analysis of complex material.In near infrared spectrum modeling, a very important problem is exactly there is redundancy wavelength in spectrum.General near infrared spectrum (NIR) comprises hundreds of wavelength variable point, and the character of some of them wavelength and research is incoherent, these uncorrelated wavelength points, can affect model quality, causes its predictive ability to decline.Therefore variables choice is the important content of spectrum modeling analysis always.
Variable Selection conventional in spectral data analysis mainly comprises the method for method based on intelligent optimization algorithm and Corpus--based Method.The former mainly contains simulated annealing (simulatedannealing, SA, see SwierengaH, deGrootPJ, deWeijerAP, DerksenMWJ, BuydensLMC, ImprovementofPLSmodeltransferabilitybyrobustwavelengthse lection, ChemomIntellLabSyst, 1998, 41, 237-248), genetic algorithm (geneticalgorithm, GA, see LeardiR, GonzalezAL, GeneticalgorithmsappliedtofeatureselectioninPLSregressio n:howandwhentousethem, ChemomIntellLabSyst, 1998, 41, 195-207), tabu search (Tabusearch, TS, see HagemanJA, StreppelM, WehrensR, WavelengthselectionwithTabuSearch, JChemometrics, 2003, 17, 427-437), ant group algorithm (antcolonyoptimization, ACO, see ShamsipurM, Zare-ShahabadiV, HemmateenejadB, AkhondM, Antcolonyoptimization:apowerfultoolforwavelengthselectio n, JChemometrics, 2006, 20, 146-157), particle cluster algorithm (particleswarmoptimization, PSO, see XuL, JiangJH, WuHL, ShenGL, YuRQ, Variable-weightedPLS, ChemomIntellLabSyst, 2007, 85, 140-143) etc., these optimized methods exist needs a large amount of parameters, search time is longer and be easily absorbed in the defects such as local optimum.The latter mainly contains without information variable removing method (UninformativeVariableElimination, UVE, see CentnerV, MassartDL, deNoordOE, JongS, VandeginsteBM, SternaC, Eliminationofuninformativevariablesformultivariatecalibr ation.AnalChem, 1996, 68, 3851-3858), Monte Carlo combines without information variable removing method (MonteCarloUninformativeVariableElimination, MCUVE, see CaiWS, LiYK, ShaoXG, Avariableselectionmethodbasedonuninformativevariableelim inationformultivariatecalibrationofnear-infraredspectra, ChemomIntellLabSyst, 2008, 90, 188-194), based on the Variable Selection method (RandomizationTest of randomized test, RT, see XuH, LiuZC, CaiWS, ShaoXG, Awavelengthselectionmethodbasedonrandomizationtestfornea r-infraredspectralanalysis.ChemomIntellLabSyst, 2009, 97, 189-193) etc.UVE method have employed leave one cross validation to obtain variable stability value, the repetitious computing of this process need, but also need introduce with original spectrum comprise the equal random noise variable of variables number, so when data set number is larger, the method counting yield is low, consuming time longer.MCUVE algorithm and RT method all introduce repeatedly modeling technique, the multiple models produced often more than single model can effectively extract from the different aspect of data and different aspects and express the complex relationship between independent variable and dependent variable, are conducive to more rationally, reliably choice variable.But due to the Stochastic choice of each modeling sample, make the operation result of these two kinds of methods there is certain instability, and also more time-consuming when data volume is larger.Therefore, be necessary to further develop novel Variable Selection fast, improve stability and the precision of prediction of model.
Summary of the invention
The object of the invention is for above-mentioned existing problems, a kind of quick, stable Variable Selection is provided.The method, under the absolute value sum of a regression coefficient is less than the condition of a constant, makes residual sum of squares (RSS) minimize, thus more strictly makes some regression coefficient vanishing, and corresponding variable is deleted, realizes variables choice.
Concrete steps are as follows:
(1) m sample to be tested is collected.Setting spectrum parameter, the near infrared spectrum of collecting sample, obtains the spectrum matrix X of sample.Measure the tested component concentration of sample by conventional method, obtain concentration vector y.Adopt certain packet mode that data are divided into training set and forecast set, wherein training set sample is used for Modling model Optimal Parameters, and forecast set sample is used for the predictive ability of testing model.
(2) cross validation is adopted to determine the binding occurrence t of LASSO.T controls the degree of compression, and t is less, and the degree of compression is stronger, and due to this restrictive condition, end product can make some component of regression coefficient β become 0, reaches the object of variables choice.
(3) utilize minimum angle regression algorithm to solve the regression coefficient β of LASSO, preserve the wavelength points position that regression coefficient is not 0.
β ^ = arg m i n β ∈ R p { ( y - X β ) T ( y - X β ) } s . t . Σ t = 1 p | β t | ≤ t
Minimum angle regression algorithm process is as follows:
1. Renewal model is selected in variables set (activeset), calculates related coefficient absolute value
y ^ 0 = 0 ; c ^ k j = x j T ( y - y ^ k - 1 ) ; C ^ k = m a x { | c ^ k j | }
Upgrade activesetA (k),
A ( k ) = A ( k - 1 ) + { j ^ } ; A ( 0 ) = φ ; j ^ = arg m i n j ∉ A ( k - 1 ) { | c ^ k j | }
2. minimum angular direction (u is determined k)
Make X k=(... s jx j) j ∈ A (k)
Wherein s j = s i g n { c ^ k j } , ω k = A k ( X k T X ) - 1 1 k , A k = ( 1 k T ( X k T X ) - 1 1 k ) - 0.5
1 kbe important be 1 vector, its length equals | A|.Calculate minimum angular direction: u k=X kω k3. material calculation
When j ∉ A ( k ) , Order a k j = x j T u k
If | A|=d, then algorithm stops.
Otherwise γ ^ k = min j ∉ A ( k ) + { C ^ k - c ^ k j / ( A k - a k j ) , ( C ^ k + c ^ k j ) / ( A k + a k j ) }
4. predicated response
γ ~ = m i n γ j > 0 , j ∈ A ( k ) { γ j } , Wherein γ j = - β ^ j / ( s j ω k j ) ; γ ~ 1 = ∞
If &gamma; ~ k < &gamma; ^ k , Then y ^ k = y ^ k - 1 + &gamma; ~ k u k
As .j ∈ A, &beta; ^ j &LeftArrow; &beta; ^ j + &gamma; ~ &omega; k j s j , Otherwise &beta; ^ = 0
A ( k + 1 ) = A ( k ) - { j ~ } , Wherein j ~ = arg m i n j { &gamma; j }
c ^ k + 1 , j = x j T ( y - y ^ k ) , And C ^ k + 1 = m a x j { | c ^ k + 1 , j | } , Return and perform step 1..
Otherwise y ^ k = y ^ k - 1 + &gamma; ^ k u k
As j ∈ A, &beta; ^ j &LeftArrow; &beta; ^ j + &gamma; ^ k &omega; k j s j , Otherwise &beta; ^ j = 0 Return and perform step 1..
(4) according to the wavelength points position retained, only retain the corresponding wavelength row of training set spectrum matrix, obtain new spectrum matrix, and set up partial least squares regression (PLS) model with training set sample measured component concentration vector.Wherein PLS model because of subnumber by Monte Carlo Cross-Validation in conjunction with F inspection determine.Utilize this model, measure the levels of the tested composition of forecast set sample.
Compared with existing Variable Selection, the present invention has that travelling speed is fast, choice variable has repeatable advantage, and can reach with less variable number and better predict the outcome.
Accompanying drawing explanation
Fig. 1: the near infrared light spectrogram of tobacco sample
Fig. 2: tobacco near infrared spectrum data training set carries out residual sum of squares (RSS) (SSR) mean value of 1000 cross validations and the variance variation diagram along with normalized binding occurrence t, and wherein vertical line represents t value corresponding to optimization model
Fig. 3: the regression coefficient β that after tobacco near infrared spectrum data training set carries out LASSO variables choice, all variablees are corresponding
Fig. 4: UVE, MCUVE, RT, LASSO tetra-kinds of Variable Selections retain the distribution plan of variable
Fig. 5: the near infrared light spectrogram of sesame oil and soybean oil, Rice oil triple blend sample
Fig. 6: the spectroscopic data training set of sesame oil and soybean oil, Rice oil triple blend sample carries out residual sum of squares (RSS) (SSR) mean value of 1000 cross validations and the variance variation diagram along with normalized binding occurrence t, and wherein vertical line represents t value corresponding to optimization model
Fig. 7: the regression coefficient β that after sesame oil carries out LASSO variables choice with soybean oil, Rice oil triple blend sample light modal data training set, all variablees are corresponding
Fig. 8: UVE, MCUVE, RT, LASSO tetra-kinds of Variable Selections retain the distribution plan of variable
Embodiment
For better understanding the present invention, below in conjunction with embodiment the present invention done and describe in detail further, but the scope of protection of present invention being not limited to the scope that embodiment represents.
Embodiment 1:
The present embodiment is applied to near-infrared spectrum analysis, measures the content of reducing sugar value in tobacco sample.Concrete step is as follows:
(1) gather the near infrared spectrum data of tobacco leaf sample, use BrukerVector22/N near infrared spectrometer (German Bruker optical instrument company) to test 269 tobacco sheet samples in different tobacco leaf producing region.NIR light spectrum wave-number range is 4000 ~ 9000cm -1, sampling interval is 4 wave numbers, totally 1296 wavelength points, and the near infrared light spectrogram of sample as shown in Figure 1.In tobacco sample, reducing sugar (ReducingSugar) content adopts AAIII type Continuous Flow Analysis instrument (German BranLuebbe company) to measure according to standard method.Before modeling, tobacco leaf sample is divided into two parts at random, comprises training set and forecast set sample, wherein training set sample is used for Modling model, forecast set sample and is used for the predictive ability of testing model.
(2) cross validation is adopted to determine the binding occurrence t of LASSO.T controls the degree of compression, and t is less, and the degree of compression is stronger, and this restrictive condition makes some component of vectorial β become 0, thus reaches the object of variables choice.The present embodiment training set carry out residual sum of squares (RSS) (SSR) mean value of 1000 cross validations and variance along with normalized binding occurrence t change as shown in Figure 2, wherein vertical line represents t value corresponding to optimization model, is 0.103.
(3) the regression coefficient β of LASSO is solved.Utilize minimum angle regression algorithm to solve the regression coefficient β of LASSO, preserve the wavelength points position that regression coefficient is not 0.
&beta; ^ = arg m i n &beta; &Element; R p { ( y - X &beta; ) T ( y - X &beta; ) } s . t . &Sigma; t = 1 p | &beta; t | &le; t
This embodiment regression coefficient β value that after carrying out LASSO variables choice, all variablees are corresponding as shown in Figure 3.
(4) according to the wavelength points position retained, only retain the corresponding wavelength row of training set spectrum matrix, obtain new spectrum matrix.Spectrum matrix and training set sample measured component concentration vector set up partial least squares regression (PLS) model, wherein PLS model because of subnumber by Monte Carlo Cross-Validation in conjunction with F inspection determine.Utilize this model, measure the levels of the tested composition of forecast set sample.This embodiment determine because of subnumber be 8.
UVE, MCUVE, RT, LASSO tetra-kinds of Variable Selections retain the distribution plan of variable as shown in Figure 4.As can be seen from Figure 4, on the one hand, LASSO is roughly the same with the variable range of other three kinds of method choice, which illustrates the rationality of LASSO method choice variable.On the other hand, the variable number that LASSO selects is more less than other three kinds of Variable Selections, and this embodies the superiority of the method.
In order to compare the effect of four kinds of variables choice further, table 1 gives tobacco near infrared data and does not adopt variables choice and set up the modeling effect of PLS model after adopting variables choice.From data in table, LASSO choice variable only 27 is nearly 1/10th of other three kinds of Variable Selections.Computing time 11.89, although slower than the PLS not carrying out variables choice, obviously faster than other Variable Selection.The RMSEP value that LASSO-PLS modeling obtains is minimum, and R value is maximum, illustrates that the method more can improve the precision of prediction of model.Therefore, LASSO-PLS choice variable number compared with other modeling method is few, and computing time is shorter, and precision of prediction is higher.
The results contrast of the different modeling method of table 1 tobacco near infrared data
Embodiment 2:
The present embodiment is applied to near-infrared spectrum analysis, measures the near infrared spectrum data of sesame oil and soybean oil, Rice oil triple blend.Concrete step is as follows:
(1) the NIR light modal data of sesame oil and soybean oil, Rice oil triple blend sample is gathered, use near infrared spectrometer (TJ270-60, Tianjin Tuopu Equipment Co., Ltd.) carry out near infrared spectrum data measurement, wavelength coverage is 800 ~ 2500nm, sampling interval is 1nm, totally 1701 wavelength points.The near infrared light spectrogram of sample as shown in Figure 5.Sample configures by a certain percentage (soybean oil quality 0.05 ~ 2.5, interval 0.05; Rice oil concentration 0.05 ~ 2.5, interval 0.05).Before modeling, sample is divided into two parts at random, comprises training set and forecast set sample, wherein training set sample is used for Modling model, forecast set sample and is used for the predictive ability of testing model.
(2) cross validation is adopted to determine the binding occurrence t of LASSO.T controls the degree of compression, and t is less, and the degree of compression is stronger, and this restrictive condition makes some component of vectorial β become 0, thus reaches the object of variables choice.This embodiment training set carry out residual sum of squares (RSS) (SSR) mean value of 1000 cross validations and variance along with normalized binding occurrence t variation diagram as shown in Figure 6, wherein vertical line represents t value corresponding to optimization model is 0.254.
(3) the regression coefficient β of LASSO is solved.Utilize minimum angle regression algorithm to solve the regression coefficient β of LASSO, preserve the wavelength points position that regression coefficient is not 0.
&beta; ^ = arg m i n &beta; &Element; R p { ( y - X &beta; ) T ( y - X &beta; ) } s . t . &Sigma; t = 1 p | &beta; t | &le; t
The regression coefficient β value that after this embodiment training set carries out LASSO variables choice, all variablees are corresponding as shown in Figure 7.
(4) according to the wavelength points position retained, only retain the corresponding wavelength row of training set spectrum matrix, obtain new spectrum matrix.Spectrum matrix and training set sample measured component concentration vector set up partial least squares regression (PLS) model, wherein PLS model because of subnumber by Monte Carlo Cross-Validation in conjunction with F inspection determine.Utilize this model, measure the levels of the tested composition of forecast set sample.This embodiment determine because of subnumber be 8.
UVE, MCUVE, RT, LASSO tetra-kinds of Variable Selections retain the distribution plan of variable as shown in Figure 8.As can be seen from Figure 8, LASSO is roughly the same with the variable range of other three kinds of method choice, which illustrates the rationality of LASSO method choice variable.On the other hand, the variable number that LASSO selects is more less than other three kinds of Variable Selections, and this embodies the superiority of the method.
In order to compare the effect of four kinds of variables choice further, table 2 gives sesame oil and soybean oil, Rice oil triple blend near infrared spectrum data do not adopt variables choice and set up the modeling effect of PLS model after adopting variables choice.From data in table, LASSO choice variable only 11, far less than the variable that other Variable Selections are selected.2.48 seconds computing times, obviously faster than other Variable Selection.The RMSEP value that LASSO-PLS modeling obtains is minimum, and R value is maximum.Therefore, LASSO-PLS choice variable number compared with other modeling method is few, and computing time is shorter, and precision of prediction is higher.
The results contrast of the different modeling method of table 2 vegetable oil NIR data

Claims (4)

1., based on a near infrared spectrum Variable Selection of LASSO, it is characterized in that comprising following steps:
1) gather the near infrared spectrum data of measured object sample, measure the measured component concentration content of sample in training set by conventional method, adopt certain packet mode that data are divided into training set and forecast set;
2) the binding occurrence t. of LASSO is determined;
3) minimum angle regression algorithm is utilized to solve the regression coefficient β of LASSO;
4) by training set spectrum Matrix Regression factor beta be not 0 wavelength row set up partial least squares regression (PLS) model with concentration vector, utilize this model, the content of prediction unknown sample composition.
2. a kind of near infrared spectrum Variable Selection based on LASSO according to claim 1, is characterized in that: the described detailed process utilizing minimum angle regression algorithm to solve the regression coefficient β of LASSO is:
1. Renewal model is selected in variables set (activeset), calculates related coefficient absolute value
y ^ 0 = 0 ; c ^ k j = x j I ( y - y ^ k - 1 ) ; C ^ k = max { | c ^ k j | }
Upgrade activesetA (k)
A ( k ) = A ( k - 1 ) + { j ^ } ; A(0)=φ j ^ = arg min j &NotElement; A ( k - 1 ) { | c ^ k j | }
2. minimum angular direction (u is determined k)
Make X k=(... s jx j...) j ∈ A (k)
Wherein s j = si g n { c ^ k j } , &omega; k = A k ( X k I X ) - 1 1 k , A k = ( 1 k T ( X k T X ) - 1 1 k ) - 0.5
1 kbe important be 1 vector, its length equals | A|
Calculate minimum angular direction: u k=X kω k
3. material calculation
When j &NotElement; A ( k ) , Order a k j = x j T u k
If | A|=d, then algorithm stops
Otherwise &gamma; ^ k = min j &NotElement; A ( k ) + { C ^ k - c ^ k j / ( A k - a k j ) , ( C ^ k + c ^ k j ) / ( A k + a k j ) }
4. predicated response
&gamma; ~ = m i n &gamma; j > 0 , j &Element; A ( k ) { &gamma; j } , Wherein &gamma; j = - &beta; ^ j / ( s j &omega; k j ) ; &gamma; ~ l = &infin;
If &gamma; ~ k < &gamma; ^ k , Then y ^ k = y ^ k - 1 + &gamma; ^ k u k
As j ∈ A, &beta; ^ j &LeftArrow; &beta; ^ j + &gamma; ~ &omega; kj s j , Otherwise &beta; ^ = 0
A ( k + 1 ) = A ( k ) - { j ~ } , Wherein j ~ = argmin j { &gamma; j }
c ^ k + 1 j = x j I ( y - y ^ k ) , And C ^ k + 1 = max j { | c ^ k + 1 j | } , Return and perform step 1.
Otherwise y ^ k = y ^ k - 1 + &gamma; ^ k u k
As j ∈ A, otherwise return and perform step 1..
3. a kind of near infrared spectrum Variable Selection based on LASSO according to claim 1, it is characterized in that: the defining method of the binding occurrence t. of described LASSO is cross validation, t controls the degree of compression, t is less, the degree of compression is stronger, this restrictive condition makes some component of vectorial β become 0, thus reaches the object of variables choice.
4. a kind of near infrared spectrum Variable Selection based on LASSO according to claim 1, is characterized in that: described PLS model because of subnumber defining method be that Monte Carlo Cross-Validation is checked in conjunction with F.
CN201510581659.5A 2015-09-11 2015-09-11 Near infrared spectrum variable selection method based on LASSO Pending CN105203498A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510581659.5A CN105203498A (en) 2015-09-11 2015-09-11 Near infrared spectrum variable selection method based on LASSO

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510581659.5A CN105203498A (en) 2015-09-11 2015-09-11 Near infrared spectrum variable selection method based on LASSO

Publications (1)

Publication Number Publication Date
CN105203498A true CN105203498A (en) 2015-12-30

Family

ID=54951293

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510581659.5A Pending CN105203498A (en) 2015-09-11 2015-09-11 Near infrared spectrum variable selection method based on LASSO

Country Status (1)

Country Link
CN (1) CN105203498A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106529008A (en) * 2016-11-01 2017-03-22 天津工业大学 Double-integration partial least square modeling method based on Monte Carlo and LASSO
CN106950193A (en) * 2017-05-24 2017-07-14 长春理工大学 Based on the near infrared spectrum Variable Selection that cluster analysis is combined from weight variable
CN107356556A (en) * 2017-07-10 2017-11-17 天津工业大学 A kind of double integrated modelling approach of Near-Infrared Spectra for Quantitative Analysis
CN108763673A (en) * 2018-05-16 2018-11-06 广东省生态环境技术研究所 The Driving forces of land use change screening technique and device returned based on LASSO
CN108827905A (en) * 2018-04-08 2018-11-16 江南大学 A kind of near-infrared model online updating method based on local weighted Lasso
CN109193635A (en) * 2018-09-29 2019-01-11 清华大学 A kind of power distribution network topological structure method for reconstructing based on adaptive sparse homing method
CN109459408A (en) * 2017-09-06 2019-03-12 盐城工学院 A kind of Near-Infrared Quantitative Analysis method based on sparse regression LAR algorithm
CN110657890A (en) * 2018-06-29 2020-01-07 唯亚威通讯技术有限公司 Cross-validation based calibration of spectral models
WO2023123329A1 (en) * 2021-12-29 2023-07-06 安徽大学 Method and system for extracting net signal in near-infrared spectrum
CN117033993A (en) * 2022-04-29 2023-11-10 华东交通大学 Method for selecting optimal training set based on minimum angle ordering

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002010726A2 (en) * 2000-08-01 2002-02-07 Sensys Medical, Inc. Combinative multivariate calibration that enhances prediction ability through removal of over-modeled regions
CN102305772A (en) * 2011-07-29 2012-01-04 江苏大学 Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method
CN104502306A (en) * 2014-12-09 2015-04-08 西北师范大学 Near infrared spectrum wavelength selecting method based on variable significance

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002010726A2 (en) * 2000-08-01 2002-02-07 Sensys Medical, Inc. Combinative multivariate calibration that enhances prediction ability through removal of over-modeled regions
CN102305772A (en) * 2011-07-29 2012-01-04 江苏大学 Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method
CN104502306A (en) * 2014-12-09 2015-04-08 西北师范大学 Near infrared spectrum wavelength selecting method based on variable significance

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
WANG CHANGYUE ET AL.: "Rapid compositional analysis of sawdust using sparse method and near infrared spectroscopy", 《26TH CHINESE CONTROL AND DECISION CONFERENCE》 *
柯郑林: "Lasso及其相关方法在多元线性回归模型中的应用", 《中国优秀硕士学位论文全文数据库 基础科学辑》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106529008B (en) * 2016-11-01 2019-11-26 天津工业大学 A kind of double integrated offset minimum binary modeling methods based on Monte Carlo and LASSO
CN106529008A (en) * 2016-11-01 2017-03-22 天津工业大学 Double-integration partial least square modeling method based on Monte Carlo and LASSO
CN106950193A (en) * 2017-05-24 2017-07-14 长春理工大学 Based on the near infrared spectrum Variable Selection that cluster analysis is combined from weight variable
CN106950193B (en) * 2017-05-24 2019-04-26 长春理工大学 Based on the near infrared spectrum Variable Selection from weight variable combination cluster analysis
CN107356556A (en) * 2017-07-10 2017-11-17 天津工业大学 A kind of double integrated modelling approach of Near-Infrared Spectra for Quantitative Analysis
CN109459408A (en) * 2017-09-06 2019-03-12 盐城工学院 A kind of Near-Infrared Quantitative Analysis method based on sparse regression LAR algorithm
CN108827905A (en) * 2018-04-08 2018-11-16 江南大学 A kind of near-infrared model online updating method based on local weighted Lasso
CN108827905B (en) * 2018-04-08 2020-07-24 江南大学 near-infrared model online updating method based on local weighting L asso
CN108763673A (en) * 2018-05-16 2018-11-06 广东省生态环境技术研究所 The Driving forces of land use change screening technique and device returned based on LASSO
CN108763673B (en) * 2018-05-16 2021-11-23 广东省科学院生态环境与土壤研究所 Land use change driving force screening method and device based on LASSO regression
CN110657890A (en) * 2018-06-29 2020-01-07 唯亚威通讯技术有限公司 Cross-validation based calibration of spectral models
CN110657890B (en) * 2018-06-29 2022-07-05 唯亚威通讯技术有限公司 Cross-validation based calibration of spectral models
US11719628B2 (en) 2018-06-29 2023-08-08 Viavi Solutions Inc. Cross-validation based calibration of a spectroscopic model
CN109193635A (en) * 2018-09-29 2019-01-11 清华大学 A kind of power distribution network topological structure method for reconstructing based on adaptive sparse homing method
CN109193635B (en) * 2018-09-29 2020-09-11 清华大学 Power distribution network topological structure reconstruction method based on self-adaptive sparse regression method
WO2023123329A1 (en) * 2021-12-29 2023-07-06 安徽大学 Method and system for extracting net signal in near-infrared spectrum
CN117033993A (en) * 2022-04-29 2023-11-10 华东交通大学 Method for selecting optimal training set based on minimum angle ordering

Similar Documents

Publication Publication Date Title
CN105203498A (en) Near infrared spectrum variable selection method based on LASSO
CN104089911B (en) Spectral model transmission method based on one-variable linear regression
CN104062257B (en) A kind of based on the method for general flavone content near infrared ray solution
CN104990894B (en) A kind of gasoline property detection method based on weighting absorbance and similar sample
CN101825567A (en) Screening method for near infrared spectrum wavelength and Raman spectrum wavelength
CN106529008B (en) A kind of double integrated offset minimum binary modeling methods based on Monte Carlo and LASSO
CN101413885A (en) Near-infrared spectrum method for rapidly quantifying honey quality
CN101520412A (en) Near infrared spectrum analyzing method based on isolated component analysis and genetic neural network
CN109324013A (en) A method of it is quickly analyzed using Gaussian process regression model building oil property near-infrared
CN105486655A (en) Rapid detection method for organic matters in soil based on infrared spectroscopic intelligent identification model
CN107958267A (en) A kind of oil property Forecasting Methodology represented based on linear
CN103344572A (en) Method for evaluating blending homogeneity of cut rolled stems and regenerated cut tobaccos in cigarettes
CN102937575B (en) Watermelon sugar degree rapid modeling method based on secondary spectrum recombination
CN108956583A (en) Characteristic spectral line automatic selecting method for laser induced breakdown spectroscopy analysis
CN102128805A (en) Method and device for near infrared spectrum wavelength selection and quick quantitative analysis of fruit
CN107860743A (en) Utilize the method and its application of the model of reflective near infrared fibre-optical probe structure fast prediction oil property
CN108693139A (en) The near infrared prediction model method for building up of electronics tobacco tar physical and chemical index and application
CN106525755A (en) Oil-sand pH value testing method based on near infrared spectroscopy technology
CN105738311A (en) Apple sweetness non-damage quick detection method based on near-infrared spectrum technology
CN103398971A (en) Chemometrics method for determining cetane number of diesel oil
CN106126879B (en) A kind of soil near-infrared spectrum analysis prediction technique based on rarefaction representation technology
CN105223140A (en) The method for quickly identifying of homology material
CN108663334B (en) Method for searching spectral characteristic wavelength of soil nutrient based on multi-classifier fusion
CN105954228A (en) Method for measuring content of sodium metal in oil sand based on near infrared spectrum
CN103105369A (en) Quantitative liquid analysis method by spectrum baseline correction

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20151230