WO2020232959A1 - Near infrared spectral feature extraction method and system based on functional principal component analysis - Google Patents

Near infrared spectral feature extraction method and system based on functional principal component analysis Download PDF

Info

Publication number
WO2020232959A1
WO2020232959A1 PCT/CN2019/111602 CN2019111602W WO2020232959A1 WO 2020232959 A1 WO2020232959 A1 WO 2020232959A1 CN 2019111602 W CN2019111602 W CN 2019111602W WO 2020232959 A1 WO2020232959 A1 WO 2020232959A1
Authority
WO
WIPO (PCT)
Prior art keywords
infrared spectrum
function
principal component
formula
band
Prior art date
Application number
PCT/CN2019/111602
Other languages
French (fr)
Chinese (zh)
Inventor
潘天红
李浩然
陈山
邹小波
Original Assignee
安徽大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 安徽大学 filed Critical 安徽大学
Publication of WO2020232959A1 publication Critical patent/WO2020232959A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/3563Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light for analysing solids; Preparation of samples therefor
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/359Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light using near infrared light
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/3563Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light for analysing solids; Preparation of samples therefor
    • G01N2021/3572Preparation of samples, e.g. salt matrices

Definitions

  • the invention relates to the technical field of near-infrared spectroscopy non-destructive analysis, and more specifically to a near-infrared spectroscopy feature extraction method and system based on functional principal component analysis.
  • near-infrared light has good transmission characteristics in conventional optical fibers, and its instruments are simple, fast analysis, non-destructive and small sample preparation, it is almost suitable for all kinds of samples (liquid, viscous, coating, powder and Solid) analysis, multi-component multi-channel simultaneous determination, etc., have been widely used in many fields including agriculture and animal husbandry, food, chemical industry, petrochemical, pharmaceutical, tobacco, etc., providing a great opportunity for scientific research, teaching and production process control. Broad use space.
  • X hydrogen-containing group
  • Different groups such as methyl, methylene, benzene ring, etc.
  • Material quality parameters (such as component content) are also related to their composition and structure information.
  • the application of chemometric methods to correlate the two can determine the qualitative or The quantitative relationship is: the calibration model. After the calibration model is established, as long as the near-infrared spectrum of the unknown sample is measured, the quality parameters of the sample can be predicted based on the calibration model.
  • the near-infrared spectroscopy data has the characteristics of high dimensionality and band overlap, which brings a certain degree of difficulty and challenge to extracting the key principal component information of the sample.
  • How to realize the feature mapping relationship from high-dimensional space to low-dimensional space so as to facilitate the extraction of the key principal component information of sample spectral data is a technical problem to be solved urgently.
  • PCA principal component analysis
  • LDA linear discriminant analysis
  • GA genetics Algorithm
  • UVE Uniformative Variable Elimination
  • iPLS Interval Partial Least Squares
  • SPA Successive Projections Algorithm
  • the dimensionality reduction algorithm commonly used in the prior art only starts from the spectral data itself, that is, the discrete points of the spectral data, to realize feature mapping from a high-dimensional space to a low-dimensional space.
  • the internal structure of spectral data presents a "functional type", which is continuous.
  • the dimensionality reduction algorithm in the prior art will result in a lot of potential feature information, such as derivative, order and other feature information, which cannot be mined.
  • the technical problem to be solved by the present invention is to provide a near-infrared spectrum feature extraction method and system based on functional principal component analysis to solve the problem that the prior art dimensionality reduction algorithm in the background art cannot obtain feature information such as derivative and order.
  • the present invention provides the following technical solutions:
  • a feature extraction method of near infrared spectroscopy based on functional principal component analysis including the following steps:
  • the feature mapping from high dimensionality to low latitude is completed, and at the same time, further information such as the order and derivative of the intrinsic function of the spectral data can be mined.
  • the step S1 includes:
  • the spectrometer adopts the NIRQuest512 near-infrared spectrometer produced by Ocean Optics in the United States. It is equipped with HL-2000 series halogen lamp light source with a wavelength range of 360nm-2000nm.
  • the resolution of the spectrometer is 3cm -1 , the integration time is 45s, and the scanning wavelength range is 900 -1700nm, built-in Hamamatsu indium gallium arsenide (InGaAs) array detector with 512 pixels, high stability, 32 scans;
  • the collected samples to be tested are several dry slices of wild matsutake, Agaricus blazei, old man's head, and Pleurotus eryngii, and spectral sampling is performed on the dry slices.
  • the step S3 includes:
  • ⁇ k (t) is the k-th B-spline basis function of the near-infrared spectrum band, k is less than or equal to m, m represents the number of B-spline basis functions, C is the coefficient matrix, and X(t) is the near-infrared spectrum
  • t is the band of the near-infrared spectrum
  • represents the summation function
  • PEN 2 (X) represents the rough penalty
  • DDX(t) represents the second derivative of the function X(t);
  • x is the observation data of the j-th near-infrared spectrum
  • x j is the observation data of the j-th near-infrared spectrum
  • j is a positive integer less than m
  • c) represents the sum of squared residuals to minimize
  • t j represents the j-th near-infrared spectrum band
  • k ⁇ K ⁇ j ⁇ k (t j ) represents the B-spline basis function of the j-th near-infrared spectrum band
  • the coefficient matrix C is estimated, and the rough penalty is used to smooth the function, which effectively avoids the phenomenon of over-fitting.
  • the step S4 includes:
  • the centralization formula is as follows:
  • i is the sample number
  • n is the total number of samples
  • X i (t) is the function of the i-th NIR spectral band t
  • c represents centralization
  • st represents the condition function
  • the step S5 includes:
  • the covariance calculation formula is as follows:
  • V(s, t) represents the covariance of two different bands of s and t.
  • the step S6 includes:
  • ⁇ j (t) is the pivot weight function of the j-th band t
  • ⁇ j (s) represents the pivot weight function of the j-th band s
  • j is a positive integer
  • ⁇ j is the eigenvalue
  • st is the condition Function
  • the step S7 includes:
  • M represents the number of pivots
  • the threshold is set to 90%.
  • the step S8 includes:
  • f j is a function The j-th pivot.
  • the present invention also provides a system adopting the near-infrared spectrum feature extraction method based on functional principal component analysis according to any one of the above solutions, including:
  • Acquisition module used to collect near-infrared spectrum data of various samples
  • a preprocessing module for preprocessing the near-infrared spectrum data using standard normal transformation
  • the acquisition module is used to acquire the spline function of the processed near-infrared spectrum data
  • Centralized processing module used to centralize the spline function
  • the covariance module is used to calculate the covariance of the centered spline function between different band functions
  • the eigenvalue module is used to calculate the jth eigenvalue of the covariance
  • the contribution degree module is used to calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum band;
  • the principal component score module is used to use the characteristic values of the near-infrared spectrum bands to calculate the functional principal component scores in the equations of different bands.
  • the present invention treats near-infrared spectroscopy data as a continuous function and utilizes full-band information. While accurately extracting band features with effective information, the feature values are obtained through the covariance of different band functions, and through the covariance features Calculate the contribution degree of the value to obtain the characteristic value of the near-infrared spectrum band, thereby obtaining the function-shaped principal component scores in the equations of different bands, and realize further mining of the order and derivative (rate of change, slope, curvature, etc.) information of the intrinsic function of the spectrum data;
  • the present invention obtains the function-shaped principal component scores in the equations of different bands, and at the same time obtains the rate of change, slope, curvature and other information, thereby enhancing the robustness of the calibration model and improving the predictive ability of the calibration model.
  • Infrared spectrum data provides a new feature extraction method, which has high practical value.
  • FIG. 1 is a schematic flowchart of a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention.
  • FIG. 2 is a schematic diagram of B-spline basis functions in a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention
  • FIG. 3 is a schematic diagram of the functional description of the near-infrared spectra of edible fungi in a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention
  • FIG. 4 is a load distribution diagram of functional principal component analysis of edible fungi in a near-infrared spectral feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention
  • FIG. 5 is a diagram of two principal component analysis results of four kinds of edible fungi spectral data in a near-infrared spectral feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention.
  • a feature extraction method of near-infrared spectroscopy based on functional principal component analysis uses near-infrared spectroscopy analysis to distinguish different types of edible fungi, and obtain orders and derivatives (rate of change, slope, curvature, etc.) Information;
  • the present invention is not limited to screening the types of edible fungi. The steps are as follows:
  • the nutrient ingredients include: protein, fat and a variety of amino acids; the spectrometer adopts the NIRQuest512 near-infrared spectrometer produced by Ocean Optics in the United States, with wavelength
  • the range of HL-2000 series halogen tungsten light source is 360nm-2000nm, the resolution of the spectrometer is 3cm -1 , the integration time is 45s, the scanning wavelength range is: 900-1700nm, built-in Hamamatsu with 512 pixels and high stability Indium gallium arsenide (InGaAs) array detector with 32 scans;
  • the collected samples to be tested are wild matsutake, Agaricus blazei, old man's head, and Pleurotus eryngii, a total of 166 dry slices, and spectrum sampling is performed on 166 dry slices;
  • Fig. 2 is a schematic diagram of the B-spline basis function in a near-infrared spectrum feature extraction method based on functional principal component analysis according to an embodiment of the present invention.
  • the B-spline function is a seventh-order spline function. 21 basis functions, using seven B-spline functions, and using formulas to describe the near-infrared spectrum data of each sample as a function, the formula is as follows:
  • ⁇ k (t) is the k-th B-spline basis function of the near-infrared spectral band
  • k is less than or equal to m
  • m represents the number of corresponding B-spline basis functions
  • C is the coefficient matrix
  • X(t) is the near
  • t is the band of the near-infrared spectrum
  • represents the summation function, so that the near-infrared spectrum data can use a set of selected basis functions ⁇ k (t), using the basis function ⁇ k (t ) Linear combination description;
  • PEN 2 (X) represents the rough penalty
  • DDX(t) represents the second derivative of the function X(t);
  • x j is the observation data of the j-th near-infrared spectrum
  • c) represents the sum of squared residuals of the minimization
  • t j represents the band of the j-th near-infrared spectrum
  • k ⁇ K ⁇ j ⁇ k (t j ) is expressed as the B-spline basis function of the j-th near-infrared spectral band
  • Equation (4) can be estimated by the least square method, but the least square method estimation is susceptible to noise and over-fitting. Therefore, the rough penalty is used to smooth the function, that is, the integral of the second derivative square is used to control the function The smoothness of the curve, then, the residual sum of squares and the roughness penalty are combined to estimate the coefficient matrix C, where ⁇ is the smoothness coefficient, the larger the ⁇ , the closer the fitting is to the straight line, and the fitting is given For some points in space, find a continuous surface with known form and unknown parameters to approximate these points as much as possible;
  • FIG. 3 is a schematic diagram of functional description of the near-infrared spectrum of edible fungi in a method for extracting near-infrared spectrum features based on functional principal component analysis according to an embodiment of the present invention.
  • the difference between near-infrared spectroscopy data requires centralized processing of the functions of near-infrared spectroscopy data.
  • the formula for centralized processing is as follows:
  • i is the sample number
  • n is the total number of samples
  • X i (t) is the function of the i-th NIR spectral band t
  • c represents centralization
  • st represents the condition function
  • V(s,t) represents the covariance between two different band functions, Represents the function of the near-infrared spectrum band s of the i-th sample after the centralized processing;
  • ⁇ j (t) is the j-th pivot weight function of band t
  • ⁇ j (s) is the j-th pivot weight function of band s
  • j is a positive integer
  • ⁇ j is the eigenvalue
  • st is the condition Function
  • FIG. 4 is a functional principal component analysis based on the embodiment of the present invention.
  • the threshold in this embodiment is set to 90%
  • FIG. 5 is a diagram of two principal component analysis results of four kinds of edible fungi spectral data in a near-infrared spectrum feature extraction method based on functional principal component analysis provided by an embodiment of the present invention.
  • the derivative information includes the rate of change, slope, curve, etc.
  • the four edible fungi are wild Matsutake, Ji For matsutake, old man's head, and Pleurotus eryngii, the abscissa in Figure 5 represents the first pivot and the ordinate represents the second pivot.
  • a near-infrared spectrum feature extraction system based on functional principal component analysis including:
  • Acquisition module used to collect near-infrared spectrum data of various samples
  • a preprocessing module for preprocessing the near-infrared spectrum data using standard normal transformation
  • the acquisition module is used to acquire the spline function of the processed near-infrared spectrum data
  • Centralized processing module used to centralize the spline function
  • the covariance module is used to calculate the covariance of the centered spline function between different band functions
  • the eigenvalue module is used to calculate the jth eigenvalue of the covariance
  • the contribution degree module is used to calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum band;
  • the principal component score module is used to use the characteristic values of the near-infrared spectrum bands to calculate the functional principal component scores in the equations of different bands.
  • installed should be interpreted broadly. For example, they may be fixedly connected, detachably connected, or integrally connected. Connection; it can be a mechanical connection or an electrical connection; it can be directly connected, or indirectly connected through an intermediate medium, and it can be the internal communication between two components.
  • connected installed
  • connected can be a mechanical connection or an electrical connection
  • it can be directly connected, or indirectly connected through an intermediate medium, and it can be the internal communication between two components.
  • specific meaning of the above-mentioned terms in the present invention can be understood in specific situations.

Landscapes

  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

A near infrared spectral feature extraction method based on functional principal component analysis, and an extraction system thereof. The extraction method comprises steps of: S1, acquiring near infrared spectrum data in multiple samples; S2, preprocessing the near infrared spectrum data using standard normal transformation; S3, obtaining a spline function of the processed near infrared spectrum data; S4, centralizing the spline function; S5, calculating the covariance of the centralized spline function in different waveband functions; S6, calculating the j-th feature value of the covariance; S7, calculating cumulative contribution; and S8, calculating scores of functional principal components in equations of different wavebands.

Description

基于函数性主元分析的近红外光谱特征提取方法和系统Near-infrared spectrum feature extraction method and system based on functional principal component analysis 技术领域Technical field
本发明涉及近红外光谱无损分析技术领域,更具体涉及基于函数形主元分析的近红外光谱特征提取方法和系统。The invention relates to the technical field of near-infrared spectroscopy non-destructive analysis, and more specifically to a near-infrared spectroscopy feature extraction method and system based on functional principal component analysis.
背景技术Background technique
由于近红外光在常规光纤中有良好的传输特性,且其仪器较简单、分析速度快、非破坏性和样品制备量小、几乎适合各类样品(液体、粘稠体、涂层、粉末和固体)分析、多组分多通道同时测定等特点,已广泛应用于包括农牧、食品、化工、石化、制药、烟草等在内的诸多领域,为科研、教学以及生产过程控制提供了一个十分广阔的使用空间。Because near-infrared light has good transmission characteristics in conventional optical fibers, and its instruments are simple, fast analysis, non-destructive and small sample preparation, it is almost suitable for all kinds of samples (liquid, viscous, coating, powder and Solid) analysis, multi-component multi-channel simultaneous determination, etc., have been widely used in many fields including agriculture and animal husbandry, food, chemical industry, petrochemical, pharmaceutical, tobacco, etc., providing a great opportunity for scientific research, teaching and production process control. Broad use space.
近红外光谱主要是由于分子振动的非谐振性使分子振动从基态向高能级跃迁时产生的,记录的主要是含氢基团X-H(X=C、N、O)振动的倍频和合频吸收。不同团(如:甲基、亚甲基,苯环等)或同一基团在不同化学环境中的近红外吸收波长与强度都有明显差别,近红外光谱具有丰富的结构和组成信息,非常适合用于碳氢有机物质的组成与性质测量,而物质质量参数(如成分含量)也与其组成和结构信息相关,应用化学计量学方法对两者进行关联,就可以确定这两者间的定性或定量关系,即:定标模型,建立定标模型后,只要测出未知样品的近红外光谱,根据定标模型就可以预测样本的质量参数。Near-infrared spectroscopy is mainly due to the non-resonance of molecular vibration when the molecular vibration transitions from the ground state to the higher energy level, and the recording is mainly the frequency double and combined frequency absorption of the hydrogen-containing group XH (X=C, N, O) vibration . Different groups (such as methyl, methylene, benzene ring, etc.) or the same group have obvious differences in the near-infrared absorption wavelength and intensity in different chemical environments. Near-infrared spectroscopy has rich structure and composition information, which is very suitable It is used to measure the composition and properties of hydrocarbon organic substances. Material quality parameters (such as component content) are also related to their composition and structure information. The application of chemometric methods to correlate the two can determine the qualitative or The quantitative relationship is: the calibration model. After the calibration model is established, as long as the near-infrared spectrum of the unknown sample is measured, the quality parameters of the sample can be predicted based on the calibration model.
然而,由于近红外光谱数据具有高维、谱带重叠等特征,给提取样品的关键主元信息带来了一定程度的困难和挑战。如何实现高维空间到低维空间的特征映射关系,从而方便提取样品光谱数据的关键主元信息是亟待解决的技术问题。近年来,为了解决高维光谱数据降维问题,国内外相继出现大量的降维算法,如:主成分分析(Principal Component Analysis,即PCA)、线性识别分析(linear discriminant analysis,即LDA)、遗传算法(Genetic  Algorithm,即GA)、无信息变量消除法(Uniformative Variable Elimination,即UVE)、间隔偏最小二乘法(interval Partial Least Squares,即iPLS)、连续投影算法(Successive Projections Algorithm,即SPA)等。上述方法各有特性,但也存在各自的不足,如主成分分析是基于线性统计方法建立的,在解决非线性相关及校正样本分布不均匀的问题时,其结果往往不可靠;遗传算法采用随机进化的方法,其选择、交叉和变异算子往往根据经验,调参过程比较繁琐,此外,其适应度函数选取也非常重要,不同的适应度函数,其结果将会有较大不同。However, the near-infrared spectroscopy data has the characteristics of high dimensionality and band overlap, which brings a certain degree of difficulty and challenge to extracting the key principal component information of the sample. How to realize the feature mapping relationship from high-dimensional space to low-dimensional space so as to facilitate the extraction of the key principal component information of sample spectral data is a technical problem to be solved urgently. In recent years, in order to solve the problem of high-dimensional spectral data dimensionality reduction, a large number of dimensionality reduction algorithms have emerged at home and abroad, such as: principal component analysis (PCA), linear discriminant analysis (LDA), genetics Algorithm (Genetic Algorithm, GA), Uniformative Variable Elimination (UVE), Interval Partial Least Squares (iPLS), Successive Projections Algorithm (SPA), etc. . The above methods have their own characteristics, but they also have their own shortcomings. For example, the principal component analysis is based on the linear statistical method. When solving the problems of nonlinear correlation and correcting uneven sample distribution, the results are often unreliable; genetic algorithms use random In the evolutionary method, the selection, crossover and mutation operators are often based on experience, and the parameter tuning process is relatively cumbersome. In addition, the selection of its fitness function is also very important. Different fitness functions will have quite different results.
但是,现有技术常用的降维算法仅从光谱数据的本身出发,即光谱数据的离散点出发,实现从高维空间到低维空间的特征映射。实际上,光谱数据的内在结构呈现“函数型”,该“函数型”是连续性的。而现有技术中的降维算法会导致很多潜在特征信息无法被挖掘出来,如:导数、阶次等特征信息。However, the dimensionality reduction algorithm commonly used in the prior art only starts from the spectral data itself, that is, the discrete points of the spectral data, to realize feature mapping from a high-dimensional space to a low-dimensional space. In fact, the internal structure of spectral data presents a "functional type", which is continuous. However, the dimensionality reduction algorithm in the prior art will result in a lot of potential feature information, such as derivative, order and other feature information, which cannot be mined.
发明内容Summary of the invention
本发明所要解决的技术问题在于提供基于函数性主元分析的近红外光谱特征提取方法和系统,以解决上述背景技术中现有技术的降维算法无法获取导数、阶次等特征信息的问题。The technical problem to be solved by the present invention is to provide a near-infrared spectrum feature extraction method and system based on functional principal component analysis to solve the problem that the prior art dimensionality reduction algorithm in the background art cannot obtain feature information such as derivative and order.
为解决上述技术问题,本发明提供如下技术方案:In order to solve the above technical problems, the present invention provides the following technical solutions:
一种基于函数性主元分析的近红外光谱特征提取方法,包括如下步骤:A feature extraction method of near infrared spectroscopy based on functional principal component analysis, including the following steps:
S1、采集多种样本中近红外光谱的数据;S1. Collect near-infrared spectroscopy data in various samples;
S2、采用标准正态变换对所述近红外光谱的数据进行预处理;S2. Preprocessing the near-infrared spectroscopy data by using standard normal transformation;
S3、获取处理后近红外光谱数据的样条函数;S3. Obtain the spline function of the processed near-infrared spectrum data;
S4、对所述样条函数进行中心化处理;S4. Perform centralization processing on the spline function;
S5、计算中心化处理后的样条函数在不同波段函数之间的协方差;S5. Calculate the covariance of the centered spline function between different waveband functions;
S6、计算协方差的第第j个特征值;S6. Calculate the j-th eigenvalue of the covariance;
S7、通过特征值,计算累计贡献度,贡献度超过阈值的主元作为近红外光谱的特征值;S7. Calculate the cumulative contribution degree through the characteristic value, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum;
S8、依据特征值,计算不同波段的方程中函数形主元得分。S8. Calculate the scores of the functional principal components in the equations of different bands according to the eigenvalues.
通过本方法在完成高维降低到低纬的特征映射,同时能够实现进一步的挖掘光谱数据内在函数的阶次,导数等信息。Through this method, the feature mapping from high dimensionality to low latitude is completed, and at the same time, further information such as the order and derivative of the intrinsic function of the spectral data can be mined.
作为本发明进一步的方案:所述步骤S1包括:As a further solution of the present invention: the step S1 includes:
采集待测样品的近红外光谱数据,并通过理化试验测定主要营养成分的含量;营养成分包括:蛋白质、脂肪和多种氨基酸。Collect near-infrared spectroscopy data of the sample to be tested, and determine the content of the main nutrients through physical and chemical tests; the nutrients include: protein, fat and a variety of amino acids.
光谱仪采用美国Ocean Optics公司生产的NIRQuest512型近红外光谱仪,配置波长范围为360nm-2000nm的HL-2000系列卤钨灯光源,光谱仪分辨率为3cm -1,积分时间为45s,扫描波长范围为:900-1700nm,内置具有512个像素点、稳定性高的滨松铟镓砷化物(InGaAs)阵列探测器,扫描次数为32次; The spectrometer adopts the NIRQuest512 near-infrared spectrometer produced by Ocean Optics in the United States. It is equipped with HL-2000 series halogen lamp light source with a wavelength range of 360nm-2000nm. The resolution of the spectrometer is 3cm -1 , the integration time is 45s, and the scanning wavelength range is 900 -1700nm, built-in Hamamatsu indium gallium arsenide (InGaAs) array detector with 512 pixels, high stability, 32 scans;
采集的待测样品分别为野生松茸、姬松茸、老人头、杏鲍菇的若干切片干样,对所述切片干样进行光谱采样。The collected samples to be tested are several dry slices of wild matsutake, Agaricus blazei, old man's head, and Pleurotus eryngii, and spectral sampling is performed on the dry slices.
作为本发明进一步的方案:所述步骤S3包括:As a further solution of the present invention: the step S3 includes:
S31、利用公式,获取各个样品的近红外光谱数据的B样条函数,所述公式如下:S31. Obtain the B-spline function of the near-infrared spectrum data of each sample by using a formula, the formula is as follows:
Figure PCTCN2019111602-appb-000001
Figure PCTCN2019111602-appb-000001
其中,φ k(t)为近红外光谱波段的第k个B样条基函数,k小于等于m,m表示B样条基函数的数量,C为系数矩阵,X(t)为近红外光谱数据的函数形式,t为近红外光谱的波段,∑表示求和函数; Among them, φ k (t) is the k-th B-spline basis function of the near-infrared spectrum band, k is less than or equal to m, m represents the number of B-spline basis functions, C is the coefficient matrix, and X(t) is the near-infrared spectrum The function form of the data, t is the band of the near-infrared spectrum, and ∑ represents the summation function;
S32、利用公式对X(t)函数进行光滑处理,所述公式如下:S32. Use a formula to smooth the X(t) function. The formula is as follows:
PEN 2(X)=∫[DDX(t)] 2dt   (2) PEN 2 (X)=∫[DDX(t)] 2 dt (2)
其中,PEN 2(X)表示粗糙惩罚,DDX(t)表示函数X(t)的二阶导数; Among them, PEN 2 (X) represents the rough penalty, and DDX(t) represents the second derivative of the function X(t);
S33、利用公式计算近红外光谱数据函数的系数矩阵C;所述公式如下:S33. Calculate the coefficient matrix C of the near-infrared spectrum data function using a formula; the formula is as follows:
PENSSE λ=SMSSE(x|c)+γPEN 2(X);  (3) PENSSE λ =SMSSE(x|c)+γPEN 2 (X); (3)
Figure PCTCN2019111602-appb-000002
Figure PCTCN2019111602-appb-000002
其中,x为第j个近红外光谱的观测数据,x j为第j个近红外光谱的观测数据,j为小于m的正整数,SMESS(x|c)表示极小化残差平方和;t j表示第j个近红外光谱的波段,k≤K≤j,φ k(t j)表示为第j个近红外光谱波段的B样条基函数; Among them, x is the observation data of the j-th near-infrared spectrum, x j is the observation data of the j-th near-infrared spectrum, j is a positive integer less than m, and SMESS(x|c) represents the sum of squared residuals to minimize; t j represents the j-th near-infrared spectrum band, k≤K≤j, φ k (t j ) represents the B-spline basis function of the j-th near-infrared spectrum band;
通过该方法实现对系数矩阵C的估计,同时利用粗糙惩罚对函数进行光滑处理,有效避免了过拟合现象。Through this method, the coefficient matrix C is estimated, and the rough penalty is used to smooth the function, which effectively avoids the phenomenon of over-fitting.
作为本发明进一步的方案:As a further solution of the present invention:
所述步骤S4包括:The step S4 includes:
利用公式对样本数据进行中心化处理,中心化公式如下:Use formulas to centralize the sample data. The centralization formula is as follows:
Figure PCTCN2019111602-appb-000003
Figure PCTCN2019111602-appb-000003
式中,i为样本序号,n为样本总量,
Figure PCTCN2019111602-appb-000004
为n个样本近红外光谱波段的函数均值,X i(t)为第i个样本的近红外光谱波段t的函数,
Figure PCTCN2019111602-appb-000005
为中心化处理之后的第i个样本的近红外光谱波段t的函数,c表示中心化,s.t.表示条件函数;
In the formula, i is the sample number, n is the total number of samples,
Figure PCTCN2019111602-appb-000004
Is the mean value of the NIR spectral band function of n samples, X i (t) is the function of the i-th NIR spectral band t,
Figure PCTCN2019111602-appb-000005
Is a function of the near-infrared spectrum band t of the i-th sample after the centralization process, c represents centralization, and st represents the condition function;
通过中心化处理,从而消除各个样本的近红外光谱数据间差异,提高了本方法的准确性。Through centralized processing, the difference between the near-infrared spectrum data of each sample is eliminated, and the accuracy of the method is improved.
作为本发明进一步的方案:所述步骤S5包括:As a further solution of the present invention: the step S5 includes:
利用公式计算协方差,协方差计算公式如下:Use the formula to calculate the covariance, the covariance calculation formula is as follows:
Figure PCTCN2019111602-appb-000006
Figure PCTCN2019111602-appb-000006
任意选取取一个与t不同的波段,记作为s;V(s,t)表示s、t两个不同波段的协方差。Choose a band different from t arbitrarily and record it as s; V(s, t) represents the covariance of two different bands of s and t.
作为本发明进一步的方案:所述步骤S6包括:As a further solution of the present invention: the step S6 includes:
利用公式计算协方差的第j个特征值;特征值的计算公式如下:Use the formula to calculate the j-th eigenvalue of the covariance; the calculation formula of the eigenvalue is as follows:
Figure PCTCN2019111602-appb-000007
Figure PCTCN2019111602-appb-000007
其中,ξ j(t)为第j个波段t的主元权重函数,ξ j(s)表示第j个波段s的主元权重函数,j为正整数,ρ j为特征值,s.t.表示条件函数,由式(7)可知,主成分函数ξ 1(t),ξ 2(t),…,ξ j(t)之间互不相关。 Where ξ j (t) is the pivot weight function of the j-th band t, ξ j (s) represents the pivot weight function of the j-th band s, j is a positive integer, ρ j is the eigenvalue, and st is the condition Function, from equation (7) we can see that the principal component functions ξ 1 (t), ξ 2 (t),..., ξ j (t) are not related to each other.
作为本发明进一步的方案:所述步骤S7包括:As a further solution of the present invention: the step S7 includes:
计算累计贡献度
Figure PCTCN2019111602-appb-000008
选取贡献度超过阈值的M个主元作为近红外光谱波段的特征值,构建定量模型,完成对待测样品的定性/定量分析;
Calculate cumulative contribution
Figure PCTCN2019111602-appb-000008
Select the M principal elements whose contribution degree exceeds the threshold value as the characteristic values of the near-infrared spectrum band, construct a quantitative model, and complete the qualitative/quantitative analysis of the sample to be tested;
其中,M表示主元个数,Among them, M represents the number of pivots,
所述阈值设定为90%。The threshold is set to 90%.
作为本发明进一步的方案:所述步骤S8包括:As a further solution of the present invention: the step S8 includes:
利用公式计算中心化处理后的不同近红外光谱波段方程中函数形主元得 分,所述函数形主元得分的公式如下:Use formulas to calculate the function-shaped principal component scores in the different near-infrared spectrum band equations after the centralization process, and the formula for the function-shaped principal component scores is as follows:
Figure PCTCN2019111602-appb-000009
Figure PCTCN2019111602-appb-000009
其中,f j为函数
Figure PCTCN2019111602-appb-000010
的第j个主元。
Where f j is a function
Figure PCTCN2019111602-appb-000010
The j-th pivot.
本发明还提供一种采用上述任一方案所述的基于函数性主元分析的近红外光谱特征提取方法的系统,包括:The present invention also provides a system adopting the near-infrared spectrum feature extraction method based on functional principal component analysis according to any one of the above solutions, including:
采集模块,用于采集多种样本中近红外光谱的数据;Acquisition module, used to collect near-infrared spectrum data of various samples;
预处理模块,用于采用标准正态变换对所述近红外光谱的数据进行预处理;A preprocessing module for preprocessing the near-infrared spectrum data using standard normal transformation;
获取模块,用于获取处理后的近红外光谱数据的样条函数;The acquisition module is used to acquire the spline function of the processed near-infrared spectrum data;
中心化处理模块,用于对所述样条函数进行中心化处理;Centralized processing module, used to centralize the spline function;
协方差模块,用于计算中心化处理后的样条函数在不同波段函数之间的协方差;The covariance module is used to calculate the covariance of the centered spline function between different band functions;
特征值模块,用于计算协方差的第j个特征值;The eigenvalue module is used to calculate the jth eigenvalue of the covariance;
贡献度模块,用于通过协方差的特征值,计算累计贡献度,贡献度超过阈值的主元作为近红外光谱波段的特征值;The contribution degree module is used to calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum band;
主元得分模块,用于利用近红外光谱波段的特征值,计算不同波段的方程中函数形主元得分。The principal component score module is used to use the characteristic values of the near-infrared spectrum bands to calculate the functional principal component scores in the equations of different bands.
与现有技术相比,本发明的有益效果是:Compared with the prior art, the beneficial effects of the present invention are:
1、本发明通过将近红外光谱数据看成连续型的函数,利用了全波段信息,在准确的提取含有有效信息的波段特征的同时,通过不同波段函数协方差得到特征值,并通过协方差特征值计算贡献度得到近红外光谱波段的特征值,从而得到不同波段的方程中函数形主元得分,实现进一步挖掘光谱数据内在函数的阶次与导数(变化率,斜率,曲率等)等信息;1. The present invention treats near-infrared spectroscopy data as a continuous function and utilizes full-band information. While accurately extracting band features with effective information, the feature values are obtained through the covariance of different band functions, and through the covariance features Calculate the contribution degree of the value to obtain the characteristic value of the near-infrared spectrum band, thereby obtaining the function-shaped principal component scores in the equations of different bands, and realize further mining of the order and derivative (rate of change, slope, curvature, etc.) information of the intrinsic function of the spectrum data;
2、本发明得到了不同波段的方程中函数形主元得分,同时得到变化率, 斜率,曲率等信息,从而增强了定标模型的稳健性,又改善了定标模型的预测能力,为近红外光谱数据提供了一种新的特征提取方法,具有很高的实用价值。2. The present invention obtains the function-shaped principal component scores in the equations of different bands, and at the same time obtains the rate of change, slope, curvature and other information, thereby enhancing the robustness of the calibration model and improving the predictive ability of the calibration model. Infrared spectrum data provides a new feature extraction method, which has high practical value.
附图说明Description of the drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例。In order to explain the technical solutions in the embodiments of the present invention more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only of the present invention. Some examples.
图1为本发明实施例1提供的一种基于函数性主元分析的近红外光谱特征提取方法的流程示意图。FIG. 1 is a schematic flowchart of a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention.
图2为本发明实施例1提供的一种基于函数性主元分析的近红外光谱特征提取方法中B-样条基函数的示意图;2 is a schematic diagram of B-spline basis functions in a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention;
图3为本发明实施例1提供的一种基于函数性主元分析的近红外光谱特征提取方法中食用菌的近红外光谱的函数性描述示意图;3 is a schematic diagram of the functional description of the near-infrared spectra of edible fungi in a near-infrared spectrum feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention;
图4为本发明实施例1提供的一种基于函数性主元分析的近红外光谱特征提取方法中食用菌的函数性主元分析的载荷分布图;4 is a load distribution diagram of functional principal component analysis of edible fungi in a near-infrared spectral feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention;
图5为本发明实施例1提供的一种基于函数性主元分析的近红外光谱特征提取方法中四种食用菌光谱数据的两个主元分析结果图。FIG. 5 is a diagram of two principal component analysis results of four kinds of edible fungi spectral data in a near-infrared spectral feature extraction method based on functional principal component analysis according to Embodiment 1 of the present invention.
具体实施方式Detailed ways
为了使本发明所要解决的技术问题、技术方案及有益效果更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the technical problems, technical solutions and beneficial effects to be solved by the present invention clearer, the following further describes the present invention in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention.
一种基于函数性主元分析的近红外光谱特征提取方法,本实施例运用近红外光谱分析,实现对不同食用菌类别进行甄别,并获取阶次与导数(变化率,斜率,曲率等)等信息;当然,本发明并不仅仅限于对食用菌类别进行甄别,步骤如下:A feature extraction method of near-infrared spectroscopy based on functional principal component analysis. This embodiment uses near-infrared spectroscopy analysis to distinguish different types of edible fungi, and obtain orders and derivatives (rate of change, slope, curvature, etc.) Information; Of course, the present invention is not limited to screening the types of edible fungi. The steps are as follows:
S1、对若干样本的近红外光谱数据;S1. Near-infrared spectrum data of several samples;
采集待测样品的近红外光谱数据,并通过理化试验测定主要营养成分的含量,营养成分包括:蛋白质、脂肪和多种氨基酸等;光谱仪采用美国Ocean Optics公司生产的NIRQuest512型近红外光谱仪,配置波长范围为360nm-2000nm的HL-2000系列卤钨灯光源,光谱仪分辨率为3cm -1,积分时间为45s,扫描波长范围为:900-1700nm,内置具有512个像素点、稳定性高的滨松铟镓砷化物(InGaAs)阵列探测器,扫描次数为32次; Collect the near-infrared spectroscopy data of the sample to be tested, and determine the content of the main nutrients through physical and chemical tests. The nutrient ingredients include: protein, fat and a variety of amino acids; the spectrometer adopts the NIRQuest512 near-infrared spectrometer produced by Ocean Optics in the United States, with wavelength The range of HL-2000 series halogen tungsten light source is 360nm-2000nm, the resolution of the spectrometer is 3cm -1 , the integration time is 45s, the scanning wavelength range is: 900-1700nm, built-in Hamamatsu with 512 pixels and high stability Indium gallium arsenide (InGaAs) array detector with 32 scans;
其中,采集待测样品为野生松茸、姬松茸、老人头、杏鲍菇共166个切片干样,对166个切片干样进行光谱采样;Among them, the collected samples to be tested are wild matsutake, Agaricus blazei, old man's head, and Pleurotus eryngii, a total of 166 dry slices, and spectrum sampling is performed on 166 dry slices;
S2、对若干样本的近红外光谱数据进行预处理;S2, preprocessing the near-infrared spectrum data of several samples;
对采集到的光谱数据使用标准正态变换进行预处理,以消除噪声与基带漂移的影响;Preprocess the collected spectrum data using standard normal transformation to eliminate the influence of noise and baseband drift;
S3、对预处理后的近红外光谱数据进行函数形描述;包括:S3. Perform functional description of the preprocessed near-infrared spectrum data; including:
S31、如图2,图2为本发明实施例提供的一种基于函数性主元分析的近红外光谱特征提取方法中B样条基函数的示意图,B样条函数为七次样条函数,21个基函数,利用七次B样条函数,利用公式对对每一样品的近红外光谱数据进行函数形描述,所述公式如下:S31. As shown in Fig. 2, Fig. 2 is a schematic diagram of the B-spline basis function in a near-infrared spectrum feature extraction method based on functional principal component analysis according to an embodiment of the present invention. The B-spline function is a seventh-order spline function. 21 basis functions, using seven B-spline functions, and using formulas to describe the near-infrared spectrum data of each sample as a function, the formula is as follows:
Figure PCTCN2019111602-appb-000011
Figure PCTCN2019111602-appb-000011
式中,φ k(t)为近红外光谱波段的第k个B样条基函数,k小于等于m,m代表相应的B样条基函数数量,C为系数矩阵,X(t)为近红外光谱数据的函数形式,t为近红外光谱的波段,∑表示求和函数,这样,近红外光谱数据就可以用一组选定的基函数φ k(t),采用基函数φ k(t)的线性组合形式描述; In the formula, φ k (t) is the k-th B-spline basis function of the near-infrared spectral band, k is less than or equal to m, and m represents the number of corresponding B-spline basis functions, C is the coefficient matrix, and X(t) is the near The function form of the infrared spectrum data, t is the band of the near-infrared spectrum, ∑ represents the summation function, so that the near-infrared spectrum data can use a set of selected basis functions φ k (t), using the basis function φ k (t ) Linear combination description;
S32、利用公式对X(t)函数进行光滑处理,数学上,样条函数光滑度可 以用函数的二阶导数描述,本步骤中采用DDX(t)表示函数X(t)的二阶导数,粗糙惩罚公式如下:S32. Use the formula to smooth the X(t) function. Mathematically, the smoothness of the spline function can be described by the second derivative of the function. In this step, DDX(t) is used to represent the second derivative of the function X(t). The rough penalty formula is as follows:
PEN 2(X)=∫[DDX(t)] 2dt   (2) PEN 2 (X)=∫[DDX(t)] 2 dt (2)
其中,PEN 2(X)表示粗糙惩罚,DDX(t)表示函数X(t)的二阶导数; Among them, PEN 2 (X) represents the rough penalty, and DDX(t) represents the second derivative of the function X(t);
S33、利用公式计算近红外光谱数据函数的系数矩阵C;所述公式如下:S33. Calculate the coefficient matrix C of the near-infrared spectrum data function using a formula; the formula is as follows:
PENSSE λ=SMSSE(x|c)+γPEN 2(X)  (3) PENSSE λ =SMSSE(x|c)+γPEN 2 (X) (3)
Figure PCTCN2019111602-appb-000012
Figure PCTCN2019111602-appb-000012
其中,x j为第j个近红外光谱的观测数据,SMESS(x|c)表示极小化残差平方和;t j表示第j个近红外光谱的波段,k≤K≤j,φ k(t j)表示为第j个近红外光谱波段的B样条基函数; Among them, x j is the observation data of the j-th near-infrared spectrum, SMESS(x|c) represents the sum of squared residuals of the minimization; t j represents the band of the j-th near-infrared spectrum, k≤K≤j, φ k (t j ) is expressed as the B-spline basis function of the j-th near-infrared spectral band;
S33、式(4)可由最小二乘法估计,但是最小二乘法估计易受到噪声的影响,产生过拟合现象,因此利用粗糙惩罚对函数进行光滑处理,即利用二阶导数平方的积分来控制函数曲线的光滑程度,然后,将残差平方和与粗糙惩罚结合在一起,实现对系数矩阵C的估计,式中γ为光滑系数,γ越大,拟合越接近直线,拟合是给定了空间中的一些点,找到一个已知形式未知参数的连续曲面来最大限度地逼近这些点;S33. Equation (4) can be estimated by the least square method, but the least square method estimation is susceptible to noise and over-fitting. Therefore, the rough penalty is used to smooth the function, that is, the integral of the second derivative square is used to control the function The smoothness of the curve, then, the residual sum of squares and the roughness penalty are combined to estimate the coefficient matrix C, where γ is the smoothness coefficient, the larger the γ, the closer the fitting is to the straight line, and the fitting is given For some points in space, find a continuous surface with known form and unknown parameters to approximate these points as much as possible;
S4、如图3所示,图3为本发明实施例提供的一种基于函数性主元分析的近红外光谱特征提取方法中食用菌的近红外光谱的函数性描述示意图,为了消除各个样本的近红外光谱数据间差异,需要对近红外光谱数据的函数进行中心化处理,中心化处理的公式如下:S4. As shown in FIG. 3, FIG. 3 is a schematic diagram of functional description of the near-infrared spectrum of edible fungi in a method for extracting near-infrared spectrum features based on functional principal component analysis according to an embodiment of the present invention. The difference between near-infrared spectroscopy data requires centralized processing of the functions of near-infrared spectroscopy data. The formula for centralized processing is as follows:
Figure PCTCN2019111602-appb-000013
Figure PCTCN2019111602-appb-000013
式中,i为样本序号,n为样本总量,
Figure PCTCN2019111602-appb-000014
为n个样本近红外光谱波段的函数均值,X i(t)为第i个样本的近红外光谱波段t的函数,
Figure PCTCN2019111602-appb-000015
为中心化处理之后的第i个样本的近红外光谱波段t的函数,c表示中心化,s.t.表示条件函数;
In the formula, i is the sample number, n is the total number of samples,
Figure PCTCN2019111602-appb-000014
Is the mean value of the NIR spectral band function of n samples, X i (t) is the function of the i-th NIR spectral band t,
Figure PCTCN2019111602-appb-000015
Is a function of the near-infrared spectrum band t of the i-th sample after the centralization process, c represents centralization, and st represents the condition function;
S5、求所有样本中不同的波段函数之间的协方差,任意选取一个与t不同的波段,记作为s,如下式:S5. Find the covariance between the different band functions in all samples, choose a band different from t arbitrarily, and record it as s, as follows:
Figure PCTCN2019111602-appb-000016
Figure PCTCN2019111602-appb-000016
V(s,t)表示两个不同波段函数之间的协方差,
Figure PCTCN2019111602-appb-000017
表示中心化处理后的第i个样本的近红外光谱波段s的函数;
V(s,t) represents the covariance between two different band functions,
Figure PCTCN2019111602-appb-000017
Represents the function of the near-infrared spectrum band s of the i-th sample after the centralized processing;
S6、计算协方差的第j个特征值,计算公式如下:S6. Calculate the j-th eigenvalue of the covariance, and the calculation formula is as follows:
Figure PCTCN2019111602-appb-000018
Figure PCTCN2019111602-appb-000018
其中,ξ j(t)为波段t的第j个主元权重函数,ξ j(s)表示波段s的第j个主元权重函数,j为正整数,ρ j为特征值,s.t.表示条件函数,由式(7)可知,主成分函数ξ 1(t),ξ 2(t),…,ξ j(t)之间互不相关; Where ξ j (t) is the j-th pivot weight function of band t, ξ j (s) is the j-th pivot weight function of band s, j is a positive integer, ρ j is the eigenvalue, and st is the condition Function, from equation (7) we can see that the principal component functions ξ 1 (t), ξ 2 (t),..., ξ j (t) are not correlated with each other;
S7、由主元权重函数所对应的特征值ρ j,计算累计贡献度
Figure PCTCN2019111602-appb-000019
选取贡献度超过阈值的M个主元作为近红外光谱波段的特征值,M表示主元个数;如图4所示,图4为本发明实施例提供的一种基于函数性主元分析的近红外光谱特征提取方法中食用菌的函数性主元分析的载荷分布图,本实施例中,M=2,即2个主元就能够显示所有的贡献度,再构建定量模型,实现待测样品的定性/定量分析;
S7. Calculate the cumulative contribution from the eigenvalue ρ j corresponding to the pivot weight function
Figure PCTCN2019111602-appb-000019
The M principal components whose contribution degree exceeds the threshold are selected as the characteristic values of the near-infrared spectrum band, and M represents the number of principal components; as shown in FIG. 4, FIG. 4 is a functional principal component analysis based on the embodiment of the present invention. The load distribution diagram of the functional principal component analysis of edible fungi in the near-infrared spectroscopy feature extraction method. In this embodiment, M=2, that is, 2 principal components can display all the contributions, and then construct a quantitative model to realize the test Qualitative/quantitative analysis of samples;
其中,本实施例的所述阈值设定为90%;Wherein, the threshold in this embodiment is set to 90%;
S8、依据特征值,计算不同近红外光谱波段对应中心化处理后的方程中函数形主元得分,实现对光谱数据的分析;函数形主元得分公式如下:S8. According to the characteristic value, calculate the function-shaped principal component scores in the equations after the centralization processing corresponding to different near-infrared spectral bands to realize the analysis of the spectral data; the function-shaped principal component score formula is as follows:
Figure PCTCN2019111602-appb-000020
Figure PCTCN2019111602-appb-000020
其中,f j为函数
Figure PCTCN2019111602-appb-000021
的第j个主元,由式(7)的计算过程可知,各权重函数的方差满足var(f 1)>var(f 2)>…>var(f j),var代表随机变量方差,c表示中心化;
Where f j is a function
Figure PCTCN2019111602-appb-000021
According to the calculation process of formula (7), the variance of each weight function satisfies var(f 1 )>var(f 2 )>…>var(f j ), var represents the variance of random variables, c Represents centralization;
如图5所示,图5为本发明实施例提供的一种基于函数性主元分析的近红外光谱特征提取方法中四种食用菌光谱数据的两个主元分析结果图,本实施实例中,利用2个主元即可实现四个食用菌的分类,并获取光谱数据内在函数的阶次与导数等信息,导数信息包括变化率、斜率、曲线等,四个食用菌为野生松茸、姬松茸、老人头、杏鲍菇,图5中横坐标表示第一主元,纵坐标表示第二主元。As shown in FIG. 5, FIG. 5 is a diagram of two principal component analysis results of four kinds of edible fungi spectral data in a near-infrared spectrum feature extraction method based on functional principal component analysis provided by an embodiment of the present invention. , Use 2 principal elements to classify the four edible fungi, and obtain the order and derivative of the intrinsic function of the spectral data. The derivative information includes the rate of change, slope, curve, etc. The four edible fungi are wild Matsutake, Ji For matsutake, old man's head, and Pleurotus eryngii, the abscissa in Figure 5 represents the first pivot and the ordinate represents the second pivot.
实施例2Example 2
一种基于函数性主元分析的近红外光谱特征提取系统,包括:A near-infrared spectrum feature extraction system based on functional principal component analysis, including:
采集模块,用于采集多种样本中近红外光谱的数据;Acquisition module, used to collect near-infrared spectrum data of various samples;
预处理模块,用于采用标准正态变换对所述近红外光谱的数据进行预处理;A preprocessing module for preprocessing the near-infrared spectrum data using standard normal transformation;
获取模块,用于获取处理后的近红外光谱数据的样条函数;The acquisition module is used to acquire the spline function of the processed near-infrared spectrum data;
中心化处理模块,用于对所述样条函数进行中心化处理;Centralized processing module, used to centralize the spline function;
协方差模块,用于计算中心化处理后的样条函数在不同波段函数之间的协方差;The covariance module is used to calculate the covariance of the centered spline function between different band functions;
特征值模块,用于计算协方差的第j个特征值;The eigenvalue module is used to calculate the jth eigenvalue of the covariance;
贡献度模块,用于通过协方差的特征值,计算累计贡献度,贡献度超过阈值的主元作为近红外光谱波段的特征值;The contribution degree module is used to calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum band;
主元得分模块,用于利用近红外光谱波段的特征值,计算不同波段的方程中函数形主元得分。在本发明的描述中,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明中的具体含义。The principal component score module is used to use the characteristic values of the near-infrared spectrum bands to calculate the functional principal component scores in the equations of different bands. In the description of the present invention, unless otherwise clearly specified and limited, the terms "installed", "connected", and "connected" should be interpreted broadly. For example, they may be fixedly connected, detachably connected, or integrally connected. Connection; it can be a mechanical connection or an electrical connection; it can be directly connected, or indirectly connected through an intermediate medium, and it can be the internal communication between two components. For those of ordinary skill in the art, the specific meaning of the above-mentioned terms in the present invention can be understood in specific situations.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement and improvement made within the spirit and principle of the present invention shall be included in the protection of the present invention. Within range.

Claims (10)

  1. 一种基于函数性主元分析的近红外光谱特征提取方法,其特征在于,步骤如下:A feature extraction method for near infrared spectroscopy based on functional principal component analysis, which is characterized in that the steps are as follows:
    S1、采集多种样本中近红外光谱的数据;S1. Collect near-infrared spectroscopy data in various samples;
    S2、采用标准正态变换对所述近红外光谱的数据进行预处理;S2. Preprocessing the near-infrared spectroscopy data by using standard normal transformation;
    S3、获取处理后的近红外光谱数据的样条函数;S3. Obtain the spline function of the processed near-infrared spectrum data;
    S4、对所述样条函数进行中心化处理;S4. Perform centralization processing on the spline function;
    S5、计算中心化处理后的样条函数在不同波段函数之间的协方差;S5. Calculate the covariance of the centered spline function between different waveband functions;
    S6、计算协方差的第j个特征值;S6. Calculate the j-th eigenvalue of the covariance;
    S7、通过协方差的特征值,计算累计贡献度,贡献度超过阈值的主元作为近红外光谱波段的特征值;S7. Calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near infrared spectrum band;
    S8、利用近红外光谱波段的特征值,计算不同波段的方程中函数形主元得分。S8. Using the characteristic values of the near-infrared spectrum bands, calculate the functional principal component scores in the equations of different bands.
  2. 根据权利要求1所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S1中,采集待测样品的近红外光谱数据,并通过理化试验测定营养成分的含量;营养成分包括:蛋白质、脂肪和多种氨基酸。The method for extracting near-infrared spectroscopy features based on functional principal component analysis according to claim 1, characterized in that, in the step S1, near-infrared spectroscopy data of the sample to be tested is collected, and the content of nutrients is determined through physical and chemical tests ; Nutritional ingredients include: protein, fat and a variety of amino acids.
  3. 根据权利要求2所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,采集的待测样品分别为若干个野生松茸、姬松茸、老人头、杏鲍菇的切片干样,并对所述切片干样进行光谱采样。The near-infrared spectroscopy feature extraction method based on functional principal component analysis according to claim 2, characterized in that the collected samples to be tested are respectively several dried slices of wild matsutake, Agaricus blazei, old man's head, and Pleurotus eryngii, And perform spectral sampling on the dry slice sample.
  4. 根据权利要求1所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S3包括:The near-infrared spectrum feature extraction method based on functional principal component analysis according to claim 1, wherein the step S3 comprises:
    S31、利用公式,获取各个样品的近红外光谱数据的B样条函数,所述公式如下:S31. Obtain the B-spline function of the near-infrared spectrum data of each sample by using a formula, the formula is as follows:
    Figure PCTCN2019111602-appb-100001
    Figure PCTCN2019111602-appb-100001
    其中,φ k(t)为近红外光谱波段的第k个B样条基函数,k小于等于m,m表 示B样条基函数的数量,C为系数矩阵,X(t)为近红外光谱数据的函数形式,t为近红外光谱的波段,∑表示求和函数; Among them, φ k (t) is the k-th B-spline basis function of the near-infrared spectrum band, k is less than or equal to m, m represents the number of B-spline basis functions, C is the coefficient matrix, and X(t) is the near-infrared spectrum The function form of the data, t is the band of the near-infrared spectrum, and ∑ represents the summation function;
    S32、利用公式对X(t)函数进行光滑处理,所述公式如下:S32. Use a formula to smooth the X(t) function. The formula is as follows:
    PEN 2(X)=∫[DDX(t)] 2dt     (2) PEN 2 (X)=∫[DDX(t)] 2 dt (2)
    其中,PEN 2(X)表示粗糙惩罚,DDX(t)表示函数X(t)的二阶导数; Among them, PEN 2 (X) represents the rough penalty, and DDX(t) represents the second derivative of the function X(t);
    S33、利用公式计算近红外光谱数据函数的系数矩阵C;所述公式如下:S33. Calculate the coefficient matrix C of the near-infrared spectrum data function using a formula; the formula is as follows:
    PENSSE λ=SMSSE(x|c)+γPEN 2(X);    (3) PENSSE λ =SMSSE(x|c)+γPEN 2 (X); (3)
    其中,PENSSE λ表示残差平方和与粗糙惩罚之和,γ为光滑系数; Among them, PENSSE λ represents the sum of the residual sum of squares and the rough penalty, and γ is the smoothing coefficient;
    Figure PCTCN2019111602-appb-100002
    Figure PCTCN2019111602-appb-100002
    其中,x为近红外光谱的观测数据,x j为第j个近红外光谱的观测数据,j为小于m的正整数,SMESS(x|c)表示极小化残差平方和;t j表示第j个近红外光谱的波段,k≤K≤j,φ k(t j)表示为第j个近红外光谱波段的B样条基函数。 Among them, x is the observation data of the near-infrared spectrum, x j is the observation data of the j-th near-infrared spectrum, j is a positive integer less than m, SMESS(x|c) represents the sum of squares of the minimized residuals; t j represents The j-th near-infrared spectrum band, k≤K≤j, φ k (t j ) is expressed as the B-spline basis function of the j-th near-infrared spectrum band.
  5. 根据权利要求1所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S4包括:The near-infrared spectrum feature extraction method based on functional principal component analysis according to claim 1, wherein said step S4 comprises:
    利用公式对样本数据进行中心化处理,所述公式如下:Use the formula to centralize the sample data, the formula is as follows:
    Figure PCTCN2019111602-appb-100003
    Figure PCTCN2019111602-appb-100003
    式中,i为样本序号,n为样本总量,
    Figure PCTCN2019111602-appb-100004
    为n个样本近红外光谱波段的函数均值,X i(t)为第i个样本的近红外光谱波段t的函数,
    Figure PCTCN2019111602-appb-100005
    为中心化处理之后的第i个样本的近红外光谱波段t的函数,c表示中心化,s.t.表示条件函数。
    In the formula, i is the sample number, n is the total number of samples,
    Figure PCTCN2019111602-appb-100004
    Is the mean value of the NIR spectral band function of n samples, X i (t) is the function of the i-th NIR spectral band t,
    Figure PCTCN2019111602-appb-100005
    It is a function of the near-infrared spectrum band t of the i-th sample after the centering process, c means centering, and st means conditional function.
  6. 据权利要求5所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S5包括:The near-infrared spectrum feature extraction method based on functional principal component analysis according to claim 5, wherein said step S5 comprises:
    所述协方差的计算公式如下:The calculation formula of the covariance is as follows:
    Figure PCTCN2019111602-appb-100006
    Figure PCTCN2019111602-appb-100006
    其中,s代表与t不同的近红外光谱的波段;X i c(s)表示中心化处理后的第i个样本的近红外光谱波段s的函数,V(s,t)表示s,t两个波段的协方差。 Among them, s represents the band of the near-infrared spectrum that is different from t; X i c (s) represents the function of the near-infrared spectrum band s of the i-th sample after the centering process, and V(s, t) represents s, t two The covariance of two bands.
  7. 根据权利要求6所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S6包括:The near-infrared spectrum feature extraction method based on functional principal component analysis according to claim 6, wherein the step S6 comprises:
    利用公式计算协方差的第j个特征值;所述公式如下:Use the formula to calculate the j-th eigenvalue of the covariance; the formula is as follows:
    Figure PCTCN2019111602-appb-100007
    Figure PCTCN2019111602-appb-100007
    其中,ξ j(t)为第j个波段t的主元权重函数,ξ j(s)表示第j波段s的主元权重函数,j为正整数,ρ j为特征值,s.t.表示条件函数。 Among them, ξ j (t) is the pivot weight function of the j-th band t, ξ j (s) represents the pivot weight function of the j-th band s, j is a positive integer, ρ j is the eigenvalue, and st is the condition function .
  8. 根据权利要求7所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,所述步骤S7包括:The near-infrared spectroscopy feature extraction method based on functional principal component analysis according to claim 7, wherein said step S7 comprises:
    累计贡献度的公式为
    Figure PCTCN2019111602-appb-100008
    选取贡献度超过阈值的M个主元作为近红外光谱波段的特征值,构建定量模型,完成对待测样品的定性或定量分析;
    The formula for cumulative contribution is
    Figure PCTCN2019111602-appb-100008
    Select the M principal elements whose contribution degree exceeds the threshold value as the characteristic value of the near-infrared spectrum band, construct a quantitative model, and complete the qualitative or quantitative analysis of the sample to be tested;
    其中,M表示主元个数。Among them, M represents the number of pivots.
  9. 根据权利要求7所述的基于函数性主元分析的近红外光谱特征提取方法,其特征在于,The near-infrared spectrum feature extraction method based on functional principal component analysis according to claim 7, characterized in that,
    所述步骤S8包括:The step S8 includes:
    利用公式计算中心化处理后的不同近红外光谱波段方程中函数形主元得分,所述公式如下:Use the formula to calculate the function-shaped principal component scores in the different near-infrared spectrum band equations after the centralization process, the formula is as follows:
    Figure PCTCN2019111602-appb-100009
    Figure PCTCN2019111602-appb-100009
    其中,f j为函数
    Figure PCTCN2019111602-appb-100010
    的第j个主元。
    Where f j is a function
    Figure PCTCN2019111602-appb-100010
    The j-th pivot.
  10. 一种采用权利要求1-9任一项所述的基于函数性主元分析的近红外光谱特征提取方法的提取系统,其特征在于,包括:An extraction system using the near-infrared spectral feature extraction method based on functional principal component analysis according to any one of claims 1-9, characterized in that it comprises:
    采集模块,用于采集多种样本中近红外光谱的数据;Acquisition module, used to collect near-infrared spectrum data of various samples;
    预处理模块,用于采用标准正态变换对所述近红外光谱的数据进行预处理;A preprocessing module for preprocessing the near-infrared spectrum data using standard normal transformation;
    获取模块,用于获取处理后的近红外光谱数据的样条函数;The acquisition module is used to acquire the spline function of the processed near-infrared spectrum data;
    中心化处理模块,用于对所述样条函数进行中心化处理;Centralized processing module, used to centralize the spline function;
    协方差模块,用于计算中心化处理后的样条函数在不同波段函数之间的协方差;The covariance module is used to calculate the covariance of the centered spline function between different band functions;
    特征值模块,用于计算协方差的第j个特征值;The eigenvalue module is used to calculate the jth eigenvalue of the covariance;
    贡献度模块,用于通过协方差的特征值,计算累计贡献度,贡献度超过阈值的主元作为近红外光谱波段的特征值;The contribution degree module is used to calculate the cumulative contribution degree through the characteristic value of the covariance, and the principal element whose contribution degree exceeds the threshold is used as the characteristic value of the near-infrared spectrum band;
    主元得分模块,用于利用近红外光谱波段的特征值,计算不同波段的方程中函数形主元得分。The principal component score module is used to use the characteristic values of the near-infrared spectrum bands to calculate the functional principal component scores in the equations of different bands.
PCT/CN2019/111602 2019-05-22 2019-10-17 Near infrared spectral feature extraction method and system based on functional principal component analysis WO2020232959A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910427265.2A CN110006844A (en) 2019-05-22 2019-05-22 Near infrared spectrum feature extracting method and system based on functionality pivot analysis
CN201910427265.2 2019-05-22

Publications (1)

Publication Number Publication Date
WO2020232959A1 true WO2020232959A1 (en) 2020-11-26

Family

ID=67177642

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/111602 WO2020232959A1 (en) 2019-05-22 2019-10-17 Near infrared spectral feature extraction method and system based on functional principal component analysis

Country Status (2)

Country Link
CN (1) CN110006844A (en)
WO (1) WO2020232959A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115963074A (en) * 2023-02-23 2023-04-14 中国人民解放军国防科技大学 Rapid detection method and system for spore and hypha ratio of microbial material
CN116881705A (en) * 2023-09-07 2023-10-13 佳木斯大学 Near infrared spectrum data processing system of calyx seu fructus physalis
CN117291445A (en) * 2023-11-27 2023-12-26 国网安徽省电力有限公司电力科学研究院 Multi-target prediction method based on state transition under comprehensive energy system
CN117473207A (en) * 2023-12-28 2024-01-30 深圳市新景环境技术有限公司 Paint spraying waste gas treatment equipment and treatment method thereof
CN117589741A (en) * 2024-01-18 2024-02-23 天津博霆光电技术有限公司 Indocyanine green intelligent detection method based on optical characteristics

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110006844A (en) * 2019-05-22 2019-07-12 安徽大学 Near infrared spectrum feature extracting method and system based on functionality pivot analysis
CN112837816B (en) * 2021-02-09 2022-11-29 清华大学 Physiological state prediction method, computer device, and storage medium
CN114298107A (en) * 2021-12-29 2022-04-08 安徽大学 Near infrared spectrum net signal extraction method and system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101738373A (en) * 2008-11-24 2010-06-16 中国农业大学 Method for distinguishing varieties of crop seeds
CN101915744A (en) * 2010-07-05 2010-12-15 北京航空航天大学 Near infrared spectrum nondestructive testing method and device for material component content
CN102519903A (en) * 2011-11-22 2012-06-27 山东理工大学 Method for measuring whiteness value of Agaricus bisporus by using near infrared spectrum
WO2012142076A1 (en) * 2011-04-12 2012-10-18 The General Hospital Corporation System and method for monitoring glucose or other compositions in an individual
CN105139412A (en) * 2015-09-25 2015-12-09 深圳大学 Hyperspectral image corner detection method and system
CN108780730A (en) * 2016-03-07 2018-11-09 英国质谱公司 Spectrum analysis
CN109409350A (en) * 2018-10-23 2019-03-01 桂林理工大学 A kind of Wavelength selecting method based on PCA modeling reaction type load weighting
CN110006844A (en) * 2019-05-22 2019-07-12 安徽大学 Near infrared spectrum feature extracting method and system based on functionality pivot analysis

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103985000B (en) * 2014-06-05 2017-04-26 武汉大学 Medium-and-long term typical daily load curve prediction method based on function type nonparametric regression
CN104778337B (en) * 2015-04-30 2017-03-22 北京航空航天大学 Method for predicting remaining service life of lithium battery based on FPCA (functional principal component analysis) and Bayesian updating
CN106645014B (en) * 2016-09-23 2019-04-30 上海理工大学 Substance identification based on tera-hertz spectra

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101738373A (en) * 2008-11-24 2010-06-16 中国农业大学 Method for distinguishing varieties of crop seeds
CN101915744A (en) * 2010-07-05 2010-12-15 北京航空航天大学 Near infrared spectrum nondestructive testing method and device for material component content
WO2012142076A1 (en) * 2011-04-12 2012-10-18 The General Hospital Corporation System and method for monitoring glucose or other compositions in an individual
CN102519903A (en) * 2011-11-22 2012-06-27 山东理工大学 Method for measuring whiteness value of Agaricus bisporus by using near infrared spectrum
CN105139412A (en) * 2015-09-25 2015-12-09 深圳大学 Hyperspectral image corner detection method and system
CN108780730A (en) * 2016-03-07 2018-11-09 英国质谱公司 Spectrum analysis
CN109409350A (en) * 2018-10-23 2019-03-01 桂林理工大学 A kind of Wavelength selecting method based on PCA modeling reaction type load weighting
CN110006844A (en) * 2019-05-22 2019-07-12 安徽大学 Near infrared spectrum feature extracting method and system based on functionality pivot analysis

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHEN QUANSHENG; ZHAO JIEWEN EDITED BY CHEN QUANSHENG: "Tea Quality and Safety Detection and Analysis Method", 31 March 2001, CHINA LIGHT INDUSTRY PRESS, CN, ISBN: 978-7-5019-7971-4, article CHEN QUANSHENG; ZHAO JIEWEN EDITED BY CHEN QUANSHENG: "Chapter 3, Near-infrared spectral analysis", pages: 101 - 103, XP009524358 *
GONG, HUILI: "Feature extraction and similarity measure on tobacco near infrared spectra", CHINA DOCTORAL DISSERTATIONS FULL-TEXT DATABASE, BASIC SCIENCES, no. 02, 15 February 2015 (2015-02-15), pages 1 - 115, XP055754936 *
LI, YUQIANG ET AL.: "NIR spectral feature selection using lasso method and its application in the classification analysis", SPECTROSCOPY AND SPECTRAL ANALYSIS, vol. 39, no. 12, 31 December 2019 (2019-12-31), pages 3809 - 3815, XP055754956 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115963074A (en) * 2023-02-23 2023-04-14 中国人民解放军国防科技大学 Rapid detection method and system for spore and hypha ratio of microbial material
CN115963074B (en) * 2023-02-23 2023-06-02 中国人民解放军国防科技大学 Method and system for rapidly detecting spore hypha ratio of microbial material
CN116881705A (en) * 2023-09-07 2023-10-13 佳木斯大学 Near infrared spectrum data processing system of calyx seu fructus physalis
CN116881705B (en) * 2023-09-07 2023-11-21 佳木斯大学 Near infrared spectrum data processing system of calyx seu fructus physalis
CN117291445A (en) * 2023-11-27 2023-12-26 国网安徽省电力有限公司电力科学研究院 Multi-target prediction method based on state transition under comprehensive energy system
CN117291445B (en) * 2023-11-27 2024-02-13 国网安徽省电力有限公司电力科学研究院 Multi-target prediction method based on state transition under comprehensive energy system
CN117473207A (en) * 2023-12-28 2024-01-30 深圳市新景环境技术有限公司 Paint spraying waste gas treatment equipment and treatment method thereof
CN117473207B (en) * 2023-12-28 2024-03-29 深圳市新景环境技术有限公司 Paint spraying waste gas treatment equipment and treatment method thereof
CN117589741A (en) * 2024-01-18 2024-02-23 天津博霆光电技术有限公司 Indocyanine green intelligent detection method based on optical characteristics
CN117589741B (en) * 2024-01-18 2024-04-05 天津博霆光电技术有限公司 Indocyanine green intelligent detection method based on optical characteristics

Also Published As

Publication number Publication date
CN110006844A (en) 2019-07-12

Similar Documents

Publication Publication Date Title
WO2020232959A1 (en) Near infrared spectral feature extraction method and system based on functional principal component analysis
Saeys et al. Multivariate calibration of spectroscopic sensors for postharvest quality evaluation: A review
Kimuli et al. Utilisation of visible/near-infrared hyperspectral images to classify aflatoxin B1 contaminated maize kernels
Mishra et al. Partial least square regression versus domain invariant partial least square regression with application to near-infrared spectroscopy of fresh fruit
Song et al. Differentiation of organic and non-organic apples using near infrared reflectance spectroscopy—A pattern recognition approach
Singh et al. Nondestructive identification of barley seeds variety using near‐infrared hyperspectral imaging coupled with convolutional neural network
Shahin et al. Quantification of mildew damage in soft red winter wheat based on spectral characteristics of bulk samples: a comparison of visible-near-infrared imaging and near-infrared spectroscopy
Torres et al. Developing universal models for the prediction of physical quality in citrus fruits analysed on-tree using portable NIRS sensors
Munawar et al. Near infrared spectroscopy as a fast and non-destructive technique for total acidity prediction of intact mango: Comparison among regression approaches
Beghi et al. Rapid evaluation of grape phytosanitary status directly at the check point station entering the winery by using visible/near infrared spectroscopy
CN109470648A (en) A kind of single grain crop unsound grain quick nondestructive determination method
Fadock et al. Visible-near infrared reflectance spectroscopy for nondestructive analysis of red wine grapes
Yu et al. Rapid and visual measurement of fat content in peanuts by using the hyperspectral imaging technique with chemometrics
Puttipipatkajorn et al. Development of calibration models for rapid determination of moisture content in rubber sheets using portable near-infrared spectrometers
Pourdarbani et al. Using metaheuristic algorithms to improve the estimation of acidity in Fuji apples using NIR spectroscopy
Liu et al. Identification of heat damage in imported soybeans based on hyperspectral imaging technology
Feng et al. Nondestructive determination of soluble solids content and pH in red bayberry (Myrica rubra) based on color space
Courand et al. Evaluation of a robust regression method (RoBoost-PLSR) to predict biochemical variables for agronomic applications: Case study of grape berry maturity monitoring
Wachendorf et al. Prediction of the clover content of red clover‐and white clover‐grass mixtures by near‐infrared reflectance spectroscopy
Hao et al. Combined hyperspectral imaging technology with 2D convolutional neural network for near geographical origins identification of wolfberry
Shen et al. Discrimination of blended Chinese rice wine ages based on near-infrared spectroscopy
CN116578851A (en) Method for predicting effective boron content of hyperspectral soil
CN106442400B (en) A kind of method that near infrared spectrum quickly determines soil type fresh tea leaves
Xu et al. Comparative study of different wavelength selection methods in the transfer of crop kernel qualitive near-infrared models
Chen et al. Hyperspectral reflectance imaging for detecting typical defects of durum kernel surface

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19929260

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19929260

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19929260

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 30/05/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19929260

Country of ref document: EP

Kind code of ref document: A1