CN101408501A - Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method - Google Patents

Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method Download PDF

Info

Publication number
CN101408501A
CN101408501A CNA2008100514936A CN200810051493A CN101408501A CN 101408501 A CN101408501 A CN 101408501A CN A2008100514936 A CNA2008100514936 A CN A2008100514936A CN 200810051493 A CN200810051493 A CN 200810051493A CN 101408501 A CN101408501 A CN 101408501A
Authority
CN
China
Prior art keywords
dna base
partial
near infrared
spectrum
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008100514936A
Other languages
Chinese (zh)
Inventor
田坚
金丽虹
赵丽辉
申炳俊
王一郎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changchun University of Science and Technology
Original Assignee
Changchun University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changchun University of Science and Technology filed Critical Changchun University of Science and Technology
Priority to CNA2008100514936A priority Critical patent/CN101408501A/en
Publication of CN101408501A publication Critical patent/CN101408501A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention relates to a method for quantitative detection of DNA bases by adopting a near infrared spectrum-partial least squares method, belonging to the technical field of near infrared spectrum detection. The present high performance liquid chromatography and the test equipment thereof are expensive, the cost is higher, the operation is more complex, the testing time is longer, and the variability of the test data is larger. The invention adopts the near infrared spectrum-partial least squares method to detect the contents of four kinds of bases of DNA, and the method comprises the following steps: step 1, the near infrared diffuse reflection spectrum of the DNA base mixture is detected; and step 2, the detected near infrared diffuse reflection spectrum is analyzed by a partial least squares method. A DNA base quantitative analysis model is established by adopting the partial least squares method, and firstly, the spectral pretreatment is carried out on the detected near infrared diffuse reflection spectrum by adopting a first derivative spectrum method; then the optimum main component is selected by adopting an interactive detection method, and the predicted square sum of residuals is taken as the evaluation criterion for verification; and then all the samples for modeling are predicted by using the quantitative analysis model, and the difference between the predicted value and the known reference value of each sample is calculated. The invention is suitable for the detection of the DNA base.

Description

Adopt the method for near infrared spectrum-partial least square method detection by quantitative DNA base
Technical field
(Partial Least Squares, PLS) the combine method of detection by quantitative DNA base belongs to the Near Infrared Spectroscopy Detection Technology field to the present invention relates to a kind of near infrared spectrum (NIR) method and partial least square method.
Background technology
Adenine (A), guanine (G), cytimidine (C) and thymine (T) are the important component parts of DNA, and more to its separation and analytical approach, that generally adopt at present is high performance liquid chromatography (HPLC).This method is according to different separation and the analyses that realize the DNA base with the partition factor between the stationary phase of the moving phase (leacheate) of sample fraction in chromatographic column.After sample entered in the chromatographic column along with moving phase, component was with regard to therein two alternate repeated multiple times (10 of carrying out 3~10 6) distribute i.e. adsorption-desorption-emit.Because stationary phase is to the adsorptive power difference of various components, i.e. preservation effect difference, therefore, the travelling speed of each component in chromatographic column is just different, and be through behind certain column length, just separated from one another, order is left chromatographic column and is entered detecting device, the ion flow signal that produces is depicted the chromatographic peak of each component, thereby is obtained each components contents after amplifying on register.High performance liquid chromatography has that speed is fast, efficient is high, the characteristics of highly sensitive, operation automation.
Near-infrared spectrum wavelength is distributed in 1100~2500nm, and near infrared spectroscopy is the method for complicated chemical composition in the working sample.Its spectral characteristic is stable, have sample pre-treatments simple and direct, need not chemical reagent, environmental protection, simple to operate, detection speed fast, good stability and can be implemented in advantage such as line analysis.In the method, the generation of near infrared spectrum mainly is because the anharmonicity of molecular vibration.In near infrared spectral range, frequency multiplication and the sum of fundamental frequencies of measuring hydrogeneous radicals X-H (X=C, N, O, S etc.) vibration absorb.Because frequency multiplication and sum of fundamental frequencies transition probability are low, and organic substance is the absorption of frequency multiplication and sum of fundamental frequencies in the near infrared spectrum district, so a little less than the extinction coefficient, bands of a spectrum are overlapping serious.Therefore, the useful information that extracts near infrared spectrum belongs to weak information and multiple information, finishes qualitative and quantitative analysis by means of chemometrics method such as partial least square method usually.
Summary of the invention
Its checkout equipment costliness of high performance liquid chromatography of the prior art, expense is higher, and operation is complicated, test consuming time longlyer, and test figure makes a variation bigger.And have not yet to see combine with the partial least square method report of detection by quantitative DNA base of near infrared spectroscopy.The method that the objective of the invention is to combine with partial least square method with near infrared spectroscopy replaces high performance liquid chromatography, easy detection by quantitative DNA base fast,, for this reason, we have invented a kind of method that adopts near infrared spectrum-partial least square method detection by quantitative DNA base.
The present invention's method is characterised in that, adopts near infrared spectrum-partial least square method to detect the content of four kinds of bases of DNA, and its step is as follows:
1, detects the near-infrared diffuse reflection spectrum of DNA base mixture
Sample is carried out near infrared scanning, and the scanning wavelength scope is 1100~2500nm;
2, analyze detected near-infrared diffuse reflection spectrum with partial least square method
(1) adopts partial least square method to set up DNA base Quantitative Analysis Model, that is: at first adopt first-derivative spectroscopy measured infrared diffuse light stalking row spectrum pre-service; Then, adopt the cross-verification method to select best number of principal components, verify as evaluation criterion with prediction residual quadratic sum (PRESS);
(2) with described Quantitative Analysis Model each sample of participating in modeling is predicted, obtained the poor of the predicted value of each sample and known reference value.
Its technique effect of the present invention is, sets up DNA base Quantitative Analysis Model, thereby does not need sample is carried out chemical treatment, and its precision of prediction can satisfy the requirement of the quantitative test that detects micro-example, and detection speed is fast, simple to operate.Measured infrared diffuse reflectance spectroscopy is overlapping serious, adopts common spectroscopic analysis methods to be difficult to carry out quantitative test, and adopts the offset minimum binary rule can realize analyzing.The selection of best number of principal components is directly connected to the actual prediction ability of DNA base Quantitative Analysis Model, number of principal components is very few, just can not fully reflect sample spectra information, number of principal components is crossed at most can also mix calculating with the information of some noises, reduce the predictive ability of model, and adopt the cross-verification method, as evaluation criterion, determined required best number of principal components with prediction residual quadratic sum (PRESS).
Description of drawings
Fig. 1 is the pre-service original spectrum of four kinds of DNA base mixtures.Fig. 2 is the pre-processed spectrum first order derivative curve map of four kinds of DNA base mixtures, and this figure double as is a Figure of abstract.Fig. 3 is the graph of a relation of adenine number of principal components and PLS Quantitative Analysis Model PRESS value.Fig. 4 is the graph of a relation of thymine number of principal components and PLS Quantitative Analysis Model PRESS value.Fig. 5 is the graph of a relation of guanine number of principal components and PLS Quantitative Analysis Model PRESS value.Fig. 6 is the graph of a relation of cytimidine number of principal components and PLS Quantitative Analysis Model PRESS value.Fig. 7 is adenine normal concentration and prediction concentrations corresponding relation calibration set curve map.Fig. 8 is adenine normal concentration and forecast concentration corresponding relation forecast set curve map.Fig. 9 is thymine normal concentration and prediction concentrations corresponding relation calibration set curve map.Figure 10 is thymine normal concentration and prediction concentrations corresponding relation forecast set curve map.Figure 11 is guanine normal concentration and prediction concentrations corresponding relation calibration set curve map.Figure 12 is guanine normal concentration and prediction concentrations corresponding relation forecast set curve map.Figure 13 is cytimidine normal concentration and prediction concentrations corresponding relation calibration set curve map.Figure 14 is cytimidine normal concentration and prediction concentrations corresponding relation forecast set curve map.
Embodiment
The preparation of four kinds of DNA base mixture samples, to be divided into two groups at random by 35 four kinds DNA base mixture samples of variable concentrations proportioning, one group is calibration set (the calibration set), comprise 30 four kinds DNA base mixture samples, another group comprises 5 four kinds DNA base mixture samples for forecast set (the prediction set).Draw the normal concentration value of 35 samples according to recipe calculation.
Adopt near infrared spectrum-partial least square method to detect the content of four kinds of DNA bases, its step is as follows:
1, detects the near-infrared diffuse reflection spectrum of DNA base mixture
In the sample cell of integrating sphere, adopt Tianjin, island UV-3100 type UV, visible light near infrared spectrometer that sample is carried out near infrared scanning, spectral bandwidth 12nm base mixture sample compressing tablet, medium sweep, the scanning wavelength scope is 1100~2500nm, and each sample scanning 3 times is averaged.The near-infrared diffuse reflection spectrum of four kinds of measured DNA base mixtures is seen Fig. 1, shown in Figure 2.
2, analyze detected near-infrared diffuse reflection spectrum with partial least square method
(1) adopts partial least square method to set up DNA base Quantitative Analysis Model, that is: at first adopt first-derivative spectroscopy that measured infrared diffuse reflectance spectroscopy is carried out the spectrum pre-service; Then, adopt the cross-verification method to select best number of principal components, verify as evaluation criterion with prediction residual quadratic sum (PRESS);
(2) with described Quantitative Analysis Model each sample of participating in modeling is predicted, obtained the poor of the predicted value of each sample and known reference value.
The mathematic(al) representation of PRESS is:
PRESS = Σ i = 1 n Σ j = 1 d ( r p , ij - r ij ) 2
In the formula: n is a sample number in the calibration set; D sets up the number of principal components that model uses; r P, ijIt is the predicted value of sample; r IjIt is the reference value of sample.
Determining of best number of principal components: when adopting partial least square method to set up DNA base Quantitative Analysis Model, the selection of number of principal components is directly connected to the actual prediction ability of DNA base Quantitative Analysis Model, number of principal components is very few, just abundant response sample spectral information; Number of principal components is crossed at most can also mix calculating with the information of some noises, reduces the predictive ability of model.When the PRESS value is more little, illustrate that the predictive ability of model is strong more, selected number of principal components is best.When the pairing four kinds of base number of principal components of selected spectrum were 4, model had minimum PRESS value; When number of principal components continued to increase, the PRESS value presented ascendant trend, sees shown in Fig. 3~6, thereby causes the model prediction ability drop, and therefore, the best number of principal components of sample is 4.
The method reliability is determined: after best number of principal components is determined, best DNA base Quantitative Analysis Model with the foundation of sample first derivative spectrum data, calibration set and forecast set sample concentration with four kinds of bases of this model prediction, the near infrared spectrum prediction concentrations of the calibration set of four kinds of bases and forecast set and the linear relationship of normal concentration are seen shown in Fig. 7~14, try to achieve during parameter that predicted value and standard value use and relative standard deviation (RSD) be listed in the table below, hence one can see that, and this method is reliable.
Figure A20081005149300052

Claims (4)

1, a kind of method that adopts near infrared spectrum-partial least square method detection by quantitative DNA base is characterized in that, adopts near infrared spectrum-partial least square method to detect the content of four kinds of bases of DNA, and its step is as follows:
The first step, the near-infrared diffuse reflection spectrum of detection DNA base mixture carries out near infrared scanning to sample, and the scanning wavelength scope is 1100~2500nm;
In second step, analyze detected near-infrared diffuse reflection spectrum with partial least square method:
(1) adopts partial least square method to set up DNA base Quantitative Analysis Model, that is: at first adopt first-derivative spectroscopy that measured infrared diffuse reflectance spectroscopy is carried out the spectrum pre-service; Then, adopt the cross-verification method to select best number of principal components, verify as evaluation criterion with prediction residual quadratic sum (PRESS);
(2) with described Quantitative Analysis Model each sample of participating in modeling is predicted, obtained the poor of the predicted value of each sample and known reference value.
2, the method for detection by quantitative DNA base according to claim 1, it is characterized in that, the process for preparation of four kinds of DNA base mixture samples is, to be divided into two groups at random by 35 four kinds DNA base mixture samples of variable concentrations proportioning, one group is calibration set, comprise 30 four kinds DNA base mixture samples, another group comprises 5 four kinds DNA base mixture samples for forecast set; Draw the normal concentration value of 35 samples according to recipe calculation.
3, the method for detection by quantitative DNA base according to claim 1 is characterized in that, PRESS meets the following formula requirement:
PRESS = Σ i = 1 n Σ j = 1 d ( r p , ij - r ij ) 2 ,
In the formula: n is a sample number in the calibration set; D sets up the number of principal components that model uses; r P, ijIt is the predicted value of sample; r IjIt is the reference value of sample.
4, the method for detection by quantitative DNA base according to claim 1 is characterized in that, best number of principal components is 4.
CNA2008100514936A 2008-11-28 2008-11-28 Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method Pending CN101408501A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008100514936A CN101408501A (en) 2008-11-28 2008-11-28 Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008100514936A CN101408501A (en) 2008-11-28 2008-11-28 Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method

Publications (1)

Publication Number Publication Date
CN101408501A true CN101408501A (en) 2009-04-15

Family

ID=40571601

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008100514936A Pending CN101408501A (en) 2008-11-28 2008-11-28 Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method

Country Status (1)

Country Link
CN (1) CN101408501A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102353647A (en) * 2011-10-11 2012-02-15 北京工业大学 Infrared spectrum rapid detection method for DNA (Deoxyribonucleic Acid) alkylating damage
CN102622534A (en) * 2012-04-11 2012-08-01 哈尔滨工程大学 Correction method of deoxyribonucleic acid high-pass sequencing data for gene expression detection
CN102798606A (en) * 2012-08-08 2012-11-28 福建中烟工业有限责任公司 Method for rapidly measuring preparation proportion of cigarette balsam to cigarette material liquid
CN103454128A (en) * 2013-08-24 2013-12-18 安徽农业大学 Preparation method of tea sample for near infrared spectrum detection
CN103760202A (en) * 2014-01-28 2014-04-30 东北师范大学 Electrochemical sensor for simultaneous detection of DNA bases
CN105352913A (en) * 2015-11-25 2016-02-24 浙江百山祖生物科技有限公司 Method for detecting polysaccharide content of ganoderma lucidum extract through near-infrared spectroscopy
CN105803070A (en) * 2016-02-05 2016-07-27 中国农业大学 Method for measuring relative content of puccinia striiformis DNA (deoxyribose nucleic acid) in wheat leaves
CN108387548A (en) * 2018-05-24 2018-08-10 东北农业大学 A method of sweetener is quickly detected based on infrared spectrum technology

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102353647A (en) * 2011-10-11 2012-02-15 北京工业大学 Infrared spectrum rapid detection method for DNA (Deoxyribonucleic Acid) alkylating damage
CN102622534A (en) * 2012-04-11 2012-08-01 哈尔滨工程大学 Correction method of deoxyribonucleic acid high-pass sequencing data for gene expression detection
CN102622534B (en) * 2012-04-11 2015-09-30 哈尔滨工程大学 A kind of DNA high pass sequencing data bearing calibration detected for gene expression
CN102798606A (en) * 2012-08-08 2012-11-28 福建中烟工业有限责任公司 Method for rapidly measuring preparation proportion of cigarette balsam to cigarette material liquid
CN102798606B (en) * 2012-08-08 2015-09-30 福建中烟工业有限责任公司 A kind of quick detection cigarette method of fragrant liquid material liquid configuration proportion
CN103454128A (en) * 2013-08-24 2013-12-18 安徽农业大学 Preparation method of tea sample for near infrared spectrum detection
CN103760202A (en) * 2014-01-28 2014-04-30 东北师范大学 Electrochemical sensor for simultaneous detection of DNA bases
CN105352913A (en) * 2015-11-25 2016-02-24 浙江百山祖生物科技有限公司 Method for detecting polysaccharide content of ganoderma lucidum extract through near-infrared spectroscopy
CN105352913B (en) * 2015-11-25 2018-06-12 浙江百山祖生物科技有限公司 A kind of method of near infrared spectrum detection Ganodenna Lucidum P.E polyoses content
CN105803070A (en) * 2016-02-05 2016-07-27 中国农业大学 Method for measuring relative content of puccinia striiformis DNA (deoxyribose nucleic acid) in wheat leaves
CN105803070B (en) * 2016-02-05 2019-05-03 中国农业大学 Stripe Rust DNA relative amount measurement method in a kind of wheat leaf blade
CN108387548A (en) * 2018-05-24 2018-08-10 东北农业大学 A method of sweetener is quickly detected based on infrared spectrum technology

Similar Documents

Publication Publication Date Title
CN101408501A (en) Method for quantitatively detecting DNA base by using near-infrared spectrum-partial least squares method
Esslinger et al. Potential and limitations of non-targeted fingerprinting for authentication of food in official control
EP1861691B1 (en) Method to reduce background noise in a spectrum
US7251037B2 (en) Method to reduce background noise in a spectrum
Milman Chemical identification and its quality assurance
CN102484030B (en) Functional check in mass spectral analysis and deviation compensation
CN106018600B (en) A kind of metabolism group method for distinguishing false positive mass spectrum peak-to-peak signal and quantitative correction mass spectrum peak area
Chapman et al. Spectroscopic approaches for rapid beer and wine analysis
CN103383352A (en) Near infrared transmitted spectrum detection method of naringin and/or neohesperidin
CN1982874A (en) Near-infrared diffuse reflection spectral method for fastly inspecting drop effective ingredient content
CN101349638A (en) Spectrum rapid nondestructive testing method for vitamin C content of fruits and vegetables
Wulandari et al. Determination of total flavonoid content in medicinal plant leaves powder using infrared spectroscopy and chemometrics
CN100443883C (en) Method for detecting hydrgenated tail-oil paraffin composition using near-infrared spectrum
da Costa Fulgêncio et al. Combining portable NIR spectroscopy and multivariate calibration for the determination of ethanol in fermented alcoholic beverages by a multi-product model
CN102042967A (en) Glucose aqueous solution quick identification method based on near infrared spectrum technology
Liu et al. Rapid identification of artificial fragrant rice based on volatile organic compounds: From PTR-MS to FTIR
Li et al. A novel method to realize multicomponent infrared spectroscopy gas logging based on PSO-split peak fitting-SVM
CN107727602B (en) Method for quantitatively analyzing content of sucralose by combining mid-infrared spectrum with vector included angle
WO2009091961A1 (en) Apparatus system and method for mass analysis of a sample
CN201072405Y (en) Spectrum rapid nondestructive testing device for vitamin C content of fruits and vegetables
CN101140225B (en) Method for detecting lead in scenting agent with AOTF near-infrared spectrometer
Wang et al. Research Article Quantitative Analysis of Multiple Components in Wine Fermentation using Raman Spectroscopy
Kotsanopoulos et al. Methods and techniques for verifying authenticity and detecting adulteration
CN100451615C (en) Method for detecting hydrogenated tail-oil cyclanes and arene composition using near infrared spectrum
CN104792711A (en) Synthetic pigment detection method and synthetic pigment detection system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090415