CN112666110A - Spectral identification model and method for milk and goat milk - Google Patents

Spectral identification model and method for milk and goat milk Download PDF

Info

Publication number
CN112666110A
CN112666110A CN202011403046.XA CN202011403046A CN112666110A CN 112666110 A CN112666110 A CN 112666110A CN 202011403046 A CN202011403046 A CN 202011403046A CN 112666110 A CN112666110 A CN 112666110A
Authority
CN
China
Prior art keywords
milk
model
goat
cow
training set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011403046.XA
Other languages
Chinese (zh)
Inventor
张淑君
刘实
南良康
王海童
杜雯娟
胡修忠
万平民
程蕾
刘晓华
易建明
苏俊东
艾哈迈德.萨贝克
张依
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong Agricultural University
Original Assignee
Huazhong Agricultural University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong Agricultural University filed Critical Huazhong Agricultural University
Publication of CN112666110A publication Critical patent/CN112666110A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/3577Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light for analysing liquids, e.g. polluted water
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/02Food
    • G01N33/04Dairy products
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2201/00Features of devices classified in G01N21/00
    • G01N2201/12Circuits of general importance; Signal processing
    • G01N2201/129Using chemometrical methods
    • G01N2201/1293Using chemometrical methods resolving multicomponent spectra

Landscapes

  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Immunology (AREA)
  • Food Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Pathology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Medicinal Chemistry (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention belongs to the field of milk product analysis, and particularly relates to a spectral identification model and a spectral identification method for milk and goat milk. Related to mid-infrared spectroscopy. The method comprises the following steps: 1) taking fresh goat milk and cow milk samples; 2) scanning the sample in the middle red spectrum range to obtain middle infrared spectrum data; 3) preprocessing the original mid-infrared spectrum to remove abnormal values; 4) dividing the preprocessed data set into a training set and a test set according to a layered sampling principle; 5) carrying out dimension reduction processing on the training set to improve the model training speed; 6) on a training set, a model for distinguishing mare milk and milk is established through 10-fold cross validation by using algorithms such as nearest neighbor and the like, and the model is evaluated and screened by using accuracy, specificity, sensitivity and AUC; 7) and predicting the generalization performance of the optimal model.

Description

Spectral identification model and method for milk and goat milk
Technical Field
The invention belongs to the technical field of milk product analysis, and particularly relates to a spectrum identification model and a method for identifying milk and goat milk. The invention relates to the field of analysis for identifying milk product components by infrared spectroscopy.
Background
The goat milk contains milk fat and milk protein which are higher than those of cow milk, and contains a plurality of essential amino acids; the content of medium-chain fatty acid is twice of that of cow milk, and is more easily digested and absorbed by the attack of human body lipase, and the volume of its fat globules is only one third of that of cow milk, so that the goat milk can be digested more quickly than cow milk[2](ii) a The content of Epidermal Growth Factor (EGF) is removedThe content of breast milk and the like and a few of milks is the highest, which is 8 times as much as that of milk, and the milk has an auxiliary treatment effect on allergic skin. Protein in goat milk meeting compositional standards is suitable as a protein source for infant and follow-on formulas[5]
At present, methods for measuring the components of goat and milk include High Performance Liquid Chromatography (HPLC), Gas Chromatography (GC), Coomassie brilliant blue-ultraviolet detection, and Near Infrared (NIR) spectroscopy[3]And mid-infrared (MIR) spectroscopy, and the like. Martin et al developed a Polymerase Chain Reaction (PCR) method, designed specific primers for 12S RNA mitochondrial gene, to achieve the discrimination of goat milk and cow milk with a sensitivity of 0.1%[4](ii) a The method uses a guide ring isothermal amplification technology to detect the milk in goat milk, such as littin, and the like, and the sensitivity of the method can reach 1 percent, and the concentration is about 0.1ng/mL[1]. However, there is no method for rapidly and accurately identifying goat milk and cow milk in batches.
Mid-infrared spectral analysis is a modern technology which is rapidly developed in recent years, is free from damage and pollution and can simultaneously analyze multiple components, is widely applied to quality detection of agricultural products and food, but related research and literature reports of mid-infrared spectral identification of goat milk and cow milk do not exist.
Disclosure of Invention
The invention aims to solve the technical problem of providing a fast batch identification method of goat milk and cow milk based on Fourier transform mid-infrared spectrum, which has the characteristics of high identification speed, high precision, low cost, simple operation, batch detection and strong practicability.
The technical scheme of the invention is as follows:
the spectral discrimination model and method of milk and goat milk comprises the following steps:
(1) selecting a milk sample: samples of fresh goat milk and fresh cow milk were collected separately.
(2) Collecting mid-infrared spectrum by pouring goat milk and cow milk sample into cylindrical sampling tubes with diameter of 3.5cm and height of 9cm respectively to ensure liquid level height of more than 6cm, and placing in 42 deg.C water bathWater bath for 15-20min, extending the solid fiber probe into the liquid, sucking sample, and detecting by using a mid-infrared spectrometer at 4000--1Scanning goat milk and cow milk samples in a wave number range, and outputting light transmittance corresponding to the samples through a computer connected with the goat milk and cow milk samples to obtain a sample spectrogram;
(3) preprocessing the collected original mid-infrared spectrum data, comprising the following steps:
converting the spectral data from transmittance (T) to absorbance (a):
A=log10(1/T);
removing the water absorption area;
detecting abnormal spectrum: removing abnormal values by using an LOF (loss of tolerance) abnormality detection algorithm;
(4) dividing the data set into a training set and a testing set;
(5) and (3) reducing the dimensionality of the main components of the training set: selecting the main component number when the accumulative variance interpretation rate is more than 99.9%;
(6) establishing and screening a model: the method comprises the following steps of taking the mid-infrared spectrum of a milk sample of a training set as an input value, taking the categories of goat milk and cow milk as output values, using a nearest neighbor (KNN) algorithm, a BP neural network algorithm, a Random Forest (RF) algorithm and a Support Vector Machine (SVM) algorithm, constructing a model on the training set through 10-fold cross validation, and evaluating and screening the model according to the principle that indexes such as accuracy, specificity, sensitivity and AUC are high;
(7) estimating the generalization performance of the optimal model: predicting samples in the test set by using the model, and evaluating the performance of the model on the test set by using the corresponding evaluation index; the behavior of the model on the test set is refined using a confusion matrix.
(8) The preprocessing of the mid-infrared spectrum data, the model construction and verification and the output of the confusion matrix are realized by Python 3.8.3.
Drawings
FIG. 1: and (5) detecting the intermediate infrared spectrogram of the processed goat milk.
FIG. 2: and (5) detecting the intermediate infrared spectrogram of the treated milk of the cow after the abnormality detection.
FIG. 3: and (4) comparing infrared spectra of the goat milk and the cow milk after anomaly detection treatment.
FIG. 4: and (4) carrying out TSNE visualization on the preprocessed data.
FIG. 5: ROC curve of model on test set.
FIG. 6: a confusion matrix map of the model on the test set.
Detailed Description
The invention is further illustrated by the following figures and examples.
Examples the experimental procedures, in which specific conditions are not specified, were carried out according to conventional methods and conditions, or according to conditions recommended by the manufacturers.
Example 1
Instruments and equipment:
MilkoScan manufactured by FOSS CorpTM7RM milk ingredient detector (according to the product instruction).
The operation steps are as follows:
1) selecting a milk sample:
73 samples of fresh goat milk and 80 samples of fresh cow's milk were collected, respectively.
2) And (3) collecting the mid-infrared spectrum:
respectively pouring goat milk and cow milk samples into cylindrical sampling tubes with diameter of 3.5cm and height of 9cm to ensure liquid level height greater than 6cm, placing in water bath at 42 deg.C for 15-20min, extending solid fiber probe into liquid, and detecting by absorbing sample with MilkoScan from FOSS companyTMThe 7RM milk component detector is at 4000-400cm-1And scanning goat milk and cow milk samples in the wave number range, and outputting light transmittance corresponding to the samples through a computer connected with the goat milk and cow milk samples to obtain a sample spectrogram.
3) Data preprocessing:
converting the spectral data from transmittance (T) to absorbance (A), and calculating the formula as follows:
A=log10(1/T)
removing the water absorption area;
detecting abnormal spectrum: the 4 outliers were removed using the LOF anomaly detection algorithm.
4) Dividing the data set:
and a data set is divided into a training set and a testing set by adopting a hierarchical sampling method, wherein the mid-infrared spectrum data of 119 samples in the training set is used for establishing a qualitative judgment model, and the mid-infrared spectrum data of 30 samples in the testing set is used for evaluating the prediction effect of the qualitative judgment model.
5) And (3) reducing the dimensionality of the main components of the training set:
and selecting the number of the principal components with the accumulated variance interpretation rate of more than 99.9 percent to obtain 16 principal components.
6) Establishing and screening a model:
constructing a model on a training set through 10-fold cross validation by using a nearest neighbor (KNN) algorithm, a BP neural network algorithm, a Random Forest (RF) algorithm and a Support Vector Machine (SVM) algorithm and taking the mid-infrared spectrum of a milk sample of the training set as an input value and the categories of goat milk and cow milk as output values; the accuracy, specificity, sensitivity and AUC of the model constructed by the 4 algorithms on the training set are all 1, which shows that the 4 algorithms have good modeling effect on the identification of goat milk and cow milk, and the 4 models can accurately identify the goat milk and the cow milk. Generally, SVM algorithms perform well in a variety of classification problems. The SVM algorithm is based on nonlinear mapping and is a novel small sample learning method; in SVM classification decision, a few support vectors play a decisive role, which avoids the possibility of occurrence of 'dimension disaster' to a certain extent and enables the trained model to have better stability. Therefore, a model established by using an SVM algorithm is selected for the identification of goat milk and cow milk.
7) And (3) model verification:
predicting and verifying 30 concentrated samples by using a qualitative model established by an SVM algorithm, wherein the result is expressed by indexes such as accuracy, specificity, sensitivity, AUC and the like; the predicted value of the obtained mid-infrared spectrum is consistent with the actual value. The accuracy, specificity, sensitivity and AUC of the model in the test set are all 1, which shows that the classification of the goat milk and the milk by the SVM algorithm has high learning ability, and the established model can accurately distinguish the goat milk and the milk. The confusion matrix is used for representing the performance of the model in the test set by using FIG. 6, the classification result is compared with the actual value, and the situation of no error classification can be seen in the graph, which shows that the model can accurately classify the goat milk and the cow milk with high precision.
TABLE 1 Performance of the model on the training and test sets
Figure BDA0002813073520000041
Primary references
[1] Detection of milk component [ J ] in goat milk by loop-mediated isothermal amplification technology, such as Liting, et al, biotechnological communication, 2018,29(06):836-839
[2] Schuiping et al, the way of improving the nutritional components and the nutritional value of goat milk [ J ]. Heilongjiang animal veterinarians, 2017(09): 118-;
[3] yangtui, research on near infrared spectra of milk from different dairy animals and composition characteristics of fatty acid and milk protein [ D ]. Chinese academy of agricultural sciences, 2013.
[4]Irene Martin,Teresa Garcia,Violeta Fajardo,et,al.Species-specific PCR for the identification of ruminant species in feedstuffs[J].meat science,2007,75(1):120-127.
[5]Turck D.Cow's milk and goat's milk.[J].World Rev Nutr Diet,2013,108:56-62。

Claims (6)

1. The spectral identification model and method of milk and goat milk are characterized by comprising the following steps:
1) selecting milk samples
Respectively collecting fresh goat milk and fresh milk;
2) mid infrared spectral collection
Adopting a Fourier transform mid-infrared spectrometer at 4000--1Scanning goat milk and cow milk in a wave number range, and outputting light transmittance corresponding to a sample through a connected computer to obtain a sample spectrogram;
3) data pre-processing
Converting the light transmittance into absorbance, removing the water absorption area, and removing an abnormal value;
4) partitioning a data set
Dividing a data set into a training set and a testing set;
5) principal component dimensionality reduction of training set
Carrying out PCA dimension reduction processing on the training set to improve the training speed of the model, wherein the number of the principal components is determined by that the variance cumulative interpretation rate is more than 99.9%;
6) model building and screening
The method comprises the following steps of taking the mid-infrared spectrum of a milk sample of a training set as an input value, taking the category of goat milk and cow milk as an output value, comparing and using a nearest neighbor (KNN) algorithm, a BP neural network algorithm, a Random Forest (RF) algorithm and a Support Vector Machine (SVM) algorithm, building a model on the training set through 10-fold cross validation, and evaluating and screening the model by using indexes such as accuracy, specificity, sensitivity and AUC;
7) prediction of optimal model generalization performance
And predicting samples in the test set by using the model, evaluating the performance of the model on the test set by using the corresponding evaluation index, and refining the performance of the model on the test set by using a confusion matrix.
2. The spectral discrimination model of cow's milk and goat's milk according to claim 1, wherein step 2) scans the sample with a milk composition detector.
3. The model for spectral discrimination of cow's milk and goat's milk according to claim 1, wherein when the spectra are collected in step 2), goat's milk and cow's milk samples are respectively poured into cylindrical sampling tubes with a diameter of 3.5cm and a height of 9cm to ensure that the liquid level is higher than 6cm, and then the goat's milk and cow's milk samples are put into a water bath at 42 ℃ for 15-20min, and then a solid optical fiber probe is extended into the liquid for sample suction detection.
4. The spectral discrimination model of cow's milk and goat's milk according to claim 1, wherein the data preprocessing method of step 3) is:
1) converting the spectral data from transmittance (T) to absorbance (A) by the formula:
A=log10(1/T)
2) removing the water absorption region;
3) abnormal spectrum detection: outliers are removed using the LOF anomaly detection algorithm.
5. The spectral discrimination model of cow's milk and goat's milk according to claim 1, wherein the preprocessed data set in step 4) is divided into training set and testing set according to the hierarchical sampling principle, and the training set and the testing set respectively account for 80% and 20% of the data set.
6. The model for spectral discrimination of cow's milk and goat's milk according to claim 1, wherein the model is constructed on a training set by 10-fold cross-validation to evaluate and screen the model with accuracy, specificity, sensitivity and AUC indexes.
CN202011403046.XA 2020-10-01 2020-12-02 Spectral identification model and method for milk and goat milk Pending CN112666110A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2020110655679 2020-10-01
CN202011065567 2020-10-01

Publications (1)

Publication Number Publication Date
CN112666110A true CN112666110A (en) 2021-04-16

Family

ID=75400867

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011403046.XA Pending CN112666110A (en) 2020-10-01 2020-12-02 Spectral identification model and method for milk and goat milk

Country Status (2)

Country Link
CN (1) CN112666110A (en)
NL (1) NL2029012B1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL2029012A (en) * 2020-10-01 2022-06-01 Univ Huazhong Agricultural Method for quickly identification of cow milk and goat milk

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030088881A1 (en) * 1998-11-19 2003-05-08 Cottingham Ian Robert Stabilisation of milk from transgenic animals
EP1374693A2 (en) * 2002-06-06 2004-01-02 Ajinomoto Co., Inc. Method for preventing decrease of breast milk amount in mammals
WO2009020864A1 (en) * 2007-08-08 2009-02-12 Jon Baker Controlled environment animal husbandry facility
CN101929951A (en) * 2009-06-19 2010-12-29 西北农林科技大学 Method for distinguishing milk doped with ewe's milk by near infrared spectrum
CN103543123A (en) * 2013-10-08 2014-01-29 江南大学 Infrared spectrum recognition method for adulterated milk
CN106163268A (en) * 2014-01-02 2016-11-23 全技术公司 For estimating feed efficiency and the system and method for carbon footprint of milcher
CN107024450A (en) * 2017-03-27 2017-08-08 云南小宝科技有限公司 A kind of method for differentiating different brands and hop count milk powder based on near-infrared spectrum technique

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112525850A (en) * 2020-10-01 2021-03-19 华中农业大学 Spectral fingerprint identification method for milk, mare, camel, goat and buffalo milk
CN112666110A (en) * 2020-10-01 2021-04-16 华中农业大学 Spectral identification model and method for milk and goat milk

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030088881A1 (en) * 1998-11-19 2003-05-08 Cottingham Ian Robert Stabilisation of milk from transgenic animals
EP1374693A2 (en) * 2002-06-06 2004-01-02 Ajinomoto Co., Inc. Method for preventing decrease of breast milk amount in mammals
WO2009020864A1 (en) * 2007-08-08 2009-02-12 Jon Baker Controlled environment animal husbandry facility
CN101929951A (en) * 2009-06-19 2010-12-29 西北农林科技大学 Method for distinguishing milk doped with ewe's milk by near infrared spectrum
CN103543123A (en) * 2013-10-08 2014-01-29 江南大学 Infrared spectrum recognition method for adulterated milk
CN106163268A (en) * 2014-01-02 2016-11-23 全技术公司 For estimating feed efficiency and the system and method for carbon footprint of milcher
CN107024450A (en) * 2017-03-27 2017-08-08 云南小宝科技有限公司 A kind of method for differentiating different brands and hop count milk powder based on near-infrared spectrum technique

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
N. NICOLAOU ET AL.: "Fourier transform infrared spectroscopy and multivariate analysis for the detection and quantification of different milk species", 《J. DAIRY SCI.》 *
OLGUN CIRAK ET AL.,: "Rapid detection of adulteration of milks from different species using Fourier Transform Infrared Spectroscopy (FTIR)", 《JOURNAL OF DAIRY RESEARCH》 *
牟少敏 等: "《模式识别与机器学习技术》", 30 June 2019, 冶金工业出版社 *
黄亚东: "蒙古马奶常规营养成分检测与NIR快速检测模型建立", 《中国优秀博硕士学位论文全文数据库(硕士)工程科技Ⅰ辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL2029012A (en) * 2020-10-01 2022-06-01 Univ Huazhong Agricultural Method for quickly identification of cow milk and goat milk

Also Published As

Publication number Publication date
NL2029012B1 (en) 2022-09-08
NL2029012A (en) 2022-06-01

Similar Documents

Publication Publication Date Title
CN112525850A (en) Spectral fingerprint identification method for milk, mare, camel, goat and buffalo milk
CN112666111A (en) Method for quickly identifying milk and mare milk
Wu et al. Practicability investigation of using near-infrared hyperspectral imaging to detect rice kernels infected with rice false smut in different conditions
CN112666112A (en) Batch discrimination model and method for camel milk and mare milk
US20220390374A1 (en) Method for extracting raman characteristic peaks employing improved principal component analysis
CN104215591A (en) Damage-free visible-near infrared light spectrum detecting method
Zhihui et al. Nondestructive detection of infertile hatching eggs based on spectral and imaging information
CN112666114A (en) Method for identifying buffalo milk and mare milk by using spectrum
Yin et al. Non-destructive detection of foreign contaminants in toast bread with near infrared spectroscopy and computer vision techniques
CN113310930A (en) Spectral identification method of high-temperature sterilized milk, pasteurized milk and pasteurized milk mixed with high-temperature sterilized milk
Riefolo et al. Assessment of the hyperspectral data analysis as a tool to diagnose Xylella fastidiosa in the asymptomatic leaves of olive plants
CN112666110A (en) Spectral identification model and method for milk and goat milk
Najjar et al. Visible/near-infrared (VIS/NIR) spectroscopy technique to detect gray mold disease in the early stages of tomato fruit: VIS/NIR spectroscopy for detecting gray mold in tomato
CN113324940A (en) Spectrum grading method for super-high-quality milk, high-protein special milk, high-milk-fat special milk and common milk
CN106338488A (en) Method for fast undamaged determination of transgenic soybean milk powder
CN113310934A (en) Method for quickly identifying milk cow milk mixed in camel milk and mixing proportion thereof
CN112213281A (en) Comprehensive evaluation method for rapidly determining freshness of freshwater fish based on transmission near infrared spectrum
US20230089466A1 (en) Establishment of Identification and Screening Method of Cows with A2 Beta-Casein Genotype of Producing A2 Milk and Applications Thereof
Steiner et al. Label free molecular sexing of monomorphic birds using infrared spectroscopic imaging
CN110609011A (en) Near-infrared hyperspectral detection method and system for starch content of single-kernel corn seeds
Shinzawa et al. New method for spectral data classification: Two-way moving window principal component analysis
CN112666113A (en) Method for quickly identifying goat milk and mare milk
Wang et al. Detection of Insect-Damaged Maize Seed Using Hyperspectral Imaging and Hybrid 1D-CNN-BiLSTM Model
Yang et al. Identification of Peanut Kernels Infected with Multiple Aspergillus flavus Fungi Using Line-Scan Raman Hyperspectral Imaging
Ravindran et al. A review on non-destructive techniques for evaluating quality of fruits

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210416