CN112834451A - Sample identification method and device based on infrared spectrum and storage medium - Google Patents

Sample identification method and device based on infrared spectrum and storage medium Download PDF

Info

Publication number
CN112834451A
CN112834451A CN202110036604.1A CN202110036604A CN112834451A CN 112834451 A CN112834451 A CN 112834451A CN 202110036604 A CN202110036604 A CN 202110036604A CN 112834451 A CN112834451 A CN 112834451A
Authority
CN
China
Prior art keywords
spectrum
standard
test
data
infrared
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110036604.1A
Other languages
Chinese (zh)
Other versions
CN112834451B (en
Inventor
李�灿
夏国强
李浩文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Micro Optical Instruments Technology Co ltd
Original Assignee
Shenzhen Micro Optical Instruments Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Micro Optical Instruments Technology Co ltd filed Critical Shenzhen Micro Optical Instruments Technology Co ltd
Priority to CN202110036604.1A priority Critical patent/CN112834451B/en
Publication of CN112834451A publication Critical patent/CN112834451A/en
Application granted granted Critical
Publication of CN112834451B publication Critical patent/CN112834451B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Artificial Intelligence (AREA)
  • Analytical Chemistry (AREA)
  • Pathology (AREA)
  • Immunology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention provides a sample identification method, equipment and a storage medium based on infrared spectroscopy, wherein the sample identification method comprises the steps of establishing a standard spectrum library of substances on the basis of infrared spectroscopy of a series of standard substances, wherein each piece of spectrum data in the standard spectrum library is called as a standard spectrum; constructing a series of test samples, carrying out infrared spectrum scanning on the test samples, and extracting each piece of spectrum data of the test samples as a test spectrum; calculating a weighted correlation coefficient of the test spectrum and the standard spectrum, setting a similar interval according to the weighted correlation coefficient, and establishing an infrared spectrum identification model; performing infrared spectrum scanning on a sample to be detected to obtain spectral data of the sample to be detected; and inputting the spectral data into an infrared spectrum recognition model to recognize the sample to be detected. The sample identification method disclosed by the invention adopts an infrared spectrum technology, and compared with instruments such as a mass spectrum and the like, the sample identification method does not use a chemical reagent, is green and environment-friendly, applies a weighted correlation coefficient in an identification model, and has high identification accuracy and high detection efficiency.

Description

Sample identification method and device based on infrared spectrum and storage medium
Technical Field
The invention relates to the technical field of infrared spectrum identification, in particular to a sample identification method, sample identification equipment and a storage medium based on infrared spectrum.
Background
The infrared spectrum is mainly divided into absorption spectrum, emission spectrum and scattering spectrum, and is referred to herein as infrared absorption spectrum. The infrared absorption spectrum is derived from a vibrational spectrum caused by vibrational rotation and energy level transition of molecules. The infrared spectrum is generally divided into 3 regions: far infrared region (about 400-10 cm)-1) Middle infrared region (about 4000-400 cm)-1) Near infrared region (14000-4000 cm)-1)。
Although the prior art can carry out matching identification on substances according to the characteristic peak positions of infrared spectra of different substances, the method needs to search peaks by a machine and involves extraction of peak shape characteristics and the like, the process is complex, the subjectivity is strong, and the method is not suitable for online detection, so that the development of a sample identification method suitable for the infrared spectra is urgently needed.
Accordingly, the prior art is yet to be improved and developed.
Disclosure of Invention
The invention mainly aims to solve the technical problems of complex identification process and low efficiency of the existing method for identifying substances by utilizing infrared spectrum.
The invention provides a sample identification method based on infrared spectrum, which comprises the following steps:
establishing a standard spectrum library of substances based on infrared spectra of a series of standard substances, wherein each piece of spectral data in the standard spectrum library is called as a standard spectrum;
constructing a series of test samples, carrying out infrared spectrum scanning on the test samples, and extracting each piece of spectrum data of the test samples to be used as a test spectrum;
calculating the weighted correlation coefficient of the test spectrum and the standard spectrum, setting a similar interval according to the weighted correlation coefficient, and establishing an infrared spectrum identification model;
performing infrared spectrum scanning on a sample to be detected to obtain spectral data of the sample to be detected;
and inputting the spectral data into the infrared spectrum recognition model to recognize the sample to be detected.
Optionally, in the first aspect of the inventionIn a first embodiment of the aspect, in the standard spectrum library, an average value of m data points corresponding to p pieces of spectral data of the same substance measured in different batches is used as the standard spectrum s of the substanceoriginal
Figure BDA0002894589020000021
Optionally, in a second implementation manner of the first aspect of the present invention, before the calculating the weighted correlation coefficient between the test spectrum and the standard spectrum, the calculating includes:
filtering the test spectrum and the standard spectrum, and calculating to obtain mathematical expectation and variance of the standard spectrum data after filtering and mathematical expectation and variance of the test spectrum data after filtering;
and standardizing the standard spectrum data after the filtering processing and the test spectrum data after the filtering processing.
Optionally, in a third embodiment of the first aspect of the present invention, the processed standard spectral data is normalized
Figure BDA0002894589020000031
Wherein s issecondFor the filtered standard spectral data, EsFor mathematical expectation of filtered standard spectral data, σs 2E is the variance of the standard spectral data after filtering processing and is the base number of a natural logarithm function;
standardized test spectral data
Figure BDA0002894589020000032
Wherein, tsecondFor the filtered test spectral data, EtFor mathematical expectation of filtered standard spectral data, σt 2Is the variance of the filtered standard spectral data.
Optionally, in a fourth embodiment of the first aspect of the present invention, the calculating the weighted correlation coefficient between the test spectrum and the standard spectrum uses the following formula:
weighted correlation coefficient
Figure BDA0002894589020000033
Wherein s isiIs the ith data point, t, in the normalized standard spectrum dataiTo normalize the ith data point in the processed test spectrum data,
Figure BDA0002894589020000034
ωiis the weight of the ith data point,
Figure BDA0002894589020000035
and k is a custom parameter.
Optionally, in a fifth embodiment of the first aspect of the present invention, k is 0.01 or 0.5.
Optionally, in a sixth embodiment of the first aspect of the present invention, the test sample comprises a sample of the same substance in a standard library of spectra and a sample of a substance in a non-standard library of spectra.
Optionally, in a seventh embodiment of the first aspect of the present invention, the sample to be tested includes drugs and explosives.
The present invention provides in a second aspect an infrared spectroscopy-based sample identification apparatus, the sample identification apparatus comprising: a memory having instructions stored therein and at least one processor, the memory and the at least one processor interconnected by a line;
the at least one processor invokes the instructions in the memory to cause the sample identification device to perform the infrared spectroscopy-based sample identification method as in any one of the above.
A third aspect of the invention provides a computer-readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements a method for infrared spectroscopy-based sample identification as defined in any one of the above.
Has the advantages that: the invention provides a sample identification method, equipment and a storage medium based on infrared spectroscopy, wherein the sample identification method comprises the steps of establishing a standard spectrum library of substances on the basis of infrared spectroscopy of a series of standard substances, wherein each piece of spectrum data in the standard spectrum library is called as a standard spectrum; constructing a series of test samples, carrying out infrared spectrum scanning on the test samples, and extracting each piece of spectrum data of the test samples as a test spectrum; calculating a weighted correlation coefficient of the test spectrum and the standard spectrum, setting a similar interval according to the weighted correlation coefficient, and establishing an infrared spectrum identification model; performing infrared spectrum scanning on a sample to be detected to obtain spectral data of the sample to be detected; and inputting the spectral data into an infrared spectrum recognition model to recognize the sample to be detected. The sample identification method disclosed by the invention adopts an infrared spectrum technology, and compared with instruments such as a mass spectrum and the like, the sample identification method does not use a chemical reagent, is green and environment-friendly, applies a weighted correlation coefficient in an identification model, and has high identification accuracy and high detection efficiency.
Drawings
FIG. 1 is a flow chart of one embodiment of a method for infrared spectroscopy-based sample identification according to the present invention;
FIG. 2 is an infrared scanning raw spectrum of a substance in a standard spectrum library;
FIG. 3 is a material distribution diagram of a standard spectrum to which the highest correlation coefficient between a test spectrum and the standard spectrum belongs;
fig. 4 is a schematic diagram of an embodiment of an infrared spectrum-based sample identification apparatus according to the present invention.
Detailed Description
The embodiment of the invention provides a sample identification method and device based on infrared spectroscopy and a storage medium.
The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," or "having," and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For ease of understanding, a detailed flow chart of an embodiment of the present invention is described below, with reference to fig. 1, in which a first aspect of the present invention is a method for sample identification based on infrared spectroscopy, the method for sample identification based on infrared spectroscopy comprising:
s100, establishing a standard spectrum library of substances on the basis of infrared spectrums of a series of standard substances, wherein each piece of spectrum data in the standard spectrum library is called as a standard spectrum;
s200, constructing a series of test samples, carrying out infrared spectrum scanning on the test samples, and extracting each piece of spectrum data of the test samples to be used as a test spectrum;
s300, calculating a weighted correlation coefficient of the test spectrum and the standard spectrum, setting a similar interval according to the weighted correlation coefficient, and establishing an infrared spectrum identification model;
s400, performing infrared spectrum scanning on a sample to be detected to obtain spectral data of the sample to be detected;
s500, inputting the spectral data into the infrared spectrum recognition model to recognize the sample to be detected.
The sample identification method based on the infrared spectrum is characterized in that an infrared spectrum identification model is firstly established, the infrared spectrum of a sample is identified by the infrared spectrum identification model, the model is based on the infrared spectrum of a series of standard substances, a standard spectrum library of the substances is established, each piece of spectrum data in the standard spectrum library is called as a standard spectrum, and then an identification matching model is established through a weighted correlation coefficient. According to the invention, the similarity of the infrared spectrum is measured through the weighted correlation coefficient, the weighted correlation coefficient can effectively use the spectral characteristics for similarity calculation, so that the reliability of the result is improved, the test spectrum of the test sample is highly similar to but different from the standard spectrum of the substances in the standard spectrum library, the common characteristics are reflected, and the individual difference is reflected. Therefore, a similar interval of the spectrum similarity is determined, products in the interval are similar substances, and otherwise, the products are non-similar substances or abnormal samples.
In an alternative embodiment of the first aspect of the present invention, in the standard spectrum library, the average value of m data points corresponding to p pieces of spectral data of the same substance measured in different batches is used as the standard spectrum s of the substanceoriginal
Figure BDA0002894589020000061
In an alternative embodiment of the first aspect of the present invention, the calculating the weighted correlation coefficients of the test spectrum and the standard spectrum comprises:
filtering the test spectrum and the standard spectrum, and calculating to obtain mathematical expectation and variance of the standard spectrum data after filtering and mathematical expectation and variance of the test spectrum data after filtering;
and standardizing the standard spectrum data after the filtering processing and the test spectrum data after the filtering processing.
In this embodiment, both the test spectrum and the standard spectrum need to be preprocessed, where the preprocessing includes filtering the spectrum data, and further, normalization processing is needed to obtain the finally needed standard spectrum data and test spectrum data.
In an alternative embodiment of the first aspect of the present invention, the processed standard spectral data is normalized
Figure BDA0002894589020000071
Wherein s issecondFor the filtered standard spectral data, EsFor mathematical expectation of filtered standard spectral data, σs 2After filtering processingE is the base of the natural logarithm function;
standardized test spectral data
Figure BDA0002894589020000072
Wherein, tsecondFor the filtered test spectral data, EtFor mathematical expectation of filtered standard spectral data, σt 2Is the variance of the filtered standard spectral data.
In an alternative embodiment of the first aspect of the present invention, the calculating the weighted correlation coefficient between the test spectrum and the standard spectrum uses the following formula:
weighted correlation coefficient
Figure BDA0002894589020000073
Wherein s isiIs the ith data point, t, in the normalized standard spectrum dataiTo normalize the ith data point in the processed test spectrum data,
Figure BDA0002894589020000074
ωiis the weight of the ith data point,
Figure BDA0002894589020000075
and k is a custom parameter.
In an alternative embodiment of the first aspect of the present invention, k is 0.01 or 0.5. In the embodiment, the weight formula includes a parameter k that needs to be specified by the user, which is the only parameter to be considered by the algorithm model of the weighted correlation coefficient method of the present invention, when k is 0.5, most data is used for the algorithm model, and when k is 0.01, only few local points participate in the algorithm model.
In an alternative embodiment of the first aspect of the present invention, the test sample comprises a sample of the same substance in a standard library of spectra and a sample of a substance in a non-standard library of spectra. In this embodiment, the test sample is used for determining the weighted correlation coefficient, the weighted correlation coefficient of the test spectrum of the sample of the same substance in the standard spectrum library and the standard spectrum of the substance is the highest, and the highest weighted correlation coefficient of other substances in the standard spectrum library and the substance in the standard spectrum library is lower, so that a threshold value of the weighted correlation coefficient can be determined, wherein the threshold value is higher than the threshold value and represents the substance belonging to the standard spectrum library, and the threshold value is lower than the threshold value and belongs to the substance out of the standard spectrum library.
In an alternative embodiment of the first aspect of the present invention, the sample to be tested includes drugs and explosives. More specifically, the sample to be tested comprises cocaine, heroin, methamphetamine, morphine, tai' an, cannabis and nitroglycerin.
The first aspect of the invention is a specific example of a sample identification method based on infrared spectroscopy, as follows:
step 1, infrared spectrum signal acquisition
The infrared spectrum signal comprises an infrared spectrum in a certain waveband range, specifically an infrared absorbance curve or other spectral parameters derived from formula deformation, and is acquired by a Fourier infrared spectrometer produced by BRUKER company (Germany).
Step 2, establishing a standard spectrum library
Taking the average value of m data points corresponding to p spectral data of the same substance measured in different batches as the standard spectrum s of the substanceoriginal
Figure BDA0002894589020000091
Step 3, spectrum pretreatment
The spectrum comprises a standard spectrum soriginalAnd a test spectrum toriginal(ii) a The preprocessing comprises the steps of filtering the spectral data, further performing standardization processing, and finally obtaining standard spectral data s and test spectral data t.
The spectrum pretreatment comprises the following steps:
3.1 spectral filtering treatment, wherein S-G smoothing filtering is mainly used;
3.2 finding the spectral data s obtained after filteringsecondOr tsecondMathematical expectation of (E)sOr EtSum variance σs 2Or σt 2
Figure BDA0002894589020000092
Figure BDA0002894589020000101
The superscript T denotes the transpose matrix,
Figure BDA0002894589020000102
Figure BDA0002894589020000103
3.3 normalization
Figure BDA0002894589020000104
Figure BDA0002894589020000105
Step 4, establishing a weighted correlation coefficient algorithm model
Specifically, calculating a weighted correlation coefficient r of the test spectrum t and a standard spectrum s, wherein s is standard spectrum data finally obtained in the step 3, and s in s isiIs the ith data point in the standard spectral data. t in tiIs the ith data point in the test spectrum data.
Figure BDA0002894589020000106
ωiIs as followsWeights of i data points.
Figure BDA0002894589020000111
Figure BDA0002894589020000112
In the above expression of the weighted correlation coefficient r, the weight ωiThe corresponding expression is as follows:
Figure BDA0002894589020000113
the above formula contains a parameter k that needs to be specified by the user, which is the only parameter that needs to be considered in the weighted correlation coefficient algorithm model, and k is 0.5 in this embodiment.
Step 5, weighted correlation coefficient calculation and similar interval determination
Testing the test spectrum of each substance in the standard library for multiple times, and obtaining a correlation coefficient between the test spectrum and each standard spectrum in the standard library; and testing the test spectrum of the non-existing substance in the standard library for multiple times, obtaining the correlation coefficient between the test spectrum and the standard spectrum of each substance in the standard library, and counting the distribution of the correlation coefficient of each test spectrum, thereby setting a reasonable similar interval for each standard spectrum substance in the standard library.
In this embodiment, referring to fig. 2, fig. 2 is an infrared scanning raw spectrum of 10 substances in a standard spectrum library, referring to fig. 3, and sample labels 0, 1, … 9 in the ordinate of fig. 3 represent substance distributions of standard spectra to which the highest correlation coefficients between a test spectrum and 10 standard spectra in the standard spectrum library belong. The test spectra of 10 substances in the standard spectrum library are tested, 20 (200 in total) test spectra of the substances in the standard spectrum library and 200 test spectra of substances outside the standard spectrum library are tested, and the 10 different substances in the standard spectrum library and the substances outside the standard spectrum library are marked by 11 marks with different shapes. And (3) respectively calculating correlation coefficients of all the test spectra t and the standard spectra of 10 substances in the standard spectrum library, and obtaining the highest correlation coefficient between each test spectrum and 10 standard spectra, the distribution of the highest correlation coefficient and the substance distribution of the corresponding standard spectrum, as shown in fig. 3. It can be easily seen that the correlation coefficient between the test spectrum of the sample of the same substance in the standard spectrum library and the standard spectrum of the substance is the highest and is higher than 0.85. The highest correlation coefficient of other substances except the standard spectrum library and 10 substances in the standard spectrum library is lower than 0.85. The threshold value of substantially each substance may be set to approximately 0.85. The similarity interval is [ 0.85, 1 ]. Of course, the threshold value can be fine-tuned specifically according to the correlation coefficient distribution of the substance. And the highest correlation coefficient between all the test spectra and 10 standard spectra forms a highest correlation coefficient matrix, wherein the expression is r, in the embodiment, n is 10, h is 400, and if rjv is in the correlation coefficient similar interval of the jth substance, the substance to which the test spectra belong is judged as the jth substance. Otherwise, judging the sample to belong to other substances outside the standard spectrum library.
Figure BDA0002894589020000121
Fig. 4 is a schematic structural diagram of an infrared spectrum-based sample identification device according to an embodiment of the present invention, which may have relatively large differences due to different configurations or performances, and may include one or more processors 10 (CPUs) (e.g., one or more processors) and a memory 20, and one or more storage media 30 (e.g., one or more mass storage devices) for storing applications or data. The memory and storage medium may be, among other things, transient or persistent storage. The program stored on the storage medium may include one or more modules (not shown), each of which may include a series of instruction operations in the sample identification device. Still further, the processor may be configured to communicate with a storage medium to execute a series of instruction operations in the storage medium on the answer sheet segmentation apparatus.
The infrared spectroscopy-based sample identification apparatus may also include one or more power supplies 40, one or more wired or wireless network interfaces 50, one or more input-output interfaces 60, and/or one or more operating systems, such as Windows Server, Mac OS X, Unix, Linux, FreeBSD, etc. Those skilled in the art will appreciate that the infrared spectroscopy-based sample identification apparatus configuration shown in fig. 4 does not constitute a limitation of the sample identification apparatus and may include more or fewer components than those shown, or some components in combination, or a different arrangement of components.
The present invention also provides a computer readable storage medium, which may be a non-volatile computer readable storage medium, and which may also be a volatile computer readable storage medium, having stored therein instructions, which, when executed on a computer, cause the computer to perform the steps of the infrared spectroscopy-based sample identification method.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses, and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A sample identification method based on infrared spectroscopy is characterized by comprising the following steps:
establishing a standard spectrum library of substances based on infrared spectra of a series of standard substances, wherein each piece of spectral data in the standard spectrum library is called as a standard spectrum;
constructing a series of test samples, carrying out infrared spectrum scanning on the test samples, and extracting each piece of spectrum data of the test samples to be used as a test spectrum;
calculating the weighted correlation coefficient of the test spectrum and the standard spectrum, setting a similar interval according to the weighted correlation coefficient, and establishing an infrared spectrum identification model;
performing infrared spectrum scanning on a sample to be detected to obtain spectral data of the sample to be detected;
and inputting the spectral data into the infrared spectrum recognition model to recognize the sample to be detected.
2. The method for identifying a sample according to claim 1, wherein the standard spectrum S of a substance is the average value of m data points corresponding to p spectral data of the same substance measured in different batches in the standard spectrum libraryoriginal
Figure FDA0002894589010000011
3. The method of claim 2, wherein said calculating a weighted correlation coefficient between said test spectrum and said standard spectrum comprises:
filtering the test spectrum and the standard spectrum, and calculating to obtain mathematical expectation and variance of the standard spectrum data after filtering and mathematical expectation and variance of the test spectrum data after filtering;
and standardizing the standard spectrum data after the filtering processing and the test spectrum data after the filtering processing.
4. The method of claim 3, wherein the normalized standard spectral data is obtained by normalizing the infrared spectrum-based sample identification
Figure FDA0002894589010000021
Wherein s issecondFor the filtered standard spectral data, EsFor mathematical expectation of filtered standard spectral data, σs 2E is the variance of the standard spectral data after filtering processing and is the base number of a natural logarithm function;
standardized test spectral data
Figure FDA0002894589010000022
Wherein, tsecondFor the filtered test spectral data, EtFor mathematical expectation of filtered standard spectral data, σt 2Is the variance of the filtered standard spectral data.
5. The method for infrared spectrum based sample identification as claimed in claim 4, wherein the calculation of weighted correlation coefficients for the test spectrum and the standard spectrum uses the following formula:
weighted correlation coefficient
Figure FDA0002894589010000023
Wherein s isiIs the ith data point, t, in the normalized standard spectrum dataiTo normalize the ith data point in the processed test spectrum data,
Figure FDA0002894589010000024
ωiis the weight of the ith data point,
Figure FDA0002894589010000025
and k is a custom parameter.
6. The method of claim 5, wherein k is 0.01 or 0.5.
7. The method for infrared spectrum based sample identification as claimed in claim 1, wherein the test samples comprise samples of the same species in a standard spectral library and samples of species in a non-standard spectral library.
8. The method for identifying a sample according to claim 1, wherein the sample to be tested comprises drugs and explosives.
9. An infrared spectroscopy-based sample identification apparatus, the sample identification apparatus comprising: a memory having instructions stored therein and at least one processor, the memory and the at least one processor interconnected by a line;
the at least one processor invokes the instructions in the memory to cause the sample identification device to perform the infrared spectroscopy-based sample identification method of any one of claims 1-8.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out a method for infrared spectroscopy-based sample identification as claimed in any one of claims 1 to 8.
CN202110036604.1A 2021-01-12 2021-01-12 Sample identification method and device based on infrared spectrum and storage medium Active CN112834451B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110036604.1A CN112834451B (en) 2021-01-12 2021-01-12 Sample identification method and device based on infrared spectrum and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110036604.1A CN112834451B (en) 2021-01-12 2021-01-12 Sample identification method and device based on infrared spectrum and storage medium

Publications (2)

Publication Number Publication Date
CN112834451A true CN112834451A (en) 2021-05-25
CN112834451B CN112834451B (en) 2023-04-18

Family

ID=75929592

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110036604.1A Active CN112834451B (en) 2021-01-12 2021-01-12 Sample identification method and device based on infrared spectrum and storage medium

Country Status (1)

Country Link
CN (1) CN112834451B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114414478A (en) * 2021-12-21 2022-04-29 北京智创芯源科技有限公司 Sample testing device and testing method for cadmium zinc telluride sample

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101949824A (en) * 2009-06-30 2011-01-19 武汉矽感科技有限公司 Method and system device for detecting commodity according to spectral information of substance
CN102445712A (en) * 2011-11-22 2012-05-09 成都理工大学 Character window weighting related spectrum matching method facing rocks and minerals
CN103760126A (en) * 2013-12-16 2014-04-30 张雪峰 Infrared spectroscopic analysis and identification method for polypide of Cordyceps sinensis
CN105717066A (en) * 2016-01-29 2016-06-29 广西中烟工业有限责任公司 Near-infrared spectrum recognition model based on weighting association coefficients
WO2018121122A1 (en) * 2016-12-29 2018-07-05 同方威视技术股份有限公司 Raman spectroscopy detection method for checking goods, and electronic device
CN108362662A (en) * 2018-02-12 2018-08-03 山东大学 Near infrared spectrum similarity calculating method, device and substance qualitative analytic systems

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101949824A (en) * 2009-06-30 2011-01-19 武汉矽感科技有限公司 Method and system device for detecting commodity according to spectral information of substance
CN102445712A (en) * 2011-11-22 2012-05-09 成都理工大学 Character window weighting related spectrum matching method facing rocks and minerals
CN103760126A (en) * 2013-12-16 2014-04-30 张雪峰 Infrared spectroscopic analysis and identification method for polypide of Cordyceps sinensis
CN105717066A (en) * 2016-01-29 2016-06-29 广西中烟工业有限责任公司 Near-infrared spectrum recognition model based on weighting association coefficients
WO2018121122A1 (en) * 2016-12-29 2018-07-05 同方威视技术股份有限公司 Raman spectroscopy detection method for checking goods, and electronic device
CN108362662A (en) * 2018-02-12 2018-08-03 山东大学 Near infrared spectrum similarity calculating method, device and substance qualitative analytic systems

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
P. R. GRIFFITHS ET AL: "Self-weighted correlation coefficients and their application to measure spectral similarity", 《APPLIED SPECTROSCOPY》 *
徐永群等: "红外指纹图谱库与阵列相关系数法快速鉴别中药材", 《光谱实验室》 *
褚小立 等: "分子光谱自动检索算法、策略与应用进展", 《分析化学》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114414478A (en) * 2021-12-21 2022-04-29 北京智创芯源科技有限公司 Sample testing device and testing method for cadmium zinc telluride sample
CN114414478B (en) * 2021-12-21 2022-09-02 北京智创芯源科技有限公司 Sample testing device and testing method for cadmium zinc telluride sample

Also Published As

Publication number Publication date
CN112834451B (en) 2023-04-18

Similar Documents

Publication Publication Date Title
CN108362662B (en) Near infrared spectrum similarity calculation method and device and substance qualitative analysis system
CN109142317B (en) Raman spectrum substance identification method based on random forest model
CN109376574B (en) CNN-based (probabilistic neural network-based) HRRP (high-resolution Radar) target identification method for radar capable of refusing judgment
WO2018121122A1 (en) Raman spectroscopy detection method for checking goods, and electronic device
CN107092829B (en) Malicious code detection method based on image matching
CN110650058B (en) Network traffic analysis method, device, storage medium and equipment
CN117132778B (en) Spectrum measurement correction calculation method and system
CN104020128A (en) Method for rapidly identifying propolis source
CN112834451B (en) Sample identification method and device based on infrared spectrum and storage medium
CN113008805A (en) Radix angelicae decoction piece quality prediction method based on hyperspectral imaging depth analysis
CN107192686B (en) Method for identifying possible fuzzy clustering tea varieties by fuzzy covariance matrix
Yin et al. Non-destructive detection of foreign contaminants in toast bread with near infrared spectroscopy and computer vision techniques
CN114611582A (en) Method and system for analyzing substance concentration based on near infrared spectrum technology
CN111426657B (en) Identification comparison method of three-dimensional fluorescence spectrogram of soluble organic matter
CN113252641B (en) Substance identification method based on residual analysis under Raman spectrum
CN111650130A (en) Prediction method and prediction system for magnesium content of litchi leaves
CN110749575A (en) Traditional Chinese medicine polysaccharide two-dimensional infrared spectrum identification prediction model and construction method and application thereof
CN108241846B (en) Method for identifying Raman spectrogram
CN115420707A (en) Sewage near infrared spectrum chemical oxygen demand assessment method and system
CN115620818A (en) Protein mass spectrum peptide fragment verification method based on natural language processing
CN113791062A (en) Method for judging fixed substance type based on Raman spectrum
CN114971259A (en) Method for analyzing quality consistency of formula product by using near infrared spectrum
CN115420726A (en) Method for rapidly identifying target object by using reconstructed SERS spectrum
CN114689526A (en) Banana wilt nondestructive detection method, device and detection equipment
Wu et al. Significance test in operational ROC analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant