US20140032127A1 - Spectroscopic finger-printing of raw materials - Google Patents
Spectroscopic finger-printing of raw materials Download PDFInfo
- Publication number
- US20140032127A1 US20140032127A1 US13/886,869 US201313886869A US2014032127A1 US 20140032127 A1 US20140032127 A1 US 20140032127A1 US 201313886869 A US201313886869 A US 201313886869A US 2014032127 A1 US2014032127 A1 US 2014032127A1
- Authority
- US
- United States
- Prior art keywords
- spectra
- cultivation
- lots
- different
- component
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000002994 raw material Substances 0.000 title description 26
- 238000007639 printing Methods 0.000 title description 2
- 238000000034 method Methods 0.000 claims abstract description 44
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 17
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 17
- 210000004962 mammalian cell Anatomy 0.000 claims abstract description 7
- 238000001228 spectrum Methods 0.000 claims description 114
- 238000000513 principal component analysis Methods 0.000 claims description 49
- 238000004497 NIR spectroscopy Methods 0.000 claims description 48
- 238000004611 spectroscopical analysis Methods 0.000 claims description 24
- 238000004458 analytical method Methods 0.000 claims description 13
- 238000001506 fluorescence spectroscopy Methods 0.000 claims description 8
- 239000006228 supernatant Substances 0.000 claims description 6
- 238000003306 harvesting Methods 0.000 claims description 5
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 2
- 229940127121 immunoconjugate Drugs 0.000 claims description 2
- 238000012306 spectroscopic technique Methods 0.000 claims description 2
- 239000003531 protein hydrolysate Substances 0.000 description 62
- 239000000047 product Substances 0.000 description 48
- 239000002609 medium Substances 0.000 description 46
- 108010073771 Soybean Proteins Proteins 0.000 description 45
- 229940001941 soy protein Drugs 0.000 description 45
- 239000000306 component Substances 0.000 description 36
- 241000209094 Oryza Species 0.000 description 24
- 235000007164 Oryza sativa Nutrition 0.000 description 24
- 235000009566 rice Nutrition 0.000 description 24
- 230000005284 excitation Effects 0.000 description 17
- 239000000203 mixture Substances 0.000 description 17
- 238000000855 fermentation Methods 0.000 description 16
- 230000004151 fermentation Effects 0.000 description 16
- 238000002284 excitation--emission spectrum Methods 0.000 description 15
- 238000002189 fluorescence spectrum Methods 0.000 description 15
- 239000012092 media component Substances 0.000 description 13
- 230000003595 spectral effect Effects 0.000 description 10
- 238000010200 validation analysis Methods 0.000 description 9
- 238000009826 distribution Methods 0.000 description 7
- 239000012526 feed medium Substances 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 238000010521 absorption reaction Methods 0.000 description 6
- 238000009499 grossing Methods 0.000 description 6
- 238000004476 mid-IR spectroscopy Methods 0.000 description 6
- 238000007781 pre-processing Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 238000012937 correction Methods 0.000 description 5
- 238000002790 cross-validation Methods 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 238000001069 Raman spectroscopy Methods 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 229960000074 biopharmaceutical Drugs 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 230000005855 radiation Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 238000012569 chemometric method Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000000295 emission spectrum Methods 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000002329 infrared spectrum Methods 0.000 description 2
- 238000011177 media preparation Methods 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000000704 physical effect Effects 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 238000005033 Fourier transform infrared spectroscopy Methods 0.000 description 1
- 229910000673 Indium arsenide Inorganic materials 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- BPQQTUXANYXVAA-UHFFFAOYSA-N Orthosilicate Chemical compound [O-][Si]([O-])([O-])[O-] BPQQTUXANYXVAA-UHFFFAOYSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 238000005452 bending Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000000695 excitation spectrum Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- RPQDHPTXJYYUPQ-UHFFFAOYSA-N indium arsenide Chemical compound [In]#[As] RPQDHPTXJYYUPQ-UHFFFAOYSA-N 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 239000012533 medium component Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000000491 multivariate analysis Methods 0.000 description 1
- 239000002086 nanomaterial Substances 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000002861 polymer material Substances 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 239000010453 quartz Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 235000002639 sodium chloride Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/75—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated
- G01N21/77—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated by observing the effect on a chemical indicator
- G01N21/78—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated by observing the effect on a chemical indicator producing a change of colour
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12M—APPARATUS FOR ENZYMOLOGY OR MICROBIOLOGY; APPARATUS FOR CULTURING MICROORGANISMS FOR PRODUCING BIOMASS, FOR GROWING CELLS OR FOR OBTAINING FERMENTATION OR METABOLIC PRODUCTS, i.e. BIOREACTORS OR FERMENTERS
- C12M1/00—Apparatus for enzymology or microbiology
- C12M1/34—Measuring or testing with condition measuring or sensing means, e.g. colony counters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12M—APPARATUS FOR ENZYMOLOGY OR MICROBIOLOGY; APPARATUS FOR CULTURING MICROORGANISMS FOR PRODUCING BIOMASS, FOR GROWING CELLS OR FOR OBTAINING FERMENTATION OR METABOLIC PRODUCTS, i.e. BIOREACTORS OR FERMENTERS
- C12M3/00—Tissue, human, animal or plant cell, or virus culture apparatus
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/17—Systems in which incident light is modified in accordance with the properties of the material investigated
- G01N21/25—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
- G01N21/31—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
- G01N21/35—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/17—Systems in which incident light is modified in accordance with the properties of the material investigated
- G01N21/25—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
- G01N21/31—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
- G01N21/35—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
- G01N21/3577—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light for analysing liquids, e.g. polluted water
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/62—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
- G01N21/63—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
- G01N21/64—Fluorescence; Phosphorescence
- G01N21/6486—Measuring fluorescence of biological material, e.g. DNA, RNA, cells
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N24/00—Investigating or analyzing materials by the use of nuclear magnetic resonance, electron paramagnetic resonance or other spin effects
- G01N24/08—Investigating or analyzing materials by the use of nuclear magnetic resonance, electron paramagnetic resonance or other spin effects by using nuclear magnetic resonance
- G01N24/087—Structure determination of a chemical compound, e.g. of a biomolecule such as a protein
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/62—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
- G01N21/63—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
- G01N21/64—Fluorescence; Phosphorescence
- G01N2021/6417—Spectrofluorimetric devices
- G01N2021/6423—Spectral mapping, video display
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/17—Systems in which incident light is modified in accordance with the properties of the material investigated
- G01N21/25—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
- G01N21/31—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
- G01N21/35—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
- G01N21/359—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light using near infrared light
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2201/00—Features of devices classified in G01N21/00
- G01N2201/12—Circuits of general importance; Signal processing
- G01N2201/129—Using chemometrical methods
- G01N2201/1293—Using chemometrical methods resolving multicomponent spectra
Definitions
- culture media are complex mixtures of among other things inorganic salts, sugars, amino acid, vitamins, organic acids and buffers.
- complex, not chemically defined raw materials like protein hydrolyzates of plant or bacterial origin are used to promote cell growth and protein production.
- raw materials are supplied as powder mixtures and then dissolved in water to form the cultivation medium.
- a significant lot-to-lot variability can be observed, leading to large variations in the yield of recombinantly produced therapeutic proteins.
- Rapid spectroscopic ‘finger-printing’ techniques like Near-, Mid-Infrared, Raman, or 2D-Fluorescence spectroscopies, are relatively inexpensive and are well suited to analyze complex mixtures. These methods generate very large amounts of high dimensional data that can only be handled by chemometric methods like principal component analysis (PCA) or partial least squares (PLS) modeling.
- PCA principal component analysis
- PLS partial least squares
- PCA principal component analysis
- PLS partial least squares
- One aspect as reported herein is a method for the selection of cultivation media component batches or lots to be used in the cultivation of a mammalian cell expressing a protein of interest wherein at least two different components are employed in the cultivation, using for such selection fused spectral data of two different spectroscopic techniques.
- the first and second spectroscopic method are selected from NIR spectroscopy, MIR spectroscopy, and 2D-fluorescence spectroscopy.
- the processing of the spectra comprises the removing of the water absorption regions and the applying of a multiplicative scatter correction, and/or the filtering comprises a Savitzky-Golay filtering.
- the identifying patterns in the spectra is by principal component analysis.
- the principal component analysis is an unfolded principal component analysis.
- the unfolding preserves the information of the first mode (sample).
- the Savitzky-Golay smoothing is with a window of 19 points and a 2 nd order polynomial.
- the data is mean-centered, and the optimal number of principal components is chosen using the leave-one-out cross validation method.
- the processing comprises the exclusion of the regions of scattering and the interpolation of the removed points.
- the final spectra are made up by the emission wavelength range of 290 nm to 594 nm and the excitation wavelength range of 230 nm to 575 nm.
- the identifying of a relation between spectra fused and compressed with PCA scores, with cultivation yield at harvest is by partial least square analysis.
- the NIR spectra are collected over the wavenumber region of 4,784 cm ⁇ 1 to 8,936 cm ⁇ 1 .
- the spectral dimensionality is reduced from 1,039 wavenumbers to 3 principal components.
- the protein of interest is an antibody, or an antibody fragment, or an antibody conjugate.
- FIG. 1 Distribution of the different tested soy protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra.
- FIG. 2 NIR spectra of different soy protein hydrolyzate lots.
- FIG. 3 Distribution of the different tested rice protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra.
- FIG. 4 Distribution of the different tested chemically defined basic medium lots on a 2-dimensional space built through PCA based on the original NIR spectra.
- FIG. 5 PCA analysis based on pre-processed spectra of soy protein hydrolyzates lots.
- FIG. 6 PCA analysis based on pre-processed spectra of rice protein hydrolyzates lots.
- FIG. 7 PCA analysis based on pre-processed spectra of chemically defined basic medium lots.
- FIG. 8 Fluorescence EEM landscape of a soy protein hydrolyzate lot samples.
- FIG. 9 Processed fluorescence EEM landscape of a soy protein hydrolyzate lot samples.
- FIG. 10 Unfolded fluorescence landscapes into a row of emission spectra.
- FIG. 11 Excerpt of unfolded spectra for three different lots of soy protein hydrolyzate.
- FIG. 12 Score plot of PC1 ⁇ PC2 of a PCA for soy protein hydrolyzates of the unfolded EEM landscape.
- FIG. 13 Score plot of PC1 ⁇ PC2 of a PCA for rice protein hydrolyzates of the unfolded EEM landscape.
- FIG. 14 Score plot of PC1 ⁇ PC2 of a PCA for chemically defined basic medium of the unfolded EEM landscape.
- FIG. 15 Measured vs. cross-validation predicted plot.
- FIG. 16 PLS model correlating NIR spectra of different lots of the chemically defined basic medium and product yield.
- FIG. 17 PLS model correlating NIR spectra of different lots of the soy protein hydrolyzate and the chemically defined basic medium and product yield.
- FIG. 18 PLS model correlating fluorescence spectra of different lots of the soy protein hydrolyzate and NIR spectra of different lots of the chemically defined basic medium and product yield.
- FIG. 19 PLS model correlating fluorescence spectra of different lots of the soy protein hydrolyzate and MIR spectra of different lots of the chemically defined basic medium and product yield.
- FIG. 20 PLS model correlating NIR spectra of different lots of the soy protein hydrolyzate and fluorescence spectra of different lots of the chemically defined basic medium and product yield.
- FIG. 21 NIR absorption radiations of overtone and combination bands of covalent bonds organic molecules.
- cultivations can be performed with the same lots of soy protein hydrolyzate and rice protein hydrolyzate in the fermentation initial media formulation and feed media. Three series of experiments were performed (Tables 4, 5 and 6).
- the first series comprised six cultivations having soy protein hydrolyzate lot 3 (as in Table 3) and rice protein hydrolyzate lot 2 (as in Table 2) in the fermentation and feed media. Cultivations were grouped according to the chemically defined basic medium lot used. Performance of different chemically defined basic medium lots was evaluated based on the product yield. There is a slight difference between the two groups in both the average ICD and average product yield. With lower ICD a lower product formation can be obtained. Thus, the chemically defined basic medium lots have little or no effect on product yield.
- the second series involved six cultivations employing soy protein hydrolyzate lot 1 (as in Table 2) in the fermentation initial media formulation and feed media. Experiments were grouped according to the chemically defined basic medium lot used. No significant ICD differences were present. Thus, the differences on product yield are due to differences in the chemically defined basic medium lots used.
- the third series involved five cultivations having soy protein hydrolyzate lot 2 in the fermentation initial media formulation and feed media. Experiments were grouped according to the chemically defined basic medium lot used. There is a difference between the two groups in both the ICD used and the product concentration obtained.
- NIR, MIR, and 2D-fluorescence spectra can be acquired of all lots of the three different cultivation media components. Thereafter spectra analysis can be performed with established chemometric methods. A novel way of analyzing the spectral information obtained with these different sources is reported herein and can be used for predictive modeling purposes.
- NIR spectra of the lots of the raw materials were obtained as triplicates in different time periods. For powder and heterogeneous coarse samples NIR spectra vary among replicates. Such outlying replicates can be eliminated based on their relative location in the PCA scores plot space (Euclidean distance).
- NIR spectra of 18 lots of soy protein hydrolyzate, 12 lots of rice protein hydrolyzate, and 14 lots of chemically defined basic medium were selected out of all provided measurements. NIR spectra were collected between 4,784 cm ⁇ 1 and 8,936 cm ⁇ 1 . This spectral region does not contain noisy regions. The observed strong baseline shifts are due to light scattering associated with different raw-material lots having differences in mean particle size distributions (granularity). The analysis of raw spectra without baseline correction allows to focus on variations mainly caused by physical effects. PCA analysis of raw spectra was performed for each raw material separately.
- FIG. 1 shows the distribution of the different tested soy protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 94% of the NIR spectra variance.
- the spectral dimensionality was reduced from 1,039 wavenumbers to 3 significant principal components. Lots giving high product yield cannot be discriminated based on this analysis from those giving low product yield.
- granularity as seen by different NIR spectra baselines, FIG. 2
- humidity content as Karl Fischer measurements
- FIG. 3 shows how the tested rice protein hydrolyzate lots distribute on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 92% of the NIR spectra variance.
- soy protein hydrolyzate lots giving high product yield cannot be discriminated based on this analysis alone from lots giving low product yield. Again, granularity and humidity of the samples change from lot to lot affecting clustering.
- FIG. 4 shows the distribution of lots of the chemically defined basic medium on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 98% of the NIR spectra variance.
- soy and rice protein hydrolyzates lots giving high product yield cannot be discriminated from those giving low product yield based on this analysis alone.
- the three analyzed cultivation media components show significant lot-to-lot variability in granularity and humidity content, as can be seen by the NIR spectra obtained. NIR is very sensitive to both these factors. Additionally both these factors dominate over smaller but still significant chemical composition differences that might be present. Prior to PCA analysis physical information has to be removed by spectra pre-processing.
- MSC multiplicative scatter correction
- fluorescence excitation-emission spectra acquired of different water soluble fermentation raw-materials can be analyzed.
- a three-way data array, with excitation wavelengths along the x-axis, emission wavelengths along the y-axis, and intensity along the z-axis can be established.
- FIG. 8 a fluorescence EEM landscape of a soy protein hydrolyzate lot samples is shown.
- 2D-Fluorescence spectra of 19 lots of soy protein hydrolyzate, of 12 lots of rice protein hydrolyzate, and of 14 lots of chemically defined basic medium were obtained.
- the spectra were obtained using excitation wavelengths from 200 nm to 600 nm, with intervals of 5 nm, and emission wavelengths also from 200 nm to 600 nm, with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths.
- a three-way array for each of the raw materials can be generated from the individual matrices.
- a typical EEM spectrum can be influenced by Rayleigh and Raman scattering effects, which affect the information content of the fluorescence landscape. To overcome the Rayleigh effect several strategies and techniques can be used:
- This region (200 nm to 225 nm) was excluded from the spectra, as well the non-informative emission wavelengths (200 nm to 315 nm and 596 nm to 600 nm) and excitation wavelengths (580 nm to 600 nm).
- the resulting spectrum is shown in FIG. 9 .
- the final soy protein hydrolyzate spectra are made up by the emission wavelength range of 320 nm to 594 nm and the excitation wavelength range of 230 nm to 575 nm, resulting in an array of 19 ⁇ 138 ⁇ 70 elements.
- the same procedure can be followed for the rice protein hydrolyzates and the chemically defined basic medium datasets.
- the final rice protein hydrolyzate spectra are comprised of the emission and excitation wavelength range of 290 nm to 594 nm and 230 nm to 550 nm, respectively, resulting in an array of 12 ⁇ 153 ⁇ 65 elements.
- the final chemically defined basic medium spectra comprises the emission wavelength range of 290 nm to 594 nm and the excitation wavelength range of 230 nm to 550 nm, resulting in an array of 14 ⁇ 162 ⁇ 60 elements.
- the soy protein hydrolyzate comprises 2 or 3 fluorophores
- the rice protein hydrolyzate comprises 3 fluorophores
- the chemically defined basic medium comprises more than 4 fluorophores.
- a PCA of the unfolded fluorescence data array can be carried out for each component raw material.
- the unfolding procedure can be applied in any of the three modes of a three-way array.
- the unfolding preserving information of the first mode can be employed. In this way, the fluorescence landscapes can be unfolded into a row of emission spectra one after the other ( FIG. 10 ).
- the dimensions of the soy protein hydrolyzate array are 19 ⁇ 138 ⁇ 70 (lot ⁇ emission wavelength ⁇ excitation wavelength). After the unfolding strategy, a two-way matrix of size 19 ⁇ 9,960 can be obtained.
- FIG. 11 shows a small part of the resulting spectra for three different lots of soy protein hydrolyzate. Noise in the extreme excitation wavelengths can be seen.
- FIG. 12 shows the score plot of PC1 ⁇ PC2 of a PCA covering 96% of variance found on the whole unfolded EEM landscape.
- FIG. 13 shows the score plot of PC1 ⁇ PC2 of a PCA using three principal components covering more than 98% of the variance in the unfolded EEM spectra.
- FIG. 14 shows the score plot of PC1 ⁇ PC2 of a PCA using two principal components covering more than 92% of the total variance in the unfolded EEM spectra.
- a PLS model can be developed for predicting the product yield at the end of the process based on NIR and/or fluorescence spectra obtained for different lots of each media component and/or their combinations.
- the PLS algorithm is given an X block (pre-processed spectra, with or without variable selection) and a Y block (product parameter) and correlates both by finding the variation in X responsible for changes in Y (i.e. maximizing the covariance between both blocks).
- a basic set can be defined wherein most of the different lots of raw materials can be included. Out of replicate batches having same the lot combinations, the one giving the highest product yield was selected for the calibration dataset (Table 7).
- NIR spectra can be pre-processed as described before to remove the influence of physical effects originating from different particle size distributions. As no replicate spectra were used, the leave-one-out cross-validation method was used as internal validation strategy.
- the obtained model was made up of only two LVs but a non-significant R 2 of 0.139 was obtained.
- the measured vs. cross-validation predicted plot is presented in FIG. 15 .
- a PLS model correlating NIR spectra of different lots of the chemically defined basic medium and product yield can be built using the calibration dataset as presented in Table 8.
- the obtained model was made up of only two LVs but again a non significant R 2 of 0.04 was obtained ( FIG. 16 ).
- a combination strategy can be used between same spectroscopic/different media components and also between different spectroscopic/different media components.
- Model accuracy and long term robustness is reflected in a high R 2 with both calibration and validation errors being low, with a small difference between RMSECV and RMSEP ( FIG. 17 ).
- product yield can be correlated to spectroscopic data from different compounds of a cultivation medium obtained with a combination of spectroscopic information of same nature (NIR) for the two (most important) process raw-materials or media components.
- NIR spectroscopic information of same nature
- Each spectrum has 944 wavenumbers and the entire calibration dataset included in the model is represented by 18,880 variables (10 samples ⁇ 2 raw materials ⁇ 944 wavenumbers after variable selection).
- PCA analysis based on the spectra that were first compressed by converting the contained information into a few non-correlated variables was performed.
- the therewith obtained model was simpler and contained only 2 latent variables (LV) and an R 2 of 0.81 was obtained.
- the NIR spectra of the soy protein hydrolyzate and fluorescence spectra of the chemically defined basic medium were joined together and the resulting model was evaluated.
- the calibration and validation datasets used for building the model were the same as before (see Table 10).
- the obtained model has 3 latent variables and a very similar R 2 value (0.87) ( FIG. 20 ) and RMSECV and RMSEP values (124 mg/l and 60 mg/l, respectively).
- the method as reported herein is directed to the combination of spectra of different nature (fluorescence spectra and IR spectra), which intrinsically have different dimensions (two (2D) and one (1D), respectively), and that requires the operations of first compressing each spectrum to principal component analysis scores and second producing linear combinations of each spectrum scores.
- the spectra of different nature are combined by means of a dimensional reduction and a linear combination of those reduced transformed variables (PCA scores obtained by compressing each spectrum).
- spectra of different dimensions and nature are used to capture in a mixture of two different fermentation raw materials the components responsible for fermentation performance of said raw materials and to make predictions of fermentation yields for a specific combination of lots.
- the cells were cultivated in shake flasks in a temperature, humidity and carbon dioxide controlled environment.
- media were prepared with these lots and cells were inoculated in shake flasks containing these media.
- a certain volume of feed medium was added daily to the shake flask culture in order to prolong cell growth and achieve higher product concentrations.
- NIR Near Infrared Spectroscopy
- NIR NIR emerges in 1960s into the analytical world, with the work of Karl Norris of the US Department of Agriculture (Siesler et al, 2002). In the electromagnetic spectrum, the NIR region is located in between Mid-Infrared and Visible. In a range of wavenumber 4,000-14,000 cm ⁇ 1 (respectively wavelength 700-2,500 nm), the absorption radiation of overtone and combination bands of covalent bonds such as N—H, O—H and C—H of organic molecules ( FIG. 21 ).
- NIR spectra were collected using flat bottom scintillation vials in a Bruker MPA FT-NIR system, equipped with a tungsten-halogen source and an InAs detector. Each spectrum was recorded in the wavenumber range of 4,999 to 9,003 cm ⁇ 1 , in an average of 32 scans and a spectral resolution of 8 cm ⁇ 1 .
- Mid Infrared Spectra were obtained using quartz cuvettes in an Avatar 370 FT-IR, Thermo Fischer, Diamant ATR. Each spectrum was recorded in the wavenumber range of 4,000 to 400 cm ⁇ 1 .
- Fluorescence spectroscopy uses irradiation at a certain wavelength to excite molecules, which will then emit radiation of a different wavelength. This technique is often used for studying the structure and function of macromolecules, especially protein interactions. Tentative assignment of fluorescence characteristics of chromophores found in proteins and nucleic acids is presented in the following Table.
- 2D-fluorescence spectra of cell culture raw materials were obtained using excitation wavelengths from 200 nm to 600 nm, with intervals of 5 nm, and emission wavelengths also from 200 nm to 600 nm, but with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths.
- Emission-excitation fluorescence spectra were measured using a Varian Cary Eclipse Spectrometer, over an excitation wavelength range from 200 nm to 600 nm with intervals of 5 nm, and emission wavelength range also from 200 nm to 600 nm, but with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths.
- Data was collected using the software Cary Eclipse Bio, Package 1.1.
- Multivariate data analysis was performed using PCA (Principal Component Analysis) and PLS (Partial Least Squares). These techniques are based on the reduction of dimensionality present in the data, allowing the retrieval of relevant information hidden in the massive amount of data. It is made transforming the original measured variables into new variables called principal components.
- the PCA analysis was used to find patterns in the spectra. With the aim to relate these patterns with a particular parameter, PLS analysis was carried out to build a mathematical model able to predict the values of this parameter in future samples using only the spectral information.
- the major problems are related to the Raman and Rayleigh scattering, which are caused by deviations of the light that are not related to the fluorescence properties of the sample. Since the wavelength regions affected by scattering are known, the intensities measured in such particular regions can be removed replacing it by interpolated points.
- the three-way emission-excitation spectra were unfolded with the purpose of have a matrix suitable to the PLS and PCA analysis.
- a Parafac based three way analysis was also done for calibration purposes. (Bahram, M., et al., J. Chemometrics, 20 (2006) 99-105).
- the unfolding approach consists in concatenating two of these three dimensions, keeping the other fixed. In this case, the emission and excitation axis were concatenated, maintaining the information of the samples.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- High Energy & Nuclear Physics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Sustainable Development (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Hematology (AREA)
- General Engineering & Computer Science (AREA)
- Urology & Nephrology (AREA)
- Plasma & Fusion (AREA)
- Crystallography & Structural Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Food Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)
- Investigating Or Analysing Materials By Optical Means (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
- This application is a continuation of International Application No. PCT/EP2011/069267 having an international filing date of Nov. 3, 2011, the entire contents of which are incorporated herein by reference, and which claims benefit under 35 U.S.C. §119 to European Patent Application No. 10190193.2 filed Nov. 5, 2010.
- Herein is reported a method for the evaluation of cultivation material components with respect to product yield already upon receipt thereof and prior to and without the need to perform a test cultivation.
- The market for recombinant biopharmaceutical products has been growing constantly since the early 1980s, when recombinant DNA technology made it possible to express recombinant proteins in different types of microorganisms like bacteria, yeast or mammalian cells. Since then, these protein products have been used in a wide array of diagnostic and pharmaceutical applications.
- As the demand for recombinant proteins rises, the need for highly effective and robust production processes is imminent. One of the most important influencing factors for robust and reproducible production processes is the composition of the starting materials, such as culture media. Most culture media are complex mixtures of among other things inorganic salts, sugars, amino acid, vitamins, organic acids and buffers. In many cases, complex, not chemically defined raw materials like protein hydrolyzates of plant or bacterial origin are used to promote cell growth and protein production.
- Commonly, raw materials are supplied as powder mixtures and then dissolved in water to form the cultivation medium. In many cases, for not chemically defined protein hydrolyzates and also for chemically defined basal media mixtures, a significant lot-to-lot variability can be observed, leading to large variations in the yield of recombinantly produced therapeutic proteins.
- Rapid spectroscopic ‘finger-printing’ techniques like Near-, Mid-Infrared, Raman, or 2D-Fluorescence spectroscopies, are relatively inexpensive and are well suited to analyze complex mixtures. These methods generate very large amounts of high dimensional data that can only be handled by chemometric methods like principal component analysis (PCA) or partial least squares (PLS) modeling. The combination of complex spectroscopic methods and chemometrics is commonly used in identity testing for raw materials or as a tool for the classification of raw materials.
- The use of principal component analysis (PCA) and partial least squares (PLS) for processing and modeling complex data have been reported by Næs, T., et al., (Næs, T., et al., NIR Publications, (2002)). In WO 2009/086083 a method for hierarchically organizing data using PLS is reported. An analyzer and method for determining the relative importance of fractions of biological mixtures is reported in WO 2008/146059. In WO 2009/061326 the evaluation of chromatographic materials is reported.
- In US 2009/0306932 a rapid classification method for multivariate data arrays is reported. Analysing spectral data for the selection of a calibration model is reported in
EP 2 128 599. In U.S. Pat. No. 5,498,875 a signal processing for chemical analysis of samples is reported. A method for classifying scientific materials such as silicate materials, polymer materials and/or nanomaterials is reported in US 2008/0177481. In US 2010/0129857 methods for the isolation and identification of microorganisms are reported. - It has been found that the performance of production processes for recombinant proteins can be predicted based on the combination of NIR and 2D-fluorescence spectra of media components, such as protein hydrolyzates and/or chemically defined media preparations which are used as components of a complex cultivation medium.
- One aspect as reported herein is a method for the selection of cultivation media component batches or lots to be used in the cultivation of a mammalian cell expressing a protein of interest wherein at least two different components are employed in the cultivation, using for such selection fused spectral data of two different spectroscopic techniques.
- In one embodiment the method for the selection of cultivation component lots to be used in the cultivation of a mammalian cell expressing a protein of interest wherein at least two different cultivation components are employed in the cultivation comprises the following steps:
-
- a) providing spectra of different lots of a first component obtained with a first spectroscopic method, spectra of different lots of a second component obtained with a second spectroscopic method that is different from the first spectroscopic method, and the cultivation supernatant yield of the protein of interest obtained in a cultivation using combinations of these different lots of the first and the second component,
- b) identifying a relation of fused spectra after computing spectra PCA scores with the yield of the cultivation,
- c) providing a spectrum of a further lot of the first component obtained with the first spectroscopic method and/or a spectrum of a further lot of the second component obtained with the second spectroscopic method, and
- d) selecting the combination of the provided first component and the provided second component if the predicted cultivation supernatant yield based on the relation of fused spectra after computing spectra PCA scores identified in b) is within +/−10% of the mean yield provided in a).
- In one embodiment the method for the selection of cultivation component lots to be used in the cultivation of a mammalian cell expressing a protein of interest wherein at least two different cultivation components are employed in the cultivation comprises the following steps:
-
- a) providing spectra of different lots of a first component obtained with a first spectroscopic method, spectra of different lots of a second component obtained with a second spectroscopic method that is different from the first spectroscopic method, and the cultivation supernatant yield of the protein of interest obtained in a cultivation using combinations of these different lots of the first and the second component,
- b) processing the spectra, filtering the spectra, smoothing the spectra, and transforming the spectra to their first derivative,
- c) identifying patterns in the spectra,
- d) identifying a relation of the patterns identified in c) with the yield of the cultivation,
- e) providing a spectrum of a further lot of the first component obtained with the first spectroscopic method and/or a spectrum of a further lot of the second component obtained with the second spectroscopic method,
- f) processing the spectra, filtering the spectra, smoothing the spectra, and transforming the spectra to their first derivative,
- g) selecting the combination of the provided first component and the provided second component if the predicted cultivation supernatant yield based on the relation identified in d) is within +/−10% of the mean yield provided in a).
- In one embodiment the first and second spectroscopic method are selected from NIR spectroscopy, MIR spectroscopy, and 2D-fluorescence spectroscopy.
- In one embodiment the processing of the spectra comprises the removing of the water absorption regions and the applying of a multiplicative scatter correction, and/or the filtering comprises a Savitzky-Golay filtering.
- In one embodiment the identifying patterns in the spectra is by principal component analysis. In one embodiment the principal component analysis is an unfolded principal component analysis. In one embodiment the unfolding preserves the information of the first mode (sample). In one embodiment the Savitzky-Golay smoothing is with a window of 19 points and a 2nd order polynomial. In one embodiment the data is mean-centered, and the optimal number of principal components is chosen using the leave-one-out cross validation method.
- In one embodiment the processing comprises the exclusion of the regions of scattering and the interpolation of the removed points. In one embodiment the final spectra are made up by the emission wavelength range of 290 nm to 594 nm and the excitation wavelength range of 230 nm to 575 nm.
- In one embodiment the identifying of a relation between spectra fused and compressed with PCA scores, with cultivation yield at harvest is by partial least square analysis.
- In one embodiment the NIR spectra are collected over the wavenumber region of 4,784 cm−1 to 8,936 cm−1.
- In one embodiment the spectral dimensionality is reduced from 1,039 wavenumbers to 3 principal components.
- In one embodiment the protein of interest is an antibody, or an antibody fragment, or an antibody conjugate.
-
FIG. 1 Distribution of the different tested soy protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra. -
FIG. 2 NIR spectra of different soy protein hydrolyzate lots. -
FIG. 3 Distribution of the different tested rice protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra. -
FIG. 4 Distribution of the different tested chemically defined basic medium lots on a 2-dimensional space built through PCA based on the original NIR spectra. -
FIG. 5 PCA analysis based on pre-processed spectra of soy protein hydrolyzates lots. -
FIG. 6 PCA analysis based on pre-processed spectra of rice protein hydrolyzates lots. -
FIG. 7 PCA analysis based on pre-processed spectra of chemically defined basic medium lots. -
FIG. 8 Fluorescence EEM landscape of a soy protein hydrolyzate lot samples. -
FIG. 9 Processed fluorescence EEM landscape of a soy protein hydrolyzate lot samples. -
FIG. 10 Unfolded fluorescence landscapes into a row of emission spectra. -
FIG. 11 Excerpt of unfolded spectra for three different lots of soy protein hydrolyzate. -
FIG. 12 Score plot of PC1×PC2 of a PCA for soy protein hydrolyzates of the unfolded EEM landscape. -
FIG. 13 Score plot of PC1×PC2 of a PCA for rice protein hydrolyzates of the unfolded EEM landscape. -
FIG. 14 Score plot of PC1×PC2 of a PCA for chemically defined basic medium of the unfolded EEM landscape. -
FIG. 15 Measured vs. cross-validation predicted plot. -
FIG. 16 PLS model correlating NIR spectra of different lots of the chemically defined basic medium and product yield. -
FIG. 17 PLS model correlating NIR spectra of different lots of the soy protein hydrolyzate and the chemically defined basic medium and product yield. -
FIG. 18 PLS model correlating fluorescence spectra of different lots of the soy protein hydrolyzate and NIR spectra of different lots of the chemically defined basic medium and product yield. -
FIG. 19 PLS model correlating fluorescence spectra of different lots of the soy protein hydrolyzate and MIR spectra of different lots of the chemically defined basic medium and product yield. -
FIG. 20 PLS model correlating NIR spectra of different lots of the soy protein hydrolyzate and fluorescence spectra of different lots of the chemically defined basic medium and product yield. -
FIG. 21 NIR absorption radiations of overtone and combination bands of covalent bonds organic molecules. - It has been found that the performance of production processes for recombinant proteins can be predicted based on the combined information contained in NIR and 2D-fluorescence spectra of media components, such as protein hydrolyzates and/or chemically defined media preparations which are used as components of a complex cultivation medium.
- Herein is reported a method in which spectra from two different (orthogonal) spectroscopy techniques—after processing to make them additive via variable reduction to principal component analysis (PCA) scores—obtained on two media components used in the fermentation of recombinant biopharmaceuticals are combined and models of such transformed spectra (inputs) are used to predict the yields at harvest (output) of biopharmaceutical product's cultivations based on mixtures of studied media components with lot-to-lot variability in terms of different fermentation performance.
- By using different (orthogonal) spectroscopies in combination with PCA methods (to ensure their additivity) and producing process models of the effect of such cultivation media mixtures on yields at harvest of the main fermentation a predictive capability is established that allows selecting media lots of each raw material and/or formulating mixtures that best serve the process goals.
- Different lots of individual components forming a complete cultivation medium vary slightly in their detailed composition but are still within the specification given by the manufacturer. In some cases, it is possible to trace this variability to single ingredients, but most commonly the lot-to-lot variability cannot be detected by analytical means. For the evaluation of the influence of different individual component lots on product yield a comparable cultivation of the same mammalian cell line can be repeatedly performed.
- Herein are reported 56 cultivations in which nine different lots of a soy protein hydrolyzate, two mixtures of two different soy protein hydrolyzate lots, five lots of a rice protein hydrolyzate, and six lots of a chemically defined basic medium powder were employed in the fermentation and feed medium, respectively.
- To assess the influence of different soy protein hydrolyzate lots with respect to product yield comparable cultivations were performed in which the same lots of a chemically defined basic medium and a rice protein hydrolyzate were used in fermentation and feed media. The results can be grouped according to the different soy protein hydrolyzate lots employed. The performance of different lots was evaluated based on the product yield at similar average inoculation cell density (ICD) values (Table 1).
-
TABLE 1 chemically soy protein defined rice protein product hydrolyzate basic medium hydrolyzate at 330 h batch lot No. lot No. lot No. ICD [mg/l] D45KD11 1 1 1 5.7 1319 D45KD12 5.3 1234 D45KD13 5.6 1305 D45KD22 2 5.3 1023 D45KD23 5.1 1070 D45KD31 3 4.8 1008 D45KD32 4.9 991 D45KD33 5.3 978 - The results obtained for a second set of cultivations are listed in Table 2.
-
TABLE 2 chemically soy protein defined rice protein product hydrolyzate basic medium hydrolyzate at 330 h batch lot No. lot No. lot No. ICD [mg/l] D52KD11 1 2 2 6.1 1434 D52KD12 5.0 1411 D52KD13 5.6 1459 D52KD21 4 5.0 1213 D52KD22 5.3 1243 D52KD23 5.4 1163 D55KD11 5 5.0 1409 D55KD12 5.4 1426 D55KD13 5.7 1430 D55KD21 2 6.8 1263 D55KD22 6.8 1256 D55KD23 6.8 1278 D55KD31 6 6.1 1269 D55KD32 6.1 1262 D55KD33 5.8 1265 - It can be seen that different lots of the individual components result in different product yields. In this series of cultivations also different average ICD values were used. Although having low ICD values,
cultivations using lot 1 and lot 5 gave significantly higher product yields than the ones having higher ICD values (lot 3 and lot 6). Thus, different soy protein hydrolyzate lots results in different production performance. - Analogously the influence of rice protein hydrolyzate on process performance can be evaluated (Table 3).
-
TABLE 3 chemically soy protein defined rice protein product hydrolyzate basic medium hydrolyzate at 330 h batch lot No. lot No. lot No. ICD [mg/l] D61KD11 3 3 2 5.9 1132 D61KD12 6.0 1085 D61KD13 5.3 1101 D61KD21 3 6.1 1062 D61KD22 6.1 1056 D61KD23 5.6 1043 - Six cultivations were performed and can be grouped according to the different lots of rice protein hydrolyzate used in each of them. Performance of the different rice protein hydrolyzate lots can be evaluated based on the mean product yield. Both groups, i.e. rice protein hydrolyzate lots, have similar ICD values.
- To assess the influence of the chemically defined basic medium on the product yield, cultivations can be performed with the same lots of soy protein hydrolyzate and rice protein hydrolyzate in the fermentation initial media formulation and feed media. Three series of experiments were performed (Tables 4, 5 and 6).
- The first series comprised six cultivations having soy protein hydrolyzate lot 3 (as in Table 3) and rice protein hydrolyzate lot 2 (as in Table 2) in the fermentation and feed media. Cultivations were grouped according to the chemically defined basic medium lot used. Performance of different chemically defined basic medium lots was evaluated based on the product yield. There is a slight difference between the two groups in both the average ICD and average product yield. With lower ICD a lower product formation can be obtained. Thus, the chemically defined basic medium lots have little or no effect on product yield.
-
TABLE 4 chemically soy protein defined rice protein product hydrolyzate basic medium hydrolyzate at 330 h batch lot No. lot No. lot No. ICD [mg/l] D55KD21 3 2 2 6.8 1263 D55KD22 6.8 1256 D55KD23 6.8 1278 D61KD11 3 5.9 1132 D61KD12 6.0 1085 D61KD13 5.3 1101 - The second series involved six cultivations employing soy protein hydrolyzate lot 1 (as in Table 2) in the fermentation initial media formulation and feed media. Experiments were grouped according to the chemically defined basic medium lot used. No significant ICD differences were present. Thus, the differences on product yield are due to differences in the chemically defined basic medium lots used.
-
TABLE 5 soy protein chemically defined product hydrolyzate basic medium lot at 330 h batch lot No. No. ICD [mg/l] D45KD11 1 1 5.7 1319 D45KD12 5.3 1234 D45KD13 5.6 1205 D52KD11 2 6.1 1434 D52KD12 5.0 1411 D52KD13 5.6 1459 - The third series involved five cultivations having soy
protein hydrolyzate lot 2 in the fermentation initial media formulation and feed media. Experiments were grouped according to the chemically defined basic medium lot used. There is a difference between the two groups in both the ICD used and the product concentration obtained. -
TABLE 6 soy protein chemically defined product hydrolyzate basic medium lot at 330 h batch lot No. No. ICD [mg/l] D45KD22 2 1 5.3 1023 D45KD23 5.1 1070 D73KD11 4 4.9 1062 D73KD12 4.3 1112 D73KD13 4.4 1121 - From the above it can be seen that there exists a need for raw-material lot characterization and a need to provide a method in which the obtained data can be used to predict which raw-material lots produce higher yields of product without the need to perform fermentation experiments.
- NIR, MIR, and 2D-fluorescence spectra can be acquired of all lots of the three different cultivation media components. Thereafter spectra analysis can be performed with established chemometric methods. A novel way of analyzing the spectral information obtained with these different sources is reported herein and can be used for predictive modeling purposes.
- NIR spectra of the lots of the raw materials were obtained as triplicates in different time periods. For powder and heterogeneous coarse samples NIR spectra vary among replicates. Such outlying replicates can be eliminated based on their relative location in the PCA scores plot space (Euclidean distance).
- NIR spectra of 18 lots of soy protein hydrolyzate, 12 lots of rice protein hydrolyzate, and 14 lots of chemically defined basic medium were selected out of all provided measurements. NIR spectra were collected between 4,784 cm−1 and 8,936 cm−1. This spectral region does not contain noisy regions. The observed strong baseline shifts are due to light scattering associated with different raw-material lots having differences in mean particle size distributions (granularity). The analysis of raw spectra without baseline correction allows to focus on variations mainly caused by physical effects. PCA analysis of raw spectra was performed for each raw material separately.
-
FIG. 1 shows the distribution of the different tested soy protein hydrolyzate lots on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 94% of the NIR spectra variance. The spectral dimensionality was reduced from 1,039 wavenumbers to 3 significant principal components. Lots giving high product yield cannot be discriminated based on this analysis from those giving low product yield. In addition granularity (as seen by different NIR spectra baselines,FIG. 2 ) and humidity content (as Karl Fischer measurements) of the samples are also different making a clustering of the lots according to any single property very difficult. -
FIG. 3 shows how the tested rice protein hydrolyzate lots distribute on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 92% of the NIR spectra variance. As for the soy protein hydrolyzate, lots giving high product yield cannot be discriminated based on this analysis alone from lots giving low product yield. Again, granularity and humidity of the samples change from lot to lot affecting clustering. -
FIG. 4 shows the distribution of lots of the chemically defined basic medium on a 2-dimensional space built through PCA based on the original NIR spectra, capturing 98% of the NIR spectra variance. As for the soy and rice protein hydrolyzates lots giving high product yield cannot be discriminated from those giving low product yield based on this analysis alone. - The three analyzed cultivation media components show significant lot-to-lot variability in granularity and humidity content, as can be seen by the NIR spectra obtained. NIR is very sensitive to both these factors. Additionally both these factors dominate over smaller but still significant chemical composition differences that might be present. Prior to PCA analysis physical information has to be removed by spectra pre-processing.
- Water absorbs very strongly in the NIR region especially in the range of from 6,900 cm−1 to 7,150 cm−1 and of from 5,160 cm−1 to 5,270 cm−1. These absorption regions are caused by the first overtone of the O—H stretching band and the combination of the O—H stretching and the O—H bending bands, respectively. Water absorption regions can be removed. Moreover, the baseline shift can be eliminated by applying multiplicative scatter correction (MSC). In order to enhance the variance between samples, the Savitzky-Golay filtering and smoothing method can be applied, and spectra can be transformed to their first derivative (window of 25 points).
- The PCA analysis was performed on previously pre-processed spectra of soy protein hydrolyzates (
FIG. 5 ). Almost all very good to good performing lots in terms of process yield group at the left-hand side of the PCA plot (negative PC1 score values). Conversely,lot 4, which appears to perform poorly, occupies the space on the right-hand side of the plot. - The PCA analysis was performed on previously pre-processed spectra of rice protein hydrolyzates (
FIG. 6 ). Lots giving very similar yields cluster together, thus, showing that PCA of pre-processed spectra is adequate and that there is already some lot-to-lot variability that can be traced to chemical composition of this component raw-material, which is unrelated to granularity or moisture level. - The PCA analysis of the chemically defined basic mediums' pre-processed spectra (
FIG. 7 ) shows that in general all very good to good performing lots group at the left-hand side of the PCA plot (negative score values of PC1). Conversely,lot 3, which appears to perform poorly, occupies the space on the right-hand side of the plot. Those results are comparable with the results obtained for the protein hydrolyzate lots. - Besides NIR spectra, fluorescence excitation-emission spectra (EEM) acquired of different water soluble fermentation raw-materials can be analyzed. A three-way data array, with excitation wavelengths along the x-axis, emission wavelengths along the y-axis, and intensity along the z-axis can be established. In
FIG. 8 a fluorescence EEM landscape of a soy protein hydrolyzate lot samples is shown. - 2D-Fluorescence spectra of 19 lots of soy protein hydrolyzate, of 12 lots of rice protein hydrolyzate, and of 14 lots of chemically defined basic medium were obtained. The spectra were obtained using excitation wavelengths from 200 nm to 600 nm, with intervals of 5 nm, and emission wavelengths also from 200 nm to 600 nm, with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths.
- In order to allow a prediction of cultivation yield based on the analysis of the raw material a three-way array for each of the raw materials can be generated from the individual matrices.
- A typical EEM spectrum can be influenced by Rayleigh and Raman scattering effects, which affect the information content of the fluorescence landscape. To overcome the Rayleigh effect several strategies and techniques can be used:
-
- zeroing the emission wavelengths smaller than the excitation ones;
- inserting missing values in the region of scattering;
- excluding the region of scattering and interpolating the removed points; or
- subtracting the background spectra.
- It has been found that excluding the region of scattering and the interpolation of the removed points is most suited in the method as reported herein. The Matlab© algorithm EEMscat can be employed therefore. This algorithm can be downloaded free from world-wide-web site: httt://www.models.kvl.dk/source/EEM_correction/. With this proceeding the scattering can be removed completely. The spectrum also shows pronounced noise along the entire emission axis in the first excitation wavelength. This region (200 nm to 225 nm) was excluded from the spectra, as well the non-informative emission wavelengths (200 nm to 315 nm and 596 nm to 600 nm) and excitation wavelengths (580 nm to 600 nm). The resulting spectrum is shown in
FIG. 9 . - The final soy protein hydrolyzate spectra are made up by the emission wavelength range of 320 nm to 594 nm and the excitation wavelength range of 230 nm to 575 nm, resulting in an array of 19×138×70 elements. The same procedure can be followed for the rice protein hydrolyzates and the chemically defined basic medium datasets. Thus, the final rice protein hydrolyzate spectra are comprised of the emission and excitation wavelength range of 290 nm to 594 nm and 230 nm to 550 nm, respectively, resulting in an array of 12×153×65 elements. The final chemically defined basic medium spectra comprises the emission wavelength range of 290 nm to 594 nm and the excitation wavelength range of 230 nm to 550 nm, resulting in an array of 14×162×60 elements.
- In conclusion, a pre-processing of the EEM spectra can be performed for each raw material data set to enhance signal to noise ratio. The differences between each raw material can thus be clearly seen: the soy protein hydrolyzate comprises 2 or 3 fluorophores, the rice protein hydrolyzate comprises 3 fluorophores and the chemically defined basic medium comprises more than 4 fluorophores.
- In order to obtain an overview of raw material lot-to-lot variability, a PCA of the unfolded fluorescence data array can be carried out for each component raw material. The unfolding procedure can be applied in any of the three modes of a three-way array. In order to enhance the lot-to-lot differences the unfolding preserving information of the first mode (samples) can be employed. In this way, the fluorescence landscapes can be unfolded into a row of emission spectra one after the other (
FIG. 10 ). - The dimensions of the soy protein hydrolyzate array are 19×138×70 (lot×emission wavelength×excitation wavelength). After the unfolding strategy, a two-way matrix of size 19×9,960 can be obtained.
FIG. 11 shows a small part of the resulting spectra for three different lots of soy protein hydrolyzate. Noise in the extreme excitation wavelengths can be seen. - To overcome these deviations, several strategies can be used. It has been found that the Savitzky-Golay smoothing using a window of 19 points and 2nd order polynomial to remove noise is best suited, and the Multiplicative Scatter Correction (MSC) is best suited to eliminate the baseline drift.
- Unfolded-PCA was applied to the soy protein hydrolyzate pre-processed matrix. The data was mean-centered, and the optimal number of principal components was chosen using the leave-one-out cross validation method.
FIG. 12 shows the score plot of PC1×PC2 of a PCA covering 96% of variance found on the whole unfolded EEM landscape. - After unfolding the resulting rice protein hydrolyzate matrix had the size 12×9,945. The same pre-processing used for soy protein hydrolyzate was applied.
FIG. 13 shows the score plot of PC1×PC2 of a PCA using three principal components covering more than 98% of the variance in the unfolded EEM spectra. - The size of unfolded chemically defined basic medium matrix was 14×9,600. The same EEM spectra pre-processing procedure as applied to the other two media components was used.
FIG. 14 shows the score plot of PC1×PC2 of a PCA using two principal components covering more than 92% of the total variance in the unfolded EEM spectra. As before with NIR spectra for the same media components it was found that lots giving higher yields are separated from lots giving lower yields in the PCA score plots of EEM unfolded spectra. - A PLS model can be developed for predicting the product yield at the end of the process based on NIR and/or fluorescence spectra obtained for different lots of each media component and/or their combinations. The PLS algorithm is given an X block (pre-processed spectra, with or without variable selection) and a Y block (product parameter) and correlates both by finding the variation in X responsible for changes in Y (i.e. maximizing the covariance between both blocks). A basic set can be defined wherein most of the different lots of raw materials can be included. Out of replicate batches having same the lot combinations, the one giving the highest product yield was selected for the calibration dataset (Table 7).
-
TABLE 7 soy protein hydrolyzate F/ZF product at 330 h batch lot No. [mg/l] D52KD13 1 1458 D52KD22 4 1232 D55KD13 5 1430 D55KD23 3 1257 D55KD31 6 1263 D73KD13 2 1120 D73KD33 7 1044 D79KD22 8 1162 - NIR spectra can be pre-processed as described before to remove the influence of physical effects originating from different particle size distributions. As no replicate spectra were used, the leave-one-out cross-validation method was used as internal validation strategy.
- The obtained model was made up of only two LVs but a non-significant R2 of 0.139 was obtained. The measured vs. cross-validation predicted plot is presented in
FIG. 15 . - A PLS model correlating NIR spectra of different lots of the chemically defined basic medium and product yield can be built using the calibration dataset as presented in Table 8.
-
TABLE 8 chemically defined basic medium F/ZF product at 330 h batch lot No. [mg/l] D45KD11 1 1314 D52KD13 2 1458 D61KD12 3 1134 D73KD21 4 1147 D79KD22 5 1162 - The obtained model was made up of only two LVs but again a non significant R2 of 0.04 was obtained (
FIG. 16 ). - Considering not only one medium component, but the two most relevant ones influencing yield, and also taking into account that different chemical information is captured by each different spectroscopic method used, a combination strategy can be used between same spectroscopic/different media components and also between different spectroscopic/different media components.
- The criteria used for selecting calibration and validation batches were based in getting the widest range possible during calibration (Table 9).
-
TABLE 9 chemically soy protein defined basic product at hydrolyzate F/ZF medium 330 h batch lot No. F/ZF lot [mg/l] calibration D45KD11 1 1 1314 D45KD31 3 1 999 D52KD13 1 2 1458 D52KD22 4 2 1232 D55KD13 5 2 1430 D55KD31 6 2 1263 D61KD12 3 3 1134 D73KD13 2 4 1120 D73KD33 7 4 1044 D79KD22 8 5 1162 validation D45KD23 2 1 1061 D55KD23 3 2 1257 D73KD21 8 4 1147 - External validation was done with one third of the data set. Calibration and validation data (NIR spectra) were pre-processed in the same manner as described before. The obtained prediction model is based on 3 LVs and the obtained R2 reached a significant value of 0.88.
- Model accuracy and long term robustness is reflected in a high R2 with both calibration and validation errors being low, with a small difference between RMSECV and RMSEP (
FIG. 17 ). In the above case, the prediction error was low (RMSEP=36 mg/l) and did not differ much from the RMSECV (126 mg/l). - Thus, it has been found that product yield can be correlated to spectroscopic data from different compounds of a cultivation medium obtained with a combination of spectroscopic information of same nature (NIR) for the two (most important) process raw-materials or media components. Each spectrum has 944 wavenumbers and the entire calibration dataset included in the model is represented by 18,880 variables (10 samples×2 raw materials×944 wavenumbers after variable selection). In order to reduce the required workload a PCA analysis based on the spectra that were first compressed by converting the contained information into a few non-correlated variables was performed. The therewith obtained model was simpler and contained only 2 latent variables (LV) and an R2 of 0.81 was obtained.
- Different spectroscopic methods capture complementary chemical information. Using two different types of spectroscopic information improved the predictive quality of the model. Therefore, fluorescence spectra of soy protein hydrolyzate and NIR spectra of the chemically defined basic medium were used (Table 10).
-
TABLE 10 chemically soy protein defined basic product at hydrolyzate F/ZF medium 330 h batch lot No. F/ZF lot [mg/l] calibration D45KD11 1 1 1314 D45KD31 3 1 999 D52KD13 1 2 1458 D52KD22 4 2 1232 D55KD13 5 2 1430 D55KD31 6 2 1263 D61KD12 3 3 1134 D73KD13 2 4 1120 D73KD33 7 4 1044 D79KD22 8 5 1162 validation D45KD23 2 1 1061 D55KD23 3 2 1257 D73KD21 8 4 1147 - Fluorescence spectra and NIR spectrawere compressed to a few principal components after pre-processing as described before. The obtained model has only 3 latent variables and an R2 of 0.90 was obtained (
FIG. 18 ). This model has better performance when compared to previous models and is more robust since it not only has higher R2 value, but also has lower RMSECV and RMSEP values (ca. 90 mg/l) with a very small difference between them. - A further test was made using MIR instead of NIR for the chemically defined basic medium. Calibration and validation datasets used were the same as presented before (see Table 10). Fluorescence and MIR spectra were pre-processed as described before. The obtained model has 3 latent variables, an R2 of 0.88, and low RMSECV and RMSEP values with no difference between them (ca. 100 mg/l both), thus showing no significant difference to the one obtained with the NIR data for the chemically defined basic medium (
FIG. 19 ). - The NIR spectra of the soy protein hydrolyzate and fluorescence spectra of the chemically defined basic medium were joined together and the resulting model was evaluated. The calibration and validation datasets used for building the model were the same as before (see Table 10). The obtained model has 3 latent variables and a very similar R2 value (0.87) (
FIG. 20 ) and RMSECV and RMSEP values (124 mg/l and 60 mg/l, respectively). - With an analytical variance for the reference analytics of product at around 60 mg/l (5% of 1200 mg/l the average product concentration) most models developed showed a prediction accuracy very close to the experimental limit.
- In conclusion, to achieve a prediction of product yield at 330 h, spectral information of both soy protein hydrolyzate and chemically defined basic medium must be used. The use of fluorescence spectroscopy data for the chemically defined basic medium gives slightly lower (but even though very comparable) prediction errors, than models based on NIR spectroscopic data for the chemically defined basic medium and 2D-Fluorescence spectroscopic data for the soy protein hydrolyzate.
- The method as reported herein is directed to the combination of spectra of different nature (fluorescence spectra and IR spectra), which intrinsically have different dimensions (two (2D) and one (1D), respectively), and that requires the operations of first compressing each spectrum to principal component analysis scores and second producing linear combinations of each spectrum scores. The spectra of different nature are combined by means of a dimensional reduction and a linear combination of those reduced transformed variables (PCA scores obtained by compressing each spectrum).
- Thus, in the method as reported herein spectra of different dimensions and nature are used to capture in a mixture of two different fermentation raw materials the components responsible for fermentation performance of said raw materials and to make predictions of fermentation yields for a specific combination of lots.
- With the method as reported herein it is possible to predict based on the spectra of two different raw materials to be used in a
fermentation process performance 10 to 14 days in advance by determining the conditions at harvest of the fermentation. - The following examples and figures are provided to aid the understanding of the present invention, the true scope of which is set forth in the appended claims. It is understood that modifications can be made in the procedures set forth without departing from the spirit of the invention.
- The cells were cultivated in shake flasks in a temperature, humidity and carbon dioxide controlled environment. In order to compare different lots, media were prepared with these lots and cells were inoculated in shake flasks containing these media. A certain volume of feed medium was added daily to the shake flask culture in order to prolong cell growth and achieve higher product concentrations.
- NIR emerges in 1960s into the analytical world, with the work of Karl Norris of the US Department of Agriculture (Siesler et al, 2002). In the electromagnetic spectrum, the NIR region is located in between Mid-Infrared and Visible. In a range of wavenumber 4,000-14,000 cm−1 (respectively wavelength 700-2,500 nm), the absorption radiation of overtone and combination bands of covalent bonds such as N—H, O—H and C—H of organic molecules (
FIG. 21 ). - NIR spectra were collected using flat bottom scintillation vials in a Bruker MPA FT-NIR system, equipped with a tungsten-halogen source and an InAs detector. Each spectrum was recorded in the wavenumber range of 4,999 to 9,003 cm−1, in an average of 32 scans and a spectral resolution of 8 cm−1.
- Mid Infrared Spectra were obtained using quartz cuvettes in an Avatar 370 FT-IR, Thermo Fischer, Diamant ATR. Each spectrum was recorded in the wavenumber range of 4,000 to 400 cm−1.
- Fluorescence spectroscopy uses irradiation at a certain wavelength to excite molecules, which will then emit radiation of a different wavelength. This technique is often used for studying the structure and function of macromolecules, especially protein interactions. Tentative assignment of fluorescence characteristics of chromophores found in proteins and nucleic acids is presented in the following Table.
-
Absorption Fluorescence Substance Imax (nm) max (10−3) Imax (nm) fF tryptophan 280 5.60 348 0.20 tyrosine 274 1.40 393 0.14 phenylalanine 257 0.20 282 0.04 adenine 260 13.40 321 2.60 × 10−4 guanine 275 8.10 329 2.60 × 10−4 cytosine 267 6.10 313 0.80 × 10−4 uracil 260 9.50 308 0.40 × 10−4 NADH 340 6.20 470 0.02 - 2D-fluorescence spectra of cell culture raw materials were obtained using excitation wavelengths from 200 nm to 600 nm, with intervals of 5 nm, and emission wavelengths also from 200 nm to 600 nm, but with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths. Emission-excitation fluorescence spectra were measured using a Varian Cary Eclipse Spectrometer, over an excitation wavelength range from 200 nm to 600 nm with intervals of 5 nm, and emission wavelength range also from 200 nm to 600 nm, but with intervals of 2 nm, giving a total of 81 excitation and 201 emission wavelengths. Data was collected using the software Cary Eclipse Bio, Package 1.1.
- Spectra pre-processing and chemometrics calculations were performed in Matlab 7.2 (MathWorks, U.S.A.) using PLS toolbox 5.5 (Eigenvector, U.S.A.) and Simca P+ 12.01 (Umetrics, Sweden). Rayleigh and Raman scatterings were removed using the EEMscat algorithm (Bahram et al, 2006).
- Multivariate data analysis was performed using PCA (Principal Component Analysis) and PLS (Partial Least Squares). These techniques are based on the reduction of dimensionality present in the data, allowing the retrieval of relevant information hidden in the massive amount of data. It is made transforming the original measured variables into new variables called principal components. The PCA analysis was used to find patterns in the spectra. With the aim to relate these patterns with a particular parameter, PLS analysis was carried out to build a mathematical model able to predict the values of this parameter in future samples using only the spectral information.
- In order to build reliable models, the quality of analytical measurements has fundamental importance. Since noise and unwanted information are intrinsic to the measurements, it is necessary to pre-treat the obtained spectra.
- One of the most common techniques to deal with these problems in the NIR spectra is the Savitzky-Golay smoothing filter (Savitzky, A. and Golay, M. J. E., Anal. Chem., 36 (1964) 1627-1639), and it is commonly used in conjunction with derivatives, which has the advantage of reduce baseline shifts and enhance the significant properties of the spectrum.
- For fluorescence spectra, the major problems are related to the Raman and Rayleigh scattering, which are caused by deviations of the light that are not related to the fluorescence properties of the sample. Since the wavelength regions affected by scattering are known, the intensities measured in such particular regions can be removed replacing it by interpolated points.
- The three-way emission-excitation spectra were unfolded with the purpose of have a matrix suitable to the PLS and PCA analysis. A Parafac based three way analysis was also done for calibration purposes. (Bahram, M., et al., J. Chemometrics, 20 (2006) 99-105). The unfolding approach consists in concatenating two of these three dimensions, keeping the other fixed. In this case, the emission and excitation axis were concatenated, maintaining the information of the samples.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/712,378 US10816477B2 (en) | 2010-11-05 | 2017-09-22 | Infrared and fluorescence spectroscopic finger-printing of raw materials for use in the cultivation of a mammalian cell expressing a protein of interest |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10190193 | 2010-11-05 | ||
EP10190193.2 | 2010-11-05 | ||
PCT/EP2011/069267 WO2012059520A1 (en) | 2010-11-05 | 2011-11-03 | Spectroscopic finger-printing of raw materials |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2011/069267 Continuation WO2012059520A1 (en) | 2010-11-05 | 2011-11-03 | Spectroscopic finger-printing of raw materials |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/712,378 Continuation US10816477B2 (en) | 2010-11-05 | 2017-09-22 | Infrared and fluorescence spectroscopic finger-printing of raw materials for use in the cultivation of a mammalian cell expressing a protein of interest |
Publications (1)
Publication Number | Publication Date |
---|---|
US20140032127A1 true US20140032127A1 (en) | 2014-01-30 |
Family
ID=43734824
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/886,869 Abandoned US20140032127A1 (en) | 2010-11-05 | 2013-05-03 | Spectroscopic finger-printing of raw materials |
US15/712,378 Active 2032-06-20 US10816477B2 (en) | 2010-11-05 | 2017-09-22 | Infrared and fluorescence spectroscopic finger-printing of raw materials for use in the cultivation of a mammalian cell expressing a protein of interest |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/712,378 Active 2032-06-20 US10816477B2 (en) | 2010-11-05 | 2017-09-22 | Infrared and fluorescence spectroscopic finger-printing of raw materials for use in the cultivation of a mammalian cell expressing a protein of interest |
Country Status (12)
Country | Link |
---|---|
US (2) | US20140032127A1 (en) |
EP (1) | EP2635892B1 (en) |
JP (1) | JP5683713B2 (en) |
KR (1) | KR101507252B1 (en) |
CN (1) | CN103201616B (en) |
BR (1) | BR112013010993B1 (en) |
CA (1) | CA2815612C (en) |
ES (1) | ES2506390T3 (en) |
HK (1) | HK1187110A1 (en) |
MX (1) | MX341795B (en) |
RU (1) | RU2593005C2 (en) |
WO (1) | WO2012059520A1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160345302A1 (en) * | 2014-01-22 | 2016-11-24 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and Apparatus for Extending Signaling in a Wireless Communication Network |
US20170177835A1 (en) * | 2013-12-27 | 2017-06-22 | Hoffmann-La Roche Inc. | Method and system for preparing synthetic multicomponent biotechnological and chemical process samples |
CN107941745A (en) * | 2017-11-16 | 2018-04-20 | 赣州市检验检疫科学技术研究院 | Method based near infrared spectrum differential staining orange |
CN108120696A (en) * | 2017-12-18 | 2018-06-05 | 福建中医药大学 | A kind of discrimination method of the big roundleaf roxburgh anoectochilus terminal bud of different planting |
CN108132224A (en) * | 2017-12-18 | 2018-06-08 | 福建中医药大学 | A kind of discrimination method of the roxburgh anoectochilus terminal bud of different planting |
CN108132223A (en) * | 2017-12-18 | 2018-06-08 | 福建中医药大学 | A kind of discrimination method of the red rosy clouds roxburgh anoectochilus terminal bud of different planting |
CN108152245A (en) * | 2017-12-18 | 2018-06-12 | 福建中医药大学 | A kind of discrimination method of roxburgh anoectochilus terminal bud and its mixed adulterant |
CN108169166A (en) * | 2017-12-18 | 2018-06-15 | 福建中医药大学 | A kind of discrimination method of the sharp leaf roxburgh anoectochilus terminal bud of different planting |
CN108169167A (en) * | 2017-12-18 | 2018-06-15 | 福建中医药大学 | A kind of discrimination method of the anoectochilus formosanus of different planting |
CN108780473A (en) * | 2016-01-21 | 2018-11-09 | 蛋白质动态解决方案有限责任公司 | Method and system for spectral data analysis |
WO2021105640A1 (en) * | 2019-11-29 | 2021-06-03 | Universite Du Mans | Method for the quick identification of microorganisms by analysis of excitation-emission matrices |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102998294B (en) * | 2012-12-20 | 2014-10-22 | 中国环境科学研究院 | Three-dimensional spectroscopic data correction method |
CN103499552A (en) * | 2013-10-23 | 2014-01-08 | 天津工业大学 | Fast and intelligent waste plastic sorting method |
US10527551B2 (en) * | 2013-12-30 | 2020-01-07 | Baxalta Incorporated | Method of predicting a performance characteristic of a plant or yeast hydrolysate and its use |
CN105424634A (en) * | 2015-10-29 | 2016-03-23 | 中国计量学院 | Water quality COD detector based on optical fiber coupling ultraviolet light source and prediction model optimization system of water quality COD detector |
GB201806752D0 (en) * | 2018-04-25 | 2018-06-06 | Ge Healthcare Bioprocess R&D Ab | Method in bioprocess system |
JP7190103B2 (en) * | 2018-09-03 | 2022-12-15 | 株式会社サタケ | How to distinguish rice production area |
KR20210022319A (en) | 2019-08-20 | 2021-03-03 | 삼성전자주식회사 | Apparatus and method for estimating bio-information |
WO2021049044A1 (en) * | 2019-09-13 | 2021-03-18 | エピストラ株式会社 | Medium manufacturing method, medium manufacturing parameter determination method, medium, and program |
JP6977977B1 (en) * | 2020-06-24 | 2021-12-08 | エピストラ株式会社 | Medium abnormality detection device and abnormality detection method |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5498875A (en) * | 1994-08-17 | 1996-03-12 | Beckman Instruments, Inc. | Signal processing for chemical analysis of samples |
US7715002B2 (en) | 2007-01-23 | 2010-05-11 | Bionorica Ag | Method for classifying scientific materials such as silicate materials, polymer materials and/or nanomaterials |
WO2008146056A1 (en) | 2007-05-30 | 2008-12-04 | Ruder Boskovic Institute | A method for determining importance of fractions of biological mixtures separated by a chromatographic method for discrimination of cell or tissue physiological conditions |
WO2009061326A1 (en) | 2007-11-09 | 2009-05-14 | Wyeth | Evaluation of chromatographic materials |
TW200943092A (en) | 2007-12-21 | 2009-10-16 | Mks Instr Inc | Hierarchically organizing data using a partial least squares analysis (PLS-trees) |
EP2128599A1 (en) * | 2008-05-28 | 2009-12-02 | Université de Liège | Analysing spectral data for the selection of a calibration model |
US7983874B2 (en) * | 2008-06-10 | 2011-07-19 | National University Of Ireland, Galway | Similarity index: a rapid classification method for multivariate data arrays |
CN102272602B (en) | 2008-10-31 | 2014-03-12 | 生物梅里埃公司 | Methods for isolation and identification of microorganisms |
EP2361377B1 (en) * | 2008-10-31 | 2018-01-31 | Biomerieux, Inc | Method for identification of microorganisms using raman spectroscopy |
CN101846617A (en) * | 2009-12-29 | 2010-09-29 | 中国科学院地球化学研究所 | Sterile detection method of cane sugar content in culture media based on spectrum analysis |
-
2011
- 2011-11-03 RU RU2013123903/15A patent/RU2593005C2/en active
- 2011-11-03 CN CN201180053140.XA patent/CN103201616B/en active Active
- 2011-11-03 KR KR1020137011378A patent/KR101507252B1/en active IP Right Grant
- 2011-11-03 JP JP2013537127A patent/JP5683713B2/en active Active
- 2011-11-03 ES ES11778872.9T patent/ES2506390T3/en active Active
- 2011-11-03 WO PCT/EP2011/069267 patent/WO2012059520A1/en active Application Filing
- 2011-11-03 EP EP11778872.9A patent/EP2635892B1/en active Active
- 2011-11-03 CA CA2815612A patent/CA2815612C/en not_active Expired - Fee Related
- 2011-11-03 MX MX2013004882A patent/MX341795B/en active IP Right Grant
- 2011-11-03 BR BR112013010993-9A patent/BR112013010993B1/en not_active IP Right Cessation
-
2013
- 2013-05-03 US US13/886,869 patent/US20140032127A1/en not_active Abandoned
-
2014
- 2014-01-06 HK HK14100093.0A patent/HK1187110A1/en unknown
-
2017
- 2017-09-22 US US15/712,378 patent/US10816477B2/en active Active
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170177835A1 (en) * | 2013-12-27 | 2017-06-22 | Hoffmann-La Roche Inc. | Method and system for preparing synthetic multicomponent biotechnological and chemical process samples |
US20160345302A1 (en) * | 2014-01-22 | 2016-11-24 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and Apparatus for Extending Signaling in a Wireless Communication Network |
CN108780473A (en) * | 2016-01-21 | 2018-11-09 | 蛋白质动态解决方案有限责任公司 | Method and system for spectral data analysis |
US11626188B2 (en) | 2016-01-21 | 2023-04-11 | Protein Dynamic Solutions, Inc. | Method and system for spectral data analysis |
CN107941745A (en) * | 2017-11-16 | 2018-04-20 | 赣州市检验检疫科学技术研究院 | Method based near infrared spectrum differential staining orange |
CN108120696A (en) * | 2017-12-18 | 2018-06-05 | 福建中医药大学 | A kind of discrimination method of the big roundleaf roxburgh anoectochilus terminal bud of different planting |
CN108152245A (en) * | 2017-12-18 | 2018-06-12 | 福建中医药大学 | A kind of discrimination method of roxburgh anoectochilus terminal bud and its mixed adulterant |
CN108169166A (en) * | 2017-12-18 | 2018-06-15 | 福建中医药大学 | A kind of discrimination method of the sharp leaf roxburgh anoectochilus terminal bud of different planting |
CN108169167A (en) * | 2017-12-18 | 2018-06-15 | 福建中医药大学 | A kind of discrimination method of the anoectochilus formosanus of different planting |
CN108132223A (en) * | 2017-12-18 | 2018-06-08 | 福建中医药大学 | A kind of discrimination method of the red rosy clouds roxburgh anoectochilus terminal bud of different planting |
CN108132224A (en) * | 2017-12-18 | 2018-06-08 | 福建中医药大学 | A kind of discrimination method of the roxburgh anoectochilus terminal bud of different planting |
WO2021105640A1 (en) * | 2019-11-29 | 2021-06-03 | Universite Du Mans | Method for the quick identification of microorganisms by analysis of excitation-emission matrices |
FR3103900A1 (en) * | 2019-11-29 | 2021-06-04 | Universite Du Mans | Method for rapid identification of microorganisms by excitation-emission matrix analysis |
Also Published As
Publication number | Publication date |
---|---|
BR112013010993B1 (en) | 2020-02-18 |
WO2012059520A1 (en) | 2012-05-10 |
MX2013004882A (en) | 2013-07-02 |
KR101507252B1 (en) | 2015-03-30 |
BR112013010993A2 (en) | 2016-08-23 |
US20180202938A1 (en) | 2018-07-19 |
EP2635892B1 (en) | 2014-08-27 |
RU2593005C2 (en) | 2016-07-27 |
MX341795B (en) | 2016-09-02 |
ES2506390T3 (en) | 2014-10-13 |
US10816477B2 (en) | 2020-10-27 |
EP2635892A1 (en) | 2013-09-11 |
HK1187110A1 (en) | 2014-03-28 |
CA2815612A1 (en) | 2012-05-10 |
JP5683713B2 (en) | 2015-03-11 |
CN103201616B (en) | 2015-11-25 |
KR20130079571A (en) | 2013-07-10 |
RU2013123903A (en) | 2014-12-10 |
CA2815612C (en) | 2019-01-08 |
JP2013544353A (en) | 2013-12-12 |
CN103201616A (en) | 2013-07-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10816477B2 (en) | Infrared and fluorescence spectroscopic finger-printing of raw materials for use in the cultivation of a mammalian cell expressing a protein of interest | |
Rowland‐Jones et al. | Comparison of spectroscopy technologies for improved monitoring of cell culture processes in miniature bioreactors | |
Jose et al. | Predicting Mab product yields from cultivation media components, using near‐infrared and 2D‐fluorescence spectroscopies | |
Zeng et al. | Quantitative visualization of photosynthetic pigments in tea leaves based on Raman spectroscopy and calibration model transfer | |
Xiaobo et al. | Genetic algorithm interval partial least squares regression combined successive projections algorithm for variable selection in near-infrared quantitative analysis of pigment in cucumber leaves | |
Guo et al. | Extended multiplicative signal correction based model transfer for Raman spectroscopy in biological applications | |
Wei et al. | Application of terahertz spectrum and interval partial least squares method in the identification of genetically modified soybeans | |
Guo et al. | Vis-NIR wavelength selection for non-destructive discriminant analysis of breed screening of transgenic sugarcane | |
Hakemeyer et al. | Near‐infrared and two‐dimensional fluorescence spectroscopy monitoring of monoclonal antibody fermentation media quality: aged media decreases cell growth | |
Shao et al. | Identification of pesticide varieties by detecting characteristics of Chlorella pyrenoidosa using Visible/Near infrared hyperspectral imaging and Raman microspectroscopy technology | |
Mishra et al. | Improved prediction of potassium and nitrogen in dried bell pepper leaves with visible and near-infrared spectroscopy utilising wavelength selection techniques | |
Lin et al. | Rice freshness identification based on visible near-infrared spectroscopy and colorimetric sensor array | |
Hu et al. | A non-destructive terahertz spectroscopy-based method for transgenic rice seed discrimination via sparse representation | |
CN111751347A (en) | Barley leaf pigment imaging method under powdery mildew stress based on Raman spectrum | |
Song et al. | Effect of controlled humidity and tissue hydration on colon cancer diagnostic via FTIR spectroscopic imaging | |
Li et al. | A feasibility study on quantitative analysis of low concentration methanol by FT-NIR spectroscopy and aquaphotomics | |
Wang et al. | Intelligent detection of hard seeds of snap bean based on hyperspectral imaging | |
Georgiev et al. | RamanSPy: An open-source Python package for integrative Raman spectroscopy data analysis | |
Sun et al. | Data mean and ratio of absorbance to concentration methods: A novel optimization strategy for near infrared spectroscopy modeling | |
Li et al. | Screening soy hydrolysates for the production of a recombinant therapeutic protein in commercial cell line by combined approach of near-infrared spectroscopy and chemometrics | |
Lu et al. | Prediction performance optimization of different resolution and spectral band ranges for characterizing coco-peat substrate available nitrogen | |
Sampaio et al. | Comparative analysis of different transformed Saccharomyces cerevisiae strains based on high-throughput Fourier transform infrared spectroscopy | |
Liu et al. | Determination of Protein Content of Wheat Using Partial Least Squares Regression Based on Near-Infrared Spectroscopy Preprocessing | |
Cozzolino et al. | Instrumental techniques and methods: Their role in plant omics | |
Huang | Calibration Transfer Methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: 4TUNE ENGINEERING LTD., PORTUGAL Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CARDOSO-MENEZES, JOSE;JOSE, GLEDSON EMIDIO;REEL/FRAME:041155/0137 Effective date: 20130405 Owner name: F. HOFFMANN-LA ROCHE AG, SWITZERLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ROCHE DIAGNOSTICS GMBH;REEL/FRAME:041155/0313 Effective date: 20161121 Owner name: ROCHE DIAGNOSTICS GMBH, GERMANY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:HAKEMEYER, CHRISTIAN;STRAUSS, ULRIKE;WERZ, SILKE;REEL/FRAME:041155/0211 Effective date: 20130418 Owner name: F. HOFFMANN-LA ROCHE AG, SWITZERLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:4TUNE ENGINEERING LTD.;REEL/FRAME:041155/0284 Effective date: 20130406 Owner name: HOFFMANN-LA ROCHE INC., NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:F. HOFFMANN-LA ROCHE AG;REEL/FRAME:041155/0445 Effective date: 20161128 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |