CN114502719A - 具有独特的eps基因簇的质构化乳酸乳球菌 - Google Patents

具有独特的eps基因簇的质构化乳酸乳球菌 Download PDF

Info

Publication number
CN114502719A
CN114502719A CN202080067888.4A CN202080067888A CN114502719A CN 114502719 A CN114502719 A CN 114502719A CN 202080067888 A CN202080067888 A CN 202080067888A CN 114502719 A CN114502719 A CN 114502719A
Authority
CN
China
Prior art keywords
seq
nucleotide sequence
amino acid
method comprises
following
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080067888.4A
Other languages
English (en)
Inventor
薇拉·库欣娜·波尔森
贡纳尔·欧尔戈德
E·G·穆哈丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Section Hansen Co ltd
Original Assignee
Section Hansen Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Section Hansen Co ltd filed Critical Section Hansen Co ltd
Publication of CN114502719A publication Critical patent/CN114502719A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23CDAIRY PRODUCTS, e.g. MILK, BUTTER OR CHEESE; MILK OR CHEESE SUBSTITUTES; MAKING THEREOF
    • A23C9/00Milk preparations; Milk powder or milk powder preparations
    • A23C9/12Fermented milk preparations; Treatment using microorganisms or enzymes
    • A23C9/123Fermented milk preparations; Treatment using microorganisms or enzymes using only microorganisms of the genus lactobacteriaceae; Yoghurt
    • A23C9/1236Fermented milk preparations; Treatment using microorganisms or enzymes using only microorganisms of the genus lactobacteriaceae; Yoghurt using Leuconostoc, Pediococcus or Streptococcus sp. other than Streptococcus Thermophilus; Artificial sour buttermilk in general
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • C12N1/205Bacterial isolates
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23CDAIRY PRODUCTS, e.g. MILK, BUTTER OR CHEESE; MILK OR CHEESE SUBSTITUTES; MAKING THEREOF
    • A23C11/00Milk substitutes, e.g. coffee whitener compositions
    • A23C11/02Milk substitutes, e.g. coffee whitener compositions containing at least one non-milk component as source of fats or proteins
    • A23C11/10Milk substitutes, e.g. coffee whitener compositions containing at least one non-milk component as source of fats or proteins containing or not lactose but no other milk components as source of fats, carbohydrates or proteins
    • A23C11/103Milk substitutes, e.g. coffee whitener compositions containing at least one non-milk component as source of fats or proteins containing or not lactose but no other milk components as source of fats, carbohydrates or proteins containing only proteins from pulses, oilseeds or nuts, e.g. nut milk
    • A23C11/106Addition of, or treatment with, microorganisms
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23CDAIRY PRODUCTS, e.g. MILK, BUTTER OR CHEESE; MILK OR CHEESE SUBSTITUTES; MAKING THEREOF
    • A23C9/00Milk preparations; Milk powder or milk powder preparations
    • A23C9/12Fermented milk preparations; Treatment using microorganisms or enzymes
    • A23C9/123Fermented milk preparations; Treatment using microorganisms or enzymes using only microorganisms of the genus lactobacteriaceae; Yoghurt
    • A23C9/1234Fermented milk preparations; Treatment using microorganisms or enzymes using only microorganisms of the genus lactobacteriaceae; Yoghurt characterised by using a Lactobacillus sp. other than Lactobacillus Bulgaricus, including Bificlobacterium sp.
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23LFOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
    • A23L11/00Pulses, i.e. fruits of leguminous plants, for production of food; Products from legumes; Preparation or treatment thereof
    • A23L11/50Fermented pulses or legumes; Fermentation of pulses or legumes based on the addition of microorganisms
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23LFOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
    • A23L11/00Pulses, i.e. fruits of leguminous plants, for production of food; Products from legumes; Preparation or treatment thereof
    • A23L11/60Drinks from legumes, e.g. lupine drinks
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/04Preserving or maintaining viable microorganisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23CDAIRY PRODUCTS, e.g. MILK, BUTTER OR CHEESE; MILK OR CHEESE SUBSTITUTES; MAKING THEREOF
    • A23C2220/00Biochemical treatment
    • A23C2220/20Treatment with microorganisms
    • A23C2220/202Genetic engineering of microorganisms used in dairy technology
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23CDAIRY PRODUCTS, e.g. MILK, BUTTER OR CHEESE; MILK OR CHEESE SUBSTITUTES; MAKING THEREOF
    • A23C2220/00Biochemical treatment
    • A23C2220/20Treatment with microorganisms
    • A23C2220/206Slime forming bacteria; Exopolysaccharide or thickener producing bacteria, ropy cultures, so-called filant strains
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23VINDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
    • A23V2400/00Lactic or propionic acid bacteria
    • A23V2400/11Lactobacillus
    • A23V2400/157Lactis
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23VINDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
    • A23V2400/00Lactic or propionic acid bacteria
    • A23V2400/21Streptococcus, lactococcus
    • A23V2400/231Lactis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/225Lactobacillus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/46Streptococcus ; Enterococcus; Lactococcus

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Polymers & Plastics (AREA)
  • Food Science & Technology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Agronomy & Crop Science (AREA)
  • Botany (AREA)
  • Nutrition Science (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本发明提供了具有改善的质构化特性的新颖乳酸乳球菌(Lactococcuslactis)乳酸菌菌株和使用该菌株生产食品的方法。

Description

具有独特的EPS基因簇的质构化乳酸乳球菌
技术领域
本发明涉及具有改善的质构特性的新颖乳酸乳球菌(Lactococcus lactis)乳酸菌(LAB)菌株。本发明还涉及使用该菌株制备食品的方法以及包含该菌株的食品。
背景技术
乳酸菌(LAB)被食品工业广泛用于食品发酵。通过LAB将新鲜乳转化为发酵乳是延长乳使用期限并提供口感和质构的方式。
因此,用于乳发酵的菌株的重要特征包括快速酸化、稳定(无/低)后酸化、长货架期和良好的质构。良好的质构通常是高口腔厚度和黏度(使用流变仪测量为高剪切应力)和高凝胶硬度。
一些LAB菌株明显有助于改善质构,这与它们产生外(或细胞外)多糖(EPS)的能力相关,EPS可以是英膜的(以英膜的形式保持附着在细胞上)或分泌到培养基中。EPS由单一类型的糖(同型外多糖)或由不同糖构成的重复单元(杂-外多糖)组成。产生EPS的LAB是令人感兴趣的,因为EPS作为发酵食品的天然增黏剂和质构增强剂起作用。此外,具有确定的流变学特性的食品级LAB的EPS具有作为食品添加剂来开发和利用的潜力。众所周知,EPS通过影响黏度、脱水收缩、硬度和感官特性来改善LAB发酵制品的流变学特性。主要结构特征(单糖类型和构型、糖苷键、非糖修饰、电荷)、构象和分子量、多糖的量以及多糖与其他系统组分的相互作用,都是有助于和影响所显示的技术功能特性的所有因素(Zeidan et al.,2017)。
发酵乳可以由嗜温LAB来产生,例如乳球菌属的物种(Lactococcus sp.)导致例如酸乳,或可以由嗜热LAB来产生,例如嗜热链球菌(Streptococcus thermophilus)和德氏乳杆菌保加利亚亚种(Lactobacillus delbruckii subsp.bulgaricus)导致酸奶。用乳酸乳球菌等嗜温发酵剂制备的乳制品,例如新鲜奶酪、酪乳、酸乳和酸奶油,受到消费者的欢迎。此外,乳制品替代产品的市场正在增长,其中用乳酸乳球菌发酵的植物基质可以发挥作用。乳糖不耐受和乳过敏的消费者,以及担心乳激素和胆固醇、动物福利和动物基食品对环境影响的消费者,在需求增长中发挥了作用。此外,植物基饮食据说比肉基饮食更健康(Tangyu et al.,2019)。
已经报道了几种质构化乳酸乳球菌菌株,例如NIZO B40、SMQ-461、Ropy352、JFR1、Lli3、Lll8(对于审核,请参见Poulsen et al.,2019)。已经阐明了NIZO B40的EPS结构,并且已经对负责EPS生物合成的基因进行了功能表征(Kleerebezem et al.,2002)。
Pan and Mei(2010)表征了乳酸乳球菌乳酸亚种(L.lactis subsp.lactis)产生的EPS,该亚种是从中式酸菜(Chinese pickled cabbage)中分离出来的,但尚不清楚这种菌株是否能够酸化乳并有助于其质构。没有报道该菌株的eps基因(Pan and Mei,2010)。Suzuki et al.(2013)报道了五种乳球菌菌株的高度保守的epsD基因和菌株特异性epsE基因的序列,其中两种菌株来自乳酸乳球菌乳酸亚种双乙酰乳酸生物型(subsp.lactisbiovar diacelylactis)和两种来自乳酸乳球菌乳脂亚种(subsp.cremoris)。然而,既没有关于完整eps基因簇的信息,也没有这些菌株产生的EPS是否能够增强乳质构的信息。
WO 2017/108679涉及新菌株乳酸乳球菌乳酸亚种DSM 29291,根据TADM和流变仪测量结果,该菌株在测试的八种不同乳酸乳球菌乳酸亚种菌株中具有最高的剪切应力(参见WO 2017/108679的实施例1和图1)。
由于嗜温培养物用于发酵乳制品,并且质构是重要参数,因此需要另外的质构化嗜温菌株,特别是改善的质构化嗜温菌株,例如,质构化乳酸乳球菌菌株。
发明内容
在第一方面,本发明涉及包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸乳球菌乳酸菌株,其中所述eps基因簇包含(i)至(x)中任一项限定的下述核苷酸序列((a)、(b)和(c),视情况而定):
(i)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:11的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:17的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:9的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:13的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;和
(c3):核苷酸序列,其与SEQ ID NO:15的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(ii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:199的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:199互补链的核苷酸26029-27444(本文称为wzx)编码的氨基酸序列具有至少95%的同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:199的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:199的核苷酸7276-8508编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(c3):核苷酸序列,其与SEQ ID NO:199的核苷酸11042-12391编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(c4):核苷酸序列,其与SEQ ID NO:199的核苷酸13008-13934编码的氨基酸序列(本文称为GT4)具有至少95%同一性;和
(c5):核苷酸序列,其与SEQ ID NO:199的核苷酸18528-19508编码的氨基酸序列(本文称为GT5)具有至少95%同一性;
(iii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:39的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:45的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:37的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:41的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:43的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(iv)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:163的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:169的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:161的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:165的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(c3):核苷酸序列,其与SEQ ID NO:167的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;和
(c4)核苷酸序列,其与SEQ ID NO:181的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%同一性;
(v)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:224的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:224的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:224的核苷酸11042-12391编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:224的核苷酸13008-13934编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:224的核苷酸18527-19507编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(vi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:67的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:73的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:65的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:69的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;和
(c3):核苷酸序列,其与SEQ ID NO:71的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(c4):核苷酸序列,其与SEQ ID NO:85的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%同一性;
(c5):核苷酸序列,其与SEQ ID NO:87的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%同一性;和
(c6):核苷酸序列,其与SEQ ID NO:89的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%同一性;
(vii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:244的核苷酸5833-6927(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:244的核苷酸4617-5123编码的氨基酸序列(本文称为GT1)具有至少95%同一性;和
(c2):核苷酸序列,其与SEQ ID NO:244的核苷酸5120-5827编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(viii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:123的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:129的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:121的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:125的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:127的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:143的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;和
(c5):核苷酸序列,其与SEQ ID NO:145的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:147的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%的同一性;
(ix)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:257互补链的核苷酸11201-12349(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:257互补链的核苷酸15538-16953(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸9726-10673编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸12336-13421编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸13418-14260编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(x)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:274互补链的核苷酸10707-11846(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:274互补链的核苷酸15037-16476(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸9232-10179编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸11833-12918编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸12915-13757编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
本发明还提供了菌株乳酸乳球菌乳酸亚种DSM 33192。
在第二方面,本发明涉及包含以下的组合物:至少一种如上所述本发明的质构化乳酸乳球菌乳酸菌菌株,优选与一种或多种其他乳酸菌菌株组合,其中所述一种或多种其他乳酸菌菌株能够:
i)在下述条件下测量时,在约15h或更短的时间内,优选在约12h或更短的时间内生成目标pH值为约4.55的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度(30℃),并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到约4.55的目标pH。因此,可以计算某种乳酸菌菌株的“达到pH 4.55的时间”;
ii)在下述条件下测量时,生成以300s-1的剪切速率测量时剪切应力为40Pa或更高的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,达到pH 4.55的时间),然后在4℃下储存,直到测量剪切应力,通常1-7天,例如5天,然后轻轻搅拌,并如本申请所述测量,以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。
在一个实施方案中,目标pH可以是例如pH 4-5,优选pH 4.3-4.7,更优选pH4.4-4.6,甚至更优选pH 4.45、pH 4.50或pH 4.55。
优选,本发明第二方面的组合物包含与以下组合的本发明的质构化乳酸乳球菌乳酸菌菌株:
(i)乳酸菌菌株乳酸乳球菌乳脂亚种(Lactococcus lactis subsp.cremoris)DSM25485或其突变体或变体,和/或
(ii)乳酸菌菌株乳酸乳球菌乳酸亚种(Ldctococcus lactis subsp.lactis)DSM33192或其突变体或变体,和/或
(iii)乳酸菌菌株乳酸乳球菌DSM 33133或其突变体或变体。
如下文详细描述的,本发明的组合物可以包含另外的组分,例如冷冻保护剂、冻干保护剂、抗氧化剂、营养素、填充剂、调味剂或其混合物。
在第三方面,本发明涉及乳酸菌菌株(i)至(x)和/或本发明的组合物用于增加发酵乳制品黏度的用途。第三方面还涉及乳酸乳球菌乳脂亚种菌株DSM 25485和/或乳酸乳球菌乳酸亚种菌株DSM 33192用于增加发酵乳制品黏度(如本发明所述,以300s-1的剪切速率测量为剪切应力)的用途。发酵乳制品可以是哺乳动物基发酵乳制品(即发酵的乳基质具有哺乳动物来源)或植物基发酵乳制品(即发酵的乳基质来源于植物,例如豆乳)。
在第四方面,本发明涉及生产食品的方法,包括其中使用至少一种本发明第一方面限定的乳酸菌菌株和/或本发明第二方面限定的组合物的至少一个阶段。最后,本发明涉及包含至少一种本发明第一方面限定的乳酸菌菌株和/或本发明第二方面限定的组合物和/或菌株DSM 33192的食品。如下文详细描述的,食品可以包含另外的组分,例如增稠剂或稳定剂或其混合物。
附图的简要说明
图1.eps基因簇的比较分析。对可在NCBI网站上获得的乳酸乳球菌的一系列eps基因簇以及作为本专利申请主题的那些基因簇进行了基因比较,以评估不同eps基因簇之间的相似程度。该分析基于fasta文件,该文件含有来自43个乳酸乳球菌菌株的eps基因簇中编码的蛋白质序列。排除了蛋白质子集(例如转座酶),以确保比较仅基于功能相关蛋白质。使用cd-hit(http://weizhongli-lab.org/cd-hit/)对蛋白质进行聚类,同一性截止值为0.9,从712条蛋白质序列中产生了270个cd-hit组。根据每个cd-hit组中蛋白质的存在与否,为每个菌株生成特征载体(长度270)。这些特征载体用于计算所有菌株之间的成对Jaccard相似性。成对相似性用于执行聚合层次聚类。是使用“单一”连接方法进行聚类,因为这会导致用于评估eps基因簇的新颖性/唯一性的保守聚类,这是本专利申请的主题。在使用“单一”连接方法的情况下,一个簇与其他簇连接的高度可以解释为两个菌株簇中任何菌株对之间的最短距离(即最高相似性)
图2.图2描绘了本发明乳酸乳球菌eps簇的概述。根据ORF的已证明或预测的功能进行,基于NCBI网页上使用默认参数对refseq蛋白质数据库进行的BLAST分析,对ORF进行注释。GT:糖基转移酶;IS:转座酶;hypot:假设的蛋白质。
序列表简要说明
DSM 33134
SEQ ID NO:91阐明了乳酸乳球菌菌株DSM 33134 eps基因簇,完整序列;
SEQ ID NO:1阐明了DSM 33134的epsR基因的可读框(ORF);
SEQ ID NO:2阐明了SEQ ID NO:1编码的氨基酸序列;
SEQ ID NO:3阐明了DSM 33134的epsX基因的ORF;
SEQ ID NO:4阐明了SEQ ID NO:3编码的氨基酸序列;
SEQ ID NO:5阐明了DSM 33134的epsB基因的ORF;
SEQ ID NO:6阐明了SEQ ID NO:5编码的氨基酸序列;;
SEQ ID NO:7阐明了DSM 33134的epsD基因的ORF;
SEQ ID NO:8阐明了SEQ ID NO:7编码的氨基酸序列;
SEQ ID NO:9阐明了DSM 33134的编码推定的GT1蛋白的ORF;
SEQ ID NO:10阐明了SEQ ID NO:9编码的氨基酸序列;
SEQ ID NO:11阐明了DSM 33134的推定的wzy基因的ORF;
SEQ ID NO:12阐明了SEQ ID NO:11编码的氨基酸序列;
SEQ ID NO:13阐明了DSM 33134的编码推定的GT2蛋白的ORF;
SEQ ID NO:14阐明了SEQ ID NO:13编码的氨基酸序列;
SEQ ID NO:15阐明了DSM 33134的编码推定的GT3蛋白的ORF;
SEQ ID NO:16阐明了SEQ ID NO:15编码的氨基酸序列;
SEQ ID NO:17阐明了DSM 33134的推定的wzx基因的ORF;
SEQ ID NO:18阐明了SEQ ID NO:17编码的氨基酸序列;
SEQ ID NO:19阐明了DSM 33134的epsL基因的ORF;
SEQ ID NO:20阐明了SEQ ID NO:19编码的氨基酸序列;
SEQ ID NO:21阐明了DSM 33134的编码推定的LytR家族转录调节蛋白的ORF;
SEQ ID NO:22阐明了SEQ ID NO:21编码的氨基酸序列;
SEQ ID NO:23阐明了DSM 33134的编码推定的核苷酸糖脱氢酶蛋白的ORF;
SEQ ID NO:24阐明了SEQ ID NO:23编码的氨基酸序列;
SEQ ID NO:25阐明了DSM 33134的epsC基因的ORF;
SEQ ID NO:26阐明了SEQ ID NO:25编码的氨基酸序列;
SEQ ID NO:27阐明了DSM 33134的epsE基因的ORF;
SEQ ID NO:28阐明了SEQ ID NO:27编码的氨基酸序列;
DSM 33136
SEQ ID NO:92阐明了乳酸乳球菌菌株DSM 33136eps基因簇,完整序列;
SEQ ID NO:29阐明了DSM 33136的epsR基因的可读框(ORF);
SEQ ID NO:30阐明了SEQ ID NO:29编码的氨基酸序列;
SEQ ID NO:31阐明了DSM 33136的epsX基因的ORF;
SEQ ID NO:32阐明了SEQ ID NO:31编码的氨基酸序列;
SEQ ID NO:33阐明了DSM 33136的epsB基因的ORF;
SEQ ID NO:34阐明了SEQ ID NO:33编码的氨基酸序列;
SEQ ID NO:35阐明了DSM 33136的epsD基因的ORF;
SEQ ID NO:36阐明了SEQ ID NO:35编码的氨基酸序列;
SEQ ID NO:37阐明了DSM 33136的编码推定的GT1蛋白的ORF;
SEQ ID NO:38阐明了SEQ ID NO:37编码的氨基酸序列;
SEQ ID NO:39阐明了DSM 33136的推定的wzy基因的ORF;
SEQ ID NO:40阐明了SEQ ID NO:39编码的氨基酸序列;
SEQ ID NO:41阐明了DSM 33136的编码推定的GT2蛋白的ORF;
SEQ ID NO:42阐明了SEQ ID NO:41编码的氨基酸序列;
SEQ ID NO:43阐明了DSM 33136的编码推定的GT3蛋白的ORF;
SEQ ID NO:44阐明了SEQ ID NO:43编码的氨基酸序列;
SEQ ID NO:45阐明了DSM 33136的推定的wzx基因的ORF;
SEQ ID NO:46阐明了SEQ ID NO:45编码的氨基酸序列;
SEQ ID NO:47阐明了DSM 33136的epsL基因的ORF;
SEQ ID NO:48阐明了SEQ ID NO:47编码的氨基酸序列;
SEQ ID NO:49阐明了DSM 33136的编码推定的LytR家族转录调节蛋白的ORF;
SEQ ID NO:50阐明了SEQ ID NO:49编码的氨基酸序列;
SEQ ID NO:51阐明了DSM 33136的编码推定的多糖丙酮酰转移酶(pyruvyltransferase)家族蛋白的ORF;
SEQ ID NO:52阐明了SEQ ID NO:51编码的氨基酸序列;
SEQ ID NO:53阐明了DSM 33136的epsC基因的ORF;
SEQ ID NO:54阐明了SEQ ID NO:53编码的氨基酸序列;
SEQ ID NO:55阐明了DSM 33136的epsE基因的ORF;
SEQ ID NO:56阐明了SEQ ID NO:55编码的氨基酸序列;
DSM 33139
SEQ ID NO:221阐明了乳酸乳球菌菌株DSM 33139 eps基因簇,完整序列;
SEQ ID NO:57阐明了DSM 33139的epsR基因的开放阅读框(ORF);
SEQ ID NO:58阐明了SEQ ID NO:57编码的氨基酸序列;
SEQ ID NO:59阐明了DSM 33139的epsX基因的ORF;
SEQ ID NO:60阐明了SEQ ID NO:59编码的氨基酸序列;
SEQ ID NO:61阐明了DSM 33139的epsB基因的ORF;
SEQ ID NO:62阐明了SEQ ID NO:61编码的氨基酸序列;
SEQ ID NO:63阐明了DSM 33139的epsD基因的ORF;
SEQ ID NO:64阐明了SEQ ID NO:63编码的氨基酸序列;
SEQ ID NO:65阐明了DSM 33139的编码推定的GT1蛋白的ORF;
SEQ ID NO:66阐明了SEQ ID NO:65编码的氨基酸序列;
SEQ ID NO:67阐明了DSM 33139的推定的wzy基因的ORF;
SEQ ID NO:68阐明了SEQ ID NO:67编码的氨基酸序列;
SEQ ID NO:69阐明了DSM 33139的编码推定的GT2蛋白的ORF;
SEQ ID NO:70阐明了SEQ ID NO:69编码的氨基酸序列;
SEQ ID NO:71阐明了DSM 33139的编码推定的GT3蛋白的ORF;
SEQ ID NO:72阐明了SEQ ID NO:71编码的氨基酸序列;
SEQ ID NO:73阐明了DSM 33139的推定的wzx基因的ORF;
SEQ ID NO:74阐明了SEQ ID NO:73编码的氨基酸序列;
SEQ ID NO:75阐明了DSM 33139的epsL基因的ORF;
SEQ ID NO:76阐明了SEQ ID NO:75编码的氨基酸序列;
SEQ ID NO:77阐明了DSM 33139的编码推定的LytR家族转录调节蛋白的ORF;
SEQ ID NO:78阐明了SEQ ID NO:77编码的氨基酸序列;
SEQ ID NO:79阐明了DSM 33139的编码推定的核苷酸糖脱氢酶蛋白的ORF;
SEQ ID NO:80阐明了SEQ ID NO:79编码的氨基酸序列;
SEQ ID NO:81阐明了DSM 33139的epsC基因的ORF;
SEQ ID NO:82阐明了SEQ ID NO:81编码的氨基酸序列;
SEQ ID NO:83阐明了DSM 33139的epsE基因的ORF;
SEQ ID NO:84阐明了SEQ ID NO:83编码的氨基酸序列;
SEQ ID NO:85阐明了DSM 33139的编码推定的GT4蛋白的ORF;
SEQ ID NO:86阐明了SEQ ID NO:85编码的氨基酸序列;
SEQ ID NO:87阐明了DSM 33139的编码推定的GT5蛋白的ORF;
SEQ ID NO:88阐明了SEQ ID NO:87编码的氨基酸序列;
SEQ ID NO:89阐明了DSM 33139的编码推定的GT6蛋白的ORF;
SEQ ID NO:90阐明了SEQ ID NO:89编码的氨基酸序列;
SEQ ID NO:93阐明了DSM 33139的编码推定的NAD依赖性差向异构酶/脱水酶家族蛋白1的ORF;
SEQ ID NO:94阐明了SEQ ID NO:93编码的氨基酸序列;
SEQ ID NO:95阐明了DSM 33139的编码推定的葡萄糖-1-磷酸胸苷基转移酶(thymidylyltransferase)RfbA蛋白的ORF;
SEQ ID NO:96阐明了SEQ ID NO:95编码的氨基酸序列;
SEQ ID NO:97阐明了DSM 33139的编码推定的dTDP-葡萄糖4,6-脱水酶蛋白的ORF;
SEQ ID NO:98阐明了SEQ ID NO:97编码的氨基酸序列;
SEQ ID NO:99阐明了DSM 33139的编码推定的dTDP-4-脱氢鼠李糖(dehydrorhamnose)3,5-差向异构酶蛋白的ORF;
SEQ ID NO:100阐明了SEQ ID NO:99编码的氨基酸序列;
SEQ ID NO:101阐明了DSM 33139的编码推定的NAD依赖性差向异构酶/脱水酶家族蛋白2的ORF;
SEQ ID NO:102阐明了SEQ ID NO:101编码的氨基酸序列;
SEQ ID NO:103阐明了DSM 33139的编码推定的dTDP-4-脱氢鼠李糖还原酶蛋白的ORF;
SEQ ID NO:104阐明了SEQ ID NO:103编码的氨基酸序列;
SEQ ID NO:105阐明了DSM 33139的编码推定的核苷酸转移酶蛋白的ORF;
SEQ ID NO:106阐明了SEQ ID NO:105编码的氨基酸序列;
SEQ ID NO:107阐明了DSM 33139的编码推定的酰基转移酶2蛋白的ORF;
SEQ ID NO:108阐明了SEQ ID NO:107编码的氨基酸序列;
SEQ ID NO:109阐明了DSM 33139的编码推定的含有DUF1972结构域的蛋白质的ORF;
SEQ ID NO:110阐明了SEQ ID NO:109编码的氨基酸序列;
SEQ ID NO:111阐明了DSM 33139的编码推定的酰基转移酶1蛋白的ORF;
SEQ ID NO:112阐明了SEQ ID NO:111编码的氨基酸序列;
DSM 33141
SEQ ID NO:222阐明了乳酸乳球菌菌株DSM 33141eps基因簇,完整序列;
SEQ ID NO:113阐明了DSM 33141的epsR基因的开放阅读框(ORF);
SEQ ID NO:114阐明了SEQ ID NO:113编码的氨基酸序列;
SEQ ID NO:115阐明了DSM 33141的epsX基因的ORF;
SEQ ID NO:116阐明了SEQ ID NO:115编码的氨基酸序列;
SEQ ID NO:117阐明了DSM 33141的epsB基因的ORF;
SEQ ID NO:118阐明了SEQ ID NO:117编码的氨基酸序列;
SEQ ID NO:119阐明了DSM 33141的epsD基因的ORF;
SEQ ID NO:120阐明了SEQ ID NO:119编码的氨基酸序列;
SEQ ID NO:121阐明了DSM 33141的编码推定的GT1蛋白的ORF;
SEQ ID NO:122阐明了SEQ ID NO:121编码的氨基酸序列;
SEQ ID NO:123阐明了DSM 33141的推定的wzy基因的ORF;
SEQ ID NO:124阐明了SEQ ID NO:123编码的氨基酸序列;
SEQ ID NO:125阐明了DSM 33141的编码推定的GT2蛋白的ORF;
SEQ ID NO:126阐明了SEQ ID NO:125编码的氨基酸序列;
SEQ ID NO:127阐明了DSM 33141的编码推定的GT3蛋白的ORF;
SEQ ID NO:128阐明了SEQ ID NO:127编码的氨基酸序列;
SEQ ID NO:129阐明了DSM 33141的推定的wzx基因的ORF;
SEQ ID NO:130阐明了SEQ ID NO:129编码的氨基酸序列;
SEQ ID NO:131阐明了DSM 33141的epsL基因的ORF;
SEQ ID NO:132阐明了SEQ ID NO:131编码的氨基酸序列;
SEQ ID NO:133阐明了DSM 33141的编码推定的LytR家族转录调节蛋白的ORF;
SEQ ID NO:134阐明了SEQ ID NO:133编码的氨基酸序列;
SEQ ID NO:135阐明了DSM 33141的编码推定的核苷酸糖脱氢酶蛋白的ORF;
SEQ ID NO:136阐明了SEQ ID NO:135编码的氨基酸序列;
SEQ ID NO:137阐明了DSM 33141的epsC基因的ORF;
SEQ ID NO:138阐明了SEQ ID NO:137编码的氨基酸序列;
SEQ ID NO:139阐明了DSM 33141的epsE1基因的ORF;
SEQ ID NO:140阐明了SEQ ID NO:139编码的氨基酸序列;
SEQ ID NO:141阐明了DSM 33141的epsE2基因的ORF;
SEQ ID NO:142阐明了SEQ ID NO:141编码的氨基酸序列;
SEQ ID NO:143阐明了DSM 33141的编码推定的GT4蛋白的ORF;
SEQ ID NO:144阐明了SEQ ID NO:143编码的氨基酸序列;
SEQ ID NO:145阐明了DSM 33141的编码推定的GT5蛋白的ORF;
SEQ ID NO:146阐明了SEQ ID NO:145编码的氨基酸序列;
SEQ ID NO:147阐明了DSM 33141的编码推定的GT6蛋白的ORF;
SEQ ID NO:148阐明了SEQ ID NO:147编码的氨基酸序列;
SEQ ID NO:149阐明了DSM 33141的编码推定的酰基转移酶蛋白的ORF;
SEQ ID NO:150阐明了SEQ ID NO:149编码的氨基酸序列;
SEQ ID NO:151阐明了DSM 33141的编码推定的酰基转移酶蛋白的ORF;
SEQ ID NO:152阐明了SEQ ID NO:151编码的氨基酸序列;
DSM 33137
SEQ ID NO:223阐明了乳酸乳球菌菌株DSM 33137eps基因簇,完整序列;
SEQ ID NO:153阐明了DSM 33137的epsR基因的开放阅读框(ORF);
SEQ ID NO:154阐明了SEQ ID NO:153编码的氨基酸序列;
SEQ ID NO:155阐明了DSM 33137的epsX基因的ORF;
SEQ ID NO:156阐明了SEQ ID NO:155编码的氨基酸序列;
SEQ ID NO:157阐明了DSM 33137的epsB基因的ORF;
SEQ ID NO:158阐明了SEQ ID NO:157编码的氨基酸序列;
SEQ ID NO:159阐明了DSM 33137的epsD基因的ORF;
SEQ ID NO:160阐明了SEQ ID NO:159编码的氨基酸序列;
SEQ ID NO:161阐明了DSM 33137的编码推定的GT1蛋白的ORF;
SEQ ID NO:162阐明了SEQ ID NO:161编码的氨基酸序列;
SEQ ID NO:163阐明了DSM 33137的推定的wzy基因的ORF;
SEQ ID NO:164阐明了SEQ ID NO:163编码的氨基酸序列;
SEQ ID NO:165阐明了DSM 33137的编码推定的GT2蛋白的ORF;
SEQ ID NO:166阐明了SEQ ID NO:165编码的氨基酸序列;
SEQ ID NO:167阐明了DSM 33137的编码推定的GT3蛋白的ORF;
SEQ ID NO:168阐明了SEQ ID NO:167编码的氨基酸序列;
SEQ ID NO:169阐明了DSM 33137的推定的wzx基因的ORF;
SEQ ID NO:170阐明了SEQ ID NO:169编码的氨基酸序列;
SEQ ID NO:171阐明了DSM 33137的epsL基因的ORF;
SEQ ID NO:172阐明了SEQ ID NO:171编码的氨基酸序列;
SEQ ID NO:173阐明了DSM 33137的编码推定的LytR家族转录调节蛋白的ORF;
SEQ ID NO:174阐明了SEQ ID NO:173编码的氨基酸序列;
SEQ ID NO:175阐明了DSM 33137的编码推定的核心-2/I-分支蛋白(Core-2/I-Branching protein)的ORF;
SEQ ID NO:176阐明了SEQ ID NO:175编码的氨基酸序列;
SEQ ID NO:177阐明了DSM 33137的epsC基因的ORF;
SEQ ID NO:178阐明了SEQ ID NO:177编码的氨基酸序列;
SEQ ID NO:179阐明了DSM 33137的epsE基因的ORF;
SEQ ID NO:180阐明了SEQ ID NO:179编码的氨基酸序列;
SEQ ID NO:181阐明了DSM 33137的编码推定的GT4蛋白的ORF;
SEQ ID NO:182阐明了SEQ ID NO:181编码的氨基酸序列;
DSM 33192
SEQ ID NO:183阐明了乳酸乳球菌菌株DSM 33192eps基因簇,完整序列;
SEQ ID NO:184阐明了DSM 33192的epsR基因(SEQ ID NO:183的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:185阐明了DSM 33192的epsX基因(SEQ ID NO:183的核苷酸407-826)编码的氨基酸序列;
SEQ ID NO:186阐明了DSM 33192的epsC基因(SEQ ID NO:183的核苷酸993-1772)编码的氨基酸序列;
SEQ ID NO:187阐明了DSM 33192的epsD基因(SEQ ID NO:183的核苷酸1782-2477)编码的氨基酸序列;
SEQ ID NO:188阐明了DSM 33192的epsB基因(SEQ ID NO:183的核苷酸2532-3296)编码的氨基酸序列;
SEQ ID NO:189阐明了DSM 33192的epsE基因(SEQ ID NO:183的核苷酸3318-3998)编码的氨基酸序列;
SEQ ID NO:190阐明了DSM 33192的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:183的核苷酸4008-4478编码;
SEQ ID NO:191阐明了DSM 33192的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:183的核苷酸4478-4960编码;
SEQ ID NO:192阐明了DSM 33192的推定的糖基转移酶(GT3)的氨基酸序列,由SEQID NO:183的核苷酸5015-5965编码;
SEQ ID NO:193阐明了DSM 33192的推定的糖基转移酶(GT4)的氨基酸序列,由SEQID NO:183的核苷酸6026-6955编码;
SEQ ID NO:194阐明了DSM 33192的wzy基因(SEQ ID NO:183的核苷酸6955-8145)编码的氨基酸序列;
SEQ ID NO:195阐明了DSM 33192的甘油磷酸转移酶家族蛋白的氨基酸序列,由SEQ ID NO:183的核苷酸8132-9322编码;
SEQ ID NO:196阐明了DSM 33192的wzx基因(SEQ ID NO:183的核苷酸9309-10727)编码的氨基酸序列;
SEQ ID NO:197阐明了DSM 33192的epsL基因(SEQ ID NO:183的核苷酸10825-11724)编码的氨基酸序列;
SEQ ID NO:198阐明了DSM 33192的LytR蛋白的氨基酸序列,由SEQ ID NO:183的核苷酸11749-12651编码;
DSM 33135
SEQ ID NO:199阐明了乳酸乳球菌菌株DSM 33135eps基因簇,完整序列;
SEQ ID NO:200阐明了DSM 33135的epsR基因(SEQ ID NO:199的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:201阐明了DSM 33135的epsX基因(SEQ ID NO:199的核苷酸352-1119)编码的氨基酸序列;
SEQ ID NO:202阐明了DSM 33135的epsC基因(SEQ ID NO:199的核苷酸1159-1938)编码的氨基酸序列;
SEQ ID NO:203阐明了DSM 33135的epsD基因(SEQ ID NO:199的核苷酸1948-2640)编码的氨基酸序列;
SEQ ID NO:204阐明了DSM 33135的epsB基因(SEQ ID NO:199的核苷酸2698-3462)编码的氨基酸序列;
SEQ ID NO:205阐明了DSM 33135的epsE基因(SEQ ID NO:199的核苷酸3484-4170)编码的氨基酸序列;
SEQ ID NO:206阐明了DSM 33135的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:199的核苷酸4174-4824编码;
SEQ ID NO:207阐明了DSM 33135的推定的dTDP-葡萄糖4,6-脱水酶的氨基酸序列,由SEQ ID NO:199的核苷酸4784-5695编码;
SEQ ID NO:208阐明了DSM 33135的推定的dTDP-4-脱氢鼠李糖还原酶的氨基酸序列,由SEQ ID NO:199的核苷酸5717-6631编码;
SEQ ID NO:209阐明了DSM 33135的推定的dTDP-4-脱氢鼠李糖3,5-差向异构酶的氨基酸序列,由SEQ ID NO:199的核苷酸6586-7257编码;
SEQ ID NO:210阐明了DSM 33135的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:199的核苷酸7276-8508编码;
SEQ ID NO:211阐明了DSM 33135的推定的DUF1919蛋白的氨基酸序列,由SEQ IDNO:199的核苷酸8515-9144编码;
SEQ ID NO:212阐明了DSM 33135的推定的UDP-吡喃半乳糖变位酶蛋白的氨基酸序列,由SEQ ID NO:199的核苷酸9159-10274编码;
SEQ ID NO:213阐明了DSM 33135的推定的DUF4422蛋白的氨基酸序列,由SEQ IDNO:199的核苷酸10271-11029编码;
SEQ ID NO:214阐明了DSM 33135的推定的糖基转移酶(GT3)的氨基酸序列,由SEQID NO:199的核苷酸11042-12391编码;
SEQ ID NO:215阐明了DSM 33135的推定的糖基转移酶(GT4)的氨基酸序列,由SEQID NO:199的核苷酸13008-13934编码;
SEQ ID NO:216阐明了DSM 33135的wzy基因(SEQ ID NO:199的核苷酸13939-15042)编码的氨基酸序列;
SEQ ID NO:217阐明了DSM 33135的推定的糖基转移酶(GT5)的氨基酸序列,由SEQID NO:199的核苷酸18528-19508编码;
SEQ ID NO:218阐明了DSM 33135的lytR蛋白的氨基酸序列,由SEQ ID NO:199的核苷酸20389-21291编码;
SEQ ID NO:219阐明了DSM 33135的epsL基因(SEQ ID NO:199互补链的核苷酸24053-24751)编码的氨基酸序列;
SEQ ID NO:220阐明了DSM 33135的wzx基因(SEQ ID NO:199互补链的核苷酸26029-27444)编码的氨基酸序列;
DSM 33138
SEQ ID NO:224阐明了乳酸乳球菌菌株DSM 33138eps基因簇,完整序列;
SEQ ID NO:225阐明了DSM 33138的epsR基因(SEQ ID NO:224的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:226阐明了DSM 33138的epsX基因(SEQ ID NO:224的核苷酸352-1119)编码的氨基酸序列;
SEQ ID NO:227阐明了DSM 33138的epsC基因(SEQ ID NO:224的核苷酸1159-1938)编码的氨基酸序列;
SEQ ID NO:228阐明了DSM 33138的epsD基因(SEQ ID NO:224的核苷酸1948-2640)编码的氨基酸序列;
SEQ ID NO:229阐明了DSM 33138的epsB基因(SEQ ID NO:224的核苷酸2698-3462)编码的氨基酸序列;
SEQ ID NO:230阐明了DSM 33138的epsE基因(SEQ ID NO:224的核苷酸3484-4170)编码的氨基酸序列;
SEQ ID NO:231阐明了DSM 33138的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:224的核苷酸4174-4824编码;
SEQ ID NO:232阐明了DSM 33138的推定的dTDP-葡萄糖4,6-脱水酶的氨基酸序列,由SEQ ID NO:224的核苷酸4784-5695编码;
SEQ ID NO:233阐明了DSM 33138的推定的dTDP-4-脱氢鼠李糖还原酶的氨基酸序列,由SEQ ID NO:224的核苷酸5717-6631编码;
SEQ ID NO:234阐明了DSM 33138的推定的dTDP-4-脱氢鼠李糖3,5-差向异构酶的氨基酸序列,由SEQ ID NO:224的核苷酸6586-7257编码;
SEQ ID NO:235阐明了DSM 33138的推定的DUF1972蛋白的氨基酸序列,由SEQ IDNO:224的核苷酸7276-8508编码;
SEQ ID NO:236阐明了DSM 33138的推定的DUF1919蛋白的氨基酸序列,由SEQ IDNO:224的核苷酸8515-9144编码;
SEQ ID NO:237阐明了DSM 33138的推定的UDP-吡喃半乳糖变位酶蛋白的氨基酸序列,由SEQ ID NO:224的核苷酸9159-10274编码;
SEQ ID NO:238阐明了DSM 33138的推定的DUF4422蛋白的氨基酸序列,由SEQ IDNO:224的核苷酸10271-11029编码;
SEQ ID NO:239阐明了DSM 33138的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:224的核苷酸11042-12391编码;
SEQ ID NO:240阐明了DSM 33138的推定的糖基转移酶(GT3)的氨基酸序列,由SEQID NO:224的核苷酸13008-13934编码;
SEQ ID NO:241阐明了DSM 33138的wzy基因(SEQ ID NO:224的核苷酸13939-15042)编码的氨基酸序列;
SEQ ID NO:242阐明了DSM 33138的推定的糖基转移酶(GT4)的氨基酸序列,由SEQID NO:224的核苷酸18527-19507编码;
SEQ ID NO:243阐明了DSM 33138的lytR蛋白的氨基酸序列,由SEQ ID NO:224的核苷酸20388-21290编码;
DSM 33140
SEQ ID NO:244阐明了乳酸乳球菌菌株DSM 33140eps基因簇,完整序列;
SEQ ID NO:245阐明了DSM 33140的epsR基因(SEQ ID NO:244的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:246阐明了DSM 33140的epsX基因(SEQ ID NO:244的核苷酸352-1119)编码的氨基酸序列;
SEQ ID NO:247阐明了DSM 33140的epsC基因(SEQ ID NO:244的核苷酸1159-1938)编码的氨基酸序列;
SEQ ID NO:248阐明了DSM 33140的epsD基因(SEQ ID NO:244的核苷酸1948-2643)编码的氨基酸序列;
SEQ ID NO:249阐明了DSM 33140的epsB基因(SEQ ID NO:244的核苷酸2698-3462)编码的氨基酸序列;
SEQ ID NO:250阐明了DSM 33140的epsE基因(SEQ ID NO:244的核苷酸3484-4164)编码的氨基酸序列;
SEQ ID NO:251阐明了DSM 33140的推定的UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺蛋白的氨基酸序列,由SEQ ID NO:244的核苷酸4168-4617编码;
SEQ ID NO:252阐明了DSM 33140的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:244的核苷酸4617-5123编码;
SEQ ID NO:253阐明了DSM 33140的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:244的核苷酸5120-5827编码;
SEQ ID NO:254阐明了DSM 33140的wzy基因(SEQ ID NO:244的核苷酸5833-6927)编码的氨基酸序列;
SEQ ID NO:255阐明了DSM 33140的epsL基因(SEQ ID NO:244的核苷酸8438-9340)编码的氨基酸序列;
SEQ ID NO:256阐明了DSM 33140的lytR蛋白的氨基酸序列,由SEQ ID NO:244互补链的核苷酸9365-10267编码;
DSM 33142
SEQ ID NO:257阐明了乳酸乳球菌菌株DSM 33142eps基因簇,完整序列;
SEQ ID NO:258阐明了DSM 33142的epsR基因(SEQ ID NO:257的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:259阐明了DSM 33142的epsX基因(SEQ ID NO:257的核苷酸407-1120)编码的氨基酸序列;
SEQ ID NO:260阐明了DSM 33142的epsC基因(SEQ ID NO:257的核苷酸1160-1939)编码的氨基酸序列;
SEQ ID NO:261阐明了DSM 33142的epsD基因(SEQ ID NO:257的核苷酸1949-2644)编码的氨基酸序列;
SEQ ID NO:262阐明了DSM 33142的epsB基因(SEQ ID NO:257的核苷酸2699-3463)编码的氨基酸序列;
SEQ ID NO:263阐明了DSM 33142的epsE1基因(SEQ ID NO:257的核苷酸3485-4084)编码的氨基酸序列;
SEQ ID NO:264阐明了DSM 33142的epsE2基因(SEQ ID NO:257的核苷酸4085-4840)编码的氨基酸序列;
SEQ ID NO:265阐明了DSM 33142的lytR蛋白的氨基酸序列,由SEQ ID NO:257的核苷酸5876-6778编码;
SEQ ID NO:266阐明了由DSM 33142的epsL基因编码的氨基酸序列,由SEQ ID NO:257互补链的核苷酸6803-7717编码;
SEQ ID NO:267阐明了DSM 33142的推定的核苷酸糖脱氢酶蛋白的氨基酸序列,由SEQ ID NO:257互补链的核苷酸7727-8173编码;
SEQ ID NO:268阐明了DSM 33142的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:257互补链的核苷酸9726-10673编码;
SEQ ID NO:269阐明了DSM 33142的推定的酰基转移酶的氨基酸序列,由SEQ IDNO:257互补链的核苷酸10657-11211编码;
SEQ ID NO:270阐明了DSM 33142的wzy基因(SEQ ID NO:257互补链的核苷酸11201-12349)编码的氨基酸序列;
SEQ ID NO:271阐明了DSM 33142的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:257互补链的核苷酸12336-13421编码;
SEQ ID NO:272阐明了DSM 33142的推定的糖基转移酶(GT3)的氨基酸序列,SEQID NO:257互补链的核苷酸13418-14260编码;
SEQ ID NO:273阐明了DSM 33142互补链的wzx基因(SEQ ID NO:257的核苷酸15538-16953)编码的氨基酸序列;
DSM 33183
SEQ ID NO:274阐明了乳酸乳球菌菌株DSM 33183eps基因簇,完整序列;
SEQ ID NO:275阐明了DSM 33183的epsR基因(SEQ ID NO:274的核苷酸1-318)编码的氨基酸序列;
SEQ ID NO:276阐明了DSM 33183的epsC基因(SEQ ID NO:274的核苷酸681-1460)编码的氨基酸序列;
SEQ ID NO:277阐明了DSM 33183的epsD基因(SEQ ID NO:274的核苷酸1470-2165)编码的氨基酸序列;
SEQ ID NO:278阐明了DSM 33183的epsB基因(SEQ ID NO:274的核苷酸2220-3005)编码的氨基酸序列;
SEQ ID NO:279阐明了DSM 33183的epsE1基因(SEQ ID NO:274的核苷酸2992-3591)编码的氨基酸序列;
SEQ ID NO:280阐明了DSM 33183的epsE2基因(SEQ ID NO:274的核苷酸3592-4347)编码的氨基酸序列;
SEQ ID NO:281阐明了DSM 33183的lytR蛋白的氨基酸序列,由SEQ ID NO:274的核苷酸5383-6285编码);
SEQ ID NO:282阐明了DSM 33183的epsL基因编码的氨基酸序列,由SEQ ID NO:274互补链的核苷酸6310-7224编码);
SEQ ID NO:283阐明了DSM 33183的推定的核苷酸糖脱氢酶蛋白的氨基酸序列,由SEQ ID NO:274互补链的核苷酸7234-7680编码;
SEQ ID NO:284阐明了DSM 33183的推定的糖基转移酶(GT1)的氨基酸序列,由SEQID NO:274互补链的核苷酸9232-10179编码;
SEQ ID NO:285阐明了DSM 33183的推定的酰基转移酶的氨基酸序列,由SEQ IDNO:274互补链的核苷酸10163-10717编码;
SEQ ID NO:286阐明了DSM 33183的wzy基因(SEQ ID NO:274互补链的核苷酸10707-11846)编码的氨基酸序列;
SEQ ID NO:287阐明了DSM 33183的推定的糖基转移酶(GT2)的氨基酸序列,由SEQID NO:274互补链的核苷酸11833-12918编码;
SEQ ID NO:288阐明了DSM 33183的推定的糖基转移酶(GT3)的氨基酸序列,由SEQID NO:274互补链的核苷酸12915-13757编码;
SEQ ID NO:289阐明了DSM 33183的wzx基因(SEQ ID NO:274互补链的核苷酸15037-16476)编码的氨基酸序列;
SEQ ID NO:290阐明了乳酸乳球菌菌株DSM 33133eps基因簇,完整序列。
SEQ ID NO:291阐明了乳酸乳球菌菌株DSM 33204、33205、33220、33221、33218、33219、33224、33197、33196、33195、33194、33226、33223、33193以及33192eps基因簇,完整序列。
SEQ ID NO:292阐明了乳酸乳球菌菌株DSM 33200、33201、33202以及33203eps基因簇,完整序列。
SEQ ID NO:293阐明了乳酸乳球菌菌株DSM 33222eps基因簇,完整序列。
SEQ ID NO:294阐明了乳酸乳球菌菌株DSM 33225eps基因簇,完整序列。
具体实施方式
定义
本文相关术语的所有定义与本领域技术人员就本文相关技术背景所理解的一致。
在本发明的上下文中,表述“乳酸菌”(“lactic acid bacteria,LAB”)在其任何实施方案中都表示产生乳酸作为碳水化合物发酵的主要代谢终产物的食品级细菌。这些细菌因其共同的代谢和生理特征而相关,通常是革兰氏阳性、低GC、耐酸、不产生孢子、无呼吸、杆状杆菌或球菌。在发酵阶段,这些细菌对碳水化合物的消耗导致乳酸的形成,降低了pH值并导致蛋白质凝结物的形成。因此,这些细菌负责乳的酸化和乳制品的质构。工业上最有用的乳酸菌存在于“乳酸杆菌目(Lactobacillales)”中,包括乳球菌属(Lactococcus spp.)、链球菌属(Streptococcus spp)、乳杆菌属(Lactobacillus spp.)、明串珠菌属(Leuconostoc spp.)、片球菌属(Pediococcus spp.)和丙酸杆菌属(Propionibacteriumspp.)。这些细菌经常单独或与其他乳酸菌组合作为食品培养物使用。
在本说明书和权利要求中,“质构化菌株”是指,在下文描述和本文实施例1举例说明的条件下,优选生成以300s-1的剪切速率测量时剪切应力优选大于40Pa的发酵的哺乳动物乳的菌株。乳酸乳球菌菌株可以定义为强质构化,因为它生成在相同的条件下以300s-1的剪切速率测量时剪切应力大于50Pa的发酵乳。将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH=4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。
此外,在本说明书和权利要求书中,“质构化菌株”是指,在下文描述和本文实施例2举例说明的条件下,优选生成在300s-1的剪切速率下测量的剪切应力为24Pa或更高,优选30Pa或更高,或者甚至更优选42Pa或更高的发酵的植物基乳的菌株。乳酸乳球菌菌株可以定义为强质构化,因为它生成在相同的条件下在300s-1的剪切速率下测量的剪切应力为30Pa或更高的发酵乳。如实施例2所述,将1%体积的过夜微生物培养物(通过在30℃下将微生物培养物接种在补充有2%葡萄糖的M17肉汤中获得)接种在具有葡萄糖的豆乳中,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。接种在30℃下以200-ml的规模进行,直到达到目标pH值,例如pH 4-5,优选pH 4.3-4.7,更优选pH 4.4-4.6,甚至更优选pH 4.45、pH 4.50或pH 4.55,然后冷却至4℃并,在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在300s-1的剪切速率下测量剪切应力,其中接种温度为30℃。
本发明的质构化乳酸菌菌株可以是分离的菌株,例如,分离自天然存在的来源,或者可以是非天然存在的菌株,例如,重组获得的。重组菌株与天然存在的菌株的区别至少在于存在用于转化或转染母菌株的核酸构建体。
术语“序列同一性”涉及两条核苷酸序列之间或两条氨基酸序列之间的相关性。出于本发明的目的,两条核苷酸序列或两条氨基酸序列之间的序列同一性程度,使用多序列比对工具Clustal Omega(https://www.ebi.ac.uk/Tools/msa/clustalo/;Sievers,F.etal.,2011,“Fast,scalable generation of high-quality protein multiple sequencealignments using Clustal Omega”,Mol.Syst.Biol.,7:539)以标准参数来确定。
在本发明的上下文中,术语“菌株衍生自”、“衍生菌株”或“突变体”应理解为通过例如基因工程、辐射和/或化学处理和/或选择、适应、筛选等从本发明的菌株衍生的菌株。优选的是,衍生菌株是功能等同突变体,例如,在质构化能力方面与母菌株具有基本相同或改善的特性的菌株。这种衍生菌株是本发明的一部分。特别是,术语“衍生菌株”或“突变体”是指,通过对本发明的菌株进行任何常规使用的诱变处理获得的菌株,包括用化学诱变剂例如乙烷甲磺酸盐(EMS)或N-甲基-N′-硝基-N-硝基胍(NTG)处理、紫外线处理,或是指自发发生的突变体。突变体可能已经经历了几次诱变处理(单次处理应理解为一个诱变步骤,然后是一个筛选/选择步骤),但目前优选进行不超过20、不超过10或不超过5次处理。在目前优选的衍生菌株中,与母菌株相比,细菌基因组中少于1%或少于0.1%、少于0.01%、少于0.001%或甚至少于0.0001%的核苷酸已被改变(例如通过取代、插入、缺失或其组合)。
在本文中术语“嗜热”是指在高于35℃的温度下生长最好的微生物。工业上最有用的嗜热细菌包括链球菌属和乳杆菌属。在本文中术语“嗜热发酵”是指在高于约35℃的温度下发酵,例如在约35℃至约45℃之间。术语“嗜热发酵乳制品”是指通过嗜热起子培养物的嗜热发酵制备的发酵乳制品,包括例如凝固型酸奶、搅拌型酸奶和饮用酸奶例如养乐多等发酵乳制品。此外,术语“嗜热发酵乳制品”是指通过嗜热起子培养物在植物基乳基质中的嗜热发酵制备的发酵乳制品,例如豆乳或补充有糖的豆乳,诸如例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5%至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
本文中的术语“嗜温”是指在中等温度(15℃-35℃)下生长最好的微生物。工业上最有用的嗜温细菌包括乳球菌属和明串珠菌属。本文中的术语“嗜温发酵”是指在约22℃至约35℃的温度下发酵。术语“嗜温食品”是指通过嗜温起子培养物的嗜温发酵制备的食品。术语“嗜温发酵乳制品”是指通过嗜温起子培养物的嗜温发酵制备的发酵乳制品,包括例如酪乳、酸乳、发酵乳(cultured milk)、斯美塔那(smetana)、酸奶油、开菲尔(kefir)和新鲜奶酪等发酵乳制品,例如夸克奶酪、特沃劳格奶酪(tvarog)和奶油奶酪。此外,术语“嗜温发酵乳制品”是指通过嗜温起子培养物在植物基乳基质中的嗜温发酵制备的发酵乳制品,例如豆乳或补充有糖的豆乳,诸如例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
本文中的术语“嗜温起子培养物”是指含有至少一种嗜温细菌菌株的任何起子培养物。嗜温起子培养物,例如乳酸乳球菌乳酸亚种菌株和乳酸乳球菌乳脂亚种菌株的组合,用于生成诸如新鲜奶酪、酪乳、酸乳和酸奶油等发酵乳制品。
术语“发酵乳”和“乳制品”在本文中可互换使用。在本发明的上下文中,表述“发酵乳制品”在其任何实施方案中是指食品或饲料制品,其中食品或饲料制品的制备涉及用乳酸菌发酵乳基质。本文所用的“发酵乳制品”包括但不限于诸如上文限定的嗜热发酵乳制品或嗜温发酵乳制品等产品。此外,如上所述,本文所用的“发酵乳制品”包括通过诸如豆乳或补充有糖的豆乳等植物基乳基质的发酵制备的产品,所述糖例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。因此,本发明的“发酵乳制品”涵盖发酵的哺乳动物乳制品(即,乳基质具有哺乳动物来源)和发酵的植物乳制品(即,乳基质是植物来源的乳基质,例如豆乳基质)。
在本申请的上下文中,术语“乳”以其常用含义广泛用来表示由动物(例如奶牛、绵羊、山羊、水牛、骆驼等)的乳腺产生或来源于植物的液体。术语“乳基”或“乳基质”可以是可以根据本发明进行发酵的任何乳材料。因此,有用的乳基包括但不限于任何乳或包含蛋白质的乳样产品的溶液/悬浮液,例如全脂乳或低脂乳、脱脂乳、酪乳、复原乳粉、炼乳、乳粉、乳清、乳清渗透物、乳糖、乳糖结晶母液、乳清蛋白浓缩物、奶油或植物基乳。显然,乳基可以源自任何哺乳动物,例如基本上纯的哺乳动物乳或复原乳粉。乳的植物来源包括但不限于从大豆中提取的乳。优选,植物基乳是豆乳,其可以优选地补充有糖,诸如例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
在发酵之前,可以根据本领域已知的方法对乳基质进行均质化和巴氏灭菌。在本发明的上下文中在其任何实施方案中使用的“均质化”是指强烈混合以获得可溶性悬浮液或乳液。如果在发酵之前进行均质化,则可以进行均质化以将乳脂分解成更小的尺寸,使得它不再与乳分离。这可以通过在高压下迫使乳通过小孔来实现。
在本发明的上下文中在其任何实施方案中使用的“巴氏灭菌”是指处理乳基以减少或消除活生物例如微生物的存在。优选,通过将指定温度保持指定的时间段来实现巴氏灭菌。通常通过加热来达到指定的温度。可以选择温度和持续时间,以杀伤或灭活某些细菌,例如有害细菌。随后可以进行快速冷却步骤。例如,可以将乳基在92℃下热处理3min(分钟),冷却至38℃,然后如本发明方法的步骤i所述进行接种。
如本文所用,术语“约”(或“大约”)是指所示值±其值的1%,或术语“约”是指所示值±其值的2%,或术语“约”是指所示值±其值的5%,术语“约”是指所示值±其值的10%,或术语“约”是指所示值±其值的20%,或术语“约”是指所示值±其值的30%;优选,术语“约”准确地是指所示值(±0%)。
在整个说明书和权利要求书中,词语“包含/包括(comprise)”和词语的变体(例如,“包含/包括(comprising)”、“具有(having)”、“包括(including)”、“含有(containing)”)通常不是限制性的,因此不排除可以作为例如技术特征、添加剂、组分或步骤的其他特征。然而,每当本文使用词语“包含/包括(comprise)”时,这也包括特殊的实施方案,在该实施方案中,该词语被理解为限制性的;在该特定实施方案中,词语“包含/包括(comprise)”具有术语“由......组成”的含义。
除非本文另有说明或与上下文明显矛盾,否则,在描述本发明的上下文中(特别是在以下权利要求的上下文中)术语“一(a)”和“一(an)”和“所述(the)”以及类似指示符的使用,应解释为涵盖单数和复数。除非在本文中另有说明,否则本文中对数值范围的列举仅旨在用作单独提及落入该范围内的每个单独值的速记方法,并且每个单独的值被并入说明书中,就如同它在本文中单独列举一样。除非本文另有说明或与上下文明显矛盾,否则本文所述的所有方法都可以以任何合适的顺序进行。除非另外要求保护,否则,本文提供的任何和所有示例或示例性语言(例如,“诸如/例如(such as)”)的使用仅旨在更好地阐明本发明并且不对本发明的范围构成限制。说明书中的语言不应解释为表示任何未要求保护的元素对于本发明的实践是必不可少的。
质构是酸奶等发酵乳制品的重要品质因素,并且消费者的接受度通常与质构特性密切相关。发酵乳的质构取决于用于发酵的细菌和工艺参数。产生多糖的细菌可以积极影响产品特征,例如质构和感官特性。感官质构属性通常与仪器质构的结果相关,例如,剪切应力与黏度和感知的口腔厚度有关(Poulsen et al.,2019)。在本发明的上下文中,发酵乳制品的流变特性(质构),例如黏度,可以作为发酵乳制品剪切应力的函数来测量,如下所述。
结合本发明,剪切应力可以通过以下方法测量:当发酵乳(例如,哺乳动物或植物基乳)的pH达到约pH 4.55时,将发酵乳制品置于4℃,并通过配备有穿孔圆盘的棒手动轻轻搅拌,直到样品均匀。在流变仪(具有ASC(自动换样器)的Anton Paar Physica Rheometer,Anton
Figure BDA0003566890230000321
GmbH,Austria)上使用锤杯(bob-cup)评估样品的流变特性。在测量期间将流变仪设置为13℃的恒温。设置如下:
-保持时间(重建到有些原始的结构)
-5分钟,不对样品施加任何物理应力(振荡或旋转)。
-振荡步骤(分别测量弹性模量和黏性模量,即G′和G″,因此计算复数模量G*)
恒定应变=0.3%,频率(f)=[0.5...8]Hz
60s(秒内)6个测量点(每10s一个)
-旋转步骤(以在3001/s下测量剪切应力)
-设计了两个步骤:
-剪切速率=[0.3-300]1/s和2)剪切速率=[275-0.3]1/s。
每个步骤在210s内含有21个测量点(每10s)。选择3001/s(300s-1)下的剪切应力用于进一步分析,因为这与吞咽发酵乳制品时的口腔厚度相关。
优选,剪切应力可以通过以下方法测量:通过将相同的微生物培养物接种在半脱脂乳(1.5%脂肪)中获得剪切应力数据;将乳在90℃下加热20min并冷却至接种温度(30℃),然后接种1%的过夜微生物培养物。接种在30℃下以200ml规模进行8-22h,直到pH为约4.55,然后冷却至4℃并在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天。储存后,通过配备有钻孔圆盘的棒轻轻搅拌发酵乳,直到样品均匀。使用以下设置在流变仪(具有ASC(自动换样器)的Anton Paar Physica Rheometer,Anton
Figure BDA0003566890230000331
GmbH,Austria)上评估样品的剪切应力:
-等待时间(重建到有些原始的结构)
-5分钟无振荡或旋转
-旋转(以在300s-1等下测量剪切应力)
-Y′=[0.2707-300]s-1和y′=[275-0.2707]s-1
210s内21个测量点(每10s)上升到300s-1,210s内21个测量点(每10s)下降到0.2707s-1。对于数据分析,选择剪切速率300s-1下的剪切应力。
或者,通过以下方法测量剪切应力:将1%体积的过夜微生物培养物(通过在30℃下将微生物培养物接种在补充有2%葡萄糖的M17肉汤中获得)接种到具有葡萄糖的豆乳中,例如具有0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。接种在30℃下以200ml规模进行,直到目标pH,然后冷却至4℃并在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天。目标pH可以是例如pH 4-5,优选pH 4.3-4.7,更优选pH 4.4-4.6,甚至更优选pH 4.45、pH 4.50或pH 4.55。储存后,通过配备有钻孔圆盘的棒轻轻搅拌发酵乳,直到样品均匀。使用以下设置在流变仪(具有ASC(自动换样器)的Anton Paar PhysicaRheometer,Anton
Figure BDA0003566890230000332
GmbH,Austria)上评估剪切应力:
-等待时间(重建到有些原始的结构)
-5分钟无振荡或旋转
-旋转(以在300s-1等下测量剪切应力)
-Y′=[0.2707-300]s-1和y′=[275-0.2707]s-1
210s内21个测量(每10s)上升到300s-1,210s内21个测量点(每10s)下降到0.2707s-1。对于数据分析,可以选择剪切速率300s-1下的剪切应力。
乳酸乳球菌乳酸亚种乳酸菌(LAB)菌株
本发明的目的是,提供适用于制备食品的质构化LAB菌株。特别是,本发明的目的是,提供适用于制备嗜温食品的质构化乳酸乳球菌菌株。如本文所述,该目的已通过包含新eps基因簇的乳酸乳球菌菌株得以解决。如实施例中所讨论的(参见例如表1和2以及实施例1和2),公开的乳酸乳球菌菌株DSM 33134、DSM 33135、DSM 33136、DSM 33137、DSM 33138、DSM 33139、DSM 33140、DSM 33141、DSM 33142、DSM 33183、DSM 25485以及DSM 33192具有优异的质构化特性。
本发明人分析了上述菌株的eps基因簇,并鉴定了新的基因序列,该基因序列据信与胞外多糖(EPS)的产生有关,从而参与了用于发酵乳的上述乳酸乳球菌菌株的优异质构特性的产生。
在不受理论限制的情况下,没有实质理由认为以下是不可信的,即另一种包含类似于本发明菌株的本文讨论的新特征性eps基因簇基因/序列的eps基因簇基因/序列的乳酸乳球菌菌株(即不同于本发明的特定菌株)将不会也具有改善的质构化特性。在LAB中,Wzy依赖性途径是合成异聚EPS的选择途径。
通过Wzy依赖性机制进行多糖生物合成的遗传基因座在所有细菌中都是相似的,并且在肺炎链球菌(Streptococcus pneumoniae)中得到了很好的研究。值得注意的是,肺炎链球菌仅产生英膜胞外多糖(通常缩写为CPS),而LAB可以产生CPS和EPS(EPS表示分泌到培养基/乳中的“胞外多糖”)。相同的基因簇负责CPS和EPS的产生。对来自90种肺炎球菌血清型的CPS基因座的遗传分析证明了多糖操纵子的显著特征:每个关键酶类都存在许多高度不同的形式。因此,发现了40个同源组的多糖聚合酶、13个组的脂肪酶以及种类繁多的糖基转移酶。这些酶的多种非同源或高度不同形式的存在,以及编码这些酶的区域中通常不同的G+C含量,支持已在多个场合从不同和未知来源输入了这些基因的观点。许多eps基因簇经历了插入序列(in sertion sequence,IS)元件介导的重排,并通过水平基因转移接收来自其他生物的基因。在操纵子侧翼或内部存在IS元件是eps操纵子组织化的典型特征。在多糖产生基因座中观察到的糖基转移酶过剩,为通过基因改组不断生成产生独特EPS的新菌株提供了机会。由于EPS在单糖结构元件(building block)、异头构型、构象和立体化学方面表现出巨大的多样性,因此所导致的EPS结构的多样性是不可思议的:例如,两个葡萄糖残基可以以30种不同的方式连接在一起。根据碳水化合物活性酶(CAZy)数据库(cazy.org),糖基转移酶目前分为107个家族(2019年6月,http://www.cazy.org/ GlycosylTransferases.html),这有助于预测它们的行动模式。然而,这并不意味着一个家族的所有酶都识别相同的供体和受体,因为多特异性在糖基转移酶家族中很常见,因此对过度解释纯粹基于这种分类的预测应当谨慎。
在LAB中编码Wzy依赖性胞外多糖生物合成蛋白的基因,通常以具有操纵子结构的簇的形式组织化,并且在嗜热链球菌中通常存在于染色体上,但在乳酸乳球菌和乳杆菌中可以存在于质粒或染色体上。通常,eps基因簇是高度多样化的,它们的核苷酸序列是LAB基因组中变化最大的序列之一。然而,eps基因簇中的模块化基因组织是保守的(Zeidan etal.,2017)。根据Zeidan et al.(2017)和Poulsen et al.(2019)的命名法,在eps基因簇开头的保守基因,其参与了多糖生物合成的调节和组装机制,被命名为epsRXCDB,在eps基因簇末端的保守基因被命名为epsL和lytR,而聚合酶被命名为wzy,翻转酶被命名为wzx。可变部分的基因包括聚合酶wzy、也称为翻转酶wzx的多糖转运蛋白和葡糖基转移酶(GT)或其他聚合物修饰酶。质构化菌株的共同特性是,它们都含有产生多糖所需的基因,例如epsCDBE-wzy-wzx和GT(Zeidan et al.,2017)。
还没有推定的功能可以分配给epsX和epsL。NIZO B40 epsL可以使用内部基因片段通过单次交叉受到破坏,或者在对EPS产生无任何影响的情况下过度生产(vanKranenburg,1999)。但是,如果来自eps簇的epsL没有功能,则epsL的第二个拷贝可能会接管。
据信EpsR负责EPS生物合成调节,因此某些突变会影响EPS的产生。据信EpsCDB和ATP形成作为酪氨酸激酶-磷酸酶系统起作用的稳定复合物,它可能通过epsE的磷酸化控制EPS合成,epsE是催化EPS重复单元组装的第一步并限定添加到脂质载体中的糖类型以形成EPS的糖基磷酸转移酶(glycosylphospho-transferase)。负责酪氨酸磷酸化的所有三个基因对于肺炎球菌的完全形成英膜都是必不可少的,其中cpsC(对应于乳酸乳球菌中的EpsC)是主要的毒力因子,通过其在CPS生物合成调节中的作用至关重要(Whittall et al.,2015)。在乳酸乳球菌中,发现epsC和epsD对EPS产生至关重要,但对epsB并无严格要求,因为其缺失的效果是所产生的EPS的量减少(Nierop Groot and Kleerebezem,2007)。已表明编码初始磷酸葡萄糖转移酶的基因epsE,其不催化糖苷键,但参与将重复单元的第一个糖连接到脂质载体上,已表明其对乳酸乳球菌的多糖生物合成是必不可少的,因为它的破坏消除了EPS产生(Dabour and LaPointe 2005,van Kranenburg et al.,1997)。
随后,eps簇的通常编码糖基转移酶、聚合酶和转运蛋白的下述基因位于该簇的可变部分中,并且确实通常与已经表征的基因具有低程度的相似性,这使得预测它们的推定的功能变得困难。比较来自90种肺炎球菌血清型的多糖合成操纵子,其中多糖生物合成得到了充分研究,揭示了负责重复单元合成和聚合的中心基因是高度可变的,并且在血清型之间通常是非同源的(Bentley et al.,2006)。肺炎链球菌中Wzy依赖性CPS生物合成类似于肽聚糖合成,由此重复单元得以构建在细胞质膜的内表面,通过也称为翻转酶的Wzx转运蛋白转运到膜的外表面,并由Wzy聚合酶聚合。多糖聚合酶wzy连接各重复单元以形成脂质连接的CPS。在肺炎链球菌中,发现了40个同源组的多糖聚合酶。重复寡糖单元的初始糖也是重复单元聚合中的供体糖,并且Wzy聚合酶的特异性决定了连接类型。对初始糖和随后的重复单元聚合连接的预测,与聚合酶同源组充分相关。在肺炎链球菌中,有32个聚合酶同源组与WchA相关,5个与WciI相关,4个与WcjG相关,1个与WcjH相关。这些关联大多是排他性的,只有五个聚合酶同源组与两个初始转移酶相关,这表明了初始转移酶的高特异性(Bentley et al.,2006)。
因此,一般来说,乳球菌属菌株的eps基因簇在保守区域内具有高度相似性(例如,epsRXCDBE、epsL和lytR),但就序列和存在的基因而言,eps基因的其余部分,包括wzy、wzx、GT基因以及其他寡糖重复单元修饰基因(如果存在的话)通常更易变。在不受理论限制的情况下,目前认为可变区的与EPS生物合成相关的基因(特别是wzy、wzx和GT基因,如果存在的话)的差异,可能是造成不同LAB菌株产生的不同EPS结构的原因,这可能对不同LAB菌株质构化能力的差异产生影响。
如上所述,本发明的第一方面涉及包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸乳球菌乳酸菌(LAB)菌株,其中视情况而定,eps基因簇包含至少一条、优选两条、更优选三条、甚至更优选所有选自核苷酸序列(a)-(m)的核苷酸序列,所述核苷酸序列(a)-(m)如(i)至(x)中任一项所限定,参见下文。
在优选的实施方案中,eps基因簇包含(i)至(x)中任一项限定的所有核苷酸序列(a)-(c)(即(a)至(c3)/(c4)/(c6),视情况而定,参见下文),参见下文。
在最优选的实施方案中,视情况而定,eps基因簇包含(i)至(x)中任一项限定的 核苷酸序列(a)-(m)(例如(i)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)和(d),(ii)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(c3)、(c4)、(c5)、(d)、(e)、(f)、(g)、(h)和(i),(iii)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)和(d),(iv)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(c4)和(d),(v)限定的用于LAB的(a)、(c1)、(c2)、(c3)、(c4)、(d)、(e)、(f)、(g)、(h)、(i)和(j),(vi)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(c4)、(c5)、(c6)、(d)、(e)、(f)、(g)、(h)、(i)、(j)、(k)、(l)和(m),(vii)限定的用于LAB的(a)、(c1)(c2)和(d),(viii)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(c4)、(c5)、(c6)、(d)、(e)和(f),(ix)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(d)和(e),以及(x)限定的用于LAB的(a)、(b)、(c1)、(c2)、(c3)、(d)和(e)):
(i)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:11的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:17的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:9的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:13的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:15的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其与SEQ ID NO:23的核苷酸序列编码的氨基酸序列(本文称为推定的核苷酸糖脱氢酶蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(ii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:199的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:199互补链的核苷酸26029-27444(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、更优选三条、甚至更优选四条、最优选五条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:199的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:199的核苷酸7276-8508编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:199的核苷酸11042-12391编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:199的核苷酸13008-13934编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:199的核苷酸18528-19508编码的氨基酸序列(本文称为GT5)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其编码具有dTDP-葡萄糖4,6-脱水酶活性并且与SEQ ID NO:199的核苷酸4784-5695编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%且最有优选100%同一性的多肽;
(e):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖还原酶活性并且与SEQ ID NO:199的核苷酸5717-6631编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(f):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖3,5-差向异构酶活性并且与SEQID NO:199的核苷酸6586-7257编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(g):核苷酸序列,其编码多肽DUF1919并且与SEQ ID NO:199的核苷酸8515-9144编码的氨基酸序列(本文称为DUF1919)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(h):核苷酸序列,其编码多肽DUF4422并且与SEQ ID NO:199的核苷酸10271-11029编码的氨基酸序列(本文称为DUF4422)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(i):核苷酸序列,其编码具有UDP-吡喃半乳糖变位酶活性并且与SEQ ID NO:199的核苷酸9159-10274编码的氨基酸序列(本文称为UDP-吡喃半乳糖变位酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(iii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:39的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:45的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:37编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:41的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:43的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(d):核苷酸序列,其编码具有多糖丙酮酰转移酶活性并且与SEQ ID NO:51的核苷酸序列编码的氨基酸序列(本文称为多糖丙酮酰转移酶家族蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(iv)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:163的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:169的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:161的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:165的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:167的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c4)核苷酸序列,其与SEQ ID NO:181的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d)核苷酸序列,其编码与SEQ ID NO:175的核苷酸序列编码的氨基酸序列(本文称为核心-2/I-分支蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(v)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:224的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:224的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:224的核苷酸11042-12391编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:224的核苷酸13008-13934编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:224的核苷酸18527-19507编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其编码多肽DUF1972并且与SEQ ID NO:224的核苷酸7276-8508编码的氨基酸序列(本文称为DUF1972)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(e):核苷酸序列,其编码多肽DUF4422并且与SEQ ID NO:224的核苷酸10271-11029编码的氨基酸序列(本文称为DUF4422)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(f):核苷酸序列,其编码多肽DUF1919并且与SEQ ID NO:224的核苷酸8515-9144编码的氨基酸序列(本文称为DUF1919)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(g):核苷酸序列,其编码具有UDP-吡喃半乳糖变位酶活性并且与SEQ ID NO:224的核苷酸9159-10274编码的氨基酸序列(本文称为UDP-吡喃半乳糖变位酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(h):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖3,5-差向异构酶活性并且与SEQID NO:224的核苷酸6586-7257编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(i):编码具有dTDP-葡萄糖4,6-脱水酶活性并且与SEQ ID NO:224的核苷酸4784-5695编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;和
(j):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖还原酶活性并且与SEQ ID NO:224的核苷酸5717-6631编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(vi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:67的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:73的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:65的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:69的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:71的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:85的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:87的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:89的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其具有差向异构酶/脱水酶活性并且与SEQ ID NO:93的核苷酸序列编码的氨基酸序列(本文称为NAD依赖性差向异构酶/脱水酶1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(e):核苷酸序列,其具有脱氢酶活性并且与SEQ ID NO:79的核苷酸序列编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(f):核苷酸序列,其具有胸苷基转移酶活性并且与SEQ ID NO:95的核苷酸序列编码的氨基酸序列(本文称为rfbA,葡萄糖-1-磷酸胸苷基转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(g):核苷酸序列,其具有脱水酶活性并且与SEQ ID NO:97的核苷酸序列编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(h):核苷酸序列,其具有差向异构酶活性并且与SEQ ID NO:99的核苷酸序列编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(i):核苷酸序列,其具有差向异构酶/脱水酶活性并且与SEQ ID NO:101的核苷酸序列编码的氨基酸序列(本文称为NAD依赖性差向异构酶/脱水酶家族蛋白2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(j):核苷酸序列,其具有酰基转移酶活性并且与SEQ ID NO:111的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(k):核苷酸序列,其具有酰基转移酶活性并且与SEQ ID NO:107的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(1):核苷酸序列,其具有还原酶活性并且与SEQ ID NO:103的核苷酸序列编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(m):核苷酸序列,其具有核苷酸转移酶活性并且与SEQ ID NO:105的核苷酸序列编码氨基酸序列(本文称为核苷酸转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(vii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:244的核苷酸5833-6927(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:244的核苷酸4617-5123编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c2):核苷酸序列,其与SEQ ID NO:244的核苷酸5120-5827编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(d):核苷酸序列,其编码具有UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺转移酶活性并且与SEQ ID NO:244的核苷酸4168-4617编码的氨基酸序列性(本文称为UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(viii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:123的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:129的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:121的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:125的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性
(c3):核苷酸序列,其与SEQ ID NO:127的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:143的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:145的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:147的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:149的核苷酸序列编码的氨基酸序列(本文称为乙酰转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(e):核苷酸序列,其编码具有脱氢酶活性并且与SEQ ID NO:135的核苷酸序列编码的氨基酸性序列(本文称为核苷酸糖脱氢酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(f):核苷酸序列,其编码具有酰基转移酶活性并且与SEQ ID NO:151的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(ix)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:257互补链的核苷酸11201-12349(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:257互补链的核苷酸15538-16953(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸9726-10673编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸12336-13421编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸13418-14260编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其编码具有核苷酸糖脱氢酶活性并且与SEQ ID NO:257互补链的核苷酸7727-8173编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;和
(e):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:257互补链的核苷酸10657-11211编码的氨基酸序列(本文称为乙酰转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(x)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:274互补链的核苷酸10707-11846(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:274互补链的核苷酸15037-16476(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸9232-10179编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸11833-12918编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸12915-13757编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(d):核苷酸序列,其编码具有核苷酸糖脱氢酶活性并且与SEQ ID NO:274互补链的核苷酸7234-7680编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;和
(e):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:274互补链的核苷酸10163-10717编码的氨基酸序列(本文称为乙酰转移酶)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽。
上述序列均存在于本发明菌株的eps基因簇的可变部分中。
因此,在优选的实施方案中,本发明的乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中所述eps基因簇包含上文(i)至(x)中任一项限定的所有核苷酸序列(a)-(c)(如果存在的话),参见上文。
在最优选的实施方案中,本发明的乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中视情况而定,所述eps基因簇包含上文(i)至(x)中任一项限定的所有核苷酸序列(a)-(m),参见上文。
优选,本发明的乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,优选其中,视情况而定,所述eps基因簇包含上文(i)至(x)中任一项限定的所有核苷酸序列(a)-(c),甚至更优选其中,视情况而定,所述eps基因簇包含上文(i)至(x)中任一项限定的所有核苷酸序列(a)-(m)至少一条、优选所有下述核苷酸序列:
(i)(1):核苷酸序列,其与SEQ ID NO:1的核苷酸序列(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:3的核苷酸序列(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:5的核苷酸序列(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:7的核苷酸序列(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:19的核苷酸序列(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:21的核苷酸序列编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:25的核苷酸序列(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:27的核苷酸序列(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(ii)(1):核苷酸序列,其与SEQ ID NO:199的核苷酸1-318(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:199的核苷酸352-1119(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:199的核苷酸2698-3462(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:199的核苷酸1948-2640(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:199互补链的核苷酸24053-24751(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:199的核苷酸20389-21291编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:199的核苷酸1159-1938(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:199的核苷酸3484-4170(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(iii)(1):核苷酸序列,其与SEQ ID NO:29的核苷酸序列(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性
(2):核苷酸序列,其与SEQ ID NO:31的核苷酸序列(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,与SEQ ID NO:33的核苷酸序列(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,与SEQ ID NO:35的核苷酸序列(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:47的核苷酸序列(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:49的核苷酸序列编码的氨基酸序列(本文称为推定的LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,与SEQ ID NO:53的核苷酸序列(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:55的核苷酸序列(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(iv)(1):核苷酸序列,其与SEQ ID NO:153的核苷酸序列(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:155的核苷酸序列(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:157的核苷酸序列(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性
(4):核苷酸序列,其与SEQ ID NO:159的核苷酸序列(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:171的核苷酸序列(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:173的核苷酸序列编码的氨基酸序列(本文称为推定的LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:177的核苷酸序列(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(8):核苷酸序列,其与SEQ ID NO:179的核苷酸序列(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(v)(1):核苷酸序列,其与SEQ ID NO:224的核苷酸1-318(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:224的核苷酸352-1119(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:224的核苷酸2698-3462(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:224的核苷酸1948-2640(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:224的核苷酸20388-21290编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:224的核苷酸1159-1938(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:224的核苷酸3484-4170(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(vi)(1):核苷酸序列,其与SEQ ID NO:57的核苷酸序列(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:59的核苷酸序列(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:61的核苷酸序列(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:63的核苷酸序列(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:75的核苷酸序列(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:77的核苷酸序列编码的氨基酸序列(本文称为推定的LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:81的核苷酸序列(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:83的核苷酸序列(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(vii)(1):核苷酸序列,其与SEQ ID NO:244的核苷酸1-318(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:244的核苷酸352-1119(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:244的核苷酸2698-3462(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:244的核苷酸1948-2643(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:244互补链的核苷酸9365-10267编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:244的核苷酸1159-1938(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:244的核苷酸3484-4164(本文称为epsE)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:244的核苷酸8438-9340(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(viii)(1):核苷酸序列,其与SEQ ID NO:113的核苷酸序列(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:115的核苷酸序列(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:117的核苷酸序列(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:119的核苷酸序列(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:131的核苷酸序列(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:133的核苷酸序列编码的氨基酸序列(本文称为推定的LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:137的核苷酸序列(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:139的核苷酸序列(本文称为epsE1)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(9):核苷酸序列,其与SEQ ID NO:141的核苷酸序列(本文称为epsE2)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(ix)(1):核苷酸序列,其与SEQ ID NO:257的核苷酸1-318(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:257的核苷酸407-1120(本文称为epsX)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:257的核苷酸2699-3463(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:257的核苷酸1949-2644(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:257的核苷酸5876-6778编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:257的核苷酸1160-1939(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:257的核苷酸3485-4084(本文称为epsE1)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:257的核苷酸4085-4840(本文称为epsE2)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(9):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸6803-7717(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(x)(1):核苷酸序列,其与SEQ ID NO:274的核苷酸1-318(本文称为epsR)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(2):核苷酸序列,其与SEQ ID NO:274的核苷酸2220-3005(本文称为epsB)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(3):核苷酸序列,其与SEQ ID NO:274的核苷酸1470-2165(本文称为epsD)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(4):核苷酸序列,其与SEQ ID NO:274的核苷酸5383-6285编码的氨基酸序列(本文称为LytR家族转录调节蛋白)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(5):核苷酸序列,其与SEQ ID NO:274的核苷酸681-1460(本文称为epsC)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(6):核苷酸序列,其与SEQ ID NO:274的核苷酸2992-3591(本文称为epsE1)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(7):核苷酸序列,其与SEQ ID NO:274的核苷酸3592-4347(本文称为epsE2)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(8):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸6310-7224(本文称为epsL)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
上述序列均存在于本发明菌株的eps基因簇的保守部分中。
甚至更优选,本发明的乳酸乳球菌乳酸菌包含能够产生胞外多糖(EPS)的活性eps基因簇,其中所述eps基因簇具有下述序列:
(i)SEQ ID NO:91;
(ii)SEQ ID NO:199;
(iii)SEQ ID NO:92;
(iv)SEQ ID NO:223;
(v)SEQ ID NO:224;
(vi)SEQ ID NO:221;
(vii)SEQ ID NO:244;
(viii)SEQ ID NO:222;
(ix)SEQ ID NO:257;
(x)SEQ ID NO:274。
最优选,本发明的乳酸乳球菌乳酸菌是下述质构化菌株:
(i):DSM 33134
(ii):DSM 33135
(iii):DSM 33136
(iv):DSM 33137
(v):DSM 33138
(vi):DSM 33139
(vii):DSM 33140
(viii):DSM 33141
(ix):DSM 33142
(x):DSM 33183。
DSM 33134的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:wzy(SEQ ID NO.:11)、wzx(SEQ ID NO.:17)、编码GT1的基因(SEQ IDNO.:9)、编码GT2的基因(SEQ ID NO.:13)编码GT3的基因(SEQ ID NO.:15)和编码核苷酸糖脱氢酶蛋白的基因(SEQ ID NO.:23)。
保守部分:epsR(SEQ ID NO.:1)、epsX(SEQ ID NO.:3)、epsB(SEQ ID NO.:5)、epsD(SEQ ID NO.:7)、epsL(SEQ ID NO.:19)、编码LytR蛋白的基因(SEQ ID NO.:21)、epsC(SEQ ID NO.:25)和epsE(SEQ ID NO.:27)。
DSM 33136的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:wzy(SEQ ID NO.:39)、wzx(SEQ ID NO.:45)、编码GT1的基因(SEQ IDNO.:37)、编码GT2的基因(SEQ ID NO.:41)、编码GT3的基因(SEQ ID NO.:43)和编码多糖丙酮酰转移酶家族蛋白的基因(SEQ ID NO.:51).
保守部分:epsR(SEQ ID NO.:29)、epsX(SEQ ID NO.:31)、epsB(SEQ ID NO.:33)、epsD(SEQ ID NO.:35)、epsL(SEQ ID NO.:47)、编码LytR蛋白的基因(SEQ ID NO.:49)、epsC(SEQ ID NO.:25)和epsE(SEQ ID NO.:27)。
DSM 33139的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO.:65)、编码NAD依赖性差向异构酶/脱水酶蛋白1的基因(SEQ ID NO.:93)、编码核苷酸糖脱氢酶蛋白的基因(SEQ ID NO.:79)、编码葡萄糖-1-磷酸胸苷基转移酶RfbA蛋白的基因(SEQ ID NO.:95)、编码dTDP-葡萄糖4,6-脱水酶蛋白的基因(SEQ ID NO.:97)、编码dTDP-4-脱氢鼠李糖3,5-差向异构酶蛋白的基因(SEQID NO.:99)、编码NAD依赖性差向异构酶/脱水酶蛋白2的基因(SEQ ID NO.:101)、编码GT2的基因(SEQ ID NO.:69)、编码GT3的基因(SEQ ID NO.:71)、编码GT4的基因(SEQ ID NO.:85)、编码GT5的基因(SEQ ID NO.:87)、编码GT6的基因(SEQ ID NO.:89)、wzy(SEQ ID NO.:67)、编码酰基转移酶蛋白1的基因(SEQ ID NO.:111)、wzx(SEQ ID NO.:73)、编码酰基转移酶蛋白2的基因(SEQ ID NO.:107)、编码dTDP-4-脱氢鼠李糖还原酶的基因(SEQ ID NO.:103)、编码核苷酸转移酶的基因(SEQ ID NO.:105)。
保守部分:epsR(SEQ ID NO.:57)、epsX(SEQ ID NO.:59)、epsB(SEQ ID NO.:61)、epsD(SEQ ID NO.:63)、epsL(SEQ ID NO.:75)、编码LytR蛋白的基因(SEQ ID NO.:77)、epsC(SEQ ID NO.:81)和epsE(SEQ ID NO.:83)。
DSM 33141的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO.:121)、编码核苷酸糖脱氢酶蛋白的基因(SEQ ID NO.:135)、编码GT2的基因(SEQ ID NO.:125)、编码GT3的基因(SEQ ID NO.:127)、编码GT4的基因(SEQ ID NO.:143)、编码GT5的基因(SEQ ID NO.:145)、编码GT6的基因(SEQID NO.:147)、wzy(SEQ ID NO.:123)、编码乙酰转移酶蛋白的基因(SEQ ID NO.:149)、wzx(SEQ ID NO.:129)、编码酰基转移酶蛋白的基因(SEQ ID NO.:151).
保守部分:epsR(SEQ ID NO.:113)、epsX(SEQ ID NO.:115)、epsB(SEQ ID NO.:117)、epsD(SEQ ID NO.:119)、epsL(SEQ ID NO.:131)、编码LytR蛋白的基因(SEQ ID NO.:133)、epsC(SEQ ID NO.:137)、epsE1(SEQ ID NO.:139)以及epsE2(SEQ ID NO.:141)。
DSM 33137的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO.:161)、编码GT2的基因(SEQ ID NO.:165)、编码GT3的基因(SEQ ID NO.:167)、编码GT4的基因(SEQ ID NO.:181)、wzy(SEQ ID NO.:163)、wzx(SEQ ID NO.:169)、编码核心-2/I-分支蛋白的基因(SEQ ID NO.:175)。
保守部分:epsR(SEQ ID NO.:153)、epsX(SEQ ID NO.:155)、epsB(SEQ ID NO.:157)、epsD(SEQ ID NO.:159)、epsL(SEQ ID NO.:171)、编码LytR蛋白的基因(SEQ ID NO.:173)、epsC(SEQ ID NO.:177)和epsE(SEQ ID NO.:179)。
DSM 33135的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO:199的核苷酸4174-4824)、编码GT2的基因(SEQ ID NO:199的7276-8508核苷酸)、编码GT3的基因(SEQ ID NO:199的核苷酸11042-12391)、编码GT4的基因(SEQ ID NO:199的核苷酸13008-13934)、编码GT5的基因(SEQ IDNO:199的核苷酸18528-19508)、wzy(SEQ ID NO:199的核苷酸13939-15042)、wzx(SEQ IDNO:199互补链的核苷酸26029-27444)、编码dTDP-葡萄糖4,6-脱水酶蛋白的基因(SEQ IDNO:199的核苷酸4784-5695)、编码dTDP-4-脱氢鼠李糖还原酶蛋白的基因(SEQ ID NO:199的核苷酸5717-6631)、编码dTDP-4-脱氢鼠李糖3,5-差向异构酶蛋白的基因(SEQ ID NO:199的核苷酸6586-7257)、编码DUF1919蛋白的基因(SEQ ID NO:199的核苷酸8515-9144)、编码UDP-吡喃半乳糖变位酶蛋白的基因(SEQ ID NO:199的核苷酸9159-10274)、编码DUF4422蛋白的基因(SEQ ID NO:199的核苷酸10271-11029)。
保守部分:epsR(SEQ ID NO:199的核苷酸1-318)、epsX(SEQ ID NO:199的核苷酸352-1119)、epsB(SEQ ID NO:199的核苷酸2698-3462)、epsD(SEQ ID NO:199的核苷酸1948-2640)、epsL(SEQ ID NO:199互补链的核苷酸24053-24751)、编码LytR蛋白的基因(SEQ ID NO:199的核苷酸20389-21291)、epsC(SEQ ID NO:199的核苷酸1159-1938)以及epsE(SEQ ID NO:199的核苷酸3484-4170)。
DSM 33138的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO:224的核苷酸4174-4824)、编码GT2的基因(SEQ ID NO:224的核苷酸11042-12391)、编码GT3的基因(SEQ ID NO:224的核苷酸13008-13934)、编码GT4的基因(SEQ ID NO:224的核苷酸18527-19507)、wzy(SEQ ID NO:224的核苷酸13939-15042)、编码DUF1972蛋白的基因(SEQ ID NO:224的核苷酸7276-8508)、编码dTDP-4-脱氢鼠李糖还原酶蛋白的基因(SEQ ID NO:199的核苷酸5717-6631)、编码DUF4422蛋白的基因(SEQ ID NO:199的核苷酸6586-7257)、编码DUF1919蛋白的基因(SEQ ID NO:199的核苷酸8515-9144)、编码UDP-吡喃半乳糖变位酶蛋白的基因(SEQ ID NO:224的核苷酸9159-10274)、编码DUF4422蛋白的基因(SEQ ID NO:224的核苷酸10271-11029)、编码DUF1919蛋白的基因(SEQ ID NO:224的核苷酸8515-9144)、编码dTDP-4-脱氢鼠李糖_3,5-差向异构酶蛋白的基因(SEQ ID NO:224的核苷酸6586-7257)、编码dTDP-葡萄糖4,6-脱水酶蛋白的基因(SEQ ID NO:224的核苷酸4784-5695)以及编码dTDP-4-脱氢鼠李糖还原酶蛋白的基因(SEQ ID NO:224的核苷酸5717-6631)。
保守部分:epsR(SEQ ID NO:224的核苷酸1-318)、epsX(SEQ ID NO:224的核苷酸352-1119)、epsB(SEQ ID NO:224的核苷酸2698-3462)、epsD(SEQ ID NO:224的核苷酸1948-2640)、编码LytR蛋白的基因(SEQ ID NO:224的核苷酸20388-21290)、epsC(SEQ IDNO:224的核苷酸1159-1938)以及epsE(SEQ ID NO:224的核苷酸3484-4170)。
DSM 33140的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO:244的核苷酸4617-5123)、编码GT2的基因(SEQ ID NO:244的核苷酸5120-5827)、wzy(SEQ ID NO:244的核苷酸5833-6927)、编码dTDP-4-脱氢鼠李糖还原酶蛋白的基因(SEQ ID NO:199的核苷酸5717-6631)、编码UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺_转移酶蛋白的基因(SEQ ID NO:244的核苷酸4168-4617)。
保守部分:epsR(SEQ ID NO:244的核苷酸1-318)、epsX(SEQ ID NO:244的核苷酸352-1119)、epsB(SEQ ID NO:244的核苷酸2698-3462)、epsD(SEQ ID NO:244的核苷酸1948-2643)、epsL(SEQ ID NO:244的核苷酸8438-9340)、编码LytR蛋白的基因(SEQ ID NO:244互补链的核苷酸9365-10267)、epsC(SEQ ID NO:244的核苷酸1159-1938)以及epsE(SEQID NO:244的核苷酸3484-4164)。
DSM 33142的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO:257互补链的核苷酸9726-10673)、编码GT2的基因(SEQ ID NO:257互补链的核苷酸12336-13421)、编码GT3的基因(SEQ ID NO:257互补链的核苷酸13418-14260)、wzy(SEQ ID NO:257互补链的核苷酸11201-12349)、wzx(SEQID NO:257互补链的核苷酸15538-16953)、编码核苷酸糖脱氢酶蛋白的基因(SEQ ID NO:257互补链的核苷酸7727-8173)以及编码乙酰转移酶蛋白的基因(SEQ ID NO:257互补链的核苷酸10657-11211)。
保守部分:epsR(SEQ ID NO:257的核苷酸1-318)、epsX(SEQ ID NO:257的核苷酸407-1120)、epsB(SEQ ID NO:257的核苷酸2699-3463)、epsD(SEQ ID NO:257的核苷酸1949-2644)、epsL(SEQ ID NO:257互补链的核苷酸6803-7717)、编码LytR蛋白的基因(SEQID NO:257的核苷酸5876-6778)、epsC(SEQ ID NO:257的核苷酸1160-1939)、epsE1(SEQ IDNO:257的核苷酸3485-4084)以及epsE2(SEQ ID NO:257的核苷酸4085-4840)。
DSM 33183的eps基因簇在可变部分和保守部分中包含下述基因:
可变部分:编码GT1的基因(SEQ ID NO:274互补链的核苷酸9232-10179)、编码GT2的基因(SEQ ID NO:274互补链的核苷酸11833-12918)、编码GT3的基因(SEQ ID NO:274互补链的核苷酸12915-13757)、wzy(SEQ ID NO:274互补链的核苷酸10707-11846)、wzx(SEQID NO:274互补链的核苷酸15037-16476)、编码核苷酸糖脱氢酶蛋白的基因(SEQ ID NO:274互补链的核苷酸7234-7680)以及编码乙酰转移酶蛋白的基因(SEQ ID NO:274互补链的核苷酸10163-10717)。
保守部分:epsR(SEQ ID NO:274的核苷酸1-318)、epsB(SEQ ID NO:274的核苷酸2220-3005)、epsD(SEQ ID NO:274的核苷酸1470-2165)、epsL(SEQ ID NO:274互补链的核苷酸6310-7224)、编码LytR蛋白的基因(SEQ ID NO:274的核苷酸5383-6285)、epsC(SEQ IDNO:274的核苷酸681-1460)、epsE1(SEQ ID NO:274的核苷酸2992-3591)以及epsE2(SEQ IDNO:274的核苷酸3592-4347)。
术语“胞外多糖(EPS)”是众所周知的,并且技术人员可以常规地确定感兴趣的乳酸菌是否产生EPS。如本领域技术人员已知和理解的,产生EPS的感兴趣的乳酸菌会包含活性eps基因簇。
如本领域技术人员所知,如上所述,活性eps基因簇包含参与EPS生物合成的调控和调节的基因和参与寡糖重复单元的生物合成和输出的基因,包括糖基转移酶(GT)、聚合酶和转运蛋白。简而言之并且如技术人员所理解的,由于第一方面的乳酸菌菌株能够产生和输出胞外多糖(EPS),那么它们将包含活性eps基因簇。Zeidan等人(Zeidan et al.,2017)综述了LAB产生EPS,并提供了LAB中eps基因簇结构的详细信息。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(i)限定的核苷酸序列(a)、(b)和(c),甚至更优选所有核苷酸序列(a)至(d),并且优选至少一条、优选所有选自(i)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33134或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(ii)限定的核苷酸序列(a)至(c5),甚至更优选所有核苷酸序列(a)至(i),并且优选至少一条、优选所有选自(ii)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33135或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(iii)限定的核苷酸序列(a)至(d),优选所有核苷酸序列(a)至(c3),甚至更优选所有核苷酸序列(a)至(d),并且优选至少一条、优选所有选自(iii)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33136或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(iv)限定的核苷酸序列(a)-(d),优选所有核苷酸序列(a)至(c4),甚至更优选所有核苷酸序列(a)至(d),并且优选至少一条、优选所有选自(iv)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33137或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(v)限定的核苷酸序列(a)至(c4),优选所有核苷酸序列(a)至(j),并且优选至少一条、优选所有选自(v)限定的核苷酸序列(1)-(7)的核苷酸序列是菌株DSM 33138或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(vi)限定的核苷酸序列(a)-(m),优选所有核苷酸序列(a)至(c6),甚至更优选所有核苷酸序列(a)至(m),并且优选至少一条、优选所有选自(vi)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33139或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(vii)限定的核苷酸序列(a)至(c2),甚至更优选所有核苷酸序列(a)至(d),并且优选至少一条、优选所有选自(vii)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33140或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(viii)限定的核苷酸序列(a)-(f),优选所有核苷酸序列(a)至(c6),甚至更优选所有核苷酸序列(a)至(f),并且优选至少一条、优选所有选自(viii)限定的核苷酸序列(1)-(9)的核苷酸序列是菌株DSM 33141或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(x)限定的核苷酸序列(a)至(e),更优选所有核苷酸序列(a)至(c3),甚至更优选所有核苷酸序列(a)至(e),并且优选至少一条、优选所有选自(ix)限定的核苷酸序列(1)-(9)的核苷酸序列是菌株DSM 33142或其突变体或变体。
优选,乳酸乳球菌乳酸菌(LAB)菌株包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇包含至少一条、优选两条、更优选三条核苷酸序列,所述核苷酸序列选自(xi)限定的核苷酸序列(a)至(e),甚至更优选所有核苷酸序列(a)至(c3),甚至更优选所有核苷酸序列(a)至(e),并且优选至少一条、优选所有选自(x)限定的核苷酸序列(1)-(8)的核苷酸序列是菌株DSM 33183或其突变体或变体。
如本文的工作实施例所讨论的(参见例如表1)——本文公开的新颖乳酸乳球菌菌株具有优异的质构化特性。此外,如实施例2以及表2和表3所示,本文公开的新颖乳酸乳球菌菌株在植物基乳中,特别是在补充有葡萄糖的豆乳中也具有优异的质构化特性,所述葡萄糖例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
优选,本文所述的质构化乳酸菌菌株(i)至(x)是生成剪切应力大于40Pa,例如为约41Pa、42Pa、43Pa、44Pa、45Pa、46Pa或更高的发酵乳的LAB菌株,优选LAB菌株生成剪切应力为41Pa或更高,例如为约41Pa、48Pa、52Pa、53Pa、55Pa、56Pa、60Pa、64Pa、65Pa或67Pa的发酵乳,优选在存在优选选自DSM 25485、DSM 33192和/或DSM 33133的协同酸化(co-acidifier)菌株或辅助菌株的情况下,甚至更优选在存在DSM 25485的情况下,剪切应力是在下述条件下以300s-1的剪切速率测量的:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在300s-1的剪切速率下测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
在不受理论限制的情况下,当按如下测量时,并非所有菌株都能在15h或更短的时间内酸化乳,即在15h或更短的时间内达到目标pH-例如pH 4.55:将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到目标pH。目标pH可以是例如pH 4-5,优选pH4.3-4.7,更优选pH 4.4-4.6,甚至更优选pH 4.45、pH 4.50或pH 4.55。接种温度为30℃。
这些菌株可称为“慢酸化”菌株。例如,如表1所示,以下菌株可以视为“慢酸化”菌株:DSM 33134、33135、DSM 33136、33138、33139、33141和33183。
对于发酵乳的生产,目前优选乳发酵(酸化)尽可能快地进行,例如,以避免任何潜在污染微生物的生长。因此,优选组合使用慢酸化菌株与在本发明的上下文中称为“协同酸化”菌株或“辅助”菌株的其他乳酸菌菌株。协同酸化菌株或辅助菌株会有助于“慢酸化”菌株在更短的时间内酸化乳。在不受理论限制的情况下,目前认为协同酸化菌株或辅助菌株尤其会比“慢酸化”菌株更快地代谢乳中存在的蛋白质(酪蛋白),因此“慢酸化”菌株会具有更多可利用的氮源供其生长,从而促进其生长。乳酸菌需要外源性氨基酸或肽,它们是由乳蛋白例如酪蛋白的蛋白水解提供,酪蛋白是乳中最丰富的蛋白和氨基酸的主要来源(Savijoki,K.,et al.,Appl Microbiol Biotechnol(2006)71:394-406)。
慢酸化菌株通常与低蛋白水解活性有关。蛋白水解是将蛋白质分解成更小的多肽或氨基酸。细胞壁蛋白酶(Prt)水解乳蛋白,例如酪蛋白,提供氮源,从而使乳适合菌株快速生长。prt活性以外的其他因素,例如碳代谢、ldh和codY活性,也可以发挥作用。仅具有高prt活性来快速酸化乳是不够的。肽的摄取和进一步降解对于乳酸化率也很重要。此外,EPS产生是高能耗过程(Zeidan et al.,2017)。质构化乳酸乳球菌菌株在酸化乳方面通常慢于非质构化菌株(Poulsen et al.,2019)。
能够在约15h或更短的时间内酸化乳的菌株,可以称为“快速酸化”菌株,即当按如下测量时,能够在15h或更短的时间内达到目标pH的菌株:将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到目标pH。接种温度为30℃。目标pH可以是例如pH 4-5,优选pH 4.3-4.7,更优选pH 4.4-4.6,甚至更优选pH 4.45、pH 4.50或pH 4.55。
这些菌株可以单独使用或与其他菌株组合使用,用于生成发酵乳。例如,以下菌株可以视为“快速酸化”菌株:DSM 33137、33140和33142。
本发明的协同酸化菌株或辅助菌株可以是能够进行i)和ii)的任何乳酸菌菌株:
i)在下述条件下测量时,在15h或更短的时间内,优选在12h或更短的时间内生成pH为约4.55的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度(30℃),并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到约4.55的pH。因此,可以计算某种乳酸菌菌株的“达到pH 4.55的时间”;
ii)在下述条件下测量时,生成在300s-1的剪切速率下测量的40Pa或更高的剪切应力的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,达到pH 4.55的时间),然后在4℃下储存5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。
在优选的实施方案中,协同酸化菌株或辅助菌株是包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株乳酸乳球菌,其中eps基因簇包含(xi)限定的核苷酸序列(a)、(b)和(c)((a)至(c4)):
(xi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:183的核苷酸6955-8145(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:183的核苷酸9309-10727(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:183的核苷酸4008-4478编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:183的核苷酸4478-4960编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:183的核苷酸5015-5965编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:183的核苷酸6026-6955编码的氨基酸序列(本文称为GT4)具有至少95%的同一性。
在进一步优选的实施方案中,乳酸菌菌株乳酸乳球菌包含能够产生胞外多糖(EPS)的活性eps基因簇,其中eps基因簇如(xii)所限定:
(xii)SEQ ID NO.:290。
技术人员应当能够找到适用于本发明的其他协同酸化菌株或辅助菌株。例如,合适的协同酸化菌株或辅助菌株可以是包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株乳酸乳球菌,其中eps基因簇如SEQ ID NO.:291-294中所限定,其中协同酸化菌株或辅助菌株能够:(i)在约15h或更短的时间内,优选在约12h或更短的时间内生成pH为约4.55的发酵乳,如上所述测量,并且能够(ii)生成以300s-1的剪切速率测量时剪切应力为40Pa或更高的发酵乳,如上所述所测量。
例如,在本发明的上下文中,以下菌株也可以用作协同酸化菌株或辅助菌株:菌株DSM 33193、DSM 33133、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM 33203、DSM33204、DSM 33205、DSM 33218、DSM 33219、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225、DSM 33140、DSM 33142和/或DSM 33137,优选菌株DSM 33193、DSM 33196、DSM33197、DSM 33200、DSM 33201、DSM 33205、DSM 33218、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225和/或DSM 33137。
因此,本发明还提供了以下菌株中任一菌株作为协同酸化菌株或辅助菌株的用途:DSM 33193、DSM 33133、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM 33203、DSM33204、DSM 33205、DSM 33218、DSM 33219、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225、DSM 33140、DSM 33142和/或DSM 33137,优选下述菌株中任一菌株作为协同酸化菌株或辅助菌株的用途:DSM 33193、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM33205、DSM 33218、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225和/或DSM33137。
此外,本文所述的质构化乳酸菌菌株(i)至(x)是,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约27Pa、28Pa、29Pa、30Pa、32Pa、35Pa、37Pa、42Pa、47Pa、54Pa、59Pa或更高的发酵乳的LAB菌株:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH为约4.55(或更高,参见表3,如果菌株在较高pH,例如pH4.55、4.48、4.71、4.64、4.68、4.58、4.4、4.56、4.58或4.86下停止酸化),然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(i),其优选为乳酸乳球菌菌株DSM 33134或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于40Pa,优选大于45Pa,例如为约46Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(i),其优选为乳酸乳球菌菌株DSM 33134或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((i)的LAB∶LAB菌株DSM25485),在下述条件下生成在300s-1的剪切速率下剪切应力大于45Pa,优选大于50Pa,例如为约53Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(i),其优选为乳酸乳球菌菌株DSM 33134或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于24Pa,优选大于30Pa,例如为约37Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.56,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(ii),其优选为乳酸乳球菌菌株DSM 33135或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于40Pa,例如为约43Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(ii),其优选为乳酸乳球菌菌株DSM 33135或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((ii)的LAB∶LAB菌株DSM25485),在下述条件下生成在以300s-1的剪切速率下剪切应力大于45Pa、优选大于50Pa、更优选大于60Pa,例如为约65Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在300s-1的剪切速率下测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(ii),其优选为乳酸乳球菌菌株DSM 33135或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约30Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.68,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(iii),其优选为乳酸乳球菌菌株DSM 33136或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于40Pa,例如为约41Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(iii),其优选为乳酸乳球菌菌株DSM 33136或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((iii)的LAB∶LAB菌株DSM 25485),在下述条件下生成在以300s-1的剪切速率下,剪切应力大于45Pa、优选大于50Pa、更优选大于55Pa,例如为约60Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(iii),其优选为乳酸乳球菌菌株DSM 33136或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约21Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.64,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(iv),其优选为乳酸乳球菌菌株DSM 33137或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于40Pa,例如为约45Pa,优选大于45Pa,例如为约48Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(iv),其优选为乳球菌菌株DSM 33137或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((iv)的LAB∶LAB菌株DSM25485),在下述条件下,生产300s-1的剪切速率下剪切应力大于45Pa,例如为约48Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(iv),其优选为乳酸乳球菌菌株DSM 33137或其突变体或变体,在下述条件下生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约30Pa,优选大于30Pa,例如为约35Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.40,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(v),其优选为乳酸乳球菌菌株DSM 33138或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于45Pa、优选大于50Pa,例如为约55Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(v),其优选为乳酸乳球菌菌株DSM 33138或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((v)的LAB∶LAB菌株DSM25485),在下述条件下生成在300s-1的剪切速率下剪切应力大于45Pa、优选大于50Pa、更优选大于60Pa、甚至更优选大于65Pa,例如为约67Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(v),其优选为乳酸乳球菌菌株DSM 33138或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约27Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.71,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(vi),其优选为乳酸乳球菌菌株DSM 33139或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((vi)的LAB∶LAB菌株DSM 25485),在下述条件下,生成在300s-1的剪切速率下剪切应力大于40Pa、优选大于45Pa,更优选大于50Pa,例如为约52Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(vi),其优选为乳酸乳球菌菌株DSM 33139或其突变体或变体,在下述条件下,生产在300s-1的剪切速率下剪切应力大于24Pa、优选大于30Pa、更优选大于35Pa,例如为约42Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.58,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(vii),其优选为乳酸乳球菌菌株DSM 33140或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于40Pa、优选大于45Pa,例如为约46Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
质构化乳酸菌菌株(vii),其优选为乳酸乳球菌菌株DSM 33140或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1的比例((vii)的LAB∶LAB菌株DSM 25485),在下述条件下,生产在300s-1的剪切速率剪切应力大于45Pa、优选大于50Pa,更优选大于55Pa、甚至更优选大于60Pa,例如为约64Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(vii),其优选为乳酸乳球菌菌株DSM 33140,或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约28Pa的发酵乳,:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(viii),其优选为乳酸乳球菌菌株DSM33141或其突变体或变体,在存在协同酸化菌株DSM 25485的情况下,优选以约9∶1比例((viii)的LAB∶LAB菌株DSM 25485),在下述条件下,质构化生成在300s-1的剪切速率下剪切应力大于40Pa,例如为约41Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(viii),其优选为乳酸乳球菌菌株DSM33141或其突变体或变体,在下述条件下,质构化生成在在300s-1的剪切速率下剪切应力大于24Pa、优选大于30Pa,例如为约32Pa的的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.58,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(ix),其优选为乳酸乳球菌菌株DSM 33142或其突变体或变体,在下述条件下,生成在300s-1的剪切速率下剪切应力大于45Pa、优选大于50Pa、更优选大于55Pa,例如为约56Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(ix),其优选为乳酸乳球菌菌株DSM 33142或其突变体或变体,在下述条件下,生成在300s-1的剪切速率下剪切应力大于24Pa、优选大于30Pa、更优选大于40Pa,例如为约42Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(x),其优选为乳酸乳球菌菌株DSM 33183或其突变体或变体,在下述条件下,生成在300s-1的剪切速率下剪切应力大于60Pa、优选大于65Pa、更优选大于70Pa,例如为约72Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
例如,质构化乳酸菌菌株(x),其优选为乳酸乳球菌菌株DSM 33183或其突变体或变体,在下述条件下,生成在300s-1的剪切速率下剪切应力大于24Pa、优选大于30Pa、更优选大于50Pa,例如为约59Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.86,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
本发明还提供了选自以下菌株的乳酸乳球菌菌株:
(i)-乳酸乳球菌菌株DSM 33134和衍生自DSM 33134的菌株,其中衍生菌株的特征在于具有与DSM 33134至少相同的质构化能力;
(ii)-乳酸乳球菌菌株DSM 33135和衍生自DSM 33135的菌株,其中衍生菌株的特征在于具有与DSM 33135至少相同的质构化能力;
(iii)-乳酸乳球菌菌株DSM 33136和衍生自DSM 33136的菌株,其中衍生菌株的特征在于具有与DSM 33136至少相同的质构化能力;
(iv)-乳酸乳球菌菌株DSM 33137和衍生自DSM 33137的菌株,其中衍生菌株的特征在于具有与DSM 33137至少相同的质构化能力;
(v)-乳酸乳球菌菌株DSM 33138和衍生自DSM 33138的菌株,其中衍生菌株的特征在于具有与DSM 33138至少相同的质构化能力;
(vi)-乳酸乳球菌菌株DSM 33139和衍生自DSM 33139的菌株,其中衍生菌株的特征在于具有与DSM 33139至少相同的质构化能力;
(vii)-乳酸乳球菌菌株DSM 33140和衍生自DSM 33140的菌株,其中衍生菌株的特征在于具有与DSM 33140至少相同的质构化能力;
(viii)-乳酸乳球菌DSM 33141和衍生自DSM 33141的菌株,其中衍生菌株的特征在于具有与DSM 33141至少相同的质构化能力;
(ix)-乳酸乳球菌菌株DSM 33142和衍生自DSM 33142的菌株,其中衍生菌株的特征在于具有与DSM 33142至少相同的质构化能力;
(x)-乳酸乳球菌菌株DSM 33183和衍生自DSM 33183的菌株,其中衍生菌株的特征在于具有与DSM 33183至少相同的质构化能力。
此外,本发明提供以下乳酸乳球菌菌株DSM 33192和衍生自DSM33192的菌株,其中衍生菌株的特征在于具有与DSM 33192至少相同的质构化能力。DSM 33192的eps基因簇在可变部分和保守部分中包含以下基因:
可变部分:wzy(SEQ ID NO.:194)、wzx(SEQ ID NO.:196)、编码GT1的基因(SEQ IDNO.:190)、编码GT2的基因(SEQ ID NO.:191)、编码GT3的基因(SEQ ID NO.:192)、编码GT4的基因(SEQ ID NO.:193)和编码甘油磷酸转移酶家族蛋白的基因(SEQ ID NO.:195)。
保守部分:epsR(SEQ ID NO.:184)、epsX(SEQ ID NO.:185)、epsB(SEQ ID NO.:188)、epsD(SEQ ID NO.:187)、epsL(SEQ ID NO.:197)、编码LytR蛋白的基因(SEQ ID NO.:198)、epsC(SEQ ID NO.:186)和epsE(SEQ ID NO.:189)。
菌株DSM 33192或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于70Pa、优选大于80Pa、更优选大于85Pa、甚至更优选大于90Pa,例如为约94Pa的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 5.6,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
菌株DSM 33192或其突变体或变体,在下述条件下,生成以300s-1的剪切速率测量时剪切应力大于24Pa、优选大于30Pa、更优选大于40Pa、甚至更优选大于45Pa,例如为约47Pa的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
包含本发明的LAB的组合物
在第二方面,本发明提供了包含一种或多种本发明第一方面所述的本发明的质构化乳酸乳球菌菌株的组合物。
因此,本发明的组合物包含上文在本发明第一方面的上下文中描述的LAB(i)至(x)中的至少一种。
特别是,本发明提供了包含一种或多种本发明第一方面所述的本发明的质构化乳酸乳球菌菌株和本发明第一方面限定的协同酸化菌株或辅助菌株的组合物。在优选的实施方案中,本发明的组合物包含一种或多种本发明第一方面所述的本发明的质构化乳酸乳球菌菌株和本发明第一方面限定的协同酸化菌株或辅助菌株,两者的比例为约9∶1(本发明的LAB菌株:协同酸化菌株或辅助菌株)。
优选,本发明的组合物包含至少一种本发明第一方面的乳酸乳球菌乳酸菌菌株和一种或多种其他乳酸菌菌株,其中一种或多种其他乳酸菌菌株能够:
i)在下述条件下测量时,在约15h或更短的时间内,优选在约12h或更短的时间内生成pH为约4.55的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度(30℃),并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到约4.55的pH。因此,可以计算某种乳酸菌菌株的“达到pH 455的时间”;
ii)在下述条件下测量时,生成以300s-1的剪切速率测量时剪切应力为40Pa或更高的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,达到pH 4.55的时间),然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。
更优选,本发明的组合物包含至少一种根据本发明第一方面的乳酸乳球菌乳酸菌菌株(i)至(x)与(a)至少一种包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株乳酸乳球菌的组合,其中eps基因簇包含(xi)限定的核苷酸序列(a)、(b)和(c)(a至c4),或与(b)包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株乳酸乳球菌的组合,其中eps基因簇如(xii)中所限定:
(xi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:183的核苷酸6955-8145(本文称为wzy)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:183的核苷酸9309-10727(本文称为wzx)编码的氨基酸序列具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:183的核苷酸4008-4478编码的氨基酸序列(本文称为GT1)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:183的核苷酸4478-4960编码的氨基酸序列(本文称为GT2)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:183的核苷酸5015-5965编码的氨基酸序列(本文称为GT3)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:183的核苷酸6026-6955编码的氨基酸序列(本文称为GT4)具有至少70%、优选至少85%、更优选至少95%、甚至更优选至少98%、最优选100%同的一性;
(xii)SEQ ID NO.:290。
在另一优选的实施方案中,本发明的组合物包含至少一种本发明第一方面的乳酸乳球菌乳酸菌菌株和一种或多种包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株,其中eps基因簇如SEQ ID NO.:291-294所限定,其中其他菌株能够(i),如上文限定测量时,在约15h或更短的时间内,优选在约12h或更短的时间内,生成pH为约4.55的发酵乳,并且能够(ii)如上文所述测量时,生成以300s-1的剪切速率测量时剪切应力为40Pa或更高的发酵乳。
在另一优选的实施方案中,本发明的组合物包含至少一种根据本发明第一方面的乳酸乳球菌乳酸菌菌株(i)至(x),和一种或多种选自以下的乳酸菌菌株:DSM 33193、DSM33133、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM 33203、DSM 33204、DSM 33205、DSM 33218、DSM 33219、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225、DSM33140、DSM 33142、DSM 33137、DSM 33192和/或DSM 25485、优选选自菌株DSM 33193、DSM33196、DSM 33197、DSM 33200、DSM 33201、DSM 33205、DSM 33218、DSM 33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225、DSM 33137、DSM 33192和/或DSM 25485。
更优选,本发明的组合物包含在本发明第一方面的上下文中如上描述的LAB(i)至(x)中的至少一种,优选一种,和
(i)LAB菌株乳酸乳球菌乳脂亚种DSM 25485或其突变体或变体;和/或
(ii)乳酸菌菌株乳酸乳球菌乳酸亚种DSM 33192或其突变体或变体;和/或
(iii)乳酸菌菌株乳酸乳球菌DSM 33133或其突变体或变体。
例如,本发明的组合物包含菌株DSM 33134和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33135和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33136和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33137和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33138和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33139和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33140和菌株DSM 25485。例如,本发明的组合物包含菌株DSM 33141和菌株DSM 25485。
例如,本发明的组合物可以包含菌株DSM 33134、DSM 33135、DSM 33136、DSM33137、DSM 33138、DSM 33139、DSM 33140、DSM 33142和/或DSM 33141中的一种或多种和一种或多种本发明第一方面限定的协同酸化菌株或辅助菌株,优选一种或多种以下菌株:DSM25485、DSM 33192和/或DSM 33133。
在另一实施方案中,本发明的组合物可以包含菌株DSM 33134和菌株DSM 24649。在另一个实施方案中,本发明的组合物可以包含菌株DSM 33139和菌株DSM 24649。
优选,本发明的组合物在其任何实施方案中都包含至少1x106CFU(菌落形成单位)/ml总LAB菌株。优选,该组合物可包含至少1×108CFU/ml的至少一种、优选一种本发明的乳酸菌菌株。
在另一个实施方案中,本发明的组合物还可以包含至少一种本发明第一方面的乳酸乳球菌乳酸菌菌株和酵母提取物,优选酵母提取物的量为0.2%。酵母提取物可以从技术人员可获得的任何来源获得,例如从Procelys(例如,酵母提取物
Figure BDA0003566890230000841
545MG,批号0005115910,批次AD 18 A05030)获得。例如,本发明的组合物可以包含菌株DSM 33134、DSM33137、DSM 33139和/或DSM 33140中的一种或多种和酵母提取物,如上所述,优选酵母提取物的量为0.2%。从表1可以看出,酵母的存在可以缩短达到pH 4.55的时间,如上所述测量,并可以导致发酵乳具有更高的剪切应力。
如上文在本发明第一方面的上下文中所述,本发明的LAB,单独或与协同酸化菌株或辅助菌株组合,优选与LAB菌株乳酸乳球菌菌株DSM25485组合,能够生成具有高剪切应力的发酵乳。因此,本发明的组合物能够产生与针对本发明的LAB(如本发明第一方面的上下文所述,单独或在存在协同酸化菌株或辅助菌株的情况下)所述至少相同的剪切应力。
乳酸菌,包括乳球菌属物种的细菌,通常作为用于批量起子繁殖的冷冻(F-DVS)或冻干(FD-DVS)培养物或作为所谓的“直投式”(Direct Vat Set)(DVS)培养物供应给乳制品工业,用于直接接种到发酵容器或发酵槽中以产生乳制品,例如发酵乳制品。这种乳酸菌培养物通常称为“起子培养物”或“起子”。因此,本发明的组合物可以是冷冻的或冻干的。此外,本发明的组合物可以液体形式提供。因此,在一个实施方案中,组合物是冷冻的、干燥的、冻干的或液体形式。
本发明的组合物还可以包含冷冻保护剂、冻干保护剂、抗氧化剂、营养素、填充剂、调味剂或其混合物。该组合物优选包含冷冻保护剂、冻干保护剂、抗氧化剂和/或营养素中的一种或多种,更优选冷冻保护剂、冻干保护剂和/或抗氧化剂,最优选冷冻保护剂或冻干保护剂,或两者。诸如冷冻保护剂和冻干保护剂等保护剂的使用是本领域技术人员已知的。合适的冷冻保护剂或冻干保护剂包括单糖、二糖、三糖和多糖(例如葡萄糖、甘露糖、木糖、乳糖、蔗糖、海藻糖、棉子糖、麦芽糖糊精、淀粉和阿拉伯树胶(阿拉伯胶)等)、多元醇(例如赤藓糖醇、甘油、肌醇、甘露醇、山梨糖醇、苏糖醇、木糖醇等)、氨基酸(例如脯氨酸、谷氨酸)、复合物(例如脱脂乳、蛋白胨、明胶、酵母提取物)和无机化合物(如三聚磷酸钠)。
在一个实施方案中,根据本发明的组合物可以包含一种或多种选自由以下组成的组的冷冻保护剂:肌苷-5′-单磷酸(IMP)、腺苷-5′-单磷酸(AMP)、鸟苷-5′-单磷酸(GMP)、尿苷-5′-单磷酸(UMP)、胞苷-5′-单磷酸(CMP)、腺嘌呤、鸟嘌呤、尿嘧啶、胞嘧啶、腺苷、鸟苷、尿苷、胞苷、次黄嘌呤、黄嘌呤、次黄嘌呤、乳清苷、胸苷、肌苷和任何此类化合物的衍生物。合适的抗氧化剂包括抗坏血酸、柠檬酸及其盐、没食子酸盐、半胱氨酸、山梨糖醇、甘露糖醇、麦芽糖。合适的营养素包括糖、氨基酸、脂肪酸、矿物质、微量元素、维生素(例如维生素B族、维生素C)。该组合物可以任选地包含其他物质,包括填充剂(例如乳糖、麦芽糖糊精)和/或调味剂。
在本发明的一个实施方案中,冷冻保护剂是除了冷冻保护性之外还具有加强作用的试剂或试剂混合物。
表述“加强作用”用于描述,冷冻保护剂在将解冻或重构的培养物接种到待发酵或转化的培养基中时赋予该培养物增加的代谢活性(加强作用)的情况。活力和代谢活性不是同义词。商业冷冻或冻干培养物可能会保留其活力,尽管它们可能已经失去了很大一部分代谢活性,例如,即使保存较短的时间,培养物也可能失去其产酸(酸化)活性。因此,必须通过不同的测定来评估活力和加强作用。活力是通过活力测定来评估的,例如确定菌落形成单位,而加强作用是通过量化解冻或重构培养物相对于培养物活力的相关代谢活性来评估的。术语“代谢活性”是指培养物的除氧活性,其产酸活性,即产生例如乳酸、乙酸、甲酸和/或丙酸,或其产生代谢物的活性,例如产生诸如乙醛等芳香化合物(α-乙酰乳酸、乙偶姻、二乙酰和2,3-丁二醇(丁二醇))。
在一个实施方案中,以材料的%w/w测量时,本发明的组合物含有或包含0.2%至20%的冷冻保护剂或试剂混合物。然而,按重量以冷冻材料的%w/w测量,优选按重量计以0.2%至15%、0.2%至10%、0.5%至7%和1%至6%的量添加冷冻保护剂或试剂混合物,包括2%至5%的冷冻保护剂或试剂混合物。在优选的实施方案中,按重量以材料的%w/w测量,培养物包含约3%的冷冻保护剂或试剂混合物。冷冻保护剂约3%的量对应于100mM范围的浓度。应当认识到,对于本发明实施方案的每个方面,范围可以是所述范围的增量。
在一个实施方案中,本发明的组合物可以包含增稠剂和/或稳定剂,例如果胶(例如HM果胶、LM果胶)、明胶、CMC、大豆纤维(Soya Bean Fiber)/大豆聚合物(Soya BeanPolymer)、淀粉、改性淀粉、角叉菜胶、藻酸盐和瓜尔豆胶。
在微生物产生在酸化乳制品中引起高/黏稠质构的多糖(例如EPS)的一个实施方案中,酸化奶制品基本上不含或完全不含任何添加的增稠剂和/或稳定剂的方式产生,例如果胶(例如HM果胶、LM果胶)、明胶、CMC、大豆纤维/大豆聚合物、淀粉、改性淀粉、角叉菜胶、藻酸盐和瓜尔豆胶。基本上不含应当理解为,该制品包含0%至20%(w/w)(例如0%至10%、0%至5%或0%至2%或0%至1%)的增稠剂和/或稳定剂。
LAB菌株增加发酵乳制品黏度的用途
在第三方面,本发明提供了第一方面所述本发明的LAB和/或第二方面所述本发明的组合物用于增加发酵乳制品黏度的用途。因此,在第三方面,本发明提供了用于增加发酵乳制品黏度(即用于改善质构)的方法,其中该方法包括使用第一方面所述本发明的LAB和/或使用第二方面所述本发明组合物。
如上所述,第一方面所述本发明的LAB菌株(i)至(x)和第二方面所述本发明的组合物,在下述条件下,优选在存在优选选自DSM 25485、DSM 33192和/或DSM 33133的协同酸化菌株或辅助菌株、甚至更优选菌株DSM 25485的情况下,能够生成以300s-1的剪切速率测量时剪切应力大于40Pa的发酵乳,例如为约41Pa、42Pa、43Pa、44Pa、45Pa、46Pa或更高,优选本发明的LAB菌株/组合物生成剪切应力为54Pa或更高的发酵乳,例如为约48Pa、52Pa、53Pa、60Pa、64Pa、65Pa、66Pa、67Pa、70Pa或72Pa:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
如上所述,第一方面所述本发明的LAB菌株(i)至(x)和第二方面所述本发明的组合物,在下述条件下,能够生成以300s-1的剪切速率测量时剪切应力大于24Pa,例如为约27Pa、28Pa、29Pa、30Pa、32Pa、35Pa、37Pa、42Pa、47Pa、54Pa、59Pa或更大的发酵乳:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH为约4.55(例如pH4.55、4.48、4.71、4.64、4.68、4.58、4.4、4.56、4.58或4.86),然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
对于用本发明的具体LAB菌株单独或在存在酸化LAB菌株的情况下发酵的乳的具体剪切应力,我们参考本发明的第一方面。
如在本发明第一方面的上下文中所讨论的,本发明的一些LAB菌株能够在15h或更短的时间内酸化哺乳动物乳,如本发明第一方面所述测量(“快速-酸化”菌株)。这些菌株因此可以优选地单独使用或与其他菌株组合使用,用于生成发酵乳,特别是它们黏度增加的发酵乳的用途。
此外,有一些本发明的LAB菌株不能如本发明第一方面所述测量的在15h或更短的时间内酸化哺乳动物乳。它们可称为“慢酸化”菌株(例如,DSM 33134、33135、DSM 33136、33138、33139、33141和/或33183)。可以在本发明第一方面限定的协同酸化菌株或辅助菌株存在的情况下有利地使用这些菌株。特别是,可以在菌株DSM 25485和/或菌株DSM 33192和/或菌株DSM 33133存在的情况下有利地使用这些菌株。如上所述,优选,一种或多种本发明第一方面所述本发明的质构化乳酸乳球菌菌株和本发明第一方面限定的协同酸化菌株或辅助菌株,以约9∶1的比例(本发明的LAB菌株:协同酸化菌株或辅助菌株)组合使用。
如表1所示,当用本发明的LAB菌株之一协同酸化菌株DSM 25485发酵乳时,乳的剪切应力值增加和/或“达到pH 4.55的时间”减少。在不受理论限制的情况下,据信,如上所述,DSM 25485的蛋白水解性质允许和/或促进本发明的LAB的生长。此外,据信由DSM 25485产生的EPS和由本发明的菌株产生的EPS的组合,导致所观察到的作为剪切应力测量的发酵乳的黏度提高,如上所述。
在不受理论限制的情况下,据信当将乳与上文所述本发明的LAB(i)至(x)之一和菌株DSM 33192一起孵育时,也会获得用本发明LAB之一和协同酸化菌株DSM 25485发酵的乳的剪切应力增加的效果(参见表1)。菌株DSM 33192也是蛋白水解菌株,并且产生结构与菌株DSM 25485产生的EPS结构相似的EPS。
此外,在不受理论限制的情况下,当乳与上所述的本发明LAB(i)至(x)之一和一种或多种以下菌株一起孵育时,据信也会获得用本发明的LAB一和协同酸化菌株DSM 25485发酵的乳的剪切应力增加的效果(参见表1):DSM 33193、DSM 33133、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM 33203、DSM 33204、DSM 33205、DSM 33218、DSM 33219、DSM33220、DSM 33221、DSM 33222、DSM 33224、DSM 33225、DSM 33140、DSM 33142、DSM 33137、DSM 33192和/或DSM 25485,优选一种或多种下述菌株:DSM 33193、DSM 33196、DSM 33197、DSM 33200、DSM 33201、DSM 33205、DSM 33218、DSM 33220、DSM 33221、DSM 33222、DSM33224、DSM 33225、DSM 33137、DSM 33192和/或DSM 25485。这些菌株能够:
i)在下述条件下测量时,在约15h或更短的时间内,优选在约12h或更短的时间内生成pH为约4.55的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度(30℃),并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到约4.55的pH。因此,可以计算某种乳酸菌菌株的“达到pH 455的时间”;和
ii)在下述条件下测量时,生成以300s-1的剪切速率测量时剪切应力为40Pa或更高的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,达到pH 4.55的时间),然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。
在第三方面的具体实施方案中,本发明提供了乳酸乳球菌乳脂亚种菌株DSM25485增加发酵乳制品黏度的用途。
本发明人已发现,在下述条件下测量时,菌株DSM 25485生成以300s-1的剪切速率测量时剪切应力大于45Pa、优选大于50Pa、更优选大于55Pa,例如为56Pa的发酵乳,参见表1:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
此外,本发明人惊奇地发现,在下述条件下测量时,菌株DSM25485生成以300s-1的剪切速率测量时剪切应力大于24Pa、优选大于30Pa、更优选大于50Pa,例如为54Pa的发酵乳,参见表2:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
在该实施方案中,有利的是,菌株DSM 25485可以单独使用,和/或与本发明第一方面所述本发明的LAB(i)至(x)中的一种或多种组合使用。优选,将菌株DSM 25485与一种或多种本发明第一方面所述本发明的LAB以约9∶1的比例(本发明的LAB菌株∶菌株DSM 25485)组合使用。
在第三方面的另一个具体实施方案中,本发明提供了乳酸乳球菌乳酸亚种DSM33192增加发酵乳制品黏度的用途。本发明人惊奇地发现,在下述条件下测量时,菌株DSM33192生成以300s-1的剪切速率测量时剪切应力大于40Pa、优选大于50Pa、更优选大于80Pa、甚至更优选大于90Pa,例如为94Pa的发酵乳,参见表1:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例1所示的方法测量剪切应力。
此外,本发明人已惊奇地发现,在下述条件下测量时,菌株DSM33192生成以300s-1的剪切速率测量时剪切应力大于24Pa、优选大于30Pa、更优选大于40Pa、甚至更优选大于45Pa,例如为47Pa的发酵乳,参见表2:
将200ml补充有2%葡萄糖的豆乳(如实施例2所述)接种2ml乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量剪切应力,其中接种温度为30℃。使用实施例2所示的方法测量剪切应力。
在该实施方案中,有利的是,菌株DSM 33192可以单独使用,和/或与本发明第一方面所述本发明的LAB(i)至(x)中的一种或多种组合使用。优选,将菌株DSM 33192与一种或多种本发明第一方面所描述本发明的LAB以约9∶1的比例(本发明的LAB菌株∶菌株DSM33192)组合使用。
在第三方面的具体实施方案中,本发明提供了乳酸乳球菌乳脂亚种菌株DSM25485,和/或乳酸乳球菌乳酸亚种菌株DSM 33192,作为协同酸化菌株或辅助菌株,优选它们与诸如本发明第一方面限定的菌株(i)至(x)等其他质构化LAB菌株组合,用于增加乳制品黏度的用途。
生产食品的方法和食品
在第四方面,本发明涉及生产食品的方法,该方法包括至少一个使用至少一种本发明第一方面限定的乳酸菌菌株(i)至(x)和/或本发明第二方面限定的组合物的阶段。通过本领域技术人员已知的方法进行食品的生产。
在另一个实施方案中,本发明涉及生产食品的方法,该方法包括至少一个使用乳酸菌菌株乳酸乳球菌乳脂亚种DSM 25485或其突变体或变体的阶段。
在另一个实施方案中,本发明涉及生产食品的方法,该方法包括至少一个其中使用乳酸菌菌株乳酸乳球菌乳酸亚种DSM 33192或其突变体或变体的阶段。
在本发明的上下文中“发酵”在其任何实施方案中是指,通过微生物(LAB)的作用将碳水化合物转化为醇或酸。用于生产食品例如乳制品的发酵工艺是众所周知的,并且本领域技术人员将知道如何选择合适的工艺条件,例如温度、氧气、微生物量和工艺时间。显然,在其任何实施方案中,选择发酵条件以支持本发明的实现,例如,获得食品,优选与用不涉及使用至少一种本发明第一方面所述LAB或不涉及使用本发明第二方面所述组合物的方法生产的食品相比,质构改善的食品。
在一个优选的实施方案中,本发明的方法在其任何实施方案中都包括,用包含至少1x106CFU/ml、优选至少1x108CFU/ml总LAB菌株的组合物发酵乳基质,乳基质可以是,优选补充有例如0.5-5%、优选0.5-2%、更优选2%葡萄糖的哺乳动物基乳基质或植物基乳基质,例如豆乳。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33134和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33135和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33136和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33138和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33139和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33137的组合物发酵乳基质。例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM 33137和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml DSM33140菌株的组合物发酵乳基质。例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM 33140和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33142的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml优选至少1x108CFU/ml菌株DSM33141和DSM 25485的组合物发酵乳基质。
例如,本发明的方法包括用包含至少1x106CFU/ml、优选至少1x108CFU/ml菌株DSM33183的组合物发酵乳基质。
在另一个优选的实施方案中,该方法在其任何实施方案中都包括用本发明第二方面所述的组合物发酵乳基质。
优选,食品是乳制品,并且该方法在其任何实施方案中都包括,用本发明的至少一种LAB菌株和/或用本发明的组合物(分别为第一和第二方面)和/或用如上限定的协同酸化菌株或辅助菌株,优选用菌株DSM25485和/或用菌株DSM 33192,发酵乳基质(在本发明的上下文中也称为“乳基”)。
优选,食品是乳制品,并且该方法在其任何实施方案中都包括,用本发明的至少一种LAB菌株和/或用本发明的组合物的(分别为第一和第二方面),发酵植物基乳基质(在本发明的上下文中也称为“植物基乳基”),例如豆乳,优选补充有糖的豆乳。糖可以是例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5%至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
本发明的食品可以有利地进一步包含增稠剂和/或稳定剂,例如果胶(例如HM果胶、LM果胶)、明胶、CMC、大豆纤维/大豆聚合物、淀粉、改性淀粉、角叉菜胶、藻酸盐和瓜尔豆胶。
在具体实施方案中,食品是乳制品、肉制品、蔬菜制品、水果制品或谷物制品。在优选的实施方案中,食品是乳制品。在另一个优选的实施方案中,食品是植物基食品,例如发酵豆乳。
如本文所用,术语“乳制品”是指由乳生产的食品。如上所述,在本申请的上下文中,术语“乳”以其通常含义广泛用来表示由动物(例如牛、绵羊、山羊、水牛、骆驼等)的乳腺或由植物产生的液体。在优选的实施方案中,乳是牛乳。根据本发明,乳可能已经经过加工,并且术语“乳”包括全脂乳(whole milk)、脱脂乳、无脂乳(fat-free milk)、低脂乳、全脂乳(full fat milk)、低乳糖乳或浓缩乳。无脂乳是无脂或脱脂乳制品。低脂乳通常限定为含有约1%至约2%的脂肪的乳。全脂乳通常含有2%或更多的脂肪。术语“乳”旨在涵盖来自不同哺乳动物和植物来源的乳。乳的哺乳动物来源包括但不限于奶牛、绵羊、山羊、水牛、骆驼、美洲驼、母马和鹿。乳的植物来源包括但不限于从大豆中提取的乳。在具体实施方案中,乳是牛乳。在另一个具体实施方案中,乳是植物基乳,优选豆乳,其可以优选地补充有糖,例如果糖、蔗糖、高果糖玉米糖浆(HFCS)、蜂蜜、葡萄糖、转化糖、麦芽糖、半乳糖、乳糖或其任何组合。糖的浓度可以为0.5%至5%、0.5%至2%、0.5%、1%、1.5%或2%,例如0.5-5%的葡萄糖,优选0.5-2%的葡萄糖,更优选约2%的葡萄糖。
本发明优选的乳制品是发酵乳制品和奶酪。在具体实施方案中,乳制品是嗜温乳制品。
在本发明的具体实施方案中,发酵乳制品选自由以下组成的组:酪乳、酸乳、发酵乳、斯美塔那、酸奶油、高脂稀奶油(thick cream)、发酵奶油(cultured cream)、ymer、发酵乳清、开菲尔、养乐多和新鲜奶酪,例如夸克奶酪、特沃劳格奶酪和奶油奶酪。特别是,发酵乳制品选自夸克、酸奶油和开菲尔。在本发明的优选实施方案中,发酵乳制品包含选自另一种选自由以下组成的组的食品:水果饮料、谷物制品、发酵谷物制品、化学酸化谷物制品、豆乳制品、发酵豆乳制品及其任何混合物。在另一个优选的实施方案中,发酵乳制品是植物基发酵乳制品,例如发酵豆乳(例如,来自“Alpro”的plantgurt)。
发酵乳制品所含蛋白质的水平通常为基于重量1.0%至12.0%,优选基于重量2.0%至10.0%。在具体实施方案中,酸奶油所含蛋白质的水平为基于重量1.0%至基于重量5.0%,优选基于重量2.0%至基于重量4.0%。在具体实施方案中,夸克奶酪所含蛋白质的水平为基于重量4.0%至基于重量12.0%,优选基于重量5.0%至基于重量10.0%。
优选,与用可比较的方法生产的食品相比,食品的质构得到改善(如本发明和例如实施例1所述,作为300s-1下的剪切应力测量,黏度得到改善),该可比较的方法涉及使用至少一种本发明第一方面所述的LAB,和/或涉及使用本发明第二方面所述的组合物(在其任何实施方案中),和/或涉及使用菌株DSM 25485和/或DSM 33192。
本发明还涉及包含至少一种本发明第一方面所述LAB菌株的食品,优选乳制品。
除非本文另有说明或与上下文明显矛盾,否则上述要素、方面和实施例在其所有可能的变化形式中的任何组合都包含在本发明中。下面仅通过实施例描述本发明的实施方案。
实施例
实施例1.质构化菌株的高通量筛选和乳凝胶质构的测量
利用乳酸乳球菌来生产多种发酵乳制品,包括奶酪和嗜温发酵乳,例如酪乳和酸奶油。对于这些应用非常感兴趣的是产生多糖的菌株,因为释放到培养基中的多糖可以改善酪乳和酸奶油的质构特性,而英膜多糖可以提高保水能力,从而提高例如奶酪的产量。
当用通常属于例如嗜热链球菌、乳杆菌属和乳酸乳球菌的乳酸菌一起发酵时,乳(液体)通常会转化为乳凝胶(软固体)。流变仪或质构分析仪通常用于评估发酵乳凝胶的流变特性,例如剪切应力。当通过感官小组评估乳凝胶的质构时,剪切应力测量与感知的口腔厚度有关。高口腔厚度被认为是例如酸奶的发酵乳凝胶的重要品质因素,并且消费者的接受度通常与例如口腔厚度的质构特性密切相关,口腔厚度是剪切应力的函数。
如Poulsen et al.,2019所述,使用在各移液管的排气筒内配备有压力传感器的Hamilton Robotics MicroLab Star液体处理单元来筛选质构化菌株。液体处理器的压力传感器位于每个移液通道的顶部空间。来自每个传感器的压力数据由Hamilton RoboticsMicroLab Star液体处理器(Hamilton Robotics)的TADM(总吸液分配监控(TotalAspiration Dispense Monitoring))软件收集,并用于评估乳凝胶样品的相对剪切应力。
如上所述,使用Hamilton液体处理机器人的TADM工具以2-ml规模筛选来自高通量筛选菌株库的乳酸乳球菌的质构化特性。压力与时间数据(TADM)是从在96孔微量滴定板中制备的2ml样品中获得的,其中除非另有说明,否则在存在不同菌株(1%接种物)的情况下,在30℃下将B-乳接种20h,然后在4℃下储存1天。Hamilton液体处理单元用于测量抽吸过程中的压力,并使用抽吸过程中获得的压力曲线上方的面积来比较菌株的质构化能力。
通过在半脂乳(1.5%脂肪)中接种相同的微生物培养物获得剪切应力数据;将乳在90℃下加热20min并冷却至接种温度,然后接种1%体积的过夜微生物培养物。接种在30℃下以200ml规模进行8-22h,直到pH为约4.55,然后冷却至4℃并储存,直到测量剪切应力,通常在4℃下储存1-7天,例如5天。储存后,通过配备有钻孔圆盘的棒轻轻搅拌发酵乳,直到样品均匀。使用以下设置在流变仪(具有ASC(自动换样器)的Anton Paar PhysicaRheometer,Anton
Figure BDA0003566890230000961
GmbH,Austria)上评估样品的剪切应力:
-等待时间(重建到有些原始结构)
-5分钟无振荡或旋转
-旋转(在300s-1等下测量剪切应力)
-Y′=[0.2707-300]s-1和y′=[275-0.2707]s-1
210s内21个测量点(每10s)上升到300s-1,210s内21个测量点(每10s)下降到0.2707s-1。对于数据分析,选择剪切速率300s-1下的剪切应力。
如上所述使用流变仪确认了11种菌株DSM 33134、33135、DSM 33136、33137、33138、33139、33140、33141、33142、33183和33192的良好质构化能力。此外,如上所述使用流变仪也证实了菌株DSM 25485的良好质构化能力。结果如表1所示。
Figure BDA0003566890230000971
以下质构化菌株被归类为“慢酸化剂”,因为它们当单独生长时(表1),如上文所述测量,能够在超过15h内达到pH 4.55:DSM 33134、33135、DSM 33136、33138、33139和33141。将酵母提取物(YE,0.2%)或协同酸化菌株(DSM 24649或DSM 25485)添加到乳基质中,以帮助缓慢酸化菌株在乳中生长。酵母提取物获自Procelys(酵母提取物
Figure BDA0003566890230000981
545MG,批次号0005115910,批次AD 18A05030)。
酵母提取物可以提高酸化速度,但与一些正在研究的菌株共同孵育时,例如DSM33135、DSM 33136、33138,它对质构发展有负面影响。当存在酵母提取物时,DSM 33138失去了1/3的质构(表示为剪切应力)。
添加协同酸化菌株的目的是,帮助质构化菌株更快地酸化乳,即在15h或更短的时间内达到约4.55的pH,并且不会不利地影响它们的质构,正如我们在存在酵母抽提物时经常看到的那样。此外,由两种或多种一起生长的不同菌株产生的不同多糖,可能对质构具有协同作用。例如,如上所述,在10%DSM 25485存在的情况下生长的DSM 33135既提高了酸化速度,又提高了质构:DSM 33135和DSM 25485组合的剪切应力,明显高于单独的DSM 25485或DSM 33135的剪切应力。同样,DSM 33140和DSM 25485组合的剪切应力,明显高于单独的DSM 25485或DSM 33140的剪切应力。在存在DSM 25485的情况下,DSM 33137具有相似的行为(表1)。由于DSM 33134、33135、DSM 33136、33138、33139和33141在单独发酵乳时是慢酸化剂(如上所述,达到4.55的pH值,其需要超过15h),因此在存在协同酸化菌株的情况下测试它们是有利的,这会有助于减少酸化时间,同时不会不利地影响作为剪切应力测量的所产生的黏度,如表1所示。
实施例2.豆乳质构的流变学测量
对补充有2%葡萄糖的豆乳中14种乳酸乳球菌菌株进行流变学测量。
本实施例测试的菌株如下:DSM 24649(非质构化)、DSM 33134、DSM 33135、DSM33136、DSM 33137、DSM 33138、DSM 33139、DSM 33140、DSM 33141、DSM 33142、DSM 33183、DSM 33192和DSM 25485。
所用的乳基是补充有2%葡萄糖的豆乳:豆乳是有机的且未加糖,从Naturli′Foods获得,每100ml的组成如下:
脂肪:2.1g
-其中饱和脂肪:0.4g
碳水化合物:0.1g
-其中糖:0.1g
纤维:0.6g
蛋白质:3.7g
盐:0.04g。
乳已经是无菌的,在使用前没有对其进行预处理。它补充了2%的葡萄糖。
将1%体积的过夜微生物培养物(通过在30℃下将微生物培养物接种在补充有2%葡萄糖的M17肉汤中获得)接种在具有2%葡萄糖的豆乳中。接种在30℃下以200ml规模进行,直至pH为约4.55(对于每种培养物达到的具体pH,参见表3),然后冷却至4℃并储存,直至测量剪切应力,通常储存1-7天,例如5天。储存后,通过配备有钻孔圆盘的棒轻轻搅拌发酵乳,直至样品均匀。使用以下设置在流变仪(具有ASC(自动换样器)的Anton PaarPhysica Rheometer,Anton
Figure BDA0003566890230000991
GmbH,Austria)上评估样品的剪切应力:
-等待时间(重建到有些原始结构)
-5分钟无振荡或旋转
-旋转(在300s-1等下测量剪切应力)
-Y′=[0.2707-300]s-1和y′=[275-0.2707]s-1
在210s内21个测量点(每10s)上升到300s-1,在210s内21个测量点(每10s)下降到0.2707s-1。对于数据分析,选择剪切速率300s-1下的剪切应力。
剪切应力(Pa)的结果如下文表2所示。“Alpro”是指“Alpro naturell mild&creamy plantgurt”,一种来自“Alpro”的商业可得的发酵豆乳(https://www.alpro.com/ se/produkter/vaxtbaserad-yoghurt-variant/mild-creamy/mild-creamy-naturell/),每100ml的组成如下:
Figure BDA0003566890230001001
并含有以下成分:水、去皮SOYBEANS(大豆)(7.9%)、糖、柠檬酸三钙、稳定剂(果胶)、酸度调节剂(柠檬酸钠、柠檬酸)、海盐、抗氧化剂(富含生育酚的提取物、食用脂肪酸的抗坏血酸酯)、维生素(B12、D2)、酸奶培养物(嗜热链球菌、保加利亚乳杆菌)。值得注意的是,Alpro包含增加发酵乳质构的果胶。然而,本实施例使用的基础乳(补充有2%葡萄糖的豆乳)不包括果胶。
Figure BDA0003566890230001011
如从表2可以看出,在发酵补充有2%葡萄糖的豆乳时,所有所选质构化菌株表现出的剪切应力都高于阴性对照(DSM24649)。
最后,每个菌株达到的pH,以及达到该pH的时间(以h为单位)如表3所示。
表3.如上所述,在补充有2%葡萄糖的豆乳中孵育的所选质构化菌株(1%的接种物)的达到pH的时间。
Figure BDA0003566890230001021
实施例3.乳球菌菌株的基因组测序
Figure BDA0003566890230001022
等人(Agersoe et al.,2018)所述,在Chr.Hansen,对菌株的基因组进行内部测序。简而言之,纯化总DNA并用于制备250bp双末端文库,以便使用Illumina MiSeq系统进行基因组测序。对序列读长进行质量剪切(Phred评分<25),并使用CLC GenomicsWorkbench版本10.1.1(CLC bio,Qiagen Bioinformatics)中的从头组装算法组装成重叠群。通过去除覆盖率为<15X和/或<20%的组装的中值覆盖率的重叠群,过滤所得的基因组组装。将剩余重叠群的共有序列以FASTA格式导出,称为基因组草图序列,并用于后续序列分析。
实施例4.乳球菌菌株eps基因簇的表征
由于质构的增强与多糖的产生有关,因此进行了eps基因簇的挖掘。位于操纵子侧翼或在操纵子内的移动遗传元件(也称为转座酶的IS元件)始终存在于eps基因簇的结构中,但它们不参与多糖的生物合成。乳酸乳球菌胞外多糖的生物合成通过Wzy依赖性途径发生。在这里,我们使用了Zeidan等人(2017))提出的命名法。eps基因簇开头的保守基因命名为epsRXCDB,末端的保守基因命名为epsL和lytR,聚合酶命名为wzy,翻转酶命名为wzx。
位于eps基因簇epsRXCDB 5′末端的基因,其参与多糖生物合成的调节和组装机制,以及3′末端的epsL和lytR,均显示出最高水平的保守性。可变部分中的基因,包括聚合酶wzy、翻转酶wzx和葡糖基转移酶(GT)或其他聚合物修饰酶在内,在菌株之间很少相似。质构化菌株的共同点是,它们都含有产生多糖所需的基因,例如epsCDBE-wzy-wzx和GT(Zeidan et al.2017)。
DSM 33134的eps基因簇长18254bp,含有20个对应于8个保守基因(epsRXCDBEL、lytR)和12个可变部分基因的开放阅读框(ORF)。可能参与多糖生物合成的可变部分的基因包括三种糖基转移酶(GT1、GT2、GT3)、核苷酸糖脱氢酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。三个GT,连同推定的核苷酸糖脱氢酶,潜在地参与重复单元的顺序构建,尽管它们的具体功能和因此的作用顺序尚未得到证实。五个IS元件也是DSM 33134的eps基因簇的一部分。
DSM 33135的eps基因簇长27444bp,含有31个对应于8个保守基因(epsRXCDBEL、lytR)和19个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括五种糖基转移酶(GT1、GT2、GT3、GT4、GT5)、dTDP-葡萄糖4,6-脱水酶、dTDP-4-脱氢鼠李糖还原酶、dTDP-4-脱氢鼠李糖3,5-差向异构酶、DUF1919、DUF4422、UDP-吡喃半乳糖变位酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。九个IS元件也是DSM33135的eps基因簇的一部分。VanZ家族蛋白通常存在于嗜热链球菌的eps基因簇中,但不存在于乳酸乳球菌中。
DSM 33136的eps基因簇长18365bp,含有18个对应于8个保守基因(epsRXCDBEL、lytR)和10个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括三种糖基转移酶(GT1、GT2、GT3)、多糖丙酮酰转移酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。三种GT,连同推定的多糖丙酮酰转移酶,潜在地参与重复单元的顺序构建,尽管它们的具体功能和因此的作用顺序尚未得到证实。两个IS元件也是DSM 33136的eps基因簇的一部分。VanZ家族蛋白通常存在于嗜热链球菌的eps基因簇中,但不存在于乳酸乳球菌中。
DSM 33137的eps基因簇长20584bp,含有21个对应于8个保守基因(epsRXCDBEL、lytR)和13个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括四种糖基转移酶(GT1、GT2、GT3、GT4)、Cre-2/I-分支蛋白、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。五个IS元件也是DSM 33137的eps基因簇的一部分。VanZ家族蛋白通常存在于嗜热链球菌的eps基因簇中,但不存在于乳酸乳球菌中。
DSM 33138的eps基因簇长21315bp,含有23个对应于7个保守基因(epsRXCDBE、lytR)和16个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括四种糖基转移酶(GT1、GT2、GT3、GT4)、DUF1972、DUF4422、DUF1919、UDP-吡喃半乳糖变位酶、dTDP-4-脱氢鼠李糖3,5-差向异构酶、dTDP-葡萄糖4,6-脱水酶、dTDP-4-脱氢鼠李糖还原酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。三个IS元件也是DSM33138的eps基因簇的一部分。VanZ家族蛋白通常存在于嗜热链球菌的eps基因簇中,但不存在于乳酸乳球菌中。
DSM 33139的eps基因簇长27175bp,含有29个对应于8个保守基因(epsRXCDBEL、lytR)和21个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括六种糖基转移酶(GT1、GT2、GT3、GT4、GT5、GT6)、两种NAD依赖性差向异构酶、核苷酸糖脱氢酶、RfbA、dTDP-葡萄糖4,6-脱水酶、dTDP-4-脱氢鼠李糖3,5-差向异构酶、两种酰基转移酶、dTDP-4-脱氢鼠李糖还原酶、核苷酸转移酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。VanZ家族蛋白也是DSM 33139的eps基因簇的一部分;它通常存在于嗜热链球菌的eps基因簇中,但不存在于乳酸乳球菌中。
DSM 33140的eps基因簇长18226bp,含有19个对应于8个保守基因(epsRXCDBEL、lytR)和11个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括三种糖基转移酶(GT1、GT2、GT3)、UDP-N-乙酰葡糖胺--LPS N-乙酰葡糖胺转移酶、Capsule生物合成蛋白CapC、聚合酶wzy和第二聚合酶样序列wzy1(其可能太短而不具有功能)和翻转酶(多糖转运蛋白)wzx。三个IS元件也是DSM 33140的eps基因簇的一部分。
DSM 33141的eps基因簇长24364bp,含有25个对应于9个保守基因(epsRXCDBE1E2L、lytR)和16个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括六种糖基转移酶(GT1、GT2、GT3、GT4、GT5、GT6)、乙酰转移酶、核苷酸糖脱氢酶、酰基转移酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。两个IS元件也是DSM 33141的eps基因簇的一部分。
DSM 33142的eps基因簇长16953bp,含有19个对应于9个保守基因(epsRXCDBE1E2L、lytR)和10个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括三种糖基转移酶(GT1、GT2、GT3)、核苷酸糖脱氢酶、乙酰转移酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。三个IS元件也是DSM 33142的eps基因簇的一部分。
DSM 33183的eps基因簇长16476bp,含有19个对应于8个保守基因(epsRCDBE1E2L、lytR)和10个可变部分基因的ORF。可能参与多糖生物合成的可变部分的基因包括三种糖基转移酶(GT1、GT2、GT3)、核苷酸糖脱氢酶、乙酰转移酶、聚合酶wzy和翻转酶(多糖转运蛋白)wzx。三个IS元件也是DSM 33183的eps基因簇的一部分。
BLAST分析用于对来自CHCC培养物保藏中心的菌株的开放阅读(ORF)进行注释。使用cd-hit工具(http://weizhongli-lab.org/cd-hit/)进行eps基因簇的比较分析。使用Clustal Omega(https://www.ebi.ac.uk/Tools/msa/clustalo/)的百分比同一性矩阵计算序列的同一性%。
新颖的质构化菌株具有与文献或NCBI网站上的eps基因簇不相似的eps基因簇。此外,与来自Chr.Hansen培养物保藏中心的已知质构化菌株相比,该eps基因簇是独特的(图1)。
保藏和专家解决方案
申请人要求在授予专利之日之前,只能向专家提供下述保藏微生物的样品。特别是,申请人要求规则33EPC中提及的保藏微生物的可得性应仅通过向请求人指定的独立专家提供样品来实现(规则32(1)EPC)。
表4:申请人CHR.HANSEN A/S根据国际承认用于专利程序的微生物保藏布达佩斯条约(Budapest Treaty on the International Recognition of the Deposit ofMicroorganisms for the Purposes of Patent Procedure)在已经获得国际保藏单位地位的保藏机构所做的保藏:Leibniz Institute DSMZ-German Collection ofMicroorganisms and Cell Cultures Inhoffenstr.7B,38124Braunschweig,Germany(德国)
Figure BDA0003566890230001061
Figure BDA0003566890230001071
参考文献
1.Zeidan et al.,2017,Polysaccharide production by lactic acidbacteria:from genes to industrial applications.FEMS Microbiol Rev 41:168-200.
2.Bentley et al.,2006,Genetic analysis of the capsular biosyntheticlocus from all 90 pneumococcal serotypes,PLoS Genet.2:e31.
3.Dabour and LaPointe,2005,Identification and MolecularCharacterization of the Chromosomal Exopolysaccharide Biosynthesis GeneCluster from Lactococcus lactis subsp.cremoris SMQ-461,Appl EnvironMicrobiol.,71:7414-7425.
4.Kleerebezem M.,et al.,2002,Metabolic engineering of Lactococcuslactis:the impact of genomics and metabolic modelling,Journal ofBiotechnology 98(2002)199-213.
5.Nierop Groot M.N.,Kleerebezem M.,2007,Mutational analysis of theLactococcus lactis NIZO B40 exopolysaccharide(EPS)gene cluster:EPSbiosynthesis correlates with unphosphorylated EpsB,J Appl Microbiol.,103:2645-2656.
6.Pan D.,Mei X.,2010,Antioxidant activity of an exopolysaccharidepurified from Lactococcus lactis subsp.lactis 12,Carbohydrate Polymers,80:908-914.
7.Poulsen et al.,2019,High-throughput screening for texturingLactococcus strains,FEMS Microbiol Lett.,366(2).
8.Suzuki C.et al.,2013,Novel exopolysaccharides produced byLactococcus lactissubsp.lactis,and the diversity of epsE genes in theexopolysaccharide biosynthesis gene clusters,Biosci Biotechnol Biochem.,77:2013-2018.
9.Tangyu et al.,2019,Fermentation of plant-based milk alternativesfor improved flavour and nutritional value,Applied Microbiology and Biotechnology,103:9263-9275.
10.van Kranenburg R.et al.,1999,Functional analysis ofglycosyltransferase genes from Lactococcus lactis and other gram-positivecocci:complementation,expression,and diversity,J Bacteriol.,181:6347-6353.
11.Whittall J.J.et al.,2015,Topology of Streptococcus pneumoniaeCpsC,a Polysaccharide co-polymerase and BY-kinase adaptor protein,J Bacteriol197:120-127.
12.Agersoe Y et al.(2018)Antimicrobial susceptibility testing andtentative epidemiological cutoff values for five bacillus species relevantfor use as animal feed additives or for plant protection,Appl EnvironMicrobiol,84(19).
序列表
<110> 科·汉森有限公司
<120> 具有独特的eps基因簇的质构化乳酸乳球菌
<130> P6639
<150> EP19193295
<151> 2019-08-23
<150> EP19193299
<151> 2019-08-23
<150> EP19193303
<151> 2019-08-23
<150> EP19193305
<151> 2019-08-23
<150> EP19193307
<151> 2019-08-23
<150> EP19193308
<151> 2019-08-23
<150> EP19193310
<151> 2019-08-23
<150> EP19193312
<151> 2019-08-23
<150> EP19193313
<151> 2019-08-23
<150> EP19193315
<151> 2019-08-23
<150> EP19193316
<151> 2019-08-23
<160> 294
<170> BiSSAP 1.3.6
<210> 1
<211> 318
<212> DNA
<213> 乳酸乳球菌乳酸亚种(Lactococcus lactis subsp. lactis)
<220>
<223> DSM 33134的epsR的ORF
<400> 1
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataa 318
<210> 2
<211> 105
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsR
<400> 2
Met Asp Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 3
<211> 768
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsX基因的ORF
<400> 3
atgatgaaaa aaggaatttt tgtaattact atagtgatat ctatagcatt tataattgga 60
ggtttttata gttataattc taggataaat aatctttcaa aagctgataa aggaaaagaa 120
gttgtaaaaa atagcagtga aaaaaatcag atagacctta cctataaaaa gtattataaa 180
aatttaccaa aatcagttca aaataaaata gatgatattt catccaaaaa taaagaagtt 240
actttaactt gtatttggca atctgattca gttatttctg aacaatttca acaaaactta 300
caaaaatatt atggaaataa gttttggaac atcaaaaata tcacttacaa tggcgaaact 360
agtgaacaat tattggctga aaaagttgaa aaccaagtat tagccactaa tcctgatgtt 420
gttttatatg aagctccact ttttaatgat aaccaaaaca ttgaagcaac agcctcactg 480
actagtaatg agcaacttat aacaaatttg gctagtgcag gagcggaggt aatagttcaa 540
ccctctccac cgatctatgg tggtgttgtg taccccgtac aagaagaaca atttaaacaa 600
tctttatcta caaagtatcc ctatatagac tactgggcta gttacccaga caaaaattct 660
gatgaaatga aggggctgtt ttctgatgat ggagtatata gaacattaaa tgcttcgggg 720
aataaggttt ggctagatta tattactaaa tattttacag caaactaa 768
<210> 4
<211> 255
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsX
<400> 4
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Phe Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Leu
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Ala Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 5
<211> 765
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsB基因的ORF
<400> 5
atgattgata ttcattgcca tattttaccg gggatagatg atggagctaa aacttctgga 60
gatactctga caatgctgaa atcagcaatt gatgaaggga taacaaccat cactgccact 120
cctcatcata atcctcaatt taataatgaa tcaccgctta ttttgaagaa agttaaggaa 180
gttcaaaata tcattgacga gcatcaatta ccaattgaag ttttacccgg acaagaggtg 240
agaatatatg gtgatttatt aaaagaattt tctgaaggaa agttactgac agcagcgggc 300
acttcaagtt atatattgat tgaatttcca tcaaatcatg tgccagctta tgctaaagaa 360
cttttttata atattcaatt ggagggactt caacctattt tggtccaccc tgagcgtaat 420
agtggaatca ttgagaaccc tgatatatta tttgatttta ttgaacaagg agtactaagt 480
cagataacag cttcaagtgt cactggtcat tttggtaaaa aaatacaaaa gctgtcattt 540
aaaatgatag aaaaccatct gacgcatttt gttgcatcag atgcgcataa tgtgacgtca 600
cgtgcattta agatgaggga agcatttgaa atgattgaag atagttatgg ttctggtgta 660
tcacgaatgt ttcaaaataa tgcagagtca gtgattttaa acgaaagttt ttatcaagaa 720
aaaccaacaa agatcaaaac aaagaaattt ttaggattat tttaa 765
<210> 6
<211> 254
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsB
<400> 6
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Arg Glu Ala
195 200 205
Phe Glu Met Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 7
<211> 696
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsD基因的ORF
<400> 7
atggctaaaa ataaaagaag catagacaat aatcgttata ttattaccag tgtcaatcct 60
caatcaccta tttccgaaca atatcgtacg attcgtacga ccattgattt taaaatggcg 120
gatcaaggga ttaaaagttt tctagtaaca tcttcagaag cagctgaagg taaatcaaac 180
gagagtgcta atctagctgt tgcttttgca caacaaggta aaaaagtact tttaattgat 240
ggcgatcttc gtaaaccgac tgttaacatt acttttaaag tacaaaatag agtaggatta 300
accaatattt taatgcatca atcttcgatt gaagatgcca tacaagggac aagactttct 360
gaaaatctta caataattac ctctggtcca attccaccta atccatcgga attattagca 420
tctagtgcaa tgaagaagtt gattgactct gtgtccgatt cctttgatgt tgttttgatt 480
gatactccac ctctctatgc agttactgat gctcaaattt tgagtgttta tgtaggagga 540
gtggttcttg ttgtacgtgc ctatgaaaca aaaaaagaga gtttagcaaa aacaaaaaaa 600
atactggaac aagttaatgc aaatatatta ggagttgttt tgcatggggt agactcttct 660
gagtcaccgt cgtattacta ctacggagta gagtaa 696
<210> 8
<211> 231
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsD
<400> 8
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Glu Gly Lys Ser Asn Glu Ser Ala Asn
50 55 60
Leu Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Lys Leu Ile Asp Ser Val Ser Asp Ser Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Tyr Ala Val Thr Asp Ala Gln Ile Leu Ser Val
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 9
<211> 780
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的编码推定的GT1蛋白的ORF
<400> 9
gtgaagttta tgaaaaaaga aaaaatttca attattacac ctgtatataa ttgtgaaaaa 60
cttattgaaa aaaccattga atgtgttttg aatcaaacat ataaaaattg ggagtggcta 120
cttgttgatg attgttcacc agacaactct gccataataa taaaaaagta tgctaaaaat 180
gataatagaa ttaaatattt taaattaagt gaaaatagtg gtgctgccgt ttctagaaat 240
aaagcattgg cagaatctac tggtagattt gtagcttatt tggatgcgga tgatttatgg 300
aaaaatgata aattggagaa acaagtaaaa tttatgttag aaaatcaata ttcatttact 360
tgtacggact atgaaaaaat tacggaaaca ggtaatagtt taaataaaat tattaaaata 420
ccaaaaaaag tagattataa tttcttttta agaaatacaa taattcaaac tgttggagtg 480
atggtagata caaaattaac agggaaagaa ttattgaaga tgcctaatat tagacgaaga 540
caagatgctg caacatggtg tcaactttta aaaaatggac acgattgtta tgagtgtcca 600
gagaatttgt cttattatag agtagtaaca aattctttat caagtaataa atttaaagca 660
ataaaaatga attggtactg gtatagaaag atagaaaaat tacctttatg gaaagcatgt 720
tattgcttta ttggatatgc ttttaatggt gtaagaaaaa gaatatatat aaaaaggtaa 780
<210> 10
<211> 259
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_GT1
<400> 10
Val Lys Phe Met Lys Lys Glu Lys Ile Ser Ile Ile Thr Pro Val Tyr
1 5 10 15
Asn Cys Glu Lys Leu Ile Glu Lys Thr Ile Glu Cys Val Leu Asn Gln
20 25 30
Thr Tyr Lys Asn Trp Glu Trp Leu Leu Val Asp Asp Cys Ser Pro Asp
35 40 45
Asn Ser Ala Ile Ile Ile Lys Lys Tyr Ala Lys Asn Asp Asn Arg Ile
50 55 60
Lys Tyr Phe Lys Leu Ser Glu Asn Ser Gly Ala Ala Val Ser Arg Asn
65 70 75 80
Lys Ala Leu Ala Glu Ser Thr Gly Arg Phe Val Ala Tyr Leu Asp Ala
85 90 95
Asp Asp Leu Trp Lys Asn Asp Lys Leu Glu Lys Gln Val Lys Phe Met
100 105 110
Leu Glu Asn Gln Tyr Ser Phe Thr Cys Thr Asp Tyr Glu Lys Ile Thr
115 120 125
Glu Thr Gly Asn Ser Leu Asn Lys Ile Ile Lys Ile Pro Lys Lys Val
130 135 140
Asp Tyr Asn Phe Phe Leu Arg Asn Thr Ile Ile Gln Thr Val Gly Val
145 150 155 160
Met Val Asp Thr Lys Leu Thr Gly Lys Glu Leu Leu Lys Met Pro Asn
165 170 175
Ile Arg Arg Arg Gln Asp Ala Ala Thr Trp Cys Gln Leu Leu Lys Asn
180 185 190
Gly His Asp Cys Tyr Glu Cys Pro Glu Asn Leu Ser Tyr Tyr Arg Val
195 200 205
Val Thr Asn Ser Leu Ser Ser Asn Lys Phe Lys Ala Ile Lys Met Asn
210 215 220
Trp Tyr Trp Tyr Arg Lys Ile Glu Lys Leu Pro Leu Trp Lys Ala Cys
225 230 235 240
Tyr Cys Phe Ile Gly Tyr Ala Phe Asn Gly Val Arg Lys Arg Ile Tyr
245 250 255
Ile Lys Arg
<210> 11
<211> 1287
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的推定的wzy基因的ORF
<400> 11
ttgaaaggaa ttaataaaat gataaaaaga aatcaattaa atttattttt gcagtatata 60
cttgcttttc taattatttt agaaactaga agtgtatatt ctcgttgtgt tgttggtcat 120
atagatgaaa taataatagg tggtataata ttaacaataa ttttaattat tttagttaac 180
atgaattata aattaaaaac taaatcattg ttttttttac ttttttatta tctatatatg 240
tttatatttt taattataaa tgttgatgga tataatgtaa actttgttat aatatttatg 300
attttatttc cacttgtatt tttaatgctc aatttatatg atagtaaaga aataaaaaat 360
ctgtttaaag catatgtaaa tattatggtt atcatttctg caatatcctt atttttttac 420
atattcggat ccttaactgg tatggtttct actaatatta tacaagaaat aaattggggt 480
ggagattatg gaggtaataa aataataaat ggatattttg gattgcattt taatacacaa 540
acaactgtaa tatttggtga tgccatttta agaaatacaa gtatatttgt tgaaggaccg 600
atgtttgcat tacatctctt atttgcaatg gcattatcac tatttatgaa taaaaaatta 660
ataaataaat attcaataat atttggcttt tcaatattat catcgctatc tataactgct 720
atattatttt atatgttttt attgttttac aagtatacat tttataataa aagcaaaacg 780
aagattattt tgttgccaat attgtttttc atatttttaa tcataggaac tacttttttt 840
aatgataagc aatcaacaaa ttcctataat ataagaaatg acgattatac agcaagtttt 900
agagtattta atgattatcc aatttttgga agtggatttg ctaataataa catagtaatt 960
aaatatatgt caacatttag attgtataat acaggtttag ctaactcatt tgtagtctta 1020
ttagtccaag gaggattata tttagtaatt ttttatttac tgccagttat tttgaacagt 1080
ttaaatttaa ttaaatccaa aaataaaaat ttgtttttga tagaaattat gcttttacaa 1140
ctttatttat tttttatgaa tgcttatcaa tatacatcat tgatgataat ctttctagca 1200
tttgattatt atattttatt attttataac aaaaataatg atttaaaaaa tttgcaactt 1260
aaggaggaaa catatgaaaa aatatga 1287
<210> 12
<211> 428
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_wzy
<400> 12
Leu Lys Gly Ile Asn Lys Met Ile Lys Arg Asn Gln Leu Asn Leu Phe
1 5 10 15
Leu Gln Tyr Ile Leu Ala Phe Leu Ile Ile Leu Glu Thr Arg Ser Val
20 25 30
Tyr Ser Arg Cys Val Val Gly His Ile Asp Glu Ile Ile Ile Gly Gly
35 40 45
Ile Ile Leu Thr Ile Ile Leu Ile Ile Leu Val Asn Met Asn Tyr Lys
50 55 60
Leu Lys Thr Lys Ser Leu Phe Phe Leu Leu Phe Tyr Tyr Leu Tyr Met
65 70 75 80
Phe Ile Phe Leu Ile Ile Asn Val Asp Gly Tyr Asn Val Asn Phe Val
85 90 95
Ile Ile Phe Met Ile Leu Phe Pro Leu Val Phe Leu Met Leu Asn Leu
100 105 110
Tyr Asp Ser Lys Glu Ile Lys Asn Leu Phe Lys Ala Tyr Val Asn Ile
115 120 125
Met Val Ile Ile Ser Ala Ile Ser Leu Phe Phe Tyr Ile Phe Gly Ser
130 135 140
Leu Thr Gly Met Val Ser Thr Asn Ile Ile Gln Glu Ile Asn Trp Gly
145 150 155 160
Gly Asp Tyr Gly Gly Asn Lys Ile Ile Asn Gly Tyr Phe Gly Leu His
165 170 175
Phe Asn Thr Gln Thr Thr Val Ile Phe Gly Asp Ala Ile Leu Arg Asn
180 185 190
Thr Ser Ile Phe Val Glu Gly Pro Met Phe Ala Leu His Leu Leu Phe
195 200 205
Ala Met Ala Leu Ser Leu Phe Met Asn Lys Lys Leu Ile Asn Lys Tyr
210 215 220
Ser Ile Ile Phe Gly Phe Ser Ile Leu Ser Ser Leu Ser Ile Thr Ala
225 230 235 240
Ile Leu Phe Tyr Met Phe Leu Leu Phe Tyr Lys Tyr Thr Phe Tyr Asn
245 250 255
Lys Ser Lys Thr Lys Ile Ile Leu Leu Pro Ile Leu Phe Phe Ile Phe
260 265 270
Leu Ile Ile Gly Thr Thr Phe Phe Asn Asp Lys Gln Ser Thr Asn Ser
275 280 285
Tyr Asn Ile Arg Asn Asp Asp Tyr Thr Ala Ser Phe Arg Val Phe Asn
290 295 300
Asp Tyr Pro Ile Phe Gly Ser Gly Phe Ala Asn Asn Asn Ile Val Ile
305 310 315 320
Lys Tyr Met Ser Thr Phe Arg Leu Tyr Asn Thr Gly Leu Ala Asn Ser
325 330 335
Phe Val Val Leu Leu Val Gln Gly Gly Leu Tyr Leu Val Ile Phe Tyr
340 345 350
Leu Leu Pro Val Ile Leu Asn Ser Leu Asn Leu Ile Lys Ser Lys Asn
355 360 365
Lys Asn Leu Phe Leu Ile Glu Ile Met Leu Leu Gln Leu Tyr Leu Phe
370 375 380
Phe Met Asn Ala Tyr Gln Tyr Thr Ser Leu Met Ile Ile Phe Leu Ala
385 390 395 400
Phe Asp Tyr Tyr Ile Leu Leu Phe Tyr Asn Lys Asn Asn Asp Leu Lys
405 410 415
Asn Leu Gln Leu Lys Glu Glu Thr Tyr Glu Lys Ile
420 425
<210> 13
<211> 1053
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的编码推定的GT2蛋白的ORF
<400> 13
gtgcaaagga atgataaaag aatgaaaaaa attttatata tagctacaac tgccgatagt 60
agaaatagat tagatggtga aacaattaaa tgtagattat taagagaata tctaagagga 120
atagaaaatg ttgaacttat ttctgtagat actgataatt ggaaaaagca taaattaaaa 180
ttagtatttt taataatata taattttatt ttttgtaatt ctatcgttgt ttcatctgcg 240
gataaaggtg ctaatattgt cttagatttt tttagaaaaa ttaatactaa aaaaaatatt 300
tattattttg taattggtgg tacattatat aaaaatataa aagaaaaaaa ttggaatatt 360
gaaacatata aaagattaaa acatatttac gttgaggcaa atcaactgaa attagatttg 420
aactctttaa atattactaa tgttgatatc ttaaataatt ttagaaaagt aaataaattt 480
gaaaataaat ataaaaagag taaagaaata aaatttgttt actttggaag agttataaga 540
gaaaaaggtg tagaggaagc aataaaaatg attaacaggc ttaatgctga aaatattata 600
tgtacatttg atatatatgg gcaatgtaaa gatgaatatt tgcaacaaat acaagaaaag 660
tttaatgaaa acataagatt tcatggtgaa ataaaaccaa atggtaaaaa agaatatgaa 720
atattatcac aatatgatgt ttttctgttc ccaactgagt acccaggaga atgccttcca 780
ggagctttga ttgattgcta tatttctgga cttgcagtaa ttgcttcaaa ttggaaatat 840
gcaaaggaat atattttaga taatgaaaat ggaaaaatct ttgaatacaa agactataat 900
gatatgtata aaaaaacaaa agaaatggta gctgaaaatg ttattcaaaa atataaatta 960
aaatcagtag aattatcaaa aaaatataat atggatgtat tattaaatga ctttaaaaaa 1020
gaaataatgg aggaaaaaaa tgaaactttt taa 1053
<210> 14
<211> 350
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_GT2
<400> 14
Val Gln Arg Asn Asp Lys Arg Met Lys Lys Ile Leu Tyr Ile Ala Thr
1 5 10 15
Thr Ala Asp Ser Arg Asn Arg Leu Asp Gly Glu Thr Ile Lys Cys Arg
20 25 30
Leu Leu Arg Glu Tyr Leu Arg Gly Ile Glu Asn Val Glu Leu Ile Ser
35 40 45
Val Asp Thr Asp Asn Trp Lys Lys His Lys Leu Lys Leu Val Phe Leu
50 55 60
Ile Ile Tyr Asn Phe Ile Phe Cys Asn Ser Ile Val Val Ser Ser Ala
65 70 75 80
Asp Lys Gly Ala Asn Ile Val Leu Asp Phe Phe Arg Lys Ile Asn Thr
85 90 95
Lys Lys Asn Ile Tyr Tyr Phe Val Ile Gly Gly Thr Leu Tyr Lys Asn
100 105 110
Ile Lys Glu Lys Asn Trp Asn Ile Glu Thr Tyr Lys Arg Leu Lys His
115 120 125
Ile Tyr Val Glu Ala Asn Gln Leu Lys Leu Asp Leu Asn Ser Leu Asn
130 135 140
Ile Thr Asn Val Asp Ile Leu Asn Asn Phe Arg Lys Val Asn Lys Phe
145 150 155 160
Glu Asn Lys Tyr Lys Lys Ser Lys Glu Ile Lys Phe Val Tyr Phe Gly
165 170 175
Arg Val Ile Arg Glu Lys Gly Val Glu Glu Ala Ile Lys Met Ile Asn
180 185 190
Arg Leu Asn Ala Glu Asn Ile Ile Cys Thr Phe Asp Ile Tyr Gly Gln
195 200 205
Cys Lys Asp Glu Tyr Leu Gln Gln Ile Gln Glu Lys Phe Asn Glu Asn
210 215 220
Ile Arg Phe His Gly Glu Ile Lys Pro Asn Gly Lys Lys Glu Tyr Glu
225 230 235 240
Ile Leu Ser Gln Tyr Asp Val Phe Leu Phe Pro Thr Glu Tyr Pro Gly
245 250 255
Glu Cys Leu Pro Gly Ala Leu Ile Asp Cys Tyr Ile Ser Gly Leu Ala
260 265 270
Val Ile Ala Ser Asn Trp Lys Tyr Ala Lys Glu Tyr Ile Leu Asp Asn
275 280 285
Glu Asn Gly Lys Ile Phe Glu Tyr Lys Asp Tyr Asn Asp Met Tyr Lys
290 295 300
Lys Thr Lys Glu Met Val Ala Glu Asn Val Ile Gln Lys Tyr Lys Leu
305 310 315 320
Lys Ser Val Glu Leu Ser Lys Lys Tyr Asn Met Asp Val Leu Leu Asn
325 330 335
Asp Phe Lys Lys Glu Ile Met Glu Glu Lys Asn Glu Thr Phe
340 345 350
<210> 15
<211> 918
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的编码推定的GT3蛋白的ORF
<400> 15
ttgaaaaata taggaatagt tctcgtaaca tataatagat tggagaaatt aaaaattgca 60
ttatcatgtt acgaaaaaca aaaaacaaaa attgatacta tgataattgt taataattgc 120
agtacagatg ggacatttga atttttagaa gaatatagta aaaggaagtt aaaatataaa 180
attgtaattt taaatatgcc aaaaaatctt ggtggtgcag gaggtttctt tgaaggaatg 240
aaatgtgcaa tgaaagaaga tttagagtgg gtttatattt cagatgatga tgcttatcca 300
aatgacaata ctatatatga gcttgaaaaa atttactcaa aattacaaaa taaagatgaa 360
attgttgcat tgtgcagtgt tgttgaaaat aaaaatggtt tagattatgg gcatagatta 420
agaataatga aaaatctctt ttttgtgaag tggaaaccag tagataggag tgaatataat 480
aaagattatt ttaatgttga cattctatcg tatgtaggat cacttattaa tgttaatgca 540
ttatattgtg caggtttaga cagaaaagat ttctttatat atcatgatga tcaagaacat 600
tcattgagat taggcaaaaa tggaaaaata ttaacttgca ctaaaagtgt aatacatcat 660
gataccgaag taaaaaaata caaggaatta ttttggggaa attactatga tactcgaaat 720
aggcttttaa tgataaaata taattttcct ttaagatatt tctatataag atactattta 780
ggatatataa gggattgttt attatgcaaa aacaaaataa aaaaagaaat gttaaaagtt 840
gcatatatgg atgctaaaaa taataaatta gggttgaatt caatatataa acctggttgg 900
gtagctaaaa ataaataa 918
<210> 16
<211> 305
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_GT3
<400> 16
Leu Lys Asn Ile Gly Ile Val Leu Val Thr Tyr Asn Arg Leu Glu Lys
1 5 10 15
Leu Lys Ile Ala Leu Ser Cys Tyr Glu Lys Gln Lys Thr Lys Ile Asp
20 25 30
Thr Met Ile Ile Val Asn Asn Cys Ser Thr Asp Gly Thr Phe Glu Phe
35 40 45
Leu Glu Glu Tyr Ser Lys Arg Lys Leu Lys Tyr Lys Ile Val Ile Leu
50 55 60
Asn Met Pro Lys Asn Leu Gly Gly Ala Gly Gly Phe Phe Glu Gly Met
65 70 75 80
Lys Cys Ala Met Lys Glu Asp Leu Glu Trp Val Tyr Ile Ser Asp Asp
85 90 95
Asp Ala Tyr Pro Asn Asp Asn Thr Ile Tyr Glu Leu Glu Lys Ile Tyr
100 105 110
Ser Lys Leu Gln Asn Lys Asp Glu Ile Val Ala Leu Cys Ser Val Val
115 120 125
Glu Asn Lys Asn Gly Leu Asp Tyr Gly His Arg Leu Arg Ile Met Lys
130 135 140
Asn Leu Phe Phe Val Lys Trp Lys Pro Val Asp Arg Ser Glu Tyr Asn
145 150 155 160
Lys Asp Tyr Phe Asn Val Asp Ile Leu Ser Tyr Val Gly Ser Leu Ile
165 170 175
Asn Val Asn Ala Leu Tyr Cys Ala Gly Leu Asp Arg Lys Asp Phe Phe
180 185 190
Ile Tyr His Asp Asp Gln Glu His Ser Leu Arg Leu Gly Lys Asn Gly
195 200 205
Lys Ile Leu Thr Cys Thr Lys Ser Val Ile His His Asp Thr Glu Val
210 215 220
Lys Lys Tyr Lys Glu Leu Phe Trp Gly Asn Tyr Tyr Asp Thr Arg Asn
225 230 235 240
Arg Leu Leu Met Ile Lys Tyr Asn Phe Pro Leu Arg Tyr Phe Tyr Ile
245 250 255
Arg Tyr Tyr Leu Gly Tyr Ile Arg Asp Cys Leu Leu Cys Lys Asn Lys
260 265 270
Ile Lys Lys Glu Met Leu Lys Val Ala Tyr Met Asp Ala Lys Asn Asn
275 280 285
Lys Leu Gly Leu Asn Ser Ile Tyr Lys Pro Gly Trp Val Ala Lys Asn
290 295 300
Lys
305
<210> 17
<211> 1263
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的推定的wzx基因的ORF
<400> 17
ttgctcacac atgagtatca gaatatctat atttggcaaa gtttactgat tttggcaagt 60
ctttttgata tttcatggta ctttatggga cgagaaaagt tcaaagtcac tgtaaccaga 120
aatttcatca taaaaatttt aaccgttatt tctatttttg tttttgtaag aaatcataat 180
gatttaccaa tctatgttgc aatcatgggg attggaagtt tactaggaag cttatcttta 240
tggccctatc tcagaaatga aattaataag ccaaatctaa gatacctcaa cttaaaaaaa 300
catttacatt acacagtcat cttatttatc ccaacaatcg ctacccagat ttatctcata 360
gcaaataaat ccatgattgg acttatggat tctgtcactc atgccggatt ttaccaacaa 420
gcagacacaa taataaagat ggcattatcc gtaattggaa ccataggtgt cgtcatgtta 480
cctcgcattg caagtatgca ctcagaagga aacatgaaag taatcagagc atcgatcgta 540
aaaacattta atatcgcaac agggatttca tttggtatct tttttggaat tctagggatt 600
gcactacact ttgcaccatt cttttttgga aaatcctttg agatggtcgg agtgattatg 660
atgctagaag cccccatcat tatctttatt ccaatgagta atgtatttgg tattcaatat 720
ctccttccac taaatagaat gagagctttt accttatcag taacctttgg tgcattatta 780
aatatcataa taaattttgc tttgataccg ttacttggcg tgatcggagc aacggtagca 840
actgtggtat ctgaatttgc agttacagct taccaatatt tatcaatcag aaaagagttt 900
tcattcagtg atttatttgg tggactttgg aagtatttta tctcaggctc attgatgttt 960
gtcgtagttt tttggatgaa tcaatcattt aaaatgacaa taattcagct gatactccaa 1020
attatcttag gtgtactcat ctatactctt tctaatatct tattaaagac acagctatgg 1080
cttatggcct cagaactttt aggaaaaatg aaaaatcggg tatcaagaaa tcatatacgt 1140
atagatcaaa aacaagaaac tctcgaacat ccattagata caattaaagc ttcgtttgat 1200
caatttgcta tcctctttca agaaatcgat gaaaaaaaat attatctcat aagaactttt 1260
tga 1263
<210> 18
<211> 420
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_wzx
<400> 18
Leu Leu Thr His Glu Tyr Gln Asn Ile Tyr Ile Trp Gln Ser Leu Leu
1 5 10 15
Ile Leu Ala Ser Leu Phe Asp Ile Ser Trp Tyr Phe Met Gly Arg Glu
20 25 30
Lys Phe Lys Val Thr Val Thr Arg Asn Phe Ile Ile Lys Ile Leu Thr
35 40 45
Val Ile Ser Ile Phe Val Phe Val Arg Asn His Asn Asp Leu Pro Ile
50 55 60
Tyr Val Ala Ile Met Gly Ile Gly Ser Leu Leu Gly Ser Leu Ser Leu
65 70 75 80
Trp Pro Tyr Leu Arg Asn Glu Ile Asn Lys Pro Asn Leu Arg Tyr Leu
85 90 95
Asn Leu Lys Lys His Leu His Tyr Thr Val Ile Leu Phe Ile Pro Thr
100 105 110
Ile Ala Thr Gln Ile Tyr Leu Ile Ala Asn Lys Ser Met Ile Gly Leu
115 120 125
Met Asp Ser Val Thr His Ala Gly Phe Tyr Gln Gln Ala Asp Thr Ile
130 135 140
Ile Lys Met Ala Leu Ser Val Ile Gly Thr Ile Gly Val Val Met Leu
145 150 155 160
Pro Arg Ile Ala Ser Met His Ser Glu Gly Asn Met Lys Val Ile Arg
165 170 175
Ala Ser Ile Val Lys Thr Phe Asn Ile Ala Thr Gly Ile Ser Phe Gly
180 185 190
Ile Phe Phe Gly Ile Leu Gly Ile Ala Leu His Phe Ala Pro Phe Phe
195 200 205
Phe Gly Lys Ser Phe Glu Met Val Gly Val Ile Met Met Leu Glu Ala
210 215 220
Pro Ile Ile Ile Phe Ile Pro Met Ser Asn Val Phe Gly Ile Gln Tyr
225 230 235 240
Leu Leu Pro Leu Asn Arg Met Arg Ala Phe Thr Leu Ser Val Thr Phe
245 250 255
Gly Ala Leu Leu Asn Ile Ile Ile Asn Phe Ala Leu Ile Pro Leu Leu
260 265 270
Gly Val Ile Gly Ala Thr Val Ala Thr Val Val Ser Glu Phe Ala Val
275 280 285
Thr Ala Tyr Gln Tyr Leu Ser Ile Arg Lys Glu Phe Ser Phe Ser Asp
290 295 300
Leu Phe Gly Gly Leu Trp Lys Tyr Phe Ile Ser Gly Ser Leu Met Phe
305 310 315 320
Val Val Val Phe Trp Met Asn Gln Ser Phe Lys Met Thr Ile Ile Gln
325 330 335
Leu Ile Leu Gln Ile Ile Leu Gly Val Leu Ile Tyr Thr Leu Ser Asn
340 345 350
Ile Leu Leu Lys Thr Gln Leu Trp Leu Met Ala Ser Glu Leu Leu Gly
355 360 365
Lys Met Lys Asn Arg Val Ser Arg Asn His Ile Arg Ile Asp Gln Lys
370 375 380
Gln Glu Thr Leu Glu His Pro Leu Asp Thr Ile Lys Ala Ser Phe Asp
385 390 395 400
Gln Phe Ala Ile Leu Phe Gln Glu Ile Asp Glu Lys Lys Tyr Tyr Leu
405 410 415
Ile Arg Thr Phe
420
<210> 19
<211> 969
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsL基因的ORF
<400> 19
atggttcaga aaagagctgg gcgaataact ttatcaagta aaaaaacaag gaatagcaag 60
aaaggaaaag acatggagca gaaaaagaaa aagaatattt ggctgataat tgtacctatc 120
ttaataataa tttcccttat aggagcaggg gcttatgcct taatagattc acttattcct 180
actgatcata cgaaaacaaa cagttcggat caaccgacca aaacttcggt ttctaatggt 240
tatatagagc aaaaaggtga agaagctgct gtgggtagta tagcacttgt agatgatgct 300
ggtgtatcgg aatgggttaa ggttccctcg aaggcaaatc tagataaatt tactgattta 360
tctacgaata atatcactat ttatcgaatt aacaatccgg aagtcttaaa aacagttacc 420
aatcgtacgg atcaacggat gaaaatgtca gaagttatag ctaagtatca taatgctttg 480
attatgaatg cttccgcttt tgatatgcag acaggacaag tagctggatt tcaaattaat 540
aatggaaagt tgattcaaga ctggagtcca ggtacaacga ctcaatatgc ttttgttgtt 600
aacaaagatg gttcgtgcaa aatatatgat tcaagtacac ctgcttcaac tattattaaa 660
aacggagggc aacaagccta tgattttggt actgcaatta tccgtgatgg taaaattcaa 720
ccaagtgatg gctcagtaga ttggaagatc catattttta ttgcgaatga taaagataat 780
aatctctatg ctattttgag tgatacaaat gcaggttatg ataatataat gaagtcagtg 840
tcaaatttga agctccaaaa tatgttatta cttgatagtg gtggctcaag tcaactatct 900
gtcaatggta aaacgattgt tgctagtcaa gacgatcgag ccgtaccgga ttatattgtg 960
atgaaataa 969
<210> 20
<211> 322
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsL
<400> 20
Met Val Gln Lys Arg Ala Gly Arg Ile Thr Leu Ser Ser Lys Lys Thr
1 5 10 15
Arg Asn Ser Lys Lys Gly Lys Asp Met Glu Gln Lys Lys Lys Lys Asn
20 25 30
Ile Trp Leu Ile Ile Val Pro Ile Leu Ile Ile Ile Ser Leu Ile Gly
35 40 45
Ala Gly Ala Tyr Ala Leu Ile Asp Ser Leu Ile Pro Thr Asp His Thr
50 55 60
Lys Thr Asn Ser Ser Asp Gln Pro Thr Lys Thr Ser Val Ser Asn Gly
65 70 75 80
Tyr Ile Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu
85 90 95
Val Asp Asp Ala Gly Val Ser Glu Trp Val Lys Val Pro Ser Lys Ala
100 105 110
Asn Leu Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr
115 120 125
Arg Ile Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp
130 135 140
Gln Arg Met Lys Met Ser Glu Val Ile Ala Lys Tyr His Asn Ala Leu
145 150 155 160
Ile Met Asn Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Ala Gly
165 170 175
Phe Gln Ile Asn Asn Gly Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr
180 185 190
Thr Thr Gln Tyr Ala Phe Val Val Asn Lys Asp Gly Ser Cys Lys Ile
195 200 205
Tyr Asp Ser Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Gln
210 215 220
Gln Ala Tyr Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln
225 230 235 240
Pro Ser Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn
245 250 255
Asp Lys Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly
260 265 270
Tyr Asp Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met
275 280 285
Leu Leu Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys
290 295 300
Thr Ile Val Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val
305 310 315 320
Met Lys
<210> 21
<211> 903
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的编码推定的LytR家族转录调节蛋白的ORF
<400> 21
atgaatcaaa aaaagaggcg tcattatcgt aagaaaaaac acacagtact aaaagttatt 60
tcaattattt ttgtattagt aattatcgct gttgcttcta tagcctacgt agcttataga 120
aatgttgaat caaccttttc aacatcatat gaaaatttcc ctaaaacaac aagtatcgac 180
ttaaaaaagt ctaaaacatt caccacactt atcattgcaa ctggtaaaaa taattctaaa 240
aatacagctt atgctactgt tttagcttca acgaatgtaa agacaaatca aactactttc 300
atgaacttcc cagtttttgc gacaatgcct aatcaaaaaa caatcactga agtttacaat 360
acgaatggag atgatggaat tttccagatg gttaaagacc tattgaatgt gtccattaac 420
aaagtaattc agattgatgt taataaaatg ggatcacttg tacaggccac tggtggaatc 480
accatgcaaa atccaaaggc attcaatgct gaaggttatg agtttaaaca aggaactgtt 540
aatttacaaa ctgctgatca agtccaagcc tatatgacac aaattgacga tactgatttg 600
gatgcttcaa tcacccggat tcaaaatgtc tcaatggaac tctacggaaa tattcaaaaa 660
attgctcata tgaaaaaact tgaaagtttc aattactatc gagaaattct ctatgctttt 720
tcaaacactg ttaaaaccaa tataagtttc aatgatgcta aaacgatcgt tatgagctat 780
aatacggctc taaagaatac cagcaagctc aatctacata caacagatga aaatggagct 840
aaggtcgttt ctcaaacaga attagactca gtcaaaaccc tttttgaaaa atctctaaaa 900
taa 903
<210> 22
<211> 300
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_lytR
<400> 22
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Val Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Thr Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 23
<211> 1191
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的编码推定的核苷酸糖脱氢酶蛋白的ORF
<400> 23
atgtttaatg aaagaaggat cataatgaaa atagcagtag caggaacagg ttatgtaggt 60
ttatctctag ccacattact aagccaaaaa aatgaggtag ttgcacttga tgtaatacca 120
gaaaaagtag agaagataaa taatagaata agtccaattc aagatgaata catagaaaaa 180
tatttcaaag aaaaagaact taatttaaaa gcaactttag attataaaga agcatttgag 240
aatgcagaat ttattataat aagtacgcca acaaattatg attcagaaaa aaattatttt 300
gacacatcat ctgttgaaga tataattcag aaagtaaaaa gtatgaatat agatacaaca 360
atggttgtta aatcaactat tcctgttgga tttataaagg caatgaaaga aagatatcaa 420
atagacaata taatgtttag tcctgaattt ttaagagaag gaaaagcttt atatgataat 480
ttatatccat caagaataat agtgggagaa aaatcagata gagcagaaaa atttgctaat 540
cttttaaaag aaaattgttt aaaagaagat gtagtagtta aatatatgga ttctactgag 600
gctgaggcag taaaattatt tgcaaataca tatttagcac ttagagttgc atattttaat 660
gaattagata catatgctga attaaaaggt ttaaatacaa aagatattat agatggagta 720
tgtttagatc ctcgtattgg aaatcattat aacaacccta gttttggctt tggggggtat 780
tgtcttccaa aggactcgaa gcaattaaag gcaaattata aagatgttcc agagaatatt 840
atcagtgcaa tagttgaatc taatagaact agaaaagacc atattgccga tatgatctca 900
aaaagaaacc caaaagtagt tggaatatat agattaacaa tgaaatctgg gtcagataat 960
tttagagcta gtgcaattca aggagttatg aaaagaatca aagctaaagg aattgaagtt 1020
gttgtttatg agccaacttt aaaagaagat aatttcttta atagtaaagt aatcaaagat 1080
atagatgaat ttaaaaagat atcagatgtc attatagtaa atagacttga tgaaaatgta 1140
tctagtgtaa aagataaagt ttatacaaga gatctattcg ctagagatta a 1191
<210> 24
<211> 396
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_核苷酸_糖_脱氢酶
<400> 24
Met Phe Asn Glu Arg Arg Ile Ile Met Lys Ile Ala Val Ala Gly Thr
1 5 10 15
Gly Tyr Val Gly Leu Ser Leu Ala Thr Leu Leu Ser Gln Lys Asn Glu
20 25 30
Val Val Ala Leu Asp Val Ile Pro Glu Lys Val Glu Lys Ile Asn Asn
35 40 45
Arg Ile Ser Pro Ile Gln Asp Glu Tyr Ile Glu Lys Tyr Phe Lys Glu
50 55 60
Lys Glu Leu Asn Leu Lys Ala Thr Leu Asp Tyr Lys Glu Ala Phe Glu
65 70 75 80
Asn Ala Glu Phe Ile Ile Ile Ser Thr Pro Thr Asn Tyr Asp Ser Glu
85 90 95
Lys Asn Tyr Phe Asp Thr Ser Ser Val Glu Asp Ile Ile Gln Lys Val
100 105 110
Lys Ser Met Asn Ile Asp Thr Thr Met Val Val Lys Ser Thr Ile Pro
115 120 125
Val Gly Phe Ile Lys Ala Met Lys Glu Arg Tyr Gln Ile Asp Asn Ile
130 135 140
Met Phe Ser Pro Glu Phe Leu Arg Glu Gly Lys Ala Leu Tyr Asp Asn
145 150 155 160
Leu Tyr Pro Ser Arg Ile Ile Val Gly Glu Lys Ser Asp Arg Ala Glu
165 170 175
Lys Phe Ala Asn Leu Leu Lys Glu Asn Cys Leu Lys Glu Asp Val Val
180 185 190
Val Lys Tyr Met Asp Ser Thr Glu Ala Glu Ala Val Lys Leu Phe Ala
195 200 205
Asn Thr Tyr Leu Ala Leu Arg Val Ala Tyr Phe Asn Glu Leu Asp Thr
210 215 220
Tyr Ala Glu Leu Lys Gly Leu Asn Thr Lys Asp Ile Ile Asp Gly Val
225 230 235 240
Cys Leu Asp Pro Arg Ile Gly Asn His Tyr Asn Asn Pro Ser Phe Gly
245 250 255
Phe Gly Gly Tyr Cys Leu Pro Lys Asp Ser Lys Gln Leu Lys Ala Asn
260 265 270
Tyr Lys Asp Val Pro Glu Asn Ile Ile Ser Ala Ile Val Glu Ser Asn
275 280 285
Arg Thr Arg Lys Asp His Ile Ala Asp Met Ile Ser Lys Arg Asn Pro
290 295 300
Lys Val Val Gly Ile Tyr Arg Leu Thr Met Lys Ser Gly Ser Asp Asn
305 310 315 320
Phe Arg Ala Ser Ala Ile Gln Gly Val Met Lys Arg Ile Lys Ala Lys
325 330 335
Gly Ile Glu Val Val Val Tyr Glu Pro Thr Leu Lys Glu Asp Asn Phe
340 345 350
Phe Asn Ser Lys Val Ile Lys Asp Ile Asp Glu Phe Lys Lys Ile Ser
355 360 365
Asp Val Ile Ile Val Asn Arg Leu Asp Glu Asn Val Ser Ser Val Lys
370 375 380
Asp Lys Val Tyr Thr Arg Asp Leu Phe Ala Arg Asp
385 390 395
<210> 25
<211> 780
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsC基因的ORF
<400> 25
atgcaggaaa cacaggaaca aacgattgat ttaagaggga tttttaaaat tattcgcaaa 60
aggttaggtt taatattatt tagtgcttta atagtcacaa tattagggag catctacaca 120
ttttttatag cctccccagt ttacacagcc tcaactcaac ttgtcgttaa actaccaaat 180
ttggataatt cagcagccta cgctggacaa gtgaccggga atattcaaat ggcgaacaca 240
attaaccaag ttattgttag tccagtcatt ttagataaag ttcaaagtaa tttaaatcta 300
tctgatgact ctttccaaaa acaagttaca gcagcaaatc aaacaaattc acaagtcatt 360
acgcttactg ttaaatattc taatccttac gttgctcaaa agattgcaga cgagactgct 420
aaaatattta gttcagatgc agcaaaacta ttgaatgtta ctaacgttaa tattctatcc 480
aaagcaaaag ctcaaacaac acccattagt cctaaaccta aattgtattt agcaatatct 540
gttatagccg gattagtttt aggtttagcc attgctttat tgaaggaatt gtttgataac 600
aaaattaata aagaagaaga tattgaagct ctgggactca cggttcttgg tgtaacaacc 660
tgtgctcaaa tgagtgattt taataataat acgaataaaa atggcacgca atcgggaact 720
aagtcaagtc cgcctagcga ccatgaagta aatagatcat caaaaaggaa taaaagatag 780
<210> 26
<211> 259
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsC
<400> 26
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Leu Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Val Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Cys Ala Gln Met
210 215 220
Ser Asp Phe Asn Asn Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 27
<211> 687
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33134的epsE基因的ORF
<400> 27
atgaaagttt ttgaggatgc cgcatcacct gaatcggaag agcataagtt agtagaatta 60
aaaaaatttt cttatagaga gctaattata aaaagagcaa ttgatatcct aggaggatta 120
gcaggttcag ttttatttct tattgcggct gcattacttt atctccctta caaaatgagc 180
tcaaaaaagg atcaagggcc aatgttctat aaacaaaaac gctatggaaa aaacgggaaa 240
attttttata ttttgaaatt taggacaatg atagttaatg ctgagcagta tttagagcta 300
catccagaag ttaaagccgc ctatcatgcc aatggcaata aactagaaaa tgacccccgt 360
gtgacgaaga ttggttcatt tattagacaa cactcaattg atgaattacc acaatttatc 420
aatgttctta aaggggatat ggcattggtt ggcccaagac caattttgct ttttgaagcg 480
aaagaatatg gggagcgcct ctcttactta ctcatgtgta aacctggaat tactggttat 540
tggacaacac atggtcgaag taaagttttt tttcctcaac gagcagattt agaactctat 600
tacctccagt accatagcac caaaaacgat atcaagcttc tagtactcac aattgtacaa 660
agtattaacg gatcggacgc atattaa 687
<210> 28
<211> 228
<212> PRT
<213> 乳酸乳球菌乳酸亚种
<220>
<223> 33134_epsE
<400> 28
Met Lys Val Phe Glu Asp Ala Ala Ser Pro Glu Ser Glu Glu His Lys
1 5 10 15
Leu Val Glu Leu Lys Lys Phe Ser Tyr Arg Glu Leu Ile Ile Lys Arg
20 25 30
Ala Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Leu Pro Tyr Lys Met Ser Ser Lys Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Val Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Glu Arg Leu Ser Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Phe Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Val Leu Thr Ile Val Gln Ser Ile Asn Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 29
<211> 324
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsR基因的ORF
<400> 29
atgtttatga atgatttatt ttaccatcgt ctaaaggaac tagttgaagc aagtggtaaa 60
tctgcaaatc aaatagaaag ggaattgggt taccctagaa attctttgaa taattataag 120
ttgggaggag aaccctctgg gacaagatta ataggactat cagagtattt taatgtgtct 180
ccaaaatatc tgatgggtat aattgatgag cctaatgaca gttctgcaat taatcttttt 240
aaaactctaa ctcaagaaga gaaaaaagaa atgtttataa tttgtcaaaa atggcttttt 300
ttagaatatc aaatagagtt ataa 324
<210> 30
<211> 107
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsR
<400> 30
Met Phe Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu
1 5 10 15
Ala Ser Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro
20 25 30
Arg Asn Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr
35 40 45
Arg Leu Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu
50 55 60
Met Gly Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe
65 70 75 80
Lys Thr Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln
85 90 95
Lys Trp Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 31
<211> 768
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsX基因的ORF
<400> 31
atgatgaaaa aaggaatttt tgtaattact atagtgatat ctatagcatt gataattgga 60
ggtttttata gttataattc taggataagt aatctttcaa aagctgataa aggaaaagaa 120
gttgtaaaaa atagcagtga aaaaaatcag atagacctta cctataaaaa gtattataaa 180
aatttaccaa aatcagttca aaataaaata gatgatattt catccaaaaa taaagaagtt 240
actttaactt gtatttggca atctgattca gttatttctg aacaatttca acaaaactta 300
caaaaatatt atggaaataa gttttggaac atcaaaaata tcacttacaa tggcgaaact 360
agtgaacaat tattggctga aaaagttgaa aaccaagtat tagccactaa tcctgatgtt 420
gttttatatg aagctccact ttttaatgat aaccaaaaca ttgaagcaac agcctcactg 480
actagtaatg agcaacttat aacaaatttg gctagtgcag gagcggaggt aatagttcaa 540
ccctctccac cgatctatgg tggtgttgtg taccccgtac aagaagaaca atttaaacaa 600
tctttatcta caaagtatcc ctatatagac tactgggcta gttacccaga caaaaattct 660
gatgaaatga agggactgtt ttctgatgat ggagtatata gaacattaaa tgcttcgggg 720
aataaggttt ggctagatta tattactaaa tattttacag caaactaa 768
<210> 32
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsX
<400> 32
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Leu Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Ser Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Leu
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Ala Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 33
<211> 765
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsB基因的ORF
<400> 33
atgattgata ttcactgcca tattttaccg gggatagatg atggagctaa aacttatgaa 60
gatactttga aaatgctgaa atcagcaatt gatgaaggga taacaactat cactgcgact 120
cctcatcata atcctcaatt taagaatgaa tcaccgctta ttttgaaaaa agttaaggaa 180
gttcaaaata tcattgacga acatcaatta ccaattgaag ttttacccgg acaagaggtg 240
agaatatatg gtgatttatt aaaagaattt tctgaaggaa agttactgac agcagcgggc 300
acttcaagtt atatattgat tgaatttcca tcaaatcatg tgccagctta tgctaaagaa 360
cttttttata atattcaatt ggagggattt caacctattt tggtccaccc tgagcgtaat 420
agtgcaatca ttgagaaccc tgatctatta tttgatttta ttgaacaagg agtactaagt 480
cagataactg cttcaagtgt cactggtcat tttggtaaaa aaatacaaaa gctgtcattt 540
aaaatgatag aaaaccatct gacgcatttt gttgcatcag atgcgcataa tgtgacgtca 600
cgtgcattta agatgaagga agcgtttgaa attattgaag atagttatgg ttctggtgta 660
tcactaatgt ttcaaaataa tgcagagtca gtgattttaa acgaaagttt ttatcaagaa 720
aaaccaacaa agatcaaaac aaagaaattt ttaggattat tttaa 765
<210> 34
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsB
<400> 34
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Tyr Glu Asp Thr Leu Lys Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Lys
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Phe Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Ala Ile Ile
130 135 140
Glu Asn Pro Asp Leu Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Leu Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 35
<211> 696
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsD基因的ORF
<400> 35
atggctaaaa ataaaagaag catagacaat aatcgttata ttattaccag tgtcaatcct 60
caatcaccta tttccgaaca atatcgtacg attcgtacga ccattgattt taaaatggcg 120
gatcaaggga ttaaaagttt tctagtaaca tcttcagaag cagctgcagg taaatcaacc 180
gagagtgcta atctagctgt tgcttttgca caacaaggta aaaaagtact tttaattgat 240
ggcgatcttc gtaaaccgac tgttaacatt acttttaaag tacaaaatag agtaggatta 300
accaatattt taatgcatca atcttcgatt gaagatgcca tacaagggac aagactttct 360
gaaaatctta caataattac ctctggtcca attccaccta atccatcgga attattagca 420
tctagtgcaa tgaagaattt gattgactct gtgtccgatt cctttgatgt tgttttgatt 480
gatactccac ctctctctgc agttactgat gctcaaattt tgagtattta tgtaggagga 540
gtggttcttg ttgtacgtgc ctatgaaaca aaaaaagaga gtttagcaaa aacaaaaaaa 600
atactggaac aagttaatgt aaatatatta ggagttgttt tgcatggggt agactcttct 660
gactcaccgt cgtattacta ctacggagta gagtaa 696
<210> 36
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsD
<400> 36
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Thr Glu Ser Ala Asn
50 55 60
Leu Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Ser Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ile
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Val Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Asp Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 37
<211> 1122
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的编码推定的GT1蛋白的ORF
<400> 37
atgaaaaaaa agaaattatt actaataagt caaagcggaa gaggtggagt aaggaggcat 60
ttgtgtgatc ttatgcttaa cctcgattat gaaattttcg aggtatgggt tgcttacaat 120
gatgatgcta ttgatgatat atttagacaa acaatagagc aattatcagg aaaaattact 180
cctatactaa taaataatct tgtcagggaa ttaaatttaa aggaggatat aaaagcatat 240
ttaaaattaa gcaaactaat aaagaaagtc aagccggata ttgtacattg tcacagttct 300
aaagctggtg ttattggtcg tttagctgcc aaaagacgag gtgttaaaaa aatattttat 360
acgccacatg cttattcgtt tttggcacct gaatttagtg gaaagaaaaa gtttcttttt 420
gttcaaattg aaaagttttt aagccgattt tcgacaactc agacattttg tgtgtcaata 480
ggggaaatgc aagctgctct tgaagtaaat ctagataaaa ccgataagtt tcaggtaatt 540
tataatggtt tgccagaaat tgatttacca agcaaagaaa cgattcgggc gcaattagga 600
ctggaaaaga cagtagttgt tataggcaat aacgcaagaa tgtcggaaca gaaaaatcct 660
atgtttttta tggaaattgc ccaaaaaatg attagacaaa acgcaaattg gcattttgtg 720
tgggcaggtg atggtcagct tatgccactt tttcaatcat ttattaagca aaatggacta 780
gagaaaaata ttcatttgct tggggagcgt cctgatagtg aaacagttgt gacagcctat 840
gacatcttct tgacgacttc ccaatatgaa ggtttacctt atgcaccaat tgaagcgatg 900
cgagctggtg tcccgattct tgcgacaaat gttgttggca atagtgagct tgtgatagag 960
ggaaaaaatg gttatttgat cgacttagag tggtcaaaat ctgtcgaaga aaaattatat 1020
aaggcagcga aaatggatgc acaaatgatt aaagcagatt ttaggcaaag gtttgcgatt 1080
gatcagatgt taaagcaaat tgaaacaatt tatttagctt ga 1122
<210> 38
<211> 373
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_GT1
<400> 38
Met Lys Lys Lys Lys Leu Leu Leu Ile Ser Gln Ser Gly Arg Gly Gly
1 5 10 15
Val Arg Arg His Leu Cys Asp Leu Met Leu Asn Leu Asp Tyr Glu Ile
20 25 30
Phe Glu Val Trp Val Ala Tyr Asn Asp Asp Ala Ile Asp Asp Ile Phe
35 40 45
Arg Gln Thr Ile Glu Gln Leu Ser Gly Lys Ile Thr Pro Ile Leu Ile
50 55 60
Asn Asn Leu Val Arg Glu Leu Asn Leu Lys Glu Asp Ile Lys Ala Tyr
65 70 75 80
Leu Lys Leu Ser Lys Leu Ile Lys Lys Val Lys Pro Asp Ile Val His
85 90 95
Cys His Ser Ser Lys Ala Gly Val Ile Gly Arg Leu Ala Ala Lys Arg
100 105 110
Arg Gly Val Lys Lys Ile Phe Tyr Thr Pro His Ala Tyr Ser Phe Leu
115 120 125
Ala Pro Glu Phe Ser Gly Lys Lys Lys Phe Leu Phe Val Gln Ile Glu
130 135 140
Lys Phe Leu Ser Arg Phe Ser Thr Thr Gln Thr Phe Cys Val Ser Ile
145 150 155 160
Gly Glu Met Gln Ala Ala Leu Glu Val Asn Leu Asp Lys Thr Asp Lys
165 170 175
Phe Gln Val Ile Tyr Asn Gly Leu Pro Glu Ile Asp Leu Pro Ser Lys
180 185 190
Glu Thr Ile Arg Ala Gln Leu Gly Leu Glu Lys Thr Val Val Val Ile
195 200 205
Gly Asn Asn Ala Arg Met Ser Glu Gln Lys Asn Pro Met Phe Phe Met
210 215 220
Glu Ile Ala Gln Lys Met Ile Arg Gln Asn Ala Asn Trp His Phe Val
225 230 235 240
Trp Ala Gly Asp Gly Gln Leu Met Pro Leu Phe Gln Ser Phe Ile Lys
245 250 255
Gln Asn Gly Leu Glu Lys Asn Ile His Leu Leu Gly Glu Arg Pro Asp
260 265 270
Ser Glu Thr Val Val Thr Ala Tyr Asp Ile Phe Leu Thr Thr Ser Gln
275 280 285
Tyr Glu Gly Leu Pro Tyr Ala Pro Ile Glu Ala Met Arg Ala Gly Val
290 295 300
Pro Ile Leu Ala Thr Asn Val Val Gly Asn Ser Glu Leu Val Ile Glu
305 310 315 320
Gly Lys Asn Gly Tyr Leu Ile Asp Leu Glu Trp Ser Lys Ser Val Glu
325 330 335
Glu Lys Leu Tyr Lys Ala Ala Lys Met Asp Ala Gln Met Ile Lys Ala
340 345 350
Asp Phe Arg Gln Arg Phe Ala Ile Asp Gln Met Leu Lys Gln Ile Glu
355 360 365
Thr Ile Tyr Leu Ala
370
<210> 39
<211> 1083
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的推定的wzy基因的ORF
<400> 39
atggcaattt attttttact tttcccgatg atcgcaatga tttatttaat gacattgctc 60
ttacgacaaa aagcacaaat ccaaaaaacg attttttgtg ttcttacgtt tggtacacta 120
ggctttattt cagcaagtcg tgcttcaagt gttgggacag atgttacgct atacgaaaat 180
atttttaaat ctataaatta cgggataagt gctgaaaata atttgggata tgtcatctat 240
aacaagttga ttggtagtgt atttggctat acgggacatg aaatcacagc tgctaattct 300
gttttgatta cgatacttat tggttttttt atttggaaag tagcggaaca ttattttgtt 360
gcgacgtttt tatacattag cttgttttat tatgctacaa gttttaatat ttcaagacaa 420
tttattgcca tggggcttgt attggtagca atttcttttg ctttagataa aaaggttatg 480
ccttggttta tcttgacagt tttggctacc ttatttcatg cgacagcaat cgtttctttt 540
cctgtctatt ggcttacaaa agtacattgg gatgtgaaaa agacattagg tatttttcca 600
attacgattt ttgcaagttt tatttttgat gctattttaa acattttttt acgttttttc 660
ccacattatg agatgtatat tactggaaca caatttaata ttgcagatca ggggcaggga 720
cgtgtggttt tggtcaaaat atttatcttg ctcattttgt ttactttaat cttgttttat 780
aaaaaaagct atgctttgat ttctgaatgt catcaaagtt tgatagcttt gacaaccgtt 840
ggattaagta tcggtattgt attttataat aatattttac tcaatagaat agaaatgttt 900
tattcaattt taagcatcgt atttattcca attgctatag attactttag tttgaaattt 960
aaagaaaaag atactgtgcg acaaatgctg acgataggta ttttgttaat tacacttgtg 1020
ccttactata tacaggttag cggtaattat tcaggaatat tgccatatac gatgaataaa 1080
tag 1083
<210> 40
<211> 360
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_wzy
<400> 40
Met Ala Ile Tyr Phe Leu Leu Phe Pro Met Ile Ala Met Ile Tyr Leu
1 5 10 15
Met Thr Leu Leu Leu Arg Gln Lys Ala Gln Ile Gln Lys Thr Ile Phe
20 25 30
Cys Val Leu Thr Phe Gly Thr Leu Gly Phe Ile Ser Ala Ser Arg Ala
35 40 45
Ser Ser Val Gly Thr Asp Val Thr Leu Tyr Glu Asn Ile Phe Lys Ser
50 55 60
Ile Asn Tyr Gly Ile Ser Ala Glu Asn Asn Leu Gly Tyr Val Ile Tyr
65 70 75 80
Asn Lys Leu Ile Gly Ser Val Phe Gly Tyr Thr Gly His Glu Ile Thr
85 90 95
Ala Ala Asn Ser Val Leu Ile Thr Ile Leu Ile Gly Phe Phe Ile Trp
100 105 110
Lys Val Ala Glu His Tyr Phe Val Ala Thr Phe Leu Tyr Ile Ser Leu
115 120 125
Phe Tyr Tyr Ala Thr Ser Phe Asn Ile Ser Arg Gln Phe Ile Ala Met
130 135 140
Gly Leu Val Leu Val Ala Ile Ser Phe Ala Leu Asp Lys Lys Val Met
145 150 155 160
Pro Trp Phe Ile Leu Thr Val Leu Ala Thr Leu Phe His Ala Thr Ala
165 170 175
Ile Val Ser Phe Pro Val Tyr Trp Leu Thr Lys Val His Trp Asp Val
180 185 190
Lys Lys Thr Leu Gly Ile Phe Pro Ile Thr Ile Phe Ala Ser Phe Ile
195 200 205
Phe Asp Ala Ile Leu Asn Ile Phe Leu Arg Phe Phe Pro His Tyr Glu
210 215 220
Met Tyr Ile Thr Gly Thr Gln Phe Asn Ile Ala Asp Gln Gly Gln Gly
225 230 235 240
Arg Val Val Leu Val Lys Ile Phe Ile Leu Leu Ile Leu Phe Thr Leu
245 250 255
Ile Leu Phe Tyr Lys Lys Ser Tyr Ala Leu Ile Ser Glu Cys His Gln
260 265 270
Ser Leu Ile Ala Leu Thr Thr Val Gly Leu Ser Ile Gly Ile Val Phe
275 280 285
Tyr Asn Asn Ile Leu Leu Asn Arg Ile Glu Met Phe Tyr Ser Ile Leu
290 295 300
Ser Ile Val Phe Ile Pro Ile Ala Ile Asp Tyr Phe Ser Leu Lys Phe
305 310 315 320
Lys Glu Lys Asp Thr Val Arg Gln Met Leu Thr Ile Gly Ile Leu Leu
325 330 335
Ile Thr Leu Val Pro Tyr Tyr Ile Gln Val Ser Gly Asn Tyr Ser Gly
340 345 350
Ile Leu Pro Tyr Thr Met Asn Lys
355 360
<210> 41
<211> 972
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的编码推定的GT2蛋白的ORF
<400> 41
atgtatatac ataatctaat ttccgtcatt attccagtat ataatgtaga aaaatattta 60
gaaaagtgtt tgaagtctgt tcaaaatcag agttatgcgc attttgaagt gatcttaatc 120
aacgatggtt caacggattc ttctttaaaa atttgtgagg catttatcaa aaaagataag 180
cgcttttctg ttttaacaaa agaaaatggt ggactttctt cggctcgaaa tttgggttta 240
aaaaaaatca ggggaaaata tgtgacattt gtggatagtg acgattatct atcagagcat 300
tatcttaaac attttgtgag tggtatagag agtgaaaaga gtatcgtttg ttcaaaattt 360
cttcttgttg atgaaaatgg tgtttttctt tctaaaagac agagaattca agaaaaaaaa 420
cttatttttt ctaaagaaga aggcataaaa gaaattttat tacaaaataa aatggatcac 480
tcagcttggg gaaaattata tccgatatct ttttttgaaa atatcacttt tccagatgga 540
aaattgtttg aagatatggg aacgacctat aagttattgg ctttagctaa cgaagttgta 600
tttttagatg aatatgatta ttattatctt caacaaccca atagcattat gaacagttca 660
tttaatttaa aaaaattaga tattatagat atgtcaaagg aaatgattaa agatatcgtt 720
aatacctgcc ctcaacttgt gaattatgct aaaaatagag catttagtgc agaggcaggt 780
atctttttag atgtgccaaa tactaaagcg tttgaatcgg cgcaaaagct gctttggaaa 840
gaagtaagag aaaatagata tgcaccattt ttgataaaag gggctagact taagaataag 900
ttaggtgcta ttttgtcgtt tttgggtagg agattttttt tgaaactcgg gaaacagttg 960
gtaggtaaat aa 972
<210> 42
<211> 323
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_GT2
<400> 42
Met Tyr Ile His Asn Leu Ile Ser Val Ile Ile Pro Val Tyr Asn Val
1 5 10 15
Glu Lys Tyr Leu Glu Lys Cys Leu Lys Ser Val Gln Asn Gln Ser Tyr
20 25 30
Ala His Phe Glu Val Ile Leu Ile Asn Asp Gly Ser Thr Asp Ser Ser
35 40 45
Leu Lys Ile Cys Glu Ala Phe Ile Lys Lys Asp Lys Arg Phe Ser Val
50 55 60
Leu Thr Lys Glu Asn Gly Gly Leu Ser Ser Ala Arg Asn Leu Gly Leu
65 70 75 80
Lys Lys Ile Arg Gly Lys Tyr Val Thr Phe Val Asp Ser Asp Asp Tyr
85 90 95
Leu Ser Glu His Tyr Leu Lys His Phe Val Ser Gly Ile Glu Ser Glu
100 105 110
Lys Ser Ile Val Cys Ser Lys Phe Leu Leu Val Asp Glu Asn Gly Val
115 120 125
Phe Leu Ser Lys Arg Gln Arg Ile Gln Glu Lys Lys Leu Ile Phe Ser
130 135 140
Lys Glu Glu Gly Ile Lys Glu Ile Leu Leu Gln Asn Lys Met Asp His
145 150 155 160
Ser Ala Trp Gly Lys Leu Tyr Pro Ile Ser Phe Phe Glu Asn Ile Thr
165 170 175
Phe Pro Asp Gly Lys Leu Phe Glu Asp Met Gly Thr Thr Tyr Lys Leu
180 185 190
Leu Ala Leu Ala Asn Glu Val Val Phe Leu Asp Glu Tyr Asp Tyr Tyr
195 200 205
Tyr Leu Gln Gln Pro Asn Ser Ile Met Asn Ser Ser Phe Asn Leu Lys
210 215 220
Lys Leu Asp Ile Ile Asp Met Ser Lys Glu Met Ile Lys Asp Ile Val
225 230 235 240
Asn Thr Cys Pro Gln Leu Val Asn Tyr Ala Lys Asn Arg Ala Phe Ser
245 250 255
Ala Glu Ala Gly Ile Phe Leu Asp Val Pro Asn Thr Lys Ala Phe Glu
260 265 270
Ser Ala Gln Lys Leu Leu Trp Lys Glu Val Arg Glu Asn Arg Tyr Ala
275 280 285
Pro Phe Leu Ile Lys Gly Ala Arg Leu Lys Asn Lys Leu Gly Ala Ile
290 295 300
Leu Ser Phe Leu Gly Arg Arg Phe Phe Leu Lys Leu Gly Lys Gln Leu
305 310 315 320
Val Gly Lys
<210> 43
<211> 990
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的编码推定的GT3蛋白的ORF
<400> 43
ttggaggaaa taaattatat gatgaagata tcagttattg ttcctgttta taatagtgaa 60
aacacaattg aaaagtgttt aatttcattg caaaagcaaa catataaaaa tttagaaatt 120
attgtaataa atgatggttc agtggataca acagaagata aaatagtaag aattatagag 180
aatgataaaa gatttattta ttttaaaact gccaatcaag gacaatctga agcgaggagt 240
tttggattga gtaaagcaac tggagagtta ataggattcg ttgattcaga tgattttata 300
gattatgata tgtatgaaat attagaaaaa aatatgagag atacaaaatc agatatttca 360
attataagat cgataatatc atttccaaat ggattcgaaa taatacctag ctgtcaaaat 420
acttttttta tcaaaacagg taatgaaatg atatttgagt atattggtgg ttttcatttt 480
ggggttgccc tatgggataa actttataaa agaagtttat ttgatggatt aaaattagat 540
acttctttta atttaatgga agatgcttta atgggtaact atgtgtttaa taaagcgaaa 600
aaaattgttt atacaggaaa agcaaagtat cattatttgc agagaaaaaa cagtacagcc 660
agaaaaaatt tggaagatag tgatttaaag gcaataggtg ttgtcttggg tatgaaagag 720
ctttatactg atagcattga gttagataaa gcatttcaac gcagatttgc acaaacaata 780
ctggaattac tcagcaaaaa tcccacaaaa gaacaacgaa aacaaataga gagagctttg 840
tttgaaacaa tagaattaga taaattgggt tatttaaaaa agggtgataa gttgctaata 900
agattgatct attataaatt tccgtcatct cccttaattc aagccaaaaa aatgattgga 960
cgaacagtaa gaaaatttaa gaagatatag 990
<210> 44
<211> 329
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_GT3
<400> 44
Leu Glu Glu Ile Asn Tyr Met Met Lys Ile Ser Val Ile Val Pro Val
1 5 10 15
Tyr Asn Ser Glu Asn Thr Ile Glu Lys Cys Leu Ile Ser Leu Gln Lys
20 25 30
Gln Thr Tyr Lys Asn Leu Glu Ile Ile Val Ile Asn Asp Gly Ser Val
35 40 45
Asp Thr Thr Glu Asp Lys Ile Val Arg Ile Ile Glu Asn Asp Lys Arg
50 55 60
Phe Ile Tyr Phe Lys Thr Ala Asn Gln Gly Gln Ser Glu Ala Arg Ser
65 70 75 80
Phe Gly Leu Ser Lys Ala Thr Gly Glu Leu Ile Gly Phe Val Asp Ser
85 90 95
Asp Asp Phe Ile Asp Tyr Asp Met Tyr Glu Ile Leu Glu Lys Asn Met
100 105 110
Arg Asp Thr Lys Ser Asp Ile Ser Ile Ile Arg Ser Ile Ile Ser Phe
115 120 125
Pro Asn Gly Phe Glu Ile Ile Pro Ser Cys Gln Asn Thr Phe Phe Ile
130 135 140
Lys Thr Gly Asn Glu Met Ile Phe Glu Tyr Ile Gly Gly Phe His Phe
145 150 155 160
Gly Val Ala Leu Trp Asp Lys Leu Tyr Lys Arg Ser Leu Phe Asp Gly
165 170 175
Leu Lys Leu Asp Thr Ser Phe Asn Leu Met Glu Asp Ala Leu Met Gly
180 185 190
Asn Tyr Val Phe Asn Lys Ala Lys Lys Ile Val Tyr Thr Gly Lys Ala
195 200 205
Lys Tyr His Tyr Leu Gln Arg Lys Asn Ser Thr Ala Arg Lys Asn Leu
210 215 220
Glu Asp Ser Asp Leu Lys Ala Ile Gly Val Val Leu Gly Met Lys Glu
225 230 235 240
Leu Tyr Thr Asp Ser Ile Glu Leu Asp Lys Ala Phe Gln Arg Arg Phe
245 250 255
Ala Gln Thr Ile Leu Glu Leu Leu Ser Lys Asn Pro Thr Lys Glu Gln
260 265 270
Arg Lys Gln Ile Glu Arg Ala Leu Phe Glu Thr Ile Glu Leu Asp Lys
275 280 285
Leu Gly Tyr Leu Lys Lys Gly Asp Lys Leu Leu Ile Arg Leu Ile Tyr
290 295 300
Tyr Lys Phe Pro Ser Ser Pro Leu Ile Gln Ala Lys Lys Met Ile Gly
305 310 315 320
Arg Thr Val Arg Lys Phe Lys Lys Ile
325
<210> 45
<211> 1398
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的推定的wzx基因的ORF
<400> 45
atgaataaat acaaaaaact gctatccaac tcactcgttt tcacaatagg aaatttgggt 60
agcaaactgt tagtcttttt actcgtacca ctctacactt atgcgatgac accgcaagag 120
tatggtatgg cagacttgta ccaaacaaca gccagtctac ttttgccatt gattacgatg 180
aatgtgtttg atgcaacttt acgttttgcc atggaaaagt caatgacaaa agagagcgtc 240
ttaacaaatt ctcttgtagt atggtgtttt agcgctgtgt tctcttgttt cggcatgatt 300
tttgtctatg cactgaactt gagtaataaa tggtacttag ccctactttt tcttatcatc 360
ttattccaag gtgggcaaag catactaagt caatatgcga gaggcattgg gaaatcgaaa 420
ttatttgcag ctggcggagt tattttaacc tttttgacag gtgctttaaa tatcttcttc 480
ttggtatttt tacatgctgg aattacgggc tacctcatgt ccctagtttt agcgaattta 540
gggacaatcc ttttttttgc ggggacactt tccatttggc aggcaatcaa ttttaaagta 600
atcgataagg aaatgatttg gcaaatgctc tattatgcct tacctttaat tcctaatgcc 660
atcatgtggt ggtcactgaa cgcttctaat cgctatttcg ttttattctt tttaggagca 720
ggtgctaatg gccttttggc ggtcgctacc aaaatcccaa gtattatttc aatttttaat 780
acgattttta cacaggcatg gcaaatttca gccatagaag aatataattc tcatcaaaaa 840
tcaaaatatt attcggatgt ttttcactac ttagcaactt ttctattgtt agggacatca 900
gcttttatga ttgtgcttaa accagttgtc gaaaaagtcg tttcaagtga ctatgcaagt 960
tcatggcaat atgtcccctt ctttatgttg gcgatgttat tttcctcatt ttctggattt 1020
tttgggacca attatattgc ggccaaacaa acaaaaggcg tatttatgac atctatctat 1080
ggtgccattg tttgtgtcct actccaagtg gtgctgctac ccaccattgg cttgaatggt 1140
gctggattag cttcaatgct aggattcttg acaacatttt tattgcgtgt taaagatacg 1200
caaaaatttg tggcgattca gattaaatgg cgaattttta tcagtaattt attgatcgtt 1260
ttggcacaaa ttttatgttt gttttatcta ccgagtgaat ttttgtattt tggtcttgcc 1320
cttttgtttt gtggcatgtt agttgttaat cagcgtacaa ttttatacat tatcatggcg 1380
ctaaaaaata aaaaataa 1398
<210> 46
<211> 465
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_wzx
<400> 46
Met Asn Lys Tyr Lys Lys Leu Leu Ser Asn Ser Leu Val Phe Thr Ile
1 5 10 15
Gly Asn Leu Gly Ser Lys Leu Leu Val Phe Leu Leu Val Pro Leu Tyr
20 25 30
Thr Tyr Ala Met Thr Pro Gln Glu Tyr Gly Met Ala Asp Leu Tyr Gln
35 40 45
Thr Thr Ala Ser Leu Leu Leu Pro Leu Ile Thr Met Asn Val Phe Asp
50 55 60
Ala Thr Leu Arg Phe Ala Met Glu Lys Ser Met Thr Lys Glu Ser Val
65 70 75 80
Leu Thr Asn Ser Leu Val Val Trp Cys Phe Ser Ala Val Phe Ser Cys
85 90 95
Phe Gly Met Ile Phe Val Tyr Ala Leu Asn Leu Ser Asn Lys Trp Tyr
100 105 110
Leu Ala Leu Leu Phe Leu Ile Ile Leu Phe Gln Gly Gly Gln Ser Ile
115 120 125
Leu Ser Gln Tyr Ala Arg Gly Ile Gly Lys Ser Lys Leu Phe Ala Ala
130 135 140
Gly Gly Val Ile Leu Thr Phe Leu Thr Gly Ala Leu Asn Ile Phe Phe
145 150 155 160
Leu Val Phe Leu His Ala Gly Ile Thr Gly Tyr Leu Met Ser Leu Val
165 170 175
Leu Ala Asn Leu Gly Thr Ile Leu Phe Phe Ala Gly Thr Leu Ser Ile
180 185 190
Trp Gln Ala Ile Asn Phe Lys Val Ile Asp Lys Glu Met Ile Trp Gln
195 200 205
Met Leu Tyr Tyr Ala Leu Pro Leu Ile Pro Asn Ala Ile Met Trp Trp
210 215 220
Ser Leu Asn Ala Ser Asn Arg Tyr Phe Val Leu Phe Phe Leu Gly Ala
225 230 235 240
Gly Ala Asn Gly Leu Leu Ala Val Ala Thr Lys Ile Pro Ser Ile Ile
245 250 255
Ser Ile Phe Asn Thr Ile Phe Thr Gln Ala Trp Gln Ile Ser Ala Ile
260 265 270
Glu Glu Tyr Asn Ser His Gln Lys Ser Lys Tyr Tyr Ser Asp Val Phe
275 280 285
His Tyr Leu Ala Thr Phe Leu Leu Leu Gly Thr Ser Ala Phe Met Ile
290 295 300
Val Leu Lys Pro Val Val Glu Lys Val Val Ser Ser Asp Tyr Ala Ser
305 310 315 320
Ser Trp Gln Tyr Val Pro Phe Phe Met Leu Ala Met Leu Phe Ser Ser
325 330 335
Phe Ser Gly Phe Phe Gly Thr Asn Tyr Ile Ala Ala Lys Gln Thr Lys
340 345 350
Gly Val Phe Met Thr Ser Ile Tyr Gly Ala Ile Val Cys Val Leu Leu
355 360 365
Gln Val Val Leu Leu Pro Thr Ile Gly Leu Asn Gly Ala Gly Leu Ala
370 375 380
Ser Met Leu Gly Phe Leu Thr Thr Phe Leu Leu Arg Val Lys Asp Thr
385 390 395 400
Gln Lys Phe Val Ala Ile Gln Ile Lys Trp Arg Ile Phe Ile Ser Asn
405 410 415
Leu Leu Ile Val Leu Ala Gln Ile Leu Cys Leu Phe Tyr Leu Pro Ser
420 425 430
Glu Phe Leu Tyr Phe Gly Leu Ala Leu Leu Phe Cys Gly Met Leu Val
435 440 445
Val Asn Gln Arg Thr Ile Leu Tyr Ile Ile Met Ala Leu Lys Asn Lys
450 455 460
Lys
465
<210> 47
<211> 918
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsL基因的ORF
<400> 47
ttggaggaaa aattggaacg aaaaaaaaag aaaaaaaaga atatttgggt tataattata 60
cctatcttaa tttttattac ccttatagga gcaggggctt atgccttaag aaattcactt 120
attcctactg atcatacgaa aacaaatagt tcggatcaac cgcccaaaac ttcggcttcc 180
aacggttatg tagaacaaaa aggcgaagaa gctgccgtag gtagtatagc acttgtagat 240
gatgctggtg taccagagtg ggttaaagtt ccctcaaagg taaatctaga taaatttact 300
gatttatcta cgaataatat cactatttat cgaattaaca atccggaagt cttaaaaaca 360
gttaccaatc gtacagatca acggatgaaa atgtcagaag ttatagctaa gtatcctaat 420
gctttgatta tgaatgcttc cgcatttgat atgcagacag gacaagtagt tggatttcaa 480
attaataatg gaaagttgat tcaagactgg agcccaggta caacgactca atatgctttt 540
gttattaaca aagatggttc gtgcaaaatt tatgattcaa gtacacctgc ttcaactatt 600
attaaaaacg gagggcaaca agcctatgat ttttatggta ctgcaattat ccgtgatggt 660
aaaattcaac caagtgatgg ctcagtagat tggaagatcc atatttttat tgcgaatgat 720
aaagataata atctctatgc tattttgagt gatacaaatg caggttatgg taatataatg 780
aagtcagtgt caaatttgaa gctccaaaat atgttattac ttgatagtgg cggctcaagt 840
caactatctg tcaatggtaa aacgattgct gctagtcaag acgatcgagc cgtaccggat 900
tatattgtga tgaaataa 918
<210> 48
<211> 305
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsL
<400> 48
Leu Glu Glu Lys Leu Glu Arg Lys Lys Lys Lys Lys Lys Asn Ile Trp
1 5 10 15
Val Ile Ile Ile Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly
20 25 30
Ala Tyr Ala Leu Arg Asn Ser Leu Ile Pro Thr Asp His Thr Lys Thr
35 40 45
Asn Ser Ser Asp Gln Pro Pro Lys Thr Ser Ala Ser Asn Gly Tyr Val
50 55 60
Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp
65 70 75 80
Asp Ala Gly Val Pro Glu Trp Val Lys Val Pro Ser Lys Val Asn Leu
85 90 95
Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile
100 105 110
Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg
115 120 125
Met Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Ala Leu Ile Met
130 135 140
Asn Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Val Gly Phe Gln
145 150 155 160
Ile Asn Asn Gly Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Thr
165 170 175
Gln Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp
180 185 190
Ser Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Gln Gln Ala
195 200 205
Tyr Asp Phe Tyr Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro
210 215 220
Ser Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp
225 230 235 240
Lys Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr
245 250 255
Gly Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu
260 265 270
Leu Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr
275 280 285
Ile Ala Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met
290 295 300
Lys
305
<210> 49
<211> 903
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的编码推定的LytR家族转录调节蛋白的ORF
<400> 49
atgaatcaaa aaaagaggca tcattatcgt aagaaaaaac acacagtact aaaagttatt 60
tcaattattt ttgtattagt aattatcgct gttgcttcta tagcctacgc cgcttataga 120
aatgttgaat caacattttc aacatcatat gaaaatttcc ctaaaacaac aagtatcgac 180
ttaaaaaagt ctaaaacatt caccacactt atcattgcaa ctggtaaaaa taattctaaa 240
aatacagctt atgctactgt tttagcttca acgaatgtaa agacaaatca aactactttc 300
atgaacttcc cagtttttgc gacaatgcct aatcaaaaaa caatcactga agtttacaat 360
acgaatggag atgatggaat tttccagatg gttaaagacc tattgaatat gtccattaat 420
aaagtaattc agattgatgt taataaaatg ggatcacttg tacaggccac tggtggaatc 480
accatgcaaa atccaaaggc attcaatgct gaaggttatg agtttaaaca aggaactgtt 540
aatttacaaa ctgctgatca agtccaagcc tatatgacac aaattgacga tactgatttg 600
gatgcttcaa tcactcgaat tcaaaatgtc tcaatggaac tctacggaaa tattcaaaaa 660
attgctcata tgaaaaaact tgaaagtttc aattactatc gagaaattct ctatgctttt 720
tcaaacactg ttaaaaccaa tataagtttc aatgatgcta aaacgatcgt tatgagctac 780
aataaggctc taaagaatac cagcaagctc aatctacata caacagatga aaatggagct 840
aaggtcgttt ctcaaacaga attagactca gtcaaaaccc tttttgaaaa atctctaaaa 900
taa 903
<210> 50
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_lytR
<400> 50
Met Asn Gln Lys Lys Arg His His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Met Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 51
<211> 1161
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的编码推定的多糖丙酮酰转移酶家族蛋白的ORF
<400> 51
atgaaaaatt atgtaaatgg ttggtggaaa accaatttag gagatgatct tttcttacat 60
ataatatgtg aacgttatag aaatcaaagt ttttttataa cttgtgaaaa agaggagatg 120
actgtttttc aacatttgaa taacttaaaa attattgaag agaataaaag ttctgtcgtt 180
attaaattgt tttcgaagtt tttaagagta atgtattttt ggataccatt aagtgttttg 240
aaagaatgga tagaattatt ctttttaaga aaaagaggaa tatctgataa aaatgtagtg 300
gttattgaaa ttgggggctc catttttatg atgccaaaaa gaaaagatat ctcaatgaca 360
gaggggtatt ttttacgtaa tattgaatta aaaaactttc ctaactatta tgtagtagga 420
agtaattttg gtccatttta ttttcaagag caagtagata aatataaaga actattttct 480
aaaatgcagg atgtatgttt cagagataca tattcaaaaa aattatttcc aaatctagat 540
acagttagaa gcgcgacaga tgttgtcatg agtttaagga tagaggatta tcaacaaatc 600
ccagagaaaa aacaaattat aatttctgtg attgatgtat tatctaaaga agatacagga 660
ttagagggta aacatcattt tgcaaataaa tatgaaaaat ttattttatc tgtaacagaa 720
gattatgtaa agaaaggtta taaagtagta ttgttttctt tttgtgattt ccagaatgat 780
catttatttt ctcagaaaat tttcaatcag ttaaataaga caataagatc taatgttgaa 840
cttttttctc ataaacaaat aaataaatca ctctctaaga ttgctgaaag tgaaaaaatc 900
atagcgacca gatttcacgc aatgatttta ggatggttat ttcaaaagcc tacttttgta 960
atctcctaca gtcaaaagac aactcaagtg attgaaaata gctttaataa gcaaacattt 1020
gttgactata ataaggtaga aaagttgaat cttaataata tggatgatta ttttgttaaa 1080
attgatgatt tgaccaaaga atatttaatt aatgatgcgc aaaatcaatt tagagggtta 1140
gactcattat tgcaatcata a 1161
<210> 52
<211> 386
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_多糖_丙酮酰_转移酶
<400> 52
Met Lys Asn Tyr Val Asn Gly Trp Trp Lys Thr Asn Leu Gly Asp Asp
1 5 10 15
Leu Phe Leu His Ile Ile Cys Glu Arg Tyr Arg Asn Gln Ser Phe Phe
20 25 30
Ile Thr Cys Glu Lys Glu Glu Met Thr Val Phe Gln His Leu Asn Asn
35 40 45
Leu Lys Ile Ile Glu Glu Asn Lys Ser Ser Val Val Ile Lys Leu Phe
50 55 60
Ser Lys Phe Leu Arg Val Met Tyr Phe Trp Ile Pro Leu Ser Val Leu
65 70 75 80
Lys Glu Trp Ile Glu Leu Phe Phe Leu Arg Lys Arg Gly Ile Ser Asp
85 90 95
Lys Asn Val Val Val Ile Glu Ile Gly Gly Ser Ile Phe Met Met Pro
100 105 110
Lys Arg Lys Asp Ile Ser Met Thr Glu Gly Tyr Phe Leu Arg Asn Ile
115 120 125
Glu Leu Lys Asn Phe Pro Asn Tyr Tyr Val Val Gly Ser Asn Phe Gly
130 135 140
Pro Phe Tyr Phe Gln Glu Gln Val Asp Lys Tyr Lys Glu Leu Phe Ser
145 150 155 160
Lys Met Gln Asp Val Cys Phe Arg Asp Thr Tyr Ser Lys Lys Leu Phe
165 170 175
Pro Asn Leu Asp Thr Val Arg Ser Ala Thr Asp Val Val Met Ser Leu
180 185 190
Arg Ile Glu Asp Tyr Gln Gln Ile Pro Glu Lys Lys Gln Ile Ile Ile
195 200 205
Ser Val Ile Asp Val Leu Ser Lys Glu Asp Thr Gly Leu Glu Gly Lys
210 215 220
His His Phe Ala Asn Lys Tyr Glu Lys Phe Ile Leu Ser Val Thr Glu
225 230 235 240
Asp Tyr Val Lys Lys Gly Tyr Lys Val Val Leu Phe Ser Phe Cys Asp
245 250 255
Phe Gln Asn Asp His Leu Phe Ser Gln Lys Ile Phe Asn Gln Leu Asn
260 265 270
Lys Thr Ile Arg Ser Asn Val Glu Leu Phe Ser His Lys Gln Ile Asn
275 280 285
Lys Ser Leu Ser Lys Ile Ala Glu Ser Glu Lys Ile Ile Ala Thr Arg
290 295 300
Phe His Ala Met Ile Leu Gly Trp Leu Phe Gln Lys Pro Thr Phe Val
305 310 315 320
Ile Ser Tyr Ser Gln Lys Thr Thr Gln Val Ile Glu Asn Ser Phe Asn
325 330 335
Lys Gln Thr Phe Val Asp Tyr Asn Lys Val Glu Lys Leu Asn Leu Asn
340 345 350
Asn Met Asp Asp Tyr Phe Val Lys Ile Asp Asp Leu Thr Lys Glu Tyr
355 360 365
Leu Ile Asn Asp Ala Gln Asn Gln Phe Arg Gly Leu Asp Ser Leu Leu
370 375 380
Gln Ser
385
<210> 53
<211> 780
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsC基因的ORF
<400> 53
atgcaggaaa cacaggaaca aacgattgat ttaagaggga tttttaaaat tattcgcaaa 60
aggttaggtt taatattatt tagtgcttta atagtcacaa tattagggag catctacaca 120
ttttttatag cctccccagt ttacacagcc tcaactcaac ttgtcgttaa actaccaaat 180
tcggataatt tagcagccta cgctggacaa gtaaccggga atattcaaat ggcgaacaca 240
attaaccaag ttattgttag tccagtcatt ttagataaag ttcaaagtaa tttaaatcta 300
tctgatgatt ctttccaaaa acaagttaca gcagcaaatc aaacaaattc acaagtcatt 360
acgcttactg ttaaatattc taatccttac gttgctcaaa agattgcaga cgagactgct 420
aaaatattta gttcagatgc agcaaaacta ttgaatatta ctaacgttaa tattctatcc 480
aaagcaaaag ctcaaacaac acccattagt cctaaaccta aattgtattt agcaatatct 540
gttatagccg gattagtttt aggtttagcc attgctttat tgaaggaatt gtttgataac 600
aaaattaata aagaagaaga tattgaagct ctgggactca cggttcttgg tgtaacaacc 660
tatgctcaaa tgagtgattt taataataat acgaataaaa atggcacgca atcgggaact 720
aagtcaagtc cgcctagcga ccatgaagta aatagatcat caaaaaggaa taaaagatag 780
<210> 54
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsC
<400> 54
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Leu
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Val Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Ile Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Asn Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 55
<211> 687
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136的epsE基因的ORF
<400> 55
atggaagttt ttgaagatac ctcatcacct gaaccgaaag aagaaaagtt agtagaacta 60
aaaaaatttt ctcacagaga aatatttatt aaaagaggaa ttgacatttt agggggatta 120
gtgggttcaa ttttgtttct tattgcggct gcattgcttt atgtccctta caaaatgagc 180
tcggaaaaag atcaagggcc aatgttctat aaacaaaaac ggtatggaaa aaacggtaaa 240
attttttata ttttaaaatt tagaacaatg attcttaatg ctgagcagta tctagagcta 300
cattcagaag ttaaagccgc ctatcatgcc aatggtaata aactagaaaa tgatccacgg 360
gtaacgaaga ttggttcatt tattagacaa tactcagttg atgaattacc acaatttatc 420
aatgtcctta aaggagatat ggcattagtc ggtccaaggc caattcaaca gtttgaagcg 480
aaagaatttg gggagcgcct cccttattta ctgatatgta aacctggaat tactggttat 540
tggacaacac atggtcgcag taaagctcct tttcctcaac gagcagattt agaactctat 600
tatctccaat atcacagcac caagaatgat atcaagcttc ttatgcttac aattgcacaa 660
attattcacg gatcggacgc atattaa 687
<210> 56
<211> 228
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33136_epsE
<400> 56
Met Glu Val Phe Glu Asp Thr Ser Ser Pro Glu Pro Lys Glu Glu Lys
1 5 10 15
Leu Val Glu Leu Lys Lys Phe Ser His Arg Glu Ile Phe Ile Lys Arg
20 25 30
Gly Ile Asp Ile Leu Gly Gly Leu Val Gly Ser Ile Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu His Ser Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln Tyr Ser Val Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Gln Gln Phe Glu Ala
145 150 155 160
Lys Glu Phe Gly Glu Arg Leu Pro Tyr Leu Leu Ile Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Ala Pro Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Met Leu Thr Ile Ala Gln Ile Ile His Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 57
<211> 318
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsR基因的ORF
<400> 57
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataa 318
<210> 58
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsR
<400> 58
Met Asp Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 59
<211> 768
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsX基因的ORF
<400> 59
atgatgaaaa aaggaatttt tgtaattact atagtgatat ctatagcatt tataattgta 60
ggtttttata gttataattc taggataaat aatctttcaa aagctgataa aggaaaagaa 120
gttgtaaaaa atagcagtga aaaaaatcag atagacctta cctataaaaa gtattataaa 180
aatttaccaa aatcagttca aaataaaata gatgatattt catccaaaaa taaagaagtt 240
actttaactt gtatttggca atctgattca gttatttctg aacaatttca acaaaactta 300
caaaaatatt atggaaataa gttttggaac atcaaaaata tcacttacaa tggcgaaact 360
agtgaacaat tattggctga aaaagttgaa aaccaagtat tagccactaa tcctgatgtt 420
gttttatatg aagctccact ttttaatgat aaccaaaaca ttgaagcaac agcctcactg 480
actagtaatg agcaacttat aacaaatttg gctagtgcag gagcggaggt aatagttcaa 540
ccctctccac cgatctatgg tggtgttgtg taccccgtac aagaagaaca atttaaacaa 600
tctttatcta caaagtatcc ctatatagac tactgggcta gttacccaga caaaaattct 660
gatgaaatga aggggctgtt ttctgatgat ggagtatata gaacattaaa tgcttcgggg 720
aataaggttt ggctagatta tattactaaa tattttacag caaactaa 768
<210> 60
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsX
<400> 60
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Phe Ile Ile Val Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Leu
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Ala Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 61
<211> 765
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsB基因的ORF
<400> 61
atgattgata ttcattgcca tattttaccg gggatagatg atggagctaa aacttctgga 60
gatactctga caatgctgaa atcagcaatt gatgaaggga taacaactat cactgcgact 120
cctcatcata atcctcaatt taataatgaa tcaccgctta ttttgaaaaa agttaaggaa 180
gttcaaaata tcattgacga acatcaatta ccaattgaag ttttacccgg acaagaggtg 240
agaatatatg gtgatttatt aaaagaattt tctgaaggaa agttactgac agcagcgggc 300
acttcaagtt atatattgat tgaatttcca tcaaatcatg tgccagctta tgctaaagaa 360
cttttttata atattcaatt ggagggactt caacctattt tggttcaccc tgaacgtaat 420
agtggaatca ttgagaaccc agatatatta tttgattttg ttgaacaagg agtactaagt 480
cagataacag cttcgagtgt cactggtcat tttggtaaaa aaatacaaaa gctgtcattt 540
aaaatgatag aaaaccatct gacgcatttt gttgcatcag atgcgcataa tgttacgtca 600
cgtgcattta agatgaagga agcttttgaa atgattgaag atagttatgg ttctgatgta 660
tcacgaatgt ttcaaaataa tgcagagtca gtgattttaa acgaaagttt ttatcaagaa 720
aaaccaacaa agatcaaaac aaagaaattt ttaggattat tttaa 765
<210> 62
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsB
<400> 62
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Val Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Met Ile Glu Asp Ser Tyr Gly Ser Asp Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 63
<211> 696
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsD基因的ORF
<400> 63
atggctaaaa ataaaagaag catagacaat aatcgttata ttattaccag tgtcaatcct 60
caatcaccta tttccgaaca atatcgtacg attcgtacga ccattgattt taaaatggcg 120
gatcaaggga ttaaaagttt tctagtaaca tcttcagaag cagctgcagg taaatcaaac 180
gagagtgcta atctagctgt tgcttttgca caacaaggta aaaaagtact tttaattgat 240
ggcgatcttc gtaaaccgac tgttaacatt acttttaaag tacaaaatag agtaggatta 300
accaatattt taatgcatca atcttcgatt gaagatgcca tacaagggac aagactttct 360
gaaaatctta caataattac ctctggtcca attccaccta atccatcgga attattagca 420
tctagtgcaa tgaagaattt gattgactct gtgtccgatt cctttgatat tgttttgatt 480
gatactccac ctctctatgc agttactgat gctcaaattt tgagtgtata tgtaggagga 540
gtggttcttg ttgtacgtgc ctatgaaaca aaaaaagaga gtttagcaaa aacaaaaaaa 600
atactggaac aagttaatgc aaatatatta ggagttgttt tgcatggggt agactcttct 660
gagtcaccgt cgtattacta ctacggagta gagtaa 696
<210> 64
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsD
<400> 64
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Asn Glu Ser Ala Asn
50 55 60
Leu Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Ser Phe Asp Ile Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Tyr Ala Val Thr Asp Ala Gln Ile Leu Ser Val
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 65
<211> 1125
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT1蛋白的ORF
<400> 65
atggaagaaa aaatcagact tgcggttttc ggacagaaac gtctttctcg tgaaggcgga 60
atagagattg ttgtaaaaga gttatgcacc cgaatggcac agaagggttg tgatgtaact 120
tgctacaata gagcaggcca tcatgtgagt ggtgcagagt atgacaaaac aattgaatat 180
gatggtatcc gtcaaaaggt tgttccgact attgagaaga agggacttgc ggcggtaagc 240
tcctcctttt tcgcagcact ttgtagtgca tttggaagat acgatgtggt gcatatccat 300
gcggaaggtc ctgccttttt ctgttggata ccaaaacttt ttggcaagcg tgtaatcagc 360
actattcacg gtttggactg ggcccgcgaa aaatggaaat ttggcgttgg atctaaattt 420
atccggcagg gtgaaaaaaa tgccgtgaaa tatgcagatg aaatcattgt tctaagcaaa 480
ggcgttcaga aatatttcat ggagacctac ggaagggaga cacattttat ccctaatagt 540
gtcaatcggc cagaggttcg ggaggcaaag ctgatcacgg atcattttgg actggaaaag 600
gattcctaca tactgttcct cggtcgtctg gtgccggaga aggggattcg atatctggtt 660
gaggcattca agaatgtcaa gacagataaa aaactggtca tcgcaggtgg ctctagtgat 720
acggattcct ttatggagga attgaaagaa ctggcgaagg gtgacgatcg gattctcttt 780
actgggtttg tgcagggagc aatgctggat gaactgtaca gcaacgctta catctacacg 840
ctgccgtccg atctggaagg aatgccatta agtctgctgg aggcgatgag ctacggaaat 900
tgctgtctgg tatccgatat tccagaatgt gcagaggttg tggaagataa ggcattgatt 960
ttcaaaaagt cagatgtaga ggacttgcga gaaaaattgc aagatgcctg tgaccatcca 1020
gaaatggtta taagaatgaa gaatcaggca gctgacttta tctgcgagaa atacaactgg 1080
gataaagttg taaaggaaac gatgaaactg tacaggagaa aataa 1125
<210> 66
<211> 374
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT1
<400> 66
Met Glu Glu Lys Ile Arg Leu Ala Val Phe Gly Gln Lys Arg Leu Ser
1 5 10 15
Arg Glu Gly Gly Ile Glu Ile Val Val Lys Glu Leu Cys Thr Arg Met
20 25 30
Ala Gln Lys Gly Cys Asp Val Thr Cys Tyr Asn Arg Ala Gly His His
35 40 45
Val Ser Gly Ala Glu Tyr Asp Lys Thr Ile Glu Tyr Asp Gly Ile Arg
50 55 60
Gln Lys Val Val Pro Thr Ile Glu Lys Lys Gly Leu Ala Ala Val Ser
65 70 75 80
Ser Ser Phe Phe Ala Ala Leu Cys Ser Ala Phe Gly Arg Tyr Asp Val
85 90 95
Val His Ile His Ala Glu Gly Pro Ala Phe Phe Cys Trp Ile Pro Lys
100 105 110
Leu Phe Gly Lys Arg Val Ile Ser Thr Ile His Gly Leu Asp Trp Ala
115 120 125
Arg Glu Lys Trp Lys Phe Gly Val Gly Ser Lys Phe Ile Arg Gln Gly
130 135 140
Glu Lys Asn Ala Val Lys Tyr Ala Asp Glu Ile Ile Val Leu Ser Lys
145 150 155 160
Gly Val Gln Lys Tyr Phe Met Glu Thr Tyr Gly Arg Glu Thr His Phe
165 170 175
Ile Pro Asn Ser Val Asn Arg Pro Glu Val Arg Glu Ala Lys Leu Ile
180 185 190
Thr Asp His Phe Gly Leu Glu Lys Asp Ser Tyr Ile Leu Phe Leu Gly
195 200 205
Arg Leu Val Pro Glu Lys Gly Ile Arg Tyr Leu Val Glu Ala Phe Lys
210 215 220
Asn Val Lys Thr Asp Lys Lys Leu Val Ile Ala Gly Gly Ser Ser Asp
225 230 235 240
Thr Asp Ser Phe Met Glu Glu Leu Lys Glu Leu Ala Lys Gly Asp Asp
245 250 255
Arg Ile Leu Phe Thr Gly Phe Val Gln Gly Ala Met Leu Asp Glu Leu
260 265 270
Tyr Ser Asn Ala Tyr Ile Tyr Thr Leu Pro Ser Asp Leu Glu Gly Met
275 280 285
Pro Leu Ser Leu Leu Glu Ala Met Ser Tyr Gly Asn Cys Cys Leu Val
290 295 300
Ser Asp Ile Pro Glu Cys Ala Glu Val Val Glu Asp Lys Ala Leu Ile
305 310 315 320
Phe Lys Lys Ser Asp Val Glu Asp Leu Arg Glu Lys Leu Gln Asp Ala
325 330 335
Cys Asp His Pro Glu Met Val Ile Arg Met Lys Asn Gln Ala Ala Asp
340 345 350
Phe Ile Cys Glu Lys Tyr Asn Trp Asp Lys Val Val Lys Glu Thr Met
355 360 365
Lys Leu Tyr Arg Arg Lys
370
<210> 67
<211> 1296
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的推定的wzy基因的ORF
<400> 67
atgcacttat atgttagtaa aaaagctata ttacagtatt taatgctgta tgtcatgttg 60
attttttgtc aaacacatgt gtatagatta tatattagat caaatttgac gatacatgtt 120
ggtttagtaa tattattttt aataattggt gtagtcaatt ttaggaaaaa aacaaaacga 180
ccttttttga tgtgtgcttt tttgttagca atggttatag gggttcgttt tattaatggt 240
ggtgtaggaa ttgatttttg ggctgaaatg gccgcaaaaa tactaattac atatattgct 300
atattgattg atcctgaaca gtttttaacc agatttgtaa agattataac gttttttgcg 360
gcaattagta ttgtgggatg gctgcaacag attgctggtt tgaatattat gcaaaaaatc 420
ggcatggtaa ataacgattt ttacacaaca gtaacatggg ataaaggtta tgttgaagaa 480
actcagcgta agatttatgg gttgttgttt tatgtgacga cagatgtaga aattaaacgt 540
aatatgagta ttttcacgga gcctggtata tatcaaatgg tattgaatgc tgccattttt 600
gtagtagcgt tttgtaataa actcattgaa ttaaatccta aagaaattaa aaaatatttt 660
ttaattctaa caattgcatt aattacaacg cagtcgacat ctggatattt tggatatgcg 720
gtaatagttc ttggtattct tttgacacga agtgcagaca caaggacaat taagagctat 780
atttatatta ttttagtggt tggatttgta gtgctattgg gagattattc cgtaagagga 840
aatgatagtt tgatttacag ggcattgtta tcgaaagtgt tttctagtca gggagacttt 900
tcattttccg cttcgacggg agtttatcgt tatgaaatga ttgggatggc attgttggct 960
atggcaatga atccatttgg tatgggctat gaagcttggg caaaattgta tcgtttgaac 1020
tcattcgctg atgccggtgg atatccgttt attattggag cggtaattgg tattgtacca 1080
ctttttgtat cattatggtg gatatttagt ccgcttaagt atatgaaaaa taaatgggta 1140
gaaattgtag tatttttatt tttatatttt aatacagcta tggcacaaac aagtgcattt 1200
tatcctgcaa taattttttt gcctgtattt cttgatatta tgagacagaa cattactgat 1260
ctggagttgg agaaagagta tgcttatgaa ctataa 1296
<210> 68
<211> 431
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_wzy
<400> 68
Met His Leu Tyr Val Ser Lys Lys Ala Ile Leu Gln Tyr Leu Met Leu
1 5 10 15
Tyr Val Met Leu Ile Phe Cys Gln Thr His Val Tyr Arg Leu Tyr Ile
20 25 30
Arg Ser Asn Leu Thr Ile His Val Gly Leu Val Ile Leu Phe Leu Ile
35 40 45
Ile Gly Val Val Asn Phe Arg Lys Lys Thr Lys Arg Pro Phe Leu Met
50 55 60
Cys Ala Phe Leu Leu Ala Met Val Ile Gly Val Arg Phe Ile Asn Gly
65 70 75 80
Gly Val Gly Ile Asp Phe Trp Ala Glu Met Ala Ala Lys Ile Leu Ile
85 90 95
Thr Tyr Ile Ala Ile Leu Ile Asp Pro Glu Gln Phe Leu Thr Arg Phe
100 105 110
Val Lys Ile Ile Thr Phe Phe Ala Ala Ile Ser Ile Val Gly Trp Leu
115 120 125
Gln Gln Ile Ala Gly Leu Asn Ile Met Gln Lys Ile Gly Met Val Asn
130 135 140
Asn Asp Phe Tyr Thr Thr Val Thr Trp Asp Lys Gly Tyr Val Glu Glu
145 150 155 160
Thr Gln Arg Lys Ile Tyr Gly Leu Leu Phe Tyr Val Thr Thr Asp Val
165 170 175
Glu Ile Lys Arg Asn Met Ser Ile Phe Thr Glu Pro Gly Ile Tyr Gln
180 185 190
Met Val Leu Asn Ala Ala Ile Phe Val Val Ala Phe Cys Asn Lys Leu
195 200 205
Ile Glu Leu Asn Pro Lys Glu Ile Lys Lys Tyr Phe Leu Ile Leu Thr
210 215 220
Ile Ala Leu Ile Thr Thr Gln Ser Thr Ser Gly Tyr Phe Gly Tyr Ala
225 230 235 240
Val Ile Val Leu Gly Ile Leu Leu Thr Arg Ser Ala Asp Thr Arg Thr
245 250 255
Ile Lys Ser Tyr Ile Tyr Ile Ile Leu Val Val Gly Phe Val Val Leu
260 265 270
Leu Gly Asp Tyr Ser Val Arg Gly Asn Asp Ser Leu Ile Tyr Arg Ala
275 280 285
Leu Leu Ser Lys Val Phe Ser Ser Gln Gly Asp Phe Ser Phe Ser Ala
290 295 300
Ser Thr Gly Val Tyr Arg Tyr Glu Met Ile Gly Met Ala Leu Leu Ala
305 310 315 320
Met Ala Met Asn Pro Phe Gly Met Gly Tyr Glu Ala Trp Ala Lys Leu
325 330 335
Tyr Arg Leu Asn Ser Phe Ala Asp Ala Gly Gly Tyr Pro Phe Ile Ile
340 345 350
Gly Ala Val Ile Gly Ile Val Pro Leu Phe Val Ser Leu Trp Trp Ile
355 360 365
Phe Ser Pro Leu Lys Tyr Met Lys Asn Lys Trp Val Glu Ile Val Val
370 375 380
Phe Leu Phe Leu Tyr Phe Asn Thr Ala Met Ala Gln Thr Ser Ala Phe
385 390 395 400
Tyr Pro Ala Ile Ile Phe Leu Pro Val Phe Leu Asp Ile Met Arg Gln
405 410 415
Asn Ile Thr Asp Leu Glu Leu Glu Lys Glu Tyr Ala Tyr Glu Leu
420 425 430
<210> 69
<211> 1050
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT2蛋白的ORF
<400> 69
atgaaaatac tatttcacat aagttcactt tttggtggtg gagctgaaag agttatgtcc 60
tatttgatta atcacaattg tgaactaaat aatgaagttt atattgtggt ttgttatgag 120
aaggaaggtg agtattatat ttcacctaaa gcaaaaaaaa ttgtaatagg aacacatagc 180
attgtttctc aatctctaga attaagaaag acaataaaaa gaattaaacc agatatatgt 240
gtaagtttta tgcaaggagg aaatatcaga ctttctctag cttgcatggg tttaaaacaa 300
aaatatattc tgtcagtaag aaatgatcct aaaaaagaat atccaaatgc gattatgcaa 360
aaattggtgc ggtattggtt tgatgcagct gacggagttg tgtttcaaac agaagatgca 420
aaaaagtttt tctgcagtag cgtgcagaat aaatcgagta taatatacaa tcctgtatca 480
aaccagtttt ttatagagaa tgtgtcaaaa gatactactg gaatagtagc gttcggtcgt 540
ctggttgatc agaaaaactt tgcgatgtta ataaaagcat atgccgtgat agcagataga 600
atcgatgatg atttatatat atacggcgaa ggccccttag agaaaaggtt atatgaaatc 660
attaattcaa ccggacttgc aagtcggatt cacttaatgg gccggactaa taatgtgcca 720
gaagtactta aaacagcaaa agtgtatgct ttaagctcgg attttgaagg tatgcctaat 780
gcattgctgg aagctgtatg tatgcttgtg ccggtagttt ctactgattg cccttgcggt 840
gggccaaaag agatttgtaa caatgcgtgt ggtctattga gcccagttgg ggatgtagaa 900
tcttttgcaa ataatcttta caaagtttcc cacagcgaac aattaagaga acaattagtc 960
aaaaaatgta ttgaacgaag agctgcgttt tcaaatgatt ctatactgaa aaagtgggat 1020
gcattttttg ataaagtgtg tggatgctaa 1050
<210> 70
<211> 349
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT2
<400> 70
Met Lys Ile Leu Phe His Ile Ser Ser Leu Phe Gly Gly Gly Ala Glu
1 5 10 15
Arg Val Met Ser Tyr Leu Ile Asn His Asn Cys Glu Leu Asn Asn Glu
20 25 30
Val Tyr Ile Val Val Cys Tyr Glu Lys Glu Gly Glu Tyr Tyr Ile Ser
35 40 45
Pro Lys Ala Lys Lys Ile Val Ile Gly Thr His Ser Ile Val Ser Gln
50 55 60
Ser Leu Glu Leu Arg Lys Thr Ile Lys Arg Ile Lys Pro Asp Ile Cys
65 70 75 80
Val Ser Phe Met Gln Gly Gly Asn Ile Arg Leu Ser Leu Ala Cys Met
85 90 95
Gly Leu Lys Gln Lys Tyr Ile Leu Ser Val Arg Asn Asp Pro Lys Lys
100 105 110
Glu Tyr Pro Asn Ala Ile Met Gln Lys Leu Val Arg Tyr Trp Phe Asp
115 120 125
Ala Ala Asp Gly Val Val Phe Gln Thr Glu Asp Ala Lys Lys Phe Phe
130 135 140
Cys Ser Ser Val Gln Asn Lys Ser Ser Ile Ile Tyr Asn Pro Val Ser
145 150 155 160
Asn Gln Phe Phe Ile Glu Asn Val Ser Lys Asp Thr Thr Gly Ile Val
165 170 175
Ala Phe Gly Arg Leu Val Asp Gln Lys Asn Phe Ala Met Leu Ile Lys
180 185 190
Ala Tyr Ala Val Ile Ala Asp Arg Ile Asp Asp Asp Leu Tyr Ile Tyr
195 200 205
Gly Glu Gly Pro Leu Glu Lys Arg Leu Tyr Glu Ile Ile Asn Ser Thr
210 215 220
Gly Leu Ala Ser Arg Ile His Leu Met Gly Arg Thr Asn Asn Val Pro
225 230 235 240
Glu Val Leu Lys Thr Ala Lys Val Tyr Ala Leu Ser Ser Asp Phe Glu
245 250 255
Gly Met Pro Asn Ala Leu Leu Glu Ala Val Cys Met Leu Val Pro Val
260 265 270
Val Ser Thr Asp Cys Pro Cys Gly Gly Pro Lys Glu Ile Cys Asn Asn
275 280 285
Ala Cys Gly Leu Leu Ser Pro Val Gly Asp Val Glu Ser Phe Ala Asn
290 295 300
Asn Leu Tyr Lys Val Ser His Ser Glu Gln Leu Arg Glu Gln Leu Val
305 310 315 320
Lys Lys Cys Ile Glu Arg Arg Ala Ala Phe Ser Asn Asp Ser Ile Leu
325 330 335
Lys Lys Trp Asp Ala Phe Phe Asp Lys Val Cys Gly Cys
340 345
<210> 71
<211> 1179
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT3蛋白的ORF
<400> 71
atggagggaa aaatgagtaa caaaacgaaa atgttattta ttatgaattc tcttaatttt 60
ggaggcgcag aaaaagcact tgtcaatttg tttgatacta tgaattacga tgtctatgaa 120
attgatttgc ttttgctttc agatgaagga aagttgctta gtatagtaaa caataaggtg 180
aatataatac acccagatct aattacgcgt aacctttatg gagaacattg taactttata 240
tttaaatttc ctaagattat atttactggt agccaatatt tgattacaag gaacagatcc 300
tatgtaaatc aattgaggtg gaagaagttt tataaaaaaa taattccgca gttacaaact 360
aaatacgatg tggctgtttc ctttcttcag ggagatccgt tgtattattt agttgataag 420
gtattcgcaa aaaaaaagat tgcttgggtg cataatgact atcgtatgac ccagtgcaat 480
agtatatttg atttgaaata ttttgaacaa gtcaatcaag ttgtaacaat ttcaaatatt 540
tgtctagata tacttaaaga aatttttcca tcggtaccat caatgttttt gcctaacatt 600
gttaattcca catcaatcaa ttcatatgca gaaggacaac caaatgaata tgatggagtt 660
aaatcaaaaa aattacttac gattggaagg ttaaaccctc aaaaaggata tgattttctt 720
ttagaaattg cagcatattt gaaagaaata aaatatgatt ttaagtggta cattatcgga 780
gagggagaac ttaaggaaca actattagcg gagtggagag aaaaaaagct agaagattgt 840
gtctatttta ttggtacaag agaaaatcca tatccatata ttaagcatgc agatgtagta 900
gtacaaacaa gtcgttacga aggaaaatct atagtgttgg atgaggccaa aattctcaat 960
aaacttattg tttgtaccaa ttatgatact gtgaaagatc agttaattga tggaaaagag 1020
gggattatct cttcttttga agttaaagag tttgcggaaa gtatcatagg attattagcg 1080
gatgataaca aaatgaatga gattgtcaat tacttatcat cgcatgaata tggaaatgag 1140
aatatgatca aattatatga tcagttattt caggtataa 1179
<210> 72
<211> 392
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT3
<400> 72
Met Glu Gly Lys Met Ser Asn Lys Thr Lys Met Leu Phe Ile Met Asn
1 5 10 15
Ser Leu Asn Phe Gly Gly Ala Glu Lys Ala Leu Val Asn Leu Phe Asp
20 25 30
Thr Met Asn Tyr Asp Val Tyr Glu Ile Asp Leu Leu Leu Leu Ser Asp
35 40 45
Glu Gly Lys Leu Leu Ser Ile Val Asn Asn Lys Val Asn Ile Ile His
50 55 60
Pro Asp Leu Ile Thr Arg Asn Leu Tyr Gly Glu His Cys Asn Phe Ile
65 70 75 80
Phe Lys Phe Pro Lys Ile Ile Phe Thr Gly Ser Gln Tyr Leu Ile Thr
85 90 95
Arg Asn Arg Ser Tyr Val Asn Gln Leu Arg Trp Lys Lys Phe Tyr Lys
100 105 110
Lys Ile Ile Pro Gln Leu Gln Thr Lys Tyr Asp Val Ala Val Ser Phe
115 120 125
Leu Gln Gly Asp Pro Leu Tyr Tyr Leu Val Asp Lys Val Phe Ala Lys
130 135 140
Lys Lys Ile Ala Trp Val His Asn Asp Tyr Arg Met Thr Gln Cys Asn
145 150 155 160
Ser Ile Phe Asp Leu Lys Tyr Phe Glu Gln Val Asn Gln Val Val Thr
165 170 175
Ile Ser Asn Ile Cys Leu Asp Ile Leu Lys Glu Ile Phe Pro Ser Val
180 185 190
Pro Ser Met Phe Leu Pro Asn Ile Val Asn Ser Thr Ser Ile Asn Ser
195 200 205
Tyr Ala Glu Gly Gln Pro Asn Glu Tyr Asp Gly Val Lys Ser Lys Lys
210 215 220
Leu Leu Thr Ile Gly Arg Leu Asn Pro Gln Lys Gly Tyr Asp Phe Leu
225 230 235 240
Leu Glu Ile Ala Ala Tyr Leu Lys Glu Ile Lys Tyr Asp Phe Lys Trp
245 250 255
Tyr Ile Ile Gly Glu Gly Glu Leu Lys Glu Gln Leu Leu Ala Glu Trp
260 265 270
Arg Glu Lys Lys Leu Glu Asp Cys Val Tyr Phe Ile Gly Thr Arg Glu
275 280 285
Asn Pro Tyr Pro Tyr Ile Lys His Ala Asp Val Val Val Gln Thr Ser
290 295 300
Arg Tyr Glu Gly Lys Ser Ile Val Leu Asp Glu Ala Lys Ile Leu Asn
305 310 315 320
Lys Leu Ile Val Cys Thr Asn Tyr Asp Thr Val Lys Asp Gln Leu Ile
325 330 335
Asp Gly Lys Glu Gly Ile Ile Ser Ser Phe Glu Val Lys Glu Phe Ala
340 345 350
Glu Ser Ile Ile Gly Leu Leu Ala Asp Asp Asn Lys Met Asn Glu Ile
355 360 365
Val Asn Tyr Leu Ser Ser His Glu Tyr Gly Asn Glu Asn Met Ile Lys
370 375 380
Leu Tyr Asp Gln Leu Phe Gln Val
385 390
<210> 73
<211> 1446
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的推定的wzx基因的ORF
<400> 73
ttggagaata aaatggtaaa gaaaaaacta caaaatattc ctttgggagt aaaatcagca 60
gtggtatata ccatggcgtc tgttttttca agaggattat caatgattac agttccaatt 120
ttcacgagaa taatgtctac aagtgaaatt ggaatggtta atctatataa ttcctggtat 180
ttattattga atgtaattgc aactctatca ttaacatcag gtggatttca ggtagcaatg 240
aaagactttg agggggaaag ggatcaatat caatcgtctg ttttgacatt aacgtcaatg 300
atggccattt ttctaggctg tatttatttt ctcataccta atagttggaa taggattaca 360
gggctacctt ctgctttgat gattttaatg cttgtagtgt ttttctttgc acaggcgcag 420
gatttttggc tattgagaca aagatatgaa tataaatata aattagctgg tgcattgaca 480
atggggtcag ctttagcatc aacagtattg tctgttatag ttgttttaag cttaaataaa 540
gcaaattcag atcaaattgt agtggggcgc ttatatgcaa ctaacattgt atctatagct 600
atttcagcaa ttttatggat taaattatat gtaaaaggaa aaacgatagt aaatataaag 660
tactggaaat actctttgaa attgagtgtg cctcttattg gttatgcttt tgcggcacag 720
attttaagtg tttcagatcg aatgatgata agcaaaatgg ttggaaatga tgcagttgga 780
atatatagta ctttatatac tgtaagttca atttcactgt tagtttggac tgcaattaat 840
tcatcgttta taccgtattt atatcaaaat atagagaaaa aaggaaatag aataaaagaa 900
ttatcattag ctttaatggg ttcgtatgct attatagctg ttatgcttac ttttcttgca 960
ccagaaatag ttaaaatatt agctactaag gaatattatg aagcaattta tattatgcca 1020
cctattgcag caggtgtgtt tttgacttct gtgtcaaata tgtattctaa tttgctcata 1080
taccataaga aaactaatta tattatgtat tcttcgatta tcgcagctac tgtaaatctt 1140
atactcaact atatatgtat aaatgcattt ggctatatgg cagcagcata tactacactg 1200
atagcataca tagttttggc gggaacacaa gcaatgtttg cgagaaagat tcgctttaaa 1260
gagactgggg aaaaatctgt atataatgat aatgcggtat ttgttatggc gatattaacg 1320
ataatagtag ccttattcgg cttggtattg tatcgctata cttggttaag atatataatc 1380
atttgtactg gaatgattgc aggaataaaa atcgcgttca tgactttgaa aagaattaaa 1440
agttaa 1446
<210> 74
<211> 481
<212> PRT
<213>乳酸乳球菌
<220>
<223> 33139_wzx
<400> 74
Leu Glu Asn Lys Met Val Lys Lys Lys Leu Gln Asn Ile Pro Leu Gly
1 5 10 15
Val Lys Ser Ala Val Val Tyr Thr Met Ala Ser Val Phe Ser Arg Gly
20 25 30
Leu Ser Met Ile Thr Val Pro Ile Phe Thr Arg Ile Met Ser Thr Ser
35 40 45
Glu Ile Gly Met Val Asn Leu Tyr Asn Ser Trp Tyr Leu Leu Leu Asn
50 55 60
Val Ile Ala Thr Leu Ser Leu Thr Ser Gly Gly Phe Gln Val Ala Met
65 70 75 80
Lys Asp Phe Glu Gly Glu Arg Asp Gln Tyr Gln Ser Ser Val Leu Thr
85 90 95
Leu Thr Ser Met Met Ala Ile Phe Leu Gly Cys Ile Tyr Phe Leu Ile
100 105 110
Pro Asn Ser Trp Asn Arg Ile Thr Gly Leu Pro Ser Ala Leu Met Ile
115 120 125
Leu Met Leu Val Val Phe Phe Phe Ala Gln Ala Gln Asp Phe Trp Leu
130 135 140
Leu Arg Gln Arg Tyr Glu Tyr Lys Tyr Lys Leu Ala Gly Ala Leu Thr
145 150 155 160
Met Gly Ser Ala Leu Ala Ser Thr Val Leu Ser Val Ile Val Val Leu
165 170 175
Ser Leu Asn Lys Ala Asn Ser Asp Gln Ile Val Val Gly Arg Leu Tyr
180 185 190
Ala Thr Asn Ile Val Ser Ile Ala Ile Ser Ala Ile Leu Trp Ile Lys
195 200 205
Leu Tyr Val Lys Gly Lys Thr Ile Val Asn Ile Lys Tyr Trp Lys Tyr
210 215 220
Ser Leu Lys Leu Ser Val Pro Leu Ile Gly Tyr Ala Phe Ala Ala Gln
225 230 235 240
Ile Leu Ser Val Ser Asp Arg Met Met Ile Ser Lys Met Val Gly Asn
245 250 255
Asp Ala Val Gly Ile Tyr Ser Thr Leu Tyr Thr Val Ser Ser Ile Ser
260 265 270
Leu Leu Val Trp Thr Ala Ile Asn Ser Ser Phe Ile Pro Tyr Leu Tyr
275 280 285
Gln Asn Ile Glu Lys Lys Gly Asn Arg Ile Lys Glu Leu Ser Leu Ala
290 295 300
Leu Met Gly Ser Tyr Ala Ile Ile Ala Val Met Leu Thr Phe Leu Ala
305 310 315 320
Pro Glu Ile Val Lys Ile Leu Ala Thr Lys Glu Tyr Tyr Glu Ala Ile
325 330 335
Tyr Ile Met Pro Pro Ile Ala Ala Gly Val Phe Leu Thr Ser Val Ser
340 345 350
Asn Met Tyr Ser Asn Leu Leu Ile Tyr His Lys Lys Thr Asn Tyr Ile
355 360 365
Met Tyr Ser Ser Ile Ile Ala Ala Thr Val Asn Leu Ile Leu Asn Tyr
370 375 380
Ile Cys Ile Asn Ala Phe Gly Tyr Met Ala Ala Ala Tyr Thr Thr Leu
385 390 395 400
Ile Ala Tyr Ile Val Leu Ala Gly Thr Gln Ala Met Phe Ala Arg Lys
405 410 415
Ile Arg Phe Lys Glu Thr Gly Glu Lys Ser Val Tyr Asn Asp Asn Ala
420 425 430
Val Phe Val Met Ala Ile Leu Thr Ile Ile Val Ala Leu Phe Gly Leu
435 440 445
Val Leu Tyr Arg Tyr Thr Trp Leu Arg Tyr Ile Ile Ile Cys Thr Gly
450 455 460
Met Ile Ala Gly Ile Lys Ile Ala Phe Met Thr Leu Lys Arg Ile Lys
465 470 475 480
Ser
<210> 75
<211> 912
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsL基因的ORF
<400> 75
ttggagggaa acttggaacg caaaaagaaa aaaaagaata tttgggtgat aattatatct 60
atcttaattt ttattgccct tataggagca ggggcttatt ccttaagaaa tttacttatt 120
cctactaatc atccgaggac aaacagttcg gatcaaccta aaaaaacttc ggtctctaac 180
ggttatgtag agcaaaaagg tgaagaagct gccgtaggta gtacagcact tgtagatgat 240
actggtatac cagaatgggt taaagttccc tcaaaggtaa atctagataa atttactgat 300
ttatctacga ataatatcac tatttatcga attaacaatc cggaagtctt aaaaacagtt 360
accaatcgta cagatcaacg gatgaaaatg tcagaagtta tagctaagta tcctaatgct 420
ttgattatga atgcttccgc atttgatatg cagacaggac aagtagctgg atttcaaatt 480
aataatgaaa agttgattca agactggagt ccaggtacaa cgactcaata tgcttttgtt 540
attaataaag atggttcgtg caaaatttat gattcaagta cacctgcttc aactattatt 600
aaaaacggag ggcaacaagc ctatgatttt ggtactgcga ttatccgtga tggtaaaatt 660
caaccaagtg atggctcagt agattggaag attcatattt ttattgcgaa tgataaagat 720
aataatctct atgctatttt gagtgataca aatgcaggtt atgataatat aatgaaatca 780
gtgtcaaatt tgaagctcca aaatatgtta ttgcttgata gtggtggctc aagtcaacta 840
tctgtcaatg gtaaaacgat tgttgctagt caagatgatc gagccgtacc ggattatatt 900
gtgatgaaat aa 912
<210> 76
<211> 303
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsL
<400> 76
Leu Glu Gly Asn Leu Glu Arg Lys Lys Lys Lys Lys Asn Ile Trp Val
1 5 10 15
Ile Ile Ile Ser Ile Leu Ile Phe Ile Ala Leu Ile Gly Ala Gly Ala
20 25 30
Tyr Ser Leu Arg Asn Leu Leu Ile Pro Thr Asn His Pro Arg Thr Asn
35 40 45
Ser Ser Asp Gln Pro Lys Lys Thr Ser Val Ser Asn Gly Tyr Val Glu
50 55 60
Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Thr Ala Leu Val Asp Asp
65 70 75 80
Thr Gly Ile Pro Glu Trp Val Lys Val Pro Ser Lys Val Asn Leu Asp
85 90 95
Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile Asn
100 105 110
Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg Met
115 120 125
Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Ala Leu Ile Met Asn
130 135 140
Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln Ile
145 150 155 160
Asn Asn Glu Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Thr Gln
165 170 175
Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp Ser
180 185 190
Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Gln Gln Ala Tyr
195 200 205
Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser Asp
210 215 220
Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys Asp
225 230 235 240
Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp Asn
245 250 255
Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu Leu
260 265 270
Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile Val
275 280 285
Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 77
<211> 903
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的LytR家族转录调节蛋白的ORF
<400> 77
atgactcaaa aaaagaggcg ttattatcgt aagaaaaaac acacagtact aaaagttgtt 60
tcaattattt ttgcatttgt aattatcgct attgcttcta tagcctacgc agcttataga 120
aatgttgaat caacattttc aacatcatat gaaaatttcc ctaaaacaac aagtatcgac 180
ttaaaaaagt ctaaaacatt caccacactt atcattgcaa ctggtaaaaa taattctaaa 240
aatacagctt atgctactgt tttagcttca acgaatgtaa agacaaatca aactactttc 300
atgaacttcc cagtttttgc gacaatgcct aatcaaaaaa caatcactga agtttacaat 360
acgaatggag atgatggaat tttccagatg gttaaagacc tattgaatgt gtccattaac 420
aaagtaattc agattgatgt taataaaatg ggatcacttg tacaggccac tggtggaatc 480
accatgcaaa atccaaaggc attcaatgct gaaggttatg agtttaaaca aggaactgtt 540
aatttacaaa ctgctgatca agtccaagcc tatatgacac aaattgacga tactgatttg 600
gatgcttcaa tcactcggat tcaaaatgtc tcaatggaac tctacggaaa tattcaaaaa 660
atggctcata tgaaaaaact tgaaagtttc aattactatc gagaaattct ctatgctttt 720
tcaaacactg ttaaaaccaa tataagtttc gatgatgcta aaactatcgt tatgagctac 780
aataaggctc taaagaatac cagcaagctc aatctacata caacagatga aaatggagct 840
aaagtagttt ctcaaacaga attagactca gtcaaaactc tttttgaaaa atctctaaaa 900
taa 903
<210> 78
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_lytR
<400> 78
Met Thr Gln Lys Lys Arg Arg Tyr Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Val Ser Ile Ile Phe Ala Phe Val Ile Ile Ala Ile Ala
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Met Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asp Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 79
<211> 1251
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的核苷酸糖脱氢酶蛋白的ORF
<400> 79
atgagagagt ttaaagattt aaagattgct gttgccggaa cgggatatgt tggtctttct 60
attgctacgc tgttatctca gcaccacaag gtgactgctg tggatatcat tcctgagaaa 120
gttgaactta tcaataataa gaaatctccg attcaggatg aatatattga aaagtatctg 180
gcagagaaag agctggatct gactgcgact ctggatgcta aggaagcata cagtgatgct 240
gattttgtag tgatcgcagc tcctacaaat tacgatagca agaagaactt ttttgacacg 300
agtgcggtag aagccgtcat taaactggtc atcgagtaca acccggaagc tatcatggtt 360
atcaagagca ctattccggt tggttataca gcaagcgttc gtgagaagtt ccactgtgac 420
aatattatct ttagcccgga gtttttgcgc gagagcaaag ctctgtatga taacctttat 480
ccttcccgta tcattgtcgg tacggatgtt gacaatgttc gactggtaaa ggcggcacac 540
acttttgcag agctcctgca ggaaggtgct attaaggaaa atatcgatac tctgtttatg 600
ggctttaccg aggcagaggc agttaagcta ttcgctaaca cttatttggc actgcgtgtc 660
agctacttca atgaactgga cacttacgca gagatgaagg gactgaacac tcagcagatc 720
attaatggtg tttgcctcga ccctcgtatc ggcactcatt ataacaatcc tagctttggt 780
tacggcggat actgcctgcc gaaagacacc aagcagctgc tggcaaatta tgcagatgtg 840
ccggaaaacc tgattgaggc tatcgttgaa agtaatagaa caagaaaaga cttcatcgct 900
gaccgtgttc tggagattgc aggtgcttat gaagcaaatg acagctggga tgagagcaaa 960
gaaaaagaag ttgttgtcgg cgtttaccgt ctaacaatga agagtaacag cgataacttc 1020
cgccagagtt ccattcaggg tgttatgaag cgtattaagg ccaagggtgc aacagtcatc 1080
atctatgagc cgaccctgaa ggacggcgat actttctttg gtagtcgagt tgttaataat 1140
ttagagaagt tcaaaaaaca gagccaggca attattgcga accgttacga caagagcttg 1200
gatgatgtga aggataaggt ttatacacgc gatatttttc aacgggacta a 1251
<210> 80
<211> 416
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139 核苷酸_糖_脱氢酶
<400> 80
Met Arg Glu Phe Lys Asp Leu Lys Ile Ala Val Ala Gly Thr Gly Tyr
1 5 10 15
Val Gly Leu Ser Ile Ala Thr Leu Leu Ser Gln His His Lys Val Thr
20 25 30
Ala Val Asp Ile Ile Pro Glu Lys Val Glu Leu Ile Asn Asn Lys Lys
35 40 45
Ser Pro Ile Gln Asp Glu Tyr Ile Glu Lys Tyr Leu Ala Glu Lys Glu
50 55 60
Leu Asp Leu Thr Ala Thr Leu Asp Ala Lys Glu Ala Tyr Ser Asp Ala
65 70 75 80
Asp Phe Val Val Ile Ala Ala Pro Thr Asn Tyr Asp Ser Lys Lys Asn
85 90 95
Phe Phe Asp Thr Ser Ala Val Glu Ala Val Ile Lys Leu Val Ile Glu
100 105 110
Tyr Asn Pro Glu Ala Ile Met Val Ile Lys Ser Thr Ile Pro Val Gly
115 120 125
Tyr Thr Ala Ser Val Arg Glu Lys Phe His Cys Asp Asn Ile Ile Phe
130 135 140
Ser Pro Glu Phe Leu Arg Glu Ser Lys Ala Leu Tyr Asp Asn Leu Tyr
145 150 155 160
Pro Ser Arg Ile Ile Val Gly Thr Asp Val Asp Asn Val Arg Leu Val
165 170 175
Lys Ala Ala His Thr Phe Ala Glu Leu Leu Gln Glu Gly Ala Ile Lys
180 185 190
Glu Asn Ile Asp Thr Leu Phe Met Gly Phe Thr Glu Ala Glu Ala Val
195 200 205
Lys Leu Phe Ala Asn Thr Tyr Leu Ala Leu Arg Val Ser Tyr Phe Asn
210 215 220
Glu Leu Asp Thr Tyr Ala Glu Met Lys Gly Leu Asn Thr Gln Gln Ile
225 230 235 240
Ile Asn Gly Val Cys Leu Asp Pro Arg Ile Gly Thr His Tyr Asn Asn
245 250 255
Pro Ser Phe Gly Tyr Gly Gly Tyr Cys Leu Pro Lys Asp Thr Lys Gln
260 265 270
Leu Leu Ala Asn Tyr Ala Asp Val Pro Glu Asn Leu Ile Glu Ala Ile
275 280 285
Val Glu Ser Asn Arg Thr Arg Lys Asp Phe Ile Ala Asp Arg Val Leu
290 295 300
Glu Ile Ala Gly Ala Tyr Glu Ala Asn Asp Ser Trp Asp Glu Ser Lys
305 310 315 320
Glu Lys Glu Val Val Val Gly Val Tyr Arg Leu Thr Met Lys Ser Asn
325 330 335
Ser Asp Asn Phe Arg Gln Ser Ser Ile Gln Gly Val Met Lys Arg Ile
340 345 350
Lys Ala Lys Gly Ala Thr Val Ile Ile Tyr Glu Pro Thr Leu Lys Asp
355 360 365
Gly Asp Thr Phe Phe Gly Ser Arg Val Val Asn Asn Leu Glu Lys Phe
370 375 380
Lys Lys Gln Ser Gln Ala Ile Ile Ala Asn Arg Tyr Asp Lys Ser Leu
385 390 395 400
Asp Asp Val Lys Asp Lys Val Tyr Thr Arg Asp Ile Phe Gln Arg Asp
405 410 415
<210> 81
<211> 780
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsC基因的ORF
<400> 81
atgcaggaaa cacaggaaca aacgattgat ttaagaggga tttttaaaat tattcgcaaa 60
aggttaggtt taatattatt tagtgcttta atagtcacaa tattagggag catctacaca 120
ttttttatag cctccccagt ttacacagcc tcaactcaac ttgtcgttaa actaccaaat 180
ttggataatt cagcagccta cgctggacaa gtgaccggga atattcaaat ggcgaacaca 240
attaaccaag ttattgttag tccagtcatt ttagataaag ttcaaagtaa tttaaatcta 300
tctgatgact ctttccaaaa acaagttaca gcagcaaatc aaacaaattc acaagtcatt 360
acgcttactg ttaaatattc taatccttac gttgctcaaa agattgcaga cgagactgct 420
aaaattttta gttcagaagc agcaaaacta ttgaatgtta ctaacgttaa tattctatcc 480
aaagcaaaag ctcaaacaac acccattagt cctaaaccta aattgtattt agcaatatct 540
gttatagccg gattagtttt aggtttagcc attgctttat tgaaggaatt gtttgataac 600
aaaattaata aagaagaaga tattgaagct ctgggactca cggttcttgg tgtaacaacc 660
tatgctcaaa tgagtgattt taataataat acgaataaaa atggcacgca atcgggaact 720
aagtcaagtc cgcctagcga ccatgaagta aatagatcat caaaaaggaa taaaagatag 780
<210> 82
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsC
<400> 82
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Leu Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Val Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Glu Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Asn Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 83
<211> 687
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的epsE基因的ORF
<400> 83
atggaatttt ttgaggatgc ctcatcacct gaatcggaag agcctaagtt agtagaatta 60
aaaaattttt cttatagaga gctaattata aaaagagcaa ttgatatcct aggaggatta 120
gcaggttcag ttttatttct tatcgcggct gcattgcttt atgtccctta caaaatgagc 180
tcaaaaaaag atcaagggcc aatgttctat aaacaaaaac gctatggtaa aaatggtaaa 240
attttttata ttttgaaatt tagaacaatg attcttaatg ccgagcagta tctagaactt 300
aatccagatg ttaaagctgc ttaccatgcc aacggcaata agctagaaaa cgatccacgg 360
gtaacgaaga ttggctcatt tataagacga cactcaattg atgaactgcc acaatttatc 420
aatgttctta aaggggatat ggcattagtt ggtccaagac caattttgct ttttgaagcg 480
aaagaatatg ggaaacgcct cgcttactta ctcatgtgca aaccaggaat cactggttat 540
tggacgacac atggtcgaag taaagttctt tttcctcaac gagcagattt agaactctat 600
tatctccagt accatagcac caaaaatgat atcaagcttc tagtactcac aattgcacaa 660
agtattcacg gatcggacgc ttactaa 687
<210> 84
<211> 228
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_epsE
<400> 84
Met Glu Phe Phe Glu Asp Ala Ser Ser Pro Glu Ser Glu Glu Pro Lys
1 5 10 15
Leu Val Glu Leu Lys Asn Phe Ser Tyr Arg Glu Leu Ile Ile Lys Arg
20 25 30
Ala Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Lys Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu Asn Pro Asp Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Arg His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Lys Arg Leu Ala Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Leu Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Val Leu Thr Ile Ala Gln Ser Ile His Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 85
<211> 312
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT4蛋白的ORF
<400> 85
atggaaaagg aattaatatc tattattaca cctacatata acagagaaaa gacactcatt 60
cgagtttacg attctttatg taaacagagt tataaatgca ttgaatggat tgttgtagat 120
gatggctcta gagacaaaac taaagaattg atttcgtcat tgataaatca gaaaaataag 180
ccttttccta tcaaatatgt ataccaaaaa aattctggga aacacgtggc agtaaataaa 240
ggcttagaaa ttgcaggggg gggtacgttg gaatcttgga ctccgatgat gccttgtttg 300
atgatgcttt ag 312
<210> 86
<211> 103
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT4
<400> 86
Met Glu Lys Glu Leu Ile Ser Ile Ile Thr Pro Thr Tyr Asn Arg Glu
1 5 10 15
Lys Thr Leu Ile Arg Val Tyr Asp Ser Leu Cys Lys Gln Ser Tyr Lys
20 25 30
Cys Ile Glu Trp Ile Val Val Asp Asp Gly Ser Arg Asp Lys Thr Lys
35 40 45
Glu Leu Ile Ser Ser Leu Ile Asn Gln Lys Asn Lys Pro Phe Pro Ile
50 55 60
Lys Tyr Val Tyr Gln Lys Asn Ser Gly Lys His Val Ala Val Asn Lys
65 70 75 80
Gly Leu Glu Ile Ala Gly Gly Gly Thr Leu Glu Ser Trp Thr Pro Met
85 90 95
Met Pro Cys Leu Met Met Leu
100
<210> 87
<211> 630
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT5蛋白的ORF
<400> 87
ttggactccg atgatgcctt gtttgatgat gctttagaga cattaatggg gtactggaat 60
gcaatgacac cccaggagaa aaaagaattt aaatcagtta caggacgcgt ggcaaatgca 120
gagacaggag aattgattgg ccctaaaaat aagcttaaac tgattgattg ctcttctctt 180
gaagccagat ttgtgagaaa aatgggatac gaaaagtggg gattatcaag aacggaagtt 240
atgcgcgaat ttcaaagtcc aaatatagaa ggactccatt tttatcctga gaatattaca 300
tatgacgcaa ttggtagaaa atataaagag cgttttgttg aggatgtggt tagaaaatat 360
tatctaaatt catcggattc aattataaag aataaaaagg gccgtagtaa agaaaattat 420
tatttatggc tacataatat taatgatgta ttcgattatt ttttatacaa tcctaaaatc 480
tttctcaaat catttgtcgg gctggcaaga gatggggttc tttcaggaag aagtatacca 540
tatatcataa atcagataaa taccttgccg aaacgagtct tgtttgttct atttatgccc 600
gtaggaatgc ttcttgctta taaaagataa 630
<210> 88
<211> 209
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT5
<400> 88
Leu Asp Ser Asp Asp Ala Leu Phe Asp Asp Ala Leu Glu Thr Leu Met
1 5 10 15
Gly Tyr Trp Asn Ala Met Thr Pro Gln Glu Lys Lys Glu Phe Lys Ser
20 25 30
Val Thr Gly Arg Val Ala Asn Ala Glu Thr Gly Glu Leu Ile Gly Pro
35 40 45
Lys Asn Lys Leu Lys Leu Ile Asp Cys Ser Ser Leu Glu Ala Arg Phe
50 55 60
Val Arg Lys Met Gly Tyr Glu Lys Trp Gly Leu Ser Arg Thr Glu Val
65 70 75 80
Met Arg Glu Phe Gln Ser Pro Asn Ile Glu Gly Leu His Phe Tyr Pro
85 90 95
Glu Asn Ile Thr Tyr Asp Ala Ile Gly Arg Lys Tyr Lys Glu Arg Phe
100 105 110
Val Glu Asp Val Val Arg Lys Tyr Tyr Leu Asn Ser Ser Asp Ser Ile
115 120 125
Ile Lys Asn Lys Lys Gly Arg Ser Lys Glu Asn Tyr Tyr Leu Trp Leu
130 135 140
His Asn Ile Asn Asp Val Phe Asp Tyr Phe Leu Tyr Asn Pro Lys Ile
145 150 155 160
Phe Leu Lys Ser Phe Val Gly Leu Ala Arg Asp Gly Val Leu Ser Gly
165 170 175
Arg Ser Ile Pro Tyr Ile Ile Asn Gln Ile Asn Thr Leu Pro Lys Arg
180 185 190
Val Leu Phe Val Leu Phe Met Pro Val Gly Met Leu Leu Ala Tyr Lys
195 200 205
Arg
<210> 89
<211> 1098
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的GT6蛋白的ORF
<400> 89
gtgagcaaat atgtcaaatt agaattaaat tacccagtag agcatgggag aaaggaaaaa 60
tccaatcatt tcctaaatgt gattggatta tatgagaatt taatgaaaaa gaacactata 120
ttagttgatt ttgatgttcc tgatgattgg gaatataaac gtggcattga ggatgtaaca 180
ggagaaaaat gggaactgtg gaaatgtatc acacatcggc ttcaaagttc taagattaag 240
gttctgttgc gctatattat aacattcttg ttcgcgttta aagtgttttt gcacagaaaa 300
aaatataaaa gaataattgc atggcaacag ttttatgggt tggctttagc ttttttttgt 360
aagttgttta atgtgaagga ttatccggaa atatatataa tgacatttat atataaaaat 420
aataaaagta gagtgtttag taaatttgtg aaatatgcgg tagattctag atacataaag 480
aaattgatgg taatgtcgga tggcgaaaaa cagttttatt ctaaagaact gaagttagat 540
gagtctttgt tctattgcac tagagttggc gttaaagatg aaactaattc tattaagcaa 600
aatattacgg aaaaatacta tttagcagta ggtagaagca atagagatta taaatttttg 660
agagatgcgt ggaaaaatga atatggaaag ttaataatag tcaatgattc atataaagag 720
ccggaaaaag atggtattgt atgtttaaaa aaatgttatg gtcgagacta tttacaaatg 780
gttgctaatt gttatgctga ggttatacct ttgaatgata aaaatatatc atctggagct 840
ttgagctttt tgcaagcaat gatgttttct aagccggtta ttgtaacaaa taatatgaca 900
gtaagggatt atattaaaag tggatataat ggggttatca ttgagaacac ttcagaagaa 960
ttggaaggtg caattaacca attagagaat ccgataatat atcaagaaat tgcatcaaac 1020
gctcgaaaag aatatgaaga gaaatatagt gaattaattt tgggcaagga tattggtagt 1080
atgattgttc atcgataa 1098
<210> 90
<211> 365
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_GT6
<400> 90
Val Ser Lys Tyr Val Lys Leu Glu Leu Asn Tyr Pro Val Glu His Gly
1 5 10 15
Arg Lys Glu Lys Ser Asn His Phe Leu Asn Val Ile Gly Leu Tyr Glu
20 25 30
Asn Leu Met Lys Lys Asn Thr Ile Leu Val Asp Phe Asp Val Pro Asp
35 40 45
Asp Trp Glu Tyr Lys Arg Gly Ile Glu Asp Val Thr Gly Glu Lys Trp
50 55 60
Glu Leu Trp Lys Cys Ile Thr His Arg Leu Gln Ser Ser Lys Ile Lys
65 70 75 80
Val Leu Leu Arg Tyr Ile Ile Thr Phe Leu Phe Ala Phe Lys Val Phe
85 90 95
Leu His Arg Lys Lys Tyr Lys Arg Ile Ile Ala Trp Gln Gln Phe Tyr
100 105 110
Gly Leu Ala Leu Ala Phe Phe Cys Lys Leu Phe Asn Val Lys Asp Tyr
115 120 125
Pro Glu Ile Tyr Ile Met Thr Phe Ile Tyr Lys Asn Asn Lys Ser Arg
130 135 140
Val Phe Ser Lys Phe Val Lys Tyr Ala Val Asp Ser Arg Tyr Ile Lys
145 150 155 160
Lys Leu Met Val Met Ser Asp Gly Glu Lys Gln Phe Tyr Ser Lys Glu
165 170 175
Leu Lys Leu Asp Glu Ser Leu Phe Tyr Cys Thr Arg Val Gly Val Lys
180 185 190
Asp Glu Thr Asn Ser Ile Lys Gln Asn Ile Thr Glu Lys Tyr Tyr Leu
195 200 205
Ala Val Gly Arg Ser Asn Arg Asp Tyr Lys Phe Leu Arg Asp Ala Trp
210 215 220
Lys Asn Glu Tyr Gly Lys Leu Ile Ile Val Asn Asp Ser Tyr Lys Glu
225 230 235 240
Pro Glu Lys Asp Gly Ile Val Cys Leu Lys Lys Cys Tyr Gly Arg Asp
245 250 255
Tyr Leu Gln Met Val Ala Asn Cys Tyr Ala Glu Val Ile Pro Leu Asn
260 265 270
Asp Lys Asn Ile Ser Ser Gly Ala Leu Ser Phe Leu Gln Ala Met Met
275 280 285
Phe Ser Lys Pro Val Ile Val Thr Asn Asn Met Thr Val Arg Asp Tyr
290 295 300
Ile Lys Ser Gly Tyr Asn Gly Val Ile Ile Glu Asn Thr Ser Glu Glu
305 310 315 320
Leu Glu Gly Ala Ile Asn Gln Leu Glu Asn Pro Ile Ile Tyr Gln Glu
325 330 335
Ile Ala Ser Asn Ala Arg Lys Glu Tyr Glu Glu Lys Tyr Ser Glu Leu
340 345 350
Ile Leu Gly Lys Asp Ile Gly Ser Met Ile Val His Arg
355 360 365
<210> 91
<211> 18254
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33134 eps基因簇,完整序列
<400> 91
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat ttataattgg aggtttttat 420
agttataatt ctaggataaa taatctttca aaagctgata aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa agtattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatccaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcact gactagtaat 840
gagcaactta taacaaattt ggctagtgca ggagcggagg taatagttca accctctcca 900
ccgatctatg gtggtgttgt gtaccccgta caagaagaac aatttaaaca atctttatct 960
acaaagtatc cctatataga ctactgggct agttacccag acaaaaattc tgatgaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgcttcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaagaaat gcaggaaaca caggaacaaa cgattgattt aagagggatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattt ggataattca gcagcctacg ctggacaagt gaccgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc tgatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acaaattcac aagtcattac gcttactgtt aaatattcta atccttacgt tgctcaaaag 1560
attgcagacg agactgctaa aatatttagt tcagatgcag caaaactatt gaatgttact 1620
aacgttaata ttctatccaa agcaaaagct caaacaacac ccattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ttagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct gggactcacg 1800
gttcttggtg taacaacctg tgctcaaatg agtgatttta ataataatac gaataaaaat 1860
ggcacgcaat cgggaactaa gtcaagtccg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ccgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caagggatta aaagttttct agtaacatct 2100
tcagaagcag ctgaaggtaa atcaaacgag agtgctaatc tagctgttgc ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacctaatc catcggaatt attagcatct agtgcaatga agaagttgat tgactctgtg 2400
tccgattcct ttgatgttgt tttgattgat actccacctc tctatgcagt tactgatgct 2460
caaattttga gtgtttatgt aggaggagtg gttcttgttg tacgtgccta tgaaacaaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtagag 2640
taattggaat aaatgttaat caaataaaag acagaaattt gtagaagagg ggagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caaccatcac tgccactcct 2820
catcataatc ctcaatttaa taatgaatca ccgcttattt tgaagaaagt taaggaagtt 2880
caaaatatca ttgacgagca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaaggaaagt tactgacagc agcgggcact 3000
tcaagttata tattgattga atttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggacttcaa cctattttgg tccaccctga gcgtaatagt 3120
ggaatcattg agaaccctga tatattattt gattttattg aacaaggagt actaagtcag 3180
ataacagctt caagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt gacgtcacgt 3300
gcatttaaga tgagggaagc atttgaaatg attgaagata gttatggttc tggtgtatca 3360
cgaatgtttc aaaataatgc agagtcagtg attttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa gaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatgaaag tttttgagga tgccgcatca cctgaatcgg aagagcataa gttagtagaa 3540
ttaaaaaaat tttcttatag agagctaatt ataaaaagag caattgatat cctaggagga 3600
ttagcaggtt cagttttatt tcttattgcg gctgcattac tttatctccc ttacaaaatg 3660
agctcaaaaa aggatcaagg gccaatgttc tataaacaaa aacgctatgg aaaaaacggg 3720
aaaatttttt atattttgaa atttaggaca atgatagtta atgctgagca gtatttagag 3780
ctacatccag aagttaaagc cgcctatcat gccaatggca ataaactaga aaatgacccc 3840
cgtgtgacga agattggttc atttattaga caacactcaa ttgatgaatt accacaattt 3900
atcaatgttc ttaaagggga tatggcattg gttggcccaa gaccaatttt gctttttgaa 3960
gcgaaagaat atggggagcg cctctcttac ttactcatgt gtaaacctgg aattactggt 4020
tattggacaa cacatggtcg aagtaaagtt ttttttcctc aacgagcaga tttagaactc 4080
tattacctcc agtaccatag caccaaaaac gatatcaagc ttctagtact cacaattgta 4140
caaagtatta acggatcgga cgcatattaa aaaatgaaaa tagcattagt aggttccagc 4200
ggtggccatt tgacacacct gtatttgtta aaaagtttgg taaacatgtt attgcaggat 4260
ttaaacctac agcatatgca aaattgagca aaaaaaatgg ctgaaaaagt tgaaacttta 4320
gtcctttcag tttttaatat tttcgagttt ctttcagatg aacattatat ttgaaagaaa 4380
aaacaaattg atttaaaaat atagtgtaaa ttatacaata gtagttagtt tgagtattta 4440
atcattgcat tgtatactgc aatgattttt taaggagaag tgaagtttat gaaaaaagaa 4500
aaaatttcaa ttattacacc tgtatataat tgtgaaaaac ttattgaaaa aaccattgaa 4560
tgtgttttga atcaaacata taaaaattgg gagtggctac ttgttgatga ttgttcacca 4620
gacaactctg ccataataat aaaaaagtat gctaaaaatg ataatagaat taaatatttt 4680
aaattaagtg aaaatagtgg tgctgccgtt tctagaaata aagcattggc agaatctact 4740
ggtagatttg tagcttattt ggatgcggat gatttatgga aaaatgataa attggagaaa 4800
caagtaaaat ttatgttaga aaatcaatat tcatttactt gtacggacta tgaaaaaatt 4860
acggaaacag gtaatagttt aaataaaatt attaaaatac caaaaaaagt agattataat 4920
ttctttttaa gaaatacaat aattcaaact gttggagtga tggtagatac aaaattaaca 4980
gggaaagaat tattgaagat gcctaatatt agacgaagac aagatgctgc aacatggtgt 5040
caacttttaa aaaatggaca cgattgttat gagtgtccag agaatttgtc ttattataga 5100
gtagtaacaa attctttatc aagtaataaa tttaaagcaa taaaaatgaa ttggtactgg 5160
tatagaaaga tagaaaaatt acctttatgg aaagcatgtt attgctttat tggatatgct 5220
tttaatggtg taagaaaaag aatatatata aaaaggtaag tgaaaagtat ttgacttttt 5280
catacttgtt tgtaccttaa acaaacgcaa cgaagtgcaa aggaatgata aaagaatgaa 5340
aaaaatttta tatatagcta caactgccga tagtagaaat agattagatg gtgaaacaat 5400
taaatgtaga ttattaagag aatatctaag aggaatagaa aatgttgaac ttatttctgt 5460
agatactgat aattggaaaa agcataaatt aaaattagta tttttaataa tatataattt 5520
tattttttgt aattctatcg ttgtttcatc tgcggataaa ggtgctaata ttgtcttaga 5580
tttttttaga aaaattaata ctaaaaaaaa tatttattat tttgtaattg gtggtacatt 5640
atataaaaat ataaaagaaa aaaattggaa tattgaaaca tataaaagat taaaacatat 5700
ttacgttgag gcaaatcaac tgaaattaga tttgaactct ttaaatatta ctaatgttga 5760
tatcttaaat aattttagaa aagtaaataa atttgaaaat aaatataaaa agagtaaaga 5820
aataaaattt gtttactttg gaagagttat aagagaaaaa ggtgtagagg aagcaataaa 5880
aatgattaac aggcttaatg ctgaaaatat tatatgtaca tttgatatat atgggcaatg 5940
taaagatgaa tatttgcaac aaatacaaga aaagtttaat gaaaacataa gatttcatgg 6000
tgaaataaaa ccaaatggta aaaaagaata tgaaatatta tcacaatatg atgtttttct 6060
gttcccaact gagtacccag gagaatgcct tccaggagct ttgattgatt gctatatttc 6120
tggacttgca gtaattgctt caaattggaa atatgcaaag gaatatattt tagataatga 6180
aaatggaaaa atctttgaat acaaagacta taatgatatg tataaaaaaa caaaagaaat 6240
ggtagctgaa aatgttattc aaaaatataa attaaaatca gtagaattat caaaaaaata 6300
taatatggat gtattattaa atgactttaa aaaagaaata atggaggaaa aaaatgaaac 6360
tttttaaact tttaaagaga tggtttaata tggtaacagg taaaagttct ttccatgtaa 6420
aacaaggaga aggaaagtat ttttcaaaat atgaaattaa aggatattac aatgatttga 6480
caaacaaggt ttcgaataag actgtaatag attcaaatgg aattccaata aatacaacaa 6540
ttgcaaattt agaaacgtat tttccaatat ccatatttca atatggctta ggattatacg 6600
atttatatat agaaacagga gaaaaagatt atttaaataa atttttaaga attgctgaat 6660
gggctataga aaatataagt gatactggaa tgtgggattg catgggtaaa ttaaatgatt 6720
ctgcacattt gacacaatca tcaatgtgtc aaagcgaagg agtatctgtc cttttgagag 6780
catataaaga aacaaaaaat gaaaaatatt atcaaaatgc gaagttggct attgatttta 6840
tgttgaaaaa agtagaagat ggtggaacag cattatacat taatgatgaa ataatatttc 6900
aagaatatgt ttccaaatat aatttatcag ttttaaatgg ttggattttt tcaatatttg 6960
gattgtatga ttttacatta gttaataaag aaaaaaagta tattgacatt ttaaatgata 7020
caatagaaac aatgtgttta gaattaaaaa aatatgatag aaaattttgg tctaattatg 7080
atttaatgca tacaatagct agtcctgcct atcatgattt acatattatg caattaagaa 7140
ttctctataa attatttaat aaaaaagaat ttatgattta tgctgataaa tgggaaaaat 7200
atcaaaaaaa taaaatatat agggcattgg caatgattat aaaattaaaa cagaaaattt 7260
taaaaaataa atattatgat ataaatacaa gtttagtaaa gtgaggagtt taaattgaaa 7320
aatataggaa tagttctcgt aacatataat agattggaga aattaaaaat tgcattatca 7380
tgttacgaaa aacaaaaaac aaaaattgat actatgataa ttgttaataa ttgcagtaca 7440
gatgggacat ttgaattttt agaagaatat agtaaaagga agttaaaata taaaattgta 7500
attttaaata tgccaaaaaa tcttggtggt gcaggaggtt tctttgaagg aatgaaatgt 7560
gcaatgaaag aagatttaga gtgggtttat atttcagatg atgatgctta tccaaatgac 7620
aatactatat atgagcttga aaaaatttac tcaaaattac aaaataaaga tgaaattgtt 7680
gcattgtgca gtgttgttga aaataaaaat ggtttagatt atgggcatag attaagaata 7740
atgaaaaatc tcttttttgt gaagtggaaa ccagtagata ggagtgaata taataaagat 7800
tattttaatg ttgacattct atcgtatgta ggatcactta ttaatgttaa tgcattatat 7860
tgtgcaggtt tagacagaaa agatttcttt atatatcatg atgatcaaga acattcattg 7920
agattaggca aaaatggaaa aatattaact tgcactaaaa gtgtaataca tcatgatacc 7980
gaagtaaaaa aatacaagga attattttgg ggaaattact atgatactcg aaataggctt 8040
ttaatgataa aatataattt tcctttaaga tatttctata taagatacta tttaggatat 8100
ataagggatt gtttattatg caaaaacaaa ataaaaaaag aaatgttaaa agttgcatat 8160
atggatgcta aaaataataa attagggttg aattcaatat ataaacctgg ttgggtagct 8220
aaaaataaat aatgtattct ttttgaaagg aattaataaa atgataaaaa gaaatcaatt 8280
aaatttattt ttgcagtata tacttgcttt tctaattatt ttagaaacta gaagtgtata 8340
ttctcgttgt gttgttggtc atatagatga aataataata ggtggtataa tattaacaat 8400
aattttaatt attttagtta acatgaatta taaattaaaa actaaatcat tgtttttttt 8460
acttttttat tatctatata tgtttatatt tttaattata aatgttgatg gatataatgt 8520
aaactttgtt ataatattta tgattttatt tccacttgta tttttaatgc tcaatttata 8580
tgatagtaaa gaaataaaaa atctgtttaa agcatatgta aatattatgg ttatcatttc 8640
tgcaatatcc ttattttttt acatattcgg atccttaact ggtatggttt ctactaatat 8700
tatacaagaa ataaattggg gtggagatta tggaggtaat aaaataataa atggatattt 8760
tggattgcat tttaatacac aaacaactgt aatatttggt gatgccattt taagaaatac 8820
aagtatattt gttgaaggac cgatgtttgc attacatctc ttatttgcaa tggcattatc 8880
actatttatg aataaaaaat taataaataa atattcaata atatttggct tttcaatatt 8940
atcatcgcta tctataactg ctatattatt ttatatgttt ttattgtttt acaagtatac 9000
attttataat aaaagcaaaa cgaagattat tttgttgcca atattgtttt tcatattttt 9060
aatcatagga actacttttt ttaatgataa gcaatcaaca aattcctata atataagaaa 9120
tgacgattat acagcaagtt ttagagtatt taatgattat ccaatttttg gaagtggatt 9180
tgctaataat aacatagtaa ttaaatatat gtcaacattt agattgtata atacaggttt 9240
agctaactca tttgtagtct tattagtcca aggaggatta tatttagtaa ttttttattt 9300
actgccagtt attttgaaca gtttaaattt aattaaatcc aaaaataaaa atttgttttt 9360
gatagaaatt atgcttttac aactttattt attttttatg aatgcttatc aatatacatc 9420
attgatgata atctttctag catttgatta ttatatttta ttattttata acaaaaataa 9480
tgatttaaaa aatttgcaac ttaaggagga aacatatgaa aaaatatgat tatttaatag 9540
tgggctctgg attatttgga gcaacatttg caaatcttgc taaaaaagaa ggaaaaaaag 9600
ttctagttat agagaaaaga acaaatatag caggaaatat atatacagag gaaatagagg 9660
gaatacaagt acataaatat ggagctcata tatttcatac agattataaa gacgtatggg 9720
aatatgttaa ttcatttgta gaatttaata gatatacacc cgaattgcta gttgattatt 9780
tagccatgac ttgatacccg atagaatatc ttaaagtctc tggttccagt gatttagctg 9840
attttaacag taaagaatac gctaaaagta tcatctctaa tttcaattga aaaccttgag 9900
gcgaacgact tttacaacgc tcagctccta gatttgtcaa aaaagagaaa actcgctcaa 9960
tcacttttct acgttttgaa aaattaggga aaaggatttt cttttgcttc atgttcttcc 10020
tgacaggtgt aattagatca attcctttta attccagtct atcgtgcagt gactgaccta 10080
gatatcccat atctccaagg actgttggtg tcccaaattg actcaacact ccctcggtca 10140
ttgaactatc tgccattgaa gcaggagtaa ttgtgtagtc tatgacatag cctgattcac 10200
tgactaaagc atgacattta catccataga agtactgtcc ctttgtagca ttgtagccaa 10260
catttgcata atctccaaga actttgcttc tgaaattacg aatcggctga cacaaaggaa 10320
tggggaagct gtcaataatg gatacactca ttccttcaac ctctttaaag acgagtgctt 10380
ggcgaatgac ttggatactc ggtaagaggg cattacaacg gcggacaaag cgagaatatt 10440
ctaggaaatt aggaaataaa ctttgagcca attggtgctt agctttaagc gtttcactaa 10500
aatgcagtac gccccatagg taacaagcga taactaagca atctgatgtt gcgagatgga 10560
cgttctttcg gttttgaacc tcaaggggaa cactcgtttg ataaagcgtc tcaatggttg 10620
tcagtaaata aacaaaacct tttggaagtg tgctattata agtcatataa gtcatgcgct 10680
ttctattgct tagtggttta agattaggat agcacgactt atttattttc caatgaatta 10740
actagcaatt cgggtatatt atctattttt gtacaagtag gaatttcagg aattacttat 10800
tttgtaatgt taattatatt aaaagataaa atgaaatttg aaggaataga aattataaaa 10860
aataaattat aaaaaagaaa aattaatgtt taatgaaaga aggatcataa tgaaaatagc 10920
agtagcagga acaggttatg taggtttatc tctagccaca ttactaagcc aaaaaaatga 10980
ggtagttgca cttgatgtaa taccagaaaa agtagagaag ataaataata gaataagtcc 11040
aattcaagat gaatacatag aaaaatattt caaagaaaaa gaacttaatt taaaagcaac 11100
tttagattat aaagaagcat ttgagaatgc agaatttatt ataataagta cgccaacaaa 11160
ttatgattca gaaaaaaatt attttgacac atcatctgtt gaagatataa ttcagaaagt 11220
aaaaagtatg aatatagata caacaatggt tgttaaatca actattcctg ttggatttat 11280
aaaggcaatg aaagaaagat atcaaataga caatataatg tttagtcctg aatttttaag 11340
agaaggaaaa gctttatatg ataatttata tccatcaaga ataatagtgg gagaaaaatc 11400
agatagagca gaaaaatttg ctaatctttt aaaagaaaat tgtttaaaag aagatgtagt 11460
agttaaatat atggattcta ctgaggctga ggcagtaaaa ttatttgcaa atacatattt 11520
agcacttaga gttgcatatt ttaatgaatt agatacatat gctgaattaa aaggtttaaa 11580
tacaaaagat attatagatg gagtatgttt agatcctcgt attggaaatc attataacaa 11640
ccctagtttt ggctttgggg ggtattgtct tccaaaggac tcgaagcaat taaaggcaaa 11700
ttataaagat gttccagaga atattatcag tgcaatagtt gaatctaata gaactagaaa 11760
agaccatatt gccgatatga tctcaaaaag aaacccaaaa gtagttggaa tatatagatt 11820
aacaatgaaa tctgggtcag ataattttag agctagtgca attcaaggag ttatgaaaag 11880
aatcaaagct aaaggaattg aagttgttgt ttatgagcca actttaaaag aagataattt 11940
ctttaatagt aaagtaatca aagatataga tgaatttaaa aagatatcag atgtcattat 12000
agtaaataga cttgatgaaa atgtatctag tgtaaaagat aaagtttata caagagatct 12060
attcgctaga gattaaaaaa ataggaggaa atcctaatga acaaaaaatt aataataagt 12120
tggattttgg ttattctatg ggcaggattt ggactaatgt caaatattta gacacaaaat 12180
gagctatatt tttggaaagt caacgaaaaa ctagacacgg agttaagaga atttaagaat 12240
tatatttttc aaagtctttt ggagacttgt aatcaagacc tgaatgcatc ctttttgtat 12300
tgtaataggt ctcaatgtat ttaaatattt cttgagtagc ctcagctctt gtctcaaaat 12360
gagcatcatt aataagctcc ctctttagcg tcttataaaa agactccatc attgcattgt 12420
catagggatt tcctttacga ctcatgctag attgagcacc gacttgacga agagtagatt 12480
gataacgaga gcttgtatat tgactccctt gatcagtatg gacaatcaag ccaggctgag 12540
gatgttcttt cccacaagct tgtaagaagc aatccctcac cagtttatct tgcatccgtg 12600
aagacattga ccagcctaca atcttacgtg aaaaaacgtc gatattcacg gctaagtata 12660
aggtgccttc tttggtaggg atataggtca tgtctcccag ccatacttta ttaggagctg 12720
ttgctttaaa gatctgatta attaaattgg gtcttgaaag cgaagctcct tttctgttgt 12780
aatgtttata tttataacgg cttcccttgg cataaagtcc catcaagtgc atcagtttcc 12840
caacacgttt cgtgttggtc ataataccag tattatgaag taccttggta attctaaccg 12900
caccatagcg tcccttatgc tcatgaaaga cagcttttat cttctctgag agaatttctc 12960
tctccacttg ttgttttgaa ggacgacgat gcatgtattc atagaaacct gagcgagaaa 13020
ccttaagaac ttttactgca tgcttaattt ttatcttccc atgatgtttc aagagaaatt 13080
caaaacgttt tacttgcttc gcttcaagaa gacccggaac ttttttagaa gttcaagttc 13140
ctccttaaga taacgatttt ctttctctaa caatttaatc ttatgttggg catcagctag 13200
ggctgtccca ttgcctggaa aagcactttc tccatattct tcaacttctt gaacccagcg 13260
ataaagacta ttggcatgaa cctcaagctc ttggctgact tctttaacag agtaaccctc 13320
ttcaagaatg agttttactg cagaattttt aaattgttta tcgaattttc ttcttgccat 13380
aataaacctt tccacgattt ctcttaactt tgtgtctagt ttattataac ctttccattt 13440
tcttaaaata attttcttct ttctgatttg gagttaaacc atttttggcg gaatgaggat 13500
tataattgtt ataaaattgt tcgatgtatt caaagcaaga gagttgaact tcttgaatgg 13560
agtgataagt tctgcgatta atttctcgtt gtttaagata cttgaaaaag acctcagtga 13620
cggcattatc ataaggatat ccaggcttgg agtaagaagc aagcaattga tgctcatcta 13680
ataactttct aaaggaagtt gatttaaatt ggcttccttg atccgaatga aaaataattg 13740
gttccttagg tttcctctta tttatagcta tttctaaggt gtcacaggcg agcttggcat 13800
caatcctatc acttactttc cacgcaatac atttccttga gtagaggtca agaatagcac 13860
agagataaac atgtcgctta ggtcctatag agatataagt gaaatctgtt gtccaaactt 13920
gatttgggga gttcgggtta aattcttgtt taagcagatt atcagaagaa aatacaggag 13980
atttatttga tttaaaacga ggtttgatgg ttgacatttt agggagtgtc atagacttga 14040
gaagtcttaa gatacggcct tcagaaatat taacgccata atcacgcaga agaatgattt 14100
taaaagctct cgttccaatt cttttcttgg ctttcatata aatctcaagg agtaattttc 14160
tcaagcgttg attttctact tcacgcttcg aaggcctctt gtttatgaag ttatagtagg 14220
tggaacgatt gacatgtaaa acacgacaga gcataactgt cgtgtgttca aatcggagtc 14280
tatagatggc tttcaatctt acttggagtt ttgcatgaat atggcactcg ctttttttaa 14340
gatcagattt tcttcctcta gttgggcatt ccttttttgt aattcttgaa tctgtttagc 14400
agtcaacacc gtattatctt caagacgcac ttgagagtac tgcttaatcc attttgcaag 14460
tgcagaagaa gataccccat aatctttaca gagttcagtt tgtgttttac cagtttgata 14520
gagattaacg agcgattgtt tgaaatcctc gtcgtaacgt ttaaaacctg acataaaagt 14580
cctttcattt ttgtgtccta atagacagat tataacacac aattttctgt ccacttttat 14640
agtatagctc caagatcgct tatcatcaaa ttgataaata taaaagaagt caaatatttt 14700
ggggaatcaa ttttttaagt tggattacag caaccttctc ctttattgca ttttttttct 14760
ttatcttgct cacacatgag tatcagaata tctatatttg gcaaagttta ctgattttgg 14820
caagtctttt tgatatttca tggtacttta tgggacgaga aaagttcaaa gtcactgtaa 14880
ccagaaattt catcataaaa attttaaccg ttatttctat ttttgttttt gtaagaaatc 14940
ataatgattt accaatctat gttgcaatca tggggattgg aagtttacta ggaagcttat 15000
ctttatggcc ctatctcaga aatgaaatta ataagccaaa tctaagatac ctcaacttaa 15060
aaaaacattt acattacaca gtcatcttat ttatcccaac aatcgctacc cagatttatc 15120
tcatagcaaa taaatccatg attggactta tggattctgt cactcatgcc ggattttacc 15180
aacaagcaga cacaataata aagatggcat tatccgtaat tggaaccata ggtgtcgtca 15240
tgttacctcg cattgcaagt atgcactcag aaggaaacat gaaagtaatc agagcatcga 15300
tcgtaaaaac atttaatatc gcaacaggga tttcatttgg tatctttttt ggaattctag 15360
ggattgcact acactttgca ccattctttt ttggaaaatc ctttgagatg gtcggagtga 15420
ttatgatgct agaagccccc atcattatct ttattccaat gagtaatgta tttggtattc 15480
aatatctcct tccactaaat agaatgagag cttttacctt atcagtaacc tttggtgcat 15540
tattaaatat cataataaat tttgctttga taccgttact tggcgtgatc ggagcaacgg 15600
tagcaactgt ggtatctgaa tttgcagtta cagcttacca atatttatca atcagaaaag 15660
agttttcatt cagtgattta tttggtggac tttggaagta ttttatctca ggctcattga 15720
tgtttgtcgt agttttttgg atgaatcaat catttaaaat gacaataatt cagctgatac 15780
tccaaattat cttaggtgta ctcatctata ctctttctaa tatcttatta aagacacagc 15840
tatggcttat ggcctcagaa cttttaggaa aaatgaaaaa tcgggtatca agaaatcata 15900
tacgtataga tcaaaaacaa gaaactctcg aacatccatt agatacaatt aaagcttcgt 15960
ttgatcaatt tgctatcctc tttcaagaaa tcgatgaaaa aaaatattat ctcataagaa 16020
ctttttgaca aatcttaata attttgacaa tacactgaaa aatgtaacat ttaatgatga 16080
tttaaaaaaa atgatataat aagattatca gatttcatcg ctgagcttag tataattatg 16140
agcaaaaaaa gagattatct taaagttcaa gatcaagagc acctctatca atttgctcaa 16200
ggtttaaata tccttgcatc aaaaatggag aaaattgcac aagaagaata ttcaccaaaa 16260
gagctaaaag aatggttcag aaaagagctg ggcgaataac tttatcaagt aaaaaaacaa 16320
ggaatagcaa gaaaggaaaa gacatggagc agaaaaagaa aaagaatatt tggctgataa 16380
ttgtacctat cttaataata atttccctta taggagcagg ggcttatgcc ttaatagatt 16440
cacttattcc tactgatcat acgaaaacaa acagttcgga tcaaccgacc aaaacttcgg 16500
tttctaatgg ttatatagag caaaaaggtg aagaagctgc tgtgggtagt atagcacttg 16560
tagatgatgc tggtgtatcg gaatgggtta aggttccctc gaaggcaaat ctagataaat 16620
ttactgattt atctacgaat aatatcacta tttatcgaat taacaatccg gaagtcttaa 16680
aaacagttac caatcgtacg gatcaacgga tgaaaatgtc agaagttata gctaagtatc 16740
ataatgcttt gattatgaat gcttccgctt ttgatatgca gacaggacaa gtagctggat 16800
ttcaaattaa taatggaaag ttgattcaag actggagtcc aggtacaacg actcaatatg 16860
cttttgttgt taacaaagat ggttcgtgca aaatatatga ttcaagtaca cctgcttcaa 16920
ctattattaa aaacggaggg caacaagcct atgattttgg tactgcaatt atccgtgatg 16980
gtaaaattca accaagtgat ggctcagtag attggaagat ccatattttt attgcgaatg 17040
ataaagataa taatctctat gctattttga gtgatacaaa tgcaggttat gataatataa 17100
tgaagtcagt gtcaaatttg aagctccaaa atatgttatt acttgatagt ggtggctcaa 17160
gtcaactatc tgtcaatggt aaaacgattg ttgctagtca agacgatcga gccgtaccgg 17220
attatattgt gatgaaataa aaataaaaga acctcttggt tcttttattt tagagatttt 17280
tcaaaaaggg ttttgactga gtctaattct gtttgagaaa cgaccttagc tccattttca 17340
tctgttgtat gtagattgag cttgctggta ttctttagag ccgtattata gctcataacg 17400
atcgttttag catcattgaa acttatattg gttttaacag tgtttgaaaa agcatagaga 17460
atttctcgat agtaattgaa actttcaagt tttttcatat gagcaatttt ttgaatattt 17520
ccgtagagtt ccattgagac attttgaatc cgggtgattg aagcatccaa atcagtatcg 17580
tcaatttgtg tcatataggc ttggacttga tcagcagttt gtaaattaac agttccttgt 17640
ttaaactcat aaccttcagc attgaatgcc tttggatttt gcatggtgat tccaccagtg 17700
gcctgtacaa gtgatcccat tttattaaca tcaatctgaa ttactttgtt aatggacaca 17760
ttcaataggt ctttaaccat ctggaaaatt ccatcatctc cattcgtatt gtaaacttca 17820
gtgattgttt tttgattagg cattgtcgca aaaactggga agttcatgaa agtagtttga 17880
tttgtcttta cattcgttga agctaaaaca gtagcataag ctgtattttt agaattattt 17940
ttaccagttg caatgataag tgtggtgaat gttttagact tttttaagtc gatacttgtt 18000
gttttaggga aattttcata tgatgttgaa aaggttgatt caacatttct ataagctacg 18060
taggctatag aagcaacagc gataattact aatacaaaaa taattgaaat aacttttagt 18120
actgtgtgtt ttttcttacg ataatgacgc ctcttttttt gattcatggt atctccatat 18180
acatattata taccttaaat tataccatat ttaatgatgc tatacttaaa tcttagagtc 18240
actattgtat aatt 18254
<210> 92
<211> 18365
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33136 eps基因簇,完整序列
<400> 92
atgtttatga atgatttatt ttaccatcgt ctaaaggaac tagttgaagc aagtggtaaa 60
tctgcaaatc aaatagaaag ggaattgggt taccctagaa attctttgaa taattataag 120
ttgggaggag aaccctctgg gacaagatta ataggactat cagagtattt taatgtgtct 180
ccaaaatatc tgatgggtat aattgatgag cctaatgaca gttctgcaat taatcttttt 240
aaaactctaa ctcaagaaga gaaaaaagaa atgtttataa tttgtcaaaa atggcttttt 300
ttagaatatc aaatagagtt ataacaataa taaatttagg gagtttttct tattgatatg 360
atgaaaaaag gaatttttgt aattactata gtgatatcta tagcattgat aattggaggt 420
ttttatagtt ataattctag gataagtaat ctttcaaaag ctgataaagg aaaagaagtt 480
gtaaaaaata gcagtgaaaa aaatcagata gaccttacct ataaaaagta ttataaaaat 540
ttaccaaaat cagttcaaaa taaaatagat gatatttcat ccaaaaataa agaagttact 600
ttaacttgta tttggcaatc tgattcagtt atttctgaac aatttcaaca aaacttacaa 660
aaatattatg gaaataagtt ttggaacatc aaaaatatca cttacaatgg cgaaactagt 720
gaacaattat tggctgaaaa agttgaaaac caagtattag ccactaatcc tgatgttgtt 780
ttatatgaag ctccactttt taatgataac caaaacattg aagcaacagc ctcactgact 840
agtaatgagc aacttataac aaatttggct agtgcaggag cggaggtaat agttcaaccc 900
tctccaccga tctatggtgg tgttgtgtac cccgtacaag aagaacaatt taaacaatct 960
ttatctacaa agtatcccta tatagactac tgggctagtt acccagacaa aaattctgat 1020
gaaatgaagg gactgttttc tgatgatgga gtatatagaa cattaaatgc ttcggggaat 1080
aaggtttggc tagattatat tactaaatat tttacagcaa actaattaag ttataaataa 1140
caattattaa atattggaga agaaatgcag gaaacacagg aacaaacgat tgatttaaga 1200
gggattttta aaattattcg caaaaggtta ggtttaatat tatttagtgc tttaatagtc 1260
acaatattag ggagcatcta cacatttttt atagcctccc cagtttacac agcctcaact 1320
caacttgtcg ttaaactacc aaattcggat aatttagcag cctacgctgg acaagtaacc 1380
gggaatattc aaatggcgaa cacaattaac caagttattg ttagtccagt cattttagat 1440
aaagttcaaa gtaatttaaa tctatctgat gattctttcc aaaaacaagt tacagcagca 1500
aatcaaacaa attcacaagt cattacgctt actgttaaat attctaatcc ttacgttgct 1560
caaaagattg cagacgagac tgctaaaata tttagttcag atgcagcaaa actattgaat 1620
attactaacg ttaatattct atccaaagca aaagctcaaa caacacccat tagtcctaaa 1680
cctaaattgt atttagcaat atctgttata gccggattag ttttaggttt agccattgct 1740
ttattgaagg aattgtttga taacaaaatt aataaagaag aagatattga agctctggga 1800
ctcacggttc ttggtgtaac aacctatgct caaatgagtg attttaataa taatacgaat 1860
aaaaatggca cgcaatcggg aactaagtca agtccgccta gcgaccatga agtaaataga 1920
tcatcaaaaa ggaataaaag ataggagttc aggatggcta aaaataaaag aagcatagac 1980
aataatcgtt atattattac cagtgtcaat cctcaatcac ctatttccga acaatatcgt 2040
acgattcgta cgaccattga ttttaaaatg gcggatcaag ggattaaaag ttttctagta 2100
acatcttcag aagcagctgc aggtaaatca accgagagtg ctaatctagc tgttgctttt 2160
gcacaacaag gtaaaaaagt acttttaatt gatggcgatc ttcgtaaacc gactgttaac 2220
attactttta aagtacaaaa tagagtagga ttaaccaata ttttaatgca tcaatcttcg 2280
attgaagatg ccatacaagg gacaagactt tctgaaaatc ttacaataat tacctctggt 2340
ccaattccac ctaatccatc ggaattatta gcatctagtg caatgaagaa tttgattgac 2400
tctgtgtccg attcctttga tgttgttttg attgatactc cacctctctc tgcagttact 2460
gatgctcaaa ttttgagtat ttatgtagga ggagtggttc ttgttgtacg tgcctatgaa 2520
acaaaaaaag agagtttagc aaaaacaaaa aaaatactgg aacaagttaa tgtaaatata 2580
ttaggagttg ttttgcatgg ggtagactct tctgactcac cgtcgtatta ctactacgga 2640
gtagagtaat tggaatgaat tttaatcaaa taaaagacag aaatttgtag gagaggagag 2700
cagatgattg atattcactg ccatatttta ccggggatag atgatggagc taaaacttat 2760
gaagatactt tgaaaatgct gaaatcagca attgatgaag ggataacaac tatcactgcg 2820
actcctcatc ataatcctca atttaagaat gaatcaccgc ttattttgaa aaaagttaag 2880
gaagttcaaa atatcattga cgaacatcaa ttaccaattg aagttttacc cggacaagag 2940
gtgagaatat atggtgattt attaaaagaa ttttctgaag gaaagttact gacagcagcg 3000
ggcacttcaa gttatatatt gattgaattt ccatcaaatc atgtgccagc ttatgctaaa 3060
gaactttttt ataatattca attggaggga tttcaaccta ttttggtcca ccctgagcgt 3120
aatagtgcaa tcattgagaa ccctgatcta ttatttgatt ttattgaaca aggagtacta 3180
agtcagataa ctgcttcaag tgtcactggt cattttggta aaaaaataca aaagctgtca 3240
tttaaaatga tagaaaacca tctgacgcat tttgttgcat cagatgcgca taatgtgacg 3300
tcacgtgcat ttaagatgaa ggaagcgttt gaaattattg aagatagtta tggttctggt 3360
gtatcactaa tgtttcaaaa taatgcagag tcagtgattt taaacgaaag tttttatcaa 3420
gaaaaaccaa caaagatcaa aacaaagaaa tttttaggat tattttaaaa ggagtaaaag 3480
gagtaaaagg agtaaataat ggaagttttt gaagatacct catcacctga accgaaagaa 3540
gaaaagttag tagaactaaa aaaattttct cacagagaaa tatttattaa aagaggaatt 3600
gacattttag ggggattagt gggttcaatt ttgtttctta ttgcggctgc attgctttat 3660
gtcccttaca aaatgagctc ggaaaaagat caagggccaa tgttctataa acaaaaacgg 3720
tatggaaaaa acggtaaaat tttttatatt ttaaaattta gaacaatgat tcttaatgct 3780
gagcagtatc tagagctaca ttcagaagtt aaagccgcct atcatgccaa tggtaataaa 3840
ctagaaaatg atccacgggt aacgaagatt ggttcattta ttagacaata ctcagttgat 3900
gaattaccac aatttatcaa tgtccttaaa ggagatatgg cattagtcgg tccaaggcca 3960
attcaacagt ttgaagcgaa agaatttggg gagcgcctcc cttatttact gatatgtaaa 4020
cctggaatta ctggttattg gacaacacat ggtcgcagta aagctccttt tcctcaacga 4080
gcagatttag aactctatta tctccaatat cacagcacca agaatgatat caagcttctt 4140
atgcttacaa ttgcacaaat tattcacgga tcggacgcat attaaaaaac aatgaaaaaa 4200
aagaaattat tactaataag tcaaagcgga agaggtggag taaggaggca tttgtgtgat 4260
cttatgctta acctcgatta tgaaattttc gaggtatggg ttgcttacaa tgatgatgct 4320
attgatgata tatttagaca aacaatagag caattatcag gaaaaattac tcctatacta 4380
ataaataatc ttgtcaggga attaaattta aaggaggata taaaagcata tttaaaatta 4440
agcaaactaa taaagaaagt caagccggat attgtacatt gtcacagttc taaagctggt 4500
gttattggtc gtttagctgc caaaagacga ggtgttaaaa aaatatttta tacgccacat 4560
gcttattcgt ttttggcacc tgaatttagt ggaaagaaaa agtttctttt tgttcaaatt 4620
gaaaagtttt taagccgatt ttcgacaact cagacatttt gtgtgtcaat aggggaaatg 4680
caagctgctc ttgaagtaaa tctagataaa accgataagt ttcaggtaat ttataatggt 4740
ttgccagaaa ttgatttacc aagcaaagaa acgattcggg cgcaattagg actggaaaag 4800
acagtagttg ttataggcaa taacgcaaga atgtcggaac agaaaaatcc tatgtttttt 4860
atggaaattg cccaaaaaat gattagacaa aacgcaaatt ggcattttgt gtgggcaggt 4920
gatggtcagc ttatgccact ttttcaatca tttattaagc aaaatggact agagaaaaat 4980
attcatttgc ttggggagcg tcctgatagt gaaacagttg tgacagccta tgacatcttc 5040
ttgacgactt cccaatatga aggtttacct tatgcaccaa ttgaagcgat gcgagctggt 5100
gtcccgattc ttgcgacaaa tgttgttggc aatagtgagc ttgtgataga gggaaaaaat 5160
ggttatttga tcgacttaga gtggtcaaaa tctgtcgaag aaaaattata taaggcagcg 5220
aaaatggatg cacaaatgat taaagcagat tttaggcaaa ggtttgcgat tgatcagatg 5280
ttaaagcaaa ttgaaacaat ttatttagct tgaatgaaga aagtaaaaag aatggataag 5340
tgtaaatgta tatacataat ctaatttccg tcattattcc agtatataat gtagaaaaat 5400
atttagaaaa gtgtttgaag tctgttcaaa atcagagtta tgcgcatttt gaagtgatct 5460
taatcaacga tggttcaacg gattcttctt taaaaatttg tgaggcattt atcaaaaaag 5520
ataagcgctt ttctgtttta acaaaagaaa atggtggact ttcttcggct cgaaatttgg 5580
gtttaaaaaa aatcagggga aaatatgtga catttgtgga tagtgacgat tatctatcag 5640
agcattatct taaacatttt gtgagtggta tagagagtga aaagagtatc gtttgttcaa 5700
aatttcttct tgttgatgaa aatggtgttt ttctttctaa aagacagaga attcaagaaa 5760
aaaaacttat tttttctaaa gaagaaggca taaaagaaat tttattacaa aataaaatgg 5820
atcactcagc ttggggaaaa ttatatccga tatctttttt tgaaaatatc acttttccag 5880
atggaaaatt gtttgaagat atgggaacga cctataagtt attggcttta gctaacgaag 5940
ttgtattttt agatgaatat gattattatt atcttcaaca acccaatagc attatgaaca 6000
gttcatttaa tttaaaaaaa ttagatatta tagatatgtc aaaggaaatg attaaagata 6060
tcgttaatac ctgccctcaa cttgtgaatt atgctaaaaa tagagcattt agtgcagagg 6120
caggtatctt tttagatgtg ccaaatacta aagcgtttga atcggcgcaa aagctgcttt 6180
ggaaagaagt aagagaaaat agatatgcac catttttgat aaaaggggct agacttaaga 6240
ataagttagg tgctattttg tcgtttttgg gtaggagatt ttttttgaaa ctcgggaaac 6300
agttggtagg taaataagga ggcaacagat ggcaatttat tttttacttt tcccgatgat 6360
cgcaatgatt tatttaatga cattgctctt acgacaaaaa gcacaaatcc aaaaaacgat 6420
tttttgtgtt cttacgtttg gtacactagg ctttatttca gcaagtcgtg cttcaagtgt 6480
tgggacagat gttacgctat acgaaaatat ttttaaatct ataaattacg ggataagtgc 6540
tgaaaataat ttgggatatg tcatctataa caagttgatt ggtagtgtat ttggctatac 6600
gggacatgaa atcacagctg ctaattctgt tttgattacg atacttattg gtttttttat 6660
ttggaaagta gcggaacatt attttgttgc gacgttttta tacattagct tgttttatta 6720
tgctacaagt tttaatattt caagacaatt tattgccatg gggcttgtat tggtagcaat 6780
ttcttttgct ttagataaaa aggttatgcc ttggtttatc ttgacagttt tggctacctt 6840
atttcatgcg acagcaatcg tttcttttcc tgtctattgg cttacaaaag tacattggga 6900
tgtgaaaaag acattaggta tttttccaat tacgattttt gcaagtttta tttttgatgc 6960
tattttaaac atttttttac gttttttccc acattatgag atgtatatta ctggaacaca 7020
atttaatatt gcagatcagg ggcagggacg tgtggttttg gtcaaaatat ttatcttgct 7080
cattttgttt actttaatct tgttttataa aaaaagctat gctttgattt ctgaatgtca 7140
tcaaagtttg atagctttga caaccgttgg attaagtatc ggtattgtat tttataataa 7200
tattttactc aatagaatag aaatgtttta ttcaatttta agcatcgtat ttattccaat 7260
tgctatagat tactttagtt tgaaatttaa agaaaaagat actgtgcgac aaatgctgac 7320
gataggtatt ttgttaatta cacttgtgcc ttactatata caggttagcg gtaattattc 7380
aggaatattg ccatatacga tgaataaata gaaatatgtg ttaaagaaca taggagaaaa 7440
aatgaattat tctttttgta caacattctt ttttggaaaa aataaaaacg gatataaaaa 7500
tatgagtagt aactcattag agcaacatga tgagacatat ttgaaaactt ttgttacgtt 7560
atttgtcagt ctgaaaagat atagctttga aaaaaagctt tttgtaaatg atttagatag 7620
gttaaagagg gtaagagagg gaaaatatta ttcaatttta gtagaagaat tagaggtacg 7680
agtatacgaa gtatcctcta agtttgttga tgaaagtaaa gagtgggcgg gctcaatgtt 7740
catctttgat gtgttatctt ttatttattt aaataagcaa cagttttcag ttgataaatg 7800
gtttttcgta gatagtgatg ttgttttctt tgattatatg gaagatactc ttaaattact 7860
aaatcaatat gatcttgcta gctatactca atggcaagag tttagaatgt taaatctttg 7920
gacagaggat tttcatggtg cagatttttc aaaactagat tcctctatta cacctttagg 7980
tggagaattt ttgatgcttg atactgctag gattgaaata tttttaagta agttcaaatc 8040
attttattgt cagtttcaag atgtgatgca tacagaagaa aattactaca gtttgattgt 8100
agattctctt gttaaagaag gttataaaaa ttatgtagta aatccttttt tcaaaagatc 8160
attagcctta aatagagctt ttacggataa atactgttgg ggtgttcatt ttccaggaca 8220
aaaaaattat aaattgaagt atttatatag tgcagtcaag aaaaataatt ataatatgga 8280
catttctaaa tctaaaaaaa taatgggttt gaatacacat tggaatatat atgatttttc 8340
tttaatcatt aaaacaatga ggaatttaaa aaataaaatt gtaaaacagt aaactataat 8400
atctaaacta ttactgactt aaaatctaaa gaaaaaattg tatagattca cattattacc 8460
tgtactttgc ctattatata caggttagtg gaagttattc aggtatattg ctgtatatta 8520
tttagccatg acaatatgag aactgtgaag ataaagtatg aaaaagatag aacttattgt 8580
aaaagatatg gagatggaag ggtttcagaa agttacgtca gttgtaacca gtaccctatc 8640
ttcaaaattt gaagtttcta ttttatctat ggcatagacg gattcttttt ttgaactatc 8700
tgtaccatta gaaacaatca agtttactga acgaaaattg ccttgggtat ttaggaaaat 8760
acttactaaa tttaggtatg aggttgtgtt aacagataaa ttacatggta tgattttttc 8820
atatatcaca ggcacaccat gtattgtttt ggctaatgat aatcataaaa ttgaagaaac 8880
atacaaacat tggttgaata atgcgaatta tattcgtttt attgaaaaaa cgactgttga 8940
aaatatttta gatgcaataa atgaattgaa gcaaatcaat atattgaggc aatcaagagt 9000
ttagaaatca gagagtaatc tgttggagga aataaattat atgatgaaga tatcagttat 9060
tgttcctgtt tataatagtg aaaacacaat tgaaaagtgt ttaatttcat tgcaaaagca 9120
aacatataaa aatttagaaa ttattgtaat aaatgatggt tcagtggata caacagaaga 9180
taaaatagta agaattatag agaatgataa aagatttatt tattttaaaa ctgccaatca 9240
aggacaatct gaagcgagga gttttggatt gagtaaagca actggagagt taataggatt 9300
cgttgattca gatgatttta tagattatga tatgtatgaa atattagaaa aaaatatgag 9360
agatacaaaa tcagatattt caattataag atcgataata tcatttccaa atggattcga 9420
aataatacct agctgtcaaa atactttttt tatcaaaaca ggtaatgaaa tgatatttga 9480
gtatattggt ggttttcatt ttggggttgc cctatgggat aaactttata aaagaagttt 9540
atttgatgga ttaaaattag atacttcttt taatttaatg gaagatgctt taatgggtaa 9600
ctatgtgttt aataaagcga aaaaaattgt ttatacagga aaagcaaagt atcattattt 9660
gcagagaaaa aacagtacag ccagaaaaaa tttggaagat agtgatttaa aggcaatagg 9720
tgttgtcttg ggtatgaaag agctttatac tgatagcatt gagttagata aagcatttca 9780
acgcagattt gcacaaacaa tactggaatt actcagcaaa aatcccacaa aagaacaacg 9840
aaaacaaata gagagagctt tgtttgaaac aatagaatta gataaattgg gttatttaaa 9900
aaagggtgat aagttgctaa taagattgat ctattataaa tttccgtcat ctcccttaat 9960
tcaagccaaa aaaatgattg gacgaacagt aagaaaattt aagaagatat agaataataa 10020
agcaaattga gatgatgaat cttaattttt caatcaataa ctcaattgtt taggtgagat 10080
tgaaaactaa aacaagaagg aaaaggaaaa ggaaaagaaa aagaattgaa aatagagaat 10140
aatgaaaagg atggaagggg tgaaagataa ataaatataa aaattattgt ccaatgcaag 10200
tttattattg ccgtttatag gcatgaatat ttttgatgct actttgagat ttgctatgga 10260
taaatcagtt accaaagaaa aagttctgag taattcttct atggtctgat gtttgaggat 10320
tttattttca ctaaggagta tgtagcatcc atagaaagta gtgcatttgt caaaaaaact 10380
ggttactttt atgatataag cgatatgaat tcagcaacta aaaaatatag agaaaataaa 10440
ttagacgagc agcttatttt agaaaaagga atcgaatatt tcttgtaaaa aatatgatat 10500
tccattgaaa aaaattgtag atatacgaaa agtaaatatg tgtttagatt cattattaaa 10560
tttagtaaaa aggatattga agagctaaaa aacagagtag attcggtatt gaaagaagtc 10620
agctttaagt tatcattaaa atatcaaata aaaatatttt ttattcataa tttttggttg 10680
attaagatga tatataaata aaaagtaaca gtaacggagg gaaaatgaaa aattatgtaa 10740
atggttggtg gaaaaccaat ttaggagatg atcttttctt acatataata tgtgaacgtt 10800
atagaaatca aagttttttt ataacttgtg aaaaagagga gatgactgtt tttcaacatt 10860
tgaataactt aaaaattatt gaagagaata aaagttctgt cgttattaaa ttgttttcga 10920
agtttttaag agtaatgtat ttttggatac cattaagtgt tttgaaagaa tggatagaat 10980
tattcttttt aagaaaaaga ggaatatctg ataaaaatgt agtggttatt gaaattgggg 11040
gctccatttt tatgatgcca aaaagaaaag atatctcaat gacagagggg tattttttac 11100
gtaatattga attaaaaaac tttcctaact attatgtagt aggaagtaat tttggtccat 11160
tttattttca agagcaagta gataaatata aagaactatt ttctaaaatg caggatgtat 11220
gtttcagaga tacatattca aaaaaattat ttccaaatct agatacagtt agaagcgcga 11280
cagatgttgt catgagttta aggatagagg attatcaaca aatcccagag aaaaaacaaa 11340
ttataatttc tgtgattgat gtattatcta aagaagatac aggattagag ggtaaacatc 11400
attttgcaaa taaatatgaa aaatttattt tatctgtaac agaagattat gtaaagaaag 11460
gttataaagt agtattgttt tctttttgtg atttccagaa tgatcattta ttttctcaga 11520
aaattttcaa tcagttaaat aagacaataa gatctaatgt tgaacttttt tctcataaac 11580
aaataaataa atcactctct aagattgctg aaagtgaaaa aatcatagcg accagatttc 11640
acgcaatgat tttaggatgg ttatttcaaa agcctacttt tgtaatctcc tacagtcaaa 11700
agacaactca agtgattgaa aatagcttta ataagcaaac atttgttgac tataataagg 11760
tagaaaagtt gaatcttaat aatatggatg attattttgt taaaattgat gatttgacca 11820
aagaatattt aattaatgat gcgcaaaatc aatttagagg gttagactca ttattgcaat 11880
cataatcttt aaatcaaaga agagagaaaa tgaataaata caaaaaactg ctatccaact 11940
cactcgtttt cacaatagga aatttgggta gcaaactgtt agtcttttta ctcgtaccac 12000
tctacactta tgcgatgaca ccgcaagagt atggtatggc agacttgtac caaacaacag 12060
ccagtctact tttgccattg attacgatga atgtgtttga tgcaacttta cgttttgcca 12120
tggaaaagtc aatgacaaaa gagagcgtct taacaaattc tcttgtagta tggtgtttta 12180
gcgctgtgtt ctcttgtttc ggcatgattt ttgtctatgc actgaacttg agtaataaat 12240
ggtacttagc cctacttttt cttatcatct tattccaagg tgggcaaagc atactaagtc 12300
aatatgcgag aggcattggg aaatcgaaat tatttgcagc tggcggagtt attttaacct 12360
ttttgacagg tgctttaaat atcttcttct tggtattttt acatgctgga attacgggct 12420
acctcatgtc cctagtttta gcgaatttag ggacaatcct tttttttgcg gggacacttt 12480
ccatttggca ggcaatcaat tttaaagtaa tcgataagga aatgatttgg caaatgctct 12540
attatgcctt acctttaatt cctaatgcca tcatgtggtg gtcactgaac gcttctaatc 12600
gctatttcgt tttattcttt ttaggagcag gtgctaatgg ccttttggcg gtcgctacca 12660
aaatcccaag tattatttca atttttaata cgatttttac acaggcatgg caaatttcag 12720
ccatagaaga atataattct catcaaaaat caaaatatta ttcggatgtt tttcactact 12780
tagcaacttt tctattgtta gggacatcag cttttatgat tgtgcttaaa ccagttgtcg 12840
aaaaagtcgt ttcaagtgac tatgcaagtt catggcaata tgtccccttc tttatgttgg 12900
cgatgttatt ttcctcattt tctggatttt ttgggaccaa ttatattgcg gccaaacaaa 12960
caaaaggcgt atttatgaca tctatctatg gtgccattgt ttgtgtccta ctccaagtgg 13020
tgctgctacc caccattggc ttgaatggtg ctggattagc ttcaatgcta ggattcttga 13080
caacattttt attgcgtgtt aaagatacgc aaaaatttgt ggcgattcag attaaatggc 13140
gaatttttat cagtaattta ttgatcgttt tggcacaaat tttatgtttg ttttatctac 13200
cgagtgaatt tttgtatttt ggtcttgccc ttttgttttg tggcatgtta gttgttaatc 13260
agcgtacaat tttatacatt atcatggcgc taaaaaataa aaaataagat atttgggatg 13320
aaatcctcat ataaatagac agtagatgta tctcgatacc attcgagttg catctactgt 13380
ctatttttag tatgctttta ggttagctca actcaaccgc ctcttaatct cccaacaaca 13440
ataaaaccca atcaaacacc ccaaaaaatt caagagaata tcactgatgg caaatgtgcc 13500
cagatagaaa acaaactgaa tggtttcaat tcctaaaagt gtgaccaaac tgacaatgac 13560
aaactgtttg aaatcagtat tgataccata aaagccacct aaaggaataa agtagagaat 13620
atttaacacc gcctcctgga tcgttctggt atccgttttg ataaaatcaa aagggttcag 13680
tgatatcgct tgaaaatccg atgttttagt aaagagcacc atgaatagca gtaatgcata 13740
cacactgaag gccagataga gataaataac tgaaaatcgt ttgaggtgat actggatgcc 13800
aaacaaccag acgatcagcg ttaataagag tattaaagtt aatgcggtat ggtcaaaatg 13860
gtcaatgacc ttaagcagat ttggatagcg tgtgagaacg ggcatgatca gccaagttat 13920
cgtcgcataa ctcaggctaa atgtgaccaa taaactgctg aggtagatca tatattttcg 13980
caactgtttc taactccttt tcttgatgag attaacccta ttttaacata ttttaaaact 14040
gtcatgtttt tatgaattta aaataaaggg cacctctaat aactaccaat taattcacct 14100
ctaaatgaaa ttagaaaaaa acacagaaaa ctaagcatag tctactgtgt ttttctttat 14160
gttacagacg ccacttgact gtttattctt attttagaca gattacctcc atgttttgag 14220
atttaaacgt ttaacttgtt cgtacctcaa taagttataa gtcagatttg taaggtcagt 14280
atttaaaaca gctcgatcga atccaataga acgtaaacta gaaccatgca ttgaattttc 14340
aacaaagcca aatacatgtt caatacggac acgtattttc gaaatgattt tattaaacat 14400
tttatcgtct gccttaagag atttagaacg tgtgttttta agacaggtaa aaagttcagc 14460
acctttaggt gttgcttgat tttgataagc tgagtctgct aaggtgatct catcgggatc 14520
aacaagaacg cctattactt gagaatcatg cacatttgca ggcattgttt gataattctt 14580
tacgaatttt gatttggtat ctatagcaat atgatttttg tagccatagt gtctctcatt 14640
acctttaatt gtccagcgag cagctgtatc cttctgtgct cttttgttct tcgtccaatt 14700
cactggtacc ctatttgctt taatcaattc attttcgtct tttggattac gttgtttagg 14760
cgcctctatg aatgttgaat caacaatctg tcctttatga gcgatcatcc cttgtgattc 14820
aagtttttct tgaaaggcag aaaaaagcca gttccctcta ttagattttg atagctgatt 14880
tctaaagttc cagatagtct ttgcgtctgg taccttgtca tcaatcttaa ggaatcttct 14940
aaacgaaatc cggtcaatca tttgatattc cattgcatca tcagatagat tgtataagcg 15000
ttgtaaaatt agaattttca acattaaaac aaggtcatag ggcggtctac caccatgaga 15060
tttgtttttt aaatcatact taaaaatccg gttaagagtt gggcgaaaac actcgaaatc 15120
aaccactttt tctagccgct caagagggtc tccttttaaa cttaattttt caagatagtc 15180
gctatctcca aataaattca tcttcaatac ctcgttagct tatttttaat ctattatacc 15240
tcttttcata cgtttttaga ggtgccctta aacttatatc tttatttata cgtaaaattt 15300
cttgcagttt aaagtttgaa acccatacct tcacttgcat taaaagattt tatactatat 15360
aatatattaa ttagatacaa gtatatctac tgtgtgatgt taagccaagt ccttagagtc 15420
aagagtgtat aattttaata tattttttaa gctatcactt tttcaaatta attgggggta 15480
agatatccta aactttgatg aattcgtctt gataaaacgt ttcaatgtac caaaaaatac 15540
tctgataggc tgcttcaaat tttttgcagt agttcctcag tcattcgctt acccaaattc 15600
caagcaatga tttttttagc ataacgattt ataatagttg agagataaca ccaccctccg 15660
aagagtggtc ttgtaaggaa tatcggttga ccaaacctta tttttttctg tcgggtaagt 15720
atgtatgata ttctttcggt taacaggatc atatatgaat atcctggttt aaattttttt 15780
atgaccacag aatatagttg aagttgcttc attaactttt gtaccagttt taaactgttt 15840
ttttccatgt ttaagcaaga ggtgagaatc ttaggcactc tatagatgac ttgttcttcc 15900
ttaactttgg ccggtgtctt tttaattaca tttcatctac ttttgagaca tagatttcat 15960
taaacttgga gtagaggtat actctttaga aagctgagtg atgaatcgtt ctgagtggta 16020
ggactcgacc agggtttctt taaattcttt tgaatatcgt ttttgcatga ctttttactt 16080
tgtctaaatt atacaattcc cagtctaaga tgtaagaata atatcagact acgcaatgtt 16140
ttttatattt tgtatataat ttagtgatca ctaggatatc actaaattac tctaacttca 16200
ttatattacc tagaccaaat gtgttatagc gaaataagta aaaattatag ttattacttt 16260
tgatattatt ccagaagaag tgaattaaat ttagttcagt agtagataga aaattagagt 16320
ttttttgaca aattgagatt tgaatttgct tgcaatgaga gttgcagtta ttgcgcttaa 16380
ggaaatcggt tgttgtaaca gcaactaatt atgatgatgt aaggacatat tttaacatac 16440
gttttgtacg tgatttacat acattcgaaa agtttagatt tgatcgtgac taaacaacaa 16500
ttttggagga aaaattggaa cgaaaaaaaa agaaaaaaaa gaatatttgg gttataatta 16560
tacctatctt aatttttatt acccttatag gagcaggggc ttatgcctta agaaattcac 16620
ttattcctac tgatcatacg aaaacaaata gttcggatca accgcccaaa acttcggctt 16680
ccaacggtta tgtagaacaa aaaggcgaag aagctgccgt aggtagtata gcacttgtag 16740
atgatgctgg tgtaccagag tgggttaaag ttccctcaaa ggtaaatcta gataaattta 16800
ctgatttatc tacgaataat atcactattt atcgaattaa caatccggaa gtcttaaaaa 16860
cagttaccaa tcgtacagat caacggatga aaatgtcaga agttatagct aagtatccta 16920
atgctttgat tatgaatgct tccgcatttg atatgcagac aggacaagta gttggatttc 16980
aaattaataa tggaaagttg attcaagact ggagcccagg tacaacgact caatatgctt 17040
ttgttattaa caaagatggt tcgtgcaaaa tttatgattc aagtacacct gcttcaacta 17100
ttattaaaaa cggagggcaa caagcctatg atttttatgg tactgcaatt atccgtgatg 17160
gtaaaattca accaagtgat ggctcagtag attggaagat ccatattttt attgcgaatg 17220
ataaagataa taatctctat gctattttga gtgatacaaa tgcaggttat ggtaatataa 17280
tgaagtcagt gtcaaatttg aagctccaaa atatgttatt acttgatagt ggcggctcaa 17340
gtcaactatc tgtcaatggt aaaacgattg ctgctagtca agacgatcga gccgtaccgg 17400
attatattgt gatgaaataa aaataaaaga acctcttggt tcttttattt tagagatttt 17460
tcaaaaaggg ttttgactga gtctaattct gtttgagaaa cgaccttagc tccattttca 17520
tctgttgtat gtagattgag cttgctggta ttctttagag ccttattgta gctcataacg 17580
atcgttttag catcattgaa acttatattg gttttaacag tgtttgaaaa agcatagaga 17640
atttctcgat agtaattgaa actttcaagt tttttcatat gagcaatttt ttgaatattt 17700
ccgtagagtt ccattgagac attttgaatt cgagtgattg aagcatccaa atcagtatcg 17760
tcaatttgtg tcatataggc ttggacttga tcagcagttt gtaaattaac agttccttgt 17820
ttaaactcat aaccttcagc attgaatgcc tttggatttt gcatggtgat tccaccagtg 17880
gcctgtacaa gtgatcccat tttattaaca tcaatctgaa ttactttatt aatggacata 17940
ttcaataggt ctttaaccat ctggaaaatt ccatcatctc cattcgtatt gtaaacttca 18000
gtgattgttt tttgattagg cattgtcgca aaaactggga agttcatgaa agtagtttga 18060
tttgtcttta cattcgttga agctaaaaca gtagcataag ctgtattttt agaattattt 18120
ttaccagttg caatgataag tgtggtgaat gttttagact tttttaagtc gatacttgtt 18180
gttttaggga aattttcata tgatgttgaa aatgttgatt caacatttct ataagcggcg 18240
taggctatag aagcaacagc gataattact aatacaaaaa taattgaaat aacttttagt 18300
actgtgtgtt ttttcttacg ataatgatgc ctcttttttt gattcatatg ctctagaatt 18360
taaaa 18365
<210> 93
<211> 1050
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的NAD依赖性差向异构酶/脱水酶家族蛋白1的ORF
<400> 93
atgagtatag taactttaga taaaaaaacg attttagtca caggtgcagc cggttttata 60
ggctccaacc ttgtgaaacg aatctatcag gaggctcctt ctgctacggt catcggcatc 120
gacaatatga atgcctacta tgatgtggca ctgaaagaat tccgcctgaa cgagctggcc 180
aagtatccca cattcacttt tgtaaaaggc aacatcgctg ataaggcact gatcaccgag 240
ctgttcgaga agtacaagcc gtctgtggtc gtcaaccttg cagcacaggc tggtgtgcgc 300
tactccatca ccaacccaga tgcttatgtg gaatctaact tggtcggctt ctttaatatc 360
ctcgaagcct gccgtcattg tgagagtctg gagcatttgg tttatgcttc ttcctcctcc 420
gtctatggtt ctaataaaaa ggttccatac agcacggatg acaaggttga caatccggtt 480
tccctttatg cagcaaccaa gaaatctaat gagttgatgg cacacgcata ctccaagctc 540
tacaacattc cttccactgg cctgagattc tttacggtgt atggccctgc aggtcgcccg 600
gacatggctt acttcggatt caccaacaag ctggtgaagg gcgaaaccat caaaatcttc 660
aactatggca actgcaagcg tgattttact tatgtggatg acatcgttga gggcgttgtt 720
cgtgtgatga agaaagcacc agacaagaag aatggtgaag atggtcttcc gattccgccg 780
tatgcagttt acaacatcgg caatcagaat ccggagaacc tgctggactt tgtgcagatt 840
ctgagcgagg agcttgttcg tgcaaaggtg ctgccggaag attacgattt cgaggctcat 900
aaagagctgg tcccgatgca gccgggtgat gtgcctgtga cctatgcaga tacgagtgca 960
ctggagcgcg acttcgggta caagccgagc acaagtctgc ggactggatt aagaaagttc 1020
gctgagtggt acgctgagtt ttataaataa 1050
<210> 94
<211> 349
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_NAD依赖性差向异构酶/脱水酶家族蛋白1
<400> 94
Met Ser Ile Val Thr Leu Asp Lys Lys Thr Ile Leu Val Thr Gly Ala
1 5 10 15
Ala Gly Phe Ile Gly Ser Asn Leu Val Lys Arg Ile Tyr Gln Glu Ala
20 25 30
Pro Ser Ala Thr Val Ile Gly Ile Asp Asn Met Asn Ala Tyr Tyr Asp
35 40 45
Val Ala Leu Lys Glu Phe Arg Leu Asn Glu Leu Ala Lys Tyr Pro Thr
50 55 60
Phe Thr Phe Val Lys Gly Asn Ile Ala Asp Lys Ala Leu Ile Thr Glu
65 70 75 80
Leu Phe Glu Lys Tyr Lys Pro Ser Val Val Val Asn Leu Ala Ala Gln
85 90 95
Ala Gly Val Arg Tyr Ser Ile Thr Asn Pro Asp Ala Tyr Val Glu Ser
100 105 110
Asn Leu Val Gly Phe Phe Asn Ile Leu Glu Ala Cys Arg His Cys Glu
115 120 125
Ser Leu Glu His Leu Val Tyr Ala Ser Ser Ser Ser Val Tyr Gly Ser
130 135 140
Asn Lys Lys Val Pro Tyr Ser Thr Asp Asp Lys Val Asp Asn Pro Val
145 150 155 160
Ser Leu Tyr Ala Ala Thr Lys Lys Ser Asn Glu Leu Met Ala His Ala
165 170 175
Tyr Ser Lys Leu Tyr Asn Ile Pro Ser Thr Gly Leu Arg Phe Phe Thr
180 185 190
Val Tyr Gly Pro Ala Gly Arg Pro Asp Met Ala Tyr Phe Gly Phe Thr
195 200 205
Asn Lys Leu Val Lys Gly Glu Thr Ile Lys Ile Phe Asn Tyr Gly Asn
210 215 220
Cys Lys Arg Asp Phe Thr Tyr Val Asp Asp Ile Val Glu Gly Val Val
225 230 235 240
Arg Val Met Lys Lys Ala Pro Asp Lys Lys Asn Gly Glu Asp Gly Leu
245 250 255
Pro Ile Pro Pro Tyr Ala Val Tyr Asn Ile Gly Asn Gln Asn Pro Glu
260 265 270
Asn Leu Leu Asp Phe Val Gln Ile Leu Ser Glu Glu Leu Val Arg Ala
275 280 285
Lys Val Leu Pro Glu Asp Tyr Asp Phe Glu Ala His Lys Glu Leu Val
290 295 300
Pro Met Gln Pro Gly Asp Val Pro Val Thr Tyr Ala Asp Thr Ser Ala
305 310 315 320
Leu Glu Arg Asp Phe Gly Tyr Lys Pro Ser Thr Ser Leu Arg Thr Gly
325 330 335
Leu Arg Lys Phe Ala Glu Trp Tyr Ala Glu Phe Tyr Lys
340 345
<210> 95
<211> 897
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的葡萄糖-1-磷酸胸苷基转移酶RfbA蛋白的ORF
<400> 95
atgaaaggcg ttattttagc cggaggttca ggcacacgtc tttacccatt aacgaaagta 60
acaagtaaac agcttttgcc aatctacgac aaaccgatga tttattatcc tatgtctgtt 120
ctgatgaacg ctggcatccg cgatattttg attatttcca caccgcagga tacacctcgc 180
tttgagaatc tgctgggtga tggacaccag tttggtgtga atctgaccta tgcggttcag 240
ctgtctccgg atggactggc acaggccttt atcattggtg ccgactttat cggtgccgat 300
tctgtagcta tggtgctggg cgataacatc tttgcgggcc acggattgaa gaagagatta 360
aacgcagcag tggaaaaggc agaaaacggc aagggtgcaa cggtgtttgg ctactatgtg 420
gacgatccag agcgttttgg tatcgttgag ttcgataaga acggtaaggc catttctatc 480
gaggagaagc cagaacatcc gaagagcaat tactgtgtca ccggtttgta cttctatgat 540
aaccgtgtgg tcgagtttgc taagaacctg aagccgtccg ctcgtggtga attagaaatt 600
accgatttga accgtattta tctggaagat ggtactctga atgtagaatt gctgggtcag 660
ggcttcactt ggctggacac tggaacacac gagagccttg ttgatgctac caactttgta 720
aagaccgtgg aacagcatca gcatcgtaag attgcctgtc tggaggaaat cgcatatctg 780
aatggctgga ttagcaagga tgagctgatg gaggtctatg aggttatgaa gaagaaccag 840
tatggacagt atctgaagga tgtcatggac ggcaagtatc aggagcattt gtattaa 897
<210> 96
<211> 298
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_rfbA
<400> 96
Met Lys Gly Val Ile Leu Ala Gly Gly Ser Gly Thr Arg Leu Tyr Pro
1 5 10 15
Leu Thr Lys Val Thr Ser Lys Gln Leu Leu Pro Ile Tyr Asp Lys Pro
20 25 30
Met Ile Tyr Tyr Pro Met Ser Val Leu Met Asn Ala Gly Ile Arg Asp
35 40 45
Ile Leu Ile Ile Ser Thr Pro Gln Asp Thr Pro Arg Phe Glu Asn Leu
50 55 60
Leu Gly Asp Gly His Gln Phe Gly Val Asn Leu Thr Tyr Ala Val Gln
65 70 75 80
Leu Ser Pro Asp Gly Leu Ala Gln Ala Phe Ile Ile Gly Ala Asp Phe
85 90 95
Ile Gly Ala Asp Ser Val Ala Met Val Leu Gly Asp Asn Ile Phe Ala
100 105 110
Gly His Gly Leu Lys Lys Arg Leu Asn Ala Ala Val Glu Lys Ala Glu
115 120 125
Asn Gly Lys Gly Ala Thr Val Phe Gly Tyr Tyr Val Asp Asp Pro Glu
130 135 140
Arg Phe Gly Ile Val Glu Phe Asp Lys Asn Gly Lys Ala Ile Ser Ile
145 150 155 160
Glu Glu Lys Pro Glu His Pro Lys Ser Asn Tyr Cys Val Thr Gly Leu
165 170 175
Tyr Phe Tyr Asp Asn Arg Val Val Glu Phe Ala Lys Asn Leu Lys Pro
180 185 190
Ser Ala Arg Gly Glu Leu Glu Ile Thr Asp Leu Asn Arg Ile Tyr Leu
195 200 205
Glu Asp Gly Thr Leu Asn Val Glu Leu Leu Gly Gln Gly Phe Thr Trp
210 215 220
Leu Asp Thr Gly Thr His Glu Ser Leu Val Asp Ala Thr Asn Phe Val
225 230 235 240
Lys Thr Val Glu Gln His Gln His Arg Lys Ile Ala Cys Leu Glu Glu
245 250 255
Ile Ala Tyr Leu Asn Gly Trp Ile Ser Lys Asp Glu Leu Met Glu Val
260 265 270
Tyr Glu Val Met Lys Lys Asn Gln Tyr Gly Gln Tyr Leu Lys Asp Val
275 280 285
Met Asp Gly Lys Tyr Gln Glu His Leu Tyr
290 295
<210> 97
<211> 1020
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的dTDP-葡萄糖 4,6-脱水酶蛋白的ORF
<400> 97
atgaatatta ttgttaccgg cggtgcgggt tttattggta gtaactttgt gttccacatg 60
ctgaagaagt atccgggtta tcgaatcatc tgtttggaca agctgaccta tgcaggcaat 120
ctgtccacac tggcccctgt tatggataac ccgaatttcc gcttcgtgaa ggctgatatc 180
tgtgaccgcg aagcagtgaa taaactgttt gaagaagaac atccggacat catggtcaac 240
tttgcggcag agtctcatgt tgaccgttct atcgaagatc ccggcatctt ccttcagact 300
aacatcatcg gtaccagtgt gctgatggat gcttgccgca agtacggcat ccagcgttac 360
catcaggttt ctactgatga agtttacggt gacctgcctc tggatcgtcc tgacctgttc 420
ttcaccgagg agactccgat ccataccagc tctccgtata gcagctccaa agctgctgct 480
gacctgctgg ttctggctta ccaccgtacc tacggcctgc ctgtgaccat ttcccgttgt 540
tccaacaact atggaccgta tcacttccct gagaagctga ttccgctgat gatcgctaat 600
gctctggctg acaagccact gcctgtttac ggcgagggtc tgaacgtccg tgactggctg 660
tatgtggaag atcactgcaa ggccattgat ctgattatcc acaagggtcg tgttggtgaa 720
gtctacaacg tcggcggtca caacgagaag cagaatattg agatcgtgaa gattatctgc 780
aaggagctgg gcaagccgga aagcttgatc actcatgttg gtgatcgcaa gggtcacgat 840
atgcgttatg ctattgatcc gaccaagatc cacaatgagc tgggctggtt gccggagacc 900
aagtttgagg acggcattaa aaagaccatc cagtggtatc tcgataatcg tgagtggtgg 960
gagaccatca tcagcggtga gtatcagaac tattatgaga aaatgtacag caaccgctaa 1020
<210> 98
<211> 339
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_dTDP-葡萄糖_4,6-脱水酶
<400> 98
Met Asn Ile Ile Val Thr Gly Gly Ala Gly Phe Ile Gly Ser Asn Phe
1 5 10 15
Val Phe His Met Leu Lys Lys Tyr Pro Gly Tyr Arg Ile Ile Cys Leu
20 25 30
Asp Lys Leu Thr Tyr Ala Gly Asn Leu Ser Thr Leu Ala Pro Val Met
35 40 45
Asp Asn Pro Asn Phe Arg Phe Val Lys Ala Asp Ile Cys Asp Arg Glu
50 55 60
Ala Val Asn Lys Leu Phe Glu Glu Glu His Pro Asp Ile Met Val Asn
65 70 75 80
Phe Ala Ala Glu Ser His Val Asp Arg Ser Ile Glu Asp Pro Gly Ile
85 90 95
Phe Leu Gln Thr Asn Ile Ile Gly Thr Ser Val Leu Met Asp Ala Cys
100 105 110
Arg Lys Tyr Gly Ile Gln Arg Tyr His Gln Val Ser Thr Asp Glu Val
115 120 125
Tyr Gly Asp Leu Pro Leu Asp Arg Pro Asp Leu Phe Phe Thr Glu Glu
130 135 140
Thr Pro Ile His Thr Ser Ser Pro Tyr Ser Ser Ser Lys Ala Ala Ala
145 150 155 160
Asp Leu Leu Val Leu Ala Tyr His Arg Thr Tyr Gly Leu Pro Val Thr
165 170 175
Ile Ser Arg Cys Ser Asn Asn Tyr Gly Pro Tyr His Phe Pro Glu Lys
180 185 190
Leu Ile Pro Leu Met Ile Ala Asn Ala Leu Ala Asp Lys Pro Leu Pro
195 200 205
Val Tyr Gly Glu Gly Leu Asn Val Arg Asp Trp Leu Tyr Val Glu Asp
210 215 220
His Cys Lys Ala Ile Asp Leu Ile Ile His Lys Gly Arg Val Gly Glu
225 230 235 240
Val Tyr Asn Val Gly Gly His Asn Glu Lys Gln Asn Ile Glu Ile Val
245 250 255
Lys Ile Ile Cys Lys Glu Leu Gly Lys Pro Glu Ser Leu Ile Thr His
260 265 270
Val Gly Asp Arg Lys Gly His Asp Met Arg Tyr Ala Ile Asp Pro Thr
275 280 285
Lys Ile His Asn Glu Leu Gly Trp Leu Pro Glu Thr Lys Phe Glu Asp
290 295 300
Gly Ile Lys Lys Thr Ile Gln Trp Tyr Leu Asp Asn Arg Glu Trp Trp
305 310 315 320
Glu Thr Ile Ile Ser Gly Glu Tyr Gln Asn Tyr Tyr Glu Lys Met Tyr
325 330 335
Ser Asn Arg
<210> 99
<211> 609
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的dTDP-4-脱氢鼠李糖3,5-差向异构酶蛋白的ORF
<400> 99
atgggacaga ttaaagttga aaaaaacgta ggcggtatag agggactttg tgttattgaa 60
ccggttgttc atggtgactc ccgtggttat ttcgtggaga cttacaacga gaatgatatg 120
aaggaagctg gtataggtat tcactttgtg caggacaatc aatctatgtc cacaaaaggg 180
gtgttgcggg gattacactt tcagaagcaa tatccgcagt gcaaattggt gcgtgttgtg 240
aacggtacgg tgtttgatgt cgcagttgat ttgagaagta attccgaaac ttatggtaaa 300
tggtatggtg ttgttttgtc cgccgagaac aagaaacagt tccttattcc ggagggcttt 360
gcacacggct tcttagttct gagcaatgaa gcagaattct gttataaggt caatgatttt 420
tatcatccga atgatgaggg cggaatggct tggaatgacc ctgaggttgg aattgaatgg 480
ccacaactga agggcgaata caagggcaat gcaagtgcag aaggatatac gctggaagac 540
ggtacagcgc tgaacctgag cgataaggac cagaagtggc tcggactgaa ggatactttt 600
aagttctga 609
<210> 100
<211> 202
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_dTDP-4-脱氢鼠李糖_3,5-差向异构酶
<400> 100
Met Gly Gln Ile Lys Val Glu Lys Asn Val Gly Gly Ile Glu Gly Leu
1 5 10 15
Cys Val Ile Glu Pro Val Val His Gly Asp Ser Arg Gly Tyr Phe Val
20 25 30
Glu Thr Tyr Asn Glu Asn Asp Met Lys Glu Ala Gly Ile Gly Ile His
35 40 45
Phe Val Gln Asp Asn Gln Ser Met Ser Thr Lys Gly Val Leu Arg Gly
50 55 60
Leu His Phe Gln Lys Gln Tyr Pro Gln Cys Lys Leu Val Arg Val Val
65 70 75 80
Asn Gly Thr Val Phe Asp Val Ala Val Asp Leu Arg Ser Asn Ser Glu
85 90 95
Thr Tyr Gly Lys Trp Tyr Gly Val Val Leu Ser Ala Glu Asn Lys Lys
100 105 110
Gln Phe Leu Ile Pro Glu Gly Phe Ala His Gly Phe Leu Val Leu Ser
115 120 125
Asn Glu Ala Glu Phe Cys Tyr Lys Val Asn Asp Phe Tyr His Pro Asn
130 135 140
Asp Glu Gly Gly Met Ala Trp Asn Asp Pro Glu Val Gly Ile Glu Trp
145 150 155 160
Pro Gln Leu Lys Gly Glu Tyr Lys Gly Asn Ala Ser Ala Glu Gly Tyr
165 170 175
Thr Leu Glu Asp Gly Thr Ala Leu Asn Leu Ser Asp Lys Asp Gln Lys
180 185 190
Trp Leu Gly Leu Lys Asp Thr Phe Lys Phe
195 200
<210> 101
<211> 828
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的NAD依赖性差向异构酶/脱水酶家族蛋白2的ORF
<400> 101
atgaaaattt tagtaactgg tgcaaacggc tatctggggc agggtattgt aaaggagctt 60
ctagataacg ggcataatgt tgtggctgcg gattttaaga ccacgtatgt tgatgaccga 120
gcagaaaaga tagattgtga cttgttctcg gtagaagaac cctatacata ctttggtaaa 180
ccggatgctc ttcttcattt ggcatggagg gatggatttg ttcattactc ggaaaaccat 240
attgcggatt taccaaaaca ttatcatttc ttgaagcaaa tggttgaagc taatatcttc 300
aagattagtg ttatgggaac gatgcacgaa attggtttct ttgaaggtag tattaatgaa 360
aatactcctt gccatccgat gagtctctat ggcatcggca aagatgctct gcgcaactgt 420
gtggcgatga tgactaatgg taaacacaca aaatggcaat ggttacgtgg ctattacatc 480
gtcggacatt ctgagtttgg atgttctatt ttttcaaaaa ttaaggcagc agaaaaagag 540
ggtaaaacag aatttccgtt taccatgggt caaaatcagt tcgattttat agattatgaa 600
gatttctgta aacaggttgc tgcagctgtt ggtcaggatg aaattaatgg aattatcaat 660
atctgttctg gtaaaccgga aaagttggct gatcgtgtag aaaggtttat taaggaaaat 720
ggatacggta ttaaattgaa atatggagca tttcctgatc gtccatacga ttcaaaggcc 780
gtttggggag ataataataa gataagaaaa attatgcaga ataattga 828
<210> 102
<211> 275
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_NAD依赖性差向异构酶/脱水酶家族蛋白2
<400> 102
Met Lys Ile Leu Val Thr Gly Ala Asn Gly Tyr Leu Gly Gln Gly Ile
1 5 10 15
Val Lys Glu Leu Leu Asp Asn Gly His Asn Val Val Ala Ala Asp Phe
20 25 30
Lys Thr Thr Tyr Val Asp Asp Arg Ala Glu Lys Ile Asp Cys Asp Leu
35 40 45
Phe Ser Val Glu Glu Pro Tyr Thr Tyr Phe Gly Lys Pro Asp Ala Leu
50 55 60
Leu His Leu Ala Trp Arg Asp Gly Phe Val His Tyr Ser Glu Asn His
65 70 75 80
Ile Ala Asp Leu Pro Lys His Tyr His Phe Leu Lys Gln Met Val Glu
85 90 95
Ala Asn Ile Phe Lys Ile Ser Val Met Gly Thr Met His Glu Ile Gly
100 105 110
Phe Phe Glu Gly Ser Ile Asn Glu Asn Thr Pro Cys His Pro Met Ser
115 120 125
Leu Tyr Gly Ile Gly Lys Asp Ala Leu Arg Asn Cys Val Ala Met Met
130 135 140
Thr Asn Gly Lys His Thr Lys Trp Gln Trp Leu Arg Gly Tyr Tyr Ile
145 150 155 160
Val Gly His Ser Glu Phe Gly Cys Ser Ile Phe Ser Lys Ile Lys Ala
165 170 175
Ala Glu Lys Glu Gly Lys Thr Glu Phe Pro Phe Thr Met Gly Gln Asn
180 185 190
Gln Phe Asp Phe Ile Asp Tyr Glu Asp Phe Cys Lys Gln Val Ala Ala
195 200 205
Ala Val Gly Gln Asp Glu Ile Asn Gly Ile Ile Asn Ile Cys Ser Gly
210 215 220
Lys Pro Glu Lys Leu Ala Asp Arg Val Glu Arg Phe Ile Lys Glu Asn
225 230 235 240
Gly Tyr Gly Ile Lys Leu Lys Tyr Gly Ala Phe Pro Asp Arg Pro Tyr
245 250 255
Asp Ser Lys Ala Val Trp Gly Asp Asn Asn Lys Ile Arg Lys Ile Met
260 265 270
Gln Asn Asn
275
<210> 103
<211> 912
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的dTDP-4-脱氢鼠李糖还原酶蛋白的ORF
<400> 103
atgaagttct ttgtaacagg tgttggtggc caacttggcc atgatgtgat gaatgagctg 60
ctgaagcgtg gccatgaagg tgttggttct gatattcagg aaaactacag cggggtggca 120
gatgactctg cagtaacaaa agcaccttat gtggctctgg atattactga taagaatgcc 180
gttgaaaaag taattacaga agtaaatccg gatgccgtaa tccactgcgc agcatggact 240
gctgttgata tggctgagga tgatgataaa gtagcgaaag tccgtgcaat caatacaggc 300
ggtactcaga acattgcgga tgtctgcaag aaattggact gtaagatgac ctatatcagc 360
acggattatg tgtttgatgg tcagggtaca gagccctggc agccggactg caaggattac 420
aagccactga atgtgtatgg tcagacgaag ctggaaggtg aactggctgt cagtcaaacg 480
ctggagaaat attttatcgt tcgtatagca tgggtgtttg gtctgaatgg taagaacttt 540
attaagacca tgctgaatgt cggcaagacg cacgatactg tccgcgtggt taatgatcag 600
attggcacac cgacctatac atatgatttg gctcgactgc tcgttgatat gaatgaaacc 660
aagaaatacg gctattacca tgcgaccaac gagggcggtt atatcagttg gtatgatttc 720
acgaaagaaa tttatcgtca ggctggttat aagacggaag tcctgccggt gaccacggcg 780
gagtatggct tgagcaaggc agctcgtccg ttcaacagcc gtctggataa gagcaagctg 840
gtggaggctg gattcactcc gcttccaaca tggcaggatg cactgagccg ttatttgaaa 900
gaaatcgagt ag 912
<210> 104
<211> 303
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_dTDP-4-脱氢鼠李糖_还原酶
<400> 104
Met Lys Phe Phe Val Thr Gly Val Gly Gly Gln Leu Gly His Asp Val
1 5 10 15
Met Asn Glu Leu Leu Lys Arg Gly His Glu Gly Val Gly Ser Asp Ile
20 25 30
Gln Glu Asn Tyr Ser Gly Val Ala Asp Asp Ser Ala Val Thr Lys Ala
35 40 45
Pro Tyr Val Ala Leu Asp Ile Thr Asp Lys Asn Ala Val Glu Lys Val
50 55 60
Ile Thr Glu Val Asn Pro Asp Ala Val Ile His Cys Ala Ala Trp Thr
65 70 75 80
Ala Val Asp Met Ala Glu Asp Asp Asp Lys Val Ala Lys Val Arg Ala
85 90 95
Ile Asn Thr Gly Gly Thr Gln Asn Ile Ala Asp Val Cys Lys Lys Leu
100 105 110
Asp Cys Lys Met Thr Tyr Ile Ser Thr Asp Tyr Val Phe Asp Gly Gln
115 120 125
Gly Thr Glu Pro Trp Gln Pro Asp Cys Lys Asp Tyr Lys Pro Leu Asn
130 135 140
Val Tyr Gly Gln Thr Lys Leu Glu Gly Glu Leu Ala Val Ser Gln Thr
145 150 155 160
Leu Glu Lys Tyr Phe Ile Val Arg Ile Ala Trp Val Phe Gly Leu Asn
165 170 175
Gly Lys Asn Phe Ile Lys Thr Met Leu Asn Val Gly Lys Thr His Asp
180 185 190
Thr Val Arg Val Val Asn Asp Gln Ile Gly Thr Pro Thr Tyr Thr Tyr
195 200 205
Asp Leu Ala Arg Leu Leu Val Asp Met Asn Glu Thr Lys Lys Tyr Gly
210 215 220
Tyr Tyr His Ala Thr Asn Glu Gly Gly Tyr Ile Ser Trp Tyr Asp Phe
225 230 235 240
Thr Lys Glu Ile Tyr Arg Gln Ala Gly Tyr Lys Thr Glu Val Leu Pro
245 250 255
Val Thr Thr Ala Glu Tyr Gly Leu Ser Lys Ala Ala Arg Pro Phe Asn
260 265 270
Ser Arg Leu Asp Lys Ser Lys Leu Val Glu Ala Gly Phe Thr Pro Leu
275 280 285
Pro Thr Trp Gln Asp Ala Leu Ser Arg Tyr Leu Lys Glu Ile Glu
290 295 300
<210> 105
<211> 903
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的核苷酸转移酶蛋白的ORF
<400> 105
atgaaaacaa cattacttgt catggctgct ggtatcggta gccgcttcga aacgggaatt 60
aagcagttgg agccggtgga tgcttctaat catattatca tggattactc gattcatgat 120
gcaatcgagg ctggcttcaa tcatgtggta tttattatcc gtaaggatat tgagaaagag 180
ttcaaagagg tcatcggtga tcgcattgcc tctatttgct cttctcacaa tataactgtg 240
gactacgctt tccaggacat taacgatatt ccgggaactt taccggaagg ccgtacaaaa 300
ccgtggggaa ccggacaggc cgtgcttgct gctaaaaatg tgattgatac cccgtttatt 360
gtcattaatg ctgatgatta ctatggcaag gaaggcttta aggctgtcca tgagtatctg 420
gtaaatggcg gaaagtcctg tatggctggc tttgtgctga agaatacgct gtctgataac 480
ggtggtgtaa ctcgtggtat ctgcaagatg gatgagcagt acaatctgac tgaggttgtt 540
gagacaaaga atattgtgaa gaccgcaact ggagcagaag cagacggaaa agtgattgat 600
gttgattctc tggtatctat gaatatgtgg ggattaactc ctgatttttt ggacatgctg 660
gagaaaggtt tcaaagaatt tttcgagaaa gaagttccgg gcaatccttt gaaagctgag 720
tatctgatcc caatcctcat cggtgaactg ctggagcagg gcaagatgtc tgtgaaggtt 780
ctgaaaacga acgatacctg gtatggtatg acctatcatg aggatgtcgc agttgtaaag 840
gacagcttca aaaaaatgct agaaaacggc gtgtacaagg ctgacttgtt cagagatctc 900
taa 903
<210> 106
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_核苷酸转移酶
<400> 106
Met Lys Thr Thr Leu Leu Val Met Ala Ala Gly Ile Gly Ser Arg Phe
1 5 10 15
Glu Thr Gly Ile Lys Gln Leu Glu Pro Val Asp Ala Ser Asn His Ile
20 25 30
Ile Met Asp Tyr Ser Ile His Asp Ala Ile Glu Ala Gly Phe Asn His
35 40 45
Val Val Phe Ile Ile Arg Lys Asp Ile Glu Lys Glu Phe Lys Glu Val
50 55 60
Ile Gly Asp Arg Ile Ala Ser Ile Cys Ser Ser His Asn Ile Thr Val
65 70 75 80
Asp Tyr Ala Phe Gln Asp Ile Asn Asp Ile Pro Gly Thr Leu Pro Glu
85 90 95
Gly Arg Thr Lys Pro Trp Gly Thr Gly Gln Ala Val Leu Ala Ala Lys
100 105 110
Asn Val Ile Asp Thr Pro Phe Ile Val Ile Asn Ala Asp Asp Tyr Tyr
115 120 125
Gly Lys Glu Gly Phe Lys Ala Val His Glu Tyr Leu Val Asn Gly Gly
130 135 140
Lys Ser Cys Met Ala Gly Phe Val Leu Lys Asn Thr Leu Ser Asp Asn
145 150 155 160
Gly Gly Val Thr Arg Gly Ile Cys Lys Met Asp Glu Gln Tyr Asn Leu
165 170 175
Thr Glu Val Val Glu Thr Lys Asn Ile Val Lys Thr Ala Thr Gly Ala
180 185 190
Glu Ala Asp Gly Lys Val Ile Asp Val Asp Ser Leu Val Ser Met Asn
195 200 205
Met Trp Gly Leu Thr Pro Asp Phe Leu Asp Met Leu Glu Lys Gly Phe
210 215 220
Lys Glu Phe Phe Glu Lys Glu Val Pro Gly Asn Pro Leu Lys Ala Glu
225 230 235 240
Tyr Leu Ile Pro Ile Leu Ile Gly Glu Leu Leu Glu Gln Gly Lys Met
245 250 255
Ser Val Lys Val Leu Lys Thr Asn Asp Thr Trp Tyr Gly Met Thr Tyr
260 265 270
His Glu Asp Val Ala Val Val Lys Asp Ser Phe Lys Lys Met Leu Glu
275 280 285
Asn Gly Val Tyr Lys Ala Asp Leu Phe Arg Asp Leu
290 295 300
<210> 107
<211> 345
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的酰基转移酶2蛋白的ORF
<400> 107
atgttttcac gacttgcttg tattaacgag atttctttag gaaataatgt acttacagga 60
ccacatattt ttatttgcga ttataatcat gcatatgaag atataaatag acctatttct 120
ttacaaggaa atattggaaa cgataataag gttattatag atgatgactg ttggattgaa 180
acaaatgttg taatctgtgg caatgttcat attggaaaac atacggtaat tggggcaaat 240
gcttttgtca ataaggacat tcctagctat tgcgttgcgg ttggaaatcc agcaaaggtt 300
gtcaaaaaat ataattttga gacaggtgca tgggagaatg tataa 345
<210> 108
<211> 114
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_酰基转移酶2
<400> 108
Met Phe Ser Arg Leu Ala Cys Ile Asn Glu Ile Ser Leu Gly Asn Asn
1 5 10 15
Val Leu Thr Gly Pro His Ile Phe Ile Cys Asp Tyr Asn His Ala Tyr
20 25 30
Glu Asp Ile Asn Arg Pro Ile Ser Leu Gln Gly Asn Ile Gly Asn Asp
35 40 45
Asn Lys Val Ile Ile Asp Asp Asp Cys Trp Ile Glu Thr Asn Val Val
50 55 60
Ile Cys Gly Asn Val His Ile Gly Lys His Thr Val Ile Gly Ala Asn
65 70 75 80
Ala Phe Val Asn Lys Asp Ile Pro Ser Tyr Cys Val Ala Val Gly Asn
85 90 95
Pro Ala Lys Val Val Lys Lys Tyr Asn Phe Glu Thr Gly Ala Trp Glu
100 105 110
Asn Val
<210> 109
<211> 321
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的含DUF1972结构域的蛋白质的ORF
<400> 109
ttggttcaaa atggaattcc agcaaagtat ggtggttttg agacttttgt agaaaaacta 60
acggcacatc aaagtaataa aaaccttaag tatcatgttg cttgtttatc gaatggtata 120
caagaaaatt ttaatcataa tgatgcagac tgttttaata tttcaaagaa aaatattgga 180
ccagcaaacg ccatttatta tgatttggca gctttaaaac actcacttaa agaaattgaa 240
gaaaaaaatt atatgggtgc aattatttat attttacttg ccgcattggt ccgtttattg 300
gtcactataa aaagcaaatg a 321
<210> 110
<211> 106
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_含DUF1972_结构域_蛋白质
<400> 110
Leu Val Gln Asn Gly Ile Pro Ala Lys Tyr Gly Gly Phe Glu Thr Phe
1 5 10 15
Val Glu Lys Leu Thr Ala His Gln Ser Asn Lys Asn Leu Lys Tyr His
20 25 30
Val Ala Cys Leu Ser Asn Gly Ile Gln Glu Asn Phe Asn His Asn Asp
35 40 45
Ala Asp Cys Phe Asn Ile Ser Lys Lys Asn Ile Gly Pro Ala Asn Ala
50 55 60
Ile Tyr Tyr Asp Leu Ala Ala Leu Lys His Ser Leu Lys Glu Ile Glu
65 70 75 80
Glu Lys Asn Tyr Met Gly Ala Ile Ile Tyr Ile Leu Leu Ala Ala Leu
85 90 95
Val Arg Leu Leu Val Thr Ile Lys Ser Lys
100 105
<210> 111
<211> 393
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139的编码推定的酰基转移酶1蛋白的ORF
<400> 111
atgtttggaa taaaatcaag aatcagaaaa tgtaaagata ttataaattt taaattgaaa 60
aataaggggg tagtagttgg aaaaaatatt atattaagaa accctcagta tatttcatgt 120
ggtaaaaatg tagttattgg agatgaaagt aaattattat gttgggatag ttatggagaa 180
gaacaatata gtaatttacc agaaatccaa ataggcgata attttcatgc gacacgaaat 240
tttacaattc aatgcgctca aaaagtggtt attggaagag atgtattggt agcttcgaat 300
gtctttatta tagattataa tcatggatta aacccattaa ccaagtcgta tcttgaaaac 360
ccactaatac gggggggggg tacttgtaga tga 393
<210> 112
<211> 130
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33139_酰基转移酶1
<400> 112
Met Phe Gly Ile Lys Ser Arg Ile Arg Lys Cys Lys Asp Ile Ile Asn
1 5 10 15
Phe Lys Leu Lys Asn Lys Gly Val Val Val Gly Lys Asn Ile Ile Leu
20 25 30
Arg Asn Pro Gln Tyr Ile Ser Cys Gly Lys Asn Val Val Ile Gly Asp
35 40 45
Glu Ser Lys Leu Leu Cys Trp Asp Ser Tyr Gly Glu Glu Gln Tyr Ser
50 55 60
Asn Leu Pro Glu Ile Gln Ile Gly Asp Asn Phe His Ala Thr Arg Asn
65 70 75 80
Phe Thr Ile Gln Cys Ala Gln Lys Val Val Ile Gly Arg Asp Val Leu
85 90 95
Val Ala Ser Asn Val Phe Ile Ile Asp Tyr Asn His Gly Leu Asn Pro
100 105 110
Leu Thr Lys Ser Tyr Leu Glu Asn Pro Leu Ile Arg Gly Gly Gly Thr
115 120 125
Cys Arg
130
<210> 113
<211> 318
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsR基因的ORF
<400> 113
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataa 318
<210> 114
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsR
<400> 114
Met Asp Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 115
<211> 768
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsX基因的ORF
<400> 115
atgatgaaaa aaggaatttt tgtaattact atagtgatat ctatagcagt gataattgga 60
ggtttttata gttataattc taggataaat aatctttcaa aagctgataa aggaaaagaa 120
gttgtaaaaa atagcagtga aaaaaatcag atagacctta cctataaaaa gtattataaa 180
aatttaccaa aatcagttca aaataaaata gatgatattt catcaaaaaa taaagaagtt 240
actttaactt gtatttggca atctgattca gttatttctg aacaatttca acaaaactta 300
caaaaatatt atggaaataa gttttggaac atcaaaaata tcacttacaa tggcgaaaca 360
agtgaacaat tattggctga aaaagttgaa aaccaagtat tagccactaa tcctgatgtt 420
gttttatatg aagctccact ttttaatgat aaccaaaaca ttgaagcaac agcctcacgg 480
actagtaatg agcaacttat aacaaatttg gctagtacag gagcagaggt gatagttcaa 540
ccctctccac cgatttatgg tggtgttgta taccccgtac aagaagaaca atttaaacaa 600
tctttatcta caaaatatcc ctatatagac tactgggcta gttacccaga caaaaattct 660
gatgaaatga aggggctgtt ttctgatgat ggagtatata gaacattaaa tgattcgggg 720
aataaggttt ggctagatta tattactaaa tattttacag caaactaa 768
<210> 116
<211> 255
<212> PRT
<213>乳酸乳球菌
<220>
<223> 33141_epsX
<400> 116
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Val Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Arg
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Thr Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Asp Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 117
<211> 765
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsB基因的ORF
<400> 117
atgattgata ttcattgcca tattttaccg gggatagatg atggagctaa aacttctgaa 60
gatactttga aaatgctgaa atcagcaatt gatgaaggga taacaaccat cactgccact 120
cctcatcata atcctcaatt taataatgag tcacccctta ttttaaaaaa agttaaggaa 180
gttcaaaata tcattgacga gcaccaatta ccaattgaag ttttgcctgg acaagaggtt 240
agaatatgtg gtgatttatt aaaagaattt tctgaaggaa agttactgac agcagcgggc 300
acttcaagtt atatattgat tgaatttcca tcaaatcatg tgccagctta tgctaaagaa 360
cttttttata atattcaatt ggagggactt caacctattt tggtccaccc tgagcgtaat 420
agtggaatca ttgagaaccc tgatatatta tttgatttta ttgaacaagg agtactaagt 480
cagataacag cttcaagtat cactggtcat tttggtaaaa aaatacagaa gttatcattt 540
aaaatgatag aaaaccatct tacgcatctt gttgcatcag atgcgcataa tgtgacgtca 600
cgagcattta agatgaagga agcatttgaa attattgaag atagttatgg ttctggtgta 660
tcacgaatgt ttcaaaataa tgcagagtca gtgatcttaa acgaaagttt ttatcaagaa 720
aaaccaacaa agatcaaaac aaagaaattt ttaggattat tttaa 765
<210> 118
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsB
<400> 118
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Glu Asp Thr Leu Lys Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Cys Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Ile Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Leu Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 119
<211> 696
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsD基因的ORF
<400> 119
atggctaaaa ataaaagaag catagacaat aatcgttata ttatcaccag tgtcaatccc 60
caatcaccta tttccgaaca atatcgtacg attcgtacga ccattgattt taaaatggcg 120
gatcaaggaa ttaaaagttt tctagtaaca tcttcagaag cagctgcagg taaatcaacc 180
gtaagtgcta atatagctgt tgcttttgca caacaaggta aaaaagtact tttaattgat 240
ggcgatcttc gtaaaccgac tgctaacatt acttttaaag tacaaaatag agtaggatta 300
accaatattt taatgcatca atcttcgatt gaagatgcca tacaagggac aagactttct 360
gaaaatctta aaataattac ctctggtcca attccaccta atccatcgga attattagca 420
tctagtgcaa tgaagaattt gattgactct gtgtccgatt tctatgatgt tattttgatt 480
gatactacac ctctctttgc agttactgat gctcaaattt tgagtattta tgcaggagga 540
gtggttcttg ttgtacgtgc caatgaaaca aaaaaagaga gtttagcaaa aacaaaaaaa 600
atactggaac aacttaatgc aaatatatta ggagttgttt tgcatgggct agactcttct 660
gactcaccgt cgtattccta ctacggagta gagtaa 696
<210> 120
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsD
<400> 120
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Thr Val Ser Ala Asn
50 55 60
Ile Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Ala Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Lys Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Phe Tyr Asp Val Ile Leu Ile
145 150 155 160
Asp Thr Thr Pro Leu Phe Ala Val Thr Asp Ala Gln Ile Leu Ser Ile
165 170 175
Tyr Ala Gly Gly Val Val Leu Val Val Arg Ala Asn Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Leu Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Leu Asp Ser Ser Asp Ser Pro Ser
210 215 220
Tyr Ser Tyr Tyr Gly Val Glu
225 230
<210> 121
<211> 1164
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT1蛋白的ORF
<400> 121
ttgcacttaa aagaggaagt tatgaataaa aggaaaaata ataattctcc acttaaaatt 60
gcagttcttg gacataagac aattccgagt cgccaaggtg gaattgaaat tgtagttgaa 120
gaattaacag ttcgtatggc aaaactggga cataaaataa cagtatataa ccgtagcgga 180
catcatgtaa gtggtaaaga atttgatgga aagaagctaa aagaatataa aggaatccga 240
atgaaatatg taccgacaat agataaaaaa ggattagcag cgatgagtgc ctcttttttc 300
gcagcggtgg tagcggcgtt tggaaaatat gatgtggtac attttcatgc agaagggccg 360
tgtgcaatgt tatggttgcc aaaattattc ggaaagcgat gtatagctac ggtgcatgga 420
ttagatcatc agagagcaaa atggggtaaa ttagccagta catatattat gttgggagaa 480
aaatgtgcgg ttagatttgc tgacgaaatt attgtgttaa gtgaaggggt tcagaaatat 540
ttccttgata catatgggag agaaacccgt tttataccta atggtgtaaa cagaccaatt 600
atccgaaatg ctgaaattat taagaataaa tttggtttag aaaaggatag ctacatcctc 660
tttttgggta gattagttcc agaaaaaggg cttagatatt tgatagaagc ctttagacag 720
gtagatactg agaaaaaaat ggttattgct ggtggaagtt cagatacaga tgaatttaca 780
aaagaattaa aagaactggc aaaagatgat tcaagaatta tttttactgg atttgttcag 840
ggaaaagagt tagatgaact ttatagtaat gcttatgtat atactttgcc gagtgatctg 900
gaaggaatgc ctcttagttt attagaggca atgagttacg gaaattgctg cttggtgtct 960
gacatagatg aatgtgcatc tgttgtagaa gataaagcat ttatttttaa gaaaagtgat 1020
gtggcagatt tgcaaagcaa attgcagaaa gcttgtgatg ataaagaaca agtacaaaaa 1080
tataaagatg aagctgcaga ttatatttgc caaaaatata attgggatga tgttgtagaa 1140
aaaacactag aattgtatca ataa 1164
<210> 122
<211> 387
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT1
<400> 122
Leu His Leu Lys Glu Glu Val Met Asn Lys Arg Lys Asn Asn Asn Ser
1 5 10 15
Pro Leu Lys Ile Ala Val Leu Gly His Lys Thr Ile Pro Ser Arg Gln
20 25 30
Gly Gly Ile Glu Ile Val Val Glu Glu Leu Thr Val Arg Met Ala Lys
35 40 45
Leu Gly His Lys Ile Thr Val Tyr Asn Arg Ser Gly His His Val Ser
50 55 60
Gly Lys Glu Phe Asp Gly Lys Lys Leu Lys Glu Tyr Lys Gly Ile Arg
65 70 75 80
Met Lys Tyr Val Pro Thr Ile Asp Lys Lys Gly Leu Ala Ala Met Ser
85 90 95
Ala Ser Phe Phe Ala Ala Val Val Ala Ala Phe Gly Lys Tyr Asp Val
100 105 110
Val His Phe His Ala Glu Gly Pro Cys Ala Met Leu Trp Leu Pro Lys
115 120 125
Leu Phe Gly Lys Arg Cys Ile Ala Thr Val His Gly Leu Asp His Gln
130 135 140
Arg Ala Lys Trp Gly Lys Leu Ala Ser Thr Tyr Ile Met Leu Gly Glu
145 150 155 160
Lys Cys Ala Val Arg Phe Ala Asp Glu Ile Ile Val Leu Ser Glu Gly
165 170 175
Val Gln Lys Tyr Phe Leu Asp Thr Tyr Gly Arg Glu Thr Arg Phe Ile
180 185 190
Pro Asn Gly Val Asn Arg Pro Ile Ile Arg Asn Ala Glu Ile Ile Lys
195 200 205
Asn Lys Phe Gly Leu Glu Lys Asp Ser Tyr Ile Leu Phe Leu Gly Arg
210 215 220
Leu Val Pro Glu Lys Gly Leu Arg Tyr Leu Ile Glu Ala Phe Arg Gln
225 230 235 240
Val Asp Thr Glu Lys Lys Met Val Ile Ala Gly Gly Ser Ser Asp Thr
245 250 255
Asp Glu Phe Thr Lys Glu Leu Lys Glu Leu Ala Lys Asp Asp Ser Arg
260 265 270
Ile Ile Phe Thr Gly Phe Val Gln Gly Lys Glu Leu Asp Glu Leu Tyr
275 280 285
Ser Asn Ala Tyr Val Tyr Thr Leu Pro Ser Asp Leu Glu Gly Met Pro
290 295 300
Leu Ser Leu Leu Glu Ala Met Ser Tyr Gly Asn Cys Cys Leu Val Ser
305 310 315 320
Asp Ile Asp Glu Cys Ala Ser Val Val Glu Asp Lys Ala Phe Ile Phe
325 330 335
Lys Lys Ser Asp Val Ala Asp Leu Gln Ser Lys Leu Gln Lys Ala Cys
340 345 350
Asp Asp Lys Glu Gln Val Gln Lys Tyr Lys Asp Glu Ala Ala Asp Tyr
355 360 365
Ile Cys Gln Lys Tyr Asn Trp Asp Asp Val Val Glu Lys Thr Leu Glu
370 375 380
Leu Tyr Gln
385
<210> 123
<211> 1257
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的推定的wzy基因的ORF
<400> 123
atgataataa agaagaaaga taaatattat attatgctat ttttattatt tgttgttgat 60
ttaaattttt taaatttaat agatactgca acatttaata tcggaggaat ttattataca 120
gacattgtat ttctattaaa tatttctgta tttttatatc agattataaa ggataaattt 180
caaattgcaa aaaatataag tattatctat gtattgtgtg taattatatt aatggtattg 240
tcagcttgtg caggtcacat aacatataat caatcaatag tggctggaat agttgcacag 300
cgagaatggg tgtcatggat gttattgatt tatccattat ctaggtggat tcaattacaa 360
aaactttctg ttgaaggaat aaaaaaatgt ataattaatt tatgcaatat atatgcgttt 420
atttgtataa tacaatattt attatatgat gtagttcagt ttacatatac tatggttaac 480
aatagatatg gaagtgttag attatatttt tatacgattt ttttctgttt tgctataggt 540
atagttattg atgacttgat tagtggatca aaacgtagtt tgaagaattc tacaatgcaa 600
tggataaaat tgctcgctta tttatttata attgtattta ttacaaaggg aagaatgcaa 660
acaatttctt tattatgtgc tattatagtt tgctcattaa ttagaagaga tatgcggata 720
gaaaaaaaaa taatactctg tttattaata gttgtgctga catatgtttt tatgaactct 780
acaatgggac aggatatatt gcatgccatt atgggtacaa gtgaaaatga tactttatct 840
gttagagatt caggaagggt atattattta ggattatata cacagtcatg gaaaagaata 900
ctttttggat gtggatttgc aaattctaat aatagttatg ctataacgat acttaatcct 960
ttgtggcagg aatatggaag tgcgagatac tatttggaag acgtaggaat tttatcacca 1020
ttaataaaat atggattggt aggaatagta ttttggattg gggttgtaat taaaaatata 1080
tcactttcat ataaaattta taataaatct ggagaaatgg tatatttaca gtttttattt 1140
atggatttaa ttgcttgtgc gacactggta cctacaatgt ttaatacaac aatccttttt 1200
ccgcttatta caatattaat attatttaga gcaaaggaat tgcagataat aagataa 1257
<210> 124
<211> 418
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_wzy
<400> 124
Met Ile Ile Lys Lys Lys Asp Lys Tyr Tyr Ile Met Leu Phe Leu Leu
1 5 10 15
Phe Val Val Asp Leu Asn Phe Leu Asn Leu Ile Asp Thr Ala Thr Phe
20 25 30
Asn Ile Gly Gly Ile Tyr Tyr Thr Asp Ile Val Phe Leu Leu Asn Ile
35 40 45
Ser Val Phe Leu Tyr Gln Ile Ile Lys Asp Lys Phe Gln Ile Ala Lys
50 55 60
Asn Ile Ser Ile Ile Tyr Val Leu Cys Val Ile Ile Leu Met Val Leu
65 70 75 80
Ser Ala Cys Ala Gly His Ile Thr Tyr Asn Gln Ser Ile Val Ala Gly
85 90 95
Ile Val Ala Gln Arg Glu Trp Val Ser Trp Met Leu Leu Ile Tyr Pro
100 105 110
Leu Ser Arg Trp Ile Gln Leu Gln Lys Leu Ser Val Glu Gly Ile Lys
115 120 125
Lys Cys Ile Ile Asn Leu Cys Asn Ile Tyr Ala Phe Ile Cys Ile Ile
130 135 140
Gln Tyr Leu Leu Tyr Asp Val Val Gln Phe Thr Tyr Thr Met Val Asn
145 150 155 160
Asn Arg Tyr Gly Ser Val Arg Leu Tyr Phe Tyr Thr Ile Phe Phe Cys
165 170 175
Phe Ala Ile Gly Ile Val Ile Asp Asp Leu Ile Ser Gly Ser Lys Arg
180 185 190
Ser Leu Lys Asn Ser Thr Met Gln Trp Ile Lys Leu Leu Ala Tyr Leu
195 200 205
Phe Ile Ile Val Phe Ile Thr Lys Gly Arg Met Gln Thr Ile Ser Leu
210 215 220
Leu Cys Ala Ile Ile Val Cys Ser Leu Ile Arg Arg Asp Met Arg Ile
225 230 235 240
Glu Lys Lys Ile Ile Leu Cys Leu Leu Ile Val Val Leu Thr Tyr Val
245 250 255
Phe Met Asn Ser Thr Met Gly Gln Asp Ile Leu His Ala Ile Met Gly
260 265 270
Thr Ser Glu Asn Asp Thr Leu Ser Val Arg Asp Ser Gly Arg Val Tyr
275 280 285
Tyr Leu Gly Leu Tyr Thr Gln Ser Trp Lys Arg Ile Leu Phe Gly Cys
290 295 300
Gly Phe Ala Asn Ser Asn Asn Ser Tyr Ala Ile Thr Ile Leu Asn Pro
305 310 315 320
Leu Trp Gln Glu Tyr Gly Ser Ala Arg Tyr Tyr Leu Glu Asp Val Gly
325 330 335
Ile Leu Ser Pro Leu Ile Lys Tyr Gly Leu Val Gly Ile Val Phe Trp
340 345 350
Ile Gly Val Val Ile Lys Asn Ile Ser Leu Ser Tyr Lys Ile Tyr Asn
355 360 365
Lys Ser Gly Glu Met Val Tyr Leu Gln Phe Leu Phe Met Asp Leu Ile
370 375 380
Ala Cys Ala Thr Leu Val Pro Thr Met Phe Asn Thr Thr Ile Leu Phe
385 390 395 400
Pro Leu Ile Thr Ile Leu Ile Leu Phe Arg Ala Lys Glu Leu Gln Ile
405 410 415
Ile Arg
<210> 125
<211> 990
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT2蛋白的ORF
<400> 125
atggaggtta ttaataagga acaccctctt attagtgtta ttgttcctat atataaagtc 60
gaaaaatatc tgggtaaatg tattgagagt attattgccc aggaatattc aaatatcgaa 120
attattttag tggatgatgg atctccagat aattgtggta aaatatgtga tgattttgct 180
actaaagatg ctcggattaa agttatccat aaagaaaatg gaggactttc ttcagcaaga 240
aatgctggaa ttgatattgc aacaggtgag tatttgggct ttgtagatag tgacgattcg 300
attgagccgt ttatgtataa aaaattgatt tcttcaatta tagaaaacaa aaccaaatta 360
gcagtatgtg cggtaaatta tgtatttgaa aatggtaaga ttcttacgaa atctaattta 420
ggtgagaatt gtacatttga tttttatcaa gcaatgatag aaatgaattc tcatagaatt 480
tttgatatgg gtgcatggag caagttatat catagagatt tattttttga tttgcgcttt 540
ccggaaggga aattaagtga agattattat attatgtata aaatctttga ccgagcgcaa 600
aaaattagtt atgtctcaac accgtgttat aattatttac aacgacagaa tagcattaca 660
cataatgtta gaattaatca cgatcatgaa tatgctgcaa aggaacaaat ggaatattta 720
gaaaagaaat atccagaatt aaaggtcttg ggacatacag cttatgcttc agcggcatta 780
actgtatatg actcatatat taaaaatgcc gttagctgtc ctcaaaagga tataaagcat 840
tttaaaagtg tagttcggga aaataggcag tatattaaga atgcagattt tttgtcaaaa 900
agcaaaaaag ttcaatttca attattttca attagtacag ctatgtacaa tattgtattt 960
aaggtgtata gaaaaattaa gagggtttag 990
<210> 126
<211> 329
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT2
<400> 126
Met Glu Val Ile Asn Lys Glu His Pro Leu Ile Ser Val Ile Val Pro
1 5 10 15
Ile Tyr Lys Val Glu Lys Tyr Leu Gly Lys Cys Ile Glu Ser Ile Ile
20 25 30
Ala Gln Glu Tyr Ser Asn Ile Glu Ile Ile Leu Val Asp Asp Gly Ser
35 40 45
Pro Asp Asn Cys Gly Lys Ile Cys Asp Asp Phe Ala Thr Lys Asp Ala
50 55 60
Arg Ile Lys Val Ile His Lys Glu Asn Gly Gly Leu Ser Ser Ala Arg
65 70 75 80
Asn Ala Gly Ile Asp Ile Ala Thr Gly Glu Tyr Leu Gly Phe Val Asp
85 90 95
Ser Asp Asp Ser Ile Glu Pro Phe Met Tyr Lys Lys Leu Ile Ser Ser
100 105 110
Ile Ile Glu Asn Lys Thr Lys Leu Ala Val Cys Ala Val Asn Tyr Val
115 120 125
Phe Glu Asn Gly Lys Ile Leu Thr Lys Ser Asn Leu Gly Glu Asn Cys
130 135 140
Thr Phe Asp Phe Tyr Gln Ala Met Ile Glu Met Asn Ser His Arg Ile
145 150 155 160
Phe Asp Met Gly Ala Trp Ser Lys Leu Tyr His Arg Asp Leu Phe Phe
165 170 175
Asp Leu Arg Phe Pro Glu Gly Lys Leu Ser Glu Asp Tyr Tyr Ile Met
180 185 190
Tyr Lys Ile Phe Asp Arg Ala Gln Lys Ile Ser Tyr Val Ser Thr Pro
195 200 205
Cys Tyr Asn Tyr Leu Gln Arg Gln Asn Ser Ile Thr His Asn Val Arg
210 215 220
Ile Asn His Asp His Glu Tyr Ala Ala Lys Glu Gln Met Glu Tyr Leu
225 230 235 240
Glu Lys Lys Tyr Pro Glu Leu Lys Val Leu Gly His Thr Ala Tyr Ala
245 250 255
Ser Ala Ala Leu Thr Val Tyr Asp Ser Tyr Ile Lys Asn Ala Val Ser
260 265 270
Cys Pro Gln Lys Asp Ile Lys His Phe Lys Ser Val Val Arg Glu Asn
275 280 285
Arg Gln Tyr Ile Lys Asn Ala Asp Phe Leu Ser Lys Ser Lys Lys Val
290 295 300
Gln Phe Gln Leu Phe Ser Ile Ser Thr Ala Met Tyr Asn Ile Val Phe
305 310 315 320
Lys Val Tyr Arg Lys Ile Lys Arg Val
325
<210> 127
<211> 1083
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT3蛋白的ORF
<400> 127
atgcagattt tttgtcaaaa agcaaaaaag ttcaatttca attattttca attagtacag 60
ctatgtacaa tattgtattt aaggtgtata gaaaaattaa gagggtttag tatggataaa 120
ttagtgagcg ttattattcc agtgtacaat acaggaaata ttataaaaaa gtgtattaag 180
agtatattga atagtgatta tagtaatata gaaataataa ttattgatga tggatccgat 240
aaagaaactg tagatatatg taacaactta gaaaaagaag aaaaaattca tgtgattcat 300
caagaaaatg ctggtgtaag tagcgctcgc aataaaggaa tatatcacgc tcaaggggag 360
tttattactt ttgtagatgc agatgacaca atagattcaa acctaattag tgttcttgta 420
aatagttgca ttgaaaaaaa tgccgatatg gcaatttgcg gttatagaga atggtatgat 480
gataagcatt gtacagaatt ccggtgcaca gattcaatta cgatattaaa agaaaaggaa 540
attttaaaag actttttttc tacaaataat attgcatgga atgtgtgggg aaaaatatat 600
aagaaaagtt tggttggcga tacaagattc atagtaggta aaaggactgg cgaagacatg 660
tattttgtat atgaaatctt aaaaaaagct catacattgg taatgaataa taaggcattg 720
tacaattatg aaaaacaaga taattctgca atggcagatt caaattgtat gaaatttttt 780
gatacttatg aattagttaa taaagtattt gaagatgaag cattagataa cgagctgaaa 840
aatgcgcaat taaattttta tataaaaagt gagttgtggt tttttcgttt cataaatgca 900
aaagataagg ataatgagaa taaaagtgaa ataaagaaag caagaaaaaa attcttggac 960
aatattaaga aaaaagaagc aagatgttct ggaagaacaa aaatagaatt aattttattg 1020
cgatattttt atcctgtttt tagagtgatc tctcttatat ggggggcaaa aaaggggatt 1080
tag 1083
<210> 128
<211> 360
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT3
<400> 128
Met Gln Ile Phe Cys Gln Lys Ala Lys Lys Phe Asn Phe Asn Tyr Phe
1 5 10 15
Gln Leu Val Gln Leu Cys Thr Ile Leu Tyr Leu Arg Cys Ile Glu Lys
20 25 30
Leu Arg Gly Phe Ser Met Asp Lys Leu Val Ser Val Ile Ile Pro Val
35 40 45
Tyr Asn Thr Gly Asn Ile Ile Lys Lys Cys Ile Lys Ser Ile Leu Asn
50 55 60
Ser Asp Tyr Ser Asn Ile Glu Ile Ile Ile Ile Asp Asp Gly Ser Asp
65 70 75 80
Lys Glu Thr Val Asp Ile Cys Asn Asn Leu Glu Lys Glu Glu Lys Ile
85 90 95
His Val Ile His Gln Glu Asn Ala Gly Val Ser Ser Ala Arg Asn Lys
100 105 110
Gly Ile Tyr His Ala Gln Gly Glu Phe Ile Thr Phe Val Asp Ala Asp
115 120 125
Asp Thr Ile Asp Ser Asn Leu Ile Ser Val Leu Val Asn Ser Cys Ile
130 135 140
Glu Lys Asn Ala Asp Met Ala Ile Cys Gly Tyr Arg Glu Trp Tyr Asp
145 150 155 160
Asp Lys His Cys Thr Glu Phe Arg Cys Thr Asp Ser Ile Thr Ile Leu
165 170 175
Lys Glu Lys Glu Ile Leu Lys Asp Phe Phe Ser Thr Asn Asn Ile Ala
180 185 190
Trp Asn Val Trp Gly Lys Ile Tyr Lys Lys Ser Leu Val Gly Asp Thr
195 200 205
Arg Phe Ile Val Gly Lys Arg Thr Gly Glu Asp Met Tyr Phe Val Tyr
210 215 220
Glu Ile Leu Lys Lys Ala His Thr Leu Val Met Asn Asn Lys Ala Leu
225 230 235 240
Tyr Asn Tyr Glu Lys Gln Asp Asn Ser Ala Met Ala Asp Ser Asn Cys
245 250 255
Met Lys Phe Phe Asp Thr Tyr Glu Leu Val Asn Lys Val Phe Glu Asp
260 265 270
Glu Ala Leu Asp Asn Glu Leu Lys Asn Ala Gln Leu Asn Phe Tyr Ile
275 280 285
Lys Ser Glu Leu Trp Phe Phe Arg Phe Ile Asn Ala Lys Asp Lys Asp
290 295 300
Asn Glu Asn Lys Ser Glu Ile Lys Lys Ala Arg Lys Lys Phe Leu Asp
305 310 315 320
Asn Ile Lys Lys Lys Glu Ala Arg Cys Ser Gly Arg Thr Lys Ile Glu
325 330 335
Leu Ile Leu Leu Arg Tyr Phe Tyr Pro Val Phe Arg Val Ile Ser Leu
340 345 350
Ile Trp Gly Ala Lys Lys Gly Ile
355 360
<210> 129
<211> 1629
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的推定的wzx基因的ORF
<400> 129
atgatgatta tgttgaatat ggacagaatt atcgagaaaa ggtatggaat cagattcaag 60
atttttcaaa cacaatttct aatatttttg aggtggataa tgaaactcga tagaacaaac 120
aatgccttta gaaatatgaa atttggtatg gttaacaaaa ttgtaacttt atttttgcca 180
tttatcataa gaactgtatt aattaaaaca ataggtatgg aatatgcagg actaaattca 240
ttattttcgt caattttgca ggtgcttaat ttgtcggaat tgggatttag tagtgcggta 300
gtatatagta tgtatcgcac gattgcagaa aatgatgatc gacaagttgg tgcgctacta 360
ttattttata aaaaagtata tcggattatt ggtttagtag ttttatcagt tggattgatt 420
ataatgccat ttttaccgaa attagtagaa ggtggttatc caccagatat aaatgtttat 480
attttgtatc tgatttatct attagatacg gttgtgagtt attttttatt tgcttataaa 540
agtgccatat taaatgctca tcaaagagta gatattgtaa gtaatatttt aacaattact 600
caaggtgcaa tgaatttggt acagattata atgctaatag cattcaaaaa ttattactgc 660
tatattatat ggatgccttt atttacaatc ttaaataata ttatgacagc ctactgtgta 720
aataagctat ttccacaata tcattgtgaa ggaaagattt ctagagatca attgtctgat 780
atgaaatata agataagtgg tcttatggtg aataaattgt gtttaacaac aagaaatact 840
ttagatagtg tatttatatc tgcatttatg ggacttacag tcagtgctat ttacggaaac 900
tattattata tattgaatgc cgttataggc ttgatttcaa ttgtttcaag tgcgatgctt 960
gccggggttg gaaatagtat cgaaacagaa agtgtagaaa aaaattataa tgatttgaaa 1020
aaattcaatt ttttatatat gtggcttagc ggatggtgtg ctatctgcat gttatgtttg 1080
acacaaccgt ttatgacaat ttggatggga aaagacaatc tattcccatt tggtgttgtt 1140
gttcttattt gcatatattt ttatgtgcta aaaatgggag atatgcgagg cttatattca 1200
gatgcagctg gattgtggtg gcagaataga tatagagcga taagtgaatc tatattaaac 1260
cttgtattaa attatgtttt agtgcaggta tggggaattt atggaattat tattgctaca 1320
ttaatatcat tattttttat taattttttg ggtggaagtg gaattgtttt taaacattac 1380
tttaaaaatg gtaaatttat agaatttttg aaatatcatt ttttctatat gcttataacc 1440
gtaataaatg cttccatatg tgtgtttcta acaaattttg ttaaatatga aggaattatt 1500
ggattggggc ttagagcaat aatatgtgta attatcccta atgtaatata tgcattggta 1560
tatttgaata tatcagaatt aaggcaacag tcaaaatggg tattaagtaa gttaaaggta 1620
aggagatga 1629
<210> 130
<211> 542
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_wzx
<400> 130
Met Met Ile Met Leu Asn Met Asp Arg Ile Ile Glu Lys Arg Tyr Gly
1 5 10 15
Ile Arg Phe Lys Ile Phe Gln Thr Gln Phe Leu Ile Phe Leu Arg Trp
20 25 30
Ile Met Lys Leu Asp Arg Thr Asn Asn Ala Phe Arg Asn Met Lys Phe
35 40 45
Gly Met Val Asn Lys Ile Val Thr Leu Phe Leu Pro Phe Ile Ile Arg
50 55 60
Thr Val Leu Ile Lys Thr Ile Gly Met Glu Tyr Ala Gly Leu Asn Ser
65 70 75 80
Leu Phe Ser Ser Ile Leu Gln Val Leu Asn Leu Ser Glu Leu Gly Phe
85 90 95
Ser Ser Ala Val Val Tyr Ser Met Tyr Arg Thr Ile Ala Glu Asn Asp
100 105 110
Asp Arg Gln Val Gly Ala Leu Leu Leu Phe Tyr Lys Lys Val Tyr Arg
115 120 125
Ile Ile Gly Leu Val Val Leu Ser Val Gly Leu Ile Ile Met Pro Phe
130 135 140
Leu Pro Lys Leu Val Glu Gly Gly Tyr Pro Pro Asp Ile Asn Val Tyr
145 150 155 160
Ile Leu Tyr Leu Ile Tyr Leu Leu Asp Thr Val Val Ser Tyr Phe Leu
165 170 175
Phe Ala Tyr Lys Ser Ala Ile Leu Asn Ala His Gln Arg Val Asp Ile
180 185 190
Val Ser Asn Ile Leu Thr Ile Thr Gln Gly Ala Met Asn Leu Val Gln
195 200 205
Ile Ile Met Leu Ile Ala Phe Lys Asn Tyr Tyr Cys Tyr Ile Ile Trp
210 215 220
Met Pro Leu Phe Thr Ile Leu Asn Asn Ile Met Thr Ala Tyr Cys Val
225 230 235 240
Asn Lys Leu Phe Pro Gln Tyr His Cys Glu Gly Lys Ile Ser Arg Asp
245 250 255
Gln Leu Ser Asp Met Lys Tyr Lys Ile Ser Gly Leu Met Val Asn Lys
260 265 270
Leu Cys Leu Thr Thr Arg Asn Thr Leu Asp Ser Val Phe Ile Ser Ala
275 280 285
Phe Met Gly Leu Thr Val Ser Ala Ile Tyr Gly Asn Tyr Tyr Tyr Ile
290 295 300
Leu Asn Ala Val Ile Gly Leu Ile Ser Ile Val Ser Ser Ala Met Leu
305 310 315 320
Ala Gly Val Gly Asn Ser Ile Glu Thr Glu Ser Val Glu Lys Asn Tyr
325 330 335
Asn Asp Leu Lys Lys Phe Asn Phe Leu Tyr Met Trp Leu Ser Gly Trp
340 345 350
Cys Ala Ile Cys Met Leu Cys Leu Thr Gln Pro Phe Met Thr Ile Trp
355 360 365
Met Gly Lys Asp Asn Leu Phe Pro Phe Gly Val Val Val Leu Ile Cys
370 375 380
Ile Tyr Phe Tyr Val Leu Lys Met Gly Asp Met Arg Gly Leu Tyr Ser
385 390 395 400
Asp Ala Ala Gly Leu Trp Trp Gln Asn Arg Tyr Arg Ala Ile Ser Glu
405 410 415
Ser Ile Leu Asn Leu Val Leu Asn Tyr Val Leu Val Gln Val Trp Gly
420 425 430
Ile Tyr Gly Ile Ile Ile Ala Thr Leu Ile Ser Leu Phe Phe Ile Asn
435 440 445
Phe Leu Gly Gly Ser Gly Ile Val Phe Lys His Tyr Phe Lys Asn Gly
450 455 460
Lys Phe Ile Glu Phe Leu Lys Tyr His Phe Phe Tyr Met Leu Ile Thr
465 470 475 480
Val Ile Asn Ala Ser Ile Cys Val Phe Leu Thr Asn Phe Val Lys Tyr
485 490 495
Glu Gly Ile Ile Gly Leu Gly Leu Arg Ala Ile Ile Cys Val Ile Ile
500 505 510
Pro Asn Val Ile Tyr Ala Leu Val Tyr Leu Asn Ile Ser Glu Leu Arg
515 520 525
Gln Gln Ser Lys Trp Val Leu Ser Lys Leu Lys Val Arg Arg
530 535 540
<210> 131
<211> 915
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsL基因的ORF
<400> 131
ttggaggaaa aattggaacg aaaaaaaaag aaaaaaaaga atatttgggt tataattata 60
cctatcttaa tttttattac ccttatagga gcaggggctt atgccttaag aaattcactt 120
attcctactg atcatacgaa aacaaatagt tcggatcaac cgaccaaaac ttcggcctct 180
aacggttatg tagagcaaaa aggggaagaa gctgctgtgg gtagtatagc acttgtagat 240
gatgctggtg taccagaatg ggttaaagtt ccctcaaggg taaatctaga taaatttact 300
gatttatcca cgaataatat cactatttat cgaattaaca atccggaagt cttaaaaaca 360
gttaccaatc gtacagatca acggatgaaa atgtcagaag ttatagctaa gtatcctaat 420
gctttgatta tgaatgcttc cgcatttgat atgcagacag gacaagtagc tggatttcaa 480
attaataatg gaaagttgat tcaagactgg agcccaggta caacgactca atatgctttt 540
gttattaaca aagatggttc gtgcaaaatt tatgattcaa gtacacctgc ttcaactatt 600
attaaaaacg gaggacaaca agcctatgat tttggtactg caattatccg tgatggtaaa 660
attcaaccaa gtgatggctc agtagattgg aagatccata tttttattgc gaatgataaa 720
gataataatc tctatgctat cttgagtgat acaaatgcag gctatgataa tataatgaaa 780
tcagtgtcaa atttgaagct ccaaaatatg ttattacttg atagtggtgg ctcaagtcaa 840
ctatctgtca atggtaaaac gattgttgct agtcaagatg atcgagccgt accggattat 900
attgtaatga aataa 915
<210> 132
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsL
<400> 132
Leu Glu Glu Lys Leu Glu Arg Lys Lys Lys Lys Lys Lys Asn Ile Trp
1 5 10 15
Val Ile Ile Ile Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly
20 25 30
Ala Tyr Ala Leu Arg Asn Ser Leu Ile Pro Thr Asp His Thr Lys Thr
35 40 45
Asn Ser Ser Asp Gln Pro Thr Lys Thr Ser Ala Ser Asn Gly Tyr Val
50 55 60
Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp
65 70 75 80
Asp Ala Gly Val Pro Glu Trp Val Lys Val Pro Ser Arg Val Asn Leu
85 90 95
Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile
100 105 110
Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg
115 120 125
Met Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Ala Leu Ile Met
130 135 140
Asn Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln
145 150 155 160
Ile Asn Asn Gly Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Thr
165 170 175
Gln Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp
180 185 190
Ser Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Gln Gln Ala
195 200 205
Tyr Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser
210 215 220
Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys
225 230 235 240
Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp
245 250 255
Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu
260 265 270
Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile
275 280 285
Val Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 133
<211> 924
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的LytR家族转录调节蛋白的ORF
<400> 133
atgtatatat atggagatac catgaatcaa aaaaagaggc gtcattatcg taagaaaaaa 60
cacacagtac taaaagttat ttcaattatt tttgtattag taattattgc tgttgcttct 120
atagcctacg ccgcttatag aaatgttgaa tcaacatttt caacatcata tgaaaatttc 180
cctaaaacaa caagtatcga cttaaaaaag tctaaaacat tcaccacact tatcattgca 240
actggtaaaa ataattctaa aaatacagct tatgctactg ttttagcttc aacgaatgta 300
aagacaaatc aaactacttt catgaatttc ccagtttttg cgacaatgcc taatcaaaaa 360
acaatcactg aagtttacaa tacgaatgga gatgatggaa ttttccagat agttaaagac 420
ctattgaatg tgtccattaa caaagtaatt cagattgatg ttaataaaat gggatcactt 480
gtacaggcca ctggtggaat caccatgcaa aatccaaagg cattcaatgc tgaaggttat 540
gagtttaaac aaggaactgt taatttacaa actgctgatc aagtccaagc ctatatgaca 600
caaattgacg atactgattt ggatgcttca atcactcgaa ttcaaaatgt ctcaatggaa 660
ctctacggaa atattcaaaa aattgctcat atgaaaaaac ttgaaagttt caattactat 720
cgagaaattc tctatgcttt ttcaaacact gttaaaacca atataagttt caatgatgct 780
aaaacgatcg ttttgagcta caataaggct ctaaagaata ccagcaagct caatctacat 840
acaacagatg aaaatggagc taaggtcgtt tctcaaacag aattagactc agtcaaaacc 900
ctttttgaaa aatctctaaa ataa 924
<210> 134
<211> 307
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_lytR
<400> 134
Met Tyr Ile Tyr Gly Asp Thr Met Asn Gln Lys Lys Arg Arg His Tyr
1 5 10 15
Arg Lys Lys Lys His Thr Val Leu Lys Val Ile Ser Ile Ile Phe Val
20 25 30
Leu Val Ile Ile Ala Val Ala Ser Ile Ala Tyr Ala Ala Tyr Arg Asn
35 40 45
Val Glu Ser Thr Phe Ser Thr Ser Tyr Glu Asn Phe Pro Lys Thr Thr
50 55 60
Ser Ile Asp Leu Lys Lys Ser Lys Thr Phe Thr Thr Leu Ile Ile Ala
65 70 75 80
Thr Gly Lys Asn Asn Ser Lys Asn Thr Ala Tyr Ala Thr Val Leu Ala
85 90 95
Ser Thr Asn Val Lys Thr Asn Gln Thr Thr Phe Met Asn Phe Pro Val
100 105 110
Phe Ala Thr Met Pro Asn Gln Lys Thr Ile Thr Glu Val Tyr Asn Thr
115 120 125
Asn Gly Asp Asp Gly Ile Phe Gln Ile Val Lys Asp Leu Leu Asn Val
130 135 140
Ser Ile Asn Lys Val Ile Gln Ile Asp Val Asn Lys Met Gly Ser Leu
145 150 155 160
Val Gln Ala Thr Gly Gly Ile Thr Met Gln Asn Pro Lys Ala Phe Asn
165 170 175
Ala Glu Gly Tyr Glu Phe Lys Gln Gly Thr Val Asn Leu Gln Thr Ala
180 185 190
Asp Gln Val Gln Ala Tyr Met Thr Gln Ile Asp Asp Thr Asp Leu Asp
195 200 205
Ala Ser Ile Thr Arg Ile Gln Asn Val Ser Met Glu Leu Tyr Gly Asn
210 215 220
Ile Gln Lys Ile Ala His Met Lys Lys Leu Glu Ser Phe Asn Tyr Tyr
225 230 235 240
Arg Glu Ile Leu Tyr Ala Phe Ser Asn Thr Val Lys Thr Asn Ile Ser
245 250 255
Phe Asn Asp Ala Lys Thr Ile Val Leu Ser Tyr Asn Lys Ala Leu Lys
260 265 270
Asn Thr Ser Lys Leu Asn Leu His Thr Thr Asp Glu Asn Gly Ala Lys
275 280 285
Val Val Ser Gln Thr Glu Leu Asp Ser Val Lys Thr Leu Phe Glu Lys
290 295 300
Ser Leu Lys
305
<210> 135
<211> 1296
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的核苷酸糖脱氢酶蛋白的ORF
<400> 135
ttgaaaatta agatagtata tttggtaaag agaagaggaa taaatatgag tcaagagtat 60
aaaatagctg ttgctggaac tggatatgtt ggattatcaa tcgctacact tttatcacaa 120
aatcatcggg tggttgctgt agacattgta aaagagaagg ttgatttgat caataaaaga 180
aagagtccaa ttcaagatga atatattgaa aaatatttgt cagagaaaaa gttaaattta 240
gaagctacat tagatgcaga atatgcctac aaggatgcag attttgttgt aattgcagca 300
cctacaaatt atgatagcaa aacacaatat tttgatactt cagcagttga agcagttata 360
aacatggtaa ctaaactaaa ttctaatgcg gtaatggtta ttaaatctac aattccagtt 420
ggatatacaa gagaaattcg taaaaaaaca gggaataata atataatgtt tagtccagag 480
tttcttagag aaagtaaggc attatatgat aatttgtatc catcgcgtat tattgtggga 540
actgatttac acgatgagaa attaattgaa gctgcccata tctttgcaga attattgaaa 600
gaaggggcta ttaaggaaaa tattgacaca ctttttatgg gatttacgga agcagaagca 660
gttaaattgt ttgcaaatac atatttagct ttgcgagtag catattttaa tgaattggat 720
acatatgcag aaagtaaaaa tttgaataca aaacaaatta tagaaggtgt ttgcttagat 780
ccacgtattg gtacacatta taataatcca tctttcggat atggtggata ctgtttacct 840
aaagatacaa aacagttatt ggctaattat gccgatgttc cagaaaatct aattgaagcg 900
attgtggaaa gcaatcatac acgtaaagat tttattgcta gtcaagtttt aaaaattgca 960
ggatattaca attatgaaga tgatatagag tatgataaaa atcaagaaaa aaaagttgta 1020
ataggtgttt atagattaac aatgaaatca aattctgata attttcgcca gtctagcata 1080
caaggtataa tgaagcgaat taaagcaaaa ggtgcagaag ttgttatttt tgagccaaca 1140
ttagaaaatg gaagtacttt ctttggctca aaaattgtaa ataatttaaa acaatttaaa 1200
gaacaaagtc aagcgattat agctaataga tatgattctt gtctggacga tgtaaaagaa 1260
aaagtatata ccagagattt gtttagaaga gattaa 1296
<210> 136
<211> 431
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_核苷酸_糖_脱氢酶
<400> 136
Leu Lys Ile Lys Ile Val Tyr Leu Val Lys Arg Arg Gly Ile Asn Met
1 5 10 15
Ser Gln Glu Tyr Lys Ile Ala Val Ala Gly Thr Gly Tyr Val Gly Leu
20 25 30
Ser Ile Ala Thr Leu Leu Ser Gln Asn His Arg Val Val Ala Val Asp
35 40 45
Ile Val Lys Glu Lys Val Asp Leu Ile Asn Lys Arg Lys Ser Pro Ile
50 55 60
Gln Asp Glu Tyr Ile Glu Lys Tyr Leu Ser Glu Lys Lys Leu Asn Leu
65 70 75 80
Glu Ala Thr Leu Asp Ala Glu Tyr Ala Tyr Lys Asp Ala Asp Phe Val
85 90 95
Val Ile Ala Ala Pro Thr Asn Tyr Asp Ser Lys Thr Gln Tyr Phe Asp
100 105 110
Thr Ser Ala Val Glu Ala Val Ile Asn Met Val Thr Lys Leu Asn Ser
115 120 125
Asn Ala Val Met Val Ile Lys Ser Thr Ile Pro Val Gly Tyr Thr Arg
130 135 140
Glu Ile Arg Lys Lys Thr Gly Asn Asn Asn Ile Met Phe Ser Pro Glu
145 150 155 160
Phe Leu Arg Glu Ser Lys Ala Leu Tyr Asp Asn Leu Tyr Pro Ser Arg
165 170 175
Ile Ile Val Gly Thr Asp Leu His Asp Glu Lys Leu Ile Glu Ala Ala
180 185 190
His Ile Phe Ala Glu Leu Leu Lys Glu Gly Ala Ile Lys Glu Asn Ile
195 200 205
Asp Thr Leu Phe Met Gly Phe Thr Glu Ala Glu Ala Val Lys Leu Phe
210 215 220
Ala Asn Thr Tyr Leu Ala Leu Arg Val Ala Tyr Phe Asn Glu Leu Asp
225 230 235 240
Thr Tyr Ala Glu Ser Lys Asn Leu Asn Thr Lys Gln Ile Ile Glu Gly
245 250 255
Val Cys Leu Asp Pro Arg Ile Gly Thr His Tyr Asn Asn Pro Ser Phe
260 265 270
Gly Tyr Gly Gly Tyr Cys Leu Pro Lys Asp Thr Lys Gln Leu Leu Ala
275 280 285
Asn Tyr Ala Asp Val Pro Glu Asn Leu Ile Glu Ala Ile Val Glu Ser
290 295 300
Asn His Thr Arg Lys Asp Phe Ile Ala Ser Gln Val Leu Lys Ile Ala
305 310 315 320
Gly Tyr Tyr Asn Tyr Glu Asp Asp Ile Glu Tyr Asp Lys Asn Gln Glu
325 330 335
Lys Lys Val Val Ile Gly Val Tyr Arg Leu Thr Met Lys Ser Asn Ser
340 345 350
Asp Asn Phe Arg Gln Ser Ser Ile Gln Gly Ile Met Lys Arg Ile Lys
355 360 365
Ala Lys Gly Ala Glu Val Val Ile Phe Glu Pro Thr Leu Glu Asn Gly
370 375 380
Ser Thr Phe Phe Gly Ser Lys Ile Val Asn Asn Leu Lys Gln Phe Lys
385 390 395 400
Glu Gln Ser Gln Ala Ile Ile Ala Asn Arg Tyr Asp Ser Cys Leu Asp
405 410 415
Asp Val Lys Glu Lys Val Tyr Thr Arg Asp Leu Phe Arg Arg Asp
420 425 430
<210> 137
<211> 780
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsC基因的ORF
<400> 137
atgcaggaaa cacaggaaca aacgattgat ttaagaggga tttttaaaat tattcgcaaa 60
aggttaggtt taatattatt tagtgcttta atagtcacaa tattagggag catctacaca 120
ttttttatag cctccccagt ttacacagcc tcaactcaac ttgtcgttaa actaccaaat 180
tcggataatt cagcagccta cgctggacaa gtgaccggga atattcagat ggcgaacaca 240
attaaccaag ttattgttag tccagtcatt ttagataaag ttcaaagtaa tttaaatcta 300
tctgatgact ctttccaaaa acaagttaca gcagcaaatc aaacaaattc acaagttatt 360
acgcttactg ttaaatattc taatccttac attgcacaaa agattgcaga cgagactgct 420
aaaattttta gttcagacgc agcaaaacta ttgaatgtta ctaacgttaa tattctatcc 480
aaagcaaaag ttcaaacaac acccattagt cctaaaccta aattgtattt agcgatatct 540
gttatagccg gactagtttt aggtttagcc attgctttat tgaaggaatc gtttgataac 600
aaaattaata aagaagaaga tattgaagct ctggggctaa cggttcttgg tgtaacaacc 660
tatgctcaaa tgagtgattt taataagaat acaaataaaa atggcacgca atcgggaact 720
aagtcaagtc cgcctagcga ccatgaagta aatagatcat caaaaaggaa taaaagatag 780
<210> 138
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsC
<400> 138
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Ile Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Val Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Ser Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 139
<211> 459
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsE1基因的ORF
<400> 139
atggaagttt ttgaggatgc ctcagcacct gaatcggaag aacacaaatt agtagtatta 60
aaaaattttt cttatggaga gctaattata aaaagagcaa ttgatatcct aggaggatta 120
gcgggttcag ttttatttct tattgcggct gcattgcttt atgtccctta caaaatgagc 180
tcaaaaaaag atcaagggcc aatgttctat aaacaaaaac ggtatggaaa aaatggtaaa 240
attttttata ttttgaaatt tagaacaatg attcttaatg ctgagcagta tttagagcta 300
catccagaag ttaaagccgc ctatcatgcc aatggcaata aactagaaaa tgatccgcgt 360
gtgacgaaga ttggttcatt tattagacaa cactcaattg atgaattacc acaatttatc 420
aatgtcctta aaggagtcgg cggtataggg cacctctaa 459
<210> 140
<211> 152
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsE1
<400> 140
Met Glu Val Phe Glu Asp Ala Ser Ala Pro Glu Ser Glu Glu His Lys
1 5 10 15
Leu Val Val Leu Lys Asn Phe Ser Tyr Gly Glu Leu Ile Ile Lys Arg
20 25 30
Ala Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Lys Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Val Gly Gly Ile Gly His Leu
145 150
<210> 141
<211> 1476
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的epsE2基因的ORF
<400> 141
atggaagggg aaagaaaaaa atctatgtat agaaaagatt ctgaaggatg gttaaagcac 60
gcagacttta tagtcttaga tatgatctgt ttgcaattag cgtatgttct ggcatatgca 120
attagcggat atggatttaa tccatatgaa acaattattt atcgcaatat ggcagttttc 180
cttgaattgg cagatctggt tatgattttt gcatatggca ccatgaaaag cgtgttaaag 240
agaggatact atcgtgattt tgctgttacg ttaaatcatg cgattatggt aggtgcctta 300
gcggttttat atttattcct gcttcaggaa gggcaggact tttcaagatt aacattgatg 360
ctgaccataa taatttattt agtaatgacg tatattgtca gagaactttg gaaaaaactt 420
ctgcgaaaac agatgaagga tggcggagaa cgtaaactac tgattgtaac atcagaagat 480
gtggctgaac aagtagtgtt aagtatgcag gaaaataatt atgccagatt ttcattggct 540
ggtgtagctg taattgatgc ggactggact ggaagagaaa ttcatggagt tccggtagtt 600
gccaacgaag agactgcagc aatgtatgta tgtcaggaat ggattgacga agttctggtt 660
gttgtttcag aagttcttcc gtatccggca gagttaattg agcagttatc agagacagga 720
gtaaccattc atcttaatct tgcaaagatt acaagtgtgc caggaaaaaa acaatttgtg 780
gaaaaagttg gtaattacac agttcttacg acaagtatta attatgcatc aaccagacag 840
ttaatgttaa aacgattgat ggatattgcg ggtggattag ttggatgtat ttttaccgga 900
atcatttgta tttttgtcgg accggcaatt tatattgcat caccgggacc aattttcttt 960
gctcaggaac gagtaggaaa gaatggaaag aaatttaaaa tgtacaagtt ccgcagtatg 1020
tatatggatg cagaagagcg taaggcagag cttatgaaag ataacaaact tggagatgga 1080
aagatgttta aactggactt tgatcctcgt gttatcggaa ataagatact tccagatggg 1140
acacataaga caggaatcag tgattttatc agacgaacaa gtttagatga atttccgcaa 1200
ttctttaacg tattacgagg cgatatgtcg attataggta ctcgcccacc cttgatttca 1260
gaaacgaatc tgtatgagct tcaccatcgt gcaaggctgg caattaagcc gggaatcact 1320
ggcatgtggc aggtaagtgg acgaagtgat attactgatt ttgaagaagt agttcgtctt 1380
gataaagagt atatcacaaa ctggaacatt gggctagata taaaaatatt atttaaaacg 1440
atattggtag tctttaagaa agatggatca atgtaa 1476
<210> 142
<211> 491
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_epsE2
<400> 142
Met Glu Gly Glu Arg Lys Lys Ser Met Tyr Arg Lys Asp Ser Glu Gly
1 5 10 15
Trp Leu Lys His Ala Asp Phe Ile Val Leu Asp Met Ile Cys Leu Gln
20 25 30
Leu Ala Tyr Val Leu Ala Tyr Ala Ile Ser Gly Tyr Gly Phe Asn Pro
35 40 45
Tyr Glu Thr Ile Ile Tyr Arg Asn Met Ala Val Phe Leu Glu Leu Ala
50 55 60
Asp Leu Val Met Ile Phe Ala Tyr Gly Thr Met Lys Ser Val Leu Lys
65 70 75 80
Arg Gly Tyr Tyr Arg Asp Phe Ala Val Thr Leu Asn His Ala Ile Met
85 90 95
Val Gly Ala Leu Ala Val Leu Tyr Leu Phe Leu Leu Gln Glu Gly Gln
100 105 110
Asp Phe Ser Arg Leu Thr Leu Met Leu Thr Ile Ile Ile Tyr Leu Val
115 120 125
Met Thr Tyr Ile Val Arg Glu Leu Trp Lys Lys Leu Leu Arg Lys Gln
130 135 140
Met Lys Asp Gly Gly Glu Arg Lys Leu Leu Ile Val Thr Ser Glu Asp
145 150 155 160
Val Ala Glu Gln Val Val Leu Ser Met Gln Glu Asn Asn Tyr Ala Arg
165 170 175
Phe Ser Leu Ala Gly Val Ala Val Ile Asp Ala Asp Trp Thr Gly Arg
180 185 190
Glu Ile His Gly Val Pro Val Val Ala Asn Glu Glu Thr Ala Ala Met
195 200 205
Tyr Val Cys Gln Glu Trp Ile Asp Glu Val Leu Val Val Val Ser Glu
210 215 220
Val Leu Pro Tyr Pro Ala Glu Leu Ile Glu Gln Leu Ser Glu Thr Gly
225 230 235 240
Val Thr Ile His Leu Asn Leu Ala Lys Ile Thr Ser Val Pro Gly Lys
245 250 255
Lys Gln Phe Val Glu Lys Val Gly Asn Tyr Thr Val Leu Thr Thr Ser
260 265 270
Ile Asn Tyr Ala Ser Thr Arg Gln Leu Met Leu Lys Arg Leu Met Asp
275 280 285
Ile Ala Gly Gly Leu Val Gly Cys Ile Phe Thr Gly Ile Ile Cys Ile
290 295 300
Phe Val Gly Pro Ala Ile Tyr Ile Ala Ser Pro Gly Pro Ile Phe Phe
305 310 315 320
Ala Gln Glu Arg Val Gly Lys Asn Gly Lys Lys Phe Lys Met Tyr Lys
325 330 335
Phe Arg Ser Met Tyr Met Asp Ala Glu Glu Arg Lys Ala Glu Leu Met
340 345 350
Lys Asp Asn Lys Leu Gly Asp Gly Lys Met Phe Lys Leu Asp Phe Asp
355 360 365
Pro Arg Val Ile Gly Asn Lys Ile Leu Pro Asp Gly Thr His Lys Thr
370 375 380
Gly Ile Ser Asp Phe Ile Arg Arg Thr Ser Leu Asp Glu Phe Pro Gln
385 390 395 400
Phe Phe Asn Val Leu Arg Gly Asp Met Ser Ile Ile Gly Thr Arg Pro
405 410 415
Pro Leu Ile Ser Glu Thr Asn Leu Tyr Glu Leu His His Arg Ala Arg
420 425 430
Leu Ala Ile Lys Pro Gly Ile Thr Gly Met Trp Gln Val Ser Gly Arg
435 440 445
Ser Asp Ile Thr Asp Phe Glu Glu Val Val Arg Leu Asp Lys Glu Tyr
450 455 460
Ile Thr Asn Trp Asn Ile Gly Leu Asp Ile Lys Ile Leu Phe Lys Thr
465 470 475 480
Ile Leu Val Val Phe Lys Lys Asp Gly Ser Met
485 490
<210> 143
<211> 768
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT4蛋白的ORF
<400> 143
atgaaaattt tattgcttat aaattggaaa ataaaatatt gtgaccatat accagaggat 60
ttgcaacctt cagattatga ttgtcctcag gaggtatatt ggttttttaa atactttaaa 120
gataaaccag aggtagatgt tgttgatatt agtgcaccaa aatttattga aaagatagaa 180
aataaagttc ggtttcattt ttttcaaact tttaaaatat tatttaaatt gaataaatat 240
gatttgattt ttgtttgtgg atctaatagt gcaatgctgt tgtgtgcatt aaaaagagtg 300
tttcacataa aaacacctcc aatattggat gttgatataa gttcatttca tcaagcatat 360
acttcaggat tgattcatag attatcacag ttttctagta gggcttttga ttatatggta 420
tatcatacta gttcacaata tgattattat atggaatatt ttccatggtt aaaagataag 480
tgcaaatttg ttccatttgg tgttgattat aattattgga aattaaaaac gtatgaagat 540
atacctgaaa aggatcaata tattgtttgt gtaggatata gaaaaagaga ttggaataca 600
ttactaaaag cgtttgataa aatagatatt ccagaaaaat tatatcttat tggaaatcca 660
gatattaaat gtgataatcc aaaagtaaaa gtgcttcctt ttatcccagt tgcagcagat 720
ggctttggga gttccggtat tagcagcaga tgtttcagct atacgtga 768
<210> 144
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT4
<400> 144
Met Lys Ile Leu Leu Leu Ile Asn Trp Lys Ile Lys Tyr Cys Asp His
1 5 10 15
Ile Pro Glu Asp Leu Gln Pro Ser Asp Tyr Asp Cys Pro Gln Glu Val
20 25 30
Tyr Trp Phe Phe Lys Tyr Phe Lys Asp Lys Pro Glu Val Asp Val Val
35 40 45
Asp Ile Ser Ala Pro Lys Phe Ile Glu Lys Ile Glu Asn Lys Val Arg
50 55 60
Phe His Phe Phe Gln Thr Phe Lys Ile Leu Phe Lys Leu Asn Lys Tyr
65 70 75 80
Asp Leu Ile Phe Val Cys Gly Ser Asn Ser Ala Met Leu Leu Cys Ala
85 90 95
Leu Lys Arg Val Phe His Ile Lys Thr Pro Pro Ile Leu Asp Val Asp
100 105 110
Ile Ser Ser Phe His Gln Ala Tyr Thr Ser Gly Leu Ile His Arg Leu
115 120 125
Ser Gln Phe Ser Ser Arg Ala Phe Asp Tyr Met Val Tyr His Thr Ser
130 135 140
Ser Gln Tyr Asp Tyr Tyr Met Glu Tyr Phe Pro Trp Leu Lys Asp Lys
145 150 155 160
Cys Lys Phe Val Pro Phe Gly Val Asp Tyr Asn Tyr Trp Lys Leu Lys
165 170 175
Thr Tyr Glu Asp Ile Pro Glu Lys Asp Gln Tyr Ile Val Cys Val Gly
180 185 190
Tyr Arg Lys Arg Asp Trp Asn Thr Leu Leu Lys Ala Phe Asp Lys Ile
195 200 205
Asp Ile Pro Glu Lys Leu Tyr Leu Ile Gly Asn Pro Asp Ile Lys Cys
210 215 220
Asp Asn Pro Lys Val Lys Val Leu Pro Phe Ile Pro Val Ala Ala Asp
225 230 235 240
Gly Phe Gly Ser Ser Gly Ile Ser Ser Arg Cys Phe Ser Tyr Thr
245 250 255
<210> 145
<211> 855
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT5蛋白的ORF
<400> 145
atgaagtgtg taatgagcat actaaactat aatgacagca ggcgagcctt agaattagca 60
gaacgatgtg tggaatttga ttcaatagaa agaataatta ttgttgacaa taaaagtacg 120
gatgattctg taacatttct aacagagaga gtaggatcaa atatagaatt agtagttgca 180
tctgaaaata ggggttttgc tgctggaaat aatataggag caaaatatgc acaagaaaaa 240
tataatcctg aatatatact atttgcaaat acagatacga ttttctcaga aacggaagta 300
aatgcatgtt tgagtaaatt gaaagcaaaa gcagatttgg gattgatatc gatgaggata 360
aaagatataa agggaaatga agaaaaatca gcatggcatt ttaagtcttt cttagactat 420
acattattta atatttggat atatagacat attacataca aaaaaggcgt gtataaaagt 480
ttttccaatg attttcaata tgttgatatt gttagaggaa gttttatgtt atttaaaatg 540
aaagcactta tagaagctaa cttttttgat gaaaatacgt tcttatatta tgaagaggaa 600
attattgcat atcgtttacg taaacatgga tataaagttg gattattgac aaattatttc 660
tatatacaca atcatattgc aagtggtaca ggaaatatat ggtttataaa aaaacactta 720
gatgcttcat tgagatgggt tttgattaat tactataata ttggaaatgt aaaaataaga 780
atatttgatt ttgcaactaa aatttgtagt tgtgagactt ttttaataga aaaattgaaa 840
cggagaggaa aatga 855
<210> 146
<211> 284
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT5
<400> 146
Met Lys Cys Val Met Ser Ile Leu Asn Tyr Asn Asp Ser Arg Arg Ala
1 5 10 15
Leu Glu Leu Ala Glu Arg Cys Val Glu Phe Asp Ser Ile Glu Arg Ile
20 25 30
Ile Ile Val Asp Asn Lys Ser Thr Asp Asp Ser Val Thr Phe Leu Thr
35 40 45
Glu Arg Val Gly Ser Asn Ile Glu Leu Val Val Ala Ser Glu Asn Arg
50 55 60
Gly Phe Ala Ala Gly Asn Asn Ile Gly Ala Lys Tyr Ala Gln Glu Lys
65 70 75 80
Tyr Asn Pro Glu Tyr Ile Leu Phe Ala Asn Thr Asp Thr Ile Phe Ser
85 90 95
Glu Thr Glu Val Asn Ala Cys Leu Ser Lys Leu Lys Ala Lys Ala Asp
100 105 110
Leu Gly Leu Ile Ser Met Arg Ile Lys Asp Ile Lys Gly Asn Glu Glu
115 120 125
Lys Ser Ala Trp His Phe Lys Ser Phe Leu Asp Tyr Thr Leu Phe Asn
130 135 140
Ile Trp Ile Tyr Arg His Ile Thr Tyr Lys Lys Gly Val Tyr Lys Ser
145 150 155 160
Phe Ser Asn Asp Phe Gln Tyr Val Asp Ile Val Arg Gly Ser Phe Met
165 170 175
Leu Phe Lys Met Lys Ala Leu Ile Glu Ala Asn Phe Phe Asp Glu Asn
180 185 190
Thr Phe Leu Tyr Tyr Glu Glu Glu Ile Ile Ala Tyr Arg Leu Arg Lys
195 200 205
His Gly Tyr Lys Val Gly Leu Leu Thr Asn Tyr Phe Tyr Ile His Asn
210 215 220
His Ile Ala Ser Gly Thr Gly Asn Ile Trp Phe Ile Lys Lys His Leu
225 230 235 240
Asp Ala Ser Leu Arg Trp Val Leu Ile Asn Tyr Tyr Asn Ile Gly Asn
245 250 255
Val Lys Ile Arg Ile Phe Asp Phe Ala Thr Lys Ile Cys Ser Cys Glu
260 265 270
Thr Phe Leu Ile Glu Lys Leu Lys Arg Arg Gly Lys
275 280
<210> 147
<211> 1107
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的GT6蛋白的ORF
<400> 147
atgattttga tgaaagttgg ttttatatca aactctgatg tttatgataa acgagcgtgg 60
agtgggacaa taaattttct ttatgaaaca ttaaataaag aatatgatat gtatccgatt 120
gtgatagaac ataaaataat tcaaaaaatg tcacgtataa ttacaaaggg aaaacgtaag 180
tataccttat tagatagctt tttttataaa ttagatatta acagaaaaat aaaaaaagct 240
caaaaaaaag gcataaaagt gttttttgct ccggcagctt caacaatatt aggggttgca 300
aagattccta tagattgcaa agtagtttat ttgagcgatg caacatatca ttgtatgtta 360
aattattatt actttaatga aagcaaagga gatataaaaa attataatag agtagagcag 420
aagtcattgt gtagagcaga caaagtcata ttttcaagtg aatgggctaa aaacgatgca 480
ataatatatt atggagtgga ttcaaataag atacatgtct taccatttgg agctaattta 540
gaagataaat atagcggaca tgatatggga gatatagtga aaatcctatt tgttggagta 600
gagtgggaaa gaaaaggggc agaattggct attgaatgcg taaagaattt gaatagaaga 660
aactataaaa agcggtttga attgacgatt atcggattag aaaaaccaga aaaatatctg 720
gctgatgaca gtattcattt tgtagggaga ttaaataaaa ataatcggga tgaattaaat 780
tgcatgatta aatattatca acagagtgat atttttcttt tacctactaa agccgaatgt 840
tctgctattg tgtttagtga ggccgctatg tatggattgc cagtgtttac tcataatacg 900
ggaggtgtta tgacttatgt agaagatggc aaaacaggca gaggattaaa gttaggatca 960
aaagcagaag acttcgctga tgcaattctg aagatgttaa atgaagacaa atataaagaa 1020
tggtcaataa acgcaagaaa aaaatatgag aaagaattga attggaactg ttgggttgag 1080
aagtgtaaag agttaattga aaattaa 1107
<210> 148
<211> 368
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_GT6
<400> 148
Met Ile Leu Met Lys Val Gly Phe Ile Ser Asn Ser Asp Val Tyr Asp
1 5 10 15
Lys Arg Ala Trp Ser Gly Thr Ile Asn Phe Leu Tyr Glu Thr Leu Asn
20 25 30
Lys Glu Tyr Asp Met Tyr Pro Ile Val Ile Glu His Lys Ile Ile Gln
35 40 45
Lys Met Ser Arg Ile Ile Thr Lys Gly Lys Arg Lys Tyr Thr Leu Leu
50 55 60
Asp Ser Phe Phe Tyr Lys Leu Asp Ile Asn Arg Lys Ile Lys Lys Ala
65 70 75 80
Gln Lys Lys Gly Ile Lys Val Phe Phe Ala Pro Ala Ala Ser Thr Ile
85 90 95
Leu Gly Val Ala Lys Ile Pro Ile Asp Cys Lys Val Val Tyr Leu Ser
100 105 110
Asp Ala Thr Tyr His Cys Met Leu Asn Tyr Tyr Tyr Phe Asn Glu Ser
115 120 125
Lys Gly Asp Ile Lys Asn Tyr Asn Arg Val Glu Gln Lys Ser Leu Cys
130 135 140
Arg Ala Asp Lys Val Ile Phe Ser Ser Glu Trp Ala Lys Asn Asp Ala
145 150 155 160
Ile Ile Tyr Tyr Gly Val Asp Ser Asn Lys Ile His Val Leu Pro Phe
165 170 175
Gly Ala Asn Leu Glu Asp Lys Tyr Ser Gly His Asp Met Gly Asp Ile
180 185 190
Val Lys Ile Leu Phe Val Gly Val Glu Trp Glu Arg Lys Gly Ala Glu
195 200 205
Leu Ala Ile Glu Cys Val Lys Asn Leu Asn Arg Arg Asn Tyr Lys Lys
210 215 220
Arg Phe Glu Leu Thr Ile Ile Gly Leu Glu Lys Pro Glu Lys Tyr Leu
225 230 235 240
Ala Asp Asp Ser Ile His Phe Val Gly Arg Leu Asn Lys Asn Asn Arg
245 250 255
Asp Glu Leu Asn Cys Met Ile Lys Tyr Tyr Gln Gln Ser Asp Ile Phe
260 265 270
Leu Leu Pro Thr Lys Ala Glu Cys Ser Ala Ile Val Phe Ser Glu Ala
275 280 285
Ala Met Tyr Gly Leu Pro Val Phe Thr His Asn Thr Gly Gly Val Met
290 295 300
Thr Tyr Val Glu Asp Gly Lys Thr Gly Arg Gly Leu Lys Leu Gly Ser
305 310 315 320
Lys Ala Glu Asp Phe Ala Asp Ala Ile Leu Lys Met Leu Asn Glu Asp
325 330 335
Lys Tyr Lys Glu Trp Ser Ile Asn Ala Arg Lys Lys Tyr Glu Lys Glu
340 345 350
Leu Asn Trp Asn Cys Trp Val Glu Lys Cys Lys Glu Leu Ile Glu Asn
355 360 365
<210> 149
<211> 600
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的乙酰转移酶蛋白的ORF
<400> 149
atggggggca aaaaagggga tttagaacag aatcatgagg acactttgac atcatataaa 60
caaatgaaag agatgttaaa aagagaaaaa aaatattatc caaatacatg gtttgacaat 120
ataagttgta atcaacgagt atacaactgg cgattcatga aattattacg cagatgcgag 180
ctttatagat ataaggcgga tcattctaag aatcctgtat ggaaaggatt atatttgatt 240
aatcgtacga aaaagaaccg gttaggtgtt tggattggag tagagatacc agagaatgtg 300
ttttcagaag gattaattat tcatcatagt ggaaatattg tagttaatgg aagtagcaaa 360
gtagggaaaa attgtcaact tcatggagat aattgtatag gaaattcagg aaaagaaaat 420
gaattaaaaa aatgcccaca gattggagat aatgtagaaa taggagttgg agcgaaagta 480
ttgggtggca ttactattgc aaataatgta aaaattggag ccaatgctgt agttacaaaa 540
tcattttatg aagagggcat tactcttgtt ggaatacctg cacataagct agaaaggtaa 600
<210> 150
<211> 199
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_乙酰转移酶
<400> 150
Met Gly Gly Lys Lys Gly Asp Leu Glu Gln Asn His Glu Asp Thr Leu
1 5 10 15
Thr Ser Tyr Lys Gln Met Lys Glu Met Leu Lys Arg Glu Lys Lys Tyr
20 25 30
Tyr Pro Asn Thr Trp Phe Asp Asn Ile Ser Cys Asn Gln Arg Val Tyr
35 40 45
Asn Trp Arg Phe Met Lys Leu Leu Arg Arg Cys Glu Leu Tyr Arg Tyr
50 55 60
Lys Ala Asp His Ser Lys Asn Pro Val Trp Lys Gly Leu Tyr Leu Ile
65 70 75 80
Asn Arg Thr Lys Lys Asn Arg Leu Gly Val Trp Ile Gly Val Glu Ile
85 90 95
Pro Glu Asn Val Phe Ser Glu Gly Leu Ile Ile His His Ser Gly Asn
100 105 110
Ile Val Val Asn Gly Ser Ser Lys Val Gly Lys Asn Cys Gln Leu His
115 120 125
Gly Asp Asn Cys Ile Gly Asn Ser Gly Lys Glu Asn Glu Leu Lys Lys
130 135 140
Cys Pro Gln Ile Gly Asp Asn Val Glu Ile Gly Val Gly Ala Lys Val
145 150 155 160
Leu Gly Gly Ile Thr Ile Ala Asn Asn Val Lys Ile Gly Ala Asn Ala
165 170 175
Val Val Thr Lys Ser Phe Tyr Glu Glu Gly Ile Thr Leu Val Gly Ile
180 185 190
Pro Ala His Lys Leu Glu Arg
195
<210> 151
<211> 402
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141的编码推定的酰基转移酶蛋白的ORF
<400> 151
atgttaaaat tacagctgga gtcaggttta ttacacttga ctccagccgt ttttggaatg 60
gctggttatc cagtacatga cgactgttta tattatatgg ataaaattat tattggagat 120
aatgtaatga ttggtgcaga cagtatagta atgcctggtg ttaaaatagg aaataatgtt 180
attatagctg ctggaagtgt tgtaacaaag gaaattcctg atggagtagt agcggggggg 240
ggccctgcta aggtaatagg aggatttgat gaattagcta aaaagcgata tgaacaatgt 300
aatcatagac catttctagc cgctcaagag ggtctccttt taaacttaat ttttcaagat 360
agtcgctatc tccaaataaa ttcatcttca atacctcgtt ag 402
<210> 152
<211> 133
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33141_酰基转移酶
<400> 152
Met Leu Lys Leu Gln Leu Glu Ser Gly Leu Leu His Leu Thr Pro Ala
1 5 10 15
Val Phe Gly Met Ala Gly Tyr Pro Val His Asp Asp Cys Leu Tyr Tyr
20 25 30
Met Asp Lys Ile Ile Ile Gly Asp Asn Val Met Ile Gly Ala Asp Ser
35 40 45
Ile Val Met Pro Gly Val Lys Ile Gly Asn Asn Val Ile Ile Ala Ala
50 55 60
Gly Ser Val Val Thr Lys Glu Ile Pro Asp Gly Val Val Ala Gly Gly
65 70 75 80
Gly Pro Ala Lys Val Ile Gly Gly Phe Asp Glu Leu Ala Lys Lys Arg
85 90 95
Tyr Glu Gln Cys Asn His Arg Pro Phe Leu Ala Ala Gln Glu Gly Leu
100 105 110
Leu Leu Asn Leu Ile Phe Gln Asp Ser Arg Tyr Leu Gln Ile Asn Ser
115 120 125
Ser Ser Ile Pro Arg
130
<210> 153
<211> 324
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsR基因的ORF
<400> 153
atgtttatga atgatttatt ttaccatcgt ctaaaggaac tagttgaagc aagtggtaaa 60
tctgcaaatc aaatagaaag ggaattgggt taccctagaa attctttgaa taattataag 120
ttgggaggag aaccctctgg gacaagatta ataggactat cagagtattt taatgtgtct 180
ccaaaatatc tgatgggtat aattgatgag cctaatgaca gttctgcaat taatcttttt 240
aaaactctaa ctcaagaaga gaaaaaagaa atgtttataa tttgtcaaaa atggcttttt 300
ttagaatatc aaatagagtt ataa 324
<210> 154
<211> 107
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsR
<400> 154
Met Phe Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu
1 5 10 15
Ala Ser Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro
20 25 30
Arg Asn Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr
35 40 45
Arg Leu Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu
50 55 60
Met Gly Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe
65 70 75 80
Lys Thr Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln
85 90 95
Lys Trp Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 155
<211> 768
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsX基因的ORF
<400> 155
atgatgaaaa aaggaatttt tgtaattact atagtgatat ctatagcatt tataattgga 60
ggtttttata gttataattc taggataagt aatctttcaa aagctgataa aggaaaagaa 120
gttgtaaaaa atagcagtga aaaaaatcag atagacctta cctataaaaa gtattataaa 180
aatttaccaa aatcagttca aaataaaata gatgatattt catccaaaaa taaagaagtt 240
actttaactt gtatttggca atctgattca gttatttctg aacaatttca acaaaactta 300
caaaaatatt atggaaataa gttttggaac atcaaaaata tcacttacaa tggcgaaact 360
agtgaacaat tattggctga aaaagttgaa aaccaagtat tagccactaa tcctgatgtt 420
gttttatatg aagctccact ttttaatgat aaccaaaaca ttgaagcaac agcctcactg 480
actagtaatg agcaacttat aacaaatttg gctagtgcag gagcggaggt aatagttcaa 540
ccctctccac cgatctatgg tggtgttgtg taccccgtac aagaagaaca atttaaacaa 600
tctttatcta caaagtatcc ctatatagac tactgggcta gttacccaga caaaaattct 660
gatgaaatga agggactgtt ttctgatgat ggagtatata gaacattaaa tgcttcgggg 720
aataaggttt ggctagatta tattactaaa tattttacag caaactaa 768
<210> 156
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsX
<400> 156
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Phe Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Ser Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Leu
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Ala Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 157
<211> 765
<212> DNA
<213>乳酸乳球菌
<220>
<223> DSM 33137的epsB基因的ORF
<400> 157
atgattgata ttcattgcca tattttaccg gggatagatg atggagctaa aacttctgga 60
gatactctga caatgctgaa atcagcaatt gatgaaggga taacaactat cactgcaact 120
cctcatcata atcctcaatt taataatgaa tcaccgctta ttttgaaaaa agttaaggaa 180
gttcaaaata tcattgacga acatcaatta ccaattgaag ttttacccgg acaagaggtg 240
agaatatatg gtgatttatt aaaagaattt tctgaaggga agttactgac agcagcgggc 300
acttcaagtt atatattgat tgaatttcca tcaaatcatg tgccagctta tgctaaagaa 360
cttttttata atattcaatt ggagggactt caacctattt tggtccaccc tgagcgtaat 420
agtggaatca ttgagaaccc ggatatatta tttgatttta ttgaacaagg agtactaagt 480
cagataacag cttcgagtgt cactggtcat tttggtaaaa aaatacaaaa gctgtcattt 540
aaaatgatag aaaaccatct gacgcatttt gttgcatcag atgcgcataa tgtgacgtca 600
cgtgcattta agatgaagga agcatttgaa atgattgaag atagttatgg ttctggtgta 660
tcacgaatgt ttcaaaataa tgcagagtca gtgattttaa acgaaagttt ttatcaagaa 720
aaaccaacaa agatcaaaac aaagaaattt ttaggattat tttaa 765
<210> 158
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsB
<400> 158
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Met Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 159
<211> 696
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsD基因的ORF
<400> 159
atggctaaaa ataaaagaag catagacaac aatcgttata ttattaccag tgtcaatcct 60
caatcaccta tttccgaaca atatcgtacg attcgtacga ccattgattt taaaatggcg 120
gatcaaggga ttaaaagttt tctagtaaca tcttcagaag cagctgcagg taaatcatac 180
gagagtgcta atctagctgt tgcttttgca caacaaggta aaaaagtact tttaattgat 240
ggcgatcttc gtaaaccgac tgttaacatt acttttaaag tacaaaatag agtaggatta 300
accaatattt taatgcatca atcttcgatt gaagatgcca tacaagggac aagactttct 360
gaaaatctta caataattac ctctggtcca attccaccta atccatcgga attattagca 420
tctagtgcaa tgaagaattt gattgactct gtgtccgatt cctttgatgt tgttttgatt 480
gatgctccac ctctctatgc agttactgat gctcaaattt tgagtgttta tgtaggagga 540
gtggttcttg ttgtacgtgc ctatgaaaca aaaaaagaga gtttagcaaa agcaaaaaaa 600
atactggaac aagttaatgc aaatatatta ggagttgttt tgcatggggt agactcttct 660
gagtcaccgt cgtattacta ctacggagta gagtaa 696
<210> 160
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsD
<400> 160
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Tyr Glu Ser Ala Asn
50 55 60
Leu Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Ser Phe Asp Val Val Leu Ile
145 150 155 160
Asp Ala Pro Pro Leu Tyr Ala Val Thr Asp Ala Gln Ile Leu Ser Val
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Ala Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 161
<211> 1155
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的GT1蛋白的ORF
<400> 161
atgaaggaaa aacatattta cattattggt tcaaaaggaa ttccagcaaa gtatggtggt 60
tttgagactt ttgtagaaga actaacagca catcaaagta ataaaaacct taagtatcat 120
gttgcttgtt tatcaaatga catacaatca aattttattc ataatggtgc cgactgtttt 180
aatattccaa agaaaaatat tggaccagca aatgccattt attatgattt ggcagcttta 240
aagtactcac ttaaagaaat tgaagaaaaa aattataagg gtgcaattat ttatatttta 300
gcttgccgca ttggtccgtt tattggttac tataaaaagc aaatgaaaaa attaggaatt 360
actttgatgg taaatcctga tggacatgag tggttgcgtg caaaatggag tgtacctgta 420
aaaaaatatt ggaaaatttc ggaacaatac atggttaaaa atgctgactt attgatctgt 480
gatagtaaga atattgagac ctatattcaa gaatcttatg cgaaatataa tacaaaaaca 540
acttatattg cctatggcgc agatttagct ccaagtcttt tgaaggataa tgacgaaaaa 600
ttagtaaatt ggtatcaaga aaagggtttg gaatctaatg gatattatct tgttgtaggt 660
cggtttgttc ctgaaaataa ttatgaaata atgattaggg aattcatgaa gtctgataca 720
aaaaaagatt ttgttttaat aacaaatgta gaacaaaata aattttatga tcaacttaaa 780
cagacaactg gatttgataa agataaacga ataaaatttg tgggaactgt atacgataaa 840
gagttaatca aaaaaatcag agaaaatgct ttcggatatt tccatggtca cgaagttgga 900
ggaacaaacc caagtttaat tgaagcattg gcgtcatcta aattaaatct cctgcttgat 960
gtaggattta ataaagaagt aggtgaagat gctgcactct attggaataa agaaaagcaa 1020
aatcttgcac aattaattaa acatgtagaa gaaaccgatt attctcacat ggaaagtaaa 1080
gcaaaagaaa gagttcaaca ttatttttct tggaattata ttgttggaga atatgagaaa 1140
gtatttacga aataa 1155
<210> 162
<211> 384
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_GT1
<400> 162
Met Lys Glu Lys His Ile Tyr Ile Ile Gly Ser Lys Gly Ile Pro Ala
1 5 10 15
Lys Tyr Gly Gly Phe Glu Thr Phe Val Glu Glu Leu Thr Ala His Gln
20 25 30
Ser Asn Lys Asn Leu Lys Tyr His Val Ala Cys Leu Ser Asn Asp Ile
35 40 45
Gln Ser Asn Phe Ile His Asn Gly Ala Asp Cys Phe Asn Ile Pro Lys
50 55 60
Lys Asn Ile Gly Pro Ala Asn Ala Ile Tyr Tyr Asp Leu Ala Ala Leu
65 70 75 80
Lys Tyr Ser Leu Lys Glu Ile Glu Glu Lys Asn Tyr Lys Gly Ala Ile
85 90 95
Ile Tyr Ile Leu Ala Cys Arg Ile Gly Pro Phe Ile Gly Tyr Tyr Lys
100 105 110
Lys Gln Met Lys Lys Leu Gly Ile Thr Leu Met Val Asn Pro Asp Gly
115 120 125
His Glu Trp Leu Arg Ala Lys Trp Ser Val Pro Val Lys Lys Tyr Trp
130 135 140
Lys Ile Ser Glu Gln Tyr Met Val Lys Asn Ala Asp Leu Leu Ile Cys
145 150 155 160
Asp Ser Lys Asn Ile Glu Thr Tyr Ile Gln Glu Ser Tyr Ala Lys Tyr
165 170 175
Asn Thr Lys Thr Thr Tyr Ile Ala Tyr Gly Ala Asp Leu Ala Pro Ser
180 185 190
Leu Leu Lys Asp Asn Asp Glu Lys Leu Val Asn Trp Tyr Gln Glu Lys
195 200 205
Gly Leu Glu Ser Asn Gly Tyr Tyr Leu Val Val Gly Arg Phe Val Pro
210 215 220
Glu Asn Asn Tyr Glu Ile Met Ile Arg Glu Phe Met Lys Ser Asp Thr
225 230 235 240
Lys Lys Asp Phe Val Leu Ile Thr Asn Val Glu Gln Asn Lys Phe Tyr
245 250 255
Asp Gln Leu Lys Gln Thr Thr Gly Phe Asp Lys Asp Lys Arg Ile Lys
260 265 270
Phe Val Gly Thr Val Tyr Asp Lys Glu Leu Ile Lys Lys Ile Arg Glu
275 280 285
Asn Ala Phe Gly Tyr Phe His Gly His Glu Val Gly Gly Thr Asn Pro
290 295 300
Ser Leu Ile Glu Ala Leu Ala Ser Ser Lys Leu Asn Leu Leu Leu Asp
305 310 315 320
Val Gly Phe Asn Lys Glu Val Gly Glu Asp Ala Ala Leu Tyr Trp Asn
325 330 335
Lys Glu Lys Gln Asn Leu Ala Gln Leu Ile Lys His Val Glu Glu Thr
340 345 350
Asp Tyr Ser His Met Glu Ser Lys Ala Lys Glu Arg Val Gln His Tyr
355 360 365
Phe Ser Trp Asn Tyr Ile Val Gly Glu Tyr Glu Lys Val Phe Thr Lys
370 375 380
<210> 163
<211> 1212
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的推定的wzy基因的ORF
<400> 163
gtgagtataa caaagataaa aaataatata ggaattttta tttttacaat tttagtcatt 60
tcaacatttg ctactggggg aattctgtca gaattagatg aaattgctgc ggtatgctca 120
tttctaattt tgattttttc ttttttttgt tcttgtttga agaaaaagat atccaagata 180
tttttaatta ttttattatt tattatattt ggattgattt caaacataat aagtgggata 240
gataggagtg tttttgatgt acttatagat attgtgactg ttgggaagcc attttggatt 300
tttttagcta tgattcaact cgtatctcca gatacttttg gctggttacg gagaaaattt 360
tctcttataa taaaaatatt tattttgatg ttatatattt ttctctttct caataccctg 420
catattgttc aaatggggaa tacattttta ttttctcaaa ttccgaattt tagtttcact 480
tttggctttc cagtgccctt tgctattgtt ttatactgtt gtataggatt tttacttaaa 540
aatgatataa acatagtgag acaaccatgg attatttctc taatattctt gataatcttc 600
actggtaaaa tgcaatctta tatttttgta gtcattttct taggcttttt gtctataaga 660
acatacagag agaaaaactt aaaaataagt agattaattt tgataggctt tataggcgtt 720
ttaatttcac taccaaaatt agtgaactac tttgcaacaa catcttattc tccgagaaag 780
ttattgatga ctgatggatt tggtttcgct ttaaaatatt tcccatttgg aacagggttt 840
gcaacgtttg gttctgctat ggctagtaaa gactattcat cattatatta tcaattaggt 900
tatagtaatt tttatgggat gcaacctggt ggtggtttag gttccttttt gaatgacaat 960
tggggggctt cattaattgg acaatttggc tttttaggaa caattttatt tagtattata 1020
attgttagaa tatacttttt aatgattgaa tattggggga acgaaaaaat tggtttatat 1080
ttaatatcag gttttactgg attaatttct ataattatag gatcaagttt ttttacaggt 1140
gcttcaggag cattattaat ggcaactttt ggaataatag tttcttatag aaaaaatgag 1200
atattgccat aa 1212
<210> 164
<211> 403
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_wzy
<400> 164
Val Ser Ile Thr Lys Ile Lys Asn Asn Ile Gly Ile Phe Ile Phe Thr
1 5 10 15
Ile Leu Val Ile Ser Thr Phe Ala Thr Gly Gly Ile Leu Ser Glu Leu
20 25 30
Asp Glu Ile Ala Ala Val Cys Ser Phe Leu Ile Leu Ile Phe Ser Phe
35 40 45
Phe Cys Ser Cys Leu Lys Lys Lys Ile Ser Lys Ile Phe Leu Ile Ile
50 55 60
Leu Leu Phe Ile Ile Phe Gly Leu Ile Ser Asn Ile Ile Ser Gly Ile
65 70 75 80
Asp Arg Ser Val Phe Asp Val Leu Ile Asp Ile Val Thr Val Gly Lys
85 90 95
Pro Phe Trp Ile Phe Leu Ala Met Ile Gln Leu Val Ser Pro Asp Thr
100 105 110
Phe Gly Trp Leu Arg Arg Lys Phe Ser Leu Ile Ile Lys Ile Phe Ile
115 120 125
Leu Met Leu Tyr Ile Phe Leu Phe Leu Asn Thr Leu His Ile Val Gln
130 135 140
Met Gly Asn Thr Phe Leu Phe Ser Gln Ile Pro Asn Phe Ser Phe Thr
145 150 155 160
Phe Gly Phe Pro Val Pro Phe Ala Ile Val Leu Tyr Cys Cys Ile Gly
165 170 175
Phe Leu Leu Lys Asn Asp Ile Asn Ile Val Arg Gln Pro Trp Ile Ile
180 185 190
Ser Leu Ile Phe Leu Ile Ile Phe Thr Gly Lys Met Gln Ser Tyr Ile
195 200 205
Phe Val Val Ile Phe Leu Gly Phe Leu Ser Ile Arg Thr Tyr Arg Glu
210 215 220
Lys Asn Leu Lys Ile Ser Arg Leu Ile Leu Ile Gly Phe Ile Gly Val
225 230 235 240
Leu Ile Ser Leu Pro Lys Leu Val Asn Tyr Phe Ala Thr Thr Ser Tyr
245 250 255
Ser Pro Arg Lys Leu Leu Met Thr Asp Gly Phe Gly Phe Ala Leu Lys
260 265 270
Tyr Phe Pro Phe Gly Thr Gly Phe Ala Thr Phe Gly Ser Ala Met Ala
275 280 285
Ser Lys Asp Tyr Ser Ser Leu Tyr Tyr Gln Leu Gly Tyr Ser Asn Phe
290 295 300
Tyr Gly Met Gln Pro Gly Gly Gly Leu Gly Ser Phe Leu Asn Asp Asn
305 310 315 320
Trp Gly Ala Ser Leu Ile Gly Gln Phe Gly Phe Leu Gly Thr Ile Leu
325 330 335
Phe Ser Ile Ile Ile Val Arg Ile Tyr Phe Leu Met Ile Glu Tyr Trp
340 345 350
Gly Asn Glu Lys Ile Gly Leu Tyr Leu Ile Ser Gly Phe Thr Gly Leu
355 360 365
Ile Ser Ile Ile Ile Gly Ser Ser Phe Phe Thr Gly Ala Ser Gly Ala
370 375 380
Leu Leu Met Ala Thr Phe Gly Ile Ile Val Ser Tyr Arg Lys Asn Glu
385 390 395 400
Ile Leu Pro
<210> 165
<211> 849
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的GT2蛋白的ORF
<400> 165
atgattttct actgtgtagt tttatataat aaaaaaattg acgaagcaat tactataaaa 60
aatctatttg agtgtaattt agataacaga aaaattgtag tatttgataa tagcgataaa 120
ttagatttta gagaatataa ttcaaaatat tataacgaaa aaatatattg tttactacaa 180
tcttccgata aaaatgttgg cctatcagca gcgtataata gaatattaga aaaattatgc 240
agagagtttg atatagaaaa cgaaaagcaa tatgtttgtt ggttagatga tgacacagac 300
atatctcctg aatttttaat taaacaagaa aaagccataa atgagaatta tgatattatt 360
gttcctaaaa ttataggaca agatggaatt gtctattctc caaatgaagc aggaaaaata 420
aaaaataatt tagttttaaa taattcaaat aaaaaaattt caaagttaaa attcaatgct 480
attaatagtt gtttaacagt aaaaacttca atttactcga attttaaatt cgatgaacat 540
ttatttttag atcaggtcga tcagctattt tttgacaatt tgagaaagag aaggttttct 600
tataagatag ttgatgtaac aattgaacag agtttttctc aaagaggagc agcaattgga 660
gaaagctata taaatagatt tcgaattaga gtcaaagata taatgcaata tggaagatta 720
tctcctaata ataatatttt ttattcatat ctgaaaaata ttttactagg atttaacttt 780
ttcagaaaaa cctggaaatt gtcttatatt aagataggta taatttcaat ttggagctat 840
aagaaatga 849
<210> 166
<211> 282
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_GT2
<400> 166
Met Ile Phe Tyr Cys Val Val Leu Tyr Asn Lys Lys Ile Asp Glu Ala
1 5 10 15
Ile Thr Ile Lys Asn Leu Phe Glu Cys Asn Leu Asp Asn Arg Lys Ile
20 25 30
Val Val Phe Asp Asn Ser Asp Lys Leu Asp Phe Arg Glu Tyr Asn Ser
35 40 45
Lys Tyr Tyr Asn Glu Lys Ile Tyr Cys Leu Leu Gln Ser Ser Asp Lys
50 55 60
Asn Val Gly Leu Ser Ala Ala Tyr Asn Arg Ile Leu Glu Lys Leu Cys
65 70 75 80
Arg Glu Phe Asp Ile Glu Asn Glu Lys Gln Tyr Val Cys Trp Leu Asp
85 90 95
Asp Asp Thr Asp Ile Ser Pro Glu Phe Leu Ile Lys Gln Glu Lys Ala
100 105 110
Ile Asn Glu Asn Tyr Asp Ile Ile Val Pro Lys Ile Ile Gly Gln Asp
115 120 125
Gly Ile Val Tyr Ser Pro Asn Glu Ala Gly Lys Ile Lys Asn Asn Leu
130 135 140
Val Leu Asn Asn Ser Asn Lys Lys Ile Ser Lys Leu Lys Phe Asn Ala
145 150 155 160
Ile Asn Ser Cys Leu Thr Val Lys Thr Ser Ile Tyr Ser Asn Phe Lys
165 170 175
Phe Asp Glu His Leu Phe Leu Asp Gln Val Asp Gln Leu Phe Phe Asp
180 185 190
Asn Leu Arg Lys Arg Arg Phe Ser Tyr Lys Ile Val Asp Val Thr Ile
195 200 205
Glu Gln Ser Phe Ser Gln Arg Gly Ala Ala Ile Gly Glu Ser Tyr Ile
210 215 220
Asn Arg Phe Arg Ile Arg Val Lys Asp Ile Met Gln Tyr Gly Arg Leu
225 230 235 240
Ser Pro Asn Asn Asn Ile Phe Tyr Ser Tyr Leu Lys Asn Ile Leu Leu
245 250 255
Gly Phe Asn Phe Phe Arg Lys Thr Trp Lys Leu Ser Tyr Ile Lys Ile
260 265 270
Gly Ile Ile Ser Ile Trp Ser Tyr Lys Lys
275 280
<210> 167
<211> 1113
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的GT3蛋白的ORF
<400> 167
atgaaaatat tattttgtca aacccaattt aaaatggggg gacaacaaaa agtactgctt 60
tctattgcta aagaattaaa taagaagcat gaagttacag tctattatga aaatcataat 120
ttttttgatt ttgaagattt aaatattata aaagcaaaga ggagttttca ggttcttaat 180
ttctttttgg caatcttgat atgcagcttc accttaaaat ttgataaaaa aacaattatc 240
gacacatggc atctatacaa cgctagggat tcattgcgta aagaaaagta tgatattgta 300
gtcttattaa atccttatgt cttgtttgta gatgaattta gaaaatttat agacactcaa 360
aaaattattt gttggacaca taatttattt gaagattata tgtttaatag atttaaaaca 420
gaacaaagta aattgaagct aagtatgtct catgcagata aaattatttc actagaaaag 480
tataccgcat caaaatggcg tgaaataaat aaaaacactg tagttattca taatcccttg 540
acaattaaaa atgaatctgg atacaacaaa aaaaattcaa aaaaaatagg aatggttact 600
aggattgata ttaatcaaaa aggtttggat attctagtta aaatagccaa attgttgaac 660
ccttctacac aggtttttat tgctggttca ggtacaaaga gtgaagaaat aaaattttct 720
aatttattaa ttgaaaataa attagaaaaa cagataatct tactaggaag cttaaaaggc 780
gaagagttag tggaatttta tagtagtttg gccttattgt tggtaacaag caggaatgaa 840
gggtttcctt tagtagtagc agaagcaatg agttttggaa ctcacattat aggttttgat 900
attccttcaa tgcgagaagt tacagcggga ggacaatttg ggacattaat tccttttgac 960
aatactgaac tatttgctaa aaatattgag gatttacaaa ataggttctt gtccaaagat 1020
tttgatttaa agtctgaaga actagtaaga tatgcaggga aattgagggt gaataaaata 1080
ataaaagaat gggaagaagc tcttactaaa taa 1113
<210> 168
<211> 370
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_GT3
<400> 168
Met Lys Ile Leu Phe Cys Gln Thr Gln Phe Lys Met Gly Gly Gln Gln
1 5 10 15
Lys Val Leu Leu Ser Ile Ala Lys Glu Leu Asn Lys Lys His Glu Val
20 25 30
Thr Val Tyr Tyr Glu Asn His Asn Phe Phe Asp Phe Glu Asp Leu Asn
35 40 45
Ile Ile Lys Ala Lys Arg Ser Phe Gln Val Leu Asn Phe Phe Leu Ala
50 55 60
Ile Leu Ile Cys Ser Phe Thr Leu Lys Phe Asp Lys Lys Thr Ile Ile
65 70 75 80
Asp Thr Trp His Leu Tyr Asn Ala Arg Asp Ser Leu Arg Lys Glu Lys
85 90 95
Tyr Asp Ile Val Val Leu Leu Asn Pro Tyr Val Leu Phe Val Asp Glu
100 105 110
Phe Arg Lys Phe Ile Asp Thr Gln Lys Ile Ile Cys Trp Thr His Asn
115 120 125
Leu Phe Glu Asp Tyr Met Phe Asn Arg Phe Lys Thr Glu Gln Ser Lys
130 135 140
Leu Lys Leu Ser Met Ser His Ala Asp Lys Ile Ile Ser Leu Glu Lys
145 150 155 160
Tyr Thr Ala Ser Lys Trp Arg Glu Ile Asn Lys Asn Thr Val Val Ile
165 170 175
His Asn Pro Leu Thr Ile Lys Asn Glu Ser Gly Tyr Asn Lys Lys Asn
180 185 190
Ser Lys Lys Ile Gly Met Val Thr Arg Ile Asp Ile Asn Gln Lys Gly
195 200 205
Leu Asp Ile Leu Val Lys Ile Ala Lys Leu Leu Asn Pro Ser Thr Gln
210 215 220
Val Phe Ile Ala Gly Ser Gly Thr Lys Ser Glu Glu Ile Lys Phe Ser
225 230 235 240
Asn Leu Leu Ile Glu Asn Lys Leu Glu Lys Gln Ile Ile Leu Leu Gly
245 250 255
Ser Leu Lys Gly Glu Glu Leu Val Glu Phe Tyr Ser Ser Leu Ala Leu
260 265 270
Leu Leu Val Thr Ser Arg Asn Glu Gly Phe Pro Leu Val Val Ala Glu
275 280 285
Ala Met Ser Phe Gly Thr His Ile Ile Gly Phe Asp Ile Pro Ser Met
290 295 300
Arg Glu Val Thr Ala Gly Gly Gln Phe Gly Thr Leu Ile Pro Phe Asp
305 310 315 320
Asn Thr Glu Leu Phe Ala Lys Asn Ile Glu Asp Leu Gln Asn Arg Phe
325 330 335
Leu Ser Lys Asp Phe Asp Leu Lys Ser Glu Glu Leu Val Arg Tyr Ala
340 345 350
Gly Lys Leu Arg Val Asn Lys Ile Ile Lys Glu Trp Glu Glu Ala Leu
355 360 365
Thr Lys
370
<210> 169
<211> 1416
<212> DNA
<213> 乳酸乳球菌乳酸亚种
<220>
<223> DSM 33137的推定的wzx基因的ORF
<400> 169
atgaataaat acaaaaaact actatccaac tcactcgttt tcacaatagg aaatttgggt 60
agcaaactgt tagtcttttt actcgtacca ctctacactt atgcgatgac accgcaagag 120
tatggtatgg cagacttgta tcaaacaaca gccaatctac ttttgccact aattacaatg 180
aatgtatttg atgcaacttt acgttttgcc atggaaaagt caatgacaaa agagagagtg 240
ttaacaaatt ctcttgtagt atggtgtttt agcgctgtgt tctcttgttt gggcgctttt 300
attatctatg cgttgaactt gagtaataaa tggtatttat ctttactttt aaccatcatc 360
ttattccaag gtgggcaaag catactaagt caatatgcga gaggcattgg aaaatcgaaa 420
ttatttgcag ctggtggagt tattttaacc tttttgacag gcgctttaaa tattcttttt 480
ttggtatatt taccgcttgg gattacgggc tatttaatgt ccctggtttt agcgaatgta 540
ggtacgattc tattttttgc tggcacactt tccatttgga aggaaattag ttttaaagta 600
attgataaaa aactgatttg gcaaatgctc tattatgcct tacctttgat tcctaatgcc 660
atcatgtggt ggttactgaa cgcatctaat cgctatttcg ttttattctt tttaggagca 720
ggtgctaatg gtcttttggc ggtcgctacc aaaattccaa gtattatttc catttttaat 780
acgattttta cacaggcgtg gcaaatttca gccatagaag aatatgattc tcatcaaaaa 840
tcaaaatatt attcggatgt ttttcactac ttagcaactt ttctattgtt agggacatca 900
gcttttatga ttgtgcttaa accaattgtc gaaaaagtcg tttcaagtga ctatgcaagt 960
tcatggcaat atgttccttt ctttatgctg gcgatgctat tttcctcgtt ttctggattt 1020
tttgggacta attatattgc ggctaaacaa acaaaaggcg tatttatgac atctatctat 1080
ggtgccattg tttgtgtctt attccaagtg gttctgctac ccaccatcgg cttgaatggc 1140
gcaggtttat cagccttgct tggattttta acaacgtttt tattgcgtgt caaagatacg 1200
caaaaatttg tggcgattca gattaagtgg cggattttta tcagtaattt attgatcgtt 1260
ttggcgcaaa ttttatgttt gttttatcta ccgagtgaat ttttgtattt tgggcttgcc 1320
ctattatttt gtggcatgtt agtggttaat cagcgtacaa ttttatacat tatcatggtg 1380
ctaaaaaata agacatttgg aatgaaatcc tcataa 1416
<210> 170
<211> 471
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_wzx
<400> 170
Met Asn Lys Tyr Lys Lys Leu Leu Ser Asn Ser Leu Val Phe Thr Ile
1 5 10 15
Gly Asn Leu Gly Ser Lys Leu Leu Val Phe Leu Leu Val Pro Leu Tyr
20 25 30
Thr Tyr Ala Met Thr Pro Gln Glu Tyr Gly Met Ala Asp Leu Tyr Gln
35 40 45
Thr Thr Ala Asn Leu Leu Leu Pro Leu Ile Thr Met Asn Val Phe Asp
50 55 60
Ala Thr Leu Arg Phe Ala Met Glu Lys Ser Met Thr Lys Glu Arg Val
65 70 75 80
Leu Thr Asn Ser Leu Val Val Trp Cys Phe Ser Ala Val Phe Ser Cys
85 90 95
Leu Gly Ala Phe Ile Ile Tyr Ala Leu Asn Leu Ser Asn Lys Trp Tyr
100 105 110
Leu Ser Leu Leu Leu Thr Ile Ile Leu Phe Gln Gly Gly Gln Ser Ile
115 120 125
Leu Ser Gln Tyr Ala Arg Gly Ile Gly Lys Ser Lys Leu Phe Ala Ala
130 135 140
Gly Gly Val Ile Leu Thr Phe Leu Thr Gly Ala Leu Asn Ile Leu Phe
145 150 155 160
Leu Val Tyr Leu Pro Leu Gly Ile Thr Gly Tyr Leu Met Ser Leu Val
165 170 175
Leu Ala Asn Val Gly Thr Ile Leu Phe Phe Ala Gly Thr Leu Ser Ile
180 185 190
Trp Lys Glu Ile Ser Phe Lys Val Ile Asp Lys Lys Leu Ile Trp Gln
195 200 205
Met Leu Tyr Tyr Ala Leu Pro Leu Ile Pro Asn Ala Ile Met Trp Trp
210 215 220
Leu Leu Asn Ala Ser Asn Arg Tyr Phe Val Leu Phe Phe Leu Gly Ala
225 230 235 240
Gly Ala Asn Gly Leu Leu Ala Val Ala Thr Lys Ile Pro Ser Ile Ile
245 250 255
Ser Ile Phe Asn Thr Ile Phe Thr Gln Ala Trp Gln Ile Ser Ala Ile
260 265 270
Glu Glu Tyr Asp Ser His Gln Lys Ser Lys Tyr Tyr Ser Asp Val Phe
275 280 285
His Tyr Leu Ala Thr Phe Leu Leu Leu Gly Thr Ser Ala Phe Met Ile
290 295 300
Val Leu Lys Pro Ile Val Glu Lys Val Val Ser Ser Asp Tyr Ala Ser
305 310 315 320
Ser Trp Gln Tyr Val Pro Phe Phe Met Leu Ala Met Leu Phe Ser Ser
325 330 335
Phe Ser Gly Phe Phe Gly Thr Asn Tyr Ile Ala Ala Lys Gln Thr Lys
340 345 350
Gly Val Phe Met Thr Ser Ile Tyr Gly Ala Ile Val Cys Val Leu Phe
355 360 365
Gln Val Val Leu Leu Pro Thr Ile Gly Leu Asn Gly Ala Gly Leu Ser
370 375 380
Ala Leu Leu Gly Phe Leu Thr Thr Phe Leu Leu Arg Val Lys Asp Thr
385 390 395 400
Gln Lys Phe Val Ala Ile Gln Ile Lys Trp Arg Ile Phe Ile Ser Asn
405 410 415
Leu Leu Ile Val Leu Ala Gln Ile Leu Cys Leu Phe Tyr Leu Pro Ser
420 425 430
Glu Phe Leu Tyr Phe Gly Leu Ala Leu Leu Phe Cys Gly Met Leu Val
435 440 445
Val Asn Gln Arg Thr Ile Leu Tyr Ile Ile Met Val Leu Lys Asn Lys
450 455 460
Thr Phe Gly Met Lys Ser Ser
465 470
<210> 171
<211> 915
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsL基因的ORF
<400> 171
ttggaggaaa aattggaacg aaaaaaaaag aaaaaaaaga atatttgggt tataattata 60
cctatcttaa tttttattac ccttatagga gcaggggctt atgccttaag aaattcactt 120
attcctactg atcatacgaa aacaaatagt tcggatcaac cgaccaaaac ttcggcctct 180
aacggttatg tagaacaaaa aggtgaagaa gctgctgtgg gtagtatagc acttgtagat 240
gatgctggta ctccagaatg gatcaaagtt ccctcaaagg taaatctaga taaatttact 300
gatttatcta cgaataatat cactatttat cgaattaaca atccggaagt cttaaaaaca 360
gttaccaatc gtacggatca acggatgaaa atgtcagaag ttatagctaa gtatcctaat 420
tctttgatta tgaatgcttc cgcctttaat atgcagacag gtcaagtgac cggttttcaa 480
attaataatg gaaaattaat tcaagactgg agtccaggta caacggttca atatgccttt 540
gttattaaca aagatggttc atgcaaaatt tatgattcaa gtacaccagc tgtaaccatt 600
attcaaaatg gggggcagca gtcttatgat tttggtactg cgattatccg tgatggtaaa 660
attcaaccaa gtgatggctc tgtagattgg aagattcata tttttattgc gaatgataaa 720
gataataatc tctatgctat tttgagtgat acaaatgcag gttatgataa tataatgaaa 780
tcagtatcaa atttgaaact ccaaaatatg ttattacttg atagtggtgg ctcaagtcaa 840
ctatctgtca atggtaaaac gattgttgct agtcaagatg atcgagccgt accggattat 900
attgtgatga aataa 915
<210> 172
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsL
<400> 172
Leu Glu Glu Lys Leu Glu Arg Lys Lys Lys Lys Lys Lys Asn Ile Trp
1 5 10 15
Val Ile Ile Ile Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly
20 25 30
Ala Tyr Ala Leu Arg Asn Ser Leu Ile Pro Thr Asp His Thr Lys Thr
35 40 45
Asn Ser Ser Asp Gln Pro Thr Lys Thr Ser Ala Ser Asn Gly Tyr Val
50 55 60
Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp
65 70 75 80
Asp Ala Gly Thr Pro Glu Trp Ile Lys Val Pro Ser Lys Val Asn Leu
85 90 95
Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile
100 105 110
Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg
115 120 125
Met Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Ser Leu Ile Met
130 135 140
Asn Ala Ser Ala Phe Asn Met Gln Thr Gly Gln Val Thr Gly Phe Gln
145 150 155 160
Ile Asn Asn Gly Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Val
165 170 175
Gln Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp
180 185 190
Ser Ser Thr Pro Ala Val Thr Ile Ile Gln Asn Gly Gly Gln Gln Ser
195 200 205
Tyr Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser
210 215 220
Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys
225 230 235 240
Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp
245 250 255
Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu
260 265 270
Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile
275 280 285
Val Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 173
<211> 903
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的LytR家族转录调节蛋白的ORF
<400> 173
atgaatcaaa aaaagaggcg tcattatcgt aagaaaaaac acacagtact aaaagttatt 60
tcaattattt ttgtattagt aattatcgct gttgcttcta tagcctacgc cgcttataga 120
aatgttgaat caacattttc aacatcatat gaaaatttcc ctaaaacaac aagtatcgac 180
ttaaaaaagt ctaaaacatt caccacactt atcattgcaa ctggtaaaaa taattctaaa 240
aatacagctt atgctactgt tttagcttca acgaatgtaa agacaaatca aactactttc 300
atgaacttcc cggtttttgc aacaatgcct aatcaaaaaa caatcactga agtttacaat 360
acgaatggag atgatggaat tttccagatg gttaaagacc tattgaattt gtccattaac 420
aaagtaattc agatcgatgt taataaaatg ggatcacttg tacaggccac tggtggaatc 480
accatgcaaa atccaaaggc attcaatgct gaaggttatg agtttaaaca aggaactgtt 540
aatttacaaa ctgctgatca agtccaagcc tatatgacac aaattgacga tactgatttg 600
gatgcttcaa tcactcggat tcaaaatgtc tcaatggaac tctacggaaa tattcaaaaa 660
attgctcata tgaaaaaact tgaaagtttc aattactatc gagaaattct ctatgctttt 720
tcaaacactg ttaaaaccaa tataagtttc aatgatgcta aaacgatcgt tatgagctac 780
aataaggctc taaagaatac cagcaagctc aatctacata caacagatga aaatggagct 840
aaggtagttt ctcaaacaga attagactca gtcaaaaccc tttttgaaaa atctctaaaa 900
taa 903
<210> 174
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_lytR
<400> 174
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Leu Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 175
<211> 951
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的核心-2/I-分支蛋白的ORF
<400> 175
atgaaggata gaaagaaaca agcaattttg atactagctc acagaaatac tctcgctcta 60
aaatcaacaa tagagctttt ggattcccaa tactttgatt tctttcttca tatagataaa 120
aaaagtagaa ttcaagattt ttttgattta aaaaaaatta caaaatcctc cactattcat 180
ttttcagaaa gaaaaaatgt acattgggga ggtttttcta tggtagaagc aatgtttgcg 240
ctattagaat gtgcacgtga tacaggagaa tactcttatt ttcatttttt atcaggaaat 300
gatatgccaa tcaaagataa tgaaatagta tttaattttt ttgaaaatag ctatcctcaa 360
aattttattg atattctaga ttttgaaaat gtcaataaaa cttcatattt ctacgaaacc 420
tctgagatga tagaggagag agtgaagtac tactatcctc atatggatat tctaaacaga 480
aaaggaaaaa ttttcatagg gaaaaaacta atttatctac aaaaattgtt gaaagttgat 540
cgcttgaaaa atagagagat agaaattttc aagggtcatc aatggtgtag tttgacaaat 600
caatttgtag atattttatt ggataaagag gaaagaagag taggtaagtc ttatttttca 660
tctagtttaa taccagacga atgttatttt caaacgtatg ctatgataaa aaaagttgaa 720
atttatcaac agaaaaatat gtcagcacgc ttaattgatt ggacgagagg taagccatat 780
atttggcgac aggatgattt ttttgaaatt atgaatgata aagattcaat gttttctagg 840
aagtttgatg aaaatgtaga tcgtaaaata attgaagaaa tttatataaa aataagagga 900
agaagtactg atgaagcaaa taaaatcaaa gataagagat ttacaaaata a 951
<210> 176
<211> 316
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_核心-2/I-分支蛋白
<400> 176
Met Lys Asp Arg Lys Lys Gln Ala Ile Leu Ile Leu Ala His Arg Asn
1 5 10 15
Thr Leu Ala Leu Lys Ser Thr Ile Glu Leu Leu Asp Ser Gln Tyr Phe
20 25 30
Asp Phe Phe Leu His Ile Asp Lys Lys Ser Arg Ile Gln Asp Phe Phe
35 40 45
Asp Leu Lys Lys Ile Thr Lys Ser Ser Thr Ile His Phe Ser Glu Arg
50 55 60
Lys Asn Val His Trp Gly Gly Phe Ser Met Val Glu Ala Met Phe Ala
65 70 75 80
Leu Leu Glu Cys Ala Arg Asp Thr Gly Glu Tyr Ser Tyr Phe His Phe
85 90 95
Leu Ser Gly Asn Asp Met Pro Ile Lys Asp Asn Glu Ile Val Phe Asn
100 105 110
Phe Phe Glu Asn Ser Tyr Pro Gln Asn Phe Ile Asp Ile Leu Asp Phe
115 120 125
Glu Asn Val Asn Lys Thr Ser Tyr Phe Tyr Glu Thr Ser Glu Met Ile
130 135 140
Glu Glu Arg Val Lys Tyr Tyr Tyr Pro His Met Asp Ile Leu Asn Arg
145 150 155 160
Lys Gly Lys Ile Phe Ile Gly Lys Lys Leu Ile Tyr Leu Gln Lys Leu
165 170 175
Leu Lys Val Asp Arg Leu Lys Asn Arg Glu Ile Glu Ile Phe Lys Gly
180 185 190
His Gln Trp Cys Ser Leu Thr Asn Gln Phe Val Asp Ile Leu Leu Asp
195 200 205
Lys Glu Glu Arg Arg Val Gly Lys Ser Tyr Phe Ser Ser Ser Leu Ile
210 215 220
Pro Asp Glu Cys Tyr Phe Gln Thr Tyr Ala Met Ile Lys Lys Val Glu
225 230 235 240
Ile Tyr Gln Gln Lys Asn Met Ser Ala Arg Leu Ile Asp Trp Thr Arg
245 250 255
Gly Lys Pro Tyr Ile Trp Arg Gln Asp Asp Phe Phe Glu Ile Met Asn
260 265 270
Asp Lys Asp Ser Met Phe Ser Arg Lys Phe Asp Glu Asn Val Asp Arg
275 280 285
Lys Ile Ile Glu Glu Ile Tyr Ile Lys Ile Arg Gly Arg Ser Thr Asp
290 295 300
Glu Ala Asn Lys Ile Lys Asp Lys Arg Phe Thr Lys
305 310 315
<210> 177
<211> 780
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsC基因的ORF
<400> 177
atgcaggaaa cacaggaaca aacgattgat ttaagaggga tttttaaaat tattcgcaaa 60
aggttaggtt taatattatt tagtgcttta atagtcacaa tattagggag catctacaca 120
ttttttatag cctccccagt ttacacagcc tcaactcaac ttgtcgttaa actaccaaat 180
tcggataatt cagcagccta cgctggacaa gtgaccggga atattcaaat ggcgaacaca 240
attaaccaag ttattgttag tccagtcatt ttagataaag ttcaaagtaa tttaaatcta 300
tctgatgatt ctttccaaaa acaagttaca gcagcaaatc aaacaaattc acaagttatt 360
acgcttactg ttaaatattc taatccttac attgcacaaa agattgcaga cgagactgct 420
aaaatattta gttcagacgc agcgaaacta ttgaatgtta ctaacgttaa tattctatcc 480
aaagcaaaag ctcaaacaac acccattagt cctaaaccta aattgtattt agcaatatct 540
gttatagccg gattagtttt aggtttagcc attgctttat tgaaggaatt gtttgataac 600
aaaattaata aagaagaaga tattgaagct ctgggactca cggttcttgg tgtaacaacc 660
tatgctcaaa tgagtgattt taataagaat acgaataaaa atggcacgca atcgggaact 720
aagtcaagtc cgcctagcga ccatgaagta aatagatcat caaaaaggaa taaaagatag 780
<210> 178
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsC
<400> 178
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Ile Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 179
<211> 687
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的epsE基因的ORF
<400> 179
atggaagttt ttgagggtga atcatcacct gaatcggaag aacacaaatt agtagtatta 60
aaaaaatttt cttatggaga gctgattata aaaagagcaa ttgatatcct aggaggatta 120
gcgggttcag ttttatttct tatcgcggct gcattgcttt atgtccctta caaaatgagc 180
tcggaaaaag atcaagggcc aatgttctat aaacaaaaac ggtatggaaa aaacggtaaa 240
attttttata ttttgaaatt tagaacaatg ataattaatg ctgatcagta tttagagcta 300
catccagaag ttaaagctgc ctatcacgcc aatggcaata aactagaaag tgatccccgt 360
gtaacgaaga ttggttcatt tattagacaa cactcaattg atgaattacc acaatttatc 420
aatgtcctta aaggggatat ggcattagtt ggtccaagac caattttact ttttgaagcg 480
aaagaatatg gggagcgcct cccttattta ctgatatgta aacctgggat tactggttat 540
tggacaacac atggtagaag caaagttctt tttcctcaac gagcagattt agaactctat 600
tatctccaat atcacagcac caagaatgac atcaagcttc ttatgcttac aattgcacaa 660
agtattcacg gatcggacgc ttactaa 687
<210> 180
<211> 228
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_epsE
<400> 180
Met Glu Val Phe Glu Gly Glu Ser Ser Pro Glu Ser Glu Glu His Lys
1 5 10 15
Leu Val Val Leu Lys Lys Phe Ser Tyr Gly Glu Leu Ile Ile Lys Arg
20 25 30
Ala Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Ile Asn Ala Asp Gln
85 90 95
Tyr Leu Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Ser Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Glu Arg Leu Pro Tyr Leu Leu Ile Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Leu Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Met Leu Thr Ile Ala Gln Ser Ile His Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 181
<211> 1044
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137的编码推定的GT4蛋白的ORF
<400> 181
atgaagcaaa taaaatcaaa gataagagat ttacaaaata attttaccta tgtttttggg 60
aagaaaactt ttcttggaag gggagaagcg attatcatag atgaacctga gcatggaaat 120
ttgggagatc aagcaattgc ttttgcagaa aatcaatttt tagtaaatca tgtatcagta 180
cgagatgtag aacatcttat agaaagcaaa actatttcag aaataaaatc tatgaaaaaa 240
aatattggaa aaaaagaatt agtttttttt catgggggag gaaatttcgg gacactttat 300
ctaaagtatg agcgcattag aagagtggca gtatcaaagc ttccctttaa taaaatgatt 360
ctatttcctc agtcaatttc atttgaagat agtaggtttg gtcagaagca gctgaataaa 420
agtaaaaaaa tatacagtca aaatacaaat tttattttga ctgcaagaga accaaaatct 480
tatggtttaa tgaagaaatg ttttccagat aacaaagtaa tcttgacacc ggatatcgtg 540
ctctcattaa atttaacaga acagtataga ggaaataata ggaatggtat cataacaatg 600
ctcagggaag atatcgaaca aaagcttaat aaaactcaat ttgaaaaaat tatcaaagag 660
ctgacagata aatttgaagt caccatttct gatacgcata ttgggaaaga aaaggatagt 720
ggtataactt atgaaaatcg tcaacactat cttgagataa agtgggatga aattgcgcag 780
catgaggtcg tcttaactga tagattacat ggtatgattt tttcatatat cacaggcaca 840
ccatgtgttg ttttggctaa taataatcat aaaattgaag aaacatacaa acattggttg 900
aatgaagtga actatattcg ttttattgaa aatccgactg ttgaaaatat tttagatgca 960
atcagtgact taaagcaaat cgaacctcac tatattgatt tatctgataa atttcaacca 1020
ctaattgatg cgataaaagg gtaa 1044
<210> 182
<211> 347
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33137_GT4
<400> 182
Met Lys Gln Ile Lys Ser Lys Ile Arg Asp Leu Gln Asn Asn Phe Thr
1 5 10 15
Tyr Val Phe Gly Lys Lys Thr Phe Leu Gly Arg Gly Glu Ala Ile Ile
20 25 30
Ile Asp Glu Pro Glu His Gly Asn Leu Gly Asp Gln Ala Ile Ala Phe
35 40 45
Ala Glu Asn Gln Phe Leu Val Asn His Val Ser Val Arg Asp Val Glu
50 55 60
His Leu Ile Glu Ser Lys Thr Ile Ser Glu Ile Lys Ser Met Lys Lys
65 70 75 80
Asn Ile Gly Lys Lys Glu Leu Val Phe Phe His Gly Gly Gly Asn Phe
85 90 95
Gly Thr Leu Tyr Leu Lys Tyr Glu Arg Ile Arg Arg Val Ala Val Ser
100 105 110
Lys Leu Pro Phe Asn Lys Met Ile Leu Phe Pro Gln Ser Ile Ser Phe
115 120 125
Glu Asp Ser Arg Phe Gly Gln Lys Gln Leu Asn Lys Ser Lys Lys Ile
130 135 140
Tyr Ser Gln Asn Thr Asn Phe Ile Leu Thr Ala Arg Glu Pro Lys Ser
145 150 155 160
Tyr Gly Leu Met Lys Lys Cys Phe Pro Asp Asn Lys Val Ile Leu Thr
165 170 175
Pro Asp Ile Val Leu Ser Leu Asn Leu Thr Glu Gln Tyr Arg Gly Asn
180 185 190
Asn Arg Asn Gly Ile Ile Thr Met Leu Arg Glu Asp Ile Glu Gln Lys
195 200 205
Leu Asn Lys Thr Gln Phe Glu Lys Ile Ile Lys Glu Leu Thr Asp Lys
210 215 220
Phe Glu Val Thr Ile Ser Asp Thr His Ile Gly Lys Glu Lys Asp Ser
225 230 235 240
Gly Ile Thr Tyr Glu Asn Arg Gln His Tyr Leu Glu Ile Lys Trp Asp
245 250 255
Glu Ile Ala Gln His Glu Val Val Leu Thr Asp Arg Leu His Gly Met
260 265 270
Ile Phe Ser Tyr Ile Thr Gly Thr Pro Cys Val Val Leu Ala Asn Asn
275 280 285
Asn His Lys Ile Glu Glu Thr Tyr Lys His Trp Leu Asn Glu Val Asn
290 295 300
Tyr Ile Arg Phe Ile Glu Asn Pro Thr Val Glu Asn Ile Leu Asp Ala
305 310 315 320
Ile Ser Asp Leu Lys Gln Ile Glu Pro His Tyr Ile Asp Leu Ser Asp
325 330 335
Lys Phe Gln Pro Leu Ile Asp Ala Ile Lys Gly
340 345
<210> 183
<211> 12651
<212> DNA
<213> 乳酸乳球菌
<220>
<223>乳酸乳球菌菌株DSM 33192和DSM 25485 eps基因簇,完整序列
<400> 183
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctggag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttagataa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttatgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacaac cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatattcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagga attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggag gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
<210> 184
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的epsR基因编码的氨基酸序列
<400> 184
Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ser Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 185
<211> 139
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的EpsX基因编码的氨基酸序列
<400> 185
Met Glu Val Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu Ser Lys
1 5 10 15
Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys Asn Gln
20 25 30
Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys Ser Val
35 40 45
Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val Thr Leu
50 55 60
Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe Gln Gln
65 70 75 80
Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys Asn Ile
85 90 95
Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys Val Gln
100 105 110
Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu Ala Pro
115 120 125
Leu Phe Asn Asp Asn Gln Tyr Arg Leu Leu Gly
130 135
<210> 186
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的EpsC基因编码的氨基酸序列
<400> 186
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Glu Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Met Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Ile Ala Lys Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Ser Tyr Asp Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 187
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的EpsD基因编码的氨基酸序列
<400> 187
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn His Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Thr Asp Glu Gly Lys Thr Thr Val Ser Ala Asn
50 55 60
Ile Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Phe Phe Asp Val Val Leu Ile
145 150 155 160
Asp Ile Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ser
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Lys Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Asp Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 188
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的EpsB基因编码的氨基酸序列
<400> 188
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Lys Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Lys Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Arg Met Leu
210 215 220
Gln Asn Asn Ala Asp Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Glu Pro Ile Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 189
<211> 226
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的EpsE基因编码的氨基酸序列
<400> 189
Met Glu Val Phe Glu Ala Ser Ser Glu Leu Glu Glu Pro Lys Leu Val
1 5 10 15
Glu Leu Lys Lys Phe Ser Arg Arg Glu Ile Ile Ile Lys Arg Gly Ile
20 25 30
Asp Ile Leu Gly Gly Leu Ala Gly Ser Gly Leu Phe Leu Ile Ala Ala
35 40 45
Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Lys Lys Asp Gln Gly
50 55 60
Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys Ile Phe
65 70 75 80
Tyr Ile Leu Lys Phe Arg Thr Met Ile Ile Asn Ala Glu Gln Tyr Leu
85 90 95
Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly Asn Lys
100 105 110
Leu Glu Ser Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile Arg Gln
115 120 125
His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys Gly Asp
130 135 140
Met Ser Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala Lys Glu
145 150 155 160
Tyr Gly Glu Arg Leu Ser Tyr Leu Leu Ile Cys Lys Pro Gly Ile Thr
165 170 175
Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Leu Phe Pro Gln Arg
180 185 190
Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys Asn Asp
195 200 205
Ile Lys Leu Ile Met Leu Thr Ile Lys Gln Ile Leu His Gly Ser Asp
210 215 220
Ala Tyr
225
<210> 190
<211> 156
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的推定的糖基转移酶GT1的氨基酸序列
<400> 190
Met Lys Lys Lys Thr Thr Lys Ile Cys Met Ile Ser Ser Ser Gly Gly
1 5 10 15
His Leu Lys Glu Leu Asn Glu Leu Ile Glu Ile Ser Glu Gln Tyr Glu
20 25 30
Thr Phe Gln Ile Thr Glu Lys Asp Lys Phe Ser Asn Ile Lys Ile Gly
35 40 45
Thr Arg Gln Tyr Tyr Val Asn Lys Ile Asp Arg Asp Glu Lys Asn Phe
50 55 60
Leu Phe His Phe Phe Ile Leu Phe Leu Lys Ile Phe Gln Ile Phe Ala
65 70 75 80
Val Glu Lys Pro Lys Val Ile Val Thr Thr Gly Ala Leu Val Ala Tyr
85 90 95
Pro Ala Cys Leu Ile Gly Lys Leu Met Arg Ala Lys Val Ile Phe Ile
100 105 110
Glu Ser Tyr Ala Arg Thr Glu Thr Leu Ser Leu Thr Gly Lys Leu Val
115 120 125
Tyr Arg Leu Ser Asp Leu Phe Ile Val Gln Trp Pro Asp Leu Ser Lys
130 135 140
Lys Tyr Ser Lys Ala Lys Tyr Tyr Gly Glu Leu Phe
145 150 155
<210> 191
<211> 160
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的推定的糖基转移酶(GT2)的氨基酸序列
<400> 191
Met Ile Leu Ile Ile Leu Gly Thr Gln Lys Phe Gln Phe Asn Arg Leu
1 5 10 15
Ile Lys Lys Val Asp Lys Leu Ile Glu Asp Asp Gln Ile Lys Asp Ser
20 25 30
Val Ile Ala Gln Ile Gly Tyr Ser Asn Tyr Lys Pro Ile Asn Tyr Lys
35 40 45
Phe Ser Asp Phe Phe Asp Gln Ser Glu Phe Asp Ser Leu Ile Asn Lys
50 55 60
Ser Asp Ile Ile Ile Thr His Gly Gly Val Gly Gly Ile Val Ser Ser
65 70 75 80
Leu Lys Lys Asn Lys Lys Ile Ile Val Val Pro Arg Leu Lys Lys Tyr
85 90 95
Arg Glu His Ile Asp Asp His Gln Leu Glu Ile Ala Arg Ala Phe Gln
100 105 110
Arg Lys Asn Leu Val Ile Leu Asn Glu Asn Leu Asn Glu Leu Cys Asn
115 120 125
Asp Ile Ser Lys Ile Glu Ser Phe Glu Pro Ile His Tyr Val Lys Asp
130 135 140
Asn Lys Lys Ile Ile Cys Glu Ile Lys Lys Phe Ile Ser Lys Val Lys
145 150 155 160
<210> 192
<211> 316
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的推定的糖基转移酶(GT3)的氨基酸序列
<400> 192
Met Ile Lys Leu Ser Ile Ile Ile Pro Ile Tyr Asn Val Glu Lys Tyr
1 5 10 15
Leu Ser Lys Cys Leu Asn Ser Ile Leu Glu Gln Thr Tyr Lys Glu Ile
20 25 30
Glu Ile Ile Leu Val Asn Asp Gly Ser Thr Asp Asn Ser Lys Asp Ile
35 40 45
Ala Val Ser Tyr Cys Glu Arg Phe Pro Asn Val Phe Lys Tyr Phe Glu
50 55 60
Lys Asp Asn Gly Gly Leu Ser Ser Ala Arg Asn Phe Gly Leu Glu Lys
65 70 75 80
Ile Ser Gly Asp Phe Val Gly Phe Leu Asp Ser Asp Asp Tyr Ile Asp
85 90 95
Asn Asp Leu Tyr Glu Ile Met Ile Asn Ser Leu Asp Ser Ser Ile Lys
100 105 110
Ile Val Glu Cys Asp Phe Ile Trp Glu Tyr Glu Asn Gly Lys Ser Val
115 120 125
Leu Asp Lys Thr Ser Glu Tyr Asn Ser Ile Lys Asp Leu Met Val Asn
130 135 140
Gly Arg Val Val Ala Trp Asn Lys Ile Tyr Asn Val Glu Trp Leu Glu
145 150 155 160
Lys Ile Asn Ile Lys Phe Lys Glu Gly Leu Leu Tyr Glu Asp Leu Asn
165 170 175
Phe Phe Phe Lys Ile Val Pro His Leu Thr Ser Ile Ser Glu Val Ser
180 185 190
Thr Val Lys Asn Ser Phe Val His Tyr Val Gln His Lys Gly Thr Ile
195 200 205
Thr Ser Asp Asn Ser Leu Asn Ile Leu Asp Ile Ile Lys Ser Tyr Glu
210 215 220
Asp Val Phe His Tyr Tyr Asn Glu Lys Gln Ile Asn Asp Leu Tyr Phe
225 230 235 240
Asp Glu Leu Glu Tyr Lys Phe Ser Arg Asn Leu Met Gly Ala Phe Leu
245 250 255
Lys Arg Ala Ile Lys Ile Lys Asp Lys Arg Gln Arg Lys Ile Ile Leu
260 265 270
Asp Glu Phe Trp Asn Asn Val Leu Ser Tyr Tyr Pro Asn Trp Lys Lys
275 280 285
Asn Lys Tyr Ile Lys Lys Leu Ser Lys Gln Asn Ile Leu Leu Phe Phe
290 295 300
Ile Asn Lys Tyr Thr Tyr Lys Leu Phe Tyr Leu Leu
305 310 315
<210> 193
<211> 309
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的推定的糖基转移酶(GT4)的氨基酸序列
<400> 193
Met Ile Tyr Val Glu Ile Arg Gly Asn Leu Gly Asn Gln Leu Phe Ile
1 5 10 15
Tyr Ala Thr Ala Lys Lys Ile Gln Lys Leu Thr Gly Gln Lys Ile Gln
20 25 30
Leu Asn Thr Thr Thr Leu Asn Lys Tyr Phe Pro Asn Tyr Lys Phe Gly
35 40 45
Leu Ser Glu Phe Ile Met Glu Asp Pro Asp Cys Phe Ile Glu Ser Tyr
50 55 60
Lys Lys Leu Pro Trp Phe Thr Asn Glu Tyr Leu Leu Pro Ile Lys Ile
65 70 75 80
Phe Lys Lys Ile Leu Asn Lys Thr Pro Lys Ile Asn Lys Ile Leu Ser
85 90 95
Asp Phe Phe Phe Lys Ala Phe Glu Lys Lys Gly Tyr Phe Ile Trp Arg
100 105 110
Gly Glu Thr Phe Lys Lys Phe Ser Leu Gly Asn His Lys Asn Tyr Tyr
115 120 125
Leu Ser Gly Phe Trp Gln Ser Glu Glu Tyr Phe Tyr Asp Ile Arg Asp
130 135 140
Glu Leu Leu Glu Ile Ile Thr Pro Ile Asn Ser Ile Arg Glu Cys Asn
145 150 155 160
Phe Glu Leu Leu Asn Leu Ile Arg Asn Ser Glu Ser Ile Cys Val Ser
165 170 175
Ile Arg Arg Gly Asp Tyr Val Asp Asn Pro Lys Ile Ser Ala Ile Tyr
180 185 190
Asn Val Cys Asp Ile Asn Tyr Phe Ile Glu Ser Val Asn Glu Ile Lys
195 200 205
Lys Asn Val Val Asn Val Lys Val Ile Cys Phe Ser Asp Asp Val Glu
210 215 220
Trp Val Lys Lys Asn Ile Lys Phe Asp Cys Glu Thr His Tyr Glu Thr
225 230 235 240
Tyr Gly Asn Ser Leu Ser Glu Lys Val Gln Leu Met Ser Ser Cys Lys
245 250 255
His Phe Val Leu Ser Asn Ser Ser Phe Ser Trp Trp Thr Glu Phe Leu
260 265 270
Ser Ile Arg Gly Gly Ile Thr Ile Ala Pro Lys Asn Trp Tyr Ala Asp
275 280 285
Glu Arg Glu Ala Asp Ile Tyr Arg Lys Asn Trp Ile Tyr Leu Glu Asp
290 295 300
Lys Thr Glu Glu Glu
305
<210> 194
<211> 396
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的wzy基因编码的氨基酸序列
<400> 194
Met Gly Phe Leu Phe Leu Thr Ile Ile Leu Ile Leu Trp Gly Tyr Ser
1 5 10 15
Phe Thr Asn Ile Lys Ile Ser Pro Phe Ser Ile Leu Phe Met Ser Leu
20 25 30
Gly Ile Phe Tyr Ser Gln Phe Thr Ser Ile Asn Ile Asp Leu Ile Ile
35 40 45
Lys Val Leu Phe Leu Ile Thr Ser Ile Ile Tyr Leu Ile Lys Asp Lys
50 55 60
Tyr Ser Lys Lys Tyr Val Phe Ser Leu Leu Leu Ile Ala Val Leu Ile
65 70 75 80
Leu Ile Glu Ser Thr Ser Pro Ser Lys Phe Asn Gln Tyr Tyr Gly Phe
85 90 95
Ile Asp Ala Leu Thr Ser Phe Ala Thr Phe Ser Thr Gly Ile Leu Leu
100 105 110
Phe Ser Ile Lys Phe Ser Leu Gln Glu Arg Arg Ser Ile Leu Lys Ser
115 120 125
Ile Ser Tyr Leu Pro Ile Phe Ser Val Leu Ile Gly Ile Pro Leu Thr
130 135 140
Phe Gly Gly Phe Ile Ser Met Thr Ala Arg Gly Gly Ile Ala Leu Ser
145 150 155 160
Gly Ala Ala Leu Glu Thr Asn Leu Ser Phe Phe Ser Val Leu Ser Leu
165 170 175
Val Ser Leu Asp Ile Leu Tyr Gln Asp Thr Arg Ser Asn Lys Tyr Gln
180 185 190
Ile Leu Lys Ile Ile Asn Phe Ile Leu Leu Cys Cys Thr Leu Thr Arg
195 200 205
Gly Gly Ile Ile Ser Gly Ile Ile Ile Ile Leu Pro Ser Leu Leu Phe
210 215 220
Leu Leu Lys Lys Gly Phe Lys Gly Val Arg Gln Phe Ile Phe Leu Ile
225 230 235 240
Ile Thr Ile Phe Gly Ser Ile Tyr Pro Leu Ile Leu Leu Trp Lys Ser
245 250 255
Ile Ser Glu Arg Thr Phe Ser Ala Asp Gly Ile Asn Thr Ser Gly Arg
260 265 270
Tyr Thr Ala Trp Asp Tyr Ile Val Asn Leu Thr Thr Asn Lys Ser Gln
275 280 285
Gly Met Gly Leu Gly Ser Leu Lys Thr Leu Thr Glu Asp Ile Asn Leu
290 295 300
Arg Ala Phe Thr Ala Ala His Asn Thr Tyr Ile Gln Phe Tyr Tyr Glu
305 310 315 320
Thr Gly Tyr Leu Gly Val Thr Leu Leu Ser Ile Leu Phe Ile Leu Ile
325 330 335
Leu Ile Ile Ile Leu Lys Leu Thr Asn Tyr Arg Lys Lys Ile Ile Tyr
340 345 350
Leu Thr Phe Ile Ser Phe Leu Val Tyr Ser Tyr Thr Asp Asn Cys Ile
355 360 365
Val Asn Asn Arg Tyr Trp Tyr Leu Phe Met Phe Ile Ile Gly Cys Phe
370 375 380
Lys Tyr Phe Asp Arg Lys Glu Glu Asn Ala Leu Leu
385 390 395
<210> 195
<211> 396
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的甘油磷酸转移酶家族蛋白的氨基酸序列
<400> 195
Met Arg Tyr Phe Lys Ile Leu Phe Glu Ile Ile Gln Leu Leu Val Ala
1 5 10 15
Ser Ile Leu Cys Arg Leu Tyr Lys Asn Pro Asn Asp Ile Trp Leu Ile
20 25 30
Asn Glu Lys Pro Asp Glu Ala Arg Asp Asn Gly Tyr Ala Phe Tyr Gln
35 40 45
Tyr Leu Arg Lys Asn Phe Pro Asp Ile Lys Val Tyr Tyr Val Ile Ser
50 55 60
Lys Glu Ser Thr Asp Ile Tyr Lys Phe Asp Asn Glu Thr Asn Ile Val
65 70 75 80
Phe Tyr Lys Ser Phe Leu His Phe Ile Leu Tyr Ile Lys Ser Lys Val
85 90 95
Leu Ile Ser Ser Gln Thr Leu Pro Tyr Pro Ser Ser Arg Lys Leu Cys
100 105 110
Glu Ala Leu Met Tyr Leu Asn Leu Asn Lys Pro Lys Arg Ile Trp Leu
115 120 125
Gln His Gly Val Thr Lys Asp Lys Leu Pro Tyr Glu Asn Met Ala Arg
130 135 140
Glu Ile Phe Lys Tyr Asp Leu Ile Thr Cys Val Ser Leu Lys Glu Ala
145 150 155 160
Asn Phe Ile Met Lys Glu Tyr Gly Tyr Asn Glu Asp Gln Val Lys Ala
165 170 175
Leu Gly Phe Ala Arg Tyr Asp Asn Leu Pro Ile Gly Asn Asn Asn Thr
180 185 190
Phe Asp Ile Leu Ile Met Pro Thr Phe Arg Lys Gly Tyr Glu Ile Lys
195 200 205
Asn Phe Ser Leu Pro Thr Asp Ser Glu Thr Lys His Phe Glu Glu Ser
210 215 220
Val Phe Phe Lys Thr Tyr Val Asp Leu Leu Asn Ser Glu Glu Leu Asp
225 230 235 240
Glu Tyr Leu Glu Lys Ser Gly Lys Lys Ala Ile Phe Tyr Leu His Tyr
245 250 255
Ala Phe Gln Pro Tyr Ala Lys Ser Phe Ser Lys Arg Leu Met Ser Ser
260 265 270
Asn Val Ile Ile Ala Glu Arg Thr Glu Tyr Asp Val Gln Lys Leu Leu
275 280 285
Ile Asn Cys Glu Leu Leu Ile Thr Asp Tyr Ser Ser Val Phe Phe Asp
290 295 300
Phe Ser Tyr Met Lys Lys Pro Glu Ile Phe Phe His Phe Asp Glu Lys
305 310 315 320
Glu Tyr Arg Ser Asn His Tyr Arg Glu Gly Tyr Phe Asp Tyr Lys Thr
325 330 335
Asp Gly Phe Gly Pro Val Val Asn Ser Lys Glu Glu Leu Leu Thr Glu
340 345 350
Ile Lys Glu Phe Ile Asp Asn Pro Ser Leu Leu Met Glu Phe Asn Lys
355 360 365
Arg Ala Asn Asn Phe Phe Lys Tyr Thr Asp Asn Asn Asn Cys Gln Arg
370 375 380
Ile Leu Lys Glu Ile Trp Arg Ile Asn Glu Thr Asn
385 390 395
<210> 196
<211> 472
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的wzx基因编码的氨基酸序列
<400> 196
Met Lys Leu Ile Lys Asn Tyr Leu Met Thr Ser Ser Tyr Gln Leu Leu
1 5 10 15
Ile Ile Ile Leu Pro Ile Ile Thr Thr Pro Tyr Ile Ser Arg Val Leu
20 25 30
Ser Pro Glu Gly Ile Gly Leu Tyr Ser Tyr Thr Tyr Thr Ile Thr Gln
35 40 45
Tyr Phe Val Leu Phe Ala Thr Leu Gly Thr Val Thr Tyr Gly Ser Arg
50 55 60
Glu Ile Ala Tyr Tyr Gln Ser Asn Lys Gln Lys Arg Ser Glu Ile Phe
65 70 75 80
Trp Gly Ile Thr Phe Leu Ser Trp Ala Thr Gly Ala Ile Ser Leu Leu
85 90 95
Ile Phe Tyr Ile Phe Ile Phe Phe Asn Gly Lys Tyr Ser Val Leu Phe
100 105 110
Phe Trp Gln Ser Phe Leu Ile Phe Gly Val Ile Phe Asp Ile Asn Trp
115 120 125
Tyr Phe Thr Gly Met Glu Lys Phe Lys Val Ile Ile Ser Arg Asn Phe
130 135 140
Cys Ile Lys Ile Ile Ser Leu Leu Cys Ile Phe Val Phe Val Lys Ser
145 150 155 160
Glu Lys Asp Leu Ser Leu Tyr Ile Val Ile Leu Gly Leu Ser Asn Ile
165 170 175
Ile Gly Asn Ile Leu Val Trp Pro Tyr Leu Arg Lys Glu Val Tyr Lys
180 185 190
Pro Asn Phe Ser Lys Leu Ser Phe Lys Lys His Leu Gly Ser Thr Trp
195 200 205
Ile Phe Phe Leu Pro Gln Thr Ser Val Thr Leu Asn Ser Leu Ile Asn
210 215 220
Gln Asn Met Ile Ala Tyr Phe Asp Ser Ile Thr Ser Leu Gly Tyr Phe
225 230 235 240
Thr Gln Thr Asn Lys Phe Thr Val Ile Ala Ile Ser Ile Val Ile Ser
245 250 255
Ile Gly Thr Val Met Leu Pro Arg Met Ser Asn Leu Val Ala Arg Lys
260 265 270
Glu Tyr Ser Lys Phe Thr Asp Tyr Val Thr Lys Ser Ile Asn Ile Ser
275 280 285
Ser Gly Ile Ser Ile Ala Ile Met Phe Gly Leu Met Ala Ile Ala Pro
290 295 300
Lys Phe Thr Thr Phe Phe Leu Gly Ala Gln Tyr Lys Phe Val Ile His
305 310 315 320
Leu Leu Val Leu Ser Ser Pro Ile Val Val Leu Val Thr Trp Ser Asn
325 330 335
Val Leu Gly Gln Gln Tyr Leu Ile Pro Leu Asn Arg Met Lys Ile Phe
340 345 350
Thr Lys Ser Leu Ile Cys Gly Asn Leu Val Asn Val Ser Leu Asn Leu
355 360 365
Ile Leu Leu Pro Lys Met Gly Val Glu Ile Ser Ile Ile Asn Gln Leu
370 375 380
Ile Asn Glu Ile Ile Ile Val Gly Ile Gln Phe Ile Ser Val Arg Lys
385 390 395 400
Glu Leu Lys Ile Asn Ile Ile Leu Gly Asp Leu Ile Lys Tyr Phe Phe
405 410 415
Ala Gly Ile Ile Met Phe Ile Ala Val Leu Tyr Leu Asn Leu Gln Leu
420 425 430
Pro Met Thr Ile Phe Thr Leu Leu Ile Glu Ile Gly Ile Gly Val Leu
435 440 445
Ile Tyr Ser Met Leu Val Ile Ser Leu Lys Thr Gly Leu Tyr Lys Glu
450 455 460
Leu Lys Lys Ile Ile Lys Ile Arg
465 470
<210> 197
<211> 299
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的epsL基因编码的氨基酸序列
<400> 197
Met Glu Arg Lys Lys Lys Lys Lys Lys Ile Tyr Ile Ile Ile Leu Ile
1 5 10 15
Leu Leu Met Phe Ile Thr Ile Val Cys Phe Gly Gly Tyr Ala Thr Arg
20 25 30
Glu Leu Ile Thr Pro Thr Glu Lys Thr Ile Pro Asn Val Ser Asp Gln
35 40 45
Pro Lys Lys Thr Ser Ala Ser Asn Gly Tyr Val Glu Gln Lys Gly Glu
50 55 60
Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp Asp Ala Gly Val Pro
65 70 75 80
Glu Trp Val Lys Val Pro Ser Lys Val Asn Leu Asp Lys Phe Thr Asp
85 90 95
Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile Asn Asn Pro Glu Val
100 105 110
Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg Met Lys Met Ser Glu
115 120 125
Val Ile Ala Lys Tyr Pro Asn Ala Leu Ile Met Asn Ala Ser Ala Phe
130 135 140
Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln Ile Asn Asn Gly Lys
145 150 155 160
Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Thr Gln Tyr Ala Phe Val
165 170 175
Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp Ser Ser Thr Pro Ala
180 185 190
Leu Thr Ile Ile Lys Asn Gly Gly Gln Gln Ala Tyr Asp Phe Gly Thr
195 200 205
Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser Asp Gly Ser Val Asp
210 215 220
Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys Asp Asn Asn Leu Tyr
225 230 235 240
Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp Asn Ile Ile Lys Ser
245 250 255
Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu Leu Asp Ser Gly Gly
260 265 270
Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile Val Ala Ser Gln Asp
275 280 285
Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295
<210> 198
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> DSM 33192的LytR蛋白的氨基酸序列
<400> 198
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys Tyr Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ser Val Ala
20 25 30
Ser Ile Ala Tyr Val Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ala
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Ser Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Ala Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Lys Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Ser Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 199
<211> 27444
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33135 eps基因簇,完整序列
<400> 199
atgaatgatt tattttacca tcggctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttat 420
agttataatt ctaggataaa taatctttca aaagctgaca aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa attattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatcaaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcacg gactagtaat 840
gagcaactta taaaaaattt ggctagtaca ggagcagagg tgatagttca accctctcca 900
ccgatttatg gtggtgttgt ataccccgta caagaagaac aatttaaaca atctttatct 960
acaaaatatc cctatataga ctactgggct agttacccag acaaaaattc tgataaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgattcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaaaaaat gcaggaaaca caggaacaga cgattgattt aagagagatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattc ggatagttca gcagcctacg ctggacaagt gagcgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc ggatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acagattcac aagtcattac gcttactgtt aaatattcta atccttacat ggctcaaaag 1560
attgcagacg agactgctaa aatatttagt tcagatgcag caaaactatt gaatgttact 1620
aacgttaata ttctatctaa agcaaaagct caaacaacac caattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ctagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct ggggctaacg 1800
gttcttggtg taacaactta tgctcaaatg agtgatttta ataagaatac gaataaaaat 1860
ggtacgcaat cgggaactaa gtcaagtctg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ctgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caaggaatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcaaccgcg agtgctaatc tagctgttgg ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacccaacc cgtctgaatt attagcatct agtgcaatga aagacttgct tgactctgtg 2400
tccgatttct ttgatgttgt tttgattgat actccacctc tctctgcagt tactgatgct 2460
caaattttga gtagttatgt aggaggagtg gttcttgttg tacgtgccta tgaaacgaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtatag 2640
tagttggaat aaactttaat caaataaaag acagaaattt gtaggatagg agagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caaccatcac tgccactcct 2820
catcataatc ctcaatttaa taatgagtca ccccttattt taaaaaaagt taaggaagtt 2880
caaaatatta ttgacgaaca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaaggaaagt tacttacagc agcaggcact 3000
tcaagttata tattgattga gtttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggtcttcaa cctattttgg tccaccctga gcgtaatagt 3120
ggaatcattg agaacccgga tatattattt gattttattg aacaaggagt actaagtcag 3180
ataacagctt cgagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt tacgtcacgt 3300
gcatttaaga tgaaggaggc atttgaaatg attgaagaaa gttgtggttc tgatgtatca 3360
caaatatttc aaaataacgc agggtcagtg attttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa aaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatggaag tttttgagga tgtctcatca cctgaaccgg aagagcataa gttagtagaa 3540
ttaaaaaaat tttctcatag agagataatt ataaaaagag ggattgatat tttaggggga 3600
ttagcgggtt cagttttatt tcttattgcg gctgcattgc tttatgtccc ttacaaaatg 3660
agctcggaaa aagatcaagg gccaatgttc tataaacaaa aacggtatgg aaaaaacggt 3720
aaaatttttt atattttgaa atttagaaca atgattctta atgctgagca gtatttagag 3780
ctacatccag aagttaaagc cgcctatcat gccaatggca ataaactaga aaatgacccc 3840
cgtgtgacga agattggttc atttattaga caacactcaa ttgatgaatt accacaattt 3900
atcaatgtcc ttaaagggga tatggcatta gttggcccaa gaccaatttt actttttgaa 3960
gcgaaagaat atggggagcg cctctcttat ttactcatgt gtaaacctgg gattactggt 4020
tattggacaa cacatggtag aagcaaagtt ctttttcctc aacgagcaga tttagaactc 4080
tattatctcc aatatcacag caccaagaac gatatcaagc ttcttatgct tacaattaca 4140
caaactattc acggatcgga cgcttactaa aaaatgaagg aaaaacatat ttacattatt 4200
ggttcaaaag gaattccagc aaagtatggt ggttttgaga cttttgtaga agaactaaca 4260
gcacatcaga gtaataaaaa ccttaagtat catgttgctt gtttatcaaa tgacatacaa 4320
tcaaatttta ttcataatgg tgccgactgt tttaatattc caaagaaaaa tattggacca 4380
gcaaatgcca tttattatga tttggcagct ttaaagtact cacttaaaga aattgaagaa 4440
aaaaattata tgggtgcaat tatttatatt ttagcttgcc gcattggtcc gtttattggt 4500
cactataaaa agcaaatgaa aaaattagga attactttga tggtaaatcc tgatggggag 4560
tgtgaaataa tatggacaac cagaaaaagc ctgaattcat acgggtttgc gcggcttgac 4620
cttttcacct caacttgttt ctgcctgtca tggtgcgttg cggccggttt cataggttct 4680
aaccttgtga aacgaattta tcaggaggct ccttctgcta cggttatcgg catcgacaac 4740
atgaatgcct actatgatgt ggcactgaaa gagttccgcc tgaatgagct ggccaagtat 4800
cccacattta ccttttatgg ataatccgaa cttccgcttc gtgaaggctg atatctgtga 4860
ccgcgaagct gtgaataaat tgtttgaaga agaacatccg gacatcgtgg tgaactttgc 4920
ggcagagtct catgttgacc gttctattga agatcctggc atcttcctcc agaccaacat 4980
cattggtacc agtgttctaa tggatgcttg ccgtaagtat ggcattcggc gttatcatca 5040
ggtttctacc ggtgaggttt acggtgacct gccgctggat cgtcctgacc tgttcttcac 5100
tgaggagact ccgatccata ccagctctcc gtatagcagt tccaaggctg ctgctgacct 5160
gctggttctg gcttaccacc gcacctacgg gctgcctgtg accatttccc gttgctccaa 5220
caactatgga ccgtatcact tcccggagaa gctgattccg ctgatgatcg ccaatgcact 5280
ggctgacaag ccgctgcctg tttacggcga aggtctgaat gtccgtgact ggctgtatgt 5340
tgaagatcac tgcaaggcca tcgatctgat tatccacaag ggtcgtgtgg gcgaagtcta 5400
caatgtcggc ggtcataatg aaaagcagaa catcgagatc gtaaagatca tctgcaagga 5460
gctgggtaag ccggagagcc tgattaccca tgttggcgac cgtaagggtc atgatatgcg 5520
ttatgccatc gatccgacca aaatccacaa tgagctgggc tggctgccgg agaccaagtt 5580
tgaggacggc attaagaaga ccatccaatg gtatctcgat aatcgtgagt ggtgggagac 5640
catcatcagc ggtgagtatc agaactacta cgagaaaatg tacagcaacc gctaagaagc 5700
cagaggagga aagaatatga agttttttgt aactggcgtt ggcggtcagc tgggtcatga 5760
tgtgatgaac gagctgctga agcgcggcca tgagggtgta ggctctgata ttcaggaaaa 5820
ttacagtggt gtggcagacg gctccgcagt aacaaaagca ccgtattttg ccttggatat 5880
tacgaacaag gatgcagttg agaaagtcat tacggaagta aatccggacg cagtgatcca 5940
ctgtgcagca tggacggctg tggatatggc agaggatgac gataaagtgg cgaaagttcg 6000
tgccatcaat gcgggcggca cacggaatat tgcggatgtc tgcaagaagc tgaattgcaa 6060
gttgacctat atcagcacgg actatgtgtt tgatggtcag ggcacagagc cctggcagcc 6120
ggattgcaag gattataagc ctttgaatgt atacggtcag acaaagctgg aaggtgagtt 6180
ggcagtcagc cagacgctgg agaagtattt cattgtccgc attgcttggg tgtttggctt 6240
gaatggcaag aactttatta agaccatgct gaatgttggt aagacacacg acactgttcg 6300
tgtggtcaat gaccagatcg gcacaccgac caatacatat gatttggctc gactgctcgt 6360
cgatatgaat gaaaccgaga agtacggcta ttatcatgca accaacgagg gcagctacat 6420
cagctggttc gatttcacga aagaaattta tcgtcaggct ggatataaga cagaagtcct 6480
gccggtgacc acggcagagt acggtctgag caaggccgct cgtccgttca acagccgtct 6540
ggataagagc aagctggtgg aagctggctt tactccgctt ccgacttggc aggatgcact 6600
gagccgttat ctgaaagaaa tcgagcagtg atagaggaga tagagaaaat gggacagatt 6660
aaggttgata aaaatgtagg cggcatcgag ggactttgtg tgattgagcc tgctgtgcac 6720
ggtgatgccc gtggctattt tatggaaacc tacaacgaaa aagatatgaa gaaagctggt 6780
atcgacattc attttgtaca ggataatcag tccatgtcca tgaagggcgt gctgcgcggt 6840
ttgcatttcc agaagcagta tccgcagtgc aagctggtac gcgccgtgcg cggaactgtg 6900
tttgatgtcg ctgttgatct tagaagtaat tctgagacct atggcaagtg gtatggtgtg 6960
accctgtctg ccgagaataa gaagcagttc ctcattccgg agggatttgc acacggattc 7020
cttgttctga gcgatgaagc agagttctgc tataaggtta atgacttctg gcatccgaat 7080
gatgagggtg gtatggcttg gaacgacccg gagattggca ttgagtggcc tggagtacag 7140
ggtgagtaca agggtagcgc gagtgcggaa ggctatgagt tagaggatgg cactgcgttg 7200
aatctgagtg ataaagacca gaaatggctg gcactgaagg atacttttaa attttgagaa 7260
cacaggggta gaagaatgca gcgggaaaac gaagtacaac acgtatttct ggtaggtgct 7320
aaaagcctag gagcatatgg tggttatgaa acctttgtat ataagctgac agaacaccat 7380
cagaataaga aaaatattaa atatcatgtg gcgtgtaaag ctaatggtga cggctgcatg 7440
gacgaaacaa aagtggatgg cgtaaaggga atcaatcaac atgagtttga attccacaat 7500
gcgcactgtt ttaaaattga tattcctcag attggtgcag cacaggcaat ttactatgat 7560
gttgcggcat tgaatgcttg ctgtaagtac ataaaggaac ataaaatcaa acatccaata 7620
gtttacataa tggcttgccg cattggaccg tttgcaggtc atttttatca ggaaatccat 7680
aagcttggtg gtacggtcta tttgaatccg gatggtcatg aatggatgag agccaagtgg 7740
tcggctccga ttcgtaaata ctggaagatt tctgagcgga tgatggtcaa atactgtgac 7800
cttgcaatct gcgattctgt gaatatcgaa aagtatatcc acgagtgtta cgacgggaaa 7860
ggaatcaaag gcagaaatcc taagaccaca ttcattgctt atggtgcaga tttgacactc 7920
agcaagctgg ctgatgacga tgaaaagctg gtgaactggt ataaggaaaa aggactggcg 7980
aagaagggct attaccttgt cgttggacgt tttgtaccgg agaactcttt tgaagtgatg 8040
attcgcgagt tcatgaagag cggaagcaaa aaagattttg cgcttatcac aaatgtgaac 8100
gataaatttc tgaatgaatt ggaagagaaa cttcatttca agagtgacaa gagaatcaag 8160
ttcgttggta cagtgtatga ccaagaactc ttgaagaaga ttcgagagaa tgcttacgca 8220
tatttccatg gacacacagt tggaggtacg aatccatcat tgattgaggc acttggcagc 8280
acggatttga acctgctggt tgatgttggt tttaataaag aggtcgccga agattgcgct 8340
ttgtattgga gccgcgaacc gggcagtctt gcaagattga ttgatcgtgc agataagatg 8400
agtaccgaag aaatcgcgga aatgggtcga aaagccaaga agcgtgtagc tgaagagtat 8460
acatgggata aaatctgtgg acagtatgaa gaagtgttca caaagtgaga gcaaatgagg 8520
gatcagacag tgaaaatctc gaaatattat agaacattct taagaagaaa actaaatgcg 8580
gagaatcgca aacgtctaaa aaacaaaaac tttacggtgc tatgtaataa ctgtgtgggg 8640
ggggtgatcc ttcacgagtt aggtgaacgc tttaattcgc caacggtcaa tttgtttttt 8700
aaagcggaag attatctcaa atttttggag aacttagatt attacttaaa acaggctctt 8760
gtagaagttg gaagcgagaa gaactaccct gttgcaaaac tggatgatat aacaatatat 8820
ttcatgcatt attcatcgtt tgatgaggca aaaataactt ggcaaaaacg agtggcaaga 8880
attaacaaaa acaatttgta tgtaattttt gttcaacaaa gcggttgtac agagcaggtc 8940
ttggaggcat ttgacaagct tccctataaa cataagctgg cacttactgc aaagccaatg 9000
ccggagataa aatgctctta ttgtattcat ggtacagcgc aaccgaatgg agaagtaatg 9060
gatttgtgca agtatgaggg aaagtttact ggcaaacgct ggattgatga atatgattat 9120
gtgggatttt taaataagaa atgatgtgag gaattgatat gtatgattat ttggtggtag 9180
gctctggtct ttacggagca atatttgcgc atgaagcaaa agcgcatgga aaatctgtgc 9240
tagttgtgga taagcgtccg aacattgggg gcaatgtcta caccgagaac attgagggca 9300
tcaacgtcca caagtacggt gcacatattt tccataccaa caacaaaaag gtttggaatt 9360
acatcacgca gtttgccgag ttcaaccgct ttacaaattc tccggttgct aattataagg 9420
gtgaactgta ttcgttgcct ttcaatatgt ataccttcaa caagatgtgg ggcgttgtga 9480
caccggagga agccgctgca aaaattgagg agcagcgcaa ggaaatcact ggcgagccga 9540
aaaatctgga ggagcatgcc atctctctcg tgggccgcga catctatgag aagctcatca 9600
aaggttacac cgagaagcag tgggggcgtg actgcaagga tctgcctgcc ttcatcatta 9660
agcgtcttcc ggttcgtctg acttttgata ataactattt caatgcgttg taccagggta 9720
tccccattgg cggttacacc aagatgattg ccaacttgct ggacggcatc gaggtgcgcc 9780
tgaacatcga ctatctggaa aacaaggttg agctggatgc gctggctggc aaggtggttt 9840
acaccggtcc catcgatgcc tactttgact ataagctggg tacgttggag taccgttctg 9900
tccgctttga gaatgagttg ctcgacaagc cgagctccca gggcaacgct gcggtcaact 9960
atacagaccg cgagacgccg tggactcgta tcattgagca caagtggttt gagtttggta 10020
gggacgagaa cggcaatgat ttgcccaaaa ccattatcag ccgagagtat agcagtgagt 10080
ggaagccggg tgatgagccg tattatccgg tcaatgatgc taagaacagc ttactgtatt 10140
ctgagtataa gaaactggca gatgctgaaa gtaaagtgat tttcggtggt cgtttgggtg 10200
agtacaagta ttacgatatg gatcagatta ttgccgctgt attggagaga tgcgaaaggg 10260
aatttgatgt atgaatggaa aaataatcgt tgttacacat aaagagtata aaatgccatg 10320
tgatacagtg taccttccag tatgcgttgg cgttggcaga gatgctttaa ggaataagta 10380
tcaggctgac gatgaaggtg aaaacatctc tgataagaat attctttatt gcgaactgac 10440
agcactgtat tgggcttgga aaaacttgaa ctgtgattat atcggactgg ctcattatcg 10500
tagatatcta actgagtcaa agagaagtaa aaacatagag gatgctctat ctcagcatag 10560
aattgaagaa ctcttgatgg actatgacat aattgtacct agagagaaaa ggtattctca 10620
aacaatagcc gaccattata ttaactgtat taaaagcaga aaagatgcac acaaaattca 10680
tttacaatta cttcgtgatt cgattcttga ggtagctcca gagtatattg cagaatacga 10740
taagaccatg aatgggcata gtgcacatat gcttaatatg tttgtgatga aaaagcaaaa 10800
tctggacaat tactgcgagt ggctatttaa gattttattt gttttagaaa aaaaaatata 10860
tgaccatgat gtctactatg atcgtataat gggcgcattt agtgagttcc tattggatgt 10920
atggattaga acaaataaaa agacgtatat agaggttgag ttaatcgaaa ctgaaagaga 10980
ctattggggt aagattaaat gggctctgaa aagaaagctg tttgaataag gagataatat 11040
aatgcgaatc ctgcattact cactagggtt tcccccttat cgaaggggag gcctgacaaa 11100
atattgtttg gatcttatgg tagcacagga aatgcagggt aatgtggtag ctatgtgctg 11160
gcctggtgaa atcggaatta tcaagaagaa aaaagttgca ataaaaaaaa gaaaaaaata 11220
cagcatagga aagagtaaga ttgaaaatta tgaaatacaa gggattttac ctgttcctct 11280
tttagaggga ataaaaaatc cggatctatt tactgaaaaa aagaaccaag aaatttggaa 11340
actattttta aagaactgga gacctgatgt tatccacttt catacattaa tgggcttgcc 11400
gctagaatat gttgaaacgg caagaaagct tggaataaaa acattattta cgacgcatga 11460
ttattttgga ttgtgtccaa ggacgacttt ggtgcgtcaa aatggcgaaa tttgtgatgg 11520
ctgcacaccg gaattgtgcg cggaatgctg tgagaatgct atcagttata gaaaattgaa 11580
aattctacaa tcttcagtgt atagggttct gaaagattta gtaattgtaa aaaagctcag 11640
aaaaaaacat tggaatgaat cgaaaaatga ttctgcacaa catcaggctt ctgttcagaa 11700
cgcacaacga gcagaagaat atgttgagct acgaaaatat tatataaaac tactaaaaag 11760
ttttaatatc atccatttta atagtagcaa tacaagagat gtgtatttaa aagcggccaa 11820
ggaggtgtta aacaacgagg tcgtttctat atcacatgaa atgataaaag ataacaaaaa 11880
gaaaaaaagg aaacacgaga tattgcacct ttcttatttg ggaccggata catataataa 11940
aggatactat gtattaaaag aaacactgaa tcagttgcat aaagaaggat ataagtttca 12000
gttaaatatt tattttgagg atgcttcgga gccctttatc gtttcacatg cgccgtatca 12060
atactcagaa ttaggcaagg tgatggatga tgcagattgc gttatattac ctagtttggg 12120
gaatgaaaca tttggcttta cagtcttaga agctctaagt tatggagttc ctgttattgt 12180
tagtagtcgt gttggtgcaa aggatattgt tgaagagggt aaaaacggat ttgttgttga 12240
aggtgatgta gactctttaa aaacaaagct gacaagtgtg ttgaatcaac ctgaaatatt 12300
ggaagatatg aataactata ttgttgcaaa tacacacatt aaaacaatga cagaacattc 12360
taaagaaatc aaggacctgt atcaaaagtg acttgtatat aaaaatggag aagattatgg 12420
atttagataa aattagatgg aattcagagg tcagtcatcc aaattttata gctcaatata 12480
gatttgtaaa agcaaaaaga ttatgtgagt attgggctga caaaaataag ctgctgtact 12540
tatttgctcg gatgcgatat gaacattata aagtaaaata taatacagat attcctgccc 12600
gttgtaaaat cggggggggt caaaatcagg catctcggag gtattgtatt taacccgggt 12660
gttgaaattg ggaaaaatgt tgattgctta aatggcgttt tgcttgggca aatcgatttc 12720
ggtgccaaag caggtgtgcc aagaattggt gataatgtgt ttttaggaac caattctatt 12780
gttgtcggta agattcaaat aggaaatgat gtgttaattg ctcctggtgc atatgtgaac 12840
tttgatgttc cggatcactc tatagtaata ggaaatcctg gaaagattat tgcaaaacaa 12900
aatgcgacaa gaggatatat agcatcacct gttgaagatt aaatgagcgg ataataaata 12960
gcgaggagtt atgagtccgc acgttttctg caaagggaga acaaattgtg caagataaag 13020
taagtattat tgttcctgta tataaagttg agagggaact agatcgctgt gttcaaagtc 13080
tgattaaaca gacttataaa aatttggaaa taattcttgt ggatgatggt agtcctgatc 13140
aatgtcctga attatgcgaa aattatgctg agatagataa gagagttaag gtcattcata 13200
aagagaacgg cggattatca gatgctcgta atgcgggatt gaaacaagca acaggcaagt 13260
atattctgta tgttgattct gatgattata ttgatttgga tgcctgcgaa agatttataa 13320
aggcggcggg taatcaaaaa atagatattg ttgttggaaa tgcaattatg gaaaaaccag 13380
atggtaaaga aatgatgata cactcagcga caccatctgg aatcacctat actgccaaac 13440
agtttattat gagtgctgtt aaagcatatc agtggtatgc ccctgcatgg cttaatatgt 13500
atagaaggga ctttcttctt gataatcagt tatacttcaa aaaaggaatt tactttgagg 13560
atgttcaaat gctgccacgt gtttttttgg ccgcaaaaaa aatcacatgc atatatggaa 13620
cattttatca ttatattatt cgagaaaatt caattatgac gtctcagaaa gacgagaaaa 13680
agaaaaacga ttcaattcaa aatatgaaag agtggaaaga gcagtttgat cttgtagatg 13740
atgtggcctt gaaaaaatgc ctatatggaa tgcttgtgaa aatgtatata cacgaatgta 13800
ggcagtatgg gattacgact aaagcaattg aaggaatgga tgatagattt atattgggga 13860
actgtctcaa ttataaagaa agattaaagg ctactatgtg gttgtgcttt ccaaggctac 13920
tgataaaaca gtgaaggagt gcatatgagt gtttacatat ttttatgggt tgctgtagtt 13980
gtatttggct ttatcgcaag tagaagtaat tataaagcaa aatattttgt gctcttttct 14040
tttttcctga tgacaattgt tttaggacta agaggggcta cagttggcga agatacaaaa 14100
atgtatctta atattgcaga aagagtaact aatatatcat ggaaagaagt gttttctagc 14160
tttccaacga gtcagtggag atatatttca tatggtggct taagtggatt tagtgagcag 14220
acggaaacag tttatttggc ctattgcaaa ttgataatgc ttatatttca taatgcacag 14280
gcagttcttc ttataacagc tgcaattacg aatgctctat ttgcaaagtt tattttagat 14340
aacataacag tcaaacaaga tgccatactg gctgtttata tttacatgtg cgatgcgatg 14400
tttatgaatt cgtttaatac aatgcgacaa attttggcaa tatccattgc agtgcaatcc 14460
atagaattaa taaaaaaaga aaagtataag aaagcaatag catgtgttct attagccgca 14520
tgcttccatc aatctgcaat agtttttttt gttgccgatt tattctattt attaaagaag 14580
aaaaaagaaa gatatattta tctccttgtg actctttgtg cattgccagt tttaatccct 14640
gtggctatta aagtggtcag tatcttctcc agtaaatatg caagttattt gtcagttagt 14700
ttttggggag cacaactacg aggaacactc ttgctctgga taattattgc aattgttctg 14760
tttattatga tacgtgctaa ccaatcggat aatatagatt ggtggctaat ctatatggca 14820
acaatttaca ttggtgtaga gcttgttgga atgcagttga cggttatatc tagggtggca 14880
atgtacttta gaattttcct tgtattgctc tttccgattg ctcaaaaata tttcactaaa 14940
aaaagtggac agttttataa aattggcgtg gttatgctaa tgactgtatc gttctttagt 15000
tatgcgagtt cgcctgatcg tttgtacact ttttgctttt aatgatcaaa gagggagggc 15060
agaggaatgc ctattgcttc ggtgataata cctacatata aaggaagtag tgccttaaat 15120
agggcaattg atagtgtgtt gtgtcaatca tataaggaaa ttgaaaaaat tgtagttgat 15180
gataatggtt ctgttgcaaa gttttaaatc tactatcaaa taaggtagaa taatagaaaa 15240
agatagcagg aggaatgacg atgaatcatt ttaaaggaga gcaatttcag caggatgtga 15300
ttattgtagc cgtgggctac tatcttcgtt ataaccttag ctatcgtgaa gttcaagaaa 15360
tcttatatga tcgtggcatt aacgtttctc atacgacgat ttatcgttgg gtgcaagaat 15420
atggcaaact actctatcaa atttggaaaa agaaaaataa aaaatccttt tattcatgga 15480
aaatggatga aaacgtacat caaaattaaa ggaaaatggc attatttgta tcgagccatc 15540
gatgcagatg gtttaacctt ggatatttgg ttacgtaaaa aacgggacac acaagcagcc 15600
tatgcttttc ttaagcggtt agtgaagcag tttgatgaac cgaaggttgt agtcacagat 15660
aaagccccct ctattaaaag tgcctttaag aaactaaaag aatacggctt ttatcaaggg 15720
acagaacatc gtaccattaa atacctgaat aatttgattg aacaagacca tcgtccagta 15780
aagagacgca ataaattcta tcgaagttta cgcactgcct cacccacgat taaaggcatg 15840
gaagccattc gaggattata taagaaaacc cgaaaagaag gcactctctt cgggttttcg 15900
gtctgtactg aaatcaaggt attattggga atcccagctt aaatcataga taccgtaagg 15960
gattttattc tttatttaaa actttgcaac agaaccaaga tttgtataaa aaataaaaat 16020
atggaggtac aatcatggga aaaattccta tgagtaaggt agacaaggag tctgttttaa 16080
acatgcttat cagcaatgaa aacagcgtga agattccgca agcggtggaa gtggttgatt 16140
atcaaacagg cgagttcgac agaggaggcg aaaaaggtct gatgtattgg gctaatttat 16200
gcgttgttga tgttgaagag ttagaacttc tgaaaagtgt tggtttggaa gaaaatgcta 16260
ttctgattaa gctgaaagta tctgattata ataatgaaaa tcttgaggtt ttaaaaggta 16320
aagtgctaga tactaaatct atggaaatag tgtttgtaga gaaaaaaagt aaagtaggta 16380
acgaaattac aggcttggca tttaaaacaa gttttagaga tttaagagga gtctgtttta 16440
aacatgctta tcagcaatca aaacagcgtg aagattccgc aagcggtgga agtggttgat 16500
tatcaaacag gcgagttcga cagaggaggc gaaaaaggtc tgatgtattg ggctaattta 16560
tgcgttgttg atgttgaaga gttagaactt ctgaaaagtc tatattttgg ggggcactac 16620
agaaatggaa cgctcttcgg cttttcggtg tctactgaaa tcaaggtatt aatgggaata 16680
acagcctaag atatttggag ttcacagagg gcgcatttga ttttcaaact tcgcaataga 16740
accaaatagg gttaatctca tcaagaaaag gagttagaaa cagttgcgaa aatatatgat 16800
ctacctcagc agtttattgg tcacatttat cctgagttat gcgacgatta attggctgat 16860
tatgcccgtt ctcactcgct atcaaagcct ggctaggttg attaaccact ttgactatac 16920
cgcattaact ttaatactct tattaacgct gattatctgg ttgtttggca tccagtatca 16980
cctcaaacat ttttcagtta tttatctcta tcttgctttc agtgtgtatt tattactgtt 17040
attcatggtg ctttttacta aaacaacgga ttttcaggcg atatcactga atccttttga 17100
ctttataaaa gcggatacca gaacgattca agaggcagtg ctaaatatta tctacttcat 17160
tcctttaggt ggcctttact gtatcaatac tgatttcaaa cagtttgtca ttgtcagttt 17220
ggtcacactt ttaggaattg aaaccattca atttatcttt tatttgggca catttgccat 17280
tagtgatatt atcttgaatt ttttgggttg tttgattggg tattattgtt gttgggagat 17340
taaaaggcgg ttgagttgag ctaacccaaa agtagcataa aaaggttctg ttgcaaagtt 17400
ttaaatctac tatcaaataa ggtagaataa tagaaaaaga tagcaggagg aatgacgatg 17460
aatcatttta aaggaaagca atttcagcag gatgtgatta ttgtagccgt gggctactat 17520
cttcgttata accttagcta tcgtgaagtt caagaaatct tatatgatcg tggcattaac 17580
gtttctcata cgacgattta tcgttgggtg caagaatatg gcaaactact ctatcaaatt 17640
tggaaaaaga aaaataaaaa atccttttat tcatggaaaa tggatgaaac gtacatcaaa 17700
attaaaggaa aatggcatta tttgtatcga gccatcgatg cagatggttt aaccttggat 17760
atttggttac gtaaaaaacg ggacacacaa gcagcctatg cttttcttaa gcggttagtg 17820
aagcagtttg atgaaccgaa ggttgtagtc acagataaag ccccctctat taaaagttcc 17880
tttaagaaac taaaagaata cggcttttat caagggacag aacatcgtac cattaaatac 17940
ctgaataatt tgattgaaca agaccatcgt ccagtaaaga gacgcaataa attctatcga 18000
agtttacgca ctgcctcacc cacgattaaa ggcatggaag ccattcgagg attatataag 18060
aaaacccgaa aagaaggcac tctcttcggg ttttcggtct gtactgaaat caaggtatta 18120
ttgggaatcc cagcttaaat catagatacc gtaagggatt ttattcttta tttaaaactt 18180
tgcaacagaa ccgagataat aatcacgtct ttttggaatt gtttgccctt aaaatgattc 18240
atctgttgtc ctcgcattct tttttattac attttacaat aaatcgggtg ttatggggaa 18300
ctttgcaaca gaacctattt gaatttagtc cagttctaac tatctttttt ttcaaattta 18360
agctaaaata gatttttgga aaactttgca acagaaccct tagttttctg tgtttttttc 18420
taatttcatt tagaggtgaa ttaattggta gttattagag gtgccctata aaataactta 18480
gagctttgtg ggagctaccc actaatacta atataaggag atagacaatg gatttaaaag 18540
atttaataag tgttattgtt ccgatatacg gcgttgaaga atatttaaat aaatgtatcg 18600
actctattat caatcaaaca tataaaaatc tagaaattat tttggttgat gatggtagtc 18660
cagataaatg tccagatata tgcgatacat tcgaaaaaaa agatgagaga ataaaggtaa 18720
tccataaaaa gaatggtgga ttatctgatg cgagaaatgc cggtattgat acagcacatg 18780
gagactattt cgtttttgtt gatagcgatg attggattga aaacacaatg gtagagcatt 18840
tgctcttcgc atgtaaaaaa tataatgttg aaatggcaac ttgtgctaga tatattacag 18900
atggtcattc aactagagca gtcgcattta atggtccagc aggagcatat tcagctgaag 18960
aagcattgaa tgaaatactc ttaggaaagt cgatggatgt tgctgcttgg gataaaattt 19020
atgctcgtaa tctatttgaa gaaatacggt ttccggttgg tgaaaataat gaagacattg 19080
cagttttcta taaactagta gacttggctg gcagagtagc acataccggt acaacggaat 19140
atttttatcg gagtcgtccg ggcagtatta caaaattgaa atatagtaca gatgccagaa 19200
aaatcatcga gaaaaatctg aattcaatag aaaaatttct tgataaaaag tatccaagct 19260
gtttgccaag tttttatcgt tataaaacaa tgaacattta tgcattgttg aataagtata 19320
ttaaatgcga aggaacaaag aaaacacaag aatttgagca tctgatgaac gagttccgaa 19380
aaaataagag ctatttcttt aatgatgatc agaccccatc aaaagaaaag aagattgcca 19440
taatgattct tttgcatctt tacaatccgt atttacttgt aaaagaaaag attacgggtt 19500
ataagtgacg aagggagaaa ttgagaaaat gattggttct gttgcaaagt tttaaataaa 19560
gaataaaatc ccttacggta tctatgattt aagctgggat tcccaataat accttgattt 19620
cagtacagac cgaaaacccg aagagagtgc cttcttttcg ggttttctta tataatcctc 19680
gaatggcttc catgccttta atcgtgggtg aggcagtgcg taaacttcga tagaatttat 19740
tgcgtctctt tactggacga tggtcttgtt caatcaaatt attcaggtat ttaatggtac 19800
gatgttctgt cccttgataa aagccgtatt cttttagttt cttaaaggca cttttaatag 19860
agggggcttt atctgtgact acaaccttcg gttcatcaaa ctgcttcact aaccgcttaa 19920
gaaaagcata ggctgcttgt gtgtcccgtt ttttacgtaa ccaaatatcc aaggttaaac 19980
catctgcatc gatggctcga tacaaataat gccattttcc tttaattttg atgtacgttt 20040
catccatttt ccatgaataa aaggattttt tatttttctt tttccaaatt tgatagagta 20100
gtttgccata ttcttgcacc caacgataaa tcatcgtatg agaaacgtta atgccacgat 20160
catataagat ttcttgaact tcacgatagc taaggttata acgaagatag tagcccacgg 20220
ctacaataat cacatcctgc tgaaattgct ttcctttaaa atgattcatc gtcattcctc 20280
ctgctatctt tttctattat tctaccttat ttgatagtag atttaaaact ttgcaacaga 20340
accaaatatg gtataattta aggtatataa tatatatatg gagataccat gaatcaaaaa 20400
aagaggcgtc attatcgtaa gaaaaaacac acagtactaa aagttatttc aattattttt 20460
gtattagtaa ttatcgctgt tgtttctata gcctacgccg cttatagaaa tgttgaatca 20520
acattttcca catcatatga aaatttccct aaaacaacaa gtattgactt aaaaaagtct 20580
aaaacattca ccacacttat cattgcaact ggtaaaaata attctaaaaa tacagcttat 20640
gctactgttt tagcttcaac gaatgtaaag acaaatcaaa ctactttcat gaacttccca 20700
gtttttgcga caatgcctaa tcaaaaaaca atcactgaag tttacaatac gaatggagat 20760
gatggaattt tccagatggt taaagaccta ttgaatgtgt ccattaacaa agtagttcag 20820
atcgatgtta ataaaatggg atcacttgta caggccactg gtggaatcac catgcaaaat 20880
ccaaaggcgt tcaatgctga aggttatgag tttaaacaag gaactgttaa tttacaaact 20940
gctgatcaag tccaagccta tatgacacaa attgacgata ctgatttgga tgcttcaatc 21000
acccggattc aaaatgtctc aatggaactc tacggaaata ttcaaaaaat tgctcatatg 21060
aaaaaacttg aaagtttcaa ttactatcga gaaattctct atgctttttc aaacactgtt 21120
aaaaccaata taagtttcaa tgatgctaaa acgatcgtta tgagctacaa taaggctcta 21180
aagaatacca gcaagctcaa tctacataca acagatgaaa atggagctaa ggtcgtttct 21240
caaacagaat tagactcagt caaaaccctt tttgaaaaat ctctaaaata aaagaaccat 21300
gaggtgtagt gacccccaaa atatagactt ttcaaatcta tatttcggag gtcttttctt 21360
atggtcaaat attccataga attaaaacaa cgtgttattc aagattattt atctggcaaa 21420
ggaggctcta cctatcttgc taaaatgcac aacgtaggtt catctagtca agttagacgc 21480
tggattcgca attatcgagc agagggactt cctacagctc actccaaagt caataaaaat 21540
tattctatgg aattgaaaga aaatgcggta caatgttatt taacaactga tttaacttat 21600
gaagctgtgg caagaaaatt cgaaattact aattttacct tacttgcaag ctgggttaat 21660
catttcaaac tctatggaga agtcccaata agcaaaaaga gaggacgacg taaaaaacta 21720
gaaagcattg catcgtctat gactcagaac tcaaatgatt ctcaacgaat taaagaactt 21780
gaacaagaat tacgttatgc gcaaattgag gtagcttatt taaaatgact tcggacattg 21840
gagaaaaatg ctctaatgaa caaaaatcaa gactcatcta cagtctccgt aaaaccttca 21900
aactcaaaga aatcttgaaa gttacaggat tccctaaagc cacttattat tattgggtca 21960
actgttttga acgggtaaac aaagatgagc ttatcgaaaa agaaatgctt aaaatacgtc 22020
aagaacatgc caatgcaggt taccgtccat tgagtgaatt acttaagcaa cgtggctatc 22080
acgtcaacca caagaaggtc cagcgcttaa tgaagaagct agggcttcgt gtaacgtctt 22140
attggcacaa atcacgtaaa tataattctt ataaaggaaa agtagggacg gtagctaaaa 22200
acaaattgca cagacgattc agaacttcca ttcctcatca aaaaatcaca acagatacga 22260
ctgaattcaa atattatgaa gaagggattc agaaaaaatg ttatctcaat ccttacattg 22320
atttatttaa tagcgaggtg atcagctatc atatctctaa acacacccct cttatcaatc 22380
aattgagact gctctaaata aagccgtagc tgtgacatct gattgtcctt atcgacgtac 22440
tttccactca gatcaaggat ggggatacca aatgagagac tatgtttcta aattaagatc 22500
tcatcgaatt tttcagtcta tgagccgtaa aggaaactgt catgataatt cagtcatgga 22560
gaatttcttt gggctactta aacaagaaat ttattatgga cgtatcttct cgtctttcga 22620
agaacttgag caggttattg taagctggat taggtactat aatacaaaac gaatcaaaca 22680
aaaattgaac tggatgagtc caattcaatt tcgcttaaat taccaaaaca attaaaaaaa 22740
tccaagtatt ttggaaaggt tataataaac tagacacaaa gttaagagaa atcgtggaaa 22800
ggtttattat ggcaagaaga aaattcgata aacaatttaa aaattctgca gtaaaactca 22860
ttcttgaaga gggttactct gttaaagaag tcagccaaga gcttgaggtt catgccaata 22920
gtctttatcg ctgggttcaa gaagttgaag aatatggaga aagtgctttt ccaggcaatg 22980
ggacagccct agctgatgcc caacataaga ttaaattgtt agagaaagaa aatcgttatc 23040
ttcaggagga acttgaactt ctaaaaaagt tccgggtctt cttgaagcga agcaagtaaa 23100
acgttttgaa tttctcttga aacatcatgg gaagataaaa attaagcatg cagtaaaagt 23160
tcttaaggtt tctcgctcag gtttctatga atacatgcat cgtcgtcctt caaaacaaca 23220
agtggagaga gaaattctct cagagaagat aaaagctgtc tttcatgagc ataagggacg 23280
ctatggtgcg gttagaatta ccaaggtact tcataatact ggtattatga ccaacacgaa 23340
acgtgttgga aaactgatgc acttgatggg actttatgcc aagggaagcc gttataaata 23400
taaacattac aacagaaaag gagcttcgct ttcaagaccc aatttaatta atcagatctt 23460
taaagcaaca gctcctaata aagtatggct gggagacatg acctatatcc ctaccaaaga 23520
aggcacctta tacttagccg tgaatatcga cgttttttca cgtaagattg taggctggtc 23580
aatgtcttca cggatgcaag ataaactggt gagggattgc ttcttacaag cttgtgggaa 23640
agaacatcct cagcctggct tgattgtcca tactgatcaa gggagtcaat atacaagctc 23700
tcgttatcaa tctactcttc gtcaagtcgg tgctcaatct agcatgagtc gtaaaggaaa 23760
tccctatgac aatgcaatga tggagtcttt ttataagacg ctaaagagag agcttattaa 23820
tgatgctcat tttgagacaa gagctgaggc tactcaagaa atatttaaat acattgagac 23880
ctattacaat acaaaaagga tgcattcagg tcttgattac aagtcgccaa aagactttga 23940
aaaatataat tcttaaattc tcttaactcc gtgtctagtt tttcgttgac tttccatttc 24000
tacttggata aaaagtctaa cttttggggt gcacatcaag gttcttttat ttttatttca 24060
tcacaatata atccggtacg gctcgatcat cttgactagc aacaatcgtt ttaccattga 24120
cagatagttg acttgagcca ccactatcaa gtaataacat attttggagc ttcaaatttg 24180
atactgattt tattatatta tcataacctg catttgtatc actcaaaata gcatagagat 24240
tattatcttt atcattcgca ataaaaatat gaatcttcca atctactgag ccatcacttg 24300
gttgaatttt accatcacgg ataattgcag taccaaaatc ataggcttgt tgccctccgt 24360
ttttaataat agttgaagca gatgtacttg aatcataaat tttgcacgaa ccatctttgt 24420
taataacaaa agcatattga gttgttgtac ctggactcca gtcttgaatc aactttccat 24480
tattaatttg aaatccagct acttgtcctg tctgcatatc aaaagcggaa gcattcataa 24540
tcaaagcatt aggatactta gctataactt ctgacatttt catccgttga tctgtacgat 24600
tggtaactgt ttttaagact ttcggattgt taattcgata aatagtgata ttattcgtag 24660
ataaatcagt aaatttatct agatttgcct tcgagggaac cttaatccat tctggtatac 24720
caacatcatc tacaagtgct atactaccca cagcagcttc ttcccctttt ttctctacat 24780
aaccattaga ggccgaattt ttggtcggtt gatccgaact atttgttttc gtatgatcag 24840
taagaataag taaatctctt aaggcataag cccctgctcc tataagggta ataataatta 24900
agataggtat aattatagtc caaatatttt ttttttttgt tttttttacg taccacgata 24960
tcctccatta ataaatatac ttagaatggt tctgttgcaa agtttcaata ctgagtacaa 25020
aagtcccatt ctgatacttt gactatgctg gtatacccat caaaacttta atttcagtag 25080
atgacgaaaa gccgaagagc gttccatttc ttcggttctt tttgtatatg cctctcaatg 25140
cttctatgcc cttaatcgta gttgatgcag tacgaagact ttgataaagt ttatttcgac 25200
gtttaacagg tctatgatct tgctcaatta agttattgag atacttgact gtccggtgct 25260
cagtttcgct gtatagcccc tgcttttgta acttcctaaa agcactagca attgatggtg 25320
ctttatctgt cacgataact ttcggttccc caaactgctt gtatagtcgt ttaaagaaag 25380
cataagcaga ttgagtattt ctctttcgtc ttagccaaat atccagtgtc atgccttctg 25440
agtcgattgc acgatagaga taatgccact tccctttaat tttaatataa gtttcgtcca 25500
ttttccacga ataaaaggac tgtctatttt tctttttcca gatttgatag atcagcttac 25560
catattcttg aacccaacgg taagttgttg tgtgagaaac gttaatgcca cgatcatata 25620
acatctcttg aacatcacga taactcaaat tatagcgaag ataataccca acagagataa 25680
taatcacgtc ttgttggaat tgtttgccct taaaatgatt catctgttgt cctcgcattc 25740
ttttttatta cattttacaa taaatcaggt gttatgggga actttgcaac agaagcataa 25800
atcgtagtat gacaaacatt tattccacga tcatataaca attcctgaac ttcacgatag 25860
cttagattgt aacgcaggta gtaaccaaca gcgacaataa tgacgtcttg ttggaagtgt 25920
ttgcctttaa aatgattcat cactctgtcc tctctgtctt ttttctcaat tttacactaa 25980
aatagatttt ttggaaaact ttgcaacaga acccctaaat ccatacctct atttatgcat 26040
aataaacttt agcttattaa ttatttctgc ctttaaaaat attaataaaa caacataaat 26100
aattatgccc acagtaattt ccaacagaat gaatatccaa gatgtcggtg ttaacaaact 26160
aattttaaag acaattagaa acatcactaa tcctgcaatt aaatacttag ataaatccga 26220
aaacagtgta tgcaaattaa gctgtttatg aattataaaa agttgataca cagttacaga 26280
catttcagaa attacagttg caattgatgc accaacagta cctagatata taatcagtgg 26340
aatatttaac attaaattga ctatcgctcc aatgatcact gacactgtat atgacttatt 26400
ttgattagtt ggtaaaagat attgagcacc tattgcgttg ctccaagcta taaaaataat 26460
tgcgattgac tcgatcatta acacaggaat aacatcacta aattgagatg taaaaaaaag 26520
tggcacgaat ttaggagtaa tagctatcag accaaacatc ataggaatcg aaattgccga 26580
cacaaaagaa aaacctgcgt acatgtattc tttaatttta ctatactctc tatgtgcaaa 26640
ggcatttgca acacgtggca acatgacagt acctgttgca gtagcaatag ccaaaaccag 26700
tttaactatt ttatcagact gatcaaaaaa gccggagctc gtgacagaat ccaatgaacc 26760
taacattgtt ttattcaaaa cccaataaat ttggacagca atttgtggga taaacataac 26820
taaagattgc tttaaatgct ttattggcct taattcacga tagttaacct ttacgagata 26880
tctgtgtaaa cttgggaaaa aagttaaatt accaattaat gtagataaaa ctgttatcaa 26940
tatatatata ttcaaatcat tgtaagattt gacaaatagg aaaacactga atagagcaag 27000
taacttaact ataaaatttc ttaatacagt tactttaaaa ttttcaattc ccataaaaaa 27060
ccaagtgata tcaaatgcag ctgcaactat agcaatggat tgagacaaat agtatgcatg 27120
atactgacca ttaatggtta aaaaagcaac gaacaaaaaa tatgctaaac atattgtaaa 27180
tagtcttaaa ataaatattt cataaaagac tttagacatt ttgagctgat tatccctaac 27240
aaaggcaatc tgacgattcc catacaaacc gactcctata ctaccaaata aaacaaaata 27300
ctgaacaata gaattggtat atgagttaat tccaatacct gaagggccca aaattcttga 27360
caaataagga atggtaagta atggcacaat tattataaag acctgatata ttgcattata 27420
aagataattt tttgcaattt gcat 27444
<210> 200
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_EpsR
<400> 200
Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 201
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_epsX
<400> 201
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Leu Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Asn Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Arg
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Lys Asn Leu Ala Ser Thr Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Lys Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Asp Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 202
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_epsC
<400> 202
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Glu Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Ser Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Ser Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asp Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Met Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Leu Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 203
<211> 230
<212> PRT
<213>乳酸乳球菌
<220>
<223> 33135_epsD
<400> 203
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Thr Ala Ser Ala Asn
50 55 60
Leu Ala Val Gly Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asp Leu Leu Asp Ser Val Ser Asp Phe Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ser
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val
225 230
<210> 204
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_epsB
<400> 204
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Met Ile Glu Glu Ser Cys Gly Ser Asp Val Ser Gln Ile Phe
210 215 220
Gln Asn Asn Ala Gly Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 205
<211> 228
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_epsE
<400> 205
Met Glu Val Phe Glu Asp Val Ser Ser Pro Glu Pro Glu Glu His Lys
1 5 10 15
Leu Val Glu Leu Lys Lys Phe Ser His Arg Glu Ile Ile Ile Lys Arg
20 25 30
Gly Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Glu Arg Leu Ser Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Leu Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Met Leu Thr Ile Thr Gln Thr Ile His Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 206
<211> 216
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_GT1
<400> 206
Met Lys Glu Lys His Ile Tyr Ile Ile Gly Ser Lys Gly Ile Pro Ala
1 5 10 15
Lys Tyr Gly Gly Phe Glu Thr Phe Val Glu Glu Leu Thr Ala His Gln
20 25 30
Ser Asn Lys Asn Leu Lys Tyr His Val Ala Cys Leu Ser Asn Asp Ile
35 40 45
Gln Ser Asn Phe Ile His Asn Gly Ala Asp Cys Phe Asn Ile Pro Lys
50 55 60
Lys Asn Ile Gly Pro Ala Asn Ala Ile Tyr Tyr Asp Leu Ala Ala Leu
65 70 75 80
Lys Tyr Ser Leu Lys Glu Ile Glu Glu Lys Asn Tyr Met Gly Ala Ile
85 90 95
Ile Tyr Ile Leu Ala Cys Arg Ile Gly Pro Phe Ile Gly His Tyr Lys
100 105 110
Lys Gln Met Lys Lys Leu Gly Ile Thr Leu Met Val Asn Pro Asp Gly
115 120 125
Glu Cys Glu Ile Ile Trp Thr Thr Arg Lys Ser Leu Asn Ser Tyr Gly
130 135 140
Phe Ala Arg Leu Asp Leu Phe Thr Ser Thr Cys Phe Cys Leu Ser Trp
145 150 155 160
Cys Val Ala Ala Gly Phe Ile Gly Ser Asn Leu Val Lys Arg Ile Tyr
165 170 175
Gln Glu Ala Pro Ser Ala Thr Val Ile Gly Ile Asp Asn Met Asn Ala
180 185 190
Tyr Tyr Asp Val Ala Leu Lys Glu Phe Arg Leu Asn Glu Leu Ala Lys
195 200 205
Tyr Pro Thr Phe Thr Phe Tyr Gly
210 215
<210> 207
<211> 303
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_dTDP-葡萄糖_4,6-脱水酶
<400> 207
Met Ser Trp Pro Ser Ile Pro His Leu Pro Phe Met Asp Asn Pro Asn
1 5 10 15
Phe Arg Phe Val Lys Ala Asp Ile Cys Asp Arg Glu Ala Val Asn Lys
20 25 30
Leu Phe Glu Glu Glu His Pro Asp Ile Val Val Asn Phe Ala Ala Glu
35 40 45
Ser His Val Asp Arg Ser Ile Glu Asp Pro Gly Ile Phe Leu Gln Thr
50 55 60
Asn Ile Ile Gly Thr Ser Val Leu Met Asp Ala Cys Arg Lys Tyr Gly
65 70 75 80
Ile Arg Arg Tyr His Gln Val Ser Thr Gly Glu Val Tyr Gly Asp Leu
85 90 95
Pro Leu Asp Arg Pro Asp Leu Phe Phe Thr Glu Glu Thr Pro Ile His
100 105 110
Thr Ser Ser Pro Tyr Ser Ser Ser Lys Ala Ala Ala Asp Leu Leu Val
115 120 125
Leu Ala Tyr His Arg Thr Tyr Gly Leu Pro Val Thr Ile Ser Arg Cys
130 135 140
Ser Asn Asn Tyr Gly Pro Tyr His Phe Pro Glu Lys Leu Ile Pro Leu
145 150 155 160
Met Ile Ala Asn Ala Leu Ala Asp Lys Pro Leu Pro Val Tyr Gly Glu
165 170 175
Gly Leu Asn Val Arg Asp Trp Leu Tyr Val Glu Asp His Cys Lys Ala
180 185 190
Ile Asp Leu Ile Ile His Lys Gly Arg Val Gly Glu Val Tyr Asn Val
195 200 205
Gly Gly His Asn Glu Lys Gln Asn Ile Glu Ile Val Lys Ile Ile Cys
210 215 220
Lys Glu Leu Gly Lys Pro Glu Ser Leu Ile Thr His Val Gly Asp Arg
225 230 235 240
Lys Gly His Asp Met Arg Tyr Ala Ile Asp Pro Thr Lys Ile His Asn
245 250 255
Glu Leu Gly Trp Leu Pro Glu Thr Lys Phe Glu Asp Gly Ile Lys Lys
260 265 270
Thr Ile Gln Trp Tyr Leu Asp Asn Arg Glu Trp Trp Glu Thr Ile Ile
275 280 285
Ser Gly Glu Tyr Gln Asn Tyr Tyr Glu Lys Met Tyr Ser Asn Arg
290 295 300
<210> 208
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_dTDP-4-脱氢鼠李糖_还原酶
<400> 208
Met Lys Phe Phe Val Thr Gly Val Gly Gly Gln Leu Gly His Asp Val
1 5 10 15
Met Asn Glu Leu Leu Lys Arg Gly His Glu Gly Val Gly Ser Asp Ile
20 25 30
Gln Glu Asn Tyr Ser Gly Val Ala Asp Gly Ser Ala Val Thr Lys Ala
35 40 45
Pro Tyr Phe Ala Leu Asp Ile Thr Asn Lys Asp Ala Val Glu Lys Val
50 55 60
Ile Thr Glu Val Asn Pro Asp Ala Val Ile His Cys Ala Ala Trp Thr
65 70 75 80
Ala Val Asp Met Ala Glu Asp Asp Asp Lys Val Ala Lys Val Arg Ala
85 90 95
Ile Asn Ala Gly Gly Thr Arg Asn Ile Ala Asp Val Cys Lys Lys Leu
100 105 110
Asn Cys Lys Leu Thr Tyr Ile Ser Thr Asp Tyr Val Phe Asp Gly Gln
115 120 125
Gly Thr Glu Pro Trp Gln Pro Asp Cys Lys Asp Tyr Lys Pro Leu Asn
130 135 140
Val Tyr Gly Gln Thr Lys Leu Glu Gly Glu Leu Ala Val Ser Gln Thr
145 150 155 160
Leu Glu Lys Tyr Phe Ile Val Arg Ile Ala Trp Val Phe Gly Leu Asn
165 170 175
Gly Lys Asn Phe Ile Lys Thr Met Leu Asn Val Gly Lys Thr His Asp
180 185 190
Thr Val Arg Val Val Asn Asp Gln Ile Gly Thr Pro Thr Asn Thr Tyr
195 200 205
Asp Leu Ala Arg Leu Leu Val Asp Met Asn Glu Thr Glu Lys Tyr Gly
210 215 220
Tyr Tyr His Ala Thr Asn Glu Gly Ser Tyr Ile Ser Trp Phe Asp Phe
225 230 235 240
Thr Lys Glu Ile Tyr Arg Gln Ala Gly Tyr Lys Thr Glu Val Leu Pro
245 250 255
Val Thr Thr Ala Glu Tyr Gly Leu Ser Lys Ala Ala Arg Pro Phe Asn
260 265 270
Ser Arg Leu Asp Lys Ser Lys Leu Val Glu Ala Gly Phe Thr Pro Leu
275 280 285
Pro Thr Trp Gln Asp Ala Leu Ser Arg Tyr Leu Lys Glu Ile Glu Gln
290 295 300
<210> 209
<211> 223
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_dTDP-4-脱氢鼠李糖_3,5-差向异构酶
<400> 209
Leu Ala Gly Cys Thr Glu Pro Leu Ser Glu Arg Asn Arg Ala Val Ile
1 5 10 15
Glu Glu Ile Glu Lys Met Gly Gln Ile Lys Val Asp Lys Asn Val Gly
20 25 30
Gly Ile Glu Gly Leu Cys Val Ile Glu Pro Ala Val His Gly Asp Ala
35 40 45
Arg Gly Tyr Phe Met Glu Thr Tyr Asn Glu Lys Asp Met Lys Lys Ala
50 55 60
Gly Ile Asp Ile His Phe Val Gln Asp Asn Gln Ser Met Ser Met Lys
65 70 75 80
Gly Val Leu Arg Gly Leu His Phe Gln Lys Gln Tyr Pro Gln Cys Lys
85 90 95
Leu Val Arg Ala Val Arg Gly Thr Val Phe Asp Val Ala Val Asp Leu
100 105 110
Arg Ser Asn Ser Glu Thr Tyr Gly Lys Trp Tyr Gly Val Thr Leu Ser
115 120 125
Ala Glu Asn Lys Lys Gln Phe Leu Ile Pro Glu Gly Phe Ala His Gly
130 135 140
Phe Leu Val Leu Ser Asp Glu Ala Glu Phe Cys Tyr Lys Val Asn Asp
145 150 155 160
Phe Trp His Pro Asn Asp Glu Gly Gly Met Ala Trp Asn Asp Pro Glu
165 170 175
Ile Gly Ile Glu Trp Pro Gly Val Gln Gly Glu Tyr Lys Gly Ser Ala
180 185 190
Ser Ala Glu Gly Tyr Glu Leu Glu Asp Gly Thr Ala Leu Asn Leu Ser
195 200 205
Asp Lys Asp Gln Lys Trp Leu Ala Leu Lys Asp Thr Phe Lys Phe
210 215 220
<210> 210
<211> 410
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_GT2
<400> 210
Met Gln Arg Glu Asn Glu Val Gln His Val Phe Leu Val Gly Ala Lys
1 5 10 15
Ser Leu Gly Ala Tyr Gly Gly Tyr Glu Thr Phe Val Tyr Lys Leu Thr
20 25 30
Glu His His Gln Asn Lys Lys Asn Ile Lys Tyr His Val Ala Cys Lys
35 40 45
Ala Asn Gly Asp Gly Cys Met Asp Glu Thr Lys Val Asp Gly Val Lys
50 55 60
Gly Ile Asn Gln His Glu Phe Glu Phe His Asn Ala His Cys Phe Lys
65 70 75 80
Ile Asp Ile Pro Gln Ile Gly Ala Ala Gln Ala Ile Tyr Tyr Asp Val
85 90 95
Ala Ala Leu Asn Ala Cys Cys Lys Tyr Ile Lys Glu His Lys Ile Lys
100 105 110
His Pro Ile Val Tyr Ile Met Ala Cys Arg Ile Gly Pro Phe Ala Gly
115 120 125
His Phe Tyr Gln Glu Ile His Lys Leu Gly Gly Thr Val Tyr Leu Asn
130 135 140
Pro Asp Gly His Glu Trp Met Arg Ala Lys Trp Ser Ala Pro Ile Arg
145 150 155 160
Lys Tyr Trp Lys Ile Ser Glu Arg Met Met Val Lys Tyr Cys Asp Leu
165 170 175
Ala Ile Cys Asp Ser Val Asn Ile Glu Lys Tyr Ile His Glu Cys Tyr
180 185 190
Asp Gly Lys Gly Ile Lys Gly Arg Asn Pro Lys Thr Thr Phe Ile Ala
195 200 205
Tyr Gly Ala Asp Leu Thr Leu Ser Lys Leu Ala Asp Asp Asp Glu Lys
210 215 220
Leu Val Asn Trp Tyr Lys Glu Lys Gly Leu Ala Lys Lys Gly Tyr Tyr
225 230 235 240
Leu Val Val Gly Arg Phe Val Pro Glu Asn Ser Phe Glu Val Met Ile
245 250 255
Arg Glu Phe Met Lys Ser Gly Ser Lys Lys Asp Phe Ala Leu Ile Thr
260 265 270
Asn Val Asn Asp Lys Phe Leu Asn Glu Leu Glu Glu Lys Leu His Phe
275 280 285
Lys Ser Asp Lys Arg Ile Lys Phe Val Gly Thr Val Tyr Asp Gln Glu
290 295 300
Leu Leu Lys Lys Ile Arg Glu Asn Ala Tyr Ala Tyr Phe His Gly His
305 310 315 320
Thr Val Gly Gly Thr Asn Pro Ser Leu Ile Glu Ala Leu Gly Ser Thr
325 330 335
Asp Leu Asn Leu Leu Val Asp Val Gly Phe Asn Lys Glu Val Ala Glu
340 345 350
Asp Cys Ala Leu Tyr Trp Ser Arg Glu Pro Gly Ser Leu Ala Arg Leu
355 360 365
Ile Asp Arg Ala Asp Lys Met Ser Thr Glu Glu Ile Ala Glu Met Gly
370 375 380
Arg Lys Ala Lys Lys Arg Val Ala Glu Glu Tyr Thr Trp Asp Lys Ile
385 390 395 400
Cys Gly Gln Tyr Glu Glu Val Phe Thr Lys
405 410
<210> 211
<211> 209
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_DUF1919
<400> 211
Met Arg Asp Gln Thr Val Lys Ile Ser Lys Tyr Tyr Arg Thr Phe Leu
1 5 10 15
Arg Arg Lys Leu Asn Ala Glu Asn Arg Lys Arg Leu Lys Asn Lys Asn
20 25 30
Phe Thr Val Leu Cys Asn Asn Cys Val Gly Gly Val Ile Leu His Glu
35 40 45
Leu Gly Glu Arg Phe Asn Ser Pro Thr Val Asn Leu Phe Phe Lys Ala
50 55 60
Glu Asp Tyr Leu Lys Phe Leu Glu Asn Leu Asp Tyr Tyr Leu Lys Gln
65 70 75 80
Ala Leu Val Glu Val Gly Ser Glu Lys Asn Tyr Pro Val Ala Lys Leu
85 90 95
Asp Asp Ile Thr Ile Tyr Phe Met His Tyr Ser Ser Phe Asp Glu Ala
100 105 110
Lys Ile Thr Trp Gln Lys Arg Val Ala Arg Ile Asn Lys Asn Asn Leu
115 120 125
Tyr Val Ile Phe Val Gln Gln Ser Gly Cys Thr Glu Gln Val Leu Glu
130 135 140
Ala Phe Asp Lys Leu Pro Tyr Lys His Lys Leu Ala Leu Thr Ala Lys
145 150 155 160
Pro Met Pro Glu Ile Lys Cys Ser Tyr Cys Ile His Gly Thr Ala Gln
165 170 175
Pro Asn Gly Glu Val Met Asp Leu Cys Lys Tyr Glu Gly Lys Phe Thr
180 185 190
Gly Lys Arg Trp Ile Asp Glu Tyr Asp Tyr Val Gly Phe Leu Asn Lys
195 200 205
Lys
<210> 212
<211> 371
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_UDP-吡喃半乳糖_变位酶
<400> 212
Met Tyr Asp Tyr Leu Val Val Gly Ser Gly Leu Tyr Gly Ala Ile Phe
1 5 10 15
Ala His Glu Ala Lys Ala His Gly Lys Ser Val Leu Val Val Asp Lys
20 25 30
Arg Pro Asn Ile Gly Gly Asn Val Tyr Thr Glu Asn Ile Glu Gly Ile
35 40 45
Asn Val His Lys Tyr Gly Ala His Ile Phe His Thr Asn Asn Lys Lys
50 55 60
Val Trp Asn Tyr Ile Thr Gln Phe Ala Glu Phe Asn Arg Phe Thr Asn
65 70 75 80
Ser Pro Val Ala Asn Tyr Lys Gly Glu Leu Tyr Ser Leu Pro Phe Asn
85 90 95
Met Tyr Thr Phe Asn Lys Met Trp Gly Val Val Thr Pro Glu Glu Ala
100 105 110
Ala Ala Lys Ile Glu Glu Gln Arg Lys Glu Ile Thr Gly Glu Pro Lys
115 120 125
Asn Leu Glu Glu His Ala Ile Ser Leu Val Gly Arg Asp Ile Tyr Glu
130 135 140
Lys Leu Ile Lys Gly Tyr Thr Glu Lys Gln Trp Gly Arg Asp Cys Lys
145 150 155 160
Asp Leu Pro Ala Phe Ile Ile Lys Arg Leu Pro Val Arg Leu Thr Phe
165 170 175
Asp Asn Asn Tyr Phe Asn Ala Leu Tyr Gln Gly Ile Pro Ile Gly Gly
180 185 190
Tyr Thr Lys Met Ile Ala Asn Leu Leu Asp Gly Ile Glu Val Arg Leu
195 200 205
Asn Ile Asp Tyr Leu Glu Asn Lys Val Glu Leu Asp Ala Leu Ala Gly
210 215 220
Lys Val Val Tyr Thr Gly Pro Ile Asp Ala Tyr Phe Asp Tyr Lys Leu
225 230 235 240
Gly Thr Leu Glu Tyr Arg Ser Val Arg Phe Glu Asn Glu Leu Leu Asp
245 250 255
Lys Pro Ser Ser Gln Gly Asn Ala Ala Val Asn Tyr Thr Asp Arg Glu
260 265 270
Thr Pro Trp Thr Arg Ile Ile Glu His Lys Trp Phe Glu Phe Gly Arg
275 280 285
Asp Glu Asn Gly Asn Asp Leu Pro Lys Thr Ile Ile Ser Arg Glu Tyr
290 295 300
Ser Ser Glu Trp Lys Pro Gly Asp Glu Pro Tyr Tyr Pro Val Asn Asp
305 310 315 320
Ala Lys Asn Ser Leu Leu Tyr Ser Glu Tyr Lys Lys Leu Ala Asp Ala
325 330 335
Glu Ser Lys Val Ile Phe Gly Gly Arg Leu Gly Glu Tyr Lys Tyr Tyr
340 345 350
Asp Met Asp Gln Ile Ile Ala Ala Val Leu Glu Arg Cys Glu Arg Glu
355 360 365
Phe Asp Val
370
<210> 213
<211> 252
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_DUF4422
<400> 213
Met Asn Gly Lys Ile Ile Val Val Thr His Lys Glu Tyr Lys Met Pro
1 5 10 15
Cys Asp Thr Val Tyr Leu Pro Val Cys Val Gly Val Gly Arg Asp Ala
20 25 30
Leu Arg Asn Lys Tyr Gln Ala Asp Asp Glu Gly Glu Asn Ile Ser Asp
35 40 45
Lys Asn Ile Leu Tyr Cys Glu Leu Thr Ala Leu Tyr Trp Ala Trp Lys
50 55 60
Asn Leu Asn Cys Asp Tyr Ile Gly Leu Ala His Tyr Arg Arg Tyr Leu
65 70 75 80
Thr Glu Ser Lys Arg Ser Lys Asn Ile Glu Asp Ala Leu Ser Gln His
85 90 95
Arg Ile Glu Glu Leu Leu Met Asp Tyr Asp Ile Ile Val Pro Arg Glu
100 105 110
Lys Arg Tyr Ser Gln Thr Ile Ala Asp His Tyr Ile Asn Cys Ile Lys
115 120 125
Ser Arg Lys Asp Ala His Lys Ile His Leu Gln Leu Leu Arg Asp Ser
130 135 140
Ile Leu Glu Val Ala Pro Glu Tyr Ile Ala Glu Tyr Asp Lys Thr Met
145 150 155 160
Asn Gly His Ser Ala His Met Leu Asn Met Phe Val Met Lys Lys Gln
165 170 175
Asn Leu Asp Asn Tyr Cys Glu Trp Leu Phe Lys Ile Leu Phe Val Leu
180 185 190
Glu Lys Lys Ile Tyr Asp His Asp Val Tyr Tyr Asp Arg Ile Met Gly
195 200 205
Ala Phe Ser Glu Phe Leu Leu Asp Val Trp Ile Arg Thr Asn Lys Lys
210 215 220
Thr Tyr Ile Glu Val Glu Leu Ile Glu Thr Glu Arg Asp Tyr Trp Gly
225 230 235 240
Lys Ile Lys Trp Ala Leu Lys Arg Lys Leu Phe Glu
245 250
<210> 214
<211> 449
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_GT3
<400> 214
Met Arg Ile Leu His Tyr Ser Leu Gly Phe Pro Pro Tyr Arg Arg Gly
1 5 10 15
Gly Leu Thr Lys Tyr Cys Leu Asp Leu Met Val Ala Gln Glu Met Gln
20 25 30
Gly Asn Val Val Ala Met Cys Trp Pro Gly Glu Ile Gly Ile Ile Lys
35 40 45
Lys Lys Lys Val Ala Ile Lys Lys Arg Lys Lys Tyr Ser Ile Gly Lys
50 55 60
Ser Lys Ile Glu Asn Tyr Glu Ile Gln Gly Ile Leu Pro Val Pro Leu
65 70 75 80
Leu Glu Gly Ile Lys Asn Pro Asp Leu Phe Thr Glu Lys Lys Asn Gln
85 90 95
Glu Ile Trp Lys Leu Phe Leu Lys Asn Trp Arg Pro Asp Val Ile His
100 105 110
Phe His Thr Leu Met Gly Leu Pro Leu Glu Tyr Val Glu Thr Ala Arg
115 120 125
Lys Leu Gly Ile Lys Thr Leu Phe Thr Thr His Asp Tyr Phe Gly Leu
130 135 140
Cys Pro Arg Thr Thr Leu Val Arg Gln Asn Gly Glu Ile Cys Asp Gly
145 150 155 160
Cys Thr Pro Glu Leu Cys Ala Glu Cys Cys Glu Asn Ala Ile Ser Tyr
165 170 175
Arg Lys Leu Lys Ile Leu Gln Ser Ser Val Tyr Arg Val Leu Lys Asp
180 185 190
Leu Val Ile Val Lys Lys Leu Arg Lys Lys His Trp Asn Glu Ser Lys
195 200 205
Asn Asp Ser Ala Gln His Gln Ala Ser Val Gln Asn Ala Gln Arg Ala
210 215 220
Glu Glu Tyr Val Glu Leu Arg Lys Tyr Tyr Ile Lys Leu Leu Lys Ser
225 230 235 240
Phe Asn Ile Ile His Phe Asn Ser Ser Asn Thr Arg Asp Val Tyr Leu
245 250 255
Lys Ala Ala Lys Glu Val Leu Asn Asn Glu Val Val Ser Ile Ser His
260 265 270
Glu Met Ile Lys Asp Asn Lys Lys Lys Lys Arg Lys His Glu Ile Leu
275 280 285
His Leu Ser Tyr Leu Gly Pro Asp Thr Tyr Asn Lys Gly Tyr Tyr Val
290 295 300
Leu Lys Glu Thr Leu Asn Gln Leu His Lys Glu Gly Tyr Lys Phe Gln
305 310 315 320
Leu Asn Ile Tyr Phe Glu Asp Ala Ser Glu Pro Phe Ile Val Ser His
325 330 335
Ala Pro Tyr Gln Tyr Ser Glu Leu Gly Lys Val Met Asp Asp Ala Asp
340 345 350
Cys Val Ile Leu Pro Ser Leu Gly Asn Glu Thr Phe Gly Phe Thr Val
355 360 365
Leu Glu Ala Leu Ser Tyr Gly Val Pro Val Ile Val Ser Ser Arg Val
370 375 380
Gly Ala Lys Asp Ile Val Glu Glu Gly Lys Asn Gly Phe Val Val Glu
385 390 395 400
Gly Asp Val Asp Ser Leu Lys Thr Lys Leu Thr Ser Val Leu Asn Gln
405 410 415
Pro Glu Ile Leu Glu Asp Met Asn Asn Tyr Ile Val Ala Asn Thr His
420 425 430
Ile Lys Thr Met Thr Glu His Ser Lys Glu Ile Lys Asp Leu Tyr Gln
435 440 445
Lys
<210> 215
<211> 308
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_GT4
<400> 215
Val Gln Asp Lys Val Ser Ile Ile Val Pro Val Tyr Lys Val Glu Arg
1 5 10 15
Glu Leu Asp Arg Cys Val Gln Ser Leu Ile Lys Gln Thr Tyr Lys Asn
20 25 30
Leu Glu Ile Ile Leu Val Asp Asp Gly Ser Pro Asp Gln Cys Pro Glu
35 40 45
Leu Cys Glu Asn Tyr Ala Glu Ile Asp Lys Arg Val Lys Val Ile His
50 55 60
Lys Glu Asn Gly Gly Leu Ser Asp Ala Arg Asn Ala Gly Leu Lys Gln
65 70 75 80
Ala Thr Gly Lys Tyr Ile Leu Tyr Val Asp Ser Asp Asp Tyr Ile Asp
85 90 95
Leu Asp Ala Cys Glu Arg Phe Ile Lys Ala Ala Gly Asn Gln Lys Ile
100 105 110
Asp Ile Val Val Gly Asn Ala Ile Met Glu Lys Pro Asp Gly Lys Glu
115 120 125
Met Met Ile His Ser Ala Thr Pro Ser Gly Ile Thr Tyr Thr Ala Lys
130 135 140
Gln Phe Ile Met Ser Ala Val Lys Ala Tyr Gln Trp Tyr Ala Pro Ala
145 150 155 160
Trp Leu Asn Met Tyr Arg Arg Asp Phe Leu Leu Asp Asn Gln Leu Tyr
165 170 175
Phe Lys Lys Gly Ile Tyr Phe Glu Asp Val Gln Met Leu Pro Arg Val
180 185 190
Phe Leu Ala Ala Lys Lys Ile Thr Cys Ile Tyr Gly Thr Phe Tyr His
195 200 205
Tyr Ile Ile Arg Glu Asn Ser Ile Met Thr Ser Gln Lys Asp Glu Lys
210 215 220
Lys Lys Asn Asp Ser Ile Gln Asn Met Lys Glu Trp Lys Glu Gln Phe
225 230 235 240
Asp Leu Val Asp Asp Val Ala Leu Lys Lys Cys Leu Tyr Gly Met Leu
245 250 255
Val Lys Met Tyr Ile His Glu Cys Arg Gln Tyr Gly Ile Thr Thr Lys
260 265 270
Ala Ile Glu Gly Met Asp Asp Arg Phe Ile Leu Gly Asn Cys Leu Asn
275 280 285
Tyr Lys Glu Arg Leu Lys Ala Thr Met Trp Leu Cys Phe Pro Arg Leu
290 295 300
Leu Ile Lys Gln
305
<210> 216
<211> 367
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_wzy
<400> 216
Val His Met Ser Val Tyr Ile Phe Leu Trp Val Ala Val Val Val Phe
1 5 10 15
Gly Phe Ile Ala Ser Arg Ser Asn Tyr Lys Ala Lys Tyr Phe Val Leu
20 25 30
Phe Ser Phe Phe Leu Met Thr Ile Val Leu Gly Leu Arg Gly Ala Thr
35 40 45
Val Gly Glu Asp Thr Lys Met Tyr Leu Asn Ile Ala Glu Arg Val Thr
50 55 60
Asn Ile Ser Trp Lys Glu Val Phe Ser Ser Phe Pro Thr Ser Gln Trp
65 70 75 80
Arg Tyr Ile Ser Tyr Gly Gly Leu Ser Gly Phe Ser Glu Gln Thr Glu
85 90 95
Thr Val Tyr Leu Ala Tyr Cys Lys Leu Ile Met Leu Ile Phe His Asn
100 105 110
Ala Gln Ala Val Leu Leu Ile Thr Ala Ala Ile Thr Asn Ala Leu Phe
115 120 125
Ala Lys Phe Ile Leu Asp Asn Ile Thr Val Lys Gln Asp Ala Ile Leu
130 135 140
Ala Val Tyr Ile Tyr Met Cys Asp Ala Met Phe Met Asn Ser Phe Asn
145 150 155 160
Thr Met Arg Gln Ile Leu Ala Ile Ser Ile Ala Val Gln Ser Ile Glu
165 170 175
Leu Ile Lys Lys Glu Lys Tyr Lys Lys Ala Ile Ala Cys Val Leu Leu
180 185 190
Ala Ala Cys Phe His Gln Ser Ala Ile Val Phe Phe Val Ala Asp Leu
195 200 205
Phe Tyr Leu Leu Lys Lys Lys Lys Glu Arg Tyr Ile Tyr Leu Leu Val
210 215 220
Thr Leu Cys Ala Leu Pro Val Leu Ile Pro Val Ala Ile Lys Val Val
225 230 235 240
Ser Ile Phe Ser Ser Lys Tyr Ala Ser Tyr Leu Ser Val Ser Phe Trp
245 250 255
Gly Ala Gln Leu Arg Gly Thr Leu Leu Leu Trp Ile Ile Ile Ala Ile
260 265 270
Val Leu Phe Ile Met Ile Arg Ala Asn Gln Ser Asp Asn Ile Asp Trp
275 280 285
Trp Leu Ile Tyr Met Ala Thr Ile Tyr Ile Gly Val Glu Leu Val Gly
290 295 300
Met Gln Leu Thr Val Ile Ser Arg Val Ala Met Tyr Phe Arg Ile Phe
305 310 315 320
Leu Val Leu Leu Phe Pro Ile Ala Gln Lys Tyr Phe Thr Lys Lys Ser
325 330 335
Gly Gln Phe Tyr Lys Ile Gly Val Val Met Leu Met Thr Val Ser Phe
340 345 350
Phe Ser Tyr Ala Ser Ser Pro Asp Arg Leu Tyr Thr Phe Cys Phe
355 360 365
<210> 217
<211> 326
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_GT5
<400> 217
Met Asp Leu Lys Asp Leu Ile Ser Val Ile Val Pro Ile Tyr Gly Val
1 5 10 15
Glu Glu Tyr Leu Asn Lys Cys Ile Asp Ser Ile Ile Asn Gln Thr Tyr
20 25 30
Lys Asn Leu Glu Ile Ile Leu Val Asp Asp Gly Ser Pro Asp Lys Cys
35 40 45
Pro Asp Ile Cys Asp Thr Phe Glu Lys Lys Asp Glu Arg Ile Lys Val
50 55 60
Ile His Lys Lys Asn Gly Gly Leu Ser Asp Ala Arg Asn Ala Gly Ile
65 70 75 80
Asp Thr Ala His Gly Asp Tyr Phe Val Phe Val Asp Ser Asp Asp Trp
85 90 95
Ile Glu Asn Thr Met Val Glu His Leu Leu Phe Ala Cys Lys Lys Tyr
100 105 110
Asn Val Glu Met Ala Thr Cys Ala Arg Tyr Ile Thr Asp Gly His Ser
115 120 125
Thr Arg Ala Val Ala Phe Asn Gly Pro Ala Gly Ala Tyr Ser Ala Glu
130 135 140
Glu Ala Leu Asn Glu Ile Leu Leu Gly Lys Ser Met Asp Val Ala Ala
145 150 155 160
Trp Asp Lys Ile Tyr Ala Arg Asn Leu Phe Glu Glu Ile Arg Phe Pro
165 170 175
Val Gly Glu Asn Asn Glu Asp Ile Ala Val Phe Tyr Lys Leu Val Asp
180 185 190
Leu Ala Gly Arg Val Ala His Thr Gly Thr Thr Glu Tyr Phe Tyr Arg
195 200 205
Ser Arg Pro Gly Ser Ile Thr Lys Leu Lys Tyr Ser Thr Asp Ala Arg
210 215 220
Lys Ile Ile Glu Lys Asn Leu Asn Ser Ile Glu Lys Phe Leu Asp Lys
225 230 235 240
Lys Tyr Pro Ser Cys Leu Pro Ser Phe Tyr Arg Tyr Lys Thr Met Asn
245 250 255
Ile Tyr Ala Leu Leu Asn Lys Tyr Ile Lys Cys Glu Gly Thr Lys Lys
260 265 270
Thr Gln Glu Phe Glu His Leu Met Asn Glu Phe Arg Lys Asn Lys Ser
275 280 285
Tyr Phe Phe Asn Asp Asp Gln Thr Pro Ser Lys Glu Lys Lys Ile Ala
290 295 300
Ile Met Ile Leu Leu His Leu Tyr Asn Pro Tyr Leu Leu Val Lys Glu
305 310 315 320
Lys Ile Thr Gly Tyr Lys
325
<210> 218
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_lytR
<400> 218
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Val
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Val Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 219
<211> 232
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_epsL
<400> 219
Val Gly Ser Ile Ala Leu Val Asp Asp Val Gly Ile Pro Glu Trp Ile
1 5 10 15
Lys Val Pro Ser Lys Ala Asn Leu Asp Lys Phe Thr Asp Leu Ser Thr
20 25 30
Asn Asn Ile Thr Ile Tyr Arg Ile Asn Asn Pro Lys Val Leu Lys Thr
35 40 45
Val Thr Asn Arg Thr Asp Gln Arg Met Lys Met Ser Glu Val Ile Ala
50 55 60
Lys Tyr Pro Asn Ala Leu Ile Met Asn Ala Ser Ala Phe Asp Met Gln
65 70 75 80
Thr Gly Gln Val Ala Gly Phe Gln Ile Asn Asn Gly Lys Leu Ile Gln
85 90 95
Asp Trp Ser Pro Gly Thr Thr Thr Gln Tyr Ala Phe Val Ile Asn Lys
100 105 110
Asp Gly Ser Cys Lys Ile Tyr Asp Ser Ser Thr Ser Ala Ser Thr Ile
115 120 125
Ile Lys Asn Gly Gly Gln Gln Ala Tyr Asp Phe Gly Thr Ala Ile Ile
130 135 140
Arg Asp Gly Lys Ile Gln Pro Ser Asp Gly Ser Val Asp Trp Lys Ile
145 150 155 160
His Ile Phe Ile Ala Asn Asp Lys Asp Asn Asn Leu Tyr Ala Ile Leu
165 170 175
Ser Asp Thr Asn Ala Gly Tyr Asp Asn Ile Ile Lys Ser Val Ser Asn
180 185 190
Leu Lys Leu Gln Asn Met Leu Leu Leu Asp Ser Gly Gly Ser Ser Gln
195 200 205
Leu Ser Val Asn Gly Lys Thr Ile Val Ala Ser Gln Asp Asp Arg Ala
210 215 220
Val Pro Asp Tyr Ile Val Met Lys
225 230
<210> 220
<211> 471
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33135_wzx
<400> 220
Met Gln Ile Ala Lys Asn Tyr Leu Tyr Asn Ala Ile Tyr Gln Val Phe
1 5 10 15
Ile Ile Ile Val Pro Leu Leu Thr Ile Pro Tyr Leu Ser Arg Ile Leu
20 25 30
Gly Pro Ser Gly Ile Gly Ile Asn Ser Tyr Thr Asn Ser Ile Val Gln
35 40 45
Tyr Phe Val Leu Phe Gly Ser Ile Gly Val Gly Leu Tyr Gly Asn Arg
50 55 60
Gln Ile Ala Phe Val Arg Asp Asn Gln Leu Lys Met Ser Lys Val Phe
65 70 75 80
Tyr Glu Ile Phe Ile Leu Arg Leu Phe Thr Ile Cys Leu Ala Tyr Phe
85 90 95
Leu Phe Val Ala Phe Leu Thr Ile Asn Gly Gln Tyr His Ala Tyr Tyr
100 105 110
Leu Ser Gln Ser Ile Ala Ile Val Ala Ala Ala Phe Asp Ile Thr Trp
115 120 125
Phe Phe Met Gly Ile Glu Asn Phe Lys Val Thr Val Leu Arg Asn Phe
130 135 140
Ile Val Lys Leu Leu Ala Leu Phe Ser Val Phe Leu Phe Val Lys Ser
145 150 155 160
Tyr Asn Asp Leu Asn Ile Tyr Ile Leu Ile Thr Val Leu Ser Thr Leu
165 170 175
Ile Gly Asn Leu Thr Phe Phe Pro Ser Leu His Arg Tyr Leu Val Lys
180 185 190
Val Asn Tyr Arg Glu Leu Arg Pro Ile Lys His Leu Lys Gln Ser Leu
195 200 205
Val Met Phe Ile Pro Gln Ile Ala Val Gln Ile Tyr Trp Val Leu Asn
210 215 220
Lys Thr Met Leu Gly Ser Leu Asp Ser Val Thr Ser Ser Gly Phe Phe
225 230 235 240
Asp Gln Ser Asp Lys Ile Val Lys Leu Val Leu Ala Ile Ala Thr Ala
245 250 255
Thr Gly Thr Val Met Leu Pro Arg Val Ala Asn Ala Phe Ala His Arg
260 265 270
Glu Tyr Ser Lys Ile Lys Glu Tyr Met Tyr Ala Gly Phe Ser Phe Val
275 280 285
Ser Ala Ile Ser Ile Pro Met Met Phe Gly Leu Ile Ala Ile Thr Pro
290 295 300
Lys Phe Val Pro Leu Phe Phe Thr Ser Gln Phe Ser Asp Val Ile Pro
305 310 315 320
Val Leu Met Ile Glu Ser Ile Ala Ile Ile Phe Ile Ala Trp Ser Asn
325 330 335
Ala Ile Gly Ala Gln Tyr Leu Leu Pro Thr Asn Gln Asn Lys Ser Tyr
340 345 350
Thr Val Ser Val Ile Ile Gly Ala Ile Val Asn Leu Met Leu Asn Ile
355 360 365
Pro Leu Ile Ile Tyr Leu Gly Thr Val Gly Ala Ser Ile Ala Thr Val
370 375 380
Ile Ser Glu Met Ser Val Thr Val Tyr Gln Leu Phe Ile Ile His Lys
385 390 395 400
Gln Leu Asn Leu His Thr Leu Phe Ser Asp Leu Ser Lys Tyr Leu Ile
405 410 415
Ala Gly Leu Val Met Phe Leu Ile Val Phe Lys Ile Ser Leu Leu Thr
420 425 430
Pro Thr Ser Trp Ile Phe Ile Leu Leu Glu Ile Thr Val Gly Ile Ile
435 440 445
Ile Tyr Val Val Leu Leu Ile Phe Leu Lys Ala Glu Ile Ile Asn Lys
450 455 460
Leu Lys Phe Ile Met His Lys
465 470
<210> 221
<211> 27175
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33139 eps基因簇,完整序列
<400> 221
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat ttataattgt aggtttttat 420
agttataatt ctaggataaa taatctttca aaagctgata aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa agtattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatccaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcact gactagtaat 840
gagcaactta taacaaattt ggctagtgca ggagcggagg taatagttca accctctcca 900
ccgatctatg gtggtgttgt gtaccccgta caagaagaac aatttaaaca atctttatct 960
acaaagtatc cctatataga ctactgggct agttacccag acaaaaattc tgatgaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgcttcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaagaaat gcaggaaaca caggaacaaa cgattgattt aagagggatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattt ggataattca gcagcctacg ctggacaagt gaccgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc tgatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acaaattcac aagtcattac gcttactgtt aaatattcta atccttacgt tgctcaaaag 1560
attgcagacg agactgctaa aatttttagt tcagaagcag caaaactatt gaatgttact 1620
aacgttaata ttctatccaa agcaaaagct caaacaacac ccattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ttagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct gggactcacg 1800
gttcttggtg taacaaccta tgctcaaatg agtgatttta ataataatac gaataaaaat 1860
ggcacgcaat cgggaactaa gtcaagtccg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ccgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caagggatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcaaacgag agtgctaatc tagctgttgc ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacctaatc catcggaatt attagcatct agtgcaatga agaatttgat tgactctgtg 2400
tccgattcct ttgatattgt tttgattgat actccacctc tctatgcagt tactgatgct 2460
caaattttga gtgtatatgt aggaggagtg gttcttgttg tacgtgccta tgaaacaaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtagag 2640
taattggaat aaattttaat caaataaaag acagaaattt gtagaagagg ggagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caactatcac tgcgactcct 2820
catcataatc ctcaatttaa taatgaatca ccgcttattt tgaaaaaagt taaggaagtt 2880
caaaatatca ttgacgaaca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaaggaaagt tactgacagc agcgggcact 3000
tcaagttata tattgattga atttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggacttcaa cctattttgg ttcaccctga acgtaatagt 3120
ggaatcattg agaacccaga tatattattt gattttgttg aacaaggagt actaagtcag 3180
ataacagctt cgagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt tacgtcacgt 3300
gcatttaaga tgaaggaagc ttttgaaatg attgaagata gttatggttc tgatgtatca 3360
cgaatgtttc aaaataatgc agagtcagtg attttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa gaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatggaat tttttgagga tgcctcatca cctgaatcgg aagagcctaa gttagtagaa 3540
ttaaaaaatt tttcttatag agagctaatt ataaaaagag caattgatat cctaggagga 3600
ttagcaggtt cagttttatt tcttatcgcg gctgcattgc tttatgtccc ttacaaaatg 3660
agctcaaaaa aagatcaagg gccaatgttc tataaacaaa aacgctatgg taaaaatggt 3720
aaaatttttt atattttgaa atttagaaca atgattctta atgccgagca gtatctagaa 3780
cttaatccag atgttaaagc tgcttaccat gccaacggca ataagctaga aaacgatcca 3840
cgggtaacga agattggctc atttataaga cgacactcaa ttgatgaact gccacaattt 3900
atcaatgttc ttaaagggga tatggcatta gttggtccaa gaccaatttt gctttttgaa 3960
gcgaaagaat atgggaaacg cctcgcttac ttactcatgt gcaaaccagg aatcactggt 4020
tattggacga cacatggtcg aagtaaagtt ctttttcctc aacgagcaga tttagaactc 4080
tattatctcc agtaccatag caccaaaaat gatatcaagc ttctagtact cacaattgca 4140
caaagtattc acggatcgga cgcttactaa aaaatgaagg aaaaacatat ttacattatt 4200
ggttcaaaat ggaattccag caaagtatgg tggttttgag acttttgtag aaaaactaac 4260
ggcacatcaa agtaataaaa accttaagta tcatgttgct tgtttatcga atggtataca 4320
agaaaatttt aatcataatg atgcagactg ttttaatatt tcaaagaaaa atattggacc 4380
agcaaacgcc atttattatg atttggcagc tttaaaacac tcacttaaag aaattgaaga 4440
aaaaaattat atgggtgcaa ttatttatat tttacttgcc gcattggtcc gtttattggt 4500
cactataaaa agcaaatgaa aaaattagga attactttga tggtaaatcc tgatggggag 4560
tgtgaaataa tatggacaac cagaaaaaag cctgaattca tgcgggttta cgcgacttga 4620
ccttttcacc tcaacttgtt tctgcttgtc atggtgcgct agggctagcg tgtttttagg 4680
ggttaatggc ttctggttgc gctctatact taacccacca cagcgaaagc tgatagagca 4740
gaaaacagaa tgaggaggta cgaggacaag cctgcgtata gattgttgct gcaatctgta 4800
tacaggcttg ttgttgtgat atgaagtagg aaaggtgaat agtgaaagac gagtatgaca 4860
atttaaggaa aagaactgat gggattgtgt ataaacctag gaaaagaata gaaactatgg 4920
aatgacggtt ggagagaatt aatggaagaa aaaatcagac ttgcggtttt cggacagaaa 4980
cgtctttctc gtgaaggcgg aatagagatt gttgtaaaag agttatgcac ccgaatggca 5040
cagaagggtt gtgatgtaac ttgctacaat agagcaggcc atcatgtgag tggtgcagag 5100
tatgacaaaa caattgaata tgatggtatc cgtcaaaagg ttgttccgac tattgagaag 5160
aagggacttg cggcggtaag ctcctccttt ttcgcagcac tttgtagtgc atttggaaga 5220
tacgatgtgg tgcatatcca tgcggaaggt cctgcctttt tctgttggat accaaaactt 5280
tttggcaagc gtgtaatcag cactattcac ggtttggact gggcccgcga aaaatggaaa 5340
tttggcgttg gatctaaatt tatccggcag ggtgaaaaaa atgccgtgaa atatgcagat 5400
gaaatcattg ttctaagcaa aggcgttcag aaatatttca tggagaccta cggaagggag 5460
acacatttta tccctaatag tgtcaatcgg ccagaggttc gggaggcaaa gctgatcacg 5520
gatcattttg gactggaaaa ggattcctac atactgttcc tcggtcgtct ggtgccggag 5580
aaggggattc gatatctggt tgaggcattc aagaatgtca agacagataa aaaactggtc 5640
atcgcaggtg gctctagtga tacggattcc tttatggagg aattgaaaga actggcgaag 5700
ggtgacgatc ggattctctt tactgggttt gtgcagggag caatgctgga tgaactgtac 5760
agcaacgctt acatctacac gctgccgtcc gatctggaag gaatgccatt aagtctgctg 5820
gaggcgatga gctacggaaa ttgctgtctg gtatccgata ttccagaatg tgcagaggtt 5880
gtggaagata aggcattgat tttcaaaaag tcagatgtag aggacttgcg agaaaaattg 5940
caagatgcct gtgaccatcc agaaatggtt ataagaatga agaatcaggc agctgacttt 6000
atctgcgaga aatacaactg ggataaagtt gtaaaggaaa cgatgaaact gtacaggaga 6060
aaataataga tgagagtatt gataataaat aatctccttt aaccgaacgc aggtagtgag 6120
atgtacatag tgaaagcgcc gtaaaaatag cgtatttacg gcacttcttg agagacgggt 6180
agagctgcga gagcagtttt tatctcgcta cctaggttca gtaacagcag atgattttat 6240
cataagaact ttcagtatca ctacctgcgt ttgtatggag cactctggtt tatgaataac 6300
acattggaag tgcttcatct gaacagaggt agtgacactt gatatatacg cttcaaaaat 6360
aaggagacga atgcaatgag tatagtaact ttagataaaa aaacgatttt agtcacaggt 6420
gcagccggtt ttataggctc caaccttgtg aaacgaatct atcaggaggc tccttctgct 6480
acggtcatcg gcatcgacaa tatgaatgcc tactatgatg tggcactgaa agaattccgc 6540
ctgaacgagc tggccaagta tcccacattc acttttgtaa aaggcaacat cgctgataag 6600
gcactgatca ccgagctgtt cgagaagtac aagccgtctg tggtcgtcaa ccttgcagca 6660
caggctggtg tgcgctactc catcaccaac ccagatgctt atgtggaatc taacttggtc 6720
ggcttcttta atatcctcga agcctgccgt cattgtgaga gtctggagca tttggtttat 6780
gcttcttcct cctccgtcta tggttctaat aaaaaggttc catacagcac ggatgacaag 6840
gttgacaatc cggtttccct ttatgcagca accaagaaat ctaatgagtt gatggcacac 6900
gcatactcca agctctacaa cattccttcc actggcctga gattctttac ggtgtatggc 6960
cctgcaggtc gcccggacat ggcttacttc ggattcacca acaagctggt gaagggcgaa 7020
accatcaaaa tcttcaacta tggcaactgc aagcgtgatt ttacttatgt ggatgacatc 7080
gttgagggcg ttgttcgtgt gatgaagaaa gcaccagaca agaagaatgg tgaagatggt 7140
cttccgattc cgccgtatgc agtttacaac atcggcaatc agaatccgga gaacctgctg 7200
gactttgtgc agattctgag cgaggagctt gttcgtgcaa aggtgctgcc ggaagattac 7260
gatttcgagg ctcataaaga gctggtcccg atgcagccgg gtgatgtgcc tgtgacctat 7320
gcagatacga gtgcactgga gcgcgacttc gggtacaagc cgagcacaag tctgcggact 7380
ggattaagaa agttcgctga gtggtacgct gagttttata aataaaagaa ttagagggat 7440
agagaaatga gagagtttaa agatttaaag attgctgttg ccggaacggg atatgttggt 7500
ctttctattg ctacgctgtt atctcagcac cacaaggtga ctgctgtgga tatcattcct 7560
gagaaagttg aacttatcaa taataagaaa tctccgattc aggatgaata tattgaaaag 7620
tatctggcag agaaagagct ggatctgact gcgactctgg atgctaagga agcatacagt 7680
gatgctgatt ttgtagtgat cgcagctcct acaaattacg atagcaagaa gaactttttt 7740
gacacgagtg cggtagaagc cgtcattaaa ctggtcatcg agtacaaccc ggaagctatc 7800
atggttatca agagcactat tccggttggt tatacagcaa gcgttcgtga gaagttccac 7860
tgtgacaata ttatctttag cccggagttt ttgcgcgaga gcaaagctct gtatgataac 7920
ctttatcctt cccgtatcat tgtcggtacg gatgttgaca atgttcgact ggtaaaggcg 7980
gcacacactt ttgcagagct cctgcaggaa ggtgctatta aggaaaatat cgatactctg 8040
tttatgggct ttaccgaggc agaggcagtt aagctattcg ctaacactta tttggcactg 8100
cgtgtcagct acttcaatga actggacact tacgcagaga tgaagggact gaacactcag 8160
cagatcatta atggtgtttg cctcgaccct cgtatcggca ctcattataa caatcctagc 8220
tttggttacg gcggatactg cctgccgaaa gacaccaagc agctgctggc aaattatgca 8280
gatgtgccgg aaaacctgat tgaggctatc gttgaaagta atagaacaag aaaagacttc 8340
atcgctgacc gtgttctgga gattgcaggt gcttatgaag caaatgacag ctgggatgag 8400
agcaaagaaa aagaagttgt tgtcggcgtt taccgtctaa caatgaagag taacagcgat 8460
aacttccgcc agagttccat tcagggtgtt atgaagcgta ttaaggccaa gggtgcaaca 8520
gtcatcatct atgagccgac cctgaaggac ggcgatactt tctttggtag tcgagttgtt 8580
aataatttag agaagttcaa aaaacagagc caggcaatta ttgcgaaccg ttacgacaag 8640
agcttggatg atgtgaagga taaggtttat acacgcgata tttttcaacg ggactaatgg 8700
attaaaagga gattcattat gaaaggcgtt attttagccg gaggttcagg cacacgtctt 8760
tacccattaa cgaaagtaac aagtaaacag cttttgccaa tctacgacaa accgatgatt 8820
tattatccta tgtctgttct gatgaacgct ggcatccgcg atattttgat tatttccaca 8880
ccgcaggata cacctcgctt tgagaatctg ctgggtgatg gacaccagtt tggtgtgaat 8940
ctgacctatg cggttcagct gtctccggat ggactggcac aggcctttat cattggtgcc 9000
gactttatcg gtgccgattc tgtagctatg gtgctgggcg ataacatctt tgcgggccac 9060
ggattgaaga agagattaaa cgcagcagtg gaaaaggcag aaaacggcaa gggtgcaacg 9120
gtgtttggct actatgtgga cgatccagag cgttttggta tcgttgagtt cgataagaac 9180
ggtaaggcca tttctatcga ggagaagcca gaacatccga agagcaatta ctgtgtcacc 9240
ggtttgtact tctatgataa ccgtgtggtc gagtttgcta agaacctgaa gccgtccgct 9300
cgtggtgaat tagaaattac cgatttgaac cgtatttatc tggaagatgg tactctgaat 9360
gtagaattgc tgggtcaggg cttcacttgg ctggacactg gaacacacga gagccttgtt 9420
gatgctacca actttgtaaa gaccgtggaa cagcatcagc atcgtaagat tgcctgtctg 9480
gaggaaatcg catatctgaa tggctggatt agcaaggatg agctgatgga ggtctatgag 9540
gttatgaaga agaaccagta tggacagtat ctgaaggatg tcatggacgg caagtatcag 9600
gagcatttgt attaataaaa gaagaaccaa aagataagtg tataggagaa tagatcatga 9660
atattattgt taccggcggt gcgggtttta ttggtagtaa ctttgtgttc cacatgctga 9720
agaagtatcc gggttatcga atcatctgtt tggacaagct gacctatgca ggcaatctgt 9780
ccacactggc ccctgttatg gataacccga atttccgctt cgtgaaggct gatatctgtg 9840
accgcgaagc agtgaataaa ctgtttgaag aagaacatcc ggacatcatg gtcaactttg 9900
cggcagagtc tcatgttgac cgttctatcg aagatcccgg catcttcctt cagactaaca 9960
tcatcggtac cagtgtgctg atggatgctt gccgcaagta cggcatccag cgttaccatc 10020
aggtttctac tgatgaagtt tacggtgacc tgcctctgga tcgtcctgac ctgttcttca 10080
ccgaggagac tccgatccat accagctctc cgtatagcag ctccaaagct gctgctgacc 10140
tgctggttct ggcttaccac cgtacctacg gcctgcctgt gaccatttcc cgttgttcca 10200
acaactatgg accgtatcac ttccctgaga agctgattcc gctgatgatc gctaatgctc 10260
tggctgacaa gccactgcct gtttacggcg agggtctgaa cgtccgtgac tggctgtatg 10320
tggaagatca ctgcaaggcc attgatctga ttatccacaa gggtcgtgtt ggtgaagtct 10380
acaacgtcgg cggtcacaac gagaagcaga atattgagat cgtgaagatt atctgcaagg 10440
agctgggcaa gccggaaagc ttgatcactc atgttggtga tcgcaagggt cacgatatgc 10500
gttatgctat tgatccgacc aagatccaca atgagctggg ctggttgccg gagaccaagt 10560
ttgaggacgg cattaaaaag accatccagt ggtatctcga taatcgtgag tggtgggaga 10620
ccatcatcag cggtgagtat cagaactatt atgagaaaat gtacagcaac cgctaagaaa 10680
ccacaagaaa aggataagaa gttttttata acaggcgtta gcggctagtt tagtcatgat 10740
atgatgaatg atctgctgag acaatcgtga aggtgttgac tattgtatat aggaaaatta 10800
cagtagtgtg tttgattaat ttgcgataat gaaatatatt ctgctttcga tatggtagat 10860
tgcacttggc agttatctaa aaatgagcag tgaaggagat gtagaaaatg ggacagatta 10920
aagttgaaaa aaacgtaggc ggtatagagg gactttgtgt tattgaaccg gttgttcatg 10980
gtgactcccg tggttatttc gtggagactt acaacgagaa tgatatgaag gaagctggta 11040
taggtattca ctttgtgcag gacaatcaat ctatgtccac aaaaggggtg ttgcggggat 11100
tacactttca gaagcaatat ccgcagtgca aattggtgcg tgttgtgaac ggtacggtgt 11160
ttgatgtcgc agttgatttg agaagtaatt ccgaaactta tggtaaatgg tatggtgttg 11220
ttttgtccgc cgagaacaag aaacagttcc ttattccgga gggctttgca cacggcttct 11280
tagttctgag caatgaagca gaattctgtt ataaggtcaa tgatttttat catccgaatg 11340
atgagggcgg aatggcttgg aatgaccctg aggttggaat tgaatggcca caactgaagg 11400
gcgaatacaa gggcaatgca agtgcagaag gatatacgct ggaagacggt acagcgctga 11460
acctgagcga taaggaccag aagtggctcg gactgaagga tacttttaag ttctgaaagg 11520
agaaataaat atgaaaattt tagtaactgg tgcaaacggc tatctggggc agggtattgt 11580
aaaggagctt ctagataacg ggcataatgt tgtggctgcg gattttaaga ccacgtatgt 11640
tgatgaccga gcagaaaaga tagattgtga cttgttctcg gtagaagaac cctatacata 11700
ctttggtaaa ccggatgctc ttcttcattt ggcatggagg gatggatttg ttcattactc 11760
ggaaaaccat attgcggatt taccaaaaca ttatcatttc ttgaagcaaa tggttgaagc 11820
taatatcttc aagattagtg ttatgggaac gatgcacgaa attggtttct ttgaaggtag 11880
tattaatgaa aatactcctt gccatccgat gagtctctat ggcatcggca aagatgctct 11940
gcgcaactgt gtggcgatga tgactaatgg taaacacaca aaatggcaat ggttacgtgg 12000
ctattacatc gtcggacatt ctgagtttgg atgttctatt ttttcaaaaa ttaaggcagc 12060
agaaaaagag ggtaaaacag aatttccgtt taccatgggt caaaatcagt tcgattttat 12120
agattatgaa gatttctgta aacaggttgc tgcagctgtt ggtcaggatg aaattaatgg 12180
aattatcaat atctgttctg gtaaaccgga aaagttggct gatcgtgtag aaaggtttat 12240
taaggaaaat ggatacggta ttaaattgaa atatggagca tttcctgatc gtccatacga 12300
ttcaaaggcc gtttggggag ataataataa gataagaaaa attatgcaga ataattgatt 12360
taaagagtga atgaagacga attatattta atagtgcata agttgcaatt tgtaagtctg 12420
tttgttgaaa aatacataga agaccatgaa taactaggaa tagtttgtgc gatttggaaa 12480
cgtactaaac taggaaaata tatttattta gcctttgttt actaaaagaa tatgtattta 12540
ggttatacta ggacatattg tataagagga ggagagaatg aaaatactat ttcacataag 12600
ttcacttttt ggtggtggag ctgaaagagt tatgtcctat ttgattaatc acaattgtga 12660
actaaataat gaagtttata ttgtggtttg ttatgagaag gaaggtgagt attatatttc 12720
acctaaagca aaaaaaattg taataggaac acatagcatt gtttctcaat ctctagaatt 12780
aagaaagaca ataaaaagaa ttaaaccaga tatatgtgta agttttatgc aaggaggaaa 12840
tatcagactt tctctagctt gcatgggttt aaaacaaaaa tatattctgt cagtaagaaa 12900
tgatcctaaa aaagaatatc caaatgcgat tatgcaaaaa ttggtgcggt attggtttga 12960
tgcagctgac ggagttgtgt ttcaaacaga agatgcaaaa aagtttttct gcagtagcgt 13020
gcagaataaa tcgagtataa tatacaatcc tgtatcaaac cagtttttta tagagaatgt 13080
gtcaaaagat actactggaa tagtagcgtt cggtcgtctg gttgatcaga aaaactttgc 13140
gatgttaata aaagcatatg ccgtgatagc agatagaatc gatgatgatt tatatatata 13200
cggcgaaggc cccttagaga aaaggttata tgaaatcatt aattcaaccg gacttgcaag 13260
tcggattcac ttaatgggcc ggactaataa tgtgccagaa gtacttaaaa cagcaaaagt 13320
gtatgcttta agctcggatt ttgaaggtat gcctaatgca ttgctggaag ctgtatgtat 13380
gcttgtgccg gtagtttcta ctgattgccc ttgcggtggg ccaaaagaga tttgtaacaa 13440
tgcgtgtggt ctattgagcc cagttgggga tgtagaatct tttgcaaata atctttacaa 13500
agtttcccac agcgaacaat taagagaaca attagtcaaa aaatgtattg aacgaagagc 13560
tgcgttttca aatgattcta tactgaaaaa gtgggatgca ttttttgata aagtgtgtgg 13620
atgctaaatg gagggaaaaa tgagtaacaa aacgaaaatg ttatttatta tgaattctct 13680
taattttgga ggcgcagaaa aagcacttgt caatttgttt gatactatga attacgatgt 13740
ctatgaaatt gatttgcttt tgctttcaga tgaaggaaag ttgcttagta tagtaaacaa 13800
taaggtgaat ataatacacc cagatctaat tacgcgtaac ctttatggag aacattgtaa 13860
ctttatattt aaatttccta agattatatt tactggtagc caatatttga ttacaaggaa 13920
cagatcctat gtaaatcaat tgaggtggaa gaagttttat aaaaaaataa ttccgcagtt 13980
acaaactaaa tacgatgtgg ctgtttcctt tcttcaggga gatccgttgt attatttagt 14040
tgataaggta ttcgcaaaaa aaaagattgc ttgggtgcat aatgactatc gtatgaccca 14100
gtgcaatagt atatttgatt tgaaatattt tgaacaagtc aatcaagttg taacaatttc 14160
aaatatttgt ctagatatac ttaaagaaat ttttccatcg gtaccatcaa tgtttttgcc 14220
taacattgtt aattccacat caatcaattc atatgcagaa ggacaaccaa atgaatatga 14280
tggagttaaa tcaaaaaaat tacttacgat tggaaggtta aaccctcaaa aaggatatga 14340
ttttctttta gaaattgcag catatttgaa agaaataaaa tatgatttta agtggtacat 14400
tatcggagag ggagaactta aggaacaact attagcggag tggagagaaa aaaagctaga 14460
agattgtgtc tattttattg gtacaagaga aaatccatat ccatatatta agcatgcaga 14520
tgtagtagta caaacaagtc gttacgaagg aaaatctata gtgttggatg aggccaaaat 14580
tctcaataaa cttattgttt gtaccaatta tgatactgtg aaagatcagt taattgatgg 14640
aaaagagggg attatctctt cttttgaagt taaagagttt gcggaaagta tcataggatt 14700
attagcggat gataacaaaa tgaatgagat tgtcaattac ttatcatcgc atgaatatgg 14760
aaatgagaat atgatcaaat tatatgatca gttatttcag gtataaaaat aatactggat 14820
tggagaacaa tggaaaagga attaatatct attattacac ctacatataa cagagaaaag 14880
acactcattc gagtttacga ttctttatgt aaacagagtt ataaatgcat tgaatggatt 14940
gttgtagatg atggctctag agacaaaact aaagaattga tttcgtcatt gataaatcag 15000
aaaaataagc cttttcctat caaatatgta taccaaaaaa attctgggaa acacgtggca 15060
gtaaataaag gcttagaaat tgcagggggg ggtacgttgg aatcttggac tccgatgatg 15120
ccttgtttga tgatgcttta gagacattaa tggggtactg gaatgcaatg acaccccagg 15180
agaaaaaaga atttaaatca gttacaggac gcgtggcaaa tgcagagaca ggagaattga 15240
ttggccctaa aaataagctt aaactgattg attgctcttc tcttgaagcc agatttgtga 15300
gaaaaatggg atacgaaaag tggggattat caagaacgga agttatgcgc gaatttcaaa 15360
gtccaaatat agaaggactc catttttatc ctgagaatat tacatatgac gcaattggta 15420
gaaaatataa agagcgtttt gttgaggatg tggttagaaa atattatcta aattcatcgg 15480
attcaattat aaagaataaa aagggccgta gtaaagaaaa ttattattta tggctacata 15540
atattaatga tgtattcgat tattttttat acaatcctaa aatctttctc aaatcatttg 15600
tcgggctggc aagagatggg gttctttcag gaagaagtat accatatatc ataaatcaga 15660
taaatacctt gccgaaacga gtcttgtttg ttctatttat gcccgtagga atgcttcttg 15720
cttataaaag ataataattg aattgatatg agatgagatg agtcaccttg gaacagtcaa 15780
agccgtaatg aaatagatac atatccacag tgcgtgagca aatatgtcaa attagaatta 15840
aattacccag tagagcatgg gagaaaggaa aaatccaatc atttcctaaa tgtgattgga 15900
ttatatgaga atttaatgaa aaagaacact atattagttg attttgatgt tcctgatgat 15960
tgggaatata aacgtggcat tgaggatgta acaggagaaa aatgggaact gtggaaatgt 16020
atcacacatc ggcttcaaag ttctaagatt aaggttctgt tgcgctatat tataacattc 16080
ttgttcgcgt ttaaagtgtt tttgcacaga aaaaaatata aaagaataat tgcatggcaa 16140
cagttttatg ggttggcttt agcttttttt tgtaagttgt ttaatgtgaa ggattatccg 16200
gaaatatata taatgacatt tatatataaa aataataaaa gtagagtgtt tagtaaattt 16260
gtgaaatatg cggtagattc tagatacata aagaaattga tggtaatgtc ggatggcgaa 16320
aaacagtttt attctaaaga actgaagtta gatgagtctt tgttctattg cactagagtt 16380
ggcgttaaag atgaaactaa ttctattaag caaaatatta cggaaaaata ctatttagca 16440
gtaggtagaa gcaatagaga ttataaattt ttgagagatg cgtggaaaaa tgaatatgga 16500
aagttaataa tagtcaatga ttcatataaa gagccggaaa aagatggtat tgtatgttta 16560
aaaaaatgtt atggtcgaga ctatttacaa atggttgcta attgttatgc tgaggttata 16620
cctttgaatg ataaaaatat atcatctgga gctttgagct ttttgcaagc aatgatgttt 16680
tctaagccgg ttattgtaac aaataatatg acagtaaggg attatattaa aagtggatat 16740
aatggggtta tcattgagaa cacttcagaa gaattggaag gtgcaattaa ccaattagag 16800
aatccgataa tatatcaaga aattgcatca aacgctcgaa aagaatatga agagaaatat 16860
agtgaattaa ttttgggcaa ggatattggt agtatgattg ttcatcgata atggttgcgt 16920
ggagatgttt tttgattcaa ggatcaatcg tttttaatcg atcatactag gaggaggata 16980
agtataatgc acttatatgt tagtaaaaaa gctatattac agtatttaat gctgtatgtc 17040
atgttgattt tttgtcaaac acatgtgtat agattatata ttagatcaaa tttgacgata 17100
catgttggtt tagtaatatt atttttaata attggtgtag tcaattttag gaaaaaaaca 17160
aaacgacctt ttttgatgtg tgcttttttg ttagcaatgg ttataggggt tcgttttatt 17220
aatggtggtg taggaattga tttttgggct gaaatggccg caaaaatact aattacatat 17280
attgctatat tgattgatcc tgaacagttt ttaaccagat ttgtaaagat tataacgttt 17340
tttgcggcaa ttagtattgt gggatggctg caacagattg ctggtttgaa tattatgcaa 17400
aaaatcggca tggtaaataa cgatttttac acaacagtaa catgggataa aggttatgtt 17460
gaagaaactc agcgtaagat ttatgggttg ttgttttatg tgacgacaga tgtagaaatt 17520
aaacgtaata tgagtatttt cacggagcct ggtatatatc aaatggtatt gaatgctgcc 17580
atttttgtag tagcgttttg taataaactc attgaattaa atcctaaaga aattaaaaaa 17640
tattttttaa ttctaacaat tgcattaatt acaacgcagt cgacatctgg atattttgga 17700
tatgcggtaa tagttcttgg tattcttttg acacgaagtg cagacacaag gacaattaag 17760
agctatattt atattatttt agtggttgga tttgtagtgc tattgggaga ttattccgta 17820
agaggaaatg atagtttgat ttacagggca ttgttatcga aagtgttttc tagtcaggga 17880
gacttttcat tttccgcttc gacgggagtt tatcgttatg aaatgattgg gatggcattg 17940
ttggctatgg caatgaatcc atttggtatg ggctatgaag cttgggcaaa attgtatcgt 18000
ttgaactcat tcgctgatgc cggtggatat ccgtttatta ttggagcggt aattggtatt 18060
gtaccacttt ttgtatcatt atggtggata tttagtccgc ttaagtatat gaaaaataaa 18120
tgggtagaaa ttgtagtatt tttattttta tattttaata cagctatggc acaaacaagt 18180
gcattttatc ctgcaataat ttttttgcct gtatttcttg atattatgag acagaacatt 18240
actgatctgg agttggagaa agagtatgct tatgaactat aaatacgtaa ttgctggtgc 18300
agataatgat ttttatgata ttgcatttga tgatgttaga aataaggata aaaacattac 18360
ttatcttaga acggccatgg attttagtaa tccggtgata agatttttga gaaatctcca 18420
ttttagtgct cgagctaata cgattattaa tttaccaggt aagtttatat ggaatagatt 18480
tacttttgaa aatccttata aaaatacaga taaaatatgc tttgttgttt ttggagcaaa 18540
ctacgctaga tatgtagagc taggagtttt cgaatcatta cgaaagagat atccgggatg 18600
caaaatagtt tgttattttc aagatttagc aagtaaatgt ggttattcgt atccagaaaa 18660
attgaaagac cattttgatt taatcttgtc atttgatcaa aaggattgcg atatgtacgg 18720
atggatatat tatccgttag tatactcaaa agttcatata gaagataata atgatattcc 18780
tgaaagtgat gtgtattttg taggtaaggc aaaagataga ctatctgaaa ttatttcgtt 18840
ttttgaaaag tgtgatgctg caggcttgaa atgtgatttt catattgtcg gtgtgccaaa 18900
agagaatcag gttctcaaaa ataaaatctc ttattgtgga caaatgtcat atgaagaaaa 18960
tttacagaga ataagaaaaa ctcggtgtat gttagaaatt atgcaacagg gggggcatgg 19020
atatacattg aggtattgtg aagcaattgc catgggaaaa aaattggcta ctaataatcc 19080
cgaaatagaa aaagcacctt tttataatga aaagttcatt tctatattta gaaacgttga 19140
agaatttgat cctcaatttg ttcttaatgg tgatcgagat gtggactata attatcttcc 19200
tgaattgtct cctttgaagc tgattgaatt tattgacgct agaatataaa attttgtgaa 19260
gaactgctat gaaaatcaac cataaataga aaaagctttc catttcatat cggtgatata 19320
gaggatggat ttaatagagc cccattaggt tagctgttta taaatcattt ttattggtat 19380
agtatatgaa tttcatttaa aacatagcta taagcagatg aaagtgtgat gatagctttc 19440
caattggcat attagtggta ataagatgtg tataaaaatg ataaagacac attaataatt 19500
gtggatagct gtagataagg aaaattaatg tttggaataa aatcaagaat cagaaaatgt 19560
aaagatatta taaattttaa attgaaaaat aagggggtag tagttggaaa aaatattata 19620
ttaagaaacc ctcagtatat ttcatgtggt aaaaatgtag ttattggaga tgaaagtaaa 19680
ttattatgtt gggatagtta tggagaagaa caatatagta atttaccaga aatccaaata 19740
ggcgataatt ttcatgcgac acgaaatttt acaattcaat gcgctcaaaa agtggttatt 19800
ggaagagatg tattggtagc ttcgaatgtc tttattatag attataatca tggattaaac 19860
ccattaacca agtcgtatct tgaaaaccca ctaatacggg gggggggtac ttgtagatga 19920
tggtgtttgg attggaaata atgtgattgt tcttcctaat gttcatattg gaaaaaaatc 19980
aattatagga gcaggttctg ttgttacaaa ggatatacca gaatactgta ttgcagtggg 20040
aaatcctgca aaagtaataa aaaaatttga tataaaagaa aaaaagtgga aactggtatt 20100
atgaattgga gaataaaatg gtaaagaaaa aactacaaaa tattcctttg ggagtaaaat 20160
cagcagtggt atataccatg gcgtctgttt tttcaagagg attatcaatg attacagttc 20220
caattttcac gagaataatg tctacaagtg aaattggaat ggttaatcta tataattcct 20280
ggtatttatt attgaatgta attgcaactc tatcattaac atcaggtgga tttcaggtag 20340
caatgaaaga ctttgagggg gaaagggatc aatatcaatc gtctgttttg acattaacgt 20400
caatgatggc catttttcta ggctgtattt attttctcat acctaatagt tggaatagga 20460
ttacagggct accttctgct ttgatgattt taatgcttgt agtgtttttc tttgcacagg 20520
cgcaggattt ttggctattg agacaaagat atgaatataa atataaatta gctggtgcat 20580
tgacaatggg gtcagcttta gcatcaacag tattgtctgt tatagttgtt ttaagcttaa 20640
ataaagcaaa ttcagatcaa attgtagtgg ggcgcttata tgcaactaac attgtatcta 20700
tagctatttc agcaatttta tggattaaat tatatgtaaa aggaaaaacg atagtaaata 20760
taaagtactg gaaatactct ttgaaattga gtgtgcctct tattggttat gcttttgcgg 20820
cacagatttt aagtgtttca gatcgaatga tgataagcaa aatggttgga aatgatgcag 20880
ttggaatata tagtacttta tatactgtaa gttcaatttc actgttagtt tggactgcaa 20940
ttaattcatc gtttataccg tatttatatc aaaatataga gaaaaaagga aatagaataa 21000
aagaattatc attagcttta atgggttcgt atgctattat agctgttatg cttacttttc 21060
ttgcaccaga aatagttaaa atattagcta ctaaggaata ttatgaagca atttatatta 21120
tgccacctat tgcagcaggt gtgtttttga cttctgtgtc aaatatgtat tctaatttgc 21180
tcatatacca taagaaaact aattatatta tgtattcttc gattatcgca gctactgtaa 21240
atcttatact caactatata tgtataaatg catttggcta tatggcagca gcatatacta 21300
cactgatagc atacatagtt ttggcgggaa cacaagcaat gtttgcgaga aagattcgct 21360
ttaaagagac tggggaaaaa tctgtatata atgataatgc ggtatttgtt atggcgatat 21420
taacgataat agtagcctta ttcggcttgg tattgtatcg ctatacttgg ttaagatata 21480
taatcatttg tactggaatg attgcaggaa taaaaatcgc gttcatgact ttgaaaagaa 21540
ttaaaagtta actaggaatt aggggagacc tgctatgaat aaaatagcta taaaaaagct 21600
aatgtcacca ttatataaaa taaaatacag acttcattct aatggatggg tttatattgg 21660
aaatcacact aaaattgtaa atcctaagca tctccatttg gaggggggaa tcaaattgct 21720
ccatattgtt taatatgccc acatggagat gcatttatta aattagggca aggtgtaaat 21780
atcggtatgt tttcacgact tgcttgtatt aacgagattt ctttaggaaa taatgtactt 21840
acaggaccac atatttttat ttgcgattat aatcatgcat atgaagatat aaatagacct 21900
atttctttac aaggaaatat tggaaacgat aataaggtta ttatagatga tgactgttgg 21960
attgaaacaa atgttgtaat ctgtggcaat gttcatattg gaaaacatac ggtaattggg 22020
gcaaatgctt ttgtcaataa ggacattcct agctattgcg ttgcggttgg aaatccagca 22080
aaggttgtca aaaaatataa ttttgagaca ggtgcatggg agaatgtata agtaatgaaa 22140
agtattataa atagcgttca gcaggaaaac ggcagataat tgtctaaaaa gaaaaattgt 22200
ttccatggca taatgataac tgacaaatga cgctaggctt tgcctagaag tatactatgt 22260
caggctatta tagtatgatt tgcagtacct acaaattgga caagtgtttt ggagtccact 22320
ttcgtaaata gacctcgttg gtgactgcaa agaattcaat atgcgttacg acatcgaatc 22380
tgatcaagat ttataatgaa tcatactgtt tcctgaagac taaattgaaa atggtattaa 22440
gaaaattatt cagtgatgcc ttgataactt tggatggtgg gataccatta ttagtcgtga 22500
gtgatatgaa ctaccatgag aaaatgcata gcactaataa cggattttag gaggaatcga 22560
tatgaagttc tttgtaacag gtgttggtgg ccaacttggc catgatgtga tgaatgagct 22620
gctgaagcgt ggccatgaag gtgttggttc tgatattcag gaaaactaca gcggggtggc 22680
agatgactct gcagtaacaa aagcacctta tgtggctctg gatattactg ataagaatgc 22740
cgttgaaaaa gtaattacag aagtaaatcc ggatgccgta atccactgcg cagcatggac 22800
tgctgttgat atggctgagg atgatgataa agtagcgaaa gtccgtgcaa tcaatacagg 22860
cggtactcag aacattgcgg atgtctgcaa gaaattggac tgtaagatga cctatatcag 22920
cacggattat gtgtttgatg gtcagggtac agagccctgg cagccggact gcaaggatta 22980
caagccactg aatgtgtatg gtcagacgaa gctggaaggt gaactggctg tcagtcaaac 23040
gctggagaaa tattttatcg ttcgtatagc atgggtgttt ggtctgaatg gtaagaactt 23100
tattaagacc atgctgaatg tcggcaagac gcacgatact gtccgcgtgg ttaatgatca 23160
gattggcaca ccgacctata catatgattt ggctcgactg ctcgttgata tgaatgaaac 23220
caagaaatac ggctattacc atgcgaccaa cgagggcggt tatatcagtt ggtatgattt 23280
cacgaaagaa atttatcgtc aggctggtta taagacggaa gtcctgccgg tgaccacggc 23340
ggagtatggc ttgagcaagg cagctcgtcc gttcaacagc cgtctggata agagcaagct 23400
ggtggaggct ggattcactc cgcttccaac atggcaggat gcactgagcc gttatttgaa 23460
agaaatcgag tagaatgatg agatatactt tgatagatct tacaaagcga aacttaaaag 23520
acaaggacta gaagtgtctg gaggtaaaaa tactttttga tttagataat aaatgaggag 23580
aaaaagcgat gaaaacaaca ttacttgtca tggctgctgg tatcggtagc cgcttcgaaa 23640
cgggaattaa gcagttggag ccggtggatg cttctaatca tattatcatg gattactcga 23700
ttcatgatgc aatcgaggct ggcttcaatc atgtggtatt tattatccgt aaggatattg 23760
agaaagagtt caaagaggtc atcggtgatc gcattgcctc tatttgctct tctcacaata 23820
taactgtgga ctacgctttc caggacatta acgatattcc gggaacttta ccggaaggcc 23880
gtacaaaacc gtggggaacc ggacaggccg tgcttgctgc taaaaatgtg attgataccc 23940
cgtttattgt cattaatgct gatgattact atggcaagga aggctttaag gctgtccatg 24000
agtatctggt aaatggcgga aagtcctgta tggctggctt tgtgctgaag aatacgctgt 24060
ctgataacgg tggtgtaact cgtggtatct gcaagatgga tgagcagtac aatctgactg 24120
aggttgttga gacaaagaat attgtgaaga ccgcaactgg agcagaagca gacggaaaag 24180
tgattgatgt tgattctctg gtatctatga atatgtgggg attaactcct gattttttgg 24240
acatgctgga gaaaggtttc aaagaatttt tcgagaaaga agttccgggc aatcctttga 24300
aagctgagta tctgatccca atcctcatcg gtgaactgct ggagcagggc aagatgtctg 24360
tgaaggttct gaaaacgaac gatacctggt atggtatgac ctatcatgag gatgtcgcag 24420
ttgtaaagga cagcttcaaa aaaatgctag aaaacggcgt gtacaaggct gacttgttca 24480
gagatctcta aacaatgatg aaagaaacta ttgatttact cgggaagatt ctcacaaaca 24540
tcctgactgc gctctatgag ccgtttggct tttcgcttct tctttccttt ttggccatgt 24600
ttttctattt gtatgcgcat gagccaatac atgctggtaa aggttggaaa aatgctatag 24660
taacgtggta tcagaagttc aaagagagca tgttcttccg aaagttgttt ttattgactt 24720
tcgtgatttc gatgatcttg ttcagaacgc tgttgaattg gaacctgtgg atgaatccgt 24780
tatccaaggt catgggtggc tgggttatct gggagataga aaatggcgaa cagaagctga 24840
ctaccgagtg catcgagaat gtgatcatga tggtgccgtt ttcagcagta gtggcgtgga 24900
cgttcggaga gaagattgga aacagctgga agaaaatact gtggtatagc ggaaagatgg 24960
catttatctt ttctataagc attgagttgc tacagcttct actacggcta ggaacattcc 25020
aactatcaga tatcttttat aatacggtag gtggagtgct cggtggactg atatactacg 25080
gagttatgaa ggcaagaaaa catctgtaaa aaggaatgga attgtgatat actgggagca 25140
gaagctccct acgctgctcg aaactttgta cacctctccg tggtgagcag cagtcaacac 25200
agtaaaaggt attgtggcta agaatgcgaa gtaggagatc ctttagtttg ttatgattgt 25260
ttcaaatcat tacattgagg aacttaaaag caaaatttac acgaaagatc tatttgatcg 25320
tgactaaaca acaattttgg agggaaactt ggaacgcaaa aagaaaaaaa agaatatttg 25380
ggtgataatt atatctatct taatttttat tgcccttata ggagcagggg cttattcctt 25440
aagaaattta cttattccta ctaatcatcc gaggacaaac agttcggatc aacctaaaaa 25500
aacttcggtc tctaacggtt atgtagagca aaaaggtgaa gaagctgccg taggtagtac 25560
agcacttgta gatgatactg gtataccaga atgggttaaa gttccctcaa aggtaaatct 25620
agataaattt actgatttat ctacgaataa tatcactatt tatcgaatta acaatccgga 25680
agtcttaaaa acagttacca atcgtacaga tcaacggatg aaaatgtcag aagttatagc 25740
taagtatcct aatgctttga ttatgaatgc ttccgcattt gatatgcaga caggacaagt 25800
agctggattt caaattaata atgaaaagtt gattcaagac tggagtccag gtacaacgac 25860
tcaatatgct tttgttatta ataaagatgg ttcgtgcaaa atttatgatt caagtacacc 25920
tgcttcaact attattaaaa acggagggca acaagcctat gattttggta ctgcgattat 25980
ccgtgatggt aaaattcaac caagtgatgg ctcagtagat tggaagattc atatttttat 26040
tgcgaatgat aaagataata atctctatgc tattttgagt gatacaaatg caggttatga 26100
taatataatg aaatcagtgt caaatttgaa gctccaaaat atgttattgc ttgatagtgg 26160
tggctcaagt caactatctg tcaatggtaa aacgattgtt gctagtcaag atgatcgagc 26220
cgtaccggat tatattgtga tgaaataaaa ataaaagaac ctcttggttc ttttatttta 26280
gagatttttc aaaaagagtt ttgactgagt ctaattctgt ttgagaaact actttagctc 26340
cattttcatc tgttgtatgt agattgagct tgctggtatt ctttagagcc ttattgtagc 26400
tcataacgat agttttagca tcatcgaaac ttatattggt tttaacagtg tttgaaaaag 26460
catagagaat ttctcgatag taattgaaac tttcaagttt tttcatatga gccatttttt 26520
gaatatttcc gtagagttcc attgagacat tttgaatccg agtgattgaa gcatccaaat 26580
cagtatcgtc aatttgtgtc atataggctt ggacttgatc agcagtttgt aaattaacag 26640
ttccttgttt aaactcataa ccttcagcat tgaatgcctt tggattttgc atggtgattc 26700
caccagtggc ctgtacaagt gatcccattt tattaacatc aatctgaatt actttgttaa 26760
tggacacatt caataggtct ttaaccatct ggaaaattcc atcatctcca ttcgtattgt 26820
aaacttcagt gattgttttt tgattaggca ttgtcgcaaa aactgggaag ttcatgaaag 26880
tagtttgatt tgtctttaca ttcgttgaag ctaaaacagt agcataagct gtatttttag 26940
aattattttt accagttgca atgataagtg tggtgaatgt tttagacttt tttaagtcga 27000
tacttgttgt tttagggaaa ttttcatatg atgttgaaaa tgttgattca acatttctat 27060
aagctgcgta ggctatagaa gcaatagcga taattacaaa tgcaaaaata attgaaacaa 27120
cttttagtac tgtgtgtttt ttcttacgat aataacgcct ctttttttga gtcat 27175
<210> 222
<211> 24364
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33141 eps基因簇,完整序列
<400> 222
atggatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcag tgataattgg aggtttttat 420
agttataatt ctaggataaa taatctttca aaagctgata aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa agtattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatcaaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac aagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcacg gactagtaat 840
gagcaactta taacaaattt ggctagtaca ggagcagagg tgatagttca accctctcca 900
ccgatttatg gtggtgttgt ataccccgta caagaagaac aatttaaaca atctttatct 960
acaaaatatc cctatataga ctactgggct agttacccag acaaaaattc tgatgaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgattcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaagaaat gcaggaaaca caggaacaaa cgattgattt aagagggatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattc ggataattca gcagcctacg ctggacaagt gaccgggaat 1380
attcagatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc tgatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acaaattcac aagttattac gcttactgtt aaatattcta atccttacat tgcacaaaag 1560
attgcagacg agactgctaa aatttttagt tcagacgcag caaaactatt gaatgttact 1620
aacgttaata ttctatccaa agcaaaagtt caaacaacac ccattagtcc taaacctaaa 1680
ttgtatttag cgatatctgt tatagccgga ctagttttag gtttagccat tgctttattg 1740
aaggaatcgt ttgataacaa aattaataaa gaagaagata ttgaagctct ggggctaacg 1800
gttcttggtg taacaaccta tgctcaaatg agtgatttta ataagaatac aaataaaaat 1860
ggcacgcaat cgggaactaa gtcaagtccg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta tcaccagtgt caatccccaa tcacctattt ccgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caaggaatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcaaccgta agtgctaata tagctgttgc ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgc taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttaaaa taattacctc tggtccaatt 2340
ccacctaatc catcggaatt attagcatct agtgcaatga agaatttgat tgactctgtg 2400
tccgatttct atgatgttat tttgattgat actacacctc tctttgcagt tactgatgct 2460
caaattttga gtatttatgc aggaggagtg gttcttgttg tacgtgccaa tgaaacaaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaac ttaatgcaaa tatattagga 2580
gttgttttgc atgggctaga ctcttctgac tcaccgtcgt attcctacta cggagtagag 2640
taattggaat gaattttaat caaataaaag acagaaattt gtagaagagg agagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctgaagat 2760
actttgaaaa tgctgaaatc agcaattgat gaagggataa caaccatcac tgccactcct 2820
catcataatc ctcaatttaa taatgagtca ccccttattt taaaaaaagt taaggaagtt 2880
caaaatatca ttgacgagca ccaattacca attgaagttt tgcctggaca agaggttaga 2940
atatgtggtg atttattaaa agaattttct gaaggaaagt tactgacagc agcgggcact 3000
tcaagttata tattgattga atttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggacttcaa cctattttgg tccaccctga gcgtaatagt 3120
ggaatcattg agaaccctga tatattattt gattttattg aacaaggagt actaagtcag 3180
ataacagctt caagtatcac tggtcatttt ggtaaaaaaa tacagaagtt atcatttaaa 3240
atgatagaaa accatcttac gcatcttgtt gcatcagatg cgcataatgt gacgtcacga 3300
gcatttaaga tgaaggaagc atttgaaatt attgaagata gttatggttc tggtgtatca 3360
cgaatgtttc aaaataatgc agagtcagtg atcttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa gaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatggaag tttttgagga tgcctcagca cctgaatcgg aagaacacaa attagtagta 3540
ttaaaaaatt tttcttatgg agagctaatt ataaaaagag caattgatat cctaggagga 3600
ttagcgggtt cagttttatt tcttattgcg gctgcattgc tttatgtccc ttacaaaatg 3660
agctcaaaaa aagatcaagg gccaatgttc tataaacaaa aacggtatgg aaaaaatggt 3720
aaaatttttt atattttgaa atttagaaca atgattctta atgctgagca gtatttagag 3780
ctacatccag aagttaaagc cgcctatcat gccaatggca ataaactaga aaatgatccg 3840
cgtgtgacga agattggttc atttattaga caacactcaa ttgatgaatt accacaattt 3900
atcaatgtcc ttaaaggagt cggcggtata gggcacctct aaaaacgtat gaaaagaggt 3960
ataatagatt aaaaataagc taacgaggta ttgaagatga atttatttgg agatagcgac 4020
tatcttgaaa aattaagttt aaaaggagac cctcttgagc ggctagaaaa agtggttgat 4080
ttcgagtgtt ttcgcccaac tcttaaccgg atttttaagt atgatttaaa aaacaaatct 4140
catggtggta gaccgcccta tgaccttgtt ttaatgttga aaattctaat tttacaacgc 4200
ttatacaatc tatctgatga tgcaatggaa tatcaaatga ttgaccggat ttcgtttaga 4260
agattcctta agattgatga caaggtacca gacgcaaaga ctatctggaa ctttagaaat 4320
cagctatcaa aatctaatag aggaaactcg cttttttctg cctttcaaga aaaacttgaa 4380
tcacaaggga tgatcgctca taaaggacag attgttgatg caacattcat agaggcgcct 4440
aaacaacgta atccaaaaga cgaaaatgaa ttgattaaag caaatagggt accagtgaat 4500
tggacgaaga acaaaagagc acagaaggat acagctgctc gctggacaat taaaggtaat 4560
gagagacact atggctacaa aaatcatatt gctatagata ccaaatcaaa attcgtaaag 4620
aattatcaaa caacgcctgc aaatgtgcat gattctcaag taatgggagt tcttgttgat 4680
cccgatgaga tcaccttagc agactcagct tatcaaaatc aagcaacacc taaagatgct 4740
gaacttttta cctgtcttaa aaacacatgt tctaaatatc ttaaggcaga cgataaaatg 4800
tttaataaaa tcatttcgaa aagacgtgtc cgtattgaac atgtatttgg ctttcttgaa 4860
aattcaatgc atggttctag tttacgttct attggatttg atcgagctgt tttaaatact 4920
gaccttacaa atctgactta taacttattg aggtacgaac aagttaaacg tttaaatctc 4980
aaaacatgaa ggtaatctgt ctaaaataag aataaacatt caagtggcgt ttgtaacata 5040
aagaaaaacg cagtagacta tgcttagttt tctgtgtttt tttctaattt cattttgagg 5100
tgaattaatt ggtagttatt agaggtgccc tataatagag ccaaccgaga aaaagcttga 5160
ttttacaggg ctttcgttgg cttagctagg ttccttagta tgattcttat gatttttcat 5220
taaaggaaac caatcgcatt tccagcggga atatatataa taaagtggtt tcctggatgc 5280
ttatttgacc caatcataaa taagtaaatg agaatcataa ataataggag gtaatggctg 5340
agcctgcgga tgtccctcga aagacgttca caggttatct gtcgttggag aaaaatataa 5400
gaaataaatg agaaaaaaga gataagaaaa tagccctcga ttttgtcggg ggggggtaca 5460
gtaagggaga tatgaaatgc taggaatatc aaaacgaaag tgtacgatgg ataccccagt 5520
acatattaat gggaatgaaa agaaaaagaa aactgaatat cgaaattctt atatcgcata 5580
tctaaatatt caaaatgtga ttctcatgtg ctgcttactt gtgttcgggg tatggctatg 5640
gacggaatta catatctcaa caaatcttaa catattctgg attaatgctc ttcttggtgt 5700
agcaataatc ctatctagtt ttaaatcact caaatttgaa ggcaatttct acaattgtgg 5760
ggacgattgt agattaacag atagattaca aaaaaatgga aggggaaaga aaaaaatcta 5820
tgtatagaaa agattctgaa ggatggttaa agcacgcaga ctttatagtc ttagatatga 5880
tctgtttgca attagcgtat gttctggcat atgcaattag cggatatgga tttaatccat 5940
atgaaacaat tatttatcgc aatatggcag ttttccttga attggcagat ctggttatga 6000
tttttgcata tggcaccatg aaaagcgtgt taaagagagg atactatcgt gattttgctg 6060
ttacgttaaa tcatgcgatt atggtaggtg ccttagcggt tttatattta ttcctgcttc 6120
aggaagggca ggacttttca agattaacat tgatgctgac cataataatt tatttagtaa 6180
tgacgtatat tgtcagagaa ctttggaaaa aacttctgcg aaaacagatg aaggatggcg 6240
gagaacgtaa actactgatt gtaacatcag aagatgtggc tgaacaagta gtgttaagta 6300
tgcaggaaaa taattatgcc agattttcat tggctggtgt agctgtaatt gatgcggact 6360
ggactggaag agaaattcat ggagttccgg tagttgccaa cgaagagact gcagcaatgt 6420
atgtatgtca ggaatggatt gacgaagttc tggttgttgt ttcagaagtt cttccgtatc 6480
cggcagagtt aattgagcag ttatcagaga caggagtaac cattcatctt aatcttgcaa 6540
agattacaag tgtgccagga aaaaaacaat ttgtggaaaa agttggtaat tacacagttc 6600
ttacgacaag tattaattat gcatcaacca gacagttaat gttaaaacga ttgatggata 6660
ttgcgggtgg attagttgga tgtattttta ccggaatcat ttgtattttt gtcggaccgg 6720
caatttatat tgcatcaccg ggaccaattt tctttgctca ggaacgagta ggaaagaatg 6780
gaaagaaatt taaaatgtac aagttccgca gtatgtatat ggatgcagaa gagcgtaagg 6840
cagagcttat gaaagataac aaacttggag atggaaagat gtttaaactg gactttgatc 6900
ctcgtgttat cggaaataag atacttccag atgggacaca taagacagga atcagtgatt 6960
ttatcagacg aacaagttta gatgaatttc cgcaattctt taacgtatta cgaggcgata 7020
tgtcgattat aggtactcgc ccacccttga tttcagaaac gaatctgtat gagcttcacc 7080
atcgtgcaag gctggcaatt aagccgggaa tcactggcat gtggcaggta agtggacgaa 7140
gtgatattac tgattttgaa gaagtagttc gtcttgataa agagtatatc acaaactgga 7200
acattgggct agatataaaa atattattta aaacgatatt ggtagtcttt aagaaagatg 7260
gatcaatgta aagcctacaa gcagcttatt ttatagagag aataagctgc aaaggataaa 7320
tataaagatt gcacttaaaa gaggaagtta tgaataaaag gaaaaataat aattctccac 7380
ttaaaattgc agttcttgga cataagacaa ttccgagtcg ccaaggtgga attgaaattg 7440
tagttgaaga attaacagtt cgtatggcaa aactgggaca taaaataaca gtatataacc 7500
gtagcggaca tcatgtaagt ggtaaagaat ttgatggaaa gaagctaaaa gaatataaag 7560
gaatccgaat gaaatatgta ccgacaatag ataaaaaagg attagcagcg atgagtgcct 7620
cttttttcgc agcggtggta gcggcgtttg gaaaatatga tgtggtacat tttcatgcag 7680
aagggccgtg tgcaatgtta tggttgccaa aattattcgg aaagcgatgt atagctacgg 7740
tgcatggatt agatcatcag agagcaaaat ggggtaaatt agccagtaca tatattatgt 7800
tgggagaaaa atgtgcggtt agatttgctg acgaaattat tgtgttaagt gaaggggttc 7860
agaaatattt ccttgataca tatgggagag aaacccgttt tatacctaat ggtgtaaaca 7920
gaccaattat ccgaaatgct gaaattatta agaataaatt tggtttagaa aaggatagct 7980
acatcctctt tttgggtaga ttagttccag aaaaagggct tagatatttg atagaagcct 8040
ttagacaggt agatactgag aaaaaaatgg ttattgctgg tggaagttca gatacagatg 8100
aatttacaaa agaattaaaa gaactggcaa aagatgattc aagaattatt tttactggat 8160
ttgttcaggg aaaagagtta gatgaacttt atagtaatgc ttatgtatat actttgccga 8220
gtgatctgga aggaatgcct cttagtttat tagaggcaat gagttacgga aattgctgct 8280
tggtgtctga catagatgaa tgtgcatctg ttgtagaaga taaagcattt atttttaaga 8340
aaagtgatgt ggcagatttg caaagcaaat tgcagaaagc ttgtgatgat aaagaacaag 8400
tacaaaaata taaagatgaa gctgcagatt atatttgcca aaaatataat tgggatgatg 8460
ttgtagaaaa aacactagaa ttgtatcaat aattaaggag gatgacatga aagggattat 8520
tttagcaggt ggatcaggga caagattata tccagattgg actaatgtca aatatttaga 8580
cacaaaatga gctatatttt cttaaaataa ttttcttctt tctgatttgg agttaaacca 8640
ttgttggcgg aatgaggatt ataattgtta taaaattgtt cgatgtattc aaagcaagag 8700
agttgaactt cttgaatgga gtgataagtt ctgcgattaa tttctcgttg tttaagatac 8760
ttgaaaaaga cctcagtgac ggcattatca taaggatatc caggcttgga gtaagaagca 8820
agcaattgat gctcatctaa taactttcta aaggaagctg atttaaattg gcttccttga 8880
tccgaatgaa aaataattgg ttccttaggt ttcctcttat ttatagctat ttctaaggtg 8940
tcacaggcga gcttggcatc aatcctatca cttactttcc acgcaataca tttccttgag 9000
tagaggtcaa gaatagcaca gagataaaca tgtcgcttag gtcctataga gatataagtg 9060
aaatctgttg tccaaacttg atttggggag ttcgggttaa attcttgttt aagcagatta 9120
tcagaagaaa atacaggaga tttatttgat ttaaaacgag gtttgatggt tgacatttta 9180
gggagtgtca tagacttgag aagtcttaag atacggcctt cagaaatatt aacgccataa 9240
tcacgcagaa gaatgatttt aaaggctctc gttccaattc ttttcttggc tttcatataa 9300
atctcaagga ggtgttttct caagcgttga ttttctactt cacgcttcga aggcctcttg 9360
tttatgaagt tatagtaggt ggaacgattg acatgtaaaa cacgacagag cataactgtc 9420
gtgtgttcaa atcggagtct atagatggct ttcaatctta cttggagttt tgcatgaata 9480
tggcactcgc tttttttaag atcagatttt cttcctctag ttgggcattc cttttttgta 9540
attcttgaat ctgtttagca gtcaacaccg tattatcttc aagacgcact tgagagtact 9600
gcttaatcca ttttgcaagt gcagaagaag ataccccata atctttacag agttcagttt 9660
gtgttttacc agtttgatag agattaacga gcgattgttt gaaatcctcg tcgtaacgtt 9720
taaaacctga cataaaagtc ctttcatttt tgtgtcttaa tagatagatt ataacacaca 9780
attttctgtc cacttttata gtatagctcc aatggggaga ttcaattaaa tgttggacat 9840
atagaaaatg tgttgtttta tacagttaca tctattgtgg gttggatttt aatagcaaca 9900
gtatcacaaa tttttgtaat aggagttagt ttaccattga taattgagtg gttatataca 9960
agggctaaag ctacggtaat aaagcagact tgatagtgca aatagaagga gaaaaaaatg 10020
gaggttatta ataaggaaca ccctcttatt agtgttattg ttcctatata taaagtcgaa 10080
aaatatctgg gtaaatgtat tgagagtatt attgcccagg aatattcaaa tatcgaaatt 10140
attttagtgg atgatggatc tccagataat tgtggtaaaa tatgtgatga ttttgctact 10200
aaagatgctc ggattaaagt tatccataaa gaaaatggag gactttcttc agcaagaaat 10260
gctggaattg atattgcaac aggtgagtat ttgggctttg tagatagtga cgattcgatt 10320
gagccgttta tgtataaaaa attgatttct tcaattatag aaaacaaaac caaattagca 10380
gtatgtgcgg taaattatgt atttgaaaat ggtaagattc ttacgaaatc taatttaggt 10440
gagaattgta catttgattt ttatcaagca atgatagaaa tgaattctca tagaattttt 10500
gatatgggtg catggagcaa gttatatcat agagatttat tttttgattt gcgctttccg 10560
gaagggaaat taagtgaaga ttattatatt atgtataaaa tctttgaccg agcgcaaaaa 10620
attagttatg tctcaacacc gtgttataat tatttacaac gacagaatag cattacacat 10680
aatgttagaa ttaatcacga tcatgaatat gctgcaaagg aacaaatgga atatttagaa 10740
aagaaatatc cagaattaaa ggtcttggga catacagctt atgcttcagc ggcattaact 10800
gtatatgact catatattaa aaatgccgtt agctgtcctc aaaaggatat aaagcatttt 10860
aaaagtgtag ttcgggaaaa taggcagtat attaagaatg cagatttttt gtcaaaaagc 10920
aaaaaagttc aatttcaatt attttcaatt agtacagcta tgtacaatat tgtatttaag 10980
gtgtatagaa aaattaagag ggtttagtat ggataaatta gtgagcgtta ttattccagt 11040
gtacaataca ggaaatatta taaaaaagtg tattaagagt atattgaata gtgattatag 11100
taatatagaa ataataatta ttgatgatgg atccgataaa gaaactgtag atatatgtaa 11160
caacttagaa aaagaagaaa aaattcatgt gattcatcaa gaaaatgctg gtgtaagtag 11220
cgctcgcaat aaaggaatat atcacgctca aggggagttt attacttttg tagatgcaga 11280
tgacacaata gattcaaacc taattagtgt tcttgtaaat agttgcattg aaaaaaatgc 11340
cgatatggca atttgcggtt atagagaatg gtatgatgat aagcattgta cagaattccg 11400
gtgcacagat tcaattacga tattaaaaga aaaggaaatt ttaaaagact ttttttctac 11460
aaataatatt gcatggaatg tgtggggaaa aatatataag aaaagtttgg ttggcgatac 11520
aagattcata gtaggtaaaa ggactggcga agacatgtat tttgtatatg aaatcttaaa 11580
aaaagctcat acattggtaa tgaataataa ggcattgtac aattatgaaa aacaagataa 11640
ttctgcaatg gcagattcaa attgtatgaa attttttgat acttatgaat tagttaataa 11700
agtatttgaa gatgaagcat tagataacga gctgaaaaat gcgcaattaa atttttatat 11760
aaaaagtgag ttgtggtttt ttcgtttcat aaatgcaaaa gataaggata atgagaataa 11820
aagtgaaata aagaaagcaa gaaaaaaatt cttggacaat attaagaaaa aagaagcaag 11880
atgttctgga agaacaaaaa tagaattaat tttattgcga tatttttatc ctgtttttag 11940
agtgatctct cttatatggg gggcaaaaaa ggggatttag aacagaatca tgaggacact 12000
ttgacatcat ataaacaaat gaaagagatg ttaaaaagag aaaaaaaata ttatccaaat 12060
acatggtttg acaatataag ttgtaatcaa cgagtataca actggcgatt catgaaatta 12120
ttacgcagat gcgagcttta tagatataag gcggatcatt ctaagaatcc tgtatggaaa 12180
ggattatatt tgattaatcg tacgaaaaag aaccggttag gtgtttggat tggagtagag 12240
ataccagaga atgtgttttc agaaggatta attattcatc atagtggaaa tattgtagtt 12300
aatggaagta gcaaagtagg gaaaaattgt caacttcatg gagataattg tataggaaat 12360
tcaggaaaag aaaatgaatt aaaaaaatgc ccacagattg gagataatgt agaaatagga 12420
gttggagcga aagtattggg tggcattact attgcaaata atgtaaaaat tggagccaat 12480
gctgtagtta caaaatcatt ttatgaagag ggcattactc ttgttggaat acctgcacat 12540
aagctagaaa ggtaaataca aatgaaaatt ttattgctta taaattggaa aataaaatat 12600
tgtgaccata taccagagga tttgcaacct tcagattatg attgtcctca ggaggtatat 12660
tggtttttta aatactttaa agataaacca gaggtagatg ttgttgatat tagtgcacca 12720
aaatttattg aaaagataga aaataaagtt cggtttcatt tttttcaaac ttttaaaata 12780
ttatttaaat tgaataaata tgatttgatt tttgtttgtg gatctaatag tgcaatgctg 12840
ttgtgtgcat taaaaagagt gtttcacata aaaacacctc caatattgga tgttgatata 12900
agttcatttc atcaagcata tacttcagga ttgattcata gattatcaca gttttctagt 12960
agggcttttg attatatggt atatcatact agttcacaat atgattatta tatggaatat 13020
tttccatggt taaaagataa gtgcaaattt gttccatttg gtgttgatta taattattgg 13080
aaattaaaaa cgtatgaaga tatacctgaa aaggatcaat atattgtttg tgtaggatat 13140
agaaaaagag attggaatac attactaaaa gcgtttgata aaatagatat tccagaaaaa 13200
ttatatctta ttggaaatcc agatattaaa tgtgataatc caaaagtaaa agtgcttcct 13260
tttatcccag ttgcagcaga tggctttggg agttccggta ttagcagcag atgtttcagc 13320
tatacgtgat tatgtgaatt gtagtgaagg ggttctgcct tatatagcat ataatgtaga 13380
agatttagct tctaaattag agaaaatgtc aaaaaaatca aaagaagatt tagatataat 13440
gggatataaa aataaattag cggtacagac tgttttgagt gaaaaagaaa tggcaattca 13500
gtttgaaaaa atatgcaatc agttattcta aataattaat atgaatatta aaagggtgcg 13560
ttaattataa aatatgctaa ttataagtcg ggataaaaag aggtagatat gataataaag 13620
aagaaagata aatattatat tatgctattt ttattatttg ttgttgattt aaatttttta 13680
aatttaatag atactgcaac atttaatatc ggaggaattt attatacaga cattgtattt 13740
ctattaaata tttctgtatt tttatatcag attataaagg ataaatttca aattgcaaaa 13800
aatataagta ttatctatgt attgtgtgta attatattaa tggtattgtc agcttgtgca 13860
ggtcacataa catataatca atcaatagtg gctggaatag ttgcacagcg agaatgggtg 13920
tcatggatgt tattgattta tccattatct aggtggattc aattacaaaa actttctgtt 13980
gaaggaataa aaaaatgtat aattaattta tgcaatatat atgcgtttat ttgtataata 14040
caatatttat tatatgatgt agttcagttt acatatacta tggttaacaa tagatatgga 14100
agtgttagat tatattttta tacgattttt ttctgttttg ctataggtat agttattgat 14160
gacttgatta gtggatcaaa acgtagtttg aagaattcta caatgcaatg gataaaattg 14220
ctcgcttatt tatttataat tgtatttatt acaaagggaa gaatgcaaac aatttcttta 14280
ttatgtgcta ttatagtttg ctcattaatt agaagagata tgcggataga aaaaaaaata 14340
atactctgtt tattaatagt tgtgctgaca tatgttttta tgaactctac aatgggacag 14400
gatatattgc atgccattat gggtacaagt gaaaatgata ctttatctgt tagagattca 14460
ggaagggtat attatttagg attatataca cagtcatgga aaagaatact ttttggatgt 14520
ggatttgcaa attctaataa tagttatgct ataacgatac ttaatccttt gtggcaggaa 14580
tatggaagtg cgagatacta tttggaagac gtaggaattt tatcaccatt aataaaatat 14640
ggattggtag gaatagtatt ttggattggg gttgtaatta aaaatatatc actttcatat 14700
aaaatttata ataaatctgg agaaatggta tatttacagt ttttatttat ggatttaatt 14760
gcttgtgcga cactggtacc tacaatgttt aatacaacaa tcctttttcc gcttattaca 14820
atattaatat tatttagagc aaaggaattg cagataataa gataagtaag gagatgaaaa 14880
tatatgaagt gtgtaatgag catactaaac tataatgaca gcaggcgagc cttagaatta 14940
gcagaacgat gtgtggaatt tgattcaata gaaagaataa ttattgttga caataaaagt 15000
acggatgatt ctgtaacatt tctaacagag agagtaggat caaatataga attagtagtt 15060
gcatctgaaa ataggggttt tgctgctgga aataatatag gagcaaaata tgcacaagaa 15120
aaatataatc ctgaatatat actatttgca aatacagata cgattttctc agaaacggaa 15180
gtaaatgcat gtttgagtaa attgaaagca aaagcagatt tgggattgat atcgatgagg 15240
ataaaagata taaagggaaa tgaagaaaaa tcagcatggc attttaagtc tttcttagac 15300
tatacattat ttaatatttg gatatataga catattacat acaaaaaagg cgtgtataaa 15360
agtttttcca atgattttca atatgttgat attgttagag gaagttttat gttatttaaa 15420
atgaaagcac ttatagaagc taactttttt gatgaaaata cgttcttata ttatgaagag 15480
gaaattattg catatcgttt acgtaaacat ggatataaag ttggattatt gacaaattat 15540
ttctatatac acaatcatat tgcaagtggt acaggaaata tatggtttat aaaaaaacac 15600
ttagatgctt cattgagatg ggttttgatt aattactata atattggaaa tgtaaaaata 15660
agaatatttg attttgcaac taaaatttgt agttgtgaga cttttttaat agaaaaattg 15720
aaacggagag gaaaatgatt aagttaggta agaattctaa agtatatata gttagtccat 15780
atcacaatac gggaggtcca aaaagtttac atcagttggc taataacttg atagaaaaag 15840
gtattgatgt atacatagtt tattactgga atgggatatt tacaggggag aaagaaatat 15900
tattttcttt ttgtaaagca aaacttgctg attgtatttg cgatatggaa gaaaatattt 15960
taatagtttc tgaatctcag tctgaagttt taaatcatta taataaaata acaaaatgca 16020
tttggtggtt gtctttagat ttctatttga ctagtagttt aaagggaggt gttcaaaaag 16080
caattcagga aaaaggatta ccaagtttta tgcagcctat aatgttttta aaatttatta 16140
tgaatgatcc taaatgtttg aagaacttaa aaaatttgga tgataaaaaa ctacaaaata 16200
tttatcatat gtataattgt gaatatgaaa aagaatattt aattaaacat gatgtgccag 16260
aaagtaaaat gtcatactta tgtggacctt tagaaaaaca gtattacaaa attgattatg 16320
agactataag gaaagaaaaa caaaatatgg ttgtgtacaa tccagctaaa atggatatgg 16380
attttttgga aaaagttaaa gatgagttat actttttaaa taagaatatt aagtttgtgg 16440
ccattgaaaa gatgagtagg gaacaggttt atcagatttt aaaaagggca aaagtatatg 16500
tggattttgg ctttttccct ggtccagaaa gaatgccaag agaggcagtt gctttatact 16560
gtaacattat aacatcaaca aaaggaagtg ctgagaatga cattgatgta gcgattccaa 16620
gaaaatttaa atttaatatt aaggaaagag aatctgttca aaaagtagtt gaaatgatag 16680
agaaaatgat tactagttat gatgattatg ttgaatatgg acagaattat cgagaaaagg 16740
tatggaatca gattcaagat ttttcaaaca caatttctaa tatttttgag gtggataatg 16800
aaactcgata gaacaaacaa tgcctttaga aatatgaaat ttggtatggt taacaaaatt 16860
gtaactttat ttttgccatt tatcataaga actgtattaa ttaaaacaat aggtatggaa 16920
tatgcaggac taaattcatt attttcgtca attttgcagg tgcttaattt gtcggaattg 16980
ggatttagta gtgcggtagt atatagtatg tatcgcacga ttgcagaaaa tgatgatcga 17040
caagttggtg cgctactatt attttataaa aaagtatatc ggattattgg tttagtagtt 17100
ttatcagttg gattgattat aatgccattt ttaccgaaat tagtagaagg tggttatcca 17160
ccagatataa atgtttatat tttgtatctg atttatctat tagatacggt tgtgagttat 17220
tttttatttg cttataaaag tgccatatta aatgctcatc aaagagtaga tattgtaagt 17280
aatattttaa caattactca aggtgcaatg aatttggtac agattataat gctaatagca 17340
ttcaaaaatt attactgcta tattatatgg atgcctttat ttacaatctt aaataatatt 17400
atgacagcct actgtgtaaa taagctattt ccacaatatc attgtgaagg aaagatttct 17460
agagatcaat tgtctgatat gaaatataag ataagtggtc ttatggtgaa taaattgtgt 17520
ttaacaacaa gaaatacttt agatagtgta tttatatctg catttatggg acttacagtc 17580
agtgctattt acggaaacta ttattatata ttgaatgccg ttataggctt gatttcaatt 17640
gtttcaagtg cgatgcttgc cggggttgga aatagtatcg aaacagaaag tgtagaaaaa 17700
aattataatg atttgaaaaa attcaatttt ttatatatgt ggcttagcgg atggtgtgct 17760
atctgcatgt tatgtttgac acaaccgttt atgacaattt ggatgggaaa agacaatcta 17820
ttcccatttg gtgttgttgt tcttatttgc atatattttt atgtgctaaa aatgggagat 17880
atgcgaggct tatattcaga tgcagctgga ttgtggtggc agaatagata tagagcgata 17940
agtgaatcta tattaaacct tgtattaaat tatgttttag tgcaggtatg gggaatttat 18000
ggaattatta ttgctacatt aatatcatta ttttttatta attttttggg tggaagtgga 18060
attgttttta aacattactt taaaaatggt aaatttatag aatttttgaa atatcatttt 18120
ttctatatgc ttataaccgt aataaatgct tccatatgtg tgtttctaac aaattttgtt 18180
aaatatgaag gaattattgg attggggctt agagcaataa tatgtgtaat tatccctaat 18240
gtaatatatg cattggtata tttgaatata tcagaattaa ggcaacagtc aaaatgggta 18300
ttaagtaagt taaaggtaag gagatgattt tgatgaaagt tggttttata tcaaactctg 18360
atgtttatga taaacgagcg tggagtggga caataaattt tctttatgaa acattaaata 18420
aagaatatga tatgtatccg attgtgatag aacataaaat aattcaaaaa atgtcacgta 18480
taattacaaa gggaaaacgt aagtatacct tattagatag ctttttttat aaattagata 18540
ttaacagaaa aataaaaaaa gctcaaaaaa aaggcataaa agtgtttttt gctccggcag 18600
cttcaacaat attaggggtt gcaaagattc ctatagattg caaagtagtt tatttgagcg 18660
atgcaacata tcattgtatg ttaaattatt attactttaa tgaaagcaaa ggagatataa 18720
aaaattataa tagagtagag cagaagtcat tgtgtagagc agacaaagtc atattttcaa 18780
gtgaatgggc taaaaacgat gcaataatat attatggagt ggattcaaat aagatacatg 18840
tcttaccatt tggagctaat ttagaagata aatatagcgg acatgatatg ggagatatag 18900
tgaaaatcct atttgttgga gtagagtggg aaagaaaagg ggcagaattg gctattgaat 18960
gcgtaaagaa tttgaataga agaaactata aaaagcggtt tgaattgacg attatcggat 19020
tagaaaaacc agaaaaatat ctggctgatg acagtattca ttttgtaggg agattaaata 19080
aaaataatcg ggatgaatta aattgcatga ttaaatatta tcaacagagt gatatttttc 19140
ttttacctac taaagccgaa tgttctgcta ttgtgtttag tgaggccgct atgtatggat 19200
tgccagtgtt tactcataat acgggaggtg ttatgactta tgtagaagat ggcaaaacag 19260
gcagaggatt aaagttagga tcaaaagcag aagacttcgc tgatgcaatt ctgaagatgt 19320
taaatgaaga caaatataaa gaatggtcaa taaacgcaag aaaaaaatat gagaaagaat 19380
tgaattggaa ctgttgggtt gagaagtgta aagagttaat tgaaaattaa gatagtatat 19440
ttggtaaaga gaagaggaat aaatatgagt caagagtata aaatagctgt tgctggaact 19500
ggatatgttg gattatcaat cgctacactt ttatcacaaa atcatcgggt ggttgctgta 19560
gacattgtaa aagagaaggt tgatttgatc aataaaagaa agagtccaat tcaagatgaa 19620
tatattgaaa aatatttgtc agagaaaaag ttaaatttag aagctacatt agatgcagaa 19680
tatgcctaca aggatgcaga ttttgttgta attgcagcac ctacaaatta tgatagcaaa 19740
acacaatatt ttgatacttc agcagttgaa gcagttataa acatggtaac taaactaaat 19800
tctaatgcgg taatggttat taaatctaca attccagttg gatatacaag agaaattcgt 19860
aaaaaaacag ggaataataa tataatgttt agtccagagt ttcttagaga aagtaaggca 19920
ttatatgata atttgtatcc atcgcgtatt attgtgggaa ctgatttaca cgatgagaaa 19980
ttaattgaag ctgcccatat ctttgcagaa ttattgaaag aaggggctat taaggaaaat 20040
attgacacac tttttatggg atttacggaa gcagaagcag ttaaattgtt tgcaaataca 20100
tatttagctt tgcgagtagc atattttaat gaattggata catatgcaga aagtaaaaat 20160
ttgaatacaa aacaaattat agaaggtgtt tgcttagatc cacgtattgg tacacattat 20220
aataatccat ctttcggata tggtggatac tgtttaccta aagatacaaa acagttattg 20280
gctaattatg ccgatgttcc agaaaatcta attgaagcga ttgtggaaag caatcataca 20340
cgtaaagatt ttattgctag tcaagtttta aaaattgcag gatattacaa ttatgaagat 20400
gatatagagt atgataaaaa tcaagaaaaa aaagttgtaa taggtgttta tagattaaca 20460
atgaaatcaa attctgataa ttttcgccag tctagcatac aaggtataat gaagcgaatt 20520
aaagcaaaag gtgcagaagt tgttattttt gagccaacat tagaaaatgg aagtactttc 20580
tttggctcaa aaattgtaaa taatttaaaa caatttaaag aacaaagtca agcgattata 20640
gctaatagat atgattcttg tctggacgat gtaaaagaaa aagtatatac cagagatttg 20700
tttagaagag attaaatttt gcaaatggta aagggggaaa tgatatttgg ctaagcagtt 20760
atttaaaaaa acaaaattat taatggtgcg tttgcttttt acaagataca gagctggttt 20820
tttaaaaaaa agtaatattt ttcactcatt tggagatggt gttttgtggc agccatatac 20880
aattccaagt gagccttatt taatgtcgat agggaataat gttaaaatta cagctggagt 20940
caggtttatt acacttgact ccagccgttt ttggaatggc tggttatcca gtacatgacg 21000
actgtttata ttatatggat aaaattatta ttggagataa tgtaatgatt ggtgcagaca 21060
gtatagtaat gcctggtgtt aaaataggaa ataatgttat tatagctgct ggaagtgttg 21120
taacaaagga aattcctgat ggagtagtag cggggggggg ccctgctaag gtaataggag 21180
gatttgatga attagctaaa aagcgatatg aacaatgtaa tcatagacca tttctagccg 21240
ctcaagaggg tctcctttta aacttaattt ttcaagatag tcgctatctc caaataaatt 21300
catcttcaat acctcgttag cttattttta atctattata cctcttttca tacgttttta 21360
gaggtgccct taaacttata tctttattta tactgacgtt aagtcaagtt tttagagtaa 21420
caaacgtaga attttaatct atttatttta agcacttacc ttttcaaatt gattaggtgt 21480
aagataccct aaactttgat gaattcgtct tgataaaacg tttcaatgta ccaaaaaata 21540
ctatgatagg ctgcttcaaa ttttttgcag tagttcctca gtcattcgct tacccaaatt 21600
ccaagcaatg attttttagc ataacaattt ataatagttg agagataaca ccaccctccg 21660
aagagtggtc ttgtaaggaa tatcggttga ccaaacctta tttttttctg tcgggtaagt 21720
atgtatgata ttctttcggt taacaggatc atatatgaat atcctggttt aaattttttt 21780
atgaccacag aatatagttg aagttgcttc attaactttt gtaccagttt taaactgttt 21840
ttttccatgt ttaagcaaga ggtgagaatc ttaggcactc tatagatgac ttgttcttcc 21900
ttaactttgg ccggtgtctt tttaattaca tttcatctac ttttgagaca tagatttcat 21960
taaacttgga gtagaggtca tactctttag aaagctgagt gatgaatcgt tctgagtggt 22020
aggactcgac cagggtttct ttaaattctt ttgaatatcg tttttgcatg actttttact 22080
ttgtctaaat tatacaattc ccagtctaag atgtaagaat aatatcagac tacgcaatat 22140
tttttatatt ttgtatataa tttagtgatc actaggatat cactaaatta ctctaacttc 22200
attatattac ctagaccaaa tgtgttatag cgaaataagt aaaaattata gttattactt 22260
ttgatattat tccagaagaa gtgaattaaa tttagttcag tagtagatag aaaattagag 22320
tttttttggc aaattgagat ttgaatttgc ttgcaatgag agctgcagtt attgcgctta 22380
agcaaatcgg ttgttgtaac agcaactaat tatgatgatg caaggacata ttttaacata 22440
cgttttgtac gtgatttaca tacattcgaa aagtttagat ttgatcgtga ctaaacaaca 22500
attttggagg aaaaattgga acgaaaaaaa aagaaaaaaa agaatatttg ggttataatt 22560
atacctatct taatttttat tacccttata ggagcagggg cttatgcctt aagaaattca 22620
cttattccta ctgatcatac gaaaacaaat agttcggatc aaccgaccaa aacttcggcc 22680
tctaacggtt atgtagagca aaaaggggaa gaagctgctg tgggtagtat agcacttgta 22740
gatgatgctg gtgtaccaga atgggttaaa gttccctcaa gggtaaatct agataaattt 22800
actgatttat ccacgaataa tatcactatt tatcgaatta acaatccgga agtcttaaaa 22860
acagttacca atcgtacaga tcaacggatg aaaatgtcag aagttatagc taagtatcct 22920
aatgctttga ttatgaatgc ttccgcattt gatatgcaga caggacaagt agctggattt 22980
caaattaata atggaaagtt gattcaagac tggagcccag gtacaacgac tcaatatgct 23040
tttgttatta acaaagatgg ttcgtgcaaa atttatgatt caagtacacc tgcttcaact 23100
attattaaaa acggaggaca acaagcctat gattttggta ctgcaattat ccgtgatggt 23160
aaaattcaac caagtgatgg ctcagtagat tggaagatcc atatttttat tgcgaatgat 23220
aaagataata atctctatgc tatcttgagt gatacaaatg caggctatga taatataatg 23280
aaatcagtgt caaatttgaa gctccaaaat atgttattac ttgatagtgg tggctcaagt 23340
caactatctg tcaatggtaa aacgattgtt gctagtcaag atgatcgagc cgtaccggat 23400
tatattgtaa tgaaataaaa aaaagaacct cttggttctt ttattttaga gatttttcaa 23460
aaagggtttt gactgagtct aattctgttt gagaaacgac cttagctcca ttttcatctg 23520
ttgtatgtag attgagcttg ctggtattct ttagagcctt attgtagctc aaaacgatcg 23580
ttttagcatc attgaaactt atattggttt taacagtgtt tgaaaaagca tagagaattt 23640
ctcgatagta attgaaactt tcaagttttt tcatatgagc aattttttga atatttccgt 23700
agagttccat tgagacattt tgaattcgag tgattgaagc atccaaatca gtatcgtcaa 23760
tttgtgtcat ataggcttgg acttgatcag cagtttgtaa attaacagtt ccttgtttaa 23820
actcataacc ttcagcattg aatgcctttg gattttgcat ggtgattcca ccagtggcct 23880
gtacaagtga tcccatttta ttaacatcaa tctgaattac tttgttaatg gacacattca 23940
ataggtcttt aactatctgg aaaattccat catctccatt cgtattgtaa acttcagtga 24000
ttgttttttg attaggcatt gtcgcaaaaa ctgggaaatt catgaaagta gtttgatttg 24060
tctttacatt cgttgaagct aaaacagtag cataagctgt atttttagaa ttatttttac 24120
cagttgcaat gataagtgtg gtgaatgttt tagacttttt taagtcgata cttgttgttt 24180
tagggaaatt ttcatatgat gttgaaaatg ttgattcaac atttctataa gcggcgtagg 24240
ctatagaagc aacagcaata attactaata caaaaataat tgaaataact tttagtactg 24300
tgtgtttttt cttacgataa tgacgcctct ttttttgatt catggtatct ccatatatat 24360
acat 24364
<210> 223
<211> 20584
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33137 eps基因簇,完整序列
<400> 223
atgaatgatt tattttacca tcgtctaaag gaactagttg aagcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattga tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat ttataattgg aggtttttat 420
agttataatt ctaggataag taatctttca aaagctgata aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa agtattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatccaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcact gactagtaat 840
gagcaactta taacaaattt ggctagtgca ggagcggagg taatagttca accctctcca 900
ccgatctatg gtggtgttgt gtaccccgta caagaagaac aatttaaaca atctttatct 960
acaaagtatc cctatataga ctactgggct agttacccag acaaaaattc tgatgaaatg 1020
aagggactgt tttctgatga tggagtatat agaacattaa atgcttcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaagaaat gcaggaaaca caggaacaaa cgattgattt aagagggatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattc ggataattca gcagcctacg ctggacaagt gaccgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc tgatgattct ttccaaaaac aagttacagc agcaaatcaa 1500
acaaattcac aagttattac gcttactgtt aaatattcta atccttacat tgcacaaaag 1560
attgcagacg agactgctaa aatatttagt tcagacgcag cgaaactatt gaatgttact 1620
aacgttaata ttctatccaa agcaaaagct caaacaacac ccattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ttagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct gggactcacg 1800
gttcttggtg taacaaccta tgctcaaatg agtgatttta ataagaatac gaataaaaat 1860
ggcacgcaat cgggaactaa gtcaagtccg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaacaat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ccgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caagggatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcatacgag agtgctaatc tagctgttgc ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacctaatc catcggaatt attagcatct agtgcaatga agaatttgat tgactctgtg 2400
tccgattcct ttgatgttgt tttgattgat gctccacctc tctatgcagt tactgatgct 2460
caaattttga gtgtttatgt aggaggagtg gttcttgttg tacgtgccta tgaaacaaaa 2520
aaagagagtt tagcaaaagc aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtagag 2640
taattggaat aaattttaat caaataaaag tcagaaattt gtagaagagg ggagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caactatcac tgcaactcct 2820
catcataatc ctcaatttaa taatgaatca ccgcttattt tgaaaaaagt taaggaagtt 2880
caaaatatca ttgacgaaca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaagggaagt tactgacagc agcgggcact 3000
tcaagttata tattgattga atttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggacttcaa cctattttgg tccaccctga gcgtaatagt 3120
ggaatcattg agaacccgga tatattattt gattttattg aacaaggagt actaagtcag 3180
ataacagctt cgagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt gacgtcacgt 3300
gcatttaaga tgaaggaagc atttgaaatg attgaagata gttatggttc tggtgtatca 3360
cgaatgtttc aaaataatgc agagtcagtg attttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa gaaattttta ggattatttt aaaaggatta aagggagtaa 3480
ataatggaag tttttgaggg tgaatcatca cctgaatcgg aagaacacaa attagtagta 3540
ttaaaaaaat tttcttatgg agagctgatt ataaaaagag caattgatat cctaggagga 3600
ttagcgggtt cagttttatt tcttatcgcg gctgcattgc tttatgtccc ttacaaaatg 3660
agctcggaaa aagatcaagg gccaatgttc tataaacaaa aacggtatgg aaaaaacggt 3720
aaaatttttt atattttgaa atttagaaca atgataatta atgctgatca gtatttagag 3780
ctacatccag aagttaaagc tgcctatcac gccaatggca ataaactaga aagtgatccc 3840
cgtgtaacga agattggttc atttattaga caacactcaa ttgatgaatt accacaattt 3900
atcaatgtcc ttaaagggga tatggcatta gttggtccaa gaccaatttt actttttgaa 3960
gcgaaagaat atggggagcg cctcccttat ttactgatat gtaaacctgg gattactggt 4020
tattggacaa cacatggtag aagcaaagtt ctttttcctc aacgagcaga tttagaactc 4080
tattatctcc aatatcacag caccaagaat gacatcaagc ttcttatgct tacaattgca 4140
caaagtattc acggatcgga cgcttactaa aaaatgaagg aaaaacatat ttacattatt 4200
ggttcaaaag gaattccagc aaagtatggt ggttttgaga cttttgtaga agaactaaca 4260
gcacatcaaa gtaataaaaa ccttaagtat catgttgctt gtttatcaaa tgacatacaa 4320
tcaaatttta ttcataatgg tgccgactgt tttaatattc caaagaaaaa tattggacca 4380
gcaaatgcca tttattatga tttggcagct ttaaagtact cacttaaaga aattgaagaa 4440
aaaaattata agggtgcaat tatttatatt ttagcttgcc gcattggtcc gtttattggt 4500
tactataaaa agcaaatgaa aaaattagga attactttga tggtaaatcc tgatggacat 4560
gagtggttgc gtgcaaaatg gagtgtacct gtaaaaaaat attggaaaat ttcggaacaa 4620
tacatggtta aaaatgctga cttattgatc tgtgatagta agaatattga gacctatatt 4680
caagaatctt atgcgaaata taatacaaaa acaacttata ttgcctatgg cgcagattta 4740
gctccaagtc ttttgaagga taatgacgaa aaattagtaa attggtatca agaaaagggt 4800
ttggaatcta atggatatta tcttgttgta ggtcggtttg ttcctgaaaa taattatgaa 4860
ataatgatta gggaattcat gaagtctgat acaaaaaaag attttgtttt aataacaaat 4920
gtagaacaaa ataaatttta tgatcaactt aaacagacaa ctggatttga taaagataaa 4980
cgaataaaat ttgtgggaac tgtatacgat aaagagttaa tcaaaaaaat cagagaaaat 5040
gctttcggat atttccatgg tcacgaagtt ggaggaacaa acccaagttt aattgaagca 5100
ttggcgtcat ctaaattaaa tctcctgctt gatgtaggat ttaataaaga agtaggtgaa 5160
gatgctgcac tctattggaa taaagaaaag caaaatcttg cacaattaat taaacatgta 5220
gaagaaaccg attattctca catggaaagt aaagcaaaag aaagagttca acattatttt 5280
tcttggaatt atattgttgg agaatatgag aaagtattta cgaaataatg attttctact 5340
gtgtagtttt atataataaa aaaattgacg aagcaattac tataaaaaat ctatttgagt 5400
gtaatttaga taacagaaaa attgtagtat ttgataatag cgataaatta gattttagag 5460
aatataattc aaaatattat aacgaaaaaa tatattgttt actacaatct tccgataaaa 5520
atgttggcct atcagcagcg tataatagaa tattagaaaa attatgcaga gagtttgata 5580
tagaaaacga aaagcaatat gtttgttggt tagatgatga cacagacata tctcctgaat 5640
ttttaattaa acaagaaaaa gccataaatg agaattatga tattattgtt cctaaaatta 5700
taggacaaga tggaattgtc tattctccaa atgaagcagg aaaaataaaa aataatttag 5760
ttttaaataa ttcaaataaa aaaatttcaa agttaaaatt caatgctatt aatagttgtt 5820
taacagtaaa aacttcaatt tactcgaatt ttaaattcga tgaacattta tttttagatc 5880
aggtcgatca gctatttttt gacaatttga gaaagagaag gttttcttat aagatagttg 5940
atgtaacaat tgaacagagt ttttctcaaa gaggagcagc aattggagaa agctatataa 6000
atagatttcg aattagagtc aaagatataa tgcaatatgg aagattatct cctaataata 6060
atatttttta ttcatatctg aaaaatattt tactaggatt taactttttc agaaaaacct 6120
ggaaattgtc ttatattaag ataggtataa tttcaatttg gagctataag aaatgaaaat 6180
attattttgt caaacccaat ttaaaatggg gggacaacaa aaagtactgc tttctattgc 6240
taaagaatta aataagaagc atgaagttac agtctattat gaaaatcata atttttttga 6300
ttttgaagat ttaaatatta taaaagcaaa gaggagtttt caggttctta atttcttttt 6360
ggcaatcttg atatgcagct tcaccttaaa atttgataaa aaaacaatta tcgacacatg 6420
gcatctatac aacgctaggg attcattgcg taaagaaaag tatgatattg tagtcttatt 6480
aaatccttat gtcttgtttg tagatgaatt tagaaaattt atagacactc aaaaaattat 6540
ttgttggaca cataatttat ttgaagatta tatgtttaat agatttaaaa cagaacaaag 6600
taaattgaag ctaagtatgt ctcatgcaga taaaattatt tcactagaaa agtataccgc 6660
atcaaaatgg cgtgaaataa ataaaaacac tgtagttatt cataatccct tgacaattaa 6720
aaatgaatct ggatacaaca aaaaaaattc aaaaaaaata ggaatggtta ctaggattga 6780
tattaatcaa aaaggtttgg atattctagt taaaatagcc aaattgttga acccttctac 6840
acaggttttt attgctggtt caggtacaaa gagtgaagaa ataaaatttt ctaatttatt 6900
aattgaaaat aaattagaaa aacagataat cttactagga agcttaaaag gcgaagagtt 6960
agtggaattt tatagtagtt tggccttatt gttggtaaca agcaggaatg aagggtttcc 7020
tttagtagta gcagaagcaa tgagttttgg aactcacatt ataggttttg atattccttc 7080
aatgcgagaa gttacagcgg gaggacaatt tgggacatta attccttttg acaatactga 7140
actatttgct aaaaatattg aggatttaca aaataggttc ttgtccaaag attttgattt 7200
aaagtctgaa gaactagtaa gatatgcagg gaaattgagg gtgaataaaa taataaaaga 7260
atgggaagaa gctcttacta aataaatgag taaaaaatta gggagaaagt gtgagtataa 7320
caaagataaa aaataatata ggaattttta tttttacaat tttagtcatt tcaacatttg 7380
ctactggggg aattctgtca gaattagatg aaattgctgc ggtatgctca tttctaattt 7440
tgattttttc ttttttttgt tcttgtttga agaaaaagat atccaagata tttttaatta 7500
ttttattatt tattatattt ggattgattt caaacataat aagtgggata gataggagtg 7560
tttttgatgt acttatagat attgtgactg ttgggaagcc attttggatt tttttagcta 7620
tgattcaact cgtatctcca gatacttttg gctggttacg gagaaaattt tctcttataa 7680
taaaaatatt tattttgatg ttatatattt ttctctttct caataccctg catattgttc 7740
aaatggggaa tacattttta ttttctcaaa ttccgaattt tagtttcact tttggctttc 7800
cagtgccctt tgctattgtt ttatactgtt gtataggatt tttacttaaa aatgatataa 7860
acatagtgag acaaccatgg attatttctc taatattctt gataatcttc actggtaaaa 7920
tgcaatctta tatttttgta gtcattttct taggcttttt gtctataaga acatacagag 7980
agaaaaactt aaaaataagt agattaattt tgataggctt tataggcgtt ttaatttcac 8040
taccaaaatt agtgaactac tttgcaacaa catcttattc tccgagaaag ttattgatga 8100
ctgatggatt tggtttcgct ttaaaatatt tcccatttgg aacagggttt gcaacgtttg 8160
gttctgctat ggctagtaaa gactattcat cattatatta tcaattaggt tatagtaatt 8220
tttatgggat gcaacctggt ggtggtttag gttccttttt gaatgacaat tggggggctt 8280
cattaattgg acaatttggc tttttaggaa caattttatt tagtattata attgttagaa 8340
tatacttttt aatgattgaa tattggggga acgaaaaaat tggtttatat ttaatatcag 8400
gttttactgg attaatttct ataattatag gatcaagttt ttttacaggt gcttcaggag 8460
cattattaat ggcaactttt ggaataatag tttcttatag aaaaaatgag atattgccat 8520
aaatttaata ttggtattga cgttaagtca agtttttaga gtaacaaacg tagaatttta 8580
atctatttat tttaagcact taccttttca aattgattag gtgtaagata ccctaaactt 8640
tgatggattc gatttgaatt ataaaaggct tcgatgtacc agaaaatact ctgataggct 8700
tcttcaaagt tcttatattt aaattgatac acccactctc tttttaaatg tccatgccaa 8760
gattcaagac tggcattatg ataagggtat ccccttcgac tgaaagagtg agtcatccca 8820
taatacttaa gcaactcttc atactctaga ctcgtatact ggcttccttg gtcagaatga 8880
agaataacag cttctggata gtcttgtgat ttaatggcct tatttaaagt tctttgcact 8940
aattctacag tcattcgctt gcccaaatcc caagcaatga cttttttagt ataacgatcc 9000
ataatggttg agagataagc ccatccttgt tgagtaggaa tataagtaat gtcggttgac 9060
caaaccttat ttttctttgt aggttcagtc tgtatgagat tttttcgatt gatgtgatca 9120
cttagtgagt atccaggctt aaatttctta atgactacag acttgagttg aagttgcttc 9180
attagcttct gtaccagttt taacccgact ttttcccctt gtttaagtag aagatgatga 9240
attttaggag caccatagat ttctcggtta gcattgaaga gttgagaaat tttgagtgac 9300
aggtattgtc tccttaattg agttttagat ggttgtcggt taatccgttg ataataactt 9360
gattcaggaa catcaaggag ttgacagctt agtctgacat tgagtgctaa agtttgtatg 9420
gtttgagcca tatccgcagc actcacttct ttttctcggc gaatatggtc aatacttttt 9480
ttaagatgtc tcgttcttcc ttaactttag ccagttgtct ttttaattct agaaaatcag 9540
ctttagagac ggagctttca ttagatttag agtagaggtc tatccattta taaattgttg 9600
caggggccac gtcgtattct ttagacagct gggtgacgga ttgaccagaa tgatagaagg 9660
cgataagggt ttctttaaat tcttttgagt agcgtttttt catgtttttg tcctttgtct 9720
aaattataca atagtgactc taagatttaa ggataacatc acttttcttt tgaccttaat 9780
tttaagattt ttaaacggag tgaaaaaagg cgaagcctat atatatattt atcttatata 9840
ttttaatctt ttgttctttt gtgtcaaaaa aagtcattgt tttcaagggt ttacagaata 9900
atgtactgac aaaaaattaa aaaagtcagt gaattatggt ataataaaag catgaagaaa 9960
aattctttgg acggaatttc ttcatgccaa ctacaaaaca cgcaaattag cgtatcttta 10020
tgtctattat accaaaaaat gaacaaaatc agaagcaggt gcaaatcttg aatgaacttg 10080
aaaaacgcaa agtagtagag aataatgcat taattaccag tgttgcgaaa atgtaaaaaa 10140
cagctttaaa aatgtttgag ttagcggtat cttgcatttt tccccacatt atgagatgta 10200
tatcactggg acacaattta atattgcaga tcaggggcag ggacgtgtgg ttttggtcaa 10260
agtatttatc ttgctcattt tgcttacttt attcttgttt tataaaaaaa gctatgcttt 10320
gatttctgaa cgtcatcaaa gtttgatagc tttgacaacc attggattaa gtatcggtat 10380
tgtattttat aataatattt tactcaatag aatagaaatg ttttattcaa ttttaagcat 10440
agtatttatt ccaattgcta tagattacat tagtttgaaa tttaaagaaa aagatactgt 10500
gcgacaaatg ctgacgatag gtattttttt taattacact tgtgccttac tatatacagg 10560
ttagcggtaa ttattcagga atattacctt atgttattca accataaaaa tataatttaa 10620
agaggaaata atgaaggata gaaagaaaca agcaattttg atactagctc acagaaatac 10680
tctcgctcta aaatcaacaa tagagctttt ggattcccaa tactttgatt tctttcttca 10740
tatagataaa aaaagtagaa ttcaagattt ttttgattta aaaaaaatta caaaatcctc 10800
cactattcat ttttcagaaa gaaaaaatgt acattgggga ggtttttcta tggtagaagc 10860
aatgtttgcg ctattagaat gtgcacgtga tacaggagaa tactcttatt ttcatttttt 10920
atcaggaaat gatatgccaa tcaaagataa tgaaatagta tttaattttt ttgaaaatag 10980
ctatcctcaa aattttattg atattctaga ttttgaaaat gtcaataaaa cttcatattt 11040
ctacgaaacc tctgagatga tagaggagag agtgaagtac tactatcctc atatggatat 11100
tctaaacaga aaaggaaaaa ttttcatagg gaaaaaacta atttatctac aaaaattgtt 11160
gaaagttgat cgcttgaaaa atagagagat agaaattttc aagggtcatc aatggtgtag 11220
tttgacaaat caatttgtag atattttatt ggataaagag gaaagaagag taggtaagtc 11280
ttatttttca tctagtttaa taccagacga atgttatttt caaacgtatg ctatgataaa 11340
aaaagttgaa atttatcaac agaaaaatat gtcagcacgc ttaattgatt ggacgagagg 11400
taagccatat atttggcgac aggatgattt ttttgaaatt atgaatgata aagattcaat 11460
gttttctagg aagtttgatg aaaatgtaga tcgtaaaata attgaagaaa tttatataaa 11520
aataagagga agaagtactg atgaagcaaa taaaatcaaa gataagagat ttacaaaata 11580
attttaccta tgtttttggg aagaaaactt ttcttggaag gggagaagcg attatcatag 11640
atgaacctga gcatggaaat ttgggagatc aagcaattgc ttttgcagaa aatcaatttt 11700
tagtaaatca tgtatcagta cgagatgtag aacatcttat agaaagcaaa actatttcag 11760
aaataaaatc tatgaaaaaa aatattggaa aaaaagaatt agtttttttt catgggggag 11820
gaaatttcgg gacactttat ctaaagtatg agcgcattag aagagtggca gtatcaaagc 11880
ttccctttaa taaaatgatt ctatttcctc agtcaatttc atttgaagat agtaggtttg 11940
gtcagaagca gctgaataaa agtaaaaaaa tatacagtca aaatacaaat tttattttga 12000
ctgcaagaga accaaaatct tatggtttaa tgaagaaatg ttttccagat aacaaagtaa 12060
tcttgacacc ggatatcgtg ctctcattaa atttaacaga acagtataga ggaaataata 12120
ggaatggtat cataacaatg ctcagggaag atatcgaaca aaagcttaat aaaactcaat 12180
ttgaaaaaat tatcaaagag ctgacagata aatttgaagt caccatttct gatacgcata 12240
ttgggaaaga aaaggatagt ggtataactt atgaaaatcg tcaacactat cttgagataa 12300
agtgggatga aattgcgcag catgaggtcg tcttaactga tagattacat ggtatgattt 12360
tttcatatat cacaggcaca ccatgtgttg ttttggctaa taataatcat aaaattgaag 12420
aaacatacaa acattggttg aatgaagtga actatattcg ttttattgaa aatccgactg 12480
ttgaaaatat tttagatgca atcagtgact taaagcaaat cgaacctcac tatattgatt 12540
tatctgataa atttcaacca ctaattgatg cgataaaagg gtaaagggtt aatgaataaa 12600
tacaaaaaac tactatccaa ctcactcgtt ttcacaatag gaaatttggg tagcaaactg 12660
ttagtctttt tactcgtacc actctacact tatgcgatga caccgcaaga gtatggtatg 12720
gcagacttgt atcaaacaac agccaatcta cttttgccac taattacaat gaatgtattt 12780
gatgcaactt tacgttttgc catggaaaag tcaatgacaa aagagagagt gttaacaaat 12840
tctcttgtag tatggtgttt tagcgctgtg ttctcttgtt tgggcgcttt tattatctat 12900
gcgttgaact tgagtaataa atggtattta tctttacttt taaccatcat cttattccaa 12960
ggtgggcaaa gcatactaag tcaatatgcg agaggcattg gaaaatcgaa attatttgca 13020
gctggtggag ttattttaac ctttttgaca ggcgctttaa atattctttt tttggtatat 13080
ttaccgcttg ggattacggg ctatttaatg tccctggttt tagcgaatgt aggtacgatt 13140
ctattttttg ctggcacact ttccatttgg aaggaaatta gttttaaagt aattgataaa 13200
aaactgattt ggcaaatgct ctattatgcc ttacctttga ttcctaatgc catcatgtgg 13260
tggttactga acgcatctaa tcgctatttc gttttattct ttttaggagc aggtgctaat 13320
ggtcttttgg cggtcgctac caaaattcca agtattattt ccatttttaa tacgattttt 13380
acacaggcgt ggcaaatttc agccatagaa gaatatgatt ctcatcaaaa atcaaaatat 13440
tattcggatg tttttcacta cttagcaact tttctattgt tagggacatc agcttttatg 13500
attgtgctta aaccaattgt cgaaaaagtc gtttcaagtg actatgcaag ttcatggcaa 13560
tatgttcctt tctttatgct ggcgatgcta ttttcctcgt tttctggatt ttttgggact 13620
aattatattg cggctaaaca aacaaaaggc gtatttatga catctatcta tggtgccatt 13680
gtttgtgtct tattccaagt ggttctgcta cccaccatcg gcttgaatgg cgcaggttta 13740
tcagccttgc ttggattttt aacaacgttt ttattgcgtg tcaaagatac gcaaaaattt 13800
gtggcgattc agattaagtg gcggattttt atcagtaatt tattgatcgt tttggcgcaa 13860
attttatgtt tgttttatct accgagtgaa tttttgtatt ttgggcttgc cctattattt 13920
tgtggcatgt tagtggttaa tcagcgtaca attttataca ttatcatggt gctaaaaaat 13980
aagacatttg gaatgaaatc ctcataaaaa tagacaggag atatatctcg atacacctcc 14040
tgtctatttt tatgctactc ttgggttagc tcaactcaac cgccttttaa tctcccaaca 14100
acaataatac ccaatcaaac aacccaaaaa attcaagata atatcactaa tggcaaatgt 14160
gcccaaataa aagataaatt gaatggtttc aattcctaaa agtgtgacca aactgacaat 14220
gacaaactgt ttgaaatcag tattgataca gtaaaggcca cctaaaggaa tgaagtagat 14280
aatatttagc actgcctctt gaatcgttct ggtatccgct tttataaagt caaaaggatt 14340
cagtgatatc gcctgaaaat ccgttgtttt agtaaaaagc accatgaata acagtaataa 14400
atacacactg aaagcaagat agagataaat aactgaaaaa tgtttgaggt gatactggat 14460
gccaaacaac cagataatca gcgttaataa gagtattaaa gttaatgcgg tatagtcaaa 14520
gtggttaatc aacctagcca ggctttgata gcgagtgaga acgggcataa tcagccaagt 14580
aatcgtcgca taactcagga taaatgtgac caataaactg ctgaggtaga tcatatattt 14640
tcgcaactgt ttctaactcc ttttcttgat gagattaacc ctattttaac atattttaaa 14700
actgtcatgt ttttatgaat ttaaaataaa gggcacctct aataactacc aattaattca 14760
cctctaaatg aaattagaaa aaaacacaga aaactaagca tagtctactg tgtttttctt 14820
tatgttacag acgccacttg actgttaatt cttattttag acagattacc tccatgtttt 14880
gagatttaaa cgtttaactt gttcgtacct caataagtta taagtcagat ttgtaaggtc 14940
agtatttaaa acagctcgat cgaatccaat agaacgtaaa ctagaaccat gcattgaatt 15000
ttcaacaaag gcaaatacat gttcaatacg gacacgtatt ttcgaaatga ttttattaaa 15060
cattttatcg tctgccttaa gagatttaga acgtgtgttt ttaagacagg taaaaagttc 15120
agcaccttta ggtgttgctt gattttgata agctgagtct gctaaggtga tctcatcggg 15180
atcaacaaga acgcctatta cttgagaatc atgcacattt gcaggcattg tttgataatt 15240
ctttacgaat tttgatttgg tatctatagc aatatgattt ttgtagccat agtgtctctc 15300
attaccttta attgtccagc gagcagctgt atcctcctgt gctcttttgt tcttcgtcca 15360
attcactggt accctatttg ctttaatcaa ttcattttcg tcttttggat tacgttgttt 15420
aggcgcctct atgaatgttg catcaacaat ctgtccttta tgagcgatca tcccttgtga 15480
ttcaagtttt tcttgaaagg cagaaaaaag ccagttccct ctattagatt ttgatagctg 15540
atttctaaag ttccagatag tctttgcgtc tggtaccttg tcatcaatct taaggaatct 15600
tctaaacgaa atccggtcaa tcatttgata ttccattgca tcatcagata gattgtataa 15660
gcgttgtaaa attagaattt tcaacattaa aacaaggtca tagggcggtc taccaccatg 15720
agatttgttt tttaaatcat acttaaaaat ccggttaaga gttgggcgaa aacactcgaa 15780
atcaaccact ttttctagcc gctcaagagg gtctcctttt aaacttaatt tttcaagata 15840
gtcgctatct ccaaataaat tcatcttcaa tacctcgtta gcttattttt aatctattat 15900
acctcttttc atacgttttt agaggtgccc ttaaacttat atctttattt atacgtaaaa 15960
tttcttgcag tttaaagttt gaaacccata ccttcacttg cattaaaaga ttttatatta 16020
tataatatat taattaaata caagtatatc tactgtgtga tgttaagcca agtccttaga 16080
gtcaagagtg tataatttta atatattttt taagctatca ctttttcaaa ttaattgggg 16140
gtaagatatc ctaaactttg atgaattcgt cttgataaaa cgtttcaatg taccaaaaaa 16200
tactctgata ggctgcttca aattttttgc agtagttcct cagtcattcg cttacccaaa 16260
ttccaagcaa tgattttttt agcataacga tttataatag ttgagagata acaccaccct 16320
ccgaagagtg gtcttgtaag gaatatcggt tgaccaaacc ttattttttt ctgtcgggta 16380
agtatgtatg atattctttc ggttaacagg atcatatatg gatatcctgg tttaaatttt 16440
tttatgacca cagaatatag ttgaagttgc ttcattaact tttgtaccag ttttaaactg 16500
tttttttcca tgtttaagca agaggtgaga atcttaggca ctctatagat gacttgttct 16560
tccttaactt tggccggtgt ctttttaatt acatttcatc tacttttgag acatagattt 16620
cattaaactt ggagtagagg tcatactctt tagaaagctg agtgatgaat cgttctgagt 16680
ggtaggactc gaccagggtt tctttaaatt cttttgaata tcgtttttgc atgacttttt 16740
actttgtcta aattatacaa ttcccagtct aagatgtaag aataatatca gactacgcaa 16800
tgttttttat attttgtata taatttagtg atcactagga tatcactaaa ttactctaac 16860
ttcattatat tacctagacc aaatgtgtta tagcgaaata agtaaaaatt atagttatta 16920
cttttgatat tattccagaa gaagtgaatt aaatttagtt cagtagcaga tagaaaatta 16980
gagttttttt ggcaaattga gatttgaatt tgcttgcaat gagagctgca gttattgcgc 17040
ttaaggaaat cggttgttgt aacagcaact aattatgatg atgtaaggac atattttaac 17100
atacgttttg tacgtgattt acatacattc gaaaagttta gatttgatcg tgactaaaca 17160
acaattttgg aggaaaaatt ggaacgaaaa aaaaagaaaa aaaagaatat ttgggttata 17220
attataccta tcttaatttt tattaccctt ataggagcag gggcttatgc cttaagaaat 17280
tcacttattc ctactgatca tacgaaaaca aatagttcgg atcaaccgac caaaacttcg 17340
gcctctaacg gttatgtaga acaaaaaggt gaagaagctg ctgtgggtag tatagcactt 17400
gtagatgatg ctggtactcc agaatggatc aaagttccct caaaggtaaa tctagataaa 17460
tttactgatt tatctacgaa taatatcact atttatcgaa ttaacaatcc ggaagtctta 17520
aaaacagtta ccaatcgtac ggatcaacgg atgaaaatgt cagaagttat agctaagtat 17580
cctaattctt tgattatgaa tgcttccgcc tttaatatgc agacaggtca agtgaccggt 17640
tttcaaatta ataatggaaa attaattcaa gactggagtc caggtacaac ggttcaatat 17700
gcctttgtta ttaacaaaga tggttcatgc aaaatttatg attcaagtac accagctgta 17760
accattattc aaaatggggg gcagcagtct tatgattttg gtactgcgat tatccgtgat 17820
ggtaaaattc aaccaagtga tggctctgta gattggaaga ttcatatttt tattgcgaat 17880
gataaagata ataatctcta tgctattttg agtgatacaa atgcaggtta tgataatata 17940
atgaaatcag tatcaaattt gaaactccaa aatatgttat tacttgatag tggtggctca 18000
agtcaactat ctgtcaatgg taaaacgatt gttgctagtc aagatgatcg agccgtaccg 18060
gattatattg tgatgaaata aaaataaaag aacctcttgg ttctttcaga ctgaagacaa 18120
actccaaaaa taattgcagt ttgtcttttt tctatgtaaa ataaaggtat taagttattg 18180
attgagaggt ttaggcatgt ttcgtaaaca aacgatgatt ggtcgtagtc aaatgggctt 18240
ctattctctt gaagatttag tccctcaaga acatcttcta agacatattg atcaatatgt 18300
tgattttgaa tttatctacg acttagttag agataaatac agtcttgaac atggtagacc 18360
aagccttgat cccgtcatgc tcattaaact ccctattatt caatatcttt ttggtattca 18420
aagtatgcgc caaacgatta aagaaatcaa agtgaacgtg gcttattgct ggtttcttgg 18480
tcttgatttc caagatgaag tgccacattt cacgaccttt ggtaaaaatt atagtcggcg 18540
tttcaaagac acggacttat ttgaacagat tttctaccgt attttggatc aagtctttga 18600
ggcgaaatta gttgaaccga gattggttta tcttgacggg acacatatta agcacacgct 18660
aatcgtcatc aatttgttaa tcaagaagtt gcggttgaag ccttgattta tcaagaagca 18720
ttagagaaag agactaacga ggtgcgggta aaggctgaaa aaaagccctt caaaaaatca 18780
gaagaaaaag acgtcaaaaa cactaaagta tccacgacag attgtgatag tggttggttt 18840
cataaaggtg aacataagga ggtctttgct tattcagcac aggttgcctg cgatactcac 18900
agttgggtgc tgggttacac agcagattgt ggtaatcttc acgatagtcg gacttttttt 18960
gatttgtacg ctaaattaat cgaaaaattt ccagaaatag atacagttgt cgcagacgca 19020
gggtataaaa caccagccat tgctaaaaag ttgatcgatg atggtcgaac tggcctattt 19080
ccttacacgc gaccacgtgg gaagaaagaa ctttttcgaa aacgagattt tatttatgac 19140
tatgtgaatc atacttacac atgcccaaat ggtaaacagt tgatttacaa aacgacccga 19200
cgtgatgggt accaagaata tcgaactact agagcaaact gtgctacttg tccattttta 19260
gcacagtgta caagctcaca aaacaaacag aagacagttt tcagacatat ttggcagctt 19320
tatctagata aaatcgagca aaatcgttta acgacttggg gtaagataac ttataagcga 19380
cgtaaagaaa cgattgaacg tttgtttggg actgccaagg aacatcataa tttgagatat 19440
acacatgaaa atggccgaga taaaatgcac atgaagcttg gtcttacttt tgcgtgcctg 19500
aacatgaaga aattagctaa aattatggcg aatagagata ggggaaaggc ggggattttc 19560
aatttttgga tgattttcaa ggtgataggt tatcaaaaag cacaaatcgc ttaagatttg 19620
tgcttttgtc tacagtctga aagaacctct tggttctttt attttagaga tttttcaaaa 19680
agggttttga ctgagtctaa ttctgtttga gaaactacct tagctccatt ttcatctgtt 19740
gtatgtagat tgagcttgct ggtattcttt agagccttat tgtagctcat aacgatcgtt 19800
ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata gagaatttct 19860
cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat atttccgtag 19920
agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt atcgtcaatt 19980
tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc ttgtttaaac 20040
tcataacctt cagcattgaa tgcctttgga ttttgcatgg tgattccacc agtggcctgt 20100
acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga caaattcaat 20160
aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac ttcagtgatt 20220
gttttttgat taggcattgt tgcaaaaacc gggaagttca tgaaagtagt ttgatttgtc 20280
tttacattcg ttgaagctaa aacagtagca taagctgtat ttttagaatt atttttacca 20340
gttgcaatga taagtgtggt gaatgtttta gactttttta agtcgatact tgttgtttta 20400
gggaaatttt catatgatgt tgaaaatgtt gattcaacat ttctataagc ggcgtaggct 20460
atagaagcaa cagcgataat tactaataca aaaataattg aaataacttt tagtactgtg 20520
tgttttttct tacgataatg acgcctcttt ttttgattca tggtatctcc agatatatat 20580
tata 20584
<210> 224
<211> 21315
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33138 eps基因簇,完整序列
<400> 224
atgaatgatt tattttacca tcggctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttat 420
agttataatt ctaggataaa taatctttca aaagctgaca aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa attattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatcaaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcacg gactagtaat 840
gagcaactta taaaaaattt ggctagtaca ggagcagagg tgatagttca accctctcca 900
ccgatttatg gtggtgttgt ataccccgta caagaagaac aatttaaaca atctttatct 960
acaaaatatc cctatataga ctactgggct agttacccag acaaaaattc tgataaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgattcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaaaaaat gcaggaaaca caggaacaga cgattgattt aagagagatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattc ggatagttca gcagcctacg ctggacaagt gagcgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagtc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc ggatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acagattcac aagtcattac gcttactgtt aaatattcta atccttacat ggctcaaaag 1560
attgcagacg agactgctaa aatatttagt tcagatgcag caaaactatt gaatgttact 1620
aacgttaata ttctatctaa agcaaaagct caaacaacac caattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ctagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct ggggctaacg 1800
gttcttggtg taacaactta tgctcaaatg agtgatttta ataagaatac gaataaaaat 1860
ggtacgcaat cgggaactaa gtcaagtctg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ctgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caaggaatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcaaccgcg agtgctaatc tagctgttgg ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacccaacc cgtctgaatt attagcatct agtgcaatga aagacttgct tgactctgtg 2400
tccgatttct ttgatgttgt tttgattgat actccacctc tctctgcagt tactgatgct 2460
caaattttga gtagttatgt aggaggagtg gttcttgttg tacgtgccta tgaaacgaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtatag 2640
tagttggaat aaactttaat caaataaaag acagaaattt gtaggatagg agagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caaccatcac tgccactcct 2820
catcataatc ctcaatttaa taatgagtca ccccttattt taaaaaaagt taaggaagtt 2880
caaaatatta ttgacgaaca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaaggaaagt tacttacagc agcaggcact 3000
tcaagttata tattgattga gtttccatca aatcatgtgc cagcttatgc taaagaactt 3060
ttttataata ttcaattgga gggtcttcaa cctattttgg tccaccctga gcgtaatagt 3120
ggaatcattg agaacccgga tatattattt gattttattg aacaaggagt actaagtcag 3180
ataacagctt cgagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt tacgtcacgt 3300
gcatttaaga tgaaggaggc atttgaaatg attgaagaaa gttgtggttc tgatgtatca 3360
caaatatttc aaaataacgc agggtcagtg attttaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa aaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatggaag tttttgagga tgtctcatca cctgaaccgg aagagcataa gttagtagaa 3540
ttaaaaaaat tttctcatag agagataatt ataaaaagag ggattgatat tttaggggga 3600
ttagcgggtt cagttttatt tcttattgcg gctgcattgc tttatgtccc ttacaaaatg 3660
agctcggaaa aagatcaagg gccaatgttc tataaacaaa aacggtatgg aaaaaacggt 3720
aaaatttttt atattttgaa atttagaaca atgattctta atgctgagca gtatttagag 3780
ctacatccag aagttaaagc cgcctatcat gccaatggca ataaactaga aaatgacccc 3840
cgtgtgacga agattggttc atttattaga caacactcaa ttgatgaatt accacaattt 3900
atcaatgtcc ttaaagggga tatggcatta gttggcccaa gaccaatttt actttttgaa 3960
gcgaaagaat atggggagcg cctctcttat ttactcatgt gtaaacctgg gattactggt 4020
tattggacaa cacatggtag aagcaaagtt ctttttcctc aacgagcaga tttagaactc 4080
tattatctcc aatatcacag caccaagaac gatatcaagc ttcttatgct tacaattaca 4140
caaactattc acggatcgga cgcttactaa aaaatgaagg aaaaacatat ttacattatt 4200
ggttcaaaag gaattccagc aaagtatggt ggttttgaga cttttgtaga agaactaaca 4260
gcacatcaga gtaataaaaa ccttaagtat catgttgctt gtttatcaaa tgacatacaa 4320
tcaaatttta ttcataatgg tgccgactgt tttaatattc caaagaaaaa tattggacca 4380
gcaaatgcca tttattatga tttggcagct ttaaagtact cacttaaaga aattgaagaa 4440
aaaaattata tgggtgcaat tatttatatt ttagcttgcc gcattggtcc gtttattggt 4500
cactataaaa agcaaatgaa aaaattagga attactttga tggtaaatcc tgatggggag 4560
tgtgaaataa tatggacaac cagaaaaagc ctgaattcat acgggtttgc gcggcttgac 4620
cttttcacct caacttgttt ctgcctgtca tggtgcgttg cggccggttt cataggttct 4680
aaccttgtga aacgaattta tcaggaggct ccttctgcta cggttatcgg catcgacaac 4740
atgaatgcct actatgatgt ggcactgaaa gagttccgcc tgaatgagct ggccaagtat 4800
cccacattta ccttttatgg ataatccgaa cttccgcttc gtgaaggctg atatctgtga 4860
ccgcgaagct gtgaataaat tgtttgaaga agaacatccg gacatcgtgg tgaactttgc 4920
ggcagagtct catgttgacc gttctattga agatcctggc atcttcctcc agaccaacat 4980
cattggtacc agtgttctaa tggatgcttg ccgtaagtat ggcattcggc gttatcatca 5040
ggtttctacc ggtgaggttt acggtgacct gccgctggat cgtcctgacc tgttcttcac 5100
tgaggagact ccgatccata ccagctctcc gtatagcagt tccaaggctg ctgctgacct 5160
gctggttctg gcttaccacc gcacctacgg gctgcctgtg accatttccc gttgctccaa 5220
caactatgga ccgtatcact tcccggagaa gctgattccg ctgatgatcg ccaatgcact 5280
ggctgacaag ccgctgcctg tttacggcga aggtctgaat gtccgtgact ggctgtatgt 5340
tgaagatcac tgcaaggcca tcgatctgat tatccacaag ggtcgtgtgg gcgaagtcta 5400
caatgtcggc ggtcataatg aaaagcagaa catcgagatc gtaaagatca tctgcaagga 5460
gctgggtaag ccggagagcc tgattaccca tgttggcgac cgtaagggtc atgatatgcg 5520
ttatgccatc gatccgacca aaatccacaa tgagctgggc tggctgccgg agaccaagtt 5580
tgaggacggc attaagaaga ccatccaatg gtatctcgat aatcgtgagt ggtgggagac 5640
catcatcagc ggtgagtatc agaactacta cgagaaaatg tacagcaacc gctaagaagc 5700
cagaggagga aagaatatga agttttttgt aactggcgtt ggcggtcagc tgggtcatga 5760
tgtgatgaac gagctgctga agcgcggcca tgagggtgta ggctctgata ttcaggaaaa 5820
ttacagtggt gtggcagacg gctccgcagt aacaaaagca ccgtattttg ccttggatat 5880
tacgaacaag gatgcagttg agaaagtcat tacggaagta aatccggacg cagtgatcca 5940
ctgtgcagca tggacggctg tggatatggc agaggatgac gataaagtgg cgaaagttcg 6000
tgccatcaat gcgggcggca cacggaatat tgcggatgtc tgcaagaagc tgaattgcaa 6060
gttgacctat atcagcacgg actatgtgtt tgatggtcag ggcacagagc cctggcagcc 6120
ggattgcaag gattataagc ctttgaatgt atacggtcag acaaagctgg aaggtgagtt 6180
ggcagtcagc cagacgctgg agaagtattt cattgtccgc attgcttggg tgtttggctt 6240
gaatggcaag aactttatta agaccatgct gaatgttggt aagacacacg acactgttcg 6300
tgtggtcaat gaccagatcg gcacaccgac caatacatat gatttggctc gactgctcgt 6360
cgatatgaat gaaaccgaga agtacggcta ttatcatgca accaacgagg gcagctacat 6420
cagctggttc gatttcacga aagaaattta tcgtcaggct ggatataaga cagaagtcct 6480
gccggtgacc acggcagagt acggtctgag caaggccgct cgtccgttca acagccgtct 6540
ggataagagc aagctggtgg aagctggctt tactccgctt ccgacttggc aggatgcact 6600
gagccgttat ctgaaagaaa tcgagcagtg atagaggaga tagagaaaat gggacagatt 6660
aaggttgata aaaatgtagg cggcatcgag ggactttgtg tgattgagcc tgctgtgcac 6720
ggtgatgccc gtggctattt tatggaaacc tacaacgaaa aagatatgaa gaaagctggt 6780
atcgacattc attttgtaca ggataatcag tccatgtcca tgaagggcgt gctgcgcggt 6840
ttgcatttcc agaagcagta tccgcagtgc aagctggtac gcgccgtgcg cggaactgtg 6900
tttgatgtcg ctgttgatct tagaagtaat tctgagacct atggcaagtg gtatggtgtg 6960
accctgtctg ccgagaataa gaagcagttc ctcattccgg agggatttgc acacggattc 7020
cttgttctga gcgatgaagc agagttctgc tataaggtta atgacttctg gcatccgaat 7080
gatgagggtg gtatggcttg gaacgacccg gagattggca ttgagtggcc tggagtacag 7140
ggtgagtaca agggtagcgc gagtgcggaa ggctatgagt tagaggatgg cactgcgttg 7200
aatctgagtg ataaagacca gaaatggctg gcactgaagg atacttttaa attttgagaa 7260
cacaggggta gaagaatgca gcgggaaaac gaagtacaac acgtatttct ggtaggtgct 7320
aaaagcctag gagcatatgg tggttatgaa acctttgtat ataagctgac agaacaccat 7380
cagaataaga aaaatattaa atatcatgtg gcgtgtaaag ctaatggtga cggctgcatg 7440
gacgaaacaa aagtggatgg cgtaaaggga atcaatcaac atgagtttga attccacaat 7500
gcgcactgtt ttaaaattga tattcctcag attggtgcag cacaggcaat ttactatgat 7560
gttgcggcat tgaatgcttg ctgtaagtac ataaaggaac ataaaatcaa acatccaata 7620
gtttacataa tggcttgccg cattggaccg tttgcaggtc atttttatca ggaaatccat 7680
aagcttggtg gtacggtcta tttgaatccg gatggtcatg aatggatgag agccaagtgg 7740
tcggctccga ttcgtaaata ctggaagatt tctgagcgga tgatggtcaa atactgtgac 7800
cttgcaatct gcgattctgt gaatatcgaa aagtatatcc acgagtgtta cgacgggaaa 7860
ggaatcaaag gcagaaatcc taagaccaca ttcattgctt atggtgcaga tttgacactc 7920
agcaagctgg ctgatgacga tgaaaagctg gtgaactggt ataaggaaaa aggactggcg 7980
aagaagggct attaccttgt cgttggacgt tttgtaccgg agaactcttt tgaagtgatg 8040
attcgcgagt tcatgaagag cggaagcaaa aaagattttg cgcttatcac aaatgtgaac 8100
gataaatttc tgaatgaatt ggaagagaaa cttcatttca agagtgacaa gagaatcaag 8160
ttcgttggta cagtgtatga ccaagaactc ttgaagaaga ttcgagagaa tgcttacgca 8220
tatttccatg gacacacagt tggaggtacg aatccatcat tgattgaggc acttggcagc 8280
acggatttga acctgctggt tgatgttggt tttaataaag aggtcgccga agattgcgct 8340
ttgtattgga gccgcgaacc gggcagtctt gcaagattga ttgatcgtgc agataagatg 8400
agtaccgaag aaatcgcgga aatgggtcga aaagccaaga agcgtgtagc tgaagagtat 8460
acatgggata aaatctgtgg acagtatgaa gaagtgttca caaagtgaga gcaaatgagg 8520
gatcagacag tgaaaatctc gaaatattat agaacattct taagaagaaa actaaatgcg 8580
gagaatcgca aacgtctaaa aaacaaaaac tttacggtgc tatgtaataa ctgtgtgggg 8640
ggggtgatcc ttcacgagtt aggtgaacgc tttaattcgc caacggtcaa tttgtttttt 8700
aaagcggaag attatctcaa atttttggag aacttagatt attacttaaa acaggctctt 8760
gtagaagttg gaagcgagaa gaactaccct gttgcaaaac tggatgatat aacaatatat 8820
ttcatgcatt attcatcgtt tgatgaggca aaaataactt ggcaaaaacg agtggcaaga 8880
attaacaaaa acaatttgta tgtaattttt gttcaacaaa gcggttgtac agagcaggtc 8940
ttggaggcat ttgacaagct tccctataaa cataagctgg cacttactgc aaagccaatg 9000
ccggagataa aatgctctta ttgtattcat ggtacagcgc aaccgaatgg agaagtaatg 9060
gatttgtgca agtatgaggg aaagtttact ggcaaacgct ggattgatga atatgattat 9120
gtgggatttt taaataagaa atgatgtgag gaattgatat gtatgattat ttggtggtag 9180
gctctggtct ttacggagca atatttgcgc atgaagcaaa agcgcatgga aaatctgtgc 9240
tagttgtgga taagcgtccg aacattgggg gcaatgtcta caccgagaac attgagggca 9300
tcaacgtcca caagtacggt gcacatattt tccataccaa caacaaaaag gtttggaatt 9360
acatcacgca gtttgccgag ttcaaccgct ttacaaattc tccggttgct aattataagg 9420
gtgaactgta ttcgttgcct ttcaatatgt ataccttcaa caagatgtgg ggcgttgtga 9480
caccggagga agccgctgca aaaattgagg agcagcgcaa ggaaatcact ggcgagccga 9540
aaaatctgga ggagcatgcc atctctctcg tgggccgcga catctatgag aagctcatca 9600
aaggttacac cgagaagcag tgggggcgtg actgcaagga tctgcctgcc ttcatcatta 9660
agcgtcttcc ggttcgtctg acttttgata ataactattt caatgcgttg taccagggta 9720
tccccattgg cggttacacc aagatgattg ccaacttgct ggacggcatc gaggtgcgcc 9780
tgaacatcga ctatctggaa aacaaggttg agctggatgc gctggctggc aaggtggttt 9840
acaccggtcc catcgatgcc tactttgact ataagctggg tacgttggag taccgttctg 9900
tccgctttga gaatgagttg ctcgacaagc cgagctccca gggcaacgct gcggtcaact 9960
atacagaccg cgagacgccg tggactcgta tcattgagca caagtggttt gagtttggta 10020
gggacgagaa cggcaatgat ttgcccaaaa ccattatcag ccgagagtat agcagtgagt 10080
ggaagccggg tgatgagccg tattatccgg tcaatgatgc taagaacagc ttactgtatt 10140
ctgagtataa gaaactggca gatgctgaaa gtaaagtgat tttcggtggt cgtttgggtg 10200
agtacaagta ttacgatatg gatcagatta ttgccgctgt attggagaga tgcgaaaggg 10260
aatttgatgt atgaatggaa aaataatcgt tgttacacat aaagagtata aaatgccatg 10320
tgatacagtg taccttccag tatgcgttgg cgttggcaga gatgctttaa ggaataagta 10380
tcaggctgac gatgaaggtg aaaacatctc tgataagaat attctttatt gcgaactgac 10440
agcactgtat tgggcttgga aaaacttgaa ctgtgattat atcggactgg ctcattatcg 10500
tagatatcta actgagtcaa agagaagtaa aaacatagag gatgctctat ctcagcatag 10560
aattgaagaa ctcttgatgg actatgacat aattgtacct agagagaaaa ggtattctca 10620
aacaatagcc gaccattata ttaactgtat taaaagcaga aaagatgcac acaaaattca 10680
tttacaatta cttcgtgatt cgattcttga ggtagctcca gagtatattg cagaatacga 10740
taagaccatg aatgggcata gtgcacatat gcttaatatg tttgtgatga aaaagcaaaa 10800
tctggacaat tactgcgagt ggctatttaa gattttattt gttttagaaa aaaaaatata 10860
tgaccatgat gtctactatg atcgtataat gggcgcattt agtgagttcc tattggatgt 10920
atggattaga acaaataaaa agacgtatat agaggttgag ttaatcgaaa ctgaaagaga 10980
ctattggggt aagattaaat gggctctgaa aagaaagctg tttgaataag gagataatat 11040
aatgcgaatc ctgcattact cactagggtt tcccccttat cgaaggggag gcctgacaaa 11100
atattgtttg gatcttatgg tagcacagga aatgcagggt aatgtggtag ctatgtgctg 11160
gcctggtgaa atcggaatta tcaagaagaa aaaagttgca ataaaaaaaa gaaaaaaata 11220
cagcatagga aagagtaaga ttgaaaatta tgaaatacaa gggattttac ctgttcctct 11280
tttagaggga ataaaaaatc cggatctatt tactgaaaaa aagaaccaag aaatttggaa 11340
actattttta aagaactgga gacctgatgt tatccacttt catacattaa tgggcttgcc 11400
gctagaatat gttgaaacgg caagaaagct tggaataaaa acattattta cgacgcatga 11460
ttattttgga ttgtgtccaa ggacgacttt ggtgcgtcaa aatggcgaaa tttgtgatgg 11520
ctgcacaccg gaattgtgcg cggaatgctg tgagaatgct atcagttata gaaaattgaa 11580
aattctacaa tcttcagtgt atagggttct gaaagattta gtaattgtaa aaaagctcag 11640
aaaaaaacat tggaatgaat cgaaaaatga ttctgcacaa catcaggctt ctgttcagaa 11700
cgcacaacga gcagaagaat atgttgagct acgaaaatat tatataaaac tactaaaaag 11760
ttttaatatc atccatttta atagtagcaa tacaagagat gtgtatttaa aagcggccaa 11820
ggaggtgtta aacaacgagg tcgtttctat atcacatgaa atgataaaag ataacaaaaa 11880
gaaaaaaagg aaacacgaga tattgcacct ttcttatttg ggaccggata catataataa 11940
aggatactat gtattaaaag aaacactgaa tcagttgcat aaagaaggat ataagtttca 12000
gttaaatatt tattttgagg atgcttcgga gccctttatc gtttcacatg cgccgtatca 12060
atactcagaa ttaggcaagg tgatggatga tgcagattgc gttatattac ctagtttggg 12120
gaatgaaaca tttggcttta cagtcttaga agctctaagt tatggagttc ctgttattgt 12180
tagtagtcgt gttggtgcaa aggatattgt tgaagagggt aaaaacggat ttgttgttga 12240
aggtgatgta gactctttaa aaacaaagct gacaagtgtg ttgaatcaac ctgaaatatt 12300
ggaagatatg aataactata ttgttgcaaa tacacacatt aaaacaatga cagaacattc 12360
taaagaaatc aaggacctgt atcaaaagtg acttgtatat aaaaatggag aagattatgg 12420
atttagataa aattagatgg aattcagagg tcagtcatcc aaattttata gctcaatata 12480
gatttgtaaa agcaaaaaga ttatgtgagt attgggctga caaaaataag ctgctgtact 12540
tatttgctcg gatgcgatat gaacattata aagtaaaata taatacagat attcctgccc 12600
gttgtaaaat cggggggggt caaaatcagg catctcggag gtattgtatt taacccgggt 12660
gttgaaattg ggaaaaatgt tgattgctta aatggcgttt tgcttgggca aatcgatttc 12720
ggtgccaaag caggtgtgcc aagaattggt gataatgtgt ttttaggaac caattctatt 12780
gttgtcggta agattcaaat aggaaatgat gtgttaattg ctcctggtgc atatgtgaac 12840
tttgatgttc cggatcactc tatagtaata ggaaatcctg gaaagattat tgcaaaacaa 12900
aatgcgacaa gaggatatat agcatcacct gttgaagatt aaatgagcgg ataataaata 12960
gcgaggagtt atgagtccgc acgttttctg caaagggaga acaaattgtg caagataaag 13020
taagtattat tgttcctgta tataaagttg agagggaact agatcgctgt gttcaaagtc 13080
tgattaaaca gacttataaa aatttggaaa taattcttgt ggatgatggt agtcctgatc 13140
aatgtcctga attatgcgaa aattatgctg agatagataa gagagttaag gtcattcata 13200
aagagaacgg cggattatca gatgctcgta atgcgggatt gaaacaagca acaggcaagt 13260
atattctgta tgttgattct gatgattata ttgatttgga tgcctgcgaa agatttataa 13320
aggcggcggg taatcaaaaa atagatattg ttgttggaaa tgcaattatg gaaaaaccag 13380
atggtaaaga aatgatgata cactcagcga caccatctgg aatcacctat actgccaaac 13440
agtttattat gagtgctgtt aaagcatatc agtggtatgc ccctgcatgg cttaatatgt 13500
atagaaggga ctttcttctt gataatcagt tatacttcaa aaaaggaatt tactttgagg 13560
atgttcaaat gctgccacgt gtttttttgg ccgcaaaaaa aatcacatgc atatatggaa 13620
cattttatca ttatattatt cgagaaaatt caattatgac gtctcagaaa gacgagaaaa 13680
agaaaaacga ttcaattcaa aatatgaaag agtggaaaga gcagtttgat cttgtagatg 13740
atgtggcctt gaaaaaatgc ctatatggaa tgcttgtgaa aatgtatata cacgaatgta 13800
ggcagtatgg gattacgact aaagcaattg aaggaatgga tgatagattt atattgggga 13860
actgtctcaa ttataaagaa agattaaagg ctactatgtg gttgtgcttt ccaaggctac 13920
tgataaaaca gtgaaggagt gcatatgagt gtttacatat ttttatgggt tgctgtagtt 13980
gtatttggct ttatcgcaag tagaagtaat tataaagcaa aatattttgt gctcttttct 14040
tttttcctga tgacaattgt tttaggacta agaggggcta cagttggcga agatacaaaa 14100
atgtatctta atattgcaga aagagtaact aatatatcat ggaaagaagt gttttctagc 14160
tttccaacga gtcagtggag atatatttca tatggtggct taagtggatt tagtgagcag 14220
acggaaacag tttatttggc ctattgcaaa ttgataatgc ttatatttca taatgcacag 14280
gcagttcttc ttataacagc tgcaattacg aatgctctat ttgcaaagtt tattttagat 14340
aacataacag tcaaacaaga tgccatactg gctgtttata tttacatgtg cgatgcgatg 14400
tttatgaatt cgtttaatac aatgcgacaa attttggcaa tatccattgc agtgcaatcc 14460
atagaattaa taaaaaaaga aaagtataag aaagcaatag catgtgttct attagccgca 14520
tgcttccatc aatctgcaat agtttttttt gttgccgatt tattctattt attaaagaag 14580
aaaaaagaaa gatatattta tctccttgtg actctttgtg cattgccagt tttaatccct 14640
gtggctatta aagtggtcag tatcttctcc agtaaatatg caagttattt gtcagttagt 14700
ttttggggag cacaactacg aggaacactc ttgctctgga taattattgc aattgttctg 14760
tttattatga tacgtgctaa ccaatcggat aatatagatt ggtggctaat ctatatggca 14820
acaatttaca ttggtgtaga gcttgttgga atgcagttga cggttatatc tagggtggca 14880
atgtacttta gaattttcct tgtattgctc tttccgattg ctcaaaaata tttcactaaa 14940
aaaagtggac agttttataa aattggcgtg gttatgctaa tgactgtatc gttctttagt 15000
tatgcgagtt cgcctgatcg tttgtacact ttttgctttt aatgatcaaa gagggagggc 15060
agaggaatgc ctattgcttc ggtgataata cctacatata aaggaagtag tgccttaaat 15120
agggcaattg atagtgtgtt gtgtcaatca tataaggaaa ttgaaaaaat tgtagttgat 15180
gataatggtt ctgttgcaaa gttttaaatc tactatcaaa taaggtagaa taatagaaaa 15240
agatagcagg aggaatgacg atgaatcatt ttaaaggaga gcaatttcag caggatgtga 15300
ttattgtagc cgtgggctac tatcttcgtt ataaccttag ctatcgtgaa gttcaagaaa 15360
tcttatatga tcgtggcatt aacgtttctc atacgacgat ttatcgttgg gtgcaagaat 15420
atggcaaact actctatcaa atttggaaaa agaaaaataa aaaatccttt tattcatgga 15480
aaatggatga aacgtacatc aaaattaaag gaaaatggca ttatttgtat cgagccatcg 15540
atgcagatgg tttaaccttg gatatttggt tacgtaaaaa acgggacaca caagcagcct 15600
atgcttttct taagcggtta gtgaagcagt ttgatgaacc gaaggttgta gtcacagata 15660
aagccccctc tattaaaagt gcctttaaga aactaaaaga atacggcttt tatcaaggga 15720
cagaacatcg taccattaaa tacctgaata atttgattga acaagaccat cgtccagtaa 15780
agagacgcaa taaattctat cgaagtttac gcactgcctc acccacgatt aaaggcatgg 15840
aagccattcg aggattatat aagaaaaccc gaaaagaagg cactctcttc gggttttcgg 15900
tctgtactga aatcaaggta ttattgggaa tcccagctta aatcatagat accgtaaggg 15960
attttattct ttatttaaaa ctttgcaaca gaaccaagat ttgtataaaa aataaaaata 16020
tggaggtaca atcatgggaa aaattcctat gagtaaggta gacaaggagt ctgttttaaa 16080
catgcttatc agcaatgaaa acagcgtgaa gattccgcaa gcggtggaag tggttgatta 16140
tcaaacaggc gagttcgaca gaggaggcga aaaaggtctg atgtattggg ctaatttatg 16200
cgttgttgat gttgaagagt tagaacttct gaaaagtgtt ggtttggaag aaaatgctat 16260
tctgattaag ctgaaagtat ctgattataa taatgaaaat cttgaggttt taaaaggtaa 16320
agtgctagat actaaatcta tggaaatagt gtttgtagag aaaaaaagta aagtaggtaa 16380
cgaaattaca ggcttggcat ttaaaacaag ttttagagat ttaagaggag tctgttttaa 16440
acatgcttat cagcaatcaa aacagcgtga agattccgca agcggtggaa gtggttgatt 16500
atcaaacagg cgagttcgac agaggaggcg aaaaaggtct gatgtattgg gctaatttat 16560
gcgttgttga tgttgaagag ttagaacttc tgaaaagtct atattttggg gggcactaca 16620
gaaatggaac gctcttcggc ttttcggtgt ctactgaaat caaggtatta atgggaataa 16680
cagcctaaga tatttggagt tcacagaggg cgcatttgat tttcaaactt cgcaatagaa 16740
ccaaataggg ttaatctcat caagaaaagg agttagaaac agttgcgaaa atatatgatc 16800
tacctcagca gtttattggt cacatttatc ctgagttatg cgacgattaa ttggctgatt 16860
atgcccgttc tcactcgcta tcaaagcctg gctaggttga ttaaccactt tgactatacc 16920
gcattaactt taatactctt attaacgctg attatctggt tgtttggcat ccagtatcac 16980
ctcaaacatt tttcagttat ttatctctat cttgctttca gtgtgtattt attactgtta 17040
ttcatggtgc tttttactaa aacaacggat tttcaggcga tatcactgaa tccttttgac 17100
tttataaaag cggataccag aacgattcaa gaggcagtgc taaatattat ctacttcatt 17160
cctttaggtg gcctttactg tatcaatact gatttcaaac agtttgtcat tgtcagtttg 17220
gtcacacttt taggaattga aaccattcaa tttatctttt atttgggcac atttgccatt 17280
agtgatatta tcttgaattt tttgggttgt ttgattgggt attattgttg ttgggagatt 17340
aaaaggcggt tgagttgagc taacccaaaa gtagcataaa aaggttctgt tgcaaagttt 17400
taaatctact atcaaataag gtagaataat agaaaaagat agcaggagga atgacgatga 17460
atcattttaa aggaaagcaa tttcagcagg atgtgattat tgtagccgtg ggctactatc 17520
ttcgttataa ccttagctat cgtgaagttc aagaaatctt atatgatcgt ggcattaacg 17580
tttctcatac gacgatttat cgttgggtgc aagaatatgg caaactactc tatcaaattt 17640
ggaaaaagaa aaataaaaaa tccttttatt catggaaaat ggatgaaacg tacatcaaaa 17700
ttaaaggaaa atggcattat ttgtatcgag ccatcgatgc agatggttta accttggata 17760
tttggttacg taaaaaacgg gacacacaag cagcctatgc ttttcttaag cggttagtga 17820
agcagtttga tgaaccgaag gttgtagtca cagataaagc cccctctatt aaaagttcct 17880
ttaagaaact aaaagaatac ggcttttatc aagggacaga acatcgtacc attaaatacc 17940
tgaataattt gattgaacaa gaccatcgtc cagtaaagag acgcaataaa ttctatcgaa 18000
gtttacgcac tgcctcaccc acgattaaag gcatggaagc cattcgagga ttatataaga 18060
aaacccgaaa agaaggcact ctcttcgggt tttcggtctg tactgaaatc aaggtattat 18120
tgggaatccc agcttaaatc atagataccg taagggattt tattctttat ttaaaacttt 18180
gcaacagaac cgagataata atcacgtctt tttggaattg tttgccctta aaatgattca 18240
tctgttgtcc tcgcattctt ttttattaca ttttacaata aatcgggtgt tatggggaac 18300
tttgcaacag aacctatttg aatttagtcc agttctaact atcttttttt tcaaatttaa 18360
gctaaaatag atttttggaa aactttgcaa cagaaccctt agttttctgt gtttttttct 18420
aatttcattt agaggtgaat taattggtag ttattagagg tgccctataa aataacttag 18480
agctttgtgg gagctaccca ctaatactaa tataaggaga tagacaatgg atttaaaaga 18540
tttaataagt gttattgttc cgatatacgg cgttgaagaa tatttaaata aatgtatcga 18600
ctctattatc aatcaaacat ataaaaatct agaaattatt ttggttgatg atggtagtcc 18660
agataaatgt ccagatatat gcgatacatt cgaaaaaaaa gatgagagaa taaaggtaat 18720
ccataaaaag aatggtggat tatctgatgc gagaaatgcc ggtattgata cagcacatgg 18780
agactatttc gtttttgttg atagcgatga ttggattgaa aacacaatgg tagagcattt 18840
gctcttcgca tgtaaaaaat ataatgttga aatggcaact tgtgctagat atattacaga 18900
tggtcattca actagagcag tcgcatttaa tggtccagca ggagcatatt cagctgaaga 18960
agcattgaat gaaatactct taggaaagtc gatggatgtt gctgcttggg ataaaattta 19020
tgctcgtaat ctatttgaag aaatacggtt tccggttggt gaaaataatg aagacattgc 19080
agttttctat aaactagtag acttggctgg cagagtagca cataccggta caacggaata 19140
tttttatcgg agtcgtccgg gcagtattac aaaattgaaa tatagtacag atgccagaaa 19200
aatcatcgag aaaaatctga attcaataga aaaatttctt gataaaaagt atccaagctg 19260
tttgccaagt ttttatcgtt ataaaacaat gaacatttat gcattgttga ataagtatat 19320
taaatgcgaa ggaacaaaga aaacacaaga atttgagcat ctgatgaacg agttccgaaa 19380
aaataagagc tatttcttta atgatgatca gaccccatca aaagaaaaga agattgccat 19440
aatgattctt ttgcatcttt acaatccgta tttacttgta aaagaaaaga ttacgggtta 19500
taagtgacga agggagaaat tgagaaaatg attggttctg ttgcaaagtt ttaaataaag 19560
aataaaatcc cttacggtat ctatgattta agctgggatt cccaataata ccttgatttc 19620
agtacagacc gaaaacccga agagagtgcc ttcttttcgg gttttcttat ataatcctcg 19680
aatggcttcc atgcctttaa tcgtgggtga ggcagtgcgt aaacttcgat agaatttatt 19740
gcgtctcttt actggacgat ggtcttgttc aatcaaatta ttcaggtatt taatggtacg 19800
atgttctgtc ccttgataaa agccgtattc ttttagtttc ttaaaggcac ttttaataga 19860
gggggcttta tctgtgacta caaccttcgg ttcatcaaac tgcttcacta accgcttaag 19920
aaaagcatag gctgcttgtg tgtcccgttt tttacgtaac caaatatcca aggttaaacc 19980
atctgcatcg atggctcgat acaaataatg ccattttcct ttaattttga tgtacgtttc 20040
atccattttc catgaataaa aggatttttt atttttcttt ttccaaattt gatagagtag 20100
tttgccatat tcttgcaccc aacgataaat catcgtatga gaaacgttaa tgccacgatc 20160
atataagatt tcttgaactt cacgatagct aaggttataa cgaagatagt agcccacggc 20220
tacaataatc acatcctgct gaaattgctt tcctttaaaa tgattcatcg tcattcctcc 20280
tgctatcttt ttctattatt ctaccttatt tgatagtaga tttaaaactt tgcaacagaa 20340
ccaaatatgg tataatttaa ggtatataat atatatatgg agataccatg aatcaaaaaa 20400
agaggcgtca ttatcgtaag aaaaaacaca cagtactaaa agttatttca attatttttg 20460
tattagtaat tatcgctgtt gtttctatag cctacgccgc ttatagaaat gttgaatcaa 20520
cattttccac atcatatgaa aatttcccta aaacaacaag tattgactta aaaaagtcta 20580
aaacattcac cacacttatc attgcaactg gtaaaaataa ttctaaaaat acagcttatg 20640
ctactgtttt agcttcaacg aatgtaaaga caaatcaaac tactttcatg aacttcccag 20700
tttttgcgac aatgcctaat caaaaaacaa tcactgaagt ttacaatacg aatggagatg 20760
atggaatttt ccagatggtt aaagacctat tgaatgtgtc cattaacaaa gtagttcaga 20820
tcgatgttaa taaaatggga tcacttgtac aggccactgg tggaatcacc atgcaaaatc 20880
caaaggcgtt caatgctgaa ggttatgagt ttaaacaagg aactgttaat ttacaaactg 20940
ctgatcaagt ccaagcctat atgacacaaa ttgacgatac tgatttggat gcttcaatca 21000
cccggattca aaatgtctca atggaactct acggaaatat tcaaaaaatt gctcatatga 21060
aaaaacttga aagtttcaat tactatcgag aaattctcta tgctttttca aacactgtta 21120
aaaccaatat aagtttcaat gatgctaaaa cgatcgttat gagctacaat aaggctctaa 21180
agaataccag caagctcaat ctacatacaa cagatgaaaa tggagctaag gtcgtttctc 21240
aaacagaatt agactcagtc aaaacccttt ttgaaaaatc tctaaaataa tcgcaattat 21300
cgagcagagg gactt 21315
<210> 225
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsR
<400> 225
Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 226
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsX
<400> 226
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Leu Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Asn Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Arg
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Lys Asn Leu Ala Ser Thr Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Lys Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Asp Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 227
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsC
<400> 227
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Glu Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Ser Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Ser Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asp Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Met Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Leu Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 228
<211> 230
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsD
<400> 228
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Thr Ala Ser Ala Asn
50 55 60
Leu Ala Val Gly Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asp Leu Leu Asp Ser Val Ser Asp Phe Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ser
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val
225 230
<210> 229
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsB
<400> 229
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Met Ile Glu Glu Ser Cys Gly Ser Asp Val Ser Gln Ile Phe
210 215 220
Gln Asn Asn Ala Gly Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 230
<211> 228
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_epsE
<400> 230
Met Glu Val Phe Glu Asp Val Ser Ser Pro Glu Pro Glu Glu His Lys
1 5 10 15
Leu Val Glu Leu Lys Lys Phe Ser His Arg Glu Ile Ile Ile Lys Arg
20 25 30
Gly Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln
85 90 95
Tyr Leu Glu Leu His Pro Glu Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Gln His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Glu Arg Leu Ser Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Leu Phe Pro
180 185 190
Gln Arg Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys
195 200 205
Asn Asp Ile Lys Leu Leu Met Leu Thr Ile Thr Gln Thr Ile His Gly
210 215 220
Ser Asp Ala Tyr
225
<210> 231
<211> 216
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_GT1
<400> 231
Met Lys Glu Lys His Ile Tyr Ile Ile Gly Ser Lys Gly Ile Pro Ala
1 5 10 15
Lys Tyr Gly Gly Phe Glu Thr Phe Val Glu Glu Leu Thr Ala His Gln
20 25 30
Ser Asn Lys Asn Leu Lys Tyr His Val Ala Cys Leu Ser Asn Asp Ile
35 40 45
Gln Ser Asn Phe Ile His Asn Gly Ala Asp Cys Phe Asn Ile Pro Lys
50 55 60
Lys Asn Ile Gly Pro Ala Asn Ala Ile Tyr Tyr Asp Leu Ala Ala Leu
65 70 75 80
Lys Tyr Ser Leu Lys Glu Ile Glu Glu Lys Asn Tyr Met Gly Ala Ile
85 90 95
Ile Tyr Ile Leu Ala Cys Arg Ile Gly Pro Phe Ile Gly His Tyr Lys
100 105 110
Lys Gln Met Lys Lys Leu Gly Ile Thr Leu Met Val Asn Pro Asp Gly
115 120 125
Glu Cys Glu Ile Ile Trp Thr Thr Arg Lys Ser Leu Asn Ser Tyr Gly
130 135 140
Phe Ala Arg Leu Asp Leu Phe Thr Ser Thr Cys Phe Cys Leu Ser Trp
145 150 155 160
Cys Val Ala Ala Gly Phe Ile Gly Ser Asn Leu Val Lys Arg Ile Tyr
165 170 175
Gln Glu Ala Pro Ser Ala Thr Val Ile Gly Ile Asp Asn Met Asn Ala
180 185 190
Tyr Tyr Asp Val Ala Leu Lys Glu Phe Arg Leu Asn Glu Leu Ala Lys
195 200 205
Tyr Pro Thr Phe Thr Phe Tyr Gly
210 215
<210> 232
<211> 303
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_dTDP-葡萄糖_4,6-脱水酶
<400> 232
Met Ser Trp Pro Ser Ile Pro His Leu Pro Phe Met Asp Asn Pro Asn
1 5 10 15
Phe Arg Phe Val Lys Ala Asp Ile Cys Asp Arg Glu Ala Val Asn Lys
20 25 30
Leu Phe Glu Glu Glu His Pro Asp Ile Val Val Asn Phe Ala Ala Glu
35 40 45
Ser His Val Asp Arg Ser Ile Glu Asp Pro Gly Ile Phe Leu Gln Thr
50 55 60
Asn Ile Ile Gly Thr Ser Val Leu Met Asp Ala Cys Arg Lys Tyr Gly
65 70 75 80
Ile Arg Arg Tyr His Gln Val Ser Thr Gly Glu Val Tyr Gly Asp Leu
85 90 95
Pro Leu Asp Arg Pro Asp Leu Phe Phe Thr Glu Glu Thr Pro Ile His
100 105 110
Thr Ser Ser Pro Tyr Ser Ser Ser Lys Ala Ala Ala Asp Leu Leu Val
115 120 125
Leu Ala Tyr His Arg Thr Tyr Gly Leu Pro Val Thr Ile Ser Arg Cys
130 135 140
Ser Asn Asn Tyr Gly Pro Tyr His Phe Pro Glu Lys Leu Ile Pro Leu
145 150 155 160
Met Ile Ala Asn Ala Leu Ala Asp Lys Pro Leu Pro Val Tyr Gly Glu
165 170 175
Gly Leu Asn Val Arg Asp Trp Leu Tyr Val Glu Asp His Cys Lys Ala
180 185 190
Ile Asp Leu Ile Ile His Lys Gly Arg Val Gly Glu Val Tyr Asn Val
195 200 205
Gly Gly His Asn Glu Lys Gln Asn Ile Glu Ile Val Lys Ile Ile Cys
210 215 220
Lys Glu Leu Gly Lys Pro Glu Ser Leu Ile Thr His Val Gly Asp Arg
225 230 235 240
Lys Gly His Asp Met Arg Tyr Ala Ile Asp Pro Thr Lys Ile His Asn
245 250 255
Glu Leu Gly Trp Leu Pro Glu Thr Lys Phe Glu Asp Gly Ile Lys Lys
260 265 270
Thr Ile Gln Trp Tyr Leu Asp Asn Arg Glu Trp Trp Glu Thr Ile Ile
275 280 285
Ser Gly Glu Tyr Gln Asn Tyr Tyr Glu Lys Met Tyr Ser Asn Arg
290 295 300
<210> 233
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_dTDP-4-脱氢鼠李糖e_还原酶
<400> 233
Met Lys Phe Phe Val Thr Gly Val Gly Gly Gln Leu Gly His Asp Val
1 5 10 15
Met Asn Glu Leu Leu Lys Arg Gly His Glu Gly Val Gly Ser Asp Ile
20 25 30
Gln Glu Asn Tyr Ser Gly Val Ala Asp Gly Ser Ala Val Thr Lys Ala
35 40 45
Pro Tyr Phe Ala Leu Asp Ile Thr Asn Lys Asp Ala Val Glu Lys Val
50 55 60
Ile Thr Glu Val Asn Pro Asp Ala Val Ile His Cys Ala Ala Trp Thr
65 70 75 80
Ala Val Asp Met Ala Glu Asp Asp Asp Lys Val Ala Lys Val Arg Ala
85 90 95
Ile Asn Ala Gly Gly Thr Arg Asn Ile Ala Asp Val Cys Lys Lys Leu
100 105 110
Asn Cys Lys Leu Thr Tyr Ile Ser Thr Asp Tyr Val Phe Asp Gly Gln
115 120 125
Gly Thr Glu Pro Trp Gln Pro Asp Cys Lys Asp Tyr Lys Pro Leu Asn
130 135 140
Val Tyr Gly Gln Thr Lys Leu Glu Gly Glu Leu Ala Val Ser Gln Thr
145 150 155 160
Leu Glu Lys Tyr Phe Ile Val Arg Ile Ala Trp Val Phe Gly Leu Asn
165 170 175
Gly Lys Asn Phe Ile Lys Thr Met Leu Asn Val Gly Lys Thr His Asp
180 185 190
Thr Val Arg Val Val Asn Asp Gln Ile Gly Thr Pro Thr Asn Thr Tyr
195 200 205
Asp Leu Ala Arg Leu Leu Val Asp Met Asn Glu Thr Glu Lys Tyr Gly
210 215 220
Tyr Tyr His Ala Thr Asn Glu Gly Ser Tyr Ile Ser Trp Phe Asp Phe
225 230 235 240
Thr Lys Glu Ile Tyr Arg Gln Ala Gly Tyr Lys Thr Glu Val Leu Pro
245 250 255
Val Thr Thr Ala Glu Tyr Gly Leu Ser Lys Ala Ala Arg Pro Phe Asn
260 265 270
Ser Arg Leu Asp Lys Ser Lys Leu Val Glu Ala Gly Phe Thr Pro Leu
275 280 285
Pro Thr Trp Gln Asp Ala Leu Ser Arg Tyr Leu Lys Glu Ile Glu Gln
290 295 300
<210> 234
<211> 223
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_dTDP-4-脱氢鼠李糖_3,5-差向异构酶
<400> 234
Leu Ala Gly Cys Thr Glu Pro Leu Ser Glu Arg Asn Arg Ala Val Ile
1 5 10 15
Glu Glu Ile Glu Lys Met Gly Gln Ile Lys Val Asp Lys Asn Val Gly
20 25 30
Gly Ile Glu Gly Leu Cys Val Ile Glu Pro Ala Val His Gly Asp Ala
35 40 45
Arg Gly Tyr Phe Met Glu Thr Tyr Asn Glu Lys Asp Met Lys Lys Ala
50 55 60
Gly Ile Asp Ile His Phe Val Gln Asp Asn Gln Ser Met Ser Met Lys
65 70 75 80
Gly Val Leu Arg Gly Leu His Phe Gln Lys Gln Tyr Pro Gln Cys Lys
85 90 95
Leu Val Arg Ala Val Arg Gly Thr Val Phe Asp Val Ala Val Asp Leu
100 105 110
Arg Ser Asn Ser Glu Thr Tyr Gly Lys Trp Tyr Gly Val Thr Leu Ser
115 120 125
Ala Glu Asn Lys Lys Gln Phe Leu Ile Pro Glu Gly Phe Ala His Gly
130 135 140
Phe Leu Val Leu Ser Asp Glu Ala Glu Phe Cys Tyr Lys Val Asn Asp
145 150 155 160
Phe Trp His Pro Asn Asp Glu Gly Gly Met Ala Trp Asn Asp Pro Glu
165 170 175
Ile Gly Ile Glu Trp Pro Gly Val Gln Gly Glu Tyr Lys Gly Ser Ala
180 185 190
Ser Ala Glu Gly Tyr Glu Leu Glu Asp Gly Thr Ala Leu Asn Leu Ser
195 200 205
Asp Lys Asp Gln Lys Trp Leu Ala Leu Lys Asp Thr Phe Lys Phe
210 215 220
<210> 235
<211> 410
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_DUF1972
<400> 235
Met Gln Arg Glu Asn Glu Val Gln His Val Phe Leu Val Gly Ala Lys
1 5 10 15
Ser Leu Gly Ala Tyr Gly Gly Tyr Glu Thr Phe Val Tyr Lys Leu Thr
20 25 30
Glu His His Gln Asn Lys Lys Asn Ile Lys Tyr His Val Ala Cys Lys
35 40 45
Ala Asn Gly Asp Gly Cys Met Asp Glu Thr Lys Val Asp Gly Val Lys
50 55 60
Gly Ile Asn Gln His Glu Phe Glu Phe His Asn Ala His Cys Phe Lys
65 70 75 80
Ile Asp Ile Pro Gln Ile Gly Ala Ala Gln Ala Ile Tyr Tyr Asp Val
85 90 95
Ala Ala Leu Asn Ala Cys Cys Lys Tyr Ile Lys Glu His Lys Ile Lys
100 105 110
His Pro Ile Val Tyr Ile Met Ala Cys Arg Ile Gly Pro Phe Ala Gly
115 120 125
His Phe Tyr Gln Glu Ile His Lys Leu Gly Gly Thr Val Tyr Leu Asn
130 135 140
Pro Asp Gly His Glu Trp Met Arg Ala Lys Trp Ser Ala Pro Ile Arg
145 150 155 160
Lys Tyr Trp Lys Ile Ser Glu Arg Met Met Val Lys Tyr Cys Asp Leu
165 170 175
Ala Ile Cys Asp Ser Val Asn Ile Glu Lys Tyr Ile His Glu Cys Tyr
180 185 190
Asp Gly Lys Gly Ile Lys Gly Arg Asn Pro Lys Thr Thr Phe Ile Ala
195 200 205
Tyr Gly Ala Asp Leu Thr Leu Ser Lys Leu Ala Asp Asp Asp Glu Lys
210 215 220
Leu Val Asn Trp Tyr Lys Glu Lys Gly Leu Ala Lys Lys Gly Tyr Tyr
225 230 235 240
Leu Val Val Gly Arg Phe Val Pro Glu Asn Ser Phe Glu Val Met Ile
245 250 255
Arg Glu Phe Met Lys Ser Gly Ser Lys Lys Asp Phe Ala Leu Ile Thr
260 265 270
Asn Val Asn Asp Lys Phe Leu Asn Glu Leu Glu Glu Lys Leu His Phe
275 280 285
Lys Ser Asp Lys Arg Ile Lys Phe Val Gly Thr Val Tyr Asp Gln Glu
290 295 300
Leu Leu Lys Lys Ile Arg Glu Asn Ala Tyr Ala Tyr Phe His Gly His
305 310 315 320
Thr Val Gly Gly Thr Asn Pro Ser Leu Ile Glu Ala Leu Gly Ser Thr
325 330 335
Asp Leu Asn Leu Leu Val Asp Val Gly Phe Asn Lys Glu Val Ala Glu
340 345 350
Asp Cys Ala Leu Tyr Trp Ser Arg Glu Pro Gly Ser Leu Ala Arg Leu
355 360 365
Ile Asp Arg Ala Asp Lys Met Ser Thr Glu Glu Ile Ala Glu Met Gly
370 375 380
Arg Lys Ala Lys Lys Arg Val Ala Glu Glu Tyr Thr Trp Asp Lys Ile
385 390 395 400
Cys Gly Gln Tyr Glu Glu Val Phe Thr Lys
405 410
<210> 236
<211> 209
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_DUF1919
<400> 236
Met Arg Asp Gln Thr Val Lys Ile Ser Lys Tyr Tyr Arg Thr Phe Leu
1 5 10 15
Arg Arg Lys Leu Asn Ala Glu Asn Arg Lys Arg Leu Lys Asn Lys Asn
20 25 30
Phe Thr Val Leu Cys Asn Asn Cys Val Gly Gly Val Ile Leu His Glu
35 40 45
Leu Gly Glu Arg Phe Asn Ser Pro Thr Val Asn Leu Phe Phe Lys Ala
50 55 60
Glu Asp Tyr Leu Lys Phe Leu Glu Asn Leu Asp Tyr Tyr Leu Lys Gln
65 70 75 80
Ala Leu Val Glu Val Gly Ser Glu Lys Asn Tyr Pro Val Ala Lys Leu
85 90 95
Asp Asp Ile Thr Ile Tyr Phe Met His Tyr Ser Ser Phe Asp Glu Ala
100 105 110
Lys Ile Thr Trp Gln Lys Arg Val Ala Arg Ile Asn Lys Asn Asn Leu
115 120 125
Tyr Val Ile Phe Val Gln Gln Ser Gly Cys Thr Glu Gln Val Leu Glu
130 135 140
Ala Phe Asp Lys Leu Pro Tyr Lys His Lys Leu Ala Leu Thr Ala Lys
145 150 155 160
Pro Met Pro Glu Ile Lys Cys Ser Tyr Cys Ile His Gly Thr Ala Gln
165 170 175
Pro Asn Gly Glu Val Met Asp Leu Cys Lys Tyr Glu Gly Lys Phe Thr
180 185 190
Gly Lys Arg Trp Ile Asp Glu Tyr Asp Tyr Val Gly Phe Leu Asn Lys
195 200 205
Lys
<210> 237
<211> 371
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_UDP-吡喃半乳糖_变位酶
<400> 237
Met Tyr Asp Tyr Leu Val Val Gly Ser Gly Leu Tyr Gly Ala Ile Phe
1 5 10 15
Ala His Glu Ala Lys Ala His Gly Lys Ser Val Leu Val Val Asp Lys
20 25 30
Arg Pro Asn Ile Gly Gly Asn Val Tyr Thr Glu Asn Ile Glu Gly Ile
35 40 45
Asn Val His Lys Tyr Gly Ala His Ile Phe His Thr Asn Asn Lys Lys
50 55 60
Val Trp Asn Tyr Ile Thr Gln Phe Ala Glu Phe Asn Arg Phe Thr Asn
65 70 75 80
Ser Pro Val Ala Asn Tyr Lys Gly Glu Leu Tyr Ser Leu Pro Phe Asn
85 90 95
Met Tyr Thr Phe Asn Lys Met Trp Gly Val Val Thr Pro Glu Glu Ala
100 105 110
Ala Ala Lys Ile Glu Glu Gln Arg Lys Glu Ile Thr Gly Glu Pro Lys
115 120 125
Asn Leu Glu Glu His Ala Ile Ser Leu Val Gly Arg Asp Ile Tyr Glu
130 135 140
Lys Leu Ile Lys Gly Tyr Thr Glu Lys Gln Trp Gly Arg Asp Cys Lys
145 150 155 160
Asp Leu Pro Ala Phe Ile Ile Lys Arg Leu Pro Val Arg Leu Thr Phe
165 170 175
Asp Asn Asn Tyr Phe Asn Ala Leu Tyr Gln Gly Ile Pro Ile Gly Gly
180 185 190
Tyr Thr Lys Met Ile Ala Asn Leu Leu Asp Gly Ile Glu Val Arg Leu
195 200 205
Asn Ile Asp Tyr Leu Glu Asn Lys Val Glu Leu Asp Ala Leu Ala Gly
210 215 220
Lys Val Val Tyr Thr Gly Pro Ile Asp Ala Tyr Phe Asp Tyr Lys Leu
225 230 235 240
Gly Thr Leu Glu Tyr Arg Ser Val Arg Phe Glu Asn Glu Leu Leu Asp
245 250 255
Lys Pro Ser Ser Gln Gly Asn Ala Ala Val Asn Tyr Thr Asp Arg Glu
260 265 270
Thr Pro Trp Thr Arg Ile Ile Glu His Lys Trp Phe Glu Phe Gly Arg
275 280 285
Asp Glu Asn Gly Asn Asp Leu Pro Lys Thr Ile Ile Ser Arg Glu Tyr
290 295 300
Ser Ser Glu Trp Lys Pro Gly Asp Glu Pro Tyr Tyr Pro Val Asn Asp
305 310 315 320
Ala Lys Asn Ser Leu Leu Tyr Ser Glu Tyr Lys Lys Leu Ala Asp Ala
325 330 335
Glu Ser Lys Val Ile Phe Gly Gly Arg Leu Gly Glu Tyr Lys Tyr Tyr
340 345 350
Asp Met Asp Gln Ile Ile Ala Ala Val Leu Glu Arg Cys Glu Arg Glu
355 360 365
Phe Asp Val
370
<210> 238
<211> 252
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_DUF4422
<400> 238
Met Asn Gly Lys Ile Ile Val Val Thr His Lys Glu Tyr Lys Met Pro
1 5 10 15
Cys Asp Thr Val Tyr Leu Pro Val Cys Val Gly Val Gly Arg Asp Ala
20 25 30
Leu Arg Asn Lys Tyr Gln Ala Asp Asp Glu Gly Glu Asn Ile Ser Asp
35 40 45
Lys Asn Ile Leu Tyr Cys Glu Leu Thr Ala Leu Tyr Trp Ala Trp Lys
50 55 60
Asn Leu Asn Cys Asp Tyr Ile Gly Leu Ala His Tyr Arg Arg Tyr Leu
65 70 75 80
Thr Glu Ser Lys Arg Ser Lys Asn Ile Glu Asp Ala Leu Ser Gln His
85 90 95
Arg Ile Glu Glu Leu Leu Met Asp Tyr Asp Ile Ile Val Pro Arg Glu
100 105 110
Lys Arg Tyr Ser Gln Thr Ile Ala Asp His Tyr Ile Asn Cys Ile Lys
115 120 125
Ser Arg Lys Asp Ala His Lys Ile His Leu Gln Leu Leu Arg Asp Ser
130 135 140
Ile Leu Glu Val Ala Pro Glu Tyr Ile Ala Glu Tyr Asp Lys Thr Met
145 150 155 160
Asn Gly His Ser Ala His Met Leu Asn Met Phe Val Met Lys Lys Gln
165 170 175
Asn Leu Asp Asn Tyr Cys Glu Trp Leu Phe Lys Ile Leu Phe Val Leu
180 185 190
Glu Lys Lys Ile Tyr Asp His Asp Val Tyr Tyr Asp Arg Ile Met Gly
195 200 205
Ala Phe Ser Glu Phe Leu Leu Asp Val Trp Ile Arg Thr Asn Lys Lys
210 215 220
Thr Tyr Ile Glu Val Glu Leu Ile Glu Thr Glu Arg Asp Tyr Trp Gly
225 230 235 240
Lys Ile Lys Trp Ala Leu Lys Arg Lys Leu Phe Glu
245 250
<210> 239
<211> 449
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_GT2
<400> 239
Met Arg Ile Leu His Tyr Ser Leu Gly Phe Pro Pro Tyr Arg Arg Gly
1 5 10 15
Gly Leu Thr Lys Tyr Cys Leu Asp Leu Met Val Ala Gln Glu Met Gln
20 25 30
Gly Asn Val Val Ala Met Cys Trp Pro Gly Glu Ile Gly Ile Ile Lys
35 40 45
Lys Lys Lys Val Ala Ile Lys Lys Arg Lys Lys Tyr Ser Ile Gly Lys
50 55 60
Ser Lys Ile Glu Asn Tyr Glu Ile Gln Gly Ile Leu Pro Val Pro Leu
65 70 75 80
Leu Glu Gly Ile Lys Asn Pro Asp Leu Phe Thr Glu Lys Lys Asn Gln
85 90 95
Glu Ile Trp Lys Leu Phe Leu Lys Asn Trp Arg Pro Asp Val Ile His
100 105 110
Phe His Thr Leu Met Gly Leu Pro Leu Glu Tyr Val Glu Thr Ala Arg
115 120 125
Lys Leu Gly Ile Lys Thr Leu Phe Thr Thr His Asp Tyr Phe Gly Leu
130 135 140
Cys Pro Arg Thr Thr Leu Val Arg Gln Asn Gly Glu Ile Cys Asp Gly
145 150 155 160
Cys Thr Pro Glu Leu Cys Ala Glu Cys Cys Glu Asn Ala Ile Ser Tyr
165 170 175
Arg Lys Leu Lys Ile Leu Gln Ser Ser Val Tyr Arg Val Leu Lys Asp
180 185 190
Leu Val Ile Val Lys Lys Leu Arg Lys Lys His Trp Asn Glu Ser Lys
195 200 205
Asn Asp Ser Ala Gln His Gln Ala Ser Val Gln Asn Ala Gln Arg Ala
210 215 220
Glu Glu Tyr Val Glu Leu Arg Lys Tyr Tyr Ile Lys Leu Leu Lys Ser
225 230 235 240
Phe Asn Ile Ile His Phe Asn Ser Ser Asn Thr Arg Asp Val Tyr Leu
245 250 255
Lys Ala Ala Lys Glu Val Leu Asn Asn Glu Val Val Ser Ile Ser His
260 265 270
Glu Met Ile Lys Asp Asn Lys Lys Lys Lys Arg Lys His Glu Ile Leu
275 280 285
His Leu Ser Tyr Leu Gly Pro Asp Thr Tyr Asn Lys Gly Tyr Tyr Val
290 295 300
Leu Lys Glu Thr Leu Asn Gln Leu His Lys Glu Gly Tyr Lys Phe Gln
305 310 315 320
Leu Asn Ile Tyr Phe Glu Asp Ala Ser Glu Pro Phe Ile Val Ser His
325 330 335
Ala Pro Tyr Gln Tyr Ser Glu Leu Gly Lys Val Met Asp Asp Ala Asp
340 345 350
Cys Val Ile Leu Pro Ser Leu Gly Asn Glu Thr Phe Gly Phe Thr Val
355 360 365
Leu Glu Ala Leu Ser Tyr Gly Val Pro Val Ile Val Ser Ser Arg Val
370 375 380
Gly Ala Lys Asp Ile Val Glu Glu Gly Lys Asn Gly Phe Val Val Glu
385 390 395 400
Gly Asp Val Asp Ser Leu Lys Thr Lys Leu Thr Ser Val Leu Asn Gln
405 410 415
Pro Glu Ile Leu Glu Asp Met Asn Asn Tyr Ile Val Ala Asn Thr His
420 425 430
Ile Lys Thr Met Thr Glu His Ser Lys Glu Ile Lys Asp Leu Tyr Gln
435 440 445
Lys
<210> 240
<211> 308
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_GT3
<400> 240
Val Gln Asp Lys Val Ser Ile Ile Val Pro Val Tyr Lys Val Glu Arg
1 5 10 15
Glu Leu Asp Arg Cys Val Gln Ser Leu Ile Lys Gln Thr Tyr Lys Asn
20 25 30
Leu Glu Ile Ile Leu Val Asp Asp Gly Ser Pro Asp Gln Cys Pro Glu
35 40 45
Leu Cys Glu Asn Tyr Ala Glu Ile Asp Lys Arg Val Lys Val Ile His
50 55 60
Lys Glu Asn Gly Gly Leu Ser Asp Ala Arg Asn Ala Gly Leu Lys Gln
65 70 75 80
Ala Thr Gly Lys Tyr Ile Leu Tyr Val Asp Ser Asp Asp Tyr Ile Asp
85 90 95
Leu Asp Ala Cys Glu Arg Phe Ile Lys Ala Ala Gly Asn Gln Lys Ile
100 105 110
Asp Ile Val Val Gly Asn Ala Ile Met Glu Lys Pro Asp Gly Lys Glu
115 120 125
Met Met Ile His Ser Ala Thr Pro Ser Gly Ile Thr Tyr Thr Ala Lys
130 135 140
Gln Phe Ile Met Ser Ala Val Lys Ala Tyr Gln Trp Tyr Ala Pro Ala
145 150 155 160
Trp Leu Asn Met Tyr Arg Arg Asp Phe Leu Leu Asp Asn Gln Leu Tyr
165 170 175
Phe Lys Lys Gly Ile Tyr Phe Glu Asp Val Gln Met Leu Pro Arg Val
180 185 190
Phe Leu Ala Ala Lys Lys Ile Thr Cys Ile Tyr Gly Thr Phe Tyr His
195 200 205
Tyr Ile Ile Arg Glu Asn Ser Ile Met Thr Ser Gln Lys Asp Glu Lys
210 215 220
Lys Lys Asn Asp Ser Ile Gln Asn Met Lys Glu Trp Lys Glu Gln Phe
225 230 235 240
Asp Leu Val Asp Asp Val Ala Leu Lys Lys Cys Leu Tyr Gly Met Leu
245 250 255
Val Lys Met Tyr Ile His Glu Cys Arg Gln Tyr Gly Ile Thr Thr Lys
260 265 270
Ala Ile Glu Gly Met Asp Asp Arg Phe Ile Leu Gly Asn Cys Leu Asn
275 280 285
Tyr Lys Glu Arg Leu Lys Ala Thr Met Trp Leu Cys Phe Pro Arg Leu
290 295 300
Leu Ile Lys Gln
305
<210> 241
<211> 367
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_wzy
<400> 241
Val His Met Ser Val Tyr Ile Phe Leu Trp Val Ala Val Val Val Phe
1 5 10 15
Gly Phe Ile Ala Ser Arg Ser Asn Tyr Lys Ala Lys Tyr Phe Val Leu
20 25 30
Phe Ser Phe Phe Leu Met Thr Ile Val Leu Gly Leu Arg Gly Ala Thr
35 40 45
Val Gly Glu Asp Thr Lys Met Tyr Leu Asn Ile Ala Glu Arg Val Thr
50 55 60
Asn Ile Ser Trp Lys Glu Val Phe Ser Ser Phe Pro Thr Ser Gln Trp
65 70 75 80
Arg Tyr Ile Ser Tyr Gly Gly Leu Ser Gly Phe Ser Glu Gln Thr Glu
85 90 95
Thr Val Tyr Leu Ala Tyr Cys Lys Leu Ile Met Leu Ile Phe His Asn
100 105 110
Ala Gln Ala Val Leu Leu Ile Thr Ala Ala Ile Thr Asn Ala Leu Phe
115 120 125
Ala Lys Phe Ile Leu Asp Asn Ile Thr Val Lys Gln Asp Ala Ile Leu
130 135 140
Ala Val Tyr Ile Tyr Met Cys Asp Ala Met Phe Met Asn Ser Phe Asn
145 150 155 160
Thr Met Arg Gln Ile Leu Ala Ile Ser Ile Ala Val Gln Ser Ile Glu
165 170 175
Leu Ile Lys Lys Glu Lys Tyr Lys Lys Ala Ile Ala Cys Val Leu Leu
180 185 190
Ala Ala Cys Phe His Gln Ser Ala Ile Val Phe Phe Val Ala Asp Leu
195 200 205
Phe Tyr Leu Leu Lys Lys Lys Lys Glu Arg Tyr Ile Tyr Leu Leu Val
210 215 220
Thr Leu Cys Ala Leu Pro Val Leu Ile Pro Val Ala Ile Lys Val Val
225 230 235 240
Ser Ile Phe Ser Ser Lys Tyr Ala Ser Tyr Leu Ser Val Ser Phe Trp
245 250 255
Gly Ala Gln Leu Arg Gly Thr Leu Leu Leu Trp Ile Ile Ile Ala Ile
260 265 270
Val Leu Phe Ile Met Ile Arg Ala Asn Gln Ser Asp Asn Ile Asp Trp
275 280 285
Trp Leu Ile Tyr Met Ala Thr Ile Tyr Ile Gly Val Glu Leu Val Gly
290 295 300
Met Gln Leu Thr Val Ile Ser Arg Val Ala Met Tyr Phe Arg Ile Phe
305 310 315 320
Leu Val Leu Leu Phe Pro Ile Ala Gln Lys Tyr Phe Thr Lys Lys Ser
325 330 335
Gly Gln Phe Tyr Lys Ile Gly Val Val Met Leu Met Thr Val Ser Phe
340 345 350
Phe Ser Tyr Ala Ser Ser Pro Asp Arg Leu Tyr Thr Phe Cys Phe
355 360 365
<210> 242
<211> 326
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_GT4
<400> 242
Met Asp Leu Lys Asp Leu Ile Ser Val Ile Val Pro Ile Tyr Gly Val
1 5 10 15
Glu Glu Tyr Leu Asn Lys Cys Ile Asp Ser Ile Ile Asn Gln Thr Tyr
20 25 30
Lys Asn Leu Glu Ile Ile Leu Val Asp Asp Gly Ser Pro Asp Lys Cys
35 40 45
Pro Asp Ile Cys Asp Thr Phe Glu Lys Lys Asp Glu Arg Ile Lys Val
50 55 60
Ile His Lys Lys Asn Gly Gly Leu Ser Asp Ala Arg Asn Ala Gly Ile
65 70 75 80
Asp Thr Ala His Gly Asp Tyr Phe Val Phe Val Asp Ser Asp Asp Trp
85 90 95
Ile Glu Asn Thr Met Val Glu His Leu Leu Phe Ala Cys Lys Lys Tyr
100 105 110
Asn Val Glu Met Ala Thr Cys Ala Arg Tyr Ile Thr Asp Gly His Ser
115 120 125
Thr Arg Ala Val Ala Phe Asn Gly Pro Ala Gly Ala Tyr Ser Ala Glu
130 135 140
Glu Ala Leu Asn Glu Ile Leu Leu Gly Lys Ser Met Asp Val Ala Ala
145 150 155 160
Trp Asp Lys Ile Tyr Ala Arg Asn Leu Phe Glu Glu Ile Arg Phe Pro
165 170 175
Val Gly Glu Asn Asn Glu Asp Ile Ala Val Phe Tyr Lys Leu Val Asp
180 185 190
Leu Ala Gly Arg Val Ala His Thr Gly Thr Thr Glu Tyr Phe Tyr Arg
195 200 205
Ser Arg Pro Gly Ser Ile Thr Lys Leu Lys Tyr Ser Thr Asp Ala Arg
210 215 220
Lys Ile Ile Glu Lys Asn Leu Asn Ser Ile Glu Lys Phe Leu Asp Lys
225 230 235 240
Lys Tyr Pro Ser Cys Leu Pro Ser Phe Tyr Arg Tyr Lys Thr Met Asn
245 250 255
Ile Tyr Ala Leu Leu Asn Lys Tyr Ile Lys Cys Glu Gly Thr Lys Lys
260 265 270
Thr Gln Glu Phe Glu His Leu Met Asn Glu Phe Arg Lys Asn Lys Ser
275 280 285
Tyr Phe Phe Asn Asp Asp Gln Thr Pro Ser Lys Glu Lys Lys Ile Ala
290 295 300
Ile Met Ile Leu Leu His Leu Tyr Asn Pro Tyr Leu Leu Val Lys Glu
305 310 315 320
Lys Ile Thr Gly Tyr Lys
325
<210> 243
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33138_lytR
<400> 243
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Val
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Val Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 244
<211> 10297
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33140 eps基因簇,完整序列
<400> 244
atgaatgatt tattttacca tcggctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat ttataattgg aggtttttat 420
agttataatt ctaggataag taatctttca aaagctgata aaggaaaaga agttgtaaaa 480
aatagcagtg aaaaaaatca gatagacctt acctataaaa agtattataa aaatttacca 540
aaatcagttc aaaataaaat agatgatatt tcatccaaaa ataaagaagt tactttaact 600
tgtatttggc aatctgattc agttatttct gaacaatttc aacaaaactt acaaaaatat 660
tatggaaata agttttggaa catcaaaaat atcacttaca atggcgaaac tagtgaacaa 720
ttattggctg aaaaagttga aaaccaagta ttagccacta atcctgatgt tgttttatat 780
gaagctccac tttttaatga taaccaaaac attgaagcaa cagcctcact gactagtaat 840
gagcaactta taacaaattt ggctagtgca ggagcggagg taatagttca accctctcca 900
ccgatctatg gtggtgttgt gtaccccgta caagaagaac aatttaaaca atctttatct 960
acaaagtatc cctatataga ctactgggct agttacccag acaaaaattc tgatgaaatg 1020
aaggggctgt tttctgatga tggagtatat agaacattaa atgcttcggg gaataaggtt 1080
tggctagatt atattactaa atattttaca gcaaactaat taagttataa ataacaatta 1140
ttaaatattg gagaagaaat gcaggaaaca caggaacaaa cgattgattt aagagggatt 1200
tttaaaatta ttcgcaaaag gttaggttta atattattta gtgctttaat agtcacaata 1260
ttagggagca tctacacatt ttttatagcc tccccagttt acacagcctc aactcaactt 1320
gtcgttaaac taccaaattc ggataattca gcagcctacg ctggacaagt gaccgggaat 1380
attcaaatgg cgaacacaat taaccaagtt attgttagcc cagtcatttt agataaagtt 1440
caaagtaatt taaatctatc tgatgactct ttccaaaaac aagttacagc agcaaatcaa 1500
acaaattcac aagtcattac gcttactgtt aaatattcta atccttacgt tgctcaaaag 1560
attgcagacg agactgctaa aatttttagt tcagaagcag caaaactatt gaatgttact 1620
aacgttaata ttctatccaa agcaaaagct caaacaacac ccattagtcc taaacctaaa 1680
ttgtatttag caatatctgt tatagccgga ttagttttag gtttagccat tgctttattg 1740
aaggaattgt ttgataacaa aattaataaa gaagaagata ttgaagctct gggactcacg 1800
gttcttggtg taacaaccta tgctcaaatg agtgatttta ataataatac gaataaaaat 1860
ggcacgcaat cgggaactaa gtcaagtccg cctagcgacc atgaagtaaa tagatcatca 1920
aaaaggaata aaagatagga gttcaggatg gctaaaaata aaagaagcat agacaataat 1980
cgttatatta ttaccagtgt caatcctcaa tcacctattt ccgaacaata tcgtacgatt 2040
cgtacgacca ttgattttaa aatggcggat caagggatta aaagttttct agtaacatct 2100
tcagaagcag ctgcaggtaa atcaaacgag agtgctaatc tagctgttgc ttttgcacaa 2160
caaggtaaaa aagtactttt aattgatggc gatcttcgta aaccgactgt taacattact 2220
tttaaagtac aaaatagagt aggattaacc aatattttaa tgcatcaatc ttcgattgaa 2280
gatgccatac aagggacaag actttctgaa aatcttacaa taattacctc tggtccaatt 2340
ccacctaatc catcggaatt attagcatct agtgcaatga agaatttgat tgactctgtg 2400
tccgattcct ttgatgttgt tttgattgat actccacctc tctatgcagt tactgatgct 2460
caaattttga gtgtttatgt aggaggagtg gttcttgttg tacgtgccta tgaaacaaaa 2520
aaagagagtt tagcaaaaac aaaaaaaata ctggaacaag ttaatgcaaa tatattagga 2580
gttgttttgc atggggtaga ctcttctgag tcaccgtcgt attactacta cggagtagag 2640
taattggaat aaattttaat caaataaaag acagaaattt gtagaagagg ggagcaaatg 2700
attgatattc attgccatat tttaccgggg atagatgatg gagctaaaac ttctggagat 2760
actctgacaa tgctgaaatc agcaattgat gaagggataa caactatcac tgcaactcct 2820
catcataatc ctcaatttaa taatgaatca ccgcttattt tgaaaaaagt taaggaagtt 2880
caaaatatca ttgacgaaca tcaattacca attgaagttt tacccggaca agaggtgaga 2940
atatatggtg atttattaaa agaattttct gaaggaaagt tactgacagc agcgggcact 3000
tcaagttata tattgattga atttccatca aatcatgtgc cagcttatgc aaaagaactt 3060
ttttataact gccatctgca gggaattcag cctattttgg tccaccctga acgtaatagt 3120
ggaatcattg agaacccaga tatattattt gattttgttg gacaaggagt acttagtcag 3180
ataacagctt cgagtgtcac tggtcatttt ggtaaaaaaa tacaaaagct gtcatttaaa 3240
atgatagaaa accatctgac gcattttgtt gcatcagatg cgcataatgt gacgtcacgt 3300
gcatttaaga tgaaggaagc atttgaaatt attgaagata gttatggttc tggtgtatca 3360
cgaatgtttc aaaataatgc agagtcagtg atttcaaacg aaagttttta tcaagaaaaa 3420
ccaacaaaga tcaaaacaaa gaaattttta ggattatttt aaagggatta aatggagtaa 3480
ataatggaag tttttgaggc atcatctgaa ctggaagagc ataagttagt agaattaaaa 3540
aaattttctc gcagagagat aattataaaa agagggattg atattttagg gggattagcg 3600
ggttcaggtt tatttcttat cgcggctgca ttgctttata tcccttacaa aatgagctca 3660
aaaaaggatc aagggccaat gttctataaa caaaaacgct atggtaaaaa tggtaaaatt 3720
ttttatattt tgaaatttag aacaatgatt cttaatgccg agcagtatct agaacttaat 3780
ccagatgtta aagctgctta ccatgccaac ggcaataagc tagaaaacga tccacgggta 3840
acgaagattg gctcatttat aagacgacac tcaattgatg aactgccaca atttatcaat 3900
gttcttaaag gggatatggc attggttggc ccaagaccaa ttttgctttt tgaagcgaaa 3960
gaatatgggg agcgcctctc ttacttactc atgtgtaaac ctggaattac tggttattgg 4020
acaacacatg gtcgaagtaa agtttttttt cctcaacgag cagatttaga actctattat 4080
ctccagtacc atagcaccaa aaacgatatc aagcttctag tactcacaat tgtacaaagt 4140
attaacggat cggacgcata ttaaaaaatg aaaatagcat tagtaggttc cagcggtggc 4200
catttgacac acctgtattt gttaaaaaag ttttgggaaa acgaagatag attttgggtc 4260
acatttgata aaacagatgc aaaatctata ttgaaagaag aaagatttta tccttgttat 4320
tatcccacaa atagaaatgt aaaaaacacg ataaaaaata ccattcttgc atttaaaata 4380
cttagaaaag aaaaaccaga tttgattatt tcgagtggtg ctgcggtagc cgttcctttt 4440
ttttggttag gtaaactatt cggtgcaaag acagtctata ttgaaatatt tgaccggatc 4500
gataaaccaa ccttaacagg aaaattagtt tatccagtta ctgataagtt tatagttcaa 4560
tgggaagagt taaaaaaagt ttaccctaaa gcaattaatt taggaggaat tttctaatga 4620
tttttgtaac tgttggaact cacgaacaac catttaatcg actcattcaa aaaattgatg 4680
aacttgtacg cgatggtgaa atcgaagacg atgtattcat gcaaattggg tactcaactt 4740
atgaacccaa atatactaaa tgggaaaagg ttattggata tgagactatg gaaagatgta 4800
tgaatgaagc gagtacgatt attactcatg gcggaccatc tacctatatg caagtattac 4860
aactaggtaa aattccgata gttgttccac ggcaaatgaa atttgatgag catataaatg 4920
atcatcaact ttgggtaagt aaacaggttg tgaaaaaggg atactcattg attttgtgcg 4980
aagatgttga agacattctc gaaaatatta ttagttccaa aatttcagat accttacaaa 5040
aaaatgtaaa tcacaacact gaatttataa aattattcag tgctgaaatt taccagctat 5100
ttataaaaag tgagaagata tgataccaaa agtaatacac tattgctggt tcggagggaa 5160
acctttacca gaatctgcgc taaaatgtat tgaaagttgg agaaggtttt gtccagatta 5220
tgaaataaaa caatggtctg agaaaaacta tgatgtaaat aaaattcaat atattaagga 5280
agcatatcaa gaaaaaaaat ttgcttttgt aacagatgtt gctcggctcg atataatttg 5340
gaatgaaggc ggtatatatc ttgacacgga tgtagagctt ataaaatctc ttgatgaatt 5400
gctgtataat agtttatatt taggaatgga aagagctggt agagtaaata cgggtttagg 5460
gtttggagct gaagtaaatc atccaattgt gagagctaat ttagaattat atactaatat 5520
tcctttttca ggcaatgata atataacttg tgtgacctat acgacgaatc ttttgaaaaa 5580
atatggtcta aaaaacaaca atgaaattca acatatagat aacgcaataa ttttacctac 5640
tgaatattta tgtcctctaa gttttgaaac aaatcgatta aaaataacgg aaaatactta 5700
ctccatccat cactatgata tgagttggaa agataagaga gataaatttt taagacttaa 5760
aatacaactt agaaaatggg taggggatga tttttatgaa aaagttatta aaagaattgg 5820
aaaataatta tcatgaataa aataaccatg acaagagaga tgagagtggt tgccttatgt 5880
gtcgtaattt tagaatattt aaataataca ggattaattg cgtcttcagt atactctttt 5940
agcatggcga gtacaatcct cttatcctat atcttattct gtaaaaaaag aaaaggattt 6000
tctttaaagg agattattgt actactaatt tcctttattt ttgtagtttt aaatcgtaat 6060
gctagtaatt ttagtttagg gttaatgtgg atactctatt ttatgttaag taagtcggaa 6120
atagatttaa aaaaagtgat gaaaacattt tttgttacct ctagtgtttg ttttattttg 6180
acaatagtac tttatttaat aatgtctctt aataaaagct ctgatatgat gatgtggcgt 6240
ggagatgctt ttataaatcg tatgagttta ggatttatcc aaccgaattt tgcaatgatg 6300
agctttttag gtgtagcgat agcgttatta tatttgagta ctgaaagaca aagaataact 6360
ataattttta ttgccattgt aacttttatt atattttact ttactcaatc aagaacttca 6420
ggatatatct tattttttat tttgagtatt ttatttgtta gtagtaaaaa aactaaaaag 6480
caagtttcaa attttgaaaa aaggagcgtt acagttttac cactacttct tttaataatc 6540
tcttattcgt cgttaaagtt acctattaat caatacctca ataacttgct ttctggtcgt 6600
ctgtcgcttt atcaagagat ttattctaca tttggtatac atttgatagg gaataatgat 6660
gttaaaaata caatgttaga cacagcatat cttcaaagtt tgctagcaaa aggaattttg 6720
tttacattgt ttttatttgt aactttcttt ttcatatttt ttcttaagag aaaaacacaa 6780
actaggttgc aaagtttagt aattatgatg tattttttaa ttgcatttac agaaacatca 6840
ttttttaggt ttgtaatttt atttccagta ttgatggtaa taatggatca gaaagaggct 6900
aataaagtaa tagaaaaggt ggcatagtga gtattaataa aacagagatt gaggaataca 6960
aagtagccgt tatagttcct gtttacaatg tagaggagta tataagagag tgcatcaaat 7020
ctattcaagc tcaaacatat tctaatactg aaattattgt tattaattga tgttatcctt 7080
aaatcttaga gtcactattg tataatttag acaaaggaca aaaacatgca aaaacgctac 7140
tcaaaagaat ttaaagaaac ccttatcgcc ttctatcatt ctggtcaatc cgtcacccag 7200
ctgtctaaag aatacgacgt ggcccctgca acaatttata aatggataga cctctactct 7260
aaatctaatg aaagctccgt ctctaaagct gattttctag aattaaaaag acaactggct 7320
aaagttaagg aagaacgaga catcttaaaa aaagtattga ccatattcgc cgagaaaaag 7380
aagtgagtgc tgcggatatg gctctaacca tacaaacttt agcactcaat gtcagactaa 7440
gctgtcaact ccttgatgtt cctgaatcaa gttattatga acagattaac cgacatccat 7500
ctaaaactca attaaggaga caatacctgt cactcaaaat ttctcaactc ttcaatgcta 7560
accgaggaat ctatggtgct cctagaattc atcatcttct acttaaacaa ggggaaaaag 7620
tcgggttaaa actggtacag aagctaatga agcaacttca actcaagtct gtcgtcctta 7680
aaaaatttaa gcctggatac tccctaagtg atggtattaa cagaaagaac tttatacaaa 7740
atgagcctaa aaagataaat aaggtttggt caaccgacat tacttatatt cctactcaac 7800
aaggatgggc ttatctctca accattgtgg atcgttatac taaaaaagtc attgcttggg 7860
atttgggcaa gcgaatgact gtagaattag tgcaaagaac tttaaataag gccattaaat 7920
cacaagacta tccagaagct gttattcttc attctgacca aggaagccag tatacgagtc 7980
tagagtatga agagttgctt aagtattatg ggatgactca ctctttcagt cgaaggggat 8040
acccttatca taataccagt cttgaatctt ggcatggaca tttaaaaaga gagtgggtgt 8100
atcaatttaa atataagaac tttgaagaag cctatcagag tattttctgg tacatcgaag 8160
ccttttataa ttcaaaacga atccatcaaa gtttagggta tcttacacct aatcaatttg 8220
aaaaggtaag tgcttaaaat aaatagatta aaattctacg tttgttactc taaaaacttg 8280
acttaacgtc ataaaaaaat ttgctaaaaa tttggagcaa ggtattataa aaattttaat 8340
taaaattttt atatttaaaa aactaaagaa tactaaaagt aaaccttata cgagagattt 8400
atttggtcgt gactaaacaa caattctgga ggaaaaattg gaacgaaaaa aaaagagtaa 8460
aaagagtatc ggggtgataa ttatacctat cttaattttt attaccctta taggagcagg 8520
ggcttatgcc ttacgagatt cacttattcc tactgaacat acgaaaacaa atagttcgga 8580
tcaaccgacc aaaacttcgg tttctaacgg ttacgtggag caaaaaggtg aagaagctgc 8640
tgtgggtagt atagcacttg tagatgacgc tggagtacca gaatgggtta aagttccctc 8700
aaaggtaaat ctagataaat ttactgattt atctacggat aatatcacta tttatcgaat 8760
taacaatccg gaagtcttaa aaacagttac cgatcgtacg gatcaacgga tgaaaatgtc 8820
agaagttata gctaagtatc ctaatacttt gattatgaat gcttccgctt ttgatatgca 8880
gacaggacaa gtagctggat ttcaaattaa taatggaaag ttgattcaag actggagccc 8940
aggtacaacg actcaatatg cttttgttat taacaaagat ggttcgtgca aaatttatga 9000
ttcaagtaca actgcttcaa ctattattaa aaacggaggg caacaagcct atgattttgg 9060
tactgcaatt atccgtgatg gtaaaattca accaagtgat ggctcagtag attggaagat 9120
ccatattttt attgcgaatg ataaagataa taatctctat gctattttga gtgatacaaa 9180
tgcaggttat gataatataa tgaaatcagt gtcaaatttg aagctccaaa atatgttatt 9240
acttgatagt ggtggttcaa gtcaactatc cgtcaatggt aaaacgattg ttgctagtca 9300
agatgatcga gccgtaccgg attatattgt gatgaaataa aaataaaaga acctcttggt 9360
tcttttattt tagagattta tcaaaaaggg ttttgactga gtctaattct gtttgagaaa 9420
cgaccttagc tccattttca tctgttgtat gtagattgag cttgctggta ttctttagag 9480
ccttattgta gctcataacg atcgttttag catcattgaa acttatattg gttttaacag 9540
tgtttgaaaa agcatagaga atctctcgat agtaattgaa actttcaagt tttttaatat 9600
gagcaatttt ttgaatattt ccgtagagtt ccattgagac attttgaatc cgagtgattg 9660
aagcatccaa atcagtatcg tcaatttgtg tcatataggc ttggacttga tcagcagttt 9720
gtaaattaac agttccttgt ttaaactcat aaccttcagc attgaatgcc tttggatttt 9780
gcatggtgat tcccccagtg gcctgtacaa gtgatcccat tttattaaca tcgatctgaa 9840
ttactttgtt aatggacaca ttcaataggt ctttaaccat ctggaaaatt ccatcatctc 9900
cattcgtatt gtaaacttca gtgattgttt tttgattagg cattgtcgca aaaactggga 9960
agttcatgaa agtagtttga tttgtcttta cattcgttga agctaaaaca gtagcataag 10020
ctgtattttt agaattattt ttaccagttg caatgataag tgtggtgaat gttttagact 10080
tttttaagtc gatacttgtt gttttaggga aattttcata tgatgttgaa aaggttgatt 10140
caacatttct ataagctacg taggctatag aagcaacagc gataattact aatacaaaaa 10200
taattgaaat aacttttagt actgtgtatt ttttcttacg ataatgacgc ctcttttttt 10260
gattcatggt atctccatat acatattata tacctta 10297
<210> 245
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsR
<400> 245
Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 246
<211> 255
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsX
<400> 246
Met Met Lys Lys Gly Ile Phe Val Ile Thr Ile Val Ile Ser Ile Ala
1 5 10 15
Phe Ile Ile Gly Gly Phe Tyr Ser Tyr Asn Ser Arg Ile Ser Asn Leu
20 25 30
Ser Lys Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys
35 40 45
Asn Gln Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys
50 55 60
Ser Val Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val
65 70 75 80
Thr Leu Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe
85 90 95
Gln Gln Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys
100 105 110
Asn Ile Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys
115 120 125
Val Glu Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu
130 135 140
Ala Pro Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ala Ser Leu
145 150 155 160
Thr Ser Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Ala Gly Ala Glu
165 170 175
Val Ile Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro
180 185 190
Val Gln Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr
195 200 205
Ile Asp Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys
210 215 220
Gly Leu Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly
225 230 235 240
Asn Lys Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
245 250 255
<210> 247
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsC
<400> 247
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asn Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Val Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Glu Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Asn Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 248
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsD
<400> 248
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Asn Glu Ser Ala Asn
50 55 60
Leu Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Ser Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Tyr Ala Val Thr Asp Ala Gln Ile Leu Ser Val
165 170 175
Tyr Val Gly Gly Val Val Leu Val Val Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Ile Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asp Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 249
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsB
<400> 249
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Cys His Leu Gln
115 120 125
Gly Ile Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Val Gly Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Gly Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Ser Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 250
<211> 226
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsE
<400> 250
Met Glu Val Phe Glu Ala Ser Ser Glu Leu Glu Glu His Lys Leu Val
1 5 10 15
Glu Leu Lys Lys Phe Ser Arg Arg Glu Ile Ile Ile Lys Arg Gly Ile
20 25 30
Asp Ile Leu Gly Gly Leu Ala Gly Ser Gly Leu Phe Leu Ile Ala Ala
35 40 45
Ala Leu Leu Tyr Ile Pro Tyr Lys Met Ser Ser Lys Lys Asp Gln Gly
50 55 60
Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys Ile Phe
65 70 75 80
Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Ala Glu Gln Tyr Leu
85 90 95
Glu Leu Asn Pro Asp Val Lys Ala Ala Tyr His Ala Asn Gly Asn Lys
100 105 110
Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile Arg Arg
115 120 125
His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys Gly Asp
130 135 140
Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala Lys Glu
145 150 155 160
Tyr Gly Glu Arg Leu Ser Tyr Leu Leu Met Cys Lys Pro Gly Ile Thr
165 170 175
Gly Tyr Trp Thr Thr His Gly Arg Ser Lys Val Phe Phe Pro Gln Arg
180 185 190
Ala Asp Leu Glu Leu Tyr Tyr Leu Gln Tyr His Ser Thr Lys Asn Asp
195 200 205
Ile Lys Leu Leu Val Leu Thr Ile Val Gln Ser Ile Asn Gly Ser Asp
210 215 220
Ala Tyr
225
<210> 251
<211> 149
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_UDP-N-乙酰葡糖胺-LPS_N-乙酰葡糖胺_转移酶
<400> 251
Met Lys Ile Ala Leu Val Gly Ser Ser Gly Gly His Leu Thr His Leu
1 5 10 15
Tyr Leu Leu Lys Lys Phe Trp Glu Asn Glu Asp Arg Phe Trp Val Thr
20 25 30
Phe Asp Lys Thr Asp Ala Lys Ser Ile Leu Lys Glu Glu Arg Phe Tyr
35 40 45
Pro Cys Tyr Tyr Pro Thr Asn Arg Asn Val Lys Asn Thr Ile Lys Asn
50 55 60
Thr Ile Leu Ala Phe Lys Ile Leu Arg Lys Glu Lys Pro Asp Leu Ile
65 70 75 80
Ile Ser Ser Gly Ala Ala Val Ala Val Pro Phe Phe Trp Leu Gly Lys
85 90 95
Leu Phe Gly Ala Lys Thr Val Tyr Ile Glu Ile Phe Asp Arg Ile Asp
100 105 110
Lys Pro Thr Leu Thr Gly Lys Leu Val Tyr Pro Val Thr Asp Lys Phe
115 120 125
Ile Val Gln Trp Glu Glu Leu Lys Lys Val Tyr Pro Lys Ala Ile Asn
130 135 140
Leu Gly Gly Ile Phe
145
<210> 252
<211> 168
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_GT1
<400> 252
Met Ile Phe Val Thr Val Gly Thr His Glu Gln Pro Phe Asn Arg Leu
1 5 10 15
Ile Gln Lys Ile Asp Glu Leu Val Arg Asp Gly Glu Ile Glu Asp Asp
20 25 30
Val Phe Met Gln Ile Gly Tyr Ser Thr Tyr Glu Pro Lys Tyr Thr Lys
35 40 45
Trp Glu Lys Val Ile Gly Tyr Glu Thr Met Glu Arg Cys Met Asn Glu
50 55 60
Ala Ser Thr Ile Ile Thr His Gly Gly Pro Ser Thr Tyr Met Gln Val
65 70 75 80
Leu Gln Leu Gly Lys Ile Pro Ile Val Val Pro Arg Gln Met Lys Phe
85 90 95
Asp Glu His Ile Asn Asp His Gln Leu Trp Val Ser Lys Gln Val Val
100 105 110
Lys Lys Gly Tyr Ser Leu Ile Leu Cys Glu Asp Val Glu Asp Ile Leu
115 120 125
Glu Asn Ile Ile Ser Ser Lys Ile Ser Asp Thr Leu Gln Lys Asn Val
130 135 140
Asn His Asn Thr Glu Phe Ile Lys Leu Phe Ser Ala Glu Ile Tyr Gln
145 150 155 160
Leu Phe Ile Lys Ser Glu Lys Ile
165
<210> 253
<211> 235
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_GT2
<400> 253
Met Ile Pro Lys Val Ile His Tyr Cys Trp Phe Gly Gly Lys Pro Leu
1 5 10 15
Pro Glu Ser Ala Leu Lys Cys Ile Glu Ser Trp Arg Arg Phe Cys Pro
20 25 30
Asp Tyr Glu Ile Lys Gln Trp Ser Glu Lys Asn Tyr Asp Val Asn Lys
35 40 45
Ile Gln Tyr Ile Lys Glu Ala Tyr Gln Glu Lys Lys Phe Ala Phe Val
50 55 60
Thr Asp Val Ala Arg Leu Asp Ile Ile Trp Asn Glu Gly Gly Ile Tyr
65 70 75 80
Leu Asp Thr Asp Val Glu Leu Ile Lys Ser Leu Asp Glu Leu Leu Tyr
85 90 95
Asn Ser Leu Tyr Leu Gly Met Glu Arg Ala Gly Arg Val Asn Thr Gly
100 105 110
Leu Gly Phe Gly Ala Glu Val Asn His Pro Ile Val Arg Ala Asn Leu
115 120 125
Glu Leu Tyr Thr Asn Ile Pro Phe Ser Gly Asn Asp Asn Ile Thr Cys
130 135 140
Val Thr Tyr Thr Thr Asn Leu Leu Lys Lys Tyr Gly Leu Lys Asn Asn
145 150 155 160
Asn Glu Ile Gln His Ile Asp Asn Ala Ile Ile Leu Pro Thr Glu Tyr
165 170 175
Leu Cys Pro Leu Ser Phe Glu Thr Asn Arg Leu Lys Ile Thr Glu Asn
180 185 190
Thr Tyr Ser Ile His His Tyr Asp Met Ser Trp Lys Asp Lys Arg Asp
195 200 205
Lys Phe Leu Arg Leu Lys Ile Gln Leu Arg Lys Trp Val Gly Asp Asp
210 215 220
Phe Tyr Glu Lys Val Ile Lys Arg Ile Gly Lys
225 230 235
<210> 254
<211> 364
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_wzy
<400> 254
Met Asn Lys Ile Thr Met Thr Arg Glu Met Arg Val Val Ala Leu Cys
1 5 10 15
Val Val Ile Leu Glu Tyr Leu Asn Asn Thr Gly Leu Ile Ala Ser Ser
20 25 30
Val Tyr Ser Phe Ser Met Ala Ser Thr Ile Leu Leu Ser Tyr Ile Leu
35 40 45
Phe Cys Lys Lys Arg Lys Gly Phe Ser Leu Lys Glu Ile Ile Val Leu
50 55 60
Leu Ile Ser Phe Ile Phe Val Val Leu Asn Arg Asn Ala Ser Asn Phe
65 70 75 80
Ser Leu Gly Leu Met Trp Ile Leu Tyr Phe Met Leu Ser Lys Ser Glu
85 90 95
Ile Asp Leu Lys Lys Val Met Lys Thr Phe Phe Val Thr Ser Ser Val
100 105 110
Cys Phe Ile Leu Thr Ile Val Leu Tyr Leu Ile Met Ser Leu Asn Lys
115 120 125
Ser Ser Asp Met Met Met Trp Arg Gly Asp Ala Phe Ile Asn Arg Met
130 135 140
Ser Leu Gly Phe Ile Gln Pro Asn Phe Ala Met Met Ser Phe Leu Gly
145 150 155 160
Val Ala Ile Ala Leu Leu Tyr Leu Ser Thr Glu Arg Gln Arg Ile Thr
165 170 175
Ile Ile Phe Ile Ala Ile Val Thr Phe Ile Ile Phe Tyr Phe Thr Gln
180 185 190
Ser Arg Thr Ser Gly Tyr Ile Leu Phe Phe Ile Leu Ser Ile Leu Phe
195 200 205
Val Ser Ser Lys Lys Thr Lys Lys Gln Val Ser Asn Phe Glu Lys Arg
210 215 220
Ser Val Thr Val Leu Pro Leu Leu Leu Leu Ile Ile Ser Tyr Ser Ser
225 230 235 240
Leu Lys Leu Pro Ile Asn Gln Tyr Leu Asn Asn Leu Leu Ser Gly Arg
245 250 255
Leu Ser Leu Tyr Gln Glu Ile Tyr Ser Thr Phe Gly Ile His Leu Ile
260 265 270
Gly Asn Asn Asp Val Lys Asn Thr Met Leu Asp Thr Ala Tyr Leu Gln
275 280 285
Ser Leu Leu Ala Lys Gly Ile Leu Phe Thr Leu Phe Leu Phe Val Thr
290 295 300
Phe Phe Phe Ile Phe Phe Leu Lys Arg Lys Thr Gln Thr Arg Leu Gln
305 310 315 320
Ser Leu Val Ile Met Met Tyr Phe Leu Ile Ala Phe Thr Glu Thr Ser
325 330 335
Phe Phe Arg Phe Val Ile Leu Phe Pro Val Leu Met Val Ile Met Asp
340 345 350
Gln Lys Glu Ala Asn Lys Val Ile Glu Lys Val Ala
355 360
<210> 255
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_epsL
<400> 255
Leu Glu Arg Lys Lys Lys Ser Lys Lys Ser Ile Gly Val Ile Ile Ile
1 5 10 15
Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly Ala Tyr Ala Leu
20 25 30
Arg Asp Ser Leu Ile Pro Thr Glu His Thr Lys Thr Asn Ser Ser Asp
35 40 45
Gln Pro Thr Lys Thr Ser Val Ser Asn Gly Tyr Val Glu Gln Lys Gly
50 55 60
Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp Asp Ala Gly Val
65 70 75 80
Pro Glu Trp Val Lys Val Pro Ser Lys Val Asn Leu Asp Lys Phe Thr
85 90 95
Asp Leu Ser Thr Asp Asn Ile Thr Ile Tyr Arg Ile Asn Asn Pro Glu
100 105 110
Val Leu Lys Thr Val Thr Asp Arg Thr Asp Gln Arg Met Lys Met Ser
115 120 125
Glu Val Ile Ala Lys Tyr Pro Asn Thr Leu Ile Met Asn Ala Ser Ala
130 135 140
Phe Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln Ile Asn Asn Gly
145 150 155 160
Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Thr Thr Gln Tyr Ala Phe
165 170 175
Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp Ser Ser Thr Thr
180 185 190
Ala Ser Thr Ile Ile Lys Asn Gly Gly Gln Gln Ala Tyr Asp Phe Gly
195 200 205
Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser Asp Gly Ser Val
210 215 220
Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys Asp Asn Asn Leu
225 230 235 240
Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp Asn Ile Met Lys
245 250 255
Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu Leu Asp Ser Gly
260 265 270
Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile Val Ala Ser Gln
275 280 285
Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 256
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33140_lytR
<400> 256
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys Tyr Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Val Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Ile Gln
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Thr Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Ile
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Thr Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Asp Lys Ser Leu Lys
290 295 300
<210> 257
<211> 16953
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33142 eps基因簇,完整序列
<400> 257
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaaaa cattgaagca acatcctcat ggactagtaa 840
tgagcaactt ataacaaatt tggctagtac aggagcagag gtgatagttc aaccctctcc 900
accgatttat ggtggtgttg tgtaccccgt acaagaagaa cagtttaaac aatctttatc 960
tacaaagtat ccctatatag actactgggc tagttaccca gacaaaaatt ctgatgaaat 1020
gaaggggctg ttttctgatg atggagtata tagaacatta aatgcttcgg ggaataaggt 1080
ttggctagat tatattacta aatattttac agcaaactaa ttaagttata aataacaatt 1140
attaaatatt ggagaagaaa tgcaggaaaa acaggaacag acgattgatt taagagggat 1200
ttttaaaatt attcgcaaaa ggttaggttt aatattattt agtgctttaa tagtcacaat 1260
attagggagc atctacacat tttttatagc ctccccagtt tacacagcct caactcaact 1320
tgtcgttaaa ctaccaaatt cggataattc agcagcctac gctggacaag tgaccgggaa 1380
tattcaaatg gcgaacacaa ttaaccaagt tattgttagt ccagtcattt tagataaagt 1440
tcaaagtaat ttaaatctat ctgatgactc tttcaaaaaa caagttacag cagcaaatca 1500
aacagattca caagttatta cgcttactgt taaatattct aatccttaca ttgcacaaaa 1560
gattgcagac gagactgcta aaatttttag ttcagatgca gcaaaactat tgaatgttac 1620
taacgttaat attctatcca aagcaaaagc tcaaacaaca ccaattagtc ctaaacctaa 1680
attgtattta gcgatatctg ttatagccgg actagtttta ggtttagcca ttgctttatt 1740
gaaggaattg tttgataaca aaattaataa agaagaagat attgaagctc tggggctcac 1800
ggttcttggt gtaacaacct atgctcaaat gagtgatttt aataagaata caaataaaaa 1860
tggcacgcaa tcgggaacta agtcaagtcc gcctagcgac catgaagtaa atagatcatc 1920
aaaaaggaat aaaagatagg agttcaggat ggctaaaaat aaaagaagca tagacaacaa 1980
tcgttatatt attaccagtg tcaatcctca atcacctatt tctgaacaat atcgtacgat 2040
tcgtacgacc attgatttta aaatggcgga tcaagggatt aaaagttttc tagtaacatc 2100
ttcagaagca gctgcaggta aatcaaccga gagtgctaat atagctgttg cttttgcaca 2160
acaaggtaaa aaagtacttt taattgatgg tgatcttcgt aaaccgactg ttaacattac 2220
ttttaaagta caaaatagag tagggttaac caatatttta atgcatcaat cttcgattga 2280
agattccata caagggacaa gactttctga aaatcttaca ataattacct ctggtccaat 2340
tccacctaat ccatcggaat tattagcatc tagtgcaatg aagaatttga ttgactctgt 2400
gtccgatttc tttgatgttg ttttgattga tactccacct ctctctgcag ttactgatgc 2460
tcaaattttg agtagttatg taggaggagt ggttcttgtt gcacgtgcct atgaaacaaa 2520
aaaagagagt ttagcaaaaa caaaaaaaat gctggaacaa gttaatgcaa atatattagg 2580
agttattttg catggggtag actcttctga ctcaccgtcg tattactact acggagtaga 2640
gtaattggaa taaattttaa tcaaataaaa gacagaaatt tgtagaagag gagagcaaat 2700
gattgatatt cattgccata ttttaccggg tatagatgat ggagctaaaa cttctggaga 2760
tactttgaca atgctgaaat cagcaattga tgaagggata acaaccatca ctgccactcc 2820
tcatcataat cctcaattta ataatgaatc accgcttatt ttgaagaaag ttaaggaagt 2880
tcaaaatatc attgacgagc atcaattacc aattgaagtt ttaccaggac aagaggtgag 2940
aatatatggt gatttattaa aagaattttc tgaaggaaag ttactgacag cagcgggcac 3000
ttcaagttat atattgattg aatttccatc aaatcatgtg ccagcttatg ctaaagaact 3060
tttttataat attcaattgg agggacttca acctattttg gtccaccctg agcgtaatag 3120
cggaatcatt gagaaccctg atatattatt tgattttatt gaacaaggag tactaagtca 3180
gataacagct tcaagtgtca ctggtcattt tggtaaaaaa atacaaaagc tgtcatttaa 3240
aatgatagaa aaccatctta cgcattttgt tgcatcagat gcgcataatg tgacgtcacg 3300
tgcatttaag atgaaggaag cgtttgaaat tattgaagat agttatggtt ctgatgtatc 3360
acgaatgttt caaaataatg cagagtcagt gattttaaac gaaagttctc atcaagaaaa 3420
accaacaaag atcaaaacaa aaaaattttt aggattattt taaagggatt aaatggagta 3480
aataatggaa gtttttgagg atggctcatc acctgaaccg gaagagcata agttagtaga 3540
attaaaaaaa ttttctcaca gagagataat tataaaaaga gggattgata ttttaggggg 3600
attagtgggt tcaggtttat ttcttattgc ggctgcattg ctctatgtcc cttacaaaat 3660
gagctcggaa aaagatcaag ggccaatgtt ctataaacaa aaacgctatg ggaagaatgg 3720
taaaattttt tatattttga aatttagaac aatgattctt aataccgagc agtatctaga 3780
acttaatccg gatgttaaag ctgcttacca tgccaacggc aataagctag aaaacgatcc 3840
acgggtaacg aagattggat catttataag acgacactca attgatgaac tgccacaatt 3900
tatcaatgtt cttaaagggg atatggcatt agttggtcca agaccaattc tgctttttga 3960
agcgaaagaa tatgggaaac gcctccctta cttactcatg tgcaaaccag gaatcactgg 4020
ttattggcaa gttcctctat tttcagataa tggagtactt caatcagcaa agggggtggc 4080
atgaatggct ccaatagatt cagaagttat agacactaaa gataattcta aagttttaaa 4140
tgatgttgtt gcgagtgaat ctgttatagt accagttaac aacaaagtaa aaaaatcaaa 4200
agcaattttg cataaaattg aaagagcttc atattttagc attaaaagaa tcttcgacat 4260
aatttgctca ttgcttggca ttatagcatt aattccagta gcaatagtaa ctaaaatatg 4320
ttacatagca acaggagata aaaaatcaat attttataaa caaaagagaa tcggaaaaaa 4380
tggtaaaccg atatatatat ataaatttag aagtatggta tggaatgcag atgaagtgtt 4440
aaaagaactt ttaaaagacc ctaagtataa aaaagaatgg gacttaaatc aaaaatttga 4500
aaatgatccg agaataacaa aaatgggaaa tattttaaga aaaacatcat tagatgaatt 4560
gccacaattc atcaatgtaa ttaaaggtga tatgtctatg ataggacctc gacctttagt 4620
tgaaggagag cttaatgctc ataaaggaaa tcatgcaata tatgaaagtg ttcgcccagg 4680
cctctcgggg tggtgggccg cgaacggaag gtcagctact acttatgaaa gaagacttga 4740
attagaatat ttctattgta aaaattgtaa tttaatatta gatattaagt gtgtattctt 4800
aacaatagca gtagtgttat ttaaaacagg agcaaagtag tatcaaacta taatgaaccc 4860
gaattgctag ttgattattt agccatgact tgatacccga tagaatatct taaagtctct 4920
ggttccagtg atttagctga ttttaacagt aaagaatacg ctaaaagtat catctctaat 4980
ttcaattgaa aaacttgagg cgaacgactt ttacaacgct cagctcctag atttgtcaaa 5040
aaagagaaaa ctcgctcaat cacttttcta cgttttgaaa aattagggaa aaggattttc 5100
ttttgcttca tgttcttcct gacaggtgtc attagatcaa ttccttttaa ttccagccta 5160
tcatgcagtg actgacctaa atatcccata tctccaagga ctgttggtgt cccaaattga 5220
ctcaacactt cctcggtcat tgaactatca gccattgaag caggagtaat tgtgtagtct 5280
atgacatagc ctgattcact gactaaagca tgacatttac atccatagaa gtactgtccc 5340
tttgtagcat tgtagccaac atttgcataa tctccaagac ctttgcttct gaaattacga 5400
ataggctgac acaaaggaat ggggaagctg tcaataatgg atacactcat tccttcaacc 5460
tctttaaaga cgagtgcttg gcgaatgact tggatactcg gtaagagggc attacaacgg 5520
cggacaaagc gagaatattc taggaaatta ggaaataaac tttgagcaag ttggtgctta 5580
gctttaagcg tttcactaaa atgcagtacg ccccataggt aacaagcgat aactaagcaa 5640
tctgatgttg cgagatggac gttctttcgg ttttgaacct caaggggaac actcgtttga 5700
taaagcgtct caatggttgt cagtaaataa acaaaaactt ttcgaagtgt gctattataa 5760
gtcatataag tcgtgcgctt tctaatgctt agtagtttaa gattaggata gcacgactta 5820
tttattttcc aatgaaatta actagcaatt cgggtaatat atatatggag gtaccatgaa 5880
tcaaaaaaag aggcgtcatt atcgtaagaa aaaacacaca gtactaaaag ttatttcaat 5940
tatttttgta ttagtaatta tcgctgttgc ttctatagcc tacgccgctt atagaaatgt 6000
tgaatcaaca ttttcaacat cgtatgaaaa tttccctaaa acaacaagta tcgacttaaa 6060
aaagtctaaa acattcacca cacttatcat tgcaactggt aaaaataatt ctaaaaatac 6120
agcttatgct actgttttag cttcaacgaa tgtaaagaca aatcaaacta ctttcatgaa 6180
cttcccagtt tttgcgacaa tgcctaatca aaaaacaatc actgaagttt acaatacgaa 6240
tggagatgat ggaattttcc agatggttaa agacctattg aatgtgtcca ttaacaaagt 6300
aattcatatt gatgttaata aaatgggatc acttgtacag gccactggtg gaatcatcat 6360
gcaaaataca aaggcattca atgctgaagg ttatgagttt aaacaaggaa ctgttaattt 6420
acaaactgct gatcaagtcc aagcctatat gacacaaatt gacgatactg atttggatgc 6480
ttcaatcacc cggattcaaa atgtctcaat ggaactctac ggaaatattc aaaaaattgc 6540
tcatatgaaa aaacttgaaa gtttcaatta ctatcgagaa attatctatg ctttttcaaa 6600
cactgttaaa accaatataa gtttcaatga tgctaaaaag atcgttatga gctacaataa 6660
ggctctaaag aataccagca agctcaatct acatacaaca gatgaaaatg gagctaaggt 6720
cgtttctcaa acagaattag actcagtcaa aacccttttt gaaaaatctc taaaataaaa 6780
gaaccaagag gttcttttat ttttatttca tcacaatata atccggtacg gctcgatcgt 6840
cttgactagc aacaatagtt ttaccattga cagatagttg acttgagccg ccactatcaa 6900
gtaataacat attttggagc ttcaaatttg acactgattt cattatatta tcataacctg 6960
catttgtatc actcaaaata gcatagagat tattatcttt atcattcgca ataaaaatat 7020
ggatcttcca gtctactgag ccatcacttg gctgaatctt accatcacgg ataattgcag 7080
taccaaaatc ataggcctgt tcccccccat ttttaataat agttgaagca ggtgtacttg 7140
aatcatagat tttgcatgaa ccatctttgt taataacaaa ggcatattga accattgtac 7200
ctggactcca gtcttgaatc aactttgcat tgttgatttg gaagccagct acttgtcctg 7260
tctgcatatc aaatgccgag gcattcataa tcaaggtatt agggtattta gctataactt 7320
cggacatttt cattcgttga tctgtacgat tggtaactgt ttttaggact tctggattat 7380
taattcgata aatagtgata ttattcgtag ataaatcagt aaatttatct agatttacct 7440
ctgagggaac cttaacccat tttggtacac caacatcatc tacaagtgct atactaccca 7500
cagcagcttc ttcacctttt tgctccacgt aaccgttaga agccgaagtt ttggtcggtt 7560
gatccgaact atttgttttc gtatgttcag taggaataag tgaatctctt aaggcataag 7620
cccctgctcc tataagggta ataaaaatta agataggtat aattatcacc cagatattat 7680
ttttcgtctt tttttttcgt tccaatttct cctccaaaat tgttgtttag tcacgaccaa 7740
ataaatctct cgtgtaaact ttggttttaa catcctcaag ttctttagta taacgatttg 7800
aaacaataac atcagaaatc attttaaact catctagatt atgaactacc cgagaattgt 7860
agaactgatc atctgtgagt gtcggctcat agacaactac ttcaattccc ttgcctttaa 7920
ttcgtttcat aattccttga attgaacttg atctaaaatt atcagaattt gacttcattg 7980
tcaagcggta gatgcccacc actttgggat accgtttaat gatcatatct gcgatatgat 8040
cttttctcgt cctatttgat tcaacaacag cctcaattaa tttctcagaa acttggtcat 8100
agttagccaa aagctgtttg gtatctttcg gtaagcaacc tgtcgcttct attgtgtgta 8160
atattcttaa caaaaaccaa aagaaaacct gtatttatgc aggttttaag taacttttgt 8220
ttttcttaac atattctagt aaattttgat aaatatcttg atatttaaaa acaatctcga 8280
ctcttttatc ttcaaaaata tatatgcatt ctataaattt aaaaagtata tctctatcaa 8340
ttttcttaaa taattctcta ttgttaaaaa gttttattaa atcaacttga taatttggaa 8400
tattatctaa ttcttccaat tccttatcaa ttattttaat tgtttctcta atttctaaaa 8460
tttcattatt atatgaactt gaataatttt tataatcttc gaaagttatt atttcttctt 8520
tccaatctgt atataaagat tgttttgatg ttatccttaa atcttagagt cactattgta 8580
taatttagac aaaggacaaa aacatgaaaa aatgctactc aaaagaattt aaagaaaccc 8640
ttatcgcctt ctatcattct ggtcaatccg tcacccagct gtctaaagaa tacgacgtgg 8700
cccctgcaac aatttataaa tggatagacc tctactctaa atctaatgaa agctccgtct 8760
ctaaagctga ttttctagaa ttaaaaagac aactggctaa agttaaggaa gaacgagaca 8820
tcttaaaaaa agtattgacc atattcgccg agaaaaagaa gtgagtgctg cggatatggc 8880
tcaaaccata caaactttag cactcaatgt cagactaagc tgtcaactcc ttgatattcc 8940
tgaatcaagt tattatgaac ggattaaccg acatccatct aaaactcaat taaggagaca 9000
atacctgtca ctcaaaattt ctcaactctt caatgctaac cgagaaatct atggtgttcc 9060
taaaattcat catcttctac ttaaacaagg ggaaaaagtc gggttaaaac tggtacagaa 9120
gctaatgaag caacttcaac tcaagtctgt agtcattaag aaatttaagc ctggatactc 9180
actaagtgat cacatcaatc gaaaaaatct catacagact gaacctacaa agaaaaataa 9240
ggtttggtca accgacatta cttatattcc tactcaacaa ggatgggctt atctctcaac 9300
cattatggat cgttatacta aaaaagtcat tgcttgggat ttgggcaagc gaatgactgt 9360
agaattagtg caaagaactt taaataaggc cattaaatca caagactatc cagaagctgt 9420
tattcttcat tctgaccaag gaagccagta tacgagtcta gagtatgaag agttgcttaa 9480
gtattatggg atgactcact ctttcagtcg aaggggatac ccttatcata atgccagtct 9540
tgaatcttgg catggacatt taaaaagaga gtgggtgtat caatttaaat ataagaactt 9600
tgaagaagcc tatcagagta ttttctggta catcgaaggc ttttataatt caaaacgaat 9660
ccatcaaagt ttagggtatc ttacacctaa tcaatttgaa aaggtaagtg cttaaaataa 9720
atagattaaa attctccgtt tgttacttta aaaacttgac ttaacgtcat ttcataaagt 9780
tttctataaa tttcttttaa agaatgttta aagttaaaaa caagtaaaac agtataataa 9840
taatacttta ttcctttatt ttccttgtac tttgatctgg tatttttcat tccccaataa 9900
aaatcgtaac ttcttaaatt ttttttgaat ttatttgagt ctaagtattc tttaatcatt 9960
gtttctgtta cttctcctaa tgggttattt ttataatatt ttcttaaata ctttgcaatc 10020
aattcttgct ttcctgcatt ttttcttgat atactttgtg gactaattct atacttcaat 10080
aatatatctt gaacatttcc gatagttata ccatttccaa ctgccctcaa taaaaaatca 10140
taatcttcac aagaaaaaac attattatat ccatttagtt tttcataaac attctttcta 10200
acaaaataag taggatgtga tacacaattt tttataaaaa gtagtttttt tacattctct 10260
gcttttcttg gaaaaatgat atctttgaaa tctacattat caataaaaca tgtaacattt 10320
gatccacaca tgtcatagcc tgtttctttt aaaaattcat attgtttttc aaaacgtatc 10380
ggaagagcta tatcatctgc atccatccta gctatatatt ctccagttgc attttttaat 10440
gctttattca aacttttagg aagacctata tttttttcat taatgataat tttcattcta 10500
tcatctttat atgattttaa aaagtcaatt ctccactttt catctggatt atccacaact 10560
ataataagtt ctaaattaga ataagtttga ttaagtatag attcaattga tgatttaagt 10620
tcattttcac tttctttata tattgacatt ataacactaa ctttttcttt cattttctat 10680
ctcctttgct ggtacaccaa ctaatactcc acttttaaag tttttattta ctacagcatt 10740
cgcgccaact tttatatcat cgcctaaaac aacatcacct ataattactg ccccaaagcc 10800
tacatcacaa ttatttccta ttattggtgc atatccgtct aatcctttat ttcctataca 10860
attattacca tgtagtttaa ggttcttccc acatttagcc aaaccactaa ctacaatttc 10920
tccagaatga taaattatta gcccttcatc aattgtatta atacttgcga aaatacccat 10980
tcttttacct atacaatttt ttctgtaaat atggtatgca tataatggaa aaaagatttt 11040
atttttttga ttataataat attcggcttt tcttaaatgc ttttgaaatc tccatatatc 11100
tatcttttca tttcctgaca aaactttttt tatataatct ttcttatttt caatatacgc 11160
ttttttttca agattcaaaa catgcattaa tttttttcta tcacttatca tctttaattt 11220
ctcctaactt tttataatta tttcctatta aaaaaattat aaaaataaat attggataat 11280
aatatgttat ttctgtcatc atcattatta aaaagctaaa aataattgca acatttaatg 11340
aatttatttt tttattatcg attttatttg gaaatccttg tttaaaaatc atcataaaca 11400
atgttaaagt tccaaatata ccacactcac acatagattg taaaatagta ttatgtgctt 11460
gccataatcc attatgatag ctaataatat ttccgtcatt aacatatcca aatccaaata 11520
aaaaataatt ttttgccaca taaattgcat tttcccatat tatagttcta cccgtcaacg 11580
taatatcttt gtttaataat tcaaaaaggc cactaaatag atttaacaaa tttccattta 11640
ttaaattaat ggacaaagca attaaaatta aaaatccaaa acaataaact aaacttagtt 11700
tctttttatt tctcaataat ctcaataata aataaaacaa aatagttaac acaataacaa 11760
caatacctgt ggcaacccat ttagataata tatttataat tgctaataca tacgatgtaa 11820
ttagtttttt tcgatctttt ttattctttg aagaatactc gttgtaataa attaaagaaa 11880
tgtacataag agcgattgaa aattccgtaa aacgagttct tattcctaaa aaataaatac 11940
catactgtgg aaatattcca ttaggataaa taatcattaa taaaatatta attagaacat 12000
aactaaaaat cattttgtat acaattctaa atgcatcctt atcgcaaata taatttttta 12060
ttaataaata caagccaaca aatacaaccg attgatagcc ccatttaaca atttcaccat 12120
tatgattcaa tgttggtata aatatcagaa gtctccataa tagtaaaagg aaaatggtaa 12180
tatcaaattt aacatttttt ttaaatgacc ttaataataa aattaacgta cataataagg 12240
ccattccaat actcatataa ttaagcacat tgatttttac aaaaaaatta ggctgaataa 12300
atgcaaaaaa acaaagtgca atgaaaattt tttctttagt tttcatcatg agccccctta 12360
gaatatattt tttctattac atttattaat ttatctgtat cataatctaa tagtttttct 12420
gaataacgtt cttctctttt catattaaaa ttattgaaaa attgtatcca ttcgtcagcg 12480
ttgtcaatat ccaagaattt aacattgtta tttaatttta catctttagt tattttgtct 12540
gaaaaagcac atggacaacc attcgcttga gcttcaactg ctgtaagagg aaatccttca 12600
aatatagaag gcaaaacaaa gcaatccata gcactataat aatattcaga tttaacattt 12660
tcaagcatta taactttttc atttatattt atttcattta ttttagatac tatttcttct 12720
tttaattctc caaaacctat tatcattaat ttagcattac tattctttat ttttgcaaaa 12780
cattcaatca gaaaagactg gtttttttgc acatttaatc ttcctacatt gccaaaaatt 12840
atatcttttt ctccaatatt cagttctttc cttatttgtt ttcgattttc agcagaatat 12900
gaaaattttt ttaaatcaat tccatttttt actactgtaa actttctatt tccataagca 12960
aattctccag cttcttctga acatgccaag aattcatttg catatttcga agcattaaga 13020
atcataaaac gttttattac tccttttaaa gatttttctg ttttgctatt atgtgaatga 13080
tttattaaaa ttttgcagcc attttgtttg gcatatttta aggttagtct tgatgtactt 13140
aacatgtgtg aatgaactac atcaaaatga ttttttttaa aaaaaccttt taaaaatttt 13200
atatatttaa aaatattttt atactctcta aatgaacatt tatgtattgt acagccaatt 13260
ttctgtattt cttcatcaaa ataataattt ccagtataat gatataaaaa atgaatttca 13320
tatttttgca tatctaaatt tcttataata ttcataatta aattttcaag accaccctgt 13380
tgcatatttg gaacaatttc gagtatcttt attttattca tttttttctt ccccttctta 13440
gcatttttct ataaacaaaa tctctcataa aattaggcat ataacataca attgcgtgtg 13500
gtacaattga tttcaaatat tcaaatttag taaaataacc attttctaat tgttctttct 13560
taaatttctt aatgctttta aaatatttgt gtccaccacg tcttttataa aaatcttcac 13620
caattctcat atatacatat atatcttgaa tattataaca tttacaacca tttctaatca 13680
ttcttagcca catatcataa tcttcgcaca agtaatattc tctataattt ccagcattta 13740
caacttcact tttcttaaac ataacagatg gatgtctaaa aggattccta ctttttgaaa 13800
atttaataat atcctcattc gtttctggta atataacatt acaaacaaca ttatctattg 13860
aatcaataaa ttcagataca cttgttccga ccattccaat atctgaatat ttctcaaata 13920
tttcaaattc cttttcaatt ctttttggca tagaatagtc atctgagtcc attctggcaa 13980
tatactcatt tgagcattct tcaactccct ttttcaatgc aggacctaag ccaacatttt 14040
tttctatagc aacaacttta aactcttttg gatattttgt tttatatttt tcaacaacgt 14100
catatagttc cttggttagt ggaccatcct ccactaaaac gaactcattt ggtttaatag 14160
tttgttcaaa catgcttttt atactttcat caaaccatgt aggattttct ttatgataaa 14220
cagacattaa aacgctatac tttttttcca caagattcac ttctccttaa aaaatcattt 14280
cactatacaa tgcaatgata aatactcaaa ctgactacta ttgtataatc tatactatat 14340
ttttataacc cgaattgcta gttaatttca ttggaaaata aataagtcgt gctatcctaa 14400
tcttaaacct aagcattaga aagcgcacga cttatatgac ttataatagc acacttccaa 14460
aagtttttgt ttatttactg acaaccattg agacgcttta tcaaacgagt gttccccttg 14520
aggttcaaaa ccgaaagaac gtccatctcg caacatcagg ttgcttagtt atcgcttgtt 14580
acctatgggg cgtactgcat tttagtgaaa cacttaaagc taagcaccaa ttggctcaaa 14640
gtttatttcc taatttccta gaatattctc gctttgtccg ccgttgtaat gccctcttac 14700
cgagtatcca agtcattcgc caagcactcg tctttaaaga ggttgaagga atgagtgtat 14760
ccattattga cagcttcccc attcctttgt gtcagcctat tcgtaatttc agaagcaaag 14820
ttcttggaga ttatgcaaat gttggctaca atgctacaaa gggacagtac ttttatggat 14880
gtaaatgtca tgctttagtc agtgaatcag gctatgtcat agactacaca attactcctg 14940
cttcaatggc tgatagttca atgaccgagg aagtgttgag tcaatttggg acaccaacag 15000
tccttggaga tatgggatat ctaggtcagt cactgcatga taggctggaa ttaaaaggaa 15060
ttgatctaat gacacctgtc aggaagaaca tgaagcaaaa gaaaatcctt ttccctaatt 15120
tttcaaaacg tagaaaagtg attgagcgag ttttctcttt tttgacaaat ctaggagctg 15180
agcgttgtaa aagtcgttcg cctcaagttt ttcaattgaa attagagatg atacttttag 15240
cgtattcttt actgttaaaa tcagctaaat cactggaacc agagacttta agatattcta 15300
tcgggtatca agtcatggct aaataatcaa ctagcaattc gggttttgaa tagtaagcag 15360
taaataactg actgtattgc ttctaaccga ttaagtggca cggtttagaa ccgaaaagac 15420
atattgtcct gtgctacaat caagtcgacc aaaacaaatt gaaggtgagc gtcaaatatg 15480
cccaatacca ttgactcacc gttattttca ataaggcagg tacctaaatc catacctcta 15540
tttatgcata ataaacttta gcttattaat tatttctgcc tttaaaaata ttaataaaac 15600
aacataaata attatgccca cagtaatttc caacagaatg aatatccaag atgtcggtgt 15660
taacaaacta attttaaaga caattagaaa catcactaat cctgcaatta aatacttaga 15720
taaatccgaa aacagtgtat gcaaattaag ctgtttatga attataaaaa gttgatacac 15780
agttacagac atttcagaaa ttacagttgc aattgatgca ccaacagtac ctagatatat 15840
aatcagtgga atatttaaca ttaaattgac tatcgatcca atgatcaccg acactgtata 15900
tgacttattt tgattagttg gtaaaagata ttgagtacct attgcgttgc tccaagctat 15960
aaaaataatt gcgattgact cgatcattaa cacaggaata acatcactaa attgagatgt 16020
aaaaaaaagt ggcacgaatt taggagtaat agctatcaga ccaaacatca taggaatcga 16080
aattgccgac acaaaagaaa aacctgcgta catgtattcc ttaattttac tatactctct 16140
atgtgcaaag gcatttgcaa cacgtggcaa catgacagta cctgttgcag tagcaatagc 16200
caaaaccagt ttaactattt tatcagactg atcaaaaaag ccggagctcg tgacagaatc 16260
caatgaacct aacattgttt tattcaaaac ccaataaatt tggacagcaa tttgtgggat 16320
aaacataact aaagattgct ttaaatgctt tattggcctt aattcacgat agttaacctt 16380
tacgagatat ctgtgtaaac ttgggaaaaa agttaaatta ccaattaatg tagataaaac 16440
tgttatcaat atatatatat tcaaatcatt gtaagatttg acaaatagga aaatactgaa 16500
tagagcaagt aacttaacta taaaatttct taatacagtt actttaaaat tttcaattcc 16560
cataaaaaac caagagatat caaatgcagc tgcaactata gcaatggatt gagacaaata 16620
gtatgcatga tactgaccat taatgattaa aaaagcaacg aacaaaaaat atgctaaaca 16680
tattgtaaat agtcttaaaa taaatatttc ataaaagact ttagacattt tgacctgatt 16740
atccctaaca aaggcaatct gacgattccc atacaaaccg actcctatac taccaaataa 16800
aacaaaatac tgaacaatag aattggtata tgagttaatt ccaatacctg aagggcccaa 16860
aattcttgac aaataaggaa tggtaagtaa tggcacaatt attacaaaga cctgatatat 16920
tgcattataa agataatttt ttgcgatttg cat 16953
<210> 258
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsR
<400> 258
Met Asn Asp Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 259
<211> 237
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsX
<400> 259
Leu Glu Val Phe Tyr Ser Tyr Asn Ser Arg Ile Asn Asn Leu Ser Lys
1 5 10 15
Ala Asp Lys Gly Lys Glu Val Val Lys Asn Ser Ser Glu Lys Asn Gln
20 25 30
Ile Asp Leu Thr Tyr Lys Lys Tyr Tyr Lys Asn Leu Pro Lys Ser Val
35 40 45
Gln Asn Lys Ile Asp Asp Ile Ser Ser Lys Asn Lys Glu Val Thr Leu
50 55 60
Thr Cys Ile Trp Gln Ser Asp Ser Val Ile Ser Glu Gln Phe Gln Gln
65 70 75 80
Asn Leu Gln Lys Tyr Tyr Gly Asn Lys Phe Trp Asn Ile Lys Asn Ile
85 90 95
Thr Tyr Asn Gly Glu Thr Ser Glu Gln Leu Leu Ala Glu Lys Val Gln
100 105 110
Asn Gln Val Leu Ala Thr Asn Pro Asp Val Val Leu Tyr Glu Ala Pro
115 120 125
Leu Phe Asn Asp Asn Gln Asn Ile Glu Ala Thr Ser Ser Trp Thr Ser
130 135 140
Asn Glu Gln Leu Ile Thr Asn Leu Ala Ser Thr Gly Ala Glu Val Ile
145 150 155 160
Val Gln Pro Ser Pro Pro Ile Tyr Gly Gly Val Val Tyr Pro Val Gln
165 170 175
Glu Glu Gln Phe Lys Gln Ser Leu Ser Thr Lys Tyr Pro Tyr Ile Asp
180 185 190
Tyr Trp Ala Ser Tyr Pro Asp Lys Asn Ser Asp Glu Met Lys Gly Leu
195 200 205
Phe Ser Asp Asp Gly Val Tyr Arg Thr Leu Asn Ala Ser Gly Asn Lys
210 215 220
Val Trp Leu Asp Tyr Ile Thr Lys Tyr Phe Thr Ala Asn
225 230 235
<210> 260
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsC
<400> 260
Met Gln Glu Lys Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Pro Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Ala Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Lys Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asp Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Ile Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Ala Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Ala Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 261
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsD
<400> 261
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn Arg Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Ile Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Thr Ser Ser Glu Ala Ala Ala Gly Lys Ser Thr Glu Ser Ala Asn
50 55 60
Ile Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ser Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Phe Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ser
165 170 175
Tyr Val Gly Gly Val Val Leu Val Ala Arg Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Met Leu Glu Gln Val Asn Ala Asn
195 200 205
Ile Leu Gly Val Ile Leu His Gly Val Asp Ser Ser Asp Ser Pro Ser
210 215 220
Tyr Tyr Tyr Tyr Gly Val Glu
225 230
<210> 262
<211> 254
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsB
<400> 262
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Asp Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Ser His Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Phe
245 250
<210> 263
<211> 199
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsE1
<400> 263
Met Glu Val Phe Glu Asp Gly Ser Ser Pro Glu Pro Glu Glu His Lys
1 5 10 15
Leu Val Glu Leu Lys Lys Phe Ser His Arg Glu Ile Ile Ile Lys Arg
20 25 30
Gly Ile Asp Ile Leu Gly Gly Leu Val Gly Ser Gly Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Thr Glu Gln
85 90 95
Tyr Leu Glu Leu Asn Pro Asp Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Arg His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Lys Arg Leu Pro Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Gln Val Pro Leu Phe Ser Asp Asn Gly Val Leu
180 185 190
Gln Ser Ala Lys Gly Val Ala
195
<210> 264
<211> 251
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsE2
<400> 264
Met Ala Pro Ile Asp Ser Glu Val Ile Asp Thr Lys Asp Asn Ser Lys
1 5 10 15
Val Leu Asn Asp Val Val Ala Ser Glu Ser Val Ile Val Pro Val Asn
20 25 30
Asn Lys Val Lys Lys Ser Lys Ala Ile Leu His Lys Ile Glu Arg Ala
35 40 45
Ser Tyr Phe Ser Ile Lys Arg Ile Phe Asp Ile Ile Cys Ser Leu Leu
50 55 60
Gly Ile Ile Ala Leu Ile Pro Val Ala Ile Val Thr Lys Ile Cys Tyr
65 70 75 80
Ile Ala Thr Gly Asp Lys Lys Ser Ile Phe Tyr Lys Gln Lys Arg Ile
85 90 95
Gly Lys Asn Gly Lys Pro Ile Tyr Ile Tyr Lys Phe Arg Ser Met Val
100 105 110
Trp Asn Ala Asp Glu Val Leu Lys Glu Leu Leu Lys Asp Pro Lys Tyr
115 120 125
Lys Lys Glu Trp Asp Leu Asn Gln Lys Phe Glu Asn Asp Pro Arg Ile
130 135 140
Thr Lys Met Gly Asn Ile Leu Arg Lys Thr Ser Leu Asp Glu Leu Pro
145 150 155 160
Gln Phe Ile Asn Val Ile Lys Gly Asp Met Ser Met Ile Gly Pro Arg
165 170 175
Pro Leu Val Glu Gly Glu Leu Asn Ala His Lys Gly Asn His Ala Ile
180 185 190
Tyr Glu Ser Val Arg Pro Gly Leu Ser Gly Trp Trp Ala Ala Asn Gly
195 200 205
Arg Ser Ala Thr Thr Tyr Glu Arg Arg Leu Glu Leu Glu Tyr Phe Tyr
210 215 220
Cys Lys Asn Cys Asn Leu Ile Leu Asp Ile Lys Cys Val Phe Leu Thr
225 230 235 240
Ile Ala Val Val Leu Phe Lys Thr Gly Ala Lys
245 250
<210> 265
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_lytR
<400> 265
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Ile His
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Ile Met Gln Asn Thr Lys Ala Phe Asn Ala Glu Gly Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Ile Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Lys Ile
245 250 255
Val Met Ser Tyr Asn Lys Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 266
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_epsL
<400> 266
Leu Glu Glu Lys Leu Glu Arg Lys Lys Lys Thr Lys Asn Asn Ile Trp
1 5 10 15
Val Ile Ile Ile Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly
20 25 30
Ala Tyr Ala Leu Arg Asp Ser Leu Ile Pro Thr Glu His Thr Lys Thr
35 40 45
Asn Ser Ser Asp Gln Pro Thr Lys Thr Ser Ala Ser Asn Gly Tyr Val
50 55 60
Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp
65 70 75 80
Asp Val Gly Val Pro Lys Trp Val Lys Val Pro Ser Glu Val Asn Leu
85 90 95
Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile
100 105 110
Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg
115 120 125
Met Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Thr Leu Ile Met
130 135 140
Asn Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln
145 150 155 160
Ile Asn Asn Ala Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Met Val
165 170 175
Gln Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp
180 185 190
Ser Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Glu Gln Ala
195 200 205
Tyr Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser
210 215 220
Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys
225 230 235 240
Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp
245 250 255
Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu
260 265 270
Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile
275 280 285
Val Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 267
<211> 148
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_核苷酸_糖_脱氢酶
<400> 267
Leu Leu Arg Ile Leu His Thr Ile Glu Ala Thr Gly Cys Leu Pro Lys
1 5 10 15
Asp Thr Lys Gln Leu Leu Ala Asn Tyr Asp Gln Val Ser Glu Lys Leu
20 25 30
Ile Glu Ala Val Val Glu Ser Asn Arg Thr Arg Lys Asp His Ile Ala
35 40 45
Asp Met Ile Ile Lys Arg Tyr Pro Lys Val Val Gly Ile Tyr Arg Leu
50 55 60
Thr Met Lys Ser Asn Ser Asp Asn Phe Arg Ser Ser Ser Ile Gln Gly
65 70 75 80
Ile Met Lys Arg Ile Lys Gly Lys Gly Ile Glu Val Val Val Tyr Glu
85 90 95
Pro Thr Leu Thr Asp Asp Gln Phe Tyr Asn Ser Arg Val Val His Asn
100 105 110
Leu Asp Glu Phe Lys Met Ile Ser Asp Val Ile Val Ser Asn Arg Tyr
115 120 125
Thr Lys Glu Leu Glu Asp Val Lys Thr Lys Val Tyr Thr Arg Asp Leu
130 135 140
Phe Gly Arg Asp
145
<210> 268
<211> 315
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_GT1
<400> 268
Met Lys Glu Lys Val Ser Val Ile Met Ser Ile Tyr Lys Glu Ser Glu
1 5 10 15
Asn Glu Leu Lys Ser Ser Ile Glu Ser Ile Leu Asn Gln Thr Tyr Ser
20 25 30
Asn Leu Glu Leu Ile Ile Val Val Asp Asn Pro Asp Glu Lys Trp Arg
35 40 45
Ile Asp Phe Leu Lys Ser Tyr Lys Asp Asp Arg Met Lys Ile Ile Ile
50 55 60
Asn Glu Lys Asn Ile Gly Leu Pro Lys Ser Leu Asn Lys Ala Leu Lys
65 70 75 80
Asn Ala Thr Gly Glu Tyr Ile Ala Arg Met Asp Ala Asp Asp Ile Ala
85 90 95
Leu Pro Ile Arg Phe Glu Lys Gln Tyr Glu Phe Leu Lys Glu Thr Gly
100 105 110
Tyr Asp Met Cys Gly Ser Asn Val Thr Cys Phe Ile Asp Asn Val Asp
115 120 125
Phe Lys Asp Ile Ile Phe Pro Arg Lys Ala Glu Asn Val Lys Lys Leu
130 135 140
Leu Phe Ile Lys Asn Cys Val Ser His Pro Thr Tyr Phe Val Arg Lys
145 150 155 160
Asn Val Tyr Glu Lys Leu Asn Gly Tyr Asn Asn Val Phe Ser Cys Glu
165 170 175
Asp Tyr Asp Phe Leu Leu Arg Ala Val Gly Asn Gly Ile Thr Ile Gly
180 185 190
Asn Val Gln Asp Ile Leu Leu Lys Tyr Arg Ile Ser Pro Gln Ser Ile
195 200 205
Ser Arg Lys Asn Ala Gly Lys Gln Glu Leu Ile Ala Lys Tyr Leu Arg
210 215 220
Lys Tyr Tyr Lys Asn Asn Pro Leu Gly Glu Val Thr Glu Thr Met Ile
225 230 235 240
Lys Glu Tyr Leu Asp Ser Asn Lys Phe Lys Lys Asn Leu Arg Ser Tyr
245 250 255
Asp Phe Tyr Trp Gly Met Lys Asn Thr Arg Ser Lys Tyr Lys Glu Asn
260 265 270
Lys Gly Ile Lys Tyr Tyr Tyr Tyr Thr Val Leu Leu Val Phe Asn Phe
275 280 285
Lys His Ser Leu Lys Glu Ile Tyr Arg Lys Leu Tyr Glu Met Thr Leu
290 295 300
Ser Gln Val Phe Lys Val Thr Asn Gly Glu Phe
305 310 315
<210> 269
<211> 184
<212> PRT
<213>乳酸乳球菌
<220>
<223> 33142_乙酰转移酶
<400> 269
Met Ile Ser Asp Arg Lys Lys Leu Met His Val Leu Asn Leu Glu Lys
1 5 10 15
Lys Ala Tyr Ile Glu Asn Lys Lys Asp Tyr Ile Lys Lys Val Leu Ser
20 25 30
Gly Asn Glu Lys Ile Asp Ile Trp Arg Phe Gln Lys His Leu Arg Lys
35 40 45
Ala Glu Tyr Tyr Tyr Asn Gln Lys Asn Lys Ile Phe Phe Pro Leu Tyr
50 55 60
Ala Tyr His Ile Tyr Arg Lys Asn Cys Ile Gly Lys Arg Met Gly Ile
65 70 75 80
Phe Ala Ser Ile Asn Thr Ile Asp Glu Gly Leu Ile Ile Tyr His Ser
85 90 95
Gly Glu Ile Val Val Ser Gly Leu Ala Lys Cys Gly Lys Asn Leu Lys
100 105 110
Leu His Gly Asn Asn Cys Ile Gly Asn Lys Gly Leu Asp Gly Tyr Ala
115 120 125
Pro Ile Ile Gly Asn Asn Cys Asp Val Gly Phe Gly Ala Val Ile Ile
130 135 140
Gly Asp Val Val Leu Gly Asp Asp Ile Lys Val Gly Ala Asn Ala Val
145 150 155 160
Val Asn Lys Asn Phe Lys Ser Gly Val Leu Val Gly Val Pro Ala Lys
165 170 175
Glu Ile Glu Asn Glu Arg Lys Ser
180
<210> 270
<211> 382
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_wzy
<400> 270
Met Met Lys Thr Lys Glu Lys Ile Phe Ile Ala Leu Cys Phe Phe Ala
1 5 10 15
Phe Ile Gln Pro Asn Phe Phe Val Lys Ile Asn Val Leu Asn Tyr Met
20 25 30
Ser Ile Gly Met Ala Leu Leu Cys Thr Leu Ile Leu Leu Leu Arg Ser
35 40 45
Phe Lys Lys Asn Val Lys Phe Asp Ile Thr Ile Phe Leu Leu Leu Leu
50 55 60
Trp Arg Leu Leu Ile Phe Ile Pro Thr Leu Asn His Asn Gly Glu Ile
65 70 75 80
Val Lys Trp Gly Tyr Gln Ser Val Val Phe Val Gly Leu Tyr Leu Leu
85 90 95
Ile Lys Asn Tyr Ile Cys Asp Lys Asp Ala Phe Arg Ile Val Tyr Lys
100 105 110
Met Ile Phe Ser Tyr Val Leu Ile Asn Ile Leu Leu Met Ile Ile Tyr
115 120 125
Pro Asn Gly Ile Phe Pro Gln Tyr Gly Ile Tyr Phe Leu Gly Ile Arg
130 135 140
Thr Arg Phe Thr Glu Phe Ser Ile Ala Leu Met Tyr Ile Ser Leu Ile
145 150 155 160
Tyr Tyr Asn Glu Tyr Ser Ser Lys Asn Lys Lys Asp Arg Lys Lys Leu
165 170 175
Ile Thr Ser Tyr Val Leu Ala Ile Ile Asn Ile Leu Ser Lys Trp Val
180 185 190
Ala Thr Gly Ile Val Val Ile Val Leu Thr Ile Leu Phe Tyr Leu Leu
195 200 205
Leu Arg Leu Leu Arg Asn Lys Lys Lys Leu Ser Leu Val Tyr Cys Phe
210 215 220
Gly Phe Leu Ile Leu Ile Ala Leu Ser Ile Asn Leu Ile Asn Gly Asn
225 230 235 240
Leu Leu Asn Leu Phe Ser Gly Leu Phe Glu Leu Leu Asn Lys Asp Ile
245 250 255
Thr Leu Thr Gly Arg Thr Ile Ile Trp Glu Asn Ala Ile Tyr Val Ala
260 265 270
Lys Asn Tyr Phe Leu Phe Gly Phe Gly Tyr Val Asn Asp Gly Asn Ile
275 280 285
Ile Ser Tyr His Asn Gly Leu Trp Gln Ala His Asn Thr Ile Leu Gln
290 295 300
Ser Met Cys Glu Cys Gly Ile Phe Gly Thr Leu Thr Leu Phe Met Met
305 310 315 320
Ile Phe Lys Gln Gly Phe Pro Asn Lys Ile Asp Asn Lys Lys Ile Asn
325 330 335
Ser Leu Asn Val Ala Ile Ile Phe Ser Phe Leu Ile Met Met Met Thr
340 345 350
Glu Ile Thr Tyr Tyr Tyr Pro Ile Phe Ile Phe Ile Ile Phe Leu Ile
355 360 365
Gly Asn Asn Tyr Lys Lys Leu Gly Glu Ile Lys Asp Asp Lys
370 375 380
<210> 271
<211> 361
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_GT2
<400> 271
Met Asn Lys Ile Lys Ile Leu Glu Ile Val Pro Asn Met Gln Gln Gly
1 5 10 15
Gly Leu Glu Asn Leu Ile Met Asn Ile Ile Arg Asn Leu Asp Met Gln
20 25 30
Lys Tyr Glu Ile His Phe Leu Tyr His Tyr Thr Gly Asn Tyr Tyr Phe
35 40 45
Asp Glu Glu Ile Gln Lys Ile Gly Cys Thr Ile His Lys Cys Ser Phe
50 55 60
Arg Glu Tyr Lys Asn Ile Phe Lys Tyr Ile Lys Phe Leu Lys Gly Phe
65 70 75 80
Phe Lys Lys Asn His Phe Asp Val Val His Ser His Met Leu Ser Thr
85 90 95
Ser Arg Leu Thr Leu Lys Tyr Ala Lys Gln Asn Gly Cys Lys Ile Leu
100 105 110
Ile Asn His Ser His Asn Ser Lys Thr Glu Lys Ser Leu Lys Gly Val
115 120 125
Ile Lys Arg Phe Met Ile Leu Asn Ala Ser Lys Tyr Ala Asn Glu Phe
130 135 140
Leu Ala Cys Ser Glu Glu Ala Gly Glu Phe Ala Tyr Gly Asn Arg Lys
145 150 155 160
Phe Thr Val Val Lys Asn Gly Ile Asp Leu Lys Lys Phe Ser Tyr Ser
165 170 175
Ala Glu Asn Arg Lys Gln Ile Arg Lys Glu Leu Asn Ile Gly Glu Lys
180 185 190
Asp Ile Ile Phe Gly Asn Val Gly Arg Leu Asn Val Gln Lys Asn Gln
195 200 205
Ser Phe Leu Ile Glu Cys Phe Ala Lys Ile Lys Asn Ser Asn Ala Lys
210 215 220
Leu Met Ile Ile Gly Phe Gly Glu Leu Lys Glu Glu Ile Val Ser Lys
225 230 235 240
Ile Asn Glu Ile Asn Ile Asn Glu Lys Val Ile Met Leu Glu Asn Val
245 250 255
Lys Ser Glu Tyr Tyr Tyr Ser Ala Met Asp Cys Phe Val Leu Pro Ser
260 265 270
Ile Phe Glu Gly Phe Pro Leu Thr Ala Val Glu Ala Gln Ala Asn Gly
275 280 285
Cys Pro Cys Ala Phe Ser Asp Lys Ile Thr Lys Asp Val Lys Leu Asn
290 295 300
Asn Asn Val Lys Phe Leu Asp Ile Asp Asn Ala Asp Glu Trp Ile Gln
305 310 315 320
Phe Phe Asn Asn Phe Asn Met Lys Arg Glu Glu Arg Tyr Ser Glu Lys
325 330 335
Leu Leu Asp Tyr Asp Thr Asp Lys Leu Ile Asn Val Ile Glu Lys Ile
340 345 350
Tyr Ser Lys Gly Ala His Asp Glu Asn
355 360
<210> 272
<211> 280
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_GT3
<400> 272
Val Asn Leu Val Glu Lys Lys Tyr Ser Val Leu Met Ser Val Tyr His
1 5 10 15
Lys Glu Asn Pro Thr Trp Phe Asp Glu Ser Ile Lys Ser Met Phe Glu
20 25 30
Gln Thr Ile Lys Pro Asn Glu Phe Val Leu Val Glu Asp Gly Pro Leu
35 40 45
Thr Lys Glu Leu Tyr Asp Val Val Glu Lys Tyr Lys Thr Lys Tyr Pro
50 55 60
Lys Glu Phe Lys Val Val Ala Ile Glu Lys Asn Val Gly Leu Gly Pro
65 70 75 80
Ala Leu Lys Lys Gly Val Glu Glu Cys Ser Asn Glu Tyr Ile Ala Arg
85 90 95
Met Asp Ser Asp Asp Tyr Ser Met Pro Lys Arg Ile Glu Lys Glu Phe
100 105 110
Glu Ile Phe Glu Lys Tyr Ser Asp Ile Gly Met Val Gly Thr Ser Val
115 120 125
Ser Glu Phe Ile Asp Ser Ile Asp Asn Val Val Cys Asn Val Ile Leu
130 135 140
Pro Glu Thr Asn Glu Asp Ile Ile Lys Phe Ser Lys Ser Arg Asn Pro
145 150 155 160
Phe Arg His Pro Ser Val Met Phe Lys Lys Ser Glu Val Val Asn Ala
165 170 175
Gly Asn Tyr Arg Glu Tyr Tyr Leu Cys Glu Asp Tyr Asp Met Trp Leu
180 185 190
Arg Met Ile Arg Asn Gly Cys Lys Cys Tyr Asn Ile Gln Asp Ile Tyr
195 200 205
Val Tyr Met Arg Ile Gly Glu Asp Phe Tyr Lys Arg Arg Gly Gly His
210 215 220
Lys Tyr Phe Lys Ser Ile Lys Lys Phe Lys Lys Glu Gln Leu Glu Asn
225 230 235 240
Gly Tyr Phe Thr Lys Phe Glu Tyr Leu Lys Ser Ile Val Pro His Ala
245 250 255
Ile Val Cys Tyr Met Pro Asn Phe Met Arg Asp Phe Val Tyr Arg Lys
260 265 270
Met Leu Arg Arg Gly Arg Lys Lys
275 280
<210> 273
<211> 471
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33142_wzx
<400> 273
Met Gln Ile Ala Lys Asn Tyr Leu Tyr Asn Ala Ile Tyr Gln Val Phe
1 5 10 15
Val Ile Ile Val Pro Leu Leu Thr Ile Pro Tyr Leu Ser Arg Ile Leu
20 25 30
Gly Pro Ser Gly Ile Gly Ile Asn Ser Tyr Thr Asn Ser Ile Val Gln
35 40 45
Tyr Phe Val Leu Phe Gly Ser Ile Gly Val Gly Leu Tyr Gly Asn Arg
50 55 60
Gln Ile Ala Phe Val Arg Asp Asn Gln Val Lys Met Ser Lys Val Phe
65 70 75 80
Tyr Glu Ile Phe Ile Leu Arg Leu Phe Thr Ile Cys Leu Ala Tyr Phe
85 90 95
Leu Phe Val Ala Phe Leu Ile Ile Asn Gly Gln Tyr His Ala Tyr Tyr
100 105 110
Leu Ser Gln Ser Ile Ala Ile Val Ala Ala Ala Phe Asp Ile Ser Trp
115 120 125
Phe Phe Met Gly Ile Glu Asn Phe Lys Val Thr Val Leu Arg Asn Phe
130 135 140
Ile Val Lys Leu Leu Ala Leu Phe Ser Ile Phe Leu Phe Val Lys Ser
145 150 155 160
Tyr Asn Asp Leu Asn Ile Tyr Ile Leu Ile Thr Val Leu Ser Thr Leu
165 170 175
Ile Gly Asn Leu Thr Phe Phe Pro Ser Leu His Arg Tyr Leu Val Lys
180 185 190
Val Asn Tyr Arg Glu Leu Arg Pro Ile Lys His Leu Lys Gln Ser Leu
195 200 205
Val Met Phe Ile Pro Gln Ile Ala Val Gln Ile Tyr Trp Val Leu Asn
210 215 220
Lys Thr Met Leu Gly Ser Leu Asp Ser Val Thr Ser Ser Gly Phe Phe
225 230 235 240
Asp Gln Ser Asp Lys Ile Val Lys Leu Val Leu Ala Ile Ala Thr Ala
245 250 255
Thr Gly Thr Val Met Leu Pro Arg Val Ala Asn Ala Phe Ala His Arg
260 265 270
Glu Tyr Ser Lys Ile Lys Glu Tyr Met Tyr Ala Gly Phe Ser Phe Val
275 280 285
Ser Ala Ile Ser Ile Pro Met Met Phe Gly Leu Ile Ala Ile Thr Pro
290 295 300
Lys Phe Val Pro Leu Phe Phe Thr Ser Gln Phe Ser Asp Val Ile Pro
305 310 315 320
Val Leu Met Ile Glu Ser Ile Ala Ile Ile Phe Ile Ala Trp Ser Asn
325 330 335
Ala Ile Gly Thr Gln Tyr Leu Leu Pro Thr Asn Gln Asn Lys Ser Tyr
340 345 350
Thr Val Ser Val Ile Ile Gly Ser Ile Val Asn Leu Met Leu Asn Ile
355 360 365
Pro Leu Ile Ile Tyr Leu Gly Thr Val Gly Ala Ser Ile Ala Thr Val
370 375 380
Ile Ser Glu Met Ser Val Thr Val Tyr Gln Leu Phe Ile Ile His Lys
385 390 395 400
Gln Leu Asn Leu His Thr Leu Phe Ser Asp Leu Ser Lys Tyr Leu Ile
405 410 415
Ala Gly Leu Val Met Phe Leu Ile Val Phe Lys Ile Ser Leu Leu Thr
420 425 430
Pro Thr Ser Trp Ile Phe Ile Leu Leu Glu Ile Thr Val Gly Ile Ile
435 440 445
Ile Tyr Val Val Leu Leu Ile Phe Leu Lys Ala Glu Ile Ile Asn Lys
450 455 460
Leu Lys Phe Ile Met His Lys
465 470
<210> 274
<211> 16476
<212> DNA
<213> 乳酸乳球菌
<220>
<223> DSM 33183 eps基因簇,完整序列
<400> 274
atgaataatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataattga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt tttcggtagt gtaaaataag 360
ttttggaaca tcaaaaatat cacctacaat ggcgaaacaa gtgaacaatt attggctgaa 420
aaagttcaaa atcaagtatt ggcgactaac cctgatgttg ttttatatga agctccactt 480
tttaatgata accaatatag actactgggc tagttaccca gacaaaaatt ctgatgaaat 540
gaagggggct gttttctgat gatggagtat atagaacatt aaatgcttcg gggaataagg 600
tttggctaga ttatattact aaatatttta cagcaaacta attaagttat aaataacaat 660
tattaaatat tggagaagaa atgcaggaaa cacaggaaca gacgattgat ttaagaggga 720
tttttaaaat tattcgcaaa aggttaggtt taatattatt tagtgcttta atagtcacaa 780
tattagggag catctacaca ttttttatag cctccccagt ttacacagcc tcaactcaac 840
ttgtcgttaa actatcaaat tcggataatt cagcagccta cgctggacaa gtgaccggga 900
atattcaaat ggtgaacaca attaaccaag ttattgttag tccagtcatt ttagataaag 960
ttcaaagtaa tttaaatcta tctgatgact ctttccaaaa acaagttaca gcagcaaatc 1020
aaacagattc acaagttatt acgcttactg ttaaatattc taatccttac attgcacaaa 1080
agattgcaga cgagactgct aaaatattta gttcagacgc agcgaaacta ttgaatgtta 1140
ctaacgttaa tattctatcc aaagcaaaag ctcaaacaac accaattagt cctaaaccta 1200
aattgtattt agcgatatct gttatagtcg gactagtttt aggtttagcc attgctttat 1260
tgaaggaatt gtttgataac aaaattaata aagaagaaga tattgaagct ctggggctca 1320
cggttcttgg tgtaacaacc tatgatcaaa tgagtgattt taataagaat acaaataaaa 1380
atggcacgca atcgggaact aagtcaagtc cgcctagcga ccatgaagta aatagatcat 1440
caaaaaggaa taaaagatag gagttcagga tggctaaaaa taaaagaagc atagacaaca 1500
atcattatat tattaccagt gtcaatcctc aatcacctat ttccgaacaa tatcgtacga 1560
ttcgtacgac ccttgatttt aaaatggcgg atcaaggaat taaaagtttt ctagtagcat 1620
cttcagaagc agctgtaggt aaatcaaccg taagtgctaa tatagctgtt gcttttgcac 1680
aacaaggtaa aaaagtactt ttaattgatg gcgatcttcg taaaccgact gttaacatta 1740
cttttaaagt acaaaataga gtaggattaa ccaatatttt aatgcatcaa tcttcgattg 1800
aagatgccat acaagggaca agactttctg aaaatcttac aataattacc tctggtccaa 1860
ttccacctaa tccatcggaa ttattagcat ctagtgcaat gaagaatttg attgactctg 1920
tgtccgattt atttgatgtt gttttgattg atactccacc tctctctgca gttactgatg 1980
ctcaaatttt gagtagttat gtaggaggag cagttattgt tgtacatgcc tatgaaacaa 2040
aaaaagagag tttagcaaaa acaaaaaaaa tgcttgaaca agttaataca aatattttag 2100
gggttgtttt gcatggggta aactcttctg agtcaccatc gtattactac cacggagtag 2160
agtaattgga ataaacttga atcaaataaa agacagaaat ttgtagaaga ggagagcaaa 2220
tgattgatat tcattgccat attttaccgg gtatagatga tggagctaaa acttctggag 2280
atactttgac aatgctgaaa tcagcaattg atgaagggat aacaaccatc actgccactc 2340
ctcatcataa tcctcaattt aataatgaat caccgcttat tttgaagaaa gttaaggaag 2400
ttcaaaatat cattgacgag catcaattac caattgaagt tttaccagga caagaggtga 2460
gaatatatgg tgatttatta aaagaatttt ctgaaggaaa gttactgaca gcagcgggca 2520
cttcaagtta tatattgatt gaatttccat caaatcatgt gccagcttat gctaaagaac 2580
ttttttataa tattcaattg gagggacttc aacctatttt ggtccaccct gagcgtaata 2640
gcggaatcat tgagaaccct gatatattat ttgattttat tgaacaagga gtactaagtc 2700
agataacagc ttcaagtgtc actggtcatt ttggtaaaaa aatacaaaag ctgtcattta 2760
aaatgataga aaaccatctt acgcattttg ttgcatcaga tgcgcataat gtgacgtcac 2820
gtgcatttaa gatgaaggaa gcgtttgaaa ttattgaaga tagttatggt tctgatgtat 2880
cacgaatgtt tcaaaataat gcagagtcag tgattttaaa cgaaagtttt tatcaagaaa 2940
aaccaacaaa gatcaaaaca aagaaatttt taggattaaa tggagtaaat aatggaattt 3000
tttgaggatg cctcatcacc tgaatcggaa gagcctaagt tagtagaatt aaaaaatttt 3060
tcttatagag agctaattat aaaaagagca attgatatcc taggaggatt agcaggttca 3120
gttttatttc ttattgcggc tgcattgctc tatgtccctt acaaaatgag ctcggaaaaa 3180
gatcaagggc caatgttcta taaacaaaaa cgctatggga agaatggtaa aattttttat 3240
attttgaaat ttagaacaat gattcttaat accgagcagt atctagaact taatccggat 3300
gttaaagctg cttaccatgc caacggcaat aagctagaaa acgatccacg ggtaacgaag 3360
attggatcat ttataagacg acactcaatt gatgaactgc cacaatttat caatgttctt 3420
aaaggggata tggcattagt tggtccaaga ccaattctgc tttttgaagc gaaagaatat 3480
gggaaacgcc tcccttactt actcatgtgc aaaccaggaa tcactggtta ttggcaagtt 3540
cctctatttt cagataatgg agtacttcaa tcagcaaagg gggtggcatg aatggctaca 3600
atagattcag aagatataga cactaaagat aattctaaag ttttaaatga tgttgttgcg 3660
agtgaatctg ttatagtacc agttaacaac aaagtaaaaa aatcaaaagc aattttgcat 3720
aaaattgaaa gagcttcata ttttagcatt aaaagaatct tcgacataat ttgctcattg 3780
cttggcatta tagcattaat tccagtagca atagtaacta aaatatgtta catagcaaca 3840
ggagataaaa aatcaatatt ttataaacaa aagagaatcg gaaaaaatgg taaaccgata 3900
tatatatata aatttagaag tatggtatgg aatgcagatg aagtgttaaa agaactttta 3960
aaagacccta agtataaaaa agaatgggac ttaaatcaaa aatttgaaaa tgatccgaga 4020
ataacaaaaa tgggaaatat tttaagaaaa acatcattag atgaattgcc acaattcatc 4080
aatgtaatta aaggtgatat gtctatgata ggacctcgac ctttagttga aggagagctt 4140
gatgctcata aaggaaatca tgcaatatat gaaagtgttc gcccaggcat ctcggggtgg 4200
tgggccgcga acggaaggtc agctactact tatgaaagaa gacttgaatt agaatatttc 4260
tattgtaaaa attgtaattt aatattagat attaagtgtg tattcttaac aatagcagta 4320
gtgttattta aaacaggagc aaagtagtat caaactataa tgaacccgaa ttgctagttg 4380
attatttagc catgacttga tacccgatag aatatcttaa agtctctggt tccagtgatt 4440
tagctgattt taacagtaaa gaatacgcta aaagtatcat ctctaatttc aattgaaaac 4500
cttgaggcga acgactttta caacgctcag ctcctagatt tgtcaaaaaa gagaaaactc 4560
gctcaatcac ttttctacgt tttgaaaaat tagggaaaag gattttcttt tgcttcatgt 4620
tcttcctgac aggtgtcatt agatcaattc cttttaattc cagcctatca tgcagtgact 4680
gacctaaata tcccatatct ccaaggactg ttggtgtccc aaattgactc aacacttcct 4740
cggtcattga actatcagcc attgaagcag gagtaattgt gtagtctatg acatagcctg 4800
attcactgac taaagcatga catttacatc catagaagta ctgtcccttt gtagcattgt 4860
agccaacatt tgcataatct ccaagacctt tgcttctgaa attacgaata ggctgacaca 4920
aaggaatggg gaagctgtca ataatggata cactcattcc ttcaacctct ttaaagacga 4980
gtgcttggcg aatgacttgg atactcggta agagggcatt acaacggcgg acaaagcgag 5040
aatattctag gaaattagga aataaacttt gagcaagttg gtgcttagct ttaagcgttt 5100
cactaaaatg cagtacgccc cagaggtaac aagcgataac taagcaatct gatgttgcga 5160
gatggacgtt ctttcggttt tgaacctcaa ggggaaccct cgtttgataa agcgtctcaa 5220
tggttgtcag taaataaaca aaaacttttg gaagtgtgct attataagtc atataagtcg 5280
tgagctttct aatgcttagt gctttaagat taggatagca cgacttattt attttccaat 5340
gaaattaact agcaattcgg gtaatatata tatggaggta ccatgaatca aaaaaagagg 5400
cgtcattatc gtaagaaaaa acacacagta ctaaaagtta tttcaattat ttttgtatta 5460
gtaattatcg ctgttgcttc tatagcctac gccgcttata gaaatgttga atcaacattt 5520
tcaacatcgt atgaaaattt ccctaaaaca acaagtatcg acttaaaaaa gtctaaaaca 5580
ttcaccacac ttatcattgc aactggtaaa aataattcta aaaatacagc ttatgctact 5640
gttttagctt caacgaatgt aaagacaaat caaactactt tcatgaactt cccagttttt 5700
gcgacaatgc ctaatcaaaa aacaatcact gaagtttaca atacgaatgg agatgatgga 5760
attttccaga tggttaaaga cctattgaat gtgtccatta acaaagtaat tcatattgat 5820
gttaataaaa tgggatcact tgtacaggcc actggtggaa tcatcatgca aaatccaaag 5880
gcattcaatg ctgaagatta tgagtttaaa caaggaactg ttaatttaca aactgctgat 5940
caagtccaag cctatatgac acaaattgac gatactgatt tggatgcttc aatcacccgg 6000
attcaaaatg tctcaatgga actctacgga aatattcaaa aaattgctca tatgaaaaaa 6060
cttgaaagtt tcaattacta tcgagaaatt ctctatgctt tttcaaacac tgttaaaacc 6120
aatataagtt tcaatgatgc taaaaagatc gttatgagct acaatgaggc tctaaagaat 6180
accagcaagc tcaatctaca tacaacagat gaaaatggag ctaaggtcgt ttctcaaaca 6240
gaattagact cagtcaaaac cctttttgaa aaatctctaa aataaaagaa ccaagaggtt 6300
cttttatttt tatttcatca caatataatc cggtacggct cgatcgtctt gactagcaac 6360
aatagtttta ccattgacag atagttgact tgagccgcca ctatcaagta ataacatatt 6420
ttggagcttc aaatttgaca ctgatttcat tatattatca taacctgcat ttgtatcact 6480
caaaatagca tagagattat tatctttatc attcgcaata aaaatatgga tcttccagtc 6540
tactgagcca tcacttggct gaatcttacc atcacggata attgcagtac caaaatcata 6600
ggcctgttcc cccccatttt taataatagt tgaagcaggt gtacttgaat catagatttt 6660
gcatgaacca tctttgttaa taacaaaggc atattgaacc attgtacctg gactccagtc 6720
ttgaatcaac tttgcattgt tgatttggaa gccagctact tgtcctgtct gcatatcaaa 6780
tgccgaggca ttcataatca aggtattagg gtatttagct ataacttcgg acattttcat 6840
tcgttgatct gtacgattgg taactgtttt taggacttct ggattattaa ttcgataaat 6900
agtgatatta ttcgtagata aatcagtaaa tttatctaga tttacctctg agggaacctt 6960
aacccatttt ggtacaccaa catcatctac aagtgctata ctacccacag cagcttcttc 7020
acctttttgc tccacgtaac cgttagaagc cgaagttttg gtcggttgat ccgaactatt 7080
tgttttcgta tgttcagtag gaataagtga atctcttaag gcataagccc ctgctcctat 7140
aagggtaata aaaattaaga taggtataat tatcacccag atattatttt tcgtcttttt 7200
ttttcgttcc aatttctcct ccaaaattgt tgtttagtca cgaccaaata aatctctcgt 7260
gtaaactttg gttttaacat cctcaagttc tttagtataa cgatttgaaa caataacatc 7320
agaaatcatt ttaaactcat ctagattatg aactacccga gaattgtaga actgatcatc 7380
tgtgagtgtc ggctcataga caactacttc aattcccttg cctttaattc gtttcataat 7440
tccttgaatt gaacttgatc taaaattatc agaatttgac ttcattgtca agcggtagat 7500
gcccaccact ttgggatacc gtttaatgat catatctgcg atatgatctt ttctcgtcct 7560
atttgattca acaacagcct caattaattt ctcagaaact tggtcatagt tagccaaaag 7620
ctgtttggta tctttcggta agcaacctgt cgcttctatt gtgtgtaata ttcttaacaa 7680
aaaccaaaag aaaacctgta tttatgcagg ttttaagtaa cttttttttt cttaacatat 7740
tctagtaaat tttgataaat atcttgatat ttaaaaacaa tctcgactct tttatcttca 7800
aaaatatata tgcattctat aaatttaaaa agtatatctc tatcaatttt cttaaataat 7860
tctctattgt taaaaagttt tattaaatca acttgataat ttggaatatt atctaattct 7920
tccaattcct tatcaattat tttaattgtt tctctaattt ctaaaatttc attattatat 7980
gaacttgaat aatttttata atcttcgaaa gttattattt cttctttcca atctgtatat 8040
aaagattgtt ttgatgttat ccttaaatct tagagtcact attgtataat ttagacaaag 8100
gacaaaaaca tgaaaaaatg ctactcaaaa gaatttaaag aaacccttat cgccttctat 8160
cattctggtc aatccgtcac ccagctgtct aaagaatacg acgtggcccc tgcaacaatt 8220
tataaatgga tagacctcta ctctaaatct aatgaaagct ccgtctctaa agctgatttt 8280
ctagaattaa aaagacaact ggctaaagtt aaggaagaac gagacatctt aaaaaaagta 8340
ttgaccatat tcgccgagaa aaagaagtga gtgctgcgga tatggctcaa accatacaaa 8400
ctttagcact caatgtcaga ctaagctgtc aactccttga tattcctgaa tcaagttatt 8460
atgaacggat taaccgacat ccatctaaaa ctcaattaag gagacaatac ctgtcactca 8520
aaatttctca actcttcaat gctaaccgag aaatctatgg tgttcctaaa attcatcatc 8580
ttctacttaa acaaggggaa aaagtcgggt taaaactggt acagaagcta atgaagcaac 8640
ttcaactcaa gtctgtagtc attaagaaat ttaagcctgg atactcacta agtgatcaca 8700
tcaatcgaaa aaatctcata cagactgaac ctacaaagaa aaataaggtt tggtcaaccg 8760
acattactta tattcctact caacaaggat gggcttatct ctcaaccatt atggatcgtt 8820
atactaaaaa agtcattgct tgggatttgg gcaagcgaat gactgtagaa ttagtgcaaa 8880
gaactttaaa taaggccatt aaatcacaag actatccaga agctgttatt cttcattctg 8940
accaaggaag ccagtatacg agtctagagt atgaagagtt gcttaagtat tatgggatga 9000
ctcactcttt cagtcgaagg ggataccctt atcataatgc cagtcttgaa tcttggcatg 9060
gacatttaaa aagagagtgg gtgtatcaat ttaaatataa gaactttgaa gaagcctatc 9120
agagtatttt ctggtacatc gaaggctttt ataattcaaa acgaatccat caaagtttag 9180
ggtatcttac acctaatcaa tttgaaaagg taagtgctta aaataaatag attaaaattc 9240
tccgtttgtt actttaaaaa cttgacttaa cgtcatttca taaagttttc tataaatttc 9300
ttttaaagaa tgtttaaagt taaaaacaag taaaacagta taataataat actttattcc 9360
tttattttcc ttgtactttg atctggtatt tttcattccc caataaaaat cgtaacttct 9420
taaatttttt ttgaatttat ttgagtctaa gtattcttta atcattgttt ctgttacttc 9480
tcctaatggg ttatttttat aatattttct taaatacttt gcaatcaatt cttgctttcc 9540
tgcatttttt cttgatatac tttgtggact aattctatac ttcaataata tatcttgaac 9600
atttccgata gttataccat ttccaactgc cctcaataaa aaatcataat cttcacaaga 9660
aaaaacatta ttatatccat ttagtttttc ataaacattc tttctaacaa aataagtagg 9720
atgtgataca caatttttta taaaaagtag tttttttaca ttctctgctt ttcttggaaa 9780
aatgatatct ttgaaatcta cattatcaat aaaacatgta acatttgatc cacacatgtc 9840
atagcctgtt tcttttaaaa attcatattg tttttcaaaa cgtatcggaa gagctatatc 9900
atctgcatcc atcctagcta tatattctcc agttgcattt tttaatgctt tattcaaact 9960
tttaggaaga cctatatttt tttcattaat gataattttc attctatcat ctttatatga 10020
ttttaaaaag tcaattctcc acttttcatc tggattatcc acaactataa taagttctaa 10080
attagaataa gtttgattaa gtatagattc aattgatgat ttaagttcat tttcactttc 10140
tttatatatt gacattataa cactaacttt ttctttcatt ttctatctcc tttgctggta 10200
caccaactaa tactccactt ttaaagtttt tatttactac agcattcgcg ccaactttta 10260
tatcatcgcc taaaacaaca tcacctataa ttactgcccc aaagcctaca tcacaattat 10320
ttcctattat tggtgcatat ccgtctaatc ctttatttcc tatacaatta ttaccatgta 10380
gtttaaggtt cttcccacat ttagccaaac cactaactac aatttctcca gaatgataaa 10440
ttattagccc ttcatcaatt gtattaatac ttgcgaaaat acccattctt ttacctatac 10500
aattttttct gtaaatatgg tatgcatata atggaaaaaa gattttattt ttttgattat 10560
aataatattc ggcttttctt aaatgctttt gaaatctcca tatatctatc ttttcatttc 10620
ctgacaaaac tttttttata taatctttct tattttcaat atacgctttt ttttcaagat 10680
tcaaaacatg cattaatttt tttctatcac ttatcatctt taatttctcc taacttttta 10740
taattatttc ctattaaaaa aattataaaa ataaatattg gataataata tgttatttct 10800
gtcatcatca ttattaaaaa gctaaaaata attgcaacat ttaatgaatt tattttttta 10860
ttatcgattt tatttggaaa tccttgttta aaaatcatca taaacaatgt taaagttcca 10920
aatataccag actcacacat agattgtaaa atagtattat gtgcttgcca taatccatta 10980
tgatagctaa taatatttcc gtcattaaca tatccaaatc caaataaaaa ataatttttt 11040
gccacataaa ttgcattttc ccatattata gttctacccg tcaacgtaat atctttgttt 11100
aataattcaa aaaggccact aaatagattt aacaaatttc catttattaa attaatggac 11160
aaagcaatta aaattaaaaa tccaaaacaa taaactaaac ttagtttctt tttatttctc 11220
aataataaat aaaacaaaat agttaacaca ataacaacaa tacctgtggc aacccattta 11280
gataatatat ttataattgc taatacatac gatgtaatta gtttttttcg atctttttta 11340
ttctttgaag aatactcgtt gtaataaatt aaagaaatgt acataagagc gattgaaaat 11400
tccgtaaaac gagttcttat tcctaaaaaa taaataccat actgtggaaa tattccatta 11460
ggataaataa tcattaataa aatattaatt agaacataac taaaaatcat tttgtataca 11520
attctaaatg catccttatc gcaaatataa ttttttatta ataaatacaa gccaacaaat 11580
acaaccgatt gatagcccca tttaacaatt tcaccattat gattcaatgt tggtataaat 11640
atcagaagtc tccataatag taaaaggaaa atggtaatat caaatttaac atttttttta 11700
aatgacctta ataataaaat taacgtacat aataaggcca ttccaatact catataatta 11760
agcacattga tttttacaaa aaaattaggc tgaataaatg caaaaaaaca aagtgcaatg 11820
aaaatttttt ctttagtttt catcatgagc ccccttagaa tatatttttt ctattacatt 11880
tattaattta tctgtatcat aatctaatag tttttctgaa taacgttctt ctcttttcat 11940
attaaaatta ttgaaaaatt gtatccattc gtcagcgttg tcaatatcca agaatttaac 12000
attgttattt aattttacat ctttagttat tttgtctgaa aaagcacatg gacaaccatt 12060
cgcttgagct tcaactgctg taagaggaaa tccttcaaat atagaaggca aaacaaagca 12120
atccatagca ctataataat attcagattt aacattttca agcattataa ctttttcatt 12180
tatatttatt tcatttattt tagatactat ttcttctttt aattctccaa aacctattat 12240
cattaattta gcattactat tctttatttt tgcaaaacat tcaatcagaa aagactggtt 12300
tttttgcaca tttaatcttc ctacattgcc aaaaattata tctttttctc caatattcag 12360
ttctttcctt atttgttttc gattttcagc agaatatgaa aattttttta aatcaattcc 12420
attttttact actgtaaact ttctatttcc ataagcaaat tctccagctt cttctgaaca 12480
tgccaagaat tcatttgcat atttcgaagc attaagaatc ataaaacgtt ttattactcc 12540
ttttaaagat ttttctgttt tgctattatg tgaatgattt attaaaattt tgcagccatt 12600
ttgtttggca tattttaagg ttagtcttga tgtacttaac atgtgtgaat gaactacatc 12660
aaaatgattt tttttaaaaa aaccttttaa aaattttata tatttaaaaa tatttttata 12720
ctctctaaat gaacatttat gtattgtaca gccaattttc tgtatttctt catcaaaata 12780
ataatttcca gtataatgat ataaaaaatg aatttcatat ttttgcatat ctaaatttct 12840
tataatattc ataattaaat tttcaagacc accctgttgc atatttggaa caatttcgag 12900
tatctttatt ttattcattt ttttcttccc cttcttagca tttttctata aacaaaatct 12960
ctcataaaat taggcatata acatacaatt gcgtgtggta caattgattt caaatattca 13020
aatttagtaa aataaccatt ttctaattgt tctttcttaa atttcttaat gcttttaaaa 13080
tatttgtgtc caccacgtct tttataaaaa tcttcaccaa ttctcatata tacatatata 13140
tcttgaatat tataatattt acaaccattt ctaatcattc ttagccacat atcataatct 13200
tcgcacaagt aatattctct ataatttcca gcttttacaa cttcactttt cttaaacata 13260
acagatggat gtctaaaagg attcctactt tttgaaaatt taataatatc ctcattcgtt 13320
tctggtaata taacattaca aacaacatta tctattgaat caataaattc agatatactt 13380
gttccgacca ttccaatatc tgaatatttc tcaaatattt caaattcctt ttcaattctt 13440
tttggcatag aatagtcatc tgagtccatt ctggcaatat actcatttga gcattcttca 13500
actccctttt tcaatgcagg acctaagcca acattttttt ctatagcaac aactttaaac 13560
tcttttggat attttgtttt atatttttca acaacgtcat atagttcctt ggttagtgga 13620
ccatcctcca ctaaaacgaa ctcatttggt ttaatagttt gttcaaacat gctttttata 13680
ctttcatcaa accatgtagg attttcttta tgataaacag acattaaaac gctatacttt 13740
ttttccacaa gattcacttc tccttaaaaa atcatttcac tatacaatgc aatgataaat 13800
actcaaactg actactattg tataatctat actatatttt tataacccga attgctagtt 13860
aatttcattg gaaaataaat aagtcgtgct atcctaatct taaaccacta agcattagaa 13920
agcgcacgac ttatatgact tataatagca cacttccaaa agtttttgtt tatttactga 13980
caaccattga gacgctttat caaacgaggg ttccccttga ggttcaaaac cgaaagaacg 14040
tccatctcgc aacatcagat tgcttagtta tcgcttgtta cctatggggc gtactgcatt 14100
ttagtgaaac gcttaaagct aagcaccaac ttgctcaaag tttatttcct aatttcctag 14160
aatattctcg ctttgtccgc cgttgtaatg ccctcttacc gagtatccaa gtcattcgcc 14220
aagcactcgt ctttaaagag gttgaaggaa tgagtgtatc cattattgac agcttcccca 14280
ttcctttgtg tcagcctatt cgtaatttca gaagcaaagg tcttggagat tatgcaaatg 14340
ttggctacaa tgctacaaag ggacagtact tctatggatg taaatgtcat gctttagtca 14400
gtgaatcagg ctatgtcata gactacacaa ttactcctgc ttcaatggct gatagttcaa 14460
tgaccgagga agtgttgagt caatttggga caccaacagt ccttggagat atgggatatt 14520
taggtcagtc actgcatgat aggctggaat taaaaggaat tgatctaatg acacctgtca 14580
ggaagaacat gaagcaaaag aaaatccttt tccctaattt ttcaaaacgt agaaaagtga 14640
ttgagcgagt tttctctttt ttgacaaatc taggagctga gcgttgtaaa agtcgttcgc 14700
ctcaaggttt tcaattgaaa ttagagatga tacttttagc gtattcttta ctgttaaaat 14760
cagctaaatc actggaacca gagactttaa gatattctat cgggtatcaa gtcatggcta 14820
aataatcaac tagaaattcg ggttttgaat agtaagcagt aaataactga ctgtattgct 14880
tctaaccgat taagtggcac ggtttagaac cgaaaagaca tattgtcctg tgctacaatc 14940
aagtcgacca aaacaaattg aaggtgagcg tcaaatatgc ccaataccat tgactcaccg 15000
ttattttcaa taaggcaggt acctaaatcc atacctctat ttatgcataa taaactttag 15060
cttattaatt atttctgcct ttaaaaatat taataaaaca acataaataa ttatgcccac 15120
agtaatttcc aacagaatga atatccaaga tgtcggtgtt aacaaactaa ttttaaagac 15180
aattagaaac atcactaatc ctgcaattaa atacttagat aaatccgaaa acagtgtatg 15240
caaattaagc tgtttatgaa ttataaaaag ttgatacaca gttacagaca tttcagaaat 15300
tacagttgca attgatgcac caacagtacc tagatatata atcagtggaa tatttaacat 15360
taaattgact atcgatccaa tgatcaccga cactgtatat gacttatttt gattagttgg 15420
taaaagatat tgagtaccta ttgcgttgct ccaagctata aaaataattg cgattgactc 15480
gatcattaac acaggaataa catcactaaa ttgagatgta aaaaaaagtg gcacgaattt 15540
aggagtaata gctatcagac caaacatcat aggaatcgaa attgccgaca caaaagaaaa 15600
acctgcgtac atgtattcct taattttact atactctcta tgtgcaaagg catttgcaac 15660
acgtggcaac atgacagtac ctgttgcagt agcaatagcc aaaaccagtt taactatttt 15720
atcagactga tcaaaaaagc cggagctcgt gacagaatcc aatgaaccta acattgtttt 15780
attcaaaacc caataaattt ggacagcaat ttgtgggata aacataacta aagattgctt 15840
taaatgcttt attggcctta attcacgata gttaaccttt acgagatatc tgtgtaaact 15900
tgggaaaaaa gttaaattac caattaatgt agataaaact gttatcaata tatatatatt 15960
caaatcattg taagatttga caaataggaa aatactgaat agagcaagta acttaactat 16020
aaaatttctt aatacagtta ctttaaaatt ttcaattccc ataaaaaacc aagagatatc 16080
aaatgcagct gcaactatag caatggattg agacaaatag tatgcatgat actgaccatt 16140
aatgattaaa aaagcaacga acaaaaaata tgctaaacat attgtaaata gtcttaaaat 16200
aaatatttca taaaagactt tagacatttt gacctgatta tccctaacaa aggcaatctg 16260
acgattccca tacaaaccga ctcctatact accaaataaa acaaaatact gaacaataga 16320
attggtatat gagttaattc caatacctga agggcccaaa attcttgaca aataaggaat 16380
ggtaagtaat ggcacaatta ttataaagac ctgatatatt gcattataaa gataattttt 16440
tgcgatttgc attaataacc ctcccgaatt aaacaa 16476
<210> 275
<211> 105
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsR
<400> 275
Met Asn Asn Leu Phe Tyr His Arg Leu Lys Glu Leu Val Glu Ser Ser
1 5 10 15
Gly Lys Ser Ala Asn Gln Ile Glu Arg Glu Leu Gly Tyr Pro Arg Asn
20 25 30
Ser Leu Asn Asn Tyr Lys Leu Gly Gly Glu Pro Ser Gly Thr Arg Leu
35 40 45
Ile Gly Leu Ser Glu Tyr Phe Asn Val Ser Pro Lys Tyr Leu Met Gly
50 55 60
Ile Ile Asp Glu Pro Asn Asp Ser Ser Ala Ile Asn Leu Phe Lys Thr
65 70 75 80
Leu Thr Gln Glu Glu Lys Lys Glu Met Phe Ile Ile Cys Gln Lys Trp
85 90 95
Leu Phe Leu Glu Tyr Gln Ile Glu Leu
100 105
<210> 276
<211> 259
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsC
<400> 276
Met Gln Glu Thr Gln Glu Gln Thr Ile Asp Leu Arg Gly Ile Phe Lys
1 5 10 15
Ile Ile Arg Lys Arg Leu Gly Leu Ile Leu Phe Ser Ala Leu Ile Val
20 25 30
Thr Ile Leu Gly Ser Ile Tyr Thr Phe Phe Ile Ala Ser Pro Val Tyr
35 40 45
Thr Ala Ser Thr Gln Leu Val Val Lys Leu Ser Asn Ser Asp Asn Ser
50 55 60
Ala Ala Tyr Ala Gly Gln Val Thr Gly Asn Ile Gln Met Val Asn Thr
65 70 75 80
Ile Asn Gln Val Ile Val Ser Pro Val Ile Leu Asp Lys Val Gln Ser
85 90 95
Asn Leu Asn Leu Ser Asp Asp Ser Phe Gln Lys Gln Val Thr Ala Ala
100 105 110
Asn Gln Thr Asp Ser Gln Val Ile Thr Leu Thr Val Lys Tyr Ser Asn
115 120 125
Pro Tyr Ile Ala Gln Lys Ile Ala Asp Glu Thr Ala Lys Ile Phe Ser
130 135 140
Ser Asp Ala Ala Lys Leu Leu Asn Val Thr Asn Val Asn Ile Leu Ser
145 150 155 160
Lys Ala Lys Ala Gln Thr Thr Pro Ile Ser Pro Lys Pro Lys Leu Tyr
165 170 175
Leu Ala Ile Ser Val Ile Val Gly Leu Val Leu Gly Leu Ala Ile Ala
180 185 190
Leu Leu Lys Glu Leu Phe Asp Asn Lys Ile Asn Lys Glu Glu Asp Ile
195 200 205
Glu Ala Leu Gly Leu Thr Val Leu Gly Val Thr Thr Tyr Asp Gln Met
210 215 220
Ser Asp Phe Asn Lys Asn Thr Asn Lys Asn Gly Thr Gln Ser Gly Thr
225 230 235 240
Lys Ser Ser Pro Pro Ser Asp His Glu Val Asn Arg Ser Ser Lys Arg
245 250 255
Asn Lys Arg
<210> 277
<211> 231
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsD
<400> 277
Met Ala Lys Asn Lys Arg Ser Ile Asp Asn Asn His Tyr Ile Ile Thr
1 5 10 15
Ser Val Asn Pro Gln Ser Pro Ile Ser Glu Gln Tyr Arg Thr Ile Arg
20 25 30
Thr Thr Leu Asp Phe Lys Met Ala Asp Gln Gly Ile Lys Ser Phe Leu
35 40 45
Val Ala Ser Ser Glu Ala Ala Val Gly Lys Ser Thr Val Ser Ala Asn
50 55 60
Ile Ala Val Ala Phe Ala Gln Gln Gly Lys Lys Val Leu Leu Ile Asp
65 70 75 80
Gly Asp Leu Arg Lys Pro Thr Val Asn Ile Thr Phe Lys Val Gln Asn
85 90 95
Arg Val Gly Leu Thr Asn Ile Leu Met His Gln Ser Ser Ile Glu Asp
100 105 110
Ala Ile Gln Gly Thr Arg Leu Ser Glu Asn Leu Thr Ile Ile Thr Ser
115 120 125
Gly Pro Ile Pro Pro Asn Pro Ser Glu Leu Leu Ala Ser Ser Ala Met
130 135 140
Lys Asn Leu Ile Asp Ser Val Ser Asp Leu Phe Asp Val Val Leu Ile
145 150 155 160
Asp Thr Pro Pro Leu Ser Ala Val Thr Asp Ala Gln Ile Leu Ser Ser
165 170 175
Tyr Val Gly Gly Ala Val Ile Val Val His Ala Tyr Glu Thr Lys Lys
180 185 190
Glu Ser Leu Ala Lys Thr Lys Lys Met Leu Glu Gln Val Asn Thr Asn
195 200 205
Ile Leu Gly Val Val Leu His Gly Val Asn Ser Ser Glu Ser Pro Ser
210 215 220
Tyr Tyr Tyr His Gly Val Glu
225 230
<210> 278
<211> 261
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsB
<400> 278
Met Ile Asp Ile His Cys His Ile Leu Pro Gly Ile Asp Asp Gly Ala
1 5 10 15
Lys Thr Ser Gly Asp Thr Leu Thr Met Leu Lys Ser Ala Ile Asp Glu
20 25 30
Gly Ile Thr Thr Ile Thr Ala Thr Pro His His Asn Pro Gln Phe Asn
35 40 45
Asn Glu Ser Pro Leu Ile Leu Lys Lys Val Lys Glu Val Gln Asn Ile
50 55 60
Ile Asp Glu His Gln Leu Pro Ile Glu Val Leu Pro Gly Gln Glu Val
65 70 75 80
Arg Ile Tyr Gly Asp Leu Leu Lys Glu Phe Ser Glu Gly Lys Leu Leu
85 90 95
Thr Ala Ala Gly Thr Ser Ser Tyr Ile Leu Ile Glu Phe Pro Ser Asn
100 105 110
His Val Pro Ala Tyr Ala Lys Glu Leu Phe Tyr Asn Ile Gln Leu Glu
115 120 125
Gly Leu Gln Pro Ile Leu Val His Pro Glu Arg Asn Ser Gly Ile Ile
130 135 140
Glu Asn Pro Asp Ile Leu Phe Asp Phe Ile Glu Gln Gly Val Leu Ser
145 150 155 160
Gln Ile Thr Ala Ser Ser Val Thr Gly His Phe Gly Lys Lys Ile Gln
165 170 175
Lys Leu Ser Phe Lys Met Ile Glu Asn His Leu Thr His Phe Val Ala
180 185 190
Ser Asp Ala His Asn Val Thr Ser Arg Ala Phe Lys Met Lys Glu Ala
195 200 205
Phe Glu Ile Ile Glu Asp Ser Tyr Gly Ser Asp Val Ser Arg Met Phe
210 215 220
Gln Asn Asn Ala Glu Ser Val Ile Leu Asn Glu Ser Phe Tyr Gln Glu
225 230 235 240
Lys Pro Thr Lys Ile Lys Thr Lys Lys Phe Leu Gly Leu Asn Gly Val
245 250 255
Asn Asn Gly Ile Phe
260
<210> 279
<211> 199
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsE1
<400> 279
Met Glu Phe Phe Glu Asp Ala Ser Ser Pro Glu Ser Glu Glu Pro Lys
1 5 10 15
Leu Val Glu Leu Lys Asn Phe Ser Tyr Arg Glu Leu Ile Ile Lys Arg
20 25 30
Ala Ile Asp Ile Leu Gly Gly Leu Ala Gly Ser Val Leu Phe Leu Ile
35 40 45
Ala Ala Ala Leu Leu Tyr Val Pro Tyr Lys Met Ser Ser Glu Lys Asp
50 55 60
Gln Gly Pro Met Phe Tyr Lys Gln Lys Arg Tyr Gly Lys Asn Gly Lys
65 70 75 80
Ile Phe Tyr Ile Leu Lys Phe Arg Thr Met Ile Leu Asn Thr Glu Gln
85 90 95
Tyr Leu Glu Leu Asn Pro Asp Val Lys Ala Ala Tyr His Ala Asn Gly
100 105 110
Asn Lys Leu Glu Asn Asp Pro Arg Val Thr Lys Ile Gly Ser Phe Ile
115 120 125
Arg Arg His Ser Ile Asp Glu Leu Pro Gln Phe Ile Asn Val Leu Lys
130 135 140
Gly Asp Met Ala Leu Val Gly Pro Arg Pro Ile Leu Leu Phe Glu Ala
145 150 155 160
Lys Glu Tyr Gly Lys Arg Leu Pro Tyr Leu Leu Met Cys Lys Pro Gly
165 170 175
Ile Thr Gly Tyr Trp Gln Val Pro Leu Phe Ser Asp Asn Gly Val Leu
180 185 190
Gln Ser Ala Lys Gly Val Ala
195
<210> 280
<211> 251
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsE2
<400> 280
Met Ala Thr Ile Asp Ser Glu Asp Ile Asp Thr Lys Asp Asn Ser Lys
1 5 10 15
Val Leu Asn Asp Val Val Ala Ser Glu Ser Val Ile Val Pro Val Asn
20 25 30
Asn Lys Val Lys Lys Ser Lys Ala Ile Leu His Lys Ile Glu Arg Ala
35 40 45
Ser Tyr Phe Ser Ile Lys Arg Ile Phe Asp Ile Ile Cys Ser Leu Leu
50 55 60
Gly Ile Ile Ala Leu Ile Pro Val Ala Ile Val Thr Lys Ile Cys Tyr
65 70 75 80
Ile Ala Thr Gly Asp Lys Lys Ser Ile Phe Tyr Lys Gln Lys Arg Ile
85 90 95
Gly Lys Asn Gly Lys Pro Ile Tyr Ile Tyr Lys Phe Arg Ser Met Val
100 105 110
Trp Asn Ala Asp Glu Val Leu Lys Glu Leu Leu Lys Asp Pro Lys Tyr
115 120 125
Lys Lys Glu Trp Asp Leu Asn Gln Lys Phe Glu Asn Asp Pro Arg Ile
130 135 140
Thr Lys Met Gly Asn Ile Leu Arg Lys Thr Ser Leu Asp Glu Leu Pro
145 150 155 160
Gln Phe Ile Asn Val Ile Lys Gly Asp Met Ser Met Ile Gly Pro Arg
165 170 175
Pro Leu Val Glu Gly Glu Leu Asp Ala His Lys Gly Asn His Ala Ile
180 185 190
Tyr Glu Ser Val Arg Pro Gly Ile Ser Gly Trp Trp Ala Ala Asn Gly
195 200 205
Arg Ser Ala Thr Thr Tyr Glu Arg Arg Leu Glu Leu Glu Tyr Phe Tyr
210 215 220
Cys Lys Asn Cys Asn Leu Ile Leu Asp Ile Lys Cys Val Phe Leu Thr
225 230 235 240
Ile Ala Val Val Leu Phe Lys Thr Gly Ala Lys
245 250
<210> 281
<211> 300
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_lytR
<400> 281
Met Asn Gln Lys Lys Arg Arg His Tyr Arg Lys Lys Lys His Thr Val
1 5 10 15
Leu Lys Val Ile Ser Ile Ile Phe Val Leu Val Ile Ile Ala Val Ala
20 25 30
Ser Ile Ala Tyr Ala Ala Tyr Arg Asn Val Glu Ser Thr Phe Ser Thr
35 40 45
Ser Tyr Glu Asn Phe Pro Lys Thr Thr Ser Ile Asp Leu Lys Lys Ser
50 55 60
Lys Thr Phe Thr Thr Leu Ile Ile Ala Thr Gly Lys Asn Asn Ser Lys
65 70 75 80
Asn Thr Ala Tyr Ala Thr Val Leu Ala Ser Thr Asn Val Lys Thr Asn
85 90 95
Gln Thr Thr Phe Met Asn Phe Pro Val Phe Ala Thr Met Pro Asn Gln
100 105 110
Lys Thr Ile Thr Glu Val Tyr Asn Thr Asn Gly Asp Asp Gly Ile Phe
115 120 125
Gln Met Val Lys Asp Leu Leu Asn Val Ser Ile Asn Lys Val Ile His
130 135 140
Ile Asp Val Asn Lys Met Gly Ser Leu Val Gln Ala Thr Gly Gly Ile
145 150 155 160
Ile Met Gln Asn Pro Lys Ala Phe Asn Ala Glu Asp Tyr Glu Phe Lys
165 170 175
Gln Gly Thr Val Asn Leu Gln Thr Ala Asp Gln Val Gln Ala Tyr Met
180 185 190
Thr Gln Ile Asp Asp Thr Asp Leu Asp Ala Ser Ile Thr Arg Ile Gln
195 200 205
Asn Val Ser Met Glu Leu Tyr Gly Asn Ile Gln Lys Ile Ala His Met
210 215 220
Lys Lys Leu Glu Ser Phe Asn Tyr Tyr Arg Glu Ile Leu Tyr Ala Phe
225 230 235 240
Ser Asn Thr Val Lys Thr Asn Ile Ser Phe Asn Asp Ala Lys Lys Ile
245 250 255
Val Met Ser Tyr Asn Glu Ala Leu Lys Asn Thr Ser Lys Leu Asn Leu
260 265 270
His Thr Thr Asp Glu Asn Gly Ala Lys Val Val Ser Gln Thr Glu Leu
275 280 285
Asp Ser Val Lys Thr Leu Phe Glu Lys Ser Leu Lys
290 295 300
<210> 282
<211> 304
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_epsL
<400> 282
Leu Glu Glu Lys Leu Glu Arg Lys Lys Lys Thr Lys Asn Asn Ile Trp
1 5 10 15
Val Ile Ile Ile Pro Ile Leu Ile Phe Ile Thr Leu Ile Gly Ala Gly
20 25 30
Ala Tyr Ala Leu Arg Asp Ser Leu Ile Pro Thr Glu His Thr Lys Thr
35 40 45
Asn Ser Ser Asp Gln Pro Thr Lys Thr Ser Ala Ser Asn Gly Tyr Val
50 55 60
Glu Gln Lys Gly Glu Glu Ala Ala Val Gly Ser Ile Ala Leu Val Asp
65 70 75 80
Asp Val Gly Val Pro Lys Trp Val Lys Val Pro Ser Glu Val Asn Leu
85 90 95
Asp Lys Phe Thr Asp Leu Ser Thr Asn Asn Ile Thr Ile Tyr Arg Ile
100 105 110
Asn Asn Pro Glu Val Leu Lys Thr Val Thr Asn Arg Thr Asp Gln Arg
115 120 125
Met Lys Met Ser Glu Val Ile Ala Lys Tyr Pro Asn Thr Leu Ile Met
130 135 140
Asn Ala Ser Ala Phe Asp Met Gln Thr Gly Gln Val Ala Gly Phe Gln
145 150 155 160
Ile Asn Asn Ala Lys Leu Ile Gln Asp Trp Ser Pro Gly Thr Met Val
165 170 175
Gln Tyr Ala Phe Val Ile Asn Lys Asp Gly Ser Cys Lys Ile Tyr Asp
180 185 190
Ser Ser Thr Pro Ala Ser Thr Ile Ile Lys Asn Gly Gly Glu Gln Ala
195 200 205
Tyr Asp Phe Gly Thr Ala Ile Ile Arg Asp Gly Lys Ile Gln Pro Ser
210 215 220
Asp Gly Ser Val Asp Trp Lys Ile His Ile Phe Ile Ala Asn Asp Lys
225 230 235 240
Asp Asn Asn Leu Tyr Ala Ile Leu Ser Asp Thr Asn Ala Gly Tyr Asp
245 250 255
Asn Ile Met Lys Ser Val Ser Asn Leu Lys Leu Gln Asn Met Leu Leu
260 265 270
Leu Asp Ser Gly Gly Ser Ser Gln Leu Ser Val Asn Gly Lys Thr Ile
275 280 285
Val Ala Ser Gln Asp Asp Arg Ala Val Pro Asp Tyr Ile Val Met Lys
290 295 300
<210> 283
<211> 148
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_核苷酸_糖_脱氢酶
<400> 283
Leu Leu Arg Ile Leu His Thr Ile Glu Ala Thr Gly Cys Leu Pro Lys
1 5 10 15
Asp Thr Lys Gln Leu Leu Ala Asn Tyr Asp Gln Val Ser Glu Lys Leu
20 25 30
Ile Glu Ala Val Val Glu Ser Asn Arg Thr Arg Lys Asp His Ile Ala
35 40 45
Asp Met Ile Ile Lys Arg Tyr Pro Lys Val Val Gly Ile Tyr Arg Leu
50 55 60
Thr Met Lys Ser Asn Ser Asp Asn Phe Arg Ser Ser Ser Ile Gln Gly
65 70 75 80
Ile Met Lys Arg Ile Lys Gly Lys Gly Ile Glu Val Val Val Tyr Glu
85 90 95
Pro Thr Leu Thr Asp Asp Gln Phe Tyr Asn Ser Arg Val Val His Asn
100 105 110
Leu Asp Glu Phe Lys Met Ile Ser Asp Val Ile Val Ser Asn Arg Tyr
115 120 125
Thr Lys Glu Leu Glu Asp Val Lys Thr Lys Val Tyr Thr Arg Asp Leu
130 135 140
Phe Gly Arg Asp
145
<210> 284
<211> 315
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_GT1
<400> 284
Met Lys Glu Lys Val Ser Val Ile Met Ser Ile Tyr Lys Glu Ser Glu
1 5 10 15
Asn Glu Leu Lys Ser Ser Ile Glu Ser Ile Leu Asn Gln Thr Tyr Ser
20 25 30
Asn Leu Glu Leu Ile Ile Val Val Asp Asn Pro Asp Glu Lys Trp Arg
35 40 45
Ile Asp Phe Leu Lys Ser Tyr Lys Asp Asp Arg Met Lys Ile Ile Ile
50 55 60
Asn Glu Lys Asn Ile Gly Leu Pro Lys Ser Leu Asn Lys Ala Leu Lys
65 70 75 80
Asn Ala Thr Gly Glu Tyr Ile Ala Arg Met Asp Ala Asp Asp Ile Ala
85 90 95
Leu Pro Ile Arg Phe Glu Lys Gln Tyr Glu Phe Leu Lys Glu Thr Gly
100 105 110
Tyr Asp Met Cys Gly Ser Asn Val Thr Cys Phe Ile Asp Asn Val Asp
115 120 125
Phe Lys Asp Ile Ile Phe Pro Arg Lys Ala Glu Asn Val Lys Lys Leu
130 135 140
Leu Phe Ile Lys Asn Cys Val Ser His Pro Thr Tyr Phe Val Arg Lys
145 150 155 160
Asn Val Tyr Glu Lys Leu Asn Gly Tyr Asn Asn Val Phe Ser Cys Glu
165 170 175
Asp Tyr Asp Phe Leu Leu Arg Ala Val Gly Asn Gly Ile Thr Ile Gly
180 185 190
Asn Val Gln Asp Ile Leu Leu Lys Tyr Arg Ile Ser Pro Gln Ser Ile
195 200 205
Ser Arg Lys Asn Ala Gly Lys Gln Glu Leu Ile Ala Lys Tyr Leu Arg
210 215 220
Lys Tyr Tyr Lys Asn Asn Pro Leu Gly Glu Val Thr Glu Thr Met Ile
225 230 235 240
Lys Glu Tyr Leu Asp Ser Asn Lys Phe Lys Lys Asn Leu Arg Ser Tyr
245 250 255
Asp Phe Tyr Trp Gly Met Lys Asn Thr Arg Ser Lys Tyr Lys Glu Asn
260 265 270
Lys Gly Ile Lys Tyr Tyr Tyr Tyr Thr Val Leu Leu Val Phe Asn Phe
275 280 285
Lys His Ser Leu Lys Glu Ile Tyr Arg Lys Leu Tyr Glu Met Thr Leu
290 295 300
Ser Gln Val Phe Lys Val Thr Asn Gly Glu Phe
305 310 315
<210> 285
<211> 184
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_乙酰转移酶
<400> 285
Met Ile Ser Asp Arg Lys Lys Leu Met His Val Leu Asn Leu Glu Lys
1 5 10 15
Lys Ala Tyr Ile Glu Asn Lys Lys Asp Tyr Ile Lys Lys Val Leu Ser
20 25 30
Gly Asn Glu Lys Ile Asp Ile Trp Arg Phe Gln Lys His Leu Arg Lys
35 40 45
Ala Glu Tyr Tyr Tyr Asn Gln Lys Asn Lys Ile Phe Phe Pro Leu Tyr
50 55 60
Ala Tyr His Ile Tyr Arg Lys Asn Cys Ile Gly Lys Arg Met Gly Ile
65 70 75 80
Phe Ala Ser Ile Asn Thr Ile Asp Glu Gly Leu Ile Ile Tyr His Ser
85 90 95
Gly Glu Ile Val Val Ser Gly Leu Ala Lys Cys Gly Lys Asn Leu Lys
100 105 110
Leu His Gly Asn Asn Cys Ile Gly Asn Lys Gly Leu Asp Gly Tyr Ala
115 120 125
Pro Ile Ile Gly Asn Asn Cys Asp Val Gly Phe Gly Ala Val Ile Ile
130 135 140
Gly Asp Val Val Leu Gly Asp Asp Ile Lys Val Gly Ala Asn Ala Val
145 150 155 160
Val Asn Lys Asn Phe Lys Ser Gly Val Leu Val Gly Val Pro Ala Lys
165 170 175
Glu Ile Glu Asn Glu Arg Lys Ser
180
<210> 286
<211> 379
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_wzy
<400> 286
Met Met Lys Thr Lys Glu Lys Ile Phe Ile Ala Leu Cys Phe Phe Ala
1 5 10 15
Phe Ile Gln Pro Asn Phe Phe Val Lys Ile Asn Val Leu Asn Tyr Met
20 25 30
Ser Ile Gly Met Ala Leu Leu Cys Thr Leu Ile Leu Leu Leu Arg Ser
35 40 45
Phe Lys Lys Asn Val Lys Phe Asp Ile Thr Ile Phe Leu Leu Leu Leu
50 55 60
Trp Arg Leu Leu Ile Phe Ile Pro Thr Leu Asn His Asn Gly Glu Ile
65 70 75 80
Val Lys Trp Gly Tyr Gln Ser Val Val Phe Val Gly Leu Tyr Leu Leu
85 90 95
Ile Lys Asn Tyr Ile Cys Asp Lys Asp Ala Phe Arg Ile Val Tyr Lys
100 105 110
Met Ile Phe Ser Tyr Val Leu Ile Asn Ile Leu Leu Met Ile Ile Tyr
115 120 125
Pro Asn Gly Ile Phe Pro Gln Tyr Gly Ile Tyr Phe Leu Gly Ile Arg
130 135 140
Thr Arg Phe Thr Glu Phe Ser Ile Ala Leu Met Tyr Ile Ser Leu Ile
145 150 155 160
Tyr Tyr Asn Glu Tyr Ser Ser Lys Asn Lys Lys Asp Arg Lys Lys Leu
165 170 175
Ile Thr Ser Tyr Val Leu Ala Ile Ile Asn Ile Leu Ser Lys Trp Val
180 185 190
Ala Thr Gly Ile Val Val Ile Val Leu Thr Ile Leu Phe Tyr Leu Leu
195 200 205
Leu Arg Asn Lys Lys Lys Leu Ser Leu Val Tyr Cys Phe Gly Phe Leu
210 215 220
Ile Leu Ile Ala Leu Ser Ile Asn Leu Ile Asn Gly Asn Leu Leu Asn
225 230 235 240
Leu Phe Ser Gly Leu Phe Glu Leu Leu Asn Lys Asp Ile Thr Leu Thr
245 250 255
Gly Arg Thr Ile Ile Trp Glu Asn Ala Ile Tyr Val Ala Lys Asn Tyr
260 265 270
Phe Leu Phe Gly Phe Gly Tyr Val Asn Asp Gly Asn Ile Ile Ser Tyr
275 280 285
His Asn Gly Leu Trp Gln Ala His Asn Thr Ile Leu Gln Ser Met Cys
290 295 300
Glu Ser Gly Ile Phe Gly Thr Leu Thr Leu Phe Met Met Ile Phe Lys
305 310 315 320
Gln Gly Phe Pro Asn Lys Ile Asp Asn Lys Lys Ile Asn Ser Leu Asn
325 330 335
Val Ala Ile Ile Phe Ser Phe Leu Ile Met Met Met Thr Glu Ile Thr
340 345 350
Tyr Tyr Tyr Pro Ile Phe Ile Phe Ile Ile Phe Leu Ile Gly Asn Asn
355 360 365
Tyr Lys Lys Leu Gly Glu Ile Lys Asp Asp Lys
370 375
<210> 287
<211> 361
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_GT2
<400> 287
Met Asn Lys Ile Lys Ile Leu Glu Ile Val Pro Asn Met Gln Gln Gly
1 5 10 15
Gly Leu Glu Asn Leu Ile Met Asn Ile Ile Arg Asn Leu Asp Met Gln
20 25 30
Lys Tyr Glu Ile His Phe Leu Tyr His Tyr Thr Gly Asn Tyr Tyr Phe
35 40 45
Asp Glu Glu Ile Gln Lys Ile Gly Cys Thr Ile His Lys Cys Ser Phe
50 55 60
Arg Glu Tyr Lys Asn Ile Phe Lys Tyr Ile Lys Phe Leu Lys Gly Phe
65 70 75 80
Phe Lys Lys Asn His Phe Asp Val Val His Ser His Met Leu Ser Thr
85 90 95
Ser Arg Leu Thr Leu Lys Tyr Ala Lys Gln Asn Gly Cys Lys Ile Leu
100 105 110
Ile Asn His Ser His Asn Ser Lys Thr Glu Lys Ser Leu Lys Gly Val
115 120 125
Ile Lys Arg Phe Met Ile Leu Asn Ala Ser Lys Tyr Ala Asn Glu Phe
130 135 140
Leu Ala Cys Ser Glu Glu Ala Gly Glu Phe Ala Tyr Gly Asn Arg Lys
145 150 155 160
Phe Thr Val Val Lys Asn Gly Ile Asp Leu Lys Lys Phe Ser Tyr Ser
165 170 175
Ala Glu Asn Arg Lys Gln Ile Arg Lys Glu Leu Asn Ile Gly Glu Lys
180 185 190
Asp Ile Ile Phe Gly Asn Val Gly Arg Leu Asn Val Gln Lys Asn Gln
195 200 205
Ser Phe Leu Ile Glu Cys Phe Ala Lys Ile Lys Asn Ser Asn Ala Lys
210 215 220
Leu Met Ile Ile Gly Phe Gly Glu Leu Lys Glu Glu Ile Val Ser Lys
225 230 235 240
Ile Asn Glu Ile Asn Ile Asn Glu Lys Val Ile Met Leu Glu Asn Val
245 250 255
Lys Ser Glu Tyr Tyr Tyr Ser Ala Met Asp Cys Phe Val Leu Pro Ser
260 265 270
Ile Phe Glu Gly Phe Pro Leu Thr Ala Val Glu Ala Gln Ala Asn Gly
275 280 285
Cys Pro Cys Ala Phe Ser Asp Lys Ile Thr Lys Asp Val Lys Leu Asn
290 295 300
Asn Asn Val Lys Phe Leu Asp Ile Asp Asn Ala Asp Glu Trp Ile Gln
305 310 315 320
Phe Phe Asn Asn Phe Asn Met Lys Arg Glu Glu Arg Tyr Ser Glu Lys
325 330 335
Leu Leu Asp Tyr Asp Thr Asp Lys Leu Ile Asn Val Ile Glu Lys Ile
340 345 350
Tyr Ser Lys Gly Ala His Asp Glu Asn
355 360
<210> 288
<211> 280
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_GT3
<400> 288
Val Asn Leu Val Glu Lys Lys Tyr Ser Val Leu Met Ser Val Tyr His
1 5 10 15
Lys Glu Asn Pro Thr Trp Phe Asp Glu Ser Ile Lys Ser Met Phe Glu
20 25 30
Gln Thr Ile Lys Pro Asn Glu Phe Val Leu Val Glu Asp Gly Pro Leu
35 40 45
Thr Lys Glu Leu Tyr Asp Val Val Glu Lys Tyr Lys Thr Lys Tyr Pro
50 55 60
Lys Glu Phe Lys Val Val Ala Ile Glu Lys Asn Val Gly Leu Gly Pro
65 70 75 80
Ala Leu Lys Lys Gly Val Glu Glu Cys Ser Asn Glu Tyr Ile Ala Arg
85 90 95
Met Asp Ser Asp Asp Tyr Ser Met Pro Lys Arg Ile Glu Lys Glu Phe
100 105 110
Glu Ile Phe Glu Lys Tyr Ser Asp Ile Gly Met Val Gly Thr Ser Ile
115 120 125
Ser Glu Phe Ile Asp Ser Ile Asp Asn Val Val Cys Asn Val Ile Leu
130 135 140
Pro Glu Thr Asn Glu Asp Ile Ile Lys Phe Ser Lys Ser Arg Asn Pro
145 150 155 160
Phe Arg His Pro Ser Val Met Phe Lys Lys Ser Glu Val Val Lys Ala
165 170 175
Gly Asn Tyr Arg Glu Tyr Tyr Leu Cys Glu Asp Tyr Asp Met Trp Leu
180 185 190
Arg Met Ile Arg Asn Gly Cys Lys Tyr Tyr Asn Ile Gln Asp Ile Tyr
195 200 205
Val Tyr Met Arg Ile Gly Glu Asp Phe Tyr Lys Arg Arg Gly Gly His
210 215 220
Lys Tyr Phe Lys Ser Ile Lys Lys Phe Lys Lys Glu Gln Leu Glu Asn
225 230 235 240
Gly Tyr Phe Thr Lys Phe Glu Tyr Leu Lys Ser Ile Val Pro His Ala
245 250 255
Ile Val Cys Tyr Met Pro Asn Phe Met Arg Asp Phe Val Tyr Arg Lys
260 265 270
Met Leu Arg Arg Gly Arg Lys Lys
275 280
<210> 289
<211> 479
<212> PRT
<213> 乳酸乳球菌
<220>
<223> 33183_wzx
<400> 289
Leu Phe Asn Ser Gly Gly Leu Leu Met Gln Ile Ala Lys Asn Tyr Leu
1 5 10 15
Tyr Asn Ala Ile Tyr Gln Val Phe Ile Ile Ile Val Pro Leu Leu Thr
20 25 30
Ile Pro Tyr Leu Ser Arg Ile Leu Gly Pro Ser Gly Ile Gly Ile Asn
35 40 45
Ser Tyr Thr Asn Ser Ile Val Gln Tyr Phe Val Leu Phe Gly Ser Ile
50 55 60
Gly Val Gly Leu Tyr Gly Asn Arg Gln Ile Ala Phe Val Arg Asp Asn
65 70 75 80
Gln Val Lys Met Ser Lys Val Phe Tyr Glu Ile Phe Ile Leu Arg Leu
85 90 95
Phe Thr Ile Cys Leu Ala Tyr Phe Leu Phe Val Ala Phe Leu Ile Ile
100 105 110
Asn Gly Gln Tyr His Ala Tyr Tyr Leu Ser Gln Ser Ile Ala Ile Val
115 120 125
Ala Ala Ala Phe Asp Ile Ser Trp Phe Phe Met Gly Ile Glu Asn Phe
130 135 140
Lys Val Thr Val Leu Arg Asn Phe Ile Val Lys Leu Leu Ala Leu Phe
145 150 155 160
Ser Ile Phe Leu Phe Val Lys Ser Tyr Asn Asp Leu Asn Ile Tyr Ile
165 170 175
Leu Ile Thr Val Leu Ser Thr Leu Ile Gly Asn Leu Thr Phe Phe Pro
180 185 190
Ser Leu His Arg Tyr Leu Val Lys Val Asn Tyr Arg Glu Leu Arg Pro
195 200 205
Ile Lys His Leu Lys Gln Ser Leu Val Met Phe Ile Pro Gln Ile Ala
210 215 220
Val Gln Ile Tyr Trp Val Leu Asn Lys Thr Met Leu Gly Ser Leu Asp
225 230 235 240
Ser Val Thr Ser Ser Gly Phe Phe Asp Gln Ser Asp Lys Ile Val Lys
245 250 255
Leu Val Leu Ala Ile Ala Thr Ala Thr Gly Thr Val Met Leu Pro Arg
260 265 270
Val Ala Asn Ala Phe Ala His Arg Glu Tyr Ser Lys Ile Lys Glu Tyr
275 280 285
Met Tyr Ala Gly Phe Ser Phe Val Ser Ala Ile Ser Ile Pro Met Met
290 295 300
Phe Gly Leu Ile Ala Ile Thr Pro Lys Phe Val Pro Leu Phe Phe Thr
305 310 315 320
Ser Gln Phe Ser Asp Val Ile Pro Val Leu Met Ile Glu Ser Ile Ala
325 330 335
Ile Ile Phe Ile Ala Trp Ser Asn Ala Ile Gly Thr Gln Tyr Leu Leu
340 345 350
Pro Thr Asn Gln Asn Lys Ser Tyr Thr Val Ser Val Ile Ile Gly Ser
355 360 365
Ile Val Asn Leu Met Leu Asn Ile Pro Leu Ile Ile Tyr Leu Gly Thr
370 375 380
Val Gly Ala Ser Ile Ala Thr Val Ile Ser Glu Met Ser Val Thr Val
385 390 395 400
Tyr Gln Leu Phe Ile Ile His Lys Gln Leu Asn Leu His Thr Leu Phe
405 410 415
Ser Asp Leu Ser Lys Tyr Leu Ile Ala Gly Leu Val Met Phe Leu Ile
420 425 430
Val Phe Lys Ile Ser Leu Leu Thr Pro Thr Ser Trp Ile Phe Ile Leu
435 440 445
Leu Glu Ile Thr Val Gly Ile Ile Ile Tyr Val Val Leu Leu Ile Phe
450 455 460
Leu Lys Ala Glu Ile Ile Asn Lys Leu Lys Phe Ile Met His Lys
465 470 475
<210> 290
<211> 12651
<212> DNA
<213>乳酸乳球菌
<220>
<223> DSM 33133的Eps基因簇
<400> 290
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctggag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttaggtaa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttacgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacagc cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatactcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagga attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggcg gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
<210> 291
<211> 12651
<212> DNA
<213> 乳酸乳球菌
<220>
<223> 33204、33205、33220、33221、33218、 33219、
33224、33197、33196、33195、33194、33226、33223、33193和33192的Eps基因簇
<400> 291
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctggag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttagataa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttatgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacaac cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatattcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagga attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggag gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
<210> 292
<211> 12651
<212> DNA
<213> 乳酸乳球菌
<220>
<223> 33200、33201、33202和 33203的Eps基因簇
<400> 292
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctgtag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttagataa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttatgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacaac cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatattcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagga attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggag gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
<210> 293
<211> 12651
<212> DNA
<213> 乳酸乳球菌
<220>
<223> 33222的Eps基因簇
<400> 293
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga ctatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctggag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttagataa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttatgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacaac cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatattcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagta attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggag gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
<210> 294
<211> 12651
<212> DNA
<213> 乳酸乳球菌
<220>
<223> 33225的Eps基因簇
<400> 294
atgaatgatt tattttacca tcgtctaaag gaactagttg aatcaagtgg taaatctgca 60
aatcaaatag aaagggaatt gggttaccct agaaattctt tgaataatta taagttggga 120
ggagaaccct ctgggacaag attaatagga caatcagagt attttaatgt gtctccaaaa 180
tatctgatgg gtataagtga tgagcctaat gacagttctg caattaatct ttttaaaact 240
ctaactcaag aagagaaaaa agaaatgttt ataatttgtc aaaaatggct ttttttagaa 300
tatcaaatag agttataaca ataataaatt tagggagttt ttcttattaa tatgatgaaa 360
aaaggaattt ttgtaattac tatagtgata tctatagcat tgataattgg aggtttttta 420
tagttataat tctaggataa ataatctttc aaaagctgat aaaggaaaag aagttgtaaa 480
aaatagcagt gaaaaaaatc agatagacct tacctataaa aagtattata aaaatttacc 540
aaaatcagtt caaaataaaa tagatgatat ttcatccaaa aataaagaag ttactttaac 600
ttgtatttgg caatctgatt cagttatttc tgaacaattt caacaaaact tacaaaaata 660
ttatggaaat aagttttgga acatcaaaaa tatcacctac aatggcgaaa caagtgaaca 720
attattggct gaaaaagttc aaaatcaagt attggcgact aaccctgatg ttgttttata 780
tgaagctcca ctttttaatg ataaccaata tagactactg ggctagttac ccagacaaaa 840
attctgatga aatgaagggg ctgttttctg atgatggagt atatagaaca ttaaatgctt 900
cggggaataa ggtttggcta gattatatta ctaaatattt tacagcaaac taattaagtt 960
ataaataaca attattaaat attggagaag aaatgcagga aacacaggaa cagacgattg 1020
atttaagagg gatttttaaa attattcgca aaaggttagg tttaatatta tttagtgctt 1080
taatagtcac aatattaggg agcatctaca cattttttat agcctcccca gtttacacag 1140
cctcaactca acttgtcgtt aaactaccaa attcggataa ttcagcagcc tacgctggag 1200
aagtgaccgg gaatattcaa atggcgaaca caattaacca agttattgtt agtccagtca 1260
ttttagataa agttcaaagt aatttaaatc tatctgatga ctctttccaa aaacaagtta 1320
cagcagcaaa tcaaacaaat tcacaagtta ttatgcttac tgttaaatat tctaatcctt 1380
acattgcaaa aaagattgca gacgagactg ctaaaatttt tagttcagat gcagcaaaac 1440
tattgaatgt tactaacgtt aatattctat ccaaagcaaa agctcaaaca acaccaatta 1500
gtcctaaacc taaattgtat ttagcgatat ctgttatagc cggactagtt ttaggtttag 1560
ccattgcttt attgaaggaa ttgtttgata acaaaattaa taaagaagaa gatattgaag 1620
ctctggggct cacggttctt ggtgtaacaa gctatgatca aatgagtgat tttaataaga 1680
atacaaataa aaatggcacg caatcgggaa ctaagtcaag tccgcctagc gaccatgaag 1740
taaatagatc atcaaaaagg aataaaagat aggagttcag gatggctaaa aataaaagaa 1800
gcatagacaa taatcattat attattacca gtgtcaatcc tcaatcacct atttccgaac 1860
aatatcgtac gattcgtacg accattgatt ttaaaatggc ggatcaagga attaaaagtt 1920
ttctagtaac atcttcagaa acagatgaag gtaaaacaac cgtaagtgct aatatagctg 1980
ttgcttttgc acaacaaggt aaaaaagtac ttttaattga tggcgatctt cgtaaaccga 2040
ctgttaacat tacttttaaa gtacaaaata gagtaggatt aaccaatatt ttaatgcatc 2100
aatcttcgat tgaagatgcc atacaaggga caagactttc tgaaaatctt acaataatta 2160
cctctggtcc aattccacct aatccatcgg aattattagc atctagtgca atgaagaatt 2220
tgattgactc tgtgtccgat ttctttgatg ttgttttgat tgatattcca cctctctctg 2280
cagttactga tgctcaaatt ttgagtagtt atgtaggagg agtggttctt gttgtacgtg 2340
cctatgaaac aaaaaaagag agtttagcaa aaacaaaaaa aaagctggaa caagttaatg 2400
caaatatatt aggagttgtt ttgcatgggg tagactcttc tgactcaccg tcgtattact 2460
actacggagt agagtaattg gaataaattt taatcaaata aaagacagaa atttgtagaa 2520
gaggagagca aatgattgat attcattgcc atattttacc gggtatagat gatggagcta 2580
aaacttctgg agatactttg acaatgctga aatcagcaat tgatgaaggg ataacaacca 2640
tcaccgctac tcctcatcat aatcctcaat ttaataatga atcaccactt attttaaaaa 2700
aagttaagga agttcaaaat atcattgacg agcatcaatt accaattgaa gttttgcctg 2760
gacaagaggt tagaatatat ggtgatttat taaaagaatt ttctgaagga aagttactga 2820
aagcagcggg cacttcaagt tatatattga ttgaatttcc atcaaatcat gtgccagctt 2880
atgctaaaga acttttttat aatattaaat tggagggcct tcaacctatt ttggtccacc 2940
ctgagcgtaa tagtggaatc attgagaacc ctgatatatt atttgatttt attgaacaag 3000
gagtactaag tcagataaca gcttcaagtg tcactggtca ttttggtaaa aaaatacaaa 3060
agctatcatt taaaatgata gaaaaccatc ttacgcattt tgttgcatca gatgcgcata 3120
atgtgacgtc acgtgcattt aagatgaagg aagcatttga aattattgaa gatagttatg 3180
gttctggtgt atcacgaatg ttacaaaata atgcagactc ggtgattttg aacgaaagtt 3240
tttatcaaga agaaccaata aaaattaaaa caaagaaatt tttgggatta ttttaaaagg 3300
attaaatgga gtaaataatg gaagtttttg aggcatcatc tgaactggaa gagcctaagt 3360
tagtagaatt aaaaaaattt tctcgcagag agataattat aaaaagaggg attgatattt 3420
tagggggatt agcgggttca ggtttatttc ttatcgcggc tgcattgctt tatgtccctt 3480
acaaaatgag ctcaaaaaaa gatcaagggc caatgttcta taaacaaaaa cggtatggaa 3540
aaaatggtaa aattttttat attttgaaat ttagaacaat gataattaat gctgagcagt 3600
atttagagct acatccagaa gttaaagccg cctatcatgc caatggcaat aaactagaaa 3660
gtgatccccg tgtaacgaag attgggtcat ttattagaca acactcaatt gatgaattac 3720
cacaatttat caatgtcctt aaaggagata tgtcattagt tggtccaaga ccaattttgc 3780
tttttgaagc gaaagaatat ggggagcgcc tctcttactt actgatatgc aaacctggaa 3840
ttactggtta ttggacaaca catggtcgaa gtaaagttct ttttcctcaa cgagcagatt 3900
tagagctcta ttatctccag taccatagta caaaaaatga tataaaactt attatgctta 3960
caataaaaca aattctacat ggatcggatg cttattaaag taacattatg aaaaaaaaaa 4020
caactaaaat ttgcatgatt tcttcttctg ggggtcattt aaaagagctt aatgaattga 4080
tagagatttc agagcagtat gaaacgtttc aaattactga aaaagataaa ttttctaata 4140
tcaagattgg aactaggcaa tactatgtga ataaaattga tagagatgaa aaaaattttt 4200
tatttcattt ttttattctt tttttgaaaa tatttcaaat atttgctgta gagaagccta 4260
aagttatagt aaccactggt gccttagtag cttatccagc atgtctaata ggaaaattaa 4320
tgagagctaa agttattttt atagagtctt atgctcgaac agaaacatta tcattaacag 4380
gaaaattagt ttataggtta tctgatttat ttattgttca atggccagat ctttcaaaaa 4440
aatattctaa agctaaatac tatggggaat tattctgatg atattaataa tattagggac 4500
tcaaaaattt caattcaacc gacttataaa aaaagttgat aaattaatag aagatgatca 4560
aatcaaagat tctgtaatag ctcaaatcgg atattctaat tacaaaccta taaattataa 4620
attttcagat ttttttgatc aatcggaatt tgattcatta ataaataaat cagatataat 4680
aataactcat ggaggagtag gtgggatagt ttcttcctta aaaaagaata aaaaaatcat 4740
agtagttccg cgtttaaaga aatacagaga acatattgat gatcatcaat tagagatagc 4800
aagggcgttt caaagaaaaa atctagttat tttaaacgag aatctaaatg aactatgtaa 4860
tgatatatct aaaattgaat cattcgagcc aatacactat gtcaaagata ataaaaaaat 4920
tatatgtgaa ataaaaaaat ttatatcgaa agttaaatga tatttttata caaaattatc 4980
ttatgatgag aaaggacttt ttaaaagata aaaaatgata aaattgagca ttataattcc 5040
aatttataac gtggaaaaat atttaagtaa atgtttaaat tctattttag aacaaactta 5100
taaagaaata gaaataatat tagtaaatga tggtagtact gataactcaa aagatatagc 5160
tgtaagctat tgtgaaagat ttcctaatgt ttttaaatat tttgagaaag ataacggagg 5220
cctctcttca gccagaaatt ttggacttga aaaaatttct ggtgattttg taggcttctt 5280
agactcagat gactatatag ataacgattt atatgaaatt atgattaatt cattggatag 5340
ttcaataaaa attgtggaat gtgattttat atgggaatac gaaaatggaa aaagtgtcct 5400
tgataaaaca tctgaatata attctatcaa agacttaatg gttaacggta gagttgttgc 5460
ttggaataaa atatataatg ttgaatggtt agaaaaaata aacataaagt ttaaagaagg 5520
tctattgtat gaagatttaa attttttctt caaaattgtt cctcacttga ctagtatttc 5580
agaagtatca acagttaaaa atagttttgt tcactatgtc cagcataaag gtacaataac 5640
ttcagataat tctcttaata tcttggatat cataaaatct tacgaagatg tctttcatta 5700
ttataacgaa aaacagatta atgatttata ttttgatgag ctagaatata aattttctag 5760
gaacttaatg ggggcatttt taaaaagagc aattaagatt aaagataaaa gacaacgtaa 5820
aataatttta gatgaatttt ggaataatgt tttatcttac tatccgaatt ggaaaaaaaa 5880
taaatatata aaaaaactat caaaacagaa tatactttta ttttttatta ataaatatac 5940
atataaatta ttttatttat tataaaaaaa atttaatatt agagtatttg tattagttgc 6000
aatgaaaata tcgaaagtag aataaatgat ttatgtagaa ataaggggaa acttaggtaa 6060
tcaattattt atctatgcca ccgcaaaaaa aattcaaaag ttaaccggac aaaaaattca 6120
attaaataca acaactttaa ataaatactt tccaaattac aagtttggcc tttcagaatt 6180
tataatggag gatcctgatt gttttattga atcctataaa aaattaccct ggttcacaaa 6240
cgagtatctc ttacctatta aaatttttaa aaaaatattg aataaaacac ccaaaattaa 6300
taaaatcctt tcagattttt ttttcaaagc ttttgaaaaa aaaggatatt ttatttggcg 6360
aggagagact tttaaaaagt tttctttagg aaatcataaa aattactatt tatcaggttt 6420
ttggcaatcg gaagaatatt tttatgatat aagggatgaa ttattagaaa tcatcactcc 6480
tataaattca ataagagagt gtaactttga acttctcaat ttaataagga attcagaatc 6540
aatttgtgtt tcaatacgcc gaggagatta tgtagataat cctaaaatat cagctattta 6600
taacgtatgt gatataaatt attttataga atctgtaaat gaaataaaga aaaatgttgt 6660
gaatgttaaa gttatctgtt tttcagatga tgttgaatgg gtcaaaaaaa atataaaatt 6720
cgactgtgaa acacattatg aaacttatgg taattcttta tctgaaaaag ttcaacttat 6780
gtcttcttgt aaacattttg ttttatctaa tagttctttt agttggtgga cagaattttt 6840
atctatacga ggtgggatta ctatagcccc caaaaattgg tatgcagatg aacgtgaagc 6900
tgatatctat agaaaaaatt ggatttactt agaagataag acagaggaag agtaatggga 6960
tttctatttt taactataat acttattttg tgggggtata gttttaccaa tataaaaata 7020
agccctttta gtattttatt catgagttta gggatctttt actctcaatt tacttcaata 7080
aatattgact taataataaa agtacttttt ttgataactt ccataattta tcttattaaa 7140
gataaatatt caaaaaaata cgttttttct ttattattaa ttgcagtatt aattttaatt 7200
gagtcaacta gtccctctaa atttaatcaa tattatggtt ttattgatgc tttgacatca 7260
tttgcaacct tctcaacagg catacttcta ttttccataa aatttagttt acaagaacgc 7320
agaagtattt taaaatcaat ttcatatttg ccaatctttt cagtgttaat tggaatccct 7380
ctaacttttg gtggttttat atctatgaca gctagaggag gaattgccct ttcaggagca 7440
gctttagaaa caaatttatc ttttttttca gttctaagcc ttgtttcatt agatatttta 7500
tatcaggaca ctcgttctaa taaatatcaa attttaaaaa ttattaactt tatattgcta 7560
tgttgtactt taacacgagg cggtattatt tctggaatta tcattatttt accaagttta 7620
ctatttcttt taaaaaaagg atttaaagga gtaagacaat ttattttttt gattattact 7680
atttttggga gtatttatcc gcttatttta ttgtggaaaa gtattagtga gaggactttc 7740
agtgcagatg gtattaatac ttcaggtcga tatacggcct gggactatat tgtgaatttg 7800
acaacaaaca aatctcaggg aatgggattg ggaagtttaa agacattaac tgaggatatt 7860
aatttacgtg cctttactgc tgctcataat acatatattc aattttatta tgaaactggt 7920
tatttgggag taacactatt atctatttta tttattttaa tattaataat aatcctaaaa 7980
ttgactaatt atagaaaaaa aatcatttac ttaacattca tttcattttt agtatatagt 8040
tatacagata attgtattgt taataataga tactggtatt tgtttatgtt tattatagga 8100
tgttttaaat attttgacag aaaggaagaa aatgcgctac tttaaaatat tatttgagat 8160
tattcaacta ttggtagcta gtattttatg tagattatat aaaaatccaa atgatatctg 8220
gctaataaat gaaaaacctg atgaagctag agataatggt tatgcttttt atcaatattt 8280
aagaaagaat ttccccgata ttaaagttta ttatgtaatc agtaaagagt ctactgatat 8340
ttataagttt gataatgaaa ctaacattgt attttataag agttttttac attttatttt 8400
atatatcaaa tctaaagttt taattagttc tcaaacattg ccctatccat ctagcagaaa 8460
attatgtgaa gcgctaatgt accttaattt gaataaacca aagaggattt ggttacaaca 8520
tggagttact aaagataaac tcccatatga gaatatggca agggaaattt ttaagtatga 8580
tttaataacc tgtgtttcat taaaagaggc taattttata atgaaagaat atggatataa 8640
tgaagatcag gtgaaggctc ttggatttgc aagatatgat aatttgccaa ttggaaataa 8700
taacacattt gatatactta taatgcctac tttccgtaag ggttacgaga ttaaaaattt 8760
tagtctccca acagatagtg aaactaaaca ttttgaggaa agtgtattct ttaaaacata 8820
tgttgattta ttgaattctg aagagctaga cgagtattta gaaaagtctg gtaaaaaagc 8880
aattttttat ttacactatg cttttcaacc atatgcaaaa tctttttcta aacgactaat 8940
gtcttcaaat gttatcattg ctgaaagaac agaatatgat gttcaaaaac tattaattaa 9000
ttgtgaattg ctaattacag attattcaag cgtttttttt gatttttcat atatgaaaaa 9060
acctgaaata tttttccatt ttgatgagaa agaatataga agtaatcatt atagggaggg 9120
atattttgat tataaaacag atggatttgg tccagtagtt aattctaaag aagaattact 9180
aactgaaatc aaagagttta ttgataaccc atctctgtta atggaattta ataagcgagc 9240
taataatttc ttcaaatata ctgataacaa taattgccaa cgtattttaa aagaaatttg 9300
gagaattaat gaaactaatt aagaattatt taatgacaag ctcttatcaa ttgttaatta 9360
ttatcttacc aataataaca acaccatata tatcgagagt acttagtcca gaagggattg 9420
gactttattc atatacttat actattacac agtatttcgt attatttgct actcttggta 9480
ctgttacgta tggtagcaga gagatagcat attatcagtc aaataaacaa aagagaagtg 9540
aaattttttg gggaattacc ttccttagct gggctactgg tgctatatca cttttaatat 9600
tttatatatt tatttttttt aatggcaaat atagtgtttt atttttttgg caaagctttt 9660
tgatttttgg agttattttt gatattaatt ggtatttcac aggaatggaa aagttcaagg 9720
ttattatttc acgtaacttt tgtataaaaa ttattagttt attgtgtatt tttgtctttg 9780
taaaatctga gaaagattta agtttatata tagttatact aggattgagc aatataatag 9840
gtaatatatt agtttggcca tatttgagaa aagaggttta taaacctaat ttttctaagt 9900
tatcattcaa aaaacatttg ggaagtacat ggatattttt tttgccacaa acttctgtta 9960
ctttaaactc attaataaac caaaatatga ttgcatattt tgactcaata acaagcttag 10020
gatactttac acaaacaaat aagtttactg tgattgcgat ttcaatagtt atttcaattg 10080
ggactgttat gttgcctaga atgtccaatt tagttgcgcg caaagagtat tcaaagttta 10140
cagactatgt tactaagagt attaatataa gctcaggaat ttctatagca ataatgtttg 10200
gtttaatggc tatagcacct aagtttacaa cttttttttt aggagctcaa tataaatttg 10260
ttattcattt gctagtttta tcatcaccga tagtggtttt agtaacctgg agtaatgttc 10320
ttggtcaaca atatttaata cctttaaata ggatgaaaat atttacaaaa tctctaattt 10380
gtggaaactt agtaaatgtt tctctaaact tgattttgtt acccaaaatg ggagtagaaa 10440
tttcaataat aaatcagtta attaatgaaa ttattattgt aggtattcaa tttatatcag 10500
ttagaaaaga gttaaaaata aatataatat taggagatct aataaaatat ttttttgcgg 10560
gtataattat gtttattgcc gttttatatc tgaatttaca attaccgatg actatcttca 10620
cactacttat agagattggt attggagttc ttatatattc aatgctagtt atttctctta 10680
aaactggatt atataaagaa ttgaaaaaga ttattaaaat tcgttagctt aaaatctatc 10740
acctttcatt tgagtagtaa gaaatacaaa gctttattat aaaatttatc atttttaaga 10800
ctatcataaa agaagaagga tgacatggaa cgaaaaaaga agaaaaaaaa aatatatata 10860
attattctaa tattattaat gtttatcact attgtttgtt ttgggggata tgctacacga 10920
gagttaatta ctcccactga aaaaacaata ccaaatgtct cggatcaacc taaaaaaact 10980
tcggcctcta acggttatgt agagcaaaaa ggggaagaag ctgctgtggg tagtatagca 11040
cttgtagacg atgctggtgt accagaatgg gttaaagttc cctcaaaggt aaatctagat 11100
aaatttactg atttatctac gaataatatc actatttatc gaattaataa tccggaagtc 11160
ttaaaaacag ttaccaatcg tacagatcaa cggatgaaaa tgtcagaagt tatagctaag 11220
tatcctaatg ctttgattat gaatgcttcc gcttttgata tgcagacagg acaagtagct 11280
ggatttcaaa ttaataatgg aaagttgatt caagactgga gtccaggtac aacgactcaa 11340
tatgcttttg ttattaacaa agatggttcg tgcaaaattt atgattcaag tacacctgct 11400
ttaactatta ttaaaaatgg agggcaacaa gcctatgatt ttggtactgc gattatccgt 11460
gatggtaaaa ttcaaccaag tgatggctca gtagattgga agattcatat ttttattgcg 11520
aatgataaag ataataatct ctatgctatt ttgagtgata caaatgcagg ttatgataat 11580
ataataaaat cagtatcaaa tttgaagctc caaaatatgt tattacttga tagtggtggc 11640
tcaagtcaac tatctgtcaa tggtaaaacg attgttgcta gtcaagatga tcgagccgta 11700
ccggattata ttgtgatgaa ataaaaataa aagaacctca tggttctttt attttagaga 11760
tttttcaaaa agggttttga ctgagtctaa ttctgtttga gaaacgacct tagctccatt 11820
ttcatctgtt gtatgtagat tgagcttgct ggtattcttt agagccttac tgtagctcat 11880
aacgatcgtt ttagcatcat tgaaacttat attggtttta acagtgtttg aaaaagcata 11940
gagaatttct cgatagtaat tgaaactttc aagttttttc atatgagcaa ttttttgaat 12000
atttccgtag agttccattg agacattttg aatccgagtg attgaagcat ccaaatcagt 12060
atcgtcaatt tgtgtcatat aggcttggac ttgatcagca gtttgtaaat taacagttcc 12120
ttgtttaaac tcataacctt cagccttgaa tgcctttgga ttttgcatgg tgattccacc 12180
agtggcctgt acaagtgatc ccattttatt aacatcgatc tgaattactt tgttaatgga 12240
cgcattcaat aggtctttaa ccatctggaa aattccatca tctccattcg tattgtaaac 12300
ttcagtgatt gttttttgat taggcattgt cgcaaaaact gggaagttca tgaaagtagt 12360
ttgatttgtc tttacattcg ttgaagctaa aacagtagca taagctgaat ttttagaatt 12420
atttttacca gttgcaatga taagtgtggt gaatgtttta gcttttttta agtcaatact 12480
tgttgtttta gggaaatttt catatgatgt tgaaaaggtt gattcaacat ttctataagc 12540
tacgtaggct atagaagcaa cagaaataat tactaataca aaaataattg aaataacttt 12600
tagtactgtg tattttttct tacgataatg acgcctcttt ttttgattca t 12651
PCT/RO/134表
Figure QDA0003566890880000011
Figure QDA0003566890880000021
Figure QDA0003566890880000031
Figure QDA0003566890880000041
Figure QDA0003566890880000051
Figure QDA0003566890880000061
Figure QDA0003566890880000071
Figure QDA0003566890880000081
Figure QDA0003566890880000091
Figure QDA0003566890880000101
Figure QDA0003566890880000111
Figure QDA0003566890880000121
Figure QDA0003566890880000131
Figure QDA0003566890880000141
Figure QDA0003566890880000151
Figure QDA0003566890880000161
Figure QDA0003566890880000171
Figure QDA0003566890880000181
Figure QDA0003566890880000191
Figure QDA0003566890880000201
Figure QDA0003566890880000211
Figure QDA0003566890880000221
Figure QDA0003566890880000231
Figure QDA0003566890880000241
Figure QDA0003566890880000251
Figure QDA0003566890880000261

Claims (19)

1.乳酸乳球菌(Lactococcus lactis)乳酸菌菌株,包含能够产生胞外多糖(EPS)的活性eps基因簇,其中视情况而定,所述eps基因簇包含(i)至(x)中任何一项限定的下述核苷酸序列(a)至(c):
(i)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:11的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:17的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:9的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:13的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;和
(c3):核苷酸序列,其与SEQ ID NO:15的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(ii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:199的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:199互补链的核苷酸26029-27444(本文称为wzx)编码的氨基酸序列具有至少95%的同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:199的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:199的核苷酸7276-8508编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(c3):核苷酸序列,其与SEQ ID NO:199的核苷酸11042-12391编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(c4):核苷酸序列,其与SEQ ID NO:199的核苷酸13008-13934编码的氨基酸序列(本文称为GT4)具有至少95%同一性;和
(c5):核苷酸序列,其与SEQ ID NO:199的核苷酸18528-19508编码的氨基酸序列(本文称为GT5)具有至少95%同一性;
(iii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:39的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:45的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:37的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:41的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:43的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(iv)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:163的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:169的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:161的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:165的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(c3):核苷酸序列,其与SEQ ID NO:167的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;和
(c4)核苷酸序列,其与SEQ ID NO:181的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%同一性;
(v)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:224的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:224的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:224的核苷酸11042-12391编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:224的核苷酸13008-13934编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:224的核苷酸18527-19507编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(vi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:67的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:73的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:65的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:69的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%同一性;和
(c3):核苷酸序列,其与SEQ ID NO:71的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%同一性;
(c4):核苷酸序列,其与SEQ ID NO:85的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%同一性;
(c5):核苷酸序列,其与SEQ ID NO:87的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%同一性;和
(c6):核苷酸序列,其与SEQ ID NO:89的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%同一性;
(vii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:244的核苷酸5833-6927(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:244的核苷酸4617-5123编码的氨基酸序列(本文称为GT1)具有至少95%同一性;和
(c2):核苷酸序列,其与SEQ ID NO:244的核苷酸5120-5827编码的氨基酸序列(本文称为GT2)具有至少95%同一性;
(viii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:123的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:129的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:121的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:125的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:127的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:143的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;和
(c5):核苷酸序列,其与SEQ ID NO:145的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:147的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%的同一性;
(ix)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:257互补链的核苷酸11201-12349(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:257互补链的核苷酸15538-16953(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸9726-10673编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸12336-13421编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸13418-14260编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(x)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:274互补链的核苷酸10707-11846(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:274互补链的核苷酸15037-16476(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸9232-10179编码的氨基酸序列(本文称为GT1)具有至少95%同一性;
(c2):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸11833-12918编码的氨基酸序列(本文称为GT2)具有至少95%同一性;和
(c3):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸12915-13757编码的氨基酸序列(本文称为GT3)具有至少95%同一性。
2.根据权利要求1所述的乳酸乳球菌乳酸菌菌株,其中视情况而定,所述eps基因簇包含(i)至(x)中任一项限定的核苷酸序列(a)至(m):
(i)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:11的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:17的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:9的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:13的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:15的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(d):核苷酸序列,其与SEQ ID NO:23的核苷酸序列编码的氨基酸序列(本文称为推定的核苷酸糖脱氢酶蛋白)具有至少95%的同一性;
(ii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:199的核苷酸13939-15042(本文称为wzy)编码的氨酸序列基具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:199互补链的核苷酸26029-27444(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):至少一条、优选两条、更优选三条、甚至更优选四条、最优选五条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:199的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:199的核苷酸7276-8508编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:199的核苷酸11042-12391编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:199的核苷酸13008-13934编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:199的核苷酸18528-19508编码的氨基酸序列(本文称为GT5)具有至少95%的同一性;
(d):核苷酸序列,其编码具有dTDP-葡萄糖4,6-脱水酶活性并且与SEQ ID NO:199的核苷酸4784-5695编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少95%同一性的多肽;
(e):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖还原酶活性并且与SEQ ID NO:199的核苷酸5717-6631编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少95%同一性的多肽;
(f):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖3,5-差向异构酶活性并且与SEQ IDNO:199的核苷酸6586-7257编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少95%同一性的多肽;
(g):核苷酸序列,其编码多肽DUF1919并且与SEQ ID NO:199的核苷酸8515-9144编码的氨基酸序列(本文称为DUF1919)具有至少95%的同一性;
(h):核苷酸序列,其编码多肽DUF4422并且与SEQ ID NO:199的核苷酸10271-11029编码的氨基酸序列(本文称为DUF4422)具有至少95%的同一性;和
(i):核苷酸序列,其编码具有UDP-吡喃半乳糖变位酶活性并且与SEQ ID NO:199的核苷酸9159-10274编码的氨基酸序列(本文称为UDP-吡喃半乳糖变位酶)具有至少95%同一性的多肽;
(iii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:39的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:45的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:37编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:41的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:43的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(d):核苷酸序列,其具有多糖丙酮酰转移酶活性并且与SEQ ID NO:51的核苷酸序列编码的氨基酸序列(本文称为多糖丙酮酰转移酶家族蛋白)具有至少95%同一性;
(iv)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:163的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:169的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:161的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:165的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:167的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(c4)核苷酸序列,其与SEQ ID NO:181的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;和
(d)核苷酸序列,其编码与SEQ ID NO:175的核苷酸序列编码的氨基酸序列(本文称为核心-2/I-分支蛋白)具有至少95%同一性的多肽;
(v)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:224的核苷酸13939-15042(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:224的核苷酸4174-4824编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:224的核苷酸11042-12391编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:224的核苷酸13008-13934编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;和
(c4):核苷酸序列,其与SEQ ID NO:224的核苷酸18527-19507编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(d):核苷酸序列,其编码多肽DUF1972并且与SEQ ID NO:224的核苷酸7276-8508编码的氨基酸序列(本文称为DUF1972)具有至少95%的同一性;
(e):核苷酸序列,其编码多肽DUF4422并且与SEQ ID NO:224的核苷酸10271-11029编码的氨基酸序列(本文称为DUF4422)具有至少95%的同一性;
(f):核苷酸序列,其编码多肽DUF1919并且与SEQ ID NO:224的核苷酸8515-9144编码的氨基酸序列(本文称为DUF1919)具有至少95%的同一性;
(g):核苷酸序列,其编码具有UDP-吡喃半乳糖变位酶活性并且与SEQ ID NO:224的核苷酸9159-10274编码的氨基酸序列(本文称为UDP-吡喃半乳糖变位酶)具有至少95%同一性的多肽;
(h):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖3,5-差向异构酶活性并且与SEQ IDNO:224的核苷酸6586-7257编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少95%同一性的多肽;
(i):核苷酸序列,其编码具有dTDP-葡萄糖4,6-脱水酶活性并且与SEQ ID NO:224的核苷酸4784-5695编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少95%同一性的多肽;和
(j):核苷酸序列,其编码具有dTDP-4-脱氢鼠李糖还原酶活性并且与SEQ ID NO:224的核苷酸5717-6631编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少95%同一性的多肽;
(vi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:67的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:73的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:65的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:69的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:71的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:85的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:87的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:89的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%的同一性;
(d):核苷酸序列,其具有差向异构酶/脱水酶活性并且与SEQ ID NO:93的核苷酸序列编码的氨基酸序列(本文称为NAD依赖性差向异构酶/脱水酶1)具有至少95%的同一性;
(e):核苷酸序列,其具有脱氢酶活性并且与SEQ ID NO:79的核苷酸序列编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少95%的同一性;
(f):核苷酸序列,其具有胸苷基转移酶活性并且与SEQ ID NO:95的核苷酸序列编码的氨基酸序列(本文称为rfbA,葡萄糖-1-磷酸胸苷基转移酶)具有至少95%的同一性;
(g):核苷酸序列,其具有脱水酶活性并且与SEQ ID NO:97的核苷酸序列编码的氨基酸序列(本文称为dTDP-葡萄糖4,6-脱水酶)具有至少95%的同一性;
(h):核苷酸序列,其具有差向异构酶活性并且与SEQ ID NO:99的核苷酸序列编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖3,5-差向异构酶)具有至少95%的同一性;
(i):核苷酸序列,其具有差向异构酶/脱水酶活性并且与SEQ ID NO:101的核苷酸序列编码的氨基酸序列(本文称为NAD依赖性差向异构酶/脱水酶家族蛋白2)具有至少95%的同一性;
(j):核苷酸序列,其具有酰基转移酶活性并且与SEQ ID NO:111的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶1)具有至少95%的同一性;
(k):核苷酸序列,其具有酰基转移酶活性并且与SEQ ID NO:107的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶2)具有至少95%的同一性;
(l):核苷酸序列,其具有还原酶活性并且与SEQ ID NO:103的核苷酸序列编码的氨基酸序列(本文称为dTDP-4-脱氢鼠李糖还原酶)具有至少95%的同一性;和
(m):核苷酸序列,其具有核苷酸转移酶活性并且与SEQ ID NO:105的核苷酸序列编码氨基酸序列(本文称为核苷酸转移酶)具有至少95%的同一性;
(vii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:244的核苷酸5833-6927(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(c):至少一条、优选两条、最优选三条编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其选自:
(c1):核苷酸序列,其与SEQ ID NO:244的核苷酸4617-5123编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;和
(c2):核苷酸序列,其与SEQ ID NO:244的核苷酸5120-5827编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(d):核苷酸序列,其编码具有UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺转移酶活性并且与SEQ ID NO:244的核苷酸4168-4617编码的氨基酸序列性(本文称为UDP-N-乙酰葡糖胺-LPS N-乙酰葡糖胺转移酶)具有至少95%同一性的多肽;
(viii)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:123的核苷酸序列(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:129的核苷酸序列(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:121的核苷酸序列编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:125的核苷酸序列编码的氨基酸序列(本文称为GT2)具有至少95%的同一性
(c3):核苷酸序列,其与SEQ ID NO:127的核苷酸序列编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(c4):核苷酸序列,其与SEQ ID NO:143的核苷酸序列编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(c5):核苷酸序列,其与SEQ ID NO:145的核苷酸序列编码的氨基酸序列(本文称为GT5)具有至少95%的同一性;和
(c6):核苷酸序列,其与SEQ ID NO:147的核苷酸序列编码的氨基酸序列(本文称为GT6)具有至少95%的同一性;
(d):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:149的核苷酸序列编码的氨基酸序列(本文称为乙酰转移酶)具有至少95%同一性的多肽;
(e):核苷酸序列,其编码具有脱氢酶活性并且与SEQ ID NO:135的核苷酸序列编码的氨基酸性序列(本文称为核苷酸糖脱氢酶)具有至少95%同一性的多肽;和
(f):核苷酸序列,其编码具有酰基转移酶活性并且与SEQ ID NO:151的核苷酸序列编码的氨基酸序列(本文称为酰基转移酶)具有至少95%同一性的多肽;
(ix)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:257互补链的核苷酸11201-12349(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:257互补链的核苷酸15538-16953(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸9726-10673编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸12336-13421编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:257互补链的核苷酸13418-14260编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(d):核苷酸序列,其编码具有核苷酸糖脱氢酶活性并且与SEQ ID NO:257互补链的核苷酸7727-8173编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少95%同一性的多肽;和
(e):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:257互补链的核苷酸10657-11211编码的氨基酸序列(本文称为乙酰转移酶)具有至少95%同一性的多肽;
(x)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:274互补链的核苷酸10707-11846(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:274互补链的核苷酸15037-16476(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸9232-10179编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸11833-12918编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;和
(c3):核苷酸序列,其与SEQ ID NO:274互补链的核苷酸12915-13757编码的氨基酸序列(本文称为GT3)具有至少95%的同一性;
(d):核苷酸序列,其编码具有核苷酸糖脱氢酶活性并且与SEQ ID NO:274互补链的核苷酸7234-7680编码的氨基酸序列(本文称为核苷酸糖脱氢酶)具有至少95%同一性的多肽;和
(e):核苷酸序列,其编码具有乙酰转移酶活性并且与SEQ ID NO:274互补链的核苷酸10163-10717编码的氨基酸序列(本文称为乙酰转移酶)具有至少95%同一性的多肽。
3.根据权利要求1或2的第(iv)项、第(vii)项、第(ix)项或第(x)项的质构化乳酸菌菌株,其中所述质构化乳酸菌菌株是,在下述条件下测量时,生成在剪切速率300s-1下测量时剪切应力为45Pa或更高的发酵乳的菌株:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,其中所述接种温度为30℃。
4.根据权利要求3所述的质构化乳酸菌菌株,其中所述菌株选自由以下组成的组:DSM33137、DSM 33140、DSM 33142或DSM 33183以及衍生自DSM 33137、DSM 33140、DSM 33142或DSM 33183的菌株,其中所衍生菌株的特征在于分别具有与DSM 33137、DSM 33140、DSM33142或DSM 33183至少相同的质构化能力,并且其中所述质构化能力是指,在下述条件下测量时,生成在剪切速率300s-1下测量时剪切应力为45Pa或更高的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,其中所述接种温度为30℃。
5.根据前述权利要求中任一项所述的质构化乳酸菌菌株,其中所述质构化乳酸菌菌株是,在存在乳酸菌菌株乳酸乳球菌亚种(Lactococcus lactis subsp.)DSM 25485或其突变体或变体的情况下,和/或在存在乳酸菌菌株乳酸乳球菌乳酸亚种(Lactococcus lactissubsp.lactis)DSM 33192或其突变体或变体的情况下,比例为约9:1(质构化乳酸菌菌株:菌株DSM 25485和/或DSM 33192),生成在剪切速率300s-1下测量时剪切应力为40Pa或更高的发酵乳的菌株,其中在下述条件下测量所述剪切应力:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并以300s-1的剪切速率测量所述剪切应力,其中所述接种温度为30℃。
6.权利要求1或2的第(i)、(ii)、(iii)、(v)、(vi)和/或(viii)项的质构化乳酸菌菌株,其中所述菌株选自菌株DSM 33134或DSM 33135或DSM 33136或DSM 33138或DSM 33139或DSM 33141,以及衍生自DSM 33139、DSM 33135、DSM 33136、DSM 33138或DSM 33139或DSM33141的菌株,其中所衍生菌株的特征在于分别具有与DSM 33134、DSM 33135、DSM 33136、DSM 33138或DSM 33139或DSM 33141至少相同的质构化能力,并且其中所述质构化能力是指,在存在乳酸菌菌株乳酸乳球菌乳脂亚种(Lactococcus lactis subsp cremoris)DSM25485或其突变体或变体的情况下,和/或在存在乳酸菌菌株乳酸乳球菌乳酸亚种DSM33192或其突变体或变体的情况下,比例为约9:1(质构化乳酸菌菌株:菌株DSM 25485和/或DSM 33192),生成在剪切速率300s-1下测量时剪切应力为40Pa或更高的发酵乳,其中在下述条件下测量所述剪切应力:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,其中所述接种温度为30℃。
7.根据权利要求1或2所述的质构化乳酸菌菌株,其中所述质构化乳酸菌株是,在下述条件下测量时,生成在剪切速率300s-1下测量时剪切应力大于24Pa的发酵乳的菌株:
将实施例2所述200ml补充有2%葡萄糖的豆乳接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.56,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,如实施例2所述,其中所述接种温度为30℃。
8.根据权利要求7所述的质构化乳酸菌菌株,其中所述质构化乳酸菌菌株是选自以下菌株的菌株:DSM 33138、DSM 33140、DSM 33136、DSM 33135、DSM 33141、DSM 33137、DSM33134、DSM 33139、DSM 33142、DSM 33192、DSM 25485或DSM 33183,以及衍生自DSM 33138、DSM 33140、DSM 33136、DSM 33135、DSM 33141、DSM 33137、DSM33134、DSM 33139、DSM33142、DSM 33192、DSM 25485或DSM33183的菌株,其中所衍生菌株的特征在于,分别具有与DSM 33138、DSM 33140、DSM 33136、DSM 33135、DSM 33141、DSM 33137、DSM33134、DSM33139、DSM 33142、DSM 33192、DSM 25485或DSM33183至少相同的质构化能力,其中所述质构化能力是指,在下述条件下测量时,生成在剪切速率300s-1下测量时剪切应力大于24Pa的发酵乳:
将实施例2所述200ml补充有2%葡萄糖的豆乳接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.56,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,如实施例2所述,其中所述接种温度为30℃。
9.一种组合物,其包含权利要求1-6中任一项所述的乳酸乳球菌乳酸菌菌株和一种或多种其他乳酸菌菌株,其中所述一种或多种其他乳酸菌菌株能够:
i)在下述条件下测量时,在15h或更短的时间内,优选在12h或更短的时间内生成pH为约4.55的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度(30℃),并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到达到约4.55的pH;
ii)在下述条件下测量时,生成在剪切速率300s-1下测量时剪切应力为40Pa或更高的发酵乳:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,达到pH 4.55的时间),然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,其中所述接种温度为30℃。
10.根据权利要求9所述的组合物,其中所述组合物包含权利要求1-6中任一项所述的乳酸乳球菌乳酸菌菌株和(a)至少一种包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株,其中所述eps基因簇包含(xi)中限定的(a)、(b)和(c)中限定的核苷酸序列,或(b)至少一种包含能够产生胞外多糖(EPS)的活性eps基因簇的乳酸菌菌株乳酸乳球菌,其中所述eps基因簇如(xii)中所限定:
(xi)(a):核苷酸序列,其编码具有聚合酶活性并且与SEQ ID NO:183的核苷酸6955-8145(本文称为wzy)编码的氨基酸序列具有至少95%同一性的多肽;
(b):核苷酸序列,其编码具有多糖转运蛋白活性并且与SEQ ID NO:183的核苷酸9309-10727(本文称为wzx)编码的氨基酸序列具有至少95%同一性的多肽;和
(c):编码具有糖基转移酶(GT)活性的多肽的核苷酸序列,其包含:
(c1):核苷酸序列,其与SEQ ID NO:183的核苷酸4008-4478编码的氨基酸序列(本文称为GT1)具有至少95%的同一性;
(c2):核苷酸序列,其与SEQ ID NO:183的核苷酸4478-4960编码的氨基酸序列(本文称为GT2)具有至少95%的同一性;
(c3):核苷酸序列,其与SEQ ID NO:183的核苷酸5015-5965编码的氨基酸序列(本文称为GT3)具有至少95%同一性;和
(c4):核苷酸序列,其与SEQ ID NO:183的核苷酸6026-6955编码的氨基酸序列(本文称为GT4)具有至少95%的同一性;
(xii)SEQ ID NO.:290。
11.根据权利要求9-10中任一项所述的组合物,其中所述组合物包含权利要求1-6中任一项所述的乳酸乳球菌乳酸菌菌株和:
(i)LAB菌株乳酸乳球菌乳脂亚种DSM 25485或其突变体或变体;
(ii)乳酸菌菌株乳酸乳球菌乳酸亚种DSM 33192或其突变体或变体;和/或
(iii)乳酸菌菌株乳酸乳球菌DSM 33133或其突变体或变体。
12.权利要求1-8中任一项限定的乳酸菌菌株或权利要求9-11中任一项限定的组合物用于增加发酵乳制品的黏度的用途。
13.一种生产食品的方法,包括至少一个其中使用至少一种权利要求1-8中任一项限定的乳酸菌菌株或权利要求9-11中任一项限定的组合物的阶段。
14.根据权利要求13所述的方法,其中所述食品是乳制品,并且所述方法包括用至少一种权利要求1-8中任一项限定的乳酸菌菌株和/或权利要求9-11中任一项限定的组合物发酵乳基质。
15.一种食品,包含至少一种权利要求1-8中任一项限定的乳酸菌菌株或权利要求9-11中任一项限定的组合物。
16.乳酸乳球菌乳脂亚种菌株DSM 25485和/或乳酸乳球菌乳酸亚种菌株DSM 33192用于增加诸如哺乳动物乳基发酵乳制品或植物乳基发酵乳制品等发酵乳制品的黏度的用途。
17.根据权利要求16所述的用途,其中在下述条件下测量时,所述发酵乳制品在剪切速率300s-1下测量时剪切应力为50Pa或更高、优选55Pa或更高:
将200ml半脱脂乳(1.5%脂肪)加热至90℃,持续20min,然后冷却至接种温度,并接种2ml所述乳酸菌菌株的过夜培养物,并保持在接种温度下,直到pH 4.55,然后在4℃下储存,直到测量剪切应力,通常储存1-7天,例如5天,然后轻轻搅拌并在剪切速率300s-1下测量所述剪切应力,其中所述接种温度为30℃。
18.根据权利要求16-17中任一项所述的用途,其中(i)乳酸乳球菌乳脂亚种菌株DSM25485和/或(ii)乳酸乳球菌乳酸亚种菌株DSM 33192,与至少一种权利要求1-6中任一项限定的乳酸乳球菌乳酸菌菌株组合使用。
19.菌株乳酸乳球菌乳酸亚种DSM 33192。
CN202080067888.4A 2019-08-23 2020-08-21 具有独特的eps基因簇的质构化乳酸乳球菌 Pending CN114502719A (zh)

Applications Claiming Priority (23)

Application Number Priority Date Filing Date Title
EP19193305 2019-08-23
EP19193295 2019-08-23
EP19193316.7 2019-08-23
EP19193312 2019-08-23
EP19193303 2019-08-23
EP19193307 2019-08-23
EP19193310.0 2019-08-23
EP19193299 2019-08-23
EP19193315 2019-08-23
EP19193312.6 2019-08-23
EP19193308.4 2019-08-23
EP19193308 2019-08-23
EP19193315.9 2019-08-23
EP19193299.5 2019-08-23
EP19193305.0 2019-08-23
EP19193295.3 2019-08-23
EP19193310 2019-08-23
EP19193313 2019-08-23
EP19193316 2019-08-23
EP19193313.4 2019-08-23
EP19193303.5 2019-08-23
EP19193307.6 2019-08-23
PCT/EP2020/073522 WO2021037738A1 (en) 2019-08-23 2020-08-21 Texturing l. lactis with unique eps gene clusters

Publications (1)

Publication Number Publication Date
CN114502719A true CN114502719A (zh) 2022-05-13

Family

ID=72088142

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080067888.4A Pending CN114502719A (zh) 2019-08-23 2020-08-21 具有独特的eps基因簇的质构化乳酸乳球菌

Country Status (4)

Country Link
US (1) US20220403323A1 (zh)
EP (1) EP4017867A1 (zh)
CN (1) CN114502719A (zh)
WO (1) WO2021037738A1 (zh)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013093049A2 (en) * 2011-12-23 2013-06-27 Chr. Hansen A/S Method for making cheese
CN108779432A (zh) * 2015-12-22 2018-11-09 科·汉森有限公司 质构化乳酸菌的新型eps基因簇

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013093049A2 (en) * 2011-12-23 2013-06-27 Chr. Hansen A/S Method for making cheese
CN108779432A (zh) * 2015-12-22 2018-11-09 科·汉森有限公司 质构化乳酸菌的新型eps基因簇

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
POULSEN, V.K ET AL: "Lactococcus lactis strain LII5 Eps gene cluster,complete sequence", GENBANK: MH678627, 14 January 2019 (2019-01-14), pages 1 - 4 *

Also Published As

Publication number Publication date
US20220403323A1 (en) 2022-12-22
WO2021037738A1 (en) 2021-03-04
EP4017867A1 (en) 2022-06-29

Similar Documents

Publication Publication Date Title
DK3209381T3 (en) COMPOSITIONS COMPREHENSIVE BAKERY STUES
CN108138122B (zh) 免疫调控
KR101914245B1 (ko) 박테리아성 균주를 함유한 조성물
AU2021201338B2 (en) Complete genome sequence of the methanogen methanobrevibacter ruminantium
CN100390283C (zh) 肺炎衣原体的基因组序列和其多肽,片段以及其用途特别是用于诊断、预防和治疗感染
AU2016357553A1 (en) Compositions comprising bacterial strains
KR20180012846A (ko) 박테리아 균주를 함유한 조성물
AU2015327511B2 (en) Biomarkers for rheumatoid arthritis and usage thereof
JPH09322781A (ja) Staphylococcus aureusポリヌクレオチドおよび配列
KR102191537B1 (ko) 포유류에서 골 소실을 예방하는 젖산균의 선별 및 이의 용도
CN107208068A (zh) 新型产志贺毒素F18型大肠杆菌噬菌体Esc‑COP‑1及其用于抑制产志贺毒素F18型大肠杆菌增殖的用途
AU2022256122A1 (en) Novel Proteins From Anaerobic Fungi And Uses Thereof
CN107208067A (zh) 新型肠侵袭性大肠杆菌噬菌体Esc‑COP‑4及其用于抑制肠侵袭性大肠杆菌增殖的用途
KR102064765B1 (ko) 병원성 대장균의 증식을 억제하는 신규 박테리오파지 및 이의 용도
JPH09252787A (ja) マイコプラズマ・ジェニタリウムゲノムまたはその断片のヌクレオチド配列およびその使用
KR20200019882A (ko) 세균 균주를 포함하는 조성물
CN112243377A (zh) 用于治疗和预防细菌相关的癌症的噬菌体
AU2016295176A1 (en) Genetic testing for predicting resistance of gram-negative proteus against antimicrobial agents
CN109517069A (zh) 一种用于表达Bt杀虫蛋白的高效蛋白质表达系统
KR20140140698A (ko) 에세리키아 콜라이의 박테리오파아지 및 그 용도
KR20220024508A (ko) 생물학적으로 봉쇄된 박테리아 및 그의 용도
KR20220041204A (ko) 2차 대사산물의 생산을 위한 마리노모나스 ef1 및 로도코커스 ef1
KR101993123B1 (ko) 신규한 병원성 대장균 특이 박테리오파지 eco5 및 이를 포함하는 항균 조성물
KR102411380B1 (ko) 서팩틴 및 효소 생산능이 우수한 바실러스 서브틸리스 균주 및 이의 용도
KR20200003039A (ko) 표적화된 유전자 파괴 방법 및 면역원성 조성물

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination