CN1973043A - 不包含抗生素抗性基因的rsf1010衍生mob-缺陷型质粒、包含该质粒的细菌和生产有用代谢物的方法 - Google Patents
不包含抗生素抗性基因的rsf1010衍生mob-缺陷型质粒、包含该质粒的细菌和生产有用代谢物的方法 Download PDFInfo
- Publication number
- CN1973043A CN1973043A CNA2005800212254A CN200580021225A CN1973043A CN 1973043 A CN1973043 A CN 1973043A CN A2005800212254 A CNA2005800212254 A CN A2005800212254A CN 200580021225 A CN200580021225 A CN 200580021225A CN 1973043 A CN1973043 A CN 1973043A
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- arg
- gly
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
Landscapes
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明提供具有RSF1010复制子、包含编码Rep蛋白质的基因的Mob-质粒,并且所述质粒经修饰以使与转移能力有关的基因失活。本发明还描述具有生产有用代谢物能力的包含所述质粒的细菌,所述细菌缺乏由thyA基因编码的活性胸苷酸合酶和由tdk基因编码的胸苷激酶,以及利用所述细菌生产有用代谢物如天然或重组的蛋白质、酶、L-氨基酸、核苷和核苷酸、有机酸、维生素的方法。
Description
技术领域
本发明涉及突变载体及其用途,更具体地,本发明涉及不包含抗生素抗性基因的广宿主范围RSF1010衍生Mob-质粒。本发明还涉及包含所述质粒的细菌和用所述细菌生产有用代谢物的方法。
背景技术
RSF1010为可转移的(mobilizable)、但不能自我转移的(self-transmissible)熟知的IncQ组质粒,其具有显著的在广细菌宿主范围,包括大多数革兰氏阴性细菌内复制的能力(Frey,J.和Bagdasarian,M.The molecular biology of IncQplasmids.In:Thomas,C.M.(编),Promiscuous Plasmids of Gram NegativeBacteria.Academic Press,London,1989,p.79-94)。RSF1010质粒的核苷酸序列是已知的(Scholz,P.等,Gene,75(2),271-288(1989);登录号GenBank M28829,gi:152577)并且已经非常彻底地研究了该质粒的功能结构。RSF1010质粒含有独特的植物DNA复制起点oriV(De Graaf,J.等,J.Bacteriol.,134,1117-1122(1978);Haring,V.和Scherzinger,E,Replication Proteins of the IncQplasmid RSF 1010,In:Thomas,C.M.(Ed.),Promiscuous Plasmids of GramNegative Bacteria.Academic Press,London,1989,p.95-124),以及repA,repB,repB′和repC,这些是质粒复制的必需基因(Scherzinger,E等,Proc.Natl.Acad.Sci.USA,81,654-658(1984);Scherzinger,E等,Nucleic Acids Res.,19,1203-1211(1991);Scholz,P.等,Replication determinants of the broad-host-rangeplasmid RSF1010.In:Helinski,D.R.等(编),Plasmids in Bacteria,Plenum Press,New York,1984,p.243-259)。RSF1010质粒还包含松弛复合体(relaxationcomplex)位点oriT和接合DNA转移起点mobA(在可选框架中包括repB基因)、mobB和mobC(mob基因座)、编码反式激活(trans-active)蛋白的基因,其涉及质粒转移(Nordheim,A等,J.Bacteriol.,144,923-932(1980);Derbyshire.K.M.等,Mol.Gen.Genet.,206,161-168(1987))、以及磺胺抗性(SulR)和链霉素抗性(StrR)基因(分别为sul和str基因)(Scholz,P.等,Gene,75(2),271-288(1989))。
如果质粒序列是完整的(Scholz,P.等,Gene,75(2),271-288(1989)),即用电子显微术识别并确认RSF1010物理图谱上引发质粒蛋白翻译的启动子(Bagdasarian,J.Frey,和K.Timmis.Gene 16,237-247(1981))。
RSF1010质粒的复制起始需要由质粒所编码的三种蛋白质的存在:RepA、RepB和RepC,其分别由repA、repB和repC基因所编码。RepC识别复制起点(在重复序列内)并正调节复制的起始;RepA具有解旋酶(helicase)活性;RepB和RepB*(其相当于由同一框架编码的两种蛋白质,但分别在不同的密码子处启动)在体外具有RSF1010-特异性引发酶(primase)活性。RSF1010质粒的复制依赖于DNA聚合酶III和宿主的促旋酶(gyrase)。RSF1010质粒可以通过不相容性组IncI-α、IncM、IncX以及最特别的IncP的质粒的tra功能而从一种革兰氏阴性细菌转移到另一种革兰氏阴性细菌(Derbyshire.K.M.等,Mol.Gen.Genet.,206,161-168(1987))。
在大肠杆菌(E.coli)中,RSF1010以每个细胞12个拷贝数存在(Bagdasarian,M.M.等,Regulation of the rep operon expression of the broad-host range plasmidRSF1010.In:Novick,R和Levy,S(编),Evolution and Environmental Spread ofAntibiotic Resistance Genes.Cold Spring Harbor Laboratory,Cold Spring Harbor,NY,1986,p.209-223)。位于mobC和mobB基因之间的质粒oriT区域的结构组织是相当复杂的。然而,已知此区域是转移起始所必需的,并还包含质粒复制必需的启动子。已经显示涉及质粒转移的不同基因的消除可能无法预料地改变质粒特性。例如,缺失编码调控蛋白的mobC基因导致质粒拷贝数的显著增加(Frey,J.等,Gene,113,101-106(1992))。这可能是RFS1010质粒的变体仍然未知的原因,所述RFS1010质粒变体不包含所有对转移来说必需的已知序列。
迄今为止还没有描述有关RSF1010及其衍生物稳定性的研究。另外,虽然RSF1010的序列是已知的,但还不能鉴定质粒稳定性决定簇(determinant),无论是通过功能分析还是通过分子分析。
生物安全性的限制在生物学上大大限制了重组菌株。1987年5月7日由NIH出版的″Guidelines for research involving recombinant DNA molecules″中描述的生物安全水平1(BL1)系统对应于其中的一些局限。例如,如果重组微生物被偶然释放到自然环境中,则这样的质粒不能被传输到其它生物体是至关重要的。欧洲指令(European directive)中描述了类似的规定,例如1990年4月23日关于有意向环境中释放遗传修饰生物的理事会指令(90/220/EEC)、1998年10月26日修改关于遗传修饰微生物包含的用途的指令90/219/EEC的理事会指令98/81/EC。
已经公开了包含复制起点即质粒RP4的par区域且缺乏转移功能的革兰氏阴性细菌载体,所述复制起点在革兰氏阴性细菌中是功能性的(美国专利5670343)。本发明的载体不能由一种革兰氏阴性细菌转移到另一种革兰氏阴性细菌。因此,它们与这些细菌形成具有的1类宿主-载体系统并符合工业规定。此同时处于大肠杆菌(Escherichia coli)和恶臭假单胞菌(Pseudomonasputida)中的系统推定(assume)使用非接合且非可转移质粒。具体地通过缺失包含mob基因座的区域获得了本发明载体的这种非常有利的特性。这种在革兰氏阴性细菌中具有广宿主范围的新的克隆和/或表达载体可用于由包含所述载体的宿主细胞生产重组蛋白质或代谢物。
迄今为止,微生物的遗传工程几乎全都依赖于抗生素抗性基因的使用,无论是用于遗传标记受体细胞还是用于遗传工程方案中鉴定和维持用作载体的质粒。如果菌株携带抗生素抗性基因,那么遗传修饰生物(GMO)释放到一般环境中、其在农业和食品加工工业的应用或在保健品(health care)工业的应用很可能被管理机构所禁止。因此,明显需要可用于替代抗生素抗性基因的标记基因,并且其不会有携带取代标记基因的GMO可能被管理机构慢慢清除的任何后果。
以前描述了胸苷酸合酶(TS)基因适合于取代抗生素抗性基因作为选择标记(欧洲专利申请EP0406003A1)。具体地,发现来自通常用于乳酪制造(并因此确认为安全微生物)的一种细菌,乳链球菌(Streptococcus lactis)的胸苷酸合酶基因作为标记基因是合适的候选物,其可以作为抗生素抗性基因的替代物,尤其是作为“食品级(food grade)”标记基因。胸苷酸合酶(5,10-亚甲基四氢叶酸:dUMP C-甲基-转移酶;EC 2.1.1.45)在DNA合成中扮演关键角色;其催化dUMP至dTMP的还原性甲基化,同时伴随辅助因子5,10-亚甲基四氢叶酸向7,8-二氢叶酸的转化。该活性是DNA的从头生物合成中的基本步骤。通过TS基因中的突变而失去TS活性的细胞不能制造DNA,也不能存活,除非供给胸腺嘧啶或胸苷,其通过替代途径转变为dTMP。可以轻易地将缺少胸苷酸合酶活性的微生物菌株(即TS-)与正常TS+菌株区别开。在化学限定的支持TS+菌株阳性生长的生长培养基中,TS-细胞将死亡,除非所述培养基补充胸腺嘧啶或胸苷。另外,在没有足够胸腺嘧啶或胸苷的培养基或环境中,具有乳链球菌TS基因的克隆载体质粒将稳定地维持在TS-细胞中,质粒的丧失导致细胞死亡。
发明内容
本发明的目的是提供源于不含抗生素抗性基因的RSF1010质粒的广宿主范围Mob-载体、提供包含所述载体并且缺乏胸苷酸合酶和胸苷激酶活性而提供非常稳定的(stabile)载体-宿主系统的细菌、以及提供使用所述细菌生产有用代谢物的方法。
该目标通过构建RSF1010衍生质粒而实现,所述衍生质粒不含与转移能力有关的基因并且没有抗生素抗性基因。此外,将作为选择标记的胸苷酸合酶导入到所构建的质粒中。而且此外,用所述质粒转化缺乏活性胸苷酸合酶和胸苷激酶基因的细菌。结果,存在于质粒上的胸苷酸合酶基因不但成为选择标记,还成为在细菌中稳定所述质粒的因子。由此完成了本发明。
本发明的目的是提供RSF1010衍生Mob-质粒,其中所述质粒选自下组:SEQ ID NO:24、SEQ ID NO:27、和SEQ ID NO:48,以及与SEQ ID NO:24、SEQ ID NO:27、和SEQ ID NO:48至少95%同源的SEQ ID NO:24、SEQ IDNO:27、和SEQ ID NO:48的变体,并且其中所述质粒经过修饰以使与转移能力有关的一种或多种基因失活。
本发明的又一目的是提供上述的质粒,其中所述质粒经修饰以使抗生素抗性基因失活。
本发明的又一目的是提供上述的质粒,其中所述质粒经修饰以增加上述质粒的拷贝数。
本发明的又一目的是提供上述的质粒,其包含PlacUV5启动子和来自无mob基因座的RSF1010的复制起点。
本发明的又一目的是提供上述的质粒,其额外包含胸苷酸合酶基因。
本发明的又一目的是提供上述的质粒,其额外包含目标基因。
本发明的又一目的是提供包含上述质粒的细菌。
本发明的又一目的是提供上述的细菌,其中所述细菌是革兰氏阴性细菌。
本发明的又一目的是提供上述的细菌,其中所述细菌缺乏活性胸苷酸合酶并缺乏活性胸苷激酶。
本发明的又一目的是提供上述的细菌,其中所述细菌具有生产有用代谢物的能力。
本发明的又一目的是提供上述的细菌,其中所述有用代谢物选自下组:天然或重组蛋白质、酶、L-氨基酸、核苷、核苷酸、有机酸和维生素。
本发明的又一目的是提供生产有用代谢物的方法,包括
(a)在培养基中培养上述细菌和
(b)从培养基收集所述的有用代谢物。
本发明的又一目的是提供上述的方法,其中所述有用代谢物选自下组:天然或重组蛋白质、酶、L-氨基酸、核苷、核苷酸、有机酸和维生素。
附图简述
图1显示RSF1010质粒的结构。
图2显示pBluescript::lacIrepB质粒的结构。
图3显示RSF1010mob-质粒的结构。
图4显示野生型和改善的thyA启动子区域的序列。-35和-10区域加下划线。-10、-14和-15区域的取代以粗体表示。
图5显示RSF1010-MT质粒的结构。
优选实施方案描述
本发明的RSF1010衍生Mob-质粒包括由RSF1010质粒所构建的质粒,由此将与转移能力有关的基因失活。
本发明所用短语“RSF1010衍生Mob-质粒”定义为以下定义的和SEQ IDNO.1中的RSF1010质粒,及其变体,由此使与转移能力有关的基因失活。图3、图5中提供了RSF1010衍生Mob-质粒的例子,而DNA序列在SEQ IDNO:24、27、和48中公开。
本发明的短语质粒的“衍生物”意思是由本发明质粒的一部分、和/或另一个DNA序列所组成的另一种质粒。“质粒的一部分”意思是包含质粒的自主复制必需的区域的部分,如复制起点(ori)和复制必需的基因(rep),以维持细菌中的复制。
与转移有关的基因包括,但不限于mobA、mobB、mobC、和oriT。包括在质粒RSF1010中的基因的位置如表1所示。
表1
基因 | 蛋白质 | 序列(SEQ ID:1) | SEQ ID NO: |
strA | Sm抗性蛋白A | 63-866 | NO2 |
strB | Sm抗性蛋白B | 866-1702 | NO4 |
oriV | 复制起点 | 2347-2771 | |
mobC | 转移蛋白C | 互补2767-3051 | NO6 |
mobA | 转移蛋白A | 3250-5379 | NO8 |
mobB | 转移蛋白B | 3998-4411 | NO10 |
repB | 复制蛋白B | 4408-5379 | NO12 |
orfE | 未知蛋白E | 5440-5652 | NO14 |
orfF | 阻抑蛋白F | 5654-5860 | NO16 |
repA | 复制蛋白A | 5890-6729 | NO18 |
repC | 复制蛋白C | 6716-7567 | NO20 |
sul | Su抗性蛋白 | 7875-8663 | NO22 |
RSF1010质粒的核苷酸序列是已知的(Scholz,P.等,Gene,75(2),271-288(1989);GenBank登录号M28829,gi:152577),并在SEQ ID NO:1中示出。RSF1010质粒包含植物(vegetative)DNA复制的独特起点oriV,编码必需复制蛋白的基因repA、repB、repB′和repC,松弛复合体(relaxation composite)位点和接合DNA转移起点oriT,编码涉及质粒转移的反式激活蛋白的基因mobA、mobB和mobC,以及磺酰胺和链霉素抗性(StrR)基因(分别为sul和strA、srrB基因)。
RSF1010质粒包含编码Rep蛋白的基因,所述Rep蛋白具有SEQ ID:13、19和21所示的氨基酸序列。
所述Rep基因为来自RSF1010的repA、B、C基因或其同源物。repA、B、C基因包括编码具有氨基酸序列SEQ ID NOS:13、19、21的蛋白质的基因。所述rep基因同源物可以为编码与SEQ ID NO:13、19、21的全部氨基酸序列具有70%或更高、优选80%或更高、更优选90%或更高、再优选95%或更高、特别优选98%或更高同源性且具有复制能力的蛋白质的基因。可以使用Karlin和Altschul的BLAST(Pro.Natl.Acad.Sci.USA,90,和5873(1993))和FASTA(Methods Enzymol.,183,和63(1990))算法测定氨基酸序列和DNA序列的同源性。称为BLASTN和BLASTX的程序是以此BLAST算法为基础开发的(参见http://www.ncbi.nlm.nih.gov)。
另外,本发明的rep基因不限于野生型基因,也可以是编码具有SEQ IDNO:13、19、和21的氨基酸序列的突变的或人工修饰的基因。所编码的蛋白质可以包括一个或多个位置的一个或几个氨基酸残基的取代、缺失、或插入,只要保留了所编码的Rep蛋白的功能,即复制能力。虽然此处提到的“几个”氨基酸残基的数目根据在三维结构中的位置或氨基酸残基的类型而有所不同,但可以为2至20个,优选2至10个,更优选2至5个。氨基酸的取代优选为保守的取代,包括ser或thr取代ala,gln、his或lys取代arg,glu、gln、lys、his或asp取代asn,asn、glu或gln取代asp,ser或ala取代cys,asn、glu、lys、his、asp或arg取代gln,gly、asn、gln、lys或asp取代glu,pro取代gly,asn、lys、gln、arg或tyr取代his,leu、met、val或phe取代ile,ile、met、val或phe取代leu,asn、glu、gln、his或arg取代lys,ile、leu、val或phe取代met,trp、tyr、met、ile或leu取代phe,thr或ala取代ser,ser或ala取代thr,phe或tyr取代trp,his、phe或trp取代tyr,以及met、ile或leu取代val。上述一个或几个核苷酸的取代、缺失、或插入还包括起源于个体差异、以及含有rep基因的微生物(突变体或变体)物种差异的天然存在的突变。
这样的基因可以通过例如位点特异性诱变来修饰SEQ ID NOS:12、18和20所示的核苷酸序列而获得,从而将一个或多个取代、缺失或插入引入到由基因所编码的蛋白质的特定位点。
另外,还可以通过如下面所提到的那些常规的诱变处理来获得这样的基因。诱变处理的例子包括在体外用羟胺处理具有SEQ ID NOS:12、18和20所示的核苷酸序列的基因,以及用紫外线照射或诱变剂处理微生物如含有RSF1010的埃希氏菌属细菌,所述诱变剂为通常突变处理中所用的诱变剂如N-甲基-N’-硝基-N-亚硝基胍(NTG)或EMS(乙基甲磺酸酯(ethylmethanesulfonate))。
rep基因还包括能够在严紧条件下与SEQ ID NOS:12、18、20的核苷酸序列、或由这些序列制备的探针杂交的DNA,并且其编码具有复制能力的蛋白质。如本文所用的“严紧条件”为这样的条件,在此条件下形成所谓的特异性杂合体且不形成非特异性杂合体。难以用任何的数值清楚地表达此条件。然而,严紧条件的例子包括:在其条件下,相互之间具有高度同源性的DNAs,例如具有不小于50%同源性的DNAs互相杂交,而具有低于50%同源性的DNAs不互相杂交的条件,和在其条件下,DNAs在具有Southern杂交通常洗涤的盐浓度互相杂交的条件,即在1×SSC、0.1%SDS在60℃,优选0.1×SSC、0.1%SDS在60℃,更优选0.1×SSC、0.1%SDS在68℃下洗涤一次或优选2-3次。
可以按照如上所述用于Rep蛋白的类似步骤获得本发明中使用的编码转移蛋白的DNA或其它基因。
本文所用短语“使一种或多种与转移能力有关的基因失活”意思是失去从细胞到另一细胞的转移活性。与转移能力有关的基因包括mobA和mobB以及mobC。使基因失活的方法的例子包括突变或缺失选自mobA、B、和C的基因的一部分。突变或缺失基因的方法的例子包括修饰表达调控序列如启动子和Shine-Dalgarno(SD)序列、向开放阅读框中引入错义突变、无义突变、或移码突变、以及缺失基因的一部分(J Biol Chem.1997272(13):8611-7)或缺失编码转移蛋白的全部区域。可以用通过使同源重组技术或者用转座子或IS因子将突变的基因引入到微生物中,其中染色体上的野生型基因被突变的基因所取代。同源重组技术包括使用线性DNA、温度敏感质粒、和非可复制质粒的方法。这些方法描述于Proc Natl Acad Sci USA.2000 Jun6;97(12):6640-5、美国专利6303383、JP05-007491A等等。
mobA基因在替代框架内包含mobB基因,且mobA基因的3’末端编码质粒复制必需的RepB蛋白。另外,repB基因的起始密码子与mobB基因的终止密码子重叠,推测存在这些基因的翻译偶联(translational coupling)。
质粒的oriT区域存在于mobC和mobB基因之间,并且是转移起始所必需的组件。已知该区域还包含repB基因翻译必需的启动子。因此有必要引入另一种(些)能够发挥repB基因翻译功能的启动子。
可以通过构建重组质粒的常规方法来进行质粒的部分缺失,例如用限制酶消化,然后连接质粒的其余部分,重组或整合等等。
本发明的具体实施方案是缺失了mobA、mobB和mobC基因的RSF1010衍生质粒。在原始RSF1010质粒(SEQ ID NO:1)上,mobA基因自核苷酸3250延伸至核苷酸5379,mobB基因自核苷酸3998延伸至核苷酸4411,mobC基因自核苷酸3051延伸至核苷酸2767。repB和mobA的编码区是重叠的。因此优选缺失mobA例如核苷酸3250-5379而不缺失repB。去除mob基因座的RSF1010衍生物RSF1010衍生Mob-的序列在序列表中SEQ ID NO:24、27和48中显示。
本发明的又一实施方案是不包含抗生素抗性标记的RSF1010衍生Mob-质粒。原始RSF1010质粒包含链霉素抗性基因(strA和strB基因)和磺酰胺抗性基因(sul基因)。在RSF1010质粒(SEQ ID NO:1)上,strA基因自核苷酸63延伸至866,strB基因自核苷酸866延伸至1702,sul基因自核苷酸7875延伸至8663。不包含抗生素抗性标记的RSF1010衍生Mob-质粒在SEQ ID NO:27和图5中显示。
本发明的又一实施方案是RSF1010衍生Mob-质粒,所述质粒经修饰而使抗生素抗性基因失活。本发明中与抗生素抗性基因有关的基因是磺酰胺和链霉素抗性(StrR)基因(分别是sul和strA、strB基因)。
本发明的又一实施方案是RSF1010衍生Mob-质粒,所述质粒经修饰以增加质粒拷贝数。强启动子或可诱导的启动子可以用于表达repB基因,其经修饰从而可以增加质粒的拷贝数。这种强启动子的例子包括lac启动子、trp启动子、trc启动子、tac启动子、λ噬菌体的PR启动子和PL启动子、tet启动子、amyE启动子、spac启动子等等。这种强启动子的例子包括PlacUV5启动子、lac启动子、尤其是PlacUV5启动子是优选的。包含PlacUV5启动子的RSF1010衍生mob-质粒在SEQ ID NOS:24、27、48中描述。
为了有条件地调控拷贝数,可以使用PlacUV5启动子控制下的PlacUV5启动子与lacI基因的组合。PlacUV5启动子是可以通过添加IPTG诱导的,并且lacI基因阻抑从PlacUV5启动子的表达(J Mol Biol.1982 Nov 5;161(3):417-38.),因此,为了增加拷贝数,可以添加IPTG,或者可以缺失lacI基因。与RSF1010相比,预期该质粒的拷贝数增加至两倍、3倍、和4倍。包含PlacUV5启动子的RSF1010mob-lacI-质粒在SEQ ID NO:48中描述。为了减少质粒的拷贝数,优选修饰lacI基因以过表达。包含PlacUV5启动子的RSF1010mob-lacI-质粒在SEQ ID NO:48中描述。
为了减少质粒的拷贝数,优选修饰lacI基因以过表达。
PlacUV5启动子的核苷酸序列公开于Genbank登录号Y00412(核苷酸7-100)中。lacI的核苷酸序列公开于Genbank登录号NP_414879中。另外,用于本发明的PlacUV5启动子的核苷酸序列描述于SEQ ID NO:24(核苷酸2824-2912)。PlacUV5启动子可以根据SEQ ID NO:24的核苷酸序列通过化学合成获得,或者通过由pET表达系统(Novagen)制备而获得。lacI的核苷酸序列也描述于SEQID NO:25。lacI可以根据SEQ ID NO:25或GenBank登录号NP_414879的核苷酸序列,使用大肠杆菌K-12(MG1655)的染色体DNA作为模板,通过PCR获得。包含PlacUV5启动子和lacI的RSF1010mob-质粒在SEQ ID NO:24和图3中显示。
本发明的又一实施方案是额外包含作为选择标记的胸苷酸合酶基因(thyA基因,SEQ ID NO:44)的RSF1010衍生Mob-质粒。胸苷酸合酶在7,8-二氢叶酸释放时,通过消耗5,10-亚甲基四氢叶酸而催化从2’-脱氧尿苷-5’-磷酸(dUMP)形成胸苷-5’-单磷酸(dTMP)。已经阐明了编码大肠杆菌的胸苷酸合酶的thyA基因(GenBank登录号NC_000913.1,gi:16130731的序列中核苷酸编号2962383至2963177)。thyA基因位于大肠杆菌菌株K12的染色体上ppdA和lgt基因之间。因此,可以利用引物通过PCR(聚合酶链式反应;参见White,T.J.等,Trends Genet.,5,185(1989))获得前述基因,所述引物基于所报道的基因的核苷酸序列。具有mob基因座并缺失了所有抗生素抗性基因且包含胸苷酸合酶基因(thyA基因,SEQ ID NO:27196至990)作为选择标记的RSF1010衍生物的序列在序列表SEQ ID NOS:44和45中显示。
额外包含胸苷酸合酶基因(thyA基因)作为选择标记的RSF1010衍生Mob-质粒可以用作载体。载体为DNA分子,其中能够整合另一种合适大小的DNA片段而不丧失载体的自我复制能力;载体将外源DNA引入到宿主细胞中,所述外源DNA可以在宿主细胞中大量复制。包含thyA基因的质粒描述于SEQID NO:27(RSF1010mob-MT)和图5。
因此,本发明的又一实施方案是包含胸苷酸合酶基因(thyA基因)作为选择标记并且额外包含目标基因的RSF1010衍生Mob-质粒。术语“目标基因”意思是涉及或影响有用代谢物的生物合成途径的基因。这些可以是涉及天然或重组蛋白、或涉及L-氨基酸、核苷、核苷酸、有机酸和维生素生物合成的基因或编码调控蛋白的基因。术语“有用代谢物”包括天然的或重组的蛋白质、酶、L-氨基酸、核苷和核苷酸、有机酸、维生素。L-氨基酸包括L-丙氨酸、L-精氨酸、L-天冬酰胺、L-天冬氨酸、L-半胱氨酸、L-谷氨酸、L-谷氨酰胺、L-甘氨酸、L-组氨酸、L-异亮氨酸、L-亮氨酸、L-赖氨酸、L-蛋氨酸、L-苯丙氨酸、L-脯氨酸、L-丝氨酸、L-苏氨酸、L-色氨酸、L-酪氨酸、L-缬氨酸和L-高丝氨酸、并且优选包括芳香族L-氨基酸,如L-色氨酸、L-苯丙氨酸和L-酪氨酸。核苷包括嘌呤核苷和嘧啶核苷,如腺苷、胞苷、肌苷、鸟苷、胸苷、尿苷和黄苷。核苷酸包括磷酸化的核苷,优选5’-磷酸化的核苷,如2’-脱氧腺苷-5’-单磷酸(dAMP)、2’-脱氧胞苷-5’-单磷酸(dCMP)、2’-脱氧鸟苷5’-单磷酸(dGMP)、胸苷-5’-单磷酸(dTMP)、腺苷-5’-单磷酸(AMP)、胞苷-5’-单磷酸(cMP)、鸟苷5’-单磷酸(GMP)、肌苷5’-单磷酸(IMP)、尿苷-5’-磷酸(UMP)、黄苷-5’-单磷酸(XMP)。有机酸包括琥珀酸、延胡索酸、苹果酸、酮葡糖酸。维生素包括泛酸。
本发明的质粒,尤其是SEQ ID Nos.24、27和48所示质粒可以包括这些序列的变体,只要与产生所述变体之前的质粒相比,这些质粒能够在细菌中发挥功能。本文所用质粒的功能意指当转化到细菌中时,所述质粒具有复制其自身和表达目标基因、以及表达质粒复制所必需的基因的能力。明显的变异乃至缺失可以发生在对于质粒的功能和复制不重要的质粒区域内,例如RSF1010衍生mob-质粒(SEQ ID NO:24)从核苷酸7219至8335的区域和从核苷酸1至2347的区域,和RSF1010-MT质粒(SEQ ID NO:27)从核苷酸1004至1649和/或从6557至6864的区域。这些区域通常可以包含一个或几个选择标记。此外,也可以修饰或缺失调控质粒复制所必需的lacI基因的编码部分(对于RSFmob-质粒(SEQ ID NO:24)为核苷酸2252至3379,对于RSF1010-MT质粒(SEQ ID NO:27)为核苷酸2914至4041)(参见实施例2),假设这样的修饰或缺失不在lacI基因内产生终止密码子或不产生移码。更多的变异可以是在SEQ ID Nos.24、27和48的其它区域内的取代、缺失、或插入核苷酸,只要所述质粒能够像产生所述变体之前那样发挥功能并复制。优选地,与SEQ ID Nos.24、27和48的序列相比,变体至少80%同源,更优选至少90%同源,最优选至少95%同源,甚至最优选至少97%同源。可以通过常规的熟知的技术如BLAST测定同源性,并且相对于序列SEQ ID NOs.24、27和48的全长测定同源性。例如,可以用软件程序BLAST 2.0计算三个参数:分数、同一性和相似性,以估计两条氨基酸序列之间的同源性。将计算期间获得的相似性值考虑在内以估计同源性百分比。BLAST(基本局部对比搜索工具(Basic Local Alignment Search Tool))是程序blasta、blastp、blastn、blastx、megablast、tblastn、和tblastx所使用的启发式搜索算法;这些程序采用Karlin、Samuel和Stephen F.Altschul的统计学方法赋予它们的结果以显著性(″Methods for assessing the statistical significance of molecular sequence featuresby using general scoring schemes″。Proc. Natl. Acad. Sci. USA,87:2264-68(1990);″Applications and statistics for multiple high-scoring segmentsin molecular sequences″。Proc.Natl.Acad.Sci.USA,90:5873-7(1993))。
制备染色体DNA、杂交、PCR、制备质粒DNA、消化和连接DNA、转化、选择作为引物的寡核苷酸等的方法包括本领域技术人员熟知的常规方法。这些方法描述于Sambrook,J.,和Russell D.,“Molecular Cloning A LaboratoryManual,第三版″,Cold Spring Harbor Laboratory Press(2001)等。
本发明的细菌包括含有本发明所述质粒的细菌,优选革兰氏阴性细菌。优选本发明的细菌具有生产有用代谢物的能力。另外,本发明的细菌包括如上所述的细菌,其缺乏有活性的固有胸苷酸合酶和胸苷激酶。然而,本发明的细菌可以具有活性胸苷酸合酶,其由所述细菌含有的本发明的质粒表达。
术语“具有生产有用代谢物能力的细菌”意指当把本发明的细菌在培养基中培养时,具有引起代谢物在细菌细胞中或优选在培养基中积累的能力的细菌。可以通过培育而赋予或增强生产这种代谢物的能力。本文所用术语“具有生产有用代谢物能力的细菌”还指能够以大于野生型或亲本菌株的量生产并导致代谢物在培养基中积累的细菌,优选指能够以不少于0.5g/L、更优选不少于1.0g/L的量生产并导致目标代谢物在培养基中累积的微生物。
术语“革兰氏阴性细菌”指根据微生物学领域的技术人员已知的分类法将其归类为革兰氏阴性细菌的细菌。这种分类法参见例如″Bergey’s Manual ofDeterminative Bacteriology,第九版″(Bergey,John G Holt(编者),Noel R.Krieg,Peter H.A.Sneath,D.Bergy,出版商:Lippincott,williams&Wilkins)。革兰氏阴性细菌包括例如以下科的细菌:醋酸杆菌科(Acetobacteriaceae)、产碱菌科(Alcaligenaceae)、类杆菌科(Bacteroidaceae)、着色菌科(Chromatiaceae)、肠杆菌科(Enterobacteriaceae)、军团菌科(Legionellaceae)、奈瑟氏球菌科(Neisseriaceae)、硝化菌科(Nitrobacteriaceae)、假单胞菌科(Pseudomonadaceae)、根瘤菌科(Rhizobiaceae)、立克次氏体科(Rickettsiaceae)、螺旋体科(Spirochaetaceae)、弧菌科(Vibrionaceae)等等。
肠杆菌科包括例如属于肠杆菌属(Enterobacter)、欧文氏菌属(Erwinia)、埃希氏菌属(Escherichia)、克雷伯氏菌属(Klebsiella)、普罗威登斯菌属(Providencia)、沙门氏菌(Salmonella)、沙雷氏菌属(Serratia)、志贺氏菌属(Shigella)等的细菌。
术语“缺乏活性胸苷酸合酶和胸苷激酶”指以所述方式修饰编码这些酶的固有基因以使修饰基因编码完全无活性蛋白。由于一部分基因的缺失、阅读框位移、或者通过修饰基因邻近区域包括控制操纵子表达的序列,如启动子、增强子、衰减子等,修饰的基因也有可能不能表达。
已知丧失了胸苷酸合酶活性的细胞不能生产DNA,也不能存活,除非供给胸腺嘧啶或胸苷,它们通过替代途径转变为dTMP。胸苷激酶的进一步失活可能产生不能利用存在于培养基中的胸腺嘧啶或胸苷的细菌。结果,存在于本发明质粒上的胸苷酸合酶基因不仅成为选择标记,还成为用来稳定细菌中的质粒的因子。
胸苷激酶催化胸苷的ATP-依赖的磷酸化,产生胸苷-5’-单磷酸(dTMP)。已经阐明了编码大肠杆菌胸苷激酶的tdk基因(GenBank登录号NC_000913.1,gi:16129199中核苷酸1292750至1293367)。tdk基因位于大肠杆菌菌株K12染色体上hns基因和ychG ORF之间。tdk基因的核苷酸序列和由该基因编码的氨基酸序列分别如SEQ ID NOS:46和47所示。
基因的失活可以通过常规方法完成,例如用紫外线(UV)照射或亚硝基胍(N-甲基-N’-硝基-N-亚硝基胍)处理的诱变处理、定点诱变、使用同源重组或/和插入-缺失诱变的基因破坏(Datsenko K.A.和Wanner B.L.,Proc.Natl.Acad.Sci.USA,97:12:6640-45(2000))(也称为“Red-驱动的整合”)。
具体地,使宿主菌株thyA基因失活之后,将修饰的质粒RSF1010转化到突变宿主中,所述修饰的质粒RSF1010缺失了mob基因座和所有抗生素抗性基因,并含有胸苷酸合酶基因(SEQ ID NO:44),接下来在不含胸苷的培养基上进一步选择转化体。然后进行tdk基因的失活。基因的失活可以通过用抗生素抗性基因取代目标基因来进行,所述抗生素抗性基因侧翼与适于进一步切除抗生素抗性基因的序列相连。通过采用FRT位点和噬菌体λRed重组酶(Flp重组酶)的系统(Datsenko K.A.和Wanner B.L.,Proc.Natl.Acad.Sci.USA,97:12:6640-45(2000))、采用来自噬菌体λ的attL和attR位点和int和xis基因的产物的系统(Peredelchuk,M.Y.和Bennett,GN.,Gene,187,231-238(1997))、采用来自噬菌体P1的loxP位点和Cre重组酶的系统(Guo,F.等,Nature,389,40-46)、由Campbell,A.M.(J.Bacterio1.,174,23,7495-7499(1992))描述的类似系统等作为用于切除的系统的示例。
可以通过将本发明的质粒导入到细菌中而获得本发明的细菌,由此,所述细菌具有了生产有用代谢物的固有能力并缺乏活性胸苷酸合酶和胸苷激酶。或者,可以通过赋予细菌生产有用代谢物的能力而获得本发明的细菌,所述细菌已经缺乏活性胸苷酸合酶和胸苷激酶并且含有所述质粒。
本发明的方法包括生产有用代谢物的方法,包括在培养基中培养本发明的细菌,使所述代谢物积累于培养基中,以及从培养基收集代谢物。
在本发明中,由培养基培养、收集、和纯化目标代谢物等可以通过与常规发酵方法类似的方式进行,其中用微生物生产所述目标代谢物。用于培养的培养基可以是合成的或天然的,只要所述培养基包括碳源和氮源和矿物质,及如果需要,微生物生长所需的适量营养物。碳源包括多种碳水化合物如葡萄糖和蔗糖以及多种有机酸。根据所选微生物的同化方式,可以使用醇包括乙醇和甘油。多种铵盐如氨和硫酸铵,其它氮化合物如胺,天然氮源如蛋白胨,大豆水解产物和经消化的发酵微生物可以用作氮源。单磷酸钾(potassiummonophosphate)、硫酸镁、氯化钠、硫酸亚铁、硫酸锰、氯化钙等可以用作矿物质。如果需要,可以向培养基添加例如弥补营养缺陷型的其他营养物。
培养后,可以通过离心或膜过滤从液体培养基去除固体如细胞,然后可以通过常规方法如离子交换、亲和层析、浓缩、结晶和其它适合于所需的特定代谢物的方法收集并纯化目标代谢物。
实施例
以下将参考下述非限制性实施例更具体地解释本发明。
实施例1.RSF1010mob-质粒的构建
通过将包含自主调控元件PlacUV5-lacI的DNA片段“Red-驱动的整合”(Datsenko K.A.和Wanner B.L.,Proc.Natl.Acad.Sci.USA,2000,97:12:6640-45)到RSF1010质粒中以代替mob基因座,来进行RSF1010Mob-质粒的构建,所述自主调控元件PlacUV5-lacI由氯霉素抗性基因(cat基因)所标记。
首先,用引物P1(SEQ ID NO:29)和P2(SEQ ID NO:30)以及pMW-PlacUV5-lacI-118质粒(Skorokhodova,A.Y.等,Biotechnologiya(rus),No.5,(2004))作为模板,通过PCR扩增PlacUV5启动子控制下的具有lacI基因的结构部分的DNA片段。PlacUV5启动子的核苷酸序列公开于Genbank登录号Y00412(核苷酸7-100)。lacI的核苷酸序列公开于Genbank登录号NP_414879。另外,用于本发明的PlacUV5启动子的核苷酸序列描述于SEQ ID NO:24(核苷酸2824-2912)。PlacUV5启动子可以根据SEQ ID NO:24的核苷酸序列通过化学合成获得,或者通过由pET表达系统(Novagen)制备而获得。lacI的核苷酸序列也描述于SEQ ID NO:25。lacI可以根据SEQ ID NO:25或GenBank登录号NP_414879的核苷酸序列,使用大肠杆菌K-12(MG1655)的染色体DNA作为模板,通过PCR获得。
引物P1与位于质粒上XbaI限制性位点上游的pMW-PlacUV5-lacI-118质粒中的区域一致。引物P2包含引入到其5’端的BamHI限制性位点。用引物P3(SEQ ID NO:31)和P4(SEQ ID NO:32)通过PCR扩增来自质粒RSF1010的repB(SEQ ID:13)基因的片段。在质粒RSF1010上,repB基因的起始密码子和mobB基因的终止密码子重叠(图1)。repB基因的SD序列位于其起始密码子上游4个碱基对处。为了在不存在邻近mobB基因的情况下提供RepB蛋白质的翻译,通过将4个核苷酸添加到引物P3中来修饰repB基因的翻译起始区。另外,引物P3包含引入到其5’端的BamHI限制性位点,而引物P4包含引入到其5’端的KpnI限制性位点。通过琼脂糖凝胶电泳纯化所获得的两种PCR产物,用BamHI限制酶处理,连接,并用作模板使用引物P1和P4进行PCR。用XbaI和KpnI限制酶处理所得DNA片段,并克隆到事先用相同限制酶处理过的pBluescript II SK(+)载体(Stratagene)中。将所得质粒命名为pBluescript::lacIrepB。
然后,构建了包含氯霉素抗性基因(cat基因)和PlacUV5启动子的DNA片段。 利用引物P5(SEQ ID NO:33)和P6(SEQ ID NO:34)从质粒pACYC184(Takara Bio)扩增cat基因。引物P5包含引入到其5’端的BglII限制性位点,是选择mob-质粒后进一步切除cat基因所必需的。引物P6包含引入到其5’端的SacI限制性位点。用引物P7(SEQ ID NO:35)和P8(SEQ ID NO:36)从pMW-PlacUV5-lacI-118质粒扩增PlacUV5启动子。引物P7包含引入到其5’端的SacI限制性位点。引物P8包含引入到其5’端的XbaI限制性位点。通过琼脂糖凝胶电泳纯化所获得的片段,用SacI限制酶处理,连接,并用作模板使用引物P5和P8进行PCR。然后,用XbaI限制酶处理所得产物,并与事先用相同限制酶处理的pBluescript::lacIrepB质粒连接。将所获得的线性产物用作模板以引物P4(SEQ ID NO:32)和P9(SEQ ID NO:37)进行PCR。引物P9包含RSF1010区域的38个核苷酸,其位于oriV和mobC基因3’端之间,BglII限制性位点,和与cat基因的5’端互补的17个核苷酸。
利用“Red-驱动的整合”(Datsenko K.A.和Wanner B.L.,Proc.Natl.Acad.Sci.USA,2000,97:12:6640-45)将所获得的PCR产物用于整合到RSF1010质粒中,取代质粒的mob基因座,其中所述PCR产物包含repB基因的3’端、PlacUV5启动子控制下的lacI基因、cat基因和位于oriV和mobC基因3’端之间的RSF1010区域的38个核苷酸。根据所述方法,将质粒pKD46用作辅助质粒。包含重组质粒pKD46的大肠杆菌菌株BW25113可由大肠杆菌Genetic StockCenter,耶鲁大学,New Haven,USA获得,其登录号为CGSC7630。
通过电穿孔将RSF1010质粒与上述的DNA片段一起导入到菌株MG1655(pKD46)中。菌株MG1655(ATCC编号47076)可由美国典型培养物保藏中心(ATCC,地址:邮政信箱1549,Manassas,VA20108,美国)获得。
将100-200ng PCR扩增的DNA片段和100ng RSF1010质粒用于电穿孔。用电穿孔仪BioRad(编号165-2098,ver.2-89,USA)(脉冲时间为4-5msec,电场强度为12.5kV/cm)进行电穿孔。电穿孔后,立刻将1ml SOC培养基添加到细胞悬液。细胞于37℃培养2小时,涂布在含30μg/ml氯霉素的LB琼脂上,然后37℃生长过夜。
用BglII和XbaI限制酶处理作为同源重组的结果而获得的分离的RSFmob-cat质粒,以去除cat基因,然后与用相同限制酶处理的含PlacUV5启动子的PCR片段连接。用引物P1(SEQ ID NO:29)和P8(SEQ ID NO:36)获得含PlacUV5启动子的PCR片段。具有缺失的mob基因座的RSF1010衍生物的序列(RSF1010mob-,8338 bp)在序列表的SEQ ID NO:24和图3中显示。
至于所获得的质粒的稳定性,在非选择性条件下提供7次传代(passages)的质粒-携带体(plasmid-carrier)培养物,而且在100个独立的克隆当中,没有获得链霉素敏感性(Sms)克隆。因此,在无选择的7次传代后,所获得的质粒RSF1010mob-的稳定性不低于99%。
研究了RSF1010mob-质粒与亲本质粒RSF1010相比的转移效率。为此,以大肠杆菌菌株C600(r+m+)(Funakoshi)为基础构建了供体菌株,所述大肠杆菌菌株C600包含常驻质粒(resident plasmid)RP1-2(Tcr)。该质粒提供接合转移必需的tra-操纵子基因。通过转化,用Strr作为选择标记将质粒RSF1010和RSF1010mob-导入C600(RP1-2)菌株。通过用质粒pAYC32(Chistoserdov,A.Y.和Tsygankov,Y.D.,Plasmid,16,161-167(1986))、pBR322和pUC19转化,构建了三个对照供体菌株。将所有构建的供体菌株与受体菌株LE392met-RifR(Promega)用于接合试验中,以测定转移效率。这些试验的结果提供于表2。
表2
具有RP1-2的供体中的质粒 | 转移效率a |
RSF1010b | 2.5×10-5 |
RSF1010mob- | <10-8 |
PAYC32c | 3×10-4 |
PBR322c | 4×10-6 |
PUC19c | <10-8 |
aC600供体和LE392作为受体杂交过夜后每个供体的转化接合体(transconjugants)数。
b在LB-利福平-链霉素培养基上测定的RSF1010及其衍生物的转移。
c在LB-利福平-氨苄青霉素培养基上测定的质粒的转移。
表2的结果显示质粒RSF1010mob-完全丧失转移能力。在这个方面,它们与不含任何mob基因的pUC19对照质粒类似。
实施例2.拷贝数增加的RSF1010质粒的mob-衍生物的构建
依据我们的数据,在静止期,RSF1010mob-具有与RSF1010相同的拷贝数。在对数生长期,所获得的衍生物的拷贝数比RSF1010质粒的拷贝数低大约两倍。在对数生长期,向培养基添加IPTG导致RSF1010mob-质粒拷贝数增加,因为涉及RSF样质粒复制的repB基因处于PlacUV5-lacI自主调控元件的转录控制之下。因此,提出从RSF1010mob-质粒中消除lacI基因可以增加质粒的拷贝数。通过用XbaI和BamHI限制酶从RSF1010mob-质粒切除lacI基因来获得无lacI基因的RSF1010mob-的相应衍生物。然后,将所得DNA片段的粘端平端化,通过连接获得RSF1010mob-,lacI-质粒。RSF1010mob-,lacI-质粒的DNA序列示于SEQ ID NO:48。
为估计RSF1010质粒衍生物的拷贝数,将三个质粒分别导入到大肠杆菌菌株MG1655中。已用“GenElute Plasmid Miniprep Kit”(Sigma,USA)从等量的不含IPTG的LB培养基中生长过夜的细胞分离了质粒DNA,用EcoRV限制酶和RNAse A处理。将琼脂糖凝胶用溴化乙锭染色后,用“Sorbfil”程序通过扫描相应于每个质粒的大EcoRV片段的电泳条带(bends),来估计质粒的拷贝数。已将每种类型的三个独立的转化体用于该试验中。RSF1010衍生物的相对拷贝数显示于表3。RFS1010质粒的拷贝数视为1.0。
表3.RSF1010质粒的mob-衍生物的相对拷贝数。
质粒 | 相对拷贝数 |
RSF1010RSF1010mob-RSF1010mob-,lacI- | 1.0±0.30.9±0.12.6±0.3 |
实施例3.缺乏任何抗生素抗性基因并含有thyA基因作为选择标记的RSF1010Mob-质粒的构建。
首先,以野生型大肠杆菌菌株MG1655为基础构建两个菌株;一个菌株缺失了thyA基因,另一个缺失了tdk基因。通过将包含抗生素抗性标记的片段整合到上述每一个菌株中,将所谓的“Red-驱动的整合”方法(Datsenko K.A.和Wanner B.L.,Proc.Natl.Acad.Sci.USA,2000,97:12:6640-45)用于使目标基因失活。将来自质粒pACYC184的氯霉素抗性基因用于破坏thyA基因(Cmr),将来自质粒pACYC177的卡那霉素抗性基因用于破坏tdk基因(Kmr)。所获得的携带抗生素抗性标记的两个突变体均可用作供体以将ΔthyA和Δtdk缺失P1转导到另一种大肠杆菌菌株中。
将具有thyA缺失的菌株用于本发明中以筛选克隆在不同质粒上的thyA基因的功能活性拷贝。
第二阶段包括thyA基因功能活性拷贝的克隆。在大肠杆菌MG1655菌株的染色体中,thyA基因位于提出的操纵子结构-lgt-thyA的远端部分。这里可能有该操纵子的邻近启动子。虽然thyA基因在正靠近起始密码子处拥有两个注解的(annotated)启动子(PthyA1和PthyA2),但其序列不同于规范的启动子,因此其各自的潜在效率仍在讨论中。
根据大肠杆菌染色体的物理图谱,thyA结构基因由795bp组成,并且相应的蛋白质胸苷酸合酶包含264个氨基酸。thyA基因的核苷酸序列和由该基因编码的氨基酸序列分别如SEQ ID NOS:44和45所示。
用引物ThyA1(SEQ ID NO:38)和ThyA2(SEQ ID NO:39)以及来自大肠杆菌细胞TG1(Amersham Pharmacia Biotech)的染色体DNA通过PCR扩增同时包含两个天然启动子的thyA基因的结构部分。这些引物分别包含EcoRI和HindIII限制性位点,用于将thyA基因克隆到载体pUC18(Takara Bio)、pUC19(Takara Bio)和pET22(+)(Promega)中。PCR扩增后,分离了包含thyA基因的994 bp DNA片段,并将其克隆到质粒pUC18、pUC19和pET22(+)的EcoRI和HindIII位点。转化到大肠杆菌菌株TG1中后,在克隆的thyA基因存在下利用使用引物ThyA1和ThyA2的对照PCR分离并检测AmpR克隆。选择了包含期望的DNA片段的几个克隆用于分离重组质粒和测定克隆的thyA基因的功能活性。
为了测定克隆在质粒pUC18、pUC19和pET22(+)上的thyA基因的功能活性,通过转化将所有质粒都导入到菌株MG1655(ΔthyA::Cmr)的受体细胞中。选择了来自各个转化试验的50个独立的AmpR克隆,并检测弥补thyA突变的能力。用ThyA1和ThyA2引物通过对照PCR确认了包含质粒pUC18thyA、pUC19thyA和pET22(+)thyA的克隆中克隆的thyA基因的存在。另外,显示除了源于质粒pET22的那些之外,所检测的所有转化体都能够在无胸苷的基本培养基上生长。这些数据表明,克隆的thyA基因只有在其被克隆在多拷贝质粒pUC18和pUC19上的条件下才能够由其自身启动子表达。
应注意到,包含克隆在pUC18和pUC19质粒上的thyA基因的EcoRI-HindIII片段处于相反的方向。这样,thyA基因的pUC18质粒转录与质粒lacZ基因的转录一致,即thyA基因的转录可由lacZ启动子发生,而在pUC19质粒中,thyA基因的转录只可以从其自身启动子引导。因此,克隆在pUC18和pUC19质粒上的thyA表达的对比允许人们估计thyA启动子的功效。发现与包含pUC19thyA质粒的克隆相比,包含pUC18thyA质粒的MG1655(ΔthyA::Cmr)菌株的克隆在基本培养基上生长更好。这些数据使我们认识到,与包含lacZ启动子上游的构建相比,thyA从其自身启动子转录的水平更低。
另一方面,包含具有克隆thyA基因的pET22(+)载体的受体菌株MG1655(ΔthyA::Cmr)能够在不含胸苷的基本培养基上非常缓慢地生长。此数据表明,在质粒pET22(+)上,thyA基因由其自身启动子的表达不足以支持受体菌株MG1655(ΔthyA::Cmr)的生长。已知与pUC18(19)质粒相比,pET22质粒的拷贝数更低。因为我们的最终载体RSF1010也不是极高拷贝数质粒,所以决定改善克隆在pET22(+)载体上的thyA基因的表达。为此,利用定点PCR诱变将一些额外的突变引入到thyA基因启动子的-10区中。
设计了两个引物:ThyA4(SEQ ID NO:40)和ThyA5(SEQ ID NO:41)。两个引物在启动子的-10区域和位置-15和-14的TG基序中都包含取代,其必然改善由thyA启动子转录的效率。用引物对ThyA1-ThyA5和ThyA2-ThyA4,并以pET-22-thyA质粒作为模板实施了两个单独的PCR扩增,分离两个thyA基因片段。然后,PCR扩增的产物一起退火,并将所得混合物用作模板,用ThyA1和ThyA2引物进行PCR,以分离具有改善的-10区域的全长thyA基因。PCR扩增后,用EcoRI和HindIII限制酶消化该修饰的994bp片段,并克隆到事先用相同的限制酶处理过的载体pUC18、pUC19和pET22中。转化到大肠杆菌菌株TG1中后,在克隆的thyA基因存在下通过利用使用ThyA1和ThyA2引物的对照PCR分离并检测AmpR克隆。选择了包含期望的DNA片段的几个克隆以分离重组质粒,测序并测定克隆的thyA基因的功能活性。
首先,将修饰的thyA基因(以下称为thyA*基因)测序,并确认了启动子区中所引入的突变的存在。新启动子包含完美的Pribnow-盒:TATAAT,和分别在位置-15和-14的TG基序(图1SEQ ID:27中核苷酸87-95)。
为检测改善的thyA启动子提供thyA营养缺陷型生长足够的thyA*基因表达水平的能力,将包含处于修饰的启动子控制之下的thyA*基因的质粒pUC18、pUC19和pET22(+)转化到菌株MG1655(ΔthyA::Cmr)的受体细胞中。选择了来自各个转化试验的50个独立的AmpR克隆,并检测了其弥补thyA突变的能力。用ThyA1和ThyA2引物通过对照PCR,确认了包含质粒pUC18thyA*、pUC19thyA*和pET22thyA*的克隆中克隆的thyA*基因的存在。另外,显示包括那些含有pET22质粒的所检测的所有转化体都能够在不含胸苷的基本培养基上生长。这些数据表明,在改善的thyA*启动子控制之下的胸苷酸合酶的活性对于thyA营养缺陷型的生长来说是足够的。
需要另外一种thyA*基因的修饰以去除基因结构部分中的PstI限制性位点。设计该位点是为了从RSF1010mob-质粒切除SulR基因(SEQ ID:22)。使用如上所述用于位点特异性诱变的PCR技术进行了功能活性基因的结构修饰,以修饰启动子区域。用引物对ThyA1-ThyA16和ThyA17-ThyA2,并以pET-22-thyA*质粒作为模板进行两个独立的PCR扩增,分离两个thyA基因片段。引物ThyA16(SEQ ID NO:42)和ThyA17(SEQ ID NO:43)提供了同义密码子(synonymous codon)的引入,所述同义密码子从thyA基因结构部分消除了PstI限制性位点。然后,将PCR扩增产物一起退火,并将所得混合物作为模板利用ThyA1和ThyA2引物进行PCR,以分离其结构部分中不含PstI限制性位点的全长thyA*基因。PCR扩增后,用EcoRI和HindIII限制酶消化该修饰的994bp片段,并克隆到事先用相同的限制酶处理过的载体pUC18、pUC19和pET22中。转化到大肠杆菌菌株TG1中后,在克隆的thyA*基因存在下使用ThyA1和ThyA2引物通过对照PCR分离并检测AmpR克隆。选择了包含期望的DNA片段的几个克隆以分离重组质粒,测序并测定克隆的thyA*基因的功能活性。
实施例4.用thyA*基因取代RSF1010mob-的抗生素抗性标记(StrR和SulR)。
将分离的克隆中的一个用于分离包含修饰的thyA*基因的质粒pET22(+)。通过EcoRI和NotI限制酶消化质粒DNA,以亚克隆到RSF1010mob-质粒的相应位点中。将连接酶混合物转化到受体菌株MG1655(ΔthyA::Cmr)中,在不含胸苷的基本葡萄糖培养基上分离ThyA+转化体。用thyA*基因侧翼的引物ThyA1和ThyA2,通过PCR检测了ThyA+转化体的重组RSF1010质粒内thyA*基因的存在。显示所有检测的ThyA+转化体都显示出对链霉素的敏感性。这些数据表明,新分离的载体RSF1010mob-thyA*(不含PstI位点)包含由thyA*基因取代的strA和strB基因(SEQ ID NO:2和4)编码的StrR基因。为进行下一个步骤,通过PstI限制酶消化质粒RSF1010mob-thyA*并自连接(self-ligate),以缺失由sul基因(SEQ ID NO:22)编码的SulR标记。结果,分离了由thyA*选择性标记取代了两个(StrR和SulR)抗生素抗性标记的新的RSF1010mob-thyA*质粒。序列表中SEQ ID NO:27和图5显示了RSF1010的衍生物的序列,所述RSF1010的衍生物缺失了mob基因座和所有的抗生素抗性基因且包含胸苷酸合酶基因(thyA*基因)作为选择标记。该新质粒命名为RSF1010-MT。
实施例5.thyA-、tdk-受体中RSF1010-MT质粒的稳定性研究。
在不含胸苷的基本培养基上分离了用RSF1010-MT质粒转化的菌株MG1655(ΔthyA::Cmr)和ThyA+转化体。根据大肠杆菌染色体的物理图谱,tdk结构基因由618bp组成,并且相应的蛋白质胸苷酸合酶包含205个氨基酸(SEQ ID NOS:46和47)。
然后,我们用生长在染色体上包含tdk::KmR插入的MG1655菌株(MG1655(Δtdk::Kmr))和携带RSF1010-MT质粒的受体菌株MG1655(ΔthyA::Cmr)上的噬菌体P1贮存物(stock)进行了P1转导试验。获得了卡那霉素抗性菌落,并通过PCR检测了染色体上tdk::KmR插入的存在。
分离MG1655(ΔthyA::Cmr,Δtdk::Kmr)/RSF1010-MT菌株后,增殖期间在非选择性条件下进行RSF101-MT质粒的稳定性研究。为此,在37℃将MG1655受体细胞(ΔthyA::Cmr,Δtdk::Kmr)/RSF1010-MT培养在试管中的LB培养液(broth)中培养72小时。然后,将培养物样品涂布在LB平板上,将24h后出现的单菌落在不含胸苷的基本培养基上重复(每种培养物200个菌落)。结果表明,源于包含RSF1010-MT质粒的MG1655(ΔthyA::Cmr,Δtdk::Kmr)菌株的所有200个菌落都能够在不含胸苷的基本培养基上生长,即所有检测的重组体都显示载体的稳定保持。
这些数据表明,克隆在RSF1010mob-质粒上的thyA*基因作为代替抗生素抗性标记的选择性标记提供所述质粒的稳定的保持。
虽然已经参考其优选实施方案详细描述了本发明,然而对于本领域的技术人员显而易见的是,能够进行各种改变和使用等同物(equivalents),而不脱离本发明的范围。
工业实用性
本发明提供不含抗生素抗性基因的广宿主范围RSF1010衍生Mob-质粒。本发明的RSF1010衍生Mob-质粒可用于使用细菌生产有用代谢物。
序列表
<110>味之素株式会社(Ajinomoto Co.,Inc.)
<120>不包含抗生素抗性基因的RSF1010衍生MOB-缺陷型质粒、包含该质粒的细菌和生产有用代谢物的方法
<130>C261-C5063
<150>RU2004119027
<151>2004-06-24
<160>48
<170>PatentIn version 3.1
<210>1
<211>8684
<212>DNA
<213>大肠杆菌
<220>
<221>基因
<222>(63)..(866)
<223>strA
<220>
<221>基因
<222>(866)..(1702)
<223>strB
<220>
<221>基因
<222>(2767)..(3051)
<223>mobC
<220>
<221>基因
<222>(3250)..(5379)
<223>mobA
<220>
<221>基因
<222>(3998)..(4411)
<223>mobB
<220>
<221>基因
<222>(4408)..(5379)
<223>repB
<220>
<221>基因
<222>(5440)..(5652)
<223>orfE
<220>
<221>基因
<222>(5654)..(5860)
<223>orfF
<220>
<221>基因
<222>(5890)..(6729)
<223>repA
<220>
<221>基因
<222>(6716)..(7567)
<223>repC
<220>
<221>基因
<222>(7875)..(8663)
<223>sul
<400>1
aactgcacat tcgggatatt tctctatatt cgcgcttcat cagaaaactg aaggaacctc 60
cattgaatcg aactaatatt ttttttggtg aatcgcattc tgactggttg cctgtcagag 120
gcggagaatc tggtgatttt gtttttcgac gtggtgacgg gcatgccttc gcgaaaatcg 180
cacctgcttc ccgccgcggt gagctcgctg gagagcgtga ccgcctcatt tggctcaaag 240
gtcgaggtgt ggcttgcccc gaggtcatca actggcagga ggaacaggag ggtgcatgct 300
tggtgataac ggcaattccg ggagtaccgg cggctgatct gtctggagcg gatttgctca 360
aagcgtggcc gtcaatgggg cagcaacttg gcgctgttca cagcctatcg gttgatcaat 420
gtccgtttga gcgcaggctg tcgcgaatgt tcggacgcgc cgttgatgtg gtgtcccgca 480
atgccgtcaa tcccgacttc ttaccggacg aggacaagag tacgccgctg cacgatcttt 540
tggctcgtgt cgaacgagag ctaccggtgc ggctcgacca agagcgcacc gatatggttg 600
tttgccatgg tgatccctgc atgccgaact tcatggtgga ccctaaaact cttcaatgca 660
cgggtctgat cgaccttggg cggctcggaa cagcagatcg ctatgccgat ttggcactca 720
tgattgctaa cgccgaagag aactgggcag cgccagatga agcagagcgc gccttcgctg 780
tcctattcaa tgtattgggg atcgaagccc ccgaccgcga acgccttgcc ttctatctgc 840
gattggaccc tctgacttgg ggttgatgtt catgccgcct gtttttcctg ctcattggca 900
cgtttcgcaa cctgttctca ttgcggacac cttttccagc ctcgtttgga aagtttcatt 960
gccagacggg actcctgcaa tcgtcaaggg attgaaacct atagaagaca ttgctgatga 1020
actgcgcggg gccgactatc tggtatggcg caatgggagg ggagcagtcc ggttgctcgg 1080
tcgtgagaac aatctgatgt tgctcgaata tgccggggag cgaatgctct ctcacatcgt 1140
tgccgagcac ggcgactacc aggcgaccga aattgcagcg gaactaatgg cgaagctgta 1200
tgccgcatct gaggaacccc tgccttctgc ccttctcccg atccgggatc gctttgcagc 1260
tttgtttcag cgggcgcgcg atgatcaaaa cgcaggttgt caaactgact acgtccacgc 1320
ggcgattata gccgatcaaa tgatgagcaa tgcctcggaa ctgcgtgggc tacatggcga 1380
tctgcatcat gaaaacatca tgttctccag tcgcggctgg ctggtgatag atcccgtcgg 1440
tctggtcggt gaagtgggct ttggcgccgc caatatgttc tacgatccgg ctgacagaga 1500
cgacctttgt ctcgatccta gacgcattgc acagatggcg gacgcattct ctcgtgcgct 1560
ggacgtcgat ccgcgtcgcc tgctcgacca ggcgtacgct tatgggtgcc tttccgcagc 1620
ttggaacgcg gatggagaag aggagcaacg cgatctagct atcgcggccg cgatcaagca 1680
ggtgcgacag acgtcatact agatatcaag cgacttctcc tatcccctgg gaacacatca 1740
atctcaccgg agaatatcgc tggccaaagc cttagcgtag gattccgccc cttcccgcaa 1800
acgaccccaa acaggaaacg cagctgaaac gggaagctca acacccactg acgcatgggt 1860
tgttcaggca gtacttcatc aaccagcaag gcggcacttt cggccatccg ccgcgcccca 1920
cagctcgggc agaaaccgcg acgcttacag ctgaaagcga ccaggtgctc ggcgtggcaa 1980
gactcgcagc gaacccgtag aaagccatgc tccagccgcc cgcattggag aaattcttca 2040
aattcccgtt gcacatagcc cggcaattcc tttccctgct ctgccataag cgcagcgaat 2100
gccgggtaat actcgtcaac gatctgatag agaagggttt gctcgggtcg gtggctctgg 2160
taacgaccag tatcccgatc ccggctggcc gtcctggccg ccacatgagg catgttccgc 2220
gtccttgcaa tactgtgttt acatacagtc tatcgcttag cggaaagttc ttttaccctc 2280
agccgaaatg cctgccgttg ctagacattg ccagccagtg cccgtcactc ccgtactaac 2340
tgtcacgaac ccctgcaata actgtcacgc ccccctgcaa taactgtcac gaacccctgc 2400
aataactgtc acgcccccaa acctgcaaac ccagcagggg cgggggctgg cggggtgttg 2460
gaaaaatcca tccatgatta tctaagaata atccactagg cgcggttatc agcgcccttg 2520
tggggcgctg ctgcccttgc ccaatatgcc cggccagagg ccggatagct ggtctattcg 2580
ctgcgctagg ctacacaccg ccccaccgct gcgcggcagg gggaaaggcg ggcaaagccc 2640
gctaaacccc acaccaaacc ccgcagaaat acgctggagc gcttttagcc gctttagcgg 2700
cctttccccc tacccgaagg gtgggggcgc gtgtgcagcc ccgcagggcc tgtctcggtc 2760
gatcattcag cccggctcat ccttctggcg tggcggcaga ccgaacaagg cgcggtcgtg 2820
gtcgcgttca aggtacgcat ccattgccgc catgagccga tcctccggcc actcgctgct 2880
gttcaccttg gccaaaatca tggcccccac cagcaccttg cgccttgttt cgttcttgcg 2940
ctcttgctgc tgttcccttg cccgcacccg ctgaatttcg gcattgattc gcgctcgttg 3000
ttcttcgagc ttggccagcc gatccgccgc cttgttgctc cccttaacca tcttgacacc 3060
ccattgttaa tgtgctgtct cgtaggctat catggaggca cagcggcggc aatcccgacc 3120
ctactttgta ggggagggcg cacttaccgg tttctcttcg agaaactggc ctaacggcca 3180
cccttcgggc ggtgcgctct ccgagggcca ttgcatggag ccgaaaagca aaagcaacag 3240
cgaggcagca tggcgattta tcaccttacg gcgaaaaccg gcagcaggtc gggcggccaa 3300
tcggccaggg ccaaggccga ctacatccag cgcgaaggca agtatgcccg cgacatggat 3360
gaagtcttgc acgccgaatc cgggcacatg ccggagttcg tcgagcggcc cgccgactac 3420
tgggatgctg ccgacctgta tgaacgcgcc aatgggcggc tgttcaagga ggtcgaattt 3480
gccctgccgg tcgagctgac cctcgaccag cagaaggcgc tggcgtccga gttcgcccag 3540
cacctgaccg gtgccgagcg cctgccgtat acgctggcca tccatgccgg tggcggcgag 3600
aacccgcact gccacctgat gatctccgag cggatcaatg acggcatcga gcggcccgcc 3660
gctcagtggt tcaagcggta caacggcaag accccggaga agggcggggc acagaagacc 3720
gaagcgctca agcccaaggc atggcttgag cagacccgcg aggcatgggc cgaccatgcc 3780
aaccgggcat tagagcgggc tggccacgac gcccgcattg accacagaac acttgaggcg 3840
cagggcatcg agcgcctgcc cggtgttcac ctggggccga acgtggtgga gatggaaggc 3900
cggggcatcc gcaccgaccg ggcagacgtg gccctgaaca tcgacaccgc caacgcccag 3960
atcatcgact tacaggaata ccgggaggca atagaccatg aacgcaatcg acagagtgaa 4020
gaaatccaga ggcatcaacg agttagcgga gcagatcgaa ccgctggccc agagcatggc 4080
gacactggcc gacgaagccc ggcaggtcat gagccagacc cagcaggcca gcgaggcgca 4140
ggcggcggag tggctgaaag cccagcgcca gacaggggcg gcatgggtgg agctggccaa 4200
agagttgcgg gaggtagccg ccgaggtgag cagcgccgcg cagagcgccc ggagcgcgtc 4260
gcgggggtgg cactggaagc tatggctaac cgtgatgctg gcttccatga tgcctacggt 4320
ggtgctgctg atcgcatcgt tgctcttgct cgacctgacg ccactgacaa ccgaggacgg 4380
ctcgatctgg ctgcgcttgg tggcccgatg aagaacgaca ggactttgca ggccataggc 4440
cgacagctca aggccatggg ctgtgagcgc ttcgatatcg gcgtcaggga cgccaccacc 4500
ggccagatga tgaaccggga atggtcagcc gccgaagtgc tccagaacac gccatggctc 4560
aagcggatga atgcccaggg caatgacgtg tatatcaggc ccgccgagca ggagcggcat 4620
ggtctggtgc tggtggacga cctcagcgag tttgacctgg atgacatgaa agccgagggc 4680
cgggagcctg ccctggtagt ggaaaccagc ccgaagaact atcaggcatg ggtcaaggtg 4740
gccgacgccg caggcggtga acttcggggg cagattgccc ggacgctggc cagcgagtac 4800
gacgccgacc cggccagcgc cgacagccgc cactatggcc gcttggcggg cttcaccaac 4860
cgcaaggaca agcacaccac ccgcgccggt tatcagccgt gggtgctgct gcgtgaatcc 4920
aagggcaaga ccgccaccgc tggcccggcg ctggtgcagc aggctggcca gcagatcgag 4980
caggcccagc ggcagcagga gaaggcccgc aggctggcca gcctcgaact gcccgagcgg 5040
cagcttagcc gccaccggcg cacggcgctg gacgagtacc gcagcgagat ggccgggctg 5100
gtcaagcgct tcggtgatga cctcagcaag tgcgacttta tcgccgcgca gaagctggcc 5160
agccggggcc gcagtgccga ggaaatcggc aaggccatgg ccgaggccag cccagcgctg 5220
gcagagcgca agcccggcca cgaagcggat tacatcgagc gcaccgtcag caaggtcatg 5280
ggtctgccca gcgtccagct tgcgcgggcc gagctggcac gggcaccggc accccgccag 5340
cgaggcatgg acaggggcgg gccagatttc agcatgtagt gcttgcgttg gtactcacgc 5400
ctgttatact atgagtactc acgcacagaa gggggtttta tggaatacga aaaaagcgct 5460
tcagggtcgg tctacctgat caaaagtgac aagggctatt ggttgcccgg tggctttggt 5520
tatacgtcaa acaaggccga ggctggccgc ttttcagtcg ctgatatggc cagccttaac 5580
cttgacggct gcaccttgtc cttgttccgc gaagacaagc ctttcggccc cggcaagttt 5640
ctcggtgact gatatgaaag accaaaagga caagcagacc ggcgacctgc tggccagccc 5700
tgacgctgta cgccaagcgc gatatgccga gcgcatgaag gccaaaggga tgcgtcagcg 5760
caagttctgg ctgaccgacg acgaatacga ggcgctgcgc gagtgcctgg aagaactcag 5820
agcggcgcag ggcgggggta gtgaccccgc cagcgcctaa ccaccaactg cctgcaaagg 5880
aggcaatcaa tggctaccca taagcctatc aatattctgg aggcgttcgc agcagcgccg 5940
ccaccgctgg actacgtttt gcccaacatg gtggccggta cggtcggggc gctggtgtcg 6000
cccggtggtg ccggtaaatc catgctggcc ctgcaactgg ccgcacagat tgcaggcggg 6060
ccggatctgc tggaggtggg cgaactgccc accggcccgg tgatctacct gcccgccgaa 6120
gacccgccca ccgccattca tcaccgcctg cacgcccttg gggcgcacct cagcgccgag 6180
gaacggcaag ccgtggctga cggcctgctg atccagccgc tgatcggcag cctgcccaac 6240
atcatggccc cggagtggtt cgacggcctc aagcgcgccg ccgagggccg ccgcctgatg 6300
gtgctggaca cgctgcgccg gttccacatc gaggaagaaa acgccagcgg ccccatggcc 6360
caggtcatcg gtcgcatgga ggccatcgcc gccgataccg ggtgctctat cgtgttcctg 6420
caccatgcca gcaagggcgc ggccatgatg ggcgcaggcg accagcagca ggccagccgg 6480
ggcagctcgg tactggtcga taacatccgc tggcagtcct acctgtcgag catgaccagc 6540
gccgaggccg aggaatgggg tgtggacgac gaccagcgcc ggttcttcgt ccgcttcggt 6600
gtgagcaagg ccaactatgg cgcaccgttc gctgatcggt ggttcaggcg gcatgacggc 6660
ggggtgctca agcccgccgt gctggagagg cagcgcaaga gcaagggggt gccccgtggt 6720
gaagcctaag aacaagcaca gcctcagcca cgtccggcac gacccggcgc actgtctggc 6780
ccccggcctg ttccgtgccc tcaagcgggg cgagcgcaag cgcagcaagc tggacgtgac 6840
gtatgactac ggcgacggca agcggatcga gttcagcggc ccggagccgc tgggcgctga 6900
tgatctgcgc atcctgcaag ggctggtggc catggctggg cctaatggcc tagtgcttgg 6960
cccggaaccc aagaccgaag gcggacggca gctccggctg ttcctggaac ccaagtggga 7020
ggccgtcacc gctgaatgcc atgtggtcaa aggtagctat cgggcgctgg caaaggaaat 7080
cggggcagag gtcgatagtg gtggggcgct caagcacata caggactgca tcgagcgcct 7140
ttggaaggta tccatcatcg cccagaatgg ccgcaagcgg caggggtttc ggctgctgtc 7200
ggagtacgcc agcgacgagg cggacgggcg cctgtacgtg gccctgaacc ccttgatcgc 7260
gcaggccgtc atgggtggcg gccagcatgt gcgcatcagc atggacgagg tgcgggcgct 7320
ggacagcgaa accgcccgcc tgctgcacca gcggctgtgt ggctggatcg accccggcaa 7380
aaccggcaag gcttccatag ataccttgtg cggctatgtc tggccgtcag aggccagtgg 7440
ttcgaccatg cgcaagcgcc gccagcgggt gcgcgaggcg ttgccggagc tggtcgcgct 7500
gggctggacg gtaaccgagt tcgcggcggg caagtacgac atcacccggc ccaaggcggc 7560
aggctgaccc cccccactct attgtaaaca agacattttt atcttttata ttcaatggct 7620
tattttcctg ctaattggta ataccatgaa aaataccatg ctcagaaaag gcttaacaat 7680
attttgaaaa attgcctact gagcgctgcc gcacagctcc ataggccgct ttcctggctt 7740
tgcttccaga tgtatgctct tctgctcctg cagctaatgg atcaccgcaa acaggttact 7800
cgcctgggga ttccctttcg acccgagcat ccgtatgata ctcatgctcg attattatta 7860
ttatagaagc ccccatgaat aaatcgctca tcattttcgg catcgtcaac ataacctcgg 7920
acagtttctc cgatggaggc cggtatctgg cgccagacgc agccattgcg caggcgcgta 7980
agctgatggc cgagggggca gatgtgatcg acctggtccg gcatccagca atcccgacgc 8040
cgcgcctgtt tcgtccgaca cagaaatcgc gcgtatgcgc cggtgctgga cgcgctcagg 8100
cagatggcat tcccgtctcg ctcgacagtt atcaacccgc gacgcaagcc tatgccttgt 8160
cgcgtggtgt ggcctatctc aatgatattc gcggttttcc agacgctgcg ttctatccgc 8220
aattggcgaa atcatctgcc aaactcgtcg ttatgcattc ggtgcaagac gggcaggcag 8280
atcggcgcga ggcacccgct ggcgacatca tggatcacat tgcggcgttc tttgacgcgc 8340
gcatcgcggc gctgacgggt gccggtatca aacgcaaccg ccttgtcctt gatcccggca 8400
tggggttttt tctgggggct gctcccgaaa cctcgctctc ggtgctggcg cggttcgatg 8460
aattgcggct gcgcttcgat ttgccggtgc ttctgtctgt ttcgcgcaaa tcctttctgc 8520
gcgcgctcac aggccgtggt ccgggggtgt cggggccgcg acactcgctg cagagcttgc 8580
cgccgccgca ggtggagctg acttcatccg cacacacgag ccgcgcccct tgcgcgacgg 8640
gctggcggta ttggcggcgc tgaaagaaac cgcaagaatt cgtt 8684
<210>2
<211>804
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(804)
<223>strA
<400>2
ttg aat cga act aat att ttt ttt ggt gaa tcg cat tct gac tgg ttg 48
Leu Asn Arg Thr Asn Ile Phe Phe Gly Glu Ser His Ser Asp Trp Leu
1 5 10 15
cct gtc aga ggc gga gaa tct ggt gat ttt gtt ttt cga cgt ggt gac 96
Pro Val Arg Gly Gly Glu Ser Gly Asp Phe Val Phe Arg Arg Gly Asp
20 25 30
ggg cat gcc ttc gcg aaa atc gca cct gct tcc cgc cgc ggt gag ctc 144
Gly His Ala Phe Ala Lys Ile Ala Pro Ala Ser Arg Arg Gly Glu Leu
35 40 45
gct gga gag cgt gac cgc ctc att tgg ctc aaa ggt cga ggt gtg gct 192
Ala Gly Glu Arg Asp Arg Leu Ile Trp Leu Lys Gly Arg Gly Val Ala
50 55 60
tgc ccc gag gtc atc aac tgg cag gag gaa cag gag ggt gca tgc ttg 240
Cys Pro Glu Val Ile Asn Trp Gln Glu Glu Gln Glu Gly Ala Cys Leu
65 70 75 80
gtg ata acg gca att ccg gga gta ccg gcg gct gat ctg tct gga gcg 288
Val Ile Thr Ala Ile Pro Gly Val Pro Ala Ala Asp Leu Ser Gly Ala
85 90 95
gat ttg ctc aaa gcg tgg ccg tca atg ggg cag caa ctt ggc gct gtt 336
Asp Leu Leu Lys Ala Trp Pro Ser Met Gly Gln Gln Leu Gly Ala Val
100 105 110
cac agc cta tcg gtt gat caa tgt ccg ttt gag cgc agg ctg tcg cga 384
His Ser Leu Ser Val Asp Gln Cys Pro Phe Glu Arg Arg Leu Ser Arg
115 120 125
atg ttc gga cgc gcc gtt gat gtg gtg tcc cgc aat gcc gtc aat ccc 432
Met Phe Gly Arg Ala Val Asp Val Val Ser Arg Asn Ala Val Asn Pro
130 135 140
gac ttc tta ccg gac gag gac aag agt acg ccg ctg cac gat ctt ttg 480
Asp Phe Leu Pro Asp Glu Asp Lys Ser Thr Pro Leu His Asp Leu Leu
145 150 155 160
gct cgt gtc gaa cga gag cta ccg gtg cgg ctc gac caa gag cgc acc 528
Ala Arg Val Glu Arg Glu Leu Pro Val Arg Leu Asp Gln Glu Arg Thr
165 170 175
gat atg gtt gtt tgc cat ggt gat ccc tgc atg ccg aac ttc atg gtg 576
Asp Met Val Val Cys His Gly Asp Pro Cys Met Pro Asn Phe Met Val
180 185 190
gac cct aaa act ctt caa tgc acg ggt ctg atc gac ctt ggg cgg ctc 624
Asp Pro Lys Thr Leu Gln Cys Thr Gly Leu Ile Asp Leu Gly Arg Leu
195 200 205
gga aca gca gat cgc tat gcc gat ttg gca ctc atg att gct aac gcc 672
Gly Thr Ala Asp Arg Tyr Ala Asp Leu Ala Leu Met Ile Ala Asn Ala
210 215 220
gaa gag aac tgg gca gcg cca gat gaa gca gag cgc gcc ttc gct gtc 720
Glu Glu Asn Trp Ala Ala Pro Asp Glu Ala Glu Arg Ala Phe Ala Val
225 230 235 240
cta ttc aat gta ttg ggg atc gaa gcc ccc gac cgc gaa cgc ctt gcc 768
Leu Phe Asn Val Leu Gly Ile Glu Ala Pro Asp Arg Glu Arg Leu Ala
245 250 255
ttc tat ctg cga ttg gac cct ctg act tgg ggt tga 804
Phe Tyr Leu Arg Leu Asp Pro Leu Thr Trp Gly
260 265
<210>3
<211>267
<212>PRT
<213>大肠杆菌
<400>3
Leu Asn Arg Thr Asn Ile Phe Phe Gly Glu Ser His Ser Asp Trp Leu
1 5 10 15
Pro Val Arg Gly Gly Glu Ser Gly Asp Phe Val Phe Arg Arg Gly Asp
20 25 30
Gly His Ala Phe Ala Lys Ile Ala Pro Ala Ser Arg Arg Gly Glu Leu
35 40 45
Ala Gly Glu Arg Asp Arg Leu Ile Trp Leu Lys Gly Arg Gly Val Ala
50 55 60
Cys Pro Glu Val Ile Asn Trp Gln Glu Glu Gln Glu Gly Ala Cys Leu
65 70 75 80
Val Ile Thr Ala Ile Pro Gly Val Pro Ala Ala Asp Leu Ser Gly Ala
85 90 95
Asp Leu Leu Lys Ala Trp Pro Ser Met Gly Gln Gln Leu Gly Ala Val
100 105 110
His Ser Leu Ser Val Asp Gln Cys Pro Phe Glu Arg Arg Leu Ser Arg
115 120 125
Met Phe Gly Arg Ala Val Asp Val Val Ser Arg Asn Ala Val Asn Pro
130 135 140
Asp Phe Leu Pro Asp Glu Asp Lys Ser Thr Pro Leu His Asp Leu Leu
145 150 155 160
Ala Arg Val Glu Arg Glu Leu Pro Val Arg Leu Asp Gln Glu Arg Thr
165 170 175
Asp Met Val Val Cys His Gly Asp Pro Cys Met Pro Asn Phe Met Val
180 185 190
Asp Pro Lys Thr Leu Gln Cys Thr Gly Leu Ile Asp Leu Gly Arg Leu
195 200 205
Gly Tnr Ala Asp Arg Tyr Ala Asp Leu Ala Leu Met Ile Ala Asn Ala
210 215 220
Glu Glu Asn Trp Ala Ala Pro Asp Glu Ala Glu Arg Ala Phe Ala Val
225 230 235 240
Leu Phe Asn Val Leu Gly Ile Glu Ala Pro Asp Arg Glu Arg Leu Ala
245 250 255
Phe Tyr Leu Arg Leu Asp Pro Leu Thr Trp Gly
260 265
<210>4
<211>837
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(837)
<223>strB
<400>4
atg ttc atg ccg cct gtt ttt cct gct cat tgg cac gtt tcg caa cct 48
Met Phe Met Pro Pro Val Phe Pro Ala His Trp His Val Ser Gln Pro
1 5 10 15
gtt ctc att gcg gac acc ttt tcc agc ctc gtt tgg aaa gtt tca ttg 96
Val Leu Ile Ala Asp Thr Phe Ser Ser Leu Val Trp Lys Val Ser Leu
20 25 30
cca gac ggg act cct gca atc gtc aag gga ttg aaa cct ata gaa gac 144
Pro Asp Gly Thr Pro Ala Ile Val Lys Gly Leu Lys Pro Ile Glu Asp
35 40 45
att gct gat gaa ctg cgc ggg gcc gac tat ctg gta tgg cgc aat ggg 192
Ile Ala Asp Glu Leu Arg Gly Ala Asp Tyr Leu Val Trp Arg Asn Gly
50 55 60
agg gga gca gtc cgg ttg ctc ggt cgt gag aac aat ctg atg ttg ctc 240
Arg Gly Ala Val Arg Leu Leu Gly Arg Glu Asn Asn Leu Met Leu Leu
65 70 75 80
gaa tat gcc ggg gag cga atg ctc tct cac atc gtt gcc gag cac ggc 288
Glu Tyr Ala Gly Glu Arg Met Leu Ser His Ile Val Ala Glu His Gly
85 90 95
gac tac cag gcg acc gaa att gca gcg gaa cta atg gcg aag ctg tat 336
Asp Tyr Gln Ala Thr Glu Ile Ala Ala Glu Leu Met Ala Lys Leu Tyr
100 105 110
gcc gca tct gag gaa ccc ctg cct tct gcc ctt ctc ccg atc cgg gat 384
Ala Ala Ser Glu Glu Pro Leu Pro Ser Ala Leu Leu Pro Ile Arg Asp
115 120 125
cgc ttt gca gct ttg ttt cag cgg gcg cgc gat gat caa aac gca ggt 432
Arg Phe Ala Ala Leu Phe Gln Arg Ala Arg Asp Asp Gln Asn Ala Gly
130 135 140
tgt caa act gac tac gtc cac gcg gcg att ata gcc gat caa atg atg 480
Cys Gln Thr Asp Tyr Val His Ala Ala Ile Ile Ala Asp Gln Met Met
145 150 155 160
agc aat gcc tcg gaa ctg cgt ggg cta cat ggc gat ctg cat cat gaa 528
Ser Asn Ala Set Glu Leu Arg Gly Leu His Gly Asp Leu His His Glu
165 170 175
aac atc atg ttc tcc agt cgc ggc tgg ctg gtg ata gat ccc gtc ggt 576
Asn Ile Met Phe Ser Ser Arg Gly Trp Leu Val Ile Asp Pro Val Gly
180 185 190
ctg gtc ggt gaa gtg ggc ttt ggc gcc gcc aat atg ttc tac gat ccg 624
Leu Val Gly Glu Val Gly Phe Gly Ala Ala Asn Met Phe Tyr Asp Pro
195 200 205
gct gac aga gac gac ctt tgt ctc gat cct aga cgc att gca cag atg 672
Ala Asp Arg Asp Asp Leu Cys Leu Asp Pro Arg Arg Ile Ala Gln Met
210 215 220
gcg gac gca ttc tct cgt gcg ctg gac gtc gat ccg cgt cgc ctg ctc 720
Ala Asp Ala Phe Ser Arg Ala Leu Asp Val Asp Pro Arg Arg Leu Leu
225 230 235 240
gac cag gcg tac gct tat ggg tgc ctt tcc gca gct tgg aac gcg gat 768
Asp Gln Ala Tyr Ala Tyr Gly Cys Leu Ser Ala Ala Trp Asn Ala Asp
245 250 255
gga gaa gag gag caa cgc gat cta gct atc gcg gcc gcg atc aag cag 816
Gly Glu Glu Glu Gln Arg Asp Leu Ala Ile Ala Ala Ala Ile Lys Gln
260 265 270
gtg cga cag acg tca tac tag 837
Val Arg Gln Thr Ser Tyr
275
<210>5
<211>278
<212>PRT
<213>大肠杆菌
<400>5
Met Phe Met Pro Pro Val Phe Pro Ala His Trp His Val Ser Gln Pro
1 5 10 15
Val Leu Ile Ala Asp Thr Phe Ser Ser Leu Val Trp Lys Val Ser Leu
20 25 30
Pro Asp Gly Thr Pro Ala Ile Val Lys Gly Leu Lys Pro Ile Glu Asp
35 40 45
Ile Ala Asp Glu Leu Arg Gly Ala Asp Tyr Leu Val Trp Arg Asn Gly
50 55 60
Arg Gly Ala Val Arg Leu Leu Gly Arg Glu Asn Asn Leu Met Leu Leu
65 70 75 80
Glu Tyr Ala Gly Glu Arg Met Leu Ser His Ile Val Ala Glu His Gly
85 90 95
Asp Tyr Gln Ala Thr Glu Ile Ala Ala Glu Leu Met Ala Lys Leu Tyr
100 105 110
Ala Ala Ser Glu Glu Pro Leu Pro Ser Ala Leu Leu Pro Ile Arg Asp
115 120 125
Arg Phe Ala Ala Leu Phe Gln Arg Ala Arg Asp Asp Gln Asn Ala Gly
130 135 140
Cys Gln Thr Asp Tyr Val His Ala Ala Ile Ile Ala Asp Gln Met Met
145 150 155 160
Ser Asn Ala Ser Glu Leu Arg Gly Leu His Gly Asp Leu His His Glu
165 170 175
Asn Ile Met Phe Ser Ser Arg Gly Trp Leu Val Ile Asp Pro Val Gly
180 185 190
Leu Val Gly Glu Val Gly Phe Gly Ala Ala Asn Met Phe Tyr Asp Pro
195 200 205
Ala Asp Arg Asp Asp Leu Cys Leu Asp Pro Arg Arg Ile Ala Gln Met
210 215 220
Ala Asp Ala Phe Ser Arg Ala Leu Asp Val Asp Pro Arg Arg Leu Leu
225 230 235 240
Asp Gln Ala Tyr Ala Tyr Gly Cys Leu Ser Ala Ala Trp Asn Ala Asp
245 250 255
Gly Glu Glu Glu Gln Arg Asp Leu Ala Ile Ala Ala Ala Ile Lys Gln
260 265 270
Val Arg Gln Thr Ser Tyr
275
<210>6
<211>285
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(285)
<223>mobC
<400>6
atg gtt aag ggg agc aac aag gcg gcg gat egg ctg gcc aag ctc gaa 48
Met Val Lys Gly Ser Asn Lys Ala Ala Asp Arg Leu Ala Lys Leu Glu
1 5 10 15
gaa caa cga gcg cga atc aat gcc gaa att cag cgg gtg cgg gca agg 96
Glu Gln Arg Ala Arg Ile Asn Ala Glu Ile Gln Arg Val Arg Ala Arg
20 25 30
gaa cag cag caa gag cgc aag aac gaa aca agg cgc aag gtg ctg gtg 144
Glu Gln Gln Gln Glu Arg Lys Asn Glu Thr Arg Arg Lys Val Leu Val
35 40 45
ggg gcc atg att ttg gcc aag gtg aac agc agc gag tgg ccg gag gat 192
Gly Ala Met Ile Leu Ala Lys Val Asn Ser Ser Glu Trp Pro Glu Asp
50 55 60
cgg ctc atg gcg gca atg gat gcg tac ctt gaa cgc gac cac gac cgc 240
Arg Leu Met Ala Ala Met Asp Ala Tyr Leu Glu Arg Asp His Asp Arg
65 70 75 80
gcc ttg ttc ggt ctg ccg cca cgc cag aag gat gag ccg ggc tga 285
Ala Leu Phe Gly Leu Pro Pro Arg Gln Lys Asp Glu Pro Gly
85 90
<210>7
<211>94
<212>PRT
<213>大肠杆菌
<400>7
Met Val Lys Gly Ser Asn Lys Ala Ala Asp Arg Leu Ala Lys Leu Glu
1 5 10 15
Glu Gln Arg Ala Arg Ile Asn Ala Glu Ile Gln Arg Val Arg Ala Arg
20 25 30
Glu Gln Gln Gln Glu Arg Lys Asn Glu Thr Arg Arg Lys Val Leu Val
35 40 45
Gly Ala Met Ile Leu Ala Lys Val Asn Ser Ser Glu Trp Pro Glu Asp
50 55 60
Arg Leu Met Ala Ala Met Asp Ala Tyr Leu Glu Arg Asp His Asp Arg
65 70 75 80
Ala Leu Phe Gly Leu Pro Pro Arg Gln Lys Asp Glu Pro Gly
85 90
<210>8
<211>2130
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(2130)
<223>mobA
<400>8
atg gcg att tat cac ctt acg gcg aaa acc ggc agc agg tcg ggc ggc 48
Met Ala Ile Tyr His Leu Thr Ala Lys Thr Gly Ser Arg Ser Gly Gly
1 5 10 15
caa tcg gcc agg gcc aag gcc gac tac atc cag cgc gaa ggc aag tat 96
Gln Ser Ala Arg Ala Lys Ala Asp Tyr Ile Gln Arg Glu Gly Lys Tyr
20 25 30
gcc cgc gac atg gat gaa gtc ttg cac gcc gaa tcc ggg cac atg ccg 144
Ala Arg Asp Met Asp Glu Val Leu His Ala Glu Ser Gly His Met Pro
35 40 45
gag ttc gtc gag cgg ccc gcc gac tac tgg gat gct gcc gac ctg tat 192
Glu Phe Val Glu Arg Pro Ala Asp Tyr Trp Asp Ala Ala Asp Leu Tyr
50 55 60
gaa cgc gcc aat ggg cgg ctg ttc aag gag gtc gaa ttt gcc ctg ccg 240
Glu Arg Ala Asn Gly Arg Leu Phe Lys Glu Val Glu Phe Ala Leu Pro
65 70 75 80
gtc gag ctg acc ctc gac cag cag aag gcg ctg gcg tcc gag ttc gcc 288
Val Glu Leu Thr Leu Asp Gln Gln Lys Ala Leu Ala Set Glu Phe Ala
85 90 95
cag cac ctg acc ggt gcc gag cgc ctg ccg tat acg ctg gcc atc cat 336
Gln His Leu Thr Gly Ala Glu Arg Leu Pro Tyr Thr Leu Ala Ile His
100 105 110
gcc ggt ggc ggc gag aac ccg cac tgc cac ctg atg atc tce gag cgg 384
Ala Gly Gly Gly Glu Asn Pro His Cys His Leu Met Ile Ser Glu Arg
115 120 125
atc aat gac ggc atc gag cgg ccc gcc gct cag tgg ttc aag cgg tac 432
Ile Asn Asp Gly Ile Glu Arg Pro Ala Ala Gln Trp Phe Lys Arg Tyr
130 135 140
aac ggc aag acc ccg gag aag ggc ggg gca cag aag acc gaa gcg ctc 480
Asn Gly Lys Thr Pro Glu Lys Gly Gly Ala Gln Lys Thr Glu Ala Leu
145 150 155 160
aag ccc aag gca tgg ctt gag cag acc cgc gag gca tgg gcc gac cat 528
Lys Pro Lys Ala Trp Leu Glu Gln Thr Arg Glu Ala Trp Ala Asp His
165 170 175
gcc aac cgg gca tta gag cgg gct ggc cac gac gcc cgc att gac cac 576
Ala Asn Arg Ala Leu Glu Arg Ala Gly His Asp Ala Arg Ile Asp His
180 185 190
aga aca ctt gag gcg cag ggc atc gag cgc ctg ccc ggt gtt cac ctg 624
Arg Thr Leu Glu Ala Gln Gly Ile Glu Arg Leu Pro Gly Val His Leu
195 200 205
ggg ccg aac gtg gtg gag atg gaa ggc cgg ggc atc cgc acc gac cgg 672
Gly Pro Asn Val Val Glu Met Glu Gly Arg Gly Ile Arg Thr Asp Arg
210 215 220
gca gac gtg gcc ctg aac atc gac acc gcc aac gcc cag atc atc gac 720
Ala Asp ValAla Leu Asn Ile Asp Thr Ala Asn Ala Gln Ile Ile Asp
225 230 235 240
tta cag gaa tac cgg gag gca ata gac cat gaa cgc aat cga cag agt 768
Leu Gln Glu Tyr Arg Glu Ala Ile Asp His Glu Arg Asn Arg Gln Ser
245 250 255
gaa gaa atc cag agg cat caa cga gtt agc gga gca gat cga acc gct 816
Glu Glu Ile Gln Arg His Gln Arg Val Ser Gly Ala Asp Arg Thr Ala
260 265 270
ggc cca gag cat ggc gac act ggc cga cga agc ccg gca ggt cat gag 864
Gly Pro Glu His Gly Asp Thr Gly Arg Arg Ser Pro Ala Gly His Glu
275 280 285
cca gac cca gca ggc cag cga ggc gca ggc ggc gga gtg gct gaa agc 912
Pro Asp Pro Ala Gly Gln Arg Gly Ala Gly Gly Gly Val Ala Glu Ser
290 295 300
cca gcg cca gac agg ggc ggc atg ggt gga gct ggc caa aga gtt gcg 960
Pro Ala Pro Asp Arg Gly Gly Met Gly Gly Ala Gly Gln Arg Val Ala
305 310 315 320
gga ggt agc cgc cga ggt gag cag cgc cgc gca gag cgc ccg gag cgc 1008
Gly Gly Ser Arg Arg Gly Glu Gln Arg Arg Ala Glu Arg Pro Glu Arg
325 330 335
gtc gcg ggg gtg gca ctg gaa gct atg gct aac cgt gat gct ggc ttc 1056
Val Ala Gly Val Ala Leu Glu Ala Met Ala Asn Arg Asp Ala Gly Phe
340 345 350
cat gat gcc tac ggt ggt gct gct gat cgc ate gtt gct ctt gct cga 1104
His Asp Ala Tyr Gly Gly Ala Ala Asp Arg Ile Val Ala Leu Ala Arg
355 360 365
cct gac gcc act gac aac cga gga cgg ctc gat ctg gct gcg ctt ggt 1152
Pro Asp Ala Thr Asp Asn Arg Gly Arg Leu Asp Leu Ala Ala Leu Gly
370 375 380
ggc ccg atg aag aac gac agg act ttg cag gcc ata ggc cga cag ctc 1200
Gly Pro Met Lys Asn Asp Arg Thr Leu Gln Ala Ile Gly Arg Gln Leu
385 390 395 400
aag gcc atg ggc tgt gag cgc ttc gat atc ggc gtc agg gac gcc acc 1248
Lys Ala Met Gly Cys Glu Arg Phe Asp Ile Gly Val Arg Asp Ala Thr
405 410 415
Vac ggc cag atg atg aac cgg gaa tgg tca gcc gcc gaa gtg ctc cag 1296
Thr Gly Gln Met Met Asn Arg Glu Trp Ser Ala Ala Glu Val Leu Gln
420 425 430
aac acg cca tgg ctc aag cgg atg aat gcc cag ggc aat gac gtg tat 1344
Asn Thr Pro Trp Leu Lys Arg Met Asn Ala Gln Gly Asn Asp Val Tyr
435 440 445
atc agg ccc gcc gag cag gag cgg cat ggt ctg gtg ctg gtg gac gac 1392
Ile Arg Pro Ala Glu Gln Glu Arg His Gly Leu Val Leu Val Asp Asp
450 455 460
ctc agc gag ttt gac ctg gat gac atg aaa gcc gag ggc cgg gag cct 1440
Leu Ser Glu Phe Asp Leu Asp Asp Met Lys Ala Glu Gly Arg Glu Pro
465 470 475 480
gcc ctg gta gtg gaa acc agc ccg aag aac tat cag gca tgg gtc aag 1488
Ala Leu Val Val Glu Thr Ser Pro Lys Asn Tyr Gln Ala Trp Val Lys
485 490 495
gtg gcc gac gcc gca ggc ggt gaa ctt cgg ggg cag att gcc cgg acg 1536
Val Ala Asp Ala Ala Gly Gly Glu Leu Arg Gly Gln Ile Ala Arg Thr
500 505 510
ctg gcc agc gag tac gac gcc gac ccg gcc agc gcc gac agc cgc cac 1584
Leu Ala Ser Glu Tyr Asp Ala Asp Pro Ala Ser Ala Asp Ser Arg His
515 520 525
tat ggc cgc ttg gcg ggc ttc acc aac cgc aag gac aag cac acc acc 1632
Tyr Gly Arg Leu Ala Gly Phe Thr Asn Arg Lys Asp Lys His Thr Thr
530 535 540
cgc gcc ggt tat cag ccg tgg gtg ctg ctg cgt gaa tcc aag ggc aag 1680
Arg Ala Gly Tyr Gln Pro Trp Val Leu Leu Arg Glu Ser Lys Gly Lys
545 550 555 560
acc gcc acc gct ggc ccg gcg ctg gtg cag cag gct ggc cag cag atc 1728
Thr Ala Thr Ala Gly Pro Ala Leu Val Gln Gln Ala Gly Gln Gln Ile
565 570 575
gag cag gcc cag cgg cag cag gag aag gcc cgc agg ctg gcc agc ctc 1776
Glu Gln Ala Gln Arg Gln Gln Glu Lys Ala Arg Arg Leu Ala Ser Leu
580 585 590
gaa ctg ccc gag cgg cag ctt agc cgc cac cgg cgc acg gcg ctg gac 1824
Glu Leu Pro Glu Arg Gln Leu Ser Arg His Arg Arg Thr Ala Leu Asp
595 600 605
gag tac cgc agc gag atg gcc ggg ctg gtc aag cgc ttc ggt gat gac 1872
Glu Tyr Arg Ser Glu Met Ala Gly Leu Val Lys Arg Phe Gly Asp Asp
610 615 620
ctc agc aag tgc gac ttt atc gcc gcg cag aag ctg gcc agc cgg ggc 1920
Leu Ser Lys Cys Asp Phe Ile Ala Ala Gln Lys Leu Ala Ser Arg Gly
625 630 635 640
cgc agt gcc gag gaa atc ggc aag gcc atg gcc gag gcc agc cca gcg 1968
Arg Ser Ala Glu Glu Ile Gly Lys Ala Met Ala Glu Ala Ser Pro Ala
645 650 655
ctg gca gag cgc aag ccc ggc cac gaa gcg gat tac atc gag cgc acc 2016
Leu Ala Glu Arg Lys Pro Gly His Glu Ala Asp Tyr Ile Glu Arg Thr
660 665 670
gtc agc aag gtc atg ggt ctg ccc agc gtc cag ctt gcg cgg gcc gag 2064
Val Ser Lys Val Met Gly Leu Pro Ser Val Gln Leu Ala Arg Ala Glu
675 680 685
ctg gca cgg gca ccg gca ccc cgc cag cga ggc atg gac agg ggc ggg 2112
Leu Ala Arg Ala Pro Ala Pro Arg Gln Arg Gly Met Asp Arg Gly Gly
690 695 700
cca gat ttc agc atg tag 2130
Pro Asp Phe Ser Met
705
<210>9
<211>709
<212>PRT
<213>大肠杆菌
<400>9
Met Ala Ile Tyr His Leu Thr Ala Lys Thr Gly Ser Arg Ser Gly Gly
1 5 10 15
Gln Ser Ala Arg Ala Lys Ala Asp Tyr Ile Gln Arg Glu Gly Lys Tyr
20 25 30
Ala Arg Asp Met Asp Glu Val Leu His Ala Glu Ser Gly His Met Pro
35 40 45
Glu Phe Val Glu Arg Pro Ala Asp Tyr Trp Asp Ala Ala Asp Leu Tyr
50 55 60
Glu Arg Ala Asn Gly Arg Leu Phe Lys Glu Val Glu Phe Ala Leu Pro
65 70 75 80
Val Glu Leu Thr Leu Asp Gln Gln Lys Ala Leu Ala Ser Glu Phe Ala
85 90 95
Gln His Leu Thr Gly Ala Glu Arg Leu Pro Tyr Thr Leu Ala Ile His
100 105 110
Ala Gly Gly Gly Glu Asn Pro His Cys His Leu Met Ile Ser Glu Arg
115 120 125
Ile Asn Asp Gly Ile Glu Arg Pro Ala Ala Gln Trp Phe Lys Arg Tyr
130 135 140
Asn Gly Lys Thr Pro Glu Lys Gly Gly Ala Gln Lys Thr Glu Ala Leu
145 150 155 160
Lys Pro Lys Ala Trp Leu Glu Gln Thr Arg Glu Ala Trp Ala Asp His
165 170 175
Ala Asn Arg Ala Leu Glu Arg Ala Gly His Asp Ala Arg Ile Asp His
180 185 190
Arg Thr Leu Glu Ala Gln Gly Ile Glu Arg Leu Pro Gly Val His Leu
195 200 205
Gly Pro Asn Val Val Glu Met Glu Gly Arg Gly Ile Arg Thr Asp Arg
210 215 220
Ala Asp Val Ala Leu Asn Ile Asp Thr Ala Asn Ala Gln Ile Ile Asp
225 230 235 240
Leu Gln Glu Tyr Arg Glu Ala Ile Asp His Glu Arg Asn Arg Gln Ser
245 250 255
Glu Glu Ile Gln Arg His Gln Arg Val Ser Gly Ala Asp Arg Thr Ala
260 265 270
Gly Pro Glu His Gly Asp Thr Gly Arg Arg Ser Pro Ala Gly His Glu
275 280 285
Pro Asp Pro Ala Gly Gln Arg Gly Ala Gly Gly Gly Val Ala Glu Ser
290 295 300
Pro Ala Pro Asp Arg Gly Gly Met Gly Gly Ala Gly Gln Arg Val Ala
305 310 315 320
Gly Gly Ser Arg Arg Gly Glu Gln Arg Arg Ala Glu Arg Pro Glu Arg
325 330 335
Val Ala Gly Val Ala Leu Glu Ala Met Ala Asn Arg Asp Ala Gly Phe
340 345 350
His Asp Ala Tyr Gly Gly Ala Ala Asp Arg Ile Val Ala Leu Ala Arg
355 360 365
Pro Asp Ala Thr Asp Asn Arg Gly Arg Leu Asp Leu Ala Ala Leu Gly
370 375 380
Gly Pro Met Lys Asn Asp Arg Thr Leu Gln Ala Ile Gly Arg Gln Leu
385 390 395 400
Lys Ala Met Gly Cys Glu Arg Phe Asp Ile Gly Val Arg Asp Ala Thr
405 410 415
Thr Gly Gln Met Met Asn Arg Glu Trp Ser Ala Ala Glu Val Leu Gln
420 425 430
Asn Thr Pro Trp Leu Lys Arg Met Asn Ala Gln Gly Asn Asp Val Tyr
435 440 445
Ile Arg Pro Ala Glu Gln Glu Arg His Gly Leu Val Leu Val Asp Asp
450 455 460
Leu Ser Glu Phe Asp Leu Asp Asp Met Lys Ala Glu Gly Arg Glu Pro
465 470 475 480
Ala Leu Val Val Glu Thr Ser Pro Lys Asn Tyr Gln Ala Trp Val Lys
485 490 495
Val Ala Asp Ala Ala Gly Gly Glu Leu Arg Gly Gln Ile Ala Arg Thr
500 505 510
Leu Ala Ser Glu Tyr Asp Ala Asp Pro Ala Ser Ala Asp Ser Arg His
515 520 525
Tyr Gly Arg Leu Ala Gly Phe Thr Asn Arg Lys Asp Lys His Thr Thr
530 535 540
Arg Ala Gly Tyr Gln Pro Trp Val Leu Leu Arg Glu Ser Lys Gly Lys
545 550 555 560
Thr Ala Thr Ala Gly Pro Ala Leu Val Gln Gln Ala Gly Gln Gln Ile
565 570 575
Glu Gln Ala Gln Arg Gln Gln Glu Lys Ala Arg Arg Leu Ala Ser Leu
580 585 590
Glu Leu Pro Glu Arg Gln Leu Ser Arg His Arg Arg Thr Ala Leu Asp
595 600 605
Glu Tyr Arg Ser Glu Met Ala Gly Leu Val Lys Arg Phe Gly Asp Asp
610 615 620
Leu Ser Lys Cys Asp Phe Ile Ala Ala Gln Lys Leu Ala Ser Arg Gly
625 630 635 640
Arg Ser Ala Glu Glu Ile Gly Lys Ala Met Ala Glu Ala Ser Pro Ala
645 650 655
Leu Ala Glu Arg Lys Pro Gly His Glu Ala Asp Tyr Ile Glu Arg Thr
660 665 670
Val Ser Lys Val Met Gly Leu Pro Ser Val Gln Leu Ala Arg Ala Glu
675 680 685
Leu Ala Arg Ala Pro Ala Pro Arg Gln Arg Gly Met Asp Arg Gly Gly
690 695 700
Pro Asp Phe Ser Met
705
<210>10
<211>414
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(414)
<223>mobB
<400>10
atg aac gca ate gac aga gtg aag aaa tcc aga ggc ate aac gag tta 48
Met Asn Ala Ile Asp Arg Val Lys Lys Ser Arg Gly Ile Asn Glu Leu
1 5 10 15
gcg gag cag atc gaa ccg ctg gcc cag agc atg gcg aca ctg gcc gac 96
Ala Glu Gln Ile Glu Pro Leu Ala Gln Ser Met Ala Thr Leu Ala Asp
20 25 30
gaa gcc cgg cag gtc atg agc cag acc cag cag gcc agc gag gcg cag 144
Glu Ala Arg Gln Val Met Ser Gln Thr Gln Gln Ala Ser Glu Ala Gln
35 40 45
gcg gcg gag tgg ctg aaa gcc cag cgc cag aca ggg gcg gca tgg gtg 192
Ala Ala Glu Trp Leu Lys Ala Gln Arg Gln Thr Gly Ala Ala Trp Val
50 55 60
gag ctg gcc aaa gag ttg cgg gag gta gcc gcc gag gtg agc agc gcc 240
Glu Leu Ala Lys Glu Leu Arg Glu Val Ala Ala Glu Val Ser Ser Ala
65 70 75 80
gcg cag agc gcc cgg agc gcg tcg cgg ggg tgg cac tgg aag cta tgg 288
Ala Gln Ser Ala Arg Ser Ala Ser Arg Gly Trp His Trp Lys Leu Trp
85 90 95
cta acc gtg atg ctg gct tcc atg atg cct acg gtg gtg ctg ctg atc 336
Leu Thr Val Met Leu Ala Ser Met Met Pro Thr Val Val Leu Leu Ile
100 105 110
gca tcg ttg ctc ttg ctc gac ctg acg cca ctg aca acc gag gac ggc 384
Ala Ser Leu Leu Leu Leu Asp Leu Thr Pro Leu Thr Thr Glu Asp Gly
115 120 125
tcg atc tgg ctg cgc ttg gtg gcc cga tga 414
Ser Ile Trp Leu Arg Leu Val Ala Arg
130 135
<210>11
<211>137
<212>PRT
<213>大肠杆菌
<400>11
Met Asn Ala Ile Asp Arg Val Lys Lys Ser Arg Gly Ile Asn Glu Leu
1 5 10 15
Ala Glu Gln Ile Glu Pro Leu Ala Gln Ser Met Ala Thr Leu Ala Asp
20 25 30
Glu Ala Arg Gln Val Met Ser Gln Thr Gln Gln Ala Ser Glu Ala Gln
35 40 45
Ala Ala Glu Trp Leu Lys Ala Gln Arg Gln Thr Gly Ala Ala Trp Val
50 55 60
Glu Leu Ala Lys Glu Leu Arg Glu Val Ala Ala Glu Val Ser Ser Ala
65 70 75 80
Ala Gln Ser Ala Arg Ser Ala Ser Arg Gly Trp His Trp Lys Leu Trp
85 90 95
Leu Thr Val Met Leu Ala Ser Met Met Pro Thr Val Val Leu Leu Ile
100 105 110
Ala Ser Leu Leu Leu Leu Asp Leu Thr Pro Leu Thr Thr Glu Asp Gly
115 120 125
Ser Ile Trp Leu Arg Leu Val Ala Arg
130 135
<210>12
<211>972
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(972)
<223>repB
<400>12
atg aag aac gac agg act ttg cag gcc ata ggc cga cag ctc aag gcc 48
Met Lys Asn Asp Arg Thr Leu Gln Ala Ile Gly Arg Gln Leu Lys Ala
1 5 10 15
atg ggc tgt gag cgc ttc gat atc ggc gtc agg gac gcc acc acc ggc 96
Met Gly Cys Glu Arg Phe Asp Ile Gly Val Arg Asp Ala Thr Thr Gly
20 25 30
cag atg atg aac cgg gaa tgg tca gcc gcc gaa gtg ctc cag aac acg 144
Gln Met Met Asn Arg Glu Trp Ser Ala Ala Glu Val Leu Gln Asn Thr
35 40 45
cca tgg ctc aag cgg atg aat gcc cag ggc aat gac gtg tat atc agg 192
Pro Trp Leu Lys Arg Met Asn Ala Gln Gly Asn Asp Val Tyr Ile Arg
50 55 60
ccc gcc gag cag gag cgg cat ggt ctg gtg ctg gtg gac gac ctc agc 240
Pro Ala Glu Gln Glu Arg His Gly Leu Val Leu Val Asp Asp Leu Ser
65 70 75 80
gag ttt gac ctg gat gac atg aaa gcc gag ggc cgg gag cct gcc ctg 288
Glu Phe Asp Leu Asp Asp Met Lys Ala Glu Gly Arg Glu Pro Ala Leu
85 90 95
gta gtg gaa acc agc ccg aag aac tat cag gca tgg gtc aag gtg gcc 336
Val Val Glu Thr Ser Pro Lys Asn Tyr Gln Ala Trp Val Lys Val Ala
100 105 110
gac gcc gca ggc ggt gaa ctt cgg ggg cag att gcc cgg acg ctg gcc 384
Asp Ala Ala Gly Gly Glu Leu Arg Gly Gln Ile Ala Arg Thr Leu Ala
115 120 125
agc gag tac gac gcc gac ccg gcc agc gcc gac agc cgc cac tat ggc 432
Ser Glu Tyr Asp Ala Asp Pro Ala Ser Ala Asp Ser Arg His Tyr Gly
130 135 140
cgc ttg gcg ggc ttc acc aac cgc aag gac aag cac acc acc cgc gcc 480
Arg Leu Ala Gly Phe Thr Asn Arg Lys Asp Lys His Thr Thr Arg Ala
145 150 155 160
ggt tat cag ccg tgg gtg ctg ctg cgt gaa tcc aag ggc aag acc gcc 528
Gly Tyr Gln Pro Trp Val Leu Leu Arg Glu Ser Lys Gly Lys Thr Ala
165 170 175
acc gct ggc ccg gcg ctg gtg cag cag gct ggc cag cag atc gag cag 576
Thr Ala Gly Pro Ala Leu Val Gln Gln Ala Gly Gln Gln Ile Glu Gln
180 185 190
gcc cag cgg cag cag gag aag gcc cgc agg ctg gcc agc ctc gaa ctg 624
Ala Gln Arg Gln Gln Glu Lys Ala Arg Arg Leu Ala Ser Leu Glu Leu
195 200 205
ccc gag cgg cag ctt agc cgc cac cgg cgc acg gcg ctg gac gag tac 672
Pro Glu Arg Gln Leu Ser Arg His Arg Arg Thr Ala Leu Asp Glu Tyr
210 215 220
cgc agc gag atg gcc ggg ctg gtc aag cgc ttc ggt gat gac ctc agc 720
Arg Ser Glu Met Ala Gly Leu ValLys Arg Phe Gly Asp Asp Leu Ser
225 230 235 240
aag tgc gac ttt atc gcc gcg cag aag etg gcc agc cgg ggc cgc agt 768
Lys Cys Asp Phe Ile Ala Ala Gln Lys Leu Ala Ser Arg Gly Arg Ser
245 250 255
gcc gag gaa atc ggc aag gcc atg gcc gag gcc agc cca gcg ctg gca 816
Ala Glu Glu Ile Gly Lys Ala Met Ala Glu Ala Ser Pro Ala Leu Ala
260 265 270
gag cgc aag ccc ggc cac gaa gcg gat tac atc gag cgc acc gtc agc 864
Glu Arg Lys Pro Gly His Glu Ala Asp Tyr Ile Glu Arg Thr Val Ser
275 280 285
aag gtc atg ggt ctg ccc agc gtc cag ctt gcg cgg gcc gag ctg gca 912
Lys Val Met Gly Leu Pro Ser Val Glm Leu Ala Arg Ala Glu Leu Ala
290 295 300
cgg gca ccg gca ccc cgc cag cga ggc atg gac agg ggc ggg cca gat 960
Arg Ala Pro Ala Pro Arg Gln Arg Gly Met Asp Arg Gly Gly Pro Asp
305 310 315 320
ttc agc atg tag 972
Phe Ser Met
<210>13
<211>323
<212>PRT
<213>大肠杆菌
<400>13
Met Lys Asn Asp Arg Thr Leu Gln Ala Ile Gly Arg Gln Leu Lys Ala
1 5 10 15
Met Gly Cys Glu Arg Phe Asp Ile Gly Val Arg Asp Ala Thr Thr Gly
20 25 30
Gln Met Met Asn Arg Glu Trp Ser Ala Ala Glu Val Leu Gln Asn Thr
35 40 45
Pro Trp Leu Lys Arg Met Asn Ala Gln Gly Asn Asp Val Tyr Ile Arg
50 55 60
Pro Ala Glu Gln Glu Arg His Gly Leu Val Leu Val Asp Asp Leu Ser
65 70 75 80
Glu Phe Asp Leu Asp Asp Met Lys Ala Glu Gly Arg Glu Pro Ala Leu
85 90 95
Val Val Glu Thr Ser Pro Lys Asn Tyr Gln Ala Trp Val Lys Val Ala
100 105 110
Asp Ala Ala Gly Gly Glu Leu Arg Gly Gln Ile Ala Arg Thr Leu Ala
115 120 125
Ser Glu Tyr Asp Ala Asp Pro Ala Ser Ala Asp Ser Arg His Tyr Gly
130 135 140
Arg Leu Ala Gly Phe Thr Asn Arg Lys Asp Lys His Thr Thr Arg Ala
145 150 155 160
Gly Tyr Gln Pro Trp Val Leu Leu Arg Glu Ser Lys Gly Lys Thr Ala
165 170 175
Thr Ala Gly Pro Ala Leu Val Gln Gln Ala Gly Gln Gln Ile Glu Gln
180 185 190
Ala Gln Arg Gln Gln Glu Lys Ala Arg Arg Leu Ala Ser Leu Glu Leu
195 200 205
Pro Glu Arg Gln Leu Ser Arg His Arg Arg Thr Ala Leu Asp Glu Tyr
210 215 220
Arg Ser Glu Met Ala Gly Leu Val Lys Arg Phe Gly Asp Asp Leu Ser
225 230 235 240
Lys Cys Asp Phe Ile Ala Ala Gln Lys Leu Ala Ser Arg Gly Arg Ser
245 250 255
Ala Glu Glu Ile Gly Lys Ala Met Ala Glu Ala Ser Pro Ala Leu Ala
260 265 270
Glu Arg Lys Pro Gly His Glu Ala Asp Tyr Ile Glu Arg Thr Val Ser
275 280 285
Lys Val Met Gly Leu Pro Ser Val Gln Leu Ala Arg Ala Glu Leu Ala
290 295 300
Arg Ala Pro Ala Pro Arg Gln Arg Gly Met Asp Arg Gly Gly Pro Asp
305 310 315 320
Phe Ser Met
<210>14
<211>213
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(213)
<223>orfE
<400>14
atg gaa tac gaa aaa agc gct tea ggg tcg gtc tac ctg atc aaa agt 48
Met Glu Tyr Glu Lys Ser Ala Ser Gly Ser Val Tyr Leu Ile Lys Ser
1 5 10 15
gac aag ggc tat tgg ttg ccc ggt ggc ttt ggt tat acg tca aac aag 96
Asp Lys Gly Tyr Trp Leu Pro Gly Gly Phe Gly Tyr Thr Ser Asn Lys
20 25 30
gcc gag gct ggc cgc ttt tca gtc gct gat atg gcc agc ctt aac ctt 144
Ala Glu Ala Gly Arg Phe Ser Val Ala Asp Met Ala Ser Leu Asn Leu
35 40 45
gac ggc tgc acc ttg tcc ttg ttc cgc gaa gac aag cct ttc ggc ccc 192
Asp Gly Cys Thr Leu Ser Leu Phe Arg Glu Asp Lys Pro Phe Gly Pro
50 55 60
ggc aag ttt ctc ggt gac tga 213
Gly Lys Phe Leu Gly Asp
65 70
<210>15
<211>70
<212>PRT
<213>大肠杆菌
<400>15
Met Glu Tyr Glu Lys Ser Ala Ser Gly Ser Val Tyr Leu Ile Lys Ser
1 5 10 15
Asp Lys Gly Tyr Trp Leu Pro Gly Gly Phe Gly Tyr Thr Ser Asn Lys
20 25 30
Ala Glu Ala Gly Arg Phe Ser Val Ala Asp Met Ala Ser Leu Asn Leu
35 40 45
Asp Gly Cys Thr Leu Ser Leu Phe Arg Glu Asp Lys Pro Phe Gly Pro
50 55 60
Gly Lys Phe Leu Gly Asp
65 70
<210>16
<211>207
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(207)
<223>orfF
<400>16
atg aaa gac caa aag gac aag cag acc ggc gac ctg ctg gcc agc cct 48
Met Lys Asp Gln Lys Asp Lys Gln Thr Gly Asp Leu Leu Ala Ser Pro
1 5 10 15
gac gct gta cgc caa gcg cga tat gcc gag cgc atg aag gcc aaa ggg 96
Asp Ala Val Arg Gln Ala Arg Tyr Ala Glu Arg Met Lys Ala Lys Gly
20 25 30
atg cgt cag cgc aag ttc tgg ctg acc gac gac gaa tac gag gcg ctg 144
Met Arg Gln Arg Lys Phe Trp Leu Thr Asp Asp Glu Tyr Glu Ala Leu
35 40 45
cgc gag tgc ctg gaa gaa ctc aga gcg gcg cag ggc ggg ggt agt gac 192
Arg Glu Cys Leu Glu Glu Leu Arg Ala Ala Gln Gly Gly Gly Ser Asp
50 55 60
ccc gcc agc gcc taa 207
Pro Ala Ser Ala
65
<210>17
<211>68
<212>PRT
<213>大肠杆菌
<400>17
Met Lys Asp Gln Lys Asp Lys Gln Thr Gly Asp Leu Leu Ala Ser Pro
1 5 10 15
Asp Ala Val Arg Gln Ala Arg Tyr Ala Glu Arg Met Lys Ala Lys Gly
20 25 30
Met Arg Gln Arg Lys Phe Trp Leu Thr Asp Asp Glu Tyr Glu Ala Leu
35 40 45
Arg Glu Cys Leu Glu Glu Leu Arg Ala Ala Gln Gly Gly Gly Ser Asp
50 55 60
Pro Ala Ser Ala
65
<210>18
<211>840
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(840)
<223>repA
<400>18
atg gct acc cat aag cct atc aat att ctg gag gcg ttc gca gca gcg 48
Met Ala Thr His Lys Pro Ile Asn Ile Leu Glu Ala Phe Ala Ala Ala
1 5 10 15
ccg cca ccg ctg gac tac gtt ttg ccc aac atg gtg gcc ggt acg gtc 96
Pro Pro Pro Leu Asp Tyr Val Leu Pro Asn Met Val Ala Gly Thr Val
20 25 30
ggg gcg ctg gtg tcg ccc ggt ggt gcc ggt aaa tcc atg ctg gcc ctg 144
Gly Ala Leu Val Ser Pro Gly Gly Ala Gly Lys Ser Met Leu Ala Leu
35 40 45
caa ctg gcc gca cag att gca ggc ggg ccg gat ctg ctg gag gtg ggc 192
Gln Leu Mla Ala Gln Ile Ala Gly Gly Pro Asp Leu Leu Glu Val Gly
50 55 60
gaa ctg ccc acc ggc ccg gtg atc tac ctg ccc gcc gaa gac ccg ccc 240
Glu Leu Pro Thr Gly Pro Val Ile Tyr Leu Pro A1a Glu Asp Pro Pro
65 70 75 80
acc gcc att cat cac cgc ctg cac gcc ctt ggg gcg cac ctc agc gcc 288
Thr Ala Ile His His Arg Leu His Ala Leu Gly Ala His Leu Ser Ala
85 90 95
gag gaa cgg caa gcc gtg gct gac ggc ctg ctg atc cag ccg ctg atc 336
Glu Glu Arg Gln Mla Val Ala Asp Gly Leu Leu Ile Gln Pro Leu Ile
100 105 110
ggc agc ctg ccc aac atc atg gcc ccg gag tgg ttc gac ggc ctc aag 384
Gly Ser Leu Pro Asn Ile Met Ala Pro Glu Trp Phe Asp Gly Leu Lys
115 120 125
cgc gcc gcc gag ggc cgc cgc ctg atg gtg ctg gac acg ctg cgc cgg 432
Arg Ala Ala Glu Gly Arg Arg Leu Met Val Leu Asp Thr Leu Arg Arg
130 135 140
ttc cac atc gag gaa gaa aac gcc agc ggc ccc atg gcc cag gtc atc 480
Phe His Ile Glu Glu Glu Asn Ala Ser Gly Pro Met Ala Gln Val Ile
145 150 155 160
ggt cgc atg gag gcc atc gcc gcc gat acc ggg tgc tct atc gtg ttc 528
Gly Arg Met Glu Ala Ile Ala Ala Asp Thr Gly Cys Ser Ile Val Phe
165 170 175
ctg cac cat gcc agc aag ggc gcg gcc atg atg ggc gca ggc gac cag 576
Leu His His Ala Ser Lys Gly Ala Ala Met Met Gly Ala Gly Asp Gln
180 185 190
cag cag gcc age cgg ggc agc tcg gta ctg gtc gat aac atc cgc tgg 624
Gln Gln Ala Ser Arg Gly Ser Ser Val Leu Val Asp Asn Ile Arg Trp
195 200 205
cag tcc tac ctg tcg agc atg acc agc gcc gag gcc gag gaa tgg ggt 672
Gln Ser Tyr Leu Ser Ser Met Thr Ser Ala Glu Ala Glu Glu Trp Gly
210 215 220
gtg gac gac gac cag cgc cgg ttc ttc gtc cgc ttc ggt gtg agc aag 720
Val Asp Asp Asp Gln Arg Arg Phe Phe Val Arg Phe Gly Val Set Lys
225 230 235 240
gcc aac tat ggc gca ccg ttc gct gat cgg tgg ttc agg cgg cat gac 768
Ala Asn Tyr Gly Ala Pro Phe Ala Asp Arg Trp Phe Arg Arg His Asp
245 250 255
ggc ggg gtg ctc aag ccc gcc gtg ctg gag agg cag cgc aag agc aag 816
Gly Gly Val Leu Lys Pro Ala Val Leu Glu Arg Gln Arg Lys Ser Lys
260 265 270
ggg gtg ccc cgt ggt gaa gcc taa 840
Gly Val Pro Arg Gly Glu Ala
275
<210>19
<211>279
<212>PRT
<213>大肠杆菌
<400>19
Met Ala Thr His Lys Pro Ile Asn Ile Leu Glu Ala Phe Ala Ala Ala
1 5 10 15
Pro Pro Pro Leu Asp Tyr Val Leu Pro Asn Met Val Ala Gly Thr Val
20 25 30
Gly Ala Leu Val Ser Pro Gly Gly Ala Gly Lys Ser Met Leu Ala Leu
35 40 45
Gln Leu Ala Ala Gln Ile Ala Gly Gly Pro Asp Leu Leu Glu Val Gly
50 55 60
Glu Leu Pro Thr Gly Pro Val Ile Tyr Leu Pro Ala Glu Asp Pro Pro
65 70 75 80
Thr Ala Ile His His Arg Leu His Ala Leu Gly Ala His Leu Ser Ala
85 90 95
Glu Glu Arg Gln Ala Val Ala Asp Gly Leu Leu Ile Gln Pro Leu Ile
100 105 110
Gly Ser Leu Pro Asn Ile Met Ala Pro Glu Trp Phe Asp Gly Leu Lys
115 120 125
Arg Ala Ala Glu Gly Arg Arg Leu Met Val Leu Asp Thr Leu Arg Arg
130 135 140
Phe His Ile Glu Glu Glu Asn Ala Ser Gly Pro Met Ala Gln Val Ile
145 150 155 160
Gly Arg Met Glu Ala Ile Ala Ala Asp Thr Gly Cys Ser Ile Val Phe
165 170 175
Leu His His Ala Ser Lys Gly Ala Ala Met Met Gly Ala Gly Asp Gln
180 185 190
Gln Gln Ala Ser Arg Gly Ser Ser Val Leu Val Asp Asn Ile Arg Trp
195 200 205
Gln Ser Tyr Leu Ser Ser Met Thr Ser Ala Glu Ala Glu Glu Trp Gly
210 215 220
Val Asp Asp Asp Gln Arg Arg Phe Phe Val Arg Phe Gly Val Ser Lys
225 230 235 240
Ala Asn Tyr Gly Ala Pro Phe Ala Asp Arg Trp Phe Arg Arg His Asp
245 250 255
Gly Gly Val Leu Lys Pro Ala Val Leu Glu Arg Gln Arg Lys Ser Lys
260 265 270
Gly Val Pro Arg Gly Glu Ala
275
<210>20
<211>852
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(852)
<223>repC
<400>20
gtg gtg aag cct aag aac aag cac agc ctc agc cac gtc cgg cac gac 48
Val Val Lys Pro Lys Asn Lys His Ser Leu Ser His Val Arg His Asp
1 5 10 15
ccg gcg cac tgt ctg gcc ccc ggc ctg ttc cgt gcc ctc aag cgg ggc 96
Pro Ala His Cys Leu Ala Pro Gly Leu Phe Arg Ala Leu Lys Arg Gly
20 25 30
gag cgc aag cgc agc aag ctg gac gtg acg tat gac tac ggc gac ggc 144
Glu Arg Lys Arg Ser Lys Leu Asp Val Thr Tyr Asp Tyr Gly Asp Gly
35 40 45
aag cgg atc gag ttc agc ggc ccg gag ccg ctg ggc gct gat gat ctg 192
Lys Arg Ile Glu Phe Ser Gly Pro Glu Pro Leu Gly Ala Asp Asp Leu
50 55 60
cgc atc ctg caa ggg ctg gtg gcc atg gct ggg cct aat ggc cta gtg 240
Arg Ile Leu Gln Gly Leu Val Ala Met Ala Gly Pro Asn Gly Leu Val
65 70 75 80
ctt ggc ccg gaa ccc aag acc gaa ggc gga cgg cag ctc egg ctg ttc 288
Leu Gly Pro Glu Pro Lys Thr Glu Gly Gly Arg Gln Leu Arg Leu Phe
85 90 95
ctg gaa ccc aag tgg gag gcc gtc acc gct gaa tgc cat gtg gtc aaa 336
Leu Glu Pro Lys Trp Glu Ala Val Thr Ala Glu Cys His Val Val Lys
100 105 110
ggt agc tat cgg gcg ctg gca aag gaa atc ggg gca gag gte gat agt 384
Gly Ser Tyr Arg Ala Leu Ala Lys Glu Ile Gly Ala Glu Val Asp Ser
115 120 125
ggt ggg gcg ctc aag cac ata cag gac tgc atc gag cgc ctt tgg aag 432
Gly Gly Ala Leu Lys His Ile Gln Asp Cys Ile Glu Arg Leu Trp Lys
130 135 140
gta tcc atc atc gcc cag aat ggc cgc aag cgg cag ggg ttt cgg ctg 480
Val Ser Ile Ile Ala Gln Asn Gly Arg Lys Arg Gln Gly Phe Arg Leu
145 150 155 160
ctg tcg gag tac gcc agc gac gag gcg gac ggg cgc ctg tac gtg gcc 528
Leu Ser Glu Tyr Ala Ser Asp Glu Ala Asp Gly Arg Leu Tyr Val Ala
165 170 175
ctg aac ccc ttg atc gcg cag gcc gtc atg ggt ggc ggc cag cat gtg 576
Leu Asn Pro Leu Ile Ala Gln Ala Val Met Gly Gly Gly Gln His Val
180 185 190
cgc atc agc atg gac gag gtg cgg gcg ctg gac agc gaa acc gcc cgc 624
Arg Ile Ser Met Asp Glu Val Arg Ala Leu Asp Ser Glu Thr Ala Arg
195 200 205
ctg ctg cac cag cgg ctg tgt ggc tgg atc gac ccc ggc aaa acc ggc 672
Leu Leu His Gln Arg Leu Cys Gly Trp Ile Asp Pro Gly Lys Thr Gly
210 215 220
aag gct tcc ata gat acc ttg tgc ggc tat gtc tgg ccg tca gag gcc 720
Lys Ala Ser Ile Asp Thr Leu Cys Gly Tyr Val Trp Pro Ser Glu Ala
225 230 235 240
agt ggt tcg acc atg cgc aag cgc cgc cag cgg gtg cgc gag gcg ttg 768
Ser Gly Ser Thr Met Arg Lys Arg Arg Gln Arg Val Arg Glu Ala Leu
245 250 255
ccg gag ctg gtc gcg ctg ggc tgg acg gta acc gag ttc gcg gcg ggc 816
Pro Glu Leu Val Ala Leu Gly Trp Thr Val Thr Glu Phe Ala Ala Gly
260 265 270
aag tac gac atc acc cgg ccc aag gcg gca ggc tga 852
Lys Tyr Asp Ile Thr Arg Pro Lys Ala Ala Gly
275 280
<210>21
<211>283
<212>PRT
<213>大肠杆菌
<400>21
Val Val Lys Pro Lys Asn Lys His Ser Leu Ser His Val Arg His Asp
1 5 10 15
Pro Ala His Cys Leu Ala Pro Gly Leu Phe Arg Ala Leu Lys Arg Gly
20 25 30
Glu Arg Lys Arg Ser Lys Leu Asp Val Thr Tyr Asp Tyr Gly Asp Gly
35 40 45
Lys Arg Ile Glu Phe Ser Gly Pro Glu Pro Leu Gly Ala Asp Asp Leu
50 55 60
Arg Ile Leu Gln Gly Leu Val Ala Met Ala Gly Pro Asn Gly Leu Val
65 70 75 80
Leu Gly Pro Glu Pro Lys Thr Glu Gly Gly Arg Gln Leu Arg Leu Phe
85 90 95
Leu Glu Pro Lys Trp Glu Ala Val Thr Ala Glu Cys His Val Val Lys
100 105 110
Gly Ser Tyr Arg Ala Leu Ala Lys Glu Ile Gly Ala Glu Val Asp Ser
115 120 125
Gly Gly Ala Leu Lys His Ile Gln Asp Cys Ile Glu Arg Leu Trp Lys
130 135 140
Val Ser Ile Ile Ala Gln Asn Gly Arg Lys Arg Gln Gly Phe Arg Leu
145 150 155 160
Leu Ser Glu Tyr Ala Ser Asp Glu Ala Asp Gly Arg Leu Tyr Val Ala
165 170 175
Leu Asn Pro Leu Ile Ala Gln Ala Val Met Gly Gly Gly Gln His Val
180 185 190
Arg Ile Ser Met Asp Glu Val Arg Ala Leu Asp Ser Glu Thr Ala Arg
195 200 205
Leu Leu His Gln Arg Leu Cys Gly Trp Ile Asp Pro Gly Lys Thr Gly
210 215 220
Lys Ala Ser Ile Asp Thr Leu Cys Gly Tyr Val Trp Pro Ser Glu Ala
225 230 235 240
Ser Gly Ser Thr Met Arg Lys Arg Arg Gln Arg Val Arg Glu Ala Leu
245 250 255
Pro Glu Leu Val Ala Leu Gly Trp Thr Val Thr Glu Phe Ala Ala Gly
260 265 270
Lys Tyr Asp Ile Thr Arg Pro Lys Ala Ala Gly
275 280
<210>22
<21l>789
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(789)
<223>sul
<400>22
atg aat aaa tcg ctc atc att ttc ggc atc gtc aac ata acc tcg gac 48
Met Asn Lys Ser Leu Ile Ile Phe Gly Ile Val Asn Ile Thr Ser Asp
1 5 10 15
agt ttc tcc gat gga ggc cgg tat ctg gcg cca gac gca gcc att gcg 96
Ser Phe Ser Asp Gly Gly Arg Tyr Leu Ala Pro Asp Ala Ala Ile Ala
20 25 30
cag gcg cgt aag ctg atg gcc gag ggg gca gat gtg atc gac ctg gtc 144
Gln Ala Arg Lys Leu Met Ala Glu Gly Ala Asp Val Ile Asp Leu Val
35 40 45
cgg cat cca gca atc ccg acg ccg cgc ctg ttt cgt ccg aca cag aaa 192
Arg His Pro Ala Ile Pro Thr Pro Arg Leu Phe Arg Pro Thr Gln Lys
50 55 60
tcg cgc gta tgc gcc ggt gct gga cgc gct cag gca gat ggc att ccc 240
Ser Arg Val Cys Ala Gly Ala Gly Arg Ala Gln Ala Asp Gly Ile Pro
65 70 75 80
gtc tcg ctc gac agt tat caa ccc gcg acg caa gcc tat gcc ttg tcg 288
Val Ser Leu Asp Ser Tyr Gln Pro Ala Thr Gln Ala Tyr Ala Leu Ser
85 90 95
cgt ggt gtg gcc tat ctc aat gat att cgc ggt ttt cca gac gct gcg 336
Arg Gly Val Ala Tyr Leu Asn Asp Ile Arg Gly Phe Pro Asp Ala Ala
100 105 110
ttc tat ccg caa ttg gcg aaa tca tct gcc aaa ctc gtc gtt atg cat 384
Phe Tyr Pro Gln Leu Ala Lys Ser Ser Ala Lys Leu Val Val Met His
115 120 125
tcg gtg caa gac ggg cag gca gat cgg cgc gag gca ccc gct ggc gac 432
Ser Val Gln Asp Gly Gln Ala Asp Arg Arg Glu Ala Pro Ala Gly Asp
130 135 140
atc atg gat cac att gcg gcg ttc ttt gac gcg cgc atc gcg gcg ctg 480
Ile Met Asp His Ile Ala Ala Phe Phe Asp Ala Arg Ile Ala Ala Leu
145 150 155 160
acg ggt gcc ggt atc aaa cgc aac cgc ctt gtc ctt gat ccc ggc atg 528
Thr Gty Ala Gly Ile Lys Arg Asn Arg Leu Val Leu Asp Pro Gly Met
165 170 175
ggg ttt ttt ctg ggg gct gct ccc gaa acc tcg ctc tcg gtg ctg gcg 576
Gly Phe Phe Leu Gly Ala Ala Pro Glu Thr Ser Leu Ser Val Leu Ala
180 185 190
cgg ttc gat gaa ttg cgg ctg cgc ttc gat ttg ccg gtg ctt ctg tct 624
Arg Phe Asp Glu Leu Arg Leu Arg Phe Asp Leu Pro Val Leu Leu Ser
195 200 205
gtt tcg cgc aaa tcc ttt ctg cgc gcg ctc aca ggc cgt ggt ccg ggg 672
Val Ser Arg Lys Ser Phe Leu Arg Ala Leu Thr Gly Arg Gly Pro Gly
210 215 220
gtg tcg ggg ccg cga cac tcg ctg cag agc ttg ccg ccg ccg cag gtg 720
Val Ser Gly Pro Arg His Ser Leu Gln Ser Leu Pro Pro Pro Gln Val
225 230 235 240
gag ctg act tca tcc gca cac acg agc cgc gcc cct tgc gcg acg ggc 768
Glu Leu Thr Ser Ser Ala His Thr Ser Arg Ala Pro Cys Ala Thr Gly
245 250 255
tgg cgg tat tgg cgg cgc tga 789
Trp Arg Tyr Trp Arg Arg
260
<210>23
<211>262
<212>PRT
<213>大肠杆菌
<400>23
Met Asn Lys Ser Leu Ile Ile Phe Gly Ile Val Asn Ile Thr Ser Asp
1 5 10 15
Ser Phe Ser Asp Gly Gly Arg Tyr Leu Ala Pro Asp Ala Ala Ile Ala
20 25 30
Gln Ala Arg Lys Leu Met Ala Glu Gly Ala Asp Val Ile Asp Leu Val
35 40 45
Arg His Pro Ala Ile Pro Thr Pro Arg Leu Phe Arg Pro Thr Gln Lys
50 55 60
Ser Arg Val Cys Ala Gly Ala Gly Arg Ala Gln Ala Asp Gly Ile Pro
65 70 75 80
Val Ser Leu Asp Ser Tyr Gln Pro Ala Thr Gln Ala Tyr Ala Leu Ser
85 90 95
Arg Gly Val Ala Tyr Leu Asn Asp Ile Arg Gly Phe Pro Asp Ala Ala
100 105 110
Phe Tyr Pro Gln Leu Ala Lys Ser Ser Ala Lys Leu Val Val Met His
115 120 125
Ser Val Gln Asp Gly Gln Ala Asp Arg Arg Glu Ala Pro Ala Gly Asp
130 135 140
Ile Met Asp His lle Ala Ala Phe Phe Asp Ala Arg Ile Ala Ala Leu
145 150 155 160
Thr Gly Ala Gly Ile Lys Arg Asn Arg Leu Val Leu Asp Pro Gly Met
165 170 175
Gly Phe Phe Leu Gly Ala Ala Pro Glu Thr Ser Leu Ser Val Leu Ala
180 185 190
Arg Phe Asp Glu Leu Arg Leu Arg Phe Asp Leu Pro Val Leu Leu Ser
195 200 205
Val Ser Arg Lys Ser Phe Leu Arg Ala Leu Thr Gly Arg Gly Pro Gly
210 215 220
Val Ser Gly Pro Arg His Ser Leu Gln Ser Leu Pro Pro Pro Gln Val
225 230 235 240
Glu Leu Thr Ser Ser Ala His Thr Ser Arg Ala Pro Cys Ala Thr Gly
245 250 255
Trp Arg Tyr Trp Arg Arg
260
<210>24
<211>8335
<212>DNA
<213>大肠杆菌
<220>
<221>基因
<222>(63)..(866)
<223>strA
<220>
<221>基因
<222>(866)..(1702)
<223>strB
<220>
<221>启动子
<222>(2824)..(2912)
<223>Placuv5启动子
<220>
<221>基因
<222>(2957)..(4039)
<223>lacI
<220>
<221>基因
<222>(4059)..(5033)
<223>repB
<220>
<221>基因
<222>(5091)..(5303)
<223>orfE
<220>
<221>基因
<222>(5305)..(5511)
<223>orfF
<220>
<221>基因
<222>(5541)..(6380)
<223>repA
<220>
<221>基因
<222>(6367)..(7218)
<223>repC
<220>
<221>基因
<222>(7526)..(8314)
<223>sul
<400>24
aactgcacat tcgggatatt tctctatatt cgcgcttcat cagaaaactg aaggaacctc 60
cattgaatcg aactaatatt ttttttggtg aatcgcattc tgactggttg cctgtcagag 120
gcggagaatc tggtgatttt gtttttcgac gtggtgacgg gcatgccttc gcgaaaatcg 180
cacctgcttc ccgccgcggt gagctcgctg gagagcgtga ccgcctcatt tggctcaaag 240
gtcgaggtgt ggcttgcccc gaggtcatca actggcagga ggaacaggag ggtgcatgct 300
tggtgataac ggcaattccg ggagtaccgg cggctgatct gtctggagcg gatttgctca 360
aagcgtggcc gtcaatgggg cagcaacttg gcgctgttca cagcctatcg gttgatcaat 420
gtccgtttga gcgcaggctg tcgcgaatgt tcggacgcgc cgttgatgtg gtgtcccgca 480
atgccgtcaa tcccgacttc ttaccggacg aggacaagag tacgccgctg cacgatcttt 540
tggctcgtgt cgaacgagag ctaccggtgc ggctcgacca agagcgcacc gatatggttg 600
tttgccatgg tgatccctgc atgccgaact tcatggtgga ccctaaaact cttcaatgca 660
cgggtctgat cgaccttggg cggctcggaa cagcagatcg ctatgccgat ttggcactca 720
tgattgctaa cgccgaagag aactgggcag cgccagatga agcagagcgc gccttcgctg 780
tcctattcaa tgtattgggg atcgaagccc ccgaccgcga acgccttgcc ttctatctgc 840
gattggaccc tctgacttgg ggttgatgtt catgccgcct gtttttcctg ctcattggca 900
cgtttcgcaa cctgttctca ttgcggacac cttttccagc ctcgtttgga aagtttcatt 960
gccagacggg actcctgcaa tcgtcaaggg attgaaacct atagaagaca ttgctgatga 1020
actgcgcggg gccgactatc tggtatggcg caatgggagg ggagcagtcc ggttgctcgg 1080
tcgtgagaac aatctgatgt tgctcgaata tgccggggag cgaatgctct ctcacatcgt 1140
tgccgagcac ggcgactacc aggcgaccga aattgcagcg gaactaatgg cgaagctgta 1200
tgccgcatct gaggaacccc tgccttctgc ccttctcccg atccgggatc gctttgcagc 1260
tttgtttcag cgggcgcgcg atgatcaaaa cgcaggttgt caaactgact acgtccacgc 1320
ggcgattata gccgatcaaa tgatgagcaa tgcctcggaa ctgcgtgggc tacatggcga 1380
tctgcatcat gaaaacatca tgttctccag tcgcggctgg ctggtgatag atcccgtcgg 1440
tctggtcggt gaagtgggct ttggcgccgc caatatgttc tacgatccgg ctgacagaga 1500
cgacctttgt ctcgatccta gacgcattgc acagatggcg gacgcattct ctcgtgcgct 1560
ggacgtcgat ccgcgtcgcc tgctcgacca ggcgtacgct tatgggtgcc tttccgcagc 1620
ttggaacgcg gatggagaag aggagcaacg cgatctagct atcgcggccg cgatcaagca 1680
ggtgcgacag acgtcatact agatatcaag cgacttctcc tatcccctgg gaacacatca 1740
atctcaccgg agaatatcgc tggccaaagc cttagcgtag gattccgccc cttcccgcaa 1800
acgaccccaa acaggaaacg cagctgaaac gggaagctca acacccactg acgcatgggt 1860
tgttcaggca gtacttcatc aaccagcaag gcggcacttt cggccatccg ccgcgcccca 1920
cagctcgggc agaaaccgcg acgcttacag ctgaaagcga ccaggtgctc ggcgtggcaa 1980
gactcgcagc gaacccgtag aaagccatgc tccagccgcc cgcattggag aaattcttca 2040
aattcccgtt gcacatagcc cggcaattcc tttccctgct ctgccataag cgcagcgaat 2100
gccgggtaat actcgtcaac gatctgatag agaagggttt gctcgggtcg gtggctctgg 2160
taacgaccag tatcccgatc ccggctggcc gtcctggccg ccacatgagg catgttccgc 2220
gtccttgcaa tactgtgttt acatacagtc tatcgcttag cggaaagttc ttttaccctc 2280
agccgaaatg cctgccgttg ctagacattg ccagccagtg cccgtcactc ccgtactaac 2340
tgtcacgaac ccctgcaata actgtcacgc ccccctgcaa taactgtcac gaacccctgc 2400
aataactgtc acgcccccaa acctgcaaac ccagcagggg cgggggctgg cggggtgttg 2460
gaaaaatcca tccatgatta tctaagaata atccactagg cgcggttatc agcgcccttg 2520
tggggcgctg ctgcccttgc ccaatatgcc cggccagagg ccggatagct ggtctattcg 2580
ctgcgctagg ctacacaccg ccccaccgct gcgcggcagg gggaaaggcg ggcaaagccc 2640
gctaaacccc acaccaaacc ccgcagaaat acgctggagc gcttttagcc gctttagcgg 2700
cctttccccc tacccgaagg gtgggggcgc gtgtgcagcc ccgcagggcc tgtctcggtc 2760
gatcattcag cccggctcat agatctgcgg gcagtgagcg caacgcaatt aatgtgagtt 2820
agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt ataatgtgtg 2880
gaattgtgag cggataacaa tttcacacag gatctagaaa taattttgtt taactttaag 2940
aaggagatat acatatgtga aaccagtaac gttatacgat gtcgcagagt atgccggtgt 3000
ctcttatcag accgtttccc gcgtggtgaa ccaggccagc cacgtttctg cgaaaacgcg 3060
ggaaaaagtg gaagcggcga tggcggagct gaattacatt cccaaccgcg tggcacaaca 3120
actggcgggc aaaccgtcga agcctgtaaa gcggcggtgc acaatcttct cgcgcaacgc 3180
gtcagtgggc tgatagtcgt tgctgattgg cgttgccacc tccagtctgg ccctgcacgc 3240
gccgtcgcaa attgtcgcgg cgattaaatc tcgcgccgat caactgggtg ccagcgtggt 3300
ggtgtcgatg gtagaacgaa gcggcattaa ctatccgctg gatgaccagg atgccattgc 3360
tgtggaagct gcctgcacta atgttccggc gttatttctt gatgtctctg accagacacc 3420
catcaacagt attattttct cccatgaaga cggtacgcga ctgggcgtgg agcatctggt 3480
cgcattgggt caccagcaaa tcgcgctgtt agcgggccca ttaagttctg tctcggcgcg 3540
tctgcgtctg gctggctggc ataaatatct cactcgcaat caaattcagc cgatagcgga 3600
acgggaaggc gactggagtg ccatgtccgg ttttcaacaa accatgcaaa tgctgaatga 3660
gggcatcgtt cccactgcga tgctggttgc caacgatcag atggcgctgg gcgcaatgcg 3720
cgccattacc gagtccgggc tgcgcgttgg tgcggatatc tcggtagtgg gatacgacga 3780
taccgaagac agctcatgtt atatcccgcc gttaaccacc atcaaacagg attttcgcct 3840
gctggggcaa accagcgtgg accgcttgct gcaactctct cagggccagg cggtgaaggg 3900
caatcagctg ttgcccgtct cactggtgaa aagaaaaacc accctggcgc ccaatacgca 3960
aaccgcctct ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg 4020
actggaaagc gggcagtgag gatccggggg gtggcccgat gaagaacgac aggactttgc 4080
aggccatagg ccgacagctc aaggccatgg gctgtgagcg cttcgatatc ggcgtcaggg 4140
acgccaccac cggccagatg atgaaccggg aatggtcagc cgccgaagtg ctccagaaca 4200
cgccatggct caagcggatg aatgcccagg gcaatgacgt gtatatcagg cccgccgagc 4260
aggagcggca tggtctggtg ctggtggacg acctcagcga gtttgacctg gatgacatga 4320
aagccgaggg ccgggagcct gccctggtag tggaaaccag cccgaagaac tatcaggcat 4380
gggtcaaggt ggccgacgcc gcaggcggtg aacttcgggg gcagattgcc cggacgctgg 4440
ccagcgagta cgacgccgac ccggccagcg ccgacagccg ccactatggc cgcttggcgg 4500
gcttcaccaa ccgcaaggac aagcacacca cccgcgccgg ttatcagccg tgggtgctgc 4560
tgcgtgaatc caagggcaag accgccaccg ctggcccggc gctggtgcag caggctggcc 4620
agcagatcga gcaggcccag cggcagcagg agaaggcccg caggctggcc agcctcgaac 4680
tgcccgagcg gcagcttagc cgccaccggc gcacggcgct ggacgagtac cgcagcgaga 4740
tggccgggct ggtcaagcgc ttcggtgatg acctcagcaa gtgcgacttt atcgccgcgc 4800
agaagctggc cagccggggc cgcagtgccg aggaaatcgg caaggccatg gccgaggcca 4860
gcccagcgct ggcagagcgc aagcccggcc acgaagcgga ttacatcgag cgcaccgtca 4920
gcaaggtcat gggtctgccc agcgtccagc ttgcgcgggc cgagctggca cgggcaccgg 4980
caccccgcca gcgaggcatg gacaggggcg ggccagattt cagcatgtag tgcttgcgtt 5040
ggtactcacg cctgttatac tatgagtact cacgcacaga agggggtttt atggaatacg 5100
aaaaaagcgc ttcagggtcg gtctacctga tcaaaagtga caagggctat tggttgcccg 5160
gtggctttgg ttatacgtca aacaaggccg aggctggccg cttttcagtc gctgatatgg 5220
ccagccttaa ccttgacggc tgcaccttgt ccttgttccg cgaagacaag cctttcggcc 5280
ccggcaagtt tctcggtgac tgatatgaaa gaccaaaagg acaagcagac cggcgacctg 5340
ctggccagcc ctgacgctgt acgccaagcg cgatatgccg agcgcatgaa ggccaaaggg 5400
atgcgtcagc gcaagttctg gctgaccgac gacgaatacg aggcgctgcg cgagtgcctg 5460
gaagaactca gagcggcgca gggcgggggt agtgaccccg ccagcgccta accaccaact 5520
gcctgcaaag gaggcaatca atggctaccc ataagcctat caatattctg gaggcgttcg 5580
cagcagcgcc gccaccgctg gactacgttt tgcccaacat ggtggccggt acggtcgggg 5640
cgctggtgtc gcccggtggt gccggtaaat ccatgctggc cctgcaactg gccgcacaga 5700
ttgcaggcgg gccggatctg ctggaggtgg gcgaactgcc caccggcccg gtgatctacc 5760
tgcccgccga agacccgccc accgccattc atcaccgcct gcacgccctt ggggcgcacc 5820
tcagcgccga ggaacggcaa gccgtggctg acggcctgct gatccagccg ctgatcggca 5880
gcctgcccaa catcatggcc ccggagtggt tcgacggcct caagcgcgcc gccgagggcc 5940
gccgcctgat ggtgctggac acgctgcgcc ggttccacat cgaggaagaa aacgccagcg 6000
gccccatggc ccaggtcatc ggtcgcatgg aggccatcgc cgccgatacc gggtgctcta 6060
tcgtgttcct gcaccatgcc agcaagggcg cggccatgat gggcgcaggc gaccagcagc 6120
aggccagccg gggcagctcg gtactggtcg ataacatccg ctggcagtcc tacctgtcga 6180
gcatgaccag cgccgaggcc gaggaatggg gtgtggacga cgaccagcgc cggttcttcg 6240
tccgcttcgg tgtgagcaag gccaactatg gcgcaccgtt cgctgatcgg tggttcaggc 6300
ggcatgacgg cggggtgctc aagcccgccg tgctggagag gcagcgcaag agcaaggggg 6360
tgccccgtgg tgaagcctaa gaacaagcac agcctcagcc acgtccggca cgacccggcg 6420
cactgtctgg cccccggcct gttccgtgcc ctcaagcggg gcgagcgcaa gcgcagcaag 6480
ctggacgtga cgtatgacta cggcgacggc aagcggatcg agttcagcgg cccggagccg 6540
ctgggcgctg atgatctgcg catcctgcaa gggctggtgg ccatggctgg gcctaatggc 6600
ctagtgcttg gcccggaacc caagaccgaa ggcggacggc agctccggct gttcctggaa 6660
cccaagtggg aggccgtcac cgctgaatgc catgtggtca aaggtagcta tcgggcgctg 6720
gcaaaggaaa tcggggcaga ggtcgatagt ggtggggcgc tcaagcacat acaggactgc 6780
atcgagcgcc tttggaaggt atccatcatc gcccagaatg gccgcaagcg gcaggggttt 6840
cggctgctgt cggagtacgc cagcgacgag gcggacgggc gcctgtacgt ggccctgaac 6900
cccttgatcg cgcaggccgt catgggtggc ggccagcatg tgcgcatcag catggacgag 6960
gtgcgggcgc tggacagcga aaccgcccgc ctgctgcacc agcggctgtg tggctggatc 7020
gaccccggca aaaccggcaa ggcttccata gataccttgt gcggctatgt ctggccgtca 7080
gaggccagtg gttcgaccat gcgcaagcgc cgccagcggg tgcgcgaggc gttgccggag 7140
ctggtcgcgc tgggctggac ggtaaccgag ttcgcggcgg gcaagtacga catcacccgg 7200
cccaaggcgg caggctgacc ccccccactc tattgtaaac aagacatttt tatcttttat 7260
attcaatggc ttattttcct gctaattggt aataccatga aaaataccat gctcagaaaa 7320
ggcttaacaa tattttgaaa aattgcctac tgagcgctgc cgcacagctc cataggccgc 7380
tttcctggct ttgcttccag atgtatgctc ttctgctcct gcagctaatg gatcaccgca 7440
aacaggttac tcgcctgggg attccctttc gacccgagca tccgtatgat actcatgctc 7500
gattattatt attatagaag cccccatgaa taaatcgctc atcattttcg gcatcgtcaa 7560
cataacctcg gacagtttct ccgatggagg ccggtatctg gcgccagacg cagccattgc 7620
gcaggcgcgt aagctgatgg ccgagggggc agatgtgatc gacctggtcc ggcatccagc 7680
aatcccgacg ccgcgcctgt ttcgtccgac acagaaatcg cgcgtatgcg ccggtgctgg 7740
acgcgctcag gcagatggca ttcccgtctc gctcgacagt tatcaacccg cgacgcaagc 7800
ctatgccttg tcgcgtggtg tggcctatct caatgatatt cgcggttttc cagacgctgc 7860
gttctatccg caattggcga aatcatctgc caaactcgtc gttatgcatt cggtgcaaga 7920
cgggcaggca gatcggcgcg aggcacccgc tggcgacatc atggatcaca ttgcggcgtt 7980
ctttgacgcg cgcatcgcgg cgctgacggg tgccggtatc aaacgcaacc gccttgtcct 8040
tgatcccggc atggggtttt ttctgggggc tgctcccgaa acctcgctct cggtgctggc 8100
gcggttcgat gaattgcggc tgcgcttcga tttgccggtg cttctgtctg tttcgcgcaa 8160
atcctttctg cgcgcgctca caggccgtgg tccgggggtg tcggggccgc gacactcgct 8220
gcagagcttg ccgccgccgc aggtggagct gacttcatcc gcacacacga gccgcgcccc 8280
ttgcgcgacg ggctggcggt attggcggcg ctgaaagaaa ccgcaagaat tcgtt 8335
<210>25
<211>1083
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(1083)
<223>lacI
<400>25
gtg aaa cca gta acg tta tac gat gtc gca gag tat gcc ggt gtc tct 48
Val Lys Pro Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala Gly Val Ser
1 5 10 15
tat cag acc gtt tcc cgc gtg gtg aac cag gcc agc cac gtt tct gcg 96
Tyr Gln Thr Val Ser Arg Val Val Asn Gln Ala Ser His Val Ser Ala
20 25 30
aaa acg cgg gaa aaa gtg gaa gcg gcg atg gcg gag ctg aat tac att 144
Lys Thr Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu Asn Tyr Ile
35 40 45
ccc aac cgc gtg gca caa caa ctg gcg ggc aaa cag tcg ttg ctg att 192
Pro Asn Arg Val Ala Gln Gln Leu Ala Gly Lys Gln Ser Leu Leu Ile
50 55 60
ggc gtt gcc acc tcc agt ctg gcc ctg cac gcg ccg tcg caa att gtc 240
Gly Val Ala Thr Ser Ser Leu Ala Leu His Ala Pro Ser Gln Ile Val
65 70 75 80
gcg gcg att aaa tct cgc gcc gat caa ctg ggt gcc agc gtg gtg gtg 288
Ala Ala Ile Lys Ser Arg Ala Asp Gln Leu Gly Ala Ser Val Val Val
85 90 95
tcg atg gta gaa cga agc ggc gtc gaa gcc tgt aaa gcg gcg gtg cac 336
Ser Met Val Glu Arg Ser Gly Val Glu Ala Cys Lys Ala Ala Val His
100 105 110
aat ctt ctc gcg caa cgc gtc agt ggg ctg atc att aac tat ccg ctg 384
Asn Leu Leu Ala Gln Arg Val Ser Gly Leu Ile Ile Asn Tyr Pro Leu
115 120 125
gat gac cag gat gcc att gct gtg gaa gct gcc tgc act aat gtt ccg 432
Asp Asp Gln Asp Ala Ile Ala Val Glu Ala Ala Cys Thr Ash Val Pro
130 135 140
gcg tta ttt ctt gat gtc tct gac cag aca ccc atc aac agt att att 480
Ala Leu Phe Leu Asp Val Ser Asp Gln Thr Pro Ile Asn Ser Ile Ile
145 150 155 160
ttc tcc cat gaa gac ggt acg cga ctg ggc gtg gag cat ctg gtc gca 528
Phe Ser His Glu Asp Gly Thr Arg Leu Gly Val Glu His Leu Val Ala
165 170 175
ttg ggt cac cag caa atc gcg ctg tta gcg ggc cca tta agt tct gtc 576
Leu Gly His Gln Gln Ile Ala Leu Leu Ala Gly Pro Leu Ser Ser Val
180 185 190
tcg gcg cgt ctg cgt ctg gct ggc tgg cat aaa tat ctc act cgc aat 624
Ser Ala Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu Thr Arg Asn
195 200 205
caa att cag ccg ata gcg gaa cgg gaa ggc gac tgg agt gcc atg tcc 672
Gln Ile Gln Pro Ile Ala Glu Arg Glu Gly Asp Trp Ser Ala Met Ser
210 215 220
ggt ttt caa caa acc atg caa atg ctg aat gag ggc atc gtt ccc act 720
Gly Phe Gln Gln Thr Met Gln Met Leu Asn Glu Gly Ile Val Pro Thr
225 230 235 240
gcg atg ctg gtt gcc aac gat cag atg gcg ctg ggc gca atg cgc gcc 768
Ala Met Leu Val Ala Asn Asp Gln Met Ala Leu Gly Ala Met Arg Ala
245 250 255
att acc gag tcc ggg ctg cgc gtt ggt gcg gat atc tcg gta gtg gga 816
Ile Thr Glu Ser Gly Leu Arg Val Gly Ala Asp Ile Ser Val Val Gly
260 265 270
tac gac gat acc gaa gac agc tca tgt tat atc ccg ccg tta acc acc 864
Tyr Asp Asp Thr Glu Asp Ser Ser Cys Tyr Ile Pro Pro Leu Thr Thr
275 280 285
atc aaa cag gat ttt cgc ctg ctg ggg caa acc agc gtg gac cgc ttg 912
Ile Lys Gln Asp Phe Arg Leu Leu Gly Gln Thr Ser Val Asp Arg Leu
290 295 300
ctg caa ctc tct cag ggc cag gcg gtg aag ggc aat cag ctg ttg ccc 960
Leu Gln Leu Ser Gln Gly Gln Ala Val Lys Gly Asn Gln Leu Leu Pro
305 310 315 320
gtc tca ctg gtg aaa aga aaa acc acc ctg gcg ccc aat acg caa acc 1008
Val Ser Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Asn Thr Gln Thr
325 330 335
gcc tct ccc cgc gcg ttg gcc gat tca tta atg cag ctg gca cga cag 1056
Ala Ser Pro Arg Ala Leu Ala Asp Ser Leu Met Gln Leu Ala Arg Gln
340 345 350
gtt tcc cga ctg gaa agc ggg cag tga 1083
Val Ser Arg Leu Glu Ser Gly Gln
355 360
<210>26
<211>360
<212>PRT
<213>大肠杆菌
<400>26
Val Lys Pro Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala Gly Val Ser
1 5 10 15
Tyr Gln Thr Val Ser Arg Val Val Asn Gln Ala Ser His Val Ser Ala
20 25 30
Lys Thr Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu Asn Tyr Ile
35 40 45
Pro Asn Arg Val Ala Gln Gln Leu Ala Gly Lys Gln Ser Leu Leu lle
50 55 60
Gly Val Ala Thr Ser Ser Leu Ala Leu His Ala Pro Ser Gln Ile Val
65 70 75 80
Ala Ala Ile Lys Ser Arg Ala Asp Gln Leu Gly Ala Ser Val Val Val
85 90 95
Ser Met Val Glu Arg Ser Gly Val Glu Ala Cys Lys Ala Ala Val His
100 105 110
Asn Leu Leu Ala Gln Arg Val Ser Gly Leu Ile Ile Asn Tyr Pro Leu
115 120 125
Asp Asp Gln Asp Ala Ile Ala Val Glu Ala Ala Cys Thr Asn Val Pro
130 135 140
Ala Leu Phe Leu Asp Val Ser Asp Gln Thr Pro Ile Asn Ser Ile Ile
145 150 155 160
Phe Ser His Glu Asp Gly Thr Arg Leu Gly Val Glu His Leu Val Ala
165 170 175
Leu Gly His Gln Gln Ile Ala Leu Leu Ala Gly Pro Leu Ser Ser Val
180 185 190
Ser Ala Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu Thr Arg Asn
195 200 205
Gln Ile Gln Pro Ile Ala Glu Arg Glu Gly Asp Trp Ser Ala Met Ser
210 215 220
Gly Phe Gln Gln Thr Met Gln Met Leu Asn Glu Gly Ile Val Pro Thr
225 230 235 240
Ala Met Leu Val Ala Asn Asp Gln Met Ala Leu Gly Ala Met Arg Ala
245 250 255
Ile Thr Glu Ser Gly Leu Arg Val Gly Ala Asp Ile Ser Val Val Gly
260 265 270
Tyr Asp Asp Thr Glu Asp Ser Ser Cys Tyr Ile Pro Pro Leu Thr Thr
275 280 285
Ile Lys Gln Asp Phe Arg Leu Leu Gly Gln Thr Ser Val Asp Arg Leu
290 295 300
Leu Gln Leu Ser Gln Gly Gln Ala Val Lys Gly Asn Gln Leu Leu Pro
305 310 315 320
Val Ser Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Asn Thr Gln Thr
325 330 335
Ala Ser Pro Arg Ala Leu Ala Asp Ser Leu Met Gln Leu Ala Arg Gln
340 345 350
Val Ser Arg Leu Glu Ser Gly Gln
355 360
<210>27
<211>6864
<212>DNA
<213>大肠杆菌
<220>
<221>基因
<222>(196)..(990)
<223>thyA
<220>
<221>CDS
<222>(196)..(990)
<223>thyA
<220>
<221>启动子
<222>(2162)..(2250)
<223>PlacUV5
<220>
<221>基因
<222>(2295)..(3377)
<223>lacI
<220>
<221>基因
<222>(3397)..(4368)
<223>repB
<220>
<221>基因
<222>(4429)..(4651)
<223>orfE
<220>
<221>基因
<222>(4643)..(4849)
<223>orfF
<220>
<221>基因
<222>(4879)..(5718)
<223>repA
<220>
<221>基因
<222>(5705)..(6556)
<223>repC
<400>27
gaattcctga ttggttacgg cgcgtttcgc atcattgttg agtttttccg ccagcccgac 60
gcgcagttta ccggtgcctg ggtgcatgat ataatcatgg ggcaaattct ttccatcccg 120
atgattgtcg cgggtgtgat catgatggtc tgggcatatc gtcgcagccc acagcaacac 180
gtttcctgag gaacc atg aaa cag tat tta gaa ctg atg caa aaa gtg ctc 231
Met Lys Gln Tyr Leu Glu Leu Met Gln Lys Val Leu
1 5 10
gac gaa ggc aca cag aaa aac gac cgt acc gga acc gga acg ctt tcc 279
Asp Glu Gly Thr Gln Lys Asn Asp Arg Thr Gly Thr Gly Thr Leu Ser
15 20 25
att ttt ggt cat cag atg cgt ttt aac ctg caa gat gga ttc ccg ctg 327
Ile Phe Gly His Gln Met Arg Phe Asn Leu Gln Asp Gly Phe Pro Leu
30 35 40
gtg aca act aaa cgt tgc cac ctg cgt tcc atc atc cat gaa ctg ctg 375
Val Thr Thr Lys Arg Cys His Leu Arg Ser Ile Ile His Glu Leu Leu
45 50 55 60
tgg ttt ctc cag ggc gac act aac att gct tat cta cac gaa aac aat 423
Trp Phe Leu Gln Gly Asp Thr Asn Ile Ala Tyr Leu His Glu Asn Asn
65 70 75
gtc acc atc tgg gac gaa tgg gcc gat gaa aac ggc gac ctc ggg cca 471
Val Thr Ile Trp Asp Glu Trp Ala Asp Glu Asn Gly Asp Leu Gly Pro
80 85 90
gtg tat ggt aaa cag tgg cgc gcc tgg cca acg cea gat ggt cgt cat 519
Val Tyr Gly Lys Gln Trp Arg Ala Trp Pro Thr Pro Asp Gly Arg His
95 100 105
att gac cag atc act acg gta ctg aac cag ctg aaa aac gac ccg gat 567
Ile Asp Gln Ile Thr Thr Val Leu Asn Gln Leu Lys Asn Asp Pro Asp
110 115 120
tcg cgc cgc att att gtt tca gcg tgg aac gta ggc gaa ctg gat aaa 615
Ser Arg Arg Ile Ile Val Ser Ala Trp Asn Val Gly Glu Leu Asp Lys
125 130 135 140
atg gcg ctg gca ccg tgc cat gca ttc ttc cag ttc tat gtg gca gac 663
Met Ala Leu Ala Pro Cys His Ala Phe Phe Gln Phe Tyr Val Ala Asp
145 150 155
ggc aaa ctc tct tgc cag ctt tat cag cgc tcc tgt gac gtc ttc ctc 711
Gly Lys Leu Ser Cys Gln Leu Tyr Gln Arg Ser Cys Asp Val Phe Leu
160 165 170
ggc ctg ccg ttc aac att gcc agc tac gcg tta ttg gtg cat atg atg 759
Gly Leu Pro Phe Asn Ile Ala Ser Tyr Ala Leu Leu Val His Met Met
175 180 185
gcg cag cag tgc gat ctg gaa gtg ggt gat ttt gtc tgg acc ggt ggc 807
Ala Gln Gln Cys Asp Leu Glu Val Gly Asp Phe Val Trp Thr Gly Gly
190 195 200
gac acg cat ctg tac age aac cat atg gat caa act cat ctg caa tta 855
Asp Thr His Leu Tyr Ser Asn His Met Asp G1n Thr His Leu Gln Leu
205 210 215 220
agc egc gaa ccg cgt ccg ctg ccg aag ttg att atc aaa cgt aaa ccc 903
Ser Arg Glu Pro Arg Pro Leu Pro Lys Leu Ile Ile Lys Arg Lys Pro
225 230 235
gaa tcc atc ttc gac tac cgt ttc gaa gac ttt gag att gaa ggc tac 951
Glu Ser Ile Phe Asp Tyr Arg Phe Glu Asp Phe Glu Ile Glu Gly Tyr
240 245 250
gat ccg cat ccg ggc att aaa gcg ccg gtg gct atc taa ttacgaagct 1000
Asp Pro His Pro Gly Ile Lys Ala Pro Val Ala Ile
255 260
tgcggccgcg atcaagcagg tgcgacagac gtcatactag atatcaagcg acttctccta 1060
tcccctggga acacatcaat ctcaccggag aatatcgctg gccaaagcct tagcgtagga 1120
ttccgcccct tcccgcaaac gaccccaaac aggaaacgca gctgaaacgg gaagctcaac 1180
acccactgac gcatgggttg ttcaggcagt acttcatcaa ccagcaaggc ggcactttcg 1240
gccatccgcc gcgccccaca gctcgggcag aaaccgcgac gcttacagct gaaagcgacc 1300
aggtgctcgg cgtggcaaga ctcgcagcga acccgtagaa agccatgctc cagccgcccg 1360
cattggagaa attcttcaaa ttcccgttgc acatagcccg gcaattcctt tccctgctct 1420
gccataagcg cagcgaatgc cgggtaatac tcgtcaacga tctgatagag aagggtttgc 1480
tcgggtcggt ggctctggta acgaccagta tcccgatccc ggctggccgt cctggccgcc 1540
acatgaggca tgttccgcgt ccttgcaata ctgtgtttac atacagtcta tcgcttagcg 1600
gaaagttctt ttaccctcag ccgaaatgcc tgccgttgct agacattgcc agccagtgcc 1660
cgtcactccc gtactaactg tcacgaaccc ctgcaataac tgtcacgccc ccctgcaata 1720
actgtcacga acccctgcaa taactgtcac gcccccaaac ctgcaaaccc agcaggggcg 1780
ggggctggcg gggtgttgga aaaatccatc catgattatc taagaataat ccactaggcg 1840
cggttatcag cgcccttgtg gggcgctgct gcccttgccc aatatgcccg gccagaggcc 1900
ggatagctgg tctattcgct gcgctaggct acacaccgcc ccaccgctgc gcggcagggg 1960
gaaaggcggg caaagcccgc taaaccccac accaaacccc gcagaaatac gctggagcgc 2020
ttttagccgc tttagcggcc tttcccccta cccgaagggt gggggcgcgt gtgcagcccc 2080
gcagggcctg tctcggtcga tcattcagcc cggctcatag atctgcgggc agtgagcgca 2140
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2200
cggctcgtat aatgtgtgga attgtgagcg gataacaatt tcacacagga tctagaaata 2260
attttgttta actttaagaa ggagatatac atatgtgaaa ccagtaacgt tatacgatgt 2320
cgcagagtat gccggtgtct cttatcagac cgtttcccgc gtggtgaacc aggccagcca 2380
cgtttctgcg aaaacgcggg aaaaagtgga agcggcgatg gcggagctga attacattcc 2440
caaccgcgtg gcacaacaac tggcgggcaa accgtcgaag cctgtaaagc ggcggtgcac 2500
aatcttctcg cgcaacgcgt cagtgggctg atagtcgttg ctgattggcg ttgccacctc 2560
cagtctggcc ctgcacgcgc cgtcgcaaat tgtcgcggcg attaaatctc gcgccgatca 2620
actgggtgcc agcgtggtgg tgtcgatggt agaacgaagc ggcattaact atccgctgga 2680
tgaccaggat gccattgctg tggaagctgc ctgcactaat gttccggcgt tatttcttga 2740
tgtctctgac cagacaccca tcaacagtat tattttctcc catgaagacg gtacgcgact 2800
gggcgtggag catctggtcg cattgggtca ccagcaaatc gcgctgttag cgggcccatt 2860
aagttctgtc tcggcgcgtc tgcgtctggc tggctggcat aaatatctca ctcgcaatca 2920
aattcagccg atagcggaac gggaaggcga ctggagtgcc atgtccggtt ttcaacaaac 2980
catgcaaatg ctgaatgagg gcatcgttcc cactgcgatg ctggttgcca acgatcagat 3040
ggcgctgggc gcaatgcgcg ccattaccga gtccgggctg cgcgttggtg cggatatctc 3100
ggtagtggga tacgacgata ccgaagacag ctcatgttat atcccgccgt taaccaccat 3160
caaacaggat tttcgcctgc tggggcaaac cagcgtggac cgcttgctgc aactctctca 3220
gggccaggcg gtgaagggca atcagctgtt gcccgtctca ctggtgaaaa gaaaaaccac 3280
cctggcgccc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct 3340
ggcacgacag gtttcccgac tggaaagcgg gcagtgagga tccggggggt ggcccgatga 3400
agaacgacag gactttgcag gccataggcc gacagctcaa ggccatgggc tgtgagcgct 3460
tcgatatcgg cgtcagggac gccaccaccg gccagatgat gaaccgggaa tggtcagccg 3520
ccgaagtgct ccagaacacg ccatggctca agcggatgaa tgcccagggc aatgacgtgt 3580
atatcaggcc cgccgagcag gagcggcatg gtctggtgct ggtggacgac ctcagcgagt 3640
ttgacctgga tgacatgaaa gccgagggcc gggagcctgc cctggtagtg gaaaccagcc 3700
cgaagaacta tcaggcatgg gtcaaggtgg ccgacgccgc aggcggtgaa cttcgggggc 3760
agattgcccg gacgctggcc agcgagtacg acgccgaccc ggccagcgcc gacagccgcc 3820
actatggccg cttggcgggc ttcaccaacc gcaaggacaa gcacaccacc cgcgccggtt 3880
atcagccgtg ggtgctgctg cgtgaatcca agggcaagac cgccaccgct ggcccggcgc 3940
tggtgcagca ggctggccag cagatcgagc aggcccagcg gcagcaggag aaggcccgca 4000
ggctggccag cctcgaactg cccgagcggc agcttagccg ccaccggcgc acggcgctgg 4060
acgagtaccg cagcgagatg gccgggctgg tcaagcgctt cggtgatgac ctcagcaagt 4120
gcgactttat cgccgcgcag aagctggcca gccggggccg cagtgccgag gaaatcggca 4180
aggccatggc cgaggccagc ccagcgctgg cagagcgcaa gcccggccac gaagcggatt 4240
acatcgagcg caccgtcagc aaggtcatgg gtctgcccag cgtccagctt gcgcgggccg 4300
agctggcacg ggcaccggca ccccgccagc gaggcatgga caggggcggg ccagatttca 4360
gcatgtagtg cttgcgttgg tactcacgcc tgttatacta tgagtactca cgcacagaag 4420
ggggttttat ggaatacgaa aaaagcgctt cagggtcggt ctacctgatc aaaagtgaca 4480
agggctattg gttgcccggt ggctttggtt atacgtcaaa caaggccgag gctggccgct 4540
tttcagtcgc tgatatggcc agccttaacc ttgacggctg caccttgtcc ttgttccgcg 4600
aagacaagcc tttcggcccc ggcaagtttc tcggtgactg atatgaaaga ccaaaaggac 4660
aagcagaccg gcgacctgct ggccagccct gacgctgtac gccaagcgcg atatgccgag 4720
cgcatgaagg ccaaagggat gcgtcagcgc aagttctggc tgaccgacga cgaatacgag 4780
gcgctgcgcg agtgcctgga agaactcaga gcggcgcagg gcgggggtag tgaccccgcc 4840
agcgcctaac caccaactgc ctgcaaagga ggcaatcaat ggctacccat aagcctatca 4900
atattctgga ggcgttcgca gcagcgccgc caccgctgga ctacgttttg cccaacatgg 4960
tggccggtac ggtcggggcg ctggtgtcgc ccggtggtgc cggtaaatcc atgctggccc 5020
tgcaactggc cgcacagatt gcaggcgggc cggatctgct ggaggtgggc gaactgccca 5080
ccggcccggt gatctacctg cccgccgaag acccgcccac cgccattcat caccgcctgc 5140
acgcccttgg ggcgcacctc agcgccgagg aacggcaagc cgtggctgac ggcctgctga 5200
tccagccgct gatcggcagc ctgcccaaca tcatggcccc ggagtggttc gacggcctca 5260
agcgcgccgc cgagggccgc cgcctgatgg tgctggacac gctgcgccgg ttccacatcg 5320
aggaagaaaa cgccagcggc cccatggccc aggtcatcgg tcgcatggag gccatcgccg 5380
ccgataccgg gtgctctatc gtgttcctgc accatgccag caagggcgcg gccatgatgg 5440
gcgcaggcga ccagcagcag gccagccggg gcagctcggt actggtcgat aacatccgct 5500
ggcagtccta cctgtcgagc atgaccagcg ccgaggccga ggaatggggt gtggacgacg 5560
accagcgccg gttcttcgtc cgcttcggtg tgagcaaggc caactatggc gcaccgttcg 5620
ctgatcggtg gttcaggcgg catgacggcg gggtgctcaa gcccgccgtg ctggagaggc 5680
agcgcaagag caagggggtg ccccgtggtg aagcctaaga acaagcacag cctcagccac 5740
gtccggcacg acccggcgca ctgtctggcc cccggcctgt tccgtgccct caagcggggc 5800
gagcgcaagc gcagcaagct ggacgtgacg tatgactacg gcgacggcaa gcggatcgag 5860
ttcagcggcc cggagccgct gggcgctgat gatctgcgca tcctgcaagg gctggtggcc 5920
atggctgggc ctaatggcct agtgcttggc ccggaaccca agaccgaagg cggacggcag 5980
ctccggctgt tcctggaacc caagtgggag gccgtcaccg ctgaatgcca tgtggtcaaa 6040
ggtagctatc gggcgctggc aaaggaaatc ggggcagagg tcgatagtgg tggggcgctc 6100
aagcacatac aggactgcat cgagcgcctt tggaaggtat ccatcatcgc ccagaatggc 6160
cgcaagcggc aggggtttcg gctgctgtcg gagtacgcca gcgacgaggc ggacgggcgc 6220
ctgtacgtgg ccctgaaccc cttgatcgcg caggccgtca tgggtggcgg ccagcatgtg 6280
cgcatcagca tggacgaggt gcgggcgctg gacagcgaaa ccgcccgcct gctgcaccag 6340
cggctgtgtg gctggatcga ccccggcaaa accggcaagg cttccataga taccttgtgc 6400
ggctatgtct ggccgtcaga ggccagtggt tcgaccatgc gcaagcgccg ccagcgggtg 6460
cgcgaggcgt tgccggagct ggtcgcgctg ggctggacgg taaccgagtt cgcggcgggc 6520
aagtacgaca tcacccggcc caaggcggca ggctgacccc ccccactcta ttgtaaacaa 6580
gacattttta tcttttatat tcaatggctt attttcctgc taattggtaa taccatgaaa 6640
aataccatgc tcagaaaagg cttaacaata ttttgaaaaa ttgcctactg agcgctgccg 6700
cacagctcca taggccgctt tcctggcttt gcttccagat gtatgctctt ctgctcctgc 6760
agagcttgcc gccgccgcag gtggagctga cttcatccgc acacacgagc cgcgcccctt 6820
gcgcgacggg ctggcggtat tggcggcgct gaaagaaacc gcaa 6864
<210>28
<211>264
<212>PRT
<213>大肠杆菌
<400>28
Met Lys Gln Tyr Leu Glu Leu Met Gln Lys Val Leu Asp Glu Gly Thr
1 5 10 15
Gln Lys Asn Asp Arg Thr Gly Thr Gly Thr Leu Ser Ile Phe Gly His
20 25 30
Gln Met Arg Phe Asn Leu Gln Asp Gly Phe Pro Leu Val Thr Thr Lys
35 40 45
Arg Cys His Leu Arg Ser Ile Ile His Glu Leu Leu Trp Phe Leu Gln
50 55 60
Gly Asp Thr Asn Ile Ala Tyr Leu His Glu Asn Asn Val Thr Ile Trp
65 70 75 80
Asp Glu Trp Ala Asp Glu Asn Gly Asp Leu Gly Pro Val Tyr Gly Lys
85 90 95
Gln Trp Arg Ala Trp Pro Thr Pro Asp Gly Arg His Ile Asp Gln Ile
100 105 110
Thr Thr Val Leu Asn Gln Leu Lys Asn Asp Pro Asp Ser Arg Arg Ile
115 120 125
Ile Val Ser Ala Trp Asn Val Gly Glu Leu Asp Lys Met Ala Leu Ala
130 135 140
Pro Cys His Ala Phe Phe Gln Phe Tyr Val Ala Asp Gly Lys Leu Ser
145 150 155 160
Cys Gln Leu Tyr Gln Arg Ser Cys Asp Val Phe Leu Gly Leu Pro Phe
165 170 175
Asn Ile Ala Ser Tyr Ala Leu Leu Val His Met Met Ala Gln Gln Cys
180 185 190
Asp Leu Glu Val Gly Asp Phe Val Trp Thr Gly Gly Asp Thr His Leu
195 200 205
Tyr Ser Asn His Met Asp Gln Thr His Leu Gln Leu Ser Arg Glu Pro
210 215 220
Arg Pro Leu Pro Lys Leu Ile Ile Lys Arg Lys Pro Glu Ser Ile Phe
225 230 235 240
Asp Tyr Arg Phe Glu Asp Phe Glu Ile Glu Gly Tyr Asp Pro His Pro
245 250 255
Gly Ile Lys Ala Pro Val Ala Ile
260
<210>29
<211>37
<212>DNA
<213>人工的
<220>
<223>P1
<400>29
cctttggtac cagatctgcg ggcagtgagc gcaacgc 37
<210>30
<211>36
<212>DNA
<213>人工的
<220>
<223>P2
<400>30
aattgggatc cgctcactgc ccgctttcca gtcggg 36
<210>31
<211>38
<212>DNA
<213>人工的
<220>
<223>P3
<400>31
cgcttggatc cggggggtgg cccgatgaag aacgacag 38
<210>32
<211>33
<212>DNA
<213>人工的
<220>
<223>P4
<400>32
ctcttggtac cgcctgatat acacgtcatt gcc 33
<210>33
<211>33
<212>DNA
<213>人工的
<220>
<223>P5
<400>33
tagcgagatc tctgatgtcc ggcggtgctt ttg 33
<210>34
<211>32
<212>DNA
<213>人工的
<220>
<223>P6
<400>34
aaaaagagct cttacgcccc gccctgccac tc 32
<210>35
<211>31
<212>DNA
<213>人工的
<220>
<223>P7
<400>35
cctttgagct cgcgggcagt gagcgcaacg c 31
<210>36
<211>34
<212>DNA
<213>人工的
<220>
<223>P8
<400>36
ctgtttctag atcctgtgtg aaattgttat ccgc 34
<210>37
<211>61
<212>DNA
<213>人工的
<220>
<223>P9
<400>37
gcagggcctg tctcggtcga tcattcagcc cggctcatag atctctgatg tccggcggtg 60
c 61
<210>38
<211>26
<212>DNA
<213>人工的
<220>
<223>ThyA1
<400>38
cggaattcct gattggttac ggcgcg 26
<210>39
<211>29
<212>DNA
<213>人工的
<220>
<223>ThyA2
<400>39
cccaagcttc gtaattagat agccaccgg 29
<210>40
<211>33
<212>DNA
<213>人工的
<220>
<223>ThyA4
<400>40
ggtgcctggg tgcatgatat aatcatgggg caa 33
<210>41
<211>28
<212>DNA
<213>人工的
<220>
<223>ThyA5
<400>41
gccccatgat aatatcatgc acccaggc 28
<210>42
<211>28
<212>DNA
<213>人工的
<220>
<223>ThyA16
<400>42
ctgtggtttc tccagggcga cactaaca 28
<210>43
<211>28
<212>DNA
<213>人工的
<220>
<223>ThyA17
<400>43
tgttagtgtc gccctggaga aaccacag 28
<210>44
<211>795
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(795)
<223>thyA
<400>44
atg aaa cag tat tta gaa ctg atg caa aaa gtg ctc gac gaa ggc aca 48
Met Lys Gln Tyr Leu Glu Leu Met Gln Lys Val Leu Asp Glu Gly Thr
1 5 10 15
cag aaa aac gac cgt acc gga acc gga acg ctt tcc att ttt ggt cat 96
Gln Lys Asn Asp Arg Thr Gly Thr Gly Thr Leu Ser Ile Phe Gly His
20 25 30
cag atg cgt ttt aac ctg caa gat gga ttc ccg ctg gtg aca act aaa 144
Gln Met Arg Phe Asn Leu Gln Asp Gly Phe Pro Leu Val Thr Thr Lys
35 40 45
cgt tgc cac ctg cgt tcc atc atc cat gaa ctg ctg tgg ttt ctg cag 192
Arg Cys His Leu Arg Ser Ile Ile His Glu Leu Leu Trp Phe Leu Gln
50 55 60
ggc gac act aac att gct tat cta cac gaa aac aat gtc acc atc tgg 240
Gly Asp Thr Asn Ile Ala Tyr Leu His Glu Asn Asn Val Thr Ile Trp
65 70 75 80
gac gaa tgg gcc gat gaa aac ggc gac ctc ggg cca gtg tat ggt aaa 288
Asp Glu Trp Ala Asp Glu Asn Gly Asp Leu Gly Pro Val Tyr Gly Lys
85 90 95
cag tgg cgc gcc tgg cca acg cca gat ggt cgt cat att gac cag atc 336
Gln Trp Arg Ala Trp Pro Thr Pro Asp Gly Arg His Ile Asp Gln Ile
100 105 110
act acg gta ctg aac cag ctg aaa aac gac ccg gat tcg cgc cgc att 384
Thr Thr Val Leu Asn Gln Leu Lys Asn Asp Pro Asp Ser Arg Arg Ile
115 120 125
att gtt tca gcg tgg aac gta ggc gaa ctg gat aaa atg gcg ctg gca 432
Ile Val Ser Ala Trp Asn Val Gly Glu Leu Asp Lys Met Ala Leu Ala
130 135 140
ccg tgc cat gca ttc ttc cag ttc tat gtg gca gac ggc aaa ctc tct 480
Pro Cys His Ala Phe Phe Gln Phe Tyr Val Ala Asp Gly Lys Leu Ser
145 150 155 160
tgc cag ctt tat cag cgc tcc tgt gac gtc ttc ctc ggc ctg ccg ttc 528
Cys Gln Leu Tyr Gln Arg Ser Cys Asp Val Phe Leu Gly Leu Pro Phe
165 170 175
aac att gcc agc tac gcg tta ttg gtg cat atg atg gcg cag cag tgc 576
Asn Ile Ala Ser Tyr Ala Leu Leu Val His Met Met Ala Gln Gln Cys
180 185 190
gat ctg gaa gtg ggt gat ttt gtc tgg acc ggt ggc gac acg cat ctg 624
Asp Leu Glu Val Gly Asp Phe Val Trp Thr Gly Gly Asp Thr His Leu
195 200 205
tac agc aac cat atg gat caa act cat ctg caa tta agc cgc gaa ccg 672
Tyr Ser Asn His Met Asp Gln Thr His Leu Gln Leu Ser Arg Glu Pro
210 215 220
cgt ccg ctg ccg aag ttg att atc aaa cgt aaa ccc gaa tcc atc ttc 720
Arg Pro Leu Pro Lys Leu Ile Ile Lys Arg Lys Pro Glu Ser Ile Phe
225 230 235 240
gac tac cgt ttc gaa gac ttt gag att gaa ggc tac gat ccg cat ccg 768
Asp Tyr Arg Phe Glu Asp Phe Glu Ile Glu Gly Tyr Asp Pro His Pro
245 250 255
ggc att aaa gcg ccg gtg gct atc taa 795
Gly Ile Lys Ala Pro Val Ala Ile
260
<210>45
<211>264
<212>PRT
<213>大肠杆菌
<400>45
Met Lys Gln Tyr Leu Glu Leu Met Gln Lys Val Leu Asp Glu Gly Thr
1 5 10 15
Gln Lys Asn Asp Arg Thr Gly Thr Gly Thr Leu Ser Ile Phe Gly His
20 25 30
Gln Met Arg Phe Asn Leu Gln Asp Gly Phe Pro Leu Val Thr Thr Lys
35 40 45
Arg Cys His Leu Arg Ser Ile Ile His Glu Leu Leu Trp Phe Leu Gln
50 55 60
Gly Asp Thr Asn Ile Ala Tyr Leu His Glu Asn Asn Val Thr Ile Trp
65 70 75 80
Asp Glu Trp Ala Asp Glu Asn Gly Asp Leu Gly Pro Val Tyr Gly Lys
85 90 95
Gln Trp Arg Ala Trp Pro Thr Pro Asp Gly Arg His Ile Asp Gln Ile
100 105 110
Thr Thr Val Leu Asn Gln Leu Lys Asn Asp Pro Asp Ser Arg Arg Ile
115 120 125
Ile Val Ser Ala Trp Asn Val Gly Glu Leu Asp Lys Met Ala Leu Ala
130 135 140
Pro Cys His Ala Phe Phe Gln Phe Tyr Val Ala Asp Gly Lys Leu Ser
145 150 155 160
Cys Gln Leu Tyr Gln Arg Ser Cys Asp Val Phe Leu Gly Leu Pro Phe
165 170 175
Asn Ile Ala Ser Tyr Ala Leu Leu Val His Met Met Ala Gln Gln Cys
180 185 190
Asp Leu Glu Val Gly Asp Phe Val Trp Thr Gly Gly Asp Thr His Leu
195 200 205
Tyr Ser Asn His Met Asp Gln Thr His Leu Gln Leu Ser Arg Glu Pro
210 215 220
Arg Pro Leu Pro Lys Leu Ile Ile Lys Arg Lys Pro Glu Ser Ile Phe
225 230 235 240
Asp Tyr Arg Phe Glu Asp Phe Glu Ile Glu Gly Tyr Asp Pro His Pro
245 250 255
Gly Ile Lys Ala Pro Val Ala Ile
260
<210>46
<211>618
<212>DNA
<213>大肠杆菌
<220>
<221>CDS
<222>(1)..(618)
<223>tdk
<400>46
atg gca cag cta tat ttc tac tat tcc gca atg aat gcg ggt aag tct 48
Met Ala Gln Leu Tyr Phe Tyr Tyr Ser Ala Met Asn Ala Gly Lys Ser
1 5 10 15
aca gca ttg ttg caa tct tca tac aat tac cag gaa cgc ggc atg cgc 96
Thr Ala Leu Leu Gln Ser Ser Tyr Asn Tyr Gln Glu Arg Gly Met Arg
20 25 30
act gtc gta tat acg gca gaa att gat gat cgc ttt ggt gcc ggg aaa 144
Thr Val Val Tyr Thr Ala Glu Ile Asp Asp Arg Phe Gly Ala Gly Lys
35 40 45
gtc agt tcg cgt ata ggt ttg tca tcg cct gca aaa tta ttt aac caa 192
Val Ser Ser Arg Ile Gly Leu Ser Ser Pro Ala Lys Leu Phe Asn Gln
50 55 60
aat tca tca tta ttt gat gag att cgt gcg gaa cat gaa cag cag gca 240
Asn Ser Ser Leu Phe Asp Glu lle Arg Ala Glu His Glu Gln Gln Ala
65 70 75 80
att cat tgc gta ctg gtt gat gaa tgc cag ttt tta acc aga caa caa 288
Ile His Cys Val Leu Val Asp Glu Cys Gln Phe Leu Thr Arg Gln Gln
85 90 95
gta tat gaa tta tcg gag gtt gtc gat caa ctc gat ata ccc gta ctt 336
Val Tyr Glu Leu Ser Glu Val Val Asp Gln Leu Asp Ile Pro Val Leu
100 105 110
tgt tat ggt tta cgt acc gat ttt cga ggt gaa tta ttt att ggc agc 384
Cys Tyr Gly Leu Arg Thr Asp Phe Arg Gly Glu Leu Phe Ile Gly Ser
115 120 125
caa tac tta ctg gca tgg tcc gac aaa ctg gtt gaa tta aaa acc atc 432
Gln Tyr Leu Leu Ala Trp Ser Asp Lys Leu Val Glu Leu Lys Thr Ile
130 135 140
tgt ttt tgt ggc cgt aaa gca agc atg gtg ctg cgt ctt gar caa gca 480
Cys Phe Cys Gly Arg Lys Ala Ser Met Val Leu Arg Leu Asp Gln Ala
145 150 155 160
ggc aga cct tat aac gaa ggt gag cag gtg gta att ggt ggt aat gaa 528
Gly Arg Pro Tyr Asn Glu Gly Glu Gln Val Val Ile Gly Gly Asn Glu
165 170 175
cga tac gtt tct gta tgc cgt aaa cac tat aaa gag gcg tta caa gtc 576
Arg Tyr Val Ser Val Cys Arg Lys His Tyr Lys Glu Ala Leu Gln Val
180 185 190
gac tca tta acg gct att cag gaa agg cat cgc cac gat taa 618
Asp Ser Leu Thr Ala Ile Gln Glu Arg His Arg His Asp
195 200 205
<210>47
<211>205
<212>PRT
<213>大肠杆菌
<400>47
Met Ala Gln Leu Tyr Phe Tyr Tyr Ser Ala Met Asn Ala Gly Lys Ser
1 5 10 15
Thr Ala Leu Leu Gln Ser Ser Tyr Asn Tyr Gln Glu Arg Gly Met Arg
20 25 30
Thr Val Val Tyr Thr Ala Glu Ile Asp Asp Arg Phe Gly Ala Gly Lys
35 40 45
Val Ser Ser Arg Ile Gly Leu Ser Ser Pro Ala Lys Leu Phe Asn Gln
50 55 60
Asn Ser Ser Leu Phe Asp Glu Ile Arg Ala Glu His Glu Gln Gln Ala
65 70 75 80
Ile His Cys Val Leu Val Asp Glu Cys Gln Phe Leu Thr Arg Gln Gln
85 90 95
Val Tyr Glu Leu Ser Glu Val Val Asp Gln Leu Asp Ile Pro Val Leu
100 105 110
Cys Tyr Gly Leu Arg Thr Asp Phe Arg Gly Glu Leu Phe Ile Gly Ser
115 120 125
Gln Tyr Leu Leu Ala Trp Ser Asp Lys Leu Val Glu Leu Lys Thr Ile
130 135 140
Cys Phe Cys Gly Arg Lys Ala Ser Met Val Leu Arg Leu Asp Gln Ala
145 150 155 160
Gly Arg Pro Tyr Asn Glu Gly Glu Gln Val Val Ile Gly Gly Asn Glu
165 170 175
Arg Tyr Val Ser Val Cys Arg Lys His Tyr Lys Glu Ala Leu Gln Val
180 185 190
Asp Ser Leu Thr Ala Ile Gln Glu Arg His Arg His Asp
195 200 205
<210>48
<211>7965
<212>DNA
<213>大肠杆菌
<400>48
aactgcacat tcgggatatt tctctatatt cgcgcttcat cagaaaactg aaggaacctc 60
cattgaatcg aactaatatt ttttttggtg aatcgcattc tgactggttg cctgtcagag 120
gcggagaatc tggtgatttt gtttttcgac gtggtgacgg gcatgccttc gcgaaaatcg 180
cacctgcttc ccgccgcggt gagctcgctg gagagcgtga ccgcctcatt tggctcaaag 240
gtcgaggtgt ggcttgcccc gaggtcatca actggcagga ggaacaggag ggtgcatgct 300
tggtgataac ggcaattccg ggagtaccgg cggctgatct gtctggagcg gatttgctca 360
aagcgtggcc gtcaatgggg cagcaacttg gcgctgttca cagcctatcg gttgatcaat 420
gtccgtttga gcgcaggctg tcgcgaatgt tcggacgcgc cgttgatgtg gtgtcccgca 480
atgccgtcaa tcccgacttc ttaccggacg aggacaagag tacgccgctg cacgatcttt 540
tggctcgtgt cgaacgagag ctaccggtgc ggctcgacca agagcgcacc gatatggttg 600
tttgccatgg tgatccctgc atgccgaact tcatggtgga ccctaaaact cttcaatgca 660
cgggtctgat cgaccttggg cggctcggaa cagcagatcg ctatgccgat ttggcactca 720
tgattgctaa cgccgaagag aactgggcag cgccagatga agcagagcgc gccttcgctg 780
tcctattcaa tgtattgggg atcgaagccc ccgaccgcga acgccttgcc ttctatctgc 840
gattggaccc tctgacttgg ggttgatgtt catgccgcct gtttttcctg ctcattggca 900
cgtttcgcaa cctgttctca ttgcggacac cttttccagc ctcgtttgga aagtttcatt 960
gccagacggg actcctgcaa tcgtcaaggg attgaaacct atagaagaca ttgctgatga 1020
actgcgcggg gccgactatc tggtatggcg caatgggagg ggagcagtcc ggttgctcgg 1080
tcgtgagaac aatctgatgt tgctcgaata tgccggggag cgaatgctct ctcacatcgt 1140
tgccgagcac ggcgactacc aggcgaccga aattgcagcg gaactaatgg cgaagctgta 1200
tgccgcatct gaggaacccc tgccttctgc ccttctcccg atccgggatc gctttgcagc 1260
tttgtttcag cgggcgcgcg atgatcaaaa cgcaggttgt caaactgact acgtccacgc 1320
ggcgattata gccgatcaaa tgatgagcaa tgcctcggaa ctgcgtgggc tacatggcga 1380
tctgcatcat gaaaacatca tgttctccag tcgcggctgg ctggtgatag atcccgtcgg 1440
tctggtcggt gaagtgggct ttggcgccgc caatatgttc tacgatccgg ctgacagaga 1500
cgacctttgt ctcgatccta gacgcattgc acagatggcg gacgcattct ctcgtgcgct 1560
ggacgtcgat ccgcgtcgcc tgctcgacca ggcgtacgct tatgggtgcc tttccgcagc 1620
ttggaacgcg gatggagaag aggagcaacg cgatctagct atcgcggccg cgatcaagca 1680
ggtgcgacag acgtcatact agatatcaag cgacttctcc tatcccctgg gaacacatca 1740
atctcaccgg agaatatcgc tggccaaagc cttagcgtag gattccgccc cttcccgcaa 1800
acgaccccaa acaggaaacg cagctgaaac gggaagctca acacccactg acgcatgggt 1860
tgttcaggca gtacttcatc aaccagcaag gcggcacttt cggccatccg ccgcgcccca 1920
cagctcgggc agaaaccgcg acgcttacag ctgaaagcga ccaggtgctc ggcgtggcaa 1980
gactcgcagc gaacccgtag aaagccatgc tccagccgcc cgcattggag aaattcttca 2040
aattcccgtt gcacatagcc cggcaattcc tttccctgct ctgccataag cgcagcgaat 2100
gccgggtaat actcgtcaac gatctgatag agaagggttt gctcgggtcg gtggctctgg 2160
taacgaccag tatcccgatc ccggctggcc gtcctggccg ccacatgagg catgttccgc 2220
gtccttgcaa tactgtgttt acatacagtc tatcgcttag cggaaagttc ttttaccctc 2280
agccgaaatg cctgccgttg ctagacattg ccagccagtg cccgtcactc ccgtactaac 2340
tgtcacgaac ccctgcaata actgtcacgc ccccctgcaa taactgtcac gaacccctgc 2400
aataactgtc acgcccccaa acctgcaaac ccagcagggg cgggggctgg cggggtgttg 2460
gaaaaatcca tccatgatta tctaagaata atccactagg cgcggttatc agcgcccttg 2520
tggggcgctg ctgcccttgc ccaatatgcc cggccagagg ccggatagct ggtctattcg 2580
ctgcgctagg ctacacaccg ccccaccgct gcgcggcagg gggaaaggcg ggcaaagccc 2640
gctaaacccc acaccaaacc ccgcagaaat acgctggagc gcttttagcc gctttagcgg 2700
cctttccccc tacccgaagg gtgggggcgc gtgtgcagcc ccgcagggcc tgtctcggtc 2760
gatcattcag cccggctcat agatctgcgg gcagtgagcg caacgcaatt aatgtgagtt 2820
agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt ataatgtgtg 2880
gaattgtgag cggataacaa tttcacacag gatctagaaa taattttgtt taactttaag 2940
aaggagatat acatatgtga aaccagtaac gttatacgat gtcgcagagt atgccggtgt 3000
ctcttatcag accgtttccc gcgtggtgaa ccaggccagc cacgtttctg cgaaaacgcg 3060
ggaaaaagtg gaagcggcga tggcggagct gaattacatt cccaaccgcg tggcacaaca 3120
actggcgggc aaaccgtcga agcctgtaaa gcggcggtgc acaatcttct cgcgcaacgc 3180
gtcagtgggc tgatagtcgt tgctgattgg cgttgccacc tccagtctgg ccctgcacgc 3240
gccgtcgcaa attgtcgcgg cgattaaatc tcgcgccgat caactgggtg ccagcgtggt 3300
ggtgtcgatg gtagaacgaa gcggcattaa ctatccgctg gatgaccagg atgccattgc 3360
tgtggaagct gcctgcacta atgttccggc gttatttctt gatgtctctg accagacacc 3420
catcaacagt attattttct cccatgaaga cggtacgcga ctgggcgtgg agcatctggt 3480
cgcattgggt caccagcaaa tcgcgctgtt agcgggccca ttaagttctg tctcggcgcg 3540
tctgcgtctg gctggctggc ataaatatct cactcgcaat caaattcagc cgatagcgga 3600
acgggaaggc gactggagtg ccatgtccgg ttttcaacaa accatgcaaa tgctgaatga 3660
gggcatcgtt cccactgcga tgctggttgc caacgatcag atggcgctgg gcgcaatgcg 3720
cgccattacc gagtccgggc tgcgcgttgg tgcggatatc ggcgtcaggg acgccaccac 3780
cggccagatg atgaaccggg aatggtcagc cgccgaagtg ctccagaaca cgccatggct 3840
caagcggatg aatgcccagg gcaatgacgt gtatatcagg cccgccgagc aggagcggca 3900
tggtctggtg ctggtggacg acctcagcga gtttgacctg gatgacatga aagccgaggg 3960
ccgggagcct gccctggtag tggaaaccag cccgaagaac tatcaggcat gggtcaaggt 4020
ggccgacgcc gcaggcggtg aacttcgggg gcagattgcc cggacgctgg ccagcgagta 4080
cgacgccgac ccggccagcg ccgacagccg ccactatggc cgcttggcgg gcttcaccaa 4140
ccgcaaggac aagcacacca cccgcgccgg ttatcagccg tgggtgctgc tgcgtgaatc 4200
caagggcaag accgccaccg ctggcccggc gctggtgcag caggctggcc agcagatcga 4260
gcaggcccag cggcagcagg agaaggcccg caggctggcc agcctcgaac tgcccgagcg 4320
gcagcttagc cgccaccggc gcacggcgct ggacgagtac cgcagcgaga tggccgggct 4380
ggtcaagcgc ttcggtgatg acctcagcaa gtgcgacttt atcgccgcgc agaagctggc 4440
cagccggggc cgcagtgccg aggaaatcgg caaggccatg gccgaggcca gcccagcgct 4500
ggcagagcgc aagcccggcc acgaagcgga ttacatcgag cgcaccgtca gcaaggtcat 4560
gggtctgccc agcgtccagc ttgcgcgggc cgagctggca cgggcaccgg caccccgcca 4620
gcgaggcatg gacaggggcg ggccagattt cagcatgtag tgcttgcgtt ggtactcacg 4680
cctgttatac tatgagtact cacgcacaga agggggtttt atggaatacg aaaaaagcgc 4740
ttcagggtcg gtctacctga tcaaaagtga caagggctat tggttgcccg gtggctttgg 4800
ttatacgtca aacaaggccg aggctggccg cttttcagtc gctgatatgg ccagccttaa 4860
ccttgacggc tgcaccttgt ccttgttccg cgaagacaag cctttcggcc ccggcaagtt 4920
tctcggtgac tgatatgaaa gaccaaaagg acaagcagac cggcgacctg ctggccagcc 4980
ctgacgctgt acgccaagcg cgatatgccg agcgcatgaa ggccaaaggg atgcgtcagc 5040
gcaagttctg gctgaccgac gacgaatacg aggcgctgcg cgagtgcctg gaagaactca 5100
gagcggcgca gggcgggggt agtgaccccg ccagcgccta accaccaact gcctgcaaag 5160
gaggcaatca atggctaccc ataagcctat caatattctg gaggcgttcg cagcagcgcc 5220
gccaccgctg gactacgttt tgcccaacat ggtggccggt acggtcgggg cgctggtgtc 5280
gcccggtggt gccggtaaat ccatgctggc cctgcaactg gccgcacaga ttgcaggcgg 5340
gccggatctg ctggaggtgg gcgaactgcc caccggcccg gtgatctacc tgcccgccga 5400
agacccgccc accgccattc atcaccgcct gcacgccctt ggggcgcacc tcagcgccga 5460
ggaacggcaa gccgtggctg acggcctgct gatccagccg ctgatcggca gcctgcccaa 5520
catcatggcc ccggagtggt tcgacggcct caagcgcgcc gccgagggcc gccgcctgat 5580
ggtgctggac acgctgcgcc ggttccacat cgaggaagaa aacgccagcg gccccatggc 5640
ccaggtcatc ggtcgcatgg aggccatcgc cgccgatacc gggtgctcta tcgtgttcct 5700
gcaccatgcc agcaagggcg cggccatgat gggcgcaggc gaccagcagc aggccagccg 5760
gggcagctcg gtactggtcg ataacatccg ctggcagtcc tacctgtcga gcatgaccag 5820
cgccgaggcc gaggaatggg gtgtggacga cgaccagcgc cggttcttcg tccgcttcgg 5880
tgtgagcaag gccaactatg gcgcaccgtt cgctgatcgg tggttcaggc ggcatgacgg 5940
cggggtgctc aagcccgccg tgctggagag gcagcgcaag agcaaggggg tgccccgtgg 6000
tgaagcctaa gaacaagcac agcctcagcc acgtccggca cgacccggcg cactgtctgg 6060
cccccggcct gttccgtgcc ctcaagcggg gcgagcgcaa gcgcagcaag ctggacgtga 6120
cgtatgacta cggcgacggc aagcggatcg agttcagcgg cccggagccg ctgggcgctg 6180
atgatctgcg catcctgcaa gggctggtgg ccatggctgg gcctaatggc ctagtgcttg 6240
gcccggaacc caagaccgaa ggcggacggc agctccggct gttcctggaa cccaagtggg 6300
aggccgtcac cgctgaatgc catgtggtca aaggtagcta tcgggcgctg gcaaaggaaa 6360
tcggggcaga ggtcgatagt ggtggggcgc tcaagcacat acaggactgc atcgagcgcc 6420
tttggaaggt atccatcatc gcccagaatg gccgcaagcg gcaggggttt cggctgctgt 6480
cggagtacgc cagcgacgag gcggacgggc gcctgtacgt ggccctgaac cccttgatcg 6540
cgcaggccgt catgggtggc ggccagcatg tgcgcatcag catggacgag gtgcgggcgc 6600
tggacagcga aaccgcccgc ctgctgcacc agcggctgtg tggctggatc gaccccggca 6660
aaaccggcaa ggcttccata gataccttgt gcggctatgt ctggccgtca gaggccagtg 6720
gttcgaccat gcgcaagcgc cgccagcggg tgcgcgaggc gttgccggag ctggtcgcgc 6780
tgggctggac ggtaaccgag ttcgcggcgg gcaagtacga catcacccgg cccaaggcgg 6840
caggctgacc ccccccactc tattgtaaac aagacatttt tatcttttat attcaatggc 6900
ttattttcct gctaattggt aataccatga aaaataccat gctcagaaaa ggcttaacaa 6960
tattttgaaa aattgcctac tgagcgctgc cgcacagctc cataggccgc tttcctggct 7020
ttgcttccag atgtatgctc ttctgctcct gcagctaatg gatcaccgca aacaggttac 7080
tcgcctgggg attccctttc gacccgagca tccgtatgat actcatgctc gattattatt 7140
attatagaag cccccatgaa taaatcgctc atcattttcg gcatcgtcaa cataacctcg 7200
gacagtttct ccgatggagg ccggtatctg gcgccagacg cagccattgc gcaggcgcgt 7260
aagctgatgg ccgagggggc agatgtgatc gacctggtcc ggcatccagc aatcccgacg 7320
ccgcgcctgt ttcgtccgac acagaaatcg cgcgtatgcg ccggtgctgg acgcgctcag 7380
gcagatggca ttcccgtctc gctcgacagt tatcaacccg cgacgcaagc ctatgccttg 7440
tcgcgtggtg tggcctatct caatgatatt cgcggttttc cagacgctgc gttctatccg 7500
caattggcga aatcatctgc caaactcgtc gttatgcatt cggtgcaaga cgggcaggca 7560
gatcggcgcg aggcacccgc tggcgacatc atggatcaca ttgcggcgtt ctttgacgcg 7620
cgcatcgcgg cgctgacggg tgccggtatc aaacgcaacc gccttgtcct tgatcccggc 7680
atggggtttt ttctgggggc tgctcccgaa acctcgctct cggtgctggc gcggttcgat 7740
gaattgcggc tgcgcttcga tttgccggtg cttctgtctg tttcgcgcaa atcctttctg 7800
cgcgcgctca caggccgtgg tccgggggtg tcggggccgc gacactcgct gcagagcttg 7860
ccgccgccgc aggtggagct gacttcatcc gcacacacga gccgcgcccc ttgcgcgacg 7920
ggctggcggt attggcggcg ctgaaagaaa ccgcaagaat tcgtt 7965
Claims (13)
1.RSF1010衍生的Mob-质粒,其中所述质粒选自下组:SEQ ID NO:24、SEQ ID NO:27、和SEQ ID NO:48,以及与SEQ ID NO:24、SEQ ID NO:27、和SEQ ID NO:48至少95%同源的SEQ ID NO:24、SEQ ID NO:27、和SEQ IDNO:48的变体,并且其中所述质粒经过修饰以使与转移能力有关的一种或多种基因失活。
2.根据权利要求1的质粒,其中所述质粒经修饰以使抗生素抗性基因失活。
3.根据权利要求1的质粒,其中所述质粒经修饰以增加所述质粒的拷贝数。
4.根据权利要求1的质粒,其包含PlacUV5启动子和来自不含mob基因座的RSF1010的复制起点。
5.根据权利要求1的质粒,其还包含胸苷酸合酶基因。
6.根据权利要求1的质粒,其还包含目标基因。
7.包含权利要求1的质粒的细菌。
8.根据权利要求7的细菌,其中所述细菌为革兰氏阴性细菌。
9.根据权利要求8的细菌,其中所述细菌缺乏活性胸苷酸合酶并缺乏活性胸苷激酶。
10.根据权利要求9的细菌,其中所述细菌具有生产有用代谢物的能力。
11.根据权利要求10的细菌,其中所述有用代谢物选自下组:天然或重组的蛋白质、酶、L-氨基酸、核苷、核苷酸、有机酸和维生素。
12.生产有用代谢物的方法,包括(a)在培养基中培养根据权利要求10的细菌和(b)从所述培养基收集所述有用代谢物。
13.根据权利要求12的方法,其中所述有用代谢物选自下组:天然或重组的蛋白质、酶、L-氨基酸、核苷、核苷酸、有机酸和维生素。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
RU2004119027 | 2004-06-24 | ||
RU2004119027/13A RU2306338C2 (ru) | 2004-06-24 | 2004-06-24 | Mob'-ПРОИЗВОДНАЯ ПЛАЗМИДА RSF1010, НЕ СОДЕРЖАЩАЯ ГЕНЫ УСТОЙЧИВОСТИ К АНТИБИОТИКАМ, БАКТЕРИЯ, СОДЕРЖАЩАЯ УКАЗАННУЮ ПЛАЗМИДУ, И СПОСОБ ПОЛУЧЕНИЯ ПОЛЕЗНЫХ МЕТАБОЛИТОВ |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1973043A true CN1973043A (zh) | 2007-05-30 |
Family
ID=34971535
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2005800212254A Pending CN1973043A (zh) | 2004-06-24 | 2005-06-24 | 不包含抗生素抗性基因的rsf1010衍生mob-缺陷型质粒、包含该质粒的细菌和生产有用代谢物的方法 |
Country Status (7)
Country | Link |
---|---|
US (1) | US20060014257A1 (zh) |
EP (1) | EP1761632A1 (zh) |
JP (1) | JP2008503202A (zh) |
CN (1) | CN1973043A (zh) |
BR (1) | BRPI0512143A (zh) |
RU (1) | RU2306338C2 (zh) |
WO (1) | WO2006001514A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111471685A (zh) * | 2020-05-21 | 2020-07-31 | 中国科学院水生生物研究所 | 一种喇叭虫rna干扰表达载体与构建方法及其应用 |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2229513C2 (ru) * | 2001-11-23 | 2004-05-27 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" | Способ получения l-аминокислот, штамм escherichia coli - продуцент l-аминокислоты (варианты) |
GB0207021D0 (en) * | 2002-03-25 | 2002-05-08 | Univ Warwick | Anti-bacterial agents |
RU2004124226A (ru) | 2004-08-10 | 2006-01-27 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" (ЗАО АГРИ) (RU) | Использование фосфокетолазы для продукции полезных метаболитов |
US7915018B2 (en) | 2004-10-22 | 2011-03-29 | Ajinomoto Co., Inc. | Method for producing L-amino acids using bacteria of the Enterobacteriaceae family |
ATE417119T1 (de) | 2005-02-18 | 2008-12-15 | Ajinomoto Kk | Verfahren zur herstellung einer nichtaromatischen l-aminosäure unter verwendung eines bakteriums der familie enterobacteriaceae mit abgeschwächter expression des csra-gens |
WO2006088235A1 (en) | 2005-02-18 | 2006-08-24 | Ajinomoto Co., Inc. | A method for producing an l-amino acid using a bacterium of the enterobacteriaceae family |
EP1907529A1 (en) * | 2005-07-25 | 2008-04-09 | Ajinomoto Co., Inc. | A METHOD FOR PRODUCING AN L-AMINO ACID USING A BACTERIUM OF THE ENTEROBACTERIACEAE FAMILY WITH ATTENUATED EXPRESSION OF THE cpxR GENE |
WO2007018310A1 (en) * | 2005-08-09 | 2007-02-15 | Ajinomoto Co., Inc. | A METHOD FOR PRODUCING AN L-AMINO ACID USING A BACTERIUM OF THE ENTEROBACTERIACEAE FAMILY WITH ATTENUATED EXPRESSION OF THE ybiV GENE |
EP2004803A2 (en) | 2006-03-23 | 2008-12-24 | Ajinomoto Co., Inc. | A method for producing an l-amino acid using bacterium of theenterobacteriaceae family with attenuated expression of a gene coding for small rna |
WO2007119890A1 (en) | 2006-04-18 | 2007-10-25 | Ajinomoto Co., Inc. | A METHOD FOR PRODUCING AN L-AMINO ACID USING A BACTERIUM OF THE ENTEROBACTERIACEAE FAMILY WITH ATTENUATED EXPRESSION OF THE sfmACDFH-fimZ CLUSTER OR THE fimZ GENE |
RU2006129690A (ru) | 2006-08-16 | 2008-02-27 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" (ЗАО АГРИ) (RU) | СПОСОБ ПОЛУЧЕНИЯ L-АМИНОКИСЛОТЫ С ИСПОЛЬЗОВАНИЕМ БАКТЕРИЙ СЕМЕЙСТВА Enterobacteriaceae, В КОТОРОЙ ОСЛАБЛЕНА ЭКСПРЕССИЯ ГЕНА ydiN, ГЕНА ydiB ИЛИ ИХ КОМБИНАЦИИ |
US8318481B2 (en) * | 2007-12-07 | 2012-11-27 | Pfenex Inc. | High copy number self-replicating plasmids in pseudomonas |
RU2008105793A (ru) * | 2008-02-19 | 2009-08-27 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" (ЗАО АГРИ) (RU) | Способ конструирования оперонов, содержащих трансляционно сопряженные гены, бактерия, содержащая такой оперон, способ продукции полезного метаболита и способ мониторинга экспрессии гена |
CN115109791B (zh) * | 2022-06-22 | 2023-09-01 | 华南农业大学 | 一种基于IncQ型质粒泛宿主的功能基因递送载体、构建方法和应用 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4912046A (en) * | 1983-06-27 | 1990-03-27 | Genentech, Inc. | Portable inducible control system |
IE892131A1 (en) * | 1989-06-30 | 1991-01-02 | Univ Cork | Marker genes for genetic manipulation |
US5670343A (en) * | 1990-04-24 | 1997-09-23 | Rhone Poulenc Biochimie | Cloning and/or expression vectors, preparation method and their use |
US6916646B1 (en) * | 1997-06-23 | 2005-07-12 | Genencor International, Inc. | Enterobacteriaceae fermentation strains |
RU2212447C2 (ru) * | 2000-04-26 | 2003-09-20 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" | Штамм escherichia coli - продуцент аминокислоты (варианты) и способ получения аминокислот (варианты) |
RU2229513C2 (ru) * | 2001-11-23 | 2004-05-27 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" | Способ получения l-аминокислот, штамм escherichia coli - продуцент l-аминокислоты (варианты) |
BRPI0407555A (pt) * | 2003-02-21 | 2006-02-14 | Kaneka Corp | vetor inédito |
US20050191684A1 (en) * | 2004-02-25 | 2005-09-01 | Zimenkov Danila V. | Method for producing L-amino acids |
-
2004
- 2004-06-24 RU RU2004119027/13A patent/RU2306338C2/ru active
-
2005
- 2005-06-24 BR BRPI0512143-4A patent/BRPI0512143A/pt not_active IP Right Cessation
- 2005-06-24 WO PCT/JP2005/012159 patent/WO2006001514A1/en not_active Application Discontinuation
- 2005-06-24 US US11/165,067 patent/US20060014257A1/en not_active Abandoned
- 2005-06-24 CN CNA2005800212254A patent/CN1973043A/zh active Pending
- 2005-06-24 JP JP2006554377A patent/JP2008503202A/ja not_active Ceased
- 2005-06-24 EP EP05755743A patent/EP1761632A1/en not_active Ceased
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111471685A (zh) * | 2020-05-21 | 2020-07-31 | 中国科学院水生生物研究所 | 一种喇叭虫rna干扰表达载体与构建方法及其应用 |
Also Published As
Publication number | Publication date |
---|---|
JP2008503202A (ja) | 2008-02-07 |
US20060014257A1 (en) | 2006-01-19 |
RU2306338C2 (ru) | 2007-09-20 |
WO2006001514A1 (en) | 2006-01-05 |
RU2004119027A (ru) | 2006-01-10 |
EP1761632A1 (en) | 2007-03-14 |
BRPI0512143A (pt) | 2008-02-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1973043A (zh) | 不包含抗生素抗性基因的rsf1010衍生mob-缺陷型质粒、包含该质粒的细菌和生产有用代谢物的方法 | |
Xu et al. | Identification of genes negatively regulated by Fis: Fis and RpoS comodulate growth-phase-dependent gene expression in Escherichia coli | |
Tsui et al. | Transcription of the mutL repair, miaA tRNA modification, hfq pleiotropic regulator, and hflA region protease genes of Escherichia coli K-12 from clustered Esigma32-specific promoters during heat shock | |
Shyamala et al. | Tandem chromosomal duplications: role of REP sequences in the recombination event at the join‐point. | |
Walter et al. | Genetic characterization of the pdu operon: use of 1, 2-propanediol in Salmonella typhimurium | |
Neidle et al. | 5-Aminolevulinic acid availability and control of spectral complex formation in HemA and HemT mutants of Rhodobacter sphaeroides | |
Frías et al. | Activation of the Anabaena nir operon promoter requires both NtcA (CAP family) and NtcB (LysR family) transcription factors | |
Petruschka et al. | The cyo operon of Pseudomonas putida is involved in carbon catabolite repression of phenol degradation | |
Thompson et al. | The control region of the F sex factor DNA transfer cistrons: physical mapping by deletion analysis | |
CN110741091A (zh) | 增加nadph的生物合成途径的基因组工程化 | |
Kim et al. | Single-base genome editing in Corynebacterium glutamicum with the help of negative selection by target-mismatched CRISPR/Cpf1 | |
PL148391B1 (en) | Method of obtaining genetically modified microorganisms by introduction of gdh gene | |
WO2021224152A1 (en) | Improving expression in fermentation processes | |
US20020128457A1 (en) | Vectors, cells and processes for pyrimidine deoxyribonucleosides production | |
Chistoserdova et al. | Cloning, mutagenesis, and physiological effect of a hydroxypyruvate reductase gene from Methylobacterium extorquens AM1 | |
Da Costa et al. | Mutations that render the promoter of the histidine operon of Salmonella typhimurium insensitive to nutrient-rich medium repression and amino acid downshift | |
Srinivas et al. | Escherichia coli vectors having stringently repressible replication origins allow a streamlining of Crispr/Cas9 gene editing | |
Schweizer et al. | Cloning and nucleotide sequence of the glpD gene encoding sn-glycerol-3-phosphate dehydrogenase of Pseudomonas aeruginosa | |
US8927254B2 (en) | Pyrococcus furiosus strains and methods of using same | |
US20070184520A1 (en) | Reduction of spontaneous mutation rates in cells | |
Vartak et al. | A functional leuABCD operon is required for leucine synthesis by the tyrosine-repressible transaminase in Escherichia coli K-12 | |
Maseda et al. | Development of expression vectors for Thermus thermophilus | |
JP2000325094A (ja) | 制限酵素及びその遺伝子 | |
US20150072898A1 (en) | Broad Host Range Expression Vector for Diverse Prokaryotes | |
Mata-Gilsinger et al. | Physical mapping of the exuT and uxaC operators by use of exu plasmids and generation of deletion mutants in vitro |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
AD01 | Patent right deemed abandoned |
Effective date of abandoning: 20070530 |
|
C20 | Patent right or utility model deemed to be abandoned or is abandoned |