CN109486803A - It is engineered phenylalanine lyase polypeptide - Google Patents

It is engineered phenylalanine lyase polypeptide Download PDF

Info

Publication number
CN109486803A
CN109486803A CN201811343013.3A CN201811343013A CN109486803A CN 109486803 A CN109486803 A CN 109486803A CN 201811343013 A CN201811343013 A CN 201811343013A CN 109486803 A CN109486803 A CN 109486803A
Authority
CN
China
Prior art keywords
pal
polypeptide
amino acid
seq
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811343013.3A
Other languages
Chinese (zh)
Other versions
CN109486803B (en
Inventor
吉伽特·W·哈思曼
尼古拉斯·J·阿加德
本杰明·米杰茨
乔纳森·弗罗姆
张希云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Codexis Inc
Original Assignee
Codexis Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Codexis Inc filed Critical Codexis Inc
Publication of CN109486803A publication Critical patent/CN109486803A/en
Application granted granted Critical
Publication of CN109486803B publication Critical patent/CN109486803B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P43/00Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y403/00Carbon-nitrogen lyases (4.3)
    • C12Y403/01Ammonia-lyases (4.3.1)
    • C12Y403/01005Phenylalanine ammonia-lyase (4.3.1.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y403/00Carbon-nitrogen lyases (4.3)
    • C12Y403/01Ammonia-lyases (4.3.1)
    • C12Y403/01024Phenylalanine ammonia-lyase (4.3.1.24)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Animal Behavior & Ethology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Neurosurgery (AREA)
  • Obesity (AREA)
  • Hematology (AREA)
  • Diabetes (AREA)
  • Neurology (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Peptides Or Proteins (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Immunology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)

Abstract

This application involves engineering phenylalanine lyase polypeptides.Particularly, this application provides engineering phenylalanine lyase (phenylalanine ammonia-lyase, PAL) polypeptides and combinations thereof, and the polynucleotides of coding engineering phenylalanine lyase (PAL) polypeptide.In some embodiments, engineering PLA polypeptide is optimized to the sensibility and the increased tolerance to acid pH level to proteolysis for providing the catalytic activity of enhancing and reducing.In some embodiments, engineering PLA polypeptide is deimmunized (deimmunized).The application further relates to be used to treat the purposes with industrial purpose comprising the composition of engineering PAL polypeptide.

Description

It is engineered phenylalanine lyase polypeptide
It is on April 17th, 2014 that the application, which is the applying date, application No. is 201480035201.3, entitled " engineering The divisional application of the application of change phenylalanine lyase polypeptide ".
Cross reference to related applications
The U.S. Provisional Patent Application Serial number 61/813,586 and 2013 year submitted this application claims on April 18th, 2013 The priority for the U.S. Provisional Patent Application Serial number 61/897,932 submitted October 31, both by reference with its entirety It is incorporated into for whole purposes.
Reference to " sequence table ", table or computer program list attachment submitted as ASCII text file
Be written file CX7-131WO2_ST25.TXT in, date created be on April 15th, 2014,127,412 bytes, The sequence table of machine format IBM-PC, MS-Windows operating system is incorporated herein by reference.
Invention field
The present invention provides engineering phenylalanine lyase (PAL) polypeptides and combinations thereof, and coding engineering benzene The polynucleotides of alanine aminonialyase (PAL) polypeptide.In some embodiments, engineering PAL polypeptide is optimized to provide increasing Strong catalytic activity, and the sensibility and the increased tolerance to acid pH level to proteolysis reduced.Some In embodiment, engineering PAL polypeptide is deimmunized.The invention further relates to the compositions comprising being engineered PAL polypeptide to be used for The purposes for the treatment of and industrial purpose.
Background of invention
Phenylalanine lyase (PAL) is fragrance together with histidine aminonialyase (HAL) and tyrosine ammonia lyase (TAL) The member of amino acid-cleaving enzymes family, race (EC 4.3.1.23-1.25 and 4.3.1.3).More specifically, having the active enzyme of PAL The nonoxidative deamination base of (EC 4.3.1.23-1.25 and previous class be EC4.3.1.5) catalysis L-phenylalanine is (E)-meat Cinnamic acid.PAL is nonmammalian enzyme, is distributed widely in plant and is accredited also in fungi and a limited number of bacterium Out.PAL enzyme is used as the human cytokines for the treatment of metabolic disorder phenylketonuria (PKU).PKU is autosome generation Thank to genetic disorder, wherein the synthesis or recycling of liver enzyme phenylalanine hydroxylase (PAH) or participation co-factor tetrahydrobiopterin One or more of enzymes, the nonfunctional due to the mutation in one of corresponding gene.The shortage of this function leads to blood flow In high-caliber phenylalanine.Phenylalanine is converted into propiophenone hydrochlorate (phenyl ketone) and other derivatives.In people, if PKU is not and early treatment, the part of high-caliber phenylalanine and its decomposition product can cause significant medical problem, and the medicine is asked Topic includes dysnoesia, microcephalus and epileptic attack.Many researchs all concentrate on being used to PAL by enzymes extraction treat PKU (Ambrus et al., Science 201:837-839 [1978];Bourget et al., Appl.Biochem.Biotechnol., 10:57-59[1984];With Sarkissian et al., Proc.Natl.Acad.Sci.USA 96:2339-2344 [1999]).
A kind of method for removing phenylalanine in blood flow is the recombination PAL using injectable and passes through pegylation PAL variant (PEG-PAL).Pegylation have been demonstrated improve enzyme half-life period and reduce subject's antigen-reactive (referring to For example, WO 2008/153776, WO 2011/097335 and U.S. Patent number 7,531,341).Have in PEG-PAL composition PAL variant has described as the dotted nostoc of wild type (Nostoc punctiforme) (NpPAL);Anabaena variabilis (Anabaena variabilis) (AvPAL) and circle rhodosporidium toruloides (Rhodosporidium toruloides) (RtPAL) Variant.Particularly, it has been described that the variant of wild type AvPAL, wherein the cysteine at position 64,318,503 and 565 Residue Serine is (see, for example, U.S. Patent number: 7,790,433;7,560,263;With 7,537,923).
Alternative solution right and wrong of the PAL application as the method for the plasma concentration for reducing the L-phenylalanine in PKU subject Invasive preparation, such as oral preparation (Sarkissian et al., Proc.Natl.Acad.Sci.USA 96:2339-2344 [1999]).The key advantages of oral delivery PAL are the exposures of reduction of the enzyme to immune system, use injectable to minimize The immune response that PEG-PAL is observed.However, the major limitation of the oral preparation about PAL is enzymatic activity in stomach and enteric cavity Loss.In order to effective and practical, PAL must be resistant to the degradation of acid pH and protease, and the protease is such as usually degraded Albumen food is trypsase, chymotrypsin, carboxypeptidase and the pepsin of oligopeptides and amino acid.It pervious is ground some In studying carefully (Sarkissian, ibid), partially due to passing through the enzyme of protease degradation and partially due to relatively low at pH 7.0 Specific activity needs a large amount of enzyme to realize the active effects that PAL is administered orally.A variety of methods have been developed to inhibit to digest PAL degradation (Kim et al., Molec.Therap., 10:220-224 [2004] afterwards;With Shah et al., Int.J.Pharmaceut,356:61-68[2008])。
A kind of method for improving effect of the PAL under gastral harsh conditions, which is to provide, is resistant to intrinsic harsh conditions It is engineered PAL polypeptide.The surface lysines of the direct mutagenesis and AvPAL of Kang et al. use chymotrypsin cleavage sites Pegylation is to reduce proteolytic inactivation (referring to Kang et al., Mol.Gen.Metabol., 99:4-9 [2010]).? In these researchs, ten cracking sites of specific mutations, and these gained mutant in addition to two (F18A and R94G) with Outer all losses are more than 50% original enzymatic activity.There is no mutant to show increased activity, and F18A mutant Show the slight increase (Kang et al., ibid) of trypsin resistance.With the other research of PAL, although effectively, usually Without result in the enzyme of longer survival.Therefore, the oral administration of previously described PAL mutant and its derivative not will lead to effectively Treat PKU.
Although the several formulations of PAL achieve progress, still to the PAL polypeptide for oral administration with improved characteristic There are needs.These improved characteristics include but is not limited to longer half-life period, increased catalytic activity, improved to alimentary canal The stability of conditional and the aggregation of reduction.
Other than treatment use, PAL enzyme can also be used in commercial synthesis L-phenylalanine and other L- phenylpropyl alcohol ammonia replaced Acid derivative.Then, these derivatives can be used as prodrug (Gloge et al., 6:3386-3390 [2000];Bartsch etc. People, Prot.Eng.Des.Sel., 23:929-933 [2010];And Turner, Curr.Opin.Chem.Biol., 234-240 [2011])。
PAL enzyme can also be used in agriculture application.Phenylpropanoid Glycosides (phenylpropanoid) of the PAL in plant, fungi and bacterium Play an important role in (such as flavonoids and lignin) biosynthesis, and may be used as defence relevant enzyme (Bate et al., Proc.Natl.Acad.Sci.USA 91:7608-7612[1994]).By using the PAL with the active recombinant polypeptide of PAL Active adjusting can potentially result in effective herbicide.
Summary of the invention
The present invention provides engineering phenylalanine lyase (PAL) polypeptides and combinations thereof, and coding engineering benzene The polynucleotides of alanine aminonialyase (PAL) polypeptide.In some embodiments, engineering PAL polypeptide is optimized to provide increasing Strong catalytic activity, and the sensibility and the increased tolerance to acid pH level to proteolysis reduced.Some In embodiment, engineering PAL polypeptide is deimmunized.The invention further relates to the compositions comprising being engineered PAL polypeptide to be used for The purposes for the treatment of and industrial purpose.In some embodiments, the present invention relates to the engineering phenylpropyl alcohol ammonia with improved characteristic Acidolysis adnosine deaminase (PAL) polypeptide and its bioactive fragment and analog, the improved characteristic are such as increased to acid pH Tolerance and/or the sensibility to proteolysis of reduction.
The present invention relates to change when having under substantially the same conditions with wild type PAL enzyme or with reference to compared with PAL polypeptide when Into characteristic engineering PAL polypeptide and its bioactive fragment and analog.The invention further relates to therapeutic and/or industrial The method of engineering PAL polypeptide and its bioactive fragment and analog is used in composition and is used for using such composition The method for the treatment of and/or industrial purpose.
In the first aspect, the present invention provides engineering phenylalanine lyase (PAL) polypeptides, wherein when in basic phase When measuring under conditions of, engineering phenylalanine lyase (PAL) polypeptide has compared with reference sequences selected from following Group improved characteristic: i) enhance catalytic activity, ii) reduce the sensibility to proteolysis, iii) it is increased to acid The tolerance of property pH, iv) reduction aggregation or i), ii), iii) or any combination iv).In some specific embodiments In, there are two types of improved characteristics for engineering PAL polypeptide tool.In other specific embodiments, improved characteristic is pair reduced The sensibility of proteolysis, and in other specific embodiments again, improved characteristic is the increased tolerance to acid pH Property.
In the second aspect, engineering PAL polypeptide includes albumen, and the albumen includes and SEQ ID NO:4 or its function The amino acid sequence identity of segment at least 85%, when the polypeptide optimal comparison of He Dangyu SEQ ID NO:4 correspond to it is following Amino acid residue difference at the position of position: X39;X91;X158;X180;X195;X243;X245;X256;X257;X270; X290;X307;X308;X326;X349;X364;X394;X399;X400;X404;X407;X443;X453;X459;X460; X463;X474;X522;X524;And X528.
In first some specific embodiment with the second aspect, most preferably compare when with the polypeptide of SEQ ID NO:4 Clock synchronization, engineering PAL polypeptide include at least one amino corresponded at one or more amino acid residue positions below Sour residue difference: A39;A91;Y158;S180;K195;T243;I245;A256;L257;N270;N290;H307;E308; I326;L349;L364;A394;S399;N400;P404;L407;F443;N453;Y459;T460;T463;N474;K522; T524;And P528.In some specific embodiments, engineering PAL polypeptide includes and the amino acid sequence comprising SEQ ID NO:4 At least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least 9 of the reference polypeptide of column A, at least ten, at least 15 and at least 20 amino acid residue differences.
In other specific embodiments of first and the second aspect, engineering PAL polypeptide includes and SEQ ID The amino acid sequence of NO:4 at least 90% (at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% and 99%) is same One property.In other specific embodiment again, engineering PAL polypeptide include with SEQ ID NO:4 at least 90% (at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% and amino acid sequence identity 99%), and include following It is one or more in displacement: A39V;A91V;Y158H;S180A;K195E;T243I/L;I245L;A256G;L257W/A; N270K;N290G;H307G/Q/M;E308Q;I326F;L349M;L364Q;A394V;S399N;N400K;P404A;L407V; F443H;N453G;Y459F;T460G;T463N;N474Q;K522Y/F/N;T524S;And P528L.
In other specific embodiments, engineering PAL polypeptide is originated from wild type Anabaena variabilis PAL (such as ATCC29413;NCBI albumen reference sequences YP_324488.1;SEQ ID NO:4).
In the third aspect, there is phenylalanine lyase (PAL) active engineered polypeptide packet by by the invention Containing the amino acid sequence having with SEQ ID NO:10 or the sequence identity of its function fragment at least 99%.
In the fourth aspect, there is phenylalanine lyase (PAL) active engineered polypeptide packet by by the invention Containing have with the sequence identity of SEQ ID NO:10 or its function fragment at least 95% and also include and SEQ ID NO:10 phase Than the amino of the amino acid residue difference at one, two, three, four, five or six or more amino acid position Acid sequence.
At the 5th aspect, the present invention provides polynucleotide sequence, the polynucleotide sequence coding is such as described herein Any one of engineering PAL polypeptide.
At the 6th aspect, the present invention provides pharmaceutical composition or industry group compound, described pharmaceutical composition or industry Composition includes to be engineered any one of PAL polypeptide as described herein.
In some embodiments, the present invention provides with phenylalanine lyase (PAL) active engineered polypeptide, The engineered polypeptide includes: a) amino acid sequence, and the amino acid sequence has and reference sequences SEQ ID NO:4 or its function The sequence identity of energy segment at least 85%;B) in one or more amino compared with SEQ ID NO:4 or its function fragment Amino acid residue difference at sour position;And c) engineered polypeptide shows selected from the following change compared with reference sequences Into characteristic: i) enhance catalytic activity, ii) reduce the sensibility to proteolysis, iii) it is increased to the resistance to of acid pH By property, iv) aggregation of reduction or i), ii), iii) or any combination iv).In some embodiments, when with SEQ ID When the amino acid sequence optimal comparison of NO:4, one or more amino acid positions are selected from: X39;X54;X59;X73;X91; X158;X112, X134, X180;X195;X240;X243;X245;X256;X257;X270;X290;X304, X305;X307; X308;X326;X349;X353;X364;X394;X399;X400;X404;X407;X443;X453;X459;X460;X463; X474;X509;X521;X522;X524;X528;X546;X564;And/or combination thereof.In some other embodiments, change Into characteristic be selected from reduce the sensibility and/or the increased tolerance to acid pH to proteolysis.In again other reality It applies in scheme, reference sequences are derived from the wild type PAL of Anabaena variabilis.In some other embodiments, reference sequences The amino acid residue of SEQ ID NO:4 corresponds to A39;T54;G59, S73;A91;Y158;S180;K195;A112;R134; Q240;T243;I245;A256;L257;N270;N290;Y304;R305;H307;E308;I326;L349;D353;L364; A394;S399;N400;P404;L407;F443;N453;Y459;T460;T463;N474;E509;Q521;K522;T524; P528;S546;And/or P564.In some embodiments, when the polypeptide optimal comparison with SEQ ID NO:4, with SEQ ID The amino acid residue difference that NO:4 is compared is one or more in following displacement: A39V;T54K;G59R;S73K; A112C;R134Q;A91V;Y158H;S180A;K195E;Q240R/W;T243I/L;I245L;A256G;L257W/A;N270K; N290G;Y304H;R305M;H307G/Q/M;E308Q;I326F;L349M;D353A/N;L364Q;A394V;S399N; N400K;P404A;L407V;F443H;N453G;Y459F;T460G;T463N;N474Q;E509L;Q521K/S;K522Y/F/ N;T524S;P528L;S546R;And P564G/L/M.In some other embodiments, engineered polypeptide has and reference sequence Arrange SEQ ID NO:4 at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or about 100% sequence identity.Some In other embodiments, engineered polypeptide have with reference sequences SEQ ID NO:4 at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% Sequence identity.In some other embodiments, engineered polypeptide has with reference sequences SEQ ID NO:4 at least about 90% sequence identity.In some other embodiments, engineered polypeptide has with reference sequences SEQ ID NO:4 extremely Few about 95% sequence identity.In some other embodiments, engineered polypeptide has with SEQ ID NO:4 at least about 90% sequence identity;With the amino acid residue difference at the H307 of position.In some other embodiments, engineering Polypeptide has the sequence identity with reference sequences SEQ ID NO:4 at least 90%.In some other embodiments, engineering Change polypeptide with the sequence identity with reference sequences SEQ ID NO:4 at least 95%.In some other embodiments, work Journey polypeptide has the sequence identity with SEQ ID NO:4 at least 90%;With the amino acid residue difference at the H307 of position. In some other embodiments, amino acid residue difference H307G/Q/M.In some other embodiments, amino Sour residue difference one or more combination selected from the following: A39;A91;Q240;A256;N290;Y304;R305;H307; D353A394;S399;P404;L407;Q521;K522;And T524.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Comprising have with SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about The amino acid sequence of the sequence identity of 99% or at least about l00%.In some embodiments, there is phenylalanine solution ammonia Enzyme (PAL) active engineered polypeptide includes to have with SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least about The amino acid sequence of 95% sequence identity.
In some other embodiments, having phenylalanine lyase (PAL) active engineered polypeptide includes tool There is the amino acid sequence of the sequence identity with SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least about 99%.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Comprising have with SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least about at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or at least The amino acid sequence of 100% sequence identity.In some embodiments, have phenylalanine lyase (PAL) active Engineered polypeptide includes with same with the sequence of SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least about 95% The amino acid sequence of one property.In some other embodiments, have phenylalanine lyase (PAL) active engineering more Peptide includes with the ammonia with SEQ ID NO:6,8,10,12 and/or 14 or the sequence identity of its function fragment at least about 99% Base acid sequence.In some embodiments, with phenylalanine lyase (PAL) active engineered polypeptide include have with The amino acid sequence of the sequence identity of SEQ ID NO:6,8,10,12 and/or 14 or its function fragment at least 95%.Some Include to have and SEQ ID NO with phenylalanine lyase (PAL) active engineered polypeptide in other embodiments: 6,8,10,12 and/or 14 or its function fragment at least 99% sequence identity amino acid sequence.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Comprising have with SEQ ID NO:4 or its function fragment at least about 90%, at least about 91%, at least about 92%, at least about 93%, At least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or at least about The amino acid sequence of the sequence identity of l00%, wherein the engineered polypeptide is deimmunized.The present invention also provides with benzene Alanine aminonialyase (PAL) active engineered polypeptide, the engineered polypeptide include have the function of with SEQ ID NO:4 or its Segment at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, The amino acid sequence of the sequence identity of at least 98%, at least 99% or l00%, wherein the engineered polypeptide is deimmunized. It in some embodiments, include to have and SEQ ID NO:4 with phenylalanine lyase (PAL) active engineered polypeptide Or the amino acid sequence of the sequence identity of its function fragment at least 95%, wherein the engineered polypeptide is deimmunized.One It is appointing in table 9-1 to 9-7 with phenylalanine lyase (PAL) active engineered polypeptide in a little other embodiments The variant PAL provided in one.In some embodiments, have phenylalanine lyase (PAL) active deimmunized Engineered polypeptide includes with the amino acid sequence at least 95% sequence identity of SEQ ID NO:6,8,10,12 and/or 14.
The present invention also provides polynucleotide sequence, the polynucleotide sequence coding is provided herein to have PAL activity Engineered polypeptide.In some embodiments, polynucleotide sequence is operably connected to control sequence.The present invention also mentions Carrier is supplied, the carrier includes at least one polynucleotides sequence that coding has the active at least one engineered polypeptide of PAL Column.The present invention also provides host cell, the host cell has the active engineering of PAL with coding as herein provided At least one polynucleotide sequence of polypeptide converts.
The present invention also provides the methods that engineering PAL polypeptide is generated in host cell, and the method includes being suitble to Condition of culture under culture comprising encode it is at least one engineering PAL polypeptide at least one polynucleotides host cell.This Invention additionally provides the method that engineering PAL polypeptide is generated in host cell, and the method includes in suitable condition of culture The host cell of polynucleotides of the lower culture comprising coding engineering PAL polypeptide.In some embodiments, this method further includes The recycling engineering PAL polypeptide from culture and/or host cell.
The present invention also provides composition, the composition includes at least one as herein provided with PAL activity Engineered polypeptide.In some embodiments, composition is pharmaceutical composition, and described pharmaceutical composition also includes pharmaceutically may be used Received carrier.The present invention also provides the purposes of these compositions.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Include: a) amino acid sequence, the amino acid sequence with with the active reference sequences of phenylalanine lyase (PAL) or Its function fragment at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, At least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, At least about 98%, at least about 99% or bigger sequence identity;B) polypeptide sequence, the polypeptide sequence include with phenylpropyl alcohol The active reference sequences of propylhomoserin aminonialyase (PAL) or its function fragment are compared at one or more amino acid positions at least One amino acid residue difference;And c) engineered polypeptide with have phenylalanine lyase (PAL) it is active refer to sequence Column are compared to showing improved characteristic selected from the following: the catalytic activity that i) enhances, ii) sensitivity to proteolysis that reduces Property, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), ii), iii), Iv any combination) or v).In some embodiments, reference sequences are protokaryon PAL, and in some other embodiments, Reference sequences are eukaryon PAL.In some embodiments, reference sequences are bacterial PAL (for example, Anabaena variabilis PAL), and In some other embodiments, it is people or other PAL.In some other embodiments, reference sequences are wild type sequence Arrange (for example, wild type Anabaena variabilis PAL), and in some optional embodiments, reference sequences be variant enzyme (for example, With the active engineered polypeptide of PAL).
In some embodiments, phenylalanine lyase (PAL) the active engineered polypeptide that has of the invention includes: A) amino acid sequence, the amino acid sequence have and reference sequences SEQ ID NO:4 or its function fragment at least about 85%, extremely Few about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, extremely Few about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or Bigger sequence identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:4 or its function fragment one At least one amino acid residue difference at a or more amino acid position;And c) engineered polypeptide and reference sequences SEQ ID NO:4 compare show improved characteristic selected from the following: i) enhance catalytic activity, ii) reduce to albumen water The sensibility of solution, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), Ii), iii), iv) or any combination v).
In some other embodiments, include with phenylalanine lyase (PAL) active engineered polypeptide: a) Amino acid sequence, the amino acid sequence have and reference sequences SEQ ID NO:4 or its function fragment at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, At least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger sequence identity;B) polypeptide sequence, institute State polypeptide sequence include compared with SEQ ID NO:4 or its function fragment at one or more amino acid positions at least one A amino acid residue difference;And c) engineered polypeptide is shown compared with reference sequences SEQ ID NO:4 selected from following Improved characteristic: i) enhance catalytic activity, ii) reduce the sensibility to proteolysis, iii) it is increased to acid pH Tolerance, iv) aggregation of reduction, the immunogenicity that v) reduces or i), ii), iii), iv) or any combination v).
In some embodiments, the engineered polypeptide with PAL activity (PAL) is included in following ammonia At least one displacement at one or more places of base acid position: 20,24,27,39,43,45,47,54,58,59,62,70, 73、80、82、91、94、98、104、105、110、112、115、117、118、119、121、123、124、125、126、127、 128、129、130、131、133、134、135、139、140、141、142、143、144、145、146、147、149、150、151、 153、154、156、157、158、159、172、174、175、176、177、178、180、187、191、195、199、205、206、 210、212、213、214、232、240、243、245、247、248、250、256、257、266、270、275、278、279、285、 286、289、290、292、304、305、307、308、309、319、321、326、331、332、334、349、353、355、364、 365、369、370、371、372、374、375、377、378、379、381、382、383、384、385、387、389、394、396、 399、400、403、404、407、417、418、425、431、432、433、434、435、436、437、438、439、443、446、 447、453、456、459、460、461、463、471、472、473、474、475、476、477、478、479、482、483、503、 507,509,521,522,524,525,528,538,546,547,551,558,560,564,565 and/or any combination thereof, Middle amino acid position is numbered with reference to SEQ ID NO:4.In some embodiments, the amino of reference sequences SEQ ID NO:4 Sour residue correspond to A39, T54, G59, S73, A91, Y158, S180, K195, A112, R134, Q240, T243, I245, A256, L257、N270、N290、Y304、R305、H307、E308、I326、L349、D353、L364、A394、S399、N400、P404、 L407, F443, N453, Y459, T460, T463, N474, E509, Q521, K522, T524, P528, S546 and/or P564.? Amino in some other embodiments, when the polypeptide optimal comparison with SEQ ID NO:4, compared with SEQ ID NO:4 Sour residue difference is one or more in following displacement: A39V, T54K, G59R, S73K, A112C, R134Q, A91V, Y158H、S180A、K195E、Q240R/W、T243I/L、I245L、A256G、L257W/A、N270K、N290G、Y304H、 R305M、H307G/Q/M、E308Q、I326F、L349M、D353A/N、L364Q、A394V、S399N、N400K、P404A、 L407V、F443H、N453G、Y459F、T460G、T463N、N474Q、E509L、Q521K/S、K522Y/F/N、T524S、 P528L, S546R and P564G/L/M.In some other embodiments, engineered polypeptide has with SEQ ID NO:4 at least About 90% sequence identity;With the amino acid residue difference at the H307 of position.In some embodiments, amino acid residue Difference is H307G/Q/M.In some still other embodiments, amino acid residue difference is selected from the following one or more A combination: A39, A91, Q240, A256, N290, Y304, R305, H307, D353, A394, S399, P404, L407, Q521, K522 and T524.In some other embodiments, the engineered polypeptide with PAL activity (PAL) changes Into characteristic be selected from reduce the sensibility and/or the increased tolerance to acid pH to proteolysis.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Include: a) amino acid sequence, the amino acid sequence have with reference sequences SEQ ID NO:10 or its function fragment at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or bigger sequence identity;B) polypeptide sequence, the polypeptide sequence include and SEQ ID NO:10 or its function fragment Compared at least one amino acid residue difference at one or more amino acid positions;And c) engineered polypeptide with Reference sequences SEQ ID NO:10 is compared and is shown improved characteristic selected from the following: i) catalytic activity of enhancing, ii) reduce Sensibility, iii to proteolysis) the increased tolerance to acid pH, iv) reduction aggregation, v) reduce immunogenicity, Or i), ii), iii), iv) or any combination v).
In some embodiments, the present invention also provides more with phenylalanine lyase (PAL) active engineering Peptide, the engineered polypeptide includes: a) amino acid sequence, the amino acid sequence have with reference sequences SEQ ID NO:10 or Its function fragment at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger sequence Column identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:10 or its function fragment at one or more At least one amino acid residue difference at multiple amino acid positions;And c) engineered polypeptide and reference sequences SEQ ID NO:10 compare show improved characteristic selected from the following: i) enhance catalytic activity, ii) reduce to the quick of proteolysis Perception, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), ii), iii), Iv any combination) or v).
In some embodiments, the present invention also provides more with phenylalanine lyase (PAL) active engineering Peptide, the engineered polypeptide includes: a) amino acid sequence, the amino acid sequence have with reference sequences SEQ ID NO:10 or The sequence identity of its function fragment at least 85%;B) polypeptide sequence, the polypeptide sequence include with SEQ ID NO:10 or its Function fragment is compared at least one amino acid residue difference at one or more amino acid positions;And c) the engineering Change polypeptide show improved characteristic selected from the following with reference sequences SEQ ID NO:10 compared with: i) enhancing catalytic activity, Ii the sensibility to proteolysis that) reduces, iii) the increased tolerance to acid pH, iv) reduction aggregation, v) reduce Immunogenicity or i), ii), iii), iv) or any combination v).
In some embodiments, the present invention also provides more with phenylalanine lyase (PAL) active engineering Peptide, the engineered polypeptide include with the amino acid sequence with the sequence identity of reference sequences SEQ ID NO:10 at least 85% Column, and at least one amino acid residue difference compared with SEQ ID NO:10, and the engineered polypeptide and SEQ ID NO: 10 compared to showing the improved characteristic of at least one selected from the following: the catalytic activity of enhancing, reduction to the quick of proteolysis It is perceptual, increased to the tolerance of acid pH, the aggregation of reduction, and/or reduced immunogenicity.
In some embodiments, the present invention also provides more with phenylalanine lyase (PAL) active engineering Peptide, wherein the amino acid residue difference compared with SEQ ID NO:10 is one or more in following displacement or displacement set It is a:
127E/V39A;I27E/V39A/R43L/V105C/A153R/L214E/P266H/L278D/C503Q;
I27E/V39A/R43L/L214E/A547D;
I27E/V39A/V105C/A112C/R134Q/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/V105C/A112C/R134Q/A153R/Q205T/L214E/P266H/L278D/C503Q/ A551D;
I27E/V39A/V105C/A112C/Q205T/P210C/P266H/C503Q/A547D;
I27E/V39A/V105C/A112C/Q205T/P266H/I285E/C503Q/A551D;
I27E/V39A/V105C/A112C/L214E/I285E/C503Q/A547D;
I27E/V39A/V105C/S131N/R134Q/Q205T/L214E/C503Q/A547D/C565N;
I27E/V39A/V105C/R134Q/A153R/P210C/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/Q205T/P210C/L278D/C503Q/A547D;
I27E/V39A/V105C/R134Q/Q205T/L214E;
I27E/V39A/V105C/R134Q/Q205T/L214E/A551D/C565N;
I27E/V39A/V105C/R134Q/Q205T/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/R134Q/P210C;I27E/V39A/V105C/R134Q/P210C/L214E;
I27E/V39A/V105C/R134Q/P210C/L214E/I285E/A547D;
I27E/V39A/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V39A/V105C/R134Q/L214E/L278D/A547D/A551D;
I27E/V39A/V105C/R134Q/L214E/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/P266H/C503Q;
I27E/V39A/V105C/R134Q/P266H/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/L278D/C503Q/C565N;
I27E/V39A/V105C/R134Q/L278D/I285E/C503Q;I27E/V39A/V105C/R134Q/L278D/ A551D;
I27E/V39A/V105C/R134Q/I285E/A547D/A551D;I27E/V39A/V105C/R134Q/C503Q/ A551D;
I27E/V39A/V105C/A153R/Q205T/L278D/C503Q/A547D/A551D;
I27E/V39A/V105C/A153R/L214E;I27E/V39A/V105C/A153R/I285E;
I27E/V39A/V105C/A153R/C503Q/A547D/C565N;I27E/V39A/V105C/A153R/A551D/ C565N;
I27E/V39A/V105C/Q205T/P210C/L214E/L278D/A547D;
I27E/V39A/V105C/Q205T/P210C/L278D/C503Q;
I27E/V39A/V105C/Q205T/P210C/L278D/A547D;
I27E/V39A/V105C/Q205T/L214E/L278D/C503Q/A547D;
I27E/V39A/V105C/Q205T/L278D/C503Q/A547D;
I27E/V39A/V105C/P210C/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/P210C/L214E/P266H/L278D;
I27E/V39A/V105C/L214E/P266H/C503Q/A547D/C565N;
I27E/V39A/V105C/L214E/L278D/L309P/C503Q/A547D/A551D;
I27E/V39A/V105C/L278D/C503Q/A547D/C565N;I27E/V39A/V105C/I285E/A547D;
I27E/V39A/V105C/C503Q/A551D;I27E/V39A/V105C/C503Q/A547D/A551D/C565N;
I27E/V39A/A112C/R134Q/Q205T/P210C/L214E/A551D/C565N;
I27E/V39A/A112C/R134Q/L214E/P266H/A551D;
I27E/V39A/A112C/R134Q/L214E/C503Q/A547D;I27E/V39A/A112C/R134Q/P266H/ I285E;
I27E/V39A/A112C/Q205T/L214E/P266H/C503Q/A551D/C565N;
I27E/V39A/A112C/Q205T/L278D/I285E;I27E/V39A/A112C/L214E;
I27E/V39A/A112C/L214E/L278D/C503Q/A547D/A551D;I27E/V39A/A112C/I285E;
I27E/V39A/A112C/A547D;I27E/V39A/R134Q;
I27E/V39A/R134Q/A153R/Q205T/L214E/P266H/C503Q;
I27E/V39A/R134Q/A153R/P210C/L214E/L278D/I285E/A547D/C565N;
I27E/V39A/R134Q/A153R/L214E/P266H/L278D/C503Q/A547D/C565N;
I27E/V39A/R134Q/A153G/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/A153R/L214E/C503Q/A547D;I27E/V39A/R134Q/A153R/L278D;
I27E/V39A/R134Q/A153R/L278D/A547D/A551D;I27E/V39A/R134Q/A153R/A547D;
I27E/V39A/R134Q/Q205T/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/Q205T/P266H/C503Q/A551D/C565N;
I27E/V39A/R134Q/P210C/L214E/C503Q;I27E/V39A/R134Q/P210C/C503Q/A551D;
I27E/V39A/R134Q/L214E/P266H/A551D;
I27E/V39A/R134Q/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/R134Q/L278D/C503Q/A547D;I27E/V39A/R134Q/C503Q/A547D;
I27E/V39A/R134Q/A547D/C565N;I27E/V39A/Q205T/L214E/C503Q/C565N;
I27E/V39A/Q205T/P266H/I285E/A547D/A551D/C565N;I27E/V39A/Q205T/P266H/ A551D;
I27E/V39A/Q205T/L278D/C503Q/A551D/C565N;I27E/V39A/Q205T/L278D/C503Q/ C565N;
I27E/V39A/Q205T/C503Q/A547D/C565N;I27E/V39A/P210C/T212S;
I27E/V39A/P210C/L214E/L278D/C503Q/A551D;I27E/V39A/P210C/L214E/I285E/ C503Q/A551D;
I27E/V39A/P210C/P266H/I285E/C503Q/A547D;I27E/V39A/P210C/P266H/C503Q/ A551D;
I27E/V39A/L214E;I27E/V39A/L214E/P266H/L278D/C503Q/A547D/A551D/C565N;
127E/V39A/L214E/L278D/C503Q;I27E/V39A/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/L214E/C503Q/A551D;I27E/V39A/P266H;I27E/V39A/P266H/L278D;
I27E/V39A/L278D;I27E/V39A/L278D/A547D;I27E/V39A/L278D/I285E/C503Q/ A547D;
I27E/V39A/L278D/C503Q/C565N;I27E/V39A/C503Q;I27E/G45D/Q205T/P266H/ C565N;
I27E/V105C;I27E/V105C/R134Q/A153R/P210C/L214E/C503Q/A547D;
I27E/V105C/R134Q/A153R/I285E/A547D;I27E/V105C/R134Q/A153R/C503Q;
I27E/V105C/R134Q/Q205T/P210C/C503Q;
I27E/V105C/R134Q/Q205T/L214E/P266H/L278D/C503Q/C565N;
I27E/V105C/Q205T/P266H/C503Q;
I27E/V105C/R134Q/P210C/L214E/P266H/L278D/A551D/C565N;
I27E/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V105C/R134Q/P210C/P266H/L278D/I285E/C503Q/A551D/C565N;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D/A551D/C565N;I27E/V105C/ Q205T;
I27E/V105C/Q205T/L214E/P266H;I27E/V105C/Q205T/L214E/P266H/A551D/ C565N;
I27E/V105C/Q205T/L214E/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/Q205T/C503Q/A547D/A551D/C565N;I27E/V105C/L214E;
I27E/V105C/L214E/P266H/C503Q;I27E/V105C/L214E/I285E/A551D/C565N;
I27E/V105C/L214E/A547D/A551D/C565N;I27E/V105C/L214E/A551D/C565N;
I27E/V105C/P266H;I27E/V105C/P266H/I285E/C503Q/A547D/C565N;
I27E/V105C/L278D/A547D;I27E/V105C/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/C503Q/A547D/C565N;I27E/V105C/C503Q/A547D/A551D/C565N;
I27E/A112C/R134Q/A153R/L214E/P266H/C503Q;
I27E/A112C/R134Q/L278D/I285E/C503Q/A551D/C565N;
I27E/A112C/R134Q/Q205T/L278D/C503Q;I27E/A112C/R134Q/Q205T/I285E/ C503Q;
I27E/A112C/Q205T/P266H/L278D/I285E/C503Q;I27E/A112C/P210C/L214E/ C503Q/A547D;
I27E/R134Q;I27E/R134Q/A153R/I285E/C503Q/A547D;
I27E/R134Q/Q205T/I285E/C503Q/A551D;I27E/R134Q/Q205T/P266H/L278D/ A547D;
I27E/R134Q/P210C;I27E/R134Q/L214E/C503Q;I27E/R134Q/L214E/C503Q/A547D;
I27E/R134Q/L214E/C503Q/A547D/A551D;I27E/R134Q/L214E/C503Q/C565N;
I27E/R134Q/L278D/I285E/A551D/C565N;I27E/R134Q/I285E/C503Q;
I27E/A153R/L214E/L278D/I285E/A551D/C565N;I27E/A153R/L214E/L278D/ A551D;
I27E/Q205T;I27E/Q205T/L214E/L278D/I285E/C503Q/C565N;
I27E/Q205T/L214E/C503Q/A547D/C565N;I27E/Q205T/P266H/L278D/I285E/ A551D/C565N;
I27E/Q205T/L278D/A551D;I27E/P210C;I27E/P210C/L214E/C503Q/A547D;
I27E/P210C/L278D/C503Q;I27E/P210C/C503Q;I27E/P210C/C503Q/C565N;I27E/ P210C/A551D;
I27E/L214E;I27E/L214E/P266H/L278D/I285E/A551D;I27E/L214E/L278D;
I27E/L214E/L278D/C503Q;I27E/L214E/C503Q;I27E/L214E/C503Q/A547D;
I27E/L214E/C503Q/A547D/C565N;I27E/L214E/A551D;I27E/P266H/L278D/C503Q;
I27E/P266H/A547D/A551D;I27E/L278D/C503Q/A551D;I27E/L278D/C503Q/A551D/ C565N;
I27E/A547D/C565N;V39A/G45S/L278D/C503Q/A551D;
V39A/V105C/R134Q/A153R/Q205T/A551D;V39A/V105C/R134Q/P210C/L214E/ A551D;
V39A/V105C/R134Q/L214E/C503Q/A547D/A551D;V39A/V105C/A153R/P266H/ A547D/A551D;
V39A/V105C/Q205T/C503Q;V39A/V105C/Q205T/A551D;V39A/V105C/P210C/A547D;
V39A/V105C/L214E/P266H/A547D/C565N;V39A/V105C/L214E/I285E/C503Q/ A551D/C565N;
V39A/A112C/R134Q/Q205T/L214E/L278D;V39A/A112C/R134Q/L214E/C503Q/ A547D/A551D;
V39A/A112C/A153R/Q205T/L278D/C503Q/A547D;V39A/R134Q;
V39A/R134Q/Q205T/L214E/C503Q/C565N;V39A/R134Q/P210C/L214E/A547D/ C565N;
V39A/A153R/C503Q/A547D;V39A/Q205T/L278D/A547D/A551D;
V39A/P210C/L214E/L278D/I285E/C503Q/A551D;V39A/P266H;
V39A/P275R/L278D/C503Q/A551D;V39A/C503Q;V39A/C503Q/A551D/C565N;V105C;
V105C/A112C/R134Q/Q205T/L214E/Y492H/C503Q/A547D;
V105C/R134Q/A153R/Q205T/L214E/C503Q;V105C/R134Q/Q205T/L214E/A547D;
V105C/R134Q/Q205T/P266H/L278D;V105C/R134Q/L214E/P266H/I285E/C503Q/ A551D/C565N;
V105C/R134Q/L214E/L278D/C565N;V105C/R134Q/L214E/C503Q/A547D;
V105C/R134Q/L214E/C503Q/A547D/A551D;V105C/R134Q/C503Q;
V105C/R134Q/C503Q/A547D;V105C/R134Q/C503Q/A547D/C565N;
V105C/A153R/Q205T/L214E/P266H/C503Q/A547D;
V105C/A153R/Q205T/P266H/I285E/A547D/C565N;V105C/Q205T/P210C/L214E/ C503Q/A547D;
V105C/Q205T/L214E/L278D;V105C/Q205T/L214E/C503Q/A547D/A551D/C565N;
V105C/Q205T/C503Q/A551D;V105C/L214E/P266H/L278D/A547D;
V105C/L214E/L278D/C503Q/A547D/A551D;V105C/L214E/I285E;
V105C/L214E/I285E/C503Q/A547D/A551D/C565N;V105C/L214E/I285E/A547D/ C565N;
V105C/L278D/C503Q/A551D;V105C/I285E;V105C/I285E/A547D;V105C/C503Q;
V105C/A547D/A551D;A112C/R134Q/A153R/L214E/L278D/I285E/C503Q/A547D/ A551D/C565N;
A112C/R134Q/L214E/C503Q/A547D/A551D/C565N;A112C/L214E/L278D;
A112C/L278D/C503Q/A547D;R134Q/Q205T/L214E/I285E/C503Q/A551D/C565N;
R134Q/Q205T/C503Q;R134Q/P210C/L214E/L278D/C503Q/A547D/C565N;
R134Q/P210C/L214E/C503Q/A547D/A551D;R134Q/L214E;R134Q/L214E/L278D/ C503Q;
R134Q/L214E/L278D/C503Q/A551D;R134Q/L214E/I285E/C503Q;R134Q/C503Q;
R134Q/C503Q/A547D/A551D;A153R;Q205T/L214E/I285E/C503Q/A551D;
Q205T/L214E/I285E/C503Q/C565N;Q205T/L214E/C503Q/A547D/C565N;
Q205T/L278D/I285E/A547D/A551D;P210C/L214E;P210C/L214E/P266H;L214E/ P266H;
L214E/P266H/C503Q/A547D/A551D/C565N;L214E/C503Q/A547D;L214E/A547D;
P266H/L278D/C503Q;P266H/C565N;L278D/A547D;C503Q;C503Q/A547D;
C503Q/A547D/A551D/C565N;C503Q/A547D/C565N;C503Q/A551D;C503Q/A551D/ C565N;
A547D;And/or C565N.
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: V80I/R134C/P564Q;V121C;A123G;A124G;M125L;L126I/T;L126M/R134L;L127A; A129G/L;N130Q;N130C/M370I R134W;M133R;R134I;R134N/G307C;G135C/S;And/or G135A/ A394E。
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: G20S/I144L;R43S;L47M/I144L;L47M/R146E;L47M/M147G/A383E;L47M/P157C; Q58H/L143V;Q58K/P157D/G369C;A62S/M147V;S82I/G135C/P157F/W279L;R94C/I149E; T110I/I139R;L118M/L141H;A119E/T156H/A289D;I139M/V;R140D/G/M;R140N/A199E; R140E/A334S/A551D;L141K/Q/P/T;E142H/P/V;E142D/G371D;L143F/M;I144L/N/V;K145N/ Q/R;K145G/P157T;R146H/L;R146W/D191Y;M147A;I149L/R;F150K/L/M;L151M;A153C/G; A153S/H250N;G154R;G154Y/L174M/Q321K/S456I/G483C;T156K/G483C;P157D/F/H/Y; Y158E;V159C/H/L/M;M247I;L319M;And/or Q389K.
In still some other embodiments, the present invention also provides with phenylalanine lyase (PAL) activity Engineered polypeptide, wherein amino acid residue difference compared with SEQ ID NO:10 is in following displacement or displacement set It is one or more: P117T/Y176Q;V172I/C/L;L174M;S175G;Y176E/I/M/R/V;I177M/V;T178L/ A477S;And/or S180C/T.
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: R43S/H374K;R43S/H374R;A112S/M370A/A507E;M147I/H374S;S187R/L381V; D191Y/H385N;A232S;Q240K/H374R;A256S/L381N;P275Q/M370S;P275T/H374R;Q332K/ Y377M;A334S/H374V;L349M;Q355K/H374S;M370G/I/S;G371H/N/Q/S;M372A/V;H374A/D/G/ L/N/R/S/T;H374Q/P396Q;H374R/G417C;L375I;L375M;Y377C/I/N;Y378C/D/E/I/L/N/S; Y378F/P404Q;I379C/H/L/M/N;L381G/V;L381M/Q560K;L382C/H/I/M/S;A383S/V;K384R; H385C/G/N;H385M/P403H;H385S/P403H;D387S;L418M;G425V;A447S;S461G;And/or S525L.
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: A24S/F434M;A62S/T433N;S98I;L213M/S438L;Q240K/T433Y;S286R/Y435T; A289S/L431E;S331I;L431C/E/G/P/S/V;L432C/V;T433A/I/L/N/P/Q/R/S/V/W;F434C; Y435L;Y435Q/H446N;G436M;G436D/T;N437E/G/Q;N437T/L538M;S438C/F/M/R/T;I439C/F/ L/V;And/or A477S.
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: A24E;Q58R/Y475H;A70S/N474E;L104M/V476L;A119E/G365A;L206M;P275Q; G276V;Q292H/A479G;Q355H/I478C;P404T/A477V;I471F/G/K/M/N/R/V/W;F472G;Q473H/K/ M/R/S;Q473H/A507S;N474A/H/R/W;N474D/R490H;Y475C/F/L/Q;V476C/I/L;I478N/S; A479G/S;F482C/L;G483C/H/S;G483A/S524I;G483R/G537C;And/or A558S.
In some other embodiments, the present invention also provides with phenylalanine lyase (PAL) active work Journey polypeptide, wherein one in set is replaced or replaced to the amino acid residue difference compared with SEQ ID NO:10 selected from following Or more: V39A/K115E/M133R/C565N;V39A/M133R/F472G/C503Q/C565N;V39A/M133R/ F472G/C565N;V39A/M133R/C503Q;V39A/M133R/C503Q/C565N;V39A/M147A/Y378E/C503Q/ C565N;V39A/M147A/Y378E/C565N;V39A/M147A/L381G/F472G/C503Q/C565N;V39A/M147A/ L381G/C503Q/C565N;V39A/M147A/F472G/C503Q/C565N;V39A/M147A/F472G/C565N;V39A/ M147A/C565N;V39A/G248C/L381G/F472G/C503Q/C565N;V39A/Y378E/C503Q/C565N;V39A/ Y378E/C565N;V39A/L381G;V39A/F472G/C503Q/C565N;V39A/C503Q/C565N;M133R/L381G/ C565N;M133R/C503Q;Y378D/C503Q;Y378E/F472G/C503Q/C565N;L381G/F472GC503Q/C565N; And/or F472G/C503Q/C565N.
In still some other embodiments, the present invention also provides with phenylalanine lyase (PAL) activity Engineered polypeptide, wherein amino acid residue difference compared with SEQ ID NO:10 is in following displacement or displacement set It is one or more:
I27E/V39A;I27E/V39A//R43L/V105C/A153R/L214E/P266H/L278D/C503Q;
I27E/V39A/R43L/L214E/A547D;
I27E/V39A/V105C/A112C/R134Q/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/V105C/A112C/R134Q/A153R/Q205T/L214E/P266H/L278D/C503Q/ A551D;
I27E/V39A/V105C/A112C/Q205T/P210C/P266H/C503Q/A547D;
I27E/V39A/V105C/A112C/Q205T/P266H/I285E/C503Q/A551D;
I27E/V39A/V105C/A112C/L214E/I285E/C503Q/A547D;
I27E/V39A/V105C/S131N/R134Q/Q205T/L214E/C503Q/A547D/C565N;
I27E/V39A/V105C/R134Q/A153R/P210C/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/Q205T/P210C/L278D/C503Q/A547D;
I27E/V39A/V105C/R134Q/Q205T/L214E;
I27E/V39A/V105C/R134Q/Q205T/L214E/A551D/C565N;
I27E/V39A/V105C/R134Q/Q205T/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/R134Q/P210C;I27E/V39A/V105C/R134Q/P210C/L214E;
I27E/V39A/V105C/R134Q/P210C/L214E/I285E/A547D;
I27E/V39A/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V39A/V105C/R134Q/L214E/L278D/A547D/A551D;
I27E/V39A/V105C/R134Q/L214E/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/P266H/C503Q;
I27E/V39A/V105C/R134Q/P266H/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/L278D/C503Q/C565N;
I27E/V39A/V105C/R134Q/L278D/I285E/C503Q;I27E/V39A/V105C/R134Q/L278D/ A551D;
I27E/V39A/V105C/R134Q/I285E/A547D/A551D;I27E/V39A/V105C/R134Q/C503Q/ A551D;
I27E/V39A/V105C/A153R/Q205T/L278D/C503Q/A547D/A551D;
I27E/V39A/V105C/A153R/L214E;I27E/V39A/V105C/A153R/I285E;
I27E/V39A/V105C/A153R/C503Q/A547D/C565N;I27E/V39A/V105C/A153R/A551D/ C565N;
I27E/V39A/V105C/Q205T/P210C/L214E/L278D/A547D;
I27E/V39A/V105C/Q205T/P210C/L278D/C503Q;
I27E/V39A/V105C/Q205T/P210C/L278D/A547D;
I27E/V39A/V105C/Q205T/L214E/L278D/C503Q/A547D;
I27E/V39A/V105C/Q205T/L278D/C503Q/A547D;
I27E/V39A/V105C/P210C/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/P210C/L214E/P266H/L278D;
I27E/V39A/V105C/L214E/P266H/C503Q/A547D/C565N;
I27E/V39A/V105C/L214E/L278D/L309P/C503Q/A547D/A551D;
I27E/V39A/V105C/L278D/C503Q/A547D/C565N;I27E/V39A/V105C/I285E/A547D;
I27E/V39A/V105C/C503Q/A551D;I27E/V39A/V105C/C503Q/A547D/A551D/C565N;
I27E/V39A/A112C/R134Q/Q205T/P210C/L214E/A551D/C565N;
I27E/V39A/A112C/R134Q/L214E/P266H/A551D;
I27E/V39A/A112C/R134Q/L214E/C503Q/A547D;I27E/V39A/A112C/R134Q/P266H/ I285E;
I27E/V39A/A112C/Q205T/L214E/P266H/C503Q/A551D/C565N;
I27E/V39A/A112C/Q205T/L278D/I285E;I27E/V39A/A112C/L214E;
I27E/V39A/A112C/L214E/L278D/C503Q/A547D/A551D;I27E/V39A/A112C/I285E;
I27E/V39A/A112C/A547D;I27E/V39A/R134Q;
I27E/V39A/R134Q/A153R/Q205T/L214E/P266H/C503Q;
I27E/V39A/R134Q/A153R/P210C/L214E/L278D/I285E/A547D/C565N;
I27E/V39A/R134Q/A153R/L214E/P266H/L278D/C503Q/A547D/C565N;
I27E/V39A/R134Q/A153G/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/A153R/L214E/C503Q/A547D;I27E/V39A/R134Q/A153R/L278D;
I27E/V39A/R134Q/A153R/L278D/A547D/A551D;I27E/V39A/R134Q/A153R/A547D;
I27E/V39A/R134Q/Q205T/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/Q205T/P266H/C503Q/A551D/C565N;
I27E/V39A/R134Q/P210C/L214E/C503Q;I27E/V39A/R134Q/P210C/C503Q/A551D;
I27E/V39A/R134Q/L214E/P266H/A551D;
I27E/V39A/R134Q/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/R134Q/L278D/C503Q/A547D;I27E/V39A/R134Q/C503Q/A547D;
I27E/V39A/R134Q/A547D/C565N;I27E/V39A/Q205T/L214E/C503Q/C565N;
I27E/V39A/Q205T/P266H/I285E/A547D/A551D/C565N;I27E/V39A/Q205T/P266H/ A551D;
I27E/V39A/Q205T/L278D/C503Q/A551D/C565N;I27E/V39A/Q205T/L278D/C503Q/ C565N;
I27E/V39A/Q205T/C503Q/A547D/C565N;I27E/V39A/P210C/T212S;
I27E/V39A/P210C/L214E/L278D/C503Q/A551D;I27E/V39A/P210C/L214E/I285E/ C503Q/A551D;
I27E/V39A/P210C/P266H/I285E/C503Q/A547D;I27E/V39A/P210C/P266H/C503Q/ A551D;
I27E/V39A/L214E;I27E/V39A/L214E/P266H/L278D/C503Q/A547D/A551D/C565N;
I27E/V39A/L214E/L278D/C503Q;I27E/V39A/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/L214E/C503Q/A551D;I27E/V39A/P266H;I27E/V39A/P266H/L278D;
I27E/V39A/L278D;I27E/V39A/L278D/A547D;I27E/V39A/L278D/I285E/C503Q/ A547D;
I27E/V39A/L278D/C503Q/C565N;I27E/V39A/C503Q;I27E/G45D/Q205T/P266H/ C565N;
I27E/V105C;I27E/V105C/R134Q/A153R/P210C/L214E/C503Q/A547D;
I27E/V105C/R134Q/A153R/I285E/A547D;I27E/V105C/R134Q/A153R/C503Q;
I27E/V105C/R134Q/Q205T/P210C/C503Q;
I27E/V105C/R134Q/Q205T/L214E/P266H/L278D/C503Q/C565N;
I27E/V105C/Q205T/P266H/C503Q;
I27E/V105C/R134Q/P210C/L214E/P266H/L278D/A551D/C565N;
I27E/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V105C/R134Q/P210C/P266H/L278D/I285E/C503Q/A551D/C565N;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D/A551D/C565N;I27E/V105C/ Q205T;
I27E/V105C/Q205T/L214E/P266H;I27E/V105C/Q205T/L214E/P266H/A551D/ C565N;
I27E/V105C/Q205T/L214E/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/Q205T/C503Q/A547D/A551D/C565N;I27E/V105C/L214E;
I27E/V105C/L214E/P266H/C503Q;I27E/V105C/L214E/I285E/A551D/C565N;
I27E/V105C/L214E/A547D/A551D/C565N;I27E/V105C/L214E/A551D/C565N;
I27E/V105C/P266H;I27E/V105C/P266H/I285E/C503Q/A547D/C565N;
I27E/V105C/L278D/A547D;I27E/V105C/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/C503Q/A547D/C565N;I27E/V105C/C503Q/A547D/A551D/C565N;
I27E/A112C/R134Q/A153R/L214E/P266H/C503Q;
I27E/A112C/R134Q/L278D/I285E/C503Q/A551D/C565N;
I27E/A112C/R134Q/Q205T/L278D/C503Q;I27E/A112C/R134Q/Q205T/I285E/ C503Q;
I27E/A112C/Q205T/P266H/L278D/I285E/C503Q;I27E/A112C/P210C/L214E/ C503Q/A547D;
I27E/R134Q;I27E/R134Q/A153R/I285E/C503Q/A547D;
I27E/R134Q/Q205T/I285E/C503Q/A551D;I27E/R134Q/Q205T/P266H/L278D/ A547D;
I27E/R134Q/P210C;I27E/R134Q/L214E/C503Q;I27E/R134Q/L214E/C503Q/A547D;
I27E/R134Q/L214E/C503Q/A547D/A551D;I27E/R134Q/L214E/C503Q/C565N;
I27E/R134Q/L278D/I285E/A551D/C565N;I27E/R134Q/I285E/C503Q;
I27E/A153R/L214E/L278D/I285E/A551D/C565N;I27E/A153R/L214E/L278D/ A551D;
I27E/Q205T;I27E/Q205T/L214E/L278D/I285E/C503Q/C565N;
I27E/Q205T/L214E/C503Q/A547D/C565N;I27E/Q205T/P266H/L278D/I285E/ A551D/C565N;
I27E/Q205T/L278D/A551D;I27E/P210C;I27E/P210C/L214E/C503Q/A547D;
I27E/P210C/L278D/C503Q;I27E/P210C/C503Q;I27E/P210C/C503Q/C565N;I27E/ P210C/A551D;
I27E/L214E;I27E/L214E/P266H/L278D/I285E/A551D;I27E/L214E/L278D;
I27E/L214E/L278D/C503Q;I27E/L214E/C503Q;I27E/L214E/C503Q/A547D;
I27E/L214E/C503Q/A547D/C565N;I27E/L214E/A551D;I27E/P266H/L278D/C503Q;
I27E/P266H/A547D/A551D;I27E/L278D/C503Q/A551D;I27E/L278D/C503Q/A551D/ C565N;
I27E/A547D/C565N;V39A/G45S/L278D/C503Q/A551D;
V39A/V105C/R134Q/A153R/Q205T/A551D;V39A/V105C/R134Q/P210C/L214E/ A551D;
V39A/V105C/R134Q/L214E/C503Q/A547D/A551D;V39A/V105C/A153R/P266H/ A547D/A551D;
V39A/V105C/Q205T/C503Q;V39A/V105C/Q205T/A551D;V39A/V105C/P210C/A547D;
V39A/V105C/L214E/P266H/A547D/C565N;V39A/V105C/L214E/I285E/C503Q/ A551D/C565N;
V39A/A112C/R134Q/Q205T/L214E/L278D;V39A/A112C/R134Q/L214E/C503Q/ A547D/A551D;
V39A/A112C/A153R/Q205T/L278D/C503Q/A547D;V39A/R134Q;
V39A/R134Q/Q205T/L214E/C503Q/C565N;V39A/R134Q/P210C/L214E/A547D/ C565N;
V39A/A153R/C503Q/A547D;V39A/Q205T/L278D/A547D/A551D;
V39A/P210C/L214E/L278D/I285E/C503Q/A551D;V39A/P266H;
V39A/P275R/L278D/C503Q/A551D;V39A/C503Q;V39A/C503Q/A551D/C565N;V105C;
V105C/A112C/R134Q/Q205T/L214E/Y492H/C503Q/A547D;
V105C/R134Q/A153R/Q205T/L214E/C503Q;V105C/R134Q/Q205T/L214E/A547D;
V105C/R134Q/Q205T/P266H/L278D;V105C/R134Q/L214E/P266H/I285E/C503Q/ A551D/C565N;
V105C/R134Q/L214E/L278D/C565N;V105C/R134Q/L214E/C503Q/A547D;
V105C/R134Q/L214E/C503Q/A547D/A551D;V105C/R134Q/C503Q;
V105C/R134Q/C503Q/A547D;V105C/R134Q/C503Q/A547D/C565N;
V105C/A153R/Q205T/L214E/P266H/C503Q/A547D;
V105C/A153R/Q205T/P266H/I285E/A547D/C565N;V105C/Q205T/P210C/L214E/ C503Q/A547D;
V105C/Q205T/L214E/L278D;V105C/Q205T/L214E/C503Q/A547D/A551D/C565N;
V105C/Q205T/C503Q/A551D;V105C/L214E/P266H/L278D/A547D;
V105C/L214E/L278D/C503Q/A547D/A551D;V105C/L214E/I285E;
V105C/L214E/I285E/C503Q/A547D/A551D/C565N;V105C/L214E/I285E/A547D/ C565N;
V105C/L278D/C503Q/A551D;V105C/I285E;V105C/I285E/A547D;V105C/C503Q;
V105C/A547D/A551D;A112C/R134Q/A153R/L214E/L278D/I285E/C503Q/A547D/ A551D/C565N;
A112C/R134Q/L214E/C503Q/A547D/A551D/C565N;A112C/L214E/L278D;
A112C/L278D/C503Q/A547D;R134Q/Q205T/L214E/I285E/C503Q/A551D/C565N;
R134Q/Q205T/C503Q;R134Q/P210C/L214E/L278D/C503Q/A547D/C565N;
R134Q/P210C/L214E/C503Q/A547D/A551D;R134Q/L214E;R134Q/L214E/L278D/ C503Q;
R134Q/L214E/L278D/C503Q/A551D;R134Q/L214E/I285E/C503Q;R134Q/C503Q;
R134Q/C503Q/A547D/A551D;A153R;Q205T/L214E/I285E/C503Q/A551D;
Q205T/L214E/I285E/C503Q/C565N;Q205T/L214E/C503Q/A547D/C565N;
Q205T/L278D/I285E/A547D/A551D;P210C/L214E;P210C/L214E/P266H;L214E/ P266H;
L214E/P266H/C503Q/A547D/A551D/C565N;L214E/C503Q/A547D;L214E/A547D;
P266H/L278D/C503Q;P266H/C565N;L278D/A547D;C503Q;C503Q/A547D;
C503Q/A547D/A551D/C565N;C503Q/A547D/C565N;C503Q/A551D;C503Q/A551D/ C565N;
A547D;C565N;V80I/R134C/P564Q;V121C;A123G;A124G;M125L;L126I/T;L126M/ R134L;
L127A;A129G/L;N130Q;N130C/M370I R134W;M133R;R134I;R134N/G307C;G135C/ S;
G135A/A394E;G20S/I144L;R43S;L47M/I144L;L47M/R146E;L47M/M147G/A383E;
L47M/P157C;Q58H/L143V;Q58K/P157D/G369C;A62S/M147V;S82I/G135C/P157F/ W279L;
R94C/I149E;T110I/I139R;L118M/L141H;A119E/T156H/A289D;I139M/V;R140D/G/ M;
R140N/A199E;R140E/A334S/A551D;L141K/Q/P/T;E142H/P/V;E142D/G371D; L143F/M;
I144L/N/V;K145N/Q/R;K145G/P157T;R146H/L;R146W/D191Y;M147A;I149L/R; F150K/L/M;
L151M;A153C/G;A153S/H250N;G154R;G154Y/L174M/Q321K/S456I/G483C;T156K/ G483C;
P157D/F/H/Y;Y158E;V159C/H/L/M;M247I;L319M;Q389K;P117T/Y176Q;V172I/C/ L;
L174M;S175G;Y176E/I/M/R/V;I177M/V;T178L/A477S;S180C/T;R43S/H374K; R43S/H374R;
A112S/M370A/A507E;M147I/H374S;S187R/L381V;D191Y/H385N;A232S;Q240K/ H374R;
A256S/L381N;P275Q/M370S;P275T/H374R;Q332K/Y377M;A334S/H374V;L349M;
Q355K/H374S;M370G/I/S;G371H/N/Q/S;M372A/V;H374A/D/G/L/N/R/S/T;H374Q/ P396Q;
H374R/G417C;L375I;L375M;Y377C/I/N;Y378C/D/E/I/L/N/S;Y378F/P404Q; I379C/H/L/M/N;
L381G/V;L381M/Q560K;L382C/H/I/M/S;A383S/V;K384R;H385C/G/N;H385M/ P403H;
H385S/P403H;D387S;L418M;G425V;A447S;S461G;S525L;A24S/F434M;A62S/ T433N;S98I;
L213M/S438L;Q240K/T433Y;S286R/Y435T;A289S/L431E;S331I;L431C/E/G/P/S/ V;L432C/V;
T433A/I/L/N/P/Q/R/S/V/W;F434C;Y435L;Y435Q/H446N;G436M;G436D/T;N437E/ G/Q;
N437T/L538M;S438C/F/M/R/T;I439C/F/L/V;A477S;A24E;Q58R/Y475H;A70S/ N474E;
L104M/V476L;A119E/G365A;L206M;P275Q;G276V;Q292H/A479G;Q355H/I478C;
P404T/A477V;I471F/G/K/M/N/R/V/W;F472G;Q473H/K/M/R/S;Q473H/A507S; N474A/H/R/W;
N474D/R490H;Y475C/F/L/Q;V476C/I/L;I478N/S;A479G/S;
F482C/L;G483C/H/S;G483A/S524I;G483R/G537C;A558S;V39A/K115E/M133R/ C565N;
V39A/M133R/F472G/C503Q/C565N;V39A/M133R/F472G/C565N;V39A/M133R/C503Q;
V39A/M133R/C503Q/C565N;V39A/M147A/Y378E/C503Q/C565N;V39A/M147A/Y378E/ C565N;
V39A/M147A/L381G/F472G/C503Q/C565N;V39A/M147A/L381G/C503Q/C565N;
V39A/M147A/F472G/C503Q/C565N;V39A/M147A/F472G/C565N;V39A/M147A/C565N;
V39A/G248C/L381G/F472G/C503Q/C565N;V39A/Y378E/C503Q/C565N;V39A/Y378E/ C565N;
V39A/L381G;V39A/F472G/C503Q/C565N;V39A/C503Q/C565N;M133R/L381G/C565N;
M133R/C503Q;Y378D/C503Q;Y378E/F472G/C503Q/C565N;L381G/F472GC503Q/ C565N;
And/or F472G/C503Q/C565N.
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide Include: a) amino acid sequence, the amino acid sequence have with reference sequences SEQ ID NO:26 or its function fragment at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or bigger sequence identity;B) polypeptide sequence, the polypeptide sequence include and SEQ ID NO:26 or its function fragment Compared at least one amino acid residue difference at one or more amino acid positions;And c) engineered polypeptide with Reference sequences SEQ ID NO:26 is compared and is shown improved characteristic selected from the following: i) catalytic activity of enhancing, ii) reduce Sensibility, iii to proteolysis) the increased tolerance to acid pH, iv) reduction aggregation, v) reduce immunogenicity, Or i), ii), iii), iv) or any combination v).
In some embodiments, the present invention also provides more with phenylalanine lyase (PAL) active engineering Peptide, the engineered polypeptide includes: a) amino acid sequence, the amino acid sequence have with reference sequences SEQ ID NO:26 or Its function fragment at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger sequence Column identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:26 or its function fragment at one or more At least one amino acid residue difference at multiple amino acid positions;And c) engineered polypeptide and reference sequences SEQ ID NO:26 compare show improved characteristic selected from the following: i) enhance catalytic activity, ii) reduce to the quick of proteolysis Perception, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), ii), iii), Iv any combination) or v).
In some other embodiments, the present invention provides with phenylalanine lyase (PAL) active engineering Change polypeptide, the engineered polypeptide includes: a) amino acid sequence, and the amino acid sequence has and reference sequences SEQ ID NO: 26 or its function fragment at least 85% sequence identity;B) polypeptide sequence, the polypeptide sequence include and SEQ ID NO:26 Or its function fragment compares at least one amino acid residue difference in one or more amino acid positions;And c) the work Journey polypeptide shows improved characteristic selected from the following compared with reference sequences SEQ ID NO:26: i) catalysis enhanced is living Property, ii) reduce the sensibility to proteolysis, iii) the increased tolerance to acid pH, iv) reduction aggregation, v) drop Low immunogenicity or i), ii), iii), iv) or any combination v).In some embodiments, of the invention that there is benzene Alanine aminonialyase (PAL) active engineered polypeptide, comprising having the sequence with reference sequences SEQ ID NO:26 at least 85% The amino acid sequence of column identity, and at least one amino acid residue difference compared with SEQ ID NO:126, and the work Journey polypeptide shows the improved characteristic of at least one selected from the following compared with SEQ ID NO:26: the catalytic activity of enhancing, Reduce to the sensibility of proteolysis, increased to the tolerance of acid pH, the aggregation of reduction and/or reduced immunogene Property.In some embodiments of engineered polypeptide, the amino acid residue difference compared with SEQ ID NO:26 is selected from underlying It changes or replaces and is in set one or more: A24E/G381L;L127V;A129I/V;S131C/T;H132L/S;R134C/ F/H/K;R134H/Y378E/G381L;R134H/Y378E/G381L/V388T;R134H/V388T;A136K;A289S; M372L;H374G/M/Q;G381A/C/F/I/L/M/N/Q/S/T;A383C/M;V388C/T;L431M;And/or L563M.
In some embodiments, of the invention that there is phenylalanine lyase (PAL) active engineered polypeptide to have With reference sequences SEQ ID NO:4 at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, At least about 95%, at least about 96%, at least about 97%, at least about 98% or at least about 99% sequence identity.In some realities It applies in scheme, of the invention has and reference sequences SEQ ID with phenylalanine lyase (PAL) active engineered polypeptide The sequence identity of NO:4 at least about 90%, and in some other embodiments, engineered polypeptide has and reference sequences The sequence identity of SEQ ID NO:4 at least about 95%.In some embodiments, of the invention that there is phenylalanine lyase (PAL) active engineered polypeptide have with reference sequences SEQ ID NO:4 at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity.One In a little embodiments, of the invention has and reference sequences SEQ with phenylalanine lyase (PAL) active engineered polypeptide The sequence identity of ID NO:4 at least 90%, and in some other embodiments, engineered polypeptide has and reference sequences The sequence identity of SEQ ID NO:4 at least 95%.In some other embodiments, engineered polypeptide includes of the invention Function fragment with phenylalanine lyase (PAL) active polypeptide (for example, any variant provided in table herein).
In some embodiments, of the invention that there is phenylalanine lyase (PAL) active engineered polypeptide to have With SEQ ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26 at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98% or at least About 99% sequence identity.In some embodiments, of the invention that there is phenylalanine lyase (PAL) active engineering Change polypeptide have with SEQ ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26 at least 90%, at least 91%, extremely Few 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence Identity.In some embodiments, having phenylalanine lyase (PAL) active engineered polypeptide includes having and SEQ The amino acid sequence of at least about 90% sequence identity of ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26.One In a little embodiments, the engineered polypeptide of the enzymatic activity with phenylalanine lyase (PAL) includes having and SEQ ID NO: 6,8,10,12,14,16,18,20,22,24 and/or 26 or its function fragment at least about 99% sequence identity amino acid Sequence.In some other embodiments, engineered polypeptide includes of the invention with phenylalanine lyase (PAL) activity Polypeptide function fragment (for example, SEQ ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26, and herein The function fragment of any variant provided in table).
The present invention also provides with phenylalanine lyase (PAL) active engineered polypeptide, wherein the engineering is more Peptide is the variant PAL that any of table 2-1 to 2-5 and/or table 9-1 to 9-7 are provided.
In some embodiments, having phenylalanine lyase (PAL) active engineered polypeptide is Anabaena variabilis Enzyme.In some other embodiments, it is heat-staple for having phenylalanine lyase (PAL) active engineered polypeptide. In some embodiments, there is phenylalanine lyase (PAL) active engineered polypeptide tolerance protein hydrolysis.Some another In outer embodiment, there is phenylalanine lyase (PAL) active engineered polypeptide tolerance to pass through at least one alimentary canal The proteolysis of enzyme.In some other embodiments, have phenylalanine lyase (PAL) active engineered polypeptide resistance to By the proteolysis by chymotrypsin, trypsase, carboxypeptidase and/or elastoser.In some other implementation In scheme, having phenylalanine lyase (PAL) active engineered polypeptide is that acid is stable.
The present invention also provides deimmunized to have phenylalanine lyase (PAL) active engineered polypeptide.One In a little embodiments, deimmunized engineered polypeptide include have with SEQ ID NO:6,8,10,12,14,16,18,20, 22,24 and/or 26 at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or bigger sequence identity amino acid sequence Column.In some other embodiments, deimmunized engineered polypeptide include have with SEQ ID NO:6,8,10,12, 14,16,18,20,22,24 and/or 26 at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger sequence identity amino acid sequence.Some In embodiment, deimmunized engineered polypeptide include have with SEQ ID NO:6,8,10,12,14,16,18,20,22, The amino acid sequence of 24 and/or 26 at least 95% sequence identity.In some embodiments, deimmunized engineered polypeptide Comprising having the amino of 95% sequence identity with SEQ ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26 Acid sequence.
In still some other embodiments, the present invention provides the engineered polypeptide of purifying, the engineering is more Peptide has phenylalanine lyase (PAL) activity.
The present invention also provides polynucleotide sequence, at least one tool as described herein of the polynucleotide sequence coding There is the engineered polypeptide of phenylalanine lyase (PAL).In some embodiments, polynucleotide sequence is operably connected To control sequence.In some other embodiments, polynucleotide sequence is codon optimization.
The present invention also provides expression vector, the expression vector includes that coding at least one has as herein provided At least one polynucleotide sequence of phenylalanine lyase (PAL) active engineered polypeptide.In some embodiments, table It also include at least one control sequence up to carrier.In some embodiments, control sequence is promoter.In some other realities It applies in scheme, promoter is allogeneic promoter.
The present invention also provides host cell, host cell coding has phenylalanine lyase (PAL) activity At least one polynucleotide sequence of at least one engineered polypeptide convert, and/or turned at least one expression vector Change, the expression vector includes that coding has phenylalanine lyase (PAL) active at least one engineered polypeptide at least A kind of polynucleotide sequence and at least one control sequence.In some embodiments, host cell includes to have phenylalanine The active at least one engineered polypeptide of aminonialyase (PAL), the engineered polypeptide is codon optimization.In some embodiment party In case, host cell is Escherichia coli (E.coli).
The present invention also provides the method that at least one engineering PAL polypeptide is generated in host cell, the method packets It includes under suitable condition of culture, culture has phenylalanine lyase (PAL) active at least one engineering comprising coding At least one polynucleotides of polypeptide, and/or there is phenylalanine lyase (PAL) active at least one engineering comprising coding The host cell for changing at least one expression vector of at least one polynucleotide sequence of polypeptide, so that engineering PAL polypeptide It is generated.In some embodiments, this method further includes recycling to have phenylalanine solution from culture and/or host cell The step of at least one engineered polypeptide of adnosine deaminase (PAL).In some other embodiments, this method further includes purifying tool There is the step of at least one engineered polypeptide of phenylalanine lyase (PAL).
The present invention also provides composition, the composition includes at least one as herein provided with phenylalanine Aminonialyase (PAL) active engineered polypeptide.In some embodiments, composition is pharmaceutical composition.In some embodiment party In case, composition is diet and/or nutritional supplement.In some other embodiments, pharmaceutical composition also includes at least A kind of pharmaceutically acceptable excipient and/or carrier.In some other embodiments, composition is suitable for phenylketonuria The treatment of disease.In some other embodiments, composition is suitable for being administered orally to people.In some embodiments, group Object is closed in the form of pill, tablet, capsule, caplets (gelcap), liquid or emulsion.In other other embodiments In, pill, tablet, capsule or caplets also include enteric coating.In some other embodiments, pharmaceutical composition is suitable Together in parenteral injection into people.In some embodiments, pharmaceutical composition and at least one effective chemical combination of other treatment Object co-administers.In some other embodiments, pharmaceutical composition includes at least one other treatment active compounds.? In some other embodiments, pharmaceutical composition is present in diet and/or nutritional supplement.
The present invention also provides the method for treating and/or preventing the symptom of phenylketonuria in subject, the side Method includes: to provide the subject for suffering from phenylketonuria, and provide at least one composition provided in this article to subject. In some embodiments, composition includes pharmaceutical composition, and in some optional embodiments, composition includes meals Food/nutritional supplement.In some embodiments of this method, the symptom of phenylketonuria is enhanced.In some other realities It applies in scheme, treated subject can eat in terms of its methionine, phenylalanine and/or tyrosine content ratio by table Reveal the more unrestricted diet of diet required for the subject of phenylalanine symptom.In some embodiments, treated Subject is (that is, be provided has phenylalanine lyase (PAL) active as herein provided comprising at least one The subject of at least one composition of engineered polypeptide) it can eat and contain in its methionine, phenylalanine and/or tyrosine Amount aspect is more unrestricted than diet required for the subject as being also not provided at least one composition as herein provided Diet.In some embodiments, the composition provided to subject includes pharmaceutical composition, and in some optional implementations In scheme, composition includes diet/nutritional supplement.The present invention also provides treated subjects, and wherein the subject has been It is administered at least one composition and/or pharmaceutical composition, at least one composition and/or pharmaceutical composition include extremely It is few a kind of as herein provided with phenylalanine lyase (PAL) active engineered polypeptide.In some embodiments, Subject is animal selected from the following: primate, rodent and lagomorph.In some other embodiments, by Examination person is mouse.In some other embodiments, subject is people.It is tested in still some other embodiments Person is human infant or children, and in some optional embodiments, subject is adult humans or youth.I
The present invention also provides comprising at least one provided in this article with phenylalanine lyase (PAL) active work The purposes of the composition of journey polypeptide.
Brief description
Fig. 1 provides the comparison of wild type PAL protein sequence: Anabaena variabilis PAL (NCBI YP_324488.1 (SEQ ID NO:4));Dotted nostoc phenylalanine/histidine aminonialyase " NpPHAL " (NCBI YP_001865631.1 (SEQ ID NO:30);Glue must Trentepohlia (Rivularia sp.) histidine aminonialyase " RspHAL " (NCBI YP_007056096.1 (SEQ ID NO:31);Oscillatoria (Oscillatoria sp.) histidine aminonialyase " Osp HAL " (NCBI YP_07108482.1 (SEQ ID NO:32);With Gloeocapsa (Gloeocapsa sp.) histidine aminonialyase " GspHAL " (NCBI YP_007127054.1) (SEQ ID NO:33)。
The A figure of Fig. 2 shows compared with wild type AvPAL, the variant 22 tested at pH 7.0 (SEQ ID NO: 8), the sensibility (table to proteolysis of the reduction of variant 30 (SEQ ID NO:6) and variant 36 (SEQ ID NO:10) It is shown as chymotrypsin and trypsin resistance), as further described in embodiment 4.
The B figure of Fig. 2 provides display compared with wild type AvPAL, in the lower variant 22 tested of pH 4.0 to 5.2,30 Number and No. 36 increased tolerances to acid pH figure, as further described in embodiment 4.
Fig. 3 provides the K of wild type PAL and variant 36MAs a result.
Fig. 4 provides the data of display wild type PAL and variant 36 amino acid specificities.
Fig. 5 provides the phase of the wild type PAL for being shown that exposure to people's chymotrypsin and trypsase and variant 36 To the result of stability.
Fig. 6 provides the opposite of the wild type PAL for being shown that exposure to pig pancreatic extract and variant No. 36, No. 42 and No. 43 The result of stability.
Fig. 7 provides the figure for the result that display embodiment 5 obtains.
Fig. 8 provides the figure for the result that display embodiment 8 obtains.
Invention description
The present invention provides engineering PAL polypeptides, mutant, bioactive fragment and the like, and include engineering Change the pharmaceutical composition and industry group compound of PAL polypeptide, mutant, bioactive fragment and the like.
The present invention provides engineering phenylalanine lyase (PAL) polypeptides and combinations thereof, and coding engineering benzene The polynucleotides of alanine aminonialyase (PAL) polypeptide.In some embodiments, engineering PAL polypeptide is optimized to provide increasing Strong catalytic activity, and the sensibility and the increased tolerance to acid pH level to proteolysis reduced.Some In embodiment, engineering PAL polypeptide is deimmunized.The invention further relates to the compositions comprising being engineered PAL polypeptide to be used for The purposes for the treatment of and industrial purpose.
Abbreviation and definition:
Unless otherwise defined, all technical and scientific terms used herein usually have general with fields of the present invention The normally understood identical meaning of logical technical staff.In general, nomenclature used herein and cell described below training Support, the laboratory procedure of molecular genetics, microbiology, organic chemistry, analytical chemistry and nucleic acid chemistry is it is well known that and Those of generally use.Such technology is known and retouches in many texts well known to those skilled in the art and reference work It states.Standard technique or its modification are used for chemical synthesis and chemical analysis.Herein refer to above and below all patents, specially Benefit application, article and publication pass through reference herein and are clearly incorporated herein.
Although it was found that using any suitable method similar or equivalent to those described herein in the practice of the invention And material, certain methods and material are described herein.It should be appreciated that the present invention is not limited to the specific methods, scheme And reagent, changed because these depend on those skilled in the art using its technical scope.Therefore, it will define below Term is more fully described below as a whole by reference to the application.All patents for referring to herein above and hereafter, specially Benefit application, article and publication, are clearly incorporated herein by reference herein.
In addition, unless the context clearly dictates otherwise, singular " one (a) " as used herein, " one (an) " and " this It (the) " include plural referents.
Numberical range includes limiting the number of the range.Therefore, each numberical range disclosed herein is intended to include falling in Each relatively narrow numberical range in such broader numerical, as such relatively narrow numberical range is all clearly write in this paper Out.It is also intended to the numerical limitations that each maximum (or minimum) numerical limitations disclosed herein include each lower (or higher), As such lower (or higher) numerical limitations are clearly write out in this paper.
Term " about " means the acceptable error of particular value.In some cases, " about " mean in given value range 0.05%, in 0.5%, 1.0% or 2.0%.In some cases, " about " mean 1,2,3 or 4 standard deviation in given value It is interior.
In addition, headings provided herein be not can be had as a whole by reference to the application it is of the invention each The limitation of a aspect or embodiment.Therefore, the term that will be defined below is more filled as a whole by reference to the application Divide ground description.Nevertheless, to facilitate the understanding of the present invention, many terms are defined as follows.
Unless otherwise instructed, nucleic acid is from left to right write with 5' to the direction 3';Amino acid sequence is respectively with amino to carboxyl It writes from left to right in direction.
Term " including (comprising) " as used herein and its cognate with it includes meaning used (that is, It is equal to term " including (including) " and its corresponding cognate).
" EC " number refers to Enzyme Nomenclature of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (biochemistry and molecular biosciences Learn internation combination naming committee) (NC-IUBMB) enzyme nomenclature.The IUBMB Biochemical Classification is that the chemistry based on its catalysis is anti- The enzyme numerical classification system answered.
" ATCC " refers to American Type Culture Collection (American type culture collection), It includes gene and bacterial strain that biological deposits, which are collected,.
" NCBI " refers to National Center for Biological Information (in national biological information The heart) and wherein provided by sequence database.
As used herein, term " phenylalanine lyase (PAL) polypeptide " refers to further include histidine aminonialyase and junket One in the aromatic amino acid lyase family (EC 4.3.1.23, EC 4.3.1.24 and EC4.3.1.25) of propylhomoserin aminonialyase Fermentoid.Because some PAL enzymes can be used tyrosine and phenylalanine as substrate, PAL polypeptide be also occasionally referred to as phenylalanine/ Tyrosine ammonia lyase.However, AvPAL and variant disclosed and claimed herein are without using tyrosine as substrate.PAL polypeptide Catalysis L-phenylalanine is converted into trans-cinnamic acid and ammonia.PAL activity refers to the enzymatic activity of PAL polypeptide.In some preferred implementations In scheme, PAL enzyme also includes co-factor 3,5- dihydro -5- methylene -4H- imidazol-4-one (MIO).The co-factor may be to urge Required for changing activity, and by the cyclisation and dehydration of conservative active site Ala167-Ser168-Gly169 tripeptides section come shape At.
" albumen ", " polypeptide " and " peptide " used interchangeably herein, with indicate not length of tube or posttranslational modification (for example, Glycosylation or phosphorylation) pass through at least two amino acid polymers of amido bond covalent linkage.
" amino acid " is pushed away by its commonly known three letter symbols or by the biochemical nomenclature commission IUPAC-IUB The one-letter symbol recommended is mentioned herein.Similarly, nucleotide can be mentioned by its generally acceptable single letter code And.
When about cell, polynucleotides or polypeptide in use, term " engineering ", " recombination ", " non-naturally occurring " and " variant " refers to following material or material corresponding with the natural or natural form of the material: the side to be naturally not present originally Formula is modified or same but generated by synthetic material or derivative and/or operate generation by using recombinant technique.
As used herein, " wild type " and " naturally occurring " refers to the form found in nature.Such as wild type Polypeptide or polynucleotide sequence are sequences present in organism, can from natural origin separate and not by manual operation by It modifies consciously.
It is immune that " deimmunized ", which refers to that operation albumen is created not as good as wild type or reference protein, as used herein The variant of originality.In some embodiments, it deimmunizes and is completed, wherein misfolded proteins do not stimulate the misfolded proteins to be administered Immune response into its patient.The response can be measured by a variety of methods, and a variety of methods include but is not limited to, Apply albumen after neutralize (that is, drug resistance antibody) presence or abundance, the presence of anaphylaxis response or cytokine release it is general All over degree (prevalence) or intensity.In some embodiments, misfolded proteins are exempted from than wild-type protein or reference protein Epidemic focus is smaller.In some embodiments, deimmunizing includes to the albumen (for example, epitope) identified by T cell receptor Modification.In some embodiments, t cell epitope is removed from wild-type protein or reference protein, deimmunized to generate Misfolded proteins.In some embodiments, deimmunized albumen is shown to the biochemistry of human immunity response and thin The response of the reduced levels of the predictive factor of born of the same parents' biology, the response include that the measurement of Dendritic Cells T cell activation or people are white thin Extracellular antigen (HLA) peptide binding assay.
" coded sequence " refers to the part (for example, gene) of the nucleic acid of the amino acid sequence of coding albumen.
Term " Percentage of sequence identity (%) " is used herein to refer to the comparison between polynucleotides and polypeptides, and It is determined by comparing the sequence of two optimal comparisons across comparison window, wherein the portion of polynucleotides or polypeptide sequence in comparison window Dividing compared with reference sequences may include addition or missing (that is, notch), with the optimal comparison for two sequences.Percentage can be such as Lower calculating: by determining in two sequences the number of the position of identical nucleic acid base or amino acid residue occur, with generation Number with position, by the number of matching position divided by the total number of position in comparison window, and by result multiplied by 100 to obtain sequence The percentage of column identity.Optionally, percentage can calculate as follows: by determining in two sequences identical nucleic acid base occur Or the number of position that amino acid residue or nucleic acid base or amino acid residue are aligned with notch, to generate the number of matching position Mesh, by the number of matching position divided by the total number of position in comparison window, and by result multiplied by 100 to obtain sequence identity Percentage.It will be appreciated by those skilled in the art that can be used for comparing the established algorithm of two sequences in the presence of many.For what is compared Optimal sequence compares following progress, for example, as known in the art, passing through the local homology algorithm of Smith and Waterman (Smith and Waterman, Adv.Appl.Math., 2:482 [1981]), pass through the homology ratio of Needleman and Wunsch To algorithm (Needleman and Wunsch, J.Mol.Biol., 48:443 [1970]), pass through the similar of Pearson and Lipman It spends search method (Pearson and Lipman, Proc.Natl.Acad.Sci.USA 85:2444 [1988]), pass through these algorithms Computer realize (for example, GAP, BESTFIT, FASTA and TFASTA in GCG Wisconsin software package) or pass through mesh It surveys.The example for being adapted to determine that the algorithm of Percentage of sequence identity and sequence similarity includes but is not limited to BLAST and BLAST 2.0 algorithms are (see, for example, Altschul etc., J.Mol.Biol., 215:403-410 [1990];With Altschul etc., Nucleic Acids Res.,3389-3402[1977]).Software for carrying out BLAST analysis is by American National biology Technology information centre website is public obtainable.The algorithm includes first by identification search sequence (query sequence) Short word with length " W " determines high scoring sequence to (HSP), when it is compared with the word of equal length in database sequence When, the short word matches or meets some positive-valued threshold scoring " T ".T is referred to as neighborhood word scoring threshold value (referring to Altschul Deng ibid).It includes them that these initial neighborhood words, which hit (word hit) to be used as the seed of starting retrieval to find longer, HSP.Then word hit along the both direction of each sequence extend to accumulation alignment score cannot increased degree.For core Nucleotide sequence, using parameter " M ", (reward for a pair of of matching residue is scored for accumulation scoring;Always > 0) and " N " (is used for mispairing The Penalty Mark of residue;Always < 0) calculate.For amino acid sequence, using rating matrix to calculate accumulation scoring.Work as generation Word hits the extension stopping in each direction: the value amount of having dropped " X " that accumulation alignment score is obtained from its maximum when following situations When;Due to the accumulation of one or more negative scoring residue alignments, when accumulation scoring becomes zero or less;Or reach any sequence End.BLAST algorithm parameter W, T and X determine the sensitivity and speed compared.BLASTN program (for nucleotide sequence) uses The comparison of 11 word length (W), 10 desired value (E), M=5, N=-4 and two chains is as default parameters.For amino acid Sequence, BLASTP program is using 3 word length (W), 10 desired value (E) and BLOSUM62 rating matrix as default (referring to example Such as, Henikoff and Henikoff, Proc.Natl.Acad.Sci.USA 89:10915 [1989]).Sequence alignment and % sequence The exemplary determination of identity can be used BESTFIT in GCG Wisconsin software package (Accelrys, Madison WI) or GAP program uses the default parameters of offer.
" reference sequences " refer to that the basic sequencing really compared as sequence arranges.Reference sequences can be the son of bigger sequence Collection, for example, the segment of full-length gene or polypeptide sequence.In general, reference sequences are at least 20 nucleotide or amino acid residue Length, the length of at least 25 residues, the length of at least 50 residues, the length of at least 100 residues or nucleic acid or polypeptide Overall length.Due to two kinds of polynucleotides or polypeptide can respectively (1) include between two sequences similar sequence (that is, complete sequence A part), and (2) may also include sequence different between two sequences, two kinds (or more) polynucleotides or polypeptide it Between sequence more usually by " comparison window " relatively sequence of two kinds of polynucleotides or polypeptide carry out, to determine and Compare the regional area with sequence similarity.In some embodiments, " reference sequences " can be based on primary amino acid sequence, Wherein reference sequences are the sequences can in basic sequence with one or more variations.For example, phrase " it is based on SEQ ID NO: 4 reference sequences on the residue corresponding to X39 with valine " refer to corresponding on the position X39 in SEQ ID NO:4 Residue (for example, alanine) has been changed to the reference sequences of valine.
" comparison window " refers at least about conceptual section of 20 continuous nucleotide positions or amino acid residue, wherein sequence It can be compared at least reference sequences of 20 continuous nucleotides or amino acid, and the wherein sequence in comparison window A part is compared with the reference sequences (its do not include addition or lack) of the optimal comparison for two sequences, it may include 20% or Less addition or missing (that is, notch).Comparison window can than 20 consecutive residues it is longer, and optionally include 30,40, 50,100 or longer window.
When in the context in given amino acid or polynucleotide sequence number in use, " corresponding to ", " about " and " relative to " refers to when given amino acid or polynucleotide sequence are compared with specified reference sequences, the residue of reference sequences Number.In other words, the number of residues of given polymer or resi-dues are designated about reference sequences, rather than by given Amino acid or polynucleotide sequence in residue real figure position be designated.For example, given amino acid sequence, such as The amino acid sequence for being engineered PAL can be by introducing vacancy to optimize the residue match between two sequences and reference sequences ratio It is right.In such cases, despite the presence of vacancy, the number of the residue in given amino acid or polynucleotide sequence about with Its reference sequences compared is made.
" amino acid of differences " and " residue difference " refer to the amino acid residue at the position of polypeptide sequence relative to reference sequence In column corresponding position amino acid residue difference.The position of amino acid of differences commonly referred to herein as " Xn ", wherein n Refer to residue difference based on the corresponding position in its reference sequences.For example, " residual on the X91 of position compared with SEQ ID NO:4 Basis is different " refer to the difference of amino acid residue at polypeptide position corresponding to the position 91 of SEQ ID NO:4.Therefore, if The reference polypeptide of SEQ ID NO:4 has alanine on position 91, then " compared with SEQ ID NO:4 on the X91 of position Residue difference " refers to any residue other than the alanine at 91 polypeptide position of the position corresponding to SEQ ID NO:4 Amino acid replacement.This paper in most cases, the particular amino acid residue difference at position is expressed as " XnY ", wherein " Xn " is corresponding residue and position (as described above) of specified reference polypeptide, and " Y " is the amino found in engineered polypeptide The single-letter identifier (that is, different residues compared with reference polypeptide) of acid.In some cases (for example, in the table of embodiment In), the present disclosure also provides the specific amino acids differences indicated by conventional expressing method " AnB ", and wherein A is in reference sequences The single-letter identifier of residue, " n " is the number of the resi-dues in reference sequences, and B is in the sequence of engineered polypeptide The single-letter identifier of residue displacement.In some cases, the polypeptide of present disclosure may include one relative to reference sequences Or more amino acid residue, by relative to reference sequences, there are the instructions of a column specific position of residue difference.In some realities It applies in scheme, when more than one amino acid can be in the specific residues position of polypeptide in use, workable various amino acid are residual Base separates (for example, X307G/X307Q or X307G/Q) by "/".Present disclosure include include conservative and non-conservative ammonia Base acid displacement either one or two of/or both one or more amino acid of differences engineered polypeptide sequence substitutions.
Term " amino acid replacement set " and " displacement set " refer to one group of amino acid replacement in polypeptide sequence.Some In embodiment, displacement set includes 1,2,3,4,5,6,7,8,9,10,11,12,13 A, 14,15 or more amino acid replacements.In some embodiments, displacement set, which refers to, is present in appointing in embodiment The set of amino acid replacement in any one for the modification A vPAL polypeptide listed in one table.For example, present in the variant 36 Displacement collection is combined into A39V/A91V/N290G/H307G/L407V/T524S, wherein the amino acid position is relative to SEQ ID NO:4。
" conservative amino acid replacement " refers to the different residue displacement residues used and have similar side chain, and is therefore usually directed to Amino acid in the amino acid replacement polypeptide in the same or similar amino acid for defining classification.By way of example rather than limit System, with aliphatic lateral chain amino acid can be replaced by another aliphatic amino acid (for example, alanine, valine, leucine and Isoleucine);Amino acid with hydroxyl side chains by with hydroxyl side chains another amino acid replacement (for example, serine and Threonine);Amino acid with beta-branched side is by another amino acid replacement with beta-branched side (for example, phenylpropyl alcohol ammonia Acid, tyrosine, tryptophan and histidine);Amino acid with basic side chain is set by another amino acid with basic side chain Change (for example, lysine and arginine);Amino acid with acid side-chain is by another amino acid replacement with acid side-chain (for example, aspartic acid or glutamic acid);And hydrophobic amino acid or hydrophilic amino acid are respectively by another hydrophobic amino acid Or hydrophilic amino acid displacement.
" non-conservative displacement " refers to apparatus there were significant differences the amino acid replacement in the amino acid replacement polypeptide of side chain properties. Non-conservative displacement can be used between restriction group, rather than the amino acid within them, and influence: (a) in displaced region The structure (for example, proline displacement glycine) of peptide backbone;(b) charge or hydrophobicity;And/or (c) volume of side chain.By showing The mode of example rather than limit, the displacement of exemplary non-conservation includes that acidic amino acid is replaced by alkalinity or aliphatic amino acid;Virtue Fragrant race's amino acid is replaced by p1 amino acid;It is replaced with hydrophilic amino acid by hydrophobic amino acid.
" missing " refers to the modification carried out and removing one or more amino acid from reference polypeptide to polypeptide.It lacks Mistake may include removing 1 or more amino acid, 2 or more amino acid, 5 or more amino acid, 10 or more A amino acid, 15 or more amino acid or 20 or more amino acid, the amino acid sum for up to forming reference enzyme 10% or up to amino acid sum 20%, while retain enzymatic activity and/or retain engineering transaminase improved characteristic. Missing can be related to the inside and/or end of polypeptide.In each embodiment, missing may include continuous section or can be not Continuously.
" insertion " refers to the modification carried out and adding one or more amino acid to reference polypeptide to polypeptide.Insertion Can polypeptide inside or arrive carboxyl or amino terminal.Insertion as used herein includes merging egg as known in the art It is white.Insertion can be amino acid continuous section or by one or more Amino acid scores in naturally occurring polypeptide every.
Term " function fragment " and " bioactive fragment " refer to following polypeptide: the polypeptide used interchangeably herein With amino terminal and/or carboxyl-terminal deletion and/or internal missing, but wherein remaining amino acid sequence and it in contrast Sequence (for example, overall length of the invention is engineered PAL) in corresponding position it is identical, and it retains full-length polypeptide substantially All activity.
" isolated polypeptide " refers to following polypeptide: the polypeptide and its natural adjoint other pollutant (for example, albumen, Lipid and polynucleotides) it is substantially separate.Term includes from their naturally occurring environment or expression system (for example, host Cell or external synthesis) in take out or the polypeptide of purifying.Recombination PAL polypeptide may be present in it is intracellular, be present in cell culture medium In, or prepare in a variety of manners, such as lysate or isolated preparation.Therefore, in some embodiments, provided herein heavy Group PAL polypeptide is isolated polypeptide.
" polypeptide actually purified " refers to following composition, and polypeptide species are existing dominant species in the composition (that is, mole or on the basis of weight, it is more richer than any other individual macromolecular substances in the composition), and When target species constitute existing macromolecular species by mole or % weight based at least about 50% when, it is usually substantially pure The composition of change.In general, the PAL composition substantially purified will constitute pressing for all macromolecular species present in the composition Mole or % poidometer about 60% or more, about 70% or more, about 80% or more, about 90% or more, about 95% or more Mostly and about 98% or more.In some embodiments, target species are purified to basic homogeneity (that is, passing through routine Detection method cannot detect pollutant species in the composition), wherein the composition is substantially by single macromolecular species group At.Solvent species, small molecule (< 500 dalton) and element ion species are not considered as macromolecular species.In some embodiment party In case, the recombination PAL polypeptide of separation is the peptide composition substantially purified.
" improved enzyme characteristic " refers to and refers to PAL polypeptide such as wild type PAL polypeptide (for example, having SEQ ID NO:4 AvPAL wild type) or another engineering PAL polypeptide compare and show the improved engineering in any enzyme characteristic PAL polypeptide.Improved characteristic includes but is not limited to such characteristic, such as increased protein expression, increased thermal activities (thermoactivity), increased thermal stability, increased pH activity, increased stability, increased enzymatic activity, increased It is substrate specificity and/or affinity, increased specific activity, the increased inhibition that substrate and/or final product are inhibited, increased Chemical stability, improved chemo-selective, improved solvent stability, it is increased to acid pH tolerance, it is increased to egg White hydrolysing activity tolerance (that is, the sensibility to proteolysis reduced), reduce aggregation, increased solubility, reduction are exempted from The temperature curve of epidemic focus and change.
" increased enzymatic activity " and " catalytic activity of enhancing " refers to the improved characteristic of engineering PAL polypeptide, can quilt It is expressed as compared with reference to PAL enzyme (for example, wild type AvPAL and/or another engineering AvPAL), the increase (example of specific activity Such as, product/time of generation/weight albumen) and/or substrate to product percent conversion increase (for example, specified Period uses the PAL of specified amount, the percent conversion of the substrate of initial amount to product).Determine the exemplary side of enzymatic activity Method is provided in embodiment.It can influence any characteristic relevant to enzymatic activity, including classical enzyme characteristic Km、VmaxOr kcat, it Change can lead to increased enzymatic activity.The improvement of enzymatic activity can be about 1.1 times from the enzymatic activity of corresponding wild-type enzyme To another up to 2 times, 5 times, 10 times, 20 for being engineered PAL for deriving PAL polypeptide compared to naturally occurring PAL or from it Again, 25 times, 50 times, 75 times, 100 times, 150 times, 200 times or more of enzymatic activity.
In some embodiments, engineering PAL polypeptide has k belowcat: at least 0.1/ second, at least 0.2/ second, extremely Few 0.3/ second, at least 0.5/ second, at least 1.0/ second, and be greater than 1.0/ second in some preferred embodiments.In some realities It applies in scheme, KmIn the range of about 1 μm to about 5mM;In the range of about 5 μm to about 2mM;In the about l0 μm model to about 2mM In enclosing;Or in the range of about l0 μm to about 1mM.In some special embodiments, engineering PAL enzyme is shown than reference It is improved in big 1.5 to 10 times, 1.5 to 25 times, 1.5 to 50 times, 1.5 to 100 times or bigger of the range of the enzymatic activity of PAL enzyme Enzymatic activity.PAL activity can be measured by any standard test known in the art (for example, by monitoring reactant or product Spectrophotometry property in variation).In some embodiments, the amount of the product of generation passes through high performance liquid chromatography (HPLC) the UV absorbance after combining o- phthaldialdehyde (OPA) derivatization or fluorescence detection are separated to measure.In some embodiments In, enzymatic activity is carried out using determining enzyme preparation, the measurement of determination under setting condition and the substrate of one or more determinations Comparison, as described in herein in further detail.In general, when comparing lysate, the albumen of the number and measurement of cell Amount is determined, and using identical expression system and identical host cell will deposit in generated by host cell and lysate Enzyme amount variation minimize.
Term " the improved tolerance to acid pH " means that compared with reference PAL, recombination PAL according to the present invention is showed Increased stability is (that is, after being exposed to the period [1 hour, up to 24 hours] that acid pH is persistently specified, in about pH out The activity of higher reservation under 7.0).
" physiological pH " means the pH range usually found in the small enteral of subject (for example, people) as used herein.It is logical Gradient pH in the range of often having about 6.0 to 7.5 from pyloric valve to large intestine.
The term used in the improved stability to condition of acidic pH or the increased tolerance to acid pH is " acid PH " means about 1.5 to 6.8 pH range.
Term " proteolytic activity " used interchangeably herein and " proteolysis " refer to that protein breakdown is lesser more Peptide or amino acid.The decomposition of albumen is usually the result of enzyme hydrolysis of the peptide bond by protease (protease).Protease include but It is not limited to, pepsin, trypsase, chymotrypsin, elastoser;Carboxypeptidase A and B and peptase are (for example, amino peptide Enzyme, dipeptidase and erepsin).
Phrase " reduction to proteolytic susceptibility " and " reduced proteolytic susceptibility " is interchangeable herein makes With, it is intended that in standard test (for example, as disclosed in embodiment), after with reason on one or more of protease, root According to engineering PAL polypeptide of the invention by enzymatic activity with higher compared with reference PAL.
" aggregation " means the condensation or precipitating of PAL polypeptide.Aggregation can lead to enzyme inactivation.Term " aggregation of reduction " means work Journey PAL polypeptide will be less susceptible to assemble compared with reference PAL.Method for evaluating aggregation is known in the art, including but It is not limited to contaminate with the fluorescence microscopy art of appropriate dyestuff (for example, thioflavin T or Nile red), dynamic light scattering, with appropriate Expect the flow cytometry of (for example, fluorine boron is glimmering (Bodipy)), by the filtering and analysis of SDS-PAGE, and/or western blot, glimmering The use of light correlation spectrometry and electron microscope.In the presence of evaluation aggregation be obtained commercially kit (for example,Albumen assembles assay kit [Enzo]).
The enzymatic that " conversion ratio " refers to substrate to corresponding product converts (or bioconversion)." percent conversion " refers to The percentage of the substrate of product is converted into specified requirements lower a period of time.Therefore, " enzymatic activity " or " activity " of PAL polypeptide Substrate is represented by within specified a period of time to " percent conversion " of product.
" Hybridization stringency " is related to the hybridization conditions in nucleic acid hybridization, the condition such as washed.In general, hybridization reaction compared with It carries out, is then different but compared with the washing of high stringency under conditions of property low strict.Term " moderate stringency hybridization " refers to fair Perhaps target-DNA combines the condition of the nucleic acid of following complementation, and the nucleic acid of the complementation is with the identity, preferably with target DNA about 60% About 75% identity of ground, about 85% identity are greater than about 90% identity with target-polynucleotides.Exemplary moderate is tight Glazing bar part is the same as at 42 DEG C hybridizing in 50% formamide, 5 × Denhart solution, 5 × SSPE, 0.2%SDS, then The condition washed in 0.2 × SSPE, 0.2%SDS at 42 DEG C." hybridization of high stringency degree " typically refers to condition below: partially The heat fusing temperature T determined under solution condition from the polynucleotide sequence for definitionmAbout 10 DEG C or less.In some implementations In scheme, high stringency degree condition refers to condition below, is only allowed at 65 DEG C and forms stable hybridization in 0.018M NaCl Those of body nucleic acid sequence hybridization (that is, if hybrid in 0.018M NaCl be at 65 DEG C it is unstable, it is such as Considering herein will be unstable under the conditions of high stringency degree).For example, can be by being equal at 42 DEG C in 50% formyl Amine, 5 × Denhart solution, 5 × SSPE, 0.2%SDS condition in hybridize, then in 0.1 × SSPE and 0.1% at 65 DEG C It is washed in SDS, and high stringency is provided.Another high stringency degree condition be with it is such as the following with condition in hybridize: Hybridize in 5 × SSC containing 0.1% (w:v) SDS at 65 DEG C, and is washed in 0.1 × SSC containing 0.1%SDS at 65 DEG C. Other high stringency degree hybridization conditions and moderate stringency are described in bibliography cited above.
" codon optimization " refers to that the codon of the polynucleotides of coding albumen becomes preferentially using in specific organism Those of codon so that encoded albumen is more effectively expressed in the organism.Although genetic code is due to most of Several codons that amino acid is referred to as " synonym " or " synonymous " codon represent and are degeneracy, it is known that The codon of specific organism, which uses, to be nonrandom and is biased to specific codon triplet.With regard to given gene, have altogether The collectin of congenerous or the gene, highly expressed albumen of ancestors origin relative to low copy number protein and organism genome For code area, this codon may be higher using deviation.In some embodiments, to the polynucleotides of coding PAL enzyme It optimizes, for being produced from the host organisms optimization being selected for expression.
" control sequence " refers to that the expression including polynucleotides and/or polypeptide to present disclosure is necessary herein Or advantageous all components.Each control sequence can be natural or external source for the nucleic acid sequence of coding polypeptide.This Class control sequence includes but is not limited to, leader sequence, polyadenylic acid tracts, propeptide sequence, promoter sequence, signal peptide sequence, Homing sequence and transcription terminator.At minimum, control sequence includes promoter and transcription and translation termination signal.One In a little embodiments, control sequence is provided with connector, for introducing the purpose of specific restriction sites, promote control sequence with Encode the connection of the code area of the nucleic acid sequence of polypeptide.
" being operably connected " is defined herein as a kind of following preparation: in the configuration control sequence relative to Interested polynucleotides are desirably placed (that is, with functional relationship) in a position, so that control sequence guidance or adjustment are compiled The expression of the polynucleotides of the interested polypeptide of code.
" promoter sequence " refers to the nucleic acid sequence by host cell identification for the expression of interested polynucleotides, such as Coded sequence.Promoter sequence includes the transcriptional control sequence for mediating the expression of interested polynucleotides.Promoter can be Any nucleic acid sequence of transcriptional activity, including mutation, truncated and hybrid promoter are shown in the host cell of selection, And it can be obtained from extracellular or intracellular polypeptides the gene homologous or heterologous with host cell is encoded.
" reaction condition appropriate " those of refers in enzymic transformations solution condition (for example, enzyme carrying capacity, substrate carry The range of amount, temperature, pH, buffer, cosolvent etc.), the PAL polypeptide of present disclosure can turn substrate under the above conditions Turn to desired product Compound.Illustratively " reaction condition appropriate " is provided at (referring to embodiment) herein.
" carrying capacity ", such as refer in " compound carrying capacity " or " enzyme carrying capacity " react start when reaction mixture in component Concentration or amount." substrate " in the context of enzymic transformations process refers to the compound generated by the effect of PAL polypeptide Or molecule." product " in the context of enzymatic conversion processes refers to the compound generated from PAL polypeptide to the effect of substrate Or molecule.
Term " culture " as used herein refer to microbial cell group using any suitable culture medium (for example, liquid, Gel or solid) suitable growth under suitable condition.
Any suitable method known in the art can be used to generate in recombinant polypeptide (for example, PAL enzyme variants).For example, There are many different induced-mutation techniques well known to those skilled in the art.In addition, mutagenesis kit is also available from many business point Sub- biology supplier.Method can be used for making particular permutation on the amino acid (fixed point) of restriction, in the partial zones of gene Specificity (region-specificity) or random mutation in domain or the random mutagenesis (for example, saturation mutagenesis) in whole gene. Many suitable methods for generating enzyme variants are known to the skilled in the art, and are including but not limited to lured using PCR, boxlike Change, gene chemical synthesis, fallibility PCR, reorganization and chemical saturation mutagenesis or any other suitable method known in the art carry out single The direct mutagenesis of chain DNA or double-stranded DNA.For the non-limiting example of DNA and proteins engineered method in following patent It is provided: U.S. Patent number 6,117,679;U.S. Patent number 6,420,175;U.S. Patent number 6,376,246;U.S. Patent number 6,586,182;U.S. Patent number 7,747,391;U.S. Patent number 7,747,393;U.S. Patent number 7,783,428;The U.S. and The patent No. 8,383,346.After variant generates, they can be screened any desired characteristic (for example, high or increased activity or Low or active, increased thermal activities, increased thermal stability and/or acid pH stability for reducing etc.).In some embodiments In, " recombination PAL polypeptide " (also referred to as " engineering PAL polypeptide ", " variant PAL enzyme " and " variant PAL body ") discovery Purposes.
As used herein, " carrier " is the DNA construct for DNA sequence dna to be imported to cell.In some embodiments, Carrier is the suitable control that may be operably connected to the DNA sequence dna that can influence to encode polypeptide and express in suitable host The expression vector of sequence.In some embodiments, " expression vector ", which has, is operably connected to DNA sequence dna (for example, turning base Cause) to drive the promoter sequence expressed in host cell, and in some embodiments, it further include transcription terminator sequence Column.
As used herein, term " expression " includes any step for participating in polypeptide and generating, including but not limited to, transcription, Posttranscriptional modification, translation and posttranslational modification.In some embodiments, which further includes the secrete polypeptide from cell.
As used herein, term " generation " is the generation that finger protein and/or other compounds pass through cell.It is intended that Term includes any step for participating in polypeptide and generating, including but not limited to, transcription, posttranscriptional modification, translation and posttranslational modification. In some embodiments, which further includes the secrete polypeptide from cell.
As used herein, if two sequences are unassociated in nature, amino acid or nucleotide sequence are (for example, open Promoter sequences, signal peptide, terminator sequence etc.) another sequence for being operably connected to it with it is " heterologous ".
As used herein, term " host cell " and " host strain " refer to for expressing comprising DNA provided herein The suitable host of the carrier of (for example, the polynucleotide sequence for encoding at least one AvPAL variant).In some embodiments In, host cell is the protokaryon for being converted or being transfected with the carrier constructed using recombinant DNA technology as known in the art or true Nucleus.
Term " analog " means with the sequence identity for being greater than 70% with reference polypeptide, but the sequence less than 100% Identity polypeptide (for example, be greater than 75%, 78%, 80%, 83%, 85%, 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity) polypeptide.In some embodiments, analog includes non-natural Existing amino acid residue and naturally occurring amino acid, the non-naturally occurring amino acid residue include but is not limited to height Arginine, ornithine and norvaline.In some embodiments, analog further includes one or more of D- amino acid residues Non-peptide bond between two or more amino acid residues.
Term " treatment ", which is directed to display pathological signs or subject's application of symptom, has beneficial or desired doctor The compound that curative effect is answered.
Term " pharmaceutical composition " refer to be suitable for medicinal usage in mammalian subject (for example, people) comprising by this The composition of engineering the PAL polypeptide and acceptable carrier of the included pharmacy effective dose of invention.
Term " effective quantity " means the amount for being enough to generate desired result.Persons skilled in the art can be by using normal Rule test determine effective quantity is how many.
Referred to using term " separation " and " purifying " and is taken out from least one other components that it naturally associates with it Molecule (for example, isolated nucleic acid, polypeptide etc.) or other components.Term " purifying " does not require absolute purity, and its mesh Be as opposite definition.
Term " subject " includes mammal, such as people, non-human primate, domestic animal, pet and experimental animal (example Such as, rodent and lagomorph).Its purpose is that term includes women and male.
As used herein, term " patient " means any subject for being evaluated, treating or just undergoing disease.
Term " baby " refers to the children in the period of first month to about one (1) years old after birth.As used herein , term " newborn " refers to the children in the period of 28 days from birth to life.Term " premature " refers to completion The baby that gestation is born after 20 weeks but before full-term, usually weighing~500-~2499 gram at birth." extremely low birth Weight youngster " is the baby that weighing is lower than 1500g at birth.
As used herein, term " children " refers to for agreeing to that treatment or search procedure are not up to the people of lawful age. In some embodiments, which refers to the people between birth and hebetic period.
As used herein, term " adult ", which refers to, has reached the people of lawful age (for example, in beauty for related judicial power State is 18 years old).In some embodiments, which refers to any fully grown organism.In some embodiments In, term " youth " refers to less than 18 years old but has reached sexually matured people.
As used herein, " composition " and " preparation " include purpose for any suitable purposes comprising it is of the invention extremely A kind of few product (for example, pharmaceutical composition, diet/nutritional supplement, feed etc.) for being engineered PAL.
Term administering " and " application " composition mean to provide the present invention to subject (for example, the people for suffering from PKU effect) Composition.
When about pharmaceutical composition, in use, term " carrier " means standard pharmaceutical carriers, buffer and excipient are such as Any one of stabilizer, preservative and adjuvant.
Term " pharmaceutically acceptable " mean can be administered to subject without cause any bad biological effect or with Harmful mode with wherein it by comprising component any one interaction and possess the material of desired bioactivity.
As used herein, term " excipient " refers to any pharmaceutically acceptable additive, carrier, diluent, assistant Agent or other compositions, rather than active pharmaceutical ingredient (API;For example, engineering PAL polypeptide of the invention).Excipient usually wraps Include preparation and/or application purpose.
When about disease/situation symptom in use, term " therapeutically effective amount " refer to improvement, decrease or eliminate disease/ One or more symptoms of situation or prevention or delay symptom (for example, PKU) breaking-out compound (for example, engineering PAL Polypeptide) amount and/or concentration.In some embodiments, which is used about the amount of composition, the amount of the composition Cause the biology (example of tissue, system or the animal subjects found by researcher, doctor, veterinarian or other clinicians Such as, medicine) response.
When about disease/situation in use, term " therapeutically effective amount " refers to improvement, decrease or eliminates the disease/situation Composition amount and/or concentration.
Its purpose is that term " treatment (treating) ", " treatment (treat) " and " treatment (treatment) " includes The treatment (for example, prophylactic) of prevention, and relax and subtract treatment.
It is engineered PAL polypeptide:
Parent's PAL polypeptide that engineering PAL polypeptide of the invention is originated from it includes bacterium bacterial strain such as Anabaena (Anabaena) (for example, Anabaena variabilis (A.variabilis)), Nostoc (Nostoc) are (for example, dotted nostoc (N.punctiforme)), Rhodosporidium (Rhodosporidium) is (for example, circle rhodosporidium toruloides (R.toruloides)), streptomyces (Streptomyces) (such as S.maritimus or S.verticillatus), quiver bacterium Belong to (Oscillatoria sp.), Gloeocapsa (Gloeocapsa sp.) and glue palpus Trentepohlia (Rivularia sp.).From this The PAL enzyme of a little bacterial strains has been accredited, and is well known.It is provided in Fig. 1 from Anabaena (Anabaena variabilis) ATCC 29413 and NCBI YP_324488.1;Nostoc (dotted nostoc) ATCC 29133 and NCBI YP_00186563.1;It quivers Trentepohlia (Oscillatoria sp.) PCC 6506 and NCBI ZP_07108482.1 and Gloeocapsa (Gloeocapsa sp.) The homologous enzyme sequence of PCC 7428 and NCBI YP_007127054.1.Dotted nostoc phenylalanine/histidine aminonialyase " NpPHAL"(NCBI YP_001865631.1(SEQ ID NO:30);Glue must Trentepohlia (Rivularia sp.) histidine solution ammonia Enzyme " RspHAL " (NCBI YP_007056096.1 (SEQ ID NO:31);Oscillatoria (Oscillatoria sp.) histidine solution Adnosine deaminase " Osp HAL " (NCBI YP_07108482.1 (SEQ ID NO:32);Ammonia is organized with Gloeocapsa (Gloeocapsa sp.) Acidolysis adnosine deaminase " GspHAL " (NCBI YP_007127054.1) (SEQ ID NO:33) and AvPAL (SEQ ID NO:4) have more In 70% homology.
In addition, when specific PAL variant (that is, engineering PAL polypeptide) is by reference to wild type PAL or with reference to the sequence of PAL When the modification of particular amino acid residue in column is mentioned, it should be understood that herein include in the another of equivalent site modification A kind of variant of PAL (determined by such as from the optional amino acid alignment between respective amino acid sequence).One In a little embodiments, PAL peptide source is engineered since the above bacterium bacterial strain is (that is, Nostoc [dotted nostoc], red winter spore ferment Mother belongs to [circle rhodosporidium toruloides], streptomyces [S.maritimus or S.verticillatus], Oscillatoria (Oscillatoria Sp.), Gloeocapsa (Gloeocapsa sp.) and glue must Trentepohlias (Rivularia sp.)) in the polypeptide listed any one Kind.In some other embodiments, engineering PAL polypeptide of the invention includes conservative active site Alal67- Serl68-Glyl69, and include with SEQ ID NO:4 at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% Or at least 99% sequence identity.In some embodiments, engineering PAL polypeptide not only include PAL activity, also have pair The activity of tyrosine and/or histidine substrate.
In some embodiments, engineering PAL polypeptide under conditions of being conducive to and generating engineering PAL polypeptide by training The microorganism comprising at least one polynucleotide sequence for encoding at least one engineering PAL polypeptide is supported to generate.In some realities It applies in scheme, engineering PAL polypeptide is then recycled from resulting culture medium and/or cell.
The present invention provides with the active exemplary work PAL polypeptide of PAL.Embodiment provides display and engineering The table of the Sequence Structure Information of the relevant specific amino acids sequence signature of the functional activity of PAL polypeptide.Structure-function correlation letter Breath with the particular amino acid residue difference of the reference engineered polypeptide relative to SEQ ID NO:4 and with exemplary work The form of the relevant experimentally determining activity data of PAL polypeptide provides.
It is in some embodiments, of the invention that have the active engineering PAL polypeptide of PAL include a) amino acid sequence, The amino acid sequence has the sequence identity with reference sequences SEQ ID NO:4 at least 85%;B) with SEQ ID NO:4 phase Than the amino acid residue difference at one or more amino acid positions;And c) the engineering PAL polypeptide with refer to sequence Column are compared to showing improved characteristic selected from the following: the catalytic activity that i) enhances, ii) proteolytic susceptibility that reduces, Iii) the increased tolerance to acid pH, iv) reduction aggregation or i), ii), iii) or any combination iv).
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, At least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and with SEQ ID NO:4 phase Than at one or more amino acid positions (such as with SEQ ID NO:4 or have and SEQ ID NO:4 at least 85%, extremely Few 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or the sequence of bigger amino acid sequence identity compare, 1,2,3,4,5,6,7,8,9, 10, at 11,12,14,15,20 or more amino acid positions) amino acid residue difference.In some embodiments, with SEQ ID NO:4 compared to residue difference at one or more positions include at least one, 2,3,4,5,6,7, 8,9,10 or more conservative amino acid replacements.In some embodiments, engineering PAL polypeptide is in embodiment Listed polypeptide in the table of offer.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, At least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and when and SEQ ID When the amino acid sequence optimal comparison of NO:4, in one or more amino acid positions selected from the following compared with SEQ ID NO:4 Set the amino acid residue difference at place: X39;X54;X59;X73;X91;X158;X112, X134, X180;X195;X240;X243; X245;X256;X257;X270;X290;X304,X305;X307;X308;X326;X349;X353;X364;X394;X399; X400;X404;X407;X443;X453;X459;X460;X463;X474;X509;X521;X522;X524;X528;X546; X564 or any combination thereof.In some embodiments, amino acid of differences are 1,2,3,4,5,6,7,8 A, 9,10,15 or 20 or more amino acid position.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:4 at least 85% (at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, At least 96%, at least 97%, at least 98%, sequence identity at least 99%), and it is included in the amino acid at the H307 of position Residue difference and optionally in 1,2,3,4,5,6,7,8,9,10 or more amino acid position The amino acid residue difference at place.In some embodiments, the amino acid residue difference at position 307 is H307/G/Q/M.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:4 at least 85% (at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, At least 96%, at least 97%, at least 98%, sequence identity at least 99%), and include selected from the following one or more At least one amino acid residue difference of a combination: A39;T54;G59, S73;A91;Y158;S180;K195;A112; R134;Q240;T243;I245;A256;L257;N270;N290;Y304;R305;H307;E308;I326;L349;D353; L364;A394;S399;N400;P404;L407;F443;N453;Y459;T460;T463;N474;E509;Q521;K522; T524;P528;S546;And/or P564.In some other embodiments, exist in 1,2,3,4,5,6 A, 7,8,9, the amino acid residue difference at 10 or more amino acid positions.
In some embodiments, the engineering PAL polypeptide for showing improved characteristic has with SEQ ID NO:4 at least 85% (at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, At least 97%, at least 98%, sequence identity at least 99%), and include to be selected from when with SEQ ID NO:4 optimal comparison One or more combined amino acid residue difference below: A39V;T54K;G59R;S73K;A112C;R134Q; A91V;Y158H;S180A;K195E;Q240R/W;T243I/L;I245L;A256G;L257W/A;N270K;N290G;Y304H; R305M;H307G/Q/M;E308Q;I326F;L349M;D353A/N;L364Q;A394V;S399N;N400K;P404A; L407V;F443H;N453G;Y459F;T460G;T463N;N474Q;E509L;Q521K/S;K522Y/F/N;T524S; P528L;S546R;And P564G/L/M.
In some embodiments, amino acid residue difference is selected from below one when with SEQ ID NO:4 optimal comparison A or more combination: A39V;A91V;A256G;N290G;A394V;S399N;P404A;L407V;K522Y/F/N;With/ Or T524S.
In some embodiments, the present invention provides the function fragments of engineering PAL polypeptide.In some embodiments In, function fragment include its engineering PAL polypeptide being originated from (that is, parent is engineered PAL) at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98% or at least about 99% activity.In some embodiments, function Segment include at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, extremely The parental array of few about 96%, at least about 97%, at least about 98% or at least about 99% engineering PAL.In some embodiment party In case, function fragment will be truncated less than 5, less than 10, less than 15, less than 10, less than 25, less than 30, it is small In 35, less than 40, less than 45 and less than 50 amino acid.
In some embodiments, the present invention provides the function fragments of engineering PAL polypeptide.In some embodiments In, function fragment include its engineering PAL polypeptide being originated from (that is, parent is engineered PAL) at least about 95%, at least about 96%, at least about 97%, at least about 98% or at least about 99% activity.In some embodiments, function fragment includes extremely Few 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least The parental array of 98% or at least 99% engineering PAL.In some embodiments, function fragment will be truncated less than 5, Less than 10, less than 15, less than 10, less than 25, less than 30, less than 35, less than 40, less than 45, be less than 50, less than 55, less than 60, less than 65 or less than 70 amino acid.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:6 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, At least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and with SEQ ID NO:6 phase Than, with SEQ ID NO:6 or have with SEQ ID NO:6 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger ammonia The sequence of base acid sequence identity is compared, in one or more amino acid positions (such as in 1,2,3,4,5,6 A, 7,8,9,10,11,12,14,15 or multiple amino acid positions) at amino acid residue difference.? In some embodiments, engineering PAL includes sequence identity with SEQ ID NO:6 at least 90%, and includes and SEQ ID NO:6 is compared at least one, 2,3,4,5,6,7,8,9,10 or more amino acid positions Amino acid of differences.In some embodiments, engineering PAL polypeptide is made of the sequence of SEQ ID NO:6.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:10 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:10, with SEQ ID NO:10 or have with SEQ ID NO:10 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions Set) at amino acid residue difference.In some embodiments, engineering PAL includes and SEQ ID NO:10 at least 95% Sequence identity, and include at least one, 2,3,4,5,6,7,8,9 compared with SEQ ID NO:10 A, 10 or more amino acid positions amino acid of differences.In some embodiments, engineering PAL polypeptide is by SEQ ID The sequence of NO:10 forms.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:12 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:12, with SEQ ID NO:12 or have with SEQ ID NO:12 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions Set) at amino acid residue difference.In some embodiments, engineering PAL includes and SEQ ID NO:12 at least 95% Sequence identity, and include at least one, 2,3,4,5,6,7,8,9 compared with SEQ ID NO:12 A, 10 or more amino acid positions amino acid of differences.In some embodiments, engineering PAL polypeptide is by SEQ ID The sequence of NO:12 forms.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:14 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:14, with SEQ ID NO:14 or have with SEQ ID NO:14 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions Set) at amino acid residue difference.In some embodiments, engineering PAL includes and SEQ ID NO:14 at least 95% Sequence identity, and include at least one, 2,3,4,5,6,7,8,9 compared with SEQ ID NO:14 A, 10 or more amino acid positions amino acid of differences.In some embodiments, engineering PAL polypeptide is by SEQ ID The sequence of NO:14 forms.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:16 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:16, with SEQ ID NO:16 or have with SEQ ID NO:16 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions Set) at amino acid residue difference.In some embodiments, engineering PAL includes with SEQ ID NO:16's at least 95% Sequence identity, and include compared with SEQ ID NO:16 at least one, 2,3,4,5,6,7,8,9, The amino acid of differences of 10 or more amino acid positions.In some embodiments, engineering PAL polypeptide is by SEQ ID NO: 16 sequence composition.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:18 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:18, with SEQ ID NO:18 or have with SEQ ID NO:18 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or multiple amino acid positions) The amino acid residue difference at place.In some embodiments, engineering PAL includes the sequence with SEQ ID NO:18 at least 95% Identity, and include at least one, 2,3,4,5,6,7,8,9,10 compared with SEQ ID NO:18 Or more amino acid position amino acid of differences.In some embodiments, engineering PAL polypeptide is by SEQ ID NO:18's Sequence composition.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:20 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:20, with SEQ ID NO:20 or have with SEQ ID NO:20 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or multiple amino acid positions) The difference of the amino acid residue at place.In some embodiments, engineering PAL includes the sequence with SEQ ID NO:20 at least 95% Column identity, and include at least one, 2,3,4,5,6,7,8,9,10 compared with SEQ ID NO:20 The amino acid of differences of a or more amino acid position.In some embodiments, engineering PAL polypeptide is by SEQ ID NO:20 Sequence composition.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:22 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:22, with SEQ ID NO:22 or have with SEQ ID NO:22 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions Set) at amino acid residue difference.In some embodiments, engineering PAL includes with SEQ ID NO:22's at least 95% Sequence identity, and include compared with SEQ ID NO:22 at least one, 2,3,4,5,6,7,8,9, The amino acid of differences of 10 or more amino acid positions.In some embodiments, engineering PAL polypeptide is by SEQ ID NO: 22 sequence composition.
In some embodiments, the engineering PAL polypeptide for showing at least one improved characteristic has and SEQ ID NO:24 or its function fragment at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and Compared with SEQ ID NO:24, with SEQ ID NO:24 or have with SEQ ID NO:24 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or the sequence of bigger amino acid sequence identity compare, in one or more amino acid positions (such as 1 A, 2,3,4,5,6,7,8,9,10,11,12,14,15 or multiple amino acid positions) The difference of the amino acid residue at place.In some embodiments, engineering PAL includes the sequence with SEQ ID NO:24 at least 95% Column identity, and include at least one, 2,3,4,5,6,7,8,9,10 compared with SEQ ID NO:24 The amino acid of differences of a or more amino acid position.In some embodiments, engineering PAL polypeptide is by SEQ ID NO:24 Sequence composition.
The variant to proteolytic susceptibility with reduction:
In some embodiments, engineering PAL polypeptide of the invention have PAL activity, show reduce to albumen The sensibility of hydrolysis, and include: a) there is the amino acid sequence with the sequence identity of reference sequences SEQ ID NO:4 at least 85% Column;B) amino acid residue difference compared with SEQ ID NO:4 at one or more amino acid positions.
In some embodiments, show reduce the sensibility to proteolysis engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, extremely Few 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and with SEQ ID NO:4 compare one or more amino acid positions (such as with SEQ ID NO:4 or have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, At least 97%, at least 98%, at least 99% or bigger amino acid sequence identity sequence compare in 1,2,3,4 A, 5,6,7,8,9,10,11,12,14,15,20 or more amino acid positions) at ammonia Base acid residue difference.
In some embodiments, show reduce the sensibility to proteolysis engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, extremely Few 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and when with When the amino acid sequence optimal comparison of SEQ ID NO:4, in one or more ammonia selected from the following compared with SEQ ID NO:4 Amino acid residue difference at base acid position: X39;X54;X59;X73;X91;X158;X112,X134,X180;X195;X240; X243;X245;X256;X257;X270;X290;X304,X305;X307;X308;X326;X349;X353;X364;X394; X399;X400;X404;X407;X443;X453;X459;X460;X463;X474;X509;X521;X522;X524;X528; X546;X564;Or any combination thereof.In some embodiments, amino acid of differences are 1,2,3,4,5,6,7 A, 8,9,10, the amino acid position of 15 or 20 or more.
In some embodiments, show reduce the sensibility to proteolysis engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity, and it is included in position X307;X326;X460;X307;And/or the amino acid residue difference at X528, Amino optionally at 1,2,3,4,5,6,7,8,9,10 or more amino acid position Sour residue difference.In some embodiments, when comparing with SEQ ID NO:4, amino acid residue difference is selected from Y304H/W; R305L/M;H307G/M/Q;I326F;Q240W;T460G;P528L;With any combination of these displacements.
In some embodiments, show reduce the sensibility to proteolysis engineering PAL polypeptide have with Any of SEQ ID NO:10,12,14,16,18,20,22 and/or 24 or its function fragment at least 85%, at least 88%, At least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence identity, and with SEQ ID NO:10,12,14,16,18,20,22 and/or 24 amino acid residue differences compared, the amino acid residue difference with SEQ ID NO:10,12,14,16,18,20,22 And/or 24 or have with SEQ ID NO:10,12,14,16,18,20,22 and/or 24 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, At least 99% or bigger amino acid sequence identity sequence compare one or more amino acid positions (such as 1, 2,3,4,5,6,7,8,9,10,11,12,14,15 or more amino acid positions) Place.In some embodiments, engineering PAL include with SEQ ID NO:10,12,14,16,18,20,22 and/or 24 at least 95% sequence identity, and include at least one, 2 compared with SEQ ID NO:10,12,14,16,18,20,22 and/or 24 A, 3,4,5,6,7,8,9,10 or more amino acid positions amino acid of differences.In some implementations In scheme, PAL include SEQ ID NO:10,12,14,16,18,20,22 and/or 24 sequence or by SEQ ID NO:10,12, 14,16,18,20,22 and/or 24 sequence composition.
In some embodiments, under substantially the same conditions, the proteolytic susceptibility of PAL polypeptide is engineered than wild The proteolytic susceptibility of raw type PAL (for example, AvPAL with SEQ ID NO:4) reduces compared with reference PAL polypeptide At least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90% or at least 95%.This field has can be used in proteolytic activity Any appropriate methodology for knowing measures, including but not limited to those of described in embodiment.
In some embodiments, when reference PAL and engineering PAL with reduced sensibility are in essentially identical item By relatively and when being exposed to the protease of substantially identical amount and type under part, there is the sensibility to proteolysis reduced Being engineered PAL polypeptide has the sensibility to the composition comprising one or more of protease reduced, the protease packet Pepsin, trypsase, chymotrypsin, Carboxypeptidase A and B, peptase are included but are not limited to (for example, aminopeptidase, dipeptidase And erepsin).
In some embodiments, it is ginseng that the engineering PAL polypeptide with the sensibility to proteolysis reduced, which has, Examine about 1.0 times, 2 times, 5 times, 10 times, 20 times, 25 times, 50 times, 75 times, 100 times, 150 of the enzymatic activity of PAL (for example, AvPAL) Times, the enzyme activity level of 200 times or more high magnification numbe.In some embodiments, when measurement work under the pH range 4.5 to 7.5 Property when, when under the pH range 4.5 to 6.5 measure activity when;When measuring activity under the pH range 5.0 to 7.5;When When measuring activity under 5.0 to 6.5 pH range;When measuring activity under the pH range 5.5 to 7.5;And/or also when 5.5 When measuring activity under to 6.5 pH range;Engineered polypeptide has enzymatic activity bigger compared with reference PAL.Some other In embodiment, engineering PAL polypeptide has the K within the scope of 1 μ Μ to 5mMmValue.
With the increased variant to acid pH tolerance:
In some embodiments, engineering PAL polypeptide of the invention has PAL activity, and tolerance acid pH is horizontal, and wraps Contain: a) there is the amino acid sequence with reference sequences SEQ ID NO:4 or the sequence identity of its segment at least 85%;B) with SEQ ID NO:4 is compared to the amino acid residue difference at one or more amino acid positions.
In some embodiments, it is shown compared with wild type AvPAL and/or another reference polypeptide increased to acid Property pH tolerance engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% Or bigger amino acid sequence identity, and compared with SEQ ID NO:4, one or more amino acid positions (such as with SEQ ID NO:4 or have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, extremely Few 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or bigger amino acid sequence The sequence of column identity compare 1,2,3,4,5,6,7,8,9,10,11,12,14, 15,20 or more amino acid positions) at amino acid residue difference.
In some embodiments, it is shown compared with wild type AvPAL and/or another reference polypeptide increased to acid Property pH tolerance engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% Or bigger amino acid sequence identity, and when amino acid sequence optimal comparison with SEQ ID NO:4, with SEQ ID NO:4 Compared to the amino acid residue difference at one or more amino acid positions, one or more amino acid position is selected from X39;X54;X59;X73;X91;X158;X112, X134, X180;X195;X240;X243;X245;X256;X257;X270; X290;X304,X305;X307;X308;X326;X349;X353;X364;X394;X399;X400;X404;X407;X443; X453;X459;X460;X463;X474;X509;X521;X522;X524;X528;X546;X564;Or any combination thereof.One In a little embodiments, amino acid of differences are 1,2,3,4,5,6,7,8,9,10,15 or 20 Or more amino acid position.
In some embodiments, it is shown compared with wild type AvPAL and/or another reference polypeptide increased to acid Property pH tolerance engineering PAL polypeptide have with SEQ ID NO:4 at least 85%, at least 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity, and include selected from the amino at following position Sour residue difference: X39;X54;X59;X73;X91;X158;X112, X134, X180;X195;X240;X243;X245;X256; X257;X270;X290;X304, X305;X307;X308;X326;X349;X353;X364;X394;X399;X400;X404; X407;X443;X453;X459;X460;X463;X474;X509;X521;X522;X524;X528;X546;X564 or its is any Combination;Optionally at 1,2,3,4,5,6,7,8,9,10 or more amino acid position Amino acid residue difference.In some embodiments, when being compared with SEQ ID NO:4, amino acid residue difference A39; T54;G59, S73;A91;Y158;S180;K195;A112;R134;Q240;T243;I245;A256;L257;N270;N290; Y304;R305;H307;E308;I326;L349;D353;L364;A394;S399;N400;P404;L407;F443;N453; Y459;T460;T463;N474;E509;Q521;K522;T524;P528;S546;And/or P564.In some embodiments, Show the increased engineering PAL polypeptide to acid pH tolerance has and SEQ ID when comparing with SEQ ID NO:4 The sequence of NO:4 at least 85%, at least 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% Column identity, and it is included in the amino acid residue difference at following one or more positions: A39V;T54K;G59R;S73K; A112C;R134Q;A91V;Y158H;S180A;K195E;Q240R/W;T243I/L;I245L;A256G;L257W/A;N270K; N290G;Y304H;R305M;H307G/Q/M;E308Q;I326F;L349M;D353A/N;L364Q;A394V;S399N; N400K;P404A;L407V;F443H;N453G;Y459F;T460G;T463N;N474Q;E509L;Q521K/S;K522Y/F/ N;T524S;P528L;S546R;And/or P564G/L/M.
In some embodiments, when every other determination condition is essentially identical, have compared with reference PAL polypeptide and increase The engineering PAL polypeptide of the tolerance to acid pH added has increased tolerance under following pH range: 1.5 to Between 6.5;Between 1.5 and 5.0;Between 2.0 to 5.5;Between 3.0 and 6.8;Between 3.0 and 5.5;In 4.0 Hes Between 6.5;Between 4.0 and 4.5;Between 4.5 and 5.0;Between 4.5 and 5.5;Between 4.5 and 6.0;In 4.5 Hes Between 6.5;Between 5.0 and 6.5;Between 5.0 and 6.0;Between 5.0 and 5.5;Between 5.5 and 6.0;In 6.0 Hes Between 6.5;And/or between 6.5 and 7.0.In some embodiments, the increased tolerance to acid pH is shown about 3.5, under 4.0,4.5,5.0,5.5,6.0 and/or 6.5 pH.
In some embodiments, the engineering PAL compared with reference PAL with the increased tolerance to acid pH is more Peptide also shows bigger PAL activity when measuring by standard test.Any suitable measurement has practical in the present invention Property, it is including but not limited to those of presented herein.
It is further contemplated that the engineered polypeptide (that is, No. 1-variant of variant 1010) of any illustration, such as by subsequent More wheels are evolved through Combination nova of the addition from other polypeptides described herein He a variety of amino acid of differences of other resi-dues, Has the purposes as the original amino acid sequence for synthesizing Other Engineering PAL polypeptide.In some embodiments, in addition Improvement by include run through former wheels evolution in be left intact residue positions amino acid of differences and generate. This is not intended to that the present invention is made to be limited to any specific method for generating engineering PAL polypeptide, because any suitable method is at this Has practicability in invention, method including but not limited to those of presented herein.
Encode polynucleotides, expression vector and the host cell of engineered polypeptide:
The present invention provides the polynucleotides for encoding engineering PAL polypeptide described herein.In some embodiments, more Nucleotide is operably connected to one or more heterologous regulatory sequences of control gene expression, can be expressed with creation more The recombination of polynucleotide of peptide.In some embodiments, by least one heterologous multicore comprising coding engineering PAL polypeptide The expression construct of thuja acid introduces host cell appropriate to express corresponding PAL polypeptide.
Such as those skilled in the art will be apparent, the codon of the availability of protein sequence and corresponding various amino acid Knowledge provide the description to all polynucleotides that can encode the theme polypeptide.The degeneracy of genetic codon, wherein Same amino acid is encoded by codon substitute or synonymous, allows to prepare maximal number purpose nucleic acid quilt, and all these nucleic acid are compiled Code engineering PAL polypeptide.Therefore, by the combination for selecting to select based on possible codon, the present invention provides for generating The method and group of every kind of the PAL polynucleotides for the coding PAL polypeptide described herein that can be prepared and each possible variation Object is closed, and all these variations will be considered being included in embodiment (for example, multiple for any polypeptide described herein In table) in present amino acid sequence be specifically disclosed.
In some embodiments, codon is preferably optimized to be used to the selected host cell benefit of albumen With.For example, preferred codon used in bacterium is normally used for the expression in bacterium.Therefore, coding engineering PAL is more The polynucleotides of the codon optimization of peptide full length coding region about 40%, 50%, 60%, 70%, 80%, 90% or be greater than It include preferred codon at 90% codon position.
In some embodiments, PAL polynucleotide encoding is with PAL activity, the work with characteristic disclosed herein Journey polypeptide, wherein the polypeptide include have with selected from SEQ ID NO:3,5,7,9,11,13,15,17,19,21, and/or The amino acid sequence at least 80% of 23 reference sequences or any variant (for example, the variant provided in embodiment), 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or bigger The amino acid sequence of identity, and the reference multicore with SEQ ID NO:3,5,7,9,11,13,15,17,19,21 and/or 23 One or more residue differences that the amino acid sequence of any variant disclosed in thuja acid or embodiment is compared (such as 1,2 A, 3,4,5,6,7,8,9,10 or more amino acid residue positions).In some embodiments, Reference sequences are selected from SEQ ID NO:3,5,7,9,11,13,15,17,19,21 and/or 23.
In some embodiments, PAL polynucleotide encoding is active with PAL, work with characteristic disclosed herein Journey polypeptide, wherein the polypeptide include have with reference sequences SEQ ID NO:4 at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity Amino acid sequence, when the polypeptide optimal comparison of He Dangyu SEQ ID NO:4, from following residual compared with SEQ ID NO:4 One or more residue differences at base location: X39;X54;X59;X73;X91;X158;X112, X134, X180;X195; X240;X243;X245;X256;X257;X270;X290;X304,X305;X307;X308;X326;X349;X353;X364; X394;X399;X400;X404;X407;X443;X453;X459;X460;X463;X474;X509;X521;X522;X524; X528;X546;And/or X564.
In some embodiments, the polynucleotides of coding engineering PAL polypeptide include to be selected from polynucleotide sequence SEQ The polynucleotide sequence of ID NO:3,5,7,9,11,13,15,17,19,21 and/or 23.In some embodiments, work is encoded The polynucleotides of journey PAL polypeptide have with SEQ ID NO:2,3,5,7,9,11,13,15,17,19,21 and/or 23 at least 80%, 85%, 90%, 93%, 95%, 96%, 97%, 98%, 99% nucleotide residue identity.
In some embodiments, polynucleotides can under high stringency with selected from SEQ ID NO:2,3,5,7, 9,11,13,15,17,19,21 and/or 23 reference polynucleotide sequence or its complementary series or coding are provided herein any The polynucleotide sequence of variant PAL polypeptide hybridizes.In some embodiments, the multicore that can hybridize under high stringency Thuja acid coding includes the PAL polypeptide of amino acid sequence, and the amino acid sequence has compared with SEQ ID NO:4 selected from following Residue positions one or more residue differences: X39;X54;X59;X73;X91;X158;X112,X134,X180; X195;X240;X243;X245;X256;X257;X270;X290;X304,X305;X307;X308;X326;X349;X353; X364;X394;X399;X400;X404;X407;X443;X453;X459;X460;X463;X474;X509;X521;X522; X524;X528;X546;And/or X564.
In some embodiments, the isolated polynucleotides of any one of engineering PAL polypeptide of this paper are encoded It is operated in many ways, to promote the expression of PAL polypeptide.In some embodiments, the polynucleotides packet of PAL polypeptide is encoded Containing the expression vector for regulating and controlling the expression of PAL polynucleotides and/or polypeptide there are one or more control sequences.Depending on institute The expression vector utilized, separated polynucleotides its be inserted into carrier in front of operation can be it is desirable that or it is necessary 's.It is well known in the art that the technology of polynucleotides and nucleic acid sequence is modified using recombinant DNA method.In some embodiments In, control sequence includes, and especially, promoter, leader sequence, Polyadenylation sequences, propeptide sequence, signal peptide sequence and turns Record terminator.In some embodiments, suitable promoter is selected according to the selection of host cell.It is thin for bacterial host Born of the same parents, the suitable promoter for instructing the nucleic acid construct of present disclosure to transcribe include but is not limited to from the following starting obtained Son: Escherichia coli (E.coli) lac operon, streptomyces coelicolor (Streptomyces coelicolor) agarase gene (dagA), bacillus subtilis (Bacillus subtilis) type froctosan saccharase gene (sacB), bacillus licheniformis (Bacillus licheniformis) alpha-amylase gene (amyL), bacillus stearothermophilus (Bacillus Stearothermophilus) maltogenic amylase gene (amyM), bacillus amyloliquefaciens (Bacillus Amyloliquefaciens) α solves amylase gene (amyQ), bacillus licheniformis (Bacillus licheniformis) blueness Mycin enzyme gene (penP), bacillus subtilis (Bacillus subtilis) xylA and xylB gene and protokaryon beta lactamase Gene (see, for example, Villa-Kamaroff etc., Proc.Natl Acad.Sci.USA 75:3727-3731 [1978]), with And tac promoter (see, for example, DeBoer etc., Proc.Natl Acad.Sci.USA 80:21-25 [1983]).For filiform The illustrative promoter of fungal host cells includes, but are not limited to from the promoter obtained for gene below: aspergillus oryzae It is (Aspergillus oryzae) TAKA amylase, rhizomucor miehei (Rhizomucor miehei) aspartic protease, black Aspergillus (Aspergillus niger) neutrality α powder amylase, Aspergillus niger acid stable alpha-amylase, aspergillus niger or aspergillus awamori (Aspergillus awamori) glucoamylase (glaA), rhizomucor miehei lipase, line protease, meter Qu Mould phosphotriose isomerase, aspergillus nidulans (Aspergillus nidulans) acetamidase and Fusarium oxysporum (Fusarium Oxysporum) trypsin like proteases (see, for example, WO 96/00787) and NA2-tpi promoter are (from for black The heterozygote of the promoter of the gene of aspergillus neutral alpha-amylase and Aspergillus oryzae triose phosphate isomerase promoter) and its mutation , truncated and heterozygosis promoter.Exemplary yeast cells promoter may be from for gene below: saccharomyces cerevisiae (Saccharomyces cerevisiae) enolase (ENO-1), saccharomyces cerevisiae galactokinase (GAL1), saccharomyces cerevisiae second Alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP) and saccharomyces cerevisiae glycerol 3-phosphate acid kinase.For yeast host Other useful promoters of cell are known in the art (see, for example, Romanos etc., Yeast 8:423-488 [1992])。
In some embodiments, control sequence is still suitble to transcription terminator sequences (that is, by host cell identify with Terminate the sequence of transcription).In some embodiments, terminator sequence is operably connected to the nucleic acid for encoding the PAL polypeptide The end 3' of sequence.Functional any suitable terminator has practicability in the present invention in the host cell of selection. Exemplary transcription terminator for filamentous fungal host cell can be obtained from for gene below: aspergillus oryzae TAKA starch Enzyme, aspergillus niger glucoamylase, aspergillus nidulans anthranilate synthase, aspergillus niger alpha-Glucosidase and Fusarium oxysporum pancreas egg White enzyme sample protease.Exemplary terminator for yeast host cell can be obtained from for gene below: saccharomyces cerevisiae alkene Enolase, S. cerevisiae cytochrome C (CYC1) and S. cerevisiae glyceraldehyde -3- phosphate dehydrogenase.For yeast host cell Other useful terminators be (see, for example, Romanos etc., ibid) known in the art.
In some embodiments, the leader sequence that control sequence is still suitble to is (that is, the translation to host cell is important The untranslated area of mRNA).In some embodiments, leader sequence is operably connected to the nucleic acid sequence of coding PAL polypeptide The end 5'.Functional any suitable leader sequence has practicability in the present invention in the host cell of selection.With It is obtained in the exemplary leader sequence of filamentous fungal host cell from following gene: oryzae TAKA amylase and aspergillus nidulans phosphorus Sour triose isomerase.Suitable leader sequence for yeast host cell is obtained from for gene below: saccharomyces cerevisiae alkene Enolase (ENO-1), saccharomyces cerevisiae glycerol 3-phosphate kinases, cerevisiae alpha-factor and Ethanol in Saccharomyces cerevisiae dehydrogenase/glycerol Aldehyde -3- phosphate dehydrogenase (ADH2/GAP).
In some embodiments, control sequence or polyadenylation sequence are (that is, be operably connected to nucleic acid sequence The sequence of the end 3' of column, and when transcription, it is identified as polyadenosine residues being added to transcription by host cell The signal of mRNA).Functional any suitable polyadenylation sequence has in the present invention in the host cell of selection Practicability.Exemplary polyadenylic acid tracts for filamentous fungal host cell include, but are not limited to for gene below: Oryzae TAKA amylase, aspergillus niger glucoamylase, aspergillus nidulans anthranilate synthase, Fusarium oxysporum trypsin enzyme Sample protease and aspergillus niger α powder glucosidase.The polyadenylation sequence useful to yeast host cell be it is known (referring to For example, Guo and Sherman, Mol.Cell.Bio., 15:5983-5990 [1995]).
In some embodiments, control sequence or signal peptide are (that is, coding is connected to the amino of the amino terminal of polypeptide Acid sequence and the code area for secreting access that the polypeptide of coding is inducted into cell).In some embodiments, nucleic acid sequence The end 5' of coded sequence inherently includes signal peptide coding region, the coding of the polypeptide of the signal peptide coding region and coding secretion With the meeting translation reading frame natural connection of the section in area.Optionally, in some embodiments, the end 5' of coded sequence includes pair It is the signal peptide coding region of external source for coded sequence.The polypeptide of expression is inducted into the secretion access of the host cell of selection Any suitable signal peptide coding region have the purposes of the expression for engineered polypeptide.It is effective for bacterial host cell Signal peptide coding region includes but is not limited to from those of obtaining for gene below: bacillus (Bacillus) NC1B 11837 maltogenic amylases, bacillus stearothermophilus α-bud amylase, bacillus licheniformis subtilisin, lichens gemma Bacillus beta-lactamase, stearothermophilus neutral protease (nprT, nprS, nprM) and bacillus subtilis prsA. Other signal peptide is known in the art (see, for example, Simonen and Palva, Microbiol.Rev., 57:109-137 [1993]).In some embodiments, signal peptide coding region effective for filamentous fungal host cell include but is not limited to from The signal peptide coding region obtained for gene below: oryzae TAKA amylase, Aspergillus ni ger neutral amylase, aspergillus niger glucose Amylase, rhizomucor miehei aspartic protease, Humicola insolens (Humicola insolens) cellulase and fetal hair are rotten Mould (Humicola lanuginosa) lipase of matter.Useful signal peptide for yeast host cell includes but is not limited to come From for those of gene below: cerevisiae alpha-factor and Saccharomyces cerevisiae invertase.
In some embodiments, control sequence or propeptide coding region, coding are located in the amino terminal of polypeptide Amino acid sequence.Gained polypeptide is known as " preferment (proenzyme) ", " preceding polypeptide (propolypeptide) " or " proenzyme ".Before Polypeptide can be converted into mature active peptides, crack the propetide from preceding polypeptide by catalysis or autocatalysis.Propeptide coding region It can be obtained from any suitable source, the source includes but is not limited to for gene below: Bacillus subtilis alkaline Protease (aprE), Bacillus subtilis neutral protease (nprT), cerevisiae alpha-factor, rhizomucor miehei aspartic acid egg White enzyme and thermophilic fungus destroyed wire (Myceliophthora thermophila) lactase are see, for example, WO 95/33836).Work as letter When number peptide and propetide area are present in the amino terminal of polypeptide, amino terminal positioning and signal peptide area of the propetide area close to polypeptide Amino terminal close to propetide area positions.
In some embodiments, adjusting sequence is also used.These sequences promote life of the polypeptide relative to host cell The adjusting of long expression.The example of regulating system is to cause gene expression in response to chemical or physical stimulus, including adjust chemical combination The presence of object and those of open or close regulating system.In prokaryotic host cell, suitable adjusting sequence includes but unlimited In lac, tac and trp operon system.In yeast host cell, suitable regulating system includes but is not limited to ADH2 system Or GAL1 system.In filamentous fungi, suitable regulating and controlling sequence includes but is not limited to TAKA alpha-amylase promoter, aspergillus niger Portugal Saccharogenic amylase promoter and aspergillus oryzae glucose starch enzyme promoters.
On the other hand, the present invention relates to recombinant expression carrier, the recombinant expression carrier includes coding engineering PAL One or more Expression modulation areas of the polynucleotides of polypeptide and the type depending on its host cell being introduced into, are such as opened Mover and terminator, replication orgin etc..In some embodiments, various nucleic acid described herein and control sequence are connected to Together to generate recombinant expression carrier, allow to encode the core of PAL polypeptide it includes one or more convenient restriction sites Acid sequence is inserted into or replaces in these sites.Optionally, in some embodiments, nucleic acid sequence of the invention is by by nucleic acid Sequence or nucleic acid construct comprising the sequence are inserted into suitable expression vector to express.It is being related to creating some of expression vector In embodiment, coded sequence is located in carrier, so that coded sequence and the control sequence appropriate for being used to express are operationally Connection.
Recombinant expression carrier can be any suitable carrier (for example, plasmid or virus), be convenient to experience recombination DNA program, and cause the expression of PAL polynucleotide sequence.The selection of carrier generally depends on carrier and carrier and will be introduced into The compatibility of host cell.Carrier can be linear or closure cyclic plasmid.
In some embodiments, expression vector is autonomously replicationg vector (that is, carrying as existing for extrachromosomal entity Body is replicated independently of chromosome replication, such as plasmid, extra-chromosomal element, minichromosome or artificial chromosome).Carrier It may include any tool (means) for ensuring self-replacation.In some alternate embodiments, carrier is to work as to be introduced into When host cell, the carrier that is be integrated into genome and is replicated together with the chromosome that it is integrated into.In addition, in some realities It applies in scheme, single carrier or plasmid is utilized, or jointly comprise two kinds of all DNA of the genome of host cell to be introduced Or more carrier or plasmid and/or transposons.
In some embodiments, expression vector includes one or more selectable markers, allows to be easy choosing Select transformed cell." selectable marker " is gene, and product provides biocide or virus resistance, to heavy metal Resistance, to auxotrophic prototrophy etc..The example of the selectable marker of bacterium includes but is not limited to come from withered grass The dal gene of bacillus or bacillus licheniformis, or imparting antibiotic resistance such as ampicillin, kanamycins, chlorine are mould The marker of element or tetracyclin resistance.Suitable marker for yeast host cell include but is not limited to ADE2, HIS3, LEU2, LYS2, MET3, TRP1 and URA3.Selectable marker for filamentous fungal host cell includes but is not limited to AmdS (acetamidase;For example, the acetamidase from aspergillus nidulans (A.nidulans) or aspergillus oryzae (A.orzyae)), argB (ornithine transcarbamylase), bar (phosphine oxamate acetyltransferase;For example, coming from streptomyces hygroscopicus (S.hygroscopicus) phosphine oxamate acetyltransferase), hph (hygromix phosphotransferase), niaD (nitrate reductase Enzyme), pyrG (orotic nucleoside-5'-phosphate decarboxylase;For example, the Orotidine-5 ' from aspergillus nidulans or aspergillus oryzae '-phosphorus Acid decarboxylase), sC (sulfate adenylyl transferase) and trpC (anthranilate synthase) and its equivalent.In another side Face, the present invention provides host cells, and it includes at least one multicores for encoding at least one engineering PAL polypeptide of the invention Thuja acid, the polynucleotides are operably connected to for expressing the one or more of engineering PAL enzyme in host cell Control sequence.Be suitable for expression the host cell as used in the polypeptide of expression vector codes of the invention be it is well known that , and including but not limited to bacterial cell, such as Escherichia coli, vibrio fluvialis (Vibrio fluvialis), streptomyces With salmonella typhimurium (Salmonella typhimurium) cell;Fungal cell, such as yeast cells are (for example, wine brewing Yeast or Pichia pastoris (Pichia pastoris) (ATCC deposit number 201178));Insect cell, such as Drosophila (Drosophila) S2 and Noctua (Spodoptera) Sf9 cell;Zooblast, such as CHO, COS, BHK, 293 and Bowes Melanoma cells;And plant cell.Illustrative host cell further includes various Escherichia coli (Escherichia coli) bacterium Strain is (for example, W3110 (Δ fhuA) and BL21).
Therefore, on the other hand, the present invention provides the method for generating engineering PAL polypeptide, wherein the method packets It includes under conditions of being suitable for polypeptide expression, culture can express the host cell for encoding the polynucleotides of engineering PAL polypeptide. In some embodiments, this method further includes the steps that separating and/or purifying PAL polypeptide as described herein.
Culture medium and growth conditions appropriate for host cell are known in the art.It is contemplated that will use The present invention is used in any suitable method that the polynucleotides for expressing PAL polypeptide are introduced cell.Suitable technology include but It is not limited to: electroporation, bioloistic particle bombardment, liposome-mediated transfection, calcium chloride transfection and protoplast fusion.
Engineering PAL polypeptide with characteristic disclosed herein can be by making to encode naturally occurring or engineering PAL The polynucleotides of polypeptide undergo any suitable mutagenesis and/or it is as known in the art and/or it is as described herein orient into Change method obtains.Illustrative directed evolution technologies be mutagenesis and/or DNA reorganization (see, for example, Stemmer, Proc.Natl.Acad.Sci.USA 91:10747-10751[1994];WO 95/22625;WO 97/0078;WO 97/ 35966;WO 98/27230;WO 00/42651;WO 01/75767 and United States Patent (USP) 6,537,746).Workable other are determined To evolution program include staggeredly extension process (StEP), vitro recombination (see, for example, Zhao etc., Nat.Biotechnol., 16: 258-261 [1998]), mutagenesis PCR is (see, for example, Caldwell etc., PCR Methods Appl., 3:S136-S140 [1994]) and cassette mutagenesis is (see, for example, Black etc., Proc.Natl.Acad.Sci.USA 93:3525-3529 [1996])。
The method of mutagenesis and directed evolution can be easily applicable in the polynucleotides of coding PAL-, can be by table with generation The variant library for reaching, screening and measuring.Any suitable mutagenesis and directed evolution method are applied to the present invention and are that this field is ripe Know (see, for example, U.S. Patent number 5,605,793,5,830,721,6,132,970,6,420,175,6,277,638,6, 365,408、6,602,986、7,288,375、6,287,861、6,297,053、6,576,467、6,444,468、5,811238、 6,117,679、6,165,793、6,180,406、6,291,242、6,995,017、6,395,547、6,506,602、6,519, 065、6,506,603、6,413,774、6,573,098、6,323,030、6,344,356、6,372,497、7,868,138、5, 834,252、5,928,905、6,489,146、6,096,548、6,387,702、6,391,552、6,358,742、6,482, 647、6,335,160、6,653,072、6,355,484、6,03,344、6,319,713、6,613,514、6,455,253、6, 579,678、6,586,182、6,406,855、6,946,296、7,534,564、7,776,598、5,837,458、6,391, 640、6,309,883、7,105,297、7,795,030、6,326,204、6,251,674、6,716,631、6,528,311、6, 287,862、6,335,198、6,352,859、6,379,964、7,148,054、7,629,170、7,620,500、6,365, 377、6,358,740、6,406,910、6,413,745、6,436,675、6,961,664、7,430,477、7,873,499、7, 702,464、7,783,428、7,747,391、7,747,393、7,751,986、6,376,246、6,426,224、6,423, 542、6,479,652、6,319,714、6,521,453、6,368,861、7,421,347、7.058.515、7,024,312、7, 620,502,7,853,410,7,957,912,7,904,249 and the corresponding application of all relevant non-United States;Ling etc., Anal.Biochem, 254 (2): 157-78 [1997];Dale etc., Meth.Mol.Biol., 57:369-74 [1996];Smith, Ann.Rev.Genet, 19:423-462 [1985];Botstein etc., Science, 229:1193-1201 [1985];Carter, Biochem.J.,237:1-7[1986];Kramer etc., Cell, 38:879-887 [1984];Wells e etc., Gene, 34: 315-323[1985];Minshull etc., Curr.Op.Chem.Biol., 3:284-290 [1999];Christians etc., Nat.Biotechnol.,17:259-264[1999];Crameri etc., Nature, 391:288-291 [1998];Crameri. Deng Nat.Biotechnol., 15:436-438 [1997];Zhang etc., Proc.Nat.Acad.Sci.U.S.A., 94:4504- 4509[1997];Crameri etc., Nat.Biotechnol., 14:315-319 [1996];Stemmer, Nature, 370:389- 391[1994];Stemmer, Proc.Nat.Acad.Sci.USA, 91:10747-10751 [1994];WO 95/22625;WO 97/0078;WO 97/35966;WO 98/27230;WO 00/42651;WO 01/75767;WO 2009/152336 and the U.S. The patent No. 6,537,746.Its is all to be incorporated herein by reference).
In some embodiments, the enzyme clone obtained after mutagenic treatment by make enzyme preparation experience definition temperature (or Other determination conditions), and the amount of remaining enzymatic activity is screened after measuring heat treatment or other suitable determination conditions.So Afterwards by comprising encode PAL polypeptide polynucleotides clone separate from gene, be sequenced with identify nucleotide sequence change (if If having), and for expressing enzyme in host cell.Enzymatic activity of the measurement from expression library can be used known in the art Any suitable method (for example, standard biochemical techniques, such as HPLC are analyzed) Lai Jinhang.
For the engineered polypeptide of known array, the polynucleotides of codase can pass through standard according to known synthetic method Solid phase method preparation.In some embodiments, the segment of up to about 100 bases can be synthesized individually, then be connected (for example, passing through enzyme or chemical connection process (chemical litigation method) or polymerase-mediated method) is with shape At any desired continuous sequence.For example, using classical phosphoramidite method (see, for example, Beaucage etc., Tet.Lett, 22:1859-69[1981];With Matthes etc., EMBO J., 3:801-05 [1984]), such as the institute usually in automatic synthesis method Practice, polynucleotides and oligonucleotides disclosed herein can be prepared by chemical synthesis.According to phosphoamidite method, few core Thuja acid is synthesized and (for example, in automatic DNA synthesizer, purifies, anneals, connects and be cloned into suitable carrier).
Therefore, in some embodiments, it is used to prepare the method for engineering PAL polypeptide can include: (a) composite coding packet The polynucleotides of the polypeptide of amino acid sequence containing the amino acid sequence selected from any variant as described herein, and (b) table Up to the PAL polypeptide by polynucleotide encoding.In some embodiments of this method, by the amino acid sequence of polynucleotide encoding Optionally have one or several (for example, up to 3,4,5 or up to 10) amino acid residues missing, insertion and/ Or displacement.In some embodiments, amino acid sequence optionally has 1-2,1-3,1-4,1-5,1-6,1-7 A, 1-8,1-9,1-10,1-15,1-20,1-21,1-22,1-23,1-24,1-25,1-30 A, 1-35,1-40,1-45 or 1-50 amino acid residue missing, insertion and/or displacement.In some embodiments, Amino acid sequence optionally have 1,2,3,4,5,6,7,8,9,10,11,12,13, 14,15,16,17,18,19,20,21,22,23,24,25,30,30,35,40 A, 45 or 50 amino acid residue missings, insertion and/or displacement.In some embodiments, amino acid sequence optionally has There are 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,18 A, 20,21,22,23,24 or 25 amino acid residue missings, insertion and/or displacement.In some embodiments In, it is replaced into conservative substitution or non-conservative displacement.
Using any suitable measurement as known in the art, measurement including but not limited to described herein and item Part evaluates the combination of any desired improved characteristic or the characteristic of expressed engineering PAL polypeptide (for example, activity, selection Property, stability, sour tolerance, protease sensitive etc.).
It in some embodiments, will be using any one of known technology for protein purification or more Any being recycled from cell and/or culture medium for the engineering PAL polypeptide expressed in host cell, for the ripe of protein purification Know that technology includes, bacteriolyze enzymatic treatment, ultrasonic treatment, filter, saltout, ultracentrifugation and chromatography and other.
Chromatographic technique for separating PAL polypeptide includes, among others, RP chromatography, high performance liquid chromatography Method, ion-exchange chromatography, hydrophobic interaction chromatography, size exclusion chromatography, gel electrophoresis and affinity chromatography.For The condition part for purifying certain enzyme depends on the factors such as net charge, hydrophobicity, hydrophily, molecular weight, molecular shape, and Those skilled in the art will be apparent.In some embodiments, affine technolog can be used for separating improved PAL enzyme.It is right It is purified in affinity chromatography, any antibody of the interested PAL polypeptide of specific binding can be used.In order to generate antibody, pass through use PAL polypeptide or its a variety of host animals of segment injecting immune, including but not limited to rabbit, mouse, rat etc..In some embodiments In, PAL polypeptide or segment are by means of side chain functionalities or are attached to the adapters of side chain functionalities and are attached to suitable load Body, such as BSA.
In some embodiments, engineering PAL polypeptide is by including that the following method generates in host cell: having Polynucleotides of the culture comprising encoding engineering PAL polypeptide as described herein under conditions of being generated conducive to engineering PAL polypeptide The host cell (for example, coli strain) of sequence, and the recycling engineering PAL polypeptide from cell and/or culture.One In a little embodiments, host cell generates more than one engineering PAL polypeptide.
In some embodiments, the present invention provides the methods for generating engineering PAL polypeptide, and the method includes suitable The recombinant bacterial cell of polynucleotide sequence of the culture comprising coding engineering PAL polypeptide is under the condition of culture of conjunction to allow to generate It is engineered PAL polypeptide and optionally from culture and/or the bacterial cell recycling engineering PAL polypeptide of culture, the engineering PAL polypeptide has the sequence with reference sequences SEQ ID NO:4 at least 85%, 90%, 95%, 96%, 97%, 98% or 99% Identity, when the amino acid sequence optimal comparison of He Dangyu SEQ ID NO:4 one selected from the following compared with SEQ ID NO:4 Or more amino acid residue difference: X39;X91;X158;X180;X195;X243;X245;X256;X257;X270;X290; X307;X308;X326;X349;X364;X394;X399;X400;X404;X407;X443;X453;X459;X460;X463; X474;X522;X524;With X528 or combinations thereof.In some embodiments, host cell generates more than one engineering PAL Polypeptide.
In some embodiments, PAL polypeptide is engineered after recycling in recombinant host cell and/or culture, they It can be further purified by any suitable method known in the art.In some other embodiments, the TAL of purifying is more Peptide and other compositions and compound combination are to provide the appropriate composition for being used for different application and purposes comprising engineering PAL polypeptide With preparation (for example, pharmaceutical composition).
Composition:
The present invention provides be suitable for being engineered PAL polypeptide used in numerous compositions.These compositions are for being permitted It is multi-field, including but not limited to medicine, diet/nutritional supplement, food, feed and fine chemistry industry production.For example, some In embodiment, the present invention provides food and/or feed, the food and/or feed include that at least one engineering PAL becomes At least one polynucleotide sequence of body and/or at least one PAL variant of coding.In some embodiments, the present invention provides Drink, the drink include at least one engineering PAL variant.
In some embodiments, the engineering PAL variant in food, feed and/or nutrition/dietary supplements is sugar Base.In addition, engineering PAL variant is applied to any suitable edible enzyme delivery matrices.In some embodiments, Engineering PAL variant is present in edible enzyme delivery matrices, and the edible enzyme delivery matrices are designed to absorb In the digestive tract quick diffusion of the PAL variant in animal after variant.
The present invention also provides engineering PAL polypeptide, the engineering PAL polypeptide is suitable in fine chemicals and other It uses in the generation of industrial important compound (see, for example, U.S. Patent Application No. 2013/0340119,2013/ 0005012 and 2005/0260724 and WO 2012/122333).
Pharmaceutical composition and other compositions
The present invention provides engineering PAL polypeptide, the engineering PAL polypeptide is suitable in pharmaceutical composition and other groups It closes and is used in object such as diet/nutritional supplement.
Depending on method of application, these compositions of the engineering PAL according to the present invention comprising therapeutically effective amount are in solid The form of body, semisolid or liquid.In some embodiments, composition includes other pharmaceutically acceptable components, such as Diluent, buffer, excipient, salt, emulsifier, preservative, stabilizer, filler and other compositions.For what is prepared and apply The details of technology is well known in the art, and is described in the literature.
In some embodiments, engineering PAL polypeptide is prepared for using in combination of oral medication.In delivering work Any suitable form used in journey PAL polypeptide be applied to the present invention, the form include but is not limited to pill, tablet, Gel film (gel tabs), capsule, pastille, dragee, powder, soft gel, collosol and gel, gel, emulsion, implantation material, patch, Spray, ointment, liniment, creme, paste, gelling agent, paint, aerosol, chewing gum, moderator, bar, suspension (packet Include but be not limited to Oil-based suspensions, oil in water emulsion etc.), medicine slurry, syrup, controlled release preparation, suppository etc..In some embodiments In, PAL polypeptide is engineered to be suitable for the form of injection and provide (that is, in the form of preparation of injectable).In some embodiment party In case, PAL polypeptide is engineered with biocompatible matrix such as collosol and gel, including based on silica (for example, oxysilane (oxysilane)) collosol and gel provides.In some embodiments, encapsulation engineering PAL polypeptide.Some optional In embodiment, engineering PAL polypeptide is encapsulated in nanostructure (for example, nanotube, nanochannel (nanotubule), receiving Rice glue capsule or micron capsule, microsphere, liposome etc.) in.In fact, not expected limit the invention to any specific delivering system Agent and/or delivery apparatus.It is expected that engineering PAL polypeptide is applied by any suitable means known in the art, including but Be not limited to: parenteral, it is oral, local, percutaneous, intranasal, intraocular, intrathecal, via implantation material etc..
In some embodiments, engineering PAL polypeptide is passed through into glycosylation, Pegylation (that is, using polyethylene glycol [PEG] or the PEG etc. of activation modification) or other compound chemical modifications (see, for example, Ikeda, Amino Acids 29: 283-287[2005];U.S. Patent number 7,531,341,7,534,595,7,560,263 and 7,53,653;U.S. Patent application Publication No. 2013/0039898,2012/0177722 etc.).In fact, not expected make the present invention be limited to any specific delivery side Method and/or mechanism.
In some other embodiments, engineering PAL polypeptide is provided at the system comprising the stable enzyme crystal of matrix In agent.In some embodiments, said preparation includes that the crystallization of crosslinking is engineered PAL enzyme and with the reaction for being attached to enzyme crystal Partial polymer.The present invention also provides engineering PAL polypeptide with polymer.
In some embodiments, the composition comprising engineering PAL polypeptide of the invention includes one or more of normal Carrier compound comprising but it is not limited to sugared (for example, lactose, sucrose, mannitol and/or D-sorbite), starch (example Such as, corn, wheat, rice, potato or other plant starch), cellulose is (for example, methylcellulose, hydroxypropyl methyl fiber Element, sodium carboxymethylcellulose), gummy (for example, Arab, tragacanth, guar gum etc.) and/or albumen is (for example, gelatin, collagen Deng).Other component in oral preparation may include colorant and/or sweetener (for example, glucose, sucrose and mannose Alcohol) and lubricant (for example, magnesium stearate) and enteric coating (for example, methacrylate polymers, hydroxypropyl methyl are fine Tie up plain phthalic acid ester and/or any other suitable enteric coating known in the art).In some embodiments, it collapses Solution agent or solubilizer are included (for example, polyvinylpyrrolidone, agar, alginic acid or its salt of crosslinking, such as mosanom).One In a little embodiments, especially in liquid preparation, engineering PAL polypeptide and a variety of other group subassemblys are described other Component includes but is not limited to preservative, suspending agent, thickener, wetting agent, alcohols, fatty acid and/or emulsifier.
In some embodiments, especially in liquid preparation, engineering PAL polypeptide and a variety of other groups are grouped It closes, the other component includes but is not limited to preservative, suspending agent, thickener, wetting agent, alcohols, fatty acid and/or emulsification Agent.In some embodiments, PAL polypeptide will be engineered and is administered to subject with other compound combinations for being used to treat PKU, Described other compounds for treating PKU include but is not limited toTetrahydrobiopterin (BioMarin Pharmaceutical, Inc., Novato, CA), antiacid (such as Omeprazole, esomeprazole and other pyrazoles), with And any other suitable compound.
In some embodiments, the present invention provides engineering PAL polypeptide, the engineering PAL polypeptide is suitable for Reduce the concentration of the phenylalanine in fluid blood, celiolymph etc..It is administered to the engineering PAL polypeptide in animal Dosage depends on situation or disease, the ordinary circumstance of animal and other factors well known by persons skilled in the art.In some implementations In scheme, composition is applied to animal for single or multiple.In some embodiments, it is contemplated that engineering PAL polypeptide Concentration in the composition for being administered to animal (for example, the people for suffering from PKU) is enough effectively to treat, improve and/or prevent disease (such as the relevant situation of PKU and/or PKU, disease and/or symptom).In some embodiments, PAL polypeptide and its are engineered His drug and/or dietary composition are administered in combination.
Industry group compound
It is expected that engineering PAL polypeptide of the invention is applied in industry group compound.In some embodiments, Engineering PAL polypeptide is prepared for using in food and/or feed industry.In some embodiments, engineering PAL is more Peptide is formulated as granular or pellet product, and such as other enzyme is (for example, cellulase, laccase and starch with animal feed components Enzyme) mixing.In some optional embodiments, engineering PAL polypeptide is used for liquid animal fodder compound (for example, water The slurries of base or oil base).Therefore, in some embodiments, engineering PAL variant of the invention is heat-resisting enough and thermostabilization With bear for generate pill and other processing the processing of feed/food.
Engineering PAL variant of the invention is also used to prepare phenylalanine and/or phenylalanine derivative.
This application provides the following contents:
Project 1. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, and the engineered polypeptide includes: A) amino acid sequence, the amino acid sequence have the sequence with reference sequences SEQ ID NO:4 or its function fragment at least 85% Column identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:4 or its function fragment one or more At least one amino acid residue difference at a amino acid position;And c) engineered polypeptide and the reference sequences SEQ ID NO:4 is compared and is shown improved characteristic selected from the following: the catalytic activity that i) enhances, ii) reduce to proteolysis Sensibility, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), ii), Iii), iv) or any combination v).
Engineered polypeptide of the project 2. as described in project 1, wherein one or more amino acid position be selected from 20, 24、27、39、43、45、47、54、58、59、62、70、73、80、82、91、94、98、104、105、110、112、115、117、 118、119、121、123、124、125、126、127、128、129、130、131、133、134、135、139、140、141、142、 143、144、145、146、147、149、150、151、153、154、156、157、158、159、172、174、175、176、177、 178、180、187、191、195、199、205、206、210、212、213、214、232、240、243、245、247、248、250、 256、257、266、270、275、278、279、285、286、289、290、292、304、305、307、308、309、319、321、 326、331、332、334、349、353、355、364、365、369、370、371、372、374、375、377、378、379、381、 382、383、384、385、387、389、394、396、399、400、403、404、407、417、418、425、431、432、433、 434、435、436、437、438、439、443、446、447、453、456、459、460、461、463、471、472、473、474、 475、476、477、478、479、482、483、503、507、509、521、522、524、525、528、538、546、547、551、 558,560,564,565 and/or any combination thereof, wherein the amino acid position is numbered with reference to SEQ ID NO:4.
Engineered polypeptide of the project 3. as described in any one of aforementioned project, wherein the reference sequences SEQ ID NO:4 Amino acid residue correspond to A39, T54, G59, S73, A91, Y158, S180, K195, A112, R134, Q240, T243, I245、A256、L257、N270、N290、Y304、R305、H307、E308、I326、L349、D353、L364、A394、S399、 N400, P404, L407, F443, N453, Y459, T460, T463, N474, E509, Q521, K522, T524, P528, S546 and/ Or P564.
Engineered polypeptide of the project 4. as described in any one of aforementioned project, wherein when with the polypeptide of SEQ ID NO:4 most When good comparison, the amino acid residue difference compared with SEQ ID NO:4 is one or more in following displacement: A39V、T54K、G59R、S73K、A112C、R134Q、A91V、Y158H、S180A、K195E、Q240R/W、T243I/L、I245L、 A256G、L257W/A、N270K、N290G、Y304H、R305M、H307G/Q/M、E308Q、I326F、L349M、D353A/N、 L364Q、A394V、S399N、N400K、P404A、L407V、F443H、N453G、Y459F、T460G、T463N、N474Q、 E509L, Q521K/S, K522Y/F/N, T524S, P528L, S546R and P564G/L/M.
Engineered polypeptide of the project 5. as described in any one of project 1-4, wherein the amino acid residue difference be selected from Under one or more combinations: A39, A91, Q240, A256, N290, Y304, R305, H307, D353, A394, S399, P404, L407, Q521, K522 and T524.
Engineered polypeptide of the project 6. as described in any one of aforementioned project, wherein the engineered polypeptide has and SEQ The sequence identity of ID NO:4 at least about 90%;With the amino acid residue difference at the H307 of position.
Engineered polypeptide of the project 7. as described in project 6, wherein the amino acid residue difference is H307G/Q/M.
Engineered polypeptide of the project 8. as described in any one of aforementioned project, wherein the improved characteristic is selected from reduction Sensibility and/or the increased tolerance to acid pH to proteolysis.
Engineered polypeptide of the project 9. as described in any one of aforementioned project, wherein the reference sequences are derived from changeable fish The wild type PAL of raw meat algae (Anabaena variabilis).
Project 10. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide packet Contain: a) amino acid sequence, the amino acid sequence have and reference sequences SEQ ID NO:10 or its function fragment at least 85% Sequence identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:10 or its function fragment at one Or more at least one amino acid residue difference at amino acid position;And c) engineered polypeptide refers to sequence with described Column SEQ ID NO:10 compare shows improved characteristic selected from the following: i) enhance catalytic activity, ii) reduce to albumen The sensibility of hydrolysis, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), Ii), iii), iv) or any combination v).
Project 11. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, and the engineered polypeptide includes Amino acid sequence with the sequence identity with reference sequences SEQ ID NO:10 at least 85%, and with SEQ ID NO:10 phase At least one amino acid residue difference of ratio, and the engineered polypeptide is shown compared with SEQ ID NO:10 selected from following The improved characteristic of at least one: the catalytic activity of enhancing, reduce to the sensibility of proteolysis, increased to acid pH Tolerance, reduction aggregation, and/or reduced immunogenicity.
Engineered polypeptide of the project 12. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set:
I27E/V39A;I27E/V39A/R43L/V105C/A153R/L214E/P266H/L278D/C503Q;
I27E/V39A/R43L/L214E/A547D;
I27E/V39A/V105C/A112C/R134Q/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/V105C/A112C/R134Q/A153R/Q205T/L214E/P266H/L278D/C503Q/ A551D;
I27E/V39A/V105C/A112C/Q205T/P210C/P266H/C503Q/A547D;
I27E/V39A/V105C/A112C/Q205T/P266H/I285E/C503Q/A551D;
I27E/V39A/V105C/A112C/L214E/I285E/C503Q/A547D;
I27E/V39A/V105C/S131N/R134Q/Q205T/L214E/C503Q/A547D/C565N;
I27E/V39A/V105C/R134Q/A153R/P210C/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/Q205T/P210C/L278D/C503Q/A547D;
I27E/V39A/V105C/R134Q/Q205T/L214E;
I27E/V39A/V105C/R134Q/Q205T/L214E/A551D/C565N;
I27E/V39A/V105C/R134Q/Q205T/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/R134Q/P210C;I27E/V39A/V105C/R134Q/P210C/L214E;
I27E/V39A/V105C/R134Q/P210C/L214E/I285E/A547D;
I27E/V39A/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V39A/V105C/R134Q/L214E/L278D/A547D/A551D;
I27E/V39A/V105C/R134Q/L214E/I285E/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/P266H/C503Q;
I27E/V39A/V105C/R134Q/P266H/C503Q/A547D/A551D;
I27E/V39A/V105C/R134Q/L278D/C503Q/C565N;
I27E/V39A/V105C/R134Q/L278D/I285E/C503Q;I27E/V39A/V105C/R134Q/L278D/ A551D;
I27E/V39A/V105C/R134Q/I285E/A547D/A551D;I27E/V39A/V105C/R134Q/C503Q/ A551D;
I27E/V39A/V105C/A153R/Q205T/L278D/C503Q/A547D/A551D;
I27E/V39A/V105C/A153R/L214E;I27E/V39A/V105C/A153R/I285E;
I27E/V39A/V105C/A153R/C503Q/A547D/C565N;I27E/V39A/V105C/A153R/A551D/ C565N;
I27E/V39A/V105C/Q205T/P210C/L214E/L278D/A547D;
I27E/V39A/V105C/Q205T/P210C/L278D/C503Q;
I27E/V39A/V105C/Q205T/P210C/L278D/A547D;
I27E/V39A/V105C/Q205T/L214E/L278D/C503Q/A547D;
I27E/V39A/V105C/Q205T/L278D/C503Q/A547D;
I27E/V39A/V105C/P210C/I285E/C503Q/A547D/A551D/C565N;
I27E/V39A/V105C/P210C/L214E/P266H/L278D;
I27E/V39A/V105C/L214E/P266H/C503Q/A547D/C565N;
I27E/V39A/V105C/L214E/L278D/L309P/C503Q/A547D/A551D;
I27E/V39A/V105C/L278D/C503Q/A547D/C565N;I27E/V39A/V105C/I285E/A547D;
I27E/V39A/V105C/C503Q/A551D;I27E/V39A/V105C/C503Q/A547D/A551D/C565N;
I27E/V39A/A112C/R134Q/Q205T/P210C/L214E/A551D/C565N;
I27E/V39A/A112C/R134Q/L214E/P266H/A551D;
I27E/V39A/A112C/R134Q/L214E/C503Q/A547D;I27E/V39A/A112C/R134Q/P266H/ 1285E;
I27E/V39A/A112C/Q205T/L214E/P266H/C503Q/A551D/C565N;
I27E/V39A/A112C/Q205T/L278D/I285E;I27E/V39A/A112C/L214E;
I27E/V39A/A112C/L214E/L278D/C503Q/A547D/A551D;I27E/V39A/A112C/I285E;
I27E/V39A/A112C/A547D;I27E/V39A/R134Q;
I27E/V39A/R134Q/A153R/Q205T/L214E/P266H/C503Q;
I27E/V39A/R134Q/A153R/P210C/L214E/L278D/I285E/A547D/C565N;
I27E/V39A/R134Q/A153R/L214E/P266H/L278D/C503Q/A547D/C565N;
I27E/V39A/R134Q/A153G/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/A153R/L214E/C503Q/A547D;I27E/V39A/R134Q/A153R/L278D;
I27E/V39A/R134Q/A153R/L278D/A547D/A551D;I27E/V39A/R134Q/A153R/A547D;
I27E/V39A/R134Q/Q205T/L214E/P266H/I285E/C503Q/A551D/C565N;
I27E/V39A/R134Q/Q205T/P266H/C503Q/A551D/C565N;
I27E/V39A/R134Q/P210C/L214E/C503Q;I27E/V39A/R134Q/P210C/C503Q/A551D;
I27E/V39A/R134Q/L214E/P266H/A551D;
I27E/V39A/R134Q/L278D/I285E/C503Q/A547D/A551D;
I27E/V39A/R134Q/L278D/C503Q/A547D;I27E/V39A/R134Q/C503Q/A547D;
I27E/V39A/R134Q/A547D/C565N;I27E/V39A/Q205T/L214E/C503Q/C565N;
I27E/V39A/Q205T/P266H/I285E/A547D/A551D/C565N;I27E/V39A/Q205T/P266H/ A551D;
I27E/V39A/Q205T/L278D/C503Q/A551D/C565N;I27E/V39A/Q205T/L278D/C503Q/ C565N;
I27E/V39A/Q205T/C503Q/A547D/C565N;I27E/V39A/P210C/T212S;
I27E/V39A/P210C/L214E/L278D/C503Q/A551D;I27E/V39A/P210C/L214E/I285E/ C503Q/A551D;
I27E/V39A/P210C/P266H/I285E/C503Q/A547D;I27E/V39A/P210C/P266H/C503Q/ A551D;
I27E/V39A/L214E;I27E/V39A/L214E/P266H/L278D/C503Q/A547D/A551D/C565N;
I27E/V39A/L214E/L278D/C503Q;I27E/V39A/L214E/L278D/C503Q/A547D/C565N;
I27E/V39A/L214E/C503Q/A551D;I27E/V39A/P266H;I27E/V39A/P266H/L278D;
I27E/V39A/L278D;I27E/V39A/L278D/A547D;I27E/V39A/L278D/I285E/C503Q/ A547D;
I27E/V39A/L278D/C503Q/C565N;I27E/V39A/C503Q;I27E/G45D/Q205T/P266H/ C565N;
I27E/V105C;I27E/V105C/R134Q/A153R/P210C/L214E/C503Q/A547D;
I27E/V105C/R134Q/A153R/I285E/A547D;I27E/V105C/R134Q/A153R/C503Q;
I27E/V105C/R134Q/Q205T/P210C/C503Q;
I27E/V105C/R134Q/Q205T/L214E/P266H/L278D/C503Q/C565N;
I27E/V105C/Q205T/P266H/C503Q;
I27E/V105C/R134Q/P210C/L214E/P266H/L278D/A551D/C565N;
I27E/V105C/R134Q/P210C/L214E/C503Q/A551D/C565N;
I27E/V105C/R134Q/P210C/P266H/L278D/I285E/C503Q/A551D/C565N;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D;
I27E/V105C/R134Q/L214E/L278D/C503Q/A547D/A551D/C565N;I27E/V105C/ Q205T;
I27E/V105C/Q205T/L214E/P266H;I27E/V105C/Q205T/L214E/P266H/A551D/ C565N;
I27E/V105C/Q205T/L214E/L278D/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/Q205T/C503Q/A547D/A551D/C565N;I27E/V105C/L214E;
I27E/V105C/L214E/P266H/C503Q;I27E/V105C/L214E/I285E/A551D/C565N;
I27E/V105C/L214E/A547D/A551D/C565N;I27E/V105C/L214E/A551D/C565N;
I27E/V105C/P266H;I27E/V105C/P266H/I285E/C503Q/A547D/C565N;
I27E/V105C/L278D/A547D;I27E/V105C/I285E/C503Q/A547D/A551D/C565N;
I27E/V105C/C503Q/A547D/C565N;I27E/V105C/C503Q/A547D/A551D/C565N;
I27E/A112C/R134Q/A153R/L214E/P266H/C503Q;
I27E/A112C/R134Q/L278D/I285E/C503Q/A551D/C565N;
I27E/A112C/R134Q/Q205T/L278D/C503Q;I27E/A112C/R134Q/Q205T/I285E/ C503Q;
I27E/A112C/Q205T/P266H/L278D/I285E/C503Q;I27E/A112C/P210C/L214E/ C503Q/A547D;
I27E/R134Q;I27E/R134Q/A153R/I285E/C503Q/A547D;
I27E/R134Q/Q205T/I285E/C503Q/A551D;I27E/R134Q/Q205T/P266H/L278D/ A547D;
I27E/R134Q/P210C;I27E/R134Q/L214E/C503Q;I27E/R134Q/L214E/C503Q/A547D;
I27E/R134Q/L214E/C503Q/A547D/A551D;I27E/R134Q/L214E/C503Q/C565N;
I27E/R134Q/L278D/I285E/A551D/C565N;I27E/R134Q/I285E/C503Q;
I27E/A153R/L214E/L278D/I285E/A551D/C565N;I27E/A153R/L214E/L278D/ A551D;
I27E/Q205T;I27E/Q205T/L214E/L278D/I285E/C503Q/C565N;
I27E/Q205T/L214E/C503Q/A547D/C565N;I27E/Q205T/P266H/L278D/I285E/ A551D/C565N;
I27E/Q205T/L278D/A551D;I27E/P210C;I27E/P210C/L214E/C503Q/A547D;
I27E/P210C/L278D/C503Q;I27E/P210C/C503Q;I27E/P210C/C503Q/C565N;I27E/ P210C/A551D;
I27E/L214E;I27E/L214E/P266H/L278D/I285E/A551D;I27E/L214E/L278D;
I27E/L214E/L278D/C503Q;I27E/L214E/C503Q;I27E/L214E/C503Q/A547D;
I27E/L214E/C503Q/A547D/C565N;I27E/L214E/A551D;I27E/P266H/L278D/C503Q;
I27E/P266H/A547D/A551D;I27E/L278D/C503Q/A551D;I27E/L278D/C503Q/A551D/ C565N;
I27E/A547D/C565N;V39A/G45S/L278D/C503Q/A551D;
V39A/V105C/R134Q/A153R/Q205T/A551D;V39A/V105C/R134Q/P210C/L214E/ A551D;
V39A/V105C/R134Q/L214E/C503Q/A547D/A551D;V39A/V105C/A153R/P266H/ A547D/A551D;
V39A/V105C/Q205T/C503Q;V39A/V105C/Q205T/A551D;V39A/V105C/P210C/A547D;
V39A/V105C/L214E/P266H/A547D/C565N;V39A/V105C/L214E/I285E/C503Q/ A551D/C565N;
V39A/A112C/R134Q/Q205T/L214E/L278D;V39A/A112C/R134Q/L214E/C503Q/ A547D/A551D;
V39A/A112C/A153R/Q205T/L278D/C503Q/A547D;V39A/R134Q;
V39A/R134Q/Q205T/L214E/C503Q/C565N;V39A/R134Q/P210C/L214E/A547D/ C565N;
V39A/A153R/C503Q/A547D;V39A/Q205T/L278D/A547D/A551D;
V39A/P210C/L214E/L278D/I285E/C503Q/A551D;V39A/P266H;
V39A/P275R/L278D/C503Q/A551D;V39A/C503Q;V39A/C503Q/A551D/C565N;V105C;
V105C/A112C/R134Q/Q205T/L214E/Y492H/C503Q/A547D;
V105C/R134Q/A153R/Q205T/L214E/C503Q;V105C/R134Q/Q205T/L214E/A547D;
V105C/R134Q/Q205T/P266H/L278D;V105C/R134Q/L214E/P266H/I285E/C503Q/ A551D/C565N;
V105C/R134Q/L214E/L278D/C565N;V105C/R134Q/L214E/C503Q/A547D;
V105C/R134Q/L214E/C503Q/A547D/A551D;V105C/R134Q/C503Q;
V105C/R134Q/C503Q/A547D;V105C/R134Q/C503Q/A547D/C565N;
V105C/A153R/Q205T/L214E/P266H/C503Q/A547D;
V105C/A153R/Q205T/P266H/I285E/A547D/C565N;V105C/Q205T/P210C/L214E/ C503Q/A547D;
V105C/Q205T/L214E/L278D;V105C/Q205T/L214E/C503Q/A547D/A551D/C565N;
V105C/Q205T/C503Q/A551D;V105C/L214E/P266H/L278D/A547D;
V105C/L214E/L278D/C503Q/A547D/A551D;V105C/L214E/I285E;
V105C/L214E/I285E/C503Q/A547D/A551D/C565N;V105C/L214E/I285E/A547D/ C565N;
V105C/L278D/C503Q/A551D;V105C/I285E;V105C/I285E/A547D;V105C/C503Q;
V105C/A547D/A551D;A112C/R134Q/A153R/L214E/L278D/I285E/C503Q/A547D/ A551D/C565N;
A112C/R134Q/L214E/C503Q/A547D/A551D/C565N;A112C/L214E/L278D;
A112C/L278D/C503Q/A547D;R134Q/Q205T/L214E/I285E/C503Q/A551D/C565N;
R134Q/Q205T/C503Q;R134Q/P210C/L214E/L278D/C503Q/A547D/C565N;
R134Q/P210C/L214E/C503Q/A547D/A551D;R134Q/L214E;R134Q/L214E/L278D/ C503Q;
R134Q/L214E/L278D/C503Q/A551D;R134Q/L214E/I285E/C503Q;R134Q/C503Q;
R134Q/C503Q/A547D/A551D;A153R;Q205T/L214E/I285E/C503Q/A551D;
Q205T/L214E/I285E/C503Q/C565N;Q205T/L214E/C503Q/A547D/C565N;
Q205T/L278D/I285E/A547D/A551D;P210C/L214E;P210C/L214E/P266H;L214E/ P266H;
L214E/P266H/C503Q/A547D/A551D/C565N;L214E/C503Q/A547D;L214E/A547D;
P266H/L278D/C503Q;P266H/C565N;L278D/A547D;C503Q;C503Q/A547D;
C503Q/A547D/A551D/C565N;C503Q/A547D/C565N;C503Q/A551D;C503Q/A551D/ C565N;
A547D;And/or C565N.
Engineered polypeptide of the project 13. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: V80I/R134C/P564Q;V121C; A123G;A124G;M125L;L126I/T;L126M/R134L;L127A;A129G/L;N130Q;N130C/M370I R134W; M133R;R134I;R134N/G307C;G135C/S;And/or G135A/A394E.
Engineered polypeptide of the project 14. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: G20S/I144L;R43S;L47M/ I144L;L47M/R146E;L47M/M147G/A383E;L47M/P157C;Q58H/L143V;Q58K/P157D/G369C; A62S/M147V;S82I/G135C/P157F/W279L;R94C/I149E;T110I/I139R;L118M/L141H;A119E/ T156H/A289D;I139M/V;R140D/G/M;R140N/A199E;R140E/A334S/A551D;L141K/Q/P/T; E142H/P/V;E142D/G371D;L143F/M;I144L/N/V;K145N/Q/R;K145G/P157T;R146H/L;R146W/ D191Y;M147A;I149L/R;F150K/L/M;L151M;A153C/G;A153S/H250N;G154R;G154Y/L174M/ Q321K/S456I/G483C;T156K/G483C;P157D/F/H/Y;Y158E;V159C/H/L/M;M247I;L319M;And/or Q389K。
Engineered polypeptide of the project 15. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: P117T/Y176Q;V172I/C/L; L174M;S175G;Y176E/I/M/R/V;I177M/V;T178L/A477S;And/or S180C/T.
Engineered polypeptide of the project 16. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: R43S/H374K;R43S/H374R; A112S/M370A/A507E;M147I/H374S;S187R/L381V;D191Y/H385N;A232S;Q240K/H374R; A256S/L381N;P275Q/M370S;P275T/H374R;Q332K/Y377M;A334S/H374V;L349M;Q355K/ H374S;M370G/I/S;G371H/N/Q/S;M372A/V;H374A/D/G/L/N/R/S/T;H374Q/P396Q;H374R/ G417C;L375I;L375M;Y377C/I/N;Y378C/D/E/I/L/N/S;Y378F/P404Q;I379C/H/L/M/N; L381G/V;L381M/Q560K;L382C/H/I/M/S;A383S/V;K384R;H385C/G/N;H385M/P403H;H385S/ P403H;D387S;L418M;G425V;A447S;S461G;And/or S525L.
Engineered polypeptide of the project 17. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: A24S/F434M;A62S/T433N; S98I;L213M/S438L;Q240K/T433Y;S286R/Y435T;A289S/L431E;S331I;L431C/E/G/P/S/V; L432C/V;T433A/I/L/N/P/Q/R/S/V/W;F434C;Y435L;Y435Q/H446N;G436M;G436D/T;N437E/ G/Q;N437T/L538M;S438C/F/M/R/T;I439C/F/L/V;And/or A477S.
Engineered polypeptide of the project 18. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: A24E;Q58R/Y475H;A70S/ N474E;L104M/V476L;A119E/G365A;L206M;P275Q;G276V;Q292H/A479G;Q355H/I478C; P404T/A477V;I471F/G/K/M/N/R/V/W;F472G;Q473H/K/M/R/S;Q473H/A507S;N474A/H/R/W; N474D/R490H;Y475C/F/L/Q;V476C/I/L;I478N/S;A479G/S;F482C/L;G483C/H/S;G483A/ S524I;G483R/G537C;And/or A558S.
Engineered polypeptide of the project 19. as described in project 10 and/or 11, wherein the ammonia compared with SEQ ID NO:10 Base acid residue difference is one or more in following displacement or displacement set: V39A/K115E/M133R/C565N; V39A/M133R/F472G/C503Q/C565N;V39A/M133R/F472G/C565N;V39A/M133R/C503Q;V39A/ M133R/C503Q/C565N;V39A/M147A/Y378E/C503Q/C565N;V39A/M147A/Y378E/C565N;V39A/ M147A/L381G/F472G/C503Q/C565N;V39A/M147A/L381G/C503Q/C565N;V39A/M147A/F472G/ C503Q/C565N;V39A/M147A/F472G/C565N;V39A/M147A/C565N;V39A/G248C/L381G/F472G/ C503Q/C565N;V39A/Y378E/C503Q/C565N;V39A/Y378E/C565N;V39A/L381G;V39A/F472G/ C503Q/C565N;V39A/C503Q/C565N;M133R/L381G/C565N;M133R/C503Q;Y378D/C503Q;Y378E/ F472G/C503Q/C565N;L381G/F472GC503Q/C565N;And/or F472G/C503Q/C565N.
Project 20. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide packet Contain: a) amino acid sequence, the amino acid sequence have and reference sequences SEQ ID NO:26 or its function fragment at least 85% Sequence identity;B) polypeptide sequence, the polypeptide sequence include compared with SEQ ID NO:26 or its function fragment at one Or more at least one amino acid residue difference at amino acid position;And c) engineered polypeptide refers to sequence with described Column SEQ ID NO:26 compare shows improved characteristic selected from the following: i) enhance catalytic activity, ii) reduce to albumen The sensibility of hydrolysis, iii) the increased tolerance to acid pH, iv) aggregation of reduction, the immunogenicity that v) reduces or i), Ii), iii), iv) or any combination v).
Project 21. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, and the engineered polypeptide includes With with the sequence identity of reference sequences SEQ ID NO:26 at least 85% and at least one compared with SEQ ID NO:26 The amino acid sequence of amino acid residue difference, and the engineered polypeptide is shown compared with SEQ ID NO:26 selected from following The improved characteristic of at least one: the catalytic activity of enhancing, reduce to the sensibility of proteolysis, increased to acid pH Tolerance, reduction aggregation, and/or reduced immunogenicity.
Engineered polypeptide of the project 22. as described in project 20 and/or 21, wherein the ammonia compared with SEQ ID NO:26 Base acid residue difference is one or more in following displacement or displacement set: A24E/G381L;L127V;A129I/V; S131C/T;H132L/S;R134C/F/H/K;R134H/Y378E/G381L;R134H/Y378E/G381L/V388T;R134H/ V388T;A136K;A289S;M372L;H374G/M/Q;G381A/C/F/I/L/M/N/Q/S/T;A383C/M;V388C/T; L431M;And/or L563M.
Engineered polypeptide of the project 23. as described in any one of aforementioned project, wherein the engineered polypeptide has and ginseng Examine the sequence identity of sequence SEQ ID NO:4 at least about 90%.
Engineered polypeptide of the project 24. as described in project 23, wherein the engineered polypeptide has and reference sequences SEQ The sequence identity of ID NO:4 at least about 95%.
Project 25. is a kind of to have phenylalanine lyase (PAL) active engineered polypeptide, and the engineered polypeptide includes With the sequence with SEQ ID NO:6,8,10,12,14,16,18,20,22,24, and/or 26 or its function fragment at least about 90% The amino acid sequence of column identity.
Project 26. has phenylalanine lyase (PAL) active engineered polypeptide as described in project 25, comprising having It is same with the sequence of SEQ ID NO:6,8,10,12,14,16,18,20,22,24, and/or 26 or its function fragment at least about 99% The amino acid sequence of one property.
Project 27. has phenylalanine lyase (PAL) active engineered polypeptide, wherein the engineered polypeptide is table The variant PAL provided in any of 2-1 to 2-5 and/or table 9-1 to 9-7.
Project 28. has phenylalanine lyase (PAL) active engineering more as described in any one of project 1-27 Peptide, wherein the engineered polypeptide is Anabaena variabilis enzyme.
Project 29. has phenylalanine lyase (PAL) active engineering more as described in any one of project 1-28 Peptide, wherein the engineered polypeptide is heat-staple.
Project 30. has phenylalanine lyase (PAL) active engineering more as described in any one of project 1-29 Peptide, wherein the engineered polypeptide tolerance protein hydrolyzes.
Project 31. has phenylalanine lyase (PAL) active engineered polypeptide as described in project 30, wherein described Engineered polypeptide tolerance passes through the proteolysis of at least one digestive enzyme.
Project 32. has phenylalanine lyase (PAL) active engineered polypeptide as described in project 31, wherein described The proteolysis that engineered polypeptide tolerance passes through chymotrypsin, trypsase, carboxypeptidase and/or elastoser.
Project 33. has phenylalanine lyase (PAL) active engineering more as described in any one of project 1-32 Peptide, wherein the engineered polypeptide is that acid is stable.
Engineered polypeptide of the project 34. as described in any one of project 1-33, wherein the engineered polypeptide is gone to be immunized Change.
Deimmunized engineered polypeptide of the project 35. as described in project 34, wherein the polypeptide includes to have and SEQ The amino acid sequence of the sequence identity of ID NO:6,8,10,12,14,16,18,20,22,24 and/or 26 at least 95%.
Engineered polypeptide of the project 36. as described in any one of project 1-35, wherein the polypeptide is purifying.
In a kind of polynucleotide sequence of project 37., the polynucleotide sequence coding at least one such as aforementioned project 1-36 Described in any item engineered polypeptides.
Polynucleotide sequence of the project 38. as described in project 37, wherein the sequence is operably connected to control sequence Column.
Polynucleotide sequence of the project 39. as described in project 37 and/or 38, wherein the polynucleotides are codon optimizations 's.
A kind of expression vector of project 40., the expression vector include at least one as described in any one of project 37-39 Polynucleotide sequence, and at least one control sequence.
Expression vector of the project 41. as described in project 40, wherein the control sequence is promoter.
Expression vector of the project 42. as described in project 41, wherein the promoter is allogeneic promoter.
A kind of host cell of project 43., the host cell is at least one more as described in any one of project 35-37 Carrier described in any one of nucleotide sequence and/or project 40-42 converts.
Host cell of the project 44. as described in project 43, wherein the host cell is Escherichia coli (E.coli).
A kind of method that engineering PAL polypeptide is generated in host cell of project 45., the method includes in suitable training Culture has phenylalanine lyase (PAL) as described in any one of project 1 to 36 comprising encoding at least one under the conditions of supporting At least one polynucleotides of active engineered polypeptide, and/or at least one polynucleotides as described in project 37 and/or 39 The host cell of sequence, and/or at least one carrier as described in any one of project 40-42, so that generating at least one work Journey PAL polypeptide.
Method of the project 46. as described in project 45, the method also includes recycling from the culture and/or host cell At least one has the engineered polypeptide of phenylalanine lyase (PAL).
Method of the project 47. as described in project 46, the method also includes at least one that purifying generates to have benzene The step of engineered polypeptide of alanine aminonialyase (PAL).
A kind of composition of project 48., the composition include at least one having as described in any one of project 1 to 36 Phenylalanine lyase (PAL) active engineered polypeptide.
Composition of the project 49. as described in project 48, wherein the composition is pharmaceutical composition.
Pharmaceutical composition of the project 50. as described in project 49, described pharmaceutical composition also include that at least one pharmaceutically may be used The excipient and/or carrier of receiving.
Composition of the project 51. as described in any one of project 48-50, wherein the composition is suitable for phenylketonuria Treatment.
Pharmaceutical composition of the project 52. as described in any one of project 49-51, wherein the composition is suitable for oral apply With to people.
Pharmaceutical composition of the project 53. as described in any one of project 49-52, wherein the composition in pill, tablet, Capsule, caplets, the form of liquid or emulsion.
Pharmaceutical composition of the project 54. as described in project 53, wherein the pill, tablet, capsule or caplets also include Enteric coating.
Pharmaceutical composition of the project 55. as described in any one of project 49-51, wherein the composition is suitable for parenteral It is injected into people.
Pharmaceutical composition of the project 56. as described in any one of project 49-55, wherein the composition and at least one are another Outer treatment active compounds co-administer.
Pharmaceutical composition of the project 57. as described in project 56, wherein the composition includes at least one other treatment Active compound.
A kind of method for treating and/or preventing the symptom of phenylketonuria in subject of project 58., the method packet It includes: the subject for suffering from phenylketonuria being provided, and provides the medicine as described in any one of project 49-57 to the subject Compositions.
Method of the project 59. as described in project 58, wherein the symptom of the phenylketonuria is enhanced.
Method of the project 60. as described in project 58 and/or 59, wherein the subject can eat its methionine, Than by being also not provided with comprising at least one as what is illustrated in project 1-36 has in terms of phenylalanine and/or tyrosine content Diet required for the subject of at least one pharmaceutical composition of phenylalanine lyase (PAL) active engineered polypeptide compared with Unrestricted diet.
Method of the project 61. as described in any one of project 58-60, wherein the subject is baby or children.
Method of the project 62. as described in any one of project 58-60, wherein the subject is adult or young.
The purposes of composition provided by any one of 63. project 48-57 of project.
The foregoing and other aspects of the invention can be more fully understood in conjunction with following non-limiting embodiment.Embodiment is provided It is for illustration purposes only and is not intended to limit the scope of the invention in any way.
Experiment
Following embodiment is provided, including testing and obtain as a result, the purpose being merely to illustrate, and is not necessarily to be construed as limiting The system present invention.
In following experiment disclosure, using following abbreviation: ppm (parts per million);M (mole);mM (mM), uM and μ Μ (micromole);NM (nanomole);Mol (mole);Gm and g (gram);Mg (milligram);Ug and μ g is (micro- Gram);L and l (liter);Ml and mL (milliliter);Cm (centimetre);Mm (millimeter);UM and μM (micron);Sec. (second);Min (s) (point Clock);H (s) and hr (s) (hour);U (unit);MW (molecular weight);Rpm (revolutions per minute);Psi and PSI are (per square inch Poundage);DEG C (Celsius temperature);RT and rt (room temperature);CDS (coded sequence);DNA (DNA);RNA (ribose core Acid);(common laboratory E. coli strain, can be from Coli Genetic Stock Center by Escherichia coli W3110 [CGSC], New Haven, CT acquisition);HTP (high throughput);HPLC (high pressure liquid chromatography);CFSE (Fluoresceincarboxylic acid succinyl Imines ester);IPTG (isopropyl ss-D-1- thiogalactoside));PES (polyether sulfone);PHE and phe (phenylalanine);BSA (ox Seralbumin);PBMC (peripheral blood mononuclear cells);PKU (phenylketonuria);MHC (major histocompatibility complex); HLA (human leucocyte antigen (HLA));HLA-DR (by the MHC II class cell surface receptor of the HLA compound coding on No. 6 chromosomes); FIOPC (compared to the improvement multiple of positive control);LB (Luria meat soup);AthensResearch(Athens Research Technology, Athens, GA);ProSpec (ProSpec TANY Technogene, East Brunswick, NJ); Sigma-Aldrich (Sigma-Aldrich, St.Louis, MO);Ram Scientific(Ram Scientific,Inc., Yonkers,NY);Pall Corp.(Pall Corp.,Pt.Washington,NY);Millipore(Millipore Corp.,Billerica MA);Difco (Difco Laboratories, BD Diagnostic Systems, Detroit, MI);Molecular Devices(Molecular Devices,LLC,Sunnyvale,CA);Kuhner(Adolf Kuhner,AG,Basel,Switzerland);Cambridge Isotope Laboratories,(Cambridge Isotope Laboratories, Inc., Tewksbury, MA);Applied Biosystems(Applied Biosystems, part of Life Technologies, Corp., Grand Island, NY), Agilent (Agilent Technologies,Inc.,Santa Clara,CA);Thermo Scientific(Thermo Fisher Scientific A part, Waltham, MA);Corning(Corning,Inc.,Palo Alto,CA);Constant Systems (Constant Systems Ltd.,Daventry,United Kingdom);Megazyme(Megazyme International,Wicklow,Ireland);Enzo(Enzo Life Sciences,Inc.,Farmingdale,NY); GE Healthcare(GE Healthcare Bio-Sciences,Piscataway,NJ);Harlan(Harlan Laboratories,Indianapolis,IN);AB Sciex(AB Sciex,Framingham,MA);With Bio-Rad (Bio- Rad Laboratories, Hercules, CA).
Following polynucleotides and polypeptides sequence is applied to the present invention.(as follows) in some cases, polynucleotides sequence It is encoded polypeptide after column.
The polynucleotide sequence (SEQ ID NO:1) of pET16b-AvPAL expression vector:
The polynucleotide sequence (SEQ ID NO:2) of AvPAL ORF:
The polynucleotide sequence (SEQ ID NO:3) of WT AvPAL:
The polypeptide sequence (SEQ ID NO:4) of WT AvPAL:
The polynucleotide sequence (SEQ ID NO:5) that AvPAL variant 30:
The polypeptide sequence (SEQ ID NO:6) that AvPAL variant 30:
The polynucleotide sequence (SEQ ID NO:7) that AvPAL variant 22:
The polypeptide sequence (SEQ ID NO:8) that AvPAL variant 22:
The polynucleotide sequence (SEQ ID NO:9) that AvPAL variant 36:
The polypeptide sequence (SEQ ID NO:10) that AvPAL variant 36:
The polynucleotide sequence (SEQ ID NO:11) that AvPAL variant 42:
The polypeptide sequence (SEQ ID NO:12) that AvPAL variant 42:
The polynucleotide sequence (SEQ ID NO:13) that AvPAL variant 43:
The polypeptide sequence (SEQ ID NO:14) that AvPAL variant 43:
The polynucleotide sequence (SEQ ID NO:15) that AvPAL variant 1002:
The polypeptide sequence (SEQ ID NO:16) that AvPAL variant 1002:
The polynucleotide sequence (SEQ ID NO:17) that AvPAL variant 1008:
The polypeptide sequence (SEQ ID NO:18) that AvPAL variant 1008:
The polynucleotide sequence (SEQ ID NO:19) that AvPAL variant 1009:
The polypeptide sequence (SEQ ID NO:20) that AvPAL variant 1009:
The polynucleotide sequence (SEQ ID NO:21) that AvPAL variant 1010:
The polypeptide sequence (SEQ ID NO:22) that AvPAL variant 1010:
The polynucleotide sequence (SEQ ID NO:23) that AvPAL variant 1084:
The polypeptide sequence (SEQ ID NO:24) that AvPAL variant 1084:
The polynucleotide sequence (SEQ ID NO:25) that AvPAL variant 967:
The polypeptide sequence (SEQ ID NO:26) that variant 967:
Expression vector pCK100900i:
Embodiment 1
PAL gene obtains and the building of expression vector
Anabaena variabilis phenylalanine lyase (AvPAL) Plasmid DNA is obtained, and the synthesis gene for encoding AvPAL is close Numeral optimization in expression in escherichia coli, and clone into coli expression carrier pET16b to provide pET16b- AvPAL(SEQ ID NO:1).AvPAL open reading frame (SEQ ID NO:2) uses oligonucleotides: PAL-pCK-F and PAL- PCK-R is expanded by PCR, and is subcloned into expression vector pCK100900i (SEQ ID NO:27).
The Plasmid Constructs are transformed into the coli strain from W3110.It is usual using those skilled in the art Known directed evolution technologies generate genetic mutation library (see, for example, U.S. Patent number 8,383,346 from the Plasmid Constructs And WO2010/144103).
Embodiment 2
High-throughput (HTP) growth and measurement
The high throughput (HTP) of PAL and PAL variant is grown
The Bacillus coli cells of conversion are put down by being laid in the LB agar comprising 1% glucose and 30 μ g/ml chloramphenicol It is selected on plate.After being incubated overnight at 37 DEG C, bacterium colony is put into NUNCTM(Thermo-Scientific) 96 holes are shallowly flat In the hole of plate, which has the LB of 1% glucose and 30 μ g/ml chloramphenicol filled with the supplement in 180 holes μ l/.It will Culture (200rpm, 30 DEG C and 85% relative humidity overnight growth 18-20 hours in shaking table;Kuhner).By overnight growth Sample (20 μ L) be transferred to 96 hole Costar that the supplement filled with 380 μ L has the Terrific meat soup of 30 μ g/ml chloramphenicol In deep plate.Plate is incubated for 135 minutes (250rpm, 30 DEG C and 85% relative humidity in shaking table;Kuhner).Then by cell With the 10mM IPTG induction in 40 μ L sterile waters, and be incubated overnight in shaking table 20-24 hours (250rpm, 30 DEG C and 85% Relative humidity;Kuhner).Two repetition cultures are merged, makes cell precipitation (4000rpm x 20min), supernatant is abandoned It goes, and before analysis by cell freezing at -80 DEG C.
The cracking of HTP sediment
Firstly, by 500 μ L lysis buffer (20mM Tris pH 7.5,1mM MgSO4, 1mg/ml lysozyme and 0.5mg/ml Polymyxin B sulfate) it is added to cell precipitate.1.5h is stirred at room temperature in mixture, and is precipitated (4000rpm x 5min) then uses clear lysate in a variety of HTP measurements described herein.These lysates are logical Cross SDS-PAGE's analysis shows that, the albumen of overexpression exists with the apparent MW of~60kDa, consistent with the desired MW of PAL.
The analysis of clear lysate
PAL Variant Activity is determined by measuring the formation of cinnamic acid, is such as changed with time by absorbance at 290nm Determining.For the measurement, by 100 μ L 200mM Tris/50mM phenylalanine pH 7.5 or 200mM sodium phosphate/50mM benzene 7.0,80 μ L water of alanine pH and the 20 clear lysates of μ L be added to poly- 96 orifice plate of acrylate (Costar#3635, Corning in hole).By the of short duration mixing of reactant, and activity uses Plus384Or(Molecular Devices) light absorption microplate reader passes through (every 12- over time at 290nm 20s is primary, by 5-20min) absorbance is tracked to determine.
It is analyzed with the HTP- of the pretreated clear lysate of protease
PAL variant chymotrypsin and trypsase stimulate, to simulate lower intestinal tract (lower intestine) ring Border.Firstly, by 30 μ L proteinase mixtures (0.01-100mg/ml chymotrypsin (C4129Sigma Aldrich), 0.01-100mg/ml trypsase (T7409Sigma Aldrich), 1mM CaCl2With 1mM HCl), 0-30 μ L in 500mM 20mM natrium taurocholicum and the clear lysate of 90-120 μ L in sodium phosphate pH 7.0 are added to 96 hole round bottom plate (Costar# 3798, Corning) hole.Plate is sealed, and is incubated for 1h at 37 DEG C, 400rpm, 1 " throw, is then analyzed.For the survey It is fixed, by the 200mM Tris/50mM phenylalanine pH 7.5 or 200mM sodium phosphate of 100 μ L/7.0 He of 50mM phenylalanine pH The lysate of 100 μ L Protease Treatments is added to the hole of poly- 96 orifice plate of acrylate (Costar#3635, Corning).It should The of short duration mixing of reactant, and activity uses Plus384Or(Molecular Devices) light absorption microplate reader is by the way that (every 12-20s is primary, by 5-20min) tracks extinction over time at 290nm Degree is to determine.As a result it provides in the following table.
With the HTP- analysis of the clear lysate of acid processing
In the measurement, PAL variant is stimulated in acid condition, to simulate gastric environment.Firstly, by the 1M lemon of 20 μ L Sour sodium (pH 3.7-4.5) and 30 μ L water or the clear lysate of 50 μ L 400mM sodium citrate pH 3.7-4.8 and 50uL are added To the hole of 96 hole round bottom plates (Costar#3798, Corning).Plate is sealed, and is incubated at 37 DEG C, 400rpm, 1 " throw Then 1h is analyzed.For the measurement, by 100 μ L 200mM Tris, 50mM phenylalanine pH 7.5 and 80 μ L 1M Tris pH The cracking of 7.5 or 200mM sodium phosphate/50mM phenylalanine pH 7.0 and 80 μ L1.0M sodium phosphate pH 7.0 and the processing of 20 μ L acid Object is added to the hole of poly- 96 orifice plate of acrylate (Costar#3635, Corning).By the of short duration mixing of the reactant, and live Property use Plus384Or(Molecular Devices) light absorption microplate reader is logical (every 12-20s is primary, by 5-20min) tracking absorbance over time is crossed at 290nm to determine.As a result in the following table It provides.
It is analyzed with the HTP of the pretreated clear lysate of pepsin
In another measurement, PAL variant acid condition and pepsin are stimulated, under conditions of simulating gastric environment Further test variant.Firstly, the 0.01-100mg/ml pepsin by 50 μ L in 400mM sodium citrate pH1.5-4, and The 50 clear lysates of μ L are added to the hole of 96 hole round bottom plates (Costar#3798, Corning).Plate is sealed, and 37 DEG C, It is incubated for 1-12h under 400rpm, 1 " throw, is then analyzed.For the measurement, by 100 μ L 200mM Tris/50mM phenylalanines What the μ L 1M Tris of pH 7.5 and 80 pH 7.5 or 200mM sodium phosphate/50mM phenylalanine pH 7.0 and 20 μ L acid were handled splits Solution object is added to the hole of poly- 96 orifice plate of acrylate (Costar#3635, Corning).By the of short duration mixing of the reactant, and Activity uses Plus384Or(Molecular Devices) light absorption microplate reader By the way that (every 12-20s is primary, by 5-20min) tracks absorbance and determine over time at 290nm.
1. the activity that relative activity is calculated as activity/SEQ ID NO:4 (being encoded by SEQ ID NO:3) of variant.
2. the polypeptide sequence of variant 22 polynucleotide sequences and SEQ ID NO:8 with SEQ ID NO:7, and variant The polypeptide sequence of No. 30 polynucleotide sequences and SEQ ID NO:6 with SEQ ID NO:5.
The relative activity relative to wild type AvPAL of 3.+=0.1 to 1.0;
The relative activity relative to wild type AvPAL of ++=> 1.0 to 2.0;With
The relative activity relative to wild type AvPAL of +++=> 2.0.
1. activity/variant 30 activity that relative activity is calculated as variant.
2. the polypeptide sequence of variant 36 polynucleotide sequences and SEQ ID NO:10 with SEQ ID NO:9.
3.+=> 1.0 to 3.0 relative to variant 30 relative activities;
++=> 3.0 to 10 relative to variant 30 relative activities;With
+++=> 10 to 35 relative to variant 30 relative activities.
1. activity/variant 36 activity that relative activity is calculated as variant.
2. the polypeptide sequence of variant 42 polynucleotide sequences and SEQ ID NO:12 with SEQ ID NO:11, variant The polypeptide sequence of No. 43 polynucleotide sequences and SEQ ID NO:14 with SEQ ID NO:13.
3.+=> 0.5 to 1.5 relative to variant 36 relative activities;
++=> 1.5 to 3 relative to variant 36 relative activities;With
+++=> 3 to 10 relative to variant 36 relative activities.
1. activity/variant 30 (variant 53-73) or SEQ ID NO:4 that relative activity is calculated as variant are (by SEQ ID NO:3 coding) (variant 74-83) activity.
2.+=> 0.5 to 1.5 relative to variant 30 relative activities;++=> 1.5 to 3 relative to variant 30 Relative activity;With +++=> 3 to 10 relative to variant 30 relative activities.
1. activity/variant 42 activity that relative activity is calculated as variant.
2.- < 0,5 relative to variant 42 relative activities;+=> 0.5 to 1.5 it is opposite relative to variant 42 Activity;++=> 1.5 to 3 relative to variant 42 relative activities.
Embodiment 3
Determine the measurement of the albumen aggregation of PAL variant
The tendency of PAL variant aggregation uses according to the manufacturer's instructionsAlbumen assembles assay kit (Enzo) it determines.Briefly, by the PAL of 0-100 μM of purifying withDetection reagent (1:2000) mixing, and Pass through flow cytometry.WithClustering criteria product consistently evaluate fluorescent, and as known in the art (see, for example, Bershtein etc., Mol.Cell, 133-144 [2013]).
Embodiment 4
The lysate of freeze-drying from shaking flask (SF) culture
The HTP culture of the selection grown as described above is laid in mould with 1% glucose and 30 μ g/ml chlorine On the LB agar plate of element, and grown overnight at 37 DEG C.Single colonie from each culture is transferred to 50ml with 1% The LB of glucose and 30 μ g/ml chloramphenicol.Culture grows 18h, and being diluted in about 1:10 at 30 DEG C, 250rpm 250ml has subculture in the Terrific meat soup of 30 μ g/ml chloramphenicol, until final OD600 is 0.2.By culture 30 DEG C, It is incubated for 135 minutes under 250rpm, until OD 600 is 0.6, and is induced with 1mM IPTG.The culture of induction is in 30 DEG C, 250rpm Lower incubation 20h.After the incubation period, by culture with 4000rpm x 10min centrifugation.Liquid is discarded supernatant, by sediment weight It is suspended from the sodium phosphate of 50mM pH 7.5 of 30ml.Make cell precipitation (4000rpm × 10min), is resuspended in 12ml 50mM pH In 7.5 sodium phosphate, and using One Shot Cell Disruption system (Constant Systems) with 17000psi cracking.So that lysate is precipitated (10,000rpm × 30min), and supernatant is freezed and is dried to generate containing enzyme Powder.
The purifying of PAL from diastatochromogenes
Saturation is grown in diastatochromogenes by PAL variant 42, as previously discussed.By the culture of saturation by from The heart (4000rpm × 20min) precipitating, and cell precipitate is stored in -80 DEG C before purification.By cell precipitate in room temperature Lower defrosting, and be resuspended in the 25mM Tris with 130mM NaCl pH 8 with 5mL buffer/g cell.Sample slurries use The microfluidization device (microfluidizer) of pressure setting with 110psi cracks.By gained lysate by with 10, 000rpm is centrifuged 1 hour, is then clarified by 0.2 μm of PES filter (Millipore) filtering.
After filtering, under the existence or non-existence of 10mM Phe, gained lysate is heated into 1.5-2 at 70 DEG C -85 DEG C Hour.Lysate is removed from heat source, and is centrifuged 1 hour with 10,000rpm at 4 DEG C to clarify.Then it will include solubility The supernatant of PAL is then loaded into chromatographic column by 0.2 μm of PES filter filtering.
The lysate of the thermally treated filtering comprising 80-100mg total protein is used with 1.2M ammonium sulfate pH 8 25mM Tris dilutes twice.Sample is loaded into the HiPrep pre-equilibrated with the 25mM Tris with 0.6M ammonium sulfate pH 8 On 16/10 phenyl FF (hi sub) column (GE Healthcare).After sample loading, by the identical of column three times column volume Buffer washs, and is then washed with the linear gradient of the 0.6M-0M ammonium sulfate in the 25mM Tris of pH 8 of a column volume It washs.The PAL to combine closely had into 25mM Tris using three times column volume, the isocratic elution of pH 8 is eluted from column.It will packet Fraction containing activity and pure PAL is collected.
The PAL buffer-exchanged of purifying from phenyl column is entered in the 0.5M Tris of pH 8.5, and is concentrated.Concentration PAL is analyzed by SDS-PAGE, and is found with~60kDa band presence.The PAL sample of purifying uses 0.45 μm of PES filter It filters, and is stored at -80 DEG C until preparing to use.
Embodiment 5
The characterization of the PAL and PAL variant of purifying
In the present embodiment, the measurement for being intended to characterize wild type and variant PAL of progress is described.
To the tolerance of acid pH:
The powder of freeze-drying comprising PAL variant is dissolved in the 20mM sodium phosphate of pH 7.0 with 2g/L.Then, by 50 μ L Enzyme solutions are mixed with 50 μ L 400mM citric acids (pH 4.0-5.2) or 100mM sodium phosphate, and by reactant 37 DEG C with 400rpm (1 " throw) is incubated for 1h.Then, by 20 μ L solution and 80 μ L 1M sodium phosphate pH 7.0 and 100 μ L 200mM Tris/ The of short duration mixing of 50mM phenylalanine pH 7.5.Enzymatic activity in acid condition uses Plus384Or(Molecular Devices) light absorption microplate reader passes through (every 12- over time at 290nm 20s is primary, by 5-20min) absorbance is tracked to determine.As a result it is shown in FIG. 2.As shown in Figure 2, with wild type PAL It compares, variant No. 30 and No. 36 keep more activity after the lower incubation of pH about 4 to 4.8.
Measure KM:
In order to which whether the mutation evaluated in variant PAL has had changed PAL variant to the affinity of phenylalanine, open country is determined The Michaelis constant of raw type enzyme and variant 36.Firstly, by 15 μ g/ml PAL and 100 μ in the 100mM Tris pH 8.0 of 100 μ L The 0-32mM phenylalanine in 100mM Tris, pH 8.0 of L is added to poly- 96 hole plate (Costar# of acrylate 3625, Corning) hole.By the of short duration mixing of the reactant, and initial rate uses Plus384Or(Molecular Devices) light absorption microplate reader passes through (every 12- over time at 290nm 20s is primary, by 5-20min) absorbance is tracked to determine.The K of the PAL variant of each testMBy as known in the art Data are fitted to Lineweaver-Burke figure to determine.As a result it is shown in FIG. 3.As shown, the K of wild-type enzymeMFor 74 μM, and the K of variant 36MIt is 60 μM.
Amino acid specificities:
Some phenylalanine lyases show the activity for tyrosine and/or histidine other than phenylalanine.For Whether the mutation that evaluation is present in PAL variant has had changed PAL variant to the specificity of phenylalanine, has rated wild type The activity of enzyme and variant 36 to these three amino acid.Firstly, by 100 μ L in 10mM sodium phosphate pH 7.0 5g/L containing PAL's The 50mM phenylalanine or histidine or 2.5mM tyrosine of freeze-dried powder and 100 μ L in 200mM sodium phosphate pH 7.5 are added to The hole of poly- 96 orifice plate of acrylate (Costar#3635, Corning).By the of short duration mixing of the solution, and initial rate uses Plus384Or(Molecular Devices) light absorption microplate reader by (every 12-20s is primary, by 5-20min) tracks absorbance to determine over time at 290nm.As a result it is shown in FIG. 4. As shown, WT enzyme or the variant 36 detectable activity to histidine or tyrosine are not observed, show that these enzymes are benzene Alanine specificity.
To the tolerance of pig and bovine protein enzyme:
The PAL variant sample prepared as described in Example 4 is dissolved in 100mM sodium phosphate pH 7.0 with 2g/L.It will Porcine trypsin and bovine chymotrypsin (being 100mg) are dissolved in 2ml 100mM sodium phosphate pH 7.0, and in 100mM Continuous 11 2 times of dilutions in sodium phosphate.Then, by 80 μ L PAL variant enzyme solutions and 20 μ L trypsase and chymotrypsin Solution mixing.Reaction mixture is incubated for 1h at 37 DEG C with 400rpm (1 " throw).Then, by 20 μ L reactants and 80 μ L Water and 100 μ L 100mM sodium phosphates, 50mM phenylalanine pH 7.0 are mixed.By the of short duration mixing of every kind of solution, and activity uses Plus384Or(Molecular Devices) light absorption microplate reader by (every 12-20s is primary, by 5-20min) tracks absorbance to determine over time at 290nm.As a result it is shown in FIG. 2. As shown in the figure, the variant of all tests shows improved protease tolerance, No. 36 needles of variant compared with wild type PAL It is most stable to proteolysis.
To the tolerance of human protease:
As previously discussed, the PAL variant of some evolution has been screened to comment for Porcine trypsin and bovine chymotrypsin Valence they to the tolerance by the proteolysis of existing enzyme in the gastrointestinal tract.Also user's enzyme tests some evolution PAL variant, to confirm that they are resistant to people's homologue of pig or ox enzyme.In these measurements, by WT PAL and variant 36 jellies Dry powder (2.4g/L, in 100mM sodium phosphate pH 7.0) and mankind's chymotrypsin (Athens Research) 0- 80BTEE unit/ml or people's trypsase (ProSpec) (0-10,000BAEE unit/ml) are incubated for 2h at 37 DEG C.Then, will 100 μ L mixtures are added to the hole of poly- 96 orifice plate of acrylate (Costar#3635, Corning), and 100 μ L are then added 50mM phenylalanine, 200mM sodium phosphate pH 7.0.By the of short duration mixing of solution, and initial rate uses Plus384Or(Molecular Devices) light absorption microplate reader by pushing away at any time at 290nm (every 12-20s is primary, by 5-20min) tracking absorbance is moved to determine.As a result it is shown in FIG. 5.As shown in Figure 5, variant No. 36 more more stable than wild type PAL enzyme.
To the tolerance of pancreas crude extract:
The PAL variant of evolution is also tested for determine its tolerance to pancreas enzyme.WT PAL, variant 36, variant 42 Number and variant 43 freeze-dryings powder (preparation as described in Example 4;12g/L, in 50mM potassium phosphate pH 6.8) with Pig pancreatinum (pancreatin) (4x Sigma-Aldrich, St.Louis, MO) is mixed with 1:1, and is vibrated at 37 DEG C (400rpm, 1 " throw) is incubated for up to 23h.10 μ L aliquots of reactant are added to poly- by point at the appointed time 190 μ L 50mM phenylalanines, 190mM sodium phosphate pH in the hole of 96 orifice plate of acrylate (Costar#3635, Corning) In 7.0.By the of short duration mixing of reactant, and initial rate uses Plus384Or190 (Molecular Devices) light absorption microplate reader is by the way that (every 12-20s is primary, by 5- over time at 290nm Track absorbance 20min) to determine.As a result it is shown in FIG. 6.As shown in Figure 6, under these determination conditions, with wild type PAL enzyme is compared, and variant 36, variant 42 and variant 43 all show significant stability.
The influence of enteron aisle detergent:
Be also tested for the PAL variant of evolution with determine they under the existence or non-existence of intestinal bile acid and fatty acid it is right The sensibility of proteolysis, to evaluate whether these acid influence their stability.By the powder comprising variant 36 freeze-dryings (as described in Example 4) is dissolved in 0-16mM natrium taurocholicum with 50 μ g/ml, in 100mM sodium phosphate pH 7.0.By pig pancreas Protease and bovine chymotrypsin (each 80mg) are dissolved in the 100mM sodium phosphate pH 7.0 of 2ml, and in 100mM sodium phosphate In it is continuous 11 times dilute 2 times.For measurement, 50 μ L PAL solution are mixed with 50 μ L protein enzyme solutions.By mixture at 37 DEG C Under with 400rpm (1 " throw) be incubated for 1h.Then, by 50 μ L mixtures and 150 μ L 200mM sodium phosphates, 50mM phenylalanine pH 7.0 mixing.By the of short duration mixing of every kind of reactant, and activity uses Plus384Or 190 (Molecular Devices) light absorption microplate reader are by the way that (every 12-20s is primary, by 5- over time at 290nm Track absorbance 20min) to determine.As a result it is shown in FIG. 7.As shown in the figure, the natrium taurocholicum of addition increases Variant 36 sensibility to proteolysis.
Embodiment 6
The enteron aisle stability of variant PAL
In order to evaluate the stability and activity of the PAL variant when it is by animal intestinal tract, raised by force with the enzyme variants of purifying small Mouse.The healthy C57B1/6 mouse of 10-12 weeks age of mouse and weighing 20-26g is maintained in metabolic cage, and fasting 15h.Arbitrarily Water is provided.After overnight fasting, the 0.5M Tris-HCl pH 8.5 of needle 0.3ml is raised by force using 21-gauge and in 0.5M 8mg/ml WT PAL (as described in Example 4 preparation) in Tris-HCl pH 8.5 or in 0.5M Tris-HCl pH The mixture of 8mg/ml variant 42 (preparation as described in Example 4) in 8.5 raises by force animal.0.5,2 after strong raise Or 6h, animal is breaked end, using green top capillary blood collecting pipe (Ram Scientific) collect blood plasma, and collect stomach, Duodenum (apart from stomach~1-8cm), jejunum (apart from stomach~10-18cm), ileum (more than caecum~8cm) and colon (caecum Below~5cm) content.The weight of these contents is recorded, and these contents are stored in -80 DEG C before analysis.
Stomach or intestinal contents 100mM sodium phosphate pH 7.0 are diluted into 4X, of short duration mixing, and with 14,000rpm x 2min Centrifugation.Supernatant is transferred to 350 μ L, 0.45 μ Μ AcroPrepTMAdvanced 96- hole filters (Pall Corp), and by true Sky filtering removal particle.The enzymatic activity of the clear filtrate of evaluation as described in the previous embodiment is simultaneously evaluated by SDS-PAGE The presence of complete PAL albumen.The result shows that compared with wild type PAL enzyme and negative control, the enzyme activity of the PAL variant of evolution Property in duodenum and jejunum display be higher.
Embodiment 7
The level of plasma phenylalanine
The plasma sample that the mouse described in embodiment 6 is collected is evaluated, in the blood to determine tested mouse The amount of existing phenylalanine.It include 0.6mM dl- phenylalanine (ring D by mice plasma (50 μ L) and 250 μ L5) (that is, comprising The deuterium that is bonded with aromatic ring carbon atom and the phenylalanine of the version of the isotope labelling of non-hydrogen;Cambridge isotope is real Test room) acetonitrile merge.Sample is mixed into 5min at RT, 10min is centrifuged with 3200x g at 4 DEG C, and supernatant is shifted Sample analysis is used for plate.For analysis, the 10 each samples of μ L are injected into 3200LC/MS/MS system (AB Sciex) pass through C18Column (150x 2.1mM, 5 μm of pearls) (Supelco, present Sigma- Aldrich), eluted in water 0.1% formic acid (A) and acetonitrile (B).Across 5min gradient (t=0,97%A;3min, 50%A; 3.5min, 5%A;4min, 97%A;5min, 95%A) elution samples, find turning for the 166 to 120 of endogenous phenylalanine 171 to 125 transformation of the standard items of change and isotope labelling.The result shows that compared with wild type PAL enzyme and negative control, Plasma phenylalanine level in the mouse samples of PAL variant (that is, variant 42) for giving evolution is at 30 minutes It is lower at m-.
Embodiment 8
The treatment function of variant PAL
In order to evaluate whether PAL variant reduces intracorporal blood-serum P he level, the mouse model of PKU has been used.In these realities In testing, PAL albumen is fed by force to impacted animal.Firstly, by removing phenylalanine three days from the diet of mouse, then It is horizontal to establish consistent baseline Phe in mouse to inject the solution comprising Phe of known quantity.It will be with C57B1/6 background 3- to -6 months age of mouse homozygous PAH enu-2 mouse (referring to McDonald etc., Proc.Natl.Acad.Sci.USA 87: 1965-1967 [1990]) it is transferred to the diet (TD.97152, Harlan) of no phenylalanine, have and is provided in its drinking water 0.03g/L Phe continue 72h.Before beginning the process in time=0h, by the Phe of mouse injection 0.15mg/g weight (the Phe solution from 10g/L in water).55 minutes after injection, about 20 μ L blood are collected by tail portion-venipuncture, And system is put on filter paper.Then, after injection time 1h, 3h and 5h when, by mouse 0.3ml 50-100g/L WT AvPAL, WT AvPAL adds Aprotinin, BSA or variant 42 strong to raise.After injection when 6h, 7h and 9h, other blood speckles are collected On filter paper.It is analyzed by blood speckles drying and in the LC-MS/MS for carrying out Phe and Tyr level using methods known in the art Before be stored in -20 DEG C (referring to Chase etc., Clin.Chem., 39:66-71 [1993]).
As a result it is shown in FIG. 8.As shown in this figure, it is raised by force with inactive protein (BSA), leads to increased serum Phe is horizontal.In contrast, cause constant Phe- horizontal with the processing of the WT AvPAL of proteolysis sensitivity, and and protease The same protein that inhibitor Aprotinin combines causes Phe- is horizontal to be remarkably decreased.The result is also shown that in protease inhibitors In the absence of application engineering-PAL variant 42 lead to reduceds blood-serum P he level.
Embodiment 9
PAL's deimmunizes
In this embodiment, describe progress is intended to identify the multifarious reality for removing t cell epitope from PAL It tests.
Deimmunize multifarious identification:
In order to identify the amino acid that can remove t cell epitope when being mutated, it is thin to identify that prediction causes T using computer approach The PAL sequence of born of the same parents' response.Concurrently, the Experimental Research for permission, harmless mutation has also been carried out, especially for not piercing The Experimental Research of the amino acid of protein active is kept in sharp measurement (for example, the measurement described in example 2).Then it analyzes Active variant is to determine influence of the mutation to the immunogenicity of prediction.
The t cell epitope of computer identification presumption in modification A vPAL:
Use immune epitope database (IEDB as known in the art;Immune epitope database and analysis resource website (Immune Epitope Database and Analysis Resource website)) tool and proprietary statistical analysis Tool is (see, for example, iedb.org and Vita etc., Nucl.Acids Res., 38 (Database issue): D854-62 [2010] .Epub 2009Nov 11]) identify the t cell epitope estimated in AvPAL variant 36.By the AvPAL variant It is parsed into all possible 15-mer analytical framework, it is Chong Die with last 14 amino acid of each frame.Using in the website IEDB The method of upper recommendation passes through eight kinds of common II class HLA-DR allele for the human colony for covering nearly 95% with collective (DRB1*0101, DRB1*0301, DRB1*0401, DRB1*0701, DRB1*0801, DRB1*1101, DRB1*1301 and DRB1*1501) the combination pair of the prediction (see, for example, Southwood etc., J.Immunol., 160:3363-3373 [1998]) The 9-mer nucleus of 15-mer analytical framework scores, and evaluates the Immunogenic potential of 15-mer analytical framework.Using such as Statistical and analytical tool known in the art identify include in modification A vPAL potential T- cell epitope cluster (that is, comprising become The subregion with abnormal high Immunogenic potential in body AvPAL).The t cell epitope cluster of identification is directed to known epitope IEDB database and GenBank albumen database screened.These screening and identifications are in the modification A vPAL variant 36 The t cell epitope of a presumption of 10 (ten).Below these epitopes be referred to as TCE-I, II, III, IV, V, VI, VII, VIII, IX and X。
Deimmunize the design in library:
Firstly, creating includes in the t cell epitope region for 10 presumptions identified as described above from for creating Build all neutral and beneficial mutation combinatorial libraries that the directed evolution wheel of modification A vPAL identifies.Analyze these mutation pair The influence combined with the prediction of eight kinds of common II class HLA-DR allele.Predict it is multiple mutation remove or reduce TCE-I, II,VI,VII.These mutation are integrated in combinatorial libraries.Then library is designed using saturation mutagenesis, at remaining six Each amino acid of T- cell epitope (that is, TCE-III, IV, V, VIII, IX and X) interior mutagenesis.Finally, creating includes from target Aggregation shape of the beneficial diversity identified to the evolution of more wheels of TCE-I, III, IV, VIII and X together with the influence PAL variant of report The combinatorial libraries of two amino acid of C503 and C565 of state.Make from the library most preferably hitting experience targeting TCE-III and The mutagenesis of the other saturation mutagenesis of VIII and the other targeting at several positions.
Deimmunize the construction and screening in library:
The combination and saturation mutagenesis library designed as described above is constructed by methods known in the art, and such as Activity is tested in the measurement not stimulated described in embodiment 2.Identification and sequencing active variant.In following table 9-1 into 9-7 Provide the activity and mutation about AvPAL variant 36 and many AvPAL variants.
Deimmunize multifarious identification:
By the binding analysis for evaluating active variant and eight kinds described above common II class HLA-DR allele Active variant is horizontal with the immunogenicity for determining them.Total immunogenicity scoring and immunogenicity hit counting in table 9-1 to 9-7 Middle display.Total immunogenicity scoring reflects the immunogenicity of variant integrally predicted (that is, higher scoring indicates higher level The immunogenicity of prediction).Immunogenicity " hitting counting " instruction has abnormal high Immunogenic potential (that is, higher hit The higher Immunogenic potential of counting indicator) 15-mer analytical framework number.Immunogene with lower total prediction Property scoring and/or immunogenicity hit the immunogenicity that counts be less than with reference to variant to hit the mutation of counting be considered as " going to exempt from Epidemic diseaseization mutation ".Optimal deimmunized mutation recombination will be accredited as to generate many active and predict to compare start reference Modification A vPAL has the variant of significant lower immunogenicity.In following table, FIOP result is from the measurement not stimulated;It is right In total immunogenicity scoring (TIS) and immunogenicity hit counting (IHC), as a result be directed to entire PAL albumen (table 9-1,9-8 and 9-9) or for specified epitope (table 9-2 to 9-7) is shown.
The testing in vitro of deimmunized PAL variant:
Deimmunized PAL variant is tested in the measurement of dendron T cell, causes t cell response empirically to test it Ability.Using standard technique from people's donor separating periphery blood monocytic cell (PBMC).These cells are used as and are determining ingredient The source for the monocyte cultivated in culture medium, to generate immature dendritic cells.By these immature dendritic cells (DC) deimmunized PAL variant is loaded, and then by the way that further culture is induced at more mature in defined medium Phenotype, to provide the DC of antigen sensibilization.By the donor of the CD8+T cell depleting obtained from donor sample identical with DC PBMC is marked with CFSE, is then cultivated with the DC of antigen sensibilization 7 days, tested in octuplicate thereafter.Each DC-T cell training Supporting object includes one group of untreated control group (that is, negative control).The measurement further includes with reference to antigen control (that is, positive right According to), including two kinds of effective holoprotein antigens.Using from diversity major histocompatibility complex II class allele The statistics of the measurement ability that provides PAL variant initiation t cell response of cell of 50 people's donors separation relevant comment Valence.
Although describing the present invention by reference to specific embodiment, a variety of changes can be made and be alternatively equal Object, to adapt to specific situation, material, the forming of substance, one method, method step or multiple steps, to realize this hair Bright benefit, without departing from range claimed.
It each of is quoted in present disclosure and each publication and patent document is incorporated herein by reference and is used for All purposes in the U.S., as each such publication or file clearly and are respectively pointed out to be incorporated herein by reference Equally.The reference of publication and patent document is not considered as the instruction that any such file is the relevant prior art, It does not constitute to perhaps the date recognizes in it.
Sequence table
<110>Codexis Inc.
Ji Jiate W Ha Siman
Nicholas J A Jiade
Benjamin meter Jie Ci
Jonathan's not rom
Zhang Xiyun
<120>phenylalanine lyase polypeptide
<130> CX7-131WO2
<150> US 61/813,586
<151> 2013-04-18
<150> US 61/897,932
<151> 2013-10-31
<160> 33
<170>PatentIn version 3 .5
<210> 1
<211> 7407
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 1
tctcatgttt gacagcttat catcgataag ctttaatgcg gtagtttatc acagttaaat 60
tgctaacgca gtcaggcacc gtgtatgaaa tctaacaatg cgctcatcgt catcctcggc 120
accgtcaccc tggatgctgt aggcataggc ttggttatgc cggtactgcc gggcctcttg 180
cgggatatcc ggatatagtt cctcctttca gcaaaaaacc cctcaagacc cgtttagagg 240
ccccaagggg ttatgctagt tattgctcag cggtggcagc agccaactca gcttcctttc 300
gggctttgtt agcagccgga tccttaatgc agacacggca gaatgtcctg aacggcctga 360
acaataacac caccggctgc aatatctgca ctaatacgtg caatatgttc atccagaccc 420
tgttcattat cattccaaat atacggacga tctgaggtcg gtttctgacc aacaacatga 480
cgaactgcgc tatacagacg ttcggttgcc ggtgacagac aggcacgtgc atcataatga 540
ccggtttttt tgtaggtacg cagatcaact gcctgaacac caaacatcag ggcaatggca 600
acataattct gaaaaatatc aacgctacga cgtgccaggg ttgcgctggt ataaccctgg 660
ctgttaatat tctggttaaa ctgttcggca tgggtcggaa aacgatctgc aatactatta 720
ccataaaagg tcagcagcgg cataatgcta ttaccgcaaa tctgcagacc tttcagaccc 780
atattaactt tacgttcacg attacccagc agactcggag gcagaccatt gctaaattcc 840
ggtgatgcca gcagtgcaat ctgaacatcc agatgttttg ccagcagacc gatataatag 900
cgcagatgat ccatacccat accaacatac tgacccagaa aattaccacc atgatagctt 960
gcctgattat caacatcaat cagcgggtta tcggtaacgc tgttaatctc aatttcgatt 1020
tgtttggcaa tctggctaat accatcaaca atcggaccca gatactgcgg cagacaacgc 1080
aggctataac gatcctggat cagttcatga tcacgataat catgtttacc atccagttca 1140
tcacgaacca gctggctatt ggccagcagg ctaatcatct gatctgctgc ccacagctga 1200
cccggatgcg gtttgctgtt atggataaac ggatgaaagc tctgatttgt accattcagt 1260
gcctgaatat ccagtgcatg aacacccatt gcaattgcgg tcagaatctg ggtatcataa 1320
acacaatttg ctgcaatacc ggtcataacg ctggtgccat tcatcattgc cagaccttct 1380
ttcggcagca gggtcagcgg actcagattc agctgacgca gtgcggtcgg tgcgtccatt 1440
tctttgccat taaaatcaac tttaaagctc gggtccaggc caatcaggct accggtaata 1500
tagctcagcg gaaccagatc accgctggca ccaatgctac caaattcata aacatacggg 1560
gtaacaccgg cattcagaaa gatttccatg cgtttaatca gttccagacg aataccgctt 1620
gcaccacgca tgtggctatt tgcacgcagc agcattgctg cacgaacatc tgccagcggc 1680
agtttattac ctgcaccggt tttcagaaac caaaccagat tggtctgcag ttcgcttgcc 1740
tgttcacggc taattgcaac atttgccata ccaccaaaac cgctggtaac accataaatc 1800
ggttcaccgc tttcaactgc attattgata taatcacagc tggcctgaat accctgcaga 1860
atatcggtat tattggtcag gctaaccagg gtgccattac gggcaacacg tgcaacatca 1920
ttgatggtca gtttctgatt accaataatc acatttgcgc tgctattgcc ggtaaagcta 1980
aactgctggc tgctggtttt gctctgtgcc tggctcaggg ttttcatatg acgaccttcg 2040
atatggccgc tgctgtgatg atgatgatga tgatgatgat gatggcccat ggtatatctc 2100
cttcttaaag ttaaacaaaa ttatttctag aggggaattg ttatccgctc acaattcccc 2160
tatagtgagt cgtattaatt tcgcgggatc gagatctcga tcctctacgc cggacgcatc 2220
gtggccggca tcaccggcgc cacaggtgcg gttgctggcg cctatatcgc cgacatcacc 2280
gatggggaag atcgggctcg ccacttcggg ctcatgagcg cttgtttcgg cgtgggtatg 2340
gtggcaggcc ccgtggccgg gggactgttg ggcgccatct ccttgcatgc accattcctt 2400
gcggcggcgg tgctcaacgg cctcaaccta ctactgggct gcttcctaat gcaggagtcg 2460
cataagggag agcgtcgaga tcccggacac catcgaatgg cgcaaaacct ttcgcggtat 2520
ggcatgatag cgcccggaag agagtcaatt cagggtggtg aatgtgaaac cagtaacgtt 2580
atacgatgtc gcagagtatg ccggtgtctc ttatcagacc gtttcccgcg tggtgaacca 2640
ggccagccac gtttctgcga aaacgcggga aaaagtggaa gcggcgatgg cggagctgaa 2700
ttacattccc aaccgcgtgg cacaacaact ggcgggcaaa cagtcgttgc tgattggcgt 2760
tgccacctcc agtctggccc tgcacgcgcc gtcgcaaatt gtcgcggcga ttaaatctcg 2820
cgccgatcaa ctgggtgcca gcgtggtggt gtcgatggta gaacgaagcg gcgtcgaagc 2880
ctgtaaagcg gcggtgcaca atcttctcgc gcaacgcgtc agtgggctga tcattaacta 2940
tccgctggat gaccaggatg ccattgctgt ggaagctgcc tgcactaatg ttccggcgtt 3000
atttcttgat gtctctgacc agacacccat caacagtatt attttctccc atgaagacgg 3060
tacgcgactg ggcgtggagc atctggtcgc attgggtcac cagcaaatcg cgctgttagc 3120
gggcccatta agttctgtct cggcgcgtct gcgtctggct ggctggcata aatatctcac 3180
tcgcaatcaa attcagccga tagcggaacg ggaaggcgac tggagtgcca tgtccggttt 3240
tcaacaaacc atgcaaatgc tgaatgaggg catcgttccc actgcgatgc tggttgccaa 3300
cgatcagatg gcgctgggcg caatgcgcgc cattaccgag tccgggctgc gcgttggtgc 3360
ggatatctcg gtagtgggat acgacgatac cgaagacagc tcatgttata tcccgccgtt 3420
aaccaccatc aaacaggatt ttcgcctgct ggggcaaacc agcgtggacc gcttgctgca 3480
actctctcag ggccaggcgg tgaagggcaa tcagctgttg cccgtctcac tggtgaaaag 3540
aaaaaccacc ctggcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 3600
aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 3660
atgtaagtta gctcactcat taggcaccgg gatctcgacc gatgcccttg agagccttca 3720
acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca cttatgactg 3780
tcttctttat catgcaactc gtaggacagg tgccggcagc gctctgggtc attttcggcg 3840
aggaccgctt tcgctggagc gcgacgatga tcggcctgtc gcttgcggta ttcggaatct 3900
tgcacgccct cgctcaagcc ttcgtcactg gtcccgccac caaacgtttc ggcgagaagc 3960
aggccattat cgccggcatg gcggccgacg cgctgggcta cgtcttgctg gcgttcgcga 4020
cgcgaggctg gatggccttc cccattatga ttcttctcgc ttccggcggc atcgggatgc 4080
ccgcgttgca ggccatgctg tccaggcagg tagatgacga ccatcaggga cagcttcaag 4140
gatcgctcgc ggctcttacc agcctaactt cgatcactgg accgctgatc gtcacggcga 4200
tttatgccgc ctcggcgagc acatggaacg ggttggcatg gattgtaggc gccgccctat 4260
accttgtctg cctccccgcg ttgcgtcgcg gtgcatggag ccgggccacc tcgacctgaa 4320
tggaagccgg cggcacctcg ctaacggatt caccactcca agaattggag ccaatcaatt 4380
cttgcggaga actgtgaatg cgcaaaccaa cccttggcag aacatatcca tcgcgtccgc 4440
catctccagc agccgcacgc ggcgcatctc gggcagcgtt gggtcctggc cacgggtgcg 4500
catgatcgtg ctcctgtcgt tgaggacccg gctaggctgg cggggttgcc ttactggtta 4560
gcagaatgaa tcaccgatac gcgagcgaac gtgaagcgac tgctgctgca aaacgtctgc 4620
gacctgagca acaacatgaa tggtcttcgg tttccgtgtt tcgtaaagtc tggaaacgcg 4680
gaagtcagcg ccctgcacca ttatgttccg gatctgcatc gcaggatgct gctggctacc 4740
ctgtggaaca cctacatctg tattaacgaa gcgctggcat tgaccctgag tgatttttct 4800
ctggtcccgc cgcatccata ccgccagttg tttaccctca caacgttcca gtaaccgggc 4860
atgttcatca tcagtaaccc gtatcgtgag catcctctct cgtttcatcg gtatcattac 4920
ccccatgaac agaaatcccc cttacacgga ggcatcagtg accaaacagg aaaaaaccgc 4980
ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga 5040
gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct 5100
ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 5160
cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 5220
cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 5280
cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 5340
atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcgctctt 5400
ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 5460
ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 5520
tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 5580
tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 5640
gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 5700
ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 5760
tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca 5820
agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 5880
atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta 5940
acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta 6000
actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct 6060
tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt 6120
tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 6180
tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca 6240
tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat 6300
caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg 6360
cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc cccgtcgtgt 6420
agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag 6480
acccacgctc accggctcca gatttatcag caataaacca gccagccgga agggccgagc 6540
gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag 6600
ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctgcaggca 6660
tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa 6720
ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc ggtcctccga 6780
tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca gcactgcata 6840
attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca 6900
agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaacacggg 6960
ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg 7020
ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg 7080
cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag 7140
gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac 7200
tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca 7260
tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag 7320
tgccacctga cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta 7380
tcacgaggcc ctttcgtctt caagaat 7407
<210> 2
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 2
atgaaaaccc tgagccaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gcaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccaat agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatca tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctgct gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcctgtc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga cctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgtgtctgca t 1701
<210> 3
<211> 1701
<212> DNA
<213>Anabaena variabilis (Anabaena variabilis)
<400> 3
atgaaaaccc tgagccaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gcaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccaat agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatca tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctgct gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcctgtc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga cctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgtgtctgca t 1701
<210> 4
<211> 567
<212> PRT
<213>Anabaena variabilis
<400> 4
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Ala Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Asn Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp His Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Leu Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Cys Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Thr Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Cys Leu His
565
<210> 5
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 5
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgtacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcagggact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatca tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg tatcaccgga atttaacaat 1200
ggtctgcctg cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcctgtc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagtatccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgtgtctgca t 1701
<210> 6
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 6
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Val Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Gly
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp His Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Val Ser Pro Glu Phe Asn Asn
385 390 395 400
Gly Leu Pro Ala Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Cys Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Tyr Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Cys Leu His
565
<210> 7
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 7
atgaaaaccc tgagccaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gcaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccaat agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctgct gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcctgtc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga cctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgtgtctgca t 1701
<210> 8
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 8
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Ala Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Asn Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Leu Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Cys Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Thr Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Cys Leu His
565
<210> 9
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 9
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgtacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcctgtc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgtgtctgca t 1701
<210> 10
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 10
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Val Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Cys Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Cys Leu His
565
<210> 11
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 11
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgtacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata aagatattct gcagcgtatt 180
caggccagct gtgattatat caataatgca gttgaaaaag gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atatggatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
aaaaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgccgctgca t 1701
<210> 12
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 12
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Val Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Lys Asp Ile Leu Gln Arg Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Lys Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Met Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Lys Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Pro Leu His
565
<210> 13
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 13
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgtacgt 120
gttgcccgta atggcaccct ggttagcctg accaataata aagatattct gcagcgtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atatggatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
aaaaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 14
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 14
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Val Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Lys Asp Ile Leu Gln Arg Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Met Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Lys Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 15
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 15
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcgcgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc atggtgcaag cggtattcgt 420
ctggaactga ttaaacgcgc ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ggcctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat attggccaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 16
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 16
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met His Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Ala Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Gly Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Gly Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 17
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 17
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcgcgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc atggtgcaag cggtattcgt 420
ctggaactga ttaaacgcgc ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ggcctggcaa aacatctgga tacccagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat attggccaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 18
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 18
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met His Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Ala Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Gly Leu Ala Lys
370 375 380
His Leu Asp Thr Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Gly Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 19
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 19
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcgcgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc atggtgcaag cggtattcgt 420
ctggaactga ttaaacgcgc ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta tgaaatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat attggccaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 20
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 20
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met His Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Ala Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Glu Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Gly Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 21
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 21
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcgcgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc atggtgcaag cggtattcgt 420
ctggaactga ttaaacgcgc ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta tgaaatcggt 1140
ctgctggcaa aacatctgga tacccagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat attggccaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 22
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 22
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met His Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Ala Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Glu Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Thr Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Gly Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 23
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 23
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ccataccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgtacgt 120
gttgcccgta atggcaccgc ggttagcctg accaataata aagatattct gcagcgtatt 180
caggccagct gtgattatat caataatgca gttgaaaaag gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcat ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc agccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atatggatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ctgctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat atttttcaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
aaaaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgccgctgca t 1701
<210> 24
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 24
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser His Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Val Arg Val Ala Arg Asn Gly Thr Ala Val
35 40 45
Ser Leu Thr Asn Asn Lys Asp Ile Leu Gln Arg Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Lys Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Gln Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Met Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Lys Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Pro Leu His
565
<210> 25
<211> 1701
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 25
atgaaaaccc tgagtcaggc acagagcaaa accagcagcc agcagtttag ctttaccggc 60
aatagcagcg caaatgtgat tattggtaat cagaaactga ccatcaatga tgttgcgcgt 120
gttgcccgta atggcaccct ggttagcctg accaataata ccgatattct gcagggtatt 180
caggccagct gtgattatat caataatgca gttgaaagcg gtgaaccgat ttatggtgtt 240
accagcggtt ttggtggtat ggcaaatgtt gtaattagcc gtgaacaggc aagcgaactg 300
cagaccaatc tggtttggtt tctgaaaacc ggtgcaggta ataaactgcc gctggcagat 360
gttcgtgcag caatgctgct gcgtgcaaat agccacatgc gtggtgcaag cggtattcgt 420
ctggaactga ttaaacgcgc ggaaatcttt ctgaatgccg gtgttacccc gtatgtttat 480
gaatttggta gcattggtgc cagcggtgat ctggttccgc tgagctatat taccggtagc 540
ctgattggcc tggacccgag ctttaaagtt gattttaatg gcaaagaaat ggacgcaccg 600
accgcactgc gtcagctgaa tctgagtccg ctgaccctgc tgccgaaaga aggtctggca 660
atgatgaatg gcaccagcgt tatgaccggt attgcagcaa attgtgttta tgatacccag 720
attctgaccg caattgcaat gggtgttcat gcactggata ttcaggcact gaatggtaca 780
aatcagagct ttcatccgtt tatccataac agcaaaccgc atccgggtca gctgtgggca 840
gcagatcaga tgattagcct gctggccggt agccagctgg ttcgtgatga actggatggt 900
aaacatgatt atcgtgatgg tgaactgatc caggatcgtt atagcctgcg ttgtctgccg 960
cagtatctgg gtccgattgt tgatggtatt agccagattg ccaaacaaat cgaaattgag 1020
attaacagcg ttaccgataa cccgctgatt gatgttgata atcaggcaag ctatcatggt 1080
ggtaattttc tgggtcagta tgttggtatg ggtatggatc atctgcgcta ttatatcggt 1140
ggcctggcaa aacatctgga tgttcagatt gcactgctgg catcaccgga atttagcaat 1200
ggtctgcctc cgagtctggt gggtaatcgt gaacgtaaag ttaatatggg tctgaaaggt 1260
ctgcagattt gcggtaatag cattatgccg ctgctgacct tttatggtaa tagtattgca 1320
gatcgttttc cgacccatgc cgaacagttt aaccagaata ttaacagcca gggttatacc 1380
agcgcaaccc tggcacgtcg tagcgttgat attggccaga attatgttgc cattgccctg 1440
atgtttggtg ttcaggcagt tgatctgcgt acctacaaaa aaaccggtca ttatgatgca 1500
cgtgcccagc tgtcaccggc aaccgaacgt ctgtatagcg cagttcgtca tgttgttggt 1560
cagaaaccga gctcagatcg tccgtatatt tggaatgata atgaacaggg tctggatgaa 1620
catattgcac gtattagtgc agatattgca gccggtggtg ttattgttca ggccgttcag 1680
gacattctgc cgaacctgca t 1701
<210> 26
<211> 567
<212> PRT
<213>artificial sequence
<220>
<223>polypeptide synthesized
<400> 26
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe
1 5 10 15
Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys
20 25 30
Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val
35 40 45
Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Ala Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe
180 185 190
Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu
195 200 205
Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln
225 230 235 240
Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala
245 250 255
Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu
275 280 285
Ala Gly Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr
290 295 300
Arg Asp Gly Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln
325 330 335
Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Gly Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Arg Glu Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu
450 455 460
Ala Arg Arg Ser Val Asp Ile Gly Gln Asn Tyr Val Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly
485 490 495
His Tyr Asp Ala Arg Ala Gln Leu Ser Pro Ala Thr Glu Arg Leu Tyr
500 505 510
Ser Ala Val Arg His Val Val Gly Gln Lys Pro Ser Ser Asp Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln
545 550 555 560
Asp Ile Leu Pro Asn Leu His
565
<210> 27
<211> 4945
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<220>
<221> misc_feature
<222> (140)..(142)
<223>n is a, c, g or t
<220>
<221> misc_feature
<222> (1758)..(1758)
<223>n is a, c, g or t
<220>
<221> misc_feature
<222> (1761)..(1762)
<223>n is a, c, g or t
<220>
<221> misc_feature
<222> (1781)..(1781)
<223>n is a, c, g or t
<220>
<221> misc_feature
<222> (1784)..(1784)
<223>n is a, c, g or t
<220>
<221> misc_feature
<222> (1786)..(1786)
<223>n is a, c, g or t
<400> 27
tggccaccat caccatcacc attagggaag agcagatggg caagcttgac ctgtgaagtg 60
aaaaatggcg cacattgtgc gacatttttt tttgaattct acgtaaaaag cagccgatac 120
atcggctgct ttttttttgn nngaggttcc aacttgtggt ataatgaaat aagatcactc 180
cggagcgtat tttttgagtt atcgagattt tcaggagcta aggaggaact aaaatggaga 240
aaaaaatcac tggatatacc accgttgata tatcccaatg gcatcgtaaa gaacattttg 300
aggcatttca gtcagttgct caatgtacct ataaccagac cgttcagctg gatattacgg 360
cctttttaaa gaccgtaaag aaaaataagc acaagtttta tccggccttt attcacattc 420
ttgcccgcct gatgaatgct catccggagt tccgtatggc aatgaaagac ggtgagctgg 480
tgatatggga tagtgttcac ccttgttaca ccgttttcca tgagcaaact gaaacgtttt 540
catcgctctg gagtgaatac cacgacgatt tccggcagtt tctacacata tattcgcaag 600
atgtggcgtg ttacggtgaa aacctggcct atttccctaa agggtttatt gagaatatgt 660
ttttcgtctc agccaatccc tgggtgagtt tcaccagttt tgatttaaac gtggccaata 720
tggacaactt cttcgccccc gttttcacca tgggcaaata ttatacgcaa ggcgacaagg 780
tgctgatgcc gctggcgatt caggttcatc atgccgtctg tgatggcttc catgtcggca 840
gaatgcttaa tgaattacaa cagtactgcg atgagtggca gggcggggcg taactgcagg 900
agctcaaaca gcagcctgta ttcaggctgc ttttttcgtt ttggtctgcg cgtaatctct 960
tgctctgaaa acgaaaaaac cgccttgcag ggcggttttt cgaaggttct ctgagctacc 1020
aactctttga accgaggtaa ctggcttgga ggagcgcagt caccaaaact tgtcctttca 1080
gtttagcctt aaccggcgca tgacttcaag actaactcct ctaaatcaat taccagtggc 1140
tgctgccagt ggtgcttttg catgtctttc cgggttggac tcaagacgat agttaccgga 1200
taaggcgcag cggtcggact gaacgggggg ttcgtgcata cagtccagct tggagcgaac 1260
tgcctacccg gaactgagtg tcaggcgtgg aatgagacaa acgcggccat aacagcggaa 1320
tgacaccggt aaaccgaaag gcaggaacag gagagcgcac gagggagccg ccagggggaa 1380
acgcctggta tctttatagt cctgtcgggt ttcgccacca ctgatttgag cgtcagattt 1440
cgtgatgctt gtcagggggg cggagcctat ggaaaaacgg ctttgccgcg gccctctcac 1500
ttccctgtta agtatcttcc tggcatcttc caggaaatct ccgccccgtt cgtaagccat 1560
ttccgctcgc cgcagtcgaa cgaccgagcg tagcgagtca gtgagcgagg aagcggaata 1620
tatcctgtat cacatattct gctgacgcac cggtgcagcc ttttttctcc tgccacatga 1680
agcacttcac tgacaccctc atcagtgaac caccgctggt agcggtggtt tttttaggcc 1740
tatggccttt tttttttntg nnaaaccttt cgcggtatgg natnanagcg cccggaagag 1800
agtcaattaa gagggtggtg aatgtgaaac cagtaacgtt atacgatgtc gcagagtatg 1860
ccggtgtctc ttatcagacc gtttcccgcg tggtgaacca ggccagccac gtttctgcga 1920
aaacgcggga aaaagtggaa gcggcgatgg cggagctgaa ttacattccc aaccgcgtgg 1980
cacaacaact ggcgggcaaa cagtcgttgc tgattggcgt tgccacctcc agtctggccc 2040
tgcacgcgcc gtcgcaaatt gtcgcggcga ttaaatctcg cgccgatcaa ctgggtgcca 2100
gcgtggtggt gtcgatggta gaacgaagcg gcgtcgaagc ctgtaaagcg gcggtgcaca 2160
atcttctcgc gcaacgcgtc agtgggctga tcattaacta tccgctggat gaccaggatg 2220
ccattgctgt ggaagctgcc tgcactaatg ttccggcgtt atttcttgat gtctctgacc 2280
agacacccat caacagtatt attttctccc atgaagacgg tacgcgactg ggcgtggagc 2340
atctggtcgc attgggtcac cagcaaatcg cgctgttagc gggcccatta agttctgtct 2400
cggcgcgtct gcgtctggct ggctggcata aatatctcac tcgcaatcaa attcagccga 2460
tagcggaacg ggaaggcgac tggagtgcca tgtccggttt tcaacaaacc atgcaaatgc 2520
tgaatgaggg catcgttccc actgcgatgc tggttgccaa cgatcagatg gcgctgggcg 2580
caatgcgcgc cattaccgag tccgggctgc gcgttggtgc ggacatctcg gtagtgggat 2640
acgacgatac cgaagacagc tcatgttata tcccgccgtt aaccaccatc aaacaggatt 2700
ttcgcctgct ggggcaaacc agcgtggacc gcttgctgca actctctcag ggccaggcgg 2760
tgaagggcaa tcagctgttg cccgtctcac tggtgaaaag aaaaaccacc ctggcgccca 2820
atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg 2880
tttcccgact ggaaagcggg cagtgagcgg tacccgataa aagcggcttc ctgacaggag 2940
gccgttttgt ttctcgagtt aattaaggca gtgagcgcaa cgcaattaat gtgagttagc 3000
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 3060
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac ggattcactg 3120
gccgtcgttt tacaatctag aggccagcct ggccataagg agatatacat atgggccatc 3180
atcatcatca tcatcatcat catcacagca gcggccatat cgaaggtcgt catatgaaaa 3240
ccctgagcca ggcacagagc aaaaccagca gccagcagtt tagctttacc ggcaatagca 3300
gcgcaaatgt gattattggt aatcagaaac tgaccatcaa tgatgttgca cgtgttgccc 3360
gtaatggcac cctggttagc ctgaccaata ataccgatat tctgcagggt attcaggcca 3420
gctgtgatta tatcaataat gcagttgaaa gcggtgaacc gatttatggt gttaccagcg 3480
gttttggtgg tatggcaaat gttgcaatta gccgtgaaca ggcaagcgaa ctgcagacca 3540
atctggtttg gtttctgaaa accggtgcag gtaataaact gccgctggca gatgttcgtg 3600
cagcaatgct gctgcgtgca aatagccaca tgcgtggtgc aagcggtatt cgtctggaac 3660
tgattaaacg catggaaatc tttctgaatg ccggtgttac cccgtatgtt tatgaatttg 3720
gtagcattgg tgccagcggt gatctggttc cgctgagcta tattaccggt agcctgattg 3780
gcctggaccc gagctttaaa gttgatttta atggcaaaga aatggacgca ccgaccgcac 3840
tgcgtcagct gaatctgagt ccgctgaccc tgctgccgaa agaaggtctg gcaatgatga 3900
atggcaccag cgttatgacc ggtattgcag caaattgtgt ttatgatacc cagattctga 3960
ccgcaattgc aatgggtgtt catgcactgg atattcaggc actgaatggt acaaatcaga 4020
gctttcatcc gtttatccat aacagcaaac cgcatccggg tcagctgtgg gcagcagatc 4080
agatgattag cctgctggcc aatagccagc tggttcgtga tgaactggat ggtaaacatg 4140
attatcgtga tcatgaactg atccaggatc gttatagcct gcgttgtctg ccgcagtatc 4200
tgggtccgat tgttgatggt attagccaga ttgccaaaca aatcgaaatt gagattaaca 4260
gcgttaccga taacccgctg attgatgttg ataatcaggc aagctatcat ggtggtaatt 4320
ttctgggtca gtatgttggt atgggtatgg atcatctgcg ctattatatc ggtctgctgg 4380
caaaacatct ggatgttcag attgcactgc tggcatcacc ggaatttagc aatggtctgc 4440
ctccgagtct gctgggtaat cgtgaacgta aagttaatat gggtctgaaa ggtctgcaga 4500
tttgcggtaa tagcattatg ccgctgctga ccttttatgg taatagtatt gcagatcgtt 4560
ttccgaccca tgccgaacag tttaaccaga atattaacag ccagggttat accagcgcaa 4620
ccctggcacg tcgtagcgtt gatatttttc agaattatgt tgccattgcc ctgatgtttg 4680
gtgttcaggc agttgatctg cgtacctaca aaaaaaccgg tcattatgat gcacgtgcct 4740
gtctgtcacc ggcaaccgaa cgtctgtata gcgcagttcg tcatgttgtt ggtcagaaac 4800
cgacctcaga tcgtccgtat atttggaatg ataatgaaca gggtctggat gaacatattg 4860
cacgtattag tgcagatatt gcagccggtg gtgttattgt tcaggccgtt caggacattc 4920
tgccgtgtct gcattaaggc caaac 4945
<210> 28
<211> 56
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 28
ctagaggcca gcctggccat aaggagatat acatatgaaa accctgagcc aggcac 56
<210> 29
<211> 47
<212> DNA
<213>artificial sequence
<220>
<223>polynucleotides synthesized
<400> 29
gatggtgatg gtggccagtt tggccttaat gcagacacgg cagaatg 47
<210> 30
<211> 569
<212> PRT
<213>dotted nostoc (Nostoc punctiforme)
<400> 30
Met Asn Ile Thr Ser Leu Gln Gln Asn Ile Thr Arg Ser Trp Gln Ile
1 5 10 15
Pro Phe Thr Asn Ser Ser Asp Ser Ile Val Thr Val Gly Asp Arg Asn
20 25 30
Leu Thr Ile Asp Glu Val Val Asn Val Ala Arg His Gly Thr Gln Val
35 40 45
Arg Leu Thr Asp Asn Ala Asp Val Ile Arg Gly Val Gln Ala Ser Cys
50 55 60
Asp Tyr Ile Asn Asn Ala Val Glu Thr Ala Gln Pro Ile Tyr Gly Val
65 70 75 80
Thr Ser Gly Phe Gly Gly Met Ala Asp Val Val Ile Ser Arg Glu Gln
85 90 95
Ala Ala Glu Leu Gln Thr Asn Leu Ile Trp Phe Leu Lys Ser Gly Ala
100 105 110
Gly Asn Lys Leu Ser Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Leu Tyr Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Gln Arg Ile Glu Thr Phe Leu Asn Ala Gly Val Thr Pro His Val Tyr
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr
165 170 175
Ile Thr Gly Ala Leu Ile Gly Leu Asp Pro Ser Phe Thr Val Asp Phe
180 185 190
Asp Gly Lys Glu Met Asp Ala Val Thr Ala Leu Ser Arg Leu Gly Leu
195 200 205
Pro Lys Leu Gln Leu Gln Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Ala Lys
225 230 235 240
Val Leu Leu Ala Leu Thr Met Gly Val His Ala Leu Ala Ile Gln Gly
245 250 255
Leu Tyr Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Gln Cys Lys
260 265 270
Pro His Pro Gly Gln Leu Trp Thr Ala Asp Gln Met Phe Ser Leu Leu
275 280 285
Lys Asp Ser Ser Leu Val Arg Glu Glu Leu Asp Gly Lys His Glu Tyr
290 295 300
Arg Gly Lys Asp Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Ala
305 310 315 320
Gln Phe Ile Gly Pro Ile Val Asp Gly Val Ser Glu Ile Thr Lys Gln
325 330 335
Ile Glu Val Glu Met Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Glu Asn Gln Val Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val
355 360 365
Gly Val Thr Met Asp Arg Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys
370 375 380
His Ile Asp Val Gln Ile Ala Leu Leu Val Ser Pro Glu Phe Ser Asn
385 390 395 400
Gly Leu Pro Pro Ser Leu Val Gly Asn Ser Asp Arg Lys Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Ser Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Ser Phe Tyr Gly Asn Ser Leu Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Ile Ser Ala Asn Leu
450 455 460
Thr Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Met Ala Ile Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Met Lys Gly
485 490 495
His Tyr Asp Ala Arg Thr Cys Leu Ser Pro Asn Thr Val Gln Leu Tyr
500 505 510
Thr Ala Val Cys Glu Val Val Gly Lys Pro Leu Thr Ser Val Arg Pro
515 520 525
Tyr Ile Trp Asn Asp Asn Glu Gln Cys Leu Asp Glu His Ile Ala Arg
530 535 540
Ile Ser Ala Asp Ile Ala Gly Gly Gly Leu Ile Val Gln Ala Val Glu
545 550 555 560
His Ile Phe Ser Ser Leu Lys Ser Thr
565
<210> 31
<211> 571
<212> PRT
<213>glue must Trentepohlia (Rivularia sp.) PCC 7116
<400> 31
Met Asn Thr Val Arg Leu Thr Lys Asn Thr Val Ser Gln Lys Ser Phe
1 5 10 15
Ser Phe Leu Asn Asn Ser Asp Ala Ser Val Ile Val Gly Asp Arg Gln
20 25 30
Leu Thr Ile Glu Glu Val Val Ser Val Ala Arg Tyr Arg Ala Arg Val
35 40 45
Lys Leu Thr Glu Asn Leu Glu Lys Leu Ala Asn Val Gln Ala Ser Cys
50 55 60
Asp Phe Ile Arg Asp Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val
65 70 75 80
Thr Thr Gly Phe Gly Gly Met Ala Asn Val Val Ile Ser Pro Glu Ser
85 90 95
Ala Thr Leu Leu Gln Asn Asn Leu Met Cys Tyr His Lys Val Gly Ala
100 105 110
Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg
115 120 125
Ala Asn Ser His Val Ala Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile
130 135 140
Lys Arg Met Leu Ile Phe Leu Asn Ala Gly Val Thr Pro His Val Pro
145 150 155 160
Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Thr Pro Leu Ala Tyr
165 170 175
Ile Ser Gly Ala Leu Ile Gly Leu Asn Ser Ser Tyr Ile Val Asp Phe
180 185 190
Asp Gly Glu Glu Met Asp Ala Pro Thr Ala Leu Gln Lys Leu Gly Leu
195 200 205
Glu Pro Leu Gln Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly
210 215 220
Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Gln Asp Thr Arg
225 230 235 240
Ile Leu Leu Ala Leu Ser Val Ala Thr His Ala Leu Thr Ile Gln Gly
245 250 255
Leu Glu Gly Thr Asn Gln Ser Phe His Glu Tyr Ile His Lys Leu Lys
260 265 270
Pro His Ser Gly Gln Ile Trp Ala Ala Ser Gln Met Leu Glu Leu Leu
275 280 285
Ala Gly Ser Gly Leu Ile Arg Asp Glu Leu Asp Gly Ser His Asp Tyr
290 295 300
Arg Gly Lys Asn Pro Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro
305 310 315 320
Gln Tyr Met Gly Pro Ile Val Asp Gly Ile Glu Asp Ile Ala Lys Gln
325 330 335
Val Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val
340 345 350
Glu Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Ile
355 360 365
Gly Val Gly Met Asp Arg Leu Arg Tyr His Ile Gly Met Leu Ala Lys
370 375 380
His Leu Asp Val Gln Ile Ala Tyr Leu Val Ala Pro Glu Phe Asn Asn
385 390 395 400
Gly Leu Ser Pro Ser Leu Val Gly Asn Gln Gln Arg Thr Val Asn Met
405 410 415
Gly Leu Lys Gly Leu Gln Ile Thr Gly Asn Ser Ile Met Pro Leu Leu
420 425 430
Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu
435 440 445
Gln Tyr Asn Gln Asn Ile Asn Ser Gln Gly Phe Ala Ser Ala Asn Leu
450 455 460
Ala Arg Thr Ser Val Glu Ile Phe Gln Gln Tyr Ile Ala Leu Ala Leu
465 470 475 480
Met Phe Gly Val Gln Ser Val Asp Leu Arg Thr Tyr Ala Ile Ala Gly
485 490 495
His Tyr Asp Ala Arg Ala Thr Leu Ser Pro Ala Thr Gln Asp Leu Tyr
500 505 510
Met Ala Val Arg Asn Val Val Gly Arg Pro Pro Ser Lys Glu Arg Ala
515 520 525
Tyr Ile Trp Asp Asp Asn Glu Gln Gly Leu Asp Ser His Ile Ser Lys
530 535 540
Ile Ala Asp Asp Ile Ala Tyr Gly Gly Gln Ile Val Thr Ala Ile Ser
545 550 555 560
Glu Val Leu Ser Ala Leu Lys Ser Val Asn Asn
565 570
<210> 32
<211> 565
<212> PRT
<213>Oscillatoria (Oscillatoria sp.) PCC 6506
<400> 32
Met Ser Thr Asn Leu Thr Glu Pro Ile Ser Gln Lys Leu Leu Gln Trp
1 5 10 15
Leu Glu Pro Asn Leu Ser Ala Val Ile Val Gly Asn Arg Arg Leu Ser
20 25 30
Val Glu Glu Val Ala Arg Val Ala Arg Cys Gly Thr Arg Val Arg Leu
35 40 45
Asn Asp Glu Ser Asp Val Ala Glu Arg Val Gln Ala Ser Cys Asp Tyr
50 55 60
Ile Thr Asp Ala Val Glu Ser Gly Lys Ala Ile Tyr Gly Val Thr Thr
65 70 75 80
Gly Phe Gly Gly Met Ala Asn Thr Glu Ile Ala Pro Glu Glu Ala Ala
85 90 95
Ser Leu Gln Asn Asn Leu Ile Trp Phe Leu Lys Ala Gly Ala Gly Gln
100 105 110
Lys Leu Pro Thr Ala Cys Val Arg Ala Ala Met Leu Leu Arg Ile Asn
115 120 125
Ser His Leu Gln Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile Lys Arg
130 135 140
Met Ile Val Phe Leu Asn Ala Gly Val Thr Pro His Val Cys Glu Leu
145 150 155 160
Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ala Gln Ile Thr
165 170 175
Gly Ala Leu Ile Gly Leu Asp Asp Ser Phe Thr Val Asp Phe Asn Gly
180 185 190
Arg Glu Met Ser Ala Ile Gln Ala Leu Glu Leu Leu Asp Leu Pro Lys
195 200 205
Ile Asp Leu Arg Pro Lys Glu Gly Leu Ala Met Val Asn Gly Thr Ser
210 215 220
Val Met Thr Gly Ile Ala Ala Asn Cys Val Arg Asp Ser Gln Val Met
225 230 235 240
Leu Ala Leu Ala Met Gly Thr His Ala Leu Met Ile Gln Gly Leu Gly
245 250 255
Ala Thr Asn Gln Ser Phe His Pro Phe Ile His Lys Leu Lys Pro His
260 265 270
Phe Gly Gln Val Trp Ala Ala Ser Gln Met Val Glu Leu Leu Ala Gly
275 280 285
Ser Cys Leu Asn Arg Asp Glu Leu Asp Gly Gln His Asp Ala Arg Gly
290 295 300
Glu His Pro Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro Gln Tyr
305 310 315 320
Ile Gly Pro Ile Val Asp Gly Ile Ala Glu Ile Ala Gln Gln Ile Glu
325 330 335
Val Glu Val Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Thr Glu Asn
340 345 350
Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val Gly Val
355 360 365
Gly Met Asp Arg Leu Arg Tyr Leu Leu Gly Leu Leu Ala Lys His Leu
370 375 380
Asp Val Gln Ile Ala Leu Leu Ala Ala Pro Glu Phe Asn Asn Gly Leu
385 390 395 400
Ser Pro Ser Leu Val Gly Asn Thr Ser Arg Lys Val Asn Met Gly Leu
405 410 415
Lys Gly Leu Gln Ile Ala Gly Asn Ser Ile Met Pro Leu Leu Thr Phe
420 425 430
Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu Gln Phe
435 440 445
Asn Gln Asn Ile Asn Ser Gln Gly Phe Ala Ser Ala Asn Leu Ala Arg
450 455 460
Arg Ser Ile Glu Leu Phe Gln Gln Tyr Met Ala Ile Ser Leu Met Phe
465 470 475 480
Ala Val Gln Ala Val Asp Leu Arg Thr His Glu Val Ala Gly His Tyr
485 490 495
Asp Ala Arg Glu Cys Leu Ser Pro Leu Ser Leu Pro Leu Tyr Glu Ala
500 505 510
Val Arg Glu Val Val Gly Gln Pro Pro Asn Val Asp Arg Ser Tyr Ile
515 520 525
Trp Asn Asp Asn Glu Gln Ser Leu Asp Ile His Ile Ala Met Ile Ala
530 535 540
Ala Asp Ile Ala Gln Glu Gly Arg Ile Val Gln Ala Val Asn Gln Ile
545 550 555 560
Leu Ser Ser Leu Lys
565
<210> 33
<211> 559
<212> PRT
<213>Gloeocapsa (Gloeocapsa sp.) PCC 7428
<400> 33
Met Asn Thr Ala Val Gln Thr Gln Ala Ser Thr Asn Leu Asn Ala Asp
1 5 10 15
Thr Ile Leu Leu Gly Asp Arg Asn Leu Thr Ile Asp Glu Val Val Ser
20 25 30
Val Ala Arg His Gly Ala Lys Val Asn Ile Ser Thr Ala Asp Asn Val
35 40 45
Ala Gln Arg Ile Gln Ala Ser Cys Asp Tyr Ile Ala Glu Ala Val Ala
50 55 60
Thr Gly Arg Pro Ile Tyr Gly Val Thr Ser Gly Phe Gly Gly Met Ala
65 70 75 80
Asn Val Val Ile Ser Arg Glu Tyr Ala Asp Leu Leu Gln His Asn Leu
85 90 95
Val Trp Tyr His Lys Val Gly Ala Gly Arg Lys Leu Pro Leu Thr Asp
100 105 110
Val Arg Ala Ala Met Leu Leu Arg Val Asn Ser His Leu His Gly Ala
115 120 125
Ser Gly Ile Arg Arg Glu Ile Val Gln Arg Met Glu Met Phe Leu Asn
130 135 140
Ala Arg Val Thr Pro His Val Pro Glu Tyr Gly Ser Ile Gly Ala Ser
145 150 155 160
Gly Asp Leu Thr Pro Leu Ser Tyr Ile Thr Gly Ala Leu Ile Gly Leu
165 170 175
Asp Asp Arg Tyr Lys Val Asp Phe Asp Gly Glu Glu Ile Asp Ala Ile
180 185 190
Thr Ala Leu Glu Arg Leu Gly Leu Pro Gln Leu Gln Leu Gln Ala Lys
195 200 205
Glu Gly Leu Ala Met Met Asn Gly Thr Ser Val Met Thr Gly Ile Ala
210 215 220
Ala Asn Cys Val Tyr Asp Thr Arg Leu Leu Met Ala Leu Thr Met Gly
225 230 235 240
Ala His Ala Leu Ile Leu Gln Gly Leu Asn Gly Thr Asn Gln Ser Phe
245 250 255
His Pro Phe Ile His Lys Leu Lys Pro His Pro Gly Gln Lys Trp Ala
260 265 270
Ala Ser Thr Met Leu Asp Leu Leu Ala Gly Ser Arg Leu Ile Arg Glu
275 280 285
Glu Leu Asp Gly Thr His Glu Tyr Arg Gly Gln Ala Pro Ile Gln Asp
290 295 300
Arg Tyr Ser Leu Arg Cys Leu Ala Gln Tyr Met Gly Pro Ile Val Asp
305 310 315 320
Gly Val Ser Gln Val Ala Gln Gln Val Glu Ile Glu Met Asn Ser Ala
325 330 335
Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Gln Ala Ser Tyr His Gly
340 345 350
Gly Asn Phe Leu Gly Gln Tyr Ile Gly Met Gly Met Asp His Leu Arg
355 360 365
Tyr Tyr Ile Gly Met Met Ala Lys His Leu Asp Val Gln Ile Ala Tyr
370 375 380
Leu Val Ala Pro Glu Phe Asn Asn Gly Leu Pro Ala Ser Leu Val Gly
385 390 395 400
Asn Lys Glu Arg Ile Val Asn Met Gly Leu Lys Gly Leu Gln Ile Thr
405 410 415
Gly Asn Ser Ile Met Pro Leu Leu Ser Phe Tyr Gly Asn Ser Ile Ala
420 425 430
Asp Arg Tyr Pro Thr His Ala Glu Gln Tyr Asn Gln Asn Ile Asn Ser
435 440 445
Gln Gly Phe Ala Ala Ala Asn Leu Thr Arg Asn Ala Val Glu Ile Phe
450 455 460
Gln Gln Tyr Met Ala Ile Ala Leu Met Phe Gly Val Gln Ala Val Asp
465 470 475 480
Leu Arg Thr Tyr Ala Tyr Ala Gly His Tyr Asp Ala Ser Glu Cys Leu
485 490 495
Ser Pro Thr Thr Arg Arg Leu Tyr Gln Ala Val Arg Glu Val Val Gly
500 505 510
Gln Pro Ser Ser Ala Thr Arg Pro Tyr Ile Trp Asp Asp Arg Glu Gln
515 520 525
Pro Leu Asp Glu His Ile Ala Lys Ile Ala Ala Asp Ile Ala Ala Glu
530 535 540
Gly Val Ile Val Ala Ala Val Lys Asp Leu Leu Thr Ser Leu Lys
545 550 555

Claims (10)

1. one kind has phenylalanine lyase (PAL) active engineered polypeptide, the engineered polypeptide is by having and reference The amino acid sequence composition and the engineered polypeptide and SEQ ID of the sequence identity of sequence SEQ ID NO:4 at least 97% The amino acid residue difference that NO:4 is compared is one or more in H307G/Q/M and lower column permutation: F18H, LL47A, L214Q,C503Q,C565P;And the engineered polypeptide shown compared with the reference sequences SEQ ID NO:4 selected from Under improved characteristic: i) enhance catalytic activity, ii) reduce the sensibility to proteolysis, iii) it is increased to acidity The tolerance of pH, iv) immunogenicity that reduces or i), ii), iii) and any combination iv).
2. engineered polypeptide as described in claim 1, one of them or more displacement selected from F18H, L47A, L214Q and C503Q。
3. engineered polypeptide as described in claim 1, wherein amino acid of the engineered polypeptide compared with SEQ ID NO:4 Residue difference is H307G/Q/M, F18H and L47A.
4. engineered polypeptide as described in any one of the preceding claims, wherein the improved characteristic is selected from reduction to egg The sensibility of plain boiled water solution and/or the increased tolerance to acid pH.
5. engineered polypeptide as claimed in any one of claims 1-3, wherein the engineered polypeptide is Anabaena variabilis enzyme Variant.
6. engineered polypeptide as claimed in any one of claims 1-3, wherein the engineered polypeptide tolerance protein hydrolyzes.
7. engineered polypeptide as claimed in claim 6, wherein engineered polypeptide tolerance passes through at least one digestive enzyme Proteolysis.
8. engineered polypeptide as claimed in claim 7, wherein engineered polypeptide tolerance passes through chymotrypsin, pancreas egg The proteolysis of white enzyme, carboxypeptidase and/or elastoser.
9. engineered polypeptide as claimed in any one of claims 1-3, wherein the engineered polypeptide is that acid is stable.
10. engineered polypeptide as claimed in any one of claims 1-3, wherein the engineered polypeptide is deimmunized.
CN201811343013.3A 2013-04-18 2014-04-17 Engineered phenylalanine ammonia lyase polypeptides Active CN109486803B (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US201361813586P 2013-04-18 2013-04-18
US61/813,586 2013-04-18
US201361897932P 2013-10-31 2013-10-31
US61/897,932 2013-10-31
CN201480035201.3A CN105324483B (en) 2013-04-18 2014-04-17 It is engineered phenylalanine lyase polypeptide
PCT/US2014/034500 WO2014172541A2 (en) 2013-04-18 2014-04-17 Engineered phenylalanine ammonia lyase polypeptides

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201480035201.3A Division CN105324483B (en) 2013-04-18 2014-04-17 It is engineered phenylalanine lyase polypeptide

Publications (2)

Publication Number Publication Date
CN109486803A true CN109486803A (en) 2019-03-19
CN109486803B CN109486803B (en) 2022-10-25

Family

ID=51729192

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201811343013.3A Active CN109486803B (en) 2013-04-18 2014-04-17 Engineered phenylalanine ammonia lyase polypeptides
CN201480035201.3A Active CN105324483B (en) 2013-04-18 2014-04-17 It is engineered phenylalanine lyase polypeptide

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201480035201.3A Active CN105324483B (en) 2013-04-18 2014-04-17 It is engineered phenylalanine lyase polypeptide

Country Status (26)

Country Link
US (17) US9611468B2 (en)
EP (3) EP2986722B1 (en)
JP (5) JP6637410B2 (en)
KR (1) KR102218930B1 (en)
CN (2) CN109486803B (en)
AU (2) AU2014253900C1 (en)
CA (1) CA2909719C (en)
CY (1) CY1121891T1 (en)
DK (2) DK2986722T3 (en)
EA (1) EA034034B1 (en)
ES (2) ES2890955T3 (en)
HR (1) HRP20190928T1 (en)
HU (2) HUE043549T2 (en)
IL (1) IL242067B (en)
LT (1) LT2986722T (en)
MX (1) MX366196B (en)
MY (1) MY193198A (en)
NZ (1) NZ713396A (en)
PE (1) PE20151775A1 (en)
PH (1) PH12015502412B1 (en)
PL (1) PL2986722T3 (en)
PT (1) PT2986722T (en)
RS (1) RS58802B1 (en)
SG (1) SG11201508598YA (en)
SI (1) SI2986722T1 (en)
WO (1) WO2014172541A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114015636A (en) * 2021-12-08 2022-02-08 广东天地壹号食品研究院有限公司 Recombinant acetic acid bacteria and preparation method and application thereof

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
HUE043549T2 (en) 2013-04-18 2019-08-28 Codexis Inc Engineered phenylalanine ammonia lyase polypeptides
WO2015161019A1 (en) 2014-04-16 2015-10-22 Codexis, Inc. Engineered tyrosine ammonia lyase
CN106636048A (en) * 2016-12-14 2017-05-10 江南大学 Phenylalanine deaminase mutant with improved enzyme activity
CN106755160B (en) * 2017-01-17 2019-08-02 湖州颐辉生物科技有限公司 A kind of histidine aminonialyase and application thereof
NZ755628A (en) * 2017-02-13 2023-09-29 Codexis Inc Engineered phenylalanine ammonia lyase polypeptides
CN110914288B (en) * 2017-05-27 2023-05-05 宁波酶赛生物工程有限公司 Engineered aldolase polypeptides and uses thereof
CA3102968A1 (en) * 2018-06-12 2019-12-19 Codexis, Inc. Engineered tyrosine ammonia lyase
WO2020013951A1 (en) * 2018-07-12 2020-01-16 Codexis, Inc. Engineered phenylalanine ammonia lyase polypeptides
EP3874033A4 (en) * 2018-10-29 2022-08-03 Codexis, Inc. Engineered dna polymerase variants
EP3893860A4 (en) * 2018-12-14 2023-03-29 Codexis, Inc. Engineered tyrosine ammonia lyase
IL295353A (en) * 2020-03-20 2022-10-01 Synlogic Operating Co Inc Microorganisms engineered to reduce hyperphenylalaninemia
EP4271806A1 (en) * 2020-12-31 2023-11-08 Synlogic Operating Company, Inc. Microorganisms engineered to reduce hyperphenylalaninemia
CN114457151A (en) * 2022-01-27 2022-05-10 宁夏医科大学总医院 Detection kit for detecting gene mutation of phenylalanine hydroxylase and detection method thereof
CN117417925A (en) * 2022-07-18 2024-01-19 浙江泽科塔生物医药有限公司 PAL variants, pharmaceutical compositions comprising the same and methods for preparing the same
CN115518691B (en) * 2022-10-14 2023-11-21 天津大学 Artificial enzyme with laccase-like activity, and preparation method and application thereof
CN116478975B (en) * 2023-06-16 2023-09-01 苏州优信合生技术有限公司 High-activity phenylalanine ammonia-lyase mutant and expression strain thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090263369A1 (en) * 2007-08-17 2009-10-22 Biomarin Pharmaceutical Inc. Compositions of Prokaryotic Phenylalanine Ammonia-Lyase and Methods of Treating Cancer Using Compositions Thereof
CN101842482A (en) * 2007-05-25 2010-09-22 生物马林药物股份有限公司 Compositions of prokaryotic phenylalanine ammonia-lyase and methods of using said compositions
CN102753566A (en) * 2010-02-04 2012-10-24 生物马林药物股份有限公司 Compositions of prokaryotic phenylalanine ammonia-lyase variants and methods of using compositions thereof

Family Cites Families (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US603344A (en) 1898-05-03 Trough for well-curbs
US753653A (en) 1903-09-14 1904-03-01 American Car & Foundry Co Hopper-bottom car.
DK75593D0 (en) 1993-06-25 1993-06-25 Novo Nordisk As
US6335160B1 (en) 1995-02-17 2002-01-01 Maxygen, Inc. Methods and compositions for polypeptide engineering
US20060257890A1 (en) 1996-05-20 2006-11-16 Maxygen, Inc. Methods and compositions for cellular and metabolic engineering
US5834252A (en) 1995-04-18 1998-11-10 Glaxo Group Limited End-complementary polymerase reaction
US6995017B1 (en) 1994-02-17 2006-02-07 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US5928905A (en) 1995-04-18 1999-07-27 Glaxo Group Limited End-complementary polymerase reaction
US6406855B1 (en) 1994-02-17 2002-06-18 Maxygen, Inc. Methods and compositions for polypeptide engineering
US6117679A (en) 1994-02-17 2000-09-12 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US6395547B1 (en) 1994-02-17 2002-05-28 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US5837458A (en) 1994-02-17 1998-11-17 Maxygen, Inc. Methods and compositions for cellular and metabolic engineering
US6309883B1 (en) 1994-02-17 2001-10-30 Maxygen, Inc. Methods and compositions for cellular and metabolic engineering
US6165793A (en) 1996-03-25 2000-12-26 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US5605793A (en) 1994-02-17 1997-02-25 Affymax Technologies N.V. Methods for in vitro recombination
ATE206460T1 (en) 1994-06-03 2001-10-15 Novo Nordisk Biotech Inc PURIFIED MYCELIOPTHHORA LACCASES AND NUCLEIC ACIDS CODING THEREOF
AU2705895A (en) 1994-06-30 1996-01-25 Novo Nordisk Biotech, Inc. Non-toxic, non-toxigenic, non-pathogenic fusarium expression system and promoters and terminators for use therein
FI104465B (en) 1995-06-14 2000-02-15 Valio Oy Protein hydrolyzates for the treatment and prevention of allergies and their preparation and use
US6506602B1 (en) 1996-03-25 2003-01-14 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US6096548A (en) 1996-03-25 2000-08-01 Maxygen, Inc. Method for directing evolution of a virus
US6326204B1 (en) 1997-01-17 2001-12-04 Maxygen, Inc. Evolution of whole cells and organisms by recursive sequence recombination
JP4062366B2 (en) 1997-01-17 2008-03-19 マキシジェン,インコーポレイテッド Evolution of whole cells and organisms by recursive sequence recombination
US7148054B2 (en) 1997-01-17 2006-12-12 Maxygen, Inc. Evolution of whole cells and organisms by recursive sequence recombination
IL136574A0 (en) 1997-12-08 2001-06-14 California Inst Of Techn A method for forming a polynucleotide of desired properties
US6365408B1 (en) 1998-06-19 2002-04-02 Maxygen, Inc. Methods of evolving a polynucleotides by mutagenesis and recombination
JP4221100B2 (en) 1999-01-13 2009-02-12 エルピーダメモリ株式会社 Semiconductor device
US6436675B1 (en) 1999-09-28 2002-08-20 Maxygen, Inc. Use of codon-varied oligonucleotide synthesis for synthetic shuffling
US6917882B2 (en) 1999-01-19 2005-07-12 Maxygen, Inc. Methods for making character strings, polynucleotides and polypeptides having desired characteristics
US6368861B1 (en) 1999-01-19 2002-04-09 Maxygen, Inc. Oligonucleotide mediated nucleic acid recombination
US6376246B1 (en) 1999-02-05 2002-04-23 Maxygen, Inc. Oligonucleotide mediated nucleic acid recombination
EP1072010B1 (en) 1999-01-19 2010-04-21 Maxygen, Inc. Oligonucleotide mediated nucleic acid recombination
US6961664B2 (en) 1999-01-19 2005-11-01 Maxygen Methods of populating data structures for use in evolutionary simulations
US7702464B1 (en) 2001-08-21 2010-04-20 Maxygen, Inc. Method and apparatus for codon determining
US7024312B1 (en) 1999-01-19 2006-04-04 Maxygen, Inc. Methods for making character strings, polynucleotides and polypeptides having desired characteristics
AU3391900A (en) 1999-03-05 2000-09-21 Maxygen, Inc. Encryption of traits using split gene sequences
US7430477B2 (en) 1999-10-12 2008-09-30 Maxygen, Inc. Methods of populating data structures for use in evolutionary simulations
US6519065B1 (en) 1999-11-05 2003-02-11 Jds Fitel Inc. Chromatic dispersion compensation device
WO2001075767A2 (en) 2000-03-30 2001-10-11 Maxygen, Inc. In silico cross-over site selection
US6967097B2 (en) * 2000-07-24 2005-11-22 Pcbu Services, Inc. Phenylalainine ammonia lyase polypeptide and polynucleotide sequences and methods of obtaining and using same
US7747391B2 (en) 2002-03-01 2010-06-29 Maxygen, Inc. Methods, systems, and software for identifying functional biomolecules
US20050084907A1 (en) 2002-03-01 2005-04-21 Maxygen, Inc. Methods, systems, and software for identifying functional biomolecules
DK2278509T3 (en) 2002-03-01 2014-12-15 Codexis Mayflower Holdings Llc Methods, systems and software for identification of functional biomolecules
WO2003078583A2 (en) 2002-03-09 2003-09-25 Maxygen, Inc. Optimization of crossover points for directed evolution
US8003356B2 (en) 2004-04-20 2011-08-23 E. I. Du Pont De Nemours And Company Methods for the preparation of para-hydroxycinnamic acid and cinnamic acid at alkaline pH
AU2005286763A1 (en) 2004-09-17 2006-03-30 Biomarin Pharmaceutical, Inc. Variants and chemically-modified variants of phenylalanine ammonia-lyase
US20090038023A1 (en) 2005-03-10 2009-02-05 Verenium Corporation Lyase Enzymes, Nucleic Acids Encoding Them and Methods For Making and Using Them
US7531341B1 (en) 2006-06-12 2009-05-12 Biomarin Pharmaceutical Inc. Compositions of prokaryotic phenylalanine ammonia-lyase and methods of using compositions thereof
US20090011400A1 (en) * 2006-12-01 2009-01-08 The Salk Institute for Biological Studies and The Regents of the University of California Substrate switched ammonia lyases and mutases
US20090312196A1 (en) 2008-06-13 2009-12-17 Codexis, Inc. Method of synthesizing polynucleotide variants
HUE041367T2 (en) 2008-06-13 2019-05-28 Codexis Inc Method of synthesizing polynucleotide variants
US8383346B2 (en) 2008-06-13 2013-02-26 Codexis, Inc. Combined automated parallel synthesis of polynucleotide variants
CA2815341A1 (en) 2010-11-05 2012-05-10 Basf Plant Science Company Gmbh Methods for increasing yield and fine chemical production in plants
US9150884B2 (en) 2011-03-08 2015-10-06 Arizona Board Of Regents, A Body Corporate Of The State Of Arizona, Acting For And On Behalf Of Arizona State University Microbial conversion of glucose to styrene and its derivatives
US20130005012A1 (en) 2011-06-23 2013-01-03 Phytogene, Inc. Enzymatic system for monomer synthesis
HUE043549T2 (en) 2013-04-18 2019-08-28 Codexis Inc Engineered phenylalanine ammonia lyase polypeptides

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101842482A (en) * 2007-05-25 2010-09-22 生物马林药物股份有限公司 Compositions of prokaryotic phenylalanine ammonia-lyase and methods of using said compositions
US20090263369A1 (en) * 2007-08-17 2009-10-22 Biomarin Pharmaceutical Inc. Compositions of Prokaryotic Phenylalanine Ammonia-Lyase and Methods of Treating Cancer Using Compositions Thereof
CN101803492A (en) * 2007-08-17 2010-08-11 生物马林药物股份有限公司 The composition of prokaryotic phenylalanine ammonia-lyase and utilize its combination treatment method for cancer
CN102753566A (en) * 2010-02-04 2012-10-24 生物马林药物股份有限公司 Compositions of prokaryotic phenylalanine ammonia-lyase variants and methods of using compositions thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TSE SIANG KANG 等: "Converting an injectable protein therapeutic into an oralform:phenylalanine ammonia lyase for phenylketonuria", 《MOL GENET METAB.》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114015636A (en) * 2021-12-08 2022-02-08 广东天地壹号食品研究院有限公司 Recombinant acetic acid bacteria and preparation method and application thereof
CN114015636B (en) * 2021-12-08 2023-09-29 天地壹号饮料股份有限公司 Recombinant acetic acid bacteria and preparation method and application thereof

Also Published As

Publication number Publication date
EP2986722B1 (en) 2019-04-03
US11913045B2 (en) 2024-02-27
DK2986722T3 (en) 2019-05-13
US10941391B2 (en) 2021-03-09
US20200040325A1 (en) 2020-02-06
AU2019272057A1 (en) 2019-12-19
ES2890955T3 (en) 2022-01-25
US20190241882A1 (en) 2019-08-08
US20140314843A1 (en) 2014-10-23
MY193198A (en) 2022-09-26
JP2016516428A (en) 2016-06-09
AU2014253900B2 (en) 2019-09-19
PL2986722T3 (en) 2019-10-31
KR102218930B1 (en) 2021-02-23
LT2986722T (en) 2019-09-10
SG11201508598YA (en) 2015-11-27
NZ713396A (en) 2020-06-26
US20200199566A1 (en) 2020-06-25
US20190040377A1 (en) 2019-02-07
US20190055535A1 (en) 2019-02-21
US10160963B2 (en) 2018-12-25
EA201591995A1 (en) 2016-04-29
US20210222146A1 (en) 2021-07-22
EA034034B1 (en) 2019-12-20
CY1121891T1 (en) 2020-10-14
US10640759B2 (en) 2020-05-05
US20200032236A1 (en) 2020-01-30
PH12015502412A1 (en) 2016-02-22
CA2909719C (en) 2022-09-27
EP3521423A1 (en) 2019-08-07
EP2986722A2 (en) 2016-02-24
PH12015502412B1 (en) 2016-02-22
US10294468B2 (en) 2019-05-21
HUE055899T2 (en) 2022-01-28
AU2014253900A1 (en) 2015-11-12
JP2022183356A (en) 2022-12-08
DK3521423T3 (en) 2021-09-06
BR112015026160A2 (en) 2017-10-10
US10487319B2 (en) 2019-11-26
HRP20190928T1 (en) 2019-07-26
US10975367B2 (en) 2021-04-13
US10144924B2 (en) 2018-12-04
EP3521423B1 (en) 2021-07-21
AU2014253900C1 (en) 2020-04-02
US20170191050A1 (en) 2017-07-06
US10494623B2 (en) 2019-12-03
WO2014172541A3 (en) 2015-02-19
JP2020185017A (en) 2020-11-19
KR20150143816A (en) 2015-12-23
ES2729048T3 (en) 2019-10-30
WO2014172541A2 (en) 2014-10-23
US20200032234A1 (en) 2020-01-30
MX2015014662A (en) 2016-03-03
US10870843B2 (en) 2020-12-22
JP2020185016A (en) 2020-11-19
MX366196B (en) 2019-07-01
US20200231959A1 (en) 2020-07-23
US10358639B2 (en) 2019-07-23
JP6637410B2 (en) 2020-01-29
US9611468B2 (en) 2017-04-04
RS58802B1 (en) 2019-07-31
JP7129715B2 (en) 2022-09-02
US11136571B2 (en) 2021-10-05
HUE043549T2 (en) 2019-08-28
US20200032235A1 (en) 2020-01-30
US11130947B2 (en) 2021-09-28
SI2986722T1 (en) 2019-07-31
CN109486803B (en) 2022-10-25
US20180016568A1 (en) 2018-01-18
AU2019272057B2 (en) 2021-05-27
PT2986722T (en) 2019-06-21
US20200399627A1 (en) 2020-12-24
CN105324483A (en) 2016-02-10
CN105324483B (en) 2018-11-30
US20190055536A1 (en) 2019-02-21
US11401515B2 (en) 2022-08-02
CA2909719A1 (en) 2014-10-23
EP3971288A1 (en) 2022-03-23
US10717974B2 (en) 2020-07-21
PE20151775A1 (en) 2015-12-02
EP2986722A4 (en) 2017-02-22
JP2018166524A (en) 2018-11-01
US20200032233A1 (en) 2020-01-30
US20180037880A1 (en) 2018-02-08
IL242067B (en) 2018-03-29

Similar Documents

Publication Publication Date Title
AU2019272057B2 (en) Engineered phenylalanine ammonia lyase polypeptides
CN108271384B (en) Method for converting gram-positive bacterium genome sequence for specifically converting nucleobase targeting DNA sequence, and molecular complex used therefor
KR101786121B1 (en) Protein production in microorganisms of the phylum Labyrinthulomycota
KR101417146B1 (en) Production of isoprenoids
KR101659101B1 (en) IDENTIFICATION AND USE OF BACTERIAL [2Fe-2S] DIHYDROXY-ACID DEHYDRATASES
US6635257B1 (en) Particles of HCV envelope proteins: use for vaccination
KR20220076467A (en) New Nucleobase Editor and How to Use It
KR101522217B1 (en) Fsh producing cell clone
KR20220010540A (en) How to edit single nucleotide polymorphisms using a programmable base editor system
CN113943720A (en) Apolygus lucorum GRK gene, dsRNA thereof, synthetic method and application thereof
KR102009273B1 (en) Recombinant foot-and-mouth disease virus expressing protective antigen of type O-TAW97
KR102335519B1 (en) Vaccine composition for preventing or reducing clinical symptom of severe acute respiratory syndrome coronavirus 2
CN109929847A (en) A kind of PEX26 gene, albumen and its application
KR20230157342A (en) Multisubstituted psilocybin derivatives and methods of use
CN109852615B (en) Bidirectional promoter capable of expressing alkaline protease, application, plasmid and genetic engineering bacteria
CN116113691A (en) Engineered leucine decarboxylase
CN106795504A (en) Xyloglucan endotransglycosylase variant and encode its polynucleotides
CN111100841A (en) Establishment and application of genetic engineering cell strain and high-throughput drug screening model of anti-obesity drug target UCP1
CN112852651B (en) Method for increasing yield of hydrocortisone produced by saccharomyces cerevisiae biotransformation
CN108588100B (en) Inhibin B double-gene fragment combined expression vector and application thereof
US11987823B2 (en) Engineered lipase variants
CN114231566B (en) R26-e (CN 362-1) carrier and preparation method thereof
CN114703207A (en) Method for producing recombinant plasmid and recombinant virus
CN110129340A (en) The infection clones of zika virus MR766 strain and its application
CN114317536A (en) Preparation method for constructing uPA transgenic mouse based on CRISPR/Cas9

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant