RO118451B1 - Polypeptide with dikinase orthophosphate pyruvate activity, dna encoding the same, recombinant vector with said dna and process for modifying a plant by means of said vector - Google Patents

Polypeptide with dikinase orthophosphate pyruvate activity, dna encoding the same, recombinant vector with said dna and process for modifying a plant by means of said vector Download PDF

Info

Publication number
RO118451B1
RO118451B1 RO96-00667A RO9600667A RO118451B1 RO 118451 B1 RO118451 B1 RO 118451B1 RO 9600667 A RO9600667 A RO 9600667A RO 118451 B1 RO118451 B1 RO 118451B1
Authority
RO
Romania
Prior art keywords
amino acid
gly
ppdk
arg
sequence
Prior art date
Application number
RO96-00667A
Other languages
Romanian (ro)
Inventor
Shozo Ohta
Satoru Usami
James Nigel Burnell
Original Assignee
Japan Tobacco Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/JP1994/002022 external-priority patent/WO1995015385A1/en
Application filed by Japan Tobacco Inc. filed Critical Japan Tobacco Inc.
Publication of RO118451B1 publication Critical patent/RO118451B1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1294Phosphotransferases with paired acceptors (2.7.9)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8271Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
    • C12N15/8273Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for drought, cold, salt resistance

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Organic Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to a polypeptide with dikinase orthophosphate pyruvate activity, stable at low temperatures, containing the amino acid sequence from the 832 amino acid residue to the 955 amino acid residue, of the amino acid sequence number 5, excepting the fact that one or more amino acid residues are added, deleted or substituted, this conferring stability at low temperatures. The invention also relates to a cloned DNA encoding the previously defined polypeptide which has the nucleotide sequence presented in the serv. number 5, excepting the fact that one or more nucleotides are added, deleted or substituted. The said DNA is introduced in a recombinant vector which has a structure presented in figure 1 or 3, and which may express, in the host cell, a polypeptide having dikinase orthophosphate pyruvat activity, stable at low temperatures. The invention presents a process for modifying a plant of Flaveria bidentis or corn, for providing the stability at low temperatures, process where a previously defined recombinant vector is used as modifying agent.

Description

RO 118451 ΒRO 118451 B

Prezenta invenție se referă la o polipeptidă cu activitate piruvat ortofosfat dikinazică (denumită în continuare ca “PPDK”), un ADN care o codifică, un vector recombinant conținând ADN-ul, și la un procedeu de transformare a unei plante de Flaveria bidentis sau de porumb, prin intermediul acestui vector.The present invention relates to a polypeptide with pyruvate orthophosphate dikinase activity (hereinafter referred to as "PPDK"), a DNA encoding it, a recombinant vector containing the DNA, and a process for transforming a Flaveria bidentis plant or corn, through this vector.

Plantele C4 au capacități înalte de fotosinteză, în condiții de lumină puternică, temperatură ridicată sau CO2 scăzut. Cu toate acestea, capacitățile lor de fotosinteză sunt mult mai reduse în condiții de temperatură scăzută, cu excepția celor care sunt adaptate la condiții de temperatură scăzută. Deși PPDK (EC 2.7.9.1, care catalizează reacția în care sunt produse AMP, fosfoenolpiruvat și pirofosfat de la ATP, piruvat și ortofosfat) este una dintre cele mai importante enzime din calea C4, activitatea sa nu este suficientă în conformitate cu rata de fotosinteză din țesutul frunzei, astfel că ea este una dintre enzimele care determină rata fixării CO2 în fotosinteză C4. Mai mult decât atât, s-a evidențiat, o dată cu descoperirea acestei enzime, că ea este sensibilă la temperatură scăzută. în cazul PPDK de la porumb, activitatea enzimei are un punct de inflexiune la 11,7°C. Această temperatură coincide cu temperatura limită de creștere a porumbului. Din acestea se consideră că PPDK este una din cauzele care reduc rata de fotosinteză a plantelor C4 la temperatură scăzută. Prin urmare, prin îmbunătățirea sensibilității la temperatură scăzută a PPDK, temperatura limită de creștere a porumbului poate fi scăzută.C 4 plants have high photosynthetic capacities, under strong light, high temperature or low CO 2 conditions. However, their photosynthetic capabilities are much reduced in low temperature conditions, except for those that are adapted to low temperature conditions. Although PPDK (EC 2.7.9.1, which catalyzes the reaction in which AMP, phosphoenolpyruvate and pyrophosphate are produced from ATP, pyruvate and orthophosphate) is one of the most important enzymes in the C 4 pathway, its activity is not sufficient according to the rate of photosynthesis from the leaf tissue, so it is one of the enzymes that determines the rate of CO 2 fixation in C 4 photosynthesis. Moreover, it was revealed, together with the discovery of this enzyme, that it is sensitive to low temperature. in the case of PPDK from maize, the enzyme activity has an inflection point at 11.7°C. This temperature coincides with the limiting temperature for corn growth. From these it is considered that PPDK is one of the causes that reduce the photosynthesis rate of C 4 plants at low temperature. Therefore, by improving the low temperature sensitivity of PPDK, the limiting growth temperature of maize can be lowered.

Flaveria brownii, care este o plantă aparținând familiei Compositae, este clasificată prin tipul intermediar C3/C4 și se știe că PPDK-ul său nu este inactivat substanțial prin tratament cu temperatură scăzută la 0°C (Burnell J.N. “A comparative studyofthe cold-sensitivity of pyruvate, Pi dikinase in Flaveria species”, Plant Ceti Physiol. 31,295-2987 (1990)).Flaveria brownii, which is a plant belonging to the Compositae family, is classified by the C 3 /C 4 intermediate type and its PPDK is known not to be substantially inactivated by low temperature treatment at 0°C (Burnell JN “A comparative study of the cold -sensitivity of pyruvate, Pi dikinase in Flaveria species", Plant Ceti Physiol. 31,295-2987 (1990)).

Prin donarea genei care codifică PPDK de Flaveria brownii, stabilă la temperatură scăzută, și prin transformarea unei plante cu gena, este de așteptat ca rezistența la temperaturi mai scăzute să poată fi atribuită plantei.By cloning the gene encoding the low-temperature-stable Flaveria brownii PPDK and transforming a plant with the gene, it is expected that resistance to lower temperatures can be assigned to the plant.

Invenția de față se referă la o polipeptidă cu activitate piruvat ortofosfat dikinazică, stabilă la temperaturi joase, care conține secvența aminoacidă de la restul aminoacid 832 până la restul aminoacid 955, al secvenței aminoacide nr. 5, cu excepția faptului că unul sau mai multe resturi aminoacide sunt adăugate, deletate sau substituite, aceasta conferind polipeptidei stabilitate la temperaturi joase.The present invention relates to a polypeptide with pyruvate orthophosphate dikinase activity, stable at low temperatures, containing the amino acid sequence from amino acid residue 832 to amino acid residue 955 of amino acid sequence no. 5, except that one or more amino acid residues are added, deleted or substituted, this giving the polypeptide stability at low temperatures.

De asemenea, invenția se referă la un ADN donat, ce codifică polipeptidă defintă mai sus, care are secvența nucleotidică prezentată în secv. nr. 5, cu excepția faptului că una sau mai multe nucleotide sunt adăugate, deletate sau substituite.Also, the invention relates to a cloned DNA, which encodes the polypeptide defined above, which has the nucleotide sequence shown in seq. no. 5, except that one or more nucleotides are added, deleted or substituted.

Acest ADN este introdus într-un vector recombinant, care are o structură prezentată în fig. 1 sau 3 și care poate să exprime, în celula gazdei, o polipeptidă având activitate piruvat ortofosfat dikinazică, stabilă la temperaturi joase.This DNA is inserted into a recombinant vector, which has a structure shown in fig. 1 or 3 and which can express, in the host cell, a polypeptide having pyruvate orthophosphate dikinase activity, stable at low temperatures.

Invenția prezintă un nou procedeu de transformare a unei plante de Flaveria bidentis sau porumb, în care, în calitate de agent transformator, este folosit vectorul recombinant definit mai sus.The invention presents a new process for transforming a Flaveria bidentis or corn plant, in which, as a transforming agent, the recombinant vector defined above is used.

Invenția prezintă avantajul că pune la îndemâna specialiștilor noi mijloace eficiente pentru atribuirea stabilității, la temperaturi scăzute, a plantelor.The invention presents the advantage that it puts at the disposal of specialists new effective means for attributing the stability, at low temperatures, of plants.

în continuare, invenția va fi descrisă în detaliu, cu referire și la fig. 1...3:in the following, the invention will be described in detail, with reference also to fig. 1...3:

- fig. 1 prezintă schematic o metodă de construcție a unui vector de expresie care conține un exemplu de genă PPDK, conform prezentei invenții;- fig. 1 schematically shows a method of constructing an expression vector containing an example PPDK gene, according to the present invention;

- fig. 2 prezintă schimbarea activității enzimei în funcție de timp, când PPDK-ul de Flaveria brownii, Flaveria bidentis și porumb, care s-au exprimat în Escherichia coli, s-au menținut la 0°C; și- fig. 2 shows the change in enzyme activity as a function of time when PPDK of Flaveria brownii, Flaveria bidentis and maize, which were expressed in Escherichia coli, was kept at 0°C; and

- fig. 3 prezintă schematic o metodă de construcție al unui vector de expresie care conține un exemplu de genă PPDK, conform prezentei invenții.- fig. 3 schematically shows a method of constructing an expression vector containing an example PPDK gene, according to the present invention.

RO 118451 ΒRO 118451 B

Pentru a reuși donarea genei complete, s-a studiat intensiv PPDK a Flaveria brownii, 50 determinând secvența nucleotidică a genei și secvența aminoacidă codificată prin aceasta și, pentru identificarea regiunii din gena PPDK care dă stabilitatea la temperatură scăzută, prin aceasta completând invenția de față.To succeed in cloning the complete gene, the Flaveria brownii PPDK was intensively studied, 50 determining the nucleotide sequence of the gene and the amino acid sequence encoded by it, and to identify the region of the PPDK gene that confers low temperature stability, thereby completing the present invention.

Astfel, prezenta invenție prezintă o polipeptidă având activitate PPDK stabilă la temperatură joasă, aceasta având o secvență aminoacidă care este la fel cu secvența amino- 55 acidă a regiunii 1/6 a regiunii întregi C-terminală a polipeptidelor (1) sau (2) care urmează, cu excepția faptului că, cel puțin un rest aminoacid al numitei regiunii 1/6 este substituit cu un alt rest aminoacid, (1) o PPDK având o secvență aminoacidă arătată în SECV ID NR.:1 la 4 din Lista de secvențe; și (2) o polipeptidă care are o secvență aminoacidă cu o omologie nu mai mică de 50%, cu secvență aminoacidă menționată (1), numita polipeptidă având activitate PPDK stabilă la temperatură scăzută.Thus, the present invention provides a polypeptide having stable low-temperature PPDK activity having an amino acid sequence that is the same as the amino acid sequence of the 1/6 region of the entire C-terminal region of polypeptides (1) or (2) which follows, except that at least one amino acid residue of said 1/6 region is substituted with another amino acid residue, (1) a PPDK having an amino acid sequence shown in SEQ ID NO:1 to 4 of the Sequence Listing ; and (2) a polypeptide having an amino acid sequence with no less than 50% homology with said amino acid sequence (1), said polypeptide having low temperature stable PPDK activity.

De asemenea, prezenta invenție prezintă un ADN donat, care codifică polipeptidă având activitate PPDK stabilă la temperatură scăzută conform prezentei invenții. Supli- 65 mentar, invenția de față furnizează date referitoare la un vector recombinant care conține ADN-ul, conform prezentei invenții, care poate exprima, într-o gazdă, o polipeptidă având activitate PPDK stabilă la temperatură scăzută. Suplimentar, invenția de față mai prezintă o plantă care este transformată cu ADN-ul, conform prezentei invenții.The present invention also provides a cloned DNA encoding a polypeptide having low temperature stable PPDK activity according to the present invention. Additionally, the present invention provides data relating to a recombinant vector containing the DNA of the present invention that can express, in a host, a polypeptide having stable PPDK activity at low temperature. Additionally, the present invention features a plant that is transformed with DNA according to the present invention.

Prin invenția de față, s-a donat și secvențat o genă care codifică PPDK având stabi- 70 litate la temperatură scăzută. în plus, s-a identificat, de asemenea, regiunea din genă care dă stabilitate la temperatură joasă. Prin urmare, prin transformarea unei plante având PPDK sensibilă la temperaturi scăzute, cu gena conform prezentei invenții, PPDK-ul sensibil la temperatură scăzută poate fi schimbat cu PPDK stabil la temperatură joasă. Pe lângă acestea, prin încorporarea regiunii menționată mai sus, care atribuie stabilitate la temperatură joasă 75 în regiunea corespunzătoare a PPDK sensibil la temperatură scăzută, PPDK-ul sensibil la temperatură scăzută poate fi schimat la PPDK stabil la temperatură joasă. Prin aceasta, planta poate fi cultivată într-o regiune friguroasă, în care planta nu ar putea fi cultivată altfel. Prin urmare, este de așteptat ca prezenta invenție să contribuie mult pentru agricultură.By the present invention, a gene encoding PPDK having stability at low temperature has been cloned and sequenced. In addition, the region in the gene conferring low temperature stability was also identified. Therefore, by transforming a plant having low-temperature-sensitive PPDK with the gene of the present invention, the low-temperature-sensitive PPDK can be exchanged for low-temperature-stable PPDK. In addition, by incorporating the above-mentioned region conferring low-temperature stability 75 into the corresponding region of the low-temperature-sensitive PPDK, the low-temperature-sensitive PPDK can be mapped to the low-temperature-stable PPDK. By this, the plant can be grown in a cold region, where the plant could not be grown otherwise. Therefore, it is expected that the present invention will contribute a lot to agriculture.

Prin prezenta invenție, gena PPDK, având stabilitate la temperatură scăzută de la 80 Flaveria brownii, s-a donat și s-au determinat secvența sa nucleotidică și secvența aminoacidă dedusă, codificată prin aceasta. Secvența nucleotidică și secvența aminoacidă sunt arătate în SECV ID NR.:5 din Lista de secvențe. Așa cum s-a descris în detaliu în exemplele care urmează, această secvență s-a determinat prin extracția ARN-urilor totale din frunze de Flaveria brownii; prepararea unei bănci cADN conform unei metode convenționale; rea- 85 lizarea unei metode de hibridizare de placă, folosind o sondă preparată prin referință la regiunea genei PPDK de Flaveria bidentis și regiunea genei PPDK de porumb, regiuni care au omologie ridicată; selectarea și donarea clonelor pozitive; și secvențarea genei prin metoda dideoxi. Secvența are omologie înaltă cu gena PPDK de Flaveria bidentis care aparține aceluiași gen și are o omologie relativ ridicată cu gena PPDK de porumb. Suplimentar, sec- 90 vențele regiunii N-terminale, regiunii C-terminale și regiunile interne, ale secvenței aminoacide, deduse, sunt în totalitate aceleași cu secvențele corespunzătoare ale PPDK de la Flaveria brownii, purificate direct din frunzele verzi ale acestei plante. Ga urmare, este aparent că gena donată este gena PPDK de la Flaveria brownii. Gena PPDK de Flaveria bidentis s-a secvențat prin realizarea hibridizării de placă, folosind ca sondă cADN de po- 95 rumb; donarea clonelor pozitive; și secvențarea genei prin metoda dideoxi.By the present invention, the low temperature stable PPDK gene from 80 Flaveria brownii was cloned and its nucleotide sequence and the deduced amino acid sequence encoded by it were determined. The nucleotide sequence and amino acid sequence are shown in SEQ ID NO:5 of the Sequence Listing. As described in detail in the following examples, this sequence was determined by extracting total RNAs from Flaveria brownii leaves; preparing a cDNA library according to a conventional method; carrying out a plate hybridization method, using a probe prepared by reference to the Flaveria bidentis PPDK gene region and the maize PPDK gene region, regions that have high homology; selection and donation of positive clones; and gene sequencing by the dideoxy method. The sequence has high homology with the PPDK gene of Flaveria bidentis which belongs to the same genus and has relatively high homology with the PPDK gene of maize. In addition, the sequences of the N-terminal region, the C-terminal region and the internal regions of the deduced amino acid sequence are completely the same as the corresponding sequences of PPDK from Flaveria brownii, directly purified from the green leaves of this plant. Therefore, it is apparent that the cloned gene is the PPDK gene from Flaveria brownii. The Flaveria bidentis PPDK gene was sequenced by plate hybridization, using maize cDNA as a probe; donation of positive clones; and gene sequencing by the dideoxy method.

RO 118451 ΒRO 118451 B

Secvența aminoacidă prezentată în SECV ID NR.:5 este nouă, și 40 de resturi aminoacide ale acesteia sunt diferite de secvența PPDK de Flaveria bidentis, care aparține aceluiași gen. Circa 180 resturi aminoacide ale acesteia sunt diferite de secvența aminoacidă a PPDK de porumb. Astfel, în ciuda faptului că secvența aminoacidă a PPDK de Flaveria brownii, prezentată SECV ID NR.:5 are omologie ridicată cu secvența aminoacidă a PPDK de Flaveria bidentis, care aparține aceluiași gen, PPDK de Flaveria bidentis este sensibilă la temperatură scăzută, în timp ce cea de Flaveria brownii este stabilă la frig. Astfel, mica diferență din resturile aminoacide aduce cu sine o diferență importantă a caracterului. Invenția de față asigură o genă PPDK donată, care codifică secvența aminoacidă prezentată SECV ID NR.:5. Așa cum s-a menționat mai sus, această secvență aminoacidă este nouă și are un efect pronunțat, care este stabilitatea la temperatura scăzută. Gena conform invenției prezente nu este restrictivă numai la cea care are secvența nucleotidică prezentată SECV ID NR.:5, ci la orice secvență nucleotidică care codifică această secvență aminoacidă, aceasta fiind în cuprinsul genei conform invenției de față.The amino acid sequence shown in SEQ ID NO:5 is novel, and 40 amino acid residues thereof are different from the PPDK sequence of Flaveria bidentis, which belongs to the same genus. About 180 amino acid residues of it differ from the amino acid sequence of maize PPDK. Thus, despite the fact that the amino acid sequence of Flaveria brownii PPDK, shown in SEQ ID NO:5 has high homology with the amino acid sequence of Flaveria bidentis PPDK, which belongs to the same genus, Flaveria bidentis PPDK is sensitive to low temperature, while that of Flaveria brownii is cold stable. Thus, the small difference in amino acid residues brings with it an important difference in character. The present invention provides a cloned PPDK gene encoding the amino acid sequence set forth in SEQ ID NO:5. As mentioned above, this amino acid sequence is novel and has a pronounced effect, which is stability at low temperature. The gene according to the present invention is not restricted only to the one having the nucleotide sequence shown in SEQ ID NO:5, but to any nucleotide sequence encoding this amino acid sequence, which is included in the gene according to the present invention.

Pentru identificarea regiunii din gena PPDK de Flaveria browniiprezentată SECV ID NR.:5, inventatorii prezentei invenții au selectat regiunea care dă stabilitate la temperatură scăzută.For the identification of the region in the Flaveria brownii PPDK gene shown in SEQ ID NO:5, the inventors of the present invention selected the region that confers stability at low temperature.

Astfel, așa cum s-a descris în detaliu în exemplele care urmează, gena PPDK de Flaveria browniis-a împărțit în trei regiuni care au aproximativ aceeași mărime, prin enzime de restricție; fiecare dintre regiuni s-a schimbat cu regiunea corespunzătoare a genei PPDK de porumb, pentru a forma gene PPDK himerice; și s-a determinat dacă PPDK-urile codificate prin genele himerice obținute au sau nu stabilitate la temperatură joasă. Ca urmare, s-a confirmat că regiunea care dă stabilitate la temperatură joasă există în ultima 1/3 regiune a genei PPDK de Flaveria brownii. Suplimentar, această ultimă 1/3 regiune s-a împărțit printr-o enzimă de restricție în două regiuni care au aproximativ aceeași mărime și, prin aceeași metodă, s-au determinat regiunile care conțin regiunea care dă stabilitate la temperatură scăzută. Ca rezultat, s-a confirmat că funcția de a da stabilitate la temperatură joasă este codificată în regiunea din amonte a sitului Xhol a genei PPDK de Flaveria brownii, prezentată în SECV ID NR.:5. Astfel, s-a confirmat că funcția de a da stabilitate la frig este localizată în secvența aminoacidă între restul aminoacid arginină de la poziția 832 și restul aminoacid valină de la poziția 955, ale secvenței aminoacide prezentată în SECV ID NR.:5 (secvența aminoacidă de la cel de-al 836-lea rest aminoacid până la cel de-al 955-lea rest aminoacid poate fi denumită de aici înainte ca “secvența care dă stabilitate la temperatură joasă”).Thus, as described in detail in the examples that follow, the PPDK gene of Flaveria browniis split into three regions of approximately the same size by restriction enzymes; each of the regions was swapped with the corresponding region of the maize PPDK gene to form chimeric PPDK genes; and whether or not PPDKs encoded by the obtained chimeric genes have low temperature stability was determined. As a result, it was confirmed that the region conferring low temperature stability exists in the last 1/3 region of the Flaveria brownii PPDK gene. Additionally, this last 1/3 region was split by a restriction enzyme into two regions that are approximately the same size, and by the same method, the regions containing the low temperature stability region were determined. As a result, it was confirmed that the function of conferring low temperature stability is encoded in the upstream region of the Xhol site of the Flaveria brownii PPDK gene shown in SEQ ID NO:5. Thus, it was confirmed that the function of giving cold stability is located in the amino acid sequence between the arginine amino acid residue at position 832 and the valine amino acid residue at position 955, of the amino acid sequence presented in SEQ ID NO.:5 (amino acid sequence from the 836th amino acid residue to the 955th amino acid residue may hereinafter be referred to as "the sequence conferring low temperature stability").

Astfel, s-a dovedit că regiunea în legătură cu stabilitatea la frig a PPDK este localizată în regiunea 1/6 a întregii regiuni C-terminale. Pe de altă parte, în SECV ID NR:1 din Lista de secvențe sunt secvența nucleotidică a genei care codifică PPDK de Flaveria bidentis și secvența aminoacidă codificată prin aceasta, și în SECV ID NR:2 sunt secvența nucleotidică a genei care codifică PPDK de porumb și secvența aminoacidă dedusă, codificată prin aceasta {Biochemistry 29,10757-10765 (1990)). în SECV ID NR:4, sunt prezentate secvența nucleotidică a genei care codifică PPDK de Entamoeba histolytica, care este o bacterie, și secvența aminoacidă dedusă, codificată prin aceasta {MolecularandBiochemical Parasitology 62,153-156 (1993)). Așa cum s-a descris mai sus, s-a dovedit prin invenția de față că regiunea legată de stabilitatea la temperatură scăzută a PPDK este localizată în 1/6 regiunea întregii regiuni de la C-terminal; este posibil să se obțină PPDK stabil la temperatură scăzută, prin substituția a cel puțin unui rest aminoacid din 1/6 regiunea întregii regiuni C-terminal a secvenței aminoacide, arătată în SECV ID NR:1,2, 3 sau 4.Thus, the region related to cold stability of PPDK was shown to be located in region 1/6 of the entire C-terminal region. On the other hand, in SEQ ID NO:1 of the Sequence Listing are the nucleotide sequence of the Flaveria bidentis PPDK encoding gene and the amino acid sequence encoded thereby, and in SEQ ID NO:2 are the nucleotide sequence of the maize PPDK encoding gene and the deduced amino acid sequence encoded thereby {Biochemistry 29, 10757-10765 (1990)). in SEQ ID NO:4, the nucleotide sequence of the gene encoding PPDK of Entamoeba histolytica, which is a bacterium, and the deduced amino acid sequence encoded thereby {MolecularandBiochemical Parasitology 62,153-156 (1993)) are shown. As described above, it has been shown by the present invention that the region related to the low temperature stability of PPDK is located in the 1/6 region of the entire C-terminal region; it is possible to obtain PPDK stable at low temperature by substituting at least one amino acid residue in the 1/6 region of the entire C-terminal region of the amino acid sequence shown in SEQ ID NO:1,2, 3 or 4.

RO 118451 ΒRO 118451 B

Aici, termenul “stabil la temperatură joasă” înseamnă că activitatea enzimei după ce se lasă enzima să stea la 0°C, timp de 20 min, nu este mai mică de 60% din activitatea 145 inițială.Here, the term "stable at low temperature" means that the enzyme activity after allowing the enzyme to stand at 0°C for 20 min is not less than 60% of the initial 145 activity.

Așa cum s-a menționat mai sus, deoarece regiunea de la cel de-al 832-lea aminoacid, arginină, până la cel de-al 955-lea rest aminoacid, valina, din secvența aminaocidă prezentată în SECV ID NR:5 definește stabilitatea la frig, PPDK-urile sensibile la temperatură scăzută, având secvența aminaocidă prezentată în SECV ID NR:1-4, pot fi transfor- 150 mate în PPDK-urile stabile la temperaturi joase, prin substituirea regiunilor corespunzătoare a PPDK-urilor sensibile la frig cu secvența aminaocidă de la cel de-al 832-lea rest aminoacid, ariginină, până la cel de-al 955-lea rest aminoacid, valină, a secvenței aminaocide prezentată în SECV ID NR:5. Această constatare este foarte importantă, deoarece, prin utilizarea ei, orice PPDK sensibil la frig dorit poate fi transformat într-un PPDK stabil la frig. 155 Metoda de atribuire a stabilității la temperaturi joase unui PPDK nu este restricționată la metoda descrisă în exemplele de mai jos, în care secvența care dă stabilitate la temperatură joasă a PPDK de Flaveria brownii este schimbată cu regiunea corespunzătoare a PPDK, sensibilă la temperatură joasă, pentru a pregăti o genă himerică, ci stabilitatea la temperatură scăzută poate fi dată prin schimbarea regiunii corespunzătoare a PPDK, sensibilă la 160 temperatură scăzută, a unei plante, cu aceeași secvență ca care dă stabilitatea la frig a Flaveria brownii prin mutageneză dirijată in sit. Prin urmare, orice ADN care codifică o polipeptidă având activitate PPDK, care conține secvența care dă stabilitate la temperatură joasă, descrisă mai sus, este în cuprinsul prezentei invenții. în special PPDK în care al 869-lea rest aminoacid din secvența aminaocidă arătată în SECV ID NR:1 este substituit prin 165 prolină, și PPDK în care cel de-al 885-lea și cel de-al 952-lea rest aminoacid din secvența aminoaocidă arătată în SECV ID NR:1 este substituit prin leucină și, respectiv, valină, au stabilitate la temperatură scăzută.As noted above, because the region from the 832nd amino acid, arginine, to the 955th amino acid residue, valine, of the aminocidal sequence shown in SEQ ID NO:5 defines cold stability , the low-temperature-sensitive PPDKs having the aminicidal sequence shown in SEQ ID NO:1-4 can be transformed into the low-temperature-stable PPDKs by substituting the corresponding regions of the cold-sensitive PPDKs with the sequence aminoacid from the 832nd amino acid residue, ariginine, to the 955th amino acid residue, valine, of the aminoacidic sequence shown in SEQ ID NO:5. This finding is very important because by using it, any desired cold-sensitive PPDK can be converted into a cold-stable PPDK. 155 The method of conferring low temperature stability to a PPDK is not restricted to the method described in the examples below, in which the low temperature stability conferring sequence of the Flaveria brownii PPDK is exchanged with the corresponding low temperature sensitive region of the PPDK, to prepare a chimeric gene, but low-temperature stability can be conferred by changing the corresponding region of the low-temperature-sensitive PPDK of a plant with the same sequence as confers cold stability in Flaveria brownii by site-directed mutagenesis. Therefore, any DNA encoding a polypeptide having PPDK activity, which contains the sequence conferring low temperature stability, described above, is within the scope of the present invention. in particular PPDK in which the 869th amino acid residue of the aminicidal sequence shown in SEQ ID NO:1 is substituted by 165 proline, and PPDK in which the 885th and 952nd amino acid residue of the sequence amino acid shown in SEQ ID NO:1 is substituted by leucine and valine, respectively, have low temperature stability.

PPDK-urile la care poate fi dată stabilitate la temperatură joasă nu sunt limitate la cele arătate în SECV ID NR:1-4, putând fi oricare dintre cele care au omologii nu mai mici 170 de 50%, la secvențele prezentate în SECV ID NR:1-4. De preferință, secvența nucleotidică a genei care codifică PPDK, căreia îi poate fi atribuită stabilitatea la temperatură scăzută, are o omologie la gena PPDK de Flaveria browniiîntr-o cantitate nu mai mică de 48,5%, mai preferat, nu mai mică de 90%.The PPDKs that can be given low temperature stability are not limited to those shown in SEQ ID NO:1-4, and can be any of those that have no less than 170 homology to the sequences shown in SEQ ID NO: :1-4. Preferably, the nucleotide sequence of the gene encoding PPDK, to which low temperature stability can be attributed, has homology to the Flaveria brownii PPDK gene in an amount of not less than 48.5%, more preferably not less than 90 %.

Este binecunoscut în domeniu că există cazuri în care activitatea fiziologică a unei 175 peptide active fiziologic se menține chiar dacă secvența aminaocidă a peptidei se modifică până la o întindere mică și chiar dacă unul sau mai multi aminoacizi din secvența aminoacidă sunt substituiți sau deletați, sau chiar dacă se adaugă unul sau mai mulți aminoacizi la secvența aminoacidă. Prin urmare, polipeptidele care au aceeași secvență aminoacidă ca cea arătată în SECV ID NR:5, cu excepția faptului că polipeptidele au asemenea modifi- 180 cări, dar au activitate PPDK stabilă la temperatură scăzută, sunt incluse în cuprinsul invenției de față. Cu alte cuvinte, polipeptidele care au aceeași secvență aminoacidă ca cea prezentată în SECV ID NR:5, cu excepția faptului că unul sau mai mulți aminoacizi sunt adăugați, deletați sau substituiți, dar care au activitate PPDK stabilă la temperatură scăzută, sunt incluse în cuprinsul invenției de față. în mod similar, ADN-urile care au aceeași secvență nu- 185 cleotidică ca cea din SECV ID NR:5, cu excepția faptului că una sau mai multe nucleotide sunt adăugate, deletate sau substituite, și care codifică polipeptidele având activitate PPDK stabilă la temperatură scăzută, intră, de asemenea, în întinderea prezentei invenții.It is well known in the art that there are cases where the physiological activity of a physiologically active peptide is maintained even if the aminicidal sequence of the peptide is changed to a small extent and even if one or more amino acids in the amino acid sequence are substituted or deleted, or even if one or more amino acids are added to the amino acid sequence. Therefore, polypeptides having the same amino acid sequence as shown in SEQ ID NO:5, except that the polypeptides have such modifications but have stable PPDK activity at low temperature, are included within the scope of the present invention. In other words, polypeptides having the same amino acid sequence as that shown in SEQ ID NO:5, except that one or more amino acids are added, deleted or substituted, but having stable PPDK activity at low temperature, are included in of the present invention. similarly, DNAs having the same nucleotide sequence as that of SEQ ID NO:5, except that one or more nucleotides are added, deleted or substituted, and which encode polypeptides having temperature-stable PPDK activity low, also falls within the scope of the present invention.

RO 118451 ΒRO 118451 B

Modificările ADN-ului, care duc la adiția, deleția sau substituția secvenței aminoacide codificate prin acesta se pot obține prin mutageneze specifice in sit care sunt binecunoscute în domeniu (de exemplu, Nucleic Acid Research, voi. 10, nr. 20, p. 6487-6500,1982). în prezenta descriere, “unul sau mai mulți aminoacizi” înseamnă numărul aminoacizilor care pot fi adăugați, deletați sau substituiți prin mutageneză de sit specific.Modifications of DNA, resulting in the addition, deletion or substitution of the amino acid sequence encoded by it can be achieved by site-specific mutagenesis which are well known in the art (eg, Nucleic Acid Research, Vol. 10, No. 20, p. 6487 -6500, 1982). As used herein, "one or more amino acids" means the number of amino acids that may be added, deleted, or substituted by site-specific mutagenesis.

Mutageneza de sit specific se poate realiza, de exemplu, folosind un primer oligonucleotidic sintetic complementar la un fag ADN monocatenar, exceptând mutația dorită care urmează. Astfel, folosind oligonucleotida sintetică menționată mai sus ca primer, se produce de către un fag o catenă complementară și se transformă celule bacteriene-gazdă, cu ADN-ul dublu catenar obținut. Cultura de celule bacteriene transformate se depune pe agar și se formează plăci de la o celulă singulară, care conține fagul. Teoretic 50% din noile colonii conțin fagul, care are o catenă monobandă care poartă mutația, și 50% din coloniile rămase conțin fagul având secvența inițială. Plăcile obținute se supun apoi hibridizării cu o sondă sintetică, tratată cu kinază la o temperatură la care sonda se hibridizează cu ADN-ul care are exact aceeași secvență ca ADN-ul având mutatia dorită, dar nu cu secvența ADN inițială, care nu este complet complementară cu sonda. Apoi, se colectează plăcile la care s-a observat hibridizarea, se cultivă și se recuperează ADN-ul.Site-specific mutagenesis can be accomplished, for example, by using a synthetic oligonucleotide primer complementary to a single-stranded DNA phage, except for the desired mutation that follows. Thus, using the synthetic oligonucleotide mentioned above as a primer, a complementary strand is produced by a phage and bacterial-host cells are transformed with the double-stranded DNA obtained. The culture of transformed bacterial cells is plated on agar and plates are formed from a single cell, containing the phage. Theoretically 50% of the new colonies contain the phage, which has a single-stranded strand carrying the mutation, and 50% of the remaining colonies contain the phage having the original sequence. The plates obtained are then hybridized with a synthetic probe, treated with kinase at a temperature at which the probe hybridizes with DNA that has exactly the same sequence as the DNA with the desired mutation, but not with the original DNA sequence, which is not complete complementary to the probe. Plates where hybridization was observed are then collected, cultured, and DNA recovered.

în plus, la mutageneza în sit specific, menționată mai sus, metodele pentru substituirea, deleția sau adiția unuia sau mai mulți aminoacizi fără pierderea activității enzimatice includ o metodă în care gena se tratează cu un mutagen și o metodă în care gena este clivată selectiv, se recuperează o anumită nucleotidă, se adaugă sau se substituie și apoi gena este ligată.In addition, in the aforementioned site-specific mutagenesis, methods for the substitution, deletion or addition of one or more amino acids without loss of enzyme activity include a method in which the gene is treated with a mutagen and a method in which the gene is selectively cleaved, a specific nucleotide is retrieved, added or substituted, and then the gene is ligated.

Prin transformarea unei plante cu gena gena PPDK de Flaveria brownii sau cu un ADN care codifică o polipeptidă conținând secvența care dă stabilitate la temperatură scăzută, care are activitate PPDK, se poate obține o plantă rezistentă la frig. Exemple ale plantelor preferate care pot fi transformate includ, deși nu se limitează la acestea, porumb, trestie de zahăr, mei, iarbă de pășune și sorg.By transforming a plant with the PPDK gene of Flaveria brownii or with a DNA encoding a polypeptide containing the sequence conferring low temperature stability, which has PPDK activity, a cold-resistant plant can be obtained. Examples of preferred plants that can be transformed include, but are not limited to, corn, sugarcane, millet, pasture grass, and sorghum.

Au fost deja stabilite metode pentru transformarea plantelor, și metoda care utilizează Agrobacterium tumefaciens poate fi folosită în mod preferat. Metoda pentru transformarea plantelor utilizând Agrobacterium tumefaciens este binecunoscută în domeniu. Prin această metodă pot fi transformate atât dicotiledonate (de exemplu, cererea de brevet japoneză (Kokai) nr. 4-330234), cât și monodicotiledonate (WO 94/00977). Alternativ, ADN-ul se poate introduce în protoplaste de plante, prin metoda electroporării sau prin alte metode binecunoscute în domeniu. Suplimentar, transformarea se poate realiza, de asemenea, prin atașarea ADN-ului la particule de tungsten și alte asemenea metode, și prin implantarea particulelor în embrionul unei plante. Metode concrete de astfel de transformări sunt descrise în exemplele care se dau în continuare.Methods for transforming plants have already been established, and the method using Agrobacterium tumefaciens can be preferably used. The method for transforming plants using Agrobacterium tumefaciens is well known in the art. Both dicotyledonates (eg, Japanese Patent Application (Kokai) No. 4-330234) and monodicotyledonates (WO 94/00977) can be converted by this method. Alternatively, DNA can be introduced into plant protoplasts by electroporation or other methods well known in the art. Additionally, transformation can also be accomplished by attaching DNA to tungsten particles and other such methods, and by implanting the particles into a plant embryo. Concrete methods of such transformations are described in the following examples.

Prezenta invenție se va descrie acum concret, prin intermediul exemplelor. Totuși, prezenta invenție nu este limitată la exemplele de mai jos.The present invention will now be described concretely by means of examples. However, the present invention is not limited to the examples below.

Exemplul 1. donarea și secvențarea genei PPDK de Flaveria brownii (1) Prepararea băncii cADN și donarea cADN-ului de lungime totală (I) Prepararea băncii cADNExample 1. Flaveria brownii PPDK Gene Cloning and Sequencing (1) cDNA Bank Preparation and Full-Length cDNA Cloning (I) cDNA Bank Preparation

Prin metoda clorhidrat de guanidină/fenol, din frunze verzi (60 g) de Flaveria brownii s-au izolat ARN-uri totale. Prin această metodă s-au obținut 26,5 mg de ARN după precipitare cu litiu. Apoi, s-au obținut 118,9 pg de ARN poli(A)+ din 13,2 mg de ARN, conform uneiBy the guanidine hydrochloride/phenol method, total RNAs were isolated from green leaves (60 g) of Flaveria brownii. This method yielded 26.5 mg of RNA after lithium precipitation. Then, 118.9 µg of poly(A) + RNA was obtained from 13.2 mg of RNA, according to a

RO 118451 Β metode convenționale, folosind o coloană care conține Oligo dT celuloză Tip 7 (accesibilă comercial de la Pharmacia). Pentru prepararea băncii cADN s-au folosit TimeSaver cADN 235 Synthesis Kit (accesibil comercial de la Pharmacia), vector Lambda ZAPII (accesibil comercial de la STRATAGENE) și reactivul de împachetare atașat la sistemul de donare cADN Ăgtlo (accesibil comercial de la AMERSHAM). Folosind un linker EcoRI/Notl, s-a preparat o bancă cADN, în care fragmente ADN sunt inserate în situl EcoRI al vectorilor Lambda ZAPII.RO 118451 Β conventional methods, using a column containing Oligo dT cellulose Type 7 (commercially available from Pharmacia). The TimeSaver cDNA 235 Synthesis Kit (commercially available from Pharmacia), the Lambda ZAPII vector (commercially available from STRATAGENE) and the packaging reagent attached to the Ăgtlo cDNA cloning system (commercially available from AMERSHAM) were used to prepare the cDNA bank. Using an EcoRI/NotI linker, a cDNA library was prepared, in which DNA fragments are inserted into the EcoRI site of Lambda ZAPII vectors.

Mărimea băncii cADN preparată a fost 415000 pfu. Ca celule-gazdă, s-au folosit celule 240 XIII-Blue.The size of the prepared cDNA library was 415,000 pfu. As host cells, 240 XIII-Blue cells were used.

(ii) Prepararea sondei(ii) Probe preparation

Folosind un primer care are secvența 5': GACGGCTAAAAAGAGGGT (desemnat pe baza regiunilor PPDK de Flaveria bidentis și PPDK de porumb care au omologie înaltă) și un primer R care are secvența TATCGAGAAACCTTCTATAC (o parte a secvenței PPDK 245 de Flaveria bidentis, catena complementară), s-a amplificat un fragment ADN originar din ARN-ul de Flaveria brownii prin transcripție inversă PCR și fragmentul ADN amplificat s-a inserat în vectorul pCRII (accesibil comercial de la INVITROGEN). Folosind vectorul obținut ca matriță și folosind aceiași primeri ca cei menționați mai sus, s-a amplificat ADN-ul, și produsul PCR s-a supus la electroforeză urmată de recuperarea ADN-ului din gel, folosind 250 SUPREC-ol (accesibil comercial de la TAKARA SHUZO). Prin acest proces se poate obține un fragment ADN, care are o mărime de 428 bp începând de la cea de-a 24-a bp în amonte de N-terminal proteinei mature. Acest fragment s-a marcat cu 32P folosind sistemul de marcaj Multiprime DNA (accesibil comercial de la AMERSHAM) pentru a prepara o sondă.Using a primer that has the sequence 5': GACGGCTAAAAAGAGGGT (designated based on regions of Flaveria bidentis PPDK and maize PPDK that have high homology) and an R primer that has the sequence TATCGAGAAACCTTCTATAC (part of Flaveria bidentis PPDK 245 sequence, complementary strand) , a DNA fragment originating from Flaveria brownii RNA was amplified by reverse transcription PCR and the amplified DNA fragment was inserted into the pCRII vector (commercially available from INVITROGEN). Using the obtained vector as a template and using the same primers as mentioned above, the DNA was amplified, and the PCR product was subjected to electrophoresis followed by DNA recovery from the gel using 250 SUPREC-ol (commercially available from TAKARA SHUZO) . Through this process a DNA fragment can be obtained, which has a size of 428 bp starting from the 24th bp upstream of the N-terminus of the mature protein. This fragment was labeled with 32 P using the Multiprime DNA labeling system (commercially available from AMERSHAM) to prepare a probe.

(iii) Clonarea cADN-ului de lungime totală de la Flaveria brownii 255(iii) Cloning of full-length cDNA from Flaveria brownii 255

Biblioteca cADN s-a cercetat prin metoda hibridizării plăcii, folosind ca sondă fragmentul ADN menționat mai sus. Ca filtru de hibridizare, s-a folosit Hybond N+ (AMERSHAM), și hibridizarea s-a realizat în 6 x SSC care conține 5 x soluția lui Denhalt, 0,1% SDS și 100 pg/ml ADN din glande sexuale de salmon denaturat la 65°C peste noapte. Spălarea s-a realizat cu 2 x SSC care conține 0,1% SDS la temperatura camerei timp de 5 min; cu 260 2 x SSC care conține 0,1% SDS la temperatura camerei timp de 90 min; și 1 x SSC care conține 0,1% SDS la 68°C, timp de 90 min. Prin această operațiune s-au obținut 28 de clone independente pozitive. Din acestea, 11 plăci care au prezentat semnale puternice s-au selectat și s-au supus la un al 2-lea screening. Acesta s-a realizat în același mod ca și primul, cu excepția faptului că timpul celei de-a doua spălări a fost de 60 min. Ca rezultat, de la 6 265 clone s-au obținut plăci pozitive, independente, originare, de la un singur fag. Pentru verificarea mărimilor fragmentelor ADN inserate, s-a realizat PCR folosind primerul R, descris mai sus, M13PrimerM4 (GTTTTCCCAGTCACGAC, accesibil comercial de la TAKARA SHUZO), și M13PrimerRV (CAGGAAACAGCTATGAC, accesibil comercial de la TAKARA SHUZO), ca primeri, și folosind fagul ca matriță. Ca urmare, două clone au conținut cADN-ul de Iun- 270 gime totală. După aceasta, prin excizie in vivo, fragmentele ADN inserate s-au subclonat într-un vector plasmid pBluescriptIISK (accesibil comercial de la STRATAGENE). Plasmizii recombinați, obținuți prin subclonare, s-au numit p411 și p631.The cDNA library was screened by the plate hybridization method, using the aforementioned DNA fragment as a probe. Hybond N + (AMERSHAM) was used as the hybridization filter, and hybridization was performed in 6x SSC containing 5x Denhalt's solution, 0.1% SDS, and 100 µg/ml DNA from denatured salmon gonads at 65° C overnight. Washing was done with 2 x SSC containing 0.1% SDS at room temperature for 5 min; with 260 2 x SSC containing 0.1% SDS at room temperature for 90 min; and 1x SSC containing 0.1% SDS at 68°C for 90 min. This operation yielded 28 independent positive clones. From these, 11 plaques that showed strong signals were selected and subjected to a 2nd screening. This was done in the same way as the first, except that the time of the second wash was 60 min. As a result, positive, independent plaques originating from a single phage were obtained from 6,265 clones. To check the sizes of the inserted DNA fragments, PCR was performed using primer R, described above, M13PrimerM4 (GTTTTCCCAGTCACGAC, commercially available from TAKARA SHUZO), and M13PrimerRV (CAGGAAACAGCTATGAC, commercially available from TAKARA SHUZO) as primers, and using phage as matrix. As a result, two clones contained the total Iun-270 cDNA. After this, by in vivo excision, the inserted DNA fragments were subcloned into a plasmid vector pBluescriptIISK (commercially available from STRATAGENE). The recombinant plasmids obtained by subcloning were named p411 and p631.

Prin PCR descris mai sus, s-a dovedit că banca preparată așa cum s-a descris mai sus a fost o bancă conținând inserte suficient de lungi care a fost completă pentru căutarea 275 cADN. Astfel, pentru izolarea mARN-urilor de Flaveria brownii este avantajos să se folosească metoda descrisă mai sus și să se obțină o cantitate mare de mARN-uri prin tratarea o dată a unei cantități mari de ARN-uri așa cum s-a descris mai sus.By the PCR described above, it was shown that the library prepared as described above was a library containing sufficiently long inserts that was complete for the 275 cDNA search. Thus, for the isolation of Flaveria brownii mRNAs it is advantageous to use the method described above and to obtain a large amount of mRNAs by treating a large amount of RNAs at once as described above.

RO 118451 Β în plus, deoarece banca cADN descrisă mai sus conține un număr de inserte care sunt suficient de lungi, căutarea cADN de lungime totală a fost posibil să fie realizată cu ușurință prin prepararea unei sonde folosind un primer care hibridizează cu o regiune din vecinătatea regiunii de procesare a proteinei dorite.RO 118451 Β in addition, since the cDNA library described above contains a number of inserts that are sufficiently long, the search for full-length cDNA was possible to perform easily by preparing a probe using a primer that hybridizes to a region in the vicinity the processing region of the desired protein.

(2) Determinarea secvenței nucleotidice totale a cADN și compararea secvenței aminoacide deduse(2) Determination of the total nucleotide sequence of the cDNA and comparison of the deduced amino acid sequence

Pentru a determina secvența nucleotidică totală cADN inserat lui p631, s-au preparat mutante de deleție. Mutantele de deleție s-au preparat folosind Deletion-Kit pentru KiloSequence (accesibil comercial de la TAKARA SHUZO). Totuși, reacția prin exonuclează III s-a oprit prin transferarea amestecului de reacție la tampon Mung Bean Nuclease menținut la 65°C. Determinarea secvenței nucleotidice s-a realizat folosind un plasmid purificat, prin folosirea unui plasmid purificat Qiagen Plasmid Mini Kit (accesibil comercial de la Diagen), Taq Dydeoxy terminator Cycle Kit (accesibil comercial de la ABI) și Applied Biosystems 373A DNA Sequencer (ABI). S-au secvențat ambele catene, cu excepția unor părți ale acestora. Pe baza secvenței nucleotidice determinate, s-a determinat secvența aminoacidă. Secvența nucleotidică determinată și secvența aminoacidă sunt prezentate în SECV ID NR:5 din Lista de secvențe.To determine the total nucleotide sequence of the inserted p631 cDNA, deletion mutants were prepared. Deletion mutants were prepared using Deletion-Kit for KiloSequence (commercially available from TAKARA SHUZO). However, the exonuclease III reaction was stopped by transferring the reaction mixture to Mung Bean Nuclease buffer maintained at 65°C. Nucleotide sequence determination was performed using a purified plasmid using the Qiagen Plasmid Mini Kit (commercially available from Diagen), the Taq Dydeoxy terminator Cycle Kit (commercially available from ABI), and the Applied Biosystems 373A DNA Sequencer (ABI). Both strands were sequenced, except for parts of them. Based on the determined nucleotide sequence, the amino acid sequence was determined. The determined nucleotide sequence and amino acid sequence are shown in SEQ ID NO:5 of the Sequence Listing.

Ar fi de remarcat că, în prepararea clonelor de deleție pentru secvențarea cADN-ukui izolat, a fost important să se încălzească înainte tamponul Mung Bean Nuclease la 65°C, deoarece reacția nu se putea opri numai prin transferarea amestecului de reacție la tamponul Mung Bean Nuclease. Suplimentar, deoarece secvența la regiunea de până la circa 600-900 bp din vectorul mai mare și insertul nu s-au putut determina numai prin deleția insertului din aval, a fost necesar să se realizeze deleția de la ambele capete.It should be noted that in preparing the deletion clones for sequencing the isolated cDNA, it was important to preheat the Mung Bean Nuclease buffer to 65°C, as the reaction could not be stopped by simply transferring the reaction mixture to the Mung Bean buffer Nucleases. Additionally, since the sequence up to about 600-900 bp of the larger vector and the insert could not be determined by deletion of the downstream insert alone, it was necessary to perform the deletion from both ends.

Pe de altă parte, s-a purificat PPDK direct din Flaveria brownii și s-au determinat secvențele aminaocide ale regiunilor sale N-terminale, C-terminale și interne. PPDK s-a purificat după cum urmează. S-au mărunții frunze verzi într-un volum triplu al tamponului de extracție. După centrifugare, s-a adăugat la supernatant sulfat de amoniu până la 30% saturare și s-au îndepărtat proteinele precipitate. Ulterior, s-a adăugat sulfat de amoniu până la 70% saturare și s-au recuperat proteinele precipitate. Proteinele recuperate s-au aplicat la Sephadex G25 (accesibil comercial de la PHARMACIA) pentru îndepărtarea sărurilor. Ceea ce a rezultat s-a aplicat în coloană DEAE-Sepharose (PHARMACIA) și proteinele adsorbite la coloană s-au eluat prin HCI, care are un gradient de 50-400 mM. Fracțiunile care au activitate PPDK s-au combinat și s-au concentrat prin sulfat de amoniu de saturație 70%, urmat de desalinizare prin Sephadex G25. Ceea ce a rezultat s-a aplicat apoi la o coloană hidroxiapatită. Proteinele adsorbite s-au eluat prin tampon fosfat, care are un gradient de fosfat de la 10 mM până la 40 mM. Fracțiunile care au activitate PPDK s-au combinat și s-au concentrat prin sulfat de amoniu de saturație 70%, urmat de desalinizare prin Sephadex G25. Ceea ce a rezultat s-a supus la SDS-PAGE, și banda PPDK s-a decupat. S-a recuperat proteina din gel prin electroeluție. Prin acest proces s-a obținut proba PPDK, purificată într-o cantitate de circa 5-10 nmol. PPDK-ul purificat, obținut în acest mod, a prezentat o bandă unică în SDS-PAGE.On the other hand, PPDK was purified directly from Flaveria brownii and the amino acid sequences of its N-terminal, C-terminal and internal regions were determined. PPDK was purified as follows. Green leaves were minced in a threefold volume of the extraction buffer. After centrifugation, ammonium sulfate was added to the supernatant to 30% saturation and the precipitated proteins were removed. Subsequently, ammonium sulfate was added to 70% saturation and the precipitated proteins were recovered. The recovered proteins were applied to Sephadex G25 (commercially available from PHARMACIA) to remove salts. The resulting was applied to a DEAE-Sepharose column (PHARMACIA) and proteins adsorbed to the column were eluted by HCl, which has a gradient of 50-400 mM. Fractions having PPDK activity were combined and concentrated by 70% saturation ammonium sulfate followed by desalting with Sephadex G25. The resulting was then applied to a hydroxyapatite column. Adsorbed proteins were eluted through phosphate buffer, which has a phosphate gradient from 10 mM to 40 mM. Fractions having PPDK activity were combined and concentrated by 70% saturation ammonium sulfate followed by desalting with Sephadex G25. The resulting was subjected to SDS-PAGE, and the PPDK band was excised. Protein was recovered from the gel by electroelution. Through this process, the PPDK sample was obtained, purified in an amount of about 5-10 nmol. The purified PPDK obtained in this way showed a single band in SDS-PAGE.

Apoi, s-au determinat secvențele aminoacide ale regiunilor N-terminală, C-terminală și interne ale PPDK purificat obținut astfel. Secvența aminoacidă a regiunii N-terminale s-a determinat prin transferarea proteinei pe o membrană PVDF și apoi determinarea secvenței folosind un secvențier de aminoacizi în fază gazoasă. Secvența aminoacidă a regiunii N-terminale s-a determinat prin digestia PPDK purificat cu carboxipeptidază și estimarea secvenței aminoacide pe baza relației dintre compoziția aminoacizilor eliberați și timpul de digestie.Next, the amino acid sequences of the N-terminal, C-terminal and internal regions of the thus obtained purified PPDK were determined. The amino acid sequence of the N-terminal region was determined by transferring the protein to a PVDF membrane and then determining the sequence using a gas-phase amino acid sequencer. The amino acid sequence of the N-terminal region was determined by digesting purified PPDK with carboxypeptidase and estimating the amino acid sequence based on the relationship between the composition of the released amino acids and the digestion time.

RO 118451 ΒRO 118451 B

Secvențele aminoacide ale regiunilor interne s-au determinat prin determinarea secvenței aminoacide de la N-terminal al peptidelor generate prin digestia proteinei cu o protează. Detaliile acestei metode sunt cele care urmează. Inițial, ca și în cazul determinării secvenței aminoacide a regiunii N-terminale, PPDK din frunze verzi de Flaveria browniia fost parțial purificată și supusă la un SDS-PAGE obișnuit. Gelul s-a colorat cu Coomassie Brilliant Blue R250 și s-a decupat banda PPDK. Gelul decupat s-a echilibrat într-un tampon de echilibrare (Tris-HCI pH 6,8,125 mM, EDTA 1 mM, 0,1% SDS) și s-a inserat într-un godeu, fiind urmat de un al doilea SDS-PAGE. Aici, împreună cu gelul echilibrat, s-au adăugat o soluție de acoperire (Tris-HCI pH 6,8, 125 mM, EDTA 1 mM, 0,1% SDS, 0,01% BPB, 20% glicerol) și o soluție enzimatică (Tris-HCI pH 6,8, 125 mM, EDTA 1 mM, 0,1% SDS, 0,01% BPB, 10% glicerol, lizil endopeptidază 1-5 pg sau proteaze V8 0,01-0,1 pg). După realizarea electroforezei, pentru același timp, în gelul concentrat s-a realizat digestia proteinei (s-a întrerup alimentarea elelctrică, și gelul a fost lăsat să stea 45 min). Apoi, s-a repornit electroforeza, și ceea ce a rezultat s-a transferat la o membrană PVDF în același mod ca la determinarea secvenței N-terminale, și secvențele aminaocide ale fragmentelor obținute prin digestie s-au determinat de la N-terminalele acestora, folosind secvențierul de aminoacizi în fază gazoasăThe amino acid sequences of the internal regions were determined by determining the N-terminal amino acid sequence of the peptides generated by digestion of the protein with a protease. The details of this method are as follows. Initially, as in the case of determining the amino acid sequence of the N-terminal region, PPDK from green leaves of Flaveria browniia was partially purified and subjected to a regular SDS-PAGE. The gel was stained with Coomassie Brilliant Blue R250 and the PPDK band excised. The excised gel was equilibrated in equilibration buffer (Tris-HCl pH 6.8, 125 mM, 1 mM EDTA, 0.1% SDS) and inserted into a well, followed by a second SDS-PAGE. Here, along with the equilibrated gel, a coating solution (Tris-HCl pH 6.8, 125 mM, 1 mM EDTA, 0.1% SDS, 0.01% BPB, 20% glycerol) and a enzymatic (Tris-HCl pH 6.8, 125 mM, EDTA 1 mM, 0.1% SDS, 0.01% BPB, 10% glycerol, lysyl endopeptidase 1-5 µg or V8 proteases 0.01-0.1 µg ). After performing the electrophoresis, for the same time, the protein digestion was performed in the concentrated gel (the electrical supply was interrupted, and the gel was left to stand for 45 min). Then, the electrophoresis was restarted, and the resulting was transferred to a PVDF membrane in the same way as for determining the N-terminal sequence, and the amino acid sequences of the digested fragments were determined from their N-termini using the sequencer amino acids in gas phase

Secvențele determinate ale regiunilor N-terminale, C-terminale și interne sunt precum urmează:The determined sequences of the N-terminal, C-terminal and internal regions are as follows:

330330

335335

340340

Secvența N-terminală: N-terminal sequence: Asn Pro Val Ser Pro Pro Val (72-78) Asn Pro Val Ser Pro Pro Val (72-78) Secvența C-terminală: C-terminal sequence: Leu - Ala Ala * Val Val (948-955) Leo - Ala Ala * Val Val (948-955) Secvența intenă (1): Intense sequence (1): Lys Leu Tyr Glu Phe Leu Val Asn Ala Gin Gly - Asp Val Val Ala (349-365) Lys Leu Tyr Glu Phe Leu Val Asn Ala Gin Gly - Asp Val Val Ala (349-365) Secvența intenă (2): Intense sequence (2): Gin Leu Leu Ala Pro Pro Ala Met Ser Asn Ala Leu - Thr (592-605) Gin Leu Leu Ala Pro Pro Ala Met Ser Asn Ala Leu - Thr (592-605) Secvența intenă (3): Intense sequence (3): Leu Thr Ala Asp Thr Gly Met Ser Lys Asp Glu lle Tyr Ser Arg lle Glu (721-738) Leu Thr Ala Asp Thr Gly Met Ser Lys Asp Glu lle Tyr Ser Arg lle Glu (721-738) Secvența intenă (4): Intense sequence (4): Ala - - - Ser Phe Gly Thr Ala - - - Ser Phe Gly Thr

345345

350350

Asn Asp Leu Cis Gin Met Val Phe Gly - Ser (844-862) în secvențele aminoacide descrise mai sus înseamnă glutamină, dar nu s-a putut analiza prin analizatorul folosit, și înseamnă că rezultatul analizei a fost neclar. Numerele din paranteze indică numărul aminoacidului regiunii corespunzătoare din secvența aminoacidă prezentată în SECV ID nr: 5. Deși părți ale secvenețelor interne (2) și (4) sunt diferite de secvența aminoacidă prezentată în SECV ID nr: 5, se consideră că aceasta se datorează erorilor secvențierului de aminoacizi. Așa cum este bine cunoscut, erorile sunt produse de secvențele de aminoacizi cu o frecvență considerabilă, nefiind produse erori substanțiale prin secvențiere de ADN.Asn Asp Leu Cis Gin Met Val Phe Gly - Ser (844-862) in the amino acid sequences described above means glutamine, but it could not be analyzed by the analyzer used, and it means that the analysis result was unclear. The numbers in parentheses indicate the amino acid number of the corresponding region in the amino acid sequence shown in SEQ ID No: 5. Although parts of the internal sequences (2) and (4) are different from the amino acid sequence shown in SEQ ID No: 5, it is believed that this is due to amino acid sequencing errors. As is well known, errors are produced by amino acid sequences with considerable frequency, and no substantial errors are produced by DNA sequencing.

355355

360360

Deoarece secvența aminoacidă prezentată în SECV ID nr: 5 corespunde bine cu secvențele aminoacide parțiale, determinate direct în PPDK purificat, descris mai sus, s-a confirmat că secvența aminoacidă prezentată în SECV ID nr: 5 este secvența aminoacidă a PPDK. Secvența aminoacidă prezentată în SECV ID nr: 5 s-a comparat cu secvențele aminoacide ale PPDK-urilor cunoscute de Flaveria bidentis și porumb. Ca rezultat, în regiunea proteinei mature au fost diferite 40 resturi aminoacide {Flaveria bidentis) și respectiv, circa 180 resturi aminoacide (porumb). în plus, din rezultatele descrise mai sus, s-a considerat că în secvența aminoacidă prezentată în SECV ID nr: 5, secvența aminoacidă de la 1 până la 71-lea aminoacid nu există în proteina matură, dar există o peptidă tranzitorie, necesară pentru trecerea prin membrane, care este prelucrată după trecerea prin membrane. Diferențele dintre secvențele aminoacide ale proteinelor mature de Flaveria brownii și Flaveria bidentis sunt arătate în tabelul 1 de mai jos.Since the amino acid sequence shown in SEQ ID NO: 5 corresponds well with the partial amino acid sequences directly determined in the purified PPDK described above, it was confirmed that the amino acid sequence shown in SEQ ID NO: 5 is the amino acid sequence of PPDK. The amino acid sequence shown in SEQ ID NO: 5 was compared to the amino acid sequences of known PPDKs from Flaveria bidentis and maize. As a result, 40 amino acid residues {Flaveria bidentis) and about 180 amino acid residues (maize) were different in the region of the mature protein. In addition, from the results described above, it was believed that in the amino acid sequence shown in SEQ ID NO: 5, the amino acid sequence from the 1st to the 71st amino acid does not exist in the mature protein, but there is a transient peptide, necessary for passing through membranes, which is processed after passing through the membranes. The differences between the amino acid sequences of the mature Flaveria brownii and Flaveria bidentis proteins are shown in Table 1 below.

365365

370370

RO 118451 ΒRO 118451 B

Tabelul 1Table 1

375375

brownii the brownies bidentis bidentis Numerele aminoacidului §i Amino acid numbers and 5 Phe 5 Phe 6 Ser 6 Ser 380 380 aminoacizii care sunt the amino acids that are 10 Pro 10 Pros 11 Leu 11 Leo diferiț i. different i. 15 Asn 15 Asn 16 Arg 16 Arg 28 Thr 28 Thr 29 Asn 29 Asn 40 Pro 40 Pros 41 Ser 41 Ser 385 385 41 Ala 41 Ala 42 Ser 42 Ser 48 Arg 48 Arg 47 Leu 47 Leo 49 Arg 49 Arg 48 Thr 48 Thr 50 Lys 50 Lys 49 Pro 49 Pros 390 390 52 Ser 52 Ser 50 Ala 50 Ala 57 Ile 57 Islands 55 Pro 55 Pros 61 Thr 61 Thr 59 Ser 59 Ser 62 Gly 62 Gly 60 Ser 60 Ser 395 395 66 Leu 66 Leo 64 Pro 64 Pros 81 Thr 81 Thr 79 Ala 79 Ala 92 Asn 92 Asn 90 Arg 90 Arg 97 Lys 97 Lys 95 Arg 95 Arg 400 400 114 Ala 114 Ala 112 Ser 112 Ser 140 Lys 140 Lys 138 Ser 138 Ser 150 Leu 150 Leu 148 Ser 148 Ser 154 Gin 154 Gin 152 Asp 152 Asp 405 405 170 Ala 170 Ala 168 Pro 168 Pro 201 Asp 201 Asp 199 Ala 199 Ala 205 Ala 205 Ala 203 Gly 203 Gly 263 Ala 263 Ala 261 Val 261 Wave 410 410 265 Gin 265 Gin 263 Lys 263 Lys 323 Cys 323 Cys 321 Ser 321 Ser 356 Val 356 Wave 354 Ile 354 Islands 374 Val 374 Wave 372 Gly 372 Gly 415 415

RO 118451 ΒRO 118451 B

Tabelul 1 (continuare)Table 1 (continued)

brownii the brownies bidentis bidentis 420 420 Numerele aminoacidului Amino acid numbers 382 Glu 382 Glu 380 Asp 380 Asp și aminoacizii care sunt and the amino acids that are 385 Arg 385 Arg 383 Lys 383 Lys 392 Val 392 Wave 390 Glu 390 Glu diferiț i different i 396 Arg 396 Arg 394 Gly 394 Gly 465 Asn 465 Asn 463 Asp 463 Asp 490 Val 490 Wave 488 Cys 488 Cys 564 Val 564 Wave 562 lle 562 lle 430 430 583 Ser 583 Ser 581 Thr 581 Thr 605 Thr 605 Thr 603 lle 603 lle 672 Ala 672 Ala 670 Val 670 Wave 723 Thr 723 Thr 721 Ala 721 Ala 435 435 724 Ala 724 Ala 722 Val 722 Wave 730 Lys 730 Lys 728 Ala 728 Ala 736 Arg 736 Arg 734 Lys 734 Lys 739 Lys 739 Lys 737 Asn 737 Asn 440 440 777 Asn 777 Asn 775 Thr 775 Thr 803 Gly 803 Gly 801 Ser 801 Ser 818 Leu 818 Leo 816 Val 816 Wave 838 Asp 838 Asp 836 Glu 836 Glu 445 445 841 Ala 841 Ala 839 Gly 839 Gly 845 Glu 845 Glu 843 Asp 843 Asp 871 Pro 871 Pro 869 Gin 869 Gin 875 Ser 875 Ser 873 Ala 873 Ala 450 450 887 Leu 887 Leo 885 lle 885 lle 954 Val 954 Wave 952 lle 952 lle

Exemplul 2. Producerea PPDK de Flaveria brownii în E.coli și măsurarea stabilității la temperatură scăzută 455 Example 2. Production of PPDK by Flaveria brownii in E.coli and measurement of stability at low temperature 4 55

Pentru a confirma faptul că enzima stabilă la temperatură scăzută este produsă în acest moment de la cADN al PPDK de Flaveria brownii, izolată cum s-a descris mai sus, s-a realizat expresia în E.coli după cum urmează.To confirm that the low-temperature stable enzyme is now produced from Flaveria brownii PPDK cDNA isolated as described above, expression in E.coli was performed as follows.

Pentru îndepărtarea peptidei de tranzit și ligarea ADN-ului la un vector de expresie astfel încât cadrele de citire să coincidă, s-au introdus situri de restricție după cum urmează. 460To remove the transit peptide and ligate the DNA to an expression vector so that the reading frames coincide, restriction sites were introduced as follows. 460

S-a digerat p631 cu Saci și s-a reciclizat, pentru a obține p631Sacl (un plasmid la fel ca p631, cu excepția faptului că este deletată regiunea în aval a sitului Saci). S-a realizat PCR folosind p631Sacl ca matriță și folosind primer 4: GATATCAATCCGGTGTCTCCTCC, care conține situl EcoRV preparat pe baza secvenței din vecinătatea regiunii de prelucrarep631 was digested with Saci and recycled to obtain p631Sacl (a plasmid like p631 except that the region downstream of the Saci site is deleted). PCR was performed using p631Sacl as a template and using primer 4: GATATCAATCCGGTGTCTCCTCC, which contains the EcoRV site prepared based on the sequence in the vicinity of the processing region

RO 118451 Β și primer M13 RV (accesibil comercial de la TAKARA SHUZO) complementar la secvența vectorului și fragmentul amplificat s-a subclonat în pCRII. S-a decupat un fragment care conține regiunea N-terminală folosind enzimele de restricție EcoRV și Saci. Trei fragmente ADN, care sunt: fragmentul ADN astfel decupat, fragmentul Sacl-Hindlll din p631 (care conține regiunea rămasă a cADN de PPDK) și un fragment obținut prin digestia pKK233-2 cu Ncol, s-au ligat, după teșirea capetelor cu fragment Klenow și apoi digerarea a ceea ce a rezultat cu Hindlll (Fig. 1). S-a transformat E.coli MV1184 cu plasmidul obținut și s-a folosit pentru experimente de expresie. S-a diluat 1 ml dintr-un mediu precultivat cu 9 ml de mediu LB proaspăt (conținând 50 mg/l ampicilină) și rezultanta s-a cultivat cu agitare 3 h la 37°C. Apoi, s-a adăugat IPTG la o concentrație de 5 mM și s-a continuat cultura pentru alte 3 h urmat de colectarea celulelor prin centrifugare. Celulele s-au suspendat în 0,5 ml dintr-un tampon de extracție (50 mM Hepes-KOH pH 7,5,10 mM MgSO4, 1 mM EDTA, 5 mM DTT) și s-a adăugat lizozim la o concentrație de aproximativ 0,5 mg/ml. Suspensia s-a lăsat să stea în gheață 5 min și apoi s-a tratat cu un distructor de ultrasonare (Model UCD-130T (accesibil comercial de la COSMOBIO) pentru o perioadă de 30 s fiecare, în total 5 min, extrăgându-se enzima. Ceea ce a rezultat s-a centrifugat cu o microcentrifugă timp de 10 min, și supernatantul s-a aplicat la coloană Sephadex G25 echilibrată cu un tampon de coloană (50 mM Hepes-KOH pH 7,0,10 mM MgCI2, 2 mM EDTA, 10 mM DTT) pentru îndepărtarea substanțelor cu greutate moleculară scăzută. Ceea ce a rezultat s-a lăsat să stea la 25°C nu mai puțin de 30 min, prin aceasta realizându-se asocierea la tetramer, care s-a folosit apoi pentru măsurarea activității.RO 118451 Β and M13 RV primer (commercially available from TAKARA SHUZO) complementary to the vector sequence and the amplified fragment was subcloned into pCRII. A fragment containing the N-terminal region was excised using EcoRV and Saci restriction enzymes. Three DNA fragments, which are: the thus excised DNA fragment, the Sac1-HindIII fragment from p631 (containing the remaining region of the PPDK cDNA) and a fragment obtained by digesting pKK233-2 with NcoI, were ligated after blunting the ends with Klenow and then digesting the resulting Hindlll (Fig. 1). E.coli MV1184 was transformed with the resulting plasmid and used for expression experiments. 1 ml of a pre-cultured medium was diluted with 9 ml of fresh LB medium (containing 50 mg/l ampicillin) and the resultant cultured with shaking for 3 h at 37°C. Then, IPTG was added at a concentration of 5 mM and culture was continued for another 3 h followed by collection of cells by centrifugation. Cells were suspended in 0.5 ml of an extraction buffer (50 mM Hepes-KOH pH 7.5, 10 mM MgSO 4 , 1 mM EDTA, 5 mM DTT) and lysozyme was added at a concentration of approximately 0 .5 mg/ml. The suspension was left on ice for 5 min and then treated with an ultrasonic disruptor (Model UCD-130T (commercially available from COSMOBIO) for 30 s each for a total of 5 min, extracting the enzyme. the result was centrifuged with a microcentrifuge for 10 min, and the supernatant was applied to a Sephadex G25 column equilibrated with a column buffer (50 mM Hepes-KOH pH 7.0, 10 mM MgCl 2 , 2 mM EDTA, 10 mM DTT) to remove low molecular weight substances.The resulting was allowed to stand at 25°C for not less than 30 min, thereby making the association to the tetramer, which was then used to measure the activity.

PPDK-urile produse în celulele E.coli de la cADN-uri PPDK de Flaveria brownii, Flaveria bidentisși porumb (harvest Queen) au prezentat în principal aceeași mobilitate ca și enzimele originare din plante în SDS-PAGE. Deși greutățile moleculare ale enzimelor mature, care sunt așteptate de la cADN-urile respective, sunt aproximativ aceleași, greutățile moleculare aparente pe SDS-PAGE diferă substanțial. S-a dovedit că aceasta se datorează diferențelor din compozițiile aminoacide și nu prelucrării prin deleție a proteinelor sau modificărilor după translație, cum ar fi atașarea catenelor de zaharide. Stabilitatea la temperatură scăzută a fiecărui PPDK produs de E.coli a fost identică enzimei corespunzătoare, originară din fiecare plantă. Stabilitatea la temperatură scăzută a PPDK de Flaveria brownii este obținută fără un factor auxiliar specific plantei sau prelucrării după translație, astfel că este de așteptat ca PPDK stabil la temperatură scăzută să fie produs prin introducerea cADN-ului în porumb și exprimarea sa. Fig. 2 arată relația dintre intervalul de timp după plasarea enzimei la 0°C și activitatea relativă a PPDK.PPDKs produced in E.coli cells from Flaveria brownii, Flaveria bidentis, and maize (harvest Queen) PPDK cDNAs showed essentially the same mobility as plant-derived enzymes in SDS-PAGE. Although the molecular weights of the mature enzymes, which are expected from the respective cDNAs, are approximately the same, the apparent molecular weights on SDS-PAGE differ substantially. This has been shown to be due to differences in amino acid compositions and not to protein deletion processing or post-translational modifications such as the attachment of saccharide chains. The low-temperature stability of each PPDK produced by E.coli was identical to the corresponding enzyme originating from each plant. Low-temperature stability of Flaveria brownii PPDK is achieved without a plant-specific auxiliary factor or post-translational processing, so it is expected that low-temperature stable PPDK is produced by introducing the cDNA into maize and expressing it. Fig. 2 shows the relationship between the time interval after placing the enzyme at 0°C and the relative activity of PPDK.

în procesul descris mai sus pentru producerea de PPDK activă în E.coli, cADN-ul, din care s-a îndepărtat regiunea care codifică peptida de tranzit, s-a inserat în vectorul de expresie. Pentru aceasta a fost important să se împerecheze cu precizie situl de decupat cu situl corespunzător la N-terminal al enzimei originare din plantă.In the process described above for the production of active PPDK in E.coli, the cDNA, from which the region encoding the transit peptide was removed, was inserted into the expression vector. For this it was important to precisely pair the site to be cut with the corresponding site at the N-terminal of the enzyme originating from the plant.

Cantitatea de enzimă produsă în E.coli The amount of enzyme produced in E.coli Primer folosit Primer used (F. brownii) (F. brownii) MITAKKRVFTF... MITAKKRVFTF... - - 2 2 MIPVSPPVTTTKKRVFTF... MIPVSPPVTTTKKRVFTF... + + 3 3 MINPVSPPVI I IKKRVFTF... MINPVSPPVI I IKKRVFTF... ++ ++ 4 4 N P VSPPVTTTKKR VFTF... N P VSPPVTTTKKR VFTF... (enzimă originară din frunze) (enzyme originating from leaves)

RO 118451 ΒRO 118451 B

TabelTable

Cantitatea de enzimă produsă în E.coli The amount of enzyme produced in E.coli Primer folosit Primer used (F. bidentis) (F. bidentis) MITAKKRVFTF... MITAKKRVFTF... +4- +4- MIPVSPPVTTAKKRVFTF... MIPVSPPVTTAKKRVFTF... - - TAKKRVFTF... TAKKRVFTF... (enzimă originară din frunze) (enzyme originating from leaves) (porumb) (maize) MATKKRVFTF... MATKKRVFTF... ++ ++ TTKKRVFTF... TTKKRVFTF... (enzimă originară din frunze) (enzyme originating from leaves)

515515

520520

Mai precis, la momentul realizării experimentului de expresie a cADN de Flaveria brownii, expresia PPDK-urilor de porumb și Flaveria bidentis fuseseră reușite, cu referire la situl de tăiere la porumb în ambele cazuri. Astfel, în expresia cADN de Flaveria brownii, mai 525 întâi s-a tăiat ADN-ul la același sit, folosind primer 2, și el s-a ales pentru exprimarea cADN-ului după construcția unui vector de expresie care conține cADN. Cu toate acestea, nu s-a produs la toate PPDK. Apoi, pe baza secvenței regiunii N-terminale a enzimei originare din frunze s-a construit un vector de expresie în care cADN inserat conține o secvență suplimentară, care codifică extinderea enzimei spre direcția N-terminalului prin 7 resturi, folo- 530 sind primerul 3, și expresia s-a realizat folosind vectorul de expresie. Ca urmare, s-a confirmat producerea de PPDK. Cu toate acestea, chiar cu acest vector de expresie, cantitatea de enzimă produsă a fost mică, astfel că enzima obținută nu a fost într-o cantitate suficientă pentru măsurarea activității sale. Astfel, s-a construit folosind primerul 4, un vector de expresie în care cADN-ul inserat conține încă o secvență suplimentară care condifică extinderea 535 enzimei spre direcția N-terminal prin 1 rest, și s-a realizat expresia folosind vectorul de expresie. Ca urmare, s-a produs o cantitate mare de PPDK și s-a confirmat stabilitatea sa la temperatură scăzută. Secvențele nucleotidice ale primerului 2 și primerului 3 au fost precum urmează:More specifically, at the time of the Flaveria brownii cDNA expression experiment, the expression of the maize and Flaveria bidentis PPDKs had been successful, with reference to the maize cutting site in both cases. Thus, in Flaveria brownii cDNA expression, DNA was first cut at the same site using primer 2, and it was chosen for cDNA expression after construction of an expression vector containing the cDNA. However, it did not occur in all PPDKs. Then, based on the sequence of the N-terminal region of the enzyme originating from leaves, an expression vector was constructed in which the inserted cDNA contains an additional sequence, which encodes the extension of the enzyme towards the N-terminal direction by 7 residues, using primer 3, and expression was achieved using the expression vector. As a result, the production of PPDK was confirmed. However, even with this expression vector, the amount of enzyme produced was small, so the enzyme obtained was not in sufficient quantity to measure its activity. Thus, an expression vector was constructed using primer 4 in which the inserted cDNA still contains an additional sequence that conditions the extension of the 535 enzyme towards the N-terminal direction by 1 residue, and expression was performed using the expression vector. As a result, a large amount of PPDK was produced and its stability at low temperature was confirmed. The nucleotide sequences of primer 2 and primer 3 were as follows:

primer 2: CGGTGTCTCCTCCCGGATATCACGGCTAAAAAGAG 540 primer 3: TTGATATCCCGGTTGTCTCCTCCGGTA.primer 2: CGGTGTCTCCTCCCGGATATCACGGCTAAAAAGAG 540 primer 3: TTGATATCCCGGTTGTCTCCTCCGGTA.

Exemplul 3. Identificarea regiunilor care dau stabilitate la temperatură scăzută în gena PPDK de Flaveria brownii (1) Himere de Flaveria brownii și Flaveria bidentisExample 3. Identification of regions conferring low temperature stability in the PPDK gene of Flaveria brownii (1) Chimeras of Flaveria brownii and Flaveria bidentis

Vectorii de expresie s-au recombinat conform unei metode convenționale, folosind 545 enzime de restricție. Astfel, fragmentul EcoRI-Hindlll al pKK.-brownii s-a schimbat cu fragmentul corespunzător al pKK-bidentis (un plasmid obținut prin inserția cADN-ului de Flaveria bidentis în pKK 223-2 ca în construcția lui pKK-brownii) pentru a obține un plasmid pKK-011. Pe de altă parte, fragmentul EcoRI-Hindlll al pKK-bidentis s-a schimbat cu fragmentul corespunzător al pKK-browniipentru a obține un plasmid pKK-100. în mod similar s-au schimbat 550 fragmentele Ndel-Hindlll pentru a obține pKK-001 și pKK-110. în plus, fragmentul XhoHindlll al pKK-110 s-a schimbat cu fragmentul corespunzător al pKK-bidentis pentru a obține pKK-1101, și fragmentul Xhol-HindllI al pKK-bidentis s-a schimbat cu fragmentul corespunzător al p\<K-brownii pentru a obține pKK-1110. Pentru a recombina ulterior în mod fin fragmentele Xhol-Hindl11 fragmentele s-au lincat prin PCR (metoda POR de lineare). Astfel, s-au 555 preparat primerii complementari la secvența nucleotidică în fragmentul Xhal-Hindlll,The expression vectors recombined according to a conventional method using 545 restriction enzymes. Thus, the EcoRI-HindIII fragment of pKK.-brownii was exchanged with the corresponding fragment of pKK-bidentis (a plasmid obtained by inserting Flaveria bidentis cDNA into pKK 223-2 as in the construction of pKK-brownii) to obtain a plasmid pKK-011. On the other hand, the EcoRI-HindIII fragment of pKK-bidentis was exchanged with the corresponding fragment of pKK-brownii to obtain a plasmid pKK-100. similarly 550 changed the Ndel-HindIII fragments to obtain pKK-001 and pKK-110. In addition, the XhoHindllI fragment of pKK-110 was exchanged with the corresponding fragment of pKK-bidentis to obtain pKK-1101, and the XhoI-HindllI fragment of pKK-bidentis was exchanged with the corresponding fragment of pK-brownii to obtain pKK -1110. In order to subsequently finely recombine the Xhol-Hindl11 fragments, the fragments were ligated by PCR (linear POR method). Thus, primers complementary to the nucleotide sequence in the Xhal-Hindlll fragment were prepared,

RO 118451 Β secvență care este comună în bidentis și brownii, care este un primer link-F: GCAGAGATGATGTTGGCAAGși un primer link-R: CTTGCCAACATGATCTCTGC. Folosind fragmentul Xhol-Hindlll de browniisau bidentissubclonat în pBluescript SK(-) ca matriță, și folosind combinația de primeri link-F/RV sau M4/link-R s-a realizat primul POR. Fragmentele obținute (în total 4 tipuri) s-au purificat prin decuparrea lor din gel. Folosind amestecul de fragment care codifică fosta jumătate a brownii și fragmentul care codifică ultima jumătate de bidentis, sau amestecul fragmentului care codifică fosta jumătate a bidentis și fragmentului care codifică ultima jumătate a brownii ca matriță, și folosind primerii M4/RV, s-a realizat a doua POR. Fragmentele lincate amplificate s-au digerat cu Xhol și Hindlll, și rezultatele s-au schimbat cu regiunea corespunzătoare a pKK-bidentis, pentru a obține pKK-link01 și pKK-link10. Altă pereche de gene himerice s-a preparat folosind situl Pstl dintre situl de recombinare în lincajul PCR și situl Hindlll. Astfel, fragmentul Xhol-Pstl al pKK-link10 și fragmentul Pstl-Hindlll de pKK-bidentis s-au inserat în situl Xhol-Hindlll al pKK-bidentis (reacție de licăre a trei fragmente), pentru a obține pKK-link101. în mod similar, fragmentul Xhol-Pstl al pKK-bidentis și fragmentul Pstl-Hindlll al pKK-browniis-au inserat în situl Xhol-Hindlll al pKK-bidentis, pentru a obține pKK-link110.RO 118451 Β sequence that is common in bidentis and brownii, which is an F-link primer: GCAGAGATGATGTTGGCAAGand an R-link primer: CTTGCCAACATGATCCTGC. Using the browniisau bidentis Xhol-Hindlll fragment subcloned into pBluescript SK(-) as a template, and using the link-F/RV or M4/link-R primer combination the first POR was performed. The obtained fragments (4 types in total) were purified by cutting them out of the gel. Using the mixture of the fragment encoding the former half of brownia and the fragment encoding the latter half of bidentis, or the mixture of the fragment encoding the former half of bidentis and the fragment encoding the latter half of brownia as a template, and using primers M4/RV, a second POR. The amplified linked fragments were digested with XhoI and HindIII, and the results were swapped with the corresponding region of pKK-bidentis to obtain pKK-link01 and pKK-link10. Another chimeric gene pair was prepared using the Pstl site between the recombination site in the PCR ligation and the HindIII site. Thus, the XhoI-Pstl fragment of pKK-link10 and the Pstl-HindIII fragment of pKK-bidentis were inserted into the XhoI-HindIII site of pKK-bidentis (three-fragment annealing reaction), to obtain pKK-link101. similarly, the XhoI-Pstl fragment of pKK-bidentis and the Pstl-HindIII fragment of pKK-browniis inserted into the XhoI-HindIII site of pKK-bidentis, to obtain pKK-link110.

Cele 40 de situri, la care resturile aminoacide sunt diferite între proteinele PPDK de Flaveria brownii și de Flaveria bidentis, există în principal în regiunea N-terminală și C-terminală, și nu sunt atât de numeroase în regiunea centrală, cu alte cuvinte, în situl activ. Astfel, cADN-urile s-au divizat în trei regiuni care sunt regiunile frontală, mediană și din spate, utilizând situri EcoRI- și situri Ndel, care există în mod obișnuit în ambele gene, și aceste fragmente interschimbabile s-au schimbat pentru a prepara gene himerice. Prin verificarea acestor gene himerice s-a determinat care regiune este legată de stabilitatea la temperatură scăzută. Ca urmare, stabilitatea la temperatură scăzută este dobândită când proteina conține regiunea 1/3 din spate a cADN-ului de Flaveria brownii. Când proteina conține regiunea 1/3 din spate a cADN-ului de Flaveria bidentis, proteina a fost sensibilă la temperatură scăzută. Apoi, regiunea 1/3 din spate s-a divizat în 2 regiuni prin enzima de restricție, și fragmentele respective s-au introdus în regiunile corespunzătoare ale pKK-bidentis și s-a verificat stabilitatea la temperatură scăzută. Ca urmare, regiunea din amontele sitului Xhol (cu alte cuvinte, regiunea 1/6 de la C-terminal) a fost necesară și suficientă pentru obținerea stabilității la temperatură scăzută. Apoi, s-a preparat o genă himerică, care conține regiunea 1/6 din spate (fragmentul Xhaol-Hindl11 care conține 7 substituții de resturi aminoacide), prin metoda de lineare PCR și gena amplificată s-a introdus în pKK-bidentis, urmată de măsurarea stabilității la temperatură scăzută, a acestora. Ca urmare, o enzimă himerică pKK-link10, având majoritatea regiunii din spate conținând 4 substituții, a fost stabilă la temperatură scăzută, în timp ce o enzimă himerică pKK-link01, care are fosta jumătate a regiunii din spate care conține 3 substituții, a fost sensibilă la temperatură joasă. Apoi, regiunea din spate s-a recombinat folosind o enzimă de restricție Pstl pentru a prepara gene himerice care au două substituții aminoacide per genă și s-a verificat stabilitatea la temperatură scăzută a genei obținute. Ca urmare, toate genele himerice au prezentat stabilitate la temperatură scăzută. Astfel, s-a acceptat că sunt două sau mai multe regiuni în legătură cu stabilitatea la temperatură scăzută.The 40 sites, where the amino acid residues are different between Flaveria brownii and Flaveria bidentis PPDK proteins, exist mainly in the N-terminal and C-terminal region, and are not so numerous in the central region, in other words, in active site. Thus, the cDNAs were divided into three regions which are the front, middle and back regions using EcoRI- and Ndel-sites, which are commonly present in both genes, and these exchangeable fragments were swapped to prepare chimeric genes. By checking these chimeric genes it was determined which region is related to low temperature stability. As a result, low temperature stability is acquired when the protein contains the posterior 1/3 region of the Flaveria brownii cDNA. When the protein contains the back 1/3 region of the Flaveria bidentis cDNA, the protein was sensitive to low temperature. Then, the back 1/3 region was split into 2 regions by restriction enzyme, and the respective fragments were inserted into the corresponding regions of pKK-bidentis and checked for low temperature stability. As a result, the region upstream of the Xhol site (ie, the 1/6 region from the C-terminus) was both necessary and sufficient to achieve low-temperature stability. Then, a chimeric gene containing the 1/6 back region (Xhaol-Hindl11 fragment containing 7 amino acid residue substitutions) was prepared by the PCR linearization method and the amplified gene was inserted into pKK-bidentis, followed by stability measurement at their low temperature. As a result, a pKK-link10 chimeric enzyme, having most of the back region containing 4 substitutions, was stable at low temperature, while a pKK-link01 chimeric enzyme, having the former half of the back region containing 3 substitutions, a was sensitive to low temperature. Then, the back region was recombined using a PstI restriction enzyme to prepare chimeric genes having two amino acid substitutions per gene, and the low temperature stability of the resulting gene was checked. As a result, all chimeric genes exhibited low temperature stability. Thus, it has been accepted that there are two or more regions related to low temperature stability.

(2) Himere de porumb și Flaveria brownii(2) Maize chimeras and Flaveria brownii

Folosind un primer PPDK-F: CTCACTGTTCGAAGAGAAGC și un primer mNdel: CATATGCTCTGTCCGGCATAATC (catena laterală complementară) conținând sit Ndel ca primeri și cADN al PPDK de porumb ca matriță, s-a realizat PCR și fragmentul amplificat s-a subclonat în pCR II. Acest fragment s-a decupat din pCR II prin Saci și Ndel. Acest fragment, fragmentul Sacl-Smal (fragment vector) al pKK-PPDK, obținut prin digestia cADN alUsing a PPDK-F primer: CTCACTGTTCGAAGAGAAGC and an mNdel primer: CATATGCTCTGTCCGGCATAATC (complementary side chain) containing Ndel site as primers and maize PPDK cDNA as template, PCR was performed and the amplified fragment was subcloned into pCR II. This fragment was excised from pCR II by Saci and NdeI. This fragment, the Sac1-SmaI fragment (vector fragment) of pKK-PPDK, obtained by cDNA digestion of

RO 118451 ΒRO 118451 B

PPDK de Flaveria brownii, teșind rezultanta cu fragment Klenow și apoi digerând ceea ce a rezultat cu Ndel, s-au licat prin reacția de lineare a trei fragmente (fig. 3), pentru a obține pKK-mz/bro(Nde). S-a realizat PCR folosind primer PPDK și un primer mXho: CTCGAGGGATCTCAATCATTG (catenă complementară), și PPDK de porumb ca matriță. Fragmentul obținut s-a subclonat în pCR II, și insertul s-a decupat prin Saci și Xhol din pCR II. Fragmentul decupat s-a ligat cu fragmentul Sacl-Xhol (fragment vector) al pKK-mz/bro(Nde), pentru a obține pKK-mz/bro(Xho).PPDK of Flaveria brownii, cleaving the resultant with the Klenow fragment and then digesting the resulting with NdeI, were lysed by the linearization reaction of three fragments (Fig. 3), to obtain pKK-mz/bro(Nde). PCR was performed using primer PPDK and an mXho primer: CTCGAGGGATCTCAATCATTG (complementary strand), and maize PPDK as template. The resulting fragment was subcloned into pCR II, and the insert excised by Sac and XhoI from pCR II. The excised fragment was ligated with the Sac1-XhoI fragment (vector fragment) of pKK-mz/bro(Nde), to obtain pKK-mz/bro(Xho).

Ambele enzime himerice, care conțin regiunea 1/3 C-terminală (fragment NdelHindlll) a PPDK brownii în PPDK de porumb și enzima himerică care conține regiunea 1/6C-terminală (fragment Xhol-Hindl11) al PPDK brownii în PPDK de porumb au prezentat stabilitate la temperatură scăzută la fel de puternică ca cea a brownii. Astfel, deoarece stabilitatea la temperatură scăzută a putut fi atribuită PPDK de porumb prin acea secvență aminoacidă considerabil diferită de cea a F.brownii, s-a considerat că stabilitatea la temperatură scăzută poate fi dată la PPDK de la diverse plante prin introducerea regiunii 1/3 din spate sau regiunii 1/6 C-terminale. Deoarece PPDK himeric porumb/F.brownii preparat în acest fel conține peptida de tranzit a PPDK de porumb așa cum este, dacă o plantă are o problemă în legătură cu transportul PPDK spre cloroplaste, în care peptida de tranzit este, de asemenea, originară de la Flaveria brownii se apreciază că această problemă poate fi depășită prin introducerea acestei gene himerice în locul genei originare din F.brownii.Both chimeric enzymes containing the 1/3 C-terminal region (NdelHindlll fragment) of brownii PPDK in maize PPDK and the chimeric enzyme containing the 1/6C-terminal region (Xhol-Hindl11 fragment) of brownii PPDK in maize PPDK showed low temperature stability as strong as that of brownie. Thus, since low-temperature stability could be assigned to maize PPDK by that amino acid sequence considerably different from that of F.brownii, it was considered that low-temperature stability could be given to PPDK from various plants by introducing the 1/3 region of back or the 1/6 C-terminal region. Because the maize/F.brownii chimeric PPDK prepared in this way contains the transit peptide of the maize PPDK as it is, if a plant has a problem with the transport of PPDK to chloroplasts, where the transit peptide is also originating from in Flaveria brownii it is appreciated that this problem can be overcome by introducing this chimeric gene instead of the original gene from F. brownii.

(3) Clone mutate punctual(3) Point-mutated clones

Resturile aminoacide ale fragmentului Xhol-Hindlll ale PPDK browniis-au schimbat unul câte unul de la tipul brownii la tipul bidentis. în mod similar, resturile aminoacide ale fragmentului Xhol-Hindlll ale PPDK-bidentis s-au schimat unul câte unul de la tipul bidentis la tipul brownii. Introducerea mutației s-a realizat prin subclonarea fragmentelor Xhol-Hindlll ale cADN-urilor PPDK de F.browniiși F.bidentis în pBluescript IISK(-), și apoi substituirea nucleotidelor prin metoda Kunkel folosind Megalabel kit (accesibil comercial de la TAKARA SHUZO) și Mutan-K kit (accesibil comercial de la TAKARA SHUZO). Secvența primerilor folosiți pentru introducerea mutanților s-a prezentat în tabelul 2. După confirmarea secvenței nucleotidice mutate printr-un secvențer ADN, aceste fragmente s-au inserat în situl XholHindlll al PPDK-bidentis.The amino acid residues of the Xhol-HindIII fragment of PPDK browniis changed one by one from the brownii type to the bidentis type. Similarly, the amino acid residues of the Xhol-HindIII fragment of PPDK-bidentis shifted one by one from the bidentis type to the brownii type. The introduction of the mutation was achieved by subcloning the XhoI-HindIII fragments of the PPDK cDNAs of F. brownii and F. bidentis into pBluescript IISK(-), and then nucleotide substitution by the Kunkel method using the Megalabel kit (commercially available from TAKARA SHUZO) and Mutan- K kit (commercially available from TAKARA SHUZO). The sequence of the primers used to introduce the mutants was shown in Table 2. After confirming the mutated nucleotide sequence by a DNA sequencer, these fragments were inserted into the XholHindlll site of PPDK-bidentis.

605 §§ 605

610610

615615

620620

625625

630630

635635

Tabelul 2Table 2

Primeri folosiți pentru prepararea clonelor mutate punctualPrimers used to prepare point-mutated clones

______ Introducerea mutației de la tipul F.brownii la tipul Fbidentis ______ Introduction of mutation from F.brownii type to Fbidentis type 836DE 836DE 5'GCAATCTCTTCAGCAATC 5'GCAATCTCTTCAGCAATC 839AG 839 AG 5'GCTTCTTTTCCAATCTCATC 5'GCTTCTTTTCCAATCCATC 843ED 843ED 5'CGAAAAGAAATCGGCTTC 5'CGAAAAGAAATCGGCTTC 869PQ 869PQ 5'GAAAGATAAATCTGCAAAAACTTG 5'GAAAGATAAATCTGCAAAAAACTTG 873SA 873 SA 5'GCCTTGAGCAAGATAAATC 5'GCCTTGAGCAAGATAAATC 885LI 885 LI 5'TTCTGGTCAATAACCTCAATG 5'TTCTGGTCAATAACCTCAATG 952VI 952 VI 5'GCTTAAACAATGACTTGTGC 5'GCTTAAACAATGACTTGTGC 836ED 836ED 5'CCAATCTCATCAGCTATTAAAG 5'CCAATCCATCAGCTATTAAAG 839GA 839GA b'GC 1 1 U 1 1 1 1 GUAATCTCTTC b'GC 1 1 U 1 1 1 1 GUAATCTCTTC

640640

645645

RO 118451 ΒRO 118451 B

Tabelul 2 (continuare)Table 2 (continued)

Introducerea mutației de la tipul F.brownii la tipul F.bidentis Introducing the mutation from F.brownii type to F.bidentis type 843DE 843 DE 5'CGAAAAGAACTCAGCTTC 5'CGAAAAGAACTCAGCTTC 869QP 869QP 5'CAAGATAAATCGGCAAAAACTTG 5'CAAGATAAATCGGCAAAAACTTG 873AS 873 AS 5'GAATGCCTTGAGAAAGATAAATC 5'GAATGCCTTGAGAAAGATAAATC 885IL 885 IL 5'CTTTCTGGTCAAGAACCTCAAATG 5'CTTTCTGGTCAAGAACCTCAAATG 952IV 952 IV 5'GCTTAAACAACGACTTGTGC 5'GCTTAAACAACGACTTGTGC

Toate enzimele în care un rest aminaocid din regiunea Xhol-Hindl 11 a pKK-1110, aminoacid care este diferit între F.brownii și F.bidentis, s-a substituit la restul aminoacid corespunzător al tipului bidentis prezintă stabilitate la temperatură scăzută. Astfel, se consideră că există o multitudine de mutații care dau stabilitate la temperatură scăzută (deoarece stabilitatea la temperatură scăzută nu se pierde numai prin schimbarea unui rest aminoacid). Apoi, enzimele în care un rest aminoacid din regiunea Xhol-HindllI a pKK-bidentis, rest aminoacid care este diferit între F.brownii și F.bidentis, s-a substituit la restul aminoacid corespunzător al tipului browniis-au verificat dacă au dobândit stabilitate la temperatură scăzută. Astfel, s-a măsurat activitatea enzimei după tratament la 0°C, după 20 min. Ca un rezultat, prin mutația lui 860Gln—►Pro, enzima a dobândit stabilitate la temperatură scăzută (activitatea după tratament la frig este 60..70% din activitatea inițială). Cu mutația lui 885lle-+Leu sau 952lle->Val, pierderea de activitate la temperatură scăzută a fost puțin prevenită. Luând în considerare aceste rezultate și rezultatele enzimei himerice descrise la (1) (pKK-link110), s-a admis că enzima dobândește stabilitate la temperatură scăzută, când aceste mutații coexistă. Din aceste rezultate, s-a ajuns la concluzia că cele trei resturi aminoacide, 869Pro, 185Leu și 952Val sunt legate de stabilitatea la temperatură scăzută. Cu toate acestea, printre aceste resturi în legătură cu stabilitatea la temperatură scăzută a brownii, ca pentru 869Pro și 885Leu aceste resturi sunt ale tipului browniiale PPDK de la porumb. Prin urmare, stabilitatea la temperatură scăzută nu este în mod necesar obținută doar prin faptul că aceste resturi sunt ale tipului brownii, ci se consideră că aceste resturi dau stabilitate termică completă în secvența PPDK a brownii sau bidentis. Prin urmare, în cazurile în care stabilitatea la temperatură scăzută trebuie să fie dată unui PPDK originar de la o specie diferită, a cărui secvență aminoacidă este considerabil diferită de cea a brownii sau bidentis, este de preferat să nu se introducă mutații punctuale și să se prepare o genă himerică în care regiunea care dă stabilitatea la temperatură scăzută să fie introdusă așa cum s-a realizat pentru PPDK de porumb.All enzymes in which an amino acid residue in the Xhol-Hindl 11 region of pKK-1110, an amino acid that is different between F.brownii and F.bidentis, has been substituted for the corresponding amino acid residue of the bidentis type show stability at low temperature. Thus, it is believed that there are a multitude of mutations that confer low-temperature stability (since low-temperature stability is not lost only by changing one amino acid residue). Then, enzymes in which an amino acid residue from the Xhol-HindllI region of pKK-bidentis, an amino acid residue that is different between F.brownii and F.bidentis, was substituted for the corresponding browniis-type amino acid residue, were checked to see if they acquired temperature stability low. Thus, the enzyme activity was measured after treatment at 0°C, after 20 min. As a result, by mutating 860Gln—►Pro, the enzyme acquired stability at low temperature (the activity after cold treatment is 60..70% of the initial activity). With the mutation of 885lle-+Leu or 952lle->Val, the loss of activity at low temperature was slightly prevented. Considering these results and the results of the chimeric enzyme described in (1) (pKK-link110), it was assumed that the enzyme acquires stability at low temperature when these mutations coexist. From these results, it was concluded that the three amino acid residues, 869Pro, 185Leu and 952Val are related to low temperature stability. However, among these residues related to the low temperature stability of the brownie, as for 869Pro and 885Leu these residues are of the PPDK brownie type from corn. Therefore, the low temperature stability is not necessarily achieved only by these residues being of brownii type, but these residues are considered to confer complete thermal stability in the PPDK sequence of brownii or bidentis. Therefore, in cases where low-temperature stability must be given to a PPDK originating from a different species, whose amino acid sequence is considerably different from that of brownii or bidentis, it is preferable not to introduce point mutations and prepare a chimeric gene in which the region conferring low temperature stability is inserted as was done for maize PPDK.

Exemplul 4. Tratarea porumbului cu gena PPDK de Flaveria brownii în conformitate cu metoda lui Gordon-Kamm W.J. et al. (The plant Cell 2:603-618, 1990) sau Kosier M.G. et al. (Bio/Technology 11:194-200, 1993) particule fine de aur sau tungsten se învelesc cu pKK-browniiși particulele învelite se implantează în embrion imatur de porumb sau în culturi de celule în suspensie. De la celulele tratate, se selectează transformanții, și călușurile obținute ale transformanților se cultivă conform unei metode convenționale, după care se regenerează plante din călușuri. Metoda de transformare nu este limitată la metoda bombardamentului cu particule, putându-se folosi metode de transformare care includ electroporări (Rhodes C.A. et al., Science 240: 204-207, 1988), metoda PEGExample 4. Treatment of maize with the PPDK gene of Flaveria brownii according to the method of Gordon-Kamm W.J. et al. (The plant Cell 2:603-618, 1990) or Kosier M.G. et al. (Bio/Technology 11:194-200, 1993) fine gold or tungsten particles are coated with pKK-brownies and the coated particles are implanted into immature maize embryo or suspension cell cultures. From the treated cells, the transformants are selected, and the obtained calluses of the transformants are cultivated according to a conventional method, after which plants are regenerated from the calluses. The transformation method is not limited to the particle bombardment method, transformation methods including electroporation (Rhodes C.A. et al., Science 240: 204-207, 1988), the PEG method can be used

RO 118451 Β (Amstrong C.L. et al., Plant Cell reports 9: 335-339,1990), metoda de electroporare a țesutului (D’Halluin K. Et al., The Plant Cell 4: 1495-1505,1992), metoda Agrobacterium (Hiei V. și Komari T. WO 9400977) și altele asemenea. De la plantele obținute se recuperează semințe care germinează pentru a regenera plante. Se izolează PPDK din frunzele plantelor 695 obținute și se verifică stabilitatea la temperatură joasă. La plantele transformate și netransformate se verifică efectul temperaturii asupra ratei de fotosinteză. Stabilitatea transformării se păstrează prin proliferarea plantelor de porumb care prezintă rate ridicate ale fotosintezei pentru mai multe generații și măsurarea ratelor fotosintezei la diferite temperaturi, precum și măsurarea stabilității la temperatură scăzută a PPDK-urilor izolate din plante. 700RO 118451 Β (Amstrong C.L. et al., Plant Cell reports 9: 335-339,1990), tissue electroporation method (D'Halluin K. Et al., The Plant Cell 4: 1495-1505,1992), method Agrobacterium (Hiei V. and Komari T. WO 9400977) and the like. From the obtained plants, seeds are recovered that germinate to regenerate plants. PPDK is isolated from the leaves of the obtained 695 plants and checked for stability at low temperature. In transformed and non-transformed plants, the effect of temperature on the rate of photosynthesis is verified. Transformation stability is maintained by propagating maize plants exhibiting high photosynthetic rates for several generations and measuring photosynthetic rates at different temperatures, as well as measuring the low-temperature stability of PPDKs isolated from plants. 700

Exemplul 5. Transformarea Flaveria bidentis cu gena PPDK de Flaveria browniiExample 5. Transformation of Flaveria bidentis with the Flaveria brownii PPDK gene

Se introduce într-un plasmid Ti nevirulent al Agrobacterium tumefaciens un vector intermediar care conține cADN-ul de lungime totală prezentat în SECV ID NR: 5 și o genă raportor. Aceasta poate fi realizată prin metoda descrisă în Draper J. et al., editori Plant Genetic Transformation and Gene Expression-a laboratory manual, Blackwell Scientific 705 Publication (ISBN O-632-02172-1).An intermediate vector containing the full-length cDNA shown in SEQ ID NO: 5 and a reporter gene is inserted into a non-virulent Ti plasmid of Agrobacterium tumefaciens. This can be accomplished by the method described in Draper J. et al., editors Plant Genetic Transformation and Gene Expression-a laboratory manual, Blackwell Scientific 705 Publication (ISBN O-632-02172-1).

Pe de altă parte, țesut de frunză sau calus de Flaveria bidentis se infectează cu Agrobacterium tumefaciens menționat mai sus. Aceasta se poate realiza cultivând țesutul sau călușul împreună cu Agrobacterium tumefaciens. Celulele infectate sunt selectate pe baza rezistenței la medicament. Din călușul selectat se regenerează plante întregi prin me- 710 tode convenționale. De la plantele obținute se recuperează semințe și se germinează pentru regenerarea plantelor. Se izolează PPDK din frunzele plantelor obținute și se verifică stabilitatea la temperatură scăzută. Se verifică la plantele transformate și netransformate efectul temperaturii asupra ratei de fotosinteză. Stabilitatea transformării se menține prin proliferarea plantelor care prezintă rate ridicate la temperatură scăzută mai multe generații și prin 715 măsurarea ratelor fotosintezei la diferite temperaturi precum și a stabilității la temperatură scăzută, a PPDK-urilor izolate din plante.On the other hand, leaf tissue or callus of Flaveria bidentis becomes infected with Agrobacterium tumefaciens mentioned above. This can be done by culturing the tissue or callus with Agrobacterium tumefaciens. Infected cells are selected based on drug resistance. Whole plants are regenerated from the selected stem by conventional methods. From the obtained plants, seeds are recovered and germinated for plant regeneration. PPDK is isolated from the leaves of the obtained plants and the stability at low temperature is checked. The effect of temperature on the rate of photosynthesis is checked in transformed and non-transformed plants. Transformation stability is maintained by propagating plants exhibiting high rates at low temperature for several generations and by 715 measuring the photosynthetic rates at different temperatures, as well as the low temperature stability, of PPDKs isolated from plants.

RO 118451 ΒRO 118451 B

LISTA DE SECVENȚELIST OF SEQUENCES

SECV IO NR: 1SEQ ID NO: 1

LUNGIMEA SECVENȚEI: 2915SEQUENCE LENGTH: 2915

TIPUL SECVENȚEI: acid nucleicSEQUENCE TYPE: nucleic acid

DESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

CGATCTCCTT CTGCTATTGC TGATATCTCA AXTTCACAGG TGAAGAAGG ATG ATG ACT 58CGATCTCCTT CTGCTATTGC TGATATCTCA AXTTCACAGG TGAAGAAGG ATG ATG ACT 58

Met Met SerMet Met Ser

TCG TCG TTG TTG TCT TCT GTT GTT GAA GAA GGT GGT ATG ATG CTT CTT CTC AAG CTC AAG TCA । TCA GCC GCC CGT CGT GAG 1 GAG 1 TCG TCG TGC TGC 106 106 Ser Serum Leu Lion Ser Serum Val Wave Glu Goofy Gly Gly Met meth Leu Lion Leu Lion Lys Lys Ser . Serum. Ala That Arg Arg Glu Goofy Ser Serum Cys Cys 5 5 10 10 15 15 TTA TTA CCG GCC GCG GCG AGA AGM CTG CTG AAG AAG CAA CAA CGG CGG CGA CGA AAC AAC GCT GTC GAT NECK CTC CTC CGG CGG CGA CGA TTG TTG 154 154 Leu Lion Pro Pro Ala That Arg Arg Val Wave Lys Lys Gin Gin Arg Arg Arg Arg Asn Assn Gly Asp Gly Asp Leu Lion Arg Arg Arg Arg Leu Lion 20 20 25 25 30 30 35 35 AAC AAC CAC CaCl CAC CaCl CGT CGT CAA CAA TCG TCG TCG TCG TTT TTT CTC CTC CGG CGG TCT TCT TTA TTA ACT ACT CCG GCC GCG GCG AGA AGM 202 202 Asn Assn His His His His Arg Arg Gin Gin Ser Serum Ser Serum Phe Phew Val Wave Arg Arg Cys Cys Leu Lion Thr Thr Pro Pro Ala That Arg Arg 40 40 45 45 50 50 GTT GTT AGC AGC AGA AGM CCA Approx GAG GAG TTG TTG CGC CGC AGC AGC ACT ACT GGC GGC TTA ACT TTA ACT CCG GCC CCG GCC CGA CGA GCA GCA 250 250 Val Wave Ser Serum Arg Arg Pro Pro GlU Glu Leu Lion Arg Arg Ser Serum Ser Serum Gly Gly Leu Lion Thr Thr Pro Pro Pro Pro Arg Arg Ala That 55 55 eo is 65 65 GTT GTT CIT how AAT AAT CCG GCC CTG CTG TCT TCT CCT CCT CCG GCC GTG GTG ACG ACG ACG ACG GCT GTC AAA AAA AAG AAG AGG AGG GTT GTT 298 298 Val Wave Leu Lion Asn Assn Pro Pro Val Wave Ser Serum Pro Pro Pro Pro Val Wave Thr Thr Thr Thr Ala That Lys Lys Lys Lys Arg Arg Val Wave 70 70 75 75 80 80 TTC TTC ACT ACT TTT TTT GGT GGT AAA AAA GGA GGA AGA AGM ACT ACT GAA GAA GGC GGC AAC AAC AGG AGG GAC GAC ATG ATG AAA AAA TCC TCC 346 346 Phe Phew Thr Thr Phe Phew Gly Gly Lys Lys Gly Gly Arg Arg Ser Serum Glu Goofy Gly Gly Asn Assn Arg Arg Asp Asp Met meth Lys Lys Ser Serum 85 85 90 90 95 95 TTG TTG TTG TTG GGA GGA GGA GGA AAA AAA GGA GGA GCA GCA AAT AAT CTT CTT GCT GTC GAG GAG i ATG TCA and ATG TCA . AGC . AGC : att : att 1 GGT 1 GGT 394 394

RO 118451 ΒRO 118451 B

Leu Leu 100 Leo Leo 100 Gly Gly Lys Gly Gly Lys Gly Ala Asn Leu Ala Glu Met Ser Ser lle Gly Gly Ala Asn Leu Ala Glu Met Ser Ser lle Gly 760 760 105 105 110 110 115 115 CTA CTA TCA TCA GTT GTT CCT CCT CCT CCT GGG GGG CTC CTC ACT . ACT. ATT ATT TCA TCA ACT < ACT < GAA GAA GCA GCA TGT 1 TGT 1 GAG GAG GAA GAA 442 442 Leu Lion Ser Serum Val Wave Pro Pro Pro Pro Gly Gly Leu Lion Thr Thr lle Ile Ser Serum Thr Thr Glu Goofy Ala That Cys Cys Glu Goofy Glu Goofy 765 765 120 120 125 125 130 130 TAT DAD CAA CAA CAA CAA AAT AAT GGA GGA AAG AAG AGC AGC CTA CTA CCT CCT CCA Approx GCT GTC TTG TTG TGG TGG GAT NECK GAG GAG ATT ATT 490 490 Tyr Tyre Gin Gin Gin Gin Asn Assn Gly Gly Lys Lys Ser Serum Leu Lion Pro Pro Pro Pro Gly Gly Leu Lion Trp Trp Asp Asp Glu Goofy lle Ile 770 770 135 135 140 140 145 145 TCA TCA GAA GAA GGC GGC TTA TTA GAT NECK TAT DAD GTC GTC CAG CAG AAA AAA GAG GAG ATG ATG TCT TCT GCA GCA TCT TCT CTC CTC GCT GTC 538 538 ser serum Glu Goofy Gly Gly Leu Lion Asp Asp Tyr Tyre Val Wave Gin Gin Lys Lys Glu Goofy Met meth Ser Serum Ala That Ser Serum Leu Lion Gly Gly 150 150 155 155 160 160 775 775 GAC GAC CCG GCC TCT TCT AAA AAA CCT CCT CTC CTC CTC CTC CTT CTT TCC TCC GTC GTC CCT CCT TCG TCG GCT GTC GCT GTC GCC GCC ATA THREAD 586 586 Asp Asp Pro Pro Ser Serum Lys Lys Pro Pro Leu Lion Leu Lion Leu Lion Ser Serum Val Wave Arg Arg Ser Serum Gly Gly Ala That Ala That lle Ile 165 165 170 170 175 175 780 780 TCT TCT ATG ATG CCT CCT GOT Goth ATG ATG ATG ATG GAC GAC ACT ACT GTA GTA TTG TTG AAT AAT CTC CTC GGG GGG CTT CTT AAT AAT GAT NECK 634 634 Ser Serum Met meth Pro Pro Gly Gly Met meth Met meth Asp Asp Thr Thr Val Wave Leu Lion Asn Assn Leu Lion Gly Gly Leu Lion Asn Assn Asp Asp ISO ISO 185 185 190 190 195 195 785 785 GAG GAG GTC GTC GTA GTA GCT GTC GCT GTC CTA CTA GCT GTC GGC GGC AAA AAA ACT ACT GGA GGA GCA GCA CGG CGG TTT TTT GCC GCC TAT DAD 682 682 Glu Goofy Val Wave Val Wave Ala That Gly Gly Leu Lion Ala That Gly Gly Lys Lys Ser Serum Gly Gly Ala That Arg Arg Phe Phew Ala That Tyr Tyre 200 two hundred 205 205 210 210 GAC GAC TCG TCG TAT DAD AGA AGM AGG AGG TTT TTT CTC CTC GAT NECK ATG ATG TTT TTT GGC GGC AAC AAC GTT GTT GTA GTA ATG ATG GCT GTC 730 730 790 790 Asp Asp Ser Serum Tyr Tyre Arg Arg Arg Arg Phe Phew Leu Lion Asp Asp Met meth Phe Phew Gly Gly Asn Assn Val Wave Val Wave Met meth Gly Gly 215 215 220 220 225 225 ATC ATC CCG GCC CAT HOW TCA TCA TTA TTA TTT TTT GAC GAC GAA GAA AAG AAG TTA TTA . GAG . GAG CAG CAG ATG ATG AAA AAA . GCT . GTC ' GAA ' GAA 778 778 795 795 lle Ile Pro Pro His His Ser Serum Leu Lion Phe Phew Asp Asp Glu Goofy Lys Lys Leu Lion . Glu . Goofy Gin Gin i Met i Met . Lys . Lys Ala That i Glu and Glu 230 230 235 235 240 240 AAA AAA GGG GGG ATT ATT CAT HOW CTC CTC GAC GAC ACC acc GAT NECK CTC CTC ACT ACT ’ GCT ’ GCT GC1 GC1 ' GA1 ' GA1 ' CTI ' CTI 1 ΆΆί 1 ΆΆί l GAT l THROAT 826 826

800800

RO 118451 ΒRO 118451 B

805 805 Lys Gly 245 Lys Gly 245 Ile His Leu Ile His Leo Asp Thr Asp Leu Thr Ala Ala Asp Leu Lys Asp Asp Thr Asp Leu Thr Ala Ala Asp Leu Lys Asp 250 250 255 255 CTT CTT GTT GTT GAG GAG AAA AAA TAC cue AAG AAG AAC AAC GTG GTG TAT DAD GTG GTG GAA GAA GCA . GCA. AAG AAG GGC GGC GAA GAA AAG AAG 874 874 Leu Lion Val Wave Glu Goofy Lys Lys Tyr Tyre Lys Lys Asn Assn Val Wave Tyr Tyre Val Wave Glu Goofy Ala That Lys Lys Gly Gly Glu Goofy Lys Lys 810 810 260 260 265 265 270 270 275 275 TTT TTT CCC CCC ACA ACA GAT NECK CCA Approx AAG AAG AAA AAA CAG CAG CTA CTA GAG GAG TTA TTA GCA GCA GTG GTG AAT AAT GCT GTC GTT GTT 922 922 Phe Phew Pro Pro Thr Thr Asp Asp Pro Pro Lys Lys Lys Lys Gin Gin Leu Lion Glu Goofy Leu Lion Ala That Val Wave Asn Assn Ala That Val Wave 815 815 280 280 285 285 290 290 TTT TTT GAT NECK TCT TCT TGG TGG GAC GAC AGC AGC CCA Approx AGG AGG GCC GCC AAT AAT AAG AAG TAC cue AGA AGT AGM AGT ATT ATT AAC AAC 970 970 Phe Phew Asp Asp Ser Serum Trp Trp Asp Asp Ser Serum Pro Pro Arg Arg Ala That Asn Assn Lys Lys Tyr Tyre Arg Arg Ser Serum Ile authorities Asn Assn 295 295 300 300 305 305 820 820 CAG CAG ATA THREAD ACT ACT GGA GGA TTA TTA AAG AAG GGG ACT GGG ACT GCA GCA GTT GTT AAC AAC ATT ATT CAA CAA AGC AGC ATG ATG GTG GTG 1018 1018 Gin Gin Ile authorities Thr Thr Gly Gly Leu Lion Lys Lys Gly Gly Thr Thr Ala That Val Wave Asn Assn Ile authorities Gin Gin Ser Serum Met meth Val Wave 310 310 315 315 320 320 825 825 TTT TTT GGC GGC AAC AAC ATG ATG GGA GGA AAC AAC ACT ACT TCA TCA GGA GGA ACT ACT GGT GGT GTT GTT CTT CTT TTC TTC ACT ACT AGG AGG 1066 1066 Phe Phew Gly Gly Asn Assn Met meth Gly Gly Asn Assn Thr Thr Ser Serum Gly Gly Thr Thr Gly Gly Val Wave Leu Lion Phe Phew Thr Thr Arg Arg 325 325 330 330 335 335 AAC AAC CCA Approx AGC AGC ACC acc GGT GGT GAG GAG AAG AAG AAG AAG CTA CTA TAC cue GGG GGG GAG GAG TTT TTT TTA TTA ATC ATC AAT AAT 1114 1114 830 830 Asn Assn Pro Pro Ser Serum Thr Thr Gly Gly Glu Goofy Lys Lys Lys Lys Leu Lion Tyr Tyre Gly Gly Glu Goofy Phe Phew Leu Lion Ile authorities Asn Assn 340 340 345 345 350 350 355 355 GCT GTC CAG CAG GGA GGA GAG GAG GAT NECK GTT GTT GTT GTT GCT GTC GGG GGG ATC ATC AGA ACA AGA ACA CCA Approx GAA GAA GAT NECK TTG TTG 1162 1162 835 835 Ala That Gin Gin Gly Gly Glu Goofy Asp Asp Val Wave Val Wave Ala That Gly Gly Ile authorities Arg Arg Thr Thr Pro Pro Glu Goofy Asp Asp Leu Lion 360 360 365 365 370 370 GGG GGG ACC acc ATG ATG GAG GAG ACT ACT TGC TGC ATG ATG CCT CCT GAT NECK GCA GCA . TAC . cue l AAA l AAA . GAG . GAG I CTT I CTT GTG GTG GAG GAG 1210 1210 840 840 Gly Gly Thr Thr Met meth Glu Goofy Thr Thr Cys Cys Met meth Pro Pro Asp Asp Ala That Tyr Lys Tyr Lys Glu Leu Glu Leo i Val and Val Glu Goofy 375 375 380 380 385 385 AAC AAC TGC TGC GAG GAG ATC ATC TTA TTA GAG GAG GGA GGA > CAC > CAC TAC cue : AAA GA1 : AAA GA1 ' ATG ATG GAT ' ATG ATG GAT ‘ ATT GAA ' ATT GAA 1258 1258

RO 118451 ΒRO 118451 B

Asn Cys Glu Ile Leu Glu Gly His Tyr Lys Asp Met Met Asp Ile Glu Asn Cys Glu Ile Leu Glu Gly His Tyr Lys Asp Met Met Asp Ile Glu 845 & 845 & 390 390 395 395 400 400 TTC TTC ACA ACA GTT GTT CAA CAA GAA . GAA. RAC CANCER AGG AGG CTT CTT TGG 1 TGG 1 WG TTG CAA WG TTG CAA TGC ( TGC ( :ga . :yeah. ACA ACA GGG GGG 1306 1306 Phe Phew Thr Thr Val Wave Gin Gin Glu . Goofy. Asn Assn Arg Arg Leu Lion Trp i Trp i <et Leu Gin Leo Gin Cys J Cys J \rg \rg Thr Thr Gly Gly 405 405 410 410 415 415 850 850 AAA. AAA. CGT CGT ACT ACT GGT GGT AAA AAA GGT GGT GCA GCA GTG GTG aga ; yes ATT GCA CTA ATT GCA CTA GAT ATG ATG NECK GTG GTG AAC AAC 1354 1354 Lys Lys Arg Arg Thr Thr Gly Gly Lys Lys Gly Gly Ala That Val Wave Arg Arg Xle Ala Val Xle Ala Val Asp Met Asp Met Val Wave Asn Assn 420 420 425 425 430 430 435 435 855 855 GAA GAA GGG GGG CTA CTA ATT ATT GAT NECK ACT ACT AGA AGM ACA ACA GCA . GCA. ATT AAG AGG ATT AAG AGG GTT GTT GAG GAG ACT ACT CAA CAA 1402 1402 Glu Goofy Gly Gly Leu Lion Ile authorities ASp ASp Thr Thr Arg Arg Thr Thr Ala That Ile Lys Arg Ile Lys Arg Val Wave Glu Goofy Thr Thr Gin Gin CAT HOW CTA CTA GAT NECK CAG CAG 440 CTT 440 CTT CTT CTT CAT HOW CCA Approx CAG CAG 445 TTT GAG GAT 445 TTT GAG THROAT CCG GCC TCT TCT 450 GCT 450 GTC TAC cue 1450 1450 860 860 His His Leu Lion Asp Asp Gin Gin Leu Lion Leu Lion His His Pro Pro Gin Gin Phe Glu Asp Phe Glu Asp Pro Pro Ser Serum Ala That Tyr Tyre - - 455 455 460 460 465 465 AAA AAA AGC AGC CAT HOW GTG GTG GTA GTA GCA GCA ACC acc GGT GGT TTG TTG CCA GCA TCC CCA GCA TCC CCC CCC GGG GGG GCA GCA GCT GTC 1498 1498 865 865 Lys Lys Ser Serum His His Val Wave Val Wave Ala That Thr Thr Gly Gly Leu Lion Pro Ala Ser Pro Ala Ser Pro Pro Gly Gly Ala That Ala That 470 470 475 475 480 480 GTG GTG GGA GGA CAG CAG GTT GTT TOT ALL TTT TTT ACT ACT GCA GCA GAG GAG GAT GCA GAA GAT GCA GAA l ACA l ACA TGG TGG CAI repent * GCA * GCA 1546 1546 870 870 Val Wave Gly Gly Gin Gin Val Wave Cya Cya Phe Phew Ser Serum Ala That Glu Goofy Asp Ala Glu Thr Asp Ala Glu Thr Trp Trp Hia Hiya > Ala > Alas 485 485 490 490 495 495 CAA CAA GGA GGA AAG AAG AGT AGT GCT GTC ATC TTG ATC TTG GTA GTA ► AGG ► AGG ACC GAA ACi ACC GAA ACi L AGC L AGC CCA Approx . GAA GAT . GAA THROAT 1594 1594 Gin Gin Gly Gly Lys Lys Ser Serum Ala That Ile authorities Leu Lion Val Wave Arg Arg Thr Glu Thr Ser Thr Glu Thr Ser Pro Pro > Glu Asp > Glu Asp 875 875 500 500 505 505 510 510 515 515 GTT GTT GGT GGT GGT GGT ATG ATG î CAT in CAT GCA GCA . GCA . GCA . GCT . GTC ' GGA 'GGA ATC TTA ACC GCT ATC TTA ACC GCT AGJ AGJ L GGA GGC L GGA GGC 1642 1642 Val Wave Gly Gly Gly Gly Met meth . His . His Ala That i Ala take to i Ala take to i Gly and Gly Ile Leu Thr Ala Ile Leo Thr Ala Arc Spring j Gl; jGl; !f Gly !f Gly 880 880 520 520 525 525 530 530

ATG ACA TCA CATATG ACA TCA CAT

GCA GCG GTG GTG GCT CGC GGA TGG GGC AAA TGT TGTGCA GCG GTG GTG GCT CGC GGA TGG GGC AAA TGT TGT

16901690

RO 118451 ΒRO 118451 B

885 885 Met Thr Ser His Ala Ala Val Val Ala Arg Gly Trp Gly Lys Cya Cys Met Thr Ser His Ala Ala Val Val Ala Arg Gly Trp Gly Lys Cya Cys 535 540 545 535 540 545 GTT ICC GOT TOT GCT GAT ATT CGT OTG AAC GAT GAT ATG AAG ATT TTT 1738 GTT ICC GOT TOT GCT GAT ATT CGT OTG AAC GAT GAT ATG AAG ATT TTT 1738 890 890 Val Ser Gly Cys Ala Asp Ile Arg Val Asn Asp Asp Met Lys Ile Phe Val Ser Gly Cys Ala Asp Ile Arg Val Asn Asp Asp Met Lys Ile Phe 550 555 560 ACG ATT GGC GAC COT OTG ATT AAA GAA GGC GAC TGG CTT TCT CTT AAT 1786 550 555 560 ACG ATT GGC GAC COT OTG ATT AAA GAA GGC GAC TGG CTT TCT CTT AAT 1786 895 895 Thr Ile Gly Asp Arg Val Ile Lys Glu Gly Asp Trp Leu Ser Leu Asn Thr Ile Gly Asp Arg Val Ile Lys Glu Gly Asp Trp Leu Ser Leu Asn 565 570 575 GOT ACA ACT GOT GAA OTC ATA TTG GOT AAA CAG CTA CTG GCT CCA CCT 1834 565 570 575 GOT ACA ACT GOT GAA OTC ATA TTG GOT AAA CAG CTA CTG GCT CCA CCT 1834 900 ί 900 и Gly Thr Thr Gly Glu Val Ile Leu Gly Lys Gin Leu Leu Ala Pro Pro 580 585 590 595 GCA ATG AGC AAT GAC TTA GAA ATA TTC ATG TCA TGG GCT GAT CAA GCA 1882 Gly Thr Thr Gly Glu Val Ile Leu Gly Lys Gin Leu Leu Ala Pro Pro 580 585 590 595 GCA ATG AGC AAT GAC TTA GAA ATA TTC ATG TCA TGG GCT GAT CAA GCA 1882 -i 905 them 905 Ala Met Ser Asn Asp Leu Glu Ile Phe Met Ser Trp Ala Asp Gin Ala 600 605 610 AGG COT CTC AAG GTT ATG GCA AAT GCA GAC ACA CCT AAT GAT GCA TTA 1930 Ala Met Ser Asn Asp Leu Glu Ile Phe Met Ser Trp Ala Asp Gin Ala 600 605 610 AGG COT CTC AAG GTT ATG GCA AAT GCA GAC ACA CCT AAT GAT GCA TTA 1930 910 910 Arg Arg Leu Lys Val Met Ala Asn Ala Asp Thr Pro Asn Asp Ala Leu 615 620 625 ACA GCC AGA AAC AAT GCT GCA CAA GGG ATC GGG CTC TCT AGA ACT GAA 1978 Arg Arg Leu Lys Val Met Ala Asn Ala Asp Thr Pro Asn Asp Ala Leu 615 620 625 ACA GCC AGA AAC AAT GCT GCA CAA GGG ATC GGG CTC TCT AGA ACT GAA 1978 915 915 Thr Ala Arg Asn Asn Gly Ala Gin Gly Ile Gly Leu Cys Arg Thr Glu 630 635 640 CAT ATG TTT TTC GCT TCT GAT GAG AGG ATC AAA GCT CTA AGA AAG ATG 2026 Thr Ala Arg Asn Asn Gly Ala Gin Gly Ile Gly Leu Cys Arg Thr Glu 630 635 640 CAT ATG TTT TTC GCT TCT GAT GAG AGG ATC AAA GCT CTA AGA AAG ATG 2026 920 920 His Met Phe Phe Ala Ser Asp Glu Arg Ile Lys Ala Val Arg Lys Met 645 650 655 ATC ATG GCG CTC ACT CCA GAA CAA AGA AAA OTG GCT CTA GAT CTC TTA 2074 His Met Phe Phe Ala Ser Asp Glu Arg Ile Lys Ala Val Arg Lys Met 645 650 655 ATC ATG GCG CTC ACT CCA GAA CAA AGA AAA OTG GCT CTA GAT CTC TTA 2074 ! 925 ! 925 Ile Met Ala Val Thr Pro Glu Gin Arg Lys Val Ala Leu Asp Leu Leu 660 665 670 675 CTC CCA TAC CAA AGA TCC GAT TTT GAG GGC ATT TTC CGA GCA ATG GAT 2122 Ile Met Ala Val Thr Pro Glu Gin Arg Lys Val Ala Leu Asp Leu Leu 660 665 670 675 CTC CCA TAC CAA AGA TCC GAT TTT GAG GGC ATT TTC CGA GCA ATG GAT 2122

RO 118451 ΒRO 118451 B

Leu Pro Tyr Gin Arg Ser Asp Phe Glu Gly lle Phe Arg Ala Met Asp 680 685 690=Leu Pro Tyr Gin Arg Ser Asp Phe Glu Gly lle Phe Arg Ala Met Asp 680 685 690=

GGA CTT CCT GTA ACT ATC CGC CTT CTA GAC CCT CCA CTT CAT GAG TTT 2170GGA CTT CCT GTA ACT ATC CGC CTT CTA GAC CCT CCA CTT CAT GAG TTT 2170

Gly Leu Pro Val Thr lle Arg Leu Leu Asp Pro Pro Leu His Glu Phe 695 700705Gly Leu Pro Val Thr lle Arg Leu Leu Asp Pro Pro Leu His Glu Phe 695 700705

TTA CCC GAA GCT GAT CTA GAA CAC ATA CTG AAC GAA CTT GCA GTC GAC 2218 935TTA CCC GAA GCT GAT CTA GAA CAC ATA CTG AAC GAA CTT GCA GTC GAC 2218 935

Leu Pro Glu Gly Asp Leu Glu His Xle Val Aan Glu Leu Ala Val Aap 710 715720Leu Pro Glu Gly Asp Leu Glu His Xle Val Aan Glu Leu Ala Val Aap 710 715720

ACA GGC ATG ACT GCA GAT GAA ATC TAT TCA AAA ATC GAA AAT CTA TCT2266ACA GGC ATG ACT GCA GAT GAA ATC TAT TCA AAA ATC GAA AAT CTA TCT2266

Thr Gly Met Ser Ala Aap Glu lle Tyr Ser Lys lle Glu Asn Leu Ser940Thr Gly Met Ser Ala Aap Glu lle Tyr Ser Lys lle Glu Asn Leu Ser940

725 730735725 730735

GAA GTG AAC CCT ATG CTT GGT TTC CGT GGT TGC AGA TTA GGG ATT TCA2314GAA GTG AAC CCT ATG CTT GGT TTC CGT GGT TGC AGA TTA GGG ATT TCA2314

Glu Val Asn Pro Met Leu Gly Phe Arg Gly Cys Arg Leu Gly lleSerGlu Val Asn Pro Met Leu Gly Phe Arg Gly Cys Arg Leu Gly lleSer

945945

740 745 750755740 745 750755

TAC CCC GAG CTA ACA GAA ATG CAA GTT CGT GCG ATC TTT CAA GCT GCA2362TAC CCC GAG CTA ACA GAA ATG CAA GTT CGT GCG ATC TTT CAA GCT GCA2362

Tyr Pro Glu Leu Thr Glu Met Gin Val Arg Ala lle Phe Gin AlaAlaTyr Pro Glu Leu Thr Glu Met Gin Val Arg Ala lle Phe Gin AlaAla

950 760 765770950 760 765770

GTG TCT ATG ACC AAT CAG GGG GTG ACT GTA ATA CCA GAG ATC ATG CTT2410GTG TCT ATG ACC AAT CAG GGG GTG ACT GTA ATA CCA GAG ATC ATG CTT2410

Val Ser Met Thr Aan Gin Gly Val Thr Val Xle Pro Glu lle Met ValVal Ser Met Thr Aan Gin Gly Val Thr Val Xle Pro Glu lle Met Val

775 780 785955775 780 785955

CCG TTA CTG GGG ACA CCT CAG GAA TTA CCT CAT CAA ATC AGT GTA ATT2458CCG TTA CTG GGG ACA CCT CAG GAA TTA CCT CAT CAA ATC AGT GTA ATT2458

Pro Leu Val Gly Thr Pro Gin Glu Leu Arg His Gin lle Ser Val XlePro Leu Val Gly Thr Pro Gin Glu Leu Arg His Gin lle Ser Val Xle

790 795800790 795800

960960

CGT GGA CTA GCT GCA AAT GTG TTT GCT GAA ATG GGG GTG ACA TTG GAA 2506CGT GGA CTA GCT GCA AAT GTG TTT GCT GAA ATG GGG GTG ACA TTG GAA 2506

Arg Gly Val Ala Ala Asn Val Phe Ala Glu Met Gly Val Thr Leu Glu ®05 810815Arg Gly Val Ala Ala Asn Val Phe Ala Glu Met Gly Val Thr Leu Glu ®05 810815

965965

TAT AAA CTG GGA ACG ATG ATT GAG ATT CCT CGA GCT GCT TTA ATA GCT 2554TAT AAA CTG GGA ACG ATG ATT GAG ATT CCT CGA GCT GCT TTA ATA GCT 2554

RO 118451 ΒRO 118451 B

970 970 Tyr Lys Val 820 Tyr Lys Wave 820 Gly Gly Thr Met Ile Glu Ile Pro Arg Ala Ala Leu Ile Ala Thr Met Ile Glu Ile Pro Arg Ala Ala Leu Ile Ala 825 825 830 830 835 835 GAA GAA GAG GAG ATT ATT GGA GGA AAA AAA GAA GAA GCT GTC GAT NECK TTC TTC TTT TTT TCG TCG TTT TTT GGA GGA ACC acc AAT AAT GAT NECK 2602 2602 Glu Goofy Glu Goofy Ile authorities Gly Gly Lys Lys Glu Goofy Ala That Asp Asp Phe Phew Phe Phew Ser Serum Phe Phew Gly Gly Thr Thr Asn . Assn. Asp Asp 975 975 840 840 845 845 850 850 CTG ACC CTG ACC CAG CAG ATG ATG ACA ACA TTT TTT GGG GGG TAC cue AGC AGC AGA AGM GAT NECK GAT NECK GTT GTT GGC GGC AAG AAG TTT TTT 2650 2650 Leu Lion Thr Thr Gin Gin Met meth Thr Thr Phe Phew Gly Gly Tyr Tyre Ser Serum Arg Asp Arg Asp Asp Asp Val Wave Gly Gly Lys Lys Phe Phew 855 855 860 860 865 865 980 980 TTG TTG CAG CAG ATT ATT TAT DAD CTT CTT GCT GTC CAA CAA GGC GGC ATT ATT CTG CTG CAG CAG CAT HOW GAT NECK CCA Approx TTT TTT GAG GAG 2698 2698 Leu Lion Gin Gin Ile authorities Tyr Tyre Leu Lion Ala That Gin Gin Gly Gly Ile authorities Leu Lion Gin Gin H1S H1S Asp Asp Pro Pro Phe Phew Glu Goofy 870 870 875 875 880 880 985 985 GTT GTT ATT ATT GAC GAC CAG CAG AAA AAA GGG GGG GTG GTG GGT GGT CAG CAG TTG TTG ATT ATT AAG AAG ATG ATG GCT GTC ACG ACG GAG GAG 2746 2746 Val Wave Ile authorities Asp Asp Gin Gin Lys Lys Gly Gly Val Wave Gly Gly Gin Gin Leu Lion Ile authorities Lys Lys Met meth Ala That Thr Thr Glu Goofy 885 885 890 890 895 895 990 990 AAA AAA GGT GGT CGT CGT GCA GCA GCA GCA AAT AAT CCT CCT AAC AAC TTA AAG TTA AAG GTT GTT GGG GGG ATA THREAD TGT TGT GGG GGG GAG GAG 2794 2794 Lys Lys Gly Gly Arg Arg Ala That Ala That Asn Assn Pro Pro Asn Assn Leu Lion Lys Lys Val Wave Gly Gly Ile authorities cys cys Gly Gly Glu Goofy 900 900 905 905 910 910 915 915 CAT HOW GGT GGT GGG GGG GAG GAG CCT CCT TCT TCT TCT TCT GTT GTT GCA GCA TTT TTT TTT TTT GAT NECK GGA GGA GTT GTT GGA GGA CTA CTA 2842 2842 995 995 Hi s Hi s Gly Gly Gly Gly Glu Goofy Pro Pro Ser Serum Ser Serum Val Wave Ala That Phe Phew Phe Phew Asp Asp Gly Gly Val Wave Gly Gly Leu Lion 920 920 925 925 930 930 GAT NECK TAT DAD GTG GTG TCG TCG TGC TGC TCT TCT CCA Approx TTT TTT AGG AGG GTT GTT CCT CCT ATC ATC GCA GCA AGG AGG - TTG - TTG GCC GCC 2890 2890 1000 1000 Asp Asp Tyr Tyre Val Wave Ser Serum Cys Cys Ser Serum Pro Pro Phe Phew Arg Arg Val Wave Pro Pro Ile authorities Ala That Arg Arg Leu Lion Ala That 935 935 940 940 945 945 GCT GTC GCA GCA CAA CAA GTC GTC ATT ATT GTT GTT TAAGCTT TAAGCT 2915 2915 1005 1005 Ala That Ala That Gin Gin Val Wave Ile authorities Val Wave

950950

RO 118451 ΒRO 118451 B

SECV ID NR: 2SEQ ID NO: 2

LUNGIMEA SECVENȚEI: 288o 1010SEQUENCE LENGTH: 288o 1010

TIPUL SECVENȚEI: acid nucleicSEQUENCE TYPE: nucleic acid

DESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

CGGCGCAGTA GGGGATCGGA AGG ATG GCG GCA TCG GTT TCC AGG GCC ATC TGC 53CGGCGCAGTA GGGGATCGGA AGG ATG GCG GCA TCG GTT TCC AGG GCC ATC TGC 53

Met Ala Ala Ser Val Ser Arg Ala Ile CysMet Ala Ala Ser Val Ser Arg Ala Ile Cys

1 1 5 5 10 10 GTA CAG AAG GTA CAG AAG CCG GGC TCA AAA CCG GGC TCA AAA TGC ACC AGG GAC AGG GAA TGC ACC AGG GAC AGG GAA GCG ACC TCC 101 1020 GCG ACC TCC 101 1020 Val Gin Lya Wave Gin Lya Pro Gly Ser Lya Pro Gly Ser Lya Cys Thr Arg Asp Arg Glu . Cys Thr Arg Asp Arg Glu . Ala Thr Ser Ala Thr Ser 15 15 20 20 25 25 TTC GCC CGC TTC GCC CGC CGA TCG GTC GCA CGA TCG GTC GCA GCG CCG AGG CCC CCG CAC GCG CCG AGG CCC CCG CAC GCC AAA GCC 149 1025 GCC AAA GCC 149 1025 Phe Ala Arg Phe Ala Arg Arg Ser Val Ala Arg Ser Val Ala Ala Pro Arg Pro Pro His Ala Pro Arg Pro Pro His Ala Lys Ala Ala Lys Ala 30 30 35 35 40 40 CGC CGG CGT CGC CGG CGT CAT CCG CTC CGA CAT CCG CTC CGA CTC CGG CGC GGG ACG GGG CTC CGG CGC GGG ACG GGG CCA CAT TGC 197 CCA CAT TGC 197 Arg Arg Arg Arg Arg Arg His Pro Leu Arg His Pro Leo Arg Leu Arg Arg Gly Thr Gly Leo Arg Arg Gly Thr Gly 1030 Pro His Cys 1030 Pro His Cys 45 45 50 55 50 55 TCG CCG CTG TCG CCG CTG AGG GCC GTC GTT AGG GCC GTC GTT GAC GCC GCG CCG ATA CAG GAC GCC GCG CCG ATA CAG ACG ACC AAA 245 ACG ACC AAA 245 Ser Pro Leu Serum Pro Leo Arg Ala Val Val Arg Ala Val Val Asp Ala Ala Pro Xle Gin Asp Ala Ala Pro Xle Gin Thr Thr Lys 1035 Thr Thr Lys 1035 60 60 65 65 70 70 AAG AGG GTG AAG AGG GTG TTC CAC TTC GGC TTC CAC TTC GGC AAG GGC AAG AGC GAG GGC AAG GGC AAG AGC GAG GGC AAC AAG ACC 293 AAC AAG ACC 293 Lys Arg Val Lys Arg Val Phe His Phe Gly Phe His Phe Gly Lys Gly Lys Ser Glu Gly Lys Gly Lys Ser Glu Gly Asn Lys Thr i040Asn Lys Thr i 04 0 75 75 80 80 85 85 90 90 ATG AAG GAA ATG AAG GAA CTG CTG GGC GGC CTG CTG GGC GGC AAG GGC GCG AAC CTG GCG AAG GGC GCG AAC CTG GCG GAG ATG GCG 341 GAG ATG GCG 341 Met Lya Glu Met Lya Glu Leu Leu Gly Gly Leo Leo Gly Gly Lys Gly Ala Asn Leu Ala Lys Gly Ala Asn Leu Ala Glu Met Ala Glu Met Ala 1045 1045 95 95 100 100 105 105 AGC ATC GGG AGC ATC GGG CTG TCG GTG CCG CTG TCG GTG CCG CCA GGG TTC ACG GTG TCG CCA GGG TTC ACG GTG TCG î ACG GAG GCG 389 î ACG GAG GCG 389 Ster ile Gly Gly sterils Leu Ser Val Pro Leo Ser Val Pro Pro Gly Phe Thr Val Ser Pro Gly Phe Thr Val Ser Thr Glu Ala Thr Glu Ala

10501050

RO 118451 ΒRO 118451 B

110 110 115 115 120 120 1055 1055 TGC TGC CAG CAG CAG CAG TAC cue CAG < CAG < GAC < GAC < GCC i GCC i GGG ' GGG' CGC 1 GTC 1 SCC 1 SCC 1 CTC CTC CCC c CCC v ÎCG ICG GGG < GGG < CTC 1 CTC 1 CGG CGG 437 437 Cys Cys Gin Gin Gin Gin Tyr Tyre Gin . Gin. Aap j Aap j Ma i May Gly ' Gly' Cys J Cys J Ma : Me: Leu Lion Pro 1 Pros 1 Ma Me Gly Gly Leu ’ Leo' Prp Fr 125 125 130 130 L35 L35 1060 1060 GCC GCC GAG GAG ATC ATC CTC CTC GAC GAC GGC GGC CTG CTG CAG CAG TGG TGG GTG GTG GAG GAG GAG ' GAG' PAC CAP ATG ATG GGC । GGC । GCC GCC 4B5 4B5 Ala That Glu Goofy Ile authorities Val Wave Aap Aap Gly Gly Leu Lion Gin Gin Trp Trp Val Wave Glu Goofy G1U ' G1U' ryr ryr Met meth Gly . Gly. Ma Me 140 140 145 145 150 150 1065 1065 ACC acc CTG CTG GGC GGC GAT NECK CCG GCC CAG CAG CGC CGC CCG GCC CTC CTC CTG CTG CTC CTC TCC । TCC CTC CTC CGC CGC TCC TCC GGC GGC 533 533 Thr Thr Leu Lion Gly Aap Gly Aap Pro Pro Gin Gin Arg Arg Pro Pro Leu Lion Leu Lion Leu Lion Ser Serum Val Wave Arg Arg Ser Serum Gly Gly 155 155 160 160 165 165 170 170 GCC GCC GCC GCC CTG CTG TCC TCC ATG ATG CCC CCC GGC GGC ATG ATG ATG ATG GAC GAC ACG ACG GTG GTG CTC CTC AAC AAC CTG CTG GGG GGG 581 581 1070 1070 Ala That Ala That Val Wave Ser Serum Met meth Pro Pro Gly Gly Met meth Met meth Aap Aap Thr Thr Val Wave Leu Lion Aan Aan Leu Lion Gly Gly 175 175 180 180 185 185 CTC CTC AAC AAC GAC GAC GAA GAA GTG GTG GCC GCC GCC GCC GGG GGG CTG CTG GCG GCG GCC GCC AAG AAG AGC AGC GGG GGG GAG GAG CGC CGC 629 629 1075 1075 Leu Lion Aan Aan Aap Aap Glu Goofy Val Wave Ala That Ala That Gly Gly Leu Lion Ala That Ala That Lys Lys Ser Serum Gly Gly Glu Goofy Arg Arg 190 190 195 195 200 two hundred TTC TTC GCC GCC TAC cue GAC GAC TCC TCC TTC TTC CGC CGC CGC CGC TTC TTC CTC CTC GAC GAC ATG ATG TTC TTC GGC GGC AAC AAC GTC GTC 677 677 Phe Phew Ala That Tyr Tyre Aap Aap ser serum Phe Phew Arg Arg Arg Arg Phe Phew Leu Lion Aap Aap Met meth Phe Phew : Gly : Gly Aan Aan val wave 1080 1080 205 205 210 210 215 215 GTC GTC ATG ATG GAC GAC ATC ATC CCC CCC CGC CGC TCA TCA CTG CTG TTC TTC GAA GAA . GAG AAG . GAG AAG CTT CTT ' GAG 'GAG CAC CaCl ATG ATG 725 725 Val Wave Met meth Aap Aap Ile authorities Pro Pro Arg Arg Ser Serum Leu Lion Phe Phew Glu Goofy . Glu . Goofy l Lys l Lys Leu Lion i Glu and Glu His His Met meth 1085 1085 220 220 225 225 230 230 AAG AAG GAA GAA TCC TCC AAG AAG GGG GGG CTG CTG AAG AAG i AAC and AAC GAC GAC ACC acc ! GAC ! GAC : ctc : ctc ACG GCC ACG GCC : tct : tct GAC GAC 773 773 Lys Lys Glu Goofy Ser Serum Lys Lys Gly Gly Leu Lion Lys Lys Asn Assn Aap Aap Thr Asț Thr Ass > Leu > Leo Thr Ala Thr Ala i Ser and Ser Aap Aap 1090 1090 235 235 240 240 245 245 250 250 CTC CTC AAA AAA . GAG . GAG CTC CTC GTG GTG ggt ggt CAG CAG ί TAC и TAC : AAG : AAG r GAC r GAC ; GTC TAC ; GTC TAC : CTC TCJ : CTC TCJ l GCC in GCC : AAG : AAG 821 821 Leu Lion Lys Lys Glu Goofy . Leu . Lion Val Wave Gly Gly ’ Gir ’ Gir i Tyr Lys and Tyr Lys 1 Glu Va: 1 Glu Va: 1 Tyr Leu Șei 1 Tyr Leo Shei c Ala c Ala i Lys and Lys

RO 118451 ΒRO 118451 B

GGA GGA GAG GAG 255 255 260 260 265 265 1095 1095 CCA TTC CCA TTC CCC CCC TCA GAC CCC TCA GAC CCC AAG AAG AAG AAG CAG CAG CTG CTG GAG GAG CTA GCA GTG CTA GCA GTG 869 869 Gly Gly Glu Goofy Pro Pro Phe Phew Pro Pro Ser Serum Asp Asp Pro Pro Lys Lys Lys Lys Gin Gin Leu Lion Glu Goofy Leu Lion Ala That Val Wave 270 270 275 275 280 280 CTG CTG GCT GTC GTG GTG TTC TTC AAC AAC TCG TCG TGG TGG GAG GAG AGC AGC CCC CCC AGG AGG GCC GCC AAG AAG AAG AAG TAC cue AGG AGG 917 917 1100 1100 Leu Lion Ala That Val Wave Phe Phew Asn Assn ser serum Trp Trp Glu Goofy Ser Serum Pro Pro Arg Arg Ala That Lys Lys Lys Lys Tyr Tyre Arg Arg 285 285 290 290 295 295 AGC AGC ATC ATC AAC AAC CAG CAG ATC ATC ACT ACT GGC GGC CTC CTC AGG AGG GGC GGC ACC acc GCC GCC GTG GTG AAC AAC GTG GTG CAG CAG 965 965 1105 1105 Ser Serum Ile authorities Asn Assn Gin Gin Ile authorities Thr Thr Gly Gly Leu Lion Arg Arg Gly Gly Thr Thr Ala That Val Wave Asn Assn Val Wave Gin Gin 300 300 305 305 310 310

TGC ATG GTG TTC GGC TGC ATG GTG TTC GGC AAC ATG GGG AAC ACT TCT GGC ACC GGC GTG CTC AAC ATG GGG AAC ACT TCT GGC ACC GGC GTG CTC 1013 1013 1110 1110 Cys Met Val Phe Gly 315 Cys Met Val Phe Gly 315 Asn Met Gly Asn Thr Ser Gly Thr Gly Val Leu 320 325 330 Asn Met Gly Asn Thr Ser Gly Thr Gly Val Leu 320 325 330 TTC ACC AGG AAC CCC TTC ACC AGG AAC CCC AAC ACC GGA GAG AAG AAG CTG TAT GGC GAG TTC AAC ACC GGA GAG AAG AAG CTG TAT GGC GAG TTC 1061 1061 Phe Thr Arg Asn Pro 335 Phe Thr Arg Asn Pro 335 Asn Thr Gly Glu Lys Lys Leu Tyr Gly Glu Phe 340 345 Asn Thr Gly Glu Lys Lys Leu Tyr Gly Glu Phe 340 345 1115 1115 CTG GTG AAC GCT CAG CTG GTG AAC GCT CAG GOT GAG GAT GTG GTT GCC GGA ATA AGA ACC CCA GOT GAG GAT GTG GTT GCC GGA ATA AGA ACC CCA 1109 1109 Leu Val Asn Ala Gin 350 Leo Val Asn Ala Gin 350 Gly Glu Asp Val Val Ala Gly Ile Arg Thr Pro 355 360 Gly Glu Asp Val Val Ala Gly Ile Arg Thr Pro 355 360 1120 1120 GAG GAC CTT GAC GCC Glu Asp Leu Asp Ala 365 GAG GAC CTT GAC GCC Glu Asp Leu Asp Ala 365 ATG AAG AAC CTC ATG CCA CAG GCC TAC GAC GAG Met Lys Asn Leu Met Pro Gin Ala Tyr Asp Glu 370 375 ATG AAG AAC CTC ATG CCA CAG GCC TAC GAC GAG Met Lys Asn Leu Met Pro Gin Ala Tyr Asp Glu 370 375 1157 1157 1125 1125 CTT GTT GAG AAC TGC Leu Val Glu Asn Cys 380 CTT GTT GAG AAC TGC Leu Val Glu Asn Cys 380 AAC ATC CTG GAG AGC CAC TAC AAG GAA ATG CAG Asn Ile Leu Glu Ser His Tyr Lys Glu Met Gin 385 390 AAC ATC CTG GAG AGC CAC TAC AAG GAA ATG CAG Asn Ile Leu Glu Ser His Tyr Lys Glu Met Gin 385 390 1205 1205 1130 1130 GAT ATC GAG TTC ACT Asp Ile Glu Phe Thr GAT ATC GAG TTC ACT Asp Ile Glu Phe Thr GTC CAG GAA AAC AGG CTG TGG ATG TTG CAG TGC Val Gin Glu Asn Arg Leu Trp Met Leu Gin Cys GTC CAG GAA AAC AGG CTG TGG ATG TTG CAG TGC Val Gin Glu Asn Arg Leu Trp Met Leu Gin Cys 1253 1253

RO 118451 Β 1135 395 400 405 410RO 118451 Β 1135 395 400 405 410

Η AGG ACA GGG AAA COT ACG GGC AAA AGT GCC OTG AAG ATC GCC GTG GAC 1301 Arg Thr Gly Lys Arg Thr Gly Lys Ser Ala Val Lys lle Ala Val Asp 1140 415 420 425 j ATG GTT AAC GAG GGC CTT GTT GAG CCC CGC TCA GCG ATC AAG ATG OTA 1349Η AGG ACA GGG AAA COT ACG GGC AAA AGT GCC OTG AAG ATC GCC GTG GAC 1301 Arg Thr Gly Lys Arg Thr Gly Lys Ser Ala Val Lys lle Ala Val Asp 1140 415 420 425 j ATG GTT AAC GAG GGC CTT GTT GAG CCC CGC TCA GCG ATC AAG ATG OTA 1349

Met Val Asn Glu Gly Leu Val Glu Pro Arg Ser Ala lle Lys Met Val j 1145 430 435440Met Val Asn Glu Gly Leu Val Glu Pro Arg Ser Ala lle Lys Met Val j 1145 430 435440

GAG CCA GGC CAC CTG GAC CAG CTT CTT CAT CCT CAG TTT GAG AAC CCG1397GAG CCA GGC CAC CTG GAC CAG CTT CTT CAT CCT CAG TTT GAG AAC CCG1397

Glu Pro Gly His Leu Asp Gin Leu Leu His Pro Gin Phe Glu AsnProGlu Pro Gly His Leu Asp Gin Leu Leu His Pro Gin Phe Glu AsnPro

1150 445 450455 1150 445 450455

TCG GCG TAC AAG GAT CAA GTC ATT GCC ACT GOT CTG CCA GCC TCA CCT1445 j Ser Ala Tyr Lys Asp Gin Val lle Ala Thr Gly Leu Pro Ala Ser ProTCG GCG TAC AAG GAT CAA GTC ATT GCC ACT GOT CTG CCA GCC TCA CCT1445 j Ser Ala Tyr Lys Asp Gin Val lle Ala Thr Gly Leu Pro Ala Ser Pro

460 465470460 465470

1155 j GGG GCT GCT GTG GGC CAG GTT OTG TTC ACT GCT GAA GAT GCT GAA GCA 14931155 j GGG GCT GCT GTG GGC CAG GTT OTG TTC ACT GCT GAA GAT GCT GAA GCA 1493

Gly Ala Ala Val Gly Gin Val Val Phe Thr Ala Glu Asp Ala Glu Ala *75 480 485490Gly Ala Ala Val Gly Gin Val Val Phe Thr Ala Glu Asp Ala Glu Ala *75 480 485490

1160 TGG CAT TCC CAA GGG AAA GCT GCT ATT CTG OTA AGG GCG GAG ACC AGC 1541 Trp His Ser Gin Gly Lys Ala Ala lle Leu Val Arg Ala Glu Thr Ser 495 5005051160 TGG CAT TCC CAA GGG AAA GCT GCT ATT CTG OTA AGG GCG GAG ACC AGC 1541 Trp His Ser Gin Gly Lys Ala Ala lle Leu Val Arg Ala Glu Thr Ser 495 500505

CCT GAG GAC GTT GOT GGC ATG CAC GCT GCT GTG GGG ATT CTT ACA GAG 1589 1103 Pro Glu Asp Val Gly Gly Met His Ala Ala Val Gly lle Leu Thr Glu 510 515 520CCT GAG GAC GTT GOT GGC ATG CAC GCT GCT GTG GGG ATT CTT ACA GAG 1589 1103 Pro Glu Asp Val Gly Gly Met His Ala Ala Val Gly lle Leu Thr Glu 510 515 520

AGG GGT GGC ATG ACT TCC CAC GCT GCT GTG GTC GCA CGT TGG TGG GGG 1637 1170 Arg Gly Gly Met Thr Ser His Ala Ala Val Val Ala Arg Trp Trp Gly i 525 530 535AGG GGT GGC ATG ACT TCC CAC GCT GCT GTG GTC GCA CGT TGG TGG GGG 1637 1170 Arg Gly Gly Met Thr Ser His Ala Ala Val Val Ala Arg Trp Trp Gly i 525 530 535

AAA TGC TGC GTC TCG GGA TGC TCA GGC ATT CGC OTA AAC GAT GCG GAG 1685AAA TGC TGC GTC TCG GGA TGC TCA GGC ATT CGC OTA AAC GAT GCG GAG 1685

11751175

Lys Cys Cys Val Ser Gly Cys Ser Gly lle Arg Val Asn Asp Ala GluLys Cys Cys Val Ser Gly Cys Ser Gly lle Arg Val Asn Asp Ala Glu

RO 118451 ΒRO 118451 B

540 540 545 545 550 550 AAG AAG CTC CTC GTG . GTG. ACG : ACG: ATC ATC GGA GGA AGC i AGC i CAT < CAT < TTG ( TTG ( CTG । CTG CGC ( GTC ( 3AA 3a GCT । GCT । GAG 1 GAG 1 PGG i PGG i CTG CTG 1733 1733 1180 1180 Lys Lys Leu Lion Val Wave Thr Thr Ile authorities Gly Gly Ser : Serum: His 1 His 1 /al : / of : Leu ; Leo; Arg Glu Arg Glu Gly Gly Glu ' Glu' Crp Crp Leu Lion 555 555 560 560 565 565 570 570 TCG TCG CTG CTG AAT AAT GGG GGG TCG TCG ACT ACT GCT GTC GAG i GAG i GTG . GTG. ATC ATC CTT i CTT i GGG GGG AAG AAG CAG CCG CAG CCG CTT CTT 1781 1781 Ser Serum Leu Lion Aan Aan Gly Gly Ser Serum Thr Thr Gly Gly Glu Goofy val wave Ile authorities Leu ' Leo' Gly Gly Lys Lys Gin : Gin: Pro Pro Leu Lion 1185 1185 575 575 580 580 585 585 TCC TCC CCA Approx CCA Approx GCC GCC CTT CTT ACT ACT GCT GTC GAT NECK CTG CTG GGA GGA ACT ' ACT' TTC TTC ATG ATG GCC GCC TGG TGG CTG CTG 1829 1829 Ser Serum Pro Pro Pro Pro Ala That Leu Lion Ser Serum Gly Asp Gly Asp Leu Lion Gly Gly Thr Thr Phe Phew Met meth Ala That Trp Trp Val Wave 1190 1190 590 590 595 595 600 600 GAT NECK GAT NECK GTT GTT AGA AGM AAG AAG CTC CTC AAG AAG CTC CTC CTG CTG GCT GTC AAC AAC GCC GCC GAT NECK ACC acc CCT CCT GAT NECK 1877 1877 Asp Asp Asp Asp Val Wave Arg Arg Lya Lya Leu Lion Lys Lys Val Wave Leu Lion Ala That Asn Assn Ala That Asp Asp Thr Thr Pro Pro ASP ASP 605 605 610 610 615 615 1195 1195 GAT NECK GCA GCA TTG TTG ACT ACT GCG GCG CGA CGA AAC AAC AAT AAT GGG GGG GCA GCA CAA CAA GGA GGA ATT ATT GGA GGA TTA TTA TGC TGC 1925 1925 Asp Asp Ala That Leu Lion Thr Thr Ala That Arg Arg Asn Assn Asn Assn Gly Gly Ala That Gin Gin Gly Gly Ile authorities Gly Gly Leu Lion Cys Cys 620 620 625 625 630 630 1200 1200 CGG CGG ACA ACA GAG GAG CAC CaCl ATG ATG TTC TTC TTT TTT GCT GTC TCA TCA GAC GAC GAG GAG AGG AGG ATT ATT AAG AAG GCT GTC GTC GTC 1973 1973 Arg Arg Thr Thr Glu Goofy His His Met meth Phe Phew Phe Phew Ala That Ser Serum Asp Asp Glu Goofy Arg Arg Ile authorities Lys Lys Ala That Val Wave 635 635 640 640 645 645 650 650 1205 1205 AGG AGG CAG CAG ATG ATG ATT ATT ATG ATG GCT GTC CCC CCC ACG ACG CTT CTT GAG GAG CTG CTG AGG AGG CAG CAG CAG CAG GCG GCG CTC CTC 2021 2021 Arg Arg Gin Gin Met meth Ile authorities Met meth Ala That Pro Pro Thr Thr Leu Lion Glu Goofy Leu Lion Arg Arg Gin Gin Gin Gin Ala That Leu Lion 655 655 660 660 665 665 GAC GAC CGT CGT CTC CTC TTG TTG ACG ACG TAT DAD CAG CAG AGG AGG TCT TCT GAC GAC TTC TTC GAA GAA . GGC . GGC : att : att TTC TTC : CCT : CCT 2069 2069 1210 1210 Asp Asp Arg Arg Leu Lion Leu Lion Thr Thr Tyr Tyre Gin Gin Arg Arg Ser Serum Asp Asp Phe Phew Glu Goofy î Gly in Gly ’ Ile Ile Phe Phew ! Arg ! Arg 670 670 675 675 680 680 GCT GTC ATG ATG GAT NECK GGA GGA CTC CTC CCG GCC GTG GTG f ACC f ACC ATC ATC : CGP : CGP l CTC l CTC CTC CTC ; GAC ; GAC : cat : quantity CCI Bcc • TCT • TCT 2117 2117 1215 1215 Ala That Met meth Asp Asp Gly Gly Leu Lion Pro Pro Val Wave Thr Thr Ile authorities i Arg and Arg 1 Leu 1 Ron i Lei and Lions 1 Asj 1 Asj ) Hla ) lol i Prc and Prc > Ser > Ser

RO 118451 ΒRO 118451 B

1220 1220 685 685 690 690 695 695 TAC cue GAG GAG TTC TTC CTT CTT CCA Approx GAA GAA GGG GGG AAC AAC ATC ATC GAG GAG GAC ATT GAC ATT GTA AGT GTA AGT GAA 1 GAA 1 ΓΤΑ ΓΤΑ 2165 2165 Tyr Tyre Glu Goofy Phe Phew Leu Lion Pro Pro Glu Goofy Gly Gly Asn Assn lle Ile Glu . Goofy. Asp Asp lle Ile Val Wave Ser Serum Glu Goofy Leu Lion 700 700 705 705 710 710 1225 1225 TGT TGT GCT GTC GAG GAG ACG ACG GGA GGA GCC GCC AAC AAC CAG CAG GAG GAG GAT NECK GCC GCC CTC CTC GCG GCG CGA CGA ATT ATT GAA GAA 2213 2213 Cys Cys Ala That Glu Goofy Thr Thr Gly Gly Ala That Asn Assn Gin Gin Glu Goofy Asp Asp Ala That Leu Lion Ala That Arg Arg lle Ile Glu Goofy ί ! и ! 715 715 720 720 725 725 730 730 1230 1230 AAG AAG CTT CTT TCA TCA GAA GAA GTA GTA AAC AAC CCG GCC ATG ATG CTT CTT GGC GGC TTC TTC CGT CGT GGG GGG TGC TGC AGG AGG CTT CTT 2261 2261 Lys Lys Leu Lion Ser Serum Glu Goofy Val Wave Asn Assn Pro Pro Met meth Leu Lion Gly Gly Phe Phew Arg Arg Gly Gly Cys Cys Arg Arg Leu Lion 735 735 740 740 745 745 1235 1235 GGT GGT ATA THREAD TCG TCG TAC cue CCT CCT GAA GAA TTG TTG ACA ACA GAG GAG ATG ATG CAA CAA GCC GCC CGG CGG GCC GCC ATT ATT TTT TTT 2309 2309 4 4 Gly Gly lle Ile Ser Serum Tyr Tyre Pro Pro Glu Goofy Leu Lion Thr Thr Glu Goofy Met meth Gin Gin Ala That Arg Arg Ala That lle Ile Phe Phew 750 750 755 755 760 760 ί и 1240 1240 GAA GAA GCT GTC GCT GTC ATA THREAD GCA GCA ATG ATG ACC acc AAC AAC CAG CAG GGT GGT GTT GTT CAA CAA GTG GTG TTC TTC CCA Approx GAG GAG 2357 2357 Glu Goofy Ala That Ala That lle Ile Ala That Met meth Thr Thr Asn Assn Gin Gin Gly Gly Val Wave Gin Gin Val Wave Phe Phew Pro Pro Glu Goofy 765 765 770 770 775 775 ATA THREAD ATG ATG GTT GTT CCT CCT CTT CTT GTT GTT GGA GGA ACA ACA CCA Approx CAG CAG GAA GAA CTG CTG GGG GGG CAT HOW CAA CAA GTG GTG 2405 2405 1245 1245 lle Ile Met meth Val Wave Pro Pro Leu Lion Val Wave Gly Gly Thr Thr Pro Pro Gin Gin Glu Goofy Leu Lion Gly Gly His His Gin Gin Val Wave I and 780 780 785 785 790 790 ACT ACT CTT CTT ATC ATC CGC CGC CAA CAA GTT GTT GCT GTC GAG GAG AAA AAA GTG GTG TTC TTC GCC GCC AAT AAT GTG GTG GGC GGC AAG AAG 2453 2453 1250 1250 Thr Thr Leu Lion lle Ile Arg Arg Gin Gin Val Wave Ala That Glu Goofy Lys Lys Val Wave Phe Phew Ala That Asn Assn val wave Gly Gly Lys Lys 795 795 800 800 805 805 810 810 ACT ACT ATC ATC GGG GGG TAC cue AAA AAA GTT GTT GGA GGA ACA ACA ATG ATG ATT ATT GAG GAG ATC ATC CCC CCC AGG AGG GCA GCA GCT GTC 2501 2501 1255 1255 Thr Thr lle Ile Gly Gly Tyr Tyre Lys 815 Lys 815 Val Wave Gly Gly Thr Thr Met meth lle 820 Ile 820 Glu Goofy lle Ile Pro Pro Arg Arg Ala B25 That B25 Ala That CTG CTG GTG GTG GCT GTC GAT NECK GAG GAG ATA THREAD GCG GCG GAG GAG CAG CAG GCT GTC GAA GAA . TTC . TTC ! TTC ! TTC TCC TCC TTC TTC GGA GGA 2549 2549 Leu Lion Val Wave Ala That Asp Asp Glu Goofy lle Ile Ala That Glu Goofy Gin Gin Ala That Glu Goofy . Phe . Phew i Phe and Phe : Ser : Ser Phe Phew Gly Gly

ί Iи I

RO 118451 ΒRO 118451 B

830 830 835 840 835 840 ACG AAC GAC CTG ACG ACG AAC GAC CTG ACG 1260 CAG ATG ACC TTT GGG TAC AGC AGG GAT GAT GTG 2597 1260 CAG ATG ACC TTT GGG TAC AGC AGG GAT GAT GTG 2597 Thr Asn Asp Leu Thr Thr Asn Asp Leu Thr Gin Met Thr Phe Gly Tyr Ser Arg Asp Asp Val Gin Met Thr Phe Gly Tyr Ser Arg Asp Asp Val 845 845 850 855 850 855 GGA AAG TTC ATT CCC GGA AAG TTC ATT CCC GTT CAT CTT GCT CAG GGC ATC CTC CAA CAT GAC 2645 1265 GTT CAT CTT GCT CAG GGC ATC CTC CAA CAT GAC 2645 1265 Gly Lys Phe lle Pro Gly Lys Phe lle Pro Val His Leu Ala Gin Gly lle Leu Gin His Asp Val His Leu Ala Gin Gly lle Leu Gin His Asp 860 860 865 870 865 870 CCC TTC GAG GTC CTG CCC TTC GAG GTC CTG GAC CAG AGG GGA GTG GGC GAG CTG GTG AAG TTT 2693 1270 GAC CAG AGG GGA GTG GGC GAG CTG GTG AAG TTT 2693 1270 Pro Phe Glu Val Leu Pro Phe Glu Val Leu Asp Gin Arg Gly Val Gly Glu Leu Val Lys Phe Asp Gin Arg Gly Val Gly Glu Leu Val Lys Phe 875 875 880 885 890 880 885 890 GCT ACA GAG AGG GGC GCT ACA GAG AGG GGC CGC AAA GCT AGG CCI AAC TTG AAG GTG GGC ATT 2741 1275 CGC AAA GCT AGG CCI AAC TTG AAG GTG GGC ATT 2741 1275 Ala Thr Glu Arg Gly Ala Thr Glu Arg Gly Arg Lys Ala Arg Pro Asn teu Lys Val Gly lle Arg Lys Ala Arg Pro Asn teu Lys Val Gly lle 895 895 900 905 900 905 TOT GGA GAA CAC GGT ALL GGA GAA CAC GGT GGA GAG CCT TCG TCT GTG GCC TTC TTC GCG AAG 2789 GGA GAG CCT TCG TCT GTG GCC TTC TTC GCG AAG 2789 Cys Gly Glu His Gly Cys Gly Glu His Gly Gly Glu Pro Ser Ser Val Ala Phe Phe Ala Lys 1280 Gly Glu Pro Ser Ser Val Ala Phe Phe Ala Lys 1280 910 910 915 920 915 920 GCT GGG CTG GAT TTC GCT GGG CTG GAT TTC TCT TGC TCC CCT TTC AGG GTT CCG ATT GCT 2837 TCT TGC TCC CCT TTC AGG GTT CCG ATT GCT 2837 Ala Gly Leu Asp Phe Ala Gly Leu Asp Phe Val Ser Cys Ser Pro Phe Arg Val Pro lle Ala 1285 Val Ser Cys Ser Pro Phe Arg Val Pro lle Ala 1285 925 925 930 935 930 935 AGG CTA GCT GCA GCT AGG CTA GCT GCA GCT CAG GTG CTT GTC TGAGGCTGCC TCCTCG 2880 CAG GTG CTT GTC TGAGGCTGCC TCCTCG 2880 Arg Leu Ala Ala Ala Arg Leo Ala Ala Ala Gin Val teu Val 1290 Gin Val your Val 1290 940 940 945 945

SECV ID NR: 3SEQ ID NO: 3

LUNGIMEA SECVENȚEI: 261oSEQUENCE LENGTH: 261o

TIPUL SECVENȚEI; acid nucleicSEQUENCE TYPE; nucleic acid

BESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

RO 118451 ΒRO 118451 B

GAATTCTCAA TCCTTTGCTC ATCGCAGCAT ATCAATGTTA ACACATAAAC TTTAGGAGGA 60GAATTCTCAA TCCTTTGCTC ATCGCAGCAT ATCAATGTTA ACACATAAAC TTTAGGAGGA 60

AGAAAACTT ATG GCA AAA TGG GTT TAT AAG TTC GAA GAA GGC AAT GCA TCT 111AGAAAACTT ATG GCA AAA TGG GTT TAT AAG TTC GAA GAA GGC AAT GCA TCT 111

Met Ala Lys Trp Val Tyr Lys Phe Glu Glu Gly Asn Ala SerMet Ala Lys Trp Val Tyr Lys Phe Glu Glu Gly Asn Ala Ser

1 1 5 5 10 10 ATG ATG AGA AGM AAC AAC CTT CTT CTT CTT GGA GGA GGC AAA GGC AAA GGC GGC TGC TGC AAC AAC CTT CTT GCA GCA GAG ATG ACC GAG ATG ACC 159 159 Met meth Arg Arg Aan Aan Leu Lion Leu Lion Gly Gly Gly Lys Gly Lys Gly Gly Cya . Cya. Asn Assn Leu . Lion. Ala That Glu : Glu: Met meth Thr Thr 15 15 20 20 25 25 30 30 ATC ATC TTA TTA GGA ATG GGA ATG CCG GCC ATT ATT CCA Approx CAG CAG GGC GGC TTT TTT ACT ACT GTA GTA ACA ACA THAT THAT THAT GAA GAA GCT GTC 207 207 Xle Xle Leu Lion Gly Gly Met meth Pro Pro Ile authorities Pro Pro Gin Gin Gly Gly Phe Phew Thr Thr Val Wave Thr Thr Thr Thr Glu Goofy Ala That 35 35 40 40 45 45 TGT TGT ACA ACA GAG GAG TAC cue TAC cue AAC AAC AGT AGT GGA AAA GGA AAA CAG CAG ATC ATC ACA ACA CAG CAG GAA ATT GAA ATT CAG CAG 255 255 Cya Cya Thr Thr Glu Goofy Tyr Tyre Tyr Tyre Aan Aan Ser Serum Gly Gly Lya Lya Gin Gin Ile authorities Thr Thr Gin Gin Glu Goofy Ile authorities Gin Gin 50 50 55 55 60 60 GAT NECK CAG CAG ATT ATT TTC TTC GAA GAA GCT GTC ATC ATC ACA ACA TGG TGG TTA TTA GAG GAG GAA GAA CTG CTG AAC AAC GGC GGC AAG AAG 303 303 Aap Aap Gin Gin Ile authorities Phe Phew Glu Goofy Ala That Ile authorities Thr Thr Trp Trp Leu Lion Glu Goofy Glu Goofy Leu Lion Asn Assn Gly Gly Lys Lys £5 £5 70 70 75 75 AAG AAG TTC TTC GGC GGC GAC GAC ACT ACT GAA GAA GAT NECK CCG GCC TTA TTA TTA TTA GTA GTA TCT TCT GTA GTA CGT CGT TCC TCC GCG GCG 351 351 Lys Lys Phe Phew Gly Aap Gly Aap Thr Thr Glu Goofy Aap Aap Pro Pro Leu Lion Leu Lion Val Wave Ser Serum Val Wave Arg Arg Ser Serum Ala That 80 80 85 85 90 90 GCC GCC CGC CGC GCA GCA TCC TCC ATG ATG CCG GCC GGT GGT ATG ATG ATG ATG GAT NECK ACC acc ATC ATC CTG CTG AAC AAC CTT CTT GGT GGT 399 399 Ala That Arg Arg Ala That Ser Serum Met meth Pro Pro Gly Gly Met meth Met meth Asp Asp Thr Thr Ile authorities Leu Lion Asn Assn Leu Lion Gly Gly 95 95 100 100 105 105 110 110 TTA TTA AAC AAC GAC GAC GTT GTT GCA GCA GTA GTA GAG GAG GGC GGC TTT TTT GCA GCA AAG AAG AAA AAA ACG ACG GGA GGA AAT AAT CCA Approx 447 447 Leu Lion Aan Aan Asp Asp Val Wave Ala That Val Wave Glu Goofy Gly Gly Phe Phew Ala That Lys Lys Lys Lys Thr Thr Gly Gly Asn Assn Pro Pro 115 115 120 120 125 125 AGA AGM TTT TTT GCA GCA TAT DAD GAT NECK TCT TCT TAC cue AGA AGM AGA AGM . TTT . TTT ATC ATC : CAG : CAG - ATG - ATG TAT DAD TCC TCC GAC GAC 495 495

Arg Phe Ala Tyr Asp Ser Tyr Arg Arg Phe Ile Gin Met Tyr Ser AapArg Phe Ala Tyr Asp Ser Tyr Arg Arg Phe Ile Gin Met Tyr Ser Aap

RO 118451 ΒRO 118451 B

130130

135135

140140

13401340

GTA GTA GTT GTT ATG ATG GAA GAA GTT GTT CCG GCC AAG AAG TCC TCC CAT HOW TTC TTC GAG GAG AAA AAA ATC . ATC. ATC ATC GAT NECK GCG GCG 543 543 Val Wave Val Wave Met meth Glu Goofy Val Wave Pro Pro Lys Lys Ser Serum His His Phe Phew Glu Goofy Lys Lys Ile authorities Ile authorities Asp . Asp. Ala That 145 145 150 150 155 155 1345 1345 ATG ATG AAA AAA GAA GAA GAA GAA AAG AAG GGC GGC GTT GTT CAC CaCl TTC TTC GAT NECK ACA ACA GAC GAC CTG CTG ACT ACT GCC GCC GAT NECK 591 591 Met meth Lys Lys Glu Goofy Glu Goofy Lys Lys Gly Gly Val Wave His His Phe Phew Asp Asp Thr Thr Asp Asp Leu Lion Thr Thr Ala That Asp Asp 160 160 165 165 170 170 1350 1350 GAT NECK TTA TTA AAA AAA GAG GAG CTG CTG GCT GTC GAG GAG AAG AAG TTC TTC AAA AAA GCT GTC GTT GTT TAC cue AAA AAA GAG GAG GCT GTC 639 639 Asp Asp Leu Lion Lys Lys Glu Goofy Leu Lion Ala That Glu Goofy Lys Lys Phe Phew Lys Lys Ala That Val Wave Tyr Tyre Lys Lys Glu Goofy Ala That 175 175 180 180 185 185 190 190 ATG ATG AAC AAC GGC GGC GAA GAA GAG GAG TTC TTC CCA Approx CAG CAG GAG GAG CCG GCC AAG AAG GAT NECK CAG CAG TTA TTA ATG ATG GGC GGC 687 687 1355 1355 Met meth Asn Assn Gly Gly Glu Goofy Glu Goofy Phe Phew Pro Pro Gin Gin Glu Goofy Pro Pro Lys Lys Asp Asp Gin Gin Leu Lion Met meth Gly Gly 195 195 200 two hundred 205 205 GCT GTC GTT GTT AAA AAA GCA GCA GTT GTT TTC TTC CGT CGT TCC TCC TGG TGG GAC GAC AAC AAC CCT CCT CGT CGT GCA GCA ATC ATC GTA GTA 735 735 1360 1360 Ala That Val Wave Lys Lys Ala That Val Wave Phe Phew Arg Arg Ser Serum Trp Asp Trp Asp Asn Assn Pro Pro Arg Arg Ala That Ile authorities Val Wave 210 210 215 215 220 220 TAC cue CGC CGC CGT CGT ATG ATG AAC AAC GAT NECK ATC ATC CCT CCT GGA GGA GAC GAC TGG TGG GGT GGT ACT ACT GCA GCA GTT GTT AAC AAC 783 783 1365 1365 Tyr Arg Arg Tyr Arg Arg Met meth Asn Assn Asp Asp Ile authorities Pro Pro Gly Asp Gly Asp Trp Trp Gly Gly Thr Thr Ala That Val Wave Asn Assn 225 225 230 230 235 235 GTT GTT CAG CAG ACC acc ATG ATG GTA GTA TTT TTT GGT GGT AAC AAC AAG AAG GGC GGC GAG GAG ACC acc AGC AGC GGT GGT ACA ACA GGC GGC 831 831 Val Wave Gin Gin Thr Thr Met meth Val Wave Phe Phew Gly Gly Asn Assn Lys Lys Gly Gly Glu Goofy Thr Thr Ser Serum Gly Gly Thr Thr Gly Gly 1370 1370 240 240 245 245 250 250 GTT GTT GCC GCC TTC TTC ACA ACA CGT CGT AAC AAC CCA Approx TCC TCC ACA ACA GGT GGT GAA GAA AAA AAA GGC GGC ATC ATC TAC cue GGT GGT 879 879 Val Wave Ala That Phe Phew Thr Thr Arg Arg Asn Assn Pro Pro Ser Serum Thr Thr Gly Gly Glu Goofy Lys Lys Gly Gly Ile authorities Tyr Tyre Gly Gly 1375 1375 255 255 260 260 265 265 270 270 GAG GAG TAC cue CTG CTG ATC ATC AAT AAT GCA GCA CAG CAG GGC GGC GAG GAG GAC GAC GTA GTA . GTT . GTT GCA GCA l GGT l GGT GTC GTC CGC CGC 927 927

Glu Tyr Leu Ile Asn Ala Gin Gly Glu Asp Val Val Ala Gly Val ArgGlu Tyr Leu Ile Asn Ala Gin Gly Glu Asp Val Val Ala Gly Val Arg

13801380

RO 118451 ΒRO 118451 B

275275

280280

285285

ACA ACA CCA Approx CAG CAG CCT CCT ATC ATC ACC acc CAG CAG TTA TTA GAG . GAG. AAC AAC GAT NECK ATG ATG CCT CCT GAC GAC TGC TGC TAC cue 975 975 1385 1385 Thr Thr Pro Pro Gin Gin Pro Pro lle Ile Thr Thr Gin Gin Leu Lion Glu . Goofy. Asn Assn Asp Asp Met meth Pro Pro Asp Asp Cys Cys Tyr Tyre 290 290 295 295 300 300 AAG AAG CAG CAG TTC TTC ATG ATG GAT NECK CTG CTG GCC GCC ATG ATG AAG AAG CTG CTG GAG GAG AAA AAA CAT HOW TTC TTC CGT CGT GAC GAC 1023 1023 Lys Lys Gin Gin Phe Phew Met meth Asp Asp Leu Lion Ala That Met meth Lys Lys Leu Lion Glu Goofy Lys Lys His His Phe Phew Arg Asp Arg Asp 1390 1390 305 305 310 310 315 315 ATG ATG CAG CAG GAT NECK ATG ATG GAG GAG TTC TTC ACA ATC ACA ATC GAG GAG GAA GAA GGT GGT AAA AAA TTA TTA TAC cue TTC TTC TTA TTA 1071 1071 Met meth Gin Gin Asp Asp Met meth Glu Goofy Phe Phew Thr Thr lle Ile Glu Goofy Glu Goofy Gly Gly Lys Lys Leu Lion Tyr Tyre Phe Phew Leu Lion 1395 1395 320 320 325 325 330 330 CAG CAG ACA ACA CGT CGT AAC AAC GGC GGC AAG AAG AGA AGM ACA ACA GCT GTC CCG GCC GCT GTC GCT GTC CTT CTT CAG CAG ATT ATT GCC GCC 1119 1119 Gin Gin Thr Thr Arg Arg Asn Assn Gly Gly Lys Lys Arg Arg Thr Thr Ala That Pro Pro Ala That Ala That Leu Lion Gin Gin lle Ile Ala That 1400 1400 335 335 340 340 345 345 350 350 TGC TGC GAT NECK TTA TTA GTA GTA GAC GAC GAA GAA GGC GGC ATG ATG ATC ATC ACA ACA GAG GAG GAA GAA GAG GAG GCT GTC GTT GTT GTA GTA 1167 1167 Cya Asp Cya Asp Leu Lion Val Wave Asp Asp Glu Goofy Gly Gly Met meth lle Ile Thr Thr Glu Goofy Glu Goofy Glu Goofy Ala That Val Wave Val Wave 355 355 360 360 365 365 1405 1405 AGA AGM ATC ATC GAA GAA GCA GCA AAA AAA TCT TCT CTT CTT GAT NECK CAG CAG TTA TTA CTT CTT CAC CaCl CCG GCC ACC acc TTC TTC AAC AAC 1215 1215 Arg Arg lle Ile Glu Goofy Ala That Lys Lys Ser Serum Leu Lion Asp Asp Gin Gin Leu Lion Leu Lion His His Pro Pro Thr Thr Phe Phew Asn Assn 370 370 375 375 380 380 1410 1410 CCG GCC GCT GTC GCT GTC TTA TTA AAG AAG GCC GCC GGC GGC GAA GAA GTA GTA ATC ATC GGT GGT TCC TCC GCT GTC CTT CTT CCG GCC GCA GCA 1263 1263 Pro Pro Ala That Ala That Leu Lion Lys Lys Ala That Gly Gly Glu Goofy Val Wave lle Ile Gly Gly Ser Serum Ala That Leu Lion Pro Pro Ala That 385 385 390 390 395 395 1415 1415 TCT TCT CCT CCT GGC GGC GCA GCA GCA GCA GCA GCA GGT GGT AAA AAA GTA GTA TAC cue TTC TTC ACC acc GCT GTC GAT NECK GAG GAG GCT GTC 1311 1311 Ser Serum Pro Pro Gly Gly Ala That Ala That Ala That Gly Gly Lys Lys Val Wave Tyr Tyre Phe Phew Thr Thr Ala That Asp Asp Glu Goofy Ala That 400 400 405 405 410 410 AAG AAG GCT GTC GCC GCC CAC CaCl GAG GAG AAG AAG GGT GGT GAG GAG AGA AGM GTT GTT ATC ATC CTT CTT GTT GTT CGT CGT CTT CTT GAG GAG 1359 1359 1420 1420 Lys Lys Ala That Ala That His His Glu Goofy Lys Lys Gly Gly Glu Goofy Arg Arg Val Wave lle Ile Leu Lion Val Wave Arg Arg Leu Lion Glu Goofy

RO 118451 ΒRO 118451 B

415 415 420 420 425 425 430 430 ACA. ACA. TCT TCT CCG GCC GAA GAA GAT NECK ATC ATC GAA GAA GGT . GGT. ATG । ATG । CAT HOW GCA i GCA i GCC < GCC < GAA GAA GGT . GGT. ATC । ATC CTG CTG 1407 1407 Thr Thr Ser Serum Pro Pro Glu Goofy Asp Asp lle Ile Glu Goofy Gly : Gly: Met meth His . His. Ala ; Alas; Ala < Ala < Glu Goofy Gly Gly lle Ile Leu Lion 1425 1425 435 435 440 440 445 445 ACA ACA GTG GTG CGC CGC GGC GGC GGT GGT ATG ATG ACA ACA AGC AGC CAT HOW GCA GCA GCC GCC GTA GTA GTT GTT GCA GCA CGT CGT GGT GGT 1455 1455 Thr Thr Val Wave Arg Gly Arg Gly Gly Gly Met meth Thr Thr Ser Serum His His Ala That Ala That Val Wave Val Wave Ala That Arg Arg Gly Gly 1430 1430 450 450 455 455 460 460 ATG ATG GGA GGA ACA ACA TGC TGC TGC TGC GTA GTA TCC TCC GGA GGA TGC TGC GGT GGT GAG GAG ATC ATC AAG AAG ATC ATC AAC AAC GAA GAA 1503 1503 Met meth Gly Gly Thr Thr Cys Cys Cys Cys Val Wave Ser Serum Gly Gly Cys Cys Gly Gly Glu Goofy lle Ile Lys Lys lle Ile Asn Assn Glu Goofy 1435 1435 465 465 470 470 475 475 GAA GAA GCT GTC AAG AAG ACA ACA TTC TTC GAA GAA CTT CTT GGC GGC GGA GGA CAC CaCl ACA ACA TTT TTT GCA GCA GAG GAG GGA GGA GAT NECK 1551 1551 G1U G1U Ala That Lys Lys Thr Thr Phe Phew Glu Goofy Leu Lion Gly Gly Gly Gly His His Thr Thr Phe Phew Ala That Glu Goofy Gly Asp Gly Asp 480 480 485 485 490 490 1440 1440 TAC cue ATC ATC TCC TCC TTA TTA GAT NECK GGT GGT TCC TCC ACA ACA GGT GGT AAG AAG ATT ATT TAC cue AAG AAG GGC GGC GAC GAC ATC ATC 1599 1599 Tyr Tyre lle Ile Ser Serum Leu Lion Asp Asp Gly Gly Ser Serum Thr Thr Gly Gly Lys Lys lle Ile Tyr Tyre Lys Lys Gly Asp Gly Asp lle Ile 495 495 500 500 505 505 510 510 1445 1445 GAG GAG ACT ACT CAG CAG GAA GAA CGT CGT TCC TCC GTA AGC GTA AGC GGA GGA AGC AGC TTC TTC GAG GAG CGT CGT ATC ATC ATG ATG GTA GTA 1647 1647 Glu Goofy Thr Thr Gin Gin Glu Goofy Arg Arg Ser Serum Val Wave Ser Serum Gly Gly Ser Serum Phe Phew Glu Goofy Arg Arg lle Ile Met meth Val Wave 515 515 520 520 525 525 1450 1450 TGG TGG GCT GTC GAC GAC AAG AAG TTC TTC AGA AGM ACA ACA TTA TTA AAG AAG GTT GTT CGT CGT ACA ACA AAT AAT GCC GCC GAC GAC ACA ACA 1695 1695 Trp Trp Ala That Asp Asp Lys Lys Phe Phew Arg Arg Thr Thr Leu Lion Lys Lys Val Wave Arg Arg Thr Thr Asn Assn Ala That Asp Asp Thr Thr 530 530 535 535 540 540 CCG GCC GAA GAA GAT NECK AGA AGM CTC CTC AAT AAT GCC GCC GTT GTT AAA AAA CTG CTG GGT GGT GCA GCA GAG GAG GGC GGC : ATC : ATC GGT GGT 1743 1743 1455 1455 Pro Pro Glu Goofy Asp Asp Thr Thr Leu Lion Asn Assn Ala That Val Wave Lys Lys Leu Lion Gly Gly Ala That Glu Goofy i Gly lle and Gly lle Gly Gly 545 545 550 550 555 555 CTT CTT TGC TGC CGT CGT ACA ACA GAG GAG CAT HOW ATG ATG TTC TTC TTC TTC GAG GAG f GCT f GCT GAC GAC : AGJ : AGJ L ATC ATG AAG L ATC ATG AAG 1791 1791 1460 1460 Leu Lion Cys Cys Arg Arg Thr Thr Glu Goofy His His Met meth Phe Phew Phe Phew : Glu : Goofy i Ala take to Asp Asp ' Arc ' Spring [ lle [lle s Met Lys s Met Lys

RO 118451 ΒRO 118451 B

1465 1465 560 560 565 565 570 570 ATC ATC AGA AGM AAG AAG ATG ATC ATG ATC CTT CTT TCC TCC GAT NECK TCA TCA GTG GTG GAA GAA GCA . GCA. AGA AGM GAA GAA GAG GAG GCT GTC 1839 1839 lle Ile Arg Arg Lys Lys Met meth lle Ile Leu Lion Ser Serum Asp Asp Ser Serum Val Wave G1U G1U Ala ; Alas; Arg Arg Glu Goofy Glu . Goofy. Ala That 575 575 580 580 585 585 590 590 1470 1470 CTG CTG AAC AAC GAA GAA TTA TTA ATC ATC CCG GCC TTC TTC CAG CAG AAG AAG GGC GGC GAT NECK TTC . TTC. AAG AAG GCT GTC ATG ATG TAC cue 1887 1887 Leu Lion Asn Assn Glu Goofy Leu Lion lle Ile Pro Pro Phe Phew Gin Gin Lys Lys Gly Asp Gly Asp Phe Phew Lys Lys Ala That Met meth Tyr Tyre 595 595 600 600 605 605 1475 1475 AAA AAA GCT GTC CTG CTG GAA GAA GGC GGC AGG AGG CCA ATG CCA ATG ACG ACG GTT GTT CGC CGC TAC cue CTG CTG GAT NECK CCG GCC CCG GCC 1935 1935 Lys Lys Ala That Leu Lion Glu Goofy Gly Arg Gly Arg Pro Pro Met meth Thr Thr Val Wave Arg Arg Tyr Tyre Leu Lion Asp Asp Pro Pro Pro Pro 610 610 615 615 620 620 1480 1480 CTG CTG CAT HOW GAG GAG TTC TTC GTT GTT CCT CCT CAT HOW ACA ACA GAA GAA GAG GAG GAG GAG CAG CAG GCT GTC GAA GAA CTG CTG GCT GTC 1983 1983 Leu Lion His His Glu Goofy Phe Phew Val Wave Pro Pro His His Thr Thr Glu Goofy Glu Goofy Glu Goofy Gin Gin Ala That Glu Goofy Leu Lion Ala That 625 625 630 630 635 635 AAG AAG AAC AAC ATG ATG GGC GGC CTT CTT ACT ACT TTA TTA GCA GCA GAA GAA GTA GTA AAA AAA GCA GCA AAA AAA GTT GTT GAC GAC GAA GAA 2031 2031 1485 1485 Lys Lys Asn Assn Met meth Gly Gly Leu Lion Thr Thr Leu Lion Ala That Glu Goofy Val Wave Lys Lys Ala That Lys Lys Val Wave Asp Asp Glu Goofy 640 640 645 645 650 650 TTA TTA CAC CaCl GAG GAG TTC TTC AAC AAC CCA Approx ATG ATG ATG ATG GGC GGC CAT HOW CGT CGT GGC GGC TGC TGC CGT CGT CTT CTT GCA GCA 2079 2079 1490 1490 Leu Lion His His Glu Goofy Phe Phew Asn Assn Pro Pro Met meth Met meth Gly Gly His His Arg Arg Gly Gly Cys Cys Arg Arg Leu Lion Ala That 655 655 660 660 665 665 670 670 GTT GTT ACC acc TAT DAD CCG GCC GAA GAA ATT ATT GCA GCA AAG AAG ATG ATG CAG CAG ACA ACA AGA AGM GCC GCC GTT GTT ATG ATG GAA GAA 2127 2127 1495 1495 Val Wave Thr Thr Tyr Tyre Pro Pro Glu Goofy lle Ile Ala That Lys Lys Met meth Gin Gin Thr Thr Arg Arg Ala That Val Wave Met meth Glu Goofy 675 675 680 680 685 685 GCT GTC GCT GTC ATC ATC GAA GAA GTG GTG AAG AAG GAA GAA GAG GAG ACA ACA GGA GGA ATC ATC GAT NECK ATT ATT GTT GTT CCT CCT GAG GAG 2175 2175 1500 1500 Ala That Ala That lle Ile Glu Goofy Val Wave Lys Lys Glu Goofy Glu Goofy Thr Thr Gly Gly Xle Xle Asp Asp lle Ile Val Wave Pro Pro Glu Goofy 690 690 695 695 700 700 ATC ATC ATG ATG ATT ATT CCG GCC TTA TTA GTT GTT GGC GGC GAG GAG AAG AAG AAA AAA . GAG . GAG ) CTT ) CTT AAG AAG i TTC and TTC ; GTT ; GTT 1 AAG 1 AAG 2223 2223

lle Met lle Pro Leu Val Gly Glu Lys Lys Glu Leu Lys Phe Val Lyslle Met lle Pro Leu Val Gly Glu Lys Lys Glu Leu Lys Phe Val Lys

RO 118451 ΒRO 118451 B

705 705 710 710 715 715 GAC GAC GTA GTT GTG GAA GTA GTA GTT GTG GAA GTA GCT GAG CAG i GCT GAG CAG i GTT AAG AAA GAG AAA GGT TCC 2271 GTT AAG AAA GAG AAA GGT TCC 2271 Asp Asp Val Val Val Glu Val Wave Wave Wave Glu Wave Ala Glu Gin ' Ala Glu Gin' Val Lys Lys Glu Lys Gly Ser Val Lys Lys Glu Lys Gly Ser 720 720 725 725 730 730 GAT NECK ATG CAG TAC CAC ATC ATG CAG TAC CAC ATC GGT ACC ATG GGT ACC ATG ATC GAA ATT CCT CGT GCA GCT 2319 ATC GAA ATT CCT CGT GCA GCT 2319 Asp Asp Met Gin Tyr His lle Met Gin Tyr His lle Gly Thr Met Gly Thr Met lle Glu lle Pro Arg Ala Ala lle Glu lle Pro Arg Ala Ala 735 735 740 740 745 750 745 750 CTC CTC ACA GCA GAT GCC ATC ACA GCA GAT GCC ATC GCT GAG GAA GCT GAG GAA GCA GAG TTC TTC TCC TTC GGT 2367 GCA GAG TTC TTC TCC TTC GGT 2367 Leu Lion Thr Ala Asp Ala lle Thr Ala Asp Ala lle Ala Glu Glu Alas Glu Glu Ala Glu Phe Phe Ser Phe Gly Ala Glu Phe Phe Ser Phe Gly 755 755 760 765 760 765 ACA ACA AAC GAC TTA ACA CAG AAC GAC TTA ACA CAG ATG ACA TTC ATG ACA TTC GGC TTC TCC CGT GAC GAC GCC 2415 GGC TTC TCC CGT GAC GAC GCC 2415 Thr Thr Asn Asp Leu Thr Gin Asn Asp Leu Thr Gin Met Thr Phe Met Thr Phe Gly Phe Ser Arg Asp Asp Ala Gly Phe Ser Arg Asp Asp Ala 770 770 775 775 780 780 GGC GGC AAG TTC CTG GAT TCC AAG TTC CTG GAT TCC TAC TAT AAA SHUT TAT AAAA GCA AAA ATT TAT GAG TCC GAT 2463 GCA AAA ATT TAT GAG TCC GAT 2463 Gly Gly Lys Phe Leu Asp Ser Lys Phe Leu Asp Ser Tyr Tyr Lys Tyr Tyr Lys Ala Lys lle Tyr Glu Ser Asp Ala Lys lle Tyr Glu Ser Asp 785 785 790 790 795 795 CCA Approx TTC GCA AGA CTT GAC TTC GCA AGM CTT GAC CAG ACA GGC CAG ACA GGC GTT GGC CAG TTA GTA GAG ATG 2511 GTT GGC CAG TTA GTA GAG ATG 2511 Pro Pro Phe Ala Arg Leu Asp Phe Ala Arg Leu Asp Gin Thr Gly Gin Thr Gly Val Gly Gin Leu Val Glu Met Val Gly Gin Leu Val Glu Met 800 800 805 805 810 810 GCA GCA GTT AAG AAA GGC CGT GTT AAG AAA GGC CGT CAG ACA CGT CAG ACA CGT CCG GGC CTT AAG TGC GGC ATC 2559 CCG GGC CTT AAG TGC GGC ATC 2559 Ala That Val Lys Lys Gly Arg Wave Lys Lys Gly Arg Gin Thr Arg Gin Thr Arg Pro Gly Leu Lys Cys Gly lle Pro Gly Leu Lys Cys Gly lle 815 815 820 820 825 830 825 830 TGC TGC GGC GAG CAC GGC GAG GGC GAG CAC GGC GAG ATC CTT CTT ATC CTT CTT CCG TAGAGTTCTG CCACAAAGTA 2609 CCG TAGAGTTCTG CCACAAAGTA 2609 Cys Cys Gly Glu His Gly Glu Gly Glu His Gly Glu lle Leu Leu lle Leo Leo . Pro . Pro

835 840835 840

15051505

15101510

15151515

15201520

15251525

15301530

15351535

15401540

26102610

RO 118451 ΒRO 118451 B

SECV ID NR: 4SEQ ID NO: 4

LUNGIMEA SECVEJTEIE: 2722SEQUEL LENGTH: 2722

TIPUL SECVENȚEI: acid nuci eicSEQUENCE TYPE: nut eic acid

DESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

GAACTATTTA AGGAATTTGT AAGAATTTAG AGTTCATTCA GATAATA ATG CAA AGA Met Gin Arg GAACTATTTA AGGAATTTGT AAGAATTTAG AGTTCATTCA GATAATA ATG CAA AGA Met Gin Arg 56 56 1 1 GTA TAI GCT TTC GAA GAT GGT GAT GGA ACC AAC AAG . GTA TAI GCT TTC GAA GAT GGT GAT GGA ACC AAC AAG . AAA CTC CTT GGA AAA CTC CTT GGA 104 104 Val Tyr Ala Phe Glu Asp Gly Asp Gly Thr Asn Lys Val Tyr Ala Phe Glu Asp Gly Asp Gly Thr Asn Lys Lys Leu Leu Gly Lys Leu Leu Gly 5 10 15 GGA AAG GGA GCT GGA CTT TGC ACA ATG ACA AAA ATT 5 10 15 GGA AAG GGA GCT GGA CTT TGC ACA ATG ACA AAA ATT GGA CTT CCA GTT GGA CTT CCA GTT 152 152 Gly Lys Gly Ala Gly Leu Cys Thr Met Thr Lys Ile Gly Lys Gly Ala Gly Leu Cys Thr Met Thr Lys Ile Gly Leu Pro Val Gly Leo Pro Val 20 25 30 CCA CAA GGA TTT GTT ATT ACA ACT GAA ATG TOT AAA 20 25 30 CCA CAA GGA TTT GTT ATT ACA ACT GAA ATG TOT AAA 35 CAA TTC ATT GCT 35 CAA TTC ATT GCT 200 two hundred Pro Gin Gly Phe Val Xle Thr Thr Glu Met cys Lys Pro Gin Gly Phe Val Xle Thr Thr Glu Met cys Lys Gin Phe Xle Ala Gin Phe Xle Ala 40 45 AAT GGA AAC AAA ATG CCA GAA GGA TTA ATG GAA GAA 40 45 AAT GGA AAC AAA ATG CCA GAA GGA TTA ATG GAA GAA 50 GTT AAA AAA GAA 50 GTT AAA AAA GAA 248 248 Asn Gly Asn Lys Met Pro Glu Gly Leu Met Glu Glu Asn Gly Asn Lys Met Pro Glu Gly Leu Met Glu Glu Val Lys Lys Glu Val Lys Lys Glu 55 60 TAT CAA TTA GTT GAA AAG AAA TCA GGA AAA OTC TTT 55 60 TAT CAA TTA GTT GAA AAG AAA TCA GGA AAA OTC TTT 65 GGA GGA GAA GAA 65 GGA GGA GAA GAA 296 296 Tyr Gin Leu Val Glu Lys Lys Ser Gly Lys Val Phe Tyr Gin Leu Val Glu Lys Lys Ser Gly Lys Val Phe Gly Gly Glu Glu Gly Gly Glu Glu 70 75 AAT CCA CTT CTT GTT TCA GTC AGA TCA GGA GCT GCT 70 75 AAT CCA CTT CTT GTT TCA GTC AGA TCA GGA GCT GCT 80 ATG TCT ATG CCA 80 ATG TCT ATG CCA 344 344 Asn Pro Leu Leu Val Ser Val Arg Ser Gly Ala Ala Asn Pro Leu Leu Val Ser Val Arg Ser Gly Ala Ala Met Ser Met Pro Met Serum Met Pro 85 90 95 GGT ATG ATG GAT ACT ATT CTT AAT CTT GGA CTT AAT 85 90 95 GGT ATG ATG GAT ACT ATT CTT AAT CTT GGA CTT AAT GAT AAA ACT GTT GAT AAA ACT GTT 392 392

Gly Met Met Asp Thr Ile Leu Asn Leu Gly Leu Asn Asp Lys Thr ValGly Met Met Asp Thr Ile Leu Asn Leu Gly Leu Asn Asp Lys Thr Val

RO 118451 ΒRO 118451 B

100 100 105 105 110 110 115 115 GTT GTT GCT GTC CTT CTT GCT GTC AAA AAA TTA . VAT. ACC j ACC j MC j MC j hAT । hAT । GAA . GAA. AGA AGM TTT · TTT · 3CA TAT 1 3CA TAT 1 GAT 1 NECK 1 TCA TCA 440 440 1590 1590 Val Wave Ala That Leu Lion Ala That Lys Lys Leu Lion Thr ; Thr; han ; inn; Aan ' Aan' Glu Goofy Arg Arg Phe Ala Tyr Aap Phe Ala Tyr Aap Ser Serum 120 120 125 125 130 130 TAC cue AGA AGM AGA AGM TTT TTT GTT GTT TCC TCC CTC CTC TTC TTC GGA GGA AAG AAG ATT ATT GCT GTC CTT j CTT j hAT hAT GCT GTC TGT TGT 488 488 1595 1595 Tyr Tyre Arg Arg Arg Arg Phe Phew Val Wave Ser Serum Leu Lion Phe Phew Gly Gly Lya Lya Ile authorities Ala That Leu ; Leo; han . inn Ala That Cys Cys 135 135 140 140 145 145 GAT NECK GAA GAA GTT GTT TAT DAD GAT NECK AAG AAG ACT ACT CTT CTT GAA GAA AAC AAC AAA AAA AAA AAA GTT GTT GAA GAA AAG AAG GGA GGA 536 536 Asp Asp Glu Goofy Val Wave Tyr Tyre Aap Aap Lys Lys Thr Thr Leu Lion Glu Goofy Aan Aan Lya Lya Lys Lys Val Wave Glu Goofy Lya Lya Gly Gly 1600 1600 150 150 155 155 160 160 GTT GTT AAA AAA TTA TTA GAT NECK ACT ACT GAA GAA TTA TTA GAT NECK GCT GTC AAT AAT GAT NECK ATG AAA ATG AAA GAA GAA CTT CTT GCA GCA 584 584 Val Wave Lys Lys Leu Lion Aap Aap Thr Thr Glu Goofy Leu Lion Aap Aap Ala That Asn Assn Aap Aap Met meth Lya Lya Glu Goofy Leu Lion Ala That 1605 1605 165 165 170 170 175 175 CAA CAA GTC GTC TTC TTC ATT ATT AAA AAA AAG AAG ACT ACT GAA GAA GAA GAA TTC TTC ACT ACT AAA AAA CAA CAA CCA Approx TTC TTC CCA Approx 632 632 Gin Gin Val Wave Phe Phew Ile authorities Lys Lys Lya Lya Thr Thr Glu Goofy Glu Goofy Phe Phew Thr Thr Lys Lys Gin Gin Pro Pro Phe Phew Pro Pro 1610 1610 180 180 185 185 190 190 195 195 GTT GTT GAT NECK CCA Approx TAT DAD GCT GTC CAA CAA TTA TTA GAA GAA TTT TTT GCC GCC ATT ATT TGT TGT GCT GTC GTA GTA TTC TTC AGA AGM 680 680 Val Wave Aap Aap Pro Pro Tyr Tyre Ala That Gin Gin Leu Lion Glu Goofy Phe Phew Ala That ile authorities Cys Cys Ala That Val Wave Phe Phew Arg Arg 200 two hundred 205 205 210 210 1615 1615 TCA TCA TGG TGG ATG ATG GGA GGA AAG AAG AGA AGM GCT GTC GTT GTT GAT NECK TAC cue AGA AGM AGA AGM GAA GAA TTC TTC AAG AAG ATT ATT 728 728 Ser Serum Trp Trp Met meth Gly Gly Lya Lya Arg Arg Ala That Val Wave Aap Aap Tyr Arg Tyr Arg Arg Arg Glu Goofy Phe Phew Lys Lys Ile authorities 215 215 220 220 225 225 1620 1620 ACT ACT CCA Approx GAA GAA CAA CAA GCT GTC GAT NECK GGA GGA ACT ACT GCT GTC GTT GTT TCA TCA . GTT . GTT GTT GTT TCT TCT ATG ATG i GTT and GTT 776 776 Thr Thr Pro Pro Glu Goofy Gin Gin Ala That Aap Aap Gly Gly Thr Thr Ala That Val Wave Sex Sex Val Wave Val Wave Ser Serum Met meth Val Wave 230 230 235 235 240 240 1625 1625 TAT DAD GGT GGT AAT AAT ATG ATG GGT GGT AAT AAT GAT NECK TCA TCA . GCT . GTC 1 ACI 1 HERE * GGl * GGl ’ GTT ’ GTT ' TGT TGT TTC TTC ! ACI ! needles ' AGA AGM 824 824 Tyr Tyre Gly Gly Asn Assn Met meth Gly Gly Asn Assn Aap Aap Ser Serum Ala That . Thr Glj . Thr Glj r Val r Wave . Cya . Cya Phe Thi Phe Thi : Arg : Arg

RO 118451 ΒRO 118451 B

1630 1630 245 245 250 250 255 255 GAT i THROAT i CCA GGA AGA CCA GGA AGM GGA GAA AAT GGA GAA AAT ATG TTC TTC GGA GAA TAT CTT AAG AAT 872 ATG TTC TTC GGA GAA TAT CTT AAG AAT 872 Asp Asp Pro Gly Thr Pro Gly Thr Gly Glu Asn Gly Glu Asn Met Phe Phe Gly Glu Tyr Leu Lys Asn Met Phe Phe Gly Glu Tyr Leu Lys Asn 260 260 265 265 270 275 270 275 16351635 GCA GCA CAA GGA GAA CAA GGA GAA GAT GTT GTT THROAT GTT GTT GCT GGT ATT AGA ACA CCA CAA ATT ATT 920 GCT GGT ATT AGA ACA CCA CAA ATT ATT 920 Ala That Gin Gly Glu Gin Gly Glu Asp Val Val Asp Wave Wave Ala Gly Ile Arg Thr Pro Gin Ile Ile Ala Gly Ile Arg Thr Pro Gin Ile Ile 280 280 285 290 285 290 1640 TCA 1640 TCA AAG ATG GCA AAG ATG GCA GAA GAT CGA GAA GAT CGA GAT CTT CCA GGT TGC TAT GAA CAA CTT 968 GAT CTT CCA GGT TGC TAT GAA CAA CTT 968 Ser Serum Lys Met Ala Lys Met Ala Glu Asp Arg Glu Asp Arg Asp Leu Pro Gly Cys Tyr Glu Gin Leu Asp Leu Pro Gly Cys Tyr Glu Gin Leu 295 295 300 305 300 305 CTT 1645 CTT 1645 GAT ATT AGA NECK ATT AGA AAG AAA TTA AAG AAA TTA GAA GGA TAT TTC CAT GAA GTA CAA GAC 1016 GAA GGA TAT TTC CAT GAA GTA CAA GAC 1016 Leu Lion Asp Ile Arg Asp Ile Arg Lys Lys Leu Lys Lys Leo Glu Gly Tyr Phe His Glu Val Gin Asp Glu Gly Tyr Phe His Glu Val Gin Asp 4 4 310 310 315 320 315 320 TTT TTT GAA TTC ACT GAA TTC ACT ATT GAA AGA ATT GAA AGA AAG AAA CTT TAC ATG CTC CAA ACT AGA 1064 AAG AAA CTT TAC ATG CTC CAA ACT AGA 1064 j 1650 n.j 1650 n. Glu Phe Thr Glu Phe Thr Ile Glu Arg Ile Glu Arg Lys Lys Leu Tyr Met Leu Gin Thr Arg Lys Lys Leu Tyr Met Leu Gin Thr Arg 325 325 330 330 335 335 AAT AAT GGA AAG ATG GGA AAG ATG AAT GCA ACT AAT GCA ACT GCT ACT GTC AGA ACA GGA GTT GAT ATG 1112 GCT ACT GTC AGA ACA GGA GTT GAT ATG 1112 1655 1655 Gly Lys Met Gly Lys Met Asn Ala Thr Asn Ala Thr Ala Thr Val Arg Thr Gly Val Asp Met Ala Thr Val Arg Thr Gly Val Asp Met 340 340 345 345 350 355 350 355 ί GTT и GTT GAA GAA GGA GAA GAA GGA CTT ATT ACA CTT ATT ACA AAA GAA CAA GCC ATT ATG AGA ATT GCA 1160 AAA GAA CAA GCC ATT ATG AGA ATT GCA 1160 Val Wave Glu Glu Gly Glu Glu Gly Leu Ile Thr Leo Ile Thr Lys Glu Gin Ala Ile Met Arg Ile Ala Lys Glu Gin Ala Ile Met Arg Ile Ala 1660 1660 360 360 365 370 365 370 CCA Approx CAA TCA GTT CAA TCA GTT GAT CAA TTA NECK CAA TTA CTT CAT AAG AAT ATG CCA GCT AAT TAT 1208 CTT CAT AAG AAT ATG CCA GCT AAT TAT 1208 Pro Pro Gin Ser Val Gin Ser Val Asp Gin Leu Asp Gin Leo Leu His Lys Asn Met Pro Ala Asn Tyr Leu His Lys Asn Met Pro Ala Asn Tyr 1665 1665 375 375 380 385 380 385 GCA GCA GAA GCT CCA GAA GCT CCA TTA GTT AAA TTA GTT AAA GGA CTT CCA GCA TCA CCA GGA GCT GCT 1256 GGA CTT CCA GCA TCA CCA GGA GCT GCT 1256 Ala That Glu Ala Pro Glu Ala Pro Leu Val Lys Leo Val Lys Gly Leu Pro Ala Ser Pro Gly Ala Ala Gly Leu Pro Ala Ser Pro Gly Ala Ala

RO 118451 ΒRO 118451 B

390 395390 395

ACA GGA ACA GGA GCT GTT GTT TTT GAT GCC GAT GAT GCA GTT GAA CAA GCT AAA GCT GTT GTT TTT GAT GCC GAT GAT GCA GTT GAA CAA GCT AAA 1304 1304 Thr Thr Gly Gly Ala That Val Wave Val Wave Phe Phew ASp ASp Ala That Asp Asp . Asp Asp Ala That Val Wave Glu Goofy Gin Gin Ala That Lys Lys 405 405 410 410 415 415 1675 1675 GGA GGA AAG AAG AAA AAA GTT GTT CTT CTT CTT CTT CTT CTT AGA AGM GAA GAA GAA GAA ACT ACT AAA AAA CCA Approx GAA GAA GAT NECK ATT ATT 1352 1352 Gly Gly Lys Lys Lys Lys Val Wave Leu Lion Leu Lion Leu Lion Arg Arg Glu Goofy Glu Goofy Thr Thr Lys Lys Pro Pro Glu Goofy Asp Asp Ile authorities 420 CAT 420 HOW GGA GGA TTC TTC TTT TTT GTT GTT 425 GCT 425 GTC GAA GAA GGT GGT ATT ATT TTA TTA 430 ACC 430 acc TGC TGC AGA AGM GGA GGA GGA GGA 435 AAA 435 AAA 1400 1400 1680 1680 His His Gly Gly Phe Phew Phe Phew Val Wave Ala That Glu Goofy Gly Gly Ile authorities Leu Lion Thr Thr Cys Cys Arg Arg Gly Gly Gly Gly Lys Lys 440 440 445 445 450 450 ACA ACA TCA TCA CAC CaCl GCA GCA GCT GTC GTC GTC GTT GTT GCT GTC AGA AGM GGT GGT ATG ATG GGT GGT AAA AAA CCA Approx TGT TGT GTT GTT 1448 1448 1685 1685 Thr Thr Ser Serum His His Ala That Ala That Val Wave Val Wave Ala That Arg Arg Gly Gly Met meth Gly Gly Lys Lys Pro Pro Cys Cys Val Wave 455 455 460 460 465 465 TCA TCA GGA GGA GCT GTC GAA GAA GGA GGA ATT ATT AAA AAA GTT GTT GAT NECK GTT GTT GCT GTC AAG AAG AAA ATT AAA ATT GCT GTC AAG AAG 1496 1496 1690 1690 Ser Serum Gly Gly Ala That Glu Goofy Gly Gly Ile authorities Lys Lys Val Wave Asp Asp Val Wave Ala That Lys Lys Lys Lys Ile authorities Ala That Lys Lys 470 470 475 475 480 480 ATT Ile ATT authorities GGA Gly GGA Gly AGC Ser AGC Serum CTT Leu CTT Lion GAA Glu GAA Goofy GTT Val GTT Wave CAT His HOW His GAA Glu GAA Goofy GGA GAT Gly Asp GGA THROAT Gly Asp ATT Ile ATT authorities TTA Leu TTA Lion ACT Thr ACT Thr ATT Ile ATT authorities GAT Asp NECK Asp GGA Gly GGA Gly 1544 1544 1695 1695 485 485 490 490 495 495 TCA TCA ACT ACT GGA GGA TGT TGT GTC GTC TAT DAD AAG AAG GGA GGA GAA GAA GTT GTT CCA Approx TTA TTA GAA GAA GAA GAA CCA Approx CAA CAA 1592 1592 Ser Serum Thr Thr Gly Gly Cys Cys Val Wave Tyr Tyre Lys Lys Giy Gly Glu Goofy Val Wave Pro Pro Leu Lion Glu Goofy Glu Goofy Pro Pro Gin Gin 1700 1700 500 500 505 505 510 510 515 515 GTT GTT GGA GGA TCA TCA GGA GGA TAT DAD TTC TTC GGA GGA ACC acc ATC ATC TTA TTA AAA AAA TGG TGG GCC GCC AAT AAT GAA GAA ATT ATT 1640 1640 Val Wave Gly Gly Ser Serum Gly Gly Tyr Tyre Phe Gly Phe Gly Thr Thr Ile authorities Leu Lion Lys Lys Trp Trp Ala That Asn Assn Glu Goofy Ile authorities 1705 1705 520 520 525 525 530 530 AAA AAA AAG AAG ATT ATT GGA GGA GTT GTT TTT TTT GCT GTC GCT GTC GGA GGA GAT NECK CTT CTT CCA Approx . TCA . TCA . GCT . GTC GCT GTC AAG AAG 1688 1688 Lys Lys Lys Lys Ile authorities Gly Gly Val Wave Phe Phew Ala That Ala That Gly Gly Asp Asp Leu Lion Pro Pro Ser Serum Ala That Ala That Lys Lys 1710 1710

RO 118451 ΒRO 118451 B

535 535 540 540 545 545 AAA GCC AAA GCC CTT GAA TTT GGA GCT CTT GAA TTT GGA GCT GAA GGT ATT GGA CTT 1 GAA GGT ATT GGA CTT 1 TGC AGA ACT GAA 1736 TGC AGA ACT GAA 1736 Lys Ala Lys Ala Leu Glu Phe Gly Ala Leo Glu Phe Gly Ala Glu Gly Xle Gly Leu Glu Gly Xle Gly Leu Cys Arg Thr Glu Cys Arg Thr Glu 550 550 555 555 560 560 CGT ATG CGT ATG TTC AAT GCA GTT GAA TTC AAT GCA GTT GAA AGA CTT CCA ATT GTT AGM CTT CCA ATT GTT GTC AAG ATG ATT 1784 GTC AAG ATG ATT 1784 Arg Net Arg Net Phe Asn Ala Val Glu Phe Asn Ala Val Glu Arg Leu Pro Ile Val Arg Leo Pro Ile Val Val Lys Met Ile Val Lys Met Ile 565 565 570 570 575 575 CIT TCA CIT TCA AAT ACC CTT GAA GAA AAT ACC CTT GAA GAA AGA AAG AAA TAT CTT AGM AAG AAA TAT CTT AAT GAA CTT ATG 1832 AAT GAA CTT ATG 1832 Leu Ser Leo Ser Asn Thr Leu Glu Glu Asn Thr Leu Glu Glu Arg Lys Lys Tyr Leu Arg Lys Lys Tyr Leu Asn Glu Leu Met Asn Glu Leu Met 5Θ0 5Θ0 585 585 590 590 595 595 CCA CTT CCA CTT CAA AAA CAA GAT TTC CAA AAA CAA GAT TTC ATT GGA TTA TTG AAG ATT GGA TTA TTG AAG ACT ATG AAT GGA 1880 ACT ATG AAT GGA 1880 Pro Leu Pro Leo Gin Lys Gin Asp Phe Gin Lys Gin Asp Phe Ile Gly Leu Leu Lys Ile Gly Leu Leu Lys Thr Met Asn Gly Thr Met Asn Gly 600 600 605 605 610 610 CTT CCA CTT CCA GTC ACT GTC AGA CTT GTC ACT GTC AGM CTT CTT GAT CCA CCA TTA CTT GAT CCA CCA TTA CAT GAA TTC CTC 1928 CAT GAA TTC CTC 1928 Leu Pro Leo Pro Val Thr Val Arg Leu Wave Thr Wave Arg Leo Leu Asp Pro Pro Leu Leo Asp Pro Leo His Glu Phe Leu His Glu Phe Leo 615 615 620 620 625 625 CCA ACT CCA ACT CTT GAA GAG TTA ATG CTT GAA GAG TTA ATG AGA GAA ATC TTT GAA AGM GAA ATC TTT GAA ATG AAA CTT TCA 1976 ATG AAA CTT TCA 1976 Pro Thr Pro Thr Leu Glu Glu Leu Met Leo Glu Glu Leo Met Arg Glu Ile Phe Glu Arg Glu Ile Phe Glu Met Lys Leu ser Met Lys Leu ser 630 630 635 635 640 640 GGT AAG GGT AAG ACT GAA GGA CTT GCA ACT GAA GGA CTT GCA GAA AAA GAA GTT GTT GAA AAA GAA GTT GTT CTT AAG AAA GTT 2024 CTT AAG AAA GTT 2024 Gly Lya Gly Lya Thr Glu Gly Leu Ala Thr Glu Gly Leu Ala Glu Lys Glu Val Val Glu Lys Glu Val Val Leu Lys Lys Val Leo Lys Lys Val 645 645 650 650 655 655 AAA GAA AAA GAA CTT ATG GAA GTT AAT CTT ATG GAA GTT AAT CCA ATG ATT GGA CAC CCA ATG ATT GGA CAC AGA GGA ATT AGA 2072 AGA GGA ATT AGA 2072 Lys Glu Lys Glu Leu Met Glu Val Asn Leo Met Glu Val Asn Pro Met Ile Gly His Pro Met Ile Gly His ; Arg Gly Ile Arg ; Arg Gly Ile Arg 660 660 665 665 670 670 675 675 CTT GGA CTT GGA ACT ACT AAT CCA GAA ACT ACT AAT CCA GAA ATT TAT ATGATT TAT ATG L ATT AGA GCA TTC 2120 L ATT AGA GCA TTC 2120

Leu Gly Thr Thr Asn Pro Glu Ile Tyr Glu Met Gin Ile Arg Ala PheLeu Gly Thr Thr Asn Pro Glu Ile Tyr Glu Met Gin Ile Arg Ala Phe

RO 118451 ΒRO 118451 B

680 680 685 685 690 690 TTA TTA GAA GAA GCT GTC ACT ACT CGT CGT GAA GAA GTT GTT ATT ATT AAG AAG GAA GAA GGA ATT . GGA ATT. AAC AAC GAT NECK CAT HOW CGA CGA 2168 2168 Leu Lion Glu Goofy Ala That Thr Thr Arg Arg Glu Goofy Val Wave Ile authorities Lys Lys Glu Goofy Gly Gly Ile . Ile. Asn Assn Asp Asp Hls . Hls. Arg Arg 1755 1755 695 695 700 700 705 705 GAA GAA ATT ATT ATG ATG ATT ATT CCA Approx AAT AAT GTT GTT ACA ACA GAA GAA GTT GTT AAT AAT GAA GAA CTT CTT ATT ATT AAC AAC TTA TTA 2216 2216 Glu Goofy Ile authorities Met meth Ile authorities Pro Pro Asn Assn Val Wave Thr Thr Glu Goofy Val Wave Asn Assn Glu Goofy Leu Lion Ile authorities Asn Assn Leu Lion 1760 1760 710 710 715 715 720 720 AGA AGM AAG AAG AAT AAT GTT GTT CTT CTT GAA GAA CCA Approx GTT GTT CAT HOW GAA GAA GAA GAA GTT GTT GAA GAA AAG AAG AAA AAA TAT DAD 2264 2264 Arg Arg Lys 725 Lys 725 Asn Assn Val Wave Leu Lion Glu Goofy Pro 730 Pro 730 Val Wave His His G1U G1U Glu Goofy Val 735 Wave 735 Glu Goofy Lys Lys Lys Lys Tyr Tyre 1765 1765 GGT GGT ATT ATT AAA AAA GTA GTA CCA Approx TTC TTC TCG TCG TAT DAD GGT GGT ACT ACT ATG ATG GTT GTT GAA GAA TOT ALL GTT GTT AGA AGM 2312 2312 Gly Gly Ile authorities Lys Lys Val Wave Pro Pro Phe Phew Ser Serum Tyr Tyre Gly Gly Thr Thr Met meth Val Wave Glu Goofy Cys Cys Val Wave Arg Arg 740 740 745 745 750 750 755 755 1770 1770 GCA GCA GCA GCA TTA TTA ACA ACA GCT GTC GAT NECK AAG AAG ATT ATT GCT GTC ACA ACA GAA GAA GCT GTC TCA TCA TTC TTC TTC TTC TCA TCA 2360 2360 Ala That Ala That Leu Lion Thr Thr Ala That Asp Asp Lys Lys Ile authorities Ala That Thr Thr Glu Goofy Ala That Ser Serum Phe Phew Phe Phew Ser Serum 760 760 765 765 770 770 1775 1775 TTC TTC GGA GGA ACT ACT AAT AAT GAT NECK CTT CTT ACA ACA CAA CAA GGA GGA ACA ACA TTC TTC TCA TCA TAC cue TCA TCA CGT CGT GAA GAA 2408 2408 Phe Phew Gly Gly Thr Thr Asn Assn Asp Asp Leu Lion Thr Thr Gin Gin Gly Gly Thr Thr Phe Phew Ser Serum Tyr Tyre Ser Serum Arg Arg Glu Goofy GAT NECK TCA TCA GAA GAA 775 AAC 775 AAC AAA AAA TTC TTC ATT ATT CCA Approx 780 AAA 780 AAA TAT DAD GTT GTT GAA GAA CTT CTT 785 AAG 785 AAG ATT ATT CTT CTT 2456 2456 1780 1780 Asp Asp Ser Serum Glu Goofy Asn Assn Lys Lys Phe Phew Ile authorities Pro Pro Lys Lys Tyr Tyre Val Wave Glu Goofy Leu Lion Lys Lys Ile authorities Leu Lion 790 790 795 795 800 800 CCA Approx GCT GTC AAT AAT CCA Approx TTT TTT GAA GAA ATT ATT CTT CTT GAT NECK AGA AGM CCA Approx GGT GGT GTT GTT GGA GGA GAA GAA GTT GTT 2504 2504 1785 1785 Pro Pro Ala That Asn Assn Pro Pro Phe Phew Glu Goofy Ile authorities Leu Lion Asp Asp Arg Arg Pro Pro Gly Gly Val Wave Gly Gly Glu Goofy Val Wave 805 805 810 810 815 815 ATG ATG AGA AGM ATT ATT GCT GTC GTT GTT ACT ACT AAA AAA GGA GGA AGA AGM CAA CAA . ACA . ACA AGA AGM CCA Approx . GAA . GAA . TTA . TTA . CTT . CTT 2552 2552 1790 1790 Met meth Arg Arg Ile authorities Ala That Val Wave Thr Thr Lys Lys Gly Arg Gly Arg Gin Gin Thr Thr Arg Arg Pro Pro ' GlU 'GlU . Leu . Lion Leu Lion

RO 118451 ΒRO 118451 B

820 820 825 825 830 830 835 835 GTT GTT GGT GGT ATT ATT TGT TGT GGA GGA GAA GAA CAC CaCl GGA GGA GGG GGG GAA GAA CCA Approx TCA TCA TCA ATT TCA ATT GAA TGG GAA TGG 2600 2600 Val Wave Gly Gly Ile authorities Cys Cys Gly 840 Gly 840 Glu Goofy His His Gly Gly Gly Gly Glu 845 Goofy 845 Pro Pro Ser Serum Ser Serum Ile authorities Glu Trp 850 Glu Trp 850 TGC TGC CAC CaCl ATG ATG ATT ATT GGA GGA TTG TTG AAC AAC TAT GTT TAT GTT TCA TCA TGT TGT TCT TCT TCA TCA TAC cue AGA ATT AGM ATT 2648 2648 Cys Cys His His Met meth Ile 855 authorities 855 Gly Gly Leu Lion Asn Assn Tyr Val 860 Tyr Val 860 Ser Serum Cys Cys Ser Serum Ser Serum Tyr Arg Ile 865 Tyr Arg Ile 865 CCA Approx GTT GTT GCT GTC AGA AGM ATT ATT GCT GTC GCT GTC GCT CAA GCT CAA GCC GCC CAA CAA ATT ATT AGA AGM CAT HOW CCA AGA CCA AGM 2696 2696 Pro Pro Val Wave Ala That Arg Arg ile authorities Ala That Ala That Ala Gin Alas Gin Ala That Gin Gin Ile authorities Arg Arg His His Pro Arg Pro Arg

870 875 880870 875 880

GAA AAT TAAATTAACT TTTTTGGTTT 2722GAA AAT TAAATTAACT TTTTTGGTTT 2722

Glu AsnGlu Asn

885885

SECV ID NRs 5SEQ ID NOs 5

LUNGIMEA SECVENȚEI: 318o TIPUL SECVENȚEI: acid nucleic DESCRIEREA SECVENȚEI:SEQUENCE LENGTH: 318o SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION:

CTGAAATTCC CGTAATCTAT CATCATTTAC ACCACAAATC GATTCACATC CTCACCGAATCTGAAATTCC CGTAATCTAT CATCATTTAC ACCACAAATC GATTCACATC CTCACCGAAT

AGAAATCAAA TATCATTTAC TCCATCTCAC GATCTCCTTT TGCTATTGCT GATACCTCAAAGAAATCAAA TATCATTTAC TCCATTCCAC GATCTCCTTT TGCTATTGCT GATACCTCAA

TTTCGCAGOT GAAGGCGGAC G TTTCCGAGOT GAAGGCGGAC G ATG ATG AGT AGT TCG TCG TTG TTG TTT TTT CTT CTT GAA GAA GCT GTC ATG ATG CCT CCT Met meth Ser Serum Ser Serum Leu Lion Phe Phew Val Wave Glu Goofy Gly Gly Met meth Pro Pro 1 1 5 5 10 10 CTG CTG AAG AAG TCA TCA GCC GCC AAT AAT GAG GAG TCG TCG TGC TGC TTA TTA CCG GCC GCG GCG AGC AGC CTG CTG AAG AAG CAA CAA CGG CGG Leu Lion Lys Lys Ser Serum Ala That Asn Assn Glu Goofy Ser Serum Cys Cys Leu Lion Pro Pro Ala That Ser Serum Val Wave Lys Lys Gin Gin Arg Arg 15 15 20 20 25 25 CGA CGA ACC acc GGT GGT GAT NECK CTC CTC AGG AGG CGA CGA TTG TTG AAC AAC CAC CaCl CAC CaCl CGT CGT CAA CAA CCG GCC GCG GCG TTT TTT

120120

171171

219219

267267

RO 118451 ΒRO 118451 B

Arg Thr Gly Asp Leu Arg Arg Leu Asn His His Arg Gin Pro Ala Phe Arg Thr Gly Asp Leu Arg Arg Leu Asn His His Arg Gin Pro Ala Phe 1835 1835 30 35 40 30 35 40 GTC val GTC wave CGG GGG ATT TGC CGT CGG AAG TTG ACT GGA GTT AGC AGA ATA GAG Arg Gly lle Cys Arg Arg Lys Leu Ser Gly Val Ser Arg lle Glu 45 50 55 CGG GGG ATT TGC CGT CGG AAG TTG ACT GGA GTT AGC AGA ATA GAG Arg Gly lle Cys Arg Arg Lys Leu Ser Gly Val Ser Arg lle Glu 45 50 55 315 315 1840 1840 TTG Leu TTG Lion CGC ACC GOT GOT TTA ACT CTG CCA CGA GCG GTG CTT AAT CCG GTG Arg Thr Gly Gly Leu Thr Leu Pro Arg Ala Val Leu Asn pro Val 60 65 70 CGC ACC GOT GOT TTA ACT CTG CCA CGA GCG GTG CTT AAT CCG GTG Arg Thr Gly Gly Leu Thr Leu Pro Arg Ala Val Leu Asn pro Val 60 65 70 363 363 1845 1845 TCT Ser 75 TCT Serum 75 CCT CCG GTA ACG ACG ACT AAA AAG AGG GTT TTC ACT TTT GGT AAA Pro Pro Val Thr Thr Thr Lys Lys Arg Val Phe Thr Phe Gly Lys 80 85 90 CCT CCG GTA ACG ACG ACT AAA AAG AGG GTT TTC ACT TTT GGT AAA Pro Pro Val Thr Thr Thr Lys Lys Arg Val Phe Thr Phe Gly Lys 80 85 90 411 411 1850 1850 GGA Gly GGA Gly AAC AGT GAA GGC AAC AAG GAC ATG AAA TCC TTG TTG GGA GGA AAA Asn Ser Glu Gly Asn Lys Asp Met Lys Ser Leu Leu Gly Gly Lys 95 100 105 AAC AGT GAA GGC AAC AAG GAC ATG AAA TCC TTG TTG GGA GGA AAA Asn Ser Glu Gly Asn Lys Asp Met Lys Ser Leu Leu Gly Gly Lys 95 100 105 459 459 GGT Gly GGT Gly GCA AAT CTT GCA GAG ATG GCA AGC ATT GGC CTA TCA GTT CCT CCT Ala Asn Leu Ala Glu Met Ala Ser lle Gly Leu Ser Val Pro Pro 110 115 120 GCA AAT CTT GCA GAG ATG GCA AGC ATT GGC CTA TCA GTT CCT CCT Ala Asn Leu Ala Glu Met Ala Ser lle Gly Leu Ser Val Pro Pro 110 115 120 507 507 1855 1855 GGG Gly GGG Gly CTC ACT ATT TCA ACT GAA GCA TOT GAG GAA TAT CAA CAA AAT GGA Leu Thr lle Ser Thr Glu Ala Cys Glu Glu Tyr Gin Gin Asn Gly 125 130 135 CTC ACT ATT TCA ACT GAA GCA TOT GAG GAA TAT CAA CAA AAT GGA Leu Thr lle Ser Thr Glu Ala Cys Glu Glu Tyr Gin Gin Asn Gly 125 130 135 555 555 1860 1860 AAA AAA AAA CTG CCT CCA GOT TTA TGG GAT GAG ATT CTG GAA GGC TTA CAG AAA CTG CCT CCA GOT TTA TGG GAT GAG ATT CTG GAA GGC TTA CAG 603 603 Lys Lys Lys Leu Pro Pro Gly Leu Trp Asp Glu lle Leu Glu Gly Leu Gin 140 145 150 Lys Leu Pro Pro Gly Leu Trp Asp Glu lle Leu Glu Gly Leu Gin 140 145 150 1865 1865 TAT DAD GTC CAG AAA GAG ATG TCT GCA TCT CTC GGT GAC CCG TCT AAA GCT GTC CAG AAA GAG ATG TCT GCA TCT CTC GGT GAC CCG TCT AAA GCT 651 651 Tyr 155 Tyre 155 Val Gin Lys Glu Met Ser Ala Ser Leu Gly Asp Pro Ser Lys Ala 160 165 170 Val Gin Lys Glu Met Ser Ala Ser Leu Gly Asp Pro Ser Lys Ala 160 165 170 1870 1870 CTC CTC : CTC CTT TCC GTC COT TCG GGT GCT GCC ATA TCG ATG CCT GGT ATG : CTC CTT TCC GTC COT TCG GGT GCT GCC ATA TCG ATG CCT GGT ATG 699 699

RO 118451 ΒRO 118451 B

1875 1875 Leu Lion Leu Lion Leu Lion Ser Serum Val Wave Arg Arg Ser Serum Gly . Gly. Ala That Ala That Ile authorities Ser Met Serum Met Pro Pro Gly : Gly: Met meth 175 175 180 180 185 185 ATG ATG GAC GAC ACT ACT GTA GTA TTG TTG AAT AAT CTC CTC GGG GGG CTT CTT AAT AAT GAT NECK GAG GAG GTC GTC GTA GTA GAT NECK GGT GGT 747 747 1880 1880 Met meth Asp Asp Thr Thr Val Wave Leu Lion Asn Assn Leu Lion Gly Gly Leu Lion Asn Assn Asp Asp Glu Goofy Val Wave Val Wave Asp Asp Gly Gly 190 190 195 195 200 two hundred CTA CTA GCT GTC GCC GCC AAA AAA AGT AGT GGA GGA GCT GTC CGC CGC TTT TTT GCC GCC TAT DAD GAC GAC TCG TCG TAT DAD AGG AGG AGG AGG 795 795 1885 1885 Leu Lion Ala That Ala That Lys Lys Ser Serum Gly Gly Ala That Arg Arg Phe Phew Ala That Tyr Tyre Asp Asp Ser Serum Tyr Arg Tyr Arg Arg Arg 205 205 210 210 215 215 TTT TTT CTA CTA GAT NECK ATG ATG TTT TTT GGC GGC AAC AAC GTT GTT GTA GTA ATG ATG GGT GGT ATC ATC CCA Approx CAT HOW TCG TCG TTA TTA 843 843 Phe Phew Leu Lion Asp Asp Met meth Phe Phew Gly Gly Asn Assn Val Wave Val Wave Met meth Gly Gly Ile authorities Pro Pro His His Ser Serum Leu Lion 1890 1890 220 220 225 225 230 230 TTT TTT GAT NECK GAA GAA AAG AAG TTA TTA GAG GAG CAG CAG ATG ATG AAA AAA GCT GTC GAA GAA AAA AAA GGG GGG ATT ATT CAT HOW CTC CTC 891 891 Phe Phew Asp Asp Glu Goofy Lys Lys Leu Lion Glu Goofy Gin Gin Met meth Lys Lys Ala That Glu Goofy Lys Lys Gly Gly Ile authorities His His Leu Lion 1895 1895 235 235 240 240 245 245 250 250 GAC GAC ACT ACT GAT NECK CTC CTC ACT ACT GCT GTC GCT GTC GAT NECK CTT CTT AAA AAA GAT NECK CTT CTT GCT GTC GAG GAG CAA CAA TAC cue 939 939 Asp Asp Thr Thr Asp Asp Leu Lion Thr Thr Ala That Ala That Asp Asp Leu Lion Lys Lys Asp Asp Leu Lion Ala That Glu Goofy Gin Gin Tyr Tyre 1900 1900 255 255 260 260 265 265 AAG AAG AAC AAC GTG GTG TAT DAD GTG GTG GAA GAA GCA GCA AAG AAG GGC GGC GAA GAA AAG AAG TTT TTT CCC CCC ACA ACA GAT NECK CCA Approx 987 987 Lys Lys Asn Assn Val Wave Tyr Tyre Val Wave Glu Goofy Ala That Lys Lys Gly Gly Glu Goofy Lys Lys Phe Phew Pro Pro Thr Thr Asp Asp Pro Pro 1905 1905 270 270 275 275 280 280 AAG AAG AAA AAA CAG CAG CTA CTA GAG GAG TTA TTA GCA GCA GTG GTG AAT AAT GCG GCG GTT GTT TTT TTT GAT NECK TCT TCT TGG TGG GAC GAC 1035 1035 Lys Lys Lys Lys Gin Gin Leu Lion Glu Goofy Leu Lion Ala That Val Wave Asn Assn Ala That Val Wave Phe Phew Asp Asp Ser Serum Trp Asp Trp Asp 285 285 290 290 295 295 1910 1910 AGC AGC CCA Approx AGG AGG GCC AAT GCC AAT AAG AAG TAC cue AGG AGG AGT AGT ATT ATT AAC AAC : CAG : CAG ATA THREAD . ACT . ACT GGG GGG - TTA - TTA 1083 1083 Ser Serum Pro Pro Arg Arg Ala That Asn Assn Lys Lys Tyr Tyre Arg Arg Ser Serum Ile authorities Asn Assn . Gin . Gin Ile authorities Thr Thr Gly Gly ’ Leu ' Leo 300 300 305 305 310 310 1915 1915 AAG AAG GGG GGG ACC acc GCG GCG GTT GTT AAC AAC ATT ATT CAA CAA . TGC . TGC : ATG : ATG ί GTG GTG ! TTT ! TTT ' GGC 'GGC ! AAC ATG GGG ! AAC ATG GGG 1131 1131

RO 118451 ΒRO 118451 B

Lys Lys Gly Gly Thr . Thr. Ala That Val . Wave. Asn Assn lle Gin < lle Gin < Cys Met 1 Cys Met 1 ral Phe < ral Phe < siy ; yes Asn Met Asn Met Gly Gly 315 315 320 320 325 325 330 330 AAC AAC ACT ACT TCA TCA GGA GGA ACC acc GGT GGT GTT 1 GTT 1 CTT ' CTT' ttc ; ttc ACT J ACT J \GG 1 1 AAC । AAC । CCA Approx AGC ; AGC; ACT ACT GGT GGT 1179 1179 1920 1920 Asn Assn Thr Thr Ser Serum Gly Gly Thr Thr Gly Gly Val Wave Leu Lion Phe 1 Phe 1 Thr J Thr J Arg ; Arg; Asn Assn Pro Pro Ser Thr Serum Thr Gly Gly 335 335 340 340 345 345 GAG GAG AAG AAG AAG AAG CTG CTG TAT DAD GGG GGG GAG GAG TTT TTT TTA TTA GTC AAT GTC AAT GCT GTC CAG CAG GGA GGA GAG GAG GAT NECK 1227 1227 Glu Goofy Lys Lys Lys Lys Leu Lion Tyr Tyre Gly Gly Glu Goofy Phe Phew Leu Lion Val ; Wave; Asn . Assn. Ala That Gin Gin Gly Gly Qlu Qlu Asp Asp 1925 1925 350 350 355 355 360 360 GTT GTT GTT GTT GCT GTC GGG GGG ATC ATC AGA AGM ACA ACA CCA Approx GAA GAA GAT NECK TTG TTG GTG GTG ACC acc ATG ATG GAG GAG ACT ACT 1275 1275 Val Wave Val Wave Ala That Gly Gly lle Ile Arg Arg Thr Thr Pro Pro Glu Goofy Asp Asp Leu Lion Val Wave Thr Thr Met meth Glu Goofy Thr Thr 1930 1930 365 365 370 370 375 375 TGC TGC ATG ATG CCT CCT GAA GAA GCA GCA TAC cue AGA AGM GAG GAG CTT CTT GTG GTG GAG GAG AAC AAC TGT TGT GTG GTG ATT ATT TTA TTA 1323 1323 Cys Cys Met meth Pro Pro Glu Goofy Ala That Tyr Arg Tyr Arg Glu Goofy Leu Lion Val Wave Glu Goofy Asn Assn Cys Cys Val Wave lle Ile Leu Lion 380 380 385 385 390 390 1935 1935 GAG GAG AGA AGM CAC CaCl TAC cue AAA AAA GAT NECK ATG ATG ATG ATG GAT NECK ATT ATT GAA GAA TTC TTC ACA ACA GTT GTT CAA CAA GAA GAA 1371 1371 Glu Goofy Arg Arg His His Tyr Tyre Lys Lys Asp Asp Met meth Met meth Asp Asp lle Ile Glu Goofy Phe Phew Thr Thr Val Wave Gin Gin Glu Goofy 395 395 400 400 405 405 410 410 1940 1940 AAC AAC AGA AGM CTT CTT TGG TGG ATG ATG CTG CTG CAA CAA TGC TGC CGA CGA ACA ACA GGG GGG AAA AAA CGT CGT ACT ACT GGG GGG AAA AAA 1419 1419 Asn Assn Arg Arg Leu Lion Trp Trp Met meth Leu Lion Gin Gin Cys Cys Arg Arg Thr Thr Gly Gly Lys Lys Arg Arg Thr Thr Gly Gly Lys Lys 415 415 420 420 425 425 1945 1945 GGT GGT GCG GCG GTG GTG AGA AGM ATT ATT GCA GCA GTA GTA GAT NECK ATG ATG GTG GTG AAC AAC GAA GAA . GGG . GGG ί CTA и CTA ATT ATT > GAT > THROAT 1467 1467 Gly Gly Ala That Val Wave Arg Arg lle Ile Ala That Val Wave Asp Asp Met meth Val Wave Asn Assn Glu Goofy Gly Gly ' Leu ' Leo lle Ile t Asp t Asp 430 430 435 435 440 440 ACT ACT AGA AGM ACA ACA GCA GCA ATT ATT AAG AAG AGG AGG GTT GTT GAG GAG ACT ACT CAA CAA . CAT . HOW ’ CTA GAT ’ CTA GAT CAC CaCl ) CTT ) CTT 1515 1515 1950 1950 Thr Thr Arg Arg Thr Thr Ala That lle Ile Lys Lys Arg Arg Val Wave Glu Goofy . Thr . Thr Gin Gin i His Let and His Let i Asp and Asp > Gin Leu > Leo Gin 445 445 450 450 455 455

CTT CAT CCA CAG TTT GAG AAT CCG TCT GCT TAC AAA AGC CAT GTG GTA 1563CTT CAT CCA CAG TTT GAG AAT CCG TCT GCT TAC AAA AGC CAT GTG GTA 1563

19551955

RO 118451 ΒRO 118451 B

Leu His 460 Leo His 460 Pro Gin Phe Glu Asn Pro Ser Ala Tyr Lys Ser His Val Val Pro Gin Phe Glu Asn Pro Ser Ala Tyr Lys Ser His Val Val 465 465 470 470 1960 1960 GCA GCA ACC acc GGT GGT TTG TTG CCA Approx GCA GCA TCC TCC CCT CCT GGG 1 GGG 1 SCA SCA GCC GCC GTG GTG GGG GGG CAG CAG GTT ( GTT ( STG STG 1611 1611 Ala That Thr Thr Gly Gly Leu Lion Pro Pro Ala That Ser Serum Pro Pro Gly J Gly J Ma Me Ala That val wave Gly Gly Gin Gin Val ’ wave ITal ITal 475 475 480 480 485 485 490 490 1965 1965 TTC TTC AGC AGC GCA GCA GAG GAG GAT NECK GCT GTC GAA GAA ACA ACA TGG TGG CAT HOW GCA GCA CAA CAA GGA GGA AAG AAG AGT AGT GCT GTC 1659 1659 Phe Phew Ser Serum Ala That Glu Goofy Asp Asp Ala That Glu Goofy Thr Thr Trp Trp His His Ala That Gin Gin Gly Gly Lys Lys Ser . Serum. Ala That 495 495 500 500 505 505 1970 1970 ATC ATC TTG TTG GTA GTA AGG AGG ACT ACT GAA GAA ACA AGC ACA AGC CCA Approx GAA GAA GAT NECK GTT GTT GGT GGT GGT GGT ATG ATG CAT HOW 1707 1707 Ile authorities Leu Lion Val Wave Arg Arg Thr Thr Glu Goofy Thr Thr Ser Serum Pro Pro Glu Goofy Asp Asp Val Wave Gly Gly Gly Gly Met meth His His 510 510 515 515 520 520 GCA GCA GCA GCA GCT GTC GGA GGA ATC ATC TTA TTA ACC acc GCT GTC AGA AGM GGA GGA GGA GGA ATG ATG ACA ACA TCA TCA CAT HOW GCA GCA 1755 1755 1975 1975 Ala That Ala That Ala That Gly Gly Ile authorities Leu Lion Thr Thr Ala That Arg Arg Gly Gly Gly Gly Met meth Thr Thr Ser Serum His His Ala That 525 525 530 530 535 535 GCA GCA GTG GTG GTG GTG GCT GTC CGC CGC GGA GGA TGG TGG GGC GGC AAA AAA TGT TGT TGT TGT GTT GTT TCT TCT GGT GGT TGT TGT GCT GTC 1803 1803 1980 1980 Ala That Val Wave Val Wave Ala That Arg Arg Gly Gly Trp Trp Gly Gly Lys Lys Cys Cys cys cys Val Wave Ser Serum Gly Gly Cys Cys Ala That 540 540 545 545 550 550 GAT NECK ATT ATT CGT CGT GTG GTG AAC AAC GAT NECK GAT NECK ATG ATG AAG AAG GTT GTT TTT TTT ACG ACG ATA THREAD . GGT . GGT GAC GAC CGT CGT 1851 1851 1985 1985 Asp 555 Asp 555 Ile authorities Arg Arg Val Wave Asn Assn Asp 560 Asp 560 Asp Asp Met meth Lys Lys Val Wave Phe 565 Phew 565 Thr Thr Ile authorities Gly Asp Gly Asp Arg 570 Arg 570 GTG GTG ATT ATT AAA AAA GAA GAA GGT GGT GAC GAC TGG TGG CTT CTT TCA TCA CTT CTT ' AAT ' AAT GGT GGT TC» TC» l ACT l ACT 1 GGC 1 GGC GAA GAA 1899 1899 Val Wave Ile authorities Lys Lys Glu Goofy Gly Asp Gly Asp Trp Trp Leu Lion Ser Serum Leu Lion i Asn and Asn . Gly . Gly ’ Sex Sex : Thr Gly : Thr Gly Glu Goofy 1990 1990 575 575 580 580 585 585 GTC GTC ATA THREAD TTG TTG GGT GGT AAA AAA CAG CAG CTA CTA . CTG GCT . CTG GCT ca» that" L CCI L CCI * GCA ATG AGC * GCA ATG AGC : AAT : AAT GAT NECK 1947 1947 Val Wave Ile authorities Leu Lion Gly Gly Lys Lys Gin Gin Leu Lion . Leu . Lion î Ala take to Prc Prc > Prc > Prc > Ala > Alas i Met i Met : Sei : You are : Asn : Asn i Asp and Asp 1995 1995 590 590 595 595 600 600 TTA TTA GAA GAA . ACA . ACA . TTC . TTC ATG TCA ATG TCA l TGG l TGG i GC1 and GC1 ' GAT ' NECK CAA GCA AGC CAA GCA AGC } CGT CTC AAG GTT } CGT CTC AAG GTT 1995 1995

RO 118451 ΒRO 118451 B

Leu Lion Glu Goofy Thr Thr Phe Phew Met meth Ser Serum Trp Trp Ala That Asp Asp Gin 4 Gin 4 Ala , Alas, Arg . Arg. Arg : Arg: Leu : Leo: Lys 1 Lys 1 /al / al 605 605 610 610 615 615 2000 2000 ATG ATG GCA GCA AAT AAT GCA GCA GAC GAC ACA ACA CCT CCT AAT AAT GAT NECK GCA 1 GCA 1 TTA . VAT. ACA ACA GCC 4 GCC 4 AGA ; AGM; AAC ; AAC; AAT AAT 2043 2043 Net Net Ala That Asn Assn Ala That Asp Asp Thr Thr Pro Pro Asn Assn Asp Asp Ala That Leu Lion Thr Thr Ala . Alas. Arg . Arg. Asn 4 Asn 4 Asn Assn 620 620 625 625 630 630 2005 2005 GGT GGT GCA GCA CAA CAA GGG GGG ATC ATC GGA GGA CTC CTC TGT TGT AGA AGM ACT ACT GAA GAA CAT HOW ATG ATG TTT TTT TTC TTC GCT GTC 2091 2091 Gly Gly Ala That Gin Gin Gly Gly Ile authorities Gly Gly Leu Lion Cys Cys Arg Arg Thr Thr Glu Goofy His His Met meth Phe Phew Phe Phew Ala That 635 635 640 640 645 645 650 650 TCT TCT GAC GAC GAG GAG AGG AGG ATC ATC AAA AAA GCT GTC GTA GTA AGA AGM AAG AAG ATG ATG ATC ATC ATG ATG GCG GCG GTC GTC ACT ACT 2139 2139 2010 2010 Ser Serum Asp Asp Glu Goofy Arg Arg Ile authorities Lys Lys Ala That Val Wave Arg Arg Lys Lys Met meth Ile authorities Met meth Ala That Val Wave Thr Thr 655 655 660 660 665 665 CCA Approx GAA GAA CAA CAA AGA AGM AAA AAA GCG GCG GCT GTC CTA CTA GAC GAC CTC CTC TTA TTA CTC CTC CCA Approx TAC cue CAA CAA AGA AGM 2187 2187 2015 2015 Pro Pro G1U G1U Gin Gin Arg Arg Lys Lys Ala That Ala That Leu Lion Asp Asp Leu Lion Leu Lion Leu Lion Pro Pro Tyr Tyre Gin Gin Arg Arg 670 670 675 675 680 680 TCC TCC GAT NECK TTT TTT GAG GAG GGC GGC ATT ATT TTC TTC CGA CGA GCA GCA ATG ATG GAT NECK GGA GGA CTT CTT CCT CCT GTA GTA ACA ACA 2235 2235 2020 2020 Ser Serum Asp Asp Phe Phew Glu Goofy Gly Gly Ile authorities Phe Phew Arg Arg Ala That Met meth Asp Asp Gly Gly Leu Lion Pro Pro Val Wave Thr Thr 685 685 690 690 695 695 ATC ATC CGC CGC CTT CTT CTA CTA GAC GAC CCT CCT CCA Approx CTT CTT CAT HOW GAG GAG TTT TTT CTA CTA CCC CCC GAA GAA GGT GGT GAT NECK 2283 2283 Ile authorities Arg Arg Leu Lion Leu Lion Asp Asp Pro Pro Pro Pro Leu Lion H1S H1S Glu Goofy Phe Phew Leu Lion Pro Pro Glu Goofy Gly Gly Asp Asp 2025 2025 700 700 705 705 710 710 CTA CTA GAA GAA CAC CaCl ATA THREAD GTG GTG AAT AAT GAA GAA CTT CTT ACA ACA GCG GCG GAT NECK ACA ACA GGC GGC ATG ATG AGC AGC AAA AAA 2331 2331 Leu Lion Glu Goofy His His Ile authorities Val Wave Asn Assn Glu Goofy Leu Lion Thr Thr Ala That Asp Asp Thr Thr Gly Gly Met meth Ser Serum Lys Lys 2030 2030 715 715 720 720 725 725 730 730 GAT NECK GAA GAA ATC ATC TAT DAD TCA TCA AGA AGM ATC ATC GAA GAA . AAA . AAA . TTA . TTA TCC TCC GAA GAA . GTG . GTG i AAC and AAC CCT CCT ATG ATG 2379 2379 Asp Asp Glu Goofy Ile authorities Tyr Tyre Ser Serum Arg Arg Ile authorities Glu Goofy Lys Lys Leu Lion Sec dry ' Glu ' Dumb i Val and Val Asn Assn Pro Pro Met meth 2035 2035 735 735 740 740 745 745 1 1 αττ αττ GGT GGT TTC TTC CGT CGT GGT GGT TGC TGC CGA CGA . TT? . TT? i GGG and GGG - ATT - ATT TCÎ TCI l TAC CCC GAG l TAC CCC GAG ) CTA ACA ) CTA ACA 2427 2427

RO 118451 ΒRO 118451 B

2040 2040 Leu Lion Gly Gly Phe Phew Arg Arg Gly Cys Gly Cys Arg Arg Leu Lion Gly Gly lle Ile Ser Serum ryr ryr Pro Glu Pro Glu Leu Thr Leo Thr 750 750 755 755 760 760 GAA GAA ATG ATG CAA CAA GTT GTT CGT GCG CGT GCG ATC ATC TTT TTT CAA CAA GCT GTC GCA GCA GTG GTG TCT ATG TCT ATG AAC AAT AAC AAT 2475 2475 Glu Goofy Met meth Gin Gin Val Wave Arg Ala Arg Ala lle Ile Phe Phew Gin Gin Ala That Ala That Val Wave Ser Serum Met meth Asn Asn Assn Assn 2045 2045 765 765 770 770 775 775 CAG CAG GGG GGG GTG GTG ACT ACT GTA ATA GTA ATA CCA Approx GAG GAG ATC ATC ATG ATG GTT GTT CCG GCC TTA TTA GTC GTC GGA GGA ACA ACA 2523 2523 Gin Gin Gly Gly Val Wave Thr Thr Val lle Wave lle Pro Pro Glu Goofy lle Ile Met meth Val Wave Pro Pro Leu Lion Val Wave Gly Gly Thr Thr 2050 2050 780 780 785 785 790 790 CCT CCT CAG CAG GAA GAA TTA TTA CGG CAT CGG CAT CAA CAA ATC ATC GGC GGC GTA GTA ATT ATT CGT CGT GGT GGT GTA GTA GCT GTC GCA GCA 2571 2571 Pro Pro Gin Gin Glu Goofy Leu Lion Arg His Arg His Gin Gin lle Ile Gly Gly Val Wave lle Ile Arg Arg Gly Gly Val Wave Ala That Ala That 2055 2055 795 795 800 800 805 805 810 810 AAT AAT GTT GTT TTT TTT GCT GTC GAA ATG GAA ATG GGG GGG CTG CTG ACG ACG TTG TTG GAG GAG TAT DAD AAA AAA GTG GTG GGA GGA ACG ACG 2619 2619 Asn Assn Val Wave Phe Phew Ala That Glu Met Glu Met Gly Gly Leu Lion Thr Thr Leu Lion Glu Goofy Tyr Tyre Lys Lys Val Wave Gly Gly Thr Thr 815 815 820 820 825 825 2060 2060 ATG ATG ATT ATT GAG GAG ATT ATT CCT CGA CCT CGA GCT GTC GCT GTC TTG TTG ATT ATT GCT GTC GAT NECK GAG GAG ATT ATT GCA GCA AAA AAA 2667 2667 Met meth lle Ile Glu Goofy lle Ile Pro Arg Pro Arg Ala That Ala That Leu Lion lle Ile Ala That Asp Asp Glu Goofy lle Ile Ala That Lys Lys 830 830 835 835 840 840 2065 2065 GAA GAA GCC GCC GAG GAG TTC TTC TTT TCG TTT TCG TTT TTT GGA GGA ACC acc AAT AAT GAT NECK TTG TTG ACC acc CAG CAG ATG ATG ACA ACA 2715 2715 Glu Goofy Ala That Glu Goofy Phe Phew Phe ser Phew sir Phe Phew Gly Gly Thr Thr Asn Assn Asp Asp Leu Lion Thr Thr Gin Gin Met meth Thr Thr 845 845 850 850 855 855 2070 2070 TTT TTT GGG GGG TAC cue AGC AGC AGA GAT AGA GAT GAT NECK GTT GTT GGC GGC AAG AAG TTT TTT TTG TTG CCG GCC ATT ATT TAT DAD CTT CTT 2763 2763 Phe Phew Gly Gly Tyr Tyre Ser Serum Arg Asp Asp Arg Asp Asp Val Wave Gly Gly Lys Lys Phe Phew Leu Lion Pro Pro lle Ile Tyr Tyre Leu Lion 860 860 865 865 870 870 TCT TCT CAA CAA GGC GGC ATT ATT CTG CAG CTG CAG CAT HOW GAT NECK CCA Approx TTT TTT GAG GAG GTT GTT CTT CTT GAC GAC CAG CAG AAA AAA 2811 2811 2075 2075 Ser Serum Gin Gin Gly Gly lle Ile Leu Gin Leo Gin His His Asp Asp Pro Pro Phe Phew Glu Goofy Val Wave Leu Lion Asp Asp Gin Gin Lys Lys 875 875 880 880 885 885 890 890

GGG GTG GGT CAA TTG ATC AAG ATG GCC ACG GAG AAA GGT CGT GCA GCC 2859GGG GTG GGT CAA TTG ATC AAG ATG GCC ACG GAG AAA GGT CGT GCA GCC 2859

RO 118451 ΒRO 118451 B

Gly Gly Val Wave Gly Gly Gin Gin Leu Lion Ile authorities Lys Lys Met meth Ala That Thr Thr Glu Goofy Lys Lys Gly Arg Gly Arg Ala That Ala That 2080 2080 895 895 900 900 905 905 AAT AAT CCT CCT AAC AAC TTA TTA AAG AAG GTT GTT GGG GGG ATA THREAD TOT ALL GGG GGG GAG GAG CAT HOW GOT Goth GGA GGA GAA GAA CCT CCT 2907 2907 Asn Assn Pro Pro Asn Assn Leu Lion Lys Lys Val Wave Gly Gly Ile authorities Cys Cys Gly Gly Glu Goofy Bis encore Gly Gly Gly Gly Glu Goofy Pro Pro 2085 2085 910 910 915 915 920 920 TCT TCT TCT TCT GTT GTT GCA GCA TTT TTT TTT TTT GAC GAC GGA GGA GTT GTT GGA GGA CTA CTA GAT NECK TAI thy OTG OTG TCG TCG TGC TGC 2955 2955 Ser Serum Ser Serum Val Wave Ala That Phe Phew Phe Phew Asp Asp Gly Gly Val Wave Gly Gly Leu Lion Asp Asp Tyr Tyre Val Wave Ser Serum cys cys 2090 2090 925 925 930 930 935 935 TCT TCT CCA Approx TTC TTC AGG AGG GTT GTT CCT CCT ATC ATC GCA GCA AGG AGG TTG TTG GCC GCC GCT GTC GCA GCA CAA CAA CTC CTC GTT GTT 3003 3003 Ser Serum Pro Pro Phe Phew Arg Arg Val Wave Pro Pro Ile authorities Ala That Arg Arg Leu Lion Ala That Ala That Ala That Gin Gin Val Wave Val Wave 940 940 945 945 950 950 2095 2095

TAAGCTTTGA AAGGAGGATG GCTTATTTGC TTCATCTTTT CCGCCATTCT3056TAAGCTTTGA AAGGAGGATG GCTTATTTGC TTCATCTTTT CCGCCATCT3056

Val 9552100Wave 955 2100

ΑΤΆΤΤΑΤΤΤΤ GGTTTCATCC TTATTOTAAT GCTGAAAATG AACGATGTTT AAACAAAACA 3116ΑΤΆΤΤΑΤΤΤΤ GGTTTCATCC TTATTOTAAT GCTGAAAATG AACGATGTTT AAACAAAACA 3116

ACCCATTATA TTTTGCTTTG OTATGCAATA ATCTACTTTT CAAACAAAAA ΆΑΑΑΑΆΆΑΑΑ 3176ACCCATTATA TTTTGCTTTG OTATGCAATA ATCTACTTTT CAAACAAAAAA ΆΑΑΑΑΆΆΑΑΑ 3176

ΛΛΛΑ3180ΛΛΛΑ3180

21052105

SECV ID NRî 6SEQ ID NO 6

LUNGIMEA SECVENȚEI: 18LENGTH OF SEQUENCE: 18

TIPUL SECVENȚEI: acid nucleic 2110SEQUENCE TYPE: Nucleic acid 2110

DESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

GACGGCTAAA AAGAGGGT 18 GACGGCTAAA AAGAGGGT 18

SECV OI NR: 7 2115SECV OI NO: 7 2115

LUNGIMEA SECVENȚEI: 2oSEQUENCE LENGTH: 2o

TIPUL SECVENȚEI: acid nucleicSEQUENCE TYPE: nucleic acid

RO 118451 ΒRO 118451 B

2120 2120 DESCRIEREA SECVENȚEI: SEQUENCE DESCRIPTION: TATCGAGAAA CCTTGTATAC TATCGAGAAA CCTTGTATAC 2$ $2 2125 2130 2125 2130 SECV ID NR: 8 LUNGIMEA SECVENȚEI: 17 TIPUL SECVENȚE®· acid nucleic DESCRIEREA SECVENȚEI : GTTTTCCCAG TCACGAC SEQ ID NO: 8 LENGTH OF SEQUENCE: 17 SEQUENCE TYPE®· nucleic acid SEQUENCE DESCRIPTION: GTTTTCCCAG TCACGAC 17 17 2135 2135 SECV IDNR: 9 LUNGIMEA SECVENȚEI: 17 SEQ IDNR: 9 LENGTH OF SEQUENCE: 17 TIPUL SECVENȚEI x acid nucleic DESCRIEREA SECVENȚEI : CAGGAAACAG CTATGACSEQUENCE TYPE x nucleic acid SEQUENCE DESCRIPTION: CAGGAAACAG CTATGAC 17 17 2140 2145 2140 2145 SECV ID NR: lo LUNGIMEA SECVENȚEI : 23 TIPUL SECVENȚEI : acid nucleic SEQ ID NO: lo SEQUENCE LENGTH : 23 SEQUENCE TYPE : nucleic acid DESCRIEREA SECVENȚEI: GATATCAATC CGGTGTCTCC TCC SEQUENCE DESCRIPTION: GATATCAATC CGGTGTCTCC TCC 23 2. 3 2150 2150 SECV ID NR; 11 SEQ ID NO; 11 2155 2155 LUNGIMEA SECVENȚEI: 34 TIPUL SECVENȚEI ; acid nucleic DESCRIEREA SECVENȚEI: SEQUENCE LENGTH: 34 SEQUENCE TYPE ; nucleic acid SEQUENCE DESCRIPTION: CGGTGTCTCC TCCGGATATC ACGGCTAAAA AGAG CGGTGTTCCC TCCGGATATC ACGGCTAAAA AGAG 34 34 2160 2160 SECV ID NR: 12 LUNGIMEA SECVENȚEI: 27 SEQ ID NO: 12 SEQUENCE LENGTH: 27 2165 2165 TIPUL SECVENȚEI; acid nucleic DESCRIREA SECVENȚEI: TTGATATCCC GGTTGTCTCC TCCGGTA SEQUENCE TYPE; nucleic acid SEQUENCE DESCRIPTION: TTGATATCCC GGTTGTCTCC TCCGGTA 27 27

RO 118451 ΒRO 118451 B

SECV ID NR; 13SEQ ID NO; 13

LUNGIMEA SECVENȚEI: 2o TIPUL SECVENȚEI; acid nucleic DESCRIEREA SECVENȚEI: GCAGAGATGA TGTTGGCAAGSEQUENCE LENGTH: 2o SEQUENCE TYPE; nucleic acid SEQUENCE DESCRIPTION: GCAGAGATGA TGTTGGCAAG

SECV ID NR: 14SEQ ID NO: 14

LUNGIMEA SECVENȚEI : 2o TIPUL SECVENȚEI : acid nucleic DESCRIEREA SECVENȚEI: CTTGCCAACA TCATCTCTGCSEQUENCE LENGTH: 2o SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION: CTTGCCAACA TCATCTCTGC

SECV ID NR: LSSEQ ID NO: LS

LUNGIMEA SECVENȚEI : 2o TIPUL SECVENȚEI : acid nucleic DESCRIEREA SECVENȚEI;SEQUENCE LENGTH: 2o SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION;

CTCACTGTTC GAAGAGAAGCCTCACTGTTC GAAGAGAAGC

SECV ID NR: 16SEQ ID NO: 16

LUNGIMEA SECVENȚEI ; 23 TIPUL SECVENȚEI ; acid nucleic DESCRIEREA SECVENȚEI: CATATGCTCT GTCCGGCATA ATCSEQUENCE LENGTH ; 23 SEQUENCE TYPE ; nucleic acid SEQUENCE DESCRIPTION: CATATGCTCT GTCCGGCATA ATC

SECV ID NR; 17SEQ ID NO; 17

LUNGIMEA SECVENȚEI: 21 TIPUL SECVENȚEI : acid nucleic DESCRIEREA SECVENȚEI;SEQUENCE LENGTH: 21 SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION;

CTCGAGGGAT CTCAATCATT GCTCGAGGGAT CTCAATCATT G

SECV ID NR: 18SEQ ID NO: 18

LUNGIMEA SECVENȚEI; 18 TIPUL SECVENȚEI ; acid nucleic DESCRIEREA SECVENȚEI:SEQUENCE LENGTH; 18 SEQUENCE TYPE ; nucleic acid SEQUENCE DESCRIPTION:

21702170

2o2o

2o2o

2o2o

21752175

21802180

21852185

21902190

21952195

22002200

22052205

22102210

RO 118451 ΒRO 118451 B

2215 2215 gcaatctstt cagcaatc gcaatctstt cagcaatc 18 18 2220 2220 SECV ID NR; 19 LUNGIMEA SECVENȚEI: 2o SEQ ID NO; 19 SEQUENCE LENGTH: 2o TIPUL SECVENȚEI: acid nucleic DESCRIEREA SECVENȚEI: GCTTCTTTTC CAATCTCATC SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION: GCTTCTTTTC CAATCTCATC 2o 2o 2225 2230 2225 2230 SECV IO NR: 2o LUNGIMEA SECVENȚEI: 18 TIPUL SECVENȚEI : acid nucleic SEQ ID NO: 2o LENGTH OF SEQUENCE: 18 SEQUENCE TYPE: nucleic acid DESCRIEREA SECVENȚEI: CGAAAAGAAA TCGGCTTC SEQUENCE DESCRIPTION: CGAAAAAAA TCGGCTTC 18 18 2235 2235 SECV ID NR: 21 SEQ ID NO: 21 2240 2240 LUNGIMEA SECVENȚEI: 24 TIPUL SECVENȚEI: acid nucleic DESCRIEREA SECVENȚEI: SEQUENCE LENGTH: 24 SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION: GAAAGATAAA TCTGCAAAAA CTTG GAAAGATAAA TCTGCAAAAA CTTG 24 24 2245 2245 SECV ID NR: 22 LUNGIMEA SECVENȚEI: 19 SEQ ID NO: 22 LENGTH OF SEQUENCE: 19 2250 2250 TIPUL SECVENȚEI: acid nucleic DESCRIEREA SECVENȚEI: GCCTTGAGCA AGATAAATC SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION: GCCTTGAGCA AGATAAATC 19 19 2255 2255 SECV ID NR: 23 LUNGIMEA SECVENȚEI: 21 TIPUL SECVENȚEI: acid nucleic DESCRIEREA SECVENȚEI: TTCTGGTCAA TAACCTCAAT G SEQ ID NO: 23 LENGTH OF SEQUENCE: 21 SEQUENCE TYPE: nucleic acid SEQUENCE DESCRIPTION: TTCTGGTCAA TAACCTCAAT G 21 21 2260 2260 SECV IO NR: 24 LUNGIMEA SECVENȚEI; 2o SEQ ID NO: 24 SEQUENCE LENGTH; 2o

RO 118451 ΒRO 118451 B

TIPUL SECVENTcI: acid nucleicSEQUENCE TYPE: nucleic acid

DcSCRIEREA SECVENȚE®;DcWRITING SEQUENCES®;

GCTTAAACAA TGACTTGTGCGCTTAAACAA TGACTTGTGC

3ECV ID NR: 253ECV ID NO: 25

LUNGIMEA SECVENȚEI : 22SEQUENCE LENGTH : 22

TIPUL SECVENȚEI : acid nucleic □ESCRIREA SECVENȚEI:TYPE OF SEQUENCE: nucleic acid □WRITING OF THE SEQUENCE:

CCĂATCTCAT CAGCTATTAA agCCĂATCTCAT CAGCTATTAA ag

SECV ID NR; 26SEQ ID NO; 26

LUNGIMEA SECVENȚEI: 2oSEQUENCE LENGTH: 2o

TIPUL SECVENȚEI: acid nucleicSEQUENCE TYPE: nucleic acid

DESCRIEREA SECVENȚEI;SEQUENCE DESCRIPTION;

GCTTCTTTTG CAATCTCTTCGCTTCTTTTG CAATCTCTTC

SECV IO NR: 27SEQ ID NO: 27

LUNGIMEA SECVENȚEI: 18LENGTH OF SEQUENCE: 18

TIPUL SECVENȚEI : acid nucleicSEQUENCE TYPE: nucleic acid

DESCRIEREA SECVENȚEI»SEQUENCE DESCRIPTION»

CGAAAAGAAC TCAGCTTCCGAAAAAAGAC TCAGCTTC

SECV ID NR; 28SEQ ID NO; 28

LUNGIMEA SECVENȚEI; 23SEQUENCE LENGTH; 2. 3

TIPUL SECVENȚEI; acid nucleicSEQUENCE TYPE; nucleic acid

DESCRIEREA SECVENȚEI:SEQUENCE DESCRIPTION:

CAAGATAAAT CGGCAAAAAC TTGCAAGATAAAT CGGCAAAAC TTG

SECV ID NR: 29SEQ ID NO: 29

LUNGIMEA SECVENȚEI; 23SEQUENCE LENGTH; 2. 3

TIPUL SECVENȚEI ; acid nucleicSEQUENCE TYPE ; nucleic acid

DESCRIEREA SECVENȚEI;SEQUENCE DESCRIPTION;

GAATGCCTTG AGAAAGATAA ATCGAATGCCTTG AGAAAGATAA ATC

22652265

2o2o

22702270

22752275

22802280

2o2o

22852285

22902290

22952295

23002300

23052305

23102310

Claims (7)

RO 118451 ΒRO 118451 B SECV 1D NR; 3o LUNGIMEA SECVENȚEI; 24SEQ 1D NO; 3o SEQUENCE LENGTH; 24 2315 TIPUL SECVENȚEI; acid nucleic DESCRIE.ÎEA SECVENȚEI; CTTTCTGGTC AAGAACCTCA ÂATG 242315 SEQUENCE TYPE; nucleic acid DESCRIBE THE SEQUENCE; CTTTCTGGTC AAGAACCTCA ÂATG 24 2320 SECV ID NR; 31 LUNGIMEA SECVENȚEI; 2o2320 SEQ ID NO; 31 SEQUENCE LENGTH; 2o 2325 TIPUL SECVENȚEI; acid nucleic DESCRIEREA SECVENȚEI; GCTTAAACAA CGACTTGTGC 2o2325 SEQUENCE TYPE; nucleic acid SEQUENCE DESCRIPTION; GCTTAAACAA CGACTTGTGC 2o 2330 Revendicări 1. Polipeptidă cu activitate piruvat ortofosfat dikinazică, stabilă la temperaturi joase, caracterizată prin aceea că aceasta conține secvența aminoacidă de la restul aminoacid 832 până la restul aminoacid 955, al secvenței aminoacide nr. 5, cu excepția faptului că unul2330 demand 1. Polypeptide with pyruvate orthophosphate dikinase activity, stable at low temperatures, characterized in that it contains the amino acid sequence from amino acid residue 832 to amino acid residue 955, of amino acid sequence no. 5, except that one 2335 sau mai multe resturi aminoacide sunt adăugate, deletate sau substituite, aceasta conferind stabilitate la temperaturi joase polipeptidei.2335 or more amino acid residues are added, deleted or substituted, this conferring stability at low temperatures to the polypeptide. 2. Polipeptidă conform revendicării 1, caracterizată prin aceea că regiunea secvenței aminoacide a piruvat ortofosfat dikinazei, ce corespunde secvenței aminoacide dintre resturile aminoacide 832 și 955 ale secvenței aminoacide nr. 5 este substituită cu secvența2. Polypeptide according to claim 1, characterized in that the region of the pyruvate orthophosphate dikinase amino acid sequence corresponds to the amino acid sequence between amino acid residues 832 and 955 of amino acid sequence no. 5 is substituted with the sequence 2340 aminoacidă nr. 5, cu excepția faptului că unul sau mai mulți aminoacizi sunt adăugați, deletați sau substituiți, aceasta conferind stabilitate la temperaturi joase polipeptidei.2340 amino acid no. 5, except that one or more amino acids are added, deleted or substituted, thereby conferring low temperature stability to the polypeptide. 3. Polipeptidă conform revendicării 1, caracterizată prin aceea că are aceeași secvență aminoacidă cu secv. nr. 5, cu excepția faptului că restul aminoacid 869 este substituit cu prolină.3. Polypeptide according to claim 1, characterized in that it has the same amino acid sequence as seq. no. 5, except that amino acid residue 869 is substituted with proline. 23452345 4. Polipeptidă conform revendicării 1, caracterizată prin aceea că are aceeași secvență aminoacidă cu secv. nr.4. Polypeptide according to claim 1, characterized in that it has the same amino acid sequence as seq. no. 5, cu excepția faptului că resturile aminoacide 885 și 952 sunt substituite cu leucină și respectiv cu valină. 5. ADN donat, ce codifică polipeptidă definită la revendicarea 1, caracterizat prin aceea că are secvența nucleotidică prezentată în secv. nr. 5, cu excepția faptului că una sau5, except that amino acid residues 885 and 952 are substituted with leucine and valine, respectively. 5. Donated DNA, which encodes the polypeptide defined in claim 1, characterized in that it has the nucleotide sequence shown in seq. no. 5, except that one or 2350 mai multe nucleotide sunt adăugate, deletate sau substituite.2350 more nucleotides are added, deleted or substituted. 6. Vector recombinant, conținând ADN-ul definit la revendicarea 5, caracterizat prin aceea că are o structură prezentată în fig. 1 sau 3 și care poate să exprime în celula gazdei, o polipeptidă având activitate piruvat ortofosfat dikinazică, stabilă la temperaturi joase.6. Recombinant vector, containing the DNA defined in claim 5, characterized in that it has a structure shown in fig. 1 or 3 and which can express in the host cell, a polypeptide having pyruvate orthophosphate dikinase activity, stable at low temperatures. 7. Procedeu de transformare a unei plante de Flaveria bidentis sau porumb, caracte-7. Process of transformation of a Flaveria bidentis or corn plant, character- 2355 rizat prin aceea că acesta constă în introducerea, în genomul acestora, a unui ADN definit la revendicarea 5, în calitate de agent transformator, prin intermediul vectorului recombinant definit la revendicarea 6. Președintele comisiei de examinare: chim. Hăulică Mariela Examinator: biochim. Cretu Adina 92355 characterized in that it consists in the introduction, in their genome, of a DNA defined in claim 5, as a transforming agent, by means of the recombinant vector defined in claim 6. The president of the examination committee: chem. Haulica Mariela Examiner: biochem. Adina thought 9
RO96-00667A 1994-07-29 1995-05-30 Polypeptide with dikinase orthophosphate pyruvate activity, dna encoding the same, recombinant vector with said dna and process for modifying a plant by means of said vector RO118451B1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP19778094 1994-07-29
PCT/JP1994/002022 WO1995015385A1 (en) 1993-12-03 1994-12-01 Polypeptide having cold-resistant pyruvate phosphate dikinase activity, dna coding for the same, and recombinant vector and transformed plant both containing said dna
PCT/JP1995/001040 WO1996004369A1 (en) 1994-07-29 1995-05-30 Polypeptide having cold-resistant pyruvate phosphate dikinase activity, dna coding for the polypeptide, recombinant vector containing the dna, and transformed plant

Publications (1)

Publication Number Publication Date
RO118451B1 true RO118451B1 (en) 2003-05-30

Family

ID=16380231

Family Applications (1)

Application Number Title Priority Date Filing Date
RO96-00667A RO118451B1 (en) 1994-07-29 1995-05-30 Polypeptide with dikinase orthophosphate pyruvate activity, dna encoding the same, recombinant vector with said dna and process for modifying a plant by means of said vector

Country Status (6)

Country Link
CN (1) CN1174093C (en)
BR (1) BR9506291A (en)
RO (1) RO118451B1 (en)
RU (1) RU2136748C1 (en)
UA (1) UA28003C2 (en)
WO (1) WO1996004369A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0690128A4 (en) * 1993-12-03 2004-03-03 Japan Tobacco Inc Polypeptide having cold-resistant pyruvate phosphate dikinase activity, dna coding for the same, and recombinant vector and transformed plant both containing said dna
CN102653727A (en) * 2012-01-18 2012-09-05 江南大学 Construction method and application of pyruvic acid-phosphoric acid double-kinase recombinant expression strain

Also Published As

Publication number Publication date
WO1996004369A1 (en) 1996-02-15
CN1174093C (en) 2004-11-03
BR9506291A (en) 1997-08-05
MX9601212A (en) 1997-09-30
CN1135770A (en) 1996-11-13
UA28003C2 (en) 2000-10-16
RU2136748C1 (en) 1999-09-10

Similar Documents

Publication Publication Date Title
CA2106960C (en) Plant promoter involved in controlling lipid biosynthesis in seeds
AU2002333434B2 (en) Coffee plant with reduced alpha-D-galactosidase activity
Higgins et al. The sequence of a pea vicilin gene and its expression in transgenic tobacco plants
ES2275302T3 (en) NUCLEIC ACID MOLECULES FROM ARTICHOKE (CYNARA SCOLYMUS) CODIFYING ENZYMES THAT HAVE FRUCTOSIPOLIMERASE ACTIVITY.
JPH10501972A (en) Plant virus resistance gene and method
JPH05508774A (en) Increased starch content in plants
HU218048B (en) Process for producing transgenic plants altered in habitus and yeald with utilyzing plasmides
ES2265956T3 (en) MODIFIED PLANTS.
JPH06502759A (en) Recombinant ACC synthase
WO2009021448A1 (en) A plant height regulatory gene and uses thereof
US20040019933A1 (en) Application of aspen MADS-box genes to alter reproduction and development in trees
CN106834305B (en) Rice male fertility regulation gene OsSTRL2 and application thereof
JP2001512322A (en) Protein complementation in transgenic plants
AU737542B2 (en) Genes controlling floral development and apical dominance in plants
CZ136196A3 (en) Polypeptide nucleotide chain, dna vector, process for preparing such polypeptide and a plant obtained by genetic engineering method
AU2011273569A1 (en) Gene involved in the development of the seed
US20040111764A1 (en) Composition and method for increased meiotic recombination in plants
JP3329820B2 (en) Polypeptide having low-temperature-resistant pyruvate phosphate dikinase activity, DNA encoding the same, recombinant vector containing the DNA, and transformed plant
CA2258571A1 (en) Plant sterol reductases and uses thereof
US20030093835A1 (en) Chimeric genes controlling flowering
RO118451B1 (en) Polypeptide with dikinase orthophosphate pyruvate activity, dna encoding the same, recombinant vector with said dna and process for modifying a plant by means of said vector
HUT73740A (en) Debranching enzymes and dna sequences coding them, suitable for changing the degree of branching of amylopectin starch in plants
CN111788216B (en) Agent for controlling formation of salt vesicle cell and plant body into which the agent is introduced
JPH10512451A (en) Deoxyribonucleic acid encoding glutathione S-transferase and use thereof
JP3513192B2 (en) A novel rice starch branching enzyme gene