BRPI0312771B1

BRPI0312771B1 - METHODS FOR REDUCING TRANSGEN SILENCE AND DETECTING POLYNUCLEOTIDE IN DNA PLANTS, MOLECULE AND CONSTRUCTION AS WELL AS DNA DETECTION KIT

Info

Publication number: BRPI0312771B1
Authority: BR
Publication date: 2018-10-02

Description

(54) Título: MÉTODOS PARA REDUZIR SILENCIAMENTO DE TRANSGENE E PARA DETECTAR(54) Title: METHODS TO REDUCE TRANSGENE SILENCE AND TO DETECT

POLINUCLEOTÍDEO EM PLANTAS, MOLÉCULA E CONSTRUTO DE DNA, BEM COMO KIT DE DETECÇÃO DE DNA (73) Titular: MONSANTO TECHNOLOGY LLC, Sociedade Norte-Americana. Endereço: 800 North Lindbergh Boulevard, St. Louis, MO 63167, ESTADOS UNIDOS DA AMÉRICA(US) (72) Inventor: STANISLAW FLASINSKIPOLYNUCLEOTIDE IN PLANTS, MOLECULE AND DNA CONSTRUCT, AS WELL AS DNA DETECTION KIT (73) Holder: MONSANTO TECHNOLOGY LLC, North American Society. Address: 800 North Lindbergh Boulevard, St. Louis, MO 63167, UNITED STATES OF AMERICA (US) (72) Inventor: STANISLAW FLASINSKI

Código de Controle: BB86314C31EB417B C5DCAD1E72219BF9Control Code: BB86314C31EB417B C5DCAD1E72219BF9

Prazo de Validade: 10 (dez) anos contados a partir de 02/10/2018, observadas as condições legaisValidity Term: 10 (ten) years from 10/02/2018, subject to legal conditions

Expedida em: 02/10/2018Issued on: 10/02/2018

Assinado digitalmente por:Digitally signed by:

Liane Elizabeth Caldeira LageLiane Elizabeth Caldeira Lage

Diretora de Patentes, Programas de Computador e Topografias de Circuitos IntegradosDirector of Patents, Computer Programs and Topographies of Integrated Circuits

Relatório Descritivo da Patente de Invenção para MÉTODOSInvention Patent Descriptive Report for METHODS

PARA REDUZIR SILENCIAMENTO DE TRANSGENE E PARA DETECTARTO REDUCE TRANSGENE SILENCE AND TO DETECT

POLINUCLEOTÍDEO EM PLANTAS, MOLÉCULA E CONSTRUTO DEPOLYNUCLEOTIDE IN PLANTS, MOLECULE AND CONSTRUCTION OF

DNA, BEM COMO KIT DE DETECÇÃO DE DNA.DNA, AS WELL AS DNA DETECTION KIT.

FUNDAMENTOS DA INVENÇÃOBACKGROUND OF THE INVENTION

Campo da InvençãoField of the Invention

A presente invenção refere-se ao engendramento genético vegetal. Mais particularmente, a um método para construir um polinucleotídeo artificial e aos métodos de uso para reduzir o silenciamento de transgene em plantas. A invenção também se refere às células vegetais contendo o polinucleotídeo artificial em que uma célula vegetal é transformada para expressar o polinucleotídeo artificial e a planta regenerada a partir deste.The present invention relates to the genetic engineering of plants. More particularly, a method for constructing an artificial polynucleotide and methods of use for reducing the silencing of transgene in plants. The invention also relates to plant cells containing the artificial polynucleotide in which a plant cell is transformed to express the artificial polynucleotide and the plant regenerated therefrom.

Descrição da Técnica RelacionadaDescription of the Related Art

Os genes heterólogos podem ser isolados a partir de uma fonte outra que não a planta na qual eles serão transformados ou eles podem ser modificados ou planejados para ter qualidades diferentes ou melhoradas. Particularmente traços ou qualidades desejadas de interesse para o engendramento genético vegetal podem incluir, mas não são limitadas à resistência aos insetos, doenças fúngicas e outras pragas e agentes causadores de doença, tolerâncias aos herbicidas, estabilidade realçada ou vida de prateleira, rendimento, tolerâncias ao estresse ambiental e realces nutricionais.Heterologous genes can be isolated from a source other than the plant into which they will be transformed, or they can be modified or designed to have different or improved qualities. Particular traits or desired qualities of interest to the genetic breeding of plants may include, but are not limited to resistance to insects, fungal diseases and other pests and disease-causing agents, tolerances to herbicides, enhanced stability or shelf life, yield, tolerances to environmental stress and nutritional enhancements.

Os métodos biológicos moleculares tradicionais para gerar novos genes e proteínas no geral envolveram a mutagênese aleatória ou direcionada. Um exemplo de mutagênese aleatória é uma técnica de recombinação conhecida como embaralhamento de DNA como divulgado nas Patentes US 5.605.793, 5.811.238, 5.830.721, 5.837.458 e Pedidos Internacionais WO 98/31837, WO 99/65927, a totalidade de todas as quais é aqui incorporada por referência. Um método alternativo de evolução molecular envolve um processo de extensão escalonado (StEP) para a mutagênese e recombinação in vitro de sequências de molécula de ácido nucléico, como divulgado na Patente US 5.965.408, aqui incorporada por referência. UmTraditional molecular biological methods for generating new genes and proteins in general have involved random or targeted mutagenesis. An example of random mutagenesis is a recombination technique known as DNA shuffling as disclosed in US Patents 5,605,793, 5,811,238, 5,830,721, 5,837,458 and International Orders WO 98/31837, WO 99/65927, all all of which are incorporated by reference. An alternative method of molecular evolution involves a stepped extension process (StEP) for mutagenesis and in vitro recombination of nucleic acid molecule sequences, as disclosed in US Patent 5,965,408, incorporated herein by reference. a

lo exemplo de mutagênese direcionada é a introdução de uma mutação pontual em um sítio específico em um polipeptídeo.l example of targeted mutagenesis is the introduction of a point mutation at a specific site in a polypeptide.

Um método alternativo, útil quando o gene heterólogo é de uma fonte não-vegetal, é planejar um gene inseticida artificial que use o códon mais freqüentemente usado na tabela de uso do códon na planta do milho (Koziel et al., 1993, Biotechnology 11, 194 a 200). Fischhoff e Perlak (Patente US N⁹ 5.500.365, aqui incorporada por referência) relatam a expressão mais alta de proteína inseticida de Bacillus thuringiensis (Bt) comparada em plantas de safra quando a seqüência de polinucleotídeo foi modificada para reduzir a ocorrência de seqüências desestabilizadoras. Foi necessário modificar a seqüência de polinucleotídeo de Bt do tipo selvagem porque o polinucleotídeo de Bt de comprimento natural do tipo selvagem não expressou níveis suficientes de proteína inseticida em plantas para ser agronomicamente útil.An alternative method, useful when the heterologous gene is from a non-vegetable source, is to design an artificial insecticidal gene that uses the codon most frequently used in the codon use table in the corn plant (Koziel et al., 1993, Biotechnology 11 , 194 to 200). Fischhoff and Perlak (US Patent No. ⁹ 5,500,365, incorporated herein by reference) report the highest expression of Bacillus thuringiensis (Bt) insecticidal protein compared to crop plants when the polynucleotide sequence was modified to reduce the occurrence of destabilizing sequences . It was necessary to modify the wild-type Bt polynucleotide sequence because the naturally-occurring wild-type Bt polynucleotide did not express sufficient levels of insecticidal protein in plants to be agronomically useful.

Os genes heterólogos são clonados em vetores adequados para a transformação de planta. As técnicas de transformação e regeneração úteis para incorporar genes heterólogos em um genoma da planta são bem conhecidos na técnica. O gene pode ser depois expressado na célula vegetal para exibir a característica ou traço adicionados. Entretanto, os genes heterólogos que normalmente expressam bem como transgenes podem experimentar o silenciamento de gene quando mais do que uma cópia dos mesmos genes é expressada na mesma planta. Isto pode ocorrer quando um primeiro gene heterólogo é muito similar a uma seqüência de DNA de gene endógeno na planta. Outros exemplos incluem quando uma planta transgênica é subseqüentemente cruzada com outras plantas transgênicas tendo os mesmos transgenes ou similares ou quando a planta transgênica é retransformada com um cassete de expressão vegetal que contém o mesmo gene ou similar. Similarmente, o silenciamento de gene pode ocorrer se a implantação de traços utilizam os mesmos elementos genéticos usados para direcionar a expressão do gene transgênico de interesse. De modo a implantar traços, linhagens transgênicas estáveis devem ser feitas com combinações diferentes de genes e elementos genéticos para evitar o silencia3 mento de gene.The heterologous genes are cloned into vectors suitable for plant transformation. Transformation and regeneration techniques useful for incorporating heterologous genes into a plant genome are well known in the art. The gene can then be expressed in the plant cell to display the added trait or trait. However, heterologous genes that normally express well as transgenes can experience gene silencing when more than one copy of the same genes is expressed on the same plant. This can occur when a first heterologous gene is very similar to an endogenous gene DNA sequence in the plant. Other examples include when a transgenic plant is subsequently crossed with other transgenic plants having the same or similar transgenes or when the transgenic plant is re-transformed with a plant expression cassette containing the same or similar gene. Similarly, gene silencing can occur if the implantation of traits uses the same genetic elements used to direct the expression of the transgenic gene of interest. In order to implant traits, stable transgenic strains must be made with different combinations of genes and genetic elements to avoid gene silencing.

A N-fosfonometilglicina, também conhecida como glifosato, é um herbicida bem conhecido que tem atividade em um amplo espectro de espécies vegetais. O glifosato é o ingrediente ativo do Roundup® (Monsanto Co.), um herbicida seguro tendo uma meia-vida desejavelmente curta no ambiente. Quando aplicado a uma superfície vegetal, o glifosato move-se sistemicamente através da planta. O glifosato é fitotóxico devido à sua inibição do caminho do ácido chiquímico, que fornece um precursor para a síntese de aminoácidos aromáticos. O glifosato inibe a enzima 5-enolpiruvil-3fosfochiquimato sintase (EPSPS).N-phosphonomethylglycine, also known as glyphosate, is a well-known herbicide that has activity in a wide spectrum of plant species. Glyphosate is the active ingredient in Roundup® (Monsanto Co.), a safe herbicide with a desirably short half-life in the environment. When applied to a plant surface, glyphosate moves systemically through the plant. Glyphosate is phytotoxic due to its inhibition of the shikimic acid pathway, which provides a precursor for the synthesis of aromatic amino acids. Glyphosate inhibits the enzyme 5-enolpyruvyl-3phosphochiquime synthase (EPSPS).

A tolerância ao glifosato também pode ser obtida pela expressão de variantes de EPSPS que têm afinidade mais baixa ao glifosato e portanto retêm a sua atividade catalítica na presença de glifosato (Patente U.S. N^p5.633.435, aqui incorporada por referência). As enzimas que degradam glifosato em tecidos vegetais (Patente U.S. N^s 5.463.175) também são capazes de conferir tolerância celular ao glifosato. Tais genes são usados para a produção de safras transgênicas que são tolerantes ao glifosato, permitindo deste modo que o glifosato seja usado para o controle eficaz de ervadaninha com preocupação mínima de dano de safra. Por exemplo, a tolerância ao glifosato foi geneticamente engendrada no milho (Patente U.S. N^e5.554.798), trigo (Pedido de Patente U.S. N^Q 20020062503), soja (Pedido de Patente U.S. N^e 20020157139) e canola (WO 9204449), todas as quais são incorporadas por referência. Os transgenes para a tolerância ao glifosato e os transgenes para a tolerância a outros herbicidas, por exemplo, o gene bar, (Toki et al. Plant Physiol., 100: 1503 a 1507, 1992; Thompson ef al. EMBO J. 6: 2519 a 2523, 1987, fosfinothricin acetyltransferase, BAR gene isolated from Streptomyces; DeBlock et al. EMBO J., 6:2513-2522, 1987, glufosinate herbicide) também são úteis como marcadores selecionáveis ou marcadores registráveis e podem fornecer um fenótipo útil para a seleção de plantas ligadas com outros traços agronomicamente úteis.The glyphosate tolerance can also be obtained by the expression of EPSPS variants that have lower affinity for glyphosate and therefore retain their catalytic activity in the presence of glyphosate (US Patent No. 5,633,435 ^w, incorporated herein by reference). Enzymes that degrade glyphosate in plant tissues ^(s US Patent 5,463,175) are also capable of conferring cellular tolerance to glyphosate. Such genes are used to produce transgenic crops that are tolerant to glyphosate, thereby allowing glyphosate to be used for effective weed control with minimal concern for crop damage. For example, glyphosate tolerance has been genetically engineered maize (US Patent No. 5,554,798 ^and), wheat (US Patent Application 20020062503 C ^Q), soybean (US Patent Application 20020157139 ^and C) and canola (WO 9204449) all of which are incorporated by reference. Transgenes for glyphosate tolerance and transgenes for tolerance to other herbicides, for example, the bar gene, (Toki et al. Plant Physiol., 100: 1503 to 1507, 1992; Thompson ef al. EMBO J. 6: 2519 to 2523, 1987, fosfinothricin acetyltransferase, BAR gene isolated from Streptomyces; DeBlock et al. EMBO J., 6: 2513-2522, 1987, glufosinate herbicide) are also useful as selectable markers or recordable markers and may provide a useful phenotype for the selection of plants linked with other agronomically useful traits.

O que é necessário na técnica são métodos de planejar genes para a expressão em plantas para melhorar traços agronomicamente úteis «· ········What is needed in the art are methods of planning genes for expression in plants to improve agronomically useful traits «· ········

J-Λ que evitem o silenciamento de gene quando cópias múltiplas são inseridas e a recombinação com genes vegetais endógenos.J-Λ to avoid gene silencing when multiple copies are inserted and recombination with endogenous plant genes.

BREVE DESCRIÇÃO DOS DESENHOSBRIEF DESCRIPTION OF THE DRAWINGS

Figura 1. Comparação acumulada das mudanças das sequências de polinucleotídeo de duas versões da EPSPS de arroz artificial (OsEPSPS_AT, OsEPSPS_ZM) e uma EPSPS de arroz nativa (OsEPSPS_Nat) o polipeptídeo de cada uma modificado para ser resistente ao glifosato.Figure 1. Accumulated comparison of polynucleotide sequence changes in two versions of artificial rice EPSPS (OsEPSPS_AT, OsEPSPS_ZM) and a native rice EPSPS (OsEPSPS_Nat) the polypeptide of each modified to be resistant to glyphosate.

Figura 2. Comparação acumulada das seqüências de polinucleotídeo de uma EPSPS de milho nativa (ZmEPSPS_Nat) e uma artificial (ZmEPSPS_ZM) o polipeptídeo de cada uma modificado para ser resistente ao glifosato.Figure 2. Accumulated comparison of the polynucleotide sequences of a native corn EPSPS (ZmEPSPS_Nat) and an artificial one (ZmEPSPS_ZM) the polypeptide of each modified to be resistant to glyphosate.

Figura 3. Comparação acumulada das seqüências de polinucleotídeo de uma EPSPS de soja nativa (GmEPSPS_Nat) e uma versão artificial (GmEPSPS_GM) o polipeptídeo de cada uma modificado para ser resistente ao glifosato.Figure 3. Cumulative comparison of the polynucleotide sequences of a native soybean EPSPS (GmEPSPS_Nat) and an artificial version (GmEPSPS_GM) of the polypeptide of each modified to be resistant to glyphosate.

Figura 4. Comparação acumulada das seqüências de polinucleotídeo de um gene BAR nativo (BAR1_Nat) e duas versões artificiais com códon de eleição de Zea mays (BAR1_ZM) e Arabidopsis thaliana (BAR1_AT).Figure 4. Accumulated comparison of the polynucleotide sequences of a native BAR gene (BAR1_Nat) and two artificial versions with the codon of choice for Zea mays (BAR1_ZM) and Arabidopsis thaliana (BAR1_AT).

Figura 5. Comparação acumulada das seqüências de polinucleotídeo de CTP2 e CP4EPSPS nativas (CTP2CP4_Nat) e versões artificiais (CTP2CP4_AT, CTP2CP4_ZM e CTP2CP4_GM).Figure 5. Cumulative comparison of native CTP2 and CP4EPSPS polynucleotide sequences (CTP2CP4_Nat) and artificial versions (CTP2CP4_AT, CTP2CP4_ZM and CTP2CP4_GM).

Figura 6. Mapa plasmídico de pMON54949.Figure 6. Plasmid map of pMON54949.

Figura 7. Mapa plasmídico de pMON54950.Figure 7. Plasmid map of pMON54950.

Figura 8. Mapa plasmídico de pMON30151.Figure 8. Plasmid map of pMON30151.

Figura 9. Mapa plasmídico de pMON59302.Figure 9. Plasmid map of pMON59302.

Figura 10. Mapa plasmídico de pMON59307.Figure 10. Plasmid map of pMON59307.

Figura 11. Mapa plasmídico de pMON42411.Figure 11. Plasmid map of pMON42411.

Figura 12. Mapa plasmídico de pMON58400.Figure 12. Plasmid map of pMON58400.

Figura 13. Mapa plasmídico de pMON58401.Figure 13. Plasmid map of pMON58401.

Figura 14. Mapa plasmídico de pMON54964.Figure 14. Plasmid map of pMON54964.

1.31.3

Figura 15. Mapa plasmídico de pMON25455.Figure 15. Plasmid map of pMON25455.

Figura 16. Mapa plasmídico de pMON30152.Figure 16. Plasmid map of pMON30152.

Figura 17. Mapa plasmídico de pMON54992.Figure 17. Plasmid map of pMON54992.

Figura 18. Mapa plasmídico de pMON54985.Figure 18. Plasmid map of pMON54985.

Figura 19. Mapa plasmídico de pMON20999.Figure 19. Plasmid map of pMON20999.

Figura 20. Mapa plasmídico de pMON45313.Figure 20. Plasmid map of pMON45313.

Figura 21. Mapa plasmídico de pMON59308.Figure 21. Plasmid map of pMON59308.

Figura 22. Mapa plasmídico de pMON59309.Figure 22. Plasmid map of pMON59309.

Figura 23. Mapa plasmídico de pMON59313.Figure 23. Plasmid map of pMON59313.

Figura 24. Mapa plasmídico de pMON59396.Figure 24. Plasmid map of pMON59396.

Figura 25. Mapa plasmídico de pMON25496.Figure 25. Plasmid map of pMON25496.

BREVE DESCRIÇÃO DA LISTAGEM DE SEQÜÊNCIA SEQ ID NO: 1 OsEPSPS_TIPS Uma seqüência de proteína deBRIEF DESCRIPTION OF THE SEQUENCE LISTING SEQ ID NO: 1 OsEPSPS_TIPS A protein sequence of

EPSPS de arroz modificada para ser resistente ao glifosato, com peptídeo de trânsito de cloroplasto.EPSPS of rice modified to be resistant to glyphosate, with chloroplast transit peptide.

Seqüência de polinucleotídeo de um polinucleotídeo de EPSPS nativa do arroz modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an EPSPS polynucleotide native to rice modified to encode a glyphosate resistant protein.

Seqüência de polinucleotídeo de um polinucleotídeo de EPSPS de arroz artificial usando a tabela de uso do códon de Arabidopsis e os métodos da presente invenção e ainda modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an artificial rice EPSPS polynucleotide using the Arabidopsis codon usage table and the methods of the present invention and further modified to encode a glyphosate resistant protein.

SEQIDNO: 2 OsEPSPS NatSEQIDNO: 2 OsEPSPS Nat

SEQ ID NO: 3 OsEPSPS AT ·· ·»·····*SEQ ID NO: 3 OsEPSPS AT ·· · »····· *

SEQ ID NO: 4SEQ ID NO: 4

OsEPSPS ZMOsEPSPS ZM

SEQ ID NO: 5SEQ ID NO: 5

GmEPSPSJKSGmEPSPSJKS

SEQ ID NO: 6SEQ ID NO: 6

GmEPSPS NatGmEPSPS Nat

SEQ ID NO: 7SEQ ID NO: 7

GmEPSPS_GMGmEPSPS_GM

SEQ ID NO: 8SEQ ID NO: 8

ZmEPSPS_TIPSZmEPSPS_TIPS

SEQ ID NO: 9SEQ ID NO: 9

ZmEPSPSJMat ··· »· ···· • · · · · 9 · · · · • · · · • · · · · ··· ·· ··ZmEPSPSJMat ··· »· ···· • · · · · 9 · · · · · · · · · · · · ··· ·· ··

Seqüência de polinucleotideo de um polinucleotideo de EPSPS de arroz artificial usando a tabela de uso do códon de Zea mays e os métodos da presente invenção e ainda modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an artificial rice EPSPS polynucleotide using the Zea mays codon usage table and the methods of the present invention and further modified to encode a glyphosate resistant protein.

Uma seqüência de proteína de EPSPS da soja modificada para ser resistente ao glifosato, com peptídeo de trânsito de cloroplasto.A sequence of EPSPS protein from soy modified to be resistant to glyphosate, with chloroplast transit peptide.

Seqüência de polinucleotideo de um polinucleotideo de EPSPS nativa da soja modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of a native EPSPS polynucleotide from soy modified to encode a glyphosate resistant protein.

Seqüência de polinucleotideo de um polinucleotideo de EPSPS da soja artificial usando a tabela de uso do códon de Glicina max e os métodos da presente invenção e ainda modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an EPSPS polynucleotide from artificial soybean using the Glycine max codon usage table and the methods of the present invention and further modified to encode a glyphosate resistant protein.

Uma seqüência de proteína de EPSPS do milho modificada para ser resistente ao glifosato, com peptídeo de trânsito de cloroplasto.A sequence of EPSPS protein from corn modified to be resistant to glyphosate, with chloroplast transit peptide.

Seqüência de polinucleotideo de um polinucleotideo de EPSPS nativa do milho modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an EPSPS polynucleotide native to corn modified to encode a glyphosate resistant protein.

• · » · »·· ·• · »·» ·· ·

SEQ ID NO; 10SEQ ID NO; 10

ZmEPSPS ZMZmEPSPS ZM

SEQ ID NO: 11SEQ ID NO: 11

CTP2 ··· ·· ···· • · · · • · · · • · · « • · · · · • · · · · · ·CTP2 ··· ·· ···· · · · · · · · · · · · · · · · · · · · · · ·

SEQ ID NO: 12SEQ ID NO: 12

SEQ ID NO: 13SEQ ID NO: 13

CTP2 NatCTP2 Nat

CTP2 ATCTP2 AT

SEQ ID NO: 14SEQ ID NO: 14

CTP2 ZMCTP2 ZM

SEQ ID NO: 15SEQ ID NO: 15

CP4EPSPSCP4EPSPS

SEQ ID NO: 16SEQ ID NO: 16

CP4EPSPS_NatCP4EPSPS_Nat

Seqüência de polinucleotídeo de um polinucleotídeo de EPSPS do milho artificial usando a tabela de uso do códon de Zea mays e os métodos da presente invenção e ainda modificado para codificar uma proteína resistente ao glifosato.Polynucleotide sequence of an EPSPS polynucleotide from artificial corn using the Zea mays codon usage table and the methods of the present invention and further modified to encode a glyphosate resistant protein.

Seqüência de proteína do peptídeo de trânsito de cloroplasto 2 do gene de EPSPS de Arabidopsis.Protein sequence of the chloroplast transit peptide 2 of the EPSPS gene of Arabidopsis.

Seqüência de polinucleotídeo do peptídeo de trânsito de cloroplasto de EPSPS de Arabidopsis. Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a CTP2 usando a tabela de uso do códon de Arabidopsis e os métodos da presente invenção.Polynucleotide sequence of Arabidopsis EPSPS chloroplast transit peptide. Polynucleotide sequence of an artificial polynucleotide encoding CTP2 using the Arabidopsis codon usage table and the methods of the present invention.

Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a CTP2 usando a tabela de uso do códon de Zea mays e os métodos da presente invenção.Polynucleotide sequence of an artificial polynucleotide encoding CTP2 using the Zea mays codon usage table and the methods of the present invention.

A seqüência de proteína da proteína de EPSPS resistente ao glifosato da cepa CP4 de Agrobacterium.The protein sequence of the glyphosate-resistant EPSPS protein of the CP4 strain of Agrobacterium.

Seqüência de polinucleotídeo do polinucleotídeo nativo que codifica a proteína CP4EPSPS (Patente U.S. N⁹ 5.633.435).Polynucleotide sequence of the native polynucleotide encoding the CP4EPSPS protein (US Patent No. ⁹ 5,633,435).

• · · · · ··• · · · · ··

CP4EPSPS_AT • · ·· · · · · • · ····»···CP4EPSPS_AT • · ·· · · · · • ···· »···

SEQ ID NO: 17SEQ ID NO: 17

SEQ ID NO: 18SEQ ID NO: 18

SEQ ID NO: 19SEQ ID NO: 19

SEQ ID NO: 20SEQ ID NO: 20

SEQ ID NO: 21SEQ ID NO: 21

CP4EPSPS_ZMCP4EPSPS_ZM

BAR1BAR1

BAR1 NatBAR1 Nat

BAR1 ATBAR1 AT

SEQ ID NO: 22SEQ ID NO: 22

SEQ ID NO: 23SEQ ID NO: 23

SEQ ID NO: 24SEQ ID NO: 24

BAR1 ZMBAR1 ZM

CP4EPSPS_SynCP4EPSPS_Syn

CP4EPSPS_AT_p1CP4EPSPS_AT_p1

Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a proteína CP4EPSPS usando a tabela de uso do códon de Arabidopsis e os métodos da presente invenção.Polynucleotide sequence of an artificial polynucleotide encoding the CP4EPSPS protein using the Arabidopsis codon usage table and the methods of the present invention.

Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a proteína CP4EPSPS usando a tabela de uso do códon de Zea mays e os métodos da presente invenção.Polynucleotide sequence of an artificial polynucleotide encoding the CP4EPSPS protein using the Zea mays codon usage table and the methods of the present invention.

A seqüência de proteína de uma fosfinotricina acetiltransferase. Seqüência de polinucleotídeo do polinucleotídeo nativo isolado de Streptomyces que codifica a fosfinotricina acetiltransferase. Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a fosfinotricina acetiltransferase usando a tabela de uso do códon de Arabidopsis e os métodos da presente invenção.The protein sequence of a phosphinothricin acetyltransferase. Polynucleotide sequence of the native polynucleotide isolated from Streptomyces that encodes phosphinothricin acetyltransferase. Polynucleotide sequence of an artificial polynucleotide encoding phosphinothricin acetyltransferase using the Arabidopsis codon usage table and the methods of the present invention.

Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a fosfinotricina acetiltransferase usando a tabela de uso do códon de Zea mays e os métodos da presente invenção. Seqüência de polinucleotídeo de um polinucleotídeo artificial com códon de eleição de dicotiledônea.Polynucleotide sequence of an artificial polynucleotide encoding phosphinothricin acetyltransferase using the Zea mays codon usage table and the methods of the present invention. Polynucleotide sequence of an artificial polynucleotide with a dicotyledon codon of choice.

Diagnóstico de molécula iniciadora de DNA para o polinucleotídeo CP4EPSPS_AT.Diagnosis of DNA primer molecule for the polynucleotide CP4EPSPS_AT.

VV

SEQ ID NO: 25SEQ ID NO: 25

SEQ ID NO: 26SEQ ID NO: 26

SEQ ID NO: 27SEQ ID NO: 27

SEQ ID NO: 28SEQ ID NO: 28

SEQ ID NO: 29SEQ ID NO: 29

SEQ ID NO: 30SEQ ID NO: 30

SEQ ID NO: 31SEQ ID NO: 31

SEQ ID NO: 32SEQ ID NO: 32

SEQ ID NO: 33SEQ ID NO: 33

SEQ ID NO: 34SEQ ID NO: 34

SEQ ID NO: 35SEQ ID NO: 35

CP4EPSPS_AT_p2CP4EPSPS_AT_p2

CP4EPSPS_ZM_p1CP4EPSPS_ZM_p1

CP4EPSPS_ZM_p2CP4EPSPS_ZM_p2

CP4EPSPS_Nat_p1CP4EPSPS_Nat_p1

CP4EPSPS_Nat_p2CP4EPSPS_Nat_p2

CP4EPSPS_Syn_pCP4EPSPS_Syn_p

ZmAdhl iniciadorlZmAdhl initiator

ZmAdhl iniciador2ZmAdhl initiator2

GNAGIAMKSGNAGIAMKS

CTPEPSPSCP4_GCTPEPSPSCP4_G

MM

Diagnóstico de molécula iniciadora de DNA para o polinucleotídeo CP4EPSPS_ZM.Diagnosis of DNA primer molecule for the polynucleotide CP4EPSPS_ZM.

Diagnóstico de molécula iniciadora de DNA para o polinucleotídeo CP4EPSPS_Nat.Diagnosis of DNA primer molecule for the polynucleotide CP4EPSPS_Nat.

Diagnóstico de molécula iniciadora de DNA para o polinucleotídeo CP4EPSPS_Syn.Diagnosis of a DNA primer molecule for the polynucleotide CP4EPSPS_Syn.

Diagnóstico de iniciador de controle 1 para o gene Adh1 do milho endógeno.Diagnosis of control primer 1 for the endogenous corn Adh1 gene.

Diagnóstico de iniciador de controle 2 para o gene Adh1 do milho endógeno.Diagnosis of control primer 2 for the endogenous corn Adh1 gene.

Motivo que fornece resistência ao glifosato a uma EPSPS vegetal. Seqüência de polinucleotídeo de um polinucleotídeo artificial que codifica a proteína CP4EPSPS usando a tabela de uso do códon de Glicina max.Reason that provides resistance to glyphosate to a vegetable EPSPS. Polynucleotide sequence of an artificial polynucleotide encoding the CP4EPSPS protein using the Glycine max codon usage table.

SUMÁRIO DA INVENÇÃOSUMMARY OF THE INVENTION

A presente invenção fornece métodos e composições para planejar uma seqüência de polinucleotídeo artificial que codifica uma proteína de interesse, em que o polinucleotídeo artificial é substancialmente diver·· ········The present invention provides methods and compositions for designing an artificial polynucleotide sequence that encodes a protein of interest, wherein the artificial polynucleotide is substantially divergent ·· ········

Jl^ gente de um polinucleotídeo que ocorre naturalmente em uma planta ou um polinucleotídeo que foi introduzido como um transgene em uma planta e o polinucleotídeo artificial e o polinucleotídeo codificam um polipeptídeo substancialmente idêntico.A polynucleotide naturally occurring in a plant or a polynucleotide that has been introduced as a transgene into a plant and the artificial polynucleotide and polynucleotide encode a substantially identical polypeptide.

Os polinucleotídeos artificiais da presente invenção que codificam proteínas que fornecem fenótipos agronomicamente úteis a uma planta transgênica contendo um constructo de DNA que compreende o polinucleotídeo artificial. Os fenótipos agronomicamente úteis incluem, mas não são limitados a: tolerância à seca, rendimento aumentado, tolerância ao frio, resistência às doenças, resistência a inseto e tolerância a herbicida.The artificial polynucleotides of the present invention that encode proteins that provide agronomically useful phenotypes to a transgenic plant containing a DNA construct that comprises the artificial polynucleotide. Agronomically useful phenotypes include, but are not limited to: drought tolerance, increased yield, cold tolerance, disease resistance, insect resistance and herbicide tolerance.

Um outro aspecto da presente invenção são polinucleotídeos artificiais que codificam uma proteína de EPSPS resistente a herbicida, uma proteína de fosfinotricina acetiltransferase, uma proteína de peptídeo de trânsito de cloroplasto. Em modalidades preferidas da presente invenção, a molécula de polinucleotídeo artificial é selecionada do grupo que consiste na SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 e SEQ ID NO: 35.Another aspect of the present invention is artificial polynucleotides that encode an herbicide-resistant EPSPS protein, a phosphinothricin acetyltransferase protein, a chloroplast transit peptide protein. In preferred embodiments of the present invention, the artificial polynucleotide molecule is selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 and SEQ ID NO: 35.

A presente invenção fornece constructos de DNA que compreendem: uma molécula promotora que funciona em plantas, operavelmente ligada a uma molécula de polinucleotídeo artificial da presente invenção, em que a molécula de polinucleotídeo artificial é selecionada do grupo que consiste na SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 e SEQ ID NO: 35, operavelmente ligada a uma região de término de transcrição.The present invention provides DNA constructs comprising: a promoter molecule that functions in plants, operably linked to an artificial polynucleotide molecule of the present invention, wherein the artificial polynucleotide molecule is selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 and SEQ ID NO: 35, operably linked to a transcription termination region.

A presente invenção ainda fornece constructos de DNA que compreendem: uma molécula promotora que funciona em plantas, operavelmente ligada a uma molécula de polinucleotídeo artificial que codifica um peptídeo de trânsito de cloroplasto, operavelmente ligada a uma EPSPS resistente ao glifosato heteróloga, operavelmente ligada a uma região de sinal de término de transcrição, em que o polinucleotídeo artificial é substancial•V ········ ··· ··· ·* >··· • · · * • · · · • · · · • « · · · • ·· ·· ·· íe mente divergente na seqüência de polinucleotídeo dos polinucleotídeos conhecidos que codificam um peptídeo de trânsito de cloroplasto idêntico.The present invention further provides DNA constructs that comprise: a promoter molecule that functions in plants, operably linked to an artificial polynucleotide molecule encoding a chloroplast transit peptide, operably linked to a heterologous glyphosate resistant EPSPS, operably linked to a transcription termination signal region, where the artificial polynucleotide is substantial • V ········ ··· ··· · *> ··· • · · * • · · · · · · • «· · · • ·· ·· ·· ee divergent in the polynucleotide sequence of known polynucleotides that encode an identical chloroplast transit peptide.

A presente invenção fornece constructos de DNA que compreendem pelo menos dois cassetes de expressão, o primeiro cassete de expressão compreendendo uma molécula promotora que funciona em plantas, operavelmente ligado a uma molécula de polinucleotídeo artificial da presente invenção, operavelmente ligada a uma região de sinal de término de transcrição e o segundo cassete de expressão que compreende uma molécula promotora que funciona em plantas, operavelmente ligado a uma molécula de polinucleotídeo que codifica um polipeptideo substancialmente idêntico como o dito polinucleotídeo artificial e é menos do que oito-cinco por cento similar na seqüência de polinucleotídeo ao dito polinucleotídeo artificial, operavelmente ligada a uma região de sinal de término de transcrição.The present invention provides DNA constructs that comprise at least two expression cassettes, the first expression cassette comprising a promoter molecule that functions in plants, operably linked to an artificial polynucleotide molecule of the present invention, operably linked to a signal region of transcription termination and the second expression cassette comprising a promoter molecule that functions in plants, operably linked to a polynucleotide molecule that encodes a substantially identical polypeptide like said artificial polynucleotide and is less than eight-five percent similar in sequence polynucleotide to said artificial polynucleotide, operably linked to a transcription termination signal region.

A presente invenção fornece células vegetais, plantas ou sua progênie que compreendem um constructo de DNA da presente invenção. De interesse particular são as plantas de sua progênie selecionadas do grupo que consiste em trigo, milho, arroz, soja, algodão, batata, canola, gramado, árvores de floresta, sorgo em grão, safras vegetais, plantas ornamentais, safras de forragem e safras de frutas.The present invention provides plant cells, plants or their progeny that comprise a DNA construct of the present invention. Of particular interest are the plants of your progeny selected from the group consisting of wheat, corn, rice, soybeans, cotton, potatoes, canola, lawn, forest trees, sorghum grains, vegetable crops, ornamental plants, forage crops and crops of fruits.

Um método da presente invenção reduz o silenciamento de gene durante a geração das plantas transgênicas compreendendo as etapas de:A method of the present invention reduces gene silencing during the generation of transgenic plants comprising the steps of:

a) construir um polinucleotídeo artificial que é substancialmente divergente de polinucleotídeos conhecidos que codificam uma proteína substancialmente idêntica ea) construct an artificial polynucleotide that is substantially divergent from known polynucleotides that encode a substantially identical protein, and

b) construir um constructo de DNA contendo a dita molécula de polinucleotídeo artificial; eb) constructing a DNA construct containing said artificial polynucleotide molecule; and

c) transformar o dito constructo de DNA em uma célula vegetal; ec) transforming said DNA construct into a plant cell; and

d) regenerar a dita célula vegetal em uma planta transgênica; ed) regenerating said plant cell in a transgenic plant; and

e) cruzar a dita planta transgênica com uma planta fértil, em que a dita planta fértil contém uma molécula de polinucleotídeo que codifica uma proteína substancialmente idêntica a uma proteína codificada pela dita molé12e) crossing said transgenic plant with a fertile plant, wherein said fertile plant contains a polynucleotide molecule that encodes a protein substantially identical to a protein encoded by said molecule12

··· • * • · ··· • * • · ··· ·· • · · • · · ··· ·· • · · • · · • • • • • ·· ·····»·· ·· · · · · • · * · • ·· ····· »·· ·· · · · · • · * · ·· ·· ··· ··· • · · • · · ··· ·· • · · • · · ··· ·· • • · ·· • • · ·· • · · · • · · · ··· ···· · * • · · · • · · · ··· ···· · * • · • ·

cuia de polinucleotídeo artificial e em que a dita molécula de polinucleotídeo artificial e a dita molécula de polinucleotídeo são substancialmente divergentes.artificial polynucleotide cuvette and wherein said artificial polynucleotide molecule and said polynucleotide molecule are substantially divergent.

Um outro aspecto da invenção é uma célula vegetal transgênica que compreende dois polinucleotídeos, em que pelo menos um dos polinucleotídeos é um transgene e os dois polinucleotídeos codificam uma proteína substancialmente idêntica e são menos do que oito-cinco por cento similares na seqüência de polinucleotídeo.Another aspect of the invention is a transgenic plant cell comprising two polynucleotides, wherein at least one of the polynucleotides is a transgene and the two polynucleotides encode a substantially identical protein and are less than eight-five percent similar in the polynucleotide sequence.

Um outro aspecto da presente invenção em um método para reduzir o silenciamento de gene durante a produção de plantas transgênicas compreende as etapas de:Another aspect of the present invention in a method for reducing gene silencing during the production of transgenic plants comprises the steps of:

a) construir um polinucleotídeo artificial que seja substancialmente divergente de polinucleotídeos conhecidos que codificam uma proteína substancialmente idêntica ea) constructing an artificial polynucleotide that is substantially divergent from known polynucleotides that encode a substantially identical protein, and

b) construir um primeiro constructo de DNA contendo a dita molécula de polinucleotídeo artificial; eb) constructing a first DNA construct containing said artificial polynucleotide molecule; and

e) retransformar uma célula a partir da dita planta transgênica com um segundo constructo de DNA que compreende uma molécula de polinucleotídeo que codifica uma proteína substancialmente idêntica ao dito polinucleotídeo artificial e o dito polinucleotídeo e o polinucleotídeo artificial são substancialmente divergentes na seqüência de polinucleotídeo; ee) retransforming a cell from said transgenic plant with a second DNA construct comprising a polynucleotide molecule that encodes a protein substantially identical to said artificial polynucleotide and said polynucleotide and artificial polynucleotide are substantially divergent in the polynucleotide sequence; and

f) regenerar a dita célula da etapa d em uma planta transgênica que compreende tanto o dito polinucleotídeo artificial quanto o dito polinucleotídeo.f) regenerating said cell of step d in a transgenic plant that comprises both said artificial polynucleotide and said polynucleotide.

Ainda são fornecidos pela presente invenção métodos para a seleção de uma planta transformada com um constructo de DNA da invenção que compreende as etapas de:There are also provided by the present invention methods for the selection of a plant transformed with a DNA construct of the invention that comprises the steps of:

a) transformar uma célula vegetal com um constructo de DNA da presente invenção; e •· ········ dla) transforming a plant cell with a DNA construct of the present invention; e • · ········ dl

··· ··· ··· ··· ·· ·· ···· ···· • · • · • • • • • • • • 4 · 4 · • · • · • • • · • · • • ··· ··· • • t · t · • • • • • • • • • · • · ··· ··· ·· ·· ·· ·· « · «·

b) cultivar a dita célula vegetal em um meio seletivo contendo um herbicida selecionado do grupo que consiste em: glifosato e glifosinato, para matar seletivamente as células que não foram transformadas com os ditos constructos de DNA; eb) cultivate said plant cell in a selective medium containing a herbicide selected from the group consisting of: glyphosate and glyphosinate, to selectively kill cells that have not been transformed with said DNA constructs; and

c) regenerar a dita célula vegetal em uma planta fértil.c) regenerate said plant cell in a fertile plant.

Um outro aspecto da invenção é um método de detectar um polinucleotídeo artificial em uma célula vegetal, planta ou sua progênie transgênica que compreende as etapas;Another aspect of the invention is a method of detecting an artificial polynucleotide in a plant cell, plant or its transgenic progeny that comprises the steps;

a) contatar uma amostra de DNA isolada da dita célula vegetal, planta ou sua progênie com uma molécula de DNA, em que a dita molécula de DNA compreende pelo menos uma molécula de DNA de um par de moléculas de DNA que quando usado em uma reação de amplificação de ácido nuciéico produz um amplicon que é diagnóstico para a dita molécula de polinucleotídeo artificial selecionada do grupo que consiste na: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 e SEQ ID NO: 35.a) contact a DNA sample isolated from said plant cell, plant or its progeny with a DNA molecule, wherein said DNA molecule comprises at least one DNA molecule from a pair of DNA molecules that when used in a reaction of nucleic acid amplification produces an amplicon that is diagnostic for said artificial polynucleotide molecule selected from the group consisting of: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 and SEQ ID NO: 35.

(a) realizar uma reação de amplificação de ácido nucléico, produzindo deste modo o amplicon; e (b) detectar o amplicon.(a) carrying out a nucleic acid amplification reaction, thereby producing amplicon; and (b) detecting the amplicon.

Os reagentes fornecidos para realizar o método de detecção acima incluem, mas não são limitados a: moléculas de DNA que especificamente hibridizam com uma molécula de polinucleotídeo artificial selecionada do grupo que consiste em: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21 e SEQ ID NO: 22; e moléculas de DNA isoladas selecionadas do grupo que consiste em: SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 e SEQ ID NO: 27.The reagents provided to perform the above detection method include, but are not limited to: DNA molecules that specifically hybridize to an artificial polynucleotide molecule selected from the group consisting of: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21 and SEQ ID NO : 22; and isolated DNA molecules selected from the group consisting of: SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 and SEQ ID NO: 27.

A presente invenção fornece plantas e progênie que compreende uma molécula de DNA selecionada do grupo que consiste em: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO:The present invention provides plants and progeny comprising a DNA molecule selected from the group consisting of: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO:

• · • · · »· · · <ÀL• · • · · · · · <ÀL

21, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 e SEQ ID NO: 27.21, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 and SEQ ID NO: 27.

A presente invenção fornece pares de moléculas de DNA selecionados do grupo que compreendem: uma primeira molécula de DNA e uma segunda molécula de DNA, em que a primeira molécula de DNA é a SEQ ID NO: 24 ou seu complemento e a segunda molécula de DNA é a SEQ ID NO: 25 ou seu complemento e o par de moléculas de DNA quando usado em um método de amplificação de DNA produz um amplicon e uma primeira molécula de DNA e uma segunda molécula de DNA, em que a primeira molécula de DNA é a SEQ ID NO: 26 ou seu complemento e a segunda molécula de DNA é a SEQ ID NO: 27 ou seu complemento e o par de moléculas de DNA quando usados em um método de amplificação de DNA produz um amplicon, em que o amplicon é diagnóstico para a presença de um polinucleotídeo artificial da presente invenção no genoma de uma planta transgênica.The present invention provides pairs of DNA molecules selected from the group comprising: a first DNA molecule and a second DNA molecule, where the first DNA molecule is SEQ ID NO: 24 or its complement and the second DNA molecule is SEQ ID NO: 25 or its complement and the pair of DNA molecules when used in a DNA amplification method produces an amplicon and a first DNA molecule and a second DNA molecule, where the first DNA molecule is SEQ ID NO: 26 or its complement and the second DNA molecule is SEQ ID NO: 27 or its complement and the pair of DNA molecules when used in a DNA amplification method produces an amplicon, where the amplicon is diagnosis for the presence of an artificial polynucleotide of the present invention in the genome of a transgenic plant.

A presente invenção fornece uma planta e sua progênie identificada por um método de amplificação de DNA para conter no seu genoma uma molécula de DNA selecionada do grupo que consiste em: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 e SEQ ID NO: 27.The present invention provides a plant and its progeny identified by a DNA amplification method to contain in its genome a DNA molecule selected from the group consisting of: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6 , SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 and SEQ ID NO: 27.

A presente invenção fornece e considera kits de detecção de DNA que compreendem: pelo menos uma molécula de DNA de comprimento suficiente que seja especificamente homóloga ou complementar a um polinucleotídeo artificial selecionado do grupo que consiste na SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21 e SEQ ID NO: 22, em que a dita molécula de DNA é útil como uma sonda de DNA ou iniciador de DNA; ou pelo menos uma molécula de DNA homóloga ou complementar a uma molécula iniciadora de DNA selecionada do grupo que consiste em: SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 e SEQ ID NO: 27.The present invention provides and considers DNA detection kits that comprise: at least one DNA molecule of sufficient length that is specifically homologous or complementary to an artificial polynucleotide selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 4 , SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21 and SEQ ID NO: 22, wherein said DNA molecule is useful as a DNA probe or DNA primer; or at least one DNA molecule homologous or complementary to a DNA primer molecule selected from the group consisting of: SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26 and SEQ ID NO: 27.

» · · · · · I»· · · · · I

A presente invenção ainda fornece um método de detectar a presença de um polinucleotídeo artificial que codifica uma EPSPS resistente ao glifosato em uma amostra de DNA, o método compreendendo:The present invention further provides a method of detecting the presence of an artificial polynucleotide that encodes a glyphosate-resistant EPSPS in a DNA sample, the method comprising:

(a) extrair uma amostra de DNA de uma planta; e (b) contatar a amostra de DNA com uma molécula de DNA rotulada de comprimento suficiente que seja especificamente homóloga ou complementar a um polinucleotídeo artificial selecionado do grupo que consiste na: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17 e SEQ ID NO: 18, em que a dita molécula de DNA rotulada é uma sonda de DNA; e (c) submeter a amostra e a sonda de DNA às condições de hibridização severa; e (d) detectar a sonda de DNA hibridizada à amostra de DNA.(a) extracting a DNA sample from a plant; and (b) contacting the DNA sample with a labeled DNA molecule of sufficient length that is specifically homologous or complementary to an artificial polynucleotide selected from the group consisting of: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 18, wherein said labeled DNA molecule is a probe of DNA; and (c) subjecting the sample and the DNA probe to the conditions of severe hybridization; and (d) detecting the DNA probe hybridized to the DNA sample.

A presente invenção fornece um polinucleotídeo isolado que codifica uma enzima de EPSPS, a enzima de EPSPS contém o motivo da SEQ ID NO: 34. A presente invenção fornece um constructo de DNA contendo um polinucleotídeo que codifica a enzima de EPSPS com o motivo da SEQ ID NO: 34. Uma célula vegetal, planta ou sua progênie que é tolerante ao glifosato como um resultado da expressão de uma enzima de EPSPS que contém o motivo da SEQ ID NO: 34 é um aspecto da invenção.The present invention provides an isolated polynucleotide that encodes an EPSPS enzyme, the EPSPS enzyme contains the SEQ ID NO: 34 motif. The present invention provides a DNA construct containing a polynucleotide that encodes the EPSPS enzyme with the SEQ motif ID NO: 34. A plant cell, plant or its progeny that is tolerant to glyphosate as a result of the expression of an EPSPS enzyme that contains the SEQ ID NO: 34 motif is an aspect of the invention.

DESCRIÇÃO DETALHADA DA INVENÇÃODETAILED DESCRIPTION OF THE INVENTION

As seguintes definições são fornecidas para melhor definir a presente invenção e para guiar aqueles versados na técnica na prática da presente invenção. A menos que de outro modo mencionado, os termos devem ser entendidos de acordo com o uso convencional por aqueles versados na técnica relevante. As definições de termos comuns na biologia molecular também podem ser encontradas em Rieger et al., Glossary of Genetics: Classical e Molecular, 5^a edição, Springer-Verlag: Nova Iorque, (1991); e Lewin, Genes V, Oxford University Press: Nova Iorque, (1994). A nomenclatura para as bases de DNA como apresentada em 37 CFR § 1.822 é usada. A nomenclatura de uma e três letras padrão para os resíduos de aminoácido é usada.The following definitions are provided to better define the present invention and to guide those skilled in the art in the practice of the present invention. Unless otherwise mentioned, the terms are to be understood according to conventional usage by those skilled in the relevant technique. Definitions of common terms in molecular biology may also be found in Rieger et al, Glossary of Genetics: Classical and Molecular, 5th edition, Springer-Verlag:. New York, (1991); and Lewin, Genes V, Oxford University Press: New York, (1994). The nomenclature for the DNA bases as presented in 37 CFR § 1,822 is used. The standard one- and three-letter nomenclature for amino acid residues is used.

Substituições de aminoácido, variantes de aminoácido, são preferivelmente substituições de resíduo de aminoácido único para um outro resíduo de aminoácido em qualquer posição dentro da proteína. As substituições, supressões, inserções ou qualquer combinação destas podem ser combinadas para chegar em um constructo final.Amino acid substitutions, amino acid variants, are preferably substitutions from a single amino acid residue to another amino acid residue at any position within the protein. Substitutions, deletions, insertions or any combination of these can be combined to arrive at a final construct.

Um polinucleotídeo artificial como usado na presente invenção é uma seqüência de DNA planejada de acordo com os métodos da presente invenção e criada como uma molécula de DNA isolada para o uso em um constructo de DNA que forneça a expressão de uma proteína em células hospedeiras e para os propósitos de clonagem em constructos apropriadas ou outros usos conhecidos por aqueles versados na técnica. Programas de computador estão disponíveis para estes propósitos, incluindo mas não limitados aos programas BestFit ou Gap do Sequence Analysis Software Package, Genetics Computer Group (GCG), Inc., University of Wisconsin Biotechnology Center, Madison, Wl 53711. O polinucleotídeo artificial pode ser criado por um ou mais métodos conhecidos na técnica, que incluem, mas não são limitados à: PCR de sobreposição. Um polinucleotídeo artificial da presente invenção é substancialmente divergente de outros polinucleotídeos que codificam a proteína idêntica ou quase idêntica.An artificial polynucleotide as used in the present invention is a DNA sequence designed according to the methods of the present invention and created as an isolated DNA molecule for use in a DNA construct that provides expression of a protein in host cells and for the purposes of cloning into appropriate constructs or other uses known to those skilled in the art. Computer programs are available for these purposes, including but not limited to the BestFit or Gap programs from the Sequence Analysis Software Package, Genetics Computer Group (GCG), Inc., University of Wisconsin Biotechnology Center, Madison, W 53711. The artificial polynucleotide can be created by one or more methods known in the art, which include, but are not limited to: overlay PCR. An artificial polynucleotide of the present invention is substantially divergent from other polynucleotides that encode the identical or nearly identical protein.

O termo quimérico refere-se a uma seqüência de ácido nucléico ou de proteína de fusão. Uma seqüência codificadora de ácido nucléico quimérica é compreendida de duas ou mais seqüências unidas na matriz que codificam uma proteína quimérica. Um gene quimérico refere-se aos elementos genéticos múltiplos derivados de fontes heterólogas que compreendem um gene.The term chimeric refers to a sequence of nucleic acid or fusion protein. A chimeric nucleic acid coding sequence is comprised of two or more sequences joined in the matrix that encode a chimeric protein. A chimeric gene refers to multiple genetic elements derived from heterologous sources that comprise a gene.

As frases seqüência codificadora, matriz de leitura aberta e seqüência estrutural referem-se à região de tripletos de ácido nucléico seqüenciais contínuos que codificam uma proteína, polipeptídeo ou seqüência de peptídeo.The phrases coding sequence, open reading matrix and structural sequence refer to the region of continuous sequential nucleic acid triplets that encode a protein, polypeptide or peptide sequence.

Códon refere-se a uma seqüência de três nucleotídeos que especificam um aminoácido particular.Codon refers to a sequence of three nucleotides that specify a particular amino acid.

Uso de códon ou códon de eleição referem-se à freqüência ··· ··· ·· ···· · ·· ········ ·· ··· · · · · · · · •·· · · · · · · · · • ······· · · • ··· ·· ·· ······· · · de uso de códons que codificam aminoácidos nas seqüências codificadoras de organismos.Use of codon or codon of choice refers to frequency ··· ··· ·· ···· · ·· ········ ·· ··· · · · · · · • ·· · · · · · · · · ······· · · • ··· ·· ·· ······· · · · of use of codons that encode amino acids in the coding sequences of organisms.

Complementaridade e complemento quando se referem às seqüências de ácido nucléico, referem-se à ligação específica de adenina à timina (uracila no RNA) e citosina à guanina nos filamentos opostos de DNA ou RNA.Complementarity and complement when referring to nucleic acid sequences, refer to the specific binding of adenine to thymine (uracil in RNA) and cytosine to guanine in the opposite strands of DNA or RNA.

Constructo refere-se aos elementos genéticos heterólogos operavelmente ligados entre si compondo uma molécula de DNA recombinante e pode compreender elementos que fornecem a expressão de uma molécula de polinucleotídeo de DNA em uma célula hospedeira e elementos que fornecem a manutenção do constructo.Construct refers to the heterologous genetic elements operably linked together making up a recombinant DNA molecule and can comprise elements that provide the expression of a DNA polynucleotide molecule in a host cell and elements that provide maintenance of the construct.

Região de terminal C refere-se à região de uma cadeia de peptídeo, polipeptídeo ou proteína do seu centro até a extremidade que carrega o aminoácido tendo um grupo carboxila livre.C-terminal region refers to the region of a peptide, polypeptide or protein chain from its center to the end that carries the amino acid having a free carboxyl group.

O termo divergente, como aqui usado, refere-se à comparação das moléculas de polinucleotídeo que codificam a mesma ou quase a mesma proteína ou polipeptídeo. O código genético de quatro letras (A, G, C e T/U) compreende códons de três letras que direcionam as moléculas de tRNA para montar aminoácidos em um polipeptídeo a partir de um padrão de mRNA. Tendo mais do que um códon que pode codificar o mesmo aminoácido é aludido como degenerado. Os códons degenerados são usados para construir moléculas de polinucleotídeo substancialmente divergentes que codificam o mesmo polipeptídeo onde estas moléculas têm uma seqüência de nucleotídeos do seu comprimento completo em que eles são menos do que 85 % idênticos e não existe nenhum comprimento de seqüência de polinucleotídeo maior do que 23 nucleotídeos que sejam idênticos.The term divergent, as used herein, refers to the comparison of polynucleotide molecules that encode the same or nearly the same protein or polypeptide. The four-letter genetic code (A, G, C and T / U) comprises three-letter codons that direct tRNA molecules to assemble amino acids in a polypeptide from an mRNA pattern. Having more than one codon that can encode the same amino acid is referred to as degenerate. Degenerate codons are used to build substantially divergent polynucleotide molecules that encode the same polypeptide where these molecules have a nucleotide sequence of their full length in which they are less than 85% identical and there is no length of polynucleotide greater than than 23 nucleotides that are identical.

O termo que codifica DNA refere-se ao DNA cromossômico, DNA plasmídico, cDNA ou polinucleotídeo de DNA artificial que codifica qualquer uma das proteínas aqui discutidas.The term encoding DNA refers to chromosomal DNA, plasmid DNA, cDNA or artificial DNA polynucleotide that encodes any of the proteins discussed here.

O termo endógeno refere-se aos materiais que se originam de dentro de um organismo ou célula.The term endogenous refers to materials that originate from within an organism or cell.

Endonuclease refere-se a uma enzima que hidrolisa DNA de <ΧοEndonuclease refers to an enzyme that hydrolyzes <Χο DNA

filamento duplo nos locais internos.double filament in indoor locations.

Exógeno refere-se a materiais que se originam de fora de um organismo ou célula. Isto tipicamente se aplica às moléculas de ácido nucléico usadas na produção de células hospedeiras e plantas transformadas ou transgênicas.Exogenous refers to materials that originate from outside an organism or cell. This typically applies to nucleic acid molecules used in the production of host cells and transformed or transgenic plants.

Éxon refere-se à porção de um gene que é realmente traduzida em proteína, isto é, uma seqüência codificadora.Exon refers to the portion of a gene that is actually translated into protein, that is, a coding sequence.

O termo expressão refere-se à transcrição ou tradução de um polinucleotídeo para produzir um produto de gene correspondente, um RNA ou proteína.The term expression refers to the transcription or translation of a polynucleotide to produce a corresponding gene product, an RNA or protein.

Fragmentos. Um fragmento de um gene é uma porção de uma molécula de ácido polinucléico de comprimento total que é de pelo menos um comprimento mínimo capaz de transcrição em um RNA, tradução em um peptídeo ou útil como uma sonda ou iniciador em um método de detecção de DNA.Fragments. A gene fragment is a portion of a full-length polynucleic acid molecule that is at least a minimum length capable of transcription into an RNA, translation into a peptide, or useful as a probe or primer in a DNA detection method .

O termo gene refere-se ao DNA cromossômico, DNA plasmídico, cDNA, polinucleotídeo de DNA artificial ou outro DNA que codifica uma molécula de peptídeo, polipeptídeo, proteína ou RNA e os elementos genéticos que flanqueiam a seqüência codificadora que estão envolvidos na regulagem da expressão.The term gene refers to chromosomal DNA, plasmid DNA, cDNA, artificial DNA polynucleotide or other DNA that encodes a peptide, polypeptide, protein or RNA molecule and the genetic elements that flank the coding sequence that are involved in regulating expression .

O termo genoma como se aplica aos vírus abrange todas as seqüências de ácido nucléico contidas dentro do capsídeo do vírus. O termo genoma como se aplica às bactérias abrange tanto o cromossoma quanto os plasmídeos dentro de uma célula hospedeira bacteriana. Os ácidos nucléicos codificadores da presente invenção introduzidos em células hospedeiras bacterianas podem ser portanto cromossomicamente integrados ou localizados no plasmídeo. O termo genoma como se aplica às células vegetais abrange não apenas o DNA cromossômico encontrado dentro do núcleo, mas à organela de DNA encontrada dentro dos componentes subcelulares da célula. Os ácidos nucléicos da presente invenção introduzidos nas células vegetais podem estar portanto cromossomicamente integrados ou localizados na organela.The term genome as it applies to viruses encompasses all nucleic acid sequences contained within the virus capsid. The term genome as it applies to bacteria covers both the chromosome and the plasmids within a bacterial host cell. Nucleic acids encoding the present invention introduced into bacterial host cells can therefore be chromosomally integrated or located in the plasmid. The term genome as it applies to plant cells encompasses not only the chromosomal DNA found within the nucleus, but the DNA organelle found within the subcellular components of the cell. The nucleic acids of the present invention introduced into plant cells can therefore be chromosomally integrated or located in the organelle.

• · · · · ·• · · · · ·

• · · » · · »· · · ·• · · · · · · · · ·

Glifosato refere-se à N-fosfonometilglicina e seus sais, o Glifosato é o ingrediente ativo do herbicida Roundup® (Monsanto Co.). Os tratamentos de plantas com glifosato referem-se aos tratamentos com as formulações de herbicida Roundup® ou Roundup Ultra®, a menos que de outro modo estabelecido. O Glifosato como N-fosfonometilglicina e seus sais (não formulado como o herbicida Roundup®) são componentes de meios de cultura sintéticos usados para a seleção de bactérias e tolerância da planta ao glifosato ou usados para determinar a resistência à enzima em ensaios bioquímicos in vitro.Glyphosate refers to N-phosphonomethylglycine and its salts, Glyphosate is the active ingredient in the herbicide Roundup® (Monsanto Co.). Glyphosate plant treatments refer to treatments with Roundup® or Roundup Ultra® herbicide formulations, unless otherwise stated. Glyphosate as N-phosphonomethylglycine and its salts (not formulated as the Roundup® herbicide) are components of synthetic culture media used for bacterial selection and plant tolerance to glyphosate or used to determine enzyme resistance in in vitro biochemical assays .

Seqüência de DNA heterólogo refere-se a uma seqüência de polinucleotídeo que se origina de uma fonte ou espécies estranhas ou, se da mesma fonte, é modificada a partir da sua forma original.Sequence of heterologous DNA refers to a polynucleotide sequence that originates from a source or foreign species or, if from the same source, is modified from its original form.

DNA homólogo refere-se ao DNA da mesma fonte como aquela da célula receptora.Homologous DNA refers to DNA from the same source as that of the recipient cell.

Hibridização refere-se à capacidade de um filamento de ácido nucléico para unir com um filamento complementar por intermédio do emparelhamento de base. A hibridização ocorre quando seqüências complementares nos dois filamentos de ácido nucléico se ligam entre si. As sondas e iniciadores de ácido nucléico da presente invenção hibridizam sob condições severas a uma seqüência de DNA alvo. Qualquer método de hibridização ou amplificação de ácido nucléico convencional pode ser usado para identificar a presença de DNA a partir de um evento transgênico em uma amostra. As moléculas de ácido nucléico ou seus fragmentos são capazes de hibridizar especificamente a outras moléculas de ácido nucléico sob certas circunstâncias. Como aqui usado, duas moléculas de ácido nucléico são ditas serem capazes de hibridizar especificamente entre si se as duas moléculas forem capazes de formar uma estrutura de ácido nucléico antiparalela, de filamento duplo. Uma molécula de ácido nucléico é dita ser o complemento de uma outra molécula de ácido nucléico se elas exibem complementaridade completa. Como aqui usado, as moléculas são ditas exibir complementaridade completa quando cada nucleotídeo de uma das moléculas é complementar a um nucleotídeo da outra. Duas moléculas são ditas serem minimamenteHybridization refers to the ability of a nucleic acid filament to join with a complementary filament through base pairing. Hybridization occurs when complementary sequences in the two nucleic acid strands are linked together. The nucleic acid probes and primers of the present invention hybridize under severe conditions to a target DNA sequence. Any conventional nucleic acid hybridization or amplification method can be used to identify the presence of DNA from a transgenic event in a sample. Nucleic acid molecules or fragments thereof are able to specifically hybridize to other nucleic acid molecules under certain circumstances. As used herein, two nucleic acid molecules are said to be able to hybridize specifically to each other if the two molecules are capable of forming a double-stranded antiparallel nucleic acid structure. A nucleic acid molecule is said to be the complement of another nucleic acid molecule if they exhibit complete complementarity. As used herein, molecules are said to exhibit complete complementarity when each nucleotide of one of the molecules is complementary to a nucleotide of the other. Two molecules are said to be minimally

ΜΜ

complementar se elas podem hibridizar entre si com estabilidade suficiente para permitir que elas permaneçam recozidas entre si pelo menos sob condições de severidade baixa convencionais. Similarmente, as moléculas são ditas serem complementares” se elas podem hibridizar entre si com estabilidade suficiente para permitir que elas permaneçam recozidas entre si sob condições de alta severidade convencionais. As condições de severidade convencionais são descritas por Sambrook et al., 1989 e por Haymes et al., Em: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, DC (1985), aqui incorporada por referência em sua totalidade. Afastamentos da complementaridade completa são portanto permissíveis, contanto que tais afastamentos não impeçam completamente a capacidade das moléculas para formarem uma estrutura de filamento duplo. De modo que uma molécula de ácido nucléico sirva como um iniciador ou sonda ela necessita ser apenas suficientemente complementar em seqüência para ser capaz de formar uma estrutura de filamento duplo estável sob o solvente e concentrações salinas particulares utilizados.complementary if they can hybridize to each other with sufficient stability to allow them to remain annealed to each other at least under conventional low severity conditions. Similarly, molecules are said to be complementary ”if they can hybridize to each other with sufficient stability to allow them to remain annealed to each other under conventional high severity conditions. Conventional severity conditions are described by Sambrook et al., 1989 and by Haymes et al., In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, DC (1985), incorporated herein by reference in its entirety. Deviations from complete complementarity are therefore permissible, provided that such deviations do not completely impede the ability of molecules to form a double-stranded structure. In order for a nucleic acid molecule to serve as a primer or probe it needs only to be sufficiently complementary in sequence to be able to form a stable double-stranded structure under the particular solvent and salt concentrations used.

Como aqui usado, uma seqüência substancialmente homóloga é uma seqüência de ácido nucléico que especificamente hibridizará com o complemento da seqüência de ácido nucléico ao qual ela está sendo comparada sob condições de alta severidade. O termo condições severas é funcionalmente definido com respeito à hibridização de uma sonda de ácido nucléico a um ácido nucléico alvo (isto é, a uma seqüência de ácido nucléico particular de interesse) pelo procedimento de hibridização específico debatido em Sambrook et al., 1989, em 9.52-9.55. Ver também, Sambrook et al., 1989 em 9.47-9.52, 9.56-9.58 aqui incorporada por referência em sua totalidade; Kanehisa, (Nucl. Acids Res. 12: 203 a 213, 1984, aqui incorporada por referência em sua totalidade); e Wetmur e Davidson, (J. Mol. Biol. 31: 349 a 370, 1988, aqui incorporada por referência em sua totalidade). Conseqüentemente, as seqüências de nucleotídeo da invenção podem ser usadas quanto a sua capacidade para formar seletivamente moléculas duplex com trechos complementares de fragmentos de DNA. Dependendo da aplicação prevista, desejar-se-á utilizar condições variáveis de hibridização para se o/e>As used herein, a substantially homologous sequence is a nucleic acid sequence that will specifically hybridize to the complement of the nucleic acid sequence to which it is being compared under conditions of high severity. The term severe conditions is functionally defined with respect to the hybridization of a nucleic acid probe to a target nucleic acid (that is, to a particular nucleic acid sequence of interest) by the specific hybridization procedure discussed in Sambrook et al., 1989, on 9.52-9.55. See also, Sambrook et al., 1989 at 9.47-9.52, 9.56-9.58 incorporated herein by reference in its entirety; Kanehisa, (Nucl. Acids Res. 12: 203 to 213, 1984, hereby incorporated by reference in its entirety); and Wetmur and Davidson, (J. Mol. Biol. 31: 349 to 370, 1988, incorporated herein by reference in their entirety). Consequently, the nucleotide sequences of the invention can be used in terms of their ability to selectively form duplex molecules with complementary stretches of DNA fragments. Depending on the intended application, it will be desirable to use variable hybridization conditions to ensure that the

obter graus variáveis de seletividade de sonda em relação à seqüência alvo. Para aplicações que requerem seletividade alta, desejar-se-á tipicamente utilizar condições relativamente severas para formar os híbridos, por exemplo, selecionar-se-á condições relativamente de sal baixo e/ou temperatura alta, tal como fornecidas por cerca de 0,02 M a cerca de 0,15 M de NaCl em temperaturas de cerca de 50° C a cerca de 70° C. Uma das condições severas, por exemplo, é lavar o filtro de hibridização pelo menos duas vezes com tampão de lavagem de alta severidade (0,2X SSC, 0,1 % de SDS, 65° C). As condições de severidade apropriadas que promovem a hibridização de DNA, por exemplo, 6,0 x cloreto de sódio/citrato de sódio (SSC) a cerca de 45° C, seguida por uma lavagem de 2,0 x SSC a 50° C, são conhecidas por aqueles versados na técnica ou podem ser encontradas em Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. Por exemplo, a concentração salina na etapa de lavagem pode ser selecionada de uma severidade baixa de cerca de 2,0 x SSC a 50° C até uma severidade alta de cerca de 0,2 x SSC a 50° C. Além disso, a temperatura na etapa de lavagem pode ser aumentada de condições de severidade baixa na temperatura ambiente, cerca de 22° C, até condições de alta severidade a cerca de 65° C. Tanto a temperatura quanto o sal podem ser variados ou a temperatura ou a concentração de sal podem ser mantidos constantes enquanto a outra variável é mudada. Tais condições seletivas toleram pouco, se algum, desemparelhamento entre a sonda e o padrão ou filamento alvo. A detecção de seqüências de DNA por intermédio da hibridização é bem conhecida por aqueles versados na técnica e as divulgações das Patentes U.S. N³⁵4.965.188 e 5.176.995 são exemplares dos métodos de análises de hibridização.obtain varying degrees of probe selectivity in relation to the target sequence. For applications that require high selectivity, it will typically be desirable to use relatively severe conditions to form the hybrids, for example, relatively low salt and / or high temperature conditions will be selected, as provided by about 0.02 M to about 0.15 M NaCl at temperatures from about 50 ° C to about 70 ° C. One of the severe conditions, for example, is to wash the hybridization filter at least twice with high-severity wash buffer. (0.2X SSC, 0.1% SDS, 65 ° C). The appropriate stringency conditions that promote DNA hybridization, for example, 6.0 x sodium chloride / sodium citrate (SSC) at about 45 ° C, followed by a 2.0 x SSC wash at 50 ° C , are known to those skilled in the art or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, NY (1989), 6.3.1-6.3.6. For example, the saline concentration in the wash step can be selected from a low severity of about 2.0 x SSC at 50 ° C to a high severity of about 0.2 x SSC at 50 ° C. The temperature in the washing step can be increased from conditions of low severity at room temperature, about 22 ° C, to conditions of high severity at about 65 ° C. Both the temperature and the salt can be varied or the temperature or concentration of salt can be kept constant while the other variable is changed. Such selective conditions tolerate little, if any, mismatch between the probe and the target pattern or filament. Detection of DNA sequences via hybridization is well known to those skilled in the art and disclosed in the US Patents ³⁵ 4,965,188 and 5,176,995 are exemplary of the methods of hybridization analyzes.

Identidade refere-se ao grau de similaridade entre duas seqüências de ácido polinucléico ou proteína. Um alinhamento das duas seqüências é realizado por um programa de computador adequado. Um programa de computador amplamente usado e aceito para realizar alinhamentos de seqüência é o CLUSTALW v1.6 (Thompson, et al. Nucl. Acids Res., 22: 4673 a 4680, 1994). O número de bases ou aminoácidos emparelhados • ·· ········ »« · · · · • · · · • · · · • · · · ······· · · é dividido pelo número total de bases ou aminoácidos e multiplicado por 100 para se obter uma identidade percentual. Por exemplo, se duas seqüências de 580 pares de base tiveram 145 bases emparelhadas, elas podem ser 25 por cento idênticas. Se as duas seqüências comparadas são de comprimentos diferentes, o número de emparelhamentos é dividido pelo mais curto dos dois comprimentos. Por exemplo, se existem 100 aminoácidos emparelhados entre uma proteína de 200 e uma de 400 aminoácidos, elas são 50 por cento idênticas com respeito à seqüência mais curta. Se a seqüência mais curta é menor do que 150 bases ou 50 aminoácidos no comprimento, o número de emparelhamentos é dividido por 150 (para as bases de ácido nucléico) ou 50 (para os aminoácidos) e multiplicados por 100 para se obter uma identidade percentual.Identity refers to the degree of similarity between two polynucleic acid or protein strings. An alignment of the two sequences is performed by a suitable computer program. A widely used and accepted computer program for performing sequence alignments is CLUSTALW v1.6 (Thompson, et al. Nucl. Acids Res., 22: 4673 to 4680, 1994). The number of bases or paired amino acids • ·· ········ »« · · · · • · · · · · · · • · · ······· · · · is divided by the total number bases or amino acids and multiplied by 100 to obtain a percentage identity. For example, if two 580 base pair strings had 145 paired bases, they can be 25 percent identical. If the two strings compared are of different lengths, the number of matches is divided by the shorter of the two lengths. For example, if there are 100 amino acids paired between a 200 and a 400 amino acid protein, they are 50 percent identical with respect to the shortest sequence. If the shortest sequence is less than 150 bases or 50 amino acids in length, the number of matches is divided by 150 (for nucleic acid bases) or 50 (for amino acids) and multiplied by 100 to obtain a percentage identity .

Como aqui descrito uma proteína pode ser substancialmente idêntica às proteínas relacionadas. Estas proteínas com identidade substancial no geral compreendem pelo menos uma seqüência de polipeptideo que tem pelo menos noventa e oito de identidade de seqüência percentual comparada com a sua outra seqüência de polipeptideo relacionada. O programa Gap no WISCONSIN PACKAGE versão 10.0-UNIX da Genetics Computer Group, Inc. está fundamentado no método de Needleman e Wunsch (J. Mol. Biol. 48: 443 a 453, 1970) usando o conjunto de parâmetros de padrão para a comparação pelo modo de pares (para a comparação de seqüência de aminoácido: Penalidade de Criação de Intervalo = 8, Penalidade de Extensão de Intervalo = 20); ou usando o programa TBLASTN no conjunto de software BLAST 2.2.1 (Altschul et al., Nucleic Acids Res. 25: 3389 a 3402), usando a matriz BLOSUM62 (Henikoff e Henikoff, Proc. Natl. Acad. Sei. U.S.A. 89: 10915 a 10919, 1992) e o conjunto de parâmetros padrão para a comparação pelo modo de pares (valor de criação de intervalo = 11, valor para a extensão de intervalo = 1.). No BLAST, o valor E ou o valor de probabilidade, representa o número de alinhamentos diferentes com pontuações equivalentes ou melhores do que as pontuações de alinhamento bruto, S, que são esperadas ocorrer em uma pesquisa de banco de dados por acaso. Quanto mais baixo o valor E, mais significante o emparelhamen•» ··«·»··· • · · 9 • ·· ··· · · ···· t · · · · · • · · · · · • · · · · · · • · · · · · • ·«··· ··As described herein, a protein can be substantially identical to related proteins. These proteins with substantial identity in general comprise at least one polypeptide sequence that has at least ninety-eight percent sequence identity compared to its other related polypeptide sequence. The Gap program in WISCONSIN PACKAGE version 10.0-UNIX by Genetics Computer Group, Inc. is based on the method of Needleman and Wunsch (J. Mol. Biol. 48: 443 to 453, 1970) using the standard parameter set for comparison by the pair mode (for the comparison of amino acid sequence: Interval Creation Penalty = 8, Interval Extension Penalty = 20); or using the TBLASTN program in the BLAST 2.2.1 software suite (Altschul et al., Nucleic Acids Res. 25: 3389 to 3402), using the BLOSUM62 matrix (Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89: 10915 to 10919, 1992) and the set of standard parameters for the comparison by the pair mode (interval creation value = 11, value for the interval extension = 1.). In BLAST, the E value or the probability value represents the number of different alignments with scores equivalent to or better than the raw alignment scores, S, which are expected to occur in a database search by chance. The lower the E value, the more significant the match • »··« · »··· • · · 9 • ·· ··· · · ···· t · · · · · · · · · · • · · · · · · · · · · · · · · ··· ··

Μ to. Porque o tamanho do banco de dados é um elemento nos cálculos do valor E, os valores E obtidos pela aplicação de BLAST contra banco de dados públicos, tais como o GenBank, tem no geral aumentado com o tempo para qualquer emparelhamento questão/entrada dado. A identidade percentual refere-se à porcentagem de resíduos de aminoácido identicamente emparelhados que existem ao longo do comprimento daquela porção das seqüências que é alinhada pelo algoritmo BLAST.Μ to. Because the size of the database is an element in E-value calculations, the E-values obtained by applying BLAST against public databases, such as GenBank, have generally increased over time for any given question / entry pairing. The percent identity refers to the percentage of identically matched amino acid residues that exist along the length of that portion of the strings that is aligned by the BLAST algorithm.

Intron refere-se a uma porção de um gene não traduzido em proteína, embora ela seja transcrita em RNA.Intron refers to a portion of a gene that is not translated into protein, although it is transcribed into RNA.

Uma seqüência de ácido nucléico isolada é substancialmente separada ou purificada de outras seqüências de ácido nucléico com as quais o ácido nucléico está normalmente associado na célula do organismo no qual o ácido nucléico naturalmente ocorre, isto é, outro DNA cromossômico ou extracromossômico. O termo abrange os ácido nucléicos que são bioquimicamente purificados de modo a remover substancialmente ácido nucléicos contaminantes e outros componentes celulares. O termo também abrange ácidos nucléicos recombinantes e ácido nucléicos quimicamente sintetizados.An isolated nucleic acid sequence is substantially separated or purified from other nucleic acid sequences with which the nucleic acid is normally associated in the cell of the organism in which the nucleic acid naturally occurs, that is, other chromosomal or extrachromosomal DNA. The term encompasses nucleic acids that are biochemically purified in order to substantially remove contaminating nucleic acids and other cellular components. The term also encompasses recombinant nucleic acids and chemically synthesized nucleic acids.

Polipeptídeos Isolados, Purificados, Homogêneos. Um polipeptídeo é isolado se ele foi separado dos componentes celulares (ácido nucléicos, lipídeos, carboidratos e outros polipeptídeos) que naturalmente o acompanham ou que é quimicamente sintetizado ou recombinante. Um polipeptídeo monomérico é isolado quando pelo menos 60 % em peso de uma amostra é composta do polipeptideo, preferivelmente 90 % ou mais, mais preferivelmente 95 % ou mais e o mais preferivelmente mais do que 99 %. A pureza ou homogeneidade da proteína são indicadas, por exemplo, pela eletroforese em gel de poliacrilamida de uma amostra de proteína, seguido pela visualização de uma única faixa de polipeptideo no tingimento do gel de poliacrilamida; a cromatografia líquida de alta pressão; ou outros métodos convencionais. As proteínas podem ser purificadas por qualquer um dos meios conhecidos na técnica, por exemplo como descrito no Guide to Protein Purification, ed. Deutscher, Meth. Enzymol. 185. Academic Press, SanIsolated, Purified, Homogeneous Polypeptides. A polypeptide is isolated if it has been separated from cellular components (nucleic acids, lipids, carbohydrates and other polypeptides) that naturally accompany it or that are chemically synthesized or recombinant. A monomeric polypeptide is isolated when at least 60% by weight of a sample is made up of the polypeptide, preferably 90% or more, more preferably 95% or more and most preferably more than 99%. The purity or homogeneity of the protein is indicated, for example, by the polyacrylamide gel electrophoresis of a protein sample, followed by the visualization of a single polypeptide strip in the dyeing of the polyacrylamide gel; high pressure liquid chromatography; or other conventional methods. Proteins can be purified by any means known in the art, for example as described in Guide to Protein Purification, ed. Deutscher, Meth. Enzymol. 185. Academic Press, San

Λα>Λα>

Diego, 1990; e Scopes, Protein Purification: Principies e Practice, Springer Verlag, Nova Iorque, 1982.Diego, 1990; and Scopes, Protein Purification: Principies and Practice, Springer Verlag, New York, 1982.

Rotulação ou rotulado. Existe uma variedade de métodos e reagentes convencionais para a rotulação de polinucleotídeos e polipeptídeos e fragmentos destes. Os rótulos típicos incluem isótopos, ligandos ou receptores de ligando radioativos, fluoróforos, agentes e enzimas quimioluminescentes. Os métodos para a rotulação e orientação na escolha de rótulos apropriados para os vários propósitos são debatidos, por exemplo, em Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989) e Current Protocols in Molecular Biology, ed. Ausubel et al., Greene Publishing e Wiley-lnterscience, Nova Iorque, (1992).Lettering or labeled. There are a variety of conventional methods and reagents for labeling polynucleotides and polypeptides and fragments thereof. Typical labels include isotopes, ligands or radioactive ligand receptors, fluorophores, chemiluminescent agents and enzymes. Methods for labeling and guidance in choosing appropriate labels for various purposes are discussed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989) and Current Protocols in Molecular Biology, ed . Ausubel et al., Greene Publishing and Wiley-Interscience, New York, (1992).

Região codificadora de proteína madura, este termo refere-se à seqüência de um produto de proteína processado, isto é, uma EPSP sintase madura que permanece depois que o peptídeo de trânsito de cloroplasto foi removido.Mature protein coding region, this term refers to the sequence of a processed protein product, that is, a mature EPSP synthase that remains after the chloroplast transit peptide has been removed.

Nativo, o termo nativo no geral refere-se a um ácido polinucléico ou polipeptideo que ocorrem naturalmente (tipo selvagem). Entretanto, no contexto da presente invenção, alguma modificação de um polinucleotídeo e polipeptideo isolados pode ter ocorrido para fornecer um polipeptídeo com um fenótipo particular, por exemplo, substituição de aminoácido na EPSPS sensível ao glifosato para fornecer uma EPSPS resistente ao glifosato. Para propósitos comparativos na presente invenção, o polinucleotídeo isolado que contém uns poucos nucleotideos substituídos para fornecer a modificação de aminoácido para a tolerância a herbicida é aludido como o polinucleotídeo nativo quando comparado com o polinucleotídeo substancialmente divergente criado pelos métodos da presente invenção. Entretanto, o polinucleotídeo nativo modificado desta maneira é não nativo com respeito aos elementos genéticos normalmente encontrados ligados a um polinucleotídeo não-modificado que ocorre naturalmente.Native, the term native in general refers to a naturally occurring polynucleic acid or polypeptide (wild type). However, in the context of the present invention, some modification of an isolated polynucleotide and polypeptide may have occurred to provide a polypeptide with a particular phenotype, for example, amino acid substitution in the glyphosate-sensitive EPSPS to provide a glyphosate-resistant EPSPS. For comparative purposes in the present invention, the isolated polynucleotide that contains a few substituted nucleotides to provide amino acid modification for herbicide tolerance is referred to as the native polynucleotide when compared to the substantially divergent polynucleotide created by the methods of the present invention. However, the native polynucleotide modified in this way is non-native with respect to the genetic elements normally found attached to a naturally occurring unmodified polynucleotide.

Região de terminal N refere-se a uma região de uma cadeia de peptídeo, polipeptideo ou proteína do aminoácido tendo um grupo amino livre no centro da cadeia.N-terminal region refers to a region of a peptide, polypeptide or protein chain of the amino acid having a free amino group at the center of the chain.

• · ·• · ·

Ácido nucléico refere-se ao ácido desoxirribonucléico (DNA) e ácido ribonucléico (RNA).Nucleic acid refers to deoxyribonucleic acid (DNA) and ribonucleic acid (RNA).

Códigos de ácido nucléico: A = adenosina; C = citosina; G = guanosina; T = timidina. Códigos usados para a síntese de oligonucleotídeos: N = eqüimolar A, C, G e T; I = desoxiinosina; K = eqüimolar G e T; R = eqüimolar A e G; S = eqüimolar C e G; W = eqüimolar A e T; Y = eqüimolar CeT.Nucleic acid codes: A = adenosine; C = cytosine; G = guanosine; T = thymidine. Codes used for the synthesis of oligonucleotides: N = equimolar A, C, G and T; I = deoxyinosine; K = equimolar G and T; R = equimolar A and G; S = equimolar C and G; W = equimolar A and T; Y = equimolar CeT.

Um segmento de ácido nucléico ou uma segmento de molécula de ácido nucléico é uma molécula de ácido nucléico que foi isolada livre de DNA genômico total de uma espécie particular ou que foi sintetizado. Incluído com o termo segmento de ácido nucléico estão os segmentos de DNA, vetores recombinantes, plasmídeos, cosmídeos, fagomídeos, fago, vírus, et cetera.A nucleic acid segment or a nucleic acid molecule segment is a nucleic acid molecule that has been isolated free of total genomic DNA from a particular species or that has been synthesized. Included with the term nucleic acid segment are the segments of DNA, recombinant vectors, plasmids, cosmids, phagemids, phage, viruses, et cetera.

Variantes de Seqüência de Nucleotídeo, usando métodos bem conhecido, o versado na técnica pode facilmente produzir variantes de seqüência de nucleotídeo e aminoácido de genes e proteínas, respectivamente. Por exemplo, as moléculas de DNA variantes da presente invenção são moléculas de DNA contendo mudanças em uma seqüência de gene de EPSPS, isto é, mudanças que incluem um ou mais nucleotídeos da seqüência de gene de EPSPS sendo suprimido, adicionado e/ou substituído, tal que o gene de EPSPS variante codifique uma proteína que retenha a atividade de EPSPS. As moléculas de DNA variantes podem ser produzidas, por exemplo, pelas técnicas de mutagênese de DNA padrão ou sintetizando-se quimicamente a molécula de DNA variante ou uma porção desta. Os métodos para a síntese química de ácidos nucléicos são debatidos, por exemplo, em Beaucage et al., Tetra. Letts. 22: 1859 a 1862 (1981) e Matteucci et al., J. Am. Chem. Soc. 103: 3185- (1981). A síntese química de ácido nucléicos pode ser realizada, por exemplo, em sintetizadores de oligonucleotídeo automatizados. Tais variantes preferivelmente não mudam a matriz de leitura da região que codifica a proteína do ácido nucléico e preferivelmente codificam uma proteína não tendo nenhuma mudança ou apenas uma redução menor.Nucleotide Sequence Variants, using well-known methods, the person skilled in the art can easily produce nucleotide and amino acid sequence variants of genes and proteins, respectively. For example, the variant DNA molecules of the present invention are DNA molecules containing changes in an EPSPS gene sequence, that is, changes that include one or more nucleotides in the EPSPS gene sequence being deleted, added and / or replaced, such that the variant EPSPS gene encodes a protein that retains EPSPS activity. Variant DNA molecules can be produced, for example, by standard DNA mutagenesis techniques or by chemically synthesizing the variant DNA molecule or a portion thereof. Methods for the chemical synthesis of nucleic acids are discussed, for example, in Beaucage et al., Tetra. Letts. 22: 1859 to 1862 (1981) and Matteucci et al., J. Am. Chem. Soc. 103: 3185- (1981). Chemical synthesis of nucleic acids can be carried out, for example, on automated oligonucleotide synthesizers. Such variants preferably do not change the reading frame of the region encoding the nucleic acid protein and preferably encode a protein having no change or only minor reduction.

Matriz de leitura aberta (ORF) refere-se a uma região de DNA ou RNA que codifica um peptídeo, polipeptídeo ou proteína.Open reading matrix (ORF) refers to a region of DNA or RNA that encodes a peptide, polypeptide or protein.

Operavelmente Ligada. Uma primeira seqüência de ácido nucléico está operavelmente ligada com uma segunda seqüência de ácido nucléico quando a primeira seqüência de ácido nucléico é colocada em uma relação funcional com a segunda seqüência de ácido nucléico. Por exemplo, um promotor está operavelmente ligado a uma seqüência codificadora de proteína se o promotor causa a transcrição ou expressão da seqüência codificadora. No geral, as seqüências de DNA operavelmente ligadas são contíguas e, onde necessário para unir duas regiões codificadoras de proteína, na matriz de leitura.Operably Connected. A first sequence of nucleic acid is operably linked with a second sequence of nucleic acid when the first sequence of nucleic acid is placed in a functional relationship with the second sequence of nucleic acid. For example, a promoter is operably linked to a protein coding sequence if the promoter causes the transcription or expression of the coding sequence. In general, the operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the reading matrix.

Superexpressão refere-se à expressão de um RNA ou polipeptídeo ou proteína codificados por um DNA introduzido em uma célula hospedeira, em que o RNA ou polipeptídeo ou proteína não estão normalmente presentes na célula hospedeira ou em que o RNA ou polipeptídeo ou proteína estão presentes na dita célula hospedeira em um nível mais alto do que aquele normalmente expressado a partir do gene endógeno que codifica o RNA ou polipeptídeo ou proteína.Overexpression refers to the expression of an RNA or polypeptide or protein encoded by a DNA introduced into a host cell, where the RNA or polypeptide or protein is not normally present in the host cell or where the RNA or polypeptide or protein is present in the said host cell at a higher level than that normally expressed from the endogenous gene that encodes the RNA or polypeptide or protein.

O termo planta abrange qualquer planta superior e sua progênie, incluindo monocotiledôneas (por exemplo, milho, arroz, trigo, cevada, etc.), dicotiledôneas (por exemplo, soja, algodão, canola, tomate, batata, Arabidopsis, tabaco, etc.), gimnospermas (pinheiros, abetos, cedros, etc.) e inclui partes de plantas, incluindo unidades reprodutivas de uma planta (por exemplo, sementes, bulbos, tubérculos, frutas, flores, etc.) ou outras partes ou tecidos daquela planta podem ser reproduzidos.The term plant encompasses any superior plant and its progeny, including monocots (for example, corn, rice, wheat, barley, etc.), dicots (for example, soybeans, cotton, canola, tomatoes, potatoes, Arabidopsis, tobacco, etc. ), gymnosperms (pines, firs, cedars, etc.) and includes parts of plants, including reproductive units of a plant (for example, seeds, bulbs, tubers, fruits, flowers, etc.) or other parts or tissues of that plant reproduced.

Cassete de expressão vegetal refere-se a segmentos de DNA quimérico que compreendem os elementos reguladores que são operavelmente ligados para fornecer a expressão de um produto de transgene em plantas.Plant expression cassette refers to segments of chimeric DNA that comprise the regulatory elements that are operably linked to provide expression of a transgene product in plants.

Plasmídeo refere-se a um pedaço circular, extracromossômico, auto-replicativo de DNA.Plasmid refers to a circular, extrachromosomal, self-replicating piece of DNA.

Sinal de poliadenilação ou sinal poliA refere-se a uma se• · • · qüência de ácido nucléico localizada a 3' em relação a uma região codificadora que cause a adição de nucleotídeos de adenilato à extremidade 3’ do mRNA transcrito a partir da região codificadora.Polyadenylation signal or polyA signal refers to a nucleic acid sequence located 3 'to a coding region that causes the addition of adenylate nucleotides to the 3' end of the mRNA transcribed from the coding region .

Reação de cadeia da polimerase (PCR)” refere-se a um método de amplificação de DNA que usa uma técnica enzimática para criar cópias múltiplas de uma seqüência de ácido nucléico (amplicon). As cópias de uma molécula de DNA são preparadas transportando-se um DNA polimerase entre dois amplímeros. A base deste método de amplificação são ciclos múltiplos de mudanças de temperatura para desnaturar, depois recozer os amplímeros (moléculas preparadoras de DNA), seguido pela extensão para sintetizar novos filamentos de DNA na região localizada entre os amplímeros flanqueadores. A amplificação de ácido nucléico pode ser realizada por qualquer uma dos vários métodos de amplificação de ácido nucléico conhecido na técnica, incluindo a reação de cadeia da polimerase (PCR). Uma variedade de métodos de amplificação são conhecidos na técnica e são descritos, inter alia, nas Patentes U.S. N⁸⁵ 4.683.195 e 4.683.202 e no PCR Protocols: A Guide to Method e Applications, ed. Innis et al., Academic Press, San Diego, 1990. Os métodos de amplificação pela PCR foram desenvolvidos para amplificar até 22 kb de DNA genômico e até 42 kb de DNA de bacteriófago (Cheng et al., Proc. Natl. Acad. Sei. USA 91: 5695 a 5699, 1994). Estes métodos assim como outros métodos conhecidos na técnica de amplificação de DNA podem ser usados na prática da presente invenção.Polymerase chain reaction (PCR) ”refers to a method of DNA amplification that uses an enzymatic technique to create multiple copies of a nucleic acid sequence (amplicon). Copies of a DNA molecule are prepared by transporting a DNA polymerase between two amplimers. The basis of this amplification method are multiple cycles of temperature changes to denature, then anneal the amplimers (DNA-preparing molecules), followed by the extension to synthesize new DNA strands in the region located between the flanking amplimers. Nucleic acid amplification can be performed by any of the various methods of nucleic acid amplification known in the art, including the polymerase chain reaction (PCR). A variety of amplification methods are known in the art and are described, inter alia, in US Patents ⁸⁵ 4,683,195 and 4,683,202 and in PCR Protocols: A Guide to Method and Applications, ed. Innis et al., Academic Press, San Diego, 1990. PCR amplification methods were developed to amplify up to 22 kb of genomic DNA and up to 42 kb of bacteriophage DNA (Cheng et al., Proc. Natl. Acad. Sei USA 91: 5695 to 5699, 1994). These methods as well as other methods known in the DNA amplification technique can be used in the practice of the present invention.

Polinucleotídeo refere-se a uma extensão de moléculas de ácido desoxirribonucléico (DNA) e ácido ribonucléico (RNA) maiores do que dois, que são conectados para formar uma molécula maior.Polynucleotide refers to an extension of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) molecules larger than two, which are connected to form a larger molecule.

Fragmentos de polipeptídeo. A presente invenção também abrange fragmentos de uma proteína que carece de pelo menos um resíduo de uma proteína de comprimento natural nativa, mas que substancialmente mantém a atividade da proteína.Polypeptide fragments. The present invention also encompasses fragments of a protein that lacks at least one residue of a protein of native natural length, but which substantially maintains the activity of the protein.

O termo promotor ou região promotora refere-se a uma molécula de ácido polinucléico que funciona como um elemento regulador, usualmente encontrado a montante (5') em relação a uma seqüência codificado-The term promoter or promoter region refers to a polynucleic acid molecule that functions as a regulatory element, usually found upstream (5 ') in relation to an encoded sequence.

ra, que controla a expressão da seqüência codificadora controlando-se a produção de RNA mensageiro (mRNA) pelo qual fornece o sítio de reconhecimento para a RNA polimerase e/ou outros fatores necessários para o início da transcrição no sítio correto. Como aqui considerado, um promotor ou região promotora inclui variações de promotores derivados por meio de ligação a várias seqüências reguladoras, mutagênese aleatória ou controlada e a adição ou duplicação de seqüências realçadoras. A região promotora aqui divulgada e seus equivalentes biologicamente funcionais, são responsáveis por conduzir a transcrição de seqüências codificadoras sob o seu controle quando introduzido em um hospedeiro como parte de um vetor recombinante adequado, como demonstrado pela sua capacidade para produzir mRNA.ra, which controls the expression of the coding sequence by controlling the production of messenger RNA (mRNA) by which it provides the recognition site for RNA polymerase and / or other factors necessary for the initiation of transcription at the correct site. As considered herein, a promoter or promoter region includes variations of derived promoters by binding to various regulatory sequences, random or controlled mutagenesis and the addition or duplication of enhancer sequences. The promoter region disclosed here and its biologically functional equivalents, are responsible for conducting the transcription of coding sequences under its control when introduced into a host as part of a suitable recombinant vector, as demonstrated by its ability to produce mRNA.

Recombinante. Um ácido nucléico recombinante é feito por uma combinação de dois segmentos de outro modo separados de seqüência, por exemplo, pela síntese química ou pela manipulação de segmentos isolados de ácidos nucléicos pelas técnicas de engendramento genético.Recombinant. A recombinant nucleic acid is made by a combination of two otherwise separated segments of sequence, for example, by chemical synthesis or by manipulating isolated segments of nucleic acids by genetic engineering techniques.

O termo constructo de DNA recombinante” ou vetor recombinante refere-se a qualquer agente tal como um plasmídeo, cosmídeo, vírus, seqüência que replica autonomamente, fago ou seqüência de nucleotídeo de DNA ou RNA de filamento único ou de filamento duplo linear ou circular, derivada de qualquer fonte, capaz de integração genômica ou replicação autônoma, compreendendo uma molécula de DNA que uma ou mais seqüências de DNA foram ligadas em uma maneira funcionalmente operativa. Tais constructos de DNA recombinantes ou vetores são capazes de introduzir uma seqüência reguladora 5' ou região promotora e uma seqüência de DNA para um produto de gene selecionado em uma célula em uma maneira tal que a seqüência de DNA é transcrita em um mRNA funcional que é traduzido e portanto expressado. Os constructos de DNA recombinantes ou vetores recombinantes podem ser construídos para serem capazes de expressar RNAs de anti-sentido, de modo a inibir a tradução de um RNA específico de interesse.The term recombinant DNA construct ”or recombinant vector refers to any agent such as a plasmid, cosmid, virus, autonomously replicating sequence, phage or nucleotide sequence of DNA or RNA of single or double-stranded linear or circular filament, derived from any source, capable of genomic integration or autonomous replication, comprising a DNA molecule that one or more DNA sequences have been linked in a functionally operative way. Such recombinant DNA constructs or vectors are capable of introducing a 5 'regulatory sequence or promoter region and a DNA sequence for a selected gene product in a cell in such a way that the DNA sequence is transcribed into a functional mRNA that is translated and therefore expressed. The recombinant DNA constructs or recombinant vectors can be constructed to be able to express antisense RNAs, in order to inhibit the translation of a specific RNA of interest.

Regeneração refere-se ao processo de cultivar uma planta a partir de uma célula vegetal (por exemplo, protoplasto vegetal ou explante).Regeneration refers to the process of growing a plant from a plant cell (for example, plant protoplasts or explants).

3_Ύ 3 _Ύ

Repórter refere-se a um gene e produto de gene correspondente que quando expressados em organismos transgênicos produzem um produto detectável pelos métodos químicos ou moleculares ou produzem um fenótipo observável.Reporter refers to a gene and corresponding gene product that when expressed in transgenic organisms produce a product detectable by chemical or molecular methods or produce an observable phenotype.

Resistência refere-se a uma enzima que é capaz de funcionar na presença de uma toxina, por exemplo, EPSP sintases de classe II resistentes ao glifosato. Uma enzima que tem resistência a uma toxina pode ter a função de destoxificar a toxina, por exemplo, a fosfinotricina acetiltransferase, a glifosato oxidorreductase ou pode ser uma enzima mutante tendo atividade catalítica que não é afetada por um herbicida que rompe a mesma atividade na enzima do tipo selvagem, por exemplo, a acetolactato sintase, EPSP sintases de classe I mutantes.Resistance refers to an enzyme that is capable of functioning in the presence of a toxin, for example, glyphosate resistant class II EPSP synthases. An enzyme that is resistant to a toxin may have the function of detoxifying the toxin, for example, phosphinothricin acetyltransferase, glyphosate oxidoreductase or it may be a mutant enzyme having catalytic activity that is not affected by a herbicide that disrupts the same activity in the enzyme wild-type, for example, acetolactate synthase, mutant class I EPSP synthases.

Enzima de restrição refere-se a uma enzima que reconhece uma seqüência palindrômica específica de nucleotídeos no DNA de filamento duplo e cliva ambos os filamentos; também chamada de uma endonuclease de restrição. A divagem tipicamente ocorre dentro do sítio de restrição.Restriction enzyme refers to an enzyme that recognizes a specific palindromic sequence of nucleotides in double-stranded DNA and cleaves both strands; also called a restriction endonuclease. Diving typically occurs within the restriction site.

Marcador selecionáve! refere-se a uma molécula de ácido polinucléico que codifica uma proteína, que confere um fenótipo que facilita a identificação de células contendo a molécula de ácido polinucléico. Marcadores selecionáveis incluem aqueles genes que conferem resistência aos antibióticos (por exemplo, ampicilina, canamicina), complementam uma deficiência nutricional (por exemplo, uracila, histidina, leucina) ou comunica uma característica que se distinga visualmente (por exemplo, mudanças de cor ou fluorescência). Os genes selecionável marcadores dominantes úteis incluem genes que codificam os genes de resistência a antibiótico (por exemplo, a neomicina fosfotransferase, aad); e os genes de resistência a herbicida (por exemplo, a fosfinotricina acetiltransferase, EPSP sintase de classe II, EPSP sintase de classe I modificada). Uma estratégia útil para a seleção de transformantes para a resistência a herbicida é descrita, por exemplo, em Vasil, Cell Culture e Somatic Cell Genetics of Plants, Vols. I-III, Laboratory Procedures e Their Applications Academic Press, Nova Iorque (1984).Selectable marker! refers to a polynucleic acid molecule that encodes a protein, which confers a phenotype that facilitates the identification of cells containing the polynucleic acid molecule. Selectable markers include those genes that confer resistance to antibiotics (eg, ampicillin, kanamycin), complement a nutritional deficiency (eg, uracil, histidine, leucine) or communicate a visually distinguishing characteristic (eg, color changes or fluorescence ). Selectable useful dominant marker genes include genes that encode antibiotic resistance genes (for example, neomycin phosphotransferase, aad); and herbicide resistance genes (for example, phosphinothricin acetyltransferase, class II EPSP synthase, modified class I EPSP synthase). A useful strategy for the selection of transformants for herbicide resistance is described, for example, in Vasil, Cell Culture and Somatic Cell Genetics of Plants, Vols. I-III, Laboratory Procedures and Their Applications Academic Press, New York (1984).

O termo específica para (uma seqüência alvo) indica que uma sonda de DNA ou iniciador de DNA hibridizam sob condições de hibridização dadas apenas com a seqüência alvo em um amostra que compreende a seqüência alvo.The specific term for (a target sequence) indicates that a DNA probe or DNA primer hybridizes under hybridization conditions given only to the target sequence in a sample that comprises the target sequence.

O termo substancialmente purificado, como aqui usado, referese a uma molécula separada de outras moléculas normalmente associadas com ela no seu estado nativo. Mais preferivelmente, uma molécula substancialmente purificada é a espécie predominante presente em uma preparação. Uma molécula substancialmente purificada pode ser maior do que 60 % livre, preferivelmente 75 % livre, mais preferivelmente 90 % livre das outras moléculas (exclusivo de solvente) presentes na mistura natural. O termo substancialmente purificado não é intencionado a abranger moléculas presentes no seu estado nativo.The term substantially purified, as used herein, refers to a molecule separate from other molecules normally associated with it in its native state. More preferably, a substantially purified molecule is the predominant species present in a preparation. A substantially purified molecule can be greater than 60% free, preferably 75% free, more preferably 90% free of the other molecules (solvent-only) present in the natural mixture. The term substantially purified is not intended to cover molecules present in their native state.

Tolerante ou tolerância refere-se a um efeito reduzido de um agente biótico ou abiótico sobre o cultivo e desenvolvimento de organismos e plantas, por exemplo, uma praga ou um herbicida.Tolerant or tolerance refers to a reduced effect of a biotic or abiotic agent on the cultivation and development of organisms and plants, for example, a pest or herbicide.

Transcrição refere-se ao processo de produzir uma cópia de RNA a partir de um DNA padrão.Transcription refers to the process of making a copy of RNA from standard DNA.

Transformação refere-se a um processo de introduzir uma molécula de ácido polinucléico exógeno (por exemplo, um constructo de DNA, uma molécula de ácido polinucléico recombinante) em uma célula ou protoplasto e que a molécula de ácido polinucléico exógena seja incorporada em um cromossoma ou seja capaz de replicação autônoma.Transformation refers to a process of introducing an exogenous polynucleic acid molecule (for example, a DNA construct, a recombinant polynucleic acid molecule) into a cell or protoplasty and for the exogenous polynucleic acid molecule to be incorporated into a chromosome or be capable of autonomous replication.

Transformado ou transgênico referem-se a uma célula, tecido, órgão ou organismo nos quais um ácido polinucléico estranho, tal como um vetor de DNA ou molécula de ácido polinucléico recombinante. Uma célula ou organismo transgênicos ou transformados também incluem a progênie da célula ou organismo e a progênie produzida a partir de um programa de procriação que utiliza uma tal planta transgênica como um precursor em um cruzamento e que exibe um fenótipo aumentado resultante da presença da molécula de ácido polinucléico estranha.Transformed or transgenic refers to a cell, tissue, organ or organism in which a foreign polynucleic acid, such as a DNA vector or recombinant polynucleic acid molecule. A transgenic or transformed cell or organism also includes the progeny of the cell or organism and the progeny produced from a breeding program that uses such a transgenic plant as a precursor at an intersection and that exhibits an increased phenotype resulting from the presence of the strange polynucleic acid.

O termo transgene refere-se a qualquer molécula de ácido po-The term transgene refers to any acidic molecule

?)5>?) 5>

linucléico não nativa para uma célula ou organismo transformados na célula ou organismo. Transgene também abrange as partes componentes de um gene vegetal nativo modificado pela inserção de uma molécula de ácido polinucléico não nativa pela recombinação direcionada ou mutação específica ao sítio.linucleic acid not native to a cell or organism transformed into the cell or organism. Transgene also encompasses the component parts of a native plant gene modified by the insertion of a non-native polynucleic acid molecule by targeted recombination or site-specific mutation.

Moléculas de peptídeo de trânsito ou peptídeo alvejador, estes termos no geral referem-se às moléculas de peptídeo que quando ligadas a uma proteína de interesse direcionam a proteína a um tecido, célula, localização subcelular ou organela celular particulares. Os exemplos incluem, mas não são limitados a, peptídeos de trânsito de cloroplasto, sinais de alvejamento nuclear e sinais vacuolares. O peptídeo de trânsito de cloroplasto é de utilidade particular na presente invenção para direcionar a expressão da enzima de EPSPS ao cloroplasto.Transit peptide or targeting peptide molecules, these terms generally refer to peptide molecules that when attached to a protein of interest direct the protein to a particular tissue, cell, subcellular location or cellular organelle. Examples include, but are not limited to, chloroplast transit peptides, nuclear targeting signals and vacuolar signals. The chloroplast transit peptide is of particular use in the present invention for directing the expression of the EPSPS enzyme to the chloroplast.

O termo tradução refere-se à produção do produto de gene correspondente, isto é, um peptídeo, polipeptídeo ou proteína a partir de um mRNA.The term translation refers to the production of the corresponding gene product, that is, a peptide, polypeptide or protein from an mRNA.

Vetor refere-se a um plasmídeo, cosmídeo, bacteriofago ou vírus que carrega DNA estranho em um organismo hospedeiro. PolinucleotídeosVector refers to a plasmid, cosmid, bacteriophage or virus that carries foreign DNA in a host organism. Polynucleotides

Os métodos da presente invenção incluem planejar genes que conferem um traço de interesse à planta na qual eles são introduzidos. Os transgenes de interesse agronômico que fornecem traços agronômicos benéficos para as plantas de safra, por exemplo, incluindo, mas não limitadas aos elementos genéticos que compreendem resistência à herbicida (Patente US N⁹ 5.633.435; Patente US N⁹ 5.463.175), rendimento aumentado (Patente US N⁹ 5.716.837), controle de inseto (Patente US N^s 6.063.597; Patente US N⁹ 6.063.756; Patente US N⁹ 6.093.695; Patente US N⁹ 5.942.664; Patente US N^s 6.110.464), resistência fúngica às doenças (Patente US N⁹5.516.671; Patente US N⁹ 5.773.696; Patente US N⁹ 6.121.436; e Patente US N⁹ 6.316.407 e Patente US N⁹ 6.506.962), resistência a vírus (Patente US N⁹ 5.304.730 e Patente US N⁹ 6.013.864), resistência à nematóide (Patente US N⁹ 6.228.992), resistência bacteriana às doenças (Patente US N⁹ • · · •The methods of the present invention include designing genes that confer a trait of interest to the plant into which they are introduced. Transgenes of agronomic interest that provide beneficial agronomic traits for crop plants, for example, including, but not limited to, genetic elements that comprise herbicide resistance (US Patent N ⁹ 5,633,435; US Patent N ⁹ 5,463,175), increased yield (U.S. Patent No. ⁹ 5,716,837), insect control (U.S. Patent ^No.'s 6,063,597; US Patent No. ⁹ 6,063,756; US Patent No. ⁹ 6,093,695; US Patent No. ⁹ 5,942,664; U.S. N ^s 6,110,464), fungal disease resistance (U.S. Patent No. 5,516,671 ^9; ⁹ US Patent 5,773,696; US Patent No. ⁹ 6,121,436, and US Patent 6,316,407 , and ^9th US Patent 6,506 ⁹ .962), virus resistance (US Patent N ⁹ 5,304,730 and US Patent N ⁹ 6,013,864), nematode resistance (US Patent N ⁹ 6,228,992), bacterial resistance to diseases (US Patent N ⁹ • · · •

• <• <

• · ·• · ·

5.516.671), produção de amido (Patente US N⁹ 5.750.876 e Patente US N⁹6.476.295), produção de óleo modificado (Patente US N° 6.444.876), produção de óleo alta (Patente US N^a 5.608.149 e Patente US N^s 6.476.295), teor de ácido graxo modificado (Patente US N⁹ 6.537.750), produção de proteína alta (Patente US N⁹ 6.380.466), maturação de fruta (Patente US N⁹5.512.466), nutrição animal e humana realçada (Patente US N^s 5.985.605 e Patente US N⁹ 6.171.640), biopolímeros (Patente US N⁹ 5.958.745 e Pedido de Patente US N⁹ US20030028917), resistência ao estresse ambiental (Patente US N⁹ 6.072.103), peptídeos farmacêuticos (Patente US N⁹ 6.080.560), traços de processamento melhorados (Patente US N⁹ 6.476.295), digestibilidade melhorada (Patente US N⁹ 6.531.648) rafinose baixa (Patente US N⁹6.166.292), produção de enzima industrial (Patente US N⁹ 5.543.576), sabor melhorado (Patente US N⁹ 6.011.199), fixação de nitrogênio (Patente US N⁹5.229.114), produção de semente híbrida (Patente US N⁹ 5.689.041) e produção de biocombustível (Patente US N⁹ 5.998.700), os elementos genéticos e transgenes descritos nas patentes listadas acima são aqui incorporados por referência.5,516,671), starch production (U.S. Patent No. 5,750,876 and U.S. ⁹ ⁹ No 6,476,295), modified oil production (U.S. Patent No. 6,444,876), high oil production (U.S. Patent No. 5,608 ^to .149 ^s and US Patent 6,476,295), modified fatty acid content (U.S. Patent No. ⁹ 6,537,750), high protein production (U.S. Patent No. ⁹ 6,380,466), fruit ripening (U.S. Patent No. 5,512 ⁹ .466), enhanced animal feed and human ^(s US Patent 5,985,605 and US Patent No. ⁹ 6,171,640), biopolymers (U.S. Patent No. 5,958,745 and ⁹ of US Patent Application US20030028917 ^9), resistance to environmental stress (US Patent No. ⁹ 6,072,103), pharmaceutical peptides (US Patent No. ⁹ 6,080,560), improved processing traits (US Patent No. ⁹ 6,476,295), improved digestibility (US Patent No. ⁹ 6,531,648) low raffinose ( US Patent No. ⁹ 6,166,292), production of industrial enzyme (US Patent No. ⁹ 5,543,576), improved taste (US Patent No. ⁹ 6,011,199), nitrogen fixation (US Patent No. ⁹ 5,229,114), production of hybrid seed (US Patent No. ⁹ 5,689,041) and biofuel production (US Patent No. ⁹ 5,998,700), the genetic elements and transgenes described in the patents listed above are hereby incorporated by reference.

Os herbicidas para os quais a tolerância da planta transgênica foi demonstrada e o método da presente invenção pode ser aplicado, incluem mas não são limitados aos: herbicidas de glifosato, glifosinato, sulfoniluréias, imidazolinonas, bromoxinila, delapon, cicloezanodiona, inibidores da protoporfiriongênio oxidase e isoxasflutol. As moléculas de polinucleotídeo que codificam proteínas envolvidas na tolerância a herbicida são conhecidas na técnica e incluem, mas não são limitadas a uma molécula de polinucleotídeo que codifica a 5-enolpiruvilshiquimato-3-fosfato sintase (EPSPS, descrita nas Patentes U.S. N— 5.627.061, 5.633.435. 6.040.497; Padgette et al. Herbicide Resistant Crops, Lewis Publishers, 53 a 85, 1996; e PenalozaVazquez, et al. Plant Cell Reports 14: 482 a 487, 1995; e aroA (Patente U.S. N⁹ 5.094.945) para a tolerância a glifosato; bromoxinil nitrilase (Bxn) para a tolerância a Bromoxinil (Patente U.S. N⁹ 4.810.648); fitoeno dessaturase (crtl, Misawa et al, (1993) Plant J. 4; 833 a 840 e (1994) Plant J. 6; 481 a 489); para a tolerância ao norflurazon, acetoidroxiácido sintase (AHAS, akaThe herbicides to which the tolerance of the transgenic plant has been demonstrated and the method of the present invention can be applied, include but are not limited to: glyphosate herbicides, glyphosinate, sulfonylureas, imidazolinones, bromoxynil, delapon, cycloananedione, protoporphyriongen oxidase inhibitors and isoxasflutol. Polynucleotide molecules that encode proteins involved in herbicide tolerance are known in the art and include, but are not limited to, a polynucleotide molecule that encodes 5-enolpyruvylshiquimato-3-phosphate synthase (EPSPS, described in US Patents No. 5,627. 061, 5,633,435. 6,040,497; Padgette et al. Herbicide Resistant Crops, Lewis Publishers, 53 to 85, 1996; and PenalozaVazquez, et al. Plant Cell Reports 14: 482 to 487, 1995; and aroA (US Patent No. ⁹ 5,094,945) for glyphosate tolerance; bromoxynil nitrilase (Bxn) for tolerance to bromoxynil (US Patent No. ⁹ 4,810,648); phytoene desaturase (crtl, Misawa et al, (1993) Plant J. 4; 833 a 840 and (1994) Plant J. 6; 481 to 489); for tolerance to norflurazon, acetohydroxy acid synthase (AHAS, aka

ALS, Sathasiivan et al. Nucl. Acids Res. 18: 2188 a 2193. 1990); e o gene bar para a tolerância ao glufosinato e bialafos (DeBlock, et al. EMBO J. 6: 2513 a 2519, 1987).ALS, Sathasiivan et al. Nucl. Acids Res. 18: 2188 to 2193. 1990); and the bar gene for tolerance to glufosinate and bialaphos (DeBlock, et al. EMBO J. 6: 2513 to 2519, 1987).

A tolerância a herbicida é um fenótipo desejável para plantas de safra. N-fosfonometilglicina, também conhecido como glifosato, é um herbicida bem conhecido que tem atividade em um amplo espectro de espécies vegetais. O glifosato é o ingrediente ativo de Roundup® (Monsanto Co.), um herbicida seguro tendo uma meia vida desejavelmente curta no ambiente. Quando aplicado a uma superfície da planta, o glifosato move-se sistemicamente através da planta. O glifosato é tóxico às plantas pela inibição do caminho do ácido chiquímico, que fornece um precursor para a síntese de aminoácidos aromáticos. Especificamente, o glifosato afeta a conversão de fosfoenolpiruvato e ácido 3-fosfochiquímico em ácido 5-enolpiruvil-3fosfochiquímico pela inibição da enzima 5-enolpiruvil-3-fosfochiquimato sintase (daqui em diante aludido como EPSP sintase ou EPSPS). Para os propósitos da presente invenção, o termo glifosato deve ser considerado incluir qualquer forma herbicidamente eficaz de N-fosfonometilglicina (incluindo qualquer sal desta) e outras formas que resultam na produção do ânion glifosato na planta.Herbicide tolerance is a desirable phenotype for crop plants. N-phosphonomethylglycine, also known as glyphosate, is a well-known herbicide that has activity in a wide spectrum of plant species. Glyphosate is the active ingredient in Roundup® (Monsanto Co.), a safe herbicide with a desirably short half-life in the environment. When applied to a plant surface, glyphosate moves systemically through the plant. Glyphosate is toxic to plants by inhibiting the pathway of shikimic acid, which provides a precursor for the synthesis of aromatic amino acids. Specifically, glyphosate affects the conversion of phosphoenolpyruvate and 3-phosphochiquimic acid to 5-enolpyruvyl-3-phosphochiquimic acid by inhibiting the enzyme 5-enolpyruvyl-3-phosphochiquime synthase (hereinafter referred to as EPSP synthase or EPSPS). For the purposes of the present invention, the term glyphosate should be considered to include any herbicidally effective form of N-phosphonomethylglycine (including any salt thereof) and other forms that result in the production of the glyphosate anion in the plant.

Através dos métodos de engendramento genético vegetal, é possível produzir plantas tolerantes ao glifosato inserindo-se no genoma da planta uma molécula de DNA que cause a produção de níveis mais altos de EPSPS do tipo selvagem (Shah et al., Science 233: 478 a 481, 1986). A tolerância ao glifosato também pode ser obtida pela expressão de variantes de EPSPS que tenham afinidade mais baixa ao glifosato e portanto retêm a sua atividade catalítica na presença de glifosato (Patente U.S. N- 5.633.435). As enzimas que degradam glifosato nos tecidos da planta (Patente U.S. N^Q5.463.175) também são capazes de conferir tolerância celular ao glifosato. Tais genes, portanto, permitem a produção de safras transgênicas que são tolerantes ao glifosato, permitindo deste modo que o glifosato seja usado para o controle eficaz de ervas daninhas com preocupações mínimas de dano à safra. Por exemplo, a tolerância a glifosato foi geneticamente engen• · · « · ·· · • « • « • · · drada no milho (Patente U.S. N^Q 5.554.798, 6.040.497), trigo (Zhou et al. Plant Cell Rep. 15: 159 a 163, 1995), soja (WO 9200377) e canola (WO 9204449).Through plant genetic engineering methods, it is possible to produce glyphosate-tolerant plants by inserting a DNA molecule into the plant's genome that causes the production of higher levels of wild-type EPSPS (Shah et al., Science 233: 478 a 481, 1986). Tolerance to glyphosate can also be achieved by expressing EPSPS variants that have lower affinity for glyphosate and therefore retain their catalytic activity in the presence of glyphosate (US Patent No. 5,633,435). Enzymes that degrade glyphosate in plant tissues (US Patent No. ^Q 5,463,175) are also capable of conferring cellular tolerance to glyphosate. Such genes, therefore, allow the production of transgenic crops that are tolerant to glyphosate, thereby allowing glyphosate to be used for effective weed control with minimal concerns of crop damage. For example, tolerance to glyphosate was genetically engineered • · · «· ·· · •« • «• · · drained on corn (US Patent No. ^Q 5,554,798, 6,040,497), wheat (Zhou et al. Rep. 15: 159 to 163, 1995), soy (WO 9200377) and canola (WO 9204449).

As variantes da enzima de EPSPS do tipo selvagem foram isoladas que são resistentes ao glifosato como um resultado de alterações na seqüência codificadora de aminoácido EPSPS (Kishore et al., Annu. Rev. Biochem. 57: 627 a 663, 1988; Schulz etal., Arch. Microbiol. 137: 121 a 123. 1984; Sost et al., FEBS Lett. 173: 238 a 241, 1984; Kishore et al., Em Biotechnology for Crop Protections ACS Symposium Series N° 379. eds. Hedlin et al., 37 a 48, 1988). Estas variantes tipicamente têm um Ki mais alto para o glifosato do que a enzima de EPSPS do tipo selvagem que confere o fenótipo tolerante a glifosato, mas estas variantes também são caracterizadas por um alto K_m para PEP que torna a enzima cineticamente menos eficiente. Por exemplo, a K_m aparente para PEP e o K, aparente para glifosato para a EPSPS nativa da E. coli são 10 μΜ e 0,5 μΜ enquanto para um isolado resistente ao glifosato tendo uma única substituição de aminoácido de uma alanina no lugar da glicina na posição 96 estes valores são 220 μΜ e 4,0 mM, respectivamente. A Patente US N^e 6.040.497 relata que a mutação conhecida como a mutação TIPS (uma substituição de isoleucina no lugar da treonina na posição de aminoácido 102 e uma substituição de serina no lugar de prolina na posição de aminoácido 106) compreende duas mutações que quando introduzidas na seqüência de polipeptídeo da EPSPS de Zea mays confere a resistência ao glifosato à enzima. As plantas transgênicas contendo esta enzima mutante são tolerantes ao glifosato. As mutações idênticas podem ser feitas nas enzimas EPSPS sensíveis ao glifosato de outras fontes para criar enzimas resistente ao glifosato.Variants of the wild-type EPSPS enzyme have been isolated that are resistant to glyphosate as a result of changes in the EPSPS amino acid coding sequence (Kishore et al., Annu. Rev. Biochem. 57: 627 to 663, 1988; Schulz etal. , Arch. Microbiol. 137: 121 to 123. 1984; Sost et al., FEBS Lett. 173: 238 to 241, 1984; Kishore et al., In Biotechnology for Crop Protections ACS Symposium Series No. 379. eds. Hedlin et al., 37 to 48, 1988). These variants typically have a higher Ki for glyphosate than the wild-type EPSPS enzyme that confers the glyphosate-tolerant phenotype, but these variants are also characterized by a high K _m for PEP that makes the enzyme kinetically less efficient. For example, the apparent K _m for PEP and the apparent K for glyphosate for E. coli native EPSPS are 10 μΜ and 0.5 μΜ while for a glyphosate resistant isolate having a single amino acid substitution of an alanine in place of glycine at position 96 these values are 220 μΜ and 4.0 mM, respectively. ^And US Patent No. 6,040,497 reports that the mutation known as the TIPS mutation (substitution of an isoleucine in place of threonine at amino acid position 102 and a serine substitution in place of proline at amino acid position 106) comprises two mutations when introduced in the EPSEA polypeptide sequence of Zea mays it confers resistance to the enzyme glyphosate. Transgenic plants containing this mutant enzyme are tolerant to glyphosate. Identical mutations can be made in glyphosate-sensitive EPSPS enzymes from other sources to create glyphosate-resistant enzymes.

Uma variedade de enzimas de EPSPS nativas e variantes foi expressada em plantas transgênicas de modo a conferir a tolerância a glifosato (Singh, et al., Em Biosynthesis e Molecular Regulation of Amino Acids in Plants, Amer Soc Plant Phys. Pubs., 1992). Os exemplos de algumas destas EPSPSs incluem aqueles descritos e/ou isolados de acordo com a Patente U.S. N⁵ 4.940.835. Patente U.S. N^Q 4.971.908, Patente U.S. N° • · · · · · »······«A variety of native and variant EPSPS enzymes have been expressed in transgenic plants in order to confer glyphosate tolerance (Singh, et al., In Biosynthesis and Molecular Regulation of Amino Acids in Plants, Amer Soc Plant Phys. Pubs., 1992) . Examples of some of these EPSPSs include those described and / or isolated according to US Patent No. ⁵ 4,940,835. ^Q US Patent 4,971,908, US Patent No. • · · · "······"

MdMd

5.145.783, Patente U.S. N^s 5.188.642, Patente U.S. N⁹ 5.310.667 e Patente U.S. N⁹ 5.312.910. Elas também podem ser derivadas de uma classe estruturalmente distinta de genes de EPSPS não homólogos, tais como os genes de EPSPS de classe II isolado de Agrobacterium sp. cepa CP4 como descrito na Patente U.S. N⁹ 5.633.435 e Patente U.S. N⁹ 5.627.061.5,145,783, US Patent No. ^s 5,188,642, US Patent 5,310,667 , and ^9th US Patent 5,312,910 ^9. They can also be derived from a structurally distinct class of non-homologous EPSPS genes, such as the class II EPSPS genes isolated from Agrobacterium sp. strain CP4 as described in US Patent 5,633,435 , and ^9th US Patent 5,627,061 ^9.

Peptídeos de trânsito de cloroplasto (CTPs) são engendrados para serem fundidos ao término N da EPSPS bacteriana para direcionar as enzimas resistentes ao glifosato no cloroplasto da planta. Na EPSPS vegetal nativa, as regiões de peptídeo de trânsito de cloroplasto estão contidas na seqüência codificadora nativa (por exemplo, CTP2, Klee et al., Mol. Gen. Genet. 210: 47 a 442, 1987, aqui incorporada por referência em sua totalidade). O CTP nativo pode ser substituído com um CTP heterólogo durante a construção de um cassete de expressão vegetal de transgene. Muitas proteínas localizadas no cloroplasto, incluindo a EPSPS, são expressadas a partir de genes nucleares como precursores e são alvejadas ao cloroplasto por um peptídeo de trânsito de cloroplasto (CTP) que é removido durante as etapas de importação. Os exemplos de outras de tais proteínas de cloroplasto incluem a subunidade pequena (SSU) de Ribulose-1,5-bisfosfato carboxilase, Ferredoxina, Ferredoxina oxidorreductase, a proteína I e a proteína II do complexo de colheita de luz e Tiorredoxina F. Foi demonstrado in vivo e in vitro que as proteínas que não de cloroplasto podem ser alvejados ao cloroplasto pelo uso de fusões de proteína com um CTP e que a seqüência de CTP é suficiente para alvejar uma proteína ao cloroplasto. A incorporação de um peptídeo de trânsito de cloroplasto adequado, tal como, o CTP de EPSPS da Arabidopsis thaliana (Klee et al., Mol. Gen. Genet. 210: 437 a 442 (1987) e a CTP de EPSPS da Petunia hybrida (della-Cioppa et al., Proc. Natl. Acad. Sei. USA 83: 6873 a 6877 (1986) foi mostrada alvejar as seqüências de proteína de EPSPS heterólogas aos cloroplastos nas plantas transgênicas. A produção de plantas tolerantes ao glifosato pela expressão de uma proteína de fusão que compreende um CTP de terminal amino com uma enzima de EPSPS resistente ao glifosato é bem conhecida por aqueles versados na técnica, (Patente U.S. N⁹ 5.627.061, Patente U.S. N^s 5.633.435.Chloroplast transit peptides (CTPs) are engineered to be fused to the N-terminus of bacterial EPSPS to target glyphosate-resistant enzymes in the plant's chloroplast. In native plant EPSPS, the chloroplast transit peptide regions are contained in the native coding sequence (for example, CTP2, Klee et al., Mol. Gen. Genet. 210: 47 to 442, 1987, hereby incorporated by reference in their totality). The native CTP can be replaced with a heterologous CTP during the construction of a transgene plant expression cassette. Many proteins located in the chloroplast, including EPSPS, are expressed from nuclear genes as precursors and are targeted to the chloroplast by a chloroplast transit peptide (CTP) that is removed during the import steps. Examples of other such chloroplast proteins include the small subunit (SSU) of Ribulose-1,5-bisphosphate carboxylase, Ferredoxin, Ferredoxin oxidoreductase, protein I and protein II of the light harvesting complex and Thioredoxin F. in vivo and in vitro that non-chloroplast proteins can be targeted to the chloroplast by using protein fusions with a CTP and that the CTP sequence is sufficient to target a protein to the chloroplast. The incorporation of a suitable chloroplast transit peptide, such as EPSPS CTP from Arabidopsis thaliana (Klee et al., Mol. Gen. Genet. 210: 437 to 442 (1987) and EPSPS CTP from Petunia hybrida ( della-Cioppa et al., Proc. Natl. Acad. Sci. USA 83: 6873 to 6877 (1986) has been shown to target the EPSPS protein sequences heterologous to chloroplasts in transgenic plants. The production of glyphosate-tolerant plants by the expression of a fusion protein comprising an amino-terminal CTP with a EPSPS glyphosate resistant enzyme is well known to those skilled in the art (US Patent No. ⁹ 5,627,061, US Patent No. ^s 5,633,435.

·· ······ ····

• · • · ·• · • · ·

Patente U.S. N^Q 5.312.910, EP 0218571, EP 189707, EP 508909 e EP 924299). Aqueles versados na técnica reconhecerão que vários constructos quiméricos podem ser feitas que utilizam a funcionalidade de um CTP particular para importar enzimas de EPSPS resistentes ao glifosato no cloroplasto de célula vegetal.US Patent No. ^Q 5,312,910, EP 0218571, EP 189707, EP 508909 and EP 924299). Those skilled in the art will recognize that various chimeric constructs can be made that use the functionality of a particular CTP to import glyphosate-resistant EPSPS enzymes into the plant cell chloroplast.

Modificação e mudanças podem ser feitas na estrutura dos polinucleotídeos da invenção e ainda obter-se uma molécula que codifique uma proteína ou peptídeo funcionais com características desejáveis. O seguinte é um método com base na substituição do(s) códon(s) de um primeiro polinucleotídeo para criar um polinucleotídeo artificial equivalente ou ainda um melhorado, de geração secundária, onde este novo polinucleotídeo artificial é útil nos métodos de implantação de gene transgênico e expressão realçada. É considerado que as substituições de códon no polinucleotídeo de geração secundária pode em certos casos resultar em pelo menos um aminoácido diferente daquele do primeiro polinucleotídeo. A substituição de aminoácido pode fornecer uma característica melhorada à proteína, por exemplo, uma EPSP sintase resistente ao glifosato ou ela pode ser uma mudança conservada que não afeta substancialmente as características da proteína. O método fornece um polinucleotídeo artificial criado pela retro tradução de uma seqüência de polipeptídeo em um polinucleotídeo usando uma tabela do uso de códon, seguido pelas etapas de realçar as características do polipeptídeo artificial que o torna particularmente útil em plantas transgênicas.Modification and changes can be made in the structure of the polynucleotides of the invention and still obtain a molecule that encodes a functional protein or peptide with desirable characteristics. The following is a method based on replacing the codon (s) of a first polynucleotide to create an equivalent or improved secondary generation polynucleotide, where this new artificial polynucleotide is useful in transgenic gene implantation methods and enhanced expression. It is considered that codon substitutions in the secondary generation polynucleotide may in certain cases result in at least one amino acid different from that of the first polynucleotide. Amino acid substitution can provide an improved protein characteristic, for example, a glyphosate resistant EPSP synthase or it can be a conserved change that does not substantially affect the characteristics of the protein. The method provides an artificial polynucleotide created by retro-translating a polypeptide sequence into a polynucleotide using a codon usage table, followed by steps to highlight the characteristics of the artificial polypeptide that makes it particularly useful in transgenic plants.

Em formas de modalidades particulares da invenção, os polipeptídeos modificados que codificam proteínas resistentes a herbicida são considerados serem úteis para pelo menos um dos seguintes: conferir tolerância a herbicida em uma planta transformada ou transgênica, para melhorar a expressão de genes de resistência a herbicida em plantas, para o uso como marcadores selecionáveis para a introdução de outros traços de interesse em uma planta e para impedir a recombinação com um gene vegetal endógeno similar ou transgene existente permitindo ainda a implantação de gene sem o silenciamento de gene.In forms of particular embodiments of the invention, modified polypeptides encoding herbicide-resistant proteins are considered to be useful for at least one of the following: confer herbicide tolerance in a transformed or transgenic plant, to improve the expression of herbicide resistance genes in plants, for use as selectable markers for introducing other traits of interest in a plant and to prevent recombination with a similar endogenous plant gene or existing transgene while still allowing gene implantation without gene silencing.

É conhecido que o código genético é degenerado. Os aminoáci37It is known that the genetic code is degenerate. The amino acids37

dos e seus códon(s) de RNA são listados abaixo na Tabela 1. TABELA 1. Aminoácidos e os códons de RNA que os codificam. Aminoácido Códonsand their RNA codon (s) are listed below in Table 1. TABLE 1. Amino acids and the RNA codons that encode them. Codon Amino Acid

Histidina; His; H Isoleucina; Ile; IHistidine; His; H Isoleucine; Ile; I

Nome completo; código de 3 letras; código de 1 letra Alanina; Ala; A GCA GCC GCG GCU Cisteína; Cys ; C UGC UGU Ácido aspártico; Asp; D GAC GAU Ácido glutâmico; Glu; E GAA GAG Fenilalanina; Phe; FUUC UUU Glicina; Gly; G GGA GGC GGG GGU CAC CAU AUA AUC AUUFull name; 3 letter code; 1 letter code Alanine; Allah; GCA GCC GCG GCU Cysteine; Cys; C UGC UGU Aspartic acid; Asp; D GAC GAU Glutamic acid; Glu; E GAA GAG Phenylalanine; Phe; FUUC UUU Glycine; Gly; G GGA GGC GGG GGU CAC CAU AUA AUC AUU

Lisina; Lys; KAAA AAGLysine; Lys; KAAA AAG

Leucina ; Leu ; L UUA UUG CUA CUC CUG CUULeucine; Read; L UUA UUG CUA CUC CUG CUU

Metionina; Met; M AUGMethionine; Met; M AUG

Asparagina; Asn; N AAC AAUAsparagine; Asn; N AAC AAU

Prolina; Pro; P CCA CCC CCG CCUProline; Pro; P CCA CCC CCG CCU

Glutamina; Gin; Q CAA CAGGlutamine; Gin; Q CAA CAG

AGA AGG CGA CGC CGG CGU AGC AGU UCA UCC UCG UCU ACA ACC ACG ACU GUA GUCGUGGUUAGA AGG CGA CGC CGG CGU AGC AGU UCA UCC UCG UCU ACA ACC ACG ACU GUA GUCGUGGUU

Arginina; Arg; R Serina; Ser; S Treonina; Thr; T Valina; Vai; VArginine; Arg; R Serina; To be; S Threonine; Thr; T Valina; Go; V

Triptofano; Trp; W UGGTryptophan; Trp; W UGG

Tirosina; Tyr; Y UAC UAUTyrosine; Tyr; Y UAC WOW

Os códons são descritos em termos de bases de RNA, por exemplo, adenina, uracila, guanina e citosina, é o mRNA que é diretamente traduzido em polipeptídeos. É entendido que quando do planejamento de um polinucleotídeo de DNA para o uso em um constructo, as bases de DNA podem ser substituídas, por exemplo, timína ao invés de uracila.Codons are described in terms of RNA bases, for example, adenine, uracil, guanine and cytosine, it is the mRNA that is directly translated into polypeptides. It is understood that when planning a DNA polynucleotide for use in a construct, the DNA bases can be replaced, for example, thymine instead of uracil.

É desejável fornecer plantas transgênicas que tenham fenótipos agronomicamente melhorados múltiplos. Freqüentemente a tolerância a herbicida é usada como um marcador selecionável para ajudar na produção deIt is desirable to provide transgenic plants that have multiple agronomically improved phenotypes. Herbicide tolerance is often used as a selectable marker to aid in the production of

plantas transgênicas que podem possuir genes adicionais de importância agronômica. A implantação dos transgenes pelos métodos de cruzamento tradicionais ou pela retransformação de uma primeira planta transgênica com um cassete de expressão vegetal adicional pode incluir a introdução de genes ou elementos genéticos que têm seqüência de polinucleotídeo idêntica ou quase idêntica. A progênie contendo estes genes implantados pode ser suscetível à perda da expressão de gene devido ao silenciamento de gene. O método da presente invenção fornece uma molécula de polinucleotídeo modificada que codifica uma proteína resistente a herbicida. As moléculas de polinucleotídeo são planejadas para serem suficientemente divergentes em seqüência de polinucleotídeo de outras moléculas de polinucleotídeo que codificam a mesma proteína de resistência a herbicida. Estas moléculas podem então coexistir na mesma célula vegetal sem a preocupação do silenciamento de gene.transgenic plants that may have additional genes of agronomic importance. The implantation of transgenes by traditional crossing methods or by the retransformation of a first transgenic plant with an additional plant expression cassette may include the introduction of genes or genetic elements that have identical or almost identical polynucleotide sequence. The progeny containing these implanted genes may be susceptible to loss of gene expression due to gene silencing. The method of the present invention provides a modified polynucleotide molecule that encodes an herbicide-resistant protein. Polynucleotide molecules are designed to be sufficiently divergent in polynucleotide sequence from other polynucleotide molecules that encode the same herbicide resistance protein. These molecules can then coexist in the same plant cell without concern for gene silencing.

A seqüência de polinucleotídeo divergente é criada usando-se uma tabela de uso de códon construída a partir de seqüências codificadoras conhecidas de várias espécies vegetais. Por exemplo, as tabelas de uso de códon para Arabidopsis thaliana, Zea mays e Glicina max podem ser usadas no método para planejar os polinucleotídeos da presente invenção. Outras tabelas de uso de códon de outras plantas também podem ser usadas por aqueles versados na técnica.The divergent polynucleotide sequence is created using a codon usage table constructed from known coding sequences of various plant species. For example, the codon usage tables for Arabidopsis thaliana, Zea mays and Glicina max can be used in the method for designing the polynucleotides of the present invention. Other codon usage tables from other plants can also be used by those skilled in the art.

A primeira etapa no método para planejar uma nova molécula de polinucleotídeo artificial que codifica uma proteína de tolerância a herbicida é o uso de uma tabela de uso de códon para determinar o uso de códon percentual em uma espécie vegetal para cada aminoácido da proteína de tolerância a herbicida, seguida pela substituição de pelo menos um de cada oito códons contíguos com um códon diferente selecionado da tabela de uso de códon e ajustando o uso de códon percentual para cada aminoácido codificado pelo polinucleotídeo até substancialmente o mesmo uso de códon percentual encontrado na tabela de uso de códon. As etapas adicionais podem incluir a introdução de um códon de parada translacional na segunda e terceira matriz de leitura aberta da nova seqüência de polinucleotídeo; elimi-The first step in the method for designing a new artificial polynucleotide molecule that encodes a herbicide-tolerant protein is the use of a codon usage table to determine the use of percent codon in a plant species for each amino acid in the tolerance protein. herbicide, followed by replacing at least one out of every eight contiguous codons with a different codon selected from the codon usage table and adjusting the percentage codon usage for each amino acid encoded by the polynucleotide to substantially the same percentage codon usage found in the codon use. Additional steps may include the introduction of a translational stop codon in the second and third open reading frames of the new polynucleotide sequence; eliminated

nando alguns códons de partida translacionais na segunda e na terceira matrizes de leitura aberta; ajustando a relação GC:AT local para cerca de 2:1 em uma faixa de cerca de 50 nucleotídeos; rompendo sinais de poliadenilação potenciais ou sítios de junção de intron potenciais; removendo pelo me5 nos um sítio de enzima de restrição de seis nucleotídeos contíguos ou mais; e comparando a identidade de seqüência do novo polinucleotídeo artificial com um polinucleotídeo existente que codifica a mesma ou proteína similar de modo que a identidade de seqüência entre os dois polinucleotídeos não seja mais do que 85 por cento.adding some translational start codons in the second and third open reading matrices; adjusting the local GC: AT ratio to about 2: 1 over a range of about 50 nucleotides; disrupting potential polyadenylation signals or potential intron junction sites; removing at least one restriction enzyme site of six contiguous or more nucleotides at least me5; and comparing the sequence identity of the new artificial polynucleotide with an existing polynucleotide that encodes the same or similar protein so that the sequence identity between the two polynucleotides is not more than 85 percent.

Uma retrotradução de uma seqüência de proteína para uma seqüência de nucleotídeo pode ser realizada usando uma tabela de uso de códon, tal como aquela encontrada no Genetics Computer Group (GCG) SeqLab ou outros programas de análise de DNA conhecidos por aqueles versados na técnica de análise de DNA ou como fornecido nas Tabelas 2, 3 e 4 da presente invenção. A tabela de uso de códon para Arabidopsis thaliana (Tabela 2), Zea mays (Tabela 3) e Glicina max (Tabela 4) são exemplos de tabelas que podem ser construídas para espécies vegetais, as tabelas de uso de códon também podem ser construídas que representam o uso de códon de monocotiledônea ou dicotiledônea.A back-translation of a protein sequence to a nucleotide sequence can be performed using a codon usage table, such as that found in the Genetics Computer Group (GCG) SeqLab or other DNA analysis programs known to those skilled in the analysis technique of DNA or as provided in Tables 2, 3 and 4 of the present invention. The codon usage table for Arabidopsis thaliana (Table 2), Zea mays (Table 3) and Glycine max (Table 4) are examples of tables that can be built for plant species, the codon use tables can also be built that represent the use of a monocot or dicot codon.

Tabela 2. Tabela de uso de códon de Arabidopsis thaliana.Table 2. Codon usage table for Arabidopsis thaliana.

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Gly Gly GGG GGG 188335,00 188335.00 10,18 10.18 0,16 0.16 Gly Gly GGA GGA 443469,00 443469.00 23,98 23.98 0,37 0.37 Gly Gly GGT GGT 409478,00 409478.00 22,14 22.14 0,34 0.34 Gly Gly GGC GGC 167099,00 167099.00 9,03 9.03 0,14 0.14 Glu Glu GAG GAG 596506,00 596506.00 32,25 32.25 0,48 0.48 Glu Glu GAA GAA 639579,00 639579.00 34,58 34.58 0,52 0.52 Asp Asp GAT GAT 683652,00 683652.00 36,96 36.96 0,68 0.68 Asp Asp GAC GAC 318211,00 318211.00 17,20 17.20 0,32 0.32 Vai Go GTG GTG 320636,00 320636.00 17,34 17.34 0,26 0.26

················

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Vai Go GTA GTA 185889,00 185889.00 10,05 10.05 0,15 0.15 Vai Go GTT GTT 505487,00 505487.00 27,33 27.33 0,41 0.41 Vai Go GTC GTC 235004,00 235004.00 12,71 12.71 0,19 0.19 Ala Allah GCG GCG 162272,00 162272.00 8,77 8.77 0,14 0.14 Ala Allah GCA GCA 323871,00 323871.00 17,51 17.51 0,27 0.27 Ala Allah GCT GCT 521181,00 521181.00 28,18 28.18 0,44 0.44 Ala Allah GCC GCC 189049,00 189049.00 10,22 10.22 0,16 0.16

Arg Arg AGG AGG 202204,00 202204.00 10,93 10.93 0,20 0.20 Arg Arg AGA AGA 348508,00 348508.00 18,84 18.84 0,35 0.35 Ser To be AGT AGT 260896,00 260896.00 14,11 14.11 0,16 0.16 Ser To be AGC AGC 206774,00 206774.00 11,18 11.18 0,13 0.13

Lys Lys AAG THE AG 605882,00 605882.00 32,76 32.76 0,51 0.51 Lys Lys AAA AAA 573121,00 573,121.00 30,99 30.99 0,49 0.49 Asn Asn AAT AAT 418805,00 418805.00 22,64 22.64 0,52 0.52 Asn Asn AAC AAC 385650,00 385650.00 20,85 20.85 0,48 0.48

Met Met ATG ATG 452482,00 452482.00 24,46 24.46 1,00 1.00 Ile Ile ATA ATA 235528,00 235528.00 12,73 12.73 0,24 0.24 Ile Ile ATT ATT 404070,00 404070.00 21,85 21.85 0,41 0.41 Ile Ile ATC ATC 341584,00 341584.00 18,47 18.47 0,35 0.35

Thr Thr ACG ACG 140880,00 140880.00 7,62 7.62 0,15 0.15 Thr Thr ACA A CA 291436,00 291436,00 15,76 15.76 0,31 0.31 Thr Thr ACT ACT 326366,00 326366.00 17,65 17.65 0,34 0.34 Thr Thr ACC ACC 190135,00 190135.00 10,28 10.28 0,20 0.20 Trp Trp TGG TGG 231618,00 231618.00 12,52 12.52 1,00 1.00 End End TGA TGA 19037,00 19037.00 1,03 1.03 0,43 0.43 Cys Cys TGT TGT 196601,00 196601.00 10,63 10.63 0,60 0.60 Cys Cys TGC TGC 131390,00 131390.00 7,10 7.10 0,40 0.40

··· • · ··· • · • · · · · • · · • · · • · · · · • · · • · · 999 9 9 9 999 9 9 9 « ·_# 99999·99 99 9 9 9 9 * * * .«· _# 99999 · 99 99 9 9 9 9 * * *. 9 9 9 9 • · · • · · • · 9 99 · • · 9 99 · 9 • · 9 • · • · · · 9 · · · • · · · 9 · · · ··· ·· ··· ·· • 9 • 9 ···*··· 9 · ··· * ··· 9 · * *

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction End End TAG TAG 9034,00 9034.00 0,49 0.49 0,20 0.20 End End TAA TAA 16317,00 16317.00 0,88 0.88 0,37 0.37 Tyr Tyr TAT TAT 276714,00 276714.00 14,96 14.96 0,52 0.52 Tyr Tyr TAC TAC 254890,00 254890.00 13,78 13.78 0,48 0.48 Leu Read TTG TTG 389368,00 389368.00 21,05 21.05 0,22 0.22 Leu Read TTA TTA 237547,00 237547.00 12,84 12.84 0,14 0.14 Phe Phe TTT TTT 410976,00 410976.00 22,22 22.22 0,52 0.52 Phe Phe TTC TTC 380505,00 380505.00 20,57 20.57 0,48 0.48

Ser To be TCG TCG 167804,00 167804.00 9,07 9.07 0,10 0.10 Ser To be TCA TCA 334881,00 334881.00 18,11 18.11 0,20 0.20 Ser To be TCT TCT 461774,00 461774.00 24,97 24.97 0,28 0.28 Ser To be TCC CBT 203174,00 203174.00 10,99 10.99 0,12 0.12

Arg Arg CGG CGG 88712,00 88712.00 4,80 4.80 0,09 0.09 Arg Arg CGA CGA 115857,00 115857.00 6,26 6.26 0,12 0.12 Arg Arg CGT CGT 165276,00 165276.00 8,94 8.94 0,17 0.17 Arg Arg CGC CGC 69006,00 69006.00 3,73 3.73 0,07 0.07

Gin Gin CAG CAG 280077,00 280077,00 15,14 15.14 0,44 0.44 Gin Gin CAA CAA 359922,00 359922.00 19,46 19.46 0,56 0.56 His His CAT CAT 256758,00 256758.00 13,88 13.88 0,62 0.62 His His CAC CAC 160485,00 160485.00 8,68 8.68 0,38 0.38 Leu Read CTG CTG 183128,00 183128.00 9,90 9.90 0,11 0.11 Leu Read CTA CTA 184587,00 184587.00 9,98 9.98 0,11 0.11 Leu Read CTT CTT 447606,00 447606.00 24,20 24.20 0,26 0.26 Leu Read CTC CTC 294275,00 294275.00 15,91 15.91 0,17 0.17 Pro Pro CCG CCG 155222,00 155222.00 8,39 8.39 0,17 0.17 Pro Pro CCA CCA 298880,00 298880.00 16,16 16.16 0,33 0.33

tsotso

··· • · ··· • · ··· ·· • · · ··· ·· • · · ···· ···· t ·« ········ ·· · · · · t · «········ ·· · · · · ·· ·· • · • · • · · • · · • • • · · · • · · · ··· ··· • · · • · · • · · • · · • • · • • · • · · · • · · · • · · · • · · · »·· · · »·· · · • · • · ·»·«··· · · · »·« ··· · · • * • *

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Pro Pro CCT CCT 342406,00 342406.00 18,51 18.51 0,38 0.38 Pro Pro CCC CCC 97639,00 97639.00 5,28 5.28 0,11 0.11

Tabela 3. Tabela de uso do códon de Zea maysTable 3. Zea mays codon usage table

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Gly Gly GGG GGG 8069,00 8069.00 15,19 15.19 0,21 0.21 Gly Gly GGA GGA 7100,00 7100.00 13,37 13.37 0,18 0.18 Gly Gly GGT GGT 7871,00 7871.00 14,82 14.82 0,20 0.20 Gly Gly GGC GGC 15904,00 15904.00 29,94 29.94 0,41 0.41

Glu Glu GAG GAG 22129,00 22129.00 41,67 41.67 0,68 0.68 Glu Glu GAA GAA 10298,00 10298.00 19,39 19.39 0,32 0.32 Asp Asp GAT GAT 11996,00 11996.00 22,59 22.59 0,41 0.41 Asp Asp GAC GAC 17045,00 17045.00 32,09 32.09 0,59 0.59

Vai Go GTG GTG 13873,00 13873.00 26,12 26.12 0,38 0.38 Vai Go GTA GTA 3230,00 3230.00 6,08 6.08 0,09 0.09 Vai Go GTT GTT 8261,00 8261.00 15,55 15.55 0,23 0.23 Vai Go GTC GTC 11330,00 11330.00 21,33 21.33 0,31 0.31

Ala Allah GCG GCG 11778,00 11778.00 22,18 22.18 0,24 0.24 Ala Allah GCA GCA 8640,00 8640.00 16,27 16.27 0,18 0.18 Ala Allah GCT GCT 11940,00 11940.00 22,48 22.48 0,24 0.24 Ala Allah GCC GCC 16768,00 16768.00 31,57 31.57 0,34 0.34 Arg Arg AGG AGG 7937,00 7937.00 14,94 14.94 0,27 0.27 Arg Arg AGA AGA 4356,00 4356.00 8,20 8.20 0,15 0.15 Ser To be AGT AGT 3877,00 3877.00 7,30 7.30 0,10 0.10 Ser To be AGC AGC 8653,00 8653.00 16,29 16.29 0,23 0.23 Lys Lys AAG THE AG 22367,00 22367.00 42,11 42.11 0,74 0.74 Lys Lys AAA AAA 7708,00 7708.00 14,51 14.51 0,26 0.26 Asn Asn AAT AAT 6997,00 6997.00 13,17 13.17 0,36 0.36

:··« : ·· « ··« • ·· « • ·♦ • · · ♦ • · ··· · • ··· · • • ·· ········ ·· · · · · • ·· ········ ·· · · · · ·· ·· • · • · • • • · • · • • • · · · • · · · ··· ··· • • • • • · • · • · • · • • · • • · • · · · • · · · • · · · • · · · ··· ··· • · • · ·· ·· ······· · · ······· · · • · • ·

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Asn Asn AAC AAC 12236,00 12236.00 23,04 23.04 0,64 0.64 Met Met ATG ATG 12841,00 12841.00 24,18 24.18 1,00 1.00 Ile Ile ATA ATA 3997,00 3997.00 7,53 7.53 0,16 0.16 Ile Ile ATT ATT 7457,00 7457.00 14,04 14.04 0,31 0.31 Ile Ile ATC ATC 12925,00 12925.00 24,34 24.34 0,53 0.53 Thr Thr ACG ACG 5665,00 5665.00 10,67 10.67 0,22 0.22 Thr Thr ACA A CA 5408,00 5408.00 10,18 10.18 0,21 0.21 Thr Thr ACT ACT 5774,00 5774.00 10,87 10.87 0,22 0.22 Thr Thr ACC ACC 9256,00 9256.00 17,43 17.43 0,35 0.35

Trp Trp TGG TGG 6695,00 6695.00 12,61 12.61 1,00 1.00 End End TGA TGA 591,00 591.00 1,11 1.11 0,45 0.45 Cys Cys TGT TGT 2762,00 2762.00 5,20 5.20 0,30 0.30 Cys Cys TGC TGC 6378,00 6378.00 12,01 12.01 0,70 0.70

End End TAG TAG 411,00 411.00 0,77 0.77 0,32 0.32 End End TAA TAA 299,00 299.00 0,56 0.56 0,23 0.23 Tyr Tyr TAT TAT 4822,00 4822.00 9,08 9.08 0,31 0.31 Tyr Tyr TAC TAC 10546,00 10546.00 19,86 19.86 0,69 0.69

Leu Read TTG TTG 6677,00 6677.00 12,57 12.57 0,14 0.14 Leu Read TTA TTA 2784,00 2784.00 5,24 5.24 0,06 0.06 Phe Phe TTT TTT 6316,00 6316.00 11,89 11.89 0,32 0.32 Phe Phe TTC TTC 13362,00 13362.00 25,16 25.16 0,68 0.68 Ser To be TCG TCG 5556,00 5556.00 10,46 10.46 0,14 0.14 Ser To be TCA TCA 5569,00 5569.00 10,49 10.49 0,15 0.15 Ser To be TCT TCT 6149,00 6149.00 11,58 11.58 0,16 0.16 Ser To be TCC CBT 8589,00 8589.00 16,17 16.17 0,22 0.22 Arg Arg CGG CGG 4746,00 4746.00 8,94 8.94 0,16 0.16

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Arg Arg CGA CGA 2195,00 2195.00 4,13 4.13 0,07 0.07 Arg Arg CGT CGT 3113,00 3113.00 5,86 5.86 0,10 0.10 Arg Arg CGC CGC 7374,00 7374.00 13,88 13.88 0,25 0.25 Gin Gin CAG CAG 13284,00 13284.00 25,01 25.01 0,64 0.64 Gin Gin CAA CAA 7632,00 7632.00 14,37 14.37 0,36 0.36 His His CAT CAT 5003,00 5003.00 9,42 9.42 0,39 0.39 His His CAC CAC 7669,00 7669.00 14,44 14.44 0,61 0.61

Leu Read CTG CTG 13327,00 13327.00 25,09 25.09 0,28 0.28 Leu Read CTA CTA 3785,00 3785.00 7,13 7.13 0,08 0.08 Leu Read CTT CTT 8238,00 8238.00 15,51 15.51 0,17 0.17 Leu Read CTC CTC 12942,00 12942.00 24,37 24.37 0,27 0.27

Pro Pro CCG CCG 8274,00 8274.00 15,58 15.58 0,27 0.27 Pro Pro CCA CCA 7845,00 7845.00 14,77 14.77 0,26 0.26 Pro Pro CCT CCT 7129,00 7129.00 13,42 13.42 0,23 0.23 Pro Pro CCC CCC 7364,00 7364.00 13,87 13.87 0,24 0.24

Tabela 4. Tabela de uso do códon de Glicina maxTable 4. Glycine max codon usage table

Aminoácido Codon Número /1000 FraçãoCodon Amino Acid Number / 1000 Fraction

Gly Gly GGG GGG 3097,00 3097.00 12,82 12.82 0,18 0.18 Gly Gly GGA GGA 5434,00 5434.00 22,49 22.49 0,32 0.32 Gly Gly GGT GGT 5248,00 5248.00 21,72 21.72 0,31 0.31 Gly Gly GGC GGC 3339,00 3339.00 13,82 13.82 0,20 0.20 Glu Glu GAG GAG 8296,00 8296.00 34,33 34.33 0,50 0.50 Glu Glu GAA GAA 8194,00 8194.00 33,91 33.91 0,50 0.50 Asp Asp GAT GAT 7955,00 7955.00 32,92 32.92 0,62 0.62 Asp Asp GAC GAC 4931,00 4931.00 20,40 20.40 0,38 0.38 Vai Go GTG GTG 5342,00 5342.00 22,11 22.11 0,32 0.32 Vai Go GTA GTA 1768,00 1768.00 7,32 7.32 0,11 0.11

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Vai Go GTT GTT 6455,00 6455.00 26,71 26.71 0,39 0.39 Vai Go GTC GTC 2971,00 2971.00 12,29 12.29 0,18 0.18 Ala Allah GCG GCG 1470,00 1470.00 6,08 6.08 0,08 0.08 Ala Allah GCA GCA 5421,00 5421.00 22,43 22.43 0,31 0.31 Ala Allah GCT GCT 6796,00 6796.00 28,12 28.12 0,38 0.38 Ala Allah GCC GCC 4042,00 4042.00 16,73 16.73 0,23 0.23

Arg Arg AGG AGG 3218,00 3218.00 13,32 13.32 0,28 0.28 Arg Arg AGA AGA 3459,00 3459.00 14,31 14.31 0,30 0.30 Ser To be AGT AGT 2935,00 2935.00 12,15 12.15 0,17 0.17 Ser To be AGC AGC 2640,00 2640.00 10,92 10.92 0,15 0.15

Lys Lys AAG THE AG 9052,00 9052.00 37,46 37.46 0,59 0.59 Lys Lys AAA AAA 6370,00 6370.00 26,36 26.36 0,41 0.41 Asn Asn AAT AAT 5132,00 5132.00 21,24 21.24 0,48 0.48 Asn Asn AAC AAC 5524,00 5524.00 22,86 22.86 0,52 0.52

Met Met ATG ATG 5404,00 5404.00 22,36 22.36 1,00 1.00 Ile Ile ATA ATA 3086,00 3086.00 12,77 12.77 0,23 0.23 Ile Ile ATT ATT 6275,00 6275.00 25,97 25.97 0,47 0.47 Ile Ile ATC ATC 3981,00 3981.00 16,47 16.47 0,30 0.30

Thr Thr ACG ACG 1006,00 1006.00 4,16 4.16 0,08 0.08 Thr Thr ACA A CA 3601,00 3601.00 14,90 14.90 0,29 0.29 Thr Thr ACT ACT 4231,00 4231.00 17,51 17.51 0,34 0.34 Thr Thr ACC ACC 3562,00 3562.00 14,74 14.74 0,29 0.29 Trp Trp TGG TGG 2866,00 2866.00 11,86 11.86 1,00 1.00 End End TGA TGA 221,00 221.00 0,91 0.91 0,36 0.36 Cys Cys TGT TGT 1748,00 1748.00 7,23 7.23 0,49 0.49 Cys Cys TGC TGC 1821,00 1821.00 7,54 7.54 0,51 0.51

ϋΜϋΜ

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction End End TAG TAG 143,00 143.00 0,59 0.59 0,23 0.23 End End TAA TAA 256,00 256.00 1,06 1.06 0,41 0.41 Tyr Tyr TAT TAT 3808,00 3808.00 15,76 15.76 0,51 0.51 Tyr Tyr TAC TAC 3667,00 3667.00 15,17 15.17 0,49 0.49 Leu Read TTG TTG 5343,00 5343.00 22,11 22.11 0,24 0.24 Leu Read TTA TTA 2030,00 2030.00 8,40 8.40 0,09 0.09 Phe Phe TTT TTT 4964,00 4964.00 20,54 20.54 0,49 0.49 Phe Phe TTC TTC 5067,00 5067.00 20,97 20.97 0,51 0.51

Ser To be TCG TCG 1107,00 1107.00 4,58 4.58 0,06 0.06 Ser To be TCA TCA 3590,00 3590.00 14,86 14.86 0,21 0.21 Ser To be TCT TCT 4238,00 4238.00 17,54 17.54 0,24 0.24 Ser To be TCC CBT 2949,00 2949.00 12,20 12.20 0,17 0.17

Arg Arg CGG CGG 683,00 683.00 2,83 2.83 0,06 0.06 Arg Arg CGA CGA 964,00 964.00 3,99 3.99 0,08 0.08 Arg Arg CGT CGT 1697,00 1697.00 7,02 7.02 0,15 0.15 Arg Arg CGC CGC 1538,00 1538.00 6,36 6.36 0,13 0.13

Gin Gin CAG CAG 4147,00 4147.00 17,16 17.16 0,46 0.46 Gin Gin CAA CAA 4964,00 4964.00 20,54 20.54 0,54 0.54 His His CAT CAT 3254,00 3254.00 13,47 13.47 0,55 0.55 His His CAC CAC 2630,00 2630.00 10,88 10.88 0,45 0.45 Leu Read CTG CTG 2900,00 2900.00 12,00 12.00 0,13 0.13 Leu Read CTA CTA 1962,00 1962.00 8,12 8.12 0,09 0.09 Leu Read CTT CTT 5676,00 5676.00 23,49 23.49 0,26 0.26 Leu Read CTC CTC 4053,00 4053.00 16,77 16.77 0,18 0.18 Pro Pro CCG CCG 1022,00 1022.00 4,23 4.23 0,08 0.08 Pro Pro CCA CCA 4875,00 4875.00 20,17 20.17 0,37 0.37 Pro Pro CCT CCT 4794,00 4794.00 19,84 19.84 0,36 0.36

• · · · · • · ·• · · · · · · ·

Aminoácido Amino Acid Codon Codon Número Number /1000 / 1000 Fração Fraction Pro Pro CCC CCC 2445,00 2445.00 10,12 10.12 0,19 0.19

As tabelas de uso de códon são bem conhecidas na técnica e podem ser encontradas nos bancos de dados de gene por exemplo, banco de dados do Genbank. O Banco de Dados do Uso de Códon é uma versão WWW extendida de CUTG (Uso de Códon Tabulado do Genbank). A fre5 qüência de uso de códon em cada organismo é feito pesquisável através deste local da World Wide Web (Nakamura et al. Nucleic Acids Res. 28: 292, 2000).Codon usage tables are well known in the art and can be found in gene databases, for example, Genbank database. The Codon Use Database is an extended WWW version of CUTG (Tabulated Codon Use by Genbank). The frequency of codon use in each organism is made searchable through this site on the World Wide Web (Nakamura et al. Nucleic Acids Res. 28: 292, 2000).

Em várias formas de realização da invenção, as etapas podem ser realizadas em qualquer ordem ou simultaneamente. Qualquer ou todas as etapas podem ser realizadas no planejamento de um polinucleotídeo artificial da invenção. Cada etapa é descrita em detalhes abaixo.In various embodiments of the invention, the steps can be carried out in any order or simultaneously. Any or all of the steps can be performed in the design of an artificial polynucleotide of the invention. Each step is described in detail below.

Os códons diferentes para um aminoácido particular deve ser distribuído por todo o polinucleotídeo com base no uso de códon percentual aproximado para espécies particulares a partir de uma tabela de uso de có15 don. Cachos locais de códons idênticos devem ser evitados. Pelo menos um códon é substituído para cada oito códons contíguos para fornecer divergência suficiente de seqüências de polinucleotídeo que codificam proteínas idênticas ou similares. Exceto onde especificamente desejado, por exemplo, para fornecer uma enzima tolerante a herbicida, a proteína codificada per20 manece inalterada pela substituição de um códon no lugar de um outro códon que é traduzido ao mesmo aminoácido como listado na Tabela 1.The different codons for a particular amino acid must be distributed throughout the polynucleotide based on the use of approximate percent codon for particular species from a codon use table. Local clusters of identical codons should be avoided. At least one codon is replaced for every eight contiguous codons to provide sufficient divergence of polynucleotide sequences that encode identical or similar proteins. Except where specifically desired, for example, to provide an herbicide-tolerant enzyme, the encoded protein remains unchanged by replacing one codon in place of another codon that is translated to the same amino acid as listed in Table 1.

Em formas de realização da presente invenção, correções são feitas para a relação GC:AT local de um polinucleotídeo ajustando-se a relação GC:AT local para ser aproximadamente a mesma relação como o poli25 nucleotídeo de tamanho natural, mas não maior do que 2X em uma faixa de cerca de 50 nucleotídeos contíguos da molécula de polinucleotídeo. A faixa de relações GC:AT de um polinucleotídeo usando as tabelas de uso de códon de plantas dicotiledôneas deve ser de cerca de 0,9 a cerca de 1,3 e para as plantas monocotiledôneas de cerca de 1,2 a cerca de 1,7. A relaçãoIn embodiments of the present invention, corrections are made to the local GC: AT ratio of a polynucleotide by adjusting the local GC: AT ratio to be approximately the same ratio as the life-size poly25 nucleotide, but no greater than 2X in a range of about 50 contiguous nucleotides of the polynucleotide molecule. The range of GC: AT ratios of a polynucleotide using the codon use tables of dicotyledonous plants should be from about 0.9 to about 1.3 and for monocotyledonous plants from about 1.2 to about 1, 7. The relationship

GC:AT local pode ser importante na manutenção da estrutura secundária apropriada de RNA. As regiões que compreendem muitas bases A+T ou bases G+C consecutivas são prognosticadas ter uma probabilidade maior para formar estruturas de grampo de cabelo devido à autocomplementaridade. Portanto, a substituição com um códon diferente pode reduzir a probabilidade de formação de estrutura secundária autocomplementar, que é conhecida reduzir a transcrição e/ou tradução em alguns organismos. Na maioria dos casos, os efeitos adversos podem ser minimizados usando-se moléculas de polinucleotídeo que não contenham mais do que cinco A+T ou G+C consecutivos. O comprimento máximo da trilha GC local (sem qualquer nucleotídeo AT) não deve ser maior do que 10 nucleotídeos. Portanto os códons que codificam proteínas ricas em Gly, Ala, Arg, Ser e Pro podem ser substituídos para impedir cachos longos de nucleotídeos GC. Os códons ricos em GC listados podem ser usados em combinação com os códons ricos em AT para os aminoácidos Lys, Asn, Ile, Tyr, Leu, Phe e vice versa para corrigir a relação GC:AT local.GC: Local TA can be important in maintaining the appropriate secondary structure of RNA. Regions that comprise many consecutive A + T bases or G + C bases are predicted to be more likely to form hairpin structures due to self-complementarity. Therefore, substitution with a different codon can reduce the likelihood of forming a self-complementary secondary structure, which is known to reduce transcription and / or translation in some organisms. In most cases, adverse effects can be minimized by using polynucleotide molecules that contain no more than five consecutive A + T or G + C. The maximum length of the local GC track (without any AT nucleotide) should not be greater than 10 nucleotides. Therefore the codons encoding proteins rich in Gly, Ala, Arg, Ser and Pro can be replaced to prevent long clusters of GC nucleotides. The listed GC-rich codons can be used in combination with the AT-rich codons for the amino acids Lys, Asn, Ile, Tyr, Leu, Phe and vice versa to correct the local GC: AT ratio.

Uma checagem da identidade de seqüência usando ferramentas de alinhamento de seqüência de nucleotídeo tais como o programa GAP (GCG, Madison, Wl) pode ser feita imediatamente depois da retrotradução para garantir que a seqüência gerada tenha grau apropriado de diversidade de seqüência. A seqüência de polinucleotídeo contígua maior do que 23 nucleotídeos que têm cem por cento de identidade de seqüência deve ser eliminado realizando-se substituições de códon nestes comprimentos de seqüência.A sequence identity check using nucleotide sequence alignment tools such as the GAP program (GCG, Madison, Wl) can be done immediately after back-translation to ensure that the generated sequence has an appropriate degree of sequence diversity. The contiguous polynucleotide sequence greater than 23 nucleotides that have one hundred percent sequence identity must be eliminated by performing codon substitutions at these sequence lengths.

Os códons de partida translacionais (ATG do DNA, AUG no mRNA) presentes na segunda matriz de leitura (matriz b), a terceira matriz de leitura (matriz ”c”) e as matrizes de leitura reversa (matriz d, e, f). O segundo e terceiro códons de início de matriz podem iniciar a tradução, entretanto muito menos eficientemente do que o primeiro. Portanto, se um ou dois AUG são encontrados próximos à extremidade 5' de uma molécula de mRNA reside na matriz b ou c pode ser benéfico eliminá-los em uma região de polinucleotídeo que contenha pelo menos os primeiros três códons yThe translational starting codons (DNA ATG, AUG in mRNA) present in the second reading matrix (matrix b), the third reading matrix (matrix "c") and the reverse reading matrices (matrix d, e, f) . The second and third matrix start codons can initiate translation, however much less efficiently than the first. Therefore, if one or two AUGs are found near the 5 'end of an mRNA molecule that resides in matrix b or c, it may be beneficial to eliminate them in a polynucleotide region that contains at least the first three y codons.

Met na matriz a. Também, se a seqüência de proteína não tem mais do que um Met na matriz a, então eliminar tanto quanto possível das matrizes avançadas b ou c. Para realizar isto, por exemplo, os códons para aminoácidos, Asp, Asn, Tyr, His na proteína de interesse seguido por qualquer um dos aminoácidos: Gly, Glu, Asp, Vai ou Ala, pode ser substituído para eliminar um códon de partida na segunda matriz. A seqüência GATGGG codifica os aminoácidos Asp-Gly e forma um ATG na matriz de leitura b. Quando a seqüência é modificada para GACGGG, o início ATG é eliminado e a seqüência ainda codifica Asp-Gly. Uma estratégia similar é usada para eliminar códons de partida na matriz de leitura c. A combinação de um aminoácido selecionado do grupo de Gly, Glu, Vai, Ala, Arg, Lys, Ile, Thr, Cys, Tyr, Leu, Ser, His ou Pro seguido por Trp pode resultar na formação no ATG na terceira matriz de leitura do gene. Nesta situação, o primeiro códon pode ser mudado para ter um nucleotídeo outro que não A na terceira posição.Met in the matrix a. Also, if the protein sequence has no more than one Met in matrix a, then eliminate as much of the advanced matrices b or c as possible. To accomplish this, for example, the codons for amino acids, Asp, Asn, Tyr, His in the protein of interest followed by any of the amino acids: Gly, Glu, Asp, Vai or Ala, can be replaced to eliminate a starting codon in the second matrix. The GATGGG sequence encodes the Asp-Gly amino acids and forms an ATG in the reading matrix b. When the sequence is changed to GACGGG, the beginning ATG is eliminated and the sequence still encodes Asp-Gly. A similar strategy is used to eliminate start codons in the reading matrix c. The combination of an amino acid selected from the group of Gly, Glu, Vai, Ala, Arg, Lys, Ile, Thr, Cys, Tyr, Leu, Ser, His or Pro followed by Trp can result in ATG formation in the third reading matrix of the gene. In this situation, the first codon can be changed to have a nucleotide other than A in the third position.

A eliminação do códon de ATG no filamento de DNA complementar do gene nas matrizes alternadas (d, e, e/ou f) sem mudar a seqüência de aminoácido da proteína pode ser realizada em uma maneira similar. Esta modificação reduz a probabilidade de tradução mesmo se o transgene é integrado em um genoma vegetal em uma orientação que permite a transcrição do mRNA de complemento reverso de um promotor vegetal nativo. A tradução de qualquer matriz de leitura reversa pode ser minimizada pela introdução de um códon de parada em todas as três matrizes de leitura reversa como descrito abaixo.The elimination of the ATG codon in the complementary DNA strand of the gene in the alternating matrices (d, e, and / or f) without changing the protein's amino acid sequence can be accomplished in a similar way. This modification reduces the likelihood of translation even if the transgene is integrated into a plant genome in an orientation that allows for the transcription of the reverse complement mRNA of a native plant promoter. The translation of any reverse reading matrix can be minimized by introducing a stop codon in all three reverse reading matrices as described below.

A criação de códons de parada para todas as três matrizes do DNA complementar pode ser realizada como segue. Os códons Leu (TTA e CTA) e Ser (TCA) produzem três códons de parada diferentes no filamento de complemento reverso. Se aqueles aminoácidos podem ser encontrados no término C da proteína de interesse, seus códons podem ser usados para gerar paradas no filamento complementar na matriz de leitura d. Para gerar um códon de parada na matriz de leitura e de filamento complementar, encontrar os aminoácidos Ala, Arg, Asn, Asp, Cys, Gly, His, Ile, Leu Phe, Pro,The creation of stop codons for all three complementary DNA arrays can be performed as follows. Codons Leu (TTA and CTA) and Ser (TCA) produce three different stop codons in the reverse complement filament. If those amino acids can be found at the C-terminus of the protein of interest, their codons can be used to generate stops in the complementary filament in the reading matrix d. To generate a stop codon in the reading and complementary filament matrix, find the amino acids Ala, Arg, Asn, Asp, Cys, Gly, His, Ile, Leu Phe, Pro,

Ser, Thr, Tyr ou Vai seguidos pelos aminoácidos Gin, His ou Tyr da proteína de interesse. Por exemplo, a seqüência de polinucleotídeo de GCCCAC que codifica os aminoácidos Ala-His pode ser modificada para GCTCAC. A seqüência complementar, GTGAGC, terá agora o códon de parada TGA mostrado em itálicos. Quando a proteína de interesse tem um Ala, Ile, Leu, Phe, Pro, Ser, Thr ou Vai seguido por um Arg, Asn, Ile, Lys, Met ou Ser a matriz de leitura no filamento complementar pode ser modificada para ter um códon de parada na matriz de leitura ”f' do filamento complementar. A seqüência de polinucleotídeo ATATCT para Ile e Ser pode ser modificada para ATCAGT para gerar códon de parada no filamento complementar como mostrado em itálicos, ACTGAT. A combinação de códons para Phe seguido por qualquer um dos códons para os aminoácidos Asn, Ile, Lys, Met ou Thr sempre gerará códon de parada na matriz do filamento complementar e ou llfltSer, Thr, Tyr or Vai followed by the amino acids Gin, His or Tyr of the protein of interest. For example, the GCCCAC polynucleotide sequence that encodes the Ala-His amino acids can be modified to GCTCAC. The complementary sequence, GTGAGC, will now have the TGA stop codon shown in italics. When the protein of interest has an Ala, Ile, Leu, Phe, Pro, Ser, Thr or Vai followed by an Arg, Asn, Ile, Lys, Met or Ser the reading matrix in the complementary filament can be modified to have a codon stop in the reading matrix ”f 'of the complementary filament. The ATATCT polynucleotide sequence for Ile and Ser can be modified for ATCAGT to generate stop codon in the complementary filament as shown in italics, ACTGAT. The combination of codons for Phe followed by any of the codons for the amino acids Asn, Ile, Lys, Met or Thr will always generate stop codon in the matrix of the complementary filament and or llflt

Para criar um códon de parada na matriz de leitura avançada b, a matriz de leitura a deve terminar nos nucleotídeos TA ou TG. A pesquisa da proteína de interesse quanto aos aminoácidos Ile, Leu, Met ou Vai em combinação com qualquer um dos seguintes aminoácidos: Ala, Arg, Asn Asp, Glu, Gly, Ile, Lys, Met, Ser, Thr ou Vai. Por exemplo, se a seqüência de polinucleotídeo que codifica os aminoácidos Met-Ser é ATGTCT, ela pode ser modificada para ATGAGT para produzir um códon de parada TGA na segunda matriz de leitura.To create a stop codon in the advanced reading matrix b, the reading matrix a must end in nucleotides TA or TG. The search for the protein of interest for the amino acids Ile, Leu, Met or Vai in combination with any of the following amino acids: Ala, Arg, Asn Asp, Glu, Gly, Ile, Lys, Met, Ser, Thr or Vai. For example, if the polynucleotide sequence that encodes the Met-Ser amino acids is ATGTCT, it can be modified to ATGAGT to produce a TGA stop codon in the second reading matrix.

Para ser capaz de criar um códon de parada para a matriz de leitura c, a matriz de leitura a deve ter o nucleotídeo T na terceira posição e o códon seguinte deve começar de AA, AG ou GA. Para encontrar códons adequados para modificar, procurar a proteína de interesse para qualquer um dos aminoácidos: Ala, Asn, Asp, Arg, Cys, Gly, His, Ile, Leu Phe, Pro, Ser, Thr, Tyr ou Vai seguidos por qualquer um dos seguintes aminoácidos: Arg, Asn, Asp, Glu, Lys ou Ser. Por exemplo, se a seqüência de nucleotídeo para os aminoácidos Gly-Glu é GGAGAG, a seqüência pode ser modificada to GGTGAG para criar um códon de parada TGA na terceira matriz de leitura.To be able to create a stop codon for reading matrix c, reading matrix a must have nucleotide T in the third position and the next codon must start from AA, AG or GA. To find suitable codons to modify, look for the protein of interest for any of the amino acids: Ala, Asn, Asp, Arg, Cys, Gly, His, Ile, Leu Phe, Pro, Ser, Thr, Tyr or Vai followed by anyone of the following amino acids: Arg, Asn, Asp, Glu, Lys or Ser. For example, if the nucleotide sequence for the amino acids Gly-Glu is GGAGAG, the sequence can be modified to GGTGAG to create a TGA stop codon in the third matrix of reading.

Uma outra modificação útil nos métodos de planejamento de polinucleotídeo artificial da presente invenção é eliminar sítios de restrição não desejados e outros padrões de seqüência específicos. Os sítios de restrição podem interferir com a clonagem e as manipulações de gene futuras. Por exemplo, alguns sítios de restrição habitualmente usados na clonagem de gene incluem, mas não são limitados, às enzimas de restrição do Tipo il com 6 ou mais bases que não de N listadas na Tabela 5 abaixo que é um excerto do banco de dados de endonuclease de restrição da New England Biolabs, Inc. (Beverly, MA, USA). A procura quanto aos sítios de reconhecimento de enzima de restrição pode ser feita usando a aplicação da função Map encontrada no GCG SeqLab ou uma aplicação similar contida em outros programas de análise de DNA conhecidos por aqueles versados na técnica de análise de DNA. As enzimas de restrição também podem ser adicionadas à seqüência para facilitar a clonagem. Por exemplo, o sítio de restrição Clal é colocado em CP4EPSPS versão AT (SEQ ID NO: 17) e ZM (SEQ ID NO: 18) para gerar seqüências recombinantes pela mudança de fragmento e para facilitar a síntese de gene usando fragmentos de nucleotídeo que podem ser montados no gene inteiro. A seqüência de polinucleotídeo CTP2 de peptídeo de trânsito (SEQ ID NO: 12) é conectado com CP4EPSPS pelo sítio de restrição Sphl para facilitar a substituição de CTP2 com versões de nucleotídeo diferentes de CTP2 (SEQ ID NO: 13, SEQ ID NO: 14) ou polinucleotídeos que codificam peptídeos de trânsito de cloroplasto diferentes. Por exemplo, na EPSPS de arroz, o sítio de restrição NgaMIV é preservado em tomo da posição de nucleotídeo 205 em todas as versões artificiais para facilitar a mudança da região codificadora do peptídeo de trânsito de cloroplasto. Também, para a EPSPS da soja a seqüência de polinucleotídeo para o peptídeo de trânsito de cloroplasto é separada do peptídeo maduro pelo sítio de restrição para Sacll endonuclease.Another useful modification in the artificial polynucleotide design methods of the present invention is to eliminate unwanted restriction sites and other specific sequence patterns. The restriction sites can interfere with cloning and future gene manipulations. For example, some restriction sites commonly used in gene cloning include, but are not limited to, Type I restriction enzymes with 6 or more non-N bases listed in Table 5 below which is an excerpt from the restriction endonuclease from New England Biolabs, Inc. (Beverly, MA, USA). The search for restriction enzyme recognition sites can be done using the application of the Map function found in the GCG SeqLab or a similar application contained in other DNA analysis programs known to those skilled in the DNA analysis technique. Restriction enzymes can also be added to the sequence to facilitate cloning. For example, the Clal restriction site is placed in CP4EPSPS version AT (SEQ ID NO: 17) and ZM (SEQ ID NO: 18) to generate recombinant sequences by fragment change and to facilitate gene synthesis using nucleotide fragments that can be assembled into the entire gene. The transit peptide CTP2 polynucleotide sequence (SEQ ID NO: 12) is connected with CP4EPSPS by the Sphl restriction site to facilitate replacement of CTP2 with nucleotide versions other than CTP2 (SEQ ID NO: 13, SEQ ID NO: 14 ) or polynucleotides that encode different chloroplast transit peptides. For example, in rice EPSPS, the NgaMIV restriction site is preserved around the position of nucleotide 205 in all artificial versions to facilitate the change of the chloroplast transit peptide coding region. Also, for soybean EPSPS, the polynucleotide sequence for the chloroplast transit peptide is separated from the mature peptide by the restriction site for Sacll endonuclease.

É entendido que a modificação dos sítios de endonuclease de restrição não é requerida, mas é útil para outras manipulações das moléculas de DNA. A Tabela 5 fornece uma lista de endonucleases de restrição, aquelas de interesse particular para a presente invenção são marcadas comIt is understood that modification of restriction endonuclease sites is not required, but is useful for other manipulations of DNA molecules. Table 5 provides a list of restriction endonucleases, those of particular interest to the present invention are marked with

um asterisco. Outros sítios de endonuclease de restrição desejáveis para a eliminação ou adição a um polinucleotídeo artificial da presente invenção estarão evidentes àqueles versados na técnica e não são limitadas por aquelas listadas na Tabela 5.an asterisk. Other restriction endonuclease sites desirable for elimination or addition to an artificial polynucleotide of the present invention will be evident to those skilled in the art and are not limited by those listed in Table 5.

Tabela 5. Seqüências de reconhecimento de enzimas de restriçãoTable 5. Restriction enzyme recognition sequences

Enzimas Enzymes Seqüência de reconhet Recognition string Aatll Aatll G_ACGT^ACG_ACGT ^A C *Accl * Accl GT^AMK_ACGT ^A MK_AC Acc65l Acc65l G^AGTAC_CG ^A GTAC_C Acell Acell G_CTAG^AC-G_CTAG ^A C- Acll Acll AA^ACG_TTAA ^A CG_TT Acyl Acyl GR^ACG_YCGR ^A CG_YC Afel Afel AGC^AGCTAGC ^A GCT Aflll Aflll C^ATTAA_GC ^A TTAA_G ‘Afllll ‘Afllll A^ACRYG_TA ^A CRYG_T Agel Agel A^ACCGG_TA ^A CCGG_T Ahalll Ahalll TTT^AAAATTT ^TO AAA Apal Apal G_GGCC^ACG_GGCC ^A C ApaLI ApaLI G^ATGCA_CG ^A TGCA_C Apol Apol R^AAATT_Y ^The R AATT_Y Ascl Ascl GG^ACGCG_CCGG ^A CGCG_CC Asei Asei AT^ATA_ATAT ^A TA_AT AsiSI AsiSI GCG_AT^ACGCGCG_AT ^A CGC Asull Asull TT^ACG_AATT ^A CG_AA Aval Aval C^AYCGR_GC ^A YCGR_G Avalll Avalll ATGCAT ATGCAT Avrll Avrll C^ACTAG_G ^The C CTAG_G Bali Bali TGG^ACCATGG CCA ‘BamHI ‘BamHI G^AGATC_CG ^A GATC_C Bani Bani G^AGYRC_CG ^A GYRC_C Banll Banll G_RGCY^ACG_RGCY ^A C *Bbel * Bbel G_GCGC^ACG_GCGC ^A C BbvCI BbvCI CC^ATCA_GCCC ^A TCA_GC ‘Bell ‘Bell T^AGATC_AT ^A GATC_A Betl Betl W^ACCGG_WW ^A CCGG_W BfrBI BfrBI ATG^ACATATG ^TO CAT

Enzimas *BglllEnzymes * Bglll

BloHIIBloHII

BlpiBlpi

Bme1580lBme1580l

BmglBmgl

BpulOIBpulOI

BsalBsal

BsaAIBsaAI

BsaHIBsaHI

BsaWIBsaWI

BsblBsbl

BsePIBsePI

BseSIBseSI

BsilBsil

BsiEIBsiEI

BsiWIBsiWI

BsmlBsml

Bsp1286lBsp1286l

Bsp1407lBsp1407l

BspEIBspEI

BspGIBspGI

BspHIBspHI

BspLU11lBspLU11l

BspMIIBspMII

BsrBIBsrBI

BsrDIBsrDI

BsrFIBsrFI

BsrGIBsrGI

BssHIIBssHII

BssSIBssSI

BstAPIBstAPI

BstBIBstBI

BstEIIBstEII

BstXIBstXI

BstYIBstYI

Seqüência de reconhecimento A^AGATC_T CTGCA^AG GC^ATNA_GC G_KGCM^ACRecognition sequence A ^A GATC_T CTGCA ^A G GC ^A TNA_GC G_KGCM ^A C

GKGCCCGKGCCC

CC^ATNA_GCCC ^A TNA_GC

GGTCTCN^ANNNN_GGTCTCN ^A NNNN_

YAC^AGTRYAC ^A GTR

GR^ACG_YCGR ^A CG_YC

W^ACCGG_WW ^A CCGG_W

CAACACCAACAC

G^ACGCG_CG ^A CGCG_C

G_KGCM^ACG_KGCM ^A C

C^AACGA_GC ^A ACGA_G

CG_RY^ACGCG_RY ^A CG

C^AGTAC_G ^The C GTAC_G

GAATG_CN^A GAATG_CN ^A

G_DGCH^ACG_DGCH ^A C

T^AGTAC_A ^The T GTAC_A

T^ACCGG_AT ^A CCGG_A

CTGGACCTGGAC

T^ACATG_AT ^A CATG_A

A^ACATG_TA ^A CATG_T

T^ACCGG_AT ^A CCGG_A

CCG^ACTCCCG ^TO CTC

GCAATG_NN^A GCAATG_NN ^A

R^ACCGG_YR ^A CCGG_Y

T^AGTAC_A ^The T GTAC_A

G^ACGCG_CG ^A CGCG_C

C^AACGA_GC ^A ACGA_G

GCAN_NNN^ANTGCGCAN_NNN ^THE NTGC

TT^ACG_AATT ^A CG_AA

G^AGTNAC_CG ^A GTNAC_C

CCAN_NNNN^ANTGGCCAN_NNNN ^TO NTGG

R^AGATC_YR ^A GATC_Y

EnzimasEnzymes

BstZ17lBstZ17l

Bsu36lBsu36l

Btgl Btrl Btsl Cfrl CfrlOI *Clal Dral Drall Drdll Dsal Eael Eagl Ecl136ll Eco47lll Eco NI EcoO109l ‘EcoRI ‘EcoRV Espl *Fsel *Fspl FspAI Gdill Hael Haell HgiAI HgiCI HgiJII *Hincll Hindll ‘Hindlll *Hpal KaslBtgl Btrl Btsl Cfrl CfrlOI * Clal Dral Drall Drdll Dsal Eael Eagl Ecl136ll Eco47lll Eco NI EcoO109l ‘EcoRI‘ EcoRV Espl * Fsel * Fspl FspAI Gdill Hael Haell HgiAI HgiCI HgiJII * Hincll Hindll ll

Seqüência de reconhecimento GTA^ATACGTA ^A TAC recognition sequence

CC^ATNA_GGCC ^A TNA_GG

C^ACRYG_GC ^A CRYG_G

CAC^AGTCCAC ^TO GTC

GCAGTG_NN^A GCAGTG_NN ^A

Y^AGGCC_RY ^A GGCC_R

R^ACCGG_YR ^A CCGG_Y

AT^ACG_ATAT ^A CG_AT

TTT^AAAATTT ^TO AAA

RG^AGNC_CYRG ^A GNC_CY

GAACCAGAACCA

C^ACRYG_GC ^A CRYG_G

Y^AGGCC_RY ^A GGCC_R

C^AGGCC_GC ^A GGCC_G

GAG^ACTCGAG ^TO CTC

AGC^AGCTAGC ^A GCT

CCTNN^AN_NNAGGCCTNN ^TO N_NNAGG

RG^AGNC_CYRG ^A GNC_CY

G^AAATT_CL AATT_C

GAT^AATCGAT ^A ATC

GC^ATNA_GCGC ^A TNA_GC

GG_CCGG^ACCGG_CCGG ^A CC

TGC^AGCATGC ^TO GCA

RTGC^AGCAYRTGC ^TO GCAY

C^AGGCC_RC ^A GGCC_R

WGG^ACCWWGG ^A CCW

R_GCGC^AYR_GCGC ^A Y

G_WGCW^ACG_WGCW ^A C

G^AGYRC_CG ^A GYRC_C

G_RGCY^ACG_RGCY ^A C

GTY^ARACGTY ^A RAC

A^AAGCT_TA ^A AGCT_T

GTT^AAACGTT AAC

G^AGCGC_CG ^A GCGC_C

<2>3<2> 3

Enzimas Enzymes Seqüência de reconhecimento Recognition string *Kpnl * Kpnl G_GTAC^ACG_GTAC ^A C Lpnl Lpnl RGC^AGCYRGC ^A GCY Mcrl Mcrl CG_RY^ACGCG_RY ^A CG Mfel Mfel C^AAATT_G ^The C AATT_G *Mlul * Mlul A^ACGCG_TA ^A CGCG_T Mscl Mscl TGG^ACCATGG CCA MspA11 MspA11 CMG^ACKGCMG ^TO CKG Mstl Mstl TGC^AGCATGC ^TO GCA Nael Nael GCC^AGGC ^The GCC GGC Narl Narl GG^ACG_CCGG ^A CG_CC *Ncol * Ncol C^ACATG_GC ^A CATG_G *Ndel * Ndel CA^ATA_TGCA ^TO TA_TG *NgoMIV * NgoMIV G^ACCGG_CG ^A CCGG_C *Nhel * Nhel G^ACTAG_CG ^A CTAG_C NIÍ3877I NIÍ3877I C_YCGR^AGC_YCGR ^A G *Notl * Notl GC^AGGCC_GCGC ^A GGCC_GC *Nrul * Nrul TCG^ACGATCG ^TO CGA *Nsil * Nsil A_TGCA^ATA_TGCA ^A T Nspl Nspl R_CATG^AYR_CATG ^A Y NspBII NspBII CMG^ACKGCMG ^TO CKG *Pacl * Pacl TTA_AT^ATAATTA_AT ^A TAA *Pcil * Easy A^ACATG_TA ^A CATG_T Pfl11081 Pfl11081 TCGTAG TCGTAG ‘PfIMI ‘PfIMI CCAN_NNN^ANTGGCCAN_NNN ^THE NTGG PmaCI PmaCI CAC^AGTGCAC ^A GTG Pmel Pmel GTTT^AAAACGTTT ^TO AAAC Pmll Pmll CAC^AGTGCAC ^A GTG PpulOI PpulOI A^ATGCA_TA ^A TGCA_T *PpuMI * PpuMI RG^AGWC_CYRG ^A GWC_CY PshAI PshAI GACNN^ANNGTCGACNN ^TO NNGTC Psil Psil TTA^ATAATTA ^TO TAA ‘PspOMI ‘PspOMI G^AGGCC_CG ^A GGCC_C Pssl Pssl RG_GNC^ACYRG_GNC ^A CY ‘Pstl ‘Pstl C_TGCA^AGC_TGCA ^A G *Pvul * Pvul CG_AT^ACGCG_AT ^A CG

Enzimas *PvullEnzymes * Pvull

Rsrll *Sacl *Sacll *SallRsrll * Sacl * Sacll * Sall

SanDISanDI

SaplSapl

SaulSaul

Sbfl *ScalSbfl * Scal

ScilScil

SdulSdul

SexAIFriAI

SfclSfcl

SfelSfel

SfilSfil

SfolSfol

SgflSgfl

SgrAI *SmalSgrAI * Smal

SmllSmll

Snal *SnaBI *Spel *SphlSnal * SnaBI * Spel * Sphl

SpllSpll

SrflSrfl

Sse232lSse232l

Sse8387lSse8387l

Sse8647l *Sspl *Stul *Styl *SwalSse8647l * Sspl * Stul * Styl * Swal

TatlTatl

Seqüência de reconhecimento CAG^ACTGRecognition sequence CAG CTG

CG^AGWC_CGCG ^A GWC_CG

G_AGCT^ACG_AGCT ^A C

CC_GC^AGGCC_GC ^A GG

G^ATCGA_CG ^A TCGA_C

GG^AGWC_CCXL GWC_CC

GCTCTTCN^ANNN_GCTCTTCN ^A NNN_

CC^ATNA_GGCC ^A TNA_GG

CC_TGCA^AGGCC_TGCA ^A GG

AGT^AACTAGT ^A ACT

CTC^AGAGCTC ^A GAG

G_DGCH^ACG_DGCH ^A C

A^ACCWGG_TA ^A CCWGG_T

C^ATRYA_GC ^A TRYA_G

GGCCN_NNN^ANGGCCGGCCN_NNN ^TO NGGCC

GGC^AGCCGGC ^TO GCC

GCG_AT^ACGCGCG_AT ^A CGC

CR^ACCGG_YGCR ^A CCGG_YG

CCC^AGGGCCC ^A GGG

C^ATYRA_GC ^A TYRA_G

GTATACGTATAC

TAC^AGTATAC ^A GTA

A^ACTAG_T ^A CTAG_T

G_CATG^ACG_CATG ^A C

C^AGTAC_G ^The C GTAC_G

GCCC^AGGGCGCCC ^TO GGGC

CG^ACCGG_CGCG ^A CCGG_CG

CC_TGCA^AGGCC_TGCA ^A GG

AG^AGWC_CTAG ^A GWC_CT

AAT^AATTAAT ^TO ATT

AGG^ACCTAGG ^A CCT

C^ACWWG_GC ^A CWWG_G

ATTT^AAAATATTT ^TO AAAT

W^AGTAC_WW ^A GTAC_W

• · ······»· • · · · • · · · · • · * φ · · · · » (ÓA>• · ······ »· • · · · · · · · · · * φ · · · ·» (ÓA>

Enzimas Enzymes Seqüência de reconhí Recognition string UbaMI UbaMI TCCNGGA TCCNGGA UbaPI UbaPI CGAACG CGAACG *Vspl * Vspl AT^ATA_ATAT ^A TA_AT *Xbal * Xbal T^ACTAG_AT ^A CTAG_A *Xhol * Xhol C^ATCGA_GC ^A TCGA_G Xholl Xholl R^AGATC_YR ^A GATC_Y *Xmal * Xmal C^ACCGG_GC ^A CCGG_G Xmalll Xmalll C^AGGCC_GC ^A GGCC_G Xmnl Xmnl GAANN^ANNTTCGAANN ^TO NNTTC Zral Zral GAC^AGTCGAC ^TO GTC

Uma pesquisa padrão pode ser realizada para encontrar seqüências desestabilizadoras e sítios de poliadenilação potenciais e depois romper ou eliminá-las como descrito na Patente US N^? 5.500.365. Certos trechos longos de regiões ricas em AT, por exemplo, o motivo de seqüência ATTTA (ou AUUUA, como aparece no RNA) foi implicado como uma seqüência desestabilizadora em mRNA de célula de mamífero (Shaw e Kamen, Cell 46: 659 a 667, 1986). Muitos mRNAs de vida curta têm regiões não traduzidas 3' ricas em A+T e estas regiões freqüentemente têm a seqüência ATTTA, algumas vezes presente em cópias múltiplas ou como multímeros (por exemplo, ATTTATTTA . . .). Shaw e Kamen mostraram que a transferência da extremidade 3' de um mRNA instável para um RNA estável (globina ou VA1) diminuiu a meia vida do RNA estável dramaticamente. Eles mostraram ainda que um pentâmero de Al l IA teve um efeito desestabilizador profundo em uma mensagem estável e que este sinal pode exercer o seu efeito se ele estiver localizado na extremidade 3' ou dentro da seqüência codificadora. Entretanto, o número de seqüências Al I IA e/ou o contexto de seqüência em que elas ocorrem também parece ser importante na determinação se elas funcionam como seqüências desestabilizadoras. Eles também mostraram que um trímero de Al l lA teve muito menos efeito do que um pentâmero sobre a estabilidade de mRNA e um dímero ou um monômero não teve nenhum efeito sobre a estabilidade. Note que os multímeros de ATTTA tais como um pentâmero automaticamente cria uma região rica emA standard search can be performed to find destabilizing sequences and potential polyadenylation sites and then disrupt or eliminate them as described in US Patent N ^? 5,500,365. Certain long stretches of AT-rich regions, for example, the ATTTA sequence motif (or AUUUA, as it appears in RNA) have been implicated as a destabilizing sequence in mammalian cell mRNA (Shaw and Kamen, Cell 46: 659 to 667, 1986). Many short-lived mRNAs have untranslated 3 'regions rich in A + T and these regions often have the ATTTA sequence, sometimes present in multiple copies or as multimers (eg ATTTATTTA...). Shaw and Kamen showed that transferring the 3 'end of an unstable mRNA to a stable RNA (globin or VA1) decreased the stable RNA half life dramatically. They further showed that an Al 1A pentamer had a profound destabilizing effect on a stable message and that this signal can exert its effect if it is located at the 3 'end or within the coding sequence. However, the number of Al I IA sequences and / or the sequence context in which they occur also appears to be important in determining whether they function as destabilizing sequences. They also showed that an Al lA trimer had much less effect than a pentamer on mRNA stability and a dimer or monomer had no effect on stability. Note that ATTTA multimers such as a pentamer automatically create a region rich in

·· «·♦······· «· ♦ ·····

A+T. Em outros mRNAs instáveis, a seqüência ATTTA pode estar presente em apenas uma única cópia, mas está freqüentemente contida em uma região rica em A+T. Uma repetição de 11 pentâmeros de AUUUA mostraram alvejar transcritos repórter para a degradação rápida em plantas (OhmeTakagi et al., Proc. Nat. Acad. Sei. USA 90, 11811 a 11815, 1993). A seqüência ATTTA pode ser formada pela combinação de códons para o aminoácido Ile (ATT) e Tyr (TAT) como mostrado A11 /AT. Um outro exemplo pode ser os códons que terminam em AT como em Asn, Asp, His ou Tyr, seguido pelo códon TTA para Leu (por exemplo, AATTTA). Também o códon para Phe (UUU) quando colocado entre os códons que termina em A e começa em A formará o motivo ATTTA. Para eliminar este motivo usualmente a mudança de nucleotídeo único é suficiente como no exemplo mostrado: GCA7TTAGC muda para GCATTCAGC ou GCCTTTAGC. Todos os três polinucleotídeos mostrados codificam Ala-Phe-Arg.A + T. In other unstable mRNAs, the ATTTA sequence may be present in only a single copy, but is often contained in an A + T-rich region. A repeat of 11 AUUUA pentamers has been shown to target reporter transcripts for rapid degradation in plants (OhmeTakagi et al., Proc. Nat. Acad. Sci. USA 90, 11811 to 11815, 1993). The ATTTA sequence can be formed by combining codons for the amino acid Ile (ATT) and Tyr (TAT) as shown in A11 / AT. Another example can be codons ending in AT as in Asn, Asp, His or Tyr, followed by the codon TTA for Leu (for example, AATTTA). Also the codon for Phe (UUU) when placed between the codons that ends at A and starts at A will form the ATTTA motif. To eliminate this reason, usually the single nucleotide change is sufficient as in the example shown: GCA7TTAGC changes to GCATTCAGC or GCCTTTAGC. All three polynucleotides shown encode Ala-Phe-Arg.

Mais seqüências que atuam em cis que alvejam o transcrito para a rotação rápida em plantas e em outro sistema foram identificadas (Abler e Green, Plant Mol. Biol. 32: 63 a 78, 1997). Estas incluem o elemento DST que consiste em três subdomínios altamente conservados separados pelas duas regiões variáveis encontradas a jusante do códon de parada dos transcritos SAUR (Newmaan etal., Vegetal Cell 5: 701 a 714, 1993). A seqüência conservada de DST consiste em GGAG(N5)CArAGAn'G(N₇)CATTTTG7'A7', onde os resíduos altamente conservados são mostrados em tipo itálico. O segundo e terceiro subdomínios de elementos DST contêm resíduos que são invariáveis entre os elementos DST e são chamados de ATAGAT e GTA, respectivamente. Ambos destes subdomínios são necessários para a função DST. Novas seqüências de polinucleotídeo artificiais são tríadas quanto a presença de motivos conservados de elementos DST GGAG, ATAGATT, CATTT e CAI I I IGTAT. Estas seqüências são eliminadas pelas substituições de base de códons que preservam a seqüência de proteína codificada pelo polinucleotídeo. Os motivos de seqüência DST GGAG, ATAGAT, CATTT e GTAT que apareceram em cachos ou padrões similares à seqüência de DST conservada também são eliminados pelas substituições γMore cis-acting sequences that target the transcript for rapid rotation in plants and in another system have been identified (Abler and Green, Plant Mol. Biol. 32: 63 to 78, 1997). These include the DST element which consists of three highly conserved subdomains separated by the two variable regions found downstream from the stop codon of the SAUR transcripts (Newmaan etal., Vegetal Cell 5: 701 to 714, 1993). The conserved STD sequence consists of GGAG (N5) CArAGAn'G (N ₇ ) CATTTTG7'A7 ', where highly conserved residues are shown in italics. The second and third subdomains of DST elements contain residues that are invariable between DST elements and are called ATAGAT and GTA, respectively. Both of these subdomains are required for the DST function. New artificial polynucleotide sequences are screened for the presence of conserved motifs of DST elements GGAG, ATAGATT, CATTT and CAI II IGTAT. These sequences are eliminated by codon-based substitutions that preserve the protein sequence encoded by the polynucleotide. DST sequence motifs GGAG, ATAGAT, CATTT and GTAT that appeared in clusters or patterns similar to the conserved STD sequence are also eliminated by γ substitutions

··· • * • · ··· • * • · • · « · · • · · • · · • · «· · • · · • · · ···· • • ···· • • • ·· • • ·· • «· ········ • · · · • » * «· ········ • · · · • »* • · • · ·· · ·· · • · · • · · • • • • • · · • · · • · · • · · • · • · • • • · · • · · * · · · · * · · · · ·· ·· ··· ··· ···· · · ···· · · • · • ·

de base.base.

As seqüências de polinucleotídeo que possivelmente podem funcionar como sítios de poliadenilação são eliminados no novo planejamento de polinucleotídeo (Patente U.S. N° 5.500.365). Estes sinais de poliadenilação podem não atuar como sítios poliA apropriados, mas ao invés funcionam como sítios aberrantes que dão origem aos mRNAs instáveis.Polynucleotide sequences that may possibly function as polyadenylation sites are eliminated in the new polynucleotide design (U.S. Patent No. 5,500,365). These polyadenylation signals may not act as appropriate polyA sites, but instead function as aberrant sites that give rise to unstable mRNAs.

A adição de um cordão poliadenilado à extremidade 3' de um mRNA é comum à maior parte dos mRNAs eucarióticos. Contidos dentro estes transcritos de mRNA são sinais para a poliadenilação e a formação da extremidade 3' apropriada. Este processamento na extremidade 3' envolve a divagem do mRNA e a adição de poliA à extremidade 3' madura. Pesquisando-se quanto as seqüências de consenso próximas ao trato poliA nos mRNAs tanto vegetal quanto animal, foi possível identificar seqüências de consenso que aparentemente estão envolvidas na adição de poliA e na divagem da extremidade 3'. As mesmas seqüências de consenso parecem ser importantes a ambos destes processos. Estes sinais são tipicamente uma variação na seqüência AATAAA. Em células de animal, algumas variantes desta seqüência que são funcionais foram identificadas; em células vegetais parece haver uma faixa ampliada de seqüências funcionais (Dean et al., Nucl Acid Res., 14: 2229 a 2240, 1986; Hunt, Annu Rev. Plant Physiol. Plant Mol. Biol 45: 47 a 60, 1994; Rothine, Plant Mol. Biol. 32: 43 a 61, 1996). Todas estas seqüências de consenso são variações em AATAAA, portanto, todas elas são seqüências ricas em A+T.The addition of a polyadenylated strand to the 3 'end of an mRNA is common to most eukaryotic mRNAs. Contained within these mRNA transcripts are signals for polyadenylation and the formation of the appropriate 3 'end. This processing at the 3 'end involves dividing the mRNA and adding polyA to the mature 3' end. By searching for consensus sequences close to the polyA tract in both plant and animal mRNAs, it was possible to identify consensus sequences that are apparently involved in the addition of polyA and the divination of the 3 'end. The same consensus strings seem to be important for both of these processes. These signals are typically a variation in the AATAAA sequence. In animal cells, some variants of this sequence that are functional have been identified; in plant cells there appears to be an extended range of functional sequences (Dean et al., Nucl Acid Res., 14: 2229 to 2240, 1986; Hunt, Annu Rev. Plant Physiol. Plant Mol. Biol 45: 47 to 60, 1994; Rothine, Plant Mol. Biol. 32: 43 to 61, 1996). All of these consensus strings are variations on AATAAA, so they are all strings rich in A + T.

Tipicamente, para se obter expressão suficiente de transgenes modificados em plantas, a seqüência codificadora de polinucleotídeo estrutural existente (gene estrutural) que codifica a proteína de interesse é modificada pela remoção das seqüências Al l IA e dos sinais de poliadenilação putativos pela mutagênese direcionada a sítio do DNA que compreende o gene estrutural. Substancialmente todos os sinais de poliadenilação conhecidos e as seqüências ATTTA são removidos no polinucleotídeo modificado, embora os níveis de expressão realçados são freqüentemente observados apenas com a remoção de algumas das seqüências de sinal de poliadenila60Typically, to obtain sufficient expression of modified transgenes in plants, the existing structural polynucleotide coding sequence (structural gene) encoding the protein of interest is modified by removing the putative AI1A sequences and putative polyadenylation signals by site-directed mutagenesis of the DNA that comprises the structural gene. Substantially all known polyadenylation signals and ATTTA sequences are removed in the modified polynucleotide, although enhanced expression levels are often observed only with the removal of some of the polyadenyl signal sequences60

6# ção identificadas acima. Alternativamente, se um polinucleotideo artificial é preparado codificando a proteína objeto, os códons são selecionados para se evitar a seqüência ATTTA e os sinais de poliadenilação putativos. Para os propósitos da presente invenção os sinais de poliadenilação putativos incluem, mas não são necessariamente limitados a, AATAAA, AATAAT, AACCAA, ATATAA, AATCAA, ATACTA, ATAAAA, ATGAAA, AAGCAT, ATTAAT, ATACAT, AAAATA, ATTAAA, AATTAA, AATACA e CATAAA.6 # tion identified above. Alternatively, if an artificial polynucleotide is prepared encoding the object protein, the codons are selected to avoid the ATTTA sequence and putative polyadenylation signals. For the purposes of the present invention putative polyadenylation signals include, but are not necessarily limited to, AATAAA, AATAAT, AACCAA, ATATAA, AATCAA, ATACTA, ATAAAA, ATGAAA, AAGCAT, ATTAAT, ATACAT, AAAATA, ATTAAA, AATTAA, AATACA and CATAAA.

A seqüência de DNA selecionada é escaneada para identificar regiões com mais do que quatro nucleotídeos de adenina (A) ou timina (T) consecutivos. As regiões A+T são escaneadas quanto aos sinais de poliadenilação vegetal potenciais. Embora a ausência de cinco ou mais nucleotídeos A ou T consecutivos elimina a maior parte dos sinais de poliadenilação vegetal, se existe mais do que um dos sinais de poliadenilação menores identificados dentro de dez nucleotídeos entre si, então a seqüência de nucleotídeo desta região é alterada para remover estes sinais enquanto mantém a seqüência de aminoácido codificada original.The selected DNA sequence is scanned to identify regions with more than four consecutive adenine (A) or thymine (T) nucleotides. The A + T regions are scanned for potential plant polyadenylation signals. Although the absence of five or more consecutive A or T nucleotides eliminates most of the plant polyadenylation signals, if there is more than one of the minor polyadenylation signals identified within ten nucleotides together, then the nucleotide sequence of this region is altered to remove these signals while maintaining the original encoded amino acid sequence.

A etapa seguinte é considerar os cerca de 15 a cerca de 30 ou mais resíduos de nucleotídeo que circundam a região rica em A+T. Se o teor de A+T da região circundante for menor do que 80 %, a região deve ser examinada quanto a sinais de poliadenilação. A alteração da região com base em sinais de poliadenilação é dependente (1) do número de sinais de poliadenilação presentes e (2) da presença de um sinal de poliadenilação vegetal maior. Os sinais de poliadenilação são removidos pela substituição de base da seqüência de DNA no contexto da substituição de códon.The next step is to consider the approximately 15 to about 30 or more nucleotide residues that surround the A + T-rich region. If the A + T content of the surrounding region is less than 80%, the region should be examined for signs of polyadenylation. Alteration of the region based on polyadenylation signals is dependent (1) on the number of polyadenylation signals present and (2) on the presence of a larger plant polyadenylation signal. The polyadenylation signals are removed by the base substitution of the DNA sequence in the context of codon substitution.

Dois padrões adicionais não identificados na Patente US N° 5.500.365. são procurados e eliminados nas formas de realização da presente invenção. As seqüências AGGTAA e GCAGGT são seqüências de consenso para os sítios de junção 5' e 3’ de intron, respectivamente, em plantas monocotiledôneas e plantas dicotiledôneas. Apenas GT do sítio de junção 5' e o AG no sítio de junção 3' são requeridos para ser um empareIhamento exato. Entretanto, quando da condução de uma pesquisa quanto a estas seqüências de consenso, nenhum desemparelhamento é permitidoTwo additional standards not identified in US Patent No. 5,500,365. are sought and eliminated in the embodiments of the present invention. The AGGTAA and GCAGGT sequences are consensus sequences for the 5 'and 3' intron junction sites, respectively, in monocotyledonous and dicotyledonous plants. Only GT from the 5 'junction site and AG at the 3' junction site are required to be an exact match. However, when conducting research on these consensus strings, no mismatches are allowed

Ó5 para cada base.Ó5 for each base.

Depois de cada etapa o mapeamento de seqüência é feito usando o programa MAP da GCG para determinar a localização das matrizes de leitura aberta e identificar padrões de seqüência que ainda precisam ser modificados. A etapa final seria realizar a análise da identidade de seqüência usando por exemplo o programa GAP do pacote da GCG para determinar o grau de divergência de seqüência e a identidade percentual.After each step, the sequence mapping is done using the GCG MAP program to determine the location of the open reading matrices and identify sequence patterns that still need to be modified. The final step would be to perform the sequence identity analysis using, for example, the GAP program in the GCG package to determine the degree of sequence divergence and the percent identity.

PolipeptídeosPolypeptides

No geral, a proteína traduzida do polinucleotídeo artificial terá a mesma seqüência de aminoácido como a proteína traduzida da região codificadora não-modificada. Entretanto, a substituição de códons que codificam aminoácidos que fornecem um homólogo funcional da proteína é um aspecto da invenção. Por exemplo, certos aminoácidos podem ser substituídos no lugar de outros aminoácidos em uma estrutura de proteína sem perda apreciável da capacidade de ligação interativa com as estruturas tais como, por exemplo, regiões de ligação de antígeno de anticorpos ou sítios de ligação nas moléculas do substrato. Visto que é a capacidade e a natureza interativas de uma proteína que define essa atividade funcional biológica da proteína, certas substituições de seqüência de aminoácido podem ser feitas em uma seqüência de proteína, e, naturalmente, a sua seqüência codificadora de DNA subjacente, e não obstante obter uma proteína com propriedades semelhantes. É assim considerado pelos inventores que várias mudanças podem ser feitas nas seqüências de peptídeo das composições divulgadas realizando-se mudanças nas seqüências de DNA correspondentes que codificam os peptídeos em que os peptídeos não mostraram nenhuma perda apreciável de sua utilidade ou atividade biológicas.In general, the translated protein from the artificial polynucleotide will have the same amino acid sequence as the translated protein from the unmodified coding region. However, the replacement of codons that encode amino acids that provide a functional homolog of the protein is an aspect of the invention. For example, certain amino acids can be substituted in place of other amino acids in a protein structure without appreciable loss of the ability to interact interactively with structures such as, for example, antibody antigen binding regions or binding sites on substrate molecules . Since it is the interactive ability and nature of a protein that defines this biological functional activity of the protein, certain amino acid sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding sequence, not despite obtaining a protein with similar properties. It is thus considered by the inventors that several changes can be made to the peptide sequences of the disclosed compositions by making changes to the corresponding DNA sequences encoding the peptides in which the peptides have not shown any appreciable loss of their biological utility or activity.

Um outro aspecto da invenção compreende homólogos funcionais, que diferem em um ou mais aminoácidos daqueles de um polipeptídeo aqui fornecido como o resultado de uma ou mais substituições de aminoácido conservativas. É bem conhecido na técnica que um ou mais aminoácidos em uma seqüência nativa podem ser substituídos com pelo menos um outro aminoácido, a carga e a polaridade do qual são similares àquelas do amino62Another aspect of the invention comprises functional homologs, which differ in one or more amino acids from those of a polypeptide provided herein as the result of one or more conservative amino acid substitutions. It is well known in the art that one or more amino acids in a native sequence can be replaced with at least one other amino acid, the charge and polarity of which are similar to those of amino62

Ύ) ácido nativo, resultando em uma mudança silenciosa. Por exemplo, valina é um substituto conservativo para alanina e treonina é um substituto conservativo para serina. As substituições conservativas para um aminoácido dentro da seqüência de polipeptideo nativa podem ser selecionadas a partir de outros membros da classe a qual o aminoácido que ocorre naturalmente pertence. Os aminoácidos podem ser divididos nos seguintes quatro grupos: (1) aminoácidos ácidos, (2) aminoácidos básicos, (3) aminoácidos polares neutros e (4) aminoácidos não-polares neutros. Os aminoácidos representativos dentro destes vários grupos incluem, mas não são limitados a: (1) aminoácidos ácidos (negativamente carregados) tais como ácido aspártico e ácido glutâmico; (2) aminoácidos básicos (positivamente carregados) tais como arginina, histidina e lisina; (3) aminoácidos polares neutros tais como glicina, serina, treonina, cisteína, tirosina, asparagina e glutamina; e (4) aminoácidos não-polares neutros (hidrofóbicos) tais como alanina, leucina, isoleucina, valina, prolina, fenilalanina, triptofano e metionina. Os substitutos conservados para um aminoácido dentro de uma seqüência de aminoácido nativa pode ser selecionada de outros membros do grupo ao qual o aminoácido que ocorre naturalmente pertence. Por exemplo, um grupo de aminoácidos tendo cadeias laterais alifáticas é glicina, alanina, valina, leucina e isoleucina; um grupo de aminoácidos tendo cadeias laterais hidroxílicas alifáticas é serina e treonina; um grupo de aminoácidos tendo cadeias laterais contendo amida é asparagina e glutamina; um grupo de aminoácidos tendo cadeias laterais aromáticas é fenilalanina, tirosina e triptofano; um grupo de aminoácidos tendo cadeias laterais básicas é lisina, arginina e histidina; e um grupo de aminoácidos tendo cadeias laterais contendo enxofre é cisteína e metionina. Os grupos de substituição de aminoácidos naturalmente conservativos são: valina-leucina, valina-isoleucina, fenilalanina-tirosina, lisina-arginina, alanina-valina, ácido aspártico-ácido glutâmico e asparagina-glutamina. Contructos de DNAΎ) native acid, resulting in a silent change. For example, valine is a conservative substitute for alanine and threonine is a conservative substitute for serine. Conservative substitutions for an amino acid within the native polypeptide sequence can be selected from other members of the class to which the naturally occurring amino acid belongs. Amino acids can be divided into the following four groups: (1) acidic amino acids, (2) basic amino acids, (3) neutral polar amino acids and (4) neutral non-polar amino acids. Representative amino acids within these various groups include, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine and lysine; (3) neutral polar amino acids such as glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine; and (4) neutral (hydrophobic) non-polar amino acids such as alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. Conserved substitutes for an amino acid within a native amino acid sequence can be selected from other members of the group to which the naturally occurring amino acid belongs. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine and isoleucine; a group of amino acids having aliphatic hydroxy side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine and tryptophan; a group of amino acids having basic side chains is lysine, arginine and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. The naturally conservative amino acid substitution groups are: valine-leucine, valine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, aspartic acid-glutamic acid and asparagine-glutamine. DNA Contracts

O Material genético exógeno pode ser transferido em uma planta pelo uso de um constructo de DNA projetada para tal propósito pelos métodos que utilizam Agrobacterium, bombardeamento de partícula ou outrosExogenous genetic material can be transferred in a plant using a DNA construct designed for this purpose by methods using Agrobacterium, particle bombardment or others

• · · · · · · >1 métodos conhecidos por aqueles versados na técnica. O projeto de tal constructo de DNA está em geral dentro do conhecimento da técnica (Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer, Nova Iorque (1997). Os exemplos de tais plantas nas quais o material genético exógeno pode ser transferido, incluem, sem limitação, alfalfa, Arabidopsis, cevada, Brassica, brócoli, repolho, Citrus, algodão, alho, aveia, colza, cebola, canola, linho, milho, uma planta anual ornamental e perene ornamental, ervilha, amendoim, pimentão, batata, arroz, centeio, sorgo, soja, morango, cana de açúcar, beterraba, tomate, trigo, álamo, pinheiro, abeto, eucalipto, maçã, alface, lentilhas, uva, banana, chá, gramas, girassol, dendezeiro, Phaseolus, árvores, arbustos, videiras, etc. É bem conhecido que as plantas agronomicamente importantes compreendem genótipos, variedades e cultivares e que os métodos e composições da presente invenção podem ser testados nestas plantas por aqueles versados comumente na técnica da biologia molecular vegetal e do cruzamento vegetal.• · · · · · ·> 1 methods known to those skilled in the art. The design of such a DNA construct is generally within the knowledge of the technique (Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer, New York (1997). Examples of such plants in which exogenous genetic material can be transferred , include, without limitation, alfalfa, Arabidopsis, barley, Brassica, broccoli, cabbage, Citrus, cotton, garlic, oats, rapeseed, onion, canola, flax, corn, an ornamental and perennial ornamental annual plant, peas, peanuts, peppers, potato, rice, rye, sorghum, soy, strawberry, sugar cane, beet, tomato, wheat, poplar, pine, fir, eucalyptus, apple, lettuce, lentils, grape, banana, tea, grasses, sunflower, oil palm, Phaseolus, trees, shrubs, vines, etc. It is well known that agronomically important plants comprise genotypes, varieties and cultivars and that the methods and compositions of the present invention can be tested on these plants by those of ordinary skill in modern biology. lecular plant and plant crossing.

Um grande número de moléculas promotoras de DNA isoladas que são ativas como um elemento genético de um transgene em células vegetais foi descrito. Estas incluem o promotor da nopalina sintase (P-nos) (Ebert et al., Proc. Natl. Acad. Sci. (U.S.A.) 84: 5745 a 5749, 1987), a totalidade da qual é aqui incorporada por referência), o promotor da octopina sintase (P-ocs), que são transportadas em plasmídeos indutores de tumor de Agrobacterium tumefaciens, os promotores de caulimovirus, tais como o promotor 19S do vírus mosaico da couve-flor (CaMV) (Lawton et al., Plant Mol. Biol. 9: 315 a 324, 1987), a totalidade da qual é aqui incorporada por referência) e o promotor 35S de CaMV (Odell et al., Nature 313: 810 a 812, 1985), a totalidade da qual é aqui incorporada por referência), o promotor 35S do vírus mosaico de escrofulária (Patente U.S. N^s 6.018.100, a totalidade da qual é aqui incorporada por referência), o promotor indutível por luz da subunidade pequena de ribulose-1,5-bis-fosfato carboxilase (ssRUBISCO), o promotor Adh (Walker et al., Proc. Natl. Acad. Sci. (U.S.A.) 84: 6624 a 6628, 1987), a totalidade da qual é aqui incorporada por referência), o promotor da sacarose sintase (Yang et al., Proc. Natl. Acad. Sci. (U.S.A.) 87: 4144 a • · · · · ·A large number of isolated DNA-promoting molecules that are active as a genetic element of a transgene in plant cells have been described. These include the promoter of nopaline synthase (P-nos) (Ebert et al., Proc. Natl. Acad. Sci. (USA) 84: 5745 to 5749, 1987), all of which is incorporated by reference), the octopine synthase promoter (P-ocs), which are transported in tumor-inducing plasmids from Agrobacterium tumefaciens, the kaolinimovirus promoters, such as the cauliflower mosaic virus (CaMV) 19S promoter (Lawton et al., Plant Mol Biol. 9: 315 to 324, 1987), all of which are hereby incorporated by reference) and the CaMV 35S promoter (Odell et al., Nature 313: 810 to 812, 1985), all of which are herein incorporated by reference), the 35S promoter from Figwort mosaic virus (US Patent ^No.'s 6,018,100, the entirety of which is incorporated herein by reference), the light - inducible promoter from the small subunit of ribulose-1,5-bis phosphate carboxylase (ssRUBISCO), the Adh promoter (Walker et al., Proc. Natl. Acad. Sci. (USA) 84: 6624 to 6628, 1987), all of which are included here provided by reference), the sucrose synthase promoter (Yang et al., Proc. Natl. Acad. Sci. (USA) 87: 4144 a • · · · · ·

4148, 1990), a totalidade da qual é aqui incorporada por referência), o promotor do complexo do gene R (Chandler et al., Plant Cell 1: 1175 a 1183, 1989, a totalidade da qual é aqui incorporada por referência) e o promotor do gene da proteína de ligação de clorofila a/b, etc.4148, 1990), all of which is incorporated herein by reference), the promoter of the R gene complex (Chandler et al., Plant Cell 1: 1175 to 1183, 1989, all of which is incorporated by reference) and the a / b chlorophyll binding protein gene promoter, etc.

Uma variedade de promotores especifica mente ativos em tecidos vegetativos, tais como folhas, hastes, raízes e tubérculos, podem ser usados para expressar as moléculas de ácido nucléico da presente invenção. Os exemplos de promotores específicos de tubérculos incluem, mas não são limitados aos promotores de patatina classe I e II (Bevan et al., EMBO J. 8: 1899 a 1906, 1986); Koster-Topfer et al., Mol Gen Genet. 219; 390 a 396, 1989); Mignery et al., Gene 62: 27 a 44, 1988); Jefferson et al., Plant Mol. Biol. 14: 995 a 1006, 1990), aqui incorporada por referência nas suas totalidades), o promotor para os genes ADPGPP do tubérculos, tanto a subunidade grande quanto a pequena; o promotor da sacarose sintase (Salanoubat e Belliard, Gene 60: 47 a 56, 1987), Salanoubat e Belliard, Gene 84: 181 a 185, 1989), aqui incorporada por referência em sua totalidade); e o promotor para as proteínas de tubérculos principais incluindo os complexos de proteína de 22 kd e inibidores da proteinase (Hannapel, Plant Physiol. 10T. 703 a 704, 1993), aqui incorporada por referência em sua totalidade). Os exemplos de promotores específicos de folha incluem mas não são limitados aos promotores da ribulose bifosfato carboxilase (RbcS ou RuBISCO) (ver, por exemplo, Matsuoka et al., Plant J. 6; 311 a 319, 1994), aqui incorporada por referência em sua totalidade); o promotor do gene da proteína de ligação da clorofila de coleta de luz a/b (ver, por exemplo, Shiina et al., Plant Physiol. 115: 477 a 483. 1997; Casal etal., Plant Physiol. 116: 1533 a 1538, 1998, aqui incorporada por referência em suas totalidades); e o promotor do gene relacionado com myb da Arabidopsis thaliana (Atmyb5) (Li et al., FEBS Lett. 379: 117 a 121, 1996, aqui incorporada por referência em sua totalidade). Os exemplos de promotor específico de raiz incluem mas não são limitados ao promotor para o gene da quitinase ácida (Samac et al., Plant Mol. Biol. 25; 587 a 596, 1994), aqui incorporada por referência em sua totalidade); os subdomínios específicos de raiz do promotor CaMV35S que foramA variety of promoters specifically active in vegetative tissues, such as leaves, stems, roots and tubers, can be used to express the nucleic acid molecules of the present invention. Examples of tuber-specific promoters include, but are not limited to, class I and II patatin promoters (Bevan et al., EMBO J. 8: 1899 to 1906, 1986); Koster-Topfer et al., Mol Gen Genet. 219; 390 to 396, 1989); Mignery et al., Gene 62: 27 to 44, 1988); Jefferson et al., Plant Mol. Biol. 14: 995 to 1006, 1990), hereby incorporated by reference in their entirety), the promoter for the tuber ADPGPP genes, both large and small subunits; the sucrose synthase promoter (Salanoubat and Belliard, Gene 60: 47 to 56, 1987), Salanoubat and Belliard, Gene 84: 181 to 185, 1989), hereby incorporated by reference in its entirety); and the promoter for major tuber proteins including 22 kd protein complexes and proteinase inhibitors (Hannapel, Plant Physiol. 10T. 703 to 704, 1993), hereby incorporated by reference in their entirety). Examples of specific leaf promoters include, but are not limited to, ribulose bisphosphate carboxylase promoters (RbcS or RuBISCO) (see, for example, Matsuoka et al., Plant J. 6; 311 to 319, 1994), incorporated herein by reference in its entirety); the a / b light collecting chlorophyll binding protein gene promoter (see, for example, Shiina et al., Plant Physiol. 115: 477 to 483. 1997; Casal etal., Plant Physiol. 116: 1533 a 1538, 1998, incorporated herein by reference in its entirety); and the myb-related gene promoter from Arabidopsis thaliana (Atmyb5) (Li et al., FEBS Lett. 379: 117 to 121, 1996, hereby incorporated by reference in its entirety). Examples of root-specific promoter include but are not limited to the promoter for the acid chitinase gene (Samac et al., Plant Mol. Biol. 25; 587 to 596, 1994), hereby incorporated by reference in its entirety); the root specific subdomains of the CaMV35S promoter that were

identificados (Lam et al., Proc. Natl. Acad. Sei. (U.S.A.) 86: 7890 a 7894. 1989, aqui incorporada por referência em sua totalidade); o promotor da ORF13 de Agrobacterium rhizogenes que exibe alta atividade nas raízes (Hansen etal., Mol. Gen. Genet. 254: 337 a 343. 1997), aqui incorporada por referência em sua totalidade); o promotor para o gene específico da raiz do tabaco RB7 (Patente US 5.750.386; Yamamoto et al., Plant Cell 3: 371 a 382, 1991, aqui incorporada por referência em sua totalidade); e os promotores específicos da célula de raiz reportado por Conkling et al. (Conkling et al., Plant Physiol. 93: 1203 a 1211, 1990, aqui incorporada por referência em sua totalidade) e o promotor de POX1 (Pox1, pox1) (Hertig, et al. Plant Mol. Biol. 16: 171, 1991).identified (Lam et al., Proc. Natl. Acad. Sci. (U.S.A.) 86: 7890 to 7894. 1989, hereby incorporated by reference in their entirety); the ORF13 promoter of Agrobacterium rhizogenes which exhibits high activity in the roots (Hansen etal., Mol. Gen. Genet. 254: 337 to 343. 1997), incorporated herein by reference in its entirety); the promoter for the tobacco root specific gene RB7 (US Patent 5,750,386; Yamamoto et al., Plant Cell 3: 371 to 382, 1991, hereby incorporated by reference in its entirety); and the root cell-specific promoters reported by Conkling et al. (Conkling et al., Plant Physiol. 93: 1203 to 1211, 1990, incorporated herein by reference in its entirety) and the POX1 promoter (Pox1, pox1) (Hertig, et al. Plant Mol. Biol. 16: 171, 1991).

Uma outra classe de promotores específicos de tecido vegetativo útil são os promotores meristemáticos (ponta de raiz e ápice de broto). Por exemplo, os promotores SHOOTMERISTEMLESS e SCARECROW, que são ativos no desenvolvimento de meristemas apicais de broto ou raiz (Di Laurenzio et al., Cell 86: 423 a 433, 1996; Long, Nature 379: 66 a 69, 1996); aqui incorporada por referência em suas totalidades), podem ser usados. Um outro exemplo de um promotor útil é aquele que controla a expressão do gene da 3-hidróxi-3- metilglutaril coenzima A reduetase HMG2, cuja expressão é restrita aos tecidos meristemáticos e florais (zona secretora do estigma, grãos de pólen maduros, tecido vascular de gineceu e óvulos fertilizados) (ver, por exemplo, Enjuto et al., Plant Cell, 7: 517 a 527, 1995, aqui incorporada por referência em sua totalidade). Também um outro exemplo de um promotor útil é aquele que controla a expressão de genes relacionados com knl de milho e outras espécies que apresentam a expressão específica de meristema (ver, por exemplo, Granger etal., Plant Mol. Biol. 31: 373 a 378, 1996; Kerstetter et al., Plant Cell 6: 1877 a 1887, 1994; Hake et al., Philos. Trans. R. Soc. Lond. B. Biol. Sei. 350: 45 a 51, 1995, aqui incorporada por referência em suas totalidades). Um outro exemplo de um promotor meristemático é o promotor KNAT1 Arabidopsis thaliana. No ápice de broto, o transcrito KNAT1 está localizado primariamente no meristema apical de broto; a expressão de KNAT1 no meristema de broto diminui durante a tran66 ···Another class of specific promoters of useful vegetative tissue are meristematic promoters (root tip and bud apex). For example, the promoters SHOOTMERISTEMLESS and SCARECROW, which are active in the development of apical sprout or root meristems (Di Laurenzio et al., Cell 86: 423 to 433, 1996; Long, Nature 379: 66 to 69, 1996); incorporated herein by reference in its entirety), may be used. Another example of a useful promoter is that which controls the expression of the 3-hydroxy-3-methylglutaryl coenzyme HMG2 reductase gene, whose expression is restricted to meristematic and floral tissues (stigma-secreting zone, mature pollen grains, vascular tissue of gynecium and fertilized eggs) (see, for example, Enjuto et al., Plant Cell, 7: 517 to 527, 1995, hereby incorporated by reference in its entirety). Another example of a useful promoter is one that controls the expression of genes related to corn knl and other species that display specific meristem expression (see, for example, Granger etal., Plant Mol. Biol. 31: 373 a 378, 1996; Kerstetter et al., Plant Cell 6: 1877 to 1887, 1994; Hake et al., Philos. Trans. R. Soc. Lond. B. Biol. Sci. 350: 45 to 51, 1995, incorporated herein by reference in their totalities). Another example of a meristematic promoter is the KNAT1 Arabidopsis thaliana promoter. At the bud apex, the KNAT1 transcript is located primarily in the bud apical meristem; KNAT1 expression in the bud meristem decreases during tran66 ···

sição floral e é restrita ao córtex da haste de inflorescência (ver, por exemplo, Lincoln et al., Plant Cell 6: 1859 a 1876, 1994, aqui incorporada por referência em sua totalidade).floral location and is restricted to the cortex of the inflorescence stem (see, for example, Lincoln et al., Plant Cell 6: 1859 to 1876, 1994, hereby incorporated by reference in its entirety).

Os promotores específicos de semente e realçados de semente adequados podem ser derivados dos seguintes genes: MAC1 do milho (Sheridan et al., Genetics 142: 1009 a 1020, 1996, aqui incorporada por referência em sua totalidade); Cat3 do milho (Genbank N^s L05934. Abler et al., Plant Mol. Biol. 22: 10131 a 1038, 1993. aqui incorporada por referência em sua totalidade); vivparous-1 de Arabidopsis (Genbank N² U93215); Atimycl de Arabidopsis (Urao et al., Plant Mol. Biol. 32: 571-57, 1996; Conceicao et al., Plant 5: 493 a 505, 1994, aqui incorporada por referência em suas totalidades); napA de Brassica napus (Genbank N² J02798); a família do gene napin de Brassica napus (Sjodahl et al., Planta 197: 264 a 271, 1995, aqui incorporada por referência em sua totalidade).Suitable seed-specific and seed enhancers can be derived from the following genes: maize MAC1 (Sheridan et al., Genetics 142: 1009 to 1020, 1996, hereby incorporated by reference in its entirety); Maize Cat3 (.... ^S GenBank Accession No. L05934 Abler et al, Plant Mol Biol 22: 10131-1038, 1993 incorporated herein by reference in its entirety); vivparous-1 from Arabidopsis (Genbank No. ² U93215); Atimycl de Arabidopsis (Urao et al., Plant Mol. Biol. 32: 571-57, 1996; Conceicao et al., Plant 5: 493 to 505, 1994, hereby incorporated by reference in their entirety); napA from Brassica napus (Genbank No. ² J02798); the Brassica napus napin gene family (Sjodahl et al., Planta 197: 264 to 271, 1995, hereby incorporated by reference in its entirety).

O promotor específico de óvulo para o gene BEL1 (Reiser et al. Cell 83: 735 a 742, 1995, Genbank N² U39944; Ray et al, Proc. Natl. Acad. Sci. USA 91: 5761 a 5765, 1994, todas as quais são aqui incorporada por referência em suas totalidades) também podem ser usados. Os promotores MEA (FIS1) e FIS2 específicos de ovo e célula central também são promotores específicos de tecido reprodutivo úteis (Luo et al., Proc. Natl. Acad. Sci. USA, 97: 10637 a 10642, 2000; Vielle-Calzada, et al., Genes Dev. 13: 2971 a 2982,1999; aqui incorporada por referência em suas totalidades).The egg specific promoter for the BEL1 gene (Reiser et al. Cell 83: 735 to 742, 1995, Genbank N ² U39944; Ray et al, Proc. Natl. Acad. Sci. USA 91: 5761 to 5765, 1994, all which are hereby incorporated by reference in their entirety) may also be used. The egg and central cell-specific MEA (FIS1) and FIS2 promoters are also useful reproductive tissue-specific promoters (Luo et al., Proc. Natl. Acad. Sci. USA, 97: 10637 to 10642, 2000; Vielle-Calzada, et al., Genes Dev. 13: 2971 to 2982.1999; incorporated herein by reference in its entirety).

Um promotor específico de pólen do milho foi identificado no milho (Guerrero et aí, Mol. Gen. Genet. 224: 161 a 168, 1990, aqui incorporada por referência em sua totalidade). Outros genes especificamente expressados no pólen foram descritos (ver, por exemplo, Wakeley et aí, Plant Mol. Biol. 37: 187 a 192, 1998; Ficker et al., Mol. Gen. Genet 257: 132 a 142, 1998; Kulikauskas et al., Plant Mol. Biol. 34: 809 a 814, 1997; Treacy et al., Plant Mol. Biol. 34: 603 a 611, 1997; todas as quais são aqui incorporadas por referência em suas totalidades).A specific corn pollen promoter has been identified in corn (Guerrero et al., Mol. Gen. Genet. 224: 161 to 168, 1990, hereby incorporated by reference in its entirety). Other genes specifically expressed in pollen have been described (see, for example, Wakeley et al., Plant Mol. Biol. 37: 187 to 192, 1998; Ficker et al., Mol. Gen. Genet 257: 132 to 142, 1998; Kulikauskas et al., Plant Mol. Biol. 34: 809 to 814, 1997; Treacy et al., Plant Mol. Biol. 34: 603 to 611, 1997; all of which are incorporated herein by reference in their entirety).

Os promotores derivados de genes que codificam proteínas de armazenagem embriônica, que inclui o gene que codifica a proteínas de ar67Promoters derived from genes encoding embryonic storage proteins, which include the gene encoding ar proteins67

mazenagem 2S de Brassica napus (Dasgupta et al, Gene 133: 301 a 302, 1993, aqui incorporada por referência em sua totalidade); a família de gene de proteínas de armazenagem em semente 2s de Arabidopsis-, o gene que codifica a oleosina 20kD de Brassica napus (GenBank N^s M63985); os genes que codificam a oleosina A (Genbank N⁹ U09118) e a oleosina B (GenBank N⁹ U09119) da soja; o gene que codifica a oleosina de Arabidopsis (GenBank N⁹ Z17657); o gene que codifica a oleosina 18kD do milho (GenBank N⁹ J05212, Lee, Plant Mol. Biol. 26: 1981 a 1987, 1994), aqui incorporada por referência em sua totalidade); e o gene que codifica a proteína rica em enxofre de peso molecular baixo da soja (Choi et ai., Mol. Gen. Genet. 246: 266 a 268, 1995, aqui incorporada por referência em sua totalidade), também podem ser usados.2S storage of Brassica napus (Dasgupta et al, Gene 133: 301 to 302, 1993, hereby incorporated by reference in its entirety); the Arabidopsis- 2s seed storage protein gene family, the gene encoding Brassica napus 20kD oleosin (GenBank No. ^s M63985); the genes encoding oleosin A (Genbank N ⁹ U09118) and oleosin B (GenBank N ⁹ U09119) from soy; the gene encoding Arabidopsis oleosin (GenBank No. ⁹ Z17657); the gene encoding 18kD maize oleosin (GenBank N ⁹ J05212, Lee, Plant Mol. Biol. 26: 1981 to 1987, 1994), incorporated herein by reference in its entirety); and the gene encoding the low molecular weight sulfur rich protein of soy (Choi et al., Mol. Gen. Genet. 246: 266 to 268, 1995, hereby incorporated by reference in its entirety), may also be used.

Os promotores derivados de zeína que codifica genes (incluindo os genes de 15 kD, 16 kD, 19 kD, 22 kD, 27 kD e gama; Pedersen et al., Cell 29: 1015 a 1026, 1982, aqui incorporada por referência em sua totalidade) também podem ser usados. As zeínas são um grupo de proteínas de armazenagem encontradas no endosperma do milho.Promoters derived from zein encoding genes (including the 15 kD, 16 kD, 19 kD, 22 kD, 27 kD and gamma genes; Pedersen et al., Cell 29: 1015 to 1026, 1982, hereby incorporated by reference in their totality) can also be used. Zeins are a group of storage proteins found in the corn endosperm.

Outros promotores conhecidos funcionar, por exemplo, no milho, incluem os promotores para os seguintes genes: waxy, Brittle, Shrunken 2, Enzimas de ramificação I e II, amido sintases, enzimas de desramificação, oleosinas, glutelinas e sacarose sintases. Um promotor particularmente preferido para a expressão do endosperma do milho é o promotor para o gene da glutelina do arroz, mais particularmente o promotor Osgt-1 (Zheng et al., Mol. Cell Biol. 13: 5829 a 5842, 1993, aqui incorporada por referência em sua totalidade). Os exemplos de promotores adequados para a expressão no trigo incluem aqueles promotores para as subunidades de ADPglicose pirofosforilase (ADPGPP), a ligação de grânulo e outras amido sintases, as enzimas de ramificação e desramificação, as proteínas abundantes em embriogênese, as giiadinas e as gluteninas. Os exemplos de tais promotores no arroz incluem aqueles promotores para as subunidades de ADPGPP, a ligação de grânulo e outras amido sintases, as enzimas de ramificação, as enzimas de desramificação, as sacarose sintases e as glutelinas. Um promotorOther known promoters that work, for example, in maize, include promoters for the following genes: waxy, Brittle, Shrunken 2, Branching enzymes I and II, starch synthase, debranching enzymes, oleosins, glutellins and sucrose synthases. A particularly preferred promoter for the expression of the corn endosperm is the promoter for the rice glutelin gene, more particularly the Osgt-1 promoter (Zheng et al., Mol. Cell Biol. 13: 5829 to 5842, 1993, incorporated herein by reference in its entirety). Examples of promoters suitable for expression in wheat include those promoters for the ADPglucose pyrophosphorylase (ADPGPP) subunits, granule binding and other starch synthases, branching and debranching enzymes, abundant proteins in embryogenesis, giiadins and glutenins . Examples of such promoters in rice include those promoters for the ADPGPP subunits, granule binding and other starch synthases, branching enzymes, debranching enzymes, sucrose synthases and glutellins. A promoter

particularmente preferido é o promotor para a glutelina do arroz, Osgt-1. Os exemplos de tais promotores para a cevada incluem aqueles para as subunidades de ADPGPP, a ligação de grânulo e outras amido sintases, as enzimas de ramificação, as enzimas de desramificação, as sacarose sintases, as hordeínas, as globulinas de embrião e as proteínas específicas de aleurona.particularly preferred is the promoter for rice glutelin, Osgt-1. Examples of such promoters for barley include those for the subunits of ADPGPP, granule binding and other starch synthases, branching enzymes, debranching enzymes, sucrose synthases, hordeins, embryo globulins and specific proteins of aleurone.

Um promotor do tomate ativo durante o amadurecimento da fruta, senescência e abscisão das folhas e, em um grau menor, das flores pode ser usado (Blume et aí, Plant J. 12: 731 a 746, 1997, aqui incorporada por referência em sua totalidade). Outros promotores exemplares incluem o promotor específico do gene SK2 de pistol na batata (Sotanum tuberosum L.), que codifica uma endoquitinase básica específica de pistilo (Ficker et aí, Plant Mol. Biol. 35: 425 a 431, 1997, aqui incorporada por referência em sua totalidade); o gene Blec4 da ervilha (Pisum sativum cv. Alaska), ativo no tecido epidérmico de ápices de broto vegetativo e floral de alfalfa transgênica. Isto o torna uma ferramenta útil para alvejar a expressão de genes estranhos à camada epidérmica de brotos que crescem ativamente. O promotor E8 específico de tecido do tomate também é útil para direcionar a expressão de gene em frutas (Deikman, et aí, Plant Physiology 100: 2013 a 2017, 1992).A tomato promoter active during fruit ripening, senescence and abscission of leaves and, to a lesser extent, flowers can be used (Blume et al., Plant J. 12: 731 to 746, 1997, hereby incorporated by reference in its totality). Other exemplary promoters include the specific promoter of the piston SK2 gene in potatoes (Sotanum tuberosum L.), which encodes a basic pistil-specific endoquitinase (Ficker et al., Plant Mol. Biol. 35: 425 to 431, 1997, incorporated herein by reference in its entirety); the Blec4 gene of the pea (Pisum sativum cv. Alaska), active in the epidermal tissue of apexes of the vegetative and floral buds of transgenic alfalfa. This makes it a useful tool to target the expression of genes foreign to the epidermal layer of actively growing shoots. The tomato tissue-specific E8 promoter is also useful for targeting gene expression in fruits (Deikman, et al., Plant Physiology 100: 2013 to 2017, 1992).

É ainda reconhecido que, visto que na maioria dos casos os limites exatos de seqüências reguladoras não foram completamente definidos, fragmentos de DNA de comprimentos diferentes podem ter atividade promotora idêntica.It is also recognized that, since in most cases the exact limits of regulatory sequences have not been completely defined, DNA fragments of different lengths can have identical promoter activity.

Os promotores que são conhecidos ou são descoberto causar a transcrição de DNA em células vegetais podem ser usados na presente invenção. Tais promotores podem ser obtidos a partir de uma variedade de fontes tais como plantas e vírus de planta. Além dos promotores que são conhecidos por causar a transcrição de DNA em células vegetais, outras moléculas promotoras podem ser identificadas para o uso na corrente invenção triando-se uma biblioteca de cDNA vegetal para os genes que são seletiva ou preferivelmente expressados nos tecidos ou células alvo e isolando a região genômica 5' dos cDNAs identificados.Promoters that are known or are found to cause transcription of DNA in plant cells can be used in the present invention. Such promoters can be obtained from a variety of sources such as plants and plant viruses. In addition to the promoters that are known to cause DNA transcription in plant cells, other promoter molecules can be identified for use in the current invention by screening a plant cDNA library for genes that are selectively or preferably expressed in target tissues or cells. and isolating the 5 'genomic region of the identified cDNAs.

Os constructos ou vetores também podem incluir com a região codificadora de interesse um ácido polinucléico que atua, no todo ou em parte, para terminar a transcrição daquela região. Por exemplo, tais seqüências foram isoladas incluindo a seqüência Tr7 3' e a seqüência nos 3' (Ingelbrecht et al., The Plant Cell 1: 671 a 680, 1989, a totalidade da qual é aqui incorporada por referência; Bevan et al., Nucleic Acids Res. 11: 369 a 385, 1983, a totalidade da qual é aqui incorporada por referência).The constructs or vectors may also include with the coding region of interest a polynucleic acid that acts, in whole or in part, to end the transcription of that region. For example, such sequences have been isolated including the 3 'Tr7 sequence and the 3' sequence (Ingelbrecht et al., The Plant Cell 1: 671 to 680, 1989, all of which are incorporated by reference; Bevan et al. , Nucleic Acids Res. 11: 369 to 385, 1983, all of which are incorporated by reference).

Um vetor ou constructo também podem incluir elementos reguladores. Os exemplos de tais incluem o intron Adh 1 (Callis et al., Genes e Develop. 1; 1183 a 1200, 1987, a totalidade da qual é aqui incorporada por referência), o intron da sacarose sintase (Vasil et al., Plant Physiol. 91: 1575 a 1579, 1989, a totalidade da qual é aqui incorporada por referência) e o elemento TMV ômega (Gallie et al., Plant Cell 1: 301 a 311, 1989, a totalidade da qual é aqui incorporada por referência). Estes e outros elementos reguladores podem ser incluídos quando apropriado.A vector or construct can also include regulatory elements. Examples of such include the intron Adh 1 (Callis et al., Genes and Develop. 1; 1183 to 1200, 1987, all of which is incorporated by reference), the sucrose synthase intron (Vasil et al., Plant Physiol. 91: 1575 to 1579, 1989, all of which is incorporated by reference) and the element TMV omega (Gallie et al., Plant Cell 1: 301 to 311, 1989, all of which is incorporated by reference ). These and other regulatory elements can be included where appropriate.

Um vetor ou constructo também podem incluir um marcador selecionável. Os marcadores selecionáveis também podem ser usados para selecionar plantas ou células vegetais que contêm o material genético exógeno. Os exemplos de tais incluem, mas não são limitados a, um gene neo (Potrykus et al., Mol. Gen. Genet. 199: 183 a 188, 1985, a totalidade da qual é aqui incorporada por referência) que codifica a resistência à canamicina e pode ser selecionada para usar a canamicina, G418, etc.; um gene bar que fornece a resistência ao bialafos; um gene da EPSP sintase mutante (Hinchee et al., Bio/Technology 6: 915 a 922 (1988), a totalidade da qual é aqui incorporada por referência) que fornece a resistência ao glifosato; um gene da nitrilase que fornece resistência à bromoxiniia (Stalker et al., J. Biol. Chem. 263: 6310 a 6314 (1988), a totalidade da qual é aqui incorporada por referência); um gene da acetolactato sintase mutante (ALS) que confere imidazolinona ou sulfoniluréia; e um gene DHFR resistente ao metotrexato (Thillet et aí, J. Biol. Chem. 263; 12500 a 12508, 1988, a totalidade da qual é aqui incorporada por referência).A vector or construct can also include a selectable marker. Selectable markers can also be used to select plants or plant cells that contain exogenous genetic material. Examples of such include, but are not limited to, a neo gene (Potrykus et al., Mol. Gen. Genet. 199: 183 to 188, 1985, all of which is incorporated herein by reference) which encodes resistance to kanamycin and can be selected to use kanamycin, G418, etc .; a bar gene that provides resistance to bialafos; a mutant EPSP synthase gene (Hinchee et al., Bio / Technology 6: 915 to 922 (1988), all of which is incorporated by reference) that provides glyphosate resistance; a nitrilase gene that provides resistance to bromoxynia (Stalker et al., J. Biol. Chem. 263: 6310 to 6314 (1988), all of which is incorporated herein by reference); a mutant acetolactate synthase (ALS) gene that confers imidazolinone or sulfonylurea; and a methotrexate-resistant DHFR gene (Thillet et al., J. Biol. Chem. 263; 12500 to 12508, 1988, all of which is incorporated herein by reference).

Um vetor ou constructo também pode incluir um marcador triá•· ········A vector or construct can also include a triá • • ········

vel. Os marcadores triáveis podem ser usados para monitorar a expressão. Os marcadores triáveis exemplares incluem um gene da β-glicuronidase ou uidA (GUS) que codifica uma enzima para a qual vários substratos cromogênicos são conhecidos (Jefferson, Plant Mol. Biol, Rep. 5: 387 a 405 (1987), a totalidade da qual é aqui incorporada por referência; Jefferson et al., EMBO J. 6: 3901 a 3907 (1987), a totalidade da qual é aqui incorporada por referência); um gene do local R, que codifica um produto que regula a produção de pigmentos de antocianina (cor vermelha) em tecidos vegetais (Dellaporta et al., Stadler Symposium 11: 263 a 282 (1988), a totalidade da qual é aqui incorporada por referência); um gene da β-lactamase (Sutcliffe et al., Proc. Natl. Acad. Sei. (U.S.A.) 75: 3737 a 3741 (1978), a totalidade da qual é aqui incorporada por referência), um gene que codifica uma enzima para as qual vários substratos cromogênicos são conhecidos (por exemplo, PADAC, uma cefalosporina cromogênica); um gene da luciferase (Ow et al., Science 234: 856 a 859 (1986), a totalidade da qual é aqui incorporada por referência); um gene xylE (Zukowsky et al., Proc. Natl. Acad. Sei. (U.S.A.) 80: 1101 a 1105 (1983), a totalidade da qual é aqui incorporada por referência) que codifica uma catecol dioxigenase que pode converter catecóis cromogênicos; um gene da α-amilase (Ikatu et al., Bio/Technol. 8: 241 a 242, 1990, a totalidade da qual é aqui incorporada por referência); um gene da tirosinase (Katz etal., J. Gen. Microbiol. 129: 2703 a 2714,1983, a totalidade da qual é aqui incorporada por referência) que codifica uma enzima capaz de oxidar a tirosina a DOPA e dopaquinona que por sua vez condensa até melanina; e uma a-galactosidase.speed. Trieable markers can be used to monitor expression. Exemplary triple markers include a β-glucuronidase or uidA (GUS) gene that encodes an enzyme for which several chromogenic substrates are known (Jefferson, Plant Mol. Biol, Rep. 5: 387 to 405 (1987), the entire which is incorporated herein by reference; Jefferson et al., EMBO J. 6: 3901 to 3907 (1987), all of which are incorporated by reference); a R site gene, which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al., Stadler Symposium 11: 263 to 282 (1988), all of which is incorporated by reference); a β-lactamase gene (Sutcliffe et al., Proc. Natl. Acad. Sci. (USA) 75: 3737 to 3741 (1978), all of which is incorporated by reference), a gene that encodes an enzyme for which various chromogenic substrates are known (for example, PADAC, a chromogenic cephalosporin); a luciferase gene (Ow et al., Science 234: 856 to 859 (1986), all of which are incorporated by reference); an xylE gene (Zukowsky et al., Proc. Natl. Acad. Sci. (U.S.A.) 80: 1101 to 1105 (1983), all of which is incorporated by reference) that encodes a dioxygenase catechol that can convert chromogenic catechols; an α-amylase gene (Ikatu et al., Bio / Technol. 8: 241 to 242, 1990, all of which are incorporated herein by reference); a tyrosinase gene (Katz etal., J. Gen. Microbiol. 129: 2703 to 2714,1983, all of which is incorporated by reference) that encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to melanin; and a-galactosidase.

Introdução de Polinucleotídeos em PlantasIntroduction of Polynucleotides in Plants

Existem muitos métodos para introduzir moléculas de ácido nucléico transformadoras em células vegetais. Acredita-se que os métodos adequados incluam virtualmente qualquer método pelo qual as moléculas de ácido nucléico possam ser introduzidas em uma célula, tal como pela infecção com Agrobacterium ou liberação direta de moléculas de ácido nucléico tais como, por exemplo, pela transformação mediada por PEG, pela eletroporação ou pela aceleração de partículas revestidas com DNA, etc. (Po• · ·······*There are many methods for introducing transforming nucleic acid molecules into plant cells. Suitable methods are believed to include virtually any method by which nucleic acid molecules can be introduced into a cell, such as by infection with Agrobacterium or direct release of nucleic acid molecules such as, for example, by PEG-mediated transformation , by electroporation or by the acceleration of particles coated with DNA, etc. (Po • · ······· *

» · · · · · · trykus, Ann. Rev. Plant Physiol. Plant Mol. Biol. 42: 205 a 225 (1991), a totalidade da qual é aqui incorporada por referência; Vasil, Plant Mol. Biol. 25: 925 a 937 (1994), a totalidade da qual é aqui incorporada por referência). Por exemplo, a eletroporação foi usada para transformar protoplastos de Zea mays (Fromm et al., Nature 312: 791 a 793, 1986, a totalidade da qual é aqui incorporada por referência).»· · · · · · Trykus, Ann. Rev. Plant Physiol. Plant Mol. Biol. 42: 205 to 225 (1991), all of which are incorporated by reference; Vasil, Plant Mol. Biol. 25: 925 to 937 (1994), all of which are incorporated by reference). For example, electroporation has been used to transform Zea mays protoplasts (Fromm et al., Nature 312: 791 to 793, 1986, all of which are incorporated by reference).

Outros sistemas de vetor adequados para introduzir DNA transformador em uma célula hospedeira vegetal incluem mas não são limitados aos vetores de cromossoma artificial binário (BIBAC) (Hamilton et al., Gene 200: 107 a 116, 1997, a totalidade da qual é aqui incorporada por referência) e a transfecção com vetores de RNA viral (Della-Cioppa et al., Ann. Ν. Y. Acad. Sei. (1996), 792 pp Engineering Plants for Commercial Products e Applications, pp 57 a 61, a totalidade da qual é aqui incorporada por referência.Other vector systems suitable for introducing transforming DNA into a plant host cell include but are not limited to binary artificial chromosome vectors (BIBAC) (Hamilton et al., Gene 200: 107 to 116, 1997, all of which are incorporated herein by reference) and transfection with viral RNA vectors (Della-Cioppa et al., Ann. Ν. Y. Acad. Sei. (1996), 792 pp Engineering Plants for Commercial Products and Applications, pp 57 to 61, all of which is incorporated herein by reference.

A tecnologia para a introdução de DNA em células é bem conhecida por aqueles versados na técnica. Quatro métodos gerais para a liberação de um gene em células foram descritos: (1) métodos químicos (Graham e van der Eb, Virology 54: 536 a 539, 1973, a totalidade da qual é aqui incorporada por referência); (2) métodos físicos tais como microinjeção (Capecchi, Cell 22: 479 a 488 (1980), a totalidade da qual é aqui incorporada por referência), eletroporação (Wong e Neumann, Biochem. Biophys. Res. Commun. 107: 584 a 587 (1982); Fromm et al., Proc. Natl. Acad. Sei. (U.S.A.) 82: 5824 a 5828 (1985); Patente U.S. N^s 5.384.253, todas as quais são aqui incorporadas em sua totalidade); e a pistola de gene (Johnston e Tang, Methods Cell Biol. 43; 353 a 365 (1994), a totalidade da qual é aqui incorporada por referência); (3) vetores virais (Clapp, Clin. Perinatol. 20: 155 a 168, 1993; Lu et al., J. Exp. Med. 178: 2089 a 2096, 1993; Eglitis e Anderson, Biotechniques 6: 608 a 614, 1988, todas as quais são aqui incorporadas em sua totalidade); e (4) mecanismos mediados por receptor (Curiel et al., Hum. Gen. Ther. 3: 147 a 154, 1992, Wagner et al., Proc. Natl. Acad. Sei. USA 89: 6099 a 6103, 1992, todas as quais são incorporadas por referência em sua totalidade).The technology for introducing DNA into cells is well known to those skilled in the art. Four general methods for releasing a gene into cells have been described: (1) chemical methods (Graham and van der Eb, Virology 54: 536 to 539, 1973, all of which are incorporated by reference); (2) physical methods such as microinjection (Capecchi, Cell 22: 479 to 488 (1980), all of which is incorporated by reference), electroporation (Wong and Neumann, Biochem. Biophys. Res. Commun. 107: 584 a 587 (1982); Fromm et al, Proc Natl Acad Sci (USA) 82:..... 5824-5828 (1985); US Patent ^No.'s 5,384,253, all of which are incorporated herein in their entirety); and the gene gun (Johnston and Tang, Methods Cell Biol. 43; 353 to 365 (1994), all of which are incorporated herein by reference); (3) viral vectors (Clapp, Clin. Perinatol. 20: 155 to 168, 1993; Lu et al., J. Exp. Med. 178: 2089 to 2096, 1993; Eglitis and Anderson, Biotechniques 6: 608 to 614, 1988, all of which are incorporated herein in their entirety); and (4) receptor-mediated mechanisms (Curiel et al., Hum. Gen. Ther. 3: 147 to 154, 1992, Wagner et al., Proc. Natl. Acad. Sci. USA 89: 6099 to 6103, 1992, all of which are incorporated by reference in their entirety).

w • ·· ·· · ·· · • · · « • · · · • · · * • · · · ⁴ • ·· ·· ··w • ·· ·· · ·· · • · · «• · · • • · * • · · · ⁴ • ·· ·· ··

Os métodos de aceleração que podem ser usados incluem, por exemplo, o bombardeamento de microprojétil e outros. Um exemplo de um método para a liberação de moléculas de ácido nucléico transformadoras às células vegetais é o bombardeamento de microprojétil. Este método foi revisado por Yang e Christou, eds., Particle Bombardment Technology for Gene Transfer, Oxford Press, Oxford, Inglaterra (1994), a totalidade da qual é aqui incorporada por referência. As partículas não-biológicas (microprojéteis) que podem ser revestidas com ácidos nucléicos e liberados nas células por uma força de propulsão. As partículas exemplares incluem aquelas compreendidas de tungstênio, ouro, platina e outros.The acceleration methods that can be used include, for example, microprojectile bombardment and others. An example of a method for releasing transforming nucleic acid molecules to plant cells is microprojectile bombardment. This method was reviewed by Yang and Christou, eds., Particle Bombardment Technology for Gene Transfer, Oxford Press, Oxford, England (1994), all of which is incorporated by reference. Non-biological particles (microprojectiles) that can be coated with nucleic acids and released into cells by a propelling force. Exemplary particles include those comprised of tungsten, gold, platinum and others.

A transferência mediada por Agrobacterium é um sistema amplamente aplicável para introduzir genes em células vegetais porque o DNA pode ser introduzido nos tecidos vegetais integrais, desviando deste modo a necessidade para a regeneração de uma planta intacta a partir de um protoplasto. O uso de vetores de integração vegetal medidados por Agrobacterium para introduzir DNA em células vegetais é bem conhecido na técnica. Ver, por exemplo os métodos descritos por Fraley et al., Bio/Technology 3: 629 a 635 (1985) e Rogers et al., Methods Enzymol. 153: 253 a 277 (1987), ambas as quais são aqui incorporadas por referência em sua totalidade. Além disso, a integração do Ti-DNA é um processo relativamente preciso que resulta em poucos rearranjos. A região de DNA a ser transferida é definida pelas seqüências da extremidade e o DNA interventor é usualmente inserido no genoma da planta como descrito (Spielmann et al., Mol. Gen. Genet. 205: 34 (1986), a totalidade da qual é aqui incorporada por referência).Agrobacterium-mediated transfer is a widely applicable system for introducing genes into plant cells because DNA can be introduced into whole plant tissues, thereby deflecting the need for the regeneration of an intact plant from a protoplast. The use of plant integration vectors measured by Agrobacterium to introduce DNA into plant cells is well known in the art. See, for example, the methods described by Fraley et al., Bio / Technology 3: 629 to 635 (1985) and Rogers et al., Methods Enzymol. 153: 253 to 277 (1987), both of which are incorporated herein by reference in their entirety. Furthermore, the integration of Ti-DNA is a relatively precise process that results in few rearrangements. The DNA region to be transferred is defined by the end sequences and the intervening DNA is usually inserted into the plant's genome as described (Spielmann et al., Mol. Gen. Genet. 205: 34 (1986), the total of which is incorporated herein by reference).

Uma planta transgênica que resulta de métodos de transformação com Agrobacterium freqüentemente contém um único gene em um cromossoma. Tais plantas transgênicas podem ser aludidas como sendo homozigotas em relação ao gene adicionado. Mais preferido é uma planta transgênica que seja homozigota em relação ao gene estrutural adicionado; isto é, uma planta transgênica que contenha dois genes adicionados, um gene no mesmo local em cada cromossoma de um par de cromossomas. Uma •fr ········ • · · · · • · · • ···· planta transgênica homozigota pode ser obtida acasalando-se sexualmente (autopolinização) uma planta transgênica segregante independente que contenha um único gene adicionado, germinando algumas das sementes produzidas e analisando as plantas resultantes produzidas para o gene de interesse.A transgenic plant that results from Agrobacterium transformation methods often contains a single gene on a chromosome. Such transgenic plants can be alluded to as being homozygous for the added gene. Most preferred is a transgenic plant that is homozygous for the added structural gene; that is, a transgenic plant that contains two added genes, one gene at the same location on each chromosome of a pair of chromosomes. A homozygous transgenic plant can be obtained by sexually mating (self pollinating) an independent segregating transgenic plant that contains a single added gene, germinating (germinating) · germinating (germinating). some of the seeds produced and analyzing the resulting plants produced for the gene of interest.

Também deve ser entendido que duas plantas transgênicas diferentes também podem ser acasaladas para produzir progênie que contenha dois genes exógeno independentemente segregados adicionados. A autopolinização de progênie apropriada pode produzir plantas que sejam homozigotas em relação a ambos os genes exógenos adicionados que codificam um polipeptideo de interesse. O retro-cruzamento com uma planta precursora e o cruzamento com uma planta não -transgênica também são considerados, como o é a propagação vegetativa.It should also be understood that two different transgenic plants can also be mated to produce progeny that contain two independently secreted exogenous genes added. Self-pollination of appropriate progeny can produce plants that are homozygous for both added exogenous genes that encode a polypeptide of interest. Retro-crossing with a precursor plant and crossing with a non-transgenic plant are also considered, as is vegetative propagation.

A regeneração, desenvolvimento e cultivo de plantas a partir de um único transformante de protoplasto vegetal ou a partir de vários explantes transformados são bem conhecidos na técnica (Weissbach e Weissbach, Em: Methods for Plant Molecular Biology, (eds.), Academic Press, Inc. San Diego, CA, (1988), a totalidade da qual é aqui incorporada por referência). Este processo de regeneração e cultivo tipicamente inclui as etapas de seleção de células transformadas, cultivando-se estas células individualizadas através dos estágios usuais de desenvolvimento embriônico até o estágio de muda enraizada. Os embriões e as sementes transgênicas são similarmente regenerados. Os brotos enraizados transgênicos resultantes são depois disso plantados em um meio de cultivo de planta apropriado tal como o solo.The regeneration, development and cultivation of plants from a single plant protoplast transformant or from several transformed explants are well known in the art (Weissbach and Weissbach, In: Methods for Plant Molecular Biology, (eds.), Academic Press, Inc. San Diego, CA, (1988), all of which are incorporated by reference). This process of regeneration and cultivation typically includes the stages of selection of transformed cells, cultivating these individualized cells through the usual stages of embryonic development to the rooted seedling stage. Embryos and transgenic seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant cultivation medium such as the soil.

O desenvolvimento ou regeneração de plantas contendo o gene estranho, exógeno que codifica uma proteína de interesse é bem conhecido na técnica. Preferivelmente, as plantas regeneradas são autopolinizadas para fornecer plantas transgênicas homozigotas. De outro modo, o pólen obtido a partir das plantas regeneradas é cruzado com plantas desenvolvidas com semente de linhagens agronomicamente importantes. Ao contrário, o pólen de plantas destas linhagens importantes é usado para polinizar plantas regeneradas. Uma planta transgênica da presente invenção conten74The development or regeneration of plants containing the foreign, exogenous gene encoding a protein of interest is well known in the art. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from regenerated plants is crossed with plants developed with seed from agronomically important strains. In contrast, pollen from plants of these important strains is used to pollinate regenerated plants. A transgenic plant of the present invention contains 74

do um polipeptídeo desejado é cultivada usando métodos bem conhecidos por uma pessoa versada na técnica.A desired polypeptide is cultured using methods well known to a person skilled in the art.

A presente invenção também fornece partes das plantas da presente invenção. As partes de planta, sem limitação, incluem semente, endosperma, óvulo e pólen. Em uma forma de realização particularmente preferida da presente invenção, a parte de planta é uma semente.The present invention also provides parts of the plants of the present invention. The plant parts, without limitation, include seed, endosperm, egg and pollen. In a particularly preferred embodiment of the present invention, the plant part is a seed.

Os métodos para transformar dicotiledôneas, primariamente pelo uso de Agrobacterium tumefaciens e a obtenção de plantas transgênicas foi publicado, por exemplo, algodão (Patente U.S. N^e 5.004.863. Patente U.S. N^s 5.159.135. Patente U.S. N^Q 5.518.908, todas as quais são aqui incorporada por referência em sua totalidade), soja (Patente U.S. Ν^Ω 5.569.834. a totalidade da qual é aqui incorporada por referência) e Brassica (Patente U.S. N^e 5.463.174. a totalidade da qual é aqui incorporada por referência).Methods for transforming dicots, primarily by use of Agrobacterium tumefaciens , and obtaining transgenic plants have been published, for example, cotton (US Patent No. 5,004,863 ^and. 5,159,135 US Patent ^Nos. US Patent No. 5518908 ^Q, all of which are incorporated herein by reference in its entirety), soybean (US Patent 5,569,834 ^{Ν Ω.} the entirety of which is hereby incorporated by reference) and Brassica (US Patent No. 5,463,174 ^and. the entirety of which is hereby incorporated by reference).

A transformação de monocotiledôneas usando a eletroporação, bombardeamento de partícula e Agrobacterium também foi relatada. Por exemplo, a transformação e a regeneração de planta foi obtida no aspargo, cevada, Zea mays (Fromm et al., Bio/Technology 8: 833 (1990), Armstrong et al., Crop Science 35: 550 a 557 (1995), todas as quais são aqui incorporada por referência em sua totalidade): aveia; arroz, centeio, cana de açúcar; festuca alta e trigo (Patente U.S. N^Q 5.631.152, a totalidade da qual é aqui incorporada por referência).The transformation of monocots using electroporation, particle bombardment and Agrobacterium has also been reported. For example, plant transformation and regeneration was achieved in asparagus, barley, Zea mays (Fromm et al., Bio / Technology 8: 833 (1990), Armstrong et al., Crop Science 35: 550 to 557 (1995) , all of which are incorporated by reference in their entirety): oats; rice, rye, sugar cane; tall fescue and wheat (US Patent No. ^Q 5,631,152, all of which is incorporated by reference).

Além dos procedimentos debatidos acima, os técnicos estão familiarizados com os materiais de recurso padrão que descrevem condições e procedimentos específicos para a construção, manipulação e isolação de macromoléculas (por exemplo, moléculas de DNA, plasmídeos, etc.), a geração de organismos recombinantes e a triagem e isolação de clones, (ver por exemplo, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989); Mailga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995), a totalidade da qual é aqui incorporada por referência; Birren et al., Genome Analisis: Detecting Genes, 1, Cold Spring Harbor, Nova Iorque (1998), a totalidade da qual é aqui incorporada por referência; Birren et al., Genome Analysis: Analyzing DNA, 2, ColdIn addition to the procedures discussed above, technicians are familiar with standard resource materials that describe specific conditions and procedures for the construction, manipulation and isolation of macromolecules (eg, DNA molecules, plasmids, etc.), the generation of recombinant organisms and the screening and isolation of clones, (see for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989); Mailga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995 ), all of which is incorporated by reference; Birren et al., Genome Analisis: Detecting Genes, 1, Cold Spring Harbor, New York (1998), all of which is incorporated by reference; Birren et al., Genome Analysis: Analyzing DNA, 2, Cold

Í3Í3

Spring Harbor, Nova Iorque (1998), a totalidade da qual é aqui incorporada por referência; Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer, Nova Iorque (1997), a totalidade da qual é aqui incorporada por referência).Spring Harbor, New York (1998), all of which are incorporated by reference; Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer, New York (1997), all of which are incorporated by reference).

Tendo agora no geral descrito a invenção, a mesma será mais facilmente entendida através da referência aos seguintes exemplos, que são fornecidos por via de ilustração e não são intencionados a serem limitantes da presente invenção, a menos que especificado.Having now generally described the invention, it will be more easily understood by reference to the following examples, which are provided by way of illustration and are not intended to be limiting of the present invention, unless specified.

EXEMPLOSEXAMPLES

EXEMPLO 1EXAMPLE 1

Quando um polinucleotideo vegetal nativo isolado que compreende uma seqüência codificadora é reconstruído como um transgene, depois introduzido na planta pelos métodos de transformação vegetal existe um risco de que a expressão do gene vegetal homólogo endógeno interagirá negativamente com o transgene. Para evitar estas interações negativas pode ser necessário fornecer um polinucleotideo de transgene substancialmente divergente em seqüência do gene vegetal nativo. Uma molécula de polinucleotídeo artificial pode ser produzida pelo método da presente invenção e usada para reduzir a ocorrência de silenciamento de transgene.When an isolated native plant polynucleotide comprising a coding sequence is reconstructed as a transgene, then introduced into the plant by plant transformation methods, there is a risk that the expression of the endogenous homologous plant gene will interact negatively with the transgene. To avoid these negative interactions it may be necessary to provide a substantially divergent transgene polynucleotide in sequence with the native plant gene. An artificial polynucleotide molecule can be produced by the method of the present invention and used to reduce the occurrence of transgene silencing.

Este exemplo serve para ilustrar os métodos da presente invenção que resultam na produção de um polinucleotideo que codifica uma EPSP sintase vegetal modificada. A enzima de EPSPS e o peptídeo de trânsito de cloroplasto do arroz nativo (Oryzae sativa) são usados para construir uma molécula de polinucleotideo artificial que também inclui códons que codificam aminoácidos substituídos que não ocorrem naturalmente na enzima de EPSPS do arroz. Estes aminoácidos substituídos fornecem uma enzima de EPSPS do arroz resistente ao glifosato (OsEPSPS_TIPS, SEQ ID NO: 1).This example serves to illustrate the methods of the present invention that result in the production of a polynucleotide that encodes a modified plant EPSP synthase. The EPSPS enzyme and the native rice chloroplast transit peptide (Oryzae sativa) are used to build an artificial polynucleotide molecule that also includes codons that encode naturally occurring substituted amino acids in the EPSPS enzyme in rice. These substituted amino acids provide a glyphosate resistant rice EPSPS enzyme (OsEPSPS_TIPS, SEQ ID NO: 1).

As etapas descritas na Tabela 6 são usadas para construir uma tal seqüência de polinucleotideo artificial (OsEPSPS_AT, SEQ ID NO: 3) usando uma tabela de uso do códon de Arabidopsis e os parâmetros para a construção de uma molécula de polinucleotideo substancialmente divergente, que quando expressada em plantas codifica uma enzima de EPSPS doThe steps described in Table 6 are used to construct such an artificial polynucleotide sequence (OsEPSPS_AT, SEQ ID NO: 3) using an Arabidopsis codon usage table and the parameters for the construction of a substantially divergent polynucleotide molecule, which when expressed in plants encodes an EPSPS enzyme from

arroz modificada resistente ao herbicida de glifosato. A comparação da seqüência de gene de EPSPS do arroz nativo aludida como OsEPSPS_Nat (SEQ ID NO: 2) que foi previamente modificada para codificar uma enzima resistente ao glifosato com a molécula de polinucleotídeo modificada quanto ao uso de códon de Arabidopsis, OsEP3PS_AT (SEQ ID NO: 3) e com a seqüência modificada quanto ao uso de códon de Zea mays, OsEPSPS_ZM (SEQ ID NO: 4) por este método é mostrada na Figura 1. A Figura 1 mostra bases de nucleotídeo mudadas nos polinucleotídeos modificados comparados à OsEPSPS_Nat, SEQ ID NO: 2.modified rice resistant to the glyphosate herbicide. The comparison of the native rice EPSPS gene sequence referred to as OsEPSPS_Nat (SEQ ID NO: 2) that was previously modified to encode a glyphosate-resistant enzyme with the modified polynucleotide molecule for the use of Arabidopsis codon, OsEP3PS_AT (SEQ ID NO: 3) and with the modified sequence regarding the use of Zea mays codon, OsEPSPS_ZM (SEQ ID NO: 4) by this method is shown in Figure 1. Figure 1 shows changed nucleotide bases in the modified polynucleotides compared to OsEPSPS_Nat, SEQ ID NO: 2.

Tabela 6. Planejamento de polinucleotídeo para uma EPSP sintase de arroz modificada (OsEPSPS_AT)Table 6. Polynucleotide planning for a modified EPSP synthase of rice (OsEPSPS_AT)

1. Substituir os aminoácidos nas posições 173 e 177 para fornecer uma enzima de EPSPS do arroz modificada resistente ao herbicida de glifosato mostrada na SEQ ID NO: 1.1. Substitute amino acids at positions 173 and 177 to provide a modified rice EPSPS enzyme resistant to the glyphosate herbicide shown in SEQ ID NO: 1.

2. Retro traduzir a SEQ ID NO: 1 para gerar uma seqüência de polinucleotídeo artificial usando a tabela de uso de códon de Arabidopsis thaliana (Tabela 2).2. Retro translate SEQ ID NO: 1 to generate an artificial polynucleotide sequence using the Arabidopsis thaliana codon usage table (Table 2).

3. Realizar o alinhamento de seqüência com a seqüência de polinucleotídeo OsEPSPS nativa (SEQ ID NO: 2) e a seqüência de polinucleotídeo artificial para determinar o grau de identidade de seqüência, mapear as matrizes de leitura aberta, selecionar padrões para procurar e identificar seqüências de reconhecimento de enzimas de restrição.3. Perform sequence alignment with the native OsEPSPS polynucleotide sequence (SEQ ID NO: 2) and the artificial polynucleotide sequence to determine the degree of sequence identity, map the open reading matrices, select patterns to search for and identify sequences of restriction enzyme recognition.

4. Fazer correções nos códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e evitar a formação de cacho de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada de uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Arabidopsis, Tabela 2.4. Make corrections to the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon use in the polynucleotide sequence according to the use of Arabidopsis codon, Table 2.

5. A seqüência de polinucleotídeo é inspecionada quanto a regiões locais que tenham uma relação GC:AT mais alta do que cerca de 2 em uma faixa de cerca de 50 nucleotideos contíguos. A seqüência de polinucleotídeo5. The polynucleotide sequence is inspected for local regions that have a GC: AT ratio higher than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence

é ajustada como necessário, substituindo-se os códons nestas regiões tal que a relação GC:AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira esteja na faixa de 0,9 a 1,3.it is adjusted as necessary, replacing the codons in these regions such that the local GC: AT ratio is less than about 2 and the entire polynucleotide composition is in the range of 0.9 to 1.3.

6. Introduzir códons de parada às matrizes de tradução b, c, d, e e T. Os códons de parada de tradução são criados nas matrizes de tradução b, c, d, e e f substituindo-se um ou mais códons dentro de cerca de 130 pares de bases (pares de base) das extremidades do polinucleotídeo artificial que cria um códon de parada sem mudar a seqüência codificadora de aminoácido da matriz um.6. Introduce stop codons to translation matrices b, c, d, e and T. Translation stop codons are created in translation matrices b, c, d, e and by replacing one or more codons within about 130 base pairs (base pairs) of the ends of the artificial polynucleotide that creates a stop codon without changing the amino acid coding sequence of matrix one.

7. Eliminar os códons de ATG das matrizes de leitura aberta avançada (matrizes b e c) e reversa (matrizes d, e, T). As matrizes de leitura avançada e reversa são inspecionadas quanto a presença de códons ATG. Qualquer códon ATG nas matrizes b e c encontrado na seqüência de polinucleotídeo antes do terceiro Met na matriz a do polinucleotídeo é eliminado pela substituição de um ou mais códons que envolve o ATG mudando um dos nucleotídeos sem mudar a seqüência codificadora de aminoácido da matriz a”. Nas matrizes reversas, a substituição de ATG ou a introdução do códon de parada podem ser feitos para interromper matrizes de leitura potenciais.7. Eliminate the ATG codons from the advanced open reading matrices (matrices b and c) and reverse (matrices d, e, T). The forward and reverse reading matrices are inspected for the presence of ATG codons. Any ATG codon in matrices b and c found in the polynucleotide sequence before the third Met in matrix a of the polynucleotide is eliminated by replacing one or more codons that surround the ATG by changing one of the nucleotides without changing the amino acid coding sequence of matrix a ”. In reverse arrays, substitution of ATG or introduction of the stop codon can be done to interrupt potential reading arrays.

8. Eliminar os padrões de reconhecimento das enzimas de restrição não desejadas e outros padrões específicos (poliadenilação, junção de RNA, padrões de instabilidade de seqüência). A seqüência de polinucleotídeo é inspecionada quanto a presença de qualquer padrão de polinucleotídeo não desejado e os padrões são rompidos pela substituição de códons nestas regiões.8. Eliminate the recognition patterns of unwanted restriction enzymes and other specific patterns (polyadenylation, RNA junction, sequence instability patterns). The polynucleotide sequence is inspected for the presence of any unwanted polynucleotide patterns and the patterns are disrupted by replacing codons in these regions.

9. Checar a identidade de seqüência entre um primeiro polinucleotídeo e o polinucleotídeo artificial criado pelo método da presente invenção. Eliminar a identidade de seqüência em um polinucleotídeo contíguo que seja maior do que 23 pares de base. É desejável eliminar a identidade de seqüência maior do que cerca de 15 pares de base. É útil selecionar a partir de aminoácidos tais como, serina, arginina e leucina que têm 6 códons ou a partir de aminoácidos com 4 códons para eliminar a identidade de seqüên-9. Check the sequence identity between a first polynucleotide and the artificial polynucleotide created by the method of the present invention. Eliminate the sequence identity in a contiguous polynucleotide that is greater than 23 base pairs. It is desirable to eliminate the sequence identity greater than about 15 base pairs. It is useful to select from amino acids such as serine, arginine and leucine that have 6 codons or from amino acids with 4 codons to eliminate sequence identity.

«20«20

cia.cia.

10. Revisar a seqüência de polinucleotídeo artificial que resulta de qualquer uma das etapas de 1 a 9 quanto a qualquer uma das características de seqüência identificadas nas etapas de 4 a 9 e se a seqüência não estiver de acordo com as condições realizar substituições de códon adicionais à seqüência até que as condições das etapas de 4 a 9 sejam atingidas.10. Review the artificial polynucleotide sequence that results from any of steps 1 through 9 for any of the sequence characteristics identified in steps 4 through 9 and if the sequence does not meet the conditions, perform additional codon substitutions to the sequence until the conditions of steps 4 to 9 are reached.

11. Construir a molécula de polinucleotídeo artificial pelos métodos conhecidos na técnica, por exemplo, usando PCR com uma mistura de iniciadores envolvidos. Os iniciadores nas extremidades do gene podem conter sítios de restrição convenientes para permitir a clonagem fácil do gene no vetor selecionado. Na extremidade 5' usualmente Alflll, BspHI, Ncol, Ndel, Pcil ou Sphl são mais convenientes visto que a sua seqüência contém um códon de partida ATG, entretanto outras enzimas também podem ser usadas se um polinucleotídeo modificado é planejado para criar uma fusão com um outro segmento de polinucleotídeo, por exemplo, peptídeo de trânsito de cloroplasto e seqüência codificadora de EPSPS.11. Build the artificial polynucleotide molecule by methods known in the art, for example, using PCR with a mixture of primers involved. The primers at the ends of the gene may contain convenient restriction sites to allow easy cloning of the gene into the selected vector. At the 5 'end usually Alflll, BspHI, Ncol, Ndel, Pcil or Sphl are more convenient since their sequence contains an ATG starting codon, however other enzymes can also be used if a modified polynucleotide is designed to create a fusion with a another polynucleotide segment, for example, chloroplast transit peptide and EPSPS coding sequence.

12. Realizar uma análise de seqüência de DNA do polinucleotídeo artificial para confirmar a construção sintética resultante na molécula de polinucleotídeo desejada. Se erros são encontrados, então eliminar os mesmos pela mutagênese direcionada ao sítio para a qual muitos métodos são conhecidos por aqueles versados na técnica da mutagênese de DNA.12. Perform a DNA sequence analysis of the artificial polynucleotide to confirm the resulting synthetic construct in the desired polynucleotide molecule. If errors are found, then eliminate them by mutagenesis directed to the site for which many methods are known to those skilled in the DNA mutagenesis technique.

Uma versão do uso de códon de Zea mays (Zea mays, Tabela 3) da seqüência de enzima de EPSPS do arroz resistente ao glifosato (enzima de EPSPS do Oryzae sativa com mutações TIPS, SEQ ID NO: 1) é fabricada. O polinucleotídeo que codifica esta enzima inclui códons que codificam aminoácidos substituídos que não ocorrem naturalmente na enzima de EPSPS do arroz nativa. Estes aminoácidos substituídos fornecem uma enzima de EPSPS do arroz resistente ao glifosato. As etapas descritas na Tabela 7 são usadas para construir uma seqüência de polinucleotídeo artificial modificada (OsEPSPS_ZM, SEQ ID NO: 4) com base em uma tabela de uso do códon de Zea mays que codifica uma enzima de EPSPS do arroz modificada resistente ao herbicida de glifosato. A comparação da seqüência de • · · · · · ·A version of the Zea mays codon usage (Zea mays, Table 3) of the EPSPS enzyme sequence of glyphosate resistant rice (EPSPS enzyme from Oryzae sativa with TIPS mutations, SEQ ID NO: 1) is manufactured. The polynucleotide encoding this enzyme includes codons that encode substituted amino acids that do not occur naturally in the native EPSPS enzyme in rice. These substituted amino acids provide a glyphosate-resistant EPSPS enzyme in rice. The steps described in Table 7 are used to construct a modified artificial polynucleotide sequence (OsEPSPS_ZM, SEQ ID NO: 4) based on a Zea mays codon usage table that encodes a modified EPSPS enzyme from the herbicide resistant rice. glyphosate. The comparison of the sequence of • · · · · · ·

SY polinucleotídeo OsEPSPS_Nat (SEQ ID NO: 2) com a seqüência de polinucleotídeo artificial OsEPSPS_ZM (SEQ ID NO: 4) usando o uso de códon de Zea mays é mostrada na Figura 2.SY polynucleotide OsEPSPS_Nat (SEQ ID NO: 2) with the artificial polynucleotide sequence OsEPSPS_ZM (SEQ ID NO: 4) using the Zea mays codon is shown in Figure 2.

Tabela 7. Construção de polinucleotídeo para a EPSP sintase de arroz modificada (OsEPSPS_ZM)Table 7. Polynucleotide construction for modified EPSP synthase of rice (OsEPSPS_ZM)

1. Retro traduzir a SEQ ID NO: 1 para gerar uma seqüência de polinucleotídeo artificial usando a tabela de uso do códon de Zea mays (Tabela 3).1. Retro translate SEQ ID NO: 1 to generate an artificial polynucleotide sequence using the Zea mays codon usage table (Table 3).

2. Realizar o alinhamento de seqüência com a seqüência de polinucleotídeo OsEPSPS nativa (SEQ ID NO: 2) e a seqüência de polinucleotídeo artificial para determinar o grau de identidade de seqüência, mapear as matrizes de leitura aberta, selecionar padrões para procurar e identificar seqüências de reconhecimento de enzimas de restrição.2. Perform sequence alignment with the native OsEPSPS polynucleotide sequence (SEQ ID NO: 2) and the artificial polynucleotide sequence to determine the degree of sequence identity, map the open reading matrices, select patterns to search for and identify sequences of restriction enzyme recognition.

3. Fazer correções nos códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e evitar a formação de cachos de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada de uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Zea mays, Tabela 3.3. Make corrections to the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon usage in the polynucleotide sequence according to Zea mays codon usage, Table 3.

4. A seqüência de polinucleotídeo é inspecionada quanto às regiões locais que têm uma relação GC:AT maior do que cerca de 2 em uma faixa de cerca de 50 nucleotídeos contíguos. A seqüência de polinucleotídeo é ajustada como necessário, pela substituição de códons nestas regiões tal que a relação GC:AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira esteja na faixa de 1,2 a 1,7.4. The polynucleotide sequence is inspected for local regions that have a GC: AT ratio greater than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence is adjusted as necessary, by replacing codons in these regions such that the local GC: AT ratio is less than about 2 and the entire polynucleotide composition is in the range of 1.2 to 1.7.

5. Seguir as etapas de 6 a 12 da Tabela 6.5. Follow steps 6 through 12 in Table 6.

• · · ·····• · · ·····

Tabela 8. Identidade de seqüência percentual entre polinucleotídeos OsEPSPS.Table 8. Percentage sequence identity among OsEPSPS polynucleotides.

OsEPSPS ZM OsEPSPS ZM OsEPSPS AT OsEPSPS AT OsEPSPS Nat OsEPSPS Nat OsEPSPS ZM OsEPSPS ZM 100,00 100.00 73,51 73.51 71,58 71.58 OsEPSPS AT OsEPSPS AT 100,00 100.00 74,03 74.03 OsEPSPS Nat OsEPSPS Nat 100,00 100.00

Tabela 9. A composição de nucleotídeo e a relação GC:AT das seqüências de polinucleotídeo modificadas para a seqüência de gene de EPSPS do arroz.Table 9. The nucleotide composition and the GC: AT ratio of the polynucleotide sequences modified to the EPSPS gene sequence of rice.

A THE C Ç G G T T GC.AT GC.AT OsEPSPS. OsEPSPS. _AT _AT 377 377 336 336 444 444 391 391 1.02 1.02 OsEPSPS. OsEPSPS. ZM ZM 365 365 381 381 470 470 332 332 1.22 1.22

As duas seqüências de polinucleotídeo artificial de EPSPS do arroz (SEQ ID NO: 3 e SEQ ID NO: 4) são modificadas tal que a identidade percentual esteja abaixo de 75 por cento comparada com a SEQ ID NO: 2 ou em relação uma com a outra (Tabela 8). A composição de nucleotídeo e a relação GC:AT das seqüências de polinucleotídeo para a seqüência de gene de EPSPS do arroz são mostradas na Tabela 9. Estes polinucleotídeos podem ser selecionados para o uso em constructos de expressão vegetal junto com elementos reguladores diferentes ou eles podem ser combinados em uma única planta pela retransformação com um constructo de DNA ou pelos métodos de cruzamento vegetal. Problemas com o silenciamento e a recombinação de gene são reduzidos quando os constructos de DNA têm níveis reduzidos de DNA homólogo.The two artificial EPSPS polynucleotide sequences in rice (SEQ ID NO: 3 and SEQ ID NO: 4) are modified such that the percent identity is below 75 percent compared to SEQ ID NO: 2 or in relation to one another (Table 8). The nucleotide composition and the GC: AT ratio of the polynucleotide sequences to the rice EPSPS gene sequence are shown in Table 9. These polynucleotides can be selected for use in plant expression constructs together with different regulatory elements or they can be combined into a single plant by retransformation with a DNA construct or by plant breeding methods. Problems with gene silencing and recombination are reduced when DNA constructs have reduced levels of homologous DNA.

EXEMPLO 2EXAMPLE 2

O Milho (Zea mays) foi geneticamente modificado para ter resistência ao herbicida de glifosato (Patente US N^Q 6.040.497). Estas plantas de milho contêm um transgene com uma EPSP sintase do milho modificado quanto a tolerância ao glifosato. Os métodos da presente invenção podem ser usados para construir um novo polinucleotídeo artificial que codifica uma EPSP sintase do milho que é substancialmente diferente na identidade percentual ao gene da EPSP sintase do milho endógeno. O polinucleotídeo arti• · · $3 ficial da EPSP sintase do milho recém construído pode ser usado como um marcador selecionável durante a seleção de linhagens de planta transgênica que podem conter traços agronômicos transgênicos adicionais. Durante a produção de semente de milho híbrido, é útil ter ambos os precursores tolerantes ao glifosato usando transgenes que não interfiram.Corn (Zea mays) was genetically modified to have resistance to the glyphosate herbicide (US Patent No. ^Q 6,040,497). These maize plants contain a transgene with a modified EPSP synthase from maize for glyphosate tolerance. The methods of the present invention can be used to construct a new artificial polynucleotide that encodes a corn EPSP synthase that is substantially different in percentage identity to the endogenous corn EPSP synthase gene. The arti • · · $ 3 polynucleotide from newly constructed maize EPSP synthase can be used as a selectable marker when selecting transgenic plant strains that may contain additional transgenic agronomic traits. During hybrid maize seed production, it is useful to have both glyphosate tolerant precursors using non-interfering transgenes.

Tabela 10. Constructo de polinucleotídeo para a EPSP sintase do milho modificada (ZmEPSPS_ZM, SEQ ID NO: 10)Table 10. Polynucleotide construct for modified corn EPSP synthase (ZmEPSPS_ZM, SEQ ID NO: 10)

1. Retro traduzir a SEQ ID NO: 8 para gerar uma seqüência de polinucleotídeo usando a tabela de uso do códon de Zea mays (Tabela 3).1. Retro translate SEQ ID NO: 8 to generate a polynucleotide sequence using the Zea mays codon usage table (Table 3).

2. Realizar o alinhamento de seqüência com a seqüência de polinucleotídeo ZmEPSPS_Nat (SEQ ID NO: 9) e a seqüência de polinucleotídeo artificial para determinar o grau de identidade de seqüência, mapear as matrizes de leitura aberta, selecionar padrões para procurar e identificar seqüências de reconhecimento de enzima de restrição.2. Perform sequence alignment with the polynucleotide sequence ZmEPSPS_Nat (SEQ ID NO: 9) and the artificial polynucleotide sequence to determine the degree of sequence identity, map the open reading matrices, select patterns to search for and identify sequences of restriction enzyme recognition.

3. Fazer correções nos códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e para evitar a formação de cachos de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada de uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Zea mays, Tabela 3.3. Make corrections to the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and to avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon usage in the polynucleotide sequence according to Zea mays codon usage, Table 3.

4. A seqüência de polinucleotídeo artificial é inspecionada quanto às regiões locais que têm uma relação GC:AT maior do que cerca de 2 em uma faixa de cerca de 50 nucleotídeos contíguos. A seqüência de polinucleotídeo é ajustada como necessário, pela substituição de códons nestas regiões tal que a relação GC:AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira esteja na faixa de 1,2 a 1,7.4. The artificial polynucleotide sequence is inspected for local regions that have a GC: AT ratio greater than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence is adjusted as necessary, by replacing codons in these regions such that the local GC: AT ratio is less than about 2 and the entire polynucleotide composition is in the range of 1.2 to 1.7.

• · · • ·• · · • ·

I · » · · ·I · »· · ·

Tabela 11. Identidade de seqüência percentual entre polinucleotídeos ZmEPSPS.Table 11. Percentage sequence identity among ZmEPSPS polynucleotides.

ZmEPSPS ZM ZmEPSPS ZM ZmEPSPS Nat ZmEPSPS Nat ZmEPSPS ZM ZmEPSPS ZM 100.00 100.00 74.81 74.81 ZmEPSPS Nat ZmEPSPS Nat 100.00 100.00

A seqüência de nucleotídeo do gene de EPSPS do milho também é modificada para reduzir identidade entre o gene sintético e nativo e manter a relação GC:AT global típica para as monocotiledôneas. A relação GC:AT para a seqüência de ZmEPSPS_ZM é de 1,38. A identidade de seqüência é reduzida a cerca de 75 % entre a nativa (ZmEPSPS_Nat, SEQ ID NO: 9) e a sintética (ZmEPSPS_ZM, SEQ ID NO: 10).The nucleotide sequence of the EPSPS gene in corn is also modified to reduce identity between the synthetic and native gene and maintain the typical global GC: AT ratio for monocots. The GC: AT ratio for the ZmEPSPS_ZM sequence is 1.38. The sequence identity is reduced to about 75% between the native (ZmEPSPS_Nat, SEQ ID NO: 9) and the synthetic (ZmEPSPS_ZM, SEQ ID NO: 10).

A comparação de polinucleotídeos nativos que codificam a EPSPS indica que o peptídeo de trânsito de cloroplasto é o fragmento mais divergente do gene. A similaridade na seqüência de nucleotídeo de peptídeos maduros é maior do que 88 % para as enzimas do milho e do arroz e algumas regiões conservadas têm identidade de seqüência tão longas quanto 50 pares de base. O silenciamento de gene pós-transcricional foi observado para seqüências tão pequenas quanto 60 polinucleotídeos (Sijen et al., Plant Cell, 8: 2277 a 2294. 1996; Mains, Plant Mol. Biol. 43: 261 a 273, 2000).The comparison of native polynucleotides encoding EPSPS indicates that the chloroplast transit peptide is the most divergent fragment of the gene. The nucleotide sequence similarity of mature peptides is greater than 88% for corn and rice enzymes and some conserved regions have sequence identities as long as 50 base pairs. Post-transcriptional gene silencing has been observed for sequences as small as 60 polynucleotides (Sijen et al., Plant Cell, 8: 2277 to 2294. 1996; Mains, Plant Mol. Biol. 43: 261 to 273, 2000).

EXEMPLO 3EXAMPLE 3

A soja (Glicina max) foi geneticamente modificada para ser tolerante ao glifosato pela expressão de uma EPSPS de classe II isolada de Agrobacterium (Padgette et al. Crop Sci. 35: 1451 a 1461, 1995). Uma seqüência de gene de EPSPS da soja nativa foi identificada e uma seqüência de polinucleotídeo artificial planejada usando o método da presente invenção. O polinucleotídeo artificial codifica uma seqüência de proteína que é modificada para produzir uma enzima de EPSPS resistente ao glifosato (GmEPSPSJKS, SEQ ID NO: 5) pela substituição dos aminoácidos T para I, R para K e P para S dentro do motivo GNAGTAMRP, resultando em uma enzima de EPSPS de soja modificada com o motivo GNAGIAMKS (SEQ ID NO: 34), também aludido como o mutante IKS. A expressão de uma enzimaSoy (Glycine max) was genetically modified to be glyphosate tolerant by the expression of a class II EPSPS isolated from Agrobacterium (Padgette et al. Crop Sci. 35: 1451 to 1461, 1995). An EPSPS gene sequence from native soybean was identified and an artificial polynucleotide sequence designed using the method of the present invention. The artificial polynucleotide encodes a protein sequence that is modified to produce a glyphosate-resistant EPSPS enzyme (GmEPSPSJKS, SEQ ID NO: 5) by replacing the amino acids T for I, R for K and P for S within the GNAGTAMRP motif, resulting in in a soybean EPSPS enzyme modified with the motif GNAGIAMKS (SEQ ID NO: 34), also referred to as the IKS mutant. The expression of an enzyme

de EPSPS modificada nas células de uma planta pela transformação com um cassete de expressão vegetal transgênico, que contém um polinucleotídeo que codifica a EPSPS modificada com o motivo GNAGIAMKS conferirá a tolerância ao glifosato às plantas. As substituições de aminoácido adicionais para a arginina (R) no motivo também pode incluir asparagina (N). Tabela 12. Constructo de polinucleotídeo para o gene da EPSP sintase de soja modificado (GmEPSPS_GM, SEQ ID NO: 7).of EPSPS modified in the cells of a plant by transformation with a transgenic plant expression cassette, which contains a polynucleotide that encodes the EPSPS modified with the GNAGIAMKS motif will confer glyphosate tolerance to plants. Additional amino acid substitutions for arginine (R) in the motif can also include asparagine (N). Table 12. Polynucleotide construct for the modified soybean EPSP synthase gene (GmEPSPS_GM, SEQ ID NO: 7).

1. Retro traduzir a SEQ ID NO: 5 para gerar uma seqüência de polinucleotídeo artificial usando a tabela de uso do códon de Glicina max (Tabela 4).1. Retro translate SEQ ID NO: 5 to generate an artificial polynucleotide sequence using the Glycine max codon usage table (Table 4).

2. Realizar o alinhamento de seqüência com a seqüência de polinucleotídeo GmEPSPS_Nat (SEQ ID NO: 6) e a seqüência de polinucleotídeo artificial para determinar o grau de identidade de seqüência, mapear as matrizes de leitura aberta, selecionar os padrões para procurar e identificar as seqüências de reconhecimento de enzima de restrição.2. Perform sequence alignment with the polynucleotide sequence GmEPSPS_Nat (SEQ ID NO: 6) and the artificial polynucleotide sequence to determine the degree of sequence identity, map the open reading matrices, select the patterns to search for and identify the restriction enzyme recognition sequences.

3. Fazer as correções para os códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e para evitar a formação de cachos de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada de uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Glicina max, Tabela 4.3. Make corrections for the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and to avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon usage in the polynucleotide sequence according to the use of Glycine max codon, Table 4.

4. A seqüência de polinucleotídeo é inspecionada quanto às regiões locais que têm uma relação GC:AT maior do que cerca de 2 em uma faixa de cerca de 50 nucleotídeos contíguos. A seqüência de polinucleotídeo é ajustada como necessário, pela substituição de códons nestas regiões tal que a relação GC:AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira esteja na faixa de 0,9 a 1,3.4. The polynucleotide sequence is inspected for local regions that have a GC: AT ratio greater than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence is adjusted as necessary, by replacing codons in these regions such that the local GC: AT ratio is less than about 2 and the entire polynucleotide composition is in the range of 0.9 to 1.3.

» · « • · · • · · »··· · ·»·« • · · · · »··· · ·

Tabela 13. Comparação da identidade de seqüência percentual do GmEPSPS modificado ao nível da seqüência de polinucleotídeo.Table 13. Comparison of the percentage sequence identity of the modified GmEPSPS at the level of the polynucleotide sequence.

GmEPSPS GM GmEPSPS GM GmEPSPS Nat GmEPSPS Nat GmEPSPS GM GmEPSPS GM 100,00 100.00 72,43 72.43 GmEPSPS Nat GmEPSPS Nat 100,00 100.00

O gene EPSPS da soja nativa é modificado usando uma tabela de códon de soja (Tabela 4) e as condições do método da presente invenção. A relação relativa de GC:AT não é mudada no gene modificado, entretanto a identidade de seqüência entre as duas é reduzida em 72 %. EXEMPLO 4The EPSPS gene of native soybeans is modified using a soy codon table (Table 4) and the conditions of the method of the present invention. The relative ratio of GC: AT is not changed in the modified gene, however the sequence identity between the two is reduced by 72%. EXAMPLE 4

O gene do polinucleotídeo aroA nativo isolado de Agrobacterium cepa CP4 (Patente U.S. N^s 5.633.435, aqui incorporada por referência em sua totalidade) que codifica uma EPSP sintase resistente ao glifosato (SEQ ID NO; 15) pode ser modificada pelo método da presente invenção para fornecer um polinucleotídeo que tenha o uso de códon de Arabidopsis, Zea mays ou Glicina max. Para a expressão apropriada de CP4EPSPS para conferir a tolerância ao glifosato em plantas, um peptídeo de trânsito de cloroplasto é necessariamente fundido à seqüência codificadora de CP4EPSPS para alvejar o acúmulo da enzima nos cloroplastos. O peptídeo de trânsito de cloroplasto CTP2 é habitualmente usado para a expressão deste gene em plantas transgênicas (Nida et al., J. Agric. Food Chem. 44: 1960 a 1966, 1996). A seqüência de CP4EPSPS junta com o polinucleotídeo CTP2 (SEQ ID NO: 11) foram modificados pelo método da presente invenção. Outros peptídeos de trânsito de cloroplasto conhecidos na técnica podem ser fundidos a CP4EPSPS para direcionar a enzima aos cloroplastos.The aroA gene native polynucleotide isolated from Agrobacterium strain CP4 (US Patent No. 5,633,435 ^are incorporated herein by reference in its entirety) that encodes a glyphosate resistant EPSP synthase (SEQ ID NO: 15) may be modified by this method invention to provide a polynucleotide using the codon of Arabidopsis, Zea mays or Glycine max. For the proper expression of CP4EPSPS to confer tolerance to glyphosate in plants, a chloroplast transit peptide is necessarily fused to the CP4EPSPS coding sequence to target the enzyme accumulation in chloroplasts. The CTP2 chloroplast transit peptide is commonly used for the expression of this gene in transgenic plants (Nida et al., J. Agric. Food Chem. 44: 1960 to 1966, 1996). The CP4EPSPS sequence together with the CTP2 polynucleotide (SEQ ID NO: 11) were modified by the method of the present invention. Other chloroplast transit peptides known in the art can be fused to CP4EPSPS to target the enzyme to chloroplasts.

Tabela 14. Construção de polinucleotídeo para a seqüência codificadora da EPSP sintase de aroA:CP4 (CP4EPSPS_AT, CP4EPSPS_ZM ou CP4EPSPS_GM)Table 14. Polynucleotide construction for the coding sequence for the EPSP synthase of aroA: CP4 (CP4EPSPS_AT, CP4EPSPS_ZM or CP4EPSPS_GM)

1. Colocar a seqüência de peptídeo de trânsito de CTP2 (SEQ ID1. Place the CTP2 transit peptide sequence (SEQ ID

NO: 11) em frente de CP4EPSPS (SEQ ID NO: 15) como um polipeptídeo de fusão. Retro traduzir o polipeptídeo de fusão para produzir uma seqüência de polinucleotídeo artificial usando a tabela de uso de códon de Arabidopsis thaliana (Tabela 2), ou a tabela de uso do códon de Zea mays (Tabela 3) ou a tabela de uso do códon de Glicina max (Tabela 4).NO: 11) in front of CP4EPSPS (SEQ ID NO: 15) as a fusion polypeptide. Retro translate the fusion polypeptide to produce an artificial polynucleotide sequence using the Arabidopsis thaliana codon usage table (Table 2), or the Zea mays codon usage table (Table 3) or the codon usage table Max glycine (Table 4).

2. Realizar o alinhamento de seqüência com as seqüências de polinucleotídeo CTP2 nativo (SEQ ID NO: 12) e CP4EPSPS nativo (SEQ ID NO: 16) e a seqüência de polinucleotídeo artificial para determinar grau de identidade de seqüência, mapear as matrizes de leitura aberta, selecionar padrões para procurar e identificar seqüências de reconhecimento de enzima de restrição.2. Perform sequence alignment with the native CTP2 polynucleotide (SEQ ID NO: 12) and native CP4EPSPS (SEQ ID NO: 16) and the artificial polynucleotide sequence to determine the degree of sequence identity, map the reading matrices open, select patterns to search for and identify restriction enzyme recognition strings.

3. Fazer correções nos códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e para evitar a formação de cachos de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada do uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Arabidopsis thaliana, Tabela 2 ou a tabela de uso do códon de Zea mays (Tabela 3) dependendo da tabela em uso.3. Make corrections to the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and to avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon usage in the polynucleotide sequence according to the use of Arabidopsis thaliana codon, Table 2 or the Zea mays codon usage table (Table 3) depending on the table in use.

4. A seqüência de polinucleotídeo artificial é inspecionada quanto às regiões locais que têm uma relação GC:AT maior do que cerca de 2 em uma faixa de cerca de 50 nucleotídeos contíguos. A seqüência de polinucleotídeo é ajustada como necessário, pela substituição de códons nestas regiões tal que a relação GC:AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira estando na faixa de 0,9 a 1,3 é a Tabela 2 que é usada e de 1,2 a 1,7 se a Tabela 3 é usada.4. The artificial polynucleotide sequence is inspected for local regions that have a GC: AT ratio greater than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence is adjusted as necessary, by replacing codons in these regions such that the local GC: AT ratio is less than about 2 and the entire polynucleotide composition being in the range of 0.9 to 1.3 is the Table 2 that is used and from 1.2 to 1.7 if Table 3 is used.

Tabela 15. Comparação da identidade de seqüência percentual dos polinucleotídeos CP4EPSPS artificiais.Table 15. Comparison of percentage sequence identity of artificial CP4EPSPS polynucleotides.

CTP2CP GM CTP2CP GM CTP2CP4 _AT CTP2CP4 _AT CTP2CP4 _ZM CTP2CP4 _ZM CTP2CP4 _Syn CTP2CP4 _Syn CTP2CP4 _NAT CTP2CP4 _NAT CTP2CP4 GM GM CTP2CP4 100,00 100.00 75,66 75.66 74,12 74.12 75,15 75.15 74,37 74.37 CTP2CP4 AT CTP2CP4 AT 100,00 100.00 76,13 76.13 74,56* 74.56 * 72,93 72.93 CTP2CP4 ZM CTP2CP4 ZM 100,00 100.00 77,76* 77.76 * 82,58 82.58 CTP2CP4 Syn CTP2CP4 Syn 100,00 100.00 82,70 82.70 CTP2CP4 NAT CTP2CP4 NAT 100,00 100.00

*Α porcentagem de identidade diz respeito à CP4EPSPS e não incluem o peptídeo de trânsito.* Α percentage of identity concerns CP4EPSPS and does not include the transit peptide.

Tabela 16. A composição de nucleotídeo e a relação GC:AT das seqüências de polinucleotideo artificiais para a seqüência de gene de CP4EPSPS.Table 16. The nucleotide composition and the GC: AT ratio of the artificial polynucleotide sequences to the CP4EPSPS gene sequence.

A THE C Ç G G T T GC:AT GC: AT CTP2CP4_GM CTP2CP4_GM 382 382 375 375 442 442 397 397 1.05 1.05 CTP2CP4_AT CTP2CP4_AT 369 369 408 408 469 469 350 350 1.22 1.22 CTP2CP4 ZM CTP2CP4 ZM 312 312 487 487 577 577 290 290 1.65 1.65

A seqüência de polinucleotideo CTP2_Nat (SEQ ID NO: 12) mais CP4EPSPS_Nat (SEQ ID NO: 16) designada como CTP2CP4_Nat é comparada na Tabela 15 com as seqüências de polinucleotideo artificiais designadas como CTP2CP4_AT (CTP2_AT, SEQ ID NO: 13 fundida a CP4EPSPS_AT, SEQ ID NO: 17) e CTP2CP4_ZM (CTP2_AT, SEQ ID NO:The polynucleotide sequence CTP2_Nat (SEQ ID NO: 12) plus CP4EPSPS_Nat (SEQ ID NO: 16) designated as CTP2CP4_Nat is compared in Table 15 with the artificial polynucleotide sequences designated as CTP2CP4_AT (CTP2_AT, SEQ ID NO: 13 fused to CP4, fused to CP4 SEQ ID NO: 17) and CTP2CP4_ZM (CTP2_AT, SEQ ID NO:

14 fundida a CP4EPSPS_ZM, SEQ ID NO: 18) produzidas pelo método da presente invenção. A seqüência de polinucleotideo que é a mais divergente das seqüências nativas CTP2CP4_NAT e CTP2CP4EPSPS_Syn é a CTP2CP4_AT tendo cerca de 73 % e 75 % de identidade de seqüência, respectivamente. A seqüência de polinucleotideo CTP2CP4_ZM comparada com a CTP2CP4_Nat e CP4EPSPS_Syn tem cerca de 83 % e 78 % de identidade com estas duas seqüências, respectivamente.14 fused to CP4EPSPS_ZM, SEQ ID NO: 18) produced by the method of the present invention. The polynucleotide sequence that is the most divergent from the native CTP2CP4_NAT and CTP2CP4EPSPS_Syn sequences is CTP2CP4_AT having about 73% and 75% sequence identity, respectively. The polynucleotide sequence CTP2CP4_ZM compared to CTP2CP4_Nat and CP4EPSPS_Syn has about 83% and 78% identity with these two sequences, respectively.

Um critério primário para a seleção de transgenes para combinar em uma planta é a identidade percentual. A Tabela 15 pode ser usada para selecionar uma molécula de polinucleotideo de CP4EPSPS para a constru20 ção do cassete de expressão vegetal quando é conhecido que a planta receptora conterá mais do que um polinucleotideo CP4EPSPS. A relação GC:AT na CP4EPSPS nativa é de cerca de 1,7. A versão artificial com o códon de eleição de Zea mays é produzida para ter uma relação GC:AT muito similar. Na versão do códon de Arabidopsis, a relação GC:AT é diminuída em cerca de 1,2.A primary criterion for selecting transgenes to combine in a plant is percent identity. Table 15 can be used to select a CP4EPSPS polynucleotide molecule for the construction of the plant expression cassette when it is known that the recipient plant will contain more than one CP4EPSPS polynucleotide. The GC: AT ratio in native CP4EPSPS is about 1.7. The artificial version with the Zea mays codon of choice is produced to have a very similar GC: AT ratio. In the codon version of Arabidopsis, the GC: AT ratio is decreased by about 1.2.

A expressão de gene também é um critério para a seleção de transgenes a serem expressados. A expressão de um transgene pode variar em plantas de safra diferentes, portanto ter várias seqüências codificadoras ·· ♦·······Gene expression is also a criterion for the selection of transgenes to be expressed. The expression of a transgene can vary in different crop plants, therefore having several coding sequences ·· ♦ ·······

safra e um as87 de polinucleotídeo artificiais disponíveis para testar em plantas de genótipos, variedades ou cultivares diferentes é uma vantagem e pecto da invenção.crop and an artificial polynucleotide as87 available for testing on plants of different genotypes, varieties or cultivars is an advantage and aspect of the invention.

EXEMPLO 5EXAMPLE 5

A seqüência de polinucleotídeo bar (SEQ ID NO: 20) que codifica uma proteína de fosfinotricina acetil transferase (SEQ ID NO: 19) foi usada para modificar geneticamente plantas quanto a resistência ao herbicida de glufosinato. Duas seqüências de polinucleotídeo bar novas foram planejadas usando o método da presente invenção. O alinhamento de BAR1_Nat com os dois polinucleotídeos BAR1 novos artificiais é mostrado na Figura 4. Tabela 17. Construção de gene de polinucleotídeo para BAR1_AT (SEQ ID NO: 21) e BAR1_ZM (SEQ ID NO: 22)The polynucleotide bar sequence (SEQ ID NO: 20) encoding a phosphinothricin acetyl transferase protein (SEQ ID NO: 19) was used to genetically modify plants for resistance to the glufosinate herbicide. Two new bar polynucleotide sequences were designed using the method of the present invention. The alignment of BAR1_Nat with the two new artificial BAR1 polynucleotides is shown in Figure 4. Table 17. Polynucleotide gene construction for BAR1_AT (SEQ ID NO: 21) and BAR1_ZM (SEQ ID NO: 22)

1. Retro traduzir a SEQ ID NO: 19 para gerar uma seqüência de polinucleotídeo usando a tabela de uso de códon de Arabidopsis thaliana (Tabela 2) ou a tabela de uso do códon de Zea mays (Tabela 3)1. Retro translate SEQ ID NO: 19 to generate a polynucleotide sequence using the Arabidopsis thaliana codon usage table (Table 2) or the Zea mays codon usage table (Table 3)

2. Realizar o alinhamento de seqüência com a seqüência de polinucleotídeo BAR1_Nat nativo (SEQ ID NO: 20) e a seqüência de polinucleotídeo artificial para determinar o grau de identidade de seqüência, mapear as estruturas de leitura aberta, selecionar os padrões para procurar e identificar as seqüências de reconhecimento de enzima de restrição.2. Perform sequence alignment with the native BAR1_Nat polynucleotide sequence (SEQ ID NO: 20) and the artificial polynucleotide sequence to determine the degree of sequence identity, map open reading structures, select patterns to search for and identify the restriction enzyme recognition strings.

3. Fazer correções para os códons usados na seqüência de polinucleotídeo artificial para se obter a porcentagem desejada de identidade de seqüência e para evitar a formação de cachos de códons idênticos. Isto é especialmente importante para os aminoácidos que ocorrem em alta freqüência, isto é, alanina, glicina, histidina, leucina, serina e valina. Distribuição aproximada de uso de códon na seqüência de polinucleotídeo de acordo com o uso de códon de Arabidopsis thaliana, Tabela 2 ou a tabela de uso do códon de Zea mays (Tabela 3) dependendo da tabela em uso.3. Make corrections to the codons used in the artificial polynucleotide sequence to obtain the desired percentage of sequence identity and to avoid the formation of clusters of identical codons. This is especially important for amino acids that occur in high frequency, that is, alanine, glycine, histidine, leucine, serine and valine. Approximate distribution of codon usage in the polynucleotide sequence according to the use of Arabidopsis thaliana codon, Table 2 or the Zea mays codon usage table (Table 3) depending on the table in use.

4. A seqüência de polinucleotídeo artificial é inspecionada quanto às regiões locais que têm uma relação GC:AT maior do que cerca de 2 em uma faixa de cerca de 50 nucleotídeos contíguos. A seqüência de polinucleotídeo é ajustada como necessário, pela substituição de códons nestas regi• · ·····♦·· ··· ··· ·· • · · · · « · · · · ··· · · · • · · · ões tal que a relação GC.AT local seja menor do que cerca de 2 e a composição de polinucleotídeo inteira esteja na faixa de 0,9 a 1,3 se a Tabela 2 é usada e 1,2 a 1,7 se a Tabela 3 é usada.4. The artificial polynucleotide sequence is inspected for local regions that have a GC: AT ratio greater than about 2 in a range of about 50 contiguous nucleotides. The polynucleotide sequence is adjusted as necessary by replacing codons in these regions • · ····· ♦ ·· ··· ··· ·· • · · · · · · · · ··· · · · • · · · Ions such that the local GC.AT ratio is less than about 2 and the entire polynucleotide composition is in the range of 0.9 to 1.3 if Table 2 is used and 1.2 to 1.7 if Table 3 is used.

A identidade de seqüência de polinucleotídeos BAR artificiais está na faixa de 73 a 77 % (Tabela 18). O polinucleotídeo nativo é altamente rico em GC. A versão artificial (BAR1_ZM) com o códon de eleição de Zea mays reduziu a relação GC:AT a cerca de 1,3 e a versão artificial (BAR1_AT) com códon de eleição de Arabidopsis a relação é de cerca de 1,0 (Tabela 19).The sequence identity of artificial BAR polynucleotides is in the range of 73 to 77% (Table 18). The native polynucleotide is highly rich in GC. The artificial version (BAR1_ZM) with the Zea mays codon of choice reduced the GC: AT ratio to about 1.3 and the artificial version (BAR1_AT) with the Arabidopsis codon of choice the ratio is about 1.0 (Table 19).

Tabela 18. Identidade de seqüência percentual entre os genes bar ao nível de seqüência de polinucleotídeo.Table 18. Percentage sequence identity between bar genes at the polynucleotide sequence level.

BAR1 ZM BAR1 ZM BAR1 AT BAR1 AT BAR1 Nat BAR1 Nat BAR1 ZM BAR1 ZM 100.00 100.00 77.35 77.35 76.99 76.99 BAR1 AT BAR1 AT 100.00 100.00 73.73 73.73 BAR1 Nat BAR1 Nat 100.00 100.00

Tabela 19. Composição de nucleotídeo e a relação GC:AT das seqüências de polinucleotídeo artificiais para a seqüência de gene bar.Table 19. Nucleotide composition and the GC: AT ratio of artificial polynucleotide sequences to the bar gene sequence.

BAR_AT BAR_AT 139 139 130 130 144 144 139 139 1.01 1.01 BAR ZM BAR ZM 122 122 156 156 154 154 120 120 1.28 1.28

EXEMPLO 6EXAMPLE 6

Este exemplo serve para ilustrar os constructos de DNA para a expressão dos polinucleotídeos artificiais da presente invenção em plantas. Um cassete de expressão vegetal de DNA transgênico compreende elementos reguladores que controlam a transcrição de um mRNA do cassete. Um cassete de expressão vegetal é construído para incluir um promotor que funciona em plantas que é operavelmente ligado a uma região líder 5' que é operavelmente ligada a uma seqüência de DNA de interesse operavelmente ligada a uma região de término 3'. Estes cassetes são construídos em vetores plasmídicos, que podem ser depois transferidos em plantas pelos métodos de transformação mediados por Agrobacterium ou outros métodos conhecidos por aqueles versados na técnica da transformação de planta. Os »······· γThis example serves to illustrate the DNA constructs for the expression of the artificial polynucleotides of the present invention in plants. A transgenic DNA plant expression cassette comprises regulatory elements that control the transcription of an mRNA from the cassette. A plant expression cassette is constructed to include a plant-functioning promoter that is operably linked to a leader 5 'region that is operably linked to a DNA sequence of interest operably linked to a 3' end region. These cassettes are constructed in plasmid vectors, which can then be transferred to plants by Agrobacterium-mediated transformation methods or other methods known to those skilled in the art of plant transformation. The »······· γ

seguintes constructos de vetor plasmídico são ilustradas para fornecer exemplos de plasmídeos contendo cassetes de expressão vegetal que compreendem as moléculas de polinucleotídeo artificiais da presente invenção e não são limitadas a estes exemplos.The following plasmid vector constructs are illustrated to provide examples of plasmids containing plant expression cassettes that comprise the artificial polynucleotide molecules of the present invention and are not limited to these examples.

As moléculas de polinucleotídeo artificiais da presente invenção, por exemplo, CP4EPSPS_AT e CP4EPSPS_ZM são sintetizadas usando iniciadores envolvidos. O produto de tamanho natural é depois amplificado com iniciadores específicos de gene contendo projeções com Sphl (iniciador avançado) e EcoRI (iniciador reverso). Os genes são clonados no vetor pCRIl-TOPO (Invitrogen, CA). Os plasmídeos pMON54949 (CP4EPSPS_AT, Figura 6) e pMON54950 (CP4EPSPS_ZM, Figura 7) resultantes contêm o polinucleotídeo artificial e estes polinucleotídeos são seqüenciados usando métodos de seqüenciamento de DNA para confirmar que as modificações projetadas pelo método da presente invenção estão contidas nos polinucleotídeos artificiais. Na etapa seguinte, o polinucleotídeo artificial que codifica o peptídeo de trânsito de cloroplasto CTP2 é ligado à extremidade 5' dos polinucleotídeos CP4EPSPS. O promotor CaMV 35S com um realçador duplicado (P-CaMVe35S) e um intron de actina 1 de arroz (l-OsAct1) derivado de pMON30151 (Figura 8) pela digestão com Sphl e Hindlll ligados aos polinucleotídeos CTP2CP4EPSPS para criar os plasmídeos pMON59302 (CTP2CP4EPSPS_AT, Figura 9) e pMON59307 (CTP2CP4EPSPS_ZM, Figura 10).The artificial polynucleotide molecules of the present invention, for example, CP4EPSPS_AT and CP4EPSPS_ZM are synthesized using primers involved. The full-size product is then amplified with specific gene primers containing projections with Sphl (advanced primer) and EcoRI (reverse primer). The genes are cloned into the pCRIl-TOPO vector (Invitrogen, CA). The resulting plasmids pMON54949 (CP4EPSPS_AT, Figure 6) and pMON54950 (CP4EPSPS_ZM, Figure 7) contain the artificial polynucleotide and these polynucleotides are sequenced using DNA sequencing methods to confirm that the modifications designed by the method of the present invention are contained in the artificial polynucleotides. In the next step, the artificial polynucleotide encoding the CTP2 chloroplast transit peptide is attached to the 5 'end of the CP4EPSPS polynucleotides. The CaMV 35S promoter with a duplicate enhancer (P-CaMVe35S) and a rice actin 1 intron (l-OsAct1) derived from pMON30151 (Figure 8) by digestion with Sphl and Hindlll linked to the CTP2CP4EPSPS polynucleotides to create the pMON59302 (CTP2PS2) plasmids (CTP2PS2) , Figure 9) and pMON59307 (CTP2CP4EPSPS_ZM, Figure 10).

Para a expressão dos novos polinucleotídeos artificiais em plantas monocotiledôneas, os genes são colocados nos cassetes de expressão vegetal contendo na extremidade 5* do polinucleotídeo, um promotor e um intron, uma região não-traduzida 5' e na extremidade 3' do polinucleotídeo um sinal de término de transcrição. Para este propósito, pMON42411 (Figura 11) contendo P-CaMV35S:en, I-HSP70, CTP2CP4_Nat e NOS 3' são digeridos com enzimas de restrição Ncol e EcoRI. O pMON59302 (Figura 9) e pMON59307 (Figura 10) são digeridos com as mesmas enzimas de restrição. Os fragmentos são purificados em gel usando o kit de purificação em gel da Qiagen e ligados para formar pMON58400 (CP4EPSPS_AT, Figura • ·· ···« • · · · • · · · · » ·« · · ···<For the expression of the new artificial polynucleotides in monocotyledonous plants, the genes are placed in the plant expression cassettes containing at the 5 * end of the polynucleotide, a promoter and an intron, an untranslated region 5 'and at the 3' end of the polynucleotide a signal of transcription termination. For this purpose, pMON42411 (Figure 11) containing P-CaMV35S: en, I-HSP70, CTP2CP4_Nat and NOS 3 'are digested with restriction enzymes Ncol and EcoRI. PMON59302 (Figure 9) and pMON59307 (Figure 10) are digested with the same restriction enzymes. The fragments are gel purified using the Qiagen gel purification kit and ligated to form pMON58400 (CP4EPSPS_AT, Figure • ·· ··· «• · · · · · · ·» · «· · ··· <

12) e pMON58401 (CP4EPSPS_ZM, Figura 13). O vetor pMON54964 adicional (Figura 14), contendo P-OsAct1/ l-OsAct1 é fabricado pela substituição de P-e35S/l-Hsp70 a partir de pMON58400 (Figura 12) usando o fragmento Hindlll /Ncol de pMON25455 (Figura 15). Para criar um vetor de expressão de monocotiledônea contendo o promotor P-FMV, pMON30152 (Figura 16) é digerido com Nhel, as extremidades são feitas abruptas com T4DNA polimerase na presença de 4 dNTP-s (200 μΜ) e Ncol. Os fragmentos de DNA CPT2CP4_AT ou CTP2CP4_ZM são isolados de pMON59302 (Figura 9) ou pMON59307 (Figura 10), respectivamente digerindo-se com EcoRI, fazendose abrupto com T4 DNA polimerase e digestão com Ncol. Os fragmentos de DNA purificados em gel são ligados e novos plasmídeos pMON54992 (CTP2CP4_AT, Figura 17) e pMON54985 (CTP2CP4_ZM, Figura 18) são criados. Em cada caso a construção plasmídica bem sucedida é confirmada pela digestão com endonuclease de restrição, usando entre outras Clal (introduzida em ambos os polinucleotídeos artificiais) e Pst I (introduzida no CP4EPSPS_ZM). O CP4EPSPS_Nat presente nos vetores precursores tem tanto Clal quanto dois sítios de restrição Pstl na região codificadora em localização diferente daquela nos polinucleotídeos artificiais.12) and pMON58401 (CP4EPSPS_ZM, Figure 13). The additional vector pMON54964 (Figure 14), containing P-OsAct1 / l-OsAct1 is manufactured by substituting P-e35S / l-Hsp70 from pMON58400 (Figure 12) using the Hindlll / Ncol fragment of pMON25455 (Figure 15). To create a monocot expression vector containing the P-FMV promoter, pMON30152 (Figure 16) is digested with Nhel, the ends are made abrupt with T4DNA polymerase in the presence of 4 dNTP-s (200 μΜ) and Ncol. The DNA fragments CPT2CP4_AT or CTP2CP4_ZM are isolated from pMON59302 (Figure 9) or pMON59307 (Figure 10), respectively by digesting with EcoRI, abruptly using T4 DNA polymerase and digesting with Ncol. The gel-purified DNA fragments are ligated and new plasmids pMON54992 (CTP2CP4_AT, Figure 17) and pMON54985 (CTP2CP4_ZM, Figure 18) are created. In each case, successful plasmid construction is confirmed by restriction endonuclease digestion, using among others Clal (introduced in both artificial polynucleotides) and Pst I (introduced in CP4EPSPS_ZM). The CP4EPSPS_Nat present in the precursor vectors has both Clal and two Pstl restriction sites in the coding region in a different location than in the artificial polynucleotides.

Para a expressão de polinucleotídeos CP4EPSPS artificiais em plantas dicotiledôneas, dois vetores precursores são usados: pMON20999 (P-FMV/CTP2CP4_Syn/3'E-9, Figura 19) e pMON45313 (Pe35S/CTP2CP4_Syn/3'E9, Figura 20). Em cada plasmídeo, um fragmento de DNA contendo o polinucleotídeo CTP2CP4_Syn é substituído com CTP2CP4_AT ou CTP2CP4_ZM. Para criar pMON59308 (PCaMVe35S/CTP2CP4_AT, Figura 21) ou pMON59309 (PCaMVe35S/CTP2CP4_ZM, Figura 22), pMON45313 é digerido com Ncol e EcoRI e os fragmentos de restrição de DNA derivados de digestões com Ncol/EcoRI de pMON59302 (CTP2CP4_AT, Figura 9) ou pMON59307 (CTP2CP4_ZM, Figura 10) são ligados, respectivamente. Para criar pMON59313 (P-FMV/CTP2CP4_AT/3’E9, Figura 23) e pMON59396 (PFMV/CTP2CP4_ZM/3'E9, Figura 24) o plasmídeo precursor pMON20999 é digerido com Ncol e BamHI para remover CTP2CP4_Syn e os fragmentosFor the expression of artificial CP4EPSPS polynucleotides in dicotyledonous plants, two precursor vectors are used: pMON20999 (P-FMV / CTP2CP4_Syn / 3'E-9, Figure 19) and pMON45313 (Pe35S / CTP2CP4_Syn / 3'E9, Figure 20). In each plasmid, a DNA fragment containing the polynucleotide CTP2CP4_Syn is replaced with CTP2CP4_AT or CTP2CP4_ZM. To create pMON59308 (PCaMVe35S / CTP2CP4_AT, Figure 21) or pMON59309 (PCaMVe35S / CTP2CP4_ZM, Figure 22), pMON45313 is digested with Ncol and EcoRI and the DNA restriction fragments derived from digestions with Ncol / EcoRI_ of CM2 ) or pMON59307 (CTP2CP4_ZM, Figure 10) are connected, respectively. To create pMON59313 (P-FMV / CTP2CP4_AT / 3'E9, Figure 23) and pMON59396 (PFMV / CTP2CP4_ZM / 3'E9, Figure 24) the precursor plasmid pMON20999 is digested with Ncol and BamHI to remove CTP2CP4_Syn and fragments

··· • · • · ··· ··· • · • · ··· ··· ·· • · · • · · • · · • · · ··· ·· • · · • · · • · · • · · ···· • • • • · ···· • • • • · • ·· ········ ·· · · · · • · · · • · · · • · · · • ·· ········ ·· · · · · • · · · • · · · • · · · ·· ·· ··· ·· ··· ·· ·· ·· ······· · · ······· · · • •

de restrição Ncol/BamHI derivados de pMON59308 (CTP2CP4_AT, Figura 21) ou pMON59309 (CTP2CP4_ZM, Figura 22) são ligados, respectivamente.Ncol / BamHI restriction measures derived from pMON59308 (CTP2CP4_AT, Figure 21) or pMON59309 (CTP2CP4_ZM, Figure 22) are linked, respectively.

EXEMPLO 7EXAMPLE 7

Os polinucleotídeos artificiais são testados para determinar a eficácia para conferir tolerância ao glifosato às plantas transgênicas. Cinco cassetes de expressão diferentes (Tabela 20) com os novos polinucleotídeos CP4EPSPS artificiais são transformados no milho e as plantas de milho transgênico resultante comparadas com o padrão comercial (Roundup Ready® Corn 603, Monsanto Co.). O plasmídeo pMON25496 (Figura 25) contido no padrão comercial tem duas cópias do polinucleotídeo CP4EPSPS_Nat, a expressão dirigida pelos promotores P-CaMVe35S (P-CaMVe35S) e POsActl, respectivamente. Os plasmídeos contendo os novos polinucleotídeos de CP4EPSPS artificiais contêm apenas uma única cópia do polinucleotídeo a ser testado. A expressão destes polinucleotídeos são dirigidas pelo promotor P-CaMVe35S com o intron da proteína de choque térmico l-Hsp70 ou o promotor P-FMV com um intron da sacarose sintase do arroz (1-OsSS). O plasmídeo pMON54964 contém o promotor da actina 1 do arroz com o primeiro intron nativo (Patente U.S. N^Q 5.641.876, aqui incorporada por referência em sua totalidade).Artificial polynucleotides are tested to determine the effectiveness of giving transgenic plants tolerance to glyphosate. Five different expression cassettes (Table 20) with the new artificial CP4EPSPS polynucleotides are transformed into maize and the resulting transgenic maize plants compared to the commercial standard (Roundup Ready® Corn 603, Monsanto Co.). The plasmid pMON25496 (Figure 25) contained in the commercial standard has two copies of the polynucleotide CP4EPSPS_Nat, the expression directed by the promoters P-CaMVe35S (P-CaMVe35S) and POsActl, respectively. Plasmids containing the new artificial CP4EPSPS polynucleotides contain only a single copy of the polynucleotide to be tested. The expression of these polynucleotides is directed by the P-CaMVe35S promoter with the heat shock protein intron 1-Hsp70 or the P-FMV promoter with a rice sucrose synthase (1-OSSS) intron. Plasmid pMON54964 contains the rice actin 1 promoter with the first native intron (US Patent No. ^Q 5,641,876, incorporated herein by reference in its entirety).

Estes plasmídeos são transformados em células de milho por um método mediado por Agrobacterium e as linhagens de milho transgênico regeneradas em seleção com glifosato. As plantas de milho transgênico podem ser produzidas por um método de transformação mediado por Agrobacterium. Uma Agrobacterium cepa C58 desarmada (ABI) que abriga um constructo binário da presente invenção é usada. Esta é transferida em Agrobacterium por um método de acasalamento triparental (Ditta etal., Proc. Natl. Acad. Sei. 77: 7347 a 7351). Culturas líquidas de Agrobacterium são iniciadas a partir de estoques de glicerol ou a partir de uma placa recém riscada e cultivada durante a noite entre 26° C e 28° C com agitação (aproximadamente 150 rpm) até a fase de cultivo de hemi-log em meio LB líquido, pH 7,0 contendo os antibióticos apropriados. As células de AgrobacteriumThese plasmids are transformed into corn cells by a method mediated by Agrobacterium and the transgenic maize strains regenerated in selection with glyphosate. Transgenic corn plants can be produced by an agrobacterium-mediated transformation method. An unarmed Agrobacterium cepa C58 (ABI) that houses a binary construct of the present invention is used. This is transferred to Agrobacterium by a triparental mating method (Ditta etal., Proc. Natl. Acad. Sci. 77: 7347 to 7351). Liquid cultures of Agrobacterium are initiated from glycerol stocks or from a freshly scratched plate and grown overnight between 26 ° C and 28 ° C with agitation (approximately 150 rpm) until the hemi-log cultivation phase in liquid LB medium, pH 7.0 containing the appropriate antibiotics. Agrobacterium cells

são ressuspensos no meio de inoculação (CM4C líquido) e a densidade é ajustada até OD₆6o de 1. Os embriões de milho HillxLH198 e Hill Tipo li imaturos recém isolados são inoculados com um constructo contendo Agrobacterium e co-cultivados vários dias no escuro a 23° C. Os embriões são depois transferidos para meio de demora e incubados a 28° C durante vários ou mais dias. Todas as culturas subseqüentes são mantidas nesta temperatura. Os embriões são transferidos para um primeiro meio de seleção contendo carbenicilina 500/0,5 mM de glifosato). Duas semanas mais tarde, o tecido sobrevivente é transferido a um segundo meio de seleção contendo carbenicilina 500/1,0 mM de glifosato). Subcultivar o calo sobrevivente a cada 2 semanas até que os eventos possam ser identificados. Isto pode levar cerca de 3 subculturas em glifosato 1,0 mM. Uma vez que os eventos são identificados, avolumar o tecido a regenerar. As mudas são transferidas ao meio MSOD em vaso de cultura e mantido durante duas semanas. Depois as plantas com raízes são transferidas para o solo. Aqueles versados na técnica de transformação de milho podem modificar este método para fornecer plantas de milho transgênico substancialmente idênticas contendo as composições de DNA da presente invenção.they are resuspended in the inoculation medium (liquid CM4C) and the density is adjusted to OD ₆ 6o out of 1. The newly isolated immature HillxLH198 and Hill Type li embryos are inoculated with a construct containing Agrobacterium and co-cultured for several days in the dark at 23 ° C. The embryos are then transferred to the delay medium and incubated at 28 ° C for several or more days. All subsequent cultures are maintained at this temperature. The embryos are transferred to a first selection medium containing 500 / 0.5 mM glyphosate carbenicillin). Two weeks later, the surviving tissue is transferred to a second selection medium containing 500 / 1.0 mM glyphosate carbenicillin). Subcultivate the surviving callus every 2 weeks until events can be identified. This can take about 3 subcultures in 1.0 mM glyphosate. Once the events are identified, swell the tissue to be regenerated. The seedlings are transferred to the MSOD medium in a culture vessel and maintained for two weeks. Then the plants with roots are transferred to the soil. Those skilled in the art of corn processing can modify this method to provide substantially identical transgenic corn plants containing the DNA compositions of the present invention.

Cerca de 30 linhagens de milho transgênico para cada constructo de plasmídeo são testadas e a eficiência de transformação e os níveis de expressão da enzima CP4EPSPS são mostradas na Tabela 20. As linhagens de milho transgênico são tratadas com glifosato em uma taxa de 64 oz/acre (1,8 kg/acre) como plantas jovens, as plantas sobreviventes são ensaiadas pelo CP4EPSPS ELISA (Padgette et al. Crop Sei. 35: 1451 a 1461, 1995) para determinar os níveis de expressão de proteína de EPSPS CP4 (% exp CP4) mostrada na Tabela 20 e o nível de expressão é comparado com a planta do milho tolerante ao glifosato padrão comercialmente disponíveis (Roundup Ready® Com 603, Monsanto Co., St. Louis, MO) como uma porcentagem da quantidade de expressão de proteína determinada no padrão comercial. No geral, mais do que 50 % das linhagens de milho sobrevivem a pulverização com 1,8 kg/acre (64 oz/acre) de glifosato. As plantas sobreviventes são mostradas ter alto nível de expressão de CP4EPSPS queAbout 30 transgenic maize strains for each plasmid construct are tested and the transformation efficiency and expression levels of the CP4EPSPS enzyme are shown in Table 20. Transgenic maize strains are treated with glyphosate at a rate of 64 oz / acre (1.8 kg / acre) as young plants, surviving plants are assayed by CP4EPSPS ELISA (Padgette et al. Crop Sci. 35: 1451 to 1461, 1995) to determine the levels of protein expression of EPSPS CP4 (% exp CP4) shown in Table 20 and the level of expression is compared with the commercially available standard glyphosate tolerant corn plant (Roundup Ready® Com 603, Monsanto Co., St. Louis, MO) as a percentage of the amount of protein expression determined in the commercial standard. Overall, more than 50% of maize strains survive spraying with 1.8 kg / acre (64 oz / acre) of glyphosate. The surviving plants are shown to have a high level of CP4EPSPS expression that

variam de cerca de 75 a 86 % de padrão comercial 603.range from about 75 to 86% of commercial standard 603.

Tabela 20. Eficiência de transformação (TE), expressão de CP4 (% média) derivada da transformação de constructos CP4-alt diferentes.Table 20. Transformation efficiency (TE), CP4 expression (average%) derived from the transformation of different CP4-alt constructs.

pMON pMON Promotor/lntron # Promoter / lntron # TE (%) TE (%) CP4 exp (%)* CP4 exp (%) * 58400 (CP4 AT) 58400 (CP4 AT) P-CaMVe35S/IHsp70 P-CaMVe35S / IHsp70 5.4 5.4 75.5 75.5 58401 (CP4 ZM) 58401 (CP4 ZM) P-CaMVe35S/IHsp70 P-CaMVe35S / IHsp70 7.2 7.2 84.7 84.7 54964 (CP4 AT) 54964 (CP4 AT) P-OsAct1 P-OsAct1 8.2 8.2 78.1 78.1 54985 (CP4 ZM) 54985 (CP4 ZM) P-FMV/IOsSS P-FMV / IOsSS 11.5 11.5 85.7 85.7 54992 (CP4 AT) 54992 (CP4 AT) P-FMV/IOsSS P-FMV / IOsSS 11.5 11.5 78.2 78.2 nk603 (control) nk603 (control) P-OsAct1/P-e35S: P-OsAct1 / P-e35S: - - 100 100

* A expressão de CP4EPSPS é calculada como a porcentagem de controle (603) feita em plantas que sobreviveram à pulverização com glifosato (1,8 kg/acre (64 oz/acre)).* CP4EPSPS expression is calculated as the percentage of control (603) made on plants that survived spraying with glyphosate (1.8 kg / acre (64 oz / acre)).

EXEMPLO 8EXAMPLE 8

Três constructos plasmídicos são avaliados em plantas de algodão transgênico (Tabela 21). O constructo de controle (pMON20999) contémThree plasmid constructs are evaluated in transgenic cotton plants (Table 21). The control construct (pMON20999) contains

P-FMV/CP4EPSPS_Syn este cassete de expressão está contido na linhagem 1445 de algodão tolerante ao glifosato comercialmente disponível (Roundup Ready® cotton, Monsanto Co., St. Louis, MO). Os constructos plasmídicos, pMON59313 e pMON59396 contendo os polinucleotídeos CP4EPSPS_AT e CP4EPSPS_ZM, respectivamente, são ensaiados quanto a eficiência de transformação e os níveis de enzima CP4EPSPS em relação ao cassete de expressão tolerante ao glifosato comercial. Cerca de cinqüenta linhagens de algodão transgênico são avaliadas para cada constructo. O polinucleotídeo CP4EPSPS_AT artificial dirigido pelo promotor PFMV dá uma porcentagem mais alta de plantas com um único inserto e um aumento no nível de expressão da enzima CP4EPSPS em relação ao cassete de expressão de pMON20999 como medido pelo ELISA.P-FMV / CP4EPSPS_Syn this expression cassette is contained in the commercially available glyphosate-tolerant 1445 line of cotton (Roundup Ready® cotton, Monsanto Co., St. Louis, MO). The plasmid constructs, pMON59313 and pMON59396 containing the polynucleotides CP4EPSPS_AT and CP4EPSPS_ZM, respectively, are tested for transformation efficiency and CP4EPSPS enzyme levels in relation to the commercial glyphosate tolerant expression cassette. About fifty lines of transgenic cotton are evaluated for each construct. The artificial polynucleotide CP4EPSPS_AT driven by the PFMV promoter gives a higher percentage of plants with a single insert and an increase in the expression level of the CP4EPSPS enzyme compared to the pMON20999 expression cassette as measured by the ELISA.

Tabela 21. Eficiência de transformação (TE), expressão de CP4EPSPS média em linhagens de algodão RO derivada da transformação de constructos de CP4EPSPS diferentes.Table 21. Transformation efficiency (TE), average CP4EPSPS expression in RO cotton lines derived from the transformation of different CP4EPSPS constructs.

pMON pMON Promotor District Attorney TE (%) TE (%) CP4 Exp (%) CP4 Exp (%) 20999 (CP4EPSPS Syn) 20999 (CP4EPSPS Syn) P-FMV P-FMV 15,0 15.0 100,0 100.0 59313 (CP4EPSPS AT) 59313 (CP4EPSPS AT) P-FMV P-FMV 15,0 15.0 116,4 116.4 59396 (CP4EPSPS ZM) 59396 (CP4EPSPS ZM) P-FMV P-FMV 16,1 16.1 52,0 52.0

EXEMPLO 9EXAMPLE 9

Os constructos contendo os polinucleotídeos CP4EPSPS artificiais, CP4EPSPS_AT e CP4EPSPS_ZM são avaliados na soja (Tabela 22). Os constructos plasmídicas contêm todas o promotor P-FMV para direcionar a expressão dos novos polinucleotídeos CP4EPSPS e são comparadas ao P-FMV/CP4EPSPS_Syn contido em pMON20999. Cerca de 25 a 30 plantas de soja transgênica são produzidas para cada construção. A eficiência de transformação e os níveis de enzima CP4EPSPS são medidos. Um nível de expressão surpreendentemente alto de proteína CP4EPSPS é medido em plantas de soja contendo a seqüência codificadora de CP4EPSPS_ZM (Tabela 22).The constructs containing the artificial CP4EPSPS polynucleotides, CP4EPSPS_AT and CP4EPSPS_ZM are evaluated on soybeans (Table 22). The plasmid constructs all contain the P-FMV promoter to direct the expression of the new CP4EPSPS polynucleotides and are compared to the P-FMV / CP4EPSPS_Syn contained in pMON20999. About 25 to 30 transgenic soy plants are produced for each construction. Transformation efficiency and CP4EPSPS enzyme levels are measured. A surprisingly high expression level of CP4EPSPS protein is measured in soybean plants containing the CP4EPSPS_ZM coding sequence (Table 22).

Tabela 22. Eficiência de transformação (TE), expressão CP4 média derivada da transformação de constructos de CP4EPSPS diferentes.Table 22. Transformation efficiency (TE), average CP4 expression derived from the transformation of different CP4EPSPS constructs.

pMON pMON Promotor District Attorney TE (%) TE (%) CP4Exp(%) CP4Exp (%) 20999 (CP4 Syn) 20999 (CP4 Syn) P-FMV P-FMV 0,55 0.55 100,0 100.0 59313 (CP4 AT) 59313 (CP4 AT) P-FMV P-FMV 0,40 0.40 66,6 66.6 54996 (CP4 ZM) 54996 (CP4 ZM) P-FMV P-FMV 0,29 0.29 242,5 242.5

EXEMPLO 10EXAMPLE 10

As células de tabaco são transformadas com três constructos plasmídicos contendo seqüências de polinucleotídeo de CP4EPSPS dife20 rentes e regeneradas em plantas. Cerca de vinte linhagens transgênicas são avaliadas de cada constructo. A expressão de cada um dos polinucleotídeos CP4EPSPS é direcionada pelo promotor realçador duplicado P-CaMVe35S (Tabela 23). A eficiência de transformação e o nível de expressão da enzima CP4EPSPS é medido. Os constructos de polinucleotídeo de CP4EPSPSTobacco cells are transformed with three plasmid constructs containing different and regenerated CP4EPSPS polynucleotide sequences in plants. About twenty transgenic strains are evaluated for each construct. The expression of each of the CP4EPSPS polynucleotides is driven by the duplicate enhancer promoter P-CaMVe35S (Table 23). The transformation efficiency and the expression level of the CP4EPSPS enzyme is measured. The CP4EPSPS polynucleotide constructs

diferentes são mostradas realizar aproximadamente a mesma em tabaco transgênico quanto a eficiência de transformação e expressão.different are shown to perform approximately the same in transgenic tobacco in terms of transformation and expression efficiency.

Tabela 23. Eficiência de transformação (TE), expressão de CP4 média em linhagens de tabaco RO derivadas da transformação de constructos deTable 23. Transformation efficiency (TE), expression of average CP4 in RO tobacco lines derived from the transformation of constructs of

EPSPS de CP4 diferentes.EPSPS of different CP4.

pMON pMON Promotor District Attorney TE (%) TE (%) CP4 exp. (%) CP4 exp. (%) 59308 CP4EPSPS AT 59308 CP4EPSPS AT P-CaMVe35S P-CaMVe35S 35 35 100.0 100.0 59309 CP4EPSPS ZM 59309 CP4EPSPS ZM P-CaMVe35S P-CaMVe35S 35 35 91.0 91.0 54313 CP4EPSPS_Syn 54313 CP4EPSPS_Syn P-CaMVe35S P-CaMVe35S 35 35 100.0 100.0

EXEMPLO 11EXAMPLE 11

A Arabidopsis thaliana é transformada com quatro constructos plasmídicos pela infiltração a vácuo (Bechtold N, et al., CR Acad Sei Paris Sciences di Ia vie/life Sciences 316: 1194 a 1199, (1993) e a progênie V1 avaliada para comparar a eficácia das seqüências de polinucleotídeo CP4EPSPS diferentes e promotores diferentes para o uso na seleção de plantas em glifosato (Tabela 24). Cerca de 30 plantas V1 transgênicas (+) são produzidas para cada constructo. Os constructos direcionadas pelo PCaMVe35S com o realçador duplicado (pMON45313, pMON59308 e pMON59309) não mostram nenhuma diferença substancial no nível de expressão em folhas como determinado pelo ELISA. As plantas são transformadas com pMON26140 que contém CP4EPSPS_Syn dirigido pelo promotor P-FMV, estas plantas mostram o nível de expressão mais alto, os níveis de expressão detectados a partir das plantas dos constructos de teste são comparados a pMON26140.Arabidopsis thaliana is transformed with four plasmid constructs by vacuum infiltration (Bechtold N, et al., CR Acad Sei Paris Sciences di Ia vie / life Sciences 316: 1194 to 1199, (1993) and progeny V1 evaluated to compare effectiveness of different CP4EPSPS polynucleotide sequences and different promoters for use in glyphosate plant selection (Table 24), about 30 transgenic V1 plants (+) are produced for each construct. Constructs targeted by PCaMVe35S with duplicate enhancer (pMON45313, pMON59308 and pMON59309) show no substantial difference in the level of expression in leaves as determined by the ELISA.The plants are transformed with pMON26140 which contains CP4EPSPS_Syn directed by the P-FMV promoter, these plants show the highest level of expression, the levels of expression detected from the plants of the test constructs are compared to pMON26140.

Tabela 24. Avaliação de cassetes de expressão CP4 diferentes em ArabidopsisTable 24. Evaluation of different CP4 expression cassettes in Arabidopsis

pMON pMON Promotor/ District Attorney/ Plantas produzidas Plants produced CP4 exp.(%) CP4 exp. (%) 45313(CPEPSPS4 Syn) 45313 (CPEPSPS4 Syn) P-CaMVe35S P-CaMVe35S + + 82,1 82.1 59308(CP4EPSPS AT) 59308 (CP4EPSPS AT) P-CaMVe35S P-CaMVe35S + + 79,3 79.3 59309(CP4EPSPS ZM) 59309 (CP4EPSPS ZM) P-CaMVe35S P-CaMVe35S + + 77,3 77.3 26140(CP4EPSPS Syn) 26140 (CP4EPSPS Syn) P-FMV P-FMV + + 100,0 100.0

EXEMPLO 12EXAMPLE 12

Plantas de trigo transformadas com os novos polinucleotídeos CP4EPSPS são comparadas quanto a eficiência de transformação e a expressão da enzima CP4EPSPS determinada pelo ELISA (Tabela 25). O CP4EPSPS_ZM fornece a expressão da enzima CPEPSPS pelo menos sete vezes mais alta do que o CP4EPSPS_AT. A expressão média de CP4EPSPS nas folhas de plantas de trigo contendo o polinucleotídeo CP4EPSPS_ZM é de cerca de 64 % daquele encontrado no trigo resistente ao glifosato que contém um constructo de cassete duplo, pMON30139: Pe35S/l-Hsp70/CP4EPSPS_Nat e P-OsAct1/l-OsAct1/CP4EPSPS_Nat (WO/0022704).Wheat plants transformed with the new CP4EPSPS polynucleotides are compared for transformation efficiency and the expression of the CP4EPSPS enzyme determined by the ELISA (Table 25). CP4EPSPS_ZM provides CPEPSPS enzyme expression at least seven times higher than CP4EPSPS_AT. The average expression of CP4EPSPS in the leaves of wheat plants containing the polynucleotide CP4EPSPS_ZM is about 64% of that found in glyphosate resistant wheat that contains a double cassette construct, pMON30139: Pe35S / l-Hsp70 / CP4EPSPS_Nat and P-OsAct1 / l-OsAct1 / CP4EPSPS_Nat (WO / 0022704).

Tabela 25. Desempenho de polinucleotídeos CP4EPSPS diferentes no trigoTable 25. Performance of different CP4EPSPS polynucleotides in wheat

pMON pMON Promotor/lntron Promoter / lntron TE (%) TE (%) CP4 Exp. (%) CP4 Exp. (%) 58400 CP4EPSPS AT 58400 CP4EPSPS AT P-e35S/l-Hsp70 P-e35S / l-Hsp70 0.25 0.25 9,2 9.2 58401 CP4EPSPS ZM 58401 CP4EPSPS ZM P-e35S/l-Hsp70 P-e35S / l-Hsp70 0.35 0.35 64,0 64.0 30139 CP4EPSPS Nat 30139 CP4EPSPS Nat P-e35S:P-OsAct1 P-e35S: P-OsAct1 - - 100,0 100.0

EXEMPLO 13EXAMPLE 13

Este exemplo serve para ilustrar a detecção de polinucleotídeos artificiais diferentes em plantas transgênicas, especificamente CP4EPSPS_AT e CP4EPSPS_ZM. Os outros polinucleotídeos artificiais, OsEPSPS_AT, OsEPSPS_ZM, GmEPSPS_GM, ZmEPSPS_ZM, CTP2_AT, CTP2_ZM, Bar1_AT e Bar1_ZM podem ser todos especificamente detectados em plantas transgênicas pelos métodos que fornecem um amplicon de DNA ou pela hibridização de uma sonda de DNA a uma amostra de planta. Aqueles versados na técnica da detecção de DNA podem facilmente planejar moléculas preparadoras a partir das seqüências de polinucleotídeo artificiais fornecidas na presente invenção para permitir um método que especificamente detectará o polinucleotídeo artificial em uma amostra de planta. O uso de um método ou de um kit que forneça iniciadores de DNA ou sondas homólogas ou complementares aos polinucleotídeos artificiais aqui descritos é um aspecto da presente invenção.This example serves to illustrate the detection of different artificial polynucleotides in transgenic plants, specifically CP4EPSPS_AT and CP4EPSPS_ZM. The other artificial polynucleotides, OsEPSPS_AT, OsEPSPS_ZM, GmEPSPS_GM, ZmEPSPS_ZM, CTP2_AT, CTP2_ZM, Bar1_AT and Bar1_ZM can all be specifically detected in transgenic plants by the methods that provide a DNA amplicon to a DNA sample or by hybridization of a plant sample. Those skilled in the technique of DNA detection can easily design preparatory molecules from the artificial polynucleotide sequences provided in the present invention to enable a method that will specifically detect the artificial polynucleotide in a plant sample. The use of a method or kit that provides DNA primers or probes homologous or complementary to the artificial polynucleotides described herein is an aspect of the present invention.

Um método de detecção de DNA (reação de cadeia da polime• · · · · · • · • ·A method of detecting DNA (polyme chain reaction • · · · · · · · · · ·

rase, PCR) é planejado para detectar os polinucleotídeos CP4EPSPS artificiais em plantas transgênicas. Os conjuntos únicos de iniciadores de DNA mostrados na Tabela 26 são planejados para amplificar um polinucleotídeo de CP4EPSPS específico e para fornecer amplicons distintamente dimenci5 onados. Os amplicons diferem suficientemente no comprimento de polinucleotídeo entre os vários polinucleotídeos CP4EPSPS para tornar fácil a separação dos amplicons pela eletroforese em gel de agarose padrão. A presença de mais do que um dos polinucleotídeos artificiais pode ser detectada em uma planta usando-se um método de PCR multiplex.rase, PCR) is designed to detect artificial CP4EPSPS polynucleotides in transgenic plants. The unique sets of DNA primers shown in Table 26 are designed to amplify a specific CP4EPSPS polynucleotide and to provide distinctly dimensioned amplicons. Amplicons differ sufficiently in polynucleotide length between the various CP4EPSPS polynucleotides to make separation of amplicons easy by standard agarose gel electrophoresis. The presence of more than one of the artificial polynucleotides can be detected in a plant using a multiplex PCR method.

Tabela 26. Seqüência de iniciadores usados para a detecção de genes CP4 diferentes em plantas transgênicas.Table 26. Sequence of primers used to detect different CP4 genes in transgenic plants.

Par de iniciador Initiator pair Especificidade de gene Gene specificity produto de PCR (bps) PCR product (bps) SEQ ID NOs: 24 e 25 SEQ ID NOs: 24 and 25 CP4EPSPS AT CP4EPSPS AT 938 (940) 938 (940) SEQ ID NOs: 26 e 27 SEQ ID NOs: 26 and 27 CP4EPSPS ZM CP4EPSPS ZM 595 (600) 595 (600) SEQ ID NOs: 28 e 29 SEQ ID NOs: 28 and 29 CP4EPSPS Nat CP4EPSPS Nat 712 (710) 712 (710) SEQ ID NOs: 30 e 31 SEQ ID NOs: 30 and 31 CP4EPSPS Syn CP4EPSPS Syn 443 (440) 443 (440)

Os pares iniciadores de DNA (Tabela 26) são usados para produzir um diagnóstico de amplicon para um polinucleotídeo CP4EPSPS específico em uma planta transgênica. Estes pares iniciadores incluem, mas não são limitados à SEQ ID NO: 24 e SEQ ID NO: 25 para o polinucleotídeo CP4EPSPS_AT; SEQ ID NO: 26 e SEQ ID NO: 27 para o CP4EPSPS_ZM; SEQ ID NO: 28 e SEQ ID NO: 29 para o CP4EPSPS_Nat e SEQ ID NO: 30 e SEQ ID NO: 31 para a molécula de polinucleotídeo CP4EPSPS_Syn. Além destes pares preparadores, qualquer par preparador derivado da SEQ IDThe DNA primer pairs (Table 26) are used to produce an amplicon diagnosis for a specific CP4EPSPS polynucleotide in a transgenic plant. These primer pairs include, but are not limited to SEQ ID NO: 24 and SEQ ID NO: 25 for polynucleotide CP4EPSPS_AT; SEQ ID NO: 26 and SEQ ID NO: 27 for CP4EPSPS_ZM; SEQ ID NO: 28 and SEQ ID NO: 29 for CP4EPSPS_Nat and SEQ ID NO: 30 and SEQ ID NO: 31 for the polynucleotide molecule CP4EPSPS_Syn. In addition to these preparer pairs, any preparer pair derived from SEQ ID

NO: 17 ou SEQ ID NO: 18 que quando usados em uma reação de amplificação de DNA produz um diagnóstico de amplicon de DNA para o respectivo polinucleotídeo CP4EPSPS é um aspecto da presente invenção.NO: 17 or SEQ ID NO: 18 that when used in a DNA amplification reaction produces a diagnosis of DNA amplicon for the respective polynucleotide CP4EPSPS is an aspect of the present invention.

As condições de amplificação para esta análise é ilustrada na Tabela 27 e Tabela 28, entretanto, qualquer modificação destas condições incluindo o uso de fragmentos das moléculas de DNA da presente invenção ou seus complementos como moléculas iniciadoras, produzindo um diagnóstico de molécula de DNA de amplicon para os polinucleotídeos artificiais ···· ····The amplification conditions for this analysis are illustrated in Table 27 and Table 28, however, any modification of these conditions including the use of fragments of the DNA molecules of the present invention or their complements as initiator molecules, producing a diagnosis of amplicon DNA molecule for artificial polynucleotides ···· ····

aqui descritos está dentro da vesatilidade da técnica. As moléculas de DNA da presente invenção incluem pelo menos a SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 e SEQ ID NO: 35. As moléculas de DNA que funcionam como moléculas iniciadores em um método de amplificação de DNA para detectar a presença dos polinucleotídeos artificiais incluem, mas não são limitadas à SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30 e SEQ ID NO: 31.described here is within the vesatility of the technique. The DNA molecules of the present invention include at least SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 17, SEQ ID NO: 18, SEQ ID NO: 21, SEQ ID NO: 22 and SEQ ID NO: 35. DNA molecules that function as primer molecules in a DNA amplification method to detect the presence of artificial polynucleotides include, but are not limited to, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30 and SEQ ID NO: 31.

Em um método para determinar a presença de polinucleotídeos da presente invenção, a análise de amostra de extrato de DNA de tecido vegetal deve incluir um controle positivo conhecido conter o polinucleotideo artificial e um controle de extrato de DNA negativo de uma planta que não é transgênica ou não contém o polinucleotideo artificial e um controle negativo que não contenha nenhum padrão no extrato de DNA.In a method for determining the presence of polynucleotides of the present invention, the analysis of a plant tissue DNA extract sample should include a known positive control containing the artificial polynucleotide and a negative DNA extract control from a plant that is not transgenic or it does not contain the artificial polynucleotide and a negative control that does not contain any pattern in the DNA extract.

As moléculas iniciadoras de DNA adicionais de comprimento suficiente pode ser selecionada da SEQ ID NO: 17 e SEQ ID NO: 18 e condições otimizadas para a produção de um amplicon que pode diferir dos métodos mostrados na Tabela 27 e Tabela 28, mas resultam em um diagnóstico de amplicon para os polinucleotídeos artificiais. O uso destas seqüências preparadoras de DNA homólogas ou complementares à SEQ ID NO: 17 e SEQ ID NO: 18 usadas com ou sem as modificações para os métodos das Tabelas 27 e 28 estão dentro do escopo da invenção. O ensaio para os amplicons CP4EPSPS_AT e CP4EPSPS_ZM pode ser realizado pelo uso de um termociclador Stratagene Robocycler, MJ Engine, PerkinElmer 9700 ou Eppendorf Mastercycler Gradient como mostrado na Tabela 28 ou pelos métodos e aparelho conhecido por aqueles versados na técnica.Additional DNA primer molecules of sufficient length can be selected from SEQ ID NO: 17 and SEQ ID NO: 18 and conditions optimized for the production of an amplicon which may differ from the methods shown in Table 27 and Table 28, but result in a amplicon diagnosis for artificial polynucleotides. The use of these DNA preparation sequences homologous or complementary to SEQ ID NO: 17 and SEQ ID NO: 18 used with or without the modifications for the methods of Tables 27 and 28 are within the scope of the invention. The assay for the CP4EPSPS_AT and CP4EPSPS_ZM amplicons can be performed using a Stratagene Robocycler, MJ Engine, PerkinElmer 9700 or Eppendorf Mastercycler Gradient thermocycler as shown in Table 28 or by the methods and apparatus known to those skilled in the art.

.··.···;···: .. ··. ···; ···:.

Tabela 27. Procedimento de amplificação de DNA e mistura de reação para a confirmação de polinucleotídeo de EPSPS artificial CP4EPSPS_AT em plantas de milho.Table 27. Procedure for DNA amplification and reaction mixture for confirmation of artificial EPSPS polynucleotide CP4EPSPS_AT in corn plants.

Etapa Stage Reagente Reagent Quantidade Amount Comentários comments 1 1 Água isenta de Nuclease Nuclease-free water adicionar até volume final de 20 μΙ add up to a final volume of 20 μΙ - - 2 2 tampão de reação 10X (com MgCI₂)10X reaction buffer (with MgCI ₂ ) 2,0 μΙ 2.0 μΙ Concentração final de 1X de tampão, concentração final de 1,5 mM de MgCI₂ Final concentration of 1X buffer, final concentration of 1.5 mM MgCl ₂ 3 3 10 mM de solução de dATP, dCTP, dGTP e dTTP 10 mM dATP, dCTP, dGTP and dTTP solution 0,4 μΙ 0.4 μΙ Concentração final de 200 μΜ de cada dNTP Final concentration of 200 μΜ of each dNTP 4 4 Iniciador (SEQ ID NO: 24) (ressuspenso em tampão 1X TE ou água isenta de nuclease a uma concentração de 10 μΜ) Initiator (SEQ ID NO: 24) (resuspended in 1X TE buffer or nuclease-free water at a concentration of 10 μΜ) 0,4 μΙ 0.4 μΙ Concentração final de 0,2 μΜ Final concentration of 0.2 μΜ 5 5 Iniciador (SEQ ID NO: 25) (ressuspenso em tampão 1X TE ou água isenta de nuclease a uma concentração de 10 μΜ) Initiator (SEQ ID NO: 25) (resuspended in 1X TE buffer or nuclease-free water at a concentration of 10 μΜ) 0,4 μΙ 0.4 μΙ Concentração final de 0,2 μΜ Final concentration of 0.2 μΜ 6 6 Iniciador de controle (SEQ ID NO: 32) (ressuspenso em tampão 1X TE ou água isenta de nuclease a uma con- centração de 10 μΜ) Control initiator (SEQ ID NO: 32) (resuspended in 1X TE buffer or nuclease-free water to a con- 10 μΜ centering) 0,2 μΙ 0.2 μΙ Concentração final de 0,1 μΜ Final concentration of 0.1 μΜ

100100

Etapa Stage Reagente Reagent Quantidade Amount Comentários comments 7 7 Iniciador de controle (SEQ ID NO: 33) (ressuspenso em tampão 1X TE ou água isenta de nuclease a uma con- centração de 10 μΜ) Control initiator (SEQ ID NO: 33) (resuspended in 1X TE buffer or nuclease-free water to a con- 10 μΜ centering) 0,2 μΙ 0.2 μΙ Concentração final de 0,1 μΜ Final concentration of 0.1 μΜ 8 8 RNase, DNase livre (500 ng/μΙ) RNase, free DNase (500 ng / μΙ) 0,1 μΙ 0.1 μΙ 50 ng/reação 50 ng / reaction 9 9 REDTaq DNA polimerase (1 unidade/μΙ) REDTaq DNA polymerase (1 unit / μΙ) 1,0 μΙ (recomendado sacudir as pipetas antes da etapa seguinte) 1.0 μΙ (recommended to shake the pipettes before the next step) 1 unidade/reação 1 unit / reaction 10 10 DNA extraído (padrão): • Amostras a serem analisadas • folhas individuais • folhas agrupadas (máximo de 50 folhas/ agrupamento) • Controle negativo • Controle negativo • Controle positivo Extracted DNA (standard): • Samples to be analyzed • individual sheets • collated sheets (maximum 50 sheets / collation) • Negative control • Negative control • Positive control • 10 a 200 ng de DNA genômico • 200 ng de DNA genômico • 50 ng de DNA genômico de planta não transgênica • nenhum DNA padrão • 5 ng DNA plasmídico • 10 to 200 ng of genomic DNA • 200 ng of DNA genomic • 50 ng of genomic DNA from non-transgenic plant • no DNA standard • 5 ng plasmid DNA

Tabela 28. Parâmetros de PCR Sugeridos para termocicladores diferentesTable 28. Suggested PCR parameters for different thermal cyclers

Mistura suave e, se necessário (nenhum topo quente no termociclador), adicionar 1 a 2 gotas de óleo mineral no topo de cada reação. Processar com a PCR em um termociclador Stratagene Robocycler, MJ Engine,Gentle mixing and, if necessary (no hot top in the thermal cycler), add 1 to 2 drops of mineral oil on top of each reaction. Process with PCR in a Stratagene Robocycler, MJ Engine,

Perkin-Elmer 9700 ou Eppendorf Mastercycler Gradient usando os seguintes parâmetros de ciclização.Perkin-Elmer 9700 or Eppendorf Mastercycler Gradient using the following cycling parameters.

Nota: Os termocicladores MJ Engine ou Eppendorf Mastercycler Gradient devem ser conduzidos no modo calculado. Conduzir o termociclador Perkin-Elmer 9700 com a velocidade de elevação ajustada no máximo.Note: MJ Engine or Eppendorf Mastercycler Gradient thermal cyclers must be driven in the calculated mode. Drive the Perkin-Elmer 9700 thermal cycler with the lifting speed set to maximum.

101101

A Λ <······· * A Λ <······· * ··· 9 · ··· 9 · • · · · · • · · • · · · · • · · • · · · • • · · · • ·: · : · ·: ·: · • · • · 9 · 9 · • · · • · · • • • · · · • · · · ·· · ·· · • · · • · · • · · · · • · · • · · • · · · · • • · • · • • · • · ζ · · · ······* · · ζ · · · ······ * · · • · • ·

N° de Ciclos No. of Cycles Ajustes: Stratagene Robocycler Adjustments: Stratagene Robocycler 1 1 94° C 3 minutos 94 ° C 3 minutes 38 38 94° C 1 minuto 60° C 1 minuto 72° C 1 minuto e 30 segundos 94 ° C 1 minute 60 ° C 1 minute 72 ° C 1 minute 30 seconds 1 1 72° C 10 minutos 72 ° C 10 minutes

N⁹ de CiclosN ⁹ of Cycles Ajustes: MJ Engine ou Perkin-Elmer 9700 Settings: MJ Engine or Perkin-Elmer 9700 1 1 94° C 94 ° C 3 minutos 3 minutes 38 38 94° C 60° C 72° C 94 ° C 60 ° C 72 ° C 10 segundos 30 segundos 1 minuto 10 seconds 30 seconds 1 minute 1 1 72° C 72 ° C 10 minutos 10 minutes

N° de Ciclos No. of Cycles Ajustes: Eppendorf Mastercycler Gradient Settings: Eppendorf Mastercycler Gradient 1 1 94° C 94 ° C 3 minutos 3 minutes 38 38 94° C 60° C 72° C 94 ° C 60 ° C 72 ° C 15 segundos 15 segundos 1 minuto 15 seconds 15 seconds 1 minute 1 1 72° C 72 ° C 10 minutos 10 minutes

Todas as composições e métodos descritos e reivindicados aqui podem ser feitas e executadas considerando-se a presente descrição. Embora as composições e métodos desta invenção tenham sido descritas, estará evidente por aqueles de habilidade na técnica que as variações podem ser aplicadas às composições e métodos e nas etapas ou na seqüência de etapas dos métodos aqui descritos sem divergir do conceito, espírito e escopo da invenção. Mais especificamente, estará evidente que certos agentes que são tanto química e fisiologicamente relacionados podem ser substituídos no lugar dos agentes aqui descritos enquanto os mesmos resultados ou similares podem ser obtidos. Todos de tais substitutos e modificações similares evidentes àqueles versados na técnica são considerados estar dentro do espirito, escopo e conceito da invenção como definido pelas reivindicações anexas.All of the compositions and methods described and claimed herein can be made and performed with consideration of the present description. Although the compositions and methods of this invention have been described, it will be apparent to those of skill in the art that variations can be applied to the compositions and methods and in the steps or sequence of steps of the methods described here without departing from the concept, spirit and scope of the invention. invention. More specifically, it will be evident that certain agents that are both chemically and physiologically related can be substituted in place of the agents described herein while the same or similar results can be obtained. All such substitutes and similar modifications evident to those skilled in the art are considered to be within the spirit, scope and concept of the invention as defined by the appended claims.

Todas as publicações e pedidos de patente aqui citados são in15 corporados por referência em sua totalidade no mesmo grau como se cadaAll publications and patent applications cited herein are incorporated by reference in their entirety to the same degree as if each

102 publicação ou pedido de patente individual fosse específico e individualmente indicado como sendo incorporado por referência.102 publication or individual patent application was specific and individually indicated as being incorporated by reference.

Claims

1. Method to reduce the silencing of transgene in transgenic plants, characterized by the fact that it comprises the steps of:

a) obtaining a DNA construct comprising an artificial polynucleotide that encodes a chloroplast transit peptide operably linked to SEQ ID NO: 18;

b) transforming said DNA construct into a plant cell comprising SEQ ID NO: 16; and

c) regenerate said plant cell in a transgenic plant

10 fertile.

2. Method according to claim 1, characterized by the fact that said artificial polynucleotide is expressed in said fertile transgenic plant.

3. Method according to claim 2, characterized by the fact that said artificial polynucleotide provides an agronomically useful phenotype selected from the group consisting of: herbicide tolerance and resistance to diseases.

4. Artificial polynucleotide molecule, characterized by the fact that it comprises SEQ ID NO: 18.

20 5. DNA construct, characterized by the fact that it comprises: a promoter molecule that works in plants, operably linked to the said artificial polynucleotide molecule as defined in claim 4.

6. Method of detecting an artificial polynucleotide in a ve25 getal cell, plant or its progeny that comprises an artificial polynucleotide molecule, as defined in claim 4, characterized by the fact that it comprises the steps:

(a) contacting a DNA sample isolated from said plant cell, plant or its progeny with a DNA molecule, in which said

30 DNA molecule comprises at least one DNA molecule from a pair of DNA molecules that when used in a nucleic acid amplification reaction produces an amplicon that is diagnostic for that moleculePetition 870160006724, of 02/26/2016, p. 4/10 an artificial polynucleotide cuve, as defined in claim 4, wherein the DNA molecule comprises a fragment of SEQ ID NO: 18;

(b) carrying out a nucleic acid amplification reaction, thereby producing amplicon; and

(C) detecting the amplicon.

7. DNA detection kit, characterized by the fact that it comprises at least one DNA molecule of sufficient length to be specifically homologous or complementary to an artificial polynucleotide, as defined in claim 4, in which the DNA molecule comprises 10 a fragment of SEQ ID NO: 18 and the DNA molecule is not complementary to SEQ ID NO: 16, wherein said DNA molecule is useful as a DNA probe or DNA primer.

Petition 870160006724, of 02/26/2016, p. 5/10

1/52

MAGNET>

Figure 3A Figure 5A Figure 3B * Figure 5B Figure 3C Figure 5C 3D figure Figure 5D Figure 3E ’ Figure 5E Figure 3F Figure 5F Figure 3G Figure 5G

Fig 3 Fig 5

Figi 2

2/52 ·· ········ • · * - *

Ι- • • CD CD • CD CD ç <C CD CD • • ο ρ- • CD CD <c < LO CD • • CD CD • • ο • CD i — 1 P- • • i — l CD <c <c OR CD • 1- CD • • CD • 1- CD F- CD 1- • • I— • • CD < • O • • CD • • CD 1- 1- Ç • • CD • • CD • l · - Ι- • CD » • CD • <c CD • CD Ο • CD • < CD • • Ι- • <C ο ο 1- <C • • CD <c • σο Ι- CD ο • • <C • • CD • 1- CD σο • CD • • CD 1- • 1- • • Ι- <c • CD 1- • CD • • CD • σο CD • Ο • • CD • < CD <c CD CD CD • CD • • CD < 1- CD • • Ι- <c • CD ο 1- CD • CD CD <c • σο 1- CD CD • • F- • < CD F— • CD • CD CD • • CD < CD 1- • • F- • ç 1- • CD CD • • CD • • CD F- CD ο • • 1- • • CD ç <xz CD • • CD • • CD • • CD • • CD • « CD • <c 1- • • CD <c <c CD • CD CD • • F- • • CD • • CD • • CD • • CD < CD I— • • Ι- • • CD < CD CD • • <C • • σο CD CD Ο • • CD • • CD CD <c CD • • CD • • CD 1- F- CD • • CD • • CD • • CD • • CD • • CD CD <C • • CD • • CD CD 1- Ι- • • <c • • CD CD CD CD • • σο • • CD 1- l · - CD CD CD CD • • CD CD <c CD ο CD F- < Ç CD <c <c Ι- • Ι- < <C CD • CD CD • • σο • • Ο 1- 1- CD • • CD <c • CD <c • CD • • CD • • CD 1- CD Ι- • • Ο • • CD <c 1- CD • • σο • • CD • • CD • • CD • • CD 1- CD I— • • CD • • CD CD 1- CD • • <c • • CD <c P-. CD • • CD • • CD 1- CD 1- • CD • • CD • • ο • • CD • CD ç CD Ç • • <c • • CD < CD CD • • CD • • ο <c l · - CD • • CD • • CD • F- CD • CD • • CD <c CD CD • • CD • • CD < • CD • • CD • • ο 1- CD <c • • CD • • CD • <c ο • • CD • • CD <c CD CD • • ο • • CD • 1- I— • • CD <c <£ ο • • ç • • -1 CD • • i — 1 CD 1- CD Ρ- • • i — l CD • • O CD • l · - LO CD • • <c • • LO CD < < 1—1 CD • • i — 1 CD • • -Μ + -> 4-> 4-> Π3 ρ- Π3 H- Π3 1- Π3 1- sz 1 <c I ru I I I ru I I <c | ru | I <c I ru 1 to 1 tO 1 CO 1 to 1 to 1 to 1 to 1 to 1 to 1 to 1 to to Ο- CL CL Cl. CL CL Ci- cy- cy- Ο- Ο- Ο- σο LO to to to to to oo to σο σο σο α_ CL Cl. CL CL Cl O_ O_ cl cl O_ O_ LlJ LU LU LU LU LU LU LU LU LU LU LU L0 LO LO LO LO LO LO tr> LO <S) LO LO Ο O O O CD O O CD O O O CD

a a ·

3/52 • · · i-HH

O <C • O CD 1- • CD F— • The O CD The 1- LO CD <c <c O 1- • • LO CD 1- F- O CD The O CM <C • co <c • • CO O • The '' laughs * CD The The CD • • CD • 1- 1- • The CD The The CD • • CD • • 1- CD CD CD <c CD CD • • Ç • CD CD 1- 1- F— The The <C • CD CD • 1- < • The CD The F— CD F— <C <C • • < • The CD 1- CD 1- • • <c • • CD F— 1- CD The The <c • • CD • CD < • The CD The The CD F— CD CD • • O The <C CD The CD • • f— • • O • <c <c The The CD • • CD 1- O Ι- • The <c The The CD <c <c (- • • Ο • The CD CD <c <C • • CD • • O 1- 1- 1- The The CD • • CD CD CD Ι- • The CD The The CD <c CD CD CD • ο • The CD I— CD 1- • 1- <C • < • CD CD The The CD F— CD • • CD • The CD The The CD 1- F— <C • • <c The The CD The <c 1- • <C • • < The CD <c The The CD • CD • O CD • The CD The The CD 1- F— CD O • < • The F— <c CD 1- • 1- ç • CD ç CD F— The The <c « CD <c CD O • The CD The • CD <c CD • • O • The CD The The • CD • • O • <c 1- The The CD • <C 1- F— < • The <c The The CD <c CD • • O • • CD F— The <C • CD • • CD CD The <C • The CD • CD <c CD CD The The 1- The The CD <c CD 1- • • f- The The CD The The CD • CD F— • CD The CD The The CD • CD <c ί- The The CD The The CD • <C • • ο The The 1- CD CD <C • CD • • CD 1- CD 1- The The <C • π- <c O CD The The CD The • CD κ- 1- ι- • • O The The 1- The CD CD • ο • • CD <£ CD <c The The CD • O CD • CD The CD The The CD ι— <£ Z CD • • Ι- The The CD The <c CD • The CD • • Ο I— <c <c The CD • • O • CD F— The The CD The CD • • CD • • CD The The l · - The CD CD • • CD • • O CD CD CD The CD CD • • CD O O F— The • <C The 1- CD • • CD • • CD Ι- • CD F— The r — H CD • • t — 1 1- • • i — 1 CD Ο 1- "-1 <C The The O CD • • LO CD ç F— CD F— The The LO <C The The CM <C 1- f— CM F— • • CO CD The The CO CD The The -F -F -F -F Π3 1- Π3 F- rc F— Π3 1- I <c 1 ΓΜ I 2: I Ç I ΓΜ I I Ç I ΓΜ | I <c I ΓΜ 1 GO 1 GO 1 GO 1 GO 1 GO 1 GO 1 CZ) 1 GO 1 GO 1 GO 1 GO GO CL CL CL CL CL CL CL CL CL CL CL CL czo GO GO go GO GO GO GO GO GO GO GO CL CL CL CL CL CL CL CL CL CL CL CL LlJ LxJ UJ LU LU LU LU LU LU LU LU LU LO cn cn LO LO LO CO LO CO LO LO LO CD CD O CD CD CD O CD O O CD O

QC

O)

LL_

4/52

Ο Ι- • ο ο Ι- • • ο ο • • ο Ο 1- ο LO Ο • • ο Ι- • • LO I— < • ο Ο • • 1- • • LO Ο • ο LO ο • • LO <c ο ο C_J 1- <c Ι- • • < • • ο • ο • • ο • I— ο ο ο π- • ο • • <c ο • Ι- • • <c • ι- <c ο <c • • Ο • • <c • ο ι- • • ο • • ο ο ο ο • ο • • ο <£ 1- ο • • ο <c ο I— < Ι- • • ο • • <c ο Ι- • • Ο • • < 1- ο ο • ο • • <c • ο ο • • ο • <c ο ο <c • • ο • • ο • ι- Ι- • • ο • • <χ 1- π- Ι- • ο • • ο <c <c ο • • Ο • ί- <c <c <c • • < • • <c ο ι- ο • • ο • • ο • • ο • ο • • Ç ο • π- • • ο • <c ο • <£ • ι- • ο 1- • ο ο • • <c • ο Ι- • <£ • <c ο ο ο Π- ο ο Ο ο ο • <c • ο ο • 1- <c <£ π- <c ο <£ • • ο • < ο • ι- • <c • • 1- • ο ο • • ο • <c • ο <c • • ο • <c ο ο ο ο • • ο • • ο • Ι- • ο • • ο • <c Η- • • ο • Ι- <c ο <C • • < • • 1- ο ο Ι- • • <£ • • < ο • • Ο • • ο <c • ο • • ι- • <c • • <c • • ο • • ι- < ç • » ο • • ο • • ο • <c • • ι- • ο Η- • • <c • 1- • ο ι- • • < • • <c • <c • • ο • • < ο ο ο • O • • ί- Ç <c ο • • ο • <c ο 1- ο • ο • • I— ο ο O • • ο • π- ο • • O • • Π- ο ο ο • • <c • <c • ο ι- • ο • • <c ο Ι- <c • • ι- • • ο π- • ο • O • • ο • • < • • ο • O <c ί- <χ • • < • • Ι- <c Ο Ι- • Ç • • ο • <£ ο • Ο • ο ο 1- ο • • ο • Ι- • ο ο • • ο • Ι- • <c ο • ο ο • • ο ο • Ο • ΐ — 1 Ι- • < ι — 1 I— • ο r — 1 π- • • 1—1 ο • Ο ο 1- ο LO ο • • ο ι- ο • LO Ι- ο ο 1- • • Ή ο • • LÍO ο I— • LO ο • •

ο

-π -Π 4-> -Π Π3 ί- ΣΞ Π3 1- Σ ίθ Η- ΖΕΖ <Ό 1- I ο: I γχι I 1 I Γ-J 1 I Ç I Γ-4 1 I <c I Ι \ Ι I 1 ΟΟ 1 ΟΟ 1 ΟΟ 1 ΟΟ 1 οο 1 οο 1 οο 1 οο 1 οο I οο I οο 1 οο Ω_ α_ CL ο_ Ω_ Ο- α_ α_ α_ ο_ Ω_ α_ ΟΟ οο οο οο οο οο οο οο οο οο ΟΟ οο Ο- ο_ α_ α_ CL. α_ Ω_ ο_ α_ Ω_ ο_ Ω_ Ο LU LU LjJ LU LU UJ UJ LlJ LU. LjJ UJ ΟΟ οο 00 οο ΟΟ 00 ΟΟ 00 00 00 00 ΟΟ ο Ο Ο ο Ο ο ο ο Ο Ο Ο ο

• «• · ······· · ·

5/52

O <C • • LO CD • » CO CD 1- CD O • • CD • • 1- CD <c CD • • CD • • Π- CD CD ι- • • ο • 1- <c • • <c • • CD • • <t CD • <C • • <£ • • CD CD 1- l · - • • l · - CD CD CD 1- < CD • • CD • CD <c ç Ι- • • Ο • • 1- CD CD 1- • • CD • • CD 1- CD 1- • l · - CD CD CD 1- 1- Ç • • CD • • 1- • • CD • • CD • • 1- • CD (- • • < • • CD l · - CD CD • • CD • • < H- • CD • • < CD CD τ — l CD • < O < • * CD CD 4-> • • Π3 1- I <c I ru 1 OO 1 oo 00 Q_ CL CL OO oo OO CL. Cl CL LU LU LU to LO LO O O O

ο ο

r-.

LD

CO

<x <£ CD CD CD <C <C CD 1- CD • · • CD • · • · CD LO r ^. 1- CD <C CD CD < CD (- <c CD • CD • CD O CD CO Ι- ου 1- <c CD CD CD CD 1- l · - • • • • • <c CD • • • CD • • CD • • CD • CD CD 1- 1- • • 1- 1- • CD 1- • 1- < • • l · - • • • • • <£ • • í— <c CD CD CD CD CD 1- 1- CD • • 1- < CD 1- CD CD < • 1- CD CD CD • • 1- CD CD CD • • CD • • 1- • » CD • • CD <c • CD • • Ι- CD <c 1- • t— < <c ου • • 1- • CD CD • • Ι- • • Ι- <c <C CD • • ου CD <c ου • • <C CD 1- Ι- • • ass • • CD • • ου • 1- Ι- CD <c CD • • CD • • ου • «· CD 1- • <c • • CD • • CD ♦ • <c • • Ι- • ç 1- • • 1- CD CD ου • • <C CD CD 1- • • CD • • <c • • CD • • CD • • CD • • CD • • 1- • • 1- » <c <C • • <c • • CD • • <C • • CD • • <C • • CD <c <c Ι- • • CD 1- CD CD • • ου • 1- CD • • CD • • CD ass • CD • • 1- CD CD 1- • • 1- CD CD CD • • Ι- ass CD 1- • • CD • • ου 1- CD CD • F— Ι- CD <c CD • • CD • • ου • CD • * (- • • CD • 1- • CD 1- • • 1- CD CD CD ass ASS 1- • • 1- • • <c 1- Ι- CD • • CD • • CD 1- ου 1- • <c CD 1- 1- • • CD t- 1- CD • • Ι- ass CD <C • CD • • ου 1- CD • <£ 1- CD <c • CD < <c CD • • Ι- • • 1- • • i — 1 CD • • i — t ου • • CD • • O 1- CD CD LO <c • • 1- CD • t- 1- • • ass • • 4-> 4—> + -> Π3 I— Π3 1- ro 1- I <c I ru I I <c I ru I | <c I r-d 1 OO 1 oo 1 oo 1 OO 1 oo 1 oo 1 OO 1 OO oo Cl Q_ CL CL CL CL CL CL CL OO oo oo OO OO oo oo OO oo CL. CL CL CL Cl CL · O_ CL Cl LU LlI uu LU LU LU LU LU LU LO L0 L0 L0 LO LO LO LO L0 CD CD O O O O CD O O

6/52

1η γ

ο Ç • • O F— • • O <C • • O CD • • LO ο • • O CD • CD LO < • • O 1- <c CD 00 (- • • co <c • F— co <c CD CD CD CD • • <c • • F— CD CD CD • • i — 1 CD • • <c O CD <c • • CD • • F— CD <c <c • • CD • • 1- • <c CD • • ο • • 1- CD • CD • • CD • • l · - O CD CD CD CD CD • • CD F— • F— • • 1- <C Ç F- • • F— • « ο • • 1- • • CD • CD 1- CD • CD 1- F— <c • • l · - • CD • • <c • • CD • • <c CD CD 1- • • 1- • • CD • <c <c • • 1- • • Ι- CD CD <C • • <c • • I— CD CD ο • • CD • • CD F— 1- <c • • CD • • <C CD I— <c • • 1- • • 1- CD CD CD • • F— • • CD F— • 1- • • CD • • CD • <c CD CD • <c • • CD • <c <C • • <C F— • CD F— • <C • • <C • • CD • CD ο • • <C • • CD • CD • • Ι- • • CD F— • <£ • • CD • • ο <c <c F— • • CD • • CD F— • 1- • • CD • • CD CD <c CD CD CD <c • • (- CD < CD • • <£ (- F— <c F- CD CD • • CD • • • 1- F— • • CD • • < • CD CD • • ο • • F— • • CD • • 1- • • <c • CD F— • • CD • • CD F— • <c • • 1- • • CD • • CD • • <c • • 1- CD ç < • * CD • • CD 1- • CD • • < • • F— • CD <c • • CD • ç 1- <c CD Ç • • CD • • CD <c <c 1- • • CD • • 1- • CD <C • • < • • F— CD CD 1- • • CD • • F— CD CD CD • • ç • • CD • • < • • CD • • CD • F— F— • • Ι- • • <£ • CD F— • • <£ • • Ο • • <C • • <c • • CD F- • 1- • CD • • <c CD 1- • • I— • • Ι- ç CD <c • • 1- CD CD < 1- CD Η- • CD • • <c • CD CD • Ο • • O • 1- CD • • < CD CD F— • CD F— • • Ç • CD CD 1- F— <X • • <C • • CD <c • < • • 1- • • CD • ç 1- • • CD • • CD 1- CD ι — 1 <C • • x — 1 1- • CD -1 CD • • CD • • ο CD • • LO <c 1- • CD CD LO CD • • C0 CD 1- 1- 00 CD • • CO 1- CTi F— • • + -> 4-> + -> ro 1- Π3 1- Π3 F— 03 F— I <c ru I I <c I or I zz I <C I or I I <C I or 1 GO 1 GO 1 GO 1 GO 1 G0 1 G0 1 G0 1 G0 1 GO 1 GO 1 GO GO CL CL CL CL CL CL CL CL CL CL CL CL GO GO GO GO G0 G0 GO G0 GO GO GO GO CL CL CL Cl CL CL CL CL Cl Cl Cl CL LU LU LU LU LU LU LU LU LU LU LU LU LO LO LO LO L0 LO LO L0 LO LO L0 L0 O O O O O O O O O O O O

σ>

7/52 iM

ο LO Η— · CD · Ο • Ο <Γ O LO O CD The <c The <c O O CD 1- · · Ο <· · σ < • -1 <c ο 1- τ — 1 <c The The CM <C The The χ — 1 Ο <c ο γ — 1 ο The χ — 1 O The The x — 1 <C CD CD ο • • ο The Ι- < CD <c The The <c • • < ι— O ο The <c The The ο <c <c CD • The O The O 1- 1- ο • • ο • The <c The 1- <c The The <c • • ο • The O The The Ç The * Ι- • <c 1- • The O The The O The The Ο • • <χ • • <c 1- 1- 1- • • ο • • ο • The O The The <c The The Ι- ο • 1- • • O The The O 1- The Ο • <c • • Ι- The CD <c • The Η- • ο <c ç ο The <c • • 1- ο ο <ί The O • The O 1- CD ο • • ο • The Ι- The Ι- • The ο • • π- • The Ο The The Ο • The «X ο • ι- • The <c • The 1- The O • • ο • • <c 1- O <c • The ο • • <χ • O Ι- The The CD • The Ι- • ο Ι- The • Ο The The 1- CD CD Ι- • • Ο • • O < • 1- • The Ο • • ο • <c O The • CD • The <c • • <£ • • <c The The ί- CD <c ο • • ο • • <x CD O ο • The <£ • • Ι- ο <c l · - The The CD • The ο ο 1- Ο • The CD The • <C CD CD Ι- • • CD • • Ι- CD <c <C • The Ο • • 1- • O ο O O <C • The Ι- <χ • 1- • • <c 1- Ι- CD 1- The Ο • • I— • • O < Ο ί- • The Ç • • <c ο O O The • ο • The ο <c 1- Ç • • <c The The O 1- 1- ο <c The The O The 1- <c • The ο • • ο CD <c ç The The O • The <c • • Ι- • The O The The <c CD The ο • • ο The • Ι- <c <c <c • The ο • • 1- Ο O ο The • <c • The Ι- • ο <χ The The <c • The CD <c The ο • • ο The The O The The <c • The <c • • Η— The O O • The <c The The ο <c 1- CD The The 1- The The CD CD 1- 1- • • ο The • <c CD Ι- CD The The <c • • CD <c <c O The CD • The <c 1- 1- < The The <c The H- CD O γ-1 ο • • γ — 1 ο The • τ — 1 H- ç Ο -1 <C The • Ο ο • • LO CD The • CD l · - The The LO 1- • The ο Ι- ο <£ Ο 1- The • x — 4 O The -1 Ι- • O ι — 1 Ο • • τ — 1 1- O x — 1 O < • -1 ο The The -Μ + -> 4-> 4-> 03 I— 03 1- z: 03 1- 03 1- I <£ ι ΓΜ | I <x I r-j I I <c I ru I I <rz I ΓΜ 1 CZ) 1 CZ) I CZ) 1 CZ) 1 CZ) 1 CZ) 1 CZ) 1 CZ) 1 CZ) 1 CZ) 1 CZ) CZ) cl α_ α_ CL CL CL CL CL CL CL Cl CL CZ) CZ) (Ζ) CZ) CZ) CZ) CZ) CZ) CZ) CZ) CZ) CZ) Cl. Ο_ cl CL Q_ CL CL CL CL CL CL CL LlJ LlJ LU LU LU LU LU LU LU LU UJ LU ι /) CO CO co CO CO co CO co CO CO CO Ο Ο Ο ο O CD O O O CD CD CD

• · · • · · · ·

8/52

Ο LO CD · · CD The <c • < O LiO <c <c CD · • · O O CD CD O CD · CM Ι- CD <C CD CD • • OO CD • <- CD 1- CD t — 1 Ο • • i — 1 <£ • • r — 1 <C CD CD r-l CD • • O • • Ç CD CD <C • • Ç • • I— CD CD <C « • CD • • CD • <c <c • • CD • • 1- <£ CD 1- • • O • • CD CD F— 1- • • <c • • Ι- ç <c CD • • CD • • CD • • ο • • <c • • CD 1- CD • • CD • • <c • CD CD • CD <c • CD F— • ç • • 1- • <c CD F— CD F— • • CD • • ç CD 1- • Ι- • • CD • • CD • CD • Ο F— CD <C • • CD • CD <c • F— • • ç • • ç 1- 1- <C • • CD • • <c CD CD CD • • <£ • • <c CD 1- (- • CD • • CD • <c CD • • CD • CD 1- CD <£ • • CD • • <C CD CD 1- • • CD • • F— • CD CD • • CD • • CD (- CD 1- • <C CD • CD <c <c CD • • CD • CD • • <c • CD • • f— • CD • • <c • <C CD 1- F— • • 1- * • <c CD CD CD • • CD • • CD <c F— CD • • CD • • CD • <c CD • CD <c • • CD <c CD CD • • F— • <C <c CD F— CD • • CD • • F— <c • F— • • <C • • 1- • CD CD • • CD • • CD 1- < 1- • • CD • • CD <c <c F— • • CD • • CD 1- F- <C • • ç • • CD < • F— • • CD • • CD • • CD • • CD • • CD 1- • 1- • • <c • • F— • O CD • <c • • CD • » <C * • <C • • CD F— • F— • • CD • • CD <c CD CD • • <c • • Ç CD CD CD • • F— • • CD 1- CD CD • • CD • • CD « • CD • • <C CD • 1- • CD <c • • CD • • CD <C 1- 1- • F— • • F— • • I— • • <c • CD 1- « F— • • ç • • <c 1- F— Ç • • CD • • 1- CD • CD • • CD > • F— CD CD CD • • CD • • Ι- • <C • • CD ♦ • F— CD CD Ο • • i — 1 CD • • r-t F— CD CD -1 F— • • τ — 1 CD • • O F— • <£ LiO CD • • O CD • • LO F— CD • CM CD • • CM <C • • oo CD • • OO CD • • I — 1 CD • • I — t <C • 1- -1 1- • • T — 1 CD • • 4-> 4-> -4-> Π3 1- frog F— sz Π3 F— ΖΣΖ Π3 F— 2: I <c I r-M I 1 <C 1 r-j I oo <C | Γ-Μ I 1 <C 1 ΓΜ 1 OO 1 OO 1 CD 1 OO 1 OO 1 oo 1 OO 1 OO 1 OO 1 oo oo Cl. CL CL CL Cl CL Ql CL CL CL CL CL LT> OO OO OO OO OO OO OO oo OO OO OO Cl. CL Cy-. Cl. Cl. CL CL CL CL CL CL Cl LU LU LU LU LU LU LU LU LU LU LU LU CO CO CO co CO CO 00 to OO to OO to O O O O O O O O O O O O

9/52

1-3 (0

ο LO CD Ι- • CJ ç The CD • <c • · O < Ο • • LO < • • CO ç • • τ — 1 CD • • i — 1 CJ <c 1- CD • • Ο CD • CD • • LO Ι- • • Η- • • CJ <c • -1 Ο 1- • CJ • • CJ 1- 1- <£ • • CJ < CD CJ • • <C • • Ο • • < • • CD CJ CJ Ι- • • CJ 1- • CD • • ο • • CD • • <c CJ CJ 1- • • t— • CJ • Ç Ι- • • 1- CJ CD 1- • • ο 1- CD CD • • CD • • CJ • • CD • • CJ • l · - CD • • Ι- • • 1- • • CD • • Ο « • 1- • • 1- • • CJ • • 1- <c CJ <χ • • CJ H- 1- CJ • cj 1- CD < • • <C • ο • • CD • • CJ <c <C CD • • CD <c <c CD CJ CJ CD • • CD • • <C l · - 1- I— • • <C • • <C 1- • <c • • CJ • 1- P- • • CD <c <c 1- • • CJ • • CD • • <c • • l · - • CD <C • • CD <c • 1- • • CJ 1- • CJ • • CD • • <c • • < • CJ 1- 1- CJ • • CD CJ 1- <c • • 1- • • 1- • • CD • • <c • • CD • • CJ 1- • CD • • CJ 1- CD l · - • • 1- CJ JCJ CJ • • Ι- • « <c • CJ • • Ο • • CD • • CD 1- 1- <£ • • CJ • • 1- • • Ι- • • <C • • CD • • Ο • 1- Ι- • • CJ l · - • <c • • Ο <c 1- < • • <£ • • CJ • • CD • CJ CD 1- <c • • CJ <c CD CJ • • CJ 1- • CJ • • CJ • • <c • • CD • • CJ • • CD • • CJ 1- • Ι- • <—1 CJ l · - <c -1 CD • • -1 Ρ- • • Ο 1- • • LO 1- • • CJ Ο CD 1- ç • • CJ 1- CD LO CJ • • ΐ — 1 < l · - 1- r —— 1 CJ • • -1 <c • • -P P> + -> Π3 1- Π3 f— Π3 1- zz: I <c I r ^ J I I < I r ^ J I I <c I PJ 1 OO 1 oo 1 oo 1 OO 1 oo 1 oo 1 oo 1 OO OO Q_ CL Q_ Cl CL CL CL CL CL OO OO OO OO OO oo oo OO OO CL O_ CL CL Cl CL CL CL Cl. LU LU LU LU LU LU LU LU LU GO GO GO GO GO GO GO GO GO O O O O O O O O O

CJ)

10/52

1— cd o • F— 1- O O r -— 1 Ι- Ο CD • CD • O LO »—1 CD CD CD Ç • • CD CD OJ <£ CD CD • • <c O LO CJ CD CD CD • <c ç • CD • CD • <C • CD CD • CD <c I— • CD • CD <c CD CD CD • <c • CD CD 1- Ι- • CD • CD 1- CD • CD 1- Ο • CD CD 1- • CD • CD CD CD CD • CD • 1- CD <C CD CD CD <c CD CD CD • <c 1- <C CD • CD CD • CD <c CD <C CD • CD <C CD CD 1- 1- • CD • CD <c CD • CD CD • CD 1- CD • CD • CD CD CD <c CD CD CD <c CD • CD CD • 1- <C CD F— <C • <c <c • CD • CD • CD • CD F— CD 1- CD • CD • CD • CD CD • Ç • l · - • F— • CD CD • CD < CD • CD • CD F— CD CD CD • CD * CD • CD CD CD • CD • CD • 1- CD CD CD CD • CD • CD CD CD CD • CD • CD <c t— CD • CD • CD <c CD • <c CD • CD • CD • CD • CD <C CD ç <C • CD <c CD • <C CD • CD • CD 1- CD CD CD • CD • 1- • CD CD CD • <C • CD • CD F— <C <C • CD • CD F— CD • <c <c • CD CD CD • CD • CD CD CD CD • CD • <£ CD F— CD • CD <c CD • CD <C <C • CD 1- CD • CD CD 1- CD CD CD • CD • CD 1- CD CD • CD • CD 1- CD CD CD • CD CD 1- • CD • CD CD • CD • CD • CD • <c 1- • F— • CD <c CD • CD <c « CD CD CD • CD • CD <c CD 1- CD • CD • CD • F— CD • CD • CD <c CD • CD CD • CD O CD • CD • CD CD CD • CD • CD • CD <c 1- CD CD • CD CD CD • CD CD CD CD CD • CD • CD CD • CD • rH CD ç i — 1 CD • -1 1- l · - • -1 CD • O CD • LO CD • O <c <C • 1-0 CD CD I — 1 l · - • i — 1 CD • I CD 4-> 4-J + -> 4-> Π3 res (T3 res res | I I I I I 1 I I I I I I I 1 tO 1 tO 1 co 1 CO 1 co 1 to 1 CO 1 to 1 to to CL CL CL Cl CL CL Ci- Q_ Cl CL tO tO co CO (O to to (O to to Cl CL CL CL CL CL CL Cl CL Cl LlJ LU LU LU LU LU LU LU LU LU AND AND AND AND AND AND AND AND AND AND I I I I I I I I I I

<c c \ j

O)

Ll_

11/52

ο ο < ο 1- • ο ο • ο Γ— ο ο ο ο Ç • LO < • ο Ι- ο LD < • ο Ο co CD • CO ο 1- • τ} - ο • Ο • LO Ο CJ) 1- <c • ο • ο • I— Ο • Η- • <c ο < • <C Ι- • ο • <c • ο • <χ ο ο ο <Χ • <c • π- ο ο Ι- • Ο • ο I— ι- • ο ο • ο ο <c • ο • ο ο <c Ι- ο • < ο ι— CO • Ο ο ο ο • <χ ο • Γ— ο ο • ο • <c ο ο ο <c • ο • ο • ο CJ • ο • <£ • Γ- • ο CD • ο <χ <c • Ι- • 1- CJ ο ç • ο • Ο • ο Ι- • ο * ο ο Ç • ο ο Ι- Ι- ο Ι- • Ç • Γ— ο <χ Ο ο • Ο • <£ Ι- <χ I— • < • Ι- <c ο <χ <_3 • ο • Ο • ο ο ο ο <c • Γ— • Η- ο ο Ι- * <c • Ο ο ο • ο Ο • ο ο Ι- • ο • ο 1- ο • Γ— • Ο • Ι- ο Ι- Η- • ο • I— • ο • Γ- ο <χ • ο • ο • Ι- • ο Γ— ο Ç 1- • ο • ο Γ- ο • ο • π- < ο • Ι- CJ • ο 1- ι- • ο * Ο <χ ο • <c ο • Ι- • ί- <χ • <c • ί- ο Ι- • ο 1- <χ • Η- • ο • ο • ο ο • ç • <£ • Γ- ο <χ ο ο ο Ο • Ι- • ο Γ- <χ π- <c ο • Ο • ο Ι- • ι- • <c • <£. ο Γ— . · Ι- • ο • ο • Γ— • ο Ο • ο ο π- • Ο • <χ ο ο Ι- Ι- • ι- ο π- • <c ο Ο Ο • ο • ο • ο Ι- < < ο ο • ο • ο <Χ Ο < ο • ο • < ο <c <c • < • ο 1- ο • ο <c • < ο ο • <£ ο <χ ο ο • ο • < ο <χ ο ο < ο ο < * <c ί — 1 Ι- < ,-1 ο • ?-1 Ι- • <—I «X • τ — 1 ί- ο LO ο ο ο ο • LO ο π- ο ο • LD ο C \ J ο • CO ο • ΟΟ ο • ^ 1 " ο • * 3- ο

QC

C \ J σ>

Li_

-Η> + -> + -> -H> -n> Π3 Π3 Π3 Σΰ ro ΓΟ 2Ξ I I I r-j I χ: I r-j I | r-J I I r ^ j I 1 οο 1 οο 1 οο 1 οο 1 ΟΟ 1 OO 1 oo 1 oo 1 oo 1 oo ass Ass ass Ass ASS β- Ass Ass ass Ass οο οο οο οο οο ω oo oo oo oo ASS ass ο_ ass ass ass ass Ass Ass Ass Lul Lul Lul Lul Lul Lul 'Lul Lul Lul Lul Ε Ε ε ε ε AND AND AND AND AND r ^ j r ^ J r ^ j r ^ j ΓΊ r-j r < r ^ J r ^ j r-j

12/52 ò ò ο

LO

Ο

LO

O • • 0 O CD 0 CD • O <c • O Ι- CD CD • • 0 Ι- * LO Ι- • O 0 • LO ο 9 I— • • LO Ο • co Ο • x f— • X CD • CD • 9 1- 0 <C • CD • CD • O • • 1- • CD • CD <c f— • <c • • CD * Ι- 0 <C • <C • π- • • CD 0 Ο <c • CD P- ι- • • Ι- • ç 0 • H- • ο • • Ι- 0 0 <c CD • O • <c • • Ο P- CD • CD • CD • <c • • <c • CD • 1- <c 1- • O 9 • CD μ- 0 CD • P- 0 O • • O CD ι— • CD • 0 9 I— 0 CJ CD • O • Ι- <c 0 9 <c • • CD • 0 • ο • CD 9 <c • • 1- <c 1- • O • 1- 9 <c 0 Π3 t— • 1- • <c CD CD • O • • <c • t— 0 Ι- • <C 9 O • 9 0 • CD • Ο • CD 9 Ι- < Π3 0 • 1- • CD <c 1- 9 Ο • • 0 • 1- • CD • Ι- 0 O • • <c CD <c • CD • Ο 9 I— CD cn CD CD • <c CD <z 9 O • • <Z 0 1- • CD 9 P· 9 O • • CD • 1- • CD 9 CD 9 ί- • • <£ • CD • O • <C 9 ο • 9 CD • 1- 0 1- • O 9 O • 9 CD <c <c • <c • 1- CD Ι- 0 0 CD • CD • <c 0 CD O ο • • <£ • <C 0 CD • <c 1- <c • • CD • O • CD • 0 • P- 0 0 1- • CD • I— 0 CD 0 Ι- • • <c • 1- CD <c • <C P- ο • • <c 1- CD • • O P- l · - 0 0 CD • CD • 0 CD P- • O • • <C CJ 1- 0 1- • <c • O • • <c 1- Ι- • CD • 0 <c <c • « 0 • Ο • P- CD 0 • O • • 0 • CD • CD • Ι- • O • • ç 1- Ç O <£ ο • <c 0 frog 1- 9 O • 1- CD CD • O • • CD • CD • P· • CD • <c • • <C 1- Ç • CD • ί- 0 O I— cn CD • <C H- • ο Ι- • « CD • CD 0 O • 1- • Ι- 0 0 1- 0 P- • O • CD • Ο • • <c • l · - 0 <£ 0 1- • O • • you CD • rH <c CD τ — 1 O • r - 1 0 1- f— • • LO 1- • 0 CD • LO O • 0 CD • O • • LO 1- • LO CD • LO O • X <C • P P P + -3 P Π3 SZ LO frog SZ Π3 SZ Π3 sz ro sz IXl Σ3 IX! txl ΣΞΣ txl XI I I LO 1 1 1 1 1 l 1 1 CZ) CZ) ç CZ) CZ) CZ) CZ) CZ) CZ) CZ) CZ) Or CL ω O_ CL Cl CL CL CL CL Cl CZ) CZ) LO CZ) CZ) CZ) CZ) (Z) CZ) CZ) CZ) Cl CL ç O_ CL CL CL CL O_ CL · CL LU UJ 0 LU LU UJ LU LU LU LU LU ε AND 0 AND AND AND AND AND AND AND AND x IX] IX] rxi IXJ IXl 1X1 XJ 1X1 XJ

ο

C \ J

G)

L ± _

13/52

751 800 ZmEPSPS_Nat GCTCCTTTGG CTCTTGGGGA TGTGGAGATT GAAATCATTG ATAAATTAAT ZmEPSPS_ZM ..C ... C ....... G..A .. C..C..A ..... G ........ ... GC.G ..

O CD • O 1- < O CD • O CD • LO Ι- CD O CD • LO <C « O 1- <c co Ο • CT) CD • cn <C CD O CD • CD • <c CD CD • k — 1 <£ • 1- CD CD • F— CD • f- « CD • CD <c I— • 1- • CD • CD • CD • 1- <c <C • CD * 1- CD CD <c • 1- CD CD • CD ç 1- CD <c • <C • CD <c 1- • CD • CD <c <C • ç • 1- <c CD • CD • CD 1- CD • CD • CD • <c • CD • <C CD 1- • 1- • Ç CD CD • <C • CD • <C • CD • CD • 1- • CD • I— CD F— • 1- • 1- CD CD • 1- CD <c CD 1- <c • <£ 1- CD • CD • 1- CD CD • <C • H- CD F— • <£ CD CD • <t • <c • CD CD <c • 1- • ç CD 1- • CD • CD CD CD • H- CD CD • CD • CD • <c • CD • CD • 1- • CD * F— • l · - • CD • <C • CD • • CD • CD • CD CD ç • F— CD t— • <C ! - <c CD CD * <C • 1- • <c CD • <c CD ç • <c 1- CD <c • CD • 1- CD CD • CD • 1- CD CD • CD • O 1- CD CD • CD F— F— • H- CD <£ l · - • CD • ί- CD CD • f— CD CD F— ο; • 1- • CD • • CD • CD • F— « 1- • CD • <C • F— • CD t— <C • <c • 1- CD CD • CD • CD 1- <£ • CD • <C 1- <C • F— • F_ CD CD • 1- • CD 1- 1- • CD • <c • CD • <c • Ç • <c • <C • CD 1- ç • <c • <C CD r — 1 CD • i — 1 <c • k — 1 <c CD r — 1 CD • O Ι- • LO CD CD O <c • LO CD • CO Ο • CO F— • cn CD • cn CD • + -> + -> -P Π3 ro Π3 Π3 1 naked I Z. 1 naked I zzz I n-J | ί CZ) naked 1 (Z) 1 (Z) 1 CZ) 1 CZ) 1 CZ) 1 CZ) CZ) Cl CL CL CL CL Cl CL Cl (Z) (Z) CZ) CZ) CZ) CZ) CZ) CZ) CL Cl CL CL CL CL CL CL LU LU LU LU LU LU LU LU ε ε ε ε ε ε ε ε naked naked naked naked naked naked naked naked

Q

C \ 1

LL_

14/52

O CD • O CD • O <c • O CD ♦ O CD • ld <C • O • LO CD • O 1- • LO F— • O CD • τ — 1 <C CD -1 CD • ass CD • ass CD 1- <—1 Ι- • τ — 1 (- • — There CD • r "H (- ' CD I — í CD CD Ο • CD • CD <c O • 1- < CD • CD • <C • CD • (- • F— Λ CD CD <C • F— • CD » l · - • Ç 1- CD CD F— • CD • 1- • F— CD 1- • CD • CD CD CD <c CD • CD • F— ç F— • <C • <c • CD I— CD • CD • <£ • CD • Ç • <c • CD 1- CD CD <C • CD • CD • <c • 1- • CD The < CD F— • CD • CD • CD 1- <C • <c • <£ CD F— CD CD • <c • CD 1- CD • Ç • <c • CD • CD • <£ • CD • CD • CD • CD • CD • f— CD CD • <C CD CD • F— • CD • F— • CD CD 1- • <C • CD • <c 1- CD • CD • CD <c CD • CD • CD • F— CD CD • <C • <£ • 1- CD Ç • CD • O • 1- CD (- • CD • <C CD CD • 1- • F— • 1- CD CD • F— • CD • <c F— CD <C • F— • CD • CD • CD • CD CD F— CD <C • CD • CD • CD • CD • Ç • CD <c F— • CD • <C • CD 1- <C • <£ • CD • O CD CD • CD • CD • CD • CD CD • CD CD Ç • CD • <C <C CD CD • <c • 1- CD CD CD • CD • CD F— <c • O CD • CD CD <c • CD • < CD • CD • <c • CD 1- CD CD F— • CD • CD • CD • CD <c « Ç CD F— • CD • CD CD • CD • <c • 1- CD 1- F— • CD • CD • 1- • CD <C « CD < <C • F— • 1- CD <c CD • • <c CD CD 1- CD <C • CD • CD CD 1- • CD CD • t— CD 1- CD • CD CD <c CD • CD • CD • Ç F— • <C • F— • CD • i — 1 < -1 CD • t — 1 1- • r-H <. • r — H CD • O CD LO < CD O 1- • LO CD • CD (- CD O CD O 1- • i — 1 CD • r — 1 F— • ASS 1- • t-H F— t — 1 CD • t — 1 (- CD r — 1 (- • x — 1 CD • + -> 4-> + -> s: Π3 Π3 1ΣΞ <T3 ΖΣΞ Π3 σ I ru I I lu s: I ru 1 | ru 1 1 ru 1 OO 1 OO 1 OO 1 oo 1 OO l OO l oo 1 oo 1 OO oo CL CL Cl Cl CL CL Cl O_ CL CL OO OO OO OO OO oo OO oo OO oo Cl. Cl Cl CL CL CL CL Cl. CL CL LU LU LU LU LU LU LU LU LU LU ε ε ε ε ε ε ε ε ε ε ru r-u LU ru r-U ru ru ru ru ru

LU

CM

Í_U

15/52

Ο ο ç O • 1- O LO O O 1- • O O 1- · <c · The CD LO CD <C <C O CD O 1- CD • Ll_ Cri co O • co CD • laughs O • 1- • LO CD • ι — 1 <c • —I O < -1 O • -1 <c • r — 1 CD CD <c O CD • ç CD CD 1- Ι- • ί- • <c • O • CD • Ο • O) the mot ο • O • <c • <c • F— CD CD • 1- • O • CD O <c • Ll_ <c • ç O 1- Ι- • O • O • O <c <c • Ο • CD • CD • l · - • O • CD 1- 1- • O • <c • O • CD • F— • <c • O • <c • CD • CD 1- CD <c O O • CD <c <C • O • ί- • O • F— • 1- • O • ο • <c • CD • CD • O • <c • Ι- • CD <c <C • 1- 1- • ο <c <c • CD • ç CD 1- CD • O • CD • CD • ç ç • O 9 CD • O • O • O • CD • CD • CD • O • <c • CD • CD 1- Ι- O O • O • Ι- CD 1- • Ι- • O • CD • Ο • 1- • Ο • O O 1- • ί- • CD O O • O • <c • ο O CD • 1- O • O 1- CD <C • <c <c O CD • CD O <c O F— <c • O • CD <c <c • O • CD • O O CD • <c • r — l <C • Cri <£ O O • O • O • O CD LO <c O CD • <c • <c • 1- CD O • ι — 1 1- • < • O • O CD 1- CD • 1- CD O • 1- O Ι- • CD 1- CD • <c • O <c Ι- ο • CD • CD • <c • O Ο • CD • CD O <C • CD • ç • Ι- • <c • 1- <c CD • <c • O • ο O <c • CD 1- CD • ç • <c • t— <c O • 1- • Ι- • CD CD O • <c O 1- • I— • Ο 1- 1- The O • O • CD 1- CD • O • CD • <c • O • O • CD • O • CD • <c • <c O ç • CD • 1- <x 1- • <£ O O • <c • CD • CD • 1- • Ι- • O • O • 1- • CD • 1- CD Ο • O O <c 9 ç • CD 1- CD • r — 1 < O i "Í Ι- • -1 O • -1 CD <c -1 <C • ΐ — 1 ç • LO O O Ο • LO O < O CD • LO CD * CD CD ç Cri < • co O <c CO CD CD • CD <c LO CD CD τ — 1 O • r ^- —1 <c • —I CD -1 O • CD • "-1 <c F— 4-> -M 4-> 4-> 4U> Π3 Π3 zs: Π3 2Z Π3 Π3 Π3 I laugh I I laugh I I laugh I I laugh I 22 I laugh I | laugh 1 co 1 CO 1 co 1 co 1 co 1 co 1 CO 1 co 1 CO 1 CO 1 co co Q_ CL · Cl. CL · CL CL · CL · CL CL · CL · CL D_ co CO CO co co CO CO co CO co LO CO Cl. O_ Cl. Cl. CL CL CL CL · CL CL · Cl. CL · LU LU LU LU LU LU LU LU LU LU LU LU AND AND AND AND AND AND AND AND AND AND AND AND laugh laugh laugh laugh laugh laugh laugh laugh laugh laugh laugh laugh

: ........... fay

16/52

GmEPSPS_Nat ATGGCCCAAG TGAGCAGAGT GCACAATCTT GCTCAAAGCA CTCAAATTTT GmEPSPS_GM ..... T..G. .CTCTC.C .. T..T ..... C ..... G..T. .C..G..A ..

O AND- O O <c CD O AND- • O CD » O <c - LO <c • O CD • LO CD 1- i — 1 O • r — 1 <c • CM CD AND- CM CD » AND- • 1- CD CD • AND- 9 1- <c <c • CD • CD (- 1- • CD • CD <c CD • O • O • CD • <C • CD Ι- 1- • AND- • <c AND- CD ο <c • 1- CD CD CD f— <c CD 1- <c • 1- < AND- • CD • <c • CD CD <c • CD • <c • CD • <c • CD • CD • CD • O 1- • CD • CD • AND- • <c • CD <c Ç • CD • CD 1- <C • <£ • O 1- CD • <C • CD • O • CD • CD CD • H- • AND- c_> CD CD • <c O CD CD CD • <C H- <c • 1- Ç CD CD CD • <c • <c CD AND- • CD • O <c <c • CD • CD CD ί- • <c • f— CD CD 9 ο • <c AND- Ç • CD • <c • CD CD <C • CD 1- ç • 1- Π- • CD • ç • CD <C Ε- • CD • CD • CD • Ι— • CD <c < • CD • 1- CD 1- • ç • CD <c <£ • CD • CD <c CD • <C • CD CD O • CD • <C CD CD CD AND- • CD • CD • 1- <c O • CD • CD • <C CD <c • Π- CD • CD • <c • Ε- CD AND- CD • O Ι- Ι— <C • AND- CD CD Ο CD • Π- CD CD CD AND- <c CD CD Ε- • AND- <C Ι- CD CD • Ι— • V— ç Ο • CD • CD CD AND- • AND- • <C CD CD CD • I— • CD • <C Π- CD <c <c • CD <£ n- <C • CD • CD • CD • Ç • O < CD * CD • Ε- CD O • -1 <£ CD -1 AND- CD t — 1 Π- • t — 1 O • O CD 1- LO <C • O Ι— • LO AND- CD r — t 1- • t — 1 <C • CM CD CD -H> + - ^> + -> frog frog frog frog σ: | CD I I CD I I CD I I CD 1 oo 1 GO 1 GO 1 GO 1 GO 1 GO 1 GO GO CL CL CL C-L CL CL CL CL GO GO GO GO GO GO GO GO CL CL Cl CL CL CL CL CL LU LU LU LU LU LU LU LU F F F F F F F AND O CD CD CD CD CD CD CD

<c

QC

O) • · · • · · ······· ·

17/52 ο

ο co

LO

CD

cd I— l— <c o <c o I— c o <C • 0 • • • 1- CD O LO CO 0 · 1- The CD 1- · 0 · 1- · O O 'laughs * 1- F— <c 0 F— 1- <c 1— c CD • • • « <c • • • • O LO ç <c <c 0 <c CD 1— <c CD <C CD • • 1- • • 0 • • CD CD O LO CD <C Ι- ο <c 0 0 0 1— 1— O 1- • • • 1- • • 0 • O CD 1— O • • < • O • CD • CD <£ <c • F— • «SC • 1- 0 ç « CD • CD • F— • Ι- • CD • CD t— F— • CD • Ο • F— 0 1- • 1- • CD • 1- CD CD • CD • <c CD CD Ι- • Ç • F— ç CD 4 » O • Ο F— 1- • CD • CD • Ι- • CD • <c • O <£ CD 0 Ο • 1- • 1- • CD <C CD • ι- • 1- • CD F— 1- • CD ♦ ι- • F— • 1- • O • F— • ο 1- F— • 1- 0 ç CD CD • <c • <c • CD 0 CD • ί- • CD • <c • 1- • CD • ο 1- <C CD CD • 1- 0 1- • CD • <C • O <c 0 ♦ Ι- • CD • Ç • 1- • <c • Ο • • O <c <c • <c • O 1- <C • F— • < • 0 F— 0 • CD • <c • 0 F— <c • • F— 0 O CD 0 • CD • CD ç I— • O • F— • <c 0 CD • CD • O • CD 0 1- • <c • (- 0 <£ • Ι- • CD • <c 0 1- • <c • Ο • (- <c 1- • <c CD • F— 1- • 1- 0 F— CD 1- O CD CD • <£ CD 0 • I— • 1- < 1- • O • CD • I— CJ CD « 0 • CD • <C • CD 1- <C • <c • 1- CD <C • F— • <c <c 0 CD • O • CD • 0 < 0 • CD • <c • O 1- 0 • <c • 1- CD <c • I— • 1- <c • Ι- • <c • <c • CD ç CD • Ο 1- 0 1- CD <c CD • CD • CD • 0 • <C • CD • CD <c 1- • <c • CD • r — 1 <c 1- 1—1 <C • r — 1 <c • T — 1 ç 1- CD 1- O 0 • LO CD • O 0 (- LO 0 • O • CO 0 CO 1- 0 'laughs' <c • 'laugh <c • P -P -P -P -P Π3 fO Π3 2Ξ <TJ <T3 22 I CD 1 1 CD 1 1 CD 1 1 CD I I CD 1 CO 1 LO 1 CO 1 CZ) 1 CO 1 CO 1 CZ) 1 co 1 LO 1 LO CL CL HERE- Cl CL Cl Cl CL CL CL CZ) CZ) CO CZ) co CZ) CO co CO LO CL CL CL Cl CL CL CL CL Q_ CL LU LU LU LU LU LU LU LU LU LU AND AND AND AND AND AND AND AND AND AND CD CD CD CD CD CD CD CD CD CD

OQ

CO _σ>

........ ··. ···: ···::. * ·: ·. ·. ·

18/52

0 CD · O CJ 1- 0 1- • 0 < • CJ ç F— LO O CJ CD • LO F— CJ 0 0 • LO CD • LO I— · CO < • CO F— • CD • O- CD • <c · < 1- <C • CD • CJ * 1— <c O • CD • 1- CD CD • CD · CD • CD 1- 1- • CD • CD · 1- CJ CD • CJ ί- Ι- <c 1— <c < • CD • 1- ο Ο • CJ < • 1- CJ 1- • CJ * CD · <c l · - 1- 1- • F— CD

F— <C CJ • CD · CD · <c <c 1- CJ CD t— • F— O • ί- <C • F— • CJ • F— • <C ο <c CJ CD • CJ • F— • CD • CD • CD • CD <c <C • CD • CD • <C CD CD • CD • <C • F— • CJ <Z • F— 0 CD • Ι- • CD CD <c 1- • CD • Ο • ί- CD <C • CD • CD • CJ 1- ο • CD • F— 0 <C • F— • CD • <C • Ç • <C • F— • 1- • CD • CD • CD • <C CD 1- • <C CJ <C • CD • (- • CD • CD • CJ • CD • F— • CD CJ 1- • CD • F— O l · - CJ F— • < • F— < <C • <C « CD * < • CD • <C • <c • Ι- • CD • CD • ç CD CJ <c Ο • CJ <£ Ι- • 1- • F— • CD • O F— Ι- • CD • <C • < F— CJ • Ο • 1- ç CD CJ • CJ • ç CD CD • <c • CD • CD • <c • O <c CD • <£ CJ 1- • 0 • 1- CD 1- • CJ • CD • CD ç F— • <c • <c • CD CJ) • CD • CD • CD CJ CD • Ç • F— 0 <C • 1- * CD • 1- < O • <C • 1- CJ 1- CJ 1- • O •

Ο

CQ _CT>

Ll_

<c ·. O · <c • O · <c CD 1- CJ CD CD • 1— 0 0 • 0 CD 1— <c f— <c CD · 0 • 1— <C CD · F- • CD · F— 0 <C · <· O Ι- 1- · CD • <C · <c · <c ο O · Ι- • CD · CD · 1- • CD · Ο 1- CD C 1- · CD • 1— CD <c • i — 1 · t — 1 O • r— ((—— · »—L <c. • CJ <C LO <C cj <c • LO CD cj <c • LO 1— LO O co F • LO CD < 0 •

+ -> + ->

fO ΣΞ ra Σ

ΞΞΣ CD 2: CD

4-> 4— * ra Σ Π3 ΣΞ

CD CD + Play ΣΞ CD

tO CO to to ASS Cl. ASS ASS tO CO to to ASS ass ASS ass UJ LU LU LU

to to to to to to ASS ASS ASS ASS CL ASS to to to to to to ASS ASS ASS ass CL ass LU LU LU LU LU LU

CD CD CD CD

CD CD CD CD CD CD • · · · · · · ·

19/52 gone

O O I— o • The C O O 1- CD CD • the c LO CD O O CD K- • CD LO 1- · 00 O • 00 <C · cn CD <c (3rd CD < O <c • I— • CD · CD 1- · you CD • I— • 1— CD <C CD· · K- CD I— <c 1- · CD • CD Ι- CD • O • CD · CD Ι— · CD • CD • H ~ CD 1- • K— CD <C • I— • 1- · <c • CD Κ- <c • O • Ç · CD • Ι— · CD • <c • CD C 1- • K— 1- O CD <c Ç · 1- « CD · 1- I— CD · CD <c CD · CD I— CD K- · <C <CD K- O CD • 1- · <C • 1- · 1- • <c · CD CD <c · Κ- 1- • <C CD H- • CD · Ι— CD CD <C · CD • CD CD < CD · 1- CD CD CD CD CD · CD • 1- 1-

K- <c

CD CD CD • CD • <c • <c » <C 1- K- CD I— • Ç • CD • CD CD <c • <c • I— CD CD • CD CD CD • <c • CD • CD • Ç 1- 1- CD <c • CD • 1- <c Κ- CD CD • CD • 1- CD CD • Ι— • CD • Κ- CD CD CD CD • Q CD • 1- CD Ι— • <£ 1- 1- CD CO Ç CD 1- • CD • CD • CD CD CD • CD 1- 1- • <c • 1- <C 1- • 1- • <XZ • CD • CD <C CJ) <c • CD • 1- • <C • Ç • CD • CD· * ç 1- <c • ç Ll. CD • < CD CD • CD • CD • 1- CD 1- * CD • CD <c <c CD • 1- CD 1- CD- 1- • 1- 1- • 1- CD 1- • CD • CD •

LO

Γ-

CD <C CD • CD • Ι- CD <C • 1- • CD « 1- CD ο • <C • CD • 1- CD CD CD 1- • <C • ç • CD • 1- <c 1- • <C • <c • CD « 1- • 1- « CD • <c • < CD 1- • CD • 1- CD CD 1- CD • <c • <c • CD • K- • CD * CD • CD * CD • CD • CD • I— • CD • <£ • CD • 1- • τ — 1 CD K- I — 1 1- CD t— < CD • Ç • O <c • LO <c CD O K- • LO CD • <C • co 1- CD 00 <c • cn K- • cn 1- CD 4-> 4-> -K> + -> -K> Π3 2Z Π3 ΖΣΖ Π3. Π3 zez Π3 SZ 1 CD 1 I CD I CD 1 I CD I s: I CD I 1 oo 1 oo 1 OO 1 oo 1 oo 1 OO 1 oo 1 oo 1 oo 1 00 CL CL CL Cl Cl CL Cl CL CL CL oo oo OO OO OO OO OO oo oo oo CL CL CL CL Cl CL CL CL CL CL LU LU LU LU LU LU LU LU LU LU and AND AND AND AND AND AND AND AND AND CD CD CD CD CD CD CD CD CD CD

20/52

Al

O F— O O l · - · O <c · O CD F— O 1- · LO O • O 1- • LO O • O π- • LO O • O ç • <c CD -1 < • C \ J ι- O C \ J <C CD t—) O O r —- 1 < • -1 < • you ο • τ — 1 O • CD • ç • CD • 1- • <c • CD • <c • <C • CD • CD • (- • 1- • CD • <C CD 1- • CD • CD • <C F— <C • ç • CD • t— • O • <C • O 1- Ι- O <c • 1- • <£ • O • Ο • CD • CD • CD • CD • < • <C I— CD • O <c 1- CD F— <c CD • F— • F— • F— • F— • CD • <C • CD • CD • <£ • CD ç O • CD • 1- • <£ O <C • ç • Ι- • <c • CD • O 1- O ο CD CD • CD • <c 1- F— • F— O <c • <C • 1- • CD • f— • O • O • F— O CD • F— • O • CD • 1- O Ç • F— • CD • F— O CD O <c • F— O 1- • CD • <C 1- 1- ç <C <c • CD • O <c O • CD • CD • F— • CD O CD • <C • <C • O • <C ί- < O CD • Ç • CD • <c ο CD • O <c O 1- <C O O • CD • <C O <c • F— • ç • CD • O • <c • O • O • F— • O • CD * <£ O CD • <C • <£ F— F— • 1- • CD • CD <c O • <C • 1- O F— O <C • O • F— • 1- • CD * <C * <c • <C • ç • ί- • <c CD CD • <C • 1- • ο • <c CD • O CD 1- O CD • CD Ι- O 1- • CD O CD • 1- CD Ο • CD • ç f— 1- O F— • F— • 1- • O 1- <c • O Ι- (- CD <c • O • <c • (- Ο 1- • CD • CD • 1- CD F— • CD • 1- < l · - O F— • CD • Ι- O F— • CD • CD • CD Ο • <C • Ç • Ι- O < • <c • O «X <c CD Ο • CD • O <c CD • x — 1 O • χ — H <c • —4 Ι- • -1 1- • r — í CD • O Ι- • LO O <c O Ο • LO CD • O <£ CD O Ι- • O F— • -I CD • ι — H 1- O C \ J CD • <—1 Ο • -1 <c • - · ί F— O t — 4 CD • <—1 O ç 4D + -> 4-> 4-> O ro v Π3 ΖΕΞ ΖΣΞ Π3 Π3 CD I 2L I CD I -2Z I CD I Z. I CD I ~ Z I CD 1 oo 1 oo 1 OO 1 OO 1 OO 1 OO 1 oo 1 OO 1 oo OO Cl_ ASS Ass Ass ass Ass ASS Ass Ass Ass OO OO OO oo oo OO oo oo oo OO CL Q_ ASS Ass ASS Ass ASS ass ass ASS LU LU · LU LU LU LU LU LU LU LU ε ε ε ε ε ε ε ε ε ε CD CD CD CD CD CD CD CD CD CD

LU

CO

21/52

• · · • · • · · · · • · · • · · · • • · * ········ ·: ·:. ·. • · • « • · · • • · · · • * * • · · • • · · · »· · * · · · · • · * · «······ * · · • ·

The CD • CJ O LO <c ç CD • O 1— · The c O LO CJ CJ • 1- O CJ <c 1- P- • O CD CO Ι- • CO CD • CD <c • LO CD • t — 1 Ο • —I < CJ —Ϊ P- • i — I CD • X — í <C CD P • CJ • CD • P- CJ CJ • < • <C • P- CJ <c CJ • CD • CJ • CD • CD • 1- CD < CD CD • Ι- • l · - • I— • CD • Ι- • Ο 1- <c • CD • < • ο 1- <c • P- • P- CJ <C • 1- • 1- • <£ 1- <c • 1- • <c • P- CJ CJ • CD • < • <c 1- < • <C • CD 1- Ι- <c CJ • CD • CJ • CD • Ο • CD • ί- <c <c • CD • CD • <c CJ ο • CD • P- CJ Ι- CJ 1- • CJ • <C P- CD • Ο • ç • 1- <c 1- • 1- • < • CD • CD • <c • Ι- • CJ 1- P- • CD • Ι- • Ο • CJ • <c • <c • ο • CD • O • CD • <c • CD • P- • 1- ç CD • CD • <c CJ CJ • CD • <C CJ <c • CJ • CD • CD • CD <c <c • <c • 1- • 1- CJ Ç • CD • CJ CD 1- • ç • CD • π- CD P- • CJ • < • P- CJ ι- • CD • 1- ç Ι- <c CJ • ο • 1- • CJ • Ο • ç • <c CD <c • P- • CD • CD <c CJ • <c • CJ • P- CJ <C <c • CD CJ P- • 1- • CD • <c • 1- • 1- • P • Ç CD CJ • P- CJ <c • <c CD Ç • CD • <c CD CJ • Ι- • < • <C 1- <c • CD • Ο • P- • CD • <c • CD • <c CJ l · - • CD • CD <c 1- • CJ • CD • <C CD <c • <c • CD • <C 1- •P- CD • CJ 1- 1- CJ CD • CJ • Ι- • CJ • 1- • <C CJ CD • Ο • CD • CD • CD • ç • CJ • CD • 1- • CD • <c <c 1- I— • P • 1- • ç P· CJ • <C • CD • 1- <c CD • CJ • <c • t — 1 P • i — 1 CD CJ -1 <C CJ Ι- <c -1 CD • LO CJ • CD <C 1- LO CJ CD O ο • LO <C • C \ J CD • CO < 1- CO Ι- • ^ f · <c • CJ • x — 1 P CJ r -— t CJ « t ^- H Ο P- "-1 1- <c -1 ç P P P P P Π3 Π3 Π3 Π3 ΙΣΞ Π3 I CD I I CD I I CD I I CD I I CD 1 oo 1 OO 1 OO 1 OO 1 OO 1 OO 1 OO 1 oo 1 OO OO CL CL CL CL CL CL CL CL CL CL OO OO oo OO OO OO oo oo OO OO CL CL Cl CL CL CL CL CL CL CL LlJ LU LU LU LU LU LU LU LU LU AND AND AND AND AND AND AND AND AND AND CD CD CD CD CD CD CD CD CD CD

LL—

CO σ>

LJ_

22/52

··· • · »·· ·· • · · ···· • • ··. ······· ·· · · -. • · · · ·· • · • · · • i · · · • · · • · · • «· «·· · · * • · • · • · · · ······· »· • ·

ο <c · LO CD • LO f— • τ — 1 1- • Π- « Ο • <C • F— • CJ 1- Ç • CD • F— < O • CJ • F— CJ F— • 1- • <c • CJ • <c • CD • <C • <c • CO <c • CD • <c CD CD • LO π- • <C • i — 1 ο • CJ F— <c • CJ • CJ • <c • CD • CJ H- ç • CD • <c • π- • <c • ι- CD CJ • ο • <c • CD • ç CD Ι- ç H- • Ο • F— • CJ • CD <c F— CJ CD • <C • Ç • CD • CD ç CD • <C • <C • CD • <C • π- CD CJ • ι- • 1- • ο 1- ι ^- 1 < • i — 1 O 0 ο CJ CD LO 1- • LO CJ • LO CD • r-Η <c i — 1 <C CD 4-> ro res I CD 1 1 CD 1 to 1 CO 1 to CO Cl CL CL CL CO to to CO CL CL CL CL LlJ LlJ LU LU ε ε ε £ = CD CD CD CD

Ο co σο ΜΒΜ 1_1_

23/52

··· • · ··· ·· • · · ···· • . »« ········ ·: ·:. · • · • · • · · • í · · · ··· • · · • · · ··· ·· • • · ·· • · · 5 ······· · · ··

O LO <c CD CD O CD CD Ç CD O O • • <£ • • • • • • • • » 1- • • <c • • O O τ-Η CD CD CD <C CD CD < < CD < • • • • Ι- σο 1- • • • < • • CD CO Ι- σο • CD LO —f CD Ι- σο co <c CD CD <c CD CD • CD • · • · The <c O ASS CD O CD CD O ç CD CD Ι- • 1- • • • • • • • <c 1- l · - • • • • • O 1- • • • • • • <c • • CD • • CD • • σο • CD 1- • <C • • <C • • CD • CD • • CD • • CD • • <C • CD • • CD • • CD • • CD • Ι- CD <c 1- • Ι- • • O <c 1- σο Ç • σο 1- 1- O • CD <c CD • Η- <c • • CD • CD ç CD < • CD • • O 1- 1- CD • • Ι- • • CD • • Ι- • CD ç • σο • 1- <C • • σο • CD <c l · - <C • • CD • • O CD <c 1- • • CD • • CD • ç Ι- • <c • • O 1- • CD • • Ο • CD • 1- <C • • CD • • CD • <c • • < • • <C • CD CD • CD • • CD CD <c <c • • 1- • CD 1- 1- Ι- • • CD • • O CD CD CD σο • • CD <c • O • CD CD l · - • <C • • O • CD 1- 1- • • CD • • 1- • CD • • CD 1- • <c • CD • CD • <c CD • • Ι- • CD 1- <c CD • CD • • Ο 1- 1- CD • • <C • CD • • CD • O • <c CD • 1- <C • • O • <C CD CD CD • • CD • • CD • <c CD • • Ι- • • CD 1- 1- <c • CD • <c σο 1- 1- CD • • CD • <C CD • Ι- • Ç • • CD <c <c <C • • σο • • 1- O O CD • • CD • • CD 1- 1- CD » • O <c < <£ l · - • CD • • CD • • CD O ç O • • CD • • CO • 1- Ι- • • O • • CD 1- 1- Ι- • • Ο • 1- CD CD 1- CO • Ι- • • Ι- • • CD CD CO • • ο • 1- σο • • <C 1- • CD • • ç • • ο • • CD • • 1- • • ί — 1 <£ • • -1 ο CD <c 1- • • χ — 1 < • CD O <c 1- LO Ι- • • r — 1 <c • • LO CO • 1- τ — 1 1- • • r — 1 σο • • + -> O 4-> Π3 1- Π3 1- ro l · - Π3 1- 1 ru I <t I I ru I <c I I ru 1 <c I s: I ru I < 1 ί — H 1 "-1 1 t — 1 1 "-1 1 i — 1 1 t—) 1 r "l 1 i — 1 1 Γ “ 1 <—1 1 r-H CD QC QT CD CD CD CD CD Hi CD CD Qi <C <c <£ Ç <c <C <= t <C <c <C <C <c QC QC QC QC QC QC QC QC QC QC QC QC

<C

LL_

24/52

Í6 +

O CD * • O CD CD CD O <C The The O CD « The LO O The • CD CD The • LO CD • • CD CD F— F— CM <C • • CO < The The CO <C CD • • = T CD The The Ι- • • CD <c <£ CD • The <C • The ο ç 1- CD The The CD The The CD CD The O • • CD ç <c CD The The CD The The CD • • CD < <c <C The The CD The The O • • <C The The CD The • CD 1- F— <c • • CD • The CD <c The <c The The ç • • CD f— 1- 1- The The CD The The O <c F— < • The CD F— The CD The The CD • • CD • The CD The < <C The The CD <c The CD < 1- CD The • < The The <c F— I— CD • The 1- The • CD <c ç CD • CD ç The CD The The CD The The CD • • CD • (- <C • The CD The The CD • <c CD The The <C • The CD The The <C • • CD • The CD CD < F— The The <c • • CD <c The F— • The CD F— F— CD • • O • CD CD • 1- CD CD <C CD • • 1- The <X CD CD 1- CD The The f— • The CD The The f— • The CD The The CD CD CD F— The The CD • • CD The 1- CD • • CD The The CD 1- 1- F— The The CD • • CD The The Ç The • <C The The CD <c <c < The The CD • • CD CD 1- CD • The F— The The CD CD F— F— The The CD • The CD CD CD CD V The CD • The CD 1- 1- 1- The The <C The The F— CD <c O • The CD The The CD F— The CD • • CD • The O < 1- • CD The The CD • The CD The The F— • The CD CD CD < • The <C • The CD The CD F— • 1- • The CD CD CD F— • The CD The CD • F— CD The The CD • I— CD <C 1- CD • The F- The The CD CD The F— * The CD * • CD The CD The The CD The The CD • <£ <C The The <c • • CD F— F— (- • The CD The The CD CD The CD CD CD <C • The CD CD <c CD • CD < 1- F— CD CD l— CD The The F— • <C CD <c CD • • CD The The CD • CD <C The CD • • CD 1- <c CD • The Ç • CD 1- 1- CD The The CD • The CD The CD • The < The The CD CD ç F— The CD • • CD • The F— • • 1- The CD CD The CD The The CD The F— CD The ι — 1 1- • The -1 1- • The 1—1 <C CD • i — 1 CD The O CD • The LO CD 1- The CD CD • • LO CD The CM CD ç CM <C • The CO CD • The CO CD The Ç -F • F -F • F <T3 F— (T3 F— fO σ 1- Π3 s: F— I ΓΜ I <C I 1 txj I <£ 1 Z. I ΓΜ I <c I I ΓΜ I Ç 1 τ — 1 1 t — 1 1 r — H l t — 1 1 r— ( l t— ( 1 r — 1 1 t — 1 1 r ^- 1 1 ι — '1 1 i — 1 i — 1 QC QC QC CtC Qi oc cé. Qi Qi QC oc oc < <C <C <c <C <C Ç Ç <C ç QC QC QC QC QC QC QC QC QC QC QC QC

OQ 'xT

CD

Ll.

25/52 ο

LO 'laughs'

Ο ^ laugh-

CD • l · - O <c O 1- · · O • • O CD • • LO CD • • CD • • ιο CD • ♦ LO f— • • CD • 1- ί- • • <c • • CD • ο • • CD <c <c O <c • CD <c <c <C • CD CD CD <C • • CD • • 1- • • CD • • CD • 1- O • • CD • • CD • • CD • • CD • • <C • • f— • • 1- • • CD • F— Ç • • CD • 1- 1- • • O <c 1- 1- • • CD • • CD • 1- • • CD • <£ CD • 1- CD • CD • • O <c <c CD • • CD • CD • • CD • • CD <c F— CD <£ • CD 1- <c F— • • O 1- <c 1- • • CD • • CD • • CD • • CD CD <c CD • • CD • • 1- • • CD 1- F—. <C • • CD • • CD • • CD • • CD K- F— CD • • F— CD CD CD • • F— CD CD <C • • CD • • ç • • CD • • 1- • <c 1- • • CD • • CD • • <c F— • CD • • CD • <c CD • • 1- • • CD 1- 1- CD • • CD 1- 1- CD • • CD • F— <£ • • CD • • 1- • • <C • • CD <c 0 CD • CD 1- CD • • CD < • CD • • CD • • CD • CD • • <C H- F— CD • • CD • 1- 1- • • CD • <c < • • CD • • ç • • CD • • CD <c 0 CD • • CD • <az CD • • CD • • <C • • CD • • Ç • <C • • CD • 1- CD • • CD • • 1- • • CD • • 1- • • CD π- • H- • • 1- • • CD ι- <c <C • • CD <c ç CD ο 0 CD < <£ CD • • <C I— 1- CD • • CD • • CD • 1- CD <c • -1 CD 1- • -1 Κ- • • τ — 1 CD • 1- LO CD • • 0 Ι— • « LO <C CD F— • • 'laugh CD • • 10 CD • • LO CD Ç + -> + -> -K> -K> Π3 2Z 1- Π3 F— ro F— Π3 sz 1 ru 1 <c I 1 nri 1 I I nri 1 <C 1 I ru 1 5—1 1 τ — 1 1 ç — 1 1 t-H 1 τ ~ 1 1 τ — 1 1 τ — I 1 ,-1 1 χ — H 1 1—1 <—1 CD Hi Od QC Qi QtC oo CD. QC CL <C <C <c <5; <C <c <c <C ç Ç QC QC QC QC QC OQ QC QC QC QC QC

CD

BARl AT AG

26/52 <c

O CD • CD CD O <c CD CD t— O CD 1- F— O 1- LO F— • cm CD • 00 CD • • CD CD • r — H CD • <- ( CD • • C \ J CD F— CD CD CD f— CD CD • t— CD CJ « CD <c • • <C • • • 1- * • <C ç • • CD • • <c • • CD CD • CD * CD • » <c ç CD CD CD 1- CD • • Ç • • • 1- • • • 1- Ι- • • CD • • • 1- CD CD CD Ο • < CD <c < • CD • <c CD CD 1- • • <C • CD • 1- <c • • CD • <C • • <£ t— CD <c <£ CD O F— 1- CD « • • CD F— • • CD • 1- • • • CD CD • • Ç • Ç • • • F— 1- CD <c ç CD • CD CD <c 1- CD CD • • Ç • CD • • • CD F— • • <C • CD • • • CD <c CD F— CD CD 1- • • CD CD CD 1- CD • • CD CD

CD

CD c c CD <C CD CD I— CD <c

F—

CD l · - 1- <c F— <c • k 1- CD CD CD CD < <C CD • · 1- CD CD • CD < CD C CD CD F— <C • < <C • • 1- ♦ <C <£ F— CD Ç • CD • • F— • CD 1- • • • Ç • • 1- CD · CD • • • < • CD • · · CD • H- 1- CD CD <c • 1- • <c <c CD • • k 1- • • • CD CD · · 1- • • • 1- CD CD • 1- • · ·

1- CD <c <c CD 1- k ç CD 1- 1- I— • CD CD • F— • • • CD • • • CD O • CD ♦ 1- CD O CD • » • F— • • F— CD CD CD CD CD O < CD • • • CD • <C • • • CD • • • CD • • k F— • <C • • • CD 9 • • 1- • • 9 <C • CD • 1- • F— CD O CD CD CJ CD 1- CD k • CD CD • • CD 9 • • CD 9 • CD F— CD • 1- • • F— • • • 1- 9 • Ç CD k • CD 9 • < CD O O CD <c CD CD CD k ç 1- 9 • • <C • • • CD CD CD <C < 9 • • Ç * • • 1- • <c Ç <C k < 9 CD • CD C £ 3 ç F— CD CD ç F— k CD CD CD 9 • CD • • • CD CD • • F— CD « <£ 9 O CD • ç • 1- <C • k CD k • F— CD F— 1- <C O • I— CD CD <c 1- k CD <C CD • <c • • F— • k CD ç 9 <C • • O » • <C • k CD k • 1- <c ç • 1- CD • <c CD 1- CD t— F— k <c CD 1- • • CD • CD CD CD • CD CD k CD • Ç • 1- 1- CD • • CD k Ç < • CD CD • • CD 1- O k F— <£ Ç • • CD • CD <£ • k 1- CD • CD • * J— • <C • F— • ♦ CD • • CD • CD CD <c CD • • 1- O k ç CD k <£ CD • • <c • • • CD • k k 9 • CD » • < 9 • CD • • • CD • • CD • • CD CD ç O -1 < O CD CD —I 1- k CD CD 1- • -1 CD CD • 9 C \ J CD • 9 00 CD CD CD <c • UD F— <£ • -1 CD <c ç r — 1 1- CD <C Ç

LO cp

Ll_

Ç OJ <C CD Ç OJ Ç CD < ΓΜ CD <X ΖΣΞ <C CD 1 1 1 I 1 1 z: 1 1 1 OJ 1 1 'U' • u- ^ t · 'U M- 'U- M- 1 U- CL cl CL CL Cl CL CL CL CL CL CL cl CL M- CL cl CD CD CD CD CD CD CD CD CD CD CD CD CD CL CD CD CM C \ J CM C \ J CM CM CM CM CM CM CM CM CM CD CM CM CL CL CL CL CL CL Cl CL CL CL CL Cl CL CM CL CL F— 1- 1- F— F— 1- F— 1- F- F— I— 1- 1- CL F— 1- CD CD CD CD CD CD O CD CD CD CD CD CD CD CD CD

• · · · · ·

27/52

O 1- • CD O ç CD CD O O CD <c • O ι— O 1- • • co ç • • CN <c • • 00 <C • • - co < • • oo CD • n CD • • • Ν ' <C • • • O CD • 1- 1- <£ <c CD • CD CD 1- CD ass • • CD 1- • • CD • O < • • CD • < • CD • O • CD • CD I— 1- CD CD ç CD 1- 1- 1- f— • CD • O CD 1- • • CD • <C • 1- CD CD • • O CD • CD CD 1- <c CD • CD 1- <c O • • CD <C • • CD <C • • • CD • Ç • • CD < • CD • CD 1- 1- CD CD CD 1- CD 1- CD O CD CD • 1- • CD • <c • • CD • CD • • CD • • CD • • O CD 1- 1- 1- • ç CD CD CD CD <£ CD 1- <C CD CD • CD • CD 1- Π- <C • CD • <C • <x 1- CD CD CD <c H- 1- • CD CD • 1- • CD • <c • f— • • CD • • 1- CD • <c • <c • • CD • CD • CD • 1- CD • CD • CD • • 1- • • (- • • CD CD • 1- • <C 1- 1- O CD CD • CD CD H- • CD <c CD 1- CD • CD 1- • 1- • CD Π- • <C ç • • CD <C O 1- 1- 1- CD • CD 1- CD • 1- 1- CD • CD 1- Ç • • <C • 1- • <£ 1- • • CD • • CD • < • CD CD CD CD • 1- CD <c CD 1- • 1- < « • CD • CD <C CD • • <c • • t— • <C CD CD • • CD • CD <C CD • • CD CD 1- • ç <C • CD CD • • CD • CD • • • <£ • CD • <C <c CD <c CD • • • CD • CD 1- 1- CD <C • CD • CD • Ç CD <C < • • <£ • CD CD CD • <C • CD • 1- CD CD <c l · - CD 1- Π- 1- CD <C CD CD • • CD CD 1- CD • <C • • l · - <C CD CD • The <C l · - • CD <c 1- 1- 1- CD CD CD • CD H- • • 1- F- • CD <c • • CD 1- <C CD CD <c • CD CD 1- CD CD f— 1- CD 1- ç 1- CD • • CD O CD CD CD • • CD • • H- <£ <c CD CD • <c CD • <c CD ç <C CD CD 1- 1- CD CD • CD • <C • • CD • CD • • CD • <c <£ <C • <C • CD • • CD • 1- • CD l · - 1- CD • • CD <c 1- CD • CD O < • • 1- • • • CD • • < • 1- π- CD • • <£ • « • CD • The CD • <c ι- CD • • <c CD • <£ • CD • CD CD ο CD • • CD • • 1- • The <C 1- 1- CD • • CD • <C • ç • The i — 1 < 1- 1- r — 1 CD CD 1- <c i — 1 CD • • ι — 1 CD 1- ç H- Ν ' CD • O CD CO CD • CN CD • CN CD • • oo CD • oo <C • CD •

QQ

LO σ>

Li_

(- 2Ξ 1- 2Ξ H- 2Ξ 1- F— H- 1- H- < ΓΝ <c CD <£ on <c CD <£ ΓΝ <C CD Ç ΓΝ <C CD | 1 1 ZC 1 1 1 1 1 1 zsc 1 1 1 N N Ν ' N N ^- N ^- Ν ' N ~ Ν- N Ν ’ N Ν ' N Ν- Ν- Cl. CL CL CL . CL CL CL CL α- CL CL Cl CL O_ α- α- CD CD CD ASS ASS ASS ASS CD ού CD CD ASS CD ASS ού ού N CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CL CL CL CL Ο- CL. CL CL Ο- CL CL CL CL Cl Ο- Ο- l · - V H- 1- Ι— Ι- 1- Π- Ι— 1- 1- Ι- 1- Π- Ι— Ι— CD LU ASS ASS CD ου CD ου CD CD CD ου CD ου CD ass

28/52

Ο CD CD AND- <C O CD · CD 1- O CD CD AND- O CD · <C 1- Μ · CD • • O CD • CD LO AND- « • CM CD • • LO CD <C • CO 1- • <C LO CD • • O • • CD • 1- CD • CD CD • <c CD O O CD • • O • » <C • • O • • 1- • • CD • • • O • • <C • • CD • CD CD CD • AND- CD ç O <c CD AND- O O CD • • • <C 1- • O CD • • CD CD • • CD CD • • <c CD <c 1- CD CD 1- • CD O «< CD • • CD • • CD • • O • <C • • CD • • • CD • • • <£ • CD CD AND- • CD • • AND- CD • • • CD • • • O • • • Π- • AND- • • • <C • • CD • • • <c • <£ • • • <C • • • CD CD <c <£ CD • 1- <C CD O CD CD * O AND- CD Ι- • • Ç • • CD • • CD Ε— • CD • • O • 1- CD CD CD CD • CD CD CD CD O O <£ CD CD • • CD <C • <C • CD <c <c CD CD CD • CD AND- <c CD • CD <c CD • • CD CD CD 1- • CD • CD • <£ AND- Ι- CD AND- l— O • • CD • AND- 1- CD 1- « Ε— CD • CD • CD • • 1- • • <C • • CD • • Ι- • t— • • CD • • • CD • Ε— O O t— CD CD CD CD • 1- CD 1- • O CD 1- <£ • • Π- • Ç • O • • CD • • Ε- • <c • • <c • • • CD * AND- • Ι— CD CD • CD • AND- 1-. • • O 1- • • <£ AND- • • 1- • • CD • CD « 1- CD CD O CD • • CD 1- <C 1- CD 1- • CD 1- O O • CD • CD * • • <C * • 1- • O • • CD • • • 1- • • CD • O • CD CD 1- CD CD CD AND- CD • <C <c Ι- CD • CD O • • AND- • • • CD • • • Ι- • • • CD • • CD • • • CD • • ç Ο Ι- AND- CD • <c • CD CD 1- CD ç ç AND- <C <C • • CD • CD • CD CD • • CD * • CD • O <£ AND- CD CD CD • <£ CD CD - O AND- CD « • 1- • • 1- • Ç CD • • CD • • Ç • CD CD CD AND- <c CD CD CD CD <c <c <c 1- O CD • • AND- • CD • CD CD • • CD • • CD • CD • CD CD CD AND- CD <C <c • CD ç O Ε— AND- • AND- • • CD • • CD • • <C • • • CD • • CD • • CD < • CD • • • CD CD CD <£ CD • • CD <c Ç CD CD CD 1- • AND- • • <Z • <C • • CD • 1- 1- < • <C • CD • « CD 1- <£ • CD 1- I— 1- <C 1- AND- CD O 1- O CD • • CD • • CD • • O • • • CD • • <t • • <C • • 1- • • • τ — 1 CD <£ AND- i — 1 CD O 1- 1- Γ — t CD • • • c — 1 <c • • CD co CD • • 1- • • • O 1- • LO ç • 'rt' CD * • LO CD • • • LO CD • LO <c * AND- ΖΣΖ AND- 2Z AND- sz AND- sz 1- ΣΖ 1- AND- 1- 2Z <C ΓΜ <C CD <C ΓΜ <C CD <c rM <c CD <c ΓΜ <t? LD 1 1 1 72 1 1 1 1 1 1 1 1 M · M- M- • M- Μ ^- M ^ J · CL CL Q_ CL Cl CL Cl CL Cl Cl Cl Cl CL CL CL Cl CD CD CD CD CD LD CD CD O CD CD O O O O O CM CM CM CM CM CM CM CM CM CM CM CM CM CM CM CM Ο- Ο- Ο- Ο- Ο- CL CL Cl Cl Ο- CL CL CL CL CL CL Ι— Ι— Ι— Ι— Ι— 1- AND- 1- and- Π- 1- 1- Ι- Ι- Ι- Ι- CD CD CD CD CD CD CD CD CD Ο CD O Ο Ο Ο Ο

ο

LO cy>

LL_

29/52 yo

O O <x · <c O <· CD CD O 0 CD CD CD O CD • F— 1- 00 CD • • • laugh < • • 0 CD • • • CO <C * • f- CD • • 00 CD • • • cr> O < <C <c ΟΊ CD • • CD • CD F- CD CD • 1- CD 0 CD «X <C 1- O • • • 0 • • » P • • • CD • • • CD • • • < • • • CD • • • CD • • • O • CD • 1- • CD CD O • • 1- CD • 1- CD F— • • < • CD • CD • CD • CD • CD • • • O • • • CD CD CD • F— • O CD O • • • CD 1- • • Ι- • < • • • <C • • • F— • • • Ο • • CD • • • CD • • • CD • • • CD • <C 1- CD CD ç 1- CD F— CD • O F— F— • F— • CD • • O • <C • • • CD • CD <c <c • CD • CD • • • CD • CD <£ CD CD 1- 1- P 0 CD 0 CD * • 1- CD • CD • • < • • F— * • • CD <C • • CD • • <C • • CD <c < CD • • • CD • CD <c CD • • CD • « F— • • O • • F— * • P- • • • <t • • < • CD • CD • • • CD • • • CD ç ç «X • • <C • • • 1- • • • < • • <C • • <£ • ç • • CD • • CD • • CD O CD CD CD 0 O • 1- CD F— 1- CD P CD • CD CD • CD • CD • CD • <c CD • <£ • O CD CD • O P- 1- 1- <C • • • Ç • • • O • • O CD • • • CD • • • <t • • <C • <£ CD » F— CD • F— 1- P CD <C CD O CD CD • CD • • 1- • • 1- • 1- • • CD • • ç • • 0 • • CD • CD « < P- 0 CD CD CD 0 • F— • CD <c Ç <c CD • • 1- • < • • <C • • • 1- • • CD • < • • <X « • • CD <c CD <C CD • CD CD 0 • 1- 1- O 1- F— <£ CD • • • CD • • • 0 • • • CD CD • • • <C • * • CD • • CD CD • • CD CD ç 1- O • • P- O CD <c CD 1- • • CD • CD • • CD • • <c » • Ç • • <D • O <C <c <x CD <c ç CD • • t— P • CD CD CD • <t CD • • 1- • • • P • CD • • CD • • <c • • • 1- • CD * • CD • P F— <D • <c F— 0 CD 1- F— <C CD CD CD 1- • • CD • CD • <£ CD • • • CD • CD • • CD • CD • CD CD CD CD 1- CD CD • • • CD F— • CD • • CD • • • < • • • Ι- • • CD < <£ CD • • • O • • • ο • F— CD • • CD F— <c <c CD F— • 1- CD • • • <C • • • CD • 1- • • • CD • • Ι- • • • < • • O • F— • O • • Ο • CD 1- CD • • CD • O • (- CD • • <£ • • P • • • F— • • • < • <C • • • < • • • <C • • • t — 1 CD 1- F- x — 1 CD CD • CD x — 1 CD • • • r — 1 CD • <£ • oj 1- • 00 1- • • laugh < • O O • • <C • r ^. CD • » F— co <£ • ΟΛ <C F— 2ΣΞ 1- 2Z F— F— 2Z F— 1- F— (- σ: < laugh < CD <£ rxi <C CD <£ laugh ç CD Ç laugh <C CD 1 1 1 1 1 I z 1 1 1 1 1 laughs ^- 'laughs ^- laughs • laughs · * 4- laugh laughs laugh laugh laugh -ri- laugh laugh laughs ^- laughs ^- O_ O- CL O_ O_ O_ 0. Ο- O- Q_ Q_ O_ O_ O_ O_ CL CD CD (D CD CD CD CD Ο O CD CD CD 0 O CD CD create create Cri create create Cri Cri create create create Cri create create Cri Cri create Ο- CL Ο- Cl. Ο- 0. O_ O_ 0- Ο- Ο- CL Ο- CL- CL ο- Ι— Ι- Κ- Ι- Ι— 1- F— Ι- F- Ι— ι— 1- Κ- I- Ι- Ρ- CD Ο Ο Ο CD CD CD ο CD CD CD CD Ο CO ο οο

Q

LO i_i_

Ιγΐ

30/52

CJ CJ • CD <c O CD • • O CD • • • CJ CD • • • CM O • co F— • • • H- • • • CJ 1- • • • CJ <c • O <C • • • r — i CJ • • F— CM <c • • i — 1 O CD CD CD —I <C CD CD • -1 CJ 1- ί- 1- I-1 CD O O <0 Ι- • <£ • ç • • CJ • • Ο • CD • • • CD • • Ι- • • O • I— 1- CD • • • CD • ο F— Ι- CD <c • < • • Ç • • • CJ • Ο • CD • • CJ • • CD • • O • O H F— CD 1- • CD 1- • <C CD 1- <c CJ CD 1- • • F— • • • O • 1- < CJ • ί- 1- CD • • • CD • O CD <C CD • ο: <c CJ » F— • O • O CD • CJ • • • <C • • CD • • CD • <C • • CD • O • <c CD • 1- <c CD CJ CJ 1- <C • CD CD O 1- 1- (- CJ • • • F— • • <C • • <£ • • CJ • • • O • • CD • • CD • • • 1- CD CJ <c CJ • • <c CJ 1- <c F— <C • CD • F— • • 1- • • • CD • • <£ • CD • • <c • • • CD • • CD • • 1- CJ CJ CJ CJ CD • CD CJ • (- <C CD • 1- 1- 1- • • * F— • • CD • O • • • CJ • • • CJ • 1- CD • O • • CD • • 1- CJ • • <c CJ ç <c • CD O O <c 1- • • » CD • « • CJ • • • 1- • • • CJ • Ι- • CD • • • CD • • • CD • • • O CD ο F— CJ • • • (- CD CD <c CD • O co O • • CJ • • • Ι- • • • O • • • CD • • • <£ • • • Ο • F— • <C • • • CD • 1- CJ CJ CD <c <c CJ • CD <c O CD CD 1- CJ • • • CD • CD • • Ι- • • CD • « • CJ <C <c <c CJ • <C <c Ο • • F— • <c • CJ • F— H- CD • F— F— O 1- 1- • • • CJ • • • CJ • • • CD • • CD • • <C • • • CJ • • • CD • • CD CJ CJ CD <c 1- CJ 1- 1- CD • <c <c Ι- • • CJ • • <C • <C • • • Ο 1- • CJ • • <£ • • <C • • • CD CJ ç CJ • F— H- CJ • • • CD • O • CJ ç • ί- • • • Ι- • • CJ Ç • ο: • • * Ο • 1- CJ 1- CD - CJ CD F— F— CD O <C <c F— • (- • • (- • O • « • 1- • • <£ • CD • <C • • • CJ • CD CD CD CJ CJ O <£ CD CD • O F- F— 1- CJ • • Ι- ç • • O • CD • CD • • Ο CD • • 1- • <C • CD • Ç CD CJ CJ O CJ • l · - O « F— 1- CJ • F— • 1- • O • • < » • CD • <c • 1- • CD CJ O CJ CJ F— • CJ • 1- O F— CD CJ • • CD <C • • • <£ • CD • • • F— • • <£ <C • • • CD • O • • <C CJ CD CD • CJ CD CD ç CJ » Π- F— F— CD CD CD CJ • • • F— • • • CJ • • 1- • • • F— • • • CJ • • • -1 CD • • -1 CD • • • r-H CD • CJ O C \ J CJ • • 1- 00 CJ <c 1- 1- O • • CD UJ O • • O 1- • • • O CD • • • -1 CD • • • CT> CJ • • -1 <x • • • -1 CD • • • τ — 1 O • • • F— ZEZ F— 2- F— F— ZZ F— ΖΞ 1- zz F— ΖΞ F— zz Ç r <i <C CD <C ΓΜ Ç CD <C ΓΜ <£ CD <C ΓΜ <C CD 1 1 1 1 I I 1 1 1 1 1 Μ- Μ- M- Μ- Μ- M M- Μ- M- M- M M Μ- M M Μ- α- ου ASS ου ου ASS ass ου ass Q_ ASS Q_ ου Q_ O_ ου O CJ CJ CJ CJ CJ CJ O CJ CJ CJ CJ O O O O CM CM CM CM CM CM CM CM CM CM CM CM CM CM CM CM Ο- ASS Ο- 'ASS ASS ASS ASS ass Ο- ASS Ο- Ο- ass Ο- ASS Ο- Ι— Ι- Ι— Ι- Ι- 1- l · - 1- Π- Ι- Ι— Ι— F— Ι— Ι- Π- CJ Ο O Ο ο CJ O CJ Ο Ο CJ CJ O O Ο Ο

UJ

LO σ>

L_l_

31/52 í-Yr

1201 1260 CTP2CP4NAT ATCGACGAAT ATCCGATTCT CGCTGTCGCC GCCGCCTTCG CGGAAGGGGC GACCGTGATG

CTP2CP4_ZM ..A ..... G ..... T ..... G..C ........ T..G ..... C ....... C..G..C ...

CTP2CP4_AT ........ G. .C..C ........ C..T..T ..T..G..T. .C..G..C .. A..T..A ...

CTP2CP4 GM ..T ..... G. .... A ..... T..C ..... G ..T ..... T. .T..G..C ............

O CD · • <C O CD CD · <C O CD F— F— F— CM CD • • CO CD • • • <c • • CO CD • * CO CD • • • M- CD • • i — I 1- * • CD t — 1 CD • 1- CD "-1 F— • • <c • • CD • • <= c • • <c • • • CD • • • CD • • CD • CD 1- 1- • • • O • F— O CD * • CD • • • F— • • CD • • CD • • CD • 1- CD CD CD • CD CD CD CD 1- CD CD ç_) 1- • • F— • • • <c • • • CD • • CD • • • CD • • • CD CD O CD CD <c <c CD CD CD 1- CD • F— • • CD • CD • CD • • <X • CD O CD CD F— CD CD • CD <c CD • CD 1- • • CD • H- • <C CD • 1- CD • • CD CD CD CD CD Ç CD CD CD CD F— 1- • • • CD • • CD F— • CD • • F— 1- • • <C CD • • • CD • CD <C CD CD • 1- CD • F— • CD • • • CD • • CD • • CD • < <c <C • CD CD F— • • CD • ç CD CD 1- 1- <£ • • < • CD • CD • • CD • • CD • CD CD 1- • CD 1- 1- F— CD <c <c F— CD CD • CD CD • • • CD • • <C F— • F- CD • • • CD • • <c • CD • CD • <c • CD 1- • <£ • • • <C • • • CD CD • CD • • • CD • • • 1- <C CD • • 1- CD CD • CD • 1- F— <C • » <C • • CD • <c • • CD • • CD • C_> CD 1- CD CD * 1- CD • 1- • l · - • • • CD • • Ç • • • CD • • • F— • <C • • • CD • CD <c 1- CD CD CD • CD • CD • • <c • CD • • CD < <c CD • CD • • CD CD CD CD F— • F— CD • CD CD F— • • 1- • 1- • • • CD • 1- 1- CD • CD • F— 1- <C • CD CD CD F— • CD CD 1- <c • • • CD • • CD • CD • • • CD • • CD • <C CD CD CD F— CD CD CD • <C • • <C • • Ç • • CD • <C • • • <c • • CD F— 1- <c CD • <c CD • CD <c F— • • (- • • • CD • • CD • » CD • • CD • • F— • CD <c CD • • <£ CD F— • • CD • • • <C • • <C • • • ι — 1 CD • • • i — I < • • i — 1 CD • • • CO CD • • 1- CM CD CD CD 1- CO 1- • <c <c CM <C • • CO 1- co CD • • r — 1 Ç » • 1-1 CD . ι — 1 CD • • F— l · - F— 2Ξ 1- F— 2- F— 2Z < ΓΜ <c CD <C ΓΜ <C CD <C ΓΜ <C CD s: 1 l 1 22 1 1 1 1 1 M- 'ít M ^- M- M Q_ CL CL CL CL CL Cl CL CL Cl CL CL CD CD CD CD CD CD CD CD CD CD CD CD CM CM CM CM CM CM CM CM CM CM CM CM CL CL CL CL Ο- Ο- CL CL CL CL CL CL 1- I— 1- 1- Ι— Ι— 1- 1- 1- 1- F— 1- CD CD CD CD CD CD CD CD CD CD CD CD

Ll_

LO σ>

L ± _

ο Ο P · · O CJ CD CD μ- ο Ç • CO CD • • • LO CD LO CD • • ι — 1 CD CJ P O t — 1 CD CJ • ç P- • • • 1- • • • CD • • CJ • P- • CD CJ CJ < CD CJ CJ <c CJ • CD • • Ç • CD • • CJ CD CD CD CJ CD <c < 1- • CJ • • CD * • CD • • Ι- CD CD • CD • * Ο • 1- • • CJ • • ç • • - CJ • P P CD • CJ <c ç • • μ- • • ç • CJ • 1- ç CD CD CD • CJ 1- 1- ç • • Ç CD • • CD • • • CD ç CJ CJ CD • • • CJ CD l · - • • • 1- <C <C • CD O CJ <C CJ • l · - co <c • • 1- • • 1- • • • cr> CD • <c CD • • • 1- • • • LO μ- • CJ • P • CD • • i — 1 CJ 1- 1- Ι- • • <£ • • • CJ » • Ο • • CD • CD • • CJ P < P CD ç P 1- 1- CJ CJ CD CD • • • CJ • • CJ • CD * • • CJ • • • CD • • CD • • • CJ • I— • CD • • <C 1- • • • H- « • • <C • • <c • • • μ- • • • <c • • CJ CD CD CD CJ • CD μ- CJ 1- • • • CD • CJ O CJ CD • • • ç • 1- t— <C » CJ CD CD CD CD • CJ 1- l · - CJ • P • • CJ • • • <c • • CJ P • <c • • CD • • CJ • P P CJ CD P CJ 1- 1- I— μ- • • CJ CJ • • ι— • • CD 1- • • CJ • <£ P CJ <C CJ CD CD CD CJ CJ I— • Ι- <C P P <c • • Ο • CD • CD • • • ç • CD 1- • μ- • • <t • • <C • • • CD • • CJ CD CD CD CJ P CJ • • μ- CJ • CJ • • 1- • • CD • • <C • 9 <c • • CJ • <c CJ • P- ç CD • • 1- • • • CJ • • Ç • 9 τ — 1 <c • • • i — 1 CD • • • τ — i <c • 9 CJ • <c CD CJ 1- CJ • • co CD CJ CJ <c CD • • LO <c • • LO CJ • ι — 1 CJ <c x—! CD • • You CD •

ο

LO σ>

Ll.

<c X ç CD <C X <r CD <c X <c CD 1 1 1 1 1 1 1 1 1 ^ 1 " * xt • st xt- xt xt • = 4 · O_ O_ Q_ Q_ Q_ Q_ O_ O_ O- Q_ Q_ O_ CJ CJ CJ O CJ O O O O O O CJ X X X C \ J X X X X X * = 4 · X X Ο- Ο- Ο- Q_ Ο- Ο- α- O_ Ο- Ο- Ο- Ω- Ι— Ι— Ι— P ρ- Ι— μ- Ι- Ι— Ι— Ι— μ- CJ CJ CJ O ο O ο Ο O O O ο

1> M

33/52

• · · ·: ·. ··. ····· • · ..... ····: • · ·. · • · • · • · ·· · s *· * ·. • · . · Í: • · · · · · • · ·

Sphl1765

Fig 6

34/52

·· · ·: ·. ··. ····· ......: ···: ·: ·: ·. · • · • · • · • · · : · ’·. :. * ’·: ... ···· · • ·

EcoRI 325

Fig 7

35/52 • · · · ·

1¼

Fig 8

36/52

. ·· ···! • * · · • · • · · ·: ·. ··. ·· ”· ·: · ’· • • · ; · · • · · The • · · • · · 1 · · ♦ · '·' · «·· ···· »·» *

EcoRI396 indlll 3174 *

Pstl 3172

CP4EPSPS AT

ΡΜΟΝ59302 5413bp

Ncol1992

CTP2 l-0sAct1 P-ü35SL-TaCAB rrnxc

Fig 9

37/52 ···. · Ί ·· »· bYl

EcoRI396

Clal 1049

NcoI1992

HindlH3174 _stl3172

Fig 1 θ

38/52 • ···· í> 5

Hindll 10

BamHI1477

Ncol1491

EcoRI3O96

Fig 11

39/52

Ou

Fig 12

40/52

Hindlll458

Fig 13

41/52 lindlll 13

EcoRl399

BamH11110

Ncol 1424 Sphl 1656 pMON54964 CP4EPSPS_AT 8877bp

EcoRl3020

Sphl 3457

Fig 14

42/52

1 · 8Λ

Fig 15

43/52

Sphl 5629 Nhel 5872

Ncol 6672

Hindlll 8563 Sphl 197

Fig 16

44/52

Sphl 9174 Hindlll 9367

Sphl 3742

Ncol 1891 Sphl 2123

Fig ’17

1¾

45/52

........... • · • · · ·: ····· • · · · • ·: ’: ·. · • · • · • · * • · · • • : * ·: • · · • · · • · * ··· ·· • · • · • Ϊ ····· ·

Sphl 9174 Hindlll 9367

Sphl 3742

Ncol 1891 Sphl 2123

Fig 18 ·· ··. - · ·

46/52

Sphl212 ^Hindl11369

Kpnl1787

EcoRI2657

KpnI2673

BamHI2675

Ncol1050 Sphl1282 Nhel1400

Fig 19 ·· ····

47/52

Iwi

Sphl507

Fig 20

48/52

ΚρηΙ8601

Fig 21> · · ·· ··

49/52

BamHI 8591 Kpnl 8601 EcoRI 8609

Hindlll 2321 Sphl 2486 j— Sphl 1372 Ncol 1596

Fig 22

50/52

Hi

Fig 23 • ·. · ····

51/52

Fig 24

V

Kpnl 159 Hindlll 161 Hindlll193

Sphl5189