BRPI0714552A2 - identification of a type of sucrose synthase and its use in fiber modification - Google Patents
identification of a type of sucrose synthase and its use in fiber modification Download PDFInfo
- Publication number
- BRPI0714552A2 BRPI0714552A2 BRPI0714552-7A BRPI0714552A BRPI0714552A2 BR PI0714552 A2 BRPI0714552 A2 BR PI0714552A2 BR PI0714552 A BRPI0714552 A BR PI0714552A BR PI0714552 A2 BRPI0714552 A2 BR PI0714552A2
- Authority
- BR
- Brazil
- Prior art keywords
- leu
- glu
- val
- lys
- gly
- Prior art date
Links
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
IDENTIFICAÇçO DE UM TIPO DE SACAROSE SINTASE E USO DO MESMO EM MODIFICAÇçO DE FIBRA. A presente invenção refere-se a métodos e meios par modificar características de fibra em uma planta produtora de fibra, tal como algodão, com base no uso de um novo tipo de proteína sacarose sintase ou ácidos nucléicos relacionados.Identification of a type of sucrose synthase and use of same in fiber modification. The present invention relates to methods and means for modifying fiber characteristics in a fiber producing plant such as cotton based on the use of a new type of sucrose synthase protein or related nucleic acids.
Description
Relatório Descritivo da Patente de Invenção para "IDENTIFICA- ÇÃO DE UM TIPO DE SACAROSE SINTASE E USO DO MESMO EM MO- DIFICAÇÃO DE FIBRA".Report of the Invention Patent for "IDENTIFICATION OF A TYPE OF SACAROSIS SYNTASE AND USE OF THE SAME IN FIBER MODIFICATION".
Campo da InvençãoField of the Invention
A invenção refere-se ao campo de agricultura, mais especifica- mente, ao uso de técnicas de biologia molecular para alterar plantas produ- toras de fibra, particularmente plantas de algodão, e/ou acelerar a geração de tais plantas contendo fibra. Métodos e meios são providos para modificar a qualidade da fibra, por exemplo, aumentando ou diminuindo os teores de celulose, resistência da fibra, uniformidade da fibra ou micronaire (índice de diâmetro de fibra). São também providos métodos para identificar marcado- res moleculares associados com tais características em uma população de variedades de algodão e plantas progenitoras relacionadas. Técnica anteriorThe invention relates to the field of agriculture, more specifically, to the use of molecular biology techniques to alter fiber producing plants, particularly cotton plants, and / or to accelerate the generation of such fiber containing plants. Methods and means are provided for modifying fiber quality, for example by increasing or decreasing cellulose contents, fiber strength, fiber uniformity or micronaire (fiber diameter index). Methods are also provided to identify molecular markers associated with such characteristics in a population of cotton varieties and related parent plants. Prior art
Muito da fibra de alta qualidade para a indústria têxtil é provido pelo algodão. Cerca de 90% do algodão cultivado em todo o mundo são Gossypium hirsutum L., enquanto Gossypium barbadense é responsável por cerca de 8%. Consequentemente, a modificação de características de fibras de algodão para melhor satisfazer as necessidades da indústria é um grande esforço em geração através ou de métodos clássicos ou alterando geneti- camente o genoma de plantas de algodão. Objetivos a serem atingidos in- cluem comprimento da fibra de algodão maior, resistência, capacidade de tingimento, produção de fibra com penugem menor, razão de maturidade de fibra, teor de fibra imatura, uniformidade da fibra e "micronaire".Much of the high quality fiber for the textile industry is supplied by cotton. About 90% of cotton grown worldwide is Gossypium hirsutum L., while Gossypium barbadense accounts for about 8%. Therefore, modifying cotton fiber characteristics to better meet industry needs is a major generation effort through either classical methods or genetically altering the genome of cotton plants. Objectives to be achieved include longer cotton fiber length, strength, dyeing capacity, lower down fiber production, fiber maturity ratio, immature fiber content, fiber uniformity, and micronaire.
A US 6472588 e o WO 0117333 provêem métodos para o au- mento da qualidade da fibra de algodão produzida a partir de uma planta de algodão através da transformação com um DNA codificando sacarose fosfa- to sintase. As qualidades da fibra incluem resistência, comprimento, razão de maturidade da fibra, teor de fibra imatura, uniformidade da fibra e "micro- naire".US 6472588 and WO 0117333 provide methods for enhancing the quality of cotton fiber produced from a cotton plant by transformation with a DNA encoding sucrose phosphate synthase. Fiber qualities include strength, length, fiber maturity ratio, immature fiber content, fiber uniformity and micro-grain.
O W09508914 descreve uma planta produtora de fibra compre- endendo em seu genoma um construto genético heterólogo. O construto ge- nético compreende um promotor específico de fibra e uma seqüência de co- dificação codificando uma peroxidase de planta, tal como uma peroxidase de algodão.W09508914 describes a fiber producing plant comprising in its genome a heterologous genetic construct. The genetic construct comprises a fiber specific promoter and a coding sequence encoding a plant peroxidase, such as a cotton peroxidase.
O W09626639 provê métodos por meio dos quais seqüência de codificação de preferência direcionando expressão de gene em tecido de ovário, particularmente bem precoce em desenvolvimento de fruta, são utili- zadas para expressar hormônios de modificação de crescimento de planta em tecido de óvulo de algodão. Os métodos permitem a modificação das características da cápsula de plantas de algodão e provêem um mecanismo para alteração das características de qualidade da fibra tal como dimensão e resistência da fibra.W09626639 provides methods whereby preference coding sequence directing gene expression in ovarian tissue, particularly very early in fruit development, are used to express plant growth modifying hormones in cotton egg tissue. The methods allow modification of the cotton plant capsule characteristics and provide a mechanism for altering fiber quality characteristics such as fiber size and strength.
As US5981834, US5597718, US5620882, US5521708 e US 5495070 todas descrevem um método para engenheirar geneticamente uma planta produtora de fibra e a identificação de clones de cDNA úteis para i- dentificação de genes de fibra em algodão. Os clones de cDNA são úteis no desenvolvimento de clones genômicos correspondentes de plantas produto- ras de fibra para permitir engenharia genética de algodão e outras plantas usando esses genes. Seqüências de codificação desses genes isolados são usadas em orientação de sentido e antissentido para alterar as característi- cas de fibra das plantas produtoras de fibra transgênica.US5981834, US5597718, US5620882, US5521708 and US 5495070 all describe a method for genetically engineering a fiber producing plant and identifying cDNA clones useful for the identification of cotton fiber genes. CDNA clones are useful in developing corresponding genomic clones of fiber producing plants to allow genetic engineering of cotton and other plants using these genes. Coding sequences of these isolated genes are used in sense and antisense orientation to alter the fiber characteristics of transgenic fiber producing plants.
Os pedidos de patente US publicados US2002049999 e US2003074697 ambos descrevem plantas de algodão do gênero Gossypium com características de fibra de algodão aperfeiçoadas. A planta de algodão tem um cassete de expressão contendo um gene codificando uma enzima selecionada do grupo consistindo em endoxiloglucan transferase, catalase e peroxidase de modo que o gene é expresso em células de fibra de algodão para melhorar as características da fibra de algodão.Published US patent applications US2002049999 and US2003074697 both disclose Gossypium cotton plants with improved cotton fiber characteristics. The cotton plant has an expression cassette containing a gene encoding an enzyme selected from the group consisting of endoxyloglucan transferase, catalase and peroxidase so that the gene is expressed in cotton fiber cells to improve the characteristics of the cotton fiber.
A US5880110 produz fibras de algodão com características de fibra aperfeiçoadas através de tratamento com brassinoesteróides.US5880110 produces cotton fibers with improved fiber characteristics through treatment with brassinoids.
O WO 01/40250 provê métodos para aperfeiçoamento da quali- dade da fibra de algodão através da modulação de expressão de gene de fator de transcrição. O WO 96/40924 provê novos construtos de DNA que podem ser usados como sondas moleculares ou inseridos em um hospedeiro de planta para prover modificação de transcrição de uma seqüência de DNA de inte- resse durante vários estágios de desenvolvimento da fibra de algodão. Os construtos de DNA compreendem uma região reguladora de iniciação trans- cripcional de fibra de algodão associada com um gene, que é expresso em fibra de algodão. Também é novo algodão tendo uma fibra de algodão que tem uma cor natural introduzida pela expressão em células de fibra de algo- dão, usando tal construto, de genes de pigmento.WO 01/40250 provides methods for improving the quality of cotton fiber by modulating transcription factor gene expression. WO 96/40924 provides new DNA constructs that can be used as molecular probes or inserted into a plant host to provide transcription modification of a DNA sequence of interest during various stages of cotton fiber development. DNA constructs comprise a cotton fiber transcriptional initiation regulatory region associated with a gene, which is expressed in cotton fiber. It is also new cotton having a cotton fiber that has a natural color introduced by expression in pigment fiber cells using such a construct of pigment genes.
A EP0834566 provê um gene que controla o mecanismo de for- mação de fibra em planta de algodão e que pode ser usado para aperfeiço- amento industrialmente útil.EP0834566 provides a gene that controls the fiber forming mechanism in a cotton plant and which can be used for industrially useful improvement.
Ruan e outros 2003) (The Plant Celll vol. 15, 952-964) descre- vem que supressão de expressão do gene de sacarose sintase reprime ini- ciação de célula, alongamento e desenvolvimento de semente de fibra de algodão. Os resultados descritos aqui provêem evidência direta de que a expressão de Sacarose sintase na epiderme do óvulo é crucial para a inicia- ção e alongamento de fibras de célula única. Os resultados descritos tam- bém permitem concluir que a supressão de Sus no revestimento da semente apenas reduz crescimento de fibra sem afetar o desenvolvimento do embrião e tamanho da semente. O artigo conclui que novas percepções no papel de Sus no controle de desenvolvimento de célula de planta e semente oferecem oportunidades para modificar desenvolvimento de fibra e de semente através de engenharia genética de expressão de Sus.Ruan et al 2003) (The Plant Celll vol. 15, 952-964) report that suppression of sucrose synthase gene expression suppresses cell initiation, elongation and seed development of cotton fiber. The results described here provide direct evidence that sucrose synthase expression in the epidermis of the egg is crucial for initiation and elongation of single cell fibers. The results described also allow us to conclude that suppression of Sus in the seed coat only reduces fiber growth without affecting embryo development and seed size. The paper concludes that new insights into the role of Sus in controlling plant and seed cell development offer opportunities for modifying fiber and seed development through genetic expression of Sus.
O W00245485 descreve métodos e meios para modular quali- dade de fibra em plantas produtoras de fibra, tal como algodão, através da modulação e/ou expressão de atividade de sacarose sintase em tais plantas.W00245485 describes methods and means for modulating fiber quality in fiber producing plants such as cotton by modulating and / or expressing sucrose synthase activity in such plants.
Baud e outros 2004 (Journal of Experimental Botany, Vol. 55, N- 396, pp. 397-409) descrevem a estrutura e perfil de expressão da família multigene de sacarose sintase tendo seis membros em Arabidopsis.Baud et al 2004 (Journal of Experimental Botany, Vol. 55, N-396, pp. 397-409) describe the structure and expression profile of the multigene family of sucrose synthase having six members in Arabidopsis.
Nenhum documento da técnica anterior descreve o reconheci- mento do tipo atualmente reconhecido de sacarose sintase, nem seu envol- vimento em desenvolvimento parede de célula de planta secundária, tal co- mo tipo C em plantas. Este novo tipo de proteína Sus abre a possibilidade para métodos e meios novos para alterar características de fibra de plantas produtoras de fibra tal como algodão, que podem ser combinados mais com qualquer um dos métodos para alterar características de fibra. Tais métodos e meios são descritos nas modalidades e reivindicações descritas a seguir. Sumário da InvençãoNo prior art document describes the recognition of the currently recognized type of sucrose synthase, nor its involvement in developing secondary plant cell wall, such as type C in plants. This new type of Sus protein opens the possibility for new methods and means for altering fiber characteristics of fiber producing plants such as cotton, which may be further combined with any of the methods for altering fiber characteristics. Such methods and means are described in the embodiments and claims described below. Summary of the Invention
Em uma modalidade da invenção, um novo tipo de proteína sa- carose sintase é provido tendo uma seqüência de aminoácido compreen- dendo uma seqüência de aminoácido selecionada de uma seqüência de a - minoácido tendo pelo menos 50% de homologia de seqüência com a se- qüência de aminoácido de qualquer SEQ ID NO.: 2, 3, 5, 7, 9, 11 ou 13; ou uma seqüência de aminoácido compreendendo a seqüência de aminoácido de qualquer SEQ ID NO.: 2, 3, 5, 7, 9, 11 ou 13; uma seqüência de aminoá- cido localizada na parte amino terminal da dita proteína, a dita seqüência de aminoácido compreendendo pelo menos cerca de 60% de identidade de se- qüência com a seqüência de aminoácido da SEQ ID NO.: 16; ou uma se- qüência de aminoácido localizada na parte carbóxi-terminal da dita proteína, a dita seqüência de aminoácido compreendendo pelo menos cerca de 60% de identidade de seqüência com a seqüência de aminoácido da SEQ ID NO. 15. A proteína sacarose sintase pode compreender uma seqüência N- terminal hidrofóbica. Ela pode também compreender qualquer uma das se- qüências de aminoácido que seguem: a seqüência de aminoácido da SEQ ID N0.:3 do aminoácido 383 ao aminoácido 394; a seqüência de aminoácido da SEQ ID N0.:3 do aminoácido 270 ao aminoácido 329; a seqüência de aminoácido da SEQ ID N-:3 do aminoácido 549 ao aminoácido 737; a se- qüência de aminoácido da SEQ ID N9:3 do aminoácido 1 até o aminoácido 545; ou a seqüência de aminoácido da SEQ ID Nõ:3 do aminoácido 18 até o aminoácido 794.In one embodiment of the invention, a new type of protein saccharose synthase is provided having an amino acid sequence comprising an amino acid sequence selected from an amino acid sequence having at least 50% sequence homology to the amino acid sequence. amino acid frequency of any SEQ ID NO .: 2, 3, 5, 7, 9, 11 or 13; or an amino acid sequence comprising the amino acid sequence of any SEQ ID NO .: 2, 3, 5, 7, 9, 11 or 13; an amino acid sequence located at the amino terminal part of said protein, said amino acid sequence comprising at least about 60% sequence identity with the amino acid sequence of SEQ ID NO .: 16; or an amino acid sequence located at the carboxy terminal portion of said protein, said amino acid sequence comprising at least about 60% sequence identity with the amino acid sequence of SEQ ID NO. The protein sucrose synthase may comprise a hydrophobic N-terminal sequence. It may also comprise any of the following amino acid sequences: the amino acid sequence of SEQ ID NO: 3 from amino acid 383 to amino acid 394; the amino acid sequence of SEQ ID NO: 3 from amino acid 270 to amino acid 329; the amino acid sequence of SEQ ID NO: 3 from amino acid 549 to amino acid 737; the amino acid sequence of SEQ ID NO: 3 from amino acid 1 to amino acid 545; or the amino acid sequence of SEQ ID NO: 3 from amino acid 18 to amino acid 794.
Em outra modalidade da invenção, um anticorpo reconhecendo a proteína sacarose sintase nova isolada é provido.In another embodiment of the invention, an antibody recognizing isolated new sucrose synthase protein is provided.
Em ainda outra modalidade, a invenção provê uma molécula de DNA ou ácido nucléico isolado codificando a proteína sacarose sintase nova, tal como ácido nucléico compreendendo a seqüência de nucleotídeo de qualquer SEQ ID N9: 1, 4, 6, 8, 10, 12 ou 14.In yet another embodiment, the invention provides an isolated DNA or nucleic acid molecule encoding the new sucrose synthase protein, such as nucleic acid comprising the nucleotide sequence of any SEQ ID N9: 1, 4, 6, 8, 10, 12 or 14
A invenção provê ainda um cassete de expressão compreen- dendo as moléculas de DNA operavelmente ligado que seguem: um promo- tor expressável em planta tal como um promotor expressável em planta con- trolando transcrição de preferência em células de fibra de uma planta produ- tora de fibra; um DNA codificando as novas proteínas sacarose sintase pro- vidas; e opcionalmente uma região de terminação de transcrição e poliadeni- lação.The invention further provides an expression cassette comprising the operably linked DNA molecules that follow: a plant expressible promoter such as a plant expressible promoter preferably controlling transcription in fiber cells of a producing plant fiber; a DNA encoding the new sucrose synthase proteins produced; and optionally a transcription termination region and polyadenylation.
A invenção também provê um cassete de expressão compreen- dendo as moléculas de DNA operavelmente ligado: um promotor expressá- vel em planta um DNA que quando transcrito resulta em uma molécula de RNA a dita molécula de RNA compreendendo ou uma seqüência de nucleo- tídeo de pelo menos 19 nucleotídeos consecutivos tendo pelo menos cerca de 94% de identidade de seqüência com a seqüência de nucleotídeo de um gene codificando isoforma C de sacarose sintase endógena ou com o com- plemento da dita seqüência de nucleotídeo de um gene codificando isoforma C de sacarose sintase endógena ou uma seqüência de ácido nucléico de pelo menos 19 nucleotídeos consecutivos tendo pelo menos cerca de 94% de identidade de seqüência com uma seqüência de nucleotídeo de sacarose sintases de isoforma tipo C novas; e opcionalmente uma região de termina- ção de transcrição e poliadenilação.The invention also provides an expression cassette comprising operably linked DNA molecules: a plant expressible promoter a DNA which when transcribed results in an RNA molecule said RNA molecule comprising or a nucleotide sequence of at least 19 consecutive nucleotides having at least about 94% sequence identity with the nucleotide sequence of a gene encoding endogenous sucrose synthase C isoform or complementing said nucleotide sequence of a gene encoding sucrose C isoform endogenous synthase or nucleic acid sequence of at least 19 consecutive nucleotides having at least about 94% sequence identity with a new type C isoform sucrose synthase nucleotide sequence; and optionally a transcription and polyadenylation termination region.
Outra modalidade da invenção é um cassete de expressão com- preendendo as moléculas de DNA operavelmente ligado que seguem: um promotor expressável de planta, de preferência um promotor expressável em planta que controla transcrição de preferência nas células de fibra; um DNA, que quando transcrito dá uma molécula de RNA de filamento duplo capaz de reduzir a expressão de um gene SusC endógeno para a planta produtora de fibra, e a molécula de RNA compreendendo uma primeira e uma segunda região de RNA onde a primeira região de RNA compreende uma seqüência de nucleotídeo de pelo menos 19 nucleotídeos consecutivos tendo pelo me- nos cerca de 94% de identidade de seqüência com a seqüência de nucleotí- deo de um gene SusC endógeno ou com a seqüência de nucleotídeo das seqüências de nucleotídeo providas codificando proteínas SusC; a segunda região de RNA compreendendo uma seqüência de nucleotídeo complemen- tar aos pelo menos 19 nucleotídeos consecutivos da primeira região de RNA; as primeira e segunda regiões de RNA são capazes de emparelha- mento de base para formar uma molécula de RNA de filamento duplo entre pelo menos os 19 nucleotídeos consecutivos das primeira e segunda regi- ões; e opcionalmente uma região de extremidade 3' compreendendo sinais de terminação e poliadenilação de transcrição funcionando em células da planta.Another embodiment of the invention is an expression cassette comprising the operably linked DNA molecules that follow: a plant expressible promoter, preferably a plant expressible promoter that preferably controls transcription in fiber cells; a DNA, which when transcribed gives a double stranded RNA molecule capable of reducing the expression of an endogenous SusC gene to the fiber producing plant, and the RNA molecule comprising a first and a second region of RNA where the first region of RNA comprises a nucleotide sequence of at least 19 consecutive nucleotides having at least about 94% sequence identity with the nucleotide sequence of an endogenous SusC gene or nucleotide sequence of the provided nucleotide sequences encoding proteins. SusC; the second RNA region comprising a nucleotide sequence complementary to at least 19 consecutive nucleotides from the first RNA region; the first and second RNA regions are capable of base pairing to form a double stranded RNA molecule between at least 19 consecutive nucleotides of the first and second regions; and optionally a 3 'end region comprising transcription termination and polyadenylation signals functioning in plant cells.
A invenção provê a inda uma célula de planta compreendendo um promotor expressável em planta heterólogo operavelmente ligado a uma molécula de DNA selecionada das moléculas de DNA que seguem: um DNA codificando uma sacarose sintase nova da isoforma C ou codificando um RNA inibidor para uma gene codificando SusC. A célula pode ser uma célula de uma planta produtora de fibra tal como algodão. Também compreendidas pela invenção são plantas, sementes ou partes de planta ou tecidos que compreendem ou consistem essencialmente em tais células de planta bem como fibras produzidas por tais plantas.The invention further provides a plant cell comprising a heterologous plant expressible promoter operably linked to a DNA molecule selected from the following DNA molecules: a DNA encoding a new isoform C sucrose synthase or encoding an inhibitor RNA for a gene encoding SusC. The cell may be a cell of a fiber producing plant such as cotton. Also encompassed by the invention are plants, seeds or plant parts or tissues which comprise or consist essentially of such plant cells as well as fibers produced by such plants.
Em ainda outra modalidade da invenção, um método para modi- ficação das características de fibra de uma planta produtora de fibra é provi- do, o dito método compreendendo modificação, tal como aumento ou dimi- nuição do nível funcional de uma isoforma C de sacarose sintase ou uma sacarose sintase com características similares em células ou paredes de célula ou fluido apoplástico da dita planta. Isto pode ser convenientemente conseguido através da provisão das plantas com os cassetes de expressão descritos aqui.In still another embodiment of the invention, a method for modifying the fiber characteristics of a fiber producing plant is provided, said method comprising modification, such as increasing or decreasing the functional level of a sucrose C isoform. synthase or a sucrose synthase having similar characteristics in cells or cell walls or apoplastic fluid of said plant. This can conveniently be accomplished by providing the plants with the expression cassettes described herein.
A invenção também compreende uso de uma nova proteína tipo SusC ou ácido nucléico de codificação para modificar as características de uma fibra em uma planta produtora de fibra. Breve descrição das figuras e desenhos Figura 1. Gráficos de hidrofobicidade de proteínas SusC e prote- ínas sacarose similares. A. Gráfico para SusC de uma variedade Gossypium hirsutum; B: Gráfico para SusC de uma variedade Gossypium barbadense\ C: Gráfico para SusC de Gossypium arboretum; D: Gráfico para SusC de Gossypium raymondír, E: Gráfico para SusC de Gossypium Iongicalyx subti- po 1; F: Gráfico para SusC de Gossypium Iongicalyx subtipo 2; G: Gráfico para sacarose sintase de Vigna radiata; H: Gráfico para sacarose sintase de Eucalyptus grandis; I: Gráfico para isoformas de sacarose sintase 6 de Ara- bidopsis thaliana-, K: Gráfico de sacarose sintase de Populus tremuloides. Todos os gráficos de hidrofobicidade descrevem um terminal N hidrofóbico para sacarose sintases diferentes.The invention also comprises use of a novel SusC-like protein or coding nucleic acid to modify the characteristics of a fiber in a fiber producing plant. Brief Description of the Figures and Drawings Figure 1. Hydrophobicity graphs of SusC proteins and similar sucrose proteins. A. Chart for SusC of a variety Gossypium hirsutum; B: Chart for SusC of a variety Gossypium barbadense \ C: Chart for SusC of Gossypium arboretum; D: Gossypium raymondir SusC Chart; E: Gossypium Iongicalyx SusC Chart Subtype 1; F: Graph for SusC of Gossypium Iongicalyx subtype 2; G: Graph for sucrose synthase of Vigna radiata; H: Graph for Eucalyptus grandis sucrose synthase; I: Graph for sucrose synthase 6 isoforms from Ara-bidopsis thaliana-, K: Graph for sucrose synthase from Populus tremuloides. All hydrophobicity charts depict a hydrophobic N-terminus for different sucrose synthases.
Figura 2. Análise de expressão do gene SusC em tecido de fibra de algodão através de RT-PCR. Dpa: dias pós-anestesia; gDNA: DNA ge- nômico de controle. Faixas 1 e 8: marcador de 1 kb. Figura 3: íntrons e exons em genes codificando sacarose sintaseFigure 2. Analysis of SusC gene expression in cotton fiber tissue by RT-PCR. Dpa: days after anesthesia; gDNA: control genomic DNA. Tracks 1 and 8: 1 kb marker. Figure 3: Introns and exons in genes encoding sucrose synthase
de algodão. A distribuição de íntron éxon é esquematicamente representada para sacarose sintase tipo C e comparada com sacarose sintase tipos BeA de algodão. Caixas pretas representam os exons.cotton The intron exon distribution is schematically represented for sucrose synthase type C and compared to sucrose synthase types BeA cotton. Black boxes represent exons.
Figura 4. Análise de estrutura 2D da parte N-terminal de genes de sacarose sintase de algodão. Análise de estrutura analítica 2D (HCA + JPRED) da parte N-terminal do gene de susyC comparado com as outras sacarose sintases de Gossypium hirsutum. A seta mostra a influência da região específica de susyC na estrutura 2d circundante. (P) = sítios de fosfo- rilação putativos.Figure 4. 2D structure analysis of the N-terminal part of cotton sucrose synthase genes. 2D analytical structure analysis (HCA + JPRED) of the N-terminal part of the susyC gene compared to the other sucrose synthases of Gossypium hirsutum. The arrow shows the influence of the specific susyC region on the surrounding 2d structure. (P) = putative phosphorylation sites.
Figura 5: Análise de estrutura 2D da parte C-terminal de genesFigure 5: 2D structure analysis of the C-terminal part of genes.
de sacarose sintase de algodão. A estrutura de folha beta parece estar au- sente na estrutura susyC 2D.sucrose synthase cotton. The beta sheet structure appears to be missing from the susyC 2D structure.
Figura 6: Análise fenotípica de plantas de Arabidopsis transgêni- cas. Troncos fasciados, bifurcados, espessos são mostrados em 2 linhagens transgênicas independentes carregando genes quiméricos de CaMV35S- SusC (painéis A e B). Os painéis CeD mostram um fenótipo tricoma típico na geração T2 de genes quiméricos de CaMV 35S-SusC e S2A-SusC. O painel C é uma fotomontagem tridimensional.Figure 6: Phenotypic analysis of transgenic Arabidopsis plants. Fasciated, bifurcated, thick trunks are shown in 2 independent transgenic lineages carrying CaMV35S-SusC chimeric genes (panels A and B). CeD panels show a typical trichome phenotype in the T2 generation of CaMV 35S-SusC and S2A-SusC chimeric genes. Panel C is a three-dimensional photomontage.
Figura 7: Western blot de mudas de 12 com transgene de CaMV 35S-SusC usando anticorpo específico de SUS C. Faixa A: plantas com fe- nótipo leve; Faixa B: planta com fenótipo severo; Faixa C: planta com fenóti- po leve; Faixa D: planta com fenótipo severo; WT: planta do tipo selvagem. Descrição detalhadaFigure 7: Western blot of CaMV 35S-SusC transgene seedlings of 12 using SUS C-specific antibody. Lane A: light phenotype plants; Lane B: plant with severe phenotype; Lane C: plant with mild phenotype; Lane D: plant with severe phenotype; WT: wild type plant. Detailed Description
A presente invenção é baseada na identificação de uma nova isoforma de sacarose sintase (chamada SusC ou SuSyC) de fibras cujo perfil de expressão no nível de RNA é intimamente relacionado com o início de fase de desenvolvimento de parede de célula secundária em plantas produ- toras de fibra tal como algodão. A proteína é abundante durante a formação da parede de célula secundária e está localizada predominantemente no apoplasto. Sua localização, abundância e perfil de expressão indicam que este novo tipo de isoforma de sacarose sintase é a principal isoforma pre- sente na parede celular no estágio de síntese de parede de célula secundá- ria. Sem pretender limitar a invenção a um modo de ação particular, é pen- sado que esta isoforma de sacarose sintase está envolvida em seqüestro de açúcares do fluido apoplástico circundando a fibra e incorporação dos mes- mos como UDPGIicose em celulose e/ou calose. Deste modo, em uma primeira modalidade, a invenção provê umThe present invention is based on the identification of a novel sucrose synthase isoform (called SusC or SuSyC) of fibers whose RNA level expression profile is closely related to the onset of secondary cell wall development phase in producing plants. of fiber such as cotton. The protein is abundant during secondary cell wall formation and is predominantly located in the apoplast. Its localization, abundance and expression profile indicate that this new type of sucrose synthase isoform is the main isoform present in the cell wall at the secondary cell wall synthesis stage. Without wishing to limit the invention to a particular mode of action, it is thought that this sucrose synthase isoform is involved in sequestering sugars from the apoplastic fluid surrounding the fiber and incorporating them as UDPGlycosis into cellulose and / or callose. Thus, in a first embodiment, the invention provides a
método para modificação das características da parede de célula em uma célula de planta caracterizado pelo fato de que o nível funcional desta nova isoforma de sacarose sintase (SusC) ou sacarose sintases com característi- cas similares é modificado. O nível funcional da isoforma tipo C de sacarose sintase ou sa-A method for modifying cell wall characteristics in a plant cell characterized by the fact that the functional level of this new sucrose synthase (SusC) or sucrose synthase isoform with similar characteristics is modified. The functional level of sucrose synthase type C or isoform
carose sintases com características similares pode ser modificado através da expressão aumentada de tal isoforma na célula de planta. Isto pode ser convenientemente conseguido através da introdução de um construto de expressão compreendendo os ácidos nucléicos operavelmente ligados que seguem, por exemplo, moléculas de DNA:Carose synthases with similar characteristics can be modified by increased expression of such isoform in the plant cell. This can conveniently be accomplished by introducing an expression construct comprising operably linked nucleic acids that follow, for example, DNA molecules:
a) um promotor expressável de plantaa) an expressible plant promoter
b) um ácido nucléico codificando uma isoforma C de sacarose sin- tase ou uma sacarose sintase com características similares; e opcionalmen- te(b) a nucleic acid encoding a sucrose synthase isoform C or a sucrose synthase having similar characteristics; and optionally
c) uma região de terminação e poliadenilação de transcrição.c) a transcription termination and polyadenylation region.
Conforme usado aqui, "sacarose sintase" refere-se a uma enzima que é ca- paz de catálise da síntese de sacarose a partir de NDP-glicose (tal como uridina difosfato glicose) e D-frutose. A enzima pode também catalisar a hi- drólise de sacarose em glicose e frutose. A enzima é classificada como EC 2.4.1.13. Sinônimos para sacarose sintase são glicosiltransferase, uridina difosfoglicose-frutose, sacarose sintetase, sacarose-UDP glicosiltransferase, sacarose-uridina difosfato glicosiltransferase, Sus, SuSy, UDP-glicose- frutose glicostiltransferase, UDP-glicose:D-frutose 2-alfa-D- glicosiltransferase e uridina difosfoglicose-frutose glicosiltransferase.As used herein, "sucrose synthase" refers to an enzyme that is capable of catalyzing the synthesis of sucrose from NDP-glucose (such as uridine diphosphate glucose) and D-fructose. The enzyme may also catalyze sucrose hydrolysis into glucose and fructose. The enzyme is classified as EC 2.4.1.13. Synonyms for sucrose synthase are glycosyltransferase, uridine diphosphoglycosis fructose, sucrose synthetase, sucrose-UDP glycosyltransferase, sucrose uridine diphosphate glycosyltransferase, Sus, SuSy, UDP-glucose-fructose glycosyltransferase, UDP-glucose 2-D: alpha glycosyltransferase and uridine diphosphoglycosis-fructose glycosyltransferase.
Vários genes de plantas codificando enzimas sacarose sintase foram descritos. O W002/45485 (páginas 16 a 20) provê uma lista extensa de números de acesso no Genbank para seqüência de nucleotídeo codifi- cando genes de sacarose sintase de planta, partes da mesma ou seqüên- cias de nucleotídeo tendo similaridade de seqüência com genes de sacarose sintase.Several plant genes encoding sucrose synthase enzymes have been described. W002 / 45485 (pages 16 to 20) provides an extensive list of Genbank accession numbers for nucleotide sequences encoding plant sucrose synthase genes, parts thereof or nucleotide sequences having sequence similarity to nucleotide genes. sucrose synthase.
Ensaios enzimáticos para atividade de sacarose sintase foram descritos (vi- de, por exemplo, Counce and Gravois, 2006, Crop Science, Vol. 46: pp. 1501-1507, particularmente páginas 1502 e 1503; Anthon e Emerich, 1990, Vol. 92: pp. 346-351, particularmente página 347; ambos documentos aqui incorporados a título de referência).Enzymatic assays for sucrose synthase activity have been described (see, for example, Counce and Gravois, 2006, Crop Science, Vol. 46: pp. 1501-1507, particularly pages 1502 and 1503; Anthon and Emerich, 1990, Vol. 92: pp. 346-351, particularly page 347; both documents incorporated herein by reference).
A "isoforma C de sacarose sintase" ou "SusC" ou SuSyC" é ca- racterizada pela presença de uma seqüência de aminoácido N-terminal hi- drofóbica (Vide Figura 1). Os 44 primeiros aminoácidos formam uma região hidrofóbica, particularmente resíduos 26 a 44 da, por exemplo, SEQ ID NO.:3. Outras enzimas sacarose sintase de planta caracterizadas por tal se- qüência de aminoácido N-terminal hidrofóbica são sacarose sintase 6 de Arabidopsis thaliana (Número de acesso At1g73370); a sacarose sintase de feijão mungbean (Número de acesso D10266), a sacarose sintase de Eu- calyptus spp. (Número de acesso DD014303) e a sacarose sintase de chou- po (Número de acesso AY341026) (todas as seqüências aqui incorporadas a título de referência).The "sucrose synthase isoform C" or "SusC" or SuSyC "is characterized by the presence of a hydrophobic N-terminal amino acid sequence (See Figure 1). The first 44 amino acids form a hydrophobic region, particularly residues 26. at 44 da, for example, SEQ ID NO.:3 Other plant sucrose synthase enzymes characterized by such hydrophobic N-terminal amino acid sequence are Arabidopsis thaliana sucrose synthase 6 (Accession number At1g73370); mungbean bean (Accession number D10266), Sucrose synthase from Eucalyptus spp. (Accession number DD014303) and Sucrose synthase (Accession number AY341026) (all sequences incorporated herein by reference).
Alinhamento das seqüências de aminoácido de SusC diferentes e comparação com outras sacarose sintases de plantas desvrevem a pre- sença de extensões de 20-25 aminoácidos consecutivos localizados no ter- minal N (SEQ ID NO:16) ou terminal C (SEQ ID NO:15) respectivamente, que são conservadas dentro das seqüências de aminoácido de SusC e au- sentes das outras sacarose sintases de planta. A seqüência N-terminal (HKSQKLLSVLDKEAGNQALDGMVV; SEQ ID Nq 16) está geralmente loca- lizada entre a posição de aminoácido 36 e posição de aminoácido 59, en- quanto a seqüência C-terminal (AYQEQRGRKRYIEMLHAWMYNNRVKT; SEQ ID Nq 15) está geralmente localizada entre as posições de aminoácido 765 e 790.Alignment of different SusC amino acid sequences and comparison with other plant sucrose synthases disclose the presence of 20-25 consecutive amino acid extensions located at the N (SEQ ID NO: 16) or C terminal (SEQ ID NO: 15) respectively, which are conserved within the amino acid sequences of SusC and absent from the other plant sucrose synthases. The N-terminal sequence (HKSQKLLSVLDKEAGNQALDGMVV; SEQ ID No. 16) is generally located between amino acid position 36 and amino acid position 59, while the C-terminal sequence (AYQEQRGRKRYIEMLHAWMYNNRVKT; SEQ ID Nq 15) is generally located between. amino acid positions 765 and 790.
Proteínas sacarose sintase foram também descritas possuir uma região de ligação de actina putativa (Winter H e outros, 1998, FEBS Ietters 430, 205- 208; Winter, H. e Huber S.C., 2000 Criticai Reviews in Plant Sciences 19(1), 31-67) que em isoformas de SusC tem a seqüência de aminoácido consenso que segue: KDVAAE[V/I]TKEFQ (SEQ ID N53 da posição de aminoácido 383 para a posição de aminoácido 394). O resíduo Iisina (K) geralmente encon- trado na posição 383 e o resíduo fenilalanina (F)1 geralmente encontrado na posição 393, são também indicativos de uma proteína tipo SusC.Sucrose synthase proteins have also been reported to have a putative actin binding region (Winter H et al, 1998, FEBS Ietters 430, 205-208; Winter, H. and Huber SC, 2000 Critical Reviews in Plant Sciences 19 (1), 31 -67) which in SusC isoforms has the following consensus amino acid sequence: KDVAAE [V / I] TKEFQ (SEQ ID NO: 53 from amino acid position 383 to amino acid position 394). The lysine residue (K) generally found at position 383 and the phenylalanine residue (F) 1 generally found at position 393 are also indicative of a SusC-like protein.
Proteínas sacarose sintase foram também descritas possuir uma região de ligação de Uridina putativa que em isoformas de SusC tem a se- qüência de aminoácido que segue VVIMTPHGYFAQDNVLGYPDTGGQVVYLLDQVRALEEELLHRFKLQGLDIT PRILVITRL (SEQ ID Nq 3 da posição de aminoácido 270 até a posição de aminoácido 329).Sucrose synthase proteins have also been described to have a putative Uridine binding region which in SusC isoforms has the amino acid sequence that follows VVIMTPHGYFAQDNVLGYPDTGGQVVYLLDQVRALEEELLHRFKLQGLDIT PRILVITRL to amino acid position SEQ ID 3 at position 270
Usando pesquisas baseadas em computador os domínios ou assinaturas que seguem puderam ser identificados na seqüência de amino- ácido das isoformas de SusC. Esses domínios ou assinaturas são comparti- lhados com outras sacarose sintases de plantas.Using computer-based searches the following domains or signatures could be identified in the amino acid sequence of the SusC isoforms. These domains or signatures are shared with other plant sucrose synthases.
Glicosil transferase, domínio grupo 1: (1PR001296; PF00534) Proteínas contendo este domínio transferem açúcares ligados a UDP1 ADP, GDP ou CMP para uma variedade de substratos, incluindo glico- gênio, frutose-6-fosfato e lipopolissacarídeos. As enzimas bacterianas estão envolvidas em vários processos biossintéticos que incluem biossíntese de exopolissacarídeo, biossíntese de núcleo de lipopolissacarídeo e a biossín- tese do polissacarídeo da saliva ácido colânico. Este domínio corresponde à seqüência de aminoácido da SEQ ID Ne:3, por exemplo, das posições de aminoácido 549 a 737.Glycosyl Transferase, Domain Group 1: (1PR001296; PF00534) Proteins containing this domain transfer UDP1 ADP, GDP, or CMP-linked sugars to a variety of substrates, including glycogen, fructose-6-phosphate, and lipopolysaccharides. Bacterial enzymes are involved in various biosynthetic processes that include exopolysaccharide biosynthesis, lipopolysaccharide core biosynthesis, and polysaccharide biosynthesis of cholanic acid saliva. This domain corresponds to the amino acid sequence of SEQ ID Ne: 3, for example, amino acid positions 549 to 737.
Assinatura da família de sacarose sintase (IPR000368; PF00862): Esta assi- natura caracteriza uma família incluindo a maior parte das proteínas sacaro- se sintase. No entanto, a região carboxila terminal da sacarose sintase per- tence à família glicosil transferase. Esta enzima é encontrada principalmente em plantas, mas também aparece em bactérias. Este domínio corresponde à seqüência de aminoácido de SEQ ID NQ:3, por exemplo, do aminoácido po- sições 1 a 545.Sucrose Synthase Family Signature (IPR000368; PF00862): This signature characterizes a family including most sucrose synthase proteins. However, the terminal carboxyl region of sucrose synthase belongs to the glycosyl transferase family. This enzyme is mainly found in plants, but also appears in bacteria. This domain corresponds to the amino acid sequence of SEQ ID NQ: 3, for example, amino acid positions 1 to 545.
Assinatura da família sacarose sintase, plantas e cianobactérias (I- PR012820): Esta assinatura representa sacarose sintases, uma enzima que, apesar de seu nome, usos geralmente dificilmente produzem sacarose. Sa- carose mais UDP (ou ADP) se torna D-frutose mais UDP-glicose (ou ADP- glicose), que é então disponível para biossíntese de parede celular (ou ami- do). A enzima é homóloga à sacarose fosfato sintase, que catalisa a penúl- tima etapa em síntese de sacarose. Sacarose sintase é encontrada, até ago- ra, exclusivamente em plantas e cianobactérias. Este domínio corresponde à seqüência de aminoácido da SEQ ID NQ: 3, por exemplo, das posições de aminoácido 18 a 794.Signature of the sucrose synthase family, plants and cyanobacteria (I-PR012820): This signature represents sucrose synthases, an enzyme that, despite its name, uses generally rarely produce sucrose. Saccharose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyzes the penultimate step in sucrose synthesis. Sucrose synthase is found until now exclusively in plants and cyanobacteria. This domain corresponds to the amino acid sequence of SEQ ID NQ: 3, for example, from amino acid positions 18 to 794.
Esses domínios podem facilmente ser reconhecidos por pesqui- sas baseadas em computador usando, por exemplo, perfis PROSITE (PRO- SITE está disponível em www.expansv.ch/prosite). Alternativamente, o ban- co de dados e algoritmo BLOCKS (blocks.fhcrc.org) podem ser usados para identificar esses domínios. Outros bancos de dados e algoritmos estão tam- bém disponíveis (pFAM: http://www.sanger.ac.uk/SOftware/Pfam/ INTER- PRO: http://www.ebi.ac.uk/interpro/; os números PF citados acima referem- se a banco de dados e algoritmo pFAM e números IPR ao banco de dados e algoritmo INTERPO).These domains can easily be recognized by computer-based searches using, for example, PROSITE profiles (PROSITE is available at www.expansv.ch/prosite). Alternatively, the BLOCKS database and algorithm (blocks.fhcrc.org) can be used to identify these domains. Other databases and algorithms are also available (pFAM: http://www.sanger.ac.uk/SOftware/Pfam/ INTERPRO: http://www.ebi.ac.uk/interpro/; PF numbers quoted above refer to database and pFAM algorithm and IPR numbers to database and INTERPO algorithm).
A presente invenção prove várias seqüências de aminoácido e seqüências de nucleotídeo variantes (cDNA bem como DNA genômico) para isoformas de SUsC isoladas de cultivares de Gossypium hirsutum (SEQ ID NOs.: 3-4), cv de Gossypium barbadense (SEQ ID NOs.: 5 e 6), Gossypium arboretum (SEQ ID NOs.: 7 e 8), Gossypium raimondii (SEQ ID NOs.: 9 e 10) e Gossypium Iongicaiyx (SEQ ID NOs.: 11 e 14).The present invention provides various amino acid sequences and variant nucleotide sequences (cDNA as well as genomic DNA) for SUsC isoforms isolated from Gossypium hirsutum cultivars (SEQ ID NOs .: 3-4), Gossypium barbadense cv (SEQ ID NOs. : 5 and 6), Gossypium arboretum (SEQ ID NOs: 7 and 8), Gossypium raimondii (SEQ ID NOs: 9 and 10) and Gossypium Iongicaiyx (SEQ ID NOs: 11 and 14).
Outras seqüências de nucleotídeo variantes podem ser obtidasOther variant nucleotide sequences may be obtained
de outras cultivares de Gossypium hirsutum ou Gossypium barbadense ou de plantas progenitoras de algodão tal como Gossypium arboretum, Gossy- pium herbaceum e Gossypium raimondii e Gossypium Iongicalyx ou outras espécies Gossypium, especialmente espécies Gossypium compreendendo um genoma tipo D, tal como Gossypium aridum, Gossypium davidsonii, Gossypium gossypioides, Gossypium klotzschianum, Gossypium turberi, Gossypium tríiobum, Gossypium turneri, Gossypium mustelinum, Gossypium tomentosum, Gossypium darwinii. Seqüências de aminoácido ou seqüências de nucleotídeo variantes incluem modificações de uma seqüência através da adição, deleção ou substituição de aminoácidos ou nucleotídeos, respecti- vamente. Amplificacao por PCR de ácidos nucléicos condificando SusC é possível a partir de outras espécies Gossypium compreendendo genoma tipo D. Tais ácidos nucléicos amplificados por PCR parecem compreenderof other cultivars of Gossypium hirsutum or Gossypium barbadense or parent cotton plants such as Gossypium arboretum, Gossypium herbaceum and Gossypium raimondii and Gossypium Iongicalyx or other Gossypium species, especially Gossypium species comprising a D-type genome such as Gossypium arid davidsonii, Gossypium gossypioides, Gossypium klotzschianum, Gossypium turberi, Gossypium triobum, Gossypium turneri, Gossypium mustelinum, Gossypium tomentosum, Gossypium darwinii. Variant amino acid sequences or nucleotide sequences include modifications of a sequence by the addition, deletion or substitution of amino acids or nucleotides, respectively. PCR amplification of SusC-conducting nucleic acids is possible from other Gossypium species comprising type D genome. Such PCR amplified nucleic acids appear to comprise
um sítio de restrição de EcoRI característico para ácidos nucléicos codifi- cando Sus tipo C.a characteristic EcoRI restriction site for nucleic acids encoding Sus type C.
Variantes da isoforma C de sacarose sintase podem ser encontradas em hibridização estringente usando a seqüência de nucleotídeo de qualquer SEQ ID N2: 1, 4, 6, 8, 10, 12 ou 14 ou parte da mesma compreendendo pelo menos cerca de 25 ou 50 nucleotídeos consecutivos da SEQ ID N9: 1, 4, 6, 8, 10, 12 ou 14 ou as seqüências de nucleotídeo complementares da mes- ma, como uma sonda. Uma sonda útil particular pode ser a seqüência de nucleotídeo codificando a seqüência N-terminal ou C-terminal de SEQ ID Nos.: 15 e 16, tal como as seqüências de nucleotídeo de qualquer uma da SEQ ID NQ: 1, 4, 6, 8, 10, 12 ou 14 codificando a seqüência de aminoácido da SEQ ID NQ: 15 ou 16.Sucrose synthase C isoform variants can be found in stringent hybridization using the nucleotide sequence of any SEQ ID NO: 1, 4, 6, 8, 10, 12, or 14 thereof or comprising at least about 25 or 50 nucleotides. of SEQ ID NO: 1, 4, 6, 8, 10, 12 or 14 or the complementary nucleotide sequences thereof, as a probe. A particular useful probe may be the nucleotide sequence encoding the N-terminal or C-terminal sequence of SEQ ID Nos: 15 and 16, such as the nucleotide sequences of any one of SEQ ID NQ: 1, 4, 6, 8, 10, 12 or 14 encoding the amino acid sequence of SEQ ID NO: 15 or 16.
"Condições de hibridização estringentes" conforme aqui usado significa que hibridização vai geralmente acontecer se houver pelo menos 95% e de preferência pelo menos 97% de identidade de seqüência entre a sonda e a sequência-alvo. Exemplos de condições de hibridização estringen- tes são incubação da noite para o dia em uma solução compreendendo for- mamida 50%, 5 χ SSC (NaC1150 mM, citrato de trissódio 15 mM), fosfato de sódio 50 mM (pH 7,6), 5x solução de Denhardt, dextran sulfato 10%, e 20 MiTVhiI de DNA carreador cisalhado, desnaturado tal como DNA de esperma de salmão, seguido por lavagem do apoio de hibridização em 0,1 χ SSC em aproximadamente 65e C, de preferência por cerca de 10 minutos. Outras condições de hibridização e lavagem são bem conhecidas e são exemplifi- cadas em Sambrook e outros, Molecular Cloning: A Laboratory Manual, Se- gunda Edição, Cold Spring Harbor, NY (1989), particularmente capítulo 11."Stringent hybridization conditions" as used herein means that hybridization will generally occur if there is at least 95% and preferably at least 97% sequence identity between the probe and the target sequence. Examples of stringent hybridization conditions are overnight incubation in a solution comprising 50% formalin, 5 χ SSC (15 mM NaC1150, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6). , 5x Denhardt's solution, 10% dextran sulfate, and 20 MiTVhiI of denatured sheared carrier DNA such as salmon sperm DNA, followed by washing of the 0.1 χ SSC hybridization support at approximately 65 ° C, preferably by about 65 ° C. 10 minutes. Other hybridization and wash conditions are well known and are exemplified in Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, NY (1989), particularly chapter 11.
Tais seqüências variantes podem ser também obtidas através de amplificação de DNA usando oligonucleotídeos específicos para genes de sacarose sintase (SusC) como iniciadores, tal como, mas não limitado a, oligonucleotídeos compreendendo ou consistindo em cerca de 20 a cerca de 50 nucleotídeos consecutivos da seqüência de nucleotídeo da SEQ ID N9: 1, 4, 6, 8, 20, 12 ou 14 ou seu complemento. As SEQ ID Nos.: 17 e 18 mostram as seqüências de nucleotídeo de iniciadores que são particularmente ade- quadas como iniciadores específicos de SusC.Such variant sequences may also be obtained by DNA amplification using sucrose synthase (SusC) gene specific oligonucleotides as primers such as, but not limited to, oligonucleotides comprising or consisting of about 20 to about 50 consecutive nucleotides of the sequence. of nucleotide of SEQ ID NO: 1, 4, 6, 8, 20, 12 or 14 or their complement. SEQ ID Nos: 17 and 18 show the nucleotide sequences of primers that are particularly suitable as SusC specific primers.
Em adição a explorar a variação natural para isoformas de SusC, seqüências variantes podem ser também obtidas através da geração induzida de variação in vitro ou in vivo. Vários métodos para geração induzi- da in vitro de seqüências de nucleotídeo variantes estão disponíveis na téc- nica incluindo, mas não limitado a, técnicas de embaralhamento ou evolução direta de DNA conforme descrito na Patente US 5605793, patente US 5.811.238 e patente US 5.830.721. Métodos para geração induzida in vivo de seqüência de nucleotídeo variante são também bem conhecidos na técni- ca e podem incluir exposição de plantas de algodão ou plantas progenitoras de algodão a mutagênicos tal como radiação por ionização, SEM, MMS ou similar, seguido por isolamento dos ácidos nucléicos codificando SusC1 por exemplo, conforme descrito em outro lugar aqui.In addition to exploiting natural variation for SusC isoforms, variant sequences may also be obtained by induced generation of variation in vitro or in vivo. Various methods for in vitro induced generation of variant nucleotide sequences are available in the art including, but not limited to, shuffling techniques or direct DNA evolution as described in US Patent 5,605,793, US Patent 5,811,238 and US Patent 5,830,721. Methods for in vivo induced generation of variant nucleotide sequences are also well known in the art and may include exposure of cotton plants or cotton progenitor plants to mutagens such as ionization radiation, SEM, MMS or the like, followed by isolation of the Nucleic acids encoding SusC1 for example, as described elsewhere herein.
Tendo identificado as seqüências consenso C-terminais e N- terminais para uma sacarose sintase tipo SusC, os inventores identificaram as seqüências de nucleotídeo que seguem em bancos de dados codificando seqüências de aminoácido compreendendo tais seqüências consenso. U- sando a região consenso N-terminal (SEQ ID NQ:16) as seqüências que se- guem (identificadas pelo seu Número de acesso) foram isoladas: EM- BL.C0499214 (cDNA de Gossypium hirsutum; EMBL: C0499090 (cDNA de Gossypium hirsutum)·, EMBLCP498472 (cDNA de Gossypium hirsutum)·, EMBL:C)498387 (cDNA de Gossypium hirsutum)·, EMBL:C)497432 (Gossy- pium hirsutum cDNA); EMBL:C0497275 (Gossypium hirsutum cDNA); EM- BL:C0496887 (Gossypium hirsutum cDNA); EMBL:C0496593 (Gossypium hirsutum cDNA); EMBL.O0495954 (Gossypium hirsutum cDNA); EM- BL:C0495804 (Gossypium hirsutum cDNA); EMBL:C0495643(Gossyp/um hirsutum cDNA); EMBLC0495623 (Gossypium hirsutum cDNA); EM- BL:C0495601 {Gossypium hirsutum cDNA); EMBL:C0495511 (Gossypium hirsutum cDNA); EMBL:C0494831 (Gossypium hirsutum cDNA); EM- BL:C0494779 (Gossypium hirsutum cDNA); EMBL:C0494693 (Gossypium hirsutum cDNA); EMBLC0494536 (Gossypium hirsutum cDNA); EM- BLC0494182 (Gossypium hirsutum cDNA); EMBL:C0493935 (Gossypium hirsutum cDNA); EMBL.C0493780 (Gossypium hirsutum cDNA); EM- BL:C0493725 (Gossypium hirsutum cDNA); EMBL.C0493724 (Gossypium hirsutum cDNA); EMBL:C0493589 (Gossypium hirsutum cDNA); EM- BL.-C0493043 (Gossypium hirsutum cDNA); EMBL:C0493007(Gossypium hirsutum cDNA); EMBL:C0492969 (Gossypium hirsutum cDNA); EM- BL.C0492566 (Gossypium hirsutum cDNA); EMBL:C0490959 (Gossypium hirsutum cDNA); EMBL:BF272227 (Gossypium arboreum cDNA clone GA_Eb0014E15f); EMBL:C0493732 (Gossypium hirsutum cDNA); EM- BL:C0493111 (Gossypium hirsutum cDNA); EMBL:C0491540 (Gossypium hirsutum cDNA); EMBLC0493583 (Gossypium hirsutum cDNA); EM- BL:C0493765 (Gossypium hirsutum cDNA); e EMBLC0498054 (Gossypium hirsutum cDNA). Todas as seqüências citadas são aqui incorporadas a título de referência. Usando a região de consenso C-terminal (SEQ ID NQ:15) as seqüências que seguem (Identificadas pelo seu número de acesso) foram isoladas: EMBL:DT463218 (Gossypium hirsutum cDNA clone GH_CHX13D22-3); EMBLC0499051 (Gossypium hirsutum cDNA); EM- BLC0498850(Gossypium hirsutum cDNA); EMBL:C0498685 (Gossypium hirsutum cDNA); EMBL:C0498361 (Gossypium hirsutum cDNA); EM- BL:C0498044 (Gossypium hirsutum cDNA); EMBL:C0497702 (Gossypium hirsutum cDNA); EMBLC0497506 (Gossypium hirsutum cDNA); EM- BL:C0497291 (Gossypium hirsutum cDNA); EMBL:C0497151 Gossypium hirsutum cDNA); EMBL:C0496924 (Gossypium hirsutum cDNA); EM- BL:C0496907 (Gossypium hirsutum cDNA); EMBL:C0496748 (Gossypium hirsutum cDNA); EMBL:C0496747 (Gossypium hirsutum cDNA); EM- BL:C0496373 (Gossypium hirsutum cDNA); EMBL:C0496003 (Gossypium hirsutum cDNA); EMBL:C0495974 (Gossypium hirsutum cDNA); EM- BL:C0495570 (Gossypium hirsutum cDNA); EMBL:C0494691 (Gossypium hirsutum cDNA); EMBL:C0493817 (Gossypium hirsutum cDNA); EM- BL:C0493779 (Gossypium hirsutum cDNA); EMBL:C0493750 (Gossypium hirsutum cDNA); EMBL:C0492925 (Gossypium hirsutum cDNA); EM- BL:C0492917 (Gossypium hirsutum cDNA); EMBLC0492529 (Gossypium hirsutum cDNA); EMBL:C0492496 (Gossypium hirsutum cDNA); EM- BL:C0493902 (Gossypium hirsutum cDNA); EMBL:BQ412019 GossypiumHaving identified the C-terminal and N-terminal consensus sequences for a SusC sucrose synthase, the inventors identified the following nucleotide sequences in databases encoding amino acid sequences comprising such consensus sequences. Using the N-terminal consensus region (SEQ ID NQ: 16) the following sequences (identified by their Accession Number) were isolated: EM-BL.C0499214 (Gossypium hirsutum cDNA; EMBL: C0499090 ( Gossypium hirsutum) ·, EMBLCP498472 (Gossypium hirsutum cDNA) ·, EMBL: C) 498387 (Gossypium hirsutum cDNA) ·, EMBL: C) 497432 (Gossypium hirsutum cDNA); EMBL: C0497275 (Gossypium hirsutum cDNA); MS-BL: C0496887 (Gossypium hirsutum cDNA); EMBL: C0496593 (Gossypium hirsutum cDNA); EMBL.O0495954 (Gossypium hirsutum cDNA); MS-BL: C0495804 (Gossypium hirsutum cDNA); EMBL: C0495643 (Gossyp / um hirsutum cDNA); EMBLC0495623 (Gossypium hirsutum cDNA); MS-BL: C0495601 (Gossypium hirsutum cDNA); EMBL: C0495511 (Gossypium hirsutum cDNA); EMBL: C0494831 (Gossypium hirsutum cDNA); MS-BL: C0494779 (Gossypium hirsutum cDNA); EMBL: C0494693 (Gossypium hirsutum cDNA); EMBLC0494536 (Gossypium hirsutum cDNA); MS-BLC0494182 (Gossypium hirsutum cDNA); EMBL: C0493935 (Gossypium hirsutum cDNA); EMBL.C0493780 (Gossypium hirsutum cDNA); MS-BL: C0493725 (Gossypium hirsutum cDNA); EMBL.C0493724 (Gossypium hirsutum cDNA); EMBL: C0493589 (Gossypium hirsutum cDNA); MS-BL.-CO493043 (Gossypium hirsutum cDNA); EMBL: C0493007 (Gossypium hirsutum cDNA); EMBL: C0492969 (Gossypium hirsutum cDNA); MS-BL.C0492566 (Gossypium hirsutum cDNA); EMBL: C0490959 (Gossypium hirsutum cDNA); EMBL: BF272227 (Gossypium arboreum cDNA clone GA_Eb0014E15f); EMBL: C0493732 (Gossypium hirsutum cDNA); MS-BL: C0493111 (Gossypium hirsutum cDNA); EMBL: C0491540 (Gossypium hirsutum cDNA); EMBLC0493583 (Gossypium hirsutum cDNA); MS-BL: C0493765 (Gossypium hirsutum cDNA); and EMBLC0498054 (Gossypium hirsutum cDNA). All sequences cited are incorporated herein by reference. Using the C-terminal consensus region (SEQ ID NQ: 15) the following sequences (identified by their accession number) were isolated: EMBL: DT463218 (Gossypium hirsutum cDNA clone GH_CHX13D22-3); EMBLC0499051 (Gossypium hirsutum cDNA); MS-BLC0498850 (Gossypium hirsutum cDNA); EMBL: C0498685 (Gossypium hirsutum cDNA); EMBL: C0498361 (Gossypium hirsutum cDNA); MS-BL: C0498044 (Gossypium hirsutum cDNA); EMBL: C0497702 (Gossypium hirsutum cDNA); EMBLC0497506 (Gossypium hirsutum cDNA); MS-BL: C0497291 (Gossypium hirsutum cDNA); EMBL: C0497151 Gossypium hirsutum cDNA); EMBL: C0496924 (Gossypium hirsutum cDNA); MS-BL: C0496907 (Gossypium hirsutum cDNA); EMBL: C0496748 (Gossypium hirsutum cDNA); EMBL: C0496747 (Gossypium hirsutum cDNA); MS-BL: C0496373 (Gossypium hirsutum cDNA); EMBL: C0496003 (Gossypium hirsutum cDNA); EMBL: C0495974 (Gossypium hirsutum cDNA); MS-BL: C0495570 (Gossypium hirsutum cDNA); EMBL: C0494691 (Gossypium hirsutum cDNA); EMBL: C0493817 (Gossypium hirsutum cDNA); MS-BL: C0493779 (Gossypium hirsutum cDNA); EMBL: C0493750 (Gossypium hirsutum cDNA); EMBL: C0492925 (Gossypium hirsutum cDNA); MS-BL: C0492917 (Gossypium hirsutum cDNA); EMBLC0492529 (Gossypium hirsutum cDNA); EMBL: C0492496 (Gossypium hirsutum cDNA); MS-BL: C0493902 (Gossypium hirsutum cDNA); EMBL: BQ412019 Gossypium
arboreum cDNA clone GA_Ed0053A09r); EMBL:C0497086 (Gossypiumarboreum cDNA clone GA_Ed0053A09r); EMBL: C0497086 (Gossypium
hirsutum cDNA) EMBL:C0494597 (Gossypium hirsutum cDNA); EM- BL:C0494374 (Gossypium hirsutum cDNA); EMBLC0499615 (Gossypium hirsutum cDNA) ; EMBLC0490812 (Gossypium hirsutum cDNA); EM- BL:C0493178 (Gossypium hirsutum cDNA).hirsutum cDNA) EMBL: C0494597 (Gossypium hirsutum cDNA); MS-BL: C0494374 (Gossypium hirsutum cDNA); EMBLC0499615 (Gossypium hirsutum cDNA); EMBLC0490812 (Gossypium hirsutum cDNA); MS-BL: C0493178 (Gossypium hirsutum cDNA).
Formas variantes de sacarose sintase isoforma C adequadas para a invenção podem então ter uma seqüência de aminoácido que com- preende uma seqüência de aminoácido tendo pelo menos cerca de 60% ou cerca de 70% ou cerca de 80% ou cerca de 90% ou cerca de 95% de identi- dade de seqüência com a seqüência de aminoácido de seqüência consenso N-terminal (SEQ ID Ne:16) ou a seqüência consenso C-terminal (SEQ ID NQ: 15) ou ambas. As variantes podem ter uma seqüência de aminoácido tendo pelo menos cerca de 60% ou cerca de 70% ou cerca de 80% ou cerca de 90% ou cerca de 95% de identidade de seqüência com a seqüência de ami- noácido de qualquer uma das isoformas SusC de SEQ ID Ng: 2, 3, 5, 7, 9, 11 ou 13. Seqüências de nucleotídeo codificando tais variantes podem ter uma seqüência de nucleotídeo tendo pelo menos cerca de 60% ou cerca de 70% ou cerca de 80% ou cerca de 90% ou cerca de 95% de identidade de se- qüência com a seqüência de nucleotídeo de qualquer uma das seqüências de nucleotídeo da SEQ ID NQ: 1, 4, 6, 8, 10, 12 ou 14.Suitable isoform C sucrose synthase variant forms suitable for the invention may then have an amino acid sequence comprising an amino acid sequence having at least about 60% or about 70% or about 80% or about 90% or about 95% sequence identity with either the N-terminal consensus sequence amino acid sequence (SEQ ID Ne: 16) or the C-terminal consensus sequence (SEQ ID NQ: 15) or both. Variants may have an amino acid sequence having at least about 60% or about 70% or about 80% or about 90% or about 95% sequence identity to the amino acid sequence of any of the above. SusC isoforms of SEQ ID Ng: 2, 3, 5, 7, 9, 11 or 13. Nucleotide sequences encoding such variants may have a nucleotide sequence having at least about 60% or about 70% or about 80%. or about 90% or about 95% sequence identity to the nucleotide sequence of any of the nucleotide sequences of SEQ ID NQ: 1, 4, 6, 8, 10, 12 or 14.
Para o propósito da presente invenção, a "identidade de se- qüência" de duas seqüências de nucleotídeo ou aminoácido relacionadas, expressa como porcentagem, refere-se ao número de posição nas duas se- qüências otimamente alinhadas que têm resíduos idênticos (x100) dividido pelo número de posições comparado. Uma lacuna, isto é, uma posição em um alinhamento onde um resíduo está presente em uma seqüência mas não na outra, é considerada como uma posição com resíduos não-idênticos. O alinhamento das duas seqüências é realizado através do algoritmo Needle- man e Wunsch (Needleman e Wunsch, 1970). O alinhamento de seqüência auxiliado por computador acima pode ser convenientemente realizado usan- do programa de software padrão tal como GAP que é parte da Wisconsin Package Version 10.1 (Genetics Computer Group, Madison, Wisconsin, LI- SA) usando a scoring default matrix com uma penalidade de criação de la- cuna de 50 e uma penalidade de extensão de coluna de 3.For purposes of the present invention, the "sequence identity" of two related nucleotide or amino acid sequences, expressed as a percentage, refers to the position number in the two optimally aligned sequences that have identical (x100) divided residues. by the number of positions compared. A gap, that is, a position in an alignment where a residue is present in one sequence but not the other, is considered as a position with non-identical residues. The alignment of the two sequences is performed through the Needleman and Wunsch algorithm (Needleman and Wunsch, 1970). The above computer-aided sequence alignment can conveniently be accomplished using standard software programs such as GAP which is part of the Wisconsin Package Version 10.1 (Genetics Computer Group, Madison, Wisconsin, LI) using scoring default matrix with a spawn penalty of 50 and a spine extension penalty of 3.
Será claro que sempre que seqüências de nucleotídeo de molé- culas de RNA forem definidas através de referência à seqüência de nucleo- tídeo de moléculas de DNA correspondentes, a timina (T) na seqüência de nucleotídeo deve ser substituída por uracila (U). Se referência é feita a mo- léculas de RNA ou DNA ficará claro a partir do contexto da aplicação. O nível funcional da isoforma de sacarose sintase tipo C ou sa- carose sintases com características similares pode também ser aumentado através de variação de seqüência induzida in vivo. Para esta finalidade, mé- todos para variação de seqüência induzida conhecidos na técnica conforme descrito em outro local neste documento podem ser aplicados para plantas de produção de fibra, tal como plantas de algodão e plantas produtoras de fibra podem ser identificadas que exibem uma atividade maior para atividade da sacarose sintase isoforma C (por exemplo, usando o ensaio enzimático descrito aqui) ou que exibem uma concentração maior de sacarose sintase, particularmente de sacarose sintase isoforma C no fluido apoplástico ou pa- rede celular, particularmente durante a fase de desenvolvimento da forma- ção de parede de célula secundária, e particularmente em paredes de célula da fibra ou no fluido apoplástico circundando essas fibras.It will be clear that whenever nucleotide sequences of RNA molecules are defined by reference to the nucleotide sequence of corresponding DNA molecules, the thymine (T) in the nucleotide sequence must be replaced by uracil (U). Whether reference is made to RNA or DNA molecules will be clear from the context of the application. The functional level of sucrose synthase type C isoform or saccharose synthases with similar characteristics can also be increased by in vivo induced sequence variation. For this purpose, methods of induced sequence variation known in the art as described elsewhere herein can be applied to fiber production plants, such as cotton plants and fiber producing plants can be identified that exhibit greater activity. for isoform C sucrose synthase activity (for example, using the enzyme assay described herein) or which exhibit a higher concentration of sucrose synthase C, particularly sucrose isoform C sucrose in the apoplastic fluid or cell wall, particularly during the developmental phase of the sucrose synthase. secondary cell wall formation, and particularly in fiber cell walls or in the apoplastic fluid surrounding such fibers.
Para algumas plantas produtoras de fibra, pode ser vantajoso modificar as características da fibra através da diminuição do nível funcional de sacarose sintase isoforma C ou sacarose sintases com características similares. Isto pode novamente ser convenientemente conseguido através da introdução de um gene codificando um RNA de silenciamento.For some fiber producing plants, it may be advantageous to modify fiber characteristics by decreasing the functional level of sucrose synthase isoform C or sucrose synthases with similar characteristics. This can again conveniently be accomplished by introducing a gene encoding a silencing RNA.
Em uma modalidade, o gene codificando o RNA de silenciamen- to pode codificar uma molécula de RNA de silenciamento ou uma molécula de RNA inibidora, capaz de redução da expressão de um gene endógeno codificando uma sacarose sintase isoforma C para alterar características de fibra. Tal redução da expressão de um gene codificando uma sacarose sin- tase isoforma C deve acontecer de preferência através de silenciamento pós-transcripcional. No entanto, será claro que mesmo quando uma molécu- la de RNA inibidora diminui a expressão de um gene de SusC através de silenciamento pós-transcripcional, tal molécula de RNA pode também exer- cer outras funções dentro de uma célula, tal como guia da metilação de DNA do gene de SusC endógeno, novamente por fim levando à expressão menor deste gene. Também, expressão de genes de SusC endógenos pode ser reduzida através de silenciamento transcripcional, por exemplo, usando RNAi ou dsRNA direcionado contra a região promotora do gene de SusC endógeno.In one embodiment, the gene encoding the silencing RNA may encode a silencing RNA molecule or an inhibitory RNA molecule capable of reducing expression of an endogenous gene encoding an isoform C sucrose synthase to alter fiber characteristics. Such reduction of expression of a gene encoding an isoform C sucrose synthase should preferably occur by post-transcriptional silencing. However, it will be clear that even when an inhibitory RNA molecule decreases the expression of a SusC gene through post-transcriptional silencing, such an RNA molecule can also perform other functions within a cell, such as guiding DNA methylation of the endogenous SusC gene, again eventually leading to lower expression of this gene. Also, expression of endogenous SusC genes may be reduced by transcriptional silencing, for example, using RNAi or dsRNA directed against the promoter region of the endogenous SusC gene.
Vários métodos estão disponíveis na técnica para produzir uma molécula de RNA de silenciamento, isto é, uma molécula de RNA que quan- do expressa reduz a expressão de um gene particular ou grupo de genes, incluindo as chamadas tecnologias de RNA de "sentido" ou "antissentido".Several methods are available in the art for producing a silencing RNA molecule, that is, an RNA molecule that when expressed reduces the expression of a particular gene or group of genes, including so-called "sense" or "RNA" technologies. "antisense".
Então, em uma modalidade, a molécula de RNA inibidora codifi- cando gene quimérico é baseada na chamada tecnologia de antissentido. Em outras palavras, a região de codificação do gene quimérico compreende uma seqüência de nucleotídeo de pelo menos 20 nucleotídeos consecutivos do complemento da seqüência de nucleotídeo de um gene de SusC endó- geno. Tal gene quimérico pode ser construído ligando operavelmente um fragmento de DNA compreendendo pelo menos 20 nucleotídeos de um gene de SusC, que pode ser isolado ou identificado conforme descrito em outro lugar neste pedido, em orientação inversa a um promotor expressável em planta e região de formação de extremidade 3' envolvida em terminação de transcrição e poliadenilação. Será claro que não há nenhuma necessidade de saber a seqüência de nucleotídeo exata ou a seqüência de nucleotídeo completa de tal fragmento de DNA do gene de SusC isolado.Then, in one embodiment, the inhibitory RNA molecule encoding the chimeric gene is based on the so-called antisense technology. In other words, the chimeric gene coding region comprises a nucleotide sequence of at least 20 consecutive nucleotides from the nucleotide sequence complement of an endogenous SusC gene. Such a chimeric gene may be constructed by operably linking a DNA fragment comprising at least 20 nucleotides of a SusC gene, which may be isolated or identified as described elsewhere in this application, in reverse orientation to a plant expressible promoter and region of formation. 3 'terminus involved in transcription termination and polyadenylation. It will be clear that there is no need to know the exact nucleotide sequence or the complete nucleotide sequence of such an isolated SusC gene DNA fragment.
Em outra modalidade, a molécula de RNA inibidora codificando gene quimérico é baseada na chamada tecnologia de co-supressão. Em ou- tras palavras, a região de codificação do gene quimérico compreende uma seqüência de nucleotídeo de pelo menos 20 nucleotídeos consecutivos da seqüência de ácido nucléico de um gene de SusC endógeno da planta. Tal gene quimérico pode ser construído ligando operavelmente um fragmento de DNA compreendendo pelo menos 20 nucleotídeos de um gene de SusC, que pode ser isolado ou identificado conforme descrito em outro lugar neste pedido, em orientação direta a um promotor expressável em planta e região de formação de extremidade 3' envolvida em terminação de transcrição e poliadenilação. Novamente, ficará claro que não há nenhuma necessidade de saber a seqüência de nucleotídeo exata ou a seqüência de nucleotídeo completa de tal fragmento de DNA ou do gene de SusC isolado.In another embodiment, the inhibitory RNA molecule encoding the chimeric gene is based on so-called co-suppression technology. In other words, the chimeric gene coding region comprises a nucleotide sequence of at least 20 consecutive nucleotides from the nucleic acid sequence of a plant endogenous SusC gene. Such a chimeric gene may be constructed by operably linking a DNA fragment comprising at least 20 nucleotides of a SusC gene, which may be isolated or identified as described elsewhere in this application, in direct orientation to a plant expressible promoter and region of formation. 3 'terminus involved in transcription termination and polyadenylation. Again, it will be clear that there is no need to know the exact nucleotide sequence or the complete nucleotide sequence of such a DNA fragment or isolated SusC gene.
A eficiência dos genes quiméricos acima mencionados na redu- 10The efficiency of the chimeric genes mentioned above in reducing
1515
ção da expressão dos genes de SusC endógenos pode ser aumentada mais através da inclusão de elemento de DNA que resulta na expressão de molé- culas de RNA inibidoras não-poliadeniladas, aberrantes, ou resulta na reten- ção das moléculas de RNA inibidoras no núcleo das células. Um tal elemen- to de DNA adequado para este propósito é uma região de DNA codificando uma ribozima autocatalítica (self-splicing), conforme descrito no WO 0001133 (incorporado a título de referência). Outro tal elemento de DNA adequado para este propósito é uma região de DNA codificando uma locali- zação nuclear ou sinal de retenção de RNAj conforme descrito noThe expression of endogenous SusC genes can be further enhanced by the inclusion of a DNA element that results in the expression of aberrant non-polyadenylated inhibitory RNA molecules, or results in the retention of inhibitory RNA molecules in the nucleus of DNA. cells One such DNA element suitable for this purpose is a region of DNA encoding a self-splicing ribozyme, as described in WO 0001133 (incorporated by reference). Another such DNA element suitable for this purpose is a region of DNA encoding a nuclear localization or jRNA retention signal as described in
PCT/AU03O0292 publicado como W003O76619 (incorporado a título de re- ferência).PCT / AU03O0292 published as W003O76619 (incorporated by reference).
Um modo conveniente e muito eficiente de regulação descen- dente da expressão de um gene de interesse usa o chamado RNA de fila- mento duplo (dsRNA) ou RNA de interferência (RNAi)1 conforme descrito, por exemplo, no W09963050 (incorporado a título de referência). Nesta tec- nologia, uma molécula de RNA é introduzida em uma célula de planta, com o que a molécula de RNA é capaz de formar uma região de RNA de filamento duplo em pelo menos cerca de 19 a cerca de 21 nucleotídeos, e com o que um dos filamentos desta região de RNA de filamento duplo é mais ou menos idêntica em seqüência de nucleotídeo ao gene-alvo ("região de sentido"), enquanto o outro filamento é mais ou menos idêntico em seqüência de nu- cleotídeo ao complemento do gene-alvo ou da região de sentido ("região de antissentido"). É esperado que para silenciamento da expressão do gene- alvo, a seqüência de nucleotídeo das 19 seqüências de nucleotídeo conse- cutivas possa ter uma falta de combinação, ou as regiões de sentido e antis- sentido podem diferir em um nucleotídeo. Para conseguir a construção de tais moléculas de RNA ou dos genes quiméricos de codificação, pode ser feito uso do vetor conforme descrito no WO 02059294.A convenient and very efficient down-regulation of expression of a gene of interest uses so-called double stranded RNA (dsRNA) or interfering RNA (RNAi) 1 as described, for example, in W09963050 (incorporated herein). of reference). In this technology, an RNA molecule is introduced into a plant cell, whereby the RNA molecule is capable of forming a double stranded RNA region in at least about 19 to about 21 nucleotides, and with the that one strand of this double stranded RNA region is roughly identical in nucleotide sequence to the target gene ("sense region"), while the other strand is roughly identical in nucleus sequence to the complement of the target gene. target gene or sense region ("antisense region"). It is expected that for silencing target gene expression, the nucleotide sequence of the 19 consecutive nucleotide sequences may be mismatched, or the sense and antisense regions may differ by one nucleotide. To achieve the construction of such RNA molecules or chimeric coding genes, use may be made of the vector as described in WO 02059294.
Então, em uma modalidade da invenção, um método para alte- ração das características de fibra de uma planta produtora de fibra, tal como algodão, é provido compreendendo a etapa de introdução de um gene qui- mérico em uma célula da planta produtora de fibra, onde o gene quiméricoThus, in one embodiment of the invention, a method for altering the fiber characteristics of a fiber producing plant, such as cotton, is provided comprising the step of introducing a chimeric gene into a cell of the fiber producing plant. where the chimeric gene
20 compreende os elementos de DNA operavelmente ligados que seguem: (a) um promotor expressável de planta, de preferência um promotor20 comprises operably linked DNA elements that follow: (a) a plant expressible promoter, preferably a promoter
expressável em planta que controla transcrição de preferência nas células de fibra;plant expressible which preferably controls transcription in fiber cells;
(b) uma região de DNA transcrito, que quando transcrito dá uma(b) a region of transcribed DNA, which when transcribed gives a
molécula de RNA de filamento duplo capaz de reduzir a expressão de um gene de SusC endógeno para a planta produtora de fibra, e a molécula de RNA compreendendo uma primeira e uma segunda região de DNA onde i) a primeira região de RNA compreende uma seqüência de nucle-double stranded RNA molecule capable of reducing the expression of an endogenous SusC gene to the fiber producing plant, and the RNA molecule comprising a first and a second DNA region where i) the first RNA region comprises a sequence of nucle-
otídeo de pelo menos 19 nucleotídeos consecutivos tendo pelo menos cerca de 94% de identidade de seqüência com a seqüência de nucleotídeo de um gene de SusC endógeno;an otide of at least 19 consecutive nucleotides having at least about 94% sequence identity with the nucleotide sequence of an endogenous SusC gene;
ü) a segunda região de RNA compreende uma seqüência de nu-ü) the second RNA region comprises a sequence of
cleotídeo complementar a pelo menos 19 nucleotídeos consecutivos da pri- meira região de RNA;cleotide complementary to at least 19 consecutive nucleotides of the first RNA region;
üi) as primeira e segunda regiões de RNA são capazes de empare-üi) the first and second RNA regions are capable of matching
Ihamento de base para formar uma molécula de RNA de filamento duplo en- tre pelo menos os 19 nucleotídeos das primeira e segunda regiões; e (c) uma região de extremidade 3' compreendendo sinais de terminação de transcrição e poliadenilação funcionando em células da planta.Base line to form a double stranded RNA molecule between at least 19 nucleotides of the first and second regions; and (c) a 3 'end region comprising transcription termination and polyadenylation signals functioning in plant cells.
O comprimento das primeira e segunda regiões de RNA (região de sentido e antissentido) pode variar de a partir de cerca de 19 nucleotí- deos (nt) até um comprimento igualando o comprimento (em nucleotídeos) de um gene de SusC endógeno. O comprimento total da seqüência de nu- cleotídeo de sentido ou antissentido pode então ser pelo menos 25 nt ou pelo menos cerca de 50 nt ou pelo menos cerca de 100 nt ou pelo menos cerca de 150 nt ou pelo menos cerca de 200 nt ou pelo menos cerca de 500 nt. E esperado que não haja nenhum limite para o comprimento total da se- qüência de nucleotídeo de sentido ou de antissentido. No entanto, por ra- zões práticas (tal como, por exemplo, estabilidade dos genes quiméricos) é esperado que o comprimento da seqüência de nucleotídeo de sentido ou antissentido não deva exceder 5000 nt, particularmente não deva exceder 2500 nt e poderia ser limitado a cerca de 1000 nt.The length of the first and second RNA regions (sense and antisense region) may range from about 19 nucleotides (nt) to a length equaling the length (in nucleotides) of an endogenous SusC gene. The total length of the sense or antisense nucleotide sequence may then be at least 25 nt or at least about 50 nt or at least about 100 nt or at least about 150 nt or at least about 200 nt or at least minus about 500 nt. It is expected that there will be no limit to the total length of the sense or antisense nucleotide sequence. However, for practical reasons (such as, for example, chimeric gene stability) it is expected that the length of the sense or antisense nucleotide sequence should not exceed 5000 nt, particularly it should not exceed 2500 nt and could be limited to about 1000 nt.
Será compreendido que quanto maior o comprimento total da região de sentido ou antissentido, menos estringentes as necessidades de identidade de seqüência entre essas regiões e a seqüência correspondente em um gene de SusC endógeno ou seu complemento. De preferência, o á- cido nucléico de interesse deve ter uma identidade de seqüência de pelo menos cerca de 75% com a sequência-alvo correspondente, particularmente pelo menos cerca de 80%, mais particularmente de preferência pelo menos cerca de 85%, muito particularmente cerca de 90%, especialmente cerca de 95%, mais especialmente cerca de 100%, muito especialmente ser idêntica à parte correspondente da sequência-alvo ou seu complemento. No entanto, é preferido que o ácido nucléico de interesse sempre inclua uma seqüência de cerca de 19 nucleotídeos consecutivos, particularmente cerca de 25 nt, mais particularmente cerca de 50 nt, especialmente cerca de 100 nt, muito particularmente cerca de 250 nt com 100% de identidade de seqüência com a parte correspondente do ácido nucléico-alvo. De preferência, para cálculo da identidade de seqüência e projeto da seqüência de sentido ou antissenti- do correspondente, o número de lacunas deve ser minimizado, particular- mente para as seqüências de sentido mais curtas. dsRNA codificando genes quiméricos de acordo com a invençãoIt will be understood that the longer the total length of the sense or antisense region, the less stringent the sequence identity needs between these regions and the corresponding sequence in an endogenous SusC gene or complement thereof. Preferably, the nucleic acid of interest should have a sequence identity of at least about 75% with the corresponding target sequence, particularly at least about 80%, more particularly preferably at least about 85%, very particularly about 90%, especially about 95%, more especially about 100%, most especially being identical to the corresponding part of the target sequence or its complement. However, it is preferred that the nucleic acid of interest always includes a sequence of about 19 consecutive nucleotides, particularly about 25 nt, more particularly about 50 nt, especially about 100 nt, most particularly about 250 nt with 100%. sequence identity with the corresponding part of the target nucleic acid. Preferably, for calculating sequence identity and designing the corresponding sense or antisense sequence, the number of gaps should be minimized, particularly for shorter sense sequences. dsRNA encoding chimeric genes according to the invention
podem compreender um íntron, tal como um íntron heterólogo, localizado, por exemplo, na seqüência espaçadora entre as regiões de RNA de sentido e antissentido de acordo com a revelação do WO 9963050 (incorporado a- qui a título de referência). Em outra modalidade, a molécula de RNA de silenciamento outhey may comprise an intron, such as a heterologous intron, located, for example, in the spacer sequence between the sense and antisense RNA regions according to the disclosure of WO 9963050 (incorporated herein by reference). In another embodiment, the silencing RNA molecule or
de RNA inibidor pode ser uma molécula de microRNA, projetada, sintetizada e/ou modulada para direcionar e causar clivagem de genes de SusC endó- genos em uma planta produtora de fibra, tal como uma planta de algodão. Vários métodos foram descritos para gerar e usar miRNAs para um gene- alvo específico (incluindo, mas não-limitado a, Schwab e outros (2006, Plant Cell, 18(5): 1121-1133), W02006€44322, W02005O47505, WP 06009836, incorporados aqui a título de referência). Geralmente, uma estrutura de miRNA existente é modificado na porção de reconhecimento de gene-alvo de modo que o miRNA gerado agora guia o complexo RISC para clivar as moléculas de RNA transcritas do ácido nucléico-alvo. Estruturas de miRNA poderiam ser modificadas ou sintetizadas de modo que o miRNA agora compreende 21 nucleotídeos consecutivos da seqüência de nucleotídeo de uma seqüência de nucleotídeo codificando SusC1 tal como as seqüências representadas na Listagem de seqüência, e permitindo faltas de combinação de acordo com as regras descritas aqui abaixo. Seqüências particularmente adequadas das quais 21 nucleotídeos consecutivos podem ser escolhidos compreendem as seqüências de nucleotídeo codificando as seqüências de aminoácido específicas de SusC conforme aqui descrito.Inhibitor RNA may be a microRNA molecule designed, synthesized and / or modulated to target and cleave endogenous SusC genes in a fiber producing plant such as a cotton plant. Several methods have been described for generating and using miRNAs for a specific target gene (including, but not limited to, Schwab et al. (2006, Plant Cell, 18 (5): 1121-1133), W02006 € 44322, W02005O47505, WP 06009836, incorporated herein by reference). Generally, an existing miRNA structure is modified at the target gene recognition portion so that the generated miRNA now guides the RISC complex to cleave the transcribed RNA molecules from the target nucleic acid. MiRNA structures could be modified or synthesized so that miRNA now comprises 21 consecutive nucleotides from the nucleotide sequence of a SusC1 encoding nucleotide sequence such as the sequences represented in the Sequence Listing, and allowing mismatches according to the described rules. here below. Particularly suitable sequences from which 21 consecutive nucleotides may be chosen comprise nucleotide sequences encoding SusC-specific amino acid sequences as described herein.
Então, em uma modalidade, a invenção provê um método para regulação descendente da expressão de uma ou aumento da resistência de plantas a condições de crescimento adversas, compreendendo as etapas de a. introdução de um gene quimérico em células das ditas plantas, oThus, in one embodiment, the invention provides a method for down-regulating the expression of one or increased plant resistance to adverse growth conditions, comprising steps a. introduction of a chimeric gene into cells of said plants, the
dito gene quimérico compreendendo as regiões de DNA operavelmente liga- do que seguem:said chimeric gene comprising the operably linked DNA regions as follows:
i. um promotor expressável de planta;i. an expressible plant promoter;
ii. uma região de DNA que quando da introdução e transcrição em uma célula de planta é processada em miRNA, com o que o miRNA é capaz de reconhecer e guiar a clivagem do mRNA de um gene codificando SusC da planta; eii. a region of DNA which upon introduction and transcription in a plant cell is processed into miRNA, whereby the miRNA is able to recognize and guide mRNA cleavage of a gene encoding SusC from the plant; and
iii. opcionalmente, uma região de DNA 3' envolvida em terminação de transcrição e poliadenilação.iii. optionally a 3 'DNA region involved in transcription termination and polyadenylation.
A região de DNA mencionada processada em miRNA pode compreender uma seqüência de nucleotídeo que é essencialmente complementar a uma seqüência de nucleotídeo de pelo menos 21 nucleotídeos consecutivos de um gene codificando SusC, contanto que uma ou mais das faltas de combi- nação sejam permitidas:The mentioned miRNA processed DNA region may comprise a nucleotide sequence that is essentially complementary to a nucleotide sequence of at least 21 consecutive nucleotides of a SusC encoding gene, provided that one or more of the mismatches is allowed:
a. Uma falta de combinação entre o nucleotídeo na extremidade 5'The. A mismatch between the 5 'end nucleotide
do miRNA e a seqüência de nucleotídeo correspondente na molécula de RNA; b. Uma falta de combinação entre qualquer um dos nucleotídeos na posição 1 até a posição 9 do miRNA e a seqüência de nucleotídeo cor- respondente na molécula de RNA;miRNA and the corresponding nucleotide sequence in the RNA molecule; B. A mismatch between any of the nucleotides at position 1 to position 9 of the miRNA and the corresponding nucleotide sequence in the RNA molecule;
c. Três faltas de combinação entre qualquer um dos nucleotídeos na posição 12 até a posição 21 do miRNA e a seqüência de nucleotídeo cor- respondente na molécula de RNA contanto que não haja mais de duas faltas de combinação consecutivas.ç. Three mismatches between any of the nucleotides at position 12 to miRNA position 21 and the corresponding nucleotide sequence in the RNA molecule as long as there are no more than two consecutive mismatches.
Diminuição do nível funcional de Sacarose sintase isoforma C pode ser vantajosa, por exemplo, em células de fibra que levam à fibra com penugem em algodão para reduzir ou evitar a divergência de energia e me- tabólitos para a produção da fibra com penugem menos favorecida às custas de produção de algodão.Decreasing the functional level of isoform C Sucrose synthase may be advantageous, for example, in fiber cells leading to cotton down fiber to reduce or prevent energy divergence and metabolites for producing downward downward fiber. costs of cotton production.
Conforme aqui usado, um "gene endógeno" é um gene que a- contece naturalmente nas espécies da planta produtora de fibra que foram escolhidas para modulação de características de fibra ou um gene que acon- tece naturalmente em uma espécie de outra planta produtora de fibra, mas pode ser introduzido na espécie da planta produtora de fibra que foi escolhi- da para modulação de características de fibra através de técnicas de gera- ção convencionais.As used herein, an "endogenous gene" is a naturally occurring gene in the fiber-producing plant species that have been chosen for modulation of fiber characteristics or a gene that naturally occurs in a species from another fiber-producing plant. but can be introduced into the species of the fiber producing plant that has been chosen for modulation of fiber characteristics through conventional generation techniques.
Ficará claro que a expressão de gene de SusC endógeno pode ser também regulada descendentemente usando genes quiméricos confor- me aqui descrito, onde o DNA codificando o RNA específico-alvo tem uma seqüência de nucleotídeo de pelo menos 20 nucleotídeos consecutivos sele- cionada das seqüências de nucleotídeo codificando as seqüências de ami- noácido da SEQ ID N2: 2, 3, 5, 7, 9, 11 ou 13 ou seu complemento, ou onde o RNA específico-alvo tem uma seqüência de nucleotídeo de pelo menos 20 nucleotídeos consecutivos selecionada das seqüências de nucleotídeo da SEQ ID N9: 1, 4, 6, 8, 10, 12 ou 14.It will be clear that endogenous SusC gene expression can also be down-regulated using chimeric genes as described herein, where the DNA encoding the target specific RNA has a nucleotide sequence of at least 20 consecutive nucleotides selected from the sequences of nucleotide encoding the amino acid sequences of SEQ ID NO: 2, 3, 5, 7, 9, 11 or 13 or their complement, or where the target specific RNA has a nucleotide sequence of at least 20 consecutive nucleotides selected from the following: nucleotide sequences of SEQ ID NO: 1, 4, 6, 8, 10, 12 or 14.
Conforme aqui usado, o termo "promotor" significa qualquer DNA que é reconhecido e ligado (diretamente ou indiretamente) por uma RNA-polimerase dependente de DNA durante iniciação da transcrição. Um promotor inclui o sítio de iniciação de transcrição, e sítios de ligação para fatores de iniciação de transcrição e RNA polimerase, e pode compreender vários outros sítios (por exemplo, acentuadores), aos quais proteínas regu- ladoras de expressão de gene podem se ligar.As used herein, the term "promoter" means any DNA that is recognized and linked (directly or indirectly) by a DNA-dependent RNA polymerase during transcription initiation. A promoter includes the transcription initiation site, and binding sites for transcription initiation factors and RNA polymerase, and may comprise a number of other sites (e.g., enhancers) to which gene expression regulatory proteins may bind. .
Conforme aqui usado, o termo "promotor expressável em planta" significa uma seqüência de DNA que é capaz de controlar (iniciação) de transcrição em uma célula de planta. O promotor de genes quiméricos des- crito aqui pode ser naturalmente associado com as regiões de codificação ou ele pode ser um promotor heterólogo. O termo promotor expressável em planta inclui qualquer promotor de origem de planta, mas também qualquer promotor de origem de não-planta que seja capaz de direcionar a transcrição em uma célula de planta, isto é, certos promotores de origem viral ou bacte- riana tal como o CaMV35S, o promotor do vírus do trevo subterrâneo N5 4 ou Ng 7 ou promotores do gene de T-DNA e similar.As used herein, the term "plant expressible promoter" means a DNA sequence that is capable of controlling (initiation) transcription in a plant cell. The chimeric gene promoter described herein may be naturally associated with the coding regions or it may be a heterologous promoter. The term plant expressible promoter includes any plant origin promoter, but also any non-plant origin promoter that is capable of directing transcription in a plant cell, that is, certain promoters of viral or bacterial origin as such. such as CaMV35S, the underground clover virus promoter N5 4 or Ng 7 or T-DNA gene promoters and the like.
O promotor expressável em planta pode ser constitutivo ou ele pode iniciar a transcrição da região ligada a jusante de maneira espacial ou temporária. Um promotor expressável em planta que controla início e manu- tenção de transcrição de preferência em células de fibra é um promotor que direciona a transcrição da região de DNA operavelmente ligado para um ní- vel maior em células de fibra e nas células de epiderme de base do que em outras células ou tecidos da planta. Tais promotores incluem o promotor de algodão de um gene de β-tubulina específico de fibra (conforme descrito no W00210377), o promotor do algodão de um gene de actina específico de fibra (conforme descrito no W00210413), o promotor de um gene de proteí- na de transferência de lipídeo específico de fibra de algodão (conforme des- crito na US5792933), um promotor de um gene expansina de algodão (W09830698) ou um promotor de um gene de quitinase em algodão (US2003106097) ou os promotores dos genes específicos de fibra descritos na US6259003 ou US6166294. O promotor pode ser também induzível por composto químico (geralmente em combinação com um ativador transcrip- cional) conforme descrito, por exemplo, no W09321334, US5514578, EP0823480, W098O5789, W001/34821 ou W00220811.The plant expressible promoter may be constitutive or it may initiate transcription of the downstream bound region in a spatial or temporary manner. A plant expressible promoter that controls transcription initiation and maintenance preferably in fiber cells is a promoter that directs transcription of the operably linked DNA region to a higher level in fiber cells and base epidermis cells. than in other cells or tissues of the plant. Such promoters include the cotton promoter of a fiber-specific β-tubulin gene (as described in W00210377), the cotton promoter of a fiber-specific actin gene (as described in W00210413), the promoter of a protein-specific gene. - cotton fiber specific lipid transfer (as described in US5792933), a cotton expansin gene promoter (W09830698) or a cotton chitinase gene promoter (US2003106097) or specific gene promoters described in US6259003 or US6166294. The promoter may also be inducible by chemical compound (generally in combination with a transcriptional activator) as described, for example, in W09321334, US5514578, EP0823480, W098O5789, W001 / 34821 or W00220811.
Os métodos descritos aqui vão mudar as características de fi- bras produzidas pelas plantas modificadas incluído resistência, comprimen- to, finura da fibra, razão de maturidade da fibra, teor de fibra imatura, unifor- midade da fibra e "micronaire".The methods described here will change the fiber characteristics produced by the modified plants including strength, length, fiber fineness, fiber maturity ratio, immature fiber content, fiber uniformity, and micronaire.
- O "micronaire" da fibra (que pode ser determinado através de HVI) é uma medida sem unidade que depende ambos da maturidade da fi- bra (ou espessura da parede determinada por teor de celulose de parede secundária) e diâmetro da fibra.- Fiber "micronaire" (which can be determined by HVI) is a unitless measure that depends on both fiber maturity (or wall thickness determined by secondary wall cellulose content) and fiber diameter.
- Resistência do feixe da fibra (através de HVI) é expressa em unidades de (cN4ex). Ela é a resistência específica do feixe de fibra onde a- Fiber beam strength (via HVI) is expressed in units of (cN4ex). It is the specific strength of the fiber bundle where the
finura da fibra individual (tex) é calculada a partir do valor de "micronaire".Individual fiber fineness (tex) is calculated from the "micronaire" value.
- Finura da fibra (através de AFIS) é expressa como (mTex). Ela representa o peso, em miligramas, de um quilômetro da fibra. Mil metros de fibras com uma massa de 1 miligrama é igual a 1 millitex.- Fiber fineness (through AFIS) is expressed as (mTex). It represents the weight in milligrams of one kilometer of fiber. One thousand meters of fiber with a mass of 1 milligram is equal to 1 millitex.
- A razão de maturidade da fibra (através de AFIS) é uma ex-- The fiber maturity ratio (through AFIS) is an ex-
pressão do grau de espessamento de parede celular (dependendo da depo- sição de celulose na parede celular secundária). É a razão de fibras com uma razão de circularidade de 0,5 (ou mais) dividida pela quantidade de fi- bras com 0,25 (ou menos) de circularidade. (Fibras com paredes mais es- pessas são menos propensas a colapso e permanecem mais circularespressure of the degree of cell wall thickening (depending on the cellulose deposition in the secondary cell wall). It is the ratio of fibers with a circularity ratio of 0.5 (or more) divided by the amount of fibers with 0.25 (or less) of circularity. (Thicker wall fibers are less prone to collapse and remain more circular
quando da secagem). Quanto maior a razão da maturidade, mais maduras as fibras são e melhor as fibras são para tingimento.when drying). The higher the maturity ratio, the more mature the fibers are and the better the fibers are for dyeing.
- O teor de fibra imatura ("% IFC", através de AFIS) é a porcen- tagem de fibras com maturidade de menos do que 0,25. Quando menor a % de IFC, mais adequada é a fibra para tingimento.- Immature fiber content ("% IFC" through AFIS) is the percentage of fibers with a maturity of less than 0.25. The lower the% of IFC, the more suitable the fiber for dyeing.
- Várias unidades diferentes são usadas como indicadores de- Several different units are used as indicators of
comprimento de fibra, mostra valores para três dessas conforme agora des- crito. Média superior ("UHM", através de HIVI) é o comprimento médio da metade mais longa das fibras (base em peso).fiber length, shows values for three of these as now described. Upper mean ("UHM" through HIVI) is the average length of the longest half of the fibers (weight basis).
- O índice de Uniformidade de fibra ("Ul", através de HVI) ex-- The fiber uniformity index ("Ul" through HVI) expressed
pressa a razão do valor médio (Comprimento Médio) para o Comprimentohurry the ratio of the average value (Average Length) to the Length
Médio Superior. Ele é uma medida do espalhamento de comprimento da fi- bra dentro de uma população; se todas as fibras tiverem o mesmo compri- mento Ul seria igual a 100%. Teor de Fibra Curta ("% SFC1 através de HVI) é a porcentagem de fibras de menos do que [1,27 cm] [1/2]" de comprimento em uma base em peso. HVI é pensado medir o Teor de Fibra Curta confor- me determinado através de genética apenas uma vez que a medição não impõe estresse de quebra de fibra potencial adicional.Upper Middle. It is a measure of fiber length spread within a population; if all fibers have the same length Ul would be 100%. Short Fiber Content ("% SFC1 through HVI) is the percentage of fibers less than [1.27 cm] [1/2]" in length on a weight basis. HVI is thought to measure Short Fiber Content as determined through genetics only since the measurement does not impose additional potential fiber break stress.
Outros indicadores de comprimento de fibra incluem o compri- mento com base em peso ("L(p)" [cm] [pol.], através de AFIS) é o compri- mento médio de fibras calculado em uma base em peso. O comprimento com base em número ("L(n)M [cm] [pol.], através de AFIS) é o comprimento médio de fibras calculado através de número. O comprimento "L5% (n)" [cm] [pol.] (através de AFIS) é o span Iength 5% ou o comprimento estendido em 5% das fibras quando elas são paralelas e aleatoriamente distribuídas. O comprimento "L2,5% (n)" [cm] [pol.] (através de AFIS) é o span Iength 2,5%, ou o comprimento estendido em 2,5% das fibras quando elas são paralelas ou aleatoriamente distribuídas. O "UQL (p)" [pol.] (através de AFIS) é o com- primento quartil superior de fibras em peso, ou o comprimento excedido em 25% das fibras em peso. Finalmente, "SFC (n)" [cm] [pol.] e "SFC (p)"[pol.] (através de AFIS) são a porcentagem de fibras de menos do que 1,27 cm (0,50 polegada) de comprimento com base em número e peso, respectiva- mente. Em contraste com HVI, AFIS bate as fibras antes de tomar essas medições, que tem potencial para causar quebra de fibra. Valores de SFC AFIS são uma boa indicação das características das fibras após processa- mento normal.Other fiber length indicators include weight-based length ("L (p)" [cm] [in] via AFIS) is the average fiber length calculated on a weight basis. Length based on number ("L (n) M [cm] [in] through AFIS) is the average fiber length calculated by number. Length" L5% (n) "[cm] [in .] (through AFIS) is the 5% span Iength or the 5% extended length of the fibers when they are parallel and randomly distributed. "L2.5% (n)" [cm] [in] (across (AFIS) is the 2.5% span Iength, or the 2.5% extended length of the fibers when they are parallel or randomly distributed. "UQL (p)" (in AFIS) is the com - upper quartile length of fiber by weight or the length exceeded by 25% of fiber by weight Finally, "SFC (n)" [cm] [in] and "SFC (p)" [in] (by AFIS) is the percentage of fibers of less than 1.27 cm (0.50 inch) in length based on number and weight, respectively. In contrast to HVI, AFIS beats the fibers before taking these measurements, which has the potential to cause fiber breakdown. AFIS SFC values are a good indication of fiber characteristics after normal processing.
Ficará claro que os métodos e meios descritos aqui para alterar as características de fibras em plantas produtoras de fibra tal como plantas de algodão podem ser combinados com outros métodos para alterar as ca- racterísticas de fibra como conhecido na técnica.It will be clear that the methods and means described herein for altering fiber characteristics in fiber producing plants such as cotton plants may be combined with other methods for altering fiber characteristics as known in the art.
Em uma modalidade da invenção, os métodos e meios do pre- sente pedido são combinados com aqueles descritos no W02005/017157 (incorporado aqui a título de referência) ou W001/17333. É esperado que a expressão combinada dos genes vá resultar em um efeito sinérgico sobre o aumento do comprimento e/ou resistência da fibra. Os métodos e meios do presente pedido podem ser também combinados com outros métodos e meios direcionados à modificação das características da fibra conforme des- crito, por exemplo, no PCT/EP2006/005853 ou no W098/00549.In one embodiment of the invention, the methods and means of the present application are combined with those described in W02005 / 017157 (incorporated herein by reference) or W001 / 17333. The combined expression of the genes is expected to result in a synergistic effect on fiber length and / or strength increase. The methods and means of the present application may also be combined with other methods and means for modifying fiber characteristics as described, for example, in PCT / EP2006 / 005853 or W098 / 00549.
Os genes quiméricos podem ser introduzidos através de trans- formações subsequentes em células de uma planta ou podem ser combina- dos em células de uma planta através de cruzamento entre plantas compre- endendo um gene quimérico cada.Chimeric genes may be introduced through subsequent transformations into cells of a plant or may be combined into cells of a plant by crossing plants comprising one chimeric gene each.
Como o início de síntese de parede celular secundária em fibras, de modo que fibras de algodão se sobrepõem à fase de alongamento de fibra, pode ser ainda vantajoso retardar o início de expressão da isoforma C de Sacarose até um estágio posterior, deste modo prolongando a fase de alongamento de fibra. Este retardo na expressão do gene de SusC pode a- inda ser combinado com uma expressão aumentada da expressão do gene de SusC em um estágio posterior. Um modo de atingir este r-etardo seria "trocar" o(s) gene(s) de SusC endógeno(s) por um gene de SusC cuja ex- pressão é ativada em uma mudança mais tarde do que os genes de SusC endógenos, por exemplo, através da redução da expressão do gene de SusC endógeno, conforme aqui descrito, e introdução da região de codifica- ção de SusC sob o controle de um promotor que é ativado em um estágio desenvolvimental posterior de síntese de parede celular secundária, tal co- mo, por exemplo, o promotor E6 (John, M.E., Keller, G., Proc. Nati Acad. Sei. USA 1996, 93:12768-12773). A região de codificação de SusC quiméri- ca sob controle de um promotor expresso em um estágio posterior em sínte- se de parede celular secundária pode ser introduzida ectopicamente, ou o gene de SusC endógeno pode ser introduzido através de técnicas de re- combinação homóloga, conforme descrito, por exemplo, no PCTVEP2006/003086. Como o gene de SusC das cultivares de Gossypium barbadense, tal como variedades ΡΙΜΑ, parece ser expresso em um estágio mais tarde do que o gene de SusC de variedades G. hirsutum, "troca" do gene de SusC em variedades de G. hirsutum por genes de SusC de cultiva- res de G. barbadense é esperada prolongar a fase de alongamento de fibra e levar a fibras de algodão mais longas. A invenção também compreende os genes quiméricos aqui des- critos, bem como plantas, sementes, tecidos compreendendo esses genes quiméricos e fibras produzidas a partir de tais plantas.As the onset of secondary cell wall synthesis in fibers, so that cotton fibers overlap the fiber elongation phase, it may be further advantageous to delay the onset of sucrose C isoform expression to a later stage, thereby prolonging the production. fiber stretching phase. This delay in SusC gene expression may still be combined with increased expression of SusC gene expression at a later stage. One way to achieve this r-step would be to "exchange" the endogenous SusC gene (s) for a SusC gene whose expression is activated in a change later than the endogenous SusC genes, for example, by reducing expression of the endogenous SusC gene as described herein and introducing the SusC coding region under the control of a promoter that is activated at a later developmental stage of secondary cell wall synthesis, such as as, for example, the E6 promoter (John, ME, Keller, G., Proc. Nati Acad. Sci. USA 1996, 93: 12768-12773). The chimeric SusC coding region under control of a promoter expressed at a later stage in secondary cell wall synthesis may be introduced ectopically, or the endogenous SusC gene may be introduced by homologous recombination techniques, as described, for example, in PCTVEP2006 / 003086. As the SusC gene of Gossypium barbadense cultivars, such as variedades varieties, appears to be expressed at a later stage than the SusC gene of G. hirsutum varieties, "exchange" of the SusC gene in G. hirsutum varieties by SusC genes from G. barbadense growers are expected to prolong the fiber elongation phase and lead to longer cotton fibers. The invention also encompasses the chimeric genes described herein, as well as plants, seeds, tissues comprising such chimeric genes and fibers produced from such plants.
Métodos para transformar plantas são bem conhecidos na técni- ca. Métodos para transformar plantas de algodão são também bem conheci- dos na técnica. Transformação mediada por agrobacterium de algodão foi descrita, por exemplo, na patente US 5.004.863 ou na patente US 6.483.013 e transformação de algodão através de bombardeamento de partícula é rela- tado, por exemplo, no WO 92/15675.Methods for transforming plants are well known in the art. Methods for turning cotton plants are also well known in the art. Agrobacterium-mediated transformation of cotton has been described, for example, in US 5,004,863 or US 6,483,013 and cotton transformation by particle bombardment is reported, for example, in WO 92/15675.
Os genes quiméricos de acordo com a invenção podem ser in- troduzidos em plantas de uma maneira estável ou de uma maneira transien- te usando métodos bem conhecidos na técnica. Os genes quiméricos podem ser introduzidos em plantas ou podem ser gerados dentro da célula de plan- ta conforme descrito, por exemplo, na EP 1339859.Chimeric genes according to the invention may be introduced into plants in a stable or transient manner using methods well known in the art. Chimeric genes may be introduced into plants or may be generated within the plant cell as described, for example, in EP 1339859.
Os genes quiméricos podem ser introduzidos através de trans- formação em plantas de algodão das quais calos embriogênicos podem ser derivados, tal como Coker 312, Coker310, Coker 5Acala SJ-5, GSC25110, FIBERMAX 819 , Siokra 1-3, T25, GSA75, Acala SJ2, Acala SJ4, Acala SJ5, Acala SJ-C1, Acala B1644, Acala B1654-26, Acala B1654-43, Acala B3991, Acala GC356, Acala GC510, Acala GAM1, Acala C1, Acala Royale, Aeala Maxxa, Acala Prema, Acala B638, Acala B1810, Acala B2724, Acala B4894, Acala B5002, não-Acala variedade "picker" Siokra, "stripper" FC2017, Coker 315, STONEVILLE 506, STONEVILLE 825, DP50, DP61, DP90, DP77, DES119, McN235, HBX87, HBX191, HBX107, FC 3027, CHEMBRED A1, CHEMBRED A2, CHEMBRED A3, CHEMBRED A4, CHEMBRED B1, CHEMBRED B2, CHEMBRED B3, CHEMBRED C1, CHEMBRED C2, CHEMBRED C3, CHEMBRED C4, PAYMASTER 145, HS26, HS46, SICA- LA, PIMA S6 ORO BLANCO ΡΙΜΑ, FIBERMAX FM5013, FIBERMAX FM5015, FIBERMAX FM5017, FIBERMAX FM989, FIBERMAX FM832, FI- BERMAX FM966, FIBERMAX FM958, FIBERMAX FM989, FIBERMAX FM958, FIBERMAX FM832, FIBERMAX FM991, FIBERMAX FM819, FI- BERMAX FM800, FIBERMAX FM960, FIBERMAX FM966, FIBERMAX FM981, FIBERMAX FM5035, FIBERMAX FM5044, FIBERMAX FM5045, FIBERMAX FM5013, FIBERMAX FM5015, FIBERMAX FM5017 ou FIBER- MAX FM5024 e plantas com genótipos derivados dos mesmos. "Algodão" conforme aqui usado inclui Gossypium hirsutum ou Gossypium barbadense. "Plantas progenitoras de algodão" incluem Gossypium arbore- tum, Gossypium herbaceum e Gossypium raimondii e Gossypium longicaiyx.Chimeric genes can be introduced by transformation into cotton plants from which embryogenic callus can be derived, such as Coker 312, Coker310, Coker 5Acala SJ-5, GSC25110, FIBERMAX 819, Siokra 1-3, T25, GSA75, Acala SJ2, Acala SJ4, Acala SJ5, Acala SJ-C1, Acala B1644, Acala B1654-26, Acala B1654-43, Acala B3991, Acala GC356, Acala GC510, Acala GAM1, Acala C1, Acala Royale, Aeala Maxxa, Acala Prema , Acala B638, Acala B1810, Acala B2724, Acala B4894, Acala B5002, Non-Acala Siokra picker variety, stripper FC2017, Coker 315, STONEVILLE 506, STONEVILLE 825, DP50, DP61, DP90, DP77, DES119, McN235 , HBX87, HBX191, HBX107, FC 3027, CHEMBRED A1, CHEMBRED A2, CHEMBRED A3, CHEMBRED A4, CHEMBRED B1, CHEMBRED B2, CHEMBRED B3, CHEMBRED C2, CHEMBRED C3, CHEMBRED C4, PAYMASTER 145, HS, HS SICA- PIMA S6 GOLD BLANK ΡΙΜΑ, FIBERMAX FM5013, FIBERMAX FM5015, FIBERMAX FM5017, FIBERMAX FM989, FIBERMAX FM832, FIBERMAX FM966, FIBERMAX FM958, FIBERMAX FM989, FIBERMAX FM958, FIBERMAX FM832, FIBERMAX FM991, FIBERMAX FM819, FIBERMAX FM960, FIBERMAX FM960, FIBERMAX FM981, FIBERMAX FM5035, FIBERMAX FM175, FIBERMAX FM935 FM5024 and plants with genotypes derived from them. "Cotton" as used herein includes Gossypium hirsutum or Gossypium barbadense. "Cotton progenitor plants" include Gossypium arboreum, Gossypium herbaceum and Gossypium raimondii and Gossypium longicaiyx.
Não obstante, os métodos e meios da presente invenção podem ser também empregados para outras espécies de planta tal como cânhamo, juta, Iinho e plantas lenhosas, incluindo, mas não limitado a, Pinus spp., Po- pulus spp., Picea spp., Eucalyptus spp., etc.However, the methods and means of the present invention may also be employed for other plant species such as hemp, jute, lignin and woody plants, including, but not limited to, Pinus spp., Pulpus spp., Picea spp. , Eucalyptus spp., Etc.
A planta transformada obtida pode ser usada em um esquema de geração convencional para produzir plantas mais transformadas com as mesmas características ou para introduzir o gene quimérico de acordo com a invenção em outras variedades da mesma espécie de planta ou relacionada, ou em plantas híbridas. Sementes obtidas das plantas transformadas con- têm os genes quiméricos da invenção como um inserto genômico estável e são também compreendidas pela invenção.The obtained transformed plant can be used in a conventional generation scheme to produce more transformed plants with the same characteristics or to introduce the chimeric gene according to the invention into other varieties of the same or related plant species, or in hybrid plants. Seeds obtained from transformed plants contain the chimeric genes of the invention as a stable genomic insert and are also encompassed by the invention.
Em outra modalidade, o método é provido para identificação de variações alélicas das proteínas envolvidas em características de fibra em uma população de genótipos ou variedades diferentes de uma espécie de planta particular, de preferência uma espécie de planta produtora de fibra, que estão correlacionadas ou sozinhas ou em combinação com a quantida- de e/ou qualidade de produção de fibra. Este método inclui as etapas que seguem:In another embodiment, the method is provided for identifying allelic variations of proteins involved in fiber characteristics in a different genotype or variety population of a particular plant species, preferably a fiber producing plant species, which are correlated or alone. or in combination with the quantity and / or quality of fiber production. This method includes the following steps:
a) provisão de uma população de variedades ou genótipos diferen-(a) provision of a population of different varieties or genotypes
tes de uma espécie de planta particular ou espécies de planta de intercru- zamento compreendendo formas alélicas diferentes das seqüências de nu- cleotídeo codificando Sacarose sintase isoforma C, tal como seqüências de nucleotídeo codificando SEQ ID Nq 1, 4, 6, 8, 10, 12 ou 14. As formas aléli- cas diferentes podem ser identificadas usando os métodos descritos em ou- tro local neste pedido. De preferência, uma população de segregação é pro- vida, onde combinações diferentes das variações alélicas das proteínas SusC estão presentes. Métodos para produção de populações de segrega- ção são bem conhecidos na técnica de geração de planta.of a particular plant species or interbreeding plant species comprising different allelic forms of the nucleotide sequences encoding sucrose synthase isoform C, such as nucleotide sequences encoding SEQ ID NO: 1, 4, 6, 8, 10, 12 or 14. Different allelic forms can be identified using the methods described elsewhere in this application. Preferably, a segregating population is pro- vided, where different combinations of allelic variations of SusC proteins are present. Methods for producing segregation populations are well known in the plant generation technique.
b) determinação de parâmetros relacionados com características de fibra para cada indivíduo da população;b) determination of parameters related to fiber characteristics for each individual of the population;
c) determinação da presença de uma forma alélica particular das seqüências de nucleotídeo codificando SusC para cada indivíduo da popula- ção; ec) determining the presence of a particular allelic form of the SusC-encoding nucleotide sequences for each individual in the population; and
d) correlação da ocorrência de característica de fibra particular com a presença de uma forma alélica particular da seqüência de nucleotídeo a- cima mencionada ou uma combinação particular de tais formas alélicas.d) correlating the occurrence of particular fiber trait with the presence of a particular allelic form of the aforementioned nucleotide sequence or a particular combination of such allelic forms.
A informação resultante pode ser usada para acelerar varieda- des de programa de geração com características de fibra e resistência à se- ca particulares, através da determinação da presença ou ausência de formas alélicas, usando técnicas de biologia molecular convencionais.The resulting information can be used to accelerate generation program varieties with particular fiber characteristics and dryness resistance by determining the presence or absence of allelic forms using conventional molecular biology techniques.
Formas alélicas do gene de SusC associadas com característi- cas de fibra particulares podem ser também identificadas, isoladas e introdu- zidas em plantas, tal como plantas de algodão, com o que a expressão da SusC endógena foi reduzida ou eliminada. Tal redução de expressão dos genes de SusC endógenos pode ser convenientemente conseguida através de silenciamento pós-transcripcional ou transcripcional conforme aqui descri- to, ou pode ser conseguida através de inativação, tal como através de dele- ção, dos genes de SusC endógenos. Introdução das formas alélicas pode ser conseguida através de técnicas de geração ou através de transformação com os genes isolados.Allelic forms of the SusC gene associated with particular fiber characteristics can also be identified, isolated and introduced into plants, such as cotton plants, whereby the expression of endogenous SusC has been reduced or eliminated. Such reduction of expression of endogenous SusC genes may conveniently be accomplished by post-transcriptional or transcriptional silencing as described herein, or may be achieved by inactivating, as well as deleting, endogenous SusC genes. Introduction of allelic forms can be accomplished through generation techniques or through transformation with the isolated genes.
Em outra modalidade da invenção, anticorpos criados contra o novo tipo de genes de Sacarose sintase são providos, particularmente anti- corpos reconhecendo as proteínas SusC tendo as seqüências de aminoáci- do de SEQ ID Ng: 2, 3, 5, 7, 9, 11 ou 13.In another embodiment of the invention, antibodies raised against the novel type of sucrose synthase genes are provided, particularly antibodies recognizing SusC proteins having the amino acid sequences of SEQ ID Ng: 2, 3, 5, 7, 9, 11 or 13.
Conforme aqui usado, "compreendendo" deve ser interpretado como especificando a presença das características, inteiros, etapas ou com- ponentes declarados conforme referido, mas não impede a presença ou adi- ção de uma ou mais das características, inteiros, etapas ou componentes, ou grupos dos mesmos. Então, por exemplo, um ácido nucléico ou proteína compreendendo uma seqüência de nucleotídeos ou aminoácidos pode com- preender mais nucleotídeos ou aminoácidos do que aqueles realmente cita- dos, isto é, ser embebidos em um ácido nucléico ou proteína maior. Um ge- ne quimérico compreendendo uma região de DNA1 que é funcionalmente ou estruturalmente definida, pode compreender regiões de DNA adicionais, etc.As used herein, "comprising" shall be construed as specifying the presence of the declared features, integers, steps or components as referred to, but does not preclude the presence or addition of one or more of the features, integers, steps or components, or groups thereof. So, for example, a nucleic acid or protein comprising a nucleotide or amino acid sequence may comprise more nucleotides or amino acids than those actually quoted, that is, being embedded in a larger nucleic acid or protein. A chimeric gene comprising a DNA1 region that is functionally or structurally defined may comprise additional DNA regions, etc.
Os exemplos não-limitantes que seguem descrevem a identifica- ção de um novo tipo de sacarose sintase envolvida em síntese de parede de célula de planta secundária e genes quiméricos para alteração das caracte- rísticas de fibra em plantas produtoras de fibra tal como algodão e usos dos mesmos. A menos que de outro modo declarado nos Exemplos, todas as técnicas de DNA recombinante são realizadas de acordo com protocolos padrão conforme descrito em Sambrook e outros (1989) Molecular Cloning: A Laboratory Manual, Segunda Edição, Cold Spring Harbor Laboratory Press, NY e nos Volumes 1 e 2 de Ausubel e outros (1994) Current Proto- cols in Molecular Biology, Current Protocols, USA. Materiais e métodos pa- drão para trabalho molecular de planta são descritos em Plant Molecular Bio- logy Labfax (1993) de R.D.D. Croy, publicado em conjunto pela BIOS Scien- tific Publication Ltd. (RU) e Blackwell Scientific Publications, RU.The following non-limiting examples describe the identification of a new type of sucrose synthase involved in secondary plant cell wall synthesis and chimeric genes for altering fiber characteristics in fiber producing plants such as cotton and uses. of the same. Unless otherwise stated in the Examples, all recombinant DNA techniques are performed according to standard protocols as described in Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, NY and in Volumes 1 and 2 of Ausubel et al. (1994) Current Protocols in Molecular Biology, Current Protocols, USA. Standard materials and methods for plant molecular work are described in Plant Molecular Bio Laby Labfax (1993) by R.D.D. Croy, jointly published by BIOS Scientific Publication Ltd. (UK) and Blackwell Scientific Publications, UK.
Na descrição e nos exemplos, referência é feita às seqüências que seguem representadas na listagem de seqüência: SEQ ID N-:1: Seqüência de nucleotídeo de cDNA codificando sacarose sin- tase tipo C de algodão.In the description and examples, reference is made to the following sequences represented in the sequence listing: SEQ ID N-: 1: CDNA nucleotide sequence encoding sucrose cotton type C synthase.
SEQ ID NQ:2: Seqüência de aminoácido de cDNA codificando sacarose sin- tase tipo C de algodão.SEQ ID NO: 2: Amino acid sequence of cDNA encoding sucrose cotton type C synthase.
SEQ ID NQ:3: Seqüência de aminoácido de SusC de uma cultivar de Gossy- pium hirsutum.SEQ ID NO: 3: SusC amino acid sequence of a Gossypium hirsutum cultivar.
SEQ ID NQ:4: Seqüência de nucleotídeo de DNA genômico de SusC de culti- var de Gossypium hirsutum.SEQ ID NQ: 4: SusC genomic DNA nucleotide sequence from Gossypium hirsutum cultivar.
SEQ ID N9:5: Seqüência de aminoácido de SusC de cultivar de Gossypium barbadense.SEQ ID N9: 5: SusC amino acid sequence of Gossypium barbadense cultivar.
SEQ ID Ns:6: Seqüência de nucleotídeo de DNA genômico de SusC de culti- var de Gossypium barbadense codificando a proteína SUSc de SEQ ID N9:7. SEQ ID N-:7: Seqüência de aminoácido de SusC de Gossypium arboreum. SEQ ID Ng:8: Seqüência de nucleotídeo de DNA genômico de SusC de Gossypium arboretum.SEQ ID Ns: 6: SusC genomic DNA nucleotide sequence of Gossypium barbadense cultivar encoding the SUSc protein of SEQ ID N9: 7. SEQ ID NO: 7: SusC amino acid sequence of Gossypium arboreum. SEQ ID Ng: 8: Gossypium arboretum SusC genomic DNA nucleotide sequence.
SEQ ID NQ:9: Seqüência de aminoácido de SusC de Gossypium raimondii. SEQ ID NQ:10: Seqüência de nucleotídeo de DNA genômico de SusC de Gossypium raimondii.SEQ ID NO: 9: Gossypium raimondii SusC amino acid sequence. SEQ ID NO: 10: Gossypium raimondii SusC genomic DNA nucleotide sequence.
SEQ ID NQ:11: Seqüência de aminoácido de SusC de Gossypium Iongicalyx subtipo 1SEQ ID NQ: 11: SusC amino acid sequence of Gossypium Iongicalyx subtype 1
SEQ ID NQ: 12: Seqüência de nucleotídeo de DNA genômico de SusC de Gossypium Iongicalyx subtipo 1SEQ ID NQ: 12: Gossypium Iongicalyx subtype 1 SusC genomic DNA nucleotide sequence
SEQ ID NQ: 13: Seqüência de aminoácido de SusC de Gossypium Iongicalyx subtipo 2SEQ ID NQ: 13: Gossypium Iongicalyx Subtype 2 SusC Amino Acid Sequence
SEQ ID N-: 14: Seqüência de nucleotídeo de DNA genômico de SusC de Gossypium Iongicalyx subtipo 2SEQ ID N-: 14: Gossypium Iongicalyx subtype 2 SusC genomic DNA nucleotide sequence
SEQ ID NQ:15: Seqüência de aminoácido da seqüência consenso C-terminal de SusCSEQ ID NQ: 15: Amino acid sequence of SusC C-terminal consensus sequence
SEQ ID Ns:16: Seqüência de aminoácido da seqüência consenso N-terminal de SusCSEQ ID Ns: 16: Amino Acid Sequence of SusC N-terminal Consensus Sequence
SEQ ID NQ:17: Seqüência de oligonucleotídeo usada como iniciador especí- fico de SUS C (5'UTR)SEQ ID NO: 17: Oligonucleotide sequence used as SUS C-specific primer (5'UTR)
SEQ ID NQ:18: Seqüência de oligonucleotídeo usada como iniciador especí- fico de SUS C (3'UTR). ExemplosSEQ ID NO: 18: Oligonucleotide sequence used as SUS C-specific primer (3'UTR). Examples
Exemplo 1. Identificação de sacarose sintases do tipo C em espécie Gossy- piumExample 1. Identification of type C sucrose synthases in Gossypium species
Identificação de SusC. Foi mostrado que existem pelo menos 3 classes de genes de Sus expressos em fibra de algodão durante o desen- volvimento, chamados tipos A, B e C. Os tipos A/D são homólogos à se- qüência previamente relatada (Número de acesso U73588) e o tipo B mostra similaridade de seqüência forte com a seqüência publicada. As seqüências tipo C providas neste documento são novas e mostram expressão diferencial entre fase de alongamento e de parede celular secundária. O tipo C parece ser a isoforma que aparece em estágios posteriores em desenvolvimento de fibra. SusC é apenas 76% idêntica às proteínas dos tipos A e B no nível de aminoácido. Usando RTPCR e análise Northern foi mostrado que o perfil de expressão desenvolvimental do transcrito tipo C precede síntese de parede celular secundária mas mostra um padrão mais comensurável com um papel nesta fase de desenvolvimento de fibra enquanto os tipos AeB são expres- sos muito cedo em alongamento de fibra. O gene do tipo C é também ex- presso fortemente em tecido de caule. cDNA foi isolado de tecido de fibra em estágios diferentes deSusC Identification. It has been shown that there are at least 3 classes of Sus genes expressed on cotton fiber during development, called types A, B and C. Types A / D are homologous to the previously reported sequence (Accession number U73588). and type B shows strong sequence similarity to the published sequence. The type C sequences provided in this document are novel and show differential expression between stretching phase and secondary cell wall. Type C appears to be the isoform that appears at later stages in fiber development. SusC is only 76% identical to type A and B proteins at the amino acid level. Using RTPCR and Northern analysis it has been shown that the developmental expression profile of type C transcript precedes secondary cell wall synthesis but shows a more commensurate pattern with a role in this phase of fiber development while AeB types are expressed very early in elongation. Fiber The type C gene is also strongly expressed in stem tissue. cDNA was isolated from fiber tissue at different stages of
seu desenvolvimento: 5, 10, 15, 20, 40 dias pós-antese (dpa). Bibliotecas foram feitas a partir desses cDNAs e DNA foi isolado. Este DNA foi então usado para caracterizar a expresso de susyC usando 9 ng de cada DNA de biblioteca como material de partida para uma reação de PCR usando como iniciadores o iniciador específico do tipo SusC de SEQ ID Nos.: 17 e 18. O tamanho esperado do fragmento amplificado é 2550 pb para o cDNA e 3353 pb para o DNA genômico. Controles apropriados foram ativados para validar o caráter semiquantitativo deste ensaio. A Figura 2 mostra os resultados, e permite também concluir que mRNA de SusC é relativamente mais abundan- te a 20-40 dpa.its development: 5, 10, 15, 20, 40 days after anthesis (dpa). Libraries were made from these cDNAs and DNA was isolated. This DNA was then used to characterize susyC expression using 9 ng of each library DNA as the starting material for a PCR reaction using the SusC-type specific primer of SEQ ID Nos: 17 and 18. of the amplified fragment is 2550 bp for cDNA and 3353 bp for genomic DNA. Appropriate controls were activated to validate the semiquantitative character of this assay. Figure 2 shows the results, and also allows us to conclude that SusC mRNA is relatively more abundant at 20-40 dpa.
Fracionamento de proteína. Extratos de proteína de fibras em estágio de alongamento (8-11 DAF) e estágio de síntese de parede de célula secundária (15-25 DAF) foram fracionados para prover frações solúvel (cito- sólica) e microssomal (membrana de plasma, citoesqueleto e tonoplasto). A fração microssomal foi purificada mais para enriquecer proteínas de mem- brana de plasma. Esta fração PM não tem mais nenhuma contaminação ci- toesqueletal ou de parede de célula, conforme julgado através de western blotting contra actina. Também incluída foi também outra fração celular com- preendendo proteínas apoplásticas de fibras intactas obtidas através de Ia- vagem suave com tampão isotônico incluindo inibidores de protease e EG- TA. Essas preparações de proteína foram ativadas em SDS-PAGE e as pro- teínas Sus identificadas através de western blotting. Os fragmentos de gel foram excisados e submetidos à digestão tríptica seguido por espectrometria de massa em tandem para identificar a forma de Sus presente. Exemplos de peptídeos de assinatura usados para identificar a isoforma de SusC são mostrados nas Tabelas 1A e 1B. O peso molecular previsto de SusC é 90,3 kDa conforme oposto a 92,4 para as proteínas tipos A/D e B permitindo se- paração da forma C das isoformas A/B em SDS-PAGE. Esta separação foi confirmada através de análise dessas 2 faixas através de MS-MS de peptí- deo (a faixa maior contém peptídeos de assinatura de A e B enquanto a fai- xa menor sozinha tem peptídeos tipo C únicos).Protein fractionation. Fiber protein extracts at elongation stage (8-11 DAF) and secondary cell wall synthesis stage (15-25 DAF) were fractionated to provide soluble (cytosolid) and microsomal (plasma membrane, cytoskeleton and tonoplast). The microsomal fraction was further purified to enrich plasma membrane proteins. This PM fraction no longer has any cytoskeletal or cell wall contamination as judged by western blotting against actin. Also included was another cell fraction comprising intact fiber apoplastic proteins obtained by gentle washing with isotonic buffer including protease inhibitors and EG-TA. These protein preparations were activated on SDS-PAGE and Sus proteins identified by western blotting. The gel fragments were excised and subjected to tryptic digestion followed by tandem mass spectrometry to identify the present Sus form. Examples of signature peptides used to identify the SusC isoform are shown in Tables 1A and 1B. The predicted molecular weight of SusC is 90.3 kDa as opposed to 92.4 for type A / D and B proteins allowing for separation of form C from S / PAGE isoforms. This separation was confirmed by analysis of these 2 bands by peptide MS-MS (the larger band contains A and B signature peptides while the smaller band alone has unique type C peptides).
Tabela 1A: Peptídeos específicos de SusC detectados em apoplastoTable 1A: SusC-Specific Peptides Detected in Apoplast
apoplasto 8d apoplasto 17d apoplasto 25d Nenhum tipo C detectado 9.20e+005 (R)KAEEYLTPLSSDTPYSV FEK() 1.45e+006 (R)IQDVNSLQHALR() 8.17e+005 2.37e006 (K)LLTLTG VYG FSK(H) (R)KAEEYLTPLS SDTPYSVFEK() 1.32e+006 2.16e006 (K)YTWQIYSEK() (K)NLTGLVEFYAK(N) 6.90e+005 (R)STQEAVVSSPLVALAI R() 7.45e+006 (K)LQG LDITPR(I) 3.64e+006 (K)YTWQIYSEK() 5.53e+006 (R)VVDGIDVFDPK(F) 3.34e+006 (K) LLTLTGVYGFSK(H) 4.17e+005 (K)D TLGQYESHIAF- TLPGLYR(V) 3.50e+005 (K)LFVEEMPVAEYLR(L) Tabela 1B: Peptídeos específicos de SusC detectados em fração microsso- malapoplast 8d apoplast 17d apoplast 25d No type C detected 9.20e + 005 (R) KAEEYLTPLSSDTPYSV FEK () 1.45e + 006 (R) IQDVNSLQHALR () 8.17e + 005 2.37e006 (K) LLTLTG VYG FSK (H) RLS SDTPYSVFEK () 1.32e + 006 2.16e006 (K) YTWQIYSEK () (K) NLTGLVEFYAK (N) 6.90e + 005 (R) STQEAVVSSPLVALAI R () 7.45e + 006 (K) LQG LDITPR (I) 3.64e + 006 ( K) YTWQIYSEK () 5.53e + 006 (R) VVDGIDVFDPK (F) 3.34e + 006 (K) LLTLTGVYGFSK (H) 4.17e + 005 (K) D TLGQYESHIAF-TLPGLYR (V) 3.50e + 005 (K) LFVE L) Table 1B: SusC specific peptides detected in microsomal fraction
micros- somal 8d micros- somal 13d microssomal 20d microssomal 25d Nenhum tipo C de- tectado Nenhum tipo C de- tectado 6.06E+006 (R)KAEEYLTPLSSDTPY SVFEK() 3.94E+006 (R) KAEEYLTPLS SDTPYSVFEK() 7.64E+006 (K)LLTLTGVYGF SK(H) 1.13E+006 (K)LFVEE MPVAE- YLR(L) 2.31 E+006 (R)IQ DVNSLQHALR() 3.89E+006 (R)VVDGIDVFDPK(F ) 6.70E+006 (R)VVDGIDVFDPK(F 3.68E+006 (K)LLTLTGVYGF SK(H) 1.82E+006 (R)YIEmLHAWmYNNR( V) 3.97E+005 (K)SGFNIDPYNGDL AAETLANFFEK(C) 3.19E+006 (K)YTWQIYSEK() 2.63E+006 (K)NLTGLV EF- YAK(N) 1.99E+006 (R)STQEA VVSSPLVAL AIR() 1.26E+006 (R)STQEAVV SSPL- VALAI R( 1.05E+006 (R)LGESLATHPQQAK(S ) 2.05E+006 (K)NLTGLVEFYAK(N ) 5.72E+006 (K)LFVEE m PVAE- YLR(L) 3.01 E+006 (K)D TLGQYESHIA FTLPGLYR(V) 8.25E+005 (K)DTLGQYESHIAFTLP GLYR(V) 2.18E+006 (R)IQ DVNSL- QHALR() 5.25E+006 (K)LQG LDITPR(I) 8.02E+005 (R)YIEmLH AWmYNNR(V) 3.28E+006 (K)NLTGLVEFYAK(N) 3.60E+005 (K)TKYPDSDIN WK(Q 9.91E+005 (K)SIGNGmDFLNR(H ) 5.35E+005 (R)L GESLATHPQQ AK(S) 5.59E+005 (R)ALEEELLHR(F) 2.77E+006 (K)LFVEE mPVAE- YLR(L)microsome 8d microsome 13d microsome 20d microsome 25d No type C detected No type C detected 6.06E + 006 (R) KAEEYLTPLSSDTPY SVFEK () 3.94E + 006 (R) KAEEYLTPLS SDTPYSVFEK () 7.64E + 006 (K) LLTLTGVYGF SK (H) 1.13E + 006 (K) LFVEE MPVAE-YLR (L) 2.31 E + 006 (R) IQ DVNSLQHALR () 3.89E + 006 (R) VVDGIDVFDPK (F) 6.70E + 006 (R) ) VVDGIDVFDPK (F 3.68E + 006 (K) LLTLTGVYGF SK (H) 1.82E + 006 (R) YIEmLHAWmYNNR (V) 3.97E + 005 (K) SGFNIDPYNGDL AAETLANFFEK (C) 3.19E + 006 (K) YT E + 006 (K) NLTGLV EF-YAK (N) 1.99E + 006 (R) STQEA VVSSPLVAL AIR () 1.26E + 006 (R) STQEAVV SSPL-VALA R (1.05E + 006 (R) LGESLATHPQQAK (S) 2.05 E + 006 (K) NLTGLVEFYAK (N) 5.72E + 006 (K) LFVEE m PVAE-YLR (L) 3.01 E + 006 (K) D TLGQYESHIA FTLPGLYR (V) 8.25E + 005 (K) DTLGQYESHIAFTLP GLYR (V) 2.18E + 006 (R) IQ DVNSL-QHALR () 5.25E + 006 (K) LQG LDITPR (I) 8.02E + 005 (R) YIEmLH AWmYNNR (V) 3.28E + 006 (K) NLTGLVEFYAK (N) 3.60E +005 (K) TKYPDSDIN WK (Q 9.91E + 005 (K) SIGNGmDFLNR (H) 5.35E + 005 (R) L GESLATHPQQ AK ( S) 5.59E + 005 (R) ALEEELLHR (F) 2.77E + 006 (K) LFVEE mPVAE-YLR (L)
A Tabela 2 apresenta um sumário dos dados de espec. de mas- sa e SDSPAGE/Western. De nota particular é a pouca abundância de proteí- na Sus-C de frações solúvel, microssomal e PM. Isto parece paradoxal con- siderando os níveis altos de transcrito presentes nos estágios finais do de- senvolvimento de fibra. Lavagens apoplásticas no estágio final de desenvol- vimento descrevem que a proteína tipo C está presente em níveis altos (com relação a A) em western blots, o inverso das observações para as frações citosólica, microssomal e PM. Sus-C é altamente abundante em 25 DAF e não podia ser detectada em 8 DAF através de MS-MS de peptídeo. Em con- traste, Sus-A possivelmente Sus-B são ubiquitárias durante o desenvolvi- mento (em todas as frações). Sus-A é a principal, e possivelmente a única isoforma em frações PM e ativa como uma proteína de 85 kDa em SDS-Table 2 presents a summary of the spec. and SDSPAGE / Western. Of particular note is the low abundance of Sus-C protein from soluble, microsomal and PM fractions. This seems paradoxical considering the high levels of transcript present in the final stages of fiber development. End-developmental apoplastic washes describe that type C protein is present at high levels (relative to A) in western blots, the inverse of observations for cytosolic, microsomal, and PM fractions. Sus-C is highly abundant at 25 DAF and could not be detected at 8 DAF by peptide MS-MS. In contrast, Sus-A possibly Sus-B are ubiquitous during development (in all fractions). Sus-A is the major, and possibly the only isoform in PM fractions and active as an 85 kDa protein in SDS-
PAGE.PAGE.
Tabela 2. Sumário de dados de espec. de massa e SDSPAGBWesternTable 2. Summary of spec data Mass and SDSPAGBWestern
microssomal 8 dpa (faixa su- perior) Susy A+B microssomal 8 dpa (faixa infe- rior) Susy A ou A+B microssomal 15 dpa Susy A+C ou A+B+C membrana de plasma 8 dpa Susy A ou A+B membrana de plasma 15 dpa Susy A or A+B solúvel 8 dpa Susy A ou A+B solúvel 15 dpa Susy A+B apoplasto 8 dpa (faixa superi- or) Susy A+B (quantidade muito baixa) apoplasto 8 dpa (faixa inferior) Susy A+B (faixa predominante) apoplasto 25 dpa (faixa superi- or) Susy A+B+C *(quantidade muito baixa) apoplasto 25 dpa (faixa inferi- or) Susy C (faixa predominante)microsomal 8 dpa (upper band) Susy A + B microsomal 8 dpa (lower band) Susy A or A + B microsomal 15 dpa Susy A + C or A + B + C plasma membrane 8 dpa Susy A or A + B plasma membrane 15 dpa Susy A or A + B soluble 8 dpa Susy A or A soluble B 15 dpa Susy A + B apoplast 8 dpa (upper range) Susy A + B (very low amount) 8 dpa (lower range) Susy A + B (predominant range) apoplast 25 dpa (upper range) Susy A + B + C * (very low amount) 25 dpa apoplast (lower range) Susy C (predominant range)
A presença de C nesta amostra é mais provavelmente devido à contamina- ção de faixa inferior grande que corresponde à SusC.The presence of C in this sample is most likely due to the large lower band contamination that corresponds to SusC.
O número de peptídeos se compara com Sus-C na fração de apoplasto a 25 DAF e o sinal de western indica que esta isoforma é abun- dante, e apenas predominantemente presente no apoplasto a 25 DAF. Inte- ressantemente, SusA e B estão presentes como as formas dominantes no apoplasto em fibra de 8 DAF. Esses resultados implicam em um papel prin- cipal para Sus na parede de célula de fibra de algodão e a correlação de expressão de Sus-C no nível de RNA com formação de parede secundária e a abundância da proteína nas paredes a 25 DAF sugerem que Sus-C é a isoforma de parede de célula principal no estágio de síntese de parede se- cundária. Trabalho com imuno-ouro EM mostra claramente sinal de Sus na parede de célula secundária externa. Isto está inteiramente de acordo com a presença de proteína Sus-C no apoplasto do estágio de parede celular. Exemplo 2. Análise da isoforma SusC e identificação de diferenças entre as isoformas diferentes de sacarose sintase em algodão e espécies relaciona- dasThe number of peptides compares with Sus-C in the apoplast fraction at 25 DAF and the western signal indicates that this isoform is abundant, and only predominantly present in the 25 DAF apoplast. Interestingly, SusA and B are present as the dominant forms in the 8 DAF fiber apoplast. These results imply a major role for Sus in the cotton fiber cell wall and the correlation of Sus-C expression at the RNA level with secondary wall formation and protein abundance at 25 DAF suggest that Sus -C is the main cell wall isoform in the secondary wall synthesis stage. Immuno-gold EM work clearly shows Sus signal on the external secondary cell wall. This is entirely in accordance with the presence of Sus-C protein in the cell wall stage apoplast. Example 2. SusC isoform analysis and identification of differences between different sucrose synthase isoforms in cotton and related species
Sítios de fosforilação. Um sítio de fosforilação de serina situado na extremidade N da proteína foi bem caracterizado na literatura, tendo RXXS como uma seqüência consenso. A fosforilação deste aminoácido seri- na específico em Zea mays (com as posições que seguem: 12RXXS15) pare- ce se relacionar com a liberação da enzima susy da membrana de plasma (Winter, H, e outros, 1997 - FEBS Letters 420, 151-155; Hardin, S.C. e ou- tros, 2004, Plant Physiology 134, 1427-1438). Mutação deste mesmo amino- ácido serina na posição 8RXXS11 faz com que a sacarose sintase de mung bean seja mais eficiente em uso de sacarose para fazer UDPgIicose (= Km maior) (Nakai T. e outros, 1999, Proc. Natl. Acad. Sci., USA 96, 14-18). Fos- forilação neste sítio específico foi também caracterizada em soja e parece se relacionar com a perda em hidrofobicidade e então putativamente com o en- fraquecimento da ligação com a membrana de plasma (Zhang, X.Q. e ou- tros, 1999, Archives of Biochemistry and Biophysics 371, 70-82). Este sítio de fosforilação foi também caracterizado em outras espécies de planta: (sy- camoré) (Pozueta-Romero, J. e outros, 2004 - Physoiogia Piantarum 122, p. 275), pêra Japonesa (Tanase, K. e outros, 2002-Physoiogia Piantarum 114, p. 21) e Gossypium hirsutum (algodão upiand) (Haigler, C.H. e outros - 2001 - Piant Molecular Biology 47, 29-51). Conclusões desses estudos indicam que a fosforilação tem uma influência sobre o valor Km da enzima.Phosphorylation sites. A serine phosphorylation site situated at the N-end of the protein has been well characterized in the literature, with RXXS as a consensus sequence. Phosphorylation of this serine specific amino acid in Zea mays (with the following positions: 12RXXS15) appears to be related to the release of the susy enzyme from the plasma membrane (Winter, H, et al., 1997 - FEBS Letters 420, 151 Hardin, SC et al., 2004, Plant Physiology 134, 1427-1438). Mutation of this same serine amino acid at position 8RXXS11 makes mung bean sucrose synthase more efficient in using sucrose to make UDPglycosis (= Km higher) (Nakai T. et al., 1999, Proc. Natl. Acad. Sci ., USA 96, 14-18). Phosphorylation at this specific site has also been characterized in soybean and appears to be related to the loss in hydrophobicity and then putatively to the weakening of plasma membrane binding (Zhang, XQ et al., 1999, Archives of Biochemistry and Biophysics 371, 70-82). This site of phosphorylation has also been characterized in other plant species: (sycamore) (Pozueta-Romero, J. et al., 2004 - Physoiogia Piantarum 122, p. 275), Japanese pear (Tanase, K. et al., 2002). -Physoiogia Piantarum 114, p. 21) and Gossypium hirsutum (upiand cotton) (Haigler, CH et al. - 2001 - Piant Molecular Biology 47, 29-51). Conclusions from these studies indicate that phosphorylation has an influence on the Km value of the enzyme.
Uma seqüência consenso similar pode ser identificada em todas as proteínas sacarose sintase G isoladas com as posições de aminoácido que seguem: 27RXXS30. Este sítio de fosforilação é então situado mais para dentro da proteína. Uma assinatura similar pode ser encontrada em outros genes de sacarose sintase de planta conhecidos: L19762, AY205302, AY205085, AJ537575, M18745, U2487, X75332, Y76091, X82504, AJ131999. A maioria desses genes tem ambos sítios de fosforilação 8RXXS11 e 27RXXS30. Proteínas L19762 e AJ131999 apenas têm o último.A similar consensus sequence can be identified in all sucrose synthase G proteins isolated with the following amino acid positions: 27RXXS30. This phosphorylation site is then situated further into the protein. A similar signature can be found on other known plant sucrose synthase genes: L19762, AY205302, AY205085, AJ537575, M18745, U2487, X75332, Y76091, X82504, AJ131999. Most of these genes have both 8RXXS11 and 27RXXS30 phosphorylation sites. L19762 and AJ131999 proteins only have the latter.
Outro sítio de fosforilação de serina foi caracterizado na literatu- ra: Ser170 de susl (milho). Este sítio foi correlacionado com marcação de proteína susy para proteólise (Hardin, S.C. e outros, 2003, The Plant Jour- nal, 35, 588-603). Este aminoácido Ser está também presente em SusC. Apenas duas sacarose sintases não contêm este resíduo Serina: Atsus5 e AB018561.Another serine phosphorylation site was characterized in the literature: Ser170 de susl (maize). This site was correlated with susy protein labeling for proteolysis (Hardin, S.C. et al., 2003, The Plant Journal, 35, 588-603). This amino acid Ser is also present in SusC. Only two sucrose synthases do not contain this Serine residue: Atsus5 and AB018561.
Domínios e proteína, sítios funcionais e motivos específicos pre- sentes em SusC (usando o programa Inter Pro Scan). SusC contém dois domínios chamados respectivamente sacarose sintase (dos aminoácidos 1 a 545) e glicosil transferase (dos aminoácidos 549 a 737) e uma região de fa- mília chamada sacarose sintase (dos aminoácidos 18 a 794). Sítio de ligação de UDP: a seqüência que segue representa a região de ligação de Uridina nas proteínas sacarose sintase C: 270VVIMTPHGYFAQDNVLGYPDTGGQVVYILDQVRALEEELLHRFKLQGLDI TPRILVITRL329. Esta seqüência mostra uma homologia muito alta em todas as sacarose sintases de planta conhecidas.Domains and protein, functional sites and specific motifs present in SusC (using the Inter Pro Scan program). SusC contains two domains called sucrose synthase (from amino acids 1 to 545) and glycosyl transferase (from amino acids 549 to 737 respectively) and a family region called sucrose synthase (from amino acids 18 to 794). UDP Binding Site: The following sequence represents the Uridine binding region on sucrose synthase C proteins: 270VVIMTPHGYFAQDNVLGYPDTGGQVVYILDQVRALEEELLHRFKLQGLDI TPRILVITRL329. This sequence shows a very high homology in all known plant sucrose synthases.
Estrutura genômica. Uma comparação foi feita no nível genômi- co do gene de SusyC com os outros genes de susy de algodão isolados (Fi- gura 3). Ficará imediatamente claro que o gene de susyC tem uma estrutura genômica particular pelo fato de que ele tem menos íntrons comparado com os outros genes de susy de Gossypium spp. A estrutura de éxon do íntron do gene de SusyC é também consideravelmente diferente dos genes susy de Arabidopsis.Genomic structure. A comparison was made at the genomic level of the SusyC gene with the other isolated cotton susy genes (Figure 3). It will be immediately clear that the susyC gene has a particular genomic structure by the fact that it has fewer introns compared to the other susy genes of Gossypium spp. The exon structure of the SusyC gene intron is also considerably different from the Arabidopsis susy genes.
Hidrofobicidade. O gráfico de hidrofobicidade de susyC foi anali- sado comparando-o a outras sacarose sintases de planta conhecidas. A ex- tremidade N-terminal parece ser hidrofóbica, uma particularidade que não deve ser encontrada na maior parte das sacarose sintases de planta. As sa- carose sintases que seguem eram mais similares a SusC neste nível: SUS6 (Arabidopsis thaliana), sacarose sintase de mung bean (D10266), sacarose sintase de eucalyptus (DD014303) e sacarose sintase de choupo (AY341026).Hydrophobicity. The hydrophobicity plot of susyC was analyzed by comparing it to other known plant sucrose synthases. N-terminal extremity appears to be hydrophobic, a feature not to be found in most plant sucrose synthases. The following saccharose synthases were more similar to SusC at this level: SUS6 (Arabidopsis thaliana), mung bean sucrose synthase (D10266), eucalyptus sucrose synthase (DD014303) and poplar sucrose synthase (AY341026).
Região de ligação de actina. Uma região de ligação de actina putativa foi descrita na literatura e parece estar presente nas sacarose sinta- ses (Winter, H. e outros - 1998 - FEBS Ietters 430, 205-208. Winter, H. e Hu- ber, S.C. - 2000 - Criticai Reviews in Plant Sciences 19(1), 31-67). Esta regi- ão se liga especificamente à F-actina. O sítio de ligação de actina de SusyC putativo tem a seqüência de aminoácido que segue: 383KDVAAEITKEFQ394. Os aminoácidos destacados em negrito são os que são específicos para SusyC. A seqüência consenso para este sítio entre sacarose sintases de planta diferentes é: [^E,D]-D-[V,A]-[A,G,S,T]-X-E-[L,V,I]-[T,S1A1M]- [K,R,G,M,L]-E-[F*,L,M]-[Q,N].Actin binding region. A putative actin binding region has been described in the literature and appears to be present in the sucrose syntheses (Winter, H. et al., 1998 - FEBS Ietters 430, 205-208. Winter, H. and Hu- ber, SC - 2000). - Critical Reviews in Plant Sciences 19 (1), 31-67). This region binds specifically to F-actin. The putative SusyC actin binding site has the following amino acid sequence: 383KDVAAEITKEFQ394. The amino acids highlighted in bold are those that are specific to SusyC. The consensus sequence for this site between different plant sucrose synthases is: [^ E, D] -D- [V, A] - [A, G, S, T] -XE- [L, V, I] - [ T, S1A1M] - [K, R, G, M, L] -E- [F *, L, M] - [Q, N].
Interessantemente, 383K(=lys) tem uma cadeia lateral básica en- quanto os resíduos de aminoácido (E(=glu) e D(=asp) têm cadeias laterais ácidas. Como K é único para a susyC isto pode indicar que esta proteína específica não se liga aos filamentos de F-actina. Além disso, o aminoácido F(=Phe), único para susyC, é mais volumoso do que seus contrapartes L e M das outras sacarose sintases de planta. Isto poderia atrapalhar mais a ligação com actina.Interestingly, 383K (= lys) has a basic side chain whereas amino acid residues (E (= glu) and D (= asp) have acidic side chains.) As K is unique to susyC this may indicate that this specific protein does not bind to the F-actin filaments In addition, the amino acid F (= Phe), unique to susyC, is rougher than its counterparts L and M from the other plant sucrose synthases. .
Seqüências de aminoácido específicas para proteínas SusC. Quando do alinhamento das diferentes sacarose sintases de planta se tor- nou aparente que algumas regiões do gene de susyC eram muito diferentes e específicas. Essas regiões estão situadas respectivamente nas extremida- des N- e C-terminais do gene de susyC e têm a seqüência de aminoácido que segue:SusC protein-specific amino acid sequences. By aligning the different plant sucrose synthases it became apparent that some regions of the susyC gene were very different and specific. These regions are located respectively at the N- and C-terminal ends of the susyC gene and have the following amino acid sequence:
A (região N-terminal) = 36HKSQKLLSVLDKEAGNQALDGMVV59 B (região C-terminal) = 765AYQEQRGRKRYIEMLHAWMYNNRVKT790A (N-terminal region) = 36HKSQKLLSVLDKEAGNQALDGMVV59 B (C-terminal region) = 765AYQEQRGRKRYIEMLHAWMYNNRVKT790
A região N-terminal foi analisada examinando sua estrutura 2D prevista. Para este propósito os programas que seguem foram usados: HCA (análise de agrupamento hidrofóbico) e JPRED. Os resultados de ambos programas foram alinhados e comparados com as outras Sacarose sintases de Gossypium hirsutum (Figura 4). Está claro a partir desta análise que susC neste local é absolutamente única em estrutura. Os sítios de fosforilação diferentes são também indicados confirmando a característica única do gene de susyC. Uma análise similar foi realizada na região C-terminal (Figura 5). Novamente, a estrutura da SusC era diferente da estrutura das outras saca- rose sintases de algodão nesta região.The N-terminal region was analyzed by examining its predicted 2D structure. For this purpose the following programs were used: HCA (hydrophobic cluster analysis) and JPRED. Results from both programs were aligned and compared with the other Gossypium hirsutum sucrose synthases (Figure 4). It is clear from this analysis that susC at this site is absolutely unique in structure. Different phosphorylation sites are also indicated confirming the unique characteristic of the susyC gene. A similar analysis was performed on the C-terminal region (Figure 5). Again, the structure of SusC was different from the structure of the other cotton sacsase in this region.
As regiões específicas de SusC N-terminais e C-terminais foram usadas para identificar seqüências EST em bancos de dados. Os resultados são sumarizados na tabela 3 que segue. Óvulos Parede Parede Outras espé- Outras Total (-3 a 0) de célu- de célula cies de gossy- espé- la pri- secundá- pium cies de mária ria planta (0 a 15 (20 a 40 dpa) dpa) Região N- 0 0 35 1 (=7-10 dpa) 0 36 terminal Região C- 1 0 30 1 (=7-10 dpa) 0 32 terminalSusC N-terminal and C-terminal specific regions were used to identify EST sequences in databases. The results are summarized in table 3 below. Eggs Wall Wall Other Species Other Total (-3 to 0) Cell Cell Gossy Species Primary Maria Plant Species (0 to 15 (20 to 40 dpa) dpa) Region N - 0 0 35 1 (= 7-10 dpa) 0 36 terminal Region C- 1 0 30 1 (= 7-10 dpa) 0 32 terminal
Tabela: número de EST's extraídas de alguns tecidos e espécies de planta específicos.Table: number of ESTs extracted from some specific tissues and plant species.
Exemplo 3. Superexpressão de SusC em algodãoExample 3. Cotton Overexpression of SusC
Usando técnicas de DNA recombinante padrão, os elementos de DNA que seguem são operavelmente ligados:Using standard recombinant DNA techniques, the following DNA elements are operably linked:
• um promotor específico de fibra de algodão conforme descrito no W02004/066571• a specific cotton fiber promoter as described in W02004 / 066571
• uma região de DNA codificando uma sacarose sintase C tendo a seqüência de nucleotídeo da SEQ ID N-1• a DNA region encoding a sucrose synthase C having the nucleotide sequence of SEQ ID N-1
• uma região de terminador 3' nos.• a terminator region 3 'nos.
Este gene quimérico é introduzido em um vetor T-DNA junto com um gene bar selecionável. O vetor T-DNA é introduzido em Agrobacterium tumefaciens e usado para produzir plantas de algodão transgênicas confor- me descrito na US 6.483.013.This chimeric gene is introduced into a T-DNA vector along with a selectable bar gene. The T-DNA vector is introduced into Agrobacterium tumefaciens and is used to produce transgenic cotton plants as described in US 6,483,013.
Plantas de algodão transgênicas compreendendo o gene quimé- rico são analisadas quanto à expressão aumentada de SusC1 particularmen- te nas fibras, e as fibras obtidas dessas plantas são analisadas quanto à re- sistência de fibra, comprimento de fibra, razão de maturidade de fibra, teor de fibra imatura, uniformidade de fibra e "micronaire". Exemplo 4. Regulação descendente da expressão da isoforma SusC emTransgenic cotton plants comprising the chimeric gene are analyzed for increased SusC1 expression particularly in fibers, and the fibers obtained from these plants are analyzed for fiber strength, fiber length, fiber maturity ratio, immature fiber content, fiber uniformity and micronaire. Example 4. Down-regulation of SusC isoform expression in
algodão Usando técnicas de DNA recombinante padrão, os elementos de DNA que seguem são operavelmente ligados:cotton Using standard recombinant DNA techniques, the following DNA elements are operably linked:
• uma região de promotor CaMV 35S• a CaMV 35S promoter region
• uma região de codificação de RNA de sentido correspondendo à seqüência de nucleotídeo da SEQ ID Ng 1 codificando a seqüência especí- fica de SusC N-terminal de SEQ ID Ne 18 ou a seqüência específica de SusC C-terminal da SEQ ID Ne 17.• a sense RNA coding region corresponding to the nucleotide sequence of SEQ ID Ng 1 encoding the specific SusC N-terminal sequence of SEQ ID Ne 18 or the specific SusC C-terminal sequence of SEQ ID Ne 17.
• uma região de codificação de RNA de antissentido correspon- dendo ao complemento da seqüência de nucleotídeo da região de RNA de• An antisense RNA coding region corresponding to the nucleotide sequence complement of the antisense RNA region.
sentido.sense.
• uma região de terminador 3' nos.• a terminator region 3 'nos.
Este gene quimérico é introduzido em um vetor T-DNA junto com um gene bar selecionável. O vetor T-DNA é introduzido em Agrobacterium tumefaciens e usado para produzir plantas de algodão transgênicas confor- me descrito na US 6.483.013.This chimeric gene is introduced into a T-DNA vector along with a selectable bar gene. The T-DNA vector is introduced into Agrobacterium tumefaciens and is used to produce transgenic cotton plants as described in US 6,483,013.
Plantas de algodão transgênicas compreendendo o gene quimé- rico são analisadas quanto à expressão menor de SusC1 particularmente nas fibras, e as fibras obtidas dessas plantas são analisadas quanto à resistência de fibra, comprimento de fibra, razão de maturidade de fibra, teor de fibra imatura, uniformidade de fibra e "micronaire".Transgenic cotton plants comprising the chimeric gene are analyzed for lower SusC1 expression particularly in fibers, and the fibers obtained from these plants are analyzed for fiber strength, fiber length, fiber maturity ratio, immature fiber content. , fiber uniformity and micronaire.
Exemplo 5. Superexpressão da isoforma de SusC em Arabidopsis thaliana transgênicaExample 5. Overexpression of SusC isoform in transgenic Arabidopsis thaliana
Usando técnicas de DNA recombinante padrão, os elementos de DNA que seguem foram operavelmente ligados: · uma região de promotor CaMV35SUsing standard recombinant DNA techniques, the following DNA elements have been operably linked: · a CaMV35S promoter region
• a região de DNA codificando a SusC de SEQ ID Ne:2 (SEQ ID N6:1 de nt84ant2474)• the SusC encoding DNA region of SEQ ID Ne: 2 (SEQ ID N6: 1 from nt84ant2474)
• uma região de terminador 3' nos• a 3 'terminator region in the
Ainda, o elemento de DNA que segue era operavelmente ligado · o promotor específico de caule do gene S2A de alfafa (Abra-In addition, the following DNA element was operably linked to the stem-specific promoter of the alfalfa S2A gene (Abraham).
hams e outros, 1995, Plant Mo. Bioi., 27:413-528) que resulta em expressão vascular e tricoma em uma faixa de espécies de planta • A região de DNA codificando a SusC de SEQ ID NQ:2 (SEQ ID Ns: 1 de nt 84 a nt 2474).Hams et al., 1995, Plant Mo. , 27: 413-528) which results in vascular expression and trichome in a range of plant species • The SusC-encoding DNA region of SEQ ID NQ: 2 (SEQ ID Ns: 1 from nt 84 to nt 2474) .
• Uma região de terminador 3' nos.• A terminator region 3 'nos.
Esses genes quiméricos foram introduzidos em um vetor T-DNA junto com um gene bar selecionável. O vetor T-DNA foi introduzido em A- grobacteríum tumefaciens e usado para produzir A. thaliana transgênica u- sando um método de mergulho floral. Plantas foram regeneradas e fenótipos registrados na geração T2 com vários fenótipos interessantes observados rotineiramente em plantas transgênicas obtendo um dos transgenes acima descritos.These chimeric genes were introduced into a T-DNA vector along with a selectable bar gene. The T-DNA vector was introduced into A-grobacterium tumefaciens and used to produce transgenic A. thaliana using a floral dip method. Plants were regenerated and phenotypes recorded in the T2 generation with several interesting phenotypes routinely observed in transgenic plants obtaining one of the transgenes described above.
Muitas das linhagens transgênicas para ambos construtos mos- tram ramificação de raiz acentuada em cultura, incluindo crescimento de raiz lateral extensiva. Adicionalmente observados foram perda de dominância apical, ramos florais múltiplos, perda de simetria de roseta e rigidez, rompi- mento de folha e filotaxia floral e tempo retardado de florescimento.Many of the transgenic strains for both constructs show marked root branching in culture, including extensive lateral root growth. Additionally observed were loss of apical dominance, multiple floral branches, loss of rosette symmetry and stiffness, leaf breakage and floral philotaxis, and delayed flowering time.
É imaginado que a ramificação aumentada e a perda de dominân- cia apical sejam devido a alongamento de célula reduzido em zonas meristemá- ticas devido a espessamento secundário prematuro de paredes de célula.Increased branching and loss of apical dominance are thought to be due to reduced cell elongation in meristematic zones due to premature secondary thickening of cell walls.
Uma grande proporção de linhagens transgênicas com qualquer um dos genes quiméricos mostrou um efeito fenotípico sobre estruturas tri- coma. Ramificação maior e pedestais espessos são geralmente vistos (Figu- ras 6C e D) e densidade de tricoma parece ser maior.A large proportion of transgenic lineages with any of the chimeric genes showed a phenotypic effect on tritic structures. Larger branching and thick pedestals are usually seen (Figures 6C and D) and trichome density appears to be higher.
Caules e ramos florais de linhagens transgênicas Sus-C geral- mente mostram espessamento extremo e são bifurcados, possivelmente fasciados (Figuras 6A e B).Stems and floral branches of Sus-C transgenic lineages usually show extreme thickening and are bifurcated, possibly fasciated (Figures 6A and B).
Três métodos foram usados para relacionar fenótipo com ex- pressão de transgene:Three methods were used to relate phenotype to transgene expression:
1) RT-PCR para Sus-C e o gene de Sus endógeno1) RT-PCR for Sus-C and the endogenous Sus gene
2) Western blotting usando o anticorpo específico de Sus-C (vide Figura 7)2) Western blotting using Sus-C specific antibody (see Figure 7)
3) Ensaio de atividade de Sus3) Sus activity test
Linhagens transgênicas mostrando níveis altos de expressão de 35S-Sus-C, correlacionado com os fenótipos mais severos Listagem de seqüência <110> Bayer BioScience N.V.Transgenic lines showing high levels of 35S-Sus-C expression correlated with more severe phenotypes Sequence Listing <110> Bayer BioScience N.V.
Commonwealth Scientific and Industrial ResearchCommonwealth Scientific and Industrial Research
OrganizationOrganization
Ruan, Yong-LingRuan, Yong-Ling
Furbank, RobertFurbank, Robert
Brill, LizzBrill, Lizz
Van Thournhout, MichelVan Thournhout, Michel
Arioli, AntonioArioli, Antonio
<120> Identificação de um Tipo de Sacarose Sintase e uso do Mesmo em Modificação Fibra<120> Identification of a Type of Sucrose Synthase and Use in Fiber Modification
<130> BCS 06-2008 WOl <150> EP 06015433.3 <151> 2006-07-25 <150> US 60/834066 <151> 2006-07-28 <160> 18<130> BCS 06-2008 WO1 <150> EP 06015433.3 <151> 2006-07-25 <150> US 60/834066 <151> 2006-07-28 <160> 18
<170> PatentIn versão 3.3 <210> 1 <211> 2635 <212> DNA<170> PatentIn version 3.3 <210> 1 <211> 2635 <212> DNA
<213> Gossypium hirsutum <220><213> Gossypium hirsutum <220>
<221> CDS<221> CDS
<222> (84)..(2474)<222> (84) .. (2474)
<400> 1<400> 1
acgcggggtg tctttgattt ttatccatat cctaattctt catacattcc ttcccttctg 60acgcggggtg tctttgattt ttatccatat cctaattctt catacattcc ttcccttctg 60
ccatttcagg aacccaaaaa aca atg gct tca ate agt gtt tgt gag cgt ttg 113ccatttcagg aacccaaaaa aca atg gct tca till agt gtt tgt gag cgt ttg 113
Met Ala Ser Ile Ser Val Cys Glu Arg Leu 10Met Ala Ser Ile Ser Val Cys Glu Arg Leu 10
ggt gaa tet cta gct act cat cca cag cag gea aag tet ate ttg tca 161ggt gaa tet cta gct act cat cca cag cag gea aag tet by ttg tca 161
Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys Ser Ile Leu Ser 20 25Gly Glu Ser Leu Wing Thr His Pro Gln Gln Wing Lys Ser Ile Leu Ser 20 25
agg att gaa age ctc gga aag ggt att cat aag tet caa aag ctg ctc 209agg att gaa age ct gga aag ggt att cat aag tet
Arg Ile Glu Ser Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu 35 40Arg Ile Glu Being Read Gly Lys Gly Ile His Lys Being Read Gln Lys Leu 35 40
tcg gtt ctc gat aaa gag gcc gga aat caa gea ctt gat ggg atg gtg 257tcg gtt ctc gat aaa gag gcc gga aat caa gea ctt gat ggg atg gtg 257
Ser Val Leu Asp Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met ValSer Val Leu Asp Lys Glu Wing Gly Asn Gln Wing Leu Asp Gly Met Val
45 50 5545 50 55
gtg ggg gtc ctc agg tcc act cag gaa gct gta gtg tcg tet cca ttg 305gtg ggg gtc ctc agg tcc act cag gaa gct gta gtg tcg tet cca ttg 305
Val Gly Val Leu Arg Ser Thr Gln Glu Ala Val Val Ser Ser Pro Leu 60 65 70Val Gly Val Leu Arg Be Thr Gln Glu Wing Val Val Ser Be Pro Leu 60 65 70
gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag tac att gct 353gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag tac att gct 353
Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala 75 80 85 90Val Wing Leu Wing Ile Arg Ser Wing Pro Gly Val Trp Glu Tyr Ile Wing 75 80 85 90
gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt gct gag tat 401gtg gag gtc caa aag ctt tt gtg gag gaa atg ccc gtt gct gag tat 401
Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val Ala Glu TyrVal Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val Wing Glu Tyr
95 100 10595 100 105
cta cgg ttg aag gaa gaa ctt gtt gat gga age tcc aat ggc gag ttt 449cta cgg ttg aag gaa gaa ctt gtt gat gga age tcc aat ggc gag ttt 449
Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser Ser Asn Gly Glu PheLeu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser As Asn Gly Glu Phe
110 115 120110 115 120
atg ttg gaa ttg gac ttt ggt gea ttc aat aat tet gtt cct cgt cca 497atg ttg gaa ttg gac tt ggt gea ttc aat aat tet gtt cct cgt cca 497
Met Leu Glu Leu Asp Phe Gly Ala Phe Asn Asn Ser Val Pro Arg ProMet Leu Glu Leu Asp Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro
125 130 135125 130 135
tet ctt tca aag tcc att ggt aat ggc atg gac ttc ctc aac cgc cac 545tet ctt tca aag tcc att ggt aat ggc atg gac ttc ctc aac cgc cac 545
Ser Leu Ser Lys Ser Ile Gly Asn Gly Met Asp Phe Leu Asn Arg HisBe Read Be Lys Be Ile Gly Asn Gly Met Asp Phe Read Asn Arg His
140 145 150140 145 150
ctt tet gcc aag cta ttt caa gac aag gag aac ttg aac ttg ttg ctt 593ctt tet gcc aag cta ttt caa gac aag gag aac ttg aac ttg ttg ctt 593
Leu Ser Ala Lys Leu Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu 155 160 165 170Leu Ser Ala Lys Leu Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu 155 160 165 170
gaa ttt ctc caa att cac tgc cag aag gga aag ggt atg ctg ttg aat 641gaa ttt ctc caa att cac tgc cag aag gga aag ggt atg ctg ttg aat 641
Glu Phe Leu Gln Ile His Cys Gln Lys Gly Lys Gly Met Leu Leu AsnGlu Phe Leu Gln Ile His Cys Gln Lys Gly Lys Gly Met Leu Leu Asn
175 180 185175 180 185
gac aga ate caa gat gtg aat tcc ctc caa cat gea tta agg aag gcc 689gac aga till caa gat gtg aat tcc ctc caa cat gea tta agg aag gcc 689
2020
25 Asp Arg Ile25 Asp Arg Ile
gag gag tat Glu Glu Tyrgag gag tat Glu Glu Tyr
205205
gag aaa agg Glu Lys Arg 220 gct gag cat Ala Glu His 235gag aaa agg Glu Lys Arg 220 gct cat gag Ala Glu His 235
gca cct gat Ala Pro Aspgca cct gat Ala Pro Asp
1 5 gtc gcc aat1 5 gtc gcc aat
Val Ala AsnVal Wing Asn
aat gtt ttg Asn Val Leuaat gtt ttg Asn Val Leu
285285
gat caa gtt Asp Gln Valgat caa gtt Asp Gln Val
300 caa gga ctc300 caa gga ctc
Gln Gly Leu 315Gln Gly Leu 315
cct gat gct Pro Asp Alacct gat gct Pro Asp Wing
gga acc aaggga acc aag
Gly Thr LysGly thr lys
Gln Asp Val Asn Ser 190Gln Asp Val Asn Ser 190
ctg act cct cta tcc Leu Thr Pro Leu Ser 210ctg act cct cta tcc Leu Thr Pro Leu Ser 210
ttt ctg ggg att ggt Phe Leu Gly Ile Gly 225ttt ctg ggg att ggt Phe Leu Gly Ile Gly 225
gtt ctt gag atg ate Val Leu Glu Met Ile 240gtt ctt gag atg to Val Leu Glu Met Ile 240
cct gtc gca ctt gaa Pro Val Ala Leu Glu 255cct gtc gca ctt gaa Pro Val Wing Leu Glu 255
gtt gta ate atg act Val Val Ile Met Thr 270gtt gta until atg act Val Val Ile Met Thr 270
gga tac cct gac aca Gly Tyr Pro Asp Thr 290gga tac cct gac ac Gly Tyr Pro Asp Thr 290
cgt gcc ttg gag gaa Arg Ala Leu Glu Glu 305cgt gcc ttg gag gaa Arg Wing Leu Glu Glu 305
gac att acc cca cgt Asp Ile Thr Pro Arg 320gac att acc cca cgt Asp Ile Thr Pro Arg 320
gta gga aca act tgc Val Gly Thr Thr Cys 335gta gga aca tgc act Val Gly Thr Thr Cys 335
tat tet gat att ctt Tyr Ser Asp Ile Leu 350tat tet gat att ctt Tyr Ser Asp Ile Leu 350
Leu Gln His Ala Leu 195Read Gln His Wing Leu 195
tcg gat acc ccg tac Ser Asp Thr Pro Tyr 215tcg gat acc ccg tac Ser Asp Thr Pro Tyr 215
ttg gag aag gga tgg Leu Glu Lys Gly Trp 230ttg gag aag gga tgg Leu Glu Lys Gly Trp 230
cat ctt cta ttg gat His Leu Leu Leu Asp 245cat ctt cta ttg gat His Leu Leu Leu Asp 245
age ttc ctt gga aga Ser Phe Leu Gly Arg 260age ttc ctt gga aga Ser Phe Leu Gly Arg 260
ccc cat gga tac ttt Pro His Gly Tyr Phe 275ccc cat gga tac tt pro Gly Tyr Phe 275
ggt ggt cag gtt gtt Gly Gly Gln Val Val 295ggt ggt cag gtt gtt Gly Gly Gln Val Val 295
gag ttg ctt cat cgt Glu Leu Leu His Arg 310gag ttg ctt cat cgt Glu Leu Leu His Arg 310
ate cta gtc att act Ile Leu Val Ile Thr 325until cta gtc att act Ile Leu Val Ile Thr 325
ggt cag cgt ctt gag Gly Gln Arg Leu Glu 340ggt cag cgt ctt gag Gly Gln Arg Leu Glu 340
cgg gta ccc ttc aga Arg Val Pro Phe Arg 355cgg gta ccc ttc aga Arg Val Pro Phe Arg 355
Arg Lys Ala 200Arg Lys Wing 200
tca gtt ttc 737tca gtt ttc 737
Ser Val PheTo be Val Phe
ggt gat aat 785ggt gat aat 785
Gly Asp AsnGly Asp Asn
ctc ctt cag 833ctc ctt cag 833
Leu Leu Gln 250Leu Leu Gln 250
ate cca ttg 881until cca ttg 881
Ile Pro Leu 265Ile Pro Leu 265
gcc caa gac 929gcc caa gac 929
Ala Gln Asp 280Gln Wing Asp 280
tat ate tta 977tat till tta 977
Tyr Ile LeuTyr Ile Leu
ttc aag ctg 1025ttc aag ctg 1025
Phe Lys LeuPhe Lys Leu
cgg ctc ctc 1073cgg ctc ctc 1073
Arg Leu Leu 330Arg Leu Leu 330
aaa gtg tat 1121aaa gtg tat 1121
Lys Val Tyr 345Lys Val Tyr 345
act gag aag 1169act gag aag 1169
Thr Glu LysThr Glu Lys
360 10360 10
1515
gga att gta cgt cca tgg ate tca cga ttc aaa gtc tgg cct tac ctg 1217gga att gta cgt cca tgg till tca cga ttc aaa gtc tgg cct tac ctg 1217
Gly Ile Val Arg Pro Trp Ile Ser Arg Phe Lys Val Trp Pro Tyr LeuGly Ile Val Arg Pro Trp Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu
365 370 375365 370 375
gag act tac act aag gat gtt gea gct gag ate acc aaa gag ttc cag 1265gag act tac act aag gat gtt gea gct gag until acc aaa gag ttc cag 1265
Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Ile Thr Lys Glu Phe GlnGlu Thr Tyr Thr Lys Asp Val Wing Wing Glu Ile Thr Lys Glu Phe Gln
380 385 390380 385 390
ggc aag cct gat ctg att gtt ggg aac tac agt gat gga aac att gtt 1313ggc aag cct gat ctg att ggg aac tac agt gat gga aac att gtt 1313
Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser Asp Gly Asn Ile Val 395 400 405 410Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser Asp Gly Asn Ile Val 395 400 405 410
gct tet tta ttg gea cat aag ttt gat gtt aca cag tgc act att gct 1361gt tet tta ttg gea cat cat aag ttt gat gtt aca cag tgc act att gct 1361
Ala Ser Leu Leu Ala His Lys Phe Asp Val Thr Gln Cys Thr Ile AlaAla Ser Leu Leu Wing His Lys Phe Asp Val Thr Gln Cys Thr Ile Wing
415 420 425415 420 425
cgt gea ctc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1409cgt gea ctc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1409
Arg Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysArg Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
430 435 440430 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1457cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1457
Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp LeuGln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu
445 450 455445 450 455
att gct atg aac cat act gat ttt ate ate acc age acc ttc caa gag 1505att gct atg aac cat act gat ttt till till acc age acc ttc caa gag 1505
Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln GluIle Ala Met Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu
460 465 470460 465 470
att gct gga age aag gac act ctc ggc caa tac gag agt cac att gct 1553att gct gga age aag gac act ctc ggc caa tac gag agt cac att gct 1553
Ile Ala Gly Ser Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala 475 480 485 490Ile Wing Gly Be Lys Asp Thr Read Gly Gln Tyr Glu Be His Ile Wing 475 480 485 490
ttc act ctt cca ggg ctc tac cgc gtt gtt gat ggg ate gat gtt ttt 1601 Phe Thr Leu Pro Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phettc act ctt cg ggg ctc tac cgc gtt gtt gat ggg until gat gtt ttt 1601 Phe Thr Leu Pro Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phe
495 500 505495 500 505
gat ccc aaa ttc aat att gtc tcc cct ggt gct gat atg age ata tac 1649gat ccc aaa ttc aat att gtc tcc cct ggt gct gat atg age ata tac 1649
Asp Pro Lys Phe Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile TyrAsp Pro Lys Phe Asn Ile Val Ser Pro Gly Wing Asp Met Ser Ile Tyr
510 515 520510 515 520
ttc cct tac acg gaa gag aag cgg agg ttg aag aag ttc cac ccg gag 1697ttc cct tac acg gaa gag aag cgg agg ttg aag aag ttc cac ccg gag 1697
Phe Pro Tyr Thr Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro GluPhe Pro Tyr Thr Glu Glu Lys Arg Arg Read Le Lys Lys Phe His Pro Glu
2020
30 att gaa gag Ile Glu Glu 54030 att gaa gag Ile Glu Glu 540
gta cta aaa Val Leu Lys 555gta cta aaa Val Leu Lys 555
gac cga gtg Asp Arg Valgac cga gp Asp Arg Val
1010
age cgg ctg Ser Arg Leuage cgg ctg Ser Arg Leu
aga aag gaa 1 5 Arg Lys Glu 605aga aag gaa 1 5 Arg Lys Glu 605
tat gaa ctt Tyr Glu Leu 620tat gaa ctt Tyr Glu Leu 620
tcg tcc cag Ser Ser Gln 635tcg tcc cag Ser Ser Gln 635
tgc gac acc Cys Asp Thrtgc gac acc Cys Asp Thr
2525
ggc ttg act Gly Leu Thrggc ttg act Gly Leu Thr
aca tgt tac Thr Cys Tyr 685aca tgt tac Thr Cys Tyr 685
ttc aac atettc aac until
530530
ctt ctt tac age cct Leu Leu Tyr Ser Pro 545ctt ctt tac age cct Leu Leu Tyr Ser Pro 545
gac cgc aac aag ccg Asp Arg Asn Lys Pro 560gac cgc aac cc cc Asp Arg Asn Lys Pro 560
aag aac tta act ggg Lys Asn Leu Thr Gly 575aag aac tta act ggg Lys Asn Leu Thr Gly 575
agg gaa ctg gtt aac Arg Glu Leu Val Asn 590agg gaa ctg gtt aac Arg Glu Leu Val Asn 590
tcc aag gac tta gaa Ser Lys Asp Leu Glu 610tcc aag gac tta gaa Ser Lys Asp Leu Glu 610
ate gaa aaa tac aag Ile Glu Lys Tyr Lys 625till gaa aaa tac aag Ile Glu Lys Tyr Lys 625
atg aac cga gtg aga Met Asn Arg Val Arg 640atg aac cga gtg aga Met Asn Arg Val Arg 640
aag gga gea ttc gtt Lys Gly Ala Phe Val 655aag gga gea ttc gtt Lys Gly Ala Phe Val 655
gtt gtt gag gea atg Val Val Glu Ala Met 670gtt gtt gag gea atg Val Val Glu Ala Met 670
999 ggc cct gct gag Gly Gly Pro Ala Glu 690999 ggc cct gct gag Gly Gly Pro Wing Glu 690
gat cct tat aac ggtgat cct tat aac ggt
4848
535535
gtt gag aat aca gag Val Glu Asn Thr Glu 550gtt gag aat aca gag Val Glu Asn Thr Glu 550
att ctg ttt acc atg Ile Leu Phe Thr Met 565att ctg ttt acc atg Ile Leu Phe Thr Met 565
ctc gta gag ttc tat Leu Val Glu Phe Tyr 580ctc gta gag ttc tat Leu Val Glu Phe Tyr 580
ttg gtt gta gta ggt Leu Val Val Val Gly 595ttg gtt gta gta ggt Leu Val Val Val Gly 595
gaa aag gct gaa atg Glu Lys Ala Glu Met 615gaa aag gct gaa atg Glu Lys Ala Glu Met 615
ttg aat gga caa ttc Leu Asn Gly Gln Phe 630ttg aat gga caa ttc Read Asn Gly Gln Phe 630
aat ggt gaa ctc tat Asn Gly Glu Leu Tyr 645aat ggt gaa ctc tat Asn Gly Glu Leu Tyr 645
cag cct cct ata tac Gln Pro Pro Ile Tyr 660cag cct cct tac tac Gln Pro Ile Tyr 660
acc tgt gga cta cca Thr Cys Gly Leu Pro 675acc tgt gga cca cca Thr Cys Gly Leu Pro 675
att ata gtt cac gga Ile Ile Val His Gly 695att ata gtt cac gga Ile Ile Val His Gly 695
gat ttg gct gcc gaggat ttg gct gcc gag
cac tta tgt 1745cac tta tgt 1745
His Leu CysHis Leu Cys
gea agg ttg 1793 Ala Arg Leu 570gea agg ttg 1793 Arg Wing Leu 570
gcc aag aac 1841gcc aag aac 1841
Ala Lys Asn 585Wing Lys Asn 585
gga gat agg 1889 Gly Asp Arg 600gga gat agg 1889 Gly Asp Arg 600
aag aag atg 1937aag aag atg 1937
Lys Lys MetLys Lys Met
aga tgg ata 1985 Arg Trp Ileagg tgg ata 1985 Arg Trp Ile
cgt tat att 2033 Arg Tyr Ile 650cgt tat att 2033 Arg Tyr Ile 650
gag gct ttt 2081gag gct ttt 2081
Glu Ala Phe 665Glu Wing Phe 665
aca ttt gea 2129 Thr Phe Ala 680aca ttt gea 2129 Thr Phe Ala 680
aaa tcg ggg 2177aaa tcg ggg 2177
Lys Ser GlyLys Ser Gly
acc ctt gcc 2225 10acc ctt gcc 2225 10
Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala Glu Thr Leu AlaPhe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Wing Wing Wing Glu Thr Leu Wing
700 705 710700 705 710
aat ttc ttc gag aag tgc aaa gcg gat cca tct tat tgg gat gag ate 2273aat ttc ttg gag aag tgc aaa gcg gat cca tct tat tgg gat gag until 2273
Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp Asp Glu IleAsn Phe Phe Glu Lys Cys Lys Wing Asp Pro Ser Tyr Trp Asp Glu Ile
715 720 725 730715 720 725 730
tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg cag att 2321tcc cag gga ggg ttg aaa cgc ata cag gag aag tat acg tgg cag att 2321
Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp Gln IleBe Gln Gly Gly Read Lys Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile
735 740 745735 740 745
tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc ttt tcg aaa 23 69 Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe Ser Lystac tcc gag aag cta ttg act ctc acc gga gt tat ggc ttt tcg aaa 23 69 Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys
750 755 760750 755 760
cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att gaa atg 2417 His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile Glu Metcat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att gaa atg 2417 His Val Tyr Gln Glu Arg Gly Arg Lys Tyr Ile Glu Met
765 770 775765 770 775
ttg cat gea tgg atg tat aac aat cgg gtc aag act gtt cca cta gct 2465 Leu His Ala Trp Met Tyr Asn Asn Arg Val Lys Thr Val Pro Leu Ala 780 785 790ttg cat gea tgg atg tat aac aat cgg gtc aag act gtt cca cta gct 2465 Leu His Ala Trp Met Tyr Asn Asn Arg Val Lys Thr Val Pro Leu Ala 780 785 790
gtt gag taa agaaaaagca aaaggctgct tggaaaccaa ggaatttcca 2514gtt gag taa agaaaaagca aaaggctgct tggaaaccaa ggaatttcca 2514
Val GluVal glu
795795
agaatgattg ctattgaagt tttatttttg cttctttttt tcaatttcgt aaatgtttaa 2574 tttcaacttt ggggtttccc cttggaatgt tttgaactct gggtttggtc ccattgattg 2634 cagaatgattg ctattgaagt tttatttttg cttctttttt tcaatttcgt aaatgtttaa 2574 tttcaacttt ggggtttccc cttggaatgt tttgaactct gggtttggtc ccattgattg 2634 c
26352635
<210> 2 <211> 796 <212> PRT<210> 2 <211> 796 <212> PRT
<213> Gossypium hirsutum <400> 2<213> Gossypium hirsutum <400> 2
1515
2020
3030
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr 10 I5Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Wing Thr 10 I5
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu Gly 25 30 Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluHis Pro Gln Gln Wing Lys Be Ile Leu Be Arg Ile Glu Be Leu Gly 25 30 Lys Gly Ile His Lys Be Gln Lys Leu Leu Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Gly Val Leu Arg SerGly Wing Asn Gln Wing Read Asp Gly Met Val Val Gly Val Leu Arg Ser
50 55 6050 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu GluPhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu
100 105 110100 105 110
Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Asn Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile HisGln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile His
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Ser Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Leu GlyRead Be Ser Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Read Gly
210 215 220210 215 220
Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Ala Asn Val Val IleRead Glu Ser Phe Read Gly Arg Ile Pro Read Val Wing Asn Val Val Ile
260 265 270260 265 270
Met Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr Pro 275 280 285 Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuMet Thr Pro His Gly Tyr Phe Wing Gln Asp Asn Val Leu Gly Tyr Pro 275 280 285 Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu Leu
290 295 300290 295 300
Glu Glu Glu Leu Leu His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Leu Read His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly ThrPro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr
325 330 335325 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro TrpIle Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp
355 360 365355 360 365
Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Wing Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala HisVal Gly Asn Tyr Be Asp Gly Asn Ile Val Wing Be Read Leu Wing His
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Thr Ile Ala Arg Ala Leu Glu Lys ThrLys Phe Asp Val Thr Gln Cys Thr Ile Wing Arg Wing Leu Glu Lys Thr
420 425 430420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480Asp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Ser Lys Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu Tyr 530 535 540 Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu Tyr 530 535 540 Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Ile Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Ile Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Pro Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrGlu Wing Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro Tyr
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu LysLys Wing Asp Pro Being Tyr Trp Asp Glu Ile Being Gln Gly Gly Leu Lys
725 730 735725 730 735
Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuArg Ile Gln Lys Tyr Thr Trp Gln Ile Tyr Be Glu Lys Leu Leu
740 745 750740 745 750
Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln GluThr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu
755 760 765755 760 765
Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Ala Trp Met TyrGln Arg Gly Arg Lys Arg Tyr Ile Glu Met Read His Wing Trp Met Tyr
770 775 780770 775 780
Asn Asn Arg Val Lys Thr Val Pro Leu Ala Val Glu 785 790 795 <210> 3 <211> 796 <212> PRTAsn Asn Arg Val Lys Thr Val Pro Leu Wing Val Glu 785 790 795 <210> 3 <211> 796 <212> PRT
<213> Gossypium hirsutum <4 00> 3<213> Gossypium hirsutum <4 00> 3
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr 10 15Met Wing Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Wing Thr 10 15
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyHis Pro Gln Gln Wing Lys Be Ile Read G Be Arg Ile Glu Be Read Gly
25 3025 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluLys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg SerWing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu Arg Ser
50 55 6050 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu GluPhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu
100 105 110100 105 110
Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Asn Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile HisGln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile His
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Ser Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Leu Gly 210Read Being Ser Asp Thr Pro Tyr Being Val Phe Glu Lys Arg Phe Leu Gly 210
Ile Gly Leu Glu Lys 225Ile Gly Leu Glu Lys 225
Met Ile His Leu Leu 245Met Ile His Leu Leu 245
Leu Glu Ser Phe Leu 260Read Glu Ser Phe Leu 260
Met Thr Pro His Gly 275Met Thr Pro His Gly 275
Asp Thr Gly Gly Gln 290Asp Thr Gly Gly Gln 290
Glu Glu Glu Leu Leu 305Glu Glu Glu Leu Leu 305
Pro Arg Ile Leu Val 325Pro Arg Ile Leu Val 325
Thr Cys Gly Gln Arg 340Thr Cys Gly Gln Arg 340
Ile Leu Arg Val Pro 355Ile Leu Arg Val Pro 355
Ile Ser Arg Phe Lys 370Ile Ser Arg Phe Lys 370
Val Ala Ala Glu Ile 385Val Wing Glu Ile Wing 385
Val Gly Asn Tyr Ser 405Val Gly Asn Tyr Ser 405
Lys Phe Asp Val Thr 420Lys Phe Asp Val Thr 420
Lys Tyr Pro Asp Ser 435Lys Tyr Pro Asp Ser 435
His Phe Ser Cys Gln 450His Phe Ser Cys Gln 450
Asp Phe Ile Ile ThrAsp Phe Ile Ile Thr
215215
Gly Trp Gly Asp Asn Ala 230 235Gly Trp Gly Asp Asn Wing 230 235
Leu Asp Leu Leu Gln Ala 250Leu Asp Leu Leu Gln Wing 250
Gly Arg Ile Pro Leu Val 265Gly Arg Ile Pro Leu Val 265
Tyr Phe Ala Gln Asp Asn 280Tyr Phe Ala Gln Asp Asn 280
Val Val Tyr Ile Leu Asp 295Val Val Tyr Ile Leu Asp 295
His Arg Phe Lys Leu Gln 310 315His Arg Phe Lys Leu Gln 310 315
Ile Thr Arg Leu Leu Pro 330Ile Thr Arg Leu Leu Pro 330
Leu Glu Lys Val Tyr Gly 345Read Glu Lys Val Tyr Gly 345
Phe Arg Thr Glu Lys Gly 360Phe Arg Thr Glu Lys Gly 360
Val Trp Pro Tyr Leu Glu 375Val Trp Pro Tyr Leu Glu 375
Thr Lys Glu Phe Gln Gly 390 395Thr Lys Glu Phe Gln Gly 390 395
Asp Gly Asn Ile Val Ala 410Asp Gly Asn Ile Val Wing 410
Gln Cys Thr Ile Ala His 425Gln Cys Thr Ile Wing His 425
Asp Ile Asn Trp Lys Gln 440Asp Ile Asn Trp Lys Gln 440
Phe Thr Ala Asp Leu Ile 455Phe Thr Wing Asp Leu Ile 455
Ser Thr Phe Gln Glu IleSer Thr Phe Gln Glu Ile
220220
Glu His Val Leu Glu 240Glu His Val Leu Glu 240
Pro Asp Pro Val Ala 255Pro Asp Pro Val Wing 255
Ala Asn Val Val Ile 270Wing Asn Val Val Ile 270
Val Leu Gly Tyr Pro 285Val Leu Gly Tyr Pro 285
Gln Val Arg Ala Leu 300Gln Val Arg Wing Leu 300
Gly Leu Asp Ile Thr 320Gly Leu Asp Ile Thr 320
Asp Ala Val Gly Thr 335Asp Wing Val Gly Thr 335
Thr Lys Tyr Ser Asp 350Thr Lys Tyr Ser Asp 350
Ile Val Arg Pro Trp 365Ile Val Arg Pro Trp 365
Thr Tyr Thr Lys Asp 380Thr Tyr Thr Lys Asp 380
Lys Pro Asp Leu Ile 400Lys Pro Asp Leu Ile 400
Ser Leu Leu Ala His 415Ser Leu Leu Wing His 415
Ala Leu Glu Lys Thr 430Wing Read Glu Lys Thr 430
Leu Glu Asp Lys Tyr 445Read Glu Asp Lys Tyr 445
Ala Met Asn His Thr 460Met Wing Asn His Thr 460
Ala Gly Ser Lys Asp 465 470 475 480Gly Ser Lys Wing Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Ile Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Ile Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Pro Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrGlu Wing Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro Tyr
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu Lys 735 Lys Leu Leu 750Lys Wing Asp Pro Be Tyr Trp Asp Glu Ile Be Gln Gly Gly Leu Lys 735 Lys Leu Leu 750
Tyr Gln Glu Trp Met TyrTyr Gln Glu Trp Met Tyr
<210> 4<210> 4
<211> 3353<211> 3353
<212> DNA<212> DNA
<213> Gossypium hirsutum <220><213> Gossypium hirsutum <220>
<221> misc_feature<221> misc_feature
<222> (32) . . (34)<222> (32). . (34)
<223> códon de iniciação <220><223> initiation codon <220>
<221> éxon<221> exon
<222> (32)..(111)<222> (32) .. (111)
<223> exonl <220><223> exonl <220>
<221> éxon<221> exon
<222> (197)..(314)<222> (197) .. (314)
<223> exon2 <220><223> exon2 <220>
<221> éxon<221> exon
<222> (394)..(545)<222> (394) .. (545)
<223> exon3 <220><223> exon3 <220>
<221> éxon<221> exon
<222> (625)..(817)<222> (625) .. (817)
725725
Arg Ile Gln Glu Lys Tyr 740Arg Ile Gln Glu Lys Tyr 740
Thr Leu Thr Gly Val Tyr 755Thr Leu Thr Gly Val Tyr 755
Gln Arg Gly Arg Lys Arg 770Gln Arg Gly Arg Lys Arg 770
Asn Asn Arg Val Lys Thr 785 790Asn Asn Arg Val Lys Thr 785 790
730730
Thr Trp Gln Ile Tyr Ser Glu 745Thr Trp Gln Ile Tyr Ser Glu 745
Gly Phe Ser Lys His Val Ala 760 765Gly Phe Ser Lys His Val Wing 760 765
Tyr Ile Glu Met Leu His Ala 775 780Tyr Ile Glu Met Read His Wing 775 780
Val Pro Leu Ala Val Glu 795 <223> exon4 <220>Val Pro Leu Wing Val Glu 795 <223> exon4 <220>
<221> éxon <222> (897)..(1328)<221> exon <222> (897) .. (1328)
<223> exon5 <220><223> exon5 <220>
<221> éxon <222> (1399) . . (1689)<221> exon <222> (1399). . (1689)
<223> exon6 <220><223> exon6 <220>
<221> éxon<221> exon
<222> (1783) . . (1949)<222> (1783). . (1949)
<223> exon7 <220><223> exon7 <220>
<221> éxon<221> exon
<222> (2050) .. (2274)<222> (2050) .. (2274)
<223> exon8<223> exon8
<220><220>
<221> éxon<221> exon
<222> (2358) . . (3060)<222> (2358). . (3060)
<223> exon9 <220><223> exon9 <220>
<221> éxon<221> exon
<222> (3169) . . (3195)<222> (3169). . (3195)
<223> exonlO <220><223> exonlO <220>
<221> misc_feature<221> misc_feature
<222> (3196) .. (3171)<222> (3196) .. (3171)
<223> Códon de terminação<223> Termination Codon
<400> 4<400> 4
cccttctgcc atttcaggaa cccaaaaaac a atg gct tca ate agt gtt tgt 52cccttctgcc atttcaggaa cccaaaaaac atg gct tca till agt gtt tgt 52
Met Ala Ser Ile Ser Val Cys 1 5Met Ala Ser Ile Ser Val Cys 1 5
gag cgt ttg ggt gaa tct cta gct act cat cca cag cag gca aag tct 100gag cgt ttg ggt gaa tct cta gct act cat cca cag cag gca aag tct 100
Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys Ser 15 20Glu Arg Read Gly Glu Be Read Ala Thr His Pro Gln Gln Ala Lys Ser 15 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151until ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151
Ile Leu Ser Arg 25Ile Leu Ser Arg 25
tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206
Ile Glu SerIle Glu Ser
3030
ctc gga aag ggt att cat aag tct caa aag ctg ctc tcg gtt ctc gat 254ctc gga aag ggt att cat aag tct caa aag ctg ctc tcg gtt ctc gat 254
Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu AspLeu Gly Lys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp
40 4540 45
aaa gag gcc gga aat caa gca ctt gat ggg atg gtg gtg gag gtc ctc 3 02aaa gag gcc gga aat caa gca ct gat ggg atg gtg gtg gag gtc ctc 3 02
1 5 Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu 50 55 601 5 Lys Glu Wing Gly Asn Gln Wing Read Asp Gly Met Val Val Glu Val Leu 50 55 60
agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354
Arg Ser Thr Gln 65Arg Ser Thr Gln 65
atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 4 08atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 4 08
Glu Ala Val Val Ser 70Glu Wing Val Val Ser 70
tct cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 456tct cca ttg gtt gcc ctt gcc att cgt
Ser Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp GluSer Pro Leu Wing Val Leu Wing Ile Arg Ser Pro Wing Gly Val Trp Glu
75 80 8575 80 85
tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504
Tyr Ile Ala Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro ValTyr Ile Wing Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val
90 95 10090 95 100
gct gag tat cta cgg ttg aag gaa gaa ctt gtt gat gga ag 545gct gag tat cta cgg ttg aag gaa gaa ct gt gat gga ag 545
Ala Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115Glu Wing Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115
gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 10gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 10
1515
gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655
Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125
ttt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 703tt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 703
Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys SerPhe Gly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Be Lys Be
130 135 140130 135 140
att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751
Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys LeuIle Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu To Be Ala Lys Leu
145 150 155145 150 155
ttt caa gac aag gag aac ttg aac ttg ttg ctt gaa ttt ctc caa att 799ttt caa gac aag gag aac ttg aac ttg ttg ct gaa ttt ctc caa att 799
Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175
cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847
His Cys Gln Lys Gly Lys 180His Cys Gln Lys Gly Lys 180
cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905
Gly Met LeuGly Met Leu
ttg aat gac aga ate caa gat gtg aat tcc ctc caa cat gca tta agg 953ttg aat gac aga until hunting gat gtg aat tcc ctc hunting cat gca tta agg 953
Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu Arg 185 190 195 200Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Wing Leu Arg 185 190 195 200
aag gcc gag gag tat ctg act cct cta tcc tcg gat acc ccg tac tca 1001aag gcc gag gag ctg act cct cta tcc tcg gat acc ccg tac tca 1001
Lys Ala Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr SerLys Wing Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr Ser
205 210 215205 210 215
gtt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt 1049gt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt 1049
Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp GlyVal Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp Gly
220 225 230220 225 230
gat aat gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc 1097gat aat gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc 1097
Asp Asn Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu 235 240 245Asp Asn Glu Wing His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu 235 240 245
ctt cag gca cct gat cct gtc gca ctt gaa age ttc ctt gga aga ate 1145ctt cag gca cct gat cct gtc gca ct gaa age ttc ctt gga aga until 1145
Leu Gln Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Arg IleRead Gln Wing Pro Asp Pro Val Wing Read Glu Ser Phe Leu Gly Arg Ile
2020
25 1025 10
250 255 260250 255 260
cca ttg gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gcc 1193cca ttg gtc gcc aat gtt gta until atg act ccc cat gga tac ttt gcc 1193
Pro Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala 265 270 275 280Pro Leu Val Wing Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Wing 265 270 275 280
caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241
Gln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val TyrGln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr
285 290 295285 290 295
ate tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289till tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289
Xle Leu Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg PheXle Leu Asp Gln Val Arg Wing Leu Glu Glu Glu Leu Read His Arg Phe
300 305 310300 305 310
aag ctg caa gga ctc gac att acc cca cgt ate cta gtc gtgagtatct 1338aag ctg caa gga ctc gac att acc cca cgt until cta gtc gtgagtatct 1338
Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325
gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc ctc cct gat gct gta gga aca act tgc ggt cag cgt 1446 Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln Arggaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc cct gat gct gta gga aca act tgc ggt cag cgt 1446 Ile Thr Arg Leu Pro Asp Thr Gly Val Gly
330 335 340330 335 340
ctt gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494ct gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494
Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 345 350 355Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 345 350 355
ttc aga act gag aag gga att gta cgt cca tgg ate tca cga ttc aaa 1542ttc aga act gag aag gga att gta cgt cca tgg until tca cga ttc aaa 1542
Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Ser Arg Phe LysPhe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Be Arg Phe Lys
360 365 370360 365 370
gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag ate 1590gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag until 1590
Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Ile 375 380 385Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Wing Wing Glu Ile 375 380 385
acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1638acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1638
Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser 390 395 400 405Thr Lys Glu Phe Gln Gly Lys Pro Asp Ile Ile Val Gly Asn Tyr Ser 390 395 400 405
gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686
Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lys Phe Asp Val Thr 410 415 420Asp Gly Asn Ile Val Wing Be Read Leu Wing His Lys Phe Asp Val Thr 410 415 420
30 1030 10
cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739
GlnGln
tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794
Cys Thr Ile Ala 425Cys Thr Ile Wing 425
cat gca ctc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1842cat gca ctc gag aag ag aaa tac ccg gat tca gac att aac tgg aaa 1842
His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysHis Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
430 435 440430 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1890cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1890
Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp LeuGln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu
445 450 455445 450 455
att gct atg aac cat act gat ttt ate ate acc age acc ttc caa gag 1938att gct atg aac cat act gat ttt until till acc age acc ttc caa gag 1938
Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln GluIle Ala Met Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu
460 465 470460 465 470
att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989
Ile Ala Gly SerIle Wing Gly Ser
475475
cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc ggc caa tac gag agt cac att gct ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro 480 485 490cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc gg ca ca ca tac gag ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Being His Ileu
ggg ctc tac cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc 2146ggg ctc tac cgc gtt gtt gat ggg till gat gtt ttt gat ccc aaa ttc 2146
Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe 495 500 505 510Gly Leu Tyr Arg Val Val Asp Gly Ile Val Val Asp Phe Asp Pro Lys Phe 495 500 505 510
aat att gtc tcc cct ggt gct gat atg age ata tac ttc cct tac acg 2194aat att gtc tcc cct ggt gct gat atg act ata tac ttc cct tac acg 2194
Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr ThrAsn Ile Val Ser Pro Gly Wing Asp Met Ser Ile Tyr Phe Pro Tyr Thr
515 520 525515 520 525
gaa gag aag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242gag gag ag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242
Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540Glu Glu Lys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540
ctt tac age cct gtt gag aat aca gag cac tt gtgagatttt ggtcctattt 2294 10ct tac age cct gt gt gt aa gag aa gag cac tt gtgagatttt ggtcctattt 2294 10
1515
Leu Tyr Ser Pro Val Glu Asn Thr Glu His LeuLeu Tyr Ser Pro Val Glu Asn Thr Glu His Leu
545 550545 550
ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt gta cta aaa gac cgc aac aag ccg att ctg ttt acc atg gca 2403 Cys Val Leu Lys Asp Arg Asn Lys Pro Ile Leu Phe Thr Met Ala 555 560 565ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt cta aaa gac cgc aac ag ccg att ctg ttt acc atg gca 2403 Cys Val Leu Lys Asp L5 As5 Arg 5
agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat gcc 2451agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat gcc 2451
Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr AlaArg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr Wing
570 575 580570 575 580
aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499aag aac acts cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499
Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600Lys Asn Be Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600
gat agg aga aag gaa tec aag gac tta gaa gaa aag gct gaa atg aag 2547gat agg aag gaa tec aag gac tta gaa gaa aag gct gaa atg aag 2547
Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met LysAsp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met Lys
605 610 615605 610 615
aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttc aga 2595aag atg tat gaa ctt until gaa aaa tac aag ttg aat gga caa ttc aga 2595
Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe ArgLys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg
620 625 630620 625 630
tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2643tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2643
Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Arg 635 640 645Trp Ile Ser Be Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Arg 635 640 645
tat att tgc gac acc aag gga gca ttc gtt cag cct cct ata tac gag 2691tat att tgc gac acc aga gga gca ttc gtt cag cct cct ata tac gag 2691
Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Pro Ile Tyr Glu 650 655 660Tyr Ile Cys Asp Thr Lys Gly Phe Val Gln Pro Pro Ile Tyr Glu 650 655 660
gct ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 2739 Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro Thr 665 670 675 680gct ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 2739 Phe Gly Wing Leu Thr Val Val Glu Wing Met Thr Cys Wing Gly Leu Pro Thr 665 670 675 680
ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787
Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly Lys 685 690 695Phe Wing Thr Cys Tyr Gly Gly Pro Glu Wing Ile Ile Val His Gly Lys 685 690 695
tcg ggg ttc aac ate gat cct tat aac ggt gat ttg gct gcc gag acc 2835tcg ggg ttc aac till gat cct tat aac ggt gat ttg gct gcc gag acc 2835
Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala Glu Thr 10Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Read Wing Wing Glu Thr 10
700 705 710700 705 710
ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tct tat tgg gat 2883ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tct tat tgg gat 2883
Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp AspLeu Wing Asn Phe Phe Glu Lys Cys Lys Wing Asp Pro To Be Tyr Trp Asp
715 720 725715 720 725
gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931gag till tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931
Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr TrpGlu Ile Being Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp
730 735 740730 735 740
cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc ttt 2979cag att tac tcc gag aag cta ttg act ctc acc gga gt tat ggc ttt 2979
Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760
tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027
Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr IleSer Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile
765 770 775765 770 775
gaa atg ttg cat gea tgg atg tat aac aat cgg gtaaaacaat catgtctatt 3080 Glu Met Leu His Ala Trp Met Tyr Asn Asn Arg 780 785catg atg ttg cat gea tgg atg tat aac aat cgg gtaaaacaat catgtctatt 3080 Glu Met Leu His Ala Trp Met Tyr Asn Asn Arg 780 785
attattatta catcaaacat cattttactc tttcattgaa ctgaaatttt catgtttctg 3140 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3192attattatta catcaaacat cattttactc tttcattgaa ctgaaatttt catgtttctg 3140 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3192
Val Lys Thr Val Pro Leu Ala Val 790 795Val Lys Thr Val Pro Leu Wing Val 790 795
gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3245gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3245
GluGlu
2020
2525
ttgctattga agttttattt ttgcttcttt ttttcaattt cgtaaatgtt taatttcaac 3305 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3353ttgctattga agttttattt ttgcttcttt ttttcaattt cgtaaatgtt taatttcaac 3305 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3353
<210> 5 <211> 796 <212> PRT<210> 5 <211> 796 <212> PRT
<213> Gossypium barbadense<213> Gossypium barbadense
<400> 5<400> 5
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr 10 15 His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyMet Ala Ser Ile Be Val Cys Glu Arg Leu Gly Glu Ser Leu Wing Thr 10 15 His Pro Gln Gln Wing Lys Ser Ile Leu Be Arg Ile Glu Ser Leu Gly
25 3025 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluLys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg SerWing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu Arg Ser
50 55 6050 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu GluPhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu
100 105 110100 105 110
Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Asn Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Ser Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile HisGln Asp Lys Glu Being Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile His
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Ser Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Leu GlyRead Be Ser Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Read Gly
210 215 220210 215 220
Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Ala Asn Val Val Ile 260 265 270 Met Thr Pro His Gly Tyr Phe Ala Gln Tyr Asri Val Leu Gly Tyr ProLeu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Wing Asn Val Val Ile 260 265 270 Met Thr Pro His Gly Tyr Phe Wing Gln Tyr Asri Val Leu Gly Tyr Pro
275 280 285275 280 285
Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuAsp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu
290 295 300290 295 300
Glu Glu Glu Leu Leu His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Leu Read His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly ThrPro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr
325 330 335325 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro TrpIle Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp
355 360 365355 360 365
Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Wing Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala HisVal Gly Asn Tyr Be Asp Gly Asn Ile Val Wing Be Read Leu Wing His
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Thr Ile Ala His Ala Leu Glu Lys ThrLys Phe Asp Val Thr Gln Cys Thr Ile Wing His Wing Leu Glu Lys Thr
420 425 430420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480Asp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Ser Lys Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu Glu 515 520 525 Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrVal Ser Pro Gly Wing Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu Glu 515 520 525 Lys Arg Arg Read Lys Lys Lys Phe His Pro Glu Ile Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Ile Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Ile Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Ala Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Wing Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrGlu Wing Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro Tyr
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu LysLys Wing Asp Pro Being Tyr Trp Asp Glu Ile Being Gln Gly Gly Leu Lys
725 730 735725 730 735
Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuArg Ile Gln Lys Tyr Thr Trp Gln Ile Tyr Be Glu Lys Leu Leu
740 745 750740 745 750
Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln GluThr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu
755 760 765755 760 765
Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Ala Trp Met Tyr 770 775 780 Asn Asn Arg Val Lys Thr Val Pro Leu Ala Val Glu 785 790 795Gn Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Wing Trp Met Tyr 770 775 780 Asn Asn Arg Val Lys Thr Val
<210> 6 <211> 3352 <212> DNA<210> 6 <211> 3352 <212> DNA
<213> Gossypium barbadense <220><213> Gossypium barbadense <220>
<221> misc_feature <222> (32)..(34) <223> códon de iniciação <220><221> misc_feature <222> (32) .. (34) <223> initiation codon <220>
<221> éxon <222> (32)..(111)<221> exon <222> (32) .. (111)
<223> exonl <220><223> exonl <220>
<221> éxon <222> (197) .. (314)<221> exon <222> (197) .. (314)
<223> exon2 <220><223> exon2 <220>
<221> éxon<221> exon
<222> (394).. (545)<222> (394) .. (545)
<223> exon3 <22 0><223> exon3 <22 0>
<221> éxon<221> exon
<222> (625).. (817)<222> (625) .. (817)
<223> exon4 <220><223> exon4 <220>
<221> éxon<221> exon
<222> (897)..(1328)<222> (897) .. (1328)
223> exon5 220>223> exon5 220>
221> éxon <222> (1399).. (1689) <223> exon6 <220>221> exon <222> (1399) .. (1689) <223> exon6 <220>
<221> éxon <222> (1783) .. (1949) <223> exon7 <220><221> exon <222> (1783) .. (1949) <223> exon7 <220>
<221> éxon <222> (2050) .. (2274) <223> exon8 <220><221> exon <222> (2050) .. (2274) <223> exon8 <220>
<221> éxon<221> exon
<222> (2357)..(3059)<222> (2357) .. (3059)
<223> exon9<223> exon9
<220><220>
<221> éxon <222> (3168) .. (3194) <223> exonlO <22 0><221> exon <222> (3168) .. (3194) <223> exonlO <22 0>
<221> misc_feature <222> (3195) .. (3197) <223> Códon de terminação <4 00> 6<221> misc_feature <222> (3195) .. (3197) <223> Termination Codon <4 00> 6
cccttctgcc atttcaggaa cccaaaaaac a atg gct tca ate agt gtt tgtcccttctgcc atttcaggaa cccaaaaaac a atg gct tca till agt gtt tgt
Met Ala Ser Ile Ser Val Cys 1 5Met Ala Ser Ile Ser Val Cys 1 5
gag cgt ttg ggt gaa tet tta gct act cat cca cag cag gea aag tet Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys Sergag cgt ttg ggt gaa tet tta gct act cat cca cag cag gea aag tet Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Ala Lys Ser
15 2015 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt Ile Leu Ser Arg 25 tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206until ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt Ile Leu Ser Arg 25 tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206
Ile Glu Ser 30Ile Glu Ser 30
ctc gga aag ggt att cat aag tet caa aag ctg ctc tcg gtt ctc gat 254ctc gga aag ggt att cat cat aag tet caa aag ctg ctc tcg gtt ctc gat 254
Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp 40 45Leu Gly Lys Gly Ile His Lys Being Gln Lys Leu Leu Being Val Leu Asp 40 45
aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gtg gag gtc ctc 302aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gtg gag gtc ctc 302
Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu 50 55 60Lys Glu Wing Gly Asn Gln Wing Read Asp Gly Met Val Val Glu Val Leu 50 55 60
agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354
Arg Ser Thr Gln 65Arg Ser Thr Gln 65
atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 408atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 408
Glu Ala Val Val SerGlu Wing Val Val Ser
vograndfather
tet cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 456tet cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 456
Ser Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp GluSer Pro Leu Wing Val Leu Wing Ile Arg Ser Pro Wing Gly Val Trp Glu
75 80 8575 80 85
tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504
Tyr Ile Ala Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val 90 95 100Tyr Ile Val Glu Wing Val Gln Lys Leu Phe Val Glu Glu Met Pro Val 90 95 100
gct gag tat cta cgg ttg aag gaa gaa ctt gtt gat gga ag 545gct gag tat cta cgg ttg aag gaa gaa ct gt gat gga ag 545
Ala Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115Glu Wing Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115
gtaggtagtt ttctcttaca gctcctggtt tatgttatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655gtaggtagtt ttctcttaca gctcctggtt tatgttatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655
Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125
ttt ggt gea ttc aat aat tet gtt cct cgt cca tet ctt tca aag tcc Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser 130 135 140ttt ggt gea ttc aat aat tet gtt cct cgt cca tet ctt tca aag tcc Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Read Lys Ser 130 135 140
att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tet gcc aag ctaatt ggt aat ggc atg gac ttc ctc aac cgc cac ctt tet gcc aag cta
703703
751 Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu751 Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu
145 150 155145 150 155
ttt caa gac aag gag age ttg aac ttg ttg ctt gaa ttt ctc caa att 799tt caa gac aag gag age ttg aac ttg ttg ct gaa ttt ctc caa att 799
Phe Gln Asp Lys Glu Ser Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175Phe Gln Asp Lys Glu Being Read Asn Read Leu Read Leu Glu
cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847
His Cys Gln Lys Gly Lys 180His Cys Gln Lys Gly Lys 180
cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905 Gly Met Leuccttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905 Gly Met Leu
ttg aat gac aga ate caa gat gtg aat tcc ctc caa cat gea tta agg 953ttg aat gac aga until hunting gat gtg aat tcc ctc hunting cat gea tta agg 953
Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu ArgLeu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Wing Leu Arg
185 190 195 200185 190 195 200
aag gcc gag gag tat ctg act cct cta tcc tcg gat acc ccg tac tca 1001aag gcc gag gag ctg act cct cta tcc tcg gat acc ccg tac tca 1001
Lys Ala Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr SerLys Wing Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr Ser
205 210 215205 210 215
gtt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt 1049gt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt
Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp GlyVal Phe Glu Lys Arg Phe Read Gly Ile Gly Read Glu Lys Gly Trp Gly
220 225 230220 225 230
gat aat gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc 1097gat aat gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc 1097
Asp Asn Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp LeuAsp Asn Alu Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu
235 240 245235 240 245
ctt cag gea cct gat cct gtc gea ctt gaa age ttc ctt gga aga ate 1145ct cag gea cct gat cct gtc gea
Leu Gln Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Arg Ile 250 255 260Read Gln Wing Pro Asp Pro Val Wing Read Glu Ser Phe Leu Gly Arg Ile 250 255 260
cca ttg gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gcc 1193cca ttg gtc gcc aat gtt gta until atg act ccc cat gga tac ttt gcc 1193
Pro Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala 265 270 275 280Pro Leu Val Wing Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Wing 265 270 275 280
caa tac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241 Gln Tyr Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295 10caa tac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241 Gln Tyr Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295 10
1515
2020
2525
ate tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289 Ile Leu Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg Phetill tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289 Ile Leu Asp Gln Val Arg Wing Leu Glu Glu Glu Leu Leu His Arg Phe
300 305 310300 305 310
aag ctg caa gga ctc gac att acc cca cgC ate cta gtc gtgagtatct 1338aag ctg caa gga ctc gac att acc cca cgC until cta gtc gtgagtatct 1338
Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325
gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg etc ctc cct gat gct gta gga aca act tgc ggt cag cgt 1446gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg etc ctc cct gat gct gta aca act tgc ggt cag cgt 1446
Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln Arg 330 335 340Ile Thr Arg Read Leu Pro Asp Wing Val Gly Thr Thr Cys Gly Gln Arg 330 335 340
ctt gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494 Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Proctt gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494 Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro
345 350 355345 350 355
ttc aga act gag aag gga att gta cgt cca tgg ate tca cga ttc aaa 1542ttc aga act gag aag gga att gta cgt cca tgg until tca cga ttc aaa 1542
Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Ser Arg Phe LysPhe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Be Arg Phe Lys
360 355 370360 355 370
gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag ate 1590 Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Ile 375 380 385gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag until 1590 Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Wing Ala Glu Ile 375 380 385
acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 163 8acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 163 8
Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr SerThr Lys Glu Phe Gln Gly Lys Pro Asp Read Ile Val Gly Asn Tyr Ser
390 395 400 405390 395 400 405
gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686
Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lys Phe Asp Val ThrAsp Gly Asn Ile Val Wing Be Read Leu Wing His Lys Phe Asp Val Thr
410 415 420410 415 420
cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739
GlnGln
3030
tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794
Cys Thr Ile Ala 425Cys Thr Ile Wing 425
cat gea ctc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1842 15cat gea ctc gag aag ag aaa tac ccg gat tca gac att aac tgg aaa 1842 15
His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysHis Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
430 435 440430 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1890cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1890
Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp Leu 445 450 455Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu 445 450 455
att gct atg aac cat act gat ttt ate ate acc age acc ttc caa gag 1938att gct atg aac cat act gat ttt until till acc age acc ttc caa gag 1938
Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln GluIle Ala Met Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu
460 465 470460 465 470
att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989
Ile Ala Gly Ser 475Ile Wing Gly Ser 475
cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc ggc caa tac gag agt cac att gct ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro 480 485 490cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc gg ca ca ca tac gag ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Being His Ileu
ggg ctc tac cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc 2146ggg ctc tac cgc gtt gtt gat ggg till gat gtt ttt gat ccc aaa ttc 2146
Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe 495 500 505 510Gly Leu Tyr Arg Val Val Asp Gly Ile Val Val Asp Phe Asp Pro Lys Phe 495 500 505 510
aat att gtc tcc cct ggt gct gat atg age ata tac ttc cct tac acg 2194aat att gtc tcc cct ggt gct gat atg act ata tac ttc cct tac acg 2194
Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr ThrAsn Ile Val Ser Pro Gly Wing Asp Met Ser Ile Tyr Phe Pro Tyr Thr
515 520 525515 520 525
gaa gag aag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242gag gag ag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242
Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu LeuGlu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu
530 535 540530 535 540
ctt tac age cct gtt gag aat aca gag cac tt gtgagatttt ggtcctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leuct tac age cct gt gag aat aca gag cac tt gtgagatttt ggtcctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leu
545 550545 550
ttacaccctt tcactagatt ttgtgttcag tgttaacaat gtcttaaatt gttgtctcac 2354 ag a tgt gta cta aaa gac cgc aac aag ccg att ctg ttt acc atg gea 2402 Cys Val Leu Lys Asp Arg Asn Lys Pro Ile Leu Phe Thr Met Alattacaccctt tcactagatt ttgtgttcag tgttaacaat gtcttaaatt gttgtctcac 2354 ag a tgt gta cta aaa gac aag ccg att ctg ttt acc atg gea 2402 Cys Val Leu Lys Asp Arg Asn Lys Pro Ile Leu
555 560 565555 560 565
agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat gcc 2450agg ttg gac cga gtg aag aac tta act ggg ctc
2020
25 Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr Ala25 Arg Read Asp Arg Val Lys Asn Read Thr Gly Read Val Glu Phe Tyr Wing
570 575 580570 575 580
aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2498aag aac acts cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2498
Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600Lys Asn Be Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600
gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2546gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2546
Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met LysAsp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met Lys
605 610 615605 610 615
aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttc aga 2594aag atg tat gaa ctt till gaa aaa tac aag ttg aat gga caa ttc aga 2594
Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg 620 625 630Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg 620 625 630
tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2 642tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2,642
Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Arg 635 640 645Trp Ile Ser Be Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Arg 635 640 645
tat att tgc gac acc aag gga gea ttc gtt cag cct gct ata tac gag 2690 Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Ala Ile Tyr Glutat att tgc gac acc aag gga gea ttc gtt cag cct gct ata tac gag 2690 Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Ala Ile Tyr Glu
650 655 660650 655 660
gct ttt ggc ttg act gtt gtt gag gea atg acc tgt gga cta cca aca 2738gct ttt ggc ttg act gtt gtt gag gea atg acc tgt gga cta cca aca 2738
Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro Thr 665 670 675 680Phe Gly Leu Wing Thr Val Val Glu Wing Met Thr Cys Wing Gly Leu Thr Thr 665 670 675 680
ttt gea aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2786ttt gea aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2786
Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly LysPhe Wing Thr Cys Tyr Gly Gly Pro Glu Wing Ile Ile Val His Gly Lys
685 690 695685 690 695
tcg ggg ttc aac ate gat cct tat aac ggt gat ttg gct gcc gag acc 2834tcg ggg ttc aac until gat cct tat aac ggt gat ttg gct gcc gag acc 2834
Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala Glu Thr 700 705 710Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Read Wing Wing Wing Glu Thr 700 705 710
ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tet tat tgg gat 2882ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tet tat tgg gat 2882
Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp AspLeu Wing Asn Phe Phe Glu Lys Cys Lys Wing Asp Pro To Be Tyr Trp Asp
715 720 725715 720 725
gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2930gag till tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2930
Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740Glu Ile Being Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740
30 1030 10
cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc ttt 2978cag att tac tcc gag aag cta ttg act ctc acc gga gt tat ggc ttt 2978
Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760
tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3026tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3026
Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr IleSer Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile
765 770 775765 770 775
gaa atg ttg cat gca tgg atg tat aac aat cgg gtaaaacaat catgtctatt 3079 Glu Met Leu His Ala Trp Met Tyr Asn Asn Arggaa atg ttg cat gca tgg atg tat aac aat cgg gtaaaacaat catgtctatt 3079 Glu Met Leu His Wing Trp Met Tyr Asn Asn Arg
780 785780 785
attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt catgtttctg 3139 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3191attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt catgtttctg 3139 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3191
Val Lys Thr Val Pro Leu Ala Val 790 795Val Lys Thr Val Pro Leu Wing Val 790 795
gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3244gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3244
GluGlu
ttgctattga agttttattt ttgcttcttt ttttcaattt ggtaaatgtt taatttcaac 3304 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3352ttgctattga agttttattt ttgcttcttt ttttcaattt ggtaaatgtt taatttcaac 3304 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3352
<210> 7<210> 7
<211> 796<211> 796
<212> PRT<212> PRT
<213> Gossypium arboreum <4 00> 7<213> Gossypium arboreum <4 00> 7
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala ThrMet Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr
1 5 10 151 5 10 15
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyHis Pro Gln Gln Wing Lys Be Ile Read G Be Arg Ile Glu Be Read Gly
25 3025 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluLys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg SerWing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu Arg Ser
50 55 6050 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu GluPhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu
100 105 110100 105 110
Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Asn Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Ser Leu Leu Leu Glu Phe Leu Gln Ile HisGln Asp Lys Glu Asn Leu Be Leu Leu Leu Leu Glu Phe Leu Gln Ile His
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Thr Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Arg GluLeu Thr Be Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Arg Glu
210 215 220210 215 220
Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Thr Ile Pro Leu Val Ala Asn Val Val IleLeu Glu Ser Phe Leu Gly Thr Ile Pro Leu Val Wing Asn Val Val Ile
260 265 270260 265 270
Met Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr ProMet Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr Pro
275 280 285275 280 285
Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuAsp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu
290 295 300290 295 300
Glu Glu Glu Leu Leu His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Leu Read His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr 325 330 335Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr 325 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro TrpIle Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp
355 360 365355 360 365
Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Val Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Glu Wing Val Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala LeuVal Gly Asn Tyr Ser Asp Gly Asn Ile Val Wing Ser Leu Leu Wing Leu
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Thr Ile Ala His Ala Leu Glu Lys ThrLys Phe Asp Val Thr Gln Cys Thr Ile Wing His Wing Leu Glu Lys Thr
420 425 430420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480Asp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Ser Lys Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Val Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Val Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu 580 585 590Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu 580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Leu Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Read Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Ala Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Wing Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrGlu Wing Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro Tyr
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu LysLys Wing Asp Pro Being Tyr Trp Asp Glu Ile Being Gln Gly Gly Leu Lys
725 730 735725 730 735
Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuArg Ile Gln Lys Tyr Thr Trp Gln Ile Tyr Be Glu Lys Leu Leu
740 745 750740 745 750
Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln GluThr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu
755 760 765755 760 765
Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Ala Leu Met TyrGln Arg Gly Arg Lys Arg Tyr Ile Glu Met Read His Wing Read Met Tyr
770 775 780770 775 780
Asn Asn Arg Val Lys Thr Val Pro Leu Ala Val Glu 785 790 795Asn Asn Arg Val Lys Thr Val Pro Leu Wing Val Glu 785 790 795
<210> 8 <211> 3352 <212> DNA<210> 8 <211> 3352 <212> DNA
<213> Gossypium arboreum <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> <22 0> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222><213> Gossypium arboreum <220> <221> <222> <223> <220> <221> <222> <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> <22 0> <221> <222> <220> <221> <222> <223> <220> <221> <222>
<223> <220><223> <220>
éxonexon
(32) . . (111) exonl(32). . (111) exonl
misc_feature (32)..(34) ATG starcodonmisc_feature (32) .. (34) ATG starcodon
éxonexon
(197)..(314) exon2(197) .. (314) exon2
éxonexon
(390)..(541) exon3(390) .. (541) exon3
éxonexon
(621) .. (813) exon4(621) .. (813) exon4
éxonexon
(893)..(1324) exon5(893) .. (1324) exon5
éxonexon
(1395)..(1685) exon6(1395) .. (1685) exon6
éxonexon
(1779) . . (1945) exon7 10(1779). . (1945) exon7 10
<221> éxon <222> (2046)..(2270) <223> exon8 <220><221> exon <222> (2046) .. (2270) <223> exon8 <220>
< 2 21> éxon <222> (2355) .. (3057) <223> éxon 9 <220><2 21> exon <222> (2355) .. (3057) <223> exon 9 <220>
<221> éxon <222> (3162) .. (3188) <223> exonlO <220><221> exon <222> (3162) .. (3188) <223> exonlO <220>
<221> misc_feature <222> (3189) . . (3191) <223> Códon de terminação <400> 8<221> misc_feature <222> (3189). . (3191) <223> Termination Codon <400> 8
cccttctgcc atttcaggaa ccaaaaaagc a atg gct tca ate agt gtc tgt 52cccttctgcc atttcaggaa ccaaaaaagc a atg gct tca till agt gtc tgt 52
Met Ala Ser Ile Ser Val Cys 1 5Met Ala Ser Ile Ser Val Cys 1 5
gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100
Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys SerGlu Arg Read Gly Glu Be Read Ala Thr His Pro Gln Gln Ala Lys Ser
15 2015 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaag ttctggtctt 151until ttg tca ag gtataatgta aataagtgct gccttcaaag ttctggtctt 151
Ile Leu Ser ArgIle Read Ser Arg
2525
tctttgttat tttttgttac attgtttata tatgtcatat tgcag g att gaa age 206tctttgttat tttttgttac attgtttata tatgtcatat tgcag g att gaa age 206
Ile Glu Ser 30Ile Glu Ser 30
ctt gga aag ggt att cat aag tet caa aag ctg etc tca gtt ctc gat 254ct gga aag ggt att cat cat aag tet caa aag ctg etc tca gtt ctc gat 254
Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu AspLeu Gly Lys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp
40 4540 45
aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gta gag gtc ctc 302 15aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gta gag gtc ctc 302 15
Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val LeuLys Glu Wing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu
50 55 6050 55 60
agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtatgatg 3 54agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtatgatg 3 54
Arg Ser Thr GlnArg Ser Thr Gln
6565
agaatattgg actgatatgt taccattatc tatag gaa gct gta gtg tcg tct 407agaatattgg actgatatgt taccattatc tatag gaa gct gta gtg tcg tct 407
Glu Ala Val Val Ser Ser 70Glu Wing Val Val Ser Ser 70
cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag tac 455cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag tac 455
Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp Glu Tyr 75 80 85Pro Leu Val Wing Leu Wing Ile Arg Ser Wing Pro Gly Val Trp Glu Tyr 75 80 85
att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt gct 503att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt gct 503
Ile Ala Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val AlaIle Wing Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val Wing
90 95 10090 95 100
gag tat cta cgg ttg aag gaa gaa ctt gtt gat gga ag gtaggtagtt 551gag tat cta cgg ttg aag gaa gaa ct gt gat gga ag gtaggtagtt 551
Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser 105 110 115
ttctcttaca gctcctggtt tatgttatgt ttgagtttat gttgaacaat gctcaatgat 611 gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac ttt ggt gca 660ttctcttaca gctcctggtt tatgttatgt ttgagtttat gttgaacaat gctcaatgat 611 gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac ttt ggt gca 660
Ser Asn Gly Glu Phe Met Leu Glu Leu Asp Phe Gly Ala 120 125 130Ser Asn Gly Glu Phe Met Leu Glu Leu Asp Phe Gly Wing 120 125 130
ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc att ggt aat 708ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc att ggt aat 708
Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser Ile Gly AsnPhe Asn Asn Be Val Pro Arg Pro Be Read Be Lys Be Ile Gly Asn
135 140 145135 140 145
ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta ttt caa gac 756ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta ttt caa gac 756
Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe Gln AspGly Met Asp Phe Read Asn Arg His Read Be Ala Lys Read Phe Gln Asp
150 155 160150 155 160
aag gag aac ttg age ttg ttg ctt gaa ttt ctc caa att cac tgc cag 804aag gag aac ttg age ttg ttg ct gaa tt ctc caa att cac tgc cag 804
Lys Glu Asn Leu Ser Leu Leu Leu Glu Phe Leu Gln Ile His Cys GlnLys Glu Asn Leu Be Leu Leu Leu Glu Phe Leu Gln Ile His Cys Gln
165 170 175165 170 175
aag gga aag gtaaaagttt cactttcact atttgttttt cctttctgta 853aag gga aag gtaaaagttt cactttcact atttgttttt cctttctgta 853
Lys Gly LysLys Gly Lys
2020
2525
30 18030 180
tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg ttg aat 907tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg ttg aat 907
Gly Met Leu Leu Asn 185Gly Met Leu Leu Asn 185
gac aga ate caa gat gtg aat tcc ctc caa cat gea ttg agg aag gcc 955gac aga till caa gat gtg aat tcc ctc caa cat gea ttg agg aag gcc 955
Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu Arg Lys AlaAsp Arg Ile Gln Asp Val Asn Be Read Gln His Wing Read Le Arg Lys Wing
190 195 200190 195 200
gag gag tat ctg act cct cta acg tcg gat acc ccg tac tcg gtt ttc 1003gag gag tat ctg act cct cta acg tcg gat acc ccg tac tcg gtt ttc 1003
Glu Glu Tyr Leu Thr Pro Leu Thr Ser Asp Thr Pro Tyr Ser Val PheGlu Glu Tyr Leu Thr Pro Leu Thr Be Asp Thr Pro Tyr Be Val Phe
205 210 215205 210 215
gag aaa agg ttt cgg gag att ggt ttg gag aag gga tgg ggt gat aat 1051gag aaa agg ttt cgg gag att ggt ttg gag aag gga tgg ggt gat aat 1051
Glu Lys Arg Phe Arg Glu Ile Gly Leu Glu Lys Gly Trp Gly Asp AsnGlu Lys Arg Phe Arg Glu Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn
220 225 230220 225 230
gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc ctt cag 1099gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc ctt cag 1099
Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu Leu Gln 235 240 245 250Glu Wing His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu Leu Gln 235 240 245 250
gea cct gat cct gtc gea ctt gaa age ttc ctt gga aca ate cca ttg 1147gt cct gat cct gtc gea gt ct gaa age ttc ct gga aca till cca ttg 1147
Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Thr Ile Pro Leu 255 260 265Pro Wing Asp Pro Val Wing Wing Glu Ser Phe Leu Gly Thr Ile Pro Wing 255 260 265
gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gcc caa gac 1195 Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala Gln Aspgtc gcc aat gtt gta until atg act ccc cat gga tac ttt gcc caa gac 1195 Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala Gln Asp
270 275 280270 275 280
aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat ate tta 1243aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat till tta 1243
Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr Ile Leu 285 290 295Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr Ile Leu 285 290 295
gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc aag ctg 1291gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt
Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg Phe Lys LeuAsp Gln Val Arg Wing Leu Glu Glu Glu Leu Read His Arg Phe Lys Leu
300 305 310300 305 310
caa gga ctc gac att acc cca cga ate cta gtc gtgagtatct gaatattaat 1344 Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325caa gga ctc gac att acc cca cga till cta gtc gtgagtatct gaatattaat 1344 Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325
tcggtttcga tcattatgtt taatgcttaa tttggctgtt tatgtaccag att act 1400tcggtttcga tcattatgtt taatgcttaa tttggctgtt tatgtaccag att act 1400
30 Ile Thr30 Ile Thr
cgg ctc ctc cct gat gct gta gga aca act tgc ggt cag cgt ctt gag 1448cgg ctc ctc cct gat gct gta gga aca act tgc ggt cag cgt ctt gag 1448
Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln Arg Leu Glu 330 335 340Arg Leu Leu Pro Asp Wing Val Gly Thr Thr Cys Gly Gln Arg Leu Glu 330 335 340
aaa gtg tat gga acc aag tat tct gat att ctt cgg gta ccc ttc aga 1496aaa gtg tat gga acc aag tat tct gat att ctt cgg gta ccc ttc aga 1496
Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro Phe ArgLys Tyr Val Gly Thr Lys Tyr Ser Asp Ile Read Arg Val Pro Phe Arg
345 350 355345 350 355
act gag aag gga att gta cgt cca tgg ate tca cga ttc aaa gtc tgg 1544act gag aag gga att gta cgt cca tgg until tca cga ttc aaa gtc tgg 1544
Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Ser Arg Phe Lys Val Trp 360 365 370 375Glu Thr Lys Gly Ile Val Arg Pro Trp Ile Be Arg Phe Lys Val Trp 360 365 370 375
cct tac ctg gag act tac act aag gat gtt gea gct gag gtc acc aaa 1592cct tac ctg gag act tac act aag gat gtt gea gct gag gtc acc aaa 1592
Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Val Thr Lys 380 385 390Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Wing Glu Wing Val Thr Lys 380 385 390
gag ttc cag ggc aaa cct gat ctg att gtt ggg aac tac agt gat gga 1640 Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser Asp Glygag ttc cag ggc aaa cct gat ctg att gtt ggg aac tac agt gat gga 1640 Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser Asp Gly
395 400 405395 400 405
aac att gtt gct tct tta ttg gea ctg aag ttt gat gtt aca cag 1685aac att gtt gt tt tt tt ttg gea ctg aag ttt gat gtt aca cag 1685
Asn Ile Val Ala Ser Leu Leu Ala Leu Lys Phe Asp Val Thr Gln 410 415 420Asn Ile Val Wing Be Read Leu Wing Leu Lys Phe Asp Val Thr Gln 410 415 420
gtttgttaaa tgccaattaa ttgaacaagt tcctttctgt attttgattt ttcccgtgca 1745 atctaaaaac ttggttttca atgttttatg cag tgc act att gct cat gea ctt 1799gtttgttaaa tgccaattaa ttgaacaagt tcctttctgt attttgattt ttcccgtgca 1745 atctaaaaac ttggttttca atgttttatg cag tgc act att gct cat gea ctt 1799
Cys Thr Ile Ala His Ala Leu 425Cys Thr Ile Wing His Wing Leu 425
gag aag acg aaa tac ccg gat tca gac att aac tgg aaa caa ctt gag 1847 Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu 430 435 440 445gag aag acg aaa tac ccg gat tca gac att aac tgg aaa caa ctt gag 1847 Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu 430 435 440 445
gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt att gct atg 1895gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt att gct atg 1895
Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met 450 455 460Asp Lys Tyr His Phe Be Cys Gln Phe Thr Wing Asp Leu Ile Wing Met 450 455 460
aac cat act gat ttt ate ate acc age acc ttc caa gag att gct gga 1943aac cat act gat ttt until till acc age acc ttc caa gag att gct gga 1943
Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly 10Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly 10
465 470 475465 470 475
ag gtaatctatc tcccacttta ctacatttgg ttcccttgta cgttgtaaat 1995ag gtaatctatc tcccacttta ctacatttgg ttcccttgta cgttgtaaat 1995
SerTo be
gttgccacct atttgagtcc ctaacaatgt ttccattcca tgattttcag c aag gac 2052gttgccacct atttgagtcc ctaacaatgt ttccattcca tgattttcag c aag gac 2052
Lys Asp 480Lys Asp 480
act ctc ggc caa tac gag agt cac ate gct ttc act ctt cca ggg ctc 2100act ctc ggc caa tac gag agt cac up to gct ttc act ctt cca ggg ctc 2100
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
tac cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc aat att 2148tac cgc gtt gtt gat ggg until gat gtt ttt gat ccc aaa ttc aat att 2148
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
gtc tcc cet ggt gct gat atg age ata tac ttc cct tac acg gaa gag 2196gtc tcc cet ggt gct gat atg age atta tac ttc cct tac acg gaa gag 2196
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu Glu 515 520 525Val Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu 515 520 525
aag cga agg ttg aag aag ttc cac ccg gag att gaa gag ctt ctt tac 2244aag cga agg ttg aag aag ttc cac ccg gag att gaa gag ctt tac 2244
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
age cct gtt gag aat aca gag cac tt gtaagatttt ggtcctagtt 2290age cct gtt gag aa aca gag cac tt gtaagatttt ggtcctagtt 2290
Ser Pro Val Glu Asn Thr Glu His Leu 545 550Ser Pro Val Glu Asn Thr Glu His Leu 545 550
ttacacccgt tttcaccgga ttttgtgttc agtgttaaca atgtcttaaa ttgttgtctc 2350 acag a tgt gta cta aaa gac cgc aac aag ccg gtt ctg ttt acc atg 2397ttacacccgt tttcaccgga ttttgtgttc agtgttaaca atgtcttaaa ttgttgtctc 2350 acag a tgt gta cta aaa gac aac aag ccg gtt ctg ttt acc atg 2397
Cys Val Leu Lys Asp Arg Asn Lys Pro Val Leu Phe Thr Met 555 560 565Cys Val Leu Lys Asp Arg Asn Lys Pro Val Leu Phe Thr Met 555 560 565
gea agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat 2445gea agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat 2445
Ala Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe TyrWing Arg Read Asp Arg Val Lys Asn Read Thr Thr Gly Read Val Glu Phe Tyr
570 575 580570 575 580
gcc aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt 2493gcc aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt 2493
Ala Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly 585 590 595Wing Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly 585 590 595
2020
2525
30 1030 10
1515
gga gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg 2541gga gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg 2541
Gly Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met 600 605 610 615Gly Asp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met 600 605 610 615
aag aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttg 2589aag aag atg tat gaa ctt until gaa aaa tac aag ttg aat gga caa ttg 2589
Lys Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln LeuLys Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Leu
620 625 630620 625 630
aga tgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat 2637aga tgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat 2637
Arg Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu TyrArg Trp Ile Being Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr
635 640 645635 640 645
cgt tat att tgc gac acc aag gga gea ttc gtt cag cct gct ata tac 2685cgt tat att tgc gac acc aga gga gea ttc gtt cag cct gct ata tac 2685
Arg Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Ala Ile TyrArg Tyr Ile Cys Asp Thr Lys Gly Phe Val Gln Pro Wing Ile Tyr
650 655 660650 655 660
gag gct ttt ggc ttg act gtt gtt gag gea atg acc tgt gga cta cca 2733gg gt tt gg tt gt act gtt gt gt gt atg acc tgt gga cta cca 2733
Glu Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu ProGlu Wing Phe Gly Leu Thr Val Val Glu Wing Met Thr Cys Gly Leu Pro
665 670 675665 670 675
aca ttt gea aca tgt tac ggg ggc cct gct gag att ata gtt cac gga 2781aca ttt gea aca tgt tac ggg ggc cct gct gag att ata gtt cac gga
Thr Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly 680 685 690 695Thr Phe Wing Thr Cys Tyr Gly Gly Pro Wing Glu Ile Ile Val His Gly 680 685 690 695
aaa tcg ggg ttc aac ate gac cct tat aac ggt gat ttg gct gcc gag 2829aaa tcg ggg ttc aac until gac cct tat aac ggt gat ttg gct gcc gag 2829
Lys Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala GluLys Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Read Wing Wing Glu
700 705 710700 705 710
acc ctt gcc aat ttc ttc gag aaa tgc aaa gcg gat cca tet tat tgg 2877acc ctt gcc aat ttc ttc gag aaa tgc aaa gcg gat cca tet tat tgg 2877
Thr Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr TrpThr Leu Wing Asn Phe Phe Glu Lys Cys Lys Wing Asp Pro Be Tyr Trp
715 720 725715 720 725
gat gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca 2925gat gag till tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca 2925
Asp Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr ThrAsp Glu Ile Be Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr
730 735 740730 735 740
tgg cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc 2973tgg cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc 2973
Trp Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr GlyTrp Gln Ile Tyr Be Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly
745 750 755745 750 755
ttt tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac 3021ttt tcg aaa cat gta cgt tac cag gag caa cgt ggg cgc aag cgt tac 3021
Phe Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg TyrPhe Ser Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr
2525
30 1030 10
760 765 770 775760 765 770 775
att gaa atg ttg cat gca ctg atg tat aac aat cgg gtaaaacagt 3067att gaa atg ttg cat gca ctg atg tat aac cgg gtaaaacagt 3067
Ile Glu Met Leu His Ala Leu Met Tyr Asn Asn ArgIle Glu Met Leu His Wing Leu Met Tyr Asn Asn Arg
780 785780 785
catgtctatt attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt 3127 catatttctg aatgtttaat tgtgtgcttt acag gtc aag act gtt cct cta gct 3182catgtctatt attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt 3127 catatttctg aatgtttaat tgtgtgcttt acag gtc aag act gtt cct cta gct 3182
Val Lys Thr Val Pro Leu Ala 790Val Lys Thr Val Pro Leu Wing 790
gtt gag taaagaaaaa gcaaaaagct gcttggaaac taaggaattt ccaagaatga 3238gtt gag taaagaaaaa gcaaaaagct gcttggaaac taaggaattt ccaagaatga 3238
Val Glu 795Val Glu 795
ctgctgttga agttttattt ttgcttttat tatttttttt ttcaattggt aaatgtttaa 3298 tttcaacttt gggtttcccc ttggaatgtt tcgaactctg ggtttggtcc catt 3352ctgctgttga agttttattt ttgcttttat tatttttttt ttcaattggt aaatgtttaa 3298 tttcaacttt gggtttcccc ttggaatgtt tcgaactctg ggtttggtcc catt 3352
<210> 9<210> 9
<211> 796<211> 796
<212> PRT<212> PRT
<213> Gossypium raimondii <4 00> 9<213> Gossypium raimondii <4 00> 9
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala ThrMet Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr
1 5 10 151 5 10 15
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyHis Pro Gln Gln Wing Lys Be Ile Read G Be Arg Ile Glu Be Read Gly
25 3025 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys Glu 40 45Lys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu 40 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg Ser 50 55 60Gly Wing Asn Gln Wing Read Asp Gly Met Val Val Glu Val Leu Arg Ser 50 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Leu Glu Phe Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Leu Glu Phe Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu Glu 100 105 110 Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PhePhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu 100 105 110 Leu Val Asp Gly Be Ser Asn Gly Glu Phe Met Leu Glu Leu Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile ArgGln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile Arg
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Ser Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Leu GlyRead Be Ser Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Read Gly
210 215 220210 215 220
Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Ala Asn Val Val IleRead Glu Ser Phe Read Gly Arg Ile Pro Read Val Wing Asn Val Val Ile
260 265 270260 265 270
Met Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr ProMet Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr Pro
275 280 285275 280 285
Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuAsp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu
290 295 300290 295 300
Glu Glu Glu Leu Leu His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Leu Read His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly ThrPro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr
325 330 335325 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp 355 360 365 Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Leu Arg Val Pro Phe Arg Glu Thr Lys Gly Ile Val Arg Pro Trp 355 360 365 Ile Be Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Wing Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala HisVal Gly Asn Tyr Be Asp Gly Asn Ile Val Wing Be Read Leu Wing His
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Ser Ile Ala His Ala Leu Glu Lys ThrLys Phe Asp Val Thr Gln Cys Ser Ile Wing His Wing Leu Glu Lys Thr
420 425 430420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480Asp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Ser Lys Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Ile Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Ile Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu Lys 610 615 620 Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Leu Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Leu Ile Glu Lys 610 615 620 Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Ala Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Wing Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrGlu Wing Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro Tyr
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu LysLys Wing Asp Pro Being Tyr Trp Asp Glu Ile Being Gln Gly Gly Leu Lys
725 730 735725 730 735
Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuArg Ile Gln Lys Tyr Thr Trp Gln Ile Tyr Be Glu Lys Leu Leu
740 745 750740 745 750
Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln GluThr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu
755 760 765755 760 765
Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Ala Leu Met TyrGln Arg Gly Arg Lys Arg Tyr Ile Glu Met Read His Wing Read Met Tyr
770 775 780770 775 780
Asn Asn Arg Val Lys Thr Ala Pro Leu Ala Val Glu 785 790 795Asn Asn Arg Val Lys Thr Wing Pro Leu Wing Val Glu 785 790 795
<210> 10 <211> 3354 <212> DNA<210> 10 <211> 3354 <212> DNA
<213> Gossypium raimondii <220><213> Gossypium raimondii <220>
<221> éxon <222> (32).. (111) <223> exonl <220><221> exon <222> (32) .. (111) <223> exonl <220>
<221> misc_feature <222> (32) . . (34) <223> códon de iniciação <22 0><221> misc_feature <222> (32). . (34) <223> initiation codon <22 0>
<221> éxon<221> exon
<222> (197)..(314)<222> (197) .. (314)
<223> éxon 2 <220><223> exon 2 <220>
<221> éxon<221> exon
<222> (394) . . (545)<222> (394). . (545)
<223> éxon 3 <220><223> exon 3 <220>
<221> éxon<221> exon
<222> (625) . . (817)<222> (625). . (817)
<223> exon4 <220><223> exon4 <220>
<221> éxon<221> exon
<222> (897)..(1328)<222> (897) .. (1328)
<223> éxon 5 <220><223> exon 5 <220>
<221> éxon<221> exon
<222> (1399)..(1689)<222> (1399) .. (1689)
<223> éxon 6 <22 0><223> exon 6 <22 0>
<221> éxon<221> exon
<222> (1783)..(1949)<222> (1783) .. (1949)
<223> éxon 7 <220><223> exon 7 <220>
<221> éxon<221> exon
<222> (2050)..(2274)<222> (2050) .. (2274)
<223> éxon 8 <220><223> exon 8 <220>
<221> éxon 15<221> exon 15
<222> (2358)..(3060) <223> éxon 9 <220><222> (2358) .. (3060) <223> exon 9 <220>
<221> éxon <222> (3169)..(3195) <223> éxon 10 <220><221> exon <222> (3169) .. (3195) <223> exon 10 <220>
<221> misc_feature <222> (3196) .. (3198) <223> Códon de terminação <400> 10<221> misc_feature <222> (3196) .. (3198) <223> Termination Codon <400> 10
cccttctgcc atttcaggaa cctaaaaaac a atg gct tca ate agt gtt tgt 52cccttctgcc atttcaggaa cctaaaaaac atg gct tca till agt gtt tgt 52
Met Ala Ser Ile Ser Val Cys 1 5Met Ala Ser Ile Ser Val Cys 1 5
gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100
Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys SerGlu Arg Read Gly Glu Be Read Ala Thr His Pro Gln Gln Ala Lys Ser
15 2015 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151until ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151
Ile Leu Ser Arg 25Ile Leu Ser Arg 25
tctttgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206tctttgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206
Ile Glu Ser 30Ile Glu Ser 30
etc gga aag ggt att cat aag tet caa aag ctg etc tcg gtt ctc gat 254gg aag ggt att cat cat aag tet caa aag ctg etc tcg gtt ctc gat 254
Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu AspLeu Gly Lys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp
40 4540 45
aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gtg gag gtc ctc 302aaa gag gcc gga aat caa gea ctt gat ggg atg gtg gtg gag gtc ctc 302
Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val LeuLys Glu Wing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu
50 55 6050 55 60
agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354
Arg Ser Thr Gln 65Arg Ser Thr Gln 65
2020
30 1530 15
atgagaatat tggactgata tgttaccgct atctaatag gaa gct gta gtg tcg 408atgagaatat tggactgata tgttaccgct atctaatag gaa gct gta gtg tcg 408
Glu Ala Val Val Ser 70Glu Wing Val Val Ser 70
tct cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 456tct cca ttg gtt gcc ctt gcc att cgt
Ser Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp Glu 75 80 85Ser Pro Leu Wing Val Leu Wing Ile Arg Ser Pro Wing Gly Val Trp Glu 75 80 85
tac att gct ctg gag ttc caa aag ctt ttt gtg gag gaa atg ccc gtt 504tac att gct ctg gag ttc caa aag ctt tt gtg gag gaa atg ccc gtt 504
Tyr Ile Ala Leu Glu Phe Gln Lys Leu Phe Val Glu Glu Met Pro Val 90 95 100Tyr Ile Wing Read Glu Phe Gln Lys Read Phe Val Glu Glu Met Pro Val 90 95 100
gct gag tat cta cgg ttg aag gaa gaa ctt gtt gat gga ag 545gct gag tat cta cgg ttg aag gaa gaa ct gt gat gga ag 545
Ala Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly SerGlu Wing Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser
105 110 115105 110 115
gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655
Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125
ttt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 703tt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 703
Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys SerPhe Gly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Be Lys Be
130 135 140130 135 140
att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751
Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys LeuIle Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu To Be Ala Lys Leu
145 150 155145 150 155
ttt caa gac aag gag aac ttg aac ttg ttg ctt gaa ttt ctc caa att 799ttt caa gac aag gag aac ttg aac ttg ttg ct gaa ttt ctc caa att 799
Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175
cgc tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847cgc tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847
Arg Cys Gln Lys Gly Lys 180Arg Cys Gln Lys Gly Lys 180
cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905
Gly Met LeuGly Met Leu
2020
ttg aat gac aga ate caa gat gtg aat tcc ctc caa cat gca ttg aggttg aat gac aga until hunting gat gtg aat tcc ctc hunting cat gca ttg agg
953 Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu Arg 185 190 195 200953 Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Wing Leu Arg 185 190 195 200
aag gcc gag gag tat ctg act cct cta tcg tcg gat acc ccg tac tca 1001aag gcc gag gag tat ctg act cct cta tcg tcg gat acc ccg tac tca 1001
Lys Ala Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr Ser 205 210 215Lys Wing Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr Ser 205 210 215
gtt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggc 1049 Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp Glygt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggc 1049 Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp Gly
220 225 230220 225 230
gat aat gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc 1097gat aat gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc 1097
Asp Asn Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu 235 240 245Asp Asn Glu Wing His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu 235 240 245
ctt cag gea cct gat cct gtc gea ctt gaa age ttc ctt gga aga ate 1145ct cag gea cct gat cct gtc gea
Leu Gln Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Arg Ile 250 255 260Read Gln Wing Pro Asp Pro Val Wing Read Glu Ser Phe Leu Gly Arg Ile 250 255 260
cca ttg gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gcc 1193 Pro Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala 265 270 275 280cca ttg gtc gcc aat gtt gta until atg act ccc cat gga tac ttt gcc 1193 Pro Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala 265 270 275 280
caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241
Gln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295Gln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295
ate tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289till tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc 1289
Ile Leu Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg PheIle Leu Asp Gln Val Arg Wing Leu Glu Glu Glu Leu Read His Arg Phe
300 305 310300 305 310
aag ctg caa gga ctc gac att acc cca cgt ate cta gtc gtgagtatct 1338aag ctg caa gga ctc gac att acc cca cgt until cta gtc gtgagtatct 1338
Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325
gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc ctc cct gat gct gta gga aca act tgc gga cag cgt 1446gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc cct gat gct gta gga aca act tgc gga cag cgt 1446
Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln Arg 330 335 340Ile Thr Arg Read Leu Pro Asp Wing Val Gly Thr Thr Cys Gly Gln Arg 330 335 340
ctt gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494ct gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494
Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 10Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 10
1515
345 350345 350
ttc aga act gag aag gga att gta cgt cca tgg atettc aga act gag aag gga att gta cgt cca tgg until
Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile 360 365Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile 360 365
gcc tgg cct tac ctg gag act tac act aag gat gtt Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val 375 380 385gcc tgg cct tac ctg gag act tac act aag gat gtt Val Trp Pro Tyr Leu Glu Thr Tyr Lys Asp Val 375 380 385
acc aaa gag ttc cag ggc aag cct gat ctg att gtt Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile Val 390 395 400acc aaa gag ttc cag ggc aag cct gat ctg att gtt Thr Lys Glu Phe Gln Gly Lys Pro Asp
gat gga aac att gtt gct tet tta ttg gea cat aag Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lysgat gga aac att gtt gct tet tta ttg gea cat aag Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lys
410 415410 415
cag gtttgttaaa tgccaattaa ttgaacaagt tccttgctgt Glncag gtttgttaaa tgccaattaa ttgaacaagt tccttgctgt Gln
355355
tca cga ttc aaa 1542tca cga ttc aaa 1542
Ser Arg Phe LysTo be Arg Phe Lys
370370
gea gct gag ate 1590 Ala Ala Glu Ilegea gct gag until 1590 Ala Ala Glu Ile
ggg aac tac agt 1638 Gly Asn Tyr Ser 405ggg aac tac agt 1638 Gly Asn Tyr Ser 405
ttt gat gtt aca 1686 Phe Asp Val Thr 420ttt gat gtt aca 1686 Phe Asp Val Thr 420
attttgatgt 1739attttgatgt 1739
2020
2525
3030
tcctcgtgca atctaaaaac ttggttttca atgttttatg cag tgc tet att gct 1794tcctcgtgca atctaaaaac ttggttttca atgttttatg cag tgc tet att gct 1794
Cys Ser Ile Ala 425Cys Ser Ile Wing 425
cat gea etc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1842cat gea etc gag aag aca aaa tac ccg gat tca gac att aac tgg aaa 1842
His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysHis Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
43O 435 44043O 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1890cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1890
Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp LeuGln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu
445 450 455445 450 455
att gct atg aac cat act gat ttt ate ate acc age acc ttc caa gag 1938 Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu 460 465 47Qatt gct atg aac cat act gat ttt till till acc act acc ttc caa gag 1938 Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu 460 465 47Q
att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989
Ile Ala Gly SerIle Wing Gly Ser
475475
cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc ggc caa tac gag agt cac att gct ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro 480 485 490cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc gg ca ca ca tac gag ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ileu 480
ggt ctc tac cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc 2146ggt ctc tac cgc gtt gtt gat ggg till gat gtt ttt gat ccc aaa ttc 2146
Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe 495 500 505 510Gly Leu Tyr Arg Val Val Asp Gly Ile Val Val Asp Phe Asp Pro Lys Phe 495 500 505 510
aat att gtc tcc cct ggt gct gat atg age ata tac ttc cct tac acg 2194aat att gtc tcc cct ggt gct gat atg act ata tac ttc cct tac acg 2194
Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr 515 520 525Asn Ile Val Ser Pro Gly Wing Asp Met Ser Ile Tyr Phe Pro Tyr Thr 515 520 525
gaa gag aag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242 Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leuga gag aag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242 Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu
530 535 540530 535 540
ctt tac age cct gtt gag aat aca gag cac tt gtgagatttt ggtcctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leu 545 550ct tac age cct gt gag aat aca gag cac tt gtgagatttt ggtcctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leu 545 550
ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt gta cta aaa gac cgc aac aag ccg att ctg ttt acc atg gea 2403 Cys Val Leu Lys Asp Arg Asn Lys Pro Ile Leu Phe Thr Met Ala 555 560 565ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt cta aaa gac cgc aac ag ccg att ctg ttt acc atg gea 2403 Cys Val Leu Lys Asp Arg Asn P5 555
agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat gcc 2451 Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr Alaagg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat gcc 2451 Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr Ala
570 575 580570 575 580
aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499aag aac acts cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499
Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600Lys Asn Be Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600
gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2547gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2547
Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met LysAsp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met Lys
605 610 615605 610 615
aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttc aga 2595aag atg tat gaa ctt until gaa aaa tac aag ttg aat gga caa ttc aga 2595
Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg 620 625 630Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg 620 625 630
tgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat cgt 2643 Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Argtgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat cgt 2643 Trp Ile Ser Ser Gln Met Asn Arg Val Asn Gly Glu Leu Tyr Arg
635 640 645635 640 645
tat att tgc gac acc aag gga gca ttc gtt cag cct gct ata tac gag 2691tat att tgc gac acc aga gga gca ttc gtt cag cct gct ata tac gag 2691
Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Ala Ile Tyr Glu 650 655 660Tyr Ile Cys Asp Thr Lys Gly Wing Phe Val Gln Pro Ile Wing Tyr Glu 650 655 660
gct ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 273 9 Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro Thr 665 670 675 680gct ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 273 9 Phe Gly Wing Leu Thr Val Val Glu Wing Met Thr Cys Wing Gly Leu Pro Thr 665 670 675 680
ttt gca aca tgc tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787ttt gca aca tgc tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787
Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly LysPhe Wing Thr Cys Tyr Gly Gly Pro Glu Wing Ile Ile Val His Gly Lys
685 690 695685 690 695
tcg ggg ttc aac ate gat cct tat aac ggt gat ttg gct gcc gag acc 2835tcg ggg ttc aac till gat cct tat aac ggt gat ttg gct gcc gag acc 2835
Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala Glu Thr 700 705 710Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Read Wing Wing Wing Glu Thr 700 705 710
1 5 ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tet tat tgg gat 2883 Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp Asp1 5 ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tet tat tgg gat 2883 Leu Asa Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp Asp
715 720 725715 720 725
gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931 Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740gag till tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931 Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740
cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc ttt 2979cag att tac tcc gag aag cta ttg act ctc acc gga gt tat ggc ttt 2979
Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760
tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027
Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr IleSer Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile
765 770 775765 770 775
gaa atg ttg cat gca ttg atg tat aac aat cgg gtaaaacaat catgtctatt 3080 Glu Met Leu His Ala Leu Met Tyr Asn Asn Arg 780 785gaa atg ttg cat gca ttg atg tat aac aat cgg gtaaaacaat catgtctatt 3080 Glu Met Leu His Ala Leu Met Tyr Asn Asn Arg 780 785
attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt catgtttctg 3140 aatgtttgtt taattgtgtg ttttacag gtc aag act gct cca cta gct gtt 3192attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt catgtttctg 3140 aatgtttgtt taattgtgtg ttttacag gtc aag act gct cca gct gtt 3192
Val Lys Thr Ala Pro Leu Ala Val 790 795Val Lys Thr Wing Pro Read Wing Val 790 795
gag taaagaaaaa gcaaaaggct gcttggaaac taaggaattt ccaagaatga 3245gag taaagaaaaa gcaaaaggct gcttggaaac taaggaattt ccaagaatga 3245
GluGlu
ttactattga agttttattt ttgcttcttt tttttcaatt tggtaaatgt ttaatttcaa 3305 ctttggggtt tccccttgga atgttttgaa ctctgggttt ggtcccatt 3354ttactattga agttttattt ttgcttcttt tttttcaatt tggtaaatgt ttaatttcaa 3305 ctttggggtt tccccttgga atgttttgaa ctctgggttt ggtcccatt 3354
<210> 11 <211> 796 <212> PRT <213> Gossypium Iongicalyx <4 00> 11<210> 11 <211> 796 <212> PRT <213> Gossypium Iongicalyx <4 00> 11
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr 10 15Met Wing Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Wing Thr 10 15
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyHis Pro Gln Gln Wing Lys Be Ile Read G Be Arg Ile Glu Be Read Gly
20 25 3020 25 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluLys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg Ser 50 55 60Gly Wing Asn Gln Wing Read Asp Gly Met Val Val Glu Val Leu Arg Ser 50 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu Glu 100 105 110Phe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu 100 105 110
Leu Val Asp Gly Ser Ser Lys Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Lys Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser Ile 130 135 140Gly Wing Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile 130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile His 165 170 175Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile His 165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Thr Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Arg GluLeu Thr Be Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Arg Glu
210 215 220210 215 220
Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Ala Asn Val Val IleRead Glu Ser Phe Read Gly Arg Ile Pro Read Val Wing Asn Val Val Ile
260 265 270260 265 270
Met Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr ProMet Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr Pro
275 280 285275 280 285
Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuAsp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu
290 295 300290 295 300
Glu Glu Glu Leu Leu His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Leu Read His Arg Phe Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly ThrPro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr
325 330 335325 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro TrpIle Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp
355 360 365355 360 365
Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Wing Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala HisVal Gly Asn Tyr Be Asp Gly Asn Ile Val Wing Be Read Leu Wing His
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Thr Ile Ala His Ala Leu Glu Lys Thr 420 425 430Lys Phe Asp Val Thr Gln Cys Thr Ile Wing His Wing Leu Glu Lys Thr 420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Val Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480Asp Phe Val Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Ser Lys Asp 465 470 475 480
Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuThr Leu Gly Gln Tyr Glu Being His Ile Wing Phe Thr Leu Pro Gly Leu
485 490 495485 490 495
Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleTyr Arg Val Val Asp Gly Ile Val Val Asp
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Val Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Val Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Ala Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Wing Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly Pro 675 680 685 Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro TyrWing Met Thr Cys Gly Leu Pro Thr Phe Wing Thr Cys Tyr Gly Gly Pro 675 680 685 Wing Glu Ile Ile Val
69O 695 70069O 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 ΊΟ 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 ΊΟ 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu LysLys Wing Asp Pro Being Tyr Trp Asp Glu Ile Being Gln Gly Gly Leu Lys
725 730 735725 730 735
Arg Ile Gln Glu Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuArg Ile Gln Lys Tyr Thr Trp Gln Ile Tyr Be Glu Lys Leu Leu
740 745 750740 745 750
Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln Glu 755 760 765Thr Leu Thr Gly Val Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu 755 760 765
Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His Ala Leu Met Tyr 770 775 780Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Read His Wing Read Met Tyr 770 775 780
Asn Asn Arg Val Lys Thr Val Pro Leu Ala Val Glu 785 790 795Asn Asn Arg Val Lys Thr Val Pro Leu Wing Val Glu 785 790 795
<210> 12 <211> 3353 <212> DNA<210> 12 <211> 3353 <212> DNA
<213> Gossypium longicalyx <220><213> Gossypium longicalyx <220>
<221> éxon <222> (32).. (111) <223> éxon 1 <220><221> exon <222> (32) .. (111) <223> exon 1 <220>
<221> misc_feature <222> (32)..(34) <223> Códon de iniciação ATG <220><221> misc_feature <222> (32) .. (34) <223> ATG initiation codon <220>
<221> éxon <222> (195).. (312) 223> éxon 2 220> <221> éxon <222> (392).. (543) <223> éxon 3 <220><221> exon <222> (195) .. (312) 223> exon 2 220> <221> exon <222> (392) .. (543) <223> exon 3 <220>
<221> éxon <222> (623).. (815) <223> éxon 4 <220><221> exon <222> (623) .. (815) <223> exon 4 <220>
<221> éxon <222> (891) .. (1322) <223> éxon 5 <220><221> exon <222> (891) .. (1322) <223> exon 5 <220>
<221> éxon <222> (1393)..(1683) <223> éxon 6 <22 0><221> exon <222> (1393) .. (1683) <223> exon 6 <22 0>
<221> éxon <222> (1777) .. (1943) <223> éxon 7 <220><221> exon <222> (1777) .. (1943) <223> exon 7 <220>
<221> éxon <222> (2044) .. (2268) <223> éxon 8 <220><221> exon <222> (2044) .. (2268) <223> exon 8 <220>
<221> éxon <222> (2353)..(3055) <223> éxon 9 <220><221> exon <222> (2353) .. (3055) <223> exon 9 <220>
<221> éxon <222> (3160) .. (3186) <223> éxon 10 <220> 10<221> exon <222> (3160) .. (3186) <223> exon 10 <220> 10
1515
<221> misc_feature <222> (3187)..(3189) <223> Códon de terminação <400> 12<221> misc_feature <222> (3187) .. (3189) <223> Termination Codon <400> 12
cccttctgcc atttcaggaa cctaaaaaac a atg gct tca ate agt gtc tgt 52cccttctgcc atttcaggaa cctaaaaaac a atg gct tca till agt gtc tgt 52
Met Ala Ser Ile Ser Val Cys 1 5Met Ala Ser Ile Ser Val Cys 1 5
gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100gag cgt ttg ggt gaa tet cta gct act cat cca cag cag gea aag tet 100
Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys SerGlu Arg Read Gly Glu Be Read Ala Thr His Pro Gln Gln Ala Lys Ser
15 2015 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaag ttctggtctt 151until ttg tca ag gtataatgta aataagtgct gccttcaaag ttctggtctt 151
Ile Leu Ser Arg 25Ile Leu Ser Arg 25
tctttgttat tttttattac attgtttata tgtcatattg cag g att gaa age ctt 207tctttgttat tttttattac attgtttata tgtcatattg cag g att gaa age ctt 207
Ile Glu Ser Leu 30Ile Glu Ser Leu 30
gga aag ggt att cat aag tet caa aag ctg etc tcg gtt ctc gat aaa 255gga aag ggt att cat cat aag tet caa aag ctg etc tcg gtt ctc gat aaa 255
Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp LysGly Lys Gly Ile His Lys Be Gln Lys Read Leu Read Val Leu Asp Lys
40 4540 45
gag gcc gga aat caa gea ctt gat ggg atg gtg gtg gag gtc ctc agg 3 03gag gcc gga aa caa gea ctt gat ggg atg gtg gtg gag gtc ctc agg 3 03
Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu ArgGlu Wing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu Arg
50 55 6050 55 60
tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 352tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 352
Ser Thr Gln 65Ser Thr Gln 65
atgagaatat tggactgata tgttaccatt atctaatag gaa gct gta gtg tcg 406atgagaatat tggactgata tgttaccatt atctaatag gaa gct gta gtg tcg 406
Glu Ala Val Val Ser 70Glu Wing Val Val Ser 70
tet cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 454tet cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 454
Ser Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp Glu 75 80 85Ser Pro Leu Wing Val Leu Wing Ile Arg Ser Pro Wing Gly Val Trp Glu 75 80 85
tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 502tac att gct gtg gag gtc caa aag ctt tt gtg gag gaa atg ccc gtt 502
2020
25 1025 10
1515
Tyr Ile Ala Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro ValTyr Ile Wing Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val
90 95 10090 95 100
gct gag tat cta cgg ttg aag gaa gaa ctt gtt gac gga ag 543gct gag tat cta cgg ttg aag gaa gaa ctt gtt gac gga ag 543
Ala Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly SerGlu Wing Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser
105 110 115105 110 115
gtaggtagtt ttctcttaca gcCcctggtt tatgttatgt ttgagtttat gttgaacaat 603 gctcaatgat gtggttcag c tcc aag ggc gag ttt atg ttg gaa ttg gac 653gtaggtagtt ttctcttaca gcCcctggtt tatgttatgt ttgagtttat gttgaacaat 603 gctcaatgat gtggttcag c tcc aag ggc gag ttt atg ttg gaa ttg gac 653
Ser Lys Gly Glu Phe Met Leu Glu Leu Asp 120 125Ser Lys Gly Glu Phe Met Leu Glu Leu Asp 120 125
ttt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 701ttt ggt gca ttc aat aat tct gtt cct cgt cca tct ctt tca aag tcc 701
Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys SerPhe Gly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Be Lys Be
130 135 140130 135 140
att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 749att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 749
Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys LeuIle Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu To Be Ala Lys Leu
145 150 155145 150 155
ttt caa gac aag gag aac ttg aac ttg ttg ctt gaa ttt ctc caa att 797ttt caa gac aag gag aac ttg aac ttg ttg ct gaa ttt ctc caa att 797
Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175
cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 845cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 845
His Cys Gln Lys Gly LysHis Cys Gln Lys Gly Lys
180180
cctttataca tcaaaactca agaatcgttc aatgttatgc cacag ggt atg ctg ttg 902cctttataca tcaaaactca agaatcgttc aatgttatgc cacag ggt atg ctg ttg 902
Gly Met Leu Leu 185Gly Met Leu Leu 185
aat gac aga ate caa gat gtg aat tcc ctc caa cat gca ttg agg aag 950aat gac aga till cat gat gtg aat tcc ctc caa cat gca ttg agg aag 950
Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu Arg LysAsn Asp Arg Ile Gln Asp Val Asn Being Read Gln His Wing Read Arg Lys
190 195 200190 195 200
gcc gag gag tat ctg act ccc cta acg tcg gat acc ccg tac tcg gtt 998gcc gag gag tat ctg act ccc cta acg tcg gat acc ccg tac tcg gtt 998
Ala Glu Glu Tyr Leu Thr Pro Leu Thr Ser Asp Thr Pro Tyr Ser ValGlu Glu Wing Tyr Leu Thr Pro Leu Thr Be Asp Thr Pro Tyr Be Val
205 210 215205 210 215
ttc gag aaa agg ttt cgg gag att ggt ttg gag aag gga tgg ggt gat 1046ttc gag aaa agg ttt cgg gag att ggt ttg gag aag gga tgg ggt gat 1046
Phe Glu Lys Arg Phe Arg Glu Ile Gly Leu Glu Lys Gly Trp Gly Asp 10Phe Glu Lys Arg Phe Arg Glu Ile Gly Leu Glu Lys Gly Trp Gly Asp 10
1515
2020
2525
3030
220 225 230220 225 230
aat gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc ctt 1094aat gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc ctt 1094
Asn Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu LeuAsn Wing Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu Leu
235 240 245235 240 245
cag gea cct gat cct gtc gea ctt gaa age ttc ctt gga aga ate cca 1142 Gln Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Arg Ile Pro 250 255 260 265cag gea cct gat cct gtc gea ct gaa age ttc ctt gga aga till cca 1142 Gln Ala Pro Asp Pro Val Leu Glu Ser Phe Leu Gly Arg Ile Pro 250 255 260 260 265
ttg gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gct caa 1190 Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala Glnttg gtc gcc aat gtt gta till atg act ccc cat gga tac ttt gct caa 1190 Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala Gln
270 275 280270 275 280
gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat ate 1238 Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr Ilegac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat till 1238 Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr Ile
285 290 295285 290 295
tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc aag 1286tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt ttc aag 1286
Leu Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg Phe LysLeu Asp Gln Val Arg Wing Leu Glu Glu Glu Leu Read His Arg Phe Lys
300 305 3io300 305 3io
ctg caa gga etc gac att acc cca cgt ate cta gtc gtgagtatct 1332ctg caa gga etc gac att acc cca cgt till cta gtc gtgagtatct 1332
Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325Read Gln Gly Read Asp Ile Thr Pro Arg Ile Read Val 315 320 325
gaatgttaat tcggtttcga tcattatgtt taatgcttaa tttggctgtt tatgtaccag 1392 att act cgg ctc ctc cct gat gct gta gga aca act tgc ggt cag cgt 1440gaatgttaat tcggtttcga tcattatgtt taatgcttaa tttggctgtt tatgtaccag 1392 att act cgg ctc cct gat gct gta gga aca act tgc ggt cag cgt 1440
Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln ArgIle Thr Arg Read Leu Pro Asp Wing Val Gly Thr Thr Cys Gly Gln Arg
330 335 340330 335 340
ctt gaa aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1488ct gaa aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1488
Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val ProGlu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile
345 350 355345 350 355
ttc aga act gag aag gga att gta cgt cca tgg ate tca cga ttc aaa 1536ttc aga act gag aag gga att gta cgt cca tgg until tca cga ttc aaa 1536
Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Ser Arg Phe LysPhe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Be Arg Phe Lys
360 365 370360 365 370
gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag ate 1584gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag until 1584
Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Ile 375 380 385 acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1632Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Wing Wing Glu Ile 375 380 385 acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1632
Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr SerThr Lys Glu Phe Gln Gly Lys Pro Asp Read Ile Val Gly Asn Tyr Ser
390 395 400 405390 395 400 405
gat gera aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1680gat generates aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1680
Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lys Phe Asp Val ThrAsp Gly Asn Ile Val Wing Be Read Leu Wing His Lys Phe Asp Val Thr
410 415 420410 415 420
cag gtttgttaaa tgccaattaa ttgaacaagt tccttgctgt attttgatgt 1733cag gtttgttaaa tgccaattaa ttgaacaagt tccttgctgt attttgatgt 1733
GlnGln
1010
1515
2020
2525
3030
ttcccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1788ttcccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1788
Cys Thr Ile Ala 425Cys Thr Ile Wing 425
cat gea ctt gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1836cat gea ctt gag aag ag aaa tac ccg gat tca gac att aac tgg aaa 1836
His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysHis Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
430 435 440430 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1884cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1884
Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp LeuGln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu
445 450 455445 450 455
att gct atg aac cat act gat ttt gtc ate acc age acc ttc caa gag 1932att gct atg aac cat act gat ttt gtc until acc age acc ttc caa gag 1932
Ile Ala Met Asn His Thr Asp Phe Val Ile Thr Ser Thr Phe Gln GluIle Ala Met Asn His Thr Asp Phe Val Ile Thr Be Thr Phe Gln Glu
460 465 470460 465 470
att gct gga ag gtaatctatc tcccacttta ctaaatttgg ttcccttgta 1983att gct gga ag gtaatctatc tcccacttta ctaaatttgg ttcccttgta 1983
Ile Ala Gly Ser 475Ile Wing Gly Ser 475
cgttgtaaat gttgccacct atttgagtcc ctaacaatgt ttccattcca tgattttcag 2043 c aag gac act ctc ggc caa tac gag agt cac att gct ttc act ctt cca 2092 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro 480 485 490cgttgtaaat gttgccacct atttgagtcc ctaacaatgt ttccattcca tgattttcag 2043 c aag gac act ctc gg ca ca ca tac gag ttc act ctt cca 2092 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Thr
ggg ctc tac cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc 2140ggg ctc tac cgc gtt gtt gat ggg till gat gtt ttt gat ccc aaa ttc 2140
Gly Leu Tyr Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe 495 500 505 510Gly Leu Tyr Arg Val Val Asp Gly Ile Val Val Asp Phe Asp Pro Lys Phe 495 500 505 510
aat att gtc tcc cct ggt gct gat atg age ata tac ttc cct tac acg 2188 Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thraat att gtc tcc cct ggt gct gat atg act ata tac ttc cct tac acg 2188 Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr
515 520 525515 520 525
gaa gag aag cga agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2236gag gag aag cga agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2236
Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540Glu Glu Lys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540
ctt tac age cct gtt gag aat aca gag cac tt gtgggatttt ggtcctattt 22 88 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leuct tac age cct gtt gag aat aca gag cac tt gtgggatttt ggtcctattt 22 88 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leu
545 550545 550
ttacaccctt tttcactaga ttttgtgttc agtgttaaca atgtcttgaa ttgttgtctc 2348 acag a tgt gta cta aaa gac cgc aac aag ccg gtt ctg ttt acc atg 2395ttacaccctt tttcactaga ttttgtgttc agtgttaaca atgtcttgaa ttgttgtctc 2348 acag a tgt gta cta aaa gac ag aag ccg gtt ctg ttt acc atg 2395
Cys Val Leu Lys Asp Arg Asn Lys Pro Val Leu Phe Thr Met 555 560 565Cys Val Leu Lys Asp Arg Asn Lys Pro Val Leu Phe Thr Met 555 560 565
gea agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat 2443 Ala Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr 570 575 580gea agg ttg gac cga gtg aag aac tta act ggg ctc gta gag ttc tat 2443 Arg Wing Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr 570 575 580
gcc aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt 2491gcc aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt 2491
Ala Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val GlyWing Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly
585 590 595585 590 595
gga gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg 2539gga gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg 2539
Gly Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met 600 605 610 615Gly Asp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met 600 605 610 615
aag aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttc 2587aag aag atg tat gaa ctt until gaa aaa tac aag ttg aat gga caa ttc 2587
Lys Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe 620 625 630Lys Lys Met Tyr Glu Read Ile Glu Lys Tyr Lys Read Asn Gly Gln Phe 620 625 630
aga tgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat 263 5 Arg Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyraga tgg ata tcg tcc cag atg aac cgt gtg aga aat ggt gaa ctc tat 263 5 Arg Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr
635 640 645635 640 645
cgt tat att tgc gac acc aag gga gea ttc gtt cag cct gct ata tac 2683cgt tat att tgc gac acc aag gga gea ttc gtt
Arg Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Ala Ile Tyr 650 655 660Arg Tyr Ile Cys Asp Thr Lys Gly Phe Val Gln Pro Wing Ile Tyr 650 655 660
gag gct ttt ggc ttg act gtt gtt gag gea atg acc tgt gga cta cca 2731gg gt tt gg tt gt gt gt gt gt gt cta cca 2731
Glu Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro 10Glu Wing Phe Gly Leu Thr Val Val Glu Wing Met Thr Cys Gly Leu Pro 10
1515
2020
2525
3030
665 670 675665 670 675
aca ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga 2779aca ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga 2779
Thr Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly 680 685 690 695Thr Phe Wing Thr Cys Tyr Gly Gly Pro Wing Glu Ile Ile Val His Gly 680 685 690 695
aaa tcg ggg ttc aac ate gac cct tat aac ggt gat ttg gct gcc gag 2827aaa tcg ggg ttc aac until gac cct tat aac ggt gat ttg gct gcc gag 2827
Lys Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Leu Ala Ala GluLys Ser Gly Phe Asn Ile Asp Pro Tyr Asn Gly Asp Read Wing Wing Glu
700 705 710700 705 710
acc ctc gcc aat ttc ttc gag aaa tgc aaa gcg gat cca tet tat tgg 2875acc ctc gcc aat ttc ttc gag aaa tgc aaa gcg gat cca tet tat tgg 2875
Thr Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr TrpThr Leu Wing Asn Phe Phe Glu Lys Cys Lys Wing Asp Pro Be Tyr Trp
715 720 725715 720 725
gat gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca 2923gat gag till tcc cag ggg ttg aaa cgc ata cag gag aag tat aca 2923
Asp Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr ThrAsp Glu Ile Be Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr
730 735 740730 735 740
tgg cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc 2971tgg cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc 2971
Trp Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr GlyTrp Gln Ile Tyr Be Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly
745 750 755745 750 755
ttt tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac 3019ttt tcg aaa cat gta cgt tac cag gag caa cgt ggg cgc aag cgt tac 3019
Phe Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr 760 765 770 775Phe Ser Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr 760 765 770 775
att gaa atg ttg cat gca ttg atg tat aac aat cgg gtaaaacaat 3 065att gaa atg ttg cat gca ttg atg tat aac aat cgg gtaaaacaat 3 065
Ile Glu Met Leu His Ala Leu Met Tyr Asn Asn ArgIle Glu Met Leu His Wing Leu Met Tyr Asn Asn Arg
780 785780 785
catgtctatt attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt 3125 catgtttctg aatgtttaat tgtgtgcttt acag gtc aag act gtt cct cta gct 3180catgtctatt attattatta catcaaacat cattttactc tttcattgaa ctgaaaattt 3125 catgtttctg aatgtttaat tgtgtgcttt acag gtc aag act gtt cct cta gct 3180
Val Lys Thr Val Pro Leu Ala 790Val Lys Thr Val Pro Leu Wing 790
gtt gag taaagaaaca gcaaaaagct gcttggaaac caaggaattt ccaagaatga 3236gtt gag taaagaaaca gcaaaaagct gcttggaaac caaggaattt ccaagaatga 3236
Val GluVal glu
795795
ttgctattga agttttattt ttgcttttct tttttttttt ttttttcaat ttggtaaatg 3296 tttaatttca actttggggt ttccccttgg aatgttttga actctgggtt tggtccc 3353ttgctattga agttttattt ttgcttttct tttttttttt ttttttcaat ttggtaaatg 3296 tttaatttca actttggggt ttccccttgg aatgttttga actctgggtt tggtccc 3353
<210> 13 <211> 796 <212> PRT<210> 13 <211> 796 <212> PRT
<213> Gossypium Iongicalyx <400> 13<213> Gossypium Iongicalyx <400> 13
Met Ala Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Ala Thr 10 15Met Wing Ser Ile Ser Val Cys Glu Arg Leu Gly Glu Ser Leu Wing Thr 10 15
His Pro Gln Gln Ala Lys Ser Ile Leu Ser Arg Ile Glu Ser Leu GlyHis Pro Gln Gln Wing Lys Be Ile Read G Be Arg Ile Glu Be Read Gly
25 3025 30
Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys GluLys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu
40 4540 45
Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val Leu Arg SerWing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu Arg Ser
50 55 6050 55 60
Thr Gln Glu Ala Val Val Ser Ser Pro Leu Val Ala Leu Ala Ile Arg 65 70 75 80Thr Gln Glu Wing Val Val Ser Be Pro Leu Val Wing Leu Wing Ile Arg 65 70 75 80
Ser Ala Pro Gly Val Trp Glu Tyr Ile Ala Val Glu Val Gln Lys LeuSer Pro Wing Gly Val Trp Glu Tyr Ile Wing Val Glu Val Gln Lys Leu
85 90 9585 90 95
Phe Val Glu Glu Met Pro Val Ala Glu Tyr Leu Arg Leu Lys Glu GluPhe Val Glu Glu Met Pro Val Glu Wing Tyr Leu Arg Leu Lys Glu Glu
100 105 110100 105 110
Leu Val Asp Gly Ser Ser Asn Gly Glu Phe Met Leu Glu Leu Asp PheRead Val Asp Gly Be Ser Asn Gly Glu Phe Met Read Le Glu Read Le Asp Phe
115 120 125115 120 125
Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys Ser IleGly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Lys Be Ile
130 135 140130 135 140
Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys Leu Phe 145 150 155 160Gly Asn Gly Met Asp Phe He Read Asn Arg His He Read Be Wing Lys Leu Phe 145 150 155 160
Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile HisGln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile His
165 170 175165 170 175
Cys Gln Lys Gly Lys Gly Met Leu Leu Asn Asp Arg Ile Gln Asp ValCys Gln Lys Gly Lys Gly Met Leu Read Asn Asp Arg Ile Gln Asp Val
180 185 190180 185 190
Asn Ser Leu Gln His Ala Leu Arg Lys Ala Glu Glu Tyr Leu Thr ProAsn Be Read Gln His Wing Read Arg Lys Wing Glu Glu Tyr Leu Thr Pro
195 200 205195 200 205
Leu Ser Ser Asp Thr Pro Tyr Ser Val Phe Glu Lys Arg Phe Leu Gly 210 215 220 Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Ala Glu His Val Leu Glu 225 230 235 240Leu Be Ser Asp Thr Pro Tyr Be Val Phe Glu Lys Arg Phe Leu Gly 210 215 220 Ile Gly Leu Glu Lys Gly Trp Gly Asp Asn Wing Glu His Val Leu Glu 225 230 235 240
Met Ile His Leu Leu Leu Asp Leu Leu Gln Ala Pro Asp Pro Val AlaMet Ile His Leu Leu Leu Asp Leu Leu Gln Wing Pro Asp Pro Val Wing
245 250 255245 250 255
Leu Glu Ser Phe Leu Gly Arg Ile Pro Leu Val Ala Asn Val Val IleRead Glu Ser Phe Read Gly Arg Ile Pro Read Val Wing Asn Val Val Ile
260 265 270260 265 270
Met Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr ProMet Thr Pro His Gly Tyr Phe Ala Gln Asp Asn Val Leu Gly Tyr Pro
275 280 285275 280 285
Asp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Ala LeuAsp Thr Gly Gly Gln Val Val Tyr Ile Leu Asp Gln Val Arg Wing Alu
29O 295 30029O 295 300
Glu Glu Glu Leu Leu His Arg Ser Lys Leu Gln Gly Leu Asp Ile Thr 305 310 315 320Glu Glu Glu Read His Read His Arg Be Lys Read His Gln Gly Read Asp Ile Thr 305 310 315 320
Pro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Ala Val Gly ThrPro Arg Ile Leu Val Ile Thr Arg Leu Leu Pro Asp Wing Val Gly Thr
32S 330 33532S 330 335
Thr Cys Gly Gln Arg Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser AspThr Cys Gly Gln Arg Read Le Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp
340 345 350340 345 350
Ile Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro TrpIle Leu Arg Val Pro Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp
355 360 365355 360 365
Ile Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys AspIle Ser Arg Phe Lys Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp
370 375 380370 375 380
Val Ala Ala Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400Val Wing Wing Glu Ile Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile 385 390 395 400
Val Gly Asn Tyr Ser Asp Gly Asn Ile Val Ala Ser Leu Leu Ala HisVal Gly Asn Tyr Be Asp Gly Asn Ile Val Wing Be Read Leu Wing His
405 410 415405 410 415
Lys Phe Asp Val Thr Gln Cys Thr Ile Ala His Ala Leu Glu Lys ThrLys Phe Asp Val Thr Gln Cys Thr Ile Wing His Wing Leu Glu Lys Thr
420 425 430420 425 430
Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys Gln Leu Glu Asp Lys TyrLys Tyr Pro Asp Be Asp Ile Asn Trp Lys Gln Read Glu Asp Lys Tyr
435 440 445435 440 445
His Phe Ser Cys Gln Phe Thr Ala Asp Leu Ile Ala Met Asn His ThrHis Phe Ser Cys Gln Phe Thr Wing Asp Read Ile Wing Met Asn His Thr
450 455 460450 455 460
Asp Phe Ile Ile Thr Ser Thr Phe Gln Glu Ile Ala Gly Ser Lys Asp 465 470 475 480 Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro Gly LeuAsp Phe Ile Ile Thr Be Thr Phe Gln Glu Ile Wing Gly Be Lys Asp 465 470 475 480 Thr Read Gly Gln Tyr Glu Be His Ile Wing
485 490 495485 490 495
Cys Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe Asn IleCys Arg Val Val Asp Gly Ile Asp Val Val Phe Asp Pro
500 505 510500 505 510
Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr Thr Glu GluVal Ser Pro Gly Wing Asp Met Ile Tyr Phe Pro Tyr Thr Glu Glu
515 520 525515 520 525
Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu Leu TyrLys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Read Leu Tyr
530 535 540530 535 540
Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560Ser Pro Val Glu Asn Thr Glu His Leu Cys Val Leu Lys Asp Arg Asn 545 550 555 560
Lys Pro Ile Leu Phe Thr Met Ala Arg Leu Asp Arg Val Lys Asn LeuLys Pro Ile Leu Phe Thr Met Wing Arg Leu Asp Arg Val Lys Asn Leu
565 570 575565 570 575
Thr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu LeuThr Gly Leu Val Glu Phe Tyr Ala Lys Asn Ser Arg Leu Arg Glu Leu
580 585 590580 585 590
Val Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys AspVal Asn Leu Val Val Val Gly Gly Asp Arg Arg Lys Glu Ser Lys Asp
595 600 605595 600 605
Leu Glu Glu Lys Ala Glu Met Lys Lys Met Tyr Glu Leu Ile Glu LysRead Glu Glu Lys Wing Glu Met Lys Lys Met Tyr Glu Read Le Ile Glu Lys
610 615 620610 615 620
Tyr Lys Leu Asn Gly Gln Phe Arg Trp Ile Ser Ser Gln Met Asn Arg 625 630 635 640Tyr Lys Read Asn Gly Gln Phe Arg Trp Ile Be Ser Gln Met Asn Arg 625 630 635 640
Val Arg Asn Gly Glu Leu Tyr Arg Tyr Ile Cys Asp Thr Lys Gly AlaVal Arg Asn Gly Glu Read Tyr Arg Tyr Ile Cys Asp Thr Lys Gly Wing
645 650 655645 650 655
Phe Val Gln Pro Pro Ile Tyr Glu Ala Phe Gly Leu Thr Val Val GluPhe Val Gln Pro Ile Tyr Glu Wing Phe Gly Leu Thr Val Val Glu
660 665 670660 665 670
Ala Met Thr Cys Gly Leu Pro Thr Phe Ala Thr Cys Tyr Gly Gly ProMet Thr Cly Wing Gly Leu Pro Thr Phe
675 680 685675 680 685
Ala Glu Ile Ile Val His Gly Lys Ser Gly Phe Asn Ile Asp Pro CysGlu Wing Ile Ile Val His Gly Lys Be Gly Phe Asn
690 695 700690 695 700
Asn Gly Asp Leu Ala Ala Glu Thr Leu Ala Asn Phe Phe Glu Lys Cys 705 710 715 720Asn Gly Asp Read Wing Wing Glu Thr Read Wing Wing Asn Phe Phe Glu Lys Cys 705 710 715 720
Lys Ala Asp Pro Ser Tyr Trp Asp Glu Ile Ser Gln Gly Gly Leu Lys 725 730 735 Arg Ile Gln Glu 740Lys Wing Asp Pro Be Tyr Trp Asp Glu Ile Be Gln Gly Gly Leu Lys 725 730 735 Arg Ile Gln Glu 740
Thr Leu Thr Gly 755Thr Leu Thr Gly 755
Gln Arg Gly Arg 770Gln Arg Gly Arg 770
Asn Asn Arg Val 785Asn Asn Arg Val 785
<210> 14 <211> 3353 <212> DNA<210> 14 <211> 3353 <212> DNA
<213> Gossypium longicalyx <220><213> Gossypium longicalyx <220>
<221> éxon <222> (32) . . (111) <223> éxon 1 <220><221> exon <222> (32). . (111) <223> exon 1 <220>
<221> misc_feature <222> (32) . . (34) <223> códon de iniciação <220><221> misc_feature <222> (32). . (34) <223> initiation codon <220>
<221> éxon <222> (197) . . (314) <223> éxon 2 <220><221> exon <222> (197). . (314) <223> exon 2 <220>
<221> éxon <222> (394)..(545) <223> éxon 3 <220><221> exon <222> (394) .. (545) <223> exon 3 <220>
<221> éxon <222> (625)..(817) <223> éxon 4<221> exon <222> (625) .. (817) <223> exon 4
Lys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu LeuLys Tyr Thr Trp Gln Ile Tyr Ser Glu Lys Leu Leu
745 750745 750
Val Tyr Gly Phe Ser Lys His Val Ala Tyr Gln GluVal Tyr Gly Phe Ser Lys His Val Wing Tyr Gln Glu
760 765760 765
Lys Arg Tyr Ile Glu Met Leu His Ala Trp Met TyrLys Arg Tyr Ile Glu Met Read His Wing Trp Met Tyr
775 780775 780
Lys Thr Val Pro Leu Ala Val Glu 790 795 <220>Lys Thr Val Pro Leu Val Glu Wing 790 795 <220>
<221> éxon<221> exon
<222> (897) .. (1328)<222> (897) .. (1328)
<223> éxon 5<223> exon 5
<220><220>
<221> éxon <222> (1399) .. (1689)<221> exon <222> (1399) .. (1689)
<223> éxon 6 <220><223> exon 6 <220>
<221> éxon<221> exon
<222> (1783) .. (1949)<222> (1783) .. (1949)
<223> éxon 7 <220><223> exon 7 <220>
<221> éxon<221> exon
<222> (2050) .. (2274)<222> (2050) .. (2274)
<223> éxon 8 <220><223> exon 8 <220>
<221> éxon<221> exon
<222> (2358) .. (3060)<222> (2358) .. (3060)
<223> éxon 9 <220><223> exon 9 <220>
<221> éxon<221> exon
<222> (3169) .. (3195)<222> (3169) .. (3195)
<223> éxon 10<223> exon 10
<220><220>
<221> misc_feature <222> (3196) .. (3198) <223> Códon de terminação <4 00> 14<221> misc_feature <222> (3196) .. (3198) <223> Termination Codon <4 00> 14
cccttctgcc atttcaggaa cccaaaaaac a atg gct tca ate agt gtt tgtcccttctgcc atttcaggaa cccaaaaaac a atg gct tca till agt gtt tgt
Met Ala Ser Ile Ser Val Cys 1 5 10Met Wing Ser Ile Ser Val Cys 1 5 10
1515
2020
2525
3030
gag cgt ttg ggt gaa tct cta gct act cac cca cag cag gca aag tct 100gag cgt ttg ggt gaa tct cta gct act cac cca cag cag gca aag tct 100
Glu Arg Leu Gly Glu Ser Leu Ala Thr His Pro Gln Gln Ala Lys SerGlu Arg Read Gly Glu Be Read Ala Thr His Pro Gln Gln Ala Lys Ser
15 2015 20
ate ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151until ttg tca ag gtataatgta aataagtgct gccttcaaat ttctggtctt 151
Ile Leu Ser Arg 25Ile Leu Ser Arg 25
tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206tcttcgttat tttttattac attgtttata tatgtcatat tgcag g att gaa age 206
Ile Glu Ser 30Ile Glu Ser 30
ctc gga aag ggt att cat aag tct caa aag ctg ctc tcg gtt ctc gat 254ctc gga aag ggt att cat aag tct caa aag ctg ctc tcg gtt ctc gat 254
Leu Gly Lys Gly Ile His Lys Ser Gln Lys Leu Leu Ser Val Leu AspLeu Gly Lys Gly Ile His Lys Be Gln Lys Leu Read Le Be Val Leu Asp
40 4540 45
aaa gag gcc gga aat caa gca ctt gat ggg atg gtg gtg gag gtc ctc 3 02aaa gag gcc gga aat caa gca ct gat ggg atg gtg gtg gag gtc ctc 3 02
Lys Glu Ala Gly Asn Gln Ala Leu Asp Gly Met Val Val Glu Val LeuLys Glu Wing Gly Asn Gln Wing Leu Asp Gly Met Val Val Glu Val Leu
50 55 6050 55 60
agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354agg tcc act cag gtcacaataa aacaaaccct ttctttggtt atgtttcatg 354
Arg Ser Thr Gln 65Arg Ser Thr Gln 65
atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 408atgagaatat tggactgata tgttaccgtt atctaatag gaa gct gta gtg tcg 408
Glu Ala Val Val Ser 70Glu Wing Val Val Ser 70
tct cca ttg gtt gcc ctt gcc att cgt tca gct cct gga gtt tgg gag 456tct cca ttg gtt gcc ctt gcc att cgt
Ser Pro Leu Val Ala Leu Ala Ile Arg Ser Ala Pro Gly Val Trp GluSer Pro Leu Wing Val Leu Wing Ile Arg Ser Pro Wing Gly Val Trp Glu
75 80 8575 80 85
tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504tac att gct gtg gag gtc caa aag ctt ttt gtg gag gaa atg ccc gtt 504
Tyr Ile Ala Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro ValTyr Ile Wing Val Glu Val Gln Lys Leu Phe Val Glu Glu Met Pro Val
90 95 10090 95 100
gct gag tat cta cgg ttg aag gaa gaa ctt gtt gat gga ag 545gct gag tat cta cgg ttg aag gaa gaa ct gt gat gga ag 545
Ala Glu Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly SerGlu Wing Tyr Leu Arg Leu Lys Glu Glu Leu Val Asp Gly Ser
105 110 115105 110 115
gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tcc aat ggc gag ttt atg ttg gaa ttg gac 655 10gtaggtagtt ttctcttaca gctcctggtt tatattatgt ttgagtttat gttgaacaat 605 gctcaatgat gtggttcag c tc aat ggc gag ttt atg ttg gaa ttg gac 655 10
1515
3030
Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125Ser Asn Gly Glu Phe Met Leu Glu Leu Asp 120 125
ttt ggt gca ttc aat aat tct gtt ccc cgt cca tct ctt tca aag tcc 703ttt ggt gca ttc aat aat tct gtt ccc cgt cca
Phe Gly Ala Phe Asn Asn Ser Val Pro Arg Pro Ser Leu Ser Lys SerPhe Gly Ala Phe Asn Asn Be Val Pro Arg Pro Be Read Be Lys Be
130 135 140130 135 140
att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751att ggt aat ggc atg gac ttc ctc aac cgc cac ctt tct gcc aag cta 751
Ile Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu Ser Ala Lys LeuIle Gly Asn Gly Met Asp Phe Leu Asn Arg His Leu To Be Ala Lys Leu
145 150 155145 150 155
ttt caa gac aag gag aac ttg aac ttg ttg ctt gaa ttt ctc caa att 799ttt caa gac aag gag aac ttg aac ttg ttg ct gaa ttt ctc caa att 799
Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175Phe Gln Asp Lys Glu Asn Leu Asn Leu Leu Leu Leu Glu Phe Leu Gln Ile 160 165 170 175
cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847cac tgc cag aag gga aag gtaaaagttt cactttcact atttgttttt 847
His Cys Gln Lys Gly Lys 180His Cys Gln Lys Gly Lys 180
cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905cctttctgta tacatcaaaa cttaagaatc gttcaatgtt atgccacag ggt atg ctg 905
Gly Met LeuGly Met Leu
ttg aat gac aga ate caa gat gtg aat tcc ctc caa cat gca tta agg 953ttg aat gac aga until hunting gat gtg aat tcc ctc hunting cat gca tta agg 953
Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Ala Leu Arg 185 190 195 200Leu Asn Asp Arg Ile Gln Asp Val Asn Ser Leu Gln His Wing Leu Arg 185 190 195 200
aag gcc gag gag tat ctg act cct cta tcc tcg gat acc ccg tac tca 1001aag gcc gag gag ctg act cct cta tcc tcg gat acc ccg tac tca 1001
Lys Ala Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr SerLys Wing Glu Glu Tyr Leu Thr Pro Leu Ser Ser Asp Thr Pro Tyr Ser
205 210 215205 210 215
gtt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt 1049gt ttc gag aaa agg ttt ctg ggg att ggt ttg gag aag gga tgg ggt 1049
Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp Gly 220 225 230Val Phe Glu Lys Arg Phe Leu Gly Ile Gly Leu Glu Lys Gly Trp Gly 220 225 230
gat aat gct gag cat gtt ctt gag atg ate cat ctt cta ttg gat ctc 1097gat aat gct gag cat gtt ctt gag atg until cat ctt cta ttg gat ctc 1097
Asp Asn Ala Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp LeuAsp Asn Alu Glu His Val Leu Glu Met Ile His Leu Leu Leu Asp Leu
235 240 245235 240 245
ctt cag gca cct gat cct gtc gca ctt gaa age ttc ctt gga aga ate 1145ctt cag gca cct gat cct gtc gca ct gaa age ttc ctt gga aga until 1145
Leu Gln Ala Pro Asp Pro Val Ala Leu Glu Ser Phe Leu Gly Arg Ile 250 255 260 cca ttg gtc gcc aat gtt gta ate atg act ccc cat gga tac ttt gcc 1193Leu Gln Ala Pro Asp Pro Val Wing Leu Glu Ser Phe Leu Gly Arg Ile 250 255 260 260 cca ttg gtc gcc aat gtt gta until atg act ccc cat gga tac ttt gcc 1193
Pro Leu Val Ala Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Ala 265 270 275 280Pro Leu Val Wing Asn Val Val Ile Met Thr Pro His Gly Tyr Phe Wing 265 270 275 280
caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241caa gac aat gtt ttg gga tac cct gac aca ggt ggt cag gtt gtt tat 1241
Gln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295Gln Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gln Val Val Tyr 285 290 295
ate tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt tcc 1289till tta gat caa gtt cgt gcc ttg gag gaa gag ttg ctt cat cgt tcc 1289
Ile Leu Asp Gln Val Arg Ala Leu Glu Glu Glu Leu Leu His Arg Ser 300 305 310Ile Leu Asp Gln Val Arg Ward Leu Glu Glu Glu
aag ctg caa gga ctc gac att acc cca cgt ate cta gtc gtgagtatct 1338aag ctg caa gga ctc gac att acc cca cgt until cta gtc gtgagtatct 1338
Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325Lys Leu Gln Gly Leu Asp Ile Thr Pro Arg Ile Leu Val 315 320 325
gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc ctc cct gat gct gta gga aca act tgc ggt cag cgt 1446gaatgttaat tcagtttcga tcattatgtt taatgcttaa tttggctgtt tgtgtaccag 1398 att act cgg ctc cct gat gct gta gga aca act tgc ggt cag cgt 1446
Ile Thr Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gln ArgIle Thr Arg Read Leu Pro Asp Wing Val Gly Thr Thr Cys Gly Gln Arg
330 335 340330 335 340
ctt gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494ct gag aaa gtg tat gga acc aag tat tet gat att ctt cgg gta ccc 1494
Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 345 350 355Leu Glu Lys Val Tyr Gly Thr Lys Tyr Ser Asp Ile Leu Arg Val Pro 345 350 355
ttc aga act gag aag gga att gta cgt cca tgg ate tca cga ttc aaa 1542ttc aga act gag aag gga att gta cgt cca tgg until tca cga ttc aaa 1542
Phe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Ser Arg Phe LysPhe Arg Thr Glu Lys Gly Ile Val Arg Pro Trp Ile Be Arg Phe Lys
360 365 370360 365 370
gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag ate 1590gtc tgg cct tac ctg gag act tac act aag gat gtt gea gct gag until 1590
Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Ala Ala Glu Ile 375 380 385Val Trp Pro Tyr Leu Glu Thr Tyr Thr Lys Asp Val Wing Wing Glu Ile 375 380 385
acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1638acc aaa gag ttc cag ggc aag cct gat ctg att gtt ggg aac tac agt 1638
Thr Lys Glu Phe Gln Gly Lys Pro Asp Leu Ile Val Gly Asn Tyr Ser 390 395 400 405Thr Lys Glu Phe Gln Gly Lys Pro Asp Ile Ile Val Gly Asn Tyr Ser 390 395 400 405
gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686gat gga aac att gtt gct tet tta ttg gea cat aag ttt gat gtt aca 1686
Asp Gly Asn Ile Val Ala Ser Leu Leu Ala His Lys Phe Asp Val ThrAsp Gly Asn Ile Val Wing Be Read Leu Wing His Lys Phe Asp Val Thr
410 415 420410 415 420
cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739 15cag gtttgttaaa taccaattaa ttgaacaagt tccttgctgt attttgatgt 1739 15
GlnGln
tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794tccccgtgca atctaaaaac ttggttttca atgttttatg cag tgc act att gct 1794
Cys Thr Ile AlaCys Thr Ile Wing
425425
cat gca ctc gag aag acg aaa tac ccg gat tca gac att aac tgg aaa 1842cat gca ctc gag aag ag aaa tac ccg gat tca gac att aac tgg aaa 1842
His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp LysHis Wing Read Glu Lys Thr Lys Tyr Pro Asp Ser Asp Ile Asn Trp Lys
430 435 440430 435 440
cag ctt gag gat aag tat cac ttc tcc tgt cag ttt act gct gat ctt 1890cag ctt gag gat aag tat ttc tcc tgt cag ttt act gct gat ctt 1890
1 0 Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Ala Asp Leu 445 450 4551 0 Gln Leu Glu Asp Lys Tyr His Phe Ser Cys Gln Phe Thr Wing Asp Leu 445 450 455
att gct atg aac cat act gat ttt ate ate acc age acc ttc caa gag 1938att gct atg aac cat act gat ttt until till acc age acc ttc caa gag 1938
Ile Ala Met Asn His Thr Asp Phe Ile Ile Thr Ser Thr Phe Gln GluIle Ala Met Asn His Thr Asp Phe Ile Ile Thr Be Thr Phe Gln Glu
460 465 470460 465 470
att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989att gct gga ag gtaatctatc tcccacttta ctacatttgg ttcccttgta 1989
Ile Ala Gly Ser 475Ile Wing Gly Ser 475
cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc ggc caa tac gag agt cac att gct ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Ser His Ile Ala Phe Thr Leu Pro 480 485 490cgttgtaaat gttgccacct atttgagtcc ctaacaatgt tttcattcca tgattttcag 2049 c aag gac act ctc gg ca ca ca tac gag ttc act ctt cca 2098 Lys Asp Thr Leu Gly Gln Tyr Glu Being His Ileu
ggg ctc tgc cgc gtt gtt gat ggg ate gat gtt ttt gat ccc aaa ttc 2146ggg ctc tgc cgc gt gt gat ggg until gat gtt ttt gat ccc aaa ttc
Gly Leu Cys Arg Val Val Asp Gly Ile Asp Val Phe Asp Pro Lys Phe 495 500 505 510Gly Leu Cys Arg Val Val Asp Gly Ile Val Val Asp Phe Asp Pro Lys Phe 495 500 505 510
aat att gtc tcc cct ggt gct gat atg age ata tac ttc cct tac acg 2194aat att gtc tcc cct ggt gct gat atg act ata tac ttc cct tac acg 2194
Asn Ile Val Ser Pro Gly Ala Asp Met Ser Ile Tyr Phe Pro Tyr ThrAsn Ile Val Ser Pro Gly Wing Asp Met Ser Ile Tyr Phe Pro Tyr Thr
515 520 525515 520 525
gaa gag aag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242gag gag ag cgg agg ttg aag aag ttc cac ccg gag att gaa gag ctt 2242
Glu Glu Lys Arg Arg Leu Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540Glu Glu Lys Arg Arg Read Lys Lys Phe His Pro Glu Ile Glu Glu Leu 530 535 540
ctt tac age cct gtt gag aat aca gag cac tt gtgagatttt ggttctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu His Leuct tac age cct gt gag aat aca gag cac tt gtgagatttt ggttctattt 2294 Leu Tyr Ser Pro Val Glu Asn Thr Glu
2020
25 1025 10
1515
2020
2525
3030
545 550545 550
ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt gta cta aaa gac cgc aac aag ccg att ctg ttt acc atg gca 2403 Cys Val Leu Lys Asp Arg Asn Lys Pro Ile Leu Phe Thr Met Ala 555 560 565ttacaccctt ttcactagat tttgtgttca gtgttaacaa tgtcttaaat tgttgtctca 2354 cag a tgt cta aaa gac cgc aac ag ccg att ctg ttt acc atg gca 2403 Cys Val Leu Lys Asp L5 As5 Arg 5
agg ttg gac cga gtg aag aac tta act ggg etc gta gag ttc tat gcc 2451agg ttg gac cga gtg aag aac tta act ggg etc gta gag ttc tat gcc 2451
Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr AlaArg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Phe Tyr Wing
570 575 580570 575 580
aag aac age cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499aag aac acts cgg ctg agg gaa ctg gtt aac ttg gtt gta gta ggt gga 2499
Lys Asn Ser Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600Lys Asn Be Arg Leu Arg Glu Leu Val Asn Leu Val Val Val Gly Gly 585 590 595 600
gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2547gat agg aga aag gaa tcc aag gac tta gaa gaa aag gct gaa atg aag 2547
Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met LysAsp Arg Arg Lys Glu Be Lys Asp Leu Glu Glu Lys Wing Glu Met Lys
605 610 615605 610 615
aag atg tat gaa ctt ate gaa aaa tac aag ttg aat gga caa ttc aga 2595aag atg tat gaa ctt until gaa aaa tac aag ttg aat gga caa ttc aga 2595
Lys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe ArgLys Met Tyr Glu Leu Ile Glu Lys Tyr Lys Leu Asn Gly Gln Phe Arg
620 625 630620 625 630
tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2643tgg ata tcg tcc cag atg aac cga gtg aga aat ggt gaa ctc tat cgt 2643
Trp Ile Ser Ser Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr ArgTrp Ile Being Gln Met Asn Arg Val Arg Asn Gly Glu Leu Tyr Arg
635 640 645635 640 645
tat att tgc gac acc aag gga gca ttc gtt cag cct cct ata tac gag 2691tat att tgc gac acc aga gga gca ttc gtt cag cct cct ata tac gag 2691
Tyr Ile Cys Asp Thr Lys Gly Ala Phe Val Gln Pro Pro Ile Tyr GluTyr Ile Cys Asp Thr Lys Gly Phe Val Gln Pro Pro Ile Tyr Glu
650 655 660650 655 660
gct ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 2739gt ttt ggc ttg act gtt gtt gag gca atg acc tgt gga cta cca aca 2739
Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro Thr 665 670 675 680Phe Gly Leu Wing Thr Val Val Glu Wing Met Thr Cys Wing Gly Leu Thr Thr 665 670 675 680
ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787ttt gca aca tgt tac ggg ggc cct gct gag att ata gtt cac gga aaa 2787
Phe Ala Thr Cys Tyr Gly Gly Pro Ala Glu Ile Ile Val His Gly LysPhe Wing Thr Cys Tyr Gly Gly Pro Glu Wing Ile Ile Val His Gly Lys
685 690 695685 690 695
tcg ggg ttc aac ate gat cct tgt aac ggt gat ttg gct gcc gag acc 2835tcg ggg ttc aac until gat cct tgt aac ggt gat ttg gct gcc acc 2835
Ser Gly Phe Asn Ile Asp Pro Cys Asn Gly Asp Leu Ala Ala Glu Thr 700 705 710 ctt gcc aat ttc ttc gag aag tgc aaa gcg gat cca tct tat tgg gat 2883Ser Gly Phe Asn Ile Asp Pro Cys Asn Gly Asp Leu Wing Wing Wing Glu Thr 700 705 710 ctt gcc aat ttc ttc gag aag tgc aaa gat cca tct tat tgg gat 2883
Leu Ala Asn Phe Phe Glu Lys Cys Lys Ala Asp Pro Ser Tyr Trp AspLeu Wing Asn Phe Phe Glu Lys Cys Lys Wing Asp Pro To Be Tyr Trp Asp
715 720 725715 720 725
gag ate tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931gag till tcc cag gga ggg ttg aaa cgc ata cag gag aag tat aca tgg 2931
Glu Ile Ser Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740Glu Ile Being Gln Gly Gly Leu Lys Arg Ile Gln Glu Lys Tyr Thr Trp 730 735 740
cag att tac tcc gag aag cta ttg act ctc acc gga gtt tat ggc ttt 2979cag att tac tcc gag aag cta ttg act ctc acc gga gt tat ggc ttt 2979
Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760Gln Ile Tyr Ser Glu Lys Leu Leu Thr Leu Thr Gly Val Tyr Gly Phe 745 750 755 760
tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027tcg aaa cat gta gct tac cag gag caa cgt ggg cgc aag cgt tac att 3027
Ser Lys His Val Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr IleSer Lys His Val Tyr Wing Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile
765 770 775765 770 775
gaa atg ttg cat gea tgg atg tat aac aat cgg gtaaaacaat catgtctatt 3 080 Glu Met Leu His Ala Trp Met Tyr Asn Asn Arggaa atg ttg cat gea tgg atg tat aac aat cgg
780 785780 785
attattatta catcaaacat cattttactc tttcattgaa ctgaaatttt catgtttctg 3140 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3192attattatta catcaaacat cattttactc tttcattgaa ctgaaatttt catgtttctg 3140 aatgtttgtt taattgtgtg ctttacag gtc aag act gtt cca cta gct gtt 3192
Val Lys Thr Val Pro Leu Ala Val 790 795Val Lys Thr Val Pro Leu Wing Val 790 795
gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3245gag taaagaaaaa gcaaaaggct gcttggaaac caaggaattt ccaagaatga 3245
GluGlu
ttgctattga agttttattt ttgcttcttt ttttcaattt cgtaaatgtt taatttcaac 3305 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3353ttgctattga agttttattt ttgcttcttt ttttcaattt cgtaaatgtt taatttcaac 3305 tttggggttt ccccttggaa tgttttgaac tctgggtttg gtcccatt 3353
<210> 15<210> 15
<211> 26<211> 26
<212> PRT<212> PRT
<213> Artificial <220><213> Artificial <220>
<223> seqüência SusC de consenso C-terminal<223> C-terminal consensus SusC sequence
<400> 15<400> 15
Ala Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met Leu His 10 15Wing Tyr Gln Glu Gln Arg Gly Arg Lys Arg Tyr Ile Glu Met
Ala Trp Met Tyr Asn Asn Arg Val Lys Thr 25Trp Wing Met Tyr Asn Asn Arg Val Lys Thr 25
<210> 16 <211> 24 <212> PRT <213> Artificial <220><210> 16 <211> 24 <212> PRT <213> Artificial <220>
<223> seqüência SusC de consenso N-terminal <400> 16<223> N-terminal consensus SusC sequence <400> 16
His Lys Ser Gln Lys Leu Leu Ser Val Leu Asp Lys Glu Ala Gly Asn 10 15His Lys Be Gln Lys Leu Read Le Be Val Leu Asp Lys Glu Wing Gly Asn 10 15
Gln Ala Leu Asp Gly Met Val Val 20Gln Wing Read Asp Gly Met Val Val Val 20
<210> 17 <211> 22 <212> DNA <213> Artificial <220><210> 17 <211> 22 <212> DNA <213> Artificial <220>
<223> iniciador 1 de PCR específico para SusC <400> 17<223> SusC specific PCR primer 1 <400> 17
cccttctgcc atttcaggaa cc <210> 18 <211> 22 <212> DNA <213> Artificial <220>cccttctgcc atttcaggaa cc <210> 18 <211> 22 <212> DNA <213> Artificial <220>
<223> iniciador 1 de PCR específico para SusC <400> 18<223> SusC specific PCR primer 1 <400> 18
aatgggacca aacccagagt tcaatgggacca aacccagagt tc
Claims (34)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06015433 | 2006-07-25 | ||
EP060154333 | 2006-07-25 | ||
US83406606P | 2006-07-28 | 2006-07-28 | |
US60834066 | 2006-07-28 | ||
PCT/EP2007/006551 WO2008012058A1 (en) | 2006-07-25 | 2007-07-20 | Identification of a novel type of sucrose synthase and use thereof in fiber modification |
Publications (1)
Publication Number | Publication Date |
---|---|
BRPI0714552A2 true BRPI0714552A2 (en) | 2013-05-07 |
Family
ID=40925399
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BRPI0714552-7A BRPI0714552A2 (en) | 2006-07-25 | 2007-07-20 | identification of a type of sucrose synthase and its use in fiber modification |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN101495626A (en) |
BR (1) | BRPI0714552A2 (en) |
ZA (1) | ZA200900106B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106834342A (en) * | 2017-01-18 | 2017-06-13 | 华中农业大学 | The method that Culm of Rice lodging tolerance is improved using sucrose synthase |
CN110055232B (en) * | 2019-01-10 | 2020-08-25 | 北京理工大学 | Two glycyrrhetinic acid sucrose synthases and application thereof in synthesis of glycyrrhetinic acid glycosylated derivatives |
CN112280795A (en) * | 2020-11-17 | 2021-01-29 | 昆明理工大学 | Use of glycosyltransferase genes |
WO2023226978A1 (en) * | 2022-05-23 | 2023-11-30 | 苏州引航生物科技有限公司 | Microbial-derived sucrose synthase and use thereof |
-
2007
- 2007-07-20 CN CNA2007800286149A patent/CN101495626A/en active Pending
- 2007-07-20 ZA ZA200900106A patent/ZA200900106B/en unknown
- 2007-07-20 BR BRPI0714552-7A patent/BRPI0714552A2/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
ZA200900106B (en) | 2010-03-31 |
CN101495626A (en) | 2009-07-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1349446B1 (en) | Modification of sucrose synthase gene expression in plant tissue and uses therefor | |
Weschke et al. | The role of invertases and hexose transporters in controlling sugar ratios in maternal and filial tissues of barley caryopses during early development | |
US9834781B2 (en) | Polynucleotides and polypeptides involved in plant fiber development and methods of using same | |
EP1692285B1 (en) | Methods and means for altering fiber characteristics in fiber-producing plants | |
US8173866B1 (en) | Modulation of plant xylan synthases | |
US9057073B2 (en) | Differential expression of subgenome specific alleles in cotton and uses thereof | |
CN103443280B (en) | Seed specific promoters in cotton | |
BR112014013367B1 (en) | POLYNUCLEOTIDE, EXPRESSION CASSETTE, VECTOR, HOST CELL, POLYPEPTIDE AND USE THEREOF | |
BRPI0714552A2 (en) | identification of a type of sucrose synthase and its use in fiber modification | |
JP4880819B2 (en) | Method for enhancing cellulose biosynthesis and modifying lignin biosynthesis in plants | |
US20100167040A1 (en) | Identification of a novel type of sucrose synthase and use thereof in fiber modification | |
ES2701243T3 (en) | New preferred promoter in cotton fibers | |
US7498492B2 (en) | Modification of sucrose synthase gene expression in plant tissue and uses therefor | |
AU2002220363B2 (en) | Modification of sucrose synthase gene expression in plant tissue and uses therefor | |
EP4099818A1 (en) | Methods of controlling grain size and weight | |
ZA200303930B (en) | Modification of sucrose synthase gene expression in plant tissue and uses therefor. | |
AU2002220363A1 (en) | Modification of sucrose synthase gene expression in plant tissue and uses therefor | |
BRPI0913562B1 (en) | METHOD TO INTENSIFY THE PRODUCTIVITY OF A FRUTAN BIOSYNTHETIC PATHWAY AND GENETIC CONSTRUCTION |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B08L | Patent application lapsed because of non payment of annual fee [chapter 8.12 patent gazette] |
Free format text: REFERENTE AO NAO RECOLHIMENTO DAS 5A E 6A ANUIDADES. |
|
B08I | Publication cancelled [chapter 8.9 patent gazette] |
Free format text: ANULADA A PUBLICACAO CODIGO 8.12 NA RPI NO 2260 DE 29/04/2014 POR TER SIDO INDEVIDA. |
|
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE AS 5A, 6A, 7A, 8A, 9A, 10A, 11A, 12A E 13A ANUIDADES. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: EM VIRTUDE DO ARQUIVAMENTO PUBLICADO NA RPI 2602 DE 17-11-2020 E CONSIDERANDO AUSENCIA DE MANIFESTACAO DENTRO DOS PRAZOS LEGAIS, INFORMO QUE CABE SER MANTIDO O ARQUIVAMENTO DO PEDIDO DE PATENTE, CONFORME O DISPOSTO NO ARTIGO 12, DA RESOLUCAO 113/2013. |