AU2008200749A1 - Promoters for regulation of plant gene expression - Google Patents

Promoters for regulation of plant gene expression Download PDF

Info

Publication number
AU2008200749A1
AU2008200749A1 AU2008200749A AU2008200749A AU2008200749A1 AU 2008200749 A1 AU2008200749 A1 AU 2008200749A1 AU 2008200749 A AU2008200749 A AU 2008200749A AU 2008200749 A AU2008200749 A AU 2008200749A AU 2008200749 A1 AU2008200749 A1 AU 2008200749A1
Authority
AU
Australia
Prior art keywords
aat
cat
gat
agt
act
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
AU2008200749A
Other versions
AU2008200749B2 (en
Inventor
Werner BASTIAN
Devon Brown
Paul Budworth
Hur-Song Chang
Bret Cooper
Bin Han
Xun Wang
Tong Zhu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Syngenta Participations AG
Original Assignee
Syngenta Participations AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2005247022A external-priority patent/AU2005247022B2/en
Application filed by Syngenta Participations AG filed Critical Syngenta Participations AG
Priority to AU2008200749A priority Critical patent/AU2008200749B2/en
Publication of AU2008200749A1 publication Critical patent/AU2008200749A1/en
Application granted granted Critical
Publication of AU2008200749B2 publication Critical patent/AU2008200749B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Description

Australian Patents Act 1990 Regulation 3.2 ORIGINAL COMPLETE SPECIFICATION STANDARD PATENT Invention Title "Promoters for regulation of plant gene expression" The following statement is a full description of this invention, including the best method of performing it known to us:- Case S-50015A16/78/NAD 00 PROMOTERS FOR REGULATION OF PLANT GENE EXPRESSION The present invention relates generally to the field of plant molecular biology. More specifically, it relates to the regulation of gene expression in plants.
Manipulation of crop plants to alter and/or improve phenotypic characteristics (such as 't productivity or quality) requires the expression of heterologous genes in plant tissues. Such genetic manipulation relies on the availability of a means to drive and to control gene 00 expression as required. For example, genetic manipulation relies on the availability and use of suitable promoters which are effective in plants and which regulate gene expression so as to give the desired effect(s) in the transgenic plant. It is advantageous to have the choice of a variety of different promoters so that the most suitable promoter may be selected for a particular gene, construct, cell, tissue, plant or environment. Moreover, the increasing interest in cotransforming plants with multiple plant transcription units (PTU) and the potential problems associated with using common regulatory sequences for these purposes merit having a variety of promoter sequences available.
Promoters (and other regulatory components) from bacteria, viruses, fungi and plants have been used to control gene expression in plant cells. Numerous plant transformation experiments using DNA constructs comprising various promoter sequences fused to various foreign genes (for example, bacterial marker genes) have led to the identification of useful promoter sequences. It has been demonstrated that sequences up to 500-1000 bases in most instances are sufficient to allow for the regulated expression of foreign genes. However, it has also been shown that sequences much longer than 1000 bases may have useful features which permit desirable, high, levels of gene expression in transgenic plants.
One desirable source for promoters which have different expression profiles is plant genomic DNA. Plant development is precisely coordinated and regulated through transcription and translation of different gene products in each cell. The expression level for each gene present in a cell not only reflects the physiological status of the cell, but also determines the range of different functions the cell can perform. Identification of genes expressed constitutively, in a specific cell type or tissue, or at a specific developmental stage, and the Case S-50015A/16/78/NAD 00 analysis of the abundance of the corresponding gene product can provide valuable insights into 0 basic molecular processes and identity promoters with desirable properties.
cDNA and high density oligonucleotide array technology allows analysis of mRNA transcripts of hundreds to thousands of genes in parallel (Schena et al., 1995; Chee et al., 1996; Lockhart et al., 1996; DeRisi et al., 1997; Lashkari et al., 1997). In some organisms with completed genome sequences, such as yeast, global gene expression profiling at the mRNA 'j level becomes possible (DeRisi et al., 1997). Genome scale transcription profiling enables not 0 only parallel monitoring of gene expression, but also a more subjective approach for gene 00 discovery because objective selection of gene probes to be put on microarrays is not required 0 (Lockhart and Winzeler, 2000).
c Microarray technology has been successfully developed for studying gene expression in plants (Schena et al., 1995; Desprez et al., 1998; Yuan et al., 1998; Giege et al., 1998; Kehoe et al., 1999). The microarrays used in those studies were cDNA microarrays on glass slides or filter membranes (Duggan et al. 1999; Southern et al. 1999). The DNA probes often consist of DNA fragments of expression sequence tags (ESTs) from various Arabidopsis EST projects Newman et al., 1994, Richmond et al., 2000, Schaffer et al., 2000). Microarrays with selected subsets of gene probes (usually in the hundreds) has been used to examine differences in gene expression during organ development (Yuan et al., 1998; Aharoni et al., 2000), and has revealed genes that are correlated or responsible for the defense response (Reymond et al., 2000).
There is, therefore, a great need in the art for the identification of novel sequences that can be used for expression of selected transgenes in economically important plants. More specifically, there is a need for the systematic identification of genes that are expressed in a particular manner, using microarray technology.
The present invention provides an isolated nucleic acid molecule (polynucleotide) having a plant nucleotide sequence that directs root-specific preferential) transcription of a linked nucleic acid segment in a plant, a linked plant DNA comprising an open reading frame for a structural or regulatory gene. The nucleotide sequence preferably is obtained or isolatable from plant genomic DNA. In particular, the nucleotide sequence is obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has at least -2- Case S-50015A/16/78/NAD 00 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 0 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, C- 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising any one of SEQ ID NOs: 1-51 or a fragment (portion) thereof a promoter isolatable from any one of SEQ ID NOs: 1-51) or to a polypeptide encoded by an Oryza gene comprising SEQ ID NO:825 or 843 or a fragment (portion) thereof a :t promoter isolatable from SEQ ID NO:825 or 843) which directs root-specific transcription of 0 a linked nucleic acid segment. Preferred root-specific promoters comprise DNA obtained or CK1 isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has 00 at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising any one of SEQ ID NOs: 518-526 and 536-544 (which are promoters corresponding to a gene comprising an open reading frame having one of SEQ ID NOs: 358-366), but preferably any one of SEQ ID NOs: 536, 537, and 539-54 or a fragment thereof which directs root-specific transcription.
Also preferred are root-specific promoters comprising DNA obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has at least 70%, e.g., 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising an open reading frame having any one of SEQ ID NOs: 358-366, or a fragment thereof which directs root-specific transcription, or to a polypeptide encoded by an Oryza gene comprising an open reading frame having SEQ ID NO:774 or 792, or a fragment thereof which directs root-specific transcription.
The present invention also provides an isolated nucleic acid molecule having a plant nucleotide sequence that directs constitutive transcription of a linked nucleic acid segment in a host cell, a plant cell. The nucleotide sequence preferably is obtained or isolatable from plant genomic DNA. In particular, the nucleotide sequence is obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has at least 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, Case S-50015A/16/78/NAD 0 0 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising any one of SEQ ID NOs: 52-339 or a fragment thereof a promoter isolatable from any one of SEQ ID NOs:52-339) which directs constitutive transcription of a linked nucleic acid segment, or to a polypeptide encoded by an Oryza gene comprising any one of SEQ ID NOs: 826-842 or 844-875 or a fragment thereof a promoter isolatable from any one of SEQ ID NOs: 826-842, 844-875) which directs constitutive transcription of a 0 linked nucleic acid segment. Preferred constitutive promoters comprise DNA obtained or C isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has 00 at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, S 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene having any one of SEQ ID NOs: 477-515, 517 and 545-579 (which are promoters corresponding to a gene comprising an open reading frame having one of SEQ ID NOs:441-476 and 527-529), but preferably any one of SEQ ID NOs: 548, 550- 553, 555-558, 560, 565-568, 571-573, 575, 576, 578 and 579, or a fragment thereof which directs constitutive transcription.
Also preferred are constitutive promoters comprising DNA obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has at least 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising an open reading frame having any one of SEQ ID NOs:441-476 and 527-529 or a fragment thereof which directs constitutive transcription, or to a polypeptide encoded by an Oryza gene comprising an open reading frame having any one of SEQ ID NOs:775-791 or 793-824 or a fragment thereof which directs constitutive transcription.
The present invention further provides an isolated nucleic acid molecule which comprises a plant nucleotide sequence that directs leaf-specific preferential) transcription of a linked nucleic acid segment in a plant. The nucleotide sequence preferably is obtained or isolatable from plant genomic DNA. In particular, the nucleotide sequence is obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has -4- Case S-50015A/16/78/NAD at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 0 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising any one of SEQ ID NOs: 693-773 or a fragment thereof isolatable from any one of SEQ ID NOs:693-773) which directs leaf-specific transcription of a linked nucleic acid segment.
Preferred are leaf specific promoters comprising DNA obtained or isolatable from a gene encoding a polypeptide which is substantially similar, and preferably has at least Se.g., 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising an open reading frame having any one of SEQ ID NOs:601-692 or a fragment thereof which directs leaf-specific transcription.
The invention also provides uses for an isolated nucleic acid molecule, DNA or RNA, comprising a plant nucleotide sequence comprising an open reading frame that is preferentially expressed in leaves, roots or constitutively, and which is substantially similar, and preferably has at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%, amino acid sequence identity, to a polypeptide encoded by an Arabidopsis gene comprising an open reading frame having any one of SEQ ID NOs:358-366, 441-476, 527-529 and 601-692 or the complement thereof, SEQ ID NOs:601-692 comprise the open reading frames corresponding to genes having promoters having one of SEQ ID NOs:693-773, or to a polypeptide encoded by an Oryza gene comprising an open reading frame having any one of SEQ ID NOs:774-824 or the complement thereof. For example, root-specific DNA having open reading frames which encode peroxidases, transport proteins, defense-related proteins, proteins involved in metabolism and DNA binding proteins, and constitutive open reading frames which encode cell cycle proteins, ribosomal proteins, transcription factors, defense-related proteins, stress-related proteins, transport protein, membrane proteins, structural proteins, proteins involved in metabolism, signaling proteins, kinases and synthases, may be useful to prepare plants that over- or underexpress the encoded product or to prepare knockout plants. Also provided are nucleic Case S-50015A/16/78/NAD o acid molecules comprising a nucleotide sequence having an open reading frame comprising C SEQ ID NO:457, 476, or 527 (constitutive) or SEQ ID NO:602, 604, 609-610 (leaf). These sequences, while being useful to over- or underexpress the encoded product, or prepare knockout plants, may be used as a control for genes that are constitutively expressed or in a leaf-specific manner.
The promoters and open reading frames of the invention can be identified by employing an array of nucleic acid samples, each sample having a plurality of oligonucleotides, and Seach plurality corresponding to a different plant gene, on a solid substrate, a DNA chip, 00 and probes corresponding to nucleic acid expressed in, for example, one or more plant tissues 0 and/or at one or more developmental stages, or probes corresponding to nucleic acid expressed in the cells of the leaves or root of a plant relative to control nucleic acid from cellular sources other than leaves or root. Thus, genes that are upregulated or downregulated in the majority of tissues at a majority of developmental stages, or upregulated or downregulated in one tissue such as in root or in leaves, can be systematically identified.
As described herein, GeneChip® technology was utilized to discover genes that are preferentially (or exclusively) expressed in various tissues including root and leaf, as well as those that are constitutively expressed, using labeled cRNA probes, determining expression levels by laser scanning and generally selecting for expression levels that were 2 fold over the control. The Arabidopsis oligonucleotide probe array consists of probes from about 8,100 unique Arabidopsis genes, which covers approximately one third of the genome. This genome array permits a broader, more complete and less biased analysis of gene expression. Using this approach, 51 genes were identified, the expression of which was altered, elevated, in root tissues, and 92 genes were identified, the expression of which was altered at least 4-fold in leaf tissue. Similarly, 288 genes were identified that were constitutively expressed.
Generally, the promoters of the invention may be employed to express an open reading frame from an insect resistance gene, a bacterial disease resistance gene, a fungal disease resistance gene, a viral disease resistance gene, a nematode disease resistance gene, a herbicide resistance gene, a gene affecting grain composition or quality, a nutrient utilization gene, a mycotoxin reduction gene, a male sterility gene, a selectable marker gene, a screenable marker gene, a negative selectable marker, a gene affecting plant agronomic characteristics, or an environment or stress resistance gene, one or more genes that confer herbicide resistance Case S-50015A16/78/NAD 0 0 or tolerance, insect resistance or tolerance, disease resistance or tolerance (viral, bacterial, fungal, oomycete, or nematode), stress tolerance or resistance (as exemplified by resistance or tolerance to drought, heat, chilling, freezing, excessive moisture, salt stress, or oxidative stress), increased yields, food content and makeup, physical appearance, male sterility, drydown, standability, prolificacy, starch properties or quantity, oil quantity and quality, amino acid or protein composition, and the like. By "resistant" is meant a plant which exhibits Ssubstantially no phenotypic changes as a consequence of agent administration, infection with a pathogen, or exposure to stress. By "tolerant" is meant a plant which, although it may exhibit some phenotypic changes as a consequence of infection, does not have a substantially decreased reproductive capacity or substantially altered metabolism.
In particular, root-specific promoters may be useful for expressing defense-related genes, including those conferring insecticidal resistance and stress tolerance genes, salt, cold or drought tolerance, and genes for altering nutrient uptake, and leaf-specific promoters may be useful for producing large quantities of protein, for expressing oils or proteins of interest, genes for increasing the nutritional value of a plant, and for expressing defense-related genes against pathogens such as a virus or fungus), including genes encoding insecticidal polypeptides. Constitutive promoters are useful for expressing a wide variety of genes including those which alter metabolic pathways, confer disease resistance, for protein production, antibody production, or to improve nutrient uptake. Constitutive promoters may be modified so as to be regulatable, inducible. The genes and promoters described hereinabove can be used to identify orthologous genes and their promoters which are also likely expressed in a particular tissue and/or development manner. Moreover, the orthologous promoters are useful to express linked open reading frames. In addition, by aligning the promoters of these orthologs, novel cis elements can be identified that are useful to generate synthetic promoters.
Hence, the isolated nucleic acid molecules of the invention include the orthologs of the Arabidopsis sequences disclosed herein, the corresponding nucleotide sequences in organisms other than Arabidopsis, including, but not limited to, plants other than Arabidopsis, preferably cereal plants, corn, wheat, rye, turfgrass, sorghum, millet, sugarcane, soybean, barley, alfalfa, sunflower, canola, soybean, cotton, peanut, tobacco, sugarbeet, or rice. An orthologous gene is a gene from a different species that encodes a product having the same or Case S-50015A/16/78/NAD Ssimilar function, catalyzing the same reaction as a product encoded by a gene from a reference organism. Thus, an ortholog includes polypeptides having less than, 65% amino acid sequence identity, but which ortholog encodes a polypeptide having the same or similar function. Databases such GenBank or one found at http://bioserver.myongjiac.kr/rjce.html (for rice) may be employed to identify sequences related to the Arabidopsis sequences, e.g., orthologs in cereal crops such as rice, wheat, sunflower or alfalfa. SEQ ID NOs:598-600, for Sexample, are the rice promoter, open reading frame and amino acid sequence for rice O polyubiquitin, the ortholog of the Arabidopsis gene comprising SEQ ID NO:155. For example, SEQ ID NOs:774 and 792 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:360; SEQ ID NOs:789-790, 799, and 813 are rice orthologs of the Arabidopsis Sgene comprising SEQ ID NO:441; SEQ ID NOs: 781, 804-805, 810, 816-817, and 822 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:442; SEQ ID NOs:777, 782- 783, 806, and 820 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:443; SEQ ID NOs:791, 793, and 808 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:446; SEQ ID NO:795 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:449; SEQ ID NOs:776, 784, 787, 800, and 807 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:450; SEQ ID NO:779 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:451; SEQ ID NO:803 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:454; SEQ ID NOs:788 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:458; SEQ ID NO:786 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:465; SEQ ID NOs:775, 778, and 814-815 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:466; SEQ ID NOs:785 and 798 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:467; SEQ ID NOs:794, 809, 812 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:471; SEQ ID NO:797 is a rice ortholog of the Arabidopsis gene comprising SEQ ID NO:472; SEQ ID NOs:780, 796, 802, 819, 821, and 823 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:527; SEQ ID NOs:811 and 824 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:528; and SEQ ID NOs:801 and 818 are rice orthologs of the Arabidopsis gene comprising SEQ ID NO:529 (Table 14). Additional orthologs of Arabidopsis genes herein are identified herein, such as rice orthologs for SEQ ID NOs:359-360, 441-443, 446-447, 449-450, 465-467 and 527-529; corn orthologs for SEQ ID NOs:360, 441-442, 465-467, 527, 529; wheat Case S.50015A/16/78NAD o orthologs for SEQ ID NOs:441-442; sunflower orthologs for SEQ ID NOs:441-442; and N alfalfa orthologs for SEQ ID NOs:365 and 529 (Table 15). Alternatively, recombinant DNA techniques such as hybridization or PCR may be employed to identify sequences related to the Arabidopsis sequences or to clone the equivalent sequences from different Arabidopsis DNAs.
The encoded ortholog products likely have at least 70% sequence identity to each other.
Hence, the invention includes an isolated nucleic acid molecule comprising a nucleotide sequence from a gene that encodes a polypeptide having at least 70% identity to a polypeptide encoded by a gene having one or more of the Arabidopsis or Oryza sequences disclosed OO herein. For example, promoter sequences within the scope of the invention are those which direct expression of an open reading frame which encodes a polypeptide that is substantially similar to an Arabidopsis polypeptide encoded by a gene having a promoter selected from the group consisting of SEQ ID NOs:1-339, 447-515, 517-526, 536-579 and 693-773 or a polypeptide that is substantially similar to an Oryza polypeptide encoded by a gene having a promoter selected from the group consisting of SEQ ID NOs:825-875.
Preferably, the promoters of the invention include a consecutive stretch of about 25 to 2000, including 50 to 500 or 100 to 250, and up to 1000 or 1500, contiguous nucleotides, e.g., to about 743, 60 to about 743, 125 to about 743, 250 to about 743, 400 to about 743, 600 to about 743, of any one of SEQ ID NOs:l-339, 477-515, 517-526, 536-579, and 693-773, or the promoter orthologs thereof, SEQ ID NOs: 825-875, which include the minimal promoter region.
In a particular embodiment of the invention said consecutive stretch of about 25 to 2000, including 50 to 500 or 100 to 250, and up to 1000 or 1500, contiguous nucleotides, e.g., to about 743, 60 to about 743, 125 to about 743, 250 to about 743, 400 to about 743, 600 to about 743, has at least 75%, preferably 80%, more preferably 90% and most preferably sequence identity with a corresponding consecutive stretch of about 25 to 2000, including to 500 or 100 to 250, and up to 1000 or 1500, contiguous nucleotides, 40 to about 743, to about 743, 125 to about 743, 250 to about 743, 400 to about 743, 600 to about 743, of any one of SEQ ID NOs:1-339, 477-515, 517-526, 536-579, and 693-773, or the promoter orthologs thereof, which include the minimal promoter region.
In a preferred embodiment of the invention said consecutive stretch of about 25 to 2000, including 50 to 500 or 100 to 250, and up to 1000 or 1500, contiguous nucleotides, e.g., Case S.50015AJ16/78iNAD C 40 to about 743, 60 to about 743, 125 to about 743, 250 to about 743, 400 to about 743, 600 N to about 743, has at least 75%, preferably 80%, more preferably 90% and most preferably Ssequence identity with a corresponding consecutive stretch of about 25 to 2000, including Sto 500 or 100 to 250, and up to 1000 or 1500, contiguous nucleotides, 40 to about 743, to about 743, 125 to about 743, 250 to about 743, 400 to about 743, 600 to about 743, of any one of SEQ ID NOs: 536-579, preferably of any one of SEQ ID Nos: 536; 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs thereof, which include the minimal promoter region.
00 Preferably, the nucleotide sequence that includes the promoter region includes at least 0 one copy of a TATA box and, for leaf-specific expression, preferably a light responsive element, SEQ ID NO:587. Thus, the invention provides plant promoters, including orthologs of Arabidopsis promoters corresponding to any one of SEQ ID NOs: 1-339, 477- 515, 517-526, 536-579, 693-773, SEQ ID NOs:825-875 and orthologs thereof. The present invention further provides a composition, an expression cassette or a recombinant vector containing the nucleic acid molecule of the invention, and host cells comprising the expression cassette or vector, comprising a plasmid. In particular, the present invention provides an expression cassette or a recombinant vector comprising a promoter of the invention linked to a nucleic acid segment which, when present in a plant, plant cell or plant tissue, results in transcription of the linked nucleic acid segment.
In its broadest sense, the term "substantially similar" when used herein with respect to a nucleotide sequence means that the nucleotide sequence is part of a gene which encodes a polypeptide having substantially the same structure and function as a polypeptide encoded by a gene for the reference nucleotide sequence, the nucleotide sequence comprises a promoter from a gene that is the ortholog of the gene corresponding to the reference nucleotide sequence, as well as promoter sequences that are structurally related the promoter sequences particularly exemplified herein, the substantially similar promoter sequences hybridize to the complement of the promoter sequences exemplified herein under high or very high stringency conditions. The term "substantially similar" thus includes nucleotide sequences wherein the sequence has been modified, for example, to optimize expression in particular cells, as well as nucleotide sequences encoding a variant polypeptide having one or more amino acid substitutions relative to the (unmodified) polypeptide encoded by the reference sequence, Case S-50015A/16/78/NAD 0 which substitution(s) does not alter the activity of the variant polypeptide relative to the unmodified polypeptide. In its broadest sense, the term "substantially similar" when used herein with respect to polypeptide means that the polypeptide has substantially the same structure and function as the reference polypeptide. The percentage of amino acid sequence identity between the substantially similar and the reference polypeptide is at least 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, and even 90% or more, 91%, 92%, 93%, 0 94%, 95%, 96%, 97%, 98%, up to at least 99%, wherein the reference polypeptide is an 00 Arabidopsis polypeptide encoded by a gene with a promoter having any one of SEQ ID S NOs:1-339, 477-515, 517-526, 536-579, and 693-773, a nucleotide sequence comprising an open reading frame having any one of SEQ ID NOs: 358-366, 441-476, 527-529 or 601- 692, or wherein the reference polypeptide is an Oryza polypeptide encoded by a gene with a promoter having any one of SEQ ID NOs:825-875. One indication that two polypeptides are substantially similar to each other, besides having substantially the same function, is that an agent, an antibody, which specifically binds to one of the polypeptides, specifically binds to the other.
Sequence comparisons maybe carried out using a Smith-Waterman sequence alignment algorithm (see Waterman (1995) or http://www hto.usc.edu/software/seqaln/index.html).
The localS program, version 1.16, is preferably used with following parameters: match: 1, mismatch penalty: 0.33, open-gap penalty: 2, extended-gap penalty: 2. Further, a nucleotide sequence that is "substantially similar" to a reference nucleotide sequence hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in 2X SSC, 0.1 SDS at 50°C, more desirably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in IX SSC, 0.1% SDS at 50 0 C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50'C with washing in 0.5X SSC, 0.1% SDS at 50 0 C, preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in 0.1 X SSC, 0.1% SDS at 50°C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50°C with washing in 0.1X SSC, 0.1% SDS at 65 0
C.
The invention also provides sense and anti-sense nucleic acid molecules corresponding to the open reading frames identified herein as well as their orthologs. Also provided are 11- Case S-50015A/16/78/NAD C compositions, expression cassettes, recombinant vectors, and host cells, comprising the N nucleic acid molecule which comprises a nucleic acid segment which encodes a polypeptide g which is preferentially expressed in leaves or roots SEQ ID NOs:358-366, 441-476, 527- 529, 774, 729 and 601-692), or constitutively expressed, in either sense or antisense orientation.
In one embodiment, the invention provides an expression cassette or vector containing an isolated nucleic acid molecule having a nucleotide sequence that directs root-specific, constitutive, or leaf-specific transcription of a linked nucleic acid segment in a cell, which 00 nucleotide sequence is from a gene which encodes a polypeptide having, at least identity to an Arabidopsis polypeptide encoded by a gene having one of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579 or 693-773, preferably one of SEQ ID NOs: 536-579, more preferably one of SEQ ID Nos: 536; 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs thereof, SEQ ID NOs:825-875, and which nucleotide sequence is optionally operably linked to other suitable regulatory sequences, a transcription terminator sequence, operator, repressor binding site, transcription factor binding site and/or an enhancer. This expression cassette or vector may be contained in a host cell. The expression cassette or vector may augment the genome of a transformed plant or may be maintained extrachromosomally. The expression cassette may be operatively linked to a structural gene, the open reading frame thereof, or a portion thereof. The expression cassette may further comprise a Ti plasmid and be contained in an Agrobacterium tumefaciens cell; it may be carried on a microparticle, wherein the microparticle is suitable for ballistic transformation of a plant cell; or it may be contained in a plant cell or protoplast. Further, the expression cassette or vector can be contained in a transformed plant or cells thereof, and the plant may be a dicot or a monocot. In particular, the plant may be a cereal plant.
The present invention further provides a method of augmenting a plant genome by contacting plant cells with a nucleic acid molecule of the invention, one having a nucleotide sequence that directs root-specific, constitutive or leaf-specific transcription of a linked nucleic acid segment isolatable or obtained from a plant gene encoding a polypeptide that is substantially similar to a polypeptide encoded by the an Arabidopsis gene having a sequence according to any one of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579, or 693-773, preferably to any one of SEQ ID NOs: 536-579, more preferably to any one of SEQ ID Nos: 536; 537; 539-542; 12-
I
Case S-50015A/16/78/NAD 0 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs N thereof, SEQ ID NOs:825-875, so as to yield transformed plant cells; and regenerating Sthe transformed plant cells to provide a differentiated transformed plant, wherein the differentiated transformed plant expresses the nucleic acid molecule in the cells of the plant.
The nucleic acid molecule may be present in the nucleus, chloroplast, mitochondria and/or plastid of the cells of the plant. The present invention also provides a transgenic plant prepared Sby this method, a seed from such a plant and progeny plants from such a plant including hybrids and inbreds. Preferred transgenic plants are transgenic maize, soybean, barley, alfalfa, 00 sunflower, canola, soybean, cotton, peanut, sorghum, tobacco, sugarbeet, rice, wheat, rye, 0 turfgrass, millet, sugarcane, tomato, or potato.
A transformed (transgenic) plant of the invention includes plants, the genome of which is augmented by a nucleic acid molecule of the invention, or in which the corresponding gene has been disrupted, to result in a loss, a decrease or an alteration, in the function of the product encoded by the gene, which plant may also have increased yields and/or produce a better-quality product than the corresponding wild-type plant. The nucleic acid molecules of the invention are thus useful for targeted gene disruption, as well as markers and probes.
The invention also provides a method of plant breeding, to prepare a crossed fertile transgenic plant. The method comprises crossing a fertile transgenic plant comprising a particular nucleic acid molecule of the invention with itself or with a second plant, one lacking the particular nucleic acid molecule, to prepare the seed of a crossed fertile transgenic plant comprising the particular nucleic acid molecule. The seed is then planted to obtain a crossed fertile transgenic plant. The plant may be a monocot or a dicot. In a particular embodiment, the plant is a cereal plant.
The crossed fertile transgenic plant may have the particular nucleic acid molecule inherited through a female parent or through a male parent. The second plant may be an inbred plant. The crossed fertile transgenic may be a hybrid. Also included within the present invention are seeds of any of these crossed fertile transgenic plants.
The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate progeny plants. Depending on the desired properties different breeding measures are taken. The relevant techniques are well known in the art and include but are not limited to 13- Case S-50015A/16/78/NAD o hybridization, inbreeding, backcross breeding, multiline breeding, variety blend, interspecific N hybridization, aneuploid techniques, etc. Hybridization techniques also include the sterilization Sof plants to yield male or female sterile plants by mechanical, chemical or biochemical means.
Cross pollination of a male sterile plant with polen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines. Thus, the transgenic plants according to the invention can be used for the breeding of improved plant lines that for example increase the effectiveness of conventional methods such as herbicide or pesticide treatment or allow to dispense with said methods due to their oo modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained that, due to their optimized genetic "equipment", yield harvested product of better quality than products that were not able to tolerate comparable adverse developmental conditions.
The present invention also provides a method to identify a nucleotide sequence that directs root-specific transcription of linked nucleic acid in the genome of a plant cell by contacting a probe of plant nucleic acid, cRNA, isolated from root as well as other tissues of a plant, with a plurality of isolated nucleic acid samples on one or more, a plurality of, solid substrates so as to form a complex between at least a portion of the probe and a nucleic acid sample(s) having sequences that are structurally related to the sequences in the probe.
Each sample comprises one or a plurality of oligonucleotides corresponding to at least a portion of a plant gene. Then complex formation is compared between samples contacted with the root-specific probe and samples contacted with a non-root specific probe so as to determine which RNAs are expressed in root tissues of the plant. The probe and/or samples may be nucleic acid from a dicot or from a monocot.
The present invention also provides a method to identify a nucleotide sequence that directs constitutive transcription of nucleic acid in the genome of a plant cell by contacting a probe of plant nucleic acid, cRNA, isolated from various tissues of a plant and at various developmental stages with a plurality of isolated nucleic acid samples on one or more, a plurality of, solid substrates so as to form a complex between at least a portion of the probe and a nucleic acid sample(s) having sequences that are structurally related to the sequences in the probe. Each sample comprises one or a plurality of oligonucleotides corresponding to at least a portion of a plant gene. Complex formation is then compared to determine which 14- Case S-50015A/16/78/NAD RNAs are present in a majority of, preferably in substantially all, tissues, in a majority of, C preferably at substantially all, developmental stages of the plant. The probe and/or samples i may be nucleic acid from a dicot or from a monocot.
C
The invention also provides a gene, the expression of which is useful to normalize the expression of other genes. When performing gene expression quantitative analysis, it is important to normalize the gene expression of the unknown to a known constitutive expressing gene. To achieve accurate relative quantification for the measurement of gene expression in Ssamples, the expression of the gene of interest is compared to the expression of a gene whose O expression does not vary with experimental treatment. This comparison is essential for 0 accurate relative quantification because this normalization process eliminates any remaining Serror that may arise from sample quality variance. Using methodologies described herein, two genes were identified, APX3 and TRX3 (ascorbate peroxidase and thioredoxin), whose expression does not vary upon virus infection, bacterial infection or between different tissue types. Probes and primer sets were prepared to measure the expression levels of these genes using quantitative PCR. Whereas the expression level of a pathogenesis related gene in infected Arabidopsis rises upon infection compared to the same gene in the noninfected control plant, the expression levels of APX3 and TRX3 remained consistent in mock and experimentally treated plants. APX3 and TRX3 gene expression levels also remained consistent between normal and cold-treated plants. These genes and their plant kingdom orthologs are useful as normalization standards for quantitative gene expression analysis in Arabidopsis, as well as other dicots and monocots.
The present invention also provides a method to identify a nucleotide sequence that directs transcription of nucleic acid in the genome of a plant cell in leaf tissue, by contacting a probe of plant nucleic acid, cRNA, isolated from leaf as well as other tissues of a plant with a plurality of isolated nucleic acid samples on one or more, a plurality of, solid substrates, so as to form a complex between at least a portion of the probe and a nucleic acid sample(s) having sequences that are structurally related to the sequences in the probe. Each sample comprises one or a plurality of, oligonucleotides corresponding to at least a portion of a plant gene. Then complex formation is determined or detected to identify which samples Case S-50015A16/78/NAD 0 represent genes that are expressed in leaf. The probe and/or samples may be nucleic acid from C" a dicot or from a monocot.
SThe compositions of the invention include plant nucleic acid molecules, and the amino acid sequences for the polypeptides or partial-length polypeptides encoded by the nucleic acid molecule which comprises an open reading frame. These sequences can be employed to alter expression of a particular gene corresponding to the open reading frame by decreasing or eliminating expression of that plant gene or by overexpressing a particular gene product.
0 Methods of this embodiment of the invention include stably transforming a plant with the 00 nucleic acid molecule which includes an open reading frame operably linked to a promoter capable of driving expression of that open reading frame (sense or antisense) in a plant cell. By "portion" or "fragment", as it relates to a nucleic acid molecule which comprises an open reading frame or a fragment thereof encoding a partial-length polypeptide having the activity of the full length polypeptide, is meant a sequence having at least 80 nucleotides, more preferably at least 150 nucleotides, and still more preferably at least 400 nucleotides. If not employed for expressing, a "portion" or "fragment" means at least 9, preferably 12, more preferably 15, even more preferably at least 20, consecutive nucleotides, probes and primers (oligonucleotides), corresponding to the nucleotide sequence of the nucleic acid molecules of the invention. Thus, to express a particular gene product, the method comprises introducing to a plant, plant cell, or plant tissue an expression cassette comprising a promoter linked to an open reading frame so as to yield a transformed differentiated plant, transformed cell or transformed tissue. Transformed cells or tissue can be regenerated to provide a transformed differentiated plant. The transformed differentiated plant or cells thereof preferably expresses the open reading frame in an amount that alters the amount of the gene product in the plant or cells thereof, which product is encoded by the open reading frame. The present invention also provides a transformed plant prepared by the method, progeny and seed thereof.
The invention further includes a nucleotide sequence which is complementary to one (hereinafter "test" sequence) which hybridizes under stringent conditions with a nucleic acid molecule of the invention as well as RNA which is transcribed from the nucleic acid molecule.
When the hybridization is performed under stringent conditions, either the test or nucleic acid molecule of invention is preferably supported, on a membrane or DNA chip. Thus, either a denatured test or nucleic acid molecule of the invention is preferably first bound to a support 16- Case S-50015A/16/78/NAD C and hybridization is effected for a specified period of time at a temperature of, between N and 70 0 C, in double strength citrate buffered saline (SC) containing 0.1% SDS followed by rinsing of the support at the same temperature but with a buffer having a reduced SC concentration. Depending upon the degree of stringency required such reduced concentration buffers are typically single strength SC containing 0.1 SDS, half strength SC containing 0.1% SDS and one-tenth strength SC containing 0.1% SDS.
A computer readable medium containing one or more of the nucleotide sequences of the invention as well as methods of use for the computer readable medium are provided. This 00 medium allows a nucleotide sequence corresponding to at least one of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579, 693-773 or 825-875 (promoters), and 358-366, 441-476, 527- I 529, 601-692 or 774-824 (open reading frames), to be used as a reference sequence to search against a database. This medium also allows for computer-based manipulation of a nucleotide sequence corresponding to at least one of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579, 693-773 or 825-875 and 358-366, 441-476, 527-529, 601-692 or 774-824.
In accordance with the present invention, nucleic acid constructs are provided that allow initiation of transcription in a "root-specific" or "leaf-specific" manner. Constructs of the invention comprise regulated transcription initiation regions associated with protein translation elongation, and the compositions of the present invention are drawn to novel nucleotide sequences for root-specific as well as leaf-specific expression. The present invention thus provides for isolated nucleic acid molecules comprising a plant nucleotide sequence that directs root-specific or leaf-specific transcription of a linked nucleic acid fragment in a plant cell. Preferably, nucleotide sequence is obtained from plant genomic DNA from a gene encoding a polypeptide which is substantially similar and preferably has, at least 70% amino acid sequence identity to a polypeptide encoded by an Arabidopsis gene having any one of SEQ ID NOs: 1-51, 518-526 and 536-544 (root-specific promoters) or orthologs thereof, SEQ ID Nos:825 or 843, or 693-773 (leaf-specific promoters) or a fragment thereof which directs root- or leaf-specific expression, respectively. Thus, these nucleotide sequences exhibit promoter activity in root or leaf tissues. Root-specific or leafspecific promoters may be obtained from other plant species by using the Arabidopsis promoter or corresponding genes sequences described herein as probes to screen for -17- Case S-50015AI16/78/NAD 0 homologous structural genes in other plants by hybridization under low, moderate or stringent N hybridization conditions. Regions of the tissue-specific promoter sequences of the present invention which are conserved among species could also be used as PCR primers to amplify a segment from a species other than Arabidopsis, and that segment used as a hybridization probe (the latter approach permitting higher stringency screening) or in a transcriptional assay to determine promoter activity. Moreover, the tissue-specific sequences could be employed to identify structurally related sequences in a database using computer algorithms.
SThese promoters are capable of driving the expression of a coding sequence in a target 00 cell, particularly in a plant cell. The promoter sequences and methods disclosed herein are useful in regulating tissue-specific expression of any heterologous nucleotide sequence in a host plant in order to vary the phenotype of that plant. These promoters can be used with combinations of enhancer, upstream elements, and/or activating sequences from the 5' flanking regions of plant expressible structural genes. Similarly the upstream element can be used in combination with various plant promoter sequences.
Also in accordance with the present invention, nucleic acid constructs are provided that allow initiation of transcription in a "tissue-independent," "tissue general," or "constitutive" manner. Constructs of this embodiment invention comprise regulated transcription initiation regions associated with protein translation elongation and the compositions of this embodiment of the present invention are drawn to novel nucleotide sequences for tissue-independent, tissue-general, or constitutive plant promoters. By "tissue-independent," "tissue-general," or "constitutive" is intended expression in the cells throughout a plant at most times and in most tissues. As with other promoters classified as "constitutive" ubiquitin), some variation in absolute levels of expression can exist among different tissues or stages.
The present invention thus provides for isolated nucleic acid molecules comprising a plant nucleotide sequence that directs constitutive transcription of a linked nucleic acid fragment in a plant cell. Preferably, the nucleotide sequence is obtained from plant genomic DNA from a gene encoding a polypeptide which is substantially similar and preferably has, e.g.
at least 70% amino acid sequence identity to a polypeptide encoded by an Arabidopsis gene having any one of SEQ ID NOs:52-339, 477-515, 517, 545-579, 826-842, 844-875 or a fragment thereof which exhibits promoter activity in a constitutive fashion at most times and in most tissues). Constitutive promoter sequences may be obtained from other plant 18- Case S-50015A 16/78/N AD 00 0o species by using the constitutive Arabidopsis promoter sequences or corresponding genes N described herein as probes to screen for homologous structural genes in other plants by hybridization under low, moderate or stringent hybridization conditions. Regions of the t'n constitutive promoter sequences of the present invention which are conserved among species could also be used as PCR primers to amplify a segment from a species other than Arabidopsis, and that segment used as a hybridization probe (the latter approach permitting higher stringency screening) or in a transcription assay to determine promoter activity.
SMoreover, the constitutive promoter sequences could be employed to identify structurally 00 related sequences in a database using computer algorithms.
SThese constitutive promoters are capable of driving the expression of a coding sequence in a target cell, particularly in a plant cell. The promoter sequences and methods disclosed herein are useful in regulating constitutive expression of any heterologous nucleotide sequence in a host plant in order to vary the phenotype of that plant. These promoters can be used with combinations of enhancer, upstream elements, and/or activating sequences from the flanking regions of plant expressible structural genes. Similarly the upstream element can be used in combination with various plant promoter sequences. In one embodiment the promoter and upstream element are used together to obtain at least 10-fold higher expression of an introduced gene in monocot transgenic plants than is obtained with the maize ubiquitin 1 promoter.
In particular, all of the promoters of the invention are useful to modify the phenotype of a plant. Various changes in the phenotype of a transgenic plant are desirable, modifying the fatty acid composition in a plant, altering the amino acid content of a plant, altering a plant's pathogen defense mechanism, and the like. These results can be achieved by providing expression of heterologous products or increased expression of endogenous products in plants.
Alternatively, the results can be achieved by providing for a reduction of expression of one or more endogenous products, particularly enzymes or cofactors in the plant. These changes result in an alteration in the phenotype of the transformed plant.
Definitions The term "gene" is used broadly to refer to any segment of nucleic acid associated with a biological function. Thus, genes include coding sequences and/or the regulatory sequences -19- Case S-50015A16/78/NAD o required for their expression. For example, gene refers to a nucleic acid fragment that expresses mRNA or functional RNA, or encodes a specific protein, and which includes Sregulatory sequences. Genes also include nonexpressed DNA segments that, for example, it form recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted C sequence information, and may include sequences designed to have desired parameters.
The term "native" or "wild type" gene refers to a gene that is present in the genome of 0 an untransformed cell, a cell not having a known mutation.
oO A "marker gene" encodes a selectable or screenable trait.
The term "chimeric gene" refers to any gene that contains 1) DNA sequences, including regulatory and coding sequences, that are not found together in nature, or 2) sequences encoding parts of proteins not naturally adjoined, or 3) parts of promoters that are not naturally adjoined. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or comprise regulatory sequences and coding sequences derived from the same source, but arranged in a manner different from that found in nature.
A "transgene" refers to a gene that has been introduced into the genome by transformation and is stably maintained. Transgenes may include, for example, genes that are either heterologous or homologous to the genes of a particular plant to be transformed.
Additionally, transgenes may comprise native genes inserted into a non-native organism, or chimeric genes. The term "endogenous gene" refers to a native gene in its natural location in the genome of an organism. A "foreign" gene refers to a gene not normally found in the host organism but that is introduced by gene transfer.
An "oligonucleotide" corresponding to a nucleotide sequence of the invention, for use in probing or amplification reactions, may be about 30 or fewer nucleotides in length 9, 12, 15, 18, 20, 21 or 24, or any number between 9 and 30). Generally specific primers are upwards of 14 nucleotides in length. For optimum specificity and cost effectiveness, primers of 16 to 24 nucleotides in length may be preferred. Those skilled in the art are well versed in the design of primers for use processes such as PCR. If required, probing can be done with entire restriction fragments of the gene disclosed herein which may be 100's or even 1000's of nucleotides in length.
Case S-50015A16/78/NAD 0 0 The terms "protein," "peptide" and "polypeptide" are used interchangeably herein.
SThe nucleotide sequences of the invention can be introduced into any plant. The genes to be introduced can be conveniently used in expression cassettes for introduction and L expression in any plant of interest. Such expression cassettes will comprise the transcriptional initiation region of the invention linked to a nucleotide sequence of interest. Preferred promoters include constitutive, tissue-specific, developmental-specific, inducible and/or viral promoters. Such an expression cassette is provided with a plurality of restriction sites for 0 insertion of the gene of interest to be under the transcriptional regulation of the regulatory C regions. The expression cassette may additionally contain selectable marker genes. The 00 cassette will include in the direction of transcription, a transcriptional and translational Sinitiation region, a DNA sequence of interest, and a transcriptional and translational termination region functional in plants. The termination region may be native with the transcriptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions.
See also, Guerineau et al., 1991; Proudfoot, 1991; Sanfacon et al., 1991; Mogen et al., 1990; Munroe et al., 1990; Ballas et al., 1989; Joshi et al., 1987.
"Coding sequence" refers to a DNA or RNA sequence that codes for a specific amino acid sequence and excludes the non-coding sequences. It may constitute an "uninterrupted coding sequence", lacking an intron, such as in a cDNA or it may include one or more introns bounded by appropriate splice junctions. An "intron" is a sequence of RNA which is contained in the primary transcript but which is removed through cleavage and re-ligation of the RNA within the cell to create the mature mRNA that can be translated into a protein.
The terms "open reading frame" and "ORF" refer to the amino acid sequence encoded between translation initiation and termination codons of a coding sequence. The terms "initiation codon" and "termination codon" refer to a unit of three adjacent nucleotides ('codon') in a coding sequence that specifies initiation and chain termination, respectively, of protein synthesis (mRNA translation).
A "functional RNA" refers to an antisense RNA, ribozyme, or other RNA that is not translated.
The term "RNA transcript" refers to the product resulting from RNA polymerase -21 Case S-50015A/16/78/NAD C catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect N complementary copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA sequence derived from posttranscriptional processing of the primary transcript and is referred to as the mature RNA. "Messenger RNA" (mRNA) refers to the RNA that is without introns and that can be translated into protein by the cell. "cDNA" refers to a single- or a double-stranded DNA that is complementary to and derived from mRNA.
S"Regulatory sequences" and "suitable regulatory sequences" each refer to nucleotide sequences located upstream non-coding sequences), within, or downstream non-coding O0 sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences include enhancers, promoters, translation leader sequences, introns, and polyadenylation signal sequences. They include natural and synthetic sequences as well as sequences which may be a combination of synthetic and natural sequences. As is noted above, the term "suitable regulatory sequences" is not limited to promoters.
non-coding sequence" refers to a nucleotide sequence located 5' (upstream) to the coding sequence. It is present in the fully processed mRNA upstream of the initiation codon and may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency (Turner et al., 1995).
non-coding sequence" refers to nucleotide sequences located 3' (downstream) to a coding sequence and include polyadenylation signal sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht et al., 1989.
The term "translation leader sequence" refers to that DNA sequence portion of a gene between the promoter and coding sequence that is transcribed into RNA and is present in the fully processed mRNA upstream of the translation start codon. The translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency.
The term "mature" protein refers to a post-translationally processed polypeptide without its signal peptide. "Precursor" protein refers to the primary product of translation of 22 Case S-50015A/16/78/NAD 00 an mRNA. "Signal peptide" refers to the amino terminal extension of a polypeptide, which is O translated in conjunction with the polypeptide forming a precursor peptide and which is required for its entrance into the secretory pathway. The term "signal sequence" refers to a fC nucleotide sequence that encodes the signal peptide.
The term "intracellular localization sequence" refers to a nucleotide sequence that encodes an intracellular targeting signal. An "intracellular targeting signal" is an amino acid sequence that is translated in conjunction with a protein and directs it to a particular sub- 0 cellular compartment. "Endoplasmic reticulum (ER) stop transit signal" refers to a carboxy- C] terminal extension of a polypeptide, which is translated in conjunction with the polypeptide and 00 Scauses a protein that enters the secretory pathway to be retained in the ER. "ER stop transit sequence" refers to a nucleotide sequence that encodes the ER targeting signal. Other intracellular targeting sequences encode targeting signals active in seeds and/or leaves and vacuolar targeting signals.
"Promoter" refers to a nucleotide sequence, usually upstream to its coding sequence, which controls the expression of the coding sequence by providing the recognition for RNA polymerase and other factors required for proper transcription. "Promoter" includes a minimal promoter that is a short DNA sequence comprised of a TATA box and other sequences that serve to specify the site of transcription initiation, to which regulatory elements are added for control of expression. "Promoter" also refers to a nucleotide sequence that includes a minimal promoter plus regulatory elements that is capable of controlling the expression of a coding sequence or functional RNA. This type of promoier sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a DNA sequence which can stimulate promoter activity and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue specificity of a promoter. It is capable of operating in both orientations (normal or flipped), and is capable of functioning even when moved either upstream or downstream from the promoter. Both enhancers and other upstream promoter elements bind sequence-specific DNA-binding proteins that mediate their effects. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even be comprised of synthetic DNA segments. A promoter may also contain DNA sequences that are involved in the binding of 23 Case S-50015A16/78[NAD 0 protein factors which control the effectiveness of transcription initiation in response to C physiological or developmental conditions.
The "initiation site" is the position surrounding the first nucleotide that is part of the transcribed sequence, which is also defined as position With respect to this site all other sequences of the gene and its controlling regions are numbered. Downstream sequences further protein encoding sequences in the 3' direction) are denominated positive, while upstream sequences (mostly of the controlling regions in the 5' direction) are denominated negative.
00 Promoter elements, particularly a TATA element, that are inactive or that have greatly reduced promoter activity in the absence of upstream activation are referred to as "minimal or core promoters." In the presence of a suitable transcription factor, the minimal promoter functions to permit transcription. A "minimal or core promoter" thus consists only of all basal elements needed for transcription initiation, a TATA box and/or an initiator.
"Constitutive expression" refers to expression using a constitutive or regulated promoter. "Conditional" and "regulated expression" refer to expression controlled by a regulated promoter.
"Constitutive promoter" refers to a promoter that is able to express the open reading frame (ORF) that it controls in all or nearly all of the plant tissues during all or nearly all developmental stages of the plant. Each of the transcription-activating elements do not exhibit an absolute tissue-specificity, but mediate transcriptional activation in most plant parts at a level of of the level reached in the part of the plant in which transcription is most active.
"Regulated promoter" refers to promoters that direct gene expression not constitutively, but in a temporally- and/or spatially-regulated manner, and includes both tissuespecific and inducible promoters. It includes natural and synthetic sequences as well as sequences which may be a combination of synthetic and natural sequences. Different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. New promoters of various types useful in plant cells are constantly being discovered, numerous examples may be found in the compilation by Okamuro et al. (1989). Typical regulated promoters useful in plants include but are not limited to safener-inducible promoters, promoters derived from the -24- Case S-50015A116/78/NAD 00 O tetracycline-inducible system, promoters derived from salicylate-inducible systems, promoters CN derived from alcohol-inducible systems, promoters derived from glucocorticoid-inducible system, promoters derived from pathogen-inducible systems, and promoters derived from y ecdysone-inducible systems.
"Tissue-specific promoter" refers to regulated promoters that are not expressed in al plant cells but only in one or more cell types in specific organs (such as leaves or seeds), specific tissues (such as embryo or cotyledon), or specific cell types (such as leaf parenchyma or seed storage cells). These also include promoters that are temporally regulated, such as in 00 early or late embryogenesis, during fruit ripening in developing seeds or fruit, in fully 0 differentiated leaf, or at the onset of senescence.
"Inducible promoter" refers to those regulated promoters that can be turned on in one or more cell types by an external stimulus, such as a chemical, light, hormone, stress, or a pathogen.
"Operably-linked" refers to the association of nucleic acid sequences on single nucleic acid fragment so that the function of one is affected by the other. For example, a regulatory DNA sequence is said to be "operably linked to" or "associated with" a DNA sequence that codes for an RNA or a polypeptide if the two sequences are situated such that the regulatory DNA sequence affects expression of the coding DNA sequence that the coding sequence or functional RNA is under the transcriptional control of the promoter). Coding sequences can be operably-linked to regulatory sequences in sense or antisense orientation.
"Expression" refers to the transcription and/or translation of an endogenous gene, ORF or portion thereof, or a transgene in plants. For example, in the case of antisense constructs, expression may refer to the transcription of the antisense DNA only. In addition, expression refers to the transcription and stable accumulation of sense (mRNA) or functional RNA.
Expression may also refer to the production of protein.
"Specific expression" is the expression of gene products which is limited to one or a few plant tissues (spatial limitation) and/or to one or a few plant developmental stages (temporal limitation). It is acknowledged that hardly a true specificity exist.s: promoters seem to be preferably switch on in some tissues, while in other tissues there can be no or only little activity. This phenomenon is known as leaky expression. However, with specific expression in this invention is meant preferable expression in one or a few plant tissues.
Case S-50015A/ 16/78/NAD The "expression pattern" of a promoter (with or without enhancer) is the pattern of N expression levels which shows where in the plant and in what developmental stage Stranscription is initiated by said promoter. Expression patterns of a set of promoters are said Sto be complementary when the expression pattern of one promoter shows little overlap with the expression pattern of the other promoter. The level of expression of a promoter can be determined by measuring the 'steady state' concentration of a standard transcribed reporter mRNA. This measurement is indirect since the concentration of the reporter mRNA is 0 dependent not only on its synthesis rate, but also on the rate with which the mRNA is 00 degraded. Therefore, the steady state level is the product of synthesis rates and degradation Srates.
c I The rate of degradation can however be considered to proceed at a fixed rate when the transcribed sequences are identical, and thus this value can serve as a measure of synthesis rates. When promoters are compared in this way techniques available to those skilled in the art are hybridization S -RNAse analysis, northern blots and competitive RT-PCR. This list of techniques in no way represents all available techniques, but rather describes commonly used procedures used to analyze transcription activity and expression levels of mRNA.
The analysis of transcription start points in practically all promoters has revealed that there is usually no single base at which transcription starts, but rather a more or less clustered set of initiation sites, each of which accounts for some start points of the mRNA. Since this distribution varies from promoter to promoter the sequences of the reporter mRNA in each of the populations would differ from each other. Since each mRNA species is more or less prone to degradation, no single degradation rate can be expected for different reporter mRNAs. It has been shown for various eukaryotic promoter sequences that the sequence surrounding the initiation site ('initiator') plays an important role in determining the level of RNA expression directed by that specific promoter. This includes also part of the transcribed sequences. The direct fusion of promoter to reporter sequences would therefore lead to suboptimal levels of transcription.
A commonly used procedure to analyze expression patterns and levels is through determination of the 'steady state' level of protein accumulation in a cell. Commonly used candidates for the reporter gene, known to those skilled in the art are 3-glucuronidase (GUS), -26- Case S-50015A/16/78/NAD 0 chloramphenicol acetyl transferase (CAT) and proteins with fluorescent properties, such as CK green fluorescent protein (GFP) from Aequora victoria. In principle, however, many more proteins are suitable for this purpose, provided the protein does not interfere with essential plant functions. For quantification and determination of localization a number of tools are suited. Detection systems can readily be created or are available which are based on, e.g., immunochemical, enzymatic, fluorescent detection and quantification. Protein levels can be Sdetermined in plant tissue extracts or in intact tissue using in situ analysis of protein expression.
00 Generally, individual transformed lines with one chimeric promoter reporter construct will vary in their levels of expression of the reporter gene. Also frequently observed is the phenomenon that such transformants do not express any detectable product (RNA or protein).
The variability in expression is commonly ascribed to 'position effects', although the molecular mechanisms underlying this inactivity are usually not clear.
The term "average expression" is used here as the average level of expression found in all lines that do express detectable amounts of reporter gene, so leaving out of the analysis plants that do not express any detectable reporter mRNA or protein.
"Root expression level" indicates the expression level found in protein extracts of complete plant roots. Likewise, leaf, and stem expression levels, are determined using whole extracts from leaves and stems. It is acknowledged however, that within each of the plant parts just described, cells with variable functions may exist, in which promoter activity may vary.
"Non-specific expression" refers to constitutive expression or low level, basal ('leaky') expression in nondesired cells or tissues from a 'regulated promoter'.
"Altered levels" refers to the level of expression in transgenic organisms that differs from that of normal or untransformed organisms.
"Overexpression" refers to the level of expression in transgenic cells or organisms that exceeds levels of expression in normal or untransformed (nontransgenic) cells or organisms.
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of suppressing the expression of protein from an endogenous gene or a transgene.
27 Case S-50015A/16/78/NAD S"Co-suppression" and "transwitch" each refer to the production of sense RNA C, transcripts capable of suppressing the expression of identical or substantially similar transgene or endogenous genes Patent No. 5,231,020).
"Gene silencing" refers to homology-dependent suppression of viral genes, transgenes, or endogenous nuclear genes. Gene silencing may be transcriptional, when the suppression is due to decreased transcription of the affected genes, or post-transcriptional, when the suppression is due to increased turnover (degradation) of RNA species homologous to the 0 affected genes (English et al., 1996). Gene silencing includes virus-induced gene silencing o, (Ruiz et al. 1998).
"Silencing suppressor" gene refers to a gene whose expression leads to counteracting gene silencing and enhanced expression of silenced genes. Silencing suppressor genes may be of plant, non-plant, or viral origin. Examples include, but are not limited to HC-Pro, PI-HC- Pro, and 2b proteins. Other examples include one or more genes in TGMV-B genome.
The terms "heterologous DNA sequence," "exogenous DNA segment" or "heterologous nucleic acid," as used herein, each refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling.
The terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides. A "homologous" DNA sequence is a DNA sequence that is naturally associated with a host cell into which it is introduced.
"Homologous to" in the context of nucleotide sequence identity refers to the similarity between the nucleotide sequence of two nucleic acid molecules or between the amino acid sequences of two protein molecules. Estimates of such homology are provided by either DNA-DNA or DNA-RNA hybridization under conditions of stringency as is well understood by those skilled in the art (as described in Haines and Higgins Nucleic Acid Hybridization, IRL Press, Oxford, or by the comparison of sequence similarity between two nucleic acids or proteins.
28- Case S-50015A/16/78/NAD O The term "substantially similar" refers to nucleotide and amino acid sequences that Srepresent functional andlor structural equivalents of Arabidopsis sequences disclosed herein.
For example, altered nucleotide sequences which simply reflect the degeneracy of the genetic Scode but nonetheless encode amino acid sequences that are identical to a particular amino acid sequence are substantially similar to the particular sequences. In addition, amino acid sequences that are substantially similar to a particular sequence are those wherein overall amino acid identity is at least 65% or greater to the instant sequences. Modifications that Sresult in equivalent nucleotide or amino acid sequences are well within the routine skill in the 0O art. Moreover, the skilled artisan recognizes that equivalent nucleotide sequences encompassed by this invention can also be defined by their ability to hybridize, under low, Smoderate and/or stringent conditions 0.1X SSC, 0.1% SDS, 65 0 with the nucleotide sequences that are within the literal scope of the instant claims.
"Target gene" refers to a gene on the replicon that expresses the desired target coding sequence, functional RNA, or protein. The target gene is not essential for replicon replication.
Additionally, target genes may comprise native non-viral genes inserted into a non-native organism, or chimeric genes, and will be under the control of suitable regulatory sequences.
Thus, the regulatory sequences in the target gene may come from any source, including the virus. Target genes may include coding sequences that are either heterologous or homologous to the genes of a particular plant to be transformed. However, target genes do not include native viral genes. Typical target genes include, but are not limited to genes encoding a structural protein, a seed storage protein, a protein that conveys herbicide resistance, and a protein that conveys insect resistance. Proteins encoded by target genes are known as "foreign proteins". The expression of a target gene in a plant will typically produce an altered plant trait.
The term "altered plant trait" means any phenotypic or genotypic change in a transgenic plant relative to the wild-type or non-transgenic plant host.
"Transcription Stop Fragment" refers to nucleotide sequences that contain one or more regulatory signals, such as polyadenylation signal sequences, capable of terminating transcription. Examples include the 3' non-regulatory regions of genes encoding nopaline synthase and the small subunit of ribulose bisphosphate carboxylase.
-29- Case S-50015A/16/78/NAD 00
O
K "Replication gene" refers to a gene encoding a viral replication protein. In addition to Sthe ORF of the replication protein, the replication gene may also contain other overlapping or t non-overlapping ORF(s), as are found in viral sequences in nature. While not essential for replication, these additional ORFs may enhance replication and/or viral DNA accumulation.
Examples of such additional ORFs are AC3 and AL3 in ACMV and TGMV geminiviruses, respectively.
S"Chimeric trans-acting replication gene" refers either to a replication gene in which the 00 coding sequence of a replication protein is under the control of a regulated plant promoter other than that in the native viral replication gene, or a modified native viral replication gene, Sfor example, in which a site specific sequence(s) is inserted in the 5' transcribed but untranslated region. Such chimeric genes also include insertion of the known sites of replication protein binding between the promoter and the transcription start site that attenuate transcription of viral replication protein gene.
"Chromosomally-integrated" refers to the integration of a foreign gene or DNA construct into the host DNA by covalent bonds. Where genes are not "chromosomally integrated" they may be "transiently expressed." Transient expression of a gene refers to the expression of a gene that is not integrated into the host chromosome but functions independently, either as part of an autonomously replicating plasmid or expression cassette, for example, or as part of another biological system such as a virus.
"Production tissue" refers to mature, harvestable tissue consisting of non-dividing, terminally-differentiated cells. It excludes young, growing tissue consisting of germline, meristematic, and not-fully-differentiated cells.
"Germline cells" refer to cells that are destined to be gametes and whose genetic material is heritable.
"Trans-activation" refers to switching on of gene expression or replicon replication by the expression of another (regulatory) gene in trans.
The term "transformation" refers to the transfer of a nucleic acid fragment into the genome of a host cell, resulting in genetically stable inheritance. Host cells containing the transformed nucleic acid fragments are referred to as "transgenic" cells, and organisms comprising transgenic cells are referred to as "transgenic organisms". Examples of methods of Case S-50015A/16/7S/N
AD
O transformation of plants and plant cells include Agrobacterium-mediated transformation (De SBlaere et al., 1987) and particle bombardment technology (Klein et al. 1987; U.S. Patent No.
1) 4,945,050). Whole plants may be regenerated from transgenic cells by methods well known to the skilled artisan (see, for example, Fromm et al., 1990).
"Transformed," "transgenic," and "recombinant" refer to a host organism such as a bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome generally known in the art and 8 are disclosed in Sambrook et al., 1989. See also Innis et al., 1995 and Gelfand, 1995; and 0 Innis and Gelfand, 1999. Known methods of PCR include, but are not limited to, methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific Sprimers, vector-specific primers, partially mismatched primers, and the like. For example, "transformed," "transformant," and "transgenic" plants or calli have been through the transformation process and contain a foreign gene integrated into their chromosome. The term "untransformed" refers to normal plants that have not been through the transformation process.
"Transiently transformed" refers to cells in which transgenes and foreign DNA have been introduced (for example, by such methods as Agrobacterium-mediated transformation or biolistic bombardment), but not selected for stable maintenance.
"Stably transformed" refers to cells that have been selected and regenerated on a selection media following transformation.
"Transient expression" refers to expression in cells in which a virus or a transgene is introduced by viral infection or by such methods as Agrobacterium-mediated transformation, electroporation, or biolistic bombardment, but not selected for its stable maintenance.
"Genetically stable" and "heritable" refer to chromosomally-integrated genetic elements that are stably maintained in the plant and stably inherited by progeny through successive generations.
"Primary transformant" and "TO generation" refer to transgenic plants that are of the same genetic generation as the tissue which was initially transformed not having gone through meiosis and fertilization since transformation).
"Secondary transformants" and the "TI, T2, T3, etc. generations" refer to transgenic plants derived from primary transformants through one or more meiotic and fertilization cycles.
-31 Case S-50015Ai16/78[NAD C They may be derived by self-fertilization of primary or secondary transformants or crosses of N primary or secondary transformants with other transformed or untransformed plants.
"Wild-type" refers to a virus or organism found in nature without any known mutation.
"Genome" refers to the complete genetic material of an organism.
The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form, composed of monomers (nucleotides) containing a sugar, phosphate and a base which is either a purine or pyrimidine. Unless Sspecifically limited, the term encompasses nucleic acids containing known analogs of natural 0 nucleotides which have similar binding properties as the reference nucleic acid and are 0 metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, Sa particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof degenerate codon substitutions) and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., 1991; Ohtsuka et al., 1985; Rossolini et al. 1994). A "nucleic acid fragment" is a fraction of a given nucleic acid molecule. In higher plants, deoxyribonucleic acid (DNA) is the genetic material while ribonucleic acid (RNA) is involved in the transfer of information contained within DNA into proteins. The term "nucleotide sequence" refers to a polymer of DNA or RNA which can be single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases capable of incorporation into DNA or RNA polymers. The terms "nucleic acid" or "nucleic acid sequence" may also be used interchangeably with gene, cDNA, DNA and RNA encoded by a gene.
The invention encompasses isolated or substantially purified nucleic acid or protein compositions. In the context of the present invention, an "isolated" or "purified" DNA molecule or an "isolated" or "purified" polypeptide is a DNA molecule or polypeptide that, by the hand of man, exists apart from its native environment and is therefore not a product of nature. An isolated DNA molecule or polypeptide may exist in a purified form or may exist in a non-native environment such as, for example, a transgenic host cell. For example, an "isolated" or "purified" nucleic acid molecule or protein, or biologically active portion thereof, is substantially free of other cellular material, or culture medium when produced by -32- Case S-50015A/16/78/NAD 00 recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Preferably, an "isolated" nucleic acid is free of sequences (preferably protein encoding sequences) that naturally flank the nucleic acid sequences located at the C 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequences that naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 0 nucleic acid is derived. A protein that is substantially free of cellular material includes ,I preparations of protein or polypeptide having less than about 30%, 20%, 10%, (by dry 00 0 weight) of contaminating protein. When the protein of the invention, or biologically active C'i portion thereof, is recombinantly produced, preferably culture medium represents less than about 30%, 20%, 10%, or 5% (by dry weight) of chemical precursors or non-protein of interest chemicals.
The nucleotide sequences of the invention include both the naturally occurring sequences as well as mutant (variant) forms. Such variants will continue to possess the desired activity, either promoter activity or the activity of the product encoded by the open reading frame of the non-variant nucleotide sequence.
Thus, by "variants" is intended substantially similar sequences. For nucleotide sequences comprising an open reading frame, variants include those sequences that, because of the degeneracy of the genetic code, encode the identical amino acid sequence of the native protein.
Naturally occurring allelic variants such as these can be identified with the use of well-known molecular biology techniques, as, for example, with polymerase chain reaction (PCR) and hybridization techniques. Variant nucleotide sequences also include synthetically derived nucleotide sequences, such as those generated, for example, by using site-directed mutagenesis and for open reading frames, encode the native protein, as well as those that encode a polypeptide having amino acid substitutions relative to the native protein. Generally, nucleotide sequence variants of the invention will have at least 40, 50, 60, to 70%, e.g., preferably 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, to 79%, generally at least 80%, e.g., 81%-84%, at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, to 98% and 99% nucleotide sequence identity to the native (wild type or endogenous) nucleotide sequence.
-33 Case S-50015A/16/78/NAD S"Conservatively modified variations" of a particular nucleic acid sequence refers to those nucleic acid sequences that encode identical or essentially identical amino acid sequences, or where the nucleic acid sequence does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given polypeptide. For instance the codons CGT, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. Thus, at every position where an arginine is specified by a codon, the codon can be altered to any of the S corresponding codons described without altering the encoded protein. Such nucleic acid oO variations are "silent variations" which are one species of "conservatively modified variations." 0 Every nucleic acid sequence described herein which encodes a polypeptide also describes every Spossible silent variation, except where otherwise noted. One of skill will recognize that each codon in a nucleic acid (except ATG, which is ordinarily the only codon for methionine) can be modified to yield a functionally identical molecule by standard techniques. Accordingly, each "silent variation" of a nucleic acid which encodes a polypeptide is implicit in each described sequence.
The nucleic acid molecules of the invention can be "optimized" for enhanced expression in plants of interest. See, for example, EPA 035472; WO 91/16432; Perlak et al., 1991; and Murray et al., 1989. In this manner, the open reading frames in genes or gene fragments can be synthesized utilizing plant-preferred codons. See, for example, Campbell and Gowri, 1990 for a discussion of host-preferred codon usage. Thus, the nucleotide sequences can be optimized for expression in any plant. It is recognized that all or any part of the gene sequence may be optimized or synthetic. That is, synthetic or partially optimized sequences may also be used. Variant nucleotide sequences and proteins also encompass sequences and protein derived from a mutagenic and recombinogenic procedure such as DNA shuffling. With such a procedure, one or more different coding sequences can be manipulated to create a new polypeptide possessing the desired properties. In this manner, lbraries of recombinant polynucleotides are generated from a population of related sequence polynucleotides comprising sequence regions that have substantial sequence identity and can be homologously recombined in vitro or in vivo. Strategies for such DNA shuffling are known in the art. See, for example, Stemmer, 1994; Stemmer, 1994; Crameri et al., 1997; Moore et al., 1997; Zhang et al., 1997; Crameri et al., 1998; and U.S. Patent Nos. 5,605,793 and 5,837,458.
34- Case S-50015A/16/78/NAD o By "variant" polypeptide is intended a polypeptide derived from the native protein by deletion (so-called truncation) or addition of one or more amino acids to the N-terminal and/or C-terminal end of the native protein; deletion or addition of one or more amino acids at one or Smore sites in the native protein; or substitution of one or more amino acids at one or more sites in the native protein. Such variants may result from, for example, genetic polymorphism or from human manipulation. Methods for such manipulations are generally known in the art.
Thus, the polypeptides may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such manipulations are oO generally known in the art. For example, amino acid sequence variants of the polypeptides can 0 be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide sequence alterations are well known in the art. See, for example, Kunkel, 1985; Kunkel et al., 1987; U.
S. Patent No. 4,873,192; Walker and Gaastra, 1983 and the references cited therein. Guidance as to appropriate amino acid substitutions that do not affect biological activity of the protein of interest may be found in the model of Dayhoff et al. (1978). Conservative substitutions, such as exchanging one amino acid with another having similar properties, are preferred.
Individual substitutions deletions or additions that alter, add or delete a single amino acid or a small percentage of amino acids (typically less than more typically less than in an encoded sequence are "conservatively modified variations," where the alterations result in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. The following five groups each contain amino acids that are conservative substitutions for one another: Aliphatic: Glycine Alanine Valine Leucine Isoleucine Aromatic: Phenylalanine Tyrosine Tryptophan Sulfur-containing: Methionine Cysteine Basic: Arginine I, Lysine Histidine Acidic: Aspartic acid Glutamic acid Asparagine Glutamine See also, Creighton, 1984. In addition, individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids in an encoded sequence are also "conservatively modified variations." "Expression cassette" as used herein means a DNA sequence capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operably linked to the nucleotide sequence of interest which is operably linked to termination signals. It also typically comprises sequences required for proper translation of the 35 Case S-50015A/16/78/NAD C nucleotide sequence. The coding region usually codes for a protein of interest but may also N code for a functional RNA of interest, for example antisense RNA or a nontranslated RNA, in the sense or antisense direction. The expression cassette comprising the nucleotide sequence of Sinterest may be chimeric, meaning that at least one of its components is heterologous with respect to at least one of its other components. The expression cassette may also be one which is naturally occurring but has been obtained in a recombinant form useful for heterologous Sexpression. The expression of the nucleotide sequence in the expression cassette may be under the control of a constitutive promoter or of an inducible promoter which initiates transcription OO only when the host cell is exposed to some particular external stimulus. In the case of a multicellular organism, the promoter can also be specific to a particular tissue or organ or stage of development.
"Vector" is defined to include, inter alia, any plasmid, cosmid, phage or Agrobacterium binary vector in double or single stranded linear or circular form which may or may not be self transmissible or mobilizable, and which can transform prokaryotic or eukaryotic host either by integration into the cellular genome or exist extrachromosomally autonomous replicating plasmid with an origin of replication).
Specifically included are shuttle vectors by which is meant a DNA vehicle capable, naturally or by design, of replication in two different host organisms, which may be selected from actinomycetes and related species, bacteria and eukaryotic higher plant, mammalian, yeast or fungal cells).
Preferably the nucleic acid in the vector is under the control of, and operably linked to, an appropriate promoter or other regulatory elements for transcription in a host cell such as a microbial, e.g. bacterial, or plant cell. The vector may be a bi-functional expression vector which functions in multiple hosts. In the case of genomic DNA, this may contain its own promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
"Cloning vectors" typically contain one or a small number of restriction endonuclease recognition sites at which foreign DNA sequences can be inserted in a determinable fashion without loss of essential biological function of the vector, as well as a marker gene that is suitable for use in the identification and selection of cells transformed with the cloning vector.
36- Case S-50015A/16/78/NAD 0 Marker genes typically include genes that provide letracyclne resistance, hygromycin 1 resistance or ampicillin resistance.
A "transgenic plant" is a plant having one or more plant cells that contain an expression vector.
"Plant tissue" includes differentiated and undifferentiated tissues or plants, including but not limited to roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of cels and culture such as single cells, protoplast, embryos, and callus tissue. The plant tissue Smay be in plants or in organ, tissue or cell culture.
OO The following terms are used to describe the sequence relationships between two or 0 more nucleic acids or polynucleotides: "reference sequence", "comparison window", (c) "sequence identity", "percentage of sequence identity", and "substantial identity".
As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full length cDNA or gene sequence, or the complete cDNA or gene sequence.
As used herein, "comparison window" makes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the comparison window may comprise additions or deletions gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well known in the art. Thus, the determination of percent identity between any two sequences can be accomplished using a mathematical algorithm. Preferred, non-limiting examples of such mathematical algorithms are the algorithm of Myers and Miller, 1988; the local homology algorithm of Smith et al. 1981; the homology alignment algorithm of Needleman and Wunsch 1970; the search-for-similaritymethod of Pearson and Lipman 1988; the algorithm of Karlin and Altschul, 1990, modified as in Karlin and Altschul, 1993.
37- Case S-50015A116/78/NAD Computer implementations of these mathematical algorithms can be utilized for C\K comparison of sequences to determine sequence identity. Such implementations include, but Sare not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, California); the ALIGN program (Version 2.0) and GAP, BESTFIT, BLAST, SFASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 Science Drive, Madison, Wisconsin, USA).
't Alignments using these programs can be performed using the default parameters. The O CLUSTAL program is well described by Higgins et al. 1988; Higgins et al. 1989; Corpet et al.
00 1988; Huang et al. 1992; and Pearson et al. 1994. The ALIGN program is based on the algorithm of Myers and Miller, supra. The BLAST programs of Altschul et al., 1990, are based on the algorithm of Karlin and Altschul supra.
Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., 1990). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always 0) and N (penalty score for mismatching residues; always For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when the cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below due to the accumulation of one or more negative-scoring residue alignments, or the end of either sequence is reached.
In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, Karlin Altschul (1993). One measure of similarity provided by the BLAST algorithm is the smallest sum probability which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic 38- Case S-50015A/16/78/NAD O acid sequence is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid sequence to the reference nucleic acid sequence is less than Sabout 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
To obtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST can be utilized as described in Altschul et al. 1997. Alternatively, PSI-BLAST (in BLAST can be used to perform an iterated search that detects distant relationships between molecules. See Altschul et al., supra. When utilizing BLAST, Gapped BLAST, PSI-BLAST, g the default parameters of the respective programs BLASTN for nucleotide sequences, 00 BLASTX for proteins) can be used. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength of 11, an expectation of 10, a cutoff of 100, M=5, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, an expectation of 10, and the BLOSUM62 scoring matrix (see Henikoff Henikoff, 1989). See http://www.ncbi.nlm.nih.gov. Alignment may also be performed manually by inspection.
For purposes of the present invention, comparison of nucleotide sequences for determination of percent sequence identity to the promoter sequences disclosed herein is preferably made using the BlastN program (version 1.4.7 or later) with its default parameters or any equivalent program. By "equivalent program" is intended any sequence comparison program that, for any two sequences in question, generates an alignment having identical nucleotide or amino acid residue matches and an identical percent sequence identity when compared to the corresponding alignment generated by the preferred program.
As used herein, "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties charge or hydrophobicity) and therefore do not change the functional properties of the molecule. When sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have 39- Case S-50015A/16/78/NAD C "sequence similarity" or "similarity." Means for making this adjustment are well known to C those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for tf example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, as implemented in the program PC/GENE (Intelligenetics, Mountain View, California).
As used herein, "percentage of sequence identity" means the value determined by O0 comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity.
The term "substantial identity" of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, or 79%, preferably at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, or 89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, and most preferably at least 96%, 97%, 98%, or 99% sequence identity, compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill in the art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning, and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 70%, more preferably at least 80%, 90%, and most preferably at least Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions (see below). Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point for the Case S-50015A16/78/NAD 0 specific sequence at a defined ionic strength and pH. However, stringent conditions encompass temperatures in the range of about 1°C to about 20 0 C, depending upon the desired degree of stringency as otherwise qualified herein. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides they encode are substantially identical. This may occur, when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two Snucleic acid sequences are substantially identical is when the polypeptide encoded by the first nucleic acid is immunologically cross reactive with the polypeptide encoded by the second 00 nucleic acid.
S(e)(ii) The term "substantial identity" in the context of a peptide indicates that a peptide comprises a sequence with at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, or 79%, preferably 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, or 89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, or even more preferably, 95%, 96%, 97%, 98% or 99%, sequence identity to the reference sequence over a specified comparison window.
Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman and Wunsch (1970). An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide. Thus, a peptide is substantially identical to a second peptide, for example, where the two peptides differ only by a conservative substitution.
For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
As noted above, another indication that two nucleic acid sequences are substantially identical is that the two molecules hybridize to each other under stringent conditions. The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture total cellular) DNA or RNA. "Bind(s) -41 Case S-50015A/16/78NAD 0 substantially" refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the j stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence.
"Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern hybridization are sequence dependent, and are different under different environmental parameters. The Tm is the temperature (under defined ionic strength and pH) at which 50% of 00 the target sequence hybridizes to a perfectly matched probe. Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth and Wahl, 1984; T, 81.5C 16.6 (log M) +0.41 0.61 form) 500/L; where M is the molarity of monovalent cations, %GC is the percentage of guanosine and cytosine nucleotides in the DNA, form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. Tm is reduced by about 1°C for each 1% of mismatching; thus, hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with identity are sought, the Tm can be decreased 10°C. Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point I for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4°C lower than the thermal melting point I; moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or lower than the thermal melting point 1; low stringency conditions can utilize a hybridization and/or wash at 11, 12, 13, 14, 15, or 20°C lower than the thermal melting point I. Using the equation, hybridization and wash compositions, and desired T, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a T of less than 45 0
C
(aqueous solution) or 32°C.(formamide solution), it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, 1993. Generally, highly stringent -42- Case S-50015A/16/78/NAD 0 hybridization and wash conditions are selected to be about 5°C lower than the thermal melting N point Tm for the specific sequence at a defined ionic strength and pH.
SAn example of highly stringent wash conditions is 0.15 M NaCI at 72°C for about minutes. An example of stringent wash conditions is a 0.2X SSC wash at 65 0 C for 15 minutes (see, Sambrook, infra, for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium Sstringency wash for a duplex of, more than 100 nucleotides, is IX SSC at 45°C for Sminutes. An example low stringency wash for a duplex of, more than 100 nucleotides, is C N 4-6X SSC at 40 0 C for 15 minutes. For short probes about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.5 M, more preferably about 0.01 to 1.0 M, Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30°C and at least about 60 0 C for long robes nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2X (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or Northern blot is formamide, hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0. IX SSC at 60 to 65 0 C. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulphate) at 37 0 C, and a wash in IX to 2X SSC (20X SSC 3.0 M NaCl/0.3 M trisodium citrate) at 50 to 0 C. Exemplary moderate stringency conditions include hybridization in 40 to formamide, 1.0 M NaCI, 1% SDS at 37 0 C, and a wash in 0.5X to IX SSC at 55 to The following are examples of sets of hybridization/wash conditions that may be used to clone orthologous nucleotide sequences that are substantially identical to reference nucleotide -43 Case S-50015A/16/78/NAD 0 sequences of the present invention: a reference nucleotide sequence preferably hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM 1) EDTA at 50 0 C with washing in 2X SSC, 0.1% SDS at 50 0 C, more desirably in 7% sodium Sdodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in IX SSC, 0.1% SDS at 50 0 C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in 0.5X SSC, 0.1% SDS at 50 0 C, preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM EDTA at 50 0 C with washing in 0.1X SSC, 0.1% O SDS at 50°C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 1 mM 00 EDTA at 50 0 C with washing in 0.1X SSC, 0.1% SDS at 65 0
C.
"DNA shuffling" is a method to introduce mutations or rearrangements, preferably randomly, in a DNA molecule or to generate exchanges of DNA sequences between two or more DNA molecules, preferably randomly. The DNA molecule resulting from DNA shuffling is a shuffled DNA molecule that is a non-naturally occurring DNA molecule derived from at least one template DNA molecule. The shuffled DNA preferably encodes a variant polypeptide modified with respect to the polypeptide encoded by the template DNA, and may have an altered biological activity with respect to the polypeptide encoded by the template DNA.
"Recombinant DNA molecule' is a combination of DNA sequences that are joined together using recombinant DNA technology and procedures used to join together DNA sequences as described, for example, in Sambrook et al., 1989.
The word "plant" refers to any plant, particularly to seed plant, and "plant cell" is a structural and physiological unit of the plant, which comprises a cell wall but may also refer to a protoplast. The plant cell may be in form of an isolated single cell or a cultured cell, or as a part of higher organized unit such as, for example, a plant tissue, or a plant organ.
"Significant increase" is an increase that is larger than the margin of error inherent in the measurement technique, preferably an increase by about 2-fold or greater.
"Significantly less" means that the decrease is larger than the margin of error inherent in the measurement technique, preferably a decrease by about 2-fold or greater.
Virtually any DNA composition may be used for delivery to recipient plant cells, e.g., monocotyledonous cells, to ultimately produce fertile transgenic plants in accordance with the -44- Case S-50015A/16/78/NAD O present invention. For example, DNA segments in the form of vectors and plasmids, or linear SDNA fragments, in some instances containing only the DNA element to be expressed in the plant, and the like, may be employed. The construction of vectors which may be employed in t conjunction with the present invention will be known to those of skill of the art in light of the present disclosure (see, Sambrook et al., 1989; Gelvin et al., 1990).
Vectors, plasmids, cosmids, YACs (yeast artificial chromosomes), BACs (bacterial artificial chromosomes) and DNA segments for use in transforming such cells will, of course, 8 generally comprise the cDNA, gene or genes which one desires to introduce into the cells.
00 These DNA constructs can further include structures such as promoters, enhancers, polylinkers, or even regulatory genes as desired. The DNA segment or gene chosen for cellular introduction will often encode a protein which will be expressed in the resultant recombinant cells, such as will result in a screenable or selectable trait and/or which will impart an improved phenotype to the regenerated plant. However, this may not always be the case, and the present invention also encompasses transgenic plants incorporating non-expressed transgenes.
In certain embodiments, it is contemplated that one may wish to employ replicationcompetent viral vectors in monocot transformation. Such vectors include, for example, wheat dwarf virus (WDV) "shuttle" vectors, such as pWl-11 and PW1-GUS (Ugaki et al., 1991).
These vectors are capable of autonomous replication in maize cells as well as E. coli, and as such may provide increased sensitivity for detecting DNA delivered to transgenic cells. A replicating vector may also be useful for delivery of genes flanked by DNA sequences from transposable elements such as Ac, Ds, or Mu. It has been proposed (Laufs et al., 1990) that transposition of these elements within the maize genome requires DNA replication. It is also contemplated that transposable elements would be useful for introducing DNA fragments lacking elements necessary for selection and maintenance of the plasmid vector in bacteria, antibiotic resistance genes and origins of DNA replication. It is also proposed that use of a transposable element such as Ac, Ds, or Mu would actively promote integration of the desired DNA and hence increase the frequency of stably transformed cells. The use of a transposable element such as Ac, Ds, or Mu may actively promote integration of the DNA of interest and hence increase the frequency of stably transformed cells. Transposable elements may be useful to allow separation of genes of interest from elements necessary for selection Case S-50015A/16/78/NAD O and maintenance of a plasmid vector in bacteria or selection of a transformant. By use of a C transposable element, desirable and undesirable DNA sequences may be transposed apart from each other in the genome, such that through genetic segregation in progeny, one may identify t.t plants with either the desirable or the undesirable DNA sequences.
DNA useful for introduction into plant cells includes that which has been derived or isolated from any source, that may be subsequently characterized as to structure, size and/or function, chemically altered, and later introduced into plants. An example of DNA "derived" Sfrom a source, would be a DNA sequence that is identified as a useful fragment within a given 00 organism, and which is then chemically synthesized in essentially pure form. An example of such DNA "isolated" from a source would be a useful DNA sequence that is excised or removed from said source by chemical means, by the use of restriction endonucleases, so that it can be further manipulated, amplified, for use in the invention, by the methodology of genetic engineering. Such DNA is commonly referred to as "recombinant
DNA."
Therefore useful DNA includes completely synthetic DNA, semi-synthetic DNA, DNA isolated from biological sources, and DNA derived from introduced RNA. Generally, the introduced DNA is not originally resident in the plant genotype which is the recipient of the DNA, but it is within the scope of the invention to isolate a gene from a given plant genotype, and to subsequently introduce multiple copies of the gene into the same genotype, to enhance production of a given gene product such as a storage protein or a protein that confers tolerance or resistance to water deficit.
The introduced DNA includes but is not limited to, DNA from plant genes, and nonplant genes such as those from bacteria, yeasts, animals or viruses. The introduced DNA can include modified genes, portions of genes, or chimeric genes, including genes from the same or different maize genotype. The term "chimeric gene" or "chimeric DNA" is defined as a gene or DNA sequence or segment comprising at least two DNA sequences or segments from species which do not combine DNA under natural conditions, or which DNA sequences or segments are positioned or linked in a manner which does not normally occur in the native genome of untransformed plant.
The introduced DNA used for transformation herein may be circular or linear, doublestranded or single-stranded. Generally, the DNA is in the form of chinmeric DNA, such as plasmid DNA, that can also contain coding regions flanked by regulatory sequences which -46- Case S-50015A/16/78/NAD 0 promote the expression of the recombinant DNA present in the resultant plant. For example, C1 the DNA may itself comprise or consist of a promoter that is active in a plant which is derived from a source other than that plant, or may utilize a promoter already present in a plant I genotype that is the transformation target.
Generally, the introduced DNA will be relatively small, less than about 30 kb to Sminimize any susceptibility to physical, chemical, or enzymatic degradation which is known to increase as the size of the DNA increases. As noted above, the number of proteins, RNA 8 transcripts or mixtures thereof which is introduced into the plant genome is preferably 0 preselected and defined, from one to about 5-10 such products of the introduced
DNA
O may be formed.
Two principal methods for the control of expression are known, viz.: overexpression and underexpression. Overexpression can be achieved by insertion of one or more than one extra copy of the selected gene. It is, however, not unknown for plants or their progeny, originally transformed with one or more than one extra copy of a nucleotide sequence, to exhibit the effects of underexpression as well as overexpression. For underexpression there are two principle methods which are commonly referred to in the art as "antisense downregulation" and "sense downregulation" (sense downregulation is also referred to as "cosuppression"). Generically these processes are referred to as "gene silencing". Both of these methods lead to an inhibition of expression of the target gene.
Obtaining sufficient levels of transgene expression in the appropriate plant tissues is an important aspect in the production of genetically engineered crops. Expression of heterologous DNA sequences in a plant host is dependent upon the presence of an operably linked promoter that is functional within the plant host. Choice of the promoter sequence will determine when and where within the organism the heterologous DNA sequence is expressed.
Furthermore, it is contemplated that promoters combining elements from more than one promoter may be useful. For example, U.S. Patent No. 5,491,288 discloses combining a Cauliflower Mosaic Virus promoter with a histone promoter. Thus, the elements from the promoters disclosed herein may be combined with elements from other promoters.
Promoters which are useful for plant transgene expression include those that are inducible, viral, synthetic, constitutive (Odell et al., 1985), temporally regulated, spatially regulated, tissue-specific, and spatio-temporally regulated.
-47 Case S-5001 5A/16178/NAD 00 Where expression in specific tissues or organs is desired, tissue-specific promoters may N be used. In contrast, where gene expression in response to a stimulus is desired, inducible promoters are the regulatory elements of choice. Where continuous expression is desired r' throughout the cells of a plant, constitutive promoters are utilized. Additional regulatory sequences upstream and/or downstream from the core promoter sequence may be included in expression constructs of transformation vectors to bring about varying levels of expression of heterologous nucleotide sequences in a transgenic plant.
The choice of promoter will vary depending on the temporal and spatial requirements for 00 expression, and also depending on the target species. In some cases, expression in multiple tissues is desirable. While in others, tissue-specific, leaf-specific, expression is desirable.
Although many promoters from dicotyledons have been shown to be operational in monocotyledons and vice versa, ideally dicotyledonous promoters are selected for expression in dicotyledons, and monocotyledonous promoters for expression in monocotyledons.
However, there is no restriction to the provenance of selected promoters; it is sufficient that they are operational in driving the expression of the nucleotide sequences in the desired cell.
These promoters include, but are not limited to, constitutive, inducible, temporally regulated, developmentally regulated, spatially-regulated, chemically regulated, stressresponsive, tissue-specific, viral and synthetic promoters. Promoter sequences are known to be strong or weak. A strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a very low level of gene expression. An inducible promoter is a promoter that provides for the turning on and off of gene expression in response to an exogenously added agent, or to an environmental or developmental stimulus. A bacterial promoter such as the Ptac promoter can be induced to varying levels of gene expression depending on the level of isothiopropylgalactoside added to the transformed bacterial cells. An isolated promoter sequence that is a strong promoter for heterologous nucleic acid is advantageous because it provides for a sufficient level of gene expression to allow for easy detection and selection of transformed cells and provides for a high level of gene expression when desired.
Within a plant promoter region there are several domains that are necessary for full function of the promoter. The first of these domains lies immediately upstream of the structural gene and forms the "core promoter region" containing consensus sequences, -48- Case S-50015A/16/78/NAD 00 O normally 70 base pairs immediately upstream of the gene. The core promoter region contains CN the characteristic CAAT and TATA boxes plus surrounding sequences, and represents a transcription initiation sequence that defines the transcription start point for the structural gene.
The presence of the core promoter region defines a sequence as being a promoter: if the region is absent, the promoter is non-functional. Furthermore, the core promoter region is insufficient to provide full promoter activity. A series of regulatory sequences upstream of the C core constitute the remainder of the promoter. The regulatory sequences determine expression OO level, the spatial and temporal pattern of expression and, for an important subset of promoters, O expression under inductive conditions (regulation by external factors such as light, temperature, chemicals, hormones).
A range of naturally-occurring promoters are known to be operative in plants and have been used to drive the expression of heterologous (both foreign and endogenous) genes in plants: for example, the constitutive 35S cauliflower mosaic virus (CaMV) promoter, the ripening-enhanced tomato polygalacturonase promoter (Bird et al., 1988), the E8 promoter (Diekman Fischer, 1988) and the fruit specific 2A1 promoter (Pear et al., 1989) and many others, U2 and U5 snRNA promoters from maize, the promoter from alcohol dehydrogenase, the Z4 promoter from a gene encoding the Z4 22 kD zein protein, the promoter from a gene encoding a 10 kD zein protein, a Z27 promoter from a gene encoding a 27 kD zein protein, the A20 promoter from the gene encoding a 19 kD -zein protein, inducible promoters, such as the light inducible promoter derived from the pea rbcS gene and the actin promoter from rice, the actin 2 promoter (WO 00/70067); seed specific promoters, such as the phaseolin promoter from beans, may also be used. The nucleotide sequences of this invention can also be expressed under the regulation of promoters that are chemically regulated. This enables the nucleic acid sequence or encoded polypeptide to be synthesized only when the crop plants are treated with the inducing chemicals. Chemical induction of gene expression is detailed in EP 0 332 104 (to Ciba-Geigy) and U.S. Patent 5,614,395. A preferred promoter for chemical induction is the tobacco PR- a promoter.
Examples of some constitutive promoters which have been described include the rice actin 1 (Wang et al., 1992; U.S. Patent No. 5,641,876), CaMV 35S (Odell et al., 1985), CaMV 19S (Lawton et al., 1987), nos, Adh, sucrose synthase; and the ubiquitin promoters.
-49- Case S-50015A/6/781NAD SExamples of tissue specific promoters which have been described include the lectin N (Vodkin, 1983; Lindstrom et al., 1990) corn alcohol dehydrogenase 1 (Vogel et al., 1989; g Dennis et al., 1984), corn light harvesting complex (Simpson, 1986; Bansal et al., 1992), corn heat shock protein (Odell et al., 1985), pea small subunit RuBP carboxylase (Poulsen et al., 1986), Ti plasmid mannopine synthase (Langridge et al., 1989), Ti plasmid nopaline synthase (Langridge et al., 1989), petunia chalcone isomerase (vanTunen et al., 1988), bean glycine rich protein 1 (Keller et al., 1989), truncated CaMV 35s (OdeU et al., 1985), potato patatin (Wenzler et al., 1989), root cell (Yamamoto et al., 1990), maize zein (Reina et al., 1990; Kriz Set al., 1987; Wandelt et al., 1989; Langridge et al., 1983; Reina et al., 1990), globulin-1 0 (Belanger et al., 1991), c-tubulin, cab (Sullivan et al., 1989), PEPCase (Hudspeth Grula, 1989), R gene complex-associated promoters (Chandler et al., 1989), histone, and chalcone synthase promoters (Franken et al., 1991). Tissue specific enhancers are described in Fromm et al. (1989).
Inducible promoters that have been described include the ABA- and turgor-inducible promoters, the promoter of the auxin-binding protein gene (Schwob et al., 1993), the UDP glucose flavonoid glycosyl-transferase gene promoter (Ralston et al., 1988), the MPI proteinase inhibitor promoter (Cordero et al., 1994), and the glyceraldehyde-3-phosphate dehydrogenase gene promoter (Kohler et al., 1995; Quigley et al., 1989; Martinez et al., 1989).
Several other tissue-specific regulated genes and/or promoters have been reported in plants. These include genes encoding the seed storage proteins (such as napin, cruciferin, betaconglycinin, and phaseolin) zein or oil body proteins (such as oleosin), or genes involved in fatty acid biosynthesis (including acyl carrier protein, stearoyl-ACP desaturase. And fatty acid desaturases (fad and other genes expressed during embryo development (such as Bce4, see, for example, EP 255378 and Kridl et al., 1991). Particularly useful for seed-specific expression is the pea vicilin promoter (Czako et al., 1992). (See also U.S. Pat. No. 5,625,136, herein incorporated by reference.) Other useful promoters for expression in mature leaves are those that are switched on at the onset of senescence, such as the SAG promoter from Arabidopsis (Gan et al., 1995).
A class of fruit-specific promoters expressed at or during antithesis through fruit development, at least until the beginning of ripening, is discussed in U.S. 4,943,674. cDNA Case S-50015A/16/78/NAD Sclones that are preferentially expressed in cotton fiber have been isolated (John et al., 1992).
,I cDNA clones from tomato displaying differential expression during fruit development have been isolated and characterized (Mansson et al., 1985, Slater et al., 1985). The promoter for polygalacturonase gene is active in fruit ripening. The polygalacturonase gene is described in U.S. Patent No. 4,535,060, U.S. Patent No. 4,769,061, U.S. Patent No. 4,801,590, and U.S.
Patent No. 5,107,065, which disclosures are incorporated herein by reference.
Other examples of tissue-specific promoters include those that direct expression in leaf cells following damage to the leaf (for example, from chewing insects), in tubers (for example, 0 patatin gene promoter), and in fiber cells (an example of a developmentally-regulated fiber cell protein is E6 (John et al., 1992). The E6 gene is most active in fiber, although low levels of transcripts are found in leaf, ovule and flower.
The tissue-specificity of some "tissue-specific" promoters may not be absolute and may be tested by one skilled in the art using the diphtheria toxin sequence. One can also achieve tissue-specific expression with "leaky" expression by a combination of different tissue-specific promoters (Beals et al., 1997). Other tissue-specific promoters can be isolated by one skilled in the art (see U.S. 5,589,379). Several inducible promoters ("gene switches") have been reported. Many are described in the review by Gatz (1996) and Gatz (1997). These include tetracycline repressor system, Lac repressor system, copper-inducible systems, salicylateinducible systems (such as the PRIa system), glucocorticoid- (Aoyama et al., 1997) and ecdysone-inducible systems. Also included are the benzene sulphonamide- Patent No.
5,364,780) and alcohol-(WO 97/06269 and WO 97/06268) inducible systems and glutathione S-transferase promoters. Other studies have focused on genes inducibly regulated in response to environmental stress or stimuli such as increased salinity. Drought, pathogen and wounding.
(Graham et al., 1985; Graham et al., 1985, Smith et al., 1986). Accumulation of metalocarboxypeptidase-inhibitor protein has been reported in leaves of wounded potato plants (Graham et al., 1981). Other plant genes have been reported to be induced methyl jasmonate, elicitors, heat-shock, anaerobic stress, or herbicide safeners.
Regulated expression of the chimeric transacting viral replication protein can be further regulated by other genetic strategies. For example, Cre-mediated gene activation as described by Odell et al. 1990. Thus, a DNA fragment containing 3' regulatory sequence bound by lox sites between the promoter and the replication protein coding sequence that blocks the -51 Case S-50015A/16/78/NAD 0 expression of a chimeric replication gene from the promoter can be removed by Cre-mediated N excision and result in the expression of the trans-acting replication gene. In this case, the Schimeric Cre gene, the chimeric trans-acting replication gene, or both can be under the control of tissue- and developmental- specific or inducible promoters. An alternate genetic strategy is the use of tRNA suppressor gene. For example, the regulated expression of a tRNA suppressor gene can conditionally control expression of a trans-acting replication protein coding sequence containing an appropriate termination codon as described by Ulmasov et al.
1997. Again, either the chimeric tRNA suppressor gene, the chimeric transacting replication 00 gene, or both can be under the control of tissue- and developmental-specific or inducible 0 promoters.
Frequently it is desirable to have continuous or inducible expression of a DNA sequence throughout the cells of an organism in a tissue-independent manner. For example, increased resistance of a plant to infection by soil- and airborne-pathogens might be accomplished by genetic manipulation of the plant's genome to comprise a continuous promoter operably linked to a heterologous pathogen-resistance gene such that pathogenresistance proteins are continuously expressed throughout the plant's tissues.
Alternatively, it might be desirable to inhibit expression of a native DNA sequence within a plant's tissues to achieve a desired phenotype. In this case, such inhibition might be accomplished with transformation of the plant to comprise a constitutive, tissue-independent promoter operably linked to an antisense nucleotide sequence, such that constitutive expression of the antisense sequence produces an RNA transcript that interferes with translation of the mRNA of the native DNA sequence.
To define a minimal promoter region, a DNA segment representing the promoter region is removed from the 5' region of the gene of interest and operably linked to the coding sequence of a marker (reporter) gene by recombinant DNA techniques well known to the art.
The reporter gene is operably linked downstream of the promoter, so that transcripts initiating at the promoter proceed through the reporter gene. Reporter genes generally encode proteins which are easily measured, including, but not limited to, chloramphenicol acetyl transferase (CAT), beta-glucuronidase (GUS), green fluorescent protein (GFP), beta-galactosidase beta- GAL), and luciferase.
-52- Case S-50015A/16/78/NAD 0 The construct containing the reporter gene under the control of the promoter is then N introduced into an appropriate cell type by transfection techniques well known to the art. To assay for the reporter protein, cell lysates are prepared and appropriate assays, which are well known in the art, for the reporter protein are performed. For example, if CAT were the reporter gene of choice, the lysates from cells transfected with constructs containing CAT under the control of a promoter under study are mixed with isotopically labeled Schloramphenicol and acetyl-coenzyme A (acetyl-CoA). The CAT enzyme transfers the acetyl group from acetyl-CoA to the 2- or 3-position of chloramphenicol. The reaction is monitored OO by thin-layer chromatography, which separates acetylated chloramphenicol from unreacted material. The reaction products are then visualized by autoradiography.
The level of enzyme activity corresponds to the amount of enzyme that was made, which in turn reveals the level of expression from the promoter of interest. This level of expression can be compared to other promoters to determine the relative strength of the promoter under study. In order to be sure that the level of expression is determined by the promoter, rather than by the stability of the mRNA, the level of the reporter mRNA can be measured directly, such as by Northern blot analysis.
Once activity is detected, mutational and/or deletional analyses may be employed to determine the minimal region and/or sequences required to initiate transcription. Thus, sequences can be deleted at the 5' end of the promoter region and/or at the 3' end of the promoter region, and nucleotide substitutions introduced. These constructs are then introduced to cells and their activity determined.
In one embodiment, the promoter may be a gamma zein promoter, an oleosin olel6 promoter, a globulinI promoter, an actin I promoter, an actin cl promoter, a sucrose synthetase promoter, an INOPS promoter, an EXM5 promoter, a globulin2 promoter, a b-32, ADPGpyrophosphorylase promoter, an Ltpl promoter, an Ltp2 promoter, an oleosin olel7 promoter, an oleosin olel 8 promoter, an actin 2 promoter, a pollen-specific protein promoter, a pollenspecific pectate lyase promoter, an anther-specific protein promoter, an anther-specific gene RTS2 promoter, a pollen- specific gene promoter, a tapetum-specific gene promoter, tapetumspecific gene RAB24 promoter, a anthranilate synthase alpha subunit promoter, an alpha zein promoter, an anthranilate synthase beta subunit promoter, a dihydrodipicolinate synthase promoter, a Thil promoter, an alcohol dehydrogenase promoter, a cab binding protein -53- Case S-50015A/16/78/NAD O promoter, an H3C4 promoter, a RUBISCO SS starch branching enzyme promoter, an ACCase Spromoter, an actin3 promoter, an actin7 promoter, a regulatory protein GF14-12 promoter, a Sribosomal protein L9 promoter, a cellulose biosynthetic enzyme promoter, an S-adenosyl-Lt" homocysteine hydrolase promoter, a superoxide dismutase promoter, a C-kinase receptor promoter, a phosphoglycerate mutase promoter, a root-specific RCc3 mRNA promoter, a glucose-6 phosphate isomerase promoter, a pyrophosphate-fructose 6phosphatelphosphotransferase promoter, an ubiquitin promoter, a beta-ketoacyl-ACP synthase promoter, a 33 kDa photosystem 11 promoter, an oxygen evolving protein promoter, a 69 kDa 00 vacuolar ATPase subunit promoter, a metallothionein-like protein promoter, a glyceraldehyde- 3 -phosphate dehydrogenase promoter, an ABA- and ripening- inducible-like protein promoter, a phenylalanine ammonia lyase promoter, an adenosine triphosphatase S-adenosyl-Lhomocysteine hydrolase promoter, an a- tubulin promoter, a cab promoter, a PEPCase promoter, an R gene promoter, a lectin promoter, a light harvesting complex promoter, a heat shock protein promoter, a chalcone synthase promoter, a zein promoter, a globulin-1 promoter, an ABA promoter, an auxin-binding protein promoter, a UDP glucose flavonoid glycosyl-transferase gene promoter, an NTI promoter, an actin promoter, an opaque 2 promoter, a b70 promoter, an oleosin promoter, a CaMV 35S promoter, a CaMV 19S promoter, a histone promoter, a turgor-inducible promoter, a pea small subunit RuBP carboxylase promoter, a Ti plasmid mannopine synthase promoter, Ti plasmid nopaline synthase promoter, a petunia chalcone isomerase promoter, a bean glycine rich protein I promoter, a CaMV 35S transcript promoter, a potato patatin promoter, or a S-E9 small subunit RuBP carboxylase promoter.
In addition to promoters, a variety of 5N and 3N transcriptional regulatory sequences are also available for use in the present invention. Transcriptional terminators are responsible for the termination of transcription and correct mRNA polyadenylation. The 3N nontranslated regulatory DNA sequence preferably includes from about 50 to about 1,000, more preferably about 100 to about 1,000, nucleotide base pairs and contains plant transcriptional and translational termination sequences. Appropriate transcriptional terminators and those which are known to function in plants include the CaMV 35S terminator, the tml terminator, the nopaline synthase terminator, the pea rbcS E9 terminator, the terminator for the T7 transcript 54- Case S-50OI5A/16/78[/AD C from the octopine synthase gene of Agrobacterium iumefaciens, and the 3N end of the N protease inhibitor I or 11 genes from potato or tomato, although other 3N elements known to those of skill in the art can also be employed. Alternatively, one also could use a gamma coixin, oleosin 3 or other terminator from the genus Coix.
Preferred 3' elements include those from the nopaline synthase gene of Agrobacterium tumefaciens (Bevan et al., 1983), the terminator for the T7 transcript from the octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease inhibitor I or II 0 genes from potato or tomato.
0o As the DNA sequence between the transcription initiation site and the start of the O coding sequence, the untranslated leader sequence, can influence gene expression, one may also wish to employ a particular leader sequence. Preferred leader sequences are contemplated to include those which include sequences predicted to direct optimum expression of the attached gene, to include a preferred consensus leader sequence which may increase or maintain mRNA stability and prevent inappropriate initiation of translation. The choice of such sequences will be known to those of skill in the art in light of the present disclosure.
Sequences that are derived from genes that are highly expressed in plants will be most preferred.
Other sequences that have been found to enhance gene expression in transgenic plants include intron sequences from Adhl, bronze], actinl, actin 2 (WO 00/760067), or the sucrose synthase intron) and viral leader sequences from TMV, MCMV and AMV). For example, a number of non-translated leader sequences derived from viruses are known to enhance expression. Specifically, leader sequences from Tobacco Mosaic Virus (TMV), Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be effective in enhancing expression Gallie et al., 1987; Skuzeski et al., 1990). Other leaders known in the art include but are not limited to: Picoravirus leaders, for example, EMCV leader (Encephalomyocarditis 5 noncoding region) (Elroy-Stein et al., 1989); Potyvirus leaders, for example, TEV leader (Tobacco Etch Virus); MDMV leader (Maize Dwarf Mosaic Virus); Human immunoglobulin heavy-chain binding protein (BiP) leader, (Macejak et al., 1991); Untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA (Jobling et al., 1987; Tobacco mosaic virus leader (TMV), (Gallie et al., 1989; and Maize Case S-50015A/16/78/NAD 0 Chlorotic Mottle Virus leader (MCMV) (Lommel et al., 1991. See also, Della-Cioppa et al., (N 1987.
Regulatory elements such as Adh intron 1 (Callis et al., 1987), sucrose synthase intron S(Vasil et al., 1989) or TMV omega element (Gallie, et al., 1989), may further be included where desired.
Examples of enhancers include elements from the CaMV 35S promoter, octopine Ssynthase genes (Ellis el al., 1987), the rice actin I gene, the maize alcohol dehydrogenase gene (Callis et al., 1987), the maize shrunken I gene (Vasil et al., 1989), TMV Omega element 00 (Gallie et al., 1989) and promoters from non-plant eukaryotes yeast; Ma et al., 1988).
SVectors for use in accordance with the present invention may be constructed to include the ocs enhancer element. This element was first identified as a 16 bp palindromic enhancer from the octopine synthase (ocs) gene of ultilane (Ellis et al., 1987), and is present in at least other promoters (Bouchez et al., 1989). The use of an enhancer element, such as the ocs element and particularly multiple copies of the element, will act to increase the level of transcription from adjacent promoters when applied in the context of monocot transformation.
Ultimately, the most desirable DNA segments for introduction into for example a monocot genome may be homologous genes or gene families which encode a desired trait increased yield per acre) and which are introduced under the control of novel promoters or enhancers, etc., or perhaps even homologous or tissue specific root-, collar/sheath-, whorl-, stalk-, earshank-, kernel- or leaf-specific) promoters or control elements. Indeed, it is envisioned that a particular use of the present invention will be the targeting of a gene in a constitutive manner or a root-specific manner. For example, insect resistant genes may be expressed specifically in the whorl and collar/sheath tissues which are targets for the first and second broods, respectively, of ECB. Likewise, genes encoding proteins with particular activity against rootworm may be targeted directly to root tissues.
Vectors for use in tissue-specific targeting of genes in transgenic plants will typically include tissue-specific promoters and may also include other tissue-specific control elements such as enhancer sequences. Promoters which direct specific or enhanced expression in certain plant tissues will be known to those of skill in the art in light of the present disclosure. These include, for example, the rbcS promoter, specific for green tissue; the ocs, nos and mas promoters which have higher activity in roots or wounded leaf tissue; a truncated (-90 to +8) -56- Case S-50015A/16/78/NAD 0 35S promoter which directs enhanced expression in roots, an alpha-tubulin gene that directs CN expression in roots and promoters derived from zein storage protein genes which direct expression in endosperm. It is particularly contemplated that one may advantageously use the 16 bp ocs enhancer element from the octopine synthase (ocs) gene (Ellis et al., 1987; Bouchez et al., 1989), especially when present in multiple copies, to achieve enhanced expression in roots.
Tissue specific expression may be functionally accomplished by introducing a constitutively expressed gene (all tissues) in combination with an antisense gene that is 00 expressed only in those tissues where the gene product is not desired. For example, a gene 0 coding for the crystal toxin protein from B. thuringiensis (Bt) may be introduced such that it is expressed in all tissues using the 35S promoter from Cauliflower Mosaic Virus. Expression of an antisense transcript of the Bt gene in a maize kernel, using for example a zein promoter, would prevent accumulation of the Bt protein in seed. Hence the protein encoded by the introduced gene would be present in all tissues except the kernel.
Expression of some genes in transgenic plants will be desired only under specified conditions. For example, it is proposed that expression of certain genes that confer resistance to environmental stress factors such as drought will be desired only under actual stress conditions. It is contemplated that expression of such genes throughout a plants development may have detrimental effects. It is known that a large number of genes exist that respond to the environment. For example, expression of some genes such as rbcS, encoding the small subunit of ribulose bisphosphate carboxylase, is regulated by light as mediated through phytochrome. Other genes are induced by secondary stimuli. For example, synthesis of abscisic acid (ABA) is induced by certain environmental factors, including but not limited to water stress. A number of genes have been shown to be induced by ABA (Skriver and Mundy, 1990). It is also anticipated that expression of genes conferring resistance to insect predation would be desired only under conditions of actual insect infestation. Therefore, for some desired traits inducible expression of genes in transgenic plants will be desired.
Expression of a gene in a transgenic plant will be desired only in a certain time period during the development of the plant. Developmental timing is frequently correlated with tissue specific gene expression. For example, expression of zein storage proteins is initiated in the endosperm about 15 days after pollination.
57 Case S-50015A/16/78/N
AD
o Additionally, vectors may be constructed and employed in the intracellular targeting of a specific gene product within the cells of a transgenic plant or in directing a protein to the 1) extracellular environment. This will generally be achieved by joining a DNA sequence encoding a transit or signal peptide sequence to the coding sequence of a particular gene. The resultant transit, or signal, peptide will transport the protein to a particular intracellular, or extracellular destination, respectively, and will then be post-translationally removed. Transit or Ssignal peptides act by facilitating the transport of proteins through intracellular membranes, vacuole, vesicle, plastid and mitochondrial membranes, whereas signal peptides direct oO proteins through the extracellular membrane.
A particular example of such a use concerns the direction of a herbicide resistance Sgene, such as the EPSPS gene, to a particular organelle such as the chloroplast rather than to the cytoplasm. This is exemplified by the use of the rbcs transit peptide which confers plastidspecific targeting of proteins. In addition, it is proposed that it may be desirable to target certain genes responsible for male sterility to the mitochondria, or to target certain genes for resistance to phytopathogenic organisms to the extracellular spaces, or to target proteins to the vacuole.
By facilitating the transport of the protein into compartments inside and outside the cell, these sequences may increase the accumulation of gene product protecting them from proteolytic degradation. These sequences also allow for additional mRNA sequences from highly expressed genes to be attached to the coding sequence of the genes. Since mRNA being translated by ribosomes is more stable than naked mRNA, the presence of translatable mRNA in front of the gene may increase the overall stability of the mRNA transcript from the gene and thereby increase synthesis of the gene product. Since transit and signal sequences are usually post- translationally removed from the initial translation product, the use of these sequences allows for the addition of extra translated sequences that may not appear on the final polypeptide. Targeting of certain proteins may be desirable in order to enhance the stability of the protein Patent No. 5,545,818).
It may be useful to target DNA itself within a cell. For example, it may be useful to target introduced DNA to the nucleus as this may increase the frequency of transformation.
Within the nucleus itself it would be useful to target a gene in order to achieve site specific 58- Case S-50015A/ 16/78/NAD O integration. For example, it would be useful to have an gene introduced through CN transformation replace an existing gene in the cell.
SOther elements include those that can be regulated by endogenous or exogenous agents, r by zinc finger proteins, including naturally occurring zinc finger proteins or chimeric zinc finger proteins (see, U.S. Patent No. 5,789,538, WO 99/48909; WO 99/45132;
WO
98/53060; WO 98/53057; WO 98/53058; WO 00/23464; WO 95/19431; and WO 98/54311) or myb-like transcription factors. For example, a chimeric zinc finger protein may include amino acid sequences which bind to a specific DNA sequence (the zinc finger) and amino acid o sequences that activate GAL 4 sequences) or repress the transcription of the sequences 0 linked to the specific DNA sequence.
The invention relates to an isolated plant, Arabidopsis and rice, nucleic acid molecule, which directs the expression of linked nucleic acid fragment in a plant, in root or leaf or constitutively, as well as the corresponding open reading frame and encoded product.
The nucleic acid molecule, one which comprises a promoter can be used to overexpress a linked nucleic acid fragment so as to express a product in a constitutive or tissue-specific manner, or to alter the expression of the product, via the use of antisense vectors or by "knocking out" the expression of at least one genomic copy of the gene.
Preferred sources from which the nucleic acid molecules of the invention can be obtained or isolated include, but are not limited to, corn (Zea mays), Brassica sp. B.
napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago saliva), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond 59- Case S-50015A16/7/NAD 00 S(Prunus am ygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, Sduckweed (Lemna), barley, vegetables, ornamentals, and conifers.
Duckweed (Lemna, see WO 00/07210) includes members of the family Lemnaceae. There are known four genera and 34 species of duckweed as follows: genus Lemna aequinocialis, L. disperma, L. ecuadoriensis, L. gibba, L. japonica, L. minor, L. miniscula, L. obscura, L. perpusilla, L. tenera, L. trisulca, L. turionifera, L.
valdiviana); genus Spirodela intermedia, S. polyrrhiza, S. punclata); genus Woffia (Wa.
SAngusta, Wa. Arrhiza, Wa. Australina, Wa. Borealis, Wa. Brasiliensis, Wa. Columbiana, Wa.
00 Elongala, Wa. Globosa, Wa. Microscopica, Wa. Neglecta) and genus Wofiella (WI. ultila, SW1. ultilane n, W1. gladiata, W1. ultila, WI. lingulara, WI. repunda, WI. rotunda, and W1.
neotropica). Any other genera or species of Lemnaceae, if they exist, are also aspects of the present invention. Lemna gibba, Lemna minor, and Lemna miniscula are preferred, with Lemna minor and Lemna miniscula being most preferred. Lemna species can be classified using the taxonomic scheme described by Landolt, Biosystematic Investigation on the Family of Duckweeds: The family of Lemnaceae A Monograph Study. Geobotanisches Institut ETH, Stiftung Rubel, Zurich (1986)).
Vegetables from which to obtain or isolate the nucleic acid molecules of the invention include, but are not limited to, tomatoes (Lycopersicon esculentum), lettuce Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber sativus), cantaloupe (C.
cantralupensis), and musk melon melo). Ornamentals from which to obtain or isolate the nucleic acid molecules of the invention include, but are not limited to, azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiara), Douglas-fir (Pseudorsuga menziesii); Western hemlock (Tsuga ultilane); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis Case S-50015A/16/78/NAD o nootkatensis). Leguminous plants from which the nucleic acid molecules of the invention can be isolated or obtained include, but are not limited to, beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, and the like. Legumes include, but are not limited to, Arachis, peanuts, Vicia, crown vetch, hairy vetch, adzuki bean, mung bean, and chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus, common bean and lima bean, Pisum, field bean, SMelilotus, clover, Medicago, alfalfa, Lotus, trefoil, lens, lentil, and false 8 indigo. Preferred forage and turf grass from which the nucleic acid molecules of the invention 00 can be isolated or obtained for use in the methods of the invention include, but are not limited to, alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and redtop.
Other preferred sources of the nucleic acid molecules of the invention include Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, cauliflower, Brassica, broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, ultilane, and zucchini.
Yet other sources of nucleic acid molecules are ornamental plants including, but not limited to, impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia, and plants such as those shown in Table 1.
-61 Case S-50015A116/7gfNAD 00 00 Table I
FAMILY
LATIN I COMMON
I
MAP REFERENCES
LINKS
RESOURCES
NA-ME
NAME
Cucurbitaceael Cucumis Cucumber salivus CCumsIS Melon ittp \VCu curbit .org/ http://genome.
cornefl.edu/cg, melo Cit rullus lanatus Cucurbita Watermelon Squash summer pepo Cucurbital Squash maxitma winter Cucurbita I Pumpkin pnoschata [bu tternu t Total http://www.na I. usda.gov/rp dic/Map proi Solanaceae jLycopersiconj Tomato 15x BAC on variety j-enomecorne escu len turn Heinz 1706 order from I H.edu/solpene Clemson Genome center (www.genome.clemson.e a11.6x BACof L.
cheesman-ii (originates
S
hitrrx/arsgenomecorne B.edu/cgibin/WebAce/ 62 Case S-50015A116/78fNAD
FAMILY
LATIN
NAME
COMMON
NAME
MAP REFERENCES
RESOURCES
from J. Giovannoni) wcbace?db=s available from Clemson olgenes genome center tt enome.
(www.pgenome.clemfsofl.e cornell.edu/tg du) c/ EST collection from http://tgrc.ucd TIGR avis.eduf (www.tipgr.orp,/tdb/ig~lind ex html) EST collection from Clemsom Genome Center (www.genome.clemsofl.e TAG 99:254-271, 1999 (esculentum x penneli) TAG 89:1007-1013, 1994 (peruvianum) Plant Cell Reports 12:293-297, 1993 (RAPDs) Genetics 132:1141-1160, 1992 (potato x tomato) Genetics 120:1095-1105, 1988 (RFLP potato and tomato) 63 Case S-50015A116/78[NAD LATIN COMMON MAP REFERENCES
LINKS
FAMILY
NAME
NAME
RESOURCES
*Generics 115:387-393, 1986 (esculentum x pennelli isozyme and cDNAs) Capsicum IPepper ann uurn htrp ://neptune .netimages.cO m/-chile/sciefl ce htmld Capsicum Chile pepper frutescens Solanum Eggplant ,n elongefla (Nicotiana (Tobacco) tabacurn) (Solanum (Potato) tu b e r o s u m (Petunia x (Petunia) 1x BAG of Petunia hybrida hybrida hoin.
Ex E. Vilrn.) 7984 available from Clemson genome center (w ww. geno me. clem-son. edu) Total l.usda. r~ov/rp dic/Map proj/ I I t Brassicaceae Brassica Broccoli http.L//res._ag-.
64 Case S-50015A116/78/NAD 00 00
FAMILY
LATIN
NAME
oleracea L.
var. italica B rassica oleracec L.
var. capil ala B rassica rapa Br-assica oleracea L.
var. boirytis Raphan us sativus var.
niger
COMMON
NAME
MAP REFERENCES
RESOURCES
LINKS
ca/ecorc/cwm t/crucifer/tralt s/index. htm htip://geneous .cit.corneUl.ed u/cabbage! abo utcab.htmI CEabbage Chinese Cabbage Cauliflower Daikon (Brassica napus) (Oilseed rape) http://arsgenome.corne 1.edu/cgibinlebAce webace?db=b assicadb http://arsgenomecorne I i I Arabidopsis 12x and 6x BACs on Columbia strain available
I
65 Case S-50015A116/78/NAD 00 00 FAMILY LATIN COMMON MAP REFERENCES
LINKS
NAME NAME
RESOURCES
arm Clemson genomne center ll.edu/cgi- (www.pgenome.clemson.edu) bin/WebAce! webace?db=a Total http://www.ni I. usda. govfpg dic/Map) p)ro i Umbelliferae Daucus Carrot ca rota Compositae Lactuca Lettuce sat iva 1-elian thus (Sunflower) annuus Total Chenopodiace Spinacia Spinach ae oleracea (Beta (Sugar Beet) vulgaris) Total Legumidnosae Phaseolus Bean 4.3x BAC available from http://arsvulgaris Clermson genomne center genonecorne (www.pgenome.clemson.edu) I.edu/cgi- 66 Case S-50015A/16/78[NAD FN-MILY LATIN COMMON MAP REFERENCES
LINKS
NAME NAME
RESOURCES
bin/WebAce/ webace~db=b eangenes Pisum Pea sativufl (Glycine (Soybean) 7.5x and 7.9x BACs htti://arsmax) available from Clemson genomecorne genome center l.edu/cgjzi- (www.gen ome.clemson.edu) bin/WebAce! webace?db=s oybase Total http://www.nal.usda.pov/ppd ic/Map pro if Gramineae Zea mays Sweet Corn Novartis BACs for Mo17 and B73 have been donated to Clemson Genome Center (www. genome.clemson.edu) (Zea miays) (Field Corn) http://www.a, ron. missouri .e d u/mn I! Total http://www.nal.usda.pov/ppd ic/Map proi/ Liliaceae A Ilium cepa Onion I Leek 67 Case S-50015A/16/78/NAD 00 C FAMILY LATIN COMMON MAP REFERENCES
LINKS
NAME NAME
RESOURCES
(Garlic) (Asparagus) Total http://www.nal.usda.gov/ppd ic/Map proi/ (N 00 Preferred forage and turf grass nucleic acid sources for the nucleic acid molecules of the invention include, but are not limited to, alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and redtop. Yet other preferred sources include, but are not limited to, crop plants and in particular cereals (for example, corn, alfalfa, sunflower, rice, Brassica, canola, soybean, barley, soybean, sugarbeet, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, and the like), and even more preferably corn, rice and soybean.
According to one embodiment, the present invention is directed to a nucleic acid molecule comprising a nucleotide sequence isolated or obtained from any plant which encodes a polypeptide having, e.g. at least 70% amino acid sequence identity to a polypeptide encoded by a gene comprising any one of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579, and 693- 773, preferably any one of SEQ ID NOs: 536-579, more preferably of any one of SEQ ID Nos: 536; 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs thereof, SEQ ID NOs:825-875, which include the minimal promoter region.. Based on the Arabidopsis nucleic acid sequence of the present invention, orthologs may be identified or isolated from the genome of any desired organism, preferably from another plant, according to well known techniques based on their sequence similarity to the Arabidopsis nucleic acid sequences, hybridization, PCR or computer generated sequence comparisons. For example, all or a portion of a particular Arabidopsis nucleic acid sequence is used as a probe that selectively hybridizes to other gene sequences.present in a population of cloned genomic DNA fragments or cDNA fragments genomic or cDNA libraries) from a chosen source organism. Further, suitable genomic and cDNA libraries may be prepared from -68- Case S-50015A/16/78/NAD O any cell or tissue of an organism. Such techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, Sambrook et al., 1989) and amplification by PCR using oligonucleotide primers preferably corresponding to sequence domains Sconserved among related polypeptide or subsequences of the nucleotide sequences provided herein (see, Innis et al., 1990). These methods are particularly well suited to the isolation Sof gene sequences from organisms closely related to the organism from which the probe sequence is derived. The application of these methods using the Arabidopsis sequences as 8 probes is well suited for the isolation of gene sequences from any source organism, preferably 0O other plant species. In a PCR approach, oligonucleotide primers can be designed for use in PCR reactions to amplify corresponding DNA sequences from cDNA or genomic DNA extracted from any plant of interest. Methods for designing PCR primers and PCR cloning are generally known in the art.
In hybridization techniques, all or part of a known nucleotide sequence is used as a probe that selectively hybridizes to other corresponding nucleotide sequences present in a population of cloned genomic DNA fragments or cDNA fragments genomic or cDNA libraries) from a chosen organism. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 32 p, or any other detectable marker. Thus, for example, probes for hybridization can be made by labeling synthetic oligonucleotides based on the sequence of the invention. Methods for preparation of probes for hybridization and for construction of cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook et al.
(1989). In general, sequences that hybridize to the sequences disclosed herein will have at least 40% to 50%, about 60% to 70% and even about 80% 85%, 90%, 95% to 98% or more identity with the disclosed sequences. That is, the sequence similarity of sequences may range, sharing at least about 40% to 50%, about 60% to 70%, and even about 80%, 85%, 90%, to 98% sequence similarity.
The nucleic acid molecules of the invention can also be identified by, for example, a search of known databases for genes encoding polypeptides having a specified amino acid sequence identity or DNA having a specified nucleotide sequence identity. Methods of alignment of sequences for comparison are well known in the art and are described hereins.
-69- Case S-50015A/16/78/NAD 00 For example, to identify orthologs of the sequences described herein, similarity N searches are carried out in databases using a BLAST (see above) algorithm followed by analysis using SCAN (the Sequence Comparison Analysis, program version 1 .k licensed from the Los Almos National Laboratories) software with added filters.
A rice database is searched (Table 14) as well as a database constructed from GenBank (Table 15). Using a PERL script, a subset of the GenBank database (GenBank version 123.0).
The database contains all of the plant translated regions from GenBank, with the exception of Arabidopsis thaliana sequences. In addition, the GenBank subset database retains annotation 0O from following fields: product, function, note, as well as protein and nucleotide accession 0 numbers and organisms.
The BLASTX search algorithm, which translates a query sequence in all six frames and then carries out a protein comparison, is selected to conduct the search. Queries are executed using the "blastall" command with the following parameters: blastp", 50", 50", "-F Homologies to hypothetical sequences are eliminated by setting the default parameters of SCAN at the command line to 60 60" (60 identities and 60 percent identity, such that all of the results have 60 or more identities and that 60% of the alignment is made up of identities). In addition to SCAN, a E-value cutoff of le-4 is implemented.
It is specifically contemplated by the inventors that one could mutagenize a promoter to, for example, potentially improve the utility of the elements for the expression of transgenes in plants. The mutagenesis of these elements can be carried out at random and the mutagenized promoter sequences screened for activity in a trial-by-error procedure.
Alternatively, particular sequences which provide the promoter with desirable expression characteristics, or the promoter with expression enhancement activity, could be identified and these or similar sequences introduced into the sequences via mutation. It is further contemplated that one could mutagenize these sequences in order to enhance their expression of transgenes in a particular species.
The means for mutagenizing a DNA segment encoding a promoter sequence of the current invention are well-known to those of skill in the art. As indicated, modifications to promoter or other regulatory element may be made by random, or site-specific mutagenesis procedures. The promoter and other regulatory element may be modified by altering their Case S-50015A/16/78/NAD 00 O structure through the addition or deletion of one or more nucleotides from the sequence which C encodes the corresponding un-modified sequences.
O Mutagenesis may be performed in accordance with any of the techniques known in the It" art, such as, and not limited to, synthesizing an oligonucleotide having one or more mutations within the sequence of a particular regulatory region. In particular, site-specific mutagenesis is a technique useful in the preparation of promoter mutants, through specific mutagenesis of the underlying DNA. The technique further provides a ready ability to prepare and test sequence 0 variants, for example, incorporating one or more of the foregoing considerations, by 00 introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis 0 allows the production of mutants through the use of specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Typically, a primer of about 17 to about 75 nucleotides or more in length is preferred, with about 10 to about 25 or more residues on both sides of the junction of the sequence being altered.
In general, the technique of site-specific mutagenesis is well known in the art, as exemplified by various publications. As will be appreciated, the technique typically employs a phage vector which exists in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M 3 phage. These phage are readily commercially available and their use is generally well known to those skilled in the art.
Double stranded plasmids also are routinely employed in site directed mutagenesis which eliminates the step of transferring the gene of interest from a plasmid to a phage.
In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector or melting apart of two strands of a double stranded vector which includes within its sequence a DNA sequence which encodes the promoter. An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the second strand bears the desired mutation.
-71 Case S-50015A/16/78/NAD This heteroduplex vector is then used to transform or transfect appropriate cells, such 1 as E. coli cells, and cells are selected which include recombinant vectors bearing the mutated sequence arrangement. Vector DNA can then be isolated from these cells and used for plant transformation. A genetic selection scheme was devised by Kunkel et al. (1987) to enrich for clones incorporating mutagenic oligonucleotides. Alternatively, the use of PCR with commercially available thermostable enzymes such as Taq polymerase may be used to S incorporate a mutagenic oligonucleotide primer into an amplified DNA fragment that can then be cloned into an appropriate cloning or expression vector. The PCR-mediated mutagenesis 0o procedures of Tomic el al. (1990) and Upender et al. (1995) provide two examples of such 0 protocols. A PCR employing a thermostable ligase in addition to a thermostable polymerase also may be used to incorporate a phosphorylated mutagenic oligonucleotide into an amplified DNA fragment that may then be cloned into an appropriate cloning or expression vector. The mutagenesis procedure described by Michael (1994) provides an example of one such protocol.
The preparation of sequence variants of the selected promoter-encoding
DNA
segments using site-directed mutagenesis is provided as a means of producing potentially useful species and is not meant to be limiting as there are other ways in which sequence variants of DNA sequences may be obtained. For example, recombinant vectors encoding the desired promoter sequence may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence variants.
In addition, an unmodified or modified nucleotide sequence of the present invention can be varied by shuffling the sequence of the invention. To test for a function of variant DNA sequences according to the invention, the sequence of interest is operably linked to a selectable or screenable marker gene and expression of the marker gene is tested in transient expression assays with protoplasts or in stably transformed plants. It is known to the skilled artisan that DNA sequences capable of driving expression of an associated nucleotide sequence are build in a modular way. Accordingly, expression levels from shorter DNA fragments may be different than the one from the longest fragment and may be different from each other. For example, deletion of a down-regulating upstream element will lead to an increase in the expression levels of the associated nucleotide sequence while deletion of an up-regulating element will decrease the expression levels of the associated nucleotide sequence. It is also known to the skilled 72 Case S-50015A16/78"AD O artisan that deletion of development-specific or a tissue-specific element will lead to a Stemporally or spatially altered expression profile of the associated nucleotide sequence.
Embraced by the present invention are also functional equivalents of the promoters of the present invention, i.e. nucleotide sequences that hybridize under stringent conditions to any Sone of SEQ ID NOs: 1-339, 477-515, 517-526, 536-579, or 693-773, preferably to any one of SEQ ID NOs: 536-579, more preferably to any one of SEQ ID Nos: 536; 537; 539-542; 548; 'vl 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs thereof.As used herein, the term "oligonucleotide directed mutagenesis procedure" refers to template- 00 dependent processes and vector-mediated propagation which result in an increase in the Sconcentration of a specific nucleic acid molecule relative to its initial concentration, or in an increase in the concentration of a detectable signal, such as amplification. As used herein, the term "oligonucleotide directed mutagenesis procedure" also is intended to refer to a process that involves the template-dependent extension of a primer molecule. The term templatedependent process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the sequence of the newly synthesized strand of nucleic acid is dictated by the well- known rules of complementary base pairing (see, for example, Watson and Rarnstad, 1987). Typically, vector mediated methodologies involve the introduction of the nucleic acid fragment into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of the amplified nucleic acid fragment. Examples of such methodologies are provided by U.S. Patent No. 4,237,224. A number of template dependent processes are available to amplify the target sequences of interest present in a sample, such methods being well known in the art and specifically disclosed herein below.
Where a clone comprising a promoter has been isolated in accordance with the instant invention, one may wish to delimit the essential promoter regions within the clone. One efficient, targeted means for preparing mutagenizing promoters relies upon the identification of putative regulatory elements within the promoter sequence. This can be initiated by comparison with promoter sequences known to be expressed in similar tissue-specific or developmentally unique manner. Sequences which are shared among promoters with similar expression patterns are likely candidates for the binding of transcription factors and are thus likely elements which confer expression patterns. Confirmation of these putative regulatory elements can be achieved by deletion analysis of each putative regulatory region followed by -73 Case S-50015A/16/78/NAD O functional analysis of each deletion construct by assay of a reporter gene which is functionally 'i attached to each construct. As such, once a starting promoter sequence is provided, any of a number of different deletion mutants of the starting promoter could be readily prepared.
As indicated above, deletion mutants, deletion mutants of the promoter of the invention also could be randomly prepared and then assayed. With this strategy, a series of constructs are prepared, each containing a different portion of the clone (a subclone), and these constructs are then screened for activity. A suitable means for screening for activity is to attach a deleted a promoter or intron construct which contains a deleted segment to a selectable or screenable 00 marker, and to isolate only those cells expressing the marker gene. In this way, a number of 0 different, deleted promoter constructs are identified which still retain the desired, or even enhanced, activity. The smallest segment which is required for activity is thereby identified through comparison of the selected constructs. This segment may then be used for the construction of vectors for the expression of exogenous genes.
In order to improve the ability to identify transformants, one may desire to employ a selectable or screenable marker gene as, or in addition to, the expressible gene of interest.
"Marker genes" are genes that impart a distinct phenotype to cells expressing the marker gene and thus allow such transformed cells to be distinguished from cells that do not have the marker. Such genes may encode either a selectable or screenable marker, depending on whether the marker confers a trait which one can 'select' for by chemical means, through the use of a selective agent a herbicide, antibiotic, or the like), or whether it is simply a trait that one can identify through observation or testing, by 'screening' the R-locus trait, the green fluorescent protein Of course, many examples of suitable marker genes are known to the art and can be employed in the practice of the invention.
Included within the terms selectable or screenable marker genes are also genes which encode a "secretable marker" whose secretion can be detected as a means of identifying or selecting for transformed cells. Examples include markers which encode a secretable antigen that can be identified by antibody interaction, or even secretable enzymes which can be detected by their catalytic activity. Secretable proteins fall into a number of classes, including small, diffusible proteins detectable, by ELISA; small active enzymes detectable in extracellular solution alpha-amylase, beta-lactamase, phosphinothricin acetyltransferase); -74- Case S-50015A16/78/NAD and proteins that are inserted or trapped in the cell wall proteins that include a leader sequence such as that found in the expression unit of extensin or tobacco PR-S).
With regard to selectable secretable markers, the use of a gene that encodes a protein that becomes sequestered in the cell wall, and which protein includes a unique epitope is considered to be particularly advantageous. Such a secreted antigen marker would ideally employ an epitope sequence that would provide low background in plant tissue, a promoter- 1- leader sequence that would impart efficient expression and targeting across the plasma 0 membrane, and would produce protein that is bound in the cell wall and yet accessible to 00 antibodies. A normally secreted wall protein modified to include a unique epitope would C satisfy all such requirements.
One example of a protein suitable for modification in this manner is extensin, or hydroxyproline rich glycoprotein (HPRG). For example, the maize HPRG (Steifel et al., 1990) molecule is well characterized in terms of molecular biology, expression and protein structure.
However, any one of a variety of ultilane and/or glycine-rich wall proteins (Keller et al., 1989) could be modified by the addition of an antigenic site to create a screenable marker.
One exemplary embodiment of a secretable screenable marker concerns the use of a maize sequence encoding the wall protein HPRG, modified to include a 15 residue epitope from the pro-region of murine interleukin, however, virtually any detectable epitope may be employed in such embodiments, as selected from the extremely wide variety of antigenantibody combinations known to those of skill in the art. The unique extracellular epitope can then be straightforwardly detected using antibody labeling in conjunction with chromogenic or fluorescent adjuncts.
Elements of the present disclosure may be exemplified in detail through the use of the bar and/or GUS genes, and also through the use of various other markers. Of course, in light of this disclosure, numerous other possible selectable and/or screenable marker genes will be apparent to those of skill in the art in addition to the one set forth hereinbelow. Therefore, it will be understood that the following discussion is exemplary rather than exhaustive. In light of the techniques disclosed herein and the general recombinant techniques which are known in the art, the present invention renders possible the introduction of any gene, including marker genes, into a recipient cell to generate a transformed plant.
75 Case S-50015,A 16/78/NAD 00 0 Possible selectable markers for use in connection with the present invention include, but N1 are not limited to, a neo gene (Potrykus et al., 1985) which codes for kanamycin resistance and can be selected for using kanamycin, G418, paromomycin, and the like; a bar gene which codes for bialaphos or phosphinothricin resistance; a gene which encodes an altered EPSP synthase protein (Hinchee et al., 1988) thus conferring glyphosate resistance; a nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to bromoxynil (Stalker et al., 1988); a mutant acetolactate synthase gene (ALS) which confers resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals (European Patent Application 154,204, 1985); 00 a methotrexate-resistant DHFR gene (Thillet et al., 1988); a dalapon dehalogenase gene that 0 confers resistance to the herbicide dalapon; a mutated anthranilate synthase gene that confers resistance to 5-methyl tryptophan. Preferred selectable marker genes encode phosphinothricin acetyltransferase; glyphosate resistant EPSPS, aminoglycoside phosphotransferase; hygromycin phosphotransferase, or neomycin phosphotransferase. Where a mutant EPSP synthase gene is employed, additional benefit may be realized through the incorporation of a suitable chloroplast transit peptide, CTP (European Patent Application 0,218,571, 1987).
An illustrative embodiment of a selectable marker gene capable of being used in systems to select transformants is the genes that encode the enzyme phosphinothricin acetyltransferase, such as the bar gene from Streptomyces hygroscopicus or the pat gene from Streptomyces viridochromogenes. The enzyme phosphinothricin acetyl transferase (PAT) inactivates the active ingredient in the herbicide bialaphos, phosphinothricin (PPT). PPT inhibits glutamine synthetase, (Murakami et al., 1986; Twell et al., 1989) causing rapid accumulation of ammonia and cell death. The success in using this selective system in conjunction with monocots was particularly surprising because of the major difficulties which have been reported in transformation of cereals (Potrykus, 1989).
Where one desires to employ a bialaphos resistance gene in the practice of the invention, a particularly useful gene for this purpose is the bar or pat genes obtainable from species of Streptomyces ATCC No. 21,705). The cloning of the bar gene has been described (Murakami et al., 1986; Thompson et al., 1987) as has the use of the bar gene in the context of plants other than monocots (De Block et al., 1987; De Block et al., 1989).
Selection markers resulting in positive selection, such as a phosphomannose isomerase gene, as described in patent application WO 93/05163, may also be used. Alternative genes to -76- Case S-50015A/16/78[NAD O be used for positive selection are described in WO 94/20627 and encode xyloisomerases and CN phosphomanno-isomerases such as mannose-6-phosphate isomerase and mannose-1-phosphate Sisomerase; phosphomanno mutase; mannose epimerases such as those which convert carbohydrates Sto mannose or mannose to carbohydrates such as glucose or galactose; phosphatases such as mannose or xylose phosphatase, mannose-6-phosphatase and mannose-l-phosphatase, and permeases which are involved in the transport of mannose, or a derivative, or a precursor thereof into the cell. Transformed cells are identified without damaging or killing the non-transformed cells in the population and without co-introduction of antibiotic or herbicide resistance genes.
OO As described in WO 93/05163, in addition to the fact that the need for antibiotic or herbicide resistance genes is eliminated, it has been shown that the positive selection method is often far more efficient than traditional negative selection.
Screenable markers that may be employed include, but are not limited to, a betaglucuronidase (GUS) or uidA gene which encodes an enzyme for which various chromogenic substrates are known; an R-locus gene, which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al., 1988); a beta-lactamase gene (Sutcliffe, 1978), which encodes an enzyme for which various chromogenic substrates are known PADAC, a chromogenic cephalosporin); a xylE gene (Zukowsky et al., 1983) which encodes a catechol dioxygenase that can convert chromogenic catechols; an a-amylase gene (Ikuta et al., 1990); a tyrosinase gene (Katz et al., 1983) which encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to form the easily detectable compound melanin; a B-galactosidase gene, which encodes an enzyme for which there are chromogenic substrates; a luciferase (lux) gene (Ow et al., 1986), which allows for bioluminescence detection; or even an aequorin gene (Prasher et al., 1985), which may be employed in calcium-sensitive bioluminescence detection, or a green fluorescent protein gene (Niedz et al., 1995).
Genes from the maize R gene complex are contemplated to be particularly useful as screenable markers. The R gene complex in maize encodes a protein that acts to regulate the production of anthocyanin pigments in most seed and plant tissue. A gene from the R gene complex was applied to maize transformation, because the expression of this gene in transformed cells does not harm the cells. Thus, an R gene introduced into such cells will -77- Case S-50015A/16/78/NAD Scause the expression of a red pigment and, if stably incorporated, can be visually scored as a N red sector. If a maize line is carries dominant Oultila for genes encoding the enzymatic intermediates in the anthocyanin biosynthetic pathway (C2, Al, A2, Bzl and Bz2), but carries a recessive allele at the R locus, transformation of any cell from that line with R will result in red pigment formation. Exemplary lines include Wisconsin 22 which contains the rg-Stadler allele and TR112, a K55 derivative which is r-g, b, P1. Alternatively any genotype of maize can be utilized if the Cl and R alleles are introduced together.
It is further proposed that R gene regulatory regions may be employed in chimeric OO constructs in order to provide mechanisms for controlling the expression of chimeric genes.
More diversity of phenotypic expression is known at the R locus than at any other locus (Coe et al., 1988). It is contemplated that regulatory regions obtained from regions 5' to the structural R gene would be valuable in directing the expression of genes, insect resistance, drought resistance, herbicide tolerance or other protein coding regions. For the purposes of the present invention, it is believed that any of the various R gene family members may be successfully employed P, S, Lc, etc.). However, the most preferred will generally be Sn (particularly Sn:bol3). Sn is a dominant member of the R gene complex and is functionally similar to the R and B loci in that Sn controls the tissue specific deposition of anthocyanin pigments in certain seedling and plant cells, therefore, its phenotype is similar to R.
A further screenable marker contemplated for use in the present invention is firefly luciferase, encoded by the lux gene. The presence of the lux gene in transformed cells may be detected using, for example, X-ray film, scintillation counting, fluorescent spectrophotometry, low-light video cameras, photon counting cameras or multiwell luminometry. It is also envisioned that this system may be developed for populational screening for bioluminescence, such as on tissue culture plates, or even for whole plant screening. Where use of a screenable marker gene such as lux or GFP is desired, benefit may be realized by creating a gene fusion between the screenable marker gene and a selectable marker gene, for example, a GFP-NPTII gene fusion. This could allow, for example, selection of transformed cells followed by screening of transgenic plants or seeds.
Genes of interest are reflective of the commercial markets and interests of those involved in the development of the crop. Crops and markets of interest changes, and as developing nations open up world markets, new crops and technologies will also emerge. In 78- Case S-50015A/16/78/NAD O addition, as the understanding of agronomic traits and characteristics such as yield and heterosis increase, the choice of genes for transformation will change accordingly. General categories of genes of interest include, for example, those genes involved in information, such Sas zinc fingers, those involved in communication, such as kinases, and those involved in housekeeping, such as heat shock proteins. More specific categories of transgenes, for example, include genes encoding important traits for agronomics, insect resistance, disease resistance, herbicide resistance, sterility, grain characteristics, and commercial products. Genes C of interest include, generally, those involved in starch, oil, carbohydrate, or nutrient 00 metabolism, as well as those affecting kernel size, sucrose loading, zinc finger proteins, see, U.S. Patent No. 5,789,538, WO 99/48909; WO 99/45132; WO 98/53060; WO 98/53057; WO 98/53058; WO 00/23464; WO 95/19431; and WO 98/54311, and the like.
One skilled in the art recognizes that the expression level and regulation of a transgene in a plant can vary significantly from line to line. Thus, one has to test several lines to find one with the desired expression level and regulation. Once a line is identified with the desired regulation specificity of a chimeric Cre transgene, it can be crossed with lines carrying different inactive replicons or inactive transgene for activation.
Other sequences which may be linked to the gene of interest which encodes a polypeptide are those which can target to a specific organelle, to the mitochondria, nucleus, or plastid, within the plant cell. Targeting can be achieved by providing the polypeptide with an appropriate targeting peptide sequence, such as a secretory signal peptide (for secretion or cell wall or membrane targeting, a plastid transit peptide, a chloroplast transit peptide, the chlorophyll a/b binding protein, a mitochondrial target peptide, a vacuole targeting peptide, or a nuclear targeting peptide, and the like. For example, the small subunit of ribulose bisphosphate carboxylase transit peptide, the EPSPS transit peptide or the dihydrodipicolinic acid synthase transit peptide may be used. For examples of plastid organelle targeting sequences (see WO 00/12732). Plastids are a class of plant organelles derived from proplastids and include chloroplasts, leucoplasts, aravloplasts, and chromoplasts. The plastids are major sites of biosynthesis in plants. In addition to photosynthesis in the chloroplast, plastids are also sites of lipid biosynthesis, nitrate reduction to ammonium, and starch storage.
And while plastids contain their own circular genome, most of the proteins localized to the -79- Case S-50015A/16/78/NAD C plastids are encoded by the nuclear genome and are imported into the organelle from the CN cytoplasm.
Transgenes used with the present invention will often be genes that direct the expression of a particular protein or polypeptide product, but they may also be non-expressible DNA segments, transposons such as Ds that do no direct their own transposition. As used herein, an "expressible gene" is any gene that is capable of being transcribed into RNA mRNA, antisense RNA, etc.) or translated into a protein, expressed as a trait of interest, or the like, etc., and is not limited to selectable, screenable or non-selectable marker genes.
00 The invention also contemplates that, where both an expressible gene that is not necessarily a 0 marker gene is employed in combination with a marker gene, one may employ the separate genes on either the same or different DNA segments for transformation. In the latter case, the different vectors are delivered concurrently to recipient cells to maximize cotransformation.
The choice of the particular DNA segments to be delivered to the recipient cells will often depend on the purpose of the transformation. One of the major purposes of transformation of crop plants is to add some commercially desirable, agronomically important traits to the plant. Such traits include, but are not limited to, herbicide resistance or tolerance; insect resistance or tolerance; disease resistance or tolerance (viral, bacterial, fungal, nematode); stress tolerance and/or resistance, as exemplified by resistance or tolerance to drought, heat, chilling, freezing, excessive moisture, salt stress; oxidative stress; increased yields; food content and makeup; physical appearance; male sterility; drydown; standability; prolificacy; starch properties; oil quantity and quality; and the like. One may desire to incorporate one or more genes conferring any such desirable trait or traits, such as, for example, a gene or genes encoding pathogen resistance.
In certain embodiments, the present invention contemplates the transformation of a recipient cell with more than one advantageous transgene. Two or more transgenes can be supplied in a single transformation event using either distinct transgene-encoding vectors, or using a single vector incorporating two or more gene coding sequences. For example, plasmids bearing the bar and aroA expression units in either convergent, divergent, or colinear orientation, are considered to be particularly useful. Further preferred combinations are those of an insect resistance gene, such as a Bt gene, along with a protease inhibitor gene such as pinll, or the use of bar in combination with either of the above genes. Of course, any two or Case S-50015A/16/78/NAD 00 S more transgenes of any description, such as those conferring herbicide, insect, disease (viral, C' bacterial, fungal, nematode) or drought resistance, male sterility, drydown, standability, C) prolificacy, starch properties, oil quantity and quality, or those increasing yield or nutritional Squality may be employed as desired.
The genes encoding phosphinothricin acetyltransferase (bar and pat), glyphosate tolerant EPSP synthase genes, the glyphosate degradative enzyme gene gox encoding Sglyphosate oxidoreductase, deh (encoding a dehalogenase enzyme that inactivates dalapon), herbicide resistant sulfonylurea and imidazolinone) acetolactate synthase, and bxn genes 00 (encoding a nitrilase enzyme that degrades bromoxynil) are good examples of herbicide 0 resistant genes for use in transformation. The bar and pat genes code for an enzyme, phosphinothricin acetyltransferase (PAT), which inactivates the herbicide phosphinothricin and prevents this compound from inhibiting glutamine synthetase enzymes. The enzyme enolpyruvylshikimate 3-phosphate synthase (EPSP Synthase), is normally inhibited by the herbicide N-(phosphonomethyl)glycine (glyphosate). However, genes are known that encode glyphosate-resistant EPSP Synthase enzymes.
These genes are particularly contemplated for use in monocot transformation. The deh gene encodes the enzyme dalapon dehalogenase and confers resistance to the herbicide dalapon. The bxn gene codes for a specific nitrilase enzyme that converts bromoxynil to a non-herbicidal degradation product.
An important aspect of the present invention concerns the introduction of insect resistance-conferring genes into plants. Potential insect resistance genes which can be introduced include Bacillus thuringiensis crystal toxin genes or Bt genes (Watrud et al., 1985).
Bt genes may provide resistance to lepidopteran or coleopteran pests such as European Corn Borer (ECB) and corn rootworm (CRW). Preferred Bt toxin genes for use in such embodiments include the CryIA(b) and CryIA(c) genes. Endotoxin genes from other species of B. thuringiensis which affect insect growth or development may also be employed in this regard.
The poor expression of Bt toxin genes in plants is a well-documented phenomenon, and the use of different promoters, fusion proteins, and leader sequences has not led to significant increases in Bi protein expression (Vaeck et al., 1989; Barton et al., 1987). It is therefore contemplated that the most advantageous Bt genes for use in the transformation protocols 81 Case S-50015A/16/78/NAD 0 disclosed herein will be those in which the coding sequence has been modified to effect N increased expression in plants, and more particularly, those in which maize preferred codons have been used. Examples of such modified Bt toxin genes include the variant Bt CrylA(b) t C gene termed lab6 (Perlak et al., 1991) and the synthetic CrylA(c) genes termed 1800a and 1800b.
Protease inhibitors may also provide insect resistance (Johnson et al., 1989), and will thus have utility in plant transformation. The use of a protease inhibitor II gene, pinll, from tomato or potato is envisioned to be particularly useful. Even more advantageous is the use of OO a pinll gene in combination with a Bt toxin gene, the combined effect of which has been 0 discovered by the present inventors to produce synergistic insecticidal activity. Other genes which encode inhibitors of the insects' digestive system, or those that encode enzymes or cofactors that facilitate the production of inhibitors, may also be useful. This group may be exemplified by oryzacystatin and amylase inhibitors, such as those from wheat and barley.
Also, genes encoding lectins may confer additional or alternative insecticide properties.
Lectins (originally termed phytohemagglutinins) are multivalent carbohydrate-binding proteins which have the ability to agglutinate red blood cells from a range of species. Lectins have been identified recently as insecticidal agents with activity against weevils, ECB and rootworm (Murdock et al., 1990; Czapla and Lang, 1990). Lectin genes contemplated to be useful include, for example, barley and wheat germ agglutinin (WGA) and rice lectins (Gatehouse et al., 1984), with WGA being preferred.
Genes controlling the production of large or small polypeptides active against insects when introduced into the insect pests, such as, lytic peptides, peptide hormones and toxins and venoms, form another aspect of the invention. For example, it is contemplated that the expression of juvenile hormone esterase, directed towards specific insect pests, may also result in insecticidal activity, or perhaps cause cessation of metamorphosis (Hammock et al., 1990).
Transgenic plants expressing genes which encode enzymes that affect the integrity of the insect cuticle form yet another aspect of the invention. Such genes include those encoding, chitinase, proteases, lipases and also genes for the production of nikkomycin, a compound that inhibits chitin synthesis, the introduction of any of which is contemplated to produce insect resistant maize plants. Genes that code for activities that affect insect molting, such those 82- Case S-50015A/16/78/NAD O affecting the production of ecdysteroid UDP-glucosyl transferase, also fall within the scope of C the useful transgenes of the present invention.
Genes that code for enzymes that facilitate the production of compounds that reduce the nutritional quality of the host plant to insect pests are also encompassed by the present invention. It may be possible, for instance, to confer insecticidal activity on a plant by altering its sterol composition. Sterols are obtained by insects from their diet and are used for hormone synthesis and membrane stability. Therefore alterations in plant sterol composition by expression of novel genes, those that directly promote the production of undesirable 00 sterols or those that convert desirable sterols into undesirable forms, could have a negative effect on insect growth and/or development and hence endow the plant with insecticidal activity. Lipoxygenases are naturally occurring plant enzymes that have been shown to exhibit anti-nutritional effects on insects and to reduce the nutritional quality of their diet. Therefore, further embodiments of the invention concern transgenic plants with enhanced lipoxygenase activity which may be resistant to insect feeding.
The present invention also provides methods and compositions by which to achieve qualitative or quantitative changes in plant secondary metabolites. One example concerns transforming plants to produce DIMBOA which, it is contemplated, will confer resistance to European corn borer, rootworm and several other maize insect pests. Candidate genes that are particularly considered for use in this regard include those genes at the bx locus known to be involved in the synthetic DIMBOA pathway (Dunn et al., 1981). The introduction of genes that can regulate the production of maysin, and genes involved in the production of dhurrin in sorghum, is also contemplated to be of use in facilitating resistance to earworm and rootworm, respectively.
Tripsacum dacryloides is a species of grass that is resistant to certain insects, including corn root worm. It is anticipated that genes encoding proteins that are toxic to insects or are involved in the biosynthesis of compounds toxic to insects will be isolated from Tripsacum and that these novel genes will be useful in conferring resistance to insects. It is known that the basis of insect resistance in Tripsacum is genetic, because said resistance has been transferred to Zea mays via sexual crosses (Branson and Guss, 1972).
Further genes encoding proteins characterized as having potential insecticidal activity may also be used as transgenes in accordance herewith. Such genes include, for example, the 83- Case S-50015A/16/78/NAD O cowpea trypsin inhibitor (CpTI; Hilder et al., 1987) which may be used as a rootworm deterrent; genes encoding avermectin (Campbell, 1989; Ikeda et al., 1987) which may prove i particularly useful as a corn rootworm deterrent; ribosome inactivating protein genes; and even Sgenes that regulate plant structures. Transgenic maize including anti-insect antibody genes and genes that code for enzymes that can covert a non-toxic insecticide (pro-insecticide) applied to the outside of the plant into an insecticide inside the plant are also contemplated.
Improvement of a plant's ability to tolerate various environmental stresses such as, but not limited to, drought, excess moisture, chilling, freezing, high temperature, salt, and 0O oxidative stress, can also be effected through expression of heterologous, or overexpression of homologous genes. Benefits may be realized in terms of increased resistance to freezing temperatures through the introduction of an "antifreeze" protein such as that of the Winter Flounder (Cutler et al., 1989) or synthetic gene derivatives thereof. Improved chilling tolerance may also be conferred through increased expression of glycerol-3-phosphate acetyltransferase in chloroplasts (Murata et al., 1992; Wolter et al., 1992). Resistance to oxidative stress (often exacerbated by conditions such as chilling temperatures in combination with high light intensities) can be conferred by expression of superoxide dismutase (Gupta et al., 1993), and may be improved by glutathione reductase (Bowler et al., 1992). Such strategies may allow for tolerance to freezing in newly emerged fields as well as extending later maturity higher yielding varieties to earlier relative maturity zones.
Expression of novel genes that favorably effect plant water content, total water potential, osmotic potential, and turgor can enhance the ability of the plant to tolerate drought.
As used herein, the terms "drought resistance" and "drought tolerance" are used to refer to a plants increased resistance or tolerance to stress induced by a reduction in water availability, as compared to normal circumstances, and the ability of the plant to function and survive in lower-water environments, and perform in a relatively superior manner. In this aspect of the invention it is proposed, for example, that the expression of a gene encoding the biosynthesis of osmotically-active solutes can impart protection against drought. Within this class of genes are DNAs encoding mannitol dehydrogenase (Lee and Saier, 1982) and trehalose-6-phosphate synthase (Kaasen et al., 1992). Through the subsequent action of native phosphatases in the cell or by the introduction and coexpression of a specific phosphatase, these introduced genes will result in the accumulation of either mannitol or trehalose, respectively, both of which have 84- Case S-50015A116/78NAD 0 been well documented as protective compounds able to mitigate the effects of stress. Mannitol 1 accumulation in transgenic tobacco has been verified and preliminary results indicate that Splants expressing high levels of this metabolite are able to tolerate an applied osmotic stress (Tarczynski et al., cited supra (1992), 1993).
Similarly, the efficacy of other metabolites in protecting either enzyme function (e.g.
alanopine or propionic acid) or membrane integrity alanopine) has been documented (Loomis et al., 1989), and therefore expression of gene encoding the biosynthesis of these compounds can confer drought resistance in a manner similar to or complimentary to mannitol.
0 Other examples of naturally occurring metabolites that are osmotically active and/or provide some direct protective effect during drought and/or desiccation include sugars and sugar derivatives such as fructose, erythritol (Coxson et al., 1992), sorbitol, dulcitol (Karsten et al., 1992), glucosylglycerol (Reed et al., 1984; Erdmann et al., 1992), sucrose, stachyose (Koster and Leopold, 1988; Blackman et al., 1992), ononitol and pinitol (Vernon and Bohnert, 1992), and raffinose (Bernal-Lugo and Leopold, 1992). Other osmotically active solutes which are not sugars include, but are not limited to, proline and glycine-betaine (Wyn-Jones and Storey, 1981). Continued canopy growth and increased reproductive fitness during times of stress can be augmented by introduction and expression of genes such as those controlling the osmotically active compounds discussed above and other such compounds, as represented in one exemplary embodiment by the enzyme myoinositol 0-methyltransferase.
It is contemplated that the expression of specific proteins may also increase drought tolerance. Three classes of Late Embryogenic Proteins have been assigned based on structural similarities (see Dure et al., 1989). All three classes of these proteins have been demonstrated in maturing desiccating) seeds. Within these 3 types of proteins, the Type-Il (dehydrintype) have generally been implicated in drought and/or desiccation tolerance in vegetative plant parts Mundy and Chua, 1988; Piatkowski et al., 1990; Yamaguchi-Shinozaki et al., 1992).
Recently, expression of a Type-Ill LEA (HVA-1) in tobacco was found to influence plant height, maturity and drought tolerance (Fitzpatrick, 1993). Expression of structural genes from all three groups may therefore confer drought tolerance. Other types of proteins induced during water stress include thiol proteases, aldolases and transmembrane transporters (Guerrero et al., 1990), which may confer various protective and/or repair-type functions Case S-50015oA/16/78/NAD 0 during drought stress. The expression of a gene that effects lipid biosynthesis and hence ,I membrane composition can also be useful in conferring drought resistance on the plant.
Many genes that improve drought resistance have complementary modes of action.
C Thus, combinations of these genes might have additive and/or synergistic effects in improving Sdrought resistance in plants. Many of these genes also improve freezing tolerance (or resistance); the physical stresses incurred during freezing and drought are similar in nature and may be mitigated in similar fashion. Benefit may be conferred via constitutive expression of these genes, but the preferred means of expressing these novel genes may be through the use of 00 a turgor-induced promoter (such as the promoters for the turgor-induced genes described in Guerrero et al. 1990 and Shagan et al., 1993). Spatial and temporal expression patterns of these genes may enable maize to better withstand stress.
Expression of genes that are involved with specific morphological traits that allow for increased water extractions from drying soil would be of benefit. For example, introduction and expression of genes that alter root characteristics may enhance water uptake. Expression of genes that enhance reproductive fitness during times of stress would be of significant value.
For example, expression of DNAs that improve the synchrony of pollen shed and receptiveness of the female flower parts, silks, would be of benefit. In addition, expression of genes that minimize kernel abortion during times of stress would increase the amount of grain to be harvested and hence be of value. Regulation of cytokinin levels in monocots, such as maize, by introduction and expression of an isopentenyl transferase gene with appropriate regulatory sequences can improve monocot stress resistance and yield (Gan et al., Science, 270:1986 (1995)).
Given the overall role of water in determining yield, it is contemplated that enabling plants to utilize water more efficiently, through the introduction and expression of novel genes, will improve overall performance even when soil water availability is not limiting. By introducing genes that improve the ability of plants to maximize water usage across a full range of stresses relating to water availability, yield stability or consistency of yield performance may be realized.
It is proposed that increased resistance to diseases may be realized through introduction of genes into plants period. It is possible to produce resistance to diseases caused by viruses, bacteria, fungi, root pathogens, insects and nematodes. It is also contemplated that -86- Case S-50015A/16/78/NAD 0 control of mycotoxin producing organisms may be realized through expression of introduced C genes.
SResistance to viruses may be produced through expression of novel genes. For example, it has been demonstrated that expression of a viral coat protein in a transgenic plant can impart resistance to infection of the plant by that virus and perhaps other closely related viruses (Cuozzo et al., 1988, Hemenway et al., 1988, Abel et al., 1986). It is contemplated S that expression of antisense genes targeted at essential viral functions may impart resistance to said virus. For example, an antisense gene targeted at the gene responsible for replication of viral nucleic acid may inhibit said replication and lead to resistance to the virus. It is believed that interference with other viral functions through the use of antisense genes may also increase resistance to viruses. Further it is proposed that it may be possible to achieve resistance to viruses through other approaches, including, but not limited to the use of satellite viruses.
It is proposed that increased resistance to diseases caused by bacteria and fungi may be realized through introduction of novel genes. It is contemplated that genes encoding so-called "peptide antibiotics," pathogenesis related (PR) proteins, toxin resistance, and proteins affecting host-pathogen interactions such as morphological characteristics will be useful.
Peptide antibiotics are polypeptide sequences which are inhibitory to growth of bacteria and other microorganisms. For example, the classes of peptides referred to as cecropins and magainins inhibit growth of many species of bacteria and fungi. It is proposed that expression of PR proteins in plants may be useful in conferring resistance to bacterial disease. These genes are induced following pathogen attack on a host plant and have been divided into at least five classes of proteins (Bol et al., 1990). Included amongst the PR proteins are beta-1,3glucanases, chitinases, and osmotin and other proteins that are believed to function in plant resistance to disease organisms. Other genes have been identified that have antifungal properties, UDA (stinging nettle lectin) and hevein (Broakgert et al., 1989; Barkai-Golan et al., 1978). It is known that certain plant diseases are caused by the production of phytotoxins. Resistance to these diseases could be achieved through expression of a novel gene that encodes an enzyme capable of degrading or otherwise inactivating the phytotoxin.
Expression novel genes that alter the interactions between the host plant and pathogen may be useful in reducing the ability the disease organism to invade the tissues of the host plant, e.g., an increase in the waxiness of the leaf cuticle or other morphological characteristics.
-87- Case S-50015A/16/78/NAD 0 Plant parasitic nematodes are a cause of disease in many plants. It is proposed that it N would be possible to make the plant resistant to these organisms through the expression of novel genes. It is anticipated that control of nematode infestations would be accomplished by altering the ability of the nematode to recognize or attach to a host plant and/or enabling the plant to produce nematicidal compounds, including but not limited to proteins.
Production of mycotoxins, including aflatoxin and fumonisin, by fungi associated with plants is a significant factor in rendering the grain not useful. These fungal organisms do not cause disease symptoms and/or interfere with the growth of the plant, but they produce 0 chemicals (mycotoxins) that are toxic to animals. Inhibition of the growth of these fungi would reduce the synthesis of these toxic substances and, therefore, reduce grain losses due to mycotoxin contamination. Novel genes may be introduced into plants that would inhibit synthesis of the mycotoxin without interfering with fungal growth. Expression of a novel gene which encodes an enzyme capable of rendering the mycotoxin nontoxic would be useful in order to achieve reduced mycotoxin contamination of grain. The result of any of the above mechanisms would be a reduced presence of mycotoxins on grain.
Genes may be introduced into plants, particularly commercially important cereals such as maize, wheat or rice, to improve the grain for which the cereal is primarily grown. A wide range of novel transgenic plants produced in this manner may be envisioned depending on the particular end use of the grain.
For example, the largest use of maize grain is for feed or food. Introduction of genes that alter the composition of the grain may greatly enhance the feed or food value. The primary components of maize grain are starch, protein, and oil. Each of these primary components of maize grain may be improved by altering its level or composition. Several examples may be mentioned for illustrative purposes but in no way provide an exhaustive list of possibilities.
The protein of many cereal grains is suboptimal for feed and food purposes especially when fed to pigs, poultry, and humans. The protein is deficient in several amino acids that are essential in the diet of these species, requiring the addition of supplements to the grain.
Limiting essential amino acids may include lysine, methionine, tryptophan, threonine, valine, arginine, and histidine. Some amino acids become limiting only after the grain is supplemented with other inputs for feed formulations. For example, when the grain is supplemented with 88- Case S-50015A/16/78/NAD o soybean meal to meet lysine requirements, methionine becomes limiting. The levels of these C essential amino acids in seeds and grain may be elevated by mechanisms which include, but are i not limited to, the introduction of genes to increase the biosynthesis of the amino acids, decrease the degradation of the amino acids, increase the storage of the amino acids in proteins, or increase transport of the amino acids to the seeds or grain.
One mechanism for increasing the biosynthesis of the amino acids is to introduce genes that deregulate the amino acid biosynthetic pathways such that the plant can no longer Sadequately control the levels that are produced. This may be done by deregulating or oo bypassing steps in the amino acid biosynthetic pathway which are normally regulated by levels O of the amino acid end product of the pathway. Examples include the introduction of genes that encode deregulated versions of the enzymes aspartokinase or dihydrodipicolinic acid (DHDP)synthase for increasing lysine and threonine production, and anthranilate synthase for increasing tryptophan.production. Reduction of the catabolism of the amino acids may be accomplished by introduction of DNA sequences that reduce or eliminate the expression of genes encoding enzymes that catalyse steps in the catabolic pathways such as the enzyme lysine-ketoglutarate reductase.
The protein composition of the grain may be altered to improve the balance of amino acids in a variety of ways including elevating expression of native proteins, decreasing expression of those with poor composition, changing the composition of native proteins, or introducing genes encoding entirely new proteins possessing superior composition. DNA may be introduced that decreases the expression of members of the zein family of storage proteins.
This DNA may encode ribozymes or antisense sequences directed to impairing expression of zein proteins or expression of regulators of zein expression such as the opaque-2 gene product.
The protein composition of the grain may be modified through the phenomenon of cosuppression, inhibition of expression of an endogenous gene through the expression of an identical structural gene or gene fragment introduced through transformation (Goring et al., 1991). Additionally, the introduced DNA may encode enzymes which degrade seines. The decreases in zein expression that are achieved may be accompanied by increases in proteins with more desirable amino acid composition or increases in other major seed constituents such as starch. Alternatively, a chimeric gene may be introduced that comprises a coding sequence for a native protein of adequate amino acid composition such as for one of the globulin -89- Case S-5005A/16/78/NAD Sproteins or 10 kD zein of maize and a promoter or other regulatory sequence designed to N elevate expression of said protein. The coding sequence of said gene may include additional or replacement codons for essential amino acids. Further, a coding sequence obtained from another species, or, a partially or completely synthetic sequence encoding a completely unique peptide sequence designed to enhance the amino acid composition of the seed may be employed.
SThe introduction of genes that alter the oil content of the grain may be of value.
Increases in oil content may result in increases in metabolizable energy content and density of 00 the seeds for uses in feed and food. The introduced genes may encode enzymes that remove or reduce rate-limitations or regulated steps in fatty acid or lipid biosynthesis. Such genes may include, but are not limited to, those that encode acetyl-CoA carboxylase, ACPacyltransferase, beta-ketoacyl-ACP synthase, plus other well known fatty acid biosynthetic activities. Other possibilities are genes that encode proteins that do not possess enzymatic activity such as acyl carrier protein. Additional examples include 2-acetyltransferase, oleosin pyruvate dehydrogenase complex, acetyl CoA synthetase, ATP citrate lyase, ADP-glucose pyrophosphorylase and genes of the carnitine-CoA- acetyl-CoA shuttles. It is anticipated that expression of genes related to oil biosynthesis will be targeted to the plastid, using a plastid transit peptide sequence and preferably expressed in the seed embryo. Genes may be introduced that alter the balance of fatty acids present in the oil providing a more healthful or nutritive feedstuff. The introduced DNA may also encode sequences that block expression of enzymes involved in fatty acid biosynthesis, altering the proportions of fatty acids present in the grain such as described below.
Genes may be introduced that enhance the nutritive value of the starch component of the grain, for example by increasing the degree of branching, resulting in improved utilization of the starch in cows by delaying its metabolism.
Besides affecting the major constituents of the grain, genes may be introduced that affect a variety of other nutritive, processing, or other quality aspects of the grain as used for feed or food. For example, pigmentation of the grain may be increased or decreased.
Enhancement and stability of yellow pigmentation is desirable in some animal feeds and may be achieved by introduction of genes that result in enhanced production of xanthophylls and carotenes by eliminating rate-limiting steps in their production. Such genes may encode altered Case S-50015A/16/78/NAD forms of the enzymes phytoene synthase, phytoene desaturase, or lycopene synthase.
Alternatively, unpigmented white corn is desirable for production of many food products and may be produced by the introduction of DNA which blocks or eliminates steps in pigment production pathways.
Feed or food comprising some cereal grains possesses insufficient quantities of vitamins and must be supplemented to provide adequate nutritive value. Introduction of genes that enhance vitamin biosynthesis in seeds may be envisioned including, for example, vitamins A, E, O B 1 2 choline, and the like. For example, maize grain also does not possess sufficient mineral 0 content for optimal nutritive value. Genes that affect the accumulation or availability of compounds containing phosphorus, sulfur, calcium, manganese, zinc, and iron among others would be valuable. An example may be the introduction of a gene that reduced phytic acid production or encoded the enzyme phytase which enhances phytic acid breakdown. These genes would increase levels of available phosphate in the diet, reducing the need for supplementation with mineral phosphate.
Numerous other examples of improvement of cereals for feed and food purposes might be described. The improvements may not even necessarily involve the grain, but may, for example, improve the value of the grain for silage. Introduction of DNA to accomplish this might include sequences that alter lignin production such as those that result in the "brown midrib" phenotype associated with superior feed value for cattle.
In addition to direct improvements in feed or food value, genes may also be introduced which improve the processing of grain and improve the value of the products resulting from the processing. The primary method of processing certain grains such as maize is via wetmilling. Maize may be improved though the expression of novel genes that increase the efficiency and reduce the cost of processing such as by decreasing steeping time.
Improving the value of wetmilling products may include altering the quantity or quality of starch, oil, corn gluten meal, or the components of corn gluten feed. Elevation of starch may be achieved through the identification and elimination of rate limiting steps in starch biosynthesis or by decreasing levels of the other components of the grain resulting in proportional increases in starch. An example of the former may be the introduction of genes encoding ADP-glucose pyrophosphorylase enzymes with altered regulatory activity or which -91 Case S-50015A/16/78/NAD o are expressed at higher level. Examples of the latter may include selective inhibitors of, for N example, protein or oil biosynthesis expressed during later stages of kernel development.
The properties of starch may be beneficially altered by changing the ratio of amylose to amylopectin, the size of the starch molecules, or their branching pattern. Through these changes a broad range of properties may be modified which include, but are not limited to, changes in gelatinization temperature, heat of gelatinization, clarity of films and pastes, Theological properties, and the like. To accomplish these changes in properties, genes that encode granule-bound or soluble starch synthase activity or branching enzyme activity may be 00 introduced alone or combination. DNA such as antisense constructs may also be used to decrease levels of endogenous activity of these enzymes. The introduced genes or constructs may possess regulatory sequences that time their expression to specific intervals in starch biosynthesis and starch granule development. Furthermore, it may be advisable to introduce and express genes that result in the in vivo derivatization, or other modification, of the glucose moieties of the starch molecule. The covalent attachment of any molecule may be envisioned, limited only by the existence of enzymes that catalyze the derivatizations and the accessibility of appropriate substrates in the starch granule. Examples of important derivations may include the addition of functional groups such as amines, carboxyls, or phosphate groups which provide sites for subsequent in vitro derivatizations or affect starch properties through the introduction of ionic charges. Examples of other modifications may include direct changes of the glucose units such as loss of hydroxyl groups or their oxidation to aldehyde or carboxyl groups.
Oil is another product of wetmilling of corn and other grains, the value of which may be improved by introduction and expression of genes. The quantity of oil that can be extracted by wetmilling may be elevated by approaches as described for feed and food above. Oil properties may also be altered to improve its performance in the production and use of cooking oil, shortenings, lubricants or other oil-derived products or improvement of its health attributes when used in the food-related applications. Novel fatty acids may also be synthesized which upon extraction can serve as starting materials for chemical syntheses. The changes in oil properties may be achieved by altering the type, level, or lipid arrangement of the fatty acids present in the oil. This in turn may be accomplished by the addition of genes that encode enzymes that catalyze the synthesis of novel fatty acids and the lipids possessing them or by -92- Case S-50015A/16/78[NAD oo increasing levels of native fatty acids while possibly reducing levels of precursors.
C Alternatively DNA sequences may be introduced which slow or block steps in fatty acid 1) biosynthesis resulting in the increase in precursor fatty acid intermediates. Genes that might be t" added include desaturases, epoxidases, hydratases, dehydratases, and other enzymes that catalyze reactions involving fatty acid intermediates. Representative examples of catalytic steps that might be blocked include the desaturations from stearic to oleic acid and oleic to inolenic acid resulting in the respective accumulations of stearic and oleic acids.
SImprovements in the other major cereal wetmilling products, gluten meal and gluten 00 feed, may also be achieved by the introduction of genes to obtain novel plants. Representative possibilities include but are not limited to those described above for improvement of food and feed value.
In addition it may further be considered that the plant be used for the production or manufacturing of useful biological compounds that were either not produced at all, or not produced at the same level, in the plant previously. The novel plants producing these compounds are made possible by the introduction and expression of genes by transformation methods. The possibilities include, but are not limited to, any biological compound which is presently produced by any organism such as proteins, nucleic acids, primary and intermediary metabolites, carbohydrate polymers, etc. The compounds may be produced by the plant, extracted upon harvest and/or processing, and used for any presently recognized useful purpose such as pharmaceuticals, fragrances, industrial enzymes to name a few.
Further possibilities to exemplify the range of grain traits or properties potentially encoded by introduced genes in transgenic plants include grain with less breakage susceptibility for export purposes or larger grit size when processed by dry milling through introduction of genes that enhance gamma-zein synthesis, popcorn with improved popping quality and expansion volume through genes that increase pericarp thickness, corn with whiter grain for food uses though introduction of genes that effectively block expression of enzymes involved in pigment production pathways, and improved quality of alcoholic beverages or sweet corn through introduction of genes which affect flavor such as the shrunken gene (encoding sucrose synthase) for sweet corn.
Two of the factors determining where plants can be grown are the average daily temperature during the growing season and the length of time between frosts. Within the areas -93 Case S-50015A/16/78/NAD 0 where it is possible to grow a particular plant, there are varying limitations on the maximal time it is allowed to grow to maturity and be harvested. The plant to be grown in a particular area is selected for its ability to mature and dry down to harvestable moisture content within the tr required period of time with maximum possible yield. Therefore, plant of varying maturities are developed for different growing locations. Apart from the need to dry down sufficiently to s permit harvest is the desirability of having maximal drying take place in the field to minimize the amount of energy required for additional drying post-harvest. Also the more readily the 8 grain can dry down, the more time there is available for growth and kernel fill. Genes that 00 influence maturity and/or dry down can be identified and introduced into plant lines using transformation techniques to create new varieties adapted to different growing locations or the Ssame growing location but having improved yield to moisture ratio at harvest. Expression of genes that are involved in regulation of plant development may be especially useful, the liguleless and rough sheath genes that have been identified in plants.
Genes may be introduced into plants that would improve standability and other plant growth characteristics. For example, expression of novel genes which confer stronger stalks, improved root systems, or prevent or reduce ear droppage would be of great value to the corn farmer. Introduction and expression of genes that increase the total amount of photoassimilate available by, for example, increasing light distribution and/or interception would be advantageous. In addition the expression of genes that increase the efficiency of photosynthesis and/or the leaf canopy would further increase gains in productivity. Such approaches would allow for increased plant populations in the field.
Delay of late season vegetative senescence would increase the flow of assimilate into the grain and thus increase yield. Overexpression of genes within plants that are associated with "stay green" or the expression of any gene that delays senescence would achieve be advantageous. For example, a non-yellowing mutant has been identified in Festuca pratensis (Davies et al., 1990). Expression of this gene as well as others may prevent premature breakdown of chlorophyll and thus maintain canopy function.
The ability to utilize available nutrients and minerals may be a limiting factor in growth of many plants. It is proposed that it would be possible to alter nutrient uptake, tolerate pH extremes, mobilization through the plant, storage pools, and availability for metabolic activities by the introduction of novel genes. These modifications would allow a plant to more -94- Case S-500OI5A16/7"8IAD Sefficiently utilize available nutrients. It is contemplated that an increase in the activity of, for example, an enzyme that is normally present in the plant and involved in nutrient utilization Swould increase the availability of a nutrient. An example of such an enzyme would be phytase.
SIt is also contemplated that expression of a novel gene may make a nutrient source available that was previously not accessible, an enzyme that releases a component of nutrient value from a more complex molecule, perhaps a macromolecule.
SMale sterility is useful in the production of hybrid seed. It is proposed that male 8 sterility may be produced through expression of novel genes. For example, it has been shown 00 that expression of genes that encode proteins that interfere with development of the male inflorescence and/or gametophyte result in male sterility. Chimeric ribonuclease genes that Sexpress in the anthers of transgenic tobacco and oilseed rape have been demonstrated to lead to male sterility (Mariani et al, 1990).
For example, a number of mutations were discovered in maize that confer cytoplasmic male sterility. One mutation in particular, referred to as T cytoplasm, also correlates with sensitivity to Southern corn leaf blight. A DNA sequence, designated TURF-13 (Levings, 1990), was identified that correlates with T cytoplasm. It would be possible through the introduction of TURF-13 via transformation to separate male sterility from disease sensitivity.
As it is necessary to be able to restore male fertility for breeding purposes and for grain production, it is proposed that genes encoding restoration of male fertility may also be introduced.
Introduction of genes encoding traits that can be selected against may be useful for eliminating undesirable linked genes. When two or more genes are introduced together by cotransformation, the genes will be linked together on the host chromosome. For example, a gene encoding a Bt gene that confers insect resistance on the plant may be introduced into a plant together with a bar gene that is useful as a selectable marker and confers resistance to the herbicide Ignite® on the plant. However, it may not be desirable to have an insect resistant plant that is also resistant to the herbicide Ignite®. It is proposed that one could also introduce an antisense bar gene that is expressed in those tissues where one does not want expression of the bar gene, in whole plant parts. Hence, although the bar gene is expressed and is useful as a selectable marker, it is not useful to confer herbicide resistance on the whole plant. The bar antisense gene is a negative selectable marker.
Case S-50015 A16/78/NAD 0 Negative selection is necessary in order to screen a population of transformants for rare homologous recombinants generated through gene targeting. For example, a homologous Srecombinant may be identified through the inactivation of a gene that was previously expressed t" in that cell. The antisense gene to neomycin phosphotransferase II (nptll) has been investigated as a negative selectable marker in tobacco (Nicotiana tabacum) and Arabidopsis s thaliana (Xiang and Guerra, 1993). In this example both sense and antisense nptll genes are r introduced into a plant through transformation and the resultant plants are sensitive to the 0 antibiotic kanamycin. An introduced gene that integrates into the host cell chromosome at the 0 site of the antisense nptll gene, and inactivates the antisense gene, will make the plant resistant to kanamycin and other aminoglycoside antibiotics. Therefore, rare site specific recombinants may be identified by screening for antibiotic resistance. Similarly, any gene, native to the plant or introduced through transformation, that when inactivated confers resistance to a compound, may be useful as a negative selectable marker.
It is contemplated that negative selectable markers may also be useful in other ways.
One application is to construct transgenic lines in which one could select for transposition to unlinked sites. In the process of tagging it is most common for the transposable element to move to a genetically linked site on the same chromosome. A selectable marker for recovery of rare plants in which transposition has occurred to an unlinked locus would be useful. For example, the enzyme cytosine deaminase may be useful for this purpose (Stouggard, 1993). In the presence of this enzyme the compound 5-fluorocytosine is converted to 5-fluoruracil which is toxic to plant and animal cells. If a transposable element is linked to the gene for the enzyme cytosine deaminase, one may select for transposition to unlinked sites by selecting for transposition events in which the resultant plant is now resistant to 5-fluorocytosine. The parental plants and plants containing transpositions to linked sites will remain sensitive to fluorocytosine. Resistance to 5-fluorocytosine is due to loss of the cytosine deaminase gene through genetic segregation of the transposable element and the cytosine deaminase gene.
Other genes that encode proteins that render the plant sensitive to a certain compound will also be useful in this context. For example, T-DNA gene 2 from Agrobacterium tumefaciens encodes a protein that catalyzes the conversion of alpha-naphthalene acetamide (NAM) to alpha-napthalene acetic acid (NAA) renders plant cells sensitive to high concentrations of NAM (Depicker et al., 1988).
-96- Case S-50015A/16/78/NAD It is also contemplated that negative selectable markers may be useful in the Sconstruction of transposon tagging lines. For example, by marking an autonomous i§ transposable element such as Ac, Master Mu, or En/Spn with a negative selectable marker, one could select for transformants in which the autonomous element is not stably integrated into the genome. This would be desirable, for example, when transient expression of the autonomous element is desired to activate in trans the transposition of a defective transposable element, such as Ds, but stable integration of the autonomous element is not desired. The presence of the autonomous element may not be desired in order to stabilize the defective OO element, prevent it from further transposing. However, it is proposed that if stable 0 integration of an autonomous transposable element is desired in a plant the presence of a negative selectable marker may make it possible to eliminate the autonomous element during the breeding process. DNA may be introduced into plants for the purpose of expressing RNA transcripts that function to affect plant phenotype yet are not translated into protein. Two examples are antisense RNA and RNA with ribozyme activity. Both may serve possible functions in reducing or eliminating expression of native or introduced plant genes.
Genes may be constructed or isolated, which when transcribed, produce antisense RNA that is complementary to all or part(s) of a targeted messenger RNA(s). The antisense RNA reduces production of the polypeptide product of the messenger RNA. The polypeptide product may be any protein encoded by the plant genome. The aforementioned genes will be referred to as antisense genes. An antisense gene may thus be introduced into a plant by transformation methods to produce a novel transgenic plant with reduced expression of a selected protein of interest. For example, the protein may be an enzyme that catalyzes a reaction in the plant. Reduction of the enzyme activity may reduce or eliminate products of the reaction which include any enzymatically synthesized compound in the plant such as fatty acids, amino acids, carbohydrates, nucleic acids and the like. Alternatively, the protein may be a storage protein, such as a zein, or a structural protein, the decreased expression of which may lead to changes in seed amino acid composition or plant morphological changes respectively. The possibilities cited above are provided only by way of example and do not represent the full range of applications.
Genes may also be constructed or isolated, which when transcribed produce RNA enzymes, or ribozymes, which can act as endoribonucleases and catalyze the cleavage of RNA -97- Case S-50015A/16/78fNAD O molecules with selected sequences. The cleavage of selected messenger RNA's can result in C the reduced production of their encoded polypeptide products. These genes may be used to Sprepare novel transgenic plants which possess them. The transgenic plants may possess Sreduced levels of polypeptides including but not limited to the polypeptides cited above that may be affected by antisense
RNA.
It is also possible that genes may be introduced to produce novel transgenic plants Swhich have reduced expression of a native gene product by a mechanism of cosuppression. It 8 has been demonstrated in tobacco, tomato, and petunia (Goring et al, 1991; Smith et al., 1990; 00 Napoli et al., 1990; van der Krol et al., 1990) that expression of the sense transcript of a native gene will reduce or eliminate expression of the native gene in a manner similar to that observed Sfor antisense genes. The introduced gene may encode all or part of the targeted native protein but its translation may not be required for reduction of levels of that native protein.
For example, DNA elements including those of transposable elements such as Ds, Ac, or Mu, may be inserted into a gene and cause mutations. These DNA elements may be inserted in order to inactivate (or activate) a gene and thereby "tag" a particular trait. In this instance the transposable element does not cause instability of the tagged mutation, because the utility of the element does not depend on its ability to move in the genome. Once a desired trait is tagged, the introduced DNA sequence may be used to clone the corresponding gene, using the introduced DNA sequence as a PCR primer together with PCR gene cloning techniques (Shapiro, 1983; Dellaporta et al., 1988). Once identified, the entire gene(s) for the particular trait, including control or regulatory regions where desired may be isolated, cloned and manipulated as desired. The utility of DNA elements introduced into an organism for purposed of gene tagging is independent of the DNA sequence and does not depend on any biological activity of the DNA sequence, transcription into RNA or translation into protein. The sole function of the DNA element is to disrupt the DNA sequence of a gene.
It is contemplated that unexpressed DNA sequences, including novel synthetic sequences could be introduced into cells as proprietary "labels" of those cells and plants and seeds thereof. It would not be necessary for a label DNA element to disrupt the function of a gene endogenous to the host organism, as the sole function of this DNA would be to identify the origin of the organism. For example, one could introduce a unique DNA sequence into a plant and tlis DNA element would identify all cells, plants, and progeny of these cells as -98- Case S-50015A/16/78/NAD 0 having arisen from that labeled source. It is proposed that inclusion of label DNAs would C enable one to distinguish proprietary germplasm or germplasm derived from such, from unlabeled gen-nplasm.
I Another possible element which may be introduced is a matrix attachment region element (MAR), such as the chicken lysozyme A element (Stief et al., 1989), which can be positioned around an expressible gene of interest to effect an increase in overall expression of S the gene and diminish position dependant effects upon incorporation into the plant genome S (Stief et al., 1989; Phi-Van et al., 1990).
00 SPlant species may be transformed with the DNA construct of the present invention by the DNA-mediated transformation of plant cell protoplasts and subsequent regeneration of the plant from the transformed protoplasts in accordance with procedures well known in the art.
Any plant tissue capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may be transformed with a vector of the present invention. The term "organogenesis," as used herein, means a process by which shoots and roots are developed sequentially from meristematic centers; the term "embryogenesis," as used herein, means a process by which shoots and roots develop together in a concerted fashion (not sequentially), whether from somatic cells or gametes. The particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular species being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue apical meristems, axillary buds, and root meristems), and induced meristem tissue cotyledon meristem and ultilane meristem).
Plants of the present invention may take a variety of forms. The plants may be chimeras of transformed cells and non-transformed cells; the plants may be clonal transformants all cells transformed to contain the expression cassette); the plants may comprise grafts of transformed and untransformed tissues a transformed root stock grafted to an untransformed scion in citrus species). The transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, first generation (or TI) transformed plants may be selfed to give homozygous second generation (or T2) transformed plants, and the T2 plants further -99- Case S-50015A/16/78/NAD 00 0 propagated through classical breeding techniques. A dominant selectable marker (such as npt CN 11) can be associated with the expression cassette to assist in breeding.
Thus, the present invention provides a transformed (transgenic) plant cell, in planra or Sexplanta, including a transformed plastid or other organelle, nucleus, mitochondria or chloroplast. The present invention may be used for transformation of any plant species, including, but not limited to, cells from corn (Zea mays), Brassica sp. B. napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum 00 vulgare), millet pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), S foxtail millet (Seraria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsuium), sweet potato (Ipomoea baratus), cassava (Manihor esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Cirrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia inregrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, duckweed (Lemna), barley, vegetables, ornamentals, and conifers.
Duckweed (Lemna, see WO 00/07210) includes members of the family Lemnaceae.
There are known four genera and 34 species of duckweed as follows: genus Lemna (L.
aequinocialis, L. disperma, L. ecuadoriensis, L. gibba, L. japonica, L. minor, L. miniscula, L. obscura, L. perpusilla, L. tenera, L. trisulca, L.turionifera, L. valdiviana); genus Spirodela intermedia, S. polyrrhiza, S. puncrata); genus Woffia (Wa. Angusta, Wa. Arrhiza, Wa.
Australina, Wa. Borealis, Wa. Brasiliensis, Wa. Columbiana, Wa. Elongata, Wa. Globosa, Wa. Microscopica, Wa. Neglecta) and genus Wofiella (WI. ultila, WI. ultilanen, WI.
gladiata, WI. ullila, WI. lingulata, WI. repunda, W1. rotunda, and WI. neotropica). Any other genera or species of Lemnaceae, if they exist, are also aspects of the present invention.
Lemna gibba, Lemna minor, and Lemna miniscula are preferred, with Lemna minor and Lemna miniscula being most preferred. Lemna species can be classified using the taxonomic 100- Case S-50015A/16/78/NAD O scheme described by Landolt, Biosystematic Investigation on the Family of Duckweeds: The Sfamily of Lemnaceae A Monograph Study. Geobotanisches Institut ETH, Stiftung Rubel, Zurich (1986)).
Vegetables within the scope of the invention include tomatoes (Lycopersicon esculentum), lettuce Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber sativus), cantaloupe cantalupensis), and musk melon melo).
O Ornamentals include azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), OO hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), Douglas-fir (Pseudotsuga menziesii); Western hemlock (Tsuga ultilane); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are not limited to, Arachis, peanuts, Vicia, crown vetch, hairy vetch, adzuki bean, mung bean, and chickpea, Lupinus, lupine, trifolium, Phaseolus, common bean and lima bean, Pisum, field bean, Melilotus, clover, Medicago, alfalfa, Lotus, trefoil, lens, lentil, and false indigo. Preferred forage and turf grass for use in the methods of the invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and redtop.
Other plants within the scope of the invention include Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, 101 Case S5001I5A/16/78/NAD 0 pepper, cauliflower, Brassica, broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, Sultilane, and zucchini.
i" Ornamental plants within the scope of the invention include impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, s Amaranthus, Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, r Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, 0 and Zinnia. Other plants within the scope of the invention are shown in Table 1 (above).
00 Preferably, transgenic plants of the present invention are crop plants and in particular cereals (for example, corn, alfalfa, sunflower, rice, Brassica, canola, soybean, barley, soybean, sugarbeet, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, etc.), and even more preferably corn, rice and soybean.
Transformation of plants can be undertaken with a single DNA molecule or multiple DNA molecules co-transformation), and both these techniques are suitable for use with the expression cassettes of the present invention. Numerous transformation vectors are available for plant transformation, and the expression cassettes of this invention can be used in conjunction with any such vectors. The selection of vector will depend upon the preferred transformation technique and the target species for transformation.
A variety of techniques are available and known to those skilled in the art for introduction of constructs into a plant cell host. These techniques generally include transformation with DNA employing A. tumefaciens or A. rhizogenes as the transforming agent, liposomes, PEG precipitation, electroporation, DNA injection, direct DNA uptake, microprojectile bombardment, particle acceleration, and the like (See, for example, EP 295959 and EP 138341) (see below). However, cells other than plant cells may be transformed with the expression cassettes of the invention. The general descriptions of plant expression vectors and reporter genes, and Agrobacterium and Agrobacterium-mediated gene transfer, can be found in Gruber et al. (1993).
Expression vectors containing genomic or synthetic fragments can be introduced into protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced into intact tissue. General methods of culturing plant tissues are provided for example by Maki et al., (1993); and by Phillips et al. (1988). Preferably, expression vectors are introduced into 102- Case S-50015A116/78INAD O maize or other plant tissues using a direct gene transfer method such as microprojectilemediated delivery, DNA injection, electroporation and the like. More preferably expression Svectors are introduced into plant tissues using the microprojectile media delivery with the biolistic device. See, for example, Tomes et al. (1995). The vectors of the invention can not only be used for expression of structural genes but may also be used in exon-trap cloning, or promoter trap procedures to detect differential gene expression in varieties of tissues, (Lindsey et al., 1993; Auch Reth et al.).
SIt is particularly preferred to use the binary type vectors of Ti and Ri plasmids of 00 Agrobacterium spp. Ti-derived vectors transform a wide variety of higher plants, including monocotyledonous and dicotyledonous plants, such as soybean, cotton, rape, tobacco, and rice S(Pacciotti et al., 1985: Byrne et al., 1987; Sukhapinda et al., 1987; Park et al., 1985: Hiei et al., 1994). The use of T-DNA to transform plant cells has received extensive study and is amply described (EP 120516; Hoekema, 1985; Knauf, et al., 1983; and An et al., 1985). For introduction into plants, the chimeric genes of the invention can be inserted into binary vectors as described in the examples.
Other transformation methods are available to those skilled in the art, such as direct uptake of foreign DNA constructs (see EP 295959), techniques of electroporation (Fromm et al., 1986) or high velocity ballistic bombardment with metal particles coated with the nucleic acid constructs (Kline et al., 1987, and U.S. Patent No. 4,945,050). Once transformed, the cells can be regenerated by those skilled in the art. Of particular relevance are the recently described methods to transform foreign genes into commercially important crops, such as rapeseed (De Block et al., 1989), sunflower (Everett et al., 1987), soybean (McCabe et al., 1988; Hinchee et al., 1988; Chee et al., 1989; Christou et al., 1989; EP 301749), rice (Hiei et al., 1994), and corn (Gordon Kamm et al., 1990; Fromm et al., 1990).
Those skilled in the art will appreciate that the choice of method might depend on the type of plant, monocotyledonous or dicotyledonous, targeted for transformation. Suitable methods of transforming plant cells include, but are not limited to, microinjection (Crossway et al., 1986), electroporation (Riggs et al., 1986), Agrobacterium-mediated transformation (Hinchee et al., 1988), direct gene transfer (Paszkowski et al., 1984), and ballistic particle acceleration using devices available from Agracetus, Inc., Madison, Wis. And BioRad, Hercules, Calif. (see, for example, Sanford et al., U.S. Pat. No. 4,945,050; and McCabe et al., 103- Case S-50015A/16/78/NAD 0 1988). Also see, Weissinger et al., 1988; Sanford et al., 1987 (onion); Christou et al., 1988 I (soybean); McCabe et al., 1988 (soybean); Datta et al., 1990 (rice); Klein et al., 1988 (maize); SKlein et al., 1988 (maize); Klein et al., 1988 (maize); Fromm et al., 1990 (maize); and Gordon- Kamm et al., 1990 (maize); Svab et al., 1990 (tobacco chloroplast); Koziel et al., 1993 (maize); Shimamoto et al., 1989 (rice); Christou et al., 1991 (rice); European Patent SApplication EP 0 332 581 (orchardgrass and other Pooideae); Vasil et al., 1993 (wheat); SWeeks et al., 1993 (wheat). In one embodiment, the protoplast transformation method for 8 maize is employed (European Patent Application EP 0 292 435, U. S. Pat. No. 5,350,689).
00 In another embodiment, a nucleotide sequence of the present invention is directly transformed into the plastid genome. Plastid transformation technology is extensively described in U.S. Patent Nos. 5,451,513, 5,545,817, and 5,545,818, in PCT application no.
WO 95/16783, and in McBride et al., 1994. The basic technique for chloroplast transformation involves introducing regions of cloned plastid DNA flanking a selectable marker together with the gene of interest into a suitable target tissue, using biolistics or protoplast transformation calcium chloride or PEG mediated transformation). The 1 to 1.5 kb flanking regions, termed targeting sequences, facilitate orthologous recombination with the plastid genome and thus allow the replacement or modification of specific regions of the plastome. Initially, point mutations in the chloroplast 16S rRNA and rpsl2 genes conferring resistance to spectinomycin and/or streptomycin are utilized as selectable markers for transformation (Svab et al., 1990; Staub et al., 1992). This resulted in stable homoplasmic transformants at a frequency of approximately one per 100 bombardments of target leaves.
The presence of cloning sites between these markers allowed creation of a plastid targeting vector for introduction of foreign genes (Staub et al., 1993). Substantial increases in transformation frequency are obtained by replacement of the recessive rRNA or r-protein antibiotic resistance genes with a dominant selectable marker, the bacterial aadA gene encoding the spectinomycin-detoxifying enzyme aminoglycoside-3N-adenyltransferase (Svab et al., 1993). Other selectable markers useful for plastid transformation are known in the art and encompassed within the scope of the invention. Typically, approximately 15-20 cell division cycles following transformation are required to reach a homoplastidic state. Plastid expression, in which genes are inserted by orthologous recombination into all of the several thousand 104- Case S-50015A116/78/NAD O copies of the circular plastid genome present in each plant cell, takes advantage of the N enormous copy number advantage over nuclear-expressed genes to permit expression levels Sthat can readily exceed 10% of the total soluble plant protein. In a preferred embodiment, a t nucleotide sequence of the present invention is inserted into a plastid targeting vector and transformed into the plastid genome of a desired plant host. Plants homoplastic for plastid genomes containing a nucleotide sequence of the present invention are obtained, and are preferentially capable of high expression of the nucleotide sequence.
SAgrobacterium tumefaciens cells containing a vector comprising an expression cassette OO of the present invention, wherein the vector comprises a Ti plasmid, are useful in methods of making transformed plants. Plant cells are infected with an Agrobacterium tumefaciens as described above to produce a transformed plant cell, and then a plant is regenerated from the transformed plant cell. Numerous Agrobacterium vector systems useful in carrying out the present invention are known.
For example, vectors are available for transformation using Agrobacterium rumefaciens. These typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, 1984). In one preferred embodiment, the expression cassettes of the present invention may be inserted into either of the binary vectors pCIB200 and pCIB2001 for use with Agrobacterium. These vector cassettes for Agrobacterium-mediated transformation wear constructed in the following manner. PTJS75kan was created by Narl digestion of (Schmidhauser Helinski, 1985) allowing excision of the tetracycline-resistance gene, followed by insertion of an Accl fragment from pUC4K carrying an NPTII (Messing Vierra, 1982; Bevan et al., 1983; McBride et al., 1990). Xhol linkers were ligated to the EcoRV fragment of pCIB7 which contains the left and right T-DNA borders, a plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et al., 1987), and the Xholdigested fragment was cloned into Sail-digested pTJS75kan to create pCIB200 (see also EP 0 332 104, example 19). PCIB200 contains the following unique polylinker restriction sites: EcoRI, SstI, Kpnl, BgllI, XbaI, and Sail. The plasmid pCIB2001 is a derivative of pCIB200 which was created by the insertion into the polylinker of additional restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRI, Sstl, Kpnl, Bgil, Xbal, Sail, Mlul, Bcll, Avrll, Apal, Hpal, and Stul. PCIB2001, in addition to containing these unique restriction sites also has plant and bacterial kanamycin selection, left and right T-DNA borders for 105- Case S-50015A/16/78/NAD O Agrobacterium-mediated transformation, the RK2-derived trfA function for mobilization Sbetween E. coli and other hosts, and the OriT and OriV functions also from RK2. The SpCIB2001 polylinker is suitable for the cloning of plant expression cassettes containing their own regulatory signals.
An additional vector useful for Agrobacterium-mediated transformation is the binary vector pC1B 10, which contains a gene encoding kanamycin resistance for selection in plants, T-DNA right and left border sequences and incorporates sequences from the wide host- range g plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its construction is OO described by Rothstein et al., 1987. Various derivatives of pCIBl0 have been constructed which incorporate the gene for hygromycin B phosphotransferase described by Gritz et al., 1983. These derivatives enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin (pCIB715, pCIB717).
Methods using either a form of direct gene transfer or Agrobacterium-mediated transfer usually, but not necessarily, are undertaken with a selectable marker which may provide resistance to an antibiotic kanamycin, hygromycin or methotrexate) or a herbicide phosphinothricin). The choice of selectable marker for plant transformation is not, however, critical to the invention.
For certain plant species, different antibiotic or herbicide selection markers may be preferred. Selection markers used routinely in transformation include the nptll gene which confers resistance to kanamycin and related antibiotics (Messing Vierra, 1982; Bevan et al., 1983), the bar gene which confers resistance to the herbicide phosphinothricin (White et al., 1990, Spencer et al., 1990), the hph gene which confers resistance to the antibiotic hygromycin (Blochinger Diggelmann), and the dhfr gene, which confers resistance to methotrexate (Bourouis et al., 1983).
One such vector useful for direct gene transfer techniques in combination with selection by the herbicide Basta (or phosphinothricin) is pCIB3064. This vector is based on the plasmid pCIB246, which comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene and the CaMV 35S transcriptional terminator and is described in the PCT published application WO 93/07278, herein incorporated by reference. One gene useful for conferring resistance to phosphinothricin is the bar gene from Streptomyces viridochromogenes 106- Case S-50015A/16/78/NAD O (Thompson et al., 1987). This vector is suitable for the cloning of plant expression cassettes C containing their own regulatory signals.
SAn additional transformation vector is pSOG35 which utilizes the E. coli gene Sdihydrofolate reductase (DHFR) as a selectable marker conferring resistance to methotrexate.
PCR was used to amplify the 35S promoter (about 800 bp), intron 6 from the maize Adhl gene (about 550 bp) and 18 bp of the GUS untranslated leader sequence from pSOG10. A 250 bp fragment encoding the E. coli dihydrofolate reductase type II gene was also amplified by PCR and these two PCR fragments were assembled with a SacI-PstI fragment from pBI221 00 (Clontech) which comprised the pUC19 vector backbone and the nopaline synthase terminator.
S Assembly of these fragments generated pSOG19 which contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus check (MCMV) generated the vector pSOG35. pSOG19 and carry the pUC-derived gene for ampicillin resistance and have HindIll, SphI, PstI and EcoRI sites available for the cloning of foreign sequences.
Transgenic plant cells are then placed in an appropriate selective medium for selection of transgenic cells which are then grown to callus. Shoots are grown from callus and plantlets generated from the shoot by growing in rooting medium. The various constructs normally will be joined to a marker for selection in plant cells. Conveniently, the marker may be resistance to a biocide (particularly an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, chloramphenicol, herbicide, or the like). The particular marker used will allow for selection of transformed cells as compared to cells lacking the DNA which has been introduced.
Components of DNA constructs including transcription cassettes of this invention may be prepared from sequences which are native (endogenous) or foreign (exogenous) to the host.
By "foreign" it is meant that the sequence is not found in the wild-type host into which the construct is introduced. Heterologous constructs will contain at least one region which is not native to the gene from which the transcription-initiation-region is derived.
To confirm the presence of the transgenes in transgenic cells and plants, a variety of assays may be performed. Such assays include, for example, "molecular biological" assays well known to those of skill in the art, such as Southern and Northern blotting, in situ hybridization and nucleic acid-based amplification methods such as PCR or RT-PCR; "biochemical" assays, 107 Case S-50015A/16/78/NAD 0 such as detecting the presence of a protein product, by immunological means (ELISAs and Western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole regenerated plant, for disease or pest resistance.
DNA may be isolated from cell lines or any plant parts to determine the presence of the Spreselected nucleic acid segment through the use of techniques well known to those skilled in the art. Note that intact sequences will not always be present, presumably due to S rearrangement or deletion of sequences in the cell.
00 The presence of nucleic acid elements introduced through the methods of this invention may be determined by polymerase chain reaction (PCR). Using this technique discreet fragments of nucleic acid are amplified and detected by gel electrophoresis. This type of analysis permits one to determine whether a preselected nucleic acid segment is present in a stable transformant, but does not prove integration of the introduced preselected nucleic acid segment into the host cell genome. In addition, it is not possible using PCR techniques to determine whether transformants have exogenous genes introduced into different sites in the genome, whether transformants are of independent origin. It is contemplated that using PCR techniques it would be possible to clone fragments of the host genomic DNA adjacent to an introduced preselected DNA segment.
Positive proof of DNA integration into the host genome and the independent identities of transformants may be determined using the technique of Southern hybridization. Using this technique specific DNA sequences that were introduced into the host genome and flanking host DNA sequences can be identified. Hence the Southern hybridization pattern of a given transformant serves as an identifying characteristic of that transformant. In addition it is possible through Southern hybridization to demonstrate the presence of introduced preselected DNA segments in high molecular weight DNA, confirm that the introduced preselected DNA segment has been integrated into the host cell genome. The technique of Southern hybridization provides information that is obtained using PCR, the presence of a preselected DNA segment, but also demonstrates integration into the genome and characterizes each individual transformant.
108- Case S-50015A/16/78[NAD It is contemplated that using the techniques of dot or slot blot hybridization which are ,I modifications of Southern hybridization techniques one could obtain the same information that is derived from PCR, the presence of a preselected DNA segment.
t" Both PCR and Southern hybridization techniques can be used to demonstrate transmission of a preselected DNA segment to progeny. In most instances the characteristic s Southern hybridization pattern for a given transformant will segregate in progeny as one or 1r- more Mendelian genes (Spencer et al., 1992); Laursen et al., 1994) indicating stable inheritance 8 of the gene. The nonchimeric nature of the callus and the parental transformants (Ro) was 00 suggested by germline transmission and the identical Southern blot hybridization patterns and intensities of the transforming DNA in callus, Ro plants and R, progeny that segregated for the transformed gene.
Whereas DNA analysis techniques may be conducted using DNA isolated from any part of a plant, RNA may only be expressed in particular cells or tissue types and hence it will be necessary to prepare RNA for analysis from these tissues. PCR techniques may also be used for detection and quantitation of RNA produced from introduced preselected DNA segments.
In this application of PCR it is first necessary to reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, and then through the use of conventional
PCR
techniques amplify the DNA. In most instances PCR techniques, while useful, will not demonstrate integrity of the RNA product. Further information about the nature of the RNA product may be obtained by Northern blotting. This technique will demonstrate the presence of an RNA species and give information about the integrity of that RNA. The presence or absence of an RNA species can also be determined using dot or slot blot Northern hybridizations. These techniques are modifications of Northern blotting and will only demonstrate the presence or absence of an RNA species.
While Southern blotting and PCR may be used to detect the preselected DNA segment in question, they do not provide information as to whether the preselected DNA segment is being expressed. Expression may be evaluated by specifically identifying the protein products of the introduced preselected DNA segments or evaluating the phenotypic changes brought about by their expression.
Assays for the production and identification of specific proteins may make use of physical-chemical, structural, functional, or other properties of the proteins. Unique physical- 109- Case S-50015A/16/78/NAD 00 Schemical or structural properties allow the proteins to be separated and identified by electrophoretic procedures, such as native or denaturing gel electrophoresis or isoelectric Sfocusing, or by chromatographic techniques such as ion exchange or gel exclusion Schromatography. The unique structures of individual proteins offer opportunities for use of specific antibodies to detect their presence in formats such as an ELISA assay. Combinations of approaches may be employed with even greater specificity such as Western blotting in which antibodies are used to locate individual gene products that have been separated by 8 electrophoretic techniques. Additional techniques may be employed to absolutely confirm the 00 identity of the product of interest such as evaluation by amino acid sequencing following 8 purification. Although these are among the most commonly employed, other procedures may be additionally used.
Assay procedures may also be used to identify the expression of proteins by their functionality, especially the ability of enzymes to catalyze specific chemical reactions involving specific substrates and products. These reactions may be followed by providing and quantifying the loss of substrates or the generation of products of the reactions by physical or chemical procedures. Examples are as varied as the enzyme to be analyzed.
Very frequently the expression of a gene product is determined by evaluating the phenotypic results of its expression. These assays also may take many forms including but not limited to analyzing changes in the chemical composition, morphology, or physiological properties of the plant. Morphological changes may include greater stature or thicker stalks.
Most often changes in response of plants or plant parts to imposed treatments are evaluated under carefully controlled conditions termed bioassays.
Once an expression cassette of the invention has been transformed into a particular plant species, it may be propagated in that species or moved into other varieties of the same species, particularly including commercial varieties, using traditional breeding techniques. Particularly preferred plants of the invention include the agronomically important crops listed above. The genetic properties engineered into the transgenic seeds and plants described above are passed on by sexual reproduction and can thus be maintained and propagated in progeny plants. The present invention also relates to a transgenic plant cell, tissue, organ, seed or plant part obtained from the transgenic plant. Also included within the invention are transgenic -110- Case S-50015A/16/78/NAD 0 descendants of the plant as well as transgenic plant cells, tissues, organs, seeds and plant parts CN obtained from the descendants.
Preferably, the expression cassette in the transgenic plant is sexually transmitted. In one Spreferred embodiment, the coding sequence is sexually transmitted through a complete normal sexual cycle of the RO plant to the RI generation. Additionally preferred, the expression cassette is expressed in the cells, tissues, seeds or plant of a transgenic plant in an amount that is different than the amount in the cells, tissues, seeds or plant of a plant which only differs in O that the expression cassette is absent.
00 The transgenic plants produced herein are thus expected to be useful for a variety of commercial and research purposes. Transgenic plants can be created for use in traditional agriculture to possess traits beneficial to the grower agronomic traits such as resistance to water deficit, pest resistance, herbicide resistance or increased yield), beneficial to the consumer of the grain harvested from the plant improved nutritive content in human food or animal feed; increased vitamin, amino acid, and antioxidant content; the production of antibodies (passive immunization) and nutriceuticals), or beneficial to the food processor improved processing traits). In such uses, the plants are generally grown for the use of their grain in human or animal foods. Additionally, the use of root-specific promoters in transgenic plants can provide beneficial traits that are localized in the consumable (by animals and humans) roots of plants such as carrots, parsnips, and beets. However, other parts of the plants, including stalks, husks, vegetative parts, and the like, may also have utility, including use as part of animal silage or for ornamental purposes. Often, chemical constituents oils or starches) of maize and other crops are extracted for foods or industrial use and transgenic plants may be created which have enhanced or modified levels of such components.
Transgenic plants may also find use in the commercial manufacture of proteins or other molecules, where the molecule of interest is extracted or purified from plant parts, seeds, and the like. Cells or tissue from the plants may also be cultured, grown in vitro, or fermented to manufacture such molecules.
The transgenic plants may also be used in commercial breeding programs, or may be crossed or bred to plants of related crop species. Improvements encoded by the expression cassette may be transferred, from maize cells to cells of other species, by protoplast fusion.
I1 Case S-50015AI16/79INAD 0 The transgenic plants may have many uses in research or breeding, including creation of new mutant plants through insertional mutagenesis, in order to identify beneficial mutants that might later be created by traditional mutation and selection. An example would be the introduction of a recombinant DNA sequence encoding a transposable element that may be used for generating genetic variation. The methods of the invention may also be used to create plants having unique "signature sequences" or other marker sequences which can be used to r> identify proprietary lines or varieties.
SThus, the transgenic plants and seeds according to the invention can be used in plant oo breeding which aims at the development of plants with improved properties conferred by the expression cassette, such as tolerance of drought, disease, or other stresses. The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate descendant plants. Depending on the desired properties different breeding measures are taken. The relevant techniques are well known in the art and include but are not limited to hybridization, inbreeding, backcross breeding, Eultilane breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization techniques also include the sterilization of plants to yield male or female sterile plants by mechanical, chemical or biochemical means. Cross pollination of a male sterile plant with pollen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines.
Thus, the transgenic seeds and plants according to the invention can be used for the breeding of improved plant lines which for example increase the effectiveness of conventional methods such as herbicide or pesticide treatment or allow to dispense with said methods due to their modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained which, due to their optimized genetic "equipment", yield harvested product of better quality than products which were not able to tolerate comparable adverse developmental conditions.
The invention also provides a computer readable medium having stored thereon a data structure containing nucleic acid sequences having at least 70% sequence identity to a nucleic acid sequence selected from those listed in SEQ ID Nos: 1-339, 358-366, 441-515, 5 17-529, 536-579 and 601-773, as well as complementary, ortholog, and variant sequences thereof.
112- Case S-50015A/16/78/NAD o Storage and use of nucleic acid sequences on a computer readable medium is well known in N the art. (See for example U.S. Patent Nos. 6,023,659; 5,867,402; 5,795,716) Examples of such medium include, but are not limited to, magnetic tape, optical disk, CD-ROM, random access memory, volatile memory, non-volatile memory and bubble memory. Accordingly, the nucleic acid sequences contained on the computer readable medium may be compared through use of a module that receives the sequence information and compares it to other sequence information.
Examples of other sequences to which the nucleic acid sequences of the invention may be g compared include those maintained by the National Center for Biotechnology Information 00 (NCBI)(http://www.ncbi.nlm.nih.gov/) and the Swiss Protein Data Bank. A computer is an example of such a module that can read and compare nucleic acid sequence information.
SAccordingly, the invention also provides the method of comparing a nucleic acid sequence of the invention to another sequence. For example, a sequence of the invention may be submitted to the NCBI for a Blast search as described herein where the sequence is compared to sequence information contained within the NCBI database and a comparison is returned. The invention also provides nucleic acid sequence information in a computer readable medium that allows the encoded polypeptide to be optimized for a desired property. Examples of such properties include, but are not limited to, increased or decreased: thermal stability, chemical stability, hydrophylicity, hydrophobicity, and the like. Methods for the use of computers to model polypeptides and polynucleotides having altered activities are well known in the art and have been reviewed. (Lesyng et al., 1993; Surles et al., 1994; Koehl et al., 1996; Rossi et al., 2001).
The invention will be further described by the following non-limiting examples.
EXAMPLES
Example 1 GeneChip@ Standard Protocol Quantitation of total RNA Total RNA from plant tissue is extracted and quantified.
1. Quantify total RNA using GeneQuant O1D260=40 ug RNA/ml; A26o/A2so=1.
9 to about 2.1 -113 Case S-50015A116/78/NAD 00 2. Run gel to check the integrity and purity of the extracted RNA Synthesis of double-stranded cDNA Gibco/BRL SuperScript Choice System for cDNA Synthesis (Cat1#1B090-019) was employed to prepare cDNAs. T7-(dT) 24 oligonu cleot ides were prepared and purified by HPLC. GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG-(dT)24- 3
SEQ
ID NO:584).
Step 1. Primer hybridization: 00 Incubate at 70'C for 10 minutes Quick spin and put on ice briefly Step 2. Temperature adjustment: Incubate at 42'C for 2 minutes Step 3. First strand snthesis: DEPC-water- 1 ul RNA (10 ug final)-10 ul T7=(dT) 24 Primer (100 pmol final)-1I ul pmol strand cDNA buffer-4 ul 0. 1MN DTT (10 mM final)- 2 ul mM dNTP mix (500 uM final)-1I ul Superscript 11 RT 200 Uful- 1 ul Total of 20 ul Mix Well Incubate at 42'C for I hour Step 4. Second strand synthesis: Place reactions on ice, quick spin DEPC-water- 91 ul 2 "d strand cDNA buffer- 30 ul mlv dNTP mix (250 mM final) 3 ul E. colt DNA igase (10 U/ul)- I ul E. colt DNA polymerase 1-10 U/ul- 4 ul 114- Case S-50015A/16/78/NAD 00 O RnaseH 2U/ul-1 ul C T4 DNA polymerase 5 U/ul-2 ul M EDTA (0.5 M final)-10 ul Total 162 ul Mix/spin down/incubate 16'C for 2 hours Step 5. Completing the reaction: Incubate at 16 0 C for 5 minutes 00 Purification of double stranded cDNA S1. Centrifuge PLG (Phase Lock Gel, Eppendorf 5 Prime Inc., pi-188 2 3 3 at 14,000X, transfer 162 ul of cDNA to PLG 2. Add 162 ul of Phenol:Chloroform:Isoamyl alcohol (pH centrifuge 2 minutes 3. Transfer the supernatant to a fresh 1.5 ml tube, add Glycogen (5 mg/ml) 2 M NH 4 OAC (0.75xVol) 120 ETOH (2.5xVol, -20 0 C) 400 4. Mix well and centrifuge at 14,000X for 20 minutes Remove supernatant, add 0.5 ml 80% EtOH (-20 0
C)
6. Centrifuge for 5 minutes, air dry or by speed vac for 5-10 minutes 7. Add 44 ul DEPC H 2 0 Analyze of quantity and size distribution of cDNA Run a gel using 1 ul of the double-stranded synthesis product Synthesis of biotinylated cRNA (use Enzo BioArray High Yield RNA Transcript Labeling Kit Cat#900182) Purified cDNA 22 ul Hy buffer 4 ul biotin ribonucleotides 4 ul DTT 4 ul Rnase inhibitor mix 4 ul 20X T7 RNA polvmerase 2 ul -115- Case S-50015A/16/78/NAD 00 40 ul STotal 40 ul NCK Centrifuge 5 seconds, and incubate for 4 hours at 37 0
C
D Gently mix every 30-45 minutes Purification and quantification of cRNA (use Qiagen Rneasy Mini kit Cat# 74103) cRNA 40 ul DEPC H 2 0 60 ul 00 RLT buffer 350 ul mix by vortexing EtOH 250 ul mix by pipetting Total 700 ul Wait 1 minute or more for the RNA to stick Centrifuge at 2000 rpm for 5 minutes RPE buffer 500 ul Centrifuge at 10,000 rpm for 1 minute RPE buffer 500 ul Centrifuge at 10,000 rpm for 1 minute Centrifuge at 10,000 rpm for 1 minute to dry the column DEPC H 2 0 30 ul Wait for 1 minute, then elute cRNA from by centrifugation, 10K 1 minute DEPC H 2 0 30 ul Repeat previous step Determine concentration and dilute to I ug/ul concentration Fragmentation of cRNA cRNA (1 ug/ul) 15 ul Fragmentation Buffer* 6 ul DEPC H 2 0 9 ul ul 116- Case S-50015A/16/78/NAD 00 O *5x Fragmentation Buffer ,1 IM Tris (pH8.1) 4.0 ml MgOAc 0.64 g KOAC 0.98 g DEPC H 2 0 Total 20 ml Filter Sterilize 00 Array wash and staining O Stringent Wash Buffer** Non-Stringent Wash Buffer*** SAPE Stain**** Antibody Stain***** Wash on fluidics station using the appropriate antibody amplification protocol **Stringent Buffer: 12X MES 83.3 ml, 5 M NaCI 5.2 ml, 10% Tween 1.0 ml, H 2 0 910 ml, Filter Sterilize ***Non-Stringent Buffer: 20X SSPE 300 ml, 10% Tween 1.0 ml, H0O 698 ml, Filter Sterilize, Antifoam ****SAPE stain: 2X Stain Buffer 600 ul, BSA 48 ul, SAPE 12ul, H 2 0 540 ul.
*****Antibody Stain: 2X Stain Buffer 300 ul, H 2 0 266.4 ul, BSA 24 ul, Goat IgG 6 ul, Biotinylated Ab 3.6 ul Example 2 Characterization of Gene Expression Profiles During Plant Development using the GeneChip The Arabidopsis GeneChip provides a method to simultaneously scan over 30% of the genome for the expression profile of each gene on chip. By using RNA extracted from different tissue and developmental stages of development, a scan of the entire Arabidopsis plant is achieved.
The advantages of a gene chip in such an analysis include a global gene expression analysis, -117- Case S-50015A/16/78NAI) O quantitative results, a highly reproducible system, and a higher sensitivity than Northern blot C analyses. Moreover, a gene chip with Arabidopsis DNA has a further advantage in that the Arabidopsis genome is well characterized.
C
Using the recently designed Arabidopsis high density oligonucleotide probe array, a total of 8,100 Arabidopsis thaliana genes were surveyed for temporal and developmental expression profiling. The objective was to identify known and novel genes that are expressed in specific organs (spatial expression) or developmental stages (temporal expression versus 8 constitutive expression). The represented genes included approximately 1,000 known full 0O length cDNAs, a collection of approximately 500 ESTs or full length sequences, 3,500 annotated Genbank genomic sequences (the transcripts of which were confirmed by the Spresence of ESTs in the database) and about 3,700 annotated Genbank sequences with a predicted translated open reading frame with 2 or more "hits" with a protein in the protein database having a defined function.
Total RNA was isolated from 9 samples at different developmental stages for to prepare cRNA microanalysis. These samples were analyzed in 9 separate GeneChip® (see, U.S. Patent Nos. 5,445,934, 5,744,305, 5,700,305, 5,700,637, 5,945,334 and EP 619321 and EP 373203) experiments that included RNA from: 1) germinating seed, day 4; 2) root 2 week; 3) root adult: 4) leaf; 5) leaf adult; 6) leaf senescence; 7) stem; 8) immature siliques; and 9) flowers prior to pollen shed. The samples were hybridized to the Arabidopsis arrays and analyzed by laser scanning for relative expression level, fold difference, organ and developmental expression. All genes were expressed in at least one of the samples.
Seeds of wild-type plants of Arabidopsis thaliana, ecotype Columbia, were sterilized and germinated in soil. Plants were grown in conviron growth chambers with 12 hours of light at 22 0 C 12:12 light dark cycle in metromix. Samples from leaves of 2-week, 5-week, 6-week, 8-week, and 11-week old plants, and inflorescences, flowers and siliques of the 6-week and 8week old plants were collected (Table In addition, 4-day old seedlings and roots from 2week, 4-week, and 5-week old plants cultured in MS liquid medium were collected. Samples collected from over 30 plants were pooled and homogenized in liquid nitrogen. Total RNA was extracted using Qiagen Rneasy column (Qiagen, Chatsworth, CA).
-118- Case S-50015A/16/78/NAD germinating seedling germinating seedling leaf leaf leaf leaf leaf leaf root root root root flower flower siliques siliques siliques inflorescence inflorescence Table 2 4 days of development 4 days of development 2 weeks after planting 2 weeks after planting 5 weeks after planting 6 weeks after planting 8 weeks after planting 11 weeks after planting 2 weeks after planting 2 weeks after planting 5 weeks after planting 6 weeks after planting 5 weeks after planting 6 weeks after planting 5 weeks after planting 6 weeks after planting 8-11 weeks after planting 6 weeks after planting 5 weeks after planting Total RNA (5 pg) from each sample was reverse transcribed using an oligo dT( 24 primer containing a 5' T7 RNA polymerase promoter sequence GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG-(dT)24-3' SEQ ID NO:585) and SuperScript II reverse transcriptase (Life Technologies). Second strand ofcDNA was synthesized using DNA polymerase I and DNA ligase. Biotinylated complementary RNAs (cRNAs) were in vitro transcribed by T7 RNA Polymerase (ENZO BioArray High Yield RNA Transcript Labeling Kit, Enzo). cRNAs were purified using an affinity resin (Qiagen Rneasy Spin Columns) and randomly fragmented by incubating at 940 C for 35 minutes in a buffer -119- Case S-50015A/16/78/NAD o containing 40 mM Tris-acetate, pH 8.1, 100 mM potassium acetate, and 30 mM magnesium C1 acetate to produce molecules of approximately 35 to 200 bases.
The labeled samples were denatured at 990 C for 5 minutes, equilibrated at 45 0 C for Sminutes, and hybridized to the Arabidopsis GeneChip® genome array (Affymetrix) at 45 0 C for 16 hours on a rotisserie at 60 rpm. The hybridized arrays were then rinsed with 1X STT and stained with streptavidin phycoerythrin at 25 0 C for 10 minutes twice with a rinse in between.
SAfter staining, arrays were washed with IX STT at 25 0 C for 20 minutes and stained with 8 biotinylated anti-streptavidin antibody at 25°C for 10 minutes. The probe array was stained 00 with SAPE at 25 0 C for 10 minutes and washed with wash buffer A at 30 0 C for 30 minutes.
O All of the wash and stain procedures were completed using a fluidic station (Affymetrix). The probe array was scanned twice and the intensities were averaged with a Hewlett-Packard GeneArray Scanner.
Genechip Suite 3.2 (Affymetrix) was used for data normalization. The overall intensity of all probe sets of each chip was scaled to 100 so that the hybridization intensity of all arrays was equivalent. False positives are defined based on experiments in which samples are split, hybridized to GeneChip® expression arrays and the results compared. A false positive is indicated if a probe set is scored qualitatively as an "Increase" or "Decrease" and quantitatively as changing by at least 2-fold and the average difference is greater than 25. A significant change is defined as 2-fold change or above with an expression baseline of 25, which is determined as the threshold level according to the scaling. For example, the data from each chip was loaded into GeneSpring software and analyzed for fold differences with the leaf samples. The 2-week leaf samples were used to find genes expressed 4-fold or higher in the leaf sample at 2 weeks of age versus all the other tissues. The remaining leaf samples at 5, 6, 8, and 11 weeks were not analyzed at this stage, but were analyzed independently. The leaf sample at 5 weeks was also analyzed against all other tissues except the remaining leaf samples for genes expressed 4-fold or higher in leaf tissue at 5 weeks. The other leaf samples were analyzed in a similar fashion. This allowed the selection of genes that were at least 4-fold elevated in expression in a leaf sample in at least one stage of development. When these genes were combined, there were 92 genes that were preferentially expressed in leaf tissue.
120- Case S-50015A/16/78/NAD 00 0 Image analysis and data mining C' Two text files are included in the analysis: Sa. One with Absolute analysis: giving the status of each gene, either absent or present in the samples b. The other with Comparison analysis: comparing gene expression levels between two samples SArabidopsis Genome Array A high-density Arabidopsis oligonucleotide array was used that includes probes for oo 8,100 Arabidopsis genes and 40 probes for spiking and negative controls. For each gene, there are 16 probe pairs (probe sets) including perfect match probes and mismatch probes for nonspecific binding control. The Arabidopsis genes are represented by known genes, predicted genes and approximately 100 clusters of ESTs. Predicted gene sequences were extracted and confirmed computationally by matching the genome sequence with ESTs and protein sequences.
The reproducibility of the array was characterized by calculation of the rate of false changes (number of genes significantly changed over the total number of genes on the array; Lipshultz, 1999). Two cDNA and subsequently cRNA (the antisense RNA synthesized by in vitro transcription using cDNAs as templates in the presence of biotinylated ribonucleotides) samples were prepared in parallel from the same total RNA samples, and hybridized to two different arrays manufactured in the same lot or different lots. Genes that showed changes of 2-fold and a signal threshold above the background (calculated according to the setting of the global scaling factor) were counted as false changes. Data from 15 pairs of array experiments indicated that false changes between two experiments using arrays of the same lot is 0.17% (based on 8 pairs), while the false change using arrays of two different lots is 0.22% (based on 7 pairs). Further analyses of these genes indicate that the fold change and expression levels are low and close to the threshold (Zhu and Wang, 2000).
Selected housekeeping genes are used to ensure the quality of the array experiments, because the quality of the total RNA and subsequently synthesized cDNA and cRNA samples has direct impact on the array results. Sample quality, specifically, labeled cRNA quality was monitored by comparing the ratio of the hybridization signal of 3N and 5N probe sets for 121 Case S-50015A/16/78/NAD GAPDH and ubiqutinl 1. Only data with a consistent 3N/5N ratio (Zhu and Wang, 2000) was archived in the database and used.
Specific Selection Criteria The following criteria selection were employed to identify Arabidopsis genes that were constitutively expressed.
Baseline (background) relative expression level of Candidates were first selected for relative expression of 250 in all tissues for a given gene.
Relative expression range of the 346 genes which were expressed in all tissue 250- 6,765.
o Candidate genes were selected for 5 fold difference in expression 331 genes o Candidate genes were selected for 3 fold difference 276 genes For 174 selected genes which met the above criteria The expression for each gene was averaged: 'low' expression =250-750; 97 genes (55.7%) 'moderate' expression 750-2250; 70 genes (40.2%) 'high' expression 2250-6750; 8 genes 47 genes were selected for further analysis 'low' expression =250-750; 21 genes (44.6%) 'moderate' expression 750-2250; 24 genes (51.0%) 'high' expression 2250-6750; 3 genes The following criteria were used to identify Arabidopsis genes expressed primarily in root tissue.
Baseline (background) relative expression level of Candidates were first selected for relative expression of 300 in all tissues for a given gene excluding the germinating seed data.
Candidate genes were sorted by fold difference. Root 3 other tissue <10 (10 fold lower expression) 122- Case S-50015A/16/78/NAD 0 When the germinating seed data included was included with the 64 selected genes 39 Swere identified with relative expression 2 150.
CO Thirteen were selected for further analysis.
Abundance Distribution of Transcripts Knowledge of the levels of all detectable mRNA species in Arabidopsis is useful for evaluating the complexity of the transcriptome and its control. The abundance of the transcript 0 species and their expression level in 5-week-old Arabidopsis was analyzed by examining the mRNA transcripts present in four major organs, leaves, roots, inflorescence stems, and 00 0 flowers. Among 8,300 genes analyzed, over 5,000 transcript species were detected in each C,1 organ. Comparison of the transcripts presented in these organs revealed the number and percentage of the commonly expressed and specifically expressed transcripts in each organ at this stage (Table 3).
Table 3 Root Inflorescence Stem Leaf Flower Root 6,052 4,928 4,915 5,243 Inflorescence Stem 5,399 4,828 5,036 Leaf 5,416 4,995 Flower 6,097 Specific 426 55 89 380 Expression measurements (average signal difference between perfect-match probes and mismatch probes) of the genes in each organ were examined. Data were collected and log transformed, then plotted against their frequencies. A normal distribution of the transcript abundance was revealed for all four organs. The median of the distributions is similar to the profiles of yeast, mammalian, and E. coli (Lockhart and Winzler, 2000). Overall, the transcription profile is more complex in flowers than in the vegetative organs. It is evidenced by the elevated frequencies in almost every level of transcription. Root has the most complex profile among the vegetative organs, while leaf and inflorescence stem have very similar and simpler profiles.
123- Case S-50015A/16/78/NAD 00 O 2. Constitutive and Organ Differential Gene Expression SThe composition of the constitutively and organ differentially expressed transcripts were characterized. A total of 347 constitutive expressed genes with median or high-level n transcripts were selected from the commonly expressed gene pool. These genes are constantly expressed above median expression level (average difference greater than 500) for al organs s and developmental stages examined. Functional categorization indicated that majority of the known constitutive genes are involved in metabolism and ribosomal protein synthesis 8 followed by genes involving transcription signaling transport 00 membrane synthases membrane and stress and defense related (Table O About 15% of the genes identified have no function assigned.
Organ differential expressed genes were also analyzed. These genes were expressed at median level (average difference greater than 50) in certain organ at al developmental stages, compared to other organs, the expression level for these genes in the organ are 4-fold higher than others. By these criteria, genes differentially expressed in root leaf (94), inflorescence stem and flower (36) were identified, and functionally categorized. To examine the organ-specificity of the differential expression, the expression level of differentially expressed genes were plotted against represented samples. The root differential expressed genes are expressed almost exclusively in root and young whole seedlings. There were 51 genes that were expressed only in root. Twenty-three percent of these genes had no known function while peroxidases and defense genes represented 51% of the genes.
Similar experiments were conducted for root at least 3 hours after exposure to stress, salt, mannitol or cold (Tables 9-10). Twenty-five root-specific promoters were downregulated and 8 were upregulated in response to salt stress, 21 were downregulated and 17 were upregulated in response to mannitol, and 22 were downregulated and 7 were upregulated in response to cold. Ten promoters did not respond to any of the stresses.
3. Dynamics of Gene Expression During Leaf Development In order to examine the dynamics of gene expression at mRNA level during different organ development, genes with transcripts detected in various developmental stages were analyzed. A total of 5,247 genes expressed during leaf development were subject to cluster analysis. Various clustering methods, including self-organizing map (SOM, Tamayo et al., 124 Case S-50015A/16/78/NAD O 1999), hierarchical cluster (Eisen et al., 1998) and K-mean, generated similar clusters. Sixteen N groups of genes formed according to their expression patterns when SOM was used. Four groups of genes were examined in detail.
C Cluster 15 shows a group of genes down regulated during leaf development. Genes in this group generally have a very high transcription level. However, they reduce their expression level by least 2-fold toward senescence. Among 34 genes in the cluster, 28 of them were photosynthesis related. Interestingly, some of the genes related to photosynthesis are also found in cluster 6, which shows a more gradual reduction in expression. These genes, OO such as ferredoxin-NADP+ reductase and NADPH protochlorophyllide oxidoreductase
B,
have relatively low level of transcripts, and their reduction is not as dramatic as others.
Cluster 8 was also analyzed. The expression of this group of genes shows a dramatic increase towards senescence. Detailed examination of this cluster revealed 8 genes involved in senescence. Other senescence genes also increased their transcription level during late development, however, those changes were not as dramatic as the eight genes identified in cluster 8. These genes were found in cluster 2.
4. Function Characterization of Global Gene Expression Pattern Cluster analysis also identifies co-regulated genes, and organizes the samples or array experiments according to their overall expression patterns. In order to validate the expression data, cluster analysis was performed on 6,626 genes with an expression level above background (average difference greater or equal 25) in any of the samples. All data were normalized to their median, organized into a SOM, and into a hierarchical cluster using Cluster program (Eisen et al. 1998).
According to the similarity of the global expression patterns of each sample, samples form three major clusters: a cluster of leaf samples, a cluster of supporting axis, including root, inflorescence stem and seedling samples, and a cluster of the reproductive organ samples, including samples of flowers, siliques, and inflorescences (including flowers and siliques).
Similarly, genes also organized into several major classes according to their expression levels: organ-differentially expressed genes were easily highlighted.
It is worth noting that sample/experimental variations also contributed to the clusters.
For example, the leaf gene expression data were produced from 2 independent experiments.
125- Case S-50015A/16/78/NAD o One set of the leaf materials was collected in the morning at approximately 10 o'clock, and the
O
N other set was collected in the afternoon around 3 o'clock in the afternoon. The circadian regulated gene expression contributed greatly to form two sample clusters. These circadian ti' regulated genes matched the genes described in Hammer et al. (2000).
Regulatory Sequences To elucidate the regulatory elements of co-regulated genes, AlignACE was employed 0 (Hughes et al., 2000). A total of 49 promoters were found to share a few potential and known 0 cis-acting elements. Among these cis-acting elements identified from the ribosomal promoters, 0 the telo-box motif (AAACCCTA) was observed in 41 of these ribosomal promoters. Telo-
O
C
I boxes have been found in many Arabidopsis ribosomal genes and in eEFIA (Tremousaygue et al., 1999). This telo-box binds a protein related to Pura conserved nuclear protein that has been implicated in the control of gene transcription and DNA replication (Safak et al., 1999).
Another motif identified in the ribosomal promoter regions was the Dof binding site (AAAG).
The Dof binding site has been shown in the promoters of a diverse set of plant genes, suggesting various roles of Dof proteins in plants (Yanagisawa and Schmidt, 1999), including carbon metabolism (Yanagisawa, 2000). Additional motifs observed include a pollen specific motif (AGAAA) and a RAVI binding motif (Kagaya et al., 1999).
The promoter regions from leaf-specific genes were also analyzed by AlignAce software to discover putative cis elements. Those that were found include a GATA box and a light regulatory element "ACGTGGCA". These elements are known to be necessary for light induced genes. A putative element that did not contain a known binding site was "TGGTTCGGACC" (SEQ ID NO:586). This element was located in 16 of the promoters analyzed.
A global gene expression pattern composed of the transcription profiles of 8, 100 genes in 20 samples collected from different organs during Arabidopsis development was identified.
By 166,000 gene expression measurements, the mRNA populations in different organs during Arabidopsis development were characterized. In particular, constitutively expressed genes and organ-differentially expressed genes were identified.
The accuracy of the microarray data was validated by two measures. First, the microarray results were repeatable. By comparing 15 pair of independently prepared labeled 126- Case S-50015A/16/78/NAD Ssamples, less than 0.2% of the false positive rate was observed. The false positives occurred N randomly among the genes with a low expression level. Second, expression levels measured by the oligonucleotide array correlated well with data from previous gene expression analysis and measurement from other technologies, such as RT-PCR.
Identification of constitutively and organ-differentially expressed genes is important to isolate constitutive or organ/tissue specific promoters. Here, it is demonstrated that the microarray technology can be used for large scale screening of these promoters, especially at the genome level. Moreover, genes that are co-regulated can be analyzed to identify the 0 regulatory elements. In this study, constitutive and organ-specific genes were identified through the screening of 8,100 genes, but also regulatory elements, such as telo-box, Dof binding site, as well as other motifs, which are important for the constitutive expression of the ribosomal proteins. By a similar approach, organ- or tissue-specific gene promoter elements, and various treatment-induced gene promoter elements, have been identified. Such results not only facilitate the dissection of the regulatory pathway, but also provide an opportunity in genetic engineering of metabolic pathways. Methods such as chimeraplasty (Zhu et al. 1999, 2000) can be used to precisely modify these regions and thus regulate a group of genes of interest.
Identification of co-regulated genes is the first step towards understanding of the regulation of a gene expression network, and assigning function to new genes. Among the 8,100 genes analyzed, approximately 3,100 genes do not have significant homology to known genes. Functional characterization of these genes becomes the challenge for the Arabidopsis genomics. A straightforward approach can be used to assign gene function; mutant lines or treated biological samples and their controls can be transcriptionally profiled. By comparing alterations in the expression of the novel genes, potential function can be assigned. The functions can be further confirmed by reverse genetics. Alternatively, genes with unknown function in the identified co-regulated gene clusters can be computationally analyzed by support vector machines (SVMs; Brown et al. 2000).
Similar experiments were conducted for root at least 3 hours after exposure to stress, salt, mannitol or cold (Tables 9-10). Twenty-five root-specific promoters were downregulated and 8 were upregulated in response to salt stress, 21 were downregulated and 127- Case S-50015A/16/78/NAD 00 O 17 were upregulated in response to mannitol, and 22 were downregulated and 7 were N upregulated in response to cold. Ten promoters did not respond to any of the stresses.
Example 3: Further Analysis of Constitutively Expressed Genes A standard curve of 50, 10, 2, 0.4, and 0.08 ng total RNA was generated for each Sprimer/probe set tested. In this case, the 50 ng sample yielded a C, value of 24.5 and the 10 ng
C
I sample yields a C, value of 26.7. The C, value is defined as the threshold cycle whereby Samplification occurs at an exponential rate. A low C, value correlates with high gene Sexpression. The threshold is determined empirically from the standard curve. By raising or lowering the threshold, the data set is maximized to represent optimal exponential amplification. A correlation coefficient
(R
2 of the best-fit line from the standard curve) greater than 99% and a slope of-3.3 (most efficient amplification) is ideal. For accurate repeatable results, the previous criteria must be met and the unknowns must fall within the range of the curve. The expression levels of the unknown can be interpolated from the unknown C, values using the standard curve.
TaqMan chemistry employs three gene-specific oligonucleotides for the detection of nucleic acids. Two of the oligonucleotides are primers used for the amplification of the molecule and the third oligonucleotide is a probe that is labeled with a 5' fluorescent reporter dye (6-FAM) and a 3' quencher dye (TAMRA). During PCR amplification, elongation proceeds once the DNA polymerase binds to the primer. As it polymerizes in the 5' to 3' direction, the polymerase encounters the quenched probe. The 5' to 3' exonuclease activity of the polymerase allows it to degrade the probe in its path, thereby releasing the 5' reporter dye.
The thermocycler is equipped with a detection system to measure the fluorescence from the released reporter dye. Since fluorescence increases with amplification of the molecule, fluorescence can be directly related to the amount of molecules in the starting sample. The primers that were employed for one set were: TRX3T 5' 6-FAM agacttcactgcaacatggtgcccac TAMRA 3' (SEQ ID NO:587); TRX3F 5' gtgtggaaatgacacagattgtga 3 (SEQ ID NO:588), and TRX3R 5'agacgggtgcaatgaaacg 3 (SEQ ID NO:589); and for the other set were: APX3 T 5' 6- FAM cgcgaacaagaactgtgctcctatcatg TAMRA 3' (SEQ ID NO:590), 128- Case S-50015A/16/78/NAD O APX3 F 5'gccgtgagctccgttctct3' (SEQ ID NO:591); and APX3 R 5'tcgtgccatgccaatcg 3
(SEQ
SID NO:592). TaqMan chemistries were used with the ABI Prism 7700 Sequence system for Srelative quantitation of nucleic acid.
t To find a gene whose expression is constitutive, the gene expression data obtained from the Arabidopsis GeneChipTM was analyzed. Three sets of data were analyzed (Table Part A represents expression data for 2 genes from wild-type plants infected or not infected with SPseudomonas syrmgae pv. maculicola strain ES4326 at 30 hours post-inoculation. Part B 8 represents expression data from wild-type Arabidopsis plants infected or not infected with 00 different viruses at 1 and 4 days after inoculation, while part C represents expression data for 2 O genes in 9 different tissue types.
129- Case S-5OOI5AI16I78/NAD 00 0 'able 4 )LANTS TRX3 APX3 tn olurnbia infected 2481 484 Zolumbia mock 2362 495
B:
0N DAYS GENE Mock TVCV 0RMV TRV
CMIV
00 TRX3 2020 1991 1738 2006 1833 1 PX3 1 5 717 755 658
C.
TRX3 APX3 4 day seed 1282 488 2 week root 1467 4 3 Adult root 1 857 320 2 week leaf 1233 771 Adult leaf 1483 857 Senescing leaf 1312 805 Flowers 694 51 3 Inflorescence 691 461 Immnature siliques 614 508 130- Case S-50015A/16/78/NAD 0 0 After analyzing the data, 2 candidate genes were identified, thioredoxin (TRX3; Genbank Accession No. U35640) and ascorbate peroxidase (APX3; Genbank Accession No.
U69138), whose expression did not vary more than 2-fold between the treatments in all experiments (except in flowers, inflorescence and siliques for TRX3). These genes also met the criteria of not having significant sequence similarity to other Arabidopsis genes.
Probe and primer sets were prepared for ubiquitin 5 (UBQ5), PR1 (a pathogenesis Srelated gene whose expression is induced upon infection), TRX3 and APX3. TaqMan was used to quantify relative expression levels of these genes in Arabidopsis mutants and in c01 uninfected and P. syringae infected plants. Table 5 shows that the PRI expression increased rapidly upon infection. TRX3 and APX3 expression levels did not change as much as a commonly used gene for normalization.
Table 5. Gene expression in Arabidopsis infected with P. syringae at 34 hours post inoculation. Measured by TaqMan.
PLANTS PR1 UBQ5 TRX3 APX3 Columbia 10 15 1.2 1.4 infected Columbia .0033 2.7 .62 1.4 Mock Pad4 mutant 4.6 2.0 1.2 1.4 infected Pad4 mutant .00027 .79 1.1 2 Mock Additionally, Arabidopsis plants were cold treated for 48 hours and the gene expression of these plants versus plants left at room temperature measured. There was no significant gene expression difference for PR 1, TRX3, or APX3 (Table 6).
131 Case S-50015A/16/78/NAD Table 6 Room temperature plants Cold-treated plants PRI 2.6 3.2 TRX3 2.0 2.4 APX3 2.1 2.8 In summary, gene-chip data was employed to find genes whose expression is constitutive in several Arabidopsis mutants, in infected plants, and throughout different tissues.
00 TRX3 and APX3 expression levels varied less than UBQ5 in a comparison between infected O and uninfected plants. TRX3 and APX3 gene expression was not significantly affected by cold-stress. Thus, TRX3 and APX3 are candidates for normalization when determining unknown gene expression levels in plants such as Arabidopsis or using quantitative PCR or other gene expression measurement assays. Likewise, the plant kingdom orthologs of these genes in dicots and monocots can be used for the same normalization standards for plants unrelated to Arabidopsis.
Moreover, unlike actin and ubiquitin (actin mediates cellular division and cycling and the ubiquitin pathway is activated upon stress, all of which may result in changes in gene expression), which belong to gene families to which probes can cross-hybridize, TRX3 and APX3 genes do not have significant similarity to genes in the Arabidopsis genome database, and the respective primer/probe sets described herein did not significantly cross-hybridize with other genes in the Arabidopsis genome database. Additionally, the promoters for these genes may be useful for constitutive gene expression.
Example 4: Construction of Binary Promoter::Reporter Plasmids To construct a binary promoter:: reporter plasmid for Arabidopsis transformation a vector containing a promoter of interest the DNA sequence 5' of the initiation codon for the gene of interest) was used, which resulted from recombination in a BP reaction between a PCR product using the promoter of interest as a template and pDONRneo. The regulatory/promoter sequence was fused to the GUS reporter gene (Jefferson et al, 1987) by recombination using GATEWAYTM Technology according to manufacturers protocol as 132- 1 Case S-50015A/16/78/NAD o described in the Instruction Manual (GATEWAYTM Cloning Technology, GIBCO BRL, 1 Rockvile, MD http://www.lifetech.com/). Briefly, the promoter fragment in the vector is recombined via the LR reaction with a binary Agrobacterium destination vector containing the GUS coding region with an intron that has an attR site 5' to the GUS reporter (pNOV2374).
The orientation of the inserted fragment was maintained by the att sequences and the final construct was verified by sequencing. The construct was then transformed into Agrobacterium u rumefaciens strains by electroporation.
pNOV237 4 is a binary vector with a VS 1 origin of replication, a copy of the 0 Agrobacterium virG gene in the backbone and a Basta resistance selectable marker cassette between the left and right border sequences of the T-DNA (SEQ ID NO:581).
The Basta selectable marker cassette comprises the Agrobacterium tumefaciens manopine synthase promoter (AtMas et al., 1983) operably linked to the gene encoding Basta resistance (denoted here as "BAR", phosphinothricin acetyl transferase, White et al, 1990) and the 35S terminator. The AtMas promoter, BAR coding sequence and 35S terminator are located at nt 4211 to 4679, nt 4680 to 5228, and nt 5263 to 5488, respectively, of pNOV2374.
The vector contains GATEWAYTM recombination components which were introduced into the binary vector backbone by ligating a blunt-ended cassette containing attR sites, ccdB and chloramphenicol resistance marker using the GATEWAYT" Vector Conversion System (LifeTechnologies, www.lifetech.com.). The GATEWAYTM cassette is located between nt 126 and 1818 of pNOV2374. The promoter cassettes are inserted through an LR recombination reaction whereby the DNA sequence of pNOV2374 between nt 126 and nt 1818 are removed and replaced with the promoter of interest flanked by at sequences. The recombination results in the promoter sequence fused to the GUS reporter gene with intron (GIG) sequence. The GIG gene contains the ST-LS1 intron from Solanum tuberosum at nt 385 to nt 576 of GUS (SEQ ID NO:582) (obtained from Dr. Stanton Gelvin, and described in Narasimhulu et al, 1996). Shown below in Table 7 are the orientations of the selectable marker and promoter-reporter cassettes in the binary vector constructs.
133 Case S-50015A/16/78/NAD STable 7 RB--AC9 promoter fragment (SEQ ID NO: 548)+GIG gene nos x LB RB--AC1 1 promoter fragment (SEQ ID NO: 550)+GIG gene nos x LB In RB--AC12 promoter fragment (SEQ ID NO: 551)+GIG gene nos x LB RB--AC13 promoter fragment (SEQ ID NO: 552)+GIG gene nos x LB RB--AC14 promoter fragment (SEQ ID NO: 553)+GIG gene nos x LB S RB--AC16 promoter fragment (SEQ ID NO: 555)+GIG gene nos x LB RB--AC19 promoter fragment (SEQ ID NO: 556)+GIG gene nos x LB 00 RB--AC20 promoter fragment (SEQ ID NO: 557)+GIG gene nos x LB RB--AC21 promoter fragment (SEQ ID NO: 558)+GIG gene nos x LB RB--AC23 promoter fragment (SEQ ID NO: 560)+GIG gene nos x LB RB--AC31 promoter fragment (SEQ ID NO: 565)+GIG gene nos x LB RB--AC32 promoter fragment (SEQ ID NO: 566)+GIG gene nos x LB RB--AC34 promoter fragment (SEQ ID NO: 567)+GIG gene nos x LB promoter fragment (SEQ ID NO: 568)+GIG gene nos x LB promoter fragment (SEQ ID NO: 571)+GIG gene nos x LB RB--AC42 promoter fragment (SEQ ID NO: 572)+GIG gene nos x LB RB--AC44 promoter fragment (SEQ ID NO: 573)+GIG gene nos x LB RB--AC46 promoter fragment (SEQ ID NO: 575)+GIG gene nos x LB RB--AC47 promoter fragment (SEQ ID NO: 576)+GIG gene nos x LB RB--IB-I promoter fragment (SEQ ID NO: 578)+GIG gene nos x LB RB--1G-2 promoter fragment (SEQ ID NO: 579)+GIG gene nos x LB RB--lAMixl-C promoter fragment (SEQ ID NO: 577)+GIG gene nos x LB RB--AR promoter fragment (SEQ ID NO: 536)+GIG gene nos x LB RB--AR2 promoter fragment (SEQ ID NO: 537)+GIG gene nos x LB RB--AR6 promoter fragment (SEQ ID NO: 539)+GIG gene nos x LB RB--AR8 promoter fragment (SEQ ID NO: 540)+GIG gene nos x LB RB--AR9 promoter fragment (SEQ ID NO: 541)+GIG gene nos x LB RB--AR10 promoter fragment (SEQ ID NO: 542)+GIG gene nos x LB x AtMas BAR 35S ter 134- Case S-50015A116/78/NAD 00
O
.o 0 0 t(N
O
l o o
(N
00l For comparison of promoter activity an additional construct was produced with the known Arabidopsis ubiquitin 3 (Ubq3(At), (Calis et al., 1990) promoter plus intron operatively linked to the GIG gene and the nos promoter. The artificial sequence of the Arabidopsis Ubiquitin3 promoter plus intron (Ubq3 is provided in SEQ ID NO:583. Thus, the orientation of the selectable marker and promoter-reporter cassette in the binary vector construct was RB-- Ubq3(At) promoter with intron fragment+GIG gene nos AtMas BAR 35S ter LB Example 5: In vitro Promoter Assays and Arabidopsis Transformation Plant preparation and growth Arabidopsis seeds are sown on moistened Fafard Germinating Mix at a density of 9 seeds per 4" square pot, placed in a flat, covered with a plastic dome to retain moisture and moved to a growth chamber. Following germination the dome is removed and plants are grown for weeks under short days (8 hrs light) to encourage vegetative growth and production of large plants with many flowers. Flowering is induced by providing long days (16 hrs. light) for 2-3 weeks, at which time plants are ready for dip inoculation into Agrobacterium to generate transgenic plants.
Agrobacterium transformation, culture growth and preparation for plant infiltration The binary promoter::reporter plasmids are introduced into Agrobacteria by electroporation. The binary plasmid confers spectinomycin resistance to the bacteria allowing cells containing the plasmid to be selected by growth of colonies on plates of LB spectinomycin (50 mg/L). Presence of the correct promoter::GUS plasmid is confirmed by sequence analysis of the plasmid DNA isolated from the bacteria.
Two days prior to plant transformation 5 mL cultures of LB spectinomycin (50 mg/L) are inoculated with the Agrobacterium strain containing the binary promoter::GUS plasmid and incubated at 30 0 C for about 24 hours. Each 5 mL culture is then transferred to 500 mL of LB spectinomycin (50 mg/L) and incubated for about 24 hours at 30°C. Each 500 rnL culture is transferred to a centrifuge bottle and centrifuged at 5000 rpm for 10 minutes in a Sorvall Centrifuge. The supernatant is removed and the pelleted Agrobacterium cells are retained. The Agrobacterium cells are resuspended in 500 mL of modified Infiltration Media 135- Case S-50015A/16/78/NAD O (IM+MOD: 50g/L sucrose, 10 mM MgCI, 10 uM benzylaminopurine) to which 50 ul of 0 CN Silwet L-77 (Dupont) has been added.
Plant transformation by dip infiltration Resuspended cells are poured into IL tri-pour beakers. Flowering plants are inverted into the culture, making sure all inflorescences are covered with the bacteria. The beakers are gently agitated for 30 seconds, keeping all inflorescence tissue submerged. Plants are returned to growth chamber following dip inoculation of the Agrobacterium. A second dip may be performed 5 days later to increase transformation frequency. Seeds are harvested -4 to 6 00 weeks after transformation.
0 Selection of transgenic Arabidopsis Seeds from transformed Arabidopsis plants are sown on moistened Fafard Germinating Mix in a flat, covered with a dome to retain moisture and placed in a growth chamber.
Following germination seedlings are sprayed with the herbicide BASTA. Transgenic plants are BASTA resistant due to the presence of the BAR gene in the binary promoter::GUS plasmid.
Promoter Assays Promoter activity is evaluated qualitatively and quantitatively using histochemical and florescence assays for expression of the B-glucuronidase (GUS) enzyme.
Histochemical B-glucuronidase (GUS) assay For qualitative evaluation of promoter activity, various Arabidopsis tissues and organs are used in GUS histochemical assays. Either whole organs or pieces of tissue are dipped into GUS staining solution. GUS staining solution contains 1 mM 5-bromo-4-chloro-3-indolyl glucuronide (X-Gluc, Duchefa, 20 mM stock in DMSO), 100 mM Na-phosphate buffer pH 10 mM EDTA pH 8.0, and 0.1% Triton X100. Tissue samples are incubated at 37 C for 1- 16 hours. If necessary samples can be cleared with several washes of 70% EtOH to remove chlorophyll. Following staining tissues are viewed under a light microscope to evaluate the blue staining showing the GUS expression pattern.
B-glucuronidase (GUS) florescence assay For quantitative analysis of promoter activity in various Arabidopsis tissues and organs, GUS expression is measured fluorometrically. Tissue samples are harvested and ground in ice cold GUS extraction buffer (50 mM Na 2
HPO
4 pH 7.0, 5 mM DTT, 1 mM Na 2 EDTA, 0.1% Triton X100, 0.1% sarcosyl). Ground samples are spun in a microfuge at 10,000 rpm for 136- Case S-50015A/16/78/NAD o minutes at 4 OC. Following centrifugation the supernatant is removed for GUS assay and for C protein concentration determination.
To measure GUS activity the plant extract is assayed in GUS assay buffer (50 mM Na 2
HPO
4 pH 7.0, 5 mM DTT, 1 mM Na 2 EDTA, 0.1% TritonX100, 0.1% sarcosyl, 1 mM 4- Methylumbelliferyl-beta-D-glucuronic acid dihydrate prewarmed to 37 0 C. Reactions are incubated and 100 uL aliquots are removed at 10 minute intervals for 30 minutes to stop the reaction by adding to tubes containing 900 uL of 2% Na2CO3. The stopped reactions are then read on a Tecan Spectroflourometer at 365 nm excitation and 455 emission wavelengths.
00 Protein concentrations are determined using the BCA assay following manufacturers protocol.
GUS activity is expressed as relative fluorometric units (RFU)/mg protein.
Example 6: Determination of the minimal promoter fragment The full-length promoter sequence as given in SEQ ID Nos: 536-579, more preferably in any one of SEQ ID Nos: 536; 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571- 576, 578 and 579, or the promoter orthologs thereof is fused to the 3-glucuronidase (GUS) gene at the native ATG to obtain a chimeric gene cloned into plasmid DNA. The plasmid DNA is then digested with restriction enzymes to release a fragment comprising the full-length promoter sequence and the GUS gene, which is then used to construct the binary vector. This binary vector is transformed into Agrobacterium tumefaciens, which is in turn used to transform Arabidopsis plants (for further details of the binary vector construction see above example 4) The above plasmid can also be used to form a series of 5' end deletion mutants having increasingly shorter promoter fragments fused to the GUS gene at the native ATG. Various restriction enzymes are used to digest the plasmid DNA to obtain the binary vectors with different lengths of promoter fragments. In particular, a binary vector 1 is constructed with a 1,900-bp long promoter fragment; a binary vector 2 is constructed with a 1,300-bp long promoter fragment; a binary vector 3 is constructed with a 1000-bp long promoter fragment; a binary vector 4 is constructed with a 800-bp long promoter fragment; a binary vector 5 is constructed with a 700-bp long promoter fragment; a binary vector 6 is constructed with a 600-bp long promoter fragment; a binary vector 6 is constructed with a 500-bp long promoter 137- Case S-50015A/16/78/NAD C fragment; and a binary vector 7 is constructed with a 100-bp long promoter fragment. Like the Sbinary vector comprising the full-length promoter fragment, these 5' end deletion mutants are Salso transformed into Agrobacterium tumefaciens and, in turn, Arabidopsis plants (for further Sdetails of Arbabidopsis transformation and promoter assay procedures see example 5 above).
The presence of the correct hybrid construct in the transgeic lines is confirmed by PCR amplification.
By using the above protocol it can be determined, which portion of the promoter sequences given in SEQ ID Nos: 536-579, more preferably in any one of SEQ ID Nos: 536; 00 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter O orthologs thereof is required for gene expression.
Minimal promoter fragments having lengths substantially less than the full-length promoter can therefore be operatively linked to coding sequences to form smaller constructs than can be formed using the full-length promoter. As noted earlier, shorter DNA fragments are often more amenable to manipulation than longer fragments. The chimeric gene constructs thus formed can then be transformed into hosts such as crop plants to enable at-will regulation of coding sequences in the hosts.
Example 7: Determination of Promoter Motifs While a deletion analysis characterizes regions in a promoter that are required overall for its regulation, linker-scanning mutagenesis allows for the identification of short defined motifs whose mutation alters the promoter activity. Accordingly, a set of linker-scanning mutant promoters fused to the coding sequence of the GUS reporter gene are constructed.
Each of them contains a 8-10-bp mutation located between defined positions and included in a promoter fragment as given in SEQ ID Nos: 536-579, more preferably to any one of SEQ ID Nos: 536; 537; 539-542; 548; 550-553; 555-558; 560; 565-568; 571-576, 578 and 579, or the promoter orthologs thereof.
Each construct is transformed into Arabidopsis and GUS activity is assayed for 19 to independent transgenic lines. The presenceof the correct hybrid consstruct in transgenic lines is confirmed by PCR amplification of all lines containing the mutant constructs and by random sampling of lines containing the other constructs. Amplified fragments are digested 138- Case S-50015A/16/78/NAD O with restriction enzyme (e.g.Xbal) and separated on high resolution agarose gels to distinguish N between the different mutant constructs. constructs. The effect of each mutation on promoter activity is compared to an equivalent number of transgenic lines containing the unmutated Sconstruct. Two repetitions resulting from independent plating of seeds are carried out in every case.
The sequences mutated in the linker-scanning constructs, in particular those that Sshowed marked differences from the control construct, are then examined more closely.
O
O
00 References O Abel et al., Science, 232:738 (1986).
Aharoni et al., Plant Cell, 5:613 (2000).
Altschul et al. Nucleic Acids Res., 25:3389 (1997).
Altschul et al., J. Mol. Biol., 215:403 (1990).
An et al., EMBO 4:277 (1985).
Aoyama et al., Plant Journal, 11:605 (1997).
AtMas, et al, Plant Mol. Biol., 2:335 (1983).
Auch Reth, Nucleic Acids Research, 18:6743 (1990).
Ballas et al., Nucleic Acids Res., 17:7891 (1989).
Bansal et al., Proc. Natl. Acad. Sci. USA, 89:3654 (1992).
Barkai-Golan et al., Arch. Microbiol., 116:119 (1978).
Barton et al., Plant Physiol., 85:1103 (1987).
Batzer et al., Nucleic Acid Res., 19:5081 (1991).
Beals et al., Plant Cell, 9:1527 (1997).
Belanger et al., Genetics, 129:863 (1991).
Bernal-Lugo and Leopold, Plant Physiol., 98:1207 (1992).
Bevan et al., Nucl. Acids Res., 11:369 (1983).
Bevan et al., Nature, 304:184 (1983).
Bevan, Nucl. Acids Res., 12:8711 (1984).
Bird et al., Plant Molecular Biology, 11:651 (1988).
Bisaro, Homologous Recomb. Gene Silencing Plants, pp. 219-70, Paszkowski Jerzy (eds.) (1994).
139- Case S-50015A/16/78/NAD 00 O Blackman et al., Plant Physiol., 100:225 (1992).
C Blochlinger Diggelmann, Mol Cell Biol, 4:2929 (1984).
Bolet al., Ann. Rev. Phyopath., 28:113 (1990).
Bouchez et al., EMBO 8:4197 (1989).
Bouchez et al., EMBO Journal, 8:4197 (1989).
Bourouis et al., EMBO 2:1099 (1983).
Bowler et al., Ann. Rev. Plant Physiol., 43:83 (1992).
Branson and Guss, Proc. North Central Branch Entomological Society of America (1972).
00 Broakgert et al., Science, 245:110 (1989).
SBrown et al., PNAS USA, 97:262 (2000).
Byme et al. Plant Cell Tissue and Organ Culture, 8:3 (1987).
Callis et al., Genes and Develop., 1:1183 (1987).
Callis et al., J. Biol. Chem., 265:12486 (1990).
Campbell and Gowri, Plant Physiol., 92:1 (1990).
Castrsana et al., EMBO 7:1929 (1988).
Chandler et al., Plant Cell, 1:1175 (1989).
Chee et al. Plant Physiol., 91:1212 (1989).
Chee et al., Methods Mol. Biol., 44:101 (1995).
Christou et al. Proc. Natl. Acad. Sci USA, 86:7500 (1989).
Christou et al., Biotechnology, 9:957 (1991).
Christou et al:, Plant Physiol., 87:671 (1988).
Coe et al., In: Corn and Corn Improvement, Sprague et al. (eds.) pp. 81-258 (1988).
Cordero et al., Plant 6:141 (1994).
Corpet et al. Nucleic Acids Res., 16:10881 (1988).
Coxson et al., Biotropica, 24:121 (1992).
Crameri et al., Nature Biotech., 15:436 (1997).
Crameri et al., Nature, 391:288 (1998).
Crossway et al., BioTechniques, 4:320 (1986).
Cuozzo et al., Bio/Technologv, 6:549 (1988).
Cutler et al., J. Plant Physiol., 135:351 (1989).
Czako et al., Mol. Gen. Genet., 235:33 (1992).
140- Case S-50015A116/78/NAD 00 SCzapla and Lang, J. Econ. Entomol., 83:2480 (1990).
Datta et al., Bio/Technolog, 8:736 (1990).
SDavies et al., Plant Physiol., 93:588 (1990).
trn Dayhoff et al., Atlas of Protein Sequence and Structure, Nati. Biomed. Res. Found., Washington, C.D. (1978).
Cs De Blaere et al., Meth. Enzymol., 143:277 (1987).
>De Block et al. Plant Physiol., 91:694 (1989).
0 De Block et al., EMBO Journal, 6:25 13 (1987).
00 Defla-Cioppa et al., Plant Physiology, 84:965-968 (1987).
SDellaporta et al., in Chromosome Structure and Function, Plenum Press, 263-282 (1988).
Dennis et al., Nucleic Acids Res., 12:3983 (1984).
Depicker et al., Plant Cell Reports, 7:63 (1988).
DeRisi et al., Science, 278:680 (1997).
Desprez et al., Plant 14:643 (1998).
Diek-man Fischer, EMBO, 7:33 15 (1988).
Duggan et al., Nat. Genet., 21:10 (1999).
Dunn et al., Can. J. Plant Sci., 61:583 (1981).
Dure et al., Plant Mol. Biol., 12:475 (1989).
Eisen et al., PNAS USA, 95:14863 (1998).
Ellis et al., EMBO Journal, 6:3203 (1987).
Elroy-Stein et al., Proc. NatI. Acad. Sci. 86:6 126 (1989).
English et al., Plant CeUl, 8:179 (1996).
Erdmann et al., J. Gen. Microbiol., 138:363 (1992).
Everett et al., Bio/fechnolMg, 5:1201(1987).
Fitzpatrick, Gen. Engin ering News, 22:7 (1993).
Franken et al., EMBO 10:2605 (1991).
Fromm et al., Nature (London), 319:791 (1986).
Fromm et al., Bio/TechnolMg, 8:833 (1990).
Gallie et al., Nucleic Acids Res., 15:3257 (1987).
Gallie et al., The Plant CeUl, 1:301 (1989).
Gan et al., Science, 270:1986 (1995).
141 Case S-50015A/16/78/NAD 00 0 Gatehouse et al., J. Sci. Food Agric., 35:373 (1984).
C Gatz, Current Opinion in Biotechnology, 7:168 (1996).
Gatz, Annu. Rev. Plant Physiol. Plant Mol. Biol., 48:89 (1997).
Gelfand, eds., PCR Strategies Academic Press, New York (1995).
Gelvin et al., Plant Molecular Biology Manual, (1990).
Giege et al., Plant 15:721 (1998).
Gordon-Kamm et al., Plant Cell, 2:603 (1990).
SGoring et al, PNAS, 88:1770 (1991).
00 Graham et al., Biochem. Biophys. Res. Comm., 101:1164 (1981).
Graham et al., J. Biol. Chem., 260:6555 (1985).
Graham et al., J. Biol. Chem., 260:6561 (1985).
Gritz et al., Gene, 25:179 (1983).
Gruber, et al., Vectors for Plant Transformation, in: Methods in Plant Molecular Biology Biotechnology" in Glich et al., (Eds. pp. 89-119, CRC Press, 1993).
Guerineau et al., Mol. Gen. Genet., 262:141 (1991).
Guerrero et al., Plant Mol. Biol., 15:11 (1990).
Gupta et al., PNAS, 90:1629 (1993).
Haines and Higgins Nucleic Acid Hybridization, IRL Press, Oxford, U.K.
Hammock et al., Nature, 344:458 (1990).
Hemenway et al., EMBO Journal, 7:1273 (1988).
Henikoff Henikoff, Proc. Natl. Acad. Sci. USA, 89:10915 (1989).
Hiei et al., Plant 6:271 (1994).
Higgins et al., CABIOS, 5:151 (1989).
Higgins et al., Gene, 73:237 (1988).
Hilder et al., Nature, 330:160 (1987).
Hinchee et al. Bio/Technology 6:915 (1988).
Hoekema, In: The Binary Plant Vector System. Offset-drukkerij Kanters Alblasserdam (1985).
Huang et al., CABIOS, 8:155 (1992).
Hudspeth Grula, Plant Molec. Biol., 12, 579 (1989).
Hughes et al., J. Mol. Biol., 296:1205 (2000).
142 Case S-50015A/16/78/NAD 00 O Ikeda et al., J. Bacteriol., 169:5612 (1987).
SIkuta et al., Biotech., 8:241 (1990).
SIngelbrecht et al., Plant Cel, 1:671 (1989).
SInnis et al., PCR Protocols: A Guide to Methods and Applications, Academic Press, Inc., San Diego, CA. (1990).
s Innis and Gelfand, eds., PCR Methods Manual (Academic Press, New York) (1999).
r Innis et al., eds., PCR Protocols: A Guide to Methods and Applications (Academic Press, New SYork (1995).
00 Jefferson et al, EMBO J, 6: 3901-3907 (1987).
0 Jobling et al., Nature, 325:622 (1987).
John et al., Proc. Natl. Acad. Sci. USA, 89:5769 (1992).
Johnson et al., PNAS USA, 86:9871 (1989) Joshi et al., Nucleic Acid Res., 15:9627 (1987).
Kaasen et al., J. Bacteriol., 174:889 (1992).
Kagaya et al., Nucleic Acids Res., 27:470 (1999).
Karlin and Altschul, Proc. Natl. Acad Sci. USA, 87:2264 (1990).
Karlin and Altschul, Proc. Natl. Acad. Sci. USA, 90:5873 (1993).
Karsten et al., Botanica Marina, 35:11 (1992).
Katz et al., J. Gen. Microbiol., 129:2703 (1983).
Kehoe et al., Trends Plant Sci., 4:38 (1999).
Keller et al., EMBO Journal, 8:1309 (1989).
Keller et al., Genes Dev., 3:1639 (1989).
Klein et al., Nature, 327:70 (1987).
Klein et al., Bio/Technologv, 6:559 (1988).
Klein et al., Plant Physiol., 91:440 (1988).
Klein et al., Proc. Natl. Acad. Sci. USA, 85:4305 (1988).
Knauf, et al., Genetic Analysis of Host Range Expression by Agrobacterium In: Molecular Genetics of the Bacteria-Plant Interaction, Puhler, A. ed., Springer-Verlag, New York, 1983.
Koehl P. and Delarue Curr. Opin. Struct. Biol., 6:222 (1996).
Kohler et al., Plant Mol. Biol., 29:1293 (1995).
143 Case S-50015A/16/78/NAD 00 O Koster and Leopold, Plant Physiol., 88:829 (1988).
0 N Koziel et al., Biotechnology, 11:194 (1993).
a) Kridl et al., Seed Science Research, 1:209 (1991).
tr Kriz et al., Mol. Gen. Genet., 207:90 (1987).
Kunkelet al., Methods in Enzymo., 154:367 (1987).
Kunkel, roc. Natl. Acad. Sci. USA, 82:488 (1985).
Lamet al., Plant Cell, 1:1147 (1989).
0 Landolt, Biosystematic Investigation on the Family of Duckweeds: The family of Lemnaceae 00 A Monograph Study. Geobatanischen Institut ETH, Stiftung Rubel, Zurich (1986).
O Langridge et al., Proc. Nat Acad. Sci. 86:3219 (1989).
Langridge et al., Cell, 34:1015 (1983).
Lashkari et al., PNAS USA, 94:8945 (1997).
Laufs et al., PNAS, 87:7752 (1990).
Lawton et al., Mol. Cell Biol., 7:335 (1987).
Lee and Saier, J. Bacteriol., 153 (1982).
Lesyng B. and McCammon JA, Pharmocol. Ther., 60:149 (1993).
Levings, Science, 250:942 (1990).
Lindsey et al., Transgenic Research, 2:3347 (1993).
Lindstrom et al., Der. Genet., 11:160 (1990).
Lockhart et al., Nat. Biotechnol, 14:1649 (1996).
Lockhart and Winzeler, Nature, 405:827 (2000).
Lommel et al., Virology, 181:382 (1991).
Loomis et al., J. Expt. Zool., 252:9 (1989).
Lorz et al., Mol. Gen. Genet., 199:178 (1985).
Lyznik et al., Nucleic Acids Res., 21:969 (1993).
Ma et al., Nature, 334 :631 (1988).
Macejak et al., Nature, 353:90 (1991).
Maki et al., Methods in Plant Molecular Biology Biotechnology, Glich et al., 67-88 CRC Press, (1993).
Maleck et al., Nat. Genet., 26:403 (2000).
Mansson et al., Gen. Genet., 200:356 (1985).
144- Canc S-5015AI 16/78/NAD 00 0 Mariani et al, Nature, 347:737 (1990).
SMartinez et al., J. Mol. Biol., 208:551 (1989).
McBride et al., Plant Molecular Biology, 14:266 (1990).
SMcBride et al., PNAS USA, 91:7301 (1994).
McCabe et al., Bio/Techno ogy, 6:923 (1988).
SMcElroy et al., Mol. Gen. Genet., 23 1:150 (1991).
Meinkoth and Wahl, Anal. Biochem., 138:267 (1984).
0 Messing and Vierra, Gee 19:259 (1982).
00 Michael et al., J. Mo!. Biol., 26 :585 (1990).
SMogen et al., Plant Cell, 2:1261 (1990).
Moore et al., J. Mol. Biol., 272:336 (1997).
Mundy and Chua, EMBO 7:2279 (1988).
Munroe eta., n, 91:151 (1990).
Murakami et al., Mo!. Gen. Genet., 205:42 (1986).
Murata et al., FEBS Lett., 296:187 (1992).
Murdock et al., Phytochernisty, 29:85 (1990).
Murray et al., Nucleic Acids Res., 17:477 (1989).
Myers and Miller, CABIOS, 4:11 (1988).
Napoli et al., Plant Cel, 2:279 (1990).
Narasimhulu et al, Plant Cel, 8: 873-886, (1996).
Needleman and Wurisch, J. Mol. Biol., 48:443-453 (1970).
Newman et al., Plant Physiol., 106:1241 (1994).
Niedz et al., Plant Cell Reprts, 14:403 (1995).
Odel-l et al., Mol. Gen. Gene., 113:369 (1990).
Odell et al., Homnologous Recomb. Gene Slencing Plants 2 19-70, Paszkowski Jerzy (eds) (1994).
Odell et al., Nature, 313:810 (1985).
Ohtsuka et al., J. Biol. Chem., 260:2605 (1985).
Ow et al., Science, 234:856 (1986).
Pacciotti et al., Bio/TechnolMg, 3:24 1 (1985).
Park et al., J. Plant Bio.., 38:365 (1985).
145 Case S-50015A/16/78/NAD 00 O Paszkowski et al., EMBO 3:2717 (1984).
SPear et al., Plant Molecular Biology, 13:639 (1989).
Pearson and Lipman, Proc. Natl. Acad. Sci., 85:2444 (1988).
t) Pearson et al., Meth. Mol. Biol., 24:307 (1994).
Perlak et al., Proc. Nat. Acad. Sci. USA, 88:3324 (1991).
Phillips et al., In Corn Corn Improvement, 3rd Edition 10 Sprague et al. (Eds. pp. 345- 387)(1988).
O Phi-Van et al., Mol. Cell. Biol., 10:2302 (1990).
00 Piatkowski et al., Plant Physiol., 94:1682 (1990).
0 Potrykus et al., Mol. Gen. Genet., 199:183 (1985).
Potrykus, Trends Biotech., 7:269 (1989).
Poulsen et al., Mol. Gen. Genet., 205:193 (1986).
Prasher et al., Biochem. Biophys. Res. Comm., 126:1259 (1985).
Proudfoot, Cell, 64:671 (1991).
Quigley et al., J. Mol. Evol., 29:412 (1989).
Ralston et al., Genetics, 119:185 (1988).
Reed et al., J. Gen. Microbiol. 130:1 (1984).
Reina et al., Nucleic Acids Res., 18:6425 (1990).
Reina et al., Nucleic Acids Res., 18:7449 (1990).
Reymond et al., Plant Cell, 12:707 (2000).
Richmond et al., Curr Opin Plant Biol., 3:108 (2000).
Riggs et al., Proc. Natl. Acad. Sci. USA, 83:5602 (1986).
Rossi et al., Biophys. 80:480 (2001).
Rossolini et al., Mol. Cell. Probes, 8:91 (1994).
Rothstein et al., Gene, 53:153 (1987).
Ruiz, Plant Cell, 10:937 (1998).
Safak et al., Mol. Cell Biol., 19:2712 (1999).
Sambrook et al., Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, New York) (1989).
Sanfacon et al., Genes Dev., 5:141 (1991).
Sanford et al., Particulate Science and Technology, 5:27 (1987).
146- Case S-50015A/16/78/NAD 00 O Schaffer et al., Curr Opin Biotechnol., 11:162 (2000).
N Schena et al., Science, 270:467 (1995).
SSchenk et al., PNAS USA, 97:11655 (2000).
In Schmidhauser and Helinski, J. Bacteriol., 164:446 (1985).
Schwob et al., Plant 4:423 (1993).
s, Shagan et al., Plant Physiol., 101:1397 (1993).
r- Shapiro, Mobile Genetic Elements, Academic Press, N.Y. (1983).
0 Shimamoto et al., Nature, 338:274 (1989).
00 Simpson, Plant Mol. Biol., 19:699 (1985).
O Skriver and Mundy, Plant Cell, 2:503 (1990).
Skuzeski et al., Plant Molec. Biol. 15: 65-79 (1990).
Slater et al., Plant Mol. Biol., 5:137 (1985).
Smith et al., Adv. Appl. Math., 2:482 (1981).
Smith et al., Mol. Gen. Genet., 224:447 (1990).
Smith et al., Planta, 168:94 (1986).
Southern et al., Nature Genet., 21:5-9 (1999).
Spencer et al., Theor. Appl. Genet, 79:625 (1990).
Stalker et al., Science, 242:419 (1988).
Staub et al., EMBO 12:601 (1993).
Staub et al., Plant Cell, 4:39 (1992).
Steifel et al., The Plant Cell, 2:785 (1990).
Stemmer, Nature, 370:389 (1994).
Stemmer, Proc. Natl. Acad. Sci. USA, 91:10747 (1994).
Stiefet al., Nature, 341:343 (1989).
Stouggard, The Plant Journal, 3:755 (1993).
Sukhapinda et al., Plant Mol. Biol., 8:209 (1987).
Sullivan et al., Mol. Gen. Genet., 215:431 (1989).
Surles et al., Protein Sci., 3:198 (1994).
Sutcliffe, PNAS USA, 75:3737 (1978).
Svab et al., Proc. Natl. Acad. Sci. USA, 87:8526 (1990).
Svab et al., Proc. Natl. Acad. Sci. USA, 90:913 (1993).
147- Case S-50015A/16/78/NAD 00 O Tamayo et al., PNAS USA, 96:2907 (1999).
Tarczynski et al., PNAS USA, 89:2600 (1992).
0 Thillet et al., J. Biol. Chem., 263:12500 (1988).
it Thompson et al., EMBO J, 6:2519 (1987).
Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Elsevier, New York (1993).
Tomes et al., Plant Cell Tissue and Organ Culture: Fundamental Methods, Springer Verlag, SBerlin (1995).
00 Tomic et al., NAR, 12:1656 (1990).
o Tremousaygue et al., Plant 20:553 (1999).
Turner et al., Molecular Biotechnology, 3:225 (1995).
Twell et al., Plant Physiol., 91:1270 (1989).
Ugaki et al., Nucl. Acids Res., 19:371 (1991).
Ulmasov et al., Plant Mol. Biol., 35:417 (1997).
Upender et al., Biotechniques, 18:29 (1995).
Vaeck et al., Nature, 328:33 (1989).
van der Krol et al., Plant Cell, 2:291 (1990).
vanTunen et al., EMBO 7:1257 (1988).
Vasil et al., Biotechnology, 11:1553 (1993).
Vasil et al., Mol. Microbiol., 3:371 (1989).
Vasil et al., Plant Physiol., 91:1575 (1989).
Vernon and Bohnert, EMBO 11:2077 (1992).
Vodkin, Prog. Clin. Biol. Res., 138:87 (1983).
Vogel et al., EMBO 11:157 (1992).
Walker and Gaastra, eds., Techniques in Molecular Biology, MacMillan Publishing Company.
New York (1983).
Wandelt et al., Nucleic Acids Res., 17:2354 (1989).
Wang et al., Mol. Cell. Biol., 12:3399 (1992).
Waterman M.S. Introduction to Computational Biology: Maps, sequences and genomes.
Chapman Hall. London (1995).
Watson et al., Corn: Chemistry and Technology (1987).
148 Case S-50015A/16/78/NAD 00 SWatrud et al., in Engineered Organisms and the Environment (1985).
C Weeks et al., Plant Physiol., 102:1077 (1993).
Weissinger et al., Annual Rev. Genet., 22:421 (1988).
In Wenzler et al., Plant Mol. Biol., 13:347 (1989).
White et al, Nucl Acids Res, 18 1062 (1990).
Wolter et al., EMBO Journal, 11:4685 (1992).
Wyn-Jones and Storey, Physiology and Biochemistry of Drouht Resistance in Plants, Paleg et Sal. pp. 171-204 (1981).
00 Xiang and Guerra, Plant Physiol., 102:287 (1993).
S Yamaguchi-Shinozaki et al., Plant Cell Physiol., 33:217 (1992).
Yamamoto et al., Nucleic Acids Res., 18:7449 (1990).
Yanagisawa and Schmidt, Plant 17:209 (1999).
Yanagisawa et al., Plant 21:281-288 (2000).
Yuan et al., Plant 15:821 (1998).
Zhang et al., Proc. Natl. Acad. Sci. USA, 94:4504 (1997).
Zhu et al., Nat. Biotechnol., 18:555-558 (2000).
Zhu et al., Plant Physiol., 124:1472 (2000).
Zhu et al., Proc. Natl. Acad. Sci. USA, 96:8768-8773 (1999).
Zukowsky et al., PNAS USA, 80:1101 (1983).
All publications, patents and patent applications are incorporated herein by reference. While in the foregoing specification this invention has been described in relation to certain preferred embodiments thereof, and many details have been set forth for purposes of illustration, it will be apparent to those skilled in the art that the invention is susceptible to additional embodiments and that certain of the details described herein may be varied considerably without departing from the basic principles of the invention.
149- Case S-50015Ad16/78/NAD Appendix: Table 8 provides a description of the corresponding genes for the A rabidopsis sequences which are expressed in a root-specific manner.
Table 8: A f-fccc-nn It AfTv Description Affy I I A71588-1 14015_s-at pirII1626 reticuline oxidase homnolog F21IC20.190 Arabidopsis thaliana >gi15262224IembICAB4585O. 11 (AL080254) reticuline oxidase-Like protein [Arabidopsis thalianal >gil7268880lembICAB79O84. ij (ALI 61553) reticuline oxidase-like protein (Arabidopsis thaliana] gblAAD25763.1I AC0070 6 0_21 (AC007060) Strong simidlarity to F1913.2 gi13033 3 7 5 putative berberine bridge enzyme from Arabidopsis thaliana BAC gbIAC004238.
A71596.1 1 106sa A71597.1 12079_s-at ".gblAAD25757. 1 JAC007060..15 (AC007060) Strong similarity to F1 913.2 giJ30333 7 5 putative berberine bridge enzyme from Arabidopsis thaliana BAC gbIAC004238. ESTs gbIF19886, gbIZ30784 and gbIZ30785 come from this gene" dbjIBAA82824.11 (AB023462) basic endochitinase [Arabidopsis thaliana] AB023448.2--- 12332_s-at AC001645.19 1565at
-I
gblAAC08601. 11 (AF054906) myrosinase-binding protein homolog [Arabidopsis thaliana]
I
AC00 1645.47 15996_at AC00 1645.50 1 15981_at AC002333. 199 13552_at gblAAB63635. 11 (ACOO 1645) jasmonate inducible protein isolog [Arabidopsis thaliana] gblAAB63635. i1 (ACOO 1645) jasmonate inducible protein isolog [Arabidopsis thalbana] gblAAB64044.11 (AC002333) putative endoch~itinase [Arabidopsis thaliana] spIQO629CH4-BRANA BASIC ENDOCHITINASE CHB4 PRECURSOR >gi l74353531pirl jS253 11 chitinase (EC 3.2.1.14) precursor rape >gil 1 7799IembICAA43708. 11 (X6 1488) chitinase [Brassica napus] AC002333.210-- 13154_s-at 150- Case S-5OOI5AII6/78/NAD 0 Accession Affy Description .nAC002391.15O 17842fi-at pirlIT047 3 1 cytochrome P450 homolog F6G 17.2 Arabidopsis thaliana >gil4468803lembICAB38 2
O
4 l1 (A035601) cytochrome P450-Lie protein [Arabiclopsis thalianal >giI72 7 0 7 19lembICAB 80402.11 (ALl 61591) cytochrome P450-like protein [Arabidopsis thalianal Cs AC003673.
2 Ol 1648-s-at pirJIT0 16 2 6 peroxidase (EC 1. 11. 1.7) ATP22a Arabidopsis thaliana >giJ300455 81gbIAAC090 3 1. 11 (AC003673) peroxidase (ATP22a) [Arabidopsis thalianal SAC004005.lO 4 19390.at pirIITOO681 hypothetical protein F6E13.14 Arabidopsis 00 thaliana >gi13 2 1285 81gblAAC23409. 11 (AC004005) unknown protein [Arabidopsis thaliana] AC004521.l 14 19195at pirl1T02393 hypothetical protein F411.19 Arabidopsis thaliana >giJ3 128201 JgbJAAC 16105. 11 (AC00452 1) unknown protein [Arabidopsis thalianal AC004521.l 19 20608-s..at pirllT02393 hypothetical protein F411.19 Arabidopsis thaliana >giI3 128201 JgbJAAC 16105. 11 (AC00452 1) !!=-,unknown protein [Arabidopsis thalianal AC004683.
7 9 16461_-i-at splP24102IPERE-ARATH BASIC PEROXIDASE
E
PRECURSOR >giJ8 l653IpirIJUO 4 58 peroxidase
(EC
1. 11.1. 7) E Arabidopsis thaliana >gil 1 66807 jgblAAA32842. 11 (M5 838 1) peroxidase [Arabidopsis thahana] AC004684.1 6 5 17907_s-at pirlITO254l hypothetical protein F13M22.25 Arabidopsis thaliana >gil32362571gblAAC236 4 5. 11 (AC004684) unknown protein [Arabidopsis thalianal AC005310.
6 17697_at pirlT02675 hypothetical protein F19D 11.2 Arabidopsis thaliana >gil35 10249 jgblAAC33493. 11 (AC0053 unknown protein [Arabidopsis thaliana] AC005560.13 6 16016_at pirlIG7 1401 probable major latex protein Arabidopsis thaliana >giJ22447621embICAB 10 185. 11 (Z9733 5) major latex protein like [Arabidopsis thaliana] >gil726 8 11 I embjCAB784 4 8. 11 (ALI 61538) major latex protein like I Arabidopsis thaliana] AC005560.1 4 7 12758_at pirIIG7 1401 probable major latex protein Arabidopsis thaliana >giJ22447621embICAB 10185.11 (Z973 35) major latex protein like [Arabidopsis thaliana] >gil7 2 6 8 11 I embICAB7844 8 (ALI 61538) major latex protein like [Arabidopsis thaliana] 151 Case S -50015 A/16/7 8/N AD IDescriDtion A jiLLI~IUEIW 1 Affv #t Aces on I ACOOS 967.50 17864_at AC006216.22 14050_at AC006216.2187_a AC006577.1 6 178ra embiCAAl 8195.11 (AL022 198) putative protein [Arabidopsis thafianal >gil7270000lembICAB798 16. 11 (AL16 1578) putative protein [Arabidopsis thaliana] gbjAAD 12680. 11 (AC006216) S imilar to giJ341 3 7 l 4 TI19118.21 putative myro sinase- binding protein from Arabidopsis thaliana BAC gbIAC004747 gbIAAD 12679. 11 (AC006216) Similar to giJ34l 3 7 1 4 Ti19118.21 putative myrosinase- biding protein from Arabidopsis thaliana BAG gbIAC004747. ESTs gbIT44298, gbIT42447, gbIR64761 and gbjIl100206 come from this gene" '.gblAAD25772. 1 JAC0065778 (AC006577) Belongs to the PF100657 Lipase/Acylhydrolase with GDSL-motif family. ESTs gbIT44453, gbITO48 15, gbIT45993, gbIR3O138, gbIA109957O and gbIT22281 come from this gene. [Arabidopsis thalianal' gbIAAD2 149 1. 11 (AC0065 87) unknown protein [Arabidopsis thalianal gblAAD25758. 1 JAC007060...16 (AC007060) Strong similarity to F1913.2 gi13033 37 5 putative berberine bridge enzyme from Arabidopsis thaliana BAG gbIAC004238 gblAAD4 1993.1 I AC006233.16 (AC006233) unknown protein [Arabidopsis thaliana] gbIAAF2O25l1. 1 ACO 15 4 501 2 (AC0 15450) unknown protein [Arabidopsis thalianal dbjIBAA2 1873. 11 (AB006068) acidic endochitinase [Arabidopsis thaliana] gblAAD 12259 .1I1 (A.F098631I) putative cell AaU-plasma membrane disconnecting CLCT protein [Arabidlopsis thaliana] AC006587. 164 15859_at AC007060.3 4 194_s-at AC007135.23 20176_at AC007584.48 20194_at ACHI 1252s at AF098630. 3 1911 8_s-at 152- Case S-5001 SA/16/78INAD 00 00 Aeep~inn Affy IDescription A I.sso AF128395.1 2 20395_at A]J133036.5 t1995a 5 pIP331541PR1_ARATH PATHOGENESIS- RELATED PROTEIN 1 PRECURSOR
(PR-I)
>gij322557jpirjjJQ1 693 pathogenesis-related protein I precursor, 17.6K Arabidopsis thatiana >gij 166861 IgbiAAA32863. 11 (M90508) PR- I -like protein [Arabidopsis thaliana] >gi1381I05991gblAAC6938 1. .1 (ACOOS 398) pat hogenes is- related PR- I-like protein [Arabidopsis thalianal" spIP24I11PERCARATH NEUTRAL PEROXIDASE C PRECURSOR >giJ816521pirIJU0457 peroxidase (EC 1. 11. 1.7) C Arabidopsis thaliana >gil11668271gblAAA32849.11~ (M58380) peroxidase [Arabidopsis thaliana] >gil6522555IembICAB6 1999.11 (ALl 32967) peroxidase [Arabidopsis thaliana] >gi17422471prflJ2009327A peroxidase [Arabidopsis thaliana] spIP426201YQJG-ECOLI HYPOTHETICAL 37.4 KD PROTEIN IN EXUR-TDCC INTERGENIC
REGION
(0328) >gi174659841pirlIC6509 9 hypothetical 37.4 kD protein in exuR-tdcC intergemuc region Escherichia cobi (strain K- 12) >giJ6060431gbjAAA579O6. II (U 18997) ORE_o328 [Escherich-ia coli] >gij 1 7894891gb jAAC76 137. 11 (AE000392) putative transferase [Escherichia coli] AL024486.15 1 629a AL035538.245 I 16514_at AL049500.57 _T 16914_s-at pirI1T05635 hypothetical protein F20D 10.200 Arabidopsis thaliana >gil4467 114IembICAB 37548.11 (AL035538) putative, protein [Arabidopsis thaliana] >gil 2 7 0 7 9 1lembICAB8O473. i1 (ALl 61592) putative protein [Arabidopsis thaliana] spIP50700IOSL3_ARATH OSMOTIN-LIKE
PROTEIN
OSM34 PRECURSOR >g'11362001 1pir115752 4 osmotin precursor Arabidopsis thaliana >giJ8873901embjCAA6141 1.1! (X89008) osmotin [Arabidopsis thaliana] I. AL049638. 193 20029_at pirlITO66l5 hypothetical protein F1 6J 13.150 Arabidopsis thaliana >gil458611 ljembICAB40949. 11 (AL049638) putative DNA-binding protein [Arabidopsis thaliana] >gi17267909lembICA.B7825 1.11 (ALI 61533) putative DNA-binding protein [Akrabidopsis thaliana] 153- Case S-5O15A16/78INAD 00 00 Affy I Description A rr~~dan Al D49730.104 18983_s-at AL080253.32145a A.L0802 82.74 1 8597_at ATAJ2596 16085_s-at ATHORF 16649_s-at ATP1N2 12932_s-at ATU 10034 15120_s-at "pl. 11S 4 255 2 proline-rich protein rape-rc >gij545029jgbjA.AC6O5 6 6 11 (S68 113) proline-rc 1 [Brassica napus=oilseed rape, pods, Peptide, 147 aal" gblAAF08575.1 IACOl 1623.8 (AC0 11623) unknown protein [Arabidopsis thaliana] pirlIT1062 4 reticuLine oxidase homolog F21C20.170 Arabidopsis thaliana >gi15262222lembICAB458 4 8. 11 (AL080254) reticuline oxidase-lie protein [Arabidopsis thalianal >gi17268878lembiCAB79082. 11 (ALI 61553) reticu line oxidase-like protein [Arabidopsis thalianal embiCAB 16787.11 (Z99707) patatin-like protein [Arabidopsis thalianal >gil7270656lembICAB803 7 3 11 (A.L161590) patatin-like protein [Arabidopsis thaliana] gblAAF16563.1IAC0I25 6 3 -16 (AC012563) putative Sadenosyl- L- methiollne: trans-caffeoyl- Coenzyme A methyltransferase [Arabidopsis thalianal gblAAD04377. 11 (AF089085) putative auxin efflux carrier protein; AtPIN1I [Arabidopsis thaliana] spIQ42521IJDCE I_-ARATH GLUTAMATE DECARBOXYLASE 1 (GAD 1) >gij497979jgbIAAA93 132. 11 (U 10034) glutamate decarboxylase [Arabidopsis thaliana] gblAAB47973.l11 (U57320) blue copper-binding protein 11 [Arabidopsis thaliana] dbjIBAA24282.1I (AB000094) inorganic phosphate transporter [Arabidopsis thaliana] dbjjBAA82824. 11 (AB023462) basic endochitinase [Arabidopsis thaliana] gblAAA62426. 11 (L4003 1) S-adenosyl-Lmet hionine: trans-c affeoyl- Coenlzyme A methyltransferase [Arabidopsis thaliana] gblAAF294O6. 1IAC022354-5 (AC022354) unknown protein [Arabidopsis thaliana] gblAAB64244. 11 (U72 155) beta-glucosidase [Arabidopsis thaliana] ATU57320 117sa ATU 62330 15623_f-at BCH1 j13211 _-sat CAYFEROYLCO 13215_s_at A
METHYLTRANS
1 U72 155.2 14170_at 15954_at 154- Case S.50015A116/78/NAD 00 xcession S 1294.2 67421.3 A ffV Descriution
I
20422-gat I 6489at 00 :98319.2167sa (98320.2 18312...s-at (98321.2 19595-s-at gblAADOO5O9.11 (U8 1294) germin-tike protein [Arabidopsis thalianal pirlIS53Ol 2 root-specific protein RCc3 rice >giJ7861I321gbIAAA655 13. 11 (L27208) RCc3 [Oryza dhaliAa89gi48. 11 (AeBIC293 323. beta (X78586) nosdr4 [Arabidopsis thalianal embICAA66963. 11 (X983 19) peroxidase [A-rabidopsis thaliana] >gij 1 42921 7lembICAA6731 11.11 (X98775) peroxidase ATP12a [Arabidopsis thaliana] >gij67 14469 jgbjAAF26 155.1 I AC00826 1. 12 gblAAB7 1452. 11 (AC000098) Strong similarity to Arabidopsis peroxidase ATPEROX7A (gbIX9832 1).
[Arabidopsis thaliana] >giJ2738254jgbjAAB9466l .11 (U97684) peroxidase precursor [Arabidopsis thaliana] gblAAFO3466. 1 JAC009327~5 (AC009327) putative peroxidase [Arabidopsis thaliana] embICAA6734O.11 (X98808) peroxidase ATP3a [Arabidopsis thaliana] pirlITO 1626 peroxidase (EC 1. 11. 1. 7) ATP22a Arabidopsis thaliana >gij3O4558jgbjAAC090 3 l 1.11 (AC003673) peroxidase (ATP22a) [Arabidopsis thalianal embjCAA72484.1I (YI 1788) peroxidase ATP24a jArabidopsis thaliana] pirIIE7 1418 hypothetical protein Arabidopsis thaliana >gil2244897lembICAB 10319. 11 (Z97338) HSR201 like protein [Arabidopsis thalianal >gi17268287lembCAB7858 2 .11~ (ALI 61541) HSR201 like protein [Arabidopsis thaliana] X98855.2 16028_at Y 11788.1 18946_at Z97338.321 16045_s-at 155- Case S-5001 SA/16/78/NAD I DescriDtion A Am v Aces on I Z97 340.345 17485_s-at 1spIP524O7IE13B-HEVBR GLUCAN ENDO-1,3- BETA-GLUCOSIDASE, BASIC VACUOLAR ISOFORM PRECURSOR ((I1->3)-BETA-GLUCAN ENDOHYDROLASE) ((1->3)-BETA-GLUCANASE) (BETA-i ,3-ENDOGLUCANASE) >gil2 12991 21pir11565 077 1 ,3-beta-glucanase (EC 3.2. precursor Para rubber tree >gil 1846681gblAAA87456. 11 (U22 147) beta- 1,3glucanase [Hevea brasiliensis]' gblAAC6I81 1. 11 (AC004667) putative AT-hook DNAbinding protein [Arabidopsis thaliana] emblCABl6788.lI (Z99707) patatin-like protein [Arabidopsis thaliana] >gi17270655lembICAB803 7 2 11 (AL161590) patatin-like protein [Arabidopsis thaliana] Z97344.151 19886_at Z99707.288 18326_s-at 156- Case S-50015A/16/78/NAD Table 9 shows expression results from an acute (3 hour) response to stress, either up or down, to cold, mannitol, or salt in roots but not in leaves. Of the nine root-specific promoters shown in Table 8, one (SEQ ID NO:8) did not show a response to any of the stresses, two (SEQ ID NOs. 47 and 48) were downregulated in response to cold, mannitol and stress, four (SEQ ID NOs:4, 7, 28 and 30) were upregulated in response to at least one of the stresses and downregulated in response to at least one of the stresses, and two (SEQ ID NOs:25 and 28) were only downregulated by salt stress.
Table 9: Accession Affy id Cold Cold Root3 Root27 Man Root3 Man Root27 Salt Root3 Salt Root27 Roots AC006577.16 ATU57320 X98808.1 U81294.2 Z97338.321 X98855.2 AC006577.16 -X78586.2 ATU62330 NOVARTIS51 AC005560.136 AF098630.3 AF128395.12 Z97340.345 AL035538.245 'X98322.2 ATU 10034 SAL049730.104 12778_r_at 15137_s_at 15985_at 20421 _at 16045_s_at 16028_at 12779_f_at 16048_at 15623_fat 14170_at 16016_at 19118_s_at 20395_at 17485_s_at 16514_at 17942_sat 15120_s_at 18983_s at -1985 -729 -2123 -19 -1068 -448 -672 56 -1274 -1058 93 228 -286 -691 200 -366 -102 322 -3753 -219 1183 2399 -694 -691 -763 603 373 537 643 422 -508 -1934 -498 54 134 -51 -2768 -1304 -1881 -1162 -1084 -595 -636 -576 -1054 -654 25 -52 -482 -357 798 -285 -336 -272 -363 992 -312 345 124 -589 -419 307 141 -14 628 -37 -115 -592 935 4 -80 -167 -4018 -2420 -2331 -1450 -1425 -1043 -976 -881 -817 -718 -648 -640 -621 -529 -490 -457 -456 -439 -1769 141 -343 371 -285 -559 -559 -588 439 16 -232 -117 261 -454 -118 3 -570 157 Case S-50015A/16/78[NAD Accession Ally id Cold Root3 Cold Root27 NIan Root3 Man Root27 Salt Root3 Salt Root27 I i AJ 133036.5 U72 155.2 X983 19.2 U81294.2 X67421 .3 Y 11788.1 ATPIN2 ACOOS3 10.6 ACOO7 135.23 AC0065 87. 164 AC004521 .114 X9832 1.2 AC002333. 199 AL0244 86.185 AC004521.119 A71597.1 AC0062 16.26 AC0062 16.22 AL080253.32 AC004683.79 X745 14.2 AL080282 .74 AC002333.2 10 X745 14.2
CAFFEROYLCOA-
METHYLTRANS
AC004005. 104
ATHORF
AC003673.201 15969_s-at 1 5954_at 1697 1 _s-at 20422_g-at 16489_at 18946_at 12932_s-at 17697_at 20176_at 15859_at 19195_at 19595-s-at 13552_at 16299_at 20608_s-at 12079_s-at 1857 1 -at 14050_at 19415_at 1646 1 _i-at 20239..g-at 18597_at 13153_r-at 20238_at 13215_s-at 19390_at 16649_s-at 1648 1 _s-at -316 52 -368 -96 446 100 -172 -99 -37 91 -410 -50 -205 -162 -201 -185 -46 -45 112 -145 13 -251 -5 288 42 -77 54 -38 -619 -178 9 530 200 146 -182 18 82 134 93 -149 -418 -165 96 -153 55 14 -132 -621 213 161 -186 553 33 0 112 -106 74 -86 -291 -272 -158 -58 -158 -97 260 29 -322 -66 167 -76 -119 79 23 -23 107 -136 60 -58 48 174 38 -121 43 16
I
-465 -447 -62 43 -41 -21 -67 -15 137 13 -36 0 101 -47 -7 -142 -26 -14 118 -164 -91 120 -82 115 -20 37 17 -22
I
-400 -388 -368 -341 -323 -199 -170 -139 -120 -117 -96 -95 -89 -80 -75 -74 -71 -62 -56 -17 1 4 9 10 12 13 16 17 -470 -252 -86 32 -357 124 -128 -23 -81 -8 73 -148 -8 -108 142 1 -24 -51 302 -56 -16 -8 -28 158 Case S-50015A./16/78/NAD Accession Affy id Cold Root3 Cold Root27 Man Root3 Man Root27 Salt Root3 Salt Root27 ATAJ2596 AC002333.2 10 AC004684. 165 AL049638. 193 A71588.1 A71596.1 Z99707 .288
ACHI
ACOOSS6O. 147 X98320.2 AC002391. 150 AC005967.50 AC007060. 34
BCHI
ACOO 1645. 19 AB023448.2 ACOG 1645.47 AL049500.5 7 AC007584.4 8 16085_s-at 13154_s-at 17907_s-at 20029_at 14015_s-at 14016_s-at 18326_s-at 12852-s-at 12758_at 18312_s-at 17843_s-at 17864_at 19840_s-at 13211 _-sat 15965_at 12332_s-at 15996_at 16914_s-at 20194_at 128 _-6 -154 45 -130 -104 150 -25 33 38 79 37 606 99 -323 170 -160 96 288 -137 -511 -52 41 138 99 -110 36 -822 29 170 133 1194 -554 -177 -704 -167 -2596 0
I
240 168 -3 35 164 132 309 97 362 293 26 41 304 337 141 421 215 366 848
I
64 -224 106 -42 -23 -15 19 -7 357 21 15 -37 -145 -242 -437 -130 -162 -818 259
I
30 31 40 64 79 98 99 114 121 131 177 196 286 312 355 370 445 541 1016
I
-47 -172 146 -14 -4 185 -275 -389 -374 -147 -1265 -116 Accession Affy id Cold Leaf3 Cold Leaf 27 Man Leaf 3 Man Leaf 27 Salt Leaf 3 Salt Leaf 27 Leaves AC006577. 16 ATU57320 X98808.1I U81294.2 Z97 338.321 X98855.2 12778_r_at 15137_s-at 15985_at 20421_at 16045_s-at 16028_at -89 63 -136 -8 -8 -16 -79 -93 -19 -4 -13 159 Case S.50015A116/78[NAD Accession Affy id Cold Cold Leaf 27 Man Leaf 3 Man Leaf 27 Salt Leaf 3 Salt Leaf 27 I I I I AC006577. 16 X78586.2 ATU62330 NOVARTISS 1 AC005560. 136 AFO9 8630.3 AF128395. 12 Z97340.345 AL-035538.245 X98322.2 ATU 10034 AL-049730. 104 AJ 133036.5 U72 155.2 X983 19.2 U81294.2 X6742 1.3 Y 11788.1 ATPIN2 ACOOS3 10.6 ACOO7 135.23 AC006587. 164 AC004521.1 14 X98321 .2 AC002333. 199 AL024486. 185 AC004521.1 19 A71597.1 AC00621 6.26 127793_at 16048at 15623_fat 14170at 16016_at 19118-s-at 20395at 17485_s-at 16514_at 17942_s-at 15120_s-at 18983_s-at 15969_s-at 15954_at 1697 1 _s-at 20422-gat 16489_at 18946_at 12932_s-at 17697_at 20176_at 15859_at 19195_at 19595_s-at 13552_at 16299_at 20608_s-at 12079_s-at 18571_at -83 69 -3 -188 1 1 3 103 15 -1I 10 -6 4 4 -4 12 -3 -177 -13 -3 8 -51 -35 2 4 -15 -18 -4 -1I -57 96 8 103 1 0 -9 1 -619 10 0 -85 13 13 4 3 0 21 -203 2 3 -62 2 -4 7 -139 -22 -47 149 -4 -258 7 -6 10 20 6 -2 -3 0 12 0 3 6 -5 -175 -2 -1I 0 -54 -12 -26 -15 -5 2 78 42 -311 7 1 3 -200 10 -2 -81 14 13 -7 -2 9 0 -204 -3 1 -47 0 2 -33 -2 -10 10 36 49 -310 4 -2 6 -54 5 2 -3 -4 25 4 11 -2 -158 -3 0 -56 -3 0 1 -31 2 5 4 -53 -34 81 -14 -195 -2 -521 -2 1 7 7 -2 -4 2 285 -6 -6 -21 2 6 -6 -7 160 Case S-5O15AI16/78/NAD Accession Affy id Cold Cold Man Leaf3 Leaf 27 Leaf 3 AC006216.
2 2 14050_at -2 1 -3 A.L080253.3 2 19415at 6 0 3 AC004683.
7 9 1646 19_iat 26 0 8 X74514.2 20239_gat -11 84 4 AL080282.
7 4 18597at -62 284 27 AC002333.
2 1 O 131 53-r-at 52 -23 41 X74514.2 20238at -9 218 0 ,Armoc1VffCA 121 s at 20 31 7 Man Leaf 27 Salt Leaf 3 Salt Leaf 27
METHYLTRANS
AC004005. 104
ATHORF
AC003673.
2 Ol ATAJ2596 AC002333.
2 lO AC0046 84.165 AL049638.19 3 A7 1588.1 A71596.1 Z99707 .288
ACHI
AC005560. 147 X98320.2 AC002391 .150 AC005967.5O AC007060.
3 4
BCHI
ACOO 1645. 19 AB023448.2 ACOO 1645.47 1 9390..at 16649_s-at 16481 _s-at 16085_s_at 13154_s-at 17907_s-at 20029_at 14015_s-at 14016_s-at 18326_s-at 12852_s-at 12758_at 18312_s-at 17843_s-at 17864_at 19840_s-at 1321 1_s-at 15965_at 12332_s-at 15996_at 8 47 3 0 74 17 -4 5 8 1 16 2 1 416 8 -80 44 -24 127 5 -3 39 0 -1 -63 -29 -18 -7 -3 2 -6 -2 -53 8 169 -94 -3 -172 -10 -3 9 0 -9 198 16 -6 2 9 487 5 106 -1 -22 9 6 -4 0 17 -60 36 35 -112 0 2 5 2 75 25 -5 -6 -2 3 9 10 5 239 10 105 -13 -4.
-10 -6 -2 2 14 -55 -40 -6 -180 1 4 -2 1 -3 0 13 -1 0 8 3 -2 184 -2 37 25 9 29 -13 -8 7 -8 -84 -9 -3 63 0 -54 -27 -133 2 6 21 -48 23 -42 -194 -8 161 Case S-5OOI5A/I6/78fNAD 00 SAccession .nA1049500.57 AC007584.48 Affy id Cold Cold Leaf3 Leaf 27 Man Man Leaf 3 Leaf 27 Salt Salt Leaf 3 Leaf 27
I
16914_s-at 265 20194_at 27 341 182 19 -7 78 -354 78 62 30 32 00 162 Case S.5001SA116/78/NAD Table lOA-D summrrarize the root genes up- or down-regulated in response to cold, mannitol or salt stress.
Table IlOA: Afl'v I Description Aeee~inn ccsso Acute (3 hr) manitol stress response downregulated root genes AC006577. 16 X98808. 1 ATU57320 U81294.2 Z97 338.321 ATU62330 12778_r-at 1gblAAD25772. 1 IAC006577~8 (AC006577) Belongs to the PF100657 Lipase/Acyihyd ro lase with GDSLmotif family. ESTs gbjT44453, gbIT04815, gbIT45993, gbIR3O 138, gbIAI099570 and gbIT22281 come from this gene. [Arabidopsis thalianal' 15985_at embICAA6734O. 11 (X98808) peroxidase ATP3a [Arabidopsis thalianal
I
15137_s-at gblAAB47973. 11 (U57320) blue copper-binding protein 11 [Arabidopsis thaliana] 1 4
I
2042 1 -at embICAB 10242. 11 (Z97336) germ-in precursor oxalate oxidase [Arabidopsis thalianal I i 16045_s-at embICAB 10318. 11 (Z97338) HSR201 like protein [Arabidopsis thaliana] I I- 15623_f-at dbjjBAA2 1503.11 (D8659 1) inorganic phosphate transporter [Arabidopsis thaliana]
I
AC006577.16 12779jf-at 1gbIAAD25772. 1 IAC006577~8 (AC006577) Belongs to the PF100657 Lipase/Acymhydro lase with GDSLmotif family. ESTs gbIT44453, gbITO48 15, gbIT45993, gbIR3O 138, gbIAI099570 and gbIT22281 come from this gene. [Arabidopsis thaliana]V
I
X98855.2 A17128395.12 Z97340.345 16028_at embICAA6736 1.11 (X98855) peroxidase ATP8a [Arabidopsis thaliana]
I
20395_at 1gbIAADl17355.l11 (AFI 28395) contains similarity to pathogenesis-related protein I precursors and SCP-like extracellular proteins (Pfam: PFOO 188, Score=79.
8 E=4.lIe-2 1, N= 1) [Arabidopsis thaliana]'
I
17485_s-at "embICAB 10405. 11 (Z97340) beta-i1, 3-glucanase class I precursor [Arabidopsis thaliana]'
I
ATU 10034 115 120-s-at gblAAA93 132.11 (U 10034) glutamate decarboxylase [Arabidopsis thaliana] 163 Case S-50015A116/78/NAD 00 00 Accession Affy Description AC004521.1 14 19195_at gblAAC16105.11 (AC004521) unknown protein [Arabidopsis thaliana] X98319.2 1697 1 _s-at embICAA66963. 11 (X98319) peroxidase Arabidopsis thaliana] X98322.2 17942_s-at em-bICAA66966.1I (X98322) peroxidase [Arabidopsis thaliana] U81294.2 20422-g-at embICAB 10242. 11 (Z97336) germin precursor oxalate oxidase [Arabidopsis thaliana] XL049730.104 18983_s-at embICAB4l72 1. 11 (AL0497 30) pEARLI 1 -like protein [Arabidopsis thalianal ATPIN2 12932_s at gblAAC84O42.I11 (AF087459) polar- auxin-transport efflux. component AGRAVITROPIC I [Arabidopsis I thaliana] X6742 1.3 16489_at embICAA478O7.1I (X67421) extA [Arabidopsis thaliana] AC004683.79 16461 _I-at gblAAC28766. 11 (AC004683) peroxidase [Arabidopsis thaliana] ACOO4005.104 19390_at gblAAC23409. 11 (AC004005) unknown protein [Arabidopsis thaliana] AC004521.1 19 20608_s-at gbjAAC 16106. 11 (AC00452 1) hypothetical protein [Arabidopsis thaliana] Manitol stress response upregulated in root genes only (acute response) AL080253.32 19415_at embiCAB458O5. 11 (AL080253) putative protein [Arabidopsis thaliana] A71596.1 14016_s-at embICAB42592. 11 (A7 1596) unnamed protein product [Arabidopsis thaliana) ACO0l 645.19 15965_at gblAAB6363 1 11 (ACOO 1645) jasrnonate inducible isolog_[Arabidopsis thaliana] A71588.1 14015_s-at embICAB42586. 11 (A71588) unnamed protein product [Arabidopsis thaliana] 164- Case S-50OI 5A116/78/NAD Accession AITy Description I I AC002333. 199 X745 14.2 ACOO 1645.47 ATAJ2596 ACOO7 135.23 13552_at gblAAB64O45. 11 (AC002333) endochitinase isolog [Arabidopsis thaliana]
I
20238_at embiCAA5262O. 11 (X745 15) bet a- fructofuranosidase [Arabidopsis thaliana] 15996_at 16085_s-at gblAAB63634.11 (AGOG 1645) jasmonate inducible protein isolog [Arabidopsis thalianal embICAB 16787. 11 (Z99707) patatin-like protein [Arabidopsis thafiana] j20176 at
I
gblAAD26967. 1 IACO7 1.35_3 (ACOO7 135) unknown protein [Arabidopsis thatiana] X98320.2 18312_s-at embICAA673 10. 11 (X98774) peroxidase ATP6a [Arabidopsis thaliana] Z99707.288 18326_s-at embICAB 16788. 11 (Z99707) patatin-tike protein [Arabidopsis thaliana] BCHI 13 211 _s-at dbjIBAA82825. 11 (AB023463) basic endochitinase [Arabidopsis thalianal AC005560.147 12758_at gblA.AC67329.11 (AC005560) putative major latex protein [Arabidopsis thalianal AL049500-57 16914_ s-at embICAB39936. 11 (AL049500) osmotin precursor [Arabidopsis thaliana] AB023448.2 12332_s-at dbilBAA828 10. 11 (AB023448) basic endochitinase [Arabidopsis thaliana] A.L035538.245 16514_ at embICAB37548. 11 (AL035538) putative protein [Arabidopsis thaliana] AC007584.48 20194_at gblAAD32907. 1 JAC007584_5 (AC007584) unknown protein [Arabidopsis thaliana] 165 Case S-5OOI5AI16/78/NAD 00 F able lOB: kccessiofl Affy#_ Description Salt stress acuti AC006577.1 6 12778_r-at erespone down regulated root only 0ATU5732' 00 SX98808.1 0 15 137_s-at 15985_at X98855.2 16028_at AC006577.1 6 12779_f~at IgblAAD25772.l I AC0065778 (AC006577) Belongs to the PF100657 Lipase/Acythydrolase with GDSLmotif family. ESTs gbIT44453, gbITO48 15, gbIT45993, gbIR3O 138, gbIA109957O and gbIT22281 come from this gene. [Arabidopsis thaliana]" gblAAB47973. I1 (U57320) blue copper-binding protein 11 [Arabidopsis thalianal gmbAAD27720.l (X988078) (AC006577) Belong to the opF100657LiasAcyhdoaewt
DL
emotif aBily. ES1 gbZ973 gbjTO48 15cuso oxbte59 thidsgene [Arabidopsis thaianaV' dmbICB 103.11 (Z97338) inoRgani pohte tastr[Arabidopsis thaliana gmbIAA673. 11 (X988556) putxiatie majo ae prte Arabidopsis thaliana 1gbAAD1575.1l AF128395) (c00 eontanglriyt emrclrpotiffml.eis (Pfarn:3 PI001, core=7 9 Eh4. Iene2 [AaI, os adpiha iana 1 (Z7586) bet[aIoi 3-luanacls Iprecus [Arabidopsis thaianal" X78586.2 ATU62330 S16048at 15623_fa AF128395.12 20395_at Z97 340. 345 ,17485_s_at 166 Case S-50015A/16/78/NAD A4cession Affy ,kL035538.2 4 5 16514_at X98322.2 17942-s-at ATU 10034 15120_s-at Description A.L049730-10 4 18983_s-at A] 133036.5 U72 155.2 15969-s-at 15954_at X98319.2 16971 _s-at U8 1294.2 20422-gat embICAB37548.l11 (AL035538) putative protein [Arabidopsis thaliana] embiCAA66966. 11 (X98322) peroxidase [Arabidopsis thaliana] ebIABIO42. 1 (Z9034) gerramat precursoroxlae ode[Arabidopsis thal iana] embCAA417 10. 11 (X604742 pEtAR [Arabioprote thairi hbaa gmbjAA84O42. 11 (X908777) poaruxin-e TPr apr th~aina] tahna gbAAC33493.11 (U72155 eukown proen (Arabidopsis thalana] gmbiAA6696. 11 (X983719 353xias (ACO7 poteins [Arabidopsis thaliana] X67421 .3 16489_at 12932_s-at ATPIN2 AC0053 17697_at ACOO7 135.23 20176_at
I
Salt stress acute respone up regulated root only AC005967.5O 17864_at AC007060.3 4 19840_s-at gblAAD033 87. 11 (AC005967) unknown protein (Arabidopsis thalianal gblAAD25759. 1 IAC0070601 7 (AC007060) Strong similarity to F1913.2 gi130 3 33 7 5 putative berberine bridge enzyme from Arabidopsis thaliana BAG gbiACOO423 8 EST gbJR9O5 18 comes from this gene.
dbjIBAA82825. 11 (AB023463) basic endochitinase [Arabidopsis thaliana] BCHI 131 s-at 167 Case S-50015A/16/7"/AD Accession ACOO 1645. 19 Affy 15965_at Description AB023448.2 12332-s-at AC001645.
4 7 15996_at AL049500.57 16914_s-at AC007584.48 20194_at gbIAAB6363 1. 11 (ACOO 1645) jasmonate inducible protein isolog [Arabidopsis thaliana] dbjIBAA828 10. 11 (AB023448) basic endochitmnase [Arabidopsis thaliana] gbIAAD6332O. 11 AC071645 (aC0075t 84)dunknown protein islo Arabidopsis thaa 168- Case S-50015A/16/78/N'AD I 00 0 Table IlOC: Accession 'nX98321.2 AC00621( 00 AC006214 Affv I Descriptionl :a
I
Genes expressed in root that have no acute response to stress .26 6.22 AL080253.32 X745 14.2 AC002333.2 10
CAIFFEROYLCO
AMETHYLTRAN
S
ATHORF
AC003673.20 1 AL049638. 193 19595_s-at embICAA66965. 11 (X9832 1) peroxidase [Arabidopsis thaliana] 1857Lat gbjAADl12681. 11 (AC006216) Similar to giJ3 4 l 3 7 l 4 T1I9L 18.21 putative myrosinase-binding protein from Arabidopsis thaliana BAC gbIAC004747. ESTs gb165870 and gbT208l2 come from this gene.
14050_at gbiAA.D12679. 11 (AC006216) Similar to giJ3 4 1 3 7 l 4 T191_18.21 putative myrosinase-binding protein from Arabidopsis thaliana BAG gbIAC004747. ESTs gbIT44298, gbIT42447, gbIR6476 1 and gbIl 100206 come from this gene." 19415_at embICAB458O5.lI (A.L080253) putative protein [Arabidopsis thalianal 20239-g-at embICAA5262O. 11 (X745 15) beta- fructofuranosidaSe [Arabidopsis thaliana] 13153_r at gbIAAB6432O.11 (AC002335) endochitinase isolog [Arabidopsis thalianal 13215_s-at gblAAA62426.11 (L40031) S-adenosyl-Lmet hionine: trans-caffeoyl-Coeflzyme A methyltransferase [Arabidopsis thaliana] 16649_s-at gblAAA62426.11 (L40031) S-adenosyl-Lmethionine:trans-taffeoy-Coeflzyme A methyltransferase [Arabidopsis thaliana] 16481 _s-at gbIAACO9O3 1. 11 (AC003673) peroxidase ATP22a [Arabidopsis thaliane] 20029_at embICAB4O949. 1 (AL049638) putative DNA-binding protein (Arabidopsis thalianail 169 Case S.50015Afl6/79/NAD 00 00 Figure I OD: Affv I Description tALL ~3 via I I- Down reg ulated with cold stress in root (acute response 3 hrs)
I
X98808.1 AC006577. 16 1 5985_at 12778_r-at AC006577.16 12779_fat embiCAA67 340.1 (X98808) peroxidase ATP3a [Arabidopsis thalianal .gblAAD25772. I AC00 6 5 7 7 8 (AC006577) Belongs to the PF100657 Lipase/Acyihyd ro lase with GDSLmotif famrily. ESTs gbIT44453, gb1T04815, gbIT45993, gbIR30l38, gbIAJ099570 and gbIT22281 come from this gene. [Arabidopsis thalianal" dbjIBAA21 503.1 (D8659 1) inorganic phosphate transporter [Arabidopsis thaliana) motif family. ESTs gb1T44453, gbjT048 15, gbIT45993, gbIR3O 138, gbIA.1099570 and gbjT22281 come from this gene. [Arabidopsis thaliana]" embICAA67361. 11 (X98855) peroxidase ATP8a [Arabidopsis thaliana] gbIAAC16105. 11 (AC00452 1) unknown protein [Arabidopsis thaliana] X98855.2 16028at AC004521-11 4 19195_at X983 19.2 1697 1_-sat X98322.2 192sa AC00 1645. 19 15965_at AJ 133036.5 15969_s-at AF128395.12 20395_at embICAA66963.11 (X983 19) peroxidase [Arabidopsis thaliana] embjCAA66966.11 (X98322) peroxidase [Arabidopsis thaliana] gbjAAB6363 1. 11 (ACOO 1645) jasmonate inducible protein isolog [Arabidopsis thaliana] embICAA67 313. 11 (X98777) peroxidase ATP I6a [Arabidopsis thalianal 1gbjAAD 17355. 11 (AF1 28395) contains similarity to pathoge nesis- related protein I precursors and SCP-like extrace~ular proteins (Pfam: PRO 188, Score=79.
8 E=4. Ie-2 1, N= 1) [Arabidopsis thaliana]" embICAB45 88 1. 11 (AL080282) berberine bridge enzyme-like protein [Arabidopsis thalianal A.L080282.747-- 18597_at 170- Case S-500I5AJI6/78INAD Accession AfTy AC002333.199 13552_at AC004521.11 9 20608_s-at A7157.1 1015s-at Description gbJAAB64045.l11 (AC002333) endochitinase isolog [Arabidopsis thalianal gbjAAC 16106. 11 (AC00452 1) hypothetical protein [Arabidopsis thalbana] erbICAB426.l1 (A71158) unnamed protein product [Arabidopsis thalana] Upregulated in root with cold stress AL035 538 .245 16514_at AF098630.3 19118 s-at AC007584.48 20194at X74514.2 20238at AL049730-104 18983-s-at X67421.3 16489at embiCAB3754 8 ij (AL035538) putative protein [Arabidopsis thatiana] embjCkB4 1725. 11 (AL049730) putative ceU wallplasma membrane disconnecting CLCT protein 171 Case S-50015A/16/78/NAD 00 Accession Affy Description AC007060.34 19840_s-at gblAAD25759.1IACOO 7
O
6 O-l 7 (AC007060) Strong silarity to F1913.2 gi1303 3 37 5 putative berberine bridge enzyme from Arabidopsis thaliana BAC gbiAC004238. EST gbJR90518 comes from this gene.
00 172- Case S-5O15A16/78[NAD Table I1I provides a description of the corresponding genes for Arabidopsis promoters which were constitutively expressed.
Table 11: Gene i1D Accession on chiD Affy Description -1 4- 1 A45785.
_-SAT
A45785.1 19852_s-at AB003522.2_AT AB003522.2 12381_at A.B004872.6-S_AT AB004872.6 15997 s at AB005560_SAT AB004872.6 15630_s-at AB006693.l IAT A.1006693.1 17438_at AB008105_SATl A.B00810 170 44s at embICAAO284O.11 (A45785) unnamed protein product [Arabidopsis thalianal dbjlBAA84392. 11 (AP000423) ATPase beta subunit [Arabidopsis thalianal dbjlBAA23547.11 (A.B004872) C0R47 [Arabidopsis thalianal dbjIBA.A22 504. 11 (ABOOS5560) AtGDI2 [Arabidopsis thaliana] dbjlBAA24536. It (AB006693) spermidine synthase [Arabidopsis thaliana] dbjlBAA32420.1 ll(ABOO8 105) ethylene responsive element binding factor 3 [Arabidopsis thaliana] dbjIBAA3l 143. 11 (AB010915) responce regulator 1 1Arabidopsis thaliana] dbjlBAA25248.11 (AB008854) 3ketoacyl-CoA thiolase [Arabidopsis thalianal dbjlBAA248O4.1I (AB010946) AtReri B [Axabidopsis thaliana] dbjlBAA32735.1I (ABOl 11545) GF14 mu [Arabidopsis thaliana] thaliana] gbjAAC 144 11. 11 (AF049236) putative acyl-coA dehydrogenase [Arabidopsis thaliana] AB008487_SAT AB0084 87 15127_s-at AB008854_SAT AB008854 14719_s-at AB0 10946_SAT AB0 10946 15200_s-at AB01I1545_SAT ABOl 11545 15163_s_at AB017643_SAT AB017643 15164_s-at 173 Case S.50015A116/78/NAD 00 00 Gene ID Accession on chip AB021858-SAT AB021858 AB024282_SAT AB024282 Affy 16540_s-at 15128_s-at AB027151.2-S-AT AB027151.2 19179_s-at AC0001I03.25_SAT ACOOOIO3.
25 20709_s-at ACOOO 104.l1ORAT ACOOO 104.IO0 13076_r-at ACOOO1O4.26AT AC000 104.
2 6 1277 1 -at ACOOO1IO6.1 3 _SAT AC000 106.1 3 17900 s_ Description dbjIBAA77759.II (AB021858) plastid heme oxygenase lArabidopsis thaliana] embICAB7lO7 4 fl (ALI 32962) cysteine synthase AtcysClI gblAAB615 17.11 (ACOOOIO3) F21J9.25 [Arabidopsis thalianal gblAAB70426.11 (ACOOO 104) Strong similarity to 60S ribosomal protein L17 (gbIXO1694).
EST
tgblAAB7O4O1.lI (AC000106) Similar to Glycine SRC2 (gbIAB00013O). ESTs gbIH76869,gbIT2 1700,gbjATTS5O 89 come from this gene.
[Arabidopsis thaliana] t gblAAC3322O. 11 (AC003970) Putative ribosomal protein L21 [Arabidopsis thalianal gbjAA395597,gbjATTS5 197 come from this gene. [Arabidopsis thalianal gbIAA.B60721. 11 (ACOOO 132) Similar to elongation factor 1 gamma (gbIEF1GXENLA). ESTs gbIT20564,gbIT45940,gbIT0 4 5 2 7 come from this gene. [ArabidopsiS thatiana] AC000132.1 6 _SAT AC000132.1 6 163 s-a AC000132.6_AT AC000 132.6 16420_at 174- Case S-5OOI5AJI6/78[NAD Gene ID A002131.48_S_AT Accession on Affy Description chin AC002131.48 12750_s-at AC002329.Z _AT AC002 329.
4 6 I 3074.at AC002330.
3 9_AT AC002330.
39 13574at AC002332.lOOAT AC002332. 10 0 13105at AC002332.7LAT AC002332.
7 1 17435 at AC002334.l 10-GAT AC0023 3 4 .1 10 16940-g-at AC002336.l~ 101CAT IAC002336.l101 12809-g-at gbjAAC 17620. 11 (AC00213 1) Identical to aspartic proteinase cDNA gbJU5 1036 from A.
thaliana. ESTs gbIN963 13, gbIT2 1893, gbIR30158, gbIT2 1482, gbIT43650, gbIR64749, gbIR65 157, gbIT88269, gbIT44552, gbIT22542, gbIT76533, gbIT44350, gbIZ34591, gblAA728734, g embICAA54O95. 11 (X7665 1) ribosomal protein S4 [Solanum tuberosum] gbjAACO429.1 I(IAC782694 (A03)putative synaptbrevi ATe[Arabidopsis tha ana l gbAAB87659.I11 (AC002332) ribosomal protein S263Arbdpi gbIAAB86O. I i (AC00233) prti Aiossthalianal gbAAC042.lI (AC00252 1 putative uyabrqiinnuail ezE2[Arabidopsis aaa 165O7at AC002339.51_AT AC0023 39.51 AC002343.3AT AC002343.
3 16447at AC002521.I 4 6 _AT AC002521.l 4 6 16917at 175- Case S-50015A116/78/NAD 00 Gene ID AC00256 Accession on Affy Description chir) I I I I I 18655_at p1.51 _AT AC002561 .51 18655at AC003672.6 4 _SAT ACO003672.- 6 4 20425 s-a~t gblAAB88646. 11 (AC00256 1) unknown protein [Arabidopsis thalianal gbiAAC27463. 11 (AC003672) putative small GTP-bmnd Ing protein [Arabidopsis thalianal gblAAC 14060. 11 (AC00398 1) F22013.34 [Arabidopsis thalianal 00 AC003981.3 4 _SAT A03981 3 1623sat AC004077.1 6 ;_S_AT AC 004077.- 16 6 17004_5_at AC004165.1OSAT AC004165.1OS 13125 at AC004218.
8 3 _SAT AC004218.8 3 1 3616_s-at AC004393.
2 2 _AT AC004393.22 16953_at gblAAC267O8. 11 (AC004077) ribosomal protein L18A [ArabidopSis thaliana] gbl AACI16961.l11 (AC004165) putative ubiquitin activating enzyme (UBAl) [Arabidopsis gblAAC27837. 11 (AC004218) ribosomal protein L23A [Arabidopsis thaliana] gbjAAC 1879 2 .l11 (AC004393) Simrilar to ribosomal protein L 17 gbjX62724 from Hordeum vulgare.
ESTs gbIZ34728, gbIF1 9974, gbIT75677 and gbIZ33937 come from this gene. [Arabidopsis thaliana] gbIAAC 17825. 11 (AC00440 1) unknown protein [Arabidopsis thaliana] gblAAB87096.21 (AC002391) unknown protein [Arabidopsis thaliana] gblAAC64298. 11 (AC004450) 3isopropylmalate dehydratase, small subunit [Arabidopsis thatiana] gblAAC64306. 11 (AC00445O) unknown protein [Arabilopsis thalianal AC004401.119AT IAC004401.1l9 13594_at AC004401.1 4 OAT AC004401.140 12767_at AC004450.1 I -AT AC004450.11 18882_at AC004450.83_AT AC004450.83 18262at 176- Case S.50015A/16/7SINAD 00 00 Gene ID Accession on Affy Description chip AC004481.
8 4 _AT AC004481.
84 13102_at gbiAAC274Ol .lII (AC0044 81) putative protein transport protein SEC61 alpha subunit [ArabidopsiS thalianal AC00455 7 .lOAT AC004557.l10 17436_at gbj AAC 80610. 11(ACOO 4 5 5 7 F 17 L2 1. 10 Arabido psis t h a hanal AC004557.
2 OAT AC00455 7 2 O 17374_at gbIAAC8O62.I
(ACOO
4 55 7 F17L21.20 [Arabidopsis thaliana) AC004557.
8 _AT AC00455 7 8 18874_at gblAAC80608.11I(ACOO 4 55 7 F171_21.8 [Arabidopsis thalianal AC004665. 121I _SAT AC004665. 121 18629_s-at gblA.AC28542. 11 (AC004665) tff?!remorin [ArabidopsiS thalianal AC004665.
3 lI _S_AT AC004665.
3 l 1597Ls-at gblAAC28529. 11 (AC004665) AC004669.3 4 _AT AC004669.3 4 16430at AC004747.l 6 OSAT AC004747.1 6 O 15506_sa AC00516 9 2 2 I _AT AC005169.
2 2 l 18283_at AC005287.2OSAT AC00528 7 2 O 16027_s_ AC005287.
5 2 _AT AC005287.
5 2 14073_a AC005309.
2 Ol _I_AT AC00530 9 2 Ol 15570 intrinsic protein IB) [Arabidops Is thaliana) gbiAAC2072O.l11 (AC00466 9 glutathione S-transferase [Arabidopsis thaliana] tgbIAAC61214I.11 (AC005169)40 ribowma protein 53[ArabidopsiS thaliana] gbIAAC621 1. 11 (ACOOS 169)40 puaeribosomal protein L28rbiopi [Ahaidosi haiaa atgbIAAC6365O. 11 (AC005309) punknveiosoa protein L28si thkaina] tahna 177 Case S-5OO15AI16/7gJNAD 00 00 Gene ID Accession on Affy chip 7AC0053019.
6 4 SAT AC005309 6 4 16009_5at AC005388.
6 _SAT AC00538 8 6 12783_s-at AC005662.
3 OSAT AC005662.3 0 1647 1 _s-at 16952_s-al Description gblAAC63629. 11 (AC005309) glutathione S -transferase (GST6) [Arabidopsis thaliana] gblAAC64875.lI (AC005388) identical to gbIL14814 DNA for tissue -specific acyl carrier protein isoform 2 from A. thaliana. ESTs gblAA59735 1, gbIT4 1805, gbIH3687l, gbIR302 gblAA04254 9 gbIZ47650, gbIH76304 and gbjAA5973 4 8 come from this gene. [Arabidops gblAAC62877.l1 (ACOOS 397) eukaryotic translation initiation factor 3 delta subunit [Arabidopsis Identical to gb1U65638 Arabidopsis thaliana vacuolar type ATPase subunit A m.RNA. ESTs gbIN96435, gbIN96106, gbIN96189, gbIN96091, gblAA04228 6 gbIF14324, gbIW43643, gbIN96027, gbIN96299, gbjR29943, gbIT43460, gbIT43544, gbIT2247 gblAAC79595. 11 (AC005727) unknown protein [Arabidopsis t gblAAC73028.I11 (AC005 82 4 acidic ribosomal protein P2 [Arabidopsis thalianal AC005679.lIOSAT AC00567 9 .10 12775_sa AC005727.19L AT AC 0057 27.1 9 l l69O1at AC00582 4 .lO 7 _AT AC005824.1O 7 16527_ai AC00582 4 .l 14-AT AC005824.l 14 17910_-a 178- Case S-50015A/16/78/NAD Gene ID Accession on Affy Description I AC005824.21
_AT
ACOOS 824.21 13089_at AC005896.150_SAT AC005896.150 18603_s-at AC005897.156_S_AT AC005897.156 1 3572_s-at AC005936.95_AT I AC005936.95 16416_at AC005990. IOAT 1AC005990.10 13069at gbjAAC73Ol 5. 11 (AC005824) putative dTDP-glucose 4-6dehydratase [Arabidopsis thaliaria] gblAAC98O6O. 11 (AC005896) putative protein translocase.
[Arabidopsis thaliana] gbjAAC97246.11 (AC005897) formyltetrahydrofolate synthetase [Arabidopsis thaliana] gblAAC9722 1. 11 (ACOOS 936) protease inhibitor 11 [Arabidopsis thaliana] gblAAC98042. 11 (AC005990) Strong similarity to gbIM95 166 ADP-ribosylation factor from Arabidopsis thaliana. ESTs gbIZ25826, gb1R90191, gbIN65697, gblAA7 13150, gbIT46332, gblAA040967, gblAA7 12956, gbIT46403, gbIT46050, gbIA1lOO39l and gbIZ25043 come from gbjAAD 15447. 11 (AC006068) unknown protein [Arabidopsis thalianal gblAAD30634. 1 1AC006085_7 (AC006085) Unknown protein [Arabidopsis thatiana] gbIAAD 14525. 11 (AC006200) ribosomal protein L7 [Arabidopsis thaliana] gblAAD20124.l11 (AC006201) ribosomal protein L2 [Arabidopsis thaliana] gbjAAD 15390. 11 (AC006223) putative hydrolase [Arabidopsi'S thaliana] AC006068.93_AT AC006068.93 18645_at AC006085.15_AT AC006085.15 20562_at AC006200.119_AT AC006200.119 13132_at AC006201.107-SAT AC006201.10 7 16924_s-at AC006223.65_AT I AC006223.65 14089_at 179 Case S-5O15A16/78/AD Gene ID Accession on Affy Description chip AC00623 4 15 6 _AT AC00623 4 15 6 14099_at gblAAD2O913. 11(ACOO 6 2 3 4 unknown protein [ArabidopSiS thalianal AC006260.5 2 _AT AC00626O.
52 12769_-at gbjAADl 8142. 11(AC 0 0 6 2 6 0 aquaporin (plasma membrane intrinsic protein 2B) [ArabidopsiS thallana] AC00626 4 .3 0 _AT AC00626 4 3 O 13095_at gblAAD29800. I AC00 6 2 6 4 8 (AC006264) putative signal sequence receptor, alpha subunit AC006300.l 12_AT AC006300.l 12 16948.at gblAAD2O7O8.11 (ACOO630 0 putative glucose regulated repressor protein [Arabidopsis thaliana] AC006300.
7 OAT ACOO630 0 7 O 16487at gblAAD20704.l1I (AC006300) putative d joxygenase [Arabidopsis thalianal AC006403.1 110_AT AC00640 3 .l 10 18223at gbIAADl18124. 11 (AC006403) unknown protein [Arabidopsis thalianal AC006438.
2 lAT AC048 2 24_t gblAAD41971.1IACOO 6 4 3 8 3 (AC006438) similar to cold acclimation protein WCOR413 AC006526.
5 '_AT AC00652161 5 51] 14103at AC006532.47LAT AC00653 2 4 7 1 9940at gbIAAD20O9O.l1 (AC00653 2 putative endosomal protein [Arabidopsis thaliana] 180- Case S-50015A116/7SINAD Gene ID Accession on Affy Description chin 1 4 AC006577.32_AT AC006577.32 1694 1 at AC006585.146_AT AC006585. 146 14565_at AC006586.141_AT AC006586.141 17390_at AC006592.15O_S_AT AC006592.15O 15980_s-at AC006841. 122-AT AC006841.1 2 2 19650_at AC006919.14OAT AC006919.140 12742_at gblAAD25780. IJAC00657716 (AC006577) Similar to gbIU55861 RNA binding protein nucleolysin (TIAR) from Mus musculus and contains several PF100076 RNA recognition motif domains. ESTs gbIT2 1032 and gbIT44 127 come from this gene. [Arabidopsis thaliana] gblAAD23Ol9.1I1AC0065 8 5 14 (AC006585) putative steroid binding protein [Arabidopsis thaliana] gblAAD22696. 1 1AC006586_5 (AC006586) 40S ribosomnal protein S16 [Arabidopsis thaliana] embICAA47427.l11 (X67034) Athb- 6 [Arabidopsis thaliana] gblAAD23 699.1 I AC00684 115 (AC006841) coatomer alpha subunit [Arabidopsis thaliana] gblAAD24635. I AC00691 9 (AC0069 19) enolase (2-phospho- D-glycerate hydroylase) [Arabidopsis gblAAD24640. 1 JAC006919_20 (AC0069 19) putative pyruvate kinase [Arabidopsis thaliana] gbIA-AD2 1434. 11 (AC00692 1) unknown protein [Arabidopsis thaliana] gblAAD31573.I AC0069 2 2 (AC006922) putative sadenosylmethionine synthetase [Arabidopsis thaliana] gblAAD31569. I AC0069 2 2 _1 (AC006922) putative aquaporin (tonoplast intrinsic proteiin gamma) AC006919.171 IAT AC006919.1 7 1 13070_at AC006921.52_AT AC006921.52 16511 -at AC006922.106_AT AC006922.106 12412_at AC006922.28_S-AT AC006922.28 15962_s-at 181 Case S-5O15AI16/78[NAD 00 SGenelID AC0069: Accession on ch in Affy Description I
I
~9.77_AT AC006929- 7 7 13150_at AC006951.'208-S-.AT IAC006951.
2
O
8 I13107_s-at ACOO7O 17 .278-SAT ACOO7O 17.278 20024_s-at AC007019.1OSAT AC007019.105 16022_at AC007070. i 67-AT AC007070. 167 13166_at AC007071.7 2 _AT AC007071.72 16933_at gbIAAD2 1502. 11 (AC006929) putative rubisco subunit bindingprotein alpha subunit [Akrabidopsis thalianal gblAAD25839. I JAC00695 1 -18 (AC006951) 40S ribosomal protein S 17 [Arabidopsis thaliana] gblAAD2 1476.11 (ACOO7O 17) unknown protein [Axabidopsis thaliana] gblAAD2O4O5. 11 (ACOO7O 19) putative ATP synthase [Arabidopsis thaliana] embICAA64728. 11 (X95458) ribosornal protein L39 [Zea mays] gbIAAD24852. 1 JAC00707 124 (AC007071) 40S ribosomal protein; contains C-terminal domain [Arabidopsis thaliana] gbIAAD23647.1I AC00 7 1 19_-13 (ACOO7I 19) 40S ribosomal protein [Arabidopsis thatiana] gbIAAD2697 1. 1 JAC007 135_8 (ACOO7 135) 40S ribosomnal protein S514 Arabidopsis thalianal gbIAAD22647. I 1AC007 138 11I (ACOO7 138) S-adenosylnethionine synthase 2 [Arabidopsis thalana] gbiAAD2564O. 1 JAC007 170_2 (ACOO7 170) cytoplasmic aconitate hydratase [Arabidopsis thaliana] tgbIAAA99933.1I J(L4458 1) vacuolar H+-pumping ATPase 16 kDa proteolipid [Arabidopsis [Arabidopsis thalianal AC007119.8 8 _AT AC007119.8 8 13080_at AC007135.5OAT AC007135.50 16919_at AC007138.25_SAT AC007138.25 12797_s-at AC007170.48_AT AC007170.48 17857_at AC007195.93_IAT ACOO7 195.93 16969_iat 182 Case S-5OOI5AiI6/7S/AD Gene ID Accession on Affy Description chip AC007357.17-S-AT AC00735 7 17 13104_s-at embiCAA74O29. 11 (Y 13695) multicatalytic endopeptidase complex, proteasome precursor, beta subunit [Arabidopsis thalianal AC007576.5_AT AC0075 7 6.5 12781_at gbIAAD39279. I AC0075 7 6 2 (AC007576) Unknown protein !!I[Arabidopsis thaliana] AC007659.93-R-AT AC00765 9 9 3 13169_r-at gbIAAD3283 1. 1 AC00 7 6 5 9 -13 (AC007659) putative GATA-type zinc finger transcription factor [Arabidopsis thaliana] AF000657.40-AT AF000657.
4 O 19623..at gbIAAB721 7 5.11 (AF000657) cytochrome C [Arabidopsis thalianal AF001394_SAT AF001394 15600_s at gbIAAD0o895. 11KAF00 1 3 9 4 fatty I acid desaturase/cytochrome f usion protein [Arabidopsis thahana] AF003096_FAT AF003096 14723_fat gbIAAC49769. 11 (A.F003096) AP2 72 a bIdomain containing protein RAP2.3 domai containn rti RA2 Arabidopsis tha tiana AF048_A AF026 125at gbjAAC497749.1(A00341 ethyleneisentive 3 [abdps thaln ca] i AF00439-S-AT AF004216 14714_s-at gbIAAB62924. 11 (AF004393)sat zt induced tonoDlast intrinsic AF01I3294.25SAT AF013294.
2 5 18650_s-at AF0l 3294.35AT AF013294.3 5 18573at gblAAB62855.11 (AF013294) similar to acidic ribosomal protein p1 [Arabidopsis thaliana] 183 Case S-50015AJ16178/NAD 00 00 ieIDAcsin#o Affy Description chip kF013959.4_AT AF01 3959.
4 16436at gblAAkB67234.lI (AF01 3 9 5 9 meta~othionein-R~e protein [Arabidopsis thalianal AY0O17641 _SAT AF017641 15165-sat gbjAAC 17 844.l11(AFO1 7 64 1) nucleoside diphosphate kinase type 1 [Arabidopsis AF017991 _S_AT AF017991 15150_s-at gblAAB97312.1(AF017 9 9 1) salt stress inducible small GTP binding protein Rani AF027172.3_SAT AF027172.3 16906_s-at gblAAC39334.11I(AFO 27 1 7 2 cellulose synthase catalytic subunit (Arabidopsis thalianal AF027174_SAT AF027174 15603_s-at gblAAC39336.11I(AFO 2 7 1 7 4 cellulose synthase catalytic subunit [Arabidopsis thaliana) AF034387_SAT AF034387 14727_s-at gblAAC33264.1I (AF034387)
AFT
protein [Arabidopsis thalianal AF034694_SAT AF034694 16544_s-at gbIAAB8769 2 .11 (AF034694) ribosomal protein L23a [Arabidopsis thaliana] AF043519_S_AT AF043519 15 130_s-at gbIAAC9516 1. 11(AC005 9 7 O) 2 0S proteasome subunit (PAA2) [Arabidopsis thalianal AF043528_SAT AF043528 16546_s-at gblAAC32064. 11KAF0435 2 8 2 0S proteasome subunit PAG 1 [Arabidopsis thaliana] AF044265_SAT AF044265 15668_s-at gblAAC00512. 11(AFO 4 4 26 nucleoside diphosphate kinase 3 [Arabidopsis thaliana] AF044313_SAT AF044313 14717_s-at gbIAACO5742. 11(AFO 4 4 3 13 anion channel protein [Arabidopsis thaliana] AF5929_S-AT AF059294 14736_s-at gbIAAF26761.11ACOO 7 3 9 6 (AC007396) T4012. [Arabidopsis thaliana] protein in budding yeast [ArabidopsiS thaliana] 184 Case S-5O0lSA16/78/NAD I 00 00 Gene ID Accession On Affy Description chip A D 1 2 8 11( F 6 5 9 AF061519_SAT AF061519 15581 _s_at gb /incAsupeox.ideAF 0 61 5 1 9 copper/ncspoxddsmte [Arabidopsis thahanal AF06248 5.1 -AT AF062485.l 17468_at gbIAAC29O67.11
(AYO
6 24 8 5 i celulose synthase lArabidopSiS thaliana] AF063901_SAT AF063901 14737_s-at gb1AAC26854. 11(AF 0 6 3 9 O 1 alanine: glyoxylate arninotransferase; transaminase [Arabidopsis thalianal 2 9 AF069299.1 9 _AT AF069299-1 9 16925at gblAAC193O5.lI WA06929 simyilar to ribosomal protein S 13 (Pfam; S 15.hrnim, score: 78.35); Identical to Arabidopsis ribosomal protein S513 (fragment) (SW: P49203A) except the first 32 amino acids are different [Arabidopsis thaliana] AF074375_SAT AF074375 15114_s-at gblAAC8324O.lI1(AFO 7 3 8 7 5 endo- 1 ,4 -beta- D-glucaflase KORRIGAN [ArabidopsiS thalianal AF076484_S_AT AF076484 16627_s-at gblAADO4627.11I(AF1OS 6 6
O)
C YTI protein [Arabidopsis Fthaliana] AF076641.
2 _AT AF076641.
2 16977at gbIAAD46O64.I1IAFO 7 6 6 4 1 -1 (AF076641) homeodomain leucine-zipper protein ATHB16 [Arabidopsis thaliana] AF077528-S-AT AF077528 15 15 2sat gbjAAB72l 16.11 (U69533) AtKAP alpha [Arabidopsis thaliana] AF080120.1 I _S_AT AF080120.ll 16935_s-at gbIAAC35545. 11(AFO8Ol 2
O)
similar to vacuolar ATPases [Arabidopsis thaliana] thaliana) AF082565SAT AF082565 15639_s-at gblAAD29109.1IIAF0 8 2 5 6 51 (AF082565) ATP dependent copper transporter IArabidopsis thalianal 185- Case S-5OOI5AI16/78/NAD 00 ene ID kY08333 Accession on Affy Description ch-in 6.2_S AT AFOS 3336.2 16932_s-at kF083337.3_SAT A.F083337.3 1693 1 _s-at AF1 18822_F_AT AR 18822 16080_f-at AF136152_SAT AF136152 15643_s-at AF144387_AT AF144387 12857_at AF1 67 98 3_S_-AT AF167983 15210_s-at gbjAAD 10030. 11 (AF083337) ribosomal protein S27 [Arabidopsis thalianal gblAAD 10030. 11 (AF083337) ribosomnal protein S27 [Arabidopsis thalianal gblAAD206I2. 11 (A.FI 18822) senescence-associated protein [Arabidopsis thaliana] gblAAD39465. 1 IA.F1361 521 (A.F136 152) PUR alpha- I [Arabidopsis thaliana] gblAAD35005. I 1AF1443871 (AE144387) thioredoxin- like 1 [Arabidopsis thaliana] gblAAF246O9. 11 AC01077 (AC010870) vacuolar membrane ATPase subunit G (AVMA [Arabidopsis thaliana] gblAAD55787. IJAR18 19661 (AFI 81966) methylenetetrahydrofolate reductase MTHFR I [Arabidopsis thalianal tgblAAF03749. I JAFR 868479- (AFI 86847) TIM 17 [ArabidopsiS thalianal AF18688__ATAF18688 17994_r-at AF181966_AT AFE181966 j17996_at AF 186 84 7_-S_-AT AF18 6 8 47 1l80 00_s a 186- Case S-5O1AI16/78INAD 00 0 e 1 D Accession on Affy Description 'ene IDchip GOSTAGOl 12977-sat gblAAD49755. I AC007 9 3 2 3 kGIS-AT(AC0079 3 2 Ident Ical to gbIU91995 Argonaute protein from Arabidopsis AJ001I342.
2 _SAT Ai0013 4 2 2 16923_s-at embjCAAl8846.11I(AL 0 2 3 0 9 4 Putative S -phase- specfic r- ribosomal protein [Arabidopsis.
thalianal 00 .1013 7 AJ0197 18011 _sat dbjlBAA225O4. 11(AB 0
O
5 5 6 0 AtGDI2 [Arabidopsis thalianal1 AJ00678 7 .1_AT AJ006787.1 19224_at embICAAO7251.lI (AJ006787) putative phytochelatin synthetase [Arabidopsis, thalianal AJ010456.
2 _AT AJ01045 6 2 17470_at embICAAO9195.11I(M0O1 0 4 5 6 RNA helicase [Arabidopsis thaliana] Ai0 1050 5 _SAT AJO10505 18018sat embICAB5483O.1I(M01 0 5 0 5 cysteine synthase [Arabidopsis thalianal AJ0168_IATAJO 128 18032 iat embjCAB5658O. 11(AJO 1 1 6 2 8 squamosa promoter binding protein- like 1 [Arabidopsis AJ 131205-AT AJ131205 18047jat embjCAA 10320. 1(AJ1 3 1 2 mitochondrial NAD-depefldeflt malate dehydrogeflaSe [ArabidopsiS thalianal ALO21636.178-AT AL021636.l 7 8 16499_at embICAA 16587. 11 (AL0216 3 6 putative protein [ArabidopSiS thaliana] AL02168 7 1 9 9 _AT AL02168 7 .l 9 9 19677at embiCAA167O 9 .lI (AL02168 7 putative protein [Arabidopsis thalianal 187- Case S-5O15AI16/78/NAD Gene ID Accession on Affy Description ch ip C A 1 7 81 11( L 2 7 AL021712.156_AT AL021712.1 5 6 20559_at embCA 68l1(O 2 7 putative protein [ArabidopsiS 7 thalianal AL021811.1 56 _AT AL021811.1 56 12776_at embICAA 16969. 11(AL 02 18 1 1 putative protein [ArabidopsiS thalianal AL021890.1 4 _AT AL021890-1 4 13591_at embICAA17l48.11(ALO 2 18 9 0 putative protein [ArabidopsiS thalianal AL219020~SAT AL021890.
2
O
9 12752_sat embICAA 763. 11 (AL021890) peroxidase prxcrl [ArabidopsiS thaliana] ALO22023.145-S-AT AL022023.14 5 16905_s-at embICAAl17773. 11(ALO 220 23 catalase [Arabidopsis thatiana] AL02214 1. 10-S-AT AL02214 1. 10 16976_s-at embiCAA185O7.11I(AL 0 2 2 3 7 3 ribosomal protein L2 [Arabidopsis thaliana] AL022224.182-S-AT AL022224.18 2 16021 _s-at embICAA 18251. 11 (AL02222 4 endome mbrafle- associated protein [Arabidopsis thaliana] AL022224.
7 2 _AT AL022224.7 2 13122at embICAA1824O.lI (AL02222 4 putative protein [Arabidopsis thaliana] AL022373.1 5 3 _AT AL022373.1 5 3 12802_at embICAAl8498.1I
(ALO
2 2 3 7 3 DnaJ-aie protein [Arabidopsis kL022580.1 8 8 _AT AL022580.1 8 8 17878at AL023 .094.2l 6 SAT AL02309 4 2 l 6 12234_s-at AL023094.323_SAT IA.L023094.3 2 3 16515-s-a embiCAA 1868.11 (AL02350) putative re iosoae le roteS6 [rt Arabidopsi thaliana embICAA 188491.11 (AL02309 4 putative protein [Afiabidopsis thaliana] 188 Case S-5O15AI16/78/NAD Gene ID ALO3 1326.1 38AT AL034567.18 9
_AT
AL035356.1 2 3
_AT
AL035394.11iLAT =AL0354401 9 1
SAT
AL035440.
4 4 7
_AT
AL035440.6&AT Accession on Affy Description chip AL031326.13 8 1793L1 at embICAA246 .11 (AL031326) water channel-like protein [Arabidopsis thaliana] AL034567.l 8 9 1 3088_at embiCAA22574.11I(AL 034567 ubiquinol-cylochrome c reductaselike protein [Arabidopsis thalhana] AL035356.1 2 3 13097at embICAA22994.I11 (AL035356) putative protein [Arabidopsis thalianal AL035394. 117 1 7384at embICAA23O2 9 Ij (AL03 5394) putative protein lArabidopsiS thalianal AL035440.1 9 l 13133_s-at ernbICAB3653O.11 (AL035440) ubiquitin-like protein [ArabidopsiS thaliana] AL035440.4 4 7 17011_at embICAB36546.11IIALO 3 5 4 4
O)
putative DNA binding protein [ArabidopsiS thaliana] AL035440.6 6 l866Lat embICAB365l7.1I (AL035440) putative protein [ArabidopsiS AL035526.l0l _SAT AL03 5 526. 101 13073_s at AL035540.3 4 8 _SAT AL035540.3 4 8 19961 sat AL035540.9 4 _AT AL035540.9 4 12804_at AL035656.1 2 6 _AT AL035656.1 2 6 17459_at ernbiCAB37458. 11 (AL035526) ribosomal protein LI 1, cytosoliC [Arabidopsis thalianal gblAAB24074.11 (S47408) glycinerich protein, atGRP j clone atGRP- 2) [Arabidopsis embICAB 37 50 7 11 (AL03 5540) probable H+-transporting ATPase [Arabidopsis thaliana] gblAAA99933. 11 (L4458 1) vacuolar H+-pumping ATPase 16 kDa proteolipid [ArabidopSiS [Arabidopsis thalianal AL035679.1 3
-SAT
AL035679.13 16967_s-at 189- Case S.50015A/16/78INAD Gene ID Accession on Affy Description chip AL035679.
2 3 2 _AT AL035679.
2 3 2 I 8905at embICAB38 828 11 (AL035679) putative proton pump [Arabidopsis go_ I IOd ALh368ianal XL035680. 1 10 SAT XL035680.l 10 17429_s at embICAB388 4 3 11 (AL035680) translation initiation factor [Arabidopsis thaliana] AL035680.5 3 _AT AL035680.5 3 13578_at embICAB38839.l11(ALO 3 5 6 ribosomal protein L14-Rie protein [Arabidopsis thalianal AL035709.
8 7 _AT AL035709.8 7 17389_at embICAB3893l1. 11(AL03 5 7 0 9 putative protein [Arabidopsis thaliana] AL049171.158.AT A.1049171.1 5 8 201 80_at No hits found less than or equal to 1 e- AL049171.
2 5 _AT AL049171.
2 5 17005at embICAB3895 2 11 (AL04917 1) putative ribosomal protein [Arabidopsis thalianal AL049480.1 7 8 _AT AL049480.l 7 8 13940at embICAB396O. 11 (AL04948O) putative acidic ribosomal protein [Arabidopsis thaliana) AL 049960 8.1l 84 _AT AL049608.I 84 12813_at emblCAB40778. 11(AL04 96
OS)
pulatlv p1 vL.-49v6 0'8'18'4- AL050300.15 _FAT XL050300.lS. 13129_fat AL050300.
2 7 _AT AL050300.2 7 16920_at AL050398.
4 _AT A.L050398.
4 19133at AL078464.3_AT A1-078464.3 7 14108at thalianal utaqtie/ ribosoal protei C2[Arabidopsis thaliana] embICAB4369O. 11 (AL050398) pon[Arabidopsis thalianal embICAB4383 6 11 (ALO7 8464) putative protein [Arabidopsis thatianal 190- Case S-50015A11I6/78/NAD ,ene ID Accession on Affy Description ~~L078468chi _A L748l 83_t embICAB43885. 1(AL 0 7 84 6 8 1-084681 IAT L07868.1 1830a acyl-CoA synthetase-like protein [Arabidopsis thalianal kL078637.47_SAT AL078637.
4 7 12803_-s -at embICAB45O57. 11 (AL078637) putative protein [Arabidopsis thalianal k~L096856.7_AT AL096856.
7 13093_at embICAB5lO6 1. 11(ALO 9 6 8 5 6 B12D-like protein [Arabidopsis !!nthabianalI AL096860.1 5 7 _AT AL096860.1 5 7 13079_at embICAB5l2O9. 11 (ALO 9 6 8 6
O)
40S RIBOSOMAL PROTEIN homolog [Arabidopsis thaliana]
AOS-S-AT
AGS 1288 1 -s-at.
A.P000423_AT AP000423 12847_at APX3SAT APX3 12885_s-at ATADHIIIAT ATADHIII 12893_at oxide synthase [Arabidopsis ebCAA84664. l (X9003) r ascbte peK nroias [Arabidopsis thalianal embICAA5797 3 11 (X82647) class III ADH, glutathione-dependent formaldehyde dehydrogenase.
[Arabidopsis thalianal dbjlBAA32420. Ij (A.1308 105) ethylene responsive element binding factor 3 [Arabidopsis vacuolar H+-pumping ATPase 16 kDa proteolipid [ArabidopsiS [Arabidopsis thalianal ATERF3_SAT ATERJF3 12906_s-at ATHADPRFAT.ATt ATHADPRFA 156 ATHAVAPSAT
ATHAVAP
1519 1 -s-a 191 Case S.50015A116/78[NAD 3ene ID Accession on chip ,%THAVAPASAT
ATHAVAPA
Affy Description 15584_s-at ATHAVAPCST ATHAVAPC 164_s-at ATHD 12AAASAT ATHD12AAA 15 134_s-at ATHDYNAGTP-SA ATHDYNAGTP 15585_s-at
T
ATHERDI3_SAT ATHERD13 15193_s-at ATHERD I SSAT ATHERD 15 15104_s-at ATHGFPSIASAT ATHGFPSIA 14734_s-at ATHHMGIAT ATHHMGI 12920_at gblAAD26493. I JAC007 1957 (ACOO7 195) putative vacuolar proton-ATPase 16 kDa proteolipid lArabidopSis thaliana] gbjAAD388O3. I AF153677-1 (AF153677) vacuolar H+-pumping ATPase 16 kDa subunit c isoform 4 thalianal gblAAA32782.II (L26296) delta- 12 desaturase [Arabidopsis thalianal gblAAB63528.11 (L36939) dynarnin-like GTP binding protein 1Arabidopsis thaliana] gblAAC2072 1.11 (AC004669) glutathione S-transferase [Arabidopsis thaliana] gblAAC23728. 11 (AC004625) dehydratilon- induced protein (ERD 1 5) [Arabidopsis thaliana] gblAAA32799.lI (L091 10) GF14 psi chain [Arabidopsis thalianal gbIAAA32814.1I (L19261) hydroxymethylgiutaryl CoA reductase [Arabidopsis thaliana] emblCAA3 3139.I11 (X 15032) hydroxy rnethylglutaryl CoA reductase (AA 1-592) embiCAB5247 1.11 (AL109796) xyloglucan endo- 1, 4-beta-Dglucanase precursor [Arabidopsis thaliana].
gblAAB9684O. 11 (L23574) acyl carrier protein precursor [Arabidopsis thaliana] ATHH MGCOAR-S_
AT
ATHHMGCOARI 12921_s-at ATHMEI5B 151_s-at ATHMTMACP. SAT ATHMTMACPl 16574_s- at 192- Case S-5OOI5AI16/78/1"AD I 00 00 Gene ID Accession on Affy chip ATHPRPHC_S_AT ATHPRPHC 15119_s-at ATHRP28ASAT ATHRP28A 16577_s-at ATHRPCASAT ATHRPCA 15155_s-at 15617_s-at
ATHSARISAT
ATHSAkRI
ATORNCXRB-SAT
ATORNCARB
15213_s-at ATTHIRED2-SAT ATTHIRED2 13184_s-at ATTHIRED3_AT ATTH1RED3 13185_at ATU01955_SAT ATU01955 151 35_s-a Description gbjAAD 10854.l11 (U60135) serine/threo nine protein phosphatase 2A-3 catalytic ribosomal protein S[Arabidopsis thaliana] gblAAA5699l 11 (M904 18) formerly called HAT24; synaptobrev in- related protein gmbIAA2715.11CO1 6
S
2 6 pruate dehaydrnserEl et sbt[Arabidopsis tha tiana gbIAAA9521. 11 (U35640) melthioeinlk pAraiotin tirxn[Arabidopsis thali ana t gbAAD 7157.11 AC06340)1 aisregulartein SAei (I A.A8) sbt[Arabidopsis thati ana] ATU09137-SAT ATU09137 1515sa 10_SAT ATU 15108 17078_sa ATU 15I3OSAT ATU 18410OS-AT ATU 15130 ATU 18410 15157_5_ 16156_s_2, 193- Case S-5O15AI16/78/NAD Gene ID Accession on Affy chip ATU 18675_SAT ATU 18675 15620_s-at ATU20347_SAT ATU20347 15649_s-at 15590_s-at ATU21214_SAT ATU21214 ATU21557_SAT ATU21557 16098_s-at ATU22340_SAT ATU 22340 15136_s-at Description gbIAAD47 19 1. 1 JAR 06084_ (AFI 06084) 4-coumarate:CoA ligase 1 [Arabidopsis thalianal gbIAAA9197 6 .l11 (U20347) nR-NA corresponding to this gene accumulates in response to gbIAAA86507.11 (U21214) pyruvate dehydrogeflase El alpha subunit [Arabidopsis thalianal ebICAB2585.1 (21) unnamed protein poducata
A
reuaoybntA[Arabidopsisthlaa gbAAB52530.11 (U22834) DatJ hmg[Arabidopsis thaliana l btnoecasto interact protein
I
hmg[Arabidopsis tha] .tgbAAB390.11 (U278121) proin 1[Arabidopsis haiana ATU36765_SAT ATU36765 15177_s-at ATU37235_SAT ATU37235 15195_s-at ATU37281_FAT ATU37281 16158_fa; ATU37587_SAT ATU37587 13205_s-a ATU39485_SAT ATU39485 15122s.a ATU43325_SAT ATU43325 15691sa 194 Case S.50015A116/78/NAD 00 00 lene ID Accession on chip ~TU43397_S_AT ATU43397 Atf) Description 15112_s-at ATU46665_SA ATU4666 14730_s at ATU49072_SAT ATU49072 15215_s -at ATU49259_SAT ATU49259 15652_s-at gblAAD09837.11 (U43397) cryptochrome 2 apoprotein [Arabldopsis thaliana] and cryptochrome 2 apoprotein (CRY2) (gbIU43397). ESTs gbIW4366l and gbIZ25638 come from this gene. [Arabidopsis thatianal gblAAC3 1617.11 (U49937) glutamate decarboxylase [AYabidopsis thalianal Arabidopsis thaliana. ESTs gbIW43856, gbIN37724, gbIZ34642 and gbIR90491 come from this gene.
gblAAB84353.l11 (U49072) IAAI 6 [Arabidopsis thaliana] gblAAF26982. I 1AC 183632 7 (ACO 18363) isopentenyl diphosphate dimethylallyl diphosphate isomnerase [Arabidopsis thabiana] gbi AABO0972 3. 11 (U 5 2851) arginine decarboxylase adosiethonin a bxls gblAAA57473.1I (UF055)1-3 3-Like protein 1 [Arabidopsis gblAAB5176.11 (U63633) S gei-like protein [Arabidopsis thalianal ATU52851_SAT ATU56929_SAT ATU52851 15197_s-at ATU56929 110sa ATU63633_SAT ATU63633 14721 _s-at ATU66343_SAT ATU66343 15654_s-at ATU68545_SAT ATU68545 14722_s-at ATU75191 _SAT ATU75191 15216-s-at 195 Case S-5OOI5AI16/78INAD 00 0 eeI Accession on Atfy Description chip SATU77381 _SAT ATU77391 I6106_s-at gblAAB82647. 1(U77381)WD- 4 0 repeat protein [Arabidopsis thalianal ATU7829 7 _FAT ATU78297 15 100_fLat gblAAB36949. 11 (U78297) plasma membrane intrinsic protein PIP3 [Arabidopsis thahianal SATU78870_SAT ATU78870 17030_s-at gbIAAB68038.l11 (U78866) ri gene 1000 [Arabidopsis thalianal 0 ATU79960_SAT ATU79960 16056_s-at gblAAB72112. 11 (U79960) 0 ~vacuolar sorting receptor homnolog [Arabidopsis thalianal ATU80186_SAT ATU80186 15627_s at gblAAB86804. 11(U 8 018 6 pyruvate dehydrogenase El beta subunit [Arabidopsis thatianal AT995-SA ATU91995 16170_s_at gblAAD49755.1IAC00 7 9 3 2 3 (AC00793 2 Identical to gbJU9 1995 Argonaute protein from Arabidopsis CATL SAT CATL 13218_s-at gbiAAC17732.11I(AF0 2 1 9 3 7 catalase 3 [Arabidopsis thalianal CYSPROLSAT CYSPROL 1 3230s-at embiCABlO39 8 .11 (Z97340) cysteine proteiase like protein [Arabidopsis thalianal D01027.1 _AT DO01027.1 18940at gblAAC2437O.l11 (U89959) [Arabidopsis thahiana) D I 1394.4_SAT Dl 11394.4 16011 s-at embICAA44 6 3 O. 11 (X628 18) Met alot hioflein- like protein [Arabidopsis thaliana] D13043.4_AT D 13043.4 15973at dbjlBAA0237 4 .lI (D13043) thiol protease [Arabidopsis thaliana] D83531 _SAT D83531 15113 s..at dbjIBAAl11944. 11(D8353 1) GDP dissociation inhibitor [Arabidopsis thalianal 196- Case S-5O15AI16/78INAD ;ene ID Accession on Affy Description chip I )88374_S_AT D88374 15 149_s_at dbjlBAI 3599.l11 (D88374) gamima subunit of mitochondrial Fl -ATPase [Arabidopsis [Arabidopsis thalianal aLUTATHIONEPER GLUTATHIONE 1 3259sat gbjAAD2483 6 .l I AC00707 1 -8 DXIDASEISAT PEROXIDASEI (AC00707 1) putati've glutathione peroxidase I Arabidopsis thaliana] GSTLRC_SAT GSTI 13263_s-at embICAA 10060. 11(AJ 0 1 2 5 7 1 glutathione transferase [Arabidopsis thalianal GST2SAT GST2 13264_s-at embICAA72973. 11 (Y 12295) glutathione transferase [Arabidopsis thalianal GST8SAT GST8 13267_s-at embICAA100 6 O.lt (AJ01257l) glutathione transferase (Arabidopsis thaliana] HSC7OL-SAT HSC701 13269_s-at embICAA544l 9 .l11 (X77 199) heatshock cognate 70-1 [Arabidopsis thalianal [Arabidopsis thalianal 305216.2_SAT J05216.2 16985_s-at gblAAA32866. 11 (J05216) ribosomal protein S I1 (probable start codon at bp 67) [Arabidopsis thaliana] L09755.2_SAT L09755.2 19682_s-at gblAAA32862.11KLO 9 7 5 5 ribosomal protein S28 [Arabidopsis thalia na] L14844_3_SAT L14844 12824_s-at No hitsfoundless than or equal to L15389_SAT L15389 18679_s at No hits founld.
197 Case S-5O15AI16/78/NAD Gene ID Accession on Affy chip L26984-SAT L26984 18682_s-at M21415.4_AT M21415.4 15988_at M55077.2_AT M55077.2 15993_at M64116_3_SAT M64116 12827_s-at Description gbIAAC274( putative sma [Akrabidopsis gbjAAA327 tubulin [Ara gbIAAA328 53.11 (AC003672) ~U GTP-binding protein thalianal 57. 11 (M21415) betabidopsis thalianal 68. 11 (M55077)
S-
M84703.2_AT M84703.2 16480..at ORYZAIN4_AT ORYZAIN4 14245_at ORYZAIN5 14246_at PHYA-AT PHYA 14622_at RAN I _S_AT RA i 14641_-s-at [Arabidopsis thalianal gblAAA32794.1l (M641 16) cystolic glyceraldehyde-3phosphate dehydrogenase (GapC) [Arabidopsis thaliana] gblAAA32884.l11 (M84703) beta-6 tubulin [ArabidopsiS thaliana] (bIAF0265) ATP 043 depeden choptoranptorr [Arabidopsis thaliana] embICAB38829.lI (AL035679) drought- inducible cysteine proteinase RD19A precursor gblAAB20558.1I (S69727) lightregulated glutarnine synthetase isoenzyme [Arabidopsis t haliana, Peptide, 430 aa] RDI9ASAT RD 19A 14644_s-at 569727.2_AT S69727.2 16503_at TH 1OLPROTEASE 1
S-AT
THIOLPROTEASE 14658_s-at embICAB38829.l11 (AL035679) drought- inducible cysteine proteinase RD 19A precursor [Arabidopsis thalianal 198 Case S.50015A116/78INAD Gene ID Accession on Affy chip THIOLPROTEASE3_ THIOLPROTEASE 14659_s-at SAT 3 TONOL-J_AT TONOL 14662_f~at 131 1256.2_AT U311256.2 16035_at U1I15108.2_SAT U315108.2 160 1 0_sat 18651 _sat 1320347.2_5_AT U20347.2 U21214_SAT U21214 1333014.2_SAT 1333014.2 18687_s_at 15955_sat 16032_s-at Description embICAB388 2 9 ij (AL035679) drought -inducible cysteine proteinase RD19A precursor embjCAA386 3 3 11 (X54854) possible membrane channel protein [Akrabidopsis thaliana) gbIAAA8221 2 .l11 (1311256) metallothionein [Arabidopsis thaliana] gblAAA5O25. 11 (U315108) metallothionein-like protein [Arabidopsis thafiana] gblAAA9197 6 .lI (1320347) mRNA corresponding to this gene accumulates in response to gblAAA865O 7 .lI (1321214) pyruvate dehydrogenase El alpha subunit (Arabidopsis thaliana] gbIAAB5392 9 .l11 (1333014) polyubiquitin [Arabidopsis Nhit fud es ha r qalt gbAAB0788O.lI (135681) asorbe ox Aid Aiopsis thalianal gblAAB86892.1I (U419988) at3 [Arabidopsis thaiana] 1335640.2_SAT 1335826.2_SAT U 35640.2 1335826. 195s-at 1341998.4_AT 1341998.4 16476_at U43224_S_AT 1343224 12842_s-at 1363815.18_5_AT U63815.18 16429.sat 1364912.1 _SAT 1364912.1 18989-s-at 199 Case S.50015A116/78/NAD Gene ID U6547 ILAT Accession on chip U65471 Affy Description 1 8692_at U84969-3_FAT U84969 12833_fUt U95973.108_AT U95973.108 18639_-at WT1O8ARCAT WTIO08A 14690_-at WT755_SAT WT755 14701 -s-at WT758_A WT758 4703_at X15550_SAT X15550 12843_s-at No hits found less than or equal to ebIAA52523. (8414) RCi14Ati [Arabidopsis thalianal gblAXD64O. 11 (U959_25 gbdoemAa045noe fromei thisgene [rabiopisg thaia] i [Aabidpss haiaa Notrsatound engan for eualt embICAA527 18.1 (X41 4) adnyat trnscto [Arabidopsis taaa ebjAA4253. om (fr8m ketoaidpi retoisea] e [Arabidopsis thaliana] X16432.2_SAT X16432.2 15992_s~a X52256.2_AT X52256.2 16443at X65052_AT X65052 16026at 549. 1 -AT! X655491, _56 a X68150.I AT X68150-1 1645 1 -at 200 Case S-50015A116/78JNAD Gene ID Accession on Affy chip X69294.2_S_AT X69294.2 16030_s-at X74604.2_SAT X74604.2 15953_s-at X74733.2_AT X74733.2 16463_at X75162.2_AT X75162. 2! 69at Afl 7~gi2 16446 at A /:)ZSZ51 I X75883.2_AT X78584.2_AT X75883.2 15989at X78584.2 16456_at Description embICAA49 155. 11 (X69294) transmembrane protein TMP-B [Arabidopsis thalianal embiCAA52 6 8 4 .l11 (X74604) heat shock protein 70 cognate [Arabidopsis thalianal embiCAA5275 .l11 (X74733) elongation factor- I beta AlI [Arabidopsis thalianal embICAA53005. 11 (X75 162) BBC1 protein [Arabidopsis thalianal thaliana] embICA.A53475.l11 (X7588 1) plasma membrane intrinsic protein I a [Arabidopsis thalianal embICAA575649. 11 (ALI8296) prota einphrpatae 2Arisi reatrunt [Arabidopsis taaa embICAA588 8 .lI (X78847) D1 [Arabidopsis thalanal X81697.2_SAT X81697.2 16918_s-at X82002.1 _AT X82002.1 2026 1 -at X84078_AT X84078 187 1 0at X831_ AT X8 4315.8 18659_at X84318_AT I X84318 18711 -at 201 Case S-5O15AI]6/78/1'AD 00 SGene ID X86962.: Accession on chin Atfy Description 1
I
3-AT X86962.3 19917_at X91398.2_AT X91398.2 16988_at X91958.1_AT X91958.1 16469_at X91959.l IAT X91959.1 15 890_at X92510.2_S_AT X92510.2 19706_s-at X94626.1_AT X94626.1 16508_at X99609.2_S_AT X99609.2 17430_s-at Y07765.7_SAT Y07765.7 16437_s-at Y09482.2_IAT Y09482.2 16036_i-at Y10157.3_SAT Y 10 157.3 19833_s-at Y 10863. 1 -1_AT Y 10863.1 19919_i-at Y1I2295.2_SAT Y 12295.2 16033_s-al Y14052.2_AT Y 14052.2 16506_at emibICAA6O525. 11 (X86962) protein kinase catalytic domain (fragment) [Arabidopsis thalianal embICAA62744. 11 (X91 398) transcription factor L2 [Arabidopsis thaliana] embICAA63O24.l11 (X91958) ribosomnal protein L9 [Arabidopsis thabanal gblAAF04877. IjAC010796 3 (AC010796) 60S ribosomal protein L27A [Arabidopsis thaliana] embiCAA63266. 11 (X925 10) allene oxide synthase [Arabidopsis thaliana] embICAA64329. 11 (X94626) A.ATP2 [Arabidopsis thatiana] embICAA67923. 11 (X99609) ubiquitin-like protein [Arabidopsis thaliana] No hits found less than or equal to embjCAA7O69l 1.11 (Y094 82) HMG1I [Arabidopsis thalianal embICAA7l239. 11 (Y 10 157) suffite reductase [Arabidopsis thaliana] embICAA7l879. 11 (Y 10986) hypothetical protein 194 [Arabidopsis thaliana] embiCAA72973.l11 (Y 12295) glutathione transferase [Arabidopsis thaliana] embICAA7438 1.11 (Y14052) ribosomnal protein S6 [Arabidopsis thaliana] 202 Case S-50015A116/78/NAD Gene ID Accession on Affy Description chin I 4 i Y 17053.2_AT Y 1705 3.2 15960_at Z1I2024_AT Z12024 18731_at Z14989.5_AT Z14989.5 17414_at Z15157.1_AT Z15157.1 16982_at Z28702.2_AT Z28702.2 16984_at Z97335.5_SAT Z97335.5 16504_s-at Z97336. I _AT Z97336.1 16930at Z97337.298_SAT Z97337.298 16934_s-at embICAA766O6.I11 (Y 17053) Athsc70-3 [Arabidopsis thaliana] embICAA7 8059.l11 (Z 12024) calmodutin [Arabidopsis thaliana] emblCAA7 8713. 11 (Z 14989) ubiquitin conjugating enzyme homolog [Arabidopsis thaliana] embICAA78856.11 (Z15157) Wilm's tumor suppressor homnologue [Arabidopsis thaliana] emblCAA82273.11 (Z28701) S18 ribosomal protein [Arabidopsis thaliana] embICAB 10 172. 11 (Z97335) hydroxymnethyltransferase [Arabidopsis thaliana] embICAB 1021 1. 11 (Z97336) ribosomal protein [Arabidopsis thalianal embjCAB 10279. 11 (Z97337) ribosomal protein [Arabidopsis thalianal .1 4 Z97340.298_SAT Z97 340. 298 159-72_s-at Z97341.130_AT Z97341.130 18230_at Z97341.407_AT Z97341.407 18614_at Z97343.270_SAT Z97343.270 16926_s-at embICAB 10398. 11 (Z97340) cysteine proteinase like protein [Arabidopsis thaliana] embICAB 10428. 11 (Z9734 1) symbiosis-related like protein [Arabidopsis thalianal embICAB 10447. 11 (Z9734 1) ribosomal protein [Arabidopsis thaliana] embICAB 10520. 11 (Z97343) ribosomal protein [Arabidopsis thaliana] 203 Case S-5OOI5A16/78/NAD Gene ID Accession on chip Z99708.65_AT Z99708.65 Description 1913 embICAB 16820. 11 (Z997 08) ubiquitin- -protein ligase-like protein lArabidopsis thatiana) 204 Case S-50015A116/78INAD 00 Table 12 provides a description of Arabidopsis genes for sequences which are expressed in a leaf-specific manner.
tfl Table 12: 12086-s-at AC002409.88_SAT 12095_at AC00622 3 9 5
_AT
120_t AF00657.30.A function Description novel gblAAC27838.I11 (AC0042 18) unknown protein [Arabidopsis thalianal novel gbIAAB86456.l11 (AC002409) unknown protein [Arabidopsis thaliana] novel gbjAAD 15394.l11 (AC006223) hypothetical protein [Arabidopsis novel 12115_at A034 5.2& T metabolism 12135at AC007230.
2 9_AT novel 12270_at A.L030978.7 9
_AT
kinase gblAAB72 170. 11 (AF000657) hypothetical protein [Arabidlopsis thaliana] embICAA22l5 2 .l11 (A.L033545) extensin-like protein [Arabidopsis thaliana] gbIAAD26875. IlJAC007230_9 (AC007230) ESTs gbjH76289 and gbIH76537 come from this gene.
embjCAA 19724. 11 (AL0239 7 8) seraie recorpri kinase rti [Arabidopsis thaliana] embICAA 18476. 11 (AL022347) putative protein [Arabidopsis thalianal n gblAAB87 103.11 (AC00239 1 putative MYB family transcription factor [Arabidopsis thaliana]1 12299at AL022347.26 5
_AT
12305_i-at AL022347.
2 1 9
_IAT
kin ase novel I 239Lat AC002391.lO 2 _AT transcrito 205 Case S.50015A116/7SINAD 00 0 fyI Accession function IDescription ,C 12788at AC00231 1 20 _AT defense "gblAAC00607.l11 (AC00231 1) similar to ripening- induced protein, gplAJOO 144912465015 and major#Iatex protein, gpIX919 6 1Il 107495 tArabidopsis thalianal" 13243-r-at EL132-RAT metabolism embICAB3753 9 .l11 (AL035538) cinnamyl- alcohol dehydrogenase EL13-2 [Arabidopsis 00 I 13352at AL030978.l 2 6 _AT novel embICAA 19730.l11 (AL030978) putative protein [Arabidopsis N thalianal I 362Q0at AL035605.
4 1 _AT metabolism embiCAB382 9 5. 11 (AL035605) formam-idase-hike protein f Arabidopsis thaliana] 137 19at NOVARTIS 106_AT novel No hits found less than or equal to I e- 13972 s at Z97344.134 SAT transcription embICAB1O5 6 1lii (Z97344) SUPERMAN like protein [Arabidopsis thaliana] 14l92at NOVAIRTIS66_AT novel gblAAC3433l.ll (AC004 122) Unknown protein [Arabidopsis thaliana] 142 l8at NOVARTIS87_AT novel No hits found less than or equal to le- 14242_s-at NRASAT metabolism gbIAAF19225.l I AC007505_1 (AC007505) nitrate reductase [Arabidopsis thaliana] 206 Case S-50015A116/78/NAD Affy MD 14248_at A ccessiOn function Description Accession I function i PAD3_AT I metabolism 14432_at AL035440.50 2
_AT
novel 14484_at U73462.2AT metabolism 14533_i-at AC007048.1 6 6 i_-AT novel 14600_at AC007576.49_AT novel 14603_at AL022347.28 2 _AT kinase 1462 1 -at PDF1.2_AT defense 14635_s_at PR. 1 _S_-AT defense 14682iat WT1O12A-RC_I_AT novel 14709_at WT788_AT novel 1.gbjAAD3lO62.IIACOO 7 3 5 7 -11 (AC007357) Strong similarity to gbIX97864 cytochrome P450 from Arabidopsis thaliana and is a member of the PF100067 Cytochrome P450 family. ESTs gbIN65665, gbIT141 12, gbIT76255, gbIT2O9O6 and gbIA1 10002 7 come from this gene.
embICAB36549. lj (AL035440) putative protein [Akrabidopsis thaliana] gblAAC32523.11 (U73462) carbonic anhydrase [Arabidopsis thaLiana] gblAAC32523. 11 (U73462) carbonic anhydrase [Arabidopsis thaliana].
gblAAD39297. 1 JAC007576_20 (AC007576) Unknown protein [Arabidopsis thaliana] embICAA 18477.11 (AL022347) serine/t hreo nine kinase-like protein [Arabidopsis thalianal gblAAC3-1244. 11 (AC004747) putative antifungal protein [Arabidopsis thalianal No its found less than or equal to 207 Case S-50015A/16/78/NAD I function IDescription A PPPc~1nn rw vr~ All~y 1LU I 14803_at 14808_i-at AC006550.
3 3
_AT
metabolism AC007230.
2 lIAT kinase 14862_at AC005770.2O 5 _AT transcription 15185_s at rAB024283SAT metabolism 1527 1 _at AC00407 7 .l 4 l _AT novel 1542_atAF069441.29_AT novel gbjA-AD258O7. I JAC006550-1 (AC006550) Strong simnilarity to gbIZ49699 glutaredoxin from Ricinlus communis. [Arabidopsis thalianal gblAAD26873. 1 1AC007230_7 (AC007230) Contains PF100069 Eukaryotic protein kinase domain.
[Arabidopsis thaliana] gbjAAC7962O. 11 (AC005770) putative RING zinc finger protein [Arabidopsis thaliana] dbjIBAA7856l.11 (AB024283) cysteine synthase [Arabidopsis thalianal gblAAC26689. 11 (AC004077) unknown protein [Arabidopsis thaliana] gblAAD36948.1IIAF069 4 4 1 8 (AF06944 1) hypothetical protein [Arabidopsis thaliana] gbAAC3523.l11 (AC005496)ES putive4 thimes bioynths esis prte Arabidopsis thaliana] gmbIAD383.11A14 9 5 1 subtate 1rt [Arabidopsisiaa ri bIBAA28535.11 (Z19602) HA4[Arabidopsis thaana] 15467_at AC000375.3 4
_AT
novel 1555_atAL096859.1 6 2 _AT novel 15613-s-at
AHMEASAT
metabolismr metabofisri 15837_at AC005496.1 7 5
_AT
16137_s-at AF149053-SAT metabolisn 16172_s-at D78603SAT metabolisn 208 Case S-5O15Ai16/78INAD Affy ID Accession function 16322_at AL09802OA novel 1 6323_at AC005957.3 5 _AT defense 16331at AC005957.23_AT defense 16365_at AC003974.1 3 6 _AT defense 16547-s-at AF5394-S-AT metabolism 16583-s-at ATHZFPH-.SAT transcription I 6687-s-at AC004044.6 4 _SAT novel 16845at AC006232.
8 7 _AT metabolismT 168569_-at AC004681.86_IAT metabolism 17019_s-at ATU28422-SAT transcriptic 17128-s-at ATHRPRPIASAT defense 17231_at AC004411.l 7 OAT novel Description embICAB5l2l5.lI (AL096860) putative protein [Arabidopsis thalianal gbAAC2723. 11 (AF0053941)no phttrop ices hypo ace 1-l ike [Arabidopsis thalianal gbIAAA83O 11 (L3095)zn figroen[Arabidopsis taaa gbAAD1545.l1 (AC0063) putative cysesesi ne protei n [Arabidopsis thaliana] gbAAC272936.21 (A00341)no puotatveicllulose synthase [Arabidopsis thalianal gbAAC69304 1 .1 (3965 3) n figprotein Arabidopsis gbAAC3422.11 (A0094421) hypothetical protein [Arabidopsis thalana] 209 Case S-50015A116/78INAD 00 0 fyI Accession function IDescription N.f -73a AF069298.
2 3 _AT kinase gblAAC19274.lI (AF069298) 1733 -atcontains similarity to a protein kinase domain (Pfam: pkinase.hmmn, score: 165.48), to legume lectris beta domain (Pfam: lectin-legB.hrlnm, score: 125.64) and legume lectins alpha domain I~*.(Pfam: lectnjegA.hmm, score: 16.72) [Arabi 00 1736L-s-at AF096373.2 8 _SAT metabolism embiCA.B39764.lI (AL049487) 0 sucrose-phosphate synthase-like 0 protein [Arabidopsis thaliana]
(N
17411 is_at X98926.1 _SAT defense embICA.A6742 6 .lI (X98926) thylakoid-bound ascorbate iperoxidase [Arabidopsis thalianal 17 81 s_at Z97342.284-S tZ93224SAT defense embICAB46O5O.11 (Z97342) disease resistance RPP5 Like protein (fragment) [Arabidopsis thaliana] 17835 at AF096370.l 4 _AT RNA gblAAC62779. 11(AFO 9 6 3 7 0) binding contains similarity to Arabidopsis protein thaliana reverse transcript ase-hlke proteins putative beta-amylase [Arabidopsis thalianal 18115..at AC005388.
4 3 _AT kinase gblAAC64891.11 (AC005388) Similar to TI 1J7.13 giJ28 8 putative protein kinase from Arabidopsis thaliana
BAC
gbIAC0O234O.
1 8296.at AC0025 10.60_AT kinase gblAAB84338.l1 (AC0025 putative Ca2+-ATPase [Arabidopsis thalianal -210- Case S. 500 15A/16/7 8/NAD Affy ID Accession function 1830L-s-at AL027 4 _SA metabolism 18469at AC006341.12_AT kinase 'QQQAL022604.205-AT novel I 8670...g-at AJ2 50341-G-AT metabolism 1 8778_at Z9338.384_AT novel 18811 -at AC002396.32 AT nove Description embICAA 18218. 11 (AL022223) fructose-bisphosphate aldolase [Arabidopsis thalianal gbjAAD3467 8 .1 IJAC006341 6 (AC006341) Similar to gblAJOl242 3 wall-associated kinase 2 from Arabidopsis thaliana.
embiCAA 18744.11 (AL022604) putative protein [Arabidopsis embICAB58103.11 (Z97538) hytetcalpreinm [Arabidopsis thaliana] bIAC1032. 11 (AC97338) hypothetical protein [Arabidopsis ptei Arbdss hanal gbAAD0163. 11 (AC003917) retroeeeient protein [Arabidopsis thalianal ebIAA6308.1 A0026 168 TMV070 reisace prtein N-Lk dfntolefud[Arabidopsisthlaa ebIAA 1929.11 (A.02 176) restanve prodtein channiel [Aoei rabidopsis thaliaa a 18835_at AC007260.34_AT novel 1 8844at AC0053 15.131 IAT transport I 8866at AC005917.1 7 8 _AT transposabl element 19034at AL021768.11 7
_AT
19465atAL021768.9 6
_AT
defense defense -211 Case S-50015AI16/78JNAD function I Description A rr~ A ccp~;ston ti L' Aceso 1958Lat AC006526.l02_AT transport 19704-i-at AJ005927.
2 _IAT metabolism 97 atAC0000098.16_AT transport -92at AC003979.28 AT hormone 1974a AC007 167.
2 4 8 _AT transport gbjAAD23O55. I JAC0065261 4 (AC006526) putative cyclic nucleotide-regulated ion channel protein [,Arabidopsis thalianal embjCAAO67 6 9 11 (AJ005927) squalene epoxidase homnologue [Arabidopsis thaliana] gbIAAB7 1447. 11 (AC000098) Similar to Arabidopsis Fe(II) transport protein (gbj122759 0 [Arabidopsis thaliana] gbIAAC255 17. 11 (ACOO3 979) Contains similarity to gibberelLinregulated protein 2 precursor (GASTI) homolog gbjU1 11765 from A. thaliana. [Arabidopsis thalianal gbIAAD3O549. I1JAF1 365801 (AFI 36580) iron-regulated transporter 2 [Lycopersicon esculentum] gbIAAD29795. I11AC006264_3 (AC006264) putative aux-inregulated protein [Arabidopsis thalhanal gbIA.AB91986. 11 (AC003033) unknown protein [Arabidopsis thalianal gbIAAB91985.l11 (AC003033) unknown protein [Arabidopsis thalianal gbIAAD22657.l I AC007 138_21 (ACOO7 138) predicted protein of unknown function [Arabidopsis thaliana] gbIAAC98O45. 11 (ACOO5 896) unknown protein [Arabidopsis thaliana] 19834_at AC006264.14-.AT hormone 19889.at AC003033.1I39-AT novel I99O1at AC003033.1 2 9 _AT novel 19992at AC007138.58-AT novel 20062_at IAC005896.23A T novel 212 Case S-5OOI5A16/78/NAD Affy ID 20063at Accession function 410 A~A C, A T I MpInhAfism 20232_s-at AL022347.l-S-AT kinase 20356_at AC004561.74-AT metabolism 20429_s at -936.6S .AT novel 20525_at ACOO7 169.89_AT transcription Description gblAAD 17422.11 (AC006284) putative esterase I ArabidopsiS gblA.AC95 191 .1I1 (AC00456 1) putative glutathione S-transferase [Arabidopsis thaliana] embICA-B10219.l11 (Z97 336) hypothetical protei [Arabidopsis thaliana] gbIAAD2648 1. 1 AC0071 69 1l 3 (ACOO7 169) putative CONSTANS-1ike B-box zinc finger protein [Arabidopsis nembiCAB388l69.11 (AL035679) putative zinc finger protein [Arabidopsis thaliana] SembICAB4O757. 11 (AL049607) glut athioue peroxidase-lie protein (Arabidopsis thaliana] 20537_at AL049608.65_AT met abolisrn 20544_at AL035679.6 8 _AT transcriptio, 20705at AL049607.66 AT metabolisrr 213 Case S.50015A116/78/NAD 00 00 Table 13 provides cumulative sequence Identifier numbers for the SEQ ID Nos disclosed in the sequence fisting. NOTE: please refer to cross referenced SEQ ID NOs Table since a single SYNGENTA NO: may refer to more than one SEQ ID NO.
Table 13: SEQ ID NOs 1-773 and their corresponding reference numbers SEQ ID NO: SYNGENTA
NO:
Root promoter reference numbers from the provisional application US 60/214087 1 AC006592.51 2 A71588.1 4 AC00 1645.1 9 AC00 1645.4 7 6 AC00 1645.50 7 AC002333.19 9 8 AC002333.2lO 9 AC002391.15 0 AC003673.
2 0l I I AC004005. 104 12 AC004521.11 4 13 AC004521.11 9 14 AC004683.
7 9 AC004684.1 6 F1 6 AC005310.6 214 Case S-5001 SA11 6/78/NAD SEQ ID NO: 17 18 19 21 22 23 24 26 27 28 29 31 32 33 34 36 37 38 39 SYNGENTA NO: AC005560. 136 AC005560. 147 AC005967.50 AC0062 16.22 AC0062 16.26 AC006577. 16 AC006587. 164 AC007060.34 ACOO7 135.23 AC007584.48
ACHI
AF098630.3 AF128395.12 AL035538.245 AL049500.57 AL049638. 193 AL049730. 104 AL080253.32 AL0802 82.74 ATAJ2596
ATHORE
ATPIN2 ATU 10034 -215 Case S.50015A11I6/78[NAD 00 SEQ ID NO: SYNGENTA
NO:
AT5720 Sy ATU57320 41 ATU62330 42
CAFFEROYL
43 NOVARTIS51 44 U72155.2 00 45 U81294.2 46 X98319.2 47 X98855.2 4 8 Z97338.321 4 9 Z97340.345 Z97344.151 51 Z99707.288 Constitutive promoter reference numbers from the provisional application US 60/213848 52 AC003981.3 4 53 AC004557.8 54 AC005287.
5 2 AC006085.15 56 AC007138.25 57 AC007576.5 58 AC007659.93 59 kF013959.4 AF0027172.3 216 Case S-50015A/16/7Sf1NAD :SEQ ID NO: SYNGENTA
NO:
61 AE0083337.
3 62 AF0123253.3 63 AC002332.71 64 AC002 334. 110 AC002 33 6.I101 66 AC002339.51 67 AC002521.l 4 6 68 AC002561.51 71 AC004165.1OS 72 AC004218.8 3 73 AC004401.1 4
O
74 AC004450.
83 AC004481.8 4 76 AC004665.31 77 AC004669.3 4 AC005309.6 4 81 AC005397.
4
O
82 AC005662.
3 0 83 AC005727.19l 217 Case S-5001 5A116/78/NAD SEQ ID NO: 84 SYNGENTA NO: AC005824.2 I
I
ACOOS 896.150
I
86 87 AC005897. 156 AC005936.95 88 AC006068.93 89 AC006200.119 AC006201.107 91 AC006223.65 92 AC006234. 156 93 AC006260.52 94 AC006264.30 AC006300.70 96 AC006403.1 97 AC006526.57 98 AC006532.47 99 AC006585.146 100 AC006586.141 101 AC006841.122 102 AC006919.171 103 AC006921.52 104 AC006929.77 105 106 AC006951 .208 I AC007017.278 218 Case S-50015A11I6/78INAD S EQ ID NO: SYNGENTA
NO:
107 AC00701 9 .lO 5 108 AC007070.
16 7 109 AC007071.
7 2 110 AC007119.
8 8 III AC007135.5O 112 AC007170.
4 8 113 AC007195.93 114 AF000657.4O 115 Z99708.65 116 kL035440.
6 6 117 AL021811.15 6 118 AL021636.1 7 8 119 AL049480.17 8 120 AL031326.1 3 8 121 AL035679.23 2 122 AL022224.72 123 AL035540.94 124 AL035356.123 125 AL050300.
2 7 126 AL02214 1. 127 AL035 526. 10 1 128 AL078464.37 129 AL034567.1 8 9 219 Case S-5O15A/16/78/NAD 220 Case S-50015A/16/78fNAD SEQ ID NO: 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 SYNGENTA NO: AI.035656. 126 AL049608. 184 U330 14.2 U4 1998 .4 U638 15.18 U95973. 108 A45785.1 AB003522.2 AB004872.6 AB005560 AB006693.1I AB008 105 AB008487 ABO 10946 ABO 1545 AB017643 A.B02 1858 A.B024282 AB02715 1.2 ACOOO 103.25 ACOOO 104. ACOOOI104.26 ACOOO 132. 16 -221 Case S-50015A/16/7gINAD 00 7SEQ ID NO: SYNGENTA
NO:
176 AC000132.
6 177 AC002131.
4 8 178 AC02329.
4 6 179 AC002330.3 9 IS0 AC002332. 100 00 181 AC00233 2 7 l 182 AC00233 4 .1I10 183 AC002 3 36.l101 184 AC002339.51 185 AC002343.
3 186 AC004165.IOS 187 AC004401.1 4 0 188 AC004481.8 4 1::897 AC006438.
2 1 190 AC006922.10 6 191 AF001394 192 AF003096 193 AF003105.1 194 AF004216 198 AF034387 222 Case S-50015A/16/78JNAD SEQ ID NO: SYNGENTA
NO:
199 AF034694 200 AF043519 201 A1F043528 202 AF044265 203 AF044313 204 AF059294 P 205 AF061519 206 AF063901 207 AF074375 208 AF076484 209 AF076641 210 AF077528 211 AF082565 212 AF118822 213 AF 136152 214 AF144397 215 AF167983 216 AF 18168 8 217 AF181966 218 AF186847 219 ACOl 220 AJ001397 2221 AJO10505 223 Case S-50015A/16/78INAD SEQ ID NO: SYNGENTA
NO:
222 AJ0 11628 223 AJ131205 224 AL096856 225 AL096860 226
AGS
227 APX3 228 ATADH I I 229 ATERF3 230
ATHADPRFA
231
ATHAVAP
232
ATHAVAPA
233
ATHDYNAGTP
234 ATHERDI13 235 236 ATHGFPS1A 237
ATHHMGCOAR
238 239
ATHMTMACP
240
ATHPRPHC
241
ATHRPCA
242
ATHSARI
243
ATORNCARB
244 ATTHIRED2 224 Case S-50015A/16178fNAD SEQ ID NO: SYNGENTA
NO:
245 ATTHIRED3 246 ATUO1955 247 ATU 15108 248 ATU 15130 249 ATU 184 250 ATU 18675 251 ATU20347 252 ATU21214 253 ATU22340 254 ATU36765 255 ATU37235 256 ATU37281 257 ATU37587 258 ATU39485 259 ATU43325 260 ATU43397 261 ATU46665 262 ATU49072 263 ATU49259 264 ATU52851 265 ATU56929 266 ATU63633 267 ATU66343 225 Case S-5OO]5AI16/78/NAD 00 SEQ ID NO: SYNGENTA
NO:
268 ATU68545 269 ATU75191 270 ATU77381 271 ATU78297 272 ATU78870 00 273 ATU79960 274 ATU80186 2E75 ATU91995 E 276 C ATL 277
CYSPROL
278 D01027.1 279 D 11394.2 280 D83531 P 281
GLUTATHIONEPEROXIDASE
282
GSTI
283 GST2 284 HSC701 285 IAA16 286 IAA8 287 J05216 288 L09755.2 289 L14844.3 290 L15389 226 Case S-50015 A/ I6/78/NAD 00 00 SEQ ID NO: SYNGENTA
NO:
291 L26984 292 M55077.2 293 M64116 294 ORYZAIN4 295 296
PHYA
297 RANi1 298 RD19A 299 THIOLPROTEASE 1 300
TONOL
301 U 11256.2 302 U 15108.2 303 U20347 304 U21214 305 U35826.2 306 U64912.1 307 WT755 308 X16432 309 X68150.1 310 X74604.2 311 X74733.2 312 X75162 313 X75881 227 Case S-5OOI5AI6/78[NAD 00 0SEQ ID NO: SYNGENTA NO: 314 X75883.2 315 X81697.2 316 X84078 317 X84318 318 X91398 00 319 X91959.1 320 X99609 321 Y07765.7 322 Y12295 323 Y 12295.2 324 Y14052 325 Y17053.2 326 Z12024 327 Z15157.1 328 AC002131.48 329 AC006577.32 330 ACOOO 104.26 331 AC000 132.6 332 AF080120.11I 333 AC007357.17 334 AC005 990. 335 AF069299.19 336 ACOOO 106.13 228 Case S-500) 5A116/78/NAD 00 0F S EQ ID NO:0: SYNGENTA NO: 33337 AC005 679. 338 AC004393.2 2 339 AC005388.6 Root primers 340 ARMi 00 344 ARF2 345 ARR2 345 ARRil 346 ARF13 347ARR61 34 Roo O 229 Case S-50015A116/78/NAD 00 SEQ ID NO: SYNGENTA
NO:
c! !358 AC001645.1 9 359 AC002333.l 9 9 360 AC002333.
2 361 AC007135.
2 3 362 AF098630.3 00 363 AL035538.2 4 364 AL080253.32 365 X98855.2 366 Z97338.321 Constitutive primer's 367
ACFI
368
ACRI
369 ACF2 370 ACR2 371 ACF3 372 ACR3 373 ACF4 374 ACR4 375 ACF6 376 ACR6 377 ACF7 378 ACR7 379, ACF8 230 Case S-5OOI5A/I 6/78/N'AD 402 ACR21 -231 Case S-5O1AII6/78fNAD 00 SEQ ID NO: SYNGENTA
NO:
403 AC F 22 2 404 ACR22 405 ACF23 406 ACR23 407 ACF24 00 408 ACR24 409 410 411 ACF26 412 ACR26 413 ACF27 414 ACR27 415 ACF31 416 ACR31 417 ACF32 418 ACR32 419 ACF34 420 ACR34 421 425AC3 232 Case S.50015A116/7gINAD 00 0F SEQ ID NO0:: SYNGENTA NO: 426 ACR39 427 428 429 ACF41 430 ACR41 00 431 ACF42 432 ACR42 433 ACF44 434 ACR44 435 P 436 437 ACF46 438 ACR46 439 ACF47 440 ACR47 Constitutive ORFS 441 WTr7 5 442 AF004393 443 ATU46665 444 D83531 445 ABO17643 446 ATU 56929 447 kB005560 233 Case S-5O15A/16/78[NAD 470 AC006403.l 234 Case S-5O15A/I6/78/NAD Constitutive promoters ACOO7 138.25 235 Case S-5OOI5Ai16/78[NAD 00 SE0D O SYNGENTA NO: Ic493 AC007195.
9 3 494 AF080120.11 495 AL02163 6 17 8 496 AL021712.1 5 6 497 AL022224.7 2 00 498 AL035440.
6 6 499 AL035656.
1 2 6 500 AL035709.
87 501 AL049608.1 8 4 502 AB005560 503 AB017643 504 AC002131.
4 8 505 AC00643 8 2 1 506 A1F004393 507 ATU46665 P 508 ATU56929 509 D83531 510 WF755 511 Z15157.1 512 U95973.108 513 Z97340.298 514 AC005309.
2 Ol 515 AC006300.11 2 236 Case S-5OOI5AII6/78[NAD 00 SEQ ID NO: SYNGENTA
NO:
Root promoters 518 AC007135.23 519 AF098630.3 00 520 A.L035538.245 521 AL080253.32 522 X98855.2 523 Z97338.321 524 AC00 1645.19 525 AC002333.199 526 AC002333.210 Constitutive ORFs 527 L14844.3 528 AJ001397 529 AC000104.26 Constitutive primers 530 18011 (forward) 531 18011 (reverse) 532 12771 (forward) 533 12771 (reverse) 534 12824 (forward) 535 12824 (reverse) 237 Case S.50015A116/78JNAD 00 SEQ ID NO: SYNGENTA
NO:
Cloned root promoters 536 AC002333.1 9 9 P537 AC002333 2 38 AC007135.
2 3 539 AL035538.2 4 00 540 AL080253.3 2 541 Z97338 542 AC001645 543 AF098630 544 X98855.2 Cloned constitutive promoters 545 AC002561 546 AC006234 547 AC006264.30 [548 AC006403 549 AC006526.5 7 550 AC007138 551 AC007195.9 3 552 AF080120.11 556 AL035440.66 238 Case S.50015A116178/NAD 00 SNET O 0SEQ ID NO: SNET
O
557 AL035656 558 AL035709.8 7 559 AL049608.18 4 560 AB005560 r 561 AB017643 562 AC002131 00 563 AC006438.
2 l 564 AF004393 565 ATU46665 566 ATU56929 567 Z97340 568 D83531 569 WT755 570 ATU63633 571 Z15157.1 572 AC005727 573 AC005309.
2 Ol 574 AC006300 575 AL021890 579 AJO01397 239 Case S-5O15AI16f78/NAD 00 SEQ 11D NO: SYNGENTA
NO:
580 Sequences from the PCT specification 581 pNOV2374 binary Gateway destination vector with GIG reporter gene 582 GIG, GUS intron GUS, GUS coding sequence with intron 00 0583 Ubq3(At) Arabidopsis thaliana Ubiquitin 3 (Ni promoter plus intron 584
GGCCAGTGAATTGTAATACGACTCACTA
TAGGGAGGCGG-(dT)24-3' 585 GGCCAGTGAATTGTAATACG
ACTCACTA
TAG GGAGGCGG- (dT)2 4 -3' 586 5'-TGGTTCGGACC-3' 587 TRX3T 5' 6-FAMI agacttcactgcaacatggtgcccac TAMRAY3 588 TRX3F 5' gtgtggaaatgacacagattgtga 3 589 TRX3R 5'agacgggtgcaatgaaacg 3 590 APX3 T 5' 6-FAM cgcgaacaagaacigtgctccialcatg TAMPRA 3' 591 APX3 F 5'gccgtgagctccgttctct 3 592 APX3 R 5'tcgtgccatgccaatcg 3 240 Case S-5OO15A/16/78/bNAD 00 0SEQ ID NO: SYNGENTA
NO:
F595 596- 597 P598 DNA for rice ortholog (0000026) 599 DNA (CDS) for rice ortholog (05000026) 00600 Amdio acid for rice ortholog (05000026) Leaf ORFs from the provisional application US 60/258692 601 EL-132 602 Novartis 10 6 603 Novartis66 604 Novartis8 7 605
NRA
606 PAD3 614 W7880 615 AID53941 616 Athzfph 241 Case S-5O15A/16/78JNAD 00 0SEQ ID NO: SYNGENTA NO: 617 ATU28422 618 at hrprplIa 619 AJ250341 620 AC002311.2O 621 AL035605.41 N622 AC007048.1 6 6 00 623 AC007576.49 624 AL022347,28 2 625 AC000375.3 4 626 AL096859.162 627 X98926.1 628 AC005560.1 6 629 -Z97342.384 630 AC0025 10.60 631 AL022223.48 632 AL022604.205 633 AL021768.96 634 AJ005927.2 635 AC006264.1 4 636 AC003033.139 637 AC003033.1 2 9 638 AC007138.58 639 AC005896.23 242 Case S-50015A/16/7gINAD 00 TSEQ ID NO: SYNGENTA
NO:
641 AL022347.1 2 642 ACO04561.7 4 C\643 Z97336.167 644 AC007169.
8 9 00 645 AL049608.6 646 AL035679.68 r647 AL049607.6 6 648 AC0042 18.86 649 AC002409.
8 8 650 AC0062223.
9 P 651 AF000657.30 652 AL033545.26 653 AC007230.
2 9 654 AL030978.79 655 AL022347.
2 6 656 AL022347.
2 19 657 AC002391.
1
O
2 658 Z97344.134 659 AL035440.50 2 660 U73462.2 661 AC005770.
2 0 662 AF06944 1.29 243 Case S-5O15A16/78/NAD SEQ ID NO: SYNGENTA
NO:
663 AC005496.1 7 664 AL096860.203 665 AC005957.3 666 AC005957.2 3 667 AC003974.136 668 AC006232.8 7 669 AC004681.8 6 670 AF069298.23 671 A.F096373.28 672 Z97342.284 673 AF096370.14 674 AC005388.43 675 AC006341.12 676 Z97338.384 677 AC002396.32 678 AC007260.34 679 AC005315.131 680 AC005917.17 8 681 A016.1 682 AC006526.1O 2 683 AC000098.16 684 AC003979.28 685 AC007167.24 8 244 Case S-50015 A1 6/78[N AD 00 SEQ ID NO: SYNGENTA
NO:
686 AL30978.126 687 AC005275.1 0 4 688 AC006550.33 689 AC007230.
2 l 690 AC00407 7 .l 4 1 00 691 AC00404 4 6 4 692 AC004411.
17
O
Leaf promoters from the provisional application US 60/258692 693 EL132 694 Novarisl0 6 695
NRA
696 PAD3 697 PDFI.2 698 PR.1I 699 Athhomeoa 700 AF149053 701 athzfph 702 ATU28422 703 athrprpla 704 AJ250341 705 AC002311.
2
O
706 AL035605.
4 1 707 AC007576.
4 9 245 Case S-5OOI5AI16/78[NAD 00 SEQ ID NO: SYNGENTA
NO:
708 AL022347.282 709 AC000375.
3 4 710 AL096859.162 711 X98926.1 712 AC005560.1 6 00 713 Z97342.384 714 AC002510.6O 715 AL022223.48 716 AL022604.205 717 AL021768.9 6 718 AC003033.1 39 P 719 AC003033.12 9 720 ACOO7 138.58 721 AC005896.
2 3 722 AC006284.5 723 AL022347.12 724 AC004561.7 4 725 Z97336.167 726 AC007169.
8 9 727 AL049608.65 728 AL035679.68 729 AL049607.66 730 AC004218.
8 6 246 Case S-500] 5A/16/78/NAD 00 SEQ ID NO: SYNGENTA
NO:
731 AC002409.
8 8 732 AC006223.95 733 AE000657.30 734 AL033545.26 735 AC007230.29 736 AL030978.79 737 AL022347.265 738 AL022347.219 739 AC002391.10 2 740 Z97344.134 741 AL035440.50 2 742 U73462.2 743 AC005770.
2 744 AF06944 1.29 745 AC005496.175 746 AL096860.203 747 AC005957.35 748 AC005957.
2 3 749 AC003974.136 750 AC006232.87 751 AC004681.86 752 AF069298.23 753 AF096373.28 247 Case S-5O15A/16/78/NAD 00 SEQ ID 754 755 r- 00 761 76 248 Case S-50015A/16/78/NAD Table 14 Identification of rice homologs to the Arabidopsis ORFs and their corresponding promoters The peptide sequences corresponding to the full-length Arabidopsis ORFs are formatted into a BLAST database. Then, a BLASTP comparison search is performed with the Arabidopsis sequences. Since there is no description associated with the predicted protein sequences, the stringency of the SCAN post process is increased. The default parameters of SCAN are set so that all of the results have 60 or more identities and that 60% of the alignment is made up of identities. An le-4 E-value cutoff is implemented and additionally no more than the top 5 hits are taken. Then the mRNA sequences for these predictions are retrieved and included in the listing along with the 2kb upstream promoter region. A PERL script carries out this process.
Table 14: Arabidopsis ORF (SEQ ID NO) 360 360 441 441 441 441 442 442 442 442 442 442 Homologous rice ORF (SEQ ID NO) 774 792 789 790 799 813 781 804 805 810 816 817 Promoter of rice gene with homologous ORF (SEQ ID NO) 825 843 840 841 850 864 832 855 856 861 867 868 -249- Case S-50015A/16/78/NAD 00 0
C
00
(N
O
O
o o
(N
Arabidopsis ORF Homologous rice ORF Promoter of rice gene with (SEQ ID NO) (SEQ ID NO) homologous
ORF
(SEQ ID NO) 442 822 873 443 777 828 443 782 833 443 783 834 443 806 857 443 820 871 446 791 842 446 793 844 446 808 859 449 795 846 450 776 827 450 784 835 450 787 838 450 800 851 450 807 858 451 779 830 454 803 854 458 788 839 465 786 837 -250- Case S.50015.A/16/78/NAD 00 0 Ar-abidopsis
ORF
(SEQ ID NO)
LL
466 466 466 466 467 467 471 471 471 E4 7 2 527 527 527 527 527 527 529 529 Homologous rice ORF (SEQ ID NO) 775 778 814 815 785 798 794 809 812 797 780 796 802 819 821 823 811 824 801 818 Promoter of rice gene with homologous
ORF
(SEQ ID NO) 826 829 865 866 836 EEE8 44 9 844559 860 863 848 831 847 853 870 872 874 862 875 852 869 -251 Case S-50015A/16/78/NAD 00 8'able 15.. Identification of homologous genes -lomologs are identified through the use of BLAST and SCAN software with some i dditional filters. The simplest way to identify homologs is to perform searches on a S)rotein level. The Arabidopsis sequences referred to in the table below are full length CDS which have an associated peptide sequence. A BLAST database that is a subset of 0 GenBank ver 123.0 (Release Date April 15, 2001) is created that contains all of the SPlant translated regions excluding Arabidopsis thaliana sequences. The subset is created with a PERL script. Then, a BLAST search (BLASTP specifically) is performed with all of the peptide sequences of the present invention against the 1 GenBank subset. SCAN (the Sequence Comparison Analysis, program ver licensed from the Los Almos National Laboratories) is then used with its default settings to post-process the BLAST results and to identify homologous sequences. In addition to SCAN, an E-value cutoff of le-4 is implemented. Finally, to determine if these sequences could be orthologs, another filter is implemented. This filter takes advantage of the fact that many of the Arabidopsis CDS already have description assigned by TIGR and its collaborators. When the GenBank subset is created, annotation from following fields is retained: product, function, and note (protein and nucleotide accessions and organism are also kept). For each homolog found by SCAN below the E-value cutoff, the words in the description to the text of the annotation are compared. If any of the words match, then the sequence is considered to have the same or similar function. Since many words in the description do not specify function to the following words are eliminated from being used in the comparison.
Excluded Words: The, like, protein, related, unknown, subunit, hypothetical, and, putative, precursor, clone, homolog, small, beta, class, dna, ma, alpha, gamma, has, not, been, from, to, by, long, type, induced -252- Case S-5O15AII6/78fNAD 00 STable SArabidopsis
ORF
kn (SEQ ID NO) 358 Homologous sequence CAA72271.1 YI 1483 Brassica napus DESCRIPTION: jasmonate inducible protein BA-A22966.1 D45 182 Chenopodium amaranticolor DESCRIPTION: chitinase CAA43708.1 X61488 Brassica napus DESCRIPTION: chitinase BABiI 377.1 AB05481 1 Oryza sativa DESCRIPTION: PR-3 class IV chitinase. Cht4. Catalytic domain BAB2 1374.1 AB054687 Oryza sativa DESCRIPTION: PR-3 class IV chitinase. Cht4. catalytic domain BAA19793.l AxB03 194 Oryza sativa DESCRIPTION: chitinase Ilb AAB65777.1 U97522 Vitis vinifera DESCRIPTION: class IV endochitinase. VvChi4B CAA43708.1 X61488 Brassica napus DESCRIPTION: chitinase AAB65777.1 U97522 Vitis vinifera DESCRIPTION: class IV endochitinase. VvChi4B AAB65776.1 U97521 Vitis vinifera DESCRIPTION: class IV endochitinase. VvChiAA BAB21377.1 AB054811I Oryza saliva DESCRIPTION: PR-3 class IV chitinase. Cht4. Catalytic domain BAB2 1374.1 AB054687 Oryza sativa DESCRIPTION: PR-3 class IV chitinase. Cht4. catalytic domain BAA19793.1 AB0O3194 Oryza saliva DESCRIPTION: chitinase Ilb 360 253 Case S-5O15AI16/78/NAD Arabidopsis ORF Homologous sequence (SEQ ID NO) CAA87072.1 Z46948 Sambucus nigra DESCRIPTION: hydrolyse internal glycosidic linkages of chitin.
pat hogenesis- related protein PR-3 type BAA22966.I D4518 2 Chenopodiumn amarant icolor DESCRIPTION: chitinase BAA22965.l D45 181 Chenopodium amaranticolor DESCRIPTION: chitinase BAA22968.1 D45 184 Chenopodium amaranticolor DESCRIPTION: chitinase BAA22967.I D45 183 Chenopodium amaranticolor DESCRIPTION: chitinase AAC35981 .1 AF090336 Citrus sinensis DESCRIPTION: chitin hydrolase. chitinase CHII1. chilI AAA33444.1 M841 6 4 Zea mays DESCRIPTION: chitinase A. seed chitinase CAA87074.1 Z46950 Sambucus nugra.
DESCRIPTION: hydrolyses internal glycosidic linkages of chitin.
pathogenesis-related protein, PR-3 type CAA53544.1 X75945 Beta vulgaris DESCRIPTION: chitinase. Ch4 CAA40474. 1 X57 187 Phaseolus vulgaris DESCRIPTION: chitinase. Chi4 362 BA.B16431.1 AB041519 Nicotiana tabacum DESCRIPTION: P-rich protein Nt-SubC29. Nt-SubC29 BAA 11855.1 D83227 Populus nigra DESCRIPTION: extensin like protein BAA1 1854.1 D83226 Populus nigra DESCRIPTION: extensin like protein 254 Case S-5O15A/16/78[NAD 00 SArabidopSis ORF Homologous sequence (SEQ ID NO) AAK3057 1.1 AF34665 9 Brassica napus In DESCRIPTION: extensin-like protein AAC6056 6 .l S68113 Brassica napus DESCRIPTION: proline-rich SACS 1. This sequence comes from Fig. 3 S365 CA.A62228.1 X90695 Medicago saliva 00 DESCRIPTION: peroxidase2. CAA0 98 81.1 AJ 01 1939 Trifoliu mre pens DESCRIPTION: peroxidase. prx2 441 AAC0481 1.1 AF037460 Fritilaria agrestis DESCRIPTION. GF14 protein.
GRF
AAF76226. I AF272572 Populus x canescens DESCRIPTION: 14-3-3 protein. 14-3-3P20-1 AAF05737.1 AF191746 Liliumilongiflorum DESCRIPTION: 14-3-3-like protein AAC49894. 1 U9 1726 Nicotiana tabacum DESCRIPTION: 14-3-3 isoform e. T14-3e AAB340395.1 U80070 Mesembryaflthemum crystaffinlum DESCRIPTION: C-box binding factor. 14-3-3-like protein.
GBF
AAB09580.1 U70533 Glycine max DESCRIPTION: SGFI4A. 14-3-3 related protein AAB07457.1 U65957 Oryza saliva DESCRIPTION: GF14-c protein, rice 14-3-3 protein hornolog; osGF]4c AAB33304.I S77133 Zea mays DESCRIPTION: GF14-6. GRF1. 14-3-3 protein homolog; This sequence comes from Fig. XAA9943 1.1 L29 150 Lycopersicon esculentumn 255 Case S.50015A/16/78[NAD 00 0 Arabidopsis ORF Homologous sequence N (SEQ ID NO) DESCRIPTION: 14-3-3 protein homologue CAA74592.1 Y14200 Hordeum vulgare DESCRIPTION: 14-3-3 protein AAB07456.1I U65956 Oryza saliva DESCRIPTION: GF14-b protein, rice 14-3-3 protein homolog; osGF14b 00 -AAD27827.
2 AF121198 Picea glauca ci DESCRIPTION: 14-3-3 protein. 14-3-3EB9D -AAD27823.2 AF121194 Populus x calesces DESCRIPTION: 14-3-3 protein. 1 4-3-3P20-2 BA.A0371 1.1 D16140 Oryza saliva DESCRIPTION: brain specific protein. S94 C.A44259. 1 -X62 .3-8 8 H .ordeu m v ,ulg are DESCRIPTION: 14-3-3 protein homologue CAA66309. 1 X97724 Solanum tuberosum DESCRIPTION: 14-3-3 protein. leaf specific CAA63658.1 X93 170 H-ordeumn vulgare DESCRIPTION: Hvl4-3-3b.
AAA85817.1 U15036 Pisum sativumn DESCRIPTION: 14-3-3-lie protein .CAB42546.
2 A1238681 Pisum sativum DESCRIPTION: 14-3-3-like protein. 14-3-3 CAA53700. I X76086 Cucurbita pepo DESCRIPTION: 14-3-3 protein 32kDa endonuclease. A215.
single polypeptide AAA33505.1 M9 68 5 6 Zea mays DESCRIPTION: regulatory protein. GF14-12 AAK26634.1 A.F342780 Brassica napus 256 Case S-5O15A/16/78fNAD ArabidopSis ORF Homologous sequence ernC in NO) DESCRIPTION: GF14 omega. 14-3-3 protein CAA44642.l X62838 Genothera elata subsp. hookeri DESCRIPTION: protein kinase C inhibitor homologue CAA72383.1 Y 11687 Solanumn tuberosum DESCRIPTION: 14-3-3 protein. 34G AAC49892.1 U91724 -Nicotiana tabacum, DESCRIPTION: 14-3-3 isoform c. T14-3c CAA72094. 1 Y 11211 Nicotiana tabacum DESCRIPTION: 14-3-3-like protein B CAA72382 1 Y1 1686 Solarium tuberosum DESCRIPTION: 14-3-3 protein. CAB42547.1 AJ238682 Pisum sativumr DESCRIPTION: 14-3-3-lie protein. 14-3-3 CAA7238 1.1 Y1 1685 Solanium tuberosurn DESCRIPTION: 14-3-3 protein. 16R AAC49891.I U91723 Nicotiana tabacum DESCRIPTION: 14-3-3 isoform b. T14-3b AAB07458. 1 U65958 Oryza sat iva DESCRIPTION: GFI4-d protein, rice 14-3-3 protein hornolog; osGFl4d BAB 11739 .1 AB042193 Triticumn aestivum DESCRIPTION: TaWINi1. TaWIN 1. TaWIN I is a member of 14- 3-3 protein famfly AAC49895 1 U9 1727 Nicotiana tabacumn DESCRIPTION: 14-3-3 isoform f. T14-3f CAA65147.l X95902 -Lycopersicon esculentum DESCRIPTION: 14-3-3 protein. tft3 gene 146.1 X95901 .Lycopersicofl esculentumn 257 Case S-50015A116/78/NAD Arabidopsis ORF Homologous sequence (SEQ ID NO) DESCRIPTION: 14-3-3 protein. tft2 gene CAB65693. 1 AJ270959 Lycopersicon esculentum DESCRIPTION: tft3 14-3-3 protein. tft3 AAC17447.l AF066076 Helianthus annuus DESCRIPTION: 14-3-3-lie protein CAA72095.1 Y 11212 Nicotiana tabacum DESCRIPTION: 14-3-3-lie protein A 148.1 X95903 Lycopersicon esculentum DESCRIPTION: 14-3-3 protein. tft5 gene CAC03467.1 Y19105 Chlamydomonas reinhardiji DESCRIPTION: 14-3-3 protein CAA55964. I X79445 Chiamydomonas reinhardti' DESCRIPTION: 14-3-3 protein CAA6O800. 1 X87370 Solanum tuberosum DESCRIPTION: 14-3-3 protein. RA2 15. root specific 149.1 X95904 Lycopersicon esculentum DESCRIPTION: 14-3-3 protein. tft6 gene BAB 11740.1 AB042194 Triticum aestivum DESCRIPTION: TaWIN2. TaWIN2. TaWIN2 is a member of 14- 3-3 protein family CAA72384.1 Yl 11688 Solanium tuberosum DESCRIPTION: 14-3-3 protein. A.AC49893.1 U91725 -Nicotiana tabacum DESCRIPTION: 14-3-3 isoform d. T14-3d 145.1 X95 900 Lycopersicon esculentum DESCRIPTION: 14-3-3 protein. tftl gene AAB0958 1.1 U70534.Glycine max DESCRIPTION: SGF14B. 14-3-3 related protein 258 Case S-5O15AI16/78/NAD 00 SArabidopsis ORF Homologous sequence idpss
R
(SEQ ID NO) S442 AAB51393.1 U92651 Brassica oleracea var. botrytis DESCRIPTION: tonoplast intrinsic protein bobTlP26-I.
TIP
BAAI27l 1.1 D84669 Raphanus sativus DESCRIPTION: water channel. VM23. VIPI. gammna-Tip homologue (Ni -AAD39372. I AF1 18381 Brassica napus 00 DESCRIPTION: tonoplast intrinsic protein. gamma-TIP2.
aquapoflfl BA]B12722.1 AB048248 Pyrus communs DESCRIPTION: gamrma tonoplast intrinsic protein. Py-gTIP CACl618.l AJ251652 Medicago truncatula DESCRIPTION: water channel. aquaporin. aqpl CAB145653.1 AJ243309 Pisum sativum DESCRIPTION: putative tonoplast intrinsic protein. tip AAF78757.l AF27 1660 Vitis berlandieri x Vitis rupestris DESCRIPTION: water channel. putative aquaporin TIP3. TIP3.
TIP-like protein AAF82790.1 AF275315 Lotus japorucu......
DESCRIPTION: a water-selective transport MIP. water-selective transport intrinsic membrane protein 1. aquaporin; LIMP I -BAA5O7.1
D
2 5 5 3 4 Oryza sativa DESCRIPTION: gamma-Tip. yk333 AAA02946.1 -L12257 Glycine max DESCRIPTION: putative channel protein. nodulin-26 AAC04846.1 AF020793 Medicago sativa DESCRIPTION: tonoplast intrinsic protein homolog MSMCP1.
msmcp I AAG44946.1 AF290619 Nicotiana glauca 259 Case S-50015Ai 16/78/NAD Arabidopsis
ORF
(SEQ ID NO) AAA02947.l L12258 Glycine max DESCRIPTION: putative channel protein. nodulin-26 CAA69353.l Y08161 Nicotiana tabacum DESCRIPTION: aquaporin 1. aqpl AAC09245.I AF037061 Zea mays DESCRIPTION: tonoplast intrinsic protein. ZmT1PI. water channel protein; aquaporin AAB 17284.1 U43291I Mesembryanthemum crystallinuni DESCRIPTION: tonoplast intrinsic protein. TIP. water channel protein AAD 10494.1 U86762 Triticum aestivumr DESCRIPTION: gammu-a-type tonoplast intrinsic protein, gamma-
TIP
CAA56553.1 X80266 Hordeum vulgare DESCRIPTION: gamma-TIP- like protein CAB6 1841.1 AJ242805 Sporobolus stapfianus DESCRIPTION: putative gamma tonoplast intrinsic protein (TIP) AAK26767.I AF326500 Zea mays DESCRIPTION: tonoplast membrane integral protein ZmTIPI-2 CA.A64952. 1 X95650 Tulipa gesneriana DESCRIPTION: tonoplast intrinsic protein. tipi AAD3 1847.1 A -3 33531 "Me-sembry'anthemum crystafi*-nurnm-*--,-,* DESCRIPTION: water channel protein MipI. CA.B39758.1 AJ133748 Picea abies DESCRIPTION: putative water channel. major intrinsic protein.
mnipfg. aquaporin-lie protein CAA06335.1 AJ005078 Picea abies 260 Case S-5O15AI16/78fNAD 00 SArabidopsis ORF Homologous sequence (SEQ ID NO) DESCRIPTION: aquaporin-like protein. MIPr 1394.1 U92652 Brassica oleracea var. botrytIis DESCRIPTION: tonoplast intrinsic protein bobTIP26-2.
TIP
AAG44945.l AF290618 Nicotiana glauca DESCRIPTION: putative delta TIP. MIP2 CAB55837.1 AJ245953 Spinacia oleracea 00 DESCRIPTION: putative aquaporin. delta tonoplast intrinsic c-i protein. dtip. highly expressed in leaf, petiole and root and not in epidermal and rneristemnatic cefls AAB04557. 1 U62778 Gossypium hirsutum DESCRIPTION: delta-tonoplast intrinsic protein. delta-TIP 185.1 X95951 H-elianthus annuus DESCRIPTION: aquaporin AAF78758.1 AF27 1661 Vitis berlandieri x Vitis rupestris DESCRIPTION: water channel. putative aquaporin TIP I. TIPI AAD31848.l AF133532 Mesernbryanthemum crystalliflum DESCRIPTION: water channel1p pr otein MipK. MipK CAB95746.2 AJ289866 Vitis vinifera DESCRIPTION: water chanel. putative aquapormn. delta-TIP AAB23597.2 S45406 Nicotiana tabacum DESCRIPTION: root-specific gene regulator. TobRB7. This sequence comes from Fig. I; conceptual translation presented here differs from translation in publication; mismatches (11, 13,4 8,7 6,8 3,9 5,103,19 7) gap (248 25 0).
CAA38634.1 X54855 Nicotianatabacum DESCRIPTION: possible membrane channel protein 184.1 X95950 Helianthus annuus 261 Case S-5O15AI16/78INAD Arabidopsis ORE (SEQ ID NO) 00 Homologous sequence DESCRIPTION: aquaporin AAkB53329. I U95008 Lycopersicon esculentumn DESCRIPTION: Rb7. RB7. putative water channel protein AAC39480.1 AF0471'73 Vernicia fordii DESCRIPTION: aquaporin AAB67881.l U65700 Solarium tuberosum DESCRIPTION: membrane channel protein. potRB7. putative CAA49854.1 X70417 Antirrhinum majus DESCRIPTION: integral membrane protein BAAO8 107.1 D45077 Cucurbita sp.
DESCRIPTION: MP23 precursor BAA19129.1 ABOO0506 Daucus carota DESCRIPTION: simrilar to EMBL Accession Number: X54855 187.1 X95953 Helianthus annuus DESCRIPTION: aquaporin. root specific;, homologue to TobRb7 AAK26769.l AF326502 Zea mays DESCRIPTION: tonoplast membrane integral protein ZmTIP2-2 AAD 10495 i U86763 Triticumn aestivumn DESCRIPTION: delta-type tonoplast intrinsic protein. delta-TIP BAA3 1452.1 ABO10416 Raphanus sativus DESCRIPTION: water channel of vacuolar membrane; The function a Xenopus oocyte system. delta-VM23. VIP3. a homnolog of delta-TIP CAA.65186.1 X95952 Helianthus annuus DESCRIPTION: aquaporin BAAO8 108.1 D45078 Cucurbita sp.
DESCRIPTION: MP28 AAX3 3 7 10.1 16 97 7 Pe t uni a x h yb r id a 443 262 Case S-50015A116/7SINAD Arabidopsis ORF (SEQ ID NO) Homologous sequence DESCRIPTION: glutamate decarboxylase. gad AAA33709.I 1-16797 Petunia x hybrida DESCRIPTION: glutamate decarboxylase. gad AAB40608.1 U54774 Nicotiana tabacum DESCRIPTION: glutamate decarboxylase. NtGADI. calmodulin regulated enzyme; calmodulin-binding protein AAK18620.1 AF352732 Nicotiana tabacum DESCRIPTION: converts glutamate to gamma- am-inobut yric acid.
Glutamate decarboxylase isozyme 3. GAD; GAD3; NtGAD3; calciura/calmodulin-depefldeflt enzyme AAC24195.1 AF020425 Nicotiana, tabacum DESCRIPTION: calmoduhin binding protein, glutamate decarboxylase isozyme 1. NtGADL1 calciu m-c amodun- depenldent enzyme AAC39483.1 AF020424 Nicotiana tabacum DESCRIPTION: glutamate decarboxylase isozyme 2. NtGAD2.
c alciu m-catrnoduhlin-dCeendenlt enzyme BAB32868.1 AB056060 Oryza sativa DESCRIPTION: glutamate decarboxylase.
GAD
BAB32870 .1 AB056062 Oryza sativa DESCRIPTION: glutamate decarboxylase.
GAD
CAA568 12 I X80840 Lycopersicon esculentumn DESCRIPTION: homology to glutamate decarboxylases; putative start codon BAB32869.1 AB.5661 .Oryza sativa DESCRIPTION: glutamate decarboxylase.
GAD
BAB32871 .1 AB056063 Oryza sativa. 263 Case S.50015Ai16/78INAD 00 0 Arabidopsis ORF H-omologous sequence NI (SEQ ID NO) DESCRIPTION: glutamate decarboxylase.
GAD
r 444 AAB69871.I AF016897 Oryza sativa DESCRIPTION: GDP dissociation inhibitor protein OsGDI2.
OsGDI2. GDP dissociation inhibitor-2 CAA0673 1.1 AJ005836 Cicer arietinurn DESCRIPTION: GDP dissociation inhibitor. gdi 00 AAB69870.1 AF016896 Oryza sativa DESCRIPTION: GDP dissociation inhibitor protein OsGDI 1.
OsGDII. GDP dissociation inhibitori 446 AAC497 16.1 U55035 Brassica rapa DESCRIPTION: small GTP-binding protein Bsarla. bsarla AAC32610.1 AF084005 Avena fatua DESCRIPTION: ras-like small monomeric GTP-binding protein.
SARI. SARI p 127.i AF048825 Malus x domestica DESCRIPTION: GTP-binding protein Sari AAF17254.1 AF210431 Nicotiana tabacumn DESCRIPTION: small GTP-binding protein Sarl BNt BAA13463.1 D87821 Nicotiana tabacum DESCRIPTION: NtSarl protein. NtSARI BA-A84612.I AP000492 Oryza sativa DESCRIPTION: ESTs AU0781II7(E I380),C72293(E 1380) correspond to a region of the predicted gene. siiar to SARI1/GTPbinding secretory factor. (AF001 308) CAA69699. 1 Y08423 Nicotiana plumbaginifolia DESCRIPTION: small GTP-binding protein AAC49717.1 U55036 Brassica rapa DESCRIPTION: small GTP-binding protein Bsarlb. bsarlb 264 Case S-50015A116/78INAD 00 00 Arabidopsis ORF Homologous sequence (SQ D O) AAA34 168.1 Li2051 Lycopersicon esculentumn DESCRIPTION: GTPase. SAR2 CAA69700.1 Y08424 Nicotiana plumbaginifolia DESCRIPTION: small GTP-binding protein CAA666 10.1 X97967 Nicotiana tabacum DESCRIPTION: GTP-binding protein. SARI1 447 AAB6987 1.1 AF016897 Oryza saliva DESCRIPTION: GDP dissociation inhibitor protein OsGDI2.
OsGDI2. GDP dissociation inhibitor2 AA.B69870.1 AF0 16896 Oryza sativa DESCRIPTION: GDP dissociation inhibitor protein OsGDI 1.
OsGD1I. GDP dissociation inhibitorl CAA0673 1.1 AJ005836 Cicer arietinumn DESCRIPTION: GDP dissociation inhibitor. gdi AAB8O7 17.1 AF012823 Nicotiana tabacumn DESCRIPTION: inhibits dissociation of GDP from GTP binding proteins. GDP dissociation inhibitor. GD! 449 AAB3 108.1 U55032 Brassica napus DESCRIPTION: aspartic protease. protease CAA54478.1 X77260 Brassica oleracea DESCRIPTION: aspartic protease. putative CAA56373.1 X80067 Brassica oleracea DESCRIPTION: putative aspartic protease BAA06875.1 D32144 Oryza sativa DESCR-IPTION: aspartic protease BAA06876.1 D32165 Oryza sativa DESCRIPTION: aspartic protease CAA39602.1 X56 136 Hordeum vulgare 265 Case S-5O15A16/78/NAD A rabidopsis ORF Homologous sequence (SEQ ID NO) DESCRIPTION: aspartic proteinase. includes put. pre- and prosequences, cleavage sites not determined CAA6 1253.1 X88774 Brassica oleracea DESCRIPTION: aspartic protease. putative 450 CAA56590. I X80362 Brassica juncea DESCRIPTION: S-adenosyI-L-methioflune synthetase. msams AAK29409. 1 AF346305 Elaeagnus umbellata DESCRIPTION: S-adenosyl-L-methionifle synthetase. SAMS1 AAK294 10.1 AF346306 Elaeagnus umbellata DESCRIPTION: S-adenosyl-L-methioline synthetase. SANIS2 CAA95856.1 Z71271 Catharanthus roseus DESCRIPTION: L-methi onine ATP S-adenosyl-L-methionifle PPi Pi. S-adenosyl-L-methionifle synthetase 1. CRSAMS1.
functional expression in Escherichia cob CAA80865.1 Z24741 -Lycopersicon esculentumn DESCRIPTION: S -adeno syl-L- methio nine synthetase AAG42490.1 AF32 1001 Suaeda maritima subsp. salsa DESCRIPTION: S -adenosytmethiolnine sythetase 2 CAA80866. 1 Z24742 Lycopersicon esculentum DESCRIPTION: S-adenosyl-L-methI fline synthetase CAA95857.1 Z71272 Catharanthus roseus DESCRIPTION: L-Methionine ATP S-adenosyl-L-methioninfe PPi Pi. S-adenosyI-L-methionine synthetase 2. CRSAMS2.
functional expression of in Eseherich-ia cob AAD48485.1 AF170798 Petunia xhybrida DESCRIPTION: S -adenosyI-L-rnethiolife synthetase AAD56396.1 AF183891 Petunia x hybrida DESCRIPTION: S-adenosyI-L-methioflife synthetase. sam2 266 Case S-5OO15A16/78/NAD 00 k rabid opsis ORF Homologous sequence (SEQ ID NO) A.AG 17666.1 AF27 1220 Brassica juncea DESCRIPTION: S-adenosy~methiorune synthetase. MSAMS2 CAA9585 8 .1 Z7 1273 CatharanthuS roseus DESCRIPTION: L- methionine ATP S -adenosyl-L-methioninre PPI Pi. S-adenosy1-L-methioriine synthetase 3. CRSAMS3.
functional expression in Eschenichia cobi 00 CAA81481.1 Z26867 Oryza sativa DESCRIPTION: S-adenosyl rnethionine synthetase BAA96637. I AP00248 2 Oryza sativa DESCRIPTION: Similar to Oryza sativa S-adenosylmethiofline synthetase I (P4661 1) AAA79831.1 U38186 Pinius banksiafla DESCRIPTION: S-adenosyl methionine synthetase AAG17036.I AF187821 Pinus contorta DESCRIPTION: catalyzes the reaction between methionilie and ATP to S -adenosylmethio nne. S -adenosyrlethio nne synthetase.
sams2 CA.B83039.I AJ277206 Camellia sinensis DESCRIPTION: s- adeno sylfle t hinonine synthetase BAA94605.l ABO4 1534 Camelia sinensis DESCRIPTION: s- adenosylflet hionhie synthetase. S AM AAA8137 7 .1 U17239 Actinidia ch~iensis DESCRIPTION: S -adenosy1Tet hJofnine synthetase AA.B38500. 1 U79767 Mesembryanthernum crystallinumn DESCRIPTION: S-adenosytffethionine synthetase. methionine adenosyltransferase AAA8 1378.1 U 17240 Actinidia chinensis DESCRIPTION: S -adenosylnethio nine synthetase 267 Case S-5O15AI16/78fNAD Arabidopsis
ORF
(SEQ ID NO) Homologous sequence CAA80867.1 Z24743 Lycopersicon esculentum DESCRIPTION: S -adenosy- L- methioflnine synthetase AAF42974. I AF1 27243 Nicotiana tabacumn DESCRIPTION: S-adenosyl-L-methiofllfe synthetase.
SAMS
CAA57696.1 X82214 Petunia x hybrida DESCRIPTION: methionine adenosyltransferase. sam I AAA2O1 12.1 M73430 Populus x generosa DESCRIPTION: S -adenosyl methionine synthet ase AAC05590.1 U82833 Oryza sativa DESCRIPTION: S-adenosyl-L-methioflife synthetase. pOS- S AM S2 AAB71138.1 AF004317 Musa acuminata------- DESCRIPTION: S-adenosyl-L-meth~iol.1e synthetase homnolog BAA09895.1 D63835 Hordeumn vulgare DESCRIPTION: S-adenosymethioflife synthetase AAA3 3274.1I M61 882 Dianthus caryophyllus DESCRIPTION: S-adenosylmeth-ionine synthetase. CARSAM2 CAA57581.l X82077 Pisum sativum DESCRI PTI ON: methionine adenosyltransferase. S AMs2 AAA58773 .1 L36681 Pisumnsativumn DESCRIPTION: S -adenosyirnet Nofninle synthase. precursor for ethylene and polyamine biosynthesis AAA58772.1 L36680 Pisum sativumn DESCRIPTION: precursor for ethylene and polyarnine biosynthesis.
S-adenosylimetJofnCn synthase CAA57580. 1 X82076 Pisumn sativumn DESCRIPTION: methionine adenosyltransferase. SAMs 1 268 Case S-50015A116/7SINAD Arabidopsis ORF Homologous sequence (SEQ ID NO) AAA81379.1 U 17241 Actinidia chneiSi DESCRIPTION: S -adeno symet hioinel synthetase AAA33857.I M62758 Petroselinum crispumn DESCRIPTION: S -adenosylmnethio ninle synthetase. SMS- I AAB71833.1 AF008568 Chiamnydomonas reinhardtii DESCRIPTION: S -adeno sylmethio nine synthetase.
CHRSAMS
AAA33858.I M62757 Petroselinum crispum DESCRIPTION: S-adenosylmethiorufle syrithetase. SMS-2 AAA73483.l U27348 Populus deltoides DESCRIPTION: S-adenosyl-L-methioflie synthetase. Sami BAA21726.I AB006187 Nicoriana tabacum DESCRIPTION: S-adenosyirnethiofline synthase. CAA65455.I X96680 -Catharanthus roseus DESCRIPTION: methionirie adenosyltransferase. S AM 1 CAA59508. 1 X85252 Cicer arietinum DESCRIPTION: SAM-synthetase. SAMs.
AAF78525.l AF195233 Pyrus pyrifolia.....
DESCRIPTION: S-adenosylmethioflife synthase. SA.MS 454 AAA34046.1 M83940 Spiniacia oleracea DESCRIPTION: 10- formyltetrahydrofolate synthet ase. sfs 1 465 CAA64455.I X94999 Mesembryanthemum crystallinumn DESCRIPTION: V-type ATPase c subunit. VmaclI AAC49473.1 U 16244 Kalanchoe daigremontiana DESCRIPTION: V-type H+-ATPase 16 kDa subunit. c subunit, presumed H+ conducting pore of vacuolar-type H+ ATPase; integral membrane protein, localized to vacuole and possibly other endomembranes AAA82977.I U 13670 Gossypium hirsutum 269 Case S-50OI5AI16/78[NAD Arabidopsis ORF Homologous sequence (SEQ ID NO) DESCRIPTION: vacuolar H+-ATPase proteolipid (16 kDa) subunit. cval6-4 AAA82976.1 U 13669 Gossypium hirsutumn DESCRIPTION: vacuolar l-+-ATPase proteobld (16 kDa) subunit. cval6-2 CAA67356.1 X98851 Beta vulgaris DESCRIPTION: proton channel, proteobld. subunit c of V-type ATPase BAA89595.1 AB036923 Citrus unshiu DESCRIPTION: vacuolar H+-ATPase c subunit. Cit-VATP c-2 BAA89594.1 AB036922 Citrus unshiu DESCRIPTION: vacuolar H+-ATPase c subunit. Cit-VATP c-i BAA75542.i AB024275 Citrus unshiu DESCRIPTION: protein translocation. vacuolar H+-ATPase c subunit. CitVATP c-2 BAA75515.1 AB024274 Citrus unshiu DESCRIPTION: protein translocation. vacuolar H+-ATPase c subunit. CitVATP c- I AAC 12797.1 AF022925 Vigna radiata DESCRIPTION: adenosine triphosphatase. c-subunit of V-ATPase AAF04597. 1 AF1938 14 Dendrobiumn crumenatumn DESCRIPTION: vacuolar H+-ATP synthase l6kDa proteolipid subunit. V-ATPase subunit AAC 12798.1 AF022926 Vigna radiata DESCRIPTION: adenosine triphosphatase. c-subunit of V-ATPase BAA89596.1 AB036924 Citrus unshjiu DESCRIPTION: vacuolar H+-ATPase c subunit. Cjt-VATP c-3 BAA755 16.1 AB024276 Citrus unshiu 270 Case S-50015 A/16/78INAD 00 SArabidopsis 0R Homologous sequence (SEQ ID NO) DESCRIPTION: protein translocation. vacuolar H+-ATPase c suburut. CitVATP c-3 *AAK01292.1 AF33 1709 Avicennia marina DESCRIPTION: vacuolar ATPase subunit c. V-ATPase subunit c CAA65062.1 X95751 Nicotiana tabacumn DESCRIPTION: proteolipid, proton channel. c subunit of V-type 00 ATPase. isoform 1 AAB64 199.1 AF010228 Lycopersicon esculentumn DESCRIPTION: vacuolar proton ATPase proteolipid subunit.
LVA-PI; induced by gibberellin AAA68 175.1 U27098.....za saliva DESCRIPTION: H+-ATPase. vatp-P1 CAA7.1930.1 Y 11037 Beta vulgaris DESCRIPTION: BV-16/1 CAA65063.1 X95752 -Nicotiana tabacum DESCRIPTION: proteolipid, proton channel. c subunit of V-type ATPase. isoform AAA327 12.1 M73232 Avena saliva DESCRIPTION: H+-ATPase. vatp-P1 BAA23351.1 AB003941 Acetabularia acetabulurn DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit .B A A 3 3 5 2 .1 A B 0 0 3 9 4 2 -A c e ta b u la ria a c e t a b u l u m DESCRIPTION: vacuolar type H+-ATPase proteohpid subunit BAA23350.l AB003940 Acetabularia acetabulum DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit BAA2 1683.1 AB003938 Acetabularia acetabulumn DESCRIPTION. vacuolar type H+-ATPase proteolipid subunit BAA2 1682.1 AB003937 Acetabularia acetabulum -271 Case S.50015A116/78JNAD 00 0 Arabidopsis ORF H-omologous sequence
(N
S (SEQ ID NO) DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit BAA23349 .1 AB003 939 Acetabularia a ceta bulu m DESCRIPTION: vacuolar type H+-ATPase proteotipid subunit CAA63I 18.1 X92374 Zea mays DESCRIPTION: V-type H+-ATPase. subunit
C
riCAA63119.l X92375 Zearmays 00 DESCRIPTION: V-type H+-ATPase. subunit
C
S466 BAA2 1682.1 AB003937 Acetabularia acetabulurn DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit BAA23349. I AB003939 Acetabularia acetabuluni DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit -AAF04597.I AF193814 Dendrobium crumenatumn DESCRIPTION: vacuolar H+-ATP synthase l6kDa proteolipid subunit. V-ATPase subunit AAC12798.1 A1022926 Vigna radiata DESCRIPTION: adenosine triphosphatase. c-subunit of V-ATPase AAC 12797.1I AF022925 Vigna racliata DESCRIPTION: adenosine triphosphatase. c-subunit of V-ATPase 4999 esemryanhe r u mcr yst alli num DESCRIPTION: V-type ATPase c subunit. Vmacl AAC49473.1 U 16244 Kalanchoe daigremontiafla DESCRIPTION: V-type H+-ATPase 16 kDa subunit. c subunit, presumed H+ conducting pore of vacuolar-type H+ ATPase; integral membrane protein, localized to vacuole and possibly other AAA82977.1 U13670 Gossypium hirsultum DESCRIPTION: vacuolar H+-ATPase proteolipid (16 kLa) subunit. cva 16-4 272 Case S-50015AJ1I6/78INAD 00 SArabidopsis ORF Homologous sequence (SEQ ID NO) AAA82976.1 U 13669 Gossypium hirsutum DESCRIPTION: vacuolar H+-ATPase proteofipid (16 kDa) subunit. cval6- 2 CAA67356.I X98851 Beta vulgaris DESCRIPTION: proton channel, proteohld. subunit c of V-type ATPase 00 BAA8 .915 95.I- AB036 923 .Cit -rus unsh iu c-i DESCRIPTION: vacuolar H+-ATPase c subunit. Cit-VATP c-2 BAA89594.I AB036922 Citrus unshiu DESCRIPTION: vacuolar H+-ATPase c subunit. Cit-VATP c- I -BAA75542. I AB024275 Citrus unshiu DESCRIPTION: protein translocation. vacuolar H+-ATPase c subunit. CitVATP c-2 BAA89596.1 AB036924 Citrus unshiu DESCRIPTION: vacuolar H+-ATPase c subunit. Clt-VATP c-3 BAA755 16.1 AB024276 Citrus unshiu DESCRIPTION: protein translocation. vacuolar H+-ATPase c subunit. CitVATP c-3 AAK01292.1 -AF331709 Avicennia marina DESCRIPTION: vacuolar ATPase subunit c. V-ATPase subunit c AAB64199.I A.F010228 Lycopersicon esculentumn DESCRIPTION: vacuolar proton ATPase proteolipid subunit.
LVA-P1; induced by gibberellin CAA65062.I X95751 Nicotiana tabacum DESCRIPTION: proteolipid, proton channel. c subunit of V-type ATPase. isoform 1 CAA7 1930.1 Y1 11037 Beta vulgaris DESCRIPTION: 273 Case S.50015A116/78JNAD 00 Arabidopsis OEHomologous sequence S (SEQ ID NO) AAA68175.I U27098 Oryza sativa DESCRIPTION: H+-ATPasC. vatp-P 1 CAA65063.I X95752 Nicotiana tabacum DESCRIPTION: proteolipid, proton channel. c subunit of V-type ATPase. iso form 2 AAA327 12.1 M73232 Avena sativa 00 DESCRIPTION: H+-ATPase. vatp-Pl BAA23352.1 AB003942 Acetabularia acetabulum DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit BAA2335O.l AB003940 Acetabularia acetabulum DESCRIPTION: vacuolar type IA+-ATPase proteolipid subunit BA.A21683.I AB003938 Acetabularia acetabuliim DESCRIPTION: vacuolar type H+-ATPase proteolipid subunit BAA2335 1.1 AB003941 Acetabularia acetabulum DESCRIPTION: vacuolar type H+-ATPaSe proteolipid subunit CAA63118.l X92375 Zea mays DESCRIPTION: V-type H+-ATPase. subunit
C
467 AAD56018.l AF180758 Vitis riparia DESCRIPTION: 60S ribosomal protein L 10. QM. similar to QM family proteins A.AG2743 1 .1 AF295 636 Elaeis guineensis DESCRIPTION: QM-hike protein. tumor supressor protein F34 765 A F227 62,0 E uph o r bia es u1-a DESCRIPTION: 60S ribosomal protein 1-10. belongs to the LIOE family of ribosomal proteins BAA19462.l AB001891 Solanium melongena 274 Case S-50015A./ I6/78INAD Arabidopsis 0R (SEQ ID NO) AAB66347.1 kF013804 Pinus taeda DESCRIPTION: WiLm's tumor supressor homolog. lp2O. AAA 17419.1 U06108 Zea mays DESCRIPTION: QM protein AA.A98698.1 U55048 Oryza sativa DESCRIPTION: QM. similar to human QM protein, a putative tumor supressor, and to maize ubiquinol-cytochrome C reductase complex subunit VI requiring protein SC34 CAA57339.I X81691 Oryza sativa DESCRIPTION: putative tumor suppresser. SC34 CAA57340.1 X81692 Oryza sativa DESCR-IPTION: Putative tumor supressor. SG 12 AAG 17477.1 AF106846 Oryza sativa DESCRIPTION: QM protein AAA99158.l U55212 Oryza sativa DESCRIPTION: putative tumor suppressor. Wilrns' tumor-related protein QM CAA78461.I Z14083 Nicotiana tabacum DESCRIPTION: HOMOLOGIE with Human WILM's tumorrelated protein HUMQM 13AA19414.1 AB001582 Solanumn melongena DESCRIPTION: QM famidly protein. TMOO2 CAA5241 1.1 X74403 Phaseolus vulgaris DESCRIPTION: cyclophilin. Cyp CAA69622. 1 -Y08320 Digitalis lanata DESCRIPTION: cyclophyLin BAA25755.l AB012947 Vicia faba 527 275 Case S.50015A116/78INAD Arabidopsis ORF Homologous sequence (SEQ ID NO) DESCRIPTION: vcCyP CAA69598.I Y08273 Digitalis lanata DESCRIPTION: cyclophlin. CYP18 CAA59468.1 X85185 Catharanthus roseus DESCRIPTION: cyclophilin. PCKR1I CAA76054.1 Y16088 Lupinus luteus DESCRIPTION: cytosolic form of cyclophilin AAFOO471.1 AF178458 Lupinus luteus DESCRIPTION: cytosolic cyclophiln.
CYCLOPH
AAA63543.1 M55019 Lycopersicon esculentum DESCRIPTION: cyclophilin. CyP. the published citation gene name is 'CyP', but the submission gene name is 'Rot I' AAD22975.l AF126551 Solanum tuberosum subsp. tuberosum DESCRIPTION: cyclophilin. cytosolic; peptidyl-prolyl cis-trans isomerase; Gyp, PPlase; romatase AAA62706. 1 M55018 Brassica napus DESCRIPTION: cyclophilin. GyP. The published citation gene name is 'GyP', but the author submission gene name is 'Rot I' AAF65770.l kF242312.Euphorbia esula DESCRIPTION: accelerate protein folding. cyclophilin. peptidylprolyl cis-trans isomerase;
PPIASE
CAA48638. 1 X68678 Zea mays DESCRIPTION: pept idyl-prolyl cis-trans isomerase. cyclophilin AAA63403.1 M55021 Zea mays DESCRIPTION: cyclophlin. CyP. the published citation gene name is 'GyP', but the submidssion gene name is 'Rot 1V 1386.1 U92087 Solanum cornmersonii DESCRIPTION: stress responsive cyclophilin.
SCCYPI
276 Case S-50015A/16/78/NAD Arabidopsis ORF (SEQ ID NO) Homologous sequence 0AA7045.1 L29469 Oryza saliva DESCRIPTION: cyclophilin 2. Cyp2 AAA57046 .1 L29470 Oryza saliva DESCRIPTION: cyclophilin 2. Cyp2 AAC05639.1 AF052206 Chiamydomonas reinhardtii DESCRIPTION: cyclophilin 1. cypi. immunophilin; peptidyl prolyl isomerase AAA57044.1 L29471 Oryza sativa DESCRIPTION: cyclophilin 1. Cypi AAA32642.I L13365 Albium cepa DESCRIPTION: cyclophilin. CyP. putative AAG01536.1 AF291 180 Capsicum annuum DESCRIPTION: cyclophiin CACYPI AA4430. 1 L32095 Vicia faba DESCRIPTION: cyclophilin AAGO3 106.1I AC073405 Oryza sativa DESCRIPTION: similar to Arabidopsis thaliana Peptidyl-prolyl cis-trans isomerase (P3479 3' incomplete CAA10766.1 AJ132763 Pseudotsuga menziesii DESCRIPTION: catalyze the cis-trans isomenzation of proline peptide bonds. cyclophilin AAB69871.l AF016897 Oryza saliva DESCRIPTION: GDP dissociation inhibitor protein OsGDI2.
OsGDI2. GDP dissociation inhibitor2 AAB69870.1 AF016896 Oryza sativa DESCRIPTION: GDP dissociation inhibitor protein OsGDII1.
OsGDI 1. GDP dissociation inhibitor I 528 277 Case S-50015A116/78/NAD Arabidopsis ORF Homologous sequence (SEQ ID NO) 529 CAA0673 1.1 Ai005 836 Cicer arietinum DESCRIPTION: GDP dissociation inhibitor. gdi AAB80717.l A.F012823 Nicotiana tabacum DESCRIPTION: inhibits dissociation of GDP from GTP binding proteins. GDP dissociation inhibitor. ODI AAB99756. 1 AF020272 Medicago sativa DESCRIPTION: malate dehydrogenase. cmdh AAB64290.I A.F007581 Zea mays DESCRIPTION: cytoplasmic malate dehydrogenase AAK2643 1.1 AF353203 Oryza sativa DESCRIPTION: cytoplasmic malate dehydrogenase.
oxidoreductase AAG 13573.1 AC037425 Oryza sativa DESCRIPTION: cytoplasmic malate dehydrogenase.
OSJNBaOO55P24.3 CAA65384. 1 X96539 Mesembryanthemum crystalinum DESCRIPTION: malate dehydrogenase. mdh CAB361618.l AJ251083 Beta vulgaris DESCRIPTION: putative malate dehydrogenase. putative cytosolic malate dehydrogenase. nrlI.
CAC12826.1 AJ299256 Nicotiana tabacum DESCRIPTION: malate dehydrogenase. md I 278 P %OE1-169S 1M -279- The present invention also provides the following promoters: Promoter lB_syn299 IG2_syn300 AClIIlsyn27I AC I2_syn272 ACl3_syn273 N1 AC20_syn278 00 AC22_syn28O 0 ~AC24_syn282 N ~AC26_syn284 AC3lsyn286 AC34_syfl288 AC38_syn290 AC40O-syn292 AC7-syn267 AC9_syn269 AF3_syn3l2 ARIO syn307 AR 13 syn309 AR-syl3OI AR2_syn3O2 AR5_syn303 AR6_syn3O4 AR8_syn3O5 ATU56929_Syn0O7 (AC32) PRISynO18 UBQ3-SynOI 6 Corresponding Sequences in Patent 289_N 479 N 534_N 535_-N 578_N 478_N 530 N531_N 579_N 56_N 385_N 386_N 492_N 550_Y 113_N 387 -N 388_-N 493_N551_N 332_N 389_N 390_N 391_N 494_N 552_N 153_N399_N400_N499_N 557_N 154_-N 403_-N404_N 501N _559_N 168_N407_N408_N503_N561_N 189_N 411_-N 412_-N505_N563_N 261 N415_-N 416 -N 507_N565_N 134 -N 277-N 419_-N 420_N513_N567_N 236_-N 307_-N423_-N 424_N51 ON5 69_N 327_-N 427_-N428_N51_-N 571_N 92_N 377 N 378_N 488_N 546_N 96_N 381 N 382_N 490_N 548_N 725_N 4_-N 352_-N 353_-N 524_N 542_N 47_-N 356_N 522_-N 544_-N 7_N 340_N 341_N 525_N 536_Y 8_N 342 N 343_N 526_N 537_Y 25_N 344_N 345_N 518_N 538_N 30_Y 346_N 347_N 520_N 539_N 34_N 348 N 349_N 521_N 540_N 265_N417_N418_N508_N566_N Expression Specificity Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Constitutive Fruitless Root specific Root specific Root specific Root specific Root specific Root specific Root specific Constitutive Inducible by SA, INA, BTH, pathogens Constitutive 698_N 703_N 583N These promoters comprise the sequences set out below: >lBsyn299 Internal TMRI Arabidopsis constitutive Contig L14844 Contig Length: 1270 bases ('test' 113-1 no restriction sites; base 823 T to
TACAAATCCAAAGAGATTCCAGATGAAGTAAAGAAGUTGTGCCTTATC
GATCCAAACGACAGAGATGTCGTTATACTTGGAACTCTGTGTAGTTCAGG
TTGCAGGATCCATCCTGTATTTGGGCGTGTGGATAACTTCTCCAAAGAT
GAAAAAACTAGTGAAAGGTAACAAGTGTTCTTCCATAGTAATATTGAC
AGACTATTTTGGGATTFIGGTGCCTTTTTTAAAATACGATTTAGTTGCAAGG
AAAAAGTGAAAACGGTTTCGTAACATTGCTGCTTCTTTTGTTTTGTCTCGA
CAGCTGCTGAGGTGCATACCTACTTAGAACCGTCCATAGATTCACTGAAGA
P XOPER\p msIl2689250 dm.22/ I 'M 00 -280- AAATAGCTGCGTTTCTGTATCCTGGATCACTTnAGAAACAAAACAAACATG
AGGACCATGCTTGAATGTGGTACGTATGTATTAGATTCCTTCCTTGATA
TGATTAAACCGGCTATTGTACCATTGGTATATGTTAGTCATATAATAGA
TATTCTCTTTATTTCATATCATAGCTTTAAAAAAATGTTCGGCTCATCG
CCACTCCTTTTGGGCCGCTCGTTGCTTTCATTTTTTTAAATTGCTTACCTT
00
AACAAATTCTTTTGATTGGTTCTCTCTCTGACTCTAGGCCGCAGAGG
AGTTCCGAATAATTCTCACTCAACTAACTTTTGATAATCACTTTCAT
N- TATTCTGATTTTTGAATTCCCTCTACTCTTGAACACGTTTACTTACTGA
GAAAAATTTAACCCTAAAAAGAAAACCACTCATTACAGCTAACATTT
AGGGGTGGACTATTGCGCAAAGCATTGATAGTGTTAATTGAAAGTCAG
ATATAGTATGCGTTACTACTAAAGTTTAACGGTTCAATTTTTTGATA
ACTGACAGTAAATAAAATTAATTTTTAAGATTAAAAGACGTTGTTAG
AAGTTGTTTAGAAATTGTGGGACACGTGTGGCACGTTGCTCCAGGG
CATATGCCAAGTCTGAGATACTCCAACGCACTGACTGACTGACCCCAT
AACCGGTGGTCAAACTCTTAACCTAACCACGGTTAAGATCTTAAAGCT
GAGATTTTCCCACATGTAATAATCTTGTTTATCTGTGAGATATTCGCGT
CCCCTTGGCCGGCTATAAATCGATAACCTCACCGATAAATCCTCTATTCA
CATCCACAACAAACCTCTTCTTCAGTCTGATAGAGATCTCACG
>1G2_syn30O internal TMRI Arabidopsis constitutive Contig AJ001397 Contig Length: 1116 bases ('test' I1G-2; base 872 A to C)
CCTCAGCAAATAAGAGGACGATAAGGATCGGTCTTCAGCTATAAACAG
AAAGAAAGTTGAGATTCGAAGACTCTTTATAAGTCATTGGATTTGA
ATAACAAATTAACAACACAACAAATTAACAACACATATACTACAATC
AGTTAAAAACCCCAATATAATATATGCATCGACTACTAAACGCGTCA
GACTGGTAAACATATGTAACTATCTCTGTTACATATTGAATGAATGTA
GTAATAGTAGATGCTAACATAAGCTCACAATTATTTGAATAATTAGC
TAAATAAAAATCATCTATAATGCGTGTAAGCTTGCATAAAAATACATT
TAACTTTTATTTAAAACTATTAAGTATCAACATCAATCGGAAAATGATT
CTTTTGAAGTTATTACAACTAGTTTATTAAAAAAATTGTTATCATCACC
ATTTTAATAATGCTATATATACTTAGTCTTTTATTTATTGTTATTGTAT
P XOPER~fls5 12699250 dM-72VI2A)5 00 -281
GCGAAAATGACTTGCAACTGAGTTGCTTACGGGCAAACCTGACCAAGATG
TGGGAAGTTCGAAACTGCAAATATGTATAATTCTTAATAAAAAAAAAAT
TATCCTACATTTCTTCATTTTTTTTTTAAAATACTAATATTTGCATACTTTGT
TGATTGAGTTTCTGAAAAATCATAATTGAGTTTTTAAATTAGTTGGTTGT
TGCATTTGACAACTTCCAATTTCTTTTAAATATATCACTTTTCATATATTCT
00 GTAGAGCTATAATTTTACAACAATAATTGAAATGTCGACCCAAAAATATA
ATTTAAAGGCATTTCGCTGATAAAAATCCAGTTAGATGTATTTGTATTA
c-I AGGGGAAACCAATTATATTATTGGTTAATATTTATTAGTCGATATTGGGTA
CATATGTATGTTCTTTTACGATTATGCCATCAAAAAATTTATTAGCCATTCG
AGAAACAAGGCATCTCTATTTTTGCTTCTTCTAATAGACTTCTTCGTCAC
TGATCTCCCACGACGATCTCCCAAACTCATTTCTCTACGTTCATCGATCTCT
CTCTTTCTCGTTTGCTCTACGAAAATCAGCCGTTTAAAC
>ACIIlsyn27l Internal TMRI Arabidopsis constitutive Contig AC007138 Contig Length: 1358 bases (ACI 11-2; base 233 A to no base; base 980 T to Y
TTAAGTGATGTTTGCAACTTTTAATGCAACATTTTTTCCAGCATATTTTAT
AATTGGTTGAAACAATTTAATTTAATTAAATTTGGTGTTTCTAACTTGT
ATATAAAAACCTTAAATGTCAATTGAAATGATAGAGAGAGACATTACTA
ATTATTGTGAAAAAGTATCACTATTTCTAAAGAATTGTTCTAGTAAAAAT
GGTATTAGTTAATTTTCAGACCATCATAAAAAGATGAIITAGATTAGTGA
AAAGAATAATCCTTCAAAAATACATATTTCGACACAAGTATACTTGGTAT
AAAATCTGTAAAAAAAAAATCAGAGCCATGACCAAATACAATATGTTAA
TTCATGTGACGTGAGATAATAAATTGATTTGATTCACTTTCCAATTGTGTT
ATAATTAACGCATTAAAAACACTAAAAAGCAAATAAATAAATGTAGCCG
ATAAGCCGATGGAAGTAAGAATTGAAGTCCAAAAGCAAAAACCTATAGA
CCGGTGGACAGTCAACAGTGTCATTnAATCCCTATAAATAGCTCACTCCCT
TGTCATCCACAAATCGTCCCCGTCTCGTCCTTCTTCGCTCGCTGTTCAGA
TTTTGCTTTGAGGCTTTAGGCTCCCCAGATCTCTAATCGCCGCAGGTTTCGC
TCTTCTTCTCCGTCTTATTGATTTCGAGTTTTTAGGCGATGCTTTACGGGT
TTGTTGTTAAATCTGAAACGAAATGAGATTTTTCTATGGGTTTCGATTCAG
ATTTGATAATATTCGAACCTTCTACGCCTGTTATATAATTAGATCTGCGA
P IOE4 31695 m212) 00 -282
AGTGTGTGACTATTGAAATGAGATTCTCAAGTTCTTAGGTTATATCGTTTGT
GATTTATACAGATTTAAAACGTATGTGGATCCGTTAATTTTCCAGTGCTGT
GTAGCAGATCTGCTTAATAGGTTTATCTTTTTTGCAAATGATTTTGATHTTC
GCANCGATCGTGTACTCTATGTAGTAGTAGTAGTATATGATTTGATAAATG
0 5 TAGTAGTAGTAGTATATGATCGTGTACTGAGCCATAAATGAGCCTTCCTCG
TTAATTATTGTCCATGAATTGTTAGTTAAGCTTGAAAGTTCCTTAAACGT
00
AATTAGATCCTTATCACTGACTGTTCCACTATGAATATCAGAAGAATCGAA
TCTCTTTGGATGAGATGCGTCTGTTTTTATGCTATTCCACAATGATTTGGAA
TCTTTCTTAGCTTTTTATGTCACTTGAGTGTGGAATCTTTTTTTTTGTTCTC
TTCCTTTCAATTGTAAAAAGTTTGTTATATGTGTATGATTTTTATGTGGTTG
CTGATTCAATTTTTCTTTTTG
>AC12-syn272 Internal TMRI Arabidopsis constitutive Contig AC007195.93 Coritig Length: 1301 bases (AC 12-5; base 503 T to C)
TGTGGAGATCAGJGCCTGATAAAGATAGCATTGCAATGATAATGTATGAT
GTGCAACGCATAAGACAACAATTGACATCAAGCACACCTCTTCTGGTGACT
GGAAATCAAACTAATAAGTTAGCTTATGAACTTGCACTAGAAACACTAGTT
TCAGAAATCAGCATAAGTATCGAAGAGAAAGCTCTAACATGTGACAAAAA
TTAAACGTGGAAkAGTACGTAAGCTGCAGGTATCATCTCTAATCACATTCTC
TAGACTCTAGCTACTATACATTAATTTTAATTTATCGTCGTGGAATGTTGAT
TATGTTTACGCCTAATGTTGTAATTTCATGGTTGATGGATATATATAGATGT
GGGTATTCCTTTTGCTATATGTGTGGAGTCGAATGGAAACAACGGCTAGGA
GCTGGTGGTTGCATTCATAGCAAAGCAGAGATTTATTTTATCATTATTTGTT
TTGCAGTCTTGTTTGGAGTGAACTTTTGTTTCTTTTTGATTGCTACTTTAATC
AATTGGGTTGTGAATTTATTCAAGTGATTTACCCAGAGACTTGTAAACGGG
ACATAAAAAGAAATAAAACCTTTCATCTATGTCTTATGATTGCATGAGT
AGCCCAAACATCTATGGTCTAGTGGTAGGAGAAGATTTAGGGAATAGTGA
AACTTGTAGATCCGAGTTCGATCCTCCCTGAAAACAAAAATCATATTTGTT
TTGAGAAGTCTCTCAGTTAGGCCTTGGGTCAATTGGTTTACCTGGTAGTTA
GAAATGCAGCCGGTCTGACTATCCCCTTTCATTAGTCGGAAAACATTTCAA
ATTCAGAACAGACAGTATGGTAGTCCTTCGGTGAGAAGTCCACTCTAAAAT
P %OPERj.,1'I2t.K25OdmC22'IM'5 00 -283
ATTTCGGTGCGTTTCTGCCGAGGCTGACCAGATTAGCCGGTAGGGTTTATC
AAAAAAAAAGAAAAAATGATTGCATGAGTACTTCTCAATTCTTCACGTTGT
CACAACAACTTGTTACATGCGACTAAACAAATTATATTGAATCCATATACA
~zJ- GATTTGCCAAATACTATTTCTATTTGGTCCCAATTAGTGATGTUTATATGGA 0 5 TTTAATAGCCCATTTAGTTATATGGGTCTGTTGTTAAAAAATAGCCCATGT N AGACCCGTTTATGGAAAAAGATAAATGGGCTTTAATTTCGACCCGGCCCA 00
AAATTACAACGTGTTCAACAACAACTCTATTATACAAACAGACTACGTCGT
TCTCTTCCACTCATCTGAAAACAAAATCCAATTCTCTCTCTCTCCCTCCAGA
TTCAAACGATCCGATCCAAAACT
>AClI3-syn273 Internal TMRI Arabidopsis constitutive Contig AF080120.1 1 Contig Length: 1368 bases (AC 13-3; base 328 insertion of T)
ACCCATTTGTCTGCCAACATCTCTTTTGGCTATATACTCATGAAACTTTAAA
AAATCTTCTTATTTGTATGTTCGAAACTCCCTGAAAGTTTCAGTCTTCTTAT
GTATGACAAGAATCGCGAGAGACTATGCAATGAACCTAATCAAATATAAC
TCTTCTCAAGAAATGATATGAAAAAGATTCATGAACATAAGAGTTGGTCCT
TGGAAAGCGACCTCTTCAAGTCTTCATTAATTAGACATTGATTCAGGTGCT
TAGGAGTTAGGACAATGTAAATTAATAAACAAAGGTGGTGCTTAAGGCGG
TCCATGACGTTGTAGAAAGTATTTTTTTTTCGTATAAGCCGACTATATACAT
ATGTGTTTTTCATTTACTTATCGCAAATAAGAAACACACACTAATCAACTA
TTTGTAAATTCAAATTCACCAAAATTATI7TATGTTATATGTTGAACCTACGA
AACTCATAGACACAGAATAAAACATAAGTGAAAAGACTGAATTAAACACT
TACTTATAAGTGAAAAGACTGAATTAAAATAACAAAGAATTATCAATAGT
ATTTTTAATAAAATTAAACATTTAAAAAATAAACTTATTTGAGGACGTAAC
CTAAAAATCTCTATATAGTTGTTnTTGACGAATATGAGTTTTATTATAAGAC
TAATTTTTCCAAGAGATAAATTTATAAAAAATATTAAATACGTAATATTTT
TTAACTCCAATAAAATATTGTAATTTCAAACCAAATATTTATTAATTAAAA
TGTGTAATGAGATACTTACATATCATCTAGACAAGTTGAGATTTCTTTAT
AGGGTTTTGTAAAAATTTGATGATTTTTAACAAGAAGAAATCCATAGGAAC
TAATAATAAAAAATACAATGCAATGATATTTAAAAAAACAACAACTGCAT
TGCAGTGAATTTCATCAAAATCCATTAAAACATTTCCAAACTCAAATAGAA
P 695 M212 00 -284-
ACAACTTCAAAACCTTAATCCAAAATGTTATAGATAGATATGCAATAGCTC
TTAGGCCTAGTACATAGCTAGATCTTGTAACTCGTGAAGGCAAATGATTGG
GACGTTGGTTCGGTTCTAGTGGTCGGGCTCAGCCTGGCGGAAAAAATTGTT
ATGGGTCTAAGGCCCATAAAGTGGCCCAGAAATAAACTCGTCGTATTTAC
ACACGTTGTCGTTTCTCTTATCTTCTAGAAAACTGTATCCCGTTTTTGTTCTT
c-i GTACTCTACACAAACAGACAACTTCAAATTACTCAACACCACGTCGTGAA 00
AATCCGATCTACGTCTCTGTCTCTCTCCAATCTCTCTGCGCCACAGAATTGT
N- GCGATTTACGAAAATCTCTGAAACCTCCGATCGTTAACGGC >AC2O-syn278 Internal TMRI Arabidopsis constitutive Contig AL035656 Contig Length: 1399 bases (AC2O-2; base 244 A to G)
CTTGGAAGCATTCAAGAGAGTCGTGGAGAGTGTGGCTCAGCGTCTCAATG
AACAGCCCGTGATCGTTGCTCACAGCGAAAACACCTTTGATGGGAGCGGT
ATCAGGAGGCTCTTGTCCAATAAATTCGAATTCGATAAGGTAAACTACCAT
ACATATATATGTTATCTAGCTTTTATGCTAAAGGAAAACTTTTTAAATGAT
GGTAACGAGTGATGATGATCCGGAACGGTTTGGTCGCAGGCGCTAAACGT
TGCCATGGAGACGATTCCAAAAGACCGTCAGGGTAAGGTGTCTAAAGGAT
ATCTACGAGCTGTGCTTGACACTGTTGCACCATCGGCCACTTTACCACCAA
TAGGCGCTGTGTCCCAGGTAAATAATGCCCCGTCTAAATTATTTTGTCTTTT
AAATTGTTTATTTTGCCTTTGAATTTACATGTTACAATATTTGTTAAACAA
ATGAAACCAGAATTAGTGTTTTAATCAAAAATTATTAGTGAATTTTTATTTT
TATTTTTTGAACGGCATTGATTAGTTAAGTTTGTTTTTGTTTATAAGATGGA
TAATATGATAATGGAAGCGTTGAAGATGGTGAATGGAGATGATGGAAATG
TGGTGAAGGAAGAAGAGTTTAAGAAAACAATGGCAGAGATATTGGGGAG
TATAATGTTGCAGCTCGAGGGTAGTCCCATATCGGTTTCCTCTAACTCGGT
GGTTCACGAGCCGCTCACCTCGGCTACCTTTCTGCCGTCAACTTCGACTGA
TACAGAGGAGCCTTCAAACTAATCATAGAAGGGAATAAGCAGCACTAGCA
GCAACAAATGTTATATGGTTTTGACTTTTGAGTGTTTACCCCCAAAAGTTTT
AGATTAATGAGGAAAACCGTCTTTACTTTCAGATGTATAAAATTGAAAGTT
TGGGGTTTCCTCTTGTTGGTGTGGTGATTCTACTCATGCCTTTTTTTTTTT
TTCTAATGACCATGGGATGCAATGTTTACTCTGTTTTTAATTTCGTTAAAA
P %OPERpA126K9Z5()d.-Z/ I VUS 00 -285-
TTTGTTTACGTTTATGATGCTTGAATGGCTATGATGAAACATTTGAGTTATC
TTTAAAAGTGTGAAATAAATATTCTGAAGTTAATTGAAGAATTTAAT
TGATTACAAGAGCTTGGCTAAAACTACAAGGAGACCAGATTAGAAA
ACTTAGCTAAATTTAATTAATTACGGTCATTAGCACAAAAAATTTG
TTTTATTATATTATTATTGGTAAGTGGAAACACAAAAGAGGACCAGT
N ~CCAAAAACGAATAAACTGTATCTCTCATTCGCCGGAGTTTCCAGCCTT 00
TTTCCGATTCTCGGATTTTTCCTGGGAATCAAACGCATCGCCGAATG
N AAGAGAGGGATAAGGTACCCAG >AC225sy28O Internal TMRI Arabidopsis constitutive Contig AL049608. 184 Contig Length: 1283 bases (AC22-1; base 52 insertion of A, base 636
TCACCAGAAAAACAAAAACTAGAAACCAGGAAAAACTTAGGAATT
AGAGTTAAGCAAAGTTAATCAACGTCATTAAGTTATTATATATACAA
TCTATATAATCTCTGTTTCGTCATTGTACATTTTGGTGACTGGAAGTTT
TCACGTGGTAAACAAGAACGTATTCGCCAACCTAAAGACTCAATCCT
GTCTACAAATTAAATACATTATCACGAAAAAAGCTTTATGTATTTAC
AACTACTTTATTCTCTCAAAACTATTGCATTGGTGTGCAAATAGTCC
GAGATGATATCATCAATCTTAATATCAACTTCAACTTTTAAATAGA
ACGTAAATTAACACG.GTCGTTCTAGCTTTGTAGCATCGAATGTAA
TGTCAAAAAAATGAGCGAAATAATTTATTCTTAATTATCTTTGCAAT
TTAAAAACCTTTAAGCATATATAATTCAACTAAAAGAATTTTAACTTG
TGACTATCTAGACTTGAAGCAAAAAGTCAAAAATGAGTAGACAACC
ATTCCTGCTGTTGATCCATAACTCAACAAATATGTGTTTAAATTTT
TTTGGTCAAACAATTCTTTCAGTTGTAAGCTAGAATATTACAAGTGT
AGATTAAAGAAATAGTCCCAAATAGCAAGCAACAAAACTAAAAATA
ACAAACAAATTCTAAAATAGAGACACAAACTTAACAAAGCTGTCA
GAAACCTCAATGAATTAATACTCGATATACTAATACCTTAAAATTTT
CTAGTTCTAAATTAATAATTTAACCTAAAAATATCACTTCTATATAT
AATTACGATAATTTAATGAAATTAGTAAACCATTAATCTCATTCA
ATTTATAGAGGTTTTACTAAATTGTAGAAACAACTAATTCGAGTAAfC
TGAATTAATAAATTTTAGAAATGTGAATTAACGAATACTTTTGTCGG
P %OElm 292SOd -VI2) 00 -286-
AATGGTTAAAAAAAGTTACTTATCAAGACAAGTATGAAGTATCACGTGAT
TAAACGTTTAATGACACCAACCTAATGACAATTTGTTTGATTTATTTGTCAC
CTAACTAGAGACTCTCTCACAGTCAACGCAGCTTATGTGTCATAGTAAGA
TTTTTGTCTACTATAGTAGAAAGACGAATTTATAACCCCTTTAGGTTTTTTC
0 5 TAACACACGCCTCTAATCTCCGCGCACACACACACACCCTCACGAAGAAG N AAGAAGACGA 00 >AC24-syn282 Internal TMRI Arabidopsis constitutive Contig AB017643 Contig Length: 1367 bases (AC24-3; base587 T to C)
TCGTGAACCCATCCATATTCTTTGCTTGACCGCTTCCATAAACAATCCACCC
CGAAGCTTTTACATCGTGATGTCTTTGTAAATTTAGGAAAACACAGACACA
GTTGGTCAATGATAATCATTACAGATTCTAAAAGAATTTGGTAGCCACTAG
TCAAAGAACTTAAAAGGCAAGATTTATCGGGACATTAGGACAAGGTAAAT
GAATGCATTATAAGAAAATAAAAAAACCCTTTAACATTTTGTUTAATAGAA
AAGAAGTAGAGGTTGATTAGTTATTGTTAAAGTAAAATGTGTTGGGCTTGT
CTTTTCCTCAAATGTCGCGAAGCTCAATGGTATAAGCGAAAGAGAAAGCA
TAGCATGATGGGCCATATATAAATAAAAACTCGAGTATGCTACAAAAACA
AGGTTTFCAATGCACTCATATCTCGTTTAACATTCTAATTTTATTCTTTTC
TGTGTCCCCCATTGGCTTGGCAATAAAGTTGAATTTGTATTGATTTATATCT
CATTCTCAGTACGAGCTAAAATTCTTAATTAAAATGAAAAATATGCTATAA
ACAATTTAAATGATTGCAAGTCCCACCTTGAACAACATCAGTTAATATT
TCCGTAGCATGTTGCATATAGCATAATTTTGGTCTTAAGTAACACCACCAC
CTCACACGTACGTACGACCAATTATGCATGTCTCAAATCCCTCCATGAIT
CTATATGGAAGACCAAGGmTCAAGATTAGCAATTTTAACGGAT-AAAACC GGTTCAAGATTTTATTTTTTATTnATTTTTGCTAAATCCTACAATTTGGTCTC
ATGACAAAAAAAATATAAAAACATAGAAACAAATAACAATGAATCTATCG
ACATCAACAAAAGCAATTAAACTTTCCGAATCAATGAAGCGATAACCGGT
AGTATCTTCGAGACTTCATATACGATCAAAATGCTAAAGTAACTATTCATA
ATCTTTTATTAATAATGAATTATCAAAGCTTCTATAATTCATACGACAAAG
ACAAAGGAATAGCAACAAGTTATGTTCATTTCGCTGTCGTTTAATTCAACA
ATGAAACGTTAACGAAACGATTUGTCGAGATTTTTAAACGTCTTTCAG
P %OPEft'jnU~I26H92S0 dO2V I 21115 00 -287-
GTTCTACGGCTAAAATTCCTAACATTTCATCACCTGTCGTTATCGTTAATAT
CGTCCTTGTCAGCAGAAAAAAATTGAAATCAGGATAAGTTGATAACTTCTA
TGAAAAAAACATTATCTTACAAAAATCCAAATACTCCGACTTAACCGGGTC
GGATCCTGGTGAGTACTAGTATCTATCTCATTACAATTCATATCCTTCCTTC
AACATTCGATCATCACGAAGCCAAAGAACAATTTCTCC
00 >AC26-syn284 Internal TMRI Arabidopsis constitutive Contig AC006438.21 Contig Length: 1343 bases (AC26-1; base 150 A to G)
GTGAGGTCATATTCAGGACCGATCCAACAATATTGAGGGT"'TTACTCCAAG
TAAAATTTTAGTTTTATTTTTAATTATCATAAACGACATAAATATAATATGG
AAAGATCACAAATACTGATTAAAAACTAAAATCATCAAAACGAAAAGGAA
AAAAGAAAAAATTGGGTTCAACTCTCATGAGTTATTAAACATTTTAGGTTT
TAGGCTTAAATCTTTAAAAAAAATCAGAACTGAAAAACGAAAAATTCTAA
TTTTATTTTGGACTCTGATTCATAGCTTATGTCGCTTATGTAGTTATGCTAG
GGATGAATCTGTATTTCGTTACCGTAATGAGAGTTCGATACTCTCTTACUTG
TTACGATTCTGGAGCATGTTACATTTITTCTTTCCGTCAACAACAACTTTA
ATATGGTAAAACAAAATTTATTTTTATTTGGCTGGTCCTACTCAAGACAAA
TCTTCTGCCGACATCACATAATCATATTAAAAACCATAACTTCTGCCACTC
TGTTTTTfTTTTTTTTTGTAACCATTAACTGATTGGATTTTGATCCATCTCAT
CTGATTTTTTAGCTCAACAATTTACTTGCACATTTCTATTGGTTTTATTTA
TACTTAGTTACATATATGATTATCGAACTAGTATCTCTTTATAATTAAGTAT
TTTTCTATTTTTTTTTAATTTAGATTTTTGTGAATTCATTTACAGTAGAAAAC
TGTAAAACCATATGGTCTAATTATAGAATGAAAACTTCAACGAATCCATAC
AACTTATTGGCTAAATATAATAAATCTGCTTGAAGCATATTGTATTATTTA
GTTGGATTTGACGATCTCTGACTTTAATGTATACCGACATACCCTATGATTT
AGATGTTGATTTTTCCCATTCTTAATATATCCATGTTAAGAGATTCCACCAT
AACATATCTAATTATTTGCATTGTAATAAATATTATCATTAAAAAAAAATA
CAACTGGACAGCTGGCTCGTCCCATTGTTTCTTACGTCCACCAATTACAfT
GTTAAAGCAAACTTATTAGAACGTTCATGTGTGAGAAGTTGGTGTCGACAT
GTGTCTAAGGTCTATGTCAGAAATCGGATAGCTTATTAAGTAAACTATAC
TATATCATTGTTAATATAGATAAAATATCTAGTTCGTCCAAATTAAACTAT
P m"695OK2/') N -288-
TTTCATAACTGCCACGTGGCGTAAACGTATCCATCGAGTCACTTGTAATAT
CTTTATAACCAAAGTCTTCCAACACATTCATCACCATCTATCTACTCTTTAC
TCTCTTCTCTTCTCACATCAATTATTCATAGTTCTCTCTTCTCCGGCAAGAA
AA
0 C1 >AC3 I syn286 Internal TMRI Arabidopsis constitutive Contig ATU46665 Contig 00 Length: 1296 bases (AC3I-3; base 556 A to G) c-I TCGGAATCTGCTGGTAATCTACGCAAAGTATACTTGTAATCAGCGACAGTG
AGAGTGATCTACAAGTAGAGAATAAGAGATTCAATGAATGAAUTGGAATG
AGGAAATGGTGAAATCAATAGAGAGATAAGGAAGATACGAACGGAGTAG
ATAGCGCGAGAAGAACGGACGACGCCTTCTACAGCCGTCGCTATTTTATTG
GAAGGTGAGTCTCGGAAGATGGACACGGCGGTGGCGCTGCCAGTGACGGC
GGTTAGAGCTAGGCCGGCGGTGACTGTGAAAGCAAAGATCGGAGACTTGG
ATCTCCCGAGAATTTTGAATTUGCGGAGAATCTCCATTTTTGTGGATTCT
GGGTTTCGTATTATTTTTTTCGTAGTAACGAAGAAGAGGACGGAGAAGCTA
CACATTTTCTAACTTACTTGCAAGTCGGGTCGGATCGGATTGATGGACAAT
CTAATGGGCCAGGATCCGGTTAGACTAATCGATGTGATTTTAATGGGCTAA
GTAAGCTGGGCTTGGCAAATAGCCAAATATAAAAGGTTAATTTAGTCAAG
AAAATCTCTCAATTTAAAATTAACTGACGTAAATCCCCCTTCAGTATCAAT
ACTGTAAAAATTGGATAGACACAGTAAAACGCAGTGTTTTACAGAATCTCT
TTTAATCGATTTGACATCACACAAACTTCAGAGAATCTCATTTTGATAAAT
TAAAGTTTTTTTTCCACTTTGTGAATTTTAAAGCCTAGGTAAATTAGTGCAT
ATATGTAATTTAAGTGTACATACTGTATCTCTCTGCAACGAATACAACCTT
CTTTTTTACCCACTACCACCTGTTTTCGCTAGGCTTGCTGGACTCAAATAAT
GTATTTTTATACGGCAAAATTATTCATTAAATTTCAACTTTACGTTATATAC
AACATTTTTTACAAAAAATTACTAACATATATGGAACCTCAAACCTCTTAA
TGTAGAAATATTAATAAATTTTTATTTAACCATTGGACTAAGGAGCTTCCA
CAATCTACTCTAATCTAATAAAGTGTATATCTCATGGGTATCAATTTTTTT
TTCAATAGGTAAGAAATCAAATCGTCTACATATCTTACGATCTTGTGAT
ATTTTACGAGCGAATATCGTCGACATAATATAAAACTCACAAAAAATAAA
ATAATAATGATACTCCATATAAAGGAAAAAGACAGCAAATATGTAGGGTC
P QP~p 129950dm22/1I 00 -289-
AATATAAACGCAGCCTCGTCGTCTCTTCATATATTCGTCTCTTTGTGTTCTT
CTTCCTCCTCAGATTCTCTTTCA
>AC34-syn288 Internal TMRI Arabidopsis constitutive Contig Z97340 Contig Length: 13 59 bases (AC34-3; basel163 T to C) N- GGATCGAACACTCTCTCGTACGTCAAGGAAAGCACTGTGATGCCAGTGAG 00 0 GATGACCTGGCTCGCGACGGAAAGGTTGCTGAGCCGAGTCGGTACGAGAA c-i ACGAGTCCGCATAGAAACAAGAGAAACGCGACGAATAGGAGAGAAGAAA
ACGAACTCGATCGCGAATCCGATCTCAACTCCATGACTGAAAAAAAACAA
CCGGAGATTTCGCTCACCTCCCGATTTTGGACTGGACTGGCGAGAGTCGCT
ACAAGTCGCTTACGGCGAGGGAGCAGAAATGGGAAAATTAAGGCTAATTA
CTAATTTACCCTCAAGTTTTATTATTAAGGTGACCTGACCTGCTCTGTCTAT
ATGTGATATTGTGACCTGCTTTGCCTATATGGCTATATGTGATACCTATAAT
CACAAGGATATTTCAGGTGGAGAATCAGAGAAAGAAATTGAAGCTGAATA
AGACACTATATGGGAGAGATTGAAAGGAAGCTGTTGGGCCATTTTGGTGT
AGCGGGTCGCAAGTCGAGCGTGAGACTTATTGCTGTGCCATTGCAGGAAT
GCAAACAGAGGAAAGATTTCACAAATGGGAAACGGATACATGCTCAGATG
GTTGTTTTGTTGTAGGAAATGCCTTTCAATGAGTATGTTAAACGCTAGCTG
TCCTGTTHAATGGACCGGTGTATGTCATCTTGTCTTGCACTGTGTGAGCACA
ACAACTTGCAATGTTTCCATTGATGCTGTAGCAGTCTCTCACATTAAGCTCT
GGTTTGGATGGCTATGAACAAGTTGATTGGTAGATAAGTTAAAATGTTGTG
ATTTGAATCTGGAATGAATAGAAAGATGTGATTGGTACTGATGTAAATTCA
ATGCTTTFAGAGAATGTATACAGGCAATAATATACCAATCATTATGTTATT
GCTGACTAAGAGCCACTCCTCTTTGCTGTTGCAATTCGGCAATCGTTCTAG
ATATGGTTTCC
ATTTCAAATCATGATATGCATTGACTTTTTCCATGTGGCGT
TCGGAAATCTTTCATCTATACTACGTCTACGTTGCAAGTTTTGCAAAATGTT
TAAATTAGTAGAATCTCACGTATATAAAAACTTTAGTCGCCAAATTGAAAA
TGGAGAATGAATGGTAAACTACTAGTTTACCCTCATATTTTAGCTGAAAAA
TATCGTCACAGCTGACGAAGAAATTAGAAACAACAAGCAACGTGTCACTT
CTCATGTCGTCGTTTTCCCCAAGAAATATCCAAACTAACACCCAATTACCT
AATGCCACGTGTTTACTCACACTCCTTTAAACAAGCTCGTAACTGTTTCATC
P IOPER'jn-n'I 2689250)dO.2IIM 00 190
TTCTTGTCCCCAAAGTCTCCTCTTCCTTATCTCTTGG
>AC38-syfl290 Internal TMRI Arabidopsis constitutive Contig WT755 Contig Length: 13 8 5 bases (AC3 8-3; No errors)
AAGATTTTCCGCTACGGGAATTTGAACCTGAAAATGCTGATTTAAA
AATTTAGCTAATGTGCTACATGAGATGTTTTTTTTGCTA
GTGGTA
00
AATTGGATATATACATCATTCAATTTATT[TTCTAATCTAGATCTC
cI CTAGGACAAATATAGGTACTGAATTATTAAAGAACATATTTTGAG
TATAAGTGAGTTTTTATATAATTTTGATGATTTTAGGTAAGTTAGGT
TACTGTAAAGTCTTTFTCAAATTCTATCTAAAACTATGAGATAGTCG
TATTTTTAACTAAAGAAGTCTTTTCAAATCCTTTCAAATCCTTAATA
TAAGAATCAAATCCACTAACTATTTTCAGTAACAGTAAAAAGTGTT
TAAATTTTTAATTTAATAACACTCAATTTCATTTAAAAATTA
ACAA
ATTAAATAACATAAAATTTATAAATTTTACATAAATTATTGTATC
ACCGCTCTAAATATACACAATGTAAAAAGTCTGTTTAAGATCAATTG
TTTAACTTAAATCTAAAGGCCCATAATACCGTCCTGTACATCTTGT
TCTCAAGTTGTAATACTGTAATACCCGTTGGGCCCAGTGGCCATTA
GATTTCAAATAACAGATCTCAACACTAGCATGGCTACACAGGCGT
CAATGCATCAGTCATATCTTCAGCATCCAACACTTGTCAACCTATG
ATCTCTTAACTCTACGCCTCGAAAACAGTTTTTATTTATTTATATCT
TCTCATTGTATCTTCATCAGTCTCTTCTTATTCCATTTTTTCAACATG
AAAATTCGAATCAGATCTTCTCTTCAATCGAAAAAAAAGAGTAT
TCTCTCTCTCTAATCATCGTTCGTTTCGTAGTTTCTTCTTCTACTAGC
TGATCTTTGATTGTATGTTTCTGGAGATCTCGATCTCATCGATCCGT
TTATCACTGATTCAGTGTGTTTGATATCTAAATCCGATTTGTTA
AG
TTAAAATTTAGGTTTCGGTTTTGTTTCTGCTTTTGAACGATTTGCAA
TTCGTTATCCGTGAAGAACATAGACGAGTATGTAGATCTTACTOAT
GCGTTGAAGAATTTTCTCTAGATTCGTCACCTATGAAGAAGATATT
TTCTTAATCTAGATGATTAGGTTATTGTTTCGACTCATTTGTTAGCA
TTTCTCTATGTTCTTAATCGGTGAAGAAATGTATCAATGTGGAGTT
GGTTCTGATTTTGTAGGATTTGCTCTAGATTGTTGAATCGAAG
P OPERjn,,126,992S dmc.22'IMSO 00 >AC4O-syn292 Internal TMRI Arabidopsis constitutive Contig Z 15157.1 Contig Length: 1356 bases (AC4O- 1; No errors)
CGCAACGATAGGTGCCTATGGAAACTGAATCAACAGATTTGGTTTTGATAT
CATATATCATCAGCTGTCTACTATTTGATCTAGGACAACACAAAAGCTTAT
c-i TCTTCTCCAAAATGGCTACTGGTAATGATTGCGTAACACTACGATTCACTA 00
TCGAATATATTTGTTCCCAGGTCTTGTTCTCTGAATTGAACGACCATATTAT
N- CATTTGTTGGAGAGGTTTACTAACCGATAAGCACAAACGGTTATTCAGGCT
GCGTGTGATAATGTTTCTATGATCTGCTTCCGCAAAAGGAGCTTTAGAGAT
AACTTGAAAAGTTTCGGTGTGGAGATCTAACGCTAAAACTTTAATTTCTTT
CTTCCCGGTTAACCAATAAAGCGATCCATCTACATACAGAGCATGCCCCCG
AGACGAGGAAGTATTAATCCGATAAGGAGAAGAAGGATTGATATACCTCC
AAGTGTTGGTGCTAAAATCAAAAACTTCACATGTAGTAGCGTTTTCTAGGC
CAAGTTCGGAAGAGTTATACATAACCAAACCGGTTTGTATATGCCACTGAT
TTTGTCTTTGCCAAATCCAAATTTAACGTGACTAAAATAACTTGGCTGCTC
AAGACAGATTTGTTGCAACCTGGAAACAGGGAAACGTCGATGCCATCGAG
TGGCGGGATTATAAACAATGTTGTTTAAGGTTTGGTAATCAAAGAGGCAA
ACAAGACCGTCACAACTATTGTGGAAAAGTTGGTAAATATGATATCGTCT
GATGATATCAACAkACACGTTAGTTTTAAGTGGAGGAGAATCAGCAGTAAC
ATGATGGGGCAACACCAACGTACTGGGTACTTCAGACACCAATACAAGAT
TTAGATCTTTCCCGCCAGCTGAGCAGATCAACTGTTTCGCCTGGAAATATT
GAGATTCGATTGTCAACTTCCATTGTTTGCAAGCAGACTTGAATCTGAGCA
GAGATTTCACCGGAACTCTCTCAAGAATATCCTCAACGGTGTCGTGGGGAA
GCAATTGCATTATTTCTCTGTCTATTGAGAGGATTTTGTTCTGAGTGATGGA
TAACATGAAAGATATGCTTATTTGTATCAATTCAATCCAATGTTGATTTTUT
CCTTGAGGAGGAAGATAAAAAAAAAAAAACGTATATACAATCGATGGGCC
CTAACCCTATCCCTAACAAATCTCTTTAATATGTAATGCGCTTTAATAGTTA
AAGCCCATTAGTTAAAAACCCAGAGCTATATGTTGACCTAGCAAATTTCG
GATCTATAAATTGAAGCCATTTTCTAGGTCATTAGTTTTTTCGTCGAGCAGC
CGCGCTTTTTGGCCGAGGAAGGATAAAGAGA
P 'OPER'jn.'I 26SV25U doc22'I 2AM 00 -292- >AC7_syn267 Internal TMRI Arabidopsis constitutive Contig AC006234 Contig Length: 1343 base (AC7- 1; base 426 A to G; base 606 A to G; base 9
ATGTGTGTAGCGAAAACCAATGACAACGTTAATTGACTCATACACTGCAC
AATGTTGAAAGTGTTTCAAAGTGAGATATAGAGAGTCACAAGAAGAGTAC
0 5 GAAAAGAATCAAAGTAAAACTCCGAAAAAAGTCTTTTGAATGCAAGAGAT
GTGAAAAATCTAGAGATGTGGTTGTGAACTTTGATTCCCCTATTGTGCGTT
00
GGTTTCAGGATGGACATGGTATACCCAACACCCCTCAAGGTTTGAAGAGG
GTTTTGATCGTCAGAACAAGCTGCGAGAAAGCCGATGUTTAATATGAAAC
ATTAGCTCCTAAACGAAAGAGACTAAATACTGTGAAGAAAGTCACTAAGT
TTATTGAAGACAATGAACAATTCAAAGACATGAAGATTGAAGAGGTTTCT
TTTACCGCACCCAAACAGCTGAAAGGGAAAAAGTTCTAAATAATGATGTT
ATAGTTGTTGATATTAAAACTTGAAAAATCAACAAGTTAAGGAAACTAAA
GAGACAGAATAAACCTTAACTTGTTGATCTTTTCAAGTTTTGTTATCGGTA
ACTACAACATCCTTACTTATATTTTTTTCTTTTCAGCCGTTTGGGTGCGACA
AGAGAAACCTCTTCAATCTTCATGTCTTTAAATTGIITATTGTCTTCAATAA
ACTTAGCAACTTCCTTCACAGTCTTTAGTCTCTTTCGTTTAGGAGATACTGT
TTCATAATAAACATCGGCTTTCTCGTAGCTTGTTCTGACGATTAAAACCTTT
TATAAACTTTGAAGGGTTTTGGGTATTACCATGCCCATCCCGAAACCAACG
CACAGTATGGAATCAAAGTTCAAACACATCTCCAGGTTCTTCACATCTCTT
GCATTCAAAAGACTTTTTCGGTGTTTTACTTCGAATCTCTTTGCATTGGATC
TTAATAATGTTTGAGCCGACCATGTTCTACATATGATGAACAAAACTCAAG
CACTAGCGATTATTAAGGCTTTTTTTTTATTTCTATCGATCTTTTTTTTTTAC
CTATTGATAATGTTGATGTTGAAATACTCAAACATGGAAGTGGAATTCAAA
AATACAACTAAAGATCTGTTTTCTTAGTACATACAGAATTGAGAAACAGA
GAGATGAAAAATGCCAAGAGTGTGAACAAAAGTCCACAAAACAAAAGCC
TCTGACGGAGAAGGAGGCTTTTAGGTGTTACCCAAACAAAACGCACACAA
TACGGCGTCGTTTAGAATCAGAAAAGACATTTCTTTATGGTCACTTGATTC
TCTCTTCCTTCATCAATCAATCTCGTCTCCTGGAAAACATTAGGGAGCCTCT
CAGATCCTCAAGAAAACCCTAA
P x0PERjnmsI261925U dO.22/12/flS 00 -293- >AC9_syn269 Internal TMRI Arabidopsis constitutive Contig AC006403 Contig Length: 1319 bases (AC9- 1; base 205 T to A)
TTTGTCACCAAAATCAGACAGGCAAAGCTGGCTCAAGCATCGCTTAAATCC
CTGTAAAACGCAACTATGTAATTAATATTGAGATATACTTGTTGCTTTCTG
0 5 ACTCTGATTTCATTCACTCGGCAGCATTCTCGTGCTCTCGGCTGCTGTTGCC N AAATCTTATGGTATCTTTCTCAAAAAGCTCATAGTACCGTTGTGTCTCCAA 00
CAGACTTTCATTGATGTAAGTCTTATAGGTACTACTAAGATCCATAATATG
TAAGGCCTCACTTGCTTCTTTCCCATCATACCATATGGCTCTTTCTCTTTTTC
CACCTCCCGGTACTGAATAACAGCATGTTGCTTGCTACAAAATGTGTGATT
AGTAGGAATGTCGGATCTTTCTCTCACGTCCAAAGAGGTAGCAAATTTGGT
AATGAATGCAAAGTGTCTCTTTCAGTGGTTCACCATCCTTAAAATGATAAA
GTCTCCATCTTTCTTTGGGTTTTCTATTATCTACGGGATTATTGAAGAGGAG
TGTGATACCTTTGTATATGTTGGTTTCTTCAGCAAGTTTCCTCGATAGCTCA
AAACATCGCACACCATGTTTTTCAATCATCAAACATGAATGAAGTAGCAGC
TATCGAGATTCTCTCTCTCGTCCGGTACTTAGCCAACACCCTTCCTCCGAAT
CTCTGAATGTCAGATTGATGTCTCCTGGCACTTGGAGGATAATCTAAGCAG
ACATGGATGGGTCCTAATCGATCAAGATAGTATCTTCCATATAGGTCTCAA
GAGTTTGAGAAGATGTCTTTCACCGTTTCATGCGGAAGTTGACTCCTTACT
TTAAGCAATTGAATGCATGATCGTAATTAGTGATATATTGGAGTTHTCGC
TTCCGGTTACTCTGATATGATATCTTTCCTCGACAACTATAACGAATGACC
AATATTTGTAATAGAGATAGTCTATTTTCGATCTCTCATTTGTTTCTTTCTT
TTTTAACATTACATTTTTTCATAGAATTCTAATACTCACAGATTGmTAATG
ATTTTTTCTTACAAAAAGTATCATTCAGATAATTTAATAAAAATGGTATCG
CAGTGCCTTTATTTACCTTTAGGAGTAAGTTTTCTTTCTTCCGATATCCTAA
ATTGTTCGACACGTGTCAATCACGAAACCACAACCAAAAAACCTTGTCGTC
TTCTCCAATCATAAAAAAAAAAAAAAAACAGTGTCCCAATTTGATCAAAC
AAAACAAATTCATAAATTCGGAGAAGAGAACGAAAAATCTTCTTGTTGGC
AAATCTCCGGCGAGATCATCTTTCTTATTTTGTTCC
>AF3_syn3l2 Internal TMRI Arabidopsis 'fruitless' NO EXPRESSION [N FRUIT ON GENECHIP CONTIG Z97336. 167_SAT LENGTH 1156 P IOPERMjn 1I26%)92S0 dmA.22/1 2
I'J
00 194
GCGAGTAAGACTTATTTGAAACATCGTCAAATTTACTTCTTTTGGTGTATAT
TTCTCATTATATGGCGTATATATCTGTTTATGTAAGAAATTGTTTCCAAAAA
TTACTGTATACTGACTTTGTAATCTTGTTTTGATATCAATGAATTTATAAGG
AAAAAAATAAAATAAAAATATAAAGTATGATGTACATGTAAAAAAAGTTG
0 5 TTTCAAGCGTAATTGTTTTTTGGCTAGAGAATGAATATACAGCAACAGTAA
ACTAATAAACTTGCGATGAACTAAAATTTCTGGTATTCCTACAATCAATGA
00
ATCACTAATTTATCTATAAGTTTTAGCTATATCCGCTTAAACCCCGCCTCAA
CTTGCTCTCTGGTCTGGGTATAGTTGGGCTACAACAGTGAAACCGTAATTA
GGAAAGAAATGATAAAAACCCAATCCAGAAGCTTACTGCAAGATAAAGA
GAAAGATCATGAAGAGGTAGGAGTGATTCATATAACAAACAGGGTCACGT
TGTCACTTTCTCCCAGAAAAATACAAATTTAGACTAACTATATAAGGAGAC
GACTTCAGAGTCTTCTAATGGGTTAGTATAACTCGGGTCATCTTTTAATCTC
TGGCTTTAAAGACATGGTAAGATTCCATATATATGAAAACTCTGTGTGTGG
TGGATTGCTTTTTTCATTTAAGGCAAAGATAGGTTTTAAGGCAGAAGACAA
GAACGACCTTTGGCTTATTTATAGGAGACCACCACHTTCACTTGAGTC
GAG
ACAGTAACGACATTTAGAATTTGCATTACTCATCTTGTCACTTTCTCCCAGG
AAAAAAAAATACAAATTTAGACCAACTATATTAGGAGACGACTTCAAAGT
CTTCTAATGAGTTAGTAACTGGGGTCATCTTTTAATCGCCGGCTTUCAAGA
CATGTACAATTTCATATGAAAACTCTGTGTGTGGTGGATTGCATCCAAGAC
AGTTTTAAGACAGAAGATAAGAACGGCCTTTGCTTATTTATAGGAGACCAC
CACTCCTCTCGATAACCATGACTCGAGACATTAACGACTTTTAAAACTAAG
GGACGAACCTTAAGCAAAAGCTCTTGCATACTCAAATTCTTCTGCCACTT
GGTAAGTCTTTTTCTCT
>ARlOsyn3O7 Internal TMRL Arabidopsis root specific Contig AC001645 -Contig Length: 1331 bases (ARIO-2; No Errors)
CCAATACATTCGAACACGTGATTGTTCGTTAATTTTCTTGATTCTGTAAGAG
AAACAAAAAATATAGATGTCCAACTTTTTTTTTCGGGTGGGAATATAGACG
TCCAGCTTAGCTACGTACTGAATAAIICAAAGTTCCAAACTAGTATATATT
AATACAATTGACGATAAGGTCATAAGGATCGATGGAIICCAACGATTCGA
TACAAGTATTAATGAAATAAGATAACACGATTGTGACAGCAAACTCTATA
P %OE~mlZS23 m'/2) 00 CK12 L7
TTGATATTTCTATTTTTTAATTAGCCATGCGTTGCACGATCAATTTACAAAA
TAATAAAAGAAAATGATCGATCAAAGAGCATTCCATTGAAATTTATC
ATCCTGTAATCACATAATTTTGGGCCCAATCCTATTTTTCAAATGACT
CTATTACATAGTCACATAGAACATCCTAAAATAGGGTTAAAATGTACTT
ATCTATTTGCAATTTTGATATTTTCCTUTCTGAAAAAGATTAGTATAGC
00
AATTATCTTTTAGATAAAAGATCTTTTGTTCTGACTATACATTAATTT
TAAAAAAAAAACTTAACAGATATATTTGCAAATACAAAATGATGAAT
N-
AAAAGGGATACCATAATCTAAAATCTGACAAAGAAAATATACAAAT
AATTACGATACTTAGAAAAAGAACTATATATTTTTGGGTAGGGAGTA
AAACAAATTACCGATTTGCTGACTATATGAGCAATTATTACATACTTA
TTATTTGTACAACAATTATTACACATACTTGTGTGGACCAACATGATT
TTTATATTGGCCATATGGTGCGTAGTAAATGTTATAATAACTTGAATA
ATAATAACTAAGCTCGACTCGATATATAGATCCAACCAGTAGCCTTA
TTCACACCTAATCTTCATCTTCATCTTCGCATTCATAGTCTCTACGACG
TAATCCCCCTCTCTCTATCTATCTTTCATATATGTGTGTATGTGTAACT
CTATATTCTGAAAATAGATCAATCAATTGATCTTTTCCTATCTCATGT
TCACAACCATCAGTTTGACTTTTGATCGTTTAAGGCTCGAGAGATTC
TTCACTGTAGTAAAGATAGTTTATACCAACAAACCCATTTGGTGTTA
GCTTTCAACATAAGTATGAGTTAGAGCTAGAACCGGATTAGTATTATT
ACTTGTACCTGTTCATAGTACTAACCAAAAATGATCCAAAAAATAA
TAACAAATAAACCATTTATGGTTATCACAGATAGATAAAAGAAGTAC
AGGA
>ARl3_syn3O9 Internal TMRI Arabidopsis root specific Contig X98855.2 Contig Length: 1272 bases (ARI 3-3; no errors)
CATUTTGAATGACATTGGTTTCCAGATTTAACTTCATATGTCTTGCCAT
AAATTTGTACGCATTGATATAGTATCATGGTCCTGACTTTAAGCATGC
ATGGGTAATGATATTAATGAAATATCGGCGAAATTTCTTGGATAAG
AAAGATTCGTACGCATGAAACCAATATGTGATGTTGGTTCCATATAA
AGCATTTGTAAAATTTAGAATAAAATCGAGTTTACGTCAGAGCCATCA
CATTACCATTAAAAATTGGATGAACTGATGAACAGGTTGAACCAGAAT
P NO ~~kIZ820a 212 00
GTCACTCAAAGTTAGAGCTTGGTTGATAGGTTCTTAAGACTAAACAGTTCC
TCATCAGTATGTAATATAATGAAATAATTTAATCTCTTTGTGTGTAAGGFrG
TTAACTGGACTACCACTAGACTGAATTTTTAATTAAGTCTTATCCTATAGTC
TTTTCAATAAGTTCACAATTGCAAGTACTTAATAAACAAAATAATTAATCA
0 5 GTTAATTATGACAAATTAGTCAACATCCGATCACATTCCACGTTGTAAAGT N TAAATTTTAAGGTAGATACTAATCATACTTGTAAAATGAATAAGATGAGTT 00
CTTCTTCTTGTCACCTCTCGATCTTGTTTTAAGTAGTTGGTGAGATGTGATA
AACTTGTAACATGCCACTGAGTTGTCAAAGACAAGCATCTATACAGTTATT
AAAAAAAAAGGTTAAGCATGTAAAAATACACACACATGTCGTAGTAAATA
TACACCTTTTTATAATTAATTATATTGTAACGAATTTGTTGTTTTGTTATAA
TATATAGATTAATGCATGATGTTTTGCGATTAAAGCCAGACGAGTTGTAAT
ATCCACAGCCTTGATAAGCTCTACATGCAGTGAACAATTTTATACATTTAG
AAAAAATAATCACTATCTCGACCATATAGACCAGGCCACTACATTACAGCT
AATCTCTGGATTTACTTGATAATTAAGACAAATATAGAACATTAAAATACT
AACTCGATGCCTCACCTTAGCCTCCTCTCAAATTGTCAATATCTAGATGGA
GTGTTACATCCACATTCCTAACAGTTTTTACTCTTTATTTTAATATATCCTTC
AACAGATCATCATCAGAATAATCATCAAATCATTATTATATATTTAACTAG
CCCAAATTGTACCATACCTATCAATTTAAATTTCTCTTTCTATCTACTATAA
AAAGTGACTCTCTAAGAACTCCAAAGATTAGAACATTGAATTGA
>ARl-syn3Ol Internal TMRI Arabidopsis Root specific Contig AC002333.199 Contig Length: 1371 bases (AR 1 base 251 AA to no bases; base 345
CTCTTTATTTGTCGTGACTCGCGAACCCCTTIITATTAACGTTAGTCAA
CACAACATTTCATTAATGATAATTCTACTACTATTAGTTTGCAATGTTAACT
AAACTCTTTTTACGTGAGAAAACTTAAGATTATCATTTCCAGACCACCGCA
AGTTCCTTGAAAAGATTGTTATATATATAACAGCTGCATATCTTAATACGG
ATTTATGGGCTTTAATTTGAAATCAATTGTATCAAATAGGTTTGAAAAAAA
ATCGTATCACATACCTTTATTTUTTGAGTGTAGTATAAGCAAGCAATATTG
ATGAATGCGTGAGTCTGCAAAATTTAACCCCAAAAAAAAGTAAGCAACAA
TATATATTCAGCAATCATGTTAGAAAGTATTTTAATCATGUTGAACTGAAC
GATCTCCGCGCTAATTAGTATTCCTAAGAGACACCAATCAGAAACTATTGG
P %OPER s$I2699250 dO.22/I2MU 00 -297-
ATAGTTCGACGGTTTAGAATTTGTCCAGTTGAGAATGGTTTTCAAACTATT
TTATAAAATTTTTTTAGCGAATTTCTAAAGTTAAGTTGACCGGCACATCT
TGGTTAAATGTTTCACTCGTCGTTGAAAAAGTCTTTTCAACAAAATCTTAC
TTCTGGATATAATTAATATCATATGTACAAAAATTGATTAATGGGTCTTAA
ACTATTTCATGTATTTACTATTTAGATAGAGACGTTTAAAAAAAAACTT
N ~TTCGTGTCTTTACTATTTAGATAGAGATTACACGACATGGAAATAATAGT
CATGGTCAAGTTTATATACGGACGACTCTCATGAAATCCTACAACAAGA
c-I AACAAAGCAACATATAGTATAATGTGAAATATACACTGTTAAGCAACT
TTACGTATTATAGTTATTTTTATGTTAATGACGTACAATGTACAAATTCTAG
TATTCTTCACCTGAATTATTTGATGCTAAACTACGTACGTCGTGGTTATT
CATTGTTCTTTAATTAGCCATCTCGAAATATAATTATTTCAATGTTACAAG
TTTTAGTCGCTCTAATAGGATGTTTATGAATTTAAACCGACCCAATCCGAC
TTGTTTTTTCTTCTAAAAAATATTATCTTGAAAATGATTTTATTAAATTCGT
TTTCGTCTTAGTCTAATTCAGCTATAAAGTATAAACGTTATGACCAAGTC
ATAATCAAATCATCATAGTATTTCTCCTTAATCACAACTACAAGAAAAGG
AATGGGTCATGACTTTCTTATAAAACATTAACTAAGATTTGACCAAACAT
ATTTTGTATTATCAATATTACACCATAAATACGGCCACATATCCTCCTAT
TCTTCACACAACTCTCCCCTCAAAACATTCCATCAAAGG
>AR2-syl3O2 Internal TMRI Arabidopsis root specific Contig AC002333.210 Contig Length: 1400 bases (AR2-2; no errors)
ACTTCCACCAGAAAAGGCGAAACCAAGAGCTTTGAATTGAATAGTCAAA
ATAATTGCTTCTACTCTTCATTCTTCAACGTATGCAACGTAACTTGTATA
TGTGCATTTATCATTTTTTATCGGCAAATTTTAGCCTTACACAAAAGACT
ATAAAGTATCATGGCCTTTTTTGTTGACATTGTCCTTCTCTTGTCAACAT
TTCCTGGTTTTAAGATACATATGGATGGTTCAGAATGTCATATAGTATAA
CTATTACTCTACACTTTGATTGATGACATCTTATTCCGATTTTATCCAAG
TCTTTTATAGATGAAAATCAACTGTTATTTATAAAAAAAATAAATAGTG
GATTTAAAGTGATTTGAGTGACATACATTAGGTGAAATTTGAAGGAATTTC
TTAGTTAAAAAATCAAAGATGCAAATCTTATAGTTTnAGGTGAGATTTTAG P QPER'j,.,% 12689250 "-.2VJ 1210 00 -298-
AGAATGTTAATAGCAATTTCCTAAAGTTCACTAAAACCATCTCAAAACTCA
TCAAAACTAAAATCACTCTCAATTCATCCTTCAATACAGCCCCTAAAACTG
AGAATCATTTTTHGTCAACTAAAACTGAAATTCATACAT'ITGATGGAATTT
GAGTAGCCGCCGGGTAATGGACCCAACCAAAGGCTCACATAGTCACATGG
0 5 TACCAAGATTTATAGTGATATTATGCGACATCTCTCTACCACATAGTCACA
TGGCACCAAGATTTATAGTGACAATATGCGACTTCTCTACTTAGGGCCGGC
00
TTTCAGAAACTCTAGAAGAACTAGCCACTGATCGACTATATTAGAGTAGTT
N AATTTTATCAATTACATTTGAAATGTTTATCTCTAGTAAGATAAATATCAA
ACAAACAAAACTTAATCCAAGGACTTGTCTTCATATCGTCTATTAAGAGTC
GAATTTAAGAAATGAAAAATAAACGATTCAAGCTTAAGTTAGTTTTAGGA
AAAAGACATACTGTTTGCTCCATATAGTTTGTACATGTATTAAATATAGAA
TAACAAAATATTTTAATGTTTGCTCCATTGACATTACATGTATTAAACAGTT
TAATAGAAAAACGAACATTTTTGTTTGTTCAATCATTGGGAAATCATAGAA
TTGTTCAAAATATGAAACAAAAGTGAGAAATATCAATTAATATAATAGTTC
TGTTTAACAAGAAAATTGAATTTAGACCAAGTCCACAATATTCATCTTGAG
TAAGAACACGACCAAAAGTCAAACTCGTTTCGAAATACATAAATATGTAC
CCCGCTATACAAAAAAGAAAAAGACATTTACATCCACTTATCCCAATAGA
CAAATGACCAAACTACCCAACATCTACCCCTATATATACCTCACCACCTTT
GCCCTCTCAACCACAAACAATAA
>AR5-syn3O3 Internal TMRI Arabidopsis root specific Contig AC007135.23 Contig Length: 1307 bases (AR5- 1; base 508 C to T
CGGATGGTTGAGGTAGTATGAGTGACCGTGACGATCAAACGTTCTCCAAA
GAAATCGATGTAGCGGTTTTCGTGGATATGGCGCTTTTGGATCTTCTTCTG
ATCCTAGCCATTTAATCTGCATAAAAGTGAGTATGAGAGAGAAGATTAAA
TAGATATCAATCCTAACTAATATTCAAGAAAACATAATATAGATCAATAA
ATTGATGAGAGTAAAAACACAAAGATGTTTAGAAATAATTATTGTCAAGA
CTCAAGTTTCTTCAAAATATCAAGAGGCGCTTGGAATAAGACCCTTATTCT
ACAATACATCAATCTATATAGAGATAAAGACTAAGCATAATTTTTAAAATA
GAAAAAATATAAACGTAAATAACACTTTTTGAGGTAATACTAAATTTCT
AAACATGAAATGTTACAAATCCACAATATTTCCATATAAATTTGTAAATAA
P %OPER~jn'.XI26992S0 dom.22J 12/05 00 -299-
TATTTTGTTAGATAATGTTAAATTTTCTAAACTGAAATATTAACAAATCCGT
AGTATTTCCATTATTAAATCTCGATUTTGTTTCAATGGGAGATTTGAATTTT
GAACCAAAAAAAAAAAAAAAGATTTCATCAAGATATCTAGGGGGATATTT
TGCTGGAATATAGCTTTGATGAGAATATUATATTTTGTATCTCTGAAAATC
0 5 AAGTTTAAAGGGGAAATGATTATGGGTTGAAATTTTGCAATCAAAAGCCC N TAATTTGCAAAAACTACATAAGTTTTTTGTTTGGGCTGGCGCTATCGGATC 00
CTTTTAGGCTTACATTTAACATCTGGTCCACTTAGAAAGAGTCACGTAGTA
TATGGTAATTGTCAACTTGATTTTTCAAGTTAAAAGAAATATGTATCAAAA
TGACTAAAAAGTAGTGAAATATTATGTATCTAATTTGTTTATTTACCAAAT
TAATGCTATAAAAATGTTCAACTGTACAATTGGCATGGAATAATAtGAACA
TAAATCATACATTATTAAGCACTTTTGCCTACGAAGGGATACCAACTTCAT
TAGTTTACATTTTCTTTTGTGTTCAATTGTTAGCTCAAACCCAATTAAGTGG
GGAAAGTAAGAAGCAACAACTCCTCTTCCCGGACCCCTAACAAATCAACT
AAACTCAATATCAAACCATTTTAAAAGAGCTCATCATTAACTAGCTACTAA
TTATTCTTAATCAATCACTGCTTAATACAAAGCACTATATATACACTTGTAT
CTTCCATTAGTTTCCCACCACAACTACAAAACATTCCAATACACAACACAC
AAAGCACACACTTTTTCTTTCTTTTAAACCCCA
>AR6-syn3O4 Internal TMRI Arabidopsis root specific Contig AL03338.24 Contig Length: 1324 bases (AR6-2; No Errors)
TTCCCTCCAATGTCCTACTGTCTCCTTCTCTGTGTGTTACCATGGTTTTACTT
CACCATGTAAGTCTCTCTCATTATCAAAATTCATCTTCTCTGTTTTCTTCCTC
CTCTGAATCAATCCTTTGTTTATTTCTTGTGTTGTGTGTGATGCAGTTAGAG
CAAGGGATGCGTCCGATTTCACGATGTTACAATCCAACCGCGTATTCGACA
ACAATGGGAAGAAGTTTCTTCGCAGGTGCAGCCACAAGCAGCAAGCTAII
CTCCAGAGGTTTCTCAGTCACAAAGCCAAAATCTAAAACCGAATCTAAAG
AAGTTGCTGCAAAGAAACATAGTGACGCAGAGAGAAGGAGACGGCTTCG
GATTAATTCCCAGTTTGCAACTCTCCGCACCATTCTTCCAAACTTAGTCAA
AGTAAGTTTAGCTCTGCATTCAATTACACAAAATGTTTCACCAGAGAAGTA
ACACTTTTTGTATTATGTTCAATGAAACTAAACAGCAAGATAAAGCATCTG
TGCTTGGAGAGACTGTCAGGTACTTCAATGAATTGAAAAAGATGGTTCAA
P /OPER'jn,'I2699250,doc.2211 00 -300-
GACATACCAACCACACCATCTTTAGAAGACAACTTGAGATTGGACCACTGT
AATAACAACAGAGACTTGGCAAGAGTCGTGTTCAGTTGTAGCGACAGAGA
AGGGCTAATGTCGGAGGTTGCAGAGTCAATGAAAGCAGTGAAAGCAAAG
GCGGTGAGAGCTGAGATCATGACAGTAGGTGGAAGAACCAAGTGTGCCTT
GTTTGTTCAAGGTGTCAATGGGAATGAAGGATTGGTGAAGCTCAAGAAAT
CGTTGAAACTTGTAGTGAATGGTAAATCATCATCAGAGGCGAAAAACAAC
00
AACAATGGAGGATCGTTGTTAATTCAGCAGCAATGAGTATTTTGTTTATAT
N ACTTGTACATCTCTGTTTCTCCTAGTCCATTAGAGAAGGTAGATGTAAAGG
TATAAAAGCCCATGTGTTATTGAAATTGGGTGGATACTTACAAGAGTCTAT
ATGAATAAAAATGATGCAATTCTTTCTTTGGAGATGGTGTGGATGTTATAA
CAAAATATGAATCATGTGAAATTTTTTGTCCCATCTTTGTTCTTACCCAATT
GTACCTTTTGAGATGAAATCCCATGGTTGCTTCTAGTAGATAGCTTTCTTCT
GGGAAACAAAGATTTGGTTTAATAAGTTGAACCAACGAATAACTCTTCAA
ACATTCCCCACCTACTTCTCATCAAACCTCCTTATAAATAGAGGATTCCAG
CACAAGTCTCTTCATCACTCAAACCAACAAGAAGTAGTCAAAGCACAATA
CAGC
>AR8-syn3O5 Internal TMRI Arabidopsis root specific Contig AL080253.32 Contig Length: 1276 bases (AR8-2; base 671 A to G)
TTGCTTGTTTTCTGAATCTGTGCGTGTCTTTTTTGAAATCGACAGCGCACTC
CAATCAGGTTGCCCATGCTCCTTCCAGTCAGGTTGCGCAGATCAATTGTGG
GCATTGTCGGACGACCCTCATGTATCCTTACGGTGCATCATCCGTCAAATG
CGCTGTTTGTCAATTCGTAACTAACGTTAATGTGATTATTCCTATCTATTAA
GCCACCTCTGCATGGTTGAGTTAAGTATAGAGATCTTTCTGTTGGAAATTT
TCATTTCTGATTCATTTTGCATCCTTAGATGAGCAATGGAAGGGTACCTCTC
CCAACTAACCGGCCAAATGGAACAGCTTGTCCCCCCTCTACATCAACTGTG
AGTTATCAAATTATGAATTTGTAATAGTTCTGTATATTCTTATGGAACTGGT
ACTTACTCTGTTCATCGATTTTTCATTTTACCAACAGTCAACACCACCCTCT
CAGACCCAAACCGTTGTTGTAGAAAACCCCATGTCCGTTGATGAAAGCGG
AAAGTTGGTGAGTATTTCTATCACCTGTGTTCTTCTTCTTATTTACCACATT
AGAGGAAGATATGACAAAGTGACTGAAACACACAAATTGCAGGTGAGCA
P IOPERlj-uI 26992VJ dm.22/1 00 -301
ATGTTGTTGTTGGAGTGACAACTGACAAAAAGTAATCAAGATGGG
ATCTTGAAGATCAAATCCAAATTCTTCCTCTATTCCTGCGTTTGGTTTGTGC
ATATTACATACGCGGAAAAACTGTATGTTATATATCTCTTGACTCCTTTTTA
ACCCAAGAGAAAAAGCTTATCAGAATCTCTTGTTACTGCATTATTGGGGTT
0 5 TATTCAAAGTTGAAGACACAAGGTTTTTGCTCGAATAATTTGGCAUTCTTTT NI GCTCCATGGAACTTGACCTTCTCTTCTGTTTGTTGACTTCTAAAACTCCATC 00
GGCCCTTGTGGCATTGTTAATGTATGTATGAATATAATCTGATACACCAAC
CAATCATTAAGATTTGGGTTTGAAATCTGTCTCTTCCGTGGATGAGATATG
CTACATGTCACAAGAACTGGTCTTAGCTTTGGTAGATAAGACTTGTCTTAG
AGCAAGTCTTGAAATCTGGAAATCTATTTTGCAGTAATCTTGTCACAACAA
CCATAACCTAATCAGTCAGTACCCTCCAAGAACATTAAAGTTAGATGATCC
GACAAAACCTCTCAACAAGACCAAACTCTTTCCATATAAATACTCTTTAAC
ACTGACACAAAGTTTCATCACTTTCTCTTGATCACTCACTGCATCA
>ATU56929_SynOO7 Internal NADII GeneChip Arabi dops is constitutive Consistent expression greater than 500
TCTCCCAAATAAAAATGAGAGCAAACACTAATCTAATATTAAATTGAATTA
AAAACTTTTAAATAGTGGAAATATATACCCTAAATTGGAAATAAAAAACC
CAAATATAATATTACAAACTAATTTTAAAATAAAAAATCTCTTTTAAATGG
TGAAAATATATACCCTAAATTGGAAATAGGAAACCCAAATATAATACCAT
AAACTTATATTAAAATGAATCAATATTTCTTTTAAATAGTTGAAATATATA
CCCTATATTGGAAATAGAAAACTCAAATATAATATTTAAATTTATTTCTAA
TTTATTTTGGTTGAATAGATTTTATATAAACTTGTGGTATTATTATTGTCCA
TAAAACTTGTTTTAGTGTTACTTTTAAGAATTTTTCAAATAATCATTTGAGT
GCTAATTATGTGTAAACAACTTTTTAATGCTATTTTTGTCCAAAAAACTTAA
AAATGTGCTATTTGTGGGAATTTTTCAATAAGATATAAATTTAAAACTGAG
TTGATTAATTAAAAGTGTCACACAAAAAAAAGTTTAATGTGAACAACAAC
ATTAATTCTTTTTTAAAAAATTTTGTTTTATACTATTATTCTATTAACATGTT
AATTAATATAACTAGAAAAAAAAATCAATCTACTAAAACTAGGTTTTTTAG
CATTTTATAAATATTTGTATGAGAACTTTCTCTAATTCAGTTCATCCAGTTA
ACCATTGTTCGCTTATTCTGCAATTCATTTATTTATCTGATATACCAGTTAA
P kOEk.1292Ud .12J 00 -302-
CCTTAAATGTTGTGTAATCAGTCGTAAAATTGTTTTGTGTAATGTTACATAA
ATTAATAGAATCAAATTTAAAATGTGTTCTAATTATGCTATGACGTTATAA
ACAAACGATAAATTCCGATTCATGATATGAAGTATTCAATTGAAAACAC
AAAAATCGACAAAATTTTAAAAATATTTTAGATCTTACATTACATACCTGT
0 5 ATTGTCGCAAAGGAAAATTTATTTCTTGTCCTAAAAGGCCATTTGGAACTT N GAGCTAATGTAAATATATAAATGGGCTTATTGGGTCCTCTAATGGGCTTGC 00
CTTTGACGTAGAAGACAGAAGCATCGTTGTGACTCCCGTTTGTGATTTAGG
AATCCGCACTGCTTGCCGTTTTCCGTTTCTACTACTTTTCAATTCAGAAA
CGCCTCTCTCGTCGTCTTCAAAGCTAAATTAGAAACCTGACGATCTCTCTCT
CTCTCTCTCTCTCGATCGGATAATATTTGAGCmTGTGGTTGGAGGATCTGA
GTTAGTC
>PRI-SynOI8 Internal: PCR Arabidopsis PRI promoter 1.2kb fragment Inducible by SA, INA, BTH, pathogens SeqLen: 1260
GGTTATTGTTGTGTTATGATTTTGGGGIICGTAAACATCGCTTATATAGAG
ATTTGAAAACTATTTTTTTCTTTTTTTTTTTGTTAACTATAGATCTCACGTTT
TTGTAAATACATGGTCCATGTGTGAGTATTTTAGTAATATTCATTGCAATTG
TCCAAATGAATAGAAGTTGTTTTCGTAACTATTTTTTTGTCAATCTTGTCCT
TACACACATTTTTCCTAATATTGTTTCGTATCGGTAGCTTTGCCATTGTTGA
TATATTTTTTTAGTATATATGTAAGTATACCCTAAATGAAGTTTATTAAGAA
ACATTGTATATAGTTGTTTCATGTCATTCAGTTGTTTGTGTTTTTTTTTCTT
CATGATTCTAATTTAAGTCTTCTATTTCAAATTTGAATTTCATATATTACTT
CATTCAAAATGTTGTGAAGATATCTTCCTGTAAATAATACAGAAAAATCGT
ATCGGACAGTTTGGCAATTAAGATTATATTTACAGTCAGAAAAAATAAAA
GTTTATATCTACAGTCAATTTTCAAATAAAAGAAAAAAAGTCAAGAATTAT
TTGTTTCTTAGTGTTTCATGCATATGAGTATCTCTATCACTCTTGCCTATGG
CTGAAAAGTCCTGAAGAATATATGCCGCCACATCTATGACGTAAGTAAAA
TAGTGACGTAGAGAAACAGTCAATAGATCACCCATTGAGATTTATCCAAA
AAGAAAAAAAAAAAAAAAAAAAAAAAAAAAGATCACCGATTGACATTGT
ATACACTTTGTTTTTTTTTTCCAAACACTAATACGCAGTTTAAATTGAAAAA
CTCTAGGTGACCGATCTACTTTTGTGTTCTTCTATCTTCAGTATACCTAATT
P %OPERjn'S%126H9'S0 doc221I 2MO c-I .303-
TTGTACCGCCTTCGTATATCATTTACCAATTTTGACTACTGATATGCACTGG
CTTTAAAATTTTCCAATCCTGATATGAATCTGTGATTCTAAGCAATAACAT
ATACTCCCTCCGAATCAGAAAAATTGATTTTTAAAGTTTTTGTATTAAA
AAGATTGAGTTTATGTATATTTTTATCAATCAATATAAAAGGTTATGAAT
TTCAAGAATCAATTAATTGAGAATTTTAAAATTTGATGAATTACTATTGGT
c-I TAATAGTTACGAGAAATAGTTTAGCATGAATAAATAGTAATTTATAACTAA 00
GCATTATTATTTTTTTAATCGGTATAAACATTCTATAAAATCAAACTTTTTT
N- ATATGGAGGGAGAATCATTTTATAAG >UBQ3_SynOI6 Internal PCR Arabidopsis Constitutive dicot promoter with intron Accession L05363 SeqLen: 1335
GGTACCGGATTTGGAGCCAAGTCTCATAAACGCCATTGTGGAAGAAAGTC
TTGAGTTGGTGGTAATGTAACAGAGTAGTAAGAACAGAGAAGAGAGAGA
GTGTGAGATACATGAATTGTCGGGCAACAAAAATCCTGAACATCTTATTTT
AGCAAAGAGAAAGAGTTCCGAGTCTGTAGCAGAAGAGTGAGGAGAAAT""T
AAGCTCTTGGACTTGTGAATTGTTCCGCCTCTTGAATACTTCTTCAATCCTC
ATATATTCTTCTTCTATGTTACCTGAAAACCGGCATTTAATCTCGCGGGT
AYTCCGGTTCAACATTTTTTTTGTTTTGAGTTATTATCTGGGCTTAATAACG
CAGGCCTGAAATAAATTCAAGGCCCAACTGTTTTTTTTTTTAAGAAGTTGC
TGTTAAAAAAAAAAAAAGGGAATTAACAACAACAACAAAAAAAGATAAA
GAAAATAATAACAATTACTTTAATTGTAGACTAAAAAAACATAGATTTTAT
CATGAAAAAAAGAGAAAAGAAATAAAAACTTGGATCAAAAAAAAAACAT
ACAGATCTTCTAATTATTAACTTTTCTTAAAAATTAGGTCCTTTTTCCCAAC
AATTAGGTTTAGAGTTTTGGAATTAAACCAAAAAGATTGTTCTAAAAAATA
CTCAAATTTGGTAGATAAGTTTCCTTATTTTAATTAGTCAATGGTAGATACT
TTTTTTTCTTTTCTTTATTAGAGTAGATTAGAATCTTTTATGCCAAGTATTG
ATAAATTAAATCAAGAAGATAAACTATCATAATCAACATGAAATTAAAAG
AAAAATCTCATATATAGTATTAGTATTCTCTATATATATTATGATGCTTAT
TCTTAATGGGTTGGGTTAACCAAGACATAGTCTAATGGAAAGAATCTTT
TTGAACTTTTTCCTTATTGATTAAATTCTTCTATAGAAAAGAAAGAAATTAT
TTGAGGAAAAGTATATACAAAAAGAAAAATAGAAAAATGTCAGTGAAGC
P \OPERijm\12619250 doc.22/12/O 00
O
O -304-
SAGATGTAATGGATGACCTAATCCAACCACCACCATAGGATGTTTCTACTTG
AGTCGGTCTTTTAAAAACGCACGGTGGAAAATATGACACGTATCATATGAT
TCCTTCCTTTAGTTTCGTGATAATAATCCTCAACTGATATCTTCCTTTTTTTG
TTTTGGCTAAAGATATTTTATTCTCATTAATAGAAAAGACGGTTTTGGGCTT
O 5 TTGGTTTGCGATATAAAGAAGACCTTCGTGTGGAAGATAATAATTCATCCT N TTCGTCTTTTTCTGACTCTTCAATCTCTCCCAAAGCCTAAAGCGATCTCTGC 00 O AAATCTCT
O
The reference to any prior art in this specification is not, and should not be taken as, an acknowledgment or any form of suggestion that that prior art forms part of the common general knowledge in Australia.
Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.

Claims (26)

1. An isolated polynucleotide comprising a plant nucleotide sequence that directs root-specific transcription of an operatively linked nucleic acid segment in a plant cell, wherein said nucleotide sequence can be obtained from a gene encoding a Spolypeptide which is substantially similar to a polypeptide encoded by an 0 Arabidopsis gene comprising an open reading frame having any one of SEQ ID NOs: 358-366, or a fragment thereof which directs root-specific transcription, or to a polypeptide encoded by an Oryza gene comprising an open reading frame having SEQ ID NO: 774 or 792, or a fragment thereof which directs root-specific transcription.
2. An isolated polynucleotide according to claim 1 comprising a plant nucleotide sequence that directs root-specific transcription of an operatively linked nucleic acid segment in a plant cell, wherein the plant nucleotide sequence has at least nucleotide sequence identity to one of SEQ ID NOs: 1-51, 518-526, 536-544, 825 and 843.
3. The polynucleotide of claim 1 wherein the plant of claim 1 wherein the plant nucleotide sequence has at least 90% nucleotide sequence identity to one of SEQ ID NOs: 1-51, 518-526, 536-544, 825 and 843.
4. The polynucleotide of claim 1 wherein the plant nucleotide sequence has at least 98% nucleotide sequence identity to one of SEQ ID NOs: 1-51, 518-526, 536-544, 825 and 843. An isolated polynucleotide comprising a plant nucleotide sequence that directs root-specific transcription of an operatively linked nucleotide acid segment in a plant cell, which plant nucleotide sequence hybridizes under high stringency conditions to the complement of any one of SEQ ID NOs: 1-51, 518-526, 536-544, 825 and 843. p:OPERlU\3045061 claims doc-l1412/ 2 00 00 O 306-
6. An isolated polynucleotide comprising a plant nucleotide sequence that directs root-specific transcription of an operatively linked nucleic acid segment in a plant cell which plant nucleotide sequence is selected from the group consisting of SEQ ID NOs: 1-51, 518-526, 536-544, 825 and 843 or a fragment thereof. 00
7. The polynucleotide of any one of claims 1 to 6 wherein the plant nucleotide c sequence is 25 to 2000 nucleotides in length.
8. The polynucleotide of any one of claims 1 to 7 wherein the plant nucleotide sequence is from a dicot.
9. The polynucleotide of any one of claims 1 to 7 wherein the plant nucleotide sequence is from a monocot. The polynucleotide of any one of claims 1 to 7 wherein the plant nucleotide sequence is a maize, soybean, barley, alfalfa, sunflower, canola, soybean, cotton, peanut, sorghum, tobacco, sugarbeet, rice or wheat sequence.
11. The polynucleotide of any one of claims 1 to 10 which comprises a TATA box, a CAAT box, or both.
12. A composition comprising the polynucleotide of any one of claims 1 to 11.
13. A recombinant vector comprising the polynucleotide of any one of claims 1 to 11.
14. An expression cassette comprising the polynucleotide of any one of claims 1 to 11 operatively linked to an opening reading frame.
15. The expression cassette of claim 14 operably linked to other suitable regulatory sequences. PAOPER'ns\30506I c.Wsydo.-14/022OO 00 -307-
16. The expression cassette of any one of claims 14 and 15 wherein the open reading frame is in an antisense orientation relative to the nucleotide sequence which directs transcription.
17. The expression cassette of any one of claims 14 and 15 wherein the open reading 00 frame is in a sense orientation relative to the nucleotide sequence which directs N transcription.
18. The expression cassette of any one of claims 14 to 17 wherein the open reading frame is from an insect resistance gene, a bacterial disease resistance gene, a fungal disease resistance gene, a viral disease resistance gene, a nematode disease resistance gene, a herbicide resistance gene, a stress resistance gene, a gene affecting grain composition or quality, a nutrient utilization gene, a mycotoxin reduction gene, a male sterility gene, a selectable marker gene, a screenable marker gene, a negative selectable marker, a gene affecting plant agronomic characteristics, or an environment or stress resistance gene.
19. The expression cassette of claim 18 wherein the stress resistance gene confers resistance or tolerance to drought, heat, chilling, freezing, excessive moisture, excessive salt, or excessive oxidative stress. A recombinant vector comprising the expression cassette of any one of claims 14 to 19.
21. A host cell comprising the vector of claim 13 or 20, respectively, or the expression cassette of any one of claims 14 to 19.
22. The host cell of claim 21 wherein the cell is a plant cell.
23. A transformed plant, the genome of which is augmented with the expression P:OPERjm\304(O061 claims doc-I41O/200 00 -308- IN Ccassette of any one of claims 14 to 19 or with the vector of claim 13 or respectively.
24. A transformed plant comprising a plant cell of claim 22. The transformed plant of claim 23 or 24 which is a dicot or a monocot. 00
26. The transformed plant of claim 25 which is selected from the group consisting of maize, soybean, barley, alfalfa, sunflower, canola, soybean, cotton, peanut, sorghum, tobacco, sugarbeet, rice, wheat and Arabidopsis.
27. A product of the plant of any one of claims 23 to 26 which comprises the vector of claim 12 or 19, respectively, or the expression cassette of any one of claims 13 to 18 or the gene product encoded by the open reading frame.
28. The product of claim 27, which is selected from the group consisting of a seed, fruit, vegetable, transgenic plant, and a progeny plant.
29. A method for augmenting a plant genome, comprising: a) contacting plant cells with the expression cassette of any one of claims 14 to 19 or the vector of claims 13 or 20, respectively, so as to yield a transformed plant cell; and b) regenerating the transformed plant cell to provide a differentiated transformed plant, wherein the differentiated transformed plant expresses the open reading frame in the cells of the plant. A method to identify a gene having a promoter, the expression of which is altered in root comprising: a) contacting a plurality of isolated nucleic acid samples on a solid substrate with a probe comprising plant nucleic acid corresponding to RNA isolated from root so as to form a complex, wherein each sample comprises a ?'OPER\js\304I560 cin, d.,M4102t2OI 00 O S- 309- Splurality of oligonucleotides corresponding to at least a portion of one plant t gene; and b) comparing complex formation in a) with complex formation between a second plurality isolated nucleic acid of samples on a solid substrate contacted with a second probe comprising plant nucleic acid corresponding O to RNA that is not from root, so as to identify which samples correspond to O genes that are expressed in root, wherein the identified genes are orthologs of Arabidopsis genes comprising a promoter selected from the group consisting of SEQ ID NOs: 1-51, 518-526, 536-544, or of Oryza genes comprising a promoter selected from the group consisting of SEQ ID NOs:
825-843. 31. The method of claim 30 wherein the probes comprise nucleic acid from a dicot or a monocot. 32. The method of claim 31 wherein the nucleic acid is from a cereal plant. 33. A method to alter the phenotype of a plant cell comprising: introducing the expression cassette of any one of claims 14 to 19 into a plant cell and expressing that open reading frame in the cell so as to alter a characteristic of that cell relative to a plant cell that does not comprise the expression cassette. 34. The method of claim 33 wherein the cell is a dicot cell or a monocot cell. 35. The method of claim 34 wherein the cell is selected from a cereal cell. 36. The method of claim 34 wherein the cell is selected from maize, soybean, barley, alfalfa, sunflower, canola, soybean, cotton, peanut, sorghum, tobacco, sugarbeet, rice or wheat. 37. The method of claim 33 wherein the expression inhibits transcription or translation P )PER\jmS\304506I claims dc 14,02/2008 00 O -310- C of endogenous plant nucleic acid sequences corresponding to the open reading frame. 38. The method of claim 33 wherein the open reading frame is expressed in an amount that is greater than the amount in a plant which does not comprise the expression ,i cassette. 00 39. A computer-readable medium having stored thereon a data structure comprising: a) a nucleotide acid molecule that has at least 70% nucleic acid sequence identity to a nucleotide molecule selected from the group consisting of SEQ ID NOs: 1-339, 457, 476-515, 517-526, 536-579, 602, 693-773 and 825-875 or the complement thereof; and b) a module receiving the nucleic acid molecule which compares the nucleic acid sequence of the molecule to at least one other nucleic acid sequence. A computer-readable medium having stored thereon computer executable instructions for performing a method comprising: a) receiving a nucleic acid molecule having at least 70% nucleic acid sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-339, 457, 476-515, 517-526, 536-579, 602, 693-773 and 825-875 or the complement thereof; and b) comparing the nucleic acid sequence of the molecule to at least one other nucleic acid sequence. 41. The isolated promoter lb_syn299 comprising the sequence TACAAATCCAAAGAGATTCCAGATGAAGTAAAGAAGTTGTGCCTTATG CTGATCCAAACGACAGAGATGTCGTTATACTTGGAACTCTGTGTAGTTC AGGTTTGCAGGATCCATCCTGTATTTGGGCGTGTGGATAACTTCTCCAA AGACTTGAAAAAACTAGTGAAAGGTAACAAGTGTTTCTTCCATAGTAAT ATTGACAAGACTATTTTGGGATTTGGTGCCTTTTTTAAAATACGATTTAG TTGCAAGGAAAAAGTGAAAACGGTTTCGTAACATTGCTGCTTCTTTTGT I P:%OPERs434506l clim dcoc.I4O212 0 0 8 00 -311- TTTGTCTCGATCAGCTGCTGAGGTGCATACCTACTTAGAACCGTCCATA GATTCACTGAAGAAAATAGCTGCGTTTCTGTATCCTGGATCACTTTAGA AACAAAACAAACATGAGGACCATGCTTGAATGTGGTACGTATGTATTA GATTCCTTCCTTGATGAGTGATTAAACCGGCTATTGTACCATTGGTATAT GTTAGTCATATAATAGTATTATTCTCTTTATTTCATATCATAGCTTTAA AAAATGTTCGGCTCATGCTGTCCACTCCTTTTGGGCCGCTCGTTGCTTTC 00 ATTTTTTTAAATTGCTTACCTCTCAACAAATTCTTTTGATTGGTTCTCTCT CTGACTCTAGGCCGCAGAAAGTGCAGTTCCGATTTCTCACTCAACT AACTTTTGATAATCACTTATTCTAGATTATTCTGATTTTTGAATTCCCTCT ACTCTTGAACACGTTTACTTACTATGAGGAAAAATTTAACCCTAG AAAACCACTCATTACAGCTAACATCTATGAGGGGTGGACTATTGCGAA AGCATTGATAGTGTTAATTGAAAGTCATGCATATAGTATGCGTTACTAC TAAAGTTTAACGGTTCAATTTTTTTGAATTTGACTGACAGTAAATAA ATTAATTTTTAAGATTAAAAGACGTTGTTTTTAGCAGTTGTTTAGAAT TGTGGGACACGTGTGGCACGTTGCTCCAGGAGGGGCATATGCCAAGTCT GAGATACTCCAACGCACTGACTGACTGACCCCTACTTAACCGGTGGTCA AACTCTTAACCTAACCACGGTTAAGATCTTAAAGCCGTTGAGATTTTCC CACATGTAATAATCTTGTTTATCTGTGAGATATTCGCCGCTTCCCCTTGG CCGGCTATAAATCGATAACCTCACCGATAAATCCTCTATTCATCATCCA CAACAAACCTCTTCTTCAGTCTGATAGAGATCTCACG 42. The isolated promoter I G2_syn300 comprising the sequence CCTCAGCAAATAAGAGGACGATAAGGATCGGTCTTCAGCTATAACAA GTAAAGAAAGTTGAGATTCGAAGACTCTTTATAAGTCATTGGATTTGTA GTAAATAACAAATTAACAACACAACATTACACACATATACTACA AATTCGAGTTAAAAACCCCAATATAATATATGCATCGACTACTAACGC GTTTCAATGACTGGTAAACATATGTAACTATCTCTGTTACATATTGAAT GATAGTTAATGTCACTACCCATTTA TAATTAAAGTCATAAATAAAAATCATCTATAATGCGTGTAAGCTTGCAT AAAAATACAGTATATAACTTTTATTTAAAACTATTAAGTATCACATCA ATCGGAAAATGATTTGCTTTTGAAGTTATTACACTAGTTTATTA I P OPIRpmWO3485061 claims (OC-14O0flOO 00 -312- AATTGTTATCATCATCTCGATTTTAATAATGCTATATATACTTAGTCTTT TATTTATTGTTATTGTATATGCGAAAATGACTTGCACTGAGTTGCTTA CGGGCAAACCTGACCAAGATGTGGGAGTTCGAACTGCAAATATGTA TAATTCTTAATAAAAAAAAAATATATCCTACATTTCTTCATTTTTTTTTT AAAATACTAATATTTGCATACTTTGTTGATTGAGTTTCTGAAAATCAT AATTGAGTTTTTAAATTAGTTGGTTTGTATGCATTTGACACTTCCAATT 00 TCTTTTAAATATATCACTTTTCATATATTCTTGTAGAGCTATAATTTTAC AACAATAATTGAAATGTCGACCCAAAAATATACATTTAAAGGCATTTCG CTGATAAAAATCCAGTTTAGATGTATTTGTATTATAGGGAACCAATT ATATTATTGGTTAATATTTATTAGTCGATATTGGGTACATATGTATGTTC TTTTACGATTATGCCATCAAAAAATTTATTAGCCATTCGAGAACAGG CATCTCTATTTTTTTGCTTCTTCTATAGACTTCTTCGTCACTGATCTCCC ACGACGATCTCCCAAACTCATTTCTCTACGTTCATCGATCTCTCTCTTTC TCGTTTGCTCTACGAAAATCAGCCGTTTAAAC 43. The isolated promoter AC 11syn.271 comprising the sequence TTAGTGATGTTGCAACTTTTAATGCAACATTTTTTCCAGCATATTTT ATATGTAAATTATAATAATGTTTCTA TTGTATATAAAAACCTTAAATGTCAATTGATGATAGAGAGAGACATT ACTATATTATTGTGAAAAAGTATCACTATTTCTAAGATTGTTCTAGT AAAAATTGGTATTAGTTAATTTTCAGACCATCATAAGATGATTTAG ATTAGTGACAAAGAATAATCCTTCAAATACATATTTCGACACAAGTA TACTTGGTATCAAAATCTGTAAAAAAAATCAGAGCCATGACCAAAT ACAATATGTTAAGTTCATGTGACGTGAGATATAAATTGATTTGATTCA CTTTCCAATTGTGTTTATAATTAACGCATTAACACTAAGCAAA TAAAAGACGAAGCGTGATAATGATC AAAGCAAAAACCTATAGATCCGGTGGACAGTCACAGTGTCATTTAAT CCCTATAAATAGCTCACTCCCTTGTCATCCACAAATCGTCCCCGTCTCGT TTCCTTCTTCGCTCGCTGTTCAGATTTTGCTTTGAGGCTTTAGGCTCCCC AGATCTCTAATCGCCGCAGGTTTCGCTCTTCTTCTCCGTCTTATTGATTT CGAGTTTTTAGGCGATGCTTTTACGGGTTTTGTTGTTAATCTGAAACGA P,%OPEk\jnms\304SO6 CIWjn4C-4/O22008 00 -313- AATGAGATTTTTCTATGGGTTTCGATTCAGATTTGATAATATTCGAACCT TCTACGCCTGTTATTATAATTAGATCTGCGATAGTGTGTGACTATTGA ATGAGATTCTCAAGTTCTTAGGTTATATCGTTTGTGAT1"IATACAGATTT C AAAACGTATGTGGATCCGTTAATTTTCCAGTGCTGTGTAGCAGATCTGC TTAATAGGTTTATCTTTTTTGCAAATGATTTTGATTTTCGCANCGATCGT GTACTCTATGTAGTAGTAGTAGTATATGATTTGATAAATGTAGTAGTAG 00 TAGTATATGATCGTGTACTGAGCCATAAATGAGCCTTCCTCGTTATTA TTGTCCATGAATTGTTAGTTAAGCTTGAGTTCCTTAACGTTTAATTA GATCCTTATCACTGACTGTTCCACTATGTATCAGAGTCGAATCT CTTTGGATGAGATGCGTCTGTTTATGCTATTCCACATGATTTGGAT CTTTCTTAGCTTTTTATGTCACTTGAGTGTGGAATCTTTTTTTTTTGTTCT CTTCCTTTCAATTGTAAAAAGTTTGTTATATGTGTATGATTTTTATGTGG TTGCTGATTCAATTTTTCTTTTTG 44. The isolated promoter AC I 2syn272 comprising the sequence TGTGGAGATCAGTGCCTGATAAAGATAGCATTGCAATGATAATGTATGA TGTGCAACGCATAAGACAACAATTGACATCAAGCACACCTCTTCTGGTG ACTGGAAATCAAACTAATAAGTTAGCTTATGACTTGCACTAGAAACAC TAGTTTCAGAAATCAGCATAAGTATCGAAGAGAAAGCTCTAACATGTG ACAAAAATTAAACGTGGAAAGTACGTAAGCTGCAGGTATCATCTCTA TCACATTCTCTAGACTCTAGCTACTATACATTATTTTAATTTATCGTCG TGGAATGTTGATTATGTTTACGCCTAATGTTGTAATTTCATGGTTGATGG ATATATATAGATGTGGGTATTCCTTTTGCTATATGTGTGGAGTCGAATG GAAACAACGGCTAGGAGCTGGTGGTTGCATTCATAGCAAAGCAGAGAT TTATTTTATCATTATTTGTTTTGCAGTCTTGTTTGGAGTGAACTTTTGTTT CTTTTTGATTGCTACTTTAATCAATTGGGTTGTGATTTATTCAAGTGAT TTCCGGCTTACGACTAAGATAACTC TTTCTATGTCTTATGATTGCATGAGTAGCCCAAACATCTATGGTCTAGTG GTAGGAGAAGATTTAGGGAATAGTGAAACTTGTAGATCCGAGTTCGAT CCTCCCTGAAAACAAAAATCATATTTGTTTTGAGAAGTCTCTCAGTTAG GCCTTGGGTCAATTGGTTTACCTGGTAGTTAGAAATGCAGCCGGTCTGA P WPER~ms30485061 chin~dm.I412/2008 00 -314- CTATCCCCTTTCATTAGTCGGAAAACATTTCAAATTCAGACAGACAGT ATGATCTGTAAGCATTAAATCGGGT CTGCCGAGGCTGACCAGATTAGCCGGTAGGGTTTATAAAAGA AAAATGATTGCATGAGTACTTCTCAATTCTTCACGTTGTCACACAAC TTGTTACATGCGACTAAACAAATTATATTGAATCCATATACAGATTTGC CAAATACTATTTCTATTTGGTCCCAATTAGTGATGTTTATATGGATTTAA 00 TAGCCCATTTAGTTATATGGGTCTGTTGTTAAAATAGCCATGTAGA CCCGTTTATGGAAAAAGATAAATGGGCTTTATTTCGACCCGGCCCAA ATTACAACGTGTTCAACAACAACTCTATTATACAAACAGACTACGTCGT TCTCTTCCACTCATCTGAAAACAAATCCATTCTCTCTCTCTCCCTCCA GATTCAAACGATCCGATCCAAAACT The isolated promoter AC I 3syn273 comprising the sequence ACCCATTTGTCTGCCAACATCTCTTTTGGCTATATACTCATGAACTTTA AAAAATCTTCTTATTTGTATGTTCGAAACTCCCTGAAAGTTTCAGTCTTC TTTTTAAGACCAAGCAGATACTACA TATAACTCTTCTCAAGAAATGATATGAGATTCATGACATAAGA GTTGGTCCTTGGAAAGCGACCTCTTCAAGTCTTCATTAATTAGACATTG ATCGTCTGATAGCATTATATACAGT GTCTAGGTCTAGTTAAATTTTTTGAA GCCGACTATATACATATGTGTTTTTCATACTTATCGCAATAGA CACACACTAATCAACTATTTGTAAATTCAATTCACCAAATTATTTAT GTAAGTACTCAATCTGCCGAAACTA TGAAAAGACTGAATTAAACTACTTACTAGTGAGACTGAATTA AAATAACAAAGAATTATCAATAGTATTTTTAATTACATTTAA AAAATAAACTTATTTGAGGACGTAACCTAAATCTCTATATAGTTGTT TTTGACGAATATGAGTTTTATTATAAGACTATTTTTCCAGAGATAA TTAAAATTAAAGATATTTATCAAATT GTAATTTCAAACCAAATATTTATTAATTAATGTGTAATGAGATACTT ACATATCATCTAGACAAGTTGAGATTTTCTTTATAGGGTTTGTAAAA TTGTATTACAAGAACAAGATAATAA PAOPER\jmWO3485061 clims doc- 14/02f 2 0 0 8 00 -315- ATACAATGCAATGATATTTAAAAAAACAACAACTGCATTGCAGTGAATT TCATCAAAATCCATTAAAACATTTCCAAACTCAAATAGAACAACTTCA AAACCTTAATCCAAAATGTTATAGATAGATATGCAATAGCTCTTAGGCC TAGTACATAGCTAGATCTTGTAACTCGTGAAGGCAAATGATTGGGACGT TGGTTCGGTTCTAGTGGTCGGGCTCAGCCTGGCGGAAAAAATTGTTATG GGTCTAAGGCCCATAAAGTGGCCCAGAAATAAACTCGTCGTATTTACAC 00 ACGTTGTCGTTTCTCTTATCTTCTAGAAAACTGTATCCCGTTTTTGTTCTT GTACTCTACACAAACAGACAACTTCAAATTACTCAACACCACGTCGTGA AAATCCGATCTACGTCTCTGTCTCTCTCCAATCTCTCTGCGCCACAGAT TGTGCGATTTACGAAAATCTCTGAAACCTCCGATCGTTAACGGC 46. The isolated promoter AC20O syn278 comprising the sequence CTTGGAAGCATTCAAGAGAGTCGTGGAGAGTGTGGCTCAGCGTCTCAAT GAACAGCCCGTGATCGTTGCTCACAGCGAAAACACCTTTGATGGGAGC GGTATCAGGAGGCTCTTGTCCAATAAATTCGAATTCGATAAt~GGTAACT ACCATACATATATATGTTATCTAGCTTTTATGCTAAAGGAAAACTTTTTA AATGATGGTAACGAGTGATGATGATCCGGAACGGTTTGGTCGCAGGCG CTAAACGTTGCCATGGAGACGATTCCAAAAGACCGTCAGGGTAAGGTG TCTAAAGGATATCTACGAGCTGTGCTTGACACTGTTGCACCATCGGCCA CTTTACCACCAATAGGCGCTGTGTCCCAGGTAAATAATGCCCCGTCTA ATTATTTTGTCTTTTAAATTGTTTATTTTGCCTTTGAATTTACATGTTACA ATTATTTGTTAAACAAATGAAACCAGAATTAGTGTTTTAATCAAAATT ATTAGTGAATTTTTATTTTTATTTTTTGAACGGCATTGATTAGTTAAGTT TGTTTTTGTTTATAAGATGGATAATATGATAATGGAAGCGTTGAAGATG GTGAATGGAGATGATGGAAATGTGGTGAAGGAAGAAGAGTTTAGAA ACAATGGCAGAGATATTGGGGAGTATAATGTTGCAGCTCGAGGGTAGT CCCATATCGGTTTCCTCTAACTCGGTGGTTCACGAGCCGCTCACCTCGG CTACCTTTCTGCCGTCAACTTCGACTGATACAGAGGAGCCTTCAAACTA ATCATAGAAGGGAATAAGCAGCACTAGCAGCAACAAATGTTATATGGT TTTGACTTTTGAGTGTTTACCCCAAAAGTTTTAGATTAATGAGGAA CCGTCTTTACTTTCAGATGTATAAAATTGAAAGTTTGGGGTTTCCTCTTG P:NOPER,jS\I30485061 claims. cim.I41O2/2003 00 -316- TTGGTGTGGTGATTCTACTCATGCCTTTTTTTTTTTTTTTCTAATGACCAT GGGATGCAATGTTTACTCTGTTTTTTAATTTCGTTAAATTTGTTTACGT TTATGATGCTTGAATGGCTATGATGAAACATTTGAGTTATCTTTAAAG TGTGAAATAAATATTCTGAAGTTAATTGAAGATTTGAAAATTTGATTA CAAGAGCTTGGCTAAAACTACAAGGAGACCAGATTAGTACAAACTT AGCTAAATTTAATTAATTACGGTCATTAGCAAAAATATTTGTTT 00 TTATTATATTATTATTGGTAAGTGGAAACACAGAGGACCAAAAGG TCCAAAAACGAATAAACTGTATCTCTCATTCGCCGGAGTTTCCAGCCGT TTCTTTCCGATTCTCGGATTTTTCCTGGGAATCAAACGCATCGCCGAGA ATCGGAAGAGAGGGATAAGGTACCCAG 47. The isolated promoterAC22_syn28O comprising the sequence TCACCAGAAAAACAAAAACTAGAAACCAGGAACTTAGGAAAATC ATAGAGTTAAGCAAAGTTAATCAACGTCATTAAGTTATTATATATAACT ACATTCTATATA.ATCTCTGTTTCGTCATTGTACATTTTGGTGACTGGAAG TTTGCCTGAAAGAGATGCACAAATA TCCTCTTTGTCTACAAATTAAATACATTATCACGAAAAGCTTTATGT ATTATAACCAAACTACTTTATTCTCTCAACTATTGCATTGGTGTGCA ATACGTTTTCTCGAGATGATATCATCATCTTATATCACTTCAACTTT TAAAGTAAAGCAAACGTAAATTAACACGGTCGTTCTAGCTTTGTAGCAT CGAAATAGTTTTAAATGTCAAAAAAATGAGCGTAATTTATTCTTAA TTATCTTTGCCAGATTTTTAAAAACCTTTAAGCATATATAATTCAACTA AAGAATTTTAACTATTTGTGACTATCTAGACTTGGCAGTCAA AATGAGTAGACATAACTCATTCCTGCTGTTGATCCATACTCAACAAA TATGTGTTTAACAATTTTTTTTTTTGGTCAACTTCTTTCAGTTGTA GCTAGAATATTACAAGATAGATGAGATTGAATAGTCCCAAATAG CAAGCAACAAAACTAAAACATTAACACAAATTCTAAATAGAGA CACAAACTTAACAAAGCTTGATACAATGCCTCATGATTATACT CGATATACTAATACCTTAAAATATTTTTTCTAGTTCTATTATATTT AACCTAAAAATATCACTTCTATAAATTAATAATTACGATAATTTAATGA AATTTAGTAAACCATTAATCTCAATATTCTTAATTTATAGAGGTTTTACT POpE 12sj304S061 cLdimdc.I42f2-O 00 -317- AAATTGTAGAAACAACTAATTCGAGTACATCCCTGTTATAATTTT __AGAAATGTGAATTAACGAATACTTTTGTTCGTGTGGTTAIAAA GTATACAAAGAGATTAGGTAAGTAT ACACCAACCTAATGACAATTTGTTTGATTTATTTGTCACCTAACTAGAG ACTCTCTCACAGTCAACGCAGCTTATGTGTCATAGTAAGACTTTTTGTCT ACTATAGTAGAAGACGAATTTATAACCCCTTAGGTTTTCTAACAC 00 0 ACGCCTCTAATCTCCGCGCACACACACACACCCTCACGAGAGAGA 0 AGACGA 48. The isolated promoterAC24_syn 2 8 2 comprising the sequence TCGTGAACCCATCCATATTCTTTGCTTGACCGCTTCCATACAATCCAC CCCGAAGCTTTTACATCGTGATGTCTTTGTATTTAGGAACACAGA CACAGTTGGTCAATGATAATCATTACAGATTCTAGATTTGGTAGC CACTAGTCAAAGAACTTAAAAGGCAAGATTTATCGGGACATTAGGACA AGGTAAATGAATGCATTATAAGAAAATAAAACCCTTTACATTTTG TTTAATAGAAAAGAAGTAGAGGTTGATTAGTTATTGTTAGTAATG TGTTGGGCTTGTCTTTTCCTCAAATGTCGCGAAGCTCATGGTATAAGC GAAAGAGAAAGCATAGCATGATGGGCCATATATAATAAACTCGAG TATGCTACAAAAACAAGGTTTCAATGCACTCATATCTCGTTTAACATTTT CTATTTTATTCTTTTCATGTGTCCCCCATTGGCTTGGCATAAGTTGA ATTTGTATTGATTTATATCTCATTCTCAGTACGAGCTATTCTTAATT AAAATGAAAAATATGCTATAAACAATTTAAATGATTGCAAGTCCCACCT TGAACAACATCAGTTAATATTTTTCCGTAGCATGTTGCATATAGCATAA TTTTGGTCTTAAGTAACACCACCACCTCACACGTACGTACGACCAATTA TGCATGTCTCAAATCCCTCCATGATTTCTATATGGAGACCAAGGTTTC AAGATTAGCAATTTTAACGGATTAAAACCGGTTCAAGATTTTATTTTTT ATTTATTTTTGCTAAATCCTACAATTTGGTCTCATGACAAAAATAT AAAAACATAGAAACAAATAACAATGAATCTATCGACATCCAAGC AATTAAACTTTCCGAATCAATGAAGCGATAACCGGTAGTATCTTCGAGA CTTCATATACGATCAAAATGCTAAGTACTATTCATATCTTTTATTA TAATGAATTATCAAAGCTTCTATAATTCATAGACAGACAAGGAAT P 'OPER~jnmsOO485061 claim~s dcc-I4/20O 3 00 -318- AGACATAGTATCCGCTTATACAGAC TTAACGAAACGATTTTGTCGAGATTTTTAAACGTCTTTTTCAGGTTCTAC GGCTAAAATTCCTAACATTTCATCACCTGTCGTTATCGTTAATATCGTCC TTGTCAGCAGAAAAAAATTGAAATCAGGATAAGTTGATAACTTCTATGA r- 5 AAAAAACATTATCTTACAAAAATCCAAATACTCCGACTTAACCGGGTCG GATCCTGGTGAGTACTAGTATCTATCTCATTACAATTCATATCCTTCCTT 00 CAACATTCGATCATCACGAAGCCAAAGAACAATTTCTCC 49. The isolated promoter AC26_syn284 comprising the sequence GTGAGGTCATATTCAGGACCGATCCAACAATATTGAGGGTTTTACTCCA AGTAAAATTTTAGTTTTATTTTTAATTATCATAAACGACATAATATAT ATGGAAAGATCACAAATACTGATTAAAAACTAAAATCATCAAACGAA AAGGAAAAAAGAAAAAATTGGGTTCAACTCTCATGAGTTATTAACAT TTTAGGTTTTAGGCTTAAATCTTTAAAAAAAATCAGAACTGAAAACGA AAAATTCTAATTTTATTTTGGACTCTGATTCATAGCTTATGTCGCTTATG TAGTTATGCTAGGGATGAATCTGTATTTCGTTACCGTAATGAGAGTTCG ATACTCTCTTACTTGTTACGATTCTGGAGCATGTTACATTTTTTTCTTTCC GTCAACAACAACTTTAATATGGTAAAACAAAATTTATTTTTATTTGGCT GGTCCTACTCAAGACAAATCTTCTGCCGACATCACATAATCATAT'AA AACCATAACTTCTGCCACTCTGTTTTTTTTTTTTTTTGTAACCATTAACTG ATTGGATTTTGATCCATCTCATCTGATTTTTTAGCTCAACAATTTACTTG CACATTTTCTATTTGGTTTTATTTATACTTAGTTACATATATGATTATCG AACTAGTATCTCTTTATAATTAAGTATTTTTCTATTTTTTTTTAATTTAGA TTTTTGTGAATTCATTTACAGTAGAAAACTGTAAAACCATATGGTCTA TTATAGAATGAAAACTTCAACGAATCCATACAACTTATTGGCTAATAT AATAAATCTGCTTGAAGCATATTGTTATTATTTAGTTGGATTTGACGATC TCTGACTTTAATGTATACCGACATACCCTATGATTTAGATGTTGATTTTT CCCATTCTTAATATATCCATGTTAAGAGATTCCACCATAACATATCTAT TATTTGCATTGTAATAAATATTATCATTAAAAAAAAATACAACTGGACA GCTGGCTCGTCCCATTGTTTCTTACGTCCACCAATTACATTTGTTAAGC AAACTTATTAGAACGTTCATGTGTGAGAAGTTGGTGTCGACATGTGTCT PAOPERj,,I\3O4I5fl61 ehims do.1410212009 00 -319- AAGGTCTATGTCAGAAATCGGATTAGCTTATTAAGTAAACTATACTATA TCATTGTTAATATAGATAAAATATCTAGTTCGTCCAAATTAAACTATTTT CATAACTGCCACGTGGCGTAAACGTATCCATCGAGTCACTTGTATATC C TTTATAACCAAAGTCTTCCAACACATTCATCACCATCTATCTACTCTTTA 5 CTCTCTTCTCTTCTCACATCAATTATTCATAGTTCTCTCTTCTCCGGCAG AAAA 00 The isolated promoter AC3 I _syn286 comprising the sequence TCGGAATCTGCTGGTAATCTACGCAAAGTATACTTGTAATCAGCGACAG TGAGAGTGATCTACAAGTAGAGATAAGAGATTCAATGATGAATTGG AATGAGGAAATGGTGAAATCAATAGAGAGATAAGGAGATACGAACG GAGTAGATAGCGCGAGAAGAACGGACGACGCCTTCTACAGCCGTCGCT ATTTTATTGGAAGGTGAGTCTCGGAAGATGGACACGGCGGTGGCGCTG CCAGTGACGGCGGTTAGAGCTAGGCCGGCGGTGACTGTGAAGCAAG ATCGGAGACTTGGATCTCCCGAGAATTTTGAATTTGCGGAGATCTCCA TTTTTGTGGATTCTTTGGGTTTCGTATTATTTTTTTCGTAGTAACGAGA AGAGGACGGAGAAGCTACACATTTTCTAACTTACTTGCAAGTCGGGTCG GATCGGATTGATGGACAATCTAATGGGCCAGGATCCGGTTAGACTAATC GATGTGATTTTAATGGGCTAAGTAAGCTGGGCTTGGCAAATAGCCAAT ATAAAAGGTTAATTTAGTCAAGAAATCTCTCTCATTTAAATTAACTGA CGTAAATCCCCCTTCAGTATCAATACTGTAAAAATTGGATAGACACAGT AAAACGCAGTGTTTTACAGAATCTCTTTTAATCGATTTGACATCACACA AACTTCAGAGAATCTCATTTTGATAAATTAAAGTTTTTTTTCCACTTTGT GAATTTTAAAGCCTAGGTAAATTAGTGCATATATGTATTTAAGTGTAC ATACTGTATCTCTCTGCAACGAATACAACCTTCTTTTTTACCCACTACCA CCTGTTTTCGCTAGGCTTGCTGGACTCAAATAATGTATTTTTATACGGCA AAATTATTCATTAAATTTCAACTTTACGTTATATACACATTTTTTACA AAAATTACTAACATATATGGAACCTCAAACCTCTTAATGTAGAAATATT AATAAATTTTTATTTAACCATTGGACTAAGGAGCTTCCACAATCTACTCT AATCTAATAAAGTGTATATGTCATGGGTATGAATTTTTTTTTTCAATAGG TAAGAAATCAAATCGTTCTACATATCTTTACGATCTTGTGATATTTTACG P 'fPER~n34SO61 cli c.IAJO2/2003 00 -320- AGCGAATATCGTCGACATAATATAAAACTCACAAAAAATAAAATAATA ATGATACTCCATATAAAGGAAAAAGACAGCAAATATGTAGGGTCATA TAAACGCAGCCTCGTCGTCTCTTCATATATTCGTCTCTTTGTGTTCTTCTT CCTCCTCAGATTCTCTTTCA N 51. The isolated promoter AC34 -syn 2 8 8 comprising the sequence 00 0 GGATCGAACACTCTCTCGTACGTCAAGGAAAGCACTGTGATGCCAGTG N AGGATGACCTGGCTCGCGACGGAAAGGTTGCTGAGCCGAGTCGGTACG AGAAACGAGTCCGCATAGAAACAAGAGAAACGCGACGAATAGGAGAG AAGAAAACGAACTCGATCGCGAATCCGATCTCAACTCCATGACTGAAA AAAAACAACCGGAGATTTCGCTCACCTCCCGATTTTGGACTGGACTGGC GAGAGTCGCTACAAGTCGCTTACGGCGAGGGAGCAGAAATGGGAAAAT TAAGGCTAATTACTAATTTACCCTCAAGTTTTATTATTAAGGTGACCTGA CCTGCTCTGTCTATATGTGATATTGTGACCTGCTTTGCCTATATGGCTAT ATGTGATACCTATAATCACAAGGATATTTCAGGTGGAGAATCAGAGAA AGAAATTGAAGCTGAATAAGACACTATATGGGAGAGATTGAAAGGAAG CTGTTGGGCCATTTTGGTGTAGCGGGTCGCAAGTCGAGCGTGAGACTTA TTGCTGTGCCATTGCAGGAATGCAAACAGAGGAAAGATTTCACAAATG GGAAACGGATACATGCTCAGATGGTTGTTTTGTTGTAGGAAATGCCTTT CAATGAGTATGTTAAACGCTAGCTGTCCTGTTTAATGGACCGGTGTATG TCATCTTGTCTTGCACTGTGTGAGCACAACAACTTGCAATGTTTCCATTG ATGCTGTAGCAGTCTCTCACATTAAGCTCTGGTTTGGATGGCTATGAAC AAGTTGATTGGTAGATAAGTTAAAATGTTGTGATTTGAATCTGGATGA ATAGAAAGATGTGATTGGTACTGATGTAAATTCAATGCTTTAGAGAATG TATACAGGCAATAATATACCAATCATTATGTTTATTGCTGACTAAGAGC CACTCCTCTTTGCTGTTGCAATTCGGCAATCGTTCTAGATATGGTTTCCA TTTCAAATCATGATATGCATTGACTTTTTCCATGTGGCGTTCGGAATCT TTCATCTATACTACGTCTACGTTGCAAGTTTTGCAAAATGTTTAAATTAG TAGAATCTCACGTATATAAAAACTTTAGTCGCCAAATTGAAAATGGAGA ATGAATGGTAAACTACTAGTTTAGCCTCATATTTTAGCTGAAAAATATC GTCACAGCTGACGAAGAAATTAGAAACAACAAGCAACGTGTCACTTCT P NOPER'jms\30485061 cloc.1402001 00 -321- CATGTCGTCGTTTTCCCCAAGAAATATCCAAACTAACACCCAATTACCT AATGCCACGTGTTTACTCACACTCCTTTAAACAAGCTCGTAACTGTTTCA TCTTCTTGTCCCCAAAGTCTCCTCTTCCTTATCTCTTGG 52. The isolated promoter AC38_syn29O comprising the sequence N ~AAGATTTTCCGCTACGGGAATTTGAACCTGAAATGCTGATTTTTAAAG 00 AAAATTTAGCTAATGTGCTACATGAGATGTTTTTTTTGCTAAGTATGA GTTTAAATTGGATATATACATCATTCAATTTATTTTTCTATCTAGAATT TGCTTTCCTAGGACAAATATAGGTACTGAATTATTAGAACATATTTT TTGGTAAGATATAAGTGAGTTTTTATATAATTTTGATGATTAGGTAG TTGATGTGATTTACTGTAAAGTCTTTTCAAATTCTATCTAAAA~CTATGAG ATTTAGATTTCTGTATTTTTAACTAAAGAAGTCTTTTCAAATCCTTTCAA ATCCTTCAAAATTAATAAGAATCAAATCCACTACTATTTTCAGTACA GTAAAAAGGTTGATTTTTAAATTTTTAATTTAATACACTCAATTTCATT TAAAAATTTAAAATCCATAATTAAATAACATATTTATAAATTTTAC ATAAATTATTAGATTGAATACACCGCTCTTATACACATGTAAA GTCTGTTTAAGATCAATTATAGATTTAACTTATCTAAGGCCCATAA TACCGTCCTGTACATCATATAGTTATCTCAAGTTGTAATACTGTAATACC CGTTGGGCCCAGTGGCCCATTTATCAGATTTCATAACAGATCTCAAC ACTAGCATGGCTACACACGTGTCAGATTCAATGCATCAGTCATATCTTC AGCATCCAACACTTGTCAACCTTCCATTGGATCTCTTAACTCTACGCCTC GAAAACAGTTTTTATTTATTTATCATTCCATTTCTCATTGTATCTTCATCA GTCTCTTCTTATTCCATTTTTTCAAACCACTTGCAATTCGAATCAGAT CTTCTCTTCAATCGAAAAAAAAGAAAGGTAATCTCTCTCTCTCTAATC ATCGTTCGTTTCGTAGTTTCTTCTTCTACGTGTAGATCTGATCTTTGATTG TATGTTTCTGGAGATCTCGATCTCATCGATTCTCTGTTCTTATCACTGAT TCAGTGTGTTTGATATCTAAkATCCGATTTGTGTGTAGGATGTTAAAATTT AGGTTTCGGTTTTGTTTCTGCTTTTGAACGATTTTGCTCTAGATTCGTTAT CCGTGAAGAACATAGACGAGTATGTAGATCTTACTTCGGATTCGCGTTG AAGAATTTTCTCTAGATTCGTCACCTATGAAGAAGATTCATTGTGTTCTT AATCTAGATGATTAGGTTATTGTTTCGACTCATTTGTTTATGCCTATTTT P %OPER~jms%304SWOI clai,doc14/022DDS 00 -322- CTCTATGTTCTTAATCGGTGAAGAAATGTATCAATGTGTGTATGTTTTGG GTTCTGATTTTGTAGGATTTGCTCTAGATTGTTGAATCGAAGA 53. The isolated promoter AC4O syn292 comprising the sequence 5 CGCAACGATAGGTGCCTATGGAAACTGAATCAACAGATTTGGTTTTGAT 00 TTATTCTTCTCCAAAATGGCTACTGGTAATGATTGCGTAACACTACGATT CACTATCGAATATATTTGTTCCCAGGTCTTGTTCTGATTGAACGACC ATATTATCATTTGTTGGAGAGGTTTACTAACCGATAGCACAAACGGTT ATTCAGGCTGCGTGTGATAATGTTTCTATGATCTGCTTCCGCAAGGA GCTTTAGAGATAACTTGAAAAGTTTCGGTGTGGAGATCTAACGCTAAAA CTTTrAATTTCTTTCTTCCCGGTTAACCAATAAAGCGATCCATCTACATAC AGAGCATGCCCCCGAGACGAGGAAGTATTAATCCGATGGAGAG GGATTGATATACCTCCAAGTGTTGGTGCTAAATCAAAAACTTCACATG TAGTAGCGTTTTCTAGGCCAAGTTCGGAGAGTTATACATACCAAACC GGTTTGTATATGCCACTGATTTTGTCTTTGCCAAATCCAAATTTAACGTG ACTAAAATAACTTGGCTGCTCAAGACAGATTTGTTGCACCTGGAAACA GGGAAACGTCGATGCCATCGAGTGGCGGGATTATAACAATGTTGTTTA AGGTTTGGTAATCAAAGAGGCAAACAAGACCGTCACAACTATTGTGGA AAAGTTGGTAAATATGATATCGTTCTGATGATATCAACAACACGTTAGT TTAGGAGGACGATACTAGGCAACAG ACTGGGTACTTCAGACACCAATACAAGATTTAGATCTTTCCCGCCAGCT GAGCAGATCAACTGTTTCGCCTGGAAATATTGAGATTCGATTGTCAACT TCCATTGTTTGCAAGCAGACTTGAATCTGAGCAGAGATTTCACCGGAAC TCTCTCAAGAATATCCTCAACGGTGTCGTGGGGAAGCAATTGCATTATT TCTCTGTCTATTGAGAGGATTTTGTTCTGAGTGATGGATACATGAAAG ATATGCTTATTTGTATCAATTCAATCCAATGTTGATTTTTTCCTTGAGGA GGAAGATAAAAAAAAAAAAACGTATATACAATCGATGGGCCCTACCC TATCCCTAACAAATCTCTTTAATATGTAATGCGCTTTATAGTTAAAGCC CATTAGTTAAAAACCCAGAGCTATATTGTTGACCTAGCAAATTTCGGAT CTATAAATTGAAGCCATTTTCTAGGTCATTAGTTTTTTCGTCGAGCAGCC p:\OEt*s34SO6l climsn, dc. 1410212001 00 GCGCTTTTTGGCCGAGGAAGGATAAAGAGA 54. The isolated promoter AC7_syn267 comprising the sequence ATGTGTGTAGCGAAAACCAATGACAACGTTAATTGACTCATACACTGCA 0 5 CAATGTTGAAAGTGTTTCAAGTGAGATATAGAGAGTCACAGAGAG N TACGAAAAGAATCAAAGTAAAACTCCGAAGTCTTTGTGCAA 00 GAGATGTGAAAAATCTAGAGATGTGGTTGTGAACTTTGATTCCCCTATT GTGCGTTGGTTTCAGGATGGACATGGTATACCCACACCCCTCAAGGTT TGAAGAGGGTTTTGATCGTCAGAACAAGCTGCGAGAAGCCGATGTTT AATATGAAACATTAGCTCCTAAACGAAAGAGACTATACTGTGAAGA AAGTCACTAAGTTTATTGAAGACAATGAACTTCAGACATGAAGA TTGAAGAGGTTTCTTTTACCGCACCCACAGCTGGGGAAAGTT CTAAATAATGATGTTATAGTGTTGATATTACTTGAAAATCACA AGTTAAGGAAACTAAAGAGACAGAATACCTTAACTTGTTGATCTTTT CAAGTTTTGTTATCGGTAACTACAACATCCTTACTTATATTTTTTTCTTTT CAGCCGTTTGGGTGCGACAAGAGAAACCTCTTCAATCTTCATGTCTTTA AATTGTTTATTGTCTTCAATAAACTTAGCAACTTCCTTCACAGTCTTTAG TCTCTTTCGTTTAGGAGATACTGTTTCATAATAAACATCGGCTTTCTCGT AGCTTGTTCTGACGATTAAAACCTTTTATAACTTTGAAGGGTTTTGGGT ATTACCATGCCCATCCCGAACCACGCACAGTATGGATCAAGTTCA AACACATCTCCAGGTTCTTCACATCTCTTGCATTCAAAGACTTTTTCGG TGTTTTACTTCGAATCTCTTTGCATTGGATCTTATATGTTTGAGCCGA CCATGTTCTACATATGATGAACAAACTCAGCACTAGCGATTATTAG GCTTTTTTTTTATTTCTATCGATCTTTTTTTTTACCTATTGATAATGTTG ATGTTGAAATACTCAAACATGGAAGTGGAATTAATACCTAA GACGTTTATCTCGATGGACGGGTAA ATGCCAAGAGTGTGAACAAAAGTCCACAACAAAGCCTCTGACGGA GAAGGAGGCTTTTAGGTGTTACCCACAACGCACACATACGGCG TCGTTTAGAATCAGAAAAGACATTTCTTTATGGTCACTTGATTCTCTCTT CCTTCATCAATCAATCTCGTCTCCTGGAACATTAGGGAGCCTCTCAG ATCCTCAAGAAAACCCTAA P OPER*j.WW&45061 Wt.,ndwc.14/02nOM0 00 -324- The isolated promoter AC9 syn269 comprising the sequence TTTGTCACCAAAATCAGACAGGCAGCTGGCTCAAGCATCGCTTAAAT CCCTGTAAAACGCAACTATGTAATTAATATTGAGATATACTTGTTGCTTT CTGACTCTGATTTCATTCACTCGGCAGCATTCTCGTGCTCTCGGCTGCTG TTGCCAAATCTTATGGTATCTTTCTCAAAGCTCATAGTACCGTTGTGT 00 CTCCAACAGACTTTCATTGATGTAAGTCTTATAGGTACTACTAAGATCC ATAATATGTAAGGCCTCACTTGCTTCTTTCCCATCATACCATATGGCTCT TTCTCTTTTTCCACCTCCCGGTACTGAATAACAGCATGTTGCTTGCTACA AAATGTGTGATTAGTAGGAATGTCGGATCTTTCTCTCACGTCCAAAGAG GTAGCAAATTTGGTAATGAATGCAAAGTGTCTCTTTCAGTGGTTCACCA TCCTTAAAATGATAAAGTCTCCATCTTTCTTTGGGTTTTCTATTATCTAC GGGATTATTGAAGAGGAGTGTGATACCTTTGTATATGTTGGTTTCTTCA GCAAGTTTCCTCGATAGCTCAAAACATCGCACACCATGTTTTTCAATCA TCAAACATGAATGAAGTAGCAGCTATCGAGATTCTCTCTCTCGTCCGGT ACTTAGCCAACACCCTTCCTCCGAATCTCTGAATGTCAGATTGATGTCTC CTGGCACTTGGAGGATAATCTAAGCAGACATGGATGGGTCCTAATCGAT CAAGATAGTATCTTCCATATAGGTCTCAGAGTTGAGAGATGTCTTT CACCGTTTCATGCGGAGTTGACTCCTTACTTTAGCATTGATGCAT GATCGTAATTAGTGATATATTTGGAGTTTTCGCTTCCGGTTACTCTGATA TGATATCTTTCCTCGACACTATAACGAATGACCATATTTGTAATAGA GATAGTCTATTTTCGATCTCTCATTTGTTTCTTTCTTTTTTTAACATTACA TTTTTTCATAGATTCTATACTCACAGATTGTTTAATGATTTTTCTTAC AAAAAGTATCATTCAGATAATTTAATAAAAATGGTATCGCAGTGCCTTT ATTTACCTTTAGGAGTAAGTTTTCTTTCTTCCGATATCCTAAATTGTTCG ACACGTGTCAATCACGAAACCACCCAAACCTTGTCGTCTTCTCC AATCATAAAAAAAAAAAAAAAACAGTGTCCCTTTGATCACAA CAAATTCATAAATTCGGAGAAGAGAACGAAATCTTCTTGTTGGCAA ATCTCCGGCGAGATCATCTTTCTTATTTTGTTCC 56. The isolated promoter AF3_syn3 12 comprising the sequence P ZPERjm9304&50b1 cl.amS d-1.4/2200 3 00 CK1 J24J GCGAGTAAGACTTATTTGAAACATCGTCAAATTTACTTCTTTTGGTGTAT ATTTCTCATTATATGGCGTATATATCTGTTTATGTAGAATGTTTCCA AAAATTACTGTATACTGACTTTGTAATCTTGTTTTGATATCAATGATTT ATAAGGAAAAAAATAAAATAATATAkGTATGATGTACATGTAA 0 5 AAAAGTTGTTTCAAGCGTAATTGTTTTTTGGCTAGAGATGAATATACA N ~GCAACAGTAAACTAATAAACTTGCGATGACTAAATTTCTGGTATTCC 00 0 TACAATCAATGAATCACTAATTTATCTATAAGTTTTAGCTATATCCGCTT N ~AAACCCCGCCTCAACTTGCTCTCTGGTCTGGGTATAGTGGGCTACAAC AGTGAAACCGTAATTAGGAAAGAAATGATAACCCAATCCAGAAGC TTACTGCAAGATAAAGAGAAAGATCATGAAGAGGTAGGAGTGATTCAT ATAACAAACAGGGTCACGTTGTCACTTTCTCCCAGAIATACAAATTT AGACTAACTATATAAGGAGACGACTTCAGAGTCTTCTAATGGGTTAGTA TACTCGGGTCATCTTTTAATCTCTGGCTTTAAGACATGGTAAGATTCC ATATATATGAAAACTCTGTGTGTGGTGGATTGCTTTTTTCATTTAGGCA AAGATAGGTTTTAAGGCAGAAGACAAGAACGACCTTTGGCTTATTTATA GGAGACCACCACTTTCACTTGAGTCGAGACAGTACGACATTTAGAATT TGCATTACTCATCTTGTCACTTTCTCCCAGGAAAATACAAATTT AGACCAACTATATTAGGAGACGACTTCAAAGTCTTCTATGAGTTAGTA ACTGGGGTCATCTTTATCGCCGGCTTTCAAGACATGTACATTTCAT ATGAAAACTCTGTGTGTGGTGGATTGCATCCAAGACAGTTTTAAGACAG AAGATAAGAACGGCCTTTGCTTATTTATAGGAGACCACCACTCCTCTCG ATAACCATGACTCGAGACATTAACGACTTTTACTAGGGACGAAC CTTAAGCAAAAGCTCTTGCATTACTCAAATTCTTCTGCCACTTGGTAAGT CTTTTTCTCT 57. The isolated promoter ARlIO syn3O 7 comprising the sequence CCATACATTCGACACGTGATTGTTCGTTAATTTTCTTGATTCTGTAAG AGAAACAAAAAATATAGATGTCCAACTTTTTTTTTCGGGTGGGAATATA GACGTCCAGCTTAGCTACGTACTGATAATTCAGTTCCAAACTAGTA TAATAAATGCAAGTAAGACAGATCA GATGTCATTATAATAAACCATTAAC P.QPRjm,30435O61 dcin~dw.4/O2OO 00 -326- AACTCTATATTGATATTTCTATTTTTTAATTAGCCATGCGTTGCACGATC AATTTACAAAATAATAAAAGAAAATGATCGATCAAAGAGCATTCCATT GAAATTTAATTCCATCCTGTAATCACATAATTTTGGGCCCAATCCTATTT TTCAAATGTAACATGCTATTACATAGTCACATAGAACATCCTAAATAG GGTTAAAATGTACTTTTATCTATTTGCAATTTTGATATTTTCCTTTCTGA AAAAGATTAGTATATGGCAAATTATCTTTTAGATAAAAGATCTTTTGTT 00 CTGACTATACATTAATTTATTTTAAAAAAAAAACTTAACAGATATATTT GCAAATACAAAATGATGAAAAATAAAAGGGATACCATAATCTAAATC TGACAAAGAAAATATACAAAAAGTCAATTACGATACTTAGAAAGAA CTATATATTTTTGGGTAGGGAAGTTCAAAAACAAATTACCGATTTGCTG ACTATATGAGCAATTATTACATACTTTTATTTATTTGTACAACAATTATT ACACATACTTGTGTGGACCAACATGATTAATTTTATATTGGCCATATGG TGCGTAGTAAATGTTATAATAACTTGAAATTAAATAATAACTAAGCTCG ACTCGATATATAGATCCAACCAGTAGCCTCTCTTATTCACACCTAATCTT CATCTTCATCTTCGCATTCATAGTCTCTACGATCAGGTAATCCCCCTCTC TCTATCTATCTTTCATATATGTGTGTATGTGTAAACTATCTATATTCTGA AAATAGATCAATCAATTGATCTTTTCCTATCTCAATTGTTTTCACAACCA TCAGTTTGACTTTTGATCGTTTAAGGCTCGAGAGAATTATCATTCACTGT AGTAAAGATAGTTTATACCAACAAACCCATTTGGTGTTGACCAGCTTTC AACATAAGTATGAGTTAGAGCTAGAACCGGATTAGTATTAATGTTACTT GTACCTGTTCATAGTACTAACCAAAAATGATCCAAAAAAATGAAATA ACAAATAAACCATTTATGGTTATCACAGATAGATAAAAGAAGTCAACA ACGA 58. The isolated promoter ARlI3_syn3 09 comprising the sequence CATTTTGAATGACATTGGTTTCCAGATTTAACTTCATATGTCTTGCCAAG TAAAATTTGTACGCATTGATATAGTATCATGGTCCTGACTTTAGCATT GGCGATGGGTAATGATATTAATGAAATATCGGCGAAATTTCTTGGATAA AAAGAAAAGATTCGTACGCATGAAACCAATATGTGATGTTGGTTCCATA TTCACATAGCATTTGTAAAATTTAGAATAAAATCGAGTTTACGTCAGAG CCATCCAACCATTACCATTAAAAATTGGATGAACTGATGAACAGGTTGA p OPER~jW34850 6 i Clim dox.I4O2/2OO 00 -327- ACCAGAAATTGTCACTCAAAGTTAGAGCTTGGTTGATAGGTTCTTAAGA __CTAAACAGTTCCTCATCAGTATGTAATATAATGATTTTAATCTCTT TGGGAGTTACGATCATGCGATTATA C ~GTCTTATCCTATAGTCTTTTCAATAGTTCACATTGCAAGTACTTAATA r- 5 AACAAAATAATTAATCAGTTAATTATGACAAATTAGTCAACATCCGATC 0 00 GTAAAATGAATAAGATGAGTTCTTCTTCTTGTCACCTCTCGATCTTGTTT TAAGTAGTTGGTGAGATGTGATAAACTTGTAA~CATGCCACTGAGTTGTC AAAGACAAGCATCTATACAGTTATTAAAAAGGTTAAGCATGTAA AAATACACACACATGTCGTAGTAAATATACACCTTTTTATATTAATTA TATTGTAACGAATTTGTTGTTTTGTTATAATATATAGATTATGCATGAT GTTTTGCGATTAAAGCCAGACGAGTTGTAATATCCACAGCCTTGATAAG CTCTACATGCAGTGAACAATTTTATACATTTAGAATATCACTAT CTCGACCATATAGACCAGGCCACTACATTACAGCTAATCTCTGGATTTA CTTGATAATTAAGACAAATATAGAACATTATACTACTCGATGCCT CACCTTAGCCTCCTCTCAAATTGTCAATATCTAGATGGAGTGTTACATCC ACATTCCTAACAGTTTTTACTCTTTATTTTAATATATCCTTCAACAGATC ATCATCAGAATAATCATCAAATCATTATTATATATTTACTAGCCCAA TTGTACCATACCTATCAATTTAAATTTCTCTTTCTATCTACTATAAAAG TGACTCTCTAAGAACTCCAAAGATTAGAACATTGAATTGA 59. The isolated promoter ARlsyn3O1 comprising the sequence CTCTTTATTTGTCGTGACTCGCGACCCCTTnTTATTACGTTTTAGTC AACACAACATTTCATTAATGATAATTCTACTACTATTAGTTTGCAATGTT AACTAAACTCTTTTTACGTGAGAAAACTTAAGATTATCATTTCCAGACC ACCGCAAGTTCCTTGAAAAGATTGTTATATATATAACAGCTGCATATCT TAATACGGATTTATGGGCTTTAATTTGATCAATTGTATCAAATAGGT TTGAAAAAAAATCGTATCACATACCTTTATTTTTTGAGTGTAGTATAAG CAAGCAATATTGATGAATGCGTGAGTCTGCATTTAACCCCAA AAAGTAAGCAACAATATATATTCAGCATCATGTTAGAAGTATTTTAA TCATGTTGAACTGAACGATCTCCGCGCTAATTAGTATTCCTAAGAGACA P IOPERj~\ 35061 clain SdOCI VDV2OW3 00 -328- CCAATCAGAAACTATTGGATAGTTCGACGGTTTAGAATTTGTCCAGTTG AGAATGGTTTTCAAACTATTTTATATTTTTTTAGCGATTTCTAAAG TTAAGTTGACCGGCACATCTTGTGGTTAATGTTTCACTCGTCGTTGA AAAGTCTTTTCAACAAAATCTTACTTTCTGGATATATTAATATCATATG 0 5 TACAAAAATTGATTAATGGGTCTTAAACTATTTCATGTATTTACTATTTA N GATAGAGACGTTTAAAAAAAAACTATTTTCGTGTCTTTACTATTTAGAT 00 0 ~AGAGATTACACGACATGGAAATAATAGTACATGGTCAAGTTTATATAC GGACGACTCTCATGAAATCCTACA GACAAGCACATATAG TATAATGTGAAATATACACTGTTAAGCAACATATTACGTATTATAGTTA TTTTTATGTTAATGACGTACAATGTACAAATTCTAGTATTCTTCACCTGA ATTATTTGATGCTAAACTACGTACGTCGTGGTTATTTTCATTGTTCTTTA ATTAGCCATCTCGAATATAATTATTTCATGTTACAGATTTTAGTCGC TCATGAGTAGATAACGCCACGCTTTT CTTCTAAAAAATATTATCTTGAAAATGATTTTATTAATTCGTTTTCGTC TTAGTCTAATTCAGCTATAAAGTATAACGTTATGACCAGTCCATAAT CAACTAATTTTCTACCATCAAAGAA GGGTCATGACTTTCTTATAAAACATTAACTGATTTGACCACATAA TTTTGTATTATCAATATTACACCATAAATACGGCCACATATCCTCCTAGT TTCTTCACACAACTCTCCCCTCAAAACATTCCATCAAAGG The isolated promoter AR2_syn3O2 comprising the sequence ACTTCCACCAGAAAAGGCGAAACCAGAGCTTTGATTGAATAGTCA AATAATTGCTTCTACTCTTCATTCTTCAACGTATGCACGTAACTTGTA TGATTGTGCATTTATCATTTTTTATCGGCAATTTTAGCCTTACACAAA GACATAATAAAGTATCATGGCCTTTTTTGTTGACATTGTCCTTCTCTTGT CAACAATCTTCCTGGTTTTAAGATACATATGGATGGTTCAGATGTCAT ATAGTATAATCTATTACTCTACACTTTGATTGATGACATTCTTATTCCGA TTTACAGCTTTAAGAATACGTTTAAA AAAATAAATAGTGGATTTAAAGTGATTTGAGTGACATACATTAGGTGA AATTTGAAGGAATTTCTTAGTTAAAAAATCAAGATGCAAATCTTATAG TTTTAGGTGAGATTTTAGAGAATGTTAATAGCATTTCCTAAAGTTCAC P OPER~.3063iO61 cIi.,do..1"2/2003 00 -329- TAAAACCATCTCAAAACTCATCAAAACTAAAATCACTCTCAATTCATCC TCGCCTAACGGACTTTTTAATAAT AAATTCATACATTTGATGGAATTTGAGTAGCCGCCGGGTAATGGACCCA C ACCAAAGGCTCACATAGTCACATGGTACCAAGATTTATAGTGATATTAT 5 GCGACATCTCTCTACCACATAGTCACATGGCACCAAGATTTATAGTGAC 0 00 CTAGCCACTGATCGACTATATTAGAGTAGTTAATTTTATCAATTACATTT GAAATGTTTATCTCTAGTAAGATAAATATCCAACACACTTAATC CAGATGCTAACTTTAGGCATTAAAG AAAATAAACGATTCAAGCTTAAGTTAGTTTTAGGAAAAAGACATACTGT TTGCTCCATATAGTTTGTACATGTATTAAATATAGATCAAAATATT TTAATGTTTGCTCCATTGACATTACATGTATTAAACAGTTTAATAGAA AACGAACATTTTTGTTTGTTCAATCATTGGGAAATCATAGATTGTTCA AAATATGAAACAAAAGTGAGAAATATCAATTAATATAATAGTTCTGTTT AACAAGAAAATTGAATTTAGACCAAGTCCACAATATTCATCTTGAGTAA GAACACGACCAAAAGTCAAACTCGTTTCGAAATACATAAATATGTACC CCGCTATACAAAAAAGAAAAAGACATnACATCCACTTATCCCAATAG ACAAATGACCAAACTACCCAACATCTACCCCTATATATACCTCACCACC TTTGCCCTCTCAACCACAAACAATAA 61. The isolated promoter AR5_syn3O3 comprising the sequence CGGATGGTTGAGGTAGTATGAGTGACCGTGACGATCAACGTTCTCCA AGAAATCGATGTAGCGGTTTTCGTGGATATGGCGCTTTTGGATTCTTCTT CTGATCCTAGCCATTTAATCTGCATAAAAGTGAGTATGAGAGAGAAGAT TAAATAGATATCAATCCTAACTAATATTCAAGAAAACATAATATAGATC AATAAATTGATGAGAGTAAAAACACAAAGATGTTTAGATAATTATT GTCAAGACTCAAGTTTCTTCAAATATCAAGAGGCGCTTGGATAGAC CCTTATTCTACAATACATCAATCTATATAGAGATGACTAGCATA TTTTTAAAATAGAAAAAATATAAACGTAAATAACATTTTTTGAGGTA TACTAAATTTTCTAAACATGAAATGTTAAAATCCACAATATTTCCATA TAAATTTGTAAATAATATTTTGTTAGATAATGTTAAATTTTCTAACTGA p \pER MS3485061 Jdw., do.- 14102/2003 00 -330- AATATTAACAAATCCGTAGTATTTCCATTATTAAATCTCGATTTTGTTTC AATGGGAGATTTGAATTTTGAACCAAAAAA AA AA AA A AA GATTTCATC AAGATATCTAGGGGGATATTTTGCTGGAATATAGCTTTGATGAGAATAT TTATATTTTGTATCTCTGAAATCAAGTTTAAGGGGAATGATTATGG 0 5 GTTGAAATTTTGCAATCAAAAGCCCTTTTGCA1ACTACATAAGTT TTTTGTTTGGGCTGGCGCTATCGGATCCTTTTAGGCTTACATTTAACATC 00 TGGTCCACTTAGAAAGAGTCACGTAGTATATGGTATTGTCAACTTGAT TTTTCAAGTTAAAAGAAATATGTATCAAAATGACTAAAGTAGTGAA ATATTATGTATCTAATTTGTTTATTTACCATTGCTATATGT TCAACTGTACAATTGGCATGGAATAATAtGACATAATCATACATTAT TAAGCACTTTTGCCTACGAAGGGATACCAACTTCATTAGTTTACATTTTC TTTTGTGTTCAATTGTTAGCTCAACCCAATTAGTGGGGAAAGTAAGA AGCAACAACTCCTCTTCCCGGACCCCTAACATCACTAACTCAATA TCAAACCATTTTAAAAGAGCTCATCATTACTAGCTACTAATTATTCTTA ATCAATCACTGCTTAATACAAAGCACTATATATACACTTGTATCTTCCAT TAGTTTCCCACCACAACTACAAAACATTCCATACACACACACAAAGC ACACACTTTTTCTTTCTTTTAAACCCCA 62. The isolated promoter AR6_syn3O4 comprising the sequence TTCCCTCCAATGTCCTACTGTCTCCTTCTCTGTGTGTTACCATGGTTTTAC TTCACCATGTAAGTCTCTCTCATTATCAAAATTCATCTTCTCTGTTTTCTT CCTCCTCTGAATCAATCCTTTGTTTATTTCTTGTGTTGTGTGTGATGCAG TTAGAGCAAGGGATGCGTCCGATTTCACGATGTTACAATCCAACCGCGT ATTCGACAACAATGGGAAGAAGTTTCTTCGCAGGTGCAGCCACAGCA GCAAGCTATTCTCCAGAGGTTTCTCAGTCACAGCCATCTAAAC CGAATCTAAAGAAGTTGCTGCAAAGACATAGTGACGCAGAGAGAAG GAGACGGCTTCGGATTAATTCCCAGTTTGCAACTCTCCGCACCATTCTTC CAAACTTAGTCAAAGTAAGTTTAGCTCTGCATTCATTACACAAATGT TTCACCAGAGAAGTAACACTTTTTGTATTATGTTCATGAACTAAACA GCAAGATAAAGCATCTGTGCTTGGAGAGACTGTCAGGTACTTCATGA ATTGAAAAAGATGGTTCAAGACATACCAACCACACCATCTTTAGAGA PAOPER\s3O4B5O61lamdo.-14102flODS 00 -331- CAACTTGAGATTGGACCACTGTAATAACAACAGAGACTTGGCAAGAGT CGTGTTCAGTTGTAGCGACAGAGAAGGGCTAATGTCGGAGGTTGCAGA GTCAATGAAkAGCAGTGAAAGCAAAGGCGGTGAGAGCTGAGATCATGAC AGAGGAGACATTCTGTGTAGTTATG AATGAAGGATTGGTGAAGCTCAAGAAATCGTTGAACTTGTAGTGAT GGTAAATCATCATCAGAGGCGAAAACACAACATGGAGGATCGTTG 00 TTAATTCAGCAGCAATGAGTATTTTGTTTATATACTTGTACATCTCTGTT N ~TCTCCTAGTCCATTAGAGAAGGTAGATGTAAAGGTATAAAGCCCATGT GTTATTGAAATTGGGTGGATACTTACAGAGTCTATATGAATAATG ATGCAATTCTTTCTTTGGAGATGGTGTGGATGTTATCAATATGA TCATGTGAAATTTTTTGTCCCATCTTTGTTCTTACCCAATTGTACCTTTTG AGATGAAATCCCATGGTTGCTTCTAGTAGATAGCTTTCTTCTGGGAAAC AAAGATTTGGTTTAATAAGTTGAACCAACGATCTCTTCAAACATTC CCCACCTACTTCTCATCAAACCTCCTTATA.AATAGAGGATTCCAGCACA AGTCTCTTCATCACTCAAACCAACAAGAAGTAGTCAAAGCACAATACA GC 63. The isolated promoter AR8_syn3O5 comprising the sequence TTGCTTGTTTTCTGAATCTGTGCGTGTCTTTTTTGAAATCGACAGCGCAC TCCAATCAGGTTGCCCATGCTCCTTCCAGTCAGGTTGCGCAGATCAATT GTGGGCATTGTCGGACGACCCTCATGTATCCTTACGGTGCATCATCCGT CAATGCGCTGTTTGTCAATTCGTAACTAACGTTATGTGATTATTCCTA TCTATTAAGCCACCTCTGCATGGTTGAGTTAAGTATAGAGATCTTTCTGT TGGAAATTTTCATTTCTGATTCATTTTGCATCCTTAGATGAGCATGGA GGGTACCTCTCCCAACTAACCGGCCAAATGGACAGCTTGTCCCCCCTC TACATCAACTGTGAGTTATCAAATTATGAATTTGTAATAGTTCTGTATAT TCTTATGGAACTGGTACTTACTCTGTTCATCGATTTTTCATTTTACCAAC AGTCAACACCACCCTCTCAGACCCAAACCGTTGTTGTAGAAACCCCAT GTCCGTTGATGAAAGCGGAAAGTTGGTGAGTATTTCTATCACCTGTGTT CTTCTTCTTATTTACCACATTAGAGGAAGATATGACAGTGACTGAA CACACAAATTGCAGGTGAGCAATGTTGTTGTTGGAGTGACACTGACA P OPERji.$\3048SO61 Ii, dow.l4JO2flOO 00 CK1 AAAAGTAATCAAGAATGAGTGAGATCTTGAAGATCAAATCCAAATTCT TCCTCTATTCCTGCGTTTGGTTTGTGCATATTACATACGCGGACTG TATGTTATATATCTCTTGACTCCTTTTTAACCCGAGAGCTTATC AGAATCTCTTGTTACTGCATTATTGGGGTTTATTCAAGTTGAAGACAC 0 5 AAGGTTTTTGCTCGAATAATTTGGCATTCTTTTGCTCCATGGAACTTGAC CTTCTCTTCTGTTTGTTGACTTCTAACTCCATCGGCCCTTGTGGCATT 00 GTATTTTTATTACGTCCACATATAA TTGGGTTTGAAATCTGTCTCTTCCGTGGATGAGATATGCTACATGTCAC AGAACTGGTCTTAGCTTTGGTAGATAAGACTTGTCTTAGAGCAAGTCT TGAAATCTGGAAATCTATTTTGCAGTATCTTGTCACACACCATAAC CTAATCAGTCAGTACCCTCCAGCATTAAGTTAGATGATCCGACA AACCTCTCAACAAGACCAAACTCTTTCCATATAAATACTCTTTAACACT GACACAAAGTTTCATCACTTTCTCTTGATCACTCACTGCATCA 64. The isolated promoter ATU56929.5yn007 comprising the sequence TCTCCCAAATAAAAATGAGAGCAAACACTTCTATATTAATTGAAT TAAAAACTTTTAAATAGTGGAAATATATACCCTATTGGAATAA AACCCAAATATAATATTACAAACTAATTTTATAAAATCTCTTTT AAATGGTGAAAATATATACCCTAAATTGGTAGGACCCAAATAT AATACCATAAACTTATATTAAAATGATCAATATTTCTTTTAAATAGTT GAATATATACCCTATATTGGAAATAGAACTCATATATATTTAA ATTTATTTCTAATTTATTTTGGTTGAATAGATTTTATATAAACTTGTGGT ATTATTATTGTCCATAAAACTTGTTTTAGTGTTACTTTTAGATTTTC AAATAATCATTTGAGTGCTAATTATGTGTACCTTTTATGCTATT TTTGTCCAAAAAACTTAAAAATGTGCTATTTGTGGGAATTTTTCAATAA GATATAAATTTAAAACTGAGTTGATTTTAGTGTCACACAAAA AAAGTTTAATGTGAACAACAACATTAATTCTTTTTTAAAATTTTGTTT TATACTATTATTCTATTAACATGTTTTAATAT CTAGAAAA TCAATCTACTAAAACTAGGTTTTTTAGCATTTTATAAATATTTGTATGAG AACTTTCTCTAATTCAGTTCATCCAGTTAACCATTGTTCGCTTATTCTGC AATTCATTTATTTATTGATATACCAGTTAACCTTATGTTGTGTAATC P OPERqms\3O495061 daiiodc-i4/2/2008 00 -333- AGTCGTAAAATTGTTTTGTGTAATGTTACATAAATTAATAGAATCAAAT TTAAAATGTGTTCTAATTATGCTATGACGTTATAAACAAACGATAAATT CCGATTCATGATTATGAAGTATTTCAATTGAAAACACAAAAATCGACAA AATTTTAAAAATATTTTAGATCTTACATTACATACCTGTATTGTCGCAAA GGAAAATTTATTTCTTGTCCTAAAAGGCCATTTGGAACTTGAGCTAATG N TAAATATATAAATGGGCTTATTGGGTCCTCTAATGGGCTTGCCTTTGAC 00 GTAGAAGACAGAAGCATCGTTGTGACTCCCGTTTGTGATTTAGGAATCC NI GCACTGCTTGCCGTTTTCCGTTTCTACTTTACTTTTCAATTCAGAAACGC CTCTCTCGTCGTCTTCAAAGCTAAATTAGAAACCTGACGATCTCTCTCTC TCTCTCTCTCTCGATCGGATAATATTTGAGCTTTGTGGTTGGAGGATCTG AGTTAGTC The isolated promoter PRISynOl 8 comprising the sequence GGTTATTGTTGTGTTATGATTTTGGGGTTCGTAAACATCGCTTATATAGA GATTTGAAAACTATTTTTTTCTTTTTTTTTTTGTTAACTATAGATCTCACG TTTTTGTAAATACATGGTCCATGTGTGAGTATTTTAGTAATATTCATTGC AATTGTCCAAATGAATAGAAGTTGTTTTCGTAACTATTTTTTTGTCAATC TTGTCCTTACACACATTTTTCCTAATATTGTTTCGTATCGGTAGCTTTGC CATTGTTGATATATTTTTTTAGTATATATGTAAGTATACCCTAAATGAAG TTTATTAAGAAACATTGTATATAGTTGTTTCATGTCATTCAGTTGTTTTG TGTTTTTTTTTCTTCATGATTCTAATTTAAGTCTTCTATTTCAAATTTGAA TTTCATATATTACTTCATTCAAAATGTTGTGAAGATATCTTCCTGTAAAT AATACAGAAAAATCGTATCGGACAGTTTGGCAATTAAGATTATATTTAC AGTCAGAAAAAATAAAAGTTTATATCTACAGTCAATTTTCAAATAAAAG AAAAAAAGTCAAGAATTATTTGTTTCTTAGTGTTTCATGCATATGAGTA TCTCTATCACTCTTGCCTATGGCTGAAAAGTCCTGAAGAATATATGCCG CCACATCTATGACGTAAGTAAAATAGTGACGTAGAGAAACAGTCAATA GATCACCCATTGAGATTTATCCAAAAAGAAAAAAAAAAAAAAAAAAAA AAAAAAAGATCACCGATTGACATTGTATACACTTTGTTTTTTTTTTCCAA ACACTAATACGCAGTTTAAATTGAAAAACTCTAGGTGACCGATCTACTT TTGTGTTCTTCTATCTTCAGTATACCTAATTTTGTACCGCCTTCGTATATC P \OPER MS\304S36W d.imdm.I4/2200 00 -334- ATTTACCAATTTTGACTACTGATATGCACTGGCTTTAAAATTTTCCAATC CTGATATGAATCTGTGATTCTAAGCAATAACATATACTCCCTCCGATC AGAAAAATTGATTTTTTAAAGTTTTTTTGTATTAAAAAGATTGAGTTTAT GTTTTTTATATTAAGTAGATCAATA 5 TTAATTGAGAATTTTAAAATTTGATGAATTACTATTGGTTAATAGTTACG AGAAATAGTTTAGCATGAATAAATAGTAATTTATAACTAAGCATTATTA 00 TTTTTTTAATCGGTATAAACATTCTATAAAATCAAACTTTTTTATATGGA GGGAGAATCATTTTATAAG 66. The isolated promoter UBQ3_SynO 16 comprising the sequence GGTACCGGATTTGGAGCCAAGTCTCATAAACGCCATTGTGGAAGAAAG TCTTGAGTTGGTGGTAATGTAACAGAGTAGTAAGAACAGAGAAGAGAG AGAGTGTGAGATACATGAATTGTCGGGCAACAAAAATCCTGAAGATCT TATTTTAGCAAAGAGAAAGAGTTCCGAGTCTGTAGCAGAAGAGTGAGG AGAAATTTAAGCTCTTGGACTTGTGAATTGTTCCGCCTCTTGAATACTTC TTCAATCCTCATATATTCTTCTTCTATGTTACCTGAAAACCGGCATTTA TCTCGCGGGTTTATTCCGGTTCAACATTTTTTTTGTTTTGAGTTATTATCT GGGCTTAATAACGCAGGCCTGAAATAAATTCAAGGCCCAACTGTTTTTT TTTTTAAGAAGTTGCTGTTAAAAAAAAAAAAAGGGATTACAACAAC AACAAAAAAAGATAAAGAAAATAATAACAATTACTTTAATTGTAGACT AAAAAAACATAGATTTTATCATGAAAAAAAGAGAAAAGAAATAAAAA CTTGGATCAAAAAAAAAACATACAGATCTTCTAATTATTACTTTTCTT AAAAATTAGGTCCTTTTTCCCAACAATTAGGTTTAGAGTTTTGGAATTA AACCAAAAAGATTGTTCTAAAAAATACTCAAATTTGGTAGATAAGTTTC CTTATTTTAATTAGTCAATGGTAGATACTTTTTTTTCTTTTCTTTATTAGA GTGTAATTTAGCATTGTATAACAAG TAAACTATCATAATCAACATGAAATTAAAAGAAAAATCTCATATATAGT ATTAGTATTCTCTATATATATTATGATTGCTTATTCTTAATGGGTTGGGT TAACCAAGACATAGTCTTAATGGAAAGAATCTTTTTTGAACTTTTTCCTT ATTGATTAAATTCTTCTATAGAAAAGAAAGAAATTATTTGAGGAAAGT ATATACAAAAAGAAAAATAGAAAAATGTCAGTGAAGCAGATGTATGG p.NopERjmsUO45O61 W. 141O2f28 00 -335 ATGACCTAATCCAACCACCACCATAGGATGTTTCTACTTGAGTCGGTCT TTTAAAAACGCACGGTGGAAAATATGACACGTATCATATGATTCCTTCC TTTAGTTTCGTGATAATAATCCTCAACTGATATCTTCCTTTTTnTGTTTTG GCTAAAGATATTTTATTCTCATTATAGAAAGACGGTTTTGGGCTTTT GGTTTGCGATATAAAGAAGACCTTCGTGTGGAGATAATAATTCATCCT N ~TTCGTCTTTTTCTGACTCTTCAATCTCTCCCAAGCCTAAAGCGATCTCT 00 GCAAATCTCT 12689250 Sequence Listing.txt SEQUENCE LI STI NG <110> Syngenta Participations AG <120> Prorroters for regulation of plant gene expression <130> S-50015A/16/78/NAD <140> <141> <150> US 60/213848 <151> 2000-06-23 <150> US 60/214087 <151> 2000-06-23 <150> US 60/258692 <151> 2000-12-29 <160> 875 <170> Patentln Ver. 2.1 <210> 1 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 1 gagagt aat t gagat caccc ccaaaaaaag aaaagt at ca gt act gt gaa t t gct ccaca ct t agct t t g ct t ggcgaag ggt gt t t gaa gacggt t t aa accacttttt tgt t t t gtaa at cgacaact ctt aggaacg t at agt cat t at t t t act cc ct at gact t a aaat at t at c t cgt t agct a gaat t t aggg t at t t at acc aaaat t t agg gat t aaaaaa gat t agt gga aaacaaaat c aagact ct t t gagcagcaca t cggt t t ct c aat aagct t t gacgt acat g gaaacggaac tcaaagagag agagct t aat gagt t ct t t a at t acaaat t ccgat at ct g gctt aacaaa ct t agct t cg t t ct t gagag gt cgggt cct t t t gaat gt t caat t ggaaa t acagat t at t aat t gagt a gat cat ct t t cat t gt gacg at gaaccgt a t t gt gggt aa taaaaaggag ggcaaaaaag at t t gtgttc caat t gat ag ct aat t ggaa aggaaacat g cgt t at t gt t aaat at t aaa t agact ct t t t caaagt t aa cgt caaaagt t t t t t t cat c t aagaaaact tt gct aaaaa at t ggt ct t c acaccacaaa gaaccat caa t ccggt t tag gaaggtcgt t ttacgacaaa t ct t ggacga t t cgat gggt t aagaaat aa t ccaaat caa t t t at gt aag at t t cct t aa agat t ct cgt aat ct at t t t Page 1 caaccctttt aat t caggt t aaacat t gt a ct t t act t ct caaagat aaa t t ct t ct cat at at cggaat aaaat t ct ac at t gagat t t t ctt caagcc agt aat ggt a agt aacggt g t ct t gt t cgt t gagagggt t aaaaaaaaaa t t t gtgt t gt t ct at at t t a ct t ct t ct ca ttaaaaaaac tgt t t gt t gg t gaat t cat a aaat t at gaa t gt t agt caa t agacaat aa t caagt cgt g cggt acat gt t t ct t cact c t ccggt t t t a ggagcgagga accaaat cct t act aacat t ct at t gct t t aaaggt t at t aaaagaagga t t ct t t t t t t ttaat t t t ca t t t caagt ct aaacaat gt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 12689250 Sequence Listing.txt gagagaaata tgaaatatgt ataatgtatt ccttgtttgt taaaaaaaac t aat cat ct a t t aacat gt t t t ccgccat g t aaaccat aa agt aaat at t t cgct t gt at cact cat cga at t t at t gat agt t ct t t t a at acat t aaa agaagt gacc aaaactt ct t cacaaact cc ct caaacccc t ct gacgcaa at t ct t gtta caat at cact t t aat t aaat t ccaat t aaa t t t t t aaaat agt t caaaca at act aat t t t t t gt gaaaa ct aggt t agc aat ct t act t caaacct act t t cat aat ct t aat ccccaa aaat t agaaa aaaaaaaaat at t t t aacat ctat t t gttg ct gat at t at taaaccaaaa aaggccaaac t ct aaat t t g t t t at t t t at t gaaat ct cc gt ggt t gt gg t ct cat caaa ct ccat cat c aat g t at gt at t ct taaaaaggag at at at act g gtt gaccaaa ttgct t t t ca acaaaccgaa caaaaggcaa t t t at caaat t t t at aat at at gt gt cat g aacaagt aac cct caaaacc act cacaat c t t at t aat t t at at gt gt t g tcgcaacaaa caaaat at t a cct at t ct ca ttgttttcgc aaccat cgga t at t t t t ct c t t aaat aaaa gt t gt t t ct a t ct aaat aag ct aact cct c aaacct ct t t acacact ct t caaaaaaaaa ct t ccggacc t aaaagct ca gt t gt aaaaa t cagat at t g t at acat agt t t at aaat t t act t t t t t gt t at aat acat gt t agaact t ttgcaacaac t at aaat aca ccagt gacaa 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 2 2010 DNA Arabidopsis thal i ana <400> 2 agt t aaat ct t gat gcat gt aaat ct act c cgct t cat t g at t t t gt t t t at gcat ggca t t t t t t ct t t ct t ct cct cc ct cgt t gaaa t t aat t t ct a tt aat ggaca aaat ggct t a t gt t at aat t at aat act ac t gagt t cat g aat at t t t t t t agt t ct t ac gt gt gaact t t t t gtgt t gt gccagaaaaa t t gggat caa t cat cagaga aaat t gaat g t at aaat gcg caaaat gt gt t aagaat t gt gct t agt cat t at aaat at a aat t t agaat t t t t ggt caa t t at at t t ca tgcccaaaaa t t at gagt aa acct t aat gg t at t ggt aca t at aagat t a tttttgcaga aaaagat t at at t t t aat at t t cgt caaca t t gat cat t t ct gt at ggaa gct t ct ct ct t t ct t aacaa ccacgt ct ag caat at gaaa gt caat act t aaaaaat t ac at gt cct t ag at at acgcgg cat caat gga gct act acat at t ct at gt a agat gact ag aaacaat t t g t ct t t gcttt t aaat t t t ct agt t t t t t t a aat ct at aga aaaat at caa t t t at at t t t gt caaaccag t t cat at t t a gt aat at t ca Page 2 act act cat g cgaaaacaaa t gaagaat ag t t t t t t ctaa gt gat t acga acgcact gat t aact t gt aa cgt acggt aa aat acct t ac agat t t t t gt ct aacat gcg t at t ct t t t c ct t ct t t at t ccgct caaat aacact t at t acat ct ct ac t t acgat cga gact gccaca t ggct cgaac agt aaaat ca caccacacgc cct t ct acct t agat t t gca aacaacat ct gtct t t t at t tgcat t t t t t ggt aacat at agcat at cag tgcggcccca gt at cct t ca at aat t acgt ct t gt agagt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 12689250 Sequence Listing.txt tccaaaacat tgtcacaaaa tatttataaa gaatttattt taactaatta ggtcgttaat tgtccaaggg t cat gggat c aaact cgaat tttgagagac aaat t gt gt t t t at at t t ct gt aaaaaat t at aaat cat t at gagcct aa aat at t aat a aat agt t t ca at t t t gat t t at gat aact t aat gt aaat t aacccacatt aaaccaaaag t caacgt t ca <210> 3 <211> 200; <212> DNA <213> Aral <400> 3 gt t aaat ct t gat gcat gt g aat ct act ct gct t cat t gg t t t t gt t t t t t gcat ggcat t t t t t ct t t a t t ct cct cct t cgt t gaaac t aat t t ct at t aat ggacag aat ggct t at tttttcatag gt aaat t act t t aat gcat c acagaaat ga t at aacaact at at t cat ag ct t t ggaaca t accgt t t at gat t t t gaat aat t gaacga aaagacaaaa t t gaat ccag t t at aat acg ct agaat at a t gact t t t ct gcct caacga at ct ct gaaa t t gat at agt act t cgagt g t ggagt gat a t t t t aat gga gcagat t caa aggcagt at a aggct gaaaa t aaagaat t a t t gt at t ggt t agagt t cac aaaaacaaaa t ct aat t t ga ttgacacacg t t gcgat cac t ggt caaat a cat t cat aaa at gaaat at g t ct gt t caaa t t gt aaaaaa ct at aaaagt at t aat at at t gct gat t t t ggt gt t gacc t t t act ccca t cgagat t t t aaaagaaat t t t act acat a t cggt t aact aaagt at at t caat t gt at a cacact aat c t t t t ggcat c ct cagt cat t t at agccat c aaat gaaact agggatgct c t t t cagt t t t at gagt ct ca caacat cgaa aat t t agct a agt ccaaacc gaacgaaaat gt caact agt act t ccgt ga caaaat ct t t t acaat at t c aaaat ct t t g at gcat gat c at at t t at t t ct t aat cgat t ct acat t ac t caggt cgca tcacaaaaaa cct at agaat agaacactt c t ccgat gaag aaaagagat t ttcagaaaaa gcct agct at cat aat t ct c aaat ccat t a t t gaat t t t a ggacaact t g t t ct ct at aa t t gt t gt at t 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2010 dopsis thalana bidopsis thaliana agt t ct t act t gt gaact t a ttgtgt t gt t ccagaaaaat t gggat caat cat cagagaa aat t gaat ga at aaat gcgt aaaat gt gt t aagaat t gt c ct t agt cat g at aaat at at t at gagt aaa cct t aat gga at t ggt acac at aagat t ag ttttgcagaa aaagat t at a t t t t aat at a t cgt caacat t gat cat t t t tgtatggaaa ct t ct ct ct a t ctt aacaaa t gt cct t aga t at acgcggc at caat ggat ct act acat t t t ct at gt ag gat gact aga aacaat t t gt ct t t gct t t c aaat t t t ct a gt t t t t t t aa at ct at agac aaat at caat Page 3 ct act cat gt gaaaacaaag gaagaat agt tttttctaaa t gat t acgac cgcact gat c aact t gt aat gt acggt aaa at acct t acg gat t t t t gt t t aacat gcgg at t ct t t t ca t acgat cgat act gccacaa ggct cgaacc gt aaaat caa accacacgca ct t ct acct t agat t t gcac acaacat ct c t ct t t t at t t gcattttttt gt aacat at a gcat at cagt 120 180 240 300 360 420 480 540 600 660 720 12689250 Sequence Listing.txt gttataatta atttagaatc cacgtctagt ttatattttc ttctttattt t aat act act gagt t cat gt at at t t t t t t ccaaaacatt gt ccaagggt cat gggat cg aact cgaat t ttgagagaca aat t gt gt t t t at at t t ct a t aaaaaat t c t aaat cat t t t gagcct aag at at t aat aa at agt t t caa t t t t gat t t t t gat aact t t at gt aaat t c acccacattt aaccaaaagg caacgt t caa tttggtcaac aatatgaaag tatatttcag tcaatacttt gcccaaaaaa aaaaattacg gtcacaaaat atttataaag ttttcatagt tgatatagtt taaattacta cttcgagtgt taatgcatct ggagtgatac cagaaatgat tttaatggaa ataacaactg cagat t caat tattcataga ggcagtatag tttggaacaa ggctgaaaat accgtttatt aaagaattat attttgaatt tgtattggta at t gaacgat agagt t cact aagacaaaaa aaaacaaaat tgaatccagt ctaatttgaa tataatacgt tgacacacgc tagaatatat tgcgatcacc gacttttctt ggtcaaatat cctcaacgac at t cataaac t ct ct gaaaa t g tcaaaccagc t cat at t t aa t aat at t caa aat t t at t t t ct gt t caaat t gt aaaaaaa t at aaaagt a t t aat at at t gct gat t t t a gt gt t gaccc t t act cccaa cgagat t t t a aaagaaatt g t act acat ag cggt t aact a aagt at at t c aat t gt at at acact aat ca t t t ggcat ca t cagt cat t a cgct caaat g acact t at t a cat ct ct acc aact aat t ag at agccat cc aat gaaact t gggat gct ct ttcagt t t t t t gagt ct cac aacat cgaaa at t t agct at gt ccaaacca aacgaaaatt t caact agt g ct t ccgt gac aaaat ct t t a acaat at t ct aaat ct t t gg t gcat gat ct t at t t at t t t gcggccccaa t at cct t cat t aat t acgt a t t gt agagt t gt cgt t aat t t t aat cgat t ct acat t aca caggt cgcat cacaaaaaaa ct at agaat t gaacact t cg ccgat gaaga aaagagat t a tcagaaaaaa cct agct at a at aat t ct ca aat ccat t aa t gaat t t t aa gacaact t ga t ct ct at aaa t gt t gt at t t 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> 4 <211> 2002 <212> DNA <213> Ar a <400> 4 ccaat t ggt t t t aagat t t t ct t t t t at aa t t t aaaaaat t t agaagat t t gacat t gt a gaat aat at t ct aat t t t aa acaat t t aaa bidopsis thal i ana aat at at gt t t t t aaact ct gt t t aaat at t aat at gt t t t aaat ct t t a aat aagt t ca t at t t aacaa gggat t gat t t aaaaagt t t cat t t t t aaa at at at t at a t aat aat gt t gt t t at aaag at t aaat at t aat acaaaat aaaact t t gt t gaagt t t gt aaagt aaaat at t t t gaat c t aat gat gt t t t at aaat t t t aat gacat a t t cat t aat a t t at t t at t t t t t aaaat at t aggat t aaa t aat t at t t t Page 4 at aaaat at g aaaaat t at g t aaat at at a at aact ct aa aat aat t aat aaacaaaaaa t gt t aacaaa t t t gat t gat ggaaaaaaaa at ggaaaaaa at at at caaa aat aat aat a aagcaaggat gt cagt gaca cgt t t agt aa t at gt t t at g aat gt gat t g at t aat gct a 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt atgacattag atcgtaaata agttcaggat ttatttgcta aaatagctac aaaaatgtat at at agat t t at gt t ct at c gaaacaaaaa agct acgt ac ggt cat aagg cgat t gt gac gat caat t t a at t ccat cct t t acat agt c t t t t gat at t atct t t t gt t caaat acaaa at acaaaaag caaaaacaaa gt acaacaat t ggt gcgt ag t at agat cca cat agt ct ct gt gt aaact a ttttcacaac gt agt aaaga at gagt t aga ccaaaaat ga at aaaagaag caaaat at ac caat acat t c at at agat gt t gaat aat t c at cgat ggat agcaaact ct caaaat aat a gt aat cacat acat agaaca ttcct t t ct g ct gact at ac at gat gaaaa t caat t acga t t accgat t t t at t acacat t aaat gt t at accagt agcc acgat caggt t ct at at t ct cat cagt t t g t agt t t at ac gct agaaccg tccaaaaaaa tcaacaacga t t t t t t aat c gaacacgt ga ccaacttttt aaagt t ccaa t ccaacgat t at at t gat at aaagaaaat g aat t t t gggc t cct aaaat a aaaaagatt a at t aat t t at at aaaaggga t act t agaaa gct gact at a act t gt gt gg aat aact t ga t ct ct t at t c aat ccccct c gaaaat agat act t t t gat c caacaaaccc gat t agt at t t gaaaat aac tg act at t t t t t t t gt t cgt t a ttgttcgtta t t t t cgggt g act agt at at cgat acaagt t t ct at t t t t at cgat caaa ccaat cct at gggt t aaaat gt at at ggca tttaaaaaaa t accat aat c aagaact at a t gagcaat t a accaacat ga aat t aaat aa acacct aat c t ct ct at ct a caat caat t g gt t t aaggct at t t ggt gt t aat gt t act t aaat aaacca cgt ggagt ag at t t t ct t ga ggaat at aga at t aat acaa at t aat gaaa t aat t agcca gagcat t cca t t t t caaat g gt act t t t at aat t at ct t t aaact t aaca t aaaat ct ga tat t t t t ggg t t acat act t t t aat t t t at t aact aagct t t cat ct t ca t ct t t cat at at ct t t t cct cgagagaat t gaccagcttt gt acct gt t c t t t atggtta ct gt ct acga t t ct gt aaga cgt ccagct t t t gacgat aa t aagat aaca t gcgt t gcac t t gaaat t t a t aacat gct a ct at t t gcaa t agat aaaag gat at at t t g caaagaaaat t agggaagt t t t at t t at t t at t ggccat a cgact cgat a t ct t cgcat t at gt gt gt at at ct caat t g at cat t cact caacat aagt at agt act aa t cacagat ag 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> acattttcat cttgatctat ttctacagtt tgtgattaac catccagacg aacatctagt ttctgtagaa ggttggtata gccctgaagg tatcattcaa ggaat t aagt t cat ct ccaa caagaagact tctgatgtca ttggatccga tgagggtact cact t cactc tacaagt t aa agacaagaag atcattggct ttcatgggtc tgccgggggc aatcttaatt ctcttggggc ttactttgct ccgttgacta ctacaactcc gttgactcct gccaagcagc taacggcatt Page 120 180 240 300 12689250 Sequence Listing.txt tggtagtgat gacggaactg tatgggatga tggtgcttac gttggggtta cgtt ggacaa ggaggtcaca ct t t ct t t gc cggt t t ccat t t gact at cc gt gat ggct c t t ggact t ga t ccat ggaag act gat gat g t ct aaat t t c t t t t acgcgt t aaaggaagt t t gccat t at gt at gaat t g tcgtat t t t c t gaagt t t t t aaaaaat acg at act ct t gg t gt acaat gc cct aaaaagg t t acaagt aa agct t gct gc t ggaagt aaa at at gcct cg t at at gcgga t cgat cacaa t t ggt t aat a gat t gat cag gcccaagat g ggagaagaac t ct t t t t gt t cat aat gt at aagt gaat ac agt cat aact agct ggcaca agccgatgt t gagct ct t t a t cat at gagg t gt ct ct t gc t t gt t ccaat t t ggt t ct ga t acaat t t aa t cagt aaaaa ggcct t ct cc aaccggtt ct t t ct aat t t c t gt ggaaaca gggt cact aa t aacgt at at cat cgt t t ca aaaggagt at t ggat cacga t at at ggt t a t ct act cact at t gaaaat t caagcaaaat gaat ct cagc atggaaagag gaat gt aaaa t caat t gt t t at caccgcag at gct caggt gt ct t cgaac ct gct ccaca t cct t gat t c t ct ct t cat g gat gt t gt t t ct gat ct t t a t act aat t t t aat gcaaat g aat cct gcac aaacat aaaa t gt t t ct aga t ct t aggt ga t gt t ct t aca t t gt agaat t aat aaaat t g ttggccaaca at gaat at t t aat aat agat t at gcaagaa t t t gaaggt c cacaat t t gt at g t gt t aagt t t t act ct act c at t at acat c tat t t ggttt t cgat ggcac t caagact aa tcaaagagga aaat t ggagt cat gcat ct a cct ct ggt ga t t t ct t t gt t gt t t ccct at act t gct at a t t aact t t t t aaagt t t t t a agt t at t agt t t gagcgt t a t t t caat t at agact t at ca ct gact t at a acgact ct gt ccaat at aaa gaacat gt at t at t at t t ct gt caat at aa at ct ct ct t t ttttttttcc gt gt acgaca ggat t cgaag agagaat t ca ccaat aact g t t at gat aaa taagcaaacg aggccacaag ccat gt t cgt at aagt t at c at ggt gat ct ct t ct t gt gc at gt t ct at a at t caaact a t ggat caaat t ct agaat gt t aaaagt at a cagaaaatt g act t t t act g aat cgt caac t aaat t gcaa gaat t t t aaa gacgat t agt t t ct t ccggt gct agt acga acacct at ga t cct ct ct at aat t at agt a agaaggt gt a aaagccct ga aggt at aat a aat t ct ct t c cagt t cgt ac at ct t t ggga t ct cct ccct at cgt t ggt t cct t t at cca aaat ct at ct t t gat aagt g t ctat t t t aa at act t act t aaat ct caac at at agt t gt gt gaccat at t gagt t ggcc t at t at at at t t t gaaat aa caat t agagt ct t gcaccct agaact aat t t gagaagct g aat gt t t t aa t caat gat cc t t cact t aaa t agt acaaaa act at agt aa 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 6 <211> 2060 <212> DNA <213> Arabidopsis thaliana <400> 6 ggaaggttaa ggtaggctaa ccgagcccac tctctaagtg tttaatttgt aaccttaatt cgtaaataaa agctgtttaa tttgtaacct ttattagata aaattttgtc tttttttttg Page 6 120 12689250 Sequence Listing.txt t caaaagat a at t t at at at acgaat ggaa aaaacgt aaa tgtgcgaaga aagact ggaa t t acgt at t t t gat t t t cat t gat gact ga t t aat t at aa at t act gaat cat caacaag aat at agt t a gat t gt t t ca t gt t t t t gt t t acgt accag t cgagct aga aat at t t agt ttgt t t t aga aaaaaagcag t t gt gat at t at t t t t t t t a t t aat ct t cg acat gt t cat t at at at gt t t t gat aaacg gccaacgcca t t t cat t at c t at aaat at g agt acgat ca agcaccgat a aaaat t ggt t ct at caagaa aaat gt t gt c t t gct t gt ca gcaat aact c t t t ggtattt cgat cgagaa aggt at at ct gt t t t aat t t at ct caat aa t gaagt gat g gt t t gt accc t t at t t t t t a aaat accaaa t agat gat ca cat t cgcaac aaat aggat g ct t t cat t cg aagctttttt accagt gt aa aat at t ct t c t t t t gaattc t t ggagt at t cggaat at t g gt gat t t gaa acaat aat t a gcaact t gca at t ct t t aaa at at aaagac taaaaaaaaa cct cgt ggat at at caagaa acaat ct act aat agt agat gcagaagat g aat cat t t t a t aaat t t t cc gacaaaaaga t t t t cccat t cgagaagaac t cat caaaat t t caaaagt c ct att gt t t a ggaacat gat act t cat t t t at aat aagt a aaccact agc aat agt t at t agat aat t at t t t t ct t gat at cgt gcgac gat aat t t at at gat acaat gcaaaatttt agaggaatt c ct t t at ggct t at gat at at t t aaacgt t a ccaaat ct t a act t t agccc ttttaaaaca gat t agt t ga aat gt at at g cacgaaagaa gt caat at aa ct ct t t aaaa at t aacaat t aaaacaaaga at t t t gt agt gt caaact aa aaaaaaaat a gat aagct ca at t cat gat t cact gt t ggc ct ct aagt t a t gcaat at t a aaat agat aa aat at gcagg t t acat at t t at t aat t act at aaat aaat t gat gat ct a t t ct t ggaat gaat cat at t t at aaat ct t at aat at t t c gt aaat t t cc agat ccat at ct agt ct cct ctgt t t t t aa accaat t aga t t t acaaat a at gaat t agc gaagt t gt gc aacat gcgt t t aaat t at t a at aaacat ac ggt ct t at ct t att t t t gca aaaaaacttt at t t t gaaaa agt acct t ca t at at at at t at acgcacga at ggt t gagt t ggct gt t gc aat t at t ggg aaat ct ct t t tgccaagacg t t t caact ag t gat cat at a t agct agt gt acat act aca gat caact t t at cccat ggt aagcgt ct ct tttaccaacg at aaat cagt ct aagagt t a gat ccggt t c ct t t ggt t ct at t at gat t a ct ct t aat aa at t aat cgt a t t gct gccat aagt agaagt t t t cccagt a t t t ccct t at aat t cact t t ct at t ct t ct act acagaaa tttggcaaga aaat at t t t a cat gcaaaac ct t ggcct t g acgt agat cg ttaat t t t gt t at ct t ggaa gt gct at t t a t t t at t t at a t aggt at at t aaaggt aaca agt at t gaat aat t t ct at c t gt t aacct t t aat aat t t a acat t t gcag caat t agat t aat acagaat t at aaaaaaa t t aaaact ag t t gt at ct ag aaact t ct aa aggct t t aaa ggagaacgt a cat aat ct aa cgt t t aat gg at at gaat at at gct t t t t a t at t t cat ct aat t gat cat ct at t at gt a t t cgt aagt a 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2060 Page 7 12689250 Sequence Listing.txt <210> 7 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 7 at at gaagag gt t t at agt c accacttttt at ct ggt t ca at aaaaaaca acat aagct c gat t t gat ct caaacacat c act t ct t aca gct t gagt t g aat t agcgct cgt t t t agt c aact aaact c ct t gaaaaga at t t gaaat c t t t t gagt gt caaaaaaaaa gt t gaact ga t ggat agt t c aat t t t t t t a act cgt cgt t at gt acaaaa acgt t t aaaa aaat aat agt aaacaaagca at agt t at t t at t t gat gct aaat at aat t aaaccgaccc t t aaat t cgt cat aat caaa t t t t at gat g t gt agt t agt aaaat acct t aat t at aaaa at accaat t a act caagat c gt gt at cgt g ttgaaaacaa t ct t gaaaaa act t t gaaat t aagt aaaac aacacaacat t t t t t acgt g t t gt t at at a aat t gt at ca agt at aagca gt aagcaaca acgat ct ccg gacggt t tag gcgaat t t ct gaaaaagt ct at t gat t aat aaaaact at t acat ggt caa acat at agt a t t at gt t aat aaact acgt a at t t caat gt aat ccgact t t t t cgtctta t cat cat agt at gcgaaagc t aagt t t gac t aaagt t act t t aaacat t t gcacat gaaa aaat ccgat t gaat cat cct ccacat ggct acaaagagt t t t gaat ggat ctct t t attt t t cat t aat g agaaaact t a t at aacagct aat aggt t t g agcaat at t g at at at at t c cgct aat t ag aat t t gt cca aaagt t aagt tttcaacaaa gggt ct t aaa t t cgt gt ct t gt t t at at ac t aat gt gaaa gacgt acaat cgt cgt ggt t t acaagat t t gt t t t t t ct t gt ct aat t ca at t t ct cct t gagctagccg gt aaacat t g agt at agcgg t t t at acaga aat agcggct t gagt t aaag aaat gt t at a gtt aaaaaca cct t cacat g agggccacgt gt cgt gact c at aat t ct ac agat t at cat gcat at ct t a aaaaaaaaaa at gaat gcgt agcaat cat g t at t cct aag gt t gagaat g tgaccagcac at ct t act t t ct at t t cat g t act at t tag ggacgact ct t at acact gt gt acaaat t c at t t t cat t g t agt cgct ct ct aaaaaat a gct at aaagt aat cacaact Page 8 aaaaat agaa at ct t gaagg aaaaaaaaaa aaaat t ggaa act gaat aaa t at t gcgat a at t t aagat t at t agt at t a aggccatgag caacgcatt c gcgaacccct t act at t agt ttccagacca at acggat t t t cgt at caca gagt ct gcaa t t agaaagt a agacaccaat gt t t t caaac at ct t gt ggt ct ggat at aa t at t t act at at agagat t a cat gaaat cc t aagcaacat t agt at t ct t t t ct t t aat t aat aggat gt t t at ct t gaa at aaacgt t a acaagaaaag ct aacgt t t c cat ggt t cag t t t gat t aac at agt t t t ag at aagt aaaa gt caagat t a acgaat caaa ct at t aact a cat gt gacat aaggat cct t t t t t t at t aa t t gcaat gt t ccgcaagtt c at gggct t t a t acct t t at t aat t t aaccc t t t t aat cat cagaaact at t at t t t at aa t aaat gt t t c t t aat at cat t t agat agag cacgacat gg tacaacaaga at t acgt at t cacct gaat t agccat ct cg t t at gaat t t aat gat t t t a t gaccaagt c gaaat gggt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 12689250 Sequence Listing.txt atgactttct tataaaacat taactaagat ttgaccaaac ataattttgt at t atcaat a ttacaccata aatacggcca catatcctcc tagtttcttc acacaactct cccctcaaaa cattccatca aaggaaaaaa atg 1920 1980 2003 <210> <211> <212> <213> 8 2003 DNA Arabidopsis thal i ana <400> 8 gt gaaat aag 0 ccagcgagt a Ci t tcataagaa 00 S ct t ccaat t a C1 aaatacagt t t ct cgt t t ct at agact t ag t at aat t gat t t t aaaaaat gaccagtgag cagaaaaggc cat t ct t caa aat t t t agcc ct t ct ct t gt t agt at aat c gtct t t t at t gt gat t t gag gat gcaaat c t t cact aaaa acagcccct a ggaat t t gag caagat t t at t t at agt gac t agccact ga ct ct agt aag ct at t aagag gaaaaagaca at at t t t aat cat aaaacaa cgt t gat t aa ct aat t ggaa gt t ct t at ga t ct t t gacaa t t t t at t cct cat gaagt t a cat t at gat t cat t gaaaag gcagtgaccg gaaaccaaga cgt at gcaac ttacacaaaa caacaat ct t t at t act ct a agat gaaaat t gacat acat t t at agt t t t ccat ct caaa aaact gagaa tagccgccgg agt gat at t a aat at gcgac t cgact at at at aaat at ca t cgaat t t aa t act gt t t gc gt t t gct cca agcct gaat t t t t gagat t t gt gcaat t ct t caat t gt t t t aaat cgaaa aat gct aact agaact aagg at ct ct t aaa aat gagt t gg agccggaacg gct t t gaat t gt aact t gt a gacat aat aa cct ggt t t t a cact t t gat t caact gt t at t aggt gaaat aggt gagat t act cat caaa tcat t t t t t g gt aat ggacc t gcgacat ct t t ct ct act t t agagt agt t aacaaacaaa gaaat gaaaa t ccat at agt t t gacat t ac t ct t t gat ag t at agt aaga agcgt t agaa t t ct t t gaca aacaaaacac cgcaacgaat t t t gagt aat ct t t aaaaat t t gt t gaat c cacaacgt cc gaat agt caa t gat t gt gca agt at cat gg agat acat at gat gacat t c t t t at aaaaa t t gaaggaat t t agagaat g act aaaat ca t caact aaaa caaccaaagg ct ct accaca agggccggct aat t t t at ca act t aat cca at aaacgat t t t gt acat gt at gt at t aaa Page 9 tgagcacgag act t t t ctta t gt gt ggat t at aact t cat agt cat ct t a t cgt ggt gat t t cat t gcga ttccacgcgg at acgat t cc ggt gcagt t g aaat aat t gc t t t at cat t t ccttttttgt ggat ggt t ca t t at t ccgat aaat aaat ag ttct t agtta t t aat agcaa ct ct caat t c ct gaaat t ca ct cacat agt t agt cacat g t t cagaaact at t acat t t g aggact t gt c caagct t aag at t aaat at a cagt t t aat a t cat t at t at agaagt at aa t agaat t gca t t t aat gagt t t t at ct act t cagt act t g t t gaaat cat cgact cgagt t gaaagt ccg t cact t ccac t t ct act ct t t t t at cggca t gacat t gt c gaat gt cat a t t t t at ccaa t ggat tt aaa aaaaat caaa t t t cct aaag at cct t caat t acat t t gat cacat ggt ac gcaccaagat ct agaagaac aaat gt t t at t t cat at cgt ttagt t t t ag gaat aacaaa gaaaaacgaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 cat t t t t gt t gagaaat at c cacaat at t c at at gt accc aat gaccaaa cacaaacaat t gt t caat ca aat t aat at a at ct t gagt a cgct at acaa ct acccaaca aacacaaaac 12689250 Sequence tt gggaaat c at agaatt gt atagttctgt ttaacaagaa agaacacgac caaaagtcaa aaaagaaaaa gacatttaca t ct acccct a t at at acct c at g Li st i ng. t xt t caaaat at g aaacaaaagt aat t gaat t t agaccaagt c actcgtttcg aaatacataa t ccactt at c ccaatagaca accacctttg ccctctcaac 1740 1800 1860 1920 1980 2003 <210> 9 <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> 9 t t ct ct gat g t gt caagcca agat t t t gt c t aggt t t t aa ccaaaaccca t agcaccgt g ct ggt aact c agt aacggt a t gt t at gagc caaat ct ct t ctgtgagagg gat ccaccag tggcgagagt t t aggt gacc t t cccat t t t gt at t agact at t gt t cttc aat at at aat ggcaagaaat t t at ct t at t aaaagagt gt aat t t t aaga at at t aat t g cat gct gaca at t t gagt at ct ccat acaa ct cat cagct at caaacat c t t t aacat ct agaaggt aac cact t t t t ga at aaact gag agct gcaaca t cctt cgat t tt cgt ggt ct gct at ggggt gaagt gt agg tt ggaacggt t t t t gt at ag t gaat t t cgt aacaat at t a ggt caacaga cat gtt ctt a aact aaaat t at att gagt a gat t t t gt ca agaaacatt c t gt t gt gt t c gctt ccgaaa t aat cat cag cccaaaagaa gat t caat t a t t caagcat t at caat gt cg t t aggat ct t aaacct gcac gct ggaaccc at t t t cgaaa ct t ggat cgt t cat cggt gg t t ggt t cgt g aat gct acca act t t t agt a at aaaaat t a agt gagat t t t t t aaaat t a ccaat t t ct a t t at at aat a caat t t t ct a t gt gat t at t t t aat aaaaa gt agagat t t gat at gt aaa ct t t ccacat at ggagaaac t aaccgat t c cgacgaatt c ggagat ct cg cgagatcggg aagct t caat at ggagt aaa agtt att ggc ctttaggcag atgagaggag cgagatgttt tt agaacgt g gt aat aat t c t gagt t t t aa t t gt aat at a tgat t t t ct a gcat aatt at cagaagtgt g at gat at t t a att att cct t ctt act ctt a gt aat ct at t aact gaagac at ct aggct t aagaggacca gagt t cat aa cgccat t agt gact t ct aaa gaaagagttt ccagaggatt aggcacagag t ccgact t t t accat cagag gt cacgggct ggcgaat t t g tgt t t ct t ga t t t t t aat ag t ctt agt ggt t catt at gat t att ctt gt a gt caacagcc aact t gt t t c at aaaaaaac at at at at at t at gt t at t a t cat t cat t g ct aact t gt t t t cccat ct a att gggat aa aat gaatt ag gt att gaaga aacggt aagc gt ct t cat gt gcacaaggaa at gat gcaat ct caat gact aagaacacaa gat t caat gg agcat t ggat ccct caat ac t aat t t agag at at at aaca tt gct aaccc aaat t gt t gt agct gcaaag aaat t agt t a at t gt gt t aa t at cct t t at at gaaaat gt t ct aacat t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page act agaat at aaat aaaaaa gt t t t t caat aat aat aaaa t gat acaat a cat t t t gacc ct caaagat c gt t t gat at a t gggat aaac ct at aact t c t aacagat t t gagat cacat caat t t acgg t at agt acat aat t t gt agt tctcat t t t t t t gaat agct t at t acacgt t ccat gacaa taacccccca 12689250 Sequence t ct at t agt a t t at t t at ac ttaaattgat aacatatgca gt aaaact aa ct ct gat act gttttttgtt acgaaacaag taactacaat gtatttattt tttgtgttgc ctattttcct ctatgctact tgtgctggcc gtttaattgc atgggacgtg taccaaaatt tgaactctcg tg <210> <211> 1881 <212> DNA <213> Arabidopsis thaliana <400> tatatataac aataatatat gatattctga gatttaaaat ttgcctaact caattttaaa ttgtgatagg ctcatgatta aaatttggat ttcgtatgaa aaaataaaaa gataacaaaa atct t ctgtc at t gcatata tcaatat t ag tttgtctaac ttgattataa ataaactatt ctccaacaac agtgttatcg actcatgatt gatttcaaaa gtat t catat gaaaagaatt aagtcaatac ttttagatct tctctcattg attaaattat ttgataggtc taatgtttta caatgttatc aaagactcaa aataaaatta agtacaacac aaaaaaatag t at aat aaaa tatcattaat tttttttatt tttttttatt gatatattta atttgct t at ttaatttgat cacatgtcat ttttttttgt gatgatgtgt at at at atga tttatat t ag tcaat t ggag ttactgccta tatctcatcc ctcgtgatga tcattcactt aattgcacag aaatatagta ccacgttcat tattttgcaa acttgcatga ataatgtatt ccacatacac aaaaacttac atcctcgttg taaacttaga tcttgagaag aaaaat t at g ct aaat ct t a t gacat caac at t aaat gt c agaaat gat c t ct agt t t ga aaat at at ag taaaaaaaaa cat at at cca aaggt t t at t ct aaact aat agagt at agt t t aaagaaat t aat at t t t a cat t cat t t g aaat t t acac t ct ccaat t t at gt t t gat c ct t gt caact t t t ccat cac gat gt aat ct Li st i ng. txt at at ct caca at t aaggaat aaaaat ct ct t t t cggat t c aat cat t at t aact t agct g aat ct gaat t aat at at ct t cgct t acgt t agcaact at g t t t t at gt ac t gaacaagat at t t aagt ca at gt ct t gaa t gt gt t cact aat t gat at a caaaaat t aa t at aacagaa at t t aaaaaa aagaggt t t g aaaagt agt g caat t aagag at t t aaaaaa gagagagt t g at aaagact a cat aaggt t t t t t t t at act t at at t gt at t agagt caac ggt aagat gg caaat aat aa gcact t t gt t gct t caat aa t gaat agt ga ttaccccgca aat t ct t gaa t at t aat aat tagtaacggg at aaact aag t t act t gt gt ttcaacaaca t t gaaaagt a at at t t t t t g t gat t t gaaa t gat at aat a aact gaacaa agcagcattt at gat cgt gt act t gaat gt gt aacgaaca ct t aat t t at t aggat t gat t at agaat ga gccaact t t c at caact t ga t t t caagt ga t t agt at ggt at cat caaaa t t t aaat cct at caaat cac 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 11 at gagt t ct t tt gagtt ggc t t t gcct t t g gt aact aat t t cat aaaaaa t gaat aacac t at at at aac ct agcct cac t t at cat aat ccgt aat t gt acct t aacat ct t ct t agt c aaagat aaga at t aagaaaa t ct t cat aat aaacagccat t ct caaagaa aacagt at aa at caaact cg t gt att ctt c aaat t t gat c t ct aagccat 12689250 Sequence cctct t gt t c aagtagt t gc t t aagat caa t t t aaact t g tct gat t aat gacacaaaat t at aat t t ca gt caat act a atattgaggg ttgaattat a acat acat aa t caaaagt cc t aaaat aaaa t at t at acaa caaaat at at gatt ggacat aaat t at aaa at aaact aaa aact ct at aa at ggagacaa Li st i ng. t xt t aagaagt gg acaccgagac at acat t t gt gt at act act tt gt acccaa t caat t aagc at cact at gt t acagagt gt t t gcat gt ac aaacacacac t aacat ggct t ggt t t agat t t t t at aact ct t t aat gca aat acat ggt aat t at agcc gagcct caac aacaaagt ca cat acgt aac aat gat caaa 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1881 <210> <211> <212> <213> 11 2002 DNA Arabidopsis thal i ana <400> 11 t t t act t t aa t cgagt t t cc ggat t aat gg t ggct t acag t aat aat cct agagaacagt gt t t gt ggt g ttct t ct t t a ttcacaaaaa at at cagaaa ct t t aatt ct t gtt agcgt a tttt act act t t t agt t t t a gacaaat t ct caactt ct ca t t ct t at aga t at acaaaca gaat t cagaa cgcaacact g cact agt aga acaacgagt g agggaatcag cgagaccaga gaat t t gaag t aaat gaaac gct at at t ag t t gat at t aa t t t aagt gt g t gt cat caga agaaacat ca aat ct act t c ttttcttaaa aacaat act g t cat acattt aat gaaat t t at ccacattt tat t t t gtga gctatggaaa tcagaattga gaatccgaat ctcgggttct ggagaaat ct gct t cat t t g aaat cacct g gct aaact aa t gt t t aat gc acactt aggt cat at t t at a aat at cagt t t t t t t t t ct t t aaaaat gt c t cagct t t gt t t t aagt ct t cgaaaaagca gt t agggaag gat gagt t t a t at cacat gt t cacat t agt cat cgaat t t t gt acaacgg gt gact t aag t t t ggt t gaa gaggt gaagt aact t aat ct t gt t t cat ac agt gaat at c t at accaaaa t aggt ct t at gct t t t gaat cgt cact ct t gt t t at t t at t ct t t aat gt t at caat act t gcgcgatgt t aagt act aa t acaat t t t a t ggt aat cct t gat gtt gt t aacagaggtt gaat gat at a gcactt gaag cagt gat gat ttttcttggg aaaat act t a t aaat ct caa ct t t ct agt t aat t caat at gt ct aaact t at t ccgat ga ggt ggt aact t t act t cgct t t agtt gat c t aaaaat ct t t cat acatt g aat gt at acg ggt gaaat ga agtgcagaag gagagaaggc gact t gaaca gt t at t cat c t aat t agt t t ggtttctttt gt t t t acagt t t gt cact t t ggt t agt gt t ct t t aaat t t ct t gaccgt c t gaacgt t ga at gt acgt ga t t caaaact a gct act at t a t cagaact ca at t t t gccat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 12 12689250 Sequence Listing.txt tatgatatta atattctttg gtatttggca aagtatctcg acttatttat ct t t ct t t ac aaaaaagt t a acaaaat ct t tcaatggggg act t t t t t ga cgct aat gat at cat at caa aat t acat at agat acaccg t t gt gt aaat t gacat aaat gaaagacaaa aaccat t aac caaaaagaga aagcat t aga t t ggagat aa cact gaaat a at aacgt gt g ggt ct caat c t t acgt t t aa gt at aaaaat aaat gt t t t g t aaccat at a gt gcaat t cc caaatgggag t at aaaagaa aaaaaccttt gt at cat caa ggt t at ct t a acct aaacaa t at t t t ggat caacagct aa acaaat cct t gt t t gt t ct t at at t aat ct aacat gacca act ct acaaa aaat aggt aa aagaaacaat gaat at t gga t gct t cat ct tg accagt ct t c aacaaaacaa t gt t gat at a at cat t t t t t t t t aat gt aa t aat ct gt t t cacgat caac agaaaat aat ct aagat aat ct at aaaaag act aaaaaag aggt aact aa caaagaacca t at t t agaaa acaaacaaac cacat acacc aaaaggaagt at at at gcat ct cgt t gaca cact t gct gt gat cat gaaa cat gt t gt t t agcct agt t a gt t gaggt t t gcaccaaacc caaagagcga t gat at ct ca aat at at t aa t t aaagt ct c aagt caaaga gacct gt t gg at gcat t gag acaacaaaag t aaat act t a t gt caaaaca at at aat t t a t t at aat t t g ct t gt at aaa caaagt caat aat aagaaaa 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> 12 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 12 ccctgaccac cctttaacct tgcttcattc tgcatgcggt gagtacggat ccggattcac tcatgtcgga tgtgcgttta tcccggagac cactcttctc tacaatacac catgcaaagg tgtttgtgaa gaagatatgt cggagaatct tgggacgcat gtgcattctt gcgcggttta agaagaagga gaagcgatga gctcggcggt ggatgagttg gcggcggtac agctcgaggc actcggttta tatgctgaac caaagcggag caatttaatg aaaacgcttg tgattgtttt ttatttaatt tgtaagtgtt attcatatac ggt at aat aa caagctgagt ggt t at at ac aacagt gat g at caacaat a t ggat ggct g aacgtgagat gtcgatccaa cactttccac aatatacaaa ttttcattga atcgtgcata cccaccat at t t acaat t gt cgtggagcgg ctgtgaggac at gggt t t at t gaagacgat t t cgaggat g acgt at gaag at at t at t gg tcact t t t t t t t t t ct at t t t ggcct at t t aaaaact t t g t ct acgact t t aat gt t gt g agt gagact t t ct gaat gt c gaagat cacg ggt gcgat gt t at t gcaaag gagccaaat a aagt cgt t ga cgagacgct a t gat t gaat g t t t t t t cct t t aat t gct at aaccat t t ac t t at gt gct a t t t cgcaat g at t t gt cct t acacgt gt ga aat at gat ct aacacccact t cat at gt aa aat gt gat t a atagaggagg t gaaggct ga at aat gct at aacgaat gat t t cgt gt gag acaaat gt at at at gt aagg t gcaat t t ct ct at ggaaat act ct t cat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 gcttctcttt accatacttt cattgcatta ttggttattc cttaatccgt agcatctctt Page 13 12689250 Sequence Listing.txt ctatct t t t t t ccgaaaat a gagggtgaga t t ccgaaat c at cgccct t c t t t gaggt aa aaggaagatt ct t caggat t cct ccggt t t at t agagaga t t aat ggct t t gt gaat gt t at t ct t t gat ttctagt t t t ct ggt gat t g t at t aaact a gaaat gacaa aacaaccaat ttttttttgg t gt agt agcg t t ggt gggt g t ccacagt aa ttggtat t t g ct cggaacat t aggagt t t t gacagat aag t ggcct t ccc t t t t gcaagt ggat t gat t t aacagtggga caagt cct t a gt cact at t t t t t cgt acgt caact t ggt g at cat ccaca cct at acaca at t t at act a gat ccgggt g act cgagaaa acgaat ccat ggagaaggga t t t ctgtgga agct t gcaaa tggaacaaaa at t gt gt t gt t ct aaaat gc gt t ggcagac t ct acat t t t t ccact t gaa cct aacaat c gt ccaagat t aat aat caaa tat t t t gct a at g at t cct gct a gcacggaaat gcat gt t ggc caccgt ct t t t t agat agag t gt agaat t t ccacaaaaca t at gaggat g t t t t gt t t t t t at t t gt aat t t t gt t t t gt cagt cgacga t t at at at gg accaaaatt g at gcaact ct accaaact ga t at at at t t t at gt t at t at t gtt aaagaa acaaaccgt g cccgcaagcg at ggt t aggg gcaaggt t t g aaaagat gaa act gaaat gt gct gcacaaa t t ggt at at g ggt agt cat t ct t cgaaagt gaagct caat gagat t t t t a aat gt caacc gaat t gt t t a ct aaacgt t t t aaaaat gt c agt t ggt ggt t gaagat cga tcgcagcgga t gt ct gat t t cat gagt at g aaagat t ggt cgaat t gagt t t act ggt t c t gaat at aaa t at t ct accg caat ggt t at t at t gat gaa aat ct t at ct gt ct t ccat a gcggaaaat g ct t acact ga 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 13 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 13 ggacaaaaca gagacat at a t gaagt t t ga gagatcaaag gatgacgtag atttcaatgg gagagagaga gagacaccga tagaagaagt ttttaatctc tggcttagga catgattgtc agttaaattg tttgatttat gttgtaagaa tatatatttt tcttggcatg cttgaccaac taattaacat atcttgacta ttaaacgcaa ccctatgt t a tcagtttatc actagt t cag tgacgaatta aat t aataaa gtcggtaatt tatgacgatc aaattaataa aagtcacttg tcgtcaaaac tatgaaaatg gatgatgaaa agtaaaaaaa aaaaaaaaaa actgatgtga cgagaaagaa cggtgcagac aggt ct t t gg t t ccaaat gt aat aaat t at cat t at at at acacact cga aagt act at c t agt agt at c t t t agt at t a aat t aat aac aaaccgacgt gagacct t t g cat gat gat c at at t t aaac t t aaggt t t g aaagaat at t t t at t t aaat at t cat aaaa at t t at t t gt gt t aat t t t t t cgt at ggca at t ct t aaaa at caaaccca gagagagct a ggtggcgaga caaaaactt t t ggt t gcgt c aat t agaaaa t aaacaaaat acaat caaat t t t acat ct t t t t aaccct a t t caat t gac aaaaaaaaaa agaaaaccga 120 180 240 300 360 420 480 540 600 660 720 Page 14 12689250 Sequence Listing.txt cggatccatc aaccagtccg ccactgccca ttcctaaaat ggtcctagca t t gagagact t ct cat t aat gat t at t aat ggt t t t caac aat gagcat c cacggcacac tggaaaaggt tgt t ggcaca caccgt ct t t t t agtgagag at t t t aggt g t t agct t gca agt ggaacaa cccact gt t t t t t t gt aagt t t gat at gt t gaggactat t t at acat ggg t at gt at at t aaaat agaaa gtt aaccaaa t ggct t ggat cat cggt ct a aaccggaacg at gat ggacc aaat ccact a gcaacat t gg t aaagaaagt aaccgt gt ga tccgcaagcg gagatggt t t gat gt agaaa aat cat gcaa t ggat gt t gg t gt t t t t gt t t t t aaaat gc ggaagacttt t aggcgat ga aagcacaat t aagcaacaac t cacaagt ca ttcagacaca ct t gt gggt t at cat aggac ttaacagacg aaat t t gt t g cact ct ccgc aagct agacg t ggt ggt gag agat caat gc ttgcagcgga ct gt cgaaag t t gcaaggt t caaaaaaat g at gact caac t gct gcacaa t at t agt aat gt t t t gt ggt ct t cgaaagt act ggcgaat t t ggt gaat a t ct acat at t at g gt ggcct at t t cgt aat cgt gcgt cgt acc aagt agct aa agct gt agag ccgcagcaga ggt gagat t g cgcaat ct cc at cgccct t c ggt gt ct gat t ggat gagt a aagaagatt g t gt cgaat t g at at ct t ct t t t ggt at aaa agt cat t t gt caat gat t at t t t at t gt t t at caaaacca t t gct at at a ggtt acaaga cggcaacat c aacaaact gt t t t t ct t ct a ccacgagt t a t cccggt ccg gt gggt gact acagt ggt aa ttggtat t t g t t t t t gaggt tgaaggaaga gt ct t gagga agt cct ct gg act at act ag aat aat t cag t accgt gt ga t agt cgt t ac t at caat at t aact at gt at t t t at agt t t at t cat gaag t t at t aat ca aaaggat acg ggt t gct t cc gcacat t aca gaccct agac ccaggtggca cgagaaggca gccaat ct at ggagaagaga aat t cggaac ct t aggagt t t t gaaagat g t t t t ggcct t t t t agat aat t cgct t ggat ct t t gt t t t c ccacat gaat ct gt aaaggt t at gt t aat g ct aaacgct t 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 14 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 14 taaggaataa cctttaaacg gctaaaact a taaaaagtga tcaaaaaaaa acaaagcaat agtgacgtgt tgttatttat cacaattaac actgtttttt tttttttctt atcacaaaca catggaagaa gctatgt t ag aaaatgagct tgacaaccat cgagaagagg gtgacaacga aaatgatgaa gggttcaagc aaaaaaaaac tttgtcatct tttaggtgag acaaaaaagt ttttatctag aagattaaac tttgattttt t ct gaat gt t at at t t at ct t gacagaat t gaaaat t at t ct agct t acc t acat gagcg t t ggtaagag ct t gat aaca ct cacct aaa Page t t gat at t t t t cacat gcct t t gt aaaacg t ccgt t ccaa ggt aat aggg at ggat agca tttgaaccca at gt gt gccg aggt gacaat at cagat t gt at t caat t t a t t accat gt g t ct t t ct t gc aaaccagt ct cat aagaat a aaacat caga gacacagtt t aaagt gagct 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt atgtagagtg tcgtcaatta tttgtccatc ttcttgtttg tttttttaat t gt ccaacca ct t gt t t t ca ct act t t at g ccact t t ggt agat t aggt t t t aggt aact gaaggtggct gt gct t t ct a aat cact gat gt caaaggca t t cgcggat t ct at cat caa t acat agaca gaat t caaat gt t t at aat t gt ct t gaaat t at acgt cga ct agt t t t ga aaagcgcgcg caaaaggat c aaat acat at aaaaaaaaac ct t ct t ggct t t t gtct t cc t t agact aat ggaat gact a aaacaat t ag t gt ct t agt t aaccaaaaag ttgt t t t ct c t cggaggct a gt t t gaaaat cacat aat at ct cgt cgat a t aacat at at caagaacat g t at ct t acgt t t gat at caa at t t at caaa aaat ct agaa t t cat agt aa aaaat act at aaaaaacaca ggt t aat t t a t gcct t aaaa t caat acct a t t agct aat t t cct acaaat at t t t cat t t at gcat t cga t t t t caattc agt t t at t ga t ct agcat at agagat t at t cat t t gcgt t aaaat caaag t acat t at t a t aacat aaag t t t t t t gtcc t cact gat aa t t aacat gaa t t t cgt acaa ct caat gat g agaagt at ag gt aaaaagt c at acaat t ga at aat aagt t t gat t t cact caaaacct ac at aat caaaa caccat at ac at g tgt t t ct t t c at gt t t t t t c aat t t cggt g tgagaagaaa ggact cgt at gagcaaaat g at t t acggga aaat at t ggt t gact t at ac t t ggacat t g at at t t t t at aaacat gt t t tcaacaacca t t t ctatct t cgt at t ct at gcgaaat t ga t acat ct gca agt gaacat g aaat t at gt g aat t t t t at t t agt ct t aaa at agt act ca ccacact cac t ct t t t gtct t t t t cct t t t at agt t t tag agacaaggag gaaat t gat a t t t at aat t t aatggaagga t aaaat agaa gat t t acaaa aaact gagaa accgt at cgt at cgcagcat t at ct t t agc act act t t ga t ct cagaagg gaaccat ct c at aaaaat gt aaaat ct t aa cacat aat ag at cct t caac acgaaaaaag agct t ccct a aagt cacaac tt ctt ct t t t t cgct t t cat at gt gcat t c ccat t t ct gt caaaagt at t t t t gt at gt a at cgat t agt caaaacacaa cacact t t at aacaat t cag t acgt t ggcg ct caacaat t gat gat at t a gaccct acac aat t t gt gt c agt cat gt at t t cat agaac at ggagt tag t t gt t t aat t at agcat t gt cact ct agt a aaaaaaaaaa t at aat gt t c t caaat aat c 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> gtaattaatg tagtacatat atctgtttta attttttttt gaaaccaatt ataaatatcc tcacggtaaa atggat t ata tatatatata tatatatata tttgctaaat gaaatatcat ttcaagaatt tagtttatta gaaatatttt attaaatgaa atataatagt ttattattaa actaaattgt ttcatttaac caaaatattt tctatgatgc tatttcatcc ctgatttttg tgattatgga tatattgtta actttataac aaaacaagag aattcaaggg actgaaacgt Page 16 120 180 240 300 12689250 Sequence Listing.txt tttatgactt ttaaatggat gactatatac aatggctgat ccaaaaaaac aaggcat at t t agagt aaaa t aaat gt gt a aaaat act gc t t t t gt aaaa ct gaccaagc gt t t agt t gt at t t t at t cc ct t cat acct aact t ggat t gt gat aat ga agt t t t at ct t ct t t t gttt at at gat at a agt caat cat aat at at at c acct t ct ct t t t t gacat ca at t at gat ac at t aat t aac aat t t t at ca act t aat t t g t caact t at g t agct t t cag at t caaact c agcaacaaat ctct t ct t t g cgt t agt at a cagt gagct c agct t t t t ag at at at aat a aaact agt t c aaaaaat gct gaggt t t t gg ccact aaaaa caat gaacaa t t acaat aaa t at t gacat a cacat gt t ct at caaat t t g t t caat gt t a at cacagt ca act caat t gg t t t t t aat at at t t t t t t ac t acgt t at t t act t cccat t t act at agt a tt at caaagc at aaat t t ct t t gaagat aa at t ct t cct a at t t aat at t cat aagacgc gt cgct acag caaagccgt c t aggct t agc aacaaaaaaa cat t t aat ga at aat cgaac act t t gt t t t gct t t aacca cct agt t ct a gcaaacaaga gt gat t aact tgt t t at t gt agtttttttt aggagaaaat ct gt t aacct t cat acaggc ct t ct t caat t t t gact at a acct caaaca t ct at aat aa at t caaagat t act t t act a agt ccagct g gt at aact t a at ccat t gga t acccaaat t t aaat acact cacgt gacaa caaaaat aac at g gccat at at g caacacct t a at t acaacaa t caat agaaa t ct gt t gt at tacaaggccc t aaggagat t agt t caat at cat t acaaat t t t agacat t tttttttttt at t t t agt gt t aaagt caag agat gaagat accaacggat cccaaat agt gt caaacgt c tccaaacaca t t t at agcac acaacat t t g cat accat t a acaaat at t a ccggccgtgg t cacat t caa aact aat gac t ct t gat cgc aat aaacaag t t acat caaa at t t at gt ct at ggt aaaac at agaaaaaa cggt aaagat acat gaat ac at t gcct t ga tttcttgtcg aaat t gt gag t t at ct t cat tttggacaaa t acaaaaat a t t t caat cag aaccat t aaa t aat t aat t c t t agtgggca at cat at at a t cccccacct aat t t t gt t t aaaaaat at a t caaaat t gg aat at gt gt t t t t t t at at t aggt t gct t a gt gagaaaac t ct at aaaaa at aaat ggag aat t t t gggt at t t at t t t g at t t t cgt t g t cat gct gat gaat t t at ac gagaacaaca aaagt at aga gaacaaaat g aact t t t t gt gaat act aga t t gt aacat c cagat gt t ct t at agaaagg at aaagaact tcaagacgat ct caat agct caat t t cgt a t aacaaat ct t t at aaat at tttaaaagca caaaaat t aa at gt ggat aa aat agt t at g agt t t gaat g t ct t t t t aac t t at ggat gc accaaacct g ggt ct aat t c 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 16 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 16 atatttaaat aagagattat caggaaaaca aaatgttgat atacagaatt ctacacgtaa ctttaatttg ttttcgttta taataataat caagaggaga cgcaagagag gttcagagat Page 17 120 12689250 Sequence Listing.txt gcgaatggga agtgtgtggg agccacaata atttcatcta cactgtttgc tccctcatac ggt t t gacca acgt acagag gt t at cct gt t t caagat gg at caaacaac cact t gt aaa t gcgct at gt acat aacat a aagaaat cga gaaact t ggc t aaaagat ag gaat t aaagg gt t cgaagga acaaagaggc gt t ct t aat t t at t t at t gt ct at agt gt t t t at at t cca t agat at ct g tcat t t t gt g at at cgcat g at cct cccca t at at at agt t t t att gaac aat at t aaat t aat at at ga gt ct t t aaga aat gt t at t a t gaat t t at c tgaaccggcc gacaaaact c acaat ggt aa aat at t ct t g cgt t ggaagc at t gat cacg t t t gagt gt c gat cgaacaa cacat cat ct t t accgagct gcaaat t gca gt t t gat gga t t at gat cca aaacgcgt ca t gct cat t gt t agggt t t gt t t cat t gct t gagcct aat a agt acgt aaa cagat ct at t t t ct ggt t ct cat acacagt accacacccc aaat gcgcaa caaat aagt a at at at acac catggcagcg aaccaagtt a ccat gt gat a gt cgt t at at aaccaaacat tttaagccac aat aaaaacc t t at t t ct t t ct t acct acc caaaaccgt a gcgt gt gcat cct ct caccc aaat ct ct gt cccgct t at c aaagacccgt t t gt t caaat gt gat t agaa aaaggt t t at at agt aaaag gctccggcga acacgt gt t g ct t at at cag at gt t at t gt t cat t t caat ct t gt t cct t ggct agct at t t ct t aagat aat t at t gt t cgt ct cgt gc t t cccact aa aat t t gcaaa ccgat gaggt aaagaaaat a t aaat gaaac at ct t acacg at t t t at gt a aaccact at a at g t cat t t t act t t t at gt t t c t gat gt cgt c gggactcacg aagt t aaaag gt ggcct t ga tagggaaaga taacaccgcg at aacgcat t agt acaacaa t ct cgagcga at gggt gt gc aaggct at t t at gggat aat t aacagccat gt t gcaacaa cggccgtgcg t t act at gaa aggt t t t cct gat acaat t a at aat t at t a ct t ccct t ga act at gcaag at at t aaat a t at cact gac t agaaaat ct t aacat t gac ct agcat gat aaat at caac t at accaaac t acacgt t t t aat aggcat t aggat cgt gt t at aaaaggc t t t t at cagc gct t cact at aaccgaagcc act ct acgaa gccacat t gg cgcacccgca at ggacagat at t ggagggt gtgtcgaccg aagt gt t at t cgt ct at at g t aat gt t t t a gaggaggat c aat t ggaaga agt aaat gt g t aat t t t t at ccgaat gaaa cgcccgt at a t cgt cgt acg t cgaaacat t at acaaggat acat t t act t agct caaaag gcat ct cat c aagct agcat at t t agaaac cat t gat t t t at at t cact g ct agacagcc gaat t t t t t c gacat caaat ggt at t ct aa ct agact t t g gat t t cat cg ggt aagaaca tt ctt gaaag cagat cct ag t t gt gt at at ggaaaagtgt t aaacagt aa cgt gggt caa t t gat t cgt g t t t t at t t at gt agt t at ag ggggat t t t a t t t t t at at c aacaaaaacc gaatgggcca t ct at cct t t t t t t aat gat acggt t t cca gat t t t gcaa t aat ggat cg cagcct t t t a gat gcat cac gcct ccat t a 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 17 <211> 2004 Page 18 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 17 gt at t at t ac gccct at at t gtgggtccaa t t t t ct t gt a t aat ct t cag t agt aaat t a t acat gcgt a t t ct acct ag ct cagaacat cgt t cgacca aat t at t t gc t t at at at t g gaagccaaat aaaagt t aca tagtagaaaa aaat t t cat t acact t at at cccaaccaat aaggaat t ag aaat t aagt g aagt at t t t a ttttttcacc at gaaaaat g aaaact caat ct t acacaat t cgt t t caaa t t t t ctatcg t cact aaat t t gt acgt acg gt t t cgagt t gaact t ggt a aaaagt aact t t at t agt gt cat t t at t ca t gt cct ct t a ccat t t t t at ct at gt at at at cat ct aca aatcagaaga acat ggacgt at at act t t t at aaat ct t c cat t t at t t t t ct ccgt t t t cacgaaaaca ccaat at aaa aact caacct tttttaaaaa aagt t cgact t t ct t aagt c tggcgggacc caaacct t aa t t caat gaaa ct ct t ct cat aactaaacgg t ct at ggt t c caat agt t at acact t t t ag gt t cacct at at ggt t t at a t aat at t t t t caagt t at aa aacgt gaat c aat gt aact t t gt aat aaaa ct at ggt t at aaggcatgt t t t t gt t t t gt at t at t agct t at gact t at acct aaacat tgaaccccat at t gt t t t t c gt ct ct t at t t at at ccgac t ggt t t aaac cacat t cgag gt t gt t gaaa cccaatcaaa gagaacat t c aaacctaaag aagt aat t cc aagaaaaat c act t gaacgg aaaacat at a aacgt aggt c ct ccct t t ca at t t at t t t g gt act at acc aaagatgct t t t t ct ct t t t t at t at aat t t t t gtgcgac at cat at caa acat t t t gt a at t t t t acgt t t agt at gt t t t at ct at at ct t t gat t gc gat t t t t ct t aat t t ggt ca acat gcat aa at t t cat t t a caacat at t a t gccgat gt g t gat t aat gt cgt ggact at gt cgat t caa aaagct t aga t t t cggaat c aat t acaaaa acacacactt at at t at cct t t t gctccgg t t at caaagt gt cat acaca t t ct at aaaa ccgaccatat acaccacatt aat ct aacca aaat aact t a t act t t ct aa gtagaaaggc at ct at ggat gt t t cagt ga t agt aat cca at aat gcaag t t t aaccgt t aaaat t t at a t t t gat at at t ggt t t gat t ccact t t t gc t at aaacgt a agt gagt at a gt act acat g aat t cat t gt gacaacaaaa gagt gt at ca aat t t ct t t t t t t t gat t ga act gaaat ga t t gt aagat t at agt aaaat ct agat t gat cat t aat t ag t ggaggcat t caaacaataa accaaatct g tttgaaggca t t gagat t t t accgacaccc t at t at t aag t t gaccaat c aacgatcgt t gat cgt gat t t gat t at gat aagt gt agt t aatct t t t t t at gacagaat at at t ct ct a t t t t aat at a gt gaat at t t at aaaat cat aaaat gt aaa gct t agt ct a aat gt at gt t t t at t gat t t ct gat at t t t gcat ggaat a ccaaaactt g t at t at t t gg cat t t gaat c aagagtgcaa aaaagat t gt at t cat ccac caaagataaa t cacaact t c ct caat gt t t aatggagaca ccaaaaaaaa t cat t t t cag at ccat cat a aat cagct t t at gt ggt t gg ctataaagaa t t cagt cagg gt t accggcc t t cctaggca gat agt gaat t t t gt gat ac cgt ggt aaga tataaacgaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 Page 19 12689250 Sequence Listing.txt ggagaaatga aagtattcat caagtaaaga aacaaacaaa caaacaaaaa acacacttca ctcgctacac aaggaagaga gatg 1980 2004 <210> <211> <212> <213> 18 2006 DNA Arabidopsis thal i ana <400> 18 t aaat aaat a ct aagagat g t t t aaagat a gt agaaaaat t act at t aat ggt caggat c t t caggat at agaat ggt ac agaagt ct at t t ggcgct aa t gt ct t t cct at gaagaat t at t t ggcacg cgggaatgga gaaat aat at acaaaaagcg at ct ct agaa ct cagaaaaa ttggcgccag t t ccaact gt aaggct t ct a at agaat ct t ct t agcaaaa agaagt gt at gt t t at at gt t agaaaaaat gcaggaaat c at at caacaa at gt t t at gc at caaaat t a t t t ggaat at t acaat cgt g t gggaat cat t agt gat t ct t gat at t aca t at acat cat cct cact aca caat caagac aaccgaat ga t aaaagt t t t aaaagtt ct a t ct t t aat t t t at ct cat t t act ct at t t t ccagcacttt gact t gt ct a gt aat t gcaa agct at ct t g cat t gt cact t t t t at caag t ct t aggat t t t t t ct t t ag gt t at t t gat gaacaaaat a t acggt at cg acaat aat t a t gt gct t gac t aact agaga t at at gaaat acagt t agct cggaaaattt t t aaat t t gg t ggat ggat c caaat ggaga t gcagaagt g gt agct t t t c ccagcctt ct ct t t t atct t t accaaat ca gagt t ct aac aaact agat a gt t aaat aga caacct agcc t gccat gt at t gt t t gaagc agt ct caggg gt gggcact c aaaat gt gt a aaaagcaat t atgt t t t t ct gat t ggt t t a gaagat at t g t t at ct ggt g tggt t t gttt aaaaaaaat g cat t at t aga at gaaagt at t aact caaac acaacctttt aaact cgaat at ggaaat aa gacgt caacc gaagcgctt c t aat agact a cagt gt at t t t aaat t t t ag cat cat at ca t aat at cggt cat aaat gaa t gaagagcct acat agt gat act t t cgct a ggtct t t t t g aagat cat aa gt gaact gcc t aacat at aa aaaat aaagc t aagt t aaga aaaaat gaac t t t t aaaaat gact t t t cgg aaggt t t aaa at t at gt gt a gcaaagt acc cagt t t aat a t t t at act aa gaaacccgca at ct caaacg agcgacaaat at gagaact g t ct gat gat g t t cagat cag ggaggat t t g t acaaat gt t cat at gcaaa tgacaaaaga acaacaacac cct ct agaag ggat t gat t a agaagt aaga ggggt ct agc ccat t gt gag agccaatttt gagcaccct c aaaccaacaa t gt t t t t ct t aaaaat agaa at caat gaaa cat t caagga t t aaaat t ga cgt acgt at g agat aagact aat act aaat aaact at ct a cgt at gt gcg at at t aat t a tgt t t ggagc caaat ct t t a aggt gt at ga gt gaagat gg aaaaacgaca aaagcggat a caacat t ct t aaaaagt t t a acccat acac at gaagt gat aggacaacac agggagtgat t act t cat cc ct t gt agaaa agaggt ct at aat agt gt at t ccaaagct a aagt t aagat gat at t gt t t gt cacagct g acaat acat c t gt at gcat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 tgtctactct aatccttatc attgattcat tatatgcttc tttttttgtt ggtaaaacca Page t t at at gcat aagaact t gg aagt at aaaa gaaggaagat t at t t ct t t c tttgtcaaac acaacgtgaa cccggctcat cacacgtgaa actacccaaa 12689250 Sequence Listing.txt ctatcct t at ct t aataaca aacaaaatac at t gtatgat at atgactca ttctaaatat attagcagta gatatcaaat ctattttacg tctaaaccgt tacacactct atatataaac aatagtcatc aaccaaagta agaagcgcac acacacaaca gagat g 1800 1860 1920 1980 2006 <210> <211> <212> <213> 19 2006 DNA Arabidopsis thal i ana <400> 19 aaaaaaaaag t t agct caaa t at t t t gt aa ggcat agaaa cgagt t t at a t gt aat at t t at t t t gt act aat gagaaac aat gcaccat gat ggact ac t at gggt aaa agccaat at t aaggt at at c cgagaccat t t t t ggt t t ca cgt cgt t agg t cggat aaat gt t aat caac agat ct agga t aact cct t g at aat cacgg cacat at t aa t aat t acacc t aaat at at t caaat cat t a aagaat aat a agat aagtt a t t at t aat t t aaaacagt ag at t t t agacc cat t ct cgga at acat aat t t t at t t acaa gt gt gt gt ga aagtgttttt cgaccagcgg t t t t gat cat t gt agat agt tt gggt agct t cat t ct cct t cgaat ggat t cacgt t t at tt ctt aggaa ttggacaagg gt ccat gt aa gcat gccat c caaaacacaa t aat t t aat t t aat t t t t gt aact acat ga act at ct at t t cat at t gat acgagt at cg aat t t cgaga att gt caat t t t gt at t aat cagtt aat ga t t t t aaaact acaaat t t gc ggt cgggct a ct gt gct ct t t ct cat t aaa caagt agct a ccaat gt aca t aaaat gat g t t t t caat gg ccggct aaat aacct t t t ga at at cct cct aat t t t t aga at cacccat a aaat ct cat t aat aaaaaat at agaaaaat aaacacaaaa t at aaagaaa agggaat t t c t ct t t gggat gct ct agcat at cat cgt ga aaat at at aa t caatt gcgg t at t t aacat agt accat gt gaagt aaat g t t aaccat ct at ggtt ggt t gatt aat cat t ct ct t gaat t act ct ct ga gt aagt aagt ggct cgaaca gt gaat at cc t gt at t t ggt agccacaatt t acagt at aa t aaat t at t t at gt aacaaa gaaaacaaca at aat aaaat Page 21 aagct ccacg at t t t aaaaa aggat gt t t g cat t gat gaa aaagat t at t t at gt t t cat aaaat t ct aa t cgt at at at aaaact aat t ggt t ggaaac cct aaatt ac t t ccagt gaa t gggt aacac t ccat cgat c aaagaaaaaa at t t t t gat t t t t t gaattc aagcct ct at t t t t ggtggg gaaacgt aag t at t t at at a cct t t ct at a at at aat t ga t t t at t t tac t gct aaacaa t cgt acgt ga gt t at t at ca t gat ct t ggg t gt acct gct aaaacat gaa t t t at gagt a at t t t t at at gagt t gaaca agct t accaa t ggt cat aga t t t aat ccaa t t t t gaagt t t t aggt ggt c caaagggtcg aat aggt t aa t t ggt at ggt cacccaaat g t t aat cgat c act aat t agg aaaggcact a agat t t t t t a aaaacat t at t caaaaat at aacaacaaaa t aaat gt t at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 ct aagaaaac ctt ctt at cc gaaaaagt at tttt cat acg agt t t aagac t gggtt gct a aaaaat gat a ct t t at aat a agcagaagaa t at aat t gt a at caat gt t t aat caaat aa t at t aagct g ggaaacgaaa accacaat ag at t aaaacct aagct at aaa acccat cat c 12689250 Sequence t acaaaagct agct att aca aatatatatt tgtatagtaa aactctttag atagcaatat t aaaaaat aa t gat t aagat caaaagat ga t aat cat cga aaaaacaaaa gt t ct t t t aa aaaat t aaaa t aat at at ct cctcaacatt tttcaacct c at cat g Li st i ng. t xt ct t agcacca aat aaaact a gt acat t aaa tt ggggt aaa tt cat cat ca ggt t t t aat a acaagact ac ttcccaaaaa t at aaat aaa gt t t gcat ct aaaagact t a gagaaaccaa cgat at aat a at at at cacc aacaat t ct g aaagt gt aga 1560 1620 1680 1740 1800 1860 1920 1980 2006 <210> <211> <212> <213> 2003 DNA Arabidopsis thal i ana <400> t t t at act ct t t ct aacact gagaaagaac aaacacat ga t t cat ggt ct ggt t t t cagt t t t ggt gt ga gaact gattt gat ct at aat t t at t t at t a aat t at at ct gagaagacgc gatt gtt aag aagat gt t ga aat t at at ga gagtaagaag t agct cat ct aact t cat gt at aaaaat gt t aacaat t t c aaagaagaaa t t aaggt aaa gagt att aaa t aaaccgt t t agaacgcggg tt cgt att ct at at at at at gact t t t t gt gggcat act t taaagcaaag at cagat cga t t cat aat t a agt gacaaga ccact ct at t at gacact at aacaacacaa acagaaaggg t aact aat cc gat aacaaaa at t t agt ct a act at t t caa t ct aaat t ac agtgtgaacg tatcatctct aactaaaact aaaaaat t gt aat cgccct a aaat at caac aaaagt aagt ttaacacacg t t cgt aat ca t aagagat at cat gct ccaa agat gt gaac t ct gat at t g ggcat t gat t t t accgt aat aaaat cgt t a aat gaacact at gcat aaca aagt at agaa aggt cagcaa t gat gat ct a ct gaaaat aa t at t t aaat t t at t t gat at cagt t t aaat at t t gagct a aaccaat t at at ct gagat t t gatt cgat t agt cat at ac gcat t t aggc ct at t t cct a aat cagat t a cagat at acg t gt gt ggct t t aat t t t t at aagaagt ct a at gaat ctt c t t aaat ct t c aacaat ct aa agt t t ct t ca aat gaaaat a aaaact cgag t t ct agtt ga t gccagaaat agaact t caa ct aat agt cc gaat t aat ca caaat aat ac at gt cacat g t t aagat at c t t gaccagat at t aaat gca t t t t ggact t aagaaaat t g t t ccat ct gc aaact t agat tcgaccacag t ccaacat gc cgcaacact c t ct cgaaat c aagt act t t a t aggt at t aa ct ggt caat g aaaagaaaac t ct t acaat t aaagt caaaa aat acaagt a gt t t t gcggt t gt at ct gat aacat t t t ga t acat ggt cc agt aggt t t g t t ggggat gt ttttttttgg aagt at t t g gt t t t t gggt agt cgaat aa aaact cat gc caat acaat c t gaaat cacg aaaaaat aaa t t aaat t aag at aaagt t ca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 aaataaatca aagtataata cttcatagtc acataaagtt ttgttttgtt tcttataatg Page 22 12689250 Sequence Listing.txt gt t t t aggt c t aaaggat t t caagaagat a t cagat t t t t aagcatctt c gt t cat cat c t gt ct ct t ag t ccaaat t aa ggct t t gat a t t gat gat t g at aat act aa aagat t caaa aat ct cagt t t t aat caaag aaaat t t gat at acat acac at cct caact t at ct t t t t a accat t aat g t aat t t t ct g t aaaat t t cc gat t aaaaaa cgact at gaa t ggggt t gca cacat t gt t c at t gt t t ct g aat t gaggaa t aat at gcaa t at ggat t ca t ct t t ct ct c t t t t t agat g t t aat t gaaa aaagaact t a acagt t gt t t at aat aaaac at g aaatatccta aagt t t t t aa t t caat t ggt aaggt t t t gt cat t at t t t a at gt gaaaga t caaacact a at ct gccat t aaagagaaca t t t ct gct t t aaacgatact t ct t t t gaat aaaat at aac t aaaacact a aaat gcct aa gccaaactgt acagcat aat at aggt aat t gct agaat t c t t t at cat ga t cct t ct gt t at t cat agt c ct aaat caat t aaat acct t t ggccact ag gt ct at aaat t at cgat cat aagct cgct a cct at aaagc cct caaat t c tagaacaaag ccaacataaa t at at gt t t a 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 21 2001 DNA Arabidopsis thal i ana <400> 21 t t t t cccaat at ct cat t t t t acaat ct aa acggcct t ag aaat t t t at a aat ct agt t c acaaaaattt t t cagaaact ct t ct aact g gggt gaagt c t t t gt acagt at ct aat gt g at at agt at a caaaacagat at t aat t t aa gaacctcgaa t gaagt t agg agt acat at t t ct cct t t aa cat accaaat agt t gcat ga act gaaact g gt aat ct t cc tact t cat ct aataacaaaa aaat t aaaaa t ct at gaaga t ct ggt t t gg t t t t agt at c at at t cccca t cacat t at c tccccatgag gat t ct gat t at cgacaat a cgaaaaaaga caggggagt a cactagacca t gt t aat aat t aaaagt gga aaact at agt agt cat gat g acgt t at t t t act aaaact a aacagcatcc gt gagat at a at ct cgt ct t aaat gat ct t t ct agt ccct caaagaaaaa ccatgagaaa t t ccct aat t ct at t at at a agt agt t aag at t aact aat cagt t gt gt g actcaagaac gaagaatcgg t caaagt t t t t t agat t t t t t t ctcaaaaa aaat ct aaaa at t t cat ggc ccatgaagag t agat ggcct aaaat ct at t cgaaat agt t ct t t cgaacc ct at gt t gac t gt gaaat ca t aat t aaat c t gt cat aaac t aagt gt t at ggcggttttt acct t at aag ct gat aaaag t ct at aat t a at t t at cct a gcaaaacttt tcgaaaaact caaaacat t a gt t caaagt t t at t at gact t ct t agct t t t t t catgccg aaccctaaaa caaat ct at g acgt ct t ct g gt t at ggt t c t caagt at ga aaact cat ca gt t t t t caaa ct agat agat tggagaagaa at t aggaaac t t at t t agga ttttttaaaa aaaaaccaaa gat at t aat g aat agaggat t t aagt cat c act t t cacca aat t t t t cat gcat ct at ac ggt t ct t aat cccat aaat c cact t t acgg acagaagggg t gaggat t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 23 12689250 Sequence Listing.txt tgaaagtgtt t tccagaaca tgcatgtgtg cat atagaaa atctaaaaaa cacgt acgat t gt t t t caag act t gt aaat t t t ct at at c aat act t t t a cact agt t at gaat cgt cag at at at t t ct t t t t at t t ac tacacacgcc aat gt t at aa aat at ggt at at t t t gcat a t t ggt at at a t accct aat t <210> 22 at t aat aagt ct caaaaacg t aat t t gat a gct t gaat cg aaaggcgaaa aaact agt aa ccat ggct t c ttttctctcc accact gat g aat agt ct t a t gagat aat a gaat t at t t aaattt ccag at at t t at t t t aaat t t t at t t aat t t aaa t t t t ggt t aa t ccaagcat t acgt t at t t t at aagaact g t t t ccat at g t t caccaat c t cagccat t a t t t aat act a t t ccct ct t c at t acat at g t gt t t gt t t t gcaagccatt tcgagacaga t gtt acacaa at gt aagcct at aaat ccat aaaat t aat g at aaacat t t tgaaagaccc accacagcac t aggt aact a aggagatcag t t cact aaaa aacat aacac t t ggcaacat cct t ct ct t t aagt at at t g tt aat caaaa gt at at aaaa gt t t t t t aag cat gcgt gt a t t ct at aat g agaact gt gt agcgat cat g agct ct cct t ct t cct t at g t at at at aac t at at at at a caaat at agg t aaaaacaat aat at aat t a cat at aagt c cat at t cat t aaaaaacaca aaat agt t t c agt gt agct a cct cat aggc gt at aaat aa gt t at cat ct at ccacact t at ccact at t act at cacat gt aagt t t t g t t t gt t ataa t ggt ggt gt c gaaaacacaa 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2001 <211> <212> <213> 2002 DNA Arabidopsis thal i ana <400> 22 ct ct at acca t aaaat aat a t gat aaacct acaagt t t cg t t aagaaggt tt ctt gcaag aagt t ccaga t t t t cat at g aggcgt t ct t t t t ct gcact aaat t gat aa gacact ct ct ccacaagttt t gt caacgca t gaacgagct at gt aacct c gacaat at ga ggccacat gc agt ggat t ca gt aat at t cg aaagat aagt aaat aaaacc cat at aaaac ct t t t agaaa t t act agaac caaaact t aa t t at aat t t a aaat t gagt t ct t ggaat t t aact ct gt t t accaaagcgt act ccat t ag at t t gagacc aaat gat t gg t t at acat ct aaaaat cat a t t cat ct t ca caaaat t gt t ttacaaacga t ct t ct ct aa t at at acat a at t ct t t ct t gaaat gcgct t t t ccat aag gt ccaat gac aacaaaat ga ttttttaccc caacaaagaa ggct t t gggg t at at agat a cgt aagcaaa cacct t t ct g at aaggacca t ct t cact gt act aagct at at t t aaat t t gt t at t aat t aaaaat ct ca ct gct t t gt g t aaat t t t gg t ct t t t t t t g aat aaaaagt ccggct at at at agat gaac tgtgt t t t at t cgt t ct aat act ct t ct gt aaat ct t aag t acct at act gaact at t aa t gt t gagat t at gagt aacg ct cagatt ag t ggaat caat t t t gcaat t a agaaat t at a t t acagt t aa caaacaaat g t t t t atctag gat t ccaat c at t t gat at a gaacaact ct at t aagat t g aaat at gt aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 ataattaacg aaaatctttg tcaaaacact attaagaaaa taatttcaaa aattaacatg Page 24 12689250 Sequence Listing.txt atgataaaat atagattgtt aattgttttt tttaattgtt ttcaagtaat at at gat gag aggt at gagt aat t at t at c gt gat at t t t aaat at ct at t aaat t at cc t t t t t t gtt g at t act gaat t t aat t ct ac gact t t acc t t t t ct gaat gt at t cagac t t t ct t ggt a gaaaccagt a gt t t gct agt t gt at agt aa aacgaat cgg t t ct ccct ct <210> 23 gat gat t aag tgt t t t t gca caaat t at ct at gt at ct at t t gat at gt a aaaagt t t t a aat t t ggct a ct ggt t t agt t t ct t gcct c cgatgagaag t at gat ccca t ct t gt gt aa ct cgt agt aa at gcgt caga gcgt t aaat a aat gaact at t ct acgagct gaaat t act a aaat t t t aat at aat t at cc at t at t cat a t t t t gat at t gcagaaat aa gt cagcaaaa gt t accat t t t aaat gcacg ggaat t gt gt aat t gt at t c t t gt aagt t g caat agat at cgt gat t aag ggt gt at agt t at t t t cat t t aat t t at ct aat t gat t t a gaacat at t c t at cgt at ca aaaaat at t t acat at ct ac t t t agt t t t a acaccct at a at ct t t t cag ggcagataga t at at t gccg t t gt t gaacc t t t gt t gaga cgaact at at aaaat gaacc gccaaat t gt t t t aaagaat agt t at ct aa gt ct ct t gt c gt at aat ct g t aat t cgagt gt at at gt t g t gcaaat ggt t t t ccgt gaa cact t cgaag at t gct aggg gat t gggt t a aaaaaaaaaa at t aaaaat a aat at aat t g t cgcgct aca agt act t gca t t aat caat a at ct t agcat acgt t at t aa t at t ct ggt a act t at t t aa t gt gt act cg gt gt t gaaat t act gcagaa at t t gt at gt aggt t aaagt aat ct ct t t c aagct t at gt aagcacgat c aagt agaaga at cgat aat c cgt t at t at t at at aaat at 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 aaggcacaaa agcaatcttc ttctcttttc acagttctgt ct ct aaat ga t g <211> <212> <213> 2002 DNA Arabidopsis thal i ana <400> 23 at caaaat aa acacaat caa gct t t gt aag at t gt at t t c t t aagct at a t gat t cat gt at accaat t c t t at at t at a ct agaat at t t gagaggt t a t t gat ct gaa aaacgcgt ga t ct aat ggga agct acgat t act aat cgt a caat ccaaaa t t ggat gat t aat aaggacg t t gat gaat t cat t cggt aa aaaat act aa at t caaaat c act t t aaaaa gcct gaccat cgt cacat gg ct t ct aggt a gt ct gagct a t gaagt cat g gt ggat gaag cgt t at at ga agt t at t aaa gaaaat gt t a t t t gt agt aa ccaact caga t ggat t ggga t t t ct t t t t g cat aaaagt g gt gat aaat g aaagct cat a t ct t t t t gac gagt gat act aat t cgaat a at ggt t gt gg aaat t gaat a at at gacaat gaagct at ag gccat t gt ga t at acaact a gaaggacggt aaaagct t t a at t t t t gt t g t t acaact ac t at aggccaa acat gacgaa t gt cgct t gt gaaagagt t a t gat t gat ga acgt t t t aaa gggaaact ca ct aat t aat g gaat ggggt c gt ccact t aa aat gaat t t c aaaacaaat t aat gt ggct c t t ggt t aat a 120 180 240 300 360 420 480 540 600 660 Page 12689250 Sequence Listing.txt gcttattttt gtttttctaa aaaatctaca aaaattggat aact t caaat t gat aaat ct cggt ccaaat t at t t aaacg aaagaaat gt cat ct agaaa t cacat t at c t t t agt gat g aacgt t at ca accaact ct a gact ct ct t t gt t t t at t t c cat gacccca gt ggt gt t t c t t aat aaaat accgaacctt t aat aat t aa act ccat ct c t agt aacat t t t at ct gt ac t ct t t t t t at t t t at at gt t gcagagaaac <210> 24 <211> 200" <212> DNA <213> Aral <400> 24 t t t cat ggga ccct gagct t gt t ct ggct c acaaaagcaa cggt t t agag ccct t acgga aggaaacat g t aaagat t gc aaat gcaat c ggt aaaacca t gt t t aaat c tttctttcac ct aact aaac t gt aaaaaac caat t t t cat t ccct t agac t agt t t at t a t gt t t ccat t at t t at cact agt at t caca t t aaagcat t at aat at gt g accacat t t c t t aat t t gga t ct t ct t at c t gt t ct aat a ct t t ct t t ct at at gt agat t t caact t ct t t gacat t t a t caat gt aga t at act at ac act t t acgaa caaagt caac t aaat t gt ga t aaaaacact aacat t gaag t aaggt t cat at gaaaat at agagt t t ggc t acct t gat c cacacacaca ct t gaat at t cccct agaaa t gagcaat t t t t t t ggagct ct t ct act t t agaaat t t gt gaat t aaaga ttgt t t ctta t t gt agat t t tg aaacat at t g t t t t aaact a at acat t t t t gaccgaactt aacaacccaa t cat at t gt c t t t ggat cat gagt t t ct aa aat gaacat t aat ggat t t t t act t caact cgt at t cat a cat gacaaaa ccat t gaccc ttccacagaa at ct t t act c t ct t t cct ct ct ct t gct t t accct t t aag aacact at at gct t ct t aca t ct aat act a ccagat t aaa ct ct at ggt a gct cgagact aaagct aaac act aaat ct t ct t aaat ct t t t ggact agt at at t t t gt t aaat aagat g t gt t t cgt ga agct t ctt ag gt acaaggt a accat gcat t gaccaat t cc tttctttgcc t ct cagcaaa aacaagaagc at t cat at ga t t at gt t t t a cacact aat g gct gt t aaaa cacaaact t g agcaat t aca agagaaaaca ttcct t t ct t t aaaat aaat act acacat c aaat at at ga ct at at at gc t t aat at gaa at t cct t t t t ct t t t aggt g aacacct cca t t at aaaat a ct gaat aaac t aat cat gt t t at aaat t aa aaaacattt t aagact agcc t t t t ct t aat caat aat t gt t t t t at ct t t 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 3 bi dopsi s t hal i ana at gt acct cg ggt ct gat ca ggt at cccac at ct acct ca t caat at t ca gggaggatga t t caagat t c ttgagccaga gaaact cat c aaagagagt g ccggt acagc agagaaaat c agat cat gac gt gagat t cc agt acgcagc cagaaagatt gaat ct t ct a t at t gaaat g accaacaacg cgat t acgt c agagaacgag at cgaccgag aaact ggt t c gt t cgaggca Page 26 gaaat cat ga aaat ggat cg agt at gct aa caaaaaccga aacgt cacaa accgct t t cc gt gccaggt g at gagccct t acgcgaaat t aat cggt t ct accggat ccc tttcacgaac t ggct t t t aa ct cacagagc aagcagtggc acgt gt caaa 120 180 240 300 360 420 480 12689250 Sequence Listing.txt gaatccaaga gaggcgtttt tgaactatag agacgt t gac at t ggaaaga gct t aaact c aacgt acgag ggt t aagat c agt gct ct cc t t t at at t t a agagt t aat t gaagatgagg t t at at gaca at t gt t at t t agaacaaaga t t agt t aaaa t caat t t at a cat agagagt gcaat at gat aaacaact t g att caaaaag t t at t aat ct at aggt t tag aaacaat acg aaacct tt ag agat aagaga ttggt t t t t a aaat t ggct a t t t cggat aa aaggt t t agt at ccaaacac gaaggtaaag aagagcagag agccact aga aat gt t aaca t gagct ct ct aagt agt cat agaaat t t at gt agt t t t ga aaccagtttt t gt t t cct ct aat t t t t aca cact cgt t t a acat t t t at g cacgat gcac aaaaact cga ccaaaagt ct at gat gacgc t aaaagt caa ct gt caaaac at gat t t aat tt ct t t gttt at caact t t t gat aat t gt c t aat ct ct t c caaaaacaaa t gt acggat t t t gat cccga gagactagag t gcaagaagt gt t t t t atcg t t gt t ct cct acct t t t ct t agacaaaatt t aact at t t g at t t at at at t gt cct t t aa t t t ct t at at acaaagat aa agat ct gat c t ccaaact ag t t ct t at gaa gt at t ggct a agt caaat at aacgt gaaaa cccct gact a t aat t t ct ca gaaaact act ggact aat ag t ct at at aaa at g t aagt at t t t caact t t t t c acacact t ct agt act at aa aat t cagact gt at at gt ac t at t t t cact caccagt gag cat at gt aat at at cagt ca cagaaagaaa agagaat aac tcaacggaaa aaat at at aa caacat cacg cact gcaaac t cgct t accg at t t agt caa cgat at t t gt t t acaat t t t t gacct at ag gt ct act t t g t t aat ccct t t at t cat aca aaggat aact cggtacgagc at t aacaat t ct at t t gat g t aaacct gt g t t t caaaact ccagt aagaa t gat aaact g t t at t t gt t g agcact at gt at gaat t t t t acact cacat cggt caagac ct ct t t aaca ct cacgcgt a acaacaact t gagt ggct ca ct at aaccat at at at cat c ggt gt aat aa agagaatt ag ct t aaat t ct gacaat ct t t ccagct t t ca t cgagaaat t agagcat t cc gct accaaaa t aaaat gcga t ct t ggat t a t at at t t aca at gt gt t gga at aat ggaaa caaat t at at at aagaaat g acat gt cat t gcat at gcat at aat t t gat t at ccaaaat ggct aaaaat gaaaagt cat t aaat acaat t aat cgggca aagaat cagt acagt ct ct a gt agt t t cga ct acact t ag gat at t at aa aaaat at at a 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> taacgttcat tatcaaaatt ccaaacaact aaaagaaaag acattgatta ttttcaagga aaagctcttt cctcagaaac ataccccaac aaaataaagt ttcgagctct ctttccagat atgttcctta acattcgagc tctctttcca gatacgttct ttaacattcg agcgctcttt ccagatacat tccttaacac cgagcttgca gcaaacatag acatcatctg acgcctgaac Page 27 120 180 240 12689250 Sequence Listing.txt aatctgatca tgggaaaggc tacgagcacc ccaattactc atacatatct cct cact ccc cct cgt aggc t t t t cgagc ttccaagaaa acagcggt t a ccat t ggacg gat gct at ga tt ct ccaaag gat cct agcc at cct aact a caaagat gt t ct t ggaat aa aat t t t t aaa t t ct aaacat t gt t agat aa t t aaat ct cg at t t cat caa t at t t t gt at at caaaagcc t cct t t t agg at t gt caact aaat at t at g acaat t ggca agggat acca at t aagt ggg aact caat at t caat cact g cacaact aca accccaat t g tt gt acccca agat agt gcc t t gccct ggt ctt cgaagga ccaacacat a t cgagt ccaa at ccaacggc aaat cgat gt at t t aat ct g at at t caaga t agaaat aat gaccct t at t at agaaaaaa gaaat gt t ac t gt t aaat t t at t t t gt t t c gat at ct agg ct ct gaaaat ct aat t t gca ct t acat t t a tgat t t t t ca t at ct aat t t t ggaat aat a act t catt ag gaaagt aaga caaaccattt ct t aat acaa aaacat t cca caagagaaaa aacact cct c t t at gt ct ag ct t ggct gt t cct caggaat gt t ggagaat caacgagagg ggat ggt t ga agcggt t t t c cat aaaagt g aaacat aat a t at t gt caag ct acaat aca t at aaacgt a aaat ccacaa t ct aaact ga aat gggagat gggatat t t t caagt t t aaa aaaact acat acat ct ggt c agt t aaaaga gt t t at t t ac t gaacat aaa t t t acat t t t agcaacaact t aaaagagct agcact at at at acacaaca tg t acaat ct t c aagt ct ccat ccagacgccg gcgt t t acag at ccggt gga gtgtgaggag ggt agt at ga gt ggat at gg agt at gagag t agat caat a act caagt t t t caat ct at a aat aacact t t at t t ccat a aat at t aaca ttgaat t t t g gct ggaat at ggggaaatga aagt t t t t t g cact t agaaa aat at gt at c caaat t aat g t cat acat t a ct t t t gtgt t cct ct t cccg cat cat t aac at acact t gt cacaaagcac tcaaacgaac at ct ccaact acaaaagt ga t gagacaact ggatcggaac cgaagacggc gt gaccgt ga cgct t t t gga agaagat t aa aat t gat gag ct t caaaat a t agagat aaa t t t t gaggt a t aaat t t gt a aat ccgcagt aaccaaaaaa agct t t gat g t t at gggt t g t t t gggctgg gagt cacgt a aaaat gact a ct at aaaaat t t aagcact t caat t gtt ag gacccct aac t agct act aa at ct t ccat t acact t t t t c cct t at ct t t tat t gaggag gatggcggaa t t gt ct cat c ggat gat gag caccgggtgt tgaagaaacg cgat caaacg ttct t ct t ct at agat at ca agt aaaaaca tcaagaggcg gact aagcat at act aaat t aat aat at t t at t t ccat t a aaaaaaaaag agaat at t t a aaat t t t gca cgct at cgga gt at at ggt a aaaagt agt g gt t caact gt t t gcct acga ct caaaccca aaat caact a ttat t ct t aa agt t t cccac t t t ct t t t aa 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> 26 <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> 26 gttgtaggaa aaacatctta tgaatttaga tccattaaat acgcttatat gcatgtgtgc Page 28 12689250 Sequence Listing.txt atgttcaatg tgtagataag ttctattctg aataaattga cgaataacaa ttattgattt caat t t cgt t t ccct at at a aagcct t gga ct t ct t aaga at t t at gt ct aat ct t t gat at at t t cat t act t agt caa gat aaaacct at aacat aaa t aat t t act c cct cat t gt c aaat agagcc gt gt agaaca ct cgt t agt t ct t ct t t t aa tt at aaacac caact accct t t t t gat t t t tttttgaaca t cgt ct gacc aat t aggt ca at t t agccaa t t aat aaat a cgat aat t t c agt acaat ag gt at t at t t a t caaaat act t t ct t ct gt a t at at t t t ct aagaagaaga taaagaacca t t gagt gt gc taccgaaaca t ct aact gat t t aacgt gt t t acgt gt t gc at t t ggggt c ttttcttaag caggt t caac t t t caggat c aat t agccga t t t t caagt a act ct t t t at ccat aact aa t t at aat t at tt cct t t t t t gt agagcccc gt gt agaacc agct t t t t gt gat t t t t caa gt aaat t agt at t t ct t gaa t t cat t t cag aacaaaaaaa aaaaagat at t at gt gacgg aat act gaat gt t acaacgt caaaact ct g agt t t at at a ct ct t ct cag agaagagat t cat aaaaaga at gt cat gt g at agact at c at t t cct t at at at at t at c at at gaaat a t accat t gat aaat t gt aaa at t t t at aat t cat gat at t caaaaagaaa at t at t caaa t t ccct ct ac cat t aggggt t t aaaat aaa t ccct ct aca t t aact aaca t ggcact gaa at cat cggt t t gt t gt gaag at t t t aaaca t gt t agt t aa t t t aagt caa aagat t t caa acat cat t ac t t t t t aaaat t ggt t t gt gt gaaaacacag gcat gt ct ca t aaat gt gt a gct t at t aat agagct t gag tg t ct gagat at aact t t gat c gt gagat aca t ct t t at t t t gat ccaat cg at gat t t act ct aat t ct t t gt at ccagaa t ct t aaaat a at at cat aat at aaat t tag aaaat t at at ccat at at t t at t t agaaca aaat t acat a cat t aggggt gt ct gat gaa actttttttt t t gagaagt t t gt gaat ct a t t t t t t agct at caacaat t cagt at aaat t t acgt aat t aaaaat aaat agggt caaac gt t cgt gct a att ccacaaa t at act caac t ct ct gcaga ccat aat at a gggaaaatga t at at at aca cat t aat gt c aaaacat gt t aat cagcct c t t gt t aat t a acat at at gc t cccagat ag t aacagct t a at aat t ct ga aaaact aat t at at gaagt a gcat t gt aat at aact t aat t at ggaat ag ccat at at t t t at at at cca t at cacaaga caacat gt t t tttgcaaacg ct agt t t t ga aggct cgt cc aat t cgt cct gt t t gtattt gt t gaat aac caacaaat at ct act ccaat t t at t gt t t a gt aaat at t t t aact gcaga t aat ccct t g aaaggaaact aat t gt agaa t t t t gt t t t g gaccaaaat a ccat at ct t g ct t gat gt t a at ggcaat t a t acct gt cct gt gat t t t aa aaat aact aa caat gt at t t gt t t t t t t t a tt at aaacac gt at t t cct c ttgt t ct t t a gcat t at aat t t t gt t t t ca aat t t ct cga t ggacagat g t aat gacgt g t t aat ct acc t aacgt t t t c t at caaaaaa t gat agt t t t aaaaaat agt ggaaagaaaa gat acgggt c at cat t ct gc gct t acct cg agaaagaaga at cat t agt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 Page 29 <210> 27 <211> 200 <212> DNA <213> Aral <400> 27 caacat aaca cgtaggcgag at t t gaat at aaaat t gcaa t aaggacct t t at gcacat g t aaaaccat a aat cat t gac gt t gt t t gct gacgaaccat aacaaat t t a aagcgaaaac t gt caagagt ccct t agaaa t gccct t t ac at caacat at act t t ggt ca t t ct gat t aa t gat aact t t t t t ggat t t t gact t t gaaa aagact t t ga t ccat ccat t gt cct at ccc at t t aaaaac gat t t ct aat gaaat gt gcc cgacgagcac gat t t gat cc at agt ggt aa ggaaagct at 12689250 Sequence Listing.txt 7 bidopsis thaliana t t aaat at ct t gt t t ct aat t cacat at aa t gct t t t cca t t at gct at a caact gcat g t ct t at gt gt gt aaagt t t t gt gct gt t gt gat aaat ggg tagct t t t t c at t aat t at c t gt t aaggt t at ggt ccagt aaat ggat ca t ct t t t t t t t acaat t t t cg t t t t at gt at acat t t t cca acat aaat t a t t t at cgagt aagt t t t t aa ggaagagaaa t cat ct t at a aaat ggaaaa t t gaat aat g aaat gt cacg agt gt gaggt caagtttttt gagat t ct ag at t at t t gaa aat aaat at g t ct at caaat caat t t aaac acat aaaaat t gagcct at a aat aat t t gt t t t t ggcaat t gcct t gat t gcat at act a aaaat gat t g t at t cccaat ccaat acaat t t aaaaacct tttggcccaa acaat ct ccc t t t t t gtct a acaat at at g t t t t aat t t a t act act caa atggggaggc caaat caat g t gat t t t aat at aagacct t aagaaat t at t aggat at t a gaaat t t aga taaaccagca t at gat at at tttttttttt at ggct t ct t gaaagaaaaa cct at gt aaa aggcacggct at gaaccacg taggggagaa t at gagagt a at at ct agt g gt aact t cac aat agt t gt t t t at agct gc t t t acaact t t gct t aacat aat gaat at a t aaact t caa at at t t aaca t ggat caat a aagaat caac gaaat t aggt cgat gt aat t gt ccaagt aa ct aat aaaat at t gt at t t t t t caaaaat t t t caaagct a t aat acgt t t ccat aat aat t caaaaat ag agaggacaaa accct ct gcg gaaat t t at t aat gt t t gag aacaaccaat t t t t at aaga ct t agat at a at gcgaaat g agcagaaaac t at t t aaagt at t aat at aa at gat ct gcg gt t at at t t t atgtgcggcc aat t t at t t t agccgt ct ca ttcaaacgcc t acaacct t c acat t t gggt t t t agtggcc at at t ct aaa t ggat t at ca cggact act a aat act at t g at act cggag t ggt aaaaac agt aaat gct gt t gat aaaa agggat t caa t at ggt t caa t t ccgact ca gt caacacca agact gcgac t t t t ct t t at at t t at at ct caaagt cat g aagt gaat gc t t at gcat ga t ct t aacgct t t t att aaga ct at at at gg t at at t aaac agcat ct t ac act aat at at aaggaacgat ct ccat t aca caat t agaga gaagggaaat acagt t caca t acgagt at t ggt t t cat ga tcaccaaaac t gcgact t t t at t t gt at t a t at at at at c t at at cat t t aat t at t at g ggt ct ggt t a aaagt t ct cg ttcacagaag caacaat t t c t agat aaat t caagagacga t gct at t act acacaat t ac agt t t aagt a caat gt gt gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 12689250 Sequence Listing.txt gagagacatt ataacataca tagataagat ataaaaatta aagcaaacaa aagtcatatt ttacttcttt tataaaaaaa gaagttaagc aataacaaac aaacacataa ccacaaagaa gacaaaacat ctttaaccaa aaacatg 1920 1980 2007 <210> <211> <212> <213> 28 2002 DNA Arabidopsis thal i ana 7 <400> 28 S ct t cgtgtga 0 cccgt gacaa rCi at aaaaaaca 00 Scct ggt t t t c rC1 ccagaaagaa aaaat act t c t aat cggcgt act aaat cct t ct gaat ct g aaat t t t t t a gat at t t acc aagaatcggg accaaaaccg t at t t cgaac at t gat aagt t t aaaacat g aaaat t acat t aat t t at at cat at at t ga t aaat t aat a t aaat t aat a t cat aaat t a at t ct t t at a t t at agt t t t t at gt t t t ac aaat t t t t aa at cgat aaat at agaggt t t gact gt gt ag acggat t aat at t at t t at t aaat aaccac acaaaagaac t aacaagt t g caacct aaca at at t cct t a aacat act ct ct t aaat acc t aaat t gt t t t at t t aat ag aaat cgaacc acggatggt t at gagt t t t a aaagt t t t t a aat acct gat agt t aaaat a t at acagt at aat t at t ccg agat aat at t at aat t gt at at at at t t at cat at t t t ac t t gt aact gg aat gt t acca t aat acct ct t act gt aat t at t aact t at gaaggat aat t t t agat at a at ct aaggcg at t ct t cat g at aaat aggt t t t t gaact g t t t ct t t t ac caaaat aagt cgcaat at aa t gggt at t t g t t ct aaaaat at t t t t at at cct accagat t caat gagca t t gt at aat a t t at t agagt ct at at t t ga aacct ct at a gt cccgagt t ttttttgaaa aaat gt at ca t at act t t t t t gt at caaaa t t ct agaaat aat t aggaaa ct aaat t aat cgt aaat t t t gt t t t agaaa aagat at gt t t t t t aat t t t t t agct agt a t aat t ggct t ct ggacat aa aat cccaacc t t at aat ccc acat act aat gt acat acac cat aat at t t t t at acat ga t t accct at t t ct gcct t aa t caagt aaaa t ggat at cga ccat at aat t t t at at t t t t aat t aat act aggaccagt g at t ct at gt a act at at at a t gct t aaat t ct t t t agt gt at at at t ct a at ct ct ct at aaaat t t cgc gagagtagt t Page 31 t aat t gt t at cacat ggcca aat t t gat t t at t t at gt ca ct gt gaaaaa aaat cggaac caaaaaat t a t aact t gaac aact t t agac aaat t t t t at t gagt at t t g t t t t at ccaa agat t t t at t gt t t ct t aat gt gagagt ct agaggct aat t acat aagac agt aat cgt g ct at aaat t a t aaaaaat ga aat ct at ggt t at at at at t cat at t t gt t tgt t t t ctaa t aat gaagga aaat t gat aa agt cccaaca tttaaccaac at aagt ct gc acgt gaaagc aaat at t gt a ttgaaaaaag t gaat t t aaa ccgagt act c gat t t at cct t ccat ct aaa t t gaat at cc ccaaaacct a t gt acccaaa act cgaaaaa ttat t t t t cc gt t cagacct t gat t ct gct t cagat at at tttccacgac aagact cgt t at aacct ct a cat aaat cga cct at caat a at gt aaaaaa ct t t gt t t ga acat t at t at aat ct t at ag at at t aat t t t t act aat t t ttacaccacc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 aat t t at t ag aatt aggacc t t t caaat aa aaaat caaca aaaact t aca t t t t aaact a agt acat at t at at act ct t t caaat cat a at t aaaacat ttt ct cccac cacaaacaaa 12689250 Sequence ttgattttcg atatgcaatc agaaat t gcc agctaat t t g t ccat at aaa aat aat t agt tctcatttcc tcctaatgaa caat ct caaa gcaaat t aaa tg Li st i ng. t xt tt gtt accaa ct aagt t gt t aacaatgacc at t t aagtgt caactaatcc cctattaaga t accct at aa at acccact a tacactacta cttcttgagc 1740 1800 1860 1920 1980 2002 <210> <211> <212> <213> 29 2003 DNA Arabidopsis thal i ana <400> 29 ct aat cgacg t aacaacaat at ct ggt gga cct t t gtt cc at t t aat gca caaat aaaat ccaaacattt t acagt agt t t gaat aat t a t t gt gt aaat aat aat t cat ggat gct tag t t cacccct t gcaat agat a agat t t ggcg at t gcat gt a caat t t ccaa tgt t t t t aaa t gaat aact a ct agt gat at gt t t at agaa ataaagcggg aagt t at gt g acgggt t act gaagcagtcg aat t cat gt t t ggt t agt gg acccct t ccc at ggat acgt gt gccgat aa cat gt aat gt t aagat at t t t gat t t ccct gt cgcat caa gt gt t t t aac t ggt t at ct t cccccaat gc cgt at caaat accat at t at at gt at gat c ct cacgccca t aacgt t ct a at cgcat aca aaat t agt gt ttaaaaaaaa t t act cat t a gct cat cagg cat t acct ac t cat caaact t t t t aacact ttctcct t t g ccaat gcaac at caaat at t gatt at gt ag at gat ct ct a ttccgcaaca ccat act ct c acat at aact at t at cgt ag at gt t at at t aacat t ggaa at t t t t ct ac gt agcaagt t t ct at aagt t ttgt t t t cat t gt t acagt t aagcaaacat t t aaat at ca aaat gaacaa t ct acgcat g aacaaaaaca gt at gt t t at t at aact gaa agcacat acg t t at at t t at att gggaccc tt cct acgga caagtt ggat caagt t gat a gt gctt acaa gaat ggct gc gaatt gt at t acgt cat gcg t at acgaaac cccgt t t at a gggt agattt gcat act aat gat act t t gt accgcgt ct g accaat gat c ct gt at at at at t ccat ct a at acat ct at t t t atgt t ct t at at aaat g gt t ccct aca t t gt at t aaa t cat gt gcga acgaaacat g gt at at aaca ct ct t t at at act aat gat t ct t t cgct ac gct t t t aagc t aat ct at gg t gaatt acat cgacgt gt cg aagttgtttt acact t t cgc acat t gt caa gatt act gat cgagt at at t cat at aggaa t t t t acct gt acgtt ccgt t t gaaat ct aa at at at cgt g cat cggt aat gcgtgtgttt t at agacacc at t gcat gca cat gt cgt ca ttgt t t ctgg ct ct cgccgt t gt caaat cg act gt t at aa caat t agggc cacat aat t t aat ccgt cgc gcat aaccac t t t at ct ggt t ggaat t gt t cgcat t t aat at cgcaaat a at agcgaaac gcccaat aca t t acat gt gt t agt ct ccat t t t ggcat at aaggt at gt g tgt t t t ataa t gaaat ct at t at aaat aaa t agt gt cgga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 32 act cacgt gt gct at t t at t t at ccagct a gtt gt agtt g aat ct ctt aa t cat t aagt a t cgt gaat at at acaat act t t t at t at t a ccact acaga t cccgcct at aaagt t t t ct at t gact ct a gt t t t at t t g t acat at t t c t gt t t cgat t gacggctct t acaagt gat c aagt at caca aaaaat t aaa 12689250 Sequence ctttccaggt tattttggtt gccgcatggc tattttgct a gttagtatat atactctaga t aagt aaaaa agaaaaact t t t aagat aaa at t t catt aa ttcat aaaaa t act t t at ag tgacctcttc gcctagttga ttt agt ggaa aaaat agat t ccttctttct atataaatca at g Li st i ng. t xt at t t cacacc at t t gt gt gt t t t at t ct at gaaaagacca aat acaat ac aat t t t t ct t ctt ggt aaaa t ct aat t aaa tcgt t t ctta ccaaaaaaag cgaat t t ggc t t gt cacat t gggaagaacc aat agat gt t t t agaat gga gagat aat ca gt t ct gaaag act ct acaaa 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 2001 DNA Arabidopsis thal i ana <400> ttgacgaaac aaagcct cgc gct at ct ccg aaacaaat t g at t at t t caa gat accat t g agatt gt t at t t t ct t gat a t t at t at t at t at t gct cat ctt ctt ctt a t ccct ccaat aagt ct ct ct t t gt t t at t t at gt t acaat cacaagcagc at ct aaagaa t aat t cccag t ct gcat t ca at gaaact aa t t ct t gat aa t agctt ggt c cat t t t t cgg aggt agt ggt t ggct cgaac aagaagaaga cct aaat t t a tcgt t t gttt t at gat t at c t aat caccca caatgt t t t t gt cct act gt cat t at caaa ct t gt gt t gt ccaaccgcgt aagct at t ct gt t gct gcaa t t t gcaact c at t acacaaa acagcaagat gaagcataac tatttcccat ttagccaaga agctgctgat t ct gct gt ct agagt gt agc t at t t ccgat tgcgagaggg agaaaact t a ct t t t caat a t cgt t t t cag t cct ct t t gg t t t at t at t a t agt ct t t at ct cct t ct ct at t cat ct t c gt gt gat gca at t cgacaac ccagaggttt agaaacat ag t ccgcaccat at gt t t cacc aaagcat ct g aagccaccca aaagt aaaac gacgacgacg t gt cgt agt t aggacaacgt at t aaact ct at t t t cat gt t t at t agt t a ccgacctttt at at at at at gt gt gt t acc tctgt t t t ct gtt agagcaa aatgggaaga ct cagt caca tgacgcagag t ctt ccaaac agagaagt aa t gct t ggaga tcaaagccca gaaagat ct t aggaggagt a aat ggaaaga t t aagt cct g t gaggt t gt a ct gaaaat ct t gacaaaat a ggt at ct t ct at acacaaca at ggt t t t ac t cct cct ct g gggatgcgt c agt t t ct t cg aagccaaaat agaaggagac tt agt caaag cact t t t t gt gact gt cagg t t acgat ct t t at ccccaag cggccat aag gagt gaaaat t cct gcct t a at att t cact ct ct caat cg t t ct cgt gct ccat at ctt t aat t t aacat t t caccat gt aat caat cct cgat t t cacg caggtgcagc ct aaaaccga ggctt cggat t aagt t t agc at t at gt t ca t actt caat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 aattgaaaaa gatggttcaa gacataccaa ccacaccatc tttagaagac aacttgagat Page 33 12689250 Sequence Listing.txt tggaccactg taataacaac agagact t gg caagagtcgt gt t cagt t gt agcgacagag aagggct aat ct gagat cat ggaatgaagg cat cagaggc t t t gt t t at a ggt at aaaag aaaat gat gc t gaaat t t t t ggt t gct t ct caacgaat aa ggat t ccagc agcagt agt a gt cggaggt t gacagtaggt at t ggt gaag gaaaaacaac t act t gt aca cccat gt gt t aat t ct t t ct t gt cccat ct agt agat agc ct ct t caaac acaagt ct ct gt agcaccat gcagagt caa ggaagaacca ct caagaaat aacaat ggag t ct ct gt t t c at t gaaat t g t t ggagat gg ttgt t ct t ac tttcttctgg att ccccacc t cat cact ca t gaaagcagt agt gt gcct t cgt t gaaact gat cgt t gt t t cct agt cca ggt ggat act t gt ggat gt t ccaat t gt ac gaaacaaaga t act t ct cat aaccaacaag gaaagcaaag gt t t gt t caa t gt agt gaat aat t cagcag ttagagaagg t acaagagt c at aacaaaat ct t t t gagat t t t ggt t t aa caaacct cct aagt agt caa gcggtgagag ggt gt caat g ggt aaat cat caat gagt at t agat gt aaa t at at gaat a at gaat cat g gaaat cccat t aagt t gaac t at aaat aga agcacaat ac 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2001 <210> 31 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 31 gctgcaccac taggatgtat gtgtgccaat catcgt t gaa gaaatttttg gaaaaagttt tttttaagaa aatcaatgaa taaatttgtt ctgtattagg ataaattatt atttacacac caagggggtc agttgacccc ataacatttt gtatataatg gtatattttg gcttgattca ctaatatttg accaccataa taaacttttc aaattaaaac gtgaagttga aattatgaat atattacttt caaacaaaag atacaaacat caaagaatga taaaatagta aaagttatgc attaagaaat ccattacttt ccttcgtaat gttggacatc tgaaccaatc ttgtgatata aacttaaatt agtagatctc tatacactca aggacttgca ttagaatttt gttcgttcga agagggtttt ttcctcttct ctaattgttt cgatataaaa tatatctttg ataagaaaat tgaaaaagta agt t agtaaa atgtccacat agat gt caca t gaat at acc aagt at gt t g cgt t gt ccag aaaat t ct t a t ct cagct ga t ggct ccgcc t cgt aacat t aat acaaacc t t acgt gt ga t gggcaat gg act t cgaagt agt ct caaga gt t gt aat ca t agaaat at a aaaaat agt a caat at acat tgat t t t t ca aat ggt t act ttttgttcac tggcggagct t t t acat gct cccct at gac act gccgt t g cat t t t t t t a at caaaacac aaaat t t t ac gct gcaccat cccaaggt cc t gt t ct ct ac agct gt t gat aat act t ct a aat gt ccgat gat agt cat c agacgagt at acgat gat t a at t aat ct ct aact cat aac t at t t aat t g t cat gt t aaa t t t at ggaca acat ct acgt aaaat aat aa t cgct aaaga gt t t t cagt t aagt at t t ac act at aat t a cggct t gct a gat t ggt t ga agaaaaaat t gaaact t aaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 34 12689250 Sequence Listing.txt aactacaaaa aatttggact atataataag aaatgcccgg taatctcaaa cctttttttt t cgat at t t c gcat gat cgt t at t aaat t a acaaaat t ga t t gaagat ct aaat t aagt g aat ct act t c aaagt t t t gt t at gaat t t t aaaat t aat a agt gt ct cct ccat caaaca t cat ct agat cgt aaccat t aaaagt agt a caacaacaaa at cat t t at c t at t t cgt gc taaggaagca ttaaagaaag t caaact t gt t gat accat t agct cgt aat t at ct t t t t t t t aagct at t ttgat t t gt t aatt ct acac t agt caagcc aact t caaaa t acct ct at a gaact aaaaa ct t at at t t a t ct ct aat at acgaat gaaa acat agt cag acaaaat ct t t ct t t caat g t accaaat ct cccact caat t t t t t t gat c t cagt at t t c t t at agt at t cccaat ct ag gccccaacct at gcaaat ac t aaaat ccat at g t act at t t aa agt acaat t a t act t gat at act gt aagac gaggaaat t c t t t t gct t ct gacact t tag ct t gat t t gt aaaat t agt t t ct t agt t ac t aact at agt aaat aagact at t aaaat ga t aat t aaaaa agt aat at t a cgaaaat aat ct t aaat t ag agt t gt ggt t t t gt gaaat t t ccat cagct ct t at ccaat ttct t t agt t aat ct caat c t t ct t at cat aact t aat aa aaaacagcaa t at gcgcaag t t aaagcat a at aaaat aat t caat agat c t gat at t t at aaaat t gaca t gt gcaccac at at at gt t t acat gt ggat cgat cgt gaa t gaaact cca cgt aat ct ca aaaat t aaca gcat t t aaaa ggt t gt at ct agt cact t gt ttagcgcaag t t aacat t ca caaact aat a acacat acat 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 32 <211> 1998 <212> DNA <213> Arabidopsis thaliana <400> 32 aagacttatt taaacatggt aatgaaaaga ct t atctaaa ctctggt t at tttat t aat c ct at t ggt ca at gact cgat gct t ggt ggt cgat t catga ttgct t ccat ttcctgtaat tatacagatt gagagaatta aatgaacttt aaaaaaaaaa actgaaaaga aaagaatat c ttatatttaa gtaatctaaa act t gtaact tgttttcaaa tgggcataaa ttagt t gggt atgtaaaagg aaagaaatga atgtaaagca gggtttgcaa aataaaagcc ct t cggattt act gt agt aa attctaacaa tcagat t gga ttaagtgttt aagcacccac caagtattcc ttttgttatg caccaaatta attacaaaac aaacat t aga ttcat t t t t c cccaaat aaa t t aat t aaaa cat t at at t a t agaaat t ga gaaaaact ac aagct at aat aacacaaaac ccacaaat ac at t caat t t t at t t aat t ac t ct at aat ca cgat t aat at t gcgaat tag at t gt at t aa t t t gtgt t ca t t at t agct a t t t t agtagg ggat t agagt tgtgacgaag aat act ccaa acgaaagaat cagaaaacaa t aat t t gt t a t cgacacagt at ggt t cgt t at t t ggat t t aat aat t t t a t caaat at t t aaagagccag t caacat t at t t at t act gc act t t gggaa ct aat t aaaa at aaaat ggt t t t t at cgaa gt accat t t t cgacct ggt g 120 180 240 300 360 420 480 540 600 660 720 780 840 atctacaaaa ctttaacttt tattccggta taggttttaa aacattgagt aaaagtgaac Page 12689250 Sequence Listing.txt atgtat t agg agat t catgt tcgtcgtaga taaact t gac caacactaca tat t agtat a t t accat ct c t cat cgat ca acat t gaaaa agt t at t at a aat t aat t t a act t gt cct t aaat t gaat a aagaaaacaa t at acccact t t ccct ccca t aat at aaat cat t at t gt a cat t at at aa t ct t caat ct t t t t ct t at t aaat aat t ct t at ct t t t aa gggt t ct t ga ct aat ggt ct t t gact cgt t cgt t t at gt c ttgaaagaaa gt t at gcgat aat t at t t cc t act ccaaag t t t at aaat a t at acgcagg t t aat t t t gt t cggat t cca ccagtgcgag t aaaagt aaa aaagat aaaa aat t cacaaa t t cct t cct a cagt t t gat t t cct t t aat c aaat ct agaa t aaat at g acat gt t t aa ctat t gt t t g t ggat aaat t t t aat acgt t at act aaat g cccgat t at t aaat aaagca ccat ct t t gt ct act at act gct aaaaaca ct at at t aaa aaaat ggaca t acct t at t t gcat aaat aa at t t ccaacc ttct t ct t ct t cat ct t t gt gaaat aaagg acat t t caat t t cagt ggaa gggagt t aga t t t gctggcc at t t at t t at aat t aaggaa cacgt aaaga ct t t cccaaa at t t ct t at t cgaat at ct a at aat cacgc gacat aaaca aat gacaaga at ct aat t t c t ct gt t ct t a agct ct t aag t t at ct t t aa aaacat aaca at acct aggt agaat t gggc t t t agat gt t t cat at at t g cat t t t agca aaagt at t t t aaat at at t t aaaaat t t aa t t at t at gt a t at at t cct a t at t aaagat aat ct t acaa t aaat aaacc ct t cact t t c gcaat at at t t at at t t ct t tcaaaaccca aaaat agaaa ttttaaccca t ct at gt ct t ggt t ct t t t a t t agat gt t a gcct agt t t t t caact ggag t at t at t t t t t t gaat t t t g t at t t at t aa aat gaaaat t ttcct t t t cg t at aaaaaat at t t t t t at t ct ct ccaaca t t t t ct ccaa t gt t gt t at t aaat t t acat gaaaaagaag 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1998 <210> 33 <211> 2013 <212> DNA <213> Arabidopsis thaliana <400> 33 acggggtcga atctacttgc caaaataaaa at t ataattt ccacgaacca tcacct t gac ttagttgggc tgatttgagc tgccacccaa aaatctgagg gttccagtat aattctttgg aacttttcca tagaaactat gtaattcgtt gcggagacgg agttcgccaa tagaagaaat actaccaagg gttaataaaa tgcgaacagt t gt t gat t ga aacct gaagt t agt gt ccaa gcataatatt gttgatgaca aaaaaaaaag gttgttttta atttcccaaa aataaaatgt cgtggggccg at ct t aaaga act aaat gat t aaaat t t gg gaaaact gt a ccat at t t t g t aat t agaat gaacccaaag aat aat at t g gaagggtaat Page 36 cagacgatt a acaat aat t c cgct aat aac t t at t t at t t gt at cct t t t t aaact t agc at gat t aat g ct at at t t gt t t gt t at t t g t aaagccaat cggccgaggc ct t gacat cc at gat t t aaa t at t t t t aaa t t at t t t t t a gcgtcgagaa gt t t act cca cat aagat aa at gaat t at a aaact t ccaa 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt at t actaatc tccaaacaca tgcctat t at ttacat t ctg ggctacacaa gt at gagt aa cgaat aagat gt acct acgt t at gagt ct a aaat aaaaaa t aagcgaat a gact gt t t t a agacgct cga t t aaaat aga caaaaccaat cat t at t t t t aaat at t t ga t t gat t aaat t cat t t at t a t at t t ggt at ccat aaat aa aaaaaacat g t aaact aat a cat gat t t aa aat aat acaa cagt aat gac ccct ct ct t c acct aaaat a t t at t gaat a aagt at at ag gt t t t t gttt caaat t acat aaaacaaaag t t ct aaat aa gaacaaat t a acaagt t aaa tcaaacccaa at aat at gaa caaagt gt t c gcaat act t t aaagagaaat act aaagaga at t t cacat c aagat ggat t tat t gt t t ct aaagt t t t ga cat t t gt t t c agat ccact t agct t t aat t at t t at ct t c gaaat t t gat cat gt gcgt c ttgt t t t cag ttgaccaaaa t acaat t aat at agat t gaa t t cagct gt t t t ct aaat t a t at t t t aaga aat aat at ga t caact t aag acagt t gaaa t at t t gt aaa acat at t agc aat acat at t aaaacgtt at ggt t aaagt t t ct t t t t gt a aaaagaaaaa at aat gcaat act aaaat at aact gt ccaa accact ct ct tttttgagaa gat t at agt t t cgaat aaac t aat gt at at ttaacggaaa at aagaaaat at acaaat t t at at t aaat c aat caaat aa at caact t aa aaaacat aaa at gcat at ac t gcat ct act t cat gt t aac at ct cat gt t t t t cagt t t t aaaaaccat c t agat gact a aacgat gact aaaaaaaaat aaat t t gt ct cccct t gaag t t t ggcaat a at g t at t t t gt t c t aacacgt cg at t acat gga at t aat aat t acagccgat g t acct acgaa aact cacat a gaaaat gt aa gaaaacat aa gtcgaacgga t gt at t aaaa t t acaat aaa t ccgt act aa aact ct gcac cat at at at a t at t at at t t acgaaaaact gcccaacagg aaagt at at c aat aat t ct t ccct aat at a ct t ct at ct t cacat gggaa t at aat t t t t acct aat cgt ct aat t t aaa aaggt t cat a at t at gaat t aagcat t gt c aaaaat t aat ttttttggat agt cgaacgt at gt aaaat c gt cat t at t g act agt at ca at aact agt a t aaat aagt a t at at at ggc agccaggaaa t t ct aact ac gt cat ct at c t t aaccact t caaaat t at a t at acgt aca aaaact t ct t 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2013 <210> 34 <211> 2005 <212> DNA <213> Arabidopsis thaliana <400> 34 ttgatgtatc ctagaggagc atctaatgt g gttcctcctc ctcctccacc tcacggtat c aggt t aatat gctctgcaat tgtat t ataa cacacattat atgtggtggt tgtagaacaa taagatgctc ttgctgtcaa actacgaacc atccatatca atccttttaa agaccatgt a ttagaatctt ctttcacact tttgtttgat cgt t gt gcgt gat t t ct t t g ct t gggt t ct t gct t at gt a t t gtgccagg t t at at t gct aacat t gt t c Page 37 t at gt aacac t t gaat t t ga gat t ct gaat t acgcgt ggg t at at t aat a t t at aaggt c t gt ggagat g t at caacat g at t gaggat g acagacat gg gct agt agcg at at cgt gac t t t t agt cct at gct t acgt 120 180 240 300 360 420 12689250 Sequence Listing.txt aacgtatttc cacttttccc aaagatgtat at gaat ct ga at t ct gaaaa t at ct gggat tt gt aaagca act act t gt t at ct t t aat t t gct t aagaa acat t gct t g aggt t gccca ccct cat gt a acgt t aat gt gat ct t t ct g agggt acct c gt gagt t at c ct ct gt t cat t t gt t gt aga cct gt gt t ct acaaat t gca t gagt gagat at at t acat a gaaaaagct t acaaggtttt ctgt t t gttg at aat ct gat t gagat at gc agcaagt ct t aat cagt cag gaccaaact c t t gat cact c gct gaaagt a agagct agt a agcgt ct aag cat t gct gat t t t t ct gaat t gct cct t cc t cct t acggt gat t at t cct t t ggaaat t t t cccaact aa aaat t at gaa cgat t t t t ca aaaccccat g t ct t ct t at t ggt gagcaat ct t aaagat c cgcggaaaaa at cagaat ct t gct cgaat a act t ct aaaa acaccaacca t acat gt cac gaaat ct gga t accct ccaa t t t ccat at a act gcat caa ctt aaaacaa gggt t t at t a ct gt t gt cat t act t cgt t c ct gt gcgt gt agt caggt t g gcat cat ccg at ct at t aag t cat t t ct ga ccggccaaat t t t gt aat ag ttttaccaac t ccgt t gat g t accacat t a gt t gt t gttg aaat ccaaat ct gt at gt t a ct t gt t act g at t t ggcat t ct ccat cggc at cat t aaga aagaact ggt aat ct at t t t gaacat t aaa aat act ct t t caat g agct t t t aga tt gt t t t gt t t t agct gt at at t agt at t t ct t t t t t gaa cgcagat caa t caaat gcgc ccacct ct gc ttcat t t t gc ggaacagctt t t ct gt at at agt caacacc aaagcggaaa gaggaagat a gagt gacaac t ct t cct ct a t at at ct ct t cat t at t ggg ct t t t gct cc cct tgtggca t t t gggt t t g ct t agct t t g gcagt aat ct gt t agat gat aacact gaca t ggt cccggt t gat ct acca gat t at cat t ct t ggat t t t at cgacagcg t t gt gggcat t gt t t gt caa at ggt t gagt at cct t agat gt cccccct c t ct t at ggaa accct ct cag gt t ggt gagt t gacaaagt g tgacaaaaag t t cct gcgt t gact cct t t t gt t t at t caa at ggaact t g t t gt t aat gt aaat ct gt ct gt agat aaga t gt cacaaca ccgacaaaac caaagt t t ca ggact aggt a t t agat t ct t t at ccat gac t ct agcat t a cact ccaat c tgtcggacga t t cgt aact a t aagt at aga gagcaatgga t acat caact ct ggt act t a acccaaaccg at t t ct at ca act gaaacac t aat caagaa t ggt t t gt gc taacccaaga agt t gaagac acct t ct ct t at gt at gaat ct t ccgt gga ct t gt ct t ag accat aacct ct ct caacaa t cact t t ct c 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2005 <210> <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> gtttgttgta gtcttgtaga gactagagat tgatcagttg atcacacttt gttcctatat gattagactg agcaatagca taact t attt gataaagtta ggactaccac atttgatttg taattaacta ttttcttata cggtatgtga ggtctgtgat ctgtcagact aattgaaatg Page 38 12689250 Sequence Listing.txt cat gt gg at at at ct ga agaat ccct g gctaaaactt gtgt t agt t a tgtc t at agt at ag t t gt gt gt gt t t gct cat ag acagggat ca ct t gct t act at gggt t t gc ttgcct t t t t aggt gt t t gt aaat at t caa t t gagcaat g t aaaact t at t t t t t aaaat caact t gat t caaggaaaac t t t aagcgt a caat aaact t t gaagat t gt ttagt t t t t a taacaaggac at t t t at t t t at t t t t gtgt aagt t gacaa t t t at t t at t at aat gagcc t at t t gt t aa ttttgacccg at aat agt ga at at agcct c taaacaagac aaccaaaaat cat cat t t ga t t cggt ct ct gt at at gcat gt ggggt t ga cagat ccct c t t t ct agct t t t ggt at gac ggt t gt gaga t gcat cacca ccat agagt t t t at aat ct t cct t t gaat a at at at t t t a t at gt at t t t agt cagat t g gt aagt gt aa tggcacaaaa ccaaat cct a aaaat acaaa ttccgaaaca aat aat at cc aaat act t aa t t t gaaat t a aaagt at t ct acgcgt at ac aagct accgt cgt t gat t t a ct gaact ct t aaaggat caa agaaaaagat tgacagcaaa accaacaat c gcat t t gt aa t aaat t t gat acgt t cagaa cagct at at c at at at at cc gaagt acaac aat caaagac gacat caagc cct cct aagt caagagt at g at t t gct cca aagaat aaaa t t aat ct aga gt gt t act ca aacaaaaaca t t agaat t t a at gt t acaca t ct act at t a at ct acacca aat ggat at a t t t t t t gtt g ccat caagt a at ggt gat ca ct t t gt t t t a at at cat gat ct t ct ct ct g gaacact act at g at t caaaagt gt t gt t gct t t gggagt t t c at t at t agaa aaat t ggacg t aat caaaat aaaacaagt g t acgaat ct a t t ccagaat g at gacgacgt gagat aaaat ggacat t caa gct at gt at a gt gt ggaat t aaat at t gt t ttctgt t t t a gat t t ct aag t at t t at aat aat t t caaat t t agt agt t c t gact ct t ga act gt aaaag gct t t t t gt a at aat t at ct ccat t ccaca gct act cgt c caaacaaat a aaact caat c tgt t ct t t gt cat gaagcat taaagaaaac gaacacacag agct t ct t ga gt t at t gt gt t t t gaccct a caaaaat ct t t t t gt aacac t aagaacgt t t aacgct t ca t gt at t aat a t t t t ct t t gt t aacgct t ac t at acaccgt at ccagaaga t aaagt t t at ggt t t gt aca gt t t t ct t ca at agacat ct t aat at gt t a caaaccaacg gaaat gccct acaat ggaag ccacgt t t ga t caact cgca aaaat aat ct accccaaaaa agt t t ct caa ct gagact t c aaaatggggg at t at aagat ct t at agat a ct gaat ct t a at ct t t t aaa t ct t ct cat c t t t t t t ctcg gat t cggat t t at at cat t c gacacgtttt t t t ct t t t t t t aagacgt ca tt cat act ag aagt t t t at a t t gt gt aggt gt gaaagacc gt at at t aat cat gct aaaa t t ct aat aac at ct t t at ga gct agaagt c t t aagct ct a t t aat t ct ga t ct acat ct a aaaat t gcaa aat agt ct ac t at t at at t c t gaaaaat ct cgaat ct t t c aaacacaaga 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 36 2005 DNA Arabidopsis thal i ana Page 39 12689250 Sequence Listing.txt <400> 36 acat at t t t g agt t t t cttt at acagt at t aacgaagat t caagaat at g t caat gct at ct aaat gaaa gt aacat ct t aaggagattt t t t agt t t t t at t t gaaacc acat gt aaat aat t aaat t a caggt gt t ac t aat t caaac agat ct cgt t t ct at t acaa at at t gcct c cat gat t agc agact t gaag gt t gt t t ggt gt t gct t t ga t t t t acat at t caat at t cc agacact aga ct agct agaa acagat gt ct t ccaaaaaat gt t at t cat t gct t at gaca ttcaaaggaa aaaagt ct aa t at at at ct t t caaaaagt a t t t gt t t t gt atcattaggc ccaaagaatg tttttgtgtc t cct aat aaa t gt aaaaat t aacacgcat t t aat t at aat t t agaacat t gaacaaat ag t at cat aaag agt act gt t g agagagt t t c t acat agagg agt caacaca cgat gat aag t t agaact ag t ct at t at ca tgt t atgttt t at at t ct cg cacact t aca acgaaat gt a t t gat at gt a cgagaaagcg t t cgct aat t act at t ct gt gct gacaaaa ct aact gat t aagagaggaa ttagaacaag ccaaact cat at t t caaagt gct caaggt c aat at at act ccaaat at t t gacct ct at a acagaagaaa ct gaacaaca agt gagt at c t aact aaaca t t ggt t at ga gt t gaaagct ct ggact gt c t cat t cgcca cat cagaat t gccgagtat t cagt gaaaat at ggaacaac t ct ggat agg aaaaaagt ca t t ggacagt c caaaat t aag t t at at aat t cgaagaaaag t t accaat at t gct t gcat g aacccaaaat t gat cgt gca ct aat at t t c cct act aat a aaacaat aaa t ct aaat agt cagt t t t t ga ccacgaaaga gacgacat aa agct at at at t t t t cgt t t t t ccaaaact t t aaacaat aa gat t t t caaa aaacgaat t c t t t t t aat t t t t at t aaaat at agggaaat t cgt t gt gat acgt caact t t gcaaacgt a t gat gaagcc at aaacat at agaagct aac gagggagat a gaaat aaat a gacgaaagaa ct t at at gt g ct aacgggaa t ct gaccaat at cat caaat ct at aat gct t t t gaaaat g t gcagagat a acgt gt t agt ggt t t t ct aa agcaaat ct t ggat t gagaa aact t t t gga aaat aat t gt ttgat t t t gt at aat cct t a t t t aat t aaa t accaaaat c gat t caact t act acgacat aaaaact t at t t aaat t aaa cgt gacacga aaat aaat ga at aaaacaac cact aat gat at t acaaat a caat cgt ct a gt aaat acgt cct gagat ca t gt gt cct ag t agt ct t gaa ct cct agt t a acagct agag t t at at acat t agt t t t at c t at t gat at a aat t cat at t act caaaaca ccacagtttt t t t t agt t t t t gt aaacaac gagat ct t ct gggaaat gt g cgcaaaact c ct cgt aaat t t cct t act at tt cgt acgaa t ct at acgaa t cgat t t at a t t cacagt t a t t aaaat t ag gt aaat at ac at gt t aaat t t ggat agt aa acggt t t at a gaagat aagt cact agt cct t t gggcct t a aagt cgaaaa t t gt gat cca t ct t ct at ga aaact t caaa t cagt t at gc at t aat at t a ct cagcgt t g aaact at aat ct acacgt ga at aat aagt t t aaat gct t t aacat acagt t ct gaaaact t t act aat t a at aagagact cagt caat t g tcacagcaaa gct t ggct gt aat t aat t aa aaat t at ct a at gt at gaag at t gt gcat t at ct t aact c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2005 aaaagaagat cgatcaacta ttatg Page 12689250 Sequence Listing.txt <210> 37 <211> 200; <212> DNA <213> Aral <400> 37 t aaaaccct g agccagattt agt act agt t ct t cat aaac gagaaaaaat agaagat cca t gagct gacg cagagaagac t t at agagt c ct gaagaaga ct ct aaact t gaat aaggaa gt gcgt t ggg gaagtcacgg at aacat t t t acat t t t acg aaaaat t t aa gt t t aaggat t t gat gat ga gacat cacat at ccaaaagg acaat gacag ct caagacat gaat t t t t gc gat t t ct t ca ccgt t gaagc ct ggt at t cg t at at t ct t t aat t t at at t t t t gt at gca dopsis thalana bidopsis thal i ana taaacaaaca accct agat c ct aaacact g caaat t cat a gaaact t t gc at t gt acaaa aat gaagct a ggt ggt gt cc agt agt agaa agaagaagac t agat gaaat aaat gaagag caagggatcg t t t t agat at acgacgt t aa acaat aaaaa at t aaat caa gt gaaat ct t ggat gcccaa t gagct t ggg t caact t at c at t acaaaaa t ct gt aact t gct t t gat ga t gcat act t g ggat t gaggg t t at gt t agc aaat aat t ct agt caat aaa acct t aagag cat ccacgaa aaccaact t a t t ccagat aa at caat ct t c t aaat acaat at t gaaaaga gagaaat t gc aagat aacga t cat caacca ccaccaccac t t t gaat t ct aat t t gggat t cgt t cacat t t t t aagt ct t aaaacat at aat at t t t ac at gaat gt t g aggaat t at t t ct cct at gc gt gagagt t c at at ct ct gg cccaaaagat ttgt t t t t gc t t cact aaac aaaaat t gt t at t gaaact g at t cgaat t c at act at at a act acat t gc agaaaaaaaa tcaggcacaa t act at cat a t agaagt t ga t ct aaaacct cacaccat gt gagaggagaa t gt gt t t aaa agct at t at c t at cgt aaca cgt ccgt cga gaaat t t ggg t ggggt t t at caaaggaaat ttggcccaac t t acaact t t aat t t t t aat tgtat t t t cg at t t t aaaat cacat cat gt accat aaact aacact t at t ct t t t ct t ag tttttatcga t t t aaat aca agaacat gt t caaaat t t ga aaat ccgt t t ct t t t t t t gc aaact gt t cg t cct t aagaa Page 41 gt ct act caa gagt ct t caa acaaaacat c at ccacacgt aagagt aaga gagaggact a gt at t t aggc t gcact t gga t t t t ct caag t t t caccat a at t gaat cga gtgggaagga ttcaaggaag agat at t t t t aat ggaaat a gaaaat agag aat gt t agga cat t t cgt ca t gcct aagat aaaagat t cc caacaaactt gaggt t t gag caacctt ct t t at ct t ggaa gaat at t at t at cat ct t t g t t t t agt acc at t t t aaat c cct gct t gca acat aaacgt agat ccaat g gat t gt gact t t t ct t cat g t gt aaaagca gacat aacaa acgt agat at aagagt t gaa ct ccat gaga aacggcgcca at cgagaaat t t agt cgcaa gat gaaact t agaggaagaa at t t aaaaaa gaaaagt aaa aagt t aat at ct t aat aagg t t t agggat c t agt t acat g agat t agaca t gct t aact t gaat cct ct c act t agt t gt ttttttaagt at t gt t t cat gagat t t at g t t caat t t ga t cacat acca t at t t caaca t aat at at t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 gcat caagt a gatcaat t gg gaagccaat a accacgagac ct gt ct agat t gt gtt agt c t at at at t ga tcaagagaca 12689250 Sequence Listing.txt aat t gatttt ctagtcccat ggt t gaaaag tcatggataa cttaatttgg gtcttattag tagctcaatt agtttgacca catacatctc tcgtttataa caacgaacaa tttgcacaca tg 1860 1920 1980 2002 <210> <211> <212> (7 <213> 38 1984 DNA Arabi dopsi s t hal i ana <400> 38 gcgcgagagc at gat ggt t c aaaagcagag t ct ggcaat g caacat ct ac t ct caaat gt ggt t t aaaac t aaggt gt t c cagagttt ac gt t t gaat ct ct t t caat t c aaaggct ttt ttct t gt t gg agat gacat a ct t cgat aca cgt t agat cc aacaat t gca cggt ccaat t taaaaacaaa ct gat t aaaa ct cgcaaaaa cgt t t t t gt t aat at aaaat ct gact ct ac cat t t aat t t t at cccaaga t gcgcgtt ca aaaat t t cga ccaaagat ga t caacact cg aaaccagat c at t aggt t t g catt cat t t c gat gaatt gc caccgt t gga t t caat agt t at gagtt aac at t ct t cttc t caat t cacc gagaccaacg t at t ggaact at aggt ccat at at gaagaa gccaat gt ct aat acaccgt at cagat gt t ccat gt gcat t t ct t at t aa ct aat ct aaa agat at t t t g t act t acaaa t aagat t ct t aat cat aaaa aaagct t t t c agaaaccgt a t cgt cgtt gc t cgt aagct t ct cact t t ct gt t ct ct ggc ccat gagaga aagt agt gaa t cat cct gt t aaaaaaacat ct ct t gt gaa gt t t t t t t t a aat t gat gga tcaaaagaga taggcccaaa aacacgacaa t t gact gt ag t t caagat t a tacagggaac gt ccat caac t at aaaacct t t t acaaat t aat ct aacat t aat gt t aga caaat t aaat agccgactt g aaat t t ct ct taaaggaaag t t ccct t ct c cacct t ct ct t cgt t ct t ga t ct gt gat ct t t t gt aat ct gtggccaaga t t at caggct ggat aaagt t gat gt aaaga aagagat at a gt ct t t gct a t t t t agt gat cgcccaat ca t t aaat at gg t t aat aat aa aaagat aagt cgcagat cgc ct t cgccgt c aat ct aaat t t gt at t acac tt gt gt aat t ccacgagaca at aacggct t ct at aaaagc t cat at cgcg ttgccgacgg gccggagcct t t ct gt acat t aat gcccat ctct t t t at t gatt gct t t t aggat gat t c acat t cact t t aaaact gag agaacaaact agaaat cgcg cggat t gt gg aat ct agt t a cgt ggcgt ga aagagaggct at cat t aaga t aat t acaat t cagaaagt g t caat agct t t acaaat t t g gt acaagt gg at aat t t at a gaaggt aaac tttgcagaag aaact t gt t g cgct cagt ca gaact ccgt c ccgt t gt cca at gt t agggt aagcgt gat c gt aat gcaca t act t t t gt g t gt aaacaaa t ggt t gt t ct at cact t at t ct t t t t aggt at gat cgt gt aaaaccat gt gt at ct ccat t aaat acgt t aat t at t cat gat t t t cgt t t gt at t t t t c t agt aagat g acacaaaaat t t t t ct t at t at ccaagt at at ct ct gaag caat ct t cct t aat aaagat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 agaaggaata tctagtatca acggaaaaaa agaaaaaatc tagagcactt cgaaaaatta Page 42 12689250 Sequence tggaccaaag aagattcaga ctgaaattat tactgtatta tgccgaggaa gaaaaaaggc attttggttt atattttgtt aacgatcctc t ctagctaag ct t agctata at t caatgt t ttaacacaaa caacattaat taaatatcgt ctcaaggaac tacgttattt cacaccacat atactcatct atatctctat tcgccggaaa aagtaaatca aaatgatcac cggcaaagac tat g Li st i ng. txt gt at t t gt cc t at t t gat at tgaacacgaa t t cact t cct t t t t ctt ct t at gt acgat g at t cacaaaa t caaat gt cc t cccat t at t t gt ccat aaa ct t ct ct ct c ttttagcggc 1680 1740 1800 1860 1920 1980 1984 <210> <211> <212> <213> 39 2035 DNA Arabidopsis thal i ana <400> 39 at t at gt aat t t aat t t t at cct t t t gct t at at gt aat a t t t ggct t t t accaaaggat at at t at aac t ccaaaacgt aaagt at at t at agaaaat t t at t caaat t t at t t at t t t ggaat at at g cat cat at at t t t ct t ctaa at at aact t c cacgat t ac cact t t t t aa aaaat t at gt agt t ct t agt t acgcaagat act acaat ag aagt ct at gc t gagat ggaa at gt gat at t tttcgaaaac t t at at at ca accct gat t t t cat t gaat a agt cat aaca aaaaat gat a t aat at gaag aacgat gct a aat t t t ct t a t cat act at t aggt agacca cact agat t a at t t caaat g aat at gaat a t t at gt cgat gat gcat t ag t ggat at aat gt agt aaat t ccacct t ccc at t t act gat gct aat gaac accaagct at aaat at at aa t t t gtcattt gt t aat gt t a t aaccacat a aat aacaaaa aaat agaat a t t gt gt ct ac aat cat t ct t gt at cat aat t t aaaat t ca gt at at aat a at ct t aact t t ct at at t t t cat gt t t at a at agt ct t t g gact caat aa aggt cagt aa acgaaaat at t at acat t at ccat t t gaaa t at t aaat at gat gt gagt t acaaat aaag at cat t t aca t gat at t t ga gt cat aacaa t t at t t aaag at at gt aact gaat t ct at t ct gat at t ga gt t t t ct t t c t at t t t at t a gaaaaaaaga cacctttttt t t aat gt ct a agct t t t aga at t t t t aaaa t agagact at aat at at t at at gat aaaca gact t ct aac t agcagaat a gaagaat t t g t gt at gt gat agaagcaaat aacct aaaag gcaaaact gt at caaaat aa gt t t t caat t t t gt t aacat t at gt t ct ag gt t gt gt cac t t gt at gat t at t ct t t t gt acacat t caa t t t gt cacag cat aaat at a t cat ggaat a aat t aaagat acat aat aac t acaaacat c aat t cgcat a att gaaaaaa t at gaagaaa t gaaat caaa aaat cct t t a at cat gt t t t gaaacaaaac t caaagat aa ggagt at aat aat aaaagt t t t t acaagat ctt ggaaaaa t t at t gt aca at gt t aagt a aggt gcacat cat agct agc at gat t t t ga t t aacat ct t cat t t cat ca t aaaat agt g gaaaaatt ag aat t gcagt t aact ct gcac ct acat t at g tttgcaaaca aaact agat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 tttggtgtaa attaacaaat taaaaacatt tggtctaatg Page 43 12689250 Sequence Listing.txt ttccgctaat ttaaacaaac atttggcctt ataagaaatt ttaggtaaga agaagt ctt t gt aaagct t c ccgt gt agt c gt gt t t t caa aacat aaact cact t ct caa t ct ct aat t c aaaat t ct t t acccat at aa ct t t ct ct ct <210> t t t att caca t t t t t t cctc tttttttaga t gt gct gt ca aaact t ggaa aaagat at t g aaacaaagt t t at at t at t c aggt gt ct ct ct t at cat ca t cacat agt c ggat cccat t aat cgt t gaa t acgat t cat gt t t gccaaa ggt caaagt c t agggct t gg cccaagcaaa cat cat cct c t ct cct ggt a aat at t caat cat t at t t t a t at caaaaac gaccct t t at aaaaaaaaaa t aaaact gaa gaagaaat at ccat t caat a cat t cccat a t t ct ct ct ct t caat gcat a ct t accat t g aagaagaaga gaat ct aat a aaaaaaaaac aacgaaact a tcacaaaaag t cat t gat t a cat cact t t c ct cat ct ccg aaaaat t ct g t aaat t t cat aat act t t ca aaaagaaacc cagagt t t ga t t ggaaat aa gaggcat t at gaagt t t ccg t t gat at aaa ct cgat ct ca t gat g 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 <211> <212> <213> 1996 DNA Arabidopsis thal i ana <400> aggctttttt acagaaaacg agaggtgaga gagat gt agc t t t catt t ca t t t t gtcat t aat t act t t a at at gt agt t t t at aaaaat cacgat aat g ct aaat t ct t aat act t agt at caaat act at t t gt t t ca gcct t agaac t acat t t t t t ct t t t t t t t t aact t t t gaa t gat acaat a tgaaagaggc t at acacgcg gt ggggt at g t cct t cct t c t gt acagt ag t gacgt gcat gaaaagt aat gt aat t acaa acct aat ct c at aat caaag t ct t ccgat c gat t at act a t at caaaat t cct aacat ct gt gt t aggca aat t t ct aaa t t ctgtgcga t aaggaact t at gagat t t t gagtgtgt t t ct caacacgc acat ggaat g ct at gt t aat t gaacagt ac agct t aacaa t at t t gcaaa t aacaaat t t tt agt t t t t t at caacat gg t cct aaat at t t t ggt t t t t gt t ct t t ccg gt acccat ct t ct t cct act agagct t gaa ct t ct gaccc agt caat ggt t gt acat t aa caaaacaagc gt t t ggt gag ct ct gat t ag at t t at t t ac taggaagccg t aacact aat cgt ggt t aaa gat cat gaca t t t cct t t ct cacgct caag gt t t t gt t t t gt t t t at cat t at gt at t ca t t gat cat t g at t at cat at t at aat gt ca aaagcctt cg caacacct t g cgct t t cat a cat t t ggt gg agt ggt t gt g ct t at aaaat ttctct t t t c t t t aagggt c gt cat ggat c cagaacaat a gt gaaaat aa t agccaat gg accgcct t t a ggt caaat aa t gact at t t c t at t t t ct aa acaaaat at a t acct aat cc t t t cgaat at acgattttt t ct aat t aaac tagt t t t t gc gt t agaaccc aaat cgcccg aaaaat agt a caaagt at ca aat t t gaat a t ggacaaaat t gt t caat t t gct t t t attt at gaggat ga gaagt cct ct at t gat aggt act t t t at aa t at t gt aaag t at t agaat g ccaat t t t at ct ct gt t cat ccaat ct gaa aaagt t ccat gat t caacag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 ataatcttaa aattaaggag tcctattgat aaagtcttgt Page 44 tcaaacgtac aaactcaat c 12689250 Sequence Listing.txt cacacaaaac cttcataaaa tacgatatag gaaataaaga ttgtttttgc gtgagaaaat act at at gaa t acacccgt g t t aaaat ct a aaat at t gaa at ct at aaca act t at at t a at aaaaaccc aaagaaagcc at aagaacaa aaaaagt aaa cgact t t t t c t at aat gaca cagt aaacaa ct caaaagat t aaaat t t t a cact t caaat at t t t agct g t gt t cat t aa t agt gat at t aaat aggaag at cgaact at agagt t gt t a acgaaaaat a at acat t cca ccat aaccat aaaat g tttaaaacaa agat t gt t t t actctgt t t t at ct t t t gct gt aat gcaat aacaaaccca aagccact ca t cgat t aat t cgcat cat ga aaaagtgagg t t t act t aat ttct t ctct t t t t gt at t aa t t t ct gaaat aaaggcat t a acaaat t t aa agt t at acaa t gt t ct cagc t aaggat aat at ccat t ct t caat gt ct t a at gaagt t gt t cct aaagt c cacaat ct t t t acat aaaca t ct t caagga aaaat aact g ggaat ct t gg t t at acat t a acact t t t ac gggt t t at at ttttttttta gaaaacaaaa t gaat gagt t ct t ct cacat acaagaat at at t gt t gt ga aact t at agc cgt t t cagaa cacct gcaga t t t gcat cat gt agaaaaac aat t cacagc gt t t gaat gt gaaat gaat a ggcgaggcgg ct ct t t gt t a ct ct ct t ct a 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1996 <210> <211> <212> <213> 41 2052 DNA Arabidopsis thal i ana <400> 41 ctgaacgggg aact ct gcac acggt t ct t a ttttttgcct at caaacaac cgat gat aat ct gt aaact t at gt aaaat a aagcaaat at gt caaat at a aaacaaaaca at ggct gact aaaaaaaaaa at t ggt t t t c aacat at t t t aat aggct ct gacaaaaccc t t at t gaaac t t gaacgt ca t gact t gat c at gccacgca t gt gat cgt t gat t t aaat a cgt at gaat t ct agct ct aa t cgaagct at aaagaaaaaa gt t at ggaaa accaaat at a aacaaaagag gct t aat gaa gt cgacaaaa gagaaccgaa cgaaaaccaa at cact ct at cagagagat g ttgact t t t g acaact at gt at act gat gg act agct t ag t t cacaagct at at at at at gt t t t t t t t g at gat gt t t t gt agact t t a t at agaat gg aagt caact c cgaaacaat c at caat t t aa aat t ct ccgt ct act act at ttacgcggcg caagaagcca aat aat gcat ggagt at at a t ggt agcct t aaccct aaaa t t aacat at t t gct cct cct t at t aat ct a gt acggagat aaaat cgt t a gat ct t t at t agact accct aagt t caccg t t agacggt t act t gaaaaa gcgat gcat t gcaact t t at at at at aagc cct t gat gca gt t act t gga gt t ggt t gga t gt t t t t gt t t t t t aat at c agt agt gcct acgt gaacat cat act gact at t t at gagg t t agaat at a at cact at ac t t ggt t caat caaaat t aaa aat cgat aaa at aaaacaat t at gt at t aa t agaat aaaa t t agt t t agc cat at acat c acat t aaaac t t t t at gaaa ttcaccaaag aact t gt t ag ct cat gaaca at aaaaat ca t t t aaaat t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 12689250 Sequence Listing.txt ccaaaaatat taatctttta ggaaattaca catgatgata ttttagcata t t acct ccaa t t t t gcaaat cat aat t t gt ct acat acac tgacaaaaaa cgt ct cacgt aagt aat t t t aaat t acacg t ccaagat t g caaaacat ag aat at aaat a aaat t t cagc t aat ct t tct at gcat ccaa ct t t t gt t t c at gt t at t ct at t at t at aa gaggct t aga <210> 42 ct ccaaat ga cat t t t at at aaat aat agt aaaaagt aat gat aaaagaa t cat t t caag tcct t at t t g aact at agga gaaat ct cag acgagcat ag t ggat at acc cgt ct cagcc gcccccacca at t cacat at t aaat t gt aa gt t ggt t gt a aat t at agt t tg ggat at t t t a gat t gt at at aat t cgt aga aaaacacaac t agct t t gt t at t acat gca t tct cact t g t acgggt act agt t gt cat g aaagagaagt ct t aagat t c aacacaat ca ttagcgcaca agat agat ag t cgct t aaat t t ggt act aa aagat at t t a gcat at ggcg accat agat t cat agat gat at t t t t gt t g t gt cctt t ct gaat t t gt ca aat at aaat t t at t t at t cc aat act aagt t gat caagt g cggagact at caaccaccac acggt gagat t acat at at a t at t t gat t t act aaaat aa t t t t t acttc gtct t t attc at aat cact a caaagt t ct t t at t aaat gt cgacaagttt aat gaagt ct aacgaaat at ct t gcggt aa t aaaat at gt at ggaaaaag gct t ct ccga ct ccct ccct t cgt t agat g gt t gt t t at a at gt at aggt t t t agt agca atgaagcagg t ggcggat at gt acgt act a ct ct t agt cc t gt aat ct t a gct t gt ccgt gt at t at aat aaat cat caa gat t caccat t acgaat agg gcagaccat a at t aaaat aa t ct cgt ct at ct ct ct ct ct t t t at t at ct t caat agaaa t t aat t at aa t ct t t ccaat aaagacaaga 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2052 <211> <212> <213> 2007 DNA Arabidopsis thal i ana <400> 42 t t t tct aaaa caat gagcca t gact agt ac t cat gct t ca aagcagagaa aacaaagaag gat at t gagc ttgaacagag t gagat t at a cgccact gaa gaaat ct ct a ccct gt aaac gat t t accct t agtt ct aaa t aaaccaaat aaaat gaaac at ccaat t gt t gacgaat ga aagacggtgg gagt cagt ag gaagaagaag aact t t agat aaacacat cc agat caacca cact gt t cca t cat aat caa t t t gct aaat acaaaat t ga agct agagaa t gt ccaagat t agaat cat c aagacccacc gaaat t t t ga acgaat cagg act t at act a gat aat agaa t ct t ct ct aa acaat cacac aaagagagag at t gct gt gt aacgaagct a aaccat at cg accaccgt cc at t ct gaaat cacaagt ct a t cat agagt c gtt gaacaaa aacct at cca cat gt aagag gagaagagag t t aaagt at t t t at ct gcac t aacat t t t c gt cgat t t ca t t gggat t ga ct caaagat c t t caagat t g acat ct tt ct cacgt t gt aa t aagagacat gact aacgt a taggcaagag t t ggact cca tcaagaacgg ccat aat cga at cgat t agt 120 180 240 300 360 420 480 540 600 660 cgcaagaata aggaaaaatg aagagaattt gggattgggg tttatgtggg aaggagatga Page 46 12689250 Sequence Listing.txt aact t gt gcg aagaagaagt aaaaaat aac gt aaaacat t aat at aaaaa t aagggt t t a ggat ct t gat acat ggacat agacaat cca aact t acaat ct ct cct caa gt t gt gaat t t aagt gat t t t t cat ccgt t t t at gct ggt t t t gat at at t accaaat t t caacat t t gt t at t cgcat c gat aagat ca gaccagaagc acacaaccac ttgggcaagg cacggt t t t a at t t t acgac t t acgacaat t t t aaat t aa aggat gt gaa gat gaggat g cacat t gagc aaaggt caac gacagat t ac gacat t ct gt t t t gcgct t t ct t cat gcat gaagcggatt at t cgt t at g t ct t t aaat a at at t agt ca at gcaacct t aagt act gt c at t ggt gt gt caat at at at gagact caag gat cgt cgt t gat at t t t t a gt t aat aaaa aaaaaaat at at caaat gaa at ct t aggaa cccaat ct cc t t ggggt gag t t at cat at c aaaaacccaa aact t t t gt t gat gat t cac act t gaaaaa gagggat t ga t t agcat t cg at t ct at act at aaaact ac aagagagaaa t agat aat t g t agt cct t aa at t gacat ac agacat g cacat caaag agt ct t t ggc cat at t t aca t t t acaat t t t gt t gt gt at t t at t at t t t t at gccacat agt t caccat t ct ggaacac aagat ct t t t t t t gct t t t t t aaact t t aa t t gt t agaac aact gcaaaa aat t caaat c at at act t t t at t gcaaact aaaaat cct t at t t t ct agt t t t gggt ct t at ct ct cgt t gaaatttcaa ggaagagagg ccaacagat a act t t aat gg ttaatgaaaa t t t cgaat gt aaaatcattt cat gtt gcct aaactaaaag ttattcaaca ct t aggaggt atcgacaacc at acat at ct at gt t gaat a t t t gaat cat cgt t t t t t t a t t t gcat t t t gt t cgcctgc aagaaacat a cccatggttg att agt agct tataacaacg t t t t t at t t a aaatagaaaa tagagaagt t t aggactt aa cgt cat t t ag aagatt agt t at t ccagat t aactttgct t ttgaggaat c ttct t actta tggaat t t t t t t at t at t gt ct t t ggagat gt acct t caa aaat ct caca ttgcatattt aacgtt aat a aaaagt cat g caatt agtt t aacaat t t gc 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2007 <210> 43 <211> 2013 <212> DNA <213> Arabidopsis thaliana <400> 43 atggatagac tagattccct cttggtgcta tgatcatggc atgcattggt ttatcaaaaa ctatcttttg gcagcctaat gttggatcgg aaagatgaaa gaggtgaaac atcaaat t gt atgacgcttc ttgactcttc acactctttc aatttgcttt acatgtttcc tcatccgttg tgtttatctt gatatctatt aatccttacc aaccaaaggt ttattcattt aatatggttt aagccgagt a agagct t gat t t t ct agt ag acaaaatcat tctct t atgt t at ct t ggt a ttct t ggttt agat at gct t Page 47 cat ct acgt a ttct t atcct aat t t aaagg aaccaatcaa atggt t ggt t gagacacaca aaat t t t aga cgct gct t t c ccat t t aaaa tgtat t atgt gcatagcaaa t at t t acaat atgtgacaac cacatacaca ctggacaact tacgaggaag 120 180 240 300 360 420 480 12689250 Sequence Listing.txt ccaatcc ttaatctcat atattacctg aaacccat ga t at gat ggat cat t ct t aaact a t t t ctgcttg ct ggt t ct gc t cct t agacc ggat gt t agc t t t t gaat aa accccat ct g caaat t t t aa ct accaagat ct t at aat gg cccacaat aa at caaact t a tt gat aaaaa t t t ctct t ct t t t aaacact ttggcaaaaa t aaat t t t ga t at ccaaaac act at gt at t t t agaaaat a t cagt at t at t aaat gt gaa t aaat caaaa aggaaat at a act agct t gc at cat aaaaa aaacct aat c ccacct t gag agt t t aat cc t t caat aat c at ct gt t t ga t t t t gacat a at at t gct ac agccct gt ct t cat gt aggt t at at gt aaa acct act aaa t aaaagt t gt ct at at acga t gct gcaat t t agat at t t t gt gt t gacaa act act t gt a gat gagt at a aat t at caaa t at gggt at c ct acgt cat t gaaaaact ca gaat cct t ct t acact t aac t t gat cat ga t t gt t gct ca aaacct gcca cat aat t t ct t aagct t t aa cacaattttt at t t ct t cat cccaat gt ct aaat t gat t gat gt at gat t at at at ct t agaaaaacat t at agt ct at t t t t catgt a acaaggaaag at at ccagaa aaat agat at ccaaaat at t ct aat gaat c aggat aggt t cat t gact t t caaaat t t ct at t t t t ct at cct t ct caaa at at caacca acat ct gt ct at t acagt t g ccaaaacgac ct gaagaat c at ct t gat gg t caagt ct aa gt ct t t t agg ttgt t t gtaa t at gt t agt c at ct t at aat aat t at at ct t agat gat aa ttat t t gtga t ct t ct cct c aagt t gacaa cat ccaaaat t t t t ct at at at at at at at t aat t aat t t cgagacgt t g aaacat at cg t ct t t ct t t t t gaat at t at t t ct cct at a at g cat acat aat gt agagat at gt gt acacga t caagct t t c t ct aat ct cc t gcggt t aat ttttttaacc ct t t gtgtta at gt aggt ga t aat cat gt t t aaaat at ac t aat ggcat c agat gt at aa at act t aat t agt ggaat gg ggaat t cat a aaaaaaaaaa at at at at at gt t t t t cct t act t t gat t g accgt t gaaa cagcat agaa t t gat t aat t aat caaccct gt aagct ct c caaaccat t g t accgt t t ct aagt t aat ga t at ct cat t t at t t acat gc t aat t gat t g aat t at gat a aaggccggat t gt at gat at ggaaat cacc aat t at ggt a t t t gt aat at aat at t gaca t gat t at aag t caat ccct a agt t t ccgat aat t ggt at c at at at at at gcaact t gt t agacct t t ac agt caaaact gt t t at t cgt t cat ggct t a t cacgt caaa 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2013 <210> 44 <211> 1996 <212> DNA <213> Arabidopsis thaliana <400> 44 aagtccatcc gcgataaaga gggatgagct ttaagttata ttttgaggat ttttatttct gttcaataaa aaaaacaact caacgtttat gcatgatgaa tattatacgt tcaagattta gaaaaataaa aaacaactca acgtttatat atcgtgcaat gaaatcatcc aatttaaaat atttaataat agtccttctg tataacaacc actagtgttc ttgtggaatt tttattttgt ttcaaaaaaa aacatgttat tttaaatttc caattttact tttcaaataa acttccacgt Page 48 120 180 240 300 12689250 Sequence Listing.txt ttattcttta atttattcat atttttgttt taaatttaac actcactctg attcattata t aggat at t t at t t t at t t g at t t t at aat t t gaaat aga t ct t t t t at t gagt t t t aga cat ggaat ga cat t aat aac ggcact aaat aaactttttt cacgcccaat gat aat gcaa gat cgagat c ct cat at agt ct gaat t t t t t t aagagt aa at aat aaagt gt t ggt t t aa gaggt t t t aa t t acggcaat ccaaat aaat t at at gaaca aggt t t at t a t caaat aat t t gt t ccaaac aaat agt t t t t at aaagaac act aaaaaca cagagat t t a at t at t t gac t aaaaat aaa ggaaat at ca cgcgcaaaat cacgaggact gt cat cct at t t aacgt cca gat ccat t ga t t t ggacat a at aat at gt t t gaat cat cc gt gccat gaa t t agt cat t c t cagaat aag t gt t cgt t aa ct ccaat t at atgcaacggg acgt t at cgg t t t at gat t g aat at acaat t agt t aat t a tgaaacacaa aaat ccaaaa t aagt aagaa act aaacaat t ccgat t gga aaaat g t t t aggat gt tat t ggt t t g t t t ct t aaag t ct aat aaag gt gt caat t a at at at t gga t aaacagt t g t aacaaaat a ct t agggccg cat gat at at ccaaact agg t at t t at t aa gcat cct at a at t agt ccaa gt at aact t t acaagt t aaa t t gagt at ca gt gt ggt t gt acat t t t aaa gct t agcagt t at t aaat t a t cgaaaccat at aat cat at tggatgaaga cacaacggt c t at at t t tag t gt aaacaaa t t t act aat a t at t t t gt at tgtgtgt t t t t aat ggt at a t aaaat at aa aaacaaaaaa t at t at at at agat ccaaaa gccgat t ggt t t t t aagt ca aaaaat aagt at agat t t ac t act at aaaa acat t aaat g t t t t cgaat t aaat at gt t t aaaat aggct cat t gt ggaa t gact ggt t t agat gcgaca t at aaaacac agacaaagt a t gggagat t a act t at at t a gaggt cat gc t cact cgt t a t cat cat aaa t caat gcaat t t t t aat at c aat ct aaaac t t t gt at agt agaggat at a gt aat gt aaa t t at at t t t a ct cgat ct ag t cgaggact c cgt t t t t at a aagaat t agt t aaact at at at agt ct t ac agagat cct t agaaact gat t t acaat t aa t gt t at t at t gt t aat ggaa acagt t aaaa gt ggt t t aaa caat at t at a cat aagagt t t gat at ccaa gtt ccacgca aat gaat cat acaaacaat c ct t gt t ct ct t t t t aaat at aagagaacaa at cat at aat t aat gat t t g at t t agt t t a ccat at agat gt cact aaca at ct at acga ct cat gct gt t t at at gt t c caat gat cga at aat acaat t aaat acat a t act t gct ac tt at gaaaga gt t t t gaaaa t agggt t t t c gt aat t ggt t at at gt gt at ccaaaaat t a t at t t at at a at t ccgaaaa aat ggact aa caat at aat a cct at at at a aaaat cgct a tccagaagaa 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1996 <210> <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> aaattaccgt ttaatcctct tagtgggacg gtgtggtttg attttttcac tttttggttg Page 49 12689250 Sequence Listing.txt ttattattat atttcagatt tcttttattt ttactgctag tagtaattgt t t t t at ggaa t ct t t ct cca at cgggat t t aaggt t t aaa t aagagt gga caat t gaat t gcacagt cca t t t t t t t gt c aaaagcccac t ccccagt at ct t t gt cgt g aaaacaaatt agt t at aaga agct at aat a at ccaaat t t at cagacat g at ct t aat at aact at acaa at ct t ct t at aagt agt aac tcatgt t t t a t t t gt caaag t caaaat cat ct ct t gcagt t caagt gcct t aaacat ggt acggt t t agt t ggt aacaat t caacagcat cat at ggact at cacat ct t at cgt ct cca <210> 46 aaaaact gt c cagct ccact t cggaat t at t at at agaca ccacgt agca tgct t t t t at ct t agt t t cg ggccaaagtt at t t ct ccgt t acggat t t g gcct ct t t gg gat t agaaaa gct acaat gt t acaaact gt tgt t t t gat a at gt t gagt t aaaaccaaaa ct at at at at at gcgt t t cc agcct ct ct c t cgct t aacg act ct caat g caggt at agt at aagt aact cct t t t t cag aacaagat aa t aaaggt t t a acaat aat t t t t t accgt t t t gtt caaacc cat t gacacc t acat acat a agt gaaaat t act gt ggcgt t t at t t at t c cagct at t t t ggagggaagt at gt t gt t ag cgt ct t ggt a at t aat gt aa t cat at at t t gt t gggat t t t t act t gt ct ct cat t agt a cgaaact aaa at gt ccgt gt t gaat ccgat ct t t aagct g t ct t t caat g at at at at at t aat ct ct ct t t t caat gat aat caagcaa t caact ct aa t cat gaaaac t at at aact c t t gaagat ct t gt t t t aaat ccat t t caaa aaat aat t ct caaact t at a gaaaat cat c acaaat aaga tg gt aat aat ga ggaagatcgg tt ct gt t t t t tcgcaggaaa tgatct t t t t acgat cgat t aat t cgggaa aagcccaatt gtat t gt t t t gggaagtat t t at t t caat c ct cgaaacat aaact t t ggt acat gagat a agtcgt t t t t tcaaaacaaa aaact t caca at at at at at acaaaact ag cagt ct at t a cccaaaacca aact cat gca ct gagaaaca t at caat gcc t t t gtgtgt t aact cct t ac ct t cgagat a caat at aat a gacaat gaaa t at aaat t ga aact aacaaa acat t at t ac gcat at ct ct at t gaat t ga at t agt gat g act cct t cgt aaaaat gt at t t ggat t gt t ggcagt at aa t t t t t t t t gt t gt aat t t gt ggt caat gaa at at t t gt ag gaaat at aaa gt cagt ggt g gcacat ct t a gat at aat gt t t at ct act t at gcgt t t aa gat at t ct t c acat ct at at ccaaaaatt c aat aaaat aa at cat at t at gacact t aac t aat t aagat at aat acgac at gaaagct a t gacct gagg at t acat t gg act ct t t gt c aacct t t t t g ggt agct t ct agt t t t gt ct t t cagt cat c t gt aat t aaa t t t gt gct aa t gt t gaat t g aaaagagat t t cgt t t t t t t t gat ggaaat t t t gtgtgga at ct t gaaat at aat t aaca aact t aagt g cacat gagct aggggaaat a t agggacat g t t t gt t t gt a t acgct at at at t t cacagt cgt gacaagt at t gat at gg acat t t gaca t caaat at t c t gaat aaacc at t t t aaat t act act aacg tt gat gcaac cact t aacaa gaacacggt t aat at acat a t cat aaat t t t ct t ct acaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 Page <211> <212> <213> 12689250 Sequence Listing.txt 2005 DNA Arabidopsis thal i ana <400> 46 t caat gat gc ggact t t acc ggaagacgcc t t gt gat ggg gt t t gaagat gcgt agat ac at t t cgcgt c t aaacaccat aagaaaaaaa ct t gcagat g cat gacggt a t cct t t gacc at at at gt ct t t t t agt caa gaat t ct t t t acgt t gcaat t t t agt t aat aaaaaat ct t at aact aat t ttttttcgac t t aat aagat caaat gat t g ggat gcat gt at at gt agat t t ccaact gt t gt t t t aact gt ct t agt ga t gggct t aaa agacaacat c ct ct t ct t at ccaccgcaga cacaaact ct at at t t caaa t gagaact t a ct t gt ccact cgt ggaacca ccagct cgt t agaggcagag aaggat ct at t ct cat t t gc gt t aacaagt t gacgaat aa gagacact ag gtagagagga at aat aat aa cgaat gt aag at t act agt c t t gggt gt gt ct at t t t gac ct gat t at cc agt gat at t t t gt caat t t a gct aat t ggt acgt aat t gg gggat caaat at t t t aat ct tgt t ggcaaa at at ccgaac cgt gagacac gaaat t t agg gacacaaat a t t ct acaaac ccaat gat ag t at aat at t g aggat t gaag agcaaggct g t acat t gct c cccaggacgc gt act gcaag aacaat t act gagcgaaacc t act caaaat aggcagagt a ggcgat t gaa t act acggt a tgaaacacca gt t gaat aac aggatgaagg t at aaaacaa t t agcaagt t t ct t t ctat t aaagt agat g t ct t cct gt a t gt ggat gat aacacat ggc aagaacat gc aat t gt t at c act t gat aag gt at t gt ccc t ct at at gt c acgaaatttt cagccatttt t t at at acac aat gat t ggt t caact acat caagt t agt t ccat t gagt t at at agt t ct aaaaagggt a t ct t t caggt cgat t agaag cggggat t ga ccggatgggc gctgtttttt t t ct cat gt g gaaaccgcgg ccgcgt at ct gt gt cgat aa aaat ccaacc at aacgaaaa aagaat at ct caagaaaagt cgt t t at gaa at cagagat t agt aaacgat t at t gct at t gt agt t t aga acgaaacat t gat aacgt cc at aaaaact t act ct at aag t t t t t ctt gt caat t ct t aa t t t t ggt ggt at at at at ag ct t aagaaca t t t gatgtta t aaact act t Page 51 t accat t aag t gt cggggt t caaagt t gct tgaaccgaga aacaagggtt t ct t gt t cgg agt gat aggt t ct t t ccttt ttat t ct t gg ct gt gat t ct caaaacgct a agt at caagt gaggcatgat ct aat aat t g gt cat t agca cat ccaaaac t gaat at t t c t gt gt ccat c gaacct ggaa caat ct gt t c aaaggt t aaa acaggaaacc aagt at caat aat t t t aat t gcaagt aat a t aagt agagg aaccccattt agcaat at ac t acgact t gt acaaaaat ac at t aat ct ca acat aat t ga caagat gat g tcccggacag aat gt gccgt aaagt t t t t g aaaacgct ag aaagaact ag aaacaaaact t aat gct agt ggt t gt ggcg gcggct gt ac ct aagcacac gaat aaagt a gaat cat at g t ggct act ct t t t agt t gca t t aat gt cat ggt aat t acc t t cgat t t at acat gcaagg at ccacat ga acgt t gcggt t at aat gcat t gat at gcat t aagt t at t c caaaat acat aact agaggt gt at t t aat t accat gcat a at caat t t gc aacaagcat t at ct t ct aac gt at aaaat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 12689250 Sequence Listing.txt gcttcaacct tcatcaacaa ttcattatca atcgagcaaa aaacaaaagc taagagcaac aaactaaaga ttttttaaga aaatg 1980 2005 <210> <211> <212> <213> 47 2003 DNA Arabidopsis thal i ana <400> 47 gt cat t gat g gat aat t at a aat t cat at a t act gt t at a cat t acagt g gtatgagaga acaat at t ac aat t t t ct t a act gt ccct a t ct aat acat cccaaacat g t gt at at at t acat t t t gaa t acgcat t ga at gaaat at c t gt gat gt t g gt cagagcca gaaat t gt ca t cagt at gt a ccact agact gcaagt act t at cacat t cc aat aagat ga t gat aaact t aaaaaggtt a at t aat t at a t t t gcgat t a gaacaatttt t aagat ct ct at at gt agaa aaat cat gcg at gaaat t t t acat gcct t g at at cat t aa agcat caagg cat t t t gttt at acggcaat t acaat at at t agt at t t ga t at cat ccaa t gacat t ggt t at agt at ca ggcgaaattt gt t ccat at t t ccaaccat t ct caaagt t a at at aat gaa gaat t t t t aa aat aaacaaa acgt t gt aaa gt t ct t cttc gt aacat gcc agcat gt aaa t t gt aacgaa aagccagacg at acat t t ag t aacaaact c aagt t gat ac t agt t gt gga gcgtgaaaag tcgctttttt t t aaaaacgt aacat gt t cg t at t gt ggat cagggat t t a gt at gaaaac t gat ct t t ga t t t at aat t a t t ccagat t t t ggt cct gac ct t ggat aaa cacat agcat accat t aaaa gagct t ggt t at aat t t aat t t aagt ct t a at aat t aat c gt t aaat t t t t t gt cacct c act gagt t gt aat acacaca t t t gtt gttt agt t gt aat a aaaaaat aat t t t t t t caat aat cat t t ac act t t at t t a ct t ct t act t ccagt t t agt taaaacaaaa tgaccgcgac ccct t aat aa at cat t t t ct gt t gcat cgt cgt aat t t t t gt agt gt aat aact t cat at t t t aagcat t aagaaaagat t t gt aaaat t at t ggat gaa gat aggt t ct ct ct t t gt gt t cct at agt c agt t aat t at aaggt agat a t cgat ct t gt caaagacaag cat gt cgt ag t gt t at aat a t ccacagcct cact at ct cg at at t t ct aa t t t caaat t a ct act att ag gt gaat t aat gt t t t t t t aa cat t at caac gcccattttt t t t cat agaa t aacgacgat ct cgcat gac ttttgttaga gat at cacaa gt ct t gccaa ggcgatgggt t cgt acgcat t agaat aaaa ct gat gaaca t aagact aaa gt aaggt gt t t t t t caat aa gacaaatt ag ct aat cat ac t t t aagt agt cat ct at aca t aaat at aca t at agat t aa t gat aagct c accat at aga t gt gaaat cg at t at cgt ct t aact t at t t t cgcat t t gg t cgt at at t c t caat t at t g t t at t aggat aact aaaaaa t aaaat t agg t t aat t gt ct gt at ct t t ga t acagt t ct t gt aaaat t t g aat gat at t a gaaaccaat a t cgagt t t ac ggt t gaacca cagt t cct ca aact ggact a gt t cacaat t t caacat ccg t t gt aaaat g t ggt gagat g gt t at t aaaa cct t t t t at a t gcat gat gt t acat gcagt ccaggccact 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 Page 52 acat t acagc ct aact cgat at ccacat t c agaat aat ca at t t aaat t t aacat t gaat t aat ct ct gg gcct cacct t ct aacagt t t t caaat cat t ct ct t t ct at t gaat t agcc 12689250 Sequence Listing.txt atttacttga taattaagac aaatatagaa cattaaaata agcctcctct caaat t gtca at atctagat ggagtgttac ttact ct t t a t t t t aat at a t cct t caaca gat cat cat c attatatatt taactagccc aaattgtacc atacctatca ctactataaa aagtgactct ctaagaactc caaagattag at g 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 48 2003 DNA Arabidopsis thal i ana <400> 48 t t t ct caaca gt acct ct t t at cagccgga cct agaact c cgatcggaga agagggagag accggcaat a ggcat cgt t a gaagaaat t a ct cgt t cgt c t t t t ct t t cg t aaat at t t t at aaacggt g agt at at aac cct t t gt at a t caaagagt g t gt gat agag gaaaact agg tttttttttt aaacagct ag gt t gt t ggag t gt gt gat gg aaaat t caac gt t ccat acc gt at t at cat t gt ccaccgg gaacggt t ca cgaat t cccg ccgtagggag gt t t cagacg gtagaagaag tcgaggaaag gct t ct t ct c t ccgt cgcca t agct t t ct g cct t t t t t ca at gt t aat gg agct aaat t t aat gact cgt gtcggggaaa act t gt aaaa aact ggat t t t ggcgt gt t t at at t gaaaa at at accaaa cat t t agt t g cat t agt ct a ccct t t t t ga ct aagt at t t t at aat act c gatcggcgaa acggat t ct c cagcgcgaga gagt gt aat c gt ggt ggt gg ttgaaacggc t t gatgaaga t cgt ct t ct c at t t t t t t ct ccgt caaact gcccaaaggc tcaaggaagc cgt gt t cat t agagt ct act aagt gt at gt act gt accct ct gaacaat g t t t aggaact at t t gat aat at ccaat t ag acct cact t a acaaat cat t t ccaaact at at t agct t ca t gt acggat a agt tt ct t t a agacggagaa gggagagtga cccggtaggg agcgt ct aga agt gat ct cg gaagct t ct t t t ct act at t t ct ct caagt t t ggcct t t t aaact gagaa gat cat t agg aaccat t gct agaaagagag t ct ggat aat t gt t t act aa agact t act c at caat at t c caat t act ga ct aaaat gca t t gaccat t t gat cagat ct t ccgaat cac t t cccagcgt t t ct ct ccag gcagat cgt g gat t gagaag t t t ggt t cgg t t ccaagt gt at gaaggagt ct gt t gct ag t t aat t t t ac ct t aacaaca agcct t acaa gaaaccacat t t cggat t t a ct at ccacat cat t t t t gaa gat ggt t t ga gt t ggat at t tatgt t t gt t aat t t cagca t ggaaagt t t accact at t a t agt aat t t t gt t gaaaat t t gccct cgcc gt t gacgact caccgcct ct atcgggaagg gagat t gt gc cggctgcggc gact t t cgag t t at cagat t at t gt t t t t t t at at at at g t acgt t t cga aaggccttt t t t t gt t gtt g tcgaggaacg gt t ccgat t c gt t agaaat c gat gt gt t ga gaaaagaaaa gaggact at g at agt t t ct t t t ggct t act t t t t agtgag t ggact aggg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 ctcagtctca tattcgatgt gagactttta acaaacaatc ctcttagaag ttttaccagt Page 53 t ggt at at gc gat t t t t t at aat t cat aaa t t t t t gt t t g ct at t t t aag gtt agcagac gagtt gaaac tt gt ct ctt c ttgact t t t a cact at cat t t t t at t t ct a cgct ct ct aa t t t agaagt c cat t t gaagt t t t at t t t t t t t gt at caac cat ct t ct cc aaggaagacg 12689250 Sequence tgctcttttt tctttctcat tatatataac tatatatttc aattttaaat acaat t aaat gaat t atatt act atcacca tttatttaat ttactaatat ct t aaat t aa at t t t gt t cc gatt caact a ccagt t t aaa t ctt gt caaa gt t t cct caa at g Li st i ng. t xt aacgt t t at a aaaat t at t t caat ct t caa at t aaat at t gaat gaatt a t agt gt ggac aggacacat c t act t caat c t t t t ct t t ct cccct t t t at t at at gt t t t acat agt cgt aat aaacat t acat ggat ag t ct gat ccat tt gt aacccg 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 49 2000 DNA Arabidopsis thal i ana <400> 49 t t t gt aat cg at aat t t aat gat agt agt g tttaaaaacc at at ccat t g t gt t at t aca t at gggat ct t t act gt t ag t ggct t t gca t aagt gaaca aggct gt gcg at ct t at at c t t t ct ct gat at ggct at t t ggcgaagaag agagaat aga t at gagt t t t tgttctcggg gt ct aat t at gat agt gt t g acgt gt gat a ctt at acggt gagt t t agat ct t at cagt a gt t t agaggg gt gt t accga aagcat t at g t t agctt aat ct at t t at cg ggaagcagca aact cagat c at gcgccaag t t aat ct t t a ct aact t aaa cggt t t gat a gaat gact ct ttgatagaag at acaat t ca tt t ctt gaat aaat t t gt cg aaaat t t gt a t t at aaaaag taaaggtcga atgtctgaac t t gaagtaaa ccaaggaat g cat t t t cct t t aaagagtt c ttt ct gcaac at ct gagttt cct t t gt t ag gcccggactt t cct gt t gat gat aacaat t ct at ct gacg t gt caacacg att cgt cat c gat caaaaga t agct cgt gt gt t t t t cgt t act acat t t t at t t ggagt t t gt t t caggg t t t t atgcgg t acgt ggagt t t gat aaat c gt ct ct cat a tt t gagaact t t aact t ct a t t t t t t gt t t ct act gaaaa gct aaccgaa agaaggtctt gt gt t t acag tt gct ccga gt gaagct gt aat caaggag gat t t ct t t c gt gt cat cga t gct gcaacc t caaat gt aa t t ct gt gt aa t ct t t t atga ggccgatct g ct agccgccg t aat ct t ct a t cat gaat ct t ct gaacaaa aaaccat t t g ct act gt t t g ccat t ct aga t t agt t at aa gaaat t t at a aaacgaggtt cccaact ct t ct t ct t ccag taaagaaaca t ct t gct t t t at t t t caggc cgaact gt ag cgt gt at t t g aaat at gaat t cat t t t ggt aaaat t cgt c gct gct at aa aaccaaatt g gatt ggtt aa cgcaagacct agt t t caaca t gt t at gt t t t gaat ct ct g actt act gag t t ct t t t at t cat cact ct g ccacgt acca gt gaat cat t ttct t t cgt t t aat gt gct t cacggct aga t caccgtt gg t gtt ccgct t act at gagac t t t t atgtgg cat aaaccat ct t t gct at a t t t t aat aat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 54 12689250 Sequence Listing.txt gctgtgttta cttcgtcatc gtatcgaatt atacggctgc tcttgttaag at ct t caaat ccaact t ggt t t at act t ga cgt aaaacac at t t acat ag t agcaaggt t aaagt gacat t aaat t gaac aat gact t at acaat ct at a t t aat t t gt a acgt aagat c aaagtttttt ccat cat agt t t caat gaca aaat at gcca acgcagt caa gat t t gt caa t tct ct accc gcat at t t gc t gact t t gat cgcat ct at a t at aaggagc t caaat at ga ct t t t gt aag t cat aaagt c ct gt t t agga aat t agacat at aaat caaa t gct ggcaaa at t t t t t gaa act caagct a t aat aaacaa t at at ct act taagaacaaa caaat at gat aaaaaat cag gct t at ct aa gt t t ct cgt c t t t cgt at ga aat aagcaat aagat at t ct acgt t at gt a agat at t agc aaaaaaat t g cccaat t agg aaaact acgt t aaccaat t c at t gcacaag at t ct aaaac taaaaaaaaa gt cct aat t t ct t t t gcat a ct t gat t act cact ccat t c t aagt acat a caact agcaa t gat gacaat t agct aat ag at gagt ggaa ttttgttaca at t aacaaat acaaaaaaaa ct caagt t gg t aagat t t ga tgacccgact t acat t at at cat at caat g t t gt aaaac 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> 2005 <212> DNA <213> Arabidopsis thaliana <400> tat cgt aaac aaaaacaaaa tcaagl tact cgt at a aggaaacact aat cal gact aacact tggaaaccga t aat g aacatatcat agtcaacata caaaal gaccaaatga ttcacatggc aagaa aaattgagct tctcatattt tctttt tatataagga aatcaaaaat gt t aal aacat t aata aaggcacgaa at t at caaagacggt caaagccaaa taaaa tccaaatggt tcagacaatc gtgga attctctctc tctaattaat tttta atggttaact aataaaatat ttatt taggtatatg tgttagccta agttt1 tttttttgtt gcaaggctta catct1 gcctgagtgt acaaaatatt aatta agaatgtttt gaaagacaga aaaga! accacatatt atcatgatgg cttcal aagt tagat at gaa aaaat at t t a t t ct gcaa ggt t at gct cccca acttt at gt ttttc aat t agttt ggaag t cat t ccaat at t t gt t t t caat a act t t at at a gagt cat at t agt aat t tct t t t gagct t t aacgt aaat t t gaaat gat t t agct ct cca cgcgccgct t gat t agt t t t ccaagat t aa t t caaaat ag t aggaaact g aaat aaat t a cat ct t aat g ct t aat t aaa t at t cact t g t t t cagt t t t agaaaacaaa aaaaaaaaaa gat gt aaacg at t t aat t ca t aat at gt at agat ccaact tgagcaccaa ct ct t ct ct c ggt caact t t t t aat gaat a ct t t gt t t at at at gat t gc aggcaaggca aaacccacaa ct t t aacct t aaaaat cgca t t aaaat gac gt t t gt t t ca aaaaaat ct a cct gat cact t acat gcat c t gaat gt aac aat t t ccaac gagaaaccca cccact t gct at agt t aacc agt gat t at a acacgt t at t t at at at aag at gat ggt gg gat aacccca t cat t t tct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 caaat at t t t aacttattat accaattaga tcacacccac tcacatacat ctccttccca Page 12689250 Sequence Listing.txt atttctcacg ctcctagaat atgcgtggag act act t ccg ttaggctact at aacaat gt gagaagaagt t t cat acct t at at acaaac aacat aat at aagaaaaaca at at t acacg aaacacacaa at t t t at t t t t t agacaagt t cat t aat gc t t cccat t t t t t aaat t t ca t t aat ct t gt ct t ct t t gct at t gct aact gt agcggt t t gaaat aat t a at gt t acaaa t t t at t t t at caaaagccaa ct ct aact at aacgacat aa t at t t t t at t gact gcat t t t t t at t t t ag agt ct t cct a at aaaat aga gt gt gt gt gt t t t gat gt gg t t ggaact t t t aaat ggggg at aat act t t caagtttttt t at aaat at c cagt gaaacc tggagaagaa aact caacga t t at t t t t at acat acat at t ct cacct t c t aact t ct t c aaaaaacat a gtgt t t t at a gcat g gt aat t t at c t gt at at gt t t ct agt caaa t t gt accaat ct at act at a t aaat at gt g gaagaagagt gaacaaaatt t t at caat at at ct at at at t cct t ct act t caat cct ct t acaaat ct a t aat t t t t at t agt t aaacc caaaaaaaga t at t t gt at g aat t at cgac t aagt t at aa t t t gct t at g ggagtt t at t t gaat t gaag ccat t t gt a at acaaacat t t t t at cat t ct cat at ct t cagagaagag tttttttcaa t gat acaat c aat caact at gagaagggaa gat cgt aaac t aaaat act a acat agcaaa gtagccgcgt gt cgagt cca t cgat accaa agagt t t agg t t t t t ctt ct t ccct t t ct c t t t t ct t agt aagct t t at t at t aaaat ct 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2005 <210> 51 <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> 51 tgtaatcctg ttatagactt tggtggccat taatcctgat atgggtaagc tcaagccgtt aggaacagga tcaacaaaaa gggaagagaa gatcatatct tggttatatg acgatggatc aagccgcgac atgatccatt atcacagctc caagtacctc cgaatcgatg taagagctac gattgatcgt ttaatttaca attgcaggat gatctagcga caaagtctaa ct t ggagaat aacagagtca tgcaaatgaa catcgacact accaatgatg aacagctaaa gaggtgatta acaagtaatc tcgatatttt ttatttatta aacaaaattg ggctgggcct aggacattac ggtatgcaaa aattctctcg gacgaaagga ttaaagattc atcaaatgaa tcacaagaga gacagctgt g aggt t t cgac gt acagcgca t act ccgat a tgt t gtgttt at at t aacat gat acat t gg ct t caaaaga ggt gt at at g agcaat t at g aaat aaaat a aaaagat t ag aat t aaggag t aaaat aaaa t ct aagcaga cggt t t ct cg aaaaaggct g t t agacat t a aaagccct ac at at gt at gt aaggagatgt ttggagagaa aacct gt t gc aat gt t t t t c aat t aat aat t aat ct cgat act aagaagc ggaaat cat t t t gtgaagaa t t at at cgat caaaatgggg ccat ggaat c aat ct gaaga cct t at t t ct aagcact at g gat gct gaca t gaaaat at t t t ct t ggat t gggcct ccca ct t t gat at a gacacaat ga cgt gct t t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 Page 56 12689250 Sequence Listing.txt tgtgaaattg tttgttgcat atgttttacg tggtgacgtc cattacttta gat gaaat t t acagcagaag acaagt ct ct t t t ct t cgt a t ggaaat at a ccaaat agt t acat aat t t a t t t t t ccgct t at at t t t ga gggt t t t act aat agat aac t agt t gt agt ggaaacgagt gat t aat t gc aact at t t ct ggt aat at t a agaaaagt ct at at t gacct agaagat t ga t t t aaat aaa gccagaagct cgacatggag aat t t agct c caaat t ct ac t t at ct t t t a aat act aat g aacct aat t t ct t t t t cgaa at t t aagaca t gat ct ccaa t t ccaat t ga cgcagaaaat aaagt ccacg cgt aaat t aa ct t gct t t aa gaccaaat ag ct acat ccaa t caact at aa at t ggt t at c aat cct gaga ggagat at ag at gacagct c aat cat gat t gt at at act a ct aacgt caa t t t t gaaat a aaact t caaa ct agat t aag gcaaaccaaa t agat gt ct t ccaaaaaaaa aaagaaaat a t t t gtgacag at t at cat at t gt t aaat ct aact t ct acc tg t aaat t t agt t cat t t t at t gt cct agat t aagct aaacc agcacgaaat ct acacgt ga t t cat at t t a at gacccaaa t t t cgct gat t gat t cct ac t aact cat t a t agaacaagt aacaaagt ca t t t gt t t ct t cccaagct cg t ccacat aaa at gcaaat ga aaat t t cgat t ct acaagt a ct at gacaag caaact ct at t acat t aaaa at gt at gaag agt t t at t t t aat gct t gt t acaaaaacac gaaact t aat caat t aggt t t t agt t at ga ct gaat agt g gt t t ct ggct aat t caaat t at gaccct aa acat at at ca aat t gt gcat t caaaat ct t t at gct gt ga at aagat gaa t gt act caga t acaat at at t aat t ccaca aaaagt ct ga t t t t gt caaa ggt t t t ggat agt t t t t aca cacgt at t ac t t ct aat gt a t t cgt ggcat gat t cagaag gt t t at t t t t t at ct at t gc t t aat gat ag gct t gt at ga t t gt at at at gt agt t caaa 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> 52 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 52 tttgcttccc ttttaatttc tttcctatct ccacaaaggc gagatttgaa tacgtaatcc tggcccaaaa tatctttcta ctttgtaat a taacattcac aaatctccta tatttccata t ct ct gt cca gct acct ct t acgt t t cagg ggctgcctat aact t atata tcatgttgca tatagccggc cgctcctcca ttagct t cca actcagtgat agcacccacg ttatgcatga aacttcaatt caagagcatc atccactttc aagct aaaat gaacat aat g aaaat aact a t t t gct t t t c t aat t at act t gt cgat gca t t gaat ct ct t gt t t ct at g at ct t t gt gt aat ggt t gaa t t t t t aagat ggct t aat at cact cact ga ccat t cat ct ct cat agaga t agaaaacat cagt ct ggat at t t at ccat gt gt t gt gt t t cagt ct gt a tt gat aagca cct at at t aa t t ggcaat t a ct ct cgct t t t t t ggt aat t aat ct t at t c cggt t t gt aa t ggct t cat t ggggagagct t cagat t t aa gactgt t t t t t t t ct t t aac ct aat at ct c 120 180 240 300 360 420 480 540 600 660 aaaacattaa gaagtatttt tttttttttt gggtgaaaag tcactgtctt tttcttaaag Page 57 12689250 Sequence Listing.txt ccaaaat t t a act gaccat g t aaaacaat c gat ggt t gga t at cat gt ca gt t t at at at ct agct aggt aat ccat caa act ct cat aa t t t ctgtat t t ct t agaaat t gt t t t acag gaaaagt ct a t aat cacgt a caaagagaga act t gat t ct t gt t t caaat t gt gct gat t gaaat gt t t t gat gt at cac at ggct t aga t t t ggt gcgt t gt ct gt t ga tacggaacga cagcct at t t aggt at t t gt agt ggaacaa cct gcaaaga caacgt t ct a t aat t aggag aggt t ccct t at at at at at t t gt t aaaat aat t at acga t at t tat cca t ct t ggat t a cgt acct acc cgaagcaaaa ttccaaaggg ct t agat cga t cgaact t t t t t t cgat t t a act ct t ct ga t cgt t gaat a at t aaagct t gct aat t t aa caaaacaaaa tagt t t t t aa t at ggagat a aat cacaat a gaat aat at g gact t at agg at gcat gcat ct t ct t ct cc at at gt t t at t agt t ct t aa cat acgt aat at caaat t t t t cct gacgt g cagct t t t gg acat t cagag gt t t cct gag t t ct t cact t ccgat t ccaa tgatcggcgg agat gcct t g t ct ggt at at agt t gagat t aat at gact a gtct t t t at a acaat t t ggg t t t t t t t ct a ccaaaatttt t gat cat cca ct agat aggc aaagaaat t a agat t at at t gat t t t at ga t t aaat t caa accacat cag gcttttttca acctcgtggg agaaaat t ca gt at gt t t gc t ggat t ct t a ct ccgccgaa at ct accgat acct t t t t at gt gt t ct t at gatgt t t t t g aaaact t ctt aagcgcaat c t acatt ggct aacgcataaa gt gt at aaac gat at aat ac t t at ct t t aa at at at agt c gct acat aca at t aaaaat a aat t t at aat t gacgat gac cagagaagag agt aagaaat ct at at agaa gcat t t t t ga t t t cct t t t t aat cgat t ct t ct acgt at a tcct t t gct t gtgaactagt aatgggatta ggt t t t caaa gt tat agt cg t agt gcct ct aacatacaaa gcataacaga cct ct t gt ag t aacatt aca aat t t gacag caaat gat at tgagccaacg tt aaactt aa t caaact t ag aaat t aat gc aacaagtaac tagctcaagc ggt t t t aat t t gt cct cgt c gatct t t gag gt t t t ct t t g aagagtt at t ggatctatgt ttctgt t t ct 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 53 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 53 atcctaaact ttcactctac tctggcattg tgatcggcag cttgcaatac ttggcgttta gat t atcaca atttatgcat cgtcccacag ttcgatatct tgcgggaact gctactcat g cgcttcatgc attctccgat gctgattggg atgcatatat tgtgtatctt ggctctacac gagtcgcacg ttcttccacc gaggccgagt ctctagacga cacgaccgga acat t cat t g gt at at t act ctggggacaa ccat agctt g acagggctgt Page 58 acccggagaa t att gcgt at gcaggctgca tcgctcaaac t gacgat t t t gagctcgaag tgcaaacaca tacaggacag gctgtgaat c aaacgcgtcc t tct ct ct ct gtttccacca aaacaaaagg actt cagaaa 120 180 240 300 360 420 12689250 Sequence Listing.txt ttagatgggt atgctctctt ct t actgagt taggcatcac at t accaaag t at at t gt ga t gaagcat ct gagt ct ct ca gt caacact t ggggcat gt a ct aat gt agt act at agt t t ccact gt t t c t agct ct t t a at aat aggag caat t gct t g gaagctgct t gt t t ct t agt t gact t gt ga aat t cat cat cat t t t ccaa gt cat t ct cg aaaat t aaat gat t ggaaca aact acgcgt at cgcgt gt c ttcat t t t ag tcgt t t t t gt t ct t ccct ct gcaaaccct t t ct t t t ggt t caat gt t ggt agcat t ggac cat at caaca t ct t cagt t t gagt at at aa ccct at at at gat cat gt gc t t t gaat gcg gat ggaact g ggct t ct gct t cat cagt at cgt ggt t t at t t t ggt t t ga aact t aaat g ggacct t aat cagcaagt ag caaaaagct a aaccaaaaaa aat t t aat ag cat aact ggg cccacgct cc at t t t t t t ca t gt gt gt aca aacccaat ca t agt t at ct t t gt gt ggt ga gccacat at c t at cat t t t a cat gaccaac tcaagcaaga gt aaaggat a at t gt aat t a t t act t t ccc t cacat gt t t aat aagt aaa t cacat t t gt ct gt at t t cc gt t t ctt ct t tttttcatac ggt t t t aat c gat aat at ct cct gat t t t t t cagt ct t t a at t aaacaaa agagagaggt t cccat ct t t t t ccct t t ac t t aat aaaaa gagagat t at aat t t gcggc ct t cct caag t ct ct gcaaa ttcgcgacaa t t gccgat gc t t ggagt gag gt t t agt aaa t at cat ct aa t t cat t t gcg gaacat gt gt agaggt cact aat cct ct ct at agat aact t gt gggct at tgggccctga gaat ggt aac t gagt ct act gaacct cagc t t t gat t cac aagcaaacaa t agat aagaa gt ct ct t t cc tttttctcgt aaaaaaagt c cgt gt t t t gt t ct t t ct t t t caat t t t t ct t ccggt t t t c t gt gagt gcc t ct aaccaag caaact ccct t cat t aat cc t gaat t at t c at act t caag gt t ct t t t t g ggat aagct c ct t t ct ct ga gaact cat t a t t t gct t t gc gcaaaacaat t t cat ct aat aat cat t t ca gat agacagc gaacat t at a gaaaacgact t gt ggt agaa ct t ct t cact t at ccaaaac ttttattgca t ggcaat cgc at ct t ct cgc t t t ggat t ag at gcct gt ga cat t caagaa ggt gcact ac ccat t gcct c ccat ct t gag ct aat t cat t agcct t caat gt t ct t gct g gat cgaagct t gt t act aga at t gat gt t g aagggcaacc act t t cact t t t t agt t at a t t gt t t gaat caaat cctt t t t aaagt gcg cacgaaagt c t ct ct ggt at gat aagaccc cgccaacccc acgacgatt c gaaat t t cga cat t aaacct ct cggt t t t a at t ggt gt t t 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 54 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 54 aatgtagtag agagacgagc cggacaaatc gagttctggt gaaagtatag acgagctggg agacgagata ggcggattgt gtagtcaatg gacgaactgg cagctaggtg aggagataga tttatgcact tgagggacga gctaacgact aagtgaggaa ttggacgagc tgactagctg acagctaggc gaggagatgg tcgttgagtg gtggagcttg aagacgagct gaacttaaag Page 59 120 180 240 12689250 Sequence Listing.txt acgagctgga accagcagag acgagcagag gcgagcggat acgt t t t aat aact aagct g t at t ct t gat aat caaagat accaccccgt gcat t at t gt aaaggatt ct agt t cact ac t t acct t aac t ctt acaaaa t at cacat t a tat t gt t ct t ccact ggt t a aact gggt ag t at t t t ct t g t t t ggt t gat t t gt t t t aat at at at aat c at ct at act a at t t aaaaaa aagagaat at t t at t t ccag ccat at at ct at ct cat t ga t t t gagat cc agaaagagaa gagct t aaag gagct t gagg gagct t gaga aagct t gagg at ggat t t ct gt gat at gga ttgggccgaa cagt aaaaaa gcaacgcgcg gt at act t t t ggaat at gag acgacaaat c ggt t t aaccc cgaaaagggt t gaagat t gt t gaaat aaac aaat t ct t t t t t gaaact aa gaat gt aacc at t t cat t t t cggat t at gc acacgt ct ga t at gcct t ca aat at ct aaa aat at t t ct t ttaaagaacc ttct t t ctac t agt gat aga aagt t t at cg aaaagaaaga acgagctgag gcgagctgaa acgagctgag acgagcagag t cct cct t t t ccaact t t t a ct aaaat gag aat cgaagt t ggt ct cct cc caaagt cat t aat at aat gt at acgcgcgc acat ggt t t g t ctat t t t ct ct t cat ccat t t t caaaat c tttttcacga agagt t caaa t t cact t ct t t ggt t at aaa at t t gaat t g gt t agt t t gt aat t at at ac agaaat aaaa t ct t t t aaag agt acagcac t at aaagt aa t t agt at t aa tcaaagcaac gagct t gagg gt gct t gagg aagct t gaag gagctagaga tt cctt gaga t gggct t t t t gggt gcaaat at aat t acat ct aat gaaaa cat at t at t g agtggccgac at t t acct t a gaact t t gga aagat gt at a t t ct ct cat a gaagat t t t c aat t t t cact ct t gaagt ct t gagat t aaa t t t t act aga ggtcgcaaag t t aaat ccaa t at ggt cct a at t gcaacca gaat at aat c at t ggaagaa t ct ct aagaa t t at ccat t t caaacacat a accagttgag gagcttgagg acaagcagag acgagctgag acgagctgag t gcct ct ct a gggat t gat g aacaaagat a actaacaaaa t aat t act t a ccgacacagc cat at t t agc acggtttttt t at t at t t t g t gaat t gt ca t t t ctatgt t aat t t cat t c aaaat t at aa at t t caaaga atgtgt t t ct ttgt t t cttg taacaaaact at t t t agt t c at at ggacat cat agt agt t t accccaat t t t aat ct t at aaatacaaac tgggctataa aaagagagat gagctagagg gagct t gaga gagct t gaag tttatagaaa ttgat t agt t ttcccct t t t ccaaaaacaa cat at at at t aat at at aac at t t at t cat tttttacaac gt caact t t t agcaagcacc tgccaaacaa gat at aat t g agacgatacg actgggtggt aagagt ct aa agacat ctt a at t t aaacaa ttttaatccc agt cagaat c gaaaat at ct aatt gt at ca ct t cgccacg caat aat aat atagagaccc ttaatacaaa 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> Page 12689250 Sequence Listing.txt taaaacgagc tgactaatat tggacaagct cgatgggttt gaggacgagc ct ct ct ct at aagaaccaag cat ggt t tcg t aagt t gaaa t at t ccaat t cat ct cct ct acaaacact t at at t aacca aaaaat gt ag tttttttcaa gccaat aaga t aaacgt cgc t t t t t t ct t t at ct t gt t cg ct t t at aat c t t cct cct t t t t t t at t gat gggat t tggt t ct cgat t gt t ggaaat t t g ct aagt t t t g gct at gt ggg t aagaaaggt ct t gaaat ga agt agt gt t g at t ct aat t t gt t gggt agc agagt t agca gat cct gaat gt aaggt at c tatgt t t t gc t cat cgt gaa gct gat t t t a gt caacat aa at aggacat a gct t t t t at a at agacccaa aaaacct gaa t aat at at aa gt caaacaat aaaagct t aa t gt ct aat t c aaaagt aaaa t acgt acacg tacaaaaaaa ct t t ct ct cc gagagat tcg t t t t ct gggt gtgtgggagg tagggaccga t t t at t t agg t t aat gat t t aat gaat t ga aaaat t t t ca ttttgtatgc agt gt t agat at t gct t t ga gat agaaat a gcct aat gaa t t at t gt t ag gagaaatggc tataggaagg cagat aagt t act gat t t t c t ctt ct gcga aat gt agaaa agt gt agat c t gt caaagt a gacgct aact t t acct t aat ttggaggaag at t t t t aaaa aaccat aat a t at cat t t t c cgt t t caaaa t gaat t gaaa t cat agccga at cgt cgaga t ct acct ct c t agggt aaag ct cgat t t t g cct t t gt t cg t ct at caagg gt t cat t t gt gat t gt t t gg t gt gaaaagt t t tctgtgga t t t t gaat t t ggaaat t aga aagt t t at ag agat t agct g t gt ggt act a aaaacaggt c t ct aggct t g t gct t at agg at t t cat gaa t aaggcaact at t t t t gt t t t t gtgacaag aat t ggt gga cat cgat t ag t at t at t gga cccaat t ct t t at aaaagat aaact ggaaa aaact at t gg aagcgagt t a t t t gaaat ga ct ct agcaca caaaccaaaa agct t ct gct ggt t tct ct t t t t t ct aat t gt caact t at ctt caagggc aagct gt t ga t t at at t t gc t gt aact ct t gagaggtagt t gaaat gaat t t agt at gt g ct t t acgct g ct ct aat gt t ggt gct aggt agcaagattt ct t t t gt t ct t agt gt ggt t aagaat t gt t gat gat act g tat t t t t cca act t gct at t t ct acct t cc gt t t t ct t cc cat t gt ccat ccct caacaa t at t at t aat aagaat at at t aat t aact a acat t t ggaa gt aaaaaaat aacggt gt ga aaaagcct aa ggat ct ct cc ct t ct ct at t t cat t t ccat gt gt tct gt t aaacaggt ga t gt gaat t ct at gt gggt t t gt t t t t gttg gt t ggat at g t gct at gaaa agt t t t gcat at at aagagg t t t aaaat ga aacct agaca gt t t gt ct gt ct t ct t t ct a at aaact at g cat ct gct t a at t gggt t t a t gt cact gag t aat agt gca t t gt at t t at gacct at t ga aaacgaattt t cct t gagt t ct at gt t acc t aagaagt t t t at at t ct ag at t gt tct t g gcct t aact c caaaagcaga aacgaaaat a aaagccgacc agcagacacg t t gct t t cac t t cgatt cga t t t gt ct ct t at t agct t t t t t t t t atggt t gt gt gt gt t at t t aaaagg agt gat at t a agt t gt aacc gt t t t agaag ct aat t t ct a t t cat t t at g ct t ct gt t t g t t caggt t ct t t t agt t aaa aagat cat gt t t t t cat gt g aat gt act ca act gaagat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 Page 61 12689250 Sequence Listing.txt <210> 56 <211> 200( <212> DNA <213> Aral <400> 56 t gt t gagct a act aat t aat at ggt aggt t t agt gt gaaa agagat ccaa t gat aat at g t t aat t ggcc gt ct cgt at t t ct t agt t at t t t at t gt t t ttagagaaaa t t ccagcat a t aact t gt at t at t gt gaaa at t t t cagac at acat at t t t gaccaaat a t t ccaat t gt ccgaat aagc gacagt caac gt ccccgt ct cccagat ct c t t aggcgat g ggt t t cgat t gcgat agt gt t t at acagat ct t aat aggt gt agt agt ag agccat aaat t t cct t aaac 0 bidopsis thal i ana ccggcacct c t cagagat t c aat t at t gt g agaaaat at g t ggt cat cat t t cacagt t c t cacct t at t cat t t t cct a at gt ccat ca cat at aaat a aaaat t gt t t t t t t at aat t at aaaaacct aagt at cact cat cat aaaa cgacacaagt caat at gt t a gt t t at aat t cgat ggaagt agt gt cat t t cgt t t cct t c t aat cgccgc ct t t t acggg cagat t t gat gt gact at t g t t aaaacgt a t t at ct t t t t t agt at at ga gagcct t cct gt t t aat t ag ccggt t t gt t gct cgat t ac gt aat at at a at t gagt gag tttttcctca acct accaaa at act agt ct act aat t t t a ttaggcaaga t t t t aaggt t cagt t t aagt ggt t gaaaca t aaat gt caa at t t ct aaag aagat gat t t at act t ggt a agt t cat gt g aacgcat t aa aagaat t gaa aat ccct at a t t cgct cgct aggt t t cgct t t t t gtt gt t aat at t cgaa aaat gagat t t gt ggat ccg t gcaaat gat t t t gat aaat cgt t aat t at at cct t at ca acat t aat t t aaact t t aat gcgt gat act gt aggt t aaa acaacaaaca act at gt cat t t t at t t t ct t t t t at t t t t agat t ct aca ct act t t t ct gatgt t tgca at t t aat t t a t t gaaat gat aat t gt t ct a agat t agt ga t caaaat ct g acgt gagat a aaacact aaa gt ccaaaagc aat agct cac gt t cagat t t ct t ct t ct cc aaat ct gaaa cct t ct acgc ct caagt t ct ttaat t t t cc t t t gat t t t c gt agt agt ag t gt ccat gaa ct gact gt t c t t t ct t t t t g gat aagt tag t at t gat t t t t ct at cgaaa t aat t at t t a gaaagt t t at t t ccaat gat gaaaat gaaa t at at at at a gt at cat t t t act t t t aat g at t t aaat t t agagagagac gt aaaaat t g caaagaat aa taaaaaaaaa at aaat t gat aagcaaat aa aaaaacct at t ccct t gt ca t gct t t gagg gt ct t at t ga cgaaat gaga ct gt t at t at t aggt t at at agt gct gt gt gcat cgat cg t agt at at ga ttgt t agtta cact at gaat t cagacat ac t agt t t aat c t ccct aaat a t agt t t aat c t aggaat gt t gt t ct acct t ct t at gat t a t aagt t ct aa ct t t accaac agacaat gt t caacatttt t ggt gt t t t ct at t act at at gt at t agt t a t cct t caaaa at cagagcca t t gat t cact at aaat gt ag agat ccggt g t ccacaaat c ct t t aggct c t t t cgagt t t t t t t t ct at g aat t agat ct cgt t t gt gat agcagat ct g t gt act ct at t cgt gt act g agct t gaaag at cagaagaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 tcgaatctct ttggatgaga tgcgtctgtt tttatgctat t ccacaat ga t t t ggaat ct Page 62 12689250 Sequence Listing.txt ttcttagctt tttatgtcac ttgagtgtgg aatctttttt ttttgttctc ttcctttcaa ttgtaaaaag tttgttatat gtgtatgatt tttatgtggt tgctgattca atttttcttt ttgttgttat agctttaaga 1920 1980 2000 <210> <211> <212> <213> 57 2000 DNA Arabidopsis thal i ana <400> 57 cggt t t ct ga caccgccacc ccaaaaggca aaaaagct ga t t at gt gt gt aaagat agat t gaggt t gat gat gat aact gaacagagga at t gct gct g aaggagaagg t gacact aaa gt gagagat g gt t acagaga agcaaat at a gact t ct cag agt gt t aat t t cat cagcag caaaact t cc caaaat t ggg ttagt t t t ca t acact accg gacagt ct t t at cat aat ga gt at gacaat t t t agt t agc aagacaaat a accaccacgg t cct cct cct gagt t ct gag act at ct aag t at t t acact t t gtgggaag aat at aat ga t gagt t cat t gaaagagagg gagccagagc ccggt gt t at acaaaggat t at at gggt t t gct ct caaag gcagct t caa t cact agat c t gat ggacct atgt t t cttt agt gaaacac ccact aat gt agccccat ct acaaaat caa aat ggaacac ccacgt aaga aact gat cac ct at cat caa at at gagt at at acaaccac ct t ct at cac cat act agt c at caaagcaa cacat at gt a cgaagaaaag ct t t gt at aa at t ct ct aac tttgcagaga tcaggcggag ccgt agt acc t ct gat aact t cat t aaaaa at act t t ct t t cacaacgt a aacagaaaag ct t gat t t gt t ccagt aat t t t gct aact a t gt gagct ca agt gt t at t t gt gat t ggat gat aat t cca aaat t t gt ga t aat aaat gt cgt ccaaat a t aaact ccat cgct t aggac ct t cgcct ct gaaaggatga ggt aagt agt ttacacaagc ggaaaaagct t ct aat ct ca at gt t ggt ac tttagagaag aaagacagac ggt aaact cc gt agct agct gaaagaagat aaaccaaaat aat agt aaaa gt t t cat aat t cct agaggc t aagt gat t a gt gaagct at at aact t gct tggt t t t gt c t t at cat t at t ggaagat at tcgagaagag ct t t t t cggc t aaact t t t c aaact aaagt acgat cagaa gcggct t cca t t ct acagct ct caat t cac aggt at gaga cgaaggaagc t aat cat t at at gaacagca acacagaat a agagcaaaga ct ggaaat gc at at gt at gt ctt gaaagcc caacacaaca aacat gat gc cacaat cat t gt t act gaaa acagat t cca t ct aagaaca t t t agt acac t t t at aaaaa cacccat caa t t at gt gat a aacaccacat aacgacagaa t aacat agag ggt gct gt ca cct cgt gct c cct agggaaa gat gct t ggg aaaaact t at agt t aaacag t ggacat at c at t at aact g gagcgaact a cat t gaacag gt t caaggt g at gct gt t t c at gt gt gt gc aaaagt gt at ct caaaagac agt t aat at g ccat t ct t ct ggt at cat aa t agt t t ccac t aaat aat gt tgtatcgagc gct t gt aat a agct t cgacc t t t cgt at ga t ggt cacact t gat at caac caacat t at t aaaaaaagaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 Page 63 aaact t aaaa ttatctcttt t t t t at t t t c at cacaat gt agat aat agt at at acacat tcaagagaaa gt gat t agag caat t t at ga aat aacaaaa at gt t at t t a at act t t t t c at ccat ct ct aaggaaaaaa 12689250 Sequence ttttatttat ttattaatat aat gtttt ca taatattttg aatt at aaat aaat gagt cg gaacaat aat t at agt ggct tttttctttg tgtggccaac taactct t cc atccaaaaaa Li st i ng. t xt t cat agt gat t t gt caaat a t t t t cacat t t aaaaat cat at at ccattt aacaaaacaa aaagagattt t ggagt att a t acgt ct aca t aat gaaagt t ct agt ct at aaaat t at at 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 58 2000 DNA Arabidopsis thal i ana <400> 58 tt agt aacaa at aaat t aaa cccgt t t ct t at t ct t aaaa tcat t t t t ag cat at acat g aatt ggt t t t at at t ct t ct ttcct t t t t a ct t gt gt gga ct at gt at at tccgccgaac ccct ct t cca ct t at t t t gc t at acgcat t acat at at ag t t cat at acg t agacccact gacgt at at a agt t t t aaac ccat at cat a aaat aaat t g t cat acatt a act aaaaaaa tt gt t t t t t c tctcat t t t c t aacgggaat ggaact t gca t aggt t t at a at at t t ggt a t ct t t acaat cat t cgt t t a t at gt caat a ttggaaaaaa aact ct t gt a t t at t cgt ct t at aact t t a aactggaggc aaat gat at t gacat gcgt a aaat aagt aa t at gt t gact gat cgat cat at t t t ct agc acaacaaat a aaacgtgtgt t acat t agt a t caat gt gat t t t aat t aaa agat t acgt t cggt gt t gca t t act t gcat ttactct t t t t t t at ggccc t t t t at t t t c act aaaat t g aaagggagag ct cgt caaaa gcact ct gca gt t t at t t t g gatggagcaa gt agcaaacg t gat t at t t g at aat at gt a gaat at ct t t ggat at aact t caat gcgat tgat t gt t t a t t cct agt t a t at gat t t ca caaaaccct a ct caat agt a t t t t cgt gca caacat at t c at cat at gag gt t at t aact ccacat ggag gt at aagat g t cggt act t a ggt agt gaaa aaaaaacaaa gcat at acgt cagct t agt t t gcaaat t gt ct gt ggt agc aaagaaaaag cat aat at at t t gt aat t at ct at t t gct g aacacaagat gat gaat t cc aat t t t at t t cat aact caa gcaagcgt ca t t t cgt t aag aat caat agt agt t caaaca gtt cat t t cc t at t t t t aat ccacgt ggct aaaacaattt ccat gaagca t t agct t aac aaaaaaat ca gt at t at t t c at aact aact acgt caat t g cact agccaa agcacaggga agt acagaag aagaaaaaac gt t ggaaaaa t ct at at gt t ct at t act ga t gaagaaat c at at t t t gt t cacgacacgg ttttgtgtag aat t aat t t c t at aaaaat t t gaaat at t c gcat cct ct a at at t aact t at ct t t agaa t gt gtt at gc t aat agt t t a gacaaat cgg gct aat t aac at at gt at ag t agt t t gt gt cgaacaaagt aat agat aga caat gaaaga aaaat at gt a aaat t gctt t aaaacaagt t at agagt t ac aggagaatga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 aggtgaaaca tacaatatta gctaatcaat tatcattttt tttttttttt caaaatcaac Page 64 t cat ggt gaa t cgat t at t t aaaaaaat at agt gaat t aa aact ct att c gt t t t aact c agagagaagt t t ct ct t gcc gagagtgaga act caat cat ct agcagtt g t cact t cgat caaaaacact t t t gt gt agt at ccgt gt ag t ct t ct t ct t gaat aaggt c ct ct ct ct ct aagagagaga aat ct at aac 12689250 Sequence aaagt t agt c at at gacat g t agt t ggcca t at aacat t g ctat t cctcc acaaaactca tcttaataac aatttgtata tataagtttt tgaacatttc at agat aaaa gt ggat gat a gagtcggtca ct cacggt gc ctcgttatct cccctattaa gacagagt ca gagt ct t cgt Li st i ng. t xt t aaaact cga ct at at act a cat agt t t t g aat at aagt g aat agct cag aat t aaaaaa at ggt cact c agact ct aac ct gcaagaga gt t t t ct t gt t aaacacat a t aact ct t at agt t t acact t aggt gcaaa t aaat aaaga t ct ct gcacg cagt cacaca gaacaat acc 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 59 1246 DNA Arabidopsis thal i ana <400> 59 t act gt agt t t gct ct gct a catct t t t t g ct t t gct cat t aagaaat gc at at t at aag ct acaat ct t tttttgggga gt t ct cccat at gat t gt at t cat at t t ca aat ct t t gag at t t aaaat t gagt acgt ca agcacgcct c at at gct aaa at at aaaaaa ggaaaaaagg ccaaaaaaga t aaat t acat gaggat aat g at t t t caaat gt ct t t gcaa t ctt ctt ggg cgaggacat t gat aat ct aa t cct caat ga aacaagat at aaaat at t gt tt ggat t t ga t at t t gggaa t aat cat gag gat ct at t at caat t t t gt c t at t t aaat a at t cat at at agagtt gt aa agt gaaat gg tt cggtt gt c agccat cgga aagttgtttg ttttattaag tcactaattt ctgaaacata tagagaacca gtt agt ggat acat at t cgt aat at t t cag gt gaaagagt ttaaaaaaaa at ctt acgaa at t t gaact t gt act att gg aaaggat at a tttcgcaaag aaaaacaaag tgggaggcaa t t t caact ag t t t gat t t ca t ggat t act a acacgagagg acat aat t t g tcacaaaaca aat t aat aaa agat t t t t at cat t t cgt ga aat t cagaca t t aaaat gga at gat t t cga acaaaaat ag agat t agat a aaat aagct t gagcact aca t t aaaat t ac at ctt gtt ca cgact t t aaa at t at t t at a gt t gt t aat c caaaaaaagg ttggcaacga cagagatt ct gracaacaac agtt gaaaga tcacaaaacg ct gaaaaaat at gt acacaa acagaat ggt t at aat ct t c t cat gcat ca aat caact t t aat cat cagt t t caaat cat cccgaat cat aaact t t aaa t t gt cacct a ct aat agt aa aact at at t c gaagaat aac aaggct t t t c ccact ct ct c aacaaacaag gacagaaagc acat gcat t t gat gt t t at g acgcaagt ga t t t t agt tat tt ct t t t t ct aat aat gt t a t at t ct t ct a aggacact ag at cct t t t gg aggagt at ag t ct t gt t cac t gat t t gagg t t ccaact ac caaat t t gt t gaat at agt g t ct ccat aac t at cct t ct a aact caaaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 12689250 Sequence Listing.txt cttcataact aaaacatcct ttaaagcctt ttcaaaaact caatca 1246 <210> <211> <212> <213> 2000 DNA Arabi dopsi s t hal i ana <400> agt at cagca aggagaagga agt gct t ct c gcccgt cgat agt gaccgt g ct cgt act ga t ct cgccaag ct t t agt aca aat t ggt gat aagaagaagc ccct at ccgt cct gagt cca ggggaaagca t t cgacat cc aat aat gat t agt aagt agt t gct ccaaga ttgccgacaa act t t cat aa cat cgt aat t cat at cact a ct t aat ggaa at ct gt at ac ggcat ccat t cct aat caac gct ggt acca aat cat act t gct ggcgcgt ttgagccgaa ggagt cct gt gt gt gcggcc ct aat gggcg acagaact ag acaggt gaat accgt ct t t t t cggt t caca t t gt t t cagc ccaaaagaat cgct cagt at agccgat ct c t cgaggt caa agat cagt gt t cggat aat a accct t aagt t t act t ct gt t t gt t aaaaa at gt t aact t aat t aagcat t at t t at t t g t accaccgt c t t t t t t t t gt t gaat t aaaa t aaaaat t at act agt t at t gagccaacaa t gt cct at ct ct agt gggga tagagccgag tagaggaaga t agat ct cct cat aaggaga t ccat ct cct agcccacaca t t at aacgt c agcccact aa t gat t ct t t a t at t ct t gat cgcct ggt ct cacct t acag gat caagat c caagat cacc gcccaggt gg t aagt gt t t g aaaacat aag t t t ct gt t ga ccat t at t cg aact aaat gt gat aat caat at t at cact a t aaaagt t t a ct t gt aaat a t t gcgaat t c t t gagt ccac caacgt ggct ct t t ct t gca agccagaacg ct gagct aaa agaagaat t a cct cct gacc gggagaggat gat cgat ct c t aat at aact t t ct ct gt ag gaggaagaac cat ct aaaag t t cacaact c ct gt t at cgc aaagagaagg aaagt cat at at ccaaggca gaaaaaggga t t ct t t t t ac cat t gt ct t t t gt t t gccga t t gcggagt t gacgt t t gt c at aat t t acg t cact aaat a ttgacacaaa taacaacaaa gcccgt aact cgacaggt ga t ct t ct t t t t at aagat t t t gct cact t t a acggt gggag gcagaagt cc gcagaagaag ttagccaaag ct t at cgct t ccccct t t ct at t t agaagt acct ccaagg t t t cat gaat gaaagt at gc aaccggcggt t caccat ccg t caaaat ct c aggt ct ccat t t agt agcct tgagaagaga t gcgt at gt t cat t t t t t ct ggt t t t ggt c accaaacttt at t t ct t cct t aaaaat gt t aaat gaat t a at gggat t aa t ct t aagct a t agcaaat aa t t t t t t t aat gccacgt cac aaaagt agag aggaagaggc agt t ccggca t t t gt caaga at t ct cat ac t agt gat agg gt t acacact cgcagaaggt t act t at cct at ggaact aa t caggt at ag acagccgcag ct agccacag ccat t gggac cgaagt cgga at gat t aat g t ggt aaagag t gt t t gat t a t t gt t gccat caat aat t aa agaacaacga acat at at at aaaat gat t t aaact cagaa aaaaagaagt acaat t agaa aaaagaacag at aat caaac at act aagaa agat gat aac t act act acc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 gtcaccatct ccggtaaaat aatgtacttg tcatttaaaa attaagaaaa aacacatcac Page 66 12689250 Sequence Listing.txt tctgcgataa aataggcaaa agcagatttg aagaagaagc agcttgagat atcaaataga gagagagagt gacagaggag tgtgtgaaca tcctttttta gtagatttgg gttttcgaga tgccgtattg aatcggctac gaatttccca attttgaatt ttgtgaatct ctctctttct ctgtgtgtcg gtggctgcga 1860 1920 1980 2000 <210> 61 <211> 2000 <212> DNA <213> Arabidopsis thaliana <220> <221> N region <222> (809)..(812) <223> n any nucleotide <400> 61 agcat cgaga agagct t acc agagact t ct ctagaaaggg t gcat t ct gt t t t acaagt t t t ct act agt t gt t caaaat t t gagt aact ct ggt cgt cg t t gt aat act at t cat gcaa aat t t t ggca gat acgaaaa cgat t gaaac ct t t cccaaa cat t t ccct g t ct gcgagat t cat t gt cca t ct cct gaaa gt gt t aaaga aact cat cct agat aaccct gaat acaat t agcat t t t cg t ct gaacaaa t cagt aat gg acaacgaagt gct caaaat g at at agt aaa gagaaacaac t gt aat act g taccgggaac cat cagaacc aggagtgagt cagaaagcgt t gt t aaaacg gat gaat aac cccat t gt t g gtgct t t t t c aagtggcgaa at gcccat t t accat agt t a gggagaggaa t gaact cat g gt ct at ct ag t gaat agat c ct t gt caaag gt t gacaaca gcct ct ggt t t aat gggct c t gaat t ct gg cagagt ct t g accaat ct ct ggaaggccat t t gact t aac cacaagt at c ttaagagcag t aagaact at at ccaat gnn t aact aagct agt gaaaagc t ggct ct gct ct t cgagt t g at caat at cc t at caat gga at gat cacct at t ct t ctta agct gat t cg cat t acaaat aagacat ggt ttgggacaaa gggct t t aaa t t t t gt gggt aacat agaat t t caaaat ga ct aat caaga ggagt gagt t cagcccct ac at at t cagt t caaat gcat t aacaagat t c nngt cggt aa cgggt gat ag t cgt ccaaac ccgaaaat gt act cgt gaga t t caccat cc gat at ct aag gcgggtacga ccgt aaaccc at at t ct cag gacat gat ag t t cgagt gac agt t at t aat at aaat ggt c gt t gt act ag gct cct t t ct gat gaaaaaa at t ct gt gca t aagt t t aag t act cgt ct a t t at t at aag caaaggagca accct t t t ga t gt caagt ag ggt ct cct cc t cgt aact t t acaagt ct t t t caccat caa t at t at at ct acaact aagg acat gat at t t gat t gacac aaacct caga agagagaacg ct agact gag t gt t cct caa ccggaatggt ggact t ct t a cgt t t ct cgt aaaaaagt ct agt t t acat t agcagct gaa tgtcgggaac at t aat caat acaaacttt t t t aat gagaa aat t cacaat aagt gat agg gcagt t gct a ggct t t ccaa at ct gat t ca cgcat t t gca ggagt gagt a agcgaggaga at t acaagga aggctcgaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 tcatcaggga gtgggcattc tacgacgtcc cagaagacga ctaccttatc ctcttcctct Page 67 agatacaaaa t gt t cct cca tcgaccaaca ct t ggt t t gg tcgaaacaga ctggtccgaa gt ggat gact caaaaggat a cctagaaacc t ct t t agct t aaggcggcaa cat act at t a cccgagaaat t t gggt t t aa gcaacaaaag agggcgtggg ct t ct t aaaa gcaaggtcag ct t t ct ct ct ct t gcgaaga 12689250 Sequence aat gcat t aa caat cagt at gagagataga gagagagaga cagggt t t t a gatctgcggc aaagaaatgg tccgggaact ttgatatggc cgagttggtc ttcaaatccc actgtcaaca taaaaagaga atgtaaaccc attatggtgt taatatataa cgcctat t gt tgccgt t gct Li st i ng. t xt ccagagaatt gct aaccagt ggtggaagag t t ggt t t agt taaggcgcca at t t act t t t t ccaat t cga aggagacatt t gct at ct t t act at t gat t ggt cat ggt t cgagctct ct aat t t agaaa gat t aaggt t t t gt ct t t t g t gagaaggt t aagt ggt aac cgccgt cgt c 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 62 2000 DNA Arabidopsis thal i ana <400> 62 gccaaaggcc aagat t aat a agaaat gat g gat t agt t gc gatgtgt t t t gagt agat gt gatcaaacaa t t t t aact t c at t cat at t a gt cat t t at t gaaaat t aaa gt at ct ct ct gcct t t ct t g at at t aat at ct ct ct ct t c gact t t acat gat ct t gat a t t aat ct aca aat cat t agg gt act gaat a aaagt t agt a t ggaagat t a at gt aat aag gt t t ct t aag ygggtaggag at t aaagt aa t t acaat ct t taagcaaaat aacat t gcaa gacgat act g tgatcacgag gagat at t aa t agt t cagt t act at agact t ct t gt gat t t t ct act gt g gt t t agt ct t at aat t t gct ct gaaat cgt t acaaaact a gaacatgtaa atgcattaaa ttgacaggtt gggagacaaa aaaaaatct t t aat at t aat gaaat t gt t t gcaagagtaa aaaat t gat c gt t t agagt t aagaaacaac ct t t aaat gg aact at aaac at t t cgaaca gaatgacagc at t t cacct c gtgtaaaaaa aat act cgt a gt acagct aa t gat st gt t g at cgat t caa ttaacagaaa t aaat t t t ca t caaat ccat acaaaactcg gctcaggaag t gat at t t t t t at gt caaat t at at at agt taagacccat t act agct ag accctaaaac gaagcaatat tgaagccagt t t caat gct g aaaaat cat a agt t at aagt gagaatagca gcagcaaat g at t at t t gaa acaaaagatt aaccaacaaa aaagtaaaca aagct cgt aa aaggtgagga at t t gagt t a t t gt t ct t t g gt cggt t t t g at aat t t t t a t t caaagat t ct ggaaaat a t at t ct at cc gaaaat aagt t t t t gt ct t c t act at agac at gaagct t c t act t aat gg gt at cggact act at t t t ag gt ggat ct t a t gt t agt gga gt at t t ct cg gt t aat t ggt t gacaaat ca t acaat agt c at acat aaga aaatacacag caaatcgcga agagt t t t t a aat t gt cact aaggcat t at t gt t cgt t gg at at caaaac t at at agt ct aat t t aaaat t t gat cat t t aaaaccgaat t cat t t gt ca t at at act aa cat t t t cat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 68 12689250 Sequence Listing.txt tgtcatggta cattatatag cattattcgg agaaaaccac t aat t at at t agat at at t t cat t aact at t t aggt t t gg at t t at ct t t t aat t gaat t at t ggcct gc aact gaaccg t ct aaaaagg st t t at caat cggaccagaa t at cgat cac accagat ct t at cagagaga <210> 63 gt t cacat cc t gt at acgat ttct t at t gg t act t t aat a aagaagatgg aaat cacaaa gt at aagt t g aact ct caac cgggat t cat gat t t ggt t g gt t acaaat g t at ccaat t t aat t aagct a ttggt t t t gt ct t aat cgt c ct gcgaat at aagt gagt t t gcccgt acat aat gt at t aa agagt t aacg t t gt gat gt t at agyt cgaa ct t at gacga t aaact cct g t gt cact cac t cgggct tag t gaat aaaaa t t t ct t t t at t t t t t t t ct t ggct caat at gaagcccaat at t t t gaat t at ct t ct aaa t at t aact ag aaat gacgaa at t t at caaa cgaat aat aa at t t gct gaa t t act ccat t t t t t t t t gt c t t t t t t t gt t act t gt cgt t at t t gt gggt t ggt aact ac t t at t t t gt t ctggacaggc acaacaagtt ggccccattt agct t t cat t aaacgct t at agt aagggt a aact t t aat g gt at t t t gt g gt t gt t gt aa gt agt ccact t cgaagt t t a t aaaccaat a accgct ct at caccacat t g aggaaagaaa t t at cacaaa t t t caat ct c 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 63 t agagact ac t acgagt cgt gt ct t caaat t t gt gt t gcc at agt agat c t ctt ct t t t t gt cacaaaag gt ccaaaaat tttttttttt gcgat t aaga gagaat t gca aaaacagctt t cat at aaag cact t t aat t gcat ct gt aa aat t acat aa aggct acat g acgt gt t at c aggt aaacac gat aaaaat a t t aagt gt t t at t t gt t ct a t ct at ct acc at t at t act t gaacacaaaa cacat at t at at aaat t gt t aagat caaag t ct gaaat gc t caagat at a t at at aat ca t ct caaacgt ct cgagat ct cat caaact a ggat t cact t aaat at t at t t t t gt at t t c at cat gagac at t t gaaaat cact t t caag t t gt gt at ga aaggat t at c t acagat t aa ct gat t t gat aat at at gca aaaccaat ac accat t aat t t at acgt acg cat t at cggt t aact t acga tttgtttaag t t aat ggt t t act t at t t ca at caacat cg cat t gt caag t cgt ct gcac gct t aaact a t gggt gt aga aat t t aacga gaat t gat ac t cacgcat t a ct aaaat gaa at ct at at at t at t at t tag acgt aagaca gtgt t t attt cccacgattt t gaaagcgt t t t acat ggaa cct t t cagt a agt ct at ct a aaact aaat t cact acacac agt t cagaat aaggaaat t a t t t t gaagat t ccaaat aga gaaaat t t at aat t acgat t ggt t gaaaac caaacaaaag cacat t t at a acat t t gt ac t t t ggt t act aat at agt t c t ct t at t t t g gt t at t ct at t t at at gagg gt gcat aat a t t gggat t ac aaat aaaagt at gaccat ct aaacgaccac at at gcat gt caccaat at t aaaacaaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 gtcatgcgta tatgaactat acatatgcgt cattagctcg ttcttcaatc taggtttaat Page 69 1020 12689250 Sequence Listing.txt atgaaaaata attttttctt cataaatatc tttttcttgg agcattgttt t t t t ct t t t t aagaat at aa cat cgt cct a at t t gt at at cagat aaacc at ct t agaac t at gt t gt t a aagataggcg cgt t ct t tag caatgggt t t t gcccaat t t aat t t caaat aaggcccaat ct t caaccat accct t gct c gt at cact cg t t ct t ggagc ct t t t at gca t ct t t agaca cgaacccgt g at at ct ct aa at at t t t t ag at t gt t at t t at t aagat gc at t aaaacat gt t t t at aat t agt t caaca t ct aat ggt t t aaat t gaat t t cccggct t cgccccact c t t ct t cct ac at t gt at t t g aat agaaagg t t aaaaat aa gaggt cgcat t t gt caat ga gat ct ct t ga gat t gt t ggt aaagaacaaa agt cgcaaca ct aat gagt a t gagt t aaaa ggat cgat t t ccct aat at a ccgcgttttt acaggaagtt t at gagagt t acacagct at aat accat t g t t gcaacgt g aat gt acaaa acggt t gat t gagt at ggag cct acct cgg acat at aaca aat gt acat g cagat t t t aa aaaagcccaa t caagt gaga gct t t gt agc ct acat cgac cgct t t cat c t gt gct t agg at t t t gacct aat ccat aac cat at at ccc ttgt t t gttg t gt t ggt ggg t ct cat t cga aaaaaaaat c aat caaccaa aact aat at a at at aagaag gt gt t aat cg gccggaccgg ct gagt t ct c t t t t t t t ct t t t t ggat cct at ggt accct t aggt at at a gat at accaa t t at t ct t t c tgatcgagga t aat gggt at t at t t gt gct t gt aacaaaa aat t act gaa aaacgt cgt c gt gat cgt t a t cgt ct ct ct agaagat aaa gt act t cgt c 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 64 2000 DNA Arabidopsis thal i ana <400> 64 cgt gt agct t ct gt t t t act gat agt gt t a t agagt t t aa gat t ct gct g aagagat at g gggt at gt gt agt t at t gat t at t gcat gg gaagt t aaag t t t at cgat a agt accagga t t gtgaagac t cgat at cat aaacct aat t act t t ct t at aatggaccgg ggaggcagat gtggtggaaa t t t t t gatct ggactt ct t t at cat cct ga gt gt t at gat t agt t t t at g t ct t at gat t at at t ct gt t gaagt t t t t g gggt t t t ggg cagcgat at a t t t t gt t t t c t cct at ggct ggct gcaact ct t t gat t gg t t ggct t ct c t gagat t agc ggaaaacatt t t gt aaacct cgt aaagcat gt t aact cgt gcat t agt gt t gat gt t gat t t gat gaat c gt t t gt gcag t t ct t ggaac gct caagcaa t at t gaaaag t aaaggt cga aagct t gct a gagaaggttt tttcgaacag aaact gt aga t t at gcat t c at t gt gt t ct ggcggatat g tat t gct t t a cct at t gt gt gagt t aagga acagct t gaa ggt ct at gat aact gaaaga aggt t aaagc gat t ct gacc agt t cagct t gaggaatgt t gt t t gcaggt t t at gt aact gt ct ct t t at t ct ct t aat a t gt t gcagt t ggat t t caac t aaagaat t t t cat gggcat gcat at gcag t caagt gt ct ct t t t t t ct t gt at gact t t t t at t ct cgg t ct t gaccgt 120 180 240 300 360 420 480 540 600 660 720 780 Page 12689250 Sequence Listing.txt ggtgagaaga ttgagctttt ggttgacaaa acagaaaacc ttcgctcaca ct aaat ggt t t act gat t gt caagat t t t a at aaagct ca t gt gggggat cgct ct gt gc t gt t t t gttt t act t ct t ga gacaat ct gc acgat act t g ggat t gcagt gt aat t aaca tt gct caaac gat t t t ggaa cgaggcgact gaaagt gat c gcat gt aaga gct t agt ct t gtgctcgccg tttttaagga caaagt caat tgt t t ct t gg gaacacaagg t t gt cct t gc t caact gcgg t t cccaagat tt gt t t t t gt aat t t gt t t t aat gt t gct a tt ctt ggcaa gaggt at aag cggt t t cgt c gct t t ct cct t t t ct t cgaa tgcggagcca t cct t at t gc acaaat gcaa cgat t ct ct a at at aaat ct accaaaat aa agacccacac gt t t ct aact aact caaat g aat t at cat t t aaat aagt c ct ct gagaat gat t t ct at a gt at aggct a t t gct ggt ac t t t act aat t cat agt ccac t gaccgat t c ct gct t ccct t t gat t gcca cct ccggct g ggcaaat cag gtt cggacaa cgcgt t t cag ct aat at cgg agagacat gc t t aaat t t ga agaagaaaga gcct t gat t c t ggaacat t t at ct t cat t c t t ct cat act t at at at t at t gacact aat t ct t t ggt ct agccggt t aa accagagt ct aat t t cggaa t gct gagct c at gt cgcggt ccact t cct g acgt gt t gt c cgt caagt t t t cccgt aaga aagagt t t gc act cgact t t t gt ggt t t ca t cat cat cat ct t cccggcg agt t cgt t gt at at gt t t ct acat at at gt t ct t gcaaac gt t gat gaga t at t aaat gg ccggt t agt t acat t cgcct aagat t t cga t at gt gt at t t cgact aaaa t t ggt cagt g t ct agat ct c t cgaat t aga ggt t t gt t t t agct t t ct ct at cacaggcg gaacat gaag cct ct caat t t t at cgact g gct acct t t t t ct t agat t g at gcat gt t c caat at t t gg ct ct agt t gc agat gaat at cat t aagggt gat t t t ct ga ct gct gaaat tgcaggaaat ct gt ggt gca gt t t t act t g t aaagct t t c t t cgcgt cgt 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> tccggatcat gcatcgtttc cgtgaccatc acatacaaat caaaccctct gaatgctaca tccataagtt tatatctggt ttggaaggaa cggctcgtat gatcctagaa ggact t gagg atctcaggcc atccaacgtt ctcctcttcc tcaagcttgc tgatttcggc tcatccaagg ttggtacggc gcctaagtac atgccaatag atccgagcct tgatatatat gctctagggt ctatccagga ctactttgac gaatattacg aggctagaga tttcttaagc cggtgcagta at cgcat cgt t ct at at gga aat ccgt acc ct ct t cact c cgt ccaagac aacct gat t c agtct t t t gg gt gt ggt t t a aat ggaat ct acat gcat cc Page 71 ccaagcct ca gt acgcgt cc agagt ct t t g ccat ggt t ac tcctggagag cgat t at ggt gcctaaggga t gagat gct t gcgt cgt cat acacaggccc aacaagt t t c gaaggt aat c gt ccaacgca gt t cact gca ccgt gggat c t ggat gt act gt gat cat cg ggagct at ac at ct ct ccgg accgcggcat 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt atctcttgaa ccatcctttc attactcaaa tacttccatc gccaacaaca gaagacaaca aggagat at c t ct at ct ct c tggcaaagcc t gt gt gt ct t t aagct t aaa caagt at at a aaacccaat g t gaat at cca gat ccaaaga ct aacat cgc caaacgagag at t at cagat taccgcggag t cgagt t t gt aaaggt t t t a ct gagt at at t t agagt gt a gt ct gt t aga t t t t t aat t g t t t t ggtcag aagcccaagc at cagagat t acgagt t cat t aggt cgt t g t t gt t t gcat t agccgagat t gt t t t caat t act agt aaa cact ct aat a aat at gt gt c t ggct t gaaa t aaat gt gct cacaact aaa t cgt gcgt ga cgat at t t ct aagaaagat c ttgt t acttg cct t t t t acc t t t at at t ag cat aaaat t c ttttgttgga at gaaagaag aat gaaaaag ccact acat g agacgaat ga cgagaagcaa at t ccaagt t aaaaaaaaag gt accaagga aat cgt gaaa ct aaaccaaa ct ccccct ca ct ccacaact ct gaaaat aa taagcgacca ccaagt at at gagat at gca ct acaacaag t t at ct t at t cat aacaaaa agt gt gaat g t cct gcat at aaggt t t act aaaaaat aac aaaaagaaga t cact ggcaa ttaaagccca gat cggcaac t ct cgaagct aaaaaaagaa t cggt t t gac t ct t cat t aa act act aacc agatgaaagg t t t t t t caat acgt t acgag gaat cat cga acact ct aat at cgagacca act t ccat ct t aagt gt t at aat t act ct t t gct gagt t c t t t atcggag ct at at t act t gt gat t t ac agaaagt ct t t gt t t at t aa agaagt aaca aagagcgt aa t ct t t t ggcc act ggat agt ct t gggaact gt gt aaaagt aaact t aacc aact t t cat c gagagaagca ct t ccggt gg cat gcgct t g at t t gt t ct t aagt ct t ct t ggt gat t t ca act t t agt t a aagt cagt at aacact cct t aat at ct t ca ct at ccaat a at aagt t t t t ct ct agaaac aaaaaaaaaa at aaagat ca accct agcag t aat cct ct c gaat t t gct a cgt t aaat t g t at acat gat acaact aaac t t ggact t ga t caat ggaca t ct t gt t agt cccaacaaag t cgat t cat a ccgt gat aag t t t t at t aca ggt t t at t ac t t t t t ct t t g cat gt t t aat t at at t at t a at t t acat t t tttttttttt aaaat ccaat aaaaaaaat a gct ct cct ct cagacacaac 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 66 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 66 gatggaggcg acactggggt tgtggacgga aaacggaaca aagctaaggt gggtatatat ataaacagct ttgaatattg agttattgct tttttttttt tttttgtata tgcagccatg at atgatatc aaaaaataat gttgatggat tatatttttt tacattcagg ctcttacttt aacggcgaaa at gt aaaagt gat at t gaaa cacaaaagcc t t gat cat ag t t acaact cg Page 72 cgtggtggca at aaat ct t c ct t t t t t t t t ccaacggt ac t t ggacaagt aaaagt ct ga aaact cat gc t ct t t aat t a tttttttttt gt t gaaacat gt t t t acgt a t agt gaagaa 120 180 240 300 360 12689250 Sequence Listing.txt tctgaaggtg agaaatgcac agcagatcca gatttcgatt gaaaaatgct ggt ct ct aat cact aacacc acat t aacat agagat ggat at agt t gt t t at gt t caaat t t t t gaat aa at t t t t gaaa tcaggcgtga act t accagg ccccat ct gt ct gct agcaa tcgaccaaga aat ct at cca at at t t t cag t agcgcat cg t gt gct t gac t gat aaagga t at caaacac t at act t gt g t at aggat t a ttgtat t ct t t at aaggt t a aggcccattt aaaccct aaa agggt t t t ct t t t gaact ac gt cgt ggt aa caaaacat t c tt gt t t t ct t t t t aaggct a t t t cct aaat caat gat at a t at at caaaa acat gagcag ct gt ggat gg t gaaact t ca t aat t gt t at t at t at at t t ct act gcgac at act cat ca aaat ccgcgg gaaaacgcaa agagt gaaca gct gt t ct gc at acat at ag gat at at at a t agt at agga acat act aaa t ggacct t at aagcccactt aat cct t at t ct t cct cacg t ct t cgacaa ct gcgcct gc gagt ct ccga tat t t t gtcc t at at aat aa at aggcgat g act t gcggt c gggt t cgat c tat t gggagc t gct aagct t caaact agt c t ccat t t aat caaaacatt c aagagcaaat t cat t ggt aa t ccaagt gaa t aacgt t t aa ttaaaggagg ct cagt gcaa at at at aaat t acat ggaca at acat gt at ttgt t t atct ggt at t at t t t ct gt t t aag gt agcagct g caaagcaat a ggat agt cct at ccat cat t gt gact t aaa tttacggaga at t gcat at c ccggtcacgg at aat ct aat ct t ggagat g t ccggt acag ct t cgat t t t ccaacccaat agat ggat aa gcact acaga cacat at t ct gaacgtggt g ct gcagcaag aaaagcaact ct ccact t ag acagat agat at aaat aaca aaacaat gt t aaagt t at at gt t ct gat t a ttaccaaaac gagcaat at a cat cgcagt c aacaccgat g ggaacaggtt aaagt t gct t t t gat ct aaa t at t gaaagt t at caggt at act aagaaaa acaat t caaa acaat ggagt aact aaact c t t atcaggga t gt t aagaat ggt t cact gc t ggat ct aat taccgggaca aact at ccat t gcaccaat g t t act t agat cgat t acaaa gat at t gt ct ct t gagat t t gaataaaggg acgt gt gaca aaacccaaca t at cccaat t t cccact gat ccaacgt t ca gt at t cat at act at t t t at gct t accaag gggagt t t t a ggat cacaaa caaat cat t g t t t gat at t t ggct t t t gt c aagaat caaa t caaaaat t t gggtcaggaa ccgat cat aa aat ct gat ca cat t t agt at t aagt ggcac gccaaggaat ct aat gt ggt cgct t agat c at at act aaa aact at gaat gt t gt t t t at at acat t ct t t t agaat gt c aggcct t tag tgccgcaact t cagcagct c 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 67 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 67 caaatatatc cctttccgct gcagctttaa tgacatgccc atggatctct ttgccgttct tcaaagtcca aatatctgaa caagcagata aaaggctagt gagacatttc aaactaggaa ccataactac tgatagcatt ctctcaaaga acttgaaagc ttcaattact ttccctaatt Page 73 120 180 12689250 Sequence Listing.txt gt gaaaagcc act gat cagt gaat t ccacg t cgct gaat c aggct t aagt ccct cggaat ccaact t t t c t aacagagt t ct gat t t cca caaact gaaa aaagact agc aaaact t cct at ccagagat tcgcagccaa aaact t ccat t at ccccaca ccct agcat c aaaccgct gc cat ccgt cac caacgaaaaa caagct t cgc aaat cacat c t t aggt t t at ct ccat t gt c ggt t t acaaa acaaaaaaat t t gat t agat t cat t t ct ct gact t t t t ct gaat ct ccac t t t ct gt t ct t cat cat cct t t gt gt agt t ggt gat gat c t t gcat at cc at t t gat aga aaacaact ca ccaagat at c acaccgacat ct ct t t t t t c gcaagcagtt cat gagat t g aaaagcat t g aacccact ct ct caaaacct accacccaaa cccaaacat c at t aacact c ct gct t cacc cccagt t t t g gcat gat t t a cagcgaagtt cat ct agaaa gt cgacaaca t ct ggt t aaa at ct gaagt c cgaagct t ct t t t ct ct cat ct cat cgt cg cgat ct t t t a acgat gat t c t t ggacat t t t t gt at agaa at t aggt t ag aagct ct gag gt aagagaag act gct gt ct aagt t cct gg t t cgaat aca at gacaagac at t gcat t aa aaaacact ag t aggt aacaa ccacat ct cg gat t t cat cg acact agcaa ct aaacgcat gcaat acccc t t cat at aca acgact t ggg agaagaggcg ccaccggt ca gat acaaaat aacaat gaac accggt t t aa t ct ct gact g agaagat cac cgct cggt ga t t gt t t gaat cct t t t cgca gt t gct agat t cgt gt gaat t t gat ct caa gt t at aaact at agt t ggt t cat gct gt cc t at ct t t cag t at caat gag cat gaagt t g caaaagt aac gaaccaaatt cact ct t at g aat acat t ga ccaaacaat g cggt aaccga ct ct acaaaa t ct caggcat t act caccaa cat gcagaat ggaat gt gaa cgagat t cga gt aaacgaga aaaat caaaa t t t cggt t aa tacacagaga accact gcgt at t cgaggat t ct ccggct g caacagt gag ccat t gat t t t gat ct caaa agct t cct ag t gct t ggcag t t t gaaat ac at t aat cat c ct cggt gaaa agcagt acca cct accat ac at cat t t ggt cat aact cca aggaaccttt aacaagagag caat t gcat c at t cat ccca cccat t ct cc t t cat cgagc agccgt t gcc ccgacct t gt ct t gt t agga gact cggt at gaacgaacac t caaaat cac gt gccact cg gagagat aaa gcgt t at acg cct cgagct t agagat gct t at ct ct ct ct t agt t t ct t t gct t cct agg t t gt gct t at at t caaat gg t gt aat ggt t at t ccagaga acaat gt aag accat t gt ct t ggagat t ca t ct t cact t g t t ct ccat ca t cgaacat t c gt t ccaacgt cct cct t caa gacccagaaa aaaagcccag acct t aagag gt gaacacat acaacat ct c gagt gagaca ct ct gt ct cg at caaagct c t ct t t t at t t cat ggcct ga t agat at gt g aact t ct aca t t t gat cgga gct t gacgac t gt t ct ct aa t cagat gt t a ttaaagagaa aagct at t t a t t t gagat aa ggt t act ggg 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 68 2000 DNA Arabi dopsi s t hal i ana Page 74 12689250 Sequence Listing.txt <400> 68 gagt t gccgc t ct ct ct gt g t aagct cgac ct acagaaaa ggccaagt t c ggcagat caa t ct cat ct cc act gt gct ct acaacaacaa cggt t acaat gggt at t caa agccagat aa accat cat ca ct t ct cacca t gaacgat t c at aaaccaga at gaaaat cc gaat ct at aa aggct t aagc t agact caga ccact t t caa aaaact t gaa gat t ct t at c cctt t ct t cc gtgaagagag ccgt at gat t caacaaaaca att ccaaagg at t t ct t ct a t caat gct t t t ct t t aaacc gact t t gaaa t t t t at at t t aggagactcg t t agct t ct c acat aagacc t ggt gat gct aagaacat t g t ccaat gaat agagat t at g ct ct ctt ct g caacaacaac ggct caacca ggacaat gat t t gt cagat a aagtaggaga t accaact gg t gcaat at ct caat t gt t gc aagt t at gag t cat cct agt cat agcat t a gat ct at cga agat t cgt cc ccat at at ac t t t at aaaaa t aaagat ct c aat at cat aa ct t gt aat t g t at t t cat t g t t agact t t c t t gt t agt ga cacaaact ag agat t at at t act t caat t t ttgt t gattg ct ggt cat aa gt t acgggag gat t cat gt a ggaat gaat g gat acaagag gt at t t agt c gacact aaac t caaat ccac aacaat acat ccacct gcac aat gat at gt agt agt ggca cagt acat gg t ct ct ct gac t at ct t t t t g cagat aat gg at cagat t ga at at t agt ca t cat cagct t ccaacgaagt t t t at ggat t t cat cagaag t ct t t cgt ca t cat cat ct t at gggct at a ggct tt ct ct agct cct cca ct gaat cct t t ggat t ccat gt t t cggaat gaagat ct gg ggt gaaat t c at ct t gt tct tgagcgacga gat cgcacct ct t t t cat cc gaagct tt ct tgatgaggcg aaggt t cagt t agagagct a at caaccaca ggcgagct t c ct agccagca ct cct gt t t t cggcaatggg aagat gagaa ttgtct t t gc ct t ct t t gt t ct t t t gattt tttctttagc aagcaggagc t gct caaact gaact gt t t t agct cccct acaact t aac gaaacagt aa t cat ggt cgt t aat gggccc ggct ct act g ct gaat cgct caat aat ct c tat t t t cct t t t t t gt t t ca aaaat gggt t t t t caaat cc ct gt t gaat g aggaagccac t cgct t t acg t t aaaagat a tgggaaccaa gccagt gt cg tggtggagga caagggaat t tgacaacaac t t caggt t t t t cagt at ct g gaat t t aggt t gagt t cgag cacaagggct at cagagaat t at t ct gt t a tgat t t gttg acat caaggt t aacat t acc caacat ct t a ttccccaaaa gt t t ccaaat t at ggcagct aagt aagacg cagagct t t g taagaggccc act cgt ct cc ct cat cct t c t ct t cgt t gt gt gt t acaac at ccaat gt c t at aat t t t g gcat ct t gt a t t t aact t gt agcct gcgt c gt t agat cct gat gt at at g gagat aggat t caccgt cat gggacaagct ggcgact caa aacaacaaca ggcccgatga aacccgcct t cgat acaccg t t at ct gat c t at gact ct t ct t ct t acaa t ct gct at ca t t t t atctcc t ccat at aat t t t t aact t a aacact t aat ccct aaccac gaaacct caa t t t caaat at at gat agaga ct acagagac acat gaat t t tgaagaacaa gct t ct t cga ctt ct gt aac t t t gat gagc t ct at t t ggg taaagaccgc aaaat cacat gt gcgt t agg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 12689250 Sequence Listing.txt tagaacagta gaagaaatca 2000 <210> <211> <212> <213> 69 2000 DNA Arabi dopsi s t hal i ana <400> 69 t cagaat cgt aat caagaga caat at cact gt t cct t cca cacccatttt aat t aaaaca t gagat gaag ggct gacaaa t t t ct t ggt c t t caaagat t caaat cgt at aaggt t t t ct aaaagt gat t agaaaaagga gt aaacgt at t t ccagct t g t cccaaat cg tcaacaaaaa aat gcaaaat agct acact t caacat cgt t aagct cct ac aaacaaaaga ggat cgact t ct aagat gat t gat t caat c gccat t t t cg cagaagagt c ccccct act a t cct aacaac t at at t gt t g acaat gt gt t ccct at ccat gt cccat t t a t ct aact t gt cagaaaactt caat t ct agt t t gcat at gt cgt ct t caat t aaagt at t a ttttaagaaa tttttgcaaa aggagaaacc t caat ggcga aat t cct ct t t t t t ggaaat t aaccct aac ccat aagaat act aaaccac t cagat acat at t gacaaat t aagaat cac t ggggaagct cat cgt gcga ct agggt t t g t ct ct t t ct t aagt agccaa t aaggct agc t at agt t t gc acgt t agat g t acaagat t c t t t gcat aat ct aaaaaat c ggat agagat gt ccaaat ga t aat agcacg at ccaagaga gt at gat caa act caagaaa ct ggaaccgt t cccat t at g at gagagagc gagagcaaga caat agccat gaacct cagg caaacaaccc gaccat at ca at t gagat ca t t ggt ct ct a at t caaggca gat t t ggcca ct t gacaaca agaaaat gac caccggaacc cgagct cgag aaggggcaaa ccaaat cct t gccat t gt ct t t t at t t agt at t gct gt t g t agct t agca gat t caaacg at t ggt gt at aaact act gt t t at gat gt a gacgat cagt gaccat gct a aagt gt caac t aaaagagt c gcat t at caa acacagcgat at cgat ct t g cact at ct cc agt cacct gt t t at t agaga aat caaat cc at aacct t ca t t t ccact ag aaat aat cat aat t t t at ga t cgagt act c cggat cgt ac ct caggt gac aat t agggat aat agt t cag acgt gggct t gt ct ct t t cg ct t t ct t t ag cggt t t cagc agt t t t t aat aaacct gaaa t t t ggt acaa ct gt at t cga at gt agt at a tttttaaaac agaat caagt tttcacaaga aat aacaat g agagagaaga t agct agaca t cagct t ct t act gt gt cca cgat at t t at gagt t gat ga at aggaacaa at t t act aac at caat acga t t t gt t ccca agat cgagct t at cgacgac caaggaaccc gaagaat t gc t t gat t t gat gt cat acgag at aaat gct g ggt t aaact t t cat t at gat at ct t gct t t at t gt aagt t t gt at ct cct at gct gt agt t ccaaaagat t aagat ccaa aagt at gt t t gt at t at at a ttcaagcaaa ct ct ccaaat acct cat aca t t ggat gat t t gt ct ggaat aact at ct aa cacaaagcca aggat t cat a caacacgat a aacact aat g aat caaccaa at aat gagga gacct t agag agct t ct ct g aat t t t agac at t t ct cagt gagagagaca at gacacgt g cccat aat gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 agtaaattcc ttaatcagct tctaagtttt tatatatttt gtaacttatc tcaaatagtt Page 76 t ct aat t t aa aacaaact ct at at cggaaa gagt t agat c 12689250 Sequence Listing.txt cataaatgac ccaat t gggt cactcgagga ataaactgca atttacgaat cactaataca gtatttgcaa ttccagagcg acgggatctc tctccggtaa cttttattct cagcaaagcg agatctcgac gtttctcaga tccgtccggc ct t gagaaat 1860 1920 1980 2000 <210> <211> 0 <212> r1 <213> 2000 DNA Arabidopsis thal i ana <400> ct gaaccgaa tcaaacgaaa cat t gaacca t t caat cgga gcacggtgtt t acaaat t at t gat cgt gat gaagt cat t g at aacgt gt t aat at cgcct t cggt at t at t t cct gtt ag aat at t t t at t gtt gat gt c acccat gttt ct gt caaggt t gat t at t t a agaagattt a t at t t ct at a cct t t ggt t a gat t t aat t g t t t t gacat a t ctt gatt aa cat caat ggc t aacgacct t t cat caccca ccggt t cgat ct acct agct cct cgt gct c cggt t gcgat ct t gt ct aag t ggt gat caa gt gt ct cgt g t t t t gccat t t cat gt at t t cgt ggt at at t gt acat t t g at t t t t acat ct at acgat t gcaaagt aaa t ct ct t agaa t t cacact t t t t t t aat gat ct at t at t gt aat t aat ct t agt t gt at at ct aat at t t a t gt at t gat t acagt acaat gt t ggcgt gt gat t acct t t ct aat t t act ccggaccggt tttggatggg t t cat cgct a gacat cat at cgcat cgt aa agt caat act aggt gagt t c aat ct gt act t at gt cat t t aacgacagaa gat cgt t aag ttcat t t t ct agcaagat aa caat t ccgaa aaaat t t gat caat t at at a at t t acaat t at at agt t at agt t t t ggt t aaacccccaa caat gcgaag t ct t gt caaa t t aat cat at cacaat t at g tttttttttt t cgt t ggat c ttagtgagac gt gcacacca t gt t ct cat c act gt cccac cgt at ccgaa cat gagt gt g cact agagcg gtct t at t t t tctctat t t t agccaaaacc at at act aat t at t t acat t aat aacact g at t t gat ggc t cct agt t t t ggt t ct agct t agccat aaa ct t at agt at t aagt t t t gg t t gt at t at t aaaaat at at caat t at gt t acaagt cct t caat gaat gt t t t ct t t ct t t cacaat gct aaggcaagag at gcgt aggc gt t gt t t gat gat at cgcca cct ct gat t t at t act at t a t t at gt ct t g gt t t ggaccc t cgat t caaa gt aaaagaat at acat at ac t t t gt aagat t gacgat agg act at ggt t g t tat tct ct t act aaccaaa agcaaagt ac agagagt at g t t t at ggct t t t ct aaat t a ct gaagat gc tggataggag t t gt cat cgt gt gagat aca gcgct cgt gt act at agagt gat caagt gt cagcgt t acg ttcaagagac aaggacgggt ggat t t t t gt tt gtt gtt cg t t gaat t t ga cagt gat t ct t agagat aac t cacacat t c aaccct at ag aat t aat at t t gt agt aaag ccat gaagt c t t t gt gt caa gt t t aagacc gt aaaaact g t acagat t ga at ggt t t aag at gagaat ga t gacat at ga t t gctgagca cgagagtct t cagt at gcaa t t gcgt gt cc ggat t ggacc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 77 t at at at gt g acagat aaag cccct ct t ca cact at t t gg aaat aat aaa at t aacagcc ct cat t t at a gt ct ct ct cc <210> 71 <211> 200( <212> DNA <213> Aral <400> 71 ct t t aat caa acgaagccat t t t cct t ct t t cagat agag t t t t t t t ct t gcct gt cgag agggt t t ccc ct ct t cgt t t t t gggaagat cct gaggt ga t ccat cct t t aagct agt t t aat t gacact t t t t agt t gg t gat gaaat c t t gcaacat c t t t acaat cg t t at t t t t gt agt t t t agt g agt at gt t aa at aaat t t t a t at t aaaaca t gt t t gggct t t t ggt agt t aat aaaat gg cct ggt aagg atgaggagga cat t gggct t aagaagaaga agacgaaaaa 12689250 Sequence ttgtcgtaac ccacatctgt tttcactttc atattttttg tttttttttt tttttttttt at ataatatt gaaact t gct ggaaataacc aaatttcggg ttatgatatc agttttagaa accctagccg cttatatttg Li st i ng. txt ct t ct gt cgt gt t aat gat t t t t t cccgt a acaaaaattt tgggctaaga t t agggt t t t gt t gaggt t t t t t gct t t aa t t cact t t ca t gt gt gt gt g gactagagag ct t gat t aca t gggcagat g ccaccgcat t 1620 1680 1740 1800 1860 1920 1980 2000 0 bidopsis thal i ana gt caat ct t c t t t t ctt ct t aaagct t cag cacgat cacc acaaat agaa ct cgacaaaa cagt t t t agt tgggagcacc at t t t gcaca gaaat t gaga t agccaat ca agaagaagt g gccccacttt at aaaagaag t cact cccat aact ccaaaa t t t at t aat a at aaact aac act t aaagt a aagt t t gt gt gat t at t gaa aaaaaat ct c t ct t gat gt a cct t agct t g at t t t gcttc t gacaagat a gagt t ct t ca t gcct t t aac tttctttgcc agt ggat t cc act ccat t t t gtgtagagac t ccacaaact ct cacaact t at aagagcac acccacgact ggactgcgat t ct acact t t t agt gat t aa cacgagt t aa t cat gt agaa t at t cat t aa t t cat act t t tgat t gt t t a t cacaggt ct ct t ccaggt t ct cat att ag act t at ct t t ct t accct ca t cgt at t gct act t t gt cag t t t ct t t t ac t gt t t t t t ct t t t t gccttt ct at gggt aa ct t gacccat cat t gcaat t tgt t gct t t c caaat at t t c t aaccaaact gt aagt t at c ggaggt gt t a t ct gt t gt t a agat t t gt aa t at aaagt ca t gat t t aaat t cacct acct t caact gcat cat ccaat gc t aagcat at c t t at cagt t a t ct gaagt ac cct t gt ct t t cagcat cat t t agggt gat g gt agt t at ac at cat cat ga t t gaaact ga aaaat at cca t ccct ct aga aagat cgt ct aat cat aagt aagt at t aat ttggt t t at g t t t aat caaa aaagt aat ct gt aaat t t t t t at t at gt t a t t t cgt agt c ct at t at gcc agcaccaat a t aaat t gct g gct caat act at t cacct t c at t agcaat c agt act cgt c aat gact t gg t ct gccaaac t acct t aagt t gaacat agt agat gaacat aat t gaaagt at ccat ggaa t aaat t t caa t t at agagt t at t t caaaag aaaat t t aaa aaaaat ct t c t gt gt t at t c t t ggt t cat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 actttctaca ttttctttca caaaataaag tcatagaaag tcatgaattt tttttcaaaa Page 78 12689250 Sequence Listing.txt gattgttttg agaaacttta gaagaaaatg tatttaatta taatattcat agctggaaat t gat ct aat a t t t t gaact t aaggat ccat cccccggt ca accgt t aaaa t ct ct t aaat t ct caaaaac acgt t t t at c ct gat cggat caggtttttt t t t aat t at a ggt aat t aaa aagaaat t aa gagagt t gcc cgt cat cgt t aaat ct t aga t t t t t ct ct t t t cct t ct ca t t t ct ct ct t cat cact t t a at at t aaagt aaaagaaagc ct aaaccgga gt cacaat ca t cat at cccc gat t t ct gca t t ccgat t cc t agct t ccga cagt gt t ct c t t ccagt t at aat at t t cag t ct ggt ct t g cgcggagaaa ccacaaccga aaat t cgt ca aat ct t ct ct agaat caat c t aat ct aat g ccaaaagaag at t t aat aca aaat agt gac gagcgacaaa acccct t t cg ct t caat t t c cgt ct gct t t t agggt t agg ggaat t t cac at gt gagt t t gat t cat aag ggacgcgt gt t caaacct t a t t cgcccgt c aat t t t caga cct caaaggt t at t t caat t at gt t t t ct c 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 72 2000 DNA Arabi dopsi s t hal i ana <400> 72 tcaacaagaa ct aacct t t g ccct caagca aacaccaaga t cacaaaat c t caaggat cg ct t cat aaca aaagaggccg aagaaaat gg t gt agt t t ac t t t aact gcc gaat ct ggac tttttttttt gagggt aaaa agacat t cct ccacgt ggcg acacgt gt ac agt cacat at cgacccaaga caaat gcat c acat t ccagg t ct ct act t g cacat agct t at at t t gt t g ggat t aat ga agagt aaaaa aat t caaat g aaat gcggag t cggagat t t aggaacggt c t agct ggct c tttgagagag t cggt aat t a ccacagagat t t ct ct caat agt t gagat t gt gccct ct c t at agaagaa at ggt t t caa aacaaagcca ct gct ggt aa cacaaaat ca acat t act t c act ct agact t at at ct t t g t t at cagct g aggaggagca ct t ct cct t g accgt t accg aaccattttt agagagagt g agaaagat t g act t ggacca ggat t t ggga caact gcaat aaact ct at t cct t t gaaaa caaat gt gac cgt t ct gcaa t aat ct gcca t at t t gat ct cacat ccaaa aat caat acc aagaat caaa ct gct aaaga aat t gt t t ac t gt t ccggt g t gaaact cga ccggcggcgg gt aaggaaat aggct agct a gcct t t agaa t ct cgcacgt taaacgaccc at gt agt at a at ggt agaag aagt agt agc gt t cat ccat cat t gt ct t c ct t cacaaaa agaaagat t a aagat t caac aat gct aaag t t gaaact ga ccaaacaacc aaacct ggaa gcagcgagat agagagactt gggaaat t t g ct acgact gt gagacgt ct c gat t ggagga cgt gacat ga aaaact gt at ccagt t cagc ct cat t t agg t t cagt aaag cggacct gaa t cat at t t ga gagacaaaaa t aaat ct caa at agcaaaca agaagacgat aagacagt t g at cat cgaaa ccct t cgt cg t gt at t t t t t aagagacggc t gt ggt agt a t gct aat t at gt cat cat ac cat ggt t at g t agat acaac t aaagct t t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 79 12689250 Sequence Listing.txt ttct t cacaa ttatcgtaaa at at t agaca tgtagagcag gct t gt t gt t caagcgt aag ccat t gt t cg t gact aaat t gacacaagag aaaagacacg t t aaat t t ac t acaat gt t g t gat ct t at g gt ccgat gga t gat gat t gt ccact at aaa caggt t cgt g gat t cat t ac t gaagaat ca t aaat gat t a at t t gct t t g t t at t ct gt a aagaagcaaa t t t gtatgt t gct cact t t c caagcccat a ct cat t t t ca cgat t at ggt ggtaaaaagg act t gt t gcg t ct cgat t t t t gagct t cgc tttcaagcca ttgt t t agt t gt t t ggt caa at aat t t act tatgt t t ct t t ct t acgcca t at t t gggt t t t at ct t t t a t t t t t t t ct a aaaaggccca cccat t at ag gcgt aagt t a gcaaact at c t t t ct gggt t t cgt gaccaa agat t gt at g t t ggaat ggc acgccaaaag at accat t ac t t acat gat g agt at ct t t t at gaaat t t t t t t at t caat acccaact aa gggt t t t gag t ct cgaaat c t ct at ct aaa ttggt t t gt t ct t t caagt c aaaaccaaag acacgt t t gt gt t aaaaat a at t t t ggt ca t gt gcaacca t cgt cact t a ccaaagct t a gt at ct t agg aat cagcggc gt cct acat t aat t t cacag ct ct t at t gt t t t t ggt t ag aaaccaaagc aaaggt cat a at ggt aagac aact agt aac aaggcct at t aaagaat cag t ggct t at gc ct gggt t aga gt t t cagt t t t t cacct ct c t ct t t ct t ca at t t cgt gt g 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 73 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 73 cacatatata tgaatcagaa aaagaaaat a taaatttatt gaatattagt attatatatt tgatcatata actttatata at t at t atat taggctttct tgcacaaaac gtcaaaaaca tttttgtttt ttttttttta aattgcagat atcttaacat aaaagtcacg tatatcgaga gtgtcagaaa t at acgagct at acgt gagc ggcaagcatt tagcaaaacg cataacaaga agcaagaagt gaaagaagtt aaaaacgacg act atgataa ct t ggaagat tacaagatga ctaaactcgg tatgggaggt ggagccaccg gcggtggagg cgctgcatcg acggatcttt ttccgtgaaa aactttatta taggatcgtc aataactgat caagtttcgg ttctttaacg taatgcaagt tttttttttt tcgtgtagag aat t t t t t ca gt aat agt ga t t t t agaaat at cgt at aaa gct t ct agga agct t act aa t at at aagca t ggaggat ca acct cgagt c aaggt t acgg at gct cccac ct t ct accga aacaaacaca t gt t t caaaa t t t aat gct a ggt agat gga aat t t t gt at aaat gat aaa act t aaaaac ccct agt t t a ttgccaccgg gcaagt gt aa gaaaaagcca t at caagacc agcacaaggc t cct t caggc t gcgat t aac aagggtagct aacaaaaaaa aat aat t ct c at gt ccaat a aat at t t ct g at t at aaaga t t t t aat gt t gat aacgt aa t at accgcac acct ct cacg ccaacgaccg ccat act t gg caccaagagc ggt ct t ggac cgtcagggcg t cgct t gt ct aaaacact t g aagt t t gaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tgatatcata ccaaataaat aggaacagtt ttaggacctg tttcgtactt caattttctc Page 12689250 Sequence Listing.txt atttcatatc cattcatata gttttttttt ttttatagtt aaagagatac gaacctaat c accat ct t ca t aact t ct t t acact act t c t agagt cat t t t ct caaat a ct agct t ct a t t gct gct ag t ct caccaat t gt aat agt c cacaaaagct aaacaccgt g t at caagt ga ttaaacccac cat aagt act aat t aacgt g cat ct cat ga taacaaaaca act ct at t cg t acat ct t gg t at t t t t aat t at aaaccac t t t acat gga t at ct gaacc t t caaccat a taaagaagaa t caaggat gt at gt caacga aaaat cat ct tttttaccgg tttaagaaaa acagccgaca t cggaat cac agcagaacct aaaaaaaaaa t agt t gt aga agt act at gt t t at t t t t t c ggaaaagcag aaagcagaat t ccaaggcat t at ct t ccac t cact aaat t t ct aaat gaa agacaaaat g t ctt ct t t t t t acct gt t t t gcccact gaa tcagcagaca acgagccacc aaaacct t t a t t aaggagct aacaaccct t gt caaacact aat act at gt act at gt caa gt t acact cc aat t aat t at aacat gt t ct aaggct caat caagt gt gga tttttttttg tcct t gt t t g aagcct act t acaagccacg gt gt act aga at agcaaacg ctt ccaagag t t t acact t c aaggt t t t cc caacaat aat caat aat t aa t cct aat t at aaat gccgct agt ct caaaa aaaaccaaat tat t t t t cag gt caaact ct gt t caact ca t aaaat t caa t cgat ct cga t agct ccat a aaaaat aaaa at t cat t ct a caaagagt cc gt t agagat t t t aagat t t t gt t t t t t t ct gt agaat t gg cct ct gat t t aaaaaaaaca aat gaat cat aaact gcgt a ct acagagt t at t at t cgcc ccaaaaggcc ggaaccgcgt aaagcagaca aagaagaaac 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 74 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 74 ttaatttgtg ttgaatttct gtagcagtga ctctcccctt caggctttga cagttttgga tatagacgag attagagaac gtgcatatca attttggaag tattcaacat caagagaatc ct t act t gt a t at at gt ct a t t t t cct t gt attccggtgg tagagatcaa ggaagcaatg tggtgaatga caaggaaaga atagctgaag agtaagtttg cttgtcattg tatctttctg gtctcggtgg acactcgttt gatttcttgt ctcggtagtc tattaattgg tagtactggc aagtttttat ctcgagaaaa aaaaatgaca catatcaacc tacattattt tacaacaggt ct aaagt at t gt acat ggt a aat t t cggt a ct t t agt gct ggcagacatt ttaggaagaa tcagacagaa gaact ggt gg t gct agt ggc act t t aact g tgtct t t t gg at cgcagct c Page 81 t cat gct gt t ggccatggat at gact acct at at ct cagt gt ccgat t t t at cacagagc ggct gct gct at agt t t t ct t t acaat t gt t at t agct t c t t cat t caaa agcaccaggt t acct ct t ct cagaacgt gt t gagt acat t agt t gact at cagt at at t g ct ggt ggct t aat agagat a ct ct gt gt ga t at t at gct t t gaat t t t ga at at gct gaa gggat gt at a 120 180 240 300 360 420 480 540 600 660 720 12689250 Sequence Listing.txt agccttcagg cggatatggg gataaatatg attacggaag ccgggatgaa gt t at ggaag at cgt cat t c acaggggaag agcgggagga t gaaat aaaa t t t at t t t ga t ct t agt ct a cagt ggt gct ct gaagt t t a at ggt agagg at gaagaagc ct t t gaaaat gat ggtgggg gt t gct gct c act t t t gtta agtgcgt t t t at t t t ct t ag t aact at aac t cat gt ct gt gt gt t cct t t t cct ccaact agaaagagaa cagggact ct gagcagaagt t gat ggccat aaggaagggg at act agcgt gt t ccat t t t cgagct gat g ggcat t ct t t ggggctccag t gt cagt gat t gaaagct ac agaccccaca cagaggctgc at gaat cacc caggt acaga tttttcaaac t aat t cagt a at ggt at at t act t t t ccag gt gact t ct a t at ggt t aca gaagaccggt gt t gat aact t ct t cat cac aaaat t t t ga agat gt cct t caaat t t t aa at aat t ct ca t at t gt at ga aggaagtttt tcacgcagcc ct t ggt t t t t agt t accgct t t ct cct cca tt ct cagaaa at act caaac t cct cggcag aat gt t t caa t ct at cccat ct ggccct cc gagat gat ga atgggagaga at gggt cacg ggt at gt t aa agat t at caa gat t t t aaaa at ggat acac ggatgggagg caact cat ct ct gaacaaaa ct gt at at ag at t ggt gct g ccaggagct g act ggaacca gt t gagact t t ct t gcaat t cat agagat t ggct t agt t g t aaact ct gc agcat at gca t agaaat agt t ggt aat agg aggt aggagt at t t caagt c ct ccat t t gg at cat t t cga tcct t t t t gt t aat t t ct ga cacct t ct t t t at t ggcgct t gaaaggt ga at gaat ct t t ct t ct cct cc acacagccaa t cgat gaat t ct t t agt t gt acat gaaaat gagggt gt at t t gt t ctgag t ct acagat g gagcgaagt a cgt gat ggag gat gat gat t tcggaaagag gcct aaat cc at agaat at t accgcaat ct cct gt agggg t aaat t t ct a cgggat t at a ccacccagt t t cat t t t ct g ct at gccagg tcccccacaa t acaact gct t gat ccacgc at ct t gagat ct t t aat caa gat t gt at t c t gt ggccgt t gt gt t acagc 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> tcctgggttc tcgactgccc agaaatggat gtacacaatt ttgagatatg ttagcagctc aggaaagcta ctatgataag agacatagaa ctgaaaccat ttgtaacaga gaagacaata cgtcagcgct gcgaatgaag tgtgaggagc gtttct t gaa tgt t gtaaga gatgt t cttg taaccgacaa agagt at at g gagagt at at ttgt t ataaa aagt ct gt gg aagctacgaa gtcttcgttt ctttctactc attataacat gagaaact t g t gagaat agt t ggt aacct a ggaagaatgt t t gt agt gat aaagagcacc ct gggct at c aat t t ct gac gt t t t gacct Page 82 t gt ct caaat aaggggcat g t gat aagat g t aat cagcgg acaagat gat tt ct ccaaaa t act gct gt t aaagaaggt a gt ggct act g gggagagggc gtt aggaaaa t t gggt gagg agcacat at g gat accaaat t at gaagt at caaaat gaca t gt at t ct t c at aaaaat aa 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gaaatcttgg gtattttttg cagttccatt tgattgattc aaagatattg ctgaacgaat ttacacaagg t cct t ccgt t gggaact cac ggt aaact ga agct aagaag gt t t t ggt gt t gaggat cat t ct t t t at gt t t gt t t t agg gagaagt ggt t t gccgt cca caccaaat cc ccccaaaaaa cgagcat at t gaaaat ggat t ct ccgt agg gt ct ct at t t caat ct cct c ct t t t ct at t t agat at cga gt t t t t t t gg t t agct gcga gt t t ggat t t aagt t gat cg agagct acat t gt gaaaaag t t ggt gt t t g agaaaat gat agagct ct gt t gt t ct t t gg t ct t gcagat t ct cgggaaa at ct gat gca gagaacacat aaaaacagac aaagcccatt aaaaaact t a t t t ct cgcgt t at aaggt ga aaaggggaag ct ct ct t cat aat t gct t t c cagat ct gt g at t t ct at gt at cgact gag aaact gt gat at t gagct t a gat act agaa gaat t cccaa tttttacagg t t act t aat t cat gaaaact aaaagaggt a gt t aaat t t a gaagaagat a act t t gt acc t at acgt t t g t t t t at caaa gt t aggt ggt t gacccgcct aagcagacgt cgccacgatt cct t cgt ct t caaat aaat t acat at ct cg t cct t t aat g t gat t gt t ga at gt aat acg t caaaaat ga t cgt gaat t t ct t ggaagt g agct at cagc t t t at caact ccaacacaac aaagaagat g aaaat ct aat gt t gt gt t t t aact t cgagt cgagagaaaa t gt gt ct caa gt gaaaat t g cgt t t agat t tacaaaacaa ggcagaccct t ggcaat acc t ggt t gacca at cgaagccc t aagct t ct c t t t t cgt t t c t ct ct gt t ct at gt agat cg t t gact cagt t gt agct aga at t t gt gat t aat t gagt at t acggaaat t at aat t t gt t act t cagt t a gaat cct at c t t at gcaggg aggagt t gt t ct agt ct gaa gagagat t ga t aaaat gcct t gact aat t g aacat cggcc gat t cgt t ac ct gagaaat a t caaaaact a act ggt t t cg ct ccct ct t t t ccaat cgt t t agat ct gt t gat gcgt t at t t acgaaat t t t at gaat gt t at t at t aca cgcat ggt gg at caat caca ct t ct cgcag ct gaact aga aacagt t gt t ct acaat t t t tagaaaacag aagt t ct agt gaaacacatt ct t aaaat t g t t t at gt cga caat ct gt gt act at cgt gt aat aaaaat c caacgct ct t gagct gat cg t at ct t ct t c at t t gt gagt t cgcgt t t t c t gt agat ct g gt t cagt t ac ggt t gcat gc gaagcagat a 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 76 <211> 2000 <212> DNA <213> Arabi dopsi s tha i ana <400> 76 gtaacgcaac gggaatattg ggagtgggca tgcaattatt gcactcatga aaaataaaac taaaaatgat tttactattt ggctgaaggt gacaagtgtt tgggtggctt ggtcggtttt tgtatacgta agtttatgcc acgtgtcctc ttatccatgg attggacggc ttggaatcgt ggaatatttt attctcatac caaagtccaa atatattaca acctccccct ttttttccct tcacaaaaag ctaaaagcca ttgcttaaaa aaccaagaaa tactaaaagg atttggaaaa Page 83 120 180 240 300 12689250 Sequence Listing.txt gtagcaatcc tgattttgat tgatagtact ataatggaac accagtagtt acat aggt t g cat gacaaaa aat t gcaaaa cat accaaaa t cat t at gga t at acaccgt ggt ccat at g t ggt aat ggg caacaaagca cgt t gt ct t t ct t ct t t aac aaact act t g t t gcat at ag tagacaaaca aat at t gt ca at ct cat t t c act cgacact aaat aaagt a at gt t at aaa atgtct t t t a ggt t cat ct c t gt cat aat c acaat aacga t ccacat act agat gat aac t t at gt t t t a caccact ct c tagagagaaa aat ggaat t g t ggt at t gaa t at aaaagac t aaat at agg gcat gt acat ttaaagaggg agagct t gga act agaagaa acgaagcaac cct t t t t ggc ct cat t aaaa gt at at at at gcaat t t at t aaat t cggct t ggat aaaaa t t t t t t gat t aat aat aact at t gt t ggga gt at t t aagt ggt gcggt ga gaccacat at t acat t t aaa cat t t gat t a t t aat t at ca gt t aat t at a at t t t gt t t c t ct gct t ct c gagagagaga t aacat aggt t t gt aaat ga t at t ct aat a ccgt t caat a ggt t tgggac aat t t accag cggt aat aaa aacagcacgt cat aat t gt t aaat agt gga cccact at t c gcaaat gaat gat caat act gt cat gct t g t agt gaagat aat aaaagga aaaat t gt t g at ggagcat g aagaaaat gt gt ccat gt gc t caact gct t cgt t aaaaaa gagt aaagaa t t aat caat a ct at cat gca cat cat ct t c t gcaacacat t t at t t t at t act t t t caga t cgacagaat aat aat t t ca acgggaaagg ggaaat t t gg agaaggcaag gggtaggaca t agt cct t t t t t gctgccga at cat gcat t at gcat gt gg t ggt gt agt t at t gat ct at gat aacaaaa t at aaagt ca gagaat t aaa t aaaat t at c t gt agat aat t t at cct gac t t t t aat at g t gt ccacaat aat agt t gca acaagcatt a at t aagt t aa caacct t gag aacccact ca cccttttttt at t gt gt gaa t t act aat ca t t at ggat gt cgat aacat a t gcat ggt ca cgat agaaca t agt gt t aca t t t ct t t ct t at at t acact t at t t t acac at ggt acat g ggt acat t aa agat gat t t c agaacagaac tt agt t t t t t agt aagaaag act cat aact t t gt t aaat g agcggt ccaa at t t t ctgt a t t t at t t at t aagcgggatt t cagt at t ca ct aat t aact t t t cggt cac cagaaaaacc gaaat t agat t t t gt gaat a tggccaaaaa agat caaact gagt t t aact act t ct gcac gat caaat t t t aacgaaat t cccaaaaaga t t ggct t aaa at ccaat ct t at t cat ggt g gcgt t t gat t agt t gcat t a at aat aaaaa acaaagaaga t at t cgt ct c caat gct at a aaaat t agca aggt gt ccct ct t aaccggc ttttcttacc t t at t agggt t gaaact ct g gcagcagcaa at cat ct t gt t at aaaaagc t agaaagct c 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 77 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 77 aggtaacttg tgggtcttgc ttgagactca cggagttttc acctatcttt caatagttga ctttttcact aaataacaaa actgttttaa aatagtaact aagtggagat gcggggtatc Page 84 120 12689250 Sequence Listing.txt gat ccccgt a t gact at t t a cgagcggtgt t at t aat aga gt gt gat t ga gat ct agggt aat t ggaggg acaaaagt aa t ct acat gt c aaaacat at t at t ggt gt at cat t gaat at acaaat aat a gt ggt t agct ttcaaaccac t t ct cat aaa t at t at at t c ct aat t gact t t cat ggaaa at t t gcat t g t ct aaat gt t acccaaagat t caact ccga at ccacacaa aagt aacaac at t caacaga aat t at at gc at at t t t at g t t gat t gt at ccacgt t ccg ct t gt gt cgt aaaaagagaa cct ct cgcat t t aaaat at t gt gt t gaaaa gaaat cagaa ct aaat t t t a aaaat t t t ca attttttttt t t t gt t t t t c at gaat t t t t agaagaagt a t cagct t at a cat at gat t t agt ccagt ca aaacggatgg t aaccat ct t aaaaacaaag aacgaaacct gt t t gagt t t act t t gat t c t t cgact t ca t t at t t t cat gggt t agggt caagat agaa gagt agcaaa t aacaagt gt t t gagat t t t aaagaggat a t aat t agt t t ggat t t gat a ccacgagt cg ggacact ct t gaaagaaaaa gctaagcgag acaat at t t a gggact aat g act aaagt cg gaat agt t at gt t t gtatta ct t t ct t t ca aat ccact at gt cat t t cat aat acct aat aaat aat t aa tt agt t t t t t t acgcacct a gct cat t ct c tgt t t cattc cat cat caat agttcttttt agaat ct agt t aaat at t gt at gaaat gac cggt t gacca gggt gacat g at at ct aagt t aat t t gt gt t cagt t caca gat t t cat ac agt aat at t a t ccgaaat ag at aacacacg cgat t ct ct c aaaact t cac cgct ct acca at act t t caa at t gagagaa gat act at ct gt t ggaaaga agt aaaat aa gct gat t agt t gt t acgt ca gt at agct t c agat aat ct t t at gcat cat tttttttttt ccaaaagaac t cacgt t t t a t ct acgt ct t caat t t t at t ct t aaacgt c tttgaaccca at gt agaagt at at act t ct gat at t acga gat agat at t t at t at t at t t ct acaaaat tttttaaaaa aacat t aaaa at gt t ggaaa t t at at ct ga tttccaccaa t cact ct ct c tcaacaaagc tttgagctac atccccacga cgt gt aact g t t ggagat aa aat cagt aaa gaat aaagaa gt t t t t act c gt agt at aca at agacat t a cat t t at at a tact t t gttt at t t gt t t t t cat t t accct caat t gaggc t t t t ggct cc t cct t cat t t t t t at t at gg aaaaacaat g aaaaat gat a at cct caat a aat t gagt ga cgt t ct t aca gacct gt ggg t ggt aacaaa ggt gacaat a t caacat t ca t t ct t aat at at cagt t t t c t aaat t t t gt tcaacaacgg t cccat ct t t t t aaccat aa at gcaaaat a t gact aacga gt agagat gt aagt t aggag t t act at ct c t caagct aaa at agt t acac ttaaaaaaaa at t at t t at g acaaaaagt a cgat ct t aca at gt ggt t ca acct acccac gat t t t cct t gat ccct t t t agt t ct t t t a ct t at at aac t acaaagt ag gat agaaat a agagt t t t ga ct ct ggccaa at t gt at at t ttcat t t t ca aaaaaat aaa agaaaat aaa aat accagt a cat aat t t t t aaacgcaacg ct ct at aaaa act gt gagt a 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 78 <211> 2000 Page 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 78 ct t t gaaaaa gcgggtaaac ct t t aaccaa t t ct t at t aa caaagt ct t a agat t aat aa gat at ct cct at t t caaact gt act ccaat t gt aagcct t act cccgt at t ct t t gt gag t acct at gac ct ct ct t act t t t aaaact c aaat at act a at t t aacat a ccaacaacgt caaat gt t t t t agt t aat aa ct t t at t t t a aacat cact g tgtcggagcc gt t t t ggaaa aaaagaaagt gaaaaaaaaa aaaaaaagaa gt aaaaagag at caccat ca agt aat t aag acat agt t aa t ct at t t caa t t t t t at t t t at gt ct ct ag gct t act t t c at t t ct ct aa accaaaacat aaagat t t aa t act ccaaca gagat ct ct c t t gt cgaact gaaat gat at agct at agca ttcagcaaga aagct t agga at at ggt cca ggat t agcat at cct t ct t a caagt t t ct a gaat at gt t t at aaaaact a aaat t t t aaa t agat ct ct t cct gt t agt t aggt gcgt t g ct at t gt gaa agaacat at t cat act t aaa aagaat t caa t cat t at t at cgattttttt caaaagt aaa t caagcaat a gaat aat t t a tttcaaaaaa t t at t aagct act cat at ag act aggat t a cat aat cct t t t ct cccct t at ct ccaaga ccgagaacat ct t cgat gga gat t t gt t at agt at t aat a aact t act ca ccacaacct g t agat aact c t t ct t t aat g t gt t t gat gt aat ccct aca cggt t gaagt t gt ggt t ggc t gat t agcgt t cgt gat t t g gt cat cggt c agt aaggaaa t aaaat t t ga act t gat gga caacat cat g t caaaaat ga ct t aaaaaaa aat ggaaaag t cgt gt aat a t t t t caat ca gat t aagaac t aggaaaat t tcaaccacag gcat t agat a ctt att cttt t aagct t gaa act t gat ct t gt t ct act t c t gt gt t t gct caact cgaat t gat t aagaa aaacct aaac ggact aat at ct aacat at a act t at t gct at ct t t cgct gcacggggt g at cat at t cg t aat at t gat aggct aat cg gt t t ccat t g gat acaacca caaaaat cac tggagaagat gaagat caag at aaact t t t at at aaaact aacat aat t a cggt aaaaat aaagcaaggt agcat t ct t t t caaagat at act caaaacg ct t gggact c act cct aaca aat gact t at aact tct t t t cccaagcgcc aact t gacca t ct cccgt ga ct t t act ct c cact t aggat aat ct ct aag at at agagat t ct t t agat a caagt cat t t tt gtt acaag at gt t t t t t c gt aat at aat gt at gat at t t gat t t t ggc gt ct ggat t a at aat ct gaa caaaagccag caagagaat c agaat ccgac caat ccacaa aaat t aaat c tgtagagaga aaaaagt aaa at at aaat aa t ct cgaacct t gact t t t t g aagaat t act t aaacaat ct at at at at ag t at at ct t t a t gct caagt c gacacct ct t t t t gt t agt a t t ct caacac aacaccgggt t cacacat t g t t aagaat aa t ct t aat ct c t aat aaacca t t t cct at ag caaact t at t gaat acaaaa gaagt aacat t t gt at gt at acat gt t gt t t gact ggggt at t t t gact g at aat t gt aa gagaact t ga cgat gagaaa ct gaaacaca ccaagccaaa acaaccacgt gaaaaaagt t aagggaaaaa t cgcgt gt gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 tatctgaata tatatataac agcaaggttt ttgtccgcat Page 86 12689250 Sequence Listing.txt gttcacctct cagttgtttt ccttcttctt ccttatctcc gcctctttct agggtttctc tcaaatcagc tcttcaacca 1980 2000 <210> <211> <212> <213> 79 2000 DNA Arabidopsis thal i ana <400> 79 caacct ccca acgct gaat c gt t cccct ca t agt ggt gt g at aaacagaa gt t gt cat aa aagaaaacaa agggt t t gac agaaaccaag cat cgggagt gt t acgat t a gcgt t ct gt g gcaat cat t a gact t ggaag tagacaaaaa t cacaaact a acagacgaag t t t ct ccaac caat ct t ct g aaccaat ct c cagct caagg caaaagt gac t agggaaat g ttctct t t ca gaagct t t cc ct ccccat t g tcaaaaagaa cgt t t ct cag gaat at gacg t gct cat ct t ttct t t act c at t ct gct t g t agaaaat aa t agaaat aaa acaaacaaac at at caaagt aacct gacca t gt aat cact gt t gcat at t t at t gcccga accct t cct t t t ct t gat gt act agct gat t caact t t ga at cgaaacaa cat agat caa caact t cat a t t acat cgag accaacaaaa t aaaaccaat aat t aagacc t gat t t t gag ct cgct t ct c at cgaaaat t t cgat aaccc aat t gt gggt at at ct t t gc gt cct t ct gt t gact gaat t gt ct t gat t g ct at t t gcga t t t t gt t ct c aaacaat gt c ccct ccat ca cccacat t cc at at aacccc cagaact agt t ct cat t t gc gt agat caac accct at aca t ct gcggt t t aaagaact ac act t aagt t a gat gt aact t gt t aaccaag t gat cacaca caat agagac gt cgacgaaa caat t agacc aaacgaagac agcat t cacg ccgat t t ct a ct aaagt t ca at t t cgacat aaggctcgga t cat at at at t gat at t ct t gt t t t ccttg agagaaagat t aat t ct cca gaaaaagcca gagt t t ctt t gt t t aat agc act gcat at a gt agaaccga ct t aacgt cc agt t at t ct t t aat acat ca gagcat agaa t t caaat t cc gaccat aaac at ccat aaga aacaaagct t t ggaaat t ct t at ggat cat ct gat aaat c agaagaaaga at t acggt ag at cgt t ct ca at acaaat t c acagagcagc ttcgaaaaac gcgct t cgt g at t ct t t cac acacgt t at a t ct t t t t t t t t t t at aat ac t ct t gct gaa gacaaat gac ccgt at t gca t t ct t cat ga acaaat t aaa acctggcgcg t gcct at aag ccaact ct gt acat acat t t at ggt at ct t t aact aaaaa taaacaggag at caaaaaac aat cat t ggg t gagct at ac t aat cat act t aaacaagt a agaagt gaaa agacaggttt at gcat cgt t t at aat cgaa aggaaatggg at t aggggt a ct cccagt cg acggaaaaca cagat t gaag ttttgtgtgg cat t cgaaat act gcagt gc at gt t at ggc gt t caat gaa t ccaaaat gc gaaggt t t aa t t ggaagt gt t gagagct t t gt ggat cat g t gt aaagt aa t at act t cag cat acaaaac cagat at gac agaaact ttt accat gt t gt gt gt t aggat t cact acaag gat gaaaaat aat ct cacct caact ccacc caat at t ct c tgacaacgaa t t t at t ccaa t t t t agact t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 ttcacgttca cttacatgag ctagggtttt atttttttca gaggtcatat ataaatagct Page 87 ccat t cgt ct aaat t t cacg t cgt agct t a agcct t t agg t ggt gaaggt 12689250 Sequence Listing.txt tctaacaaag ctccggcgcc accattaaaa tctctcgatc tctatctgcg gtgagcttag tattactctc aattgttctc agttcagtat gtcttgattt gatgatctga ttttgtgtag atattttgat tcagcgagct ctatagtagt aatcgatctg ttaatgtatg tgttgttgat tggtagtatc atttttggtg aggt at t t ga 1800 1860 1920 1980 2000 <210> <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> agaagaaccc agt acat aaa aat t ggt t t g t caaat agt g t aaaaat gag acagaaacct t t t acct t t t t t cat gt t gt cacct gat gc cat at acct t ct at cat caa gt t gt cagt g tgagaagaac t aat gt cct a caaaggccag gt ct gaagcc t at at t aat c t ccggt t agt gt aaat gt t g t gagaat act t t acaagagt aat t ct t gac aaat ct gat g gggat t gat t acagat t t t a aggat t t t gc at aact t t cg gaat gat t t g acggccggct ct ct aaccaa t t t at ct gat cggt ct gaaa aat t cat gct aaaccccaaa acct t gt ct gct cagt t cc acagat t ct t caagt cggat t t t t gat cca gcct t ct gt g t t gt t gagat t at ggaat t g aaccact aat at t t t gcat a aaaat t gcat at gt at at aa aggagggaga at t gt t at t g cacaaaattt aggt t aacag at t t t act t g agt t gggt aa gt agaat aat at t gct t t t a t t cagat t t c at caat t ct g gct t t at cac ccagt t ct c t t gt ccat at ggt t ggt cga gggat ggat g cagcagcttt gtacacggag ttttgaaaca t ct ggat t t t t cggt caat c cact t aat ga t at at t t t t c atgaagagag t caccat at a t ggt t ct gt t t t at t gat cc at t ct aat t t t t at t agct t gaaat t aact aat ccggct c at gacgacgt gt t at at t ct t ccagaat ct gat ct ccagg cgt cat cat c cgt agcat t t aggagt gt t a aggat ct t aa ggaggtaaga gt t t gat t t t gagagatgt c acaagt cagc aggct t gt gt tcgaggcaag t t at t t agt t tgaaaaaaac t gat aat gaa act gt t t cca t ccagcaact t t ct t ct cct gaat at t acc t t t agt aaca at t acaagaa agat gat aac gaagcaaaag t ct t cgt ct t t cagat t ccc gaaggacaca t ct at cat t a t gagacgt ca ccaaagt at t t acat agt gg gaaccat acc t gggt gt t t t t cagct cgag at t ct ct t ac ct t t ggt aac aagagagagt aat t acgt t g aacaaaaaga agt t caagt g aat t gt agat agat t t agt t gt t gaat t gg aaacaggtgg cat aaaacaa aact gat at g aaat ct aat t aat t acaaaa caggagtgcg gacgct act c at ggccact g tcccgaaagg ct cagaggt a t cat t ct t at t cagct caga cat t t ggaca ct aacct t aa gcagct t ct a t at cat aact t t cct t t at a at at gt ggt t at gt at at at gat ct cat at aaggt caaca t t t gt t t ct g ggt aat gcaa at aacat aag at t gagagt t gct t t gt ct g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 agttggtcaa aaggtgttat gggttttaga aaattaataa Page 88 t acat gt cct t aaaaagt ct gt t gact t t t t ct acact t t gt gat gt t t g ct at at t gac at t t t gt t t c ataaagggga gt gagaagat t aaat t at t t t t t ct t t t ga t caat ccaca t ggat t t gt a t gt ct t t ct a at gaat ct aa agt caat aaa cccaat aaaa t ct ggaaaaa 12689250 Sequence gttttgggag tagtttaata gtcgtcgtct caggaaagat cttataactc tagaatctta tgtttacaaa ggaaggaagg tttttattca at gaaat t t c ctgatttttt tgtttgcaaa tccgtgattt ctatatgtta aaaaaggaaa gagagacgt g Li st i ng. txt t cgaat gat a gagat t cgt t t gt cat t gat at t acct t ag t t t acat gt a at aacaaat t gat ggat aac cct gt gggt a cgt t t acacg t ctaggagag gacgacct cc aacagt t t at cat agatt ag agtgagggga aat gcaat aa gt gggt ggt g 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 81 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 81 aagat t cat a t at t ct t aac t aaaat at t a t aaat at gcc gt ggagt at a aaagtt acaa t ct t gt t act aaat t t t gga t t t gagtgt a accgcatct c t t t agtcttg ct t agccaat cgagt t t t t g gtttcccgaa gt t cggattc atcgtcggcg aggatcgaac aagagt aat g tcggtaagaa aggaggagga gtt cat ct ga cgccacctga at aaaaat ca cacacaaaat cccct cagt g aaacacgaaa gt at acagat gt at agt t ac t gt aat t aac ttgagaatgc aaatt aacgg gcaacaacgt cct ct ct cca ggacct t gt c ccaacgctca tt cat ct caa agt t t gaagc cagaggtaga cagatgcaac gtaggcatag ggagaggaca t aat ct gcca acaatatttt ccacatt aca aat ct at ct c aat t t gat ca aaat aaaat t gtt agat cat cct gcaacgt atcaaaaacc at t t aggtga cgagcatt cc taatgaccaa cgagctgatt ggaat ccat t aatcaacggc cgcgggcaac cgtcgagacg aaacagctgt agcaacagaa ggacggcttt tgt t t ggat c t ct t t t t t ct cccaccacca t t at gt t t ac acacaaaat a aataccccaa acaaaagaaa cgt gtt t t t c aat aat t aat aaaat cct aa at cagt at ca gat ct t aat a at ct t t gct c ct t caccgt c cat cgt cgt c agtgt t t ccg ggcggtggcg gaaggaagcg tccgaccaag acgacaacag t ct cat aggt t ct gt aacct ct t aat t at t ttacacaaac t gcccaaat a at aat aaaac aaat aaaat t gaagataaca caaat cct cg aat ct cat at tt act gat aa agct t gt t ca ccgccgacac t gaaat t t cg gt ct ccact t t ct cccat cc tagtaaaaaa gagagagacg aggctgaaag aggaggaaga ct t ct acgt t tgt t gt t gaa ggtggctttt cacct ct ct g atgtcgaaaa ataacccaca aaacactcgc aat aacat gt t at ct acagt aat cctt ccg aaaagt t t ag ttttgtacca agctaacgcc ct aat ccaac cagcgagt ct t cacgct cgt cggt ct cat c cgagt t t cga cgccgtcggg tgggaagct t tgaagagaac t t t t ct t t gg tgggaggtgg gaagaaccgg acgat ct cct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 gataaatcgt atcaaggaag atcaaaacga aatcttcaaa atggagagag agatgctatg Page 89 12689250 Sequence Listing.txt atgatgatga ttgcaggcaa acttggtttc taagagattt tgagtttttg gaagagagag agagagaaga gct gagacgt gct gacgt gt t t at t agat t t ggcat gggc gccacgtggc cact t aaagc gt cgcct cct gt at gt gcac ct t t t gt t ct t caaaccct a gaatgt t t t t gaggaaggga t ccct ct t t t agt cgat t aa ct acagat t a t ct at t cgga cct caaaat c ccgt cat gac at at ct ct t c t t ct at t gaa gaaacgagga t t gt gt t gcg t t t cgt t t t a ttctct t ct t t t t at aact g gat gt t t agg gt t at t t acc tcagagaacg aat t gagt gc ct ccgt t t gg t t gat t t cgt t t gt ct ct t c ggcccgagac cct acat t ca at aat cat at ccaggtggac aat ggt acgc ct cgct gat c gt t aat cccg t cat gt act t at ct gt t t gc t ct ct cgt t a tcacgcgcgc t t gaaat at t t gagt t t gat t ct gt t t cga t caccggt t t aaat ccgt cg aaggt ct t cg gt t t ccgat t aagat gaaaa atggaagaag cgt t t t t gt t caaat t t cag acat ccaaag gcccat t act aacgat t cac t cct t caacc att caaccaa aat gt at t at gct cct ct ca 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 82 2000 DNA Arabidopsis thal i ana <400> 82 acaat t ct gc gct at agt t g aacgagaat a t t aaagat ct t acaagt aat cat aaat t ca cagat gcct a gcat gat gaa aaaacgat ac aaacacggt t caaat at gca ggat ccaaaa t at t t agt ca agt t acact a t t acaaat ac gaat gat aac ggct gt at ca cccaagt agt accacct at t ttccat t t t a ct cct aaacg tttgaaagca at accat aat gact t at t ga caat gat t ga t ct t t cat cc aaagaaaatt t gat gt t gt t t aat gt t t t g ct gcaacat c gaagct t aaa gagaat aaag tgagaaggca t gcaat t t t a aat t t t gt t a gcgt ct t acg gt caat ct ca aacct acacc t ct t gaggt t t gt t gaggt g at t t gt t t t g gact t gt t at agt cat aat a act agacaat agcat gat cg acaagt acaa t t t t t ct t t a gat gaagcaa gt aaacgt t t ggt ct agggg acaaaacaaa t aat at ct t c t cagt t ct ac t t ct ct agat aact t aaaac t at t gt cat t t aacgagagt gaat cgt agc at gtt t t t ct aagacgggt a gaaagat t cg at agt ct ct t ct aggt aagc ct ct t ct agt aaat ct caca t cact t agat cgat t t ggac gt t t cat aga gact t t at ag gaat gt caag acacaaat ca at t t cct aat ct t aaat aaa t at gt t t gca act cct aaac cat t at gt at gt t gt agaat ct ct t gt cat aaacgat ct a cgaagat at c aaat at t cac gct t gat gaa caat t gt cgg tttccaggag aaaggcct ag cgcact act t at t t gcct cg gat aaagt t a ttttttaaaa t t t t cat agc ct gat ct gga t ct gt t t gt c ct ct t gaggt t t cct t agt t cggggaatct t aaaaacgt c t t ct ccaat c cgat at aat t gcat gaagca t ct gaat cac acat ggaaga acaagagagt ctt gcaaaaa gt gccgt t aa t at cat t ct a t aaact at t c ccaat gacgg cgaaact gt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 12689250 Sequence Listing.txt gaatattttt ttgccaatgc tttcgaaact tttggttcat aaatccttta gaacat acaa t gagaaat t a accgat aggc at t aaaccat gaaaat t agg t cagaaccat cagacaat t a caacacaaaa ct t t t gcaca acaaaaaaaa aat t ccat at gt gt t ggagg gt ggaagct g t t cct ct caa t ct t at ct t c cgt gt t t ggc cacaat aggc t t t cgcgt gt at aat aaaat t t ct t gagt c at t t acgaat cat t t t t gt t accact agat ttaaaaaaaa cggaaat aaa aaaat aagat t gt at aagt a ccat ggct t c accaaaaaac t t ct ccgat a cacgccgagg tttttgaaaa cacgaaaaag gaaaacct ca at gact agac ct aat ccaaa t ct t t act ca cat t t t gat a aaaaaaccgt t at t t t aaca agccct aaaa t caaat aaaa t ct ct t t ct c aagaagaaga t t t agcact a t at t gat t t t agagt gaacc t cgaaat t cc gaccact gca at aacacaaa act at t cat c t at t t gt t t c t gaaat t at t aaggaaat ga taaaagaaga caccaaacca t ccccat t t a at caat act c aaaaaggct c ttaaaaacca aaact t t acg acaccaaaat cct gcaat t c acat aat cct cccccgtttt ct acccgaga t t aaaaat gg at t agt cat a at gaaaat ga ct t ct ct cct at accct cct aaat ct caat acgaaaaaaa at gcaaat gt caat t t t ct t ct aagt acaa aaaat aaaaa t gt gacgat g at gt agt aat at aaaaaaat t acgt t gaaa caaat gt at a t t agt aat aa at gaat gat a aggaagctt c t cct cgt t ct cat ct ct ct t 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 83 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 83 aaaccaaaca t t at at t t t c t t t t gaat ga at gt at t at c t aacat accc agat t t ccgc atcggtagt g agaaatcttt ttgggat t aa ccaatcacca acaccacgaa gacgtcgttt accgtctccg t ccggctt cc gagaat aat a acagaaaaaa t acaat t at c ccgaat aat a t t caaagt t t t aaagt gt t a aaat t t t gaa t ccat t cct a aagat cgcct accggatcgt ccaaaccatt t ct at acaat act t t aaaat ccgtctccac tcatcgccga gaaccgtaga t gt aaagat a t aagt aagt a at t t t at t t g t gcgat aat a t t ct at aat t accaaaccat t aaat at gt a ct t aaat t t c accgaacatt gaaacggagc ccacgtgaaa cctccaaaca cgccgcattt t t t ct t cct t agaggat aat tacggaaacc t aat gcat at aat aaaagt a tacaaaaggc t gat ctt at t gatcggtat a cacat acat c t aaat cccaa aaaccctaaa t ct acaaat c cacgtgcttg aatcacacac cgtgagtct c tggtagcttc aaagacgacg Page 91 agat aaaat t aat t gt t aac tgt t agtcga t aaaaaat t t aaaat t aat a gt gtt acaaa acagt t t t aa agacagtaaa cct gct ct ct ggacacgtca cat ct cacaa acgt cagct t acggcgggag gggtacatta caaagaagt c gt t t aatt ag t act t t aat a t aaaat t aac at aaaat t at acat aaat t t aaaaaccgt a atacccggag aat at cgaaa t ct ccct ct a gctt at aaaa gaact ct aac t aact ccgt c gtaatggct c cgaaacagag tcaacacgat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 12689250 Sequence Listing.txt t gat t ct gat acaact t gat aaat t gat t t tttttttttt att aacgcaa aat at t cgag t at t t cat t t aat gcgcagt at ggat gat t caaat ccct c aat acacaaa t at t t cccaa ggccat t ggc t cct aacgat t cgat t ggaa cggcaaat t a at ct gggt aa gat t t ggt t a tttttctagg cccat cact g gccatttttt gagt t tat t g t gt t at agt g t t t agt t at a ct aat t aaaa t at t at t t gt t t t gt t t gt t acagcagt aa tcaacgcagc at t t ggagga t ct t t at aat gaagaacaaa t cct t cact c ccct aggat c gt at gt gagc t t t cat t ggg ggt at t aat c gt t t cgaaaa t cgacat ct c t at t t t aaga aat t gt gggg t t aat t acga gt t at gcgga t t gt t aat t g cat gaagat a t t t t atctgg aaaagaaaaa caggcccagc t at aaat gt a at accct cga gt t t gaacct gaat caggt a t ct gagt t t t t t t t gat gat t t gt gaaat t aat ct at at t cggtgacgcc aat t aaat t g at gcat agt g t t agt gt gt g gaggaatgt g gaggt t aat c acacaaaact agtttttttt cacat at t aa cgaat t at gt aaat aacaaa gact t t ccaa t t t ct t cat c aact at t t ac t ct t t cacat t t ggggct t g t ct gt gt aat t ct ct gt t at gccgccacgt aatccccaca gct t at t gga at gt t ct t aa atgaactggt gcgggaagga at aaacaagt aact aaat t t gt t t cagt t a ct act cagcc gacaaaact a agctaaaggg t ct t t ccttt cat ct ct ct c t ct t ctcttt t t aggat ct a aaaccctaga tgtat t t t gg tggt t t aggt aaat t t t gt g at at ggat t t gaaaaactga ttaagaggag tatgagt t t t aaatt aagct ct at t t gat g aggcccagcc aaaat gat ga cggtt ccaac t t gcct caga cgt cgct t ct ct ct t t agga t ccct t t aat t t t cagctt a agaagatat t ttgtgaaggt 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 84 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 84 aagcaacgag ggtcgtcagc at ccacaat t ttgttgaaaa tttttttttt t aat t aaaat t aat t aaat t gctgggtcaa ct t ct caat g ct t t t caat a gggaagtcgt aaat t agcat tcactgtgag ccacat aat g aactt ct gat gacacactat aacgt caat a at t t cct aac gt t t t ct t ct t aaat t t gat t aaaaat t aa ggtaccggag ct t t t t t t ca gcaacacat g gt t agcgat a act acgacat t t aagaaact aaat cat at t cagt t t aagt acgtagaaaa tttatcagaa t aat t t gact agagtgcggc t cat at gt gt aat ccaatt a t t cat at aaa gcaaatt aga act t aagagt t t gt t t aaat t t cgt aat gc gact t t t t gt at aat aaccc t gat at caat t ctt cct at g at gt at at ca gtt ct caaaa ct ct ct t cat agcagt att a t aagagt t aa tat t t ccct t aaaat ggaat att gatt ccc aaaaaaaat a cggaaatt ac gct t ggaat g at gat cgaac t t t atccat t aat gat t t t t agaaactt ac gactgaagct tttgggcaaa ct ct aat gt t caaagccatt ct ccaaat gt aagagagtat cagcaagtt c 120 180 240 300 360 420 480 540 600 660 Page 92 12689250 Sequence Listing.txt tgatgacgaa tgctgagagg aggaggaggc atctaagtga ttcggacatt gt aaaggagc t t t ggat ccg agt ggt t gt t t at t t gct t t aagacaaat g ct cat gt gat cgt ggt cgaa at ccgggt t t gt t t t gct ct at ggcct ct a agat t gt t gt t acgat t gt g gggaagct aa t t cat t ct ct gagct acggt gact t t gt ct ct agt t aaaa at aat t aat a ttaaacgaga aaaat t cgac caccgt cacc t t ct gat ct g tgaaaaaaca ct t agt t cct t gct aat ct g gt ct at t t gg t cgagat cgg agt caggaca acccgaggaa agcgt ct gct gaacat t gga cat t gct t gg gggt t t t t t g ccat gact t c ccgt ct t gct t agt t t ggt t t t cacgt caa ccct t ggct t agaagttttt at cat acaaa t cacgt ggaa gaaccaattt act ct ct ct t agaaaaaaaa at cat acat a gcgggt aagc ggt gat aggc t gggcct gga gt gcgt t t ct ct gagt gggg ggt t ggt gga ggt ggt gt gc gt t t gt t cag gagcgacgag aagat aggga at at caaggg gacgggt t ag ccggat t ct t gt gcgaat gt acccaaaaaa t ct t aact ct at gat t agt c gcgagt acac gcct ccgt t c t t ct t cgt ga gccgagagtt ggaagact t g aat ggcggt g t aaaggat ct gt cat ggaga aact t aat ac t acgagat ga cacct t t ggc t cacacggt t t aaacgaggt act ggagagt ccaact at gc t acgt t t t at aat t t t ct t t aaaaat cgaa t aaacacaat gaacaaat aa accacgcgcc acat gcgat g t t gt cct gct ct t t acacaa t gagagt act t ggcaacat t ggct agggag acgggt ggaa ggat ggggct ggaaggagct agagct at gg agagat t gag acat ccgct g ccgt at t t ca at t t t cgt t a t t t gt t ggat t t cat t aat a agt t acaaag t at t agact a aat at cgcat gat acagaga cat ct t ct ct t gccaaat ct at ggaaggca t cgt t at t gg t ggt caact t t t t ggt gt t c acgt ccat gg agact gat cg t ccagaggt a tggagaggag ggagt gt act gt ggact cag t ct t t cct gg cacgt t t at a ccat t aggt t gacact agt g aat ct t ggga aacgat t t at ct aaacat t c cct t t ccaat gt gagat t t c ct t aaat at c 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 cctctttaga ttctctctct acagagaact tgattatttt <210> <211> 2000 <212> DNA <213> Arabi dopsi s tha i ana <400> ccaaccttgt cgaaattgaa aagacggtct ttgacttctc ttctcatttc attacaagag t ct t cact t a at ggt t t gca t cgt agaat c t cgagacggt tagagaaaga at caggt cgt gcaggt gat g gaat cact cc t act ccact t atcttcgatt tctgtgaaat caaattcaaa gaatcccaga gat t cgaat t ct aaat aaga gagatcgaaa gagagacttc t t t t caagt t act t gct t at at gact ct ag act t gaacat ct t ct ct cga t t cct t gat a t t ccct aat c t t cact t aca tgagagagaa Page 93 t agcaat agt gaacact aga t gact t ct t t gacgaggagt ccacct ct gg aat t gagaaa ggaaacagag tttgaagaac at gaagaaca t t t ct gct ct at t at ct ct a tagaaacggt cgt aaacgac ggt t ggat ca ccgaat cct g aaaaaaagat 120 180 240 300 360 420 480 12689250 Sequence Listing.txt t agggt aagt cact acct t t cagt at t cat caaaaaat t g aaaaaaaaag ccaat acaat t t ggaat cag cat t gat t t g at gagagt at t t gtgaagac gagagt t aat aacagacaac ct ccaaact t gggct t t t at gcaat t gaaa aat aaaaaat agagagggaa aaaaaat t aa aaccct aat a acct cct t ca t cct cgt t ag t caat t ct ct cgat cgt aaa agt at t t aat ct aaat t t ct cat t gcct ga aaccgttggg agtggaagca gatccaaatt cgaattacta gagagctcaa aat t t caaaa at at t gct ag t t ccaat aat gaat t gt aaa cat at at at g cact at t gga acaaaat t ag t t aat t cagt caat at at at gacat t ct t g aacat cct t t gggct act cc t aat aggt t t aaact t t t gg aaagcaacca at t aaaaaaa acgcaaaat a aaccct gacg t t t t t t ct ct t ct caggt at t agat t cat t gt acggat ct t ggaat t agt t act at t t gt ggcat ct aat gt t t t t t t aa at t at t t at t ccaacaact t t ct act act t t gagat ggat gat ggat aaa agagaaagaa ct ccaact gt t aacaat ct g cat t at aaag aat ct t t t ct aaat t gggct aat t act t t c ctcctttttt t att cgaaaa aaaaaaaaaa gt t t ggt cgc ccgt t t ct t c ct ct ct caaa gct t ct t cga t ct at acaca t t gat aaacc cagagat ct a t aat gaat ca aagt cgcaaa t acaggt at t t at gat t t gt tctgat t t t a t t act act t c t t aaaaaaat aagaaaacac t ct at gt gt g t at acat t at gagct t t t aa t agt ct at gg t ct ct act t t t ct ct caggt aaat t t t act gaaaagaaaa acgcaacagt t gt aact t t t t t t t accttc ct cgt ct ct a t t ct cat ct a aagt t t ccac ct aat ct t ct t t t gt t gat t at at t cat t t t at ct t ct t c at gat at t aa aaacat ggaa t t ct t aaat a tttttttttg gat cat t agc aaaagaattt at t aat ccaa acact aacgt gcgt gagt t c gct act ccaa aaat ct t ct a ct t t ct t t t a gaacgaaat t at gagcgat g gt cggaaaaa t t gt t t at t t ct t ct ct cgc aaact cgt cc t cat caacaa ct t t t t ctat t cgt t cgat t t cgcagt gac gat t acaggt t t t t t t gt ct at at t t cagt aaaagggaaa aacact t t t g gggaaagaat t cat t at t at gt aat t gaat at gt t gt aac accgt t gt aa aagacgggt a act t gggct a ct aact at t t act agt t t gg at cct aaaac ccggcaaaaa t aaagct gac t t t t ct agcc t caccgt cgc gt cgat t aat cagt gct t t c ggt t t gct t t gt at at gt t c ttacaaaacc t t gat agaat 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 86 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 86 ggagagaatt tcttctttat ctcctttaga cgtcgaactg cgaggtcatt gcccagtcct ctacagt t cc aagcggctgt cctcatggag gaagaggggg ttctgggccc accgcgcttc tccccacgtc attcggctga accgtctctc cggagcaacc actgctttcc ccagcctcat ggatcaccat cttagaaggc tcgtgttctt catcttcttc gatggtgtcc tctctcttgt Page 94 120 180 240 12689250 Sequence Listing.txt gtttgcccct atccgctgcc acatcctcac gaacatagtc tctgt t cact t t t t catct c cggct t cat g t t t at t t t gt t aaat ct gga t t t t t agttt aat t t t gt gc t gaaaggcat aacaaaacaa at t t at aat c ttgaaagaca aaat cgagaa gtat t t t t ct t cgt acat at t t t t at at t t gcat t aaaaa aaat at t t t t t gt cgcaccg aat aat t ct t taggaagcga aagaagccat ct t gccaat a t caaat t aaa cat ct ccct a cat t aaact g t ct gaacat g t at t at t at t t t t aat t caa at t gaaaat t ggcacat aaa t ccat ggt ac at gggcat t c t t t t aat gt a t at aaccct a t t t agaaaat ct cacaagt c aagat at t t t cagct at cat t t t act t t at t at gaagact gt t t t caaaa ct aaat at t t gt t t acaaac t gt t at aaaa aat t at aggt aaaaaaat at ct t acccct c aaaaaccgga gaactaggga t ct ct aagaa t at ct t t t t c gggggatgag caaacaagt t acat ct t agt gct caat t gt t t t agt t ccc acat acacca act t at aaga cagat t gaga gt ggagt aac at cact aat g tttttttttt t aaat t at aa aaaaat ct at acgagat t ac agt t act t t t ggat t t t t at t at cact t ct ttaacccaaa ttttaaacga gacaagattt aat gt ccgct t t t t t t gt at ccat t ct aaa at gt ccct t a t at t t aaacc ct aagt aat g agaaaactt g aat t t accat aaaagttttt ct t t ccggat cat cat aaac gt t t aaagac gact cct ct c ct t aaat at c ttccaaagag gacat ggct c agagt t at t t t t t t gccttt gat at act cg t ct aaaat t a t t agct cgga at gct t t aca gacat cat ca at t t t gt t t t aagct aagt a t gt caaccag ttttaccccc caacaat t t c gat aat t t ag t t t t at t t t t at t gt t aaaa ggcactgt t t aaaat t t t aa aagcaccaca t aagat t gac t t t t cat aac gat act t aga cagt cgt agg at t t t t ct t g aagaaaattt cacat aat aa acct t t gat a ccaat cct gg t at t t at t at t acagt t cct t ccacat aat gagagaaaat agggt t t t gg at t t acat t a at t t acgt gt aat gagt t ca ttgtct t t ga t at t t t t t at gcat t agct a at gggt gt ag ttaacaaaaa gt gt t at at c cat gggt t t g acaaat t aaa at t acaaaaa tt agt caaag t at gcact t t t cat at t aag cct gagt t ga gt ccaaagac gt gt t ct aaa aaaggact t g ttactgt t t g gt t t t ct t t a ccat t at t t g aaat ct t cca t gt ccacgt a aat ccagat t at t at caagt caagt gagac gct ct acgt g ct agagt aca gt cat t t gag agat ct t t t c t gaaact t t g t t at aaaaca aaacccatt t gat aat t t t t aaaaaacaat t aat t t t t aa gat t ct t ggc cat t at agaa t gaagaat ac agat cgaaga at t t aaaaca act t t cat ct gt caaaat cg t t t cacgat c acat at at ct gggaggact a tt gt t t t ct t ct aaaat at t 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 tttattatta acatgtttta gattgaaaat tgctcataca <210> 87 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 87 gtcaaaagaa gaaaaatgtt gacaaatgtt acgaatttgg tttaaatcaa aagtgaatta Page 12689250 Sequence Listing.txt tccgagaaag atgatattta cccccaaaat at ct aaacaa at act t aaaa at gcaat t ct ct aat agact tgtacacaca tctccaaaca ttcgtcgcca ct t ct t ctat at cgact ggc t t at t aagt a cat gcat gag tagggatggg agggtaaat g gaggtgattg at cgat t ct c t ct aat at t c aagtct t t gg taaaagaaaa caagt t aaca t t t at t t t at t t aaaact ct at t act aat t tgt t t gattt aat t t t at t t gagt t aagt c gct t ggaaca t att gtt gac act cagt cag t ct t t t t t t t agaccaaaaa acaagaaaat t cacacct ca t t t t ct t t t a at agt gccca ct ct ct ct at at caaat at c cacacataac t gaagt t ct c caggt t cgt t tttcat catt ct t t ccattt gcaat aat at aacatgaaac t ccagt cacg cgtgagcgat ccgtggattc at gcat gact cgtacgacaa t t t gtgtgt t taaagggttc at at gaagt a t t t at t aat t aacacaatca t ct att gt aa t t t t t t ctgc tgct t gatga t t gaat ct aa tacgaacaaa aaaaaagcaa tgtggactcc t gt t at t t t a t t acat t t ac gt gaaaaaat gtggcagaaa gt aaaagtt c ct t cct cat t ccct ct caaa ccgat acct c act t at t t cc t at gcgt t t g t act cgt t ac t ggt cat at a at acgat t t t caaat t ct aa aaagct at at gtcgaggcac acaaactgcg cgtcgtcgt t caat cgt cga cat at cgt aa ttcagt t t t a gaat at at gt t aaaaacgt t ct t t cat t gt t t t t at t t at cagt t t t cat tt agt t t t t t aaat at t t ca aat accaat g caaaaat t gt aaaaaagtgc t cat t at ct g aat aaaat aa tacagtgtcg caggaaaaat t aaaaagt ga aatt at ct cg agcat t ct ac t t t at at agt aact ct gt ct atctcagccg cat at at ct a t at at at at g agt t at at ag cat t aact t t at agt ctt aa gcacgtgtgc aaaacgtgt g gct t ct gcac t ct at t gt gt cgt gt at gcg at gt aacgt t t ct t gt t gca t t t ct t aaat ggt t ctctgt aaagt t at ga tcatgat t t t ct t t aact t g acaaaatcag aat cggcgat tttcgcgaga t gcgt t t at a act aact caa aaatt gcttt gtcaagcaaa aagaacaaga at act gt at a agcgtt ctt a agt ct gcaca gccat ctt ca ct ct caat t t tt ct t t t ct t act t t gcaca cacat at t t a tt agccat aa caagat t agt accaagt t t c gtcgcagagc ccacaacgaa cagaaat t gc t t t gt t act t tgt t t agtgt acat caaat a aat at gccag tatggatgt c tttttttttt ttctaacaca t t t cgct ct a at t t t ct t ct at gt gat t t a gatggaagt c tggctagcac ataccaaaaa t t at t t act t aat ctt gt ca t aaat cgaca aaaaaaacga at ggaat aaa t acct t t cca ct t at cact t t cct t agt aa t ct t t ct ct t ggt gat gat a ccacccat gt agt ggt t ct t at ggat gcct t aat at gt at t t t t gt act c caaagat t ca ggctt ccccg t gat ccat ca ct t t ccgt cg gtt ctt gaat at ggct t t t a aat t t t aat t t gt aaaaaat at gt ct at ac caaat agt aa tttttttttt ct ct t t t t t t t gcaat cat a taggagcaga aagtgtagt a t ct at cat cg at t t t at agg ct gccat t t a aaat t gcaaa caagt cgt cg aagat at t cg gtctct t t at ct t t cgaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 Page 96 <210> 88 <211> 200( <212> DNA <213> Aral <400> 88 t t ggat caaa tt t gagcat a gtt gcct gt aat t caacac ccaat gcaga at ct gagaga ctt gaacaac gcaaaagttt gaaat ct t ac ggcgggaacc cat cggcaag gt gt gaaaaa t gatt caat a t t acaagt t t gaaagtt cca t t t aaaact a gcccatt act t gtt gacttt t agcct agt t t gaat t t aat t t ggctcaag at gggt aaat aaaggat ct t ggcct t gt gc t t gact t t ca t ct cat t gaa aaaggt t gag t t ct t agggc cggatttttt gt t t caggcg t gct t cct ct 12689250 Sequence Listing.txt 0 bi dopsi s t hal i ana agt agcggt t acat ccaat t gcct t gt cct caaaat t ct a tccaagaagc cgaagct cgt t ct t t cct ac cgaaat t cac cggat cat t a ggcccagacg gt gat caact t t ggat aaat agaaagaatt gaacagaaag agt at t t at c at caagct ct acact ct cgt agt aaat cac gt aaat t ct t at t t t agagg aagt ct t gaa t aat t gt t ct at ccaaaaca gacgcat gac cat aaaagct aacact gaga t agagat tag gat t ct t ct c t cagt ct t ct aaat at ggt g t gact t t at c gt ct caaggc at ct gcat ct t t aat t gcag at t aaaccac ct ggt t gat c t aat caact t t cat cacagc at cggaaat g gcgacgaggc at ct t cat at ct ct ccat t g caaagaaaga cgtt acagaa agagact aat at t t at at cg ct ct aat cat gggct gt aga t ccaat at aa ttt act cgaa act t agat ac t t t t aacaat tttaaaaaac aaaaaaagag t ct t t ct cca acaat aagag tccttctttt gcaaaaacaa t t t ggt cat t ccgt ct cgt c gt gat ct ccc gcgat t t cga ct t gaacat t t agt agct aa gcacaaacat ccaagggaaa cat gaaagat ccaagcgt ga t ct t t t gct t tcgccggaaa gagacat acg cgt cgacgt t tt ct ct ct at gaagccact a aaaacagagg gt aat aaagg gat t ct aat g ct acacat ca t aat t t gct c t aaagcct at agt at t accc t t t t t ct t aa t aaaaacat a t t aaat t aat ttaaagaaag t aaat cggca gt gaaacgct t t t ggt t ct c aat t ct cgat t ggcgaat t t gt t accggcg t gaagat t t t t t acgt gt ag agagccat ga agacaaccac t ccct at cca t t gt aaat t t ct caat t t ga acact t t t cc t t ct caaaga aacagagaaa at t t gaaaca gaact ct gct t gt ct cagat aaaaagct ct at t t t gt t ct taacaaaaaa aaagacgt ga t agt t gccat agct act t t t t t gcagt aat aggactagt t gct caaagag at t t t ct t t t t t gat act t a gat aat cct a aaacacgt gt t gt cagt t t a ct ct ct t aaa ccccaaattt gat t t at at c t t t gat cgga gaat t caaaa act t t ct t ct ccaagct cag t t t t gtt gt t ccacaaaacc t gt gt t at ac gt cct t cct c t caacgt gt t cct t aacat g ggaggaagga tcgaaagcgt t t gt t cct ct t ct cgagt gt t acct t t ggg at t t t t agga at t ggt aat a t aagat t t gg t agcccat t a gattttttt t t t ct at t t t g tt gt t t gtt t aact t t at t t cat t t caaat aaaat t aaag at at t at ccg caacagagca gcaaaaaccc gagaaaacct cgaaaact t g cat cggt gat t cat cgt t t t t t t ct t cggt t ggct t t gt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 97 12689250 Sequence Listing.txt ttcgaatcaa t t aggat t gt gaataatcgt gtctgtgtat ataagactgg tgat t gatta ttagggcatc ggttggcagg agaatagatt gttttttttt ttggtgtgtg ttggatcttt agat ct t t ga ct gaggaat a 1920 1980 2000 <210> <211> <212> <213> 89 2000 DNA Arabidopsis thal i ana <400> 89 t gcaat acag t gact t act t cact t gct t g acccaat t t a cgt at cat t c aacaagt t gt tggatccaaa at caat caca cgaat t caaa t gt t gt aggt gat gt cat t g cat t gggggt gcat t ct t gt t gaagt t gt g ggt t gt t gt c aaacgaattt at acct at ac ctagcagcag t t at t t aat c ctatccgaaa gacat gct aa at gagaaagt aat aat t t cg at ccat t cgg t at t cat t ca gt at ccact c t t cggt cat t gt ct t ggacc cggt caagct ct gcaat at g cagaataccc cctacacaca aggt t t acac at gt t ccaat at gacaacat t t gt t t gt ag gaaggcagt g cct gt gt t t g tcggagcacc cat gt cagcc acgcact t t g agat t caggt t gt t t ggt ca cat gt t gt ct cat act t acg t aaaaat aaa t at t t t gat a t gt ct ggt t a t cact t t ggc ct cgaaacct t t t t agt at a gt t at t t t ga ggt t at t at a t t t ggat at t tcgcagctca ggt t t t ct gt t t gt t cat gt acaggaccga t ct t t t ggag tgt t ccacaa t gat t agagt aaat aggt gt gat aat agt t aacaat caat ggct t gct t c agct cgt gaa accctgat t t agagagt t t c ttaaaaacga gat aat gt at t gcaat agaa gt t at t t t cg t aat at t t t g cgt agccat t aagagtaacc t gcat gacct t t aat ct t t t t at t cat t cg at at t t gt t t ct t t t aat at t t cagt t at t gt t t cccgag t gcat ggt at ct ct aaaaat cactgaagga gtaaaccaaa ct t acact at gacat gt ggc gt t cat t aga t ggt agat ag gagcct t acg at acaagt t c ct ct ct gt t c cct ct t ct t c aaaaagaaac gt t gat t t t t gt gaaccgt g act t t t ggag aat at t t t t a aaaat at at t cggt t t cgat acagat t gcc t ccct t aggc t t agaaaat c gt t at cgggt cggt t ct gat t t t cggat ag t t cggat agt ct aat gacgc t gt t gt t t gt t cat gt gat a cct ggat gct act ct t t t at at ct aat cag t agagt agca agaaatagag aact at t gac caaccaaggg agaggt t cat caagccgct g t gccgt cgt t t caaaact gt at gt ct gt aa ggct t acct c t aaaaat gt g at at t t at at t at at t t cag t t t t cggt t a at at ct cat a t at at at ccc caat aact t g t t t t cggt t a t ct t t cggt t t t cgat t at t t caat t at t t t gagaagct g at t t t gcat g ct t t gcaggt t gt t t ct t ga t t atcaaaac aat act t gag t cat t t gact act gt t ggt g aacgt t t ggc agagcgagaa tatggacacc acaaatggct gagt agggt a t gat at ggag gt t at agat a gt gt t gt t gt aat gt t caat at at t t t t aa at at t t t t gg t agaat t t gg t gt t ct ct cc aaacagctat gcgt t cggt t t aaagt t t ag t t aaaat t t a t t gggagat t t aaat at t t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 ataaatattt tcaactattt ttaattattt aatctagaaa gaaataatat ttttgaaaat Page 98 gt gt t t at at aat t t agt at ccat aggt t a gat aat t ccg agt ct ctt ct gtt gat ct gc 12689250 Sequence ttcggatatt tttgatacgt agccat t cga ccttgggctt tagaaaccat gggcttttta tttagttaac ctaatttgag aagcatatat cctctgagaa attgcgttcg attacagct c ctccactcat catctatcga tttatcgtcg aggat t cgga Li st i ng. t xt tttcagtttt tcgattatag agtaaacggg ctcctgaggc at aaaact ca catcgccgct cacagaggtt agtgtcgtcg ttaatcattg aatctgggt t 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 2000 DNA Arabi dopsi s t hal i ana <400> accagagt cc aaaaaat t ga t cgt aaaaga t gt cct aaca cat t cct t aa ct t t t t t t ca at t act at ca aat aacaat t t t ct aacaat at cat agat g cagt t t cat a at ct at aat c aaat at t at g t ggatt ggac gt at t t gt ct t gt t at t t t g agat gct agt at ct at at at aat aat at ac gat t caaagt t at aagt aaa t t t gact t t t cgaaat t t t c act aaact aa ct acat at ac agagat ct t c agaaat acat aagt agat t g t accagt ct a t ct ct aact t t at at ct gga t aaaaaat gc t t t t t at t gt ct agt ggat a cat cat cat c agctt gat at agt t t t ct aa aaagt acaaa gct cagaagc gagagggttt gat at accat t t at cat at g cactt gtt ca at t t at t aag t aaaact t t t tttttttttt gacggatt cc at ct gat gaa t t gt at ct gt cgt aat at ag gt gt at t t ga gt at cat t t g ct t gat gagt t t gt at t t t a t t t aaat t gc at t gt ct gct t t ct t ct gag t at cat cgac at at t at aga t aact t at t c t t t gat t cgg ccgt gggt t a t agct t gccc t t gcat t aaa caccaaaatt act cagact a cgt t t t aacg aat t t acgt t ttcgacgaaa t aat t at t t t aat gat caaa at at t t t t ga gcgaat t t at at gaact aat caagat ggt a caaaat at gt ct ct cct ct t t gaaat gt t c cat accgt at cagaagct ag ggt t t t t gca gacgaact cg acat at acct t aaagt t caa ttttcgccgg gt at t agt aa at at t at gt g t at gt t ct t g agaat t gcat t cat gagacg tttgaaaaac ct t gaaaagt t t t ct gat ga t t gacgat aa aat t cgt caa cggct t t cca agtt gtt acc t t gat gt ccc cat agaccgc cact t aacat tttgat gagc at agact t t a tcacgagaca ct t gcccat a t t aaat at gt agtt gctt aa t t at cat at g at cagt t t t a gt t aacct ga cat ccaat t t tt ct ccaaag t gt t gacat t t at cat catt catttttttt at gat t t t gc gat t at act g t aaat aaaag at t t t t cgt t t aat t t ccaa accaaaat at t aagct acaa at t at gt t t c t aaaccat ca t aat gtt ct t t agt ct cgca ttcaaggcac at t t at at gt tt gt gt gtt c t at t gt t gac t aat t at t t c t at ct caaaa act t t t aaga aat cggaat c gggctt gccc aat t gt t t at at t gat t at t t t at at at ca aagt at t at g tact t t t t ac t at at at aac gt cggat at a gaaaaat cat cgat at t ct g t t cgt t gt ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 99 ct t gt caaaa cggt aat at a aaat t t t caa t aacaat t at t gatt cggt t aaat t ct aat gatt ctt aac gacccat at c agaat aaaaa act t cgccgt at ccgt t aga acgt gtt aaa t cat at at at t t agt t at t c agcat ccgga at gat t t ggt t cggt ct t t t aaat aaagat gggt t cat t t cgagagcat c 12689250 Sequence at act aagca act t t t cgac aat at gat aa aaaaaaaaac ct at t at t ca t at at t t cat tggtatcgtg taattatatt taaatctggg ttgggctttt tagcatccgg at t agtctag gtatgggttt gaacaattac ccactt aggc ccatt agggt tcctcactct caagttggat Li st i ng. t xt agat t t t cag t t gat gaat c t cat t t t at t cat at gat t t t aactt ggt t tttggtaggc t acaccattt tagaacaaac ct caaaaccc caaaaat at t t act aaaact at t t t t ctct at t ct gat at ttt ct aagaa ct gcct t t gt agatt ctt ct at gaggtt gc t aat at ct ga 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 91 2000 DNA Arabidopsis thal i ana <400> 91 agaagaat ct tttttttttt ttcagacaaa ttgcaaagga aat ct ct gga ct gaat t t ca t at gtt gt ct t caaagt t ac at t ggt t t t a t at t aaaaat gat gat gaaa t agaaat at t at t gaaat t t ct t t cacgat gt gacat ct t ct at caatt a t cgt at t t ct t ggcat t t t t cat gagat t t tgaacgaagc ggaagacgca t t t ggt t gga agaaactttt agaacacgac aaacaagt ga gaaaaat t t c caggt t t t gt at gaaat aaa aaat at t tag t t t at at at t gt t gt t gttg aatt gt cat t t aat aaat ga agaagat gaa ttgatgggga at aagt at ac t at gt gt gaa tt ctt ct t t t t t gct aat ca at at t ct gt c at gaaggaac at t gaaat t a t ggt at at t t at aagcccca ct actt ct ct gacgt aat t t act acact ac t agt t agat t at aaaaaaaa t t t t acaact ct agact cca ct t ct ct ggc at aat gaaag gaagt t gt ag agt gct gat g t at t t t at t t aagaacacga t t t cgt ggt t t t ct ccaat a aacgcgt agt ttgttctggg cct acat cct cct ggaaact at aat t gaat gaccacgcat ct t t gcaat t t caaat at gt gt att gcaga t at aaat t at t at t t aggt g gatgaagaga t ccaat ggt t cagaaaat aa aaaat gt cca tttgggagag aaaaaaat ca t t cct aat t t t t t aggaat t at ct ggt gt a agcct act ag acagcaaaat at cgt cgt t a t cggt gcct a ggagaaaaga agggggaaag caaggaaaat ttaagacaag at ccaat t t g t att agaact agt ct t at ct acaaat gt ga at t caaat aa t aagagt at c accaat aacg gt ccacagac gaacatt at c ct gct ggtt a t gt aat gct a gcat gt gaaa gaatt gat at ct gt at gt ct at aat t aat a aacacggaag at t t t ct aag gaat t t cgt c t t gcct caaa t t act aat ac t aaaat t aga t aaaat at t c agt t acat ac t gat t t ggac gt at at gat a at gtt ggaat caacat at t g aact caacaa act agcaat a gat agat t ca at cagt agt a cacgt acaac t at t t t t t gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 gtgaaacaca cgtgttttgt gtgcacaaac ttaattaggc tagtgagtcg atcgtgacat Page 100 12689250 Sequence Listing.txt cat at t t gaa t t t cacat aa aggcgat gt t gt t aagaaaa cat ct aaaag ct aaaagt t a aaacct aat g cgcgt t gcct aagt t aaaat t t aacat t aa cgaat aagt a aacacgt t ag t t gcaagt t t t t at aat gt a t gaaccat ga aagaaaagac t t gt t gt t t g t t act t act t at ggacat t t aat t t t t ct g at t ggaacgt aaaaaaaaaa at at t t cat a gt t aat t aaa gaaacacaaa at ggat gt t c gt cagcagt c t cgt ct t cct ct at at aaat gt t ct ct t gt cat at gat ac t aact t t t ct t at agt aaca aaaaaaaaca gcgt t t t aat aagt ggt t t a t gaggat aga aaagaaaat g agt at ccat t at aaat acct cct ccaat t t t t cat cgat g t gt gat ggca t t t act at ca t ggacgaagt acaagt t aat acat t gt t t a at at gat ccg t at aaccaga at t aaaaaat t caaacat ct ct caat at t c t t t agt acaa t t gaaaaact accat ggacg t ccat gct cg t agaacat t t at t aagat ag agt aaat at c t aaaat agac agagaaaaac ccct t t ct t c t aaacaaat t act at agcga aaat t ct at t aagt t t aagt ct gaagacat t t agagact t aaagcggcca aaat cat at t gaaagggatt aaacact aaa acgcct ct t a at at at t t t c 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 92 <211> 2000 <212> DNA <213> Arabi dopsi s tha i ana <400> 92 gt t at t t gaa cgact t t gga aacagl ataagcgttt accaagaaag agaat gaaaaaaaga ggaaaat aaa t gaaa ctaaacaagt ggtaaagagc aaagc t aaaccct aa at cgat ccat gt ct t gcgacgaaga t gaaat ggag act cc~ aagaagaaga t gat gat at g t ct t ct gagtttattg agtaaaaacg aaattl taaagccaaa ccacaaaaaa aaatct acaact act c aaggt acat c acgt a aaaattcgtt ttgtcataat cttaa gacaacgtta at t gact cat acacfl agagtcacaa gaagagtacg aaaag gcaagagatg tgaaaaatct agaga ggt t t cagga t ggacat ggt at acc( tcagaacaag ctgcgagaaa gccga gact aaat ac tgtgaagaaa gt cact :aata tcatttatat actactatta agatttaaga at t ca agaga aaaac agaaa aat t a aat a at ag ct t c agat a ct ct gcaca aat ca gt gg caaca gt t t aagt gt cct ct caa gagacgcttt ct t t t agt t t cgat agaaat gagaagaagg acccat cagc gagat aat t t at ct ccct ca t t gacaat t t at t t gat ct a at gt t gaaag aagt aaaact t t gt gaact t cccct caagg aat at gaaac t t at t gaaga at gggaaat c gt t ct at at a caat gt ct aa t ccgct gaag tgaagaagac tgcaacagag t t cct t cgt c aaact t gaaa t t t at t t t t a t gt gt gt agc t gt t t caaag ccgaaaaaag t gat t cccct tttgaagagg at t agct cct caat gaacaa gaaacgt cat taaaggaacc at t ct at t ca at at ct t caa t ct t cgt cgg aagcaaaaga at t t caacca t at ggt gaaa t cat t gcat g gaaaaccaat t gagat at ag t ct t t t gaat at t gt gcgt t gt t t t gat cg aaacgaaaga ttcaaagaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 101 12689250 Sequence Listing.txt tgaagattga agaggtttct tttaccgcac ccaaacagct gaaaaggaaa t aat gat gt t gagacagaat t cct t act t a ct t cat gt ct t agt ct ct t t t ct gacgat t gaaaccaacg t gcat t caaa t gt t t gagcc ggcttttttt aaat act caa at acagaat t acaaaagcct acggcgtcgt cat caat caa ct aaat caaa at agt t gt t g aaacct t aac t at t t t t t t c t t aaat t gt t cgt t t aggag aaaacctttt cacagt at gg agact t t t t c gaccat gt t c t t t at t t ct a acat ggaagt gagaaacaga ctgacggaga t t agaat cag t ct cgt ct cc agt cgagat c at at t aaaac t t gt t gat ct ttttcagccg t at t gt ct t c at act gt t t c at aaact t t g aat caaagt t ggt gt t t t ac t acat at gat t cgat ct t t t ggaat t caaa gagat gaaaa aggaggcttt aaaagacat t t ggaaaacat t t gaaaaat c t t t caagt t t t t t gggtgcg aat aaact t a at aat aaaca aagggt t t t g caaacacat c t t cgaat ct c gaacaaaact tttttttacc aat acaact a at gccaagag t aggt gt t ac t ct t t at ggt tagggagcct aacaagt t aa t gt t at cggt acaaaagaaa gcaact t cct t cggct t t ct ggt at t acca t ccaggt t ct t t t gcat t gg caagcact ag t at t gat aat aagat ct gt t t gt gaacaaa ccaaacaaaa cact t gat t c ct cagat cct aagt t ct aaa ggaaact aaa aact acaaca cct ct t caat t cacagt ct t cgt agct t gt t gcccat ccc t cacat ct ct at ct t aat aa cgat t at t aa gt t gat gt t g t t ct t agt ac agt ccacaaa cgcacacaat t ct ct t cct t caagaaaacc 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 93 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 93 tatatat t ga ctcaacatat catgal ctgttattgt ggattcattg ttagcl atttactatg aaatgaaaga acattt attaacccat ttttccacag agaacl tttaacaaaa cagaaagcga caaag cgagagagag agagcacctt tctga cgctacaacg ataaggatat tgtga ccacattcct tttgtctatg tacact agtgat t agt gaatactgaa taat a aggagtgaga gcaaatgatt gttga, gttgggacgg ctgaactgga gttgg ggagctaatc aattctatat ggggc! attttctact ttaataaaaa cataa agaa aagca attt aat ac at gaa cgt gg cgt gg tttt tacgt at at g act t a ggttt cccgt t t acat at ga t at at at acg gt t t ggaaca cat t cgt aac agaaaacaaa aaat at aat c caaaacatt g at t t t t ccaa aagaaaatt g gggact aaac at ct gt at gg ct ccggt cca gaagcgacgc gt cagat gt a t gt gt t caga aaaaat gt t t caat at cgag aat aaaacca caat aagagc gccat gaaca t t t at t t t t g caat t ggaaa aacgt ggcat acggtgccga gt ggacccaa cgtttctttt at t t t at gat agact aaaaa at cat aacaa t agacaat t c ggggagagac agagaaat gt ccacat t aca tagacagact t t t ggaat t g aagaggagt g t gcaat t gac ct t t cat cat at cat gt cca 120 180 240 300 360 420 480 540 600 660 720 780 840 v vv tgtgataaat tatgtttttg ttatatggta gggttagctg agagctatca aaagactctt Page 102 12689250 Sequence Listing.txt t t t t at ccac t t act aat t a at gaaggat g gt agt t caaa aggt t cat aa t t ctaggggg t agt at at t a t t aaat t aac ct at t gat ag agt t acagat t t aat t t t gt gaaaaact ct gaaat t ct gc t ct at t gat t ct aggt aat t aat t ct at at aaacat t at c agcaaaat t a cgt ct ct t ca aacaaaaaaa <210> 94 <211> 200( <212> DNA <213> Aral <400> 94 ct t t t t t cct atcct t t t t a gaat ct cat c tgt t ggagga ttgtcct t t t act t acgct g ggt gat ct ct aact t t agct t gaacgt gaa at t t t ct agt ct aat agat t cat gcat aag at agaat ct a aagct t t at c ct t cat at t t aggacaaggc gggtcggtcc cacaaaact c ct ccacaat c t t gt at caat caacaaaaca t agcagt agc ttaaacaaaa at gt agcaat aat t agt gaa tcat t t cat t gt aat at aat at gagagcac gt ct t cacaa gt t at agaaa t gat t t gt aa aact agt at a gat aat caga gt t aacaagt t t t at t t at t t cat gacat a aaat gaaaac cat t at caac at t ct t t t aa agt t gaagt t at acct t aac t aat t ct t ac aaaaaacagt gat t agat t g agaaagt t t c gaaacgaat a caat caat aa t cact at agc acacaacat a cgt t aagagc act at at aag aat t t t act a aacaact t t c t t at gt gt aa ggacaagaga aacgt t t agg cat aat t t t a at aat cagaa gaaaccaaaa t at cat at t a t at cat cagt ggaacat gaa gt t t agat t a cacaaaaat a at aaaaacaa cacgt at aca t at agt ct ct t ccacaat ac ataggaagtc aatttaatcg gagct cct t c t t gat caat c aagt at t gcc aagagt gaca aagaaaaat a tatggggcgg gaat t aaaag t ct caaat aa t aat aat aat tgacaaacac t aat t at act t at at t aagc t at at cat ca at cat aat cg ccat aagcct at t at t aacg ct at at aaac aaaacacaac t agaaat t aa t agct at ct c caaat agat a gt ct at at t a t agaagcat a cgaggctaag gt ct ct gt t c gt t cat ct t t t t agt t at ag t aat t gagat aat gt at at g aaaat cagt t t gacagct ag t cat acacac accaaaagga t at at t gaca aact t t cat t t t t cat at at 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 0 bidopsis thal i ana t t t t gt aaaa aaat gt at gg at gt at t aca t t t gagat gc t gt aact t ct cggaaaact t agct ct ct t g gt ggt gt t t t aggctgaaga t acct t ct aa gtt gaaaaaa aagt t gacaa gacat t aat t aaat agct t g at cat gt aca ggcaaggacc t t t at ct t t a gaggt t gct g agcagtggct t t t cagt t gt t at t t at ct a aat at agt ct ct acaaat gt ct gcgt agga ct at t t at gc cct cagt gga t ct t cct t gt t t t cagt gag t cgt gaacaa aat act t gat Page t ct t t cgcag ccat ct t t t g agat t ggaat at t t at aat a agagagt t ct aacgaat t t c ct gct t t t gc agggaggcgg gagcagat t a tcaaggaaga at t t aat aaa cagt aat aaa at t gct t ggt t cat ct aat g gggacaggga ct gcct gaca agt aagt ct c aggaacaagc aaagt t gt cc t t gt ct aact 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt attttttgtt tttcagacga gcctcttgag attacttaca gttactggga cat agacgag gt gt gt gat c cccaat t ggg tcggacggca gagaaccct t t ggggt at t t agggggaaga at t gt aaccg t gcat gaaga ct gacccgat t gt aaaat gt agaagaacaa t ct gcact t t gat t t acgac cagaacat gc gt ggaat gt t gt t t t t t aat tgat t t t t ca gagcacacaa acaaaat cac ct at ggt t gt aaagcat aaa at t t gat aga t t at t caggc at t t act t ac aat t t t ct gc tcagtggaga t aaact t acc t t t t gt aaat gtgggccggt agact cact t cgt gagaaca cacat gagaa ggt gt t aat g gcat at at t c t cat t ct cag ccgacaaaga t t t t gatgga cat ggcat t t ttttcttaaa t gt cat gt ga at t aagaat g acgaaat aaa gt t gt t at ct t agcat cgag at caagcgat aagcacaaac ct t t at gat a gggct gt t ca at t t act at a aagt t at cct gt agcaacac at gcct t gt c ct cct acat t at agcagat g aagagagtag aagt cgcat g cct gct t cca ct agact t t g aatgggaacg tcgt t t gttg t t aat gct ca at aat ggaac aaat t t agt a ttagaaacca ttgagagacc tttttttttc agagacact g t acacct t t g t at acccat t gcagcagct t t gt aaaggaa ccct t t act a agt t t ct at g gacacttttt gt t ct t gtta caacgat aga t t t gat gat g cgggt aaagt gat gggaggt aat ct t at t g gt acacagt c acat gat t ga ttct t ct t cc tgt t t gattt t gagat ct t g at t t gt gaat agacaaaacg agaaat aaat t t ct cat t ca t cat act gct t t acaggt cc gcccct gact gat ct t at t a t at aggact t agct gat cat gaaat t cagt acagct gt t t gaaagacgag gaaagt t gt t tgtggagagg aact ct caaa t t t t gcatta cat ggggat t t cagat t t gg ct caat gt ac ct t t agt acc t gat t acaaa cccct gat gt ct gt cgt t t t t aat t t agt g t t cgcaaat t tgggactggc t gagcct t ac gcaagggtga tccgggagat t t ccacat gt aacgat aaca taacaaagcg t t at at agaa cact t t gat g gt at gt t t cc t ct aaagcat cat t ggt acg t aat ct t t t c t aat t gt t ca aat t ct ggac t gt aat agt a cat t acact c cgt t at t t t g t t t at aat ga at acaaggca gcgagggttt tgaaagagca cgt t gaccat 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> tttgatgtat gagaaagagg aatcgtcgga tagagaatat ataaagagct ttttggcggc ggagaaaggg aatgtagtgg atgtgtctct tcatat t ata ttgactagta gtgat t agag gtgactcgtt agtgtagatt tgttaggatg aggatggtgc atgtggttcg ttataatgtt aagaaaaagt tatcatttgt catggcttag gaagttggtt tgtgagggt a gattccggtg gcggagttgg tct t catgag agact t caga tttttttttt ttttttctga gt gagagat a t t caat ggga ct t gagagaa t gt gt t ggat gtactaaaat ccaatcaggg Page 104 aaaagt gt t g cggcaaagga t t gat t t gca ct at t gat t g gt gt agat t t at aat aagt c t t t ct t t t t g 120 180 240 300 360 420 12689250 Sequence Listing.txt acatgttttc gagtacattc ctcagccgag aacaaaaccc gaacccaaac ctaatgtagg tgatgt t gga taagaagaga agt caacat g gtct t t t t t c at t t at aggc ct t t cat t t t tgtcgggttt aagcacaaga at t ccagaaa cactt cattt ttct t t gtag gatcgt t t gg ct aat att gc agct t t acac agt at at aaa acgtgcagt g aaaagacaac cat at caaca aaat t aat cc ttaaaaaaaa t aact ct aag t aat t t t t at t t acacgt ct t gat t at t t a t at at cgaga aaat at aaaa aat act t aat agaagaaaga aaaggaagag ct t t t t t t t t t aaagat agt at caaagtt a gcacat t caa caaat aagt g t t gaaagat a tgt t t t gtta tttgtataca gtgagt t aaa gat t t t aat a at at caaaca t t at ct aaaa gaact t ct ag t aat aaat at t caaacaat a at at accaat aaaat at t t a agcacttctt cat t t at t aa ctct t t ctta gtt att gaaa ggcctaagac taaaaaacac cct actt gac agatgt t t gt aagataaagg ggtcaacgaa cggtagagat t aatt agaga at at t t t at a t aaaaaat aa ttgtgactgg t acat acgga aaggaaaaaa agcat t cgac gt at at aaat cgtccaccac t at ct caacc accggtgaac t t at t aact t ttatagcaaa tcat t t t gt t at cat ct acc acctt gaaac cctaagaaac t aacat at ca t t gaaat t at aacact t aca aagatcaaga ttgtgtaat g ct t t t gggcc cgagcgcatg t t t aggt t t g gat t t aact t ttatgtatgg tatgaaaggt t t at t gaaat gaggccacac ataaaaggag gagatgaacc t t t at aaaat aat t t t ct t a t aat agt t t a actaaacaga aacagat t t a gataatgtga t t gt t at at a cagt aaaact gt ct ctt act agct aat aaa acatcaaaca ccacacacca cgt ct at ct t gaagatgaga aagaaaagaa at gt t gcaca aat cct cagt at aaat gaga t t t at caaat gt aaaaact t t at gct aaat t gaat t t gat acgtctcttt accccatggc aaccaccaca at ct caacct t ct aat at t g cagct t t acc t t t t t at cat cacgtctctt ttat t t ggt t tatcgagagg acgccacat a t aaat t aat a t at t t at t aa at at t at agc at t cat t t t g t ct t t ccttt agaagaatcg gaagaagaag t gaat cct t a caat t at t at t aaaat t aaa at t t t gtgt a t caaggt caa gt agat agaa cctgacacac tt ct t t t t ct gggaaaatac at t t t ct t at aat aat t t aa cgat t t t aat cat at caaac t t at t aacct t t t ct t atgg at t gt aat t g t t at t gt gat accaccacaa at t aaat ct t t ctaagagac aaagat aat g t t t t gt t at a gtataccaaa 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 96 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 96 ccattatctt caatatgctc tcaaaaacag aaacttcaca aggaatcctc aagacaccgg cttgttgaaa cccgaactct tcttctgcct cacgcagaag aacatagaaa gcttgatggc tgaggtactc ggttggtatt gtgtacctct tcttctctaa cccgaccgaa acagctagat Page 105 12689250 Sequence Listing.txt accctttagg cacagctgtt acatctgtaa atgacagtgt cctct t caga t ct t gct t cc caat t gct t g t gt t t gact t atgt t t t gt c gagaatcggg t gct t aaaat aat t t gt t ga caaat t t t gt aaaacgcaac t cact cggca aaaaagct ca act act aaga t ct t t ctct t t gat t agt ag aat gcaaagt t t t gggt t t t ggt t t ct t ca caaacat gaa ccct t cct cc agcagacat g t t gagaagat t gcat gat cg t ct t t cct cg t ct ct cat t t cagat t gt t t t at cgcagt g gt t cgacacg aaaaaaaaaa gaagagaacg ttccacggaa cccgccact g t t t t cgccat ct t ggct t cc t agcgt ct gt gat at at aga at t gt gct aa caagcaaaat caccaaaat c t at gt aat t a gcat t ct cgt t agt accgt t t ccat aat at t t t ccacct c gaat gt cgga gt ct ct t t ca ct at t at ct a gcaagt t t cc t gaagt agca gaat ct ct ga gat gggt cct gt ct t t cacc t aat t agt ga acaact at aa gt t t ct t t ct aat gat t t t t cct t t at t t a t gt caat cac aaaaaacagt aaaaat ct t c gt t t cagaca ct at t gt t gt t t ct t gagaa atggtggaga ct ct t t ct at ggt ggt gacc t cat at gct t t aagcagct t agacaggcaa at at t gagat gct ct cggct gt gt ct ccat gt aaggcct c ccggt act ga t ct t t ct ct c gt ggt t cacc cgggat t at t t cgat agct c gct at cgaga at gt cagat t aat cgat caa gt t t cat gcg t at at t t gga cgaat gacca ttttttaaca t ct t acaaaa cct t t aggag gaaaccacaa gt cccaat t t t t gt t ggcaa ct t cgt t gt t gct gt t gaag tagaagacaa gt gt gt agt g acggaat t ct t ggt t t t t t t gaact t agaa agct ggct ca at act t gt t g gct gt t gcca cagact t t ca act t gct t ct at aacagcat acgt ccaaag at cct t aaaa gaagaggagt aaaacat cgc t t ct ct ct ct gat gt ct cct gat agt at ct gaagt t gact gt t t t cgct t at at t t gt aa t t acat t t t t agt at cat t c t aagt t t t ct ccaaaaaacc gat caaacaa at ct ccggcg gt t gt t ggct ct t aacgat c aagacaagaa at t aggacgt gat t t aat t a t aat gat t t a aat gat at t t agcat cgct t ct t t ct gact aat ct t at gg t t gat gt aag t t cccat cat gt t gct t gct aggt agcaaa t gat aaagt c gt gat acct t acaccat gt t cgt ccggt ac ggcact t gga t ccat at agg cct t act t t a ccggt t act c t agagat agt t cat agaat t agat aat t t a t t ct t ccgat t t gt cgt ct t aacaaat t ca agat cat ct t aact t gat gc gct t t t gat g t ct ct gat ct gt t t ggt act ggt gaat at t gt t t t gt t t c aaccaactt g aacat aacct aaat ccct gt ct gat t t cat t at ct t t ct c t ct t at aggt accat at ggc acaaaat gt g t t t ggt aat g t ccat ct t t c t gt at at gt t t t t caat cat ttagccaaca ggat aat ct a t ct caagagt agcaat t gaa t gat at gat a ct at t t t cga ct aat act ca at aaaaat gg at cct aaat t ct ccaat cat t aaat t cgga t ct t at t t t g 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 97 2000 DNA Arabidopsis thal i ana Page 106 12689250 Sequence Listing.txt <400> 97 t aat agt ct c gt cgaagat t caaggacgag cacagacgaa t t aat t t t ct ggaactaggg cgat t ccaac gt t gagct t c tt ct aaaaac t t cagt t cca ggaaagt at c at ggct ct at ccaaaaat cc t ct t t t gct a t gcacgcat c gat cct t ccc ccgaagacga agagt acat g at aagcct t g cat gagat t g ct t t ct cct g gt t gagaat t ggacaggatt t ct t aat ggc t agt at agt a at gat t t t gg agt t ggat t g agt t aaagt t gt at gat t ga aaaaagt gga caaaagaat a at gggccat g tt cat cat ca ct t t at gt ca gaaaccaat a t accat cat a act cacat ct gaagt gaat g aaagagct t g aaaacct t ga aaccaagaca gt ct t caaaa aact gt gct c ggagaagaac t gat gat ggt t aggt gat t a t at t t gct aa gat t cacaca t t t t cgaagt cgacgaact t ttacagagca aagat ct gga agaacggtgt caacat cagg ccgaaact gc ggt t gt ggt c agt aat t at a aaaat t at t t cct t t t t cat t act t t at ga gt t aat t ct t ct t at gaaaa aaact ct t cc t aat t t t t t c agcccagat a t aagaat ct t at t t ct ct cc at aaat acaa gaat ct ccat cccaccaat c caaagat gat ct aaccgat t t ct ct gat at agaacat cct agaggt aaat at gt gt ct ga acagagaaag cact gct gga t cat t cat t c at cct at gt c agact gt t t a gaagt at ct c ct ct gt t t ca at cggaagac gaacaagaag gt ggaaaagt gt ct gcaat t agat t cgt at gt agct t t ga gat ggt cagt t t gccgat gg tgt t t t t cag caat t aat aa at gat t aaat cgct t t gt aa at ct t t caaa at t t acaact aaaact at ca t cagat agt c aagat cct t c aaat aaaaat cat t cat gt c t cact caaaa gatggaagag gat gaacaat cct ccgt at t taagcgaagc t gagct aaaa t t cct ct gt t t t aaggt t t t gaaaat at gg at cat ct ggg tctgt t t t t g gcagtgaagc ggaaaccaca ttgacgaaca at gaaaccga aacat t t t ca aacct t t t ct accagcgagg t t ct cgt ct t at aat t gat t t t aat t t aag at aaaaat t g cagggt t t at agaaaggatt t gat agaaag aagt gat gca ttttcaagca ttttgtctaa agcccagct t ggagaagacg tcttttccac acaacttctg t aat act t at t at aaat aca gaaaat ct aa aaact t gt ga ct caaacaca t accagaat g ct t gagat cg gat t ggt t t t tttttttttg t gtt gcaaca gcaaaaagag aat t t caaag tttacagagc aagt t cagaa ct t gt aacaa cgaacat ct t ccaagt ccga gaacgt t t t c t gggaaat t t t t t t atctgc t ggacaat at t ggt gaat gt ggt t at t agt gct t atagga gaat t at gat t t at at aagg acat t t caaa at t at t agga gt gt aaaaat gaaaagaaca cgagaagaga aaaacgaaag t gt ct aagca t ct agat act gaaaagat t c t caacgaat t ct t ct t ct gt ccat t t t cat at ggaaaaga t t gt ct ct ct gt gat t t t ca gaacaagaaa at t cat ggat t t t t aat t t t at at t acaga at cagacaca cat cacat cc tgaaggaaac agaagt gat g t t t ct ct aac cgtggaagat acct gct gct t at cgat t t c aat gt ct ct c t at t act t at ct aaagt t ac t t ccct at ga t t t at t t gt g t gt cacat t a t gaacagat t t aat t ccagt t at cgt cat t ct ct gt cgat ttccgaagaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 gaagaagaag aaggagaaaa Page 107 12689250 Sequence Listing.txt <210> <211> <212> <213> 98 2000 DNA Arabidopsis thal i ana <400> 98 gaccaagagt aaat aat at a gt gaat gt ga at t gaaat t g agaaat at t c at aagacct c t t agaat t ag t gagt gt gt t ggat t agcct acgt t aaat t t t t caatttc acaaggt at c ccgt cgat cc t caagat t t g caat cagct t ct aat ggt aa caggggatag tccaggcgag gt gt at gat t caaaaat aaa t agt gt t ct t at at aaat t a aat act t t t t aaaaacgagg t cat gat t t t at aaaact cc at t gct act t ttat t t gct t t aaat aat t t ccaaaaat aa at at t agaac aat at aat at ct at ggt at c agt t t at t t g gt agt t ggt t t t act gt t gt acgact agcc t act t cgt gg gact ct at t c t at aat at t t gcat at caga aagt gt gcat t aat gaaat g ct t at agagt acgaagattt at t ct acaat t agat gt t t g aagat t caga cgccaccacc ct t t at aaat at t t at ct gg ct acat cat t act t gct t t g cact at t gat gt t aaagagt t at aaat agt at t aaaaaaa cagt t t at at aaaat t t t gt aat at at aaa cagaat t aga at agt t t gt t t t cacaat ga t aat aagt at t t t t gacat t act t t t t t ga t accaaaaat agt t t gat gt ggct gat cca t t t aaact t t at aacaccaa gtt aaaagac ct caaagat a agt gat t gt t t t caagagt a aggaaaagag t t t t gt t ct a gagaact cgt at caaagaag aaat aaaat t t t t ggt t gca cat t t t at gt ct t t aat caa ct t t t t t ct t aaaaagaaca agaaagcaca aaacgacaat cact t gcct c gaat t aat ac aat aat cat t t aaat t aat g ttggat t t t a aat t agt gct t ccaaaacct t t gggacat a at aaacaat t acat t t at aa gt t t aaggt t aaacat t aat caat t at gct t t at cat t t t t gt t t gggt t gaat ccaaca agt t agt tag gagat gct aa t t t acat t ga aggct t at gt t gat ggat t a gt gcaaaat c at t gt ct gt g aaaaat t gag acaaaaaat t t t t cat aaac t t gaccacct t t act gat t g agaacgagag t gt aacaaaa t gccat t cac gaat t t t gt t t caat t t ct a gaaat at aat t agat t gat g t aat t t gt gt ggt t t ggcct t t t t t ct t at gcaaat gt aa t gt ggat t t t t cgaaaat at ct gat acct t t agct gat gc cat gct accc agt at t gat t t aaagaat ct t t t gt t at t c caacat gcat t t act ct at c t t at gct ct g cat at cat gg aat aagt at c t aat t aacat t t t agt t aag t ggt t t at at gagat act at t t t t at t t gt t t at agt cac aat cact ct a aat at cgat t aagt t t ct t a t agaagat t t aaaaaaat at gat gacaaaa t cat at agat acaagaatt t taaaaggaaa gat t t aat t g at t ggct at t gat at gt gt g t t gagaat t t t caat t ct t c acaacaacat agaagcacct aat cgaccaa ct at t ggt at t t t gat t t t c aat aat t t gt at t agt agt t gaaaact ggt t cgct gat ga agagct gt t a ct t aat t t t a at aat caggt t ccaat t gat t gt gaaaaaa at ct t cat ca at ggaat gca ccaat agaaa at agaaagt t cgaaacgt at aagaaaacaa t aaaat gt cg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 Page 108 cagat t t t at agaagaaaat ccat aagcgt ct cacat t aa 12689250 Sequence Listing.txt tttcttacat aatgcttttt attcgcaata ttttttctta agcatttagt tcgttaaaaa gagagataaa ataaataaaa gtaatataat cgt cct t t t g t gct ct caaa aat at aaagc aaact cagat ct cacgat t t t ct t ct t cct cacacat cat 1860 1920 1980 2000 <210> <211> <212> <213> 99 2000 DNA Arabi dopsi s t hal i ana <400> 99 t t cgt cgt cg aaaaagt cca agct ct gcac cacacgcaat cagat cat ct tttaccagcc at aat gt acg aagct ct gt c gaact ct ccc t t ggt aaagt t cagccgt aa t aact caggc aacct at ct c at aacat aac aaaccat aac gaaact cagc gt aaggt t aa aaat cat aaa aaat cggcca t acgcat cgg gt ct t act t c cgt aaat gcg cgaaagaaat ttttttggga t ggagat aat t gt t t t t t ca aaaact ct t t at ggat agaa gt t act aagc t cact gaaga at at t t aaac acgat acacc gt ct t accag t ct t t cagt t agct t t at at ct t ggagggt gcat t ccat a aaaacat aaa agccat agat t cagccat aa at aacaacag cat aacat aa accat aact c t cagt cggca ccat at gt ca aaaacgattt at t gaaagat act ccggcgt cgat t t cgt t gat gat gaac t aat gacgt g t t t aat t t t t t ggcact ct c gagt caat aa cct gcct aag acgct t t gt a cagt gat at g aaagct cct t accagagaaa ct gcccat at t at gat t aat act ccct t gc act cagt cat t ct aact cag aacat aact c gat aaat ct a ct cagt t aaa acaaact cag agt cacagca aacaaacagt at caacgt cg cagt gacggg act caccgga ct gaat t t cg ccgt cgagag agt aaagct a gat t at ggt a agt t aaaaac aaat ggt at c cccaaaccat t t cat ct aga gt cagct t ca at gaaat t cg ct t cagt at t at gact at ca t t ct t ct cct gcct t t accc t t ggt ct t ca aacat aaaac ccat agat aa agt cat aaca act cagccat acat aaacca ccgt cat at a taaaaaaaac aact cagccg acgt t aaaac aat cact t ga act t agat gt gccggaacag agaaagt gga agaaat aaga at gacat gga gaat t aaat a ct agagt t ag ct acgt t t t t t t aggacct t ggct caaact t t caat ccaa ct cagct gt c acaagct t ag at t gat t ct c t ccagat t ag gacat ct gt a aaact cat cc cat aact cag t aaacct at c at t t t aacag gact cagct a acagaaact c tcagacacgg t caaat caaa ccagct t t aa gat t gaact t cgaat gt gac acgagaaaag agaaagcgac aacccaaacg at aat gacat at aat aat t a t at gcaagcc cagt aggcca caccggt t gg t t t ct gt agg accggat ggg t gcat agt ag caat t aat cc t aat t gcagg aat ccccct c t aaccat aaa acagat acca t cat aacat a tcagccacac aaact cagt c t at caaaaca agacat cgat at gaat t gat t agat ct aag aaaccat cgt agat gt caga t t cgcgct t a agcaaaacgg agt t t t t t t t t t at ggagt t gt at aat cac at t aaaact a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 attaggggtg ttaattgtaa ttacacaata actgaaaaca tctgagtaat aaaaaaaaga Page 109 t gt gt t ct ac aaat t gt ct t t aagct cct t aaaat at t t t cgacacaaaa aaaaat cct c at cagagct t 12689250 Sequence t gt aggaat a aacat acaaa t gt gt t ct at ttcccaaagc cgtcctttaa acccgacgaa gaaagattaa gcatttctgg aatttttatc ttgatattcc ttgtaaaaat ctgaagaata aaagttgttc cacttattaa aaaaaagtta tgttatatat ctctcactat cttttatttg taaagaaaaa Li st i ng. t xt ct t aggaat t gt t gt t t aag gact t t aat t aagt t t act c t t cagtt at a gt t ct t t cca agt t caaat t cagct t aaat agagaaaaat cact t at t ac at t t t t t t aa gaaacaat t g 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 100 2000 DNA Arabidopsis thal i ana <400> 100 gt t t t t t t t t att aaaggcg gaaactgcga gacccat t at aatcgaacag t t t aaaat t t at t t t t aacc t aat t at caa t t t gt t t t t g gaataaaaga tccggcaaaa aaaccggatt tgagct t t t t acaat cat ga t ct t ggt gaa cagat t cgat agct t ct t ct agaacaaaat t t ct gt acac ggat at t t gt t gt t ggat ca at t t cgat ca gaagt t t t at t gggaaat ac agtacccaaa aagcgaacga t t t aaaat ct ctaaaaacaa gggtcaaaag ttatggcaaa at aat agagt t t t aact aag cacagagcac acacat acat t gat t caagt caagtcaaac tgat t atct t t ct aagct t t ggtgggcttg tctgt t gtta ccagagatcc tcgtcggtta gtcaccggaa ct aagt acac acaat ct gat cagtagcaga gaaatcgaaa t t t t t aat ac ctaaaaacaa gggtcaaaag tt caatt aaa gatccgcgcg t t t at t at t t agat t at aat ctcacagcga cat at t at t c agaccaattt gggt ccct aa tagcgaaaca gat cgt ct t t t t ct caagat aggat t t gaa aaaatctttt at t t ccct at atccgcaaca ccgt t gagat t at cagacat t ctt actt ca agatgagaag ggctaaaaac caggt aaaat tt caatt gaa gatccgcgtg tgaccaagat tttctaaaaa t agcgaaat t t gaat agt at cacaatt ccg ttttagccca ttggt t t act accgtcggt t gct at cagac agat ggact a ct gt ct cat c ctt att acac t gt t ct cat g gt aacgct ct gt cacacgt c aat t at t aca ggt t t ct t ga acataagaaa at aacgcaat aact caat t c gatccgcgt g tggccaagat t t t t gtaagt t agaagtt at ggt t t t t t t c cat gcat act aaacatattt aat att agca aaaat t t cca gacgaaattt t t aaacaat a ctagacaaaa ttaggcatag gcagagatca t t gt t cat gt act t t gt act ct t at cat ct accgaat ct a tccggtgttg tttaaaaccc ctataaagaa aagct t gtga aatggccagg t t t t ctaagt t agaagtt at gaaaccccga gacccatt at at gacat aaa tt att aaacg t cat t t t at t attggagaaa caagtacaaa t cagat caga ttttcaacga cat t agaaac at ct aagcat aat gt ct t gt gt ct cgccat t cacagct t c ctgtgtacga gct cat cgct aaatt aacat tgttgagaaa accaagagga tatatct t ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 Page 110 ttgt t t t gt c att ggaat ac t t t cat ct cc gct t t t t at c cacggaaaaa gagaaaaaaa gtgcaggcga tgacgtggaa aacgccgcgt gcat t t t t ca gcaaggagaa t ct gt t t ct g attttttttt gagaaaaat t t t t ggct t t t acat t agaag aaagt at t gt aggagaagca t at t gat t t t at t agagagg t ccgact ct c accct aacca 12689250 Sequence aaacagagaa atggttttta ttttaacgtt cgttttgcca t cacaaaaca aaaccatt gc ctt gaaaaca aat agaaaaa ttcataaaaa tt cact cgaa t t t ggaaat a t at gaggaag atgtttggtg ttttataaga aattttttag ttttatttta atgccctagt ttaatgtctt tgcgttaggt ttcattttag Li st i ng. t xt aaaagggat t ct ct ct gt ct aaaaacagag gt t accgt t t t ct ct cgtt a agaaaat gag gagacgatag ttt att gaac ggt at aaaag cggcagacga ctt aaaagaa t ct aaatt ac t at t aaaaat ct at t t act t ctt cagaaga agaagaagt a t agt aact ag at t t t at caa cct at at t t t gagagcaaaa 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 101 2000 DNA Arabidopsis thal i ana <400> 101 at agagcaga acagaaggac t aagact cgg taaacaggga at at t ct gaa caagagat cc ct act acat c agat t ct gt g caacgggaaa cggcaggtaa t agat gt at t acaaaccagg t ct ccaagcg cact ct aaac gaagtt gt aa t t t t ccat gt t cct gaggaa cacacct gat t gt aagt aac ct t t t gat ct t at gaggagt gt t t ctcttt t aact gagag agggat t t t a t gaagccat c t acacat t t t at t t gat at c aagaagggca gt gct acaat ct t t t gaat g gt t at t ggct t agct acgca cat t t t ct ac aagccgagca act at agacc caat ggaaca gagct at ct g at accat gac ct t t t gt t t c t agct acaac ct t ct gt ggt t t cat t at at agacct gct a aat ct aat ga t t at at at ac gt cat ggt t a at t agat gaa tt gt t t t gt t t t t gaggct g at gct gcaaa aagaaaggt g gaacat aat c ttttcttgaa ccagccagac agct t t t cgt ct gt t at caa acat t t gct t cgct aat gct t gt gat gact t t t at t gtct ccct t t agga t agat ggagt aaaaggt t t a aacct at at t ct gagat cag ggat t t ct t t t cact t ct ga acgatt gt t t aagt t ct t ca at t t gcccaa t cat ccact a act ct gat ca t t cacat cgg agacgggt t a gaaacgaat c gcat t gt t at tt gaat acgg at agtt gact ct t ct t cttc aaagat agag ggaagagatt t act agtt gt cagaagggtt t gt att aaca cagaagt t aa ggt ct t t cct tcaggcaaaa act ct at t ct ttttgaaaac t agct at t gt aaggtctgt t t t t at t at gt acaat cggaa gagcgcacac t at t ct ct t c t t t gt t t ct t gt gt t gt gaa agcgt t t act acgt ct act g t cat ggccgc t ct t cact t t at ct ggt ct t ggaaat t at a t gct cagct c t t acagt gaa gaagat aaag gct act at cc t at gccaagt cacagggaac ct gcaaccac gt gt t t cagc aaggagat at t gat t cgaac aggagtgaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 gcgaggctaa ccgggttttg ggttggggtt ttaaccagat attgttttct aaaattcatc Page 111 12689250 Sequence Listing.txt at t ct aacca gaccgagt at t t aat gt cgt caat t aat at ct gat t t aaa aaaaaat gt a at t aaat t ag tacgacgaaa gt t t ctcttg t gagct t gat cct t ct t cat t t t t caat t t gaact gat ct t gt t t cgt aa caact at t ct ct caaaggaa aaat cat gt a at t gt t t t ct aat at gaagc aagt t gat gc cgagacgcaa ct gaat ct ca ct ct cgcat c cgct t cccct t gt acgaat c cacgt ggat t t t ct ccat ga t ct gt gaat a gat cat acca t cgagt ccag t t t at ct at a ct cat gt aat t ct gt agt ca t t ccat at aa att ggagaca ccgt ct t ct t cacgct t ct c t gt gaact ga t cgat t t ct c acgttttttt t t t t cat t ct ggaagggggt ggct aat gaa cat t t t t gt g aat t t t t t ga aaat t t t aca aat aaaagt a gagat aact g ct t ct t cttc t ggt aat gct t ct gat ct ga ggat ct t gct t ct t cct t t t ct gt t gat ct tcataccagc agcgaatttt at t at gaaat t aat at cagt t t gagcat t t gat t t t aat t aaaat gagt a agcggaggga t t ct t ct aga act gat cgt c ggt gaat ct c gct t ct t at c gcgggttttt ggt t t t t gt a tactccaggg t caat aaccc gaagtgt t at at ctt ggaat at t cat gcca tt cagat ct a cgat cct t gt accgtgaagc tat t t ct t ct gt t agatct g cgat t t cttt gagt t gt t t c 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 102 2000 DNA Arabidopsis thal i ana <400> 102 ggagaagagg at ct ccggt c caccgacggc t t gt cgat t t t agccat gt t t t at gaagcc t at gt t t t ag acgcccat t g t t t t ggt t ga t t at gat t gg acaaat aaat at t gat agt a t aaaagt gaa gt t gaggt ca gt t gaaaaat acgaaat t aa ccgacgatgg ggt gaggat t t aagt t t ggt t t aaggt t ct ttgtgt t t t c gt t t caat aa at gaat t at t at t agt gct t agagt cagcc cagat t aagt ggat t cct t t at at t t t at g at t acgct t t acaagagcaa cat t cat t ca act t ggggag cggaagt t gt t t gat gagt c t t ct gt ct gt t gct t agaat act gt t t gt g taaaacccca t t gcaat t gc ct agaacggt ct t cggat t t aaat t aacga ttcat t t ct g t t at gggt t t cat aat t t ca gt t t aat ggt ct at t cct cg t act aaaagt t aagat ggt t acgaaact ca cgat gt t gt a ccct gat t t g at cgt aacgg t t t gt t act c t gct at t t aa ggat gct t ct gacct t t cgt at ggt caagc gaagcggaat ct gat agt t g t t aagccat g at cgaat agt gcgcgtgagg ct aaaccagc gaggagat t a at gt ct ccgt gaat acgt at cgaact t gt c ggt t gt gt gg tttaaagccc tt at agaacc gt gt ccact g gt acct t t t a acat gat t ag ct t gt ct aac aagact t gaa at ggcaagt t gcgt aact ga ccaat t t gac aacgt cgct t gagt t gagca cgt t ggct ac cgt t t ggacc ggagtttttt t t t t gtggat at t aggccca t aaaact cga gt aat t gt ct agcccat t ga ccaaat cat g aat t t t t t t a at gt t gaact ttttttttgg aat aaaact t ccct t ccccc t t cat gt gcc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 112 12689250 Sequence Listing. t gtacg atctgggacc gtcgttgt ttaatcacac ccttttctgc cacg t t gt agat cc t t t agt t t t t aaaat caaat at aat at gt t t t aat t t t aa agt gaact at cagcgt at t a at agagcgaa t aaact gcgt cgaact t t t c t cgaat aaag t ct cgaggaa at t t ct cct t t t gat t at cg at t t gt t t cg ggct t t gact t ggt t gt ct c ct t t ct agat t gat act t t a t aaat ccat a act aaat t t a t ct t ct t t ga t ct aaaaat t taaaccaacg acaaaagtt g cgt at t t gga t ccgcct ct t cct gagat ct ccgat cagga ct ct cgct ct at t acgt t t g t t t gtcgtgg aagat t gt t g aaagct ggaa gt t cat t aaa at ccct gt t g gat t aagat g tggtat t t t t t t t ctgt t aa aacaaaat ca ggggat agt t aggct t gt t t ggcat at t cg ct ccact cca ccgaat t ct g t gact t caat gtgtgt t t t c gat ct gt t gc tttttgatcc ggact ct agg at t act t aat t t t t aat gat tacat t t t t g cgct ct t gt a t t t t ct t t ga tttaaaccga t gcat t aggt t t caacat t t ccggt caagc aaat t ct t ct at cagcaact cct cgaacct t gt ggt t gt t gat at t t cat gct t gat gt t t ccgaaat ct t t at at cc act t cat t aaaat aat t t gact t t at t cat t t caaaaaaa at t ggaca tcat t t t a gt at caaa ccgt ct t t at gcat t c t ct aaat c ttgatttc ccat t t t g at t t cgca cgat ccgt txt gc tcat t gacat ag t cctt ct act ca gt at ccat cc aa atggtttat t t a cat ggat aca ta tacatttatt tac tttaaaagat t c tggggggtaa itt gtgtgctatt t a cccaatcacg ct cccgtcaccg aa gt cat ct cct ag tgagctattt ct tagctcgttg laa gcagtgagag iga gtttcttccc cg aggtgattgc 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 103 2000 DNA Arabidopsis thal i ana <400> 103 t at acgt aac aat t acgct t t gct gct t ca gt t t at at t a t t t cat t t t g gt t gt gct t a act at aaacc cgact gt gaa act t at cagg aaat gct cat gccgaacaca acacacaaat at aaaaagcc ct t t t at t t t t t agaaagt t at t t gt t t t g t ct ct at gt t cgt aaaat ga cagt aat ggg agcaccaaca t t gat acat t gccat aat ga t gt aaaagat gaaat t gt ag caaat t at at t ct t ct t ct c at t t aaagt t t t gat gt t gg t at at cat gt ttgaagacaa t t acat agac t aat gat gca t at agacgac t act gagct t aat t agat aa ccgacat aaa t gggcct aaa gagt caat at gt gt aagt at t t t gt gat at caaaaat aat gggt ggt t at t t t cct gaaa at ct ct t t ct t aat at ccac gcacct t aca at t aaacact accaat aat g at aaaat cgg at t gaggt aa at gt acat ct t t t t ctgat t ct ct cggct c gagt agt t cc agaagact t g caaagt aaat gcact at cca gtcggaaaga act t ct gaaa aaggt ccat g t act aacaac cct at cat ca t aagagt aat gaaat at t ac t ct t t gt at t t ccagggt aa at gggt at gt cacat gt t t a agt cact aac t aaat at ct g t aat aat aaa t caaaagat a 120 180 240 300 360 420 480 540 600 660 720 tcataaattg acaaaacttg gaaagtaaag tgggaacatg ggacggacga gaatcttggc Page 113 12689250 Sequence Listing.txt ggccgaaaat gggattatgc caattgttat agcccaacct tctccattat ctctctattt gt t t aaacac t t act agat t t ct t aagcaa aaagaaaact agt ct t gt t g aagact aaga acaaagat at aat t ggaaat ccat at ggt g agagtgagga t t t t t ct t t g t at at agt ca act gt t t caa at agacct ct t cat cat ct t t t at agaat t ttctctcttt aaacgcagga aat t ct ct ct gat ct gt t ct t act t ct gat t t aat t t at g cact t gt ct a at at t t gt t c taaacaaagc t t t at gaacc ggat aaat gc aaaaaacaaa acat cgaaag t aagat t act ct acacaat a t ccaat t gaa aaat caat aa caaaaat at c cat cat cat c cagt gaat ca t t at at t t t t aact t t t ct g t ct t t gt t t t t t t aaat aat t aaacgt at a ct aact cgac ccacat t t gc ct at gt at at aat t t t ct ac at t aaat t at aact t t gt at t at cat ccaa ggt at gt gaa agt ggat gga tacat t t t t g t gat t t t t t t t agaat t t gg at agaat t aa at t t cgccgt acaacaacaa gt gt cccaat caaaaaaact gt t t cagat t aacgat t t ac t aaat at at a acat act at t gt at gt t agt aact t at at t aat ct agt ga caacagt gat ttttgggcaa gt t gt gaaca ct t caaat t t taacccccca t t t ct t t t aa caccaccaaa agaat act t t t aagt t t ct c ggt aacacat ct ct t ct t ca cggt gaccat cat t act t t t t act cat t gt t t agat t aga at t acct t ac ct aat t t aaa tagaaaaaaa t aaagaaact at at aaaat a aat at t aat a caaaact at a ggct cct ct t t acacat t t c ccaaggt at c caaacatttt t t at t t t at t t ct ct ct ct g ct ct ct ct ct t t t ctgt t t g at t caaaact ct ccaaaaaa t ct gaaaaca t ggaagt gga at t ct cacaa aact ct at ag aaat cacat g agt cagct t g act t at caaa gt t t t t gtgc t aaaat agag ccaacct gt c aacat aaaac at t at caaac t t t t gt aaaa t gt t t gt cca tttccaccca ct ct ct ct ct gt t gccgaga t t cccacgt t ct t t ct t caa 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 104 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 104 gtggccggaa aaatcgtcgg agct c tgggagatag agagagagac tgaga! aaaat ct gt a atggtttttt t ct t g cacggtactg tacttactct cttaa< ttttcttctt taaactgata cacat attgaaacat gcaataaact acgga ttaattt t aa ctaaat t ctt taat a ttattttatg gtagtttaat ggtta ttaaatttca gaattttaat acactt cggt g gagac acgt a aaat c gt at c at at c aaat a at aat ttaa gt gagt t tag tgagagagaa t t aaggt ggg tat t t t gtcg aat at ccaaa taaaaaacaa aaat aaaat c tgt t t t t t ca ct t aat t at t Page 11 at at gt at gt t aat gat t gc t t t t at cagt gt t t t t t t cc t gaaacacat t aaaat t aac at aagt ct at t at agaacat t gt t t at at g aaaccgaccg gt aaagccgt ct t t ct at t a acgaaaat t a t t t t at acaa ggccct aaaa aaagt at t ag aggaat gt gg aggcact t t a 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt agttttagaa aactataaac acatatgcaa aacatcaaac caaat t atgt ct t t t gt t gc cgt t t caat c cagt gt t at t at cggacaac t t ct acat t g gt cat ctt at at t t t t gtgt act caat caa t t gaaact gc t agt gaattt at act t at at at gt agcaat tt at gagagt act t aacat g t ggt ct aaat t t cagat t at t t t at at t t t at acaaaat g gagt acatt c t ct t gagt ac aggt gaaaaa gatccttttt cact at t cgt gct ccct cct at gct ct at a acaaccact c t at t t act ct t gt t agt gt g t acat t t t gg ggt aagct gc t acgt aact t at caaaccca cact t t t gt c at t cat ggga gaaaaaagat t aggt gat t t ttctgt t t t a t at t t t at aa at t act ccct aagat gt t t t gcagt t t t ca acct aat at g agt aagagaa tgt t t ctat t ggct t at t aa ct ct cagct a t t acact aac t t tct ct caa t at cct aact aagaaaat aa ttttgtcgta t t t t t t t gt a cagt caat t t at ct t t t gag ccct t t t gt t acgccaaact agat t t at t a aact aat t t a ttagaaacaa ggagt ct at t ctt agcgggt t ct cat ggt t ttgt t t cat t acaagt t t aa t t at cat t t t t at ccat t t a t ggat aaaaa att gggccat gat caacggt ct ct ct ccat cact t cct t a at t t ttgct t t t acaat aga at gct t t aaa cat caggaca tacggcacgg gaat aat aca at t aat t aag gcaact attt t ct cact t cc at gcat ggt a att gt att gg ct ct gt t t ag aaat ct act c t t cacaagt c at at t ggat g at t t aaat t t ttttaacaaa cct t aaacct cgaat t t gga aaacagt gag t ct ct cgtt a t t at ct aat a t ccacaccaa agt agt ccac t t act aaaac t t acat gt ct act gt t agt t t t t ccat t aa t gt agacgt c tgt gt t t at t at gt gatt ga caact t t t ct t at gat gcgt ct t aact aaa aaaagt agt a t aagct aggt t t aaat t gt a t t t t agggat aat t t gact a at ct aaat ac ct t at at t t t at acacaaac acaaacaggc cgaat gcaaa aaccct aacc t ccagaacgt t t t t gcat gt t aaat t gtt t aagt t aacaa ct t t ct t t t c t act cat gt t agat at ggac cct at t t at a t ggt caat t c cgctt cagag ttt ccgt aac acaacaagca t caat t aat c act gt t gt ct gt t t t cat cc ct t at t agt t t t t at t t t gt t t t gt gact a t at t t t t t at aaaacagagt t at aaaagt t ccat agaagt gaat ct t ct g cct cct ccca t cct caaaaa 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 105 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 105 gacaacttga agaaattacg gtaatcaccg tttttcaaac tcttacaacc atggaagaag aagt aaat ct tgcaccaaca t agcaaaaat t caaaagat a acat gaat cg gct caaaat t ccagacctgg gcttctgcat aaaacgtttt gaaaatgaga atgggcctaa gcccacttgg ct ccgt ggag gacct aggt g gagagagtat at act aaggg gcaccacct a cagat t aggt Page 11E gcgggatcat t t t ct agt gg gct ct aat t g agagcaagaa t at t gcaaat caaat cggga aat gccgct c cagggtt ct c acgt ccacaa aaggt t t gga gagcct cgaa gacat caaaa 120 180 240 300 360 12689250 Sequence Listing.txt gcaattaggt ctgcgccgcc gctcaccttc tcttccttca ccggagtaag ccttggccgg aaact cccca atcgcaaggg agggagatcg tt ct ccact t gaaacagat a gcgt gggct a gagagct at c gaaat gggt t agt t gat gt a t t aaaaccat aat t t ggccc gggaaaattt aat cgaaat c t gat aaat cc t t t gagt aat at t agat t gt at gt t gaat t t aat agt gaa t gt gaat t gt t t t gt at ct g t gt at at t at t t t cat t cac t at aaaat t t t gt t gat gt a acat gt ggt t cat ct at act cat caggt aa acaat aagaa gaggtgaaga aggaagaacc t aagacgct t gcaat gt gaa caccggcgt c acagagcgt c gat t t at t at t caat agct t gggcctgcca aaaaaccat t t ggaaact cc gct ct ct gcc gat ct ct gac gggt t t gaca ccct agaagt t gggt t t cat cacgt t aca t at aggt t gc at t aat t aag accaactt gt caaaagt gt g ggct t at at t t att actt ga cct gt gt t t g tggt t gt t t c gt aact gaaa tcccgagagc agt aact acc agcgt cgat a gat t acact a at agagagaa ggagaaggt g t att gct ttt t gaat att ga at at t gt t t g ggt ccaatt a t gggt ccaac aaaaccaaaa t ct ct t cgt c aacat t agat acat t agat t cactgt t t t t ct act aact a t gact t t t ga ttgat t t t ga at t ct aat gc gaagaggt t c t cgt gagt t a gt t t gat aag ct cat t t t ca t t t agtgttc ttgat t t t ga tcgaagacga ggcggcgaga gat ct gagt a gat gat t gag gcagggaagg gt caagat ag gact t t agt t t gtt gt at ct at aat aacac agt at ggaca aacaat t aaa t cgt cgct ct t ccgt ct cca t t cgat t gt a t at att gccc gct ct act t a agat gaaat c ct t gat agt t t gtt cat gt g agtt gagttt tgt t gt t t t c at at t ct t t t aacact agac at t ct ct t ga aacct gaat g at cct t aat a ggcaagaggg t acggt ggaa tgatggagga gaaacggccg aaaaat cccg gt t t ct cgat t t t aaat t ca t cat act t ac t t aggat t gt agcccaaaat ggct agggt t ct gcct ct ct t ct t t gt ct c agt t ct t tag act cccact t t cgct ct t t c aat t t gagaa t gct at t ct t t caaat t t ga t t ggt ct t aa t gaat t t ct c aggct gaat a ct ct t t caag caat at t t ga t t gaagt t t a agacat ggaa at t t ggt t ga gaaggt t t aa t gaaat gccg cacaagaggg acgct ggcaa t att ggacca at aaaagt gt t t t t agt t t t cat t t ggt t c gaaaggattt t cagt at at g cgcat ct cca cggt gagt at gt ct gt agat gt t t gt gt t g aat t act gt a gact agt gga gt t gt at ct g t aat gt agct gat t aggcat t t at t t t gt g t aaat gt at g t t gagcgt t a cat t gcat t t agacat aaat t t t ctct t ct 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> 106 2000 DNA <213> Arabidopsis thaliana <400> 106 aaacttgtgt tgatcaattt tgctcccaat ctcccagaaa ttcaggtgat tatcactctc tactcactgt ttggctggtt tcgtcaagat tctaattttg tgcttcaagt atatgtatta Page 116 12689250 Sequence Listing.txt ttatgtgttc ttatagttca tggcatctag taagtaaacc at atgt t aga aat at caagg t t ggt agat c t at gt aaagt t aat t gct t g t t aacagt t a t gt gat cat t gctgtgcgga gtaggggaag acaggcgcaa gagt ccaaaa t gt t at at t g aactggaggc taacaacccc t ct caat gt c aaggagcagt acgaact agc gt cagaagaa ggcagct t aa aaaaaaaagg aact ct gct t t gcaat t act at t cgt t gct at aacgt t gt gt aaacgt aa gt at t cct ga t t ggggatct t t ct ct caat caaat gaaat ct t t t t agt c t at t t t t aaa t ct t gt ggt g ct at t t cat t at t gt at act t t at ggct ag gt t t gat aaa cat cgt t t gc t aat gcaggt t t ccat gt ga ccat t gt cgg ct ct t t gccg aggt aagaaa at t ggggt t t atgggagaca at t t gat gt g t at ggt t gt t cccgaggtt c caagaaagct gct t t gct aa gccggagact gt t cat t gt t caaaaat gt g t caat ggt at t ct t t t t gt c gaat gt aat a cat gagaaac ccaaagcat c aaat gcct gc cct cct caca gact gt cgaa ccat act t t t aacagt t t cg t caagt caac gcagt act t a act at t t gt t aat aat at ac gt t gt gt at g at t t gt gct t t caat cgat t ggt gct gaag gacgtggaag cgaagt t cca accgaaat ga at gt gt gt at at agcggt t g at gaagacga gt at cgat t c t t ct gggt gg at gcagaaga t gggat t t t a cacct ct t t t t t gt t gtgca at t t t t t gcc acagt agcct accaaacat a cact caaacc t caacaat t c aaact t gggc agt aat aaga act t cagt ag t t cagt cacc t t t aaat t ca agt gaat aaa gct t cct t ga gcat ct cct t act t gat gct gt t t ct gaag t ct gat ct gt gcat cat t ct caacggt t gc caagat ggt c ct at acgt t g t t ggat t at c t gt agat ggt gggcggtgt c gaat gat gac t gcgt aat ga ct cct ct agg acgaagat gc ct t t gct t t g gt t aaaagct ttgaggacag ct ct ct acaa t aaat cgcca agagacat t g aaat gat t ct acat at gt t t t cgt at t cag gat t ccaact taaaaaagac ggt aaacacc caacaaaacc aaagct caga t gcaat ggt t ggt t ct aaag gct ggggct a t agaacccat gt t t t gt act gcagcacact aggct ggcat caagt ggct t t t ggcat ggg at aagggt ca ggcgcaagct aggaggtat t tgcaacgcca gggaccgct a t gct at gaac agt gct agcg ct gct acgaa act t t gcaaa t t t cat t t ac ttggt t gttg t at t acgaat gcaaat gt t g gt caat acaa t acaaaat ct gagt t at t t t t t ggt act t c aat cact t t g ct t at at gt t cat t at t gga t gcaat cat c ggaaaaat cc gacat ggt ag t t ct agt agt aat ggt ct ct ccgct t t t gg cggt aaccaa att aggcaca gt t t aacaat t t t ccgtggg act gt at gca ct act gt aat ctgggaagag gcagcagt t g gggagaccga ggt t t gt t ca t t t gct gggt gat cagct t g gt tttttggg at t t t gacaa t ct t t at aaa ct t t gacagt ct t aaggt aa gt aaaaagaa aagcaact ct act t t ct gt g gt ccacgt ac aacacgt t t c aagggaaaca ctgt t at t t t gt t t t ct gat act t t cat t t 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 107 <211> 2000 <212> DNA Page 117 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 107 agt agt act c acaaacacat cct t t t at ag ccaat aat at ttttaaacca aaacgt aaat t gaacat t ct t t t agagt aa at aat caat t gtt ggaaacc t ct t t at aca cggaagcaat ctagagagag aaat at t caa t ggact gcaa gagaagat aa t at gcgcaag t at gt caaac t caact gt t g acact at t ga t caaaggat a aaaggaagca t aaagt ccat ct aat ggt ct at t gt t gaat t t t gct t t ag gat t t at caa cct t gt ct ca ct t ggctgac agt t gaccga t gt aat t gt a agccgt t gat cat t t gat t a t t t ggt gt ag gct caaccaa t t gt at t gga aaaaacaaca t t t gtt aagc cagt at ggga aat ct t act t at gt acaagt gaact t aact t ggacaaat t t cccaat t ag t act t ggaga tcgggaaagt tggtggaacg gt gat ggaat cat gagt t t g t at gagt t t g aacacat acg at agat t caa aacgt t aaaa accct ccaac gaaacaaaag gt ccgaacct ct at t act t a gt at ccat at ct at t t at aa t t t ggcaaat act t at caag tcgaaaaaag aaaaat aaaa t act at caac ttct t t t t ga agt ct t t ccg t t t t gt at t t t gt cat aaaa accaact t t g gt t at gt gaa t t t ct at caa tttttttttt t gat t gcat a cat ggt at t g ct t aat gaac acat at gat g t cat caggag aat acat gat aaat ggct ag tggggggggg agggt t gct a at aat aagac at gagagt gt cat at t gaaa gaaccacctt t aaaat ggt t ct t t aggt aa t t t cagt t t t t aat t t t t t t agt t t t gacc at acct t gaa at t t cct at a aact ct cct t gt at t aaaaa t caaat t at t ggt gt acaat aagcgt aacg cct ct gcgca ct cct t agt g gagact ct t g ct ct t t cgct t t t ccgact c gt ct t t t gaa gcaacat gt a at t t t t ggt g t ct t gat ggg t agct agcat agct t ggaaa gcat at gt ag t t gt t aaact t aat gt at at gggggggggg t gt ct gagt t t at at gt aat t gaaaaagt a accaaaaaat t t cagt cgca gcaat t t gat cagt aat aat caat gat cag t gact at t t a gacct cgat g accgct act a ggt t caact t gt aat cact t aaaaaaat ag t t cagat at t t act gcct t t t t t aggt gt g act act t ct c ctct t t t t t a t t at ct ct t g t t t agt t gca aaaaat caat gat gt ggt t t t at ct t at t t t agcaaaagt ct t gat t cct ggt cagt t t t gat gagt gaa t gggacaaat gt aaaaat gg t gcat gat ca t t aaaggt cg gt gagt t at g gagagt gt t g aaaaat cat a gggagaaaag gaaggagagg cat caaaacc aagagct t t a t cacagt t t t t gt t aat at a at cat gt t at gacat t ggag at t agt agaa agt t act t cc aaaaat t aag ccgt at t ggg gcct gt t act gcat t aaaaa ct t aat gct c t t gact ct aa t gaacat t gt act cat t t t c t gacat t cac ct ct gat cca tt at ccaccg aggt t t aaac cct at caagt gt accaagt c t gt gat t ct t gt t t at agaa cgaaagt aca t gggaat aag acgaccaaaa at gact gaga aat aaat t at t gt ggat t aa at gt at gct c ct ct agt ct t ggt ccgt gac gat aaat ggt gat t ct aat t ccaaat t at t at t ct gt gga t aat ccct ca at gacaat gt at t at ggaaa at t t t cat ag aat aaat ct c t gt t ct gct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 cgtcgctcag ataggatctc aacaagacac cacaaaccct aaatttcgtc aactccacag Page 118 12689250 Sequence Listing.txt cgactcgatt cgatcaagga <210> 108 <211> 2000 <212> DNA <213> Arabidopsis thaliana 2000 <400> 108 gaaaaaat t a gt t t gggtcc atccaacaaa tggcctaggc aaat gt t at t att gacact t aaacat cat a ttacacgaaa aagatacaaa gacggcaaac caacaaaact gaagatatgt t t t agat aag ccct cat gct at ggt t cct t t t t aat at gg gt t t cat acc ttagaggat a at aaagaaat gat t t aaat a t acaaacat c cact at gat a t t gcgt t aaa aggt t agct a aat t t aat ct at ctt gaaaa at at t at aac aatgggcct t ct cacct cca aagagt t t t g at gat at aac ct t t gct t t a caat acat cc ggaaccagca gat aact cac gagat cact a ctcgtat t t t att aagtt gc atgtggattt ccaaaat at t gt t t gat t aa t t t gtggact t gat t cgt t a t t t ggatt cc aaacgtacac agacaatt ca cctaccaaaa gtct t t ggat ctaagcacac t ct aaact ag tt ct cgaat t cgat t aagt t tgatgt t gag t t t accat t c cat caatt at cgaaggcctg tgtggcccaa t cgct t t ct a cgat aagt aa act aat t aac ct aact caca tctcgggat g acct ct aggc agt t t t t aga gtagatgat t cgaaatgttg t gact at at g aagt t t gct t gt t gat t t gg at at act aac gcgatgaagt t t t t ctctgt agctaacaaa caat t cat t g aaacat gt ga at t aat at cc gaaat at t t c at at agt gt g ttttacaagc actt act t t c ct t aat cgat tcaatcaacc at t t t t t cct ccaacatttt aaagt aagt t t ct at aaaac caagt t t ct t gatat t t gt t caat agtt gt t ct t t t t t ct ggactaaagc t t agct aaaa t gt agt t t ct t cat caact a t acat t gt at ct aaacct at cggtttgttt tggctacaaa at agt t t t ga tatgaagcca gat t t gat at agaaaaggtt cgagat t ggt ttgat t t ct c t ccaaaat t a aaaat t at t t acaaaaatat at t cgaaaac taacaagaaa gtaatcggt g acat cccct c t ct acaggt t tgtgctggac acaacgagat ct aaccct ag ct ccagattt t ct t gt t ggt ggcct t gtgg t t t gaat aac at t t t at aga t at t t t acca ct aaat cat g gt t ct at at t gat at aaact agacacgtga cagt t t gt t c aaaat t at aa t cgt at t agg cacaat at gt gt at gcaaat t gaaaat gt c gcat t act cg aact t t cat t t t t ggccat t t t at t cat t a gtgat t t aag acagct ct aa agat t gt t gg tactagacga cacaaatcca cgaaatcgaa cgaagccaat agtaagaaga cat cgt gt cc agggt t t ct c ttaagagtcg gaggtaaat g taagaggttt at t t t ct t t c gt t t t gagct cat at aaaat cgat acgat t ttagaaaggg acaat gagat t agt t t gct a t at aaaaat a t agagccat a attttttttt t t at t t at t a caactacaaa ct t gct t t t t ct ct aat cat gacat t acaa aaacaataat gat ct cagga caaaccaaat agat t t t t at ct t aaagaat ggagagt t aa gt ccggt cca ggcaggccag t gct t aagt a ccat at aaaa agtgt t t cgg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 Page 119 ct cggcccag t ct t aat cct t t t gaat cat t t aat gt t ac cagccgt cgc 12689250 Sequence Listing.txt aacaagatcg caagacttca agatggtaaa tcctttcaca tctccttcat tatacttttc tattatcaat tagtatgtaa atgttacgga agttcgagac tttgatgatc ttctctgctt caaccttaat catctacact gaattttatc gaaagtttaa gactttgaat catttttgat gatatcttct ttgtatgaaa acaagt cgt t 1800 1860 1920 1980 2000 <210> <211> <212> <213> 109 2000 DNA Arabidopsis thal i ana <400> 109 cgat acct ac gt cact t ct t cacaaaaat a t acact t t t a aaacgcagct accagcatcc cagtgcagga t ct gaat gt a t gct aat t ac aaat ct gaaa cat t t t caaa catcaaaaca gcacatagag t cact at ct c aagt caagt g t t ggct cact tt caccaact agaat t t t aa cagtgtagag ccat agct t t cact ct t aac cat ct ccgat ct t t aaact g ccat t aaagc ggcgt t t gag act gt aacaa agaat gat at actt ct ggca agt t t acact act at aacat t t t aaagct t acagtcgacg caatggaaac aaaccagcat ct t ggt t t at gggtgatat a gacaactt ca caat at t t t a gat gt ccaaa tcaatcccaa acataacaca at aat at gca aaccaaataa ctt ct gctt a agct t t gcat accat catt g at ct ccct ct act aat ccca ccaat aat t g aagctccgac aat ct gt aga agtaaggcag acgctaacag cct t ctgtgg ggagct t t gg t agccacat g caagt gt t gg aat gcagagt acaaact t ac gt t t t aat ac gt t at gct aa agcat ct gcg acacaaggca gct aagact t at cct t t ct g aaacagcttt aagaacacaa agggaaagtg ct at cctt ga ccat acat ac tt agt aacat tgaccaagcc t actt ccacc t ct ct at aac cgggt t atgt cgatt aagag agat cct cac ct caact cag at t t t t t cgt gggaggcttg ct t t t gggct ttttccacaa taggtgagaa ct at at at ga t accacaat g t act cat at c t cagt t at gt act act gaat ctcccaaacc caaact t gaa gcagt t t caa ctt gaagcat aaacaaaccg gaat at gcaa aactcccaca t gt ct ccact t ccgt ct ct t aat aat ccct act t ggccag gactggagcg cgcaacataa ct t t t ccct t ct t gtat t t t t t cgct t gaa ct t gt t gaag ttctggaagt agtaatggt c tt gctt aaat aat ct aggt t ggact at t ct ct gact at t t gtatcaacca ct at aaat ga ctatgaaaaa aaaaaaagag ttcat cat ca caatt acttt t t t aat gacg tt gacagt at at t act at t c acctt caaca at t cgat t ga gtgt t gtgcc ctgct t cagg tgcgaggat t tacgt t t t at ct gggacat a ct t ct t t t ac t at gat gcac agacat t at g gt at att aca ctgctggggc t at cat t at a gt aaaaact c aacacaaat a t acaaat aca gaacaacaaa ttaccgacga ctttgaaaaa aaacaat ct c ct acaacat c ggt t t atacg tttgtgcaac ttcgtaagcg gcat act gat gcatcaccac ttat t t t cgt t t gacat cct t t gt gt gat c t cat ct t ct t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 gtgacatcgt aatcggtgaa ttcgatgtcg acggtaagcc gctggtgcta gggttaggat Page 120 t t aaggt aga cagcgccgga t ct t ggaagc ct t gtcggga at at t aaagc at t ggat gt g taacgaacac gat t t t caca gt t t cgaat t ggaatacgga t gacgccat t at gaaat t at ccat aaaat g agacct aagt gagct t accg t ccacaagt a 12689250 Sequence aggggctcgg ctgaggtgga ggccgtggtg gtttcgagga gccgtcgtcg gaaaatgaca t aat t t t t aa aaaat aat at tccattgggc tgacatattc t t ct gaat t a ggt caat at a attctcttcg tctcaacatt Li st i ng. txt agccact gcg gaaagggttt gagct t caaa t agt act t aa aacaaat agg aat act aaaa t agggt t t ca gt t gt t gat g t gct gaat gt at cagat gt t at t t t aaaat cccaaact ca t cat at ct ga gagatcggct 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 110 2000 DNA Arabidopsis thal i ana <400> 110 tgt t t t act c ct aaat t gcg aat at caaaa ct at t agt t t at aact gat g cat caat aaa acat t t at gg t cacgcct ct ttcgacgcaa gat t agct t c acacaatttt t gat agt t aa ct gt gt ct gg aaat ggt t t t tcat t t t t ca at at at t agg t gaaat gt cc at t t gat gt g t at gt ct cag t ct caat ct c t aaact aaat aat at t t ccg aagt ggt t aa tt at cacgca aaaat at at t gt at caat t t t at ct at t aa aacttttttt gt t at t t t ct t aagt t t t aa at t gat t t gt agccat ct at aat t t gt gt t aact ct gat a t t ct t aat ga gaaat t t agt ggt t t t gat g ggt gaaacat at gt aat t gt ct t t ggct t t tt ct cacacg t ggt cagt t t ttttggaaaa gggaaat gt a gtccatttga cttcaccttc cccactgaag ttcgactcta t aat agagat aacagat gt t t t aaaaaaat t t t t t gt t aa tttttttttt t aaat agagg ccagacgt ag ccct cat cag t cat ct cct t aaacat gt at ct acat at at gt at t ggt t c gtatgt t t t g gat aagt cat t t t t t agat t t ct ct cgt cg t t aagt t ct t aaat gcaaga t ct cat gt t t at gcagaagt at gaaacgt t t aat t at gga t acagt aaac taaaagaaaa gat cact t t t ggt gaat aat t aacct aaaa t t aggt at at gt aagt t t t g t at cct agt t ct gcact t at gt t ggt acat t cat gt t aca t gaat t t ct a ggat aaacaa at gcat t t ga aat at aagt c acacaaat t g gt acct cct g aacaat acca ggaggctat g ccact gcgat t t ggact t t g t aaat aaaat t agaaat t t a t aaaat aat t aaaaact gat aaaaat t gag ct at t t t t t t t gat gaat ca ttgt t t ctct aacaaagt at caat t t t gt t t agat t t at t aagt t t at ag accacat t t a t aaat t caaa aagagaagat gat at t acac at at t gct cc caat gt aat t gaat gcat ga cat gaat gca t cgat t aaaa gat t ccat t t cat cat t t ag caaaat t t t a gt t aat at at at cat cat ca t ct t t ggggt t t gat t ct ac gt aat gt at a at at ccacct t t agt ct agt act agat t ct agt gt acaaa t ct caagat t gt agacgt ga at cgaaggt g gaagaagat g ct agt gct gc at t cat ggaa agt cgaccaa gt aact t t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 Page 121 12689250 Sequence Listing.txt aagat t gtat tgtgataatg aatgagaata catttgt t ct gt t t t caat t t t t t gt gttt agaaact t at at ct ct gt at gaacct at ct aat t agat ag aaaagagt at t cacat t t t c cat t aat ggg cct ct t aacg cact gt cgt c cgt t act ccg gat t acaaag cact t t ggaa t t gcaacaat cgat act t t t aaaaaagt ag t t t ctat t cg tt cat aaacc ctt gtt gaac ct gagagact t gct ct t aag t cgaaat cac t aat aat aat at t t t at t ag aat at ct aga agt t t gggag aaaacat aga t aagtt ct ct at aat t t aat aaagaagaaa cagaaaccct cct ct cgt ct at ggtt aaga t t aaaat gt t t at t at agt a ct acat aaat aaacgt gaat caatt gt gt c t gggctt aat acat t aat gg aat at at aaa t t gcct t t cg gt t t t t t ct a t agat at t at t gat agagct at ct t caact caacaaagca ct aaagt t t t aaat at at t g gct t t ct ct g taacaaaacc t ct ct t ct ct ct t t t t gtt c gaaat act t t gct aggaaaa t aaggat aaa act cat t t gc gat t aat aat ccgacgt t t c ggt t t at aaa agaaat gaag aaaact t cct gct cgccgat 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 111 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 111 cactctttta atagcttaca atgttttgag cattgactgt tgtgttcttc ttttcacatc tttggtagag tgtgccgagg catggaggtg aacaccgaca ggtaaagt t g at t at t cat t aaaaacccaa ctaatcgaag gaatctcttt agtgaagatt ctaaggacca aagtgatcga agaagaagaa gcatttagta aaccgtttat tccct t gata ctat t gtaga ctggtttttg aaat gat t ga aat acgt t aa at cact ct ct caattcaacg gcttaaaaaa gaacacgtca ct aaaaagct at ct ct cact atgtgaagga acgctcttcg ttccacctcc agtagccgt c tcaattcctt gcctgctcca agttcttccg aagagggcaa ttgtagcaaa tgcttcgccg gacggagttt ctctgggtac tatgaagctt actcttctat ttcaggtaaa tttgtagcat gcgtgtggct gttaaccaca aggtcggagg taacttttac tttttttttt tgttttgcta at cat t t t cc t ct at gcaac at caagaggc t ggaat caac t ct att ct ca t t gat t gggg gacat gaaga t ct aagt ct t at t t ct t t t t t caagat t t t cgcgacagt a cgat ct ccgg act aat ggt g ggaaacggt g cccgt agat a gcaat agct g t t cgacccct caacagtggg at gt t t act a tt ct aggaaa t cggt agt gt aagaat t aac at t t ccaggc cagaggagtt act at aact c gact ct act t gt aat t t gcc cat gt t t caa ggaaaatggc ct agatt cct cagcgagt cc gcgt t aaggc ccgat ct cgc gaat agct ca cct t ct agcg cgaat agt ct agagaactt g acacacaat a ccaaaccgat caccgagt ga cgat ccat ga t t t gtcaagg t gt gt act ga t gat ggaaaa t cgt t cat at ct cat at cct tttcgcagcg t ggccgt ct c ccggcaagt g cgaggaagaa t agat t cgaa gt t ggt t aga atct t t t ct t ttgccaagga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 gccaatctac ctctacctgt tcctctcaag gtactaatct tgcgattact caaaaattat Page 122 12689250 Sequence Listing.txt atgtcaacaa atttggctac ggatcatata aattgtaaaa agctaagaag ccactttttt t t t t gt t at a at gaaggcaa at ggt gt agt aagaaagaat t cacat aaac gt ct tt ct ag aaagt t t t aa t aaat aat cg cagat t t agg t t t t at at t c cccaacaagt aagaagt ct c t t gcaaaact act t t aat cc <210> 112 ggtggacaga gaccgatgt t gt t ccaagcg cat gagaagt at t t gcct ct t t t ct t act g agt t t ggggt tgt t gcggag ct ct ggt ccg t t t t gt ccag aagat t cggc at aaaccct a ct gt gt gt ag cct t gaacca ataagcggag ccggt t t aca acgcgt aat g ct ct t aggag t t cagt t gaa ct act act ac cacat gagga ct at caacaa aaagggcgt g at at cgt t aa ccaaat t t ac at at at aaaa gcggaact ac gagcgaggtt t t gact gt t t ggcggaagaa ccct caaaaa accgaat t t c t t at ct gaga aacaacact a at gacaat t t ggt t caaat c aacgt cgt cg gt ggaaaccc gcgat t tct g agaaccaaga aggat t cat a ggt t t t ccaa agat aaagca agct gt ggag gaat at ct ct cat gat cact gcaact aaag ggccgagtgg ccacagt t gt t t acaaagt g t aaacgct cg at t ct t ccat gcct ct tt ct gtgatggaag acaacagaga cctccgggag at t gctcgag t t t gct ct ac ggt aaat aat t at t at gct a t ct aaggcgc caaat at t t t ggccgaagaa t t t tctcact cgt t tct ct c aat cggagt t 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 112 gtcatttttt t at gat t aga agcaattttt aat t aat gt c t agct gaaaa t at cggt ct a tgt t cat gag caaaaaaaag t at aat gaaa t t aagaat ag ttgaaaagaa acact t gt t a aaaacat t gt ttttattgag t t t t at t aaa t gt gt gat ga t t tct aaat g accaaact aa aat agcat ga at gt at at gt act aat gat g aat aat at at t cat t at ct a gt agt gct t a gat t aat cat t gt acaaact t t t t ctctta at t cagt t t a at cacacgt g at cacat at a t gt gt cat t c at cat t gccg t t t t gat t t a cat t gt aaat agat t t t aga aat cgcgaag act aaact aa gt agt gat t a at t t at t aac acat t aat t aacct t gagt t t acat agt a at at at ct t t at t tct t t aa at t t t t gt t t at at ggaaag t aat at t agt cat ct t t aat aagt t caagt t ct ct aaat a gacaacgtt a t aagaggt t t t t gat aat aa at aat t t t t t t t agagt at t cacagt at ct t t t at aaaaa t t at cat gat aaat at t ct t t aagat t ct g agtt ggccaa aaagat gt gt t aaat at t t c acaagat t t a agat t aacat gaaaact aat ggt agcaaac aaat t t aat a t t t at t t t at t at aat aaat t gt cggaaac gcacat gct a at t t t gt agt at t t t t t agt t t cat gt aaa ct t t cat at a ccagt gt t t t at t t at t aat t t t gt t aaag aagaaat ct g gacgaaaat a aact acaaca aaat aaagag t t t gaat caa gact at agat at gt t ggaga ttttttggaa aat acacgt t aat acat ggt agaat t t cat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 123 12689250 Sequence Listing.txt aataatgttt taaaaaaaaa aattcataac gtattatgaa atagcatgtt at at aacat g gt t t act t t t cgtgggtcca t gt t t t at t t t at agaat ga at ggagagt g cgt t gt ct t t t t aaat t aaa gtggaggaga t ccct t gcga t ct ccacacg t caaaat cct t ct gt cagt a t cat ct t ccg t ct t t ccgat cgt agat ct t aaaccat ct c ct ccagagaa t gaaat t t t a cat ct t acaa tggt t at t t t ggt t at ct ac t at at gt caa t t gaccaat t ttcaaacagg at aaat aaaa ggggctgcgt gagagggaag cct ct ct ct c ccat gat cga t ct gaat ct g ct t ct t cct c cggt t t t at c t t ggt accat cgt t t cgct t cct t ct ct t c at gt t acat g caaat aaagg gt aacaaaaa t aat at t acc act t t t gt gt t t cat at at a t aaaaat gt t at at at at t a agat t aaaat at t gact t at t ct ct cgct t t cccaat t ga cat ct ggaag cat cat ccga t ccct ccgt t t t cccct gct cacct ct cag at at t t t gaa at agct t t at act t t at ct c t t t aat t t t a gat gct t t t g t at gat t at a at gagat t t a t t t t aat aat ccaaaaaat a tggcacgcag ct t caat cat t caat caat a cgt gat t t ca gct gct t ct t t cat ct act t t t ccgt cgct at cagagct g aaat t acacg tat t t ccttt t gct agt t t c t t at t t t t t a t gt gat ggt g t at t aagaaa t t t gat t t t t aggggctaaa ccgt aaacaa ccct aggcat t cat cact cc acct t at ct t t cat cat gt a ct cgct ct t c ct ccgt cgt c ggagccat t c t t t ct ccagt t t t aaact aa gat gt gt t t c ct t ccgacaa t t aat t t gt a aat t t aaaaa t gt cat t cat ct at at at t a aagt aat t at t agaaat t at aaaaaaat at t t t cgt t t cg gacgt cagat ct t cat ct t c t t t aaccgct ttctct t t t c t ct t ct cgct ct t t cact cg ct t agat cgt 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 113 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 113 t agact t aaa aaggaaact a t t t at t t t ca t aagagt cat agat ggccaa agggaggaaa aat ct ct t gt gaact t at gg t caat ct act acat t t ct t t cat t t t aacc t aact aat cg aaacat aat c aaagact aat at aacct aag aaaccgt cac caagagggtt t t caagt gaa cat t gaagct gcgat cat t t aggt t t t gat aaaacaaaaa gtgaagagca t aaat aat ag tcaaaaccaa gaaaacaacc cagt t ct gat tgtggcagga aagt t cact t aggact aact t cagat t t t t t agt t t t gca t t t at gat ga agt ct t ct cc agcaacaagt acaagaacaa t t aaagat ac t t at at agac t t t ggtgttg cat gact ccg gaagccgtga gaact ggt aa t gccaaact t cgagaaaaat aacaaact t a t act cct aca act t at t ct c t t ggaggat a t t t act t caa caat t t gcgg caat t acagt gt t t ggggat at t gt t cttg gt t t t t t aaa ct cct aat t g gt agat aact at ct aggaga acaagt gt at t gggaact ca gggt t t gat a ccaagaggac t t t ggaggct cact cat at c aaat at t t ac at aaat t t aa agt aggt t t t 120 180 240 300 360 420 480 540 600 660 gtccttatca tctttttagg ttatgtggag atcagtgcct gat aaagat a gcat t gcaat Page 124 12689250 Sequence Listing.txt gataatgtat gatgtgcaac gcataagaca acaat t gaca tcaagcacac ctct t ctggt gact ggaaat aat cagcat a acgt aagct g t t t t aat t t a t gat ggat at aacggctagg gt t t t gcagt gggt t gt gaa t aaaacct t t tggtaggaga aacaaaaat c ct ggt agt t a aat t cagaac gcgt t t ct gc gat t gcat ga acaaat t at a agt gat gt t t gcccat gt ag at t acaacgt cat ct gaaaa aacttt gaga <210> 114 <211> 129( <212> DNA <213> Aral <400> 114 gaat t ct at c caaggagat g t ct t t t t gct acgccaaagc at ccggt gaa gacat acact at ggt t caag t t t at cagct caaact aat a agt at cgaag caggt at cat t cgt cgt gga at at agat gt agct ggt ggt ct t gt t t gga t t t att caag cat t t ct at g agat t t aggg at at t t gt t t gaaat gcagc agacagt at g cgaggctgac gt act t ct ca t t gaat ccat at at ggat t t acccgt t t at gtt caacaac caaaat ccaa tccgagaaga agt t agct t a agaaagct ct ct ct aat cac at gt t gat t a gggt at t cct t gcat t cat a gt gaact t t t t gat t t accc t ct t at gat t aat agt gaaa t gagaagt ct cggt ct gact gt agt cct t c cagat t agcc at t ct t cacg at acagat t t aat agcccat ggaaaaagat aact ct at t a t t ct ct ct ct t gaact t gca aacat gt gac at t ct ct aga t gt t t acgcc t t t gct at at gcaaagcaga gt t t ct t t t t agagact t gt gcat gagt ag ct t gt agat c ct cagt t agg at cccct t t c ggt gagaagt ggt agggt t t tt gt cacaac gccaaat act t t agt t at at aaat gggct t tacaaacaga ct ccct ccag ct agaaacac aaaaat t aaa ct ct agct ac t aat gt t gt a gt gt ggagt c gat t t at t t t gat t gt t act aaacgggaca cccaaacat c cgagt t cgat cct t gggt ca at t agt cgga ccact ct aaa at caaaaaaa aact t gt t ac at t t ct at t t gggt ct gt t g t aat t t cgac ct acgt cgt t at t caaacga t agt t t caga cgtggaaagt t at acat t aa at t t cat ggt gaat ggaaac at cat t at t t t t aat caat t taaaaagaaa t at ggt ct ag cct ccct gaa at t ggt t t ac aaacat t t ca at at t t cggt aagaaaaaat at gcgact aa ggt cccaat t t t aaaaaat a ccggcccaaa ct ct t ccact t ccgat ccaa 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 0 bidopsis thal i ana at t t ccggt t t t gt ct ct g cgggtgggaa t acaaggact at cat agct g t ct t t gat cg gat at ggaca t gcagccat t t t gcat acaa gat t t caccc acct gcaaca gt t t aat cct caaagcgagt at ggat at gg gat ccggt at ccaat t t agt cgagagat ca aaat cacat t tgggaaagag gt ggaact cc at t t gat t cg caggctgggt caaacct gat gcgtgaagga Page 12E gaagaaacct acgct t gct a t t t cact gct ct t gt cgat a atgaggaaga aaaggagaag cat gt aact a cact ggt t gt ct t t cct gct gcat t ct ccc acat cct gag t gt at gcaaa gggacaaagt t cgcat t ggc t ggt t gcagt t t acgaagat 120 180 240 300 360 420 480 12689250 Sequence Listing. tcgcct tgaacactat tct t gtat ggagcatgtg tttggtatac gcct ttgcagagca at caagt gct t ggggagt gg t at gt t gct t gact t t act g t t cggagt t a gcaaagt t ca at at t t t at c t gcat gat t t agt aacaagc gggcccat t a ccct aaggaa t gt t cct gat <210> 115 ggt t acct gg at gt gt gct a gct gcagat a gcagat at gt agt gact t gg gatggagaga gat gaagaac aaact ggaga cct gt aaaca at aaacagca agat t ggaga actt cggcaa cgcgt agct c at aaagccag ct ct act gaa agt t act act at gcagt aac gcgt gcaaaa at aat aagcc gt ct t gt aga at caat t t gt t aagt t t aca t t at at at ga t act gct t t t gct ccagaga aagcgaaaaa agat at at t c agcgt gcct c agaaacaaag t ggt t ct t gg ggct cat gag gat gaacgat agt aggat aa acat aaat ct t gaact aggg aat aaat at g ttaagagagg aacgagagtt cat acaat at ccacgg ccagaaca agcaagct t t t gcttt gat t ct gt agaat caa t at at aca gat ggt cc agcccacc t cact t t t ccgt aat c txt gg ttgatcttta ac cttacgagcc ga acacaaacat ict t aggt cact a tg taacggtgaa ga tggaaacaga ca tcaatcagga iat t ct gat agaa tac aagt t t t aat t t cgcaagt at g t a aacccatt aa tc cacagtgaaa at tttctcttgt 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1290 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 115 cgaaccact c gcgaaagaga t t ggt cggt t aacgacat gg cct ccgacga agaagagagt t ct aaagct t gat ggt t cag gaagat gat g caagat ct gg acaacaat cc agat t t gaga caaaat t at g ggt t gt gt t a aagcat ct t t tttttatagc tcaccggaga gaagcgcgt c ttccacggcg t gt gaat cat gtt cacaagc aacgcat cgc t gt t gact aa at t ggcgt ag aaacggtgcg at ct ggat t t at gaacct ag at t t t t t at a aat ggagaat cggaatgggc t t t ct t t t t t at aaagat ga aacaacat ca t ccggct aaa gagat cgt cg ctcgacggcg agcgatgcag t t cgt ggat c t t cggct t t g at t gt ct t ct t t t t gat cga ggt aaggat t agt cact gaa t t agt at t t a t ggat t ggcc ct t gt t agt g cactgt t t t t t at t t t t ct g gaact cgt cg acagcgacgt t t at ccat ac caagct gct g agaact ggt c t t gagt ggct gt gat gat gt t t t gt aacaa aaggagatgg gaagat ggat gccatggcga tttttttttg at t agt gaga gt ggagat ga gt t t t at t gt gct t agat t a ccgaagct aa ct t caccaaa aagggagat c gcat agcggt t aact ct t t t cacggagagg acgacat gaa cggaagaaga gt t t agt aag gat gat gat t t t t ct gaaat ttttttttgg agctaggaag cgat ct t cac t t t t aagt cc cagat aaat t at gct cgaaa cact t t gt gg gt cgt ggat c t gat t ct t ca gccaccggcg aacagct gaa at cgaaagag aacgat t gaa agt t at agga gt ggt ggt ga ct gaaat t t t aaagtggagt t aacagat ag t t t t at t gaa at ggaaaat g act t gt cagt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tttttagatt tttttctttt tattttcaat tctataccac aaatatatac tattgttgtt Page 126 1020 12689250 Sequence Listing.txt t t ggt aaaat acat t gaat t acgaaat t ag ttttacacga aact accaaa gggccaagct gagaaaaaag gt cgt t t agc gacct t t t t a act t gt aagc ct t t cgat gt aagt ct t cga at t t ct gaga tgtgtgaaag agt t gcagt t t cat agaat t ttaagagaag atcaaaatct taaacacata aatctcatct gttatttttt t agat gat t a t aaaaat ggc gct cgt t gat t t t at t gt t g ggaact caac ct acgat t gt ttcagaggac at t cgct cgt ttct t ct t ct t t cgacct ag t ct t t gaacc t t t t ggt t at t ct aat aagc gt gat t t gat aat cat t cat tcagaaacaa cgt agt gat t at aaaaaat g ct ct gacaac gaagat agat aaacaggt ac gtgt t t atta gacaat ccgg aat t cgt t t t t ct t cct t ca tcgt t gt t t t t gt t aaagt a agct t t gt t t t at gt t acca at at cgaaag t ct t t gattt aat gat t at c cat acgggt c t gct t ggt t t at aact gaaa gagat t t t t g tt att gcaac taggcgagag gat t t t cgt g ttctct t t gt gagat t t ct g gat t agct t g t agggat ct t agat t aat at gat cgat t t t gt cacagct g ttgt t t t gt a acaagt ccca t aact t at t a t gggcct aac aat t at t ct g cat t t ct ct g agagagat t a tttttgcaac t t gat t gt ct at t ct act t a t ct t gt aat a at gat t t ggg aaaagtgggc t t aaggt ct t t gt gt gt ggg t ct at accac gat t t gt t ac ct t gt ggaca aaagcgcgca gggct t t aat at t cagcgt c ct gt t t ct t c at caat cggc gaat t t aaca t ggt gt t t aa tgtggaagga acgcgt t t t c at t caat t at at at gagt t t ct t t gat ct g t gt gagt t t c 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 116 2000 DNA Arabidopsis thal i ana <400> 116 aaggt t cct t t ccacct at c gggt t at t ct gt caccaagg gt t t acct ga gaagaggt t g ct t gt t acct aaat agcaat gat cct aat t t t cagat t cg t gccccat ct ggccgt t ggt gagagagcga t ct t t acaac acct t ct agt t cct caacct gtgaaacagg at caaagct t aat gct acat gaagat at cg t gct cagat g ttaacggcaa t cgt caact c gt cacagaag gat gcggt t t gat cacat at t act t caaac gaat caaat t t ctatgggcg t t t t cact ac at t t gat t at acat agtt ag t ggaggt aag agatgtgt t t gt at aat gt a agagcaccac aggaggaagt ct gaaat cac at act gat at act t accaca aaaat t ct cc gt t gaaccaa t ct t t t t act t at at ggt t a t gt cgcaaga aact gagaga t ggt at gt t g ct ggagt ct t aacgacgt gc ct cct cat t g cacggt ct ca acat ct ct t g at agt at t at aggacaagag gagt t gt gaa aaggt at agt t t gaaacgt a aagct cggt t ct at aat gt a t gacaggt ga cagagacat t accagcacag agcggtgaag gaggaagcat ct at agcaaa at aaagt at g t ct at cgagt ct ggaat ct t t act t act cc cagat gat ga gt t cggt t t t t cat cat ccg at cagaagat cacgggagag cct cgt cccc t cacgagct t ccat cgaat a gt agct gt t a 120 180 240 300 360 420 480 540 600 660 720 780 Page 127 12689250 Sequence Listing.txt taatagtgac ttctctgtaa tcttttgttt ttttgtttca tcaactaaaa t aat aacaat t t gt t t agt a cact acgcaa gat agat agc aggact t gaa cggt cgcat g act t gaat at at ggct ggt t at ggacgt ag t t t ct ct at a t t t t gt t t gt at t gt t t gct at t t t aaacc t t at t ccaac cacaaat caa ggt cacaaat tacaaaggca cct ct t cct c t t t caat cga t ct ct cat t g gt t acaaaca ct t aaaaagt acaaccat gg tcaacagacg aaccgt ggcg gt gcaaacgt t gat ct acca cgt t aat gt a aacct cact t aaacagagca at ct t t at t t t gaat t caaa at aaaaaaaa t ggaaggat a t aagt t t t cc gt aaaaat t g aaat t acccc gaggaact t c aacct ct gt g ggat cagaca ct caat caca taggaggagg at cagacgac at gaagat ag t t gggcacct cat gaagcac tgaaacaaag gat at t caac cacat t ct aa aagcaggct a gagat cat gt t t t agt ct gt agt caat t gt gagcaaaaaa aaat t t t gt t gt at aaacaa t gagaaaat c aat t ccaat c agat cgct t t tcgagaaaag aaaaagt gt a gat gat ccac agaagat gga ct aagat caa ccat t gaaaa agagt cct ct aagt cact t c t at gcat t gc cat t t t t t t t ct t ct t t t t t t t t at aaaaa aaat acaaaa aaaagaat aa t ct t t agcag ggt t t t aaca t ggggcct t g t gt gagat ct t gat t t t t gt t aaagt aaaa cat ct t at t t aacagt caag acgagacat g ct acat ggat ggt agt t aat aat aaagt t t t ggagaaat t act gcagcag ct t ccaagt a t t t cgt at at at gaaaccat caccaaaggc t ct ggcgcaa aaaat caaaa aaaaaagaaa acacgacgat ct gcacaaca aat t ct cgt c ttaccaaaaa t t gt gaaaat tgaaaacaga gcaaacaaga cagacaaaag cct aggat ca t at at acaaa t t t t t t t gt a t gcgt gggca ct gat gagag gat aaat t ct acaat t gt t c aat ct t t act caat t aagat aaat gact t a at gt t at t t t aat at at at a cct ccat cat t ct t gt ct at gcaggat at c 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 117 <211> 2000 <212> DNA <213> Arabi dopsi s thai i ana <400> 117 t caccaacca gagaaat cct t gaacagt ca gaagcagaat acgaagcact tctcgcagga cagatcaaag ct t t cagt ga ct ct caact c gctaagaacg aaagaatgcg ggctcatctt atggcattgt agattggttc cgatcttcag gttgctgtaa aagagcacgg gaagagggag aaagtcgccc gtactacaaa gaaagcccaa aaggagcaag aggacctcgt cgggccaaag gaggagatag t gggcat cct tatgcagaga ggccttaagg agttaaccaa aagggcgacc ct t cggct cc ct t cgact cg gt agt cagt c t caat ggt cc aaagagat ca t gcaacaggt gct cgt ct t g gt ggat gct g ggt gcgaaga gacgaagt t a Page 12E agt t t aaggc ct aaggggct ggt t cagcgg acgt ccaagg gcgat ct t ca t gcggaat ga at agagt gaa agaat caagc t t gcggcat c at gcct t gaa at cgaat aac cggagccaaa ggaat t cgaa act t cccaga gat cgcat t g ccgagct gct ggct t at ct c cagaggcgct t gaact cagc cgt t at cgag 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt ctcggggacg atgatctgaa catgtcccca gatcagctcg gtttttcgag gcaaacctcc caagt t gct c at t cgacgt g ct at t t t gaa ccgt gaaccc gt cgt at gag t t t t t t aaat t t t gt t gcag gaggcgatct aggat aat gg ct cccat cga t t agt gct gg acaggt t acc cct cgat t t c accgaccgt c act gaagt ca gat gt t t agc aagcaaagca gt t at t aacg ct cat gct t t cct cat cgt t ct ct t t acga ct at t gat t t aagcgt t t cc cagtggcgga t aagcgt t ga at at t gat cg act t t cgggg cgt ccgt t ct agat agggt a at cgat acag t ccgagct cg ggaaggtcag agaggaaaga ccct t ccggt tccggggcga aggaaggcga cgagat ggat agct at agcg aaagcacgt g cgt gt t gt ga t ggt t aacgt cat t at cgaa ttaccggagc acgt t gt at c caaact ct gt t cggaat caa ccagcacggc agaaacgaac t ggacccaat cggt t t t t aa cggggaggaa aagat at gt c gcct agt ccg t cgt cgt acc gaaat cct t c cagcct tt ag ggcgaactgg t t cct gt cat t cagct ct t c ccgact cgt c t t caggggt t t t gt gagct g gctggcgagc ttact t t ct t t aat ct ggt t t ct cgt cgag aagt gt gt t a t t at t cat t t t caaat gt cg cagacgacct tttcggggcg t t t t t acttc aaggat gt gg gat cccgat t gat t t t cccg tggaaagacg at t agct t t t acggagaggc t t t gaagt t t at cct t cgt a ct t t at ct ct gat gt t gacc aat ct ggcga gcgagctgga t ggcaat gca tacct t t t t g gt aat ct ct c t ct ct t gt ag at cggagt ac ct ggt t t acg acct cgt t ag gaat aggt cg at ct caaaac tacgtcaggg acggagt t t c agat t ct cat t cgagt t cga t cacgaggt c aaacat gcca ggct gcgt ca ggat cct t at gaccaagcaa cct t ct agct t ggat aat t g gct ggaaat a t at gt gaagt aat ggacgt t t at aagagca t ct agagact gt gagaagt t t ctat t t t ca t t aacact gg cggaagtgat t ct t t gt aat t ct t at t t aa t cgacct cgc ct ggt agat t agct t t t t ct aggagt t t t a gcgcgt caac cct cccccag aat cct cccg t gagct t at g aagat gacga cgct t ccgat t t at aggaaa t gaagt cagg cagggatgt t aacggcgat c acat gt t at g t ct agat t t c gat at cgcgt t ct ct ct gt t t at cagagcc 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 118 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 118 acctaatgcc cctacctctt cccaaaacca tctttgttgg tttcatttat gaatacacta ccttaaattc ataattttac aatatttata gatatatgaa tataatttta at t agtttat catcaattta tttttaat t g accaaatagc actagttggc gagaaaaaaa aatacaaaga at aact aggc cat at t at t g t t agt at t ac gcccaaaat a t t t agt gat t agat gaacaa Page 12 c gt t t t t t t t t aaaaataaca gt t t t t t gca cacagaaagg ct t at t at t a gtgaaacaaa gt cccat t t t caaatgccct ct aat cact a gcatcaaaag gt aat t t aat ctt gagaaat 120 180 240 300 360 12689250 Sequence Listing.txt atccgtggtc aaagaagata gacgagtggg ggtggcaata gagtaaatag gaagggt t t g agagtgggac ct ct caccct aat t t aaat a t t acct at t c ttagaaacac cat aat cgt t agat at aaag t at t at t at g agat at at t a acaaat gcgt at t caat t at gt t t gt caac gggggt gat a gt t t t gggat t gct t t aaaa t gt act t at a aat t acct t t at t t gaat ac t gaat gat aa cat at at ct t t at aaagt t a gacaact aag aagct gt t ct ct t ct cat t a at t t t t t ct g at ct gcct t t t t t gtaagga ccat aaacgc at aaaccact aat t at t at t ccaacacat a tt at acgaac aggat ct t t g at t ggat t t a gat gt acaat t gcct aat gg aat ggaggat t t cgt at t at aat t acaaac t accat t t ct cagcagccat t at gcat agc gat gt aagat gaact t t t ga t t t t aat t t t cacacat at g att cacaacc aaggaaaaaa cgcgt gt agt t ggacacgct t at t cat t t c agagat aat t ct caat t agc at at gt t aat gt agct gagg t ct t t ct cac at t at t gt gg t at t ct act c ct gt t acat a ggt t t ct caa t at acat aaa gt at gggaat ggt at t at t t t t t gt t gct t t att gacaac at ct gcacaa t at t caaaat at gt cct ct t t t t t t t t ct a at t aggt t t a at t t ggagat t t gaaat ggt t gat gagaat at agt at t ga aaaaaaaaaa tcacaaacca gaagcaaatt cat ct t t ct a t aacaaat t t aatggaaggg t t t t ggtaag gcgcct ct t t t cgaccact t cacct t aaga t t t gcaaggt t gt t t aat ag t acaaat t t a aat gt aaagc t aat gt t t aa ggat act gt t t t ggct t aat t aacct t t ga at t gaaat ag caaat t at at t t t t t tat aa aat t t t aat c t gt aat cat a at aggaaat g aaaaaagaaa t ccat t gat t aaaagct aaa gaagccgaga t aat cgt gt a at t t at ct t t ct t ct t cttc cgaagaaaaa gat cgt cgt g cacat t t at t ct t t caaaag at at ct at t t cgt at caat a t ct t t t acct aaat t agcgt t t act caaag cgt at gt gat cggcaaagt a t at t t agt gt t gt gact t ca t cct cagt gt at gt t t at aa agt aat t t aa t ct ggcct t t cat cacacag aaaaat gt aa agaat acaga aacat at caa caaaaaacaa gt cggt t aag t aaaact at c ccat t t ccga ttctgt t t ct aaaggaacac gatgagacag ggt t cct cct at t acacct a t t gt t gt cat cct t t t gt ca t gt t t at cga ct cgt t aat t t t caat at ca ccaaagggcg cat gaaagcg gct t t t t t t a t aat cat t t t t t t t t gtgt a aagaagt aga agt t at ct aa t gct t t caat aat t t t gaca aat at cacat agct t ct t at at t t at gt ga ggaaagt aat aaaaggaaga aaaccgt ct t ct t ct t ccac gccgt t gaga gaaccaccaa 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 119 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 119 agtctcaacc tcccagaacc tctgtct t ac aaatccatgt gagggaagtg gaccggtgt t actgtactgc acttccataa aaaacatttc attaacaaaa ctataccctt caacaacaaa gaaaagtttt tttttctata aacacatact tgcggaaaca atactggaat tcctccacga Page 130 120 180 12689250 Sequence Listing.txt atcggtgt t g gaggctggaa tatagcctgc acaatacgta acacatgttt t gct t t ggaa aagct at act aaagaagat g agcaaaacaa ccaat cagag gact gaagt t act gact t ga caat at ccaa ct aaaccaaa at ct t gt caa gct act act g cccat cat ga t aaat at ggc gaaact t aga aat t t gt cca gccaat at t t aaat at t t t a agagaat aaa at aact aagg t gagagat t t agt agaaaat t t aaaat at a t t t cat ggct at ggaaaaac aaat gact ct aagt gaaaga ct caaaagt g gagacct cca taaaaacaaa ttttagcaga ct cat acaca t at ct at t ca t aaagat t ca aaaaaaaaac aacaat at t t acct t gct ac cct ccat aca aagat gt t t t ccgt aagagt gaccat t gac caaccaccag act t t gct t g aaaggaaaga aat ccaat ca ct gat t t gct act t t ct t gt t t at at at ca at t t at ggca aggat t aaag gagt cagt t t t agt t t gt ga aaccaat aga t agact t aag at agt t gaga cgt agaat ct acct t ggt gg gcccat t t aa aaaaact agg accact ct cg cat t gaagaa cacat aaaat tttacaagcc t ct t t gcat a at t at cgcag gt agcaat ca t cat aacaag agt aaacct g gaagccaagt gagt acct ca gcct t t agt c agcccagaag t t gaaaat t a aat t aaaaaa t t t at at t t a caaaagt agt t aaagagt cc gcaacaat aa aggaaat t ag at t at ggat g t t aggat at g t t t ggtgaca aaaccaattt agt t acaat a gt t gt gat ac gat aaat t t t gt t gt t gct a gacct at aag gt t ct t aact at cgt cact t t act act act t ct aat gaag at t t act t ag at acagccaa t at gat cct a t aact cct ct aat aaacaaa t t t t t gatgt gcagatcggc ct ct cat t gc aagat cacaa t t gaaaat t c agt ccacgag gact t gacca cat t acgcca at t caact at act t t act t c at act t aaaa at t t t gt gga t ggt ggaat t t aaagat gac act at gt gag t cat gt ggt g t ct aat aat c gt t gat ct t t t t cact ct ag t ccact at gg tct t t t t gag tttccagaaa ttagaacaaa t aaagat t ca t caact at t g tcagagaaca aaaggagat a ccat t ct cat gaaacaaaat aaat gacaca cacgacggt c gacgat gct c at ccgagat t t t t ct ct ct c t gagcct aaa at gaaagat t cat ct ct t ga at t t agat t t caat t aagt t t aaaat ggt t at t t aggaaa ggt agat gt t at gagt acag ttct t t t act aat ggt gcga t cat cat ct c gct t t t t t gg gcct at at at gt aaaacagt ct t t cgt t ca t ct ct cact a t agat cact c gcagat t at t aaacaaaaaa cacat acaaa at at t t t t ag gagt gaat ct ttttccaaga ct gcat cat g at caat agat gcggat gat g gat ggccgt g t cgct t at t c t t t t at t t t a aacaggaaat ggaaagtct t cct t at agac t t aaacat ga t agct aaact t t t t gat at c t t t t agattt ttgt t t t aaa t at aat t t t g t caat gt t ga t gcat agt t t at gt gt t agg tttttcaaag at t cat t act ct t t t act ca t cat cgt at a t cgccat cga 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 120 2000 DNA Arabidopsis thal i ana Page 131 12689250 Sequence Listing.txt <400> 120 gcaagact aa ccat agt ggt gat aat caaa cgt gt aat aa agacact t gt t t caagt at a aaat ggcaga t gt gggt t t c t agat at t t t gat caaaagc gt cgaaat ca gt t gt aat at gaaaggct aa t ct t cat ct c t gagagct at at caaggat c t gt gt cacat aaat gt at ct t t t t t aat gt at ggat caat agt agt gaca taaggcgaac t aat t t cct c at cat t t gag gt ccat ct t c cct ct aat t c t agat agaaa t t t at aaacc ct t gagt gcg cagat t aagt t at t t at t t a at t t t ggaaa t aaaacct cc at t gat cagg cat ct t t gt a aaat at t at a gaaat gat t c cct ggat agt at t ct t t t ga gaact agt t t aact gcat t t t t cgt ggt t t t agct gt t t t aaagcaagaa t gt t acaat a at t t act t gg ttctttctcc t gt t gat aaa t t aat ct at a aaat ct gaga ccct ct at ct actttttttt tgaccccaaa caaaggaact at t t t ccat g t at aat at ca ct gt aat at a t tct at gcat t t at ct t aaa accgt t at t a acgagt t at g act ct at at c t aat t t agt c t t t acacat t tt agt agcaa acat acacca t gagacgt t t act aat t gt g agcat t gt gg t t acat gat g t gaat cgt aa caat gact gt t at gagaagc at at t t caat gt aggat aaa t gaat gt cga aaaat t ggt g gt caacaat t aaact t t t ga t ct t t cgaag gtt agcaaga tttttcaaaa t cat t t at aa ct at t gccca gt at aaagt a aaaagttttt t aaat aaacc t gaagt gt ct t t cacat aat ttctgt t t t a cgt t t t cgcg acaacatttt aacat t aaac at t gacaat a aaccat t gca t aaat t cact t tct gat aat at t t aat t aa gt cat caaat ggat t ccat a gcaaact aaa acaagt t aca accagt t aaa ggat t gcaaa cccaact gga gt t gt t at ct ccat ccaaac cct t aat aga t gcagt caaa t ggt ct gaaa t t t gt aat gt ct gcaggt t t gtat t gt t t t t aat t agt t t gaaaaaat ga ccaccaaaga t at gt agat g t at t ccat aa agat caaagc at gaat t gat t ct t t cat ct t gat agaaac acaaat t at a cct agt t gt c t t aat t t aag gaat t aaaag aaaaaaat gc at t aaaat t a aat t t at t t t t t agaaat t t t aat t at t t t ttacagagac cgt t aat gt t at ccgaagag aggcgat ct c t caat t t tct cgt t gagct c caact t at gt t t ggt ccat t at t aat aaga at ggt tct t g t at at aacca at aaat gt aa t t t at gaaaa at t gagat ag gt gact ct ct at t ggaat ac agcagaacaa acct at acac ct aaagt aag gaaaaaggaa ccaat at aaa ct at aaacag at aat at t t t at t at gt t ag t ggt agt t gc cagt ccat t t t at t at gct c t ct t acat gg aaat cat caa t ct at cacaa ct at aat t ag gcat gaat aa t gcct aaat g aacaaact aa t cct t t t gct aaaaagaat a aat t caact a t gt cct t t gc gt at aagat c cat ct t gt cg t aggt at agt at t t t at t t a agcaaaggat t gt t cagt ct ct t gat gat a tttaagaaaa gt aagt agct ccaact aaaa ct cat gat ga ct at at t t at agt aaat gac at t t ctcttt aagctt gtt t aaaaaaacac t agagat cga t gacat ccaa aat t gt ccac t t aat ct t at caact acct a aaagact aac aaaat gt agg tcaaaagaga aaat t t t aga t aat t aact a caaat at aag aaccaaact a agt t ggt ggt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 132 12689250 Sequence Listing.txt gatagagtga gagagaaaca 2000 <210> <211> <212> <213> 121 2000 DNA Arabidopsis thal i ana <400> 121 acaaacact g ttaaaaacca t ccggt ggt t gt t at aaagt aacaggct ag at gt t ct cgt t gt cggccat t t t t gat at a at at aat t t a t ct ct ct cga ct t gcat t aa t t t t gagttt gt t gat ct ca gt t t gcat t t aaaaaaaaga t gt agat ct t t t t t agacat at caat t at t cct aacaaaa t aat t t gct c t t aat ct cga tgagcggagt cgt ggaccat gat ct cagt g aaaat aagaa t t t gaaaat a at at gct t t c gagacct gaa cat t t t gaaa ccat gt at t t t agct t t at g t gt at cggt t ggcct t aat g acat gat cat agt aat ccca at t gt t cgaa aaat aagt ag tgaggaagaa t t ct cgaat c aaat ct at ac gggt t t caac tttttttttt gat gt acat a aaaggt t ct t gagt cacat g t cct t ct t at cat cgat gga t ccaat cat t t gct aat t ga at t gt cacat at t aat t at a t t cct gacat t at at t aat a aaagaaaaaa cgt t t gt gt t t t gaaat t t c t ct t ggaact agcacat cac gat at t ggt t aact t caat t ct gt ct aact accgt t gaga cgct t gt t ca at aaccaaac t t t aaat gag agt ggat ct g gacgacaaaa t t t cccct cg at t cat t t t g gaagt aact t tttttatcca aagaaaaaca t gcgt gat t t gat gt act ac aaaat agt at ct ccaat t aa cat cgat at a ct at t ct t cg t aat t t t cat t cgt t t gt ct gt t t at t gct cgt acaat cc t cgt at gaga t acgt t at at gaccgggat a aagt t at t t g t at t t caacc t cggt at aaa t at t ct t aca t gat t gaaaa at t gagat t t tgtatct t t t at act cgt at tttttttttg at ct t t t gaa aacact t t t a t t act t at t t act gact ct t cgt caggact agct gct gcg aacaaaccag t aaaagagat t ggact t t t t gcatgggacc t t t caaat t t t cact gt cct at t t cgat at at t t t t gat c ttgtct t t gt t ct t t caat c cact t t agct gct t t t gtgg acacat t t ac t t at at gcct ggggt gt cac caat t aat ga cct aaccaac gat t t t ggt t t ggt acaat a agacat gat c t aat gaat ga t aat at aagc gt at aat aaa gaat aat aat gaggaagaag t t t t t t t gt c t gaagaaaat t ggt act aaa t ggaat cat a gt aacaaat c caaatttttt gaaaaggtt c t t t accaat t gt t t ccat at tgt t t t gtgt ctat t t t ct t agt cggt gcg ct t aaat cca at gt ccgt cc t t t at t t ct g t ccaaat t gt gat t t acat c t gt ccaagaa t cat gggcca gagt cccat a cat agct t at cagt t t ct at t gct gt aat c t aat t gt at t aat t t ct t aa t t t gcggtca cgct gaat ac t t t agat aat at at ggat ga aact t ccct t t t t t aagaat aaacgat at t t cgt ct t t t g gt t t t caaag at at aat at t t t cgt gcact aact act acc t t t t gtgtaa t aaat aat gc gt t accaagt t caat t t t t g acaatgggt t aaat cccgt a t ct t caaagg gact aat t t t t ccaat t gt t aaaagaaaga at at t gggt c ct aacgct ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 tgttactatt gtaccgaacc agttttgaag cccattaaaa gcccaatacg aaagtacgtt Page 133 t gt aaat t t g at aat t t aca cgt cagacaa gct cat cgac 12689250 Sequence Listing.txt ggagcccaag aggggtaaat atggaataaa aaatacgcga tgtggacaaa gctactatct ctcctccgtc ttggcttcgt ggtggtctct ctgacattat ataatttcta aatatctctc ctctcctctc gtttgattcc aaaaaaaaaa gaacgaacga 1860 1920 1980 2000 <210> <211> 0 <212> 1- <213> 122 2000 DNA Arabi dopsi s t hal i ana <400> 122 ct ccgaat t c gt t cgagct t t gcgt t t ct t agaagacgat aacagaggat at caacggag gaaggcggcg gt t ggt gt t g agt t t ggcat at aaaagaaa gat at t t t aa at cgaaacag aaact t t gga agagcaatgg attttttttt at t acct t t t t t aaaaact t agcat aat t a tctctat t t t t ct t t t cttt ttggt t t gt a ct t t t t acaa aaat gaaat c aaaat accaa at t t acagt c ct at cagaat t ct agagcgt ggt ggt t gct gacgggt t ct gat gggacgt t t gt t ggaag acgct acagt acgt gt at gt ttgtctcttt gaagacattt aacacaaacc t gctct gt t t at t aaacccc t aat agat aa agaaaaaagt agat t t gat g ggagaaagaa gaaaaaaggt t ggcat t ct c t t t t t cacat gact cgaat g t t caat cct t caaat t cat t t acat aagat at gaagt t cc t t t gact cat caaaat aaat t t gat cccat gcgt t gt cct cgggt t gacc ct t gccgaga aacat t ccgg t t t cgagat g t t cggacat t gggt gcat ct ct acct caga ct at t t aat c t ct t at t t t t at t t gat gac gaggaat t cg ttgaagaaaa t t at t aat at at aat ccaaa tcatagggga t aaat t aaag t t act gat aa gat t t t t t t a t gggat t t t a aat t at at gt t aaccat gca at t gat t cat gt aat t agt t t t agat t ct t agaactt gt t cgcct cccat t gaat t cgga t t cct ct gt t aacagct t t g at ct ct t aca gt t gt t attg ctcgtgagga ggacgccat t ggat ct t t ct t gacccct t c agagtttttt aagagacaga aat aaaagt c cat ccaccaa t at agat t ag gaagagagac t t gact t at t gt gat aacat aaat t aat t a gt gt t at gt c aacagaat ct t at aat gct a t gt act at at aaaat at aat t gact aat ct ct gct at cga at ct gaacga t ccgagt cgg ttcgagagcc at ggaat cat aaaacat t ga ttgt t ct t ct cgact caaga t ccat caaca t t t atgtct t tggggaagaa caaacacaaa acaaaggaaa ttgctgt t t t t at t aat t t a t cgat gat t c agt gt caaac aaat t gat ac t agaagcact aaaccat t gg aaagat t t t g gt gcat at cc t t t ct agaaa at ggt aact t at ggt agct a gacgggaaag t t agggat t g gagaagaagc at cct aaaga gat gat ct ga caacaacat c gat t ct t ct c t ct t gt t gt t cgcaat t aaa accaaat caa tttttttttt gaacat gagc caat acaat g acgaagagac t ct ct at t ca ggt aaat aat at gt aacact aaagagggag at gaat acac t t t acat t gt at at gct t ca agt t at ct aa cat gat t at g aat t act at a aaagt aat aa tttttatagc gtt aaaaaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 134 aacagat aga gaact t aact aat t cacct a aaaaaaaagt aaacat agca gcccat at ga agt ccgagt a agaagat ccg gagt t t t ggc aat t t aat t c aat t t t agca agt gctt ct t gat agcgtt c gggcgataca gt cactt aac aagacgaaac 12689250 Sequence gt aaat t aca ccggagt caa tatccaaaag at atgtgtat tttggtagtt tttaactttc tttttactgt ctaaacttac cat at cattt atgtaaccca ctgaaatggc aatgtatgt g cggagatcgg agactcatcg Li st i ng. t xt act t t gt t cc ct t gct t aag t gtt aat gt t t aacat t t t c aggt ct aggc gggaaaacat ttgcaacaac aat t t aaaac t at agaagat aaaaaaaaaa caaaacaact ccaagcccaa tgtgaagct a aacaaat t t c 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 123 2000 DNA Arabidopsis thal i ana <400> 123 at ct at agag ggt t ct gaaa gt t ggact aa t gact at act cagggaacca tgcagagccg at cagat gga tccgcgaaac at gcacat aa gcgct acaag agaggagtga aacct agt ga gt gaat ct ac ggcgcct cct gt aact at at cat cat caag tt agct gt gt att gctt gct cat ggcct t a t ct cagct ct gat t t t t t ct t at caat caa t cagt gagac cat aacgt ct atctgt t t t t caggt t t caa ct gt at gcaa gcgatcacgg ctgtgggaac gt gagaacac t aat aaggac aagcggcaaa ggagacattt gcagaggaag ct cacaacac gact at t gaa at aaat at at tgggaaagga act ggagaaa t t t t aaggt a caaaacacat cat at aat ct ct cat t at t a t act t gat t t gggaact t ca ggcgagtaaa gt at gt ct t a gat ccat t gg aat t ccggct t gcat acact at at gagcaa acaccccact t ct t t aat gg gaagagagag ccacgat gac t at gt t gaga cct ggcgcct t t t t t acggt at at t t gat t t at at cat aa gaaaggtgac t t act gat t c ggcct t t ggt cgcctgagag aat ct ct cgg at t ct t t t ct ggccct gcat aggcat cat t caat t cct gt t gat gt gt at gaggtcaccg ggagccgcac ccaagaagca ct gt t t gt ac caggggatag at gagat act at aacagt ga ggaccagccg tgcaccacgc gt cct t aaaa at t at t at ca t caagt ggt t aagaat t ct t agact agaga at gcagt t gt t t t t caact t aaat gcaaat tttttgaaaa ccggat cat c caggt at t t a t gt ct t t ct g t t gaaaaggt ttcagcaagc gat cagt t ca gt agacat ag t act agact a caaagcggct cagacct gaa ttgt t gtct t t gt cagt gag ccact caagc tt ct t t ccaa t t gact ggga agt t t t gttt ct t ct t cttc aact cagct t gt t ct gaacc t t ct t aaaat agcaaagagt aaaat aaaga cagat at t gt at aaact aga agt ggct t t a cagagt t caa catt act gag t t at at gt gc t ccagaat ca t at at aaaga ggt gaaagt a aaagat agac ct t t gat aca aggagcgggt t gct gct gct gat cctt cct aaggat at at gtt ccct t t g at t aat t t t c att ct cagac aat caat aag gaaat t t t t a t t aacacat a aaat t at t aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 attttgataa atactatttt gtattaatta ctaattagta aacttatgat tttctaaaag Page 135 accat aat t t t t t ct t at aa t t ggat at gt gt gat aat at t gaatt gaaa gct cct t t ct acaat gt ct t t gaat t t gt t at t cat at t g cat cgct t t c t ct gat t t ct t ct gt aaaat ct t aggt gt t t t t t t t t ct c gggaaagtaa t cggt ggt at t cct ct acac t cccat t t ct gt t aggt t t t at agagct ct ct t t gagt ct gaagat ct aa 12689250 Sequence ctaaaaatag cgggacaaaa t t aaat aaaa t aaaaaaat c ttcaaaaatt ttaaagtata aaggct at aa at gt aat t ag gt at gt at ag tccaccaaca agat cttt gg attcagaaga ctacttgtcg attattccca at gct ct gag ggt aagt aat aaaattgggt aaaatggaac ctgattcttt tatttgttgc Li st i ng. t xt t aaat ct t ac at accaaat t aagat gagaa ct gaaagt t a t agat ct ct c aagaggtacg t ccat t t gt t cccgt t gat c gaatt aagt a t gacat gt t c caaact aggt aagt agaacg aat gat t aat t gaaact gat t ctt gct gt g t acact ct aa at ctt cgtt t t gat cgagaa gatt cacgt c gt t t gt t t gc 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 124 2000 DNA Arabidopsis thal i ana <400> 124 t gt ggcat ac ct t at ggat c aaaaaacgac t ccaggt t ga ttggaagagg at caccat ca acaaacct gt gt t ct cagac gat accaaca gaaact aaga ttgcat t t t g gctgagagga gagagt t t at gcct t t t gt a aaat at gat t aat at aat t t t aat at at at aat at at gat acaat t at t t agct aat agg tcaggaaaaa at cagct gat gcaaaat ccg gcaggagtca gaggccacct t gggt ct cat agaaggt cca gcaaaggggt t t ggcct ct t agat at ggcg gacct at ccg t at t at at gt t acact agat at gt t t t t t g t t t t at t t t a at t t at at ct at at t gat ac at t ccat at a acaggt aagt aat gt t gt t g acacaagccc aat agt at gt caaggaggtt tcaaacagag act t at gat g agat acagag act agt ggga at t gt t at t t at t gaggaag aggaggtat t t t cagt cact gaat act cat gt t gat t t t g t gat t agat t at t at t ct aa aat gt at gcg t t ct ct t gt c at ggt ggt cg gat acgagca agt t gcaaaa tccaaaagaa ggggt t t acc at agacagaa gagaacgtag agaagggaca gaaacggtgg aggt agt aag ct aat t cgt a ggcagatt aa t gt t t gaat a gagtattttt at gtt cattt t at ct at aac t at t t t at t g t t t t gaaact at caact aca t gaaagt aca t caagggt t t ccgat ct acc cacagggcag ggcgcaagag gcaaaat ct g tcgagaacaa ggggcagcaa t cat ggat t c agct gccat g ggcacat agc aaaaaaaaag t gat t at aac gt t t t t aaag t gat aaat t c t cgaaat at c t cat gat cac t gt ct aaat t caat cacat c agggacaaac act acagaac aataaggagg ggtcgacggt aat at gcacc cat t at gaat tcgaaagaga aggcagggt g act ggt gat a ct gaggat ac tt gggagat t t ct t t gt ct t t act gt agt a t t gctt aaaa ct at ct att t aat t ct agt a at at aaaaac aat t gt acaa at gat at gct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 136 12689250 Sequence Listing.txt agtaattcaa gtcacacttt aattcttttc acttgcatat agccatatag tttttttttt t at caat gt t at t at at t ca aggt t aggt a t t ct caaact agt t gaagt t at gt agt at t aat at ct agt gt t t gat t t g aagt cagaaa t ct gt gt aat aagcggatt a gct agat act ct gat ct gat gt at gt cgaa act aagt t at caaaacgt aa gcaaacat aa t cat aaat t a t t aat aaaat t t at aagaac t at at gt act t ct t t ct gga t t aat t aaat gat at at agt cgggt acgct ct t t t t cttc ct ccgt t acc t gt cgaat gt tagaacgcac t aat aagaaa gagt t t t aag gagt t t aaaa aaaat agt at t cgaat t ggt cgt at ct ct t t aagccagt a aaacaaaagg at aaaacgcg t ct t ct aaga aat t agggt t ct caaat gac caaaat gt ac at at ct aat t agt t t ggat t t gt t t t t t t a ct gat cat t t caat t t t caa gat at t t t t g gt aaccaaaa gcccat t aaa accgt ct aaa t aaat ct ct c t ct t ct cgct t at t at at t a agaaaaagt t aaat t t act a t t ggaat t gg t t t t t t gtgg tat t ggagaa aat gt t agt a gct t t aact t aagt caaaat t t aat t at t t ttct t ggttg cccgat t ct c t gcct t t t aa aat ct cat at t aaaat t gt a cat at gat aa aat t t at at g t ggt gt t t gt t cggcat aag aacat aaaag t gaaaaat aa agggat gt t a ct t gt t ggct t aat aaat at tttacgagaa tct t t t t ct c tt ct ct ccag 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 125 2000 DNA Arabidopsis thal i ana <400> 125 at ggcct cct t ct ccgt ct c t t gat at at c gat ct gagga gt gaat t cag gt ggt cat t g cgt ggt gat t acct t gt gaa t at t t gt gat agaacagct a t gaagt agag gt aaagt ct c t t t gt t t t ag gt at agt t gg cat t t t t aag cggt t ccggt ct ct gct t ct gat t gaaat t accatgggag ct aaggat t t gt cact ct t t t t ggt gaat c t t ct t at cat ct t t t aat ct t gggt act ag aaagt t ct ga at t ct t t at g ct t aggaat t t gt at aagt a at t t agat aa cgcaact gga ggt acgct t c t ct t gt gt gt at ct gcggaa agct gat t t g gggaggtaaa t gct t ct cct gt at gcaagt gat aat gat g act ct gt ccc agact t t gca t ggt t ct t gg ccaaat t gat t accat ct ct gt gggaat aa gat cgt t t t c t ct ct gt gat gt acat t cag gtggaagggc gt t aaagcaa gt ggcgt t gc cct aaacagg t t aagagct c at gat gacca t ggggaagt c gagt t t acct t gt t t gt ct a t cccgcat at ct gat t gagc agagcat gct t cgct ccct t t agt at ct t g at t ggaagat tt aat ccgcc gtggctggaa agt t cat gga t act ggacca at t gt t gttg t agcaaaggt aaggcagaaa t cat caat cc at t at ct aca gaaat ct cga t gt t act gcg t at at gat cg gcat cat ct c agct t t t cat gat at t ggt t t cat gat ct t t t ggcct gat gagt t gt gct aagt t act t c t gagat gcaa t gt t t ggt gc aaagt gat gg ctt ct cgcaa ct ct ggact c gt gaacgt t a gat t ct agga ct t ct agct g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 attatgttac at at ataaat gaggctcatt tcagcttgca gagtatagaa tccaaccttg Page 137 12689250 Sequence Listing.txt gtgtcactca aacattgttt tctgattcta ggtggcttgt tgatcgtatg gtcgaacttg ggt t t t caag agacat ggac ccct t cct aa t t act t t cct acagagat aa cggct t gaaa ct t ct t cgt a gt gagt ccca ct t t gct t ac aact at t t t g cct t t caaca cat aact t t a ccagt aagaa cact t t t at g cat aaaat ca t ct ct t t cgc t acct ct ct c at cat t at cc ct t t aacct t gaaccaat ga act gcagaga at t t t gt gat caat agct aa act ct ggcca act t ct t ct c aat t t cact c acgatgcgcg at gt acaaac t acacaagat gact agagac ggccgt aat a ct act t t aac t agct aaagg t gtt ccaaac gagt ggat t g gacggagct g ct at gat t at gact t ct t ac tgcagaaaag ccagagacag t t gggt acac cccacacgag ct gt t cagt a gt t t t t cccg gt t t aggagg ttgaaagaca gat t aaaaat t gt t gggt ct cct aacaaaa taaaaggcgg gcagcaat ct t ccaaat gt t at t at gaacc t ggt ct t t gc agt gaccgat aat gt agccg acagacaat c t agacgat gc t ct at at gca t t t gtgt t t c at ct aat aac at gact gct a ccaagat gct gt at at at at at t cagact a aacaacaaaa caagagat ct caact ct t ac t t cct cgaga tagagaaccc gggacaacga aagggaaggt caaagggact gct acgaaaa gaaat agcaa ct aat t t t gc at caat aaac cgt t agt acg caaaaact ag t gggt ct caa ttaaagcaaa gaaaccct ag ggagactcag aggt t t at t t t t t t gacttt gccaaaagag tacaaccaag t gcgact cat cct agagat t t ggcat t agt at ccaat cgc at t agcat at t t at cacat t aggcacaaca cact t t t gct t t aaat t at c t t ct gat ccg ct t ccct t t g 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 126 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 126 cttctttgga agcaacgtga agatcgtact gaagattagt cttcaacttt tccacatcag ttactatcat ctcagatcgt tcttgcaacg cccggtgtag tccttcactc aattcatcga cccattgact ggtcttagca agaatcacat caatccttat ctgccttgta cactctttca ctgcagtgag ctttaagagt gtaacagatt catcagcctt ctcccatcct aaaaaaagaa atctcaaaac aaacaaaaat gctgaaataa tttggcatgt tgaagaatca aagattcttt tttttcttta agatctttaa cttgaacttc cat at gt gt g t at gaat cca t agt t at ct g t cagat aact ct t cagct t t at ct agt gag t ct t t ccttc cct gcagct t acgcat cat c cgagt t gacg caaaagact a cat t t t acct agt gt t gat c aaacaat t t g aacat agt t t aat ct cct t c agacct ct cc t gct gcat t g aagct cggct ct t at cact c caaat gcgat t t t aagt gcc t aat cagcaa gaaacagctt t cagagt gag acct gat cct gtgt t t t t ac tcagccaact t cact aat cc t cagaagct g t t gat ct t at t ctt ctt t ca gct cgat cct aaagt t t ct g caaaaaaaac ctt cagcgac caagagtgag ccat t ct act act t ct cct g 120 180 240 300 360 420 480 540 600 660 720 Page 138 12689250 Sequence Listing.txt acacacaaat caaacacaca aaaaaagact tcaaatgaaa atttctatct gacagat t at ct caagagaa t t t ct cagt c t t t t t t t gct t t t aat aaaa aaaaggaaac t at at t t at t cgcagt gat t t ct act aaaa acct gat cat tcaaacaaac t at cggaaag tcgaagaaca aagaaggagc ccacaaagac t t aaat acat cggaat acga t t gact t at c at aaggccca t aaaat t t ga ct cccacaaa cacaaaact a gccaagt agg gt ggct t t gt ccaaaat aat aaacagacaa aaaat ct t cc at act gt at c cacat t t aat aaat aat t ca aat cat cat c t t t cgct t ga aaat t acat a t t cgt cgaga agt ggat gat acaacaacga tttaaaaaaa gggt gcagt t gt t t at ccac at t agt ggca t t t t gt t cat gaat cat cca t at aaaacag acaaagaaca ccgaagactt agt act ct gc agaact gat a gt aaat gt t a acct caacaa gat t t ct t t c aat cccat cg agct accaca caat t t caat ccgagact aa gaaagaagaa t t at gcgct t cat ct ct ct a t t aacaaat a at cct ct acg aaaagat t ca aat ccat cac t gt t gt cat t aggt t t ggt g aacaggagtt t ct ct t ccat agaat ccgt a gaagt t t t ga ct t t caat t t t gaccgt t t t caaaact at g caat gggcgc gt at cgcact cgaaaccat a t gat t ccaca gaaagt ggga gggct gt cgc tttcaaaaaa aat cgccat t agcaggt gca aaat ct t ccc agt ccact ag cgccgt aat c t aaaat t aaa gact cgat t c ggccaacct c gaagaaggaa aat t t gaact cat ccgacaa gt t gt t t ct t t aaccaaat t gagcgagt cc tagaacacac t ct t t t caag gact t gagag ggagcgt gt a t t t t t t t caa gt t t t at t t t t gt t gt t ct t t gt aat acca ggt t t t t aag ggt t t t t gag tgaagaagac cacagaaact gcacct gat t caacaact gg t accct ccat gaaaat aagc aacgt acaaa aat cccgct a t ct t t t t t t c t at t gaaaag t t cacat t aa acaat caaat t t t agct aac agcgcaaaga t gt gccagag ct t t gacgct t t t at t cat t ttttcaaaaa t ct gat ggat t t t gggct t c agact ggat a gaagcaaat c 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 127 <211> 2000 <212> DNA <213> Arabi dopsi s thai i ana <400> 127 agat t ct aca aacat ct t t g gct aaagt cg cacgagtgac tgcttgtgat ccttgggttg agaacttagg ttgtttcatg ggacctcaaa tgggagatag taacaatcca gatttgagga agagattagt gaagatggag aaggaagaga aaaggattaa agagagagca agatttaagg cagatatgat catcatttag ttagctgcag gaatcagtat tcgtgtcaat tttctatcac aat acgat ga t cgt t t t caa gt t gct t cat ct t ct gaggt t ggct gt at c aagcaaagt a ggaaagt gt t t ggggagt ga aagagagt cg agat t t agt c aaact cgaga t gaat ggt gt Page 13 t gt t gacacg t gt ggaaact ccggt ct at t act t ggt gag gaagat t cag cgt gaagat g at gagt t t ca t at t agt aga t t ct aaat ct gagat gaaga gcaat gt t t g ct gt t caaca at t agt gggg aaggaggt t c ct t t cat cag t t at t gt at a t t gt aat t t c gcgt t t aaag 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt aatctagtca aagtgtttcg tttcataaga gggagctctg cagtttatgc agtttaaggg gt ct t t ggt t t gcagccat t ggct acaaaa gaagaaat cc cacat ct gt g aacgct cat t gct ggagcat gt ct aagt t t aaat gt aat g cact t cgaag aact t cccca t t t t caagt t agt t t agt ca t caccaaagt t acagt act t t gact cat t t gat t ct ct ca gagaagtcag gggcaacaat ggt t cgagt c t cgt cat t ac agccgacgca ct t gagat gg at t t aact t a t t t t ct t t t g aat t cccgga t cct cat t gt gt t gt at gaa gcacaaggt t t act t aat gg cacct gaact ct t gat t caa t accat t aca tttttcctcc aat ct gcaaa t ggt gaaaac gaagt t t t at gt t t cat aag t t gaat t t t t t ct t acaaag t t at t cat t c agaat gat ga ccgt t gt agt ccggcaacgg ct t at t t gaa at t cgt gt gc t aaagt t t ca ct at t t acga aat gt t t gt c gaaaat gct g at gaagt t gt t ccacgcat t cgtgtct t t t aaggcct t t c agt caacaag t ct ggaacat cgagccacaa t cgaacaagt aacaaaaact aat ct cat ct at agt at t t g agt t t gt gca gt ct t ct at t gcaaaat ct t at t gt cct t g t t ct t aacat ct agct ggt c agtttttttt gaaaccct ag cact ct at t t t cct t t t gat t t ct at t cgc t acaat ggt t t gt acat agg acacagattt ctct t t t t ac ccacct ct ct t t gat ccaga aaagaat ct g ggt cat agt t t t gat ggt ca ct caagt t gt t t t ctt gttt agcagaaggt gt t aagggt c at t accaaat aaat act t t t gt cct t acca gt t gt t t t aa aggat act cg at aat t gt t a t gt ct t t cat gct cat ct cc ct at ggt t at at t aggact c cct t gcaaat t t ct t ggt ac caggt t t ccg at acat t ct t t caact t caa ccat at caaa cacaaagaca gaaggaaaaa aagccagct g at ct gct t at ct t t t acgca gaagaat ct a t t t ggt t t t t t t t at gaaat t t cct gt at t t at t cat caa t t aaat gt aa gct ct caccc accgt aagt c at aat t at ca t t cgat t t cg ct t cgt t cct at t gt t ct t a t acat t ct t t accaat t cga cagat aaagg gagagaacca acaagaagt a acacaat t ag at aact ct t t gtggaggaca caagcaaacc t t ct ct t ct c t at gt aaat g gt caaagt ga ct t t t t t t ca ggt t t gt t t a t ct gcgt aaa aggt agct ga t agcat gaga aagagacccg aggggtact t cacact acgc aacccct aat cgt t t t gggg 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 128 <211> 2000 <212> DNA <213> Arabidopsis tha <400> 128 tctttgttct gcattgtaca gatt ct gcaa caacctt at g atgtcagctc cattgcttat atgagctttt cctccatctc tctaactctt cttgactaga Il i ana tggatctcct ctcatgtgca tggccatata aggaacccca catattcggg tctaaggttc ccgcagatac atcatttaga tgct t ctgaa gcaacctcag agt t gaaagt gtccacagat tggcatacca cgaacagcct ctaaaactgg tagtagtct a aattcttgag gccattgggc gtgtggactg tgctccaata Page 140 120 180 240 300 12689250 Sequence Listing. )atcatt gagcgaaccc gagaaact t caat t at at gat t gaaat t t gagt t t t gt t ct t gt ccaa t gt at cgcga tcaaccaaag t t at cagaac ccat acct aa agct cat gag t cagt cacat ccacgct t ca cct aaagcaa gaagat gaga aagaactttt caaagaat t g aaat caagt c gaggact gaa acgaat agt a gt ggt t ct aa aaggagccat t gt gct gat g at ccgt at t t aaccat aat t gaat ggt cga t gaat t at ga aaact ggat a t t ct accct t t ggat t aggg aaaggaagaa ct gcaccct c agac t ccct ccat c t agaaaaat c t cagt gact c gcgaat gcca gt gct aacac t aat t ct cat t accat cct t gt ccaagct t gaaccggagc t caat cgcaa t cacaact t c aagct gagt t ct aat t ggca aat cgt t cac at gt acaat g taaagaggaa t cgat ggaag gaagcgacgg ct t gggaaat at gact gt cg ct aact cgga at t t aaaat a t t at t aat t g aaat aaaccg agacaaccct ct ct gt t gt t ttaaaagaac gagat cgaaa act aaagct g ccacagct t a gccacccagt at gagcaact aaat gacct t ct t cccat ag acgt cccat g agt gacacct cgt t t cgt ac agcct ct ct g at gct ct gt a acat aat gct aaat caaat c aagct t ccac aagt gaagag cagt gcaagc cagacgcgac agaaggaat c agaaagt aga act gaggaag ct agat ccga gcggt t ct cg aaaccagt cc aat ct aaagc t at t t aaccg t t t t at ct t c cat t t t t at t t caggt gt aa t ct cct at t g ct ct cccacg gt at cat t at t cccagagt c aacaat at gt t ct ct ct caa ct cact gcag aaacaact gt aaat t at t ca gat t gt act t gcaacaacaa t gcagt ggaa at t caagaac gaagct gaat caagt ggaga tcgcagccac t gccggt gt g gagagat aga acgat gct gg cccgt agaat at ccgt t ccg cgat aaaccg aacgcaaaat aaat caaacc ggaaagt ct c ct cgt cgcgc gat t aagt gt aaaacc ct t ggaaa caacagct t ct cat gt ccaagt ca t agt ct t g cat t gaga gcct ggt t t ccggt t t cgat agt t gt cacaaa t caaat ca t agct gag t t gt caat cacgagag agcagaga t at t aagg gat agat g tgaaacgg gcaaat gc acccgaat ggaaat at caaccgaa aacgat t c at t t ct cg aacaacaa t xt gc agaat ct at t at acccatcaca ct ttgtatacca tat tccaccagga ga tccaagcaaa gg tatgatcagc tag cggt ct t ggt ag aacacttaat taa cct gggt t gg ac acaaatgcca cc aatgttactt gt acttgttgcc ttc acacaccttt t c t agcaact ct ac ttttatttct t g cagat acct g ;aa agagacttcg ttt actgccggag ;tt tgtgacagca ca at ct gaaggt ;gt accaactgag aa taaccaaaaa ta aaactgtacc tt tgattaaact t a t cat t aaacc ct ttattggcaa at ccccaattcg tat ccagatcgaa 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 129 <211> 2000 <212> DNA <213> Arabi dopsi s tha i ana <400> 129 aagaatctaa aacattgtat cctaaatatc tatctgtaga cactttattg tgtttttata aagcaacaat aatcaaatca aagttcacaa tcaatctaat gtccaagcaa aaacaattaa Page 141 120 12689250 Sequence Listing.txt agt gct act a ggcaat t t gc cact at caaa aat aaagat a ct at aat t t c gcaaacgt ga ct at gaaagc acaat t t t t a at aact t at a gat aaat at a ct aaaaccgg t acat gccca caaacaaaac gaaaacaaaa t t t t acgat a t cgt gat caa t gcat gat t a t gt t t aaaat t cacaacat t agt t aat t t a ccggaat cca t cat ct at at cacaact ggt t at acat aac t t t t t aagt a gt cact aaaa at t ct accac t aaat acaag t at t act acg ct t aaggccc ggagt t gt t g aat t gagaga aagt t ccat t at t cat cat a cact t at gaa act act at ca cat t t t aaaa t at t aat t ag t gt t t at cag aaacat t t ca t acgcat t t g t at ct aat t a t t t at act gt cat gcat t aa aaact t t t ac at t at gaact aggat gat ga aaat cccacc t ct ct t t aac gct t t gct aa t t aacct t aa at cat aagca at t cgt ct ag aacaagat t a at at caat t t at t cacat ga ggat cgaat a at t t gcacca t agaccact g t t at cact gg t gagt acgt g aaat aaat ac tagacacagc gacagagaaa t ct t aat gat t aaat t aaat at at t ggaat at at gacaac aaaaaat t ga t t act gaat a cgaat t at aa gt ccagt aaa ct t gaat aat aat t aacgt g t caaaacat c ccact cagt c aat t cat agt cat gaagct a cgacgact t c aaact aaaaa ct aacct t aa gaaagtggga at t acact t g gacaaaaaca agt aaagaat t cat t t ct ca t gt gat t aat gt t t agt gga agagt t aaat gccgggaat c gt gct t ct t g gcct t aaat g acct at at at gtgaagcaga agct t ggt ct ct acaaagt g acaact t t t c gt t gt gt aaa aat t t gt gac gacggccaaa at t aact gcc cat t t t gt gt gt t aat cat c t ct gccat ac agct t agcag aaat at t gag agacgagt t a t gt gcgt t at aat gggact g gt ggct aaaa t at at caaac t t aaagt t ag acacaagatt caact t gaga t gcct t t caa gaagat acaa t t ccagct at ct agt t acca t t t t acaact gcat gt at t t gaacccgggt t gaat t at t a acgct acagc gt caaaaat a gct t at gt ga ct t t cat t gg ttgtttatct aataaattaa agt gt aat at agt gt aaaaa agt acaaaat aagggccaat at t gaggcca aaaagaaaga agt t cat cac aat at aat t t agagagtat t acaagct cat t at at ggt t g t at cggaaca aact t t aggc aat agcaat t t aaat caagt t t t agat at t t gagagt at t cgat act t aa ct ct caagt a t t at aat t gt t aaat t aaac t t cat acaac t aaat t t ccc gt gaaacacg ct gt accgt g agacaagtt a t gagcgact t cgt gaagct t ttat t t cct c cacaaagaga at at ggacct t caaat ct t a t t aaact aaa taaaaagaag aaaccat gt g ccaagt ct ac t ccaaat t t t t at gt gat t t at gt gct aag at agt cact a t ct t t t gaaa caat t at t t g ct act t cat a aggat at ggg acct act aca gacaagaaac t t t gtt aaag t cact t t t t a gt ggt gt t ag t t aact t acc cctctttttt t agct t t gcc ccattttttt aaacaaaagg gcagggtact gt t t at aat c aaggcccaaa agatgggcca t ct caggt ca gaaaat ct ga 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 130 <211> 2000 Page 142 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 130 agt ggagcat aaact caat a aaact t t aat t gact t t ct c tagagaggaa ct gaaat aag gaacccgacc cggct aagat at t gggt t t c ggaggt t t t g t accagt gag gaaat ggggt at gt t at ggg aaaagt caga gct ct t aat g tgaacaaaaa at t acat t t c at t t gt t t t a ccat gt acaa gt gt act t ca t gt aat gggc gct t t at t gg t t t t acacct t t gcct at t a t t t agaat at cagacagaaa t t aaaaaaat gt t act t agc cggcgt caca tgcct t t t t c gt at ccgaac t t cgat t t ct cagt cccaga cat aaaccaa caat t t cat g aat aacct ca aaaagagt aa cct ct t gt t t cgct t t acgc cgt ct ct gcg t t ggat at ga t agggt t t t g agagagagac t t ct ct aat a ct t t aact t a t gct t aat t c t at accacaa at at gact cc t cagacgt t g ttat t t gtgt ggt aat t gt c t t at t t at gg ccat gagaga gct ccact ct at t t t cact t tcgt t at t t t t acgt t t at t gaacgtgacg t agat cgt ac t cagacat cg t gt cgaccat cccaaggttt caaccaacgg ttct t ctat t gt gcaaat ga t aat ccaaac agtgagagaa tcagaaccaa cccaaat gat cct ccgt gt t t ct t t cgct g t gt t t t ggat t ct agggat a ggt agt gt t t gacgacgaag aaacgacgt c gct caaccca aacggat at t aggct t t agt at t t aagt ag cagt acct aa t gt gt t ct ct t gagt t at t c accat aact t act gat act g t t ct cact ct t t ct at gt gt ct t t gccgac at aat at t aa cat gt acgaa cgcat cat gc ccacgt at ac tcgt t t t gt g ggct ct aat t ct cagatt ag t t aggct t cg at ct gaaat c acaccact aa at ggagact t t ct cagct cg t cacaagct t gt ccat cct c at at gct ct t cgggtcgagg gagct gcgt t ttgcacgacg t gagcaaat g gt cgt t t ct a t t at aact gc ccat t t gt t g ggaat at at t ct cgt ct t gt aat at t t t t t cat ggagct t gaaacacaat gaggt t t cag ggct t gat ca cacatttttt t t t gt t t t ac t at cat t t t t gaat t t gt t t t ggct gaat c gtcagccgcg gt ccct cat g aaat gt at t t aagt aaacga t t t cat at t t agat ct t t t c acat agcat a accaaacat t t accgct ct a aacct caaca cct gat ct ga t t gt t cgaat gccggagaca gaatccaggg cgccgccgga gagcaacat t ggt t t aggag gat t at t caa at t t gat t t g tttttgtaaa at ggt aaggg t ct ct cgaca cat cat gcca ct act at t t c acat cggt t t tttgtttccg gat t gat gt c acaaggtttt acct at t at t t t ct agacgt gat t gt gt cc t t cct ccacg t gacacct ga acgggaaacg t gt gt t ccca acct gt agct t caacggct g tt ct caaaag acat t t ct aa gaaaaaat gc ggcaaggct c tcaaagagcc t t cgt ct gaa at gat cgaag gacct t ggaa aggt aagt t a gagaat aacc gt t t gct t t t ct gt gt t aca t gggct t t aa aacaaagggc t gaccaat t t t gaagt cgt c aagaaaaaaa t t accct at c act t t gt ct t gagcattttt agcat ct t t t cat t at t t gt ctcttctttt at at t t t t ct t at at act gt ttttttgacg t gt cct gcaa t cat aacct a t t aact ct ga t t t ggct t t t t cggaagt t c agat t agagt gt aaaaact t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 Page 143 12689250 Sequence Listing.txt tgggaagcag ctaaaatact gtgttctttg tttcttccat cgaaatcgaa aaaagctaag atctttgcgt tttgaaacga 1980 2000 <210> <211> <212> <213> 131 2000 DNA Arabidopsis thal i ana <400> 131 at t ct aat at t cgaat aaat t t accagcct agt at t ct gt at ct gccaat ttgaaaagac tccaaacccc aaccgagggc gt gt agt tag ttct t gt t t a tgacagagaa gagagt at cg atgggcgaaa t cgat t aagt at aat gaat a t t at aacat a agt t gt cat t gt gact t cgt ct t t aaaat c t t t cgt t t aa t ccgcaaaat aaat acat t a aat ct t caca t t t gcat t gt gaaaggt t aa aaacact ggg ccgat t t at a t gt t t gt aat t t aat at t t t aaat aaacac t t t t t ct t t a gt gt t t acga accact aggc gat t t t gacc at gat cat gc acgat gcaat t t aat t agt a t aaaaat gt g t at ct aagac aaaaat aaca gagt t ggat a gat ct agt ca gtgaggcaaa aacat aaaca gaaaat t at a gat t ct t agt agat t act at gaccgt at aa t cat at gt t t t t aacat t t a acat agat ga t aacacat aa act at gaaaa ct gaat caag gggat t t at a t act t t t t at tgtattcata ttttacgtag atcgtttaat tttctacatg t at t aaat t g ct gagt aaat agaaaat gaa act acgt t ac ct caccgact cat t ct ct ag aat aat aaat t gt ccacgaa t gt t t aat gg t t t agtggaa at t tgt acac t gt t ct ct aa ggat gct aag gt gat gct gt aggat at agt acgt aaacat gat gat t gat gaaaat acaa agt t t t at t t aaat t cat cg aggggt t aat aaat t t t gt g t acat ct t at act at agagt at cat gggcc cagttttttt t t t ct accaa act t t aacgt at t acct gaa gat ct t cgt c t at t t aat ga cacat gcat a caaat at t aa aaat t t gt t a aaaaggt t t c ct ct t t t cat t aacaaact t at t t cat t t g ct at aat gt g caaagaccac agt t acacat ct t at agagg aaacaaggat gaat at gaac t t t t aat t t t agct t ggaga t gaat act t c cacat aat at gt cgacaat g ttacaccaga t t ct t at t aa ct caaagccc tttttttttg caaat t caat t caaact t aa aaagacattt acat cacat g ccacat aat c caaaaccacc at act t aat a accat aacaa aaagggaggc at gct aaat t at t t t t t ct c gcat t t t caa at gt t aat t t caat t t ct t a at gat gagt t gt acggt t ac at agt cat at t aagat t cat gcgt t at ccg t at gt gagag ct t t gt gaaa tgaagcggat t t gacgt t gg t gt acgacca aat gt aat t a t t t t t ccaat gaat caat ag t t ct gat gt c aagt t t act t t t ct aact t c t t gacct cag aaat ct t aaa t act ct cct c t t ct t t gat c ct cct caat c t accaact at at aaaat t t g at t agaaact cat aagcaaa ggat at ggt a t aat gt t ggg gt cact gaaa acat at gat g agagggtacg ccgcaagt aa t t t t caagt g act at t t t t a aagct t aaa aaat t gtt ga t t t aat t acc t t t gagt t t t ct cct at ct a t t t aat t cct at t at at t t t aaaaaaat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 aaataatccg agagagagat ttacccaaca caaccttaaa tgcaattggg ccttaagtat Page 144 ccat ct aat c cacact aagc gaggacaaat t cccact t ct at t caacaat 12689250 Sequence Listing.txt ataattaatt aagggaaaca aatattggta agagggtaaa atagtcaaaa ttgttctcct accacacaca cctaaccaac taacaccaaa attagcaaag ccgtaattta ccccattttc tactcctctt ttaaacccaa ccgcctctct cact ct ct ga t ct cagat ct ct ccct ccct ct t t ct ccga t ct cagat cc ct acaaagaa 1800 1860 1920 1980 2000 <210> <211> <212> <213> 132 2000 DNA Arabi dopsi s t hal i ana <400> 132 t t ct cagact acagcagaag act at gcact ggat ccgggc t gcat t caga cat t t t ccag gct gt ct ct a t gcaat at t t gat at ccat a t gcat ct t ag tcaacaagca caaccacact caagaagat c ct t cacaat t cccagaaat a caaaacaaca gaaact t t ac t agt cgaaag aagt aacacc caggct ct t t t cat aat cgt t gt agaact c t gt acccaac aaggt gat t t gct ccggt gg ggagt t t gat cagact cat c t t gct t t t gc tcgcacacag gaat agaaac aagat aaaac aagggt acat ccct ggt t ga act acaat aa ct t caaat cc t caat ct t cc agt aagt t aa gt t ct t acaa ct ccgt t t ct aaact t gcgt aagt acacac ct t agt gct t aagat cat aa at ct t gat ca cccacaacca gcat t t at t c t t caagacct at cagct ct g ct ct t t ccgg agat ggat ct aat gaact ca acccccat ac cccat act ct gat cat t aga aacact aaat aat ggaacaa gagaat ct cc t ct t agcagt ct t t agt aga agt ct gccca cccaacagtt aacagagat a t gcagt gact ccact t t gac t t t t cgcct t ct t t aaagcc gct cct t t ag at ct cct cat gaat ct ccca aacat agt at ccaact t t ca t ct t gct ccg act cgt accg t cagggt aat t t agat ccgt t ccat ggcag t t at ct t t gt agagt acaga at ct t t gact at ct t caaga gaat t t cacc at t t gct at a ct gt t gaat a at caaacaaa gct t ggt ct a ggcacat caa agct gcat aa ccgagaactt aat t t ct t t a t t t ccccat t aat t caat gt gccaaccaat t at aaacct c at at at ct ct gact ct t t cc ct cct t t aat agaaagaaac t t t gat t at c ct cggat ccg ct gggt t t gg ct at t ct t ga t t ggagt at g gagt ct t gt g t gt cat ct t c aaaccaagca tggagaagca gat t ccacaa at gagagat a ttgagagagg t at t t t caga gt at gat ct a agct at gcat ct t t cgt t ac gaaat ct t cc act t caagcc at act aaagt acct aaat t g aagaacagt a caaagaacga agcaccagt a cct ct ct t ct accgt cgagt t gt gt t gact gccgat t act ggt t t t t gag t cccaaaat c agacccacga cat t t cct t t aaaagagt cc acacgt aaaa t ggt cagct t ct cgct t cag t aagt cagt t aaacat t act accagccat g caaaaagaaa acacat caat aaagt gt act cagcct ct gt t cacct gt aa t at caat caa t t act act at act t gaacaa t acacaat cc ggt ccat aca at aaact t ct gt gaagt ct c t gt aagat cg t caact ggat t ggaaat t ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 Page 145 gacgt t gt t t aagat ggt gt cagagagcgt aggt t t gaag agt t t t t gt c ttact t t ct t t t t t t gt gt t at aagct ct g gagct t cacc t gt t t gt gga cggt gccat t tagagagaga tttgaaacaa cacgggccac ttaaaaaagg t at agaggcc t t act t t agg gaagagcgaa 12689250 Sequence gt t ctgat t a aggtgtattg gttgaagaag tttcgaaaga gaaaat aaaa aaggggat t a agaaacaaac ataaccgttg t gat t at t aa caat t t t t at gaagtgagat gggcctgact caatgtttgt agtattaggg tttttat t aa aggt t t t gcg Li st i ng. txt agt t t gat t t gt gaaat t ga gggt t t gt ga t gt t gt at ct tttcggcaac t t t t gat t aa t t t t ct caaa gcagtggaag gat ct t gaag aat t gagat t gt gat t cacg ct aaacggct t agaat at at gt t aat gggc t gt ct ct ct t acgacggcac 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 133 2000 DNA Arabidopsis thal i ana <400> 133 t t cgt t aagt aat t ct ct t c t ct t t t aat t agaat aacct t t at t t at t a agaaat at aa cat t cacgat t gt aaaccat t t aaat aaat ct acgaaaaa gt at at gat t t t t t aagt t t gaaaact t aa t aact ggagg gat ct cat t t cacct at at a t at aat aat c gct agaaaag at aaacaaat gt aaat t t t g aaggt t t t cg t aaaat aaaa t gtt gacaaa aaaat t at ac t t t t t t agt g cat at t gct g t t t cctgcgg t aaaaaaaat acat aat t t c t at t ggat aa caat t t aat t ct aat t agt a at gct t t t ac t t gcat agag gacaagt t at gct aat t t at ct t act t gcg t gcaaat at g t caagcat gc at at t gat t g t gat cgaaaa aaat t t t cgt gagaat aaat aaat caaaag agt at gt t ac ct gt ct gct t t gt ct act aa cat t t gaaaa t at t t t t t t a at aat t t gca ct acgagaat gat at agcca agacgat cct t t gt t t gaaa att aacaaaa at ggt ccat g t at acacaaa t agacacgat at gt t aacag aat t t t t aat ctgcgagaag cgaaagt aga gacaat t aaa t aaaaat gt t t aaaaat at t t aaaat t agt t t t t at t t t a t at t agt gat acat cgaccg acat at gaga t t t ct aggct t cgt ct t at t t at aggt t at t ct aaat at a ct cct caacc t t aaat acac t gaacggat t t t caaat ggt at aaccggcc t ccat aat t a t at aaat at t aagt t t ccaa cat gt t t gaa t t gat aat t t ttct t t t gt c ccact t t ccg gat agt gat t t t at t ct ct c cat aaacat t at t t cgt gt a ccact at aaa t cagat t t t a t ct gt aat t a accacat cca t gat t ccacc aaaccaaat a at cgacgaaa t cat ccggt c ct at t t t t t a aagcaaagt t t agact at ag t gacacat t t ct t t gt t cgt agt ct t aaaa t gagt at gga gt caaggaaa aacttttttt acaaaagat a t at t t t aat a t t ct t at cat at ccacact a caat t at aca t at gaaaaca t t t t ggat aa ct aact act a tat t t t ctcc agt t agt t at t aaaat aaaa ggt t caat t t t aaaat at ga t t ct aaaat a t ct cgcagt g t agt agaat t at gat gaaag t at t ggt aga act t gat cga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 aaaattagaa gattttgaga aaaagacagc atgtttttat gcgatacaat gtatagtttc Page 146 12689250 Sequence Listing.txt acaaacaat t ccaat aat aa t at gat aaat gtataatttt gtagatagta taagtgtaat t t t gt aaaag t gagt aact a gt t t t at caa aat t ct t t t g ttct t t t t ac tct t t gt t ag cgt aagaaaa taccaggccg caat aact ga t ct cat agaa t ct accaaac t t agt cact a aaat t agcag aaccgt t act t aaat t t t gc agt t t t gt at ttcaaaaaaa aaaaaaaagg t gt gat aaga aact aggt t c t ccgccact t gaagcaacaa t acat agt gg t caaat at ag aaaagt t t t a act at t t caa t at caat ggg aaagt t t t t g aaaact agaa at at aaagcc t ct ccgcat c agggt t cat a t agt gct at g agt t ggat ag ct t t t t t t t t aaat act t ac at ccat aaca t aat aat aac t t agt at gac caaat t aaaa cat t t t ct gc gcagccagag agcaagccaa ct t caacacc tttttacagc aact ccagct gt t gt at aat t ct t t t gaag ccaat aact g at gggt ct aa at t act at t a agagagacaa aaaacat at a aat cat aaaa t ct ct cgt at aacaact t aa t t t at at at g t t t gt acgaa aagct aggt t cct aaaggcc aagccgcat t gtgagaggga 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 134 2000 DNA Arabidopsis thal i ana <400> 134 gcaaat gcaa t acat at at a act ggt t aaa at at caagt g aggaact ct t cccacaagca t cat agt gt g tgcacgaaaa ggt gat t ct a cagact aact cacgggagt g cact gt gat g tacgagaaac aact cgat cg t cacct cccg gcagaaatgg acct gacct g gat acct at a at cagct aaa at at cggat g ggt t t at aaa gct cgcccgt ctagt t t t t g ct at at t cac aaat t caat t aagct at gat cagaaccaaa gat gat gat g aagcat acgg ccagtgagga gagt ccgcat t gaat ccgat at t t t ggact gaaaat t aag ct ct gt ct at at cacaagga t caaat acaa ttgaagcagg at t act caca ct ct t ggcat aggat ccact ttccaaagga ct cgat gt gt cat agt cgt g tt act caacg at gat gat t c cggacatcgg t gacct ggct agaaacaaga ct caact cca ggactggcga gct aat t act at gt gat at t t at t t caggt gaat ccagaa t aaat cat aa ggaaagat at gaaggcct at gaat acct gt acct gaaat g cact caaact aagcaacaag at t t ctt ct c acat aat t t a at cgaacact cgcgacggaa gaaacgcgac t gact gaaaa gagt cgct ac aat t t accct gt gacct gct ggagaatcag ct t gaat aga acaacacaaa t at aaacaac gt gt t gaaaa aacct ccat c gt acct aaac cagccaat ct aacgct t t t a ct t ct t cttc agct ggagat ct ct cgt acg aggt t gct ga gaataggaga aaaacaaccg aagt cgct t a caagt t t t at t t gcct at at agaaagaaat aat ct t at cc at t cgt gaag t cgat gct ca act ggct ct t t ccat t caaa aaaacaagct aacaaccat a at t t cact t c t agct aaat t t gt acct cgg tcaaggaaag gccgagtcgg gaagaaaacg gagat t t cgc cggcgaggga t at t aaggt g ggct at at gt t gaagct gaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 147 12689250 Sequence Listing.txt taagacacta tatgggagag attgaaagga agctgttggg ccattttggt gcaagtcgag t cacaaat gg at gagt at gt cact gt gt ga aagct ct ggt t t gaat ct gg aat gt at aca ct t t gct gt t cat t gact t t gt t t t gcaaa t gaaaat gga cgt cacagct tttccccaag t cct t t aaac ct ct t ggaaa <210> 135 cgt gagact t gaaacggat a t aaacgct ag gcacaacaac t t ggat ggct aat gaat aga ggcaat aat a gcaat t cggc t t ccat gt gg at gt t t aaat gaat gaat gg gacgaagaaa aaat at ccaa aagct cgt aa ccaagaagca at t gct gt gc cat gct caga ct gt cct gt t t t gcaat gt t at gaacaagt aagat gt gat t accaat cat aat cgt t ct a cgt t cggaaa t agt agaat c t aaact act a ttagaaacaa act aacaccc ct gt t t cat c catt gcagga tggt t gt t t t t aat ggaccg t ccat t gat g t gat t ggt ag t ggt act gat t at gt t t at t gat at ggt t t t ct t t cat ct t cacgt at at gt t t accct c caagcaacgt aat t acct aa t t ct t gt ccc at gcaaacag gt t gt aggaa gt gt at gt ca ct gt agcagt at aagt t aaa gt aaat t caa gct gact aag ccat t t caaa at act acgt c aaaaact t t a at at t t t agc gt cact t ct c t gccacgt gt caaagt ct cc gt agcgggt c aggaaagat t at gcct t t ca t ct t gt ct t g ct ct cacat t at gt t gt gat t gct t t agag agccact cct t cat gat at g t acgt t gcaa gt cgccaaat t gaaaaat at at gt cgt cgt tt act cacac t ct t cct t at 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 135 aaaat t ggat t cat gt gt t g acgt at aat a agat at t gat gcatgggtcg t gaacgt t gc aggt t aat gg gt caaacaat at t t aaat ca at ct ct cact at aaggt t aa tggccaaaca gaaaat acac t t t gat at aa t t t aat t at c agt ct t t gaa t gat at gt aa gat cccat ag aggtcgt t t t aaacaaaagc agct gat aaa acact t gat g at t gt t gttc t t ct aat t aa t t acat t t gt at at t aaaat acat acact g cat t gaaaat acacaaacaa acgt t agat t t at t acaat t aggtggacga ggagacagcg t gaat t gaag taaaacacaa t acat at t t a t gat t t t aaa gt t t t t t t ag at t cat cat a t t at gt t cat t aat t t gaga t aggt cat aa aat aaaaat c gt cct gat aa t accacat at t aat t ggcgg ttacaaggaa cat agat t at acact t t ggt t agacgt aga aat t act aat aaaggt t gat t t t aacat at cat at t t aac cacgt t t aaa at cgat at aa agaagt ggt a cat gt t t gga at at at t ct c aacggcggt g aat gct gagt gagcagccat ggct t at t t t ggccgataag at aaaacaaa t t cct cacat t act at t t t t aat gaaaat g acat gat cat ttaagaaaac t aat t t t agg t gaat t gt gc gagcgagagg gaaggtgaca ct t t t gat ga at at t t gt aa t t gt cact ct aagt t at acc t gat t t agat gaggt at aca t cat t t t t t t at at t aat aa gcct at t t t g aaagt at at c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 tccattaacc aatgataata gctgaaagca tcacatccat caccaaatat gaattcgatt Page 148 12689250 Sequence Listing.txt ggtcactaca ataactatga aggataacga ttttaagaag attaagactt cagtggacaa aat t act t ca aaat acgggc t at t t at t t g at t caact ag t t t cggt aac at t t at act t tttttttttt act aaaat gc gat gcaaacc gt caaacgat ct ct ccccgt taaaaaaaag ct t t t t gtgc at gat gaaat tgaaaaagaa t gggcct t t a at aaaaccct cggct ct gcg <210> 136 t gat t at acc caaat t ct t t at t ct ccat g aaaagt ct aa t ct cgt t gt t gt t t gt gt ca t t t t t gaat t aaact gat t g aaact ct cat t t ct gacgt t gtct t ct t t g gat aaaat t a agaaaat agt at t t t t t t gt at t aaagggc t t t at t aat t aat ct t ccac aat ct t cgaa aat gcaaaaa t t aagat t ac acaaaaacaa aacat act t g cacaat aaac aagccaacga t aat gct aaa aaact t t t at t ccat cct t a t t t gt ct at c gt t cct t t ca gt gaact t t t t t act at t t a act caat aca ct t aat ccga caagt agccc t t t aaat cag acgaaaaat a aacct t t gaa aaaacaaaaa t cgagt t gt t at gt ctt t t t ccaact at ag t t at at t caa gaaaaagaaa t act t cagaa aaact t at aa t t cct t t gac aat cgat cat t acat aat aa agt t t t gttt at t t caaagc aat t t cgt at gaacaaaccc aat gat t cgg gt t t gaacca gcgt at act c act t gt t act t gct agt gat t cgt t gggac agaaat aaaa ct t ct ct cct caccaccacc gt ct ct ccgg aat gat t t t t cat at gaaaa tgt t t t t ct a t gt gt gt gt g ccaat agt gg aat at aaagg t t cact gcgc aaat at gaca gat ggt t t cg t gt t t t gcat t acgt ctt ag cgt t gt at t c aat gat t t t t cgt t ct t t ac ct aagcacga act cat cgt a ct gggat ggg t gcat cact c aat agt t t ac at t t t gt caa t ct gt t cgt c aagt ct t t t a t ccat t ggag cagt gt t gt g 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 136 at aat t t at g gt t t ggt t t t cat cagcat c at t accgt t t cat t cat t t c t at aaagat t gt ggcggt gt at t t caat ga t caaagct aa taccaaaacc caaaact at c t gaaggt act aaaact gct a gact t gggga gt ggagcagt t cggt aat aa ct ct aagt ca t gt cct ct cc aagagcaaac at act t gggg gcggccaccc t gcacct t gt aacaaacaac ct gcgccact t accagt agg gt t gaat aaa ccgt acacac gt cat accag at t cct act c aaaat at t t g agact gaagc at cct t ccag ct gt aacat g t cgccat gt t tggcaaggag t ggt t t aact at gt cct gag t cgagacat t aaacat t gca aaacaaat ac gacaat aat a at at aagt ca ccat acccct aaagaaaaat aat t t agt ga t aagagccat ct ct t aacat acagaaaat g act t aaccaa ccgat ggcaa aat cagt ct c att aacagaa at cat at aca cccat aacct acgaat gt t a caacgt act c t t ggggt t gc caaagt t agg at gt t cagaa at t aaaaat a acat agcct g t at at ct t ac cat ccgagt c ggaat at aat ct acccgggc t gcaaaat gt 120 180 240 300 360 420 480 540 600 660 Page 149 12689250 Sequence Listing.txt atgtcaagat ctcatttagc atgctaatgt ccatcagttc aagaagatta ccagt ct agt t gcagaat ca at cat t ccca at at t t gt gt cat cagcagt t cggcct gca caact aat aa gagaaatt ag agggcgtcca at ccagat at ct at accaac tcaagaacag aact t cact t t t t act gaag t act t cggaa t t t gat aagt catt gcacaa t t gggcat t a aat t t t t t ct at ccct cgca aaat acaat c acct t aaccc gct ggagt gt t gacat aggt aact cat cga agaaggatt c aat aacat t g aaat gt agt c t gaaat gcgc gt at t aaagt gggacccggc t ct gaaat t g agagaat cgt t agt ct caga t t at t t t t t t cagt t agat a gaaagccat a gaat act gat t at ct t ct ag gagat gacaa t t gagt aaac ct ct t aaact t gt at t act a aggccaaaaa caat t aat gc t at t agaat g at t cat ct t c cat gccaat a t t cat at cag at t t acat ca agct t agagc t at agagaaa gaat t agcac caaaaaaaaa agt t aagggt agt aat caca t at t caact a ct gaacaat c aaccat ggaa cacaat caga gt gaat agga t gt caaggt c caact gt t at t gt aat at at at t t at t aaa ct t t agct t g gact gct ggg aacct at aag aaaact gaga ct t gaacgt t t gacacat gg gaaagaacac t t act ct t ct agaagt at t t aat aagt at g ttct t t ctaa t t t aacat at acaaggaagt cat ggccaga at t t t gacaa ccagaat cca taagaaggag t t t accact t aaaaggacca t gccat t gga acacaccaga gt t t ct t cat gcat caggaa gt agat t aag aagccaact c t t ccact t t a gaat ct ggt a aaagat t gaa ct ggt aat aa t t t ggtatta aaggccacca t gat t cacaa aagct aat ac accaggt aca t t aacagt gc agcaagt aat acct agat aa gaagacaaac cccaaat gct t ccat ggaga at t aat ct ga at ct act ct c ct t act gat g cat cagct gg t t gaat acaa t t gt aaat ca acgggagat g t cct cgact c t aat t t caca caact cagt t t t ct gat gcg acat t gct t c cat at t gcaa ccacaaccaa agt acgcaag t aacat ccat t aaat gt gat aaccat at at caacaaaaca ct t cat t agt t agt t t cgt a t ggt at cat a t aat gaat aa tttgaaaaaa 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 137 <211> 1149 <212> DNA <213> Arabidopsis thaliana <400> 137 tctcctttgc ccccacagtt taatatattt gttccaaatt tgaaaaaata caacaaacaa cattggtagt gttacatgtg tcgtccttcg t gacaacgt a agt gt agcct gt gat t cat a atcttttttg gaaatcgtac cgcgtaagaa ttttctcttc caatttcgtt tcgacatatt atcttttttt cttttagatc gttaattttt ctttaagctt gtcttctctc ttgattcgat cat aaacacc t t cat gaaaa t gct gaaaat gt aaaat at a acat gt at aa t cgcgat t cg gat cagcgat ct gct gaaaa Page cccacgagaa t at t caat ct t t ggat agca aggact aaaa at acct at aa t ct aaggt aa t cgct ct t ct cct agaaat t at t aaaaacc at aagcaaaa t t gt t aat t a at aaaaat at ggt ct t at t t aaaaaaact c gat ct gt gt t t t t gat t t t t 120 180 240 300 360 420 480 12689250 Sequence Listing.txt ttgtttgttt tgctccatgt gtatggggat atttacgatt ttaacaaaac aaaaatatga at t gaggt t t aggt gagt ga t t t acct gt c ct t ggt gt at at aacat t gc gt t t gaaact t gt act t gat at t aact gca t t act t gagt caat t gt gac gagaaaaca t t t at t t agc t aaagt at ag at gt gt t at t gcaat t t gat t ggt t cat ct caggct t ggt t ct t t gat ga t t t gat ct ct t t ggt gt t cc ct aaat t gat gaat t gggt t aact t t ct t a gat t gat ct c t gggt t t act gat t t ct cat t t gt gcagt t aagct t gaag gaccaaaat c t gcat gat cc gt t t t ct t gt t t aat t gt t c t gct t aggat at t t at at t g ggagat t gat ct ggt gt gt c t gat t gat t t t t aat gt t t g agt gat t aat cct gagct t g ggt at ct t t t acat t sgt t t ct t aaat t cg t gt ct gct t g ct gt acct aa t gct cgat cc at ct cat t ac gt t t gt gcaa t acat t gct t aagt t cat gc cgtaggt t t t ggct ct ct cg agt t ct t t ga at gt t t aaag t caat gagt g cct gaagaaa t t gact ct gg t at gat t ggg ggt t at ct ca t t ggt t t gt c gt t act gat t 540 600 660 720 780 840 900 960 1020 1080 1140 1149 <210> <211> <212> <213> 138 1315 DNA Arabidopsis thal i ana <400> 138 aaaagaaagg t t t t aaggt t cat aat ggt a gt t acat gt a t t gtgtggac t acccat at t t acact at t g gat t t t t aaa aat t aagt at ct t at t aaaa t t ggcccact aaaagaggaa t gggt t ccct gaaaaat t gt t t cagt t t ca t at ct acaaa at t ct t t t t t t ct ct at cgt t t t t gaaggt cct t t at t aa act t gt aaga acaat cct at at t t t at t t t gt ct aaccaa tttttttttt t cggat t t t g aagt t t agag tgacaaaaag t t t ct t t t ca at gagaaaag ct t t t t agt t aaat aat gca at t at agt t c gaagcccat a tctgt t t t ga at caaaaaaa t aaaaat t gg at t ggaagga ct t t agat t c ccacgcgt ct t ggct t gcca t aat t gacca cact t at caa gct caagt t t at gat t t gt t aat gat t t ga at at t at t aa ggat aat aat caaat at aac ct at t t act a aacat ggagt aat cact t at t t ccgt t t ca t aat t t ggat aaat t aaaat at cat aat aa t at at t at ca t t t acaat ac at t agagat c t cacat at t c aat at ct caa ttgt t t t t ga cat t t at t t g gt t ct t t t ct gaaaact t ga aacaaacaaa at gagt aat a t t t at t at ct t at t t t t t t g ct t cat t t at act t gt t gga t t t t cct at c t aat t aagga tagagacaac aaaact cat a gt gagaacac agt gacgt ct acat t t at ag ccgaccct ga at at t t aaga aagt t t t t ga tttttggctc ct cat aaat t aaaat gat t t gt ggt aaat g gat t t gct ac tcaaacccca t t t agat cca t acat at aac t t t t t t t t ct aaaat t aat t aat ct aat t t agaat ct cga gcgct cct at cat ct ct t cc at t t cat ct t aat t aat t ag aaat aaat ac tttttttttt aagat t t at t t t t gt t ggga gat gt t ggt t t acat t at t t t ct t t t t t t t t at gat at ct ct t at t t t t a t t t t at t gt t t ct aacat ac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 151 ggat t t t t t c at t t t agaaa t acact t at c tcgtct t ct t ctttccgaaa attaaaaaga t cat t t ggct cat cat t t t c 12689250 Sequence Listing.txt ttgaactttc taataaaaaa attcggaaca agtcaccatt gtgacgaacg aaaacattaa aatagaaaac taacaattat tttatatata ccccaccacc atctcttgtc catcatacaa tcttcttctt ctccaagaaa acactgaaga agaaa 1140 1200 1260 1320 <210> <211> <212> (7 <213> 139 2000 DNA Arabidopsis thal i ana <400> 139 aaaaagcttt t t t t atgct a t t cat t t at a cct t t caggt t t t cgt aact aat t aaggct gat at cagga ct t cct t gca at aaacaat t gat t acaaaa att aaaacaa aaaaat ct aa ct t acaaaat ct at t at t at ggacatt cct gcat cacgga at gaaat aac cat gtt acaa ggtcggggca aat gtt gt aa t at at at at a gt at ctt t ga at aaaat ct t tt t aagt cac aat t t t t gaa t aat aggct a t at gt at gt a ggt gcat cat caacgt gaag ct gt gt gct a agagt t t t ca t cat t t gt aa ct t cggt ct g gt t t gtt aag aaagaaaaat t aaaaaact t ct at gtt t ga t t agagct at gctt at aaaa tt gt gatt at acat ggcgt t agaat at gat ccct t caaat ggtt ctt ct a actgacggga t agaaaat ac accaaaaat a cgt t ggt t aa t at cagt agc gtgt t t t t ca ggt aat caaa at t t ccgt ct ct aacaat gt aactt gt at a tgt t t t cat t gct aat t t t c t act at cgaa at gt agaaca cgagagcaat ct t t t ggaaa t aaaaat t aa tgt t t t gat t aaaact cgat aat t aaaaat t t aaaaat at gcaaaaaaaa ttgt t t gtcc at agcaggt g t ggct gct gg at t gaaggt g gaagtgggga act gat at t t cact acaat g ttgat t t t ca at gcat cat t aaat t t aaga ct t aat at t t t aaat t t t at acaagt t at a cagagctgtt t at acaaaaa cct aat t t t a at t cggt t aa acaat ggat g gt gt t t ct ct at act t aaac aaact t t ct a aat caaat aa t acgt gaat t at aat t t t t t t ct at aaagt t agaat gcat t at at ct t at ggt t t act ca cgcct ct aat t gt cgcccaa aaggt t aat g act at agcat t t gt aat aga cat caagacg t gggct t t at tt t acatt ag at ggaccaaa aaat gaaacg at gt at gt at agt at aaat t agaaaaagaa t gct gagttt t t t t gtt gt t at ct cgaaag t aact at t gg t ct aaaact t at gt aaggt g at gagt t ggc t aat gaat gg t at t t t t t ct t t t aat at aa aaat at t t t a acat t t cat c ggcgtcat t t act aggt gcc t cgt gaact t aaagaaaagg accgagat ac aaact t at cc aaat gt t at g gat ct cgaaa t at t t gat t a at t caaaaaa aaat cagt ca at t t gtt tag ccccaccgt t t gaaactt gg ct agt t at t a t aaggt t t t t aagact t gga ggcggtaaca t gat t acaaa ctt gaactt t t t t gct at cc ttttgcaggc ttttttttt g t ttttat at a t t t at at at a gt cgt gt t t c gt t t t act ct at aat t at aa ct gccgacag agt t aact ag gact t t ct t g at t ct gt t at t t t t t gact a t at t gt gt aa t agagt agaa t aaat t at aa t gcaaat at a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 tatgctttgg cctttgtaat tgttatttat catctgtcgg caaaataagc ggaataatca Page 152 aagagct aga aacctt caaa at gt ggt t gg t t t ggt t t cc t ctt cagat c gt at cat cac agagagagag at t agacgat ttaaagaaaa t t t agat cgc acaaagt act acaggccct g gagacgacca agagagagag 12689250 Sequence aaatcttttt cactgatcaa attcaccaag ccctaaaatt acgagtcacg agaccaaaac cacgcgcatt cagcagcgcg ataatttctc ttctccgaca tcggaataac agagaaagag Li st i ng. txt at cat t agca tcagt t t t t t taacgaagat tgat t ggcaa gagagagaga agaaaaagag t acgt t cgt a tttttgacga acgt t gt cgt t t cct gaat a gaagcagat c agagagagag 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 140 2000 DNA Arabidopsis thal i ana <400> 140 aacaagt aat caaacaccag t aaccat aag at caagat t a cct t gacat t tgggctagac t t t t t gggct t t t gagcct g t ggt cgacat tgt t t t gttg t at aaccaat at t ccat aat ggatgaagaa gat gaact t c cat ct cgt ca cat at t ct t c gccacgt cac accggat aag t aat at t t at tttttttaag agt t ct cggg gt t agt t t at cgat cat cct ct at at t cgt gt gagt t t gt gt t gaaat t t ggaaatggaa cagat aat t c aagt ct at t c ct t at t t t t t gt t t caat cc cat gt aacct agaat t cggc t cacat gcca at at gt gt gt tgagcgagcc cat ct cccaa ccgct t gggg ccaagaccat gaaccacagc ttct t t t gt c t acagaat at aacct aaat t aaggt t caaa t aagt t t ggt t t at ggat t t t t t gt gt at c tcaaaaagac gt agt agct a agcccatttt gt aat gggt t t t cagaacgt gtt ggct t t a gt gt ggt t gt aatt cagt ac t ct ct t aaaa t aaaaat t t c agagggttcc ggggcaaaac cgtt cacact act aaaccac ccaat gacgt gt cct t ct t c at at at gaca cggat cgcat at t aat cgt t gt t ccct aat t ggt ct aat t ct t ct t attt gacacgt gct gct actt aat gat gt at t ag t caat cat ct at ggaacaag aggt ctt aat act t gt acgt cat gt at at g gaaat acaag tt ct att gt c aaggtgggcc aaaaact t aa at ct cat cac t t aaaaaggt cct cat t t at ct t gct t t t g at at cgcaat t gacgat gt t t at caaaaac at t cat at t t at t at t aaaa ttaaccaaga gagaaaaat a cgtt gt at at cgacaat t ct catt gt t t ct at cacat gt t aacgt acgat t cact act t g t t t t t ct t aa cact cat agc caaaat caac at t gggaact gaaat aat ca acgt ggaat c t t t gt aaaa t t t at t t t ca t t accat at a cgt gggt t ca gat ct at caa t acat gt aag caggaccaaa t agct ggt t g aaacaaaaat aggaact t ac t gt t ct t gat ct t t aagagt t t t gat agca t cagcacat t t aat t at gt a acat t agt t t ttggt t t ct t ccgaccat at cat aggt ggt agct t gat ca t gat aaat ct at t caaaagg caacaaagca ggccaaaaaa t t gaat t gcc aaaaaaact t gct t at ct t a accacagcca t t ggct t t gt taaaaaccaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 tcaacgaatt tttagaaaaa cgtaaaacga tataatttgg Page 153 t gccat ggat t t gt t t t aat t gat gt aaat ct aat t cat t aat t aaaact aaaat at at a ttt ct cgcca ct aaaat aaa aaaaagct aa t agt gcct ca aagct ct ctt tt cat ctt ct catt att ct a caaaagt aca t agat t agt t act t t t t ct g t gaaacaat t caat at at aa gaaaaaacga aaaggcagt a t cat cat cag ct cat caaac 12689250 Sequence att acat cat agaat agat t catt ct ct aa t aat aat aga gttatttgct atgagtgtct t at aat ccaa t t aacaaaat aat t t gaat a aat agt aggt cgt ggt act g t gat cat aaa catttgtttt tttctttcca t aacct t t aa aaat t ccccc atacacacgc catt at aaa taccatctcc tcgttttgat Li st i ng. t xt caat t ct gt a ttttgttaaa gagat t aaaa aaaaat gagc aaccggtggt t aat act t ga gt at t t at ca ct caaaat aa t aaggcaagt aaaaat ct ct agggt t gact ct t gt t aat g gat agt cgca gaat aaaaca t ggat t aaga cgt t cct gaa cat gaaaaaa agaaaaaaat gaat at aaaa gaat ct ct cc 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 141 2000 DNA Arabidopsis thal i ana <400> 141 gt cggact gc t t agat gaaa t t at cct cct cccat cgat g acat cagt aa gt ccaaaaca caaaaaagat gtt ct gcttt gct t t gt t ct t gt t ct gat c gt gt ccgt gg t t cctgcgag t t gct t ct ct gaat ccaaat aaaaaagttt t t ct ggct gt aaat t ct ccg ggt aacacat t gat cat caa catgagagga t aaaact caa t gccat t t t t tt cct agaga cat caat at t caagaaagct ct t agaat gt t ggaaaggt a gat caat acg t gt gat agaa att cgaacag caaat t gaag gct ct cct ct aggct atttt t t t gggtt at cact agcgt g gccaccggag at t cct t at t t aaaact aac agt t t t ccgc acagacagac t at ct t gcag tt ggat at gg cagaggt aaa t caccgt cca t t t gct gaat aacgt t aact t t aaacat at act ggt aaac aagacaagag aacaacaaaa at t t aaagcc ctct t t cttt t t at t t t ct c gcaaat ccaa t gt t acgat c ct at gt t ggt acaaagt t ga tcaacggaca gaaacat t at t t at t t gcag gat ct cccaa aacat t cgt a at acacaaga tt gt gcaggt ttcgacccag acacacacac at ct aacaat caccgcgt cc cct aaagat g aggaaaagt c ct t at t t cat t ct aacaaat t gt ccgcgca ttttccaacg t t t gtgtacg at gct acaaa t at t t ccaag ttctgt t t ag accacgagt c gaaggactgt at agagtt ag aagctt cacc ct t t caacgg agaaat ccgg t t t acact t a t at gat t gca att aat ggca act ggaagaa ccat t t agac t cat agagca tt gt agcgt t cact t aggcg gcggct aat g gcgt gggcct gt agaaagt g tcgaacgagg t gt t ggt t ct cagt ct caaa t t act t cgga t t aat caaat ttctct t t gc agcagat aca aaaat ct at t ggaccaat ac ggat t cgt at aggt t gcat t agcaaacgca at aat aacag gaagcgacac t t gggt ct t t cat t gt caat cgat at t t cc t act agacaa aagaaaaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 aatatagaca ttgccgaatg ccaaccaaaa agaaaagcaa aatcatcaaa tcgaacaaag Page 154 12689250 Sequence Listing.txt t at t t t t at a ttgt t t t t aa cat ggat at a gt agat t aaa at act cat gt caagt aggt g t aaaagt at c caagagt agc t t t t at caag t at t at ct cg t ggat t aaat gcacgt gt ca agt t t t t t aa ct t ct ct ct c at t act ct cg t aaaat act a act t gaaaaa aacat t agat t t agcat gt g agagagaat c t gaaccaat g at ct aaacca agt aaaagt t aact t at cca t agct gaaaa t gcacgt gat cacacgcccc t ct gaagcga act ct t gagc aacat gt t at acat gt gt gt t at accagga cccaacacaa act gt t at t t at t at ccgcc at cact t t ga ccat t t t cat acacgcgcgt gt aaagcggt agcgcacgt t ct t at t cgat ataataatct ctttgattaa gcatgaaaca at gt t aagaa gaagaacaga gcaaagt agt cact t acacc acct t t t t gc acgt agat ac at at at ccaa t t ct cat aat t t aat caacc cggt t t t acg agct t t gacc t t ct caccca aaat t gaat t aaagat t at a tagcaccagc t t t t gggcct cct cat gt at t ct ct gggt g gat gccat cc cagat t cgt a gt t ggat t ct cgct t ct t ct at t gtct t t t cccaaagcca at t t t t gtaa t aat aat at g aaaaact cac cacgaat ggc at t ct ct cgt caccaagt t g cggt t caagt ttcacgcgcc caatggacgg gct agccgcc gt t ct ct at t ct cact ct ct 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 142 2000 DNA Arabidopsis thal i ana <400> 142 cat act agaa ct t ct t gt ct aagacaat t a ct ct at at ac at aaacaaat ct cgaaaaag agaaaaggaa gaccacaaga gat caacgt t t gat gat t ga t t gggtaaag ccacaat gca caaagaat at cacagat aaa gaacat t ct t aaaagt t gca ggacacagca t cat agacga agt t ggagac t t cagct cct cacct t agaa agt t t t gat t gagcagct t g t t ggt gt ct g t at aagaaaa agaagt gaat cagaacact a aggt t ct t aa tgaaaagaca cacgt t t t ct aact gct t ca cact t at t gt aacaaaacca cat gt aggt a aagt agacga t at t gct gat gt at aaat aa t gaat at acc accagct t ca cat at t gcaa cat aagt gag ggaacccat c gccaaggtt a gacat aagt g aagaat t cat ggat t agt t c agcacgct gc gat t t cat aa tacgaggagc t t at t at t ac cgaacct t t c cgagtgcccg t t gt acat t g t t gaat t cac aagagt acca agt ct gaaga gaacctt ct t cact t aggac at gct cct ac gat t cagacc gaaaagagaa ct t ct gcaat t t t cat caat aat ct ct t ga t t agaat t gt ct t cagt gga aaccgt ct ct agaccaggat gagt t ct at c aggt aacacc t t ct gt cat c ct cagaaaca agcct ct t cg ggcacct cct aagaaaatt c gat at aaaaa acaacct cct ct cct ggt ct tggagagccc aaat gcaaga caat ct t ct t acat t t ct t t t gat ccat ga at gcct gcag at gcat t t gt ggct t ct t t g agt gccat ga caaaaacat t ggaaact gac ccaagagttt t t at t aggcg ggt ct ct ct a t ggct t aat c at aat ggcat t aacat acat gt gctct gct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 155 12689250 Sequence Listing.txt ct t t acacaa t ct gct gaac aagt gt t gat t ct agat aat gcgt cat aag ccaaaaggt g aggcgt t aga cgt caaacca ttttttcaga agat aat caa ct cct t caca gaaccct gt a gaaaat agaa agagccaact caggt ccaga agt t ccgt ga ct t cgt ct gt at t t t gt ct c acaagagt ga acat agct gc gaat ct cgcg gaat t gggt a ct aact t gga t agat caaga agaacgcat c cacat at gga at t gt gt aca tcacagaggc gcccaaat ca aaggaagaac agcgt t act t at ct cccaac agaagaagaa gt at t cct t c t caaagt ct t aaagt accaa agcccttttt agagacagaa aact t t t cga gct accagcc gt gaagt gaa accgcaccat ttaagagaca t t t gact aaa aaat aat aac aaccat aaac gt act gt t gc aact ccgagg ccgt gaagcc agaaagat aa gcat gt t aaa cgcgacgact aat accct cg t gt t agt gt t agggacgatt cat t aact at gt aat accac ct gcacaaga tgacaccaac t gt ct t t caa gct aaat caa cat gaacaac aggt aaccgg at t t gaat t g acaagat gaa agaagat gaa at cgcct gaa t t caaagt ag gt t gt cgt t t ttaagccaga at t ggt t ggt t t gccaat t c tttaaaagga aagt at agaa agat accaca at t aacagaa aat ggcat ga gaaat caat g at t gaat gga agt t t ggat g agt t cgagcc aagt gct cac act t t gccat aaact acaaa t at gt ccgaa cagggagaga gaaat t t aga gt t t aacaaa at act at aag agt t caggaa agacat aagc gagaaat caa t gt gacaaat agt ggagacc aat gcaaat c gcagaaggga ggt t cat t ag at cgt gct t g t at cgat agg t t caat ggcg gt ccact agg acgacat cgt gaagggagaa gcat t t t t gg 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 143 2000 DNA Arabi dopsi s t hal i ana <400> 143 tcaggagaga ggt t cact t g gat aat gt ca gaagct cgag at agct ggat at agat t t gg cat t cgggt t at cagccct g ct cgaagaag ggt gcat t gc gagagt ct t t t at gcagct g aaccagat ct agact ggaaa t gat t t gcaa acat at t cga at gct t t gaa at t at aaacc ct ct ggagaa gact agacca cat t agaact t aaat gcct g t t aact t aga ct ggt at gt g agttacctta ctctccctca tttcaggttg tggaaaattt at ggat agat t gct ct t at t t aacacgcct cggt at ct t t aaaccacat a aggt t gggag t t act ct t gt cat ccgt aac caagat ccac accgcaaat g ggat ggt t t t gct agagcag gat at gt act gagaaaact g ct cgaagct t acat t cct t g t at t t ccaca at ggt agat c at gt cagct a cgt aat gt aa gcagct cct t gcgaggat t a at at at at gg caaaat gt gg t t gt cacat g t gaaact t t t ccgt cct t ca t cat gaaaca t t ct t ggaag aacccgat gc agat agct ga at gt agaaat gat caat t at gt gcaagaga aagt at acat gact acaat g t agcaagat g agct t gt gct agt at acaac aaaaggaaag t ggcat at gg gcaagccgct ggcaaacat a gaaacagagg 120 180 240 300 360 420 480 540 600 660 720 aacat t aaga agt at ccagg t gaaagcgt g at ccaggt aa acggaaagaa t cacagt t t c Page 156 12689250 Sequence Listing.txt act gt aggag aacat ggt ca t gt agagaat gaagtcatat actttacgct gaatggtttg agt t t gt t t g gaact t t t ca tagt t t t gt g at at gt aaaa t at t t t at t c t agacaact t t agaact ccg cct t t t t gat cacat gt at c at aaagt gat ggagt at aga aaat acgagg t gt aacgcgt t agct t t t gt cggt aat t t t ct cgat ct at gt aaat cat c t aat t cct gt gagat t at gt ggt t t aaagt ccaaagat aa t t t gat gaca at ggggaat c ggaaaaggt a accact aat a t aaat aat t a t gt ccat t gc t t t cgtattc caacaaaaac t gt at t cat g t acgat t aaa act agaaaac t aagt aat t a t ct t ct gt t c t cct t t t gat at act caat g t at t cct t gg gat t t t t act ct t agat t t g aaaaaagcaa acacgt at t g at gact t t t t t agt gt t at c at t at gat t t t at t t ct gga cat aacat t a gcgt t cccac t cgt t t aaac tat t t t t t ca at t aat t t gt at t acaaaga aat t aaaat a at at t t t t gg caat t ccaat cgcatttttt at ct t at at g t aaat t at t c gat gt aat ct ccaaat gt t a t at aaagat g agt ggat t t g aacat gt gt t gtggaggacc at t t at t aac agaaaat at a gacgt agat a cccagt act t t gcat cat aa aact gt t t t g ct acaat gct t aaaat agat at acat ccat t cct t ct gaa t t t t at ct t t t gt at gat ca ttcgat t t t t ggt gt t gat t agt t t gt t t g t gt acaaaga t t gt t at t ac t gaat aat t a agat t ct t t t t t aaaaggat t t t t t cctta t t t t ct t t at t ctt aaacac gt gcaaaaca t t t t acgttt t t t t at t t at aagat at gct ccat at at at at at t cat cg gat ct t ccgt t gt aat t t gg t t t ct gat at gt t gt gaat t at ct t t gaca acaaagt t at t t t t aggat a at at t acaag ggcat t act t aaagactttt accat gt t ct t at act t t ac ccaacagagc ct cat aaaaa t t t t gt at t t gat t gt ct t a t at t caaaaa cct t ct ccat caat t cgt at at t aaacgt g aagt t gaaat aggt t aat gg ct t gat ct at aaat at t gt a 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 144 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 144 gctggatcaa ggt t ctttcc acgt t ccaac taatgtat t g tttcttcaaa tgtgat t aac gttacaagtt gcagagaaat ttgattatat acgtatacgt aatctgtgtg tatggaatta ggaagatgac atgaaagctc aaaatgctga tggtgtcaat cttggacgtc acttacgatt tatgtttact tggcttttga agatgtcaaa aaaatacaag atgtttgtca tcattcacac atgaataaat ttggttgatg aagagtacta gt ct ct at ca gt t t at ccct ttttagcttt atagcagcta aaggt t atgt taat t cacat tgtattattt tatttccctt tggaagggaa tgtgtcaccc gttttattcg atttatacat agact t gaag atagtaagt t aaaaaaaagg taaaaaaaaa caagtttatt tatctaggaa Page 157 ct t t gt t t t a at at act aat ggat acgt at t at at t t t at acgaat t t gg gt aaaaat gt ctgtcttttt aaggat t agt caagt gaat c 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gatgggcact agttaccaca ccaattggat tttgggatgt cgaccaaaca gaagaagtag ccggaat t at aat t t t gat t t t t caacgt a t cat t ct aaa aaaaact aat ggt t ct t t t g act t t t cttc acat t gacaa t gat cat t t g t gt gagct aa at t t ccat t c tggtggaggc t gt ct t ct ca t at t t t t t at t t t ccacct t act aaaaacc t t aat at at t act at gaat a gat aagcct t aact t aat cc accaat t aat t t caat ct t t aaaacgt t t c agat t at at g ggt ct at t gg gt aat aggca at t gt t t gat at t at t act c gacccat act aat t accacc t at t t gaat g ccat t agt aa t at t t t at cg gaaaacaat t ct at agt aag aat ct t t aaa at at ccaaca aaat aaaaga aaaat act ct aaaacgt gat t t t gt aagaa at at ct t acg t t t t atctga cct at at ccg gact aat cgc ccct ct cct c at at agcaac at t acaat ag t t t ct acgt g cgagct ct t t cacaagctt a at at agggt a t at aaaaaat tcccgcagaa tttagcaaaa t ct at t at aa t aat ct cgt a t ct cacgaat aaaat aat aa t acagt agag aacaat acat aaaaaaaat c t ct gct aaag t agt at t t ct t at gt t aaaa t act acat t c ct cagat cac t t t t agt at g ct ct t cgcct t t t gct t aca at cat ct t gt t cat t t t cat aaaagctt gt t ccat aaaac gagt t t aat t gaat acaaaa gaaaaaat aa t aagat acat caaacaccgt caaat t t gca agcacat aaa act t t t ct ca ggt ccaaat t t act cgt aat aaaat t aaaa gt gcaaat t a t gacaat t ac t t at ct ct t c aggat ccgac t aat t aat aa at aaat t aaa acaat cacaa t t aagaat t a agt t ggt ccc gagt t gt gat t t caaaggt c t aat ct ccat at t t aaaaat accgact t t c t t gaaaacat t agt t ggt at gat t accat t t at t aaaaaa at ct t ct aaa acgt ct ccat aagaagatt c aaaaaaaaaa aat at t ggat agccacagaa t gcaat t at t ccgact t t t a act at t caaa cccct ccat t cgt ggt t gga t ct t ct agaa aggagagtcg gt t t aagact tatct t t t gt agt t t agt ca ccgt gat t cc aat t cagt cg ct t gat t at a t t at at t act cat gaaat t c ggct t t t t at aat gt t t gt a aat ct at t t c acaccaat t a t aat t t cgaa at gat aaat a t cct t t ggca caat t t ggcc t ccccaact g cccgacccgt at ct t aat t a acct t t ct t c 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 ctcgaacaaa aacaacaaac gcagagaaac tcaaaactcg <210> 145 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 145 gttgaaataa gatttaaaaa atggat t atc tctggtacat tcaatgaaca ctctttttaa agaagaaaca acattataag atgggcttgg caattttatt taacaaacaa gatgacaaaa agctatggag tatatgaggg taaagtcgt c tgtcgtcttc gaaagttctc aaagagaagg cgtt aatt ca t agt t gaaaa gct t t t t gt t aaaaaaaaaa at t t t acaaa cat ct gagaa Page caaaaatt cc t aaacat t t g gtat t gt t t t ggactccaaa aagcattttt accccaatct t agaggt ct c ggaagaact a t t at gt t t t a agt ccaaaac at ct t cgt t t ct aat ct t ct 120 180 240 300 360 12689250 Sequence Listing.txt tttcaat t ca atactttttt t tctcaatct ctcctctcaa atcaaacaac tttgat t gat ct t cgcaat c cacaacgaga ttagacgaga t cat ct t gcg at ct gt act c t t ggcgct t g aat ccgcgt g cgat act acg acaaat t at t agat gt gat g aacagt t gt g t gat t aaaag aagt t aagt t ggt aagat t a accaat t t gg gct at gt aat act ct gt cgg tgt t t gattt gagt gat agc gaat cat t at aat aat gaaa t t gat aaaaa t gat t acaaa ttttgcgaaa gat at t t t t c t ct ct ct agc at ct gaacat gat t ccacat caaacaagaa at ct agt gt a t gagcaagat gt cact ggac gt ggct t ccg ct aggt t cgg agaagatgag agagt t t ggt t t caat gct g t t ct gt aat a t agat t t aat ggt ccgt agt agaact gt t a t aat aaacct gt t gct t at t at ccagt t t t ggt t at t ct t t acaagcct a cgccgt aacc t aaagaggt g cact cct t t t gaact t at aa aat aagaaga at at at aaaa caaat t act t cgagaaaaaa gaagcgcagt gat gaagagt cgaggt gt t g ct ggcacaaa t aacat cggc aagact ccac taaggacgag t t t cact aag t t t ggagt t t gcaat aaaat at agagggt g at caaat gt g t ggacaat ga gt gacgat t c aacat t cacc t ct t t at gt c gccgt ct agt gcgat ggt t a cacat cat t a act t t t gt ca t aagt aagag ct gt t ct t gg accact t at g t t t t gt t cat at gacaat aa ggccacgacc act accgact acagaggaag aaacacgt cg acggcacaag tgcgggcaga t ct ct t t acc ct t aaact ca agacct ct t c gt gt at t t gg t t gt t ggt at at gagaat t t at t cgat t t t tt gt aacaaa t gt t t t at t a act t t aagt c ct t cgt t t gt ttttacagcg gcct t aat t t ccat gt acca t t t t ct t t t t t caacaaaat t t t t at at ga t ggcat act a ct t t t t t ggt acacat t aca at aaacacaa ct gat t t ggc aagaaat cgg acgcaaagac acgagcggct accagct cag t ct ggcct ct ct ct t t ct ct cagaat ccaa agt t t gt t ga ccat aagcca t t aat t act g gt t gt aat gg at cgat t cgt t t caat gaaa caaaccggaa aacat t t ccc gt acgat aaa cgct agat t t ccat ct cct t t t at at gt cg aat t t aagag aat t t at aaa aaat t t aaca aagat gccag aaagacaaac aacaccgctt cggt gat gct at t ct ccaat ct t agct at g t t gggagct g at ct gt t gt g ct ct aaaccc t ct ct caat t at aat t gact t t ggt aagt c gact gt t agc t aat cggagc t t ct gt cgt g gat ccagt t t aacaat gt t c t caacgt caa ggt ggt t act t ct caaat gt at gt gt at t t at t caaat ca gt aaagaaag ct t gt gt gaa caaact cgag aaaaaaaat g t t t agct aaa cccacacggc agct t ct aaa 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 146 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 146 aacttcaaat gcctattaca agtttatgta agagttcaac attaaatttc cgtaatagaa aggcaaaatt ttacctaaat ttatttcctg aagtacaaga ggatcatagt tgggttgacc Page 159 12689250 Sequence Listing.txt attttcagta attacaggtg cggtagtaaa tcttccactt ccaatgacag ct t acact ct ttcacaaaac t gcgat acat tctct t t ct t cct ct aat t g agt t at caaa acaaacat ac gt acaaat ac t aaat at gac t t at t cat ga t gt aaaggt t at ct t cct at agct t at gt c gaaaaagat c at ct cct cat agacaat gca cagcaat cac ct agaat aag t t t gaat ct g t ggct cacct at ccaaat gc agaagat cga gat t gagaat ct t t t t t t t t gt t t ct t gt a at agt t gt t g t ccacat gaa acgacgccgt at t t agt aaa acct aat cga t ccaat ct ac t cct t ccaag t gaat cgat g at at at agga cgaat gt gct at ggat ggat gat acagaac ccct ct t cgg agcct gccat ccgt gt t gt c t acat t at ga caaaat t at t ct t at gt at c t ggaaaaagt cagt gggat c aaggaat cag ttttcgaaaa at at cgt at t caaaaat cgg gt ct t gggga gcgcct cccc tgagcaaaac t caat gagca ttaaggggac t aggt t t ct t t cat t cct t t cct ccacaaa agagacact g t t t ctctct t gat aggccca t t t t t t cct c cggcggcgaa aagcat act g gt t t at gaat t at gaggaca cat t gggt ca t ccct at aga aaat gaaagg at gcgt ct t a t t t ctt acac agcaacat ag ccacaagaca caat ct agt t tt cct aaaga gacaat ct ca aagaat at at at t t at gaac cat agat aac cgt aat cat a agaaaat gaa cat cggt gca acat ct ct at agaaagagat at gat caat g ccagct t gga t ct gcgt at a t t t ctggaga t aacggagt t t gt ct t t t t t t accat at t g aaccacaaaa t gct gt cgct caat ct gaaa gaact gaaag aaat t ct acc taacaaccaa t t gat t gt aa at aaat cagt gagt ct t gcc t t agagt t t t caaaaact t c t ggaat gat a ccaaat t agt ccct ggagt c ct aaaacaat at gagaccct at t acaat gt cagagaat ct aaacagagac gaagaaggga ct caggacga caaact t t cc acacttt gga aagaaat t aa gct t agt gga t tat t t t cct t gat gat tga t ct ccat cac tttttttccc ggcgt t taat ccct agaat g cgt ct acat t gaagagaatt aaacat gt aa t t ct gcgt t g gt aggaat t c aggaat at t a ct t t acaaag aat at ggaga accaggct gc ct ggcct caa t aggct t t t a accagt caag aagaagt t aa cat t cat gt g gaacagct cc caggaacaaa ggat gt aat t gacccaacga tcgaaagaag act t ggt agt cgggaaaaaa t cggggt ccc aggaaat t t a t t ggagt ct a gt cgt ct at c t gagct ggt c t t ct ct ct aa ccaaaaat at gt t gggcct t aagat t at at t acact cgga gaaacat t t g agcggtagga t t at gccaaa t cgaat t t gc ccat act t cg aagt aacat a act t at ccaa gaccct t ct t acaaacat ca cat gat gcaa t ggcct gt aa cacat aacct aat cct t t ca at gct act ga t aagat caat ttcacaacag ggaaccgaat agaagcaaaa t aacct t gat aagaat ct gc aagaaat caa ct caaaact c gggagaaaaa aagacagaaa t at caact aa t cat t acaca t t t atct t ct ccaacat aaa t gt gat at t t at agt gcaaa gct t agacct 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 147 <211> 2000 <212> DNA Page 160 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 147 act gcagcaa agat t at at g cagt gt t t ga t gaaaagt ga t ct ccat t t c gt acccacgc agct ggt at g aaat gcaaga t ct t acgct a gt gt cat gat gagct ct gca t t at aaagaa ccct cgacga t t agt at gt a gggt ggat t c gaggct t at t aggat cagt t aaccct cgag gct t ggcgat t gat ct t tag ttat t gt t ct t ggat t t ct t tgt t t t gtgt aagt aaaaaa gagaat cat a aat gt at at t cccaaagt t a t t at caaat g t t t t gagtta cat aaaat ca t ct ccgt cac ct t t ctcttt cccat aat ca aat t agcat g ggaacgaaga agagct ct at t ct gat acat gagccagagg gct act gcac aagact at cg ccagaagat g gt t t t t cgt t ggt t atcgcc agct ggt gt t gct t t t aaac t cct gagat g gact t cgcat agat t gat ca gctgaaccgg ct gaacaaga ggt at cact a ggt cat at ga at gt gat at t caaat t t caa ct t t t ggcaa at gaat gggg ccact agacc aaat aat t t a aaaaat ccaa t t aaaaat cg agaaacact t act agggt t a caccggct ct ct t at ct t aa at aaagaacc gct at t t aat t t gtgagaca aaagt t t gt c t t ct gt gaaa t t ctcaggga cagacgctgg aggt t ggggt gcaaggtat t gt t t t gcttc at t gacat ga gaacacaaaa aat gt acaac gtct t t t t gt t t gt ggat gc aggt t ggt gg act cgt ccac aact ct cggc tttgcaggag t t gtgagccc t t ggt t cct g gat t at aaca aact ccaaat cgtt ccgaga aaacgcccac gt t t t gt gat cgt t t gt t ca t t agacgt t a agt t gggcca gt t ct t cttc gt at t t cggc tctcct t t t t agt agt agt t at at gct t t t caat ggct aa ct t acat act aat gt t gt ag gctt aggaac ccagt t gat g t t t cact gga t gcat agat a t gt at cat t t at agagact c t t gat t t caa tt cat act t c t t t gatgttt agacaagct g gat cat agt g gcccgagtgg t gat cagcgt gt t at at t ga t t t at at t ga t gt gaat gt g gacagt t acg t ct gaacat t at cgaact cg at t gat at at at aaacagt a aaaat aagaa at cacaat at t aaaaat at a t t atagagcg at t t gt ggag t aaaat t t gt t t ct t cagt t t t cct t gagc ggatgaagcc cat ct ct acg t at at act gg at t act cat a gggatgctgc t act ct ct t c tgaccaaacc ct gcaact ga gt at gaaat a agagt ct gaa t t t gtct t t c gtgatgcaga aat t at t gga t at gat aaca aggatagaag gt gcagat ct gcgagt at t g gct at t gt ga aacct t t t ct aagaaacaca gct ct aaaat ggacct ct cg agt t t gct ag aacagaacaa aaaaact at c aaaggaact c ggt ct agt ag ccgcat at aa agaagaaggt gacaaagt aa at gcaagcaa agagact t t a aagggat t gc t t agt t cact agact agt gt accaccct ca t gaat ct ggt t cct cact gc aaaagaaaaa t t t ggt acac ggat t gccgg gct ct t ccag aaaaaaagaa aagt gaat ga act at cat ga ct ct ct gggg tcaagaaagc cgcaagct gc ttgt t ct t t g t gcaagagt g at caagt t t g act cat t t t a ct aat at aaa cacccaaagc ct t t t aat ac gat at t t t ca caaagt t aaa tttttttttt at gaaaggcc gct t cat cct act ct cat cc aat gagct aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 tctcgtcttt aagatcatat gtcaaaactg aatttgagtt ttgatatatt caggtaaaga Page 161 12689250 Sequence Listing.txt gagaagatag aaacataaaa <210> 148 <211> 2000 <212> DNA <213> Arabidopsis thaliana 2000 <400> 148 t cacaat t cc aact at agac gcat caagat t gt aagctt c taaaaacaaa ttgcggcgag tagcgggagc gcat t gtgat at agcagcgt ctcacggtgg att ct gaaag t aaat t t t at aaat t at t t t aagaaataaa aagt gagaat ggt t t acat g cggt t t t gag acaaagcgag gacacgcgcg ttggtgcgag gatgt t gct c gtgtggt t t t tgcat t t t gt caaagaatcg ggaacaagtt at gacat at g gt t t t t gcaa t t t t ctctct aacat t t gat ttacat t t t a t aaact gcgg at at ccgaag at agagact c caaat ct t ga agaagt t gt a aggaatagt t agt t gaagga ttgggaaat g at ct gct gcc caagtgacga t t t gt t t agt t t at t aact a acaacagaaa ct t t agt t t c gt t t t aaaat aaaaaaaaaa t agat t agaa aaacgcgagg tgggcgaat t ct cct ct cct t agaggt cac tgtgcagcca tagatgaaga t t t t ctcat t gaggtgaact ct cggt t aca ct ct aaaat g t acacat aag tggtacctta actatatact tctcttagac taagttatga ttttccgagc aacgggagac att cgct t t a at at ct t gt g gt t gaagtat gggatgataa ggat t gt at g ct t ggt gat g ggagctctct t gat gat t ct cat gt ct at t gaaatgaaaa aaaaaacttt caagcttctt caaaat t aat aaaaaaaat g at aaacat ga tcggtgcat c ccggagcact cct ct at t at caaaaaaaat tttgt t acac aat cgaagt g ttgctagttt at acaaaggt agcactcagt t t at agat ac act t t aacat atgt t t t agg tgtaaacaaa gaaat t t cga t gaaccat at gcgatgtagt agaagct t gg at cat t at ag agtt at caga t cct t gccgc t t t t t gtagt t aat t acaat t aaaatt cca t t aaaaaat g t at cat ccca ttgggttcgg ct t t cgagat t t t t gtgggt tggt t atgaa ctgat t ggct taat t t t t cg ct at t t t gag aggt t gaagc agtt gaat at cctgt t t t t a tgttgcaacg ggact t t t gg gaat cct t t g aat ccaact t t ct ccagat a aagctcaagg tagtct t t t t gcagat aat g cgcggaacga aagat t agag ggtat t t aga t cacgt cgt c atgtggcgac t ctt acaat c t t aagt t aaa aat t t at t t t aat gt at t at act att ccaa t t t gcat tat cacgcgttgg t t acccct ac atggagcgcg gaaaaaatag tcgt t ct t ct at act aaaaa t t at aact ga tttctgaaca tgt t t t cttg at aacatt ct ccaagacaat tt gaat aaag t t t t t t at at tggctgcaat aagt t t t cgg gcgt t t gtat gat t caact c gcggctcgtt aagaaaatga aact at ctt c att gagcat t ggt cat caag aaataagaac at t acaat at ggt t t t t aga t cact agt ga t ct t t ggt t c at aat t aat a t at t t t ggaa agaaat t gat tgaggtgaag aaat agt agt t ct gaaagt t tat t t cgttt aaat t ggat t t at gaaaat t act t t aggag cctt aatt ca tttttttttt gaaaaagtt g gaagctacaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 Page 162 acaagat t t a ctt caat gac tgggctaaaa cgccagat t a gaacaaggaa 12689250 Sequence Listing.txt aaacatcaaa gat t ccatct aaact t catt catct t caat ct t caacat c tagtatgtat gtacataagt aaaattgttg ataagaaaac aaaacaatga tagcccataa aaggcccatt aaacttgggt ttagacttta gattcaacga gtgagtcaca taaccctctt ggaaagagtc tcaacact t g cagagaaaaa gatcccggaa 1800 1860 1920 1980 2000 <210> <211> <212> <213> 149 2000 DNA Arabidopsis thal i ana <400> 149 accaat cct c ggt aaggt cg agt t t t ct at cgcaat caca t gt t caacca gct at at at c aaat t ccagc t gt aaccct t cct t aacat c at aggatt ac cact at cat t cact agctt c acact t aaat ct t t cgt aag at t accagct gct t cat cct gaat t cact a agat at t ccc at gt caat cg gct t t t cact cgaaggtgga at gcgagaaa t t aaccct cc gat aagt at t at at caaaat t t t t t t gt t t acagagacaa gt cgcaagt g at ct t t gct c t cgaaaat t t ttaggaaacc aaccat attt at t cccct ct t gggaaagt a aagaat caca agaaacagat act agct agc agaaaaggt a ct aaacgat t gat cagaaac ct t ct gcgt c cgaaaacacc ct aaaaacat t aaat t cct t gaccat t acc tttcaagaca gct t aagggt at t t at gt at aat t ct at t a gt t act t gat ct caaat t ct cact gaat ca cagt acaagc gggagaagct ct ct t agt cc aat t aacact acct cat ggc ggt gt acat g cact cat ct c aaagcagtt a acacact act caaaacct ca gt caat t t aa t at caccaac at acat aagc aagcct t t cg agat gccat t gaaat t gggt aaaaaaaat g aggt t at t ga t agat aaacg ttggaagaaa gt t at at aaa ccct t t t ccc aat gt acat g t gct t at at g t ct ct actt g ct t t gaaat t tacaagccaa acagct ccaa at gaaat cag cat cat gaca t aat gcat ga ctt cat ct cc at gaat at t a gaaat t aat a gagact gt ac ggt cagaat c caaggagtga tt t aaaact g at at at t cat gt cgcaat t a aaaat aaaag aaaccct aga t ggagtt ct c aaggcagaga gaaggat ct a ggaagggact t ccact at ca t t aat at t aa gt t t t t t at g tt ctt ccaca at ggtt gt aa agtt ccaaac aagct t cat a at t t caaaat agat aagcaa gaaaat t gct acccaacacc aacccct aaa gaaact aat t aat aacaat c caat t t caag aaat gt ggaa at caagagga t gat cgt t at t t ct t cacaa ct at t cgaaa t t at t cgcaa agact t gt aa gagagagaaa gcggat gt t a t t at cgacat gcaact aagc t t t gt at cga aat t gaaaat t t t gggtcag gaat t t t cag t t t gagt t cc accagaacaa acat aacct g caagcagt gc t gcct t t t ca aaat ccgct g cccaaat aac aagagacaca caacacact c acgt t t gt t t gagt at t aaa gaaagagaac agt t cct t ca at t cccacct gact caaacg aacat caaac aagccct aaa ttggggaaaa aaat t ct t aa tttcccgcca at aagcacaa gat t aaacgg t t at at t t at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 tgataactaa tgtttaacaa at at cat atc tgtgtgcaac tagtcaaatc Page 163 t at ccacaaa t gat aacct c t t t aaat caa at at t ggct a t acgt at acg t gagt agcaa gaaaacgcgt agagagagag cacacatact taaaaaaaaa aatcacacac at at aaat t t gcccaataag ccaatcgcag cagaagt t t c agagagagat 12689250 Sequence aat agagt aa t aagcat at t attaaattgg gggataaaat tttttgtttt gtttcttgac caatttggat aatatgact t aggtt gaat a t gact t t gac t cgt agt t ga aagt ct ct at tcgt t ctaca gatagagaga Li st i ng. txt t t t aat t at c at t gat aaaa at ct t aat t a tat t gggcct t t t t t caggt t aact t ct t c gaagagaacg gt agat t aat t ct aaagt t a t aaaat ct aa aat at gat ga tt at ggt cag t ccgat ct ga agaacgaagt 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 150 2000 DNA Arabidopsis thal i ana <400> 150 t ccaacatt a t gat aat cga at gt aat t aa aacgat t t cg cgcat aat t g gt gt t t at gt act at aaagc taagcaggag at ccacaaaa t cgt aagaac t acgagaagt ct cat caagc gat aaccaca agaggt gt t c agt gagaaac ct ccacaaaa t t gagat at g t ct ct t gacc cgat t gt gga actct t t t t c actctttttt gaat aaat t a ct at at agt a t gcaagcat a t t t at acat a t t aaat cccc t t t aat ct ca ggcctt gt ct t acat gt at c acaaaagaaa t gt gagaggt aagaacaaaa gccacgat ct cggt gcat gt ct ct t ct t gt cat aaccccc gaggt at agg gt t t gat gct t t gat aaat t ccgat gt aat t cgt cat t ag ccat cgccgg t caccagat c aagt t t aaat t at at at t at at t aat t t ag t cgaat t act at gaaagt aa ccat t t gcat ccgt t t t at t tacacaaaaa acat ccaagt t t ct t agcat tgaaagagcc t t t gaagat c agagagcat t t t at ct ccaa aaacgaagga tttgtgagaa cct t ccct t t t gagaaact t gaagcct agt gt gt gt aat c at ccat cat t cat catt aga t t t t t gt t ga t gat aaat ct acat t t at t c aaaggt t aat at t t aggcaa at caat ct t t gct act t t gt gagagt t t at t aaaacat ga at aaagat at aaaaact aaa t t cct gat t t t t t ggcact c ccacaagt cg gat gcaagt a at ct ct ggag agct t t agcc ccat t t gcgt t gct aaaagt at cat cgt ca gctct t t t t c act cct t aca at cct t at at t gagatt cga agcaaaaaaa t t t caggct a at gt at gat g aagggaaaca gt cat act ca t t agt at gct agcagcaaaa t gat at aaat cccat ct t t c t gct t t gcaa at ct ct t t ct gct aggct t c agagagccaa gcct t acct a agt agt t gaa t t gacgct cc t t aaaaacct t t at t gct gg t caccggat c at t t t at aat t t t t t at t gt cat at t t at t aaaat t aggt at cgt agat a ct at gt ct gc t cct t t t aat aaat aagt t g tgcttgggaa gagagt gtt c ct aat cat t c act t t t t aga cct t t gt aat t aaact cat g t at ccgt t t t agaaacacca ggcaagt gt a gct t ct t agc t ct cgcat ga t gaaccat gt at ccat cat c cat cat cat c act aaat t t t t caat ggt gg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 Page 164 12689250 Sequence Listing.txt aattcttaag tggatttctt taaaaaaata aatatataat attataatat acat aat aat act at aaaaa t gt cat at at caagt t gagt t t gt t gagt a aagt gagt gt at caacagt c gt t cgagt cc t gt ct t aat t ggaaacagaa cggaatcagg t at t agaaat t gaaaaact t at t caaccaa aat cct t aaa ct t at aat t t at aat gaaaa cgt t gt agt c cggcaacgga caagct ct aa ggt t t t ccac aagacaaaaa at t aaaat t c gaaccacgat at cagct ct t agaaaaaaaa t t at t at cga gacagat acg t agct ggt ca gt t t gt t t t t tcaaaggccc t gt aaaagat t aaacat aca aagt gt t aaa t at t gt t cac aaagt t t gat at ggt ggagt t agt gt t aaa ggat act cgg acct cct t ct at t t gaaat t gaaaagagt g t aat at aaaa ct t gat t cgt gct t agt ct a aat t t t gaag aagt gaat t t t aat t t at t t ct ct cacccg t t t t t t aat g t t ct t ct cgg t agagagact at t t caccaa acaat aaaaa gcaaaat t t g at cacgaaat t t t agat t t t tttttgggaa ct t t gact gt agagacccgg ggccgatttt t ccct ccaac cgaaat caat 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 151 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 151 gcttaagtct cctttttcta taattgttgt tgtgattgag gtcatctctt atggcttaca ttgtaaagaa aatttgagaa at ataaattt tattcatgtt ttatgtgaaa cccatatgca tttcggatcg atcatttttc cgaggaaata ttgagatgta acaat t at t c cttttatttc aattctagat cttaattact cgtgttataa tttaaaattt tgttacttta cgaaaaatct tcgatcggtg tgcagtgcac cgacactttt cttatttaat tgtgattcat agatttttgt ttataacacg actgtgaaaa gtttaacaaa act t t aaaaa ctgaatatct aaatgaatat gcataatctc attttatttt tatcgattgg at t gat aaaa aaaacaat t a tacaacttag act aacacat gt gcacgct a t gccagt cac gaaaacttta tgcaaattaa agcaaaagag aagtttgatt ttttatttgc aacttgtgga tgactggctt ccataaactt ttaaagctct at acaaat t t cgt t gagct t gt at t gct ac at t at gct at gtggagaaaa gt aaaat t t a t aaaaagat c ct aacat gt t agt cct cgt g tttacaaaga agaaagacgc gcat at agaa t t aacat gaa t t t ctt gacg aagt at ct t a t aaat aaaaa t at gt at ct a ct ct t t caga agt at t aat t cgt t at at ag gaat ct t aac ct gt at t at t t caat gaaaa acgaat t ct a aaaaagagaa t t ggct gt t g ccaaaccat a t aacaagct a at ct t at aag t cagt at t ct agaaccaat t t t ggat aacc taaaaagaaa agcaaat t aa t at gat gaag t t cct ct ct a gt cacagt t t t cgagccat t t t t t t t gt at tttcacccgg ggaat t acac at t at at t t t aggt ct gaac cgcagct aac ct t t t t at t t ggat gaat t a at t gat gat a caaggt ccga tcgaaaacac t ggcgaat ct aaagaaaaaa aaagaagaaa aagacat caa t ccat t t cat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 taacaccaaa acctttttct tctctctaac caaacaagaa gaggactcca ttttcgcaag Page 165 12689250 Sequence Listing.txt gtatgcttca tttgtatcga tctttttata gttctgcaag tttgatatgt actcagaat a aat t gaat at t t t aggat t t gt t act t cgg t t gaat at ga aaagggtat t tttttttttt t gt aat gat a t agaaaagt t t ct t t t t gt t act t t t gaat at ggat t aca caaaat aaat t agaaat gt a t at t t t ccaa <210> 152 t t t gt t t t ag caagaagat t ggaat t gat a at aaggggat t t t ggt at ct t t t t t t t gt c at aat t gt t t ggt ct gagaa t gt t t gt t t t agt ct aaat t t aat agt at a aaat t t gt ca ct at t at at t aaaat gat ag t ct t gt t agt gcat at t aat at t t gaaacg gat at gat t t caagat acaa aaaccgatt a agt gct ct ac aagct at aag gt t ct t ggct t ccaat t t gt at t at aaaaa gt ccgcat gt at t at t gaaa t agaact aca at cagat cat at t caacaca t t ct t acat a gcat cccat a gcaat gt t aa act ct gagca at at t t aaag agt t at caca aat aacgt gg at at t at cca t accat at ga t t aaccaaat gt at at at t a t aat gt gaat t t t ct ct gcg gt aaaagt t g at agcgat t a gt aat gaat t aggt t at aat acgcacat gg t t t gacat t t att cgaggaa ct cgaaaat g aagcagt aat cgt aagct t g ct at t agcca gacat cgaag at ccaacaag agat gat gt g gcaatttttt at gat agat t tacagacaac acgat ggat g ct gat aat ct t ctt aaaaca t aggt aat t t t aat t cgcat t t t at t t at t 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <211> <212> <213> 2000 DNA Arabidopsis thal i ana <400> 152 agt gacat ca gt gt cggt gg aaaaaat t gg t gat ct t ggt aggat t aaga t t acat at t t gaaacat aat gt t t acct ct aat t t t gcag aaacaacgtt at t t aact t c aacgt gt t t t t acgaat ct a aacaaaaagt t act t gaaaa ct agaaggt c t cacagt t ca aaaat ccaaa aaaagcaccg aaagaagaag at t t agt t t t agt t ccgcaa at gaaaat t a ggacat at t a t at gt t aaaa aacccacaaa at t acat cac ct gacaat t a aat t gaat at aat aagaat c cccagagact ttcat t t t t a t at ccaaat t at act t cccc aat cgaat t a aat cat t ct a caaat aat ag gt t aagaat a gt at acct at cat aggaaaa cagat aaagt at ggat caag ct acaat aag gt acaaat gg cacct t gagt cat t t ggcga at t gaaaaaa t gaagt gaat cagat gaaaa gt agt aat gt aacaaaat t a t ggagcct t t t gaaagat aa t t acct t ct t gt t caaaat g at agggaaca at t cat t caa t aaaat agt t aat t at at ct t cat cgaaaa cact t ggat a t caaat t t gt t t gaat ct gc aaaaaaat aa cct ct cat t t gt cct t t gt t agct t t t aat cat aat t t ca agt t t at gt g t act ct aaac ct caagagt g t at t t gagaa ggat at at aa t gaact aaat gaaagat t cc aggaaacaaa t cat t t aaaa agat t caat g aaaaaat t ct t at agct ct g act gaaacga t agat at ct a gat t t t t ct t aaaaat at ca t aaat t caat aagagt ggt g gacttt acga at aact acag aat cgaaat g at aacct aaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 166 12689250 Sequence Listing.txt aggagcggtt tatccgttgt attaaaaatt taacaataac aaatgtttca aat cgaccca agccgccccc t t t t agt cat ggt t t t t t gt aggt aaact a t cat agcact at agt aacaa gcat acat t t at at gt t t t g act t t gt t at at gat t t t t t aat t acat aa t aat cacgac caaagt gaaa aacagt agca t t at t t t at g gct t gct cac atgggaggca t gaat at at t t t t gt gt gt a aggggccact ttgacaaaaa t ct aacaagc gt gt aggact agt t gggt t t gagt aat t t c gcaacagt cc tttttctcaa t t t ctgt t t a gt t t ct t at a tgaacggaac at gaat aaag aat t caccaa aaaaat aaat t gact caact aaggt gt gat t t gact tt ag t t at t aacaa gat gccact c aat gt gaat t cat ccacat t ct aact agca t agat t t ct t aat gaaact c at at gaaagt at t caaact t aagct t at aa at aagt ct ac at acaat t at ct t aat gat a ct t gcct t t a aaagaaaact caagagtggt ct at t ccacc aagaagcat a aat gagacca gt t t agt t aa aat at ct cca t gat ct at gg accgaaat t a cacaaat t aa t cat acat t a acact aat t t t gt gt ct gca gct t t t caat t gt agt ct t a t at gat ct gt at t t act at a ccat t t ccct gt agt gagag ccat aaat at agtttttttt aaat ccaaaa ccaaaat act acct aaact t t cact caccc aat t t ccagt tttttatcag aaaaaaaaaa agct gt cgca at t t gat aaa t t t cat t gaa t aat aat at t cacgt act t a acct t caaat t t at t t aagg aagaaagat a ggt aat t gca t at t t gt t at gt aagat t ct ct t agt gat t ct t ct aaat g at t t at t t t c gt cct t cgaa agaacgt ct a t t t ggaaaat acgt agaaaa t caat t aagt t aat aaaaaa t at agat t ct gt t ggt t t ct aacat at aga t aagct t ct t aggct gcat t gagaaaaaaa 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 153 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 153 ggct gct ggt ggagt t t gt c cagt agcaag t gat cat gga t t t ctgt t aa aaat aaagca t act t t t cgg agt acat at g gacaact gag aaccaat t gt agagatt ct t agcccgt gat ct gaaacgt g cacggccct g gacgcat cgc at gcct cct t gct at ggat t t at aat at t a gt gt t t t t t t at acgt aat a t t t t gcct aa t gact ct t gt ggaagcatt c at cct at cgt gat at gaaat t ccgt gat t g ccaacgaccc t t cct aat t t agaaccgttt t t t gt t gtac t gt at ccaag aaagcaaagt ct t gat gaag aagagagtcg cat cct cagg cgagagcat t t at cgt caaa at gggt acct aact t caaat gt t t t t gtat aaaat at cag ttcgtat t t g cct t gt aaca aagat aaaag t ggagagt gt at ggacggt g tctgt t t t ct gct ct ccagt at t t at t t t a ct aaat t t at ttttaccaaa aacgat act t at gt t at at a ggt gat gagc agagaaaagc ggct cagcgt aggat ct ct c cagagct ct c ct ct t t ct gt t t t t t at at a at gat t gat a accgt aaaaa acat ggt ggg t gt t gct aga aacat cgt ag gcgt ct cagg ct caat gaac 120 180 240 300 360 420 480 540 600 660 cgttgctcac agcgaaaaca cctttgatgg gagcggtatc aggaggctct Page 167 12689250 Sequence Listing.txt t gt ccaat aa t t at gct aaa ggtcgcaggc ct aaaggat a t aggcgct gt t at t t t gcct t gt t t t aat c t t aagt t t gt at ggagat ga t ggggagt at ttcacgagcc ct t caaact a t gact t t t ga cagat gt at a cctttttttt gt t aaaat t t t t aaaagt gt agct t ggct a t t acggt cat acacaaaaga t t ccagccgt tcggaagaga at t cgaat t c ggaaaacttt act aaacgt t t ct acgagct gt cccaggt a t t gaat t t ac aaaaat t at t t t t t gt t t at t ggaaat gt g aat gt t gcag gct cacct cg at cat agaag gt gt t t accc aaat t gaaag t t t t t t t ct a gt t t acgt t t gaaat aaat a aaact acaag tagcacaaaa ggaccaaaag t t ct t t ccga gggat aaggt gat aaggt aa t t aaat gat g gccatggaga gt gct t gaca aat aat gccc at gt t acaat agt gaat t t t aagat ggat a gtgaaggaag ct cgagggt a gct acct t t c ggaat aagca ccaaaagttt t t t ggggt t t at gaccat gg at gat gct t g t t ct gaagt t gagaccagat aaat aat t t g gt ccaaaaac t t ct cggat t actaccatac at at at atgt gt aacgagt g cgat t ccaaa ct gt t gcacc cgt ct aaat t t at t t gt t aa t at t t t t at t at at gat aat aagagt t t aa gt cccat at c t gccgt caac gcact agcag t agat t aat g cct ct t gt t g gat gcaat gt aat ggct at g aat t gaagaa t agt acaaaa t t t t t at t at gaat aaact g t t t cctggga at gat gat cc agaccgt cag at cggccact at t t t gtct t acaaat gaaa ttttgaacgg ggaagcgt t g gaaaacaat g ggt t t cct ct t t cgact gat caacaaat gt aggaaaaccg gt gt ggt gat t t act ct gt t at gaaacat t t t t gaaaat t act t agct aa at t at t at t g t at ct ct cat at caaacgca t at ct agct t ggaacggttt ggt aaggt gt ttaccaccaa t t aaat t gt t ccagaat t ag cat t gatt ag aagat ggt ga gcagagat at aact cggt gg acagaggagc t at at ggt t t tct t t acttt t ct act cat g t t t t aat t t c t gagt t at ct t gat t acaag at t t aat t aa gt aagt ggaa tcgccggagt tcgccgagaa 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 154 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 154 acaattattc gcttccatct attctggtgg ccttattcac ccttatccat tttttttttt tgaacagttt ggtttttaca tggacctcaa agatttcaga gttgggaaag tgtcaatct t accgcttgtt tgatctttct ggcattgatt ttaaaattct tgatctctct atcaattttt tggttgggta ggacgtttta acgctgcttc agcttcctct gagtgcgtgg ggacctgttt t t gct aact t t t t t gctat g cct aat aaaa cagt gcacca aaaat t ct t g t t gaagat aa cat at agcat gt t ct t cct t Page 16E ggcat gacgt aat gt t aat a gct aaaaaca aagact cat a at ct at ct t t t t gatggcca aaat cacaaa ct t gt ccaag t t t t t t t t ct t cat at gat a agaaat caat agcaccgt ga ct ggcat t ga aat gagaat c ttgggcaaca aaacaacgca 120 180 240 300 360 420 480 12689250 Sequence Listing.txt aaccaagaaa tcgatcgatt gcatcgcaac aacttgaaat ggtagctcat t gcat t aaag t cct at at aa gagaaagt at caaaaact ag gt cat t aagt t t ggt gact g caat cct ct t caaact act t at at cat caa gt cgt t ct ag t t at t ct t aa aagaat t t t a at aact caat t t t ggt caaa aaat agt ccc gagacacaaa ct aat acct t t ct at aaat t ct t aat t t at t aat aaat t t gt t act t at c at gacaat t t t t at gt gt ca taggt t t t t t gaagaagacg aacgct t aaa at t aat ggct gt aat agt t a aaaccaggaa t at t at at at gaagt t t t t g t gt ct acaaa t at t ct ct ca t ct t aat at c ct t t gt agca ttatct t t gc act at t t gt g t cct gct gt t caat t ct t t c aaat agcaag ct t aacaaag aaaat at t t t aat aat t acg agaggt t t t a t agaaat gt g aagacaagt a gt t t gat t t a t agt aagact ct aacacacg aagaagaat c aat at t gt t t tttttttttg t aaat t agct aaact t agga aact acat t c t cacgt ggt a t t aaat acat aaact at t gc aact t caact t cgaaat agt cagat t t t t a act at ct aga gat ccat aac agt cgt aagc caacaaaact ct t gat acaa t t ct agt t ct at aat t t aat ct aaat t gt a aat t aacgaa t gaagt at ca t t t gt cacct ttttgtctac cct ct aat ct at ct aaagt t t acaaat gt a t aaccagt aa aaat cat aga t at at aat ct aacaagaacg t at cacgaaa at t ggt gt gc t t t aaagt aa t t t aaat gt c aaaacct t t a ctt gaagcaa t caacaaat a t agaat at t a aaaacat t aa t gaaacct ca aaat t aat aa gaaat t t agt gaaacaact a tact t t t gt t cgt gat t aaa aact agagac t at agt agaa ccgcgcacac acaccgcct a aat t t cat ca aacat t act c gtt aagcaaa ct gt t t cgt c t att cgccaa aaagct t t at aaat acgt t t agcaaacgt a aaaaaaat ga agcat at at a aaagt caaaa t gt gt t t aac caagat agat cacaaacaaa at gaat t aat t t t aacct aa aaaccat t aa at t cgagt ac ct t gt gaat g cgt t t aat ga t ct ct cacag agacgaattt acacacaccc aat cat gt gt gat t aaaaga aaaagaaaat accagaaaaa gt t aat caac at t gt acat t cct aaagact gt at t at aac t ct cgagat g aat t aacacg gcgaaat aat at t caact aa at gagt agac aatttttttt gagat t aaag t t ct aaaat a act cgat at a aaat at cact t ct caat at t at ccct gaat gt t aaaaaaa caccaacct a tcaacgcagc at aacccct t tcacgaagaa 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 155 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 155 ataacgtcaa tcccgttgct ttcacaagag tttacaaagt cattgtcgta cagcgcttca ttggtaatga tgttcaataa ccttcctttg ctaaacgatt ttgacatgaa tttcattctc tcttgataaa ggtgacttcc taactttgga taaaactcta aaagtttgaa tgcaacaggg tgattagttt tacacttgtg gccagtagaa taaaactttc ttaagcattt tttacatctt attgaaaacg aaccatttat caccattctc cgaaactgac aaaaatctct gacatgaagc Page 169 120 180 240 300 12689250 Sequence Listing.txt cccatcaagc catgtggggt ttttcccacg tacgtacacc caaagttcac atgcatgcaa t gct aaaaaa at at ct t ct c aaagct ct t a at cat ct ccc ccaaagggt c aat gact t t g gcagt gact c gct cgt ct t g gaggt cct t g gt t t act t ga t t t cagct aa ccagggct cc gaagccacaa act aat t aag gt t at t t cga at gagt t gt t act aat aagc act aat aagc t t t gggcct t t at t t t ct ag aaaaaaaaac gt t t ctctta at gt cct t gg ttct t t gat t cgt agt t cgt caat t t ct t g gt t t gt t t t g t t gctct ct t at aagat gcc agt ct ct at a t ct at aaaac t cat t t caac ct caaagccc cat t t accgg cagcct t t gc act t agccgg t agt cat t ca agact ct caa act ct gggaa aaat t ct acc cgt t cctt ag ct t ct ct t ag gt t cct tt ag at t gt t ct ca at t at at t gc aact at t ag t t gggccaat cgaagt at t a ct agat cgct t t gt t at t t c at gat t gaat t at gagt t at caggt t t t t a t t t gt t t gt g at t t ct t at c cctt cgcaga t ct agat t gc taaaccagac at t ct aaaga catctct t t t at caggt t ac cct cggcaca t cct gct t ac t ggt ggt gt c aggcaccat t cagaggagat aggccct gcc at t t gt ct g cgat gcagaa t t t cat t gt t cat t t t gagt at gt t t cat t agat t t ct aa ccgat gt t ca t aat t aaaac t ct t ct gct c ct t cacat ct t at t t gat t t cgggaaat t a ct cgat t t t a t gt t t gt agt caat t ct cat acagt t t ct a ttcaagacca t gcct ct ct t aaaat gaaga gcct ct gt t c t ct t gcaaga gct ggaaaca gcaggcat ca at ccct ct cc tgcgccggat t ct at ggt gt t t ggcct t t g tttgcaaacg gtt aagaagc cat t t t t t t a ttgt t t cttc t t at agt ct c gt agct t gt c aaaaaat t t a t cggt t t t gg t gat t ct t ca ctt cgaacga cgcgt ct gat t t gat t ccgt t gat t t t agg gat ct t t gt t at aagat t ag gt t t ggt t at t cct cacgt t t ct gcct cat t gat aat cca aagact t t t g accct gacca ct t ccaacat at ggcct t gg acact cat cc t t at ct cct c tt cct caagg t cgcat t cgg at t t gcct t c t t aagggt gt aaaat gt ggt t aat gat t t g t t t ggct t ca aact at at ct aaat aaat aa t caccat t at t ct ct ct cac ttctctcttt ct ct cgt t t a t t cat ct at g aat act aaga t t gat aagaa ggt t ct agat t t t t gct gat aaccat at ca aact t caaca aat t t t ct t c t gt agct gac agt caccgaa cat t aaagcc cgt at ct ct g t ggt gct t ct cgccaacaaa act t ct t cat aagct cct cc agaact t gt t gct t ggggga t gt gcgt t t t gt t t t t ctt t aat t gt aat a cgacat t t gc tgt t t t ggcc at aagaaggc agaaaaaagt caaggt acct ggt ct gt at g ct t t t ct cga t ct ct caat t gt aat at cgt cagt t ct ct g cgat t act ac 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 156 <211> 1217 <212> DNA <213> Arabidopsis thaliana <400> 156 caactatttt tatgtatgca agagtcagca tatgtataat tgattcagaa tcgttttgac Page 170 12689250 Sequence Listing.txt gagttcggat gtagtagtag ccattattta atgtacatac taatcgtgaa at gaaacat t t cacggt ct g t gt at gaaat cggt t t aagt acagt cat ga t aaaaat t ag t gt ggt cgaa agt t gt aaga t ct cgt t gt c aacaaggaag t gaat ct t cc ggat ct act t t cgt t t aat t agt t gaccga gat ccat gt t gaagt t agat agat gaagt t agaaagct at t t gcagct ca gt at ct t at t aat t aat t at ct aat t gaac t aaccact aa agccat caaa ttaacacgag at gat t cgt g gat aaacccg ct cct cact t aagact aaga t caat ct cat t at t t gct gg t ggat ct gt g t cagt t agct cat gt t acct t gaat ct gaa t gt gt at aga t t ct gat t ca taaaaaa gt at aaat at gat acaat t c aagccaacca aaaaacggag gcaaaagaac ggaaaaggct t ct gt cgat t cct at at aaa t cat cagccg gagaaagt aa ct t ct t ccgc at ct cgat ct aacct ccact cgat t at agc gggaaat gat cact gt caat tt ctt cgaaa at cagggt t t ccat aaacac t aat agaaaa cgacgacgac ct gt cat gt a t aat ccaagg gt ct gacagc t t aat t at t t t t cat at at t t t t t gaat ct gagat aat cc t ct t t ct t t c tgt t t t ctca aaat ct t t t g t accagaat t t t gt at at gt gt t agat t ga ct t t aggat t at t t gact gt at cat gaaag cgaat t aaat t aacgt t gcc acacgcggat gct gagat ga caggt cacgt ttttgaaagg t t cct ct ccg ccggcgact t aggagat t ca caaggt aat a at t t cct t ga gt t t t act ag t ggct t gacc gaat t gaaat at ct gaacac t gt agt gt cg at t gaact ct t agt gat at g acact tt ct t t acgt t gaat t ggat t gact cgagcaggt c t t aat t agt t t at ct t t acc ccgaaaat aa ct t t gaat t g gacagagaag t tct ccgt t t ggaact t tct gat ct ggaat aat cgat ct a t t gatggaga ct gaact gt t t gt t t aagt t t acgt t gaac t t t t gtgtgt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1217 <210> <211> <212> <213> 157 2000 DNA Arabidopsis thal i ana <400> 157 gt cacct at t aaagt ggt aa ct ct t gt gt g act at ct cac aaggat caga aaggctcagg gagt caaaat gcatt ct ccg gaccat gt ct aaagt aggcc gcggt t gggt acat acaaaa ct aagt aaac aaacct gcat gaaggt gat c aaccacaaaa act aagaaac caat t at gat gcagaaat ca acacaat aga cgat caat ag gcgagat t at agaacct gt g t at cgat cca aaacct gat a acat at cact aat gcaaat t ccaaacct t a gt agt aacac at t gaggaat gt ccct agga t ct caagt ct t gaaagt aaa att ct cccat at t at gt aga gaat at t t ca ct ct cat gaa accaaaaaca aagaat t t ca caagaaaaca accaat agt a cat t t caat g acacct ct ga ct t at at ct t t gt agct gaa aat t agct t c aaggct ct ca act accct at t gagat at t a gat acat aca agt aaaaagc t caaaact ct act cat t t t c ct t acat at t gct aagat at agcat t t t at cat t gct t t t aagagt at aa aagaagaaga ct t gggaagg 120 180 240 300 360 420 480 540 600 660 accagcacag cagacgtctg ttactactaa ctaactcaga cttgtcattg aactatatta Page 171 12689250 Sequence Listing.txt tatacggctc acttgttttg ctgcagtaac tggcttatct tcttttctgg cttgcgttaa t aacat gagt t t ct cgaat C aaacagagga ct t aat aaat at gacccacc at cat at cct gat gaaat ca at t t cgt caa gt t agt gggc aat aat at t g t ccat t t t aa cgat at ccgt t at at t t cgc tgccagccac aact ct ct ct ggat cat t gt cgat t cgt t g gat at t gct g t t ct gt agct t t agaggct g t t ggat t t ga gt t ccagagc agagagaaaa tttcgacaag cat t cgt t gt gt ggacaat t aat caaggag t agt ccaat t cggaaat acc t t t act gaag cct ccacacg aaggat t at t agccgt gcgt cat ct aacgg at t t ct t ct c t cgcat t t ct ct ccct ct ct t at gct t ggc ct gcgat cgg agt t t ct agg cact cct gct t t gt t gt t t a t ct gt gcgt t t t agct aaga cacgact t cc ccaagcaaaa t acct at t ct aaccaaaccg cgat gggt ga gggat ct t cg act gcccaac aaaat t caac aagcat gt at ct t t t cccat t t aat cat ga cat ct acaga ct caacgct c ccagat t t t a gat ccgt cgt t t gat ct gt g cgt t at t at c ct aat ct t ac t gat t t aggt t gat t ct cag t at t t agt ag gt cgagt aga gt aggt gggc aat aaccgat t gcat t gct a gaaacct aat t ct gcgt gag gt gt t cgcaa at caacgccc cct ccaat t g ct at at acca t cgt caat t a t ct accagaa t cat aaaaag t t at cct t cc t gct gct t cc ct t aat gat g gt cagct ct c gaat ct gt ga tttttttttt ct t gt t at gt t gaggct cat t t cct at caa at gaaaat t t t caaaat t cc ggt ct aaacc aacact gct g acgcgt t cac t aaaagt cct t t t t t gat aa cat at t gcca ccaaccct aa t at t ggcaaa cgt t ct cat t t agt act cgt tcgaaacaag at t t t cat ct t gt t t t t cgt t cgt t t ct gt aagt t t t t ga t ccact cgt a at agt t ggat gt at at at at acaacaccgg gt t ccccat g gaat cgcaga gcat t ggt t t ct gt gact t t acct gcgaca ct gat gcct a tttccccaaa at t at t t cct gat ccgaacg tttgaccaca cat gact ct a gt ct t act cg gt at gacgga t gact cgat c agt agat cca t ct gt ggt t c gagat t t ggc t gaagct gt t t t cgt gagt t ct gt gt at t t 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> 158 2000 DNA <213> Arabi dopsi s tha i ana <400> 158 gaaatatctc gcctggaaaa gagag ct gat act aa at aaaaggag gt t ca( caaagaccat gcagattatc ctaaa at t t t t caaa t ggcaagct c acctc cgaatagttc accaccagca gcatat ccacct at aa aat ccat aaa cccaa( aacaaacttt ttcataacaa aaaca( aaagg cat gt act a at cct t cca caagg cacc at t agaacgt t t t ct ct t gt gt aat t t cat cact aaat ct t ggcaat ggc aat gcaagat cat gt caagc Page 17' agaccct aac t ccct t cct c at t cgcaaaa t ccagcact g aagat gggt t gaaagat at a aacaagaaat t gt t t aat gc ct aat ct t t c gcacagcaca cat at acgct ggagt caaca agagat t at c t cact aacct 120 180 240 300 360 420 12689250 Sequence Listing.txt ccttgaaccg gattatattc ggatggcgaa gtgatctgtg attaatgatc cat t ct cat c ttcaggaaaa acgat aagaa at gcat at ct aggaat at ac t gacct t cat act t gt ccat caaacaaaaa gaat t t cacg gat gcgat t c gacgatggaa aact t tct t t at t aat agag tcatgt t t t t acaagt t t cc gt t t t t t t t t at t t acacga atagagagag aacct t t t aa acaaaaaaag gt t cgt gcgc ct t gt t agaa t t t t aaact t gaagt t gt aa t gagct gagg gat cgt ccgc aat ct gt at c at at t acaac at t t acaaaa gggt at agga ct t aggacca gagcctggca t ct cgct cga aaaaat ct cc t gaaaat gaa cgagat ct cg ggt t aat t t g t t ct t t t t t t aaat aat gat t t t t ccattt at gcgat cca gt caccaacg cacgagtggt t cat t gt cca t gaat t t at g aagaagt t at gt at at gaaa cgaaaggccc aat at acgt t gcct t at cgt at ct cgct ct cggt t t t cac aaaaaaaaca t acaagat aa aaaaat cgac t caccagct c cgct cgat gt act ccaaaat caaccct aat aaaat t ct cg acaagat t aa cgat t cgcaa agat t t ggat t t t ct t gttg gagaatgaag at t t aat t aa t acat gt t t g cgcttttttt cagcagcaag t aaat cacat t cat gat t t t gt t cccat gt acat t aat ag aat caat t aa t ct t agt act tgt t t t t gt a ct ct cgat t g aacaaagcaa ct aacgt t ac t ct t ct gct t atggagagca act t cat ggc tcccagcacc tccaaacaca t t gaaat t ca aaaaaaaaaa agaaagtggg t ggagagat t gaat t t t t ct aagaat ct cc t acgagat t t t t t ccct t gc ct ccct aat t caaat t cgcc t t t t ctctat cat ct act t t gt gagt gt at acct t at at g tagagcaaac acaagcccaa ttacccaaaa gt at ct ct cg at act cat t g gt t aaacacc aat agcaat t at gaaaacac aacaagt t cc t at gt ct t t c caaaacagaa agt ccagt t c aacagt at cc aaaaaaaaag t caagcct aa t t t t t ct t t t cct gcct aaa tttaaaagaa t gat t cgagt caacact t ca t t agat t ct t gaagat t gt c t aacat t t t t at t t t aat t g ggccgct aaa at t aaaagac t t aat t aat a t ct ct acaaa ct ccgt cct t t ct ct t gcca t acaaat ct t caat cagct a cat t ccaat t aaaaacagag t t agagt t t t accagct cgt aggaaaaacc t agcagct aa caaaat caag ggaaagagat gcgaagaat c t t ggggt gaa caaact ccca agagggaccc gt at aact ac act at acact t t ggt t ct at act caaacgt caacat at at aaat gacat a ct aat at aca ct t at at ggg gagcaaaaaa gt gaaaacac cct cagat ct 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 159 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 159 attgaaatct aagcatttgc atgatatata gttacgtgtc acttacatac atgtcttgta ttgtttagtt tcactgctga gagtgagttt ctctatgtac tgtaatgaac ttggccgatc gtgttttact agtttttctc ttgaaagaaa aattcgtatt actggtactt ggactaaaat ttggtctcca caccactctc ctctaatcat tttacaaaca attctatatt tgagagagga Page 173 120 180 240 12689250 Sequence Listing.txt tat t at t gaa catgatacaa at t aaataat attttatgag tacgtatttg catcttgtga gaaagaat t a gt t t t acct a caaagaagat ccggtggtgc gct cct t aat at act t t at a ct t at t gat a aaaagagagt acaat agt at gat t t cgt t a ct at at acgt gact gaaact agt t agt aga t aaaat t t aa gcct gaaaaa gacacaccat t gggt acaaa cggagcgacc tttttttttt caggt ggt gg gt cat t gt gt t cagaaat t a aat t ggat t a t aagt t t gt g caaagaat gg aaaaat aaaa at t t at t t cc caact ccgct t cgact t gt a agt gt t t t ct t aaggat gt a at aat t at ga gt ct t t gt gt t t at t t gt aa agct t t cgt c t t agct t aaa t ct t t at caa gt aat t gt cc aacat at at a acat ct t ct t agt gaggt t t t t t t t aggt g gagact t gaa ct aaacaagg t cagct aaac gat gggt t aa cat t t ccaaa t t t t aaat cg t gaaat aaac gt at t t t t t t aggttttttt cat aaaact a aaaat aaaat at at t cacaa at caaacagc t t ct t acat a t at aaact t t t ct at cat cc aagggaaat a t aat aat at a t t ggt aggt t t ggcgt gat t t t gt at ggaa cat at t aaaa agat cgt at a aaacaaacaa t t acat aaaa gt t ct cgat a aact t t t t t a ct t aaaagca gat agt aat a gt gt t cct aa aggggtagaa gacat t gt t t act gggct ag at aaaaaaat t t gcat gcag gacgat cat t t t ggt gt gt t aat caat caa gat act at ag cat gat t gaa aaaaaaaaag cat cagt t ct t aaaaaaaat ct ct ct t ct g at gg t t aacaat gg aagagagagg at ct acat gc gat aat t t aa gcat cat caa t aaacaat t t aacaaact t a act aaaaaga aaaaaat at g t t t at caaca aaaat at aaa gat at at aca t at t gat gga gagt aaat t t t aat t aat ca aat ct ct aac gcccaaat t a gt t t t t t at a gt t ggt ggcg gaat at t t t t gt aacgacga t gccat gt ga agaagaggca cagat at aat t t ggat aat t gacgaaaaga ct gaat cggc aaaacagat c gt aagt gaaa aaaagaaaaa t gaccaaagt cccggccacc at caact cca ttttttttta t at acgat ca gacgat aaat agccacgcca aagacgat gt aaggt gct t c ct gaccacgt caat at t t aa aat t at acat t ct cagcagc caaact gggc at cagaacat t at at at at a at at t t t t t g ggt gt ct t gt t cat t gat t g aat t gagaaa cat at cagat t cact t gat a aagaaaat ag gcccaccacg t caggct cgt t ct ct act t t gt aacagaat gccaaat aga cacat t t t t a act t gcat ct t aaat at aag t aact t ct t t aacaaact aa gaaagagat a aat ct aaaac aat t cgt aat agat t caagg t caggt t aat at acagact a t t t aat cat g t t t ccat ct g tagaccgaac aaat t t cgaa t at at at at a gt t t t gt cac at at gt t t gt t at at gt t t g cct t gaagaa t ggt ggat t a caaaaaaaag aat at aaaat gat ct gt cag gt agcct cac gct t ct t cac 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 160 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 160 Page 174 12689250 Sequence Listing.txt agtacagaat catctccaaa gatctcggtc aaagcaggca tatgccaaac cct gaagcca cggct t cgat t ccct gt ct c agacgt aaag acagcgt gca aaact agt at aat t ct ct gg aaat aat gcc aat aagaaac gt aaaat caa aaccccaat t t ct ct t t caa at t cgcagat at cgaggt aa t acgcaat aa t aacgat caa t cagcagct a aat gct gcca t act ct t t aa agt ccct ccc act cgacat a t at caact aa t t at t gt at a ct t t gact t t ct t gacagt a cat gaaaat g gat at gct aa ct aaagt t cc aaaaaat aaa aggt t t cagt at t ccacat c at t aaccgat t t t aacat at caggcagaac ct t t t t caac ct t caagt t t ct t t agct ag t t gcacggt g t t gcggt gaa caaat acagc ct t t gat t t c ggt ct ct cca gt ccaccacg t t ggt t t aat ct t ggat acc cct ct agacg acat gt t agt at t gagt t t c ggct ggt aag ccgcagcccc agat at cagt caccagcttt t acaagt cat aat t aggaat t cagaat t at tt ctt tcat a t t at aaat cc at at ct gt t g aaaaacaagg aat gaaaat a gt t t cgaat t agact t t ccg t agt t gagct cgct t gggt g t aat t t gct a t t t t t t t at t accaggt agt at aat cat cg acct act act tacacggaag gat gt gaaga t ccccct gt t t ct t t t gat c acct gt t t ca acgcat aaat t agacat t ca agt acat cct at gaggt ggt tagagcagcc aaccgaacct tt ct cct gga cccat cggt c t gct t ct t ca at cct t ggt t gaacccaaca gaat t t agaa agat t t aat t t ct t t at t ag t gt at agcgc aaat at t t aa t at at gt aaa gggggt t gct cgaaaaaaaa cgct agat aa caaaaaattt t gaaaat gac ggaccaacga t cgaagat t t at ga gagacccaat cgcagt aaat gt acccgcgt t gcat accat agt aggccat aagt agt cat at t t ct t cgc gcct gt gat t ggt t gggagt t aaact gct c aataggggac cctt ggaaag agggct t t ga t ct t caaaaa acgggctcga cacacagt t g ggt ggaact c t cat at t cag ct t gct tt ag t t ct t gcaat aacct t t t t c accat gat at aacct aat ac t at t aaaat t t cct agat at aaagggatga ggct aat gag aacaaaggct t t t t at t cat t at t cct t ca aat cgagt gc t t t ctgtat t ct t gagt gaa caacaaagcc gaat at gat c gat t ct t ct g t at ct cggca gcat t acgat at gt acccgc t at aaat agc t cacat t ct c t accat agt t gt ccat act t t t t t agt at a acccaaat ac ggt ct aaggg t gt ggt agca t ccat gt acc caggt t gagg gagt at agt a t ct ct gt t t g aaaacaaaac acaggaat ct t t gat t cgcc ttct t t t t ca aaaat at t aa gacaat at gc at aaat ggga at cgaaat aa t gcct at t cc at at t t t t t a t t gaat gaat t t act cccac cgaaaaaatt gt gaat accc aaagat accg caaagt t gac tccaccagat t ct at caat a at aat gagac aggaact ccc agt agcat t c ttcggcacaa at cat ct t t g t t t cgcggat gt t caat t t a agcaggaggg att acccaca at aagct aca t cgt cct t t g agt agaagat agt t act cgg agt caat t t a t ggt gacat a aacaaggt ct t t cacaaaat aaat acat ca agt t t t gaat t aaaat cact ggaat t cat c cgcat aaccg t gaat cat aa t at t at gat a t t t t aaaact aaaaaat t ga t t t t t at t ga cgt aacaaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 Page 175 12689250 Sequence Listing.txt <210> <211> <212> <213> 161 2004 DNA Arabidopsis thal i ana <400> 161 agt aat ccaa t ccgt cgt t c gggat t t t aa aaccaggacg gt gat ct cgt aaacgagacg t t cgct at gg cacgat at ga gt t t t ggt ag ct gaaaacgg gagaaat at c ggggct t t ca t ct gat gt gg acgggagagc t t gct t t gct aaccat t t ca at aact t t t t t gat ggat t t gagt t caaaa t gt t t gt gt a at t t t aaaaa gagat ct t t g t gat aaaat a gaaat t t t aa aact agaagc at at t t cat a taaaacacaa aaagaagcag gagaaacaag cct t ct ct t c ct ct cgt ct t tcaccgacga gcgagatggt acacggcgat ct caccggaa at at t t t cat gcaccgt agc t ggat gcggt ct at gat t aa t gaggt gt gg cgacggt t ga cgaat t cagc aggcgaggat t ct ggct caa t gcgat agaa aat caat gt a gt gaaacaaa t gcat at ggt agt t aggt ag t at acct t gt ccaagaagct ct acat cgat caaaggct at aat t at aaag t ct t ct gat t actttttttt cat gt t cat g gacaat at cc t cat t gact t t t t acct t gt cact acgcgc cgaagt t t ac gaagaaagaa gat gct t t ac t t t aacagct ct gt accgt c at t gggt t ca ggagaagcac cgacgcagat tggagcgccg t at act gcag ggaggagagc cgt ggat ccg gggccct t ct gt aat aact t t aagat aaag t at agcaaga t t t t ggact a ct ccat at aa t acgt aagct acaaat gaaa gaagatgggt caggt t cat a at ct aacgat act cat ccgg t t cct ccaat t at gt at gct aat ggct t ac cat ct ct aag t at aat gt t t cagct ggcac caggagct ca ccgagtgggc tcgtcgggaa cacgt ggct a ccgat gt t cc acggt ggt ga cgagccacag ct gat caagg ttgagcaagg gggt at gct t cggagat at g aat acgggt c at t t ct aaag gat cat t t t t at aaat t aaa act cgt at t t gt gt gaat aa t ggaaaat ga t t at t aat t g ccat t aat t t tt ct t t t t gt aacat ct at c aagt aat t ga acat t ct cga ttaaacgcca t t t gtgtacg gt ggt t cgaa caggt gt gct t t aggt ggt a caaaact ccc ct agcgcaat aacgagt cag ccacgggacc gat t cat at c acacct acgg t cct ccggag ct ct ggct t t ct aagt acga aggtgacgga t gacggaat c gcacggcggg ggt t cat ggg gt aaaat cac t t ggtcaaag ttgt t at t ga gat t t ccaag gt gaat t t t c cgacgt gaaa acaaacgt aa agagggatgg t gt ggat cga ttcat t cttt t aaacaggt g t gccgccgt t tgat t t t t gc t act gcaaag aacacgagag t t t aaaat ca aat t agt t t t cgt cgct at c cagagtcgt t ggaccgagt c gagcaaagga ggat aat t t g t t t att gacc at t t cagct c ggcgccacct t ct gagct ca agggt t t ct a caacggcgga gacgt t gacg aat t aaccaa t agat agt t t gcaaaaact t gat caagt t t t t t t t t ctag at t t t t gtac gat cccgat t aat gaaggt t ct aaaaact g ttgaaagaac cacaaacagt cct ccagct g at t ccgt aaa ct t at aaacg ttttccggac caat ct ct ct caaaacaaca at gat agt t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 gtttcaaaag attactaaaa ttttgagcag gtagcaccat ataagaagat aagaaaggtc Page 176 12689250 Sequence Listing.txt tcatttataa actccatacc aaaaactgca tccggtaaga cgct t cgcaa ggatctaat c aaattggcta cttctaaact ttgaaaggat cttatttgct agattctcac ttttacttgt agt t gaaat a act gggat gt at gt 1920 1980 2004 <210> <211> <212> <213> 162 2004 DNA Arabidopsis thal i ana <400> 162 gcat cat cga aaaat ccacc cact at ct ct at cat cat ca cacccgat ca gt t at acaaa aaacct ggac aaccct gt gc acaaaaat at t at agat t ct aggggaact c t t t t aggaat at t t t aaat a gcat cact aa tt ct t t gttt aat aagagga aagact ct t t acaacacat a aaacgcgttt at gat gt aat t aaaaat cat aaact at t aa t t t at t aaaa t t at t t at t g ct gaccaaga t at at cct ac gagt t t ct ga aat ct acaaa aacacagtt c t gaccat ggg tcagcaaacg gaaaaaaacg agaagcagca aagagt t cca at cgggt t ct t ct ct t gat t ctct t ct t t c aaact aat ga cat t caact t agt t gcgt ca cat t t t t t t c agt ggt t at t cgat aaggat at aagt cat t t act acaaat caat gact gg agt agat gct ct at aat gcg gt at caacat aaat t gt t at t t at t gt aat t gt gggaagt at t t ct t cat aaaat cat aa accaaaggcc at t ct aat ca cat aggaaat gaaat ct at c aat cgagcat cat gaaccat ttgagacaga gt t t t t t t at t at t aacat t t gt cgct gt t ct acaaat ca at aagt gt t c act at acaaa ct t t ctattt at t acat t aa cggt ct t cag ggat t t gt ag t cgagt t aaa t aaacat at g aacat aagct t gt aagct t g caat cggaaa cat cat ct cg at gcgaaaat t cgaaact gc ttttttttta ttgagt t t t t aaaact caaa gcgt aaat cc caacaacaaa aat t cgat t a cagaaaagag agaagagaaa gagagagaaa t t t t at t t ga t gt cgaat ct aat ct cccat tatggaggag t cat t t t t t t aacat t t ct t ct gt agt t t a gaaggaaaaa ct at aaacaa t aaat aacaa aaccccaat a t aact at ct c cacaat t at t cat aaaaat a at gat t t gct at t t t aat aa gact t gcaac aaat at gt at aaat act aat aaat t agt t g acccggaact t cgat gcct t ccct aaaaat acagt gaaca cgaagaacaa aagagagaga gagaaagaga caaaagt act t t agt t gat t t gt aat t cag acaaaaagac gt t gt ccct t at t t t t at t c aaagact cgg gt aaaacaaa gt aaagaaag at t aacaaca t aat at at gc t gt t acat at t gaat aat t a cagt at at aa t t t gaagt t a t gct at at at t gagt t gct t aat t ct t aat at t t gcat ac gt t t gt at gc gat ccct at c ct ct t aaacg gaaacaat aa cgaaat cgga cgaaat t agc gaaaat ggt g ggctgagaaa gct t t t t aat aaat t ct aat aat ct at ggg t t ggaat t at t agaaat t ag t at ct at at t gtctttttac t cat cat at a t t gagat t cg caacaaat t a at cgact act t gaat gaat g aagt cat aaa ct t t t at t t a t t acaact ag act t agt ct t acgggcaaac aaaaaaaaaa t t t gt t gat t at t t gacaac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 Page 177 t t ccaat t t c caat aat t ga agt t t agat g t cgat at t gg att cgagaaa t ct cccacga t t gt ct acg t t t t aaat at aat gt cgacc t at t t gt at t gt acat at gt caaggcat ct cgat ct ccca aaaat cagcc 12689250 Sequence atcacttttc atatattctt caaaaat at a cat t t aaagg at aggggaaa acaat t at at atgttctttt acgattatgc ctattttttt gcttcttcta aactcatttc t ctacgttca at gg Li st i ng. t xt gt agagct at cat t t cgct g t at t ggt t aa cat caaaaaa at agactt ct t cgat ct ct c aat t t t acaa at aaaaat cc t at t t at t ag ttt att agcc t cgt cact ga t ct t t ct cgt 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 163 2004 DNA Arabidopsis thal i ana <400> 163 t t t t caat at tctgt t t t ct acggt agggt gat at ggct a gct t ct gt cc ttttacaagc at t t ct ggcc t cagct cat a gt aat t t t ca aggaacaagt t t gctt gaac cgcaagtggc t t t t aggct t tt agt agtt g at t at t t t ga aggccagttt aagcaagt ct at aaacat gg t gaagaat ca t gt cct ccac t ccacat t t g aacat t agct aagt t aacac t t t t gcat ct t t t gt t t t t t cccct gt t at gat t t ctt ca aagt cgct t g ggt gt t t ct g ct caaagat t cat ggtt aga tt ggtt ct gt tcaggaacgc aaagct t aag t at t t t t gt c t ct ct ct at t at at t t gct a at act t cct t gt t at cacaa cat aacat ct gagact ccac gagt gt gt gc gt t cat cat c t ct ccagcca aacat aaat a acct t caaat at t t gct t at agt t gat gat t cgt t ggt t c at t t gt cact agct t cat t t t t t t caggt t acaaat ccac cat t at t aca t gcagt t t t a ct gct t ct t g aagaaggaag t gt ccat at a t cct t agat t tccat t t t t a t ct gt cacgc gt t t gcaagc agaagaggtt t ccaaact cc at ggcat ct c at t t t cat cc acaagt at t g ct ct t t t gcc t t t ctt aaaa t cat t t aaag ct aaaagcaa t gggaggt t g ggaacat caa t ct gat at gt ccctt ggaag aaagcat at g aagtcttttt accaact aga caatt cacga agct t t t gat tt at gtt agt tt gtt aaaaa t aagat gaga gt att ggcat ccat t gt ct t gtt gagcat a at t acct t ca t ccgaaaat c att att gct t ct ggt at aag t ct t at caaa aacat ct at a t t at aat aac at accgagt a t caaagct t t aggt aaaacc act agt t t t t gctt caaggc gat ct ccgga t t t t t cagt t cct ccct gag agccaat gaa taagagcaag at att agt cc t agt t gcaaa aaat t acat a att cct caac aat at ct caa ccaagt gt gt ccacct gcct ccacacaggg act cgcct t g ct t t gat agt aaaaaaaaat t gcaat aaca aat ct t gt t a t accagct ac cagcaaagaa aat at t t gca ctt ct ggtt g act gcaaggt gcggctgcca aaacgt ggt c t at caat ct a ggt t t t gggt aaat agaaaa at t gt t t t t t ct ct caaat a act gct ggt c caacat acga ct gt ct ccag tccgaggagg att cagcaaa t ctt gat ct t t cct t gacgt t t gt aaaat a gat gctt cac gaat t caaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 tataatgcaa taatgataag aaaacatttc tcctccatca accaaaacgt gaattaacat Page 178 ct at cgaaaa gagcgcttgt at caact t t t t aaaaacgt a aaaat t aaag acaat gcat t tt cat cat ca t t at aacagt gt caacaaga ct gt ct ct ct t aggt gctt c t cat aacgag ggaaat at t c t acact aaac t t aaaat t t a ggat cggct t caaaat aat g cgaat cgt ac gaacaaagcc t ct t caat cc 12689250 Sequence acaagt t gac acact t gaca tttaaagaat cgcaaaacac aatactcttt tgttgataca agtttttttt tttttttttt aacgat aaaa gt at t t aat t taaaaagcaa ccaaaaaaaa aaaattgcct tattcacgta ggcagatatt tgttggcccc acagagagag agagacacta at gt Li st i ng. t xt cacaaaat ct act cat att a t gt agt ct t c ggggt caat a gggacgt t t a aagagct gt c at aacaaaaa t cct ct t at a aaaccct aat acaagt gt ga at aat aat ca t aagat aaaa acagaat ct a t caaaat gaa at gat t gat a gaaaat gat t t gat at cat a ct ct t act ca 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 164 2004 DNA Arabidopsis thal i ana <400> 164 t t at t t agt g ccaaat t t t a t caat at gt t at ct aacaag t agt acgt ac caaat gaaat at t gct acaa t ccacgt aat t t act cgat g ccgctggccg t agtt gt gac t gt aaat gag t gt aaagt ca cgct ct t at t accact cct a t agt aat ct c aactt aat cg aat t caaaag gaagtt gt aa ttaaaaaaaa caagaat ct c acaaaaat ga t t t ct t aat t ct caacaact gagt aat at g gcaacat aac aagt t t gagt cat t t t agac at ggacaaaa agat t t t ggt t gtt gaacca aact t t ggt c aat att gagc cact t cct ca gat t t at ggt aaacct ct at tttacagaca t at t gt at cg cct aagat ca agagt t t t at ttt at caaca gat actt at a t gt t t at aaa t t t aaat t t t gaccct t aaa gt at at gagt at caaaaaat gagcaat ctt caact atttt aatt cttttt agt t t t at t t t t aat ggat t gt gagaaaat acaccat aaa t gacat gaaa t t gaat acac at caaat agt t t agagct gt at gaat caca caact cgact at t at agagg gagcat cgaa aacgaaat aa caagt agat a at ct acacca tt ggtt ct gt t at at cgat a t att att gcc t aaaagt ct t agcaact t aa aagt t aagt g ggt t caat t c aagt cgct ct aat aagagt t tgat t t gttg gt t t caagt a tt t aacaat a gt t t t cgaat t aacat t at c cct t t cat gc aaaat agct t t aaact gcaa t agct t ct aa acact ccgaa cacgat at gt t agt aaacaa gaat cctttt ttagtcgggg gat t t t cagt t t t gagattt ggctt gat aa gtggtagagc gt t t t acat c gtt gagagat aaaaaat aaa t gt ct t ct t c tctct t t at t act gacaaaa acat aagat g t t t cct t aac cct t gct caa act gt ct ggc ccaat gaaac aat gt caat a t at cat gt ca att ct t t gac agccaaat t a acat aaact a t t at at gtt t aaagcat ggt aat t gaat t g t acgagaaat t cagaagact caagt cat t c ttcaaaccag at cgat t aag agaaat aagt t cagt t t aga att caaaggg aat t at gat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 179 12689250 Sequence Listing.txt ttaatttcaa taatttttgt agtgttgaga aatttatgtt caaaataggg cgagt aagaa gaat acaaca acaaaat ttt t at t ct t acc tggctaggag t t t t gt aat c gaaaagaaag gagcat t cga t ct t aaaaaa aaat t acaaa agagagagtt ct ct t t ct ct t t t ctct t aa <210> 165 t gt gt caat c t t at agt aaa ttttttttca ct t at t at t a aacagaaaaa t ct aaaat cg agcaaaat ga gagcgcacaa t t t caaaat a aaaaaaagaa t cct at ct t c ct t t ct caac at ccaccat c agaaaat gt c at t t t gcttt aaat aaccga t t at t ct t t t aggt t at t ga gt t gat caaa taagaaaccg aaaaaaaaac tcacaacaaa aaaaaaagaa t ccat t cct c aat ct ct at t at ga aaacaaat aa ct acggt at t t t t t ct aaaa gct aat ct t g ccaccaattt acaaat aaac agt t ggt cgg t at t acaacc aaaaacaat t aagaagaaga ccaccat ct c agat ct t t ct at cat agaag t t gt gt t gat t agt t t aaat gaat t gcat g agct t gacaa agaagaagaa t ct t gtgggg aaaat at t ac t at t t at t aa ggaagagctt cct cat ct t c ccat t accat aat caagt gt cat gacaat c ct gcaacaaa gagt t t at t t act aat at t t t aat ggt t t g aaaacgaaaa cgggcgtgaa aaat t t aaat aaacat t t t g t t t gtgt t ga at ct t cct ct t acct ct ggc 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 165 ct at gt agaa gt accaacat ccaaagt t cc t t at ggat t t at t cgt t ggc t t at t t t at a ttct t t ctac t at caat t ag gat cgat aac aaat gt cat a t t ct ct agaa t at acct at t at t gt cat aa tttcaacaaa aagagaaagc agcaaat t at aat at t t aca aaaat t aat t tagaaaaaag ttttgttcaa ccat at cgt t aaaat gggca t t t gt t agat aagaaacaaa t t ct aagcaa gt at t ggat t aaaat aaaga t t t t ct t aat cat t t t cat a aagaaat aat aaaagcgat c gt t t ggaaga at aat gat t a aaagggaaaa at agat t acg ccct cgat t t t gaaaaat ca at at at ccat gt t t t t ggt t agt ct t t t ga cct t gaaacc t ggt cgt ct t gccaaagaat agt agct ct t aaat t t t t aa aat gat at t t t ccaagat t t ttaaagaaaa gaat at aaag t gt caat gt c at t t t at gt g t gagt t gaca at t ct caaat t gt aagat t t aat gt agaaa t aaaact ct a tgatct t t t g t agt t t acat cat t at at at t at t at caag aaat aaaat a t at at aaaaa gat acggt gt aaaagaagaa at gct ccaaa t t t aat t t t g cgacacgtt g t cat gaagat caat ct t aaa gt ct at t t ca aacgat t aaa ct t t gat cat agct t t aact t caat ct gga at t t at t gt t agat t t t t aa gaaaat caaa aat at aat aa ggt gt ct aaa gagaggtaaa cct caat cga t aat t gat gt at gact act t t t gat ccat c t t t cgcgt aa cgcgt cat t t ttagagacac ccacgact gt t t aagt agga at aaat t t gt ct cat gt t t a aaacacct at acat t t accc at gggt ggga agagt gaaac gt ccaat t cc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 ccctttcatt aatacaaatt gttcccaaca aaatatttat gggtccctct tttagatttc 1020 Page 180 12689250 Sequence Listing.txt cctgggtccc gcggatccaa attttaatgt ggacgt caaa t cct t t t t t t t t gt ccact t agt gt t t gga t at act aat t t t gaaccat a t agt ggt t ag acacgct act t at t at t ct t gt aat gacga t t ggat t cct t t t ct ct at g at act at t gt aaat at at aa at t t t t caaa agaacaaaaa acagat gt ga cagt t act at t cct ct t ct t t aacat aaaa t t ct cgt caa aat t t t aaat t act t aat ac at gaat t gt a aacagt gagt gat t t t gact t t gt acggat at at t cgcat t t t caaagat t t t t cacat t at aaaaat t g aat cagaaat gct ct t ct t c aagct cgt ct ct t t t t t t t t t t t ct agagt cccacaaaac t agaaact ga aaat aagacc at ggt gt at g gat t ccat t t aagaagt t gt t t ct t ct cct aggccct cca aaaagt t caa aat aggaacc aaaaaaccaa at aaaaaact t t cgt ct t ct at gg t t t t t gccat cat at ggat g ccgat ct t aa ccaat cacat aat ccgaaga accaaaat t a t cagt t t t t t at at at ct ca ct at t at t t a ccggat t t t c ttttttcaac aaagat t t t g aaagacacac gt ct t aagga ttgaaaacga gat at act ac t at t at t ct a ggaacaat at accgagccgg gct t ct t t aa t t t t ccaat c cgat ggt at a t t cgat t t t a cat aaaat ct cct aaaagca t t ggat t t t c t cat aaaaga agagaaagga t t at t at t at t at aaat aaa t agt t aggcg t gaat t gcat aaaat t gt ct t t aagt t t aa t ct t ct ggt t acact aat ga t t t t t at t t t ggaat at t at ct at t t at t a cggcacat aa ct cgct ggag t t t at t t t ag acaaaagaaa 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 tctctctatt ttattctcat cctctcctca <210> <211> <212> <213> 166 2004 DNA Arabi dopsi s t hal i ana <400> 166 acgact cccc cat t t at t ct acat cat t t c at t cct aaac gaacaaaatt accaccggct cacagt gt t t cct t t at cag caacaagat c t aact accgg acggaat ccg gagt ggat gg gt ct ct ggt g aaaccagt gg gaat acagaa t aagct aaaa t aacacat ca acagt t t ct t ggt gcct t gt gaaaagagaa t at cagt ct c agt aaaaaaa gt gat t cgt a aat t ccgaag t ggagacagt gt agt t aagc aggccaagt g t at aacaaca acaagact t g t t at ccacaa gagat gct cc caggt gt ct g ttttgaagag cgccgcacgc aaat cgaaga gat caacgt c agagaggact gt t gagaat c cggagact gg act ggt t t t g aact ct t t t t t cct caaaag at at gat aat t t t gat accg agat t t aact agagaaaat a acaacaaat c aaacat t cat gagaaaccag ccgact cgat gaagt cat ag agt ggt gt ga t aagct gaag agct aacaaa agat t acct t t t aaaagat t agaagct ggt gccgt cggaa aaaacaact a t ccgccct ac caat ct at cc aacgccggcg cgacggaggt aaccgacgga gt ct gt gagt acgaat ct t c caacat t t t c t agt aggaaa ct gaggagat t aat gct t ga at t caaacgc aaaaat cat a gagagat cca aacagaat gc agt gaact ca aact ct ggaa agagagaat c t gct gt t t cg 120 180 240 300 360 420 480 540 600 660 720 780 Page 181 12689250 Sequence Listing.txt ggaaacgagt ggt t gtccgt taccggagat aagtaatgtc tacgt t acag ct t t gat t ca ttagt t t t gt cccaaacaaa ccaaat aaag t cat aaaat t at at t aacaa at t gt at cat at t act ct ac t gt ggaaat c act at t aact gat acat ct a taagcccgcg t t t t t aact t t t acct t caa ggt gat ggag ct gt aacaat agcccact ag aacaat cat t acagcat cat aaat ct cat a at aact gcca tggacaaagg acccat t agg cccaaaaaat t aacaat aaa aat aaat at a acat t aaat a t acat aagt c ct at t t agag at t at aagat gt t gt agt t g at at at ct t t t ct aat aaaa gt ct ct at t t at t cgct t t a t t gtccggac t at ccggt t g gacatttttt cat cat cat c at t t acgt t a tgtggcacga acct t cat t g gt t t cat t ga aaat ct aat a aat acaaat a at t t caaat t at caat aaca gt ct t gat t g t agact cact att aagacaa at at ct t aaa t aat at at at at at caacct t t act acat g aat gt ct acg gaaagact at agagacacgt t t t gt t t ct c at cct t caaa at gg t cct ct t ggc aagcagaat g agcagaatgg t t aat at ggg cacaat t t t a t caaat ct t t t at at ct at a aat agat t t c t caat gaaag aaagaat aat aacaaaaaat cct at aaat a t cagt ct ct t t ggaacaat c t aacagt t t t tgggcacagt ggcaacagt c ct gat ccagt ttat t t t t cc aat act agt t ggcccggttt gcccagt t t a at gat at at a aaagct aacc t aat t at aaa gt t gct caat ct gat gt ct a ct at t act t c aaat ct aaca t cat gt at t t t aat at t aac cat gt t ccct aaat caagga ggtggcct t t t t aaat t t ca aact t t gaag t ct t aat t ga t at t t t gaaa ttttggcggc gt gt cggat g aaat ccaaag aat cgaaaac ccgggtgaaa aaaact aaca at cat cccga acctccgggg gt t t t cct ac t t t at t aat g aggggat t at gat aaaaat a t gaaat aat t caaggcaaaa agagt cggt a gat t t aact a ggcccaaaga aaaaat aat g acaaact cca aat t t at t t g 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 167 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 167 tgaaggtggc aaaagaatac ggagtcaagc ttggacgagg aggtggacct acccatcttg atgggcaatt gagggtaacg gttcaaggtg acttatgctt taggactctt cagcgtttca caccggtttc ccctaagcct gagtggcgtg ct gaagaat a ccgt t ct gt t gt ct t caagg taagt t ct t a accagctcaa t t actcctct ctaagttcag gtttgggttt gtattatagg acataggaag ccgaccatca aaacgtaaac tcccgtggat ctttgcgtgg actcagacga t gacaat gt t ctat t t t gt c aagt t at t ga cagct gcaac t cct cat gga agccccgttt ctt aaacaaa caacaccaga caagcggagg ggt t cact t a Page 18' ccacggaaga t cagcct ccg acagt ct t t c act t gagcat t gaaat ggct t gt t gagt ac t t gt agt t aa gct cgagt at aat cgagt cg cccgt gt ggc ggtgggaccg gat accat t c ggagaagagc ggaat gcat c at aat t gcca t t ccgt ct gg aaaagagagt ggaaggatga ct gcgt gcaa t t ggct t t gg 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt aggagcat t c gt acaaccaa aggagat ccc at t cggt gaa act t act t cc cat acaagct ttttagaagg cat t gaacgt aagt ccggcc aact gaat cc agggt at cgc t t ccct aaat t cct at cccg t gt t accat a t t ct aagat g ccaaaaaat a at t cat t t t t aat gt t at at at t aaaat ga gaagaaccaa accat t gagt gt ct caaaga gagat at aca gagaat ct t g aaacgcgt ga t ggccat t ct ggaat t gcgg caact t cgag act at aacca aact t aaccc t gaccct t ac gt gt caagcc acat ct ct ct aaagagt gaa t gct ggt at g ct acgct at g t acgct t t t t t ct aggaaat t t gaaat t ac aact ct aaat tttggaaaaa agt ccat t at gt ggact ggt at ggat ccca agt aat t acc t ct ct ct ct t t act cggat c aaaat cagt c tacagaagga t ccgt gt cac ct ct gt at ga t t aact acca cat t at agat gt gcgt t t cg ttgaggcaaa t at acact ca aaggact aca tacgcaccgg caaaacaccg t aat gt at t a aat gagt at g at at t t ct ga cat t cgt aat tttaaacacg cagt t cct t a t t gaat aaaa ggat agagag cgaggaat aa ccgt t t aacc ct ct ggagaa agt cgt caaa at gg cagt aagaat aat t gat ct a ccgcct cct c agagaccaga gt cccat ccg t acct t t cca ggct gcagct agcagat ccg t ggagt ct ag gact t gaaga gt t aaggcag t gt t ct at ga at aat t t ct t aagaaacaag at at aaact c aaaat t t gga gt t at t ccct t at aaaat t g t gat gacat g t at aat gaat cgcgt t aat t aaaat aaaac agct t t ggaa ct caacat gc t caaagagat gt cgaaat gg gt ct ct gaag cgcct cct cc gt t t ccat gc ggt t gcaggt t cgt gaccca tgacccaagc t ccagcggct t acggt t at c t t t aaaacgt t gt gat gaaa gt gt t at t ct aaaagaaat c caaaaat t aa caaaat aggt aat gaagat t t gct gagaag t cat gat at g aaaat cact t acgct t gt gt aaaagaacag t t t gat act t ttttcgccaa aact t caacc t ccaggt at t ggt gt cagac cacaaagaca t acat cacga t t ccacgt ca gagct cgt ga ct caccat ga t ct t gt acca t ct ct ccaac at t gt t gt t a t t t ct t t t cg acact t aaat at gt cat t aa aat t cct t ct at aaagt aca gt t agacaga act gacacag ccct acgcgc cgt t cgccga t t gat t t t t c 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 168 <211> 2004 <212> DNA <213> Arabi dopsi s tha i ana <400> 168 aacgataaag atacaaactt ttttcattcc aaaccaattt ct act t acca tttttggaaa gtccaccaca t gt ct t at t t ggt ggagcgt cgcgcgtcta aat t at t at t at at act at t t aaccaacca aat ct caaga t gt ggagat g cttaacttat ctgaaatgcg ttcacatcac caaaccct aa cccat caaga ggt t cacaaa ccaaagacgc cat t cacaca acgt at agt a Page 18 at aat ggt gc t gcgt aaat c aat gt t t gaa gt t cacat t t tacaacaaga gt accaaaga ttttgagaga t agt t gcaac agt t aacaaa aacccaat gt t at at at ct a ct t gccat ga 120 180 240 300 360 12689250 Sequence Listing.txt ataaagaaga taagatgtct agacagatca atttgtaacc gtcatcatct cat gacgt cc acgt gt gact acat agat t c acacacact a caat ccaccc agt t ggt caa ct t aaaaggc taaaaaaacc t aaagt aaaa cgaaagagaa aacaaggttt gt cccccat t cgagct aaaa t ct cacct t g t ggt ct t aag ccct ccat ga aaccggt t ca acaaaaaaaa caat t aaact gat caaaat g t at aat t cat t aat t caaca ggt t ct acgg gt cagcagaa t t acaaaaat t ct cat t aca t ct ccgt cac aaaagat cac at at t t t t gg t ct t at aat t gt gat t cct c cgaagctttt t gat aat cat aagat t t at c ct t t aacat t t gt gt t gggc agcat agcat caat gcact c ggct t ggcaa t t ct t aat t a aacaacat ca taacaccacc t t t ct at at g agat t t t at t t at aaaaaca t t ccgaat ca ct aaagt aac acgacaaaga at gaaacgt t ct aaaat t cc aaaaat t gaa ccaaat act c at t cat at cc t at cat aacc t t gaact t t t ct cat cact t ggcaaaat cc gt gaacccat acat cgt gat t acagat t ct gggacat tag t t gt t t aat a ttgtct t t t c gat gggccat at at ct cgt t t aaagt t gaa aaat gaaaaa gt t aat at t t acct cacacg gaagaccaag t t t t at t t at tagaaacaaa at gaagcgat t at t cat aat caaaggaat a aacgaaacga t aacat t t ca at caggat aa cgact t aacc t t cct t caac at gg t t cat t t cgt t t t t gtagt g at t aaaccga ccat at t ct t gt ct t t gt aa aaaagaattt gacaaggt aa gaaaagaagt ct caaat gt c at at aaat aa t aacat t t t c t t t gtat t ga t at gct at aa t t ccgt agca t acgt acgac gt t t caagat t t t t gct aaa t aacaat gaa aaccggt agt ct t t t at t aa gcaacaagtt t t t t gtcgag t cacct gt cg gt t gat aact gggt cggat c at t cgat cat ggt gaagat t cgat cat ct a t aacaaaat a t gct t gaccg at t t aggaaa ggt agccact at gaat gcat agaggt t gat gcgaagct ca aaact cgagt t aat t t t at t t t t at at ct c acaat t t aaa t gt t gcat at caat t at gca t agcaat t t t t cct acaat t t ct at cgaca at ct t cgaga t aat gaat t a at gt t cat t t at t t t t aaac t t at cgt t aa t ct at gaaaa ct ggt gagt a cacgaagcca t ccat t t t t g ccat t ggaca ccat ct t gca t cat aat gt c ct t ccat aaa acacagacac agt caaagaa t at aagaaaa t agt t at t gt at ggt at aag at gct acaaa ct t t t cat gt at t ct cagt a t gat t gcaag agcat aat t t t gt ct caaat aacggat t aa t ggt ct cat g tcaacaaaag ct t cat at ac t caaagct t c cgct gt cgt t gtct t t t t ca t at cgt cct t aaacat t at c ct agt at ct a aagaacaat t 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 169 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 169 gtttttcgtt cttttctcat atgttctaat cattcgtagc ccgaaagccg acgttgagta aacagaacaa gctcacatgt tggacaattg atagttttta gatgttaaaa ctcgtttaca aggtcaagct ctagttttta gcatttccta gtaagtgaaa ccgtgtcaat gggcaatatt Page 184 120 180 12689250 Sequence Listing.txt tttagaagtt ttgtgaaaga ttgattagta gggaacttga gttagtatta acattacgtc t at gt t aaga cat gaaggaa ct t t t t t ct t at t gt gt t ct ggt gct t t aa gacct act ca aacaaaagt c ggaacggacg gagagct t gt ggt agct caa gat t cat gga at t at t t t t a cgct t acagc t t aaact t gg aagact gt aa ct t at gt at t t at t t ggt cc aaact gt t ct t t at at gggc at t t t agact at t gat caaa gt act t caaa agat caccaa t t t t t at at a t at t aaaaat gacat t caga gct cct t gt c acaat at t ct cct ct t ct ct t t t t ct at t c gt cat at t ct t at gct gt aa at aaat aact at t ggt t t ac t ccgt t t acc acacat t t gt ggaat ct gga at t ct gaact at at gaagag cagt t agcgt agaagaaagt aat ct ct ct t t gt aaaagat t t gcgt gt ac caccact t t c gcat t t t agc aat t t aaggg gact t ct gt g t t t gt aat at acgct t t gcg ct at ggct gg aat caccat t t t t t ct t at t t t t t aact aa at aact gt ca cat gt gcaac caaaaact t c gt t t t at t aa at t cct t caa cgat caaacc ccacgat ct t tttgtttaac t gaggt t t at aggat t ggca at t t at ccaa gaaagagtgc tgaagaagga t ct t agagt a cact gt gt cg ct t ct cgt t g t ggt gt cct t t t t t t t t gt a gaat caaat a t t t t t ct t t t aagcagagag at ct gt gggt cagct cat t c aat gt t gcaa at gggcct t t t ccct aagt g gcagtgggca tttttttttt t t at aact aa aaaagact ag ct gaat cggc act caacgca at t caccct c t at t t t cgcc gccaact aaa at gg t gt gt t act t t ggccact ca gcct t t at t c aagat t ct ca aaggt t t t gc gaggct at gc gagccgacaa ccaaaagaac gcat t aaaag ccaccgt t ac gaacaagt ag ct t t ct t t aa aaacat agaa gct gcat at c aacaat gaaa caat cggt ct aat cacaaag gat t gt at ag t t t atgtggg cggccact ca aaacaaacga ttttttttac t t aat t ct t t aagt t at t ac t ct aaat ct c ct gt cgat at t cat cgt t at t at t t aaaaa act acat t gc gt t aaat aga t cgcagat gc t t at t cgct a aaaagt at ga aagaaccttt t ggaccgcct ct t ct ggaat t gt ct gagat t at t aaagga cggcaagtgg cgaaat gaat at agt acaag gt gt gat cat at gt at gaaa t acat acgcc ct gt act cct aat caaat at ct gct t ct t g ct t agat aag gt aaat cat c accagaaat c t at t at aaca caagaaat at t t t t t ct t t t ct cgact t t c t acat agacc ct t cat t t gg t t caat t aga gt gact t ct t aat gt gccca t t t ct t t at a aat gt gt aga caaaagaacc ct t cacaact ct t t ccgt ct ggt caagaca t gagt acat g aat ccgcagt at t agaagat t at gagt at t t aat t gt at a ct t t caacat aat ct t ct gt at t t cgaaat aact aggt ga gt t gcagagg t t cggccct t gctt cgacaa t acgt cact t ct aat t t t gt at act t t t t g t t aat t cact t gacccat ac aact cct aca cgaaat aaaa aaacaact aa at agt gt at c ct gt aaccat 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 170 2003 DNA Arabidopsis thal i ana Page 185 12689250 Sequence Listing.txt <400> 170 ggt caccgaa t gat at t t gc agt at acgcc aaagct at aa cat t t gt at c ct aat at gaa caggtggaca tt act agagc t ggt t aaagg ct cat ggt t a t at acacagt t aagt t acca aact t t t aag aat t gct aga t gat t ct caa ct t gcaccac act ggcacag cat t cct ggg ggt t t cgagc t t gggt t ggc agccat cccc ggcctt gt cc t gt gt t t at c agaat t at t t ggat at t ggt cggt t aggt a aat ggaat t g ttttcaacaa aagt t aaaaa agt acgaat a cacat agt t c t gat gcaat g caccaat cac cat gt gt gct ct t agt ct ac t t t t t actgt t ctt gagaaa t gat t caaga t t gt gt cct t cat caaccca t ct gt act ac gt t ccagcct caccaaggga ct t ct cagca ct cct t gt aa ct gcct tt ag aacaat t gaa t at t ggat t t t cccaat cgg gcat caaccc at gaccacgt at t t t t ccct ccct t cat t g t t caagt cca t t agt t gct t t at gt gccct gcaaat t at g aaacgagaac t t gt aat ggt agacacagcc agcacgaaga t t t act agaa aaact ct gac caacaagcaa t at t t gt ct c t cct cct ct a t ccgt cggaa t gt accgct g aat t aggt t t at t gtgt t t a aat ct cagaa t agcct aat c gcggt t act t at agt gat gc aagcaat acc agt ggt ct t g act gacgcca cat at ct gct t gt gact ct c at gagt gaac gt t gct t acc gt t t gcggt t agct agaagc gagt t t ccac gacaat t t cg gagct gcact gaagct aagt ct at agt t ct ct t aat gcaa at gt t t aaca ct at at aaac at t t t aaacg accaaaaacc tt cctt gacc acaggt gt aa aagaacactt ct t gt at t at gt t t t ct cat aact t t t ct t t ccaaggaat gt at ct ccgg ccgagtgt t t gat t t at agc agct acaat c t t ggaat gt a t t ggt ct gt t agt gct t ggg aggct ct agg gagct gagat agagaaat gc aat agct t ct at gt ccccgt t t gaagct gc ggct t gagt t ttct t ggttc ct t ggagct g acact at at a tttttttttg t gct gct ct t aaaaacaaga at t gt t gat c t at t t gt aaa t gt t t at t ac ccaaat t aag t gt cagt gt c at cagaaaaa gat agaaaaa gaagt t gact cact t accac t at at t ct t a ccat t t at at cact ct ct t g cgct t gggct t aagt cact c gggt t gt gaa t t ct t agcct t t ggt ggaat t t t t t t t t at ct t agcccgg agct at ct gt aggaggagct catt ggcaca t cgt gact ct t gaaacaat t aagt aaccag cat t agt gt g t t gt aat aat act t ggcaac caat cat ct a ccaaat cgca t cgt cat t cc t accat gt gg cat caagt cc t act acat gt caaggaagaa agat caaaga at ggt ct at c accaacct cg t ggt t ccat t agt aaact gt gt aagt t t ga aaat ccacaa acaccaat gg gat t gct t t c t ct ct t caaa t t cggt ggt a t act t ct cca agaagacgat gaact t gt t c t t gcagt t t t at gt aact ct aagct gt cgc ggtgctggag aacact gt gg t t cgt t ct t g cat gt ccct g gaaat gagt g t ccat gt aac cact aaagac t gt gcagat t cat cccaat c caacaaagac t t ccaaact c agt gggt gt t t t gt cat cag t ct ggt t t ct at gt gt gcaa gt t ccaaagg t gt t t aacat accgct t aac cct aaat cct t gt caaat ac at aagaaaag t cat t ccacg ttact t t t ct t aat ggat ga t t ct ccacct ct t cggt t ac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 186 12689250 Sequence Listing.txt tccgttttag tcgacaacaa tgg 2003 <210> <211> <212> <213> 171 2004 DNA Arabidopsis thal i ana <400> 171 gagact cat a ct agat caat t gat ccggaa acat t at gat tcgcggtggc at aat cgagt ccacat t t at ct t atct t t g aggaattttt t agat t gcaa t gct ct ct gt t gat t t cat t agct t ct ct t t agat cact a t gagat t cac gcat t t cat a aaact t gat t t cat t gcat c t t ct t cat cg t at t at gct t t cccct t t t g gat t gact t g t t ct cct t ct cat at ct at a cgaaccaagt at t t t ccat c acat t gt t ac t t aaaacgt g t gt aacct t t acat ct gaag cgt cat t ct g gct gt gt aaa t gagact t ca t t t act cgt g t ggcggt gat t gat ccagt c ct agt t t gag t t agat t t gg gt gct ct ggt t t t ctggaga gt t agaat ga at ct gt gaga t cat cacat a t t caaat t ag cat ccccaat t cat t t gat t aat gacct at t cct cgggt g acaaaggat a aat ct ct cct t t t t ct cttt t ct t gt caaa aat at acacc aat at gggat aaat t cct ct t act agagt t at t ct t at gt t at ct t t t t g acct aaaat t ct t t ct t ct a at ggcgt acg t acaccgt ca ttcggaaccc cgaggccacg t t ggagt aga agat t t agaa t gaagt t at g t agct acggt t t t ggatgaa t t agt aaact taaaaaaggg ct gt t at t ga ct agt t t gct gcaat act gc tgctgt t t t c cat t ct ct gt t ct t ct gt t c caaaggt t t c act caaaat g t gt gt agt t t t aaat t cgag cggt t caaat act t ggagt g ggat gcagt a t caat ct at a gaat t t t t at t cat caacat caaacagt ac gat cggt cct t cgaccat gc at aaccgacc t aggaat t gt gt agcgat t t aat t gggaat aacagt aat t t t at aat t t t ggagt gt aat tagacacaag t ccgat agaa aaggat ct cc t at at cccaa ct t ct t cct c aaagcgaaag ttccgt t t t g t t gat at t t t at t cccgggt t cct t ct caa t at agagat g t at agaat t t tctgtgacag t gat at t t gt t t cacat gt t t t t aat t gat t cat gat ggt gt t caacat t gaat gt t cct gat t t cat ct gat t gt t t cc gt t ct cgat a gccggtgaag ct ccggt t t c ct t cacct t t t cat gagaga gat t t cagct ggt aat t aag t t cgt t t cag ttttccaaag act agt at at gt t ggat t cc t t ct gct t gt t gggct t gt a caagt at cag ccggaact ac t gat ggt gaa t t t act at ac acat accgcc t t t t at t act gagt ccagt c t t t agt caga t t ct at aaag gct t t cagat t acaaat t t g gt at t ggt gt at t gt t t t ct t at aacat gc t t ct cgagct cat t gat ct t acggacgagg gagat cgcac t t cat ggct t ct t act t at t at at gt ggga agat t gt t t t ct gat agat a ct t ggcct t t gcat acgat t gat gat t ct a at t aact t t t aacat agaca at t cat t aat cat t gt act t ttct t t ctca accagggat t acggat t gcg cgt t t aggt g ct at agcaat ct t t ct t act t t t at act t a at t t ccaat t gt aat t gt t c gccaact t ac t at at caaaa t t gggtcaaa act t t att ag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 cctgtttaaa gcaaaacgtg acaaaacata aatattcaat tgctatatac ggaaagcgat Page 187 ccagat at t t at t aacct aa aat at agt t t ct t ct cat t c gaat gat t aa aagct t aaac t ct ct ct ct C t ct agaaaaa 12689250 Sequence Listing.txt t gggcccaaa at at ct ct ct acgt cacat a cat aat ccca cgact agccg gt t ct aagac at t t ccggt t t aagt at ct g t cact cgt gt cct ct cct ca aaat cct cag ccgcat ct ca at gg 1860 1920 1980 2004 <210> <211> <212> <213> 172 2004 DNA Arabi dopsi s t hal i ana <400> 172 act ct gcaca t t t at t ccac aat ggt gt t t t t t t t aact c at gaagcaga aat aagt at c ct cagcaaat acaaat caaa agt t ccat ca ccccct cacc t at ct t gct a ct t t t t caag ggt ct gct t t t gaaaaat ga ccaat agt gg agcagcaat g t acat cacct aat t t gat t a caaaat gt ga t cgct gaat g gt t ccct at a ct at aat t t c aat cggagt c gat gaat t ga ggagaaat aa t gat ggat ct t gagagat gt aaat t cct t t cacgat t t ct ggat t cggt t agagt t gt gt cggt gact gc ggaaaacaga gagt agt t ca gct cacacaa gat t cct t ca aat gt aacag t cat at gt ct cagccagcgg at cagacccc t t ct t t t at c aacaaacct c accaacccaa aact agagaa tccaccaaag cgaagaaggc cct acagaat ct at gt accg t gat ccaaac gcgat agt ga t gagt ggaag agt caggct t ct t caccat c gt at act act t t acat at gg gt aaat ct gt ct at gcat gt t gt agcact t at ct gat gag t t t at caacc gcct ccaaaa gcact ct at t ct aaaagaat acat acct gg at gt t ct gca agagat cagg aaaccagt ca agct t gacga accact caca agt at ccaac cct t ct t gt a at t cct ccag ct aggaaacc tttcgaacca agagat at aa aagaggt gac acgaagacgt t aaat cgt gc at act aggga t at t t t caaa t aat ggt aat t t t ggt act c t ct t gacgt t ccaagcat t c aact agaaat aaagt aaaca gaccct caac t gt act cat g ct cat at ct t ggaggggt at ggaagt t cat at accagagc agt t agaaga ccagcat gt a t at t aat cgt ttacaacaga aaccaat agc acccaat aag at gt aaagat aagat t gagg agagagaacg ggcggaaat g agccgt ggct tagcgaaaga acaat gat ga t aaaat cacg gt t aaacaag t ccagt ggct agct t gt agc caccagcaac t cat t gat at caaact aggg t ct gt t ccca at t t cccct a gct agt cat a gaccaagaag at acaat ct g t caaat gcat tgaaacaaag at ct aacacc cact t gat t c t gccaaaat c tacagcgaga ct gt t t gaga cct t t ct cat aaccaggcac taccaggagg agaggat aat gt agaat cct agaagaagaa acat cacaat t t at t t t t gt aat aaat at a t caat aggca t at cct gaaa gacct gaat t caacacggtt t ct gt t ct ag at t acacgat act ct gt aac t at gaaaaga ttcccgcaaa at t gagaacc cat gacaaaa gaaaggt t ag ggt agcat ca ct t aat agcc aaagcaat gg gct gt aacgg t cccat ct cc ct aacgt at a at at agat t t t gaagagt t t t cgt gaaaat ccat ggccga aat gacaccg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 188 aaacgacaat t ct accgt t c act gct ccac ccgat acaaa cgacgacct g cgct t t t t t g ct t t t ctctc act t t aacaa gat ccat aaa aat cct ct t t aaaact at at acccgaat ac ccccaact at aaat t ct at a ttct t ctt ct cagat t cgcc 12689250 Sequence acgt cgact a at t ccct t t c gtctttcttt ttttttcttt att at acaat t acccgcat c ctaggtcagt tctattcgtg aaacatgact tttacacact tattattatt gtttgagcag ccact gggt c gatctttacg at gg Li st i ng. t xt aaact ccaat gaccaatttt cgaacaaat a aaat cgagt c t gacat aact aaact aagac aggtt cact t ggcgt ct acg ttttttaaac t gaaccgt at t cct ctt at t t cgt t t gt at aaaat t aaga gt acggatt c 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 173 2004 DNA Arabidopsis thal i ana <400> 173 ctgagacggc gcat ct t t aa tt cct aaaga cat cccct t g caaatccagg cat cccgat c cctcaaagcc ggaggcaaga agtgaggaga aaaaaacaga cgacaacaaa cat gaaaact tctct t gttt gaaaaacaaa gaaact t gt t aaaagaagat acggt gt cct gt t t caaact at cat t gat t gcaat t t cag act at aacag caaat aat gg ccct acaact ccctagcgcc cccgaatcca at ggt cacca tgatgatcga aagaccatca aaccat gt cg aggaaataga gagaaggagt gaaaaatgag t t t ctctgt g t aaccat aat t t cgt aaat t at ccaacct c act caacat a aagat t cggg t t aat t gt t a tt caatt aag ct t t cagt t a at t t t aggat aagat t at ga attcaccaga gtccctactg tcgacattcg cctagaagga gatcgtttaa cgt t t t act c gagccgt t ag gcat t gtgct catgcggcga agct t cccga aaaaaggact aaccagacgg tatgtgt t t t gaggtggagg ctaaggaagg t agcaaat ct ccat at gt at act t t t gttg ct t t t gcacc t caat actt a act ggat aac gt caact at a t ct t t t aact t cgacaaat a agaaatacag aaccaatcca ggct t gt t gg at gat ccaga tggagtagaa aaaagatct c atggccaaag ccctccgaga atacgggttt taataaggga aagacgagag agagaggatg cct at t ccaa t ct ct t ccat acaacagaat tggt t gt t gt at ct caatt c t at at t t t t t t at caat caa t t gagaaat a tt gct aat ca t t at t acaac aat at ct t at t gt t gt caac t cct cct cct tgtgagagca atacgcggt t aagat cggt a atgacgaacc aagaaatcag aaaatgagag acaact t gga t gagt aaaat gt aact t t t g at acacaat c ccagcataaa t gt agtt gat tgtctgcagc at gat ct t at gtggcactgg t ggaat ct ca aatt gagtt c t t acgaccat at aacct caa act t ct t t t c cct cct att c t t t ct acct c t cacggatt t tcaggacaaa atcacggagg atgtccgaga acaacgaaca aaaagctagg gaggaagcct ccaagccgaa ttttgagaga acat t gt t gg gtatccggag agtt actt aa gatgggattt t caggacct g t aat cct caa tatgaaaaca gaaact t gt a agctt att ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 atcgattcta gatagcaaac tcgttcatat atccatcaac aggcaaaaaa cacaaagact Page 189 ct aat agagg ccgggt t at a cgagaccaaa aat aggcgt a at agtt cgct at at at act a t agt gat aag t act t t gaac aaccct acaa gccgt caaca t acat t t t ca at at t t t aaa gat t t aat t g aat t aaact g t gccact t ag at gt gct cac aagt t t ccct acgcgt ct ga tt gggct aaa gaagaatgcg at gt cgaagc gtt cat caat 12689250 Sequence taactgtttg cattttaact t ctaacagtt agagt t gcca agagatgttg ggacccacag cgtggagagc acacggtatt cacatggctg cgtggttcga t ggcaaact c t ct caccttt t t t aat t t aa aact ct gct t aaacgaaatc gtttcttctt t cact at at a tt aaaccgaa gaggt t cgca ct t t t cct ct at gt Li st i ng. t xt gggt at at ac caaaggatt c t aaagt ggga ct t t agt aaa gaat at ggag ccgatgtggg gggct t t t at ccat t t t t t a at cgt t t ct t at ct cagt t g ggt t t aat aa t cagat act t accct ccaaa aggcgaaaga agt gaagt ct act ct ccaaa gggcct aat a agcct act ga ct t cct t caa t t agat ct gt 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 174 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 174 tgcatttttt gtagcttctt ggtct aagaggggaa taagcaagca agccal atcaactgtg gttttaaccc aaaacl ctctcaaggt attctagaaa gattt cattctggat gctcacaatc tttggl t cact ccgt a t gaagttt ca tt cga! gaacacttca aaactctgtg tttgas gagtagcact acctttcctt caact t ggcacact a ggt ct ccat t t cgac! tcgatctggc ctggactgaa aaaaa! aaaacgaaat atttt gcgca t gaaal gttaagacat aaagcaaaaa taagg at cagtt cca aggcgctt aa gactt acgcagaatc tcaccactgt ccctt agt t gccaaa act at ct gt a t acag gattaggttt tgctgatatc atat a tggttgatac cttgtcacgt tgct g taccttgcgt ctccttcttc atcggl atttaccaca aactaaacaa at tcc gt agc acat t aaaag acct t tagt c gacac ggt t a at gag gt t t c gat ca agaa cgat g t ct c at gt a aaaca catt c agact tat gg agaaa caagt at t t c gcaatttttt agt t t agaac cat accgaca ctt cgagttt cgat at agcg t at aact caa at cct gaaag at agt aaaaa gt gt cat gat agacact t gc t t accagt gt aacact t t ct agct gacct t at gaacat ca gt gaccaat a t caaccagct gat act ggaa gct aaacaca ccagacaacg gggt ggcat a at gct cct cc t ct aact agc t gct gt gt t a aacct gt t gc t at aacat ct gct ct t agt t tcaccgaaga t t t ct gacat t caaacaat g at t t gaagca t t gat at t ag t ct gagt acc at at caat t t t gcat ccaaa act gat at aa at aaacaat g acagt gct aa agccgcct cc ct t ct accag gacacactt g aagct gt aac acaaact ccg agaat t cgag gat t t t t aag gct cagcgaa gt ggt acgt a tt gat gat ga t agaat at ga agt caat gt a cagcat ct ac cacat actt t t t at aact t c gt at gat aaa agat ccgtt t at actt gaaa att caccaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 v v v Page 190 12689250 Sequence Listing.txt attatatata aagtataaaa cattcacata aacacttgca atcaaattca agagaaagat gccct ct t ca aaat gt cat c t gcaggagat t cggaat ct c ct caat at t c cgaaagact c t aagat gacg t t at ct act g taagaggccc aggt ct aat c ggcggt acat aagct gact c t aat ct ccga gaacaat gaa at t gcat aac t gaaat t cca aaagagagat ct aat t t cct t cct ccgagc ct cgaat gt t ccat accgga t t t t gat t ca at t agagt ca t cagat t t cg cct t cgat at t acgct ct ct t ct agat ccg gat aaaaact t cagcat ct c aat t t cat ca t gt t ct t aca at act ccat a at t cggcgat act gct gct g ggt t gt cgt g cgt at t t gaa aaaat at at a at t t at t t ct ct t cgat aaa caaaagt cga at gg cacagct t ca at gagat t ct aaact t caaa gt gcccat ag gcat t ct t cg agagt aacac accaccggag gacgt ggt t c t act ct gggc t t gaat aat a t cgagat aat tacgcaagag at at cccaaa gcagt gt caa t t ct cagt at at t gaacaat t aact t cgct t cgacgct t t ggcggcgcgt t gaccgt t ga t t ct cgcct t t aaacgat t t t ct t gt t t t c at gat t at at t gagct t cac t t cgat ccgt ccaggcaaga t gcaat t gat t t t gct cccc ggact acaat gat gt t gaga cgccgaaact gaat gccaaa gt aggt gacg at cgccacca gggcct aat g ct aat at gat t agat gacgt ggt cagacaa tttccggcga 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 175 <211> 2004 <212> DNA <213> Arabi dopsi s tha i ana <400> 175 ttcaatcgta aggctgcaga gcaatcagct tccagtgagc taagccaatg tgtttcacaa cgctcgtgat tattgttaag ttcactgcaa gcgaat t gt a gct t t gccat ct cagcagt a ttaaagggaa gct cat ggt t agt t t cagag cgttaggggg cattctgtat ttgtctaggc gt t t aaagaa t ccact cat t aat aaagt gt caacttctag agggaagatt cttaggctgt gaat ggct t a ttaatgcagc acgaaacggg t caaaagat g aat t acgcga t t ccat t gt a agt t acacaa t t cacat gt a ct gt agagat aagaactaaa aaagat gct g agat t agcgc agacactaaa aacaaccttg ccaactacaa gaagacaata caggcagcta tcccgcttaa ggctcagttc aagaagaaag ctcagaaagg gccgaggt t g cct gt t gt aa aact at t gaa gt cct t at gc t t gaaggt t t t ct t ggt t gg ct t ct gt ct t t t ct gt aagt at t at gcaag t cagt gccag t ggaggcat a t gggagaact cact cagct t agaaaccgt c aaaacgcaca ct ct ccggga gcacct cat c cct agcct ag t t agt t t gag aaaggggagc acaat gat cg at cggat t t a t at t gct ct t aacct act t c aaggt cgaaa aagt acacag gct ct t t t ag act gt act t c aagact ct t a gt agct aaaa at t agcaaaa t t t ct t at t t gggt t t agca cgt t gaaggt t gt gagaat a agat t caaga t gt gt at t gc gt ct aact t t aagaat acgc ct ct t gggag gagct gcaac cgat ccagt c ct t gt gagaa aagccaggaa at cct gaaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tat t at cat c cct cct caac ct act gat ca t t gt caaaac gat cagt cag agaaaat t ga Page 191 12689250 Sequence Listing.txt gacaacgcca gagt gt ggag cgggt t aaag t ct ggaacct agagaaaatt ggcagcattt t t cat gcat g gacaacgcca gagt gt ggag t cat agt t ac cacagat t ga t t ggt ggt cg at ct acat at acaaacccaa aacaaaagt c aaaaacagaa cagagat at a t t gt cgcacg aat ct ggaac acagagaaaa gaggcagcat cct t cat gca gagacaacgc gggagt gt gg aacgggt t aa aat ct ggaat acagaagcaa t at at ggt t a agagaagt aa at t gagt cat aaat gaaaat cagaaggt cc caat agaagg gcccaact t g aat acact ct aggaggaaaa ct t ct t cat g ttgagacaac t t gggagt gt t gaacgggt t caaat ct gga agacagagaa aggaggcagc ct t ct t cat g gccacgcat a cgt agccat t t at t t gagcg t cgaat aaag t act ct caaa at t accct ca t t cat t accc ct gacccaac gcct t cct ca at gc cat gaacggg gccaaat ct g ggagacagag aaaggaggca acct t ct t ca aat t gagaca at t t gggagt cat gagcggg acagt gt caa t t gt gat t cc gaaaat caaa t aat gt gggg ct t cat t at t ct ct t cat t a t cact ct t ga aggt ccgt t t t t t cat t t ct t t aaaggagg cagcat t t gg gaacct cct t aaaat t gaga gcat t t ggga t gcat gaacg acgccaaat c gt ggagacag ttaaaggagg gat ggagt ct t at ct agt ct act aat at ga ttgcaaaaca t gt gggccgg t t t gt gggct t t at t t gt gg acgaaaccct t cggcgat ct cat gcat gaa caacgccaaa gt gt ggagac ggt t aaagga t ggaacct cc agaaaat t ga cagcat t t gg t gt gagt t t g agt agt aat t t t ggt acaat gaagct t cat t t at aaaaac ggt t ct aaaa gct ggt t ct a aat ct caaaa agggt t t t ag 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 176 <211> 2004 <212> DNA <213> Arabi dopsi s thai i ana <400> 176 aaaaatattt ttcgtttcca aaatttgttt tataaaaaat tatctttgag ttatattatc aacgcaagcc ttgaaataaa gtctcataga aaagcctttc aagacgatga gcaacaaggc acatcaattg tttggccttc cttgatgttg gacttcttgc tgatgatcct ataaacctcc t t t aat gcct cgagagccga t gt ct cgat g t gagcat cct ctgtggcaac t gct ct gaga at gat cact a t at t ggaat c t gcat ggt ct tcaaacgttg ttggcttggt gacatcatag t aggcgct ag t t at ggct ct gt at cgt t ct acagtccttc cttccacctt agaacaaaga t gt gt gaacg aacaacagaa t at t t ggct t t t ct t agcgt gcat t t gcag gagagaat cg aaagat aagc t gct t cagat ct t agt t ct t accaaaagag tgcccagccg t cat aggt t t at t ccgaact aaaagaaaga t t t t t aagt g t t gat t caga t t gt ct ggt c t ct gaaaagc ct t ct t t ct c ct gt ct t gt t t t aaccacct ct ccaagt gc t gt cccat at t t t aat aggt ccacaat t ga aaacaaat ac caacacaaac agt agcagcc t gaagat at c ct t t t ct acg t gcat agct c cccaat caac gct t acat t c acct ct gt aa ct gagct t t c t aat t cagat 120 180 240 300 360 420 480 540 600 660 720 Page 192 12689250 Sequence Listing.txt ctaatgagca gcagactcta tgat t agtgt gtctgcaaca acaaaat t ag gct aaat t cc cat act t caa cct t aaacaa at cgacaaat at ct cgacca cccaaat gt t cgaat cgaaa cgat t acact agcgaat t cg agagaggaga gt cgt at t ct t t t gggat t t ct t t cccct t cct t t t cttg t gct aat aaa aaccaaacaa ct t ct t cat c t t ct t ct ct c at cgaacaat gagct cact g gct t gaat t c acaaacaacc aagaaaat ca ggt t t t agat t t gggaacat aat gaaaaaa cct acacaca agcaccggat at at act at a acaccaat gg t t agact t gc t cgt ccggt c t gaaat agaa ggat ct t ct c t caagt t ggg agggaaagt a aaccgagcca at cat ccct t t cgat t t gct ct t t t t t t ct ct t gat t t t t cgt ct gagca cagt aagaaa t gt t cgagat ct gacat gt g t at caat aaa acaggcacca at gact gct t gct gct t aac t at at agaga t ggat t t gga cgacaccgga t t ct cgccat aaggagaaga t at t caacct ct t act aaaa at t agagct t aaccagagga caacact cct ct cacggaag ccagt t agct t t t cct t gga at gg act agaaact caat t t caga agggagcctt acgct t aaat cacat gt aaa ccgagt at t c t t agaact aa gagat gt gt g t t ct aagcag gt ct ccgat c t gt t acact t t gacagaaat t ct cggct t g aagcct acat aaccggaaaa at acact caa t t at agct t t at t ct t caga cat at t cat a accagat ccg gt gaaacaac t gaagagat t acact accac t aagagat ac cat caaaat t at acgaacac acct agat cc cgt act t gga aact cgt t gc aaaaccactt t t ctcgagga aaat aaat aa tt att gccca at at gt aat t taaacaaaaa t acagat t t t t ct ccgcct c at at ct ggt a t t t at gat ca at at cagaag t aact t cgca aaccat gat t ct t agat ct g gt accagagt caaagcaaaa caaaaccaag agat acggat agcaacattt gagtacgagt gagt gaat ct t aaacaagt a gt gct t t aga t t t aat t t ct acgacgaaga cat t t at aca t cagaaat aa aaaaaccct a aact t t t t gc agat ccat cg t gt caat t gc ccccaagt t c 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 177 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 177 catgagaagg gagaagaaga taatgtaaga agtttagatt tgattattgg gtagttttta tgt t gagtaa gct t ggaatg tct t gt t aaa gttaatattg cttattttag gtttcggttt taaagtattc caagaggact cgtggttttg tatgtcaaca aacgtactat tcttttttct atgt t atcta aaaaccattt aacaaaaaaa aaatagtaat ttcagtatat aatagttttg gggttgtgat ttgtgaagta atctttggaa gt gaagt gga gt t at gagat gagt t t aaga gt t t t t aaaa t t t cagat t a gat at t t t ac at aaaat gt a aat t t t ct t t gat aat ct ca Page 19, gacat t gt t c t at aat t t gg at caat ct t g t t t cacct t t t t t gaaat at aacaaaattt t gagaacct t t at t t agaat at gt aaacaa aaaaat t agg t ggt gt gaag t cat gt gct t ct t t t t agt a t t acat acaa gt t ct t t aac at aaaat aca t t cgt t t aaa cacat t gggc 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt ttctcatagg cccaatataa acaacacatt gatcat t atc catgtaaggt gaactcttta at cgaaaccg at ccact ct a cct t cct at c at aacgaaac gcct ccacct cacagt cat c aaaat cct ca aact t at cga gt cct t aat a gt t accgaat act cgct t ag at gaat t t at t t t at cgat t ct ct t gt t at gt gagt ct t g caaaact t t c at ct gggt at t t ct ct agca tt gctt ccac agt t t gt aca gagt t agcct act ct ct gat t t t t t t t ct c gt t gt caat g t t caat t t ca aat t cgt aaa cat ccgt caa att agcccac gt aat t gt cc t ct t t ct t cg t t t ct ct gag t t ct ct t ct c ct cgt ggat t t t cat gt t ct at t t t gt gat t t ggggt t t t at agt t gt ga t gt cgcgat t gt gcaat t ga cat t t at gt t ttgct t t ct g aaagaagt ct gct at t gct t at at at t gcg at t t t t agt c at aaat at at t gt gat ggt c aat aggt aaa gt cat ct caa cacgt gat t t cgat t aaat c acgaaat ct c at t t aggaga caaat aaat g aat ct ggt gc gt t gat t gt c t gt t cat aaa t cgat t t t ga ct gt gt at gg aggt t t gt t t tcgtttttag gt ggcgat t t gat t gt t t cc gaaaaggttt at t t cggcaa t t agt cat t t cgt ggct gat t gt aacat t g act t t acaaa gt at acat ac t ct t gat t ct at ga t ccaacggt t ct cact aaat caat cgt t ca ct t t t t t t t t cagcgcggtt t aaccat ct c t t t agct t t g t t at act t t t tcgt t t t ct g at ct ggat ga t t t gt ggct a gat t acgt aa t t gat t gaga t cagt gt ct t t t t t t t t act ataggaacgg gcaaat gt cg gcaaagt caa tgagagagac t cct t cgt t a t t t t ct t t t a t ggt t gact t aact t t ct t t agt at t ccaa at aaaact t g at at agagat t t gacgggat aat t t agcat t ct t ct cgt t gt gagct ct c agt t ct ct gc agcagt t agg t t gt at gat g at t t t ggttt t cggact gt t agaagaaaat at gaat caat agat t gct t g at gt t t aggg at t t caaggt gagagttttt caat t aaact t ct gt t t gag t gt t gcat ca t t at gggt at gt t t t atgt t ggagcacgt a at acgt t acg ccaacggct a t at acagt gg t at ccct cct tt ctt caaca t t t ct t atga t t aat t t at t t aagagt t t t t gat t agt ag cat cggt ggt t act t gaaga aaat t t cat g t at gt t gcat t ct t at t gaa t t t t gt t gt a t t at at cact act t t at t t a aat aat t t gt gaggaggacc t t caat at ga ggt t ct ct gc t t t ggt t t t t 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 178 <211> 2004 <212> DNA <213> Arabidopsis tha <400> 178 gcacaacctt ctact t cttc ggtgctgctt tgccaaaaga t at ct ccct a at ccct accc catgatagat actaacacca cacctttcat tctcttcatg Il i ana cctatgt t ct gaaaagtgaa acccct t agg tttgctact g ccaagctaac tataacagga gaagaaattc taaatgctag aaaaacattt gtttattgtc tctcatcaat caccataaat ataatcactc act catgcat cctagcatct ttaccctggc ccacaagatg tttacacatt caagcaaaac cacatgaaaa Page 194 120 180 240 300 12689250 Sequence Listing.txt gaaagat atgt t ccggt gt t gcccct g atccttaaga acagaagaaa acgg aaacat ct at aat t t caaca ggct t gaagt aat aat t t ac agaacaaat c at t t ctt agc gagcaccat c ccagatt ct c atcagaacaa t caaaat t t g tcgctcagcg gat ccgat ct cact aat t t a cct cct t cct tccgtct t t t agct t gcgt g t ccgct t t ct tatat t gggc tgttaaaaac at t gagact g gt aat ctt cg t t t ggt t t t g ct ct t t gt aa t gaatt cct t aggact caat ggtct t gagt t t t t at t t t t t caat gct cc act aat caaa tgtcacacag gatcagacaa acaaaccaag t t t t cct t t t ctt agagaat t t gat t at cc t ct ct ccat c aat cagt at c aaaaaaaaac act t cacgct t gt at at cag att ct cagcg cacactaaac actt acggcg agaactgagg aat ctt ctt c ct gt act t aa t t aaaat tag gct t gct tag t ct ct t t ct c gat agt t t gt t ct agat agc t gt gcgat ct t agt cagat a gatgagact t gt t cgat at t taagcat t gg ct aaacccat caatcaacaa act ct ct t t c cat catt cca gtgccctaaa ctaagacacc tgat t cgacg tt caacct cg cccacacaga gaacctcacg cacgct ct t c cgt caagt gc acat cgt ct t gct gct acac gcgct t t acc aagaaaggga agagtgggaa cat aagt cac ggct t t gtgt ccgcggagaa t aagt ct cgt ct ccat ct gt at t t ccat gt gtaaaatggg t gaaacct cc ttagtgattg ct t t t gcagg atgc catt cacaca aagt t t gaat cagaaaacac acaatt ccca t t t agaat cc gat at t t ct c cagt ccccat ctt cct gt ct tttctccgga gcgagagagc t at at aact c tt gctt aagc cgaagcaaat at aaat acaa gacggcgaga cagagacatt aaat aaat aa acgtgtggca ttcgct t ct t ct cagaact c attcacaagc gaat t cat t t aat t atgggt ttagggattg ctggcgt t at agct att at a caagaggttt t accgt gat c agagcaagaa aat act t t ct tt gcat ccat agat t ccaat accactcaca aagaacact c agcgat ct cc ttctccaaaa t cagct t t cc t gcat ct ct c tcctccacaa t ct aact at a t ct ctcct cct t tt ccgat ct t cacgaccttt aagcgggctc gcggattttt ct caaaccct at t cgct gca gtat t gcgt c tgtgcgggat aat t gt t aat at at t gagat t ct ct t gt t c aacat t t at t gaagaagcat t at t t cct ac t aaaaacct g agcacct t ga t cat t gaat t cataaacgca t t ccaacct c gt aat t t cca t t ct cct t ct gt ct gt caaa gccctaaaaa aggcagctt c t ct cct t cat actt at ccac t acaaaat ct ct t ct ggt t t t t ct cgt t ga t gt t t t t t t t ttattgggcc ggat t acaca agaagat at a aacat ggt t c atgtgt t t gt t at aaaat ct gt t t at aaag ct caaaagat tgtgact t ga gtgt t t t gat ttgaagaggc 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 179 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 179 ggtcattgtt gatcagaaca gaattaccac tctcatcctc tgattcattc acatttgtct tcctccgcgg tgaccttccc tgttctattc ctcctcctcc tcctccagag tttctccgat Page 195 120 12689250 Sequence Listing.txt cat ct t t aga t ct t t t gagc ttaacgacgg ccaat ct ct t gact t ct ccc cat t caacat gagt agt cac t cgagt t cca t aggct caag t cct aat ct c ct aaat t ccc aaccaagcaa t aaat t ct t t t at cct caag cagcaaat cc aat gagaaac acccggaacc cgat gt gaat cgggt t gaat ggt t agat t t agagagagat cccat aaacg aat ccgccat t t gct t t t gt aagaaaagag act t gcact c agcaat t at t at t gaat cgt aaat ct at ct gt aaaat ct a t t acccaat t cgt gt gt agt gaccaagt cc at t aggat ca t t t at caacc caaccat t cc t gaaat aat c t ccggt act c at agct cat c t t gct t gt ca acct t cgt gt ccaagt aaga at t at caaca t ct caccaga ct cagct t gt aacacct ct a at t agt agaa ct caggaccc agt ccaacct t t t gt t at cg t ggaggt cga agaagaagac gagaacgagt taaaccaaaa t gcagct cac t ct gct t cac at gt cgaaaa at cat t at t t aaaggagggt gt t ct gaaca at caaat caa t gagct t t t g t cgat aat t g t gaat caaat t cagct t caa acacat ct t a at cct cggat acaagat t ca t ccat aacaa gcgt at t cag tccgaaccca agcaagat at aaat acat t a ggact ct t aa t act cgt aaa t t ct t gt gt c cct ct gt t gt t acacaat cc acct caagct gacaacat aa cct cgt t ccg t ct t ct ct t t at ct ct t gaa t caagagat t aaaagaacga at ggaagt gt t gact ct t ct at at aat aat gacaaacgt a t t agt at at a agt at cgt t a agt t t t t t t t at cgaagaag ct t t at ct t c t ct t ct t cgc at gg gcat at gaat aagct accaa ccaaaact cc ct t ct cccgg gaacaccgaa gagccacat a agagct t agc t act cgat t t accct t t ggc accccaaacc ct aacat cct t t acgcgt cc t gagcagat t cat aacct cc ct ct caacgt gt aaaccct g ggt acgcaat gt t ggt t t t g t ct cct t gga t gt t ct t gt t t cgcggct cc gt ct cgagag t ct t ct t ct t cat cacagga ccct cgaaga t t t acact t t t t aaaggt at t t gcgcgt gg gagagaagaa gaat ct aaga t ggat t gat t gatatgaccc atttttggcc aagggt acgc t t ccgcat ct agcccggct a gct at aaaca cccaaacgt a taacccaaag aat at ct cgg t gt t ccaaga accaccat ga at gagct cct aat agct t cg ct t t at agcc t t gt ccaat c at accaat gt at cacccgac ccgat gat ct gt gt ggct cc aacaacggga gct t cgt cgg aacgcaaacg ct gat t gt t g ct t cct ct ga ggt t t t t ct g ggcaagct gg t gccact t t t agt caggggt t ct t t ggt t g cagagaaaaa t caggaat t g t t t gt t ct gg t t caat gacc cgat t cgt ca t aat ccacag t cgct cct ct cccat cacac t cagaaact t t gaacaact t acaat gt t ca at ccat t gt t tcaacgcaat act t cgact t accat t gat t acat t ct cgt ccccaacct a ccggacccag t t accaat t t ggt gt t gggt acaat cgt at t aaat gaacc ccgagaacaa act gagt gag ct ct ct ct t t aagt agagac ct t t gat ggg act gt t aat g aat at t gt ca t gat t t acac gaggt aat ga at cagggt t t aat t gt t t at 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 180 <211> 2004 Page 196 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 180 ttttagcaaa t cgt ct t cgt at cat cct t g acgt ct t t t g gcgagat t gt acggcaccgt ttaccaaaac aaat t aagt g gt at at t at c cct t t caaga t ctt gaaaaa at t agcagct gagt gt t at g t t gggagact agaaaaaaaa aacccaattt ct t ccggt at ttgtgat t t t t t at t gccgg t at gacaccg aat agaat t t t t aacct cga aagaggt t t g t act aaaaac acat acacgg gt t at ggcag aagatggt t t aaat t gccaa gcgaat aaag agt t t gggga gagaagatgt at gat t aaaa t t gaaaat aa t t t ct t t t aa t t at at cgt t cat aagaaga aaaagagt aa t agct ccggt cat gt aagt g gt t acgt t aa cat t t t t aaa t t t gt agt gt gat acat aat ccagat acag caacaacat g t at aagggt a aaaat t caaa cct ct at ct t ggaaaat aat aat t t at gt t t t t at at cga aaggaaggt a t cat gat act gaacat at t t gaat gt aacc at t at t ggt t t aat aagt t a gct t t caaac cct at ct aaa agagagggcc ct t t t t gaag t t t gt at aat tacaaaaaaa t ggt t cat gg ccacgt ggat at gt t t acac t t act t at aa aacagagagc gagagagaga caccagt cag t gaat gt cca at gaaact ac agt t at ggt g tcaacacaag aat ct aacgt agcat ccgaa ttgct t t ct t at t at t act t t aat t gt at a t t ggacct t a ccagagaccc gt gat caat t caacat acac acat t aacat caat t t t gt a agcaaat acg gagggaaat c at t agt t gat acact aaat c aaagt t gaat gaagct gt ga aat t ct cgca aaaagcatt g t aat gt gt t t aaaaaaagac gct at cat gg ggaccat at t cact act t cc aggat cacga at t at caat t atggcaggag cgaaaagt ac aat t t at gat aaaaacat cc aggacat gaa t cacaat aac acaaaaat ga tggaacagaa cgaccaaaac gt t at agt ct aaaat t gcag ct cat aaact t t cgaat t t a t t at gt aaaa aaagccaagc at at gt aaaa aaat t gcgat cgct aacggt gaat ggcaat t t t t t t gaat ct aacgaaat ggat act act gaggt t gct t aacaagact a t gt aat t t ag gt aaact at g t aat aagaag gcct aaaagg at at gt gat c t gacacgt gt acaccaaaac at t aacaat t aggcagaggc ggaacgattt t t t aat gt gt aact at t aac cat gt t t caa ttaaagaaaa aaaacagaca ggt cacgat a gacgatggaa ctt caaagac gat t t cgt ga t agcgt t cag at t t gaat t t agagt t gggt at gggagt ga t at aat at cc t gt t t at agg t aaagat aag cgat cct t t t gaaat t t at t ct gct aaat t ct at at t ct t t gat ggaagt agaat gt gt g ct t ct t t cgt t t t caaagt t aat t t ggt at cct t t t ccat at at t gct t g ccct at t cac at caat gt gt acacaagaca tttggccacg ggaggaaaca aacaagcggg at gacaaat c at t aagt gt a aat gt aat t t tggcaagagc gcaaaggaat t cgt ct at cc t t caat ct t c cct t ggt t t c ct acgt t aca t at t at t at t gccat cacca ttcgagcacc aat aat t t t a t at gt cccag t t at cat t t a ggat ggt t ag agt t gat t t t t gt t gt gt t g gctaaggacg t t at t t gaac t gt t t ggt t a ct t gt t at t c at at aaat aa t gt t cat t t c aggt t t aaac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 Page 197 12689250 Sequence Listing.txt cctaatcctt tcttctcctt catttttcta cttcgatcag acgccaaaca tcagaagccc tagagcttga gccgtcgaaa atgt 1980 2004 <210> <211> <212> <213> 181 2004 DNA Arabidopsis thal i ana <400> 181 agagact aca acgagt cgt a t ct t caaat a t gt gt t gccg t agt agat ct ct t ct t t t t a t cacaaaagt t ccaaaaat a tttttttttg cgat t aagac agaat t gcaa aaacagct t a cat at aaagt act t t aat t t cat ct gt aat at t acat aat t cat gcgt at t gaaaaat aa t t t ct t t t t t agaat at aac at cgt cct at t t t gt at at c agat aaacca t ctt agaaca at gt t gt t aa agataggcga gt t ct t t aga aat gggt t t g ggct acat gc cgt gt t at cc ggt aaacacg at aaaaat aa t aagt gt t t t t t t gt t ctaa ct at ct acca t t at t act t c aacacaaaat acat at t at a t aaat t gt t t agat caaagc ct gaaat gca caagat at aa at at aat caa ct caaacgt t at gaact at a t t t t t t ctt c t ct t ggagca t t t t at gcaa ct t t agacat gaacccgtgg t at ct ct aat tat t t t t agg t t gt t at t t g t t aagat gca t t aaaacat a t t t t at aat c t cgagat ct c at caaact at gat t cact t t aat at t at t t ttgtat t t ca t cat gagaca t t t gaaaat c act t t caagt t gt gt at gag aggat t at ct acagat t aaa t gat t t gat g at at at gcat aaccaat acc ccat t aat t a at acgt acgt cat at gcgt c at aaat at ct ttgtat t t gt at agaaagga t aaaaat aaa aggt cgcat t t gt caat gaa at ct ct t gaa at t gt t ggt g aagaacaaac gt cgcaacaa t aat gagt aa at t at cggt a aact t acgag t t gt t t aagc t aat ggt t t t ct t at t t cat t caacat cgc at t gt caaga cgt ct gcaca ct t aaact ac gggt gt agaa at t t aacgaa aat t gat act cacgcat t at t aaaat gaag t ct at at at a at t at t t agg at t agct cgt ttttcttgga at gagagt t c cacagct at t at accat t ga t gcaacgt ga at gt acaaac cggt t gat t t agt at ggagt ct acct cggt cat at aacaa at gt acat ga cgt aagacac t gt t t at t t c ccacgat t t a gaaagcgttt t acat ggaaa ct t t cagt at gt ct at ct ag aact aaat t t act acacacg gt t cagaat t aggaaat t aa t t t gaagat a ccaaat agaa aaaat t t at a at t acgat t c gt t gaaaaca t ct t caat ct gcat t gt t t t gct t t cat ct gt gct t agga t t t t gacct t at ccat aacg at at at ccct tgt t t gt t gt gt t ggt gggt ct cat t cgat aaaaaaat ct at caaccaaa aaacaaaagt acat t t at ag cat t t gt act t t ggt t act a at at agt t ct ct t at t t t gg t t at t ct at g t at at gaggt t gcat aat ag t gggat t acg aat aaaagt a t gaccat ct t aacgaccacc t at gcat gt g accaat att a aaacaaaat g aggt t t aat a t t t t t t ct t t t t ggat cct a t ggt accct c aggt at at aa at at accaac t att ctt t ca gatcgaggat aat gggt at a at t t gt gct c gt aacaaaac at t act gaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 gcccaatttt agttcaacat gagttaaaac agattttaaa actaatataa aacgtcgtca Page 198 at t t caaat t aggcccaatt t t caaccat t ccct t gct cc t at cact cgt ct aat ggt t g aaat t gaat c t cccggct t c gccccact ca t ct t cct acg 12689250 Sequence Listing.txt gatcgattta aaagcccaaa tataagaagg tgatcgt t aa cctaatatat caagtgagag tgttaatcgt cgtctctct c cgcgtttttg ctttgtagcg ccggaccgga gaagataaaa caggaagttc tacatcgacc tgagttctcg tacttcgtcg at ga 1800 1860 1920 1980 2004 <210> <211> <212> <213> 182 2004 DNA Arabidopsis thal i ana <400> 182 act ct t ggca t gt at at acc gct aact t at aagt t t at t g acaaggt cga aat aat t t t c agt aat caaa at ct caagca cagct agcgt aacaat agt t gt t t t cct aa gccaagcttt t gt t t t atga t t t ct agcaa aacaacgaaa t at ct gt aat t at cat aat c at gat t at t g t t gat t gt at ggat t gt gca at t caacgt t cggat ggaat at acat at ac t t cgt acgt c accccacaca aat gat t t t a at t t gaat aa t at t at t at a aat at t ct cg gt at ct ct t t t t ggat acaa at aat ct cac agt agt cat t t gt at t t t at t at gt t aagt aggagaacat gact caacat gcgcat cgcc act aaggt t t t t t t ct at at t aaaccgt at aacacaaat t taaagaacaa t cgaaaaat a t aat t t ccaa ct ct at at gc at gcat gt at at acact gat t ct ct gt agg t gt ccat gt c gcaat caaat acaat t gt ag t at at acagt t gaat ccaaa t t at at t t t t t t acaat agc at aagcat ca aaacaaaact aagagt at aa tagaagaacc ct t agt cgaa gcgt caat aa cact t gt t t t t gaat ggaaa act gt at at a at ct ct act c at at at at at t t at t at agc t agaat ggt t t at at aacac aat t gt at t c acgat ggt gt acacacgt at at ccaaaagc accaccaat c taagcccaca t t t ggt gggt t caaat agt t aaaaaaaaat gagaaatttt t t gtaacgga at ct cgagat ggt t t t cact t aaaaat aac accaat at at gt t gct t aga gt t aagat t t t ct t at t acg t t caact gca t t t t at t at t aaaat ct aaa at at gt at at t at acgt aac gat ccat gca cgt at t t aaa acgat ct aag at acat acat cacgt at acg gtgcaaagga caat cccat t at acaaaat g aaat aat t t g aagt t gaagt gact aacgcg gaat t at t t c gat at gcgt a aacact t t aa acaagagt cc aat at aat ga gat t aat caa agct ct cgga t cat t at t t t at gcat aaac t agt gt aat g at t agaaccg aaat aaat t a at aaat t cat t t aacggat t t gat gcat ct at t ct t gct a gct t t gaat g ggat at at aa cgt at at acg t at at at ggt t t t t ccact a t cct t at agt at t act cgt t agt cggagt c tcgtat t t t t gcat gt t t at t aagt at t aa at ct t at cga aacgat t acg t at t caaat t gt t at t ggga gat cagat ct at t aaat act aaat agt cac cgat t ccaat t accacagt t t at at at t t t cgt at aaaat tgt t t t gtgt aat ct agcat at aat t aat c aaaacgt gaa cgcgcct at a t at t cgt agt ggat aat gaa acat t t gt aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 Page 199 ccacgt gaga t t ct t t cacc aat ct t t cac caagct gt gc t ggct t at gt aaat gt t aat t t cct aat ct at t t t gccac t t t t acggt a ct at t caaag gt t ct t t at t t acgt gggaa at t t t t t cca gagagagat g aat t gat ggc gt aagcccat gt cacagt aa ct gagt cgcc 12689250 Sequence act ccat gat t caat aacac ttctcattgg tttctacaac ccatatatat ggtgttcatc t t agaacagt aact t gct ct gcccat aaaa aaagaaaagt tttatgggct ttttaacaat at at ct gaga t t t cgat t ga acat t at cag aaaccataag at gg Li st i ng. t xt t gcat t t gt a gt t agagt ca aaaact ct t a t t acat gt gt aat t t gt caa t act gat aag agt agaaacg t cct t t acca t ct at agat t caaat aat ga acgt aat cct t gggagt t t g aaaaagat gt cccat gt t at aacggt t gaa gt t agt caga 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 183 1998 DNA Arabi dopsi s t hal i ana <400> 183 cgacgt t gcg cggat cat gc at acaaat ca cat aagt t t a gct cgt at ga ct caggccat aagct t gct g ggt acggcgc ccgagcct t g at ccaggact gct agagat t ct ct t gaacc gagat at ct a t at ct ct ct t gcaaagcct a t gt gt ct t t g agct t aaat a agt at at aca acccaat gaa aat at ccat g t ccaaagat a caaagaagt c at cgt t t ccg aaccct ct ga t at ct ggt t t t cct agaagg ccaacgt t ct at t t cggct c ct aagt acat at at at at gc act t t gacga t ct t aagccg at cct t t cat ggt cgt t gat gt t t gcat aa gccgagat gt t t t t caat aa ct agt aaact ct ct aat act t at gt gt cct gct t gaaact aat gt gct t a at cgccgat g t gaccat cat at gct acat c ggaaggaaaa act t gaggct cct ct t cccg at ccaaggaa gccaat agag t ct agggt gt at at t acgaa gt gcagt aac t act caaat a t ccaagt t t c aaaaaaagaa accaaggat c t cgt gaaat c aaaccaaaac ccccct caag ccacaacttt gaaaat aaac agcgaccaga cagcacaaac cgcat cgt cc t at at ggagt t ccgt accag ct t cact ccc t ccaagact c cct gat t ccg t ct t t t gggc gt ggt t t at g t ggaat ct gc at gcat ccac ct t ccat cgc t cgaagct t c aaaaagaaac ggt t t gacct t t cat t aagt t act aaccaa at gaaaggaa t t t t caat ga gt t acgagct at cat cgaca agat t ct cga aagcct caaa acgcgt ccga agt ct t t ggt at ggt t acgt ct ggagagcc at t at ggt t g ct aagggagt agat gct t gg gt cgt cat at acaggcccac caacaacaga t t t t ggcct a t ggat agt ga t gggaact cg gt aaaagt t a act t aaccac ct t t cat ct t gagaagcat c t ccggt ggt c t gcgct t gcc gaaggaact c caagt t t cac aggt aat ct c ccaacgcacg t cact gcaat gt gggat ct c gat gt act t t gat cat cgat agct at acct ct ct ccggag cgcggcat at agacaacaag at cct ct ct c at t t gct at g t t aaat t gt g t acat gat t a aact aaacca ggact t gaaa aat ggacat g t t gt t agt ga caacaaagct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 aacat cgcca caact aaacc aagt at at ac act ct aat at ttgttctttc gattcataca Page 200 12689250 Sequence Listing.txt aacgagagtc gtgcgtgaga gatatgcaat cgagaccaaa gt ct t ct t cc gt gat aagat t at cagat cg ccgcggagaa gagt t t gt t t aggt t t t acc gagt at at t t agagt gt aca ct gt t agat t t t t aat t gat t t ggtcagaa gcccaagccc cagagat t ag at at t t ct ct gaaagat ct t gt t act t gca tttttaccag t at at t agt c t aaaat t caa t t gt t ggaaa gaaagaagaa t gaaaaagt c act acat gt t acgaat ga acaacaagac at ct t at t t a taacaaaaaa t gt gaat gt g ct gcat at t t ggt t t act ct aaaat aact g aaagaagaag act ggcaat g aaagcccaag t t ccat ct gg agt gt t at ac t t act ct t aa ct gagt t caa t at cggagaa at at t act ct t gat t t acat aaagt ct t ct t t t at t aaaa aagt aacaat t gat t t cat t t t t agt t agg gt cagt at t t cact cct t ca t at ct t cat a at ccaat aat aagttttttt ct agaaacaa aaaaaaaaaa aaagat cagc t t at t acat a t t t at t act c t t t ct t t gaa t gt t t aat ct t at t at t at t t t acat t t gt tttttttttt aat ccaat t t aaaaaat aaa t ct cct ct at 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1998 <210> <211> <212> <213> 184 2004 DNA Arabidopsis thal i ana <400> 184 atggaggcga aacggaacaa t aaacagct t tttttttttt t at gat at ca at at t t t t t t ct gaaggt ga gt ct ct aat g act aacaccc cat t aacat t gagat ggat t tagt t gt t t t t gt t caaat c t t t gaat aat tttttgaaaa caggcgtgac ct t accaggt cccat ct gt t cact ggggt t agct aaggt g t gaat at t ga t t t t gt at at aaaaat aat g acat t caggc gaaat gcaca t cgt ggt aac aaaacat t cg t gt t t t cttt t t aaggct at t t cct aaat a aat gat at aa at at caaaag cat gagcagt t gt ggat ggt gaaact t cac aat t gt t at t gtggacggaa ggt at at at a gt t at t gct g gcagccat gc t t gat ggat t t ct t act t t t gcagat ccag t gcgcct gcg agt ct ccgaa at t t t gt ccg at at aat aat t aggcgat ga ct t gcggt cc ggt t cgat ca at t gggagcc gct aagct t t aaact agt cc ccat t t aat c acggcgaaac t gt aaaagt a at at t gaaac acaaaagccc t gat cat agt t acaact cga at t t cgat t g gat agt cct a t ccat cat t g t gact t aaaa t t acggagat t t gcat at ct cggt cacggt t aat ct aat a t t ggagat ga ccggt acaga ttcgat t t t a caacccaatt gtggtggcaa t aaat ct t ct tttttttttt caacggt acg t ggacaagt g aaagt ct gat aaaaat gct c acaccgat gg gaacaggtt a aagt t gct t g t gat ct aaag at t gaaagt g at caggt at c ct aagaaaat caat t caaag caat ggagt a act aaact ct tatcagggag aact cat gca ct t t aat t aa tttttttttt t t gaaacat a t t t t acgt at agt gaagaat caacgt t cag t at t cat at c ct at t t t at a ct t accaaga ggagt t t t aa gat cacaaaa aaat cat t gt t t gat at t t a gct t t t gt ct agaat caaaa caaaaat t t c ggtcaggaac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 201 12689250 Sequence Listing.txt tgctagcaat attatatttc aaaacattca gatggataat gttaagaatc cgaccaagac at ct at ccaa t at t t t caga agcgcatcgg gt gct t gaca gataaaggag at caaacaca at act t gt gg at aggat t at t gt at t ct t a at aaggt t at ggcccat t t a aaccct aaaa gggt t t t ct c t t gaact act <210> 185 t act gcgaca t act cat cat aat ccgcggt aaaacgcaat gagt gaacat ct gt t ct gcc t acat at aga at at at at at agt at aggaa cat act aaat ggacct t at g agcccacttt at cct t at t g t t cct cacgc ct t cgacaat agagcaaat g cat t ggt aac ccaagt gaag aacgt t t aac taaaggagga t cagt gcaac t at at aaat a acat ggacaa t acat gt at a t gt t t at ct a gt at t at t t g ct gt t t aagt t agcagct gg aaagcaat ac ggcc cact acagag acat at t ct t aacgt ggt gt tgcagcaaga aaagcaact t t ccact t agt cagat agat c t aaat aacag aacaat gt t c aagt t at at g t t ct gat t aa taccaaaaca agcaat at at at cgcagt ct gt t cact gca ggat ct aat c accgggacat act at ccat g gcaccaat gc t act t agat c gat t acaaaa at at t gt ct a t t gagat t t g aat aaaggga cgt gt gacat aacccaacaa at cccaat t t cccact gat t cgat cat aat at ct gat caa at t t agt at a aagt ggcact ccaaggaat t t aat gt ggt t gct t agat ct t at act aaat act at gaat t t t gt t t t at t t acat t ct t t t agaat gt ca ggcct t t aga gccgcaact a cagcagct ct 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 185 ct ct aggggt ggat t t gaat aacaagatt g agt t gct caa cat t gat t ca tggggggcac cagaaaagac t gcaacaggt ct t t t ct cca gt cat caccc cccagt gat g acaacacat g aaact t t gat gaggcagttt ggagagtct t t gt at gaacg t t t t t aagt c caat caggca cgt gat cagg aaat t ggaag t ct gt acct c t t acacat gg gcccct at gt at gt at gt t c at gccgcaaa t t cat gagct cagaact ct g ct at cacct t cacat gaagc aggacgatga acaat ggt ca aggaagt aaa gt ccct act a aaccaggt at agat ggct t c gaagaacggg at ggct acaa ct ggaact gc t aat cggat c tggcgggtgg ggt t t ggact cacaaagcgg cagt ct gcga gat gcct cca gcaaagcgca ct t t gat t t g tgggcggcac at ct at ct t g gt t ct t gt cc aaact ct t t g cggat t gact t at t ccat ac agcgccgt at at cgcct agt t gagat agga ggct at ggga ggat caagca cct t ct gagt t cacct gt ct aat gacaggc ccat ggagt a ggt acaaaag aat ggcaagt ggt ct agccc ggt cct cct g at ggt ggat t gt gcagccac acgaat ggat aacagagagt gagcat t cag agt t caact t cgaagatgga cct cat cgt c cacagttttt cagcct cct a t ggaagct ga ccct cgagcc ct ggt gt ct c gt t t gt caat caagaggt ac cat t ccct ca caat gcggcc cgct aaat t t gagcaaat gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 agagccctca tcgagttctt ctatcagcat tggtggaaag cgaaaagagc cggagccacg Page 202 12689250 Sequence Listing.txt t t gggagt t t aggt t gat ga aagt gt ct t g ggttttttat gcttcttttt aaaaggt t ct t t at gct ct c aggaaggaag ccat t t t t ga aaat t gacaa caaacgt aat t t t gt at ccg t aaaacaact t gat t t agca cgt cgct cca acaat acgat gaaact aaag cggat ct at c accaaggct c <210> 186 cct ccat gga at t gat t gat ct gaact t t t t t t at t t cat aacct t gt t g agt ggt t t ga at t t ct gaac t ct ct cacgt accgcgcat a at t gt at cgc aacct t ggca acat caat aa aaat at agcc t act t t gat a cgt caagaac ccgt t agat c t gt acgcaac gagct gaat c cgat ct aaca ggt gagt gat tggtcaaagg ccttcttttt aacat at agg ggtttttct c t t aat ct gt t aact gtt cat gt t gt at ct t cat at t gct a ct t t aggcga agcat aat ca aagcat agca gt ccgat cgt at t t aaaagt aaaacct t gc t gacacgt t g t aaacagt t c at cat cat ca at ga ct acaaagt g at gat gacaa t t t t t at t t c aggat t agt t cat aaat aaa gcgt ccat t t ggaagaaagc t caat aaat a ct acggat ga gact t t agcc t cgt t gt aca t cct aat at c cccaat t agg cgt ccat cca t ccaat acaa gcaaat cat g acacat cact ct gagt aacc gtggagatgg gttcttgaat gt gt t t t gag ct cagt t t t c caaacat t gg t at at agt t t t cgt t ccat t agcgaaggag at gggaccaa gt gat gact a at caaccgat aagaaaaaca caat agt t at acgt aaacga cgccacaacc aact accacg acgaagct aa gaat cat t t c cgaaagcaaa gt aact t at a ccat t t t t ca t at at gt caa t aagat acag cgt cgat at g gt ct ct ggt g at agacat t a t gat ct t ccc taaagccagc tcat t t t act t t gt cacat g at at caccgt ccgaaat cca t ggct at ct c ccaat cagga ct at t at at a gactgagaga 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 186 act t t aat ca cacgaagcca ct t t cct t ct at cagat aga gt t t t t t t ct t gcct gt cga cagggt t t cc cct ct t cgt t ct t gggaaga gcct gaggt g ct ccat cct t agt caat ct t t t t t t ctt ct t aaagct t ca gcacgat cac t acaaat aga gct cgacaaa ccagt t t tag ttgggagcac t at t t t gcac agaaat t gag t t agccaat c ct ct t gat gt t cct t agct t gat t t t gct t ct gacaagat agagt t ct t c at gcct t t aa t t t t ct t t gc cagt ggat t c aact ccat t t agt gt agaga at ccacaaac at cacaggt c gct t ccaggt cct cat at t a aact t at ct t act t accct c ct cgt at t gc cact t t gt ca ct t t ct t t t a tt gt t t t t t c ct t t t gcct t t ct at gggt a t t cacct acc t t caact gca gcat ccaat g t t aagcat at at t at cagt t t t ct gaagt a gcct t gt ct t ccagcat cat t t agggt gat t gt agt t at a aat cat cat g t t t t cgt agt t ct at t at gc cagcaccaat ct aaat t gct agct caat ac cat t cacct t t at t agcaat t agt act cgt gaat gact t g ct ct gccaaa at acct t aag 120 180 240 300 360 420 480 540 600 660 Page 203 12689250 Sequence Listing.txt cacaact tcttgaccca tttgaaactg taagctagtt tagaagaagt gctc t aat t gacac t t t t t agttg t t gat gaaat at t gcaacat at t t acaat c t t t at t t t t g gagt t t t agt aagt at gt t a cat aaat t t t ct at t aaaac gact t t ct ac agat t gt t t t t t gat ct aat t t t t t gaact gaaggat cca t cccccggt c aaccgt t aaa ct ct ct t aaa at ct caaaaa t acgt t t t at t ct gat cgga ccaggttttt t gccccact t gat aaaagaa ct cact ccca caact ccaaa gt t t at t aat t at aaact aa gact t aaagt aaagt t t gt g agat t at t ga aaaaaaat ct at t t t ct t t c gagaaacttt at t t aat t at t ggt aat t aa t aagaaat t a agagagt t gc acgt cat cgt t aaat ct t ag ct t t t t ctct ct t cct t ct c t t t t ctctct t cat cact t t t at aagagca gacccacgac tggactgcga at ct acact t at agt gat t a ccacgagtt a at cat gt aga t t at t cat t a at t cat act t ct gat t gt t t acaaaat aaa agaagaaaat aat at t aaag aaaaagaaag act aaaccgg cgt cacaat c tt cat at ccc agat t t ct gc t t t ccgat t c at agct t ccg t cagt gt t ct at gc ccat t gcaat ttgt t gcttt t caaat at t t ttaaccaaac agt aagt t at aggaggtgt t at ct gt t gt t aagat t t gt a t t at aaagt c at gat t t aaa gt cat agaaa gt at t t aat t t t t ccagt t a caat at t t ca at ct ggt ct t acgcggagaa cccacaaccg aaaat t cgt c caat ct t ct c aagaat caat ct aat ct aat t aaaat at cc ct ccct ct ag caagat cgt c t aat cat aag caagt at t aa at t ggt t t at at t t aat caa aaaagt aat c agt aaat t t t t t at t at gt t gt cat gaat t at aat at t ca tccaaaagaa gat t t aat ac gaaat agt ga agagcgacaa aacccct t t c act t caat t t t cgt ct gct t ct agggt tag gggaat t t ca at gaacat ag aagat gaaca aaat t gaaag t at ccat gga t t aaat t t ca t t t at agagt gat t t caaaa aaaaat t t aa t aaaaat ct t t t gt gt t at t at t ggt t cat ttttttcaaa t agct ggaaa gat gt gagt t agat t cat aa cggacgcgt g at caaacct t gt t cgcccgt caat t t t cag t cct caaagg gt at t t caat cat gt t t t ct 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 187 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 187 tcttcaggca catcttcgag atctttgccg gaatgtgcac cataacaaca tccct t aat g gttttacgac aaactccaca agaaaaatca cggtgttggt ggcgggtgat cttaatgaca atgtagttac aaggaagaca agcgtaaact caaacatcac aaggtaaagg gattcgtcta tttaggtgag tgatagtgag aggtggtggc ttgcaaacag aacaatgata gtatatatct tcccacacgt ttttattcgt caagaataat gtccgtagct ccggaaataa gagaggaaat cgaggtagat taatacaagc ggatcatgag cgtcat t gag ggaaaaagag tgagaatgtg tttatcgtac aagtgaaatt tcaaaacgtt tctgacaaca Page 204 t gcacat t t g aatgtcgat g gagagacagg t ct at gaacc agacaaccca gctgtggct t gaggctgaag t t t gcat t t t 120 180 240 300 360 420 480 12689250 Sequence Listing.txt ccataatata tatctgatag atcgttatcg tcatcatcat catcatcgtc atcatcatca agat caacat cgacgacgaa gaatcagagg t ct aaggaga aagacgt gac t t t ccct cca aaacggcggc gt t ggt ggcc at gt t at gt c t t ccgcagt g t t t at cat t t t caact at t c acgaagggcc ttat t ct t ct aat gact at t agt gaat aag t t t t t t cat g at gaat caga t gaat at t ag aact t t at at ttgcacaaaa tttttttttt t aaaagt cac at at acgagc ttagcaaaac cat cat cat c gaagacgacg acgacgat t c t gagggt gag aacct ct at g agt gt t t gcc gt t ggagct t gt t gct gct g t agat cgcag aagccat at g t gt acaat cc gat gct t cca gagggccaat at t t aat t ga caat at aagg acgt t t ggt g at gcat t gt c aaaagaaaat t at t at at at aat t at t at a cgt caaaaac aaat t gcaga gt at at cgag t at acgt gag gcat aacaag at cat ct t ca acgaccacca t ccccat t ct tgggtgagaa gaat t ccaaa t t t gcat ccg gt gcggt t ca cgt gt t gccg ct t gt ccaag ggt t t gt gt t at t t t agact gt t ccagct g aat t t ggct a t aat at t t ag t aat aaat ga agaagt gt t g at t cat ct ac aaat t t t t t c t gt aat agt g tttttagaaa aat cgt at aa t gct t ct agg aagct t act a ct at at aagc at gg acat caaaaa aact cat cat t gaggt at ct aggtgaagag ttgcaaagag aagcat t cag act ggat cgg ccat t gccaa cggcggcgga t gt t t acaat t gaggct ct c tagt t t t gt t t at t t t cgat ct gt aaat at ct agat cat g acat acaaaa cat t cat cga aggt agat gg aaat t t t gt a t aaat gat aa aact t aaaaa accct agt t t at t gccaccg agcaagt gt a aaggact aaa cact gt caaa t agaggt gt c ggt gt t t cat gacaaaagt a agagt t gaga agaaagggca acct t t gcct acct gt t t aa t ggat aaaga acgt t gagac tttttaacgc t at acaaaga cat act aat a at cgt t ct ct t gaagat aat at t t at t t t a aat gt ccaat t aat at t t ct aat t at aaag ct t t t aat gt agat aacgt a gt at accgca aacct ct cac aaaacgagca agaacct t ct aaaat t agga ct gct gt ggt at agt acct t ct gact t agt aaggt gt t t a t acagat cga gggaagctt c agcagccaat t cat t t t gat gct t t agt t t caaccact cc t t at t t gaat t gaat gagt t t t t gt t cgt g t cacat at at at aaat t t at gt gat cat at at aggct t t c t t t t t t gt t t aat ct t aaca cgt gt cagaa gggcaagcat 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 188 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 188 ctcctgggtt ctcgactgcc cagaaatgga tgagaaactt gtgtctcaaa tgggagaggg cgtacacaat tttgagatat gttagcagct ctgagaatag taaggggcat ggttaggaaa aaggaaagct act atgataa gagacataga atggtaacct atgataagat gttgggtgag gctgaaacca tttgtaacag agaagacaat aggaagaatg ttaatcagcg gagcacatat Page 205 120 180 240 12689250 Sequence Listing.txt gcgtcagcgc tgcgaatgaa gtgtgaggag cttgtagtga tacaagatga tgt t t ct t ga ttaaccgaca at t gt t at aa cgt ct t cgt t agaaat ct t g tttacacaag gt cct t ccgt agggaact ca gggt aaact g aagct aagaa tgt t t t ggt g t t gaggat ca gtct t t t at g t t t gt t t t ag t gagaagt gg gt t gccgt cc acaccaaat c tccccaaaaa t cgagcat at cgaaaat gga t t ct ccgt ag ggt ct ct at t ccaat ct cct t ct t t t ctat ct agat at cg ggt t t t t t t g ct t agct gcg cgt t t ggat t aaagt t gat c at gt t gt aag aagagt at at aaagt ct gt g t ct t t ct act ggtatttttt gagagct aca t t gt gaaaaa ct t ggt gt t t aagaaaat ga gagagctct g ttgt t ct t t g tt ctt gcaga t t ctcgggaa gat ct gat gc tgagaacaca aaaaaacaga caaagcccat aaaaaaact t t t t t ct cgcg t t at aaggt g gaaaggggaa t ct ct ct t ca caat t gct t t t cagat ct gt aat t t ct at g gat cgact ga aaaact gt ga t at t gagct t ggat act aga agat gt t ct t ggagagt at a gaagct acga cat t at aaca gcagt t ccat t gaat t ccca gt t t t t acag gt t act t aat t cat gaaaac t aaaagaggt ggt t aaat t t t gaagaagat aact t t gt ac at at acgt t t tttttatcaa cgt t aggt gg ttgacccgcc aaagcagacg t cgccacgat acct t cgt ct gcaaat aaat t acat at ct c ct cct t t aat gt gat t gt t g t at gt aat ac gt caaaaat g t t cgt gaat t act t ggaagt at gg gaaagagcac t ct gggct at aaat t t ct ga t gt t t t gacc t t gat t gat t aagct at cag gt t t at caac tccaacacaa t aaagaagat aaaaat ct aa agt t gt gt t t aaact t cgag ccgagagaaa gt gt gt ct ca agt gaaaat t t cgt t t agat ttacaaaaca tggcagaccc t t ggcaat ac t t ggt t gacc t at cgaagcc gt aagct t ct gt t t t cgttt at ct ct gt t c gat gt agat c at t gact cag t t gt agct ag gat t t gt gat ct t ct ccaaa ct act gct gt caaagaaggt t gt ggct act caaagat at t caat t gagt a t t acggaaat cat aat t t gt gact t cagt t t gaat cct at t t t atgcagg t aggagt t gt act agt ct ga agagagat t g gt aaaat gcc t t gact aat t aaacat cggc t gat t cgt t a cct gagaaat at caaaaact cact ggt t t c cct ccct ct t ct ccaat cgt t t agat ct gt ggat gcgt t a t t t acgaaat at t at gaat g t t at t at t ac t gat accaaa at at gaagt a t caaaat gac at gt at t ct t gat aaaaat a gct gaacgaa t cgcat ggt g t at caat cac t ct t ct cgca act gaact ag caacagtt gt gct acaat t t ttagaaaaca aaagt t ct ag agaaacacat t ct t aaaat t gt t t at gt cg ccaat ct gt g cact at cgt g aaat aaaaat acaacgct ct ggagct gat c t t at ct t ct t t at t t gt gag t t cgcgt t t t t t gt agat ct t gt t cagt t a t ggt t gcat g agaagcagat 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 189 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 189 ctaaactcta ttccagcaaa attatttcaa aattagatat atgggagaat ttttacaggc Page 206 12689250 Sequence Listing.txt cat t ct agat cgacgtcgt t caaat ct ct t gcaaggcaca at t t t t cct g aacggat aaa gct ct gt caa cacaaagt ac at t t t t cccg gt gt cagcac gt cat at t ca t at t t t t aat aaact aaaat t at t aaacat aaat t ct aat gggat gaat c t ct ggagcat at t t at t t t t t at t aaaaac t ggat t t t ga ttggt t t t at agt at t t t t c t aaaaccat a t aaat at aat gact t t aat g t at ccat gt t t cat t aaaaa t t acat t t gt t gt ct aaggt t t aat at aga cgt aaacgt a cat caccat c t ctt ct ccgg t t t cat t aac t gcaat t gaa ct t ct t t t ct aagt t cat gc t t t t t gt t t c t cacat ct t c cgaaagtt ag aaagt t t at g t t ct ggt t aa t cagcagct t ggaccgat cc t at cat aaac cat caaaacg t t t aggtttt t t t at t t t gg t gt at t t cgt gt t acat t t t at t t ggct gg cat aact t ct t ccat ct cat t t at act t ag t at t t t t t t t t ggt ct aat t aaat ct gct t t at accgaca aagagat t cc aaaat acaac taaagcaaac ct at gt caga t aaaat at ct t ccat cgagt t at ct act ct caagaaaaga gct gt gat t t acaaaaat cc t ct t cgaat t tttaaagcac aat t gcaaac t aat t gct ct t gat t t t at c tttaaagcac act aagt t t c at t t aacgt g aacaat at t g gacat aaat a aaaaagaaaa aggct t aaat act ct gat t c t accgt aat g tttctttccg t cct act caa gccact ct gt ctgatttttt t t acat at at t aat t t agat at agaat gaa gaagcat at t t accct at ga accat aacat t ggacagct g t t at t agaac aat cggat t a agt t cgt cca cact t gt aat t t act ct ct t tgc at ccgt t aat ccaat t caga cact cct gt a t gcaaat ggc gacgtcgt t t gt t t at t cag t gaagt caac cacaaat t gc t at t at t t aa at act ct t t a agggt t t t ac t aat at ggaa aagaaaaaat ct t t aaaaaa at agct t at g agagt t cgat tcaacaacaa gacaaat ct t tttttttttt agct caacaa gat t at cgaa t t t t gt gaat aact t caacg gt t at t at t t t t t agat gt t at ct aat t at gct cgt ccca gt t cat gt gt gct t at t aag aat t aaact a at ct t t at aa ct ct t ct cac aagtacgatt aagagtaaaa aataagaaac t t t t gaaat c gaagt cat ga t gct ct t aaa ttgatgt t gt acgt aatt ga gaagtcgtga t at ggat at t accgagtaac t ccaagt aaa agatcacaaa tgggt t caac aat cagaact t cgct t at gt act ct ct t ac ct t t aat at g ct gccgacat ttttgtaacc t t t actt gca ct agt at ct c t cat t t acag aat ccat aca agt t ggattt gat t t t t ccc t t gcat t gt a ttgt t t ctta gagaagt t gg t aaact at ac ttttcataac ccaaagtctt at caat t at t cct agat cct aaagt gaat t t t t at at ggc cggt ct cat t gact aat t t g t gat t t t aat t t t aaat gac gt ggagcgt a aact agt gag at t t t agt t t t act gat t aa t ct cat gagt gaaaaacgaa agt t at gct a t t gt t acgat gt aaaacaaa cacat aat ca at t aact gat cat t t t ct at t t t at aat t a t agaaaact g act t at t ggc gacgat ct ct at t ct t aat a at aaat at t a cgt ccaccaa t gt cgacat g t at at cat t g tgccacgtgg ccaacacat t cat agt t ct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 Page 207 <210> <211> <212> <213> 12689250 Sequence Listing.txt 190 2005 DNA Arabidopsis thal i ana <400> 190 t gt gt gat gg gt ggct at gg t t acccaaat at t t gcct t g t agccggaat ctt ccgacaa gat t ccaaac t gagat at ca t t acaggact aggt aaagt a t caagaaact gggagcggga agt t t ct aat caaccatttt gct t gt aaac act act agt c acaccaaaat ct t t at t aaa acaccat cac aaat aaat t t at ggt ccat g gt caaaat cc accaacgaca t ct t cat t aa at cagagct c aaacaaacag tctct t t ct t ct ct ct act t ct gt t cat t t t gacgct cgg at t gat cgaa gt t ggat t t g aacat acct a aat ct gact a aaat t t t cag gt t t t t cttt acccat ct t c t t gt acaaac at t gggcgac t at aggt ct t ct t gcacgat cgt t gaat gt t aat ggcgct t at t ct t t ac tttctttaaa t aat gt ct t a acat at acac t aaact aaat aaaagt t gat caaccaacat at t t t at t t t t ggt at ccaa gggt cgggt a acggggt t gg aagat caacg cggcaacaat agagcgagt c ct ct at cgct cctt caacag t t gct t t ggt acgt t gt t t c t t agat t ct g t cat at gt t t ggct t ccgga t agt agt aca gaggagaagg t t cgccct t g gaaaggt at a aacact t at g t gat at ggt t ggcct at t ga ct t gcaat gt gcaggagaaa t t t cct cgt c t gat ct ct t c gcct t aaaat t at at at act aaagaatttt tt cct acaaa t aaat gaat t ct t t at at aa cgaaaaaat c aggat ccggt t t gt cgggt a t gagat t aca gt ct aaaaac at aaaccgaa acgcgccgag ttcat t t t t c gt t t ggt t t c agct aat gt a ct t agat t t g t aact ggaaa t ct ct caggc ttcgt t attc agaagcgt t g caaaggcaaa gagcaactgg tttttctgac ct at ct agaa gcgaat t t gt cggt t caaac gaagcct t ga cccgaact ga cat gct gct t t ct t atgttt t t caat aaaa aaggatgt t t act t cct aga caaacat aag aaagagt t t c tgaaaagaaa t aaaccact c t cggat at at t t act ct t ct t t agt t gt ct at gagaat ac gagat gt gaa t t t t cct cac t cct ccagat t ccgat ccat at cact cgt t t t t agct t aa at t t t ggaat agt ggt act t t gat gacat t t t gt t t aaac agacct gcac t ggt gt t at a ct agt aaact gatctct t t t gcaat gt agt aat ct t acca t gat t t at t a ggaacgtgca t cggcct t ca gt at t ct t ca gaagat at at t t at t ggct g gt t t t t at t t gcaaat tt ag aaat gcggaa act aat t aaa ct cct t gat t aaccaagat c ct ct caccaa acat ct cacc t gt agat cat gcacgagaga tt gtt ct caa ct cgat t cca cgt cact ct c t cgcgat t t g t t gct caaat cact ct gcag t t t gct at gg gt aagact at tgt t t t ctt c ccaaagctt c t ccct t ct t a ct cat t t act aact t agt aa ccgcacgct g t ct t t gt t ca agt t at agga cgggatcct t gct cggt ct c gct gcaacca agagaaaaac ct aat ct gct t at ct t caga ctt ct ccaaa at t aaaat aa tccaaagaaa t gat gccagc t aat t acgat ccacacccat gt ccat t cat aaaacacgt g agagcgggca t aaat ct ct c at ct ct ct cc t t cgt t agat aaact agat c ct gat t agaa aaaat gt gt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 208 12689250 Sequence Listing.txt ggtggaagtg aaactataca tttttttttt ttgaaatttg ttgcttaatt tttcaaattc gatcggaaat tgatggaatt agatct t gaa actggaatct tcgcttgtgt agtgttgatt ttgtttttgt gtatgtgtag aaatg 1920 1980 2005 <210> <211> <212> <213> 191 2004 DNA Arabidopsis thal i ana <400> 191 agccagtgga at aat ccat c caaaat ct ca t agaat gacg cgt t caat gt t aacat t aac accgt cct at t t aggaat t t at gt aggt t t t t aagat t t c t t gact gaac tt ctt gaaaa gaat cgt at g ct at at t at a t t cct t acac ct t t t gct at ct cct ccat g accat aaat c agaagcagtt acggaat at g t t acat t t ca t gct aat at t t gt t t t t t t c t cggt t t t gg t t t ggagcgt ct aaggt aac t t cct cct t t t cgt cacgcg at gt acaaca t t at ct gcgc act t at caaa t cat at agga at aat t t tag t t aaggccgt aggaaat cat cat cgcggcc agat caagac at caaaact t caaaacaaaa cat t t t at aa aaaat gaat a caaacaaaaa t t aat t t gt t gat aaaat aa agaaaaagat agacaacat g ccaaaagt ca t aaact tt ag cgggt aat t t t gt aaaagaa cat t t ggt ga agaaaaaaaa acgt ggt ct g ct aat t acca at gt caat at at t t ct ccga t agt cat at t t t t gaat gt g cgt aagagaa aaaagggaaa acat gt t ct t t t ccat at at ct t t t ggat t ccct t t t cca caaaat at ct t at at gcacc t t t t act t at accaaccat c aaaaaagt ct ct t t t agt aa agat at ccca atggaagaga gacggtgct g t caat at at c gt t agt t t at at agt aaaaa t aaggct t t a t act ct t at t ttgacggaca t ct t gcat t t t aact aacca cgact t gaac cat ct t cat c tttttatacg ct ct t t aaag aaaaacggaa at t cacat at aat t t t t t ct ccaatt at ct t gagt t t cca acccgt t gt g t ct t ct acaa aat t t t cat t aaacaaaagt t t cccact gc t ct t at t t cc t at t at caat ct t gaggaag acagctgtgt gt t gcat t ca gt aaaaacca gt t t ccat at t gaaat t t t t ttgaccacaa t at t t t ct t a ccaaccaat g cat gat t act t t t t ct t aat ct t t ccact t aacct t t cgt agaaat t aca t t t t t t at t c aaaaaaaaga acggt at aaa aaat caacgg at at at gt aa gaat aaggcc taggcccaaa aat t ggcat t t gt cacacgt gacat cagaa t t t cccaagt cagt t gagca gt gt t t at ct aaaat at ct t at ct t t gaga aaaat ct t t c t at gct t ct a cact cat gac aaaggt aaaa t t agaggat a at gt t gacat acagaacat a tacgagggac t gcccaat aa gt gagt cagc gcct caacct gagaaaaat a aggcact acc aagt aaaat t t t t gt aat ag t t ct gaact a t gt at at act cat t t t caaa t aaat t t cat cgt aat t at c t gt aggct t c at agt at ct c ct t ct t t gt a at t t ccct cc gact t t t at t ggaagagt t a ccaagt ct t g t at acat aaa t t t at aaat t t caacacat a cct gt at t t t ct t t t ct t t t t act aact t t cagaaaat at agcact aagt t ggt ccct ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 ttaccaattc tgatatacca cccaaaatta acatcaaacc caatatttaa agtcccacac Page 209 gt gagaacga at t aat ccgc ggagt cgaag agat ccat t c tcggaaaaag gagct t t t gg ct gacgt cgg gaaaat t cac at aagcggag ct gggcaaac caaat cgt gg t t cgt cgt ca 12689250 Sequence cacgct t t ag ccacgt aat t t aaat at aaa acaaacccat agagagagag agacagagag aaaggttggt gtttctctaa gaagagattc atcttctctc at gg Li st i ng. t xt t cggt ct cca aaaat t caat aat at aagga ggat aaggaa agat t caaaa at ccgat t cc tctcaaagct tttttcaaat t gt gcgt t ca t cggat ct cg 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 192 2004 DNA Arabi dopsi s t hal i ana <400> 192 gt cct t caat gt gat ccat t aggt t aacaa acagt ggcgg cgaaat caaa tgcaaccaag at t ccaaacc t t gacggt gg t at cagct ac at gccat aaa gat cagccga aagct gt agc at ggaccat g at t ggaat ca gaaact t gac aat act gt gc aaacagt t ct gt t t ct t gt a gct t cacaga gaagaat gt g act cggagct aagt act gt t tcaaagaaaa t accgt t t cc gaaaaat at g t ggt t ct t t g caat gcacag t t ct acggga cacat t ct ca t ggt ccacaa ct cagct gca ggct ggt t cc at cagct aag gt ggct ct ct ggt t ct t t ca cgact gcaca gt t cat at ca gt t t aaggac aat at gagat acgt at agaa t gggat cgcg gact cagaac t gt t t t t t gt accat t ct gg gaagacct aa caccgcct aa aaaacaaaaa t ct ct cat cc gct gggggca gt t ggt t t cg aat gat act t acaact act c t caggaggt a t ct t cgagt g ccacct acgc accaccgagc aacaagggca t gggct gt ga acaagggctt aaggt at gt t t t ct cat ct c t ct t gat gaa t t aggt t ct t agt t t t agt g t cat cgt agc aaaaccaaca t cact t t gaa ttcaacgagc ggat ggt t ct ccaaaat ggc accct caaat at ggaccat g at ct aaat gg gat caaaat c t t gggaat t t aaat caacga gct t caat gc caaacgat ga cacagaccga t t gat ggt t t t t gat aacca t t ct t at gga cgt gt t acaa aagaaaggca agat cct t ag at t gat act g at ggaat ct t t ct t gct t t g t agt t gat t g t ct t cgaaac aacgt gat t g act gt t at ac aaagat t gat t ggt t gat cc at ct at aat c cgt gt gt gt a gt ct ggt ct t at caggaagc t caaggagt g ct t t gggggt t t caaacgt t t ccat t agga agat t gggga gccaccgcct gagacaaggt t agagct ggt agaagt t gga cat t t t t ct t t t t acgaagt at aagagt t t aagat agat t t aggaaaat a gt t gt t at gc t t gt t ct gat caggt act t g gagagcat gg cccgggaat a t t cat at at a ct t cacat t g t t t t t t t t ct cct t cgaat a aaaccaggaa t cgaacct t a t t ccaggct t gat t t t ggt g at gt t ct ct a t t t gaaagt t ccaccgggt g cagt at gct g gat gaagct g gagt at aaga t ggat gcat c ct ct agt t gt t caggt at at acgagt t aga t accacaact t ccat agat g t gt cagt t at at cacgacaa agaaat acaa gaat cgct ag cggat act ac ct t t gct caa gagat t t at t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 210 ggtt cgt ct a t ggt gcattt at gt t t gat c t cgcaaagat cat act att a at cacat t gt at gt ggatt a aat t at t at t at t t t gcgcc cacagcaaca t aat gcggca at at gt t gt t at cgat gt ca gaacct t act aatt accaga gat acgt t t t gcctt caat t ct aat at aaa t at aaaggca aacagagaaa 12689250 Sequence gct t ct t gt a gt aagt gggt gat t t t ct t a cacact aat t t at gaaat gc caaagatgac ttgagacgct gcctgcgtga cgctcgtgga cacaacactt gtattcagag agccaaaaaa tgttgtaacc gtcatgtat c gtaat t catt tcactcacac tttcagctcc accgtaggaa at gt <210> 193 <211> 2012 <212> DNA <213> Arabidopsis thaliana <400> 193 ttaaagaaga aaatctctct gataaataaa atgggagata ttttaaatgt atgtgacagg gaacaactca tttagtcacg tgttgaatat ttggaattat tgttttactt gtcaagtgat ttattgtttt atagtgtcac actgtcacga at t aaaaggc caatcaat t a aactaaatgc ctatcttagc gattcggccc aaaaagacaa at ccat ccaa t gatt ccaat taatgagaag gaacaaaaaa tcaactactg gaatttgttt aacataaggt ttatatctat aaacacaaag gt at agt caa tt att aaaag tt aaaat agt ttgacatcat tatctgtttc tttgagataa aaatgatttg gaccctacct attttattca ttaatatata aatatatagt acatgat t gc at at at t at a t gt agt at aa ct at acacgt cttttaactg aatttccaat agttgtttat atatttcacc ataaaacaac aaaatatggt tactgaatat tccgtaattt acgatgcctg gtaaacagaa ttacaaggaa atttt aaat t at gctagtta tcaaacaaaa ggacaaatat t t att cgat t gacat agaca gt acaggcca cact t t gt t c gt aact aagc aagt gcct at ctt gcctt gt t t at gt t cag t gct t ctt at aaat t gt act gaaacaaat a gct aat t t ag t ct gt ct t t c act aat at at at gaagtt ga acat aat t t a t t ccacgt ac at aat t t gga gagt t t aaga at t t t t cgga Li st i ng. txt cat gagt gt a ttttttgagt cgt aaat t t c t ggacgt gag caat accgaa aggct t acaa aaggaacat a t t t at t acca act t t ct ct t tgct t ct t t t ct ct t t ggt t aaat act gt a aacccaat t c aggt t cgt gg aact cgacct gt aaaat gat at t t aagaat aaat t t ggt t aat gt acagt at t gct ccat gaact t gt ct t t t gcgt t ca at at at at at at t t t at aaa gaaagt t gca acat acacat cct aat aaag aaact agt aa t t at t t t t aa tttttttgga t t at cat gaa gcat cgt aac acgcat aacg aact aaact a t t t t gtcgt t aaact at t ag t aaaacat t t gaaagaaacc t t t t t t at gt gagaagat at ttttaaacaa cccaact aat t ct cggccca at gaagccca t t cact t t gt t ccagt t agt agaaaaacaa tccgaaacga gaat gacat a caat cct gt a caat t t gat c at at at at at at aaaat t t t accat aaaat aacacat acc t t agt t aaga t t gat t t gga taact t t t t t 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 gtttgtcacg aaacgacaaa agctttgtca taaaaaccaa atcctatcaa cttgctcaaa Page 211 12689250 Sequence Listing.txt aagct t caat t t t acaat t t aaat t acat t t gt gcct at g aat at ct gt t t gt t t t t aat aaaat caat t t ct gact aaa aat t t acaat cat caagat a ct aaaat ct c agat at aacc aacat t t aca aat at cgt ac gt gt cct gca cacgcagat t cat aaat gt g t t t ct at gct t agcct t at g caaaagt t gt t t aaaacat t ct gcgt t ggc gcct aat cac gacgt t aggt t t t t t aat ct at agat ct ac agggt at aat t t t ct gaaga cccaaaagt a ccct caagaa cat aacagca t gt at agt t t t t cat aat t c t act cct gaa at t aat agat aaaaat t t gt t at t t act gc agt aaat gaa agt aaacaac aaacaacaaa t caat t agt t t cgccgccat t aaggt ct t a t t cat t at at at t t at aat a tgcaaacaac act agat ct a t t agact at t aact aact at t at aaaagt c cat cgt act c gcaaagcgtt t ccat t t t cg gt at ggt cct ct t t t caat aat t t at t t aaat at t gcaaaag agggaaat ag t t t gt ct acc t aat aaact a t ggt agt acc gct t cat ct t gaagagagaa ccgct aagat ct cct aat ca t cat t t at t t t t aaat gt t a t at t t agt t g act t t at gaa agcagacaac at aaaat cct gt aaat agt a t t at t t t act gaagaagcaa t ct gt t t t cg 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2012 <210> 194 <211> 2004 <212> DNA <213> Arabi dopsi s tha i ana <400> 194 aaaat agt ca agagact gca agccc! t aaaaaagat aat cagt gt c ct t cc( aaaactggaa at gct t gcag at aca aaacaaat ga cagaagaaca t aaaa( at gct t gat a aagt at ct aa acaga gagcatgtag aacaggtctt tttttl ctgcatcatc acttttaagt cattc gagat ggcat t agagt at gc aacac tatctccaac acttagatcc caaca~ gtcttcaggc tcatgttgta aaaca cagtgttttt tgggatattt taaga( atttaacact atgtttggtt agttt aaacaaaaca atcaagaagc atgaa~ acagatttgt cataatcgat aatagl aaaat at aat tgtaaaaaaa gt gt g tatggataat ggatcttttc ttttct gtagcacaac aataaataaa actcc! gtgaa caacaagagc gaaaacgcag tttgagcagg cat t t cccc caaag aat c :tttt t gct aaagt aat cc at t t caaac caaca aact t t gt c cgt t t aaaa gcgt a t at gct ccaa gat gcaacgg agaaaacgtt acaaaagaaa gcat caat ca gaat at at gt ggagt aagt g aact t cat ag gcat ct ct ct act t gaaaag aat gt gt cga t t at t aaaat gat gat gacg t t t t t t ct ct cccat t at t c agaaacaatt ccct gaaaaa gggcagt gt a acggagagga cat t gaaaga ct t ct cagca ggt ggt t t ca aagaacccag at ct t cat ag at t agt caat tggaaaaaag agaacgt gat agt t t t ct ca aaact t t t t c ct accgat t t caaaaat gaa aat t acat ac aaagat aacc cgaggt t ct a agacat agt t aaagct agct t aagt t t ct a ggt ggact ac agaaaat at a t ccaact t t t t ct t t t ct t t aagacat aaa aat aat t aat cct ct t gaga t act t gt aat t aat gagagg aat aaaaaac cgcgt aaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 212 12689250 Sequence Listing.txt aaaataaaat aaaaaaacaa acgaagagca gtgagtattg ccaaaaggcg t acat at t aa gaaat t aaaa gagt at at at at aaacaaaa t ct t cct ct t gt t t at t at c at cat t t t t a t t ct at t gat gt t t t act ag agt t t at gct t gat ct at t t cct at gt t ga cgat t gat t a aaact t t gat agt aagt t gt at t t gt t aca t t gt gt aat a t gat at at at act at gat ct gt cgct t t ct cct cat ct cg t t ct cct t ct t t ct cct t ct t t t t t agggt at cgat ggt t act t t t agt t gaaaat gt t t agat ct cat c t t t agt t t ag t gaat t cgaa t t gat t caga ggt t acaat g at aat aat aa gt at at t at a at ct t cct ga agccacgt ga t at ct ct aac t t t t cccat c t ct t ct gct g t at t at t gat t t act t t agt ttttttttgg tctct t t ct t ct t t t t t t gg gagt gat t aa aagccccttt agaaat at aa at gt at gat gt ct t t at ct aaaca t ct acagaga t ct t t cgt cg t t t t gt cgaa accat agaaa ttcat t t ct c act gaagat g t t act agt gt t t gaat t t t t at t cat at at aaat t gaat c aat acgat ct t t t at aat t t t t gt act gat aaat t t t at g t at at at at a gact ccacaa act t t t cttc gt t ct t t t ga aggcagagac caggt act at at gat aggt t t t acacgat c gt t t at t gt t gat cct t t ct t gt t gat aat gat t at gt gt agggt t t gat tagt t t t gt t ct t t gt t aat t gt aagaaat t at at aaat a agaaacgcaa ttct t ct t ct t gaaact agg ct t t t t ctt c acgct t ct t c t at t cat agg t aat t t cat g t at aaat cgt at at t t ggt t t t t t at t at c t t at t act t a gattttttt t t gt gt at t t g 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 195 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 195 caaccatgag ttttttaaaa atagcaacaa taccaaaaca gaaatgatcg agttcttcga ggaagaagta gaggtgtgag agatgat at c taactttgtt ttttagggct acagtaacta tttgcagcca cgcgaatttc aaaacggtat gaagcgacaa ttcacgaatg tcaaatcgaa ttatcgtcaa aataaccagg tgatttactt aggt agt at a tct t ctgccc ccact t gcct gtcaact t aa cgacctgcca ttctgtagcg acaattaacc aatgtcaaat cgaagtgtta tcaaaataac caggtgattt acttagtctt tatatcttct gcccccactt gcctattcgt ttaacgacct gccattctgt agcgtatagc acccgaccaa t gt t gt t gga t t gccat ct t aagat at t t c gat t t ccct c gt ggt aagcg agt ct t t gct at t cgt aat c t at agcat aa agt gt t ct t t t gct t t acac aat ct t gacg at aagcacac aagaacat t c t acgt agt gt t ct t ct t t t a t agt aacagt ccaacacccg t t ct t t at t t t t acacat t t t t gacgt cat gcacacaaac at t t gt at t g at t t t cgt ga t cat t gacca aaacct t gct at t aat t gca gt gct t t at a t agaaaaggt t gct t aacca agggaggcgt gt at t gcat g t cgt gaaccg tgaccaacgt ct t gct t t ac cat gt t at cg accgaggtag acgt gt caac t t acacaat t 120 180 240 300 360 420 480 540 600 660 720 780 840 aaccaatgtc aaatcgaagt gttaagtgtt ctttatttgt attgcatgtt atcgtcaaaa Page 213 12689250 Sequence Listing.txt t aaccat gt g agt at at ct t act t at cgac t gct acact a cct ct gacgc t t at t gt cac t ccgat at ga t t accaaat c t gaat aat ca at act ccat t gaaat aaat c cgt t cagt ac t aaat agt t c at acgt t gt a ggaat at ggt acgt t at at a gt cat t cagt t ggat t t t at acat aagaaa cgt t aaat ct at t t act t ag ct gcccccac ct gccat t t t aagaaaaggg cgcat t aaat at ccgggaat t t at t gt gat gt agt act t c at ccagt aca t caat act aa ct t t cat agt aaact t agaa at gct t t at t at cat t gccc tttgccgccc cccaactttt aaat t aaccc ct at aact at aacaaaccaa t t ct ccgat c tct t t gcttt t t gcct at t c gt agcgt at a t t t acacat a gcgt ct gat a t t t t ct aat a t t t aaaacat t gct ggt cat at act caccc t t t t t t t gt t acat gct gct agt t gt agt a t agt agt aac t acaat ct ct at ccgt agag ctgggagaag t t aaaaaaat caagaagaag t cact t cact at gc acacatttcc gtgaaccgag gtatagattc gt aat ct t ga gcat aagcac caacaat at a cgt at ct cca at t t t ctct t cacat gt t gt agat t caagg gaaaaagt t a t t t t aat gat gcagt t gt aa at gaaagct c t gcat aagt a agct cccct a gct at at cgg gt cat t ct ct gcaaaat aat aaaacact cg ct ct ct aat c cgt cat t gac acaaacct t g t t acgt t gga t ct ct ct t t g atgt t t t ct t t t ccgt agt a taaaagccca ccagcccaat t at t t t t t t a at t gcat gt a t aaat t gaga at gcat gct t cat ct acaag at at t t cgca t aat aaggat ttgt t t atgt gct at at at a aaaaagcttt caacgt gt ca t gt t aaaaag agct gt at at ct cgcgct t t t ccaagt t gc ct t gt gct gt t t cacct t gt at cgt ggt at aaaaaaaaat t t t t at gaaa t ct acact ca ttttttttt g t ct gct gt cg t ggt aaaaaa gt caaat t t g gacaaagaaa t acgt acaac t aacct cagc 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 196 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 196 aatattttat cgaattgggc cttaaaataa tgcagcgaat attctagctc tacaatgttc agactatata taactcattt tgcctcccat aactttcatc atgtagaaaa cttgtctttc gagt t aggca agatgtgt t a gcgacagtgt aacaagtggt tatgtacaaa tttatgcgaa tgagtagcaa gagatggaga aagagaactt tttatcaatt tttgaagtaa ttggctacat tatatcaatt catatacttg atatatatat ttttgtttag aaaatgtgtt gtgtatatct t aat aagt ga t t cggt at aa gggt cat aag t t t ct t t aat t t t cgaat gt t agat gat aa t gct ct at t a cgaat aat t t t at aaaaaat t caacat gag Page 21L agact caat a t t agagt t aa caagcaaaaa acat at t t ga gat gagaaat agt agat aca aaaaat gt gt at gt act t at aaaat cct t a agaat gcat c acact aaaaa aaat t t gt gg aaat ggagca aggt ct aaaa aat ct t t aag acaaaat gat t t gagat cat at at agt t t c aaact acaat aaagt ggt gt 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt t aacaa at at at t agg gttttgtgcc gatcaaactc ttttaatcca aatc aaaaat at t t at at caaaag ct t agat t t t t ct t t t cgga t aat gcact t at at t gt gaa at ccgaaat c gccat cat ca t t catcgagg ggt t t gat cc at aat gt t at t t aacaaaat gt t caaat ct t t t t aat aac ct at aggt ga aaaaaaat t a t aaaat aat a t t t aat aat c ttgcgaacga ttgcaaaaaa t aacat aat a ttcaacagaa t cacaagt aa ttttttgcaa tcacccacag gcaaaccgt a aaat aat aca aaccaat t gt cat t gcat t c t ct at t cacg acaatt t gt a aaagct ct ga gat cgt aacc t gt t gt at t c cact ct gct c at t t t ccaac ct at at at aa ct t t t gagat ct t agagaat at aagt aaaa at gt acagat ct t t t aat at at at t at t t g ataagggaag t t agggt t ac taagaaaaca cat aacct t t t at aaccacg t aagt at gt a t t act agagt gct gcaagat t gggt t t ct t t gaact agt t gagcaagcaa gacgaggcgt gct cgcgct g gat t cat gt t aat t ggt t aa at t at t t t gc acat acggt t cgcagacct g tttttggcac gccgtttttt t t at at gat t cacagt at t a t gt t t t t t t t cccaat aat t atgtgcggac at gg cat at t aaat t gcat agat t t at aagt t at ggat t ggaac ttat t gt t ga cat ct t ct t a t gt t t at cac gggct cat t c gggtggaaga t t t gt caagt t aaact at at gt aat ct t t g gt t t t t gttt t gcaaaat ct cat t t at t t a aaaat ccaat tt gtt aaaag t gcaaaat ct at ct acgt t t at gaagccca t gt agcaaat t ct at at aaa acat gt aggt t t at at aat c t aaact acgt caat aacct a t at t gcat ac ggaaat t agg cact gat t t c t cat cgcaat at t gagcgt g t t t at gt t t a at at t aat cg aat agt t caa aaaggct caa at acacgt ct act t acat ga t t t t at aact aaggct caaa at acacgt gc gt t t gacat a at at t t t at t at t ct agggt t ct t t t t t gc aacaaccct c caat act ct g t t t gggcgt g cgt t t gagca t aat act aaa cgat aaat at t t t att agca at t at ct t t g t aat ct at t c t t cagact t g caat t ct gaa gt t t gt aaag t at gccat gc aacaacgt ag aagagat at a t t gagt t aca aat t gaacct acgacgt aat aagagt t at a aat aat t aag t aat t gggcc t agaat at t c ct cct acgaa 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 197 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 197 ttggctatct gatttcttta ttat t aaccg ccttataatc ttagtcgaaa aggaatattc tcattcgaat ttaaatcttc tacccaattc caatatttct gttcaataat ctctcgaatc tgccaaacta agtttaaagc aatcatctta atattttccc aagacataat ccgaaaagaa aatacgtctt tcccatatgc ataattcatt t t gat t t t at ccaat ct t t t t aagt aat ca at t aaact gt at at t t ccct t at t t t cgac caccaat at t Page 21E t t t t at ccga t t gaat t at c aat at t t t aa t t ct t aacat at at t t at gt ct agccat t c gcat at t ct t acagat cat a t t gt gt t at a caat at t cct gt t ccat at t gat t t agaca t act ct t at a cct cat acga 120 180 240 300 360 420 12689250 Sequence Listing.txt aacaaaaaaa agaagtaaaa tgatattgtt ggctaatcct tttccattcc ccccccccac cccaccccac caggt t aagt t t t gaacct t gt t gatcaag aaaat cgat t t gt aaaaat a ccat t caaat tgct t t t gcg t t gt at acac aat t cgt t ct agt at cct at aact aaat gg at t gt t t t t t at cgat acaa t t t aactt cg gt at t at at c gat act aact t cct aagt t t at at at t gat cat ct t act a t t t aaggt aa tccacgtgt g t ggacact t a gactt aat gt ggt t atgtgt cgttgcagca ccccaccccc agt t t caggt caagggacaa catgccaaga tttacaaagg t aaaat t gt c gcat t t gat g agaat at aat ct t aaat at t t t cgt t ct gt gcaacccat c t cacaat gt t t ct cgt gat c act cat t t t g t aaat t at at aataagcgaa t agt t at t t a ct aat t t aga gt att aacct t at agt t t t c tggtggcaaa aacat t t aat cgagaagaga agtt aaagag t agat ct t t c aaacgccggc tt gaaat ct c ttggaggcaa tagt t t ggca at ct at t gat t aact caact gagtgaagt t t t t t ccaagt tttatcaaag t t t at gcagt aggagacaac atggaagttc ttttttttta aaaggtaagt at act at at t aaat t aacat at t gat agac tt aaagact g aaaaaaatac aaaaat t aaa ct t t at t ggc cgt t atggcc tcgcaagaaa acaaagtgag ct ct ct aaat t t t t ggagt g atgg t caagt ct t a t t ct ccat aa t gt act ct t t ggt at cgt t g gaact at act t at t t aaaat cat t gt t gat ccact acaat at t t t cgtta ttgcaccaaa ttgt t gccac aagtgt t t ga agat gt aaac t aaat gat ac ttacagacaa aaat t aaaaa at t ct at ct t cacaatcgt t actatgtgt g atgt t t t gt t actgacagag agaaaat at a act gt gacag t ccgcat t t g t gcat t acga caaggaatgc caact ggtgt aggtaaagt t t t t gt t t gat ct t t t caact at acgt acaa acaacgt t gg t at t t t t gt t gaat aacat t gat t gaagat tgaggtatta agat ct at ga cagt t t gt ac caat aat aaa acacaatatt at t aat ccgt agatgcagcc cggtgaacca acgt gt at ct aaat t at gag actt agaacg t ggaaacgt a agt agat at a aat cgat ggc agacgt t t t a at ggcat at t t t gt aat agt t at t gagat t t cgct t t gcg at t ct at t at at act at t ag at gcact ggc aaat ct at aa aggaggaaca gat at t gat g act at gaaaa at at gt t t at t gt ct ct cca t t gat aaact agt caaat t a gcaaaccat g aagcaagcca t cagt ct cgt t at at at t t c t t t gt gt gat gt cgacgact aacaat aaaa gat t gt t gt a gaat t t t t ag agcgct t aag 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 198 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 198 gaaatcaggg ttctgagcca aagcaagtcc aataaacttt ccataggcca ttcctgctgc cccgccaata aagagagaag gagcatagta tccaccgaca agtccagaag cccgacacca tgcggttgca gctatcttga ccgctaccag ctgaagcaaa agatcagctg aaagaccctt Page 216 12689250 Sequence Listing.txt cacaaatgga cgtttctcca acaaaatatc cacattctga aaaccccagt agggt at acc t ggt at ccca agat aat gcc t gaaaat cag agaat ggcaa aggt t cagag cat agaagt t ggat t caact caaagt cct c at at cagat g gccagt ct ga t t caacact t aacagt ct t a t ccagt agat gat agt cgga cgaagct cca gt t gaat aga aagct cct ga cgccggt aaa gacgacat t g gacgggagat t gaagt gacg ggct gat ct t ccaat aacaa tggagaagca gat aaat aaa caagt t t cat t t cct t t t ct cgct t at t t t gt agggt t t t aaagct at ga gcat cct t gt aacgagacca t gaccagct c t gcat at t ac ccgagaccga gt gt t t ggaa gcaaagaagc t t aacgaat c aacaaaccag ggact t t t at ggacct t ccg aggaaaggac tttccagcag acaaggat aa cgat caggaa accacact aa gacggct gct cgcgggaaaa aact gaaaag cggagggcag cgt gt t t at c at t at t caaa t aaat t t gat gt ggct gt t g t aat t t t t ct gagcgt t t gc t act aaat t c cgt t gaat ct agt cat act t t accaacact t aagact gt c agccacacag t cat aagt t t ct ccaggaga tttcggaaac gt gaagt t ga at ccagccac aat caagagt aggaaat gcc tgaacagaga gccccagcga gcaacact gc at t ct cgaag cacgcaacca t cccat ccca ct ccagt gag gat caaat cc t t ct accgcc gaacgt cggt cacagagt gg cat t aacgga cgcat gggct t aaaaat aaa t ccct cgcag ct t gt cgt ca caccacaaca t ct t aggt ag ct t gt t t aca aat g t aat ccaccc aacagcagat agcgcccaat ggt t t ct ggt gcggaagt ca cacagaagca t gaat cagt t agct gcat t g t ccat acgca agcagct gag at t cacacct at t t ccagt c ct t t acgcga ct gat t gagg at t ggaaccg ggaaaagt ct aacaccaacc aacct cat cc aat agcgacg t t t gt gaat t aagcgt ggct t aat t t gat c t gggcct t ga aggcct ct t a t t gat ct t ct t at t aaaaaa t t gt ct t t t c t t gt cgat t c cct t t t cct g at t acaggaa gt cat ggat g aaaagat aga t ccct t gcgt t agt caggaa gt aacagcac gat gaagaag aaccct gt aa agagagaaca ccagcggcaa t t agcgat t g ccaagcgt ca t cgagact ag at gct cacca at cggt gcct cgaagcaagt aaacaagccg t gat cagt ct gaaccgaaaa ggagcgaat c gccat gaacg t t aaat caca ggt aaacagg gct t t aggt t t ct caat cca aagaaagatt gggt t aat t t gat t t ct t t g aaaacaaaat at aat act t c at acagcct t at gt acat cg gt ggaagt t c aaaaagcat a cct t aaacgc t aagaat aac gccacaaaac caacat at aa acat t agcaa gaagt gagaa acgct ccaat cacat gcggc aat gagaat c ccaaaccgcc ct ct aagcca gaacacagtt acgct at cgc cct gct t cgc acggt gaaag tccgggaaga gct gct t ccg aat ct t acgt t t t at at ggg caaact t caa aaat ccagct at ct gt at t t ct t ct t ct t c t ct t cct ct t t gt gt t t t t t 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 199 2004 DNA Arabi dopsi s t hal i ana Page 217 12689250 Sequence Listing.txt <400> 199 at caacaaga gct aacct t t gccct caagc aaacaccaag at cacaaaat at caaggat c act t cat aac aaaagaggcc t aagaaaat g gt gt agt t t a at t t aact gc ggaat ct gga tttttttttt cgagggt aaa aagacat t cc t ccacgt ggc cacacgt gt a gagt cacat a ccgacccaag ct t ct t caca t caagcgt aa gccat t gt t c ct gact aaat agacacaaga caaaagacac ct t aaat t t a t t acaat gt t gt gat ct t at cgt ccgat gg at gat gat t g t ccact at aa ccaggt t cgt agat t cat t a acaaat gcat gacat t ccag at ct ct act t acacat agct cat at t t gt t gggat t aat g aagagt aaaa gaat t caaat gaaat gcgga ct cggagat t caggaacggt ct agct ggct ttttgagaga at cggt aat t tccacagaga gt t ct ct caa cagt t gagat t gt gccct ct at at agaaga at t at cgt aa gt aaat gat t gat t t gct t t t t t at t ct gt gaagaagcaa gt t t gt at gt cgct cact t t gcaagcccat gct cat t t t c acgat t at gg t ggt aaaaag aact t gt t gc gt ct cgat t t ct gagct t cg cat ggt t t ca gaacaaagcc gct gct ggt a t cacaaaat c gacat t act t aact ct agac at at at ct t t gt t at cagct gaggaggagc t ct t ct cct t caccgt t acc caaccatttt gagagagagt aagaaagatt t act t ggacc t ggat t t ggg t caact gcaa caaact ct at acct t t gaaa aat at t agac at t gt t t agt ggt t t ggt ca aat aat t t ac at at gt t t ct t t ct t acgcc ct at t t gggt at t at ct t t t at t t t t t t ct taaaaggccc gcccat t at a ggcgt aagt t t gcaaact at ct t t ct gggt acaaat gt ga acgt t ct gca at aat ct gcc at at t t gat c ccacat ccaa t aat caat ac gaagaat caa gct gct aaag aaat t gt t t a gt gt t ccggt gt gaaact cg tccggcggcg ggt aaggaaa gaggct agct agcct t t aga at ct cgcacg ttaaacgacc t at gt agt at aat ggt agaa at gt agagca t t cgt gacca aagat t gt at t t t ggaat gg tacgccaaaa aat accat t a t t t acat gat aagt at ct t t aat gaaat t t at t t at t caa gacccaact a agggt t t t ga ct ct cgaaat t t ct at ct aa caagt agt ag cct cat t t ag agt t cat cca acat t gt ct t t ct t cacaaa aagaaagatt caagat t caa aaat gct aaa at t gaaact g cccaaacaac gaaacct gga agcagcgaga gagagagact t gggaaat t t act acgact g agagacgt ct t gat t ggagg ccgt gacat g aaaaact gt a gccagt t cag ggct t gt t gt at t ggt t t gt gct t t caagt caaaaccaaa gacacgt t t g cgt t aaaaat gat t t t ggt c t t gt gcaacc t t cgt cact t t ccaaagct t agt at ct t ag gaat cagcgg cgt cct acat aaat t t caca t t t cagt aaa ccggacct ga at cat at t t g agagacaaaa ct aaat ct ca gat agcaaac aagaagacga caagacagtt aat cat cgaa t ccct t cgt c t t gt at t t t t gaagagacgg t t gt ggt agt ct gct aat t a agt cat cat a acat ggt t at t t agat acaa ct aaagct t t t ct ct t at t g t t t t t ggt t a caaaccaaag gaaaggt cat t at ggt aaga aaact agt aa aaaggcct at aaaagaat ca at ggct t at g act gggt t ag ggt t t cagt t ct t cacct ct t t ct t t ct t c gat t t cgt gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 gtgaagaatc atttcaagcc atgt Page 218 12689250 Sequence Listing.txt <210> <211> <212> <213> 200 2004 DNA Arabidopsis thal i ana <400> 200 aat t t t t t ga aaaaaacaaa accat at gga aat t gt at t t t acat t t at a aat t at t t ac t aaat t t t ac aacgt aat t c t t t t acggt t aat t t t ccag agt gagaaaa at cggt gaaa tttttcgtca tcgaaaaaag tttctttaaa t gggct t gcc caact t ggcc t t t gacacat at at at acat gaggaggatt t t t ggct aaa cat t gt caac t ct agt agat tt ctt ct t t t cat t gcggcg act t t at act at at agt caa ttttaaaaaa t caaaat t t a t at aat t at t t aat t ct gaa t ggt ct aat t caacgggct g t t at aat t t t agt t t t t act gat t t t aaag ggt t t t gacg t gcaaat t t t ttgacgggaa ccaaaat cgt cacaat t ct c aaaat aat t t aaat cgt gat acaagt ct t c t t t at gat t t at at ggacat cat aaaaaaa ct agaaacaa t t t gat acaa at ct t gt t ga caaagt at at aat agagacc t cat ct acaa act t gt t t cg t t agt t aat t ct at gt ccaa t t t t t acaat t aaat agt t a aaagt t t aat ct gcat t ct t ct t t agacca gggcat at ga cccat gggca aat t acaat t t t t ct gaaaa gaaagat agt ggaaaaat ac ccaccaaaat aacgcaatt c gat t t t at t a cgat t t t acg t acgat t t t a t t t at t gt at gat t t ct aca t t gat t t at c ggt ct t t at t ct gt ccgt ac ccgt cagcag cct acacaaa t agt gccat c tatctct t t t t cct at gt cc at gt aaacat t t t cagt t cg aaagt at t t t aaaaaacaaa t t t acat at t aact act cat aat at caaaa t t aacgt gaa aat gcat at g ccagt t t t gt t gaat t gagt acaact aaat t t t t aaat t t at t ccacaat aat t ct ccga cgt gat t t t a tctgat t t t a tat t ggcgaa gt t t t agcgg ccgt t t t ggc t ggt aagaaa at t t t t acgg aat aat aaca gggcatggac ggacacgccc t agccat t cg aacaaagaca aaaaacaaga cat at acct g t t t gtat t ca ggt t gt ct t c t t ct t gat cc gt gcaaaaaa at aat t at at gt aaacgt aa at caagaat t aaaat t at at at gt aaagt t gacgagct at ccacggat aa caat agacat aat t t t t aaa t aat gaat t a t t t ct ggaat t t t t acgat t t t at at t ggc cggt t t t ggc aaacacaatt t t aaat cgt t gaaaaaagca aacacaattt aaaaat gcaa at aat aaaat at t t t t cgga aat ggt ct t t t t aagagt t t aagagagaaa gt t agt t at t t caat t acat ct ct t ggct t gccgcgaggt t gcaagat cg aat t t ccgt c gaaat t t aaa aaacat t t gt t t gat t t gat tgtgtat t t t gcagccccag at t ct ggacc t at gggct t g gt ct t cat t t t aat at at t t t t t ct t at t a acgcaatttt ttgacggaaa gagaacacaa gggaaaacac t t acagt t t t at t t t t t t gt at t ccat aat t accat t t t g t t t ct caat t aggct aacca ccaaaccgt c ggacggacca cagaat t ccg at ccaact at cacct gaagg ct t gacgct g t ct t aagt ac cgt t t gcat g ggcgtct t ct caat t t t gca at aat acgt a t t t t gaaat t at at t at t t a t t t aat t t t a ggagtat t t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 Page 219 agagtttttt t gact at t t t t t t ct ct aca t ct t ct ct ga cgaagat aat t t t t t gggt t gatcacacag gaagaagaag 12689250 Sequence Listing.txt tgtaatctta acttattatt tcaagggtat tttgggagat ctgtcttgaa gacaaagaaa gctttcttct ataatacat c aagcaaaaat tccatctccg atttcggaag agagt t gttc at ga 1860 1920 1980 2004 <210> <211> <212> (7 <213> 201 2004 DNA Arabidopsis thal i ana <400> 201 t cagagt cag gt t ccgt t ac t t t t gacct a aaaaccacac t t gt aat t t a t gt at caaat att aaacaaa t gt gt caat t at t at t at cg t cat caaat c gcat caaat t agt ct at t at cgt ct t t gt c gat ct gccat gt t ggt gt gt ccat t aggcc t gt t ct ccac at gt at at at gct t t gaat g t t caacgagt t gt gt t t acc cat t at aaga cat ct aggag t t aagcat ca ct gt cat act aaacaaactt t gt cct t cat cat cagcat c at gat ct cat at t t ct ct ca ct ct aat aac gaacacaaaa aaat ct acaa ct aat cacaa agat agat cc t cgcgt t t aa cct aaagcca gaat gaaaat gaaaaggct a t t t t t ggttt gt aat gagag caat aacaat t gggcct t at t ccct t gt ga gt t t agt t ca t ggt agaat g at ggat t t t a aaat t t agaa ttaggacaaa t gacct t cac tttttgatcg at t gt at cca ct t cct agcc t act t cgt t t cact gt t cct aat cagt aac acacaaaat c t ct t caaat t at gt caat gt t t agt aaaat gat t gcgat t t gt t aat gt g agact aagt g t at acaaaag aaagct t cct agat gagaaa gat t gt t gt g aat t t cgaat tgtgt t t t ca aaagaaat ag t at aagt t ac ggat t act ag aact act t t t at agt agt ac aat cact act t t aaagat t t ccacaact at t cacaat t at at t aggt t ca at cat at ct t agct ct t t cg agat t t t aac t t caaat t t c t cat t gt aac aat ct act t c ct ggaagact at aat t t at a aat at ccaga at act caaaa aggaaagat c t gaact caga gagagaagaa t ggaat t at a ct gat t ct ct at t cgact ag t att gtt gt t t gat ct t acc t gat at aaaa t aat t aagt t t t t t t at t t a ttgt t gt t gg gat at t aaac gt ct ct cagt aggt aat t t a gaaact cagg gt aact ct gc t t gt gat gca accaaat ct t at t t t agt t a ct act caaat gat t at t gaa aaaacacgat at caacgaac gat t t t at gg caccaaaaaa tgagagagaa gat ct gat ca gaagaagaag aagaaggaaa t t t t t at t ct ct t t act t aa ttgccgacaa at gcgt acca cggt aaaaaa aaccaaagaa ctgt t t t t gt aagggt cacc aagat ccagt t t ggctagag ct gt ct t ggt gaagt ct at g t t ct gt t gga acct aaacac caaat t t t ca ct ct aacaac aact gaat ca t t ct aagat t t aagt t agag gat aaagat a acaacgaaga aacgcaagt g gaaccat ct c t cggt gagct aagaat caac aagaaaaggc t ct t t t gtt t gagggaggt g ccat t t acat at act aagt a ggt t t agct t aaagt t t tag aat t aat aat t aat gggt ag tttcgcaaaa at gt agt aaa tttaaaaaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 tgtaatgttg agcatcccaa agtttgtgtg tcccaatgct agagcatatt atagtct t ac Page 220 ct ct agt gag t t t ct t t ct t aaat at t cac agcccgt t aa acccggagt c ccccttt ct t gaaaaaaaaa t t cat aaaga cct cat gt gg gat accgt t a t aaccgaat c agacggagct cgat ct cagc at caaggaag 12689250 Sequence agtgttgtat gatcatatat ttgattattc aaaattatac aaagattacc aaagaatggg gacccacccg acaaaacaac ccaaagattt ggcttgtctc ttaaatcact cgacgaagaa at ga Li st i ng. t xt t t t gt t gagt aaaagaggaa ccaaat cat a ccgaat cgt t t ggctt gaga gaaat cagaa aagagaaat a aat cactt aa at at t aat aa t t aat at cca acaaagaaaa aagagt caga 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 202 2004 DNA Arabidopsis thal i ana <400> 202 ggagt caact caat ggct cc gaacaagagt t t t ct t t t aa tcggaagaaa agaaagat t a agaaaat cat t ggctt cttt cggctgcggt cgt t ggagag t at t t t ct t a ttct t t t t gg tttttttgca at t t gt t t t c at gt t cgct t t ct gt gt at c t t acat aaaa aaaact at ac aaaaat acat t aacgt t t t t gct accaat c t caaacat gt tgt t gacaag at ct gt ct ct at gatt ct t a t t t t t t ctt g t gt gggtt ct caat gaaaat caaaagaaaa t ct t t t ccag t ggagt t t t t ggcaaaagaa t gt cat t gaa t t at t cgagg gat t at gct a t at gagat ct cgt cgt aat t t cat gt t agt t t t t ggcat t gat t at cat a aaaat t t agt gaaagt t gt a gt gat t agt a aaagccgat t cct t ct t t ga t ct acacat g cccct cct ct aagat caaaa ggcat at ccc aaaact ccat att gct gaag gat cgaaagt ctt aat gcag ctt agagaac gagaagt ct a t ggcct gaag ttttcaaaca tt gcct gttt t gggt gt t t g t t gt gt gt ca t t t ct ct cat tttacaaaaa aaagt t t t aa t gt aaaat t t t aaaaagct g at t gt t aaag t gt at acaac aact t at t t t ggt t aact t t cttt gcacac tcaaccacac aagt t t cct t t aaaat t gat aagaagaaga t gaagatt gt acggt gcaaa tagaggcaga gccggagatt aagt at agt t t t t t at at t a t tat tttacc aaat gt ggca t cat aaat aa gt gt aagtt a t gtt aaaaga t t aaaaaat g t acgaaat aa aaaat aat at t t at acacct t ct t t cct ct gat at cgt t c t at at t t t aa acacat t at a agaagagaaa t t t ct t t t ct cat aat ggga aaaat t t gaa t ct aaggaga caaaggagga aagat t ggcc atgtcgaagg ctct t t attt caagaagaaa aacct t t t t c t t ggt gt at t agt t cat t gt t t aat at gt t aacacaaat c at gggt ct aa t acat aat t a t ct t t t gtaa t aat act aca gat aat gaca gaggagaat a t at ccat ct c t at at t acca aaaggaagaa tt catt ctt c aact gt t t gg gaagt gat ga gat gagt t ag gacgccaagt ggagaagct g tggaaaccgt t at t t t at t t aaaaaggaat cgagattttt accacaaaca t t cgagt gaa cat gt t ct t c gt ccgat at c t t agt aat ac t gt at t gt aa agt ct t t aat t cagt at agg ccaaccaat c at at t t t t t a aat aaggat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 Page 221 12689250 Sequence Listing.txt ctttgtggta gattgtagtg aatgaaattg acgaaaatga agaagacaat acaat gat ga t t at t t t ct g at t at agct t t aagat t cat t aggt gt at t t ggaaaat t g acat t t t acc gt t t ct aacc caacagat ct act ct ct gca <210> 203 gct caat ct t t ggt caaaaa t t cct aaaca at gt t t t gct at t t t gt cat gagt agact t aaacacat ga cgaat ccgat cat cat ct cc aaaaaacaaa ggt ct caact t aaaagct ga t acat t ggga ct t aat ct t c gt aaat t t aa cat gaagt cg ct cgt gat aa at aacccgac at t ct ccaca aat g ct caagt act act at ct ct t caagaaact t cat at at ggt cacaaaat ca t aaat ct ct g ttctttctgc ccgaaacacg ttttccaagg aaagat gat t t ct t aat t t c t at gt t gaac ct t at ggt t a gcaaaaat aa act aagccca ct act t at gc t gggct at aa ct t cagt t t c ggt gct t aca t at ct at cat ct t t ct at gt aat at t gt t a t gat ct t gaa aaagagagt t cgt aacacat t t t gggtagg aaacccgacc act t cat cac 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 203 t at aaat cac taaaaagggc t t at cgaacg t gat ct gt ca agagat at ag t caagaat ga t at t t gat t t t at gaaaacg ct act t at t c t t at t gt gat t gcaaat t gt gt t t t at aca t t t gt cgat c at at t t aaaa gt gat at t aa t t t ggact t c aat t at acaa t gt ct gccac ct ct t caact gt gt t t ggag t gt t cgat ga aat gt gt aag gacct agt gg t cact t ggcc at t at at gaa ct t ct t t cca gat gaaat ga aggt aat gt g ct accat gt g agagt at t ct tgaaaacaga caaaacggaa tgtctat t t t t agt t gt tag cacat at gca gt t gcact at aaaaat t ct t t t acgt caaa aat aaat gat cct caagggt at t acacat g ct gagt t gt c agt acgcaac ttagaagaaa t at t t t t aaa t at at gaaaa t gat gt at aa tacacaagca at gagt aat g aaagat t t gt aagt at ggag aaaat aat t a ggact at t ac tttttgacag gcgcccaaag aaaagt t acc at cacat at t t agagat aaa cggaat t ct t t t cat act t a at t t gacaca caaat gggca aat aat t aca agt t aagt aa aagt acat gg t gaaaaat ga accgaaacaa t t t t t t atgg caat gcct t a agcgt t ggt c t gt at t t gat t t agagaaat t act t t t caa at t t t ggaac t t aat gt t t a gat t t gt aat t act cgt aca t t gat cgt t g t t ggt t acaa agt caccaat aat ggct gat aaat ct t at t aaagt t t gaa gagaat aaaa aggaat t aaa at t t t gt t t a acaat gt t t g gagat ccaca t gt gcaagaa aaat t aat aa t t gggat at g gaaaccct at t at cgacgt a t gt aaat cat gaaagaaat a agagaaaat t gat ct t ggaa aggt t t t t gc ttaacagaag caat gaagaa t aaat gt t ga t cgat at t t a aagat agaag aagaagat t a t gt cccct ag aat at cat t a t gcaaccgt c at t cat gaaa at aact aat t gagt agt at t t at gaaacca aaacat t t t a t t t t ctt t cc aaaaaat t ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 gtacaaatta atagctagat aatagtcgat ttctacacac acaaaaaggg ttatatcctt Page 222 12689250 Sequence Listing.txt agt ggat aag caaact aat t t at gt at at a gat aagaat a act t caaaac at agct agag tttttttttt aat at agacc ct gt gact cc cat at aat gg t cat aaaagt tagt t t t gt a gt t ct t cttc cat t agagag aattatttca atataaaaaa t caaat t t at gct aaagaca aat aaact aa caacaat gga aat agt agat tttttgatga aat gat t gt c t cct cacaac t gt at at t ga gt t t ct ct ag acagagagat tt ctt ct t t t acagct at cg t t t gggt at t aagt agat t t t at agggt t t t t t caaaact gcct acacaa caacct t t at t t at ct t t ag aaat aacat t t gcat gcat g aaaaaacgac agat agaaag cat aaat t t t at gg at t cgt cgt t aat at acat g gt acacacaa gt at acagaa at at act gt a acacacat at t t t aggt t at t gagaat agc t aaat t aaat t at cat at gt aaaat t cgca ct ct t gaaga cggagtggt c gt t caactac gtcaaaatca aaaaacgagt caaaat ggac t t at aact t t t t cact at t c cat gcct t ca at t at t t aag at at ggt acc gt t atgt t t g agagagagat gagaaaat cc t ct aaccaaa ct at t t at t g aaaacaaaag t aaaaat at t ct t cct t ct a agat t gaaac t t t t aggttt t aggcaat at cat ccaccca t at caccat a ct at aaagct aaaccaaaag at ct act t t t t t t ct t ggat 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 204 2000 DNA Arabidopsis thal i ana <400> 204 gt gat aaaaa cat t t t aaat t gaat gcat t t t at ct t gat at at ct aaga cat t at at aa t t t agt t t t t t cggt t cggg t ccggat t aa aaaaaaat gg aat at acagt agggt t t t ga gcagt gcct c ttaaacgccc aaaagat cca at t t cagaaa t at aat at t a tacaaaaaac t caaaaaagt gt at at gat t at at at at t a at at t gt agc cggggttttt t cggt t cgga accgattttt act t acaaaa t acaat t aca gt t t t gacaa t ccaat ct ag gt t t gt t at c t aaaaat aat tt at agagac t at aaaaat g cat gt aaat a t accat at aa agt t at aact cccat t aaga cct gggagt t ggat t t t t t a cat cggt t t t t t t ct aat t c at gaaat ct a aat ccacaac aaaaaat at g t gat ct acca cat gaagt aa ggat aat ct c agaggaagag t t at aat gct aagaat aaaa t cat at aagt agaaaagaaa aat aaat gaa cggat at at c at t t ct agcc t t cggt t cgg ggt t aat t t g at aaaaaaca acaat t at gg t acgact aca ttccgacgac gaaaaaacat agat t t ggaa ctt agagagc t t t t t t gt at t acgt t at aa aagcat at aa acaaaacaaa agagagaat a ggt t t aat cg gaat t gaacc t t at t gggt a aaccaaat aa aacaacaggc gt caggt cca act ccaaact gt aat gccct gagatggaag gt gt aaagcc aaagaat caa gacacat gt c ct aaaaaaac at cat acacc gt t gaat at t aaat ccacag gt t cggt t t a gaacaaaaat t aaat t aaat accacaaat t ccaat acaca t t acaaaat t gcgacgtcga acgagacct g t gt aaaccat t aact ct aaa acaaaacaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 223 12689250 Sequence Listing. ttaccc taagaaagtt gaggttgt tgat t aaaga gaaataagaa gagt tgagcaaagg t gat gaat t c at caaagt t t att aggtt aa accgat t ggt ggccgatcgg tcggt t cggt accaaaaact t aaaaat aaa t actt at cct gaaaaaaaat cggatcaaag aact att gaa caaaat t t ga t t at caaaat cgct t ct cca t ct t cgt t t c t t t t t t gtaa atgaaggaga ct t t gt t gt t gact cat caa t ccgat t acc tt cgat ct cc t t t ctcgggt agtt agat at tgt t caaaaa acggtgattt at at at aaaa t t aat at t t c aacaaaattt aacaat t t cc tagggat t gt aaat t t t ct t cat aaccat g gaccaactga agagacgagt ttcgcaagag tt aacctt at gaaccgaccc tt aatt cggt t gaaccgt t a at t at t t at t acat gat ct t ttggt t cttt t cat at at t a caaaagaaaa aaaat caat a t aat t t ccct t t t at at at a caaagaacaa gat t agagag t gat gat t ac gcagagagcc atatggctgc gaaccgactt tgctgccgaa tcccaggcct at gacat at t aaagctgaca t act t t t t t t t t aat aat ga atagaacaga ggat t gattt aaact t gcga aagagagacg atctcccaaa aatgaaag t aat t caa aaaaacga agatt ct c ccgaat t c aaccaaaa aaaat at t aat at t aa aagct gat gacaatt a gat aaat a gt cct aat ctattttt ct t t t t aa cat ct ctt t ct aaaat txt gg gttgaatct c ag gcttggaaga t c gaagaaaaat tag aagaaacct a gg tatgggtgaa tt aaattaattt itt ttcgtttttt gt atgactcttg ict t aat at aaaa tt gat t gactaa t g gagctggat g ica acgaat t aaa tt cattaattt c ct tttagaaaaa ica at cgaccgac ta tttcattcat ct ttctcttctc 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 205 <211> 480 <212> DNA <213> Arabidopsis thaliana <400> 205 aagttataaa ccacagaaag aagtcaacta agaatctacc aaataagatg actgtctttc tatgaatgaa aacagctttg ctatttctag atagtttcat gccaagtttg tcactaaatt tttaacattt tcgcataaga taataacata tacttatatt aatcaaaagt agcaataaag tgaagaaaaa aagtctaat a ttttaagtcc taaagttgga aaaatggtaa agaagcactc tgtatgtgtg ggccgaattt gagaaaccag tgacaccaat tcacacaagt atttgtgcct actcatttcc tccaaacgtc agaaaaaaag gt cacacat a ggt caaaat a tttttaagaa aagt gt agaa aaaaaaaaaa aaaaacctca aaacatagca gaat t t cttt agataggaaa tattggcaag ct aat t t gcc at t aat caaa aaaaaaat t g t ct t t caat c gcagccatgg 120 180 240 300 360 420 480 <210> 206 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 206 gtattttaca aagtctccat attctataca ttcaatagat attcacctaa atatagtttg Page 224 12689250 Sequence Listing.txt aat t atgtgt cgtctagtgc aataataat ct aaacataa aacaagagag t gaagaagt g gt gt ggct ct t ct t act t t c caaacctttt tttgccaaaa caact gcgga ccgcacaat g t ggat t cgaa t at ccaact a aaacat aggt at t t t at t t c at t gaact ca aggat accaa t at gat t t t g at ct aaaacc ct ccaat ct t t t t t aaat t a t agat aacat t ct t cgt caa tttaagaaga t t gat t at t a agagagaaga t caaagat t t act t gat t gg aaat aaat aa gacgaaggag t ct t cct t gc t gaaacat ct t at t t gt t at t agcaat agt gat t at ct gg ccaaaagaaa <210> 207 ccccgaggat ct ccct t t t a t t at at t aca caaagt caac aat t cgt gat t aaagt acaa cggt t cgat c t caaaat t ct t cat cct t t t t gcat gt at g gact aat gt t at t at at agt gct t t t agga at gat t t tag gt at aat t t g t caaaat t aa t t agt t t aat aaaat t t gt a t aaacaaat t t t agt cagt g ttat t t t gaa agggcaaat a t t cacgt t gc gat aaact aa at aaaat aaa t ccaaaacct aaat t t act t t t t t at agt a t gt t at t aac acat at t aca t gaaagat ga agaggaaaaa t cggcgt t t g t ct cct t at c at at ccct cc t ccgt ct ccc ggat cct aat t gcagt t ct a accgt t cgga cat ct t t caa t t cat t ct ga cat gact t ac t t t aagt gct t t t ggtaaga gat gt at t gg gaaagt t gat gat t t ct at a t aagaat caa aacact aaat aat t t t acac tagtaagcgg gt aagagagt cacgacaaaa t gcaaaggt a t ct t at ct gc at aacaaaat aaaaagaaaa ccct caggag ct t cgt t t t c cacat at at g aaaaaaaat a t ggt cat at a gat t gt gagt at gg gt cgt gt ggt t ct cct ct t c ct aat t ccat accccact ct ct t cgt ggat at agt aacga acct t t t t t g at t aact at t at t t t acaga t aat act at t t tcat ct t t a aat t aact at aaaat gat ct acgaattttt t t t t t aact a at t gacaaac t t t at t at t a aaat cact t a gt t at acggc ct t gat ct ct caaact aat g acacaaat gc cacgcggcca aaact act t a agaagaagat ct t ccgcct t t ccgct t gt t ttgtct t at t acat t t gt t a t gcat at ggc t t gt aagt t t ct t t gt t gca cct t t t ccac t at at at t t t ct cat t at at cagaat agt a aaat at ct ac gaaacaact t agaccaagaa aaaaaat gt g at cat ggt t c aacaaaat ct gaaagat gat gagt t t t agt at aaaat ct t aaat gt t t t t cgt t t t caat aaat t at t aa aaat act aga gggt acact c gat t aaaaaa t ggaaaaact at t gt gagcc agat t ccaat caaaaacat c at at t aaaga at t cagaggt t gt gt ct at g ggaaat at ga t t gt t caccg t acat gt t t g t t gacagt gt cgagagatag aaaacaggt t ct ct t at ct t gt t t t at ccc aaat at at at acgt gcagt t at acact gca t cct act aag cct ct t ggt t gt ggct aagt caattttttt gagcctt gt a gt ct agt caa t gagt t t t t a t t caaat t t c ct caaat ct t aacaat aaat aat ccat aat t t aaat acac accaagagt t aaagagt gat gaaaaat at a at acgt aagc aaggcaagct at gt aat t t a ggt t t gt gaa t aaact t ct c at ct t gcct a t cggct aaca gagt t gat ag agt t at at t c agaggaggat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 Page 225 <211> <212> <213> 12689250 Sequence Listing.txt 2002 DNA Arabidopsis thal i ana <400> 207 gat ggt t cac gt cggt act a cat t ggat t t aacaacgt gg t t t t ggaggt agcaaat ggc agagct t aag caccagaccg cgt t t t t t t a cgt t cggt t c tct t t gattg at gggaaat g cgaccgt agc t ccact t caa ggaaggaaat t at ggaaaca agaaat caaa t ggacat t t a at at t t t cgg at aat t acga act ct gt gag t gat ct acca aagt gaaat t t t gt t at aat t gt aaaacac gcggagatgt at at gcaat t aaat t t t aat at agt t ggcc at aaaccaaa aaaaaagagt aaggt ggt gc gt t ggt acag at t t caat ga tggtacaagg ct t gt ggt t g aacgacaagg ggct t ggcaa gcct t t agag t t t at aaaat t t gaat ccga gt t cat gt aa ggggagtgt g caaagaaat a cagccacagt tcgaagacaa gct at ggt t g tgtatgagca at at t aat t a aat t t t t t aa at gt at t t aa ggat t agagt t caat t t gt c t t gt ggt aga ttttttttaa cat aat cat a ct t t t t ccat ttttgttaga t gt at gaaaa aaaaat at ac caaaacat at aaaaaaccaa at cct t ct gt gggat aacat t cacat t gt t gact t ct at t t t t t ct ggca t agat t t gaa tagaagaaga gt at acact a t at agaaact ct t t t ggt gt t at t t t t at t gat gt t ggag cggt cacat a t t cgat ct ct at t gat gcat at at gt gt at t ct at ggt t t aat gcaagct agt agagat a t t t caaat t a cgt aaaaat g ggacacact t ct at cat t t t aaat t at gt c aaggat aaaa ccacccgaca aaaaacggt g at aaat acac t att ccaaaa aat aacaaaa aaccagat ct caagct acca at ggt ct gga ggacat ct t t t gt t ggt t gt tcgctgggaa tggagaagaa t gt ccagaat caaaat t agc gaaat cat t c agat at t t gg t t gt aat t gc t gat agt at g gcat ct ggcg aacgat gct t at at t t t agg gt t t atcttt ct agt t ct ac gat aagt cct t at t cat at a aaagt t agat accacaagat at ct t at aag caagt agaag at t act t aga t aaaat t at c t t agcccat t cat aaaaaat aagat t acaa t t agagt tag t aat t gaagt ct caaagagt cggaccaacg ct ct at ct aa t acat aaaaa at ggt t gcaa cacaaaaccg acccat aat c t acaccact a ggt gt caagt ggaacggttt ct cgat t cgg agagat at t t cggtccagcg at cggcgaat tcaaggccaa t aat ct ct gt cagt t t t cga act t gt t ggt at aat t acat t at at gaggg t ct cat aat t gt aat t t t t t t t t t aaat t t t agaat t t aa gcccaccaga at cccaccga gct t t act ag aat at t t t aa gt t ggccaaa ct gt aacgt a gagt t t t aaa ct t t gt acat gat gt t ct at t t at at ccac gat acaacat gt gt ct t gat gt gaagt gga cat ct gcagc t t cgt t at gg ccagt t at ct gacacggttt t t aagt t aaa gagt cat t ga act ct t caga cat cct ct t t agaacat t at cagt at gt ac t gt at aaagt gt t t ct acat agct gcccat at t agat t ct ct cct t act t t t t aggt cat t aacaaat gt gt aacct t cc aat gat gacg cagt t t at t a t aaacaaagt t gt ct ccgac acgct caat c acat gt acaa aaggaaaaaa t at t gact ag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 agagagcttt ttagcttcac atttcttcac ttccacacac ttttacttct ttctctcttc Page 226 12689250 Sequence Listing.txt tcttctcttc tccagatctg atcccaaacc tttgattcat tgttgttgtt ctctgctgct ttatcagaga gcatcatcat gt 1980 2002 <210> <211> <212> <213> 208 2004 DNA Arabidopsis thal i ana <400> 208 ttggggagga gct gat aaaa aacaccct aa t agt t t gt gg aagt t t aat a cacaat aat t aaat aat t t a at t t ggat t c agtttttttt cagt aaaaga t t gat t t act aat t agt agg aaat at at at t ggt t ct agc ct agat ggat aacacagaat aacat t aat t t cgt at t t at at t aat t aat t act t ccccc t aagt aagt a caacggt gt a acgaacgtt c t ccaat t t ac at gcccct gc cgt t t gt gat ggaacagatt t t at gt t t aa aggaat acaa t agt gt at aa aaagat ct cc aaagaaaaac aaat acct ac caat t ct t ct tttttttaaa aaat t t t caa t t t ggct aat aat at gacca t acat t t aac acaact caaa agt t t agt ag t t t t t aat t a gct ct t caaa t t t t t t t at c ctgt t t gttg t t t gt caaga ggct t gaaga aaaaaagt ca ccat at t at t t at aaacaaa t t t ct t ctta agccat t t cc aat t t t gct t t t gaat t ct c at gggat t t g t ct t gaaat g at aat ct t t t gt t cat t t t t t t ct gat gt a t aact aacaa agaaattttt aaat gt agat t gaaaaat t c agagaactt g aaaaacggt a t at t t gaaaa t aat t t t t aa ccact aaacc at gcact t at ccaaagt gaa t t cgt t ct aa at ct ct gaaa t t ggcat t t a accct t gt ct aagat t ggt g acat t caaca aaat t at t t a aggagt ct cc at cacagct c cagct cagat act t ct ct gg at t t t gt at t t at t cgaat c at ggact t t t t cgacact ga t t at aacgt a gaaat at t at aat aacacct gacgat gat a t ct at t aat a at aact ct aa t agt t at gca aaat cat gt t t t t acct aaa ggt ct gt t ga at t at act t t cat cacact c aaaact gact aaaaaact ga caaat cat t t caat t gcaaa ct aaaat at t t at aagcgt c t gt gaat aaa t t t t ggt aaa tttggaaaaa agcct gacgc ct ct gat ccg t t gt gat at g t gct gt t gt t t t cgat t t ga at ccgat ct g at agt t t cct attttttttt acccat acac t caat t cgac ccact accat t ggt cgat t g at at acaaaa t at acat aaa at t gat aaaa aact at aat g t cgaact cgt aaggat t t at accaat t gga t t t t ct aaaa cat aat acca acagt agt aa at cat t t t ct cat agaaaaa t acgt gacct aat caat at t t acgcact ca act t gcct at aaccgct cag gt gagat ct c cat gt t ct t c ggt t t t t aat t gacat aat g ggt t t aaagc t t gaat gaaa t ggt aat gt g at cgat aagt aaaagagcat tcgccaaaac t cat t at t cc gcaat t gct g aaggt aagt c aat t t gaaaa at gt aat agt t acct t gaac gt aaaat t t c t gt caacacc aaaat t cgt t t ccct t t aaa at gt caacac cat t t at t at gaaaagagcc t t aat t aat t ggt t t ct aag at t t t t ctct cat t t t gcca gct gat ct ct t ct caaggt a gaat t t t cat t cgat t t t cc t cccagcct t t ggaat t t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 Page 227 att gt gggt a t att gagct c at t t ggat ct ct t t gat t ag t gt t t ct agc ggagt t agag ct at t aggt t tt gtt gt ct g gaat t cat t c at ct gaat ga at gt t t t ggt cat cat caag 12689250 Sequence Listing.txt tcattgattt attgcttggt ccaacatttt tagcagctgg aattttggaa agaactattt ttgttgtatc gttttgattt acctttttct ctgattattg ttttgtgtcg gttgcatcca atcatttttt tatgtgctca agttattgta tggattgttc tagacattgt taagatctga cgtttgcatt ttcaggaaaa at ga 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 209 2004 DNA Arabidopsis thal i ana <400> 209 at t ct t agt t t t gt at t t t t tgt t ctgccg act ccaccat t aat aat t ca t aat aat cca ct t t at at ct at t at t t t cg tgtacgggaa accggt ct t a gggt ccat t a t aagt t ct ac t cacaaat ga t aat t gt gt t cgt cat agga aaaat t agaa at t t gagt t g cct gt at gt t gagcccat ag ccacccacaa gt t t t t aat a t at aact t at caacaaacaa gt t t t t caga cact cat t ga ggtt at t t at act t at t t t a gt at caccct aat t t gat at t agagacat a t caaagt gca aagct gat cg cgtt gaatt c t acaat aaat t ct gaaact g tt ctt acaca aat gaat at g ct aaaaacat tt gt t t cttt gaaaaaaaac t gacat at gc t t t t acaggt tttttggaca gtctttcttt at at at ccca cct at cgaat at t at agt t t ggat gt t at a aggt t t t t ct actttttttt ccgagaattt t agcggt t gc ggat aaat ga cat ggcagaa t gt gct cggg agt agct at a gacct atttt ggatcaggcc ggt cat aat g t t t cat t t t c acaaat cttt tt catt gt ct ggt t t gcgt t aat caccaaa t at agt at at cgt t caaaac at t caat gaa ccact cccct cat ct aat at t t t aagt t t g at cat t t t gt t t t t aaaaat gct t at t t t c t t t t ggt t ct t agt t cct t c t at t t cat ga gaat t t ccga t t t aaaat ct cat cat cat c t acat at t aa aaat t t aaac t caggcct at cagaggcct g t ct cat t t ag aaaat t gaat ct cggaaaag at gt act t t c t at agt at t g t gt at t t at c aagaat t gat ccaaacccaa gcgt t gccaa at gggat t aa ct caggaaaa cat caaat ct ctact t t t t a t aagt t t t ag tgt t t t t aga t gct ggt t t t at aat t caga at ctt ccaca ct t t catt ct caaat t ccgt at t gt aacat ct gat accaa aacaaaaaac cct t gccat c at aacgt t t c aagt gat gac ct ccgt t t ca act t agaaga aact t t t ct a act at accat at t t t t t gac at t gcgt cag caat aacat a at at gccat c t t acat t t gg t ct gt gacgt aat gcat ct t ct t at ggt at gat at t att t t gt aact t gt cagaaaaat a ctt gaactt a tttt cat cat t t at act aat gt at gaaacg gaccaacaaa t acagt t tag t ct t t at acc t cgaaat t ga ccaaagcct t aagagagaac acgcaaat gc t aaagct t cg t t t aaact t t ggt gccgt t c t ct t gt t ccc t aacact aat cgt at aat t t t t t cat t aat acct t ct gac acgt at ct ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 ctactatccg ctgtttatta atttctctga atttttaagt ttaagaaaat agaaacaccc Page 228 ttaaaaaaaa t ccaccact t ct t agt t aca t t at t t aaag t ct agat t aa ct ct t t t gga agt ct cct cg t t gt t gt aat aact at t t t a t aaggt t aag gt t ct aagcc t t t t aaccct aaagagaaag aagct t ct ct ct t t ct caat t t t t gt t cat t t caaaact t aagt gt cat c 12689250 Sequence cat t accaaa aat aaaccca gagccgagcc aagccgtata ttctctaagt ttggtttatt tgtgaaacca act t gt t cat ctctctctct ctctctctct ccatatccct ctctctgttt gttgttgttc aagtgaagaa cataagaatt tctctgataa at ga Li st i ng. txt t acggt gacg act at t ggt t taaagaaaga at t gaaaaga ct ct ct ct ct t t t ct aat t t gct caat t at taaagaaaaa t at cacagac caaact t aac t aagt t t ggt accat t gct c ct ct ct ct ct gagcct ct aa gt t t acaaca gct ggagt ag 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 210 2004 DNA Arabidopsis thal i ana <400> 210 t t at t t act a t gggat t t t c caaat gt t t g t cacgagt t t t t cct t gggc aagaagt t gt t aaat ggcct ccat t at cgc gagat gggt t ct ct at t gca t t ggagccat t gt ccct t ct gt t t t gaat a gt gt ct t t ct ggaggtggcg at t ccct t t g gt caggt ggt aaaaat ct cg aat cagaaac t t cgat cat a gt t acct ct g gtat t t t gt t gat t t ct t ct at gaaat gct ct aat acgt g t at ggcgt t t t gt t gggt t c gt t at acaac gagacat cct t gt gct t gt t aagct t ct t c t ggacggct g agt t ct ct t c acaggt t ct g t ct t t at t ct gt t t at t cga cct act t agg t t t cct t ggg t t t cct gt ca gt gggaat ca accacactt c ct t ggat t t t at gagt gat g ctgt t t cttg ctctgt t t t c t t t gtgt t aa at ct t gaaca aagagagat t ggt cat ct t g gct gggt t t c t t at ggaaat ggggaacct c ct t t t ct t t c ct t cgt ccac t t t gt t atct t ct gccgt t t aaact cacct aat t t t cct c at at cact t c act gt t t cgc at aagct t cc t agccacact cccgt agcgc tttttgggga t aat at gggt at t gt aggac t gagagcat c cat act gcct at ggaat gt c t gtt t t t cat aggct gacat at aacaagat ct aggct agt t ggat aacaa cct at gt aac t at ct acagg t ct t t t cgct gt cat t ct t g act ct gcgt c at aacaaaga gact aaggca t t caat gt gg ct gcat ccct caacccagct t t at at t t gt ttaagaagaa gct ccagaga acct t acgt g t t ct t t acat t accaagt cc gaggat accc ct t gat t aag cgt t t gt gag t ggt gt t gct at ct t t t gat t t aat t acca ct ct t accat ggat t gcaat gat acat gat aaat ct cgt t aact gt cct g tggcggggaa ct gct t caag t act t gt ct at t gagccgt gt cat ggt gt t ct cgcacac gact t ccagt agaat ggt t g cct t t gggcc gcat gt t t ct at acacgagc aat gt ct t ag t ggt at t at t gt t cacgaac gt aagt ct aa t at ggagcat cagcat at gt agcaat t gt a gacgct gt ca t ccagccat a cct cct gagc agt t caagag at gt gat t gg t gaagt ct t t t t at ct ggcc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 229 12689250 Sequence Listing.txt ctggtt catggccaag caaacttatg ggcgttgtta gccttgattg cggt t t act t t cat gact gaggat cat t ggaaag gt aaccaagg aacaaaagga acacaat agt t aaat agt gg caaagat t aa t ccct cgct g t t t aggt t t t accagt ct t t t t t ct t t gt c aggt t ggt ct at ct cgct ag gaaaaaat gt t ct at at acg t ct t caaat t aaggggctaa caaaact gt a aaat aaat aa cct ct gct t c t t t t at aat t gcgt gt gct t gagcaagagc t cgat gat cg agagaaat aa at aaacacaa t t aat at t aa gt t gagat at t t at gt aaat aat aagt gaa aaaagt gct t t t aact ct t t at aat t t at t gt gt gcat t c cat g gt at gt agt c ct aat ccat a acgt agat t c gat aat cgca aaagaagaat aacct aat t g t at t t t agca tcat t ct t t g ct t t at cact t gctct cggc at t gat cggg ggt t aaagag t at gacaaac aaaact caaa gt ccact t gt acat t agct t t ggacacct g at t acagt at gat gcgct gt t t t ccact cc aggat ct at a t t t at at ct t at ct cgt cag t ggacaat ct aacgact ttt aagt t at at a aagt cat gaa gcct caaaga t t t gcat aaa aaat t aagaa t gt caact t g caaat caat t t aat acacag t t cat cggat 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 211 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 211 agattaacgc acatattatt agcttcacat tgat t aatct aagaatataa ttaatatct a attaacgcct atattattta gcttcacatt tgacgccatg gtatactcgt ttccagtact ttgacttttg tctttgttag tttttttttt caatgttttt ttttaccaac tctagtctac aaagaaataa act at acat a t t aat t t t t c aaaaaatagt tcaattggta aattttgtat cattggattt tcttaaagat atctaaatga catct t gaat ttgaataata ctaatctgct atatatttgt tctaaaagtc atttataatt actgacaagt tacaacatgc tatcaaaaac atatattatt gcattttttt cctcattttc tgggtgcatt tatttgacta ctactactat ccactcctcg ggatcagact aggctacat c aaacagtaca gct gt aact t cct at at aaa ctactagtct actactatcg tatttttttt t gcat gat at atggcaagag gct t gt t at c at t agact aa gt aat t gt t t t act at gt t t aat gt t t t t t gcattttttt t act at t t t g act aaaaaat gaaaagatt c t agaaaat t t aacgt t t aag t t aaaaat at cat t gaacca cacaaagat c cct t cgt ct a cat t cgt t ac ct aagaat t a acat ggt aca ct t ct at cat t at gcaat t a agt agt ct at gt at gcat t t caacaaact a at acaat at t t agt aaaat a t t at aat t t g at cgaaat t a t at aaat t aa aaat aggt ct t agt aaaat t cat gt gccca t gact acact aat t act at t gaat t ct t ag t t agat t aat agt t t t t at t t t t t accaat at t agt cagt t t at t gggt a aaaat agt at t t t t t aacat t t ggcaaat c at t t aaaat t gat t t caaac tt at aaacaa cacgagt ct a accgt ggaaa t aaacat t t a aggt t gt t t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 attttggaat caaattaatt agtagacgac gacttcaaac aataattatt taacatacac Page 230 12689250 Sequence Listing.txt gttttgaaca actgtatgca ataattaaaa aatcaacaag accaatcaaa atcattaaca aat gct gt aa aaaaaagt at ttcagt t t t t t t t gt t aact t aat agat t t cggat aaat a gt acaat gac t aaaat t gt c t at ct ct ct a aatttttttt ct gagat t t t ct t t atcttc tagacgggat ggaagaagtt t aaaat cgaa aaaaaaccat agat caaagc t t t t t aaat g ctt gaacaag ct cct at aaa gat cct aat c gat cacaaca at t at ct gat cgaaaat aat t ctt caaaaa caaat cacca t ccgat t t ct t t acagct t a ggt ct t ct cg gaaggaagag t aacaaat gt t ct agt gct c t t t t cagat a t act at t aat at gct t t t ct aat aaat aaa aaaaagccca acgt gt t t t a at t t t t gagt t t cct ccat t t cat cat cat caacct t ccc ct ccggt cac at t cgt at ca at gt cat t t aacat tagat t t t t c t t t t aaaact at t t cct t t a t t t t t t act t t agat t cagt t aaat at aat gt t gaaaaat t t t gt t t t t a gt t at t acat cgt t gaagaa gt ct gct t gt cggaggt t ct caacgaagcg gt t t ct aaga at t aaaaat c ct at t aat at at t acat acg cagct cct at t t t t ct t t t g t t act caaag t gct at ct t c ct t t t t gttt aaaggaaaaa gacgaagaag cgt aaagaaa t ctt cccaga aacgccgat g at at t t aat g t at t aat at t t t ccagat t t t acat agt aa at aaaat act at t t agt cgt t gat t aaggg cacct t ct t c ttttttttaa agaaaggt ct at acct t cct tggcgccgag t cagt gat at at at t t t gac 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 212 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 212 ttttcaacaa aaaggtcaga ggtga< aatggcgctg tatccagctt ctttt tgggt t ccct gcaacagcgt gtgt c acccatttca gccaatctca tcat c acaagcagct agcagagcac gccaa! taactcatac gct t cctcta actgc atgttcaact ttgggcgtga agct g ttccattacc aaccctccat ggct a gggtctcact ttgcagttct cttccl t agt ccat ga gct ccat aac ccgag cattctattg aaaatctcaa cagct1 gagtgcagtt ccaagtattg catcc< agtacgtcct acgaatgctg cttca( agatgaattc ggtttcattt tttcal atgaa cgccagagat tgtctactct at t ctatttc ct ccc ccagc acact gccgt cccgc taagc caagc tccat at cat :tttc aaagc ct gt a tact t t t t att caac t agaagaat t ttcccccaaa t gaat cact t t cgacct aaa ct caaccat c at t cagcaca ct t at t gaat cgcggtccag aagcaat cct t att ctt t cc cgcacaggaa cat ct gcct c t cat t at ct a gcat ct gccg t ct gcat t cc gt gat cggt a agat caacca ct ct t aaagc accaagaagg agt gt t act g gact t t acat act t t t gcat t ct t ccaat a gacaacagcc agt agccat a at gat t t t t c gat gagt ct c cgt at act ct agt t cct t at cacaaccat a at ct gat t cc t aat ct cat t ct t ct ct t gc cct t at cct t acat at caac gat cagcaac caacaaat gt cacact ct t c 120 180 240 300 360 420 480 540 600 660 720 780 840 Page 231 12689250 Sequence Listing.txt aaggagaccc gttttggcat attggtctat catacagttc catgtaacaa aat ggcgcaa t at caaagct agcagat t ct cgt act gaca cgct t t ct t a act ct gcggc aaaat gt at a cccat gcaat gat aaacgag gct cgct ccg t t agt gt t gg aaagcaagaa ccat gaat t c t gacat t gt g agccgt gacg at t t gt t t t t ggt ccaaat c agt t acgcct tcgaaacaac t caaagat cc gt gat t aaat gct ccagaca ttaacgacca gaaacct gaa at t t cgt cga agagcat t cc ccct caccaa aat cggt cca gct cat cgct agacat gct c gct t gct cac gagaaact t c gagacaacaa t aat cgccaa t aagct t aaa cagt t acgt g t t ct t ct t ct aat caacgcg t gcgt gct ga gcaagt ct aa aat ct cct ag cct cact t t t gat acccat t acact t t ccg t caaat cagt ttgaaacaca ggt caaacct aat cgagt aa gaagat cgac t gcgaaat cg gacggt gt ct cgat t caaga t gt cat caga at at caaat t gcaaagagaa t ct t ct t ct t at gg act gat accc at caaggcca at cact aat a t ct cat aat c cat caaagt a agcat cact a aaaaaccat a caat t cacga t t agct ct ca cct ct gat ca gaagcgt at c t cct t gt cga ct gcaagacc t t cct acgac aagaaaaaaa gggct t t agt t t agggct ct ct t ct t ct t c ccagt t t t gc at ct t gat gc gccgaaagaa ct aaacaaat ct aaacgt aa at ct t t ccac aacccagat c gaacat gat t at t ggt t gaa t agt gt t gaa gaat gt caag gt cccgt t t t t t agat cgt t ggaacacgaa agt cgccgga t cct t aat gg t t gt t ct t ct t t ct t ct t ca cat cct t t cg cgt acat ccc at aacacat g agct aagcaa ccaaagccaa cagcat caac aaacacagt a t caaagcaat t aagagt cgt aacagaaaat cat aaagaga tacagaggaa caccat gt aa gat aagt t t c at t gcgcct g at aaacacgg gcct t at t t t t at t t aaaaa t t t ct aggat 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 213 <211> 2004 <212> DNA <213> Arabi dopsi s thai i ana <400> 213 gattcgacca agct t t at ag caat acaagt gagttcaatc cgggttgaac cgggtgagtt t gaat aaaat at ct cgagt t gt gaat gt at ctctctctat tataactatc ccaatctctt agacaaggaa aagctcataa attcgataca t gaggcgt aa gct gt cacaa gt t aacat ca actgattatt tatacaatac ttttatttca gttcaaggct catgatcgac ttgctacctc aatctgtaag ttgaaacaga caggatcgct tcgatagacc aaagt act aa agcat caat g caaccggaag t at at gacca ggt t gt t ct t cggt gt t gt a t ccaacat t c aacct agaca t t ct at aagt t aat ccgt t t gaaagat aat cat at aaat a t t agaaat ga aaacaact ct agt t gggt ct at agagacat aaagcacat g t at agat t t t agt gat acca cct gcaaaat cct agaacac t at aat acac t acgt t t gat t cacaat aaa ct at cgt cct gggaact act acacgaactt at t t caacaa cat at t t cag ttgacgacag at gt t ccgga aagaact gt a 120 180 240 300 360 420 480 540 600 660 atttgcaaat gcttgtgctg aataagaaaa gtagacatat at ct gaccaa gt t at aat ag Page 232 12689250 Sequence Listing.txt ttattaagcc ggcttaagag caaaacagag attaattacc atcaggatca aagaagaaaa ct t gct t gac t cat aacaag aact gt acca ggaaagagt c gat cct t aac t ct ggat aat caaaat cagg cat t ggt t t c aggcat ccaa gagacaaagc at t ct aat cc taacaacaca aat t gaaaac t ct gat gat t ct aat caaga acgt ggcat c t t gact at ca t t act ccaag ggacccc gac gct ccgt aat at cagacgt c at cgat caat t t t t ccat ca gacgagaaag t caaaagct c gaaat t t ggg cgct gaggt a at gcat t gca act t t cgat c agaacaaacc aagt ggacac cacaat agca agat gggt aa aacccagaac gt cgat t agg ct ct ggct at agaacaact c ggct cct t at gt gaaact aa aaaaat t aat caaacaat gt ct ct cacct c aat t t caact t gaagat t ga ggaagagact gat cgat t aa at gt aaacaa acagagaaac gcact gt aag aaagcacct g t cct cgaacc acgaagat ga t ct caat aga acacaaat ct t cgact cagt agagaat acg t gt cgt t acc at ggccaaga agaaacaaaa aacacat ct t aaccaacttt ct gt t gct t a at ccggt t t a caccaat t cc ct t cagt gaa aat g t ct gaaaagt at aact ct ga gccaaagaaa agat at gat g gacct t ct gg gt aagt t t ag caaacacct g agat aacact aact agaaac ct aaat aaaa gcgt t t gt t t aagagagat a t ct t t gt aaa ct cgccat gt at ggt cgaca t aat caaacc aaaaaaaaga at ct gaat at aaacat ccgg t cat t ct t cc gagat t t ct t t t ct at ccct aagat accaa caat acct t g acccat t ggg aagat t t gt t ccacaccacc acaat t at ac aact t aat ca gagat aagt g agccaaat t a t t t t gggt cg agaaaaagaa act gggcgag t t ggt t ccaa aaaaagagat at aaggt t ga at t at t at ac ccgaccaat c t ccggt at t a aat cgcact t t caccgaat t t t ct cct aca t cat at ccat agagaat gaa agat ggct ag gaagggt t t c t gt aggt ct c at at at aat c aaaggat t aa acaagagaaa caaaccat ag aaat t ggt ag aagt gt aaga gcgt gt gat g gcggacaaag t agt aaaagt accgat cggt gt aat t gcag agaaat gt t c aaccct t at t cct t t accag t t ct act ccg 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> 214 2004 DNA <213> Ar a <400> 214 aaat t cat gt t gt at gt t t t gaaaat at ag at t ggt gacg t at acaaaaa t ct t t t gt t a t aat at t t t a )i dopsi s t hal i ana gaat t t ct t t agt gccgacg ggact aaat t at t t acgaaa t t gt ggt t ct cgat acacct at t ct aat t t cgaaact t t a t gt gt gat t g gt aat t at gt gcaaagaaga t gt gt t t t gt t agt t gaacc cgt at gt aaa aaat aat at a agacagcttt gt at gacat c ggt gt t ct t a t t t t t at gt t at t caaat t a cgact ct cca Page 23C gggact aaat aaagat at cg gct t t t gt cg gct gcacat t ggaaaact t a t t aggat at t at aaaacat c t gt aat t aga aaact gagaa acaaat cat c t gt caccat t t ggt at agt t t t caaat at a t ccact agct 120 180 240 300 360 420 12689250 Sequence Listing.txt tgttggtctc taaacttgaa atcagttttt atttttcttg taatattttc caat at cgt a ccaaaatttt t aat at t t at aaact gt gt c t t at aaaat a ttaaaacaaa aaact t t t t g gt t t gccgcc at t gt t t t gg at cgat cgaa aaagggcaat aagt t cat aa gat gt gt t gg tt ct t t t gt t at ct t t cat t ggat gaaact t t t gt t t aat ct t caaaat a gt aaaaaacc tgt t cct t t a gact aaacct cggt caagat ggcct t t t t c at t t t t cct g t cgt ct ct gt t t ct t ct gat gacaaatttt gact aat cac t gt agt cgca act acat t ac gt at t agt t t t t t gt t agga t t t t ct aat t caaaaaaat a ct gt t caact aact t gt gca ct ccaat t at aaaaggt t aa caaact at ct t ggt agt aaa t t ggt ggt t c gt at cat t t a at t aat at ag at at at at cc aaacat at cc t at ct gcgag gat aat cact cact t aacga t ct t t ccttt agaaagaaag gagaactttt t agagaat aa aaat aaaat g at t aaat t cc at t caat at t t at aacgt t a accgacaaaa ttgt t t t gt c aaaaaat t at t t acat aacg ct agt cacat at gt ggaat t t t t gt t aaag aggagagagt t cct t aaaat at cct t cact at cgcct aat tttttctcaa gt aaat t t aa act t aacat a at t t aaat gt aagt cat gaa aat ccaccgt aaaagaaaac aaactttttt cgaaact t cg t ct t t t t ct t tat g t acat cat at t t at gat at t t at t ct gt aa gt at gt act a at at at agat t at t ct acat tctgt t t t t g ct t gt t t t ac t t gt t acgt t gagggggt t a aggt ggcaat t gt ggt ggag ccaaat ccat agt t gat t gc ttgt t gt t t a aat t gt t t gg aat t cgt t ac gt t aaaact c at t caaagt c cct t at ccat t agat ct t aa agt t t cct gt t t t t cgatta ct ggcgact a ct ggt at at t ct ct at aaac cgaat aaaaa t gct t gaaaa aat t t t gt aa t acgt aacgc t cgt cacat t t aat t aaat a at t aaaat aa t gt t ct act t gat gt aaacc t gcaaat ct c agagtgagac t t at ccat ca agat gact aa ct t gat t gat t t cat gacct t t aaat t ggc t gt at t caga at gagaaaat t t at t agt t t aat t ccaaat ccagaaat gt t aaat t t t ct aagat t gct g ctggagacag aaaat t gt t t t t aat aat ac aat aat t t aa atgt t t t t aa cgt t t cat gg ttgttttaca t gt t t t gtt t t agt aaat t t agt t gct t cg at t gt cgt ga aaaaaact aa caat aat at a ccacgtgggt ctactttttt t aaagt aat a aaat gt t t ga at at aat cca t aaacat act t aaaagcat g t t cct at act t ct t ct gacc ccaaaat gga ct aacct t ga cgt gaagct a agaagacacg at at t gt t t t 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 215 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 215 gataaacctg tttgtgaaat tttcaaaaat tggattccat ttattttggt attattgcca tcttaatacg tttcaagcga actaaaaaaa tgttaactac tagattttaa tccgcggtgt accgcggaga caattcattt tttaaaatta atatatataa aaatttataa attattttat agtttatcat tgttactaag taacgtcctg ccaaacccgt ccacaaaaaa aaactattta Page 234 120 180 240 12689250 Sequence Listing.txt t t t t at t aat cgt t gcggga aaat t t cgaa t aaat aat ga at gt t t t at g t t agat aat g at at t t t cat at gat t t gt t t t t t gt t t t a agt t t gt t t g gt aaaat t aa ccaact ct ag at aaat t t t a caacactt at t aaact t gt a t at t gt aat t gt t t ggt cac t t t t at caac t t cagat t t g aat gccaat a gt at ggaat t ct t t gaat t t aaggt aacaa t agaat t at g cagt at t act agacgat aaa cagat t aggt at cacgct t t acat t t t t gt t ct t t ggttt <210> 216 gt t gat ct t a cgt t gaaccc t cacaaaat a t at t aat t at aagt t t aaat at at at t aac t cct aaat aa t at t taaaca aaat at t gt t aat t aaat t t ttattttgga gat t t at t t g t gat t aaaag agt t t at aag ttcaccccaa t gt aaaaaaa aat at acat t cacgt gt t t c ggt t t gccca t t caat gt t t gaaat gact a aat t t gt t t t acaaaatttt agcaaaaaag aat t gggccg gt t aact gca t caccgt acc t cct t cgat t t t at at gcat t ct cggaaaa t t t at aat at acacattttt t gacgaat t t t t t t gat aga at acgaat aa t ct aaaagca t t aat gt cag aaaaacgttt aat aaaat ac gat t gat aat acaaagattt t t aaaat gac t t acat t caa cagct ct t at aaacggat gt aat gt aaagt t gat aaaat a act t gat gag cat at gacat ggt t t t caaa at gagcacat tcaccaaagg gt at t ggaaa aaaaagaaaa agcagt t t t g at t t agt t gg ggaacaaacc t ct ccgct ct acat aat aat at g gt t t at gaat cgccaat t gg gt t t t t gaga t t aaact t t t t at t at t t aa aaaat t t aga t agcat gaca agt aagaaca gt t t gt gct a gt gat t gaca aat gct aat g t gt aaaaat g at t t caaaac gat aaaaat c ttcat t t ct t gggggatat c at ggt cgt cg t agt t t at at accgacat ag agagaat cat gcaat t ggt g t acct aat ga aaacattttt gaaagaat aa ggct ct t gat t t caggt gag ggat t t at ca t ct ccact ct aat acact ct cat t gt t aaa at at t t t agt t t aat at at g tttttttaaa t at aaat t ca aaaat aat at agat t t aaat t t gt aaat aa at at t t at t t at t t taaggg at t t aaat t a acat t gaat t t at acat aga cat act t aat t t t ct gagt t at t t t t t act at gaaaaaca cgt gat t tag agt t aacat g aaggt t aaat t t ct t t at at ct at ct t aaa aaccct t t ca t ggaat at at t gagcat aat cat gt ct agt ct accaaat c aaat cct t aa t ct t ct ct gt t gt caggat t t t cat t ct t a ct ct at at at aat at t aat a t t t t t acct t t t t t aaat aa gt t gaaat ac aaacaaaaaa at t gat gt ga aaaat t t aaa gt aaat aagt t gggt agagt t ct aaaaaaa at agct ct gt t ggagt at t t acgt cact t t ttgat t t t t g att cggccac ccacgtggga gat ct caaaa aaccgaacgt t t aaaaaat a aat t t ggt aa aaagcct t t a aat ct t aaac caaaaat acg gt t at acgaa t ct at cgcag t t t gat t ct c 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <211> <212> <213> <400> 2004 DNA Arabidopsis thal i ana 216 Page 235 12689250 Sequence Listing.txt tcaacgcctc agccttttgc ttagctcgac ttgttcctgc ctgcgacaga gaggt at cgc gt agcaat at ccagcaccgg acagact cat caacaagcgg acaacgagt a ggt t caccaa gcgatagacg aagccct gac agagagatag agat cagaga gt t t gt t ct t aggaggaat c aagagaggag t gt t acaat c ccat gt aggt at caat acaa t ggt ct ggt c gagagagagt t aaat aaat a gcgt gt ct ca aat gt ct at t at t agat aga acaaaggt aa aaact gt t t t at ggt gt t aa t act t ggt ga t cat cccaat at ct gct t t g agt t agt cag at gt gt aggt t gtt gtt tag agact t t t gt t cct t ct ct a cgacacagcc aact cct cct cacaaacgcc ct t cat aat t cagagccgt c cagaggaat c gagcagagca aagcggct t a at t caagat t aat caacggc ggataagagc aagat gcgac gagacgacgg gct gaaagca gaaact gccg aaacaaagaa t gt caggaat ggaat aat aa t at at t t t at t ggt t act t t aaaagtgggc cgt gt t gat a t gt gt ct t ct gact t ct t t c gagt t t t gat t ct t agat t c ttgat t t t cc aat t at gggt t ct cgt aat t t t at gagcaa gt t t t cct ga gt aat aaagc gccaccatt g at ct ct t t ct t cct ccacaa gat t t at cca cccgat t gca gacgcgt cct gct ccggat c caagcagcgt at cgcaccgg gcagt gacac t t aat cgcac ct gat ct cca acgagat gat ctctcggagg gaggagtcat gggcggtgat cgt ggat gct at t t gt t t gg t acct t at t a gt t t t gt aat gggct t t t t g ccact gct ga gat cct cgat t ct ct t caat ggaaat t ccg t t t t gt t t ga t aat t aat cc act t gt t t t g aaat t at gt t cat at gat cg aat t t t aggt t gcat t at t g at gg t t ct at acac gt ct ct gt gt t cgccggt t t ccat gt t t ga cggct ct gat t ct t cgct ct t cccgat ggc t ct ct t t agc aagaagcaat cat act cct g cggcct t ggc t agcagct t g t gat gagat c aagcagtggg cgt t gagact t ct ccat ct c t t t agt t t ag aggcaagaat t aaaat t aat ggcccggtcc t agat aagac t gt t aat cat t t t ctctct t gcgat cggt t gt gat t caac at at caact a ttttttcggg ct t ccgact t t ct ct aat cc gccgaatt ag at gt t t t gcc gacct t at at aacact ct cc t cccacct ct cgat t ccggc t ccgaaat cc t t t gt t ctct gaat ccgcct gact t t gt t c agt cggt gt t cgact ct t t g aagct gaaga gat t t t gat c ct t ct gct ca at cggaat t c gaat t cgccg t aaat cgct g cat ggat at g ggt t t ggt ct gagct aat t c t aat ggt t t t cat t at t at t ccat t at aag acaagt accg ct ct ct t gt c t cagt t t t ct gt ggt aggat at t t at acat at ct t agat c at ggat t gct t gt t at gt cc t gat cagt ga t t t atggaag at t aacagat gccact agcg t cacaaagct act at ct cca accgacat t a gccat caat t t t agct gagc gt t t ct agaa t cct cgat ct cccat t t t ca t t ct cgt cgc tccgaagaag cgat t ct cag t cgat cgaat tcagaggcgc gat ct gt cgc aat t t gcggc agat cagacc gt ct t acgag agagt ct aga cat t cat gt a acgaggct cc aat gggct t c agagcagcag ct cggaat ca at cggaagt c cgt t gt t agg ggat t t cact t at ct gat t c agagt aacag t t aagt t agt cagt gagat t t t caaat t t t tt gt t t t t gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 Page 236 12689250 Sequence Listing.txt <210> 217 <211> 2001 <212> DNA <213> Aral <400> 217 ggt gatt gat taacacagag ct gt t caaaa at t gat ggcg t t gact t acg t aaat gt t t a aagat t gct a ct aagaacac cgcccaaggt agt aact t gt t agt at t aat aat t t ggat g gact gcat t g t ct t t t t t t c gaaggtt t t g at gagt t t t a acat t ct at c t gaggat ggt t t gat at t gt gt caagaaat gt t gt at gt a ttggaccaaa acaaaaaat c ct cct t ggca cgccggtgac acct acct aa act agt t t t t gt gaggaaat t aacagaaat t at t at gt aa dopsis thalana bidopsis thaliana gct act caca at gcagaaat caat t gagca agacaagaat at t accagt a gt t gt gt gat ct gt ggt gca aacct t agaa aaat t t gagc cct t t caaac t at t t ggct c cgt caggaca gt gcggt gat tcaaaaacca act ct agt ag aat aagcact t cagt t t acg gt t accgaga gact gt t cca t t t gt aacca t ccaact t ga taaagaaaag t cat acccgg at gct gct at cgacagtgga ct aaat t t t c t gt aaaggaa at gt at t tag gct gt gct aa ttagaaaaag agggt aat aa ggt t cgt t t g aagt t at aga cggcattttt agaacat t t c act aat aaat gt gt gt t gca gaagct gt ga t t t gcct t ac at at ggt tag gat t at cgac agct t ggaac ct ggt t agca at t t ct t cca t t gacat t t t t ggt at ct at aaat ct aaag t cat cgacat agt ct gat t t accaaaccgt at cagt agt t tgt t t t t gga t t t gctct gt at cact agcc t t t gccgt t a t t t t t ct t at aaggat aaaa cacaat aaaa at t t ggat t t gaaaat t caa at caagat ac t aact t at t a t cat gaacct gct acccgt t ct t gcat cac at agcaggt t gaaaaaagct agccagtggc ttttttttag ggaaaaact c aagct t ct ca aat cat agcc cgt cccat ga caccaaagaa t caggt gt t t t cat t t gact aat gaacat t gtgcagagaa cat t ccct t a gt cggt t ggc cagt agt aca ttttaaaccg ct at aact cg t act gt ccgt ccggt ggt ag gt t t agt t gg t t act at t at agt gat agca ct t t gt t gaa aat agat gt g at caat caca t ccat ct t ct ttttgttccc t cat aaacaa aacaat aaag t gt t cagt t t aggt gcaaaa at gt aaagt g t cat cct gt g gagggagacg at t cagaggt agaggaaaat accaaaagt a cagt at t aca t ggaat cat t t at t gccaac gact t t gt cc gt t t ggaagg cat aggat t a aaat aggt t t t t acact gt a gggt t t t ggt agt t t gt t aa t gt agcggat gt aat gt ggg aaaat aagaa ggaaaagat a at gt gagaaa aat at at t t a tagaaaaaca gct gcagt cc cact t t t ct t at t t aggat a aggcgagcag t at t t gaat g ggt gcagat c cct t gcaaaa acct ggaaaa at t gct aaac cat t gct ggt aagagaaacc at gt t gccgg t aact at ct t cggct cagat acacaat at g act gt t gt t c aggt cat gt a t t gt aact gc cact t gaat t agact t t gt t t gt ggaat ct t at t t aagag gcacgtgat t gat acccacg gt cgt agct c aaat at aaaa aat t ccaaaa aaaaacgt ga cat t gat aaa acaat gt t ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 taatataatt ttttatctaa agtttttacc ggttaccagt t accaccaaa gtt gtt ggat Page 237 12689250 Sequence Listing.txt ctctatcttc atttttagag attagtataa atgagtgagt aaacaaacca gagagaccaa acttgcctct ccaccaccac ttcctctacc tatctaat t c cgaatctgga tttggatttg gatctcattt cgattcaacg atga 1920 1980 2004 <210> <211> <212> <213> 218 2004 DNA Arabidopsis thal i ana <400> 218 caacct t gt c t gact t ct ct ct t cact t aa cgagacggtt caggt gat gg t ct t cgat t t aat cccagag agat cgaaag agggt aagt a act acct t t a agt at t cat a aaaaaat t gt aaaaaaaagg caat acaat c t ggaat cagc at t gat t t ga t gagagt at t t gt gaagacc agagt t aat g acagacaaca t ccaaact t g ggct t t t at t caat t gaaaa at aaaaaat a gagagggaaa aaaaat t aaa gaaat t gaaa t ct cat t t ca t ggt t t gcat agagaaagaa aat cact cct ct gt gaaat c at t cgaat t c agagact t ct accgt t ggga at t t caaaag t at t gct aga t ccaat aat c aat t gt aaat at at at at gt act at t ggag caaaat t aga t aat t cagt c aat at at at t acat t ct t gc acat cct t t a ggct act cca aat aggt t t a aact t t t ggc aagcaaccat ttaaaaaaaa cgcaaaat ag agacggt ct a ttacaagaga cgt agaat ca t caggt cgt c act ccact t t aaat t caaat t aaat aagat t t t caagt t t gtggaagcag tttttttaaa t t at t t at t t caacaacttt ct act act t t gagat ggat t at ggat aaat gagaaagaaa t ccaact gt t aacaat ct gt at t at aaagg atct t t t ct t aat t gggct t at t act t t ct tcct t t t t t a att cgaaaag aaaaaaaaaa t t t ggt cgct ct t gct t at t t gact ct agg ct t gaacat t tt ct ct cgag t cct t gat ac t ccct aat ca t cact t acag gagagagaat at ccaaat t c agt cgcaaat acaggt at t a at gat t t gt a ct gat t t t at t act act t ct t aaaaaaat g agaaaacaca ct at gt gt ga at acat t at a agct t t t aag agt ct at ggg ct ct act t t a ct ct caggt c aat t t t act g aaaagaaaaa cgcaacagt g gt aact t t t t agcaat agt a aacact agat gact t ct t t a acgaggagtt cacct ct ggc att gagaaag gaaacagagc ttgaagaaca gaat t act ag at ct t ct t ct t gat at t aaa aacat ggaaa t ct t aaat aa ttttttttgg at cat t agct aaagaat t t g t t aat ccaaa cact aacgt a cgt gagt t ca ct act ccaaa aat ct t ct ac t t t ct t t t aa aacgaaat t a t gagcgat gc t cggaaaaat t gt t t at t t t t t ct ct cgct t gaagaacat t t ct gctct t t t at ct ct at agaaacggt g gt aaacgaca gt t ggat cag cgaat cct gg aaaaaagat t agagct caac t t t t t gtct c t at t t cagt c aaagggaaaa acact t t t gc ggaaagaat t cat t at t at c t aat t gaat a t gt t gt aact ccgt t gt aag agacgggtaa ct t gggct ac t aact at t t g ct agt t t ggg t cct aaaaca cggcaaaaaa aaagct gaca t t t ct agcca caccgt cgca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 accctaataa accctgacgc cgtttcttct tttaccttcc Page 238 cct cct t cat cct cgtt agt caat t ct ct t gat cgt aaag gt at t t aat t t aaat t t ct t att gcct gag t t t t t ctctc ct caggt at g agat t cat t t t acggat ct t ggaatt agt c act at t t gt t gcat ct aat a 12689250 Sequence tctctcaaac t cgt ct ct aa cttcttcgat tctcatctat ct at acacaa agt t t ccacc tgataaaccc taatcttctt agagatctat ttgttgattt aat gaat caa t at t cat t t g at gg Li st i ng. t xt aact cgt ccg cat caacaac t t t t t ct at g cgt t cgat t g cgcagt gact at t acaggt t t cgat t aat t agt gct t t ct gt t t gct t t c t at at gt t ca tacaaaaccc t gat agaat c 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 219 2004 DNA Arabidopsis thal i ana <400> 219 t at t at aat t t gagat agac t t cgt cagca agcaaccaat t gtt acgagt aaagagct aa acagccacca cgaaat cgca at at t cat at acct ct at ac cgcat gat t t act act cgat cacaagt acc agccaaaaat t ct act aat t act aact t t a caact aact c at acat at ac t t t t cgtct t ccacgt cat c aaagacccaa ccgt aact t a ct t t gagt t a aat gt t t at a t t acagt t at t gt aat t act t t t gt t at at at at gaat t t ccat caaat a cat t cct aaa ggt cat at t t ttgccccacc t at aat cat t t gcct t t cgt cagt cgcct t aat t t t aaac t at gat ct aa t t gaacact t caagat t aat t t gaagat t a t gt t agt aaa gt t gt t t t cg agcgcat t ac ccct gact cg ct ct aaccac t ct cgt t t gt at gaaaagaa cacaaaaaga cgt gacat at at gt at aaat t t t t gaggt a aagt aaat t g gt t agaaaat ct at gaaaac acccat gacc cact t t at ct t gcat gaaaa gact cacgac t gt t gt act t t cagt t act a gt gcat gcat cgaggt at at at cgaggt aa at aaat t aaa agaagctt ct aaacaat gaa ccacgt cgac agaaccct t g t cggagt tag t cat t at t t c aat ct aaacg t ct ct agct c cat ct t t gac cagaagcaaa aat t aaacaa t ggcaaacct gaaagagagt cat ct cagt t t at at at agt at cct cct t g at t t t ctct a aact gt at aa cagat t aat a t at ct ct cgt t at agt ct aa cgcat cacac aaagat t t gg t t t t ccggat agact acaca at at gccagc aat t t aaagt agagagagag ccaaagcaaa ttttccccaa at at t t ct t c aat t at at at t caact agt g acaaat cat a gt aaaaaagc t at ct at at a aat aaat agt ct at aaaact acaat gt aca gccact gt t t cat cct t t at at aagat t aa t gaaat agt a aacaaat at t t aat cat at t t agat aaacg taaaaaaaca cgacct t ggg t at t aat gaa gt agcagt aa aaagat at ag cgaat cat t t gat t t agcct ttcacgcacg aat acagat t aaact aaat t ccacgt gat g at cact t ccc gt aaat ggct t ctt ct att c aaacgt aat t at t cgt t agt t at ct t caca acgt aacgt a taacggccgg aat t aat gca t aaat t aat g t aat cggat a acaaaaat gt t t at gcaat t t cagcat t cc gagggt aat t aat ct cat t t agagaacaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 gagaggcgag agcgacgtag ggttggtgtt tcgtacggat tttctcggtc aatcttagtt Page 239 t ct ccggcga ct t aat ct gt ttcgt t t t gg ggat ct at t a tttttttttt t gt t t aggt c tggt t t t ct a agct agt t ag tgt t t gcttc ttattttcac gagat t gct t gt t cgat ct g t gt t at ggt t gct t cgt agc gt t gct t caa t t t gat t gt t t ct ct ct agc ggt t t t cttc ttat t t t caa aggaat cat c 12689250 Sequence ttcaggtaaa aat ggcggcg cgtatgaact cgatttttcc cactgatagt gttgtatgtt tgttttggtg tgtgtgttcg atttcttact gtttcgtgct tcttactagg ttttctatgt taatcactgg ttttcgtgct tgttgcttct atatttcgat ggataatttt aaaatgaccc at gg Li st i ng. txt gt t t t ggt gt act cgat t gg t agt t gat ct t ct t cgaaga tgt t t t ctta t t gct t t ggg t ct gagt cgt agt t t t gt gc actgt t t t t g ttgt t ct t ca gt ggat ccgc t t gcgat t t c t at t t aggt t ct t cgt t t t c ct gt t gaat t ttgt t t t t ga ttgt t t t caa ggt ggt t gt g 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 220 2004 DNA Arabidopsis thal i ana <400> 220 gcat cat cga aaaat ccacc cact at ct ct at cat cat ca cacccgat ca gt t at acaaa aaacct ggac aaccct gt gc acaaaaat at t at agat t ct aggggaact c t t t t aggaat at t t t aaat a gcat cact aa tt ct t t gttt aat aagagga aagact ct t t acaacacat a aaacgcgttt at gat gt aat aat ct acaaa aacacagtt c t gaccat ggg tcagcaaacg gaaaaaaacg agaagcagca aagagt t cca at cgggt t ct t ct ct t gat t ctct t ct t t c aaact aat ga cat t caact t agt t gcgt ca cat t t t t t t c agt ggt t at t cgat aaggat at aagt cat t t act acaaat caat gact gg agt agat gct accaaaggcc at t ct aat ca cat aggaaat gaaat ct at c aat cgagcat cat gaaccat ttgagacaga gt t t t t t t at t at t aacat t t gt cgct gt t ct acaaat ca at aagt gt t c act at acaaa ct t t ctattt at t acat t aa cggt ct t cag ggat t t gt ag t cgagt t aaa t aaacat at g aacat aagct aaaact caaa gcgt aaat cc caacaacaaa aat t cgat t a cagaaaagag agaagagaaa gagagagaaa t t t t at t t ga t gt cgaat ct aat ct cccat tatggaggag t cat t t t t t t aacat t t ct t ct gt agt t t a gaaggaaaaa ct at aaacaa t aaat aacaa aaccccaat a t aact at ct c cacaat t at t acccggaact t cgat gcct t ccct aaaaat acagt gaaca cgaagaacaa aagagagaga gagaaagaga caaaagt act t t agt t gat t t gt aat t cag acaaaaagac gt t gt ccct t at t t t t at t c aaagact cgg gt aaaacaaa gt aaagaaag att aacaaca t aat at at gc t gt t acat at t gaat aat t a gat ccct at c ct ct t aaacg gaaacaat aa cgaaat cgga cgaaat t agc gaaaat ggt g ggctgagaaa gct t t t t aat aaat t ct aat aat ct at ggg t t ggaat t at t agaaat t ag t at ct at at t gtctttttac t cat cat at a t t gagat t cg caacaaat t a at cgact act t gaat gaat g aagt cat aaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 240 12689250 Sequence Listing.txt taaaaatcat ctataatgcg tgtaagct t g cataaaaata cagtatataa aaact at t aa t t t at t aaaa t t at t t at t g ct gaccaaga t at at cct ac gagt t t ct ga t t ccaat t t c caat aat t ga agt t t agat g t cgat at t gg at t cgagaaa t ct cccacga t t gt ct acg <210> 221 gt at caacat aaat t gt t at t t at t gt aat t gt gggaagt at t t ct t cat aaaat cat aa t t t t aaat at aat gt cgacc t at t t gt at t gt acat at gt caaggcat ct cgat ct ccca aaaat cagcc caat cggaaa cat cat ct cg at gcgaaaat t cgaaact gc ttttttttta t t gagt t t t t at cact t t t c caaaaat at a ataggggaaa atgt t ct t t t ct at t t t t t t aact cat t t c at gg at gat t t gct at t t t aat aa gact t gcaac aaat at gt at aaat act aat aaat t agt t g at at at t ct t cat t t aaagg acaat t at at acgat t at gc gct t ct t ct a t t t gaagt t a t gct at at at t gagt t gct t aat t ct t aat at t t gcat ac gt t t gt at gc gt agagct at cat t t cgct g t at t ggt t aa cat caaaaaa at agact t ct ct t t t at t t a t t acaact ag act t agt ct t acgggcaaac aaaaaaaaaa t t t gt t gat t at t t gacaac aat t t t acaa at aaaaat cc t at t t at t ag t t t att agcc t cgt cact ga 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 tctacgttca tcgatctctc tctttctcgt <211> <212> <213> 1999 DNA Arabidopsis thal i ana <400> 221 accgaacat g at t t gcct t a t acgcct t t t ct at aat ct t t gt at ct gat t at gaat t gt t ggacacat c t agagctct g t aaagggt t c t ggt t acacc cacagt ct t c t t accact cc t t t aagct gc gct agaaaca t ct caat at t caccact ccc t gt gct t ccg gt ct act gt a t act gt aat t gagaaaat t g t caagaaat c gt cct t t agc aacccagcgg t act acat ag cagcct aagc aagggaagt g t cagcaact g t t gt aacat a ct t t agt gt g at t gaaat ga ggat t t gt t g aat cgggt t t t cggaat cca ccgct ggt at aggt t t ccga t gt t t agat t t cagaaagct ct aat ct t gg ttact t t t gg t gat gcagt g aat accaggc gt ct t ggagc acgccaagag t ct gct aat a act ct cat gt gt gaact t ga ct t accggct gcggt t t t ct aggaat cgct ct ccggt aag gt gt t t gggt t at agct t ct acaat ct t gg aat gt at t t t t ct gt t ct t a ct t gggagct t ctaggagga t gagat cat t aaat gct cgt gct t ct t gaa ccccgt aagt agct gccat t t gagt t t t gt t ggt t cact t t gggct t t cg t cact ct act t gt gaaagaa t agcct gaac t ggaat t t gc t t t t at at gt gcccggaagc at ct gt ggt g ggagct aaca ggcacat t cg gact ct cat g acaat t gaaa aaccagt cca agt gt gcact aat aat t gt g ggcaaccat c gt ggt at gat t ct ccaagt a gacgat aaag t t gt t ccat t agt t t t ct aa aact ct cagg t gt cgct t ac ct ggagt ggt ct gt ggct ca t t ct t gt at a t ccct gt aag t gagt gaact t gt aacaat t aaagact gat cagat t ct t g ccaat cact g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 gcacaggcat caacccagct agaagccttg gagctgcaat catctacaac aaagaccat t 1020 Page 241 12689250 Sequence Listing.txt cctgggatga ccacgtgagt ttccacacac tatataccaa atcgcattcc aaactcggtt t cgagcat t t gt t ggcccct at cccct t ca t t gt cct t ag t t t at ct at g t t at t t gcaa at t ggt aaac t aggt at t gt gaat t gagac caacaaagca t aaaaat t t a cgaat aaaac t agt t ccaac gcaat gt at t aat cact cct t t t t agt cga t t ccct gaca t cat t ggagc agt ccagaag t t gct t ct at t gccct ct t a at t at gat gt gagaacct at aat ggt at t t acagccacca cgaagat t cc ct agaaacag t ct gacaaga aagcaactt g t gt ct cgt t t cct ct aaact caacaat gg at t t cgt t t t t gcact t gct ct aagt aaaa agt t ct at t g at gcaat at t t t aacat gt t at aaacccaa t aaacgt gt c aaaaccat ca t t gaccgat a gtgtaagaag acact t cact t at t at t at a t ct cat ccat t t t ct t cact tttttgtcgt gct ct t t acc acaagacat c t t gat ct act t gt aaacaag t at t acagat at t aagat gg agt gt cacca gaaaaat ggt gaaaaaagt a t t gact gt aa t accacaaat tt ctt aacac t t at at gat t ct ct t gt ct c cat t ccagt g at gt ggt t gt aagt cct ct g acat gt at gt gaagaagt t c caaagat gt t t ct at caccg acct cgcct a t ccat t t gt c aact gt at aa gt t t gat cat ccacaat t ac caat ggt aat gct t t ct t ct t t caaact t c ggt gt t t t gg cat cagagcc gt t t ct ggcc gt gcaat gt g caaaggagaa t aacat ggat ct t aaccggt aat cct aat g aaat act t t t gaaaagaagt t ccacgagt a t t t t ct caca ggat gat gat ccacct cacc ggt t act ccg 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1999 <210> <211> <212> <213> 222 2003 DNA Arabidopsis thal i ana <400> 222 t acat gat at at t t t ccttt t t ggt aat t a t t t at at aaa gaaaaaagac aat ct cgt t g t ggt gagt aa at gt t cgaga aat at ct t t c gat t at at at acact t at ga ttgaacggaa t aaaat gt gg aat aat t aca gt t t gt t aat at acgt at ag agt gt aagaa t t caaat gt t ct t t cgt acg t gggt cgt at cgaat cat gg t cgaaccat a gggaat t at t t t t gtctct t ggt at at caa aat aat at at at cat aat aa t agt gccat a t caaat act a ccaat act ag acct t t t aaa aaaaagacaa at gt gggagt tgtct t t t aa t gat cat gat agccggaact t at t cggt t c t gt cat acca act t t t aagg ttttcttaaa t ct t t t agat ggcat cct ac ctt aaaagga t t cagt t t at gaaaat agaa cgt t at cgt g t t t ggt gact t agccgt t t c t t gccgct gc ct aat t cagt ccaat aat aa acgat aat ga t gacgct t t c at aaaact at gt t acgt t t c aacgagt aga gt t t ccgcgt at aaat t aga cat gt gaaca acat t at t t t t at at t t t ct ggt t ggt aat at acat at t g t cagcat cat gat t at at aa t at at ggagt t aact at gac t at gcat t t t at t t cgt aat t acct t t t ga aagggtggt t aaat at ggcc aacact aat c ct t t t aaat a acct t t t aat gggct t gagt gcatggt t t t ttgaaaggaa 120 180 240 300 360 420 480 540 600 660 720 780 Page 242 12689250 Sequence Listing.txt ttagatccat tgagattcta aagctaccaa gacgaggaac atggaagacg gat t ct t cgt act aacaaaa at aaaaacaa at gt aagt gg gagaccgaaa cgcgt t t aca gcacggaaca t cacaacat a cat gagt ggg aat gagat ac t at t aaaat c ttat t t t t gc ttct t t ctct t t gt act agc at cacat ct t t t t ct ggggt gtt caagagc t aggcat ct g t gat t aggac t gt t gaat t g t cgagt t t at caaaat gt t a gaaaccaaat gccaaat aaa ggt t gaccca aacaccgaca ct cgat t t ct gacaat aaaa gaaat gt ggg acat aat t ca agt gt agaag t ggact aaat gt t t t gtat c at aaccat t a ct t ct t gttc t act t at t cc t agct t t agg t gt t t ggct t caaagt t t gt gat cgagt ga cat cat at ca at at cccacg ct aat at act aagagtgggc at t agaaaac ct t t ct t t gt tcccaagaag act ct t ggaa gt aagt t aac caaaat at aa t ggt ccccac aaaat t ggt t t ct ct ct ct c ccat acat ac at agcat ct c ttct t t gtat gct at at at t acat ggat cc gt t t t t at t t tgg t gt gact ggt t t aaat t aaa at t t t at aat caagaaagcc t t gt t at at c caact agcgg tcgcaaccaa tgt t ccgcga aagt acact g t t t t t at ct t cact at t t t g tttttttttc t ct cact ct t t ct t ct ct at cat cact at t t gat t agt gt aagaggt t t c tttttttaag gt t t ggat ct ct ct ct t t ct t gat t agagg aaaaaat t aa cagaccat aa t t t aggagt c t t accgcggt cact t act at aacaggt gat ct t cacat gg agacaatttt t t t t t cctta t ct ct ct gt c t gt ct ct ccc cacgccacaa gggctctct t t gaaaaggt g cagaggtttt gt t t t ct ggg acaaagat ca t ggcgcat t g t t gt act cac ccat caaagg aagcgggttt t gaccacat g aaagt caaag t gt t cact ag gt at ggt aca t at t gact ac gaaaaagaaa t t t aaaat aa aaat t aat t a ct aact ct t t ct aacct t t a acgcagat t c caaaggt gt a t ct ct gt t gc t t t t ggat t a caagact aat at t aggct gt 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 223 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 223 aaaagcccac atgtgaaaag aaaatataat caaagtgata aagtataggt acat t acat a gtagacagat taaattttta cagctaataa ctcttcgtgg ttgtggaagc tctaattcac aatgagattg ttttgtgcca tttgctttta tagtct t cat cgaggaagaa agtaaagagt cttgcagtat ggtttacaac gatgaggata ttaaccaacg tttagtcgat gttgttatta gagttggaga tgggttggat tctgaaccat gactgctttt ctttccgatg gcggcaacag tagcccaaaa agt aggcaca at at agaaat t t gt agat t t gt ct t caaag at at agt aag agaaaaat t a at gt cat gaa t ccaaaat gg caaaaacaaa Page 24 aaat t at act caaat cat t t gct aat t aat gt t t ct t t t a agt gaaat t c agccaccat a caat caaccg aaaaat t cgt gcat gt gagg gaaagt ct aa t caaaaat gc t t t t gt aaac t t t t at t aac gcaact ccga t gat t at t at aact ct at ac gt gt caagt g gaat agt aga cccaaat cat at gt t t t gaa 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt act gcat at t t gggt t ct ga t aagt t aaaa ccacgtgggg t at gat t agt at t ct gt at t t ccgt acaga aaaccat t t g cgat gt at ca t t t t agcat c t at aaggt ag ttagt t t t ag at t t t t caat at at acaaac aaat caagaa aat aaaagt t t t agat at at gagaaaccaa acaagt gt at aat acaat t t at t t t t t cac cat gccagt g agt cact gac t accat t aat t acgt gat at aat aat gt aa aggagaaaag t ggct cgacc ct cct aaaaa at t t t t gat a t caagat t t g ggct t t ct ga gt t t cact cc cat t at t gag gagt gt gaag at t t ct at t t t t agcaaat t ggaacccaaa ccgaat t aga at at at at at t t aggt aaga at ct aaat ga t t t cct t t gt gt gt t t gt at gcaccat gct t t gcagcgt a gt cact t t t c t t t t act at t cagaaat t ag gaaaat act a caaaaacaaa t ct gt t gt ac t ct t aagt t c t ct t gaat at agagggaaaa at t t ct t cgt t t t ggaact t t gaat at gt g aagaaaaaaa t t t aat ct at ct aaat acca t t act agt at t at ccgat cg t t t t t t gat a caat t at t aa aaccgat ccg caagt t t t at aaat aaat ga t t aaat t gga at cacat ct a at ggact agc at gt aat cgt t aag ct gt at agt a at gaaat agg t at cat gt gc t gaacgaat t aat at aggaa agat agaaaa t t ct gct t at gcat t t t cct t aaat cagt a at aat aat aa at at acaccc aacgt ccaaa at ct gaat ga t t t t gaaaat ct t t gagt at agt acat gaa aaagcaat cg at t t agt acg tt agt cacaa t t ctgggaca acagagaaac gt cat ct t t g taaatttaat ataatgtact cacat t t t ct aaggat t t t g t gaat ct aaa cact t t t t cc ttacaaaaaa cat t ct aat a ct ct agat t c t t t t t gt t t t aat gat acat at aaggt agg acacat t t at ccaaat caaa t t caat accg agaaattttt t t t t t at aat t gaagt ct aa acct agt at t gat cagt t t a acgt aaacgt at ggcacat t cagt gat gat accaaat ct a t gt aat gt aa t cat t at gt t gagcat t cat aaat t t t gaa aaaaaaggt g agagt gact g ct ct t t t gga caacat t t t c at ccacat ac agt gt ct gat ct ggt t t t ca accaacat aa aact cgaat c t at caagaaa t t gaat t at t acct at aacc t cacat at t t t t t at gggt t gt aaaacct a ggcacaat ga ggt t act ct g cct t t gct t t 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 224 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 224 atatcacaat tccattattc gaataaggac atatgtaaat tttgaagaat tattttatat taggactaat catgggccgt gaccaaaaca actttatctg aaactaaatt taaacccaaa gctcaataaa gaaaagattt ggaatcaaaa cctactatat tattttagga agtacatttt t at caaat t t t t at at at ag aacct t ct t a at aaaacgat gat cat aagc aaat gt aacc Page 24z t at ct at t t t agt aagccat t aggcct at a t t gat gt gaa acgcacat ac t t aat t t t t a t t at at t t t t agaagt t t t g aat at act gg t aat t t t aaa t aat t gct at t aat t aat t a 120 180 240 300 360 12689250 Sequence Listing.txt cataataata tcattagaaa aatttaatca aaaacaaaat cctttaatga gtt accaaaa agacgggtca t ct gt t acca t caat t aaat at t t at caat t at t t t caat cct at caaaa gat t t gggt a cact cct at g at t aat at t a gat caacat t t t aat t t at t aagat t at at aact at aaaa at t ct t aaat t agaat aggt t aaaat t ct a acaat t gt cg t t aaat at t t at cgt cccgc at t gt t gcca aagaagat aa aaacagt t ca ggat t t aagc t ct cacaaaa cgt ct ct t ca t cat ct gct c t t at t t aact t caaagat t t aat t at at ac ct t aat caca t ct ct gt gat ttgtat t ct t t caact at cg aaagt gcat t t t at t agct a at aaaat aaa aagaaaaagt at t t t t t aag aaaaccaaaa t at at at t ag t t t aaaaat t t t t at t t t t g aaat at t at t cat agat t cg t at aaaaat a ggt at accgc at t agat at t gaat cgt at c aat aaaacgg at aacgcct c t t ct ccaaaa acat t ct cag t gcgat t t ca acaat at t ca cct aaact ga aaat aagt t c t aaaaat t ag at at gagt t t t aact aat t t t t t ct t atct cgggt cat at aaat gt t ag tggct t t t t g aaaat at aat act act aat a at gt t at gt g agaat gat ac act act t t aa gaat t gagt t aat at gat gt t ggt at agcg aaat t t acaa ggat t aaaat accaat aaag t t at t acaac t gt ct cgaaa gct gccaat t aacct aaaag agaagccgt c at gg t t t ct t aat t agcaaaat at gt aaaat aat cacaaaaaat t t at act at a aagct aact a ct at agt at t ggt gcat t cc at t agaaat c gcaagacgga t aat ccaccg t t aaacat at gt at gt at aa aat t t acaaa aaaaaat t ca at at ggt gga t t t aat aagg t t act t aat a gt t t t aat at ct agt t t gct aaat t t aaaa t t gccaat t t cact aaat ag agaat t cagc t caaat aaat t t ct t cct cc at agggct t a accgaat at t t t aat at t t t aaagt at t ca t t cgaat aaa at at at t t gt tttgcaagac cat at t aaat aaagt cat at tttggaggaa t t t caat acg caaat cat cc t gt t act at a act t t t at at cggaacgggt t gt at t t gaa at t aaaact t acaat t at aa at at t aact t at aaaaaaag t at at gggt t gct at cgt t t acagat gt ca at t ccaat t t at aaagt gag t t caat ct ct t gt cat t aag t t ggat ct aa caaat at ct a at at acaaaa acaat t aaaa gt gt aat at a t aat ct t at a t t t t t t ggt g aaat at ggt g cat aaat aaa t gggt t t t t g ggt t aaat ct t aat t t agaa t at aaaat t a at aat aaat a aaagaaat t a t caat at t t a cagt t t t t t a act gaaaaat t aaaat at aa t aacgt aaaa gaaaaaagag cgt aacagct at acct cat t cat t t t at t t gt ct t t t ct a ct cgt t cgt a 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 225 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 225 ttcttaagtg ggttttaaaa ttttaatatt catttttgtt ggtgtcgctt tatctaaaac gtctcttaca atttcttgca aaggcacgtc aaaatgcaaa gaagccaaca atagctatat gcacgtgatg cgtcaattgt gatccataaa tttttggggt cacgaactta cgatacggcg Page 245 120 180 12689250 Sequence Listing.txt gccgacatat atatcacttt gtagtggttc tatttaaaat tcaactcatt accaaaaaaa aaact caact ct ccaat t gc cct t gt at at cagt at t agt ggcaacaat a aaagt t act a aact caact c t cact t t at t aat at t gt gt caccgat t ca aagat cgcat aagt aggat a gt at ggat aa cgaccggt cc ct t t t t gagc cagaat t aaa ct t agct gcc t gat gat t at ct cat t caac cct t gacgcg cgaat t acac cat t t gt aca aagacgt aca tccacaaacc t t t gt t gct t ggt ct t cct t cct ct cccgg t gt gggt t at gt t gggagat gataaagagg ct t at ct t aa aact ct t at t at t act t gt t ct at t agaaa at t at at gca t acggcccat t t t t ct t at a accct t t t t c t act t cat at cgt ggt cgt g t ct t t gt cac aat t gat aac t gccat aagg t ggcaat aaa t cggcct t t t at gact cgct acagt t act c cgaaat t aaa at t t t at gt t agcacgact c ccact aat at aaacat agat acgacaccat caaacaacgc t caagt aaaa ctt caccacc ct t aaaaacg at gaat cggc t t gt t ct gt g t gt aacaaaa aagt at at ct t t agt cagac t cact t t gt a t gt t t t caga t gt gat gagt accat at ct t tctcat t t t a t acat gat at at gacat aag agt at t gat c at gt caat gt aat at t aat t aacaacattt t t at agaat t ctat t t t t gt act t t cact a aaggcat agc at ggt at t ac t caagccat g ttttcttacc at at t ct gaa ggagact t ga agagcgggcc aaagaaaaat acaaaaat ga tt cct ccgca gt aaat at t c aact at aaat t t gat t gaat cat g cact t t at at aat t gaagaa t t t t acat ga aaat aaggca aaat gcgat c gct at at aga at t t cagt cg t gt gt t act t t cagaact gt gcat t at t t t t aaact aggt t t t aact t t t t t t t ggtat g t aagt t t gga ttgctat t t g aagagagact t t t t cct cac t ct cat t cat at ct act t ct cacaact t t g t agt acccat agt ggat gt t t aacat aaca at t at ggt cc aaact t t aat gtt gcacaca at ct t t t t t t gat aaaaat a ct gt t ct t at aaagt aaaaa aaaact cact ct cgaagt gt cat cgaaat t cat aaagt t c t t t gt aat t g act at aaat t at t act t at t t ggaggat ga tt ct t t t t ct aact acat ac t ccgt ccaag ct t t at acat tttggaagca t t gt t ggt at t aaat t t t at t caaacct ct ct t ct cct ac tat t ggaaga aaagt t t t cc tcct t t t t gt t t t gt t aact ggt t t t t gac aaaact aaaa t aaaaccct a aat t cacaac ct ct t t ggat cgat ct t t t g at at gt gact at t act at at tttttttttt t at t gaact t tgatggagaa t ggggt t act at gt at t cag aact aaaat a agcaaaaat a aggccact ac t t t gact aag gt cat gt agg aaaaaat t aa caagt aact t aat aat gaag gat aat t aga ggaagat ct c t cat ggat t t aaat gt gaac aaaaaaaact ct cct aaaaa at caagaaat at at aaaacc aggcct agat cgt caccgt t gagaat aaaa cgcaaaaaat cct t aat t t t t t gaagt t gt t t t gcagagt 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 226 2004 DNA Arabidopsis thal i ana Page 246 12689250 Sequence Listing.txt <400> 226 at t t ct t ct a aaaaaagt ac cat t t at gt a t act aaggcc gt ct t agaac gaggct aat g at ct t t t aag act t ccat gt aact cggaag t at gt t at ga aaaacaggt a gt gaat aat t aacgaacgct t t t ccagat a at at t aat t t aaaaagat ac gaaaat gact t t t ct cat aa t t at aaaaaa gat t t t ggt a gggatggagc ggcct ccgat t ggt t aaccg t aacagt t t g t t aat aat t t accaaaaaaa gt acat t t t c acagagcaac acat t accaa aggaaaaaag gat ct ct t t a act at t t t ca at cat t caat t gact cat t t aat ggt gat a aaccact caa agaat t aat t aaacagat ca t agt t t ggt g cacaagacac aacgccct cg tgt t t agtta ggt t t at t ca aacgct t gag t agaagct ag t gggacgt gt aaat acacat t acgat t t ga aaaacagagt aaacat t at c aagt caaaac at at ct at at t agaat caac t aaaagt gca t at t aact ga t ccggt t t at gt t t ggt t t a t acggat at t aact at t caa aggat at at t aagct cgat a t t t t t t aat a agt caaagca at aact gcgt t t cacat t t a acacat aat t gct gt t gat c t at at t t ggt caaagaacct act acct t gt gct acat gt t t ct ct t at ga at gcgat t aa at gcaaat t a t gacaat caa aagagaagga acaaat at t a at at gt t cgt t t t aggt aca accat t t t aa agat at at at aacaaact ct t ct gcct t t t t at caat t ct t aagt t ct cg t t at aat cat gt ct accaaa at cggt t t t t aagagt cagc tcgaaccaaa t t acgt aat g at at t t t t aa at aagaagag aaat at gaga t gt cagaaaa cgt ggct aaa ggt ct gaaaa ttat t t t cct tact t ct t t c aaacagt t aa t agt t t gacc ct agcaagcg t cacat gt t g aagct ct acg aaaagct act agt agcat t g aaaat gt t gt aaact t aaga gacct aagt a t cat gt t t gt caat aaacaa aaact t caaa caacgt gt gt tgtcagaagg at aaaat at g ggaact cat c t aaat t cat t at at t at t t a ggat caagt c cct caagt gt ct ggat caaa cagt cagt t a cggt t aacca at at caaaaa agt ct caaat gaat at t aaa at aaat agt a aat aaaaaaa t gaat cggcc aggaat ct t c t t at aaat ac t t t at aact a gaaaact ggt agt gagaat g tcggacaaga t at gat cat c accaaaat gt ccacagaggc aggact t cgt t t t cat caat gaat t t t ct g t t t t ctt tcc ccaaaaat ag t aaaat gcgc aaaat ct t t a gt t t at acaa ggaaaagaaa at aat at aat aat aat t at a ct aat t cact ct t ccat t aa gt ggaat aac t t gt aact t t ccaaact gag gat t t t gaaa t t t gt t t aaa t t ct t caaaa gct t gt at aa caaagaaaca ccaact t t t a gt accaact t ggtggccaga ct t ccacggc aaat t cat t t ccat at t ct c aagct t t acc cat t t t at ca agt t t aagat gt agcat aat t gaagggct c t t caaat at t gat t gt gat t agccact cct t gt at t ccaa aacaaggcaa t gagt at cat t aggat acaa tcgagacaga aaat aacaaa aaaacaaaac acat aat t gg t at t t t t t t a caactttttt t t t at caaat at at gct caa gt at aagct a at t t ggggt t t aaggt cgga at t t t ccaat cact acgt ga act at t gat a aaaat acaaa cacaacaaaa t at aaaat ga gt ct ccaat a cact aaat t c ct acacaat a aat cacaaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 247 12689250 Sequence Listing.txt ctcgccactg tttcgaatag atgg 2004 <210> <211> <212> <213> 227 2003 DNA Arabi dopsi s t hal i ana <400> 227 ct t caaggt t t gct agt ct t gt gcaagaat ggaagt ggac aaagcaggag t aacaaacat ggt t ct t aca ccat ct t agg aggt t t at aa t gat ccgt t t gaggaaggtt t ggt accagg t gt agaagt t taagcgcagc aaggcgt t gg agt at cacat aagagcct aa gt t ct t acgt aagt t caagc agaagat cgc t cacact t gg at at ct t gt t aat ct at t gc tttcaacaag aagt gt ggt c gt at t gt gt t gt at t t gat t aaacat aaat t acaaaat t t cgt t t t t t gt cacaagaat c ggacacgcgt gat ct acaag at ggaagcca agaat cat ct aat t caat at ggacagt gga at t gact t ct acaat t t ct C t t t t cgat ca t aaagaaggt taagagacaa t t ct aaggaa aacgat t cgc aagat at t gg ccct agat ct t gt gaagcct tggagaaaac cat cacagat agcagt t gag t act t gt aat t ct t ct t gct agcact gt t t acaagaagga gt gggaagaa t agt agacca t ct t caat t t caccggaaaa aaggaat gaa act t acct ac t gcacgct t g t act acaaca cgaagagaag t gt aat gct t ct at agat ga t ct t acat gg ggaacact t g cgt t at t gt a t t agat cggt cgaat ggaaa aaacaaaaat t gt gaaacat at gcat ggaa caat cgt t ct t t ct t aaagc gagct t at gg t acact ccaa aacgt gagt c t t cgct agct gct cct gt t t at at agt t t g ggggaaagct t t cat gt aag gaact gaaga at t t caaagt aact ct aat a aat aat ggt t cat gcacgct caagaaagga gct t aacgga at gagct at a agt t t t gt t t ggt gt t ct ca agat t t gt t t agcat agat a at ct cact ga t t t caacat t at ggaaggt g aaat t caccg caaact at ac t gcagacat a tgggaaaagg t gt ct ccat t ccaagaagaa act t t gccat gt gt t agaca t cacggct ac tcggaaaccc caaagcaagt t acct t at ga gaat gt gt ga aacgt t gt ga gt gt at at ca gt t gt gacat cacgagt aat t at gt ccat t gt t t aacaaa gt t aagact t gaat acct t g at aat gat at tccaaaaaca caggat t ccc cgt t aat gt t caccggt cct gagcagcaaa gagt at gcat aaaaact t gg gaaaagcacc agagt at gat ct t t gat gca gt t gat t t gg aggt caacct cct t ct t gt t cgt t aacaga ct ggact agg t ct t aaat gc t at t ggt gt t cat cgagaag t aggagt aac t t ccggt gaa agat gct t gc t cacat t aca cgat t t t t gt at at t cgaat t t t at t caat t aact agcaa t t gt gaggat gt aagaat t t ct t ct t aaca ggt t acact a agt aagt aat t gggat t t t t cgt ggt at t t tggggaaaca t t cct t ct t C aat ccacact aggact t aca t ggt at gat a ccagagagcg gt gaacggag acacaaat ag at ct t t gagg gacact gt t t aagaaagat c t caggcact a t gt ggaaact gt aaaat at t caat aaaaaa caagagcacg cgt gcaagt c t at ggt t cat cgt t ct gt t t at gat ccaga gt t t ccat t t act cgt gt gc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 gt aacaaaat gacat gcaag t cacgt gcca acaat cgt at ct aaaaagcc Page 248 aat gaggt cc aaat cccgt t aaaat ct t ca at cct aaacg accgaaagat t ct cat t ggg cacacat cct agt aggagct 12689250 Sequence Listing.txt gtggctgagg aaaacaggag ataactgaca ccgcccat t c tggttttatc tctttggccc aaatttcggt gtaaccaaaa tgaccatttt cttgttcatc gctctctttc ttcttcacaa at g 1860 1920 1980 2003 <210> <211> 0 <212> r1 <213> 228 2003 DNA Arabi dopsi s t hal i ana <400> 228 caaggat cca at cgt ccat c att gaaaagc aagt t at at a t gatt gcgt t ctt aaaat gt ggagt ggt ga cggagaagat aat gt ct agg att cagccat acgaggtt cc act ccgct gt act t t t ggaa agcaacggga at gt cgt ggc ctcacgggag cgcaagat ca at aagacagt at t at t gt t t acgcat attt at at acat aa cgaat gacac at gt cct gga t t t at ggt aa tttctttcag gaaact t t ga aaaaaact aa aact at agat cgat at at ag at at ccaaca at t t ct t acc t t t t cggct a t ggt t t cagg t gt t ggt t ac t ggcgat aaa cggagcaccg agggct t caa cat t gt cct c atgt t t t t gg agt acaagaa cgat t t t ggt acgagaat cc t aaaggt t ct ggcat t agcc t caat gt t t t ttcaat t t t c t t gt gaaat a gaaaat cgaa t cgat aaggt cgagaaagaa t cct at t caa aaat gcacca aggctt ggga at t ggcct aa t t t at agt t a aaat gt at gt at t aaaaaac t at agat t at gaggt gact a tt at caaaag gcacagt at g gcgcacacag gt gt t caagg att ggggat c ggt t t ggttt cgt gct acac agagcct aag t cccaagt t t t cgt gatt ga aat agct t aa gt t t t t at at t t t t ggt t t a aacct aat t g t t t gaagt ac t cccagt ct a aggaagt gt a ct gat t acac ct at agt gt t cgat cat ct g caat cct t ca t aaacct t aa aat t aaat ag att cat caag t at gt agct g aggagt acac ggt t agggt t t gat gaggat at t t t t gtgg at gaccat t g agat cat ggt t t aaaat gt g agat caat aa cgt ggt t t gg gagt ct t t aa t gct aaacct acaaaaaaaa ttttttcaga at t aat gat a ct at gt t ct t gagacat gat gct acgt aca t ccaat cacg at gt cacat g aacaat gagc at t cacaat a gatt acaggt cat t gt at aa t at acaagt t t t t t t acat t aaaat t aat a aaggaatgt t acggcgtgag caat t act at act cgcact t gtt cgacgt t ccgt t t gt gt t gaat t gcat tggatgcgaa tcgtgggacc ct t t t gaaga cagcaaaat c gt act agagt at aaaat at a aaat gat t gc aatt gt caag tt gat ct gt g ct agt t t ggt at gaat cgt t t gcgccaat t cggcgt gat a t ct ggccacc at t gaacat c gagct cat gt acat gaaact ct t cat gat a gt at t at t t g acaaat ct aa gcctt gcaat ccgccat cgg ctt gt ct cca gagt at at ca ttggt t t t t a gcagaggat g aaagacaaga tttgcctgag ct at gt at ac at gt gt t gt g acaacct ct t t aat ggtt cg aact ct at aa aat t t at ct t t at at t gt ac cagggatagc t t ct aat ct a ggat gct gt t ccat gt t t t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 249 t t t at t t t cg t gt ct ct ggg cccgact cga accacact ac ct gcgt caat ct cgcact ct tctgat t t t c aagcct ct gg aaaggt gt at cct act t t aa cccggctcgg t ct ct ct at c ggcgact caa cact at ct gt at t t cat t t t t cat cgaaga 12689250 Sequence ccattattgt aattttggac atcaaatcgg acttttctca accggataat tacatttgt a tctctttctc tctatctctc ggt caggt t a t cact t gcaa agacat gt t a t t gaaaaacc gacgccgatt cacatagctg tgt Li st i ng. txt caaagcat at ct cgat t at t gct at gagaa t ct at ct ct c aggt acct t t ct at ct ccga cggt ggct t a gacat t gat g aacgacccga gaaaaagaag t cat t t ct t c at cct cgat c t t at t agt t t cgagccgaac 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 229 2004 DNA Arabidopsis thal i ana <400> 229 t t at t t agt g ccaaat t t t a t caat at gt t at ct aacaag t agt acgt ac caaat gaaat at t gct acaa t ccacgt aat t t act cgat g ccgctggccg t agt t gt gac t gt aaat gag t gt aaagt ca cgct ct t at t accact cct a t agt aat ct c aact t aat cg aat t caaaag gaagt t gt aa ttaaaaaaaa t t aat t t caa cgagt aagaa caagaat ct c acaaaaat ga t t t ct t aat t ct caacaact gagt aat at g gcaacat aac aagt t t gagt cat t t t agac at ggacaaaa agat t t t ggt t gtt gaacca aact t t ggt c aat at t gagc cact t cct ca gat t t at ggt aaacct ct at tttacagaca t at t gt at cg cct aagat ca agagt t t t at t aat t t t t gt t gt gt caat c t t t at caaca gat act t at a t gt t t at aaa t t t aaat t t t gaccct t aaa gt at at gagt at caaaaaat gagcaat ct t caact at t t t aat t ct t t t t agt t t t at t t t t aat ggat t gt gagaaaat acaccat aaa t gacat gaaa t t gaat acac at caaat agt t t agagct gt at gaat caca caact cgact agt gt t gaga agaaaat gt c at t at agagg gagcat cgaa aacgaaat aa caagt agat a at ct acacca t t ggt t ct gt t at at cgat a t at t at t gcc t aaaagt ct t agcaact t aa aagt t aagt g ggt t caat t c aagt cgct ct aat aagagt t tgat t t gttg gt t t caagt a t t t aacaat a gt t t t cgaat t aacat t at c cct t t cat gc aat t t at gt t aaacaaat aa aaaat agct t t aaact gcaa t agct t ct aa acact ccgaa cacgat at gt t agt aaacaa gaat cct t t t t t agtcgggg gat t t t cagt t t t gagat t t ggct t gat aa gtggtagagc gt t t t acat c gt t gagagat aaaaaat aaa t gt ct t ct t c tctct t t at t act gacaaaa acat aagat g t t t cct t aac caaaat aggg at cat agaag cct t gct caa act gt ct ggc ccaat gaaac aat gt caat a t at cat gt ca at t ct t t gac agccaaat t a acat aaact a t t at at gt t t aaagcat ggt aat t gaat t g t acgagaaat t cagaagact caagt cat t c ttcaaaccag at cgat t aag agaaat aagt t cagt t t aga at t caaaggg aat t at gat c aat caagt gt cat gacaat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 gaatacaaca ttatagtaaa attttgcttt ctacggtatt ttgtgttgat ctgcaacaaa Page 250 acaaaat ttt t at t ct t acc tggctaggag t t t t gt aat c gaaaagaaag gagcat t cga t ctt aaaaaa aaat t acaaa agagagagtt ct ct t t ct ct t t t ctct t aa ttttttttca ct t at t at t a aacagaaaaa t ct aaaat cg agcaaaat ga gagcgcacaa tt t caaaat a aaaaaaagaa t cct at ct t c ct t t ct caac at ccaccat c 12689250 Sequence aaat aaccga t t t t ct aaaa ttattctttt gctaatcttg aggt t at t ga ccaccaattt gt t gat caaa acaaataaac taagaaaccg agttggtcgg aaaaaaaaac tattacaacc tcacaacaaa aaaaacaatt aaaaaaagaa aagaagaaga tccattcctc ccaccatct c aatctctatt agatctttct at ga Li st i ng. txt t agt t t aaat gaat t gcat g agct t gacaa agaagaagaa tcttgtgggg aaaat at t ac t at t t at t aa ggaagagctt cct cat ctt c ccat t accat gagt t t attt act aat at t t taatggt t t g aaaacgaaaa cgggcgtgaa aaat t t aaat aaacat t t t g t t t gtgt t ga at ct t cct ct t acct ct ggc 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 230 <211> 1542 <212> DNA <213> Arabidopsis thaliana <400> 230 aagcttatgt caaaaatatt taattaaaat agtatcaagt aaaaacccta atccgttatt acgaaaaaaa aaagaacatc tagaattttc aatttggatg tatgaaatat tttgtcgtcc cat gaaggct agacatccaa tgtctaaaaa ttctttatca tgtctcactt ctgtttctat ctctttacta taaaat t att atataaacat acgtatagga atttttggag cctcaagat t agctaaat t a aacgcctaaa tcat t accat tcagacacaa ggaccgacca attcgaaaac gcttgacaac aaactcggtt ttggctggtt ataatttctt tcgtcgtcgt tttatttgaa ttggttgcaa aagcccaaca ggcctgtgga aattgggctg acacatgacg agaatgtggc ccaaaaatgt tgcagaagtg atatagtatt tttttaacgc tcactggatt tataagtaga tcatcgtgaa acgt t cgaag gccattt tct ttagatctcg tgccgtcgtg cgacgttgtt ctctgtctct ctcgattcac tgctacttct at at gt aat t aaaat at caa gat at t t gat act t at acaa t act at at at ccctcatttt cat t t t gat t gt aat aat gt t aaat aaat g aat gaat gga aacct agact taggtgcgt c gat agct t t t t at agaaat t t at t t aat t a gat t t t t t gt ttggacgacc ttccggtacg gt t t ggattc t at at gtt ga t gat t at aac cct caagtt a t aaagt at ga aat gct t t t g aaatagccaa gaact acct a ct cat agt t t aact t t t gt a t at gatt cat cggttt attt aaaaataaaa t agat t gat t gt t agtgaga aaaacatat t gtctcacaaa atcggcgtta t t t at t cct g ctttcgcgcg ttgagt t at g gt at t t at aa aactt ggaaa aacat ggat g gtagggtct t t at aat t t ca aaaggaagaa gact t gcaaa cgcaat t gat cctt at gaaa aaaccagaca gct gaaatt c aaatgggccg gggtccgggt at t cgacgt a aacaaaaaaa aggagagagc tt gatt cct t at ct ct ggat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 251 ccgt gcgt t a at gagt ggat gt gggaat t c t caggt ct aa t t t gaggat a agat gat t at t gagagt t t t t t cat t ggct gcgt t t t ct t aaagt gat ag t ggt t gat ga t gaat gagat ccaat t gcgt actctgt t t t 12689250 Sequence cgtcgttttc agatctgttg gtgattcgct tgtttgtaat tagttgatat tttttccaga t t ct gcggaa t t at agat ct tcgtaggtcc acaaaggtct ttcgtagtta tttttatgga gtgaacaggc ttgatcaaaa Li st i ng. txt cgt t t ct t ct gct ggat ct g t caggcat gt aagat ct t ga t gt t at ct ct t t caaggaat tg gt t t t ctgt t t at ct gcgt c t ct cgt at aa t t gat t t aga gct gct agat t gcgt gt aat 1200 1260 1320 1380 1440 1500 1542 <210> <211> <212> <213> 231 2003 DNA Arabidopsis thal i ana <400> 231 cgccggt caa cgaggagagt cagacacggt t ct ct t ct ct accct aat t g ggagtggaaa t gat t aaact tttgacagaa t t t gagagt a gt aggt ggt t t gcggggt t g gagat t gacg aagggat t ct agt t agt aaa gagt gt t gag t gt gct at at t accat gt ac aat act ggac ct caacagaa aaagt aat gg cggt aagt ca ct t agt gcat t ct aat agaa cct ccgt acg t aagat ggt g tttacaacac caccaccatt gaaact ct t g t t t agggt ct t at gct at ac gcaat t cgt a t t at t act t c aaagt at at g aaat cgaaag cct gaggt ca aat at at ct t t gaat gt ct c t cat gaact a tagaaaccgg ggt t at cact aaaat aaaag gat gat caag t ct gaagccg t cat at gt gc gt cagt cat a acaagt gt t a aat ct ct ccg gct aagcaga gct gt ct t ac agcgat gt t g t gt aat gt at t t cgaaat at t gt ggct t gc ttatgtgttt agagt aat gt t at ccgt gt t ct aagt acgt ggt t t at t ga t cact t aaag aat gat gcgg gt t gat ct t g aact t ggt gg at gagcagaa ctgagaaagg at t gggaagt tttaagcaac t caact cat c tgaacagaaa at aaat caac tgcgcaggac t aat gagaga ct gaagct gc aagt ct ct aa t cat gt gggt t t gat t ccag acggat t gaa t at gat t t gt t agaagct ga t ggagat gat t aggagt gag ggatgaagcc t aaact agcg t caaaat ct c t t gat t gt ct at gct gat at act gaat t gg aagcggtgaa ggat gacccc gagt gcat at t at gagct t g gcat t ggat a gcat gact ct cgt acggt gt act ct ccgac act t ggagct t gat t t gcag ct t gct ct at aat t gct t aa t cct gat t ct gtgctggacc aaat ggt t t c agagggaaag ct t gggaagc atggaaagag t t t aat gaaa t at ct t gcat t ct ct gaaaa agt cagt caa agact gt t t c gggaaaacag gacgaagaca t act ct gat a gt t t aggaga at at at gct g gaagagaat c at ggcgaacc at gct t ct aa gaccgct acc gtgatgagaa t t at gcagct ct t gat ct t t t t t at cagt t t agccgct gg t t t t ggggt t acgt t gcaat gaat gaagt t gaagcagggt t t t agt ct t c t cat agt cat t cacct ct t c gcaat t gt aa aggt aat t gc aaccat ct ga t aat ct at gt gaacgct ccg gat cat t cat ct gaaaagca at t t t ccat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 ttttgtgtga agttgttgga caagactaag gcggaatgga gaaccggaga agatgaatac Page 252 cgat ggt agt t ccaacaaat ct gcct cgt t aat gact caa t gt gt catt g aacct aat cc at ct aaaat a gt t ccct t cc t at at ctt ca ct cagat aca t act cat t at t gt t at t t t g gt t t caat gt gt agaaat t a ggcccgt at c at t gt t t at g ggcccaagcc act cat ct ga t ct ct ct ct c aaact ccgac 12689250 Sequence gaagcct gaa agct t ct t t g gat aat at ca gt gt aagggt tttgtccttg tttggtgtta accgattcgg tttcatatat tcatgtatgc agtaacattt ttacttatac tacatcggt a aaaatgacgt ggtcgtcctt aaacaaaaac aacaaacaaa t ct cgccaga t t t t cat cga at g Li st i ng. t xt gt ccat t act t t ggt cat ga cagt t aat gt gcct agat ga taggcccaaa t ct t agaaat gtt acagaac at aaat aaat ct cgat t t ca cggggatct c gct caagt t t cagct ct cac gt t gt gt act caaaaat cag t gggct t t t a aagcagcgt t tcaacagcga gat t t aagat 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> <211> <212> <213> 232 2010 DNA Arabidopsis thal i ana <400> 232 aacacaat gg accaaaacga agct at aaag agt gt at aac aat t gggt at cat caat t aa agct t at cag t at t t gat gg acat t t aaag at gt at cgaa acat t t t agt at t t tggagc t aat gt acgc t acaaat act t at t t agt t a at t aaagt at gt t t t gtat g t gat t t gt aa aat t t at ct t caggtagcgg t act t gccaa aact act t gt at aat ct t t t t aaaaggttt aacagaagag gcaat t gagc ggt cagt t t t agt t t aat t t aaaacct caa ct t t aact t t t t gaaat cca ct cacgt aca at t at agat t aaact t t t ca act t at acgg aat acaat ca t t aaagt cgt gct ct cacgt t act ct aagc t t gt caaat t at gat t t t gt t at t t t aaaa acgt at t ct t at aacaaaag taaggaggga agacacacac gt aat ct cat ccgct accat agat at t gaa at gat gaaag aat ct agact caaat cagga at t t cgt t t a t at at aaaca aaaagt agt a t ct at aaaaa cat aagaaat gt t t ct ct at at cagaaaca t t t ccggtag gt cggct gct at cagaggaa cat t gt ggaa aaat aaaaag ct caccaact ccacct t t gt at t cct acct gcat aact aa ct acaaagt g caaggt gat a ct at aat t t g agt cgt t aaa aat agtt aga ct cat t cct t t gagt ct caa t ccat t t at t at t at cacgt gaat ggt t gg gaagct t ct g t gat cat t at ct aaat ccaa at ggatt at a t at gcaaat a acaagagaac ggtt gtt aag t t t atct t ct at aat gacgt aagt at at cc ct caagaagt agaacaaat a t ggcaggt aa agt t t cccac t t t ggt t t t t caactttttt gt caat gaca t gt gt gt gt g aat caagaaa cacacaaaaa ccgcaaaaaa t t t at t t t at t ct ccaacaa t t at t agt gt caaacat gt a t t agaat aaa t cat t t aat c t cct t aaacc t tct at gt t t gt t at at caa t ct at t aaaa gt t t aaaaaa t aat t t cact gat accat gg t t t t t t aat a ct t t t ggt t g acacaacat c gaat t gt t aa agaaaat t gg aaat cat at t t t aaaaat t a t gt t gat at t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 253 12689250 Sequence Listing.txt atttttaaaa attgggttgg tgtctaatgt cccaatgata gcat aat aat gt t t t at t gt t cgcct ct t t t cgaaact ct ct ct aggt t t t caagt t ct t at t t t t t gt a agat gaaat c act cact acg t t agggt t t t acaat t cct g t t at caagaa gaaact gt t t t aacct ct ct <210> 233 t ct t gtggac ct ct at ct ct tctgct t t t g t t t t t ct t cc gat t ct t gat t act t t ct gt at caagacac t t gagct aca t t t ct cct ct t at t gggt t t t t t gact t t a at gt at gat a t agat t ccat t gaaaaaaat ct cgggat t g t gt t t cct ga t t t t aact t t t ct t gat at g gt t t t t t t t c t t t act t ct t caat t t ct t t caat t t cacc t t ct gggt t t at cct ccgaa t at aaaaaaa cagaaatttt t t gt t t at t t ggt ct caaat ttcagccaca gat t ct ct cc at t aat ct t g t ccct t gct t t at gt aat aa t t act ct gca t ccat agat c ct t cagt t ct t aat t t ggt c at ggt t t ggt aagaat t t t g ct t t t t t t gt t ccacact ca ct caggt agt t act gt gt t t t t t ggaaat t t t gaat ct t c tttttttttc gggt at t cct tcacgaaaaa cgat t ct gt t gt gaaat gat agt t gggt t g caat t at t aa ct t caact gt caaacacaat t t t t aaaaat ct t gat t at t ct cct acaag t ct gct t t t g caacct caaa ct acgat t ac cat t t gct t t t gt t gct t t c t acgat gt t t 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2010 <211> <212> <213> 2001 DNA Arabidopsis thal i ana <400> 233 ct gt gaat aa t t t gt gt agt act t accaaa ccgt gggt t g gcggctcgga aat aaaat gg ct aggggt ag t t aggagt gg agcagccgac t gt at aaaac gaacgt gt ga gagctggaaa t gt gt at t ct aggggaagtt at ct t gat gg gt t gt t at ag aat gt gt t gt aacat cat aa t gat at t t t t gt gt cgt caa aaagagagag gt t ccgagca ttttcttcac t aaat aagag agaaaat act cat caggaca t caagt cgag ct gaact aag t t t at t t t t g gt act caat a agt gt acgt t cgt ct gagt t aattagtcta tgatatgatg ttggagctat aaaatgctaa t act t at gt t t gct cagat t ccgt t cccaa ggagtact t t tttggcaaag caggt t ct t t tttcagggcc aagaaact ac gagcct aat t aat t cct ggc t cgcct t gga at t gaat t gc at ggagat at t t gact ct t t gat at t t ct a ccct cat t aa ctggaaggga gcagat at t a t ccaat act a at gct ct cca at aat aagt a agt gt t t t ca gaagt t at gg gcaacaattt at t cagt cac aagcct at t g gct at caaga gt cggct t t t t t act t caaa ct ct t gacct tgt t t at t gc gat ct t t t aa acaagaat gt ct gagt at ag aggt acgt gc agt t ggcgt t cgat t gct gc caat gct t ag tat t t gt t gc t t at t aacaa cagct gat gc t t t act t aat t gat caaat a t gt t t aagat t ct gaaact a gt gcaccgct act t aaat at cgacat gat t gcacct t gct tttcacgagg gt gaaat t t g t aat t t t ct t cacgt t t ct c gcagcat ct a aacagt at t a aggggtaaat gt t ct gat gc ttcaaagagc t acaccat ga caaaaggaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 attgtaattt aagactatgg ttgattacac aggcgtgctg gtggtgaaaa agtgtacaac Page 254 12689250 Sequence Listing.txt gtgtttgata accagcttcc tgcggctctg aagagactcc aatttgacaa gcagctagcg at ggacaaca cct gagcaag gaagcat ct g acct aacaca aggt t cat gc ct t t t t t ct t acaat gt t at ggaggtgaca aacact gcag t ccccaggat t t cct at ct c t at at ct cac t ggt ct gt gc aagcgaagcg ct ct ct gcct t at ggt gat a t ccggaagct gt t accgt cg t t gacaccgt aaacaat aag t at ct t aaag t t agt aat at agacggt t t g aatgcggcga ct ggt t gaca gt t gagaagg agacgaat cg aaat t t t aga tggcctgcgg cagt ct cct c t t t t t ct t aa t ccaacagat ggt cact gag t ct cat t gag at gaact ct t ct t ct ct ct c gat ct ggt t c at at at at at aacat gt gca t agagt cgct t ggagt gcag gt ggt aaccc gt aaggt t aa cat t t ggcaa aat t caat cc gaccat t t ct cct aaagaag gct gat ggt t t ct t ct at t g t act t t cat c cgtttttttt acaagt ct gt at at at at ct ggaact aaaa ggat aaaat g t t acct cact cacacact cc cccat at aaa caggat ccaa ccaagt ccat t t gcggagct t gt cat t cgg accagcct ca t ct ccat cag t ct aaaaaga t t t t t ct at g gaat gaaact ccaacaact a caat acccag cgggaaggaa gt t gat t t ct at t t t cgacc t aaat aat t a tgt t t t gtct cgt at act gc cggt accat g gt aaacact g ct t gat t gct aggccct gct t gt at caaaa gt t aaat at t gt ggt acgt t gt act t t t gt ct ct gagagt gt aagaaagc t caggaaact gct acaacga t ct ggt accg t acgt gaaca caagt ccgag gat gt aagt g agt caact aa 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2001 <210> <211> <212> <213> 234 2003 DNA Arabidopsis thal i ana <400> 234 accaagtt ct t ggt t gt at g t gcggt gat g aaccgt t gaa cccaccgct a tgatgagaag ggcacat ct c t cacct cccg taggaaacac tgt t gccaag gt t t t at gag ct ct gt gt t c aat t t t aagg gt t t t aggt c t at ct at agt aggt acgt ag gacagaggt c t t gaact t aa ct gat caagg tcaaagagca t t cact gat t gt gagcgcgt t at t cat t cc aggt t t aat a t ct t t ccttt aaat gat ct t aagt t ct at a gat gct aaga ct gagaagt a aagt t gaaca cgct t cacat agagtgaaga agt act t ggc act t ggt t gg ggt gggat ga cagct t aaga aaagt ggaac t gt agt t t aa t t cccct agt gt t at gat t t gatggt t t t t caggt cacaa at ggct t gat aat gt t cgca gaagct t gcg cggt gact t c t ccgat t ggg t at t agcagc t gt gt t cat c t aaat gt acc t aagt at ct t t ct act act t gt gt t ggct c gat ct t t t ga ggacct gat c gtggaagcga t cagt cat gg ggt gt t ct t g gt gagct t gg aaagct t aca cgt cct gcgt t t ct t ggt ga t ct t aat gt a t at ggct t t g gaat gat t t a atgt t t gttt cagagt cccg ttttggggaa ccact t acca gat t cccat c at gt ct acga ct gat t t ggc t gat caaaga ggaaggagac t gt ggt t t gt at gt t gccac t gaggct t t c act agct t aa 120 180 240 300 360 420 480 540 600 660 720 780 Page 255 12689250 Sequence Listing.txt tatgaatgaa aggtcggatt agcgcgcgag acgggagacc gaaaat t aat aat ct at ct t aat t gat gat gaact t ccac caaagatt ct caagaaacac t t t t cat gaa t cagacgat a aaaggt t t at t aat gaaagt cgt t gat aaa at cacgt t t t t t t t ccattg t cgt aact at ttacaacaaa t at t aggct t aat t t gat gg ccgccact ag cgact gcgt t t ct t t ccct c aaaaaaaaga at t gat t gca acct t caaac aact at aaaa tggt t t t gt g t ct accagt a gct t t t t gag cact gat t t t t ggt ct t t cg t ggt t t cgat gt aat gt t at at aaat act t cct gat cat t t t t at t aaaa gt ggggaat a at t aaaaat g gacgt t ggaa ct agccgcca t agcct act a t at aaaat cg t at cgact ct t agt t t gcgg t ct aggt t gg accaagacca aaacaaagt a agaagcct ct t t t t t aacat gggccaatga aat t agat ca ct gt cagaac t ct t gt accc t t t t t t t t at t gt gt t gat t t ct t t aat ca aat aaat aac at t t cgt gct cct at accga ccgcaaacgc caat aat gga t t t aggt t gg ct t ct t t t ac t t cgaat at t gagagaggt a cagctt at t t ct ct accaaa t ggat agt at ct aat t t t t g at aaat t aca accaacct aa aaccaaaaca caaaaaagt g aat at act t a at t gt t aaaa gagaaaagt c cacgt at aaa agct aaccag t aggt act ga aacat ggt ct t cggcct t t t ggaagaagt g ccct aagaat at ggat aat a tttgacaaaa aaat t aaagt t t t aaggcat aaaat aagag t t t t aat cgg aaacaaagt g agaat cat t t caacaagt gg aacacacaca aaat at gacg t aat aat at a at ccct t t t t t agt cgat gg agt at gaaat gat aacaaaa gat t cat aac act gat ct ag aaaat ct t ga aagt gcat ca gt gaaagaaa at agacact a ct aat aat t g ggct cat t cg ct aact t agg t aat caagt c t ggcat ct t a t ct ct ct ct t t gt aat t cat t gaaacgt t g ct aat aagt g ct ct ct ct t t 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 cgtcctgcta tagtgagttt agaaagaaac tgatagtaga gagaagaaaa atg <210> 235 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 235 taatcagtac caaattaagc agatcl taaaagtaat cgaaaataga aggaal ttgattgtac tggctcattg tttccl gagatcacga accttagatc gaaaal aggagagtgt tttaagtata acgcgl taactttagg ctagcgttca atacg aattaaaaca agattcacgc ggttt ttgtttaata gaact t aatt aattg! tgttttttct aatgttctgt atgaa tcaaatttcc ttaaaatata aaaat1 taaga agagg :ttga agttt :tttg at gaa ctgcc gagaa at ct a tcatt aaaaatt aca acct t gagga agaaaaagt g atagaagcga cagtt at t t g t t aaaat gaa ttttgcagcg caaagat t at gt gat gct t a t t gt t t t at c Page aat ct aaaaa ggaggaacac t aaactt cgt tagagaagga at t t t t ct t g act accaat a at at t at t t c tgat t ggttg at t t ct t t t t gagtcgattg ccgaaactt c caacgggagg agagaagaag caaggaaaag tttttatcca aaaaaggt t a t gt t t t t t t t t t t at t ctgt gt at ctt act actt aagaca 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt cat t acaaca aaccccactt acat t t t aag tggcctggac t act t cat ct agaagaaaat at gt t cat t t at t acct t ag aaaaat gcac acaat accga taaaacaaaa gt t t cct cct aat t t aact t gccaacagt a t ccagacct a t cgt caccga gt t ct ggt ga act t gat caa gt t gt ct t ct t gat t ggt t t t t at ct t cag aaact t at t g gt t gt gat t g gcagat ct ag t at gat aaat acct act t cc gt t t ccaat c t t cact acaa act agt t t t g at gat cagt c cgat aat cct aat t t ggt t c at ct cagaca aaacgt aagt t cat gagt gg acct t at cct t t t agt gt ga acgcgt ct ac cccact t gt g acct t act t t at gat cct aa ctt gagcaag t ct ct act cg ccat ggt at t t t ct ct cgat t t gct at gt t agt t gt t aga agt t t t gaga agat gt acac aat gt at act t ct t gaaaag cat t t t cct t ccact gct at at t ccgt t t t agt t gt t cga t t cat t gt ca t t acgt aaat accaaaagt a gtt aggccca ct t t ccgacc gt agccaaca acgtggacgg t ct cacct t t at at at t agt t ccgt t t agc ttgcagcaga at caat t t ct agat ct ct gt t t caaact t g t act gat cgt t t t gaat t t g at gg aat agat aaa aaaaaagagg gct t gct gaa cccgccaat t t ct aaaact g gcaacat aac cacct t aat g t t ggt t agct cct agaat ct gat t ct agga t acggt cat a aaat t t aaga ggat t ggaac t t gat t act c aat cct gcct cct t ct t ct t ttttaaaaaa tat t t t cct c at at t t gaag gat ggt t gt t t act t gt cct acagat ct at aat t t gaact gtaaaagtgt ctacat t ct a agat at t cat t cacat t gt g acacat cgt t accgt aagt g aat gaagcag at ct t ggt gt t t t caaact t act t t t ggt a ttaggcccaa aat gggct t a t at t t t at ag gaaaaagaaa agaaccat gt cgacgt t acg ct cat ct t ct at aagaagac ggccaaggt a t t cgt ggat t gat ct at ct t act t caagt g t gat ct aagg gagct t gt t t gt acat at at aaagt agcaa ct ct t t gagc gt t gat gcaa aaaat t at gt cct t gt at ag gat gt cct at ct t acacgat aat gat gat g t at t ggat t a aaagaat aat gt caat aaaa t agct at ggg aaccccagct cgcaacgt ag at at t t at ca aagat ct gat t t gaaggat t aaaact t gat at ct aat cgg at t aagct at gt t t gt gt gt 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 236 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 236 tcccatggtc agaaaactca ttcgataacc ctcgt t cgat ctacaagttt tgtggtgagg gtacttcagg taaattcttt tcaatactta ataaggacga ggcctccata gcgt t cagt g cttggatttc aagaggtatg acagtttgtg atttcaacaa tttatttatt tttagaacca ct acat t t ga t gat caaagt agact t ct ca gt t t t t ct t c gtggaatttt tcgctaactg ctttaacaat gccgtatcca ttgtgatagg agccttcatt aaagattttt caaaaatttt Page 257 ggcacggaag t agt gt ggt t t t t t gtgtaa ct ct gct aat aaat t aaat a gat t at caaa 120 180 240 300 360 12689250 Sequence Listing.txt aaaaaatcca acagattttc tgttaaaatg aaaatatata ttagattgtt at cat ct gcc t gct agt gca t t at t t t aaa t t aaat t t aa t aaagaaaat at t ggat at a at at aggt ac t aat t t t gat ct aaaact at caaat cct t c gt t gat t t t t aat t aaat aa aaat at acac ggcccat aat gt t gggccca acacacgt gt ccat t ggat c ct cat t gt at aat cagat ct gt t cgt t t cg gat ct cgat c ccgat t t gt g t t t gt ct ag t cgcgt t gaa t ct agat gat aat cggt gaa t ct agat t gt ggaat t t gaa t at gt t t t t g accaaaaaaa aaaaaaaaga t t agct aat g t acat cat t c t gaat t at t a gat t t t aggt gagat t t aga aaaat t aat a aaat t t t t aa cat aaaat t t aat gt aaaaa accgt cct gt gt ggcccat t cagat t caat t ct t aact ct ct t cat cagt t ct ct t caat tagt t t cttc t cat cgat t c t gt aggat gt at t cgt t at c gaat t t t ct c t aggt t at t g gaaat gt at c t gaat cgaag t ct ggacat g ct aaggt at g at t t aaaaag t t t t ccgct a t gct acat ga aat t t at t t t aagaacat at aagt t gat gt t t t ctgtat t agaat caaat t t t aat aaca at aaat t t t a gt ct gt t t aa acat cat at a t at cagat t t gcat cagt ca acgcct cgaa ct ct t ct t at cgaaaaaaaa t t ct acgt gt t ct gt t ct t a t aaaat t t ag cgt gaagaac t agat t cgt c t t t cgact ca aat gt gt gt a at gt ct t at t t t ca aat t t aaat t aaact t agcg cgggaat t t g gatgtttttt t ct aat ct ag tttttggtaa gat t t act gt t t t aact aaa ccact aact a ct caat t t ca cat aaat t at gat caat t at gt t at ct caa caaat aacag t at ct t cagc aacagttttt t ccatt t t t t gaaaggt aaa agat ct gat c t cact gat t c gt t t cggt t t at agacgagt acct at gaag t t t gt t t at g t gt t t t gggt aaaaaacatt ggat at at ag ggat at aaaa aacct gaaaa t t gct aaagt aat t t gct t t gat at aagt g aaagt ct t t t gaagt ct t t t t t t t cagt aa t t t aaaaat t t agat t gaat agat t t aact gt t gt aat ac at ct caacac at ccaacact at t t at t t at caaaccact t t ct ct ct ct c t t t gat t gt a agt gt gt t t g t gt t t ct gct at gt agat ct aagat t cat t cct at t t t ct tctgat t t t g tttaaaaaaa t t agct aat g at caat t t at t aaat at at a t gct gat t t t at gagt t t aa cct aggacaa agt t t t t at a caaat t ct at caaat cct t t cagt aaaaag t aaaat ccat acaccgct ct t aaat ct aaa t gt aat accc t agcat ggct t gt caacct t cat t ccat t t gcaaaat t cg t ct aat cat c t gt t t ct gga at at ct aaat t t t gaacgat t act t cggat gt gt t ct t aa ct at gt t ct t t aggat t t gc 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 237 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 237 gttagtgtag gaagtttgtc cggatatgca attttttttt tgtaacgtgt atcatctcgt taagaactca tatgtattat atgaacaaaa tcaatcatgt atattaaaaa tggtttgccc taaagatatt ctatgttttt ttcgtgaata ggtgttaagg atattaaaca aaaataccct Page 258 120 180 12689250 Sequence Listing.txt tatttattct tacgactatc ttgtgtctat agcactctag cttctacgtt ggtatcattt at at t t t aaa gt t t aat aaa at ggt t gacg t t cgt aacgt gt gt at t aaa t t t t t t t at a ct cagt t t ga tagt t gt t t a t t gt aaagt a aaaat gt t at agaaaat gat ct t gcaacca t cgt aaaagg cat cat aat g tttgcaacag t t gact cat a acaaat gat c aaaaat acat t t aat t t gt a at act gt t t t aat cgt t t t a gaat cat t ca ct t agt at ac gt t ggct cat caggct t t t a ct caaaaaaa aaat aaat aa agcaagctgg ct t cacact t gagat t at t c aagat t t t gt act aaacat a aaaaagagt g gt at ct cgt t at t ggt t gcc gat t t gat t c t acat acact aaaggat t gt aacaaagcca aaagaggtt a at acat gct g ct t ct at act at cggat at g t at ggagat g aaagt cagaa t at aaagt ag at gacat t ct t ccacacat a acggt aaaca t ggt caaat a t t t t t t act t aaaaaaaaca at t t gcaaac t agaaagcat t gaat caat c aaacaacaac act aaat cat tccccaacag at cacgccac at t ccct cca gat at t at t a t t aat aat t t at t agt aat g at aagaact c ct aagaaaat t ct t ct t ggg at at t t t t t a gt aaacgagt t gcat cct ct t ct t t at caa cat t t t t t t g aat t gt t t t a t t aaccgt ct t at caagt at at at ccat ac agt t aat t ag t t t t t ct t aa gt t agaact t aaaacgat aa t at at gt cac t gt ct t gaat gat ct cct ag cat aact t ca at caaat t aa at cat cat t c gt gaaaacct agt at t acaa acat gcgagt ct caccacct at gg t t t act ggt a aaaat t at t t gaagt t t gt c at at gt at t a t caat gt t t c at t t ccacat t at at t gt t c t ggat agcca aat t caacaa t t act t gt aa gt t t ggt agt gat aagat ag ttttccaaac ct ct t acgac t at aact at g gt gt at agcc tttcccaaaa agaagaggt c aggat aat ag gacagt agat ct t gt ct act t gat at gaat t aat t t t gaa t t gtaaggac acat cct cgt aaaat t agat aacaaaat ga t cgt ct cct a ct ct cct ct c t aaat t t t aa t act t t t gat t ggat at gca t at gagccaa t t t cgt gaaa t aaaat t at t aacgt at agt aaact at gt t t t t ggt ccca acaggacatt t t t acat gct t acct t ct t g gt t at at act cat t ggt ct t ct at t t gaaa aaat t at gga gaat t gt aat caat caat ag t acaaact aa gaaaat accg ct t caat acc caaagt t t at t aaacgat aa acaaat at ct at aat t gt ct gagt t acgaa aaacgaaaaa t aaaact t t g t cct ct ct ct aat t t ct cac t t t t at t gat ttttttttt c at caaat cgt gaaagt ct at at ct t agagg gact gaat gg aact t gat cc ttaccccaag at ct t t at aa gcat t t t gt a t t t gat caca ct t caacact t gcggcct aa at t at t cat t t gaaaat t ag atcagaaggg gcggt aaaaa t t ggcaat t a t t t t t ggtt g gt t t t t cgt t caat aaaaat t gt aagacag at aat t aat g aat t t cgt ac aat t aggcaa at gaagaaaa t t aat ct ct c cccccct gga 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 238 2004 DNA Arabidopsis thal i ana Page 259 12689250 Sequence Listing.txt <400> 238 cat t t t t ct a gat t t t ct ga acaggagaca acaat gagga at at t cat t a t ct t t t at t t at t t gt caaa cact ct aacc t aaaat aaat ct cagt ggt t t act t t ggt t at caacaat t at ggt t gcac t t aat t gact agaaagagaa t ct t t t aacc cacacaat aa at t gccaaac gacgt gagt c t aggat acat acat at t ct c aat at ccct c aagct t t at g gat gcaacca caagaaaaat t gt t at t gct ct gt ct caat t gt t at gat a t aaacaacat aaaaaaat t a cagct gt aat t ct t ct ct t c ccccctt ct t t at t ct at ag aat aat t tag aaaat agcaa t gaaaat agc gt cat t t cca gt ggt t cct c t gt gt at acc acaaccact a t at t t t acct ccgttttttt cat at t ct t t at at cat t t a at at t gat t a cccat cat t a ggaaagcgt g cat acact at t at ggagt ac aaaat gt aag aaagcaagtt acaaact t ga t t cat at t t c t agacct ct g aaaacat t t g at t at at agc t at t aaagt a caaacacagt t t t aat at t t aaat acggaa at t t ct aaca aggat ggt aa acat ct t gca t t ct t ct cat ct ct ct t cct t t t t at gat a accact t at t aaaat t agaa t t t ggt gct t at aaat at ca agat t aat t aat caaat t a at gcgt gt gc t at t ggt t ct t aaagt aaaa t t t t ct t t gg at aggat tag gt gat t aat a at t agt gacc t gt gt ggt t a act cat act t aat at t at t t ct t aaaaact gt t t t t at t c aaagt t at aa aaccaaatt c gt t t ct t ct t aat t t cgaga aaacgat at t t at cct agca at at aat t t t t agt cggt ca t t cct cccct aat aat at gc acat aat ct g t gcaat at ct t t at t t at at cacat t cct c t t gt t t ggca t gt t at t aca at at caaaca gaaaacaat a gt agt t gccc ct agct t aag aat t t ct gaa at gcacacaa t aagat t aaa cat acaaat a gt t cgt at t c aaagaaaaca tagt t t t gt a aaccaaaaga ct t gct t gca ttct t cgttg t aat cacat c t t t aat gt t a caaaaccat g aaaaat t caa gagaat t t ga t cct t cct at t t cgt gt aaa at t aat aaag t t t act aat a agagat t t ct t cagt at cgt aat gacaat c t cat gcat gc ct t accaaat t gat at t agc gt t aaact ct accaaaccct at gat gt gaa act t t t t t ga ttact t gttc cct t t gagt t t cat agacaa t act gt ggac t t t gat caac t at at t aat a ct gaat at ct t t acgt at at t t t ct cat ac aat aaagt ag gt at t t t gca aat t gacct t act t t gct aa cat ct at t ct gat cat agat aaat gt ccaa gt ccagt t ac gaaaaaaaga agt cgaacaa aggt ct cgcc t acat aaat a at t at t aaat gaagagt ct a t t ggt t t acg ccaaacaat c aaat t cgaat t aat at t t ac gat ggagat a ct cct t t t gt ct cccact at ct ccaaaaca aact cagt at t t ggcat gaa ct t t at at ga t ggaccact c acat t aaat t agct gt t t t a aaaagaaaac t at gat t ct a acat gt at at aat t cagct t aaaagt t t ac acct at aggt at gaaacat c t t gat t t aga cgcaat caaa cat t t cat t t t cat gat aat agaaaagct c gaccccat aa t at t cacaag t t ccacacgt gt agt gat ct aat aacaaaa t at aacacac at t t t aat t t t caaaaaagt at t t t act t t tcaaaagagc t gt aacat t t ct caagagt c t cacccact c at at at ct ct cacccacacg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 260 12689250 Sequence Listing.txt tacgcacaca cacaaagaca atgt 2004 <210> <211> <212> <213> 239 2004 DNA Arabidopsis thal i ana <400> 239 aaggcccaat at aaagaaac gt agggt cgt gt t t gt act t cagagcgtgt at t t gcacat ct t t t gt cca t ct gt cct aa cagt ggaaaa aat cgaaacg ggaaaat at t caat aacagc gcccagct gt t gcaat gt t a agt ct gaccg gcct gat gat gacgat aagc agaacaagat at aacat gat t t ct cgact t gat t gt t acg cgcaaaacca cccgt ccat g cct caaaat a aat t t acct g tgccgacgga ccat gat t t c t gct gagaat aat act gt at at t gggt cat t ggcagt aac cct gt aaaga t t ggcaacat gagt aaaaca ttaaacaagg cat ggt t t aa ct agat cagt cct cat cgat caaaat aaat cat aaat t ca cat gt gt t t g aacggaggaa at ccct gt ca t t ct t cagt a t t cgat ct t c t at gaaacaa aagcatggt t cagat gctt t caaagaaaca t gt acaat aa t cct at acat aacgcccgat aaaacggat c gaagcaaacg gt aagt gt t g t t gat ct ccc t caaggat ct aaaagt t t cc t agt gagagt aaat gt agt a t at gcagt t g act caat acc aaaaagacac aggact acca caagaaacaa tttgtttccc ccact t agt a ct cgagt t t c t aagat t act agagaacgag t ccct gat at t at gat t t cg t t cgaaaat t aaggt acat t cat aaaaact cacat at t at cat t cat caa t cct cat at t acgagagtga t gact aaaac cacgt aaacg t caaaat caa t cgt gcat cg t cgaat t t gt agaaacacca aact cgccgg gat at t t acc ttgt t at t cg ggcaccat ca agcagcaaat accagaaaag ccat t cgt ct aat t t cagt c gtagaggaag acaagcacga gt gt t t agga acaacgcaca gaaaaccaca t gaaggt aca agct cggaat gct ct cgat g gaaagaggct gt ct t t gat a act t agaaaa t aggagct t a t t gagaact a agt gt t act g gcgat t acga acgat ct aca agt aagaact t caaaccct g t cgagat aat cgt acat gaa gct t at act t acggat ct ga ggagagagat t aaat t gcag agt gt caaag gat t t t ccat t aagacgt gg t gt caacat a t caat t gct a agat ggt aga ct at gacat c aggact gcct at ggat caac t ccccaagat cat act t gca t agact cct g t t aat cat ca caagat ct t a gaaaat caat ggaagaacat aacgt aat t c at t cgat t ag cgat ct t ct a t gccaaaat c aat cagat ca t cgaact gaa ggat t t t ccc acct agat t t t cgagt aat g agcgagtgcc gaaacgt cga cccgt cgct c t t t at t cct c cgt gt t gact t t at t at ct t tttaacaaca ct t acgaaat t t gaacagt a t cat t at t ac act accacgt gcct at at gt at aaagaaga ct cagat at g acat cat aca aat cgct ct t aaaaat gat c ccacaact ga accaat t gt g t ct gt aaaat t cat aat cac at aaacaat g t t ggact gaa t cat t cagct cacat aat t a aaacacgat t gact at aat c gaacaat gga at act ggt t t gagaccggag gat ct cgct t t ggaat t gca gagt gaccca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 ttatgttaaa ttagaaacta tttgagataa gttacaaaat at ataaaaac Page 261 ttagaagct g tttgggctag ccct t t t ggc gctcgaagaa at t aaggaat cctt at agt a t act t gact c agagacgaaa 12689250 Sequence Listing.txt ttactacatt atgggcagca tttataagcc cacgtaagga gggggcacgt gtcatctcgt atgacctgaa ctatttttgc ttctgtgtct ctctcatcaa atcaaatccc taattctcga at gg 1860 1920 1980 2004 <210> <211> 0' <212> r1 <213> 240 2004 DNA Arabi dopsi s t hal i ana <400> 240 ct ggat t t gt gaaaaagcca acat gagct a caacat gt t c gt aagt ggag agct caggt a gcat acct ca at t t ct ccac gaagcat t t c t cct t at cag ggaacaacat aat gcat t at ccat t ct t t t t t ct cagt ct at t t ct t t ca ccaaat t gat t t ct ct t caa t ccacaat cc at at cagaga t cat ct ccaa ccaacct t at ccaggacact t caat agaat gact t t act t ggcaaagaaa cccgacccaa cact t t gaga t t t gagt gac agt aat ct t a aact aaaaac ggt t accgag ggagt ct agc t cgact ccat t gtt acgaaa cct t at caat t gt gt gcagt caaacaacgc cct t aact ct cgat ct t at g cccaaagt at t at cagagat cact t t ct gt gaaacgtgga t ct gaccat c t agat acgac gt cgat cgcc t ct ct t caag ccacat t cat ccgaat ggat gct t ct t t gg cgat ct t t ag t act t t cacc t gaccatt ag acacacaaac act t ggagt g aggcaacacc gat act ct t g at acaagt ca t t ggt t cct a gt cagaaact cggt t t ct ct t act ggt ct c agt at cat ca cact ct t t cc gat at cagaa t t gaacat ca agaaccgacg agaaat ct ca accat ct ccg agaat ct t gt gaacccat ca t t ct gt agt a aat cgt ggaa cgccacggt t gat t t ct cca gct t at cat c cgagt ct t ct ggt at caaat agaaggaaat aaaaat gcag acat at t t ct acaaaat gca t gatt ctt ca ct ccacagt t gagt acat t a tt ggtt gacg act ccaact t t t agaat gat accaaat caa acct t ct cac gccct agcga gaat ct t gt c caat cat cct gagact t cct at t t cagcga ccct t ct t ct gt ct caacaa gaaat ct cag ccat t t t cag t ct t caagag t cct t ct caa t ct t caacag gcaaat t cat ct cgt cgcca caccat aaaa cagat at agc at t agacct a gcagct aaga aacct t cgt a t ct t ct gat t t ccgcaact g cat t cct t t c t ct t at gct t ct t t acct t t t at t cacaga gct gat ct t t gcgt gt cccc cct t ct t ct g ct at ct t gt a t aat ct gt t g acaat at t at ccgagt t caa cat cct t t at agct t t cct t gt acagagac t cgt t t cgga aaacagaagt agt cgt cgt t cccgagaaat t cacagagat ccgt agcaaa t t aaggt t at aaagt t t cct t aaaagt aaa ct cagt cacc cgcaat gt gg ct t cccat t c aacaat gct a t ct t ggt gca ct t gt agaaa t ct t ccaaca agacccatt t at cat t ct t c agagt t caaa cacat cat ct cagt t cct t a ct t ct cagcc act t t ct t t c ctt gagcaca aat ct ct t ca t act act act gaaagaaat g aaaaggcgaa ct caaccggc cgaagt t t t c aaaaaagaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 262 gagcgagact gacgat t at a at ggct at t g aaagat acaa ct gcacacaa agagagagag t t aggact t c cgagagacga gt gagact ag t aagct gaaa caccaat t aa accact cgct agcgaagaga aaagt ggggg ccct aat t t g at t aacgaaa 12689250 Sequence agagagagag ct t t at gaag ccggt t cagt t aaaccgt ag agtagagggg t at t at ggt a t ct ct cct ca cgcgcct t ct caaagaagag agagagagag aaaaaaggcg aagaatacgc gaaacctagt cct gt t t cct at gg Li st i ng. t xt cgaaggcaaa aagt t agt at at agcaaaag caaaat t cga agagagagag agat t ccat c gat ct gt t t t agt ct ct cga t aaaccggt g aaagagct cg t ct t ct t t ca agagagagag caat t t aggg ccggat cgga 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 241 2004 DNA Arabi dopsi s t hal i ana <400> 241 caat aaat aa at t aacct ct ct t t at t gt a t at t t t gt t g aaat ct t t t c ttacaaagag ct at gggt t t t cgt agcat c ttgaaaaaca caat cccat c ggcact t aga t t cagt ct t a t at agt agt c at aaaacat g gt t t agcgat t at aat ct aa t aaat at at a at at agt aac t agt aaccaa cacact t cca act acaaaca cggagt aacc gaaat t at t t t t ct at t aat t cggaaat aa t t gt t aat at caaat t gt ca ct ct caaaaa t at acccagt aagaact gga at ct t cgacg cct t ct agaa t gggt gaagg t cgcat gaaa cat at at t at gggt t t cat a t t agaat t ag t acat gat t c aaat t t agct aagat acaaa caaat t t t aa act t t gaagc t cgat aaat t at at cat t at t gt t t at at a t t t gt t gt t c gaaaat gaaa at act ccaaa agat aaat ga agt t gcaaaa t aaat aaaat ct gat agat c t cgt gacact at ct ggt gct t t ggaccgct t caat t agt a gt agt t t t t g cat gagcgga caaat agt gt t caagcct ct t aaat ct aaa aaaacat cca ct t t ct t t ca t t acat acaa t t t at caat a aagt aaacat t t caaact ct t t aat at at t t t t gt t aaca t t gaat t agt act gt aat ct aaaaaaaacg aagt t at gaa t agt aggt gg t agt t aat cg cgacgat t aa cggt ccct t a aact acaat t ct t at gagt g gt aagaggt g t aaaaat gac aaat gagaaa t t t caaaat g t gt gt t acct cgacaaaact t acggt ggca accat at cga aat gaaat t t t t t t at t aat aaaaaaaaga aact at t aaa t aat aagt aa t cagaat aga t aaaaat aaa act ggat aaa accgct t gac at ccact ct t aacccgct ct t aacct ct ca cat cct aaca cacat aat ga t t gt act aga acaaagaaaa t at t aaat ct at aggaact a t aagt t at at t t t gccgt ct gcagt aacac t cat t t gaca t t gct t caag at at t aaact agt t aaaat t ct aat t t t gt aaggct t t ag gaat agt ggc aat t aggt ct aaaaat ct aa cccaaggt gt t gggat caat t t gt gaat t t accacaaagg aaaagaaagc t gat aacaca ct aaagat t a at t t t gaat g agat t aagt t ct at aaaat c t t at t ccct t acaat t caca acaccat t at aaaccaccca t gaact gt ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 acatgtgagt gtgttatcgg atttagctgt tgtcttttct t t t t at acga aacaaaaaat Page 263 12689250 Sequence Listing.txt aaat aaaacc at t cggccca at t gat ct aa t t t t t agct g t aaaacct t t gcccat at t t t aaagct cat t t cgt ct t t c t acct t agcg at cat cact g gt agaagaag t t t at caaat t at t t gagaa t acaat t aat t ccaccagt t at ct aat t t g gagt gt t cac t t ct ct ct t g t ct cat ccct agt ct gt ct a t t t t t gat ct aagaagaaga t t gt acaaac ttcacaaaac t act gt gt t a t ccaat ct t t t aaaaacat t aaaacct aga cagt cct cat t t cct ct t cc gt t t agcgaa ct gt t t ct t t at gt ct ct gggct t ct agat act c at at t at t t a gaaat acgaa t gt t gggct t t act ct t act cgt t ggagct t t t gcgt gaa t ct aggat aa ct ct aat t gt t at t t aggcc caat aacaaa t t act at aaa t act caat ct acaaaaaat a taggcccaaa at aaat t caa t agaagccgt ttcaccagaa t ct ccggaag gt t gt t gt at t t ggaacat c gaaat gt ct t aat aaaagt a acaaaagt cg act agggt t t cggcacaagg aat gt at at c acact t t t ga t act gt t gat 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 242 <211> 2012 <212> DNA <213> Arabi dopsi s tha i ana <400> 242 gcttttccac caccgtatct cttgtt ggt at ct gcc tcccagcaga t t caa gaaatgagaa at ct t at t gt cacati agct t t ccaa acat aaaaat ct t ca attaaaatca gaacgaaaac aacag agaaaatctg aatataacac aatgc( t caaagct ac act gt t ct cg aacat1 tgaaccaagg cataaacaga gaaat cttactgaag ccattatcag cgaga tttgttgttc gaggacggaa gtttc ct t gaact cg gt at act cgg cgagg cgat tgt tgt cccat tct tc ct t t agaacaccac aaaaatccga tcgaa acaagggaac gaggggtagc gatgg~ atatgagagc gctggatttg cgcga! aaat t t t gaa t t t t t aaat a taaaa cgaaaattgc cttaataaac tttatt tcttcatgtt tttacaccac ggtgg( t gagcat t t t aaaaaacaaa t at at gaaa t cct cct t ga ct ct ct ccaa gaaagccat t aaca ccaa caat a at t gc cacaa t gat acaga aat t ggag at cac at cca t gat agaaa gt aac agt aa t t gg ct cgt at at a acaacacaat ct aaagcaaa aacat t ccaa at at ccaat c gaat cgagag t t cgaaact a gcaaaat cga gaaagt gt ga acact gagcg cgt accgcga gat gagagaa gagagagatt gcaggacgca agagacat t a at aaaaagaa t aaat aat t c t t t t caat t c t at at at at a aggct gcat a t t cgt gcat t agaagccat g aagacat ggc caacgaat at agcat t cat a cacagat cgg ccgt cgcaat gcgacggagg gcgacgaagc agat aaaaaa agt aagcgat gagat cggag t t t t ccct t t aaat t cat ca acgt ct aat t aaat gt gaaa t t at at at t g acaacagagg ggacaaccaa cat aaagct g at t gat t aga cct acgaaat agaaaacaga gagat agaga t gt aggt aaa t gaaat t gcc t gt agat caa acgaaacct a gt aaaaat ac aagagaaact ggt aacgt t c t t t gt at agt aat aat agt a t gt t t at caa at t t t t t at a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 264 12689250 Sequence Listing.txt aagaaaattg cgatgtgtaa gggatatttg agttgttaaa ttttctataa at gt t act t a aat aat gt t g ct aact acct aaat t aat t a gaagt t cgaa agaat t aaga act gagct aa aagacagaga ct aat t t gt g t t ct cgacat aggt t t t at a gct t t t caac t t ct agat ct tggatcggag aaagaacgtt caaacctttt aaat t t caat gcaggt t ct a t t t t gt t gct aat t t ggt t a t act ggaat c cagact ct ct ccat cggcag ttgt t t gat c ctat t t t t ga t t t t t ctt ct gtagtttttt at t t aaggt a caat t gat aa ct t t cat t cc t at t t gat t g aacaat gcag gt aacaagat at t acaacat act gat caat t cgaccat gg agt ccacgag t ct ccgt t t t cccaat ct ga cccat gt cca tccgcgaccg aagaaaaaat t gtt gcaaac aat t cact aa t t gat caaaa agcaat t gt a act at gact a t t gt t t t t at gat gaat t t g gacct gaact ct t t t t ggct gaaatt gt t t ct acct t aaa ct t t agt cct cccaaaaaat gg ct t cct acca cct t ct at t c t ct t t aggaa gat gacaaga t gagct cccc gt t t aagcct gaaaaaaaga t gt cct aaag ggatccgggc t t t t t t agt c t gacaact t t cgt t t t gttc t ct ct gat ca at t at at gt t aaaaaat t aa taaccacaac cacgt t aaac gat aacggt a ccgt ccat t a t t t gt t cgt a t at t t t gagg t t ggt aat t a gggt ccaact ct t aaat t ca accct t gccc t t t cat agt c gacggtcgat 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2012 <210> 243 <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> 243 agcaagagcc gatccaccac caccactatg agcttctcga aggcgtttgg cgttttgcgt ggaagcgagt tgggctaaac gctcacgacg gtgagtttga caatgaaagc ctcggctttt atctttct t a gct t gatttc cacaatcttg gaaccctccc ggctcatgat cggagactac gttgttgttg ctgctgcttc cgctaggaac tgaggcatgt tgttgctgtt gttgttgttg aggccaaatc tcgaaaccct tgttgttatt attgctcttg tcttgatgat gatgatcctt gttgctgttg ttgtctctcc ctcctagata taatcaacat cacacaccca acacaccaca atttgatttc cttttctttg ctataaagat ctgcaaacgc cgccatggaa ttagtctttc tctgaaagtg gaagtttgct ggatttactc gt ct t t at ca t t cacggct g tttagcagca acaacaagtt acaat t cat a gt t gaagt at cat t ccaaat t t gat gt t ct at agat ct cg gt ct act t ga gaagaaccct aagagaaat c t gacggt aga t t cat cact t ggt ggat t aa t cat t at t at gaggctgagt ggaacccaag ct acat ct ca cct cct t gt c aaact ccggc gagt agaagt t gt t gt t gga t ct t t gt aaa t gat gat ct t gccat t t ct t aaagaaaagg tgatgaggaa aagat aact c gagagagaga caccaccact ggt gct gcaa t gct ct t aac t at gaggaca t cgt caccgt t acggt t at t ttgcaggagc agt at t gcgg gat aaagat a gct t gt t gt t t ccccagt gg gt aaat ct t g gctt aagaaa aagt t t gat c gagagaaaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 aagtttttaa gacatagaga gtcccatgaa gatcttttgc tttgggcact tcactacagt Page 265 12689250 Sequence Listing.txt tatcttccag tttgagtgaa tccccatcac ataaatatca ctcaaactat aaaagaaaaa agagt aat ca t at t t t cat a acgcgt t gt t at ct aat cga gccgaaccaa t t aagat gt g ct agaaaat a at agagt gcc ct ccccaccg aagt t t gaga aaat gaaat c t cgt cacgga gacat aact t at t t at t cgt cgt agaaact aaaaacaaaa aaaaggaat a t ggat aaat a t t ccat t cca t t aat at gt a atccct t att acaatctaat tt ct t t act a t gt aggaagt tggttttagg attcaaaaca cgtatcgaat ttaataatga ttaattaaaa gaaatttctt gaaaatagct cgctttttaa acgtgttaga acagggagtt tttatctttt tacttcggtt aaacaaaaaa ggcaaagat a tgagaaaaaa tgaaaaggat cggtctcatc gtgaggttta tttcaacgac gcct t cgtgc ctaatcagaa atacgtggt a taaacacttt ttgaatatga aaaaaactat gt gat aggt gac t at at gct t a t aat acat gt cat t t t t t t g at at gcat ga ct cgt t t t cg agcgaaaat g gct cat agt g t t t ct ccact t at ct at t t t aaagt ct gaa agt ct aacgt ccgct t aat c gt t aat t t t t t t aacat t ga act at aagaa at t t t t t aag t t ct ct acat aagat gat aa t ccacagat g aggagt at aa t t t t agt aga tt gt t t cttt t t aat t t t t a cgt ccct ccc gt gat gaaat at aaagaaaa cacat t cct t acggcgatt a cct t t t t t ga at cgacat ga aact at gt gg t gt t gaaaaa t aat t t t gt t t t t t cat agg aaat t aaat g caaat t at at aat at at t ac t t t t aaaaat t aaaact aaa t caat ggt ga ccgcaaaacc t at at at ggg aggaaaaaaa t gacat t t t a gact t t caat t aat t t t gt t agagggcgat tgtgtgaaaa 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 <210> 244 <211> 2007 <212> DNA <213> Arabidopsis thaliana <400> 244 taaaaaaata aacggggact agat t ctgt g tacttaatat ggtcacaact catatgagat tatatatatt aagcataaat actatatttt tttgtgatgg taagtttacc aaattgattg tattgtttag aaatattttt atatgtgagc ataaaaaaac acgtcgtgat ttggaaaaca cacgtttaat attatgatac cagatcggaa agggagcaat ttaagatgga aacgtaaata caatcgaaaa at t gatcgaa t ggt caagat gcccaaatct tgagagcaat gacatgcaaa gtgaaaaaag aaccactctc tcgttacata tcagactagg aaaccaagtc tcgcattgga ccggt gat cg t agt acccaa t t t t t gt t ag t agat t aaaa ct aaat t act cgat gat gag aaagat gat a at t at t t gt t at gaccgaga at gt t t t gaa gagggt agt a at acgact t t gagaggcgct aat aaat t aa t t cat aaat t aaaaaaat ca acagt aaat c gt at t t t at g t gt t at gt gt gagaaagaat agt cat cgga agt gct ct ca t at aaagaaa ggt t t aaaca t t acaaaat t aat at at at a t t at aat t aa acaaat aat a ttttacacaa at gacaat ac t t t agagct t cgact agaag aaagt t gt ga t gaat t aaaa t t t acct at c t gt gagct ga 120 180 240 300 360 420 480 540 600 660 720 Page 266 12689250 Sequence Listing.txt ctgacaaatc gattttatgt gctttacaac ttagatataa ctgaacatat t gaggt t gga cgct t aaagg t t t t t gaat c caaaat t aca ggt agacat c aact cgt t at t t acct caag cacat t aat c gat t at t t ca aaagcccaat aacaaat t t c aaat t at aga aaat t acaat aaaat at agg aaaaaat aaa ct at t t t caa t caat ct caa t t t t t t t aat t at t at t cca ct ct at aaga aaat caaaca cgat agact c at t t t t t t t g aact t t acag t t t act caaa ccat t gaaag agt ct ggaca at at ct ct t t t gct cat t t c ct aact agga aact t gt gt t at at t gat aa ttct t t t gat at gaaagt t g act gaact ac aagaaat act t gagcat aaa ccgt t agat c t t gagt at t a t t t gcat t aa accgacacag cagagaagag at agacacaa at gaagt t t a at ccat at t g agaacagat g aaaacaaat t aat t gt cat g gt t ct t cct t aacat t aat t aaacaat cca t agaagt cac at t t t aaaag ct t t t ct gat ct t caaat t a caaaggcaat caaat ct cat t t acacgcgt aacaaat ccg at t ct t t t cc aat at aat aa agacgaagaa aaaat gg t cgt t gt ct c t ggt gat ccc t aact aat ga cggt t gtt ga gccat ggt at gt agggt t t c gt gat agt t c t t ct ct at t g tttctttgac at gat caaat at t t cct at t at t gt t t ct c t agat t ct t t aat at t caat aacct t t t t g cct cagaagc gcat cgaat t t at t t gacaa aaagat gt aa gaat aat cac gaaaaat cac gt t cgagaac t t at agact a t ggcaaaact ggaat t cat g at t gctct t t cacct t gat a t t t t t aaat t acat gggct t ct caagacct acaat at t t a aaat aaaat a tctgat t t t g gt gaccaaca act aaat at a t gaat ccgat cct ccgcgt g cat gt t t t at gat t ct ct cc t cgaaaact c t ggt at t gga gagt t t ct ct at cacaat t g cat agt t t ac t t t t t ggaat at aat gat t t ct t t gggt ag t gt acaaccc gt gt gt at aa t act t aaaca at t at aaat t aagt t gct ac t aggaat acc t t t ct caaat aaaaaaaaat t caat t t ct t t t gcaat t ct gt t t ggt cca t aat t at t at gcct cct t ct at caaat caa 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2007 <210> 245 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 245 atttaccgtt tgaattcggt cagaattcga gcaatactaa ttaagctgac tgacggaaca gttcaatttt ttttttttgt ttatcgtttt taataggatt agatcaaata tttctaatga t at at t t t t a at aaact ct a t t at t t t t ca atttttgatt caggtcctat aaactgaaac gaaatctttg tacaaattta t cttttaat t tcatactttt ttccaaagct tatgaatgct tgccaacaaa taaaaatttt caaatgcaat tccggacttt gtaacgttta ataccaat t a acccaat t t a aaacattatt attgtaggta t gat at aat a gat aaaaaaa gct caat t t a gt t t t act t a ctaaataatt tgatccgtaa ct t aacaaaa t t gt t t at ct ccacaacaaa atacaact t g aacacattat gtgt t atact Page 267 at at gact ga t gat agt t at aact act t ca ct aaaaaat a aat t aaaat t t t cat caaat cct aaat t t t gt gaacat t t at gct t t aac 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gactaatttt ttgcgacact aaaatgaaaa cagtagtttg atacatttgt aaactgatgc at t t t t aaaa aat ct aaaat at t t gat t at gt caaaaaaa ct t gccaact t t agct t t cg gaat ct gcaa ct t t t t gagc gt t t at t t gg tat t ggaaag agcat t cct a agt ccat aca t ggt t gcgag aggt t t ct t g aaagacatt g at ct t caaga t t ggaact aa at t t at agat t t t t agt t t g gagt cact t a at t t t at aaa aggt gggcat taaacgacac aagcgagacg agat t gaat a cgaaagagct acagt t t cag gct ccacaag tcct t t t ctc act t agaagt ct t cat ct ag tagaaagaag at gagccaag aaaaccaaca t cacgacagt t gaact ggt t gat t t gcgga cagt gat cat at gat t t gca t cagaaaat t gaagaat aag at t agt gt ct t t t t at aaaa aat t gct t t t t ct t aaaat t gcccat agt t gat cgt t gac agagacgat a aat t t t acat acaat ccaaa t t t ggtccgg cgaat cat ag at aat ct t t g agaaagat cc gat ccaagat ct t agaaagt gt t acat aga cagat t t ggt agcat t t gag gct t ccgaag at at gaaggt gt t t t aaaac accaat at gc caagt gcacc aaact t at t g t t cgat t ggt ggtt agacac t t aat t t at a aaat aat t t t gaccagct ct caaagaaagc at gg t ct ct ct t aa t ct t t ggt at t t ct t t at t t ttgagagagc t agagt t ct t acat act cat t gcacccat a gat gt ggaca aaggct t t t c aact t agcat tctgct t t t a aact t t caga aaaaat at t g gaat t cat ac aaggatgat t at gaaacaaa t gt t t gaaaa aagaat t t ga at t ct t gat t t ct aaat ggg aaat ggggt a agt ggt agaa agggt cgt t c aaaaaccat a t aat at t t cg t t cacat t aa cgacaaaagc t caat acat t aagct t at aa gccaaaat ga acat gaagac gaaat t t gga ct at ggt gca gcat cagagc ct ct gct t t g t gt agt t at g t t t gaat t gt gcaat t agt a t gt agaagaa ct t gaacat g aat t t gat gg t agt agt t gt t at at gagt t gat t t t aaat acggcacgat ttaaacgcaa ctt acaacaa gt t t ggt t cg tat t gagaga t ccacat aaa t t t gagt t ct gt t t t t t caa t at t gt aat c gcaatt agat t gt t t t gagc t gacat act t acgggt ct cg t act cat aat ttttaaaacg aggt accct a cccgagcaga gcaacat gt t t t t gagt t t g t aagt gt t ga t t t gtgt t cc t aaat gggca gggct aaat g gt t t gaat at tcagagagag 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 246 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 246 gattaccatg agatcaccgt acgtccatga gttgtctgac attgatggaa tatcttttac cacattctcg agtgacatct taaggcgcaa tttatttatc acaggagaga cttgagatga ggtatgtttc cgatagtcat gcacctacaa tagtatgtgc caaaaat t aa agaaaacat g atgatgtaag atggtacaca gcagtcaatt aaaaattagt ataactaggt tttttatgtt tacttctggt cccagttact ccattatgtt tcagaaacaa tcaaatacat gtacaattga Page 268 120 180 240 300 12689250 Sequence Listing.txt gt t t ggaaga t aaact t cca t gt act t at a at t acgacat t t t t t t aat c gt agat aat t gaact t at gg accccat cca t caat agcct gaaagt gt at caagt gggga caccaaaact ccgaaaggag acgat at ct g t aaaact caa ct cagaggt t at gat ct gca t at gt gt caa cgaaaacccg at t t t ct at a ct aagt t aaa gccaat t cag acacccat gg cgat acagaa ct agagaaat at agaaagcg ggaat t t gt g gagagat t at t t gt gt ct ct t aat t t t ggt t t gcaaat t a ct ct t cgt gt agaagaagag t ct gt t gagc aaggct t t at caaat t t gca at aaagcgt a ttaacaaaac aaaaaaacat t t gat t acat accct ct gct t t gcat agcc at acaagaaa t t t cct at ag gagaaaccaa agt ggt t cgt gaaacat cgc t aact t t cca t t acct t t gg t agaaat ct t at t ct ct cca at aaaaaaga t gaaacct t a agagaaat ca aaacacaggt t gt t agggt t gt gcacact t t agct t cgat gggt t t t gct gat t t act ga agat aagaaa t ct t t t t aaa cccaaaaaat aggt at at ct caaagggatt gcgagt aagt cat ct t caca gt t caaat ca gccct at cat t ggt at t caa t cacaagat a agt acccat t aat gcact ca t gagat ccaa caat at t gct aaat aagcga t t t cccagat aggt cgaaac gagagat gaa cacgagaatt t gagct t gt t aaggt gact a gggaaatttt ct t ct t ct aa ggat t t cggt t aaaat t t gg t gat at at aa gt gaaaaaag at gg t at at t t at a aat t gcagat t ccaaaat gt aaaaacattt agt gcagt ga t t t act aaat agt ct at aat acggat gt aa aaact caaat aat aaat t ca t ggat agagg at t caacat a acct ggagat at cat cagt g at t t agt t gg agaacgat t g t t t ct accaa gct caaacct gaagaaat cg acagagaaga agaat agagc t gaagaagat ct t caaagt a ttaagacaaa gt aaccgagt aat cct t cac ct t at cgcgg t acgt t t gga aact cacct c ccccaggaag act cagt aga aat gcat t at ct aagt act t aacat agt t c gacct t t gga t t gt gaaaac t at agact ct ct caat at ga gct agaat t t at at cgggt a gcct gaacca aacacaat ac ct t t t gggga cct t cgat at gt ct aggt ac aagaat gaat t cat caaaca aat gaagaat gcgat cggaa gcagaaaaag acggaagt t g t t gggggt ag cct t at aagc cgat t agt t c t t gat t aagt ct gt t t ccaa acat at aacc cct gccagcc at gaaact ca t cgggt agaa t gaat t cat t gaaagat t t a acat ct cgaa gt gaaggt t t cagt acat ac aggaaact t c agcaaaacat aagat ggct t aacgt aat aa aact aaaat t at caggagt a ct t aaacgag t gt caat aaa tttaagaaag at t ggat cgg t cat aaacat gagaaaggaa aaat t at ct t aaccagt t at aat agacat t ct t t cgt cgc ccggcgaaga 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 247 <211> 2004 <212> DNA <213> Arabi dopsi s tha i ana <400> 247 ct t gagagag t caaagaaga aagt ccat t a gagacct t gt t cat caccga aagaggact g agt ct t ct gg t ccct gt t ca gct caat aag gaacat agca caagt gt t ga agaggt gaat Page 269 120 12689250 Sequence Listing.txt ctttttgcga gtcctgat t c agtgttttgt tttgcgaaca ttagacacta acgatctttt gatgtgacag gaat aat aga acaggt at at gt t t t t aggg t act at t aca tcctcagcca at cgccgt ac att cct aat c cgaggt t gac agaagtcct c ct t aagat t c at ggact t t c gt t t ctgtga at ct t t t gaa gatttttttt at t t t aacga agaat aat t t agt t accgt g ttgct t t t t g at agt t at ac t ct gat gaac cacactgaac tcagaaacga at at t t t t cc acacgaaat a atgaaaggcg t ct cgaacgt at t at t gaac gt t t t ggaga t agctt at aa t t t gaat t t t gaagtgagaa gtgaaaggt t at at act cat ttggat t at t aaacctggt t aaatggaacg at cacgt gct ct ct cat ccg t cat t cat ct ct t agagct g gaaacgcaat at gaat tgct ttgtgat t t c gagact ct t a tttttttttt tgt t gcattt at at aact at aat gaaaat a cagat at gct act t t t gact taggataagg accct t t t aa caaaaaagaa tgctaat t t t ccaagtggac gtgagaatct acaagaatcg cgacgtacga taagagcgga at t t gggagc ctcgagaaaa gaatcaagaa at ct caaagt gat t t t aat c aggt ct acaa t gt t t t t at t aggcgcttct ct ccgt cgct at ggact at a cagcct t ccc ccaagaaat a ct t t gat t aa agagatacca actt gaagga ct aaagt aac ttggat t t cg gt gt cccat a cct act agca acagact t ga t gccact aat aaacaaacaa aatt cggaac gat t t t ct cc gt ct aacaat at ct agt gt a gactgaggt t aaacggtgat acgacacaca gaat caat gc gaaagat t gc tt cct ct at c atgt ggaacatcct cact cgt gca agat agt t t c ggt t t gtaga cgat t t gat a agagatgttc acaccaccat cgaat actt c tgaaggcgat t ggt aat act act cacgt gc caaggagat c agaat t t gga t t t t agaat t ctgaggt t t t t acgt t at t g aagctaacgc act t t at aat aaagt cat aa aat cggt t t a t gaat t t t gc tt ct ct t t t t at caaacaat gacaaaccca aat agat ct a aagaccataa aacactccac gctgagggt a gacacat gt a at t aat t t t c gat gat at at t t t ggcgct g aaaat t t t gc aat at caaaa atagt t t t gt agaat agact agact ct cat tcgaacgagg cct gct caac aagaacaaaa attttttttt gtcggaggt a gat ctt ccat gat t caat ct accat t t t gt ttagtgaaaa aaat t t t gaa act cgt agt a at t t t at at t gcaaaagaaa t acgtt ct ct caacgt aat t t t t t t t at ag aat at acgat gccgt agaat cacacggaac aat t at t t ga aagacgtaaa t ggt caat at at t cat aaat t agcgat aga gat t t ct t aa at gt t t t at a tgcagaaaaa t caat gt ct t acgtcggaac cacgtgacaa aagccatct t at ct cat t ca cgcaaacgca ttttttaggt t cat gacgat gt gaat act a t t t t agaat a t act cagcat at at aat gt a ct cgaactt t t acgt aat t t tt cat aaact aagt t act t t ctggaccaca tatct t t t ga at at t t t t ag t at t at aaaa aaagat ct gc at cggt acgc acactggaca t gaagaact a t aat ct cat t ttttcttcaa 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 248 <211> 2008 Page 270 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 248 at t t aact at t gcaact at a t t gat at t at gt caaaaaat caagaagcca ccccat ct ac cct aagaagt aaaagt t t at at cagt cgct t ggaaaat gt acaaact at a t gcat cgact t gat t t t t t t gt caacat t t gat act t aca at at at at aa gat act gat a aaat aat t t t ggat cgt aaa gggt cgt cgt at aagt t t t c ggat aat gt t aacgttt gac aaaagt aat a t gccgt t t ac t t t t acat at t gct aaaact t aaat agaaa aaaaagcaat at at acgt aa aat ct ct ct c cat aagaaga ttaagacaac t aat ct aaat aggccaagtt aat at t t t t g act aaaacac gat t t at t cc t agact aaca at t cct accc cat cgacaga t ct t cacggc act aat t cca gaat ct t caa t t t at gat t t tttaaaaaaa aaat at t t t t gat t t t t t gc t gacaaat at t ggact t gt g aaact cat at ccaaat gacc ct gacgcat a t ct cct act a t gt t at t aca t agt aaggaa aat t gccaat t cgccaat aa caat aaaaca at gat gggca ccaat t at at ct aaaact t g t t t ct t ctat agaagaaact at gt at aaaa gt ct at t at c at t gat cagt tgatggaacg cagccccaac ttctat t t cc ggt t t gt gt g aaact t t cgg ctt aacacca t ccacct acc t agt t acgt a caaat gcaat caat t t t cat t ct t t aaacg aagaaatttt t t t t at caaa gat act gat c at act ct aaa at aat at t aa acaagaagat agat aacat t t at aagat gt t gat t aat ac at gaaaagag t gcgat agt t t t t gacgt t t t t aaat t act agt t gat gt t at ct cat at a cggaaat aga aaat agt ggc acaat agt t a cct at at ct t t t aaat aaat aaaacgaggc t aaat t cacc cct agt gcaa ct t t acgt aa gt t ct t aaat caccagagac caaaat cat c aaaat t t cga at ct t aact t caaacacaca ccggcact t a gat at ct gat t t agt at t at t gt gaccaat aaacat at t c aat at caccc t aagt agt ag ttcaaaacag at t acaagat gt acat ct ga gat at aaat a gcat gaagca act ct t ct t g t ct at t agt t t t ct t gaat g at t cgt aaat t acaat t t ct aaat gccacg cat t cccat t at caat caaa aact aaaaac aaaaat gcca cggcacgtag aat t caggac at t accat t c agtagcgacg t t t t gggtaa aaat t aggt t agaat t t ct t t at caagct c cgaat t caac aaat t gct ac gt ccaaaact ct aagagcat t t at aat gt t cagaagaaac t aat t gct t t at at acat gg aagagcgtag aggaaaat at tcagaaaaag aaaaat at ga t t aacat t t t t gcct ct t t t cgt gt acgac t gt t t gat ac aagct ggaac t t at t t agat t at ct t act t t gt at ggt gg ggt t gaaat c gagaagtaag t ct at at t at ct accact t t gaact aggaa ct t gaagggg t at at aat ga t t cgt gaat t aaat t aggaa t gt t cagaaa tgggacaaaa aaaagcat ac aacacat gca caaaaaaat a t t t t t t gt gt gt t cat aggt t gt t t aat aa cacgt cagat act aat at aa t ct aat at at accat gt cct t t ct cat t aa aaaggt gaaa at at at t t gt t t t t caaaat t t t ggtcggc t t t t gt t t t t t ct gt t gt ct aaat ct aaca t at at t at at t gt caat gt c acat aat ccg acaaaagcat agaaat ggca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 Page 271 12689250 Sequence Listing.txt gat t ctaact gtggatgtgg ctcctcctgc aaatgtggtg actct t gcag gtaaacccct agattctctc ttctttacat ttatatgc 1980 2008 <210> <211> <212> <213> 249 2003 DNA Arabidopsis thal i ana <400> 249 at t t t gt t t a at aat gggt t gaaacaagaa cggt gacgt a cat gt gaaca gctt ccaacg at t t t cat t t t ct cct cact t at ggt at ct cagccggcaa t t gct aat t g cgt aaat t t t gt aat t t t ca cccat aact a accact cagc caaaagaaaa at aat t t t aa aaaaagaat a t aat at t t t c at aacaacga t cact t act c acct aat ct a catgt t t t t g t t t t ccat gg t cct ct ct ct ggt act agt t t caat ct ggg t agt t t at t t tagagacaaa ct ct aat at a caat aaaaag t gat gt gaaa acagt gaagc t gt ct ct t t g gct ct cat t t agt t t aact a at t at t gt ac aacaaagt t c ct ct cagccc cat t t t t t ac t t ggcggt at t t gaaact ct ct ct t aaat a t at accat aa t aaaat at aa acagagacaa at gt t t t t t t cat gt at t t a at t ct caacc t cct t aat ac t t t ctctct t acggacctt g t t ct agt gt t t caat t t ct c t t t gt t t gt t t t ct cgt aag aaaacat gaa at ct aacct t agtgggagac gaaaggaaga t gcaaagcct at cat gacct t cacat t gat aat aat t at a at t t cat aac aat t gct gct at aat t at aa cact t gt t aa gcatat t t t t gat t at aat g t t ggat ggt c aacaat ct ga taaacacacg aaaaaagt ca aagct t cct t t t at t at t t c gt cagat ct a cct t act t t t ct t ct t gat t acct ct ct gc t gt t t gat cc taaaaggaac t ggt t ct t gt t aagt gat t t gt aat gaaca cgagtgagaa aagagggtga gcgcgggtga gat t t ggt t g t ct t ct t caa t gt gt t ct t a cat t gat t t t t at accacac t t ct t gt ct c acgagt agaa at gct ct gt c agacagacaa gaaaacat at ccat t t ct at t t t gat t t t a cat gt t at t t t t t caaat at acaaaacaga t ccat t agat act acagt ac gt agt t t at t gt t gcgct t t t t ccct t cgt t ct gcgt act t at t gaaacc agct t ct t t g agt t gt gat t ttaaaaaaaa aaaaat gaac t aaat ggt at t gat ggaat t t t t ccct t ct at at acaact gaat t t t gt t t gt t gt gagt caaaagt aga t ct cgt at ca gaat at t t aa tt act t t t t t aat t t aaacc t aaat t t at a ct aaat cat t ct t at t gct a gt ggacct ac at aaat caaa aacct gaaac ct aacgact c tccacaccac act aaat t t a t gt t agct t t gcat t t t ct c ct gt t ct t ct t cct t ct ct a gat t cat t at ct gaat t t t a t ct t agt aat at aaaaaaaa agctacggga aggtgacagg ct t t aagcgt ct t cact t ct cct cgt gcaa ct act t aaat gcat at t aag t cact ct cct cactct t t t t t t t at ccat t gat acgt ggt ggt at t t tag ttaacaacaa aaaccat at a aaaat aaaat t t aaaat t aa cat gt t acct agaat cact c at acgaacaa ccact cct t c ct ct ggt t ca t ct ct cact a t ct ct ct t ca gt t t caat t t cat t t gggt t gggaaatgct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 gaagaagtta gagtaagaaa tttttggttt ttgaatggaa agtgattgat tttggggaag Page 272 at gagtt ct g ct t t atctta ggt t ct t ct t aat t agat t g cct gcat at t ggaacgat aa ctt gtt aagc aaaat t gaga at t t t gggt t gaagat ct ga 12689250 Sequence Listing.txt ggt t gaat t c t ggct at gaa acct aggt ta aagt tt ct aa aaatctaacc caattctcat ctgaaaccct aatttttct g attttgttca tgcatctata gaaagtgaag atttttatgt gttcttcttg ttgttgtgat gctaagcaga taaaacaagt tgt 1800 1860 1920 1980 2003 <210> <211> <212> <213> 250 2004 DNA Arabi dopsi s t hal i ana <400> 250 t ct att ctt c at acaaat gt t t at t at gt a cat at at at t act t at at t t att gt ct ct t t t t gt gt ggt ttacgaacaa gaat t t t t ga cat ct t gcat at t aat t t ca aaggatgagt cat gatt at t gaagat t t at acat gggccc cact ctt ct t aagct t t aca aagt t gaat t t t cat at acc atgtct t t t a t gat acact c at t t t acat g t t t t t t t ct t t cgct t at at gt gaagaaac at gt t t aaaa t gat cagtt c at aaat gat t tttttgtcaa t gacat cgct ttct t t t gct gaatcttttt at caaaccaa t gagat t t t a t t t ggt cat t aaat t t t cgg ttggtgaagg acgat t t t t a cgt caaat at act ct t ccac ct ctt accaa ct t t t agccc aaat ggtt gt gtt gacgtt c tggaaccaaa at at at accc at t t t t t t gg agaat at at a ggt aaaaat t at at t caat t ct at t at t t g gt agcat t t g cat gt cgaaa agcct t acat ttcacaaaaa gacacaat t g ct aagcct aa gt t gt gat gg cgt t t t at at t gt aaat at t t ct ct aat ac ct at at t gt c t at aaaagaa tt caat t t t t caaagcccaa acccat t t cc at aact t t ct agaaat aaaa aat t gt t cat at agaccaaa aaat at agca ct t at ct at t t gcccaaaat ggaaaaacat acggt aat ac at aagaaaat aat t aat aca agt aat agt a t t ggcat at t t aaat agct a gt t gt aaat g t agt aaat aa gt t agt act t agat t t t t ct t t t t t agtct ct ccggt t t t cat t gat t t t t at t aaaaat act t ggact a t caat at at c t t t at t t cca t t t t at ccaa t aaaat t at a at acgggt t c tcaacaacaa t at at t t at a aaaagt at ca ggaaaagaac acaaat cat t gt t at caaaa at gt act aat t t ct caat ca t cact gt cca gacgaagcag t at at gat t a caat gccaat at ct caat ac agt agcccgt cagaaaacaa taaaaaagac aaaaaaaaac ggagat at at ggt ggggt t g t t gct t cggt t ct cgct at c accct acccc at ggat t t ga cgggt t caat cgt ggt cgt t at gaagaaat at at aact t t t acaaact gt at at gccaag act t t at t t a catttttttt atgt t t t t gc cttt caagca t t acat t t gg t t t t gt at t c t t at ccatt a at ccat agca aagaacccat t t gaaat gt a aaaat t ct t g ccaaat t ct t at at cagtt g gt at t at ggt gt gaagaaat t at at cgt ca ttcaccaacc t t t at t t ct c ct ggt ct cca t gtt caatt g ggt aat at at t gtt agagt a t ggct at gt c t t t t act t ct gtt gat t t t a at t aaat cat t acat t aat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 Page 273 gtt acat t t t t t aacat aaa aaaat ccagt t at gt gattt ccat act caa ct aacat t t t t acaact cct tttttttcca aacaagcagt ttttttttgc aat aaat aaa taat t t t t ca taaaagaaag aaaagat t aa caccaacccc gt t t cct t ca cct aaat ct t t t ct caggt g 12689250 Sequence aaat att ctt aaat aaccat at ataacatt tcaacaaaga gaaaaat aca aat t t gct t a aaatgcagct taccaaacgc atgacaaaat caccctcagc accgtctact ccggtgaat t ggccaaagcc taaaattcac tgaatatcac aatatttact at gg Li st i ng. txt t ct t t t t t t a aat t t gct t a t aaat at at t aacgt gaaaa aaaat cat ga gt ct at at ga acaaccaaaa at t t acaat g t t t act at aa t gaaaaat ac accact agt t tttgagaaac aacaacaaca act cct ccga aaaccaacct gcgccacaag 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 251 2004 DNA Arabidopsis thal i ana <400> 251 t at t t aggac t accaat gct aat t t gggga aaaaaaaagt tatct t t ct t gt aat catt c t aat gagaca aat t aaaat a at at agt t t c acat gat ccc aggt aaat ca t aat at ggt a t ct t t at aga caact gaaat t gaagct t t a act ct t aat a aat t at cat g ggaagaaaat gact gt at at t cat t aaccc ct ggt at aaa cact ct t cag t t at at t aat aaaaat at t a ggt acaat aa at t t t gt t t g t gt at ct aac aaaat gtttt t gtt at gcaa gacct t aaga t t ctt caat c cat gaaaaat ct caacaaca t gt t t cact t cat t t t agaa ggct t t agca tttttacaaa t t t at t aat t t at t aat t t a at t ct aagct caaaagaaat at ct cat aaa t gat agct t g t at t ct t gga gt acaat gca tgcaaagaga ct aat t act a att cacat at gt t t t at t t g caaact at ca tt aagt t t ct ct gct gcaac ggccacat gc aaaaaaaat a t at aaat aga at ct t at gcg ct t t ct aaaa t t gaaat agc t at agagaca at gagat at t ttgt t t t aag gat t t gt t t t aagt t t t at c ct aacaaat c cgt act gtt a cagaaaaat a agaaaagggt at t gcaaaaa aaaacagaac at gat cat gg t gt gt ct at g ttgacagaaa gat gt aacgg aaaaat t at g t t aacaat aa ct gt aagt t t t gcagaat aa cct cgt t t t g aat t gt aat t at at t t gggc aat t t at t ga aaaat t t aat agct t ggcca t t cat t aacc aagt caat t a t at t aagaca aagggt acaa acaat aat gc atgt t t t t gt t aaacaattt t t t t at gt t t t at acat aac gt cagaaaca tt at gcaaaa cccggt t cca at acgt gt t t ttaaaacgca at att ggt gt agcaaagt ca t aat cagt aa t aaaat at ca at at t aat t t ccgt t t aaaa at agt cat gt caaagaaat g cgt aat gaac ct gatt ggt t t aat gtt gag aat t gaat t g gct gaaaaa t gcat t t act tt ct aaaaga ct at gat aca gtt at ct t t c at t t act gat act t ct aggt gt ctt ctt ct at aagaaaaa t at gt gcat t ttcaaaagga aacct ct at a at t t gagaac at agat t t t t agat t t at ct at ggcccaat at t t gt at t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 agatgggcca atagtcaagt gtggcccaat taggagtaaa atctcataaa cgaaatgggc Page 274 12689250 Sequence Listing.txt ccataatggc ccaatatttg tacaccacaa tcggaaaccc cttttgggtc atttcaagtt tgacgagaaa ct t ct cgaaa t ct cgaat cg ttgt t gt t gt cgagat ct at gcgcgat tag t t t t at cgat t aat t gaat t t t ggct t gat gct at aat ct ct at gt act t cgt aaat cgc at t t cct gcg at gct t gt t t t agt cgt ct g gaat cgt t at gaagaat aaa tgtgtgaaaa gaat t gt t t g t agggt t gaa gat at aagt t caggt t aagg tat t t ct t ct cct ct ccgt g t gt gaaat ac t gt agt t t at ct gaaacaaa acgagat t t g aat t caccat att cagcagc t at gcct t ga cgt t t t ct cc at gt at t cct ccct t ct gt gaaat cgat t ct at c t cacct t t t g act cgaaact t t ggtagcag t cagat gct t t t aat t at ct aact t cat t t ttttgtgtgg aaaaat ct ac cgat t caggt agat ct at t t t t t cgat t t t agcagat gat t gagt t t ct t t t at t gat t t aat ggt t t t a caat cccact ct t gat t at g caacgagat t aaat cact ct gat aaat t t g t gggt cct ca at gt t at t t a agatcggt t t gt gt ct t ct a t gt gat t gt a at acact at a t t t gat at at 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 252 2004 DNA Arabidopsis thal i ana <400> 252 caaaccgct a acat t cagag gt t t at t t t c at t t gaaagc acaaagt t t g ttggacaagc t ct t gact ct gt ggct t gga t gcagct at a at gagaat gc t ct ct t gat a t t t ct gt gca at cagcct ga act t t ct t t c agcaat gat t cct t agcat a gggat gt cac t gcagt t ct t at t ct gct ga aggaaatgga t aagcat cat t t aggt t cat aagat gaat g t agt t caaaa gct t ct gt t c cat gct t cag t gcat acgag t cgagccct t t gt gt ggat a gt cat ct gcc agt t acagct gt t t gt cagt agagcat aac ct ct t t t cag gt gcct t gga t caaggt aaa t t t t ccagat aagct ccaga t ggt act t t t ggaagccctt t cgagct at t gt t t gt t aag t ct ct cct t t ct t cacgcag aact acagaa ct ccat gat c cct gt t t at a caaacccgt c gt aacagt gg gatgaagaga ct t t t t t cgt gaacaaggca aggat t gaac at act t t ct t gt t gt acgag t agcacat ct t aat t gaaaa t acaat t t ac at cggagcac cat gt aagct at t t t t gt t t t cgcagct ga aaccaggaag agaacat at a t gt t cccat a t cggaat t ca aaccaggttt aacccggact ct t cagt t gt aaat t aagt g at t at caat g gt gaat ct ct agaat acaat act gat t ct t aat at t at t t t t gat ggct c aat cat at gt t ccat t t t ac gct ccgcact tgagacagac at t ct t t gat caggat t gaa t act aat ct g act t at gaac t gct aat t at at t t ct gaaa ggt agt acag gtccgggtga aggt t gaat g t gcact gat a cagcaaagat at accaggt g cct aact gat t agcgacaat t ct ct t caca ct t gt t at cg aat cat t t aa accaagct t c at agt gt acc t at gt at gct t at t t gcaaa agct ggaat g ct gcagaat g aggt gaat ga t t agt aat ca agaat ct ct a t aagat agct at ct t t t ct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 275 12689250 Sequence Listing.txt ttttcacatg tatatatgat tgatgagagt ttacaatctt cattaggtca acacacagca acct gaaaat ct t t cacat g t aaccct ct t aact cacgat agt agt agt a aggct caaag agctggagcc t gct aaaat t t t aaat t ct c t t at ccat t t agagaat at g gtgtacggaa t t cact gaaa t t cgaat caa <210> 253 gat ct t t t at gt t aaaacct agt ct t aacc gtct t t t at t aaat t gt gca gagagagact agat t cacat gccagt t t t g ggatat t t t t t t t caaat t a ggt t at t t at aat at at at t aacct t ct t a t t gt cct ct c cggcggagaa at agaagaaa ct gat agt ag gcagact t gt t gcct gt ggt ggt gt gaagt agagaat aac ggcaacaacc ct gt at agt g aat t cat t ca at aat ct ct t ct t t aat cac gact ct t t t a t cct ct caga agt t cct acg at gg gcagaagaaa t gaact ct cg cgcct t gcct gat caat act t ct t t gct ga t gt ct t t gac at agt t t t t a t t agaat at c t t gaacaat t t aat cacat g at t t gagt t t t cacaggt gt gt ct ct t at a cgaagt t t t c gcaacact ga agaaagaaac t aact caaga gcaaat cat c gaat gagat a aaagat t ct t t ct t ct acag tttttttttt at t gaat t t t t t t t at t t at gat t at at aa cacgt gact c t t aggat cgc aacagat t cg agt at gaacc at ccaact gg gaat at cgcg acgccgt aag aat aat ct ct t gcaagaat g gcat cat t t t t ct acact gc t t aaggat ga cct t ct aat g ttat t t t ggt t at gat t t t a acgtgaggct t at ct act gc ccggagaat c 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 253 aagt cat t gt t acacaaact at t aaat ct a at t t ct t gct gt t at gaaat t t gact t t gt at gagccat c at agcat gt a act cct act t caaat ct t ca aagt aaaaga t t caaaat gt cct gagt aat ct t accaat a acaaaacat c at agt gat t g t t t t t t t cgt t aaact t cca at caacat t a t t t ct ct cac t t ct gat t gc t ccccaacat t t at gt gt ct aaaaat ccga aagat gaaat t t t caagat a cct t at t at a t gacgct ct c gaacgaaaat acgat gct t t at t t gat ct t t at cgcat ac t gat accaaa cat t at at cc aat t at at at ccat gt gt at tggt t t t t ag aat aagt at c gt cagt agt g acaccaat ca aat cccagag t aaagat gaa t at aacgt aa t gacat at aa acgct t at gc cgt at t t aaa act t ccgt at cgagaat aaa at at cacat t ct ct t ccaac agccacacaa aaacaat at c at gacaccaa at t gaaggga aagat ggt aa gagaaagaaa gcaccat aaa gt ct t act cg aaat t at gt t accat at gct cat aat cact t t at gacgt c gt t ct ccaca aaat ct at ct t cct caact a t caat t t gat acaat t t t gg tggtgaagaa aat cagt gat ccgt cat cgt at gagaaaaa t t aacagat t t ct ggat agc t ct gat aaca t t aact at ct t cagt at agt t cat ccaat g ggat cat at a gagagt t t cc t t t ct t ctat gt t gaggaac aacat t cgga gat caaat t a tttcaaccaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 caaattgcta taaccacgaa tgtgttcttt agagacaaaa ccaattgatg gagatagagt Page 276 12689250 Sequence Listing.txt t t gtgagaac gat cgcgt t a aacct cat ct aaat ct at aa at acaaat t a aaccgacaat aat aacaaaa at t t t t t t ac aaat t t t t t a t at gt gagt t t t at t at gt t tttttgaaaa caat at aat c cccaaat aat t ggaaat at c aaaacatt ct cct ct t cat a tcgaaagaga t t cgt t acaa <210> 254 gat aaagaag gt agt gaaga t gat agt t ag aaat caaagt t t t act t t ag t gt cagt gac t t aaat t t ct gct ct t t aac gt t at gt t ga t t at aaaaag gct cgt at aa aaat cgt cat t t agt agt cg at aat t t t cc aaat at agt a agagagt t cc agagagaagt gacact t t t c cggcgaaaag at cggcgacc t agaat cgat agagaagtgt t caat cgct a aagagaat t a t ct aaaaaca t t cacgaaat aaaact ct aa gaagaat aac t aacgt gagt aaat t t at t t aaaaaaaaag t t gt t gt aca t ct t t t gtat at aat gt t t t aaact ccaca cct ct cagag t ct ct ct ct t at gt aagggacaca agaaaat cgt gat ggt gagc gt gt cacct t agaaaaaaaa aat at acaaa at at t t t t ct t aat t agt t t acaatttttt t acat aaaac act t aat at t t caat at aat acaaacat ct t caaaaaaat taagagaaaa aaacct caaa aaacggcagc t ct ct at ct c cgatggagag cactttgat c aacaaaactt gat cgagt aa t t acagt t gt gaaaaaaaga gataaaaaac aaat t at caa t t t t t t gtga t t t ct t t t ac t cat gt agt t gtaacaaaac ct t t t t t t t g acaat at act aat aat t t at aaaggaaaaa caaaagcaaa cacatcaacc ctt ccaat ct gggaatccag gcggat aat a aactccgaca gtcgccgacg caaacaaaaa at gaaaaaat at cgat t gt a at aaaaaact t t at aat t t g aaaacaaatt tcaaacaagt t t t ctgtct a ct ct acat gt agaaagtaat cgcaat t t ag aacccaaaca ctctagggt t 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 254 t agt ggat gg cat agct t aa ct ggt agt gc ct t t caagt t t gat t accac aagat cccga tatgtct t t t at t ct gt gct gt cccct gaa cagaggtat t t t t at t agct aat ggat t ca cgat t agt gt t gat cagact t ggt gcccct t gat cact t c agaccgt aag t t gt at gggt t gt cct t caa ccaaaccat t t caact t t ct t aaacct t at gt t t aggat t t t t gt t t gt g gcaaagct gt gcaaggtct g gt gggaact t t t gt caat t c aaaaggaat t agt cct gat g gt t agt ggt g t aaggcct t t gt ggt gct ac t aagt cgt t g at t t ct t agg gggat gt gaa t ggat t t ct c cct ct gct at t aagt t aaaa gat t t aat gt gaaagaagaa gt gaagat gc ct t t t gtct a t gccat t gt t t ct t t gt t gt agact cgt ct at ct ggcaaa t gt t ggt gat t cat gt caaa t ct act t at g aat t gat gt a gat caat aga t gct at caga t caat at ct c act agact gg t t t agtcttt agat t gat ca gaat t gt t ca cat ct t gcag cgcat t gcag ttttttagcc gaggt t ggt g gct gt t t ggg at ct gggat g t ct t t t t t t t aaaat t gct t 120 180 240 300 360 420 480 540 600 660 Page 277 12689250 Sequence Listing.txt cacaag gaggccatta catccctctg aagcaatcag atgaggaagt gggt gat gact ct c t t t t act t gt gagt gt t ct t act t acacca t gcact aat g ct t t gt gaat cagct gt gac t t agt gt t gc t gt gt at t t g at t t t ggacc t t t gat t t t g ct gagt gt t t gact gcat ca t t t ccat t t t t gt act acca at agacat at caaaat aggc gaagccact g t aaaagt gt c cgt ct ct ct c t caat t at t t caaaccct aa act t cct t ac t t caaaaat a t t t gt t gat t ct gt ggt gcc ct t ct t gaaa gt t t ct gct c t accact gat agt aaaacag gt ct t at aat t at t aat gct at t gat t gat t t t ggaat gt t t t t gact cc t t cat cat t c aat t accaat gcacccacaa gcaacgggcg agct t t gct a aaaagagagg gt ct t caagt ttct t gtct t gaaact gaag aggt t cacat t gct ct at ac ct t t cagct t t gt aaat gct t t caaat t cg t cgct ct t t t cat cgt gct g t t t t at t t aa at at agat t c t t ggcat t ca acaat cct ca at t t t cagt t aat t act t ca cacat aat at at act act t t aaaaaaaacg aggt t cgaac aact gt ggca acaaagt t ga gaacaggtga t t gat t gat t at ga gacaaaact g at at t acat a t gggacat ga gt cgccat gt gt t cccaaaa aggt t gt gct ggaagt t t ga aaact t gt aa tgcaagagga gt cct gat gg gt t gat gaga t ct ct agt gg acat caagat t t t ctctct t gat gat aaag t at gaaact g t cat caaat a t ct cgagt aa t t t cctaggg agaagaaagc cgat t t cgt a caaaggcat g t agagat aaa gaacgct gac ct ccact t ct gcaacaagt c t ggaggt ggt agct aagt t t cct ggaaaga aat t ggt ggc gaagaggtaa gtgt t t t t t a aggtgaagac t t agat t ct t t t t aacat t t ct t gtt gttt ct t t t gagaa gggaagaagc t t t aaaaaat t t t t t gcact cgt t t t ccgt t ct gt t t caa caaagcagct t gct t ct t t c t cat ct aat a t ct t at t aag caaccat gt a t t gat gt t at caagat gcat tacgacacgg gat at gt aac gt gaaaggt c act aacaaaa ggct t t gt t g ggct acgt ga t gaacat gt c t cagaat t ga gt ct t t t caa aggaaaaaaa t aaaaacgct tacaaaaggg t ct t cct cgt aat cgt t t ct agat t cgcat 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 255 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 255 aaaatacgtc acaaatataa tact aggcaa gt t gt t gtaa aat t gatttt t tgatat t ga aatggtaagc ccttgtactg tgccgcgcgt tttcaaaaaa cacaaaaaaa aaataatttg ctcaatacgt tcaatttttt tctttctttc taggtaaagg gataaggata atggtttttt aaggataaaa aaacagtgat atttagattt tttgattttt tgctaaatgt caacacagag at aat t at t t aagagt t cat at at t t t aac ttttcttaac acat ggt t t c ctct t gt t t g ct t t gat t aa acacaaacgt Page 27E t at t at aagt ggacggatgt caccact agt ggcgt caaat t cat agct t t t t t t atcct t aaaagt cat t aat gcact gt caat agagt g gt at gcgcca t gt t t ct ct t ctgacggcgt gcat t gacca at t at t caaa gaaat t cat a cgccaat at t 120 180 240 300 360 420 480 12689250 Sequence Listing.txt catggatcat gacaataaat atcactagaa taattaaaaa tcagtagaat gcaaacaaag cat t t t ct aa t gt cgt aat t aaaat gt t ca at agaccaac cct t aat t ca ct t t t gagaa acaaaaaagg at caat aat a at acaaact a gcact cgagt actcgt t t t t gt gacgaat c cat t t cagga t aaaagact t gat t t t gat c aagt at t gac t at cccct ca t t t t gt t ggt at acaaagag accaaacact ccaagcccca t t caccaact accaaccaaa gct t ct t ct g t t gcggat ac gt aaaacagt ggaat t t cct aaact at t gg gaaat t cggc aaat act aat at aat caat g aat caccat a ct ct t t gt gc t t agat t aca gt t gt cct t a at ct t ct t ct t at at gt t ac gt at t acaaa gaaaact t gc gaat t at t ca agt aacct ag ct t at t cat a t at t gt t aat aact t t agt t t t t acat aca ct cacacaca cccacgt acc cct ct ccct c ct t t caat t a acaaact at a ct t t t at at t t t at caaacc cgggt t ggt c at acgt gt t t gt aact t t at t accaat aaa t agt acat gg at aact t t t t aact gt gct c agt t aat ct t ttttttacca t act t agt at act cct aaga t aaaacgaat t agct t t gt a t act at act a tgat t t t t ga gagcat at t t aat agat at g ct t t gt ggt c cgcaat cacg t aacgccgt t t t at aaaat c ct ct cgccga at gg cacgt aat t g caaagt ccaa t at ccgaat t t t t t t t t t gt t gaacgt gca gaagat gt ag t agacaat ga t t gt cgt ct c agat acat t a aagat at ct t tgagcaaaaa gt gt caat ca gtgagaacga t t gcgaaaat ggat gaact t t ct at gt t ag agcaact act agt cgt t t ct aacat aat ct tttctttacc t t aaat t t aa t acct t t t gc ct ct ct ccct cgat t t t ct c gaat t t cct t aacaat cggc gaagat ct t t tttgaaaacc t ct aaaaat t t acat acat t aaaact t t aa gagt t t at at agt t aat ct t gaggt aaat a agat gaaat a t t aaat cggg ct acat agt a at aat cat ac aat t aaat aa aat at gat t a tt cgt t t t t t t aat t ccact cacat cct cc t accaccat c cgccgt t t at cgt t ggt cct t ct t t at t t c accggaaaaa tttttttttt aat gt t t t gc t ct ccat at g ct t t aaacaa t t gaact t t g at aat t aaat aacat at aca t t gagt act t at at acaaga gaaat agt t g agt t caaaac aaaact t cat cat at t t t ga aagt gccagt t at ct cacaa t gat at aat t t aacat t t t c gaaat agaaa t cct acct t c aacaacaaca t at ct cat ca cat t t ct caa t t cct cagca aacaat at ca 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 256 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 256 ttttgattat ttaagttgcc aaccaagcta gaccactttc caacttttat agacaaagtg taagaactta gcacatgctt tccaacaagg aatatgaaac tttatttcaa aaagattatt gttaaagctt ttaatcattc taaaaggtga ttaatatttg aattattgga agtaacttgt t t t ggacata taccaaat t a atcaatagta tcaatggaca tataacatgt tgtgagact a Page 279 120 180 240 12689250 Sequence Listing.txt tatcaatgtt atattccaag ccagttaaac caatttataa tgataccatt caat acact t gt gact t at a agt t gat at g t caat t aact at t t aat at a t aaacat t t a at t t aat ct a aaaat t cat a t ct t t gt t ct ct acccgt ca gt at t aaaaa aaaaaccaca t t gt t at t t c cct aagccga gaaccgaat c gcgt cggt ga t t t t t at t t g agct at at at gt t t ct cct c t ct t ct t ct t t at t gt t gat agaaaat ct t t ct gagct aa t ct gt cgat c aat gact t cg t t t acaagt g caat aat caa aagt t gt gat t act t t t t t t t acacacat t aaaaat gct t t at ct t gct c cat t aaaaca t t t t t at t t t t caaaaaat t t at aaat t at aaaat gagag aaat t gct aa aat gt t t act ttcacaagaa gt ccgt t t gt gagaat aaaa gcct aact t t at aat caacg gct aaaaaga ggaaat aat a t cgcacat gt cat t t ct ct a caaggtgagt t ct ct at gcc tgt t t t gggg t t ct t aaggt t ct cgct gt t t gt at gct cg aat t ct agt g cat at gaagc ct gat t cgt g t t agt gaaaa at t t t ct t ag caaagat cct ct at t cgggt t t t at aacat at t t t at t t t cacct t acga aaaacat at c t t t t gaagct taaacgacca aat t t at aag t aaacaact c ct gct ct t ct t t t gaacgat agccgt aacc gt t t agat ca gtgtgaaagc gaagaaat ag act cgt t t cg t ct t t ct ct c ct ct agat cc gat t t cgct a gt t gct t gt t t t at gt gt t a at t t t t gt t t at t gat ct gg t t t t ct cttt gat gt t t gag t t t act t cat at gg aat t gggt ct agt ct t t tag t t t ggt ct ag aaaaagat at t t gt cacaat t t aat t t gt t gat aat ct gt ct t aaaat gg t t t ccgt caa t gt gaagt t t cat agat t t t agt t t t t at t gt ccgaacca at cagt cacg aact caaaac caggt cacca aaaaaaat aa ct t t cct t ag t cgct gct t c gt t cgct t ga gat ct gt t t a at gt gat t cg gat ct at gga t t t t cagt ga t t t t aat ct t gagat ct gt g t t t caat aaa gagct t at cc cat at agaac t gt gcat t cc aat t t t ggt t at cacgaaaa aaaat ct aga t t at at t t t g gt t t t t cttt act agt caac t t ct cct t gg gaat t at gaa caaaaaaaca at t t t t ct at caaaagccga gctcccgggc aat ct aacgg t agcat t gt c aagagtgaga t gt t agct gc t cgaat ct t c t t t t gct gct gcat gcgt t g at ccgt gct t gt t t gaggat agt gaagt t g cgat ct gt t a aagt t t gaac cgct gct aat aat t cat t t c t at agat aaa act cacaaag t caaaat cgt t caggaat t t ccgat caat c caat agt gt t t gat gat gca t t ct acat at aat t ggccaa t t gcaacagt agacgaaat c gt cacgagaa t aat agt t t t gccgat aaat t aat t cat t t caacat agac t ct cccagat aaaat cgt ag t gccgct gt t t gt at cat ct cgt t agt cgt t ggt t t t at g gt t ggat cga t ct t ct cgct t t t agt t cga ggt gt t gat g ct agt t t t ct ct t cgaaact ggt t t cat t t 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 257 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 257 tcaggggcac agtgaggaac ttagaggact cgtcagttaa aatcgaagtc gcagtgaagc Page 280 12689250 Sequence Listing.txt agctcggtaa aagagggttg caggtatgtc tgttttttcc aatcttgttc tcaagatctg t t act cat cc aat gggt t ac t t ggct at t g t gcct aaccg acct cagact t ggagt t t ca t aact t t t at t t t ggat gag t gaaggat t a gat t gaaaaa at gggt t at g t ggggt t at g agacccaagg aagt t caagc aagct agcag agt gaggt t t cagct agt t c ggt ggcggt g t gact gct t a t ggagat t cc aagat aat t a t gaact t at g ct t at cagag t aat gt gt t a aat aat t at g agaat at t ag cgaaaaagaa agt aaaacaa gaat aaagga agaaaat ct g gaagaaacca cgt t aat ct c act ggt t t aa ggaagt gaac t gcagaagat aagt gt t gaa gagaat cgcc ggt aat at ga t t at t ggct t gact ggaaag acccat gt t t caggt at gt t cagcacct ga gagt ct t cct gagagcagaa t cat at t aga t t gt ggccaa t agagat ggt cgct gaacag aaggaggt t g t agaat t caa gaaggt t gaa t at at gaaga aat agt aacc t t t t gt t cca agagt t cct a ggct t t aagg t cgat aggt a gt t t ct agaa t aggggt cgg aaaaaaaaga t gct ct t t ca aagaagaaga aaact caat c cact at t at c t t cct cggaa gat gaacgt g t t ccact t at caagat gcag t gact ct gct t t cagat aat caaagct ct c ct act gat gt tttgaacggg at acat t caa ct at gagct t gct t ct agaa t ccaaggct g cagat gt t t g gaacaaaatt t gt aaaagct gt t t ggcaag cct ct t ct t a gat t ct agcg t t t gt gaaag acagct aaat aagct gat ca t t aat t acca t ct ggaggac ggaaat t gat t t ct aat cac t t t gat at t a gct ct ct t t t gat cccat t a aaaagct ct c at gt t cat t t ct t g t agt t gagca gaat ccaacg cccct cggt c ct cgcggat t acgt t act ga at t ccgcgat agat t t cggt aagt t ct gca t t t ct t t ggc accggccgt c at cact ggga t gggt gagac gaagggaagt gt t aggaact gt ggaagct t tcgagagacg t t at ggaat c cagcaaaaaa agat caagat at t ct aaaat t agt aagat a at ct act cgt t aagt aaat c t act gaaat t at t gct t gt g at aacat aaa ct agaagat a ccaacaagaa t cacaaat cc aact t t ct t c gat gt t gcag t acaaact t g gct t t t ggt t act cacagt t aacat acct g t ct cat t t ca t t caagt cct t t ggct cgt t t t cact t ct t t t t at t aggt t cacat cgaa ggcgaccggt ct t at t t at c acccgat caa caaaggcacg ct t caggaaa cgagaggaaa caaagacaat aagagt caag gaagct cat a gat gaaat at t gt at at t aa gct t aagt gt acaaacat aa t gggagaagt gaat ggagga t agggt gaat agaaacaaaa acgt agagag at ct ct ct ct gat t t ct cag gggcat aagg gt gaaat t gc t at gaat aca ct t act t ggg cat gaagaaa ct ct ct t gac cgaacat t ct t aggt ccat c t t ggt t gcaa t gt aggt aca aagcgat gt g ggacagaaac tgacacaagg at cggt t caa t cccaagat g tggcagccca gaacaat ggt aagagct t gt aaaggt t t ca t acat acat a gat t t t t gt a gcaagaacgg at at t t gt gg at aaaat gaa agt t ggaaaa aaaaat t gaa at t t gggaaa aggaaaat aa at at aat t ag ct ct ct caga ggaact ct t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 Page 281 <210> <211> <212> <213> 12689250 Sequence Listing.txt 258 2004 DNA Arabidopsis thal i ana <400> 258 ggt t ct t agt gat t t agt aa atggaggcag cggcct aacc t t t aagct t a ccaaacgacc at gt act gag aacat caaca gcat aat t t t acgt gt gat t t t ccat t gat agat gaaaac t caccat t gc acaagaagaa acaaat ct ag at aaaat gt t ggggat t gt g gacccaaaat t gat caaat c ttttggccgg t gt t cgt at c t t t t gaat t t cacgt t aaac ccaaattttt agt t t ct aac t t gaacat t c gact agccgc at at at at ac caaagaaat a t aact caaat at t cgggt cc acat t t aat a t gaact aat a gacaacgaaa aaat ct ct aa aacgagt gt c ct t ggcgt at t t t ct agaat cat gt cat t c t t t aat t aga t aat cgat ct cat at at t t g t gat at at t a aact at ccat gct cat at t c at agcat t gg gt acaagat t gat caat gaa aacat t gt t g act t ggaaaa t ct at cacca cat t t gt t t t t gaagct act cggccat at t gact aat t at caat aaccac t acaat t ct a t at t agat t a acat at at t a gagt t t t ct t cacgaat aga aacgt aaccc gaaaaat gat at gcaagt ca at caagcat a gcaat t gt t g t aat acat ag t t ggt aaggt t agt aagaac cagt t aacat gaaaaacagc aacaactttt aagt agcaaa t gt aaat ct c ggt agcaacc aaat t t ct t a at t cat cagt gt gaaat aac t at gat ggag aaat ggaat a aaaaagat ca t acacat t gt ct t ct ct ct c ct gt cct t t g act t t aaacc aagacat acc t at agact aa ct t at at t gt t gaat gcgt a aaat gt at t a acgt agat gt act t at gt ag accacaccac t at t t acagc acagat ct aa t t aaact ct t t t t at gt at t acgt at at ga agcagt gat g cat aagagt t acat at at ct acact at gca t at t ccct t t aat t ggt aaa aagt t t t t gt gt at at ct ca cgt t caaacg aagagt t aaa aaat aat at t aat gt t cat a t aat cat aaa agt cacat gc t t acagct ca t act ct aat c ggt ccaccct gaccgt t t t c gccgct at ac aagat t t caa gt agt t caaa t gaat acat t ct at at aat a cat at at t t t caaaat gt aa agaaact cat t t t at t agag aat t ggaaag t aagat t cgt at aat at agt caat t aat t a t gct t aat t a t t aaaaagt g at at at at gt aaagat cgt t t at at t acca at t t t agt t a aat gcat aag gcat at gat a aat aact t ga at cgt acagc gt aaat t t at gaagaggtaa t gagt cacat aaccagt aac cact t gt ccc t cat ct t t at ccaccacct c t act aat agg cat gat at t g at aat t t gat t aat gagaac at acat t at a at t t gtctta agt cat t t t a acacat gaaa t aaat agt at gct at t gat t at t at t ggca at aat t ccac t caccat ct a at gat ccgat ttaaccaacg t aaaacgt aa t aacgt gt t t cat at at ct a aaat at t t at aaaatgaggg aaaaat gt t a caaaaaaaaa t t gct aaaca at t gt gaat c caaaaaaaaa at aat agagg ct t caaaat c cacaat cat a t cct t at at t gt t t at ccct t aat t t caat aaaat ccgac tcaaagaaaa t gat act gca at at cggaaa gt at at at at gt at agt act ggcacct t cc t cat t t at t a t t t t t tat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 282 12689250 Sequence Listing.txt aacttaagtt atatgcatgg atacatgaac gatactccta tataaagaga acagcattca aaaggtctta tcatcttctt cactaaacaa aaaaaaaccc ttcaaaacat ttccttattc tttcttcttc atctacaaca atgg 1920 1980 2004 <210> <211> <212> <213> 259 2004 DNA Arabidopsis thal i ana <400> 259 aat t t ggt t t at cacgat ca ccaggacaat t gaaccat ct t t t t gggttt agccaagttt aagaagccaa at t t gtgttt t gt t aat t aa t at gt aat cg t gt cggt t ca agt t t t t t cg taaccaaaac t cagt t at t g t t t t gaaat t aaaact aat g gt at ct cct a agact at at t t ct aaaat ga acct accat g accacaat ac tggaccacac t t t gagat ct ct ggcat at t agt t t t t gag gaaagt at t a gt t t t gt t gt gaaact caaa agaaaacagg gcaacat ggt acaat gt t t t t ggt t t t ccg aacagaaagc gact t at aag ttaaaaaaaa t at ct cat t t ggt gt t ggac cat acat gca at gt t t at t t aagaaagt ac cat acat t aa aat at ct agt gcgt t t t t at at t t ct t t t c t t cat at t t t ccagccat ga at cgagagaa cact aaat t a gt aaat gcaa t gct at agca aat t t t t t t c t t t ct t t t t t t gat aaagaa t aaagt at ag accat t aat a acaat caat g tgtcgagagg at at ct ct t t t t t ct at aat t t aagcct aa act t t t gttt at t aat t t t a ct t cat at ac at at aat agc cat t gt t acc t gacaagact caaaccagaa gagat t t ggt gt t caagaca cagt t gaacc at gat aagca gaat ct t gt g ttgcgagaac ct ct gaact c t t t ct caagt t at t t ggt gt t at ccact ct tttttttgga t t t t t t t t gt ggat t aaaaa ct aagt aaag t aat t ggaac gt gt t t ggt c t t gggt gat t ct ct at aaat ttaat t t t gg ggt t gcagat ct t t t t t ctc t agt at t gt t at accaaagc t at ccat gcg ggaggaagaa acgaat gt aa gt at t t cgat t t t cct t gag t caagt t t ac aat at cacaa ct aaat agat aaacagaagg t cgaact caa at aat t t at a t t gat t aaaa aacgcaat aa gt at t t gaag at aaaaaact taaacaaaca aaaaaaaat c t at gt t at aa ttgggcaggg t gt acgaaga act t gat cga cat t aaacct gt ct t cct gg aat aacact a tttttcatgg gcgt at at at cat cat ggat t t t at cgct t tggaggacat acat gaaaat t at t cgt t ct tttagaaaca at t gct t t ga aaccct caat gggat ct t aa t at t gt t gt c aat t t t caat at t ccat gat aat t gaaaat t at at gat t g agaacaaat g t t at t t t t at t ccat t t gga t t ct t aat at at ggt gcat g agctggggga gccagt gact acagggtct t ttgt t t ctct t t t ctacttt agcct gt gt g t t at t gat t a aaact t gt ca aacaat t aat gaatcgacgg gat aact cct aaaact cgac t t at t t at t t ct at ct ccac t tat t t t agt at t cat gt at at t gaaat ag aat t aaaat g t cct at aat g aat t ct t t t a acgaaagt ga agaat ct caa at gt t gcct t aat t gggaaa aaaaat t aaa agct t acaat at t t t t t at t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 tttgattaaa aagtggtaaa tgatattttt ttcctccatt ttgcattttt acactttgta Page 283 t gat ccaat t accatt act a ggt aaagcat agt t gacgaa at aaagct ca cgaagaggaa tgct t t t at t aaact aaaat t at t at t t at agtttttttt caaagagaga gaagaagaag 12689250 Sequence t at ct acat a t aat aaat ct t at aat ggaa aaat at t at t tttgcttatt ttaagggcta att aattt at aaagcacaac attgagaaga aacaaact cg at gt Li st i ng. t xt ct at aat aaa ccat t t acat atgt t at t t a t t gt t act t t ataattaatt gaaattaagc atttccttgt ctacacgatc tcggagaatt cagtactcgc 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 260 2004 DNA Arabidopsis thal i ana <400> 260 ggagctgcaa t gt gaccaca cgt t at t t gc caaatt gt ca cat caggact aaggt aaat a gcagaccct a aagat ggat t t ctt aaaccc cagggcttt g ct t t ggt ct c t caat gt gt c t t t t gcgat t t gggt t t gt g aat gct act c gt ggat t at t aagt t gacac aaat cgaat c tttgagagaa gat t at agat agact t t aga ccaccgt cct t ct t cact gc t at ct cacaa t cat caaggc t ccgt gact g agt gct gt t t gggt acgt t c t at ct act cc t gacgcact t at gt gt at t t t gact gcaga t t ctggagca t ccgaat aag t t t gat gagt t gat t t ggt t at t t t t t t ct caat t gt gat t t gcagat t c t t aat t t t ac caaaat at ac at aaaccgt a ct t t t t t at c t ct act t ggt t at t t aat gg cggccccaac gct t t aggcc gagaaagtt c gaggaagttg tctagtgcgc tctctgctgc tagctctgct ggt cct t gga t t t t gt aaca gt t t ccat gg t t ccct gt aa acaaact caa tgt t t ggtta ggagctcaag aaat t t t cat aagtgtgt t t at t gt ccct g t acccat gaa gct t t agcaa t t gtgaggaa at accaaaac t t ct at t t t t at ccaaaaaa ggcaat ggt c t t t acct t t t aat gact aat at t ct acaaa gt t aaaaaac cact caacat t cccgat aaa act ccagagg gt t aat ct aa gagt at act c ctt gt cgcaa at gct cat ca ggcct t ccga gaagagaagg ct at aaggt g gaaagat ct c gt ct gt t t ct aagaagaaaa ttagcccaaa t ggt gtt at t caaat cact a ctt cacccaa ctt agatt gc cat t aat aag tcct t t t gt a gagaat caag agt gggcat g t t t t t t ct ct tcaaacaaaa at cgt agat g t t cat aat aa t at gagt t ct cgat ggct ct t ggagact gg at t at gaaat tt gat gaagt actt ggcgt a t gagat cagt caaacact gt aaaccaggat tt ggcagtt g att gat gagc gt gtt gcaat gat t aaact g at at t t t at a t t ct at t t t a gggt at acga aagt aaaagc t t ct cacct t accagaat cg agcat aagaa t t cgat ct ct agcaaat caa aact ct t ccc ttct t t gat t t acagcgtt c agt at t gt cc cct agatt ct at caaggaag ct cat gcct c t aaagcctt t tt t at aacct atcgtttttt cat ggtt at c t ct t cgat aa tgcaaaaaga aaat t ct aat accaaacacc cgact at ccg ggat t caaag agt ggat t t t t aagccat at agt t cat gaa gat aagagt a t at aaaaaaa agat gagat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 284 t cggt aaaaa t t t t gt ct cg at t gt t t t t g aagaagagt c gagagat at c ct ct caat t g t ct ct gt t at t t at caat ca at cat ggaac t gat cat aat aaacgct at t t t gt gat t at gt ct ccgt t a aagt cact aa cact ct gat t aat t t ct ct t gt t t t t t t gt t t ct ct gt t a aact t ggt t a aact at gaag 12689250 Sequence ataagaacac acgccactag tatattgcac aattattaac gatctcaaat ctttaacttt aataactttc acagtctttg tgagttctgg taaaaagttt gcttgctctg tttaaaagga agattatttt ggattttagg t t gt t at aga act at t ct ct gagt gt ggaa t t t t agct ga at gg Li st i ng. t xt t t gt ggat aa at t cat act c gcat acccaa at t cgaagat ccagct t t t g t t aggat t aa act caaagat ggaggaggtt t t t gat ct t t agt t gt ct ac t cat at gggt aat aggaaga ct t t gt cgcc at t t t t gt t t at cgaat ct c t acagat t t c gaggt ct gaa gat t cat ct g <210> 261 <211> 2004 <212> DNA <213> Arabi dopsi s tha i ana <400> 261 t gat gt act t gagcaat aga t gct g( acattttaac taatagcaat gtcaa t gacat t t aa ctctaaaaga ct t ac aacttgcttt atttcttgaa agtta agaacaaaca atccgaaaat gttta gcaacgagga aagagccaac gagta( gaccagcttt cacatagaaa cctttt accgtgaatg aacctacaca atcaa~ gaagtatact gcaaacttta aacaa! caatgacgaa t aagccgat t agaat t t gat aact a caat t ccat g aaacc~ cacctctgtt aat ct ct gaa gat act ct t gt aat ca gcgacagt ga gagt g~ at t ggaat ga ggaaat ggt g aaat c agcgcgagaa gaacggacga cgcctl tcggaagatg gacacggcgg tggcg gactgtgaaa gcaaagatcg gagaci ctccattttt gtggattctt tgggtt acggagaagc tacacatttt ctaac aatctaatgg gccaggatcc ggtta ccgca at t ggt t cct cat caaaaga t aaat at ct g aacct at ct c caagg cagac ct ct t at t ac aacga gcaat gt aat aaaac cat c at ct a aat ag ct ac ct gcc t gga t cgt t act gact a ct t ccaaaaa t gt gagat t t aact gcct ac t caact at aa tcggaacgag tttcacacag ccaaat gt t a cgt at cact a t caaact aca ttaaacacga ggaat ct gct caagt agaga agagat aagg agccgt cgct agt gacggcg t ct cccgaga at t at t t t t t t gcaagt cgg at cgat gt ga cagacgt caa t gacccaagt aagat aaaca cagt at t ct t ct t caacgt a t t t aagaat t ccat at caca ct ct aat aac agaaaat cgg at gt aagat a ggt aat ct ac at aagagat t aagat acgaa at t t t at t gg gt t agagct a at t t t gaat t t cgt agt aac gt cggat cgg t t t t aat ggg aaaacaaat c t act t gt cag aat t aggaat acct t gt cct gccacaaact cgt t t agcag t agt act agt t t gcct t cac cct aat gcca t gt aat t gt a gcaaagt at a caat gaat ga cggagt agat aaggt gagt c ggccggcggt tgcggagaat gaagaagagg at t gat ggac ct aagt aagc 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 tgggcttggc aaatagccaa atataaaagg ttaatttagt caaaaaaat c t ct caat t t a Page 285 12689250 Sequence Listing.txt aaat t aactg acgtaaatcc ccct t cagta tcaatactgt aaaaat t gga tagacacagt aaaacgcagt at ct cat t t t t t agt gcat a ct t t t t t acc at acggcaaa aat t act aac t t aaccat t g t gggt at caa t ct t gt gat a t aat aat gat agcct cgt cg t t cact aaac gttttacaga atct gataaattaa agtt tatgtaattt aagt cactaccacc tgtt1 attattcatt aaat1 at at atggaa cct c gactaaggag cttc( tttttttttt caat ttttacgagc gaat actccatata aagg; t ct ct t cat a t at t( agaaacaaag atgg t t t t a ttttt gt acat t cgct t caac aaacct cacaat aggt aa at cgt c aaaaag at cgat t t ga ccact t t gt g act gt at ct c aggct t gct g t t t acgt t at ct t aat gt ag ct act ct aat gaaat caaat gacat aat at acagcaaat a cat cacacaa aat t t t aaag t ct gcaacga gact caaat a at acaacat t aaat at t aat ct aat aaagt cgt t ct acat aaaact caca t gt agggt ca act t cagaga cct aggt aaa at acaacct t at gt at t t t t ttttacaaaa aaat t t t t at gt at at ct ca at ct t t acga aaaaat aaaa at at aaacgc 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 cgt ctc t t tgt gt tct t ct tcct cct cagat tct ct <210> 262 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 262 tccatgaata ataaatttat gggttl cacatttacc ttaatatttt tcagcl atcttgtttt atcattcagt tttgt aacaaatcag atcacatata ttcat ctgtaatgtt tgttatacac aacaa! ggt cct aaca tctctcaaca gggt t ttctcctcct cctcctcacg tttat tatttatata aaat t ct t ga catgt ggacct t caa ttttttatgt cagat gatgatgttt agcaataaat gatgtl taatttgcat ggagccattt atttal atactacatt accatatttt agggt ccacatatgg ttgaccattt actgt1 tatataaccc aatacacata agaatl cactagat t a aat t at t ccc agcat1 ttcctaatta cataat t ggt aatcal cctaattatc tattgacttc acaat tggt t at gt a tcgat agcat gacaa ggt gt at gaa agat t ctt c tgat g caac cgacg gagt tagt t at ac tgttt aat cc gt gt t t aagc t t at t gt at t t t aagct cct t t aaggacct tgat t t t t ga ttttgggccc at t t t t cacc at aagat t aa cat cgt aat c at cat t at gc gct at t at t a aaat gggcat t agt cgt gt t agcat t t gca t at at t at aa aacgcacaat at gaat t t ag cat gt t gt t c acgat at at a t ct t at t aat agt gt gt cac ctgggcccag aggtggaccc t at at aaat t gat gaagt cg at gcat gggg t gt t at aat t t t at gct cct gacat gt acc t ggccct ccc cct gcat aga t cat t gat cg t act t gaat t agccaagaca t ct acat ct a cagt ggt gaa ggt aaaat aa acgt gt gaac cccacacgag t t t t atctca t aat caat t t t t gt caat t t gcat gaacat t t at t at at a t t gt gaaaaa gccgcccat a caccgaagat aaaaaat t ag gaaggaatgt aacat gaat c aggccacagg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 286 12689250 Sequence Listing.txt ggcacatggg aagtatagta cgaaaatgct atggcagacc aggactttta act t t aaaat aaaat agt ct aagaaggaca ct t t cgat t t at t t t cat at t at at at t t a tt ct gt t t t t gt act aaaaa t t t gat t aga gt t ggcct t t cat cat t ct c cat cacct cc acacgat cca t ct ct ct t ca agaagccgaa aat cat t at a t t at t t gaat taaaaaaagg acgaacgaat ct t t cccat c aaaccat cac gtct t t t t ct gat at t t t t t gaagaggtat at t t at t t t g gcccact ggc tcccgacaac gt t caaat aa at at t t agcc aacgt t t tag gaat ct cct a t cgagaaat a t caat t agt a caagcat agg cgaat ccacg t gt gggct cc caat t aaat a t ct t t t t t t t caaaagact c acct t t t t at ggt ggat aaa ttggccgcaa act cct t acg t cat cct caa ct t t gt t t ct ttccccaaaa ct cgt t gt cg tga t t t gtcggca ccaat aggcc ggaat caaaa at t at gt ct t aaaat aat ct ct t t gct at t gaat at ct t t gcgagt ggt a ttcgacacag t t at t ct cag gt ccaaccaa t gat caat t c ct acacacat aaat at caaa at cggat cac t t t t at aat a t t gt gaat gg cacat gt t t g agct gt cat t ccgacaaaac at t t t t cat c t gccgt aaaa aaaat gat t c aaaaggagag gaaaat gacg t cccagcat c cat at at aat ct cacat ccc agaact aaaa ct aact aat t gt t cgt t t aa t aagaaat t g cggaccccat aagct gacca t t t t aat t cc act ct t t at t aaat t gat at act gat t aaa at at t t caag act gt gccaa ccat t cgt ga cacct ggcaa t t t at accac cct at ct ct c gaaaaaccct act cgt t gat 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 263 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 263 ctagagttta aaaagagagc aaccaagtgg tcggtgcctg gattattctt taagtgggat agaagtttta ttgtgttttt tagaaataga tctgatgcag gggaggtcat ttttaactca ttattttatt ggtcaaactc ctaaagttag ctttggaata accat t agca ataatataag cttgtaaaga gttcttcgat aatctaatca gtagaaat t a gatcaaaatt agcaagaact aaaaactgat ttatgtccag aaaacctctc cattcggtag tatactttcg gtatgaaatc taagagtatc gagtcagctt ctaatatat a taatctagca catgtgt t ag gaaaaatat g gattagcagc attcttttac taaaacagcc cat at gt t cc aagt t t t agt aat gt at aga at gact cat t ct t aat ct t g t ggt acagca t t t at aaaga t gcagt acaa t agccaat ca aat t at t aaa ccat gt gcac ttttcaaaga at at t at at t gt gt t ct gat ttaccccaaa aagt t t at t c cat gct t at g tt gt t t aaac at aat accat gt t ct t aaat t t t t caaat c aact gaat t t acagagaaaa t gcagt acaa t aat t aaaga cat ct at t at t t gagt t t aa agccgt aacc t t cat t acaa ttgct t t at t t t caaagt ct gt t t t aaat t aat caaacca aaaagaaaaa t aaaagat aa cagaggaggt ggt at at gac aat acaagt t at at t t agct 120 180 240 300 360 420 480 540 600 660 720 780 840 ttttgatttc cattctcaga atgaaaaagt gtgcagctat aattgacatt gtctatgtaa Page 287 12689250 Sequence Listing.txt ct at gt aaca ct gaccagaa at t cat cacc t cggat aaaa t at agct aag gcaaat cat c tttcaaacaa ttgt t gattt acat gat aga aat t t t at aa gt t t cgat t c aaat gat at t t aat t t t gca cagcgatt cc t ct cct t cgc agggat gaat ttctgat t t a gaaat aat ct aat gact gt g cat gcgacac cat aat t gac ct aaaccgag cacgt at cgg act caagat c aaaat t ccat cggacct aag t aagaact aa aacat gact a t t t t ggggt a aat t caaaca agct act ggt aggat gct t c at gaggacac agct t t t cct t gt at gct t c ttgccaggaa ggacgcgaga t at ct gat t t gacct aaat a accaacacca aaagaat aag aaaaat ccat at aaaaact c aagaacgcaa gt cat t t t ca gt t t ccgct g t ct t gat gat caaagct at t agaagt agca t agagt t act gct aaaacag gat t cgt gat t cgt act t ag ccgcct cgat t t t ggct t gg gggagaagcc gccgat at ga tgt t t t t t gg aaat aaaacg at gt caaaaccggg aaccgaacca acct cct acc at cagat t ga gaaaacaaga t t t atgt tcg ct aaaact t g t aagat t cgc cat t acact c t ct t caagct cat ggagat t ggat ct gcat cgt ggcct cc t agt caacgc cct t accagt ct aaccct aa at at at at ac ggt gt t at ag cagtaggcag aaaattctat cgtaattttt aat ct t acct t agaaat cat caat t acgca t gt ccgt gca aacagaaaca t ct gaaaaaa t aaaccagat t t t aacat ca t cccct gat a cacgagcttt cgt ct t aaga t t ct ct t t t agcat cgat c cat ct t caat ccct aat t t g t aat t t t at t tgtgtggaga aaacgcat ca aagaat cat c cacccaagt a at ct t ct agt t acgt act t t gagt act t ca gaaatagagg gacat caaca aaagat aaca ccaggt t tag gat t cct cat cctggagccg ccggt ccgat t t cgct t gaa ggcggctaaa t gct ct act c t t t aaaaat g ggt gcat ggt gt cat cacca 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 264 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 264 ttatccataa tgttttatat gt t ca aaataagatt ccggtggcta tgaagl ttttttcaaa cgaagaaaaa aatgt tatttgggaa gtacatttta aatgt cattagaaat aaaattttaa agagt atactaattt aattaattaa attta taataaccaa tcaaaatcaa cat aa< aat t agaaaa acgaaataca t t at t ttagatttga tatctaaatt ataat atctttatat aattctattt ttacc caat t at ga caaag aacct aaat t at t aa aagat aat t c at t aa gaaaa aggaat t t gg at aat cat t a aaaaaacaaa t aat t t t t gt at t t t at ct c aaaacgaaat t t gat at ct a ccaaaaaagt t t t t t t ctt a at aat at cat Page 28E gt t gaat agt t t t ggt caaa t acaggt t at aagt aat t ac t t t gaggat t acat t at t aa aat t t t aat t aaccaat t aa at t act at ag cat t at agaa t gaaaacat g cat aaat t t c cct at t at at at aggt at gc aaaaagt caa ttttcaaaaa aat t t aat t t aat caacaaa t t cact t ct c aaat aat t ag 120 180 240 300 360 420 480 540 600 v V 12689250 Sequence Listing.txt agttttttcg catatgccat aatttaaatt tttaaaaacg aatataaatt t gaaaaaat a t cat aat t t g at at act aca at cct agaat t ct t aaat t a at caacaaaa cat at act t t cct gcggtat cat caaagat aat ggt t cag gct t gat t t a t t t aaagaat t t t t at t t aa aaaaaggt ag acct t ct ct c t t ct caccat t caat cct ct t t gt t gaat a aaaaagcttt gcgggt t t ca ggct ct t cgc t ct ccat t t t gt t cgt t gaa t t ataacggg aaat at t t at at aagt aat a t gacgggt t t aaaat caagc t at at acaac t aaat cat ct act gcaagt t t t cct aaact acccat at t a t caggcgt cc t gaaact t t a t aaaaat at a agaat cacaa ccaccagcca ct ct ct ct t c at t t t ct cga cagat t t gga ct t ct gaacg cggt ggt gat ctcggcgggt t t t t gt t t gt gaagaagaag t gaaaaat ag at t caat t t t acat at t aga agat cgat at t acaagt gt a t cacagt t t a t t acaat aaa aat at ct aga gaagcaaaat taaaaggacc aaaat t aaaa aaat gt t t ca ggcggt t aat aat agggccc ccact ct ccc t t t cacgt ct gaaaat t ct c aact t cat t t t t aat cgt gg agat t cacct t t caaat t cc t gaaaat t t g at gc at t t aagaaa at acat at t a t gt gat t caa t aat acaaga t aat ct t aaa ct at acat t t agt t t t t agt t t aaaaggct at act gaat a aaaat gat t c t agt t at ct g agaaat cat t at ct aat t at aagcccaaaa ct at at aacg ct ct t ccct g at t ct cat t t gt t t t t t t cc agagt t t cgg cct gct t cct cccaaat ct a aat ccacaaa act t t ct act t at t gaaaat t t t t t aat ac t ggt at act a cact aaacaa aaaat caaat t gaact aaaa t at t gaat ct t t caaat at c t aat t acgaa at t t t act cg ct aaaaaat a t at t at t t aa t ccaacct ca aacct t cct c aaact t t ccc t ct t t caggg gt t t t t aacc gt cgggt cgg tctgaggggg acaaact cac t t t t gaattc gt t caat aca gagt aacat g t t at aat ct t aagct gaaaa caggt gaaac t at agat aat aaaat aat at t at aagcct a aaagacaagt t at ct gt t ac tgagcaggaa gat t t t t t at t t caat t at t gt aaaat at t t ccgaat ct c ct t cat gcat gagaaaat t c aaaagt t t t g aaaat t t t t a gt cgt aaaag ttagccccga agat t cat t t at t t gt aacc 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 265 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 265 tttccatatt agattatact tctcc< aaggtatatc atctgatcat caacal tttgttataa tat t agt t at gtaat catacactta gacgctgaca ttt gt cgttctattt ttgttggggt tcaagl ttttgatatt taactcactc tattt( ttgtaaatat cattatacac ataat1 atact ctgatgtttt tgaca acat at cacg tattg tattgcataa gctta tctcattttt 1 tttca acgactaaaa caaaa tt cgccgact :gatt tgttcaattt Page 289 ct at t t aaaa acaat agaca t at t agct t t ggcgacct ggaat caat t aat cat t t t c taaacgaat a cgaagacaac at t at ct t ca ct t t t gt t t c t at ct t t ct a aaaacaaat a aggt agat t c t aat at aaat 120 180 240 300 360 420 12689250 Sequence Listing.txt tgtgccggca taaaatataa at at t gcggt t aaaaaaaaa aaaact t t gc cat gaaaat a t gaagaggt a cgtggcagaa t at t ccaaaa ct ct at aaat aaaaat gaga aat at at acc at aaaaaat c at at aat acc at accct at a t t ggt t gaat gt gt t act t t t t t aat gct a agat at aaat tgaacaacaa t gt t aat t aa t at aaat at t t at t ct gcaa t cgt aaaat t t aat t at gct caat t gaaaa ct gt at t gt c aat gt aaat a acagaagcat cgt t t ct act gaaacct gac t t ggaggat c tt cgcgt aat aagt at aat a at ggcat ct t t aat t caat a gcaaacact a ct aaatt gga t ct t t t aaat at aaact t at ttggaaatag agat t t t at a t aagaat t t t tttttgtcca tt aaaact ga cat t aat t ct t at aact aga tgtatgagaa t t cat t t at t gt t t t gtgt a at gacgtt at cacaaaaat c gcaaaggaaa t at aaat ggg cgt t gtgact ttact t t t ca gat ct ct ct c tgagt t agt c t gt gt aaat a agatagggcg tttttgaaac at aaagt t t t at ct aat at t aataaaaaac ggt gaaaat a at t aaaat ga aaaactcaaa t aaactt gt g t caaat aat c aaaaact t aa gtt gatt aat tttttaaaaa aaaaaaaat c ct t t ct ct aa t at ct gat at at gt t acat a aaacaaacga gacaaaattt at t t at t t ct ct t at t gggt cccgt t t gt g attcagaaac t ct ct ct ct c atgt gat t t gatt a at t gacacaa t t ccaaaaat tttattacca aaat t gaat t ccaaat at aa t at accct aa at caat at t t t at aat at t t gt at t at t at at t t gagtgc aaat gt gct a t aaaagt gt c at t t t gt t t t aat ct act aa t t cagt t cat accagtt aac aat t aat aga t aaatt ccga t aaaaat at t t gt cct aaaa cct ct aat gg at t t aggaat gcct ct ct cg tctcgatcgg aatt cgt cac at agcact aa agt at at aaa tttgtatcaa aaaaactttt t att acaaac at t ggaaat a ct t t t aaat a aaat t t at t t t gt ccat aaa t aat t at gt g tttgtgggaa acacaaaaaa at act at t at aactaggttt ccagt t aacc ctt aaat gt t at caaat t t a tt cat gatt a t t agat ct t a ggccat t t gg gct t gcct t t ccgcactgct t cgt ctt caa at aat at t t g aaaaaaaaac aaaact t t at aat t t t aat c t ct cccaaat aaat agt gga t aat t t t aaa ggaaacccaa gtt gaaat at ct aat t t at t act t gt t t t a t aaacaact t t t t t t caat a aagt t t aat g t ct at t aaca t t t agcattt at t gt t cgct gt gt aat cag aaat gt gt t c t gaagt att t cat t acat ac aactt gagct gacgtagaag t gccgt t t t c agct aaatt a agct t t gtgg 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 266 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 266 tctctgat t g tttttagcaa tttcgtggaa aagaagaagg catgctccag ctgatgtata gatagagaga gagaccagtg ttcctagaga gatatcacta atgtgaaaag taaatcatat tgtttaaagg tatttttatt ctttaaggtt catggtacca aactagtcca aaatctcaga Page 290 12689250 Sequence Listing.txt attttaaata cgacttatgg ttgaacaaaa aagatttgga ataaataatt caat agt gat cat gagt cat gcct t t gaga gcct cgagt t ggt t aat at a gt caaaaaat ggaaaaagca ct cgcacct a ggaacggcaa gt at aaggt c ct t t ct acct t caact t aat t ct gaat cat gt agat ct gt aggt t t gagg at t cgcgat c t cgt t ct t cg aggt t t t ggt caacgactt g tgt t t t t agt t t gaaaat t g gt gat ct t t t t t t gt ct aat agt ccagcag t t cgt ccaaa t ct at aat gc aact at gt t t cgat ct t cat aagct cct t t cgct cacaca t ggt t at aat gagt gagt ca aat agaat t t at ccgct t ca gcgt gt t ggc aat aaagaaa aat t t gt t aa ccgat t ccaa at t ggaaat a aaat acgt aa ttct t t t cct cgt t t ct ct c ct ct at ct gg t ccgt cgt t g gt t t cgt t t g t t gat ct at c at t gat t cat at aagagat t gt t t cgat t t t at ggat ct g aat ggt t ct t gaat acct gg acagaaaagc t agt agt t cc cggt ggaat c t at at ct gat tgct t t t t gt aat at agt t a t ct gct ggt t acaaggcgag t t t gat acat cggggaggcc gat tct t t t a acggcacacg ccat ct ct gt caaat ct ct a aat agt t ggg cgt aagt aaa at caacggct t t gtcaggga cct cacgaac aaat t t aggg t gagt ct ct t aggat aat gt at at ct ggag gat ct t cat t t agcct agct gat at t cgca tt gt t t t gt t ct at aat aat at ggt t t at a t at t t t t at t gt gaat gaga ttat t t t acg aagaaat t ca t t ct t caat c t gcagt gct c t agct t t ct t cccggat cca at gg t t t at t t gt t caat t t t gt c ggccccacgg t at ct t ct aa t t t at at t t t aaat t aaaat gct at t t at c t ggt t gaagg cagat agct t gcat aaccct gcat ct cat t t t t t ct cttt t t t t gaattg t t t t t aat cg aaaggggttt t at at at gt a aggt t agat a t gcat ggt t a acat gt at ga t ggt t t ct ac gat t cgt at t t t t cat ct ga t t at gat gga aagct cccct aat ct t ct gt t ct at ggt at caagaggcca t act t t ccag at cgt t ct ct gat t t t t gac at gt agt gga ct acgct acc acacgt gt ca gt t gt t ccgc aat t t ct act t gcggt cagg gt aat t t ggg cacgcgat at agt act t at a gct t cat cat t ct cgaaagt t agat ct gcg gt gaat t ct g ct ggaaacaa agt at ct ct c gacat cgt t c ggt t gact t a t t gcct at t g t cat cat t cg gt cct gat aa t ct t t t t cct gt cgaaaggt cggt t acagc ct act caaac ccat t at gt a t cct gagt ac t t t at aat t t cct cct act a acacact ggt aaat ggt gga gcagt t gaga gt gacagct c at ct t t gat t aaaagat aat tgcggcgaaa aat ggagat g t aaaaacct c aaacgct aag t aaaggcact t accaaat ca ct t gcggt t t t t t t ctctct ggt t t t t t t c ggagt t cat a acgat t gacg t t at cgt cct t acct gaaac t ct t aggct t agat at acaa acat t gt t gt t t ggat t t gt ggt aaaaaga att gaagacg gt gagt t t cc gt ct ct t ct t cagcgt gcac t ctt ctt t ca caagt cct gt 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 267 2049 DNA Arabidopsis thal i ana Page 291 12689250 Sequence Listing.txt <400> 267 gat t t t at at ccagt t ct ac aggcat ggat t cgggt gt ag accacccat c gt at t caaat at ct t t ccaa t t ggat aat t acaacct aac accat t aacc ccaccgccgg ccaccgt cgc ccaccgccgt gacaaccacc cat cgccgga gaat caccac ccggaaccac cct acccct t t ct caaaaag acaat aaaca t aat aat ct t ccat ct ct aa t caacaat ag t aacgcaaat t t gt aagaaa gaact t ggt g tttttttttt t aat ct t ccc ccat t t ct t c aagaaacatt t t ct t ct t ct cgt ccgcaaa gt t cgat gat t t t t acaat t gaat accact at t t cct t ct cct aat caaa t t accaaat t t aat at t t ct gt t cggat ac acgt at agga aggt t t ggat cgaat t acca aaat aat at a t gcagat at a aat at aacat accact acaa ccaaccacct cggaaaaccg tgccggaaaa gt caccat cg caaccgccgc cgccggaat c caccgccat c aagt t aat t a t gt t at t cat t at t agat t t ct t ct aaaac t at at at cat ct agacagt a t caaaat at c ct gaat cacc t t t acgagt t t t t t t t t ct a aacaagaaac gaagt cacgt t cacgct ct c t cacgt cgca acat t t ct ca caaaggat cg t t t t ct t ct a ccat t cgaat t ccgt t cgga cgggt at ggg agt t at t t t c caact t t aat acct t aaaat at gt aat at a ccaccgccgg ccgt cgccgg ccgccgccgg ccgcgccgcc ccggacaacc cgccggaact act accgccg gt cgct ggaa ct t aat aat t gcaaat cacc gat at ct aat t t t aaat at t rrt gagccat c t at aat gaga aacct agaaa t ct t cacacc caccact t gt acgat aaat c ct t ccat t gc t gact t t gaa t gggaat ct g cct ggagat a gt cat t t acg agagaagcca aacgt caat a t cggt t cgga t at t t at aga t gt cact aat at ct t at aaa t aaat t at aa t t t t at aaat t at t at t aca cggct aacca acaaccaccg aat caccacc accgccggac accgccgccg accaccat cg ccgt cgccgg ccaccaccac acat at at ag ct t at aaat t at gagagt t a aggct t gacc aat t gt gcat act t ct at gt caacacgt ga gagccaaggt gagct gt at t t gt t t t acac gcgt t t gcat t t t t t ct t ct t gat t t t gaa caat gt ct t c caaacaattt aat cat cgcc aaat at caaa t at t t t aggt t ct cggat cg t t caat at ac aaagaaat t a aaacaaaaat t aacaaact a gt t t t at ccc ccgccgccgg ccgt cgccgg at aacgacca aaccaccgt c ttgccggaca ccggacaacc aat caccgt c caccacct t a t gct at at t c ttagaaaaaa t t ct t aaat t t aat gt acaa gt aaccat at gact ct gt gt at cacaagaa gaaagaagag t ct aaat aga at at gt caca t at t accgga ct act t t cgt cct ct cat gg acgagt t t cc aact acaat g ccagct ct ca agt aacaaat t t t t ggat at gat t cggat a cgt t act t t c acaaaaagat acat t at t aa accaat aaaa cact act aac ccaaccacct acaaccacaa tcgccggcaa accat cgccg accaccgcac acct ccaccg gccgccgt cg gat cat t t t a t t aat t t t t c at gaccagt c t t t gt gt t ct gt aaat aagc t ccat at at a t cagt t t t ga t t t gagct cg at t gaaaaga t t aaaat ct g t agcat gcat gcccacat t c t t acgt t cca ct t ct t ct t c at ggaccaga gaat t acaat cggaagcgat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 t agagaat ca aggat cgct a t cgt at t gct ct cgaagaac t at gct t ct t ccagt t ggt g Page 292 12689250 Sequence Listing.txt t t t ggat ga 2049 <210> <211> <212> <213> 268 1803 DNA Arabi dopsi s t hal i ana <400> 268 caat acgt ga gacaat cgga gagaaagt ca gt ct caact a t t t ct t cct g cacaaagt gt t aagct ccac aacat agt cg cgcgagagtt tttcccgccc t ct t at t t t g t cgct gagt t at gagt cat c t ggt t t ct at cat at agcgt t t t cgt cct t aaacct gt aa gt t t t t t gca agt ccaaagt gt cgat gcaa aaaaagt ct c cact act act t act t t agt c tgcaacaaca ccat at t agg ct gact cact t cccaaccat tcaaacgccg t t t t t aat ag agcccat t ag t cat cagact gagagcacat t gt ct t t gag tgacgagaag t t t t cact t t gct t t act t a ct t t ggccga cgaat ct ct c tttaaacaaa gt ggcaacag t at at at at a act aat ct aa caacat at t t ttacaaaagc t aacgaat t c at cat cat at gt gat aaaga aaaggccgaa aaaaaaat t c ct t t ct cgga aact at t gat t t t t aaat t a aaaaaaaagg cct aat aaag aaaaaact at cccact cgcc t cgt at aat c cccacct gac t t aacaagat aacat agat a aagaagacag gccat t cat t aagaagaact accacaagca t gt aaaacaa gt ggt t aagg aggcgacgat caaccgct ca t gat gaat t t t at at at aca ct acaccat g t t t ct ct ct t at agaccgaa ccgat t aagc aacaat aaca gt t gt gaaag agaaaacat a ct t ct ggact ct t t gt ct t t t t ccat t t t g t gt act t t at t gt gt at t ac gcccaat cag t at cgt ccat aacacaaacc gaaagt gaac t ct t t gccct gaagt gaat c t gt t aat ggc t aaat t gaag ggct gt gaga gaagagaact caacgcat ac cat t at at ag cgt gt gcct g at t t t t t t aa gt t gaccaaa gt at at caga t t t t at t aac cccaact ct t ct t ct ccaag gct ccat t at t t t gcat t cg agat aagaat agaaaaagaa acggccgaga cacggt gacg t t at t t t t ca gt t aat at ac t t ct ct t t aa aaact t cat a agt t ct gact t cact cgct c acgct cct ca acact t gcgc t t at t ct t ct gaaaacat ga t t cgct cgt c gt caaacct t gct gcgagga ggaccaat cg gt acgcgcaa t agct aat cc ct aagt acat t at t t t gt ct gcgcccct t c t t t at agaaa aat agaacca at agct gaat t agt acagt a caat ct t ggt agt cacaaat caaaaaggca aaagaaaaat gacggagctt gt t gaccaca ct t t t t t cct t t t aat at t a gaaaaagttt agt at t ggaa t t aaaact ag t t at cacaaa ct ct acgaaa gt gggt t t aa t ct ct ct t t c t gat t t t gt a at aagaggag aaaat t t ct c acagaacatt aaaaacaaac cat acgcgag ttcagcaaac ggggt t t ccc ttcaaaacaa t ct ct t t cat ct t gaaacga aaat gaacat gt t caaagt a t gcagt t t t c gcct ccaaat at ct cat ct a t at cct t at t gagt t gt gaa gt t cccat cc t gaacgt at t ttgcagacga aaagaat at a t cgcat t t t a t t cat aaggc t gact agaag ccccaaaccc gact at aaca t at acat ct a t ct t t cccat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 Page 293 12689250 Sequence Listing.txt tttcttcaaa ttttccccat taaacaaaaa aaaatcaaat ctctctcttt ctctctctaa tgg 1800 1803 <210> <211> <212> <213> 269 2002 DNA Arabidopsis thal i ana <400> 269 aat t t aat t a aaaaat aat g aacat t ggag aaaat t gt t a gacat t t gca gctgaagaga t cct ct t aat cgcagccaag t at agaaact ccagt cat ag cat acat cat aat gt gact g aggt t aagaa at gt at at at t t aggt t t aa t gcaagat t a t cagaagt at gaaagt ct ca t cagat t t t g t ggaaaggt c aaat t t aat c ct agt aaaaa gcaat aaaga at gaggct t t aacccat t aa cct gaaacgc gat t ct t ct c taagacaaca t t t ct ccat g acacggt aca at ct aacat g cccgct cct t t t t gt t ccat at gagt cct g gaaat t t gaa aaaat aaagc caat t at aat t cat aagaga ggacat gat t at agaccaca at t aagat t t t t aaggat ga aact agt t t t t at t aat agg agagt t cat t ggaat ct t ca ct t gcaagag t t caat aat t aat ct t t t ca aaaat t t gcc at t caaaacg t aacact aaa agacaacgcg ggacgt t gca t cacgat gat t t t at t acct cgt cat gaaa aacgaaattt cat at cagaa tagcagcagc t ggact cgt t cgagact act t gaaat gat t at ccgt ct t g tgaagaccag gat t t at t aa cat gat ct t g act t t ggggc t cacgaaact t t t gacact c agaacaagt c gt t t at t aat at at aat t cc aacat t t t ga ttcaaaaagc gt gat t aagc ct aat aaat t tcgaaacaaa at t t gt t gt t aagcccaaac t agct t cacg gat t aacggt caaaat acag gat acat act at gacaat ca cgt aaaaaaa at ct aaat ct ggct t ggacg gaaggt t t t c ccaaccccaa at ggagacgt gtct t t t gt a agagtgagag gat at t t t ct ccat t t gt aa ct aaacct cc gccacctt gt at gaagt t gt agt t t t gt ga t t caaat t ca aat t t t ct t t aaaaat ggt c t act agacaa t t t caat t ac t aaccaat t c aaaaaacaca ccgt ct t caa caacggt at c cgct t cacga gaggcatgaa aagccaacaa t t at t aggt g aaaact t t gt act t gaaaac caat cgaaat agcgt gagt c at t t ccccat tacaaaagcg act agagt t g gt at gcat at agagagaatt t t cct t aat t at cacaagat at t acgct cc at ct t t cgt t cact t cat t t t t gggct t t t cact at t act t caccaaaat ct gaaaagt g agat t t cat a cgt t t aaaga tat t t t ctct cgat gccgt a t gcat t acgc gggt at t at c t t cgcgt gag cgagcgtgt t gt aaaat t ag at aat aat t g aat aat gat g t aacgt t t ga gaat ggt aag ct accat cag gt gcaagaat t t t t cat t t c t cagt at at a gt t t t t ctt a gcat t aaaga agaaaaagt a cgt act gaag at t gat cat t gt at at t t t c cat caat agg at t t aagt gg cat at aat ag ct gcagaaca gaaagaaat t caaaaaagt g ct t gt aaat a acat cat gca taggaaggaa gt cgt at tag acct aaaccc aagaact aac aat at at aat t t t at gcaga at aat t gt t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 tttataagcc actaataatt aaataaaaat aatatcagta ccgtgtaaaa gcaaaactag Page 294 t gt aaacat a ct t t gacat t t t ccct caac cat aat t cat accact acca 12689250 Sequence Listing.txt aattttcaat tggcaaagct ctgatccatc cgttttagta attcgttgac tcacttagac agatgtgtgc aagaaagaaa aacagcatgt tgcttgcttt gagaaaagtc accctctctt caattcttca attctcctcc ctttataaac taattctctt ctctctcaac aactacttgt ttagagagag tcccaccacc caat cccaat gg 1800 1860 1920 1980 2002 <210> <211> <212> <213> 270 2004 DNA Arabidopsis thal i ana <400> 270 cat ct ccat t atcggagcag tttaaaacca t gct gt t t gg agagtttttt aaaaacaaag t gaagt aaag at aaggt t t t cact ct t ct t at ct t t agt t acat ggat ga t t t t t atggt t t aat t ct aa aat aaagt cg accat t gt at ct ct at at ct t t t t aaat aa t ggct t at t t t t aaagt gt a ccct t t agca t at at aagat acat t acat c t ct ct gt acc ct at acaaaa gt t gt at at t ggaagaaccc at t agt t cgc aaat gt t aca t gcct cat ag gat t gt at ga gaagaagaag ct gct aagct t t t t at ggat t ggat t gaag ct t t t aagaa t ggat ct t ga ttgtact t t g t gt t t gaaga t t t aagt gt c aaagat at ga ct ct cacact gt t t agccga gat gcat cca aaacaacggt agt t t t at t g t t t ct gct at t aaccaat cc acat at aaga at aat gacca t gt gcgccat gt gt agcat g ct at gt agct t ct t t gt t t t t t t aggt tag at at at gagt aggaagagct at ggat ccca gccat at ct c ggagct ct t c t gact t t t ca gt aggat t t t t at t t ccat t cat t t aaagt t acaagt at a t ccgct t t t a gt t cgaat aa agaaaat t at gat t gcaaaa at at at at aa ct t t cct at a t t caat acga agat t gat aa aat t t t gct t cgat aaat ga t cat gat t ca t at at gaggt cggagt cct c aagact cat g aagcat gt aa t agt caaagc cct t gaagt t t aagcct t at aggagct t t c at ct ct ccat t t t t t t gt gt t t at t t t at t agt t t cgt ac t t at act t ct t accgt gt t a t t t aagt gaa at t t aat act cat t gt t gaa ct at at t t t t gt caaact gg at ccat aat a t gt t at act a ct at at t t ga ttttcacaaa gt at at acat t t ccacgt at gggagat t t c at gt t agat a agt cgt caca ct gccaat t a agat ccacac caatggaggg caaat t caat act t acccct ccct at ct ct t gt ct gt t gc tt ctt gt gac t t t t cagtta ct aaat t act at act t t t ca t caat t aat t caacagat ct caaact t at t gaat acagct t t t t t t t t gt t t t acat cca tttcggcaga t gt aat ct cg agagt t gt t a gt acgt aaga at at t gt t t g cagt gt caac acaaccgct g cagaaggcct aaagaggt ca gat t at at ag tttagcaggg at aat t gat g t t aat ggct t ct ct ct ct ct aggt gt at aa at at at gcaa t gt at cat t g t t t t cgact c at t cat at cc t acagt at t c agt t t gaact aaat t cggat ggaaatgt t t agat t t ct ct aaacagcaat t gt acgt act aat t agt caa gaaat t t gac t aaat gt aaa t t cgaat caa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 Page 295 agcaaact ca tt agcgt gcc t at agcat cc aaacaact ca at accaat t g at at gggct g at at t aaat g t cacgccct t ggcat ct cca gaaaat gaaa cat t at gt ca accaaaacac t agcccat at gagaacacgc agat aagccc aaccct aat c ct t ct t t gt a gacaccgaaa 12689250 Sequence acgctgaatg acgataaaaa cgaacgtaga tt caagt ct t gagaaatcat tctcaaaaac aataaggtct gtagtgggat tttcttacac atggaaacaa at t aacat aa gacccagt aa catttttagt ttctatcata tttccgcctc ccccgagaaa at gg Li st i ng. txt tagggacgct ggccat aaaa agt ct ct cca gcgt t t t t gg att acgaacg t aaat at at a t aaat agct t acgaaaaccc t t t gt t cgaa caggat t cct aat ccgaact ct t t t agaac ctt ccaccaa at at gagccc ct t gtgt t t t t agt t t caga 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 271 2004 DNA Arabidopsis thal i ana <400> 271 cgcaggct cc t t aat gt t t g agcagat at a aat cat cat t t at caaat ca act t aggaag gt at t acaca aat aat t t t g tgaaaggaaa gccagcactt gct t gcact a aaaaaat gac t at t gct at a aaaggaat ag t agt at at ac gt ct t caaat t agt t t at at cat aat t aat at at acct t c at at gt t aca agacaaagat ct at t t at t a cact gcgt aa at t t t gt t at t t ct at gat t aat at t t ct t t aaagt t act aagt gat aaa at t agggct a agagt aaact cggat t at cc tggat t t t t c cat t aagt t t t acaaacat a caaat gagt a t cccat acat ttcaacgaca gcggggtgaa agact ct t at t t cgt agat t aacat t t gac at t gagat aa gt t t caaact t gt t gcct ga acgaaat cgt gct cat cgac t gt cgt act t t ccat t t cat gt aaat t aag cgaggataag aagct t t aga gt cagt ccac t t t at gcaat t aat gt acat accat t caaa gat aaat t t t cat t t aacca aacacat t aa t gt t t gt ct a ct t t t gct gg at t aat ct t t ct aat t t gaa ccgt t gaaga caat t gaaat act gt at at t at agt ct t ct ct ggcat t t c t act t gaat t at at ct aaat t aat aat t gg gacaaacgt c t at t aacact t at t t at gct t ct caat ct a t t gt at caac t t aat at ggc t t gat caaaa aat t t t aaaa t gt gagt at a t acaagt t t c t at at gaact t t cat gt aca aaaat gt t ac gagt t t ggga gaaaat gt gt t aat aaat t t t cat ggacaa cat t accaaa aat gat at t a cacat t t gcg ct agggat at gt at t aacgt cat t acggga agt t t t t ct t aat gaccat t t t t ct aat at t ct ct caaag agaat aagat t t gat t at aa aat t agcat a gt caaaaaaa ct t t t t aaaa gt t gcct t t a caaaaaagat t gacaggct g at t ggt ggt t accacaact t agaggaaggc ggcagcat aa t cgt gggt cg t aaaat t t aa t aacaaaat a at aaaaaat g t t cct aaat t t t cct t gt ga aaact t t aga gccacgt ccg gaaaaaaat a gcact agt at t ccat aaat a gact act at a cggcaccaac at t gccgcga gcagat t at c ttggaaaaaa acgagaaaac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 catacacctc atttctatca ttaaccctca catggcttat ctgtacatct ggaccccacc Page 296 12689250 Sequence Listing.txt accttctact tttatctccc attttttttt tcaaat t att actacccaaa aaaatattca acaaacct ac at t gt t ccat gccat gcaaa aacaaaaatt aaat gcat t a t at t at t t t g ttccaaggaa ct at t t aat c t aat accgaa aaaat ggcag agagagt aaa aaaaacacat at t t at t agt t t at acccca t gat t gggcc aaacacaaat t cagt aat aa cat t t agt ag gt act aat aa agt accct t g aagagat aca aagaacaaag gtct t at t t t aggaaaaat a ct t t gat t gc agt aat aat g t t gact at t a t gcat at t aa t t t at at t t t t gat t at aag gt t t at ccct ct gt cgt cgt at gt t at t t at t t a t t cacat t t t t at t t t at ac cat at aact a ttttgcagga t cgt at gt ga t agt act aag cagaat t at g ggcat t acaa cacacagcac aacat t t gt g acagaact gc act at at cat t t gat gt tag acat t cgt ga t t t t aact aa t t agt t t aga t agact at ac gggt t t cct t cacaaact cc t at t t at ccc aaaagat t t g acagt acaag caagagaaaa t aat t t t gac aact ggaaag gggt t at t aa aaat agt acg t t t cccct t t at agct agct 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 272 2004 DNA Arabidopsis thal i ana <400> 272 gtggtagaga at cgaagcag ccat t cgcca aagcaccatt aact t t gcaa tgtct t t t gg agct at t ct a caaaagcgt t act t gagat g t t gat at acc aat ct at t t g gaat t t ggt t gat aat ct t t t ggat t ggt c t t t t aat aat at t ct ct ggt t gt t gt t ggt t aaacat t t a t t at agaaag gt cacgat ct at act agt t t t gccagt act at t aat gagg aact t cct t g agagt t t gag gaggt aggt a ct at agcgt c t t t ctgattt aggt aacaaa t at t t t ct t c ggct t t gggg t gcat gat at ct t t t ccaaa gggagcaaga ct accacaaa caact t t gt c cct t at gaca cct cat ggaa gagaccct cg agt t t gagat t agacaat gt t t aaaat t t a acct t accga t t acccgt aa at t t t at t t t aaagaat aca t at acaacag gagt t ct ct g t t aggt at aa ct gat t t ct a ct ccaaggct act gt t t t gg gccggatggc tttttttttt ggagatcgag ggt ct ggt ca t ggaaggt ct aaggt gt t gt gat accccct t ggat aaaat t t ct cct t ca aact t t t aca at t t t t gtct gaat cgggat aagt cat cct t ct gcact t a ct gt t gcat t t cgggt aaac agagcat t at t ggt ct gt gt gt ct t t gt aa tttttttttt gagtgggaga cagaagggt g ggt cacgaaa cact at gt ct t t ggt t t cat ttgtct t t ct t cggt t at t t t t t t cat t gt t t t ggt acac t gt t gggat c t aat agaagt ggt gt at aca t ct t ggaat c t cact ccgt t agt aaaacag t gaggt t cca t cct ggct t a t gccgat t t c gaggaccgag tcgagaagcc gggt gt t gag tggggccagg aagcatgtgt accggat cac gt t aagaggt at t t ct cgat cat t t caagt gaat gacacg t t t gggt t t t cagaaggaat t agcct t caa gt t t at gaac t aggt aaggt agaacagct g aat cct gat c t cct ct acag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 297 12689250 Sequence Listing.txt tctatttgat tcaaacaaga gctcagcata tgaaaccatt gtttcggggc cagct at aga aat aaaagt t t agt t ggat a gt t t t t t gt a gaaact t t at t gaat t t ct t act t gt acgt aat cgt gcct act ccggct a t ggaaat t t c gt t gagt gag gt t t ggt t t a aaaaaaaat a t at ggt t cgt gat ct aat t t <210> 273 t aagct t ct t gt t gat gaga ccaat gat aa gcccaatttt cgt t t t ggca acact aacag ggagggataa at aat gacat aaact at caa t t ct gcagt t tcaacggaag aagcacagaa gtgaagacga aaagat ct ct t ggat t t cag t t at at ggca gt t gaagagc t ct t t gacgg ct aat gcat t t at at at aca t t caaat ct g aacgaaaat t gaggt t aact at cggaaaac aaacgt cat c agaagaagac acagccctt c cgaacct t t g t t t t ctctct at gt aact t gt aag at gcaat agt t t t cgt t ggg gatcagcggg aaaaaaaact gccaaaacca agt t gt t t gg aaaat t t gaa aat gt gaaag gt t t t acat a caaat t agt c aat aagt t t c aggacact ct aat gacgcaa t ggat at gaa at cagat t aa ttaacaacca t t aaat cat t t caaat ct gc cgt act t aca t t t ctt acca aagt caacaa at gcgt agct t t aaact cgt aat gt cat aa t cggt t t ct c t cct ggt t t g cat t t gt caa gagcat gcaa aaaagt t cac t t t at t caat t aact aaaat aaacagaacc cat aat t t t g t accaat t t a at t t t aat t a t aat t aat ga t t aggt aaca aaccaaaaaa caat cct t ca 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 ttttctccat ttttcaattc aattttggct <211> <212> <213> 2004 DNA Arabidopsis thal i ana <400> 273 aat agacgag aaaggcaaaa t acat at aaa caat t t ct at ttcagaagag gat t t ggat c at cgccgt t g at cgagacag aaaccaaaca aaaact t cat gggaacggcc t act at at t a gcat aat at a aagaaagttt t gtt tat gt t at caaaagt a agccct aat t t aaaccct aa at t ggagat g aat gacgaat atcgagacgg at cgat at cg cgt gaat gcc ggaccaat aa cagat ct at t agaact ct at at caaaagag aagagcat ca t caat t cat a at aacgact g at caat cat a at aat gaaga agt at caat a cgacacact a at cgccacac gagagat aca caaaaaaaac t at t t agcca t t t gcat t t t aagaccat cc t at gt gaat a t t at ggt t gt cacaaaccaa at t at gt gaa aaaccacaat t ct aaacact aat t ct ccga t cgccgt cga aat t gccgt c at cgccgt aa t aaacact t g cgt at gggt c ct at t t aat t tt att gcaag gt t aagaaga ct ct t t gat t at caaat t ca cacaaaaccc tcacgaaacc caaat caaat t t t cgaacct ttgagacgga ttgt t ctct t aaagt cgccg t aaaagat t a cat gaagt at ttgggt t t t c t ct ct t act a gt t t ct t agc agt t ct t agg t at caat ct a t aat t t cgt t ct aat t t caa gaat cat acc t aaact ct cc tcgacacact t gagct aat g ctgcagagag cgt t t at cga ccct t cgat a tggaagaacc t aaat ctt ag t aaaagat aa at aagaacat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 ctcattttaa ctcttatata agagacaaat gacaaaatgt ct ct t at at a agagt caaaa Page 298 12689250 Sequence Listing.txt tgagatgttc ttagtaatta atgatcaaaa aaaattctaa gaactaacct aagaaactac cat t aat gaa at aaaacaat t t ccat t gaa t at aaagt ct aat t acaaac t act aaat ct t ct aaat caa agat t t gaca aact t at aac ct t t aat t t a aatttttttt acagt at act t acat t gt ct t aagcgt cgt ct ct cgggaa t t t caaat cc aaat t t at t g gcagct t ct a <210> 274 gct ct aagag t cat t aagat cgt gct ct aa at at agat gt t t caat t caa aacact cct c aat at at gga t gt gaaaaat ccat t t aat t t at t t at t ct ct at t at aat t t at gt at t t caat agaat t ct at agt ct t aacact ccaa aat cat caat t t gat ccgt t t gt t at ct t c ctt caaacca t caaaat t aa gact t at t aa gagaagtgt c gat accat aa agt t t tct at caaat t aaat t cccaaat ct aat at at aca caat at at aa agagat t tag gt t gt cacaa t t at aat gag ccact ccaat ct acgaat ct catctct t t t t gaat t t ggt cat g t t acaaaagt gagcct at ca t tct aagt ct t t t t agt aag aaagaaaat t t t at t t gat a gaat t t acac aaat gct aag at gcat at aa aagt aaaaat aaaat agt t a atcct t t t t a t t t cgt cgt c ct aagacgt t acaat cat cg gaat gat ccg gt t t gt t gag t ct t agt t ac t t aaggt t ac agt cat gt t t at aat cat aa aaat aaaaca ttttaggagg t aat t act t c aaaagct t ga t at at t t t t t ccat aaaaca taaacacaca t at at t t at t gat at ct gt t aaacaaacca agcacat aca at t ccat ct a gcggagatct aat t t t t t ca t ct aagggac gagt t t t gt c gaat aat t t c t agcacgat t aaaact t t at aaat aaaaaa caagt aacat t aat aaaaga aat t act ct t at t acat t at t at t t gt t t t t gact t t gaa t ct ct cgat t aagaggaaga ct t gct cct t cgaagat gaa 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2008 DNA Arabidopsis thal i ana <400> 274 t aaacgat aa t gccat ct t a ggt gt accgc t t t at agt t t at t t at t t t a t t agt cgt t g t ct t aaaat t t at at t aaat t aat aat gt t acct t t t aga aat aaat at t acct gt t t gt at acgt t t ca ggagacaat t at cat t gt t a t t aat gt t ga cgggacgt t g t cgaat caca aat gat at t a t t at gaagt t t aat gat at a t t cat t cct a gaaat t t t ca agcgaact aa cat t t t t t aa ct aagt aacg t ct t at t t at aacccacaca aaat at gacg at t at t t t t g t aaat at acg t t aact ct aa aat aat t aat aaaat t ggat aaaaat gt t a aat t aat at a t cct gccaaa aat at gt t t a tttttcgcca aat t t gt t t t at agat t aaa aat aat at t a aagcaaaaat gt cagt agca t ccat t t at t act act agat t at aaaaat t cccgt ccaca t gaat cat t g at t ggt t aat t gagat t t t t ct t t t t at aa t t t aaaaaat t t agaagat t t gacat t gt a t t ggt at t at t t t aat ccgc t at aaat t at aaaaaaaact t t aaaat at t at at gt t cat t t aaact ct a at t caaat at aat at t t t t t t aaat t t t t a aat aagt t ga 120 180 240 300 360 420 480 540 600 660 Page 299 12689250 Sequence Listing.txt aatacatgat ttgtttattt aaacaaaaaa cgtttagtaa gaacaatatt aaaaat t t t g t gt gaagt t t t t aaagt aaa t aagt ccaac agagt at aaa aaaaacaaca t ct gt t aaac t at t t t at t g act t t gt t t g t t t t gt t t t a gccact t cag t gggaaat gc caaaagt at g aacgt ct t t g aaat aaaggt ggt aat agaa ct t t acagt a taaacagacg at acgcagat acgaaat cac cgcagacatt ttctctcttt t t t t aaaat a gt t t gaat t a at t aat t at t t ct aggatt t t t t t at gat t ct t at agt t t t t gt at t cac t aat t t gt aa gt cacaat at t caaccacgt at t t gggt t t caat at t caa gaat t gaaat aat t t aat t t aacaaacaaa t t at gagcaa t t act aat t g at aaagt t aa t aggt t cacc gct t t t cct t t t t gt t t at a ggt t t t ct cg t t gt t aat aa aat t t gat t g ttggaacaaa at t t gt t aaa aaaagt t aca at aagcagct cccaaaaacg aaaaaaat gt acat t t gat a gt t t cact t g gcccacat at tgt t t ggttt gact aat gag gt t t t t cacc at t t t gt at t aaaagaaaaa ggccgagcag ct gcaat t t a gt accggaac cgat t t ct cc t gcat acat a gaaaaat g aat acgt t t g at aat gt gat gat t t aat gc at gact gt aa t t caaat t t c ct t at gat aa gat gt t t cat aaagtggggg aaat aat ggt at gagt agt t gacat accga tcaaaagaga cacat gcaat aaaggt acct ggaaaaaaca gaaaagaaag t t t t gggct c gt t ggt t cag aaaccggatt gct ct t ct cc at aat aat ac t gct aat t t t t gacaat t t a t aat gacat t aaat gt at ac aaaaccat ac aaat ct t t ct t t ct t at t t t at at cat gaa cgtcgcgtga t at at agt t a cat agaaggt at cat t t ct t t ggt gct at c aat gaaaccc ttttttggaa aat aat gagc t t gat cat gt gt gagct acc t at caaaat c act ct t ct t c act ct t gt ca t at t t aaaca aagggat t ga aat t aaaaat gaat t gt aaa at agat gggt t t aat t ct aa gagt t at agc t t act t ggag aaacaacgt c t t t agt t gat acat gat t cg t aaat ccacg t at at gat ct ttaaaaaccg t t t cat t aaa t at at aat t t at aat aaagc ct agt aat ct aaat ccaaaa ct t aagt t at t ct gt t ct at ggat t t t t ga 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2008 <210> 275 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 275 tattataatt aatgtttata atgaaaagaa tgagatagac ttacagttat cacaaaaaga ttcgtcagca tgtaattact cgtgacatat agcaaccaat tttgttatat atgtataaat tgttacgagt atatgaattt ttttgaggta aaagagctaa ccatcaaata aagtaaattg acagccacca cattcctaaa gttagaaaat cgaaatcgca ggtcatattt ctatgaaaac t cat t at t t c aat ct aaacg t ct ct agct c cat ct t t gac cagaagcaaa aat t aaacaa t ggcaaacct gaaagagagt Page ccaaagcaaa ttttccccaa at at t t ct t c aat t at at at t caact agt g acaaat cat a gt aaaaaagc t at ct at at a cgaat cat t t gat t t agcct ttcacgcacg aat acagat t aaact aaat t ccacgt gat g at cact t ccc gt aaat ggct 120 180 240 300 360 420 480 12689250 Sequence Listing.txt atattcatat ttgccccacc acccatgacc catctcagtt aataaatagt acct ct at ac cgcat gat t t act act cgat cacaagt acc agccaaaaat t ct act aat t act aact t t a caact aact c at acat at ac t t t t cgtct t ccacgt cat c aaagacccaa ccgt aact t a ct t t gagt t a gagaggcgag t ct ccggcga ct t aat ct gt ttcgt t t t gg ggat ct at t a tttttttttt t gt t t aggt c tggt t t t ct a agct agt tag tgt t t gcttc ttattttcac t at aat cat t t gcct t t cgt cagt cgcct t aat t t t aaac t at gat ct aa t t gaacact t caagat t aat t t gaagat t a t gt t agt aaa gt t gt t t t cg agcgcat t ac ccct gact cg ct ct aaccac t ct cgt t t gt agcgacgtag gagat t gct t gt t cgat ct g t gt t at ggt t gct t cgt agc gt t gct t caa t t t gat t gt t t ct ct ct agc ggt t t t cttc ttat t t t caa aggaat cat c cact t t at ct t gcat gaaaa gact cacgac t gt t gt act t t cagt t act a gt gcat gcat cgaggt at at at cgaggt aa at aaat t aaa agaagctt ct aaacaat gaa ccacgt cgac agaaccctt g t cggagt tag ggt t ggt gt t t t caggt aaa cgt at gaact cact gat agt tgt t t t ggt g at t t ct t act t ct t act agg t aat cact gg t gt t gct t ct ggat aat t t t at gg t at at at agt at cct cct t g at t t t ctct a aact gt at aa cagat t aat a t at ct ct cgt t at agt ct aa cgcat cacac aaagat t t gg t t t t ccggat agact acaca at at gccagc aat t t aaagt agagagagag t cgt acggat aatggcggcg cgat t t t t cc gt t gt at gt t t gt gt gt t cg gt t t cgt gct t t t t ctatgt t t t t cgt gct at at t t cgat aaaat gaccc ct at aaaact acaat gt aca gccact gt t t cat cct t t at at aagat t aa t gaaat agt a aacaaat at t t aat cat at t t agat aaacg taaaaaaaca cgacct t ggg t at t aat gaa gt agcagt aa aaagat at ag t t t ct cggt c gt t t t ggt gt act cgat t gg t agt t gat ct t ctt cgaaga tgt t t t ctta t t gct t t ggg t ct gagt cgt agt t t t gt gc actgt t t t t g t ct t ct at t c aaacgt aat t at t cgt t agt t at ct t caca acgt aacgt a taacggccgg aat t aat gca t aaat t aat g t aat cggat a acaaaaat gt t t at gcaat t t cagcat t cc gagggt aat t aat ct cat t t agagaacaca aat ct t agt t ttgt t ct t ca gt ggat ccgc t t gcgat t t c t at t t aggt t ct t cgt t t t c ct gt t gaat t ttgt t t t t ga ttgt t t t caa ggt ggt t gt g 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 276 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 276 aaagcttctg gcattttttg acttttgtcg tcactaaacg aagagacttg accgtggtaa gacgatttca acaactttat ctgataaggg gactatgata ttttgtgtca aaaaaaaacg ggggctatga tattttttac atagctacaa tttctttttc tttttttggg gtgaatctag at atcaggct tgggttcatt attacttcag gtttaggatt aatat t ggat agcatatat a Page 301 120 180 240 12689250 Sequence Listing.txt tcctctaatt tgtaattaac tcaacatttg ataacacgaa gatatattct at t at ct at c taacaccaca at aacacaca cggcat act g aaat t t at aa tgt t gt t t t c cct t t aaaga at t gt ct t ct agct ct agt t gggaccccaa t aat t at cgt t t cagct ct c at agat t cat aaaagccgt a ct aact t at t t t t at at cct tacagacacc acaaaaattt tt ct at gcga agt aat aaga aaaat cat t t ttgt t t t t ag aagat at t gt ct t gtcgggg aacct agcaa gt aaacaaca ccct t ccact t cacat agga at caacct t c at caaaaat g t t t ggt t at g cacaaaaaac gagaggat t a ct acgagagt gt aacagagc aat at at aaa aat gat t gcc gt aaat aat a gt t accct ca cat at gct cc t ct t t t t gt a gat cat ct at cgcaagct ca ttat t t t gaa t t cgacggt a t t t gt t gtaa aaaacaaaga t aagat cgaa t ccaaat acg ct agaat t at act t t at gat at ct t t t t t t t cagcgat t a aaaaat at ct at ggcccat t t cgggaat cc t ct caacact t at cat cacc gagccaat aa t cat at gt gc t gcat t t gag aaagt cct aa at ct ccct ac t act acaat g agaat gcgaa act t t ct gag at at caaact at caaaaat t acaacat cga t t t t t at aaa t aat aat t ca aat cggat aa t agt agt at t t t agt t at t t cacat gt t ga aacct cagat cat gt at at c gaaat aaaca t t t t t agat c t t at t t t t ct t at ct ct t ga t t at t agt ca aagtgggggc t t t act gt at at t at t gt gc ct cact aaac at gg ct t t gaaaca t ccacgt gat t t at t t t agt t aact ct aaa t ccact t t t a t t t atggagg at gaat t ct g aat agt at gt aagt aacaag gt ggat cagt ct ccccacca gaat gt at t t t t aaat act t gact aaaat a aat t act t t t t cat gt t t ca ttaat t t t cc ct acagt t cc agcagacttt act t ggaaag at t t ct t t t t t t t gt t atca gaagcacagc ccgaacgaat act acat t at cgagaaatt g at aacacacg caaaat cct c acacgt t t t a gt ggt ccat g cat t gcact t t gt ct aaat t t ct ct t at ag ggggcaact g ttaaacggag t act gt t gat agcaact t gt t cat cat cga t acccact ca aaat t t t caa t t ct gcat at at at caccca gt t caact ac cgcat t t t t g gaaact at at acct cgt gt c act at agat g tgaatggaag t t ggagcat c gcacaat aat acaacaacga ttttcttggg t gt t t t aaca at ct cgaaag ct gt t t cact aaaagcct at gt gat t t gaa ccagt gat ag t ggat acaaa aat at at at g at aaacaagt ct t at t ggat cgcaat at gt at agat gaag t gt t gt acct t gacact aaa agt aacgt gt at aat gt caa cccat aat t t t gat t t at t t gt t aagt t aa t ccaat act g t aat t gcaaa act t t t gaag tgacacgagt aact cat at t ttttaagaga cat t agagt t t gat t t t caa gcaat gct ga ttggagccaa ccaaggaaaa gacaaat gt a at at at act t t t gggggat c 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 277 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 277 cgcaaatgca aatcagctaa atcaaataca agaatccaga acttgaatag aaatcttatc Page 302 12689250 Sequence Listing.txt ctacatatat aatatcggat gttgaagcag gtaaatcata aacaacacaa aattcgtgaa gact ggt t aa aat at caagt t aggaact ct acccacaagc t t cat agt gt at gcacgaaa cggt gat t ct t cagact aac gcacgggagt gcact gt gat gt acgagaaa gaact cgat c ct cacct ccc agcagaaat g gacct gacct t gat acct at at aagacact cgcaagt cga t t cacaaat g aat gagt at g gcact gt gt g t aagct ct gg t t t gaat ct g gaat gt at ac t ct t t gct gt gcat t gact t agt t t t gcaa t t gaaaat gg t cgt cacagc ttttccccaa ct cct t t aaa t ct ct t ggaa aggt t t at aa ggct cgcccg t ct agt t t t t act at at t ca gaaat t caat aaagct at ga acagaaccaa t gat gat gat gaagcat acg gccagt gagg cgagt ccgca gt gaat ccga gat t t t ggac ggaaaat t aa gct ct gt ct a aat cacaagg at at gggaga gcgt gagact ggaaacggat t t aaacgct a agcacaacaa t t t ggat ggc gaat gaat ag aggcaat aat t gcaat t cgg t t t ccat gt g aat gt t t aaa agaat gaat g tgacgaagaa gaaat at cca caagct cgt a accaagaagc aat t act cac t ct ct t ggca gaggat ccac ct t ccaaagg t ct cgat gt g t cat agt cgt at t act caac gat gat gat t gcggacat cg at gacct ggc tagaaacaag t ct caact cc t ggact ggcg ggct aat t ac t at gt gat at at at t t cagg gat t gaaagg t at t gct gt g acat gct cag gct gt cct gt ct t gcaat gt t at gaacaag aaagat gt ga at accaat ca caat cgt t ct gcgt t cggaa t t agt agaat gt aaact act at t agaaaca aact aacacc act gt t t cat at gg aggaaagat a t gaaggcct a t gaat acct g aacct gaaat t cact caaac gaagcaacaa gat t t ct t ct cacat aat t t gat cgaacac tcgcgacgga agaaacgcga at gact gaaa agagt cgct a t aat t t accc t gt gacct gc t ggagaat ca aagct gt t gg ccat t gcagg at ggt t gt t t t t aat ggacc t t ccat t gat t t gat t ggt a t t ggt act ga t t at gt t t at agat at ggt t at ct t t cat c ct cacgt at a agt t t accct acaagcaacg caat t acct a ct t ct t gt cc t t at aaacaa t gt gt t gaaa t aacct ccat ggt acct aaa t cagccaat c gaacgctttt cct t ct t ct t aagct ggaga t ct ct cgt ac aaggt t gct g cgaat aggag aaaaacaacc caagt cgct t t caagt t t t a t t t gcct at a gagaaagaaa gccat t t t gg aat gcaaaca t gt t gt agga ggt gt at gt c gct gt agcag gat aagt t aa t gt aaat t ca t gct gact aa t ccat t t caa t at act acgt t aaaaact t t cat at t t t ag t gt cact t ct at gccacgt g ccaaagt ct c ct cgat gct c aact ggct ct ct ccat t caa caaaacaagc t aacaaccat aat t t cact t ct agct aaat t t gt acct cg gt caaggaaa agccgagt cg agaagaaaac ggagat t t cg acggcgaggg t t at t aaggt t ggct at at g t t gaagct ga t gt agcgggt gaggaaagat aat gcct t t c at ct t gt ct t t ct ct cacat aat gt t gt ga at gct t t aga gagccact cc at cat gat at ct acgt t gca agt cgccaaa ct gaaaaat a cat gt cgt cg t t t act caca ct ct t cct t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 Page 303 <210> <211> <212> <213> 12689250 Sequence Listing.txt 278 2006 DNA Arabidopsis thal i ana <400> 278 t gcgggt aga caagagct cg t ct ct t gt t g gt t caggt ct ccct t aact c t ct caaact t tttttttttt at ggt t gcag gcct at ct ag cat gaggact ttttgttgga ttttgtgtca cat at act t g gat gcat gt t caaagt at ca cgccct cgga t at caagagt ggcggat t ac gcact agt ct tgtggcagga ggat caccac ct agt t agt t agt t acagca t ggt gt at aa t caaggt gat ttttttttac t caact cagt gaaat cgaca agcct t acct aat at agat g t agat ccaac t t ct gaaaga agat cgt ct t at t gccagag cctctct t t t t at t ggact c t aacat tt ag t t t agt t t ct gact t gaaat agaagaaaat t t gt gact ga t t t gggtgaa acat cccat g ctgt t t t ct t t ct t t t ccct aaaact agct ccat t caaac agct t t t t at t cacagcct a cct t t acct t t caggt gcat t ct cgt t at t agt t acct t t gct t t gggt t t t t at t gt t a ggat gat aag at ggat t t t a ct t cagagac aggcat caac t gt t t t aacc t caat caaaa at ct gggaac agacgat aac gggt aat gag cagt aat gat t ctt ccaccc gat t at at at ggt t t ggct g gt at t t at t t gtct t gt t t t cagt t t gt ga agt t t ggt t g agt gggt at g t gaagaacct at cacaaat c caagccaat t ggat t agt cg t gct cct cca gaatcgggaa gacagaggac gt t gt agt gt ggt t agggcg cct t t gat ct t aagat ggt c aagagagat a gaaaacgagt t ggct cacaa acgt agcct t t gagt gt aag tggagagaga t cct cct cca agcaagcaac cacaaaacaa aat t ggccgg cat at ct cat cct gaaggac ct t gcct t ac t t aggact ct t t aacact ca gt gct ct aac t t cgct cat c cat gt caaga t t t t t at gt c t aagat gt t c t t cat gaacc at t cact ct c t aaact aagg at aat t agga aaacccgt aa gaacaat gga gcacct gcca gt t gcagaga gggactggct cacccct gct ggggaaacat aacgcgaagc ccgcat t t ga t gcgggat t c t ct cct ccat t aaat t t gt g gct t ct act g gaaagagt t a aat t ggaaaa t ct ccaat t g aacct gaccg t cgct acat c t t cgcat ct t cat ct t t t ct gcct t t t aac t ccct t acat at at t cct t t t ct ct gcact cgcgat gaat t ct ggt t ct c t acct caact acaacct at c cct aat gt ac aat gt gat ct t t gt t t ccat at ct ggt t t t ggat t cat t t t t cat cct ca gt cacat gca t gt gt t t gt g caggcaccaa t agt agct gg cat at at at t t at t cgat at ct acat ggat at agct t t ga tttgtttggc gaagatggga gagt at at ct at gaat caag ct gagt ggat tgt t ggaaag t aagat t gga ct act at ct t t t t acct aat gt at t cact g ggt t ct gt ag t gaaact t t c t caaaat caa t t gt t gggaa t gt t t t t t t t t t t ggt cct t t t ct aaaaac at ct acat at at ct aaaat a t ct aaaaaaa gcat t gaaga gt cagt gt ca ggt t aagcac t at t t cat ct cacagaaaag ct agcat cag t t agt t agt t gacaat t gga t gcat t gcgt tttttttttt ct t t aagaga agacggtgct cccgagt cca cccaact t aa aagact agt a cgtaggaagg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 304 12689250 Sequence Listing.txt gccaatttac aaccctaaac ttgtgaaaga agaatgaatt ttaatcttgg cgttattggg cctccgctcc tctggactct gtcgggcttg atccattttc gttgctgagt ggatcgtagg aaccctaaac ttgtgaaaga agaatg 1920 1980 2006 <210> <211> <212> <213> 279 2012 DNA Arabidopsis thal i ana <400> 279 caaaaggagc agaggactga gaggt gaat c cgat ct t t t g agcgatagag at t t ct t aaa t gt t t t at ag gcagaaaaat caat gt ct t t cgt cggaaca acgt gacaaa agccat ct t c t ct cat t caa gcaaacgcac t t t t t aggt a cat gacgat g t gaat act aa t t t agaat ag act cagcat a t at aat gt aa t cgaact t t a acgt aat t t t t cat aaact a agt t act t t t tggaccacac at ct t t t gat tat t t t t aga t t gagagagt gt ct t ct ggt tttttgcgag at gt gacagg aat aat agag caggt at at a t t t t t agggt act at t acaa cct cagccaa t cgccgt aca t t cct aat cc gaggt t gact gaagt cct cc t t aagat t cg t ggact t t ca t t t ct gt gat t ct t t t gaag attttttttt t t t t aacgat gaat aat t t a gt t accgt ga tgct t t t t gc t agt t at aca ct gat gaact acact gaaca cagaaacgac tat t t t t cct caaagaagaa ccct gt t cag t cct gat t ca aagtgagaag t gaaaggt t a t at act cat g t ggat t at t a aacct ggt t t aat ggaacga t cacgt gct c t ct cat ccga cat t cat ct c t t agagct gc aaacgcaat c t gaat t gct a t gt gat t t ca agact ct t ac tttttttttt gt t gcat t t g t at aact at c at gaaaat aa agat at gct t ct t t t gact a aggataagga ccct t t t aag aaaaaagaag gct aat t t t a agt ccatt ag ct caat aagg gtgt t t t gt t aat caagaag t ct caaagt c at t t t aat ca ggt ct acaag gt t t t t at t c ggcgct t ct a t ccgt cgct a t ggact at ac agcct t ccct caagaaat at t t t gat t aaa gagat accac ctt gaaggaa t aaagt aact t ggat t t cgc t gt cccat at ct act agcaa cagact t gaa gccact aat a aacaaacaaa at t cggaact at t t t ct cct t ct aacaat a t ct agt gt ag agacctt gt t aacat agcac t t gcgaacat gaacat cct g act cgt gcat gat agt t t ca gt t t gt agaa gat t t gat aa gagat gt t ca caccaccat a gaat act t ct gaaggcgat c ggt aat act a ct cacgt gca aaggagatcg gaat t t ggag t t t agaat t g t gaggt t t t a acgt t at t gt agct aacgca ct t t at aat a aagt cat aaa at cggt t tag gaat t t t gct tctct t t t t c t caaacaat t acaaacccaa cat caccgaa aagt gt t gaa t agacact aa at gat at at t t t ggcgct gg aaat t t t gca at at caaaat tagt t t t gt t gaat agact a gact ct cat c cgaacgagga ct gct caaca agaacaaaac tttttttttt t cggaggt at at ct t ccat g at t caat ct t ccat t t t gt t t agt gaaaaa aat t t t gaac ct cgt agt at t t t t at at t t caaaagaaaa acgt t ct ct c aacgt aat t t tttttataga at at acgat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 attataaaaa cacgaaatac caagtggacg actgaggtta atagatctag ccgtagaata Page 305 aagat ct gca t cggt acgct cact ggacaa gaagaact ag aat ct cat t t t t t ct t caat tgaaaggcgg ct cgaacgt a t t at t gaacc t t t t ggagat agct t at aaa t t gaat t t t c 12689250 Sequence t gagaat ct a aacggt gat a caagaatcga cgacacacaa gacgtacgag aatcaatgcg aagagcggag aaagattgcg tttgggagct tcctctatca tcgagaaaaa tg Li st i ng. t xt agaccataac acacggaaca acact ccaca at t at t t gaa ct gagggt aa agacgtaaat acacat gt at ggt caat at t t t aat t t t ca t t cat aaat t 1740 1800 1860 1920 1980 2012 <210> <211> <212> <213> 280 2004 DNA Arabi dopsi s t hal i ana <400> 280 aagaacat ct t t at ct ct ag aact t t t gaa agacgct cgt at ct caccct at agat t t aa taaccaacca at cat aacag aaaaagggga t acagt at ca ggaagt cct a caat t cagt g aat acacaac t t gccaat cc t ct at t aagc agagcat t t g at cact ct ct ct ccat ccac cct t t gcaat t t gt accaaa aagaaaagaa gagcgaaaag t aaaat aaaa t t cggccat g tgaaagcaga ggccct ggca ct t t agcacc gcaacct gga t cccagct aa caaaaagcga at gt aacat g aaaccaactt aacat ct t at gcaagact ca agt gt agct c at t ccccgt a agat cat t ca t gt gagct aa t t ct t ct gt t gt cggt t gag t gaggt t cca t cagat t t cc aacaat act g at caaaat cc aaaagagact ggt ct gaat t acagaaact g aaaat t gcat aagct ggagt t cgggagt ga agct t ct gct aacgccgaga ccaaagat aa t t t ct aagaa taagacacaa ct ccaat aat aagagaaagt cagt ccggct t agt at agt c act cat gct c aagt t t gt gg cagcacct t g gct caagacc t gat gat acc agt cagaacc aaaat caat a t at accaaca agact t gcgc agagat t t cg ttagcaaaac at cccaat ca cggaaagaag t ggat ct cag aaacct at ac t t gaggacct tgggaaaaaa accaggagga aagt t aacaa t gt agt gat c ccaaaagt gt at caaat gt g at ggcaagat ct gagcacca t gt gct caag at cagccagc gaaaagccgt cct gcat t t c t acaagaggt t acaggct ca at ccaaaaac aat ccct t cc aat caaccac t t cagaaaat gt agt caaca gaat ccat gg aagaaagat g cagct gagt c at t t gagt gc cat caccgt t t ggat t t at a at aagaacat cact aaaaca at at aaagca t t t accat aa t acct t t gag gaaacacaag t gacat at ga aaccaaat ag t gaat aaat c cccaat t cga agt at ct t t g gact ct ct t g t caggat aca t t gt t t t gac t cct t gt aca t ccaagaaat gcggt agaat gaagcat cca ccgagcacag t gaat t t gca t t ggt t agct ccat t cagt a acact ct t gg t agaaagggt aagagat act ct t gcagat g aaaacaaggt t t at act gag ccat aacagt aagcaagaaa t ct t agcaat aat ct aaact t ct ct gaaag t t ct at cct g cagt agat cc caaaaacaac t gt cat cgt t at t cgaaaaa agcaaat gt c caaacccagg cgat t cacga aacct t aaaa caat t gt gaa aat t accaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 306 gaagaacat t ggtcaagaga aaaat t aat t ct at t t t cat aaaacggaat cat t at t gt a at at t t agat taagaaagaa t ct ct ct ct c ct ct t t ggag cgtagaggag acaat gagag aaat aaat t t at t t gtcttt t gt t t t agaa aat ct at t t a t t at ccaaat agcat ct t t g t ct ct ct cga gat cagcaac 12689250 Sequence atgggaagat gagcgaacct aatttttttt tattttttt a gctttgcact cacagaacag ttggtttaat cgactggtta aatgctaatt gaattattag ttacacacac aaaatt act t ttcaatagcc aaataagaaa gacttgtctg attttctcta gtctctcgac ggtcgcatat tat g Li st i ng. t xt ggacgagatt t t gt aaaaat aacagaagat t t t at t t at t att gctt cag ttaagt t t t a aaaaaaagag t agacgcaat caaaagt cca t t cat t gaga aat aat t gac gacat t t t ct accct aaaaa acagcaaaaa ct t ggt aat a aaaaagcat a caccgccgat t t t ct t t ct t 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 281 2004 DNA Arabi dopsi s t hal i ana <400> 281 act caat ccc aagt agt caa cacaaacacg t acct gt aat ct t t aaat ct gccaaact gg ct gt t aact c gt t t ct t act act t gat at t tcagggaaga t t ctgggaag act t cact t g t ct t t t caac t agaaaagt t gat agat at g t ct t gcgt t a agagaccagt gt ct at t aag ggt t agat t c t gt at at at t aagat t act t t gct t agaat agagaaaaat ct t t cgaat g at gcat agaa tt ct gctct t gt aaat ggct gt t t ct at ca t gt agagct g aactggaagg aacacct aca t aagact aat at gt gt t at a at t ggt t t t g t aaaacat t a cgt aaaagt a at t at t aat a t cggt ccaac t gagat at gt t ggt gaaaaa aaaggt act t ggt t ct t at c gagct gct t a ggt t ct t cgg agat t at gat at cgt at cgt aaat ccat at aact cat caa t ct gaat at t agaagtggt c cat t ct t agc tt att gcaaa cct t agt t t t t at gt t aaat aat t cacacc act t t at cct at t t t ct at c t act t gat aa t t t t gt t ct c t t t gt at at a t gt t at aat g ttactct t t g gccgt caat t t t t t ccgt ct acagt ggact at gaat caat accaaacct g gcagt cgt ga ct t t t t cagg gt ggat cat t t act cat aaa t t ct t aat ct t cacat gt at acat t t gt gg at acct gt gc ct ccat t t at at gt t gt t ga gacct at aga at t ccat t t t act t t t aaat gat t gaaaaa t gaat gacat t ggaatt gag aaaggt t agt ggctgagaaa t aagt t act c aaat gt aact agat ct t ct t aagct gagaa gcaacat cat cagct gcat t ccaatt agca gt caaccat t aagacgagag accgaccaac t t t gt t aat g at agt act ag ct aaaat t t t aaat t ccgac t at t caaccg aat t at t ct c t cccagggt a t acaact cat at t gcct t aa ct t t at aagt aaacct t t gg acct gccgag ccgt t gcaga t ggagt agt t cgaacgct gt t t agt at t t t cagct t t act gt t aagatt t aagggagat a t gct aact t c t at aat cact t aat t caaat caggtccggt t t t at gt act agcaaaagt a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 tacttttttg tttgtttgcc acacattgat tgatttatta cttttaagta tatcaattta Page 307 12689250 Sequence Listing.txt acagggaaat ataactgaat ctagtaaata taataact tt cttgatttgg gtctcaagt c tgagaaccaa gt aaat cagt ct at aat t aa cagatttttt ttttggaaaa caat t acgt t aaaaaaaaag aat t gagt t c gaagggtaag t cgt cat t t a at t t gt t aac agagaccaac aaacccaat g at ct at aaat gaaaaat cag ccat t aat ca t at gaaaggt aat at t t aaa aaaaaaaaaa at gt accaac caat caat at aagcat t ct g aaaaagattt gacact gcaa at at t t t gat t ct agat gt t gt caaacat t t at cgt t aat ggcaat t t ga t aaat t ccgt aaagagt t at t aat caat ga gaagt cgt gg at ct t at aac at at t t ct t c at gg aaaat act at t t cct at aat ggaaagcat a at ct aaaat g t at gt t t t t t gt aaat at t g at at t t gaga aagacagaaa at gct t accc at t t at caac t ct t ct t t gt t t t t t acgt g t t t ct t gct t at t ggcagcg aaagt caat t t t t gt agt ga at act cgaca t t t ct t gat t t aagaaaaat caccat t cgt cat at at at a t cat aact aa acaact t gca tttgaacaag aat cct at t a gt agt ggt cg aat t t at t t c caacacgcaa t t aaat aagg at aaaaat ga cacaagagat cacacaaaat gaact cagaa 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 282 <211> 2008 <212> DNA <213> Arabidopsis thaliana <400> 282 acataagcac tcgcatttca cagtagccaa acacttcatg gccactcttt ttttccggga ccattttttg atttataacg tgcatactgc tcaaatatgt aatcatacta gggaaaatct cttttaaata gtatattcgc cgaaaaatac acattagaat atacaaaatc aagtaacgag tacagtgcgt agggtccatt cgatgtttct cagatacaga tacacattcg aaatcatcaa cctaatgtgt gtgtgctttt ttttagtcaa gttatgacat gagtatgttg ctacgtgtcg atgttgagct agtttcttct tatgaaacaa tatctgcctg aaaaaaatcc cgtatattac attttgcttt ctctggtgac atatccaatt gt t t ataata gat ct agct a tgt t aaagat gcgagtttat atcttttgga attaaaaata tttggccaca ggagttctga aaggtcaggt ttgtttttat taatgggttt tcaaataaga t cgat t gat t act at ct t t t tgtgtct t t t cat caagat c gt t gat gt ct t t t gt aggt t agat caaagt t gaaaaat aa ctat t t t gt g t gagat at cg t cat at at gt t cgacgaaat t ggct aaat t act aaagcat agagaat t t a at gat t t t t t aaaact gt t t ct aat ct t t g ttttttgaca t t ct aaat aa at acgt at t t cggaat t t ga gaaact aaaa ct acacact a caat t t aagt t at at t gt aa aacccaacgc ct at aat gaa at aaat accc t gt t aact ag cagt t acat a aaaat aagaa t ct t gct cgc ttcgaagccc aacat gt aat tggaaaaaga at agt t t t t a t t gt acat ga cgcct caaaa t gccacgt aa t agagact t c at ct t ct t at gat gt at cat agat at gagt t agat cacat aat gt agct g t ct at t at ag aat t t t t ggc gat cat t t t g t ct t at gat t ggt t cagat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 308 12689250 Sequence Listing.txt cattgttttt tgtaaaatat aggcccaatt caccataagt ccatgaccaa agat agaacc t ct cgagat c t acagt t cca t ccacct ct t at t t t t t t gc at at t t t cat at t aat t gt t at aat ct t t t gat t t agaaa aaat ccaaat t agagt ccac aaaaaaggcg tttaaaaagc agt cagcaca acaggt agt g t t gt gagct t aat act gaac at gt ggt cac t cgaaat at c caacct acac acgt t t t t ca caggt aat ga cacat aat t t ccat caact t t ccat t at gg t t gct act at acgaat at t t aat gaagat a t t t aaaat ct at ccat t at t tcccaaccaa t agcgat cgt caggat ct t c gt caat t gt a t cgaaaat t t agct t t ct ag act aagt t t c t acat t gt gc aaaact gt gt t t aaaat aat t t t ct ggt ca gat aat agat gcaagt t act t gt t acgt aa ggccacat gt gt at cat aaa acaaat cgag aacaat gg t ct cgct t t c t aaat acaat ccagt t aat t aaat t t ggct aat at gaat c cgaaat aaaa t aat ccat cc t aaacgcagt act gaaat cc t t cagacgat at at aaaaca aaagaaaat a ttttaaagag t at t ct cat c aaagacgaac gt gat caat g t at t gacgt a ct ggt aacgt cgct t t t ct a at t t ct t ct a cgt caat act agt t at t t t c gct aagaaat at aat t t cct t t t t t t t ct t ct at aat ggt ct gt aat t at t ggt gt gacg t at aaat t ac act t acaaaa aacaaaaat a tcgccaagct acacaat ct c gaacgt at ct agt cct ct gt t aaat aaat g cat t agt caa ttacaacaaa ct aaaat ct t t t aacat cca t t t t caat ca caacagat aa aaat t at t ac t aacgact ag ct aaaccct t aaaaat ct ct 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2008 <210> 283 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 283 tcaggtaact tgtgggtctt gct t gagact gactttttca ctaaataaca aaactgtttt tcgatccccg tacctctcgc atgctaagcg gatgactatt tat t aaaata ttacaatat t tacgagcggt gtgtgttgaa aagggactaa gatat t aata gagaaatcag aaactaaagt gtgtgtgatt gactaaattt tagaatagtt aggatctagg gtaaaatttt cagtttgtat tcaattggag ggattttttt ttctttcttt aaacaaaagt aatttgtttt tcaatccact actctacatg tcatgaattt ttgtcatttc aaaaaacata ttagaagaag taaatacct a tgattggtgt attcagctta taaaataat t cacggagttt aaaat agt aa agcgct ct ac t aat act t t c t gat t gagag cggat act at at gt t ggaaa t aagt aaaat cagct gat t a at t gt t acgt at gt at agct at agat aat c aat at gcat c t cacct at ct ct aagt ggag cat t t gagct aacgt gt aac aat t ggagat ct aat cagt a gagaat aaag aagt t t t t ac gt gt agt at a caat agacat t ccat t t at a t t t act t t gt at at t t gt t t t t caat agt t at gcggggt a acat ccccac t gat gcaaaa aat gact aac aagt agagat aaaagt t agg t ct t act at c cat caagct a t aat agt t ac t at t aaaaaa t t at t at t t a ttacaaaaag 120 180 240 300 360 420 480 540 600 660 720 780 840 tacattgaat atcatatgat ttttagtttt tttttttttt ttcatttacc ctcgatctta Page 309 12689250 Sequence Listing.txt caacaaataa taagtccagt catacgcacc taccaaaaga accaattgag gcatgtggtt cagt ggt tag act t caaacc t t t t ctcat a t t t at t at at t act aat t ga act t cat gga agat t t gcat t at ct aaat g gaacccaaag aat caact cc t t at ccacac caaagt aaca aaat t caaca aaaat t at at t aat at t t t a t t t t gat t gt cgccacgtt c aact t gt gt c taaaaaagag ct aaacggat act aaccat c aaaaaaacaa tcaacgaaac ct gt t t gagt aaact t t gat t gt t cgact t t t t t at t t t c at gggt t agg gacaagat ag aagagt agca act aacaagt gat t gagat t gcaaagagga t gt aat t agt at ggat t t ga cgccacgagt gt ggacact c aagaaagaaa gggct cat t c t t t gt t t cat agcat cat ca ct agt t ct t t t t agaat ct a t ct aaat at t caat gaaat g at cggt t gac gtgggtgaca aaat at ct aa aat aat t t gt gt t cagt t ca t t gat t t cat t aagt aat at t t t ccgaaat t aat aacaca cgcgat t ct c t t aaaact t c aat g t ct cacgt t t t ct ct acgt c at caat t t t a t t ct t aaacg gt t t t gaacc gt at gt agaa acat at act t cagat at t ac t ggat agat a gt t at t at t a gt t ct acaaa cat t t t t aaa acaacat t aa t aat gt t gga agt t at at ct cgt t t ccacc t ct cact ct c act caacaaa t at t t t ggct t t t cct t cat t t t t t at t at tcaaaaacaa caaaaaat ga gt at cct caa ct aat t gagt gacgt t ct t a t t gacct gt g t t t ggt aaca at ggt gacaa aat caacat t aat t ct t aat aaat cagt t t gat aaat t t t aat caacaac t ct cccat ct gct t aaccat ccacct accc ttgat t t t cc gggatccct t t gagt tct t t t act t at at a t at acaaagt gagat agaaa caagagttt t ggct ct ggcc aaat t gt at a t at t cat t t t caaaaaaat a at agaaaat a t caat accag gt cat aat t t ggaaacgcaa tt ct ct at aa aaact gt gag 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 284 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 284 tcaggtcttc tctgtagctc tgttacttct agt t agctaa aatgaatttc tccatataat gttagcttta tctgcatcca aagttttttc ctatgtttat aactttatac agtctggttc tactcattca tcctttggta actctcaagt acttattgtc tattgcatca atcttctaat tgtttcattc ttaaacctct gtgtctcctt tgtctttctc ttctatagat atgtagtctt gtagtcttgt tagagagtta gt t gagatat cggttatgac caatttgttg tagctccttg at cacagt t a cat ggt t t ac cat gat gt t a act ggagt t t t t aggt t gt t gcaccaccct gct aaat ggt gct agat agt t acct ct t aa t aagt agaac Page 31( t cgggt at t t t acaggt t t a t gt cat at gt ct gt gat t at t gaat t gcct agact at t t g cat gct t t aa t agt t ct aca aagt at cct t t t actgggac gagaaaaaag ct t gat t cgc gat accgt t a gt t gagt aca ct gt t gt gat aacaaagagc t gt ct t cacc g tctctct t t gaacgct t t c cagcgagaca 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt gtttatgtga atgttcatgc ttaagtgtcg aacgtatcta tctctactat t ct t gt t aga acct ct t ct c t ct gt agt ct tctcct t t t g gt t at gacca taagaaagaa gt aact ct at gaaact ggac accgt ccat t aaaagcggtt t act cacat t aaat t aacaa t aaaat cat a acaacaacaa tttttttttg ccgaact ct c at caat gcat act aat ggt a gggt t t t acc at ct aaact c act ct at at a t t at t cgt ct t t t ccaaat c cagt t agt t t t t caaagt at t gct agat ag t agcct t gct at t t gt t gt a cct agt at gt agct ct t t gt ct aacaacag t cat t t at cc caat cagat t aaaaggat t a at at t ct t t g ct cccat t aa ccaaaaat at tcaagaaaag t t aat t aaca t aaaacaact cct ct cacaa t cacaaccat t cat cgacca aagaaat ct t t ct t cgt t ct t t ccgat aaa t at at ct cca cct t gaacgc ttagt t cttt acagagt aag gct cct t gt a ggcat aacca t t t gt t caga t cat t ggct t gt t t caaacc at gt t t t aat t t ggat aat g aaat agaaga aaagat t t t a t t at t t t t t t gggagat t at t ct t caaat a tgcacgtgga accaat caaa cgaacatt ct at t t t t gacc t t cct t cgt t ctagtttttt aat g t t t t t t t gt a t caccggt t a agct ct ct t t at gggat at t agt agaact t gat t gcaggc aagaaccagt t gaaat caag agcccat t ac t t t accaaat t aaaaat t ct aaaagccttt at gt aaaat t cct t t t t t ac gt aaacagat aggaaaat t a aagagagact at act gaat a cgaaacattt gt ccgat gga at t gct t acc cct cagt ct c gt ct t gct ag t gaaat ct ct t t gt agcct a acct cct t ga aggat agagt t ct gt ct cgg gat t ggat ga ccacaacaat at t t cgt ccc t ct t t at gaa gaacaat t ac ttcct t t t ga ct gaat at aa agcaacaaga aaaacaggga t gat ccgcat at acgct cca at gccaacgt t aaacagcct aact ct agcc aaat acaaac t gt t ct t aga agct ct gt ag t t gagat at t acact at agc gt t ct t t agc acgct ct ccg gagt caact t ct acagt aac t t cgt cct t a gcct at at ga at t gat aacc gt t t aaat t a t gat t t t gga caacaacat a gat at t t t t t aggaaaaact aaat aact aa at t t aggaag cacaagt t gc gt acaaat t a ggcgccatag tcaacccaaa cct agccgcc t ccct t gt ag 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 285 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 285 tatccatgaa taataaattt atgggtttgg tacacattta ccttaatatt tttcagcatg aaatcttgtt ttatcattca gttttgttcg aaaacaaatc agatcacata tattcatagc acctgtaatg tttgttatac acaacaagac agggtcctaa catctctcaa cagggt t ggt cattctcctc ctcctcctca cgtttatatg ttgtgt t t aa t at t at t gt a at t t aagct c at t t aaggac aat gat t t t t gt t t t t gggc aaat t t t t ca Page 311 gccat gt t gt t t acgat at a ctt ctt att a ct agt gt gt c gactgggccc ccaggtggac cct at at aaa t ct ct acat c t acagt ggt g at ggt aaaat acacgt gt ga agcccacacg cct t t t at ct t t t aat caat 120 180 240 300 360 420 12689250 Sequence Listing.txt tttatttata taaaattctt gacatgtaga ttataagatt aagatgaagt cgttgtcaat t t ggacct t c at gat gat gt t at aat t t gc aaat act aca t accacat at at t at at aac agcact agat gt t t cct aat t ccct aat t a ggggcacat g aaact t t aaa t gaaaat agt at aagaagga cact t t cgat ccat t t t cat t t t at at at t at t t ctgttt aagt act aaa agt t t gat t a aagt t ggcct gacat cat t c aacat cacct acacacgat c t ct ct ct ct t ct agaagccg at aat cat t a aat t t t t t at t t agcaat aa at ggagccat t t accat at t ggt t gaccat ccaat acaca t aaat t at t c t acat aat t g t ct at t gact ggaagt at ag at t t at t t ga ct t aaaaaaa caacgaacga t t ct t t ccca at aaaccat c tagtct t t t t t t gat at t t t aagaagaggt gaat t t at t t t t gcccact g t ct cccgaca ccgt t caaat caat at t tag caaacgtttt aagaat ct cc t at cgagaaa gt cagat t ct at gat gt t ga t t at t t at ca t t agggt cga t t act gt t ga t aagaatt ag ccagcat t at gt aat cat gt t cacaat aat t acgaaaat g at t caat t ag ggcaagcat a at cgaat cca t ct gt gggct accaat t aaa ctt ct t t t t t ttcaaaagac at acct t t t t t gggt ggat a gct t ggccgc acact cct t a aat cat cct c ccct t t gt t t agtt ccccaa t act cgt t gt tat g t ccat cgt aa t gat cat t at acgct at t at cgaaatgggc gt t agt cgt g t t agcat t t g act at at t at ttaacgcaca ccat gaat t t ct at ggcaga t at t t gt cgg ggccaatagg cgggaat caa ccat t at gt c t aaaaat aat ttct t t gct a t cgaat at ct at gcgagt gg aat t cgacac aat t at t ct c cggt ccaacc aat gat caat ct ct acacac aaaaat at ca cgat cggat c t cat gcat gg gct gt t at aa t at t at gct c at gacat gt a t t t ggccct c cacct gcat a aat cat t gat at t act t gaa agagccaaga ccaggacttt cat t t t at aa cct t gt gaat aacacat gt t t t agct gt ca ct ccgacaaa ttat t t t t ca t t t gccgt aa t aaaaat gat agaaaaggag aggaaaat ga aat cccagca t ccat at at a at ct cacat c aaagaact aa acct aact aa gggcatgaac t t t t at t at a ct t t gt gaaa ccgccgccca cccaccgaag gaaaaaaat t cggaaggaat t t aacat gaa caaggccaca t agt t cgt t t t at aagaaat ggcggacccc t gaagct gac t t t t t t aat t acact ct t t a t caaat t gat aaact gat t a t cat at t t ca agact gt gcc cgccat t cgt t ccacct ggc at t t t at acc cccct at ct c aagaaaaacc t t act cgt t g 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 286 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 286 gaattttgtt tatagagaca aaaaaacatg aagtaatgaa cattaaaaaa aatcttagta atataatggg ttctctaata taatctaacc ttcgagtgag aaaaaaatga acataaaaaa aagaaacaag aacaataaaa agagtgggag acaagagggt gataaatggt atagctacgg Page 312 12689250 Sequence Listing.txt gacggtgacg tatgatgtga aagaaaggaa gagcgcgggt gat gatggaa ggcat gt gaa gt gct t ccaa ct at t t t cat aat ct cct ca at t at ggt at agcagccggc ct t t gct aat t t cgt aaat t t t gt aat t t t gt cccat aac agaccact ca aacaaaagaa t aat aat t t t at aaaaagaa aat aat at t t ct at aacaac t ct cact t ac aaacct aat c t ccat gt t t t cat t t t ccat t at cct ct ct caggt act ag t t t caat ct g t t t agt t t at ct gaagaagt agat gagt t c aact t t at ct t gggt t ct t c gt aat t agat gt cct gcat a caacagt gaa cgt gt ct ct t t t gt ct cat ct agt t t aac ct at t at t gt aaaacaaagt t gct ct cagc tt cat t t t t t cat t ggcggt t at t gaaact gcct ct t aaa aat at accat aat aaaat at taacagagac tcatgt t t t t gacat gt at t t cat t ct caa t at cct t aat t gt t t ct ct c ggacggacct ct t t ct agt g t t t caat t t c ggt t t gt t t g t t t t ctcgt a t agagt aaga tgggaacgat t act t gt t aa t t aaaat t ga t gat t t t ggg t t gaagat ct gct gcaaagc t gat cat gac t t t cacat t g t aaat aat t a acat t t cat a t caat t gct g ccat aat t at accactt gt t at gcat at t t ct gat t at aa t at t ggat gg aaaacaat ct aat aaacaca aaaaaaaagt t t aagct t cc t at t at t at t ccgt cagat c accct t act t ttct t ct t ga t gacct ct ct t t t gt t t gat t ct aaaagga t t t ggt t ct t agt aagt gat aat t t t t ggt aaggt t gaat gcaaat ct aa gaat t t t gt t ttgt t ct t ct gat g ct gat t t ggt ct t ct t cttc at t gt gt t ct t acat t gat t act at accac ct t t ct t gt c aaacgagt ag aaat gct ct g ttagacagac t ggaaaacat t cccat t t ct gat t t gat t t cgcat gt t at cat t t caaat ttacaaaaca t ct ccat t ag t aact acagt ttgtagt t t a t t gt t gcgct gct t ccct t c cct ct gcgt a act at t gaaa gt agct t ct t t t agt t gt ga t t t t gaat gg t ct ggct at g cccaat t ct c cat gcat ct a t gt t gt t gt g t gt t t ccct t aaat at acaa t agaat t t t g t t t gt t gtga accaaaagt a t ct ct cgt at aagaat at t t t ct t act t t t aaaat t t aaa at t aaat t t a at ct aaat ca t act t at t gc t t gt ggacct at at aaat ca gaaacct gaa at ct aacgac act ccacacc t t act aaat t t t t gt t agct gt gcat t t t c ct ct gt t ct t cct cct t ct c t ggat t cat t t t ct gaat t t aaagt gat t g aaacct aggt at ct gaaacc t agaaagt ga at gct aagca t t aggt gaca ct ct t t aagc ct ct t cact t t t cct cgt gc gt ct act t aa gagcat at t a cat cact ct c aacact ct t t tttttatcca ccgat acgt g t aggt at t t t ttttaacaac t aaaaccat a acaaaat aaa aat t aaaat t accat gt t ac t cagaat cac acat acgaac t accact cct t t ct ct ggt t t ct ct ct cac ct t ct ct ct t t agt t t caat at cat t t ggg t agggaaat g at t t t gggga t aaagt t t ct ct aat t t t t c agat t t t t at gat aaaacaa 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 287 2004 DNA Arabidopsis thal i ana Page 313 12689250 Sequence Listing.txt <400> 287 t t gat caat c cccgaggccg aaagct at t g at t acat at t t t t ct t cttc at ct aat at a t gt t at at t t ct acggaaga t gat t gat ac at aat aat ag cctgcggaag t agt t cat t t gct gt t t t gg gct gagaaat t t gat gaat t t at gt aat ct agct t gggaa agt gt aat t c t t ccact t t t t aaat aagaa aggaagctga ccgaagaaat t ct t ct gat c agaaagaaga aagt t acat g gt aat t t gt a at t at t t cgt caaaat at t g act at t t aag agt aaaacaa at t t t ct at a gt at t gt aca t t cat t t cag gt aat acaca t cgcat cgga t t ct cgt cgt t aaat t gcaa tt ctt cacaa t t t t ggccat tcggcaaaga caagaaagt a at aaccagga t t aat at cag aaaaaat at t t t ct ct agat t t agact cga ccaaagt gga t ccaaat t t t cat t t at t aa aacagcaaga t t t at t ct t t cacat t at t c gaaagcacac ggagaagaga ggctgcaaag ggagaaatgg gat t at t ggt gccat aat gt cagct t aacc t t caagt t t t t t acat t t aa at t agat t t t aacacaccct ct t gggcct a at t t t cat gt t aat cgct gc agaaaccat g gccat cacca cgct gcaagt aaact act t c t at agt t t at gt aaacaaaa act gt aacct ggt t cagt t c aaacaaaat a t gaagt t t at aat t aagt t t tttttttttc gcaagat aag gaacaagt ga t agt t ccat g ct t t at gaaa aagct t ct gt caat t t t gt t t aat t t at cc tacacagagc gcgatgaccg t accgt gcca t ct t t gat gt agt t gaagt g aacgt ct t at t t gagagact ct t ct t caat gt t t agt aaa t aat at at at t accaaaat t aat act aagg agaaaat gt c cgcact at cg act t t agagg ccat ccaagg aacaaat ct t aact at t t ca aacactt gt t t gt t t t t t t t t t t caat t t t accgaggt aa t t t t aact t t ccaat at ct a at ct aat t t c t gat aat t ca aggat at ct c gt cct aaaga at at t t t t t t cagagct cag ggaagctgag gt t t t at t t t gct ggaaat a aaat gaagaa aagct aaacg ccggaact gc t gcat t gt gg gt t ggt gt gg t at gt t t gga t ct t t t gaac ct cat ggt cc aacaat t aag t t aaat aat a at gt caagt c cccgt aat t a t t at at aaag t ct cct acct agcagaagaa agtcataat g aggagaagtc t t aaat t t ca tat t gt t cgt ggt at aatt a t ct t t t ggct tgt t gt t gt t t caacgt t at t t at t t gat t act t t t agcc agt aat t t ac tggtgtgtgt t aat caaagc agt t t at at a t t gt gt t ccc aagaagattt ctaaaaaaga t aact t t t ac t gaat gat aa caagatagct cggagaagat tccaaccaag t t t cat t t gt gt t t cat t t g gtaatgtgt t t t t ct aaact aagt aaaaat aaaaataacc ttat t gt t cc aaagcatcct aaggcccaat aggtct t gca cct t t gcct a cgacgatt cg ctgaacaaga t t t ctctaat t t gt t t at at aaat t aat ac gcagaagaac agt t agat ca t t t t t acct g aaaaat t t ca at gat aat at gtatgaagat ttgggaagag at ccatt gt t t at at at t t g ct t cagt t gg t cgaggt aaa at gaaat gct caggagcaac caaatccaca gt t ct caaag ct at t t ggat t at t aact t g gcaacataaa ttact cat gt t aaact ccca acaccaatac aaaat t t t gc aaaaacaaaa cgt t t t aggg aacgt t t gt g aat t agggt t caagct t t ca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 cgcgggagct caacatcagc catg Page 314 12689250 Sequence Listing.txt <210> <211> <212> <213> 288 2007 DNA Arabidopsis thal i ana <400> 288 ggat gat t ga t t gt ggggct ct gaagt t gg gtct t t t t ca t cgaggt acc cgt t t gggt t tgaaaagaga ttgt t gcttt aaagat caaa gcgt at at ac cagact aat g aaagact gt t act ggct t t c t t ct t t t t t t t at at gt gt t ct caaaat ca agaaact agt ttaagaacgg aggt gt t gaa t t t ct t t cat at aggt agaa agcccat at g at at t t gt at acgt t agct t agt t t t ccga at ccgccgct gctt cagccg t gcgt ggt gt gt ccgt t ct a t aggt t gagt agtgagagaa t gt gagt aat tagacacgaa t ct ct gt cat t acct t at t t t act at gt t t t t t aagat aa cact t t at ca gt ct t gat t g t aat aat cac gagagaagca t ct ct gat t t gt ct ct gagc tgt t cct t t c ct aact t ct a t gacct t at g caat t gt gaa gt aat gaagg cgat ct act c t cat caagag gct agccgac t t at ct aaag tt cat act aa ct t aagt gt g t t ct agggt t t ccgt ct aca ct t ct t cttc t t ct ggagt t cat t t t t t aa t agagaat cc ggat ct t ggt ccaccat aca ccgaaactt g ggggct t ct c t cact ct gt t ttctct t t gt t cgt t ct t gt gt aagaat t t gt caat t t t c at caaccat g aagcaagat g gaagat agt t ttttttacac ct t gct ccct at t gaact aa at aacggaat agaaaat aca act t gt gagt t acacaggt g ggctagacgc aaaaaggt ag t at gggct t t ct ct caaaag gcccat t tag tttctttggc ggt t cgt t cc t ct t t t agga t gaat t t ct t t t aacacgt t acaaat cgag t t gagcct t t t accaagt ga cat t ggat gg ggat gct t ca t ct ccaaaat ct ct gat act gt at gagt cc ggcgt t ct aa at t at at gt t aaagat t aat at cgt agat t tctgat t t t g aact cgct gt at act t gcag at act gt at g cgt cat ggaa cat t t t gaaa t gt gat gt gt t gggct t aca gcct t t cgaa aagct aat gt t t ct ct t aaa t at aaaat at agt at at aaa aaact cgat t at t gt t ccgt t t ct cat cct agat ct ct cc gct cgat t at aaagat ct gt gaaagat ct t t gacat t cca t ggt at t gat acct ggt ggc at t t agt aga aaaaccggaa gaagaaacaa gaagat t at g tat t t ggttg cct at t t gt t acat gat gac ct ggaat caa agt gt act t c t act aat aat cct t t t gt ga gaat t t ggaa cgat acat t a caaaaat t gg gggctagggc ccagt ct t t a gaat t t caat aggaccat ac aaat at gggc caccaat aaa t cgcct agct cgct ct gt t a aaacat gt t c ttgtggt t t t cgat cgt act cgaact act c gaaggaaaac ggcct acaag ggaact gat a t t t t t cgtt t aagat acaat at gt t agt at t t gt t gct t a at t gccact c gaagatgt t t t at gat gat g cagt gat ct c t cggt t cgt g ccaat act t t t t aat t gaac agt t at t at t gt agt t agt t gagat t t t ga t t t gtgccgc tcagaaaccc t acct gt gaa ggact t aaaa tcaaaagccc t t aaaaat t g t ccat cgct c t ctt ct gagc at at cgt cgt gt agaat cat gt at at agt t agcat aagat t gt gt t gt ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 Page 315 gtt at agagc t agt gat t t g aagat t t agc t ct t gacat t ct t t t t t gt g at ct gt ggaa ct t ct t aat a ct acat t t gt 12689250 Sequence Listing.txt taatttcaga ttgttttgtg ttgaacataa gcattggttt cagtaacaat catgtattct acaacttttt tcttagataa caatcacata tttcaatttg agctctggtt ggtatgaaga gaaaat g 1860 1920 1980 2007 <210> <211> <212> (7 <213> 289 2004 DNA Arabidopsis thal i ana <400> 289 acgt ggact t t t agct agcc gaaat gacaa cagagccaca aaaagaccgt gt t gggaaac acaacagaac at at at agag acgagggcag gagat t t t ac tttttttttt t gt at at t aa aat ccaaaga agat gt cgt t ggcgt gt gga t ccat agt aa gt t gcaagga t cagct gct g gcgt t t ct gt t ggt acgt at gt at at gt t a t gt t cggct c ct t acct ct c cagt t ccgaa at t t t t gaat aaaaagaaaa ccct t gct t a att ggagat c t acat ct ct t t at ct cat ct ct t t cat caa ccaagat cac at cct gggt c t ggct t ct ct aggaagtggg at ggt ct aca tttttttttt aat gcaggac gat t ccagat at act t ggaa t aact t ct cc t at t gacaag aaaagt gaaa aggt gcat ac at cct ggat c gt at t agat t gt cat at aat at gct gt cca aacaaat t ct t aat t ct cac t ccct ct act ccact cat t a t ct gagagt a at gggt t aaa ggcct t t at c gt t at gccct taccaaacac t gcagt at ct t gt t cct caa at t gt t gaaa at at gt cgat ggt acgt t aa ttttttttta caagcgacgg gaagt aaaga ct ct gt gt ag aaagact t ga act at t t t gg acggt t t cgt ct act t agaa act t t agaaa cct t cct t ga agt at t at t c ct cct t t t gg t t t gat t ggt t caact aact ct t gaacacg cagct aacat accgagcat a at t at gaagg agcaaaggcg ccagt agcga ggatggt t t t at ct cgact g t cgcaat t t t t t t ct gt gct at t gct cat t ttcgagacca at cat cct ct cgat agt cgc agt t gt gcct t t caggt t t g aaaaact agt gat t t ggt gc aacat t gct g ccgt ccat ag caaaacaaac t gagt gat t a t ct t t at t t c gccgctcgt t t ct ct ct ct g t t t gat aat c t t t act t act t t atgagggg ct ct cct t ct at at ggat t c ct caacggct aagaagagtt cgct agcacc ct ct agt t gt cagact cagt tacaagcaga t ccct gagct agaacggt t a ct t t gct t gg t gagct at gt t at gct gat c caggat ccat gaaaggt aac ct t t t t t aaa ct t ct t t t gt at t cact gaa at gaggacca aaccggct at at at cat agc gct t t cat t t act ct aggcc act t at t ct a at gaggaaaa t ggact at t g gct gt gt gca t ccat t aaga t agagagtct t gat t cct gc t t t agt t t gt taggggagac t gcgat acag gggt gt t gt t tccagagccg ct ggt t t ct a t act acaat g gat aat t aca caaacgacag cct gt at t t g aagt gt t t ct at acgat t t a t t t gt ct cga gaaaat agct t gct t gaat g t gt accat t g tttaaaaaaa t t t t aaat t g gcagaaagt g gat t at t ct g at t t aaccct cgcaaagcat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 tgatagtgtt aattgaaagt catgcatata gtatgcgtta ctactaaagt ttaacggttc Page 316 aat t t t t t t g tttttagcaa t at gccaagt aaact ct t aa aat ctt gttt t caccgat aa ct cacgt t t c aat t t gaact gtt gt t t aga ct gagat act cct aaccacg at ct gt gaga at cct ct at t aaacaacaag 12689250 Sequence gacagt aaat aaaat t aat t aat t gt ggga cacgt gt ggc ccaacgcact gactgactga gtt aagat ct t aaagccgt t tattcgccgc ttccccttgg catcatccac aacaaacctc aat g Li st i ng. t xt t t t aagat t a acgt t gct cc cccct act t a gagat t t t cc ccggct at aa t t ct t cagt c aaagacgt t g aggaggggca accggt ggt c cacat gt aat at cgat aacc t gat agagat 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 290 2003 DNA Arabidopsis thal i ana <400> 290 at at agat t t gacaagat t a at ctt aact a t aaat aaaaa gaggccggca t caccaat t c t gcaaatt ac cgt aaagt ag t aaat t t t t g gagacaaat t t cat cagaat t t cgat at ca aact t cgaat acacaaaatt act t agt cca ct gat ct aag at t at t t at a ccaat cagaa t at t ct aat t cacccat at a agtagaagag aacagaggaa aagat t caga t ct aat t t cg caaaat aat c aaaact ct at t gccact acc cgt aggaact aggacct t ga cat t ct at at cgacgtt cgt ggt aaaaat t aggt t t gtt c ttctttggga agct caaaag tcaacaacac gct accaaaa aaactttttt agcat gtt ca atgt t t gttt gaaaccacgt gct t t act aa cat ggt ct aa cgt agaccat aat at t t ct c aaaagaaagg gcagat t t t g t t t t aat t t a at t at t gcaa act t t t t gat aggaagt caa aggggcaaga aat gacccca gaat t cct aa aggaaaaaag agaaaat cag caaaat ggaa cat acacaaa at gcat gcat aaat at gat t t gt gt gt caa t aggt gat ac aat aaat at a cagat gat ac t at aaaaat a t at at ggat c gt cct gggt c at t aaat aag t gaaaggat a agt aaat at c act at t t aag ct at at aat c at t at aggcc aaaat aat at agccaact aa t ct acgattt gaagt t agac t t t at at t cc t cgct cat cg aat gtt ctt c ct at aact aa cgact gaat c t t t t t t t t at cat t t t t t aa t t acaaaat a t at aagat t t t gat at gaca at t t t t ggac gt aaaaaact gt cgt ccaaa tttt cct gac at gt t t ct cc caaaaaaat c acaacat gt a t aaat gt ct a aagt t at t ga t t t t gtgat g aacaccagcc at t cct t ct a t aacaggt t t t acccaaact acagact t aa acggct ccac t t ccat agt t t t caacaaat gat t t caat t aaaaat ct t t tttttaagaa t t t gct t t t a aat at gat ac t t gt gat act cat at at aat tgaccacaag gcat aagat a t act at at aa t cact at ct t t aaaacct at t t at ctt aaa t cagt aaaac gaacgt aaat ccaaccct ag t t t ccct t t a gt gt ggt t ct ttcggcacca caccacaaaa ct accaaaat acgt aat ct t gcaat caaac tt cat ccggc aaacggat at at t t t t t agt t caaat gt ga t gat caaaca ct aaaaat at at t aat aagt aagat t t caa acat t at t ac gat gt gt aca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 Page 317 t ct gaaaaaa aaat at t aac aagcat gcct t ct t gcgt gt tagt t t gttt gaat gaagct t aaat t t at t t t t ct t atct ccacgt gt at ccatt ggtt g tcaaagagaa <210> 291 t at gaat at a at t t t t t t t c ct t t t t t t gg acgact t t t g gat act ct gt ggaacaaat c t agat t at at t act t t gt ca ggt ggacat a aaat cacaaa gt aagagaaa 12689250 Sequence tttgtaacgt ttgactgtta aaaataaaag t aat at agt a tcggctgccg tttacaattg ttttttttta catat t cgcc t gt ct t gct a aaact caat a t aacat aaat agaaaatgat t at at aaaaa gcaat ccaat at gt cat at a cgt aact aaa atccgaatct ctctctttct agcat cat aa gaagaagaag tgg Li st i ng. t xt t t acat gat t aggaaat gaa ccaat t gcga aat aat t t ga aaacat t aaa gggcaagt t g t at at at ct c actt gcggaa t ct at aaat a aaact acaat aat acgat at aagaggcat g t agt t act ct cgt t t t ct at ttact t t ct t at gt t at t cg at at at acaa at agaaaat g gt ggccatt c agt t aat caa 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <211> <212> <213> 2004 DNA Arabi dopsi s t hal i ana <400> 291 cagaat cgt t at caagagat aat at cact a tt cctt ccac acccat t t t g at t aaaacat gagatgaagc gct gacaaac t t ct t ggt ct t caaagat t c aaat cgt at t aggt t t t ct t aaagt gat t t gaaaaaggaa t aaacgt at t t ccagct t ga cccaaat cgt caacaaaaat at gcaaaat c cct aacaact at att gtt ga caat gt gt t t cct at ccat t t cccat t t ac ct aact t gt g agaaaact t g aat t ct agt t t gcat at gt a gt ct t caat g aaagt at t aa tttaagaaac t t t t gcaaat ggagaaacca caatggcgag at t cct ct t c t t t ggaaat g aaccct aacc cat aagaat g at agt t t gcg cgtt agat gt acaagat t ca t t gcat aat t t aaaaaat cg gat agagat a t ccaaat gaa aat agcacgt tccaagagag t at gat caag ct caagaaaa t ggaaccgt t cccatt at gg tgagagagca agagcaagaa aat agccat c aacct cagga aaacaaccct accat at caa ccat t gt ct g t t at t t agt c t t gct gt t gc agct t agcaa at t caaacga t t ggt gt at t aact act gt c t at gat gt aa acgat cagt t accat gct aa agt gt caact aaaagagt ca cat t at caaa cacagcgat t t cgat ct t gt act at ct cca gt cacct gt c t at t agagag at caaat cca t ct ct t t cgg t t t ct t t agt ggt t t cagca gt t t t t aat a aacct gaaat tt ggt acaaa t gt at t cgat t gt agt at at ttttaaaaca gaat caagt g t t cacaagat at aacaat gc gagagaagaa agct agacat cagctt ct t t ct gt gt ccaa gat at t t at c agt t gat gaa taggaacaac gt t aaact t a catt at gat c t ctt gct t t g t t gt aagtt c gt at ct cct a t gct gt agt t ccaaaagat g aagat ccaat agt at gt t t t t at t at at ac tcaagcaaaa t ct ccaaat a cct cat acaa t ggat gatt g gt ct ggaat t act at ct aat acaaagccat ggat t cat aa aacacgat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 gctacactta ct aaaccaca ttgagatcaa t aacctt caa tttactaaca acactaatgc Page 318 12689250 Sequence Listing.txt aacatcgttt cagatacatt tggtctctat ttccactaga tcaatacgaa atcaaccaaa agct cct aca aacaaaagat gat cgact t t t aagat gat c gat t caat cc ccat t t t cgt agaagagtca cccct act at gt aaat t cct ct aat t t aac acaaactct c tatcggaaac agtt agat cc tt gacaaat a aagaatcacg ggggaagctc atcgtgcgaa tagggt t t gc ctct t t cttc agtagccaaa aaggctagcc t aat cagct t at aaat gacc act aat acag t t t t at t ct c tt gagaaat c ttcaaggcaa atttggccaa t t gacaacat gaaaatgacc accggaaccc gagctcgaga aggggcaaaa caaat cctt a ctaagt t t t t caat t gggt c t at t t gcaat agcaaagcga atgg aat aat cat t at t t t at gaa cgagt act ct ggat cgt acc tcaggtgacg at t agggat t atagt t cagg cgtgggctta at at at t t t g actcgaggaa tccagagcga gat ct cgacg ttgttcccaa gatcgagct g atcgacgaca aaggaaccca aagaat t gca tgat t t gat g tcatacgaga t aaat gct gc t aact t at ct t aaact gcaa cgggatctct t t t ct cagat taatgaggaa acct t agagg gct t ct ct gc at t t t agact t t t ct cagt g agagagacac t gacacgt gc ccat aat gt a caaat agt t t t t t acgaat a ct ccggt aaa ccgtccggcg 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 292 2002 DNA Arabidopsis thal i ana <400> 292 t at ccgacaa at t t t ct t t a t t t cat t aga agat at at t a caaaagagga t t gaaggat c gt aaaacaat at t acacaca t ccat t t t aa accaaacccc ggt caccat t acaat t aaaa at t t t cat gt ct t t gt t t ct aact aaaaaa t at ggt at t t caat agt caa t aaact t aac aat cacaaaa t gagcgagt t gt ggct acaa t t at at t gt t aaagt at t ga cacacacaaa agaat aaagt ccaaagct ct aagt ct cacc gt ct aaaat c gt gcgagt ga t agt t gcat t gact at t gt t at t t gaat t t t gaggat gag aaagaaaact agt aaat aaa agaaat acac acacct t ct a tgt t t catta t gacaaat gg t aat aaaat g t aagat acgg at at cat t ca aat acccaca t t cacct acc gt aat gagt a gaaaacgttt t gat at aat t t t at agt cct caat t gcgca ct t t t t t ct c t aaaaaat t t t acccacct c agat gt t t gc acagct t t t a t aact ct agt aagaat at t a t agat caaat t t aat cagcc at t t t t t t t g acaaacct ct at gcct t t gt t cct t at ct c t t t gat gt gt tttttcctaa t gggt t t t gc aat cact aat act t act gcg gccacccagt act t t t t gt t t at at ct gt t cat at at aaa aggat gt t t a gat gt at gt c gt t aaacaaa t t t ggt cat c t gcagagcat agt aat t aaa act t gt at ca aacccggt cc at t acagt ct acaccat cga cagct t t act t aact t at ga cacccact ac t ct t t t gtt t t cat gt aaaa at t at gaaga gagat t t t aa aaaat cct t c at aat ccaac accaacaacc gaaagt gt gt ct at t ct at t aaccaaacga aacaat t t ca tacagacaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 319 12689250 Sequence Listing.txt attattttct ttgaaatgaa aaatccgatt ggttcacgtt atataacaag cggt ccccaa ccaact ccaa caaat aaacc t ct gact cac aaaaaggct c cat aat ct cc gt at t gt t ca t t gacgat gg cagct act ct aat at t ct ca gt at gcaaga t at gat t t aa t aagcat aag tagctttttt t gt cacat t c t t t t aagat a ttctat t ct t t agt cagaaa agccaaat ac ggt gaccagt t t t ct cgt ct gagccacaga cat cgaagca t t t t t gt t t t aagt agat t t t t aaccat t t gat gt gt ggg t t t t t gagat cct gt t agt t agt caact t g gt cact t gag t t t agaat t g gcat gat gct at t t t gaaaa at at at at t t cat t agt gaa caaaccgtt c gat ct gagt t ggt at t gt t c ccagt t gat t aagggact at t gt t at t gga at t t gagt ag t t t gaat t cg aaat gt gcat t t t t gct gaa ct t aat gat t agaat ct aag t aat gccaaa ct t t acaagt ccgt aaaagg ct t ctct t t t t ct ct t ct t c t t cgt at t cg t t caagt ct g t gact gt at c tcagcagcga agt t t at gt t t t cat gt gt g agacaat aag aagat at cac ct t ct gaaat aaaagct t ac aaat agt aaa ccaaagt agg cct at aaat a aacactt gt t gagat ct gat ct t ct cgt t t acgt gcgt at tatggt t t t t t t t cgt tat a gcgt t ggcat t at gt gt gat t aacat acga t t at gat t t t t cgat t ct t t aggat at ggt tcggaccaat t at t ggt agg gt ggt t t aac accaacgt gt ct t t t aaggc cgccgaat ct ct ct ct cgat t t t t t ccgat aat t agat t t gat t ct caaa gat t t gaat a t gt agct t ga agcgagt cac cgaat cat t t gt t t ggt t t a gaaact at t c 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 tcttttatga ttctacagtg gctaagtcat tttttttttg t gt agcagaa t g <210> <211> <212> <213> 293 2004 DNA Arabidopsis thal i ana <400> 293 gt gt t t gcaa ttccaacaga ct t gacagt g ct t gt ggct a acggt ccgt t gt t t ggt t t c t cat gaaat c ggagagaagc t t aat at t ga caat t aaat c t ccat t aat g at caat gaaa ttttgacaga t ct t t gt at g aaaaagaaaa gcaagtggag aat ccggaat t caaccat t g ccat cct t t t t acagaaaat ct t caagt t a at cagat ggt cagt t gaggc t acct t at ct gagacaagt a ttgat t t t ac gccaagggat ct t t ct t t gt t t cagggct c t at t t gct ca gt at at t t t g gt cat caaga cagcgcaaac tttcggggga ct at gcacaa cagt at cct t t gat gcagca t t t gccgt ac gt ggacct t c ct t aact ggg ttggggacaa cagt aagt gt at ct at aaca t t ct t agt ca t t gacat caa t ct acgacgt ct t t t gcgag at cggaaat t gt t ggt gat a act gacat t g t t t gat cct t at t gt t gttt caact t agt a t t at gt t aga aat t acgcac t agt t t gggt ccaagaccat caat gact gc at ggaact ct at ccgaat ga t caccat cac gt at t ggaat t cgaaaaat c ggt t agt t ga t gat gctct g gt act act ct at t ct t ggca t t t t gtggt g t t ct cgcat g gaagctcgga t aat cat gt c t t t cgt aat g 120 180 240 300 360 420 480 540 600 660 720 acagatagag tgactaatac caatggcttt ggctttgtat gtacccttct cctcaaacct Page 320 12689250 Sequence Listing.txt ct cat t t t ct ct cagat gt t t aagat cat t t aaat gt aca agcgat t cac t cgt act agc t t caaagct t cgggacagga t t gt t t t t t t gat t caccca gggact t t t g t acgcct at c t at act t t t c act aaaat gg at ccaaat ac agat t agcgc at aacagagc aagaaaagt a at t ccgaccc cact cacct a ct ct cgt t t t ctccattatt tggttactta ttcttgttac gtactaattt atgtttacct ccagaaaggt gggaat gt t g ttccaacacc ct t ccgt gag cct acat ct c ct at aagt aa ct cat aagt a t t cccgt t t a t cacgaagt a ct t t t t t gaa aaat at ct t c t aaaat at aa gccgct cgt g gct gcgt t t c gt gagat gca t acaacggt a t t gaat caga ct cct ccacg t ct cact ct a cgat t ct aca t cggat t t gg aaagacat gg gaggaagt t g t t gcgcggt t t t t cat acgc aaagt gat cc acaacaaagt aat t cacat g t t t t t t gaat t at t gct t at ttttaaagaa t t t t at t t gg aact cgt ggc at gct t accc gt t gccat gt aat aaggaaa t ct gact t t t t t ct t ct ct c aaat ct ct ct at gg t t cct aaagt agaaaaaat g cat ct accaa t gt t cat cat gt caagaggt at cgt t cat a acact t cgaa t aat aat t t a t aaat acat c caaat caaaa agt ct cct aa gcgt t acgt t agt caaacac gacccgt ct t ct cgcct cag at gat t aagg tggccaagaa t t t t aaat aa ct gccaat ct at cgcgagaa gt t t caaaaa cgacgat gat tgcgggagct at cacgact a agct ct act a acaaat t t ca at t cacgt aa aact aat cga t t t t caaat t agagt t gaaa ct agaaaat g tggtcggcgc aaat at t t aa gaat gacgac gcaat t t ggt aaact ct cag cct ct t cacg cat ct t caac at cgcgaagc ct ggat t cac gaggcat ct a gct cat gt t c tgcaccaaac t agcaat t ga cat gt aat ac at act aaagt gt t t t t gat a ct t gt ccat a act t gaaat a gaacccgt ct at aaaagcat agaat at t cc at t t gccaaa ct t t t aggt t ccact agat c gaaccct t ct ct ct ct ct aa 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 294 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 294 ttaggaacac taattgagtt ttccccttta caaaacaatt gtctacattc atcttcttct tatctccaat gatcct t ctc ctagcaatga tcatctccta cgacgaaaac caccacatca ttgagagaat ctacgaagcg tggatggtgg gtcttggtgc agagaaagat caaagattcg atgaacataa caccaaaaac ctcagt t at a ccaatgagga gt accgat ct atgtaccttg ccagtgatcg gtatcaggcg cgtgttggt g t at acaccac tctt caaaca t cggcgt t t c ccaccgagac aacacgggaa agat ct t t aa aact t ggt t t gt gct aaacc acgcgctt cc Page 321 cat caaat ct aaaaat gggt at at gccat g cagccgt agc gaagaagat g agat aat ct t aacccggttt gaccaagagg t gat t cggt t ct at ct cgat t t ct t aaagt gacat gt caa gactcggagg aat caaaacg cgt t t cat cg gct gat t t aa gt t t t gaaaa gactggagaa 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt aagaaggtgc cgtcgctgat gt t aaagatc aaggaagctg tggtaagaat t ct t t t gat t t ggaaaaat c aaatgggt t t t t t cat aaat ct ggggaat g aaacaaagat acacaaact t gcacgt t t cg t t t at ggacc agat t t gt t t gt gt t t agat t t gt t gggca t t t aat at ct t aat ggt ggt cgaagcggat agt gt at t t a t t t gt t atgt ct gagaacag ttgaagcggg caaaaaact g ggct at t t t g t t gt ggct gt ggggaaacag caggaaagt g t t at t cat t t cct caaat cc ct at gaaaat act ccaagaa at gagt t t gg t t t gaat t t t ct gat gat at gggagctaat at ggt ct t t a t t t gt caaga t gt gacaat t t t ct caact a tt gt ct gaac t t aat ggact t at cct t aca gt t t at aaaa tttgcagaaa cgaggcgtct t ggt cgcgct ct agacaacc t ct t cagggt gggt t at gga atggggagag t ggaat cgct t t aaaat t t a t gat t at t t t t at cgaaaaa act t t t gat a t ct gaaat gt t at ggagat g t t t gtgt t t a at t t t ggtta aact t t aaag at t t aaat ga t gat t gat aa t t ggagcagt aagaat t ggt at gcgt t t ga aggct gct ga t gaaact caa aat gct aagg ct aaagaaag t t ccagct ct at t t gcact a gt at t t gat g accgagaacg agcggat aca at gg acaaaaaaaa acggatgggt gt agt aaaaa aat aat t cag t gact t t t ac aat aat t at g t gcat caaaa aat ct gagaa t t ggt agt gg atggtgggga t aacaact aa ggaaggaat a t gat t gt gac gt t t at t at c t ggt cgt t gt caat ggct t t t t gt t accat ct t t ggct ca at t ct t cggt t ct t gat t t c gact t t gt gg gaaaggact a t aaagat ggc acat t at t ag cagt t t t t t c ct gcgaaaat agat at ct t g t t t t aggt gt gt t aat aaat caaaat gt t g aat t gat gat gacaat at gg cgat gat t t g t ctt ctt gt t aacaagatt g acat ct t at a aaaaat ggt g gaccagaaca gaat t t gaaa t gat t cat at ccagcct at t at gaaat aac aagat gt t aa gacggagt t a ct ggat t gt g acgt aacat t aaat t ct t t t at t t cgaat c t t t agaat t g t ct t ggat t g aaact t t t gt acgt t t cgaa ctt gagaaag aat t t t aggt at t t t gggcc acccat t aca gt at gt gt t a gt t t agggag tgacaggaga accaaggat g gt at t gat ac gggt t agt t t t t t act ct gt gaagat gt cc agt gt cgcca ct aat t gt t t aat gt t ggt t gaccacggt g agaaact cat gaagcgccaa 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 295 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 295 ttctcacacc attatatatg gtgatggaag ttattaatgt ttaaactttc tatcagttct tcaagatgta gcagaaatgt ctaaatattt gat t aaaaga gaataaggat catgt t gtaa ccaaagataa caatgaatat tattgtcttt aaaaagcaaa gactatgact ataaggatca at gacat t gg gt t gt aaaca t t t t t gt t t a t aacaaat ca agagt agt gc agt ggat cca Page 32' acaat gt gct agt t t t t at t t acat ggt t t gggct ct t at at aat gaaag t t t gtgagca t cacat t t ct t at gt aacac t t gt gt ct gt gaaaacagaa agagacaaat gt t t agt ct t 120 180 240 300 360 12689250 Sequence Listing.txt atttcttcct tcaatttcca tgagcatgcc taaagaacct ggagaggaac cctctctttt accat cagca aact t gacaa aacaactttt ct t at gcaga gagt act gt t tgcgagacgg ccagaaagag gt ct aacgt t gt t ggagaga ttgt t cattt gt t at t t t cg gt t t aaaaaa t gt t t aaat c at agt gt t t c t t t at t t t ct ttact t gtta t t t at aaaaa t at aat gct t gt t t t gaat a t gat t cacat gtcaagaggg cat at aaaat acagaaat at at t cct at t t t cacaat aca cacgt gt aaa taaagaacca gcgccaat ca tttgaggaag t t ggct at t g agt at t acgg ccggaaatgg aagaagagga at gccacgag gagagagat g t caat ggt ga gactaggcga t aaaaat aaa aaaaaaaaag t t ct cagt gt aact aggt t t t gt gt gaat g t gcat at t at t t t at at t ac gt t t gggt t a cat t t gt t ga at caat gt t t ct t t gggct c t cat ggt ct c t t ggt aaaat ccaacacgat cgt cagt aat ct ggcat ct t gagagaagcc cgat cct aat gagaagatgt t t ggt gact g t cgt ggagat caaagaagt g t t ccgat aat aaccagaggt t gagat cat a t t t t gat t t t t t t t ct t gt a t t t t act t t a aagagt t t cc gt at t t agaa act ct at act gcaat t gt cg gt t t gt t at a tt gt t t cgca act gat cggg at acacaaaa agcacgtttt t at cat t cac cact gaacat t at t t t t t at aaagt aat aa ccaat t at t a ct t ct t cttc at gt cgcaat ct ct aacagaacaa aacaat ccac gagaagt t gg gat gaaaccg agt t gt ccaa t ct act ct cc t gaagt t aaa gt t agt t at c gt aat t aaac aat t t t gaaa at t t aaat t g at agggt t gt t t t at t ctct aagccat at a t ggat t cct a agct aaaaat t t t at t at ga t t cct at ct t gaacaagcaa ct aaaat ct c gt acct cact t agccct t at ct aaacaat g aagccaaaag t act t ct t ga t ct gcat cac gt accagct t at gaggt t gt at gat ccaag gact t gt ggc agagagtgga at ct ccat t t ggt t aagaga t agt ggat t t t at t t t t t t t ttagaaaaca t t t cggaaat ct gcccagt t at cct t t t ca aat t t t aaat t t at aagt cc cct aact t t g tt gtt tact a t t at t gat t t agaaat aaca tcacacaaaa at t aaat gat t t t aat t t at t t ccgt t at a t t t aaccaat ct t cgaagct t t t cct t caa t agt gt gat c aacct t t ggt gt aaggt aat cat t ggt ggt at aggat t ga t t t gtt t t t a ggt at gt at t gt t t t t t t ct tt ct aacaaa t ccat t t gat t ct t aaaat t cccct gt t at ct ct at acaa t agaaggt t a aat agaat at t at ct caat g t t t gt cat t c at t at t t at t gcaaaagct g gat t t gacaa ccaaaaaat a ct t gt cacag aat gt caacg aaaagct cac t t t gt t agag 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> 296 2004 DNA <213> Arabidopsis thaliana <400> 296 cgattcgatc ttctccggag atttactgca aaaaacttag ggggagattt aactttaaga ggatagaagc aaaataccta agaaaacatg gtaacgtgga tttgcgataa gaacgaatcg Page 323 12689250 Sequence Listing.txt aagtttgaca atgatgttaa aacgatatgg attttgtttg acgaaaaaaa t aat aacgat aaact t aaat aaaggaaagt agaaaacgt g t ct ggt t act t t t t t t at t t cgat t ct ct t t ct t t t t t cc agaaagt t gc tttagcgagg gtct t t t at c at cgcccat a t ct gt gt tag ggccaagat t at ct gat cat ttcct t t t cc ct t cgat t ga aat t gat t ct ct t gt t t t ct gt t t t act ca t ct cat cct t cct t t t gaaa at gt at gt ag agt at t t gt g t agt caagat t ct gact ggt gat t cct gca t caagt t cat agt t t at gct t at ggcat ga t gt ct ct gt g ttttttaccc t agt t t t aca cccaact t t g aagcccat aa gcct t cat t a t at t aat aat caat cggat t t cat t acaca caagat t t ac aaaat gt t ac caaat t aagc ct ccgagaca ggagcacaaa t agaat t t aa ggct t ct t ga caaagt t t cc t t t ctgctgg aact t t gaag gcgat at aaa act t t t t at a t t at cgct t t ggt at agct a ct gt gt t t ga t agt ggat t t gt t gt t gat t t cagt t act t t ct gt t gcgt aactct t t t t t at gcat t cg at gt t t t at t tcaggaaaaa t t t t t at at a aat t t t at t a aagt aaat cc gact at cgt t aacaact acc ct t t t t t t t t t at aagaat c gt ct t agt ca aagt agt agt cggcgaagcc t t t t t t ctt c t gggcat acg t aat agagag ct at aacaaa aact t ct t ct t cct t t t t t t gagct t acca t t aat t t ct t ggt t t t gt ag gt gt gt gat c t aact cct ac gct at aaaga at ct t gagag accct gt t aa t t ggat t gt a t gaaaat gat cct gt t ct g act gt gt gt t ct t gt at gac ccct ctt t t a aat g aaaaat aaaa ct t t ct t t ca cagaaacaga gt cgt ct t cc accat ct t t c t t t t t t act t aaccccattt cat cggt gaa act t gat at a gaccct gt t c t t t t at t t t a t at t cggggt gcgt agcaca aagcct ct ga t ct t ct t t ct t ct t t t t t t a gct agct act aaagct at cc gt ct ct t t cg at t t at gaga gat t gct t t c ct ct t aagag gct gat at t a ct at caact a ggat gact t a aaat cat t ga ggt t t agct a t gaaat t ct g t ggt gat gaa t cagagt t t t caaat t t aaa acgat aaaaa ttacaaaacg tcaaaaccca t t t ct ct ct c t t gat ct ct t t gtt ctt ct t aat at caat t t ct t aagccc t t gat at gt g t t ccat t at a ggt cccat t c agagagaagg cgagt gt gac t ct ct t t aaa act at at at a ggt ct cact c t ct gt t gt ca at ct ggct t g agaagcat cc cctgtgggga t t t at t at t g gct cagat cc ttat t t gtgc gaaaggaagt acct t ggaat t gt at t ct cg aat t ggt t ct tgt t t t t gt g gcactt at t t aat aaaacct t t ct t cgaat aaaat acaaa caaaaagccg act t t ct act t ct t t ct cgt at cct t t gga ccact t t act aat at t cccc act gt t ct gt t gt gaagt t t aaat at acaa at cgt cgt cg t ggt gat cga t agt cacaag ggt acct act t t gaagct t g t t ct t act ga t caat accct at gt gt gt t t aat t aat t aa ct t t att gt t agt ct act t t tcgt t t t gt t gact t gt t ca gt t cat ggaa at gacacgat at ggagt t ag t agt gt t t ca t t cat at gat t t t gt t gtct 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 297 <211> 2004 <212> DNA Page 324 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 297 accaaaaaaa caat agat at t ccaat ggca t t gcat gat a aat ggcaaga t gct t gt t at t at t agact a t gt aat t gt t ct act at gt t caatgttttt t gcat t t t t t tgcatttttt at act at t t t t act aaaaaa t gaaaagat t ct agaaaat t caacgt t t aa t t t aaaaat a ccat t gaacc acacaaagat t cct t cgt ct cgact t caaa aaat caacaa t cat t t aaca ct agat t t t t at t t t aaaac tat t t ccttt t t t t t t t act at agat t cag at aaat at aa agt t gaaaaa t t t t gt t t t t t gt t at t aca gaagaagaag aaaat gaaga at agct agt a t cat t cgt t a gct aagaat t cacat ggt ac act t ct at ca t t at gcaat t t agt agt ct a t gt at gcat t t caacaaact gat acaat at t t agt aaaat ct t at aat t t t at cgaaat t gt at aaat t a t aaat aggt c at agt aaaat ccat gt gccc at gact acac caat aat t at gaccaat caa t gt t t ct aag cat t aaaaat t ct at t aat a aat t acat ac t cagct cct a t t t t t ct t t t t t t act caaa t t gct at ct t act t t t t gt t taaaggaaaa agt t at agat aaaccaaaac t cagaat t ct caat t act at agaat t ct t a at t agat t aa t agt t t t t at at t t t accaa t at t agt cag t t t at t gggt aaaaat agt a ttttttaaca at t ggcaaat gat t t aaaat agat t t caaa at t at aaaca t cacgagt ct t accgt ggaa at aaacat t t t aggt t gt t t t t aacat aca aat cat t aac aat at t t aat ct at t aat at t t t ccagat t gt acat agt a t at aaaat ac gat t t agt cg gt gat t aagg ccacctt ct t ttttttttta aagaaaggt c aaact cat t a caaaaccat a t agat t aacg t t gat t aat c gat t aacgcc t t gacgccat t t t gact t t t t caat gt t t t t aaagaaat a aaaaaaat ag t cat t ggat t t cat ct t gaa cat at at t t g t act gacaag cat at at t at at gggt gcat accact cct c aaaacagt ac act act agt c aat t t t ggaa cgt t t t gaac aaat gct gt a gaaaaaagt a t t t cagt t t t ttttgttaac at aat agat t t cggat aaat t gt acaat ga gt aaaat t gt ct at ct ct ct aaattttttt t ct gagat t t gat at gcgat gct at at t t t cacat at t at t aagaat at a t at at t at t t ggt at act cg gtct t t gtta tttttaccaa aact at acat t t caat t ggt t t ct t aaaga t t t gaat aat t t ct aaaagt t t acaacat g tgcatttttt t t at t t gact gggatcagac agct gt aact t act act at c t caaat t aat aact gt at gc aaaaaaacca t agat caaag t t t t t t aaat t ct t gaacaa t ct cct at aa agat cct aat cgat cacaac cat t at ct ga acgaaaat aa t t ct t caaaa t caaat cacc t t t ggaaat a t t t aagaat a t agct t caca at t aat at ct agct t cacat t t t ccagt ac gt t t t t t t t t ct ct agt ct a at t aat t t t t aaat t t t gt a t at ct aaat g act aat ct gc cat t t at aat ct at caaaaa t cct cat t t t act act act a t aggct acat t cct at at aa gt at t t t t t t t agt agacga aat aat t aaa t t aacaaat g ct ct agt gct gt t t t cagat gt act at t aa aat gct t t t c caat aaat aa aaaaaagccc t acgt gt t t t t att t t t gag at t cct ccat at cat cat ca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 tcgttgaaga agacgaagaa gataccttcc tctttatctt ctccgatttc tcaaccttcc Page 325 12689250 Sequence Listing.txt cgtctgct t g tcgtaaagaa atgg 2004 <210> <211> <212> <213> 298 2002 DNA Arabidopsis thal i ana <400> 298 act t at t t ga t t cgt gt at c t gt cat gct t t at t gt ggaa at at ct gt ga ct gat t gagt ct t t caccca aaat agt gat ct t t t t t t t t t t t gatgtta atagaggaag aat t t gcaca t cat t t caga aat aaat t t a aat t gt aagc caaagt at at aggt gaat gg caggt t aaca gt gt aat ct a at gagggt t t t t t ct cat ct gaaaacaaaa gt gaact ct a cat at t caaa aaaaccagga gggaacat aa acat t t gt ct aat t aat at t at t act ccat aact ct gt t t cat gaggt cc t act t cggt g act gcagat g t t aacat t ct cgct gt t aag t gt t agct t g cgt t t gat ct t gt aaagaga cagcaaact a ct aaaaagcc act ct aat t c aaat at ct t t aggt at t t aa t ct t ggact g gt gt cat t aa aaagcgcagc t aaact t gt a caaaggtggt aggt t t gt t g cct t t agt t g caaacaaaca gaaaccct t c gagacat at t ggt acact aa at aaat gaat cagaaaaaga at ccaagaca acccat t caa t aat ct t ct t t agt t gat at at gct ct t cc aat aggaaca aat gt at at g gt t t gggt t t ct at at gaca t ggaagt t gg ggat t aaggt acat t ggct c agat aact t c aact t ccaat agt t t t t gag gt t gaccgt t at aaaat at a ggt caaagac ct ggaggt aa aaggct at ag t t t gt t t t ct gct t gat t gt t cacat at aa aacaaagaga at t ct t t at t t gat cat cgt aaat caat aa cagagaagag ggaagataga acacggcact t gat t aacac gagt cacaat gct at t gagg t t act cat gt t aagct gat a t t gcggat t t t t ct gcagct caagt aacgt aact gt t aag t t t t t gt t t c aggt t t cat c t act gcgaag aacagagat g ggat ggt caa cgt t ccacat ccaact t aag aacct t gt ga ct aaat cgaa at at ct ggt t t t ccct ct ct t gt at aat t t aat t t gggt a gt gt t ggt t c agt t t t t cat gt ct t cat aa aat t acat t a aaacat ggca act at t t t t t gct aacaaca gt ggt t ct ac ct ggaat t gc t t gct t gaga tcatgt t t t g ct ct agt ct g t caaat t t t c t ct t agt t t g aaat gaagt c t gt t aagct t gaat gt ct t t ct t ct t gt at agt gt t cccc ct aaggt at t acccgt t gca at aaagct ag gcagaat aaa aat gt t t t gt gaacaaacac ggt gagt ct g agt t t t t t ct at at gt at ag aaat ggt t t g ct t at t t gt t t ct t t at t aa t t t ggt t caa caaat t at t t act accgat c aaagt t t t t a agt t gaacct at cat t ggt t t cgt at gt t t t gaat gct t t t ct t at ct ga aacgt t t gt a t t acacgt t a t gcct t gggt at act gaaac aat ct t t t t a gt cagt t gcc aggaacaaca acaacagcag t ct ct at t t c aat aggct t g cct t gccaac cacat aaacc cct aact aac act aagcgac agt at caat c t t ct t aaaaa cgt at ggt gg t t t at aat aa tgt t t t cat t t t ct aaaacg agt acagcca t t agt acact at gagct t ct gaaaagagag at cccat t cc aact ct ct cc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 Page 326 t t t cccct t g t t ct ccgact aagct aat ct cacgt t gcgt caat ct aat t gat aaggcat at aat aat ga ccaaagggaa gt gat aat ct gcaaat caaa 12689250 Sequence Listing.txt ct t aaaataa taataaatac cataaatatc ttccgatttt tgacaacaat atcttttaaa attatcttaa aagaaaccga aaagacgtgt agcagagacc tatacataca cctatacat t actgagggtt ctatctatat taatccttct ctgcttcctt tg 1800 1860 1920 1980 2002 <210> <211> <212> <213> 299 2004 DNA Arabidopsis thal i ana <400> 299 cact t at t t g t t t cgt gt at t t gt cat gct at at t gt gga aat at ct gt g act gat t gag t ct t t caccc caaat agt ga actttttttt ct t t gat gt t aat agaggaa gaat t t gcac ct cat t t cag gaat aaat t t caat t gt aag ccaaagt at a caggt gaat g ccaggt t aac cgt gt aat ct aat gagggt t gt t t ct cat c agaaaacaaa t gt gaact ct gcat at t caa aaaaaccagg aaact ct gt t ccat gaggt c t t act t cggt aact gcagat at t aacat t c t cgct gt t aa at gt t agct t t cgt t t gat c t t gtaaagag acagcaaact gct aaaaagc aact ct aat t aaaat at ct t aaggt at t t a ct ct t ggact t gt gt cat t a gaaagcgcag at aaact t gt acaaaggtgg t aggt t t gt t t cct t t agt t acaaacaaac agaaaccctt agagacat at aggt acact a t t aat ct t ct ct agt t gat a gat gct ct t c gaat aggaac t aat gt at at ggt t t gggt t gct at at gac t t ggaagt t g aggat t aagg aacat t ggct cagat aact t caact t ccaa tagt t t t t ga agt t gaccgt gat aaaat at aggt caaaga cct ggaggt a aaaggct at a t t t t gt t t t c ggct t gat t g gt cacat at a aaacaaagag cat t ct t t at t t gat cat cg aaaat caat a t gagt cacaa t gct at t gag ct t act cat g at aagct gat gt t gcggat t t t t ct gcagc acaagt aacg gaact gt t aa t t t t t t gt t t caggt t t cat ct act gcgaa t aacagagat gggat ggt ca t cgt t ccaca accaact t aa caacct t gt g act aaat cga gat at ct ggt t t t ccct ct c t t gt at aat t aaat t t gggt agt gt t ggt t tagt t t t t ca t gt ct t cat a aaat t acat t t ct ggaat t g gt t gct t gag ttcatgt t t t act ct agt ct t t caaat t t t ttct t agttt t aaat gaagt gt gt t aagct cgaat gt ct t cct t ct t gt a gagt gt t ccc gct aaggt at aacccgt t gc t at aaagct a ggcagaat aa aaat gt t t t g agaacaaaca t ggt gagt ct tagt t t t t t c t at at gt at a aaaat ggt t t cct t at t t gt t t ct t t at t a at t t ggt t ca acaaat t at t ct cgt at gt t at gaat gct t gt ct t at ct g gaacgt t t gt ct t acacgt t gt gcct t ggg cat act gaaa t aat ct t t t t t gt cagt t gc taggaacaac cacaacagca t t ct ct at t t aaat aggct t gcct t gccaa acacat aaac t cct aact aa cact aagcga gagt at caat t t t ctt aaaa gcgt at ggt g gt t t at aat a ttgt t t t cat at t ct aaaac aagt acagcc t t t agt acac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 tgggaacata aataaatgaa tcagagaaga gaaacatggc aactaccgat catgagcttc Page 327 t acat t t gt c gaat t aat at cat t act cca ct t t cccct t ttt ct ccgac aaagct aat c t cacgtt gcg t caat ct aat tcagaaaaag t at ccaagac t acccat t ca ggataaggca t at aat aat g tccaaaggga t gt gat aat c t gcaaat caa 12689250 Sequence aggaagatag aactattttt aacacggcac tgctaacaac at gat t aaca cgt ggt t ct a t ct t aaaat a at aat aaat a at gacaacaa t at ct t t t aa aaaagacgtg tagcagagac tactgagggt tctatctata at gg Li st i ng. t xt t aaagt t t t t aagt t gaacc cat cat t ggt ccat aaat at aat t at ct t a ct at acat ac t t aat cctt c agaaaagaga t at cccatt c t aact ct ct c ct t ccgatt t aaagaaaccg acct at acat t ct gctt cct 1620 1680 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 300 2004 DNA Arabidopsis thal i ana <400> 300 tt t agat agt aaaat gat t g accaat caat aaat act t cg act ggaacat gt gcaccat c cct aacaat c gt t t cacat g t at caaat t t acacacacac caat t t at at act t cgaact aaat aaaaat aat t aaacaa aat at gacat aat t gt aaac aat t at t at t tggcccacaa t gt gagat t g gaaaaat t gg agt t gaaaga t aat gct t t g t t ct caat ca aaagt gt cac gct act agct t gt cact aga aat gtt cat t caccccaat c t t t at caat c gt cccgt gt t gact cat gt t aaat ct gcgg gt gcaaaaaa t agcacact g aagaaaaacg t t t aaat t t t ct aaaccgcc t t t cct t aag aat aaaat gc t ct cgaat gt tgccacaaac at t gcgat t a aaat ctt agt agt t t t aaat caaat at at g at gtt t t t ga acgt t cgt ga at gt gaacca t aggt agtt c acaggt caca gataggaaag t cat gt gtt a gct ctgtttt ct t caaaat g t t aaaat t t g aaat t at cac gt t t ggaaat caaaagggac tggagggccg t aaaat aaat t aaaat t t gg aagat aacat gaat aaagcg t agt t at aca aagaaggaga aggaat t ccg gaagt acgt a ttgtacgcag cccaaaaagg t aaaat caca gt at gt t t ga tt ggt t t ggt aaaaaaaaaa t at at aaaga gt t caact at t agt at t at a gtct t t t at t t aat cact t t agat t gcat c t cacacaat a at at aat ct t tgccacaaac ct t at aat t t t agt gat t t g aaggt aaggt aaaagt caac at gat t agct cat gt ggt t g t at t cat gcc caaaaacaat cgact t acat aagagat gat at cgt at t at gcaat cgt t t aaat t ggat c t t t gtcaagg tttaaaaaaa t at t at t at t ct ggcagt ac gt ggt ggt ag ct t t t t gccc t ct gt at gat aat at aaat a t ct cgt at ac gcat t gaat a act t at ggt t aaacggt cca aaggaaat gc agcgt t t t ga gt aaaagaat agtt gat gt t ggat cat cca caaaggaaaa agaaaaaaga aat t t t ct ct t aat ccctt c agtt gt at ga ct t gt aaaat aaaaagat ca t t aaat aaat ct ct cagccg gacat t cat a aat agat aat t gt gaaaaaa t at aat caca ttggagagcg t cat t t at gc agcat gt aat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 tacataattc tacttagcga taatataaa aaatatcat a Page 328 12689250 Sequence Listing.txt gtcactatac gaatatcaaa ttcataaaac tagccgtgac ttatctacac at gt t t cgag gat t at aagt at ct t t agt t cacacat at a ttggagaaaa gagagaat at cgccacgt ag gt t t t caaaa aat aat at at cacat agt ag t t aaat ct ca aat aaagacc aaggaagaaa at aat t t t t g t caagact aa acaagaat aa gaaaaaggga at t ggt cggt t t at t t aaat cat ccggt aa accagaaaaa gccgt ccgat caagagaaaa caaat gact a t t t t ggagct at t ct accac aat cgt at t a caaaagct ct t ccgct t cgc ct ct aaccgc ccggt cacca at cat caaat cat g caacgacatt t agat t agat t gct ccgt at t aat caagt a t t t t cctct a at acat gt ac t ggaat t at a t ccaat caaa aaat ggcgcc ccat ct ccca at aacaaaaa at gaaat at t act cagt t ga aaaacct ct t gaact at cga aaacat gaca gcgat gt act t aat t t ct cc acct cagat a ct ct t ct agc agaat t at t t aacaacgaca t t t agt aacc gtat t t t t ac agcaaaat ag t cat t cgaga agt cat t at c t gt t t agcct accaaat at t t aagt aagag at ct t gaagc 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 301 <211> 2010 <212> DNA <213> Arabidopsis thaliana <400> 301 gtgacaacga caacaacagg atgtatgttc gaaggcggta actgcaacca cttgct t ct t atctacgaaa ggtcttacca ttattatcct ccaatgccgg tggggtatag tgttttgaaa cgttacaagc catacgacga acaacaatcg agagacgatg gatggatgga tttgcagttc gatttgatct tcgaggagct tctggaagaa aactcgaatc atttcttcat ttttcttcta ataaaagaga aaatatggtg tttgtatgag ttaaaagaaa ttgtagagga taaatttttg t acat t cat a gt t t at at aa t at ggt at ac ttaacaaaag cccgtctagc tcagttggta ggttcgagcc ccacggtggg cgatactttt gtaatcgtga taatattaac tacgcaactg tcgaaactac acaaatcaaa tcacgagatc cttttttata gaattagact tatcgacact tatttttaac attacttttt acatagatcg gagattcgat tcaaataccc acatagaacg caagat ccag t t cgat ct aa aaaccgtt at gaggt t t ggg t t t gataagg gagagt gaat t t gct t at at ct t t gt t gct t t t t t gtt gt gaat at acaa t at ggt at aa gagcgcaagg t at gcgat t t t t cat t t caa ct t t cct t t t ccat gat gat aat t t t t gga gacaaggt ca aat gt t caag t caacgaggt cat ccct at g t t agaat aag t t at gagt ag gt gt t gggat cat gt t aat a t t gct t ggag t gt t t at gat aaat at t t t c t aacat t gt g ct ct t aacct t gagat t t gt tt ctt gt gaa t t cat at aga t cccat at gg aat gt at cga at aat gggcc aaat gaagaa act aat cgaa t aagat ccat ct gt t act t g ggat t t gt cg t gaagt t gag ggcat t caag t t t aaggt ac aaaact t t gt aggt gat t ga ct acaagat t t gt ggt cgt g gt ct t aaaat cgt gt aaacc t act ct t t t a at ggt t ct ca aaaaagagga cact at t ct t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 ttgagagccc atctcaacaa gttgaatcat cttttttgtg tactctcact tcaccttttg Page 329 12689250 Sequence Listing.txt acaaaaagt c gt t gaat acc aat aaaacaa gat aact aat t t t cct t ct a caaaact t cg cgaaact gaa t t aacat gt t aat at act t t t aaagt ct at tttgtttcca act aaaat t a acacgt ggac aagt t t cat t gat ct ct t cc <210> 302 at ggaagcca t t t t t caaat agaagt gat a gaaaaat at g aaat t t t t t a gaaaccaact aaagt gaat t agaacat aga acact t t t at gat t t t gtta act ct aat cc t t t act t gag ggt ct acaaa cact gat t at aat t t t ct ag aact t t ggaa tggaaaccaa gct aaagt ct aaaat t t t cg t t agacaat t t at t acat aa t aacgat t t t aaat at t gaa t t gt at t aat ctt ct t t t t t t caat at t t t gaacgat t at t t ct aat t t t cgt t t aaggc aaaaaacat g cactacttta ctttgccaac atct t gtttt agacat at ct aaact aat ct cgaggat t aa ggt at t acaa gcaaaat t at t cct t t cgat aaaaaat t t a tgt t act t t g t ct t gt aaat at t t t t ctgt t t aggt gat a gcct at aaat aaat t aagat aaagggaaaa caat at at gt tgaacaaagg t gacaact ac gt gt cat at a aagct t t at t t gt t t aaat t ct ct cct agt ct t t at acat tt att ggaaa agagtggaca at caaagct c cat ct t cat a cct gt aggac aaat caaaga aatttttttt gaagct aat a ct gaaaat t a cgct aacat g gt caaat gga t at aaagaag gt aaaaagct agggct gt aa aagat cgt t g ct gaat at gt aat ct t ct ca 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2010 <211> <212> <213> 2012 DNA Arabidopsis thal i ana <400> 302 caaaaggagc agaggactga gaggt gaat c cgat ct t t t g agcgatagag at t t ct t aaa t gt t t t at ag gcagaaaaat caat gt ct t t cgt cggaaca acgt gacaaa agccat ct t c t ct cat t caa gcaaacgcac t t t t t aggt a t t gagagagt gt ct t ct ggt tttttgcgag at gt gacagg aat aat agag caggt at at a t t t t t agggt act at t acaa cct cagccaa t cgccgt aca t t cct aat cc gaggt t gact gaagt cct cc t t aagat t cg t ggact t t ca caaagaagaa ccct gt t cag t cct gat t ca aagtgagaag t gaaaggt t a t at act cat g t ggat t at t a aacct ggt t t aat ggaacga t cacgt gct c t ct cat ccga cat t cat ct c t t agagct gc aaacgcaat c t gaat t gct a agt ccat t ag ct caat aagg gtgt t t t gt t aat caagaag t ct caaagt c at t t t aat ca ggt ct acaag gt t t t t at t c ggcgct t ct a t ccgt cgct a t ggact at ac agcct t ccct caagaaat at t t t gat t aaa gagat accac agacctt gt t aacat agcac t t gcgaacat gaacat cct g act cgt gcat gat agt t t ca gt t t gt agaa gat t t gat aa gagat gt t ca caccaccat a gaat act t ct gaaggcgat c ggt aat act a ct cacgt gca aaggagatcg cat caccgaa aagt gt t gaa t agacact aa at gat at at t t t ggcgct gg aaat t t t gca at at caaaat tagt t t t gt t gaat agact a gact ct cat c cgaacgagga ct gct caaca agaacaaaac tttttttttt t cggaggt at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 330 12689250 Sequence Listing.txt catgacgatg tttctgtgat tgtgatttca cttgaaggaa gaatttggag t gaat act aa t t t agaat ag act cagcat a t at aat gt aa t cgaact t t a acgt aat t t t t cat aaact a agt t act t t t tggaccacac at ct t t t gat tat t t t t aga at t at aaaaa aagat ct gca t cggt acgct cact ggacaa gaagaact ag aat ct cat t t t t t ct t caat <210> 303 t ct t t t gaag attttttttt t t t t aacgat gaat aat t t a gt t accgt ga tgct t t t t gc t agt t at aca ct gat gaact acact gaaca cagaaacgac tat t t t t cct cacgaaat ac tgaaaggcgg ct cgaacgt a t t at t gaacc t t t t ggagat agct t at aaa ttgaat t t t c agact ct t ac tttttttttt gt t gcat t t g t at aact at c at gaaaat aa agat at gct t ct t t t gact a aggataagga ccct t t t aag aaaaaagaag gct aat t t t a caagtggacg t gagaat ct a caagaat cga gacgtacgag aagagcggag t t t gggagct tcgagaaaaa t aaagt aact t ggat t t cgc t gt cccat at ct act agcaa cagact t gaa gccact aat a aacaaacaaa at t cggaact at t t t ct cct t ct aacaat a t ct agt gt ag act gaggt t a aacggt gat a cgacacacaa aat caat gcg aaagat t gcg t cct ct at ca tg t t t agaat t g t gaggt t t t a acgt t at t gt agct aacgca ct t t at aat a aagt cat aaa at cggt t tag gaat t t t gct tctct t t t t c t caaacaat t acaaacccaa at agat ct ag agaccat aac acact ccaca ct gagggt aa acacat gt at ttaat t t t ca at ct t ccat g at t caat ct t ccat t t t gt t t agt gaaaaa aat t t t gaac ct cgt agt at t t t t at at t t caaaagaaaa acgt t ct ct c aacgt aat t t tttttataga at at acgat t ccgt agaat a acacggaaca at t at t t gaa agacgt aaat ggt caat at t t t cat aaat t 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2012 <211> <212> <213> 2003 DNA Arabidopsis thal i ana <400> 303 t at t t aggac t accaat gct aat t t gggga aaaaaaaagt tatct t t ct t gt aat cat t c t aat gagaca aat t aaaat a at at agt t t c acat gat ccc aggt aaat ca cact ct t cag t t at at t aat aaaaat at t a ggt acaat aa at t t t gt t t g t gt at ct aac aaaat gt t t t t gt t at gcaa gacct t aaga t t ct t caat c cat gaaaaat t gat agct t g t at t ct t gga gt acaat gca tgcaaagaga ct aat t act a at t cacat at gt t t t at t t g caaact at ca t t aagt t t ct ct gct gcaac ggccacat gc ct aacaaat c cgt act gt t a cagaaaaat a agaaaagggt att gcaaaaa aaaacagaac at gat cat gg t gt gt ct at g ttgacagaaa gat gt aacgg aaaaat t at g aagt caat t a t at t aagaca aagggt acaa acaat aat gc atgt t t t t gt t aaacaat t t t t t t at gt t t t at acat aac gt cagaaaca t t at gcaaaa cccggt t cca cgt aat gaac ct gat t ggt t t aat gt t gag aat t gaat t g gctt gaaaaa t gcat t t act tt ct aaaaga ct at gat aca gt t at ct t t c at t t act gat act t ct aggt 120 180 240 300 360 420 480 540 600 660 taatatggta ctcaacaaca aaaaaaaata ttaacaataa atacgtgttt gtct t ct t ct Page 331 12689250 Sequence Listing.txt tctttataga tgtttcactt tataaataga ctgtaagttt ttaaaacgca ataagaaaaa caact gaaat t gaagct t t a act ct t aat a aat t at cat g ggaagaaaat gact gt at at t cat t aaccc ct ggt at aaa agatgggcca ccat aat ggc tgacgagaaa ct t ct cgaaa t ct cgaat cg ttgt t gt t gt cgagat ct at gcgcgat tag t t t t at cgat t aat t gaat t t t ggct t gat gct at aat ct ct at gt act t cat t t t agaa ggct t t agca tttttacaaa t t t at t aat t t at t aat t t a at t ct aagct caaaagaaat at ct cat aaa at agt caagt ccaat at t t g cgt aaat cgc at t t cct gcg at gct t gt t t t agt cgt ct g gaat cgt t at gaagaat aaa tgtgtgaaaa gaat t gt t t g t agggt t gaa gat at aagt t caggt t aagg at ct t at gcg ct t t ct aaaa t t gaaat agc t at agagaca at gagat at t ttgt t t t aag gat t t gt t t t aagt t t t at c gt ggcccaat tacaccacaa tat t t ct t ct cct ct ccgt g t gt gaaat ac t gt agt t t at ct gaaacaaa acgagat t t g aat t caccat at t cagcagc t at gcct t ga cgt t t t ct cc at g t gcagaat aa cct cgt t t t g aat t gt aat t at at t t gggc aat t t at t ga aaaat t t aat agct t ggcca t t cat t aacc t aggagt aaa tcggaaaccc at t cct ccct t ct gt gaaat cgat t ct at c t cacct t t t g act cgaaact t t ggtagcag t cagat gct t t t aat t at ct aact t cat t t ttttgtgtgg at at t ggt gt agcaaagt ca t aat cagt aa t aaaat at ca at at t aat t t ccgt t t aaaa at agt cat gt caaagaaat g at ct cat aaa ct t t t gggt c aaaaat ct ac cgat t caggt agat ct at t t t t t cgat t t t agcagat gat t gagt t t ct t t t at t gat t t aat ggt t t t a caat cccact ct t gat t at g t at gt gcat t ttcaaaagga aacct ct at a at t t gagaac at agat t t t t agat t t at ct at ggcccaat at t t gt at t a cgaaatgggc at t t caagt t caacgagat t aaat cact ct gat aaat t t g t gggt cct ca at gt t at t t a agatcggt t t gt gt ct t ct a t gt gat t gt a at acact at a t t t gat at at 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 <210> 304 <211> 2005 <212> DNA <213> Arabidopsis thaliana <400> 304 ccaaaccgct aattctgctg attttccaga tacattcaga gaggaaatgg aaagctccag ggtttatttt ctaagcatca ttggtacttt tatttgaaag ct t aggt t ca tggaagccct tacaaagttt gaagatgaat gtcgagctat attggacaag ctagttcaaa agtttgttaa gtcttgactc tgcttctgtt ctctctcctt agtggcttgg acatgcttca gcttcacgca t gt t gt acga at agcacat c t t aat t gaaa t t acaat t t a t at cggagca gcat gt aagc t at t t t t gt t gt cgcagct g Page 33' gagaat acaa t act gat t ct aaat at t at t ct t gat ggct caat cat at g ttccat t t t a t gct ccgcac at gagacaga tcagcaaaga t at accaggt t cct aact ga ct agcgacaa t t ct ct t cac cct t gt t at c t aat cat t t a caccaagct t 120 180 240 300 360 420 480 12689250 Sequence Listing.txt ctgcagctat atgcatacga gaactacaga aaaccaggaa gattctttga cat gagaat g t t ct ct t gat at t t ct gt gc gat cagcct g gact t t ct t t aagcaat gat acct t agcat agggat gt ca t t gcagt t ct ct t t t cacat cacacacagc gacct gaaaa gct t t cacat gt aaccct ct t aact cacga gagt agt agt t aggct caaa cagctggagc at gct aaaat gt t aaat t ct t t t at ccat t aagagaat at tgtgtacgga ct t cact gaa ct t cgaat ca ct cgagccct at gt gt ggat agt cat ct gc aagt t acagc cgt t t gt cag t agagcat aa act ct t t t ca cgt gcct t gg t t caaggt aa gt at at at ga agat ct t t a t gtt aaaacc gagt ct t aac tgtct t t t at t aaat t gt gc agagagagac gagat t caca cgccagtttt t ggat at t t t ct t t caaat t tggt t at t t a gaat at at at aaacct t ct t at t gt cct ct acggcggaga t ct ccat gat acct gt t t at ccaaacccgt t gt aacagt g tgatgaagag cct t t t t t cg ggaacaaggc aaggat t gaa aat act t t ct t t gatgagag t at agaagaa t ct gat agt a cgcagactt g t t gcct gt gg aggt gt gaag t agagaat aa tggcaacaac gct gt at agt t aat t cat t c aat aat ct ct t ct t t aat ca t gact ct t t t at cct ct cag cagt t cct ac aat gg cagaacat at at gt t cccat ct cggaat t c gaaccaggtt aaacccggac t ct t cagt t g aaaat t aagt cat t at caat t gt gaat ct c t t t acaat ct agcagaagaa gt gaact ct c t cgcct t gcc t gat caat ac ttct t t gct g ct gt ct t t ga cat agt t t t t gt t agaat at at t gaacaat t t aat cacat cat t t gagt t at cacaggt g agt ct ct t at gcgaagtttt acaggat t ga at act aat ct aact t at gaa t t gct aat t a t at t t ct gaa t ggt agt aca ggt ccgggt g gaggt t gaat t t gcact gat t cat t aggt c agcaacact g gagaaagaaa t t aact caag t gcaaat cat agaat gagat caaagat t ct at ct t ct aca ct t t t t t t t t t at t gaat t t gt t t t at t t a t gat t at at a t cacgt gact at t aggat cg caacagatt c t at agt gt ac at at gt at gc gt at t t gcaa cagct ggaat t ct gcagaat aaggt gaat g gt t agt aat c aagaat ct ct gt aagat agc aat ct t t t ct aagt at gaac aat ccaact g cgaat at cgc aacgccgt aa caat aat ct c at gcaagaat t gcat cat t t gt ct acact g t t t aaggat g t cct t ct aat t t t at t t t gg at at gat t t t cacgtgaggc ct at ct act g gccggagaat 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2005 <210> 305 <211> 2008 <212> DNA <213> Arabidopsis thaliana <400> 305 gatttattta ccgtttgaat tcggtcagaa ttcgatccgg actttgtaac gtttaatatg actgagcaat actaattaag ctgactgacg gaacaatacc aattaaccca atttatgat a gttatgttca attttttttt tttgtttatc gttttaaaca ttattattgt aggtaaacta cttcataata ggattagatc aaatatttct aatgatgata taatagataa aaaaactaaa aaatatatat ttttaataaa ctctattatt tttcagctca atttagtttt acttaaatta Page 333 120 180 240 300 12689250 Sequence Listing.txt aaattatttt tgattcaggt cctataaact gaaacctaaa taatttgatc cgtaat t cat caaat gaaat at t t t t cat a cat t t t gcca t t aacgact a gat gcat t t t aacaaaat ct gt t cgat t t g agagagt caa at aaact t gc gt t ct t t agc t t caagaat c t aatcct t t t t agat gt t t a t gagct at t g t act t agcat t ct cgagt cc at aat t ggt t aaacgaggt t ccct aaaaga gcagaat ct t at gt t t t gga gt t t gat t t a gt t gat t t t a gt t ccgagt c gggcaatttt aaat gaggt g aat at t aaac gagagaagcg ct t t gt acaa ct t t t t t cca acaaat aaaa at t t t t t gcg t aaaaagat t aaaat cgaaa at t at acagt aaaaagct cc caact t cct t t t t cgact t a t gcaact t ca t gagct agaa t t t ggatgag gaaagaaaac t cct at cacg at acat gaac gcgaggattt t ct t gcagt g cat t gat gat caagat caga act aagaaga t agat at t ag gt t t gt t t t a act t aaat t g at aaat ct t a ggcat gccca gacacgat cg agacgagaga at t t at ct t t aagct t at ga at t t t caaat acact aaaat gaat aaat t t gagct acaat t t cagt t t gg acaagcgaat t t ct cat aat gaagt agaaa t ct aggat cc agaagct t ag ccaaggt t ac caacacagat acagt agcat t ggt t gct t c gcggaat at g at cat gt t t t ttgcaaccaa aaat t caagt at aagaaact t gt ct t t cga t aaaaggt t a ct t t t t t aat aaat t aaat a t agt t gacca ttgaccaaag cgat aat g t aat t ct t aa at gct ccaca gcaat aacac gaaaacagt a t acat t ct ct ccaaat ct t t t ccggt t ct t cat agt t gag ct t t gt agag gat ccacat a aagat t gcac aaagt gat gt at agaaaggc t t ggt aact t t t gagt ct gc cgaagaact t aaggt aaaaa aaaacgaat t t at gcaagga gcaccat gaa tat t gtgttt t t ggtaagaa gacacat t ct t t at at ct aa at t t t aaat g gct ct agt gg aaagcagggt caaaat t gt t acaaaat aca at t at gt gt t gt t t gat aca ct t aaaaaaa ggt at t aat a tat t t t t cac agagccgaca t t ct t t caat ct cat aagct ccat agccaa ggacaacat g t t t t cgaaat agcat ct at g t t t t agcat c t cagact ct g t at t gt gt ag cat act t t ga t gat t gcaat acaaat gt ag gaaaact t ga t t t gaaat t t t gat t t agt a at gggt at at gggt agat t t tagaaacggc cgt t ct t aaa t at ct cct aa act t ggt gaa at act at gct t t t gt aaact ccat act t ac t t t cggt t t g at t aat at t g aaagct ccac acat t t t t ga t at aagt t t t aat gat at t g aagacgcaat t t ggat gt t t gt gcat gaca agagcacggg ct t t gt act c t t at gt t t t a at t gt aggt a t agt acccga aagaagcaac acat gt t t ga gat ggt aagt gt t gt t t t gt gagt t t aaat t aaat gggct acgat gt t t g cgcaat caga 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2008 <210> 306 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 306 aagtcattgt acaaaacatc gaacgaaaat tataacgtaa gcaccataaa atgagaaaaa Page 334 12689250 Sequence Listing.t atgcttt tgacatataa gtct t act tacacaaact atagtgattg acga at t aaat ct a at t t ct t gct gt t at gaaat t t gact t t gt at gagccat c at agcat gt a act cct act t caaat ct t ca aagt aaaaga t t caaaat gt cct gagt aat ct t accaat a caaat t gct a t t gtgagaac gat cgcgt t a aacct cat ct aaat ct at aa at acaaat t a aaccgacaat aat aacaaaa at t t t t t t ac aaat t t t t t a t at gt gagt t t t at t at gt t tttttgaaaa caat at aat c cccaaat aat t ggaaat at c aaaacat t ct cct ct t cat a tcgaaagaga t t cgt t acaa <210> 307 t t t t t t t cgt t aaact t cca at caacat t a t t t ct ct cac t t ct gat t gc t ccccaacat t t at gt gt ct aaaaat ccga aagat gaaat t t t caagat a cct t at t at a t gacgct ct c taaccacgaa gat aaagaag gt agt gaaga t gat agt t ag aaat caaagt t t t act t t ag t gt cagt gac t t aaat t t ct gct ct t t aac gt t at gt t ga t t at aaaaag gct cgt at aa aaat cgt cat t t agt agt cg at aat t t t cc aaat at agt a agagagt t cc agagagaagt gacact t t t c cggcgaaaag at t t gat ct t t at cgcat ac t gat accaaa cat t at at cc aat t at at at ccat gt gt at tggt t t t t ag aat aagt at c gt cagt agt g acaccaat ca aat cccagag t aaagat gaa tgtgt t cttt at cggcgacc t agaat cgat agagaagtgt t caat cgct a aagagaat t a t ct aaaaaca t t cacgaaat aaaact ct aa gaagaat aac t aacgt gagt aaat t t at t t aaaaaaaaag t t gt t gt aca t ct t t t gtat at aat gt t t t aaact ccaca cct ct cagag t ct ct ct ct t at g acgct t at gc cgt at t t aaa act t ccgt at cgagaat aaa at at cacat t ct ct t ccaac agccacacaa aaacaat at c at gacaccaa at t gaaggga aagat ggt aa gagaaagaaa agagacaaaa aagggacaca agaaaat cgt gat ggt gagc gt gt cacct t agaaaaaaaa aat at acaaa at at t t t t ct t aat t agt t t acaatttttt t acat aaaac act t aat at t t caat at aat acaaacat ct t caaaaaaat taagagaaaa aaacct caaa aaacggcagc t ct ct at ct c aaat t at g accat at g cat aat ca tt at gacg gt t ct cca aaat ct at t cct caac t caat t t g acaatttt tggtgaag aat cagt g ccgt cat c ccaat t ga cgatggag aacaaaac gat cgagt t t acagt t gaaaaaaa gat aaaaa aaat t at c t t t t t t gt t t t ct t t t t cat gt ag gt aacaaa ct t t t t t t acaat at a aat aat t t aaaggaaa caaaagca cacat caa ct t ccaat :xt cg ttaacagat t tt tctggatagc ct tctgataaca ct ttaactatct t c t cagt at agt ca t cat ccaat g ct ggat cat at a t a gagagt t t cc at tttcttctat gg gttgaggaac aa aacattcgga at gatcaaatta gt tttcaaccaa tg gagatagagt ag cactttgat c tt gggaatccag aa gcggat aat a gt aactccgaca ga gtcgccgacg ac caaacaaaaa aa at gaaaaaat ga at cgatt gt a ac ataaaaaact tt ttataatttg ac aaaacaaat t t g t caaacaagt ct tttctgtcta at ct ct acat gt aa agaaagtaat aa cgcaat t tag cc aacccaaaca ct ct ct agggt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 Page 335 <211> <212> <213> 12689250 Sequence Listing.txt 2004 DNA Arabidopsis thal i ana <400> 307 t cccat ggt c ct cgt t cgat gt act t cagg at aaggacga ct t ggat t t c at t t caacaa aaaaaat cca at cat ct gcc t gct agt gca t t at t t t aaa t t aaat t t aa t aaagaaaat at t ggat at a at at aggt ac t aat t t t gat ct aaaact at caaat cct t c gt t gat t t t t aat t aaat aa aaat at acac ggcccat aat gt t gggccca acacacgt gt ccat t ggat c ct cat t gt at aat cagat ct gt t cgt t t cg gat ct cgat c ccgat t t gt g t t t gt ct ag t cgcgt t gaa agaaaact ca ct acaagt t t t aaat t ct t t ggcct ccat a aagaggtat g t t t at t t at t acagat t t t c ggaat t t gaa t at gt t t t t g accaaaaaaa aaaaaaaaga t t agct aat g t acat cat t c t gaat t at t a gat t t t aggt gagat t t aga aaaat t aat a aaat t t t t aa cat aaaat t t aat gt aaaaa accgt cct gt gt ggcccat t cagat t caat t ct t aact ct ct t cat cagt t ct ct t caat tagt t t cttc t cat cgat t c t gt aggat gt at t cgt t at c gaat t t t ct c t t cgat aacc t gt ggt gagg t caat act t a gcgt t cagt g acagt t t gt g tttagaacca t gt t aaaat g t ct ggacat g ct aaggt at g at t t aaaaag t t t t ccgct a t gct acat ga aat t t at t t t aagaacat at aagt t gat gt t t t ctgtat t agaat caaat t t t aat aaca at aaat t t t a gt ct gt t t aa acat cat at a t at cagat t t gcat cagt ca acgcct cgaa ct ct t ct t at cgaaaaaaaa t t ct acgt gt t ct gt t ct t a t aaaat t tag cgt gaagaac t agat t cgt c ct acat t t ga agact t ct ca gt ggaat t t t ct t t aacaat t t gt gat agg aaagattttt aaaat at at a ct t at t t t ca aat t t aaat t aaact t agcg cgggaat t t g gatgtttttt t ct aat ct ag tttttggtaa gat t t act gt t t t aact aaa ccact aact a ct caat t t ca cat aaat t at gat caat t at gt t at ct caa caaat aacag t at ct t cagc aacagttttt tccat t t t t t gaaaggt aaa agat ct gat c t cact gat t c gt t t cggt t t at agacgagt acct at gaag t gat caaagt gt t t t t cttc t cgct aact g gccgt at cca agcct t cat t caaaaatttt t t agat t gt t aaaaaacat t ggat at at ag ggat at aaaa aacct gaaaa t t gct aaagt aat t t gct t t gat at aagt g aaagt ct t t t gaagt ct t t t t t t t cagt aa t t t aaaaat t t agat t gaat agat t t aact gt t gt aat ac at ct caacac at ccaacact at t t at t t at caaaccactt t ct ct ct ct c t t t gat t gt a agt gt gt t t g t gt t t ct gct at gt agat ct aagat t cat t ggcacggaag t agt gt ggt t t t t t gtgtaa ct ct gct aat aaat t aaat a gat t at caaa tttaaaaaaa t t agct aat g at caat t t at t aaat at at a t gct gat t t t at gagt t t aa cct aggacaa agt t t t t at a caaat t ct at caaat cctt t cagt aaaaag t aaaat ccat acaccgct ct t aaat ct aaa t gt aat accc t agcat ggct t gt caacct t cat t ccat t t gcaaaat t cg t ct aat cat c t gt t t ct gga at at ct aaat t t t gaacgat t act t cggat gt gt t ct t aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 tctagatgat taggttattg tttcgactca tttgtttatg cctattttct ctatgttctt Page 336 12689250 Sequence Listing.txt aatcggtgaa gaaatgtatc aatgtgtgta tgttttgggt tctgattttg taggatttgc tctagattgt tgaatcgaag atgt 1980 2004 <210> <211> <212> <213> 308 2005 DNA Arabidopsis thal i ana <400> 308 gaggaat ct c acccaaaacc at gt at cat t gt t at gagag tttggagaaa t aaat gt ct c gat at t gt ga act accggaa cccat t aat t at agat t t at agagagtttt aat t at aagc cat t t t t gt a cgtaagggca gt agct at aa t t cgtagccg gt gt ct ccga t t t agct ct a t t ct ct agct t aat cct ct g t gt cgat ct g t t agat t t ga gt t t t gt t t c gat ggt t gat at gggt aaag tcgaccacaa aggt t cgaga gacaaact t a t t gt ggagt t gaaaacat at t ct caact gt t t aat gat ga at t t at t t at ct t t aaccat t t caacgat t t t t at t caat aggct gt aat caaagccaac t t t at aaaaa t at t t aacat at t ct t t gt t aat t agt aaa gt accct cga caagact cct t ct t cat ct t t at t gact gt t t agat ccgt at t aat t gat t at ct gct ct t at acct gt t agt ggat ct g gtat t t t t gt agaagt t t ca ct ggt cact t aggaggctgc aggccgagcg t gat at t t gc cct at at aaa gat t gagt t g t t gat gacat aat ct ct cat agcaaaccaa aat t at cgt a aact ccat t g t t cgat t t at t cat gacggc aat t t at aca t t caaat t ga gggt aaat t c agt agaacca gtcgaccagg t t cagat t ct ct t at gat t a gat t at cgct t t act cgt cg agcgt t t agt ggt t at agt t gt ct cgt ggt tgt t t gatta t gat t ct gat cat t aacat t gat ct at aag t gagat gaac t gagcgt ggt gagt at aat c t t t cat t at g act at t gaaa acacact cct aaat t ct ccg ct t aaaaat t at gcat at t g t gt gact gca caat t t at at t agggt t t t c at t at acaat aaaaaaaaat acaaccaaaa caaagagaaa at t agggt gc t act t gcagg t t gt agct gt tat t ct t t gc at caat at t g t t t gat at cg gat t ct gat g t t gat at gat t at t gt t gac gt t t cgat t t gt ggt cat t g ct t ggt ggt a aagaggt cct at t accat cg t t t gaact t g agagt aaaat acat at ct t a t t at gat ggt t t at t agt t g t agat t t t aa at t at gt aaa t t t aaat at a act agt at t a cgt cacct t t t t ct t aacca gt at gagaat aaat agaaag gcgaaaaccc gct ct cat at t t agat at t t t t agggt t t a t gt t gt t at a t t cct at t ga t ct t cgcat g tat t t ggttg agct caact g gt t t t ggttg tt gt t t t t gt gt cat gt t ga ttgacaagcg t caagt acgc at at t gt ct t gt agat t gt t gt t t gt t t t gat aagt t t c gat t caacgt aat aaaat ct agt t aagat g at aaaat ct a tgt t t t atgt at t t aat t cc t cgat cat ca aacaacacat t t t gt ggat c gcccaaaacg t agacacct c t t ct cacat t t ct ct ct t t a gat t ct t agt ct gct t t t ga gt ct gat gt a t t t t t t atca gt gat gt t cc gt gat at gt g t t gt at agt t t t t gacagct tt ct ggaaaa t gt cat cgag at gggt gt t g at ggaagt t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 Page 337 gagaccacca aacat gat t a ggaggt t t t g t t caccct t g t cat cat t gt t t t t t ct at t agt act act g ct ggt acct c aggct ggt at gt gt caagca at ct gct t t t t t cgcct aac 12689250 Sequence Listing.txt cacagtcatt gatgccccag gacatcgtga tttcatcaag ccaggctgat tgtgctgttc ttatcattga ctccaccact ctctaaggat ggtcagaccc gtgagcacgc tct t ct t gct gatgatttgc tgttgtaaca aggtaatata actttcgtat gtgattgatt gattcgtttt atttgttgct aacttgcgtc agat g 1740 1800 1860 1920 1980 2005 <210> <211> <212> <213> 309 2005 DNA Arabidopsis thal i ana <400> 309 tcaagacaaa agt t at gt gg t t ccccct ct t at aat gcat cacgt cccac t ct at ct cca agccaagtt g aaggt t aagc cgagt t t ct g ct cct aact a gaagctgcgg act t t gcct a cat t gat aca aat aacaaca at cct at aac cagagagacc t at cacaagc acat gt gcaa gt caagt t t t taacaccaga accaagt gaa aaat t at aaa aggcact t gg aggat cagt g acagat t at t t caact t t t t ct t agaaat g t ct t ct cct t t cat ct gct c t ct t ccaat g agt t gat aca t accat t cat at cat t t gct cat at at cag ttttccttgg gcat ccccgt t caagct cca gt t cacaat g acaat gacca ccat caagat ct t gaat ct a t cgat cacca caacat at gc t t gaat t t ac ct t gcaaaga at cacat ct c aacagagcaa gaagactt ct ttttgttaga gt t acct gct cct gagcgt a t gct caccac t gcaaggat a gat t act t ga aagcgaat gt agt cat acat aagccgaagg aat gact gt t aggt at ct gc aaagt aacag at gt t gt cca t ct at gat t t t gt aacaat t at cact t agc t cat acgaaa t t gcaaat cc aaat t aagac gt t at aact a aaggat gt aa at gat aagcc t ct ct at agc t gt ct t cct c ct agct gcac act gt cgat t aagt t cccct t t t caaagct aaat t t at ac t gcact t t t t aact agat aa act t cacgat gt t t at cggg t ct at cat aa at acacaaac acacagaaga t t acgaat ga cat caaggt a t cacgagt at at gat at t gg t ct cct t gaa aaacaaaatt aact t cct ca at t at cagat ct t gagcat a aagaaaagtt t t ct ct ct cc cat agt cgaa at cat gagaa act gct t ct a gt ggt t t t at cct t cccat g aaaaaaagt a caagt at ct t t gaccat ct t t ggt t caat a aaaaggggtt aggt aagaaa aaaaagacaa aagcat aaac gaat t t t at a gccaat t t gg ct caaaagct t t aaaat t aa aagat t t t t c t cgagt gat c at accat t gc ccaaaacat c aaagacacaa accaaat aaa t cgt t at cat gaagacgaag gagcaagt t a caggatgaag ct act agt t t cccat t t gt t t cagct gat g ttaccaaaga gat gcaaat t agat t cgt t c t t gt aaaaat gaacct gt ac t gaagaggt t gat aaaat aa gt t aaacct a aat ct cat ac agt at gat t t t gt actct t t gct gat agt t aagact caac aagaaat ct a t cct aaaat c aact cat caa cat agcat t c cgacat t cac ct t gat ccat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 ctcagaaaca aagaatttaa cttggagaaa aacccagatg agactaatcg atcaaacaaa Page 338 cccgt aaat c cgaagacgaa at ct gagct a t acgt t gact gt caacat t g t t t aat gggc t t t t at t t t t t aact t ct ct caat t t gct a t t aaat gt t a gaaacgt aga ct t caaaacc t t gaaaccct aagcagt t ga ct t t t aaact ct cat at ct g cacact gact caactt agaa 12689250 Sequence agt cttt ccc gaact gt gat ggtttcgcgt taggtgtgtg at cagat ct a agct acct ct gat at agact agacacctat t ggaccaaca ccaaccaat a aat aggcct a ggcccat t at tctcttggtg gtgatgagta ctgcgaccct aatttgtgag aat gg Li st i ng. t xt t acaacaat c t agt gt at ac gt t aaat ct g gt at cct cca gaaagaacgt aaat acat gg ggt aaact gt ct ct ct t ct c t cacgagt cg cat t act t ca aaccgt ct at ccgctgggaa ccacat t t ac at t t gt t t t a gt gcgctt at tttgcgcagt 1560 1620 1680 1740 1800 1860 1920 1980 2005 <210> <211> <212> <213> 310 2019 DNA Arabidopsis thal i ana <400> 310 ggat t t ccca t at t t gagaa gt t t act t ga t at gt gat ac att at gtt ga t gcct ct gt t at t t gaacaa t t t aat gt ct ct acagct ct t cct t gaacg gggaccagcg act at agct c gct agt t gag t ct ct acact gcct agt t ct ctt gaacgct agagt gagt c ct cggct aca gat gatt cgt acaat gcct a gt cccat t ga gtt ctt cagg aaaagagt t a tt cgcgtt ag cgt t act at g gt acat act c gt gat act t a agagctgttt t cacct gt ct ct t t t gt agt ct t t ccggt t agacagt t t a t gt agt ct t g at at t acct c at agct ct gt t t agct ct cc ct ccggt t at aact t t aaga gt aacgt aac cct t agaaac t at gaaccgt taaccaaaag t ctt ct ct gt gct aaaat ga ct t t at ct gc t t t at aact t att cat cct t tt gt ct att g catt ctt aaa tt ct ctt ct a ctt gtt agag at gaccaat t t gt gaat gt t t t agacagt t t t ct ct t caa agt ctt gct a tttt gt agcc gaccaat t t g aagaacct ag t ct at agct c t ggacct aac ccat t t cat t cggt t caat c agct ct gtt a at t t ct ccat at ccaaagt t t at acagt ct t ggt aact ct cat caat ct t cct ct gt gt c t agat at gt a agtt agtt ga t gtt gt agct cat gctt aag agt t t t at at agt at cct t g gat agtt agt tt gct acaga t t gt agct cc t at gt ggcat t t t gt t t t gt aacagt cat t t at ccgt t t c agat t at gt t ctt ct at cac at aat cat gg t t t t ccat ga ggt t cact gg caagt t t agg ct aat gcacc t cctt gct aa gt ct t gct ag gat att acct cct t gt aagt t gt cgaacgt ct ccat t t t t aacgct cacc t ct t t agct c gt aagat ggg t t gt aagt ag aaccagat t g tcagaaagaa ggct t t gaaa aaaccagccc t t aat t t t ac agtt at cggg t t t act acag t gt t at gt ca agt t t ct gt g tt gt t t gaat accct agact at ggt cat gc at agt t agt t ct t aaaagt a agaact t act at ct at ct ct tt gt agt ct t ggtt atgaaa t ct t t t t gt a at at t acct c aactt aggat caggctctgt ccagt gat t g tcaagccaca at t acat t t c caaatt ctt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 339 12689250 Sequence Listing.txt atgaagttta aattatactc acattaaaag gattattgga taatgtaaaa at t act gat t tttgacaaca t at aagat at caagaaggaa agggaaaat a cgcat at t t a ct ccacacaa aacgt gt aca agcctggcgc t agcct caac caaaccct ag t t agat ccct t t ggaaaat t acat at aaaa tttttacaac aaactttttt act aaccgaa ggaagat caa gt t gcact aa aat t agggt t cat agat ct a ccaaaact ct ccgcct t at t t gt agt t t cc aacaaat at t t cat act ccc aacaaccaaa t t t t gt caag ct ct ct t aat t gcat t aaaa t ggt acct ct t t acct caca aact ct cat c at at aaagaa cgt ct t ct t c aaat ct t ccg ct t t gaaat a at t aaaaaga aat at t t at t aaaaggggag t aacat ct t c caact t gcac cacaaaccaa accat cgaac gaccaatttt at ct t t t cct gt t ct ct agt at aaaaat g gaagaaaaag t t t t aat gt a t t t t t ccttt at t at gt aaa aaat aaggaa gtggaaagag t caaaat act at t ct cgaaa t gaccgt ccg t cgt t at t gc tttttcctca at t ct gaaca cct t t t t cct aaat t ct gaa tttacagcaa cagat aaaac aat t at gat c agact at acg gaat aat gcc cat t t t aaac at ggaaact c t t accaaat a gt ct ct gt t c 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2019 <210> 311 <211> 780 <212> DNA <213> Arabidopsis thaliana <400> 311 aataatatgc aagaaacgta catgtgtgat aaaaaaaaaa aaatcatctt aaacaatcga gaaaaaaata ttttagttta ttttaagtgt tatatttgaa aataggtaaa ataggttctt ctaattcggt gtcatccata caaaaaacaa at ccgct aaa at gt aaat gt taggcccaaa ttaaatacat caccttctcg acgctaattc t ct t caagtc ggctccttta ttgctacagg cgcctccaat tagggtttgt ttcgacctca aatttaaatt tcggtgtttg tgaatctaaa atacgtttta acccaatggt tcgtctttta aaatagaaag tttcgttttt gatgaatcgt gatctgatct gatctctctg ctttatatca t gt gt ggt t a act t gat t aa t at aaaaaat aat at gt aac at ggcaaaaa t t cggcccat ct ccact acg t t t gt t gct t aat t at gcaa t ct agt t gat gat ct t ct gt t ggt gggt t t gt gaaaaagt ctgt t t t t t g t t cat aagcc t t gat aat t a at gt acat t t t cagaat gga tacacaaaaa t ct ct ct ct c ct t cgat t ca ct t t gggat t t agcat t ct t t gt cat gat c t t cct t act t t t cgagct t t agaat t caaa aacaaaat t g t t t t agt tat t t gcat at t t t agagaat t a tctcagaggg t t gt ct ccc at t ctt cttt at gt gat tag at t gt t gt t c t ggt t t t at g gat ct gat ct agaaacaat g 120 180 240 300 360 420 480 540 600 660 720 780 <210> 312 <211> 2002 <212> DNA <213> Arabidopsis thaliana <400> 312 aaaggtgcta agtgtaagga aagacatttg gctgaagtta gagagtccat aatgttctca Page 340 12689250 Sequence Listing.txt ttggctattt caaaaacggc actaatttta gcctccacta gctctggcgc aaacaaaact at cacat agc gatggcaccg t ctt caagac acat ggt aga agaat gcaat ggaaat ct t a cgggaaaat g t t caaat gca t ct gaaaacg caaagagat t gacgacaat a act aaacct c gaaaacaat g t gt t ccat t t ct t caacct c t aat ggt gac cacct aggct cgt t t t t t t c aat t gat t t t t ct t cact at aagaagacga t t cgt t gct a gt gt t ct gct t t at t gat ct cagaaagt at at t t gat t gg aact ct cgaa agat t t aat a t gt t ct ct at t t cct t gagc ggcagct t ga <210> 313 caagaccatt aaacccgtcg ctccaacacc ttctccatag caatagtata actctccaca tagggcagcc acataaaagg gatacagctt tctaagagac acttgagttc caagatttca caaagctttc ctccagcaat caattccaat caatatccca aactatctag aacaggagct tataatgata atgatactta tccat t gttt agacaaaaca tgagcagaag caacgcttca gatttctcaa accttcacca aaagggattc cgagaaaaaa tcgtttaaca gagacatcaa aaatgcaaac gcagccgaac gagattcatc ttcaaaatta agatacaaaa ccctaacaga caatgaaacc aaaatccaaa ttttttttcg gcccaccggc ttaaagctcg taaaccctag ttctctctgt gagttttcgc ctgtctcgtt ttgtgtataa gctagctcct gtagattttt gatataaatc gtccgt t gat cgtccttgga atgagtaat a ttttgataac atccttactg actttgattc tttaagagtt agagtgatct ttgaaatact at ct t t gaga t ggt ct t t t g tgtttaaagt ttcatcattt gttgaagaaa tg at gt t caagt t aat cat cat t cggt cgt gc ttgcaaacgg ttcaaacaaa agt gt at cat gt aat t t t ca acat ccaaag t t at gt gaaa gaat caaact ct agt ggcaa ggcaact cac ccgat t t t gc cat agacgaa t t t t gct aaa acacagaat c t at ccact t g at aaaaacag cccaagacat ccaat aacga act t ggat at t cct ct ct ac gt agggt t ct ct aagt t cct t t gt t at t t t t t acact t t c t at t acgact t gagat t gt c t caagat t cg t t t t t gaagt gat t t aaagt at t t caaaga at at ggt t ag t at gt ct at g at t ct t cgt c ccagagaagg t acaact aca gt acaaat t t cat cat cct t t caaagacct t gagat t t gg t cacagt t t c t gat cct gt c t gt gaaaaaa acaaagaaac aat cct t gag cagct agagc tgt t t gatta t gat caaat c ttacaagccc at agt t aaac act t cagt ct t t gt cgt ct c tgct t gt t t t tgt t t t gttc gt t at gct t t agt gat gaaa gt t aat gt aa acaagt ct ca tatctgt t t t t t gaggt t t t gt gt ct gat t ct t ct caat c aggagcattt t ct ct gt aat cacgaccaaa gt t aaat ct c aaaat ct aaa ccaagcact c acgcat at gg at ct t caact gaaaacat t a caccaact t c t gt t ggaagt ct cct cat t c aaagt cagaa at accat t t t at t act aaca t t agat at ga gt aaaagaat cgagccggat aaagaat aaa t gagt t t agg agagccgcgc t ct gat t at c ct ct cat cgt at t t cgagt t at t gat t t t g ct t t t cgttt t gt gcat aca cct t t agt at act ct at t t a at cat t t gat gatgtttttt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 Page 341 <211> <212> <213> 12689250 Sequence Listing.txt 2003 DNA Arabidopsis thal i ana <400> 313 agaaat cat a cct ggat cct tt ct aaagca gacggcggag aaaaat agct ttagaaccac ccgaagt t ga aat agt agt g aagt cgcaac t ct t at t aat caat gaat at aat gcaat ca aagat aaat g at ct t at cgt aaat cacact t t agt ggct a cagct t aaat at gt ggat t t at t acagt gt cct t ggat gc t aagcat gt t acgt t t t gt g t gat gat aga aaagt t agt t t t t cccaact t aat gaat gt acgccagat c at t agt gaca cat t act gat caaacggt t a act acgaaga t agt ggcaaa t cacaaagct t ggt aaaat t gaaat at t t g aaat aat t t g gtgtacgagc t gagt ct caa t t gt t agct t t t t t t t t t at t t cat gt t ag tttttttttt t gaat agcag gt cggt t t t g cgat gcat t g t cacacat t c gt gat t t tag at acat at aa t t agt t t gag caaaat gat t tacaagagcg t t t gt t t t ct gt aggaacaa aaaaat t t at t agacgat ca ct gt aggt ac gcat ggt gct at ct caact g at cat gaggg aaaact acaa aat t at cact t aacat t aac gagt t aaagt ct t gtat t t g ggt ggat t aa aat ct ggcgg ggaat t t t gt cgt gat agct gagccacgt g aagcccgt at t gt t t t t t t t gaggacat t g tatggagccg ccacaat aaa gat at gt aag gacggctat t t t t t at cat g ct gat gt t t c aaat aat aat aaagt at ct c gat aaggagt t agggccat a t t aacat t gt at aat t t gt a t cact agt t c tgaagagaga act cat t aat t at ggt acag gaggat cat g at t at t aaaa aaat agct gg gacccaat cc aaaat t t t at at t ct t act a act cgct t ct gt t t gt gagc aggagaacgg t aat t t t aag cat gt ct t ga t acgagcct t at gt at ggcc t agt t t at t a gt aat t t t cg t gacgcaaat at t aat aaat gcgccggcat aaat aat cag t cat gaggct aacat t t t gg t t aaagat t g acat gagcct t ggaaaat ac gggcccatt c ct t t t ctat t at ct t ggcac gat aacgat t aaaaacagaa cat t at ggag at gt t ct t aa t t t t gt t cgt acagacat t g aat ggagcat aat aat t aga gt t at t gt ca at t cgaggga ccat t gat at cggaggaggt ct agcgt cgg t t cgagagat gat ggt t caa gt t t aaccgg cat t t at t gt taggcagacg ctt gt t t t t t gggggtggct t gat cgagt a t aggt ggt at t ct ccaacat t aat gggt cg t t t t aagt gt aaggt cct ca t t t t t at cat t aggt ggt aa cagaacaat a t ggcaat t ag aaaccagt gc at caat cat c t t at t ggt ac aat gt cat ag ct t ct t at cc aacagt t agt at ct at t t t t t caaaact ct t t t at t t agt at t t gt cat c t ccaagagt t at gcccaagg aaggagt gt c cagat cagac aat gaggt gg cagt t cacac gcct gcgat c ct cgagct aa at aacat t t g t gt gt t t ggt aggtggggca cat at gt cat gact ccacgt aaaaat t cca gt ct t t t t t t t t ct at t t t g aat cgt gt aa t aat t act t a t caagat ggc tagcaaccac t acact act t t t t gt aact t at aaat aaaa t at ct ct t aa t cat ggt aca tggctggcaa aact aaccaa gat t agaaaa gt ccaaagt c aagt at gat t at cgt ct t t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 ttttttattt ctcaactttc tttgcattac atcttccaac tatataatta accactctac Page 342 12689250 Sequence Listing.txt tttctcaaat cccacactca gagaagaaga agaagaaaaa aaacagagct ttacaatttc t ct ct acaga gat cgaagat at g 1980 2003 <210> <211> <212> <213> 314 2004 DNA Arabidopsis thal i ana <400> 314 gat aaggt t t ggt t gt caca t ct aat t gt c at gat t ggt g at gct gagat cgcgcgccac gt ggaat ct c t gat at acca agt aat at ca t t act t at cg t aat at t gt g t gacacct ca acaaaaat gt t t t t gacttt accat gcat g at aaggt t t t aaaaagt gga t ct gaaaaat at at agt at a t aagt t aaag t t gacat aaa cact gat t ga t at aaagt ct at agacgcat aacgt t aaat aaagaaattt agt ct gaaat at at caagca gccacat ggc at t t gt cat t agcaat t t t c gggacaacaa ggat t ggacg ccat t cacgt gacccacccc acacaacaat act t aaat gc t caat t aagt t cat aat t t t aacat act t a t t at aat gt a ggtgaacaag at t agt t gga ggagt t t t gt cgt aagt t at cgacaggct a t t gggt cggt taaacaaaaa aact aat aaa t act t cct at tgt t t t atga gt t caat aga agaat aaact act ggct aac t aaact aggg at cat t aaat aaaacgat aa t t t at t ggga gcaaaagat a acgt ggcat t gt gacgat ac gt cacct t t t acaaaacggc t at gaaaaag t aat t t gagt t t gaaat gaa gagt t ccggc at aacat at g agagaat gac aaaaat gt t c t t t ggagt ga tatgtgt t t t t t t aaaat ga t gat at t at a cct aaagaaa aaacct ccat t t at t t caaa t gct acgaga t at cat aagc t t t ggat caa at t ct aat t t t t at t t gaat ttct t gtct t agaat t ggt t ccat ggaact t t t at t gt t t t gaaat t t t g at gt gagt gg gt t t gt cgga gccaacacag accat aact t ggt t cgt t t a t at gt gt gat agt t t gt gaa t ct aacat ct t t ggt gat ag cat at at ct a t ct at at at a agaaact t ag caagaat gt a t t t ggcat ac cgacaaaaag at gacat t t a t at t t accag gt t gt ggt t c gat aaaact a t agct aat cc t ct t t t t gt c aggat ct aaa t t t t caaat t t acgt att ag caaaat t t at gcat ct cct g t gt t at at t t gat at gaaga ttgagcgaac gggatcgggt accat at gat t acaccct gg ggct t t ggt t aagat at acg cgaaat agt t t caagt aggg aaaact agac t ctat t t t ga caat gagaca ccat gct t cg aat gct gat t aaaaact gt g aagagagcac ggt gat gggg aaaaacat at t gat agct ct t gcaagt t t a cat at cat t a at ggact t gg at aaact cga t caacgat t g t aact at gt a ttcaggacag t ct t t t gtt t t cgt t t t cag gaaaat gcaa aaacggct at cggt t agt ca t t t aacct ca ttttagagag act t t at gt t acgaat caat t at gct t ct t t t t t gt t t t a ct t ct agact aaacaaagt g ct cgaat at a ttggagcaaa cgat aaaaaa gct t t t t aaa cat at t gat g gcggcgaggc aaaaat t t ga ccaccat cac t at gagt gga t cat gt at t t t gcaaact at at caaat t t t aaaaact ct a t gat at t gat ct agt t cat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 Page 343 at cat t t t gt t t at at ccaa caaaaaacca aagcaat aaa gcacat gt t c tgaaagaaga caaagct act t t t t t t cat t aaaaaacat a aaagt at aca t t ct cact aa agaagt t aac 12689250 Sequence Listing.txt tccctcataa taataataat cataactcac tcgtatactt gaaaatacaa actttcattc caaaaaaaaa cctcaataca taacgttggc aaaaatacta aaaatatttt ttaaaagata aacaaattaa caagaaggag agctctcact ataaaacaga ccactccaac agagaagcaa aagaaaaaaa agctattgtt tat g 1740 1800 1860 1920 1980 2004 <210> <211> <212> <213> 315 2007 DNA Arabidopsis thal i ana <400> 315 tcacaaacgc gt cact cacc acaggagccc cggt cagt t a cagt t t acca acaat gacaa t t t t gtctct t aacact ct c t t at acat at agat t cat ct ggct cat t ga tacaccaaaa cgagctggag gat acgaagc cact t t gat c t gcagt cgt a gt t cgact t c ct at gat cct t acggagt gg t t t gagtcgg t gt gt caagt t gt agat agt ttaaaccgaa ttaagggaaa t t acgt ct cc t caat acggt t ccat accaa tccgacagca t cct cacccg aggaaaaaag acct cgt t t t act ct t t ct g ct at t cact t taaaccagga aat caagat t aaaagact t c agacaagagt acat agaagc caccgt t ccg gggaacgtca caat t cagt c t ccggt ggt g gaaggagaag gagaggaaga gagagt gat g ggt gggacat ttaaccaaga tat t t t t t cc t ct t t cccac gt accagcat t act cacccg t at ggt acct caccaccacc aagat ggt aa t gt t gt aat c gt t ct gaat t t cct aat gt t at gt t t cact t cgaagt ct c t t t agt ct aa t ct t cct cag gt gt agccac gt cacgagt t aggccagagt gt ct t t ccgt agaaagt t cc gaat t ggaga gt cgccat ga act gt t t gt g aaaaaccgaa t ct aat aaaa aaaacagttt cgcct ccacc acaccacat c aggcggt t ca act gcagt cc accat at t ca gt aaccaacc t ct ct t t t t t agt ct accct t caat caacg gt gat t at gt at agcaagat at caagaaga ggat cat ct t gt t caaccac t cgct ggaca caagagcctt cgt gct ct ac ggcggt t t gt gat aaggt t t gagacagaga gt gaat acgg aaccgactt a t gaat t t t aa gaacaaat aa aacat t cat c cccacct acc t ggt t ct t ac ggt ggct gct gcat gct t ac act gat ct ct t at at cgt at t gt gt gt t t a aat t t agt t c t at gt at cac t cgt t acaaa t gat t agt cg aat gt gacaa at cat cact t agt cat acaa agat agaat c caccact t t a t gt t gt t gag agggagagag gacggagaca agat aaggt t aaccgaaatt aat t t t aat t at t cagt at t agat cccaaa at ct acagca caaacct ct c ccaccccct c t act gat t ag ggt gt ct acg cat t t gt agc t t aat ct aaa agt act t aat t gt t t gt at t gt gt t cgt at t t ggcagt gg t cagaagt t g agcat cccac acaccgagat gt ct cgt ct g t at gcccgag agt gt t gcgt at t gt agaag aaagagttt t gat cat cggt cat t aat gca at at t cgt aa at t t t t ccat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 actcaaaact ccaaatcgga ctaactgaaa aaaagaaacc gaatgttcaa accccagtta Page 344 t ggaaaat at ct ggt gacca agagat cat a ct t caaaacg gt cccat act at cggat aat gagt gt t gt a at gat gt gt t aat t t aagca t aaccggact accaccgcgt t at ct cat ca at t cgggt ca t t t t gat t ca t gt at gggct t aact ggat t t t t ggt t t ga gt gaagct t g 12689250 Sequence t aat acggat at gaccaaat ttttctaaat tgatagcgtg ttcgaagatt ttattccata aggt t at t ga ct t t ct cat t cttgcttatt cctctgccgg taagcttatt tcgctgttaa ggt gct cat a cat ct ggaga ggat gat t t a t t cct t t ggt aat cat g Li st i ng. t xt cagaccgaga t gccat gt ga caaccgattt caggt at cat t t t cgatt at at ct gt t gat t t t at gt t t g ct t act t t t c agt agt aat t ct gacaccaa t ct ccat t t t ct t ct t cct c at t t gt t t gg t t ggt gt t aa at gat t t t gt tct t t t t gt g 1560 1620 1680 1740 1800 1860 1920 1980 2007 <210> <211> <212> <213> 316 2004 DNA Arabidopsis thal i ana <400> 316 at agat t at a at at ggct cc aaaacaaaac gct t ct ct at gaagcaact g ct cggat ct a agct acgat t aat t t cact a cact t cat cc cacat cat ca ccct t ct aac aacaacggga t at cagct gg agagt agaat at cgt t aaga gt act gt aac act ct gaggt aagggatttt aaaagggaac ttacat t t t c cat agt t t ag agcagcat ct caatt gagac ttcccacaga tggaacccag cct aagt cca ccagct aagt gcaaat ggca acaacccaaa t ct t cat t t c aagct caaag gat aacagcg aagaaacacg ctgatggcgg at acgagt ga aagagct gct ct gggt t t ac gaat t t t ggg gaat aaagag aaat t gagt t tcaaagaacc t gagaat gac at ct ggaagc at gcaaagt a at at cat ct g aat caaacct aaccaacgcc t gaacacct c ct cgact caa caggt t cat c cat ct ct cct at acct t ct t aagt t aacct agt agaacca aaat ggagt c t gat cgagt c aaatcggagg aagt t ggaag agggaat ccc cact ccaaaa aaccactccc at ct gt caca ttatgaaaaa gctaaagaaa aagcacagat cgagcacaaa acaggactca caaacaatcc atcagcaaaa gacacagtac cct agaaat a aacaccggaa aacggaatca gacgagctcg gaaatcggag attgagaagc ttgcat t t t g gaaacagaga aat gt t aaac acgtacaaga at t aat gaag agaacaagaa ggagt cact a aagt aacact t cct gcagat ctgagcaaga gaat cgat ct t caact ct ac ctaaacccag at gct aat at actaaacgac aacgaatcgg tcccggaaag agaggtggtg aaat t ggt t c ggt t t cagaa ggaat t t t cc aaccct att g ataaaaaaaa ct act agt at gatt aacgaa ttagatgcag cacat gat t g gt cacat aat caccaaaaaa ct t t at ct ag aaacctgaga ctttacgcaa aat t at ct t c agaccatcaa agct cacaat t caact ggt a aaaacgaaac ct aact t gga gttcgggaag gt ccgcct cg at t cgaggt g ggtgacagag aaatt caaag aat gt gt aca at t t gt t t t a aatt gct agt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 ttttcattat ggcaaatacg ataaatgatt tataagtgtg Page 345 12689250 Sequence Listing.txt tactaaatga tttatgaagt aaggataaat gcaaatttcg aaaattggct t t t gcct aaa t ct t t t cttt t acaact t ct aaaaagt gat cgaacat gaa gt gat t t ct c at ct at t t t t t aat act t t t caagagt aaa aggt gaaaaa aat ggt t aga agagacaaat agaaat aaag t t ggt cat at aat caagaga tt ct ccagaa tttttaaaac caat t cat at t t t t ct t cct aaagat t t ga t t at t at t gc aaat gat acg gact cgt t t g ct aaat ggcg agaaaat gct gaaact t t at aact t aaacc t ct aaaccct aaat agaaag caaaat acaa t t t ct t gat a t aaat t t t ct caagt cat t g at t acgagt c gt t at cacaa at ga at gact t gat caat acaact gacagacaaa at gaacaaaa at t t t t cct a t t ggct ct ac t cacgagt ca gt ct aaat ag t t t t at gaaa at at aaat ac ct ccacgacg ccat t t t t ag t at gt agaat at aaagt t t t gact t aaaac at t at act gt acat t ccat a tacaaccaca t t t t t gt aat t aaat gaaat t t cgccacct agaggaaaaa aat cagct aa cct t t ct act cat ct t aaaa cgt aat t acg gt cgt aaat a t at acat cat t aaat t t t ct at caaaaact gagaaagt t a act gt acaaa gcgt t t agt a aaaaact cag 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 317 <211> 2004 <212> DNA <213> Arabidopsis thaliana <400> 317 ctactggaaa gaaaaggttt gaaataaaac agacgttatg ttgcataggt taatgatctt agattatttg attcgtgaaa cctatgcata atggagtacc tttggctgat aatacttatg act t agcgta aaagatggca gaagcctatc ttccatttga aacgatcaag gcctatgttg atgttttgct tcattgtttg gaacagcctc atgtgtttag gccaagggag aaagttcaca tttatgcatg tgcgacattt tttctattaa aggctttact tcatcttcag atgcagagga ttcgacaggt gagtgaggca cctgaacctt aatttacgtt tacagtgcct tcttttagtt gttaatgatc tgttagtgat tcttttatgg tgttattatc aggttagacg caatctaggc aaggtgttct atatgacagt tttagtgttt ttgctaattt gatgttaccc attccgtaga gtgaggttag ccttcagaga atccaactcc t t at gt gat t at t t t acggg gt t ct t gt ga cat at gt t t c t t acggcgt t gaaaggcat c ccccaccggt gact ccacac aat ccaact g gagaaaacaa t at ggt t ct t t ct ct ct gga t t t t ct at t g caagcacaga at ct t aacaa gagaagagaa aat acagggt gt t acaagt t gt gagat aag at gacaaagc gt t t t ctgt t t acaggt t t g t t acct cgat t aat gacacc aagaagggt a gagaaagaaa t gt gcagt ca t aat t t cct g caacat t gt t gat t ct gaat gcat t ct gga gcggat t gca gaagagggat act t act t ct t t ct ggat ac acggt t t aat agt t ccacat t t t t atgtga t at t at t t at t at t t agt t a aat ccct at a ggctgtct t t aat agt gat a tttgaaaagc aaat ct gagc ttttctgctt acat gggat a ggct ct cat t tt ct ccaaag gt cat ggaca act t cct ct a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 aaactctctg ttttccgttt tccgggtcta aaattgtttt tttttttgca gcatgaaaca Page 346 12689250 Sequence Listing.txt gagct ct t gg aagat agct t ggct at gcct ggatttcaac ccactacaac atcatacaaa t t t ggct caa acaagacct g cccggaggca ct ggt aaaac agcaaacct g aaccgct ggt caccaggt ca t cgat agat g ct ccaaat ca aat t t t ccat t t gt aact aa t t aat t caaa caaat t gt aa t t t t aat cgg ggaat t t ggg <210> 318 gcgat gat ga act g cggttattcc tagtt tcaaacaaga agtt~ cttctcttac aaca taatgatttt gttc( t cccgat aaa ct gg acct ccaagt cggt t gaatccgttg atgc~ ccggtctacc aactt caccaatggt gt at ctggttattg tgtgt atccggttct atcg! taatatcttc aact( at ggac ct cgt agaaga cct aag ct t agg cagct g t cagg aat ct c cgat t gcaat a at t t c gcggtt caaagg t ct gat gat c t t caccaact cgacagt cac ct ct ct ct ag at cct aat ga cagggat t gt ggaggat t gg acat t aat t g gagat at t t t acgat t ct ca t gt t t at t t t tggaaaacca acaaaat agg acaccagt ac t aaat ct gaa agcat ggat g ct act at aat accggt aat g t cct ccaaca gcgt ggt ggc t ggaaact ct cacaat ct t g gct aaat aag cggt t aagga aaaact at t g at t t t agggc t cgt gt acgt t gcat ct cat gct t cacaga caat cat t t a ct at t cacaa ccggat t cat cgaat t at t t t at t acat t g ct t t gt aaag t t t t agcact aagaat ct gt taccagggac t ct t t at t ga 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 aagt t caggt t gct ccat ca t t gct at gg catca atcgcagtga agaaggaagg aacctaaatc gt at a act aat caaa t gat cat caa gat t caacat <211> 2002 <212> DNA <213> Arabi dopsi s tha i ana <400> 318 gcaatcgtgc tatcggttag agatg! tactaatgag tttaactact gcaaa agtaaaaaca ccttcaagag at t ac! cat cacagga gcagccgcaa at ct g! tcaggtgact tttccctaga cttca~ ttagtgtacc ttttgttcaa gtaaa~ at t gacact a t aat t t ccac aat gt t gatgaaggct ctatactttc aattcc catcggatat tgggcttatg gatccl cccact ct gg gtcaaagccc t t gct gcacgtaagt ttcttagcgc caagt( at at gt ct ga ct caat t ct t t gt t gt cacaaagtat gggatcaaag gaaac ggcgagaggc gggtacatag cggta~ aat ct gagat gt t t t aagt t acct g aggc acagc caac aacgt ct acg agat ccat t agcac aat t ct t gct acaaggat ca cgcat t caac caat t gt t t g cat ct t aaac aacagt gagg gggt gt t t t a ct cgacct at ct ct gct at c ct t t t t t aac at t t gcgagt t gcgat aaag gct t at ct ca t t gt t t t gt t t cat t gagca accgggat gc t t t gt gt t t g aaagt t t gt t caaccagt t g ccgat gt acg ct act aaaca ct t caat ct g aaagact gt c ccaacat at g aacct gct gt gcgct gt t gc t gt t t ct agg aact t t t t ac t t ccagaaat ct t t t at gt a t gt t aat cat t t aaaaacat cagt t acat t gt gt t aat gg t t at ct ct t t gt ct ct aact agt acat gga t t aggat cat cgt t cct cgg 120 180 240 300 360 420 480 540 600 660 720 780 840 Page 347 12689250 Sequence Listing.txt tgact t catg agcctaactg gtgcagtgag cacat t ccct ctcacat t ca ccacat gt ac gct t aacgt t cat cgccgt t t agt caat t g ct ct ct at t t gcat at aaaa gt t gcat gat acacct aact gt t ggt t t aa t aat t t agac aaaaacgaaa t cat ct t cat t caacct gca gat cgacgaa aaat cagat t at caaaat at t gt gaaaagt gt t t t gaaat aaagaagat c tataaggcaa ag gtcttcttta gt gat t ccaaaa at tgtatccttt at gtct t accat ta gaccactttc at t aaagt ct ct at taaaaaaacc tt caaaaaccta at aggccaaagt gg gcgt t agat a t t cttcacgcag cg tttctctctt ct gaggtaacca tt tttttttgca at gtggttatgg at caagaatgtg tt gttgttgtta ca tcaaaaacga tg aacaat aa t t gat gt c t t ccat gt t ct t aggc gagcct aa cat aat ca t agaaat g aat t t agt ct t t caat aaccccac ttttatca act t ct t t caat t t t g t t gccgt t gct t gt ag ccat ggat aat gt agt gct gaat gct t gt t gct gca t t t t gcagat aggt t t at t t at aaat aaat t t ct gaaaat t aaat cat ag gagatagaga gat t agct gt t gt ct ccacc at cgt gct cc gat t ct ct ct ggtttctttt t gt t agt t ct aaaaagatt c tgat t t gttt t t t t t t t gt t at gcaaaagc gccat t gcag t t gt aat t ca ct cct gt aag caagt t ct aa ttaccaacaa t at t agagaa agagaagaaa t t aat t t gaa aaagct ct ag gt aat cat t a t t t t at t ct c t t t gat at ct t aacgat t t t gat t t t t gt g gaact gat gc aaacct ct t t t t ct agccaa t at ggcat t g ct gt cagact t t at at t at a ct t gt at t ct aat ct aaaaa agaat act aa gaacat aaaa cagaat t gt t at t at aacaa at aaaaaaaa cct t t ct t ct agct ccaat t cat t t gat ca at t ct gt t aa t act gat t ga aaat ggct aa gt gt t gt t gg 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2002 gaggaaga aggtttgagc tgagagatct ttggaggtag <210> 319 <211> 2003 <212> DNA <213> Arabidopsis thaliana <400> 319 ccaggtgtat aatcaattat atcttcacag tttggtagaa tctcacatcc cataaacgaa caatgtgcac agcccaaggg gccaagacaa ctaaaagaaa gaaggtaaac atacattcag cgagatggaa at t atat t gt agcagaggac atcatttccg atgaccctcg agacttgctg accaacagtt tgagagtcga aaaatgatat aaagtagcta aaaagatatt aaaggtaaac actttcagga cctggaagag aagagtggag agagtagggt gagatcccat catatcatgt at t agt at t c cataaccgca aat t agcaaa aagtcaacca ctgaagaaca acagt ct gag at cct gat gc at gagct acc tatagtgttt tctgagaagg t t aat aaaag gccaaaatac t aaat caat g tgcaaaaaac t t gcgaaat a ccaagt ct gc aaaatccagc aagatt caaa ccct cat t cg t aagact act tgagacaagt agcat at cac cccct gt acc acactt cagg t cat gt t cag tt gtt aagt c caagt t aacg acagctagaa t t t cacct ac gagtagattt 120 180 240 300 360 420 480 540 600 660 aagcagaaaa taaaaaggaa ctcacaagta tcatgttagc aatcccaaag aagcatcctc Page 348 12689250 Sequence Listing.txt gt at gccact t ct caaaaac aat aaaacca acacat aacg gccgagaaga gt aaacaaaa agt agagat t act at aagag at ccgact t a tcggagaaac cggat t t t ct gct act gct c gat t gaagag gt gt cact gg cagccgagt a gt cacgacgc cgt cgt t t cg aaaccgaggt acaacaccgg ct ct ct t ccc agcccaaaat t t at aaact c agt gt agcag gaaattatga gtt! aacaagttgg ttaa gagacaagtg ttt! taaccaagag ctt! t cgaagct gt t aa~ tgtataagaa aag! cagaatactt cta tggaaaacgc agce acgcacgcca aac~ t ccaccagct t cc~ cagcaagtga aac! cattcaccgt cga! aaaacgacaa tct t ggt aat aac aga~ cttgctgagc catt ttcaagcttt ttc aaattgaaga aga~ ggaacccacg gag~ cgcgtcagaa agg cct ct acgat at a gt cgt aaacc ct a agt gat at ct t aac atcggcgaaa atg gaacaaa gacagca gat cat a gacattt aaaat gc gacgatt caaat ac gaagat a agt caca acct ggt gt t at ct gcaat t a cgggttt acaccgg t at gt t ct t ggt t aagacgt atccacc cgcgt ga atcgggc gagggt a gagt cagat g agagt aact c caagat ct t g cgat gaaaca ggaat agt t a ct gat t cct c at aaccaagg acccaacgat ggct t agcaa aaaat cgt cc cct t ct cct t at cgaaggca ct aacgaat c tgaagagaaa cggt ggact a cat at gaaat cacgt aagga accgaaggca aggt t caat c ct aaaggccc aaat aggaaa gaaat cct t t t ct aat t caa acct gcaaat cagcgat at c t ct ct gaaag at t agaact a aagaaccacg cct cagccac t gaacct acc ggaggaaat c cgt aat at t c gacgat aat t ggagat t ct t cgcggat ggg aagat gaaga gcagaagaag agaat cgt gt t gaaccacat at gt ggt t gt at ct cgcgt t cgt aaaaat c t at aaaagaa ct t gaaacca gcct gaagt a accact t t gt ct gt aaacaa agaat t t ct a caccgcggct aagt t cccac at caact t ca aat acat t gc cgcagt t t cc t ct agt t gaa cct t ct gat a gcggct cgt g agct t cacga aat t aaaacg caaccaaaat ggt aacaaca ggt agat gat at aact aaat ct t at cgt ct 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 gaaaccc tagccgcaaa gagagaaagg gagggaggag <210> <211> <212> 320 2004 DNA <213> Arabi dopsi s thai i ana <400> 320 acgt t aagt g gctccgtcgg t ct t aacggt agtccggcgt tatacgccgc cgcgttagat acggaaggcc accctgtttc cgtcaccatc aatgccgcgt aaacggcgtc tatttgagcg ccggaat ct a tattcccggc gt t t t ct t t a gcgtttgctg cgtaagcgaa aaacggataa tgaagcagag caagcattgg tttaacaacc gt t cct t t ac t cgct t gcac acct t aacgc tcaaagagac aacagagcgt gcat t t acca ggt t cgat t a Page 34 ccgt t aacac caat ct cat t ct t t aaat cc t gt t gt act t agt ct aaaga t aagat acga at t ccggt t t t ct ct t caca ct cat ct cca gacggcggac caat ccggt a gat t t t at cg cgacgt t t gt aaaagaacca 120 180 240 300 360 420 12689250 Sequence Listing.txt gaggaaggcg ggtacgaatt agcaagcgcg ctaagagcaa taggagacga gct t t gt cga t acggcgt ga gtcgctggga gaggcggcgg gcaagtgcgg gat t t cagga t t gact ccga aacgt agt ag acgt gcgt ca cat ggt t at t t t gat agt t a t at t t at acg cggaagaat a tagcaaaaag tttttctaac acct t gact t t gcgcat at t t t at ct cct t aat t t at t t a gagt ggt gga accaaat gaa cgaaacgaca t at gaaccgt tacaaaagcc agcat cat t c ccggacggag gt t t at act t t cgt cggat c agt at ggcat aggagagaag t t agt acat t ggt t cact ac ccat t ccaga acacggcaaa at ggct gaga ct cgagt gt a tggagacgaa at t t aggt gt gct gaaat aa t t t t act t t a at at gaat gt t t at t gt t t c t cgt t aat t c caact aat ca t t t ct t ctat taggaggact at t gt ct agt gat t t gaaca t ggat agat g acggccaatt t t t gat t gt g aacct t t cac aaccagagag gacgaat act t at gt gggt t ct cgt t aggg ct t at cggt g t t t ct ccggc at ct gt ggaa tacagagaca ggaaggaaag t ctt cagaaa ggaaagtttt aaacaagt gc t agat t t aac ccat aat ct t t t t cat agt t ttaaacaaga at cat cat t t t t t t gt t aat gt caaat t gt cacgt cacgt gggt t agat g t aact agcag ccct acgat t agagagattt t gaat ct ct c gat g gt gt gaat gt t cgt t accga t t aat ccagt agagcgacga t cgaagat t t gacgggagat acaaaaacaa gacat aggt a aat cagaaaa cagagcatt a t gt t gcgt t t caagtggggg t aaaact gt a cagt t aaagg gat t t t ggt t gt caagacct ct t t ct t t t g acacaaagt g caagt ct t ca gagatcagga t ggcccaat a ccgt t ggat t t at gat agag t cagagt cag at cgt t ct ct t ct t cat cgc cggcgatagc tgtcggcgaa t t act t t gat t gat acggt t t gt t agct at aaaaaacgca t at gaagaga t aaggt aaga aacgcat aga gggacggagc t t gt gggat a t aat t t t aaa aact ct ct at at aacagaaa aggt t ct t t a t gaagagt gt acat gat t t a acagat gat g act gcaacct aaaat at aag aaaagagaac aggaat ccga t ct t t t t at a caaat t t cgt gat ct t gat g att gacaaga t t cgat ct cc t gat t ggt gg t t t ggaat t t gat t cct t ga ccgaccat aa aaat aagaac t t aaaaact a aagaaacggc t gagt gt cgt t ggt ct acag act aat aaca gct ct at t t a aat t t t gat g cct aacaaaa t at gt t gcat gt gacagact t agat t t aac aaat cat gga aaaat at t t a cccat t at aa ct agagagaa t gcgt ccaag t t cat ct ct c gt t t t t gtaa 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <210> 321 <211> 2006 <212> DNA <213> Arabidopsis thaliana <400> 321 agctctctga ttgtttttag caatttcgtg gaaaagaaga aggcatgctc cagctgatgt atagatagag agagagacca gtgttcctag agagatatca ctaatgtgaa aagtaaatca tattgtttaa aggtattttt attctttaag gttcatggta ccaaactagt ccaaaatctc agaattttaa atacgactta tggttgaaca aaaaagattt ggaataaata attacacact Page 350 120 180 240 12689250 Sequence Listing.txt ggtcaatagt gattggttat aattttgata cattttattt gttgattttt gacaaatggt ggacat gagt agagcct t t g ct cgcct cga at t ggt t aat aat gt caaaa aaaggaaaaa at gct cgcac ctcggaacgg aaggt at aag act ct t t ct a t cat caact t t t t t ct gaat t ct gt agat c t t caggt t t g at aat t cgcg acgt cgt t ct cct aggt t t t aaccaacgac ct t t gt t t t t caat t gaaaa t gt gt gat ct tgt t t t gtct agaagt ccag acgt t cgt cc t cct ct at aa ct t aact at g caccgat ct t t caaagct cc t gt cgct cac <210> 322 cat gagt gag agaaat agaa gt t at ccgct at agcgt gt t aat aat aaag gcaaat t t gt ct accgat t c caaat t ggaa gt caaat acg cct t t ct t t t aat cgt t t ct cat ct ct at c t gt t ccgt cg agggt t t cgt at ct t gat ct t cgat t gat t ggt at aagag t t ggt t t cga agt t at ggat t t gaat ggt t t t t gaat acc aat acagaaa cagt agt agt aaacggtgga t gct at at ct t t t t gct t t t cat aat at ag ttttctgctg acaacaaggc tcacggggag t t t gat t ct t tcaacggcac ggcccat ct c aaacaaat ct t aaaat agt t caacgt aagt at aat caacg t aat t gt cag cct cct cacg ct caaat t t a t ggt gagt ct t t gaggat aa t t gat at ct g at cgat ct t c cat t agcct a at t gat at t c t t t t t gt t t t ct gct at aat ct t at ggt t t t ggt att t t t agcgt gaat g t cct tat ttt at caagaaat gat t t ct t ca t gt t gcagt g t t at agct t t gt t cccggat gagat g gcccaatttt ttaggcccca acgt at ct t c t gt t t t at at ct aaaat t aa ggggctat t t aaat ggt t ga gct cagat ag ggagcat aac aacgcat ct c gggt t t t ct c ct t t t t t gaa tgt t t t t t aa gagaaagggg at t t at at at gct aggt t ag gcat gcat gg gt t acat gt a aat t ggt t t c at agat t cgt at t t t t cat c agat t at gat acgaagct cc t caaat ct t c at ct ct at gg ct ccaagagg ct t t act t t c ccaat cgt t c gt cat gt agt cggct acgct t aaacacgt g t t t gt t gttc aat aat t t ct at ct gcggt c agggt aat t t ctt cacgcga cct agt act t at t gct t cat t t t t ct cgaa t t gt agat ct t cggt gaat t t t t ct ggaaa gt aagt at ct at agacat cg t t aggt t gac t gat t gcct a t act cat cat at t gt cct ga t gat ctt t t t ggagtcgaaa cct cggt t ac t gt ct act ca t at ccat t at ccat cct gag cagt t t at aa t ct cct cct a ggagcagt t g accgt gacag t caat ct t t g cgcaaaagat act t gcggcg aggaatggag gggt aaaaac t at aaacgct at at aaaggc cat t accaaa agt ct t gcgg gcgt t t t ct c ct gggt t t t t caaggagt t c ct cacgat t g t t ct t at cgt t t at acct ga t t gt ct t agg t cgagat at a t aaacat t gt cct t t ggat t ggt ggt aaaa agcat t gaag aacgt gagt t gt agt ct ct t t accagcgt g t t t t ctt ct t ct acaagt cc 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2006 <211> <212> <213> <400> 2003 DNA Arabidopsis thal i ana 322 Page 351 12689250 Sequence Listing.txt caggtaactt gtgggtct t g ct t gagactc acggagtttt cacctatct t act t t t t cac cgat ccccgt at gact at t t acgagcggt g at at t aat ag t gt gt gat t g ggat ct aggg caat t ggagg aacaaaagt a ct ct acat gt aaaaacat at gat t ggt gt a acat t gaat a aacaaat aat agt ggt t agc ct t caaacca t t t ct cat aa t t at t at at t act aat t gac ct t cat ggaa gat t t gcat t at ct aaat gt aacccaaaga at caact ccg t at ccacaca aaagt aacaa aat t caacag aaat t at at g aat at t t t at t t t gat t gt a gccacgt t cc act t gt gt cg aaaaaagaga t aaat aacaa acct ct cgca at t aaaat at t gt gt t gaaa agaaat caga act aaat t t t t aaaat t t t c gatttttttt at t t gt t t t t cat gaat t t t t agaagaagt t t cagct t at t cat at gat t aagt ccagt c t aaacggat g ct aaccat ct aaaaaacaaa caacgaaacc t gt t t gagt t aact t t gat t gt t cgact t c t t t at t t t ca t gggt t aggg acaagat aga agagt agcaa ct aacaagt g at t gagat t t caaagaggat gt aat t agt t t ggat t t gat gccacgagt c t ggacact ct agaaagaaaa aact gt t t t a t gct aagcga t acaat at t t agggact aat aact aaagt c agaat agt t a agt t t gt at t t ct t t ct t t c caat ccact a t gt cat t t ca aaat acct aa aaaat aat t a t t t agt t t t t at acgcacct ggct cat t ct ttgt t t cat t gcat cat caa tagt t ct t t t t agaat ct ag ct aaat at t g aat gaaat ga t cggt t gacc t gggt gacat aat at ct aag at aat t t gt g t t cagt t cac t gat t t cat a aagt aat at t t t ccgaaat a aat aacacac gcgat t ct ct t aaaact t ca at g aaat agt aac gcgct ct acc aat act t t ca gat t gagaga ggat act at c tgt t ggaaag aagt aaaat a agct gat t ag t t gt t acgt c t gt at agct t t agat aat ct at at gcat ca tttttttttt accaaaagaa ct cacgt t t t ct ct acgt ct t caat t t t at t ct t aaacgt ttttgaaccc t at gt agaag cat at act t c agat at t acg ggat agat at t t at t at t at t t ct acaaaa at t t t t aaaa caacat t aaa aat gt t ggaa gt t at at ct g gt t t ccacca ct cact ct ct ct caacaaag taagtggaga at t t gagct a acgt gt aact at t ggagat a t aat cagt aa agaat aaaga agt t t t t act t gt agt at ac aat agacat t ccat t t at at ttact t t gt t t at t t gt t t t t cat t t accc ccaat t gagg at t t t ggct c t t cct t cat t t t t t at t at g caaaaacaat aaaaaat gat t at cct caat t aat t gagt g acgt t ct t ac t gacct gt gg tt ggt aacaa t ggt gacaat at caacat t c at t ct t aat a aat cagt t t t at aaat t t t g at caacaacg ct cccat ct t ct t aaccat a t caat agt t g t gcggggt at cat ccccacg gat gcaaaat at gact aacg agt agagat g aaagt t agga ct t act at ct at caagct aa aat agt t aca at t aaaaaaa t at t at t t at t acaaaaagt t cgat ct t ac cat gt ggt t c cacct accca t gat t t t cct ggat ccctt t gagt t ct t t t act t at at aa at acaaagt a agat agaaat aagagt t t t g gctct ggcca aat t gt at at at t cat t t t c aaaaaaat aa t agaaaat aa caat accagt t cat aat t t t gaaacgcaac t ct ct at aaa aact gt gagt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2003 Page 352 12689250 Sequence Listing.txt <210> 323 <211> 200i <212> DNA <213> Aral <400> 323 agct caggt a gt t gact t t t gt at cgat cc cacgat gact aaat acgagc aacgat at t a gat gt gt gt g aggaggatct at ct caat t g ct aaaacaaa t acact ct ac aaaaaaaaac t t at gat t gg aagt acat t g ttacaacaaa gt t cagt ggt cccact t caa tcct t t t ctc ct t t t t at t a t t t t act aat at aact t cat agt agat t t g aaat at ct aa tttgaaccca gccaat caac at at t at cca t t t caaagt a at aaaat t ca at aaaaat t a cagt aat at t 8 bi dopsi s t hal i ana act t gt gggt t cact aaat a ccgt acct ct at t t at t aaa ggt gt gt gt t at agagaaat at t gact aaa agggt aaaat gagggatttt agt aat t t gt at gt cat gaa at at t agaag t gt at t cagc aat at cat at t aat aagt cc t agct aaacg accact aacc at aaaaaaaa t att caacga t gact gt t t g ggaaaacttt cat t gt t cga at gt t t t at t aagat gggt t tccgacaaga cacaagagt a acaact aaca acagat t gag t at gcaaaga t t at gt aat t ct t gct t gag acaaaact gt cgcat gct aa at at t acaat gaaaagggac cagaaact aa t t t t agaat a t t t cagt t t g t t t t t ct t t c t t t t caat cc t t t t t gtcat aagt aaat ac t t at aaaat a gat t t t t agt agt cat acgc gat gggct ca atct t t gttt caaagcat ca aacct agt t c agt t t agaat gat t ct aaat ct t caat gaa t t cat cggt t agggtgggt g t agaaat at c gcaaat aat t agt gt t cagt at t t t gat t t ggat aagt aa agt t t t ccga act cacggag t t t aaaat ag gcgagcgct c at t t aat act t aat gat t ga agt cggat ac gt t at gt t gg t at t aagt aa t t t cagct ga act at t gt t a t t cat gt at a ct aat agat a at t aat at gc tttttttttt acct accaaa t t ct ct cacg cat t ct ct ac t caat caat t t t t t t ct t aa ct agt t t t ga at t gt at gt a at gacat at a gaccagat at acat ggat ag t aagt t at t a t gt gt t ct ac t cacat t t t t cat acaacat t at t aat gt t aat agt t at a t t t t cacct a t aact aagt g t accat t t ga t t caacgt gt gagaat t gga t at ct aat ca aaagagaat a aat aagt t t t t t agt gt agt cgt caat aga gct t ccat t t at ct t t act t at cat at t t g t t t t t cattt agaaccaatt t t t t at t t t g gt ct t t cct t t t at t t t t at acgt caaaaa acccaaaaaa gaagt at cct ct t ct aat t g t acgacgt t c at at t gacct ttat t t ggt a aaaat ggt ga aaaaat caac t aaaat t ct t ggaaaat cag t ct gat aaat t ct t t caat a gagatgcggg gct acat ccc aact gat gca gat aat gact gt aaagt aga aagaaaagt t t act ct t act at acat caag cat t aat agt at at at t aaa t gt t t at t at tttttacaaa accct cgat c gaggcat gt g gct ccacct a cat t t gat t t t at gggat cc caat gagt t c t gat act t at caat at acaa agt gagat ag t t acaagagt gt gggct ct g acaaaat t gt caat at t cat at t caaaaaa aat at agaaa t t t t caat ac t t t gt cat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 tttttttgat tgtatggatt tgataataac acacgtttcc accaatcaac aacggaaacg Page 353 12689250 Sequence Listing.txt caacgccacg ttccgccacg agtcgcgatt ctctctcact ctctctccca t ctttctct a taaaacttgt gtcgtggaca ctcttaaaac ttcactcaac aaagcttaac cataaactgt gagtaaaaaa gagaagaaag aaaaatgg 1920 1980 2008 <210> <211> <212> <213> 324 2002 DNA Arabidopsis thal i ana <400> 324 gt gcat at at cagt t gt t cc caat t cacat aat cacaaac aat cat t aaa gcaaagt t aa gacat gt t aa ccaacat t aa aaaacgagaa gt t t t ggtta aat ggat aag aaat t at aca aagaaacatt caaaaaccag gt t gaaat t t gcaaaat t aa cgcat gcat t at gt t at at g at t t t ctgt t t cgt gat cca tcgcgggtag t at gcct t t t agat gt aaaa at at aagaaa aggcgt aacc t cgt t gaaat gat gcgcat a aaaaaacaat t t ct t gagct gaaatttttt gacagaacat gaagaagcct tggcgagcaa t t act t ggat acact aagac agt accgt gg at t t aacgcc t at t aacgac ct t t t ct t t t cat acaat aa t aacaaaat a ct t t t t t t cc t acat agt ag t gccacaaat ct aact t aaa t ccagt t t t g t ggt gaaaaa gt aagt gat a agagaagaca aacatt gt t t acgt agt at t gact t ggt t t at t agt gt ga at t agcggt t at act at at a gct t t cat t t accagccgct ggt ccgagat gct caagt t g t gt cgt at t c gacgt cgt t g aat gt at aca aaaaggtgt t cct t t ct gct t t gt t t aat t agt act t at c ggt t t t t at t at t t t aact t acgt t at aac t aaat ct ct g agt t aacacg t at aaaaaca t aagt t t cat t cat gcaacc aat at at t t t aat t aacaag cat t caccat gtaaggaaga at aaggccac gcat gaacga t agt t agt at gcgt t t cacg gcagcgccaa tctggacgag gaggcagt t g t gt ccct t cc agat gt ct ct t ct t gaaaat t ggt t cgaag aat gacgt aa gaagat at aa t t at t aaact aat t aaccag gacaagggca t cccat gt t a ct caagt ggc gt t act t t t c caaaagt aac cagt aaaact ccgt t t ct ga act t at ct aa acgaat gt ag cacccgt gac gagat agt ca at at gat acg gaaaacagaa aacgt ggaaa t t t act t at a caacat cgt c aagagt aaaa t cccaacat c ccaat t t cac tcat t t t t ct at t t t gaact aagt t t t t gt t at aaaagt t at cct t acat at t aaat aac t act ggact a t t at t ggacc acat at t aaa cgcaaat t t c at t cct cggt aaaat t at ag aaat aaagaa t t aact aat a cat acagt t t t agt aaaaga ccgaaacat t aacaacaaaa t t ccgt gat t cat at at gat cccacaagga cggaaaaaat gaaacaat cg t ct agagt cg gtctgagaga agt cacaacc gccgaagat a t t t gt t t t t g cat at t t at a t t t t agaagt acat t t ccat cat ct ct t gt t aat aat at g acgt acctt t t ccct acaca ct cat at cat agt at ct cat ct caagt ct t at gcagcaag agcat at at a t t acagaact t t aagt t t t g cat t gcagt a t t at t t t at a agaaagaat t gaaacaggt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 agtcagatat cgtactcccg gaaaagaaaa taaaaaacga Page 354 gct t aagaga agtt gcaaaa at at at t t t t caaat t t t ct aat t t aggaa at at aaat cc gaaaccct ag at aat cat t g aaaggtcaaa t at t t t t t t t t caaaat aaa gaagaaaat g accaaacgaa acgtcgaaga 12689250 Sequence gcagaaattt gt t t aacacg caaaat t aca cat t at t t t g t gaacacgga aggt t t at at ctaatgctat gttttttctg ataaaagggc catcgggccc tcttctctct tggtataaat tg Li st i ng. txt aact t t t t t a aggagaaaag at at at at at t t t gaaat t a at aat cgcag at gct gt ct t at caaggcca gt t t at at at at at at gt at t t at t t cat g aaaacggccc gcgt t acaga 1680 1740 1800 1860 1920 1980 2002 <210> <211> <212> <213> 325 2005 DNA Arabidopsis thal i ana <400> 325 gt acaaagt g t cgct cgagc at gt aggaat acgt at at at gcagctgtct ccagt ct ct g gggaagagac agt t ggaagg cccat at at c t gat t gt gt c at aaaccgaa gggacat ct c at act ct aaa aagt t t ct cg at t t aaaat g aagaagaaga t cgagagcat t aat t t ccat t at t t t at t t at agaaat t t at t t cat t gc t acgt at acc tcaaaagagg gt t aagct ca cat gtt gt gt act t gt t aga aaggt gt gat t t t aact cca t act gcat ct act gat at t t gaaact aat t cggt t t cggt t t ct t at t t t at aaaact t g t aaaatt gag act aaaacat agagt t t aaa tggtagaccg agt gat at ga t aat cacaag ggcat aat ca ccaat t t gt g t aat ct gaaa at t t at aaat gt ccact t t g aaacccaaaa cgat gagaca cgcgcat t ct t aaagat gat gggat t t aaa agt act t t ca ggtt agt t t c caaat agt t t t t aaccgaat t t gact t aca ccgat acttt aat cact t t t gcacgt at ag aat ggt t t ca at gt t at aat aaaaaagggt aagacat t ag ct gcacgct t at t t ggt cct gct acaaat t at gagt t ccc t at t t t ct t t cat gctt ct t ct ct t t act c agccatggcg t t t aacggat gagatgaagg gt ggcaacgt t t t t ct at at ggt t t aggca gggt t aaat t ggat t acggt ct gt t ct agt t cgt at at at t ggat acagc gt t ggaaacg aat act aat t t t ct ccacgt aagat t aaat t ct cat caac cat ggt t agt act ct at cat t ct gt gccac tttttttttg gagaat gt t a ccacgt at t g t ct cgt caat gt t cgcagag tgt t gt t t t a t t at ggt gag caaagagtt a t gt gt agt t a gt gcct cat a aggccat gcc t t agt t t ggt t t t ggt acca cgaagaaaat t t ggt cagat agt caaaacc gat aaaacag aat cgacct t at aggaat aa agt t cgagt t at t gt t t cct at act t ggt a aaaaat t cgt gt t gagt t t c aagt t at t ac t t t ggggct t ct ct t t caat ccaggccaag gt ggat t t gt caagaagact agact t t t ga ccaaaccaaa cat caaagga at at acaaat t t aggccat g at t t caagt t gt aact ggt g t aaaccaaat agct ggt t t c tat t t t ctac t cat t t ct cg gacaaaagag caagt agaga ctt caccagg ct t gt acacc ttttttttca t at at t ggt t at agt t acgt at t t t agct a t gt gt gt aca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 attctttggg cttattgtga atatagctcg agcaactcaa actattcaaa ctcatggacc Page 355 at gt caacaa at aaat aaac gcct acagct gt cact gacg ccaat gaccg aaacggcttt accgt cagat aaact cgagt aaacaaacct t t aat cct ga ct gccct gaa acat t aact t agt t t t ggga gt gaacggt g act t t t gat t cct t at t t gt ccaact ccaa acaagact ct agccgcaatt gt agcgt t aa 12689250 Sequence ttatcatttg tgtaaaaggc tttcttgttt ttcaaatttt aaaaaaacag aaaaacgaaa gaccgactca aagccaatga atggactaag ccatcgaacc t aaaggaaaa ggaaaagt ag aagatcgtgc ggttacatag at atagactt ctcct t cgac cacaattcct ctttcacaat caat g Li st i ng. txt aaaaacct at ct ccaagat a aggaaaaaag gaacgcaaga gaacat t ct c gagaaaagaa cgagccat cg ct gat at t t t ct t t ct t t cg gcagcat aca agcccat t gg ct at caat ga aaat gt t gag gaacctttt t t cat t cct gg aacct t ct cg ggaaaaaaga at t t t t t agc 1500 1560 1620 1680 1740 1800 1860 1920 1980 2005 <210> <211> <212> <213> 326 2004 DNA Arabidopsis thal i ana <400> 326 acaat agat c cacat acct a aacaat aaga ct t gt gaat t t ct t t aaat a aacgt t t gga t at caagat a gct t t act at t t agt aaaaa at t t t t t gga gt ct gat tag aat aat t gt a aat caacgca cgt aaaat ca t at aat t t t g agaat gt aca gt aat t acat tatctct t t t t t t t at ggaa aat acacat t aaagct at at ttcaaaaaga agt cagaat a cgt gt t at ac gat acaagt t cat caat at a ccat aacaat tagcagaaaa t t aagt t aca at acgggt t a aaaagat t at aact at aaaa at t ggcggga t t ct caat cg aaat gt gat a aact aacct t aaat at cct t t at ct t gt t a aaaacatt ct t t t t at t aag at ccggcaag gct accat ca caaagggaaa cgt t act t aa gt aaacct t a gaat at cagg aat acagaaa t t cat gat ct aacct aaaaa aat ct t t aat at aaaat caa at aat t gat g aat t t t t ct a t ct act aat a aaat aaagag gagt cacagt t gt aacat gt t t t t ctctta at t cagt t t a at cacacgt g t caaaat t cc caaacat aaa cat at aact g t aacaat t at at at at act a t t acat caat t at caaat t c ggacat aagt acaaat t t gt t t at t at t t t aaaat at t t t acaat t at at caaaaatttt t t t at t gtct aat t t at aat agct t gt cgg tttttttccg t t acat agt a at at ct ct t t at t t ct t t aa t ct gcat t t c aaaaagt cag cact gt t t at aaact at aaa acgt t aaaat gt at caaat t aat at gaaaa agt t agt aga cacaaaact a t agcaccact gt ggt at agt aaat t t t t t a ct act aaat t aaccat t t t c aaat gat t at aaacat at t g aaat acat gc t t t at aaaaa t t at cat gat aaat at at t c t cgcat t t ca agaagcct cc t aacaat t gt at at aat t t t cgcct agacc caat at gaaa tctgat t t t t t t agcct aag gaatttttt t t at caaat ca gct act t aat aaaat cgt aa acggaaaaac t t t ct t gtgt agat t t gaaa gagaacacat gat aacat aa gcacat ggt a at t t t gt agt t t at t t t t t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 356 12689250 Sequence Listing.txt gtaatacacg gttttattaa aatcacatat aaattttttt taaggattat gaaaat t t ca t t aaaact aa aat gt gt t t c ctt ccgacaa t t aat t t gt a aaat gaat t t acat caat at ct at t agcag aaaact aagt t ggt ct gaga cct cat t t t a cacgcgctt c at ccgaagaa <210> 327 t aat aat gt t at at aacat g gt t t acagt t cat gggt cca t gt t t t at t t t at caaat t g at at cat acc aaagt t cat g t gcaaacct a aat t aaaaga cct gt agct g aact t t ct cc gacaaaaaaa ttttcttaaa t gaaat t t t a cat ct t acaa tggt t at t t t ggt t at ct aa cct acaccaa aagat accat at ct ggacat aaaaacaaat gaaagacgtt gt gagat ct c agaaacaaga at gg aat t cat gac at gt t acat g caaat gaaag gt aacaaaaa t at t accct a t gt t t ggat a aacaat aat a aagt agt tag t t ct cccaaa taggagggca at t t t at aaa aaat at caca gt t t t t t t aa ttat t t t gaa gt agct t t gt act t t ct ct c aaat aaact t t caat gt at a caaaaat at c t agat t agcc act agaat t t aaat cgt caa ggagaaggaa gcaaaaattt gt t cat gt aa at agcat gt t aaat t acacg t at t t cat t t t act agt t t c agcaaat aga at at caggt t aaat t caaaa t aaaat agt a ct t aat t t t t t t caaaaat a cat t cact ct t ct cgagaaa 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 <211> <212> <213> 2001 DNA Arabidopsis thal i ana <400> 327 aaat ct gcgt at aagaaagg t t t gt gt caa gatgaaggag t at act t t t c gct t t t t gt t t t t cggacat gcaat at t t t aaat t t aat t aaat agat gg agat t aaact ct gaat caac gacaacacaa gat t cact at tcat t t gttg t aat gt t t ct t aggt at t gg t t t cagagt t gcgcagaaca ct gat aaagg atct t gt t t t tatctgt t t t acaaaagcgg t cagaccct g aat aat gaaa at ccacgaga ct cgaaat aa agat t t ggt t aagct t at t c cgaat at at t gagaggt t t a at gat ct gct cat at t gcat aacat t t t gt aat cagcgt a cggagcaaac t t ccgaggt t ct t t gaaaat at ct t ggt t t t acacact t a aagct gcaaa agcaaacaaa caaaaaacac t t gat at cat t t ct ccaaaa tgt t cccagg ct aaccgat a tccgcaaaag t ct aaact ga gtcaagcgga gat gt aaaca gccat cggt a t t t aaat t ct t t aggt t gt g cagaact ct a ct caat caac ct t ggacaac aagaagaaaa t at ct ccgca at at cat cag t ggct act gg t ct t gt t ct c agcacaaacg gagct t t aga agt gt t gt t t gaacaaat t a gagcagtgca ttgct t gttg cat ccat cat gggagaagat agct t caaag act gt aat ct aagaat ct aa t ggt gaaagt acgat aggt g ct gt ct act a t aat gat t gc t gaat t gaac gt t at t cagg gat aact t ga cgt t t aaat t acgt t aacat t gt cggagat gt gcgccaaa aacacat agt gt gcagacat at t t t gt t ct gt ct ct t t aa aat t t aaat c t acaat at ag cct at ggaaa t t t gat ct ag gt aacact ac gaccat at t a ct gcgt gt ga aaagt t t cgg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tgtggagatc taacgctaaa actttaattt ctttcttccc ggttaaccaa taaagcgatc Page 357 1020 12689250 Sequence Listing.txt cat ct acat a gat t gat at a ct aggccaag gt ct t t gcca t t gcaacct g gt t t aaggt t gt aaat at ga agcagt aaca t agat ct t t c t gt caact t c ct caagaat a gaggat t t t g ccaat gt t ga t gggccct aa gcccat t agt t gaagccat t aggat aaaga cagagcat gc cct ccaagt g ttcggaagag aat ccaaat t gaaacaggga t ggt aat caa t at cgt t ct g tgatggggca ccgccagct g cat t gt t t gc t cct caacgg t t ct gagt ga t t t t t t cct t ccct at ccct taaaaaccca t t ct aggt ca gagacgccat ccccgagacg t t ggt gct aa t t at acat aa t aacgt gact aacgt cgat g agaggcaaac at gat at caa acaccaacgt agcagat caa aagcagact t tgtcgtgggg t ggat aacat gaggaggaag aacaaat ct c gagct at at t tt agt t t t t t aggaagtat t aat caaaaac ccaaaccggt aaaat aact t ccat cgagt g aagaccgt ca caacacgtt a act gggt act ct gt t t cgcc gaat ct gagc aagcaat t gc gaaagat at g at aaaaaaaa t t t aat at gt gt t gacct ag cgtcgagcag aatccgataa ggagaagaag tt cacat gt a t t gt at at gc ggct gct caa gcgggat t at caact at t gt gt t t t aagt g tcagacacca t ggaaat at t agagat t t ca at t at t t ct c ct t at t t gt a aaaaacgtat aat gcgcttt caaat t t cgg ccgcgctttt gtagcgt t t t cactgatttt gacagattt g aaacaatgt t ggaaaagtt g gaggagaatc at acaagat t gagat t cgat ccggaactct t gt ct att ga t caat t caat at acaat cga aat agtt aaa at ct at aaat tggccgagga 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2001 <210> <211> <212> <213> 328 2000 DNA Arabidopsis thal i ana <400> 328 atgagaaggg gt t t agat t t gt t gagt aag t t aat at t gc aaagt at t cc at gt caacaa t gt t at ct aa aat agt aat t ggt t gt gat t t ct cat aggc t cgaaaccgt t ccact ct aa ct t cct at cc agaagaagat gat t at t ggg ct t ggaat gt ttat t t t agg aagaggact c acgt act at t aaaccat t t a t cagt at at a t gt gaagt aa ccaat at aaa t caat t t cag at t cgt aaac at ccgt caac aat gt aagag tagt t t t t ag ct t gt t aaag t t t cggt t t g gt ggt t t t gt ct t t t t t ct g acaaaaaaaa at agt t t t ga t ct t t ggaag caacacatt g t cat ct caat acgt gat t t c gat t aaat cc t gaagt ggag t t at gagat t agt t t aagaa t t t t t aaaat t t cagat t at at at t t t aca t aaaat gt at at t t t ct t t t at aat ct caa at cat t at cc ccaacggt t a t cact aaat a aat cgt t caa acat t gt t ca at aat t t ggt t caat ct t gt t t cacct t t c t t gaaat at t acaaaat t t g gagaacct t a at t t agaat t t gt aaacaac at gt aaggt g gt at t ccaag t aaaact t ga t at agagat c aaaat t agga ggt gt gaagt cat gt gct t g t t t t t agtat t acat acaat tt ct t t aaca t aaaat acaa t cgt t t aaag acat t gggct aact ct t t aa gagcacgt aa t acgt t acgc caacggct aa 120 180 240 300 360 420 480 540 600 660 720 780 Page 358 12689250 Sequence Listing.txt taacgaaaca ttagcccaca cgaaatctcc tttttttttt tgacgggat t cct ccacct g acagt cat ct aaat cct cat act t at cgat t cct t aat ac t t accgaat t ct cgct t aga t gaat t t at t t t at cgat t a t ct t gt t at t t gagt ct t gg aaaact t t cc t ct gggt at t t ct ct agcaa t gct t ccacg gt t t gt acaa agt t agcct a ct ct ct gat a t t t t t t ct ct t t gt caat ga t aat t gt cca ct t t ct t cgc t t ct ct gaga t ct ct t ct cg t cgt ggat t t t cat gt t ct t t t t t gtgat c t ggggt t t t a t agt t gt gat gt cgcgat t g t gcaat t gag at t t at gt t g t gct t t ct ga aagaagt ct t ct at t gct t c t at at t gcgt tttttagtca t aaat at at g gt gat ggt ct at aggt aaaa tttaggagac aaat aaat gt at ct ggt gct t t gat t gt ct gt t cat aaat cgat t t t gaa t gt gt at ggt ggt t t gt t t g cgt t t t t agt t ggcgat t t t at t gt t t cct aaaaggt t t a tttcggcaag t agt cat t t g gt ggct gat t gt aacat t gt ct t t acaaat t at acat act agcgcggt t a aaccat ct ct t t agct t t gg t at act t t t a cgt t t t ct ga t ct ggat gat t t gt ggct aa at t acgt aat t gat t gagaa cagt gt ct t a t t t t t t act a taggaacgga caaat gt cga caaagt caag gagagagacc cct t cgt t at t t t ct t t t at ggt t gact t t at t t agcat t ct t ct cgt t t t gagct ct ct gt t ct ct gct gcagt t aggt t gt at gat gt t t t t ggt t t c cggactgt t t gaagaaaat a t gaat caat t gat t gct t gt t gt t t agggt t t t caaggt t agagt t t t t a aat t aaact a ct gt t t gagg gt t gcat cat t at gggt at g at acagt ggg at ccct cct c t ct t caacaa t t ct t at gaa t aat t t at t g aagagt t t t g gat t agt aga at cggt ggt a act t gaagat aat t t cat gc at gt t gcat g ct t at t gaac t t t gt t gtaa t at at cact t ct t t at t t at at aat t t gt a aggaggaccg t caat at gaa gt t ct ct gct 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 cttgattcta actttctttg ttttatgttt ttggtttttg <210> 329 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 329 gttcgcttgt tctcatggac gacactacct taataattat tagcaacaac catcataact caaatctata agaatgacat ttgtgagtat ctatcactaa ttttggtttg gataaatgat ctaaaatgga ttattggtat atgtagattt at t ccatcca aact t cat t a ttgt t ggttc tttgttatta ttgatattct aatttttttt ttggctctct agt t ataatt gatataaaat aataagaatg gagaaataaa tgttatttat aaagaacatt gatatgttct tttatccacg ccat ggt aaa aagact at ct t t cgt t t cat agat cat aaa t aaat at t t t aagctttttt aaacct at t g aact t t aaat at caat gaga tgagaaaaaa Page acct t aat ag acat t ggt t a t t t t t aaagt t ct act at t c at t ggt t t at at at at t t t t t t gt t acaat act t aaaaat gaagagaaga agaat t cat t agt ct t gat g cat gat gaat gt aat t at ca t aagt aaaat t gat t ct t aa at t cat at at t ct ct gt at t acaaat ct ag at gcacaaat t t ggagt t aa 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt aacatttcca tcaatacatg atttactttg aaaataaaat gttaggaaat ggttgataaa at t ccat at a t acacaat ca acaagaat ct aat t at at aa t aat cgcgt t cct gcaagat t cacct t gaa t cat cat cca aacaaacact t caaggt t aa gt t gaaacag aaaaaaat ct gaaaaaattt t act aat aag caaacaat ga aat caaaat t gaaacgaatt t gat t ggt ca t ct ct ct cga t t act cct t t t ct t t t cttt gat t t t gaac agtgagaaag aact cact ag act t accaat t gct gacgt a gaggat aat t gt aat ct cag ggaaat aat c gcacat t gca gat gaat ct t t cacaaaat c aacgccgaaa t t t t t t t gt g t t t aaaat t t ggt aat at at tttaaaaaaa at t at gt aag t ct aagt at a tgt t cgggcc gt cact aaaa agaaat cgt c agagagagcg t t t t t cacct cct agct t aa aaacacaaaa aaat cat cgg aagt accat a caaaacgaac t t t at aat at ct t aact agt cct t gt t aaa t at cgagct a taaaaagagg aat gaat t cc aagaacaaac gaaagaaaag aaaat t t t ct aact t gt t t g t ct t gccgac aagat aat gt t at t agt t t g t t gggct t ga cgaat at cac acact cct at t gccat t t t t tttttttttc ggggaatttt t aat aat at a ct gt gaat t t gat t aaat at t t aaat aat c at ct t acact gt gat gaaca caaacaagag t t ct t t at ac t t gat act at acat t ct t t c at aaaat gt a aaat cgt aca t gaat at at t aaacaaaacg tttttagtgc ggctgggccg t ccgt caaca agctgccgt g aaat ggt cac at t t t t ctct ct t t t t t t t t ct cgggaaac taagaaaaaa gt gaagat at aagagat aaa t ct cct at gc t gaagat caa t at caat at c t t t ggt ct ag t cgact t gat cat gt t at t g act t t ct gga tgcaaaaaca t aat caact t t gat caaat a aacaaacat a agt t t cagt c t caacaat t g acat caaggt agat t agt at ttgt t t cgt a t t ct t ct ct c ct t t ct t ct t aaaagagat a t t at aaat t g t t gat cct t t ccaaacaat a agt t t cagct caaagacaaa cgcaacaaca gt gaggat t t t t t ccgat t a t at agcat aa gaat act gt c t cgt ct t gt a acaaact t gt ct ct aaaaag agggat aaaa aat caaagaa gt cagt cact tcagaacaaa acagt ct at c cggcct t t ct at t t t t at t t ct t ct t act t tttttatcgc 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 330 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 330 gcattttttg tagcttcttg gtctgtagcc agaggggaat aagcaagcaa gccaacattg tcaactgtgg ttttaaccca aaacaaaaga tctcaaggta ttctagaaag atttaccttc attctggatg ctcacaatct ttggtagtcc cactccgtat gaagtttcat tcgagacacc aagt at t t cc caat t t t t t g gt t t agaaca at accgacat t t cgagt t t t gat at agcga Page 36( cagacaacga ggt ggcat ac t gct cct ccg ct aact agca gct gt gt t aa acct gt t gca gccgcct cca t t ct accaga acacact t gc agct gt aacc caaact ccgt gaat t cgagg 120 180 240 300 360 12689250 Sequence Listing.txt aat t at ataactcaat ataacatct a aacact t caa aact ct gt gt t t g agt agcact a ggcacact ag cgat ct ggcc aaacgaaat a t t aagacat a t cagt t ccaa cgcagaat ct gtt gccaaaa at t aggt t t t ggt t gat acc acct t gcgt c tttaccacag t tat at at aa gagaaagat g ccct ct t caa aat gt cat ct gcaggagat a cggaat ct cc t caat at t ct gaaagact cc aagat gacgc t at ct act gt aagaggccca ggt ct aat ct gcggt acat c agct gact ct aat ct ccgat cct t t cct t c gt ct ccat t t t ggact gaaa t t t t gcgcat aagcaaaaat ggcgct t aag caccact gt c ct at ct gt at gct gat at ca t t gt cacgt t t cct t ct t ca act aagcaga agt at aaaac aacaat gaag t t gcat aact gaaat t ccaa aagagagatt t aat t t cct a cct ccgagca t cgaat gt t a cat accggag t t t gat t cac t t agagt caa cagat t t cga ct t cgat at c acgct ct ct c ct agat ccga aact at gaga cgacgt t t ca aaaagat cag gaaat agaaa aaggcgatgt act t t t ct ca cct t at gt aa acagaaacaa t at acat t cg gct gagact t t cggt at ggg ttccagaaag at t cacat aa at aaaaact c cagcat ct ca at t t cat caa gt t ct t acag t act ccat ag t t cggcgat a ct gct gct ga gt t gt cgt gg gt at t t gaat aaat at at at t t t at t t ct t t t cgat aaat aaaagt cgaa t cct gaaagg t agt aaaaat t gt cat gat t gacact t gct t accagt gt a acact t t ct t gct gacct t t t gaacat caa t gaccaat at caaccagct a at act ggaaa ct aaacacaa acact t gcaa acagct t cag t gagat t ct t aact t caaaa t gcccat agt cat t ct t cgt gagt aacacg ccaccggagt acgt ggt t ct act ct gggct t gaat aat at cgagat aat a acgcaagagt t at cccaaat ct ct t agt t g caccgaagag t t ct gacat t caaacaat gt tttgaagcaa t gat at t agc ct gagt accc t at caat t t t gcat ccaaag ct gat at aaa t aaacaat ga cagt gct aaa t caaat t cac cagt gt caat t ct cagt at t t t gaacaat g aact t cgct g cgacgct t t c gcggcgcgt g gaccgt t gag t ct cgcct t a aaacgat t t g ct t gt t t t cc t gat t at at t gagct t cacg t cgat ccgt t at t t t t aagg ct cagcgaat t ggt acgt at t gat gat gaa agaat at gag gt caat gt aa agcat ct aca acat act t t a t at aact t cg t at gat aaat gat ccgt t t t t act t gaaaa ttcaccagaa caggcaagaa gcaat t gat g tt gct cccca gact acaat t at gt t gagat gccgaaact c aat gccaaac t aggt gacgt tcgccaccat ggcct aat gt t aat at gat a agat gacgt g gt cagacaaa t t ccggcgat 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 331 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 331 aaaatatttt tcgtttccaa aatttgtttt gtgtgaacga ttccgaactc cacaattgat ataaaaaatt atctttgagt tatattatca acaacagaaa aaagaaagaa aacaaataca acgcaagcct tgaaataaag tctcatagat atttggcttt ttttaagtgc aacacaaaca Page 361 120 180 12689250 Sequence Listing.txt aagcctttca agacgatgag caacaaggct t ct t agcgt t t gat t cagaa gt agcagcca cat caat t gt act t ct t gct t t aat gcct c gagcat cct c t gat cact at caaacgt t gt aggcgct agt cagt cct t cc t aat gagcag ct aaat t cca at act t caaa ct t aaacaag t cgacaaat t t ct cgaccaa ccaaat gt t c gaat cgaaaa gat t acact a gcgaat t cga gagaggagat t cgt at t ct t t t gggat t t t t t t cccct t g ct t t t ct t gt gct aat aaaa accaaacaaa t t ct t cat ca t ct t ct ct ct t cgaacaat c agct cact gc ct t gaat t cc t t ggcct t cc gat gat cct a gagagccgat t gt ggcaact at t ggaat ct t ggct t ggt g t at ggct ct g t t ccacct t a cagact ct at caaacaaccc agaaaat cat gt t t t agat c t gggaacat t at gaaaaaaa ct acacacaa gcaccggat g t at act at at caccaat ggt t agact t gcc cgt ccggt ct gaaat agaaa gat ct t ct ct caagt t gggc gggaaagt aa accgagccaa t cat ccct t c cgat t t gct c t t t t t t t ct c t t gat t t t t t gt ct gagcaa t t gat gt t gg t aaacct ccg gt ct cgat ga gct ct gagat gcat ggt ct c acat cat aga t at cgt t ct t gaacaaagat gat t agt gt g agt aagaaaa gt t cgagat c t gacat gt ga at caat aaaa caggcaccac t gact gct t c ct gct t aact at at agagag ggat t t ggat gacaccggag t ct cgccat t aggagaagat at t caacct t t t act aaaaa t t agagct t a accagaggaa aacact cct t tcacggaaga cagt t agct c t t cct t ggaa cat t t gcagt agagaat cgt aagat aagcc gct t cagat c t t agt t ct t t ccaaaagagc gcccagccgt cat aggt t t t t ct gcaacaa ct agaaact g aat t t cagat gggagcct t a cgct t aaat t acat gt aaac cgagt at t ca t agaact aaa agat gt gt gc t ct aagcaga t ct ccgat ca gt t acact t t gacagaaat a ct cggct t gt agcct acat a accggaaaat t acact caat t at agct t t t t t ct t cagaa at at t cat at ccagat ccga t gt ct ggt ct ct gaaaagcc t t ct t t ct ct t gt ct t gt t c t aaccacct g t ccaagt gca gt cccat at c t t aat aggt t caaaat t agt tgaaacaaca gaagagat t c cact accacg aagagat acc at caaaat t c tacgaacaca cct agat cca gt act t ggag act cgt t gcg aaaccacttt t ct cgaggag aat aaat aat t at t gcccaa t at gt aat t c aaacaaaaat acagat t t t a ct ccgcct ca t at ct ggt aa t t at gat cat t at cagaagc gaagat at cg t t t t ct acgt gcat agct ct ccaat caaca ct t acat t ct cct ct gt aat t gagct t t ca aat t cagat c aact t cgcag accat gat t c t t agat ct gc t accagagt a aaagcaaaaa aaaaccaagc gat acggat c gcaacat t t c agt acgagt a agt gaat ct a aaacaagt ag t gct t t agat t t aat t t ct c cgacgaagac at t t at acat cagaaat aaa aaaaccct ac act t t t t gct gat ccat cga gt caat t gcg cccaagt t cg 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 332 2000 DNA Arabi dopsi s t hal i ana Page 362 12689250 Sequence Listing.txt <400> 332 aaggt aat ga ct t ggt ccat t aaat t acga t accgt gcgt at t act at t t ct t aaaacat aggaagtaag t aaat gt cga t t accgct cg cact t gt t t g t t cgct t aat aaaaaat ct t acaagaat cg at at gaaaaa cat t aat t ag ggt ggt gct t at at acat at t t gt aaat t c acacagaat a t gaat t aaaa t aaact t at t t t t t at t at a at t t t t t aac t aat gagat a aat t t gat ga at gat at t t a t t t ccaaact gcaat agct c ggacgt t ggt aaggcccat a t at ct t ct ag aaat t act ca ct gcgccaca t gagcct t ct gt aat at t at t aat cgt aat t t t aagt agt aaaccacacc at t agct cac agt t t gat t a agaat at aag caccgt t agt ttgt t t cat g t acccat t t g ct t at t t gt a cgagagact a gat t cat gaa acat t gat t c aaggcggtcc gtgt t t t t ca aaat t cacca aaacat aagt taacaaagaa t gaggacgt a agact aat t t t ccaat aaaa ct t acat at c tttttaacaa aaaaaacaac caaat agaaa t t aggcct ag t cggt t ct ag aagt ggccca aaaact gt at acaccacgt c gaat t gt gcg t cat gacacg gat t at gaca aat gt at act aacaactttt agat t t t at c gt t at t ggga at t t t gaacc gacaagagtt cccgcaaacg t gact t ct t c t ct gccaaca t gtt cgaaac t gcaat gaac cat aagagt t aggt gct tag at gacgt t gt t t t act t at c aaat t at t t a gaaaagact g t t at caat ag acct aaaaat ttccaagaga t at t gt aat t at ct agacaa gaagaaat cc aact gcat t g caact t caaa t acat agct a t ggt cgggct gaaat aaact cccgt t t t t g gt gaaaat cc at t t acgaaa aaaat aaaac aaat cat aga t aggt t t t ga t t t t aat at a t gcagt t gaa gat gt agct a aagt at aaga ggacgt t agc gcgccgt t ct gtctcgaggc tctct t t t gg t ccct gaaag ct aat caaat ggt cct t gga gagt t aggac agaaagt at t gcaaat aaga t gt t at at gt aat t aaacac t at t t t t aat ct ct at at ag t aaat t t at a tcaaaccaaa gt t gagat t t at aggaact a cagt gaat t t acct t aat cc gat ct t gt aa cagcctggcg cgt cgt at t t t t ct t gt act gat ct acgt c at ct ct gaaa aaaat aagt a gaaagttttt at aat gact c t at at at at a ccgt acat t t t cgat at gga aaaaaaagag tagtaggaag ct cgt t t cga aaact t gact ct at at act c t t t cagt ct t at aact ct t c aagcgacct c aat gt aaat t t t t t t t cgt a aacacacact t gaacct acg t t act t at aa aaaat t aaac ttgt t t t t ga aaaaat at t a t at t t at t aa t ct t t at agg at aat aaaaa cat caaaat c aaaat gt t at ctcgtgaagg gaaaaaat t g acacacgtt g ct acacaaac t ct gt ct ct c cct ccgat cg at gat gt gcc t t aat agcgg aaccgcacat t at t t t t t t t t cat cgaagc t t aat ggct t accgt acgga t gggcct cac gccat cat cc ct t ct t cact at gaaact t t ct t at gt at g t caagaaat g t t caagt ct t aat aaacaaa taagccgact aat caact at aaact cat ag gt gaaaagac at t t aaaaaa cgaat at gag aat acgt aat t t aaaat gt g gt t t t gt aaa at acaat gca cat t aaaaca agat agat at caaat gat t g t t at gggt ct t cgt t t ct ct agacaact t c t ccaat ct ct ttaacggcga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 Page 363 12689250 Sequence Listing.txt ccgccgattt tcccggtaaa 2000 <210> <211> <212> <213> 333 2000 DNA Arabidopsis thal i ana <400> 333 t t ct agct ca acccagggt g t gaat cgcct t aat ccaaac t ggct at gga aggcccaaag gat cgt cct a ct caccat ac t gt t ct aaat t gt ggact t c acacat cgag cagcagt cag t ggt ccagt g gat gagat ac agggaccat g t gaat acat t t gt at t t t t c t gt at t t cga t gaagat at a t t t aat aat g aagt gt t t aa at t act gaga cat t t gt caa acat aacact aaaaaaaaac t t at t at cat tt gat agccc agaaacgat c t ct t ct ct gc t ggaact ct c aaaat aaaga t cct caagac t ct gccaggc t ct gaaccaa aggaccaaac at t t t at cag t t cgacat ct ggagat ct cg agct acgt ga ccat t cat t g gt acaaat t c t t gt t gaat t t ct t at t ggt cgagcaagaa gct t ct t t t c t gct act t t t at t at agaat t t at ct aaaa t agaaat aag ct t t gt gcat aat at cat aa caat caacat aaaccat at a ttaaagacaa gt t t at t t ca aat at gat t t t ct t t ct t ct t aagcct t t c cgcaccatca ccagatccca caggctcact cagttgctcc ctcggggacg cct t gccact cat t gcaat t ct ccat ggag ccat gacat t gt ct cat t ct cagcagcaac cggt t gt ggt t gt t cgagct taccaaaggg agat gat t ca tgagaggaac tgcacacaca gat gcaat ac t t gat gcgt t agaaat aaca t gat ct t aaa at ccat aat t at ct t at at t gtt acacgaa gaacaaaaat gcaact t gt a at t t t at t t t t at t t ct at g gt acaaacca t acagt t t at t ct t ct cgt c ct ct cgact t ccaccagact gagaccggag gagccctgag aagagct cca gccagccaca cct ct t agt c cct aaacacc aaact t caca ct act t ct ac aat gt caat g gagtcaggac gt t t cacgct gt gcagcat c aaaacgct ag t gat gat aat aaat gacaaa t t gt at gt aa at t cat t t t g act t t t gt aa aagaaaaagg t t ct t gaaaa t at t acat gt at t t caaact aaat t aat aa cagat aagaa ggcccat at t gt cgt t t gt c gat act t t t c gagccacct g gaaccat t ac gagcaacgac acacgaccag at ct gct gt g t acct cgcca gccaat ct gg aat ccaagca aacacact ca t t cacaagct t t gcagct gc cgct cgaacc t ct t t aaat a cgat t t gcag at aggcaaat ggggaaaat g t t t gat t at a at t at aat gt aagt gaaat a at at at cccc t t at t gcaat t t t aaaact a act aaaat t t aaat gt at ca aaagt gat cg t gt ggt t t gc act t t agagc t ccagt aagg aaacaat cca caccacgcca ct ccacaccg ct t at cagca caat t ct cct accgt ccacg acat gggat a agaaaagcag t agcgact ga t ccat ct cgt agct cggaac t t gggt cgt t cccct cct gc cat t t ct gac agat agt t t t cat t t gat t t at at t agaga t ct agct agc t gt gcgaat a t aaaacagt c cct agat t aa aat t acgaat act at gt t t a t aat at t gat gcccat t aaa ccggagct ca t act t ct t ct t acgt t t ccg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 atccttaatc gagctctagt atcattgatt tggctatgtt aatgaagatg agggtttagg Page 364 gt t ct t at ac act ct t t gat agct t ct ct g t t ctgtaggg 12689250 Sequence Listing.txt agacaccaat gtatctcgtg ctagatctga tttgaagt t a tgagagaaaa tgacatctga ttttggttta tctccttctg gatcagctta tttttattgt attttcgatg ggtacttata tttttttttt caatttggga tttttcatga tttgcaaaga 1860 1920 1980 2000 <210> <211> 0 <212> r1 <213> 334 2000 DNA Arabidopsis thal i ana <400> 334 t agaagat at at gacgt cga cat t caagac t caaagcat a agat gacaat t at acaat t t gt ct gat at t t t t at t t t cc t ct act gt t c t at t gt ct ac gt cgat gcat at gggt t gt t aact aagct t t acaat t t t t actggagaga at ggaggat c gagt t ggt t a caaat gt gt t caagt t aact t aaaat t t gg gt gt t ct aga aagat gt t ga ccat cacgca t at t ct aggt aaagat ct t c t ct ct t t caa ggat t t agca ggt act act a at t at t aagt caggt t t cag ggaaat ct t a ct t ct t cat a accaacaact t act acaat c aagt aaaact t gcagat cac cgcacaaaag act gaaaaat t at accagat aagcat t t t g act cgt t gt t at act t t t at t att ggcaca t t t gctattt ct t ccaat at t t at cgat ca aagaaaatt g aaat gacat g t gt aaaccgc at t accggt g ttccgt t t t t at t ct gt t cg gt aagagct t ct at aaccct aat ct aacat caacat agat cagaacaaga aggaagat t g gt gct at t cc act at cct t a ctgt t t t ct c t ccat ct t ct cgacat t cat t aggt gacac ttggacaaag aaat gt t gt a ggat t at at a gt t t at ct gt t t t gtt ccaa gat t agacca t cat act caa t at t cat acg agagaaacat gcggt t t t gg gt aagcacaa at aat t t at g gcaat t acgg t ct ct cgcga gt ct t cat gg t t t aaact ca t t caggt gaa ggt ct t gt gg aagt gt cggc t agt acacat ct gagct cca acgcaat cgc at t t cct agt tgtct t t t gg cgggt t act a t ct caagt ct gt act ct t aa act aat t t t a t at t ct t ct g t t t t acagt c ct gt t t agat aat t agagat aat gt aat t t t aacat t t t a acaaaaaaaa at t ggt t aga ggcccct cca t gaat t gacg cgt cgcgcat t ct ct ct ct c t t ggaaat t t agcaaaacat aaat gaat t g accagct aat ct ggaagaaa ct t cacat t t agt t cct aaa t acacct cgg at gat aaaaa at ct t at t cg gaggcaggac aagt t aat ag ct t aaaaaca gt t aacat t t cggat gt t ac t t gt gcat t t t agact t t t g t ggat ct t gg t t t aaat ggt t gaat t t gt a caagt caacg t gcaat ccac cgaaaacatt t t t t t atcct t t agcgt t gt t ct ct cagat gt ct at gt cc aacaagattt ccaagt acat ctatggagga t t gt at ct ca gt ct t ct act t gggcaact g t aacaacat c gt t t gacat a agaat gt aat gggt t aacga gt aaagct ac cacacacaat t gcagact ca gat t t cgcat t t t gt t gct g t at t t gagag t t act t aaac gt t at gat ct t t t gccaat t t caat gat t t ggccaat gac cgtcgagggg t t gcct ct at t t t gaagct c agct ccgaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 365 ct t t cacgca t t cagt gt gt gat gat ccgt gat t t cgat c acct t t t t ct aggt t t t gt a acat t t ggag gt t agt t aag gaggt at t at t ct t aggt t g t gt at ct t t g t gct gct gca cgt t gt aat t aacgt t gt t g t t ct cgct gg tgaagcagga 12689250 Sequence tgatctagat ctgagctaag ggaattgaat attcttgctt ggtaatgctt tggatttcga tcgtttcaag tctggatcta gctt caatt a gatt ct gaat catctgtttc gatctgcgtt tgatctctct ttatgtgata Li st i ng. txt gcat t ct caa cgat t t agga tgt t t t t ggt t gaacat t at t ccgcaagat at t ct aat ga t t gat act ct t t t t gat t gt t t t t t ctt at t gaaaat t ct ct t cgat t t c ct at acgat g t t t at agt ca gt t t ct gt gt 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 335 2000 DNA Arabidopsis thal i ana <400> 335 gaagt gt t gt at agcgat gt gggt gat t t c aat gt act gc act gat at cg cagt t gaaga t t t act gcag gaccgt at ct t ggt t ct gt g ct acct at ag t aat cgt t t c t t t ct ccaaa acat caat gg caaaacagat t t t t gat t t a gt t at cat ct caggaggcaa cagcaagcca cagct gagaa caaaacctt a ggt t gt at gt t at gt gcct g cat t gcct ct t aaat t ggt t aat t t gt gaa t ct ggt gaaa t cct t t ggca accacat ggc at gt gaaagt cat gt ct t ag aaagaggttt cat gat aat c at gt t t aaac ct cagagcaa t t agt agaat ccct ggt aac t t aaacaaat t gat gaggat t ccgt t gcac t ggt t ccaca agcccacgaa acat at acct gcaggggat a t ct gat t ct t t gt t ct gaag t cagaat ct c t accct t at g t at ccct t gt t gggaaaact agct gt at ca aagt t t t t gt caagt gct t t caagagt t at cat ct ccaac caat gct at g at cat caccg tgt t t t t aaa at aaggggat aacct gaaag ccaaagactt t at gat cgt c t t gcagt ggt ggt gat agat caaaact at t t cacat gcat t ct t gt at gt gt at gtt t t t t agt t t aaaa t t caat at aa t t t t t cctta gggaggaatt ccaaat ggac at aaat t caa gt aggt gt gg gcaact t aaa gat at agat t gt at ccagag ct t caaaaga t gt ct gcat c ct gt t ct gga gcat t gt gt t t gaaggt gt t t cagct t at g t at gt gct ga agct ct caac caaacaagt g at ct t gggca aaagt t gt at ccat ggct ag ct t gt gt at a gt t ct t gt at aacct t gt t a t gggacat gt gt t t t t t ggc accat cat ga gaaat t gt gt ggccacaagg t t t t ct cact t gcagt gact t ggt t caat a at t ct gat t a at t t at aat a gt gt aat ct g cccgat t cca cccagaggga aact ggaaat at t t ctcttt at agt aaat t cccaaggct a gt gat ct gaa t gt cact gt c aat t gct aaa t gct gt t caa ttcgccaggg t gat acaaac agct gcagca gt gt at gt gt at cagaaaga t at t at t cca t t t t gtctt g t ggt t at gct agagt ct gga t t gccaaat g gat ggaat ga gt t gcagt cc ct t t gcgact aagat at t gg gt ct t ggaca gcaacct at t t accacggt t t t acagt t gg t ct cat gt gg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 tgttgttggt tctcagggga aagacatgcc atggttttgg ggacatcagg cgatgacaag Page 366 aaagt gaagc t ccct t gat g ggt t t t gttt accaagt gt c acagt ct t ct t at gaat cga ct acat acat at caaaaat g ccagaaat at aacagccaga agt gt gagaa tgtgggaagc acatgaggcg cgagt cccaa t t t t aacttc at gt t t t cca t t aaaat t at at ggt aat t t t gt ct ggt cg gaaaggccca att agggttt t ct gccacca 12689250 Sequence t ccaaagt cg caat cttt gt tatttaaacg gtacaacttg gt t gggatca aat gaagggt tgaagttttg aattttgctc aatt aaggaa at t t aaaaag tatacgcatt ttgttttctt ggtaaataat acaaaaagaa tggactccga t gggct t t t t cgtaagtccc gaatataacc ctctttgtct tcagcagtca Li st i ng. t xt aggct ct cgc cgaggaaaaa ggt t t aaagt at aggact ag cacacaacaa t gt at cagaa t ct acacaac gaggcaaat t taaaaagcac gt gcgcat cc at acacgt ag aact ct aat a cct t ct t gt t t acaacact c cact aagaaa aaat gat agt caagt gaaaa gaaaactggg act ct ct aat gtaggagaaa 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 336 2000 DNA Arabidopsis thal i ana <400> 336 acct at cat g ggat at t ccc act cggatt c t gt t t cagt g ggat at aaac t gat at gtt g ctt accaat t gat t ct aaac aagaat t at t att aat gact at t t t gt agt aaaat t t t ca cgt gt at gt t aat ggt gagt tagaagaaaa cct ccaat at at aaagt ggc at t aaaagga t agaaaacgt t t caggat ac t t t ccgct t a agacagaaca tggtgt t t t t aaacaaaagt t caaagaat t gat gaaacac ct gt t gact t t ggt t ct aat at gt cat at t t act gaaaat tt ggt caaaa t gtt gggttt t gat gat t ag at caggt aaa at ggact aac aaagaat at g t gatt cgat c at gggt t at t t t t t ct t at a ggt t t t cct t t ggat at ggg t gat t caaaa t aagggaaat at at at aaat aaaaaaaaaa gt aat gcat c gt aaat t at g cgt t t t ct aa aacgat gagc act t t t aagt t t agt t at ca t t t caagact ct t t t t t t gt gt gt at at t t gat agt gcgt gagaaaat ga ccacgcgt at ctt ct gt ct t t t t t cct cct tt cacttgtt gt ggat cccc at gt at acaa at at t aat at gt cctt gat a cggaat t t gg tt cat t t cgt ttttcagcga aact gaacgt catt ggcaat tt gagt gat t t gcaagt agt gt t at at at c gcgt t t aact caggaccacg gaaccaat cg t gct ct t gac tttttttttt ct gt aact aa t t t ctt cttt at t gt acagt at t t caaat t t gcaat t t t a aggttgtttt at t t cggat a at t t gct aat ct aacgat t a gat t caaaat ggt gt cgttt tttttgctaa t gagt gat ga aaaat t t gt g t aaat gtt cc cgacgtgagg cctt cgagaa t act gaagaa tttttttttt gaagaagt aa t ct t t at t t t t accaaat at t aaaat at t t t t aaagt aaa acagt aatt t gt t t cat aga ggt aat t aac aatt gt gt t c act caacgct gaaat at gat acaaat t ct g gat caagt ac aat at ggt ct gtgaaggaga ct at cgt t ga taagcaaaaa gaaaccaaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 367 12689250 Sequence Listing.txt aagatgcttg agattattct acatttcaaa agattccacc ttatatgctc gcaat acat g t caact aat t t aaat t t at t agaat t cgt a ccaat cct ag aat aat aagt caagt aacca t t t aaaacgt t t agat acac aagat aat t a ttaaagaaac aaaaact ct t ccact t t t ct t caat t t cga cat ggt t cca accat cat at aaaacatttt ct cgt ccact cagagt aaaa ct aact ct cc ggct t acgga t t at aat t t c at gagt t t at aact t cagat cgcgt cct t a t gt t cct t aa t t t cct cat c t ccgagt aac agt t t t aagt at accaccat cat t t agagt t gat at t ct t t gcgat at ga ggcgtgtct t gaacaccgaa cat at ct t t g t at t t gaacc aagaagaaac acaaaaaaaa accgact t t a at aaaaaaca t agt aact t t t agt t ggt t t t t acggt aaa t t t cgaat at t t act acat a t gt cct at t t t t gat cat gg aacgt t t aca at ggt caaat t aat at t aga t ct cgt gcgt t cat aaat gc tt ct cagagc tttccgaaga t gt t aaact g ct t t cgaat g gt t agt gat g t cat gat at c t acgt acct g t ct t at cat t t t gccat t at aat aat t t ac aggaaaaaag gt ccct cccg at at at at cc aaat t t caag at t cat t at c caact aat ct t aat t caat g gt aaagt t gt t t gat cat at aacat caaac at t acgt t gt t gagacat t c cact aaact c taaaaacagc ct at t t gcat t gt gt aaaaa t t t gccct ct t t t att tcat 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> <211> <212> <213> 337 2000 DNA Arabidopsis thal i ana <400> 337 t ggt t t gcaa aat gct aaga t ccacat t aa at gaaat gct gt t t gagaca aat gaaact c t acacct gat gaaaaagcaa at gcat t t at at at t gat gc cat agt aagt gt t gct gcaa at gt t act t g aaaagt t at a acacaat gag t at t t gacac aat ccgacat aat gaaact t act t at aagg aaat gt gaat t gt at aaat t at gagt cgcc aaaacaaaca at ggt aact g t aat gcat t t t ggt aacaaa ccct cagaac ct gt gt cct g ggat t gt aaa cagaagat aa aggcaagacg at t ggt at gt t gt at aagt t taagagcaca gct t t aaaat t gagat gaaa caagcgct ga t at t cagt at ggt aaagaca t gat acact c ggacacactt accct ct gct t gat at t t gc t cat gt ggt a t gat at cat t aagggt t t cc aat gct act c tgagacaaaa ct ct t ccaca t caat gaaat t gt gaat gct t aagaagt aa agagcacaat ct agggt t ct at gat ccaag acct t gacaa gct ccaacgg cact ct agt g t gat aaagca gaacat t ct a caaat ggcca at aaggt aac t gt gaat gct t t agaat gaa act acact ct t t acct gcat aggcaaat t a t act gagt t a aagagaat ct aat agt aacg aaagt ct ct t t at t agct aa gat ccact gg agggt ct t ga ggat t at gac act t t t gt t a agct cact ct t t at aat t cc ct t t gt at aa t ccacat t aa aagcaacggt cgacct aaag acaat t agca ct gt ggt gat gt at ct t gt a gt cat gt cca gcgtgcaacg t accct tt ag agagt t at ac t at gcat t ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tcacagagaa cggaagatga tagtaatcaa ataagatcga tgcatcattg attcacccag Page 368 12689250 Sequence Listing.txt aaat acaat g aat caagt t a t at at cgagc ct cggt cacc accagagaga ggagaccgag gaagagtcgc ccat cgcct t t t ct t cact t ggct t gat ca cact t at t gg taacgacaaa gat t t cgt ca cct ct ct ct t at ccct ct ct aacct t aaga at ct cggcat gaat ct ct ga agaact agt t acat aact ga aagt at t caa aaacagcaaa tttaggcgaa aatggcgaga gt t acgat at ccccat t t ct t cagaaaat t gagagagaga gccgt aagt a gt ccagat ct at cagat caa ct t ct t cttg ct ct gat cct gt t gt ct gat t t t ctaggag t t gctgagaa t cagt t cat g ttgcacaacc act t gt caca t gact aaat c aaat gt gagg gagcat aaga gcacaat ggt cccat ct ct c tccccgagaa gagat t ct ca t at at t at aa acgcgcgt at aact t ct cca t gaccact ct t t t t gt t t ct t t t ggcgat t gt gaat t t t g cct t cagaga acat t acaaa aat caaagga agagaccaaa gaact t acga gaacaat gaa aat acacaag t ccccact t c aacaaaact g agccgcat gt agcccaat aa cgt gt gt agt act ct t t caa t cct gt ct gt ttctgat t t a t ct gat t t gt at t t t t t t gg at acat gaaa t caat t t cat gt acaaccca gccaacaaat gaacaaaat c gacat t gagt aacat cggag cgt cccgct a aaaaaaaact at cact t t t g ct t t gt t gac at aaact gt c cat gt accgg t at ct at ct a t cact ggt t a t t gt at cgct t gtt gtt gt t ccgat cggat at t aaacaat t t cgt at ct a caat accgt a tggcgaaaaa aagagagaga at cagcaat c t gct t t ccct t gaat t t t cc taaacggacc acgt ggct at at cact ccac cgt gcat cca ggagat ct ct ct ct t t gaaa gt t t gat cgg ttgagt t t t t 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 338 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 338 gttagaggca agtagtgaag aagaagtgag tgatgaagaa act atacatt tgaagggtct cactagtttt ctttctactt acttgttctg tatttctaaa acat t agaga ataaataaat tttgacatcg gaaccgcata ctctatttat aagtacattt tctttgt t ga tatagtaat t ataatcttat actttgattg gtgcaactat taatttaact tcatgatcaa aagatatagt tagcattatt gtctaccgag agtaatacca gaataagcta aataacggac ctaaatacaa aaaaatttaa aagagtaagc taaacaaccg ccacataaat aacaaacaca tgaaaaacca agt agat gt a tttcccaaaa at cgt agact ccgt t ct t aa acct aaat ca gt gt acat t t agt ct t at at ctt gt t t t t t caaat t gat a gt ggt cct t a t act gacct a at agaagt gt gct gt t ct t g t ggt ccgat g at t t cat at t catgggaaag t act aacat g act t ggt t ct cct t aat t t a gt t t t gt t ga gt t t ct ct gc caat t gcaat aat aacaaac ct gt t acaat at caacgt ct t cat t gcct a ct cct t t t ga tttttttttt agcaagt ct t t t gat aat t t act t t at cct gaagt t t cat at t t t t aaga t t gcaaat ga acat gaaaaa t acaaagt cc 120 180 240 300 360 420 480 540 600 660 720 Page 369 12689250 Sequence Listing.txt acgtacatgc cacgaggtaa tgaataagtc taattgcaac aaaataaata at at aaat aa t gt gacaaga t gt gt cct at aact cct ccc gat acgct t a gat agt at ac caagat ct t a ct t ct at at t acat at gat a t aaccat t cg caaat gat ct t t t t gat aag agacccgaaa t aggccat at at acaaaaca ttcaaaaaaa aat gaagaga gt aaggat at cat t at t aga ttat t gcttc ccgat t ct t c tttttcaaaa gt aagact ct cat t t ct aag t cgt ggt gca t cagcgat at t t agct agt t t t ggct t ct a t t cat t acat t aact gagt a t gat t gt t gt acaact aaaa t aagt agaat act act cacc t acat agt aa aaat agt gaa t t gt ggt t at gat t t t act t caacct t t gt aagcccatt a act ct t at at t att agccgc t at t gt t t t t at aaacgaca ttgagccacg ccat caaat a cagt agat gt t ct cccccat cat at gaagc at aat at t t t agt aagaaat ct cat at t t a caacat act c t ct at t gaac at gt t t at t t caact aat t t aat t ggat ct aagaacgaat acat aaaat a aat aagt aat aaaat ggaca aaaaact ct c at t t t t t t cg gct at t t t at aaggt caacg at cgt caat c t gaat caact gaaat cgat a aat cat t act acgt acaagt t acgt at aac t at at at aaa caccaat at a ctgatgagag t agagat t t a caact aaaag ct agct cat t t aat caat t g aat acat ct g t gt t aat t ac aggcccattt gt ct t cacaa t t at ct t t t g cgt ggt gcgc t t t ccgat gc cact cgt t aa ct t caggt t t gagct t cct c cat gt agcaa aagt caaat g aat t aagt ca acaaaaaat g t acaaaat ac t t gagcat aa at gat ggat t aat gcaagt t t t ccgat t aa cat t t ct t at gcct gat gaa caat t cat ag aaacct aggt at ct cacat c at at t gt t t c gact caaat a cat caaat aa at accaccaa t at gt gt gat caat gat at c ccaaggt at t act t gt t aat acat gacat g aat gacat ag aagt t t t aag gt acact t t t ct ct at t ct t ggacagtttt ggt t agt t t a t t aact t at t aaaaaaaagt acaaaaat ga t ct t gat ggg t t cgcagggt t ct t ct gcag 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 339 <211> 2000 <212> DNA <213> Arabidopsis thaliana <400> 339 tctagatggg tgaattttat aagtttatta gtctatatat at at actgta tattgcgtgc atctttttta tttatttatt t ctttttt gg ctattaaaat gatttaaata gatacatata at t gt t ggaa ttataatctg aaatcgtacc aaatatccta ataataagag ttttgggcaa aagtgtatga actcaacatt t tcgagaat t ttgtaacgga tcgtaattac cacacttacc aatatgttgt tatataattt tattttttta cat gt aat at ct t t gt aat a t cagacat t t t act t aat t g cgaaaat t ca cat gt t t at c t t agt cgcca at t ct ct gca aagt gt at t t Page 37( t acat cat gg t at agaat t c t t t t aaaat t gcaat ccat a ggtacgaggc aagaaaatt g t t t gccat at t t agct ggt a gt t act act t aagt t gat at aat aaaacct cgt t t t t caa aaat agt t ga at at t aaat t aagt t gt at g at aat t caac gt t cagat cg act agt at at 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gtatatcatt ggtgtcaaaa ccatatagtt ccacaatata gagattttgt cttcatgtca t cagt t gaaa t t cacat t gg aggtcaagag gt t at t aat t agt ct accat aaagaaaaaa ct cagt t gct cact ggt gat t agact t t ca at ccagt t t a t t t at t t aat t t t t t caat a acaaaaaccg at t at t aat a acacat agt a gacaacgt aa at aaat t t t c gt aaaaaat c tttaaaaggc gcat t t t gt t t ccacct gac t t t t t gt t t g cact ct cact t ct acgat t c <210> 340 <211> 28 <212> DNA <213> Arabi <400> 340 ctct t t attt <210> 341 <211> 28 <212> DNA <213> Arabi <400> 341 cct t t gat gg gtt gaaccac caccaagaga cgaaaat t cc aaaaat at at aagct aaagc ct ct t ct cag agcct t gt t a gaagaat t cc ct caaaat t t aat aagcaaa t agt t agat t acat acaat g ct aagat t at t t t acat gt t aat t gat gcg act t t at aat t gaaat t gt t t t t gtgattt at aaaat gt a t ct t gt caat t t gt ct gt gt t t t t ggt t gt gt t t ct cat c at t cgt t ct a act t act t t t t gct at at t g t agacat at a t t aat t cact cact aagt t t ttggt t t t t c gcct t caaag ggtgagacac t gagat t t t c acaaat gt ag gt aat at t gc cgt gagt t ca gacccat gca at t caacat a t acaaaat ga aacacaat aa t t cgt at aaa t t t t t t t t gt t aaacagat a aggt t t t at t ct gt t gcgt t t gcgagt t t g t ct t cgt ct t t ct ggt gcaa ct t t t t at t c cct t t t t t t a aaagagaaat acacct cat a t ct t t t caag aaagct gct a ct t t at at ag ggaaattttt t aaaccact a aaat t t t aaa aat gt t gat g cat t aat aca caat gaaaat at gt aaat t g aat agt aat c aaaaat t t aa aaaaagacgt aaaaaacat a t t ccaacct g cact t t cat t t gact at aaa ct ccgcct cc aagt aagat a acaaat gt aa aaaaaaagt c t t t gt cacaa gacaaact t a agt cgaagt c gt ggt gt aag t cgt ggct gg gactagagag t t gaaaaagt at t at ct aaa t t at t gcaat t t t atgt t ga aaaat ggt aa att acagaca aaaat t at ac acagt aaaac agt agaat t t t aaagat t t c cat ct gccaa ggaat ct cag acct ct ccca t t caat ct ca agaaaaacat at t gt t t t t c t at cacat aa aat aaaaagc gaacct gaaa t caaact ct t gt ct t t gt t a gt t t aat gaa gt t acaaat g gt t at t t cgg t gt t cgt t at aacaacact a gaat agt ct g aaat gt at t t t aacagaaat aat t gcacga at aat t aaat acaaat t aaa t t aat at t t t agcacccaac at t t t t t at t ct t ggt t ct t ct ccgat ct c 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 dopsis thali ana gtcgtgactc gcgaaccc dopsis thali ana aatgttttga ggggagag Page 371 12689250 Sequence Listing.txt 00 O 00 O (N) O tq OO in O- t^ <210> <211> <212> <213> 342 27 DNA Ar abi <400> 342 act t ccacca <210> <211> <212> <213> 343 28 DNA Ar abi dopsis thal i ana gaaaaggcga aaccaag dopsis thal i ana tggttgagag ggcaaagg <400> 343 t t at t gt t t g <210> 344 <211> 28 <212> DNA <213> Arabi dopsi s thal i ana <400> 344 cggatggttg aggtagtatg agtgaccg <210> <211> <212> <213> 345 DNA Ar abi dopsis thali ana <400> 345 t ggggt t t aa <210> <211> <212> <213> 346 28 DNA Ar abi aagaaagaaa aagtgtgtgc dopsis thali ana tgtcctactg tctccttc <400> 346 t t ccct ccaa <210>347 <211> <212> DNA <213> Arabi <400> 347 gct gt at t gt <210> <211> <212> <213> 348 28 DNA Ar abi dopsis thali ana gctttgacta cttcttgttg dopsis thali ana tctgaatctg tgcgtgtc <400> 348 ttgct t gttt <210> <211> <212> <213> 349 29 DNA Ar abi dopsis thali ana <400> 349 tgatgcagt g <210> 350 <211> 28 <212> DNA agtgatcaag agaaagtga Page 372 00 00 12689250 Sequence Li st ing. txt iana <21 3> Ar abi dops is t hai <400> 350 at aaacggt g <210O> <211 <21 2> <21 3> 351 DNA Ar abi at gt t aat gg gcccaaag dopsi s t hal i ana acaagat t ga agt at t gagg <400> 351 t caacgggt t <210O> 352 <21 1> <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 352 ccaat acat t <210O> <211 <21 2> <21 3> 353 DNA Ar abi <400> 353 t cgt t gt t ga cgaacacgt g at t gt t cgt t dopsi s t hal i ana ct tctt t tat ct at ctgt ga dopsi s t hal i ana at cgggt at t t aat agtt c <210O> <211 <21 2> <21 3> 354 29 DNA Ar abi <400> 354 ccc aaaaag a <210O> <211 <212> <213> 355 28 DNA Ar abi dopsi s t hal i ana <400> 355 t gt t t gt gt a <210O> <211 <212> <213> 356 28 DNA Ar abi gt t t aaaagc t caagaag dopsi s t hal i ana gacat t ggt t t ccagat t <400> 356 cat t t t gaat <210O> <211 <21 2> <21 3> 357 28 DNA Ar abi dopsi s t hal i ana <400> 3 t gt t t gt <210O> <211 <21 2> <21 3> ~57 gt a gt t taaaagc t caagaag 358 1158 DNA Ar abi dopsi s t hal i ana <400> 358 Page 373 12689250 Sequence Listing.txt atggctttca caaaaatctc cttagtcctt cttctctgcc tcttaggttt act gt caagt t gt ggt accg ggaaccccga at caaccaag aacgccgct a at gt t t gct c ct t t at at ac gact cat t cg t aaaacgt t t aagt at t t t a agt aagaaat t ct t ct gt t t ggagcaacac ggct act t cg caaagt ct cg gct t t cagga tttggagcca gt caacgcaa cct aacct t a ct caaaact g acgat gcat a ccggagggt c ct ggt aat gg at act t t ccc at t t cact ca t caaaaat aa aagt t t at aa t gcat t at t t aggaaaacaa gacat t t gac cat gt gcat c gt aact act g gt cgt ggt cc gt ct t gacct cgggt t t gt g ccat t agagc ggat t ggat a gt t gct aa cggt t gcgct ct gcggt gt t ggt cggt agc ttgcgcgggg caact t t gcc cgagaccgga at t aat at t t ct t t agt aaa caaaacaaat t t ggt t at aa t t gat ct caa t ccat acgt a ccagagcagc gat ccaact a t ct acgccag gt t t t ggat g t at t aat gga ct at agagac ccaaacct ct ggat gccgat at t gt gacac aaaagat t ct aat t ct gt t a cgt aagat t t t gt gt ct gt g ct aaacgaat gaggat t at a gaccat at aa gt aact aaga t t agat t t ct aacacacaat t cat ggaact cccgaact t g aat agcgt aa at ggaat gt a t at t gt ggac gt t gcagt ca caggt cct t g aaggt t t ct t acacccgt ga ccagacgt ga at gt ct t t ct t at acat ct a gat t ggagga t t at at t at t gat t aaacct t gat cct aat gct acat aga acccat gt gc acaact acgg t gggt agcaa ggccggt t ct acggt ggt aa agct t ggt gt ct t t t ct gaa gt t cggt t ac t agaggt agt t aacaat at t ct ct t t cgt t aat t gct acc t ct ct t ct at ct t t agt cat t t t gat at t t at aacacgt t t t gat at t gt cct aaat aac agagat t aac accgggaaaa agcgt gt ggt cccaact gt a gaaccaaggg t t ccggt gca ggaccct ggt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1158 <210> <211> <212> <213> 359 1127 DNA Arabi dopsi s t hal i ana <400> 359 at ggcgacac at cat gacag t gct gcagca agt ggacct t aat gct ggt c at gagt aaag gcagccgct g at gt t ggct c t t ct t t t t at aat t t at at a aaaat gcgat gaaccgcgtt ggt ggggat a gcaact ct aa ct cgt ggt ac tcggaagcgg aat cgt t cgc agt t ct ct ca gt at cagaaa aat at ct gat t ct gaaaaaa ttcacaaaac t t gt ggt acc acct aaacct cat agcaagc ttgcccagcg agcct acaaa cgaat ct gga at gt ct aaaa t t at cagt t t gct ct cat ca t gt ggt acaa acaaaagcct act ccgact c gt t at cacac aaagggtttt ggaact gt t g agt aagt t ca aggaat t at t gcat t aaaaa t t t t cct ct t acgggt gt aa act gt ggcac caagcggt ag cagcgt t ct t acact cgcca ct aagcgt ga ccgt t t act a aaagt at t at ct t t at t t t g t act ct aacc aggcaacat g t gggt gccag t ggcggt ct a caacagcat c ggct t t cat c gat t gccgcc cat t ct at t a cat gt t t aaa at aat t gaag 120 180 240 300 360 420 480 540 600 660 aaaaatatag agacaggaat attttcctca taagaaataa ggatttgcta gctatatata Page 374 t t accaaaat ggt t at gt aa gcacaacat a cat ggaact a cagat at ggt agaat gt gcg gt gaat gt aa t ct gcaagaa gt gt t ccaag aggt t t t t gt t cct t gt caa caact at ggt ggct cgt agc t ccggt t t t g t ggt gggcgt gct t ggggt c 12689250 Sequence aat gt gt ct a aaacaagt t t tacaaagaag aaatagccag cccggaaaga actactatgg gcagccggaa agt t cct t gg ccaact gt gg ct t t ccagt g agccaagggt ttggtgcaac ccagctgcag tgcagagcag act cct ggaa ccaat ct ct c Li st i ng. t xt aaaat aaagt aggaaggt ac t cgcggt ccg act t cct ct c t gccat gt gg cacgaggagg ggt t aaccat at gt t ga t t at ct gt t t t gct caccaa at ccaaat ca ttgaaggacc ttttggaaca at caacggt g t act t ggact 720 780 840 900 960 1020 1080 1127 <210> <211> <212> <213> 360 2494 DNA Arabi dopsi s t hal i ana <400> 360 at gt ct t ggg at cat ct act ggct ccgt t g t t t t acagat acat aat cag at cgt t gagt t ggacct t ac gat ct ccggt t cccat acct t ggcact gga cggaggaact cggaggaact cact ggaacc aaccggcact t ggt accggt at gggacgat aggt at t caa acacggt aaa t caaggt t t c aat t t t t t gg ggt at gaaac acgat ggat c caat ccaggt gt cccaaat c at aat gt gt a t t cact t t gg acgcaagaag ggt accaaat t t cct t ggt a act ggaggaa act ggcact g ggcact ggaa ggcact ggaa ggcaccggag gggaccggca t caggt gct c ggat ccgat t t acgt caagt caacaaagt a t aact at at t gt agt t t gt g cgt cat gct a acacgcaaag cacgt acgac t gct gaggt t at t t t t t t gt ct ccggacga t t at t acggc ccggt t t cca ccagcggcaa ct ggcaccgg gcaccggaac ct ggcaccgg ccggcact gg gaact ggcac ct gggaccgg agaaat t gga acgat ggcgt t cgact at gt gacaaaat cc t t at acaaag at t aaccat c ggaat ccagt gt gaagaaag ggagccaccg t gt ggaaaaa t aagt t aat t gt acat aacg gt t gact t t c gat t t ct gct t gt t ct caac aggaact ggc t ggcact gga t act ggagga gact ggcact tggaaccggc caccggagga agcacaaggc gaccaagat a gaaaggt gga aagagaggt a at t at at t ac cggat gagt a tcaaaacgaa t gcagct aac ct ct t caat c aacacact aa aact t t aat g gccct t t ct g acaaccaaca ccggaagcca accat cgacg accggaact g act ggcact g act ggcact g ggaaccggca accggaggaa act ggcact g aacagt acag t acgcaagct gt aacaaaac agct t acgt a t gacat t gt g t ct t gt t t ct ct t gaacact t ct t caagt t ct t cgat gag ccagct t cgc gat cact t at t cat at acat ct t acggcaa agact t ct t a ccggt aaaca t t cact at t c gcaccggaac gaact ggcac gaact ggcac ct gggact gg ct ggcact gg gaaccggaac gaggt acat c acggt gggga aaggagt t ct caact t t ct t aacccttttt gt t gagggt t t at gaggt ca caagacaaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 gcatctatcc tttcgaacct agtactgata ctaaatttac Page 375 12689250 Sequence Listing.txt agatcattgg atttcacggt ttcgcgggca accatgttaa ttccattgga t t ccaaaat c gtgagacagg gacaaggcca tttttggaga agaccaact a t ct at gt agt aaaaact t t g acagct ggac aagat cact g gct ccat cag ggt agt gagg gtaggacaag gt t gt agaag at t agt t ct t ct t t t t t gt g agt t gat ggc cacgact aat caagaaggac at t t ggagt c ct ccaccact agct gt at gg agat ggt gt t t gaacgt gga t t t t t at t gg t cgagct t ga gagt t gat ac cgt t t gggat ggt t ccacgg gcaccacccc caggaacttt cccaagat gg gaaaagaaca gct t t ct ct t ct t ct t gaat acct acgat g aagcgagt at ggt caaaaga cat gt cgcgc cct gt t cct t gacgatggt t gcagct gt ca acgagaacat t t t ccct t t t at cagacgaa t gt agt gaca t gt at ct ggc acgagctggt t t t gact cct at gggat gat tat t ggagcg tggaaaaccc t cgt cgt t t a t gcagt t t ga caat ct t t gg ct at t cct t t t agt t ggt t t caat aaccaa caaccccttt ct cacgat ga agt t t gaat a t gct aggat t caact aat t t t at at aacat acgct gat t t acaaagt t t g gagt at gt t a gcaacacaat ggt gct t t cg gt ct cgt t t g acat t gct t g accgt at agt gct t gact at gaat gaacca t ggaat agga ccat ggaaga at aa gaagct gacc t gt t aagaaa caaaaat ggt cgaagaggt a cact at t cat ccgtggaagg t caagact ag aat t caagaa at gct at t gg cccaaaagct acggt gt aag t gt acgacaa gat t t gaaga t t at t at cct ccaagt gaat at cgt aaat a gctggcacag gct ggt gat c gct t act t cg gcagaaggt g gt t t at gt ag t ct caggt t g t gcacgct t a t at gcat t t a at act acgaa caagaacaaa ggaaggt t ac agct t at t t a agaaggagct gaaagt gt ct agcaggtcag ggt at act t t caat t t t gt t acat cacggc t gct t aggt t cct t cgaat t t act t cacaa 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2494 <210> 361 <211> 1561 <212> DNA <213> Arabidopsis thaliana <400> 361 atggagaaga at act t ctca aaccatcttc tcatgtgtct ctgctcagct ccggacaggt accat t gtac gtaacgctgt ccgtcagaaa actcttcgcc tcttcttcca cgattgcttc ttttcgtaca at ataagaag tttatttct g tagagtttac gatgtttatc ttctatgtat t gaat gt at a gggat gt gat gcgt caat aa cagatgacat gtcattggcc ggagacggat ttgatagcaa tcccaat t gc cgcaacaaag ctcgtgaagt cgtcgttttg gtacactaat t ccaact t t t tt ct accaga ttccagcaga gt t cgt gt ac at cgaaat gt cat t t t caaa t gat agcat c t cgacacggt t ct cat gt gc at at at at at t t ct t ct cct act cat gt cc ct t t cgt t ac gt t acaat t t at ccccaaaa t aaaacat t a accat cggag ggtgaaggcg t gacat t t t g at at at at at t ct cct t t ct gaacgtggaa cgct ccggcc t ct t t t gat a cat ct agat t t ct t t acgt a agagaccat c aagcaagccg gctct cgcca at at at at at 120 180 240 300 360 420 480 540 600 660 atatatatat atattttaca tcacttagtt attcctctat t at t cct t t a at acat at t a Page 376 12689250 Sequence Listing.txt tataatgtcc gaggtattga aaattcgttt tttctacagt atatatcgtt t t gt gcgaaa gt gacact t t cat at agt aa ggaggaccaa agcgt t caaa agccgt cacg aacct at t at acgt t at aat aat at acaat t cagct t aag gaccagt cca gt t cacgt ca t gccaat agt ggt t ggt gt t <210> 362 aagct aat ca t t at t ggcgg t ct gat caga gct acccggt gccaat t gcc gcct ct ct ca at t t t t ct ac tcaggagcac t t t agccct a caaat gt gcc cgt act t t cg gat caaat ct gaaggagctt t t gact ggt a t at cat t aaa t t t gt aat t t t aat aacaat ggagct aggg tcagcccgag aaccgat at g tttaaccaaa acact at agg caacacgt at cgat cggt gt at aat gct t a t gt t cacaga ttagacaagc at gct ggt ga t at aat acaa aagacaagt a agt aat ct ga aggagagat g t t t aacct aa at t gccct ct t t ct t at ggt at t t gcacat cgacccgagt cgacgt aaga t t t caagaat t caacggt ca t t t cat caca gat t cgaagg gt acaagt aa caagt aacaa ct aaat aaaa gcagaat ct c accagct caa caggt accaa at agt at t aa t gt ggaaaaa at aaaccgt g at cgcaat ca ct ccaacaag agat ct acag gcgat cacga gat t gt t cac t t cagt t t t c caact acaca ct acacaggt at t gcagacc cacgaaggcc cggcat gt t t aaccgaactt t gt t at t gt g t gt caaagag gat acgt ggt acat ggat cc gaaagggttt t t aat t cgt t agt t aggt cg gt gt caat t a 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 <211> <212> <213> 1341 DNA Arabi dopsi s t hal i ana <400> 362 at ggagacga ccaact ccaa gt t t acacag tcccacaaac agaat caccg gt caat aact t t gct t cct c ct cgt gaagg aaaat cgcgg g gagagaacg ccagccaat g acaaagagat gaagaaacag t t cgt cgcat t t gcggt cca t gacgat gaa at aat ct cca t agcct t t ct t caagact t c gagt aaccgt gt cct ct cac t agat gt t gt caact t act t acgcagcct c at gccgct gc aggct t t t aa t t gt gt t cga tagaccaacc cct caaagac agat acct t c ggt t gaaacc aact ct ccaa ct t ct acacc t ct gt ct gaa cgat t gt acc t gaat t t ct c agacaaccca cggat gcgga cat ct cgact gat ggaaagt gct acct gcg agct t ct aag t acgcgggt g aact act t gt cct t ct gcaa at t agt aaag ct ct caat t t aaaaat gat t accct gacca gat gaaggag aagt gccct g t acgt ggct g ggcat ggcca t t cat t cggt cct gt at t t g gacgaacaag gt ggaagat c gagagcgt t a gat cacaaag gaaagct ct a aaat cat aaa acgat cacat t gat ct ct ca agt t ct accc ct at ct t t gt at t t cgacgc ct gccacgt g t aggaat ct g cct gggccgc ct ggt gcgaa ccggcaagag tcaggaccaa ct gcgct cat t gct ggt cca t cggaaat ct gccat ct t cg cct t cct cca agaacacact t ct t gccgga cgat gct cgt cct ccaacag gcct t t gct g cat cact cac cacggct cga t t t ct accca aagcagcatt agccgct agt ct ggaaat gc gct t gct aac cat gt t ct ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 377 t ct gt ggt ct ttacgaaaaa t cat ct t cca agct at gaga gct agct t t g aact t ggcca cct gaagaga aaccct agt g t gagt at t gg aaaaggagga t gat cggt t c ct cat gaacc gat gggat t c t gt t gat aga acat gt cgt c t ct t ggt t t a 12689250 Sequence tcgaggaggg gaagttaaaa gt t aggaact gt gat cct ag gaaactagca aat ct gat gc ct at accgt g agt agct ggt tccggtttgg gttgttggga ttccaaggac ggacaaggaa cttcgagcag aacccagagt Li st i ng. t xt ttgaagaggc acgagggt gg t caccaact a gt aagct acc at gt gt cccc t t gaagcgt t t gct cgcct t t t gcact aac caacaccagt caccggt t cc tgccacagcc aggt gcagcc act t t cct t t cgt t agagac gt cat ct gac t t ct cggt t g t ct t t acgag cgt gt t aggc cgt cacact g t gct accat g at t cacaagc gact cagcct t t ct ct acca cact gggt t g cacaat ccct caacat t cct <210> 363 <211> 387 <212> DNA <213> Arabi dopsi s tha i ana <400> 363 at ggct cct c t caagaact c ct t t gt aaca ttcttcacca gcct at ccgc t cat cgt cac ccagct t t aa cattcccacc tctgcccaaa actccaggac aacaaacgtt gccacagcca ccaccgatgc cgagcacaca gataccatca aacattccac agatcaactt ccctagcaac ttccttactc caccaccttc aaagtaa <210> 364 <211> 336 <212> DNA <213> Arabi dopsi s thai i ana <400> 364 at ggct ccaa gaacccccct t gcact ct t c acct ct gcaa ccacagggac t t gt cct aaa gt gct caat c t agt ggacct aacat t ggga at ccaaggct t ggct gacct tgaggccgcg at t ct t ggaa t t gt caat at t aacct t cct agtaggaatg ctccaaagag tttccagtgc 960 1020 1080 1140 1200 1260 1320 1341 120 180 240 300 360 387 t ct ct ggt ca ct t ct ccaat accact at gc caaccaact c ttgcccaacc t t t ccct t ca gt t t ct ct ca acct cct ct t ct t cact t ac aattccatag agatcggtac ttgtgtcact aacccacctg t aaagccat g t t gct cgct c gt ct gcct t t gcact gcagt caaggctagc at caat ct ca gcgt act cct caat gt t t gt gcgt aa 120 180 240 300 336 120 180 <210> 365 <211> 802 <212> DNA <213> Arabi dopsi s tha i ana <400> 365 at ggccagag agaagat t gt ggt ggct ggt ggt accacaa agagct ggaa act act ct t g gggctgagaa tatttgcatt catggctact ttagctgcag ccattgtaat gtcactaaac aaagagacaa agacct t ggt t gt ggccacc at t ggt act g t t cct at t aa agccact t t a Page 378 accgct aagt gat t cgat ac caggt t ct t t t cagat t t t c t gacat ggt a t t t gaacgt t aaagct aaac agggaagaac ct gt gat cac cgt gt ccgcc caccacaacc ttcagcacac at at t gaaag gt t at agct a agccggaaac t aaacat t gg tggaaaaaaa gcaacact ag ggaaacaagc ggcgcaggag gt ct ct at t t t ccgt cgt ct 12689250 Sequence accggctttt gtgtaagcga caaaatgttt ttgagttatt atgtaatggt gagct t ccac t ggagt acaa aggt ct t cgt ttttcaaatc aatttactgg gctaaacggt ttct t gataa tatctgcggc tgcaaacgcg acgccaagtg gaacaaagtc caatcatcgc agcattcgcc cccgcctctt aatcaat t ct aa Li st i ng. txt gt aaaaccct aacat t gat t aact t gt t ga ct cct ct ct a t t t gatt t ca t ct aaaaaaa gcggt gt t cg tgcgacaggt ggagt cat t c aaaaact t ct at acgaaat c ct t ggt t t t a t gat t gt ggt t cgccat t ct at t gaatt ag cgt t ggaaaa tggcagagct t caccact t a t aat gct cct ccaccaccgc 240 300 360 420 480 540 600 660 720 780 802 <210> 366 <211> 926 <212> DNA <213> Arabidopsis thaliana <400> 366 atggcgaaag agtccaccac catcgacgtc agccatgtcg taaaggacgc gaagaagaag gccaagagag gtttggctat attcgatttc at t ggggctg cctctgtcat gtacaccgcc ctccagttcc aagccggtta cgatgacctt cataattact tattgacata atgtttacaa accatataga tact t ctaat ttgt t atagt tattagtaag acaagtcatg ttccggttat aatatactcg ttttttgaag gtactttgtg gtcctttcac ttccattctc catcgtatcc ct gat cct cc t cat t t gcga t act gt aact ataatttgat ataactcaca ctcttttatg cgctcaacac atcagcagca gcagcggcag accaaagcac caactggctc cctatctgtc gcaccgcggt tgtggctgat tctatcgcga cagccatcgc cctcaagagg cattga <210> 367 <211> 28 <212> DNA <213> Arabi dopsi s thal i ana <400> 367 ggcgagccaa ggct t t gt gg ct cct ccgt t gaggaaact c cct gcgt t t c at t t aat gca t t t ctt gaaa at cat t gt gc at agccgt ag at t gt ccgt c t acct t at aa t t at gat t gt cat caat cac agcagt t t gg ttctct t ct t gcact gt t ac cagt cgcct c tggcggccat t t ccct t ct t agt acgt ccc t at t t t at at t caaat at t t ggt t agt aac ccgt agt cgc cacat gct gt ct ct at t ct t at at at aaag ct acct t gca agact t ct gc cat cgt t ct t caaaagt t ca aagaggtggt agcagt cact t act cagt t c t t cct at ct t ct t t aagt ga aaat t cgagt ct aat gaaat t agct at ct c cgcgccccgg cat t t ct t t c ct ggt cgt ga cacaacggca cagaacgt t a at cat cat ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 926 Page 379 t cagt at ct g <210> 368 <211> <212> DNA <213> Arabi <400> 368 cct aacgcac <210> 369 <211> 28 <212> DNA <213> Arabi <400> 369 actt ctt aag <210> 370 <211> <212> DNA <213> Arabi <400> 370 caacagacaa <210> 371 <211> 28 <212> DNA <213> Arabi <400> 371 cacat at caa <210> 372 <211> 28 <212> DNA <213> Arabi <400> 372 ccacacaaac <210> 373 <211> 28 <212> DNA <213> Arabi <400> 373 at ct t cgat g <210> 374 <211> 28 <212> DNA <213> Arabi <400> 374 ttgagagaaa <210> 375 <211> 28 <212> DNA <213> Arabi <400> 375 gaagggtttt <210> 376 aacccgcct t dopsis thal acaagt t aaa dopsis thal cgcaat cact dopsis thal gaaacagaat dopsis thal cacat gacca dopsis thal caaaagaaaa dopsis thal gat gt gt t t g dopsis thal ccct agaaag dopsis thal gggct ct acg 12689250 Sequence Listing.txt gggt at t c iana cat t caacag iana gaccat gc iana ttgaaaaccc iana act t gccg iana caccaat c iana ct t ct ccc iana aggcggag iana tgtgaaag Page 380 12689250 Sequence List ing. txt <211 <212> <213> 29 DNA Ar abi dopsi s t hal i ana t t tat gtgcc t gtat gagc <400> 316 t ct caat ct g <210O> 377 <21 1> 28 <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 377 at gt gt gt ag <210O> <211 <21 2> <21 3> 378 28 DNA Ar abi cgaaaaccaa t gacaacg dopsi s t hal i ana ct t gaggat c t gagaggc <400> 378 t t agggt t t t <210O> <211 <21 2> <21 3> 379 28 DNA Ar abi dopsi s t hal i ana <400> 379 gggact ggcc <210O> <211 <21 2> <21 3> 380 28 DNA Ar abi at agacgagt t at t cagg dopsi s t hal i ana aat t t gcgaa t gaat gag <400> 380 at ggt caacg <210O> <211 <21 2> <21 3> 381 28 DNA Ar abi dopsi s t hal i ana <400> 381 t t tgt cacca <210O> <211 <21 2> <21 3> 382 29 DNA Ar abi aaat cagaca ggcaaagc dopsi s t hal i ana aagaaagat g atct cgccg <400> 382 ggaacaaaat <210O> <211 <21 2> <21 3> 383 28 DNA Ar abi dopsi s t hal i ana <400> 383 t gat ggt cac <210O> <211 <212> <213> 384 28 DNA Ar abi t gct ggagaa aat at ggg dopsi s t hal i ana Page 381 12689250 Sequence Li st ing. txt t tcgtt t tcg t ct tctcc <400> 384 ct t cggaact <210O> 385 <21 1> <21 2> DNA <21 3> Ar abi <400> 385 t t aagt gat g <210O> 386 <21 1> 28 <21 2> DNA <21 3> Ar abi <400> 386 caaaaagaaa <210> 387 <211> 29 <212> DNA <213> Ar abi <400> 387 t gt ggagat c <210> 388 <211> 28 <212> DNA <213> Ar abi <400> 388 agt t tt ggat <210> 389 <211> 28 <212> DNA <213> Ar abi <400> 389 acccat t t gt <210> 390 <211> 26 <212> DNA <213> Ar abi <400> 390 gccgt t aacg <210> 391 <211> 26 <212> DNA <213> Ar abi <400> 391 gccgt t aacg <210> 392 <211> 28 <212> DNA <213> Ar abi <400> 392 ggcagat t t g dopsi s t hal t t t gcaact t dopsi s t hal aat t gaat ca dopsi s t hal agt gcct gat dopsi s t hal cggat cgtt t dopsi s t hal ct gccaacat dopsi s t hal at cggaggt t dopsi s t hal at cggaggt t dopsi s t hal gt ggt t caga iana t taat gcaac iana g caac Cac iana aaagat agc iana gaat ct gg iana ct ct t t t g iana t cagag iana t cagag iana aacagaag Page 382 12689250 Sequence List ing. txt <210O> 393 <21 1> 28 <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 393 aat t ct t at t <210O> <211 <21 2> <21 3> 394 29 DNA Ar abi <400> 394 cgaaaagtt t <210O> <211 <21 2> <21 3> 395 28 DNA Ar abi aggcgccaca at gcaagg dopsi s t hal i ana acccaat t cc caaaaat gc dopsi s t hal i ana ct tct gggga agaagaac dopsi s t hal i ana at ct tctgaa at t tgt tg <400> 395 t tt t tgaccc <210O> <211 <21 2> <21 3> 396 28 DNA Ar abi <400> 396 t cgt ct t cgg <210O> <211 <21 2> <21 3> 397 28 DNA Ar abi dopsi s t hal i ana <400> 397 aggaagcat c <210O> <211 <212> <213> 398 28 DNA Ar abi cat cgaat ag agagagcg dopsi s t hal i ana t gagagagat at cct gcg <400> 398 ct gat cccaa <210O> <211 <212> <213> 399 28 DNA Ar abi dopsi s t hal i ana <400> 399 ct tggaagca <210O> <211 <21 2> <21 3> 400 28 DNA Ar abi t t caagagag t cgt ggag dopsi s t hal i ana t ct ct t ccga t t ct cggc <400> 400 acct t at ccc <210O> 401 <21 1> 28 Page 383 12689250 Sequence Li st ing. txt <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 401 aagat t ct ga <210O> <211 <21 2> <21 3> 402 28 DNA Ar abi <400> 402 cgggat ct t c aagcaagt ga cgat gat g dopsi s t hal i ana ct tgt tct tt t tctct gc dopsi s t hal i ana aacaaaaact agaaaccagg <210O> <211 <21 2> <21 3> 403 DNA Ar abi <400> 403 t caccagaaa <210O> <211 <212> <213> 404 28 DNA Ar abi dopsi s t hal i ana <400> 404 t cgt ct t ct t <210O> <211 <21 2> <21 3> 405 28 DNA Ar abi ct tct tcgt g agggt gt g dopsi s t hal i ana t gggagggga act caaac <400> 405 t cagaat ct a <210O> <211 <21 2> <21 3> 406 29 DNA Ar abi dopsi s t hal i ana <400> 406 c aaac gag aa <210O> <211 <21 2> <21 3> 407 28 DNA Ar abi agagagagat cgat gaacg dopsi s t hal i ana at ccat at tc t t tgctt g <400> 407 t cgt gaaccc <210O> <211 <21 2> <21 3> 408 28 DNA Ar abi dopsi s t hal i ana <400> 408 ggagaaatt g t tct t tggct t cgt gat g <210O> 409 <21 1> 28 <212> DNA <21 3> Ar abi dops i s t hai i ana Page 384 <400> 409 t gat acgt t a <210> 410 <211> <212> DNA <213> Arabi <400> 410 acct at t cat <210> 411 <211> 28 <212> DNA <213> Arabi <400> 411 gt gaggt cat <210> 412 <211> 28 <212> DNA <213> Arabi <400> 412 ttttcttgcc <210> 413 <211> 28 <212> DNA <213> Arabi <400> 413 caactt aacg <210> 414 <211> 28 <212> DNA <213> Arabi <400> 414 tgatcggaga <210> 415 <211> 28 <212> DNA <213> Arabi <400> 415 t cggaat ct g <210> 416 <211> 29 <212> DNA <213> Arabi <400> 416 tgaaagagaa <210> 417 <211> 28 <212> DNA <213> Arabi <400> 417 t ct cccaaat cgcct t cct a dopsi s t hal tgacaacaaa dopsi s t hal att caggacc dopsi s t hal ggagaagaga dopsi s t hal acct gccat t dopsi s t hal aagat t t aac dopsi s t hal ct ggt aat ct dopsi s t hal tctgaggagg dopsi s t hal aaaaat gaga 12689250 Sequence Listing.txt t ccat ccg iana aaccaaaaac iana gat ccaac iana gaact at g iana ct gt agcg iana ggctgagg iana acgcaaag iana aagaagaac iana gcaaacac Page 385 12689250 Sequence List ing. txt <210O> <211 <212> <213> 418 DNA Ar abi dopsi s t hal i ana caaagct caa at at t at ccg <400> 418 cct ccaacca <210O> <211 <21 2> <21 3> 419 28 DNA Ar abi <400> 419 ggat cgaaca <210O> <211 <21 2> <21 3> 420 DNA Ar abi dopsi s t hal i ana ct ct ct cgt a cgt caagg dopsi s t hal i ana aggaagagga gact t tgggg <400> 420 ccaagagat a <210O> <211 <21 2> <21 3> 421 28 DNA Ar abi dopsi s t hal i ana <400> 421 agt gat t ccc <210O> <211 <21 2> <21 3> 422 DNA Ar abi cgt aact cat gct ct gt g dopsi s t hal i ana caaagagaag aaagaaat gg <400> 422 gct gat cct c <210O> <211 <21 2> <21 3> 423 28 DNA Ar abi dopsi s t hal i ana <400> 423 aagat t t t cc <210O> <211 <21 2> <21 3> 424 29 DNA Ar abi gct acgggaa t t t gaacc dopsi s t hal i ana aacaat ct ag agcaaat cc <400> 424 t ct t cgat t c <210O> 425 <21 1> 28 <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 425 gt aat t t ggg <210O> 426 <21 1> 28 t aaaaacct c ggaacggc Page 386 12689250 Sequence List ing. txt <212> DNA <213> Ar abi <400> 426 t gt gagcgac <210O> <211 <21 2> <21 3> 427 27 DNA Ar abi <400> 427 cgcaacgat a <210O> <211 <21 2> <21 3> 428 28 DNA Ar abi <400> 428 t ct ct t t at c dopsi s t hal i ana aggact t gt a gt aggagg dopsi s t hal i ana ggt gcct at g gaaact g dopsi s t hal i ana ct tcct cggc caaaaagc dopsi s t hal i ana ggagagcaat gaaaacac dopsi s t hal i ana ggacggagcg agagat ac <210O> <211 <21 2> <21 3> 429 28 DNA Ar abi <400> 429 accagct cat <210O> <211 <21 2> <21 3> 430 28 DNA Ar abi <400> 430 atct gaggaa <210O> <211 <212> <213> 431 28 DNA Ar abi dopsi s t hal i ana <400> 431 t cacacacac <210O> <211 <212> <213> 432 DNA Ar abi gt cagct t ta act ccgt c dopsi s t hal i ana acct aaacca at aacagaga <400> 432 acct t cacaa <210O> 433 <21 1> 28 <212> DNA <21 3> Ar abi dops i s t hai i ana <400> 433 acaacgt gt t <210> 434 <211> 28 <212> DNA <213> Ar abi t gat gt gtca ccgt tct t dopsi s t hal i ana Page 387 <400> 434 t t t acccct t <210> 435 <211> 28 <212> DNA <213> Ar abi <400> 435 ccacat gggg <210> 436 <211> 28 <212> DNA <213> Ar abi <400> 436 ct caaaact c <210> 437 <211> 28 <212> DNA <213> Ar abi <400> 437 t agat gcgct <210> 438 <211> 28 <212> DNA <213> Ar abi <400> 438 cagt gacat a <210> 439 <211> 28 <212> DNA <213> Ar abi <400> 439 t cagt t at aa <210> 440 <211> 28 <212> DNA <213> Ar abi <400> 440 aaaat at gt g 12689250 Sequence Listing.txt t aaat at g t t cct t gt ac dopsi s t hal i ana gattttgaag atatcgaa dopsi s t hal i ana ccaggaaaaa cgaaaacg dopsi s t hal i ana aggagat t cc ct t ct cca dopsi s t hal iana t ccaat t t ca aaacgccg dopsi s t hal iana acctttggga accaccgc dopsi s t hal i ana cttccttgga acaccgga <210> 441 <211> 1116 <212> DNA <213> Arabi dopsi s tha i ana <400> 441 at gt cgt ct t ct cgggaaga gaat gt gt ac tatgaggaaa t ggt t gagt t catggagaaa act gt cgaag agagaaacct ct t gt ct gt t gct t cct gga ggat cat at c t t ccat t gaa cat gt t t cca ttatcaagga ctacagagga gatggaatac tcaatcttct ggattctcac t t agccaagt gt t gcaaaga gct t acaaga cagaaggaag aagat cgaaa ct t gt t ccca Page 38E t agct gagca ct gt t gacac acgt cat t gg aaagcagagg ct gaact cag ct gcat ct t t agct gaacgt cgat gagct t t gct aggaga aaacgat gat caaaat ct gt ggccgagt cc 120 180 240 300 360 12689250 Sequence Listing.txt aaagtctttt acctcaaaat gaaaggagat t accacaggt acct t gct ga gt t t aagact ggagct gaga aact ct t t aa t gt cat gat g t agct cct ac agat t ct caa acgggat gt t gat at t ct ct aggagaagaa cct t t ggaac act ct t at at gt t gct gat t ccggaagagg ggaaagaagc gt t t caacat aat gct t at g t cat ccgat t ct cacct gat at gat gct t c caat t gt t t t t cat acaaag t ct gacat ca at gcagt ct a t t t aggat ga ggaaaccagc t gct gagagc ct at ct at ct t gt gt t t t t c agact gggac cgt gcct gca cagact t t gt gacaggcttt acagt acgt t at gt acacct agct t ct t t t ggcgggcggt tgagacaggg act ct ggt t g t at t t t gct g t t ct aat gt a t t gct ct t aa gt ct cgcaaa aacact t at t t gat gaggcc gat aat gcaa t t gt ct t cct act cagaaat gat gagat ca cagt ga ct t acaagt c ct t gagacca ggat at t gca ct t ct ct gt c acaggt t t gt ccagcaccaa at t t ct gagc ct t ct ccgt g cct t t t gct t t t ct ct caaa aggaggcgt c agct caggt a gt aaaggagc ct t gct gat t t t ct act acg at t t t gt cat at t gct t aat t ggat acat t acaat ct gac t t ccct at aa cgct gat t gt aaaacat gag 420 480 540 600 660 720 780 840 900 960 1020 1080 1116 <210> 442 <211> 847 <212> DNA <213> Arabi dopsi s tha i ana <400> 442 at gccgat ca gaaacatcgc cat t ggccgt aaggcggcgt tggctgagtt catttcaact ggcatggctt tcaacaagct cactgaaaac gct gcagt gg ct cat gcct t t ggact ct t c ggtggacacg ttaaccctgc cgtcactttc ct ccgt ggt a t cct ct act g gat t gct cag cttaaattcg ccaccggtgg cttggtatgt ctttttggac ctaacttctt tgtcaagtaa gccggctttt ggt ct ct ct g ct ggagt agg gatgacattc gggcttgttt acaccgtcta tcttggaaca at t gct ccca tcgcaatcgg aggagctttc tctggagcct ccatgaatcc ct ggacat gg accaaccact gggt ct act g tggactcatc tacgaagttt tcttcatcaa ct act ga ccagat gaag t t gat ct t t g ggagccacca gt cgct gt ct ggt gct t t ca ct t ct cggct caaact accc ct aat t aacc agt gt t gaac cgct acagcc t t t cat t gt t cgccgt ggct ggccggacct caccacacac ccacccgt cc t cgt cgccgg ct cct t ct gg cagt t ggt gc t t ggt ggt aa ccgt cgt cgc t t t agccct t t t t t gaaaat gct t t cgt t t at t gacccca ggagccaaca ttcggaccag ct cgt cggcg gagcagct cc cgat gcct t a t t caggct ct t ct cgt agct caacat ct ct cat cact ct c t t gcct cat c t at t agat ct t t t aggct gt t cgagat cgt aaaacgggag t ct t agct gg cggt ggt gag gt ggaat cgc caaccacaga 120 180 240 300 360 420 480 540 600 660 720 780 840 847 <210> 443 <211> 5160 <212> DNA Page 389 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 443 at ggt t t t ga t at gt t cgca gaat act t t g t t ggtgagaa t t gat ggt aa gt gacaaact t cacaact ga cat agt at aa at t gct t aat aat t t gcat t t at aat cat a ggagacaaat acgt t t ct ag t t t t gt aagt caat act agt gt cggt at at aagt agaaaa aat aat t gat ccacat cacc cact at gt gg gacaccgt ga cat t aaaat a gggagt t t ac ct gt gt aat a t gat t t t t t t t at acat at a t at caat at g ct at aat t t g at agt caat t at aacaaat t t acat at at t t agt gt acaa caaaaaccgc ct acact t cc ct t ct t aat g t t cgat accg cccaaggct t cat cat ggac gct ccaggt c t t aact aat g ggt t t cat t a caaagt t agc ttcat t gttt t at aat t at a ct at t at t gg t gt t ct ct t g ggcccactt g t cat ccggt t cat ct aaat c tagccgaagg agt t caccac t accat t t ga at gcgt ggt a aagggt at gt t at t cat aat gaaaaaaagt t t cact ct ac t at t t t t t ct aat t t at t t t at at gcat t t aat t at at gt t at acaat t c t gt at at t t t at aaat aaat aacgaat gat caagt aagt a t t gat gt at t aaagacgct g aacct agct t t ct at caat a t ct ccat ct c gt caat ggt g gaat at agga gagggat t t g agt at t agt c t t t agt act a ccgacat at c at cat aaat a at t t gaact t t t at at t t at t t t gagaat c acaaaaact a aaacaaattt t t gagt t t at aacgt t gt at t ggcaaaat a t ggagaat t a gggacataga t ct ct act t t t t at t t t t t a t t aaat t gat t gct aat aat acacactttt aat t t at t at t t act at t t g gt acacagcg gaat ct gt ct aat t aaccaa t t at t t aat t cgt at cagat ct t t t gt gac agaact acgt t cgat ct t t a ct agat ct aa gt cat t ct aa gaagt t gaaa ccacat t t at t agt t t t t t t cat gct t t ca at t ggt at t g t gaat accaa caaaaat gaa aaagt at gt g cact t ct gt c t ggt t t t gat ct at at at t t gaaat t aat g gt cact t t t c ggt acat t ac aagagaaat g ct acacgcac t aacct ct ca t cagat acat t t t aaat at a at t t gt acag t t aacaacat at t cgt acac t acact t aca gcaccat gt t at gact at t g gat t gaaaaa cat aaaagat t acat ggat g t gat at ggat t at t t cct aa t t t ggt t t ga t gt at aacga aat gat gt t t t t ct ccaaaa tttgagaaaa gt aacaagt g act aat gt t a at t cggt t at t caat caagt aggaaaagat ggcaacct ct t ggt ccact c aagt t gaaaa tttgacaaaa agaact t aat act at t act c gaaat t acca cctctttttt aat ct aat aa aaaaat ct at t t acat agac cat aat aaat at ct caat t t ttagt t t t gt t ct gt acact cggat ct cgc aaaacatttt aggt at gaga gagct gat gc gaaccagagt gagt accct g gt act at at c aat ggt t act aaaggt at ag gat aaact ca gaat at aaaa t at gacccat gact t gct cg aaat t t t cat cgt t aaacca agt at gat t a ggtaaagcag ct t t t at tcc aact t t t t ct gt gaact at g gaaaaccat g t gt aaat at a t aat agaaat t aggggcct t t t t t t tat aa t t gaaat gct at at t t t t ct at t t t t atca at t cat at t t t t t tat at aa acaacatttt aaaagt gt t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 tacaaattta agtgtacgaa tcaaactatg aaaagtcact act t t gcaaa t cgt gaat ac Page 390 12689250 Sequence Listing.txt ccgat acat g agacat aaaa aat t t t cacc aaacgagttt at at aaat at aat aggt gat t aaat act cg cgt t t t aat t caaaaccct a t t t t t aagt t at aat ct aca tgt t t t ctgt t aat accgaa cat agat aac aat gt gaaag caat cacat a aaagt t gat t aat ct aat t a t cacaact t a gact t gt gat at t gt cacat ttgt t t gttc t gaat acgat at at t ggt t t ccct t t cgt t gat t cact aa gat agt aaat agat t cat aa tgt t t ggaaa gaggaatct g gccggat t gg gacaaaccca at ct gact ag t t t gt acgt g t acaat at gt t aagct acat t t ct aaaaaa ct aat t cat a at aaat gt ct caaat gcat t aaat t acagg gtagtcgccg t t gt t cat cg gat t t ct acc t t t gt caaaa t t ct aaaat t aaaaact t ca t ct ct aact t t t cact gaat t ggggt at t a t gaat agt at at cct aaaat aaaacct agg acat t ct at c at aaat ggt t aat aact agt t t t gcat aaa t gat aaaaaa t t t cgt acca gagcatggaa t t t t gaat at aacagaaccg agacggcggt cct t caaaag acat t gt cac t t agaccgt t t t t at t gt t a at gt ct gaaa ggaat cgt aa at t aggat gc at at t cat aa aat cgaat at ct ct ccaccc ggt at aat cg t t cct ct gt a t t t t ctcat t t t aaaact ga acaccgacca ccat at aaga aaat t caaca t cat ggccga aat gact t cc ttgat t t t gt act gcct agt aaat t agaat agagcccacc at gt gaaact at gt gat ggc aaaaaat at c aat at at at a t ct aaagat a at agt ct t t t t t t gat t t gt ttaggaacag at gt gt aaac gggagtaggg aaaat ggcag cggagccaat at aat aaat a gacgat t t t a at at t t at t t gcggat at t c at gt at at at t at at aaaaa agt t gt at ga aggt t cgat c t aaat t t aaa aagttttttt caact t aacc agt t t t ggac at aaaaat gt aaat t ccaaa gat ct gat t c t t at t t t t ct t gt agat tag at t t ggt agg t ct at aat t a t at acat at a caccat t aaa t caacaat t t at cact gt gt at agt act t t gcccat t at c ct t t gat caa ct t t t ggt aa cat aacact t aacaaat gt g at t at agct c acagt t ggt t aacaaacgca gt t caagt aa gaaat t t t gg t t t t gt acgt gct t gacggt ggaagcattt t ggaagt at g tttgaaaaga aagt t cat gt ct at ggagt t cacaact t ct cct t ct t ct t aaaat gt gaa aact t t t t t c ct t t acat ca aaaat t ccaa cggcaagat a t agat t gact t ggcat gagt gacaaagagc t gt t t caaaa t agaggat at at ct cgt ggt caccat t aag gcct t t gat a at t gt gat t a aat act at aa t t agat cact aagct acaga at at caat ct t aaacat t at gact gt t caa ct t cagaagc aggctgaggg gt at at at at ct ct caat t t t gcat t agt t t t t aatt tgt ggaagcggag t gt at aaat t t at t cagaat ggcgt agt gg t at t t t t t t a t ct t cccaaa ct t t t t t gaa agat ct at ca ct caccgat t ccagcat t t c ct ct at gct a caccagaact gcaaagt caa t gt aat t aat agt gact aaa agt gact aat aaacagt t gt cgat cggt ct aagt t ggt ac ct t gat agt g t t at ct t gt c at aaaaat t a gaaat t cgt a t aagt t at gt tttaaaaaga aat t t t gaat t gcgccact c cat cat gt t a t aaaccct at at acagct t a caggt t t gct 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960 4020 gggagaaatt cgctcggtac ttcgaggtgg agctaaagga agtaaaccta agtgaaggtt Page 391 12689250 Sequence Listing.txt act acgt gat cagccat at t t gct agt caa gt ggagggt t t ggt t aagag gggt cgt gt g t t ggt gct ga t at t t t at ct ttggt t gttg t t cgaggt ga aat t t t ct t t gat t aggggt ggt at agaga gcct t ct ct c cgt t t t ggct ct gcgt gt t g at t t cgaagg atagaaggga gt t at t gttg <210> 444 ggat ccagac gggat ccaca gaaaaacgag cat agct ccg cat caacgt g gagggcagca t caacccact aggt caagt a t aggat cgag gt ct at t t aa cgt t t gt t ac acaaaaat gt aaacagagcg tcaaggacca ggat cgt ccc t cat caggga t gct t cat ga tcgcggaaaa gatggaggaa aaagcagcag ct caacggt g gagactggt t t t t at ct at c agt ggt caca gaggat t t ac t t cact ct ca at t at gat gt ccaaat t at t gccgaaaat a gt t t at gact gat ggagaat t t t caacat a t agt t t ccac agct t acact agat t t ct ca gct agat acc tgtaaaggag gt t t gt gaag aaat ggt aga agt t cgaaga ggaacacacc ct gaat t aga agt at ggact ct gaagagct at t t ct ccaa t at t t t acgt gct caat act tgt t t t gtta ct ggt at t t g t gcat agaga gt ct caaagg aacgagt t cg at gcct gccg agaacact cg t t gcct t cca aagaagatgg gagaggaaga cgagaacaca at ct gt gt cg cgt gaaacgt gat ccacgt g at gggact t t agt ct at gct t at ct t t cat gggt at gt t t agcat t ggt t accagct cat cat t t t caat t t acat cgaa acat ggt ggt accaaggagt agat ct ct ga at gcacagca cggagagact agat at ct aa agaaggagat agat gaat gg ct caat gact gat gcagcaa aggct t cct t ggt at t ggt t at t aat t at c at aact t t ca at gaacgt t t t cgt ct t gga t t at aat gt c tgt t t gattt t ct caaagaa gccagt cgt c gat gct acgt cat cacggt t t gt t gct gat gaagatggga t ct gat ggaa t gt gt gct aa 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 4680 4740 4800 4860 4920 4980 5040 5100 5160 <211> <212> <213> 3061 DNA Arabidopsis thal i ana <400> 444 at ggat gaag ggt ct t ct ct ct ggagat t t at t at t at ct caat t ggct t tt gcgt aaaa gct gact t t g t ct at at t cc t t ct ct t aat aagt t at aat tcaggggaga agt acgaagt ccgt cgat gg t caat act ct ccgat t gt gt cgcgat t t t g agact aaat t t cgccaagt t aat t aggt ac ct caat cagg cagcgagagt agagaaggct cat agt t ct g t ct t aaggt c t aat t cgt t a gat t t aaagc acaat gat ct tt cat cat ct tact t ct t t t t t cacat gga t caat at t t t t at t gaat gt cct gct cat t ggt act ggt c t ct ct ct ct c ct t acat gat t ggt t gct t t gaat t t aat g ccact cct at t at aagt aat caggaat gac t gt t aaat at tgct t t gttt t aggt t ct ag t caaggagt g t t t t t gaat c gaacaacgga gt gagct t ca cat t t agt t a at ct t ct at g ct aat ggt ct t act acggt g at at gaagac t aaacagct g cagagact ac cat cct t agt cat gct t gat t ct agt t gag acat gagaat gt t at t gaga t gat t gat t a act aat t t ca gagagt caac tttttgtggt tggaagaagt aat gt t gaca 120 180 240 300 360 420 480 540 600 660 Page 392 12689250 Sequence Listing.t agct act t t ccagct t a t ct agct t tgatgccaaa ggt t atcaat at t t at gaat t ac gct t gt t cgt t ggaagct at gt gcct cat t cat gt t t gat t t t t t t ctag atggaggccc t t cagct at g aggagagt t a t ggt gt t gt c aggaaat t cg t gt aat gaca t gct ct cat g cagt at t t ca t gcacgt t t t t caggt t t t t t t ggact at a gt gct gt ct a t t cct t at ct at at ggat gc agggt aaggt gcgat cct t c t agct t t gt t agcct t t t ag at aat ccaat agccaccct a caat t gggcc aaat gt gt t t agat acct t c act ct at aaa at gat t acat tt gct cccaa aaaccgagct t aact gt t ct gt cct t at t c gt t t t t gtcc ct ct t t t t t t ct t ggt caaa t t at at t aac t caaat ct cc t t caagaat a caact aagga aat at t t gt t gt ct t aaaga at cat ct cca ggact t t aaa t gt t t gact c caaggaggtt tt ctt ct t t t gt t t ct aaat tggtgggacg t t gt ggct t t t agat ct gat t t ct ggt gt t t t acct aacc gcat gcat at gcaat t t t aa t caaacaggt ttccaaacac gcaaat caga gaat aat gt g ct t gct acac act t ggcagt ct t t at t gt a gggaaagt t c ccagcct gga ct gt caat gt at acggat gt aaggcaaggt t ccat agat t agct t t t cac t gt t t at at g t ct t at gggt cgacgagaag t t t gat t gcg t t agact at c agat act at t t caacct gcc gt gt ggat cc tgt t gct t t a caccat at at gt acat t gct t ct ct ct t at t at at gt t ga tttgtttcgg aacct aat at acat ct gagg aacaaggt aa at ct ct cat c caat gt gagc caggaagat t caat gat t ct cat gt gagt t t t agct aat c act ct t ct ca gt gt aacat t act gacaggt at agcat t t g at cgacct t c ggcgcaccag gaccaagt ac acat t gat t a t caagt ct gt t gat cat t ct ct acaggt cc at t t t t gaga gacccaaaaa t aagct gat g t gct gct aac gact t t at t g t at gat act g cct t gt t t t c aat gt t t t at ct at cct ct c ttat t t ggt t ct t cct t ggc acaaacct ga t cgt t t ct gc tgct t t t at a gagagactgc t ggct gt gac tctgt t t t at ct gt at gt t a ggcagggt t g cagtcggtgc ct ct t at t ca t t t gt ct gaa act at at t at gagt cgat t g at gt ct t ct g t gt caacaga t t ggt cct gt t t t at gat tt gt cttt ct cat t t t t t gt gcat agct t t gc aaaaagt g aacgt cga cacat gat ttctcttg t at caat t gt cat gca t aat gaga gaagt ctt cagct ct a t at gggt t ggtgcaac t caggcat gtgcaagg aacttttt acaggtag caaat gca cggat at c ggt ct gt t acat t at t ctcgggcg aagt cat t t cct t t gg t t at t ct c t t ct t gat at at at t t t t gt t cat cgct gaaa xt cc ttgttaccaa gg ctaacggaaa t a aagct gt t ga ca tttttatctt ct ttttttctcc ag ct t aagt aat cc agcaactccc gc tggaaagttt gg aatggatttg tc ttttgcatta t a acact t t ggc gt ggcact t cac at gaaggtgaat ct tttttgctct cg cagagtccct gg gagaattgcc ta ttctggcgt t tt gcacgactta tg cgtctttgac ct gtcgt t aact ag tttgacgagg ag aaagttgtct ct acagt aacat gt tatgaatgt g tt aaaatgtttt at tgccat t at g ct accccaaaaa tt tcacttatgt tt gagcttgctg t a caacacatac gt ttctcggctc at tcacacaacg ct gat aaccct c 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 tgatgagctg ttcttcgaca Page 393 t ct at gat ag ttagt t t ctc t t t t gt t cat gat gt gtt ga at cggt t t t t gt t gcaggaa at at gagcct t acccct t t t gt cct gaaca acat gt at aa at t aat t gac ct ggat ct aa 12689250 Sequence gt caacgaac ccact t t gga tt agt gagca t ct t aaagt t gagtt at gat gctacaacac att gat cacc ggaaaggtaa ggcgacatat tctgtaattg gtgtggatct taacgcagct Li st i ng. t xt caact gct t c ggct ggt t t c act t t gacac t gt t ct agaa gggagt gat c agt gct gcag at at caacgg t aat aacct t aact gt t gt t t cacaaat at at t gt t t t gt aggaggaat g 2760 2820 2880 2940 3000 3060 <210> <211> <212> <213> 445 3356 DNA Arabidopsis thal i ana <400> 445 at ggcggt gc acggt at act t ct t t t t t t t t t gt gt t gat gat ggt gaat t t gaaact t a ggt t t t gagt gt aat gagaa cat t t cct ca agaggat t t a t acagct t ca gaagaaagt g t cat gt t cga cat t gt t ct t cact ccaaag ct gt ct ct gg t gact t ct t a cagaaat agc gcat gct cac t gt cct agt a at agcact ct aat act gt gg aat agt gt t t t at aaaggct t t t cat ct gc t gt t t cgt ct t t t gtcgttt t aggt caat g ttttcgcaga t ggaat ct gt gt t gagaat a gaaggtgaag agcaact cca t at t t t gat g gact at t at c agggaatgca cgt gt t t t gt cct t t gggt t ct t ggggct a gt t t t t gttt gggt t at gga t agagt t gat t at t ggt aag aagat cat t c gt ggat caga ct t gt t gggt t at t gt gt t a ttgacagagc agat cgaggt t cct cat cgt t ggat gat t t agat t t t gt t ct gt t t cat t t gt at at agg t t gat t at ga agt t cat act gcct ct acat at t t at t gt g at t t caat ga tggagaaaga t t t t t ct t t g t t act cagt a t gggt gt t gc gt t t gat t at t gt cct ggt c gcaagt t gt t agcacct aat t aggat t t gc agcacagaag gagt acgt gg t acaaacat c ccgacaat gg t cgt t t t cgt t t gatgcagc at gt gt cgat t t gcgt gt t a t gcagat gat aat t agaaac aat aagaat t t t gat t t gcc t t ccacct t g t t t ct ct gt c t ct act gact agt t gct cca t t t gct t t ga ctgggagaaa t ggt ggct cg gt ggat agaa t ct ccat cac cgact t t cat at t t at t t ct at aagt t t at gagaagtat t at ct t ggaat ct gt t gat ac aagcgat gca acccat act c tat t t t gct c cat t t cgat c gt t t accgt a gat gct aggg gagat t at gc gt gt t t t ggt acct at ggaa t agt aaggat t cat t t gt gg ccggaagaac at aat gacag t t gt at agt t gcagaat t t c at caaggt aa gct t cagt at cgccaat gca t t t ggt gcat aagct t gt t t t t cct t t gat t gcct t ct t t t agat gat t t cat t acct at agt ggt ct ag ttcct t t t t c gct cgt t cga aaacaggt ga at t t gacgca at cat gat t g at t gcgt t t t ct t gcagct a at gt ct gt ag ttct t ct t cc at at act gt t aagct at ccg aggt agat ct agcgt t at ct cat t ccat at t t t gcaat gc t gat t t ggt t at t gccacag t ct t ct t t gg aat at t t agt t t t gt caaga ggct caat t g gcat ccct t g at t ggact t c gaacgact gc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 aacaaaggtg gggctgcctt ctctcctagt attcagttac ttgtatctat ctttctatac Page 394 12689250 Sequence Listing.txt ttatgcaagt acagtaatgt ttcattgttt tgactggttt actgtatcag gttgaaggag gt t ggaaaat t cat ct t t gc t ct t agat ct aat gaggt gg at gct aaat a at caagt gca t gcat t agaa cct aaaggct t ct act acag ggacaccagc act t t at cac ct acact t ga t at caat ggg aat cct agt a aggt at ct ga cagaagct t g t gcaagct gt taacaagaaa t agccgaaat cat t t ggt ag at gt gt agca at ct gt at t g at gcgaaaat aaaat at t gt gct cagt aaa gat t t cat ca aat t ct agca ggct at t at t gt gacct t ga ggagggaagt t aat ggacaa gaggaat aca ct cagat t t t ct ct t gat t t ggt t gct acg t act ggact a act t at cagc act aagat cc aat gt ct t t g aaggt act gt cagaaaaacg at aggt t ct g aat ct acgat t gagct t t ca aggagaggaa t gcagat gct atgagacggg tcct t t t t ca aagt aggggt aagacct gt c acat acaat a cgct t agaga at t gt at cga att gagacaa cat t gaat aa aaagcgagag gat t t t ct gg gaagaaagat acccat t t at aacgggtat t aagcgt t gga acaact aacc gt at ct t agg cat ct t at gg caat t gaat g at gt ct ggct aaccct ct t t caaat aaaat t t ccagat ga t gt gat aat t t cacat ct t c gct gt ct cac at gt gccaca t t t cgt t act acagt t t gga gggt aacgt t t cagat gact aaat agagaa aagat at t t a at at t t t ct a aaggt gt t t g ccagt cacgt aat gcaggaa gat acat t t t t at cgt t t gc aaact gct t c t agcaaaggt cat ggt t aag acat acgaag gcgagt t t ca ttggaaacag aaat caacgg t gt agaggt c t agt gct gt a t t ct gaacaa cct cat t cat cagat t cat a aggt t t acgt ggat cggt t a ct act acat a ctct t t t ct t gt gt aat ggt ggt at t t ccg t acagt t t t c gcaccgt t gg caagcgat gt ccaggt caag acat gt t gt c aacaacgat a t gagat gcat t gt at agaga ct cct ggt at gt t cat gct t gct t gt gat c at aaccaat t gct aggt cgg gagcaaat ct t cat gaat t t ggact t at ga aaccggct ac cacct t t gca gt aagt caga gt t cccagaa aaaat ccat t t aaat t t ggt ct t at at aga gt caagaaag at ggt t caaa cct ggggt aa accccaaaat t gt t at aat t ggcctggcaa ggaaaccgcc cat t aat at g ct gct t t cca t t ct aat ggg ccagt t t agg t gat t acat a ggt gct cat g cact t gat t c ttagggaaag t cgt gt at ag at t at act at t at t ct act a at gt gt t t cc gaat t act t g t cat at t t ga ggaaaat t t g t at aaacacc acgt agccgt gat ct gt t ga t t at t ggct t aact gaaggt tt cat aaaga aaggaccgt t ct at at t t cc at gcgcct gg at ggagat at at t ct t t t ca gct ct aaaat aaat gt t t gt ccaat cggca agat ct ct gt tat t gt t t ca gtt aaaccaa t t ggcgt ct c aaaggt at t a gt cagt acgc agt t at gaac ct t gaat aga t t gt gaat gt t cgt aat cgt cgaaat gcag at caaacat a act aggcat g gt gggaat gg ct t ggt t gt t caggct t t ct t t agt aacag ct ct aa 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 <210> <211> <212> 446 994 DNA <213> Arabidopsis thaliana Page 395 12689250 Sequence Listing.txt <400> 446 at gt t cat ga gct aagat ct aaagacgagg cct aacct t a t gat t t agga gct t aact ga agcct acgca t t gat t t ggg gt at t aaaga act t gt t at t t caacct act tctgt t t t t g t gcagaat ca at t cct cat t t t accat ct c gaacgt t agg aggt t t caaa t cgat t ggt t t gt t t ct ggg t t cgct ct ca t t cgat ct cc at ct gagt t t t cgat aat cg gcat ccaact t ggt caccag t t t ctcattt gagt act t gt t aagct at t t caggtggacg aaaaaggaac ct aggaaaca ggcct ct cca ccat t ggagg t gggt t t ct c ct at ggcgt t act cgat aat ct t t ct t at t aat t t t gt t t act gct t gaa aacat ggt t g t ct gaagaac at t gct cgca gt t at accag caaagat t ga cagtggaaga ct gt ggt ct a t t gat gcact agat agacat act t cact ac t t t t catgt g aat acat caa ct cgct t cgt gct ggt aaaa cagat ct t t c t t at at gat t t t t t gagt cc aat t ggt gaa t cagcat t gg gggt ct ggaa t t t agct cac gt cat agagg gt t t t at gaa cct agt t gat t ct ct cagac accgt at gct aggaaagggt cagcat t gt c gt aa t agggt t at g ccact ct gct t t at ct aat c ct ggat t aga t t gagt agt g acagagatt g gaaaat caag ggat t act at acgcaat gca att cagaaac ccgtcggt t t gctt acgaca gaat ccct ag gcatcagagg aaagt gaat c aggaaaatgg gcagaaagag t cacat gt t g gat ccaat t c t gct t aaat t t gt t t t aat t gt acagcat c t t t aaggct t gct aaggt t g t cat at at t c t ccaaat gt t t gat gat t ac aagagagat t ccagcgt t cc acgagct ccg t aacggat t c gt t acggaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 994 <210> <211> <212> <213> 447 3065 DNA Arabidopsis thal i ana <400> 447 at ggat gaag ggt ct cct t t gt t t t t cttc t gct t t gat c gt at gat ct g t gagcat gga tgt t cccgcg ggaat gact a tcaggggaga t gat gcct aa t gt cact t ac gct t gt t cgt agt acgaggt ccgt cgat gg tctct t attt cggat t caaa agct caggt t gat gat t gaa at ct gat t at ct at ggt gga agagaaggct ggt at t at ag t aact ct t gt accct t at t c t at t gt t ct c t gt caaggt c t caat t gaac aaaacggat c t t cgat ct t g aat gt t t t gg tggt t t t gt g gaat caact t cct gagcat t ct agct gcca ct ct caat gc at acagat gt ggcaccggt c t cct t cct ct ct t t gaat t c t cggt gagat at aat gat ga ggct t t t aca gat gt gacga ccct t aat ct t aggcgct ag t t t gt t t t gt cat t gaccag t acaaagt at t caaggagt g ct t t ccccga at gat t gaat t agat t agat t gagat t t ga t at gt acat t gt aggt gct t caat cagct t ccgggat t ac t t t t atgct a t t t at gat gg t t gt cct t ca t at cct cagc ttcgt t t t t c t t cgcat aat ct ccccgat t t t gt ct agt t gacct ctt ag cacat ggaca tggaagaagt aat gt t gaca at ct t at caa gaaat ggcaa aagct gt t ga 120 180 240 300 360 420 480 540 600 660 720 tggaagctat gttttcgtca aaggaaaggt acaaacttct catttgatgt atttgttgct Page 396 12689250 Sequence Listing.txt gacattttaa aatatcattt tgctgataac tcttaaattg ttgatgaaaa ttttcctgca ggt t caaaag t gagaaacgt aaaaacacac t t gat gt t ct t ggct aacca ct t t at t ggt t gat act gt a t t gt acat t t ct at gcggaa gt t gggagaa caagt t t t at t t caggcat t agt gcaaggt agt ct gcaag gcaat gt t aa tctgagggag aaggt aat at acgtcaggag t gt t cat t at ct t ct t gt aa t gtt gt t t t t ct cgggccat aggt cat cat aact t t gat a t t t t t ct t t t gt gct at aag ccccct t tag tgt t t t gtca ct ct acact c gaaaat t cat agcct ggaac acgagcct gt gt gccagct a cgagccggca gat ggaat gg aat gcct t ct t t t at gt gac cacgcagtgg at gagaat ga agt t ct gaag t ct ct t gcac ct accccagg at gat t acat t gcacgact t at t t t ctct t aggggaatgc t ct t at gcca agact gct aa t gcagt t ct t gat cgt t at a gt t ggt t gat t gact agact t at at gat ga cgccat t at g accccagaaa agccagtggc t ct gt gt t ac at t t t ccttg t ccat agct t t at gt aat ca t gacaggt at t gcat t t gt g tgatct t t t g caacgagcca ct cct at gga agt t t t t cag at t t gaccag agagt t t at g act t t t gcag cact t cacac aggt gaaat t cagct aagt t gt t t ccaagg t t t t t ctgt c t t t t gat t ct agt gct gt ct t gt ct t cat g accgt aagat ggt agagt t t at gcaaaaag agcat gt t t a aagt t gct gt t gccat agt t gt ggt ct t gt t t aat cacat agccacccga cagt t ggccc t gaaaggact at aagct cca gt acat gt ag gaat t gcaca ct act ct ct c gt ct t ct gt t t ct acagat g ggt cct gt t g gagt t ggaca ggccct gaaa ttttgttcaa agt t acaaca gt gt t act ga gaaat at ggt gaat gaccaa t t gt cagact gat at t ct t g aacat ct cca at t ct gacct t ct gt gact t atggtggcac acaact ct t t t gt at t ggag gacgagggag at t gt gt gt g aat t t t caat t cat gaaact t t t at at t ca t caggat t t t t caaacaggt ttccgaacac gcaaat cgga t gaaaagt t g t ct caat cat ccggaaaat g t acacagaat t ct t t t t t gt gt t cgt act c cagagact ga at gagat at t act gct t t at t ct t ct ct ca gaat acgat g aaggaact ga ct t t caagt g ct t gatggca cat ct cgacc tatct t t ct t ct t t aaat gg t at at t t at c t t gcct t ct g ct ct ct ct ct at at at gt t g gt t t t ct t t c gaacgct gt a gt aaggt t at accct t cat a t act gt gt cc t gt gact t t t t t cacacact t ggaaaaat c t aggaagat t caat gat t ct t at gt aagt t aaat acct ag t t t gagat ac aaagt aat at aat t t cgccg t ggat ct gat ccacaacgt t t aaccct caa ct t cgacat g t t caacggt g t gggcat at t agaaggaccc t t gcgtaagc aggat at acg acact at t ga aacccgcct t gt t at aacag ct ggct agct ct ct ct at gg t at gact t t t ct t ct cct t g aacaaacct g t t gt t gt gca ct cat at at a t ggt gt aaca cct gccgaac at act t at cc gccgt t ggcc t agt t ct t ct tat t cct t t t ggcagggt t g cact cagt gc t t t aaact t t t gcat t t t t c gt t t t t aagc t att caggaa t agt gat gt t gt t gt gaaac gct cccaagg accgaact aa t at gat agat agcgagat ct 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 ctttagattt tattgctgcc tactttaagt ttgatgggtt ttaagtgtct aaccctgtcc Page 397 12689250 Sequence Listing.txt ttggtcct t a cagagctatg atgctacaac acactttgag acaactgt t g cggatgtgtt gaacatgtat accctaatca ccggaaaggt aaagtttgtg aaattcgcta cggatatttg ttgatgcggt ggaaagacat ctcaggttt ct cgacat tt ttcacaacaa gttttgttt c ttcttatgtt gcagcaactg gacctaagtg ttgacctgag tgccgcaagt gctgcagagg aat ga 2880 2940 3000 3060 3065 <210> <211> <212> <213> 448 850 DNA Arabidopsis thal i ana <400> 448 at gcct at ga ttcaacgagc t t aggct t t g t ct ct gt t t c at gt aat aaa cacat cact t caggt at aac ct aact t gt g gt t gt t gt ac ct ct ccat t a at t agaat ac gat t gct ggc t t act t gct t caat ggt at c t at ct t t t aa agt cgt t gag t cact at cgc gt acct ct gt t at gt ct ct t t gcagat act ct cat t cct t acaacaaaag t ct gaaact t aact ct t ct t caagcaacaa ct ggaat t ac acat t cagag caagaacaca tccaacaccg gaat gaccat t gct aagaac cct cgaat gg t cct ct at cc t gt t ggt gt t acat ct t ct t ccagagactt tgacagagga t cct aaacac gact t t gat t cggtggct t t acagt t ggat t t agagct t c t cgggat cat ggaaccct t a ct agct acac gt t gct t caa gt t t ct t gt g ggat cgaacc cagt ct t cct t t t ctt tact gaaat t ggt a gccagaggtt ct gat t t ggc ggt t ct t ct c t ggt t t agct aggt ggat t c ct gt ct cgt c aggccat gat acgcct t cac tcgccgccat t t gat ct t ga aat t ggaaga t cct t gat ct t at cat t at g aat ggat t gc t gt ct act t t t t at ct ct gt gccgt ggt gg at at gt t tag agaaat gcat gt ct t ccccg cggat ccgat act cact ggt gt aaccat t c t gaggt t t t g cgaat at gct ttggtat t t t tt cct aaacc t t t cgt agct t t t t cat tca t gt t t gt gt t caccaaatt t gt at t ggat g tt act aaagc t ct gggct ct 120 180 240 300 360 420 480 540 600 660 720 780 840 850 <210> 449 <211> 2756 <212> DNA <213> Arabidopsis thaliana <400> 449 atgaagatat actctagaac ggt t gctgt t tctgcctttg ctgagcgcaa tgacgggacc gat t cgaaaa atcggct t gc agcacgcgtc tacagacttg gagattctgg agatgctgat cagtactatg gtgagatcgc cat t ggtact actgggagct ctaacctctg ggtgccatca tggactattg ttgttagctg tggttgagat t cact cat t g t t cagagt t g gaat ccaagc gt t gt t gt gc ccacct caga t caaaat gct act t t t gagt Page 39E t gt cat t cct gact gaaaaa aagaaaagcc t t aagaat t a agt t cact gt at t t ct cagt agaaaact aa cct gt gt t t c act caagt t g cct gagagct t ct agat gct ggt t t t t gac aagcaacat t t t gt gt t ct a 120 180 240 300 360 420 12689250 Sequence Listing.txt tctcttttgc agcttgcatg tctcttgcat cccaaataca agtcgtctcg ttcaagcaca t at gagaaga gt t ggt agat aaaccat t gt t gct cat ct t gt t t t t t t ag at at gt ccga t t t gt ct at c t agct aaat t ct cct gt t t g agt gt t aat g caaggagccg act t gt at t t gacacaaaag acgt t t t cca gat gt t ct t a t aacacat at ggt gacaact t ct gcgatag aat t gct t at ttttttccct tgt t agccag t t t gtctgag t gct gt gcaa agaaaat ct g gt t at aat ct t t t ccgt gt g t t gagt cggt gt t ct gcat g aagagcgcat gt gaaacagt t aacagct at ct gt caacaa at ggt t t gt t act t t acagt t aaaagat gc tgatgcagga t aat gat gct ccct aat cag t cat caggag t gat ggt at c gt at gcct gc ct aat at gt c gt t t t t t cat ggaggt gt t g ggct act ggc gt gacgat ct t t ggcggt gc at at agct t t ggt t ct aaag cagat t ct gg agct t t at ag gt gt agact a cagt gcaaga gt t agt gt gc at t at t t t gg ct cgcagat t gt agt cat t g at ct gct cat ggtggacaag t gagat ggct at t gaact ac agt t at t gt t gt gaacgcct t gcccact gt ct ct ct ct t a t cact caat g aacat cact a aaagct gccg gt cacagt t g t act ct t aca t t t atcgagg ct t ggt ct t g acat t ct t ct tt gt t t t t t a t t t ggct t aa at ccaaat ca aggt t t gt ga aact caagct acccact ggt t cct gacaaa t t t gagt t ac t acat ct t t g aat t at ggt t t aat caccat ct gt t gt gga t t aat at t gg ct t accat t c ggt ct gt gt a ccat t aagct at t t agt at g gaaaacgcca gt t gt gt gga gtcaacgagg gt cct gat t a ccccagccca t t cact t acc act t gaaat t t agcct ct ag ct gat agcag caat t cat t a gcgat t t agt acat acacat caaccaagga gatt ccaaga ct t gt t t gga caggt acaac ccgt aat gca tttcaagggc t acat cact t at at at at t t at gt gt aaca ttcaaaaagg t gt t at acag ct t gccggt c t t t t aact ag gat aaaccat t caat acggg act agct aat at ccccct t g ct t t t gat gg aggat ct t ca at t t t gt t t a aat t gt ct aa tccagagcca t aaccaat aa ct gat t at aa at gggagagt at t ggaggca t t t t gcagt t aat t cat gt t ct gct aggt t cggcactgga t gt caaggat caat gacct c gcct ggt at a gat ct ct gt t ct gagt t act at gct caagc gatgaagaag aaacat acat cct t gt ct gg t gt t at agt t ct gat t ggcg t t t t gctat t ggt t ct gt ga caacggt aag t at at gt agt gct at t ggag cagaccattt aaaat ct gt g cgcct t gcag t acccgt ggt t t t t ggaat g t aaat t at ct t ggt gt t gga gt t gaggcaa gt t gt cat t t at cgct act g ct gcagt t ga aagt at t t ga ggat t t ccca aat gt gcaga at aact aact gcaat t gct g caggt aaact ct t ct t aaca acat t t gt t g ggaaaagct g acat t cct gt aaggcctt at aaggt ggt ga at gt t cct gt t t t t ct cat a t gacat gggt ttgt t ct t t a t at gt t t gct aagt ggct gt caagaat t aa aat at t t caa cagctggagt t ggat t t act acaagt cat a acccaaccga gt caggt aaa ct t gt t t gt t agt at gggca gat gct gcgt aacat gact c caagactttt t gggt t t t ct ct gt gcacaa t ct t gct cca 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 gaagaggtat gatagtgaac tatttaaaga caacattcat attatgtagt gaatatgacc Page 399 at ct act aat cct gt ggcac ct ct ggt aaa aaccct aaga accacaccgt ct gt t gt act agt gcat cag ccat t gaat c acggt ggt aa at t t gact t t 12689250 Sequence ggt at gat gt gagcagtatg tggttttatt gctcttgacg tattactctg tagcacaatt aaat gt aaca ggat ct t ggg ggtaacgaac aggtcgggtt Li st i ng. t xt t t ct gaaagt tggtgaggga ttgctccacc t cgt ggacct tcttcttgct ttctttttct agat gt gt t c at gggcaaat tgcagaggca gcctaa 2520 2580 2640 2700 2760 <210> <211> <212> <213> 450 1182 DNA Arabi dopsi s t hal i ana <400> 450 at ggagact t gaccagat ct gct t gt gaga gct acgat t g gacgat gt t g ccagacat t g ggt gaccaag agt cat gt cc t gccgt t ggt ggcgct at gg gt t accaat g gagaaat acc gggggaccac ggat ggggag agt ggagct t agggct ct t g gacact t acg t t cgat t t ca aggt t t caga t cct at t cac ct gat gcagt cat gcaccaa act acgagaa gt ct t gat gc ct caaggt gt gacacat gt t ttgcaaccaa taagaccaga t t ccagt ccg acgagat t gc ttgacgacaa acggt gat gc ct cat ggagg acat cgt gag t ccaggt ct c gaacagggtt gaccaggaat aaacggcagc at ct gagt ct cct t gat gcc gaccaacat g gat t gt ccgt t gacaaat gc t cacggt cac t ggt t at gcc gat t ggt gct tggcaagacc t gt ccacacc gcgt gacct c aaccat ct t c t ggt t t aact t ggt gct t t c gcaagcagca at acgccat t gat t ccagac gat gacaat c gt at ggacat gt gaacgaag t gcct t gagc gt cat ggt t t gacact t gcc aaagt cct t g ttcaccaaac act gat gaaa cgt ct cact g caagt cact g gt cct gat ct aaggagcat g cacct caacc ggacgt aaga tcaggcaaag aagagt gt gg ggagt acccg aaggagat cc aact t ggact ttcggaagag gacaccct ga aagaccct ga t cggt gagat gt t ccat t gg t t aacat t ga gt ccagaaga cccct gagct aagt caggaa t t gagt act a caacccagca t gat caaacc cat caggccg t cat cat t ga acccaaccaa t ggct aat gg agccat t gt c t gaagat cgt tgaagagagg acgaccct ga caagct t t gc t agcaaagt t caccaccaag at t cat ct ct acaacagagc cat t ggagct cat gcct t t g gaat ggaact caat gacaat cgat gaaacc aat cat ccca gt t t gt gat c cacat acgga agt cgacaga cat ggct cgc t gt ct t cgt t gaaagagaca aggaaat gga ct t cacct gg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1182 120 gaggtcgtga agccactcaa gtgggacaaa cctcaagctt aa <210> 451 <211> 3596 <212> DNA <213> Arabi dopsi s tha i ana <400> 451 atggcttctg ttatctcttc ctctcctttt ctatgcaaat catcctccaa ggttattcat ctatccttca tttcttttct ctctcgattt cacattttct caatttctta taggttttgt Page 400 12689250 Sequence Listing.txt cgattttcgc agagtgattt ggggatttct tcgtttccta aatct t ctca gatttcgat t cat cgat gt c gat cggt ct c ct cgt t t gat gacaat gacg t aacat t t gg t ggt at ct t c t ct t ct t ct t at acaaat t g at ggt t gt t t gagaagat t g aat gt t gaca act gacct t g gat gaagt t t ggt gt t t gcc t ct t t cact g gaaat gat gt ct ggagcat t gcact ggaaa at agt t t t at t t gt t agaat ct t cgcaggt aagcaaagga ccagct t ccg ct gat gt gt t gat agt t t ac t at t gaact t acaagacgat t at t t cct ac gt t at cat t g at gacat t gt gcgacgacat t t t ctt caag agaagaaat c ct ggaact ac t t t act t caa gagaagatt c gt t aat gt t g aagagagagt ct t ct t cttc agaagagttt agt gt aact t t t gt gat t cc tcatgaggga gaaat t t t aa aatgct t t t t at gt t gcact gat at gat t g gt t t t t t ct a t ggt caat t t aat cct cct t t cagt gt gt t agt t t ccccc tccaccaacg tctgat t t t a t aat ct t t ca ct agt cat at t t at at agt g gct t caaact ggagt t cagt at t at gggt c at t t at aaat gcaacat ggt t aaat t at gt gat cat t at a gat ct cacgg aggat cggt a tggt t t t gt g t agct agagc at gt t ct t at t t ggtgaaaa ttct t ct t ct t gat aggt t t at t gt gat t g ggat cat t at gcat t gcagg ggt at gat gg tgt t t gt t t c tgcacaagaa gct t t ct t gt t t at aggt t t gctacaggga aaggt t t gt a aaat ggaaat t ct ccaaat t at gaggt t t a caagt gagt g cagat aat t t gt ct t cacct at t act ct t t t t gt agat t a ggt acaact a cct t ct ct t a ggt t gct t ga t gt ggaagct t gaggcat gt t t gat gt at g aagatt gt t t t aagct t ct t t gt t ggat t t ttcggagaag gact cat gat agccaaggt a t ct t ct t ct t agt t t aggt a t t t act t act at t t t cact g gaacagaat a t t t gaat t ga act cat ct gt ggt cat t gca gat t gaat at tgt t aggaac ttggaaacac gaat agct ga cagt gat gcc at act gt agt t ct t ggat gg t gat at gt t t t t t acagt ag aat t gcat at t gggat t ct a t t ggt gaaat t cgaaagt ct t agaaat aaa at ct t t t t t g gggggaaaga at act gt ct t t ct t act cct ccgt t at ggc ct t ct t cttc t at caggt ga t cat t agt gg gt t t gt ggt c cgt at t ct t c ct t ct t cttc t t agt gaat t t gct gaaggt ct gat aagcg t t aagt at t t ttctgt t t t g aggct aat cc ggccaggaga ggat t t gt t t agact cacac t gat gcaggt t t cct t t gat t gt t t caact gt ct gaagct t gagat gcct t act gcat t t t at gaggt at t act gt gcag t ct ggcgt cc at ct gt t gct gagt gt aaga aat t aat agc gct gct agat at ggt gt cat gt t t t ct t gt gat ct t at at t cct caaaag ttct t t ctct aaact gggat t t cct ggt ga ct ggt gct t t ttct t ct t ct t t ct gt aat t ggt gat caca t t gggat ccg tgcaaaccgg ct at gat at c ct t gcat t cg t gact acaaa ggt aaagat c gat t at gt t c act t gt act g t t t gtgt t ag t ct ct at gat tgaaaaaggc cat at cct ga agt t at t t gc at t t cct t t t at t gaacct t t ct ct aaat t t t cat ct gga ggt gcaact t aat cat t ct a t gcat at gat ggaagaaaga ccct cct gat aaat cat at g ct t ct gcaga 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 ataggacatc tgtaccattt gagccagtat atagtgatgg aaatgcaagg tacgatcaca Page 401 12689250 Sequence Listing.txt tggtactcta gaatttctaa acagtgggca gagtaaatat ggactgctgg tatcgctat t t gaat t t t t t gaacct act c aaat t gt at t gt t ggct aac aaagctggag t t t ct gccac acaaccgagc ct t gt act gg t aagt t ccac gcggaagaaa at ct gaagct at t t t ct at g at gt t gt t ga at caccacaa aaggt caaag cat t gt t act t ct ggt cat c gaaagacat g gt ggt gct t g gt acccat t a act aacaaag act t ccccgg ct gcagcct c gt ct at ct gg t t cat at gag ttttgggcgc ct t t ct t t t t cct gt ggt gg t t caggt t ac t ct agct aga tgggaagaca t t ct t gt ct g ct acaat cgt aagact t cat t cat gcat ag t att aggaaa tttgtttgcg t t ccaact t t at ct caagt g t t at caggt g t gcgcagat c cct t ggt ggc ct t accct ac acgt gt t t t a t cggat ggga ggct t t aacc aaat aact ca t aat acaact aat t t t t ggt gt aat t acag ct aaggt ggg t t at t t t gt t gaat gcaaag gaggat t t t a t at ct t agt a caat ct t gca caaat t gaaa ttatgtgttt aagt t at t ga t t agt ct cat cct t gt cccg gt t t aat caa t ggat ggat g tttgaagaag ccagcagaca t t at ccaat t aaat gt t t gt cacaaagaag ggccgt gt cg gt act ct gga t gat agaat g t gcggat t gc ct t t gt cgca t t t acgat t a aat gt ct t t t acgt gaaaat tggctgcagc ct cat gct t a aaaggat ct g t t gt t ccat t t gt t t gt ct g ttct t t ctct gaaaat ct ac gct act caga agaat acaaa t gt at gct ct ct ggct gt ga cct acgct cg at ct gcact t t acaggt gt g ggcaaat t t a ct gacccaag t gt gt aacac gt cacact ct cct ct gt t gg gat t at agat ccaat t t aaa t t t t cagcct t gacagagt a t aaact t t t c gct ct gt t gt t t at ct t gt a cagt gat agt gt t gagt cag aaggtt ct t t ttgtgt t t t a aggt t agct c t accaaat gc cccggt acct cacaccagcc ct t gaat gaa cacct at t ct t gt ct caacg ct t agcgt ct ggagt t ct t g aacat gt t t c ct aagt aaaa t t at ct ct cc t t gacgt gt c cact gagt t a cat t ct cct g t acat cggt t cacgcagcag act t gat t t t t t agt at at g t t t ct t t at t t t t t gctggc tct t t t ctct tcagggaaga t acat t t cga at t gt at ct a ggagcaggt g agccct agct cct caagt ac t gaaagt gag acaaacagga cct t acact g cagt ag 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 <210> 452 <211> 1777 <212> DNA <213> Arabidopsis thaliana <400> 452 atgaatctta gggttttgtt cctcgctct c aggtctcctc tctcttctct ttttgttttt gtttccttcg cgttttctga ttggatttca taaaggatct ggtaacgt t a tgtcct t at t acaatacttt gccttcgaac attgtagctc gcaatgtggt ttcaaagata caaactttaa ctt ct cctt g ccat t gat t t cat t t gat ct cat cgat ccg tgtgt t t t gt aacgt agat c Page cat ct cct ct t gt agat ct a ct gt t t ccga agtgaaattg gtgt t gctgc gaat cccct a gct t caaggt gat ccgt t t t t ct ggt t t ct agct cct t t a caat cct aga gt aat agat t 120 180 240 300 360 12689250 Sequence Listing.txt t cttta ttttct t act ttacactcac cttcaatagt gtcttcgttg catg gggtagagga ct gat gt gt t at cat t caag at gaccat ga tccccaaaaa cgt t gt t gt g ctggagaaga agct t t agct t t ggct gt ga at ct t cct t a t aggaaact a t gcat t ct at ccat acat ct at cat t ct cg t ct t gct t gg tggaaggaaa gaggt ct t ct t cct cggt ct ct t t t t ctgg gaaaaccaaa t gaat ggct c t gat t gaat a gacat cgt ct aaaat t gcaa t t cat gt t gt t ct t gt t gac t t t ggat at g cagt gccaaa at ct t aaaga gact gagct c ct ct t t gt gt ct t aacaggg t gat cat aag t t t gt ct aaa t gcagagact t t gcagt cag ttcat t ctta tgt t gcacag gccat accag cagcggcgaa at gggcat ac t t cat cat t t aaggt t t cca gaggt t cgct at ct gat t ga at t gt aacat ttct t t t gat gat gt t gt gg aact t gt cct t gt at gt gt t t t t ggt t t t t ct t gt t ggt c t gagt t t t ct aaaact agag ttgt t ggttc t at t t gt t at t aacaat gca ccagt act t g aaat acaat t cct ggagcat agt gt ct t ct tctgt t t t cc agccaagt t c gt gcagt t aa aggt ggaagt t ct ct ct t t g agggt at gt t t t t ct gaaga cagt t gct ag gagaaaacac cat t t ccagg t act cct cct aat gct ct t t t t aaaaat ga gt t t t gt t gt t aggt gt gat agaat ct aac cat t t t at t g t ccat cccaa caggt aaaca gt t att gaga t t gat ct ggt acaat ggaac t t t t aacact agcgt ct cac acat t cct t t aggaactagg t act ct t cct t t t t t at ct t agacgt t at g at gt caat cg t gat gatgcg agt t gagact t t ct t acgt a gt t t cct gca aggt agct aa caaaact aaa gggaat t agg cat gt t ggt a t t t t t t gtt g cgt ct ct t ca t att ccacaa aaaaact ccg gggt t at at c cat agaggt a cggaatcggt caaggt aaac gat aat ct cc t ct accgagg cct ct t cct c cagggaact a cat t gt t ct t ct t t agt gac gatgccgagg gt t gaggaag gt t t gtgttt t t t at ct at c gt ggt t ccag t acct ct t t t t at ct ccaaa gccagcgt t c agcat t t gac t t cacat gct agcaacat t c t agacaat ac aaagat ct aa at at acgat g gt t gaat ct g ct cct cct cc cat t ggat ct t ct t t t acca cat cact cga ct ct t cagt g ct t t ggccaa 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1777 gggaaaacaa agaacaagaa gaactaa <210> 453 <211> 438 <212> DNA <213> Arabidopsis thaliana <400> 453 at gcct at aa gt gcgacaat ggt cggagct aacgccctcc gcaagctccc ttatatgcgt caacaaccct aatttttgtt ttctgatttc tgtgt t ctga ttggatat t a atatatgtta cttggtgctg tgtttgcgaa ccagctcgtg gatgtgatgc tcgctaaggc tagagctgct cctcttcttg tttttggttt atcagcttct ttgctgggt c cgtaagtcgt gat t gtt gag gatccgtggg aaatgggat g aatgagcgcc caaat t t t gc Page tcggtaccca ccaccaatcg aaaat gcgat agcatgtggt t gaagct t aa gt t act t t gg ttttttttag gat gt act cc aaaaat gct a t ccgat t cat gggtatggga ggaagatctt t at at at t at aaaaatt ggt 120 180 240 300 360 420 12689250 Sequence Listing.txt gacgtttttg gtgtatga <210> 454 <211> 273 <212> DNA <213> Arabidopsis thaliana <400> 454 atgtacacac aacaaggctt ctcgal ttctcacatg atgcatcaaa gaaag! gttagaggaa gcattggagc tggttt cctggacttc ctacaagacc ttgtt1 tcccttcacc gtcgttcctc acttc 438 120 180 240 273 atctt cccatatgca tgtcgaaaac gcagtactca gagca ccttcagggt ttgtattgcc aattcgggat cata tatccactgg ttggtacaat gagtacaat g :ctat gagattgata tcagtcaaag cttcaaggct ct caa t aa <210> <211> <212> <213> 455 2271 DNA Arabidopsis thal i ana <400> 455 atgacgccgg gagat t t gt t ct aaaaggag t t ccagaaat t t t t agt aag tggcgtcaga ct act gt cac gt t aat t caa gaagct t ct a cact t gct cg tgtgagggaa cggt cgt at c t ct ct gact c aaat t gaat c gt aact ct t t ggt gt t at gg t cct t at aat t gt cacaaat gt t at at at t tgctgttttt aaact cacat at t acgt ggc agagagaaag t gacact aaa aaggt t gcaa ct gaaaacgc t gct t at t at aacagat aaa agct t t aacg cagat t cgat gt gat t gaaa t t at ggct ca tcacggcggc at gaaagcct t t t gtgt t t c at t ggaat t g ct t at gat gt ct ct acct aa ttcct t t att at t agt at t c cat at gat at ggcgccggt g agt cgt t t gt aggagtgaac at ct cgccgc cgagacgttt aat t ggt t cg cat gcgaaaa tagaagaagc t t gggt gt t t ct gaat cgat agccgt agaa t gt agt cacc t t agct t t gc t gt t at gt at gggt ggct aa aacat t t t gt at cct accct t ccgaaagat t gagaaat gt agt ct t cct t aagcgcacgg t caaat t caa act aaaagga t t t act t t at t t aagat t t t t gt aaaacac aaaagagcga t gat t t aaat ttggt t t t gg t gt gat t gt g gagt ggt aca accgtcgggt t ct gat t gat gt ggat ggt t cgaat ggt t a ggaacatttt t gt ggt gaag gggt aaaat t t cat at ct t t agat t act ag cggt t cgt t t aat t cgaaag gt t t at t gat t cgat at gaa t t t t cact gc gcgct t t ct t acct t t aacc cccaaat t ca at t t ggt aaa at t gt gat t g agcagat gcc gt t cgct cga ct gat t agt t t t gt t t t t tt agt t t agagc t t t gt t t t gc caat accagt at ct t cagct ct cat ct t gg gaat gat cat t cat ccat cg acagt aaaca at t ct aacgg cat cgt gacg aaagaaacgc at t cgt t gag agt cgct t t c t t t caacgt a gct t ct cggt t gat t gt gat gat t at aacc ggt t t gaagt t t t cct t gaa gat gt aaaag t aagt gt t gt agat t at at c t ct ggcgt ct ct t t gt gt t g act at t gt t g gat t gct aat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 aagtgatgaa taattcctag ttttgtctga ataatttagt gcttagaagt Page 404 12689250 Sequence Listing.txt gt cat agt t a agt t t t t gaa t gct ccct gt t at caat ct t aaact gct cg ttcggggcaa gt t t cat t ct at t t ct caat t ct at aacac ccacat gagt gact t t t gca t t accaaat t at ggggggat t t t ct gt gca ggccacgcat aaaact ccat gaagat gt ga t gt t t ct agc gat t cat t t t aaggat t t at t t gcagat t t aagaaaact c ct gt t ct aac ccaaaat cat accgct at t t t t t t cctat g t t cct t ggcc t t ct ct t cac ct gaaat t ac t t t ct ggt ac t cat cat cct act act t ct t ct t t cct caa gat t t gct ca t acggat aag t t gt gat at g t t t ct t t gt g ggact t t t t g ct t cagggga t ggt at t gt t ct t cct cagc t t acgaat ac t t ct gt aggt t ct t cact t t ggaagt t ccc aggt act t ct gct at ct ct c t at ccgt aac ggcat t t gt g ggccct at t t t gcgcct t t t gt ct agct ac ct ccat t gct at t gt t aat t t t ccat at gt aagaccacgg ct cat t ggcg aat t cat t ga t caaat gat t ct at gt at gg cacagcagct t aact agt t t t ggat t ct ct ct aat t cgct ct t t at ccaa t at ccccgaa gct gat gagc gat gaaat cc aact t gt t ga t gggct t t t a cct t gacat c agaagct gac tttttctagc at t t cct t t a gaat gat acc ct t t cat gat t t gaat t cat agt aaacaaa t at t t accat gat gct t t aa at cct t gt t g t t cacat at a at t t at aggg t gacggat cg cagt cgt gat accaagact g t ct agt act a ttgt t ct t gt t cgat act gc cat gct ct t a ttat t t gtct ggt aaat t ct ct caagacca at cct t acat gggt t agt t g at cgcct ct t gt gcaagt gc t ct t ct t ccc aat gat agcg acgt ccct t g tgcaaggcca 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2271 <210> 456 <211> 788 <212> DNA <213> Arabidopsis thaliana <400> 456 atggcgtcct tcgctcgccg tgccatgagc tctggagccg ct t ctat t gg tcagcgtcgt ctccctcttc cttctctttt cgatatttac atagcttatt tcgcacattt tatgtctaat atgt t acaac tttgtgaaga ttttgtatat actgtataca aaaaattgag tttttgagca tcaccatgga tctaccaagg ttgact t ct g agaggagcat gtatgtatca atcatgtgtt gtgatttttt gt t gaat t ga ttcct t ctgt tctctgtctg gctggggctt acttttctac aaagaggtac ttactcttgc tcttcagctc ggttccttcc cttatacata gttttttttg tcattgtcac caggaagacc ttttatggag t t agct caga t gcct cgccg gat t t t t t t c ttgtgt t t t c cat t ggcgt t t t t t gt ggat gaaacagccc t gt gacact g t acct gacca agt ggat aca t t gctct t ct cagacgcct g agacct ct ca t cccat ct gc gcgccgctgg ggt at caat t acgaat t t t a gt caggt t cc t t gt ct gt gt acgaacccag t gt t t t t gt t at ct gcagt t agct cgccac t ccat t gcct t ggagagt ac t t ct cat gca t cgt at t t cc t aaat t ct ct t t ggt ggcgt gat gaaccaa aaat t gaat c act gt gcaga gaaact ggaa aat gaaagct t gt gct t at t tgggggaaaa agaggct t cg acaat gagt t ct t t gt t t t c 120 180 240 300 360 420 480 540 600 660 720 780 Page 405 12689250 Sequence Listing.txt cct t t tag <210> 457 <211> 553 <212> DNA <213> Arabidopsis thaliana <400> 457 atgatttacc gaaagtggag tttgctgtcc attgctgccg tcgccgtcgg attcgtcttg ttggtcgtta gatctatatc ttcattggct tgtgttgcgc tcctcctttg atcttattcg ttggcctttt gaaacattga ggcacgaaag ttcatagtct tcgttgatgg atcaggattg atttgatccg tatcaggcga ctaagcttct atacataaac caatttcatc tttttttgtt agtctgactt t tggtgtctg atgatcaggt aacaaacaag tga ggcccaccgg caaaacgt at gct t t t t ctc gat cacaaat gaacaacctt cct t t at t t g t t ct cgat ct ggt aact t t g aacgaaaggg t t at cct cgg t ct cct t ct c t cct t ct t ca act ct t gat g t aat t gaaaa t gt gt t gat a t ct ggagat t at ccaaagat gagcagaaga cggagcggt a t t t ctct t gt caat t t t t ca agt t agt aat t t t gtgt t ct aaggt t at ga at at gt t t ga t t agat t t gg agaat gt ct c 120 180 240 300 360 420 480 540 553 <210> <211> <212> <213> 458 3761 DNA Arabidopsis thal i ana <400> 458 atggcggacg ccaaccgct c t ct gct t cag cct cct ccga ct t ct t aat a ggt gacaaca tt ct t t t gt t t gat t t agag t agct t t caa t ggt aaaggt t act agagat ccct gat cag ct t agt ct t t t gt at gat t a ggccgtagga aaggt act t a aat ct caat a ct cct ccat c t t gat cccac t ggat gaaat gcgct gat gc aaggat t tag t at ggaat t t ggtaagcaga ggaact act a ggagagact a at ggat gcag at ct caaagg gt t gct aat g t t t t ggt t t a tgggtggaca at gct ct gga ct cat cggat aact cgcaga cgcacct acc ccagat t gct t aaacgt cct ct cat at ccc agggt t t gga tgtcagggac agaagat t ga t t aagt at ct accct aat t g ct gaacagt t at agt gaat t ggctgaggca agcaggggct gatt gaccaa act t act cca cct accggct ggt ct t ccac aaacaaaaag cgt gt t gaca t ct ggt t cgt at t gaaaact ggt t ccgt ct t at t ccgaat t cagct t cag t gct act agg gat cact gac t aggt gat t g ggcaat acag gat caat t t g tttttggtaa acaaacgcaa t ct ct t ct gg ctt ctt ctt a cacaagaaat at ggt gct t c t ct t t aaaat t act gat t gt t cgat accgg at gagagt t g t ct ggagct a act gt t gacc gt cct t caag at gt agat t t ct ggt t cagg t t at gaaaat t at gt gt ct t at acgaagaa t ccgat ccca caacagcgt t cgct gct cgt t t at gat t at ct ct t t t aac gat t t gat ct t t t cgt at gg gt gt t at cat agat t caggt t aact ggt ac aggt agt agt act aat ggt g tggaggaggc t ccgaat aac gt at gt aat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 406 12689250 Sequence Listing.txt tgtaaactgt t ctaggt t gg tttgataatt ggtaaaggag gtgaaacaat caagct aaga t t t gt ct cca t t t acat t t g cgaacagatt gat t gt t gt g at t t gat at t t acgt cat ac aat at at t ag ct ccacgat a ggaaggt t ct acaaaaaat a gct t ct t ct t gt t t ctgtta t ct t t t t caa ccaagaaat a ttgtctattt gccaat cct a t gct gt gat a cat t t ct t gg accaccaaag t t t gct aacc ccgt t t t t ct gt t gaacact at aggt t cca gt t t t t t ct t ccaat aat t t t at gat ct t g ct caat t t ag at gct gt gt c t t t ggt t t at gt ggt t at ca ct ggt t at gg cacct t acgg ct ggagct ag gct ggt aaaa ccccct ggag gaacat gct a t t t gt t t at a gt gt t gt gat t ct at t at t c ct gct aaaac at t gcat ct g t t at t t t at t t t ccat ct t g cct ct gct ca ct at ct t at t t at ccaat t t tt ct t t agca t t ggagt at a cat t ggt t t g gt gt t t ct aa gt caacat aa gt gggcgact gagt aact gt t t acct t cac agact ct t t a t gt gt at at a gcacccattt caccaat cag gt t gat ccga t ct t t gt t at at t gt t t t at gcagaaccgt agcccgccca t ggt t acat g aagt t accct aat t caggt a tgt t t at t ca acccaacgcc aacaat t agt gat t at gaca t acat t at t t agat aat agg ct at at act c cat t ggat t g ttaaaccacg at at at ct t a t gct cat t t t gt ggcagt t t acagt gt gct t at t cact t t t at gcagaca ggatgcct t t act gt t t t at ct t ccgt t ca gacat t t gaa t at t at t ct g cat t at at t c gt t ggt at t a t at gt t gct t at ct cgaat a t acggt at gt t t t ggact ga ccgct t t at t t t t cctgtta at gagaaact ccct caagct caaccaggag caacaaactt t gt gt t ct t g t cccgt ggt t agaacggact t aat gaaat c ggct t ct gct agt t gt cat t t gat gt aggt t t ct t at gcc cat t gt t ct t t gaagat at t ct t t gt t at t gt t gccgct a t t at gggcat t agcat gat t t caaat gct a t t t t cat cgc gt gaggct gc acaat act gt gct agt act t gacct at t ag cct t ggt act t ggt aat at a ggt t at t t t g tt gt ct gcaa gggaaaggga aggcat cacc t at t ggt aaa t gt ct ct gcc aaaagt gacc cagcaatggg gggcaccacc cat at ccagg cagct ggt t a t caat t at gt at t t gt ct gc t t gcagat t g at cagt ggcg gt gt ct at t t agct t t aggt t at act agt t t t cat t t cct ct ct t ccgct t t t t gtgat t gcat t gt aag cct t cct aca t cgct t at at ct t cat aact t t t t cat agt t t ct ggt cac t gact agt t t ttttgttaca ct agt t t caa ccgt t gt gt t t t ccccaat t ct t ccat gag ct gcacaacc aagct t ct at t gggct t ct g at t accct t c t agcgt at cc t ct gt gt t ct t aagct at t t t ggaggctat t ggt ggt ccg t ccacct cag ct at gat cag caaat ct at g t t aggt t ct g aggt t at t cc at gggat aac aggt at gt t t t t at acat ga ct at ct agt a t cact t agt c t t gat at t t g t at t t gcagg t t ct t aact t at t at at gt t cgagt t t cat ct act t t t ca cccat t gct a agaaat ct t g agccaaggt g cagt ccaacc t t gt gt t cat t cat caaggt gt gt caat aa t t t ct t aaat t at t gacat t ct t t gagaaa agt at t cgt t caggagt aac t agaat at ga t t gt at gcat at gt gt t t at at t ct aacga ccacaacaag ccagcacaac t at ggt caat t cct ct gt gc 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 Page 407 12689250 Sequence Listing.txt caccatccca gcagagcgcg caaggtgagt atgattatta cggtcagcaa aaccaagcag agcat gct t c acaat gcct c acggcagcac cagct cagt c gt act ggt ca cagcagcagc at ggccagag cccaaccagc agggat at gg acgggt ct gc ct ccacct gc t ggt ggt agc t ggt t at ggc gcagcaat cg cact aat cca t ggacaggct ggcagggt at t t acaat t ct ccagcagt ct agct t caggg gt ct t at gga t ggagccact tggaccgccc tcagccccac caagct ggt c ggat at ggt c agt caagagg gggt at ggt a ggagct cct c gggt at ggag ccaggt gct c t at ggacaac ggat acacac gct ggt ggt g aaagcat ccc caacagat ac agggat acca aagct gct gg aagat gcat c caact ggt ca caact t ct ca caccaccacc ct gggagct a ct ccagcgt a aacct gct gc gt ggt ggt ac cgaaaagt t g cacagggt ac gcaagat ggg gt at gat caa t caagccgct acagccgcct ggct ggt t ac t gct t caaag t ggt agt cag t gggt at ggt t ggt ggaggt accagct t ca cagt ct cagc aat t act acc t at ggagct t cagggt ggt t ccaccat cgt gct caaggt a agcagccagc ccaccgactt t ct gggt at g caagcgccac t act ct t cag cagagt gct g 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3761 <210> 459 <211> 3359 <212> DNA <213> Arabi dopsi s tha i ana <400> 459 atgagcgcga cgcagacgca gacgc ctctgcaaac cgt ct t t cca cgt cg~ gccggcgagg atttctccga cgatc( tctgccgacg ct gt cgact c cgat ct accgat ccaa t cggt ct ct c t ggt ci tctctctttc gtttctcctg cagati atatatatat atattgctgt ctgatt ct cggagaaa cat t ct gt ag ct at at gat gt t acca t t aaggt t cc t cacc t ct at ct act aat cgat t ga t t at c t atat at ctt t cgt t t tc t t tgg! t gct gt ggaa t act t ct aag aagtti aacgt cagaa ccaaaacgat gct aa! aaattcagac agagaggcag aggati cgatacggac aggtggacgc tatgal ct cacacgt g ggt t t t ct ct gact ct gccatgaccc tcattcttct cttag! atgga ccgcactcgc tggccttcag ggtgatgcga at cct cct ct cagc t ct g :tttg ct ga cagc gct t act aa gt ct t t t gt gct at ct ac t t ca ct t c gt t gg cccct ccgca t ccgcct cct t accgaaat a ct t ct t cct c t act at at at t t gt t t at t t gt caacaat a ccaacatttt gat t cgt cac gct ct t at t t t gt ct gct t c gt at ct ccca t t ct ggat ac t agt t gaaca ct t ct t ct t g t t t gct ct gc t t gaccct t t t gt t ccgccg gat t cct cct agt cct t cgg at at at at at ggt t gcgcag gt t ccact t c gt t t t ccct g ct gcgt act t cact cgt t ca gt ct t ggacc t gt at t at t a at ccaagt ca cgat gt gaaa t cct t t t aaa at t gt acaat cgat ct t ct c ccacgt at ct caaccat cct gt t ct ct ct c at at at at at agcgat ct at agaagt t cgg aaat ct gaaa cat at at at g ct t caat ct c t t gaat t gca t t gt aggcag cct gt t gaat gagct t ggag t t gct ccat g gacgct gat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 gtgagcgtaa atatcttccc cagtttttca agtttgtcgt t gcaaaccca ct ct cagt ga Page 408 12689250 Sequence Listing.txt ggaccaaggt gattatacta gtcaattacg tatcagcatt tttatttgtc atttgttttc agt cct ct ct aggt aggt t c t t cct t gt ac at cat gat t g at t cgt caat t at aat cct c gt t ccgccct t aaaat caaa t agaggct t g aacct gct aa caaccaggt c caaat cgt cc at acct aaac t t aaacccat aaat t t caaa at t ct cggcg tat t t gt t ga agggct t t ga cagt t agaac t cact acct t ct t t t at gct tttttttttc gt t at acact gat acccaaa t aggcat act ct gt cacaag gt aacat at c t gacagaat t gct agt cgaa aaact ct t gc t gacaaact g at ct at t t gc t t at gct t gt gagaat ct cc ct ct t aat t t gcagt t at ac gat cccct gg gt ccacaat a t t t ct gct gt at acaagct c cat t gagaac gcaat ggagt t gt cagat t t t aat t t gct g cgccagt t at ct gcagat gt t aacat ggcg ct gt aagt t t ttgt t t gct g gat at t gct t aact aggaaa gcat t at aga t cat at gaga agccagt aag t aaat agacc gat t t aat t g t gaat ct cac at gaaacaca t gaact t t ct gt gaat cacg gct gat t cat agat gt t acc accacct cct agaat ct t at ct aat t gt t g act t cact ct ggt gt t t t gc cgccat gt ca aaaat t gat t t ctt gaacaa ct gt gt agt t at gt gct gt t cat acaaaag gct gt aagat t cccagaat c t t ccact cac aat ccgat ct ttccgggcaa cacaaat t t g acgct t t ct a t t aggat t t t cctggggaag act caggt t g cct gacaaag t gct cat ct c ccgt aaagag ct t t cgggt a aact t acacg aaaccaaact at t ggagaag gt t t t cct aa t ct t ggct at cact aaaccg caggat t gaa aagagct ct c t gcct ccaag ct at t t ct ga gt gagt gct t at ggat cat a t t agaat t gg act t ct gt t c gt ggt t t t gt t aacct gt at gat gt ct ggg caaacct ct t t acagaat ga gt t at gt gca t ct ct gt gt c ggcgggggt a acaaaat t cc ggt gaacct g ct ct gt gaca gt t ct aagt c ccaat gcagg caccaat at c gt t t act t aa at cgt ct aac at t aat at gc agacaact gt ggct t cacac gat aggcaac ccggt t ggt a accat gt agt gcaat gt gt a t t t t t at t at gcat t t ggct at cct t ct t a ct gggagt cc t at gcaggt t ct t t agaagc gat gggaggt atctct t t t c t ggt cct cac gacact aagc t t at t t gt t t acaccaggag t at ggaccaa agact caaca t cat aagcca t cagt ggt ct t ccacaact a agggaagtaa gt cgct t aca acat gacat g t gaat gt at t gt at cagt t a t t cat t t agc t t t aaaat t t aaact t ccct gagt t gt aga t t t t t t ct ct t gagt gt aaa t gggaccct t t t aat ggt ct t agaact t t g tggaccaaaa acaaggaact ccaat gat t t gaat gt aaaa agaaaat cgc cgagt t gt aa t t t agt t t t t t t t t ct aaat t t gact t at t t aagt aat ct aggat t cact gt at t ggt t g acgacattt t gt t gat t t t g gaagat cct c ct cat t ct at t agt ggact g t ct ct acaag t at cct gggt aacacaacaa aat t t gt gcc t ggt t t cct t gat t aggt gg agt ct t at ac t ct gt at ct c ct aat t t t t t ggt t ccagct ct aat gt t t g ct gt t t t cca cgaggt ct ca ccagact ct g ccgt acacgg ct t t gcaat g gacaccat cg ccaat t ggt a t ct gat at at t gggat cacg 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 gccttggaca caagagagaa aaaaacctac gaacttgttc cagatatgga ggtcagtgta Page 409 ct aacct t t t t ggt t cacca t aagt t at aa t cct ct gt gt t t gt at at ct t aat cgat at gacagcct cc t gt t aat t gt 12689250 Sequence Listing.txt t ct gggt t ct aggt ccct ct ct t t agt gaa aacaggaaat tgttgttggc attatactgc agatatttgt agagacagac ccgagtggtg ttcttcctcg acaggttttg ttattgcacc ttttgtttct tagtagaaaa taatttggct cctttgtag 3180 3240 3300 3360 <210> 460 <211> 546 <212> DNA <213> Arabi dopsi s tha i ana <400> 460 at gggcgt cg ct gt t ct aaa t ccccaggat caacat cgt a at cct agcgc at gt cccaac acacgccgga gt cct cct cg t agt caat cc ccgct t cct c ct cct cgt gc ggct gt ct ct agt ct t agca at accgt t gc cgttagccag cccaaggaga ct t cagat ct ggt t gt ggaa ggaccagatc cgggtttgat tccgggtcag gcaccgtttt acgccggtcc tgtgaccatg ccagctttct tcacgaagaa ggcaacaaat gcct ga t gct t gaaag aggcagaaga t ccagat ct t gct t t cgt t c gt gagaat cc at gt cagat c at ccgt t t at acct cgccgc gat ct cat ca at ccat t ct c agacggt ct c ct t ct cct cc caaagggaac ttaagcgcgg t gggat ct ac ct ggccgcaa ct cct agcga ggat cct t cg t cacat gaaa caacaaccgt t gt agcgccg cct t aagaag cgaagaaat c t cgt cggat t at cgaaat ca t gt cccgct t cct agacat c 120 180 240 300 360 420 480 540 546 <210> <211> <212> <213> 461 1453 DNA Arabi dopsi s t hal i ana <400> 461 at gt t gt gt a t at gt t t at g ttgaacagca t ct t t at aag t gat t gagt t ggcct t aagc t gcat gt ct a t t t t ggagct act t gt t at t gcccct gagg ggt at gcct t aaat t t cgct gaaccat t t g tttttacaga gt gagggt gt agt gct t t at ct t t gt t t gg ct ggt ct t ca ct ggt aact c t at at t t at g aaat t cct at at gct aat cg ct caacat t t act ct ct caa t t t t t t t ct g tcacaaaggc t acggggact aagat ct t t g gt t t t t t gca t ggt t t ccat t t at gcat ac t t t cgt at t g t t t aggt cca acat gct ggt gt at ccct ca at t t gat t t t gat ct t t act caagt aacaa at ct t t t t ca aagcct at ct ggt gt gacca gt ccat gct c ct ct ct gcaa ggaact t t aa cat t t caacc gat ct aggaa t ct caagct t gcagt ct t ag t at t t gt at a tggcgaaagg cccaggaagg t t t t t t t cct ct gt gagt gg t t ggt gacac at t t t caat g t t at cgccaa ccgat ggt aa acat cact gt at t t agagct t t gt at agt a t gt at t ggt t agt t gcagt t cgat ggt agt t cct ct caaa aacagt t t ct cact aacggt gat t agaaac aagt aact ga aacacacggt t ggagat gat gt t t t gcat c cagt t gt cac 120 180 240 300 360 420 480 540 600 660 720 ctttgaccag cggtagtgtt catggattta cataaaagct tttgttttgc ttttctcagg Page 410 12689250 Sequence Listing.txt aact gccacc t t t cct aat c aacagat t cc accct gat ga t acccat at t agaat gct t a t t t acat t t c ct t gcggt aa t ct at t t aat ct cggggt t c t gaat ggt aa t ct ccagggc t t cacaat ca caaacct gct t ct t act gga cct cggaaag at gct aacag t gat t caaca aggaggccat gt aaaacaaa ct t gat cat t at ct t ggt t t aat t at caaa t aa ct gat t gcca gt agt gaacc ccaaact ct a ggt aat gacc aat t t aaaag t ccat cct aa gaact cagcc at t acat t t t accat cat t g at ct at t gac ct gat t gacc ggt t t at agc aat ct ct t aa t t gt t ggt ag t t t gaccat t t t agccagt c gaacagt t t a t ggct act gg t ct cacat ct t cacct t aga t at t at t at c t t t t gt ct at t gt gt acct t gat acat t t t ggct gt t gt t t at t t t ct t g cgt t t gaggg t aacct agt g aaacgcaggc cct gt gaaaa ct ccat t gat t t gcat t t t g gact t cgcag t t t at gct t t gt gt t ggct t gt ccat gcag agat t t t at t cat ccaat gt t cat t ggt gt ggccgt gt t g ccat gt at at t ct ct ggt aa ggggt ccaac gcat cat t gg 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1453 <210> 462 <211> 999 <212> DNA <213> Arabi dopsi s tha i ana <400> 462 at gccggcga cagcaggaag agt gcgt at g gcgct t caga ct cat ggaat at ggcagagt agtaaagaag agcctaagac gacacaacag agttttcagg gacttttggc tcttgctagg ggt t ct t gt a agaagt gt gg ccgt gt t ggg agt act aagg aggataagga gaaagatcct ttggagaaga ttaggagagg t gt t ggt aaa gaagaggaga gt gagagt t c t gat t ct gat gagaggtttg ggaagaagaa aggtgggagt aagaagaagc gt gt t t ct ga t gagt ct gat aggcggaggt cgatgaagaa gcggagttcg gatgaggaag agggaaggag taagagaaga gatgattctg atgagtctga ggatgaagat gagaagagga gaaggagaag ccgtcgtaat gaagatgata gaagacagaa gcggaggaac aatgtttcag gcgatgatgt ttcacgcgtt aagagcagga aacgtcatca caggaaagag cccgcaaaca gcgat t ggt t aaaaccgaag at t acgggt t cat t t gacgt ggt gcgat t g ggt gaggt t g gt t gat t ct g agt gt t aaga t ct gat t cgg cat aagagaa aaggagagga gacagacgt g cat t ct gat g aaagt cgcag ggacgt ggt t cgagagt ag at cgggt gca at gat cct t a at cccgagaa cgaat aacga t t cagt gt ag aagct gcggt aggaggt gag agat ggagag agacgagt t c at t ct ggaga ggagt t t gag gaggaagaaa tgaagagaaa at t ct gat t c ct t ct t ct ga ct t ct aagcg t agt agt gca tgcaccaact ct cgt acgcg t gaggct cgt gaat t t ct t g t t t gt cgggg t agt gaagag gat t at t gct ggt t aggaag taggaagaga t gagt ct gag gagagat gaa gagt agaaag agagt ct t ct ct cagaagca gt ct gagaag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 999 <210> 463 Page 411 <211> <212> <213> 12689250 Sequence Listing.txt 1606 DNA Arabidopsis thal i ana <400> 463 at gaacgacg gcggaagaga ct gt t t ccat t gat t gt t at gct t gt t gag t gat gt t cgc t aagt t agat at gat t t at a at gaat aaga t t agt ct gat ct aat at t gt gt at t act t t t aaat gcat c accaagcagc t caaggat ct at t t gat at t ggcaat ct t t ct t t t ggt ct cct t ct gt gt gacgcaaagg acaaaaat at aat t t cat gt t t t agat ccg ct t ct cgt ga t ccgt at gaa t gat ccat ca tt gtt aacag gagat gt at c aagcaaacga t t at cagat c cgat t t ggt t gcggagaaga aagaagat gt cgct t gt t ct gat t t gaagt aacat agt ct gt aagat t at t agt t ct ct g gccagaaaat aaggat caaa t aaggat ct t aat t gt t cag t ggaact t t t t t t t gtgtgt gt t aaagat t t gt t gcgt t g aggaat at gc t cct t ccccc t t ct cat cat gt t t gat agt t gggaaaat c act cccagt g ct cact gt ct at ccgt aagt gaggcagat c gat ct cggt t t t at gat at g t at t t gaat a agaagat cag gagt t t t ct g ggat t acaat t t t gact t ga ggt t t agt ct ggt gt agt ag t gaaaat t at aact gt cat a gt t ct t caag ct gaacgt t a gt t ggt act t t t t t gt at at t t t gat ccgt ct ct ct t gaa tcgtgaagag t ggt aaagca t cct cct aaa t act t t acat gt at t t t gaa gtgtgcgaga gt at gaaaaa aggt t t act t cgt t gt t cgg cagcagatgg t ct gct gaag aat ct ct aat at cgt aggaa gcaagat t at at t t caat t t at ct ct gaac t t gat t t caa aat cggcaaa t gt act gt gt gt t gat ct t g catgttttt t cgcaagacga gt cgcgat ga aaaacaaat c cct act ggag t ct gct gat g t t gt t gcagt gact t gggt t aaggt t cat g t ct aat gat c t t aaat t t t t cat gt aat ag acacact t ga caaat t ct ct t t t t act t t g ccaagt t act t gagat t t at aagt at gat c t caat cgt aa t t caacat t g gagaagaagg t t t t gat ct t tttgaaaaaa act t gt aat t act gaagact t gat ct agct t t cat t t act cagt gat t ac t at t gt caat gt at gcct ac cct gat gcat at t at gt aat t cat t t t gca gt t t gct t cg tagtggaagc ccccagaggt ct cat ggt ct at aaaagaac ct ct ggaggt t gcgcgt ct t accgcat aga gt t ct ct gat gcct ga ccgccaagaa t t t agtgttt gcggt t t at c agaaact cca agaagcaagc ggt t aagt t t t t agaagt t t t t ct t cat t g aat t t agcaa t t caaat t ct gt gt t at cgt t ct at gcagc gccat gaaag aaacagct cc t caaat aaat cgt gt cggat t t t cgaat ac gtt gaaagaa t gt act t gat t gct gt t gac ccact ggt aa at t at cagt g gt t gt gt t ag gat gt ggcat t t act t gt t a t cact cat t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1606 <210> 464 <211> 2258 <212> DNA <213> Arabidopsis thaliana <400> 464 atgtctcaga tctttagttt tgccggtaag cttcctgatc tctgaatttt ttttcatctt Page 412 12689250 Sequence Listing.txt tgaatttgat ttatcagatt gaaatttttt atttctctgt tagatttaga t t t gaggt ga ct t gat t ct t agt t t t agt t t at t ggat ct t t agct cat a gcgcaat at a at gt t t at t t tcgagaaaaa ggcgt t t gt c cgct t gt aag gt gat gat t g gat t ggcagc gaggtagcaa ct t ct ct cgg agat t gct ga ctgacaaggg aggct t gt t c t gaagt cagt t gt ct aacac t gt ct ct t gc t at cacacgt t aaact ct ct gaaagggct g acgggt t ggt gccacagggt aggct t t ct c cat ct cat t g caaat ct t gc tcaaggggat act t cagt gg ggt ggact ct t ct ccat t gc cagat t cagg t t t acgt acg cct agat ct a t t ggccgt t g gagaat cat t agaat t t ct c cccaaaccca ccct ccaagc gccagt t t cc tggaccacaa t agcat ccgt t gaagggt gt aat t gct gt t ct ct gct aaa gt ct ct t t ca gggt t gt gct t t t ggt agct aagaat t gga ct cgat caag cacct gt t cc t gt t gt gggt gagcgagaag gaccat caca cccgaacat g ct ct t t t gca t acccaat t t t ct ggt gaac cagcgct ct g agccatcggg aacagagt ct t t gt t ccaat t gaagt ct t a agcaaact gc gat t caagca aggaat aaga t agat at at a acgagct at a aggt t aat t t ct t t ggt at t aaggat gct a aagagat cac at t gat gt gc gagaggagt g caaaaggaga t t gtctagga ggaact gct g gt t t cagat c ctgtggaacg caact t gaga at t gct aaga gat gagggt t aact gt cct c ttggcaaaac cat t acggct ggat t ct ggg gcct gccaag aaaaaggcga aaagctt ct t gggt t t t t t g t gt t t gagt a cgct ct t t gt aagt t gt gcc ggt t t cct ac t t gact gat a aacat cgat g cagat t ct ca t t ggcct cct t t t ggct t at t at at at at a gat ct t at ct act gaat ct g t t acaggt ga gt ct t t t gt t gt gt t gt t gc t accagat ga ct t gcgct t t t t gat gt t cc gct t agat gg gtcgtggggg t t ggt ct t cg t t t ct accat agct t gagct gct gccccaa t gct agccat t t gtcaggga t t aagct t ca t gt cgat cac t cat gggaaa gagt gact ga t cat cagt aa t at cact t ga gt t ccct t t t t t agagat ct ct at t cgt aa ct cagct cga at ct gat t ca gagt gat ct c gat gt t ccaa gt gat t t gga ct gat aagct aggat agt gt t at at at at a gcaat t t ct c at t t t gat t c aaat gat t t t at cgct t ggt acct acgat c gtgtcttttt t gt ct ccaaa t t ccaagat a gaagaaggca act t ggaaaa gt ct at t ggt t act gacaat gaaccgct gc ct t gact gag t gcaagat cc t caaggaat c gat gct gaat t gat ct t gt g t ggt gt cggg cat ggggct t at cccct t t g gagt ct t cag gaact gt ggt caccacagga ct gccct ggc ggat at t gat gagct ct ct t t gccat cact t at cact gac t at t t cgaaa caaact gcag cgt t t ccgt g t caccgat ga t at agat ct g t gt t t t ggt a aat gat ct ct t accgt cgt g agt t t cgct g t t cagt gct t gagat ct t t a cagt ggct t a act gaagat g acagat gt t a t t gt cgat t c cgt agct gcc ggact t t t gg t ct acaat ca ct gacat t gg t gct ccaagc gcct ct ct ac gt cact gat g ct cgct ggat ct gcaaaaat gaat ct gt t g t t at ct gaca ct t gaagaat gaaaagt t ga t t gcct gct t t t t ggt gat g ct gt gt gggc gt gaagat ca gct cgt aacg gccagcctgg t gcgcaat ct at cct at cag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 Page 413 12689250 Sequence Listing.txt ttgcaggttg ctctatggtt acagacaaga gcttgccagc catcgtcggg ttgggttcca ctctattggg attaaacctc caacagtgtc gatccatttc caattccact gtcgacttct tagtcgagcg t ctttacaaa tgtgacatcc tctcctga 2160 2220 2258 <210> <211> <212> <213> 465 1524 DNA Arabidopsis thal i ana <400> 465 at gt ct acct gcgct cgt ct ct ct t t ggt t caaagt t gct tgat t t t t ct t at at gt ct c t at ggaact g gt agt ct agg t t t ct t ggt g t at cgaaat t t act aat t gc t ct at gt caa agt t agat t g cgtacgggac t ggt gat gaa t t at t gct gt acggat at gc t ggccat agg agat t cact c t at gaat cag aat at gt gca ct t ct gt gag t at caat t ca t t t gat gaca at t t t cgct g gct ggt cagt <210> 466 <211> 805 tcagtggcga t ct cat gt at ct gaat t gt t t t t gtgcttt t agaaat t t c t cct t t t cga ct ct acgct g ct gat ct t ct tccccgacaa acct agggaa at aagt acag ttatgct t t t t gacct t aaa agcaaagagt gt ct at t gt c t at cat cagt t cat ct ct ct t at cgt t ggc t t at agcct a aat at cat ga at t cact gac at gt ct t gt g at t gt gagt g gggccaatgc aagct ct t gc ccagagccga t gagaccgct at ccat ct at cact gat ct t agat cggccc t ct ct agat t aaat t ct agt gt t t t aaat c agaaact aag aaaaaaaaga at t t gct gga at t t accgt t t ct t cat at c gagt gt gt at ggtgtgggag ccagt t gt t a accgggatt a t ct ggt ct t g gat gct ggt g at t t gct aaa gt t t agt gt c t gt t gaat ct ccat ggcgt a t ct at cccct tcaacagcca t ct at acggt at ga cct t t ct t cg act ct ct t at gt gt gt gt t t t gagaagt t c t gt t t aggct t t at t at ct t ct t t t t cttt act t t t t ggc at t t ct t ggt gt t gccaggt gt t accat ca gaaggcatgt ct t t gtgtta t ggcat ct at t ggct ggt gt accccaaggc ct t gt gggct t caggt acga agact caat t acaat t caac at aat ggaag t t t ct ct agg gt t at at cat aagct t t t cg ct cat cgt cg gct t cct t gg aat ct ccat c gt t gt t ct ca accgat ct gt aagcgat at c t agt cagt gg gtct t t t ctc t t agat ct gt gt cct gt t ct t t t gtgattg aaat t gct ga t gat t act t g t t aacaaggt gggt gt gat g gt t gggt at c t aaat cct ac agct ggt ct t t cct ct gt gt t t at acaagt t t gat gaaaa aaat at cagt at gaaat at a t t t gat acgc t t ggt at gat ggat cat t t t cgct gccgct tt ctt ct t t t t gat ct ccct gt gt t gct ca act gaact at at caaat t ct cat t ggt gat at gt gt t gaa t t gt ccaaca ct gt gact gg gt t gct t t ca t gt aaaccat at gggagct g aggccggagt t at ggt t t ga t acct ct t cg t cagct ggaa cct t ct t t gc at at ct ccct t t t ggtaggg t cat t acat g at ggaagaaa aaat t t gt gg t ct t at cct c gt cct cccga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1524 Page 414 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 466 at gt ct acgt gct ct cgt ct t at cat t ccc ggcct act aa at t t gt gat t ggt at gggag at gagacct g at t t at ggac t act acct at ct ct cagct g agagaatgt t cgat t ct ct c t at ct t t gcg agct ggt cag <210> 467 tcagcggcga t ct cct gt ac t t t gat ct ga t t gat cgat g gt gat t gggt ct gct t at gg agt t ggt t at t t at t at t gc t t gacggat a gt at ggct at t gaat ggaaa agggcaaat g gaagcgctcg tccagagcag t gaaact gct gccat t t cct t ccagat gat t at t agt gt t cgggcgtaaa aacagcaaag gaagt ct at t t gt t at cat t cgct cacct t t ggaat t gt t acgt t aacgt ct cagcagcc ct ct at acgg aat ga ccgt t ct t cg t ct cct ct ca t gct t ct aga ct t agat cag at gagat gt t agt ggt gt t g gt ccct gt gg agt accggca t cgt ct ggt c ggt gat gcgg cat ct gt at c t aagct at t t cct aat t gt a gct t cct t gg ct cgat t ct c gact t t t t at aat ct ggat t gt t act ggat gt gt ggct t c t t at ggct gg t caat cct aa t t gct t gt gg gt gt t aggt t t t gact t gt t gt t gggat ga ggcat cat t t cgccgccgcc t t ccaat t cg gagaatcgt t t t aggt at t g t t t ggt t t t a aat gggagt g t gt t t t gggt ggct aagt ct t ct t gct gga t gt t cct cac t t t ggt t t t g t t ct t at cct t at ct t ct cg 120 180 240 300 360 420 480 540 600 660 720 780 805 <211> <212> <213> 1139 DNA Arabidopsis thal i ana <400> 467 atgggaagaa t t ccgat t cc t ct gcgat t c gat accaat t gt ct gagct g agatcaaggg t caggat ct a at t t ggt gt c ct t gcaacaa t t catcct t t t t cagactgg t t ggacaggt ct ct t cgt cg ggt at gt t ca gt aagt t t ct t ccat ggat t agt t t t t ct a gat t gat t ag aaat ct gaat t aagccat ac cgat gt t ggt atgggagaag gt acat ggt g ccat gt t ct c t at gagaggt t ct t t t gtct t gct aagt t c aagat t gat c ct cgat ct ct t aat t aacca t t cat cggat gt t t t t at at aat ct gaat a ccaaagt ct c atgaagagga gagaat gt gt aagt ct gct g aggat t aaca gct t t t ggt a gt t cgt t gca aagt t ccct g ct t gaat cat tgtgacttct ctgtcaggat ttctgtgttc aat gt t ct ct aaat gat t ag act gt gat ga t t gat t t agg gct act gt cg agggt gt t ga caagt gaagc gaaaagat gc agat gct t t c aagct t t ggg aggat gccca gt cgt caaaa t t t gtt gttt ct t t gt t gcg gt t t at at ag t t t t gagct a acctgcgagg t ggt gt gcca t gagt t t cca act t gaagct t t t t cat t t g gt gt gctgga t act t gt gct t ggt caccat gat t at t gt c gat tgt t t t a at t t t gagat t t t ctct t t a t at gagt t t g t gt t accgt c gat ccaaaaa t t ct gt gt cc gcccgt at t g aggat t aggg gct gat aggc cgt gt t gct a gctcaagagg agcaggaaat t ggat acaag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 taagctaact catcaagtct ttgacttttt tttcaggggc ttcacgaagt ttaacagagc Page 415 t gact t cacc at t at t ggct ggaaat at ca gccat ggacc aagt t gaggc t t acat caga t t t gt t gttg t t t ggct aac 12689250 Sequence Listing.txt aagagaagcg t gt t gt ccct gat ggt gt ca acgct aaggt attgtcactc tttgatccta tagcagtgat agttatcatt aattgaacaa tgttttcgtt tttttttcag ttcctctcat cgtcagccgg gaagtgcctt tttgccagcc cactactga 960 1020 1080 1140 <210> <211> 0 <212> r1 <213> 468 1091 DNA Arabidopsis thal i ana <400> 468 atggcaacgg acat ccgt cg t t ct ct cct c tttggagaac ttcacccgga t ggt caaat t t acaacat t t t t att cat ca gat aact t at ccat cat cac at t at aact t aacggat aga gat t t cgttt cat gaaccat agt t t cct t g t t t cgt gat c t cat caaaga ct gacgct t t aagagcact g cgatt gt acg ct cct aagcg t ct at t t t t a gcct cgat ga tt ccaat t t t t gaaatt gat ct ct gt t gaa aagt t t t gt t ct gggt at t g ggcgaagacc ct gat t t gga agt at gcaga at ct cat t ca t att gt ct ac gggt at gt gt t t cagct t t g tt at ct ct gt ct at t t gcca t t cagct ct t aaact t t t cc ccat t t t cca gt t t at agat t gaaccgat a t t t t ct ccat aggat cgat t ct t t t t aat a ct agt t gcac ct cct gt aag at ct cacct a t gat gagaat ttctgct t t t t t gt t gcagg act t t t ccag at aat gct gt aat at ccgt a t t ct caggt c tcccgagcag t ct t ccgccg ttgacgacga t cgt agat t g cct aat t t t g aat at ct gaa t t t t t t t t ct t t t cacagat t gct ct agct ct t ct ct t ac gtgtgagaca gtt acgaaag t aagt t t aag cct at ccgca t t t t cct at c t t agt t gt ct at gggt cggt cggat ggt ct t gact cgcgc gccat gacga t ct aggt t t t gt t t t gagat aat t gat t t g gcgt ct t at t t ggaacat ga gaagctgcga gt ct at gt t t aact cct cca at gt t aat at ct acat t t t t cat acat t gc t at gcacat c gt ccat ct t a ct cct aaaaa t t ct gaat ca gt t t gaggt g agct ccgaag t gct t gt aat ct gat t t gat t cagt at aat gt agat cgat ggat caaat c t aact t t t ga agtgggagaa t at ccaaggg tt ggt t t agt caccagt gaa t gcct t t act ct gaaact gt cgcaacaagg ct gt ct t t ca cctt aagaga gt t t ct gaat aagcacaaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1091 <210> 469 <211> 1642 <212> DNA <213> Arabidopsis thaliana <400> 469 atggctcctc atggagatgg attaagtgac atcgaagaac ctgaggtcga tgctcaatcg gagattcttc gaccgatctc ctcagtcgtc ttcgtcatcg gtactttatg atttatgaag tttccgattc gatcgatttc ttcttcgtct tcttacttac tttttgtgtg tttctgacgg agctgcagct atgcaagcgg aggctcttcc tttggtcaac aagttcggac tctctgaaac Page 416 120 180 240 12689250 Sequence Listing.txt tactgattcg ccgttaagag ctctcctttt ttcgattttg cttgtttgga tttgatctcc at gaat ct ct ct gt at cacg gct t t aggt a t cagcat t t t aaat gaat gc ggaact gt t c at aat caat g ct gt ct t gac gt t t gggt t t aaagat ct ct t gt aat gaaa t at t cct t gt ct t cct t t ct aagt t cct ga cgt caggcat ct caagat gc t t ggcaggt t t t gccaat ga t gaat ct aac t cagggt gct agccgt gacc gacagt t gt g gagaaacct t cact t caaag gcgt gcat aa agct t t gcac agt at agt ag act t t t t t t a cagct t ct ct ccggaacct g t t ccagcat c gat t t t gtta ct ct ct ct cc gt gct gat t g at ct gat gt t tt ct t t t t t c gcaat gt t t g t ct cgacacc t act t act ct at ct act ggt t gct acgct a cagaaaat ca gccgtggcgt gat ct agt gg accgct gcat t cggacct t t t t t act t t t t agat ct t cga t caat gct ct ct t t ggt t t c t gct act aac cat aact t t t cggt ggct t c ct t gct aaga ct gct cgt t t at at ct ct at gct gaacat t gt gt t t cat g cat ct t t agt t t t at ct ct c caat ct cct c t t t aact at t gact cgt t gg aaggacatgg at gaaact gt at gt ggct ga acggagat aa tagagggaac aa t gcaggct t g at caat gt ag caaagat at g t caagt cgct t gagat t t ct gct t ccat cc aaggt acagt accacgt cct act caact ct t t t t at gt t g t gcaggt caa at agaagaat gat gat t ccc t agat gt t t g aaggaact ca t caat t t t ga at at gt ccac aggt agt gt a t gt t t t gaaa t ct t ct gaaa acct acagca t gct act aaa gt aaaggat t tttgccccgg at aat t agcc gcat t gt t t g ct ccagggat aagcat t aaa tct t t t act t ctct t ct t t c gagat t t gag t gaaagt t gg aggagccaac accaat t ccg tttttgccag at ct gt at gg at t t gaaggt caaaat t ggg gcaagat gaa at gt t aacaa ct ct t t t ggc at accagt cg gaagagt t ct gt gat caact gccct gggt t aagagat gca t gt gaat ct c gtatgaggga cgat agt gt t acct gacat c ct cgt gt gct t t t act t gt g t t gcat cat a t t t at t t t ca at aggcgat g gt aggt t cat t t cat ct t gc agt t ggt ct c at t t aat aga gt ggt t caga acat t gat ca ttct t at t ac t ct ggat t ct t gt t cct caa t gcagaact t t cat caat gg 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1642 <210> 470 <211> 2478 <212> DNA <213> Arabidopsis thaliana <400> 470 atggcggccg cgaaactcgt ggcgttgtta acttgtatct ttgccgacgc tggaatcgac ggcggtgata tcgaattgga tcagcttaat aactttacag ttacttcaat tttacttcta gtgaatttct caaattaatt tttgttttct gatgataaaa ccaaggaact taaaggcaga cttcttctcg ccttggtttt ggcggcgatg aaccgaaact gctaaaatcc gtgccctagg tgtcaatcgt gaaaatttgc tcttttttgg ttggtgaaga gaagagttag taacagagaa Page 417 caat t t cacc t agat ccgac t t acact cag t at ggct t gt at ct cagat t ggagaaact t 120 180 240 300 360 12689250 Sequence Listing.txt cttcaagaga gacaagacaa agttgcttct ttggagactg aggtttcttc t gcat gcat c at at at caat agct caagca t t t t at at gc t t aagaaat t aaaccgagaa t ct ct t ggt t agt agaaagt aat gaagagc t gacgacgct cgact ct t at caaaggct aa act ggagt ag agat t t t ct c acgcccacgg gt at gt t gat aggt gacgt t aaact gt aag t t acgaaat t ttgagccaca gt gcggt t ac agt at at t t t ct t aat t t gt t caagt ggcc cacaacgaag ccaggt t aga at t t ct agag gccaat gt cg t t cat t t ggt gct t cct t t a t t acccat ct gaat cat t at t gct ccat cc t t t t agtgt c ct t t gt t t cc cgagct act g acgt aagt t t tctggagcag gaagct aaac cat t t t ct ct at gt t cat t a agaagaacaa ct t ggat t aa caaaat aaaa ggaact gat g t t t t caggt g aat t aact t g caaaccagtt t acat gact c agcaaaaaat t aact t acag ct at t t ccag cgt t caaaca accacat at t t t t ct at ct a t t t gt t t at g acagccacaa act gt ccact atcggaggca gaat gt gt t g aaagt aaact t t gcagt at g t cagct t aaa t cat agt gt a t t at aagat t t t gat t ccac t cact gaaat agaaaaaagg aat t agagaa ct aaat t ct a aaaaacaagg gaat t gaact t gacacacat t gcaaat t t t aat acgaaaa gt t t agt t cc t t ataggaag gaggt t cat g agat t aat ag gacact t cct atggagaaag tgagagcaaa caagct gaaa gcaat gt gt g aaat at at t c ct at ct acca gt caagt t t c gat at t aat c ct ct ccagga aaccacat gt at t acaaaga ct gt gt at aa t ct t t t aat c gaagagccat t t ct ct cgt c at t gat aaac caaat t cct c t aaat t agt t caaat gacct t gct t ccat g t t cat cagat gcaggtgagt cat t at act t agaaagaat t cgagagtgga t at t at t cat cacaat ct at ct t gagcgt g ccct t aact t aaat gt t gcg gggcgtggct at t t t at ct t ggact cagac tt act caaaa ct t at gcct a aat gggct aa t t at t act ca cagccat aaa aagcgaaaga aagaacat gt t t t cagct cc ggct aaaaag t gat aaggt g gt t cct ggaa t t t at aggca t gat aact ct gagct gt t gg t t gcact ct a t gt t cact gt t gt t cgct ct t t cagt at at ttcgt t t gt t t t cagat at a tctgtggaac t cat t at t t g gt gaat t cag gat agaagct gaaagt at gt t t caaat cat t cct gcagct ct ct t aaaat t cggt t t ct g gacaaagcat t ccccct t gg t t acgact t t t gt t gct ggg ggt at t gt t a act aagt t t g accgcacat g ggaggctct t agagactgt t ggct t accat agat ccct at at cgat gat g tttagcaagc agagct acca t ct gcct caa t t at agt t at gt at at at gt aacct t t t gc agacaacat g aggcct ct gc t ct ggcaagt t gctct gt ag t gt t gt at gt act aagggt a gat t gcct t t t gt t gagt aa agaacccat c gt t gaggt ac cagact agt g t cat t at aca t t aaat agaa tcacaagaca t t ct gaggt a caat cact ct gaagccacca t t t gct gt ac ct t gcat cat acacact ggg aat ccaaat t at cat ct t t t gct aat gt t a at ggt t at gc aaaacgcat g gct t ccaaga t accaggt ga acaat at gcg cgt acgt t ga t gaagcct t a ct t at cacca gagt agt gt c gaagct t caa caccaaagag ct t t t t t gt t t t t gt t ggcc t t ct t at aac tt ctt aaaga tt att gcagc 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 Page 418 12689250 Sequence Listing.txt acaaagacga agaagcctat ccgacagtct catcaacata ctcgccgtaa ggccagaagg ggtcattctg acaaatga 2460 2478 <210> <211> <212> <213> 471 4301 DNA Arabidopsis thal i ana <400> 471 at gccgt ct t cct acgt t cg t t t ct t t t aa t t t gcgat t g t t ct at t cat t cat t gt gat agat t ct cgt ct gagt t gt g at agt t ccga gt gt cacaca t gt gat at t g t cgcaagt ag t agat aaagg t at t t ct at g gaacgt agcc at ggaat t t t gct t t t ct gt ggt caccct a t t acagt ct t t gaagt t ct a gt t t caat t c t t at t ct ct a at ct gt ccgc agct act ggt aat t cat gaa tttggaaaca t aaat t at t c gt aggat at a cct cct ccgc cat ccgat t c t gcagct cga aatttttttt t gt gat t tag t t aggct t t t t gat t t ct gc t t gt aat t ga ggt cgggt t g t ggct t ct t t gagat cat ga t gt gacat t t cgggt t at gc gaaccat cat t t t at cagt a ggt t ct t act caagt caat t t gggt aaaca ccat t t t gcc ggt ggaaat g t t at acaagc gt t acaat ac t t gaact cga t cgaat t t t t aat aat gct g t t t t gtatct cgat at ct aa t aat ct t aca cgccgt cct c agat cacaag aat cgagcac cat ggat ct g gcgt at ct t t cacaat ct cg acgt ggt agc gt ggcat t t a at gat t t t t t t gct gat ggt gat ct agct t aaagt cgat t at t at t caag t t gtgtgaag t ct gt ct gt c ct ct ct t act ttttttgcct aagt t ggccc gcccat ct gg agct gat t ga ttct t t t tca ct t t t ctgt t t gaagct aag t at gggt at g at t t at t at c gct gagt t t t ct t cact gca t aact t ct t t gt gt t cct t c gt aagact cg t gt gt ccgt a t t cgt t t acg t cgt t gcaag at t t t aat aa ct gt t t gat g at gt at t agc t caat t ggt a cagt t t cagc t aat gct t t t ccaact at at agcaat ct gg act t t aact c t acgt t t gac agt t aat ccc t cacct t t t t at at aacaat aaacaat gt t cagt gagat t t act agt t aa ttttccacag gt caagcat t t t t cacgt ct t at gat gt t g cct t t ggt t c t t accaat at ggat t gct aa ttctcgt t t c gat ct ccagt gat ccgt gat gat t gct agg t ct aggcgt a gaaat t t t ga t t caaat act at ct t cgaga t ct aaaat gt t caaaat t gt gct cct ct gc at t t t ct gcg t agcat t at t ctgat t t t t g at t t t gt t ct at aaact t t t cagt at caag ccacaagaaa cacaaat ggg gcgat aaaat cagt t at cgt agaat gt cga t t aaagat gc gt t gt acat a caat t t cct a at t t t t acat cat t t acggc cct t at t t ct cct t ct cact cgt t t t t t ct t t gcat ggt g at cct cgagc t t t t gtctat t gt t ggt t t g t ggat caaca t gt agat ct t gat act gaaa t at t gcct t t ct t ct t acgc t gaaagat ga caagagccat aaat gat at t gt t ct ct ct a acat t t ct gt ccgaggagca cat acaact a gt ggt ct t gg t cat gagt ac acat t cgt ac gagaagt gt c cat t gaat ct t t accact ga ct at gt aaag cat gcaaat c at t ggat gt t cacct t t t t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 tgtggcatct gattggcgtg gactttcatc gggtggt t ct aacttcagat gatctacctt Page 419 12689250 Sequence Listing.txt t gt ggggt at t gt t t t at at cct gacaaaa aaat acaaca gt gaact aca caagacaacc t ggat t ccaa tttgaacacc ct gt gat t cc t t gt t cat aa t ct ct t caag t ccact ggt t caat gat at t at ct t gagag t t t t agct gt aaggct t cat agaat ct ggt gct ct cagcg aat ggccat c accaat t t t g cct ct t at t c ct ct gact t c t gact t act c t t t aaat gct t ct t cccct t gat ct ct t gc t t t cct t ccc acaat ccat g ct ccat cagt t gt act t t gt acaaagt t at t gagagact c t aaccagcgt gaact t t gt a agt t t t aat a at agt gaaaa aagaccaggt t ct gt gat ct caaggccat t caaat gt cac aggt aact t t aaat ggt t ca gt gcaagcca at t t ctgttc ct ccat ct t t gat gaggact ct t ggt gat g t aat act ct c agccat t t ct t ggaaact t g gt agt t ggt a gt gggaact c tgt t gt t gt t t aat t at t t c t t t t gtctct gact t ggcgg ggt t t at t ca t t t gtgcttc ggct at t cct ct t agccct c ccgt gt gaag cgt ct ccct g ct t cacat cc ggaat t gcaa t caact t t ca t t gt aat cca t at t t act t g ct gt ct cat t cggcaagcat cggt t gt t ac t at gt at ct a ggaagcaggg at t t gct cgc t agcat gct a ct ggt t t at a t aat t at t ct aagct t t at a aact cgt t ca ct cagaaat g t t t cct caac t agcagt ct g gacgt gagt a tgcatggaga caggt gcaca t ct acgt t gg t t ccct acat aggagaggag ggt t at gt ga acccct ccag ggt aaacat t ggaat t ggct tttgggacaa ctcggaacgg act at t ccac at ggggggt t t t ct ggaat t at t ct t at t c acgaaacccc gccct cact g aaacatccaa aatttat t at gt t ctttgt t gct gt t gt ag gt t ct t t aca cacgccat t a t t t t aat aga aaaaaaat gg cgct t t gacg t gaat t agt c at gt at caca t ccaagaat c agcct t ct ga t gat ggt t at at t at gcgaa t t t t at t t gt t ccaacact t t t gatgggca t gt gt t cagg gt t ggcgt t g gt at gt at t a gcggt t ggct caat cgt cac gcggt ggaat tgt t act t t c ggat caagt g tt ct cct aaa t ggt ggt t gt t t gt t ggaag gt ccaat ccc t gct accct t acaaggt aaa caat t ggcca t aaaaat ct a aat gcat at t gct t t gt ggg cacat aagaa agcaat agt a t cat t cat gt acct gacgt a t at at ct gga t t gt accct t ct gaat accc t accaaagt t gagt t t gt t c t t t cct cacc gt at gct aga cat t agt t gc ct t t ct t t ac caggaacggg ccagcaagt a ct ggt t ct t c cact at t ct t t gt at at t gg aact t t cat a gt act caaga ct ct ct aact cat ggt cct c cacaat t gcc gt t t gt aat t aaat t ggagc tgagaaaaaa t ggaagcat c t at agt t t ct aaccggagag cccccct t ca aaccaat cac t gagct gcat cat t gt t gt c t ggat at cat t aat ct cact t t ct gt gcaa t t at ccgt t c ct gct act gt cat t t cat gc ct at at t cca tttttgcaga ggt t t ggt ct gaagat gat g acaatt gt t t t t ct aaat gc at gt aagt ga gt ct agt t ct t t gt cat at t t t ct ct t gag at t t t gct ca gt gt gct at g agcgggggt a at t caagt aa acggct t ct c at at t ct at g t gggggt t ca ggt gct cct a t ggt act t ga t t cat t gaga ct t aat aaga at t t gatt ag gcaat ct gcc aacagct aca 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 ttaagaccta cgctagttgt agtagacttc tagctaaagt ttgagcagct gtacttcttt Page 420 aagt aggact tttttgaaag t act ggt t t t t cct gct gaa cggct gt ct a gat t ct t cca t cct t t gcgg t gact aaacc t gt t t gt aag aact t t gt ct ttgtctgttt t gt gat t ct c tgcagagaac t gt gt act t a gacaagct t c t gagct cact t t cgt t at gt gaggat ct ac 12689250 Sequence t t gct aat ca t at aacgacg ttgtttgttc gacaggtct a gtcatagtga cggt gt gt gt t at cact ggc agt ggactt c tactccatct attactacta t acttt ggat acaccatgat ctcttcactc tcttcaaact gaaaatttt g caggagctgt agaaacatca agtgcgact a Li st i ng. t xt aaagt aaat t ct at gt gt at gacaat cgt g at t t t t ctct cgt aaagacc gtt ct gt ct t agcgagtggt t ggct acct g gt agt ggct a ggat t cat gt ggt acat at t gct gct t cga aagat gt ccg ggcct t ggaa gt t at t gat t ggat caaat c 3840 3900 3960 4020 4080 4140 4200 4260 4301 <210> <211> <212> <213> 472 950 DNA Arabidopsis thal i ana <400> 472 at ggat ccag aaaggt agt a aaagat gt t c at gat gggaa cagct ggtt c gt cgct ggag cgtt cgt t t t at gt gat t t t tgtgt t t t ac aaaggaat t g cct agt gt ag aggaaact ct t gat gct at a ggt t t t cct c t agct gt t ac gagaat act a ct gaaat gag t cact ggt t t caagagtgga cccat gggct agaat t t t ag ct t ct gt gct cat t gt gttt ggtgaagacc gagctt agt a taatgggaag caat t ct at a caaaat cat a gat t cgt gca tttttttaca ct ct gct t t g t cct t acacc at at t t ggaa t ggt gct ggg gagaaat gt g gact t t t gct at ccaagaga t ggct at aga gcat aaaaca at t t gaagt t gt t at t gat g aaat t gat t c ggat agcaca t at at gt aaa t t ct t at gat gcaaggagca at t gat t ct g gt cgagaaaa gaagaggat g act at t t acg gct ct t ccag gct at aggag gat t t t t aca ggt aat ggt t at gt ct cgat act ggaat t a gacaaacaat t t cct t acct agt gt caaga gat cgaaagt gct at act t a tcccgacagc gaggtcagac gagctgaagc gt ccgtt gat gaaccatttt gactt att ag gt gt t t acat at ggt gct at t t t ct t cttc ggt t aagt ag gcagt at gt a gat ct cgagt caagt t aat g ct t t gat at t acat agct t t at t gt t at t t gat agct gca cacaagagt a t gat t cct ga gaagacaat c agccacat gg aacact gaag cggt gt t gaa t ggt ggt t t t act cat ggt t tt t agaact g gt t cact t ca gggagtaact tt at t t ggt g gagt gt at ac gagtgagaga ggt t t actt t ggt gcaacac gacaat ggca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 950 <210> 473 <211> 964 <212> DNA <213> Arabidopsis thaliana <400> 473 atggcttttg ttgtaacatc cctgatattc gctgtcgtag gcatcattgc ttcgatatgc actagaatct gcttcaacaa aggcccctcc accaatctgt atgtcaattt tcattccttt Page 421 120 12689250 Sequence Listing.txt ttgcttcttc tccctttgat ttagtcatct ttttgctgtc tttgatccat aagt t aat aa aact at t at t t t t gt t cct t act ct t gt gc act t gt t t gt t ct gct gt t g act gct t t gc ct ct ct t gt t t ct agcct t t t cat t t at ct t ct acaacat caat t cat t c ggcaat t gt a gt ag agt gt t ct t c ggt t gat gac cct gt gat gg t ggaaaaagt ct t gt t t ct a gat gat gt aa at t cacaacc t gat t gt t gc act act t cca gt ggaact t a gaacat t at t at t gt ct t ac t acat t gcgc ggt at caat t t t t at at t t g gt t t t t t t ca gt t gat ct t t t gaaacaggt gt t t atgttt at gggaat t t t gat caacat t t t t t gtt gt t ct ct agt ca ggt ct t ct ac at at cat t ac agat gaaccc gagat cat t a ct cat t gact at gt aagcaa ttccct t t t t t acat ct t ac cat gcact at gt at ccagcc t aaccat caa t gat aggat a at gaat t agt t aaacgct gt atcgt t t t t g t ct cat t gt c t t t at ct ct c t t t gct aat t ct agt accgt t t at cat t ga at t ggt cat c gt t t cct ct a cgt t at t t gt act at caaga acgt aat at c at t cat gt t a gcaaat cct t t t t ct t cgca cct at ct t aa t gct t cgt ca t gagct t aga tgt t t t t gaa cacagt t ct a ct t ct t at gt accgcaact g ct t gct agcc t t at ccaacc t at agat cct t t gacact t c agagaaagt t t cct t at at g t t ggcaggt g gcgaggtgga 180 240 300 360 420 480 540 600 660 720 780 840 900 960 964 <210> 474 <211> 789 <212> DNA <213> Arabidopsis thaliana <400> 474 at gat gagaa ggcaacgaga t gaacl gttttgaatc ttctccgttc tcct c ccgcccgcta gaattcgtcc tccct ttggctcata tttctccgtc tggat atgctatgtg gatctgtgac tttct ctgattttgg ttttctatgt cgctg! at t ctctgtt acgtct t gac gccac gttgaatttt agggttaagt ttgtcl tctcggatct gtttagattt taacc ctgggttttg gttctgtcct tatttt aaaagcacac cgatgtttgt tctgt gtggatcata ttattcatgt tcacal at t agtttca tgt t ct t gca ataat ct t ct gt ga aacaa tctagggttt tctacgatct at catctctt t at g t t ct cgcg cat c gcat c ct t ct aaat t aaagt tgat a t t t g t act t t t g ccgat t t ct t t cgcat cgt t t cgt t gct gc gggt t t ct ct gt t t ct gcca cct t ccggaa ggt t t caat g ttcct t attg gaaatggagg act gaaacga t t t gt t gtct t gt t t gacat t acct gat ca t t t cgtcttc t t ggt at at c t gat gcct t g t t t ct at ggt aggagat t t c at t caagct t at cat ct aga t t cgaatt ag t att gtt ct t gaaat t aaac gt ct at t cgc t t t ccccgat t t cgcct t ct t gt ggct ct t ggt t ct t gct cgggaggtcg t ggt aat t t t t ct act t t cg aat gat t cca ggt t t t at t g t t gt t t cct c ct at gt t t gt agaatggaag 120 180 240 300 360 420 480 540 600 660 720 780 789 <210> 475 Page 422 <211> <212> <213> 12689250 Sequence Listing.txt 1196 DNA Arabidopsis thal i ana <400> 475 at ggcgact t t caat t ccac t t cat gt aaa t gt t gt t agt t cct gat t t g t agt t ct cga t aat gggt t t t caaacaat a gt ggggt at t t cct caacaa ct gt t t t gaa t gat cagt t t t t gcat cagc cggt aagt t t ct t at cat ga at ct gact t c t gt t gct at g gaacat ct ga aaat aagcat t gat ct acag cgaagct t ca gat gat t agc at t gacgat t t t t gt ct t t t t t t ct act aa ggat ggat t g cat aagt gat agact ct cat agcat t gcca at cggt at gt t t gt ggact t cat t gt gt t g ggt t acat gc cct t acat t c t ct t ct ccct t t t gat t ccg gcagggacag t t t t ctt ct t t t ggt t gat t acacgat t at agct at t t gg t t t cct cat c caat t at gga at aagt t cca aaaaaaaaca at t t gt t gct agat t ccgt t t t t gt gt t t t acat t gcaga at t at t ct at t t at aggct t cact gcct t c act ggt gt t a cct t t caaaa t gcgt t t gcg act ct t gt t t gcat at acca act t t t acat agt t gaagt t gcaaccgaag aat caccct g gat t t cgat g t aat t gagca ct t t ct cgt c aacacttttt t at ggt ccag accaat t cac ct cagt t cat ct t t gccaaa cat ct t ct t a at cat at t cc aat gt act t t t t t ggt ct cg t t caat ct at at gagat gt t acagaaaaac gct t gct cgt gt t at gt aca t t ct ccct t g ct gagcct gc ccggacct aa at t cat t t t t at ct acat ct t t t t cgattt gat gaat t t t gagaat cact gt cact cgt g t t ctgggcac cct ccagaga at ct gcagaa ccat agt t gt caaat at t ct t t acagcat g t t t t agt at t agt gat t gt a t ggaacct ct aaaat aaagt t ctt aagaca ccaaact gat cgt t gcgagc aaccagt gag t t t t t gt t gg cct t ggaat c t t ct ggt gaa gact t t cgat ct t t gct t t a t ct cagaaca caacgt t caa aact t t cct a ggt t act cct cat t gt t cct ct gat at t t c gt aat caat c gt aagact ct t t cat ggt t g t cagt gt t aa aagat cgaga cgact t agt c acact t gct t gaat ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 476 <211> 349 <212> DNA <213> Arabidopsis thaliana <400> 476 tcaacttaat cgtttctctc aaatttaggg tctgaatcat ctctatctgt ttgagggttt aaacaaggag ttcataattc gcgatcttga gaatgagatt atgatggagt cgaaaggtgg attttacgaa gctcccctcg gttacagcat gaaat t caaa t ct t ct gt ct act caaact g ttttctcttt tctcgaaagt cttgcggttt cgtttgagat ctggagaaag gggtttctgg t ctatcgatc ttcatttata taaaaagcgt taaaaagaag tccagcagta gtagt t cct t tgaagacgtt cgtccaaacg gtggaatcaa ctccaagagg ccatcctga 120 180 240 300 349 <210> <211> <212> 477 1211 DNA Page 423 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 477 cacgcagaat t agt t gccaa cgat t aggt t at ggt t gat a t t acct t gcg aat t t accac aat t at at at aagagaaaga t gccct ct t c caaat gt cat ttgcaggaga at cggaat ct t ct caat at t acgaaagact gt aagat gac at t at ct act gtaagaggcc t aggt ct aat t ggcggt aca aaagct gact at aat ct ccg ct caccact g aact at ct gt t t gct gat at cct t gt cacg t ct cct t ct t agact aagca aaagt at aaa t gaacaat ga aat t gcat aa ct gaaat t cc taaagagaga cct aat t t cc ct cct ccgag cct cgaat gt gccat accgg gt t t t gattc cat t agagt c ct cagat t t c t cct t cgat a ct acgct ct c t ccct t at gt at acagaaac cat at acat t t t gctgagac cat cggt at g gatt ccagaa acat t cacat agat aaaaac ct cagcat ct aaat t t cat c ttgt t ct t ac t at act ccat catt cggcga t act gct gct aggt t gt cgt acgt at t t ga aaaaat at at gat t t at t t c t ct t cgat aa t caaaagt cg aagct gacct aat gaacat c cgt gaccaat ttcaaccagc ggat act gga agct aaacac aaacact t gc t cacagct t c cat gagat t c aaaact t caa agt gcccat a agcat t ct t c t agagt aaca gaccaccgga ggacgtggt t at act ct ggg at t gaat aat t t cgagat aa at acgcaaga aat at cccaa t t ct gagt ac aat at caat t at gcat ccaa t act gat at a aat aaacaat aacagt gct a aat caaat t c agcagt gt ca t t t ct cagt a aat t gaacaa gt aact t cgc gt cgacgct t cggcggcgcg gt gaccgt t g ct t ct cgcct ct aaacgat t atct t gt t t t t at gat t at a gt gagct t ca at t cgat ccg ccacat act t t t t at aact t agt at gat aa aagat ccgt t gat act t gaa aat t caccag accaggcaag at gcaat t ga t t t t gt ccc t ggact acaa t gat gt t gag tcgccgaaac t gaat gccaa agt aggt gac t at cgccacc t gggcct aat cct aat at ga t t agat gacg cggt cagaca ttttccggcg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1211 <210> 478 <211> 1116 <212> DNA <213> Arabidopsis thaliana <400> 478 cctcagcaaa taagaggacg ataaggatcg gagat t cgaa gactctttat aagtcat t gg acaaattaac aacacatata ctacaaattc cgactactaa acgcgtttca atgactggt a aatgaatgat gatgtaatag tagatgctaa gt cat aaat a aaaat cat ct at aat gcgt g tttatttaaa act at t aagt atcaacatca acaactagtt tattaaaaaa attgttatca gt ct t cagct at t t gt agt a gagt t aaaaa aacat at gt a cat aagct ca t aagct t gca at cggaaaat t cat ct cgat Page 42z at aaacaagt aat aacaaat ccccaat at a act at ct ct g caat t at t t g t aaaaat aca gat t t gct t t t t t aat aat g aaagaaagt t taacaacaca at at at gcat t t acat at t g aat aat t aaa gt at at aact t gaagt t at t ct at at at ac 120 180 240 300 360 420 480 ttagtct t t t gggcaaacct aaaaaaaat a t gt t gat t ga t t gacaact t ttttacaaca aaaaat ccag t t t at t agt c t at t agccat gt cact gat c t t t ctcgttt <210> 479 at t t at t gt t gaccaagat g t at cct acat gt t t ct gaaa ccaat t t ct t at aat t gaaa t t t agat gt a gat at t gggt tcgagaaaca tcccacgacg gct ct acgaa 12689250 Sequence at t gtaatat gcgaaaatga tgggaagttc gaaactgcaa ttcttcattt tttttttaaa aatcataatt gagtttttaa ttaaatatat cacttttcat t gt cgaccca aaaat at aca tttgtattat aggggaaaac acatatgtat gttcttttac aggcatctct atttttttgc at ct cccaaa ct cat t t ct c aatcagccgt ttaaac Li st i ng. txt ct t gcaact g at at gt at aa at act aat at at t agt t ggt at at t ct t gt tttaaaggca aat t at at t a gat t at gcca t t ct t ct aat t acgt t cat c agt t gct t ac t t ct t aat aa t t gcat act t t t gt at gcat agagct at aa t t t cgct gat t t ggt t aat a t caaaaaat t agact t ct t c gat ct ct ct c 540 600 660 720 780 840 900 960 1020 1080 1116 <211> <212> <213> 1270 DNA Arabidopsis thal i ana <400> 479 t acaaat cca acagagat gt t t t gggcgt g t t ct t ccat a t t t agt t gca t cgat cagct agctgcgt t t aat gt ggt ac at t ggt at at aaaat gt t cg at t gct t acc agt gcagt t c tctgat t t t t ccct aaaaag gcat t gat ag gt t caat t t t gt t gt t t t t a ggcat at gcc ggt caaact c aagagat t cc cgt t at act t t ggat aact t gt aat at t ga aggaaaaagt gct gaggt gc ct gt at cct g gt at gt at t a gt t agt cat a gct cat gct g t ct caacaaa cgaat aat t c gaat t ccct c aaaaccact c t gt t aat t ga t t t gaat t t g gcaagt t gt t aagt ct gaga t t aacct aac agatgaagta aagaagttgt gccttatgct gatccaaacg ggaact ct gt ct ccaaagac caagact at t gaaaacggt t at acct act t gat cact t t a gat t cct t cc t aat agt at t t ccact cct t ttct t t t gat t cact caact t act ct t gaa at t acagct a aagt cat gca aact gacagt t agaaat t gt t act ccaacg cacggt t aag gt agt t cagg ttgaaaaaac t t gggat t t g t cgt aacat t agaaccgt cc gaaacaaaac t t gat gagt g at t ctct t t a t t gggccgct t ggt t ct ct c aact t t t gat cacgt t t act acat t t at ga t at agt at gc aaat aaaat t gggacacgt g cact gact ga at ct t aaagc t t t gcaggat tagtgaaagg gtgcct t t t t gct gct t ct t at agat t cac aaacat gagg at t aaaccgg t t t cat at ca cgt t gct t t c t ct gact ct a aat cact t at t act at gagg ggggt ggact gt t act act a aat t t t t aag t ggcacgt t g ct gacccct a cgt t gagat t ccat cct gt a t aacaagt gt t aaaat acga ttgt t t t gt c t gaagaaaat accat gct t g ct at t gt acc t agct t t aaa at t t t t t t aa ggccgcagaa t ct agat t at aaaaat t t aa at t gcgcaaa aagt t t aacg at t aaaagac ctccaggagg ct t aaccggt t t cccacat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 taataatctt gtttatctgt gagatattcg ccgcttcccc ttggccggct ataaatcgat Page 425 12689250 Sequence Listing.txt aacctcaccg ataaatcctc tat t catcat ccacaacaaa cctct t ct t c agtctgatag agat ct cacg 1260 1270 <210> <211> <212> <213> 480 1343 DNA Arabidopsis thal i ana <400> 480 t agat gcgct caaagccaaa gcaaaat ct a gagaacat ga ct t t gagat t tttccaagag t aagat ct ag gccat t t ct t at at at aacc t cct t cat t c t ct at at t ct at aaaagaca cat caaaaat cat ct aaact aggaagagt c t agggct t ct t at ct ct ct t cgcaat t t t c at ct aagat t caaat gct t t ccat t gaagg agaaatgcgg t t t t gaaat t aggagattcc ctt aaaatggaaa at t agaaagtacc acat ggggtgagcc acc tggctctatc aatt ct t catagta ccct caaggccatc gtg! gatctgagtt aact ctaaacgaag cca caaaatcagt tac gaaacaagaa caat ctattttgat aca ctccttgcaa ctt acacacatct ctt aatctctgaa cgtt ccacatt gt a tca ccccttacct aaat cgggttttca aac cttctctttc gt c gagccgcctc gac! acgacggaaa tt a atacggagat tt a ggatatgtca ct g ct ccat g agccgca at aagt at ct at c acaaga ggt ct g ggaaaag t cact a aact t gg aagaggg ttgaca catggag cccaaac aaagatt t t t t ct act ccaa tcccaa aagaaat aact cag gat gt t g gaagacg ct aat at cat t acat ca gt aaacat ga t ggt ggct ag ccat caat cc accagcaaca t at aaaccct aact gct t ac tcagacaaaa t t at agct ca cat t gt ggt t aaaaaaact t at at gt t t ac t caagt gaaa ct ct caacgt caccacacca cagat t t act caccagact c tgagccacac cct t gcgt ga at acct t aga aagat t t t gg gaggacaat a aaaaagcgac act t t at act cat act t gag gaat cct ct t caaccat ct t t aatt accat aggaat cacc t cagt gcct t t cat t t t t at aaaat t agcc cttt gaaaca tt gcatt ct t agaat act t a acct aaat t c at t t cct aaa gt t ct cgaac agt aagt gat agat t gagat ccgagat gca ggt t t gcttt at t ct at gt t aaact aaaca agat t t ct ct cgaaaggcag caaaat caca gt t at aaaca ctt caagaac accaaagat g gat ggcct t c cct ccct t ca aacaaacagt cgagt cat ca t gct t caaat gaaccagt ca ct caacct t t gcct caccac ggat cgcaac ttagcaagag t gct gaagt t t aagaat gaa gat gcagt ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1343 tcgaattcac aacggaagta cgccgcggcg <210> 481 <211> 1343 <212> DNA <213> Arabidopsis thaliana <400> 481 tcagttataa acctttggga accaccgcgc ttggcacacg acaaaatgta gttaagattg ttttcacggt atcgaagagc atttaacgca ctttctgtct ccatgatagc attatctctt Page 426 120 12689250 Sequence Listing.txt t cagagagag ct t t gt cccg ct gct caagt gct t cat cac gagaggct aa t gct t cct t a gt t gct gcaa at gat ct t ct tgcacacaca t t t t agt gat at cgggct t g t gcaat ggag aaat act cca cgt at cat at gagcaagt aa agagagct cc gcat gct gaa caacgagcct at ggt gt ct g act aat at ac t at ccat ct t caaacaggat gaat aaggca cgct t cgct g ct caat cct t t gt t ccaagg ccgct t cgt t t at t cat cac cacagt t t gt ct gct t ct t c t agcgcccat aaat t gat at aat cat aagc cccacaagt c gat acgcggg cagaagaacc at t aacagat tcaggagaaa t gt aat gt gg cct t gt ct t a act cgt aaaa at gat t t t gg act aat ct gg t t gt agt agt gt at at t caa t ct t t cct t g caat gcat t a act t aggt t t aagt gt t t t t t t t cat act g caaacat t gt agt t ccat aa agcgt aat ac aacat at at a aaaat t cgaa aggt caagaa caagcaacgg caagaagt gg aaact gct aa at gcaaagac t t ct ggt gga gt cgaat acg agt at at ct c acagct gcgt t gt t gct cct cat t caaagt gct t accaca cccaccact c gaaact aggc cat acat t t a cat ccaaacc gagt t cgaat gaaaat gaaa cacgat t aac aaat cgagaa aagacgagcc acgagagcaa gt t t ct gt t t t cat t ct agt ggt agacccg at cat caat c ccct ct cagc t t t t cggcat aacgaaaaca gat t gt gt t c t ccat cct ct t gacgagt aa acccaaat aa aaacgat gaa t t caagt t aa at t gt agaga gat cgaagac agat ct gagg aggt act ct c gcaagaaggt caat ct ct cc gggccgt at g ggt t gagat c aggct t t t ga aaggat ggag cacat t ct ac t gt t t t t t t g ct t t gt agt a t act t t caac ct aaaaacag at cgagaaat gaaaacaat g agcaacgacg gaaaaact t g t t acggat t t at act t ggaa ggt t caat t t t at t at t gt c aaat at aagc at gggcct aa ccgacgt gt g gcct cggaaa 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1343 cggagagaga tctgcgagag aaagagagat cagattccgg aagcacatat ttt <210> 482 <211> 1400 <212> DNA <213> Arabi dopsi s tha i ana <400> 482 tcagtatctg aacccgcctt gggtattcaa gaatttaggt cgatacaccg agccagataa tgagttcgag ttatctgatc accatcatca cacaagggct t at gact ct t ct t ct cacca at cagagaat ct t ct t acaa t gaacgat t c t at t ct gt t a t ct gct at ca ataaaccaga tgatttgttg ttttatctcc atgaaaatcc acat caaggt t ccat at aat gaat ct at aa taacattacc ttttaactta aggcttaagc ggacaatgat aat gat at gt t t gt cagat a agt agt ggca aagtaggaga cagtacatgg t accaact gg t ct ct ct gac tgcaatatct tatctttttg caat t gt t gc cagat aat gg aagt t at gag at cagat t ga tcatcctagt atattagtca cat agcat t a t cat cagct t Page 427 ct cct gt t t t cggcaat ggg aagat gagaa t t gt ct t t gc ct t ct t t gt t ct t t t gat t t t t t ct t t agc aagcaggagc t gct caaact 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt caacatct t a aacact t aat tagactcaga gatctatcga ccaacgaagt ttccccaaaa gt t t ccaaat t at ggcagct aagt aagacg cagagct t t g taagaggccc act cgt ct cc ct cat cct t c t ct t cgt t gt gt gt t acaac at ccaat gt c t at aat t t t g gcat ct t gt a t t t aact t gt ccct aaccac gaaacct caa t t t caaat at at gat agaga ct acagagac acat gaat t t tgaagaacaa gct t ctt cga ct t ct gt aac t t t gat gagc t ct at t t ggg taaagaccgc aaaat cacat gt gcgt t agg ccact t t caa aaaact t gaa gat t ct t at c cct t t ct t cc gtgaagagag ccgt at gat t caacaaaaca att ccaaagg at t t ct t ct a t caat gct t t t ct t t aaacc gact t t gaaa t t t t at at t t agatt cgt cc ccat at at ac t t t at aaaaa t aaagat ct c aat at cat aa ct t gt aat t g t at t t cat t g t t agact t t c tt gtt agt ga cacaaact ag agat t at at t actt caattt ttgt t gattg t t t at ggat t t cat cagaag t ct t t cgt ca t cat cat ct t at gggct at a ggct tt ct ct agct cct cca ct gaat cct t t ggat t ccat gt t t cggaat gaagat ct gg ggt gaaat t c at ct t gt t ct gaact gct t t t agct cccct acaact t aac gaaacagt aa t cat ggt cgt t aat gggccc ggct ct act g ct gaat cgct caat aat ct c tat t t t cct t t t t t gtt t ca aaaatgggt t t t t caaat cc ct gt t gaat g 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1400 <210> 483 <211> 1286 <212> DNA <213> Arabidopsis thaliana <400> 483 acttcttaag cgcaatcact gaccatgcag attggctgtt atagtcgtaa aacaatcagg gcataaatag tgcctctgat ggt t ggaagt tataaacaac atacaaatat catgtcacct ataatacgca taacagagtt tatatatcaa tctttaacct cttgtagcta gctaggttaa tatagtctaa cattacaaat ccatcaaagg acatacaaat ttgacagact ctcataaata aaaaatacaa atgatatttt ctgtattttg ttataat t ga gccaacgtct tagaaataat cgatgactta aact t aatgt tttacagtat agaagagtca aact t aggaa aagt ct at ct aagaaataaa t t aatgctaa t cacgt acgt tatagaaaac aagtaaccaa agagagacga tttttgatag ctcaagcact tgattctttc cct at t t tag t at t t gt t at ggaacaaaat gcaaagagaa cgt t ct agac t t aggagat g t t ccct tct t t at at at at a t t aaaat t ag t at acgacat t t at ccaat c t ggat t at cc acct acccag agcaaaaaca caaaggggtt t t t t t aagt c ggagat aaca cacaat at t t t aat at gcca t t at aggt ga cat gcat ct a ctt ct ccaaa t gt t t at aga t t ct t aagat acgt aat t t a aaat t t t acc t gacgt ggct ct t t t ggacc ttcagagaga t cct gaggt a t t t t at at ac at t t gggaac t t t tct agt g aaat t t t gat t cat ccat t a gat aggcat a gaaat t agct t t at at t at t t t t at gaaat aat t caat ga acat cagcag t t t t t caagt t cgt gggct a aaat t cagca tgt t t gcttt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 cctttttggt tttaatttgt ttcaaatctt agatcgattc ttcactttgg attcttaaat Page 428 cgat t ct t gt acgt at agat t t t gct t gt t aact agt aag gggat t agga t t t caaat t c cct cgt ct gt ct t t gaggaa t t ct t t ggat agt t at t at g tct atgt t t t tgt t t ct t gt 12689250 Sequence gctgatttcg aacttttccg atgttttttt cgatttatga gtatcacact ct t ctgaaga gcttagatcg ttgaatatct ggtgcgtatt aaagcttagt ct gt t g Li st i ng. t xt att ccaact c cgccgaatct tcggcggatc taccgat t cc tgccttgacc tttttatgtg ggtatatgtg ttcttataat tgagattgat gtttttgggt 1020 1080 1140 1200 1260 1286 <210> <211> <212> <213> 484 1389 DNA Arabidopsis thal i ana <400> 484 cacat at caa t t t ctt cagt t agagt at at gt ccct at at t t gat cat gt t ct t t gaat g t agat ggaac agggct t ct g t gt cat cagt tt cgt ggttt gt t t t ggttt gaaact t aaa at ggacct t a aacagcaagt cgcaaaaagc at aaccaaaa caaat t t aat gt cat aact g t ccccacgct agattttttt gt t gt gt gt a ct aacccaat t t t agt t at c t t t gtgtgg cacat gacca tttcaagcaa aagt aaagga at at t gt aat gct t act t t c cgt cacat gt t gaat aagt a ctt cacattt at ct gt at t t atgt t t cttc gat t t t t cat t gggt t t t aa at gat aat at agcct gat t t t at cagt ct t aaat t aaaca agagagagag ggt cccat ct cct t ccct t t cat t aat aaa cagagagatt caaat t t gcg t t ct t cct ca act t gccgat gat t ggagt g t agt t t agt a t at at cat ct cct t cat t t g t t gaacat gt aaagaggt ca gt aat cct ct ccat agat aa t t t gt gggct act gggccct t cgaat ggt a ct t gagt ct a t t gaacct ca t at t t gat t c aaaagcaaac gt t agat aag ttgtctcttt act t t t t ct c aaaaaaaaag at cgt gt t t t gctct t t ct t agcaattttt gct ct aacca agcaaact cc aat cat t aat aat gaat t at cgat act t ca gtgttctttt ct ggat aagc ct ct t t ct ct ct gaact cat at t t t gcttt gagcaaaaca act t cat ct a ct aat cat t t gcgat agaca acgaacat t a aagaaaacga aat gt ggt ag ccct t ct t ca gt t at ccaaa t ct t t t at t g gt t ggcaat c t t at ct t ct c ct t t t ggat t agccatt gcc ct ccat ct t g ccct aat t ca t cagcct t ca aggtt ctt gc tggatcgaag t ct gt t act a gaatt gat gt taaagggcaa gcact t t cac at t t t agt t a at t t gt t t ga cacaaat cct gct t aaagt g tacacgaaag ct t ct ct ggt aagat aagac ct cgccaacc acacgacgat cagaaat t t c gccatt aaac gcctcggttt agatt ggt gt t cgt caacac agggggcat g t t ct aat gt a at act at agt t gccact gt t ctt agct ct t gaat aat agg t gcaatt gct ccgaagct gc ttgt t t ctt a t at gact t gt at aat t cat c ttcat t t t cc cggt cat t ct t caaaat t aa at gat t ggaa ccaact acgc ccat cgcgt g t ct t cat t t t gatcgt t t t t ct t ct t ccct tagcaaaccc t t t ct t t t gg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1389 Page 429 12689250 Sequence Listing.txt <210> <211> <212> <213> 485 1366 DNA Arabidopsis thal i ana <400> 485 at ct t cgat g cagat t t gt t gaagt at t aa gaaact t act caccacaacc at t agat aac t at t ct t t aa t at gt t t gat t t aat ccct a t acggt t gaa aat gt ggt t g t t t gat t agc t t t cgt gat t t ggt cat cgg aaagt aagga t t t aaaat t t aaact t gat g aacaacat ca at t caaaaat t t ct t aaaaa aaaat ggaaa acagcaaggt ttcct t cttc gat gt gt t t g at caact cga t at gat t aag caaaacct aa t gggact aat t cct aacat a t gact t at t g gt at ct t t cg cagcacgggg gt at cat at t gct aat at t g gt aggct aat t ggt t t ccat t cgat acaac aacaaaaat c gatggagaag gagaagat ca t gat aaact t gaat at aaaa aaaacat aat agcggt aaaa t t t t gt ccgc t t cct t at ct ct t ct cccgt at ct t t act c aacact t agg acaat ct ct a at at at agag t at ct t t aga ct caagt cat ct t t gt t aca tgatgt t t t t cggt aat at a at gt at gat a cgt gat t t t g t ggt ct ggat caat aat ct g accaaaagcc at caagagaa agagaat ccg t t caat ccac ct aaat t aaa t at gt agaga at aaaaagt a at at at aaat ccgcct ct t t gaaacaccgg t ct cacacat at t t aagaat agt ct t aat c at t aat aaac t at t t cct at t t caaact t a aggaat acaa t cgaagt aac at t t gt at gt t t acat gt t g gct gact ggg t aat t t t gac aaat aat t gt aggagaactt t ccgat gaga acct gaaaca aaccaagcca tcacaaccac gagaaaaaag aaaagggaaa aat cgcgt gt ct agggt t t c gt t ct t t gt g t gt acct at g aact ct ct t a t ct t t aaaac caaaat at ac agat t t aaca ttccaacaac aacaaat gt t at t agt t aat at ct t t at t t t t aacat cac gt t gt cggag t ggt t t t gga aaaaaagaaa gagaaaaaaa aaaaaaaaag cagt aaaaag aaat caccat gt agt aat t a t t acat agt t aat at ct gaa gt gt t cacct t ct caa agagct at ag act t cagcaa ct aagct t ag t cat at ggt c t aggat t agc t aat cct t ct gt caagt t t c t t gaat at gt aaat aaaaac t aaaat t t t a t gt agat ct c cccct gt t ag aaaggtgcgt gt ct at t gt g aaagaacat a aacat act t a agaagaatt c cat cat t at t agcgatttt t aacaaaagt a t at at at at a ct cagt t gt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1366 <210> 486 <211> 1288 <212> DNA <213> Arabidopsis thaliana <400> 486 tcacacacac gtcagcttta actccgtcac cgtctccgcc gtctccaccg ccgcatttcg tgagtctcac ggcgggaggt aatggctctc cggct t cctc atcgccgatt tct t cctttg gtagcttcgg gtacattacg aaacagagga gaataataga accgtagaag aggataataa agacgacgca aagaagtctc aacacgattg attctgatcc catcactgtc gacatctccg Page 430 120 180 240 12689250 Sequence Listing.txt gtgacgccgc cgccacgtaa tccccacaac aacttgatgc cattttttta t t aaat t ggc gcat agt gat agt gt gt gat ggaat gt ggc ggt t aat cat acaaaact aa t t t t t t t t gt cat at t aact aat t at gt ga at aacaaaag ct t t ccaat c t ct t cat cca ct at t t act c t t t cacat t t ggggct t gaa t gt gt aat t g t ct gt t at t g tt att ggaaa gt t ct t aaat gaact ggt ga gggaaggatt aaacaagt t a ct aaat t t aa t t cagt t act act cagccag caaaact aaa ctaaagggcg t t t cct t t t t t ct ct ct ccg ttctct t t ct aggat ct at c accct agat t t at t t t ggag gt t t aggt t t at t t t gt gaa at ggat t t t t aaaact gaat aagaggagaa tgagt t t t t a at t aagct aa at t t gat gat gcccagccca aat gat gaaa gt t ccaact a gcctcagagg t cgct t ct t c ct t t aggat c cct t t aat cg t cagct t aat aagat at t ga gt gaaggt at t gat t t ga tttttttttg t aacgcaat t t at t cgagct t t t cat t t t a t gcgcagt t t ggat gat t ac aat ccct ct c t acacaaaat t t t cccaat c ccat t ggcga ct aacgat t c gat t ggaacc gcaaat t agt ct gggt aat t t t t ggt t agg gt t t at t gaa t t at agt gt t t agt t at agt aat t aaaat t ttat t t gtca t gt t t gt t t t agcagt aaaa aacgcagcca t t ggaggat a t t t at aat at agaacaaagt ct t cact cga ct aggat ct c at gt gagct t t cat t gggt t t at t aat caa ttttaagaaa t t gt ggggat aat t acgat t tatgcggaga gt t aat t gga t gaagat aac t t at ct ggag aagaaaaaca ggcccagccg t aaat gt aaa accct cgaga t t gaacct t t at caggt aaa t gagt t t t t c t t gat gat t t gt gaaat t t c t ct at at t t c 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1288 <210> <211> <212> <213> 487 1362 DNA Arabidopsis thal i ana <400> 487 gaagggtttt caat t t acat caat t t acgt t t aat gagt t tat t gtcttt agt at t t t t t ccgcat t agc t cat gggt gt agt t aacaaa t t gt gt t at a aacat gggt t t t acaaat t a gggct ct acg t act agagt a gt gt cat t t g caagat ct t t gat gaaact t at t t at aaaa t aaaacccat aggat aat t t aaaaaaaaca t ct aat t t t t t ggat t ct t g aacat t at ag tgtgaaaggc caaacaaaac agat t t at aa t ctt gaaaga t gaaat cgag cagt att t t t t t t cgt acat t t t t t t at at at gcat t aaa aaaaat at t t gct gt cgcac aaaat aat t c at aagat at t aacagct at c t ct t t acttt cat at gaaga aagt t t t caa ct ct aaat at at gt t t acaa t t t gt t at aa aaaat t at ag t t aaaaaaat cgct t acccc ttaaaaaccg t t agt t act t at ggat t t t t at t at cact t ct t t aaccca aat t t t aaac t t gacaagat acaat gt ccg aat t t t t t gt gt ccat t ct a at at gt ccct t ct at t t aaa gact aagt aa t t at gct t t a at gacat cat ct at t t t gt t aaaagct aag gat gt caacc ttttttaccc ct caacaat t at gat aat t t aat t t t at t t t aat t gt t aa ccggcact gt t gaaaat t t t 120 180 240 300 360 420 480 540 600 660 720 aaattacaaa aatgaagaat actaggaagc gagaactagg gaagaaaact tgaagcacca Page 431 cat t agt caa act at gcact act cat at t a gacct gagt t gggtccaaag t ggt gtt ct a t t aaaggact aat t act gt t tagt t t t ctt t agat t gaaa agagat cgaa t t at t t aaaa agact t t cat gagt caaaat act t t cacga aaacat at at tggggaggac tgt t gt t t t c t act aaaat a at t gct cat a 12689250 Sequence gaaagaagcc at t ct ct aag cactt gccaa t at at ct t t t ct t caaat t a aagggggat g cgcatctccc tacaaacaag tccattaaac tgacatctta ct t ct gaaca t ggct caat t tatat t at t a t t t t t agt t c tttttaattc aaacatacac tt att gaaaa t t actt at aa caggcacat a aacagat t ga Li st i ng. t xt aaaat t t acc t caaaagttt agct t t ccgg tt cat cat aa gt gt t t aaag gt gact cct c ccctt aaat a cat t ccaaag gat t t at t at ga at t aagat t g t t t t t t cat a at gat act t a accagt cgt a acat t t t t ct t caagaaaat t ccacat aat agacct t t ga t aacat gtt t 840 900 960 1020 1080 1140 1200 1260 1320 1362 <210> <211> <212> <213> 488 1345 DNA Arabidopsis thal i ana <400> 488 at gt gt gt ag gt gt t t caaa tccgaaaaaa t t gat t cccc gt t t gaagag catt agct cc acaat gaaca tgaaaaggaa caacaagt t a tt gtt at cgg gacaaaagaa agcaact t cc at cggct t t c gggt att acc ct ccaggtt c ct t t gcat t g t caagcact a ct at t gat aa aaagat ct gt gt gt gaacaa cgaaaaccaa gt gagat at a gt ct t t t gaa t at t gt gcgt ggt t t t gat c taaacgaaag at t caaagac aaagt t ct aa aggaaact aa t aact acaac acct ct t caa t t cacagt ct t cgt agct t g at gcccat cc t t cacat ct c gat ct t aat a gcgat t at t a t gt t gat gt t t t t ct t agt a aagt ccacaa t gacaacgt t gagagt caca t gcaagagat t ggt t t cagg gt cagaacaa agact aaat a at gaagat t g at aat gat gt agagacagaa at cct t act t t ctt cat gt c tt agt ct ct t t t ct gacgat cgaaaccaac t t gcat t caa at gt t t gagc aggctttttt gaaat act ca cat acagaat aacaaaagcc aat t gact ca agaagagt ac gt gaaaaat c at ggacat gg gctgcgagaa ct gt gaagaa aagaggttt c t at agtt gt t t aaacct t aa at at t t t t t t t t t aaat t gt t cgt t t agga t aaaaccttt gcacagt at g aagacttttt cgaccat gt t t t t t at t t ct aacat ggaag tgagaaacag tctgacggag t acact gcac gaaaagaat c t agagat gt g t at acccaac agccgatgtt agt cact aag ttttaccgca gat at t aaaa ct t gt t gat c ct t t t cagcc ttat t gtct t gat act gt t t t at aaact t t gaat caaagt cggt gt t t t a ct acat at ga at cgat ct t t t ggaat t caa agagat gaaa aaggaggctt aat gt t gaaa aaagt aaaac gt t gt gaact acccct caag t aat at gaaa ttt att gaag cccaaacagc ct t gaaaaat t t t t caagt t gt t t gggt gc caat aaact t cat aat aaac gaagggtttt t caaacacat ct t cgaat ct tgaacaaaac ttttttttac aaat acaact aat gccaaga t t aggt gt t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 432 12689250 Sequence Listing.txt cccaaacaaa acgcacacaa tacggcgtcg tttagaatca gaaaagacat ttctttatgg tcacttgatt ctctcttcct tcatcaatca atctcgtctc ctggaaaaca ttagggagcc tctcagatcc tcaagaaaac cctaa 1260 1320 1345 <210> <211> <212> <213> 489 1329 DNA Arabidopsis thal i ana 7 <400> 489 t gggactggcc 0 gagcct t acg C1 caagggtgac 00 0 ccgggagatt C1 t ccacat gt g acgat aacat aacaaagcga t at at agaaa act t t gat gt t at gt t t ccc ct aaagcat t att ggt acga aat ct t t t ct aatt gtt cag at t ct ggacc gt aat agt ag at t acact cg gt t at t t t gt t t at aat gag tacaaggcaa cgagggttt c gaaagagcaa gt t gaccat at agacgagt t gt gt gat ca ccaat t ggga cggacggcat agaacccttt ggggtat t t t gggggaagag tt gt aaccga gcat gaagac t gacccgat c gt aaaat gt g gaagaacaag ct gcact t t t at t t acgacc agaacat gct t ggaat gt t c t t t t t t aat t gat t t t t cat agcacacaaa caaaat caca t at ggt t gt g aagcat aaat t att caggca t t t act t acc at t t t ct gcg cagtggagaa aaactt acca t t t gt aaat g t gggccggt a gact cactt c gt gagaacaa acat gagaaa gt gt t aat ga cat at att cc cat t ct cagc cgacaaagaa t t t gat ggat at ggcat t t t ttt ctt aaaa gt cat gt gaa t t aagaat gt cgaaat aaat ttgt t atct t agcat cgaga agcacaaact t t t at gat at ggct gtt cag t t t act at at agtt at cct c tagcaacaca t gcct t gt cg t cct acatt g t agcagat gc agagagtagt agt cgcat gc ct gctt ccag t agact t t ga atgggaacgg cgt t t gtt ga t aat gct cat t aat ggaact aat t t agt at tagaaaccaa tgagagacca ttttttttca gagacact gt acacct t t gt at acccat t t cagcagctt g gtaaaggaag cct t t act at gt t t ct at ga acact t t t t g ttct t gt t aa aacgat agag t t gat gat gg gggt aaagt t at gggaggt a at ct t at t gt t acacagt cc cat gat t gat t ct t ct t ccc gt t t gat t t c gagat ct t gt t t t gt gaat c gacaaaacgc gaaat aaat t t ct cat t cat cat act gct t t acaggt ccg cccct gact t at ct t at t at at aggact t a gct gat cat t aaat t cagt t cagct gt t t c aaagacgagg aaagtt gtt t gtggagaggc act ct caaat t t t gcat t at at ggggat t a cagat t t ggt t caat gt acc t t t agt accc gat t acaaat ccct gat gt a t gt cgt t t t g aat t t agt gt t cgcaaat t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1329 <210> 490 <211> 1319 <212> DNA <213> Arabidopsis thaliana <400> 490 tttgtcacca aaatcagaca ggcaaagctg gctcaagcat cgcttaaatc cctgtaaaac Page 433 12689250 Sequence Listing.txt gcaactatgt aattaatatt gagatatact tgttgctttc tgactctgat cggcagcatt gct cat agt a t aagat ccat ctct t t t t cc agt aggaat g aaagt gt ct c gt t t t ctat t ctt cagcaag at gaat gaag cct ccgaat c acat ggat gg aagat gt ct t gat cgt aat t cct cgacaac cat t t gt t t c t gt t t aat ga cagt gcct t t acacgt gt ca aaaaaaaaaa gaacgaaaaa ct cgt gct ct ccgt t gt gt c aat at gt aag acct cccggt t cggat ct t t t t t cagt ggt at ct acggga t t t cct cgat t agcagct at t ct gaat gt c gt cct aat cg t caccgt t t c agt gat at at t at aacgaat t t t ct t t t t t t t t t t t ctt a at t t acct t t at cacgaaac acagt gt ccc t ct t ct t gt t cggct gct gt t ccat cagac gcct cact t g act gaat aac ct ct cacgt c t caccat cct t t at t gaaga agct caaaac cgagat t ct c agat t gat gt at caagat ag at gcggaagt t t ggagt t t t gaccaat at t t aacat t aca caaaaagt at aggagt aagt cacaaccaaa aat t t gat ca ggcaaat ct c t gccaaat ct t t t cat t gat ct t ct t t ccc agcat gt t gc caaagaggt a t aaaat gat a ggagt gt gat at cgcacacc t ct ct cgt cc ct cct ggcac t at ct t ccat t gact cct t a cgct t ccggt t gt aat agag t t t t t t cat a cat t cagat a t t t ct t t ct t aaacct t gt c aacaaaacaa cggcgagat c t at ggt at ct gt aagt ct t a at cat accat tt gct acaaa gcaaat t t gg aagt ct ccat acct t t gt at atgt t t t t ca ggt act t agc t t ggaggat a at aggt ct ca ct t t aagcaa t act ct gat a at agt ct at t gaat t ct aat at t t aat aaa ccgat at cct gt ct t ct cca at t cat aaat atct t t ctta t t cat t cact tt ct caaaaa t aggt act ac at ggct ct t t at gt gt gat t t aat gaat gc ct t t ct t t gg at gt t ggt t t at cat caaac caacaccct t at ct aagcag agagt t t gag t t gaat gcat t gat at ct t t t t cgat ct ct act cacagat aat ggt at cg aaat t gt t cg at cat aaaaa tcggagaaga t t t t gt t cc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 <210> 491 <211> 1305 <212> DNA <213> Arabidopsis thaliana <400> 491 tgatggtcac tgctggagaa aatatgggca gtgat t atca ttcat t catc atctgggaat ttgctaaatc ctatgtctct gtttttgttt tcacacaaga ctgtttagca gtgaagcaag tcgaagtgaa gtatctcgga aaccacactt cgaacttctc tgtttcattg acgaacacga cagagcaatc ggaagacatg aaaccgacca atctggagaa caagaagaac attttcagaa acggtgtgtg gaaaagtaac cttttcttgg catcagggtc tgcaattacc agcgaggttt aaaagagatt catggatcca ttcaaagttt taatttttct acagagcat a t t acagat gc ttcagaaatc agacacagat gtaacaacat cacatccccg acat ct t t ga aggaaacaga agtccgaaga agt gat gat a cgttttcttt ctctaaccat gaaatttcgt ggaagatctt tatctgcacc tgctgctgtt Page 434 aaaat cct ag t t t gct at at acgcat cgat cct t ccct t t aagacgacga gt acat gt t a agcct t gaag gagat t gaga t ct cct gcaa gagaat t ccg 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt aaactgcaga ttcgtatttc tcgtctttgg acaatattat cgatttcgga caggattggt t gt ggt cgt a aat t at agat at t at t t t t g t t t t cat t gt t t t at gacaa aat t ct t at g at gaaaacgc ct ct t ccat c t t t t t t cat t ccagat aaaa gaat ct t t ca gct t t gaat a ggt cagt t t a ccgat ggat a ttttcagcag t t aat aaaga at t aaat t ga t t t gt aaaag t t t caaat t t t acaact t t t act at caagc gat agt cgga at t gat t t gg at t t aagggt aaaat t ggct ggt t t at gaa aaggat t t t a tagaaagaca t gat gcaat t t caagcagt g t gt ct aagaa ccagct t cga gaagacgaaa t gaat gt aat t at t agt t at t at aggact a t t at gat t t c t at aaggt t t t t t caaat gt at t aggat ga t aaaaat t aa aagaacat at gaagagact c acgaaagt t c gt ct ct ct ct t act t at t ag aagt t acat g cct at gaagt at t t gt gagt cacat t agt a acagat t aaa t t ccagt caa cgt cat t at g t gt cgat t t c cgaag t aat ggcagt t at agt aaaa at t t t ggcct t ggat t gt ac t aaagt t gt t t gat t gact t aagt ggaaaa aagaat at aa ggccatgagc at cat cat aa 660 720 780 840 900 960 1020 1080 1140 1200 1260 1305 <210> <211> <212> <213> 492 1359 DNA Arabidopsis thal i ana <400> 492 t t aagt gat g gaaacaattt t gt caat t ga ct aaagaat t t gat t t agat t t ggt at caa cat gt gacgt cat t aaaaac at t gaagt cc cct at aaat a ct cgct gt t c t t cgct ct t c gt t gt t aaat t t cgaacct t gagat t ct ca gat ccgt t aa aat gat t t t g at aaat gt ag t t t gcaact t aat t t aat t t aat gat agag gt t ct agt aa t agt gacaaa aat ct gt aaa gagat aat aa act aaaaagc aaaagcaaaa gct cact ccc agat t t t gct t t ct ccgt ct ct gaaacgaa ct acgcct gt agt t ct t agg t t t t ccagt g at t t t cgcat t agt agt agt t t aat gcaac aaat t t ggt g agagacat t a aaat t ggt at gaat aat cct aaaaaaat ca at t gat t t ga aaat aaat aa acct at agat t t gt cat cca t t gaggct t t t at t gat t t c at gagat t t t t at t at aat t t t at at cgt t ct gt gt agca cgat cgt gt a at at gat cgt at t t t t t t cc ttttcttaac ct at at t at t t agt t aat t t t caaaaat ac gagccat gac t t cact t t cc at gt agccga ccggtggaca caaat cgt cc aggct cccca gagt t t t tag t ct at gggt t agat ct gcga t gt gat t t at gat ct gct t a ct ct at gt ag gt act gagcc agcat at t t t t t gt at at aa gt gaaaaagt t cagaccat c at at t t cgac caaat acaat aat t gt gt t t at aagccgat gt caacagt g ccgt ct cgt t gat ct ct aat gcgatgctt t t cgat t caga t agt gt gt ga acagat t t aa at aggt t t at t agt agt agt at aaat gagc at aat t ggt t aaacct t aaa at cact at t t at aaaaaaga acaagt at ac at gt t aagt t at aat t aacg ggaagtaaga t cat t t aat c t cct t ct t cg cgccgcaggt t acgggt t t t t t t gat aat a ct at t gaaat aacgt at gt g ct t t t t t gca at at gat t t g ct t cct cgt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 435 aat t at t gt c t t at cact ga tctgt t t t t a gt gt ggaat c at gat t t t t a cat gaat t gt ct gtt ccact t gct at t cca tttttttttt t gt ggtt gct 12689250 Sequence Listing.txt tagttaagct tgaaagttcc ttaaacgttt aattagatcc atgaatatca gaagaatcga atctctttgg atgagatgcg caatgatttg gaatctttct tagcttttta tgtcacttga gttctcttcc tttcaattgt aaaaagtttg ttatatgtgt gattcaattt ttctttttg 1140 1200 1260 1320 1359 <210> <211> <212> <213> 493 1301 DNA Arabidopsis thal i ana <400> 493 t gt ggagat c taagacaaca t agct t at ga aagct ct aac t aat cacat t tt gatt at gt tat t cct t t t att cat agca aact t t t gt t tttacccaga t at gat t gca agt gaaact t gaagt ct ct c t ct gact at c gt cct t cggt att agccggt ctt cacgtt g cagat t t gcc agcccat t t a aaaagat aaa t ct at t at ac t ct ct ct ct c agt gcct gat at t gacat ca act t gcact a at gt gacaaa ct ct agact c t t acgcct aa gct at at gt g aagcagagat t ctt t t t gat gactt gt aaa t gagt agccc gt agat ccga agtt aggcct ccct t t cat t gagaagt cca agggt t t at c tcacaacaac aaat act at t gt t at at ggg t gggct t t aa aaacagact a cct ccagat t aaagat agca agcacacct c gaaacact ag aat t aaacgt t agct act at t gtt gt aat t tggagtcgaa t t at t t t at c tgt t act t t a cgggacat aa aaacat ct at gtt cgat cct t gggt caat t agt cggaaaa ct ct aaaat a aaaaaaaaag t t gt t acat g t ct at t t ggt t ct gt t gt t a tttcgacccg cgt cgt t ct c caaacgat cc t t gcaat gat ttctggtgac t t t cagaaat ggaaagtacg acat t aat t t t cat ggtt ga tggaaacaac at t at t t gt t at caatt ggg aaagaaat aa ggt ct agt gg ccct gaaaac ggt t t acct g cat t t caaat t t t cggtgcg aaaaaat gat cgact aaaca cccaat t agt aaaaat agcc gcccaaaat t t t ccact cat gat ccaaaac aat gt at gat t ggaaat caa cagcat aagt t aagct gcag t aat t t at cg t ggat at at a ggctaggagc t t gcagt ct t t t gt gaat t t aacct t t cat taggagaaga aaaaat cat a gt agt t agaa tcagaacaga t t t ct gccga t gcat gagt a aat t at at t g gat gt t t at a cat gt agacc acaacgt gt t ct gaaaacaa gt gcaacgca act aat aagt at cgaagaga gt at cat ct c t cgt ggaat g t agat gt ggg t ggt ggt t gc gt t t ggagt g at t caagt ga tt ct at gt ct t t t agggaat t t t gt t t t ga atgcagccgg cagt at ggt a ggctgaccag ct t ct caat t aat ccat at a t ggat t t aat cgt t t at gga caacaacaac aat ccaat t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1301 <210> <211> <212> <213> <400> 494 1367 DNA Arabidopsis thal i ana 494 Page 436 12689250 Sequence Listing.txt ctgccaacat ctcttttggc tatatactca tgaaacttta acccat t t gt t t at t t gt at gagagact at at t cat gaac cat t gat t ca aggcggtcca tgt t t t t cat aat t caccaa aacat aagt g aacaaagaat gaggacgtaa gact aat t t t ccaat aaaat t t acat at ca ttttaacaag aaaaacaaca aaat agaaac t aggcct agt cggt t ct agt agtggcccag aaact gt at c caccacgt cg aat t gt gcga gt t cgaaact gcaat gaacc at aagagt t g ggt gct t agg t gacgt t gt a t t act t at cg aat t at t t at aaaagact ga t at caat agt cct aaaaat c t ccaagagat at t gt aat t t t ct agacaag aagaaat cca act gcat t gc aact t caaaa acat agct ag ggtcgggct c aaat aaact c ccgt t t t t gt t gaaaat ccg tttacgaaaa ccct gaaagt t aat caaat a gt cct t ggaa agt t aggaca gaaagt at t t caaat aagaa gt t at at gt t at t aaacact at t t t t aat a t ct at at agt aaat t t at aa caaaccaaat t t gagat t t t t aggaact aa agt gaat t t c cct t aat cca at ct t gt aac agcctggcgg gt cgt at t t a t ct t gt act c at ct acgt ct t ct ct gaaac t t cagt ct t c t aact ct t ct agcgacct ct at gt aaat t a t t t t t cgtat acacacact a gaacct acga t act t at aag aaat t aaaca tgt t t t t gac aaaat at t aa at t t at t aat ct t t at aggg t aat aaaaaa at caaaat cc aaat gt t at a tcgtgaaggc aaaaaat t gt cacacgtt gt tacacaaaca ct gt ct ct ct ct ccgat cgt t t at gt at ga caagaaat ga t caagt ct t c at aaacaaag aagccgact a at caact at t aact cat aga t gaaaagact t t t aaaaaat gaat at gagt at acgt aat a t aaaat gt gt t t t t gt aaaa t acaat gcaa at t aaaacat gat agat at g aaat gat t gg t at gggt ct a cgt t t ct ct t gacaact t ca ccaat ct ct c taacggc aaaaat ct t c caagaat cgc t at gaaaaag at t aat t aga gt ggt gct t a t at acat at g t gt aaat t ca cacagaat aa gaat t aaaat aaact t at t t t t t at t at aa ttttttaact aat gagat ac at t t gat gat t gat at t t aa t t ccaaact c caat agct ct gacgt t ggt t aggcccat aa at ct t ct aga aat t act caa tgcgccacag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1367 <210> 495 <211> 1340 <212> DNA <213> Arabidopsis thaliana <400> 495 aagttgttgt catttaccta ttcccaacac tttccttttg tcattagaaa cacttatacg atatgtttat cgacataatc gttaggatct cctctcgt t a at t agatata aagat t ggat cgtttcaata tcatat t att atggatgtac aagccaaagg gcgagatata ttatgcctaa gatcatgaaa gcgacaaatg cgtaatggag gtagcttttt ttaattcaat tatttcgtat at at at t ct a ct ccacct t a aacctgttac atatttgcaa ttgggtttct caatgtttaa t t at at acat aaat acaaat aatgtatggg aataatgtaa t ggggt at t a t t t t aat gt t gattttgttg cttggatact tattattgac aacttggctt Page 437 agaat at ct a ggt cgt at ca t agt ct t t t a t t aaaat t ag agct t act ca t aacgt at gt gtt cggcaaa aat t at t tag 120 180 240 300 360 420 480 12689250 Sequence Listing.txt t gt t aat cat tcat t t t t gt t gt aagaagt t aaagt t at c t aat gct t t c t t t aat t t t g cagaat at ca t aaagct t ct agaat t t at g caaggaaagt caaaaaagga aagaaaccgt at cct t ct t c cgagccgt t g t ct gaaccac <210> 496 tttgtttgtc aacaattaca aacatctgca caataacctt gt agggggt g agagt t t t gg t aat gct t t a aat t gt act t acaaat t acc cat at t t gaa t at t gaat ga t gacat at at aat t at aaag agagacaact ct t aagct gt cacct t ct ca agaatttttt caaat ct gcc at at accat t gat cagcagc aaat at gcat at agat gt aa t t t gaact t t t act t t t aat t aacacacat ct t at t caca ttaaaggaaa aagcgcgtgt t ct t ggacac t t at at t cat ct gagagat a t ct t at t caa cat at gt cct agcttttttt gat at t aggt t gaat t t gga t t t t t gaaat at gt gat gag accat agt at aaaaaaaaaa agt t cacaaa gct gaagcaa t t ccat ct t t at t t aacaaa aat at t gaaa ct t caaat t a ct at t t t t t a t t aaat t t t a gat t gt aat c ggt at aggaa aat aaaaaag t gat ccat t g aaaaaaagct ccagaagccg at t t aat cgt ct aat t t at c t t t ct t cttc t gat gt gact t agt cct cag t at at gt t t a t aaagt aat t at ct ct ggcc at acat caca at gaaaaat g aaaagaat ac at t aacat at aaacaaaaaa agagtcggt t gt at aaaact t t t ccat t t c t t ct t ct gt t 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1340 <211> <212> <213> 1354 DNA Arabidopsis thal i ana <400> 496 aat t ct t at t aaaaaggt ct ct cct t ggct ggt ct at aat agccct aaca caagagt gct at aaggt t t a aat t cat act aaggaagt t c tagaaagaca ccacagagaa cagaat gt ga cat gaagt gg caat gaat gc atggagcaga aggcgccaca ct ct acaaag t aat ccacag ggcat gaaca t acat aaaag ct gct ct t t a acaaaccaaa ataagaggcg aggaacgt ca t aagct t t t t at caaagat a caaat ct cct agaccgaacc aaat cgaaaa agggaagagc at gcaaggt t aat at t gaaa at aaacacgt t t ct t aact g t t gcacact t cacaat ct gc aggt gct aac t t agat agat aaccaagaac t cagacacat at caaat t gt t cacat caca ct gt agccca tagaaaagga caact agcgt ct t aagacat agacaaagaa t t t ct ggat t ct t caagcac at t gt gat t t t gaacaagt g t t ggagct ac caagagt gaa gcat caccgc at ggat t aag gt acat t t ga gaggcaaat a aat caaacca agaacgt act t act t aact c aagt ggat t c t t cat gaaaa agt t cct t ct gct gct t t ca cat aaaat ct t t gat t ct ag cagcccat t a gt gaagt aat accat ct gca agacat gaca ct aaat gt ct at aacgct aa t aaaccat ga gt t gcaggt a cgaggat t t g agaccgat at gagaaacaac gcaat ct cct t caat t ggag ct t gaaaat g at aat gcgt c act at t t gcc accact t t aa caagaaagt a ccaacagat a t t caaat t aa at caaaat gg acaacgaaat accggat t ga aat t gagt t t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 438 ggat gggt t c gagccat cgt ct cact at cg gccat t t caa acaaagt cca ccgaaacgac agagagaagg t t agagcat t at t agcaggt gct t gagt t c at aggct t cg t ggcgat t t t ct aggacaag at cgt acat a gagaagaat c t t t gggaat t 12689250 Sequence ccagaatctc ccaacccgtg cgtgaagaag aagaaagaaa tctgtgtatt ccttcgcatg gt ct ct caaa gt ct t cgcga agtgaaaagt accaaaatac gctgcagccc ttttttgtta tcgcgagaga cagaaaggga gggtaaactt ttcg Li st i ng. txt aagccacaag gataaagaag t t aaaat cgc cgact t t caa cctcggt t gt gtgt t t t aag cgat t at t gg at gaaagt t c at gaaaagt g ct gaaact t t agt agaaact cgt t t t atgt ccagacaggg ttggtgaaat 960 1020 1080 1140 1200 1260 1320 1354 <210> <211> <212> <213> 497 1371 DNA Arabidopsis thal i ana <400> 497 tttttgaccc t gacagagt t ttcgaagaga aaaaaat aaa at at cat cca caaat at aga gggagaagag aaagt t gact at aagt gat a t t t aaaat t a t t t agtgtta at gt aacaga t gcat at aat t cat t gt act agt t aaaat a t ct t t gact a tacaccggag aaagat at gt agt t t t t aac ct gt ct aaac at t t at gt aa t ggcaat gt a ct t ctgggga ttttcaaaca cagaacaaag agt ct t gct g ccaat at t aa t t agt cgat g agacagt gt c t at t aaat t g acat t agaag at t aaaacca t gt caaagat at ct gt gcat gct at t t ct a at at at ggt a t aat at ggt a atctgacggg t caaact t t g gt at ct t gct t t t ctgt t aa t t act aacat cccaaggt ct tgtggggaaa agaagaacat caaacaat ac gaaaacgaag t t t t t ctct a t t t aggt aaa at t cat gt aa aaacaaagag at acat gaat cact t t t aca t t ggat at gc t t t gagt t at at cccat gat gaaaaat t ac act t aaagt a gct att t t t a aaaggt t aaa t t ccaat t t a t aagt at aga t gt t aaaaaa tttccaaaac aggcccaagc acat t gt gaa gagcat cgaa aat gaaact t agacagagca t t caat t t t t t aat at t acc cact t t aaaa ggagagcat a acact ct ct a tt gtt ct t t t t t cat t ggt t ct aact t t t t t at gaaat ga t at aaaaat a at aaat t t ac t agcct at ca aaaaaacaga aaacgaact t agat aat t ca aaaaaaaaaa aact aaacat ccaagcccat gct aagt ccg acagat t aaa t ggat aat ag at ggagaaaa t t t t agat t t ttttggagaa act t gaaaaa at t at ggcat tttttttttc ct t t gact cg t gt at t caat acaacaaatt aat ct acat a ccaaat gaag agt ct t t gac gaat caaaat t agagagt t t aact aat t t a cct aaat t t t aagt agt gct agcagat agc atgagggcga agt agt cact ccccat t t ga at aagaggaa aagt t t gaag gat gt t at t a agaaat aat c aggt t cat ag t ct ct aaat t acat t t act g aat ggat t t t cct t t gggat cat t aat t at agat t aacca t t ccat t gat t cat gt aat t aaat t t agat t ggcgt aaat at t ct at cca agcat t t ggt t ct t t t t t t a gt t ccat at c t acact gaaa taaccggaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1371 tcggagactc atcgttgcaa caacaacaaa tttcagaaga tccgaagacg Page 439 12689250 Sequence Listing.txt <210> <211> <212> <213> 498 1296 DNA Arabidopsis thal i ana <400> 498 aggaagcat c t at agcaaag caact aaaat aaagt aaaat at ct t at t t t acagtcaagg cgagacatgc t acat ggat c gt agt t aat t at aaagt t t t ggagaaattt ctgcagcagc t t ccaagt ag t t cgt at at a t gaaaccat a accaaaggcc ctggcgcaaa aaatcaaaaa aaaaagaaaa cacgacgat c tgcacaacat at t ct cgt cg cat cgaat ag t agct gt t at taccaaaaat t gt gaaaat t gaaaacagac caaacaagag agacaaaaga ct aggat cac at at acaaaa t t t t t t gtaa gcgtgggcaa tgatgagagt at aaat t ct t caatt gtt ca at ct t t act a aat t aagat t aat gact t ac t gt t at t t t g at at at at at ct ccat cat c ct t gt ct at t caggat at ct agagagcgag aat agt gact aat aacaat g t gt t t agt ac actacgcaaa at agat agct ggactt gaaa ggtcgcatgg ctt gaat at t tggctggttc tggacgtaga tt ct ct at aa t t t gt t t gt a ttgt t t gct t ttttaaacca t att ccaact acaaat caat gt cacaaat g acaaaggcaa ct ct t cct cg tt caat cgaa ct ct cat t gg at cacat at a t ct ct gt aat ttacaaacac t t aaaaagt t caaccatgga caacagacga accgtggcgt t gcaaacgt c gat ct accat gt t aatgtag acct cact t c aacagagcaa t ct t t at t t g gaatt caaat taaaaaaaaa ggaaggatag aagt t t t cca t aaaaatt gg aat t acccct aggaact t ca acct ct gt ga gatcag t act gat at a ct t t t gt t t t t caat cacat aggaggagga tcagacgacg t gaagat aga t gggcacct c atgaagcacc gaaacaaaga at att caaca acat t ct aat agcaggctac agat cat gt c t t agt ct gt t gt caat t gt a agcaaaaaaa aat t t t gt t t tataaacaag gagaaaatct at t ccaat ct gatcgctttt cat ct ctt gc t t t gtt tcat cgagaaaagt aaaagt gt ac at gat ccaca gaagatggaa t aagat caac cat t gaaaag gagt cct ct a agt cactt ct at gcat t gca at t t t t t t t c t t ct t t t t t t ttataaaaaa aatacaaaac aaagaataat ctttagcaga gt t t t aacaa ggggcct t ga gtgagatct c gat t t t t gt a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1296 <210> 499 <211> 1393 <212> DNA <213> Arabidopsis thaliana <400> 499 cttggaagca ttcaagagag tcgtggagag tgtggctcag cgtctcaatg aacagcccgt gatcgttgct cacagcgaaa acacctttga tgggagcggt atcaggaggc tcttgtccaa taaattcgaa ttcgataagg taaactacca tacatatata tgttatctag cttttatgct aaaggaaaac tttttaaatg atggtaacga gtgatgatga tccggaacgg tttggtcgca ggcactaaac gttgccatgg agacgattcc aaaagaccgt cagggtaagg tgtctaaagg Page 440 120 180 240 300 12689250 Sequence Listing.txt atatctacga gctgtgcttg acactgttgc accatcggcc actttaccac caataggcgc tgtgtcccag cct t t gaat t at caaaaat t t gt t t t t gt t t gat ggaaat t at aat gt t g gccgct cacc ct aat cat ag t gagt gt t t a at aaaat t ga tttttttttt t t t t t t acg t t t tttgtttacg t gt gaaat aa ct aaaact ac cat t agcaca agaggaccaa cgt t t ct t t c agagggataa <210> 500 gt aaat aat g t acat gt t ac at t agt gaat t at aagat gg gtggtgaagg cagctcgagg t cggct acct aagggaat aa cccccaaaag aagt t t gggg ct aat gacca t t t at gat gc at at t ct gaa aaggagacca aaaaaat aat aaggt ccaaa cgat t ct cgg ggt ccccgt ct aa aat t at t t gt t t t t at t t t t at aat at gat aagaagagt t gt agt cccat t t ct gccgt c gcagcact ag t t t t agatta t t t cctcttg t gggat gcaa t t gaat ggct gt t aat t gaa gat t agt aca t t gt t t t t at aacgaat aaa at t t t t cct g at t at t t t gt t aaacaaat g at t t t t t gaa aatggaagcg taagaaaaca at cggt t t cc aact t cgact cagcaacaaa at gaggaaaa t t ggt gt ggt t gt t t act ct at gat gaaac gaat t t gaaa aaaact t agc t at at t at t a ct gt at ct ct ggaat caaac ct t t t aaat t aaaccagaat cggcat t gat t t gaagat gg atggcagaga t ct aact cgg gatacagagg t gt t at at gg ccgt ct t t ac gat t ct act c gt t t t t t aat at t t gagt t a at t t gat t ac t aaat t t aat t t ggt aagt g cat t cgccgg gcat cgccga gt t t at t t t g t agt gt t t t a t agt t aagt t t gaat ggaga tat t ggggag t ggt t cacga agcct t caaa t t t t gacttt t t t cagat gt at gcct t t t t t t cgt t aaaa t ct t t aaaag aagagct t gg t aat t acggt gaaacacaaa agt t t ccagc gaatcggaag 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1393 <211> <212> <213> 1400 DNA Arabidopsis thal i ana <400> 500 aagat t ct ga aact aaat t t t at aaat t at agaaagaaat t gaaagt gag t t cggt t t ac at acggt t t t gaaacaaagc gat gacacgc aagt t ggt gc agt gat gt t g aagcaagt ga t at t t t gt t t t t t t t at t aa aaaacaacag aat ct t t agt at ggt t t t aa gagaaaaaaa gagt agat t a gcgaaacgcg gagtgggcga ct cct cct ct cgat gat gat agt cat gt ct ct agaaat ga aaaaaaaaac t t ccaagct t aat caaaat t aaaaaaaaaa gaaat aaaca aggt cggt gc att ccggagc cct cct ct at t ct t t t t t gt at t t aat t ac aaat aaaat t tttttaaaaa ct t t at cat c aat t t gggt t at gct t t cga tgat t t t gt g at ct ggt t at act ct gat t g t at t aat t t t agt t ct t aca aat t t aagt t ccaaat t t at at gaat gt at ccaact at t c cggt t t gcat gat cacgcgt ggt t t acccc gaaatggagc gct gaaaaaa t cgt cgt t ct at caaat aag aaaat t acaa t t t ggt t t t t t at t cact ag caat ct t t gg t at at aat t a t ggt at t t t g t acagaaat t gcgt gaggt g t agaaat agt t ct t ct gaaa 120 180 240 300 360 420 480 540 600 660 Page 441 12689250 Sequence Listing.txt gttgtgtggt ttttagaggt caccaaaaaa aatctatttt gagatactaa t t t t gcattt at t caaagaa at t ggaacaa gagat gacat tcagt t t t t g t t t t t t t ct c t t gaacat t t caaacaagat at cct t caat t gat gggct a cgacgccaga aaagaacaag tgt t gtgcag t cgt agat ga gt t t t t t ctc at ggaggt ga caact cggt t t ct ct ct aaa gat t acacat t t aaaacat c gact agt at g aaat agccca t t agt gagt c gaagat cccg ccat t t gt t a agaaat cgaa at t t t gct ag act at acaaa acaagcact c at gt t at aga aagact t t aa aaagat t cca t at gt acat a taaaaggccc acat aaccct cacaggt t ga gt gagt t gaa t t t cctgttt ggt t gt t gca agt ggact t t t acgaat cct cat aat ccaa t ct aaact t c agt aaaat t g at t aaact t g ct t ggaaaga agct t at aac tat t t t ctga t t at gt t t t c acgat aacat tggccaagac t t gt t gaat a ct t t t t t t t a at t cat ct t c t t gat aagaa ggt t t agact gt ct caacac aaat at t t cg t gaaaat t gg acat at gaaa t t gact t t ag t ct cct t aat aatttttttt aaggaaaaag t at gaagct a aat ct t caac aacaaaacaa t t agat t caa ttgcagagaa 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1400 <210> 501 <211> 1283 <212> DNA <213> Arabidopsis thaliana <400> 501 tcaccagaaa aacaaaaact agaaaccagg aagttaatca acgtcattaa gt t at t atat tcat t gtaca ttttggtgac tggaagtttt aacctaaaga ct caat cct c tttgtctaca atgtat t ata accaaactac tttat t ctct tttctcgaga tgatatcatc aatct t aat a taaat t aaca cggtcgt t ct agctttgtag gagcgaaata atttat t ctt aat t atcttt taattcaact aaaagaattt taactatttg aaatgagtag acataactca at t cctgct g acaatttttt tttttggtca aacaattctt atgagat t aa agaaatagtc ccaaatagca aattctaaaa tagagacaca aacttaacaa at actcgata tactaatacc ttaaaatat t aaaaat at ca ct t ct at aaa t t aat aat t a aatctcaata ttct t aattt atagaggttt acatccctga attaataaat tttagaaatg aaaaact t ag at aact acat t gt cacgt gg aat t aaat ac caaaact at t t caact t caa cat cgaaat a gccagatttt t gact at ct a t t gat ccat a t cagt cgt aa agcaacaaaa agct t gat ac t t t t ctagt t cgat aat t t a t act aaat t g t gaat t aacg gaaaat cat a t ct at at aat taaacaagaa at t at cacga gcat t ggt gt ct t t t aaagt gt t t t aaat g t aaaaacct t gact t gaagc act caacaaa gct agaat at ct aaaacat t aat gaaacct ct aaat t aat at gaaat t t a tagaaacaac aat act t t t g gagt t aagca ct ct gt t t cg cgt at t cgcc aaaaagctt t gcaaat acgt aaagcaaacg t caaaaaaat t aagcat at a aaaaagt caa t at gt gt t t a t acaagat ag aacacaaaca caat gaat t a aat t t aacct gt aaaccat t t aat t cgagt t t ct t gt gaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 tggttaaaaa aagttactta tcaagacaag tatgaagtat cacgt gat t a aacgt t t aat Page 442 gacaccaacc agtcaacgca ttataacccc cctcacgaag t aat gacaat gct t atgtgt t t t aggt t t t aagaagaaga 12689250 Sequence Listing.txt ttgtttgatt tatttgtcac ctaactagag actctctcac catagtaaga ctttttgtct actatagtag aaagacgaat ttctaacaca cgcctctaat ctccgcgcac acacacacac cga 1140 1200 1260 1283 <210> <211> 0 <212> r1 <213> 502 1397 DNA Arabidopsis thal i ana <400> 502 t cagaat ct a agact t ggaa cct t t agaaa at t ct at ct a t cgggt ct t t caaat cat ca aaagt t gaga aacacaacaa at gcat cgac at at t gaat g at t aaagt ca at aact t t t a gt t at t acaa at at act t ag gct t acgggc t aat aaaaaa at act t t gt t at gcat t t ga ct at aat t t t gct gat aaaa t t aat at t t a aaaat t t at t t t ct t cgt ca t ct ct ct t t c tgggagggga t t at t t t t ag ttagat t t t a t at t gcat ca ttact t cttt t at aaat aag t t cgaagact att aacaaca t act aaacgc aat gat gat g t aaat aaaaa t t t aaaact a ct agt t t at t t ct t t t at t t aaacct gacc aaaat at at c gat t gagt t t caact t ccaa acaacaat aa at ccagt t t a t t agt cgat a agccat t cga ct gat ct ccc tcgt t t g act caaact a gaat cat t ca aat aagt t gc ct aacat t t t gt t t agt ggt aggacgat aa ct t t at aagt cat at act ac gt t t caat ga t aat agt aga t cat ct at aa t t aagt at ca aaaaaaat t g at t gt t at t g aagat gt ggg ct acat t t ct ct gaaaaat c t t t ct t t t aa t t gaaat gt c gat gt at t t g t t gggt acat gaaacaaggc acgacgat ct at gact acaa act t at aagt gt caact at a t t t cct t t ct t at t at t aca ggat cggt ct cat t ggat t t aaat t cgagt ct ggt aaaca t gct aacat a t gcgt gt aag acat caat cg t t at cat cat t aat at gcga aagt t cgaaa t cat t t t t t t at aat t gagt at at at cact gacccaaaaa t at t at aggg at gt at gt t c at ct ct at t t cccaaact ca at cat at gga gt t ct cat t t caaaaacatt at t t ct gt ag ttaagaagga t cagct at aa gt agt aaat a taaaaacccc t at gt aact a agct cacaat ct t gcat aaa gaaaat gat t ct cgat t t t a aaat gact t g ct gcaaat at t t t aaaat ac t t t t aaat t a t t t cat at at t at acat t t a gaaaacaat t t t t t acgat t t t t t gct t ct t t t ct ct acg ggagacaaaa t t t t gt t gt c t ct t at t t t t tttaaaagac aaaagt aaaa acaagt aaag acaaat t aac aat at aat at t ct ct gt t ac t at t t gaat a aat acagt at t gct t t t gaa at aat gct at caact gagt t gt at aat t ct t aat at t t gc gt t ggt t t gt t ct t gt agag aaggcat t t c at at t at t gg at gccat caa t ct aat agac t t cat cgat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1397 <210> <211> <212> 503 1367 DNA Page 443 12689250 Sequence Listing.txt <213> Arabidopsis thaliana <400> 503 t cgt gaaccc t t acat cgt g at t acagat t t cgggacat t t t t t gt t t aa gct t gt ct t t at gat gggcc t cat at ct cg aat aaagt t g t aaaat gaaa cagt t aat at ccacct caca tggaagacca t t t t t t at t t cat agaaaca caat gaagcg act at t cat a gacaaaggaa ttaacgaaac cct aacat t t aaat caggat t ccgact t aa cct t cct t ca at ccat at t c at gt ct t t gt ct aaaagaat aggacaaggt tagaaaagaa t cct caaat g at at at aaat t t t aacat t t aat t t gt at t aat at gct at t t t t ccgt ag cgt acgt acg aggt t t caag at t t t t gct a aat aacaat g at aaccggt a at ct t t t at t tagcaacaag gat t t t gt cg cat cacct gt aagt t gat aa ccgggtcgga acat t cgat c t t t gct t gac aaat t t agga t t ggtagcca aaat gaat gc gt agaggt t g t cgcgaagct aaaaact cga t ct aat t t t a gat t t at at c aaacaat t t a cat gt t gcat accaat t at g at t agcaat t aat cct acaa aat ct at cga gt at ct t cga aat aat gaat t t at gt t cat agat t t t t aa cgt t at cgt t ct t ct at gaa t cct ggt gag at cacgaagc cgct t ccat a aaacacagac ct agt caaag at t at aagaa at t agt t at t caat ggt at a gt at gct aca ttct t t t cat t cat t ct cag aat gat t gca at agcat aat cat gt ct caa t t aacggat t t t t ggt ct ca cat caacaaa gact t cat at t at caaagct t t cgct gt cg acgtct t t t t aat at cgt cc aaaaacat t a t act agt at c caaagaacaa aacaat ccac acagt t ggt c aact t aaaag aat aaaaaaa gt t aaagt aa agcgaaagag aaaacaaggt gt gt ccccca t acgagct aa agt ct cacct t t t ggt ctt a at ccct ccat aaaaccggtt tgacaaaaaa agcaat t aaa acgat caaaa t ct at aat t c t t t aat t caa caggt t ct ac t t gtcagcag t ctt acaaaa t at ct cat t a tttctcc cccgaagct t aat gat aat c gcaagat t t a ccct t t aaca aat gt gt t gg aaagcat agc t t caat gcac t t ggct t ggc aat t ct t aat t gaacaacat agt aacacca gat t t ct at a caagat t t t a aat at aaaaa ct t t ccgaat t gct aaagt a at acgacaaa caat gaaacg ggct aaaat t aaaaaaat t g at ccaaat ac caat t cat at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1367 <210> 504 <211> 1289 <212> DNA <213> Arabidopsis thaliana <400> 504 tgatacgtta cgccttccta tccatccgtc atccaacggc taataacgaa acattagccc attatacagt gggcctccac ctgtaat t gt at t atccctc ctcacagtca tctctttct t ttttcttcaa caaaaatcct catttctctg tctttcttat gaaacttatc gattctcttc aacgatt aaa acacgaaat c ccat t t agga cgcaaataaa agaatctggt tcgt t gattg Page 44z t ccaat cgt t t cct t t t t t t gacagcgcgg t gt aaccat c gct t t agct t tct t atactt caat at agag ttttgacggg tt aat t t agc t ct ct t ct cg t ggt gagct c t t agt t ct ct 120 180 240 300 360 12689250 Sequence Listing.txt gct t aattta ttgtcct t aa tactcgtgga tttgt t cata aatcgttttc ggt aagagt t t gt gat t agt t t cat cggt g t t t act t gaa at aaat t t ca at t at gt t gc t gt ct t at t g ggt t t t gttg gt t t at at ca t t act t t at t ct aat aat t t aggaggagga cat t caat at at ggt t ct ct t t t t t ggttt <210> 505 tt gtt accga agact cgct t gt at gaat t t gat t t at cga t gctct t gt t at gt gagt ct aacaaaact t t aat ct gggt ct t t ct ct ag t at t gct t cc gt agt t t gt a ccgagtt agc gaact ct ct g gcttttt t t c tt gtt gt caa at t t cat gt t agat t t t gt g at t t ggggt t t t at agt t gt at t gt cgcga t ggt gcaat t t ccat t t at g at t t gct t t c caaaagaagt acgct at t gc caat at at t g ct at t t t t ag at at aaat at t ct gt gat gg t gaat aggt ct t cgat t t t at ct gt gt at t t aggt t t gt gatcgt t t t t t t gt ggcgat gagat t gt t t t t gaaaaggt t gat t t cggc ct t t agt cat t t cgt ggct g cgt gt aacat t cact t t aca at gt at acat t ct ct t gat t gaat ct ggat ggt t t gt ggc t t gat t acgt agt t gat t ga t t t cagt gt c cct t t t t t t a t t at aggaac aagcaaat gt t t gcaaagt c at t gagagag t gt cct t cgt aat t t t ct t t act ggt t gac ct aact t t ct t gagcagt t a gat t gt at ga t aat t t t ggt aat cggact g gaagaagaaa t t at gaat ca ct agat t gct ggat gt t tag cgat t t caag aagagagttt accaat t aaa t at ct gt t t g t at gt t gcat t t t t at gggt t t gt t t t at g 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1289 <211> <212> <213> 1343 DNA Arabidopsis thal i ana <400> 505 gt gaggt cat agt t t t at t t at t aaaaact t gagt t at t a acgaaaaat t t gct agggat acgat t ct gg acaaaat t t a aat cat at t a ct gat t ggat t ct at t t ggt aat t aagt at aact gt aaaa t t ggct aaat att caggacc t t aat t at ca aaaat cat ca aacat t t tag ct aat t t t at gaat ct gt at agcat gt t ac t t t t t at t t g aaaaccat aa t t t gat ccat t t t at t t at a t t t t ct at t t ccat at ggt c at aat aaat c gat ccaacaa t aaacgacat aaacgaaaaa gt t t t aggct t t t ggact ct t t cgt t accg at t t t t t t ct gct ggt cct a ct t ct gccac ct cat ct gat ct t agt t aca t t t t t t aat t t aat t at aga t gct t gaagc t at t gagggt aaat at aat a gaaaaaagaa t aaat ct t t a gat t cat agc t aat gagagt t t ccgt caac ct caagacaa t ct gt t t t t t t t t t t agct c t at at gat t a t agatt t t t g at gaaaact t at at t gt t at t t t act ccaa t ggaaagat c aaaat t gggt aaaaaaat ca t t at gt cgct t cgat act ct aacaact t t a at ct t ct gcc tttttttttg aacaat t t ac t cgaact agt t gaat t cat t caacgaat cc t at t t agt t g gt aaaat t t t acaaat act g t caact ct ca gaact gaaaa t at gt agt t a ct t act t gt t at at ggt aaa gacat cacat t aaccat t aa t t gcacat t t at ct ct t t at t acagt agaa at acaact t a gat t t gacga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 tctctgactt taatgtatac cgacataccc tatgatttag atgttgattt ttcccattct Page 445 t aat at at cc t at t at cat t accaat t aca acat gt gt ct catt gtt aat cgt ggcgt aa acatt cat ca tt ct ct ctt c at gtt aagag aaaaaaaaat t t t gt t aaag aaggt ct at g at agat aaaa acgt at ccat ccat ct at ct tccggcaaga 12689250 Sequence att ccaccat aacat at ct a acaactggac agctggctcg caaact t att agaacgt t ca t cagaaat cg gatt agctt a tatctagttc gtccaaatta cgagtcactt gtaatatct t actctttact ctcttctctt aaa Li st i ng. t xt at t at t t gca t cccatt gt t tgtgtgagaa t t aagt aaac aact at t t t c t at aaccaaa ct cacat caa t t gt aat aaa t ct t acgt cc gt t ggt gt cg t at act at at at aact gcca gt ct t ccaac t t at t cat ag 960 1020 1080 1140 1200 1260 1320 1343 <210> <211> <212> <213> 506 1519 DNA Arabidopsis thal i ana <400> 506 caact t aacg aat t aaccaa aaaat aacca t at ct t ct gc aacgacct gc ccaat gt caa accat gt gat t at at ct t ct t t at cgacct ct acact aaa t ct gacgccg at t gt cacat cgat at gat t accaaat cgt aat aat caat act ccat t t c aat aaat cct t t cagt acaa aat agt t cat acgt t gt aat aat at ggt t t gt t at at acc acct gccat t t gt caaat cg ggt gat t t ac ccccactt gc cat t ct gt ag at cgaagt gt t t act t agt c gcccccactt gccat t t t gt gaaaagggtt cat t aaat gc ccgggaattt att gt gattt agt act t ct g ccagt acaat aat act aat t t t cat agt ac act t agaaag gct t t at t t a cat t gccct a tgccgcccat caact t t t ct ct gt agcgt a aagt gt t aag ttagtct t t g ct at t cgt aa cgt at agcat t aagt gt t ct t t t gct t t ac gcct at t cgt agcgt at agc t acacat aca gt ct gat acg t t ct aat aat t aaaacat ca ct ggt cat ag act cacccga t t t t t gt t t t at gct gct gc t t gt agt aat gt agt aact g caat ct ct ag ccgtagaggc gggagaaggt t agcat aagc t gtt ctt tat ct t t acacat t ctt gacgt c aagcacacaa t t at t t gt at acat t t ccgt aat ct t gacg at aagcacac acaat at at t t at ct ccat c t t t ctct t at cat gt t gt t t at t caaggt a aaaagt t acc t t aat gat t a agt t gt aaat gaaagct ct a cat aagt aat ct cccct aca t at at cggat cat t ct ct t a acacaaacct t t gt at t gca t t t cgt gaac att gaccaac acct t gct t t t gcat gt t at gaaccgaggt t cat t gacca aaacct t gt g acgt t ggaag t ct ct t t gct gt t t t ct t t c ccgt agt act aaagcccat t agcccaat at tttttttaaa t gcat gt at t aat t gagat c gcat gct t t t t ct acaagt c at t t cgcat g at aaggat gt t gct t t acac t gt t at cgt c cgaggt agt a gt gt caact t acacaat t aa cgt caaaat a at agat t cag acgt gt caac t t aaaaagt g ct gt at at cc cgcgcttttt caagt t gct c t gt gct gt t t cacct t gt t g cgt ggt at at aaaaaaat ga tt at gaaacg t acact cat a t t t t t t t gat t gct gt cggg gt aaaaaaac caaat t t ggt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 Page 446 cat t cagt aa gat t t t at ct at aagaaaaa t t aaat ct t t 12689250 Sequence Listing.txt attaaccctt aaaaaaatgc aaaataattt gtttatgtga caaagaaatg at aact at ca agaagaagaa aacact cggc t at at at at a cgt acaacac caaaccaatc acttcactct ctctaatcaa aaagctttta acctcagccg ct ccgat ca 1380 1440 1500 1519 <210> <211> <212> (7 <213> 507 1296 DNA Arabidopsis thal i ana <400> 507 t cggaat ct g t acaagt aga agagagat aa acagccgt cg ccagtgacgg gat ct cccga gt at t at t t t ct t gcaagt c t aat cgat gt ggt t aat t t a t at caat act t aat cgat t t t t ccact t t g at act gt at c ct aggctt gc act t t acgt t ct ct t aat gt at ct act ct a aagaaat caa t cgacat aat agacagcaaa t ct t t gtgt t ct ggt aat ct gaat aagaga ggaagatacg ct at t t t at t cggt t agagc gaat t t t gaa tt t cgt agt a gggt cggat c gat t t t aat g gt caaaaaaa gt aaaaat t g gacat cacac t gaat t t t aa tct ct gcaac t ggact caaa at at acaaca agaaat at t a at ct aat aaa at cgt t ct ac at aaaact ca t at gt agggt ctt ctt cct c acgcaaagta tact t gtaat cagcgacagt gagagtgat c t t caat gaat aacggagtag ggaaggtgag taggccggcg tttgcggaga acgaagaaga ggat t gat gg ggct aagt aa t ct ct caat t gat agacaca aaact t caga agcct aggt a gaat acaacc t aat gt at t t ttttttacaa at aaat t t t t gt gt at at ct at at ct t t ac caaaaaat aa caat at aaac ct cagat t ct gaat t ggaat atagcgcgag t ct cggaaga gt gact gt ga at ct ccat t t ggacggagaa acaat ct aat gct gggct t g t aaaat t aac gt aaaacgca gaat ct cat t aatt agt gca tt ct t t t t t a tt at acggca aaaat t act a at t t aaccat cat gggt at c gat ct t gt ga aat aat aat g gcagcct cgt ct t t ca gaggaaatgg aagaacggac tggacacggc aagcaaagat t t gt ggat t c gct acacat t gggccaggat gcaaat agcc t gacgt aaat gt gt t t t aca t t gat aaat t t at at gt aat cccact acca aaat t at t ca acat at at gg t ggact aagg aatttttttt t at t t t acga at act ccat a cgt ct ct t ca t gaaat caat gacgcct t ct ggt ggcgct g cggagactt g t t t gggt t t c t t ct aact t a ccggt t agac aaat at aaaa ccccct t cag gaat ct ct t t aaagtttttt t t aagt gt ac cct gt t t t cg t t aaat t t ca aacct caaac agct t ccaca t t caat aggt gcgaat at cg taaaggaaaa t at at t cgt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1296 <210> 508 <211> 1277 <212> DNA <213> Arabidopsis thaliana <400> 508 tctcccaaat aaaaatgaga gcaaacacta atctaatatt aaattgaatt aaaaactttt Page 447 12689250 Sequence Listing.txt aaatagtgga aatatatacc ctaaattgga aataaaaaac ccaaatataa t aat t t t aaa ggaaacccaa gt t gaaat at ct aat t t at t act t gt t t t a t aaacaact t t t t t t caat a aagt t t aat g t ct at t aaca t t t agcat t t at t gt t cgct gt gt aat cag aaat gt gt t c t gaagt at t t cat t acat ac aact t gagct gacgtagaag t gccgt t t t c agct aaat t a agct t t gt gg at aaaaaat c at at aat acc at accct at a t t ggt t gaat gt gt t act t t t t t aat gct a agat at aaat tgaacaacaa t gt t aat t aa t at aaat at t t at t ct gcaa t cgt aaaat t t aat t at gct caat t gaaaa ct gt at t gt c aat gt aaat a acagaagcat cgt t t ct act gaaacct gac ttggagg t ct t t t aaat at aaact t at t t ggaaat ag agat t t t at a t aagaat t t t tttttgtcca t t aaaact ga cat t aat t ct t at aact aga tgtatgagaa t t cat t t at t gt t t t gtgt a at gacgt t at cacaaaaat c gcaaaggaaa t at aaat ggg cgt t gt gact ttact t t t ca gat ct ct ct c ggt gaaaat a at t aaaat ga aaaact caaa t aaact t gt g t caaat aat c aaaaact t aa gt t gat t aat tttttaaaaa aaaaaaaat c ct t t ct ct aa t at ct gat at at gt t acat a aaacaaacga gacaaaattt at t t at t t ct ct t at t gggt cccgt t t gt g at t cagaaac t ct ct ct ct c t at accct aa at caat at t t t at aat at t t gt at t at t at at t t gagt gc aaat gt gct a t aaaagt gt c at t t t gt t t t aat ct act aa t t cagt t cat accagt t aac aat t aat aga t aaat t ccga t aaaaat at t t gt cct aaaa cct ct aat gg at t t aggaat gcct ct ct cg t ct cgat cgg t at t acaaac at t ggaaat a ct t t t aaat a aaat t t at t t t gt ccat aaa t aat t at gt g t t t gtgggaa acacaaaaaa at act at t at aact aggt t t ccagt t aacc ct t aaat gt t at caaat t t a t t cat gat t a t t agat ct t a ggccat t t gg gct t gcct t t ccgcact gct t cgt ct t caa at aat at t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1277 <210> 509 <211> 1331 <212> DNA <213> Arabidopsis thaliana <400> 509 agtgattccc cgtaactcat gctctgtgct caacagatca ttcaaagttt gtggatcagc atcctgtgag ctaacagcac cttggaaaag aagcttcttc tgttgctcaa gacccctgca tttggtcggt tgagtgatga tacctacaag ctct t gaggt tccaagtcag aacctacagg ccactcagat ttccaaaatc aataatccaa caataacaat actgtatacc aacaaatccc caaaatcaaa atccagactt gcgcaatcaa agaaaaaaga gactagagat ttcgttcaga caagaaccaa cagct gaat a ccgt cccaat t t t cagt at c aggt gact ct ct cat cagga aaactt gt t t t t cct cct t g ccact ccaag aaat gcggt a Page 44E at agaat ct a aat ct ct ct g t cgat t ct at t t t gcagt ag ctt gcaaaaa t acat gt cat t gacat t cga tacaagcaaa aaat caaacc gaat cgat t c aact aat aca aaagt t gcca cct gt ct at t at ccagagca caacat cact cgt t ct ccat aaaacct t t g t gt ct t gt ac caggaagaaa acgagagcga 120 180 240 300 360 420 480 540 600 12689250 Sequence Listing.txt aaagggtct g aaaaacagaa cat gaaaat t cat t cgt aga gagaacaat g aat t aaat aa t cat at t t gt gaat t gt t t t t gt aaat ct a agat t t at cc agaaagcat c t ct ct ct ct c ggaggatcag <210> 510 aat t t t agca act gat ccca gcat cggaaa ggagatggga agagaatttt at t t gct t t g ct t t t t ggt t agaaaat gct t t t att acac aaat t t caat t t t ggact t g t cgagt ct ct aaacgt agt c at cagaat cc gaagaagaaa agatgagcga t t t t t at t t t cact cacaga t aat cgact g aat t gaat t a acacaaaatt agccaaat aa tctgat t t t c cgacggtcgc aacagaagca atggccgagc gat gt gaat t acct ggacga t t t at t gt aa acagaacaga gt t at t t at t t t agat t gct act t t t aagt gaaaaaaaaa t ct at agacg at at caaaag tccaaacctt aaaataaaat acagcaatt g t gcaaat t ac gat t t t cat t aaat aat aat agat gacat t t att accct a tcagacagca t t t act t ggt agagaaaaag caatcaccgc tccat t t ct t tgaat t cggc caaagaagaa gagaggtcaa t gacaaaat t ttctctattt aaaaaaaacg aaaacat t at aat aat at t t cat at aagaa cgatt ct ct c tct t ctcttt 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1331 <211> <212> <213> 1385 DNA Arabidopsis thal i ana <400> 510 aagat t t t cc aat gt gct ac at t caat t t a at t aaagaac aggt aagt t g t agat t t ct g aat aagaat c t t aat t t aat at t t at aaat aaaagt ct gt ct gt acat ca cat t t at cag caat gcat ca ct ct acgcct cagt ct ct t c caat cgaaaa ct t ct t ct ac gctacgggaa at gagat gt t t t t t t ct aat at at t t t t t g at gt gat t t a t at t t t t aac aaat ccact a aacact caat t t t acat aaa t t aagat caa t at agt t at c at t t caaat a gt cat at ct t cgaaaacagt t t at t ccat t aaaagaaagg gt gt agat ct t t t gaacct g t t t t t t gct a ct agaat t t g gt aagat at a ct gt aaagt c t aaagaagt c act at t t t ca t t cat t t aaa t t at t agat t t t at agat t t t caagt t gt a acagat ct ca cagcat ccaa t t t t at t t at ttttcaaacc t aaat ct ct c gat ct t t gat aaaat gct ga aagt at gagt ct t t cct agg agt gagt t t t t t t t caaat t t t t t caaat c gt aacagt aa aat t t aaaat gaat acaccg aact t aaat c at act gt aat acact agcat cact t gt caa t t at cat t cc act t gcaaaa t ct ct ct aat t gt at gt t t c tttttaaaga t t aaat t gga acaaat at ag t at at aat t t ct at ct aaaa ct t t caaat c aaaggt t gat ccat aat t aa ct ct aaat at taaaggccca acccgt t ggg ggct acacac cct t ccat t g at t t ct cat t t t cgaat cag cat cgt t cgt t ggagat ct c aaat t t agct t at at acat c gt act gaat t t gat gat t t t ct at gagat t ct t caaaat t t t t t aaat t t at aacat aaa acacaat gt a t aat accgt c cccagt ggcc gt gt cagat t gat ct ct t aa gt at ct t cat at ct t ct ct t t t cgt agt t t gat ct cat cg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 449 at tct ct gt t at gt t aaaat t at ccgt gaa t ct ct agat t at t gt t t cga t at caat gt g gaaga ct t at cact g t t aggt t tcg gaacat agac cgt cacct at ct cat t t gt t tgtatgt t t t 12689250 Sequence attcagtgtg tttgatatct gttttgtttc tgcttttgaa gagtatgtag atcttacttc gaagaagatt cat t gtgttc tatgcctatt ttctctatgt gggttctgat tttgtaggat Li st i ng. txt aaat ccgat t cgat t t t gct ggat t cgcgt t t aat ct aga t ct t aat cgg t t gct ct aga t gt gt gt agg ct agat t cgt t gaagaat t t t gat t aggt t t gaagaaat g t t gt t gaat c 1080 1140 1200 1260 1320 1380 1385 <210> <211> <212> <213> 511 1356 DNA Arabidopsis thal i ana <400> 511 cgcaacgat a t cagct gt ct ct ggt aat ga t ct ct gaat t aacggt t at t t agagat aac t cccggt t aa t at t aat ccg aaact t caca cggt t t gt at act t ggct gc agtggcggga gt cacaact a gt t agt t t t a t act t cagac cgcct ggaaa gagcagagat t t gcat t at t t at gct t at t aaaaaaaaac at gt aat gcg ct agcaaat t gcagccgcgc ggt gcct at g act at t t gat t t gcgt aaca gaacgaccat caggctgcgt t t gaaaagt t ccaat aaagc at aaggagaa t gt agt agcg at gccact ga tcaagacaga tt at aaacaa t t gtggaaaa agtggaggag accaat acaa t at t gagat t ttcaccggaa t ct ct gt ct a t gt at caat t gt at at acaa ct t t aat agt t cggat ct at tttttggccg gaaact gaat ct aggacaac ct acgat t ca at t at cat t t gt gat aat gt t cggt gt gga gat ccat ct a gaaggat t ga t t t tct aggc t t t t gt cttt t t t gt t gcaa tgt t gt t t aa gt t ggt aaat aat cagcagt gat t t agat c cgat t gt caa ct ct ct caag t t gagaggat caat ccaat g tcgatgggcc t aaagcccat aaat t gaagc aggaaggat a caacagattt acaaaagct t ct at cgaat a gt t ggagagg t tct at gat c gat ct aacgc cat acagagc t at acct cca caagt t cgga gccaaat cca cct ggaaaca ggt t t ggt aa at gat at cgt aacat gat gg tttcccgcca ct t ccat t gt aat at cct ca t t t gt t ctga tt gat t t t t t ct aaccct at t agt t aaaaa cat t t tct ag aagaga ggt t t t gat a at t ct t ct cc tat t t gt t cc t t t act aacc t get t ccgca t aaaact t t a at gcccccga agt gt t ggt g agagt t at ac aat t t aacgt gggaaacgt c tcaaagaggc t ct gat gat a ggcaacacca gctgagcaga ttgcaagcag acggt gt cgt gt gat ggat a cct t gaggag ccct aacaaa cccagagct a gt cat t agt t t cat at at ca aaaat ggct a caggt ct t gt gat aagcaca aaaggagct t at t t ct t t ct gacgaggaag ct aaaat caa at aaccaaac gact aaaat a gat gccat cg aaacaagacc tcaacaacac acgt act ggg t caact gt t t act t gaat ct ggggaagcaa acat gaaaga gaagat aaaa t ct ct t t aat t at t gt t gac t t t t cgt cga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1356 Page 450 <210> <211> <212> <213> 12689250 Sequence Listing.txt 512 1296 DNA Arabi dopsi s t hal i ana <400> 512 accagct cat ct cgat gt ac t ccaaaat t c accct aat t c aat t ct cgt t aagat t aaaa at t cgcaaag at t t ggat t g t ct t gt t gga gaat gaagaa t t aat t aat a cat gt t t gt t ct t t t t t t ct gcagcaagca aat cacat t t at gat t t t ca t cccat gt gt at t aat agac t caat t aat a t t agt act ac t t t t t gt at t ct cgat t ggt g gagagcaat t t cat ggcaa ccagcacct a caaacacaca gaaat t caag aaaaaaaaaa aaagt gggaa gagagat t t c at t t t t ct t t gaat ct cccc cgagattttt t ccct t gct g ccct aat t ca aat t cgcct t t t ct ct at ga t ct act t t t a gagt gt at at ct t at at ggg gagcaaacat aagcccaatt acccaaaat c at ct ct cgct gaaaacacaa caagt t cct t t gt ct t t cac aaacagaaag t ccagt t ct a cagt at ccca aaaaaaaggg aagcct aagc t t t ct t t t t t t gcct aaaca taaaagaaag at t cgagt gt acact t caac agat t ct t t t agat t gt cac acat t t t t ca t t t aat t gaa ccgct aaact t aaaagacct aat t aat aga t ct acaaagt ccgt cct t cc aaacagagag agagt t t t t g cagct cgt ac gaaaaaccca gcagct aaga aaat caagga aaagagat ga gaagaat caa ggggt gaaat aact cccat c agggacccac at aact acgt t at acact at ggt t ct at at t caaacgt aa acat at at ac at gacat agt aat at acact t at at gggt t gcaaaaaaga gaaaacact g t cagat gaat at acct acct t cat ga t t gt ccat t c aacaaaaaaa at t t cacgt g t gcgat t ccg cgat ggaagg ct t t ct t t t t t aat agagaa at gt t t t t t t aagt t t ccat t t t t t t t t gt ttacacgaca agagagagt c cct t t t aat g aaaaaaagaa t cgt gcgcgt t gt t agaacg t t aaact t aa agt t gt aagc agct gaggat taggaccacg gcct ggcaac t cgct cgaca aaat ct ccaa aaaat gaaac agat ct cgcg t t aat t t gag ct t t t t t t t t at aat gat ga t t ccat t t at gcgat ccat a caccaacgcg cgagt ggt ca at t gt ccat a aat t t at gt c gaagt t at gt at at gaaaac aaaggcccaa t at acgt t t c ct t at cgt t g ct cgct ct ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1296 <210> 513 <211> 1359 <212> DNA <213> Arabi dopsi s thai i ana <400> 513 ggat cgaaca ct ct ct cgt a cgt ca~ ctcgcgacgg aaaggt t gct gagcc gagaaacgcg acgaatagga gagaa! catgactgaa aaaaaacaac cggag gagagtcgct acaagtcgct tacgg ctaatttacc ctcaaatttt attatt aggaa gagt c gaaaa at t t c cgagg aagg agcact gt ga ggt acgagaa cgaact cgat gct cacct cc gagcagaaat t gacct gacc Page 451 t gccagt gag acgagt ccgc cgt gaat ccg cgat t t t gga gggaaaat t a t gct ct gt ct gat gacct gg at agaaacaa at ct caact c ct ggact ggc aggct aat t a at at gt gat a 120 180 240 300 360 12689250 Sequence Listing.txt ttgtgacctg ctttgcctat atggctatat gtgataccta taatcacaag gatatttcag gt ggagaat c gaagct gt t g gccat t gcag gat ggt t gt t t t t aat ggac t t t ccat t ga gt t gat t ggt at t ggt act g at t at gt t t a t agat at ggt aat ct t t cat t ct cacgt at t agt t t accc aacaagcaac ccaat t acct t ct t ct t gt c agagaaagaa ggccat t t t g gaat gcaaac t t gt tgtagg cggt gt at gt t gct gtagca agat aagt t a at gt aaat t c t t gct gact a t t ccat t t ca ct at act acg at aaaaact t t cat at t t t a gt gt cact t c aat gccacgt cccaaagt ct at t gaagct g gtgtagcggg agaggaaaga aaat gcct t t cat ct t gt ct gt ct ct caca aaat gt t gt g aat gct tt ag agagccact c aat cat gat a t ct acgt t gc t agt cgccaa gct gaaaaat t cat gt cgt c gt t t act cac cct ct t cct t aat aagacac t cgcaagt cg t t t cacaaat caat gagt at t gcact gt gt t t aagct ct g at t t gaat ct agaat gt at a ct ct t t gct g t gcat t gact aagt t t t gca at t gaaaat g at cgt cacag gt t t t cccca act cct t t aa at ct ct t gg t at at gggag agcgtgagac gggaaacgga gt t aaacgct gagcacaaca gt t t ggat gg ggaat gaat a caggcaat aa t t gcaat t cg t t t t ccat gt aaat gt t t aa gagaat gaat ct gacgaaga agaaat at cc acaagct cgt agat t gaaag t t at t gct gt t acat gct ca agct gt cct g act t gcaat g ct at gaacaa gaaagat gt g t at accaat c gcaat cgt t c ggcgt t cgga at t agt agaa ggt aaact ac aat t agaaac aaact aacac aact gt t t ca 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1359 <210> <211> <212> <213> 514 1318 DNA Arabidopsis thal i ana <400> 514 acaacgt gt t gccacgacag ct agt gccag acct gcagga acggagaagg ct gt ggat t c gt ct cgcat c aaat agt at a t gcacaact a gt aaat t t gt aagaat ccgt agaggaggat t caat act t t t gat gt gt ca cgt t gacgat t t ct t gcgag cgat acggt g agaagt gaat t gct gcggt g t t gat ct gt c t at ct agct t at cgat at ga t t t gat t cgt at aaaat ct g gt t at act aa t t t ggt t t gt ccgt t ct t gc gat aaat gcg acgt ct t gcg aat cat gaag agct acat aa gt gagt gaga t t t at t aat t agcgt agt gt t at ct accca caagagaaaa ttaaaaggag cat t act aat act aat t aac ttttagaagc t t aaggat t a tgagtcagag aaagagacgc gat gt gggag t ggat caaaa aat gt cat ct t ct t gt ct gt t t aacct t aa aaagt ggt ct aaacaat at a t t at gct cac at at at at at at ct gcagat cgat ct t ggt aacgagt t t g cggagatgaa aagccaacgt t cgaat gt t c cat cat caac gt t t cccct t t t agt cccct aagggactga ct at act agt ccagt t t aaa at at at at t c t ccgaagct g cat gagt ct t t t aggt t t t g gaagacgaag gaaaat ct gg t gggaagct t agagaaaaca t ggct t t t aa t t agcgt agt ct agt t ggag ct t t t ct cgt t aaaat aaaa ccaaaat t aa 120 180 240 300 360 420 480 540 600 660 720 780 Page 452 agat t ccat g agt t t t act t t t at ct at t t t ggact cacc t t aat t gat g t at at aagt t agaat aact t t acgt ggttt gaaacaaaga gct aacagac t t act at at t gat agt ggt a aacaagaat c at cact t t t t gggct att ct at acacatt a t gggt t t gt c acggcccatt 12689250 Sequence ctaaagaaga agaagaacaa catctttgtc tttattgacc tactttaagt tgctttttaa tgtgtgcagt cagtgtgctt cacctt cat a cat gagagct at t at t aaca gt t t gat t t t ccacccaaaa aaaaaaaaaa tagcgacctt ttatgggcca at aat gt aaa cat at t t agt Li st i ng. t xt acacaat aag aat aaagt aa t t gct ggt gt gt t act at t g gaatt cat ca t gacagt aaa gaagaaccat at gat at gag acaaggaaaa agatt ggt gt tt t ct cccct acaat t at at t t t t atctca ct t t gtt agt ttt gt ct gcc t t t t ggacat gcccagt aag ggggtaaa 840 900 960 1020 1080 1140 1200 1260 1320 <210> <211> <212> <213> 515 1280 DNA Arabidopsis thal i ana <400> 515 cacatggggg cagt cggt aa t t aat gt gca gt aagt t at g aact t t at aa at cggatt at cggaaacaaa t t t ctt gcgt gtcgacccgg cat at t t t t g gt ct cat t t a gt acgt t t at aaaaaaacac gt t gt t ccag cct cat aact ct t ct t t ct t t cagct caga ct ct t act ga ct ct ggt t t t at gt gcat ct gt cggat ct c at t t t gaaga t aagt t gat t cgagact at a gt t t t gt aaa aaagt ct cgg catt at catt agagagt t gg t acat t t t ac tgcaagaagg tgggccgggg t gt gaagt at gagt act t ac acaacat aaa gaaccaaaca t gcct t aacg t t gt gt gt t g t acat t gcat ccgat cat t t t t t t t gccct ct ct ct t t gg t ct t t t t t gc tatcgaaata agatgaagaa aataatacaa aatttgtcat at cgt gtt ac gaaacat cag at ggagtt gt t t at aat t t t t t t ccct t t t agt agt t tag tttccgccaa acggacaaag aaacaactt g agaact aggc aagt t acaac cgat gct gac aaaacaattt cgct ct caaa t gggaat aaa gt gaaat t t c cgt t t t cct g at ccat aggg t at aat cgga t ct t t cccac cat t t gt gt t t at gt t gggt t gcct t aacg t gt t gcgt t t t ct t t t gat a gat ccaact a cacct ct t t a tggcacagag accacat t ca at gat ct t gc acacat gat t gt cggccaca aat act at aa ggt gaaaaaa at aaacgaaa aact aagt at t t t t t t t t ct ttttcttcgc gct aat t cgt gggt t t t cgc ggaacgact g at t t ggggat cat ct ct aaa act t t t gt aa t ccat gagat gt t t agcat g acacactt aa t cact cact t t aagat agat gt acccaat t t cagct caaa aacgt agcaa ct aaact cgg aact ggt ct a t t aaat acga at aat aact c cct t cct ct c t ct ct at ct c tcgt t t t ct t t t gagaaat c t t ct cgt at a t agat t t caa ct at cagt aa at t t t t t cat t t ggatt aat at t at ggt t t ggagat t gt g t t t gcgt t t g gtt gtt ggaa gggat cat t a act cccaaga aaagagt aag taaccaaaca t ctt aagcaa accccaat t c at at ct t gac ct ct ct ct ct ct cgat ctt c ct t cat ct t c agcgt t t t cg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1280 tttttcctgg gagttttgag Page 453 12689250 Sequence Listing.txt <210> <211> <212> <213> 517 2006 DNA Arabidopsis thal i ana <400> 517 agct ct ct ga at agat agag t at t gt t t aa agaat t t t aa ggt caat agt ggacat gagt agagcct t t g ct cgcct cga at t ggt t aat aat gt caaaa aaaggaaaaa at gct cgcac ctcggaacgg aaggt at aag act ct t t ct a t cat caact t t t t t ct gaat t ct gt agat c t t caggt t t g at aat t cgcg acgt cgt t ct cct aggt t t t aaccaacgac ct t t gt t t t t caat t gaaaa t gt gt gat ct tgt t t t gtct agaagt ccag acgt t cgt cc ttgt t t t t ag agagagacca aggtat t t t t at acgact t a gat t ggt t at cat gagt gag agaaat agaa gt t at ccgct at agcgt gt t aat aat aaag gcaaat t t gt ct accgat t c caaat t ggaa gt caaat acg cct t t ct t t t aat cgt t t ct cat ct ct at c t gt t ccgt cg agggt t t cgt at ct t gat ct t cgat t gat t ggt at aagag t t ggt t t cga agt t at ggat t t gaat ggt t t t t gaat acc aat acagaaa cagt agt agt aaacggtgga caat t t cgt g gt gt t cct ag at t ct t t aag t ggt t gaaca aat t t t gat a tcacggggag t t t gat t ct t tcaacggcac ggcccat ct c aaacaaat ct t aaaat agt t caacgt aagt at aat caacg t aat t gt cag cct cct cacg ct caaat t t a t ggt gagt ct t t gaggat aa t t gat at ct g at cgat ct t c cat t agcct a at t gat at t c t t t t t gt t t t ct gct at aat ct t at ggt t t tggtat t t t t agcgt gaat g tcct t at t t t at caagaaat gaaaagaaga agagat at ca gt t cat ggt a aaaaagattt cat t t t at t t gcccaatttt ttaggcccca acgt at ct t c t gt t t t at at ct aaaat t aa ggggctat t t aaat ggt t ga gct cagat ag ggagcat aac aacgcat ct c gggt t t t ct c ct t t t t t gaa tgt t t t t t aa gagaaagggg at t t at at at gct aggt tag gcat gcat gg gt t acat gt a aat t ggt t t c at agat t cgt at t t t t cat c agat t at gat acgaagct cc t caaat ct t c aggcat gct c ct aat gt gaa ccaaact agt ggaat aaat a gt t gat t t t t gt cat gt agt cggct acgct t aaacacgt g t t t gt t gttc aat aat t t ct at ct gcggt c agggt aat t t ct t cacgcga cct agt act t at t gct t cat t t t t ct cgaa t t gt agat ct t cggt gaat t t t t ct ggaaa gt aagt at ct at agacat cg t t aggt t gac t gat t gcct a t act cat cat at t gt cct ga tgatct t t t t ggagtcgaaa cct cggt t ac t gt ct act ca t at ccat t at cagct gat gt aagt aaat ca ccaaaat ct c at t acacact gacaaat ggt ggagcagtt g accgt gacag t caat ct t t g cgcaaaagat act t gcggcg aggaatggag gggt aaaaac t at aaacgct at at aaaggc cat t accaaa agt ct t gcgg gcgt t t t ct c ct gggt t t t t caaggagt t c ct cacgat t g t t ct t at cgt t t at acct ga t t gt ct t agg t cgagat at a t aaacat t gt cct t t ggat t ggt ggt aaaa agcat t gaag aacgt gagt t gt agt ct ct t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 tcctctataa tgctatatct gatttcttca atctctatgg Page 454 ct t aact at g caccgat ct t t caaagct cc t gt cgct cac t t t t gct t t t cat aat at ag ttttctgctg acaacaaggc 12689250 Sequence Listing.txt t gt t gcagt g ct ccaagagg ccat cct gag t accagcgt g ttatagcttt ctttactttc cagtttataa ttttcttctt gt t cccggat ccaatcgttc tctcctccta ctacaagtcc gagat g 1860 1920 1980 2006 <210> <211> <212> (7 <213> 518 1307 DNA Arabidopsis thal i ana <400> 518 cggat ggt t g t agcggt t t t gcat aaaagt aaaacat aat t t at t gt caa t ct acaat ac at at aaacgt caaat ccaca tt ct aaact g caatgggaga ggggatat t t t caagt t t aa aaaaact aca aacat ct ggt aagt t aaaag t gt t t at t t a at gaacat aa gt t t acat t t aagcaacaac ttaaaagagc aagcact at a aat acacaac aggt agt at g cgt ggat at g gagt at gaga at agat caat gact caagt t at caat ct at aaat aacact at at t t ccat aaat at t aac t t t gaat t t t t gct ggaat a aggggaaat g t aagt t t t t t ccact t agaa aaat at gt at ccaaat t aat at cat acat t t ct t t t gtgt t cct ct t ccc t cat cat t aa t at acact t g acacaaagca agt gaccgt g gcgct t t t gg gagaagat t a aaat t gat ga t ct t caaaat at agagat aa t t t t t gaggt at aaat t t gt aaat ccgcag gaaccaaaaa t agct t t gat at t at gggt t gt t t gggct g agagt cacgt caaaat gact gct at aaaaa at t aagcact t caat t gt t a ggacccct aa ct agct act a t at ct t ccat cacacttttt acgat caaac at t ct t cttc aat agat at c gagt aaaaac at caagaggc agact aagca aat act aaat aaat aat at t t at t t ccat t aaaaaaaaaa gagaat at t t gaaat t t t gc gcgct at cgg agt at at ggt aaaaagt agt t gt t caact g t t t gcct acg gct caaaccc caaat caact at t at t ct t a t agt t t ccca ct t t ct t t t a gt t ct ccaaa t gat cct agc aat cct aact acaaagat gt gctt ggaat a t aat t t t t aa ttt ct aaaca t t gt t agat a at t aaat ct c gat t t cat ca at at t t t gt a aat caaaagc at cct t t t ag aat t gt caac gaaat at t at t acaat t ggc aagggat acc aat t aagt gg aaact caat a at caat cact ccacaact ac aacccca gaaat cgat g cat t t aat ct aat at t caag t t agaaat aa agaccctt at aat agaaaaa t gaaat gtt a at gt t aaat t gat t t t gttt agat at ct ag t ct ct gaaaa cct aat t t gc gctt acatt t ttgat t t t t c gt at ct aat t at ggaat aat aact t cat t a ggaaagtaag t caaaccat t gct t aat aca aaaacat t cc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1307 <210> 519 <211> 1344 <212> DNA <213> Arabidopsis thaliana <400> 519 cccaaaaaga atcgggtatt taatagttct aaaaatttat acatgatttt atccaaactc Page 455 12689250 Sequence Listing.txt gaaaaaacca aaaccgaaat cgaaccattt ttatatttac cctattagat t t t t cctat t agacct at t g t ct gct t t aa at at at aaaa cacgact aat ct cgt t cat a cct ct at aaa aat cgat aaa t caat at cat aaaaaaat t c gt t t gat t at t at t at t at g t t at agaaat t aat t t at cg t aat t t at ag accaccaat t gt t gt t aat t aagt gt t t t c ttaagaaaaa ccact aaaaa t t gagct t t t tcgaacacgg at aagt at ga aacat gaaag t t acat aat a t t at at agt t t at t gat at a t t aat aaat t t t aat aagat aaat t aat aa t t t at aat at agt t t t cat a t t t t act t gt t t t t aaaat g at aaat t aat aggt t t t act t at t agagt a aggaccat at aaat aat caa t caacaat t a ct t acat t t c aaact acaca at ggt t cct a gt t t t at caa t t t t t at t gt cct gat t t at aaaat act at cagt at aacc at t ccggt cc aat at t t t t t t t gt at aaat at t t at t at a t t t t actgt a aact ggt t ct t t accaaat t acct ct ct aa gt aat t cgt a cat at t t t ga act ct t agaa at cat at cca aaacat t ct c t cccaccaat aaca ccagat t ct g t gagcat caa at aat at gga t agagt ccat at t t gat t at t ct at aaat t cgagt t agga t t gaaaat t c gt at caact a ct t t t t t gct t caaaact t t agaaat at at aggaaaat ct at t aat aaaa aat t t t gaga t t t t cgat at at t gccagct t at aaaaat a at t t cct cct ct caaagcaa cct t aagt t t gt aaaagt ga t at cgaagag at aat t t aca at t t t t agt a aat act ct at ccagt gt aaa t at gt aaat c t at at at at a t aaat t cat a t agt gt t gt t at t ct at aat ct ct at aaat t t t cgcagt c gtagt t t t t a gcaat ct t gt aat t t gaaca at t agt caac aat gaat acc at t aaat aca t t t at t t t at ct t aat gt t c gagt ct t gat gct aat t cag t aagact t t c at cgt gaaga aaat t aat aa aaat gacat a t at ggt cct a t at at t at gt t t t gtt ctt t t t ct aaacat gaaggaaat c t gat aaat at ccaacat t ac accaact t ac t accaact aa at gaccat t t t aat cccct a ct at aaat ac ct act act t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1344 <210> 520 <211> 1324 <212> DNA <213> Arabidopsis thaliana <400> 520 ttccctccaa tgtcctactg tctcct t ct c taagtctctc tcattatcaa aattcatctt tttgtttatt tcttgtgttg tgtgtgatgc gatgttacaa tccaaccgcg tattcgacaa ccacaagcag caagctattc tccagaggtt aatctaaaga agttgctgca aagaaacata ttaat t ccca gtttgcaact ctccgcacca ctctgcattc aattacacaa aatgtttcac aatgaaacta aacagcaaga taaagcatct t gt gt gt t ac ctctgt t t t c agt t agagca caatgggaag t ct cagt cac gtgacgcaga t t ct t ccaaa cagagaagt a gt gct t ggag Page cat ggt t t t a t t cct cct ct agggatgcgt aagt t t ct t c aaagccaaaa gagaaggaga ct t agt caaa acact t t t t g agact gt cag ct t caccat g gaat caat cc ccgat t t cac gcaggtgcag t ct aaaaccg cggct t cgga gt aagt t t ag t at t at gt t c gt act t caat 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gaat t gaaaa t t ggaccact gaagggctaa gct gagat ca gggaatgaag t cat cagagg t t t t gt t t at aggt at aaaa aaaaat gat g gt gaaat t t t t ggt t gct t c ccaacgaat a aggat t ccag cagc agat ggt t ca gt aat aacaa t gt cggaggt t gacagt agg gat t ggt gaa cgaaaaacaa at act t gt ac gcccat gt gt caat t ct t t c t t gt cccat c t agt agat ag act ct t caaa cacaagt ct c agacat acca cagagactt g t gcagagt ca tggaagaacc gct caagaaa caacaat gga at ct ct gt t t t at t gaaat t t t t ggagat g t t t gt t ctta ct t t ct t ct g cat t ccccac t t cat cact c accacaccat ctttagaaga caacttgaga gcaagagtcg atgaaagcag aagtgtgcct t cgtt gaaac ggatcgt t gt ct cct agt cc gggtggatac gtgtggatgt cccaat t gt a ggaaacaaag ct act t ct ca aaaccaacaa tgt t cagttg tgaaagcaaa tgt t t gt t ca ttgtagtgaa t aat t cagca attagagaag t t acaagagt tataacaaaa cct t t t gaga at t t ggt t t a t caaacct cc gaagt agt ca tagcgacaga ggcggtgaga aggt gt caat t ggt aaat ca gcaat gagt a gt agat gt aa ct at at gaat t at gaat cat t gaaat ccca at aagt t gaa t t at aaat ag aagcacaat a 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1324 <210> <211> <212> <213> 521 1276 DNA Arabidopsis thal i ana <400> 521 ttgct t gttt t t gcccat gc t cat gt at cc t t aat gt gat ct t t ctgttg gt acct ct cc agt t at caaa t gt t cat cga tt gt agaaaa gt gt t ct t ct aat t gcaggt gt gagat ct t t t acat acgc aaagct t at c aggt t t t t gc t t t gt t gact t ct gaat ct g t cct t ccagt t t acggt gca t at t cct at c gaaat t t t ca caact aaccg t t at gaat t t t t t t t cattt ccccat gt cc t ct t at t t ac gagcaatgt t aaagat caaa ggaaaaact g agaat ct ct t t cgaat aat t t ct aaaact c t gcgt gt ct t caggt t gcgc t cat ccgt ca t at t aagcca t t t ctgattc gccaaat gga gt aat agt t c t accaacagt gt t gat gaaa cacat t agag gt t gt t ggag t ccaaat t ct t at gt t at at gt t act gcat t ggcat t ct t cat cggccct t t t t gaaat c agat caat t g aat gcgct gt cct ct gcat g at t t t gcat c acagct t gt c t gt at at t ct caacaccacc gcggaaagtt gaagat at ga t gacaact ga t cct ct at t c at ct ct t gac t at t ggggt t t t gct ccat g t gt ggcat t g gacagcgcac t gggcat t gt t t gt caat t c gt t gagt t aa ct t agat gag ccccct ct ac t at ggaact g ct ct cagacc ggt gagt at t caaagt gact caaaaagt aa ct gcgt t t gg tcct t t t t aa t at t caaagt gaact t gacc t t aat gt at g t ccaat cagg cggacgaccc gt aact aacg gt at agagat caatggaagg at caact gt g gt act t act c caaaccgt t g t ct at cacct gaaacacaca t caagaat ga t t t gt gcat a cccaagagaa tgaagacaca t t ct ct t ct g t at gaat at a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 457 at ct gat aca gat at gct ac aagt ct t gaa cagt cagt ac caaact cttt at cact cact ccaaccaat c at gt cacaag at ct ggaaat cct ccaagaa ccat at aaat gcat ca 12689250 Sequence Listing.txt at t aagattt gggtttgaaa tctgtctctt ccgtggatga aactggtctt agctttggta gataagactt gtcttagagc ctattttgca gtaatcttgt cacaacaacc ataacctaat cattaaagtt agatgatccg acaaaacctc tcaacaagac actctttaac actgacacaa agtttcatca ctttctcttg 1020 1080 1140 1200 1260 1276 <210> <211> <212> <213> 522 1272 DNA Arabidopsis thal i ana <400> 522 cat t t t gaat acgcat t gat t gaaat at cg gt gat gt t gg t cagagccat aaat t gt cac cagt at gt aa cact agact g caagt act t a t cacat t cca at aagat gag gat aaact t g aaaaggt t aa t t aat t at at t t gcgat t aa aacaat t t t a cat t acagct t aact cgat g t ccacat t cc gaat aat cat gacatt ggt t at agt at cat gcgaaat t t c t t ccat at t c ccaaccatt a t caaagt t ag t at aat gaaa aat t t t t aat at aaacaaaa cgt t gt aaag ttct t ct t ct t aacat gcca gcat gt aaaa t gt aacgaat agccagacga t acat t t aga aat ct ct gga cct cacct t a t aacagt t t t caaat cat t a t ccagat t t a ggt cct gact t t ggat aaaa acat agcat t ccat t aaaaa agctt ggtt g t aat t t aat c t aagt ct t at t aat t aat ca t t aaat t t t a t gt cacct ct ct gagtt gt c at acacacac tt gtt gt t t t gt t gt aat at aaaaat aat c t t t act t gat gcct cct ct c t act ct t t at t t at at at t t act t cat at g t t aagcatt g agaaaagat t t gt aaaat t t tt ggat gaac at aggt t ct t t ct t t gt gt g cct at agt ct gt t aat t at g aggt agat ac cgat ctt gt t aaagacaagc at gt cgt agt gt t at aat at ccacagcct t act at ct cga aat t aagaca aaat t gt caa t t t aat at at aact agccca t ct t gccaag gcgat gggt a cgt acgcat g agaat aaaat t gat gaacag aagact aaac t aaggt gt t a t t t caat aag acaaat t agt t aat cat act t t aagt agt t at ct at acag aaat at acac at agat t aat gat aagct ct ccat at agac aat at agaac t at ct agat g cct t caacag aat t gt acca t aaaat t t gt at gat at t aa aaaccaat at cgagt t t acg gtt gaaccag agt t cct cat act ggact ac t t cacaat t g caacat ccga t gt aaaat ga ggt gagat gt t t at t aaaaa ct t t t t ataa gcat gat gt t acat gcagt g caggccact a at t aaaat ac gagt gtt aca at cat cat ca t acct at caa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1272 tttaaatttc tctttctatc tactataaa agtgactctc taagaactcc aaagattaga acattgaatt ga <210> <211> <212> <213> 523 1264 DNA Arabidopsis thal i ana Page 458 12689250 Sequence Listing.txt <400> 523 at aaacggt g agt at at aac cct t t gt at a t caaagagt g t gt gat agag gaaaact agg tttttttttt aaacagct ag gt t gt t ggag t gt gt gat gg aaaat t caac gt t ccat acc ct cagt ct ca t ggt at at gc gat t t t t t at aat t cat aaa t t t t t gt t t g ct at t t t aag gtt agcagac gagt t gaaac t t gt ct ct t c ttga at gt t aat gg agct aaat t t aat gact cgt gtcggggaaa act t gt aaaa aact ggat t t t ggcgt gt t t at at t gaaaa at at accaaa cat t t agt t g cat t agt ct a ccct t t t t ga t at t cgat gt cact at cat t t t t at t t ct a cgct ct ct aa t t t agaagt c cat t t gaagt t t t at t t t t t t t gt at caac cat ct t ct cc gcccaaaggc tcaaggaagc cgt gt t cat t agagt ct act aagt gt at gt act gt accct ct gaacaat g t t t aggaact at t t gat aat at ccaatt ag acct cact t a acaaat cat t gagact t t t a tgctct t t t t t at at at aac aat t t t aaat gaat t at at t t t t at t t aat ct t aaat t aa gat t caact a t ctt gt caaa t t ggcct t t t aaact gagaa gat cat t agg aaccat t gct agaaagagag t ct ggat aat t gt t t act aa agact t act c at caat at t c caat t act ga ct aaaat gca t t gaccat t t acaaacaat c t ct t t ct cat t at at at t t c acaat t aaat act at cacca t t act aat at at t t t gt t cc ccagt t t aaa gt t t cct caa agcct t acaa gaaaccacat t t cggat t t a ct at ccacat cat t t t t gaa gat ggt t t ga gt t ggat at t tatgt t t gt t aat t t cagca t ggaaagt t t accact at t a t agt aat t t t ct ct t agaag aacgt t t at a aaaat t at t t caat ct t caa at t aaat at t gaat gaat t a t agt gt ggac aggacacat c t act t caat c aaggccttt t t t t gt t gtt g tcgaggaacg gt t ccgat t c gt t agaaat c gat gt gt t ga gaaaagaaaa gaggact at g at agt t t ct t t t ggct t act t t t t agtgag t ggact aggg t t t t accagt t t t t ct t t ct cccct t t t at t at at gt t t t acat agt cgt aat aaacat t acat ggat ag t ct gat ccat tt gt aacccg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1264 <210> 524 <211> 1331 <212> DNA <213> Arabidopsis thaliana <400> 524 ccaatacatt cgaacacgtg attgt aatatagatg tccaactttt ttttt( ctgaataatt caaagttcca aacta gatcgatgga ttccaacgat tcgat cagcaaactc tatattgata ttt ct acaaaataat aaaagaaaat gatcgl tgtaatcaca taattttggg cccaal cacatagaac atcctaaaat agggtl :cgtt aattttcttg cgggt gggaatatag gtata tat t aataca acaag tattaatgaa atttt ttaattagcc at caa agagcat t cc :ccta tttttcaaat taaaa t gt act t tta Page 459 at t ct gt aag acgt ccagct at t gacgat a at aagat aac at gcgt t gca at t gaaat t t gt aacat gct Sct at t t gca agaaacaaaa t agct acgt a aggt cat aag acgat t gt ga cgat caat t t aat t ccat cc at t acat agt at t t t gat at 120 180 240 300 360 420 480 12689250 Sequence Listing.txt t t t cct t t ct t ct gact at a aat gat gaaa gt caat t acg at t accgat t t t at t acaca gt aaat gt t a aaccagt agc t acgat cagg at ct at at t c ccat cagt t t at agt t t at a agct agaacc at ccaaaaaa gt caacaacg <210> 525 gaaaaagat t cat t aat t t a aat aaaaggg at act t agaa t gct gact at t act t gt gt g t aat aact t g ct ct ct t at t t aat ccccct t gaaaat aga gact t t t gat ccaacaaacc ggat t agt at at gaaaat aa agt at at ggc ttttaaaaaa at accat aat aaagaact at at gagcaat t gaccaacat g aaat t aaat a cacacct aat ct ct ct at ct t caat caat t cgt t t aaggc cat t t ggt gt t aat gt t act caaat aaacc aaat t atctt t tagataaaa gatcttttgt aaaact t aac ct aaaat ct g at at t t t t gg at t acat act at t aat t t t a at aact aagc ct t cat ct t c at ct t t cat a gat ct t t t cc t cgagagaat t gaccagct t t gt acct gt t at t t at ggt t agat at at t t acaaagaaaa gt agggaagt t t t at t t at t t at t ggccat t cgact cgat at ct t cgcat t at gt gt gt a t at ct caat t t at cat t cac t caacat aag cat agt act a at cacagat a gcaaat acaa t at acaaaaa tcaaaaacaa t gt acaacaa at ggt gcgt a at at agat cc t cat agt ct c t gt gt aaact gt t t t cacaa t gt agt aaag t at gagt tag accaaaaat g gat aaaagaa 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1331 <211> <212> <213> 1374 DNA Arabidopsis thal i ana <400> 525 ctct t t attt t t cat t aat g agaaaact t a t at aacagct aat aggt t t g agcaat at t g at at at at t c cgct aatt ag aat t t gt cca aaagt t aagt tttcaacaaa gggt ct t aaa t t cgt gt ct t gt t t at at ac t aat gt gaaa gt cgt gact c at aat t ct ac agat t at cat gcat at ct t a aaaaaaaaaa at gaat gcgt agcaat cat g t at t cct aag gt t gagaat g tgaccagcac at ct t act t t ct at t t cat g t act at t t ag ggacgact ct t at acact gt gcgaacccct t act at t agt ttccagacca at acggat t t t cgt at caca gagt ct gcaa t t agaaagt a agacaccaat gt t t t caaac at ct t gt ggt ct ggat at aa t at t t act at at agagat t a cat gaaat cc t aagcaacat t t t t t at t aa t t gcaat gt t ccgcaagtt c at gggct t t a t acct t t at t aat t t aaccc t t t t aat cat cagaaact at t at t t t at aa t aaat gt t t c t t aat at cat t t agat agag cacgacat gg tacaacaaga at t acgt at t cgt t t t agt c aact aaact c ctt gaaaaga at t t gaaat c t t t t gagt gt caaaaaaaaa gt t gaact ga t ggat agt t c aat t t t t t t a act cgt cgt t at gt acaaaa acgt t t aaaa aaat aat agt aaacaaagca at agt t at t t aacacaacat t t t t t acgt g t t gt t at at a aat t gt at ca agt at aagca gt aagcaaca acgat ct ccg gacggt t tag gcgaat t t ct gaaaaagt ct at t gat t aat aaaaact at t acat ggt caa acat at agt a t t at gt t aat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 460 gacgt acaat cgt cgt ggt t t acaagat t t gt t t t t t ct t gt ct aatt ca at t t ct cct t t aact aagat cat at cct cc gt acaaat t c at t t t cat t g t agt cgct ct ct aaaaaat a gct at aaagt aat cacaact ttgaccaaac tagt t t cttc 12689250 Sequence tagtattctt cacctgaatt ttctttaatt agccatctcg aataggatgt ttatgaattt t t at ct t gaa aat gat t t t a at aaacgtt a t gaccaagt c acaagaaaag gaaatgggtc ataattttgt attatcaata acacaactct cccctcaaaa Li st i ng. t xt at t t gat gct aaat at aat t aaaccgaccc t t aaat t cgt cat aat caaa at gact t t ct t t acaccat a cat t ccat ca aaact acgt a at t t caat gt aat ccgact t t t t cgt ctt a t cat cat agt t at aaaacat aat acggcca aagg 960 1020 1080 1140 1200 1260 1320 1380 <210> <211> <212> <213> 526 1400 DNA Arabidopsis thal i ana <400> 526 act t ccacca ct act ct t ca t at cggcaaa acat t gt cct at gt cat at a t t at ccaagt gat t t aaagt aaat caaaga t cct aaagtt cct t caat ac cat t t gat gg cat ggt acca accaagattt agaagaact a at gt t t at ct cat at cgt ct agt t t t agga at aacaaaat aaaacgaaca acaaaagt ga accaagt cca at acat aaat gaaaaggcga t t ct t caacg t t t t agcct t t ct ct t gt ca gt at aat ct a ct t t t at t ag gat t t gagt g t gcaaat ct t cact aaaacc agcccct aaa aat t t gagt a agat t t at ag at agt gacaa gccact gat c ct agt aagat at t aagagt c aaaagacat a at t t t aat gt t t t t t gt t t g gaaat at caa caat at t cat at gt accccg aaccaagagc tttgaattga atagtcaaaa ataattgct t t at gcaacgt acacaaaaga acaat ct t cc t t act ct aca at gaaaat ca acat acat t a at agt t t tag at ct caaaac act gagaat c gccgccgggt t gat at t at g t at gcgact t gact at at t a aaat at caaa gaat t t aaga ct gt t t gct c t t gct ccat t t t caat cat t t t aat at aat ct t gagt aag ct at acaaaa aact t gt at g cat aat aaag t ggt t t t aag ct t t gat t ga act gtt attt ggt gaaattt gt gagat t t t t cat caaaac at t t t t t gt c aat ggaccca cgacat ct ct ct ct act t ag gagt agt t aa caaacaaaac aat gaaaaat cat at agt t t gacat t acat gggaaat cat agt t ct gt t t aacacgacca aagaaaaaga at t gt gcat t t at cat ggcc at acat at gg t gacat t ct t t at aaaaaaa gaaggaattt agagaatgt t t aaaat cact aact aaaact accaaaggct ct accacat a ggccggcttt t t t t at caat tt aat ccaag aaacgatt ca gt acat gt at gt att aaaca agaat t gt t c aacaagaaaa aaagt caaac cat t t acat c t at cat t t t t t t t t t t gtt g at ggtt caga at t ccgat t t at aaat agt g ctt agtt aaa aat agcaat t ct caatt cat gaaat t cat a cacat agt ca gt cacat ggc cagaaact ct t acat t t gaa gact t gt ct t agctt aagt t t aaat at aga gt t t aat aga aaaat at gaa t t gaat t t ag t cgt t t cgaa cactt at ccc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 aatagacaaa tgaccaaact acccaacatc tacccctata tatacctcac cacctttgcc Page 461 12689250 Sequence Listing.txt ct ct caacca caaacaat aa 1400 <210> 527 <211> 519 <212> DNA <213> Arabi dopsi s tha i ana <400> 527 atggcgttcc ctaaggtata cttcgacatg gt gat ggagc t gt acaccga t aagact ccc accggagaga aaggtgttgg cggtaccgga caccgt gt ga t ccct aact t cat gt gccag ggcggtgagt cgat ct acgg gagcaagttc ggaccgggga t cct gt cgat ggcgaacgcc atctgcaccg tgaagaccga t t ggct t gat gaaggcttag acgtggtaaa ggccatcgag aagcct gt gg ttgttgccga ttgtggtcag accat cgacg aggact gccg aaacccct t c ggaggagatt gaggacgaga ggt gcaaaca gggaagcacg aaggt t ggat ct ct ct t ag gccagcccgc agaat t t cag act t caaggg tcaccgccgg at t t cgagag cgaacggat c t ggt gt t t gg cat cat ct gg gggaaggat c agct ct ct gc at ct aagt t t gaacggaaca gaagcacacc t cagt t ct t c gcaggt cgt g aaagccgacg 120 180 240 300 360 420 480 519 <210> <211> <212> <213> 528 3065 DNA Arabi dopsi s t hal i ana <400> 528 at ggat gaag ggt ct cct t t gt t t t t ct t c t gct t t gat c gt at gat ct g t gagcat gga t gt t cccgcg ggaat gact a tcaggggaga t gat gcct aa t gt cact t ac gct t gt t cgt t ggaagct at gacat t t t aa ggt t caaaag t gagaaacgt agt acgaggt ccgt cgat gg t ct ct t at t t cggat t caaa agct caggt t gat gat t gaa at ct gat t at ct at ggt gga agagaaggct ggt at t at ag t aact ct t gt accct t at t c gt t t t cgt ca aat at cat t t gt gccagct a cgagccggca t at t gt t ct c t gt caaggt c t caat t gaac aaaacggat c t t cgat ct t g aat gt t t t gg t ggt t t t gt g gaat caact t cct gagcat t ct agct gcca ct ct caat gc at acagat gt aaggaaaggt t gct gat aac ct cct at gga agt t t t t cag ggcaccggt c t cct t cct ct ct t t gaat t c t cggt gagat at aat gat ga ggct t t t aca gat gt gacga ccct t aat ct t aggcgct ag t t t gt t t t gt cat t gaccag t acaaagt at acaaact t ct t ct t aaat t g ggccct gaaa t t t t gt t caa t caaggagt g ct t t ccccga at gat t gaat t agat t agat t gagat t t ga t at gt acat t gt aggt gct t caat cagct t ccgggat t ac t t t t at gct a t t t at gat gg t t gt cct t ca cat t t gat gt t t gat gaaaa t ct t ct ct ca gaat acgat g t at cct cagc t t cgt t t t t c t t cgcat aat ct ccccgat t t t gt ct agt t gacct ct t ag cacat ggaca tggaagaagt aat gt t gaca at ct t at caa gaaat ggcaa aagct gt t ga at t t gt t gct t t t t cct gca t gggcat at t agaaggaccc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 aaaaacacac gatggaatgg atttgaccag agttacaaca aaggaactga ttgcgtaagc Page 462 1020 12689250 Sequence Listing.txt t t gat gt t ct t ggct aacca ct t t at t ggt t gat act gt a t t gt acat t t ct at gcggaa gt t gggagaa caagt t t t at t t caggcat t agt gcaaggt agt ct gcaag gcaat gt t aa tctgagggag aaggt aat at acgtcaggag t gt t cat t at ct t ct t gt aa t gtt gt t t t t ct cgggccat aggt cat cat aact t t gat a t t t t t ct t t t gt gct at aag ccccct t tag tgt t t t gtca ct ct acact c gaaaat t cat agcct ggaac acgagcct gt ct t t agat t t t t ggt cct t a gaacat gt at t t gat gcggt ttct t atgt t aat gcct t ct t t t at gt gac cacgcagtgg at gagaat ga agt t ct gaag t ct ct t gcac ct accccagg at gat t acat t gcacgact t at t t t ctct t aggggaatgc t ct t at gcca agact gct aa t gcagt t ct t gat cgt t at a gt t ggt t gat t gact agact t at at gat ga cgccat t at g accccagaaa agccagtggc t ct gt gt t ac at t t t ccttg t ccat agct t t at gt aat ca t gacaggt at t gcat t t gt g tgatct t t t g caacgagcca t at t gct gcc cagagct at g accct aat ca ggaaagacat agagt t t at g act t t t gcag cact t cacac aggt gaaat t cagct aagt t gt t t ccaagg t t t t t ctgt c t t t t gat t ct agt gct gt ct t gt ct t cat g accgt aagat ggt agagt t t at gcaaaaag agcat gt t t a aagt t gct gt t gccat agt t gt ggt ct t gt t t aat cacat agccacccga cagt t ggccc t gaaaggact at aagct cca gt acat gt ag gaat t gcaca ct act ct ct c gt ct t ct gt t t ct acagat g ggt cct gt t g gagt t ggaca t act t t aagt at gct acaac ccggaaaggt ct caggt t t c gt gt t act ga gaaat at ggt gaat gaccaa t t gt cagact gat at t ct t g aacat ct cca at t ct gacct t ct gt gact t atggtggcac acaact ct t t t gt at t ggag gacgagggag at t gt gt gt g aat t t t caat t cat gaaact t t t at at t ca t caggat t t t t caaacaggt ttccgaacac gcaaat cgga t gaaaagt t g t ct caat cat ccggaaaat g t acacagaat t ct t t t t t gt gt t cgt act c cagagact ga at gagat at t act gct t t at t t gat gggt t acact t t gag aaagt t t gt g ct cgacat t t ctttcaagtg aggatatacg ct t gat ggca cat ct cgacc tatct t t ct t ct t t aaat gg t at at t t at c t t gcct t ct g ct ct ct ct ct at at at gt t g gt t t t ct t t c gaacgct gt a gt aaggt t at accct t cat a t act gt gt cc t gt gact t t t t t cacacact t ggaaaaat c t aggaagat t caat gat t ct t at gt aagt t aaat acct ag t t t gagat ac aaagt aat at aat t t cgccg t ggat ct gat ccacaacgt t t aaccct caa ct t cgacat g t t caacggt g t t aagt gt ct acaact gt t g aaat t cgct a ttcacaacaa acact at t ga aacccgcct t gt t at aacag ct ggct agct ct ct ct at gg t at gact t t t ct t ct cct t g aacaaacct g t t gt t gt gca ct cat at at a t ggt gt aaca cct gccgaac at act t at cc gccgt t ggcc t agt t ct t ct t att cct ttt ggcagggt t g cact cagt gc t t t aaact t t t gcat t t t t c gt t t t t aagc t at t caggaa t agt gat gt t gt t gt gaaac gct cccaagg accgaact aa t at gat agat agcgagat ct aaccct gt cc cggatgtgt t cggat at t t g gt t t t gt t t c 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 gcagcaactg gacctaagtg ttgacctgag tgccgcaagt gctgcagagg Page 463 12689250 Sequence Listing.txt aat ga <210> <211> <212> <213> 3065 529 1850 DNA Arabidopsis thal i ana <400> 529 atggcgaaag at ccgatt ag cagat ct ct g tccgt t ggat aaagt t attt t t t aagt t gt aggacaaat c ccaacctgtt taagatggag t gct gt ggat t gaat at t t g gt act ggagt ggaaggatgt agcatgccgc cact gcat cc gt t ct agt t t gt t ct t gttg t caat ccct g cagat ct ct g aaccactcat gagaagcctg cttaagaccc tctgtaggt t aggcgaggaa actgggtcct gt t t t t t t gt gt t cgt t t cc ct cctt ccct actt acaaac aaccagt t cg at cgt gt aac cat agt at ct t caacgct ga cat ggat cga gact t t agat ggatat gct c atcctccaca ttgatcgat g at t t gcat t c at gt gt at t a caat gt tgct gat gt ccaag t cct aact gc aat ct t ct ct acgact ct ct tggccaaccc aaaagaacat agaggt t gag cct cacagt a tccgtgagct t t t ctct t ct ggacggagaa gt t gtctagt t ggaact cca aacagtt aat at gggagt at gt aact t gt c t caaat gct c tgtgctcgt t tt ct agat ct ggt t gt t cca ccaacgatt g act ct gat t t ct cct cagct t t gt acct at t gt t ggat at ct gct t t ccc cgt gt cat aa t t at caggt g gt t at ggt t g aat gt t t cca aaggt gat ac gt t cat t gga tgctcacaca ggcaaacacc ctct t gt t t g cgtgccagt g cccagatgt c cgtcaaggac ctggaagcaa t t t atctct a gcgctctct g gaggt t cat a ct aat at gag act ccgat gg gcaatggaga at caat t at g actggagct g ctggctagt g t agat caat g at t gagt t gg agaaaccgtt act gaaagt a gat t gcaagg t cct ccagct t ct t ct t aaa gt t gt t t t gt t t gt t gct ac gtggt t t ccc t ct acaagt c gatctgt t t g gtgtaacaga t t cgct ct cg aacgcatt ga acaaggct t g t ct gat gt t a aaccat gct a gatgcatggt aat ct agttt cagt t caaca ct gct agct c at aaaact ct tt ct t t cttt ct ct t acagc ctggagtat t aaat cct aga caggt at aac aat cgctt gc aaat at aggt t cgcat at ct gt t t at at ag act at t gat t ggt at cat gc gctgaagct t ggtagt t aag ataagtggga aact gat gcc gaggaaagaa tcaggctgct ct t t t t aagt tgct t t gtgt t t t t gt cttt tcctcaagga accacaacag agaacgtgat aagtgcagac at gt at aaca cactt aaat a acgtggagct tgct t gtgac t ccccgt t at gatt caaat t gt t ccat cag gt ccaaggt a ttctgcagac t t t cat ct at t gaat ct agt caact ct t gt aat t t t agt a t t ggat agag gt gat t at gc t t ggt gct ga t gaacggt gt ggt t t t gttt aaggagaaga gt t gagggat ggt at ggaga gcct t ggaga t aaat gaagc t ct ct at t gg tttttttcag at t t gcacca ggct t t ggga cat ct gggga ct cgt ct gga t gccat agaa t at t gct cat gcaat cat ca cacat ccgt g t t gcagt gct gt cagggt ac gact t at ct a aat at gacgc cct aat gt gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 Page 464 00 12689250 Sequence Listing.txt attttgtttg gttaggcctt ccgattgatg aagtatcaag gaagaagatg gatttgactg 1800 cagaggagct caaggaagag aaggacttgg cgtactcatg cctctcttaa 1850 <210> 530 <211> 32 <212> DNA <213> Arabidopsis thaliana <400> 530 cctcagcaaa taagaggacg ataaggatcg gt 32 S<210> 531 0 <211> 33 0 <212> DNA CK1 <213> Arabidopsis thaliana 00 0 <400> 531 0 gtttaaacgg ctgattttcg tagagcaaac gag 33 (N <210> 532 <211> 28 <212> DNA <213> Arabidopsis thaliana <400> 532 cacgcagaat ctcaccactg tcccttat 28 <210> 533 <211> 26 <212> DNA <213> Arabidopsis thaliana <400> 533 tcggagatta tcgccggaaa acggat 26 <210> 534 <211> <212> DNA <213> Arabidopsis thaliana <400> 534 tacaaatcca aagagattcc agatg <210> 535 <211> <212> DNA <213> Arabidopsis thaliana <400> 535 cgtgagatct ctatcagact gaaga <210> 536 <211> 1371 <212> DNA <213> Arabidopsis thaliana <400> 536 ctctttattt gtcgtgactc gcgaacccct tttttattaa cgttttagtc aacacaacat ttcattaatg ataattctac tactattagt ttgcaatgtt aactaaactc tttttacgtg 120 Page 465 12689250 Sequence Listing.txt agaaaact t a t at aacagct aat aggt t t g caat at t gat t at at t cagc t aat t agt at t t gt ccagt t gt t aagt t ga caacaaaat c t ct t aaact a gt gt ct t t ac t at at acgga t gt gaaat at gt acaat gt a cgt ggt t at t aagat t t t ag t t t t ct t ct a t aat t cagct t ct cct t aat ct aagat t t g at cct cct ag agat t at cat gcat at ct t a aaaaaaaat c gaat gcgt ga aat cat gt t a t cct aagaga gagaatggt t ccggcacat c ttact t t ct g t t t cat gt at t at t t agat a cgact ct cat acact gt t aa caaat t ct ag ttcat t gttc t cgct ct aat aaaaat at t a at aaagt at a cacaact aca accaaacat a t t t ct t caca ttccagacca at acggat t t gt at cacat a gt ct gcaaaa gaaagt at t t caccaat cag t t caaact at t t gt ggt t aa gat at aat t a t t act at t t a gagat t acac gaaat cct ac gcaacat at t t at t ct t cac t t t aat t agc aggat gt t t a t ct t gaaaat aacgt t at ga agaaaaggaa at t t t gt at t caact ct ccc ccgcaagt t c at gggct t t a cct t t at t t t tttaacccca t aat cat gt t aaact at t gg t t t at aaaat at gt t t cact at at cat at g gatagagacg gacat ggaaa aacaagaaaa acgt at t at a ct gaat t at t cat ct cgaaa t gaat t t aaa gat t t t at t a ccaagt ccat at gggt cat g at caat at t a ct caaaacat ct t gaaaaga t t gt t at at a at t t gaaat c t t gagt gt ag aaaaaaagt a gaact gaacg at agt t cgac ttttttagcg cgt cgt t gaa t acaaaaat t tttaaaaaaa t aat agt aca caaagcaaca gt t at t t t t a t gat gct aaa t at aat t at t ccgacccaat aat t cgt t t t aat caaat ca act t t ct t at caccat aaat t ccat caaag aat t gt at ca t at aagcaag agcaacaat a at ct ccgcgc ggt t t agaat aat t t ct aaa aaagt ct t t t gat t aat ggg aact at t t t c t ggt caagt t t at agt at aa t gt t aat gac ct acgt acgt t caat gt t ac ccgactt gt t cgt ct t agt c t cat agt at t aaaacat t aa acggccacat 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1371 <210> 537 <211> 1400 <212> DNA <213> Arabidopsis thaliana <400> 537 acttccacca gaaaaggcga aaccaagagc ctactcttca ttcttcaacg tatgcaacgt tatcggcaaa ttttagcctt acacaaaaga acattgtcct tctcttgtca acaatcttcc at gt cat at a gt at aat ct a t t act ct aca ttatccaagt cttttattag atgaaaatca gatttaaagt gatttgagtg acatacatta aaatcaaaga tgcaaatctt atagttttag tcctaaagtt cactaaaacc atctcaaaac t t t gaat t ga aact t gt at g cat aat aaag t ggt t t t aag ct t t gat t ga act gt t at t t ggt gaaat t t gt gagat t t t t cat caaaac Page 46E at agt caaaa at t gt gcat t t at cat ggcc at acat at gg t gacat t ct t t at aaaaaaa gaaggaattt agagaatgt t t aaaat cact at aat t gct t t at cat t t t t t t t t t t gtt g at ggt t caga at t ccgat t t at aaat agt g ct t agt t aaa aat agcaat t ct caat t cat 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt ccttcaatac agcccctaaa actgagaatc attttttgtc aactaaaact cat t t gat gg cat ggt acca accaagattt agaagaact a at gt t t at ct cat at cgt ct agt t t t agga at aacaaaat aaaacgaaca acaaaagt ga accaagt cca at acat aaat aat agacaaa ct ct caacca aat t t gagt a agat t t at ag at agt gacaa gccact gat c ct agt aagat at t aagagt c aaaagacat a at t t t aat gt t t t t t gt t t g gaaat at caa caat at t cat at gt accccg t gaccaaact caaacaat aa gccgccgggt t gat at t at g t at gcgact t gact at at t a aaat at caaa gaat t t aaga ct gt t t gct c t t gct ccat t t t caat cat t t t aat at aat ct t gagt aag ct at acaaaa aat ggaccca cgacat ct ct ct ct act t ag gagt agt t aa caaacaaaac aat gaaaaat cat at agt t t gacat t acat gggaaat cat agt t ct gt t t aacacgacca aagaaaaaga accaaaggct ct accacat a ggccggcttt t t t t at caat t t aat ccaag aaacgat t ca gt acat gt at gt at t aaaca agaat t gt t c aacaagaaaa aaagt caaac cat t t acat c gaaat t cat a cacat agt ca gt cacat ggc cagaaact ct t acat t t gaa gact t gt ct t agct t aagt t t aaat at aga gt t t aat aga aaaat at gaa t t gaat t t ag t cgt t t cgaa cact t at ccc 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1400 acccaacatc tacccctata tatacctcac cacctttgcc <210> <211> <212> <213> 538 1307 DNA Arabidopsis thal i ana <400> 538 cggat ggt t g t agcggt t t t gcat aaaagt aaaacat aat t t at t gt caa t ct acaat ac at at aaacgt caaat ccaca t t ct aaact g caatgggaga ggggatat t t t caagt t t aa aaaaact aca aacat ct ggt aagt t aaaag aggt agt at g cgt ggat at g gagt at gaga at agat caat gact caagt t at caat ct at aaat aacact at at t t ccat aaat at t aac t t t gaat t t t t gct ggaat a aggggaaat g t aagt t t t t t ccact t agaa aaat at gt at agt gaccgt g gcgct t t t gg gagaagat t a aaat t gat ga t ct t caaaat at agagat aa t t t t t gaggt at aaat t t gt aaat ccgt ag gaaccaaaaa t agct t t gat at t at gggt t gt t t gggct g agagt cacgt caaaat gact acgat caaac at t ct t cttc aat agat at c gagt aaaaac at caagaggc agact aagca aat act aaat aaat aat at t t at t t ccat t aaaaaaaaaa gagaat at t t gaaat t t t gc gcgct at cgg agt at at ggt aaaaagt agt gt t ct ccaaa t gat cct agc aat cct aact acaaagat gt gct t ggaat a t aat t t t t aa t t t ct aaaca t t gt t agat a at t aaat ct c gat t t cat ca at at t t t gt a aat caaaagc at cct t t t ag aat t gt caac gaaat at t at gaaat cgat g cat t t aat ct aat at t caag t t agaaat aa agaccct t at aat agaaaaa t gaaat gt t a at gt t aaat t gat t t t gttt agat at ct ag t ct ct gaaaa cct aat t t gc gct t acat t t ttgat t t t t c gt at ct aat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 tgtttattta ccaaattaat gctataaaaa tgttcaactg tacaattggc atggaataat Page 467 at gaacat aa gt t t acat t t aagcaacaac ttaaaagagc aagcact at a aat acacaac at cat acat t t ct t t t gtgt t cct ct t ccc t cat cat t aa t at acactt g acacaaagca 12689250 Sequence at t aagcact tttgcctacg t caatt gtt a gct caaaccc ggacccctaa caaatcaact ct agct act a at t at t ct t a tatcttccat tagtttccca cacacttttt ctttctttta Li st i ng. t xt aagggatacc aact t cat t a aat t aagtgg ggaaagtaag aaact caat a t caaaccat t at caat cact gct t aataca ccacaactac aaaacattcc aacccca 1020 1080 1140 1200 1260 1307 <210> <211> <212> <213> 539 1324 DNA Arabidopsis thal i ana <400> 539 t t ccct ccaa t aagt ct ct c t t t gt t t at t gat gt t acaa ccacaagcag aat ct aaaga t t aat t ccca ct ct gcatt c aat gaaact a gaat t gaaaa t t ggaccact gaagggctaa gct gagat ca gggaatgaag t cat cagagg t t t t gt t t at aggt at aaaa aaaaat gat g gt gaaat t t t t ggtt gctt c ccaacgaat a aggat t ccag cagc <210> 540 t gt cct act g t cat t at caa t ctt gt gtt g tccaaccgcg caagct at t c agt t gct gca gt t t gcaact aat t acacaa aacagcaaga agat ggtt ca gt aat aacaa t gt cggaggt t gacagt agg gat t ggt gaa cgaaaaacaa at act t gt ac gcccat gt gt caat t ct t t c t t gt cccat c t agt agat ag act ct t caaa cacaagt ct c t ct cctt ct c aat t cat ct t t gt gt gat gc t att cgacaa t ccagaggt t aagaaacat a ct ccgcacca aat gt t t cac t aaagcat ct agacat acca cagagact t g t gcagagt ca tggaagaacc gct caagaaa caacaat gga at ct ct gttt t at t gaaat t t t t ggagat g t t t gt t ctta ct t t ct t ct g catt ccccac t t cat cact c t gt gt gt t ac ctctgt t t t c agt t agagca caatgggaag t ct cagt cac gtgacgcaga t t ct t ccaaa cagagaagt a gt gct t ggag accacaccat gcaagagtcg at gaaagcag aagt gt gcct t cgt t gaaac ggat cgt t gt ct cct agt cc gggt ggat ac gt gt ggat gt cccaat t gt a ggaaacaaag ct act t ct ca aaaccaacaa cat ggt t t t a t t cct cct ct agggatgcgt aagt t t ct t c aaagccaaaa gagaaggaga ct t agt caaa acact t t t t g agact gt cag ct t t agaaga t gt t cagt t g tgaaagcaaa tgt t t gt t ca t t gt agt gaa t aat t cagca at t agagaag t t acaagagt t at aacaaaa cct t t t gaga at t t ggt t t a t caaacct cc gaagt agt ca ct t caccat g gaat caat cc ccgat t t cac gcaggtgcag t ct aaaaccg cggct t cgga gt aagt t t ag t at t at gt t c gt act t caat caactt gaga tagcgacaga ggcggtgaga aggt gt caat t ggt aaat ca gcaat gagt a gt agat gt aa ct at at gaat t at gaat cat t gaaat ccca at aagt t gaa t t at aaat ag aagcacaat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1324 Page 468 <211> <212> <213> 12689250 Sequence Listing.txt 1276 DNA Arabidopsis thal i ana <400> 540 ttgct t gttt t t gcccat gc t cat gt at cc t t aat gt gat ct t t ctgttg gt acct ct cc agt t at caaa t gt t cat cga tt gt agaaaa gt gt t ct t ct aat t gcaggt gt gagat ct t t t acat acgc aaagct t at c aggt t t t t gc t t t gt t gact at ct gat aca gat at gct ac aagt ct t gaa cagt cagt ac caaact ct t t at cact cact t ct gaat ct g t cct t ccagt t t acggt gca t at t cct at c gaaat t t t ca caact aaccg t t at gaat t t t t t t t cattt ccccat gt cc t ct t at t t ac gagcaatgt t gaagat caaa ggaaaaact g agaat ct ct t t cgaat aat t t ct aaaact c ccaaccaat c at gt cacaag at ct ggaaat cct ccaagaa ccat at aaat gcat ca t gcgt gt ct t caggt t gcgc t cat ccgt ca t at t aagcca t t t ctgattc gccaaat gga gt aat agt t c t accaacagt gt t gat gaaa cacat t agag gt t gt t ggag t ccaaat t ct t at gt t at at gt t act gcat t ggcat t ct t cat cggccct at t aagat t t aact ggt ct t ct at t t t gca cat t aaagt t act ct t t aac t t t t gaaat c agat caat t g aat gcgct gt cct ct gcat g at t t t gcat c acagct t gt c t gt at at t ct caacaccacc gcggaaagtt gaagat at ga t gacaact ga t cct ct at t c at ct ct t gac t at t ggggt t t t gct ccat g t gt ggcat t g gggt t t gaaa agct t t ggt a gt aat ct t gt agat gat ccg act gacacaa gacagcgcac t gggcat t gt t t gt caat t c gt t gagt t aa ct t agat gag ccccct ct ac t at ggaact g ct ct cagacc ggt gagt at t caaagt gact caaaaagt aa ct gcgt t t gg tcct t t t t aa t at t caaagt gaact t gacc t t aat gt at g t ct gt ct ct t gat aagact t cacaacaacc acaaaacct c agt t t cat ca t ccaat cagg cggacgaccc gt aact aacg gt at agagat caatggaagg at caact gt g gt act t act c caaaccgt t g t ct at cacct gaaacacaca t caagaat ga t t t gt gcat a cccaagagaa tgaagacaca t t ct ct t ct g t at gaat at a ccgt ggat ga gt ct t agagc at aacct aat tcaacaagac ct t t ct ct t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1276 <210> 541 <211> 1264 <212> DNA <213> Arabidopsis thaliana <400> 541 ataaacggtg atgttaatgg gcccaaaggc agtatataac agctaaattt tcaaggaagc cctttgtata aatgactcgt cgtgttcatt tcaaagagtg gtcggggaaa agagtctact tgtgatagag acttgtaaaa aagtgtatgt gaaaactagg aactggattt actgtaccct t t ggcct t t t aaact gagaa gat cat t agg aaccat t gct agaaagagag t ct ggat aat Page 46 c agcct t acaa gaaaccacat t t cggat t t a ct at ccacat cat t t t t gaa gat ggt t t ga aaggccttt t t t t gt t gtt g tcgaggaacg gt t ccgat t c gt t agaaat c gat gt gt t ga 120 180 240 300 360 12689250 Sequence Listing.txt tttttttttt tggcgtgttt ctgaacaatg tgtttactaa gttggatatt aaacagct ag gt t gt t ggag t gt gt gat gg aaaat t caac gt t ccat acc ct cagt ct ca t ggt at at gc gat t t t t t at aat t cat aaa t t t t t gt t t g ct at t t t aag gtt agcagac gagt t gaaac t t gt ct ct t c ttga <210> 542 at at t gaaaa at at accaaa cat t t agt t g cat t agt ct a ccct t t t t ga t at t cgat gt cact at cat t t t t at t t ct a cgct ct ct aa t t t agaagt c cat t t gaagt t t t at t t t t t t t gt at caac cat ct t ct cc t t t aggaact at t t gat aat at ccaat t ag acct cact t a acaaat cat t gagact t t t a tgctct t t t t t at at at aac aat t t t aaat gaat t at at t t t t at t t aat ct t aaat t aa gat t caact a t ct t gt caaa agact t act c at caat at t c caat t act ga ct aaaat gca t t gaccat t t acaaacaat c t ct t t ct cat t at at at t t c acaat t aaat act at cacca t t act aat at at t t t gt t cc ccagt t t aaa gt t t cct caa tatgt t t gt t aat t t cagca t ggaaagt t t accact at t a t agt aat t t t ct ct t agaag aacgt t t at a aaaat t at t t caat ct t caa at t aaat at t gaat gaat t a t agt gt ggac aggacacat c t act t caat c gaaaagaaaa gaggact at g at agt t t ct t t t ggct t act t t t t agtgag t ggact aggg t t t t accagt t t t t ct t t ct cccct t t t at t at at gt t t t acat agt cgt aat aaacat t acat ggat ag t ct gat ccat t t gt aacccg 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1264 <211> <212> <213> 1331 DNA Arabidopsis thal i ana <400> 542 ccaat acat t aat at agat g ct gaat aat t gat cgat gga cagcaaact c acaaaat aat t gt aat caca cacat agaac t t t cct t t ct t ct gact at a aat gat gaaa gt caat t acg at t accgat t t t at t acaca cgaacacgt g t ccaact t t t caaagt t cca t t ccaacgat t at at t gat a aaaagaaaat t aat t t t ggg at cct aaaat gaaaaagat t cat t aat t t a aat aaaaggg at act t agaa t gct gact at t act t gt gt g at t gt t cgt t t t t t t cgggt aact agt at a t cgat acaag t t t ct at t t t gat cgat caa cccaat cct a agggt t aaaa agt at at ggc ttttaaaaaa at accat aat aaagaact at at gagcaat t gaccaacat g aat t t t ct t g gggaat at ag t at t aat aca t at t aat gaa t t aat t agcc agagcat t cc t t t t t caaat tgtact t t t a aaat t at ct t aaaact t aac ct aaaat ct g at at t t t t gg at t acat act at t aat t t t a at t ct gt aag acgt ccagct at t gacgat a at aagat aac at gcgt t gca at t gaaat t t gt aacat gct t ct at t t gca t t agat aaaa agat at at t t acaaagaaaa gt agggaagt t t t at t t at t t at t ggccat agaaacaaaa t agct acgt a aggt cat aag acgat t gt ga cgat caat t t aat t ccat cc at t acat agt at t t t gat at gat ct t t t gt gcaaat acaa t at acaaaaa tcaaaaacaa t gt acaacaa at ggt gcgt a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 gtaaatgtta taataacttg aaattaaata ataactaagc tcgactcgat at atagatcc Page 470 aaccagt agc t acgat cagg at ct at at t c ccat cagt t t at agt t t at a agct agaacc at ccaaaaaa gt caacaacg ct ct ctt at t t aat ccccct t gaaaat aga gact t t t gat ccaacaaacc ggat t agt at at gaaaat aa 12689250 Sequence cacacctaat ct t catcttc ctctctatct atctttcata tcaatcaatt gatcttttcc cgttt aaggc t cgagagaat catttggtgt tgaccagctt taatgttact tgtacctgtt caaataaacc atttatggtt Li st i ng. txt at ct t cgcat t at gt gt gt a t at ct caat t t at cat t cac t caacat aag cat agt act a at cacagat a t cat agt ct c t gt gt aaact gt t t t cacaa t gt agt aaag t at gagtt ag accaaaaat g gat aaaagaa 960 1020 1080 1140 1200 1260 1320 1331 <210> <211> <212> <213> 543 1344 DNA Arabidopsis thal i ana <400> 543 cccaaaaaga gaaaaaacca t t t t cctatt agacct at t g t ct gct t t aa at at at aaaa cacgact aat ct cgt t cat a cct ct at aaa aat cgat aaa t caat at cat aaaaaaat t c gt t t gat t at t at t at t at g t t at agaaat t aat t t at cg t aat t t at ag accaccaat t gt t gt t aat t aagt gt t t t c ttaagaaaaa ccact aaaaa at cgggt at t aaaccgaaat tcgaacacgg at aagt at ga aacat gaaag t t acat aat a t t at at agt t t at t gat at a t t aat aaat t t t aat aagat aaat t aat aa t t t at aat at agt t t t cat a t t t t act t gt t t t t aaaat g at aaat t aat aggt t t t act t at t agagt a aggaccat at aaat aat caa t caacaat t a ct t acat t t c t aat agt t ct cgaaccattt at ggt t cct a gt t t t at caa t t t t t at t gt cct gat t t at aaaat act at cagt at aacc at t ccggt cc aat at t t t t t t t gt at aaat at t t at t at a t t t t actgt a aact ggt t ct t t accaaat t acct ct ct aa gt aat t cgt a cat at t t t ga act ct t agaa at cat at cca aaacat t ct c t cccaccaat aaaaat t t at t t at at t t ac ccagat t ct g t gagcat caa at aat at gga t agagt ccat at t t gat t at t ct at aaat t cgagt t agga t t gaaaat t c gt at caact a ct t t t t t gct t caaaact t t agaaat at at aggaaaat ct at t aat aaaa aat t t t gaga t t t t cgat at at t gccagct t at aaaaat a at t t cct cct ct caaagcaa acat gat t t t cct at t agat ccct aagt t t gt aaaagt ga t at cgaagag at aat t t aca at t t t t agt a aat act ct at ccagt gt aaa t at gt aaat c t at at at at a t aaat t cat a t agt gt t gt t at t ct at aat ct ct at aaat t t t cgcagt c gtagt t t t t a gcaat ct t gt aat t t gaaca at t agt caac aat gaat acc at t aaat aca at ccaaact c t t t at t t t at ct t aat gt t c gagt ct t gat gct aat t cag t aagact t t c at cgt gaaga aaat t aat aa aaat gacat a t at ggt cct a t at at t at gt t t t gtt ct t t t ct aaacat gaaggaaat c t gat aaat at ccaacat t ac accaact t ac t accaact aa at gaccat t t t aat cccct a ct at aaat ac ct act act t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 Page 471 12689250 Sequence Listing.txt ttgagctttt aaactacaca aaca 1344 <210> <211> <212> <213> 544 1272 DNA Arabidopsis thal i ana <400> 544 cat t t t gaat acgcat t gat t gaaat at cg gt gat gt t gg t cagagccat aaat t gt cac cagt at gt aa cact agact g caagt act t a t cacat t cca at aagat gag gat aaact t g aaaaggt t aa t t aat t at at t t gcgat t aa aacaat t t t a cat t acagct t aact cgat g t ccacat t cc gaat aat cat gacat t ggt t at agt at cat gcgaaat t t c t t ccat at t c ccaaccat t a t caaagt t ag t at aat gaaa aat t t t t aat at aaacaaaa cgt t gt aaag ttct t ct t ct t aacat gcca gcat gt aaaa t gt aacgaat agccagacga t acat t t aga aat ct ct gga cct cacct t a t aacagt t t t caaat cat t a t ccagat t t a ggt cct gact t t ggat aaaa acat agcat t ccat t aaaaa agct t ggt t g t aat t t aat c t aagt ct t at t aat t aat ca t t aaat t t t a t gt cacct ct ct gagt t gt c at acacacac tt gtt gt t t t gt t gt aat at aaaaat aat c t t t act t gat gcct cct ct c t act ct t t at t t at at at t t act t cat at g t t aagcat t g agaaaagatt t gt aaaat t t t t ggat gaac at aggt t ct t t ct t t gt gt g cct at agt ct gt t aat t at g aggt agat ac cgat ct t gt t aaagacaagc at gt cgt agt gt t at aat at ccacagcct t act at ct cga aat t aagaca aaat t gt caa t t t aat at at aact agccca t ctt gccaag gcgat gggt a cgt acgcat g agaat aaaat t gat gaacag aagact aaac t aaggt gt t a t t t caat aag acaaat t agt t aat cat act t t aagt agt t at ct at acag aaat at acac at agat t aat gat aagct ct ccat at agac aat at agaac t at ct agat g cct t caacag aat t gt acca t aaaat t t gt at gat at t aa aaaccaat at cgagt t t acg gtt gaaccag agt t cct cat act ggact ac t t cacaat t g caacat ccga t gt aaaat ga ggt gagat gt t t at t aaaaa ct t t t t ataa gcat gat gt t acat gcagt g caggccact a at t aaaat ac gagt gt t aca at cat cat ca t acct at caa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1272 120 180 240 tttaaatttc tctttctatc tactataaa agtgactctc taagaactcc aaagattaga acat t gaatt ga <210> 545 <211> 1400 <212> DNA <213> Arabidopsis thaliana <400> 545 tcagtatctg aacccgcctt gggtattcaa ggacaatgat aatgatatgt ctcctgtttt gaatttaggt cgatacaccg agccagataa ttgtcagata agtagtggca cggcaatggg tgagttcgag ttatctgatc accatcatca aagtaggaga cagtacatgg aagatgagaa cacaagggct tatgactctt cttctcacca taccaactgg tctctctgac ttgtctttgc Page 472 12689250 Sequence Listing.txt acgattc tgcaatatct tatctttttg atcagagaat ct t ct t acaa tgaa t at t ct gt t a tgat t t gttg acat caaggt t aacat t acc caacat ct t a ttccccaaaa gt t t ccaaat t at ggcagct aagt aagacg cagagct t t g taagaggccc act cgt ct cc ct cat cct t c t ct t cgt t gt gt gt t acaac at ccaat gt c t at aat t t t g gcat ct t gt a t t t aact t gt t ct gct at ca t t t t atctcc t ccat at aat t t t t aactta aacact t aat ccct aaccac gaaacct caa t t t caaat at at gat agaga ct acagagac acat gaat t t tgaagaacaa gct t ctt cga ct t ct gt aac t t t gat gagc t ct at t t ggg taaagaccgc aaaat cacat gt gcgt t agg at aaaccaga at gaaaat cc gaat ct at aa aggct t aagc t agact caga ccact t t caa aaaact t gaa gatt ct t at c cct t t ct t cc gtgaagagag ccgt at gat t caacaaaaca att ccaaagg at t t ct t ct a t caat gct t t t ct t t aaacc gact t t gaaa t t t t at at t t caat t gt t gc aagt t at gag t cat cct agt cat agcat t a gat ct at cga agat t cgt cc ccat at at ac t t t at aaaaa t aaagat ct c aat at cat aa ct t gt aat t g t at t t cat t g t t agact t t c t t gt t agt ga cacaaact ag agat t at at t act t caat t t ttgt t gattg cagat aat gg at cagat t ga at at t agt ca t cat cagct t ccaacgaagt t t t at ggat t t cat cagaag t ct t t cgt ca t cat cat ct t at gggct at a ggct t t ct ct agct cct cca ct gaat cct t t ggat t ccat gt t t cggaat gaagat ct gg ggt gaaat t c at ct t gt tct ct t ct t t gt t ct t t t gattt tttctttagc aagcaggagc t gct caaact gaact gt t t t agct cccct acaact t aac gaaacagt aa t cat ggt cgt t aat gggccc ggct ct act g ct gaat cgct caat aat ct c tat t t t cct t t t t t gtt t ca aaaatgggt t t t t caaat cc ct gt t gaat g 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1400 <210> 546 <211> 1343 <212> DNA <213> Arabidopsis thaliana <400> 546 at gt gt gt ag cgaaaaccaa tgacaacgt t gtgtttcaaa gtgagatata gagagtcaca tccgaaaaaa gtcttttgaa tgcaagagat ttgattcccc tattgtgcgt tggtttcagg gtttgaagag ggttttgatc gtcagaacaa cattagctcc taaacgaaag agactaaata acaatgaaca attcaaagac atgaagattg tgaaagggaa aaagttctaa ataatgatgt caacaagtta aggaaactaa agagacagaa ttgttatcgg taactacaac atccttactt aat t gact ca agaagagt ac gt gaaaaat c at ggacat gg gctgcgagaa ct gt gaagaa aagaggt t t c t at agt t gt t t aaacct t aa at at t t t t t t t acact gcac gaaaagaat c t agagat gt g t at acccaac agccgatgt t agt cact aag ttttaccgca gat at t aaaa ct t gt t gat c ct t t t cagcc aat gt t gaaa aaagt aaaac gt t gt gaact acccct caag t aat at gaaa t t t att gaag cccaaacagc ct t gaaaaat t t t t caagt t gt t t gggt gc 120 180 240 300 360 420 480 540 600 660 gacaagagaa acctcttcaa tcttcatgtc tttaaattgt ttattgtctt caataaactt Page 473 12689250 Sequence Listing.txt agcaacttcc ttcacagtct ttagtctctt tcgtttagga gatactgttt cataataaac at cggct t t c gggt at t acc ct ccaggt t c ct t t gcat t g t caagcact a at t gat aat g agat ct gt t t gt gaacaaaa caaacaaaac act t gat t ct t cagat cct c t cgt agct t g at gcccat cc t t cacat ct c gat ct t aat a gcgat t at t a t t gat gt t ga t ct t agt aca gt ccacaaaa gcacacaat a ct ct t cct t c aagaaaaccc t t ct gacgat cgaaaccaac t t gcat t caa at gt t t gagc aggctttttt aat act caaa t acagaat t g caaaagcct c cggcgtcgt t at caat caat taa t aaaacct t t gcacagt at g aagacttttt cgaccat gt t t t t at t t ct a cat ggaagt g agaaacagag tgacggagaa t agaat caga ct cgt ct cct t at aaact t t gaat caaagt cggt gt t t t a ct acat at ga t cgat ct t t t gaat t caaaa agat gaaaaa ggaggctttt aaagacattt ggaaaacatt gaagggtttt t caaacacat ct t cgaat ct tgaacaaaac ttttttacct at acaact aa tgccaagagt aggt gt t acc ct t t at ggt c agggagcct c 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1343 <210> <211> <212> <213> 547 1329 DNA Arabidopsis thal i ana <400> 547 gggactggcc gagcct t acg caagggtgac ccgggagatt t ccacat gt g acgat aacat aacaaagcga t at at agaaa act t t gat gt t at gt t t ccc ct aaagcat t at t ggt acga aat ct t t t ct aat t gt t cag at t ct ggacc gt aat agt ag at t acact cg gt t at t t t gt at agacgagt t gt gt gat ca ccaat t ggga cggacggcat agaacccttt ggggtat t t t gggggaagag t t gt aaccga gcat gaagac t gacccgat c gt aaaat gt g gaagaacaag ct gcact t t t at t t acgacc agaacat gct t ggaat gt t c t t t t t t aat t gat t t t t cat t at t caggca t t t act t acc at t t t ct gcg cagtggagaa aaact t acca t t t gt aaat g t gggccggt a gact cact t c gt gagaacaa acat gagaaa gt gt t aat ga cat at at t cc cat t ct cagc cgacaaagaa t t t gat ggat at ggcat t t t t t t ctt aaaa gt cat gt gaa agcacaaact t t t at gat at ggct gt t cag t t t act at at agt t at cct c tagcaacaca t gcct t gt cg t cct acat t g t agcagat gc agagagtagt agt cgcat gc ct gct t ccag t agact t t ga atgggaacgg cgt t t gt t ga t aat gct cat t aat ggaact aat t t agt at acacct t t gt at acccat t t cagcagct t g gtaaaggaag cct t t act at gt t t ct at ga acact t t t t g ttct t gt t aa aacgat agag t t gat gat gg gggt aaagt t at gggaggt a at ct t at t gt t acacagt cc cat gat t gat t ct t ct t ccc gt t t gat t t c gagat ct t gt cat act gct t t acaggt ccg cccct gact t at ct t at t at at aggact t a gct gat cat t aaat t cagt t cagct gt t t c aaagacgagg aaagt t gt t t gtggagaggc act ct caaat t t t gcat t at at ggggat t a cagat t t ggt t caat gt acc t t t agt accc gat t acaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 474 12689250 Sequence Listing.txt ttataatgag agcacacaaa ttaagaatgt tagaaaccaa tttgtgaatc ccctgatgt a tacaaggcaa caaaatcaca cgaaataaat tgagagacca gacaaaacgc tgtcgttttg cgagggtttc tatggttgtg ttgttatctt ttttttttca gaaataaatt aatttagtgt gaaagagcaa aagcataaat agcatcgaga gagacactgt tctcat t cat tcgcaaattc gt t gaccat 1140 1200 1260 1320 1329 <210> <211> <212> <213> 548 1319 DNA Arabidopsis thal i ana <400> 548 t t t gt cacca gcaact at gt cggcagcatt gct cat agt a t aagat ccat ctct t t t t cc agt aggaat g aaagt gt ct c gt t t t ctat t ctt cagcaag at gaat gaag cct ccgaat c acat ggat gg aagat gt ct t gat cgt aat t cct cgacaac cat t t gt t t c t gt t t aat ga cagt gcct t t acacgt gt ca aaaaaaaaaa gaacgaaaaa aaat cagaca aat t aat at t ct cgt gct ct ccgt t gt gt c aat at gt aag acct cccggt t cggat ct t t t t t cagt ggt at ct acggga t t t cct cgat t agcagct at t ct gaat gt c gt cct aat cg t caccgt t t c agt gat at at t at aacgaat t t t ct t t t t t t t t t t t ctt a at t t acct t t at cacgaaac acagt gt ccc t ctt ctt gt t ggcaaagct g gagat at act cggct gct gt tccaacagac gcct cact t g act gaat aac ct ct cacgt c t caccat cct t t at t gaaga agct caaaac cgagat t ct c agat t gat gt at caagat ag at gcggaagt t t ggagt t t t gaccaat at t t aacat t aca caaaaagt at aggagt aagt cacaaccaaa aat t t gat ca ggcaaat ct c gct caagcat tgt t gct t t c t gccaaat ct t t t cat t gat ct t ct t t ccc agcat gt t gc caaagaggt a t aaaat gat a ggagt gt gat at cgcacacc t ct ct cgt cc ct cct ggcac t at ct t ccat t gact cct t a cgct t ccggt t gt aat agag t t t t t t cat a cat t cagat a t t t ct t t ct t aaacct t gt c aacaaaacaa cggcgagat c cgct t aaat c t gact ct gat t at ggt at ct gt aagt ct t a at cat accat tt gct acaaa gcaaat t t gg aagt ct ccat acct t t gt at at gtt t t t ca ggt act t agc t t ggaggat a at aggt ct ca ct t t aagcaa t act ct gat a at agt ct at t gaat t ct aat at t t aat aaa ccgat at cct gt ct t ct cca at t cat aaat atct t t ctta cct gt aaaac t t cat t cact tt ct caaaaa t aggt act ac at ggct ct t t at gt gt gat t t aat gaat gc ct t t ct t t gg at gt t ggt t t at cat caaac caacaccct t at ct aagcag agagt t t gag t t gaat gcat t gat at ct t t t t cgat ct ct act cacagat aat ggt at cg aaat t gt t cg at cat aaaaa tcggagaaga t t t t gt t cc 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 <210> <211> <212> <213> <400> 549 1305 DNA Arabidopsis thal i ana 549 Page 475 12689250 Sequence Listing.txt tgatggtcac tgctggagaa aatatgggca aaaagagatt catggatcca gt gat t at ca t t gct aaat c tcacacaaga t cgaagt gaa cgaact t ccc cagagcaat c at ct ggagaa acggt gt gt g cat cagggt c aaact gcaga t gt ggt cgt a aat t at agat at t at t t t t g t t t t cat t gt t t t at gacaa aat t ct t at g at gaaaacgc ct ct t ccat c t t t t t t cat t ccagat aaaa gaat ct t t ca tt cat t cat c ct at gt ct ct ct gt t t agca gt at ct cgga tgt t t cattg ggaagacat g caagaagaac gaaaagt aac t gcaat t acc ttcgtat t t c gct t t gaat a ggt cagt t t a ccgat ggat a ttttcagcag t t aat aaaga at t aaat t ga t t t gt aaaag t t t caaat t t t acaact t t t act at caagc gat agt cgga at ct gggaat gt t t t t gttt gtgaagcaag aaccacactt acgaacacga aaaccgacca at t t t cagaa ct t t t ct t gg agcgaggttt t cgt ct t t gg at t gat t t gg at t t aagggt aaaat t ggct ggt t t at gaa aaggat t t t a tagaaagaca t gat gcaat t t caagcagt g t gt ct aagaa ccagct t cga gaagacgaaa t t caaagt t t acagagcat a t t cagaaat c gt aacaacat acat ct t t ga agt ccgaaga cgttttcttt gaaat t t cgt t at ct gcacc acaat at t at t gaat gt aat t at t agt t at t at aggact a t t at gat t t c t at aaggt t t t t t caaat gt at t aggat ga t aaaaat t aa aagaacat at gaagagact c acgaaagtt c t aat t t t t ct t t acagat gc agacacagat cacat ccccg aggaaacaga agt gat gat a ct ct aaccat ggaagatct t t gct gct gt t cgat t t cgga gt ct ct ct ct t act t at t ag aagt t acat g cct at gaagt at t t gt gagt cacat t agt a acagat t aaa t t ccagt caa cgt cat t at g t gt cgat t t c cgaag aaaat cct ag t t t gct at at acgcat cgat cct t ccct t t aagacgacga gt acat gt t a agcct t gaag gagat t gaga t ct cct gcaa gagaat t ccg caggat t ggt t aat ggcagt t at agt aaaa at t t t ggcct t ggat t gt ac t aaagt t gt t t gat t gact t aagt ggaaaa aagaat at aa ggccatgagc at cat cat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1305 <210> 550 <211> 1358 <212> DNA <213> Arabidopsis thaliana <400> 550 ttaagtgatg tttgcaactt ttaatgcaac gaaacaattt aatttaattt aaatttggtg tgtcaat t ga aatgatagag agagacatta ctaaagaatt gt t ctagtaa aaat t ggtat gatttagatt agtgacaaag aataatcctt t ggt at caaa at ct gt aaaa aaaaaatcag atgtgacgtg agataataaa ttgatttgat at t aaaaaca ctaaaaagca aataaataaa ttgaagtcca aaagcaaaaa cctatagatc at t t t t t t cc ttttcttaac ct at at t at t t agt t aat t t caaaaat aca agccat gacc t cact t t cca t gt agccgaa cggtggacag Page 47( agcat at t t t t t gt at at aa gt gaaaaagt t cagaccat c t at t t cgaca aaat acaat a at t gtgt t t a t aagccgat g t caacagt gt at aat t ggt t aaacct t aaa at cact at t t at aaaaagat caagt at act t gt t aagt t c t aat t aacgc gaagt aagaa cat t t aat cc 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt ct at aaat ag t cgct gt t ca t cgct ct t ct t t gt t aaat c t cgaacct t c agat t ct caa at ccgt t aat at gat t t t ga t aaat gt agt at t at t gt cc t at cact gac ctgt t t t t at t gt ggaat ct t gat t t t t at ct cact ccct gat t t t gct t t ct ccgt ct t tgaaacgaaa t acgcct gt t gt t ct t aggt t t t ccagt gc t t t t cgcat c agt agt agt a at gaat t gt t t gt t ccact a gct at t ccac tttttttttg gt ggt t gct g t gt cat ccac t gaggct t t a at t gat t t cg t gagat t t t t at t at aat t a t at at cgt t t tgtgtagcag gat cgt gt ac t at gat cgt g agt t aagct t t gaat at cag aat gat t t gg t t ct ct t cct at t caat t t t aaat cgt ccc ggct ccccag agt t t t t agg ct at gggt t t gat ct gcgat gt gat t t at a at ct gct t aa t ct at gt agt t act gagcca gaaagt t cct aagaat cgaa aat ct t t ct t t t caat t gt a t ct t t t t g cgtctcgttt ccttcttcgc at ct ct aat c cgat gct t t t cgat t cagat agt gt gt gac cagat t t aaa t aggt t t at c agt agt agt a t aaat gagcc t aaacgt t t a t ct ct t t gga agct t t t t at aaaagt t t gt gccgcaggtt acgggt t t t g t t gat aat at t at t gaaat g acgt at gt gg ttttttgcaa t at gat t t ga t t cct cgt t a at t agat cct t gagat gcgt gt cact t gag t at at gt gt a 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1358 <210> <211> <212> <213> 551 1301 DNA Arabidopsis thal i ana <400> 551 t gt ggagat c taagacaaca t agct t at ga aagct ct aac t aat cacat t t t gat t at gt tat t cct t t t at t cat agca aact t t t gt t tttacccaga t at gat t gca agt gaaact t gaagt ct ct c t ct gact at c gt cct t cggt at t agccggt agt gcct gat at t gacat ca act t gcact a at gt gacaaa ct ct agact c t t acgcct aa gct at at gt g aagcagagat t ct t t t t gat gact t gt aaa t gagt agccc gt agat ccga agt t aggcct ccct t t cat t gagaagt cca agggt t t at c aaagat agca agcacacct c gaaacact ag aat t aaacgt t agct act at t gt t gt aat t tggagtcgaa t t at t t t at c t gct act t t a cgggacat aa aaacat ct at gt t cgat cct t gggt caat t agt cggaaaa ct ct aaaat a aaaaaaaaag t t gcaat gat t t ctggtgac t t t cagaaat ggaaagtacg acat t aat t t t cat ggt t ga tggaaacaac at t at t t gt t at caat t ggg aaagaaat aa ggt ct agt gg ccct gaaaac ggt t t acct g cat t t caaat t t t cggtgcg aaaaaat gat aat gt at gat t ggaaat caa cagcat aagt t aagct gcag t aat t t at cg t ggat at at a ggctaggagc t t gcagt ct t t t gt gaat t t aacct t t cat taggagaaga aaaaat cat a gt agt t agaa tcagaacaga t t t ct gccga t gcat gagt a gt gcaacgca act aat aagt at cgaagaga gt at cat ct c t cgt ggaat g t agat gt ggg t ggt ggt t gc gt t t ggagt g at t caagt ga t t ct at gt ct t t t agggaat t t t gt t t t ga atgcagccgg cagt at ggt a ggctgaccag ct t ct caat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 477 ct t cacgt t g cagat t t gcc agcccat t t a aaaagat aaa t ct at t at ac t ct ct ct ct c tcacaacaac aaat act at t gt t at at ggg t gggct t t aa aaacagact a cct ccagat t 12689250 Sequence Listing.txt ttgt t acatg cgactaaaca aat t atattg aatccatat a tctatttggt cccaattagt gatgtttata tggatttaat tctgttgtta aaaaatagcc catgtagacc cgtttatgga tttcgacccg gcccaaaatt acaacgtgtt caacaacaac cgtcgttctc ttccactcat ctgaaaacaa aatccaattc caaacgatcc gatccaaaac 1020 1080 1140 1200 1260 1301 <210> <211> <212> <213> 552 1368 DNA Arabidopsis thal i ana <400> 552 acccat t t gt t t at t t gt at gagagact at at t cat gaac cat t gat t ca aggcggtcca gtgt t t t t ca aaat t cacca aaacat aagt taacaaagaa t gaggacgt a agact aat t t t ccaat aaaa ct t acat at c tttttaacaa aaaaaacaac caaat agaaa t t aggcct ag t cggt t ct ag aagt ggccca aaaact gt at acaccacgt c gaat t gt gcg ct gccaacat gt t cgaaact gcaat gaacc at aagagt t g ggt gct t agg t gacgt t gt a t t t act t at c aaat t at t t a gaaaagact g t t at caat ag acct aaaaat ttccaagaga t at t gt aat t at ct agacaa gaagaaat cc aact gcat t g caact t caaa t acat agct a t ggt cgggct gaaat aaact cccgt t t t t g gt gaaaat cc at t t acgaaa ct ct t t t ggc ccct gaaagt t aat caaat a gt cct t ggaa agt t aggaca gaaagt at t t gcaaat aaga t gt t at at gt aat t aaacac t at t t t t aat ct ct at at ag t aaat t t at a tcaaaccaaa gt t gagat t t at aggaact a cagt gaat t t acct t aat cc gat ct t gt aa cagcctggcg cgt cgt at t t t t ct t gt act gat ct acgt c at ct ct gaaa t at at act ca t t cagt ct t c t aact ct t ct agcgacct ct at gt aaat t a t t t t t t cgt a aacacacact t gaacct acg t t act t at aa aaaat t aaac ttgt t t t t ga aaaaat at t a t at t t at t aa t ct t t at agg at aat aaaaa cat caaaat c aaaat gt t at ctcgtgaagg gaaaaaat t g acacacgtt g ct acacaaac t ct gt ct ct c cct ccgat cg t gaaact t t a t t at gt at ga caagaaat ga t caagt ct t c at aaacaaag t aagccgact aat caact at aaact cat ag gt gaaaagac at t t aaaaaa cgaat at gag aat acgt aat t t aaaat gt g gt t t t gt aaa at acaat gca cat t aaaaca agat agat at caaat gat t g t t at gggt ct t cgt t t ct ct agacaact t c t ccaat ct ct ttaacggc aaaaat ct t c caagaat cgc t at gaaaaag at t aat t aga gt ggt gct t a at at acat at t t gt aaat t c acacagaat a t gaat t aaaa t aaact t at t t t t t at t at a at t t t t t aac t aat gagat a aat t t gat ga at gat at t t a t t t ccaaact gcaat agct c ggacgt t ggt aaggcccat a t at ct t ct ag aaat t act ca ct gcgccaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1368 <210> 553 <211> 1340 Page 478 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 553 aagt t gt t gt t t t cct t t t g at at gt t t at cct ct cgt t a cgt t t caat a aagccaaagg gat cat gaaa gt agct t t t t t gt t aat cat tcat t t t t gt t gt aagaagt t aaagt t at c t aat gct t t c t t t aat t t t g cagaat at ca t aaagct t ct agaat t t at g caaggaaagt caaaaaagga aagaaaccgt at cct t ct t c cgagccgt t g t ct gaaccac cat t t acct a t cat t agaaa cgacat aat c at t agat at a t cat at t at t gcgagat at a gcgacaaat g t t aat t caat t t t gt t t gt c gt agggggt g agagt t t t gg t aat gct t t a aat t gt act t acaaat t acc cat at t t gaa t at t gaat ga t gacat at at aat t at aaag agagacaact ct t aagct gt cacct t ct ca agaatttttt caaat ct gcc ttcccaacac cact t at acg gt t aggat ct aagat t ggat at ggat gt ac t t at gcct aa cgt aat ggag t at t t cgt at aacaat t aca at at accat t gat cagcagc aaat at gcat at agat gt aa t t t gaact t t t act t t t aat t aacacacat ct t at t caca ttaaaggaaa aagcgcgtgt t ctt ggacac t t at at t cat ct gagagat a at at at t ct a aacct gt t ac t t gggt t t ct t t at at acat aat gt at ggg t ggggt at t a gat t t t gttg t at t at t gac aacat ct gca t ct t at t caa cat at gt cct agcttttttt gat at t aggt t gaat t t gga t t t t t gaaat at gt gat gag accat agt at aaaaaaaaaa agt t cacaaa gct gaagcaa t t ccat ct t t at t t aacaaa ct ccacct t a at at t t gcaa caat gt t t aa aaat acaaat aat aat gt aa t t t t aat gt t ct t ggat act aact t ggct t caat aacct t aat at t gaaa ct t caaat t a ct at t t t t t a t t aaat t t t a gat t gt aat c ggt at aggaa aat aaaaaag t gat ccat t g aaaaaaagct ccagaagccg at t t aat cgt ct aat t t at c t t t ct t cttc agaat at ct a ggt cgt at ca t agt ct t t t a t t aaaat t ag agct t act ca t aacgt at gt gtt cggcaaa aat t at t tag t gat gt gact t agt cct cag t at at gt t t a t aaagt aat t at ct ct ggcc at acat caca at gaaaaat g aaaagaat ac at t aacat at aaacaaaaaa agagtcggt t gt at aaaact t t t ccat t t c t t ct t ct gt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1340 <210> 554 <211> 1354 <212> DNA <213> Arabidopsis thaliana <400> 554 aattcttatt aggcgccaca atgcaaggtt aaaaaggtct ctctacaaag aatat t gaaa ctccttggct taatccacag ataaacacgt ggtctataat ggcatgaaca ttct t aact g agccctaaca tacataaaag ttgcacactt caagagtgct ctgctcttta cacaatctgc ct t aagacat agacaaagaa t t t ct ggat t ct t caagcac at t gt gat t t t gaacaagt g Page 47 c aagt ggat t c t t cat gaaaa agt t cct t ct gct gct t t ca cat aaaat ct t t gat t ct ag agaccgat at gagaaacaac gcaat ct cct t caat t ggag ct t gaaaat g at aat gcgt c 120 180 240 300 360 12689250 Sequence Listing.txt ataaggttta acaaaccaaa aggtgctaac ttggagctac cagcccatta actatttgcc aat t cat act aaggaagt t c tagaaagaca ccacagagaa cagaat gt ga cat gaagt gg caat gaat gc at ggagcaga ggat gggt t c gagccat cgt ct cact at cg gccat t t caa acaaagt cca ccgaaacgac agagagaagg t t agagcat t at aagaggcg aggaacgt ca t aagct t t t t at caaagat a caaat ct cct agaccgaacc aaat cgaaaa agggaagagc at t agcaggt gct t gagt t c at aggct t cg t ggcgat t t t ct aggacaag at cgt acat a gagaagaat c t t t gggaat t t t agat agat aaccaagaac t cagacacat at caaat t gt t cacat caca ct gt agccca tagaaaagga caact agcgt ccagaat ct c cgt gaagaag t ct gt gt at t gt ct ct caaa agt gaaaagt gct gcagccc tcgcgagaga gggt aaact t caagagt gaa gcat caccgc at ggat t aag gt acat t t ga gaggcaaat a aat caaacca agaacgt act t act t aact c ccaacccgt g aagaaagaaa cct t cgcat g gt ct t cgcga accaaaat ac t t t t t t gt t a cagaaaggga t t cg gt gaagt aat accgt ct gca agacat gaca ct aaat gt ct at aacgct aa t aaaccat ga gt t gcaggt a cgaggat t t g aagccacaag gat aaagaag t t aaaat cgc cgact t t caa cct cggt t gt gt gt t t t aag cgat t at t gg accact t t aa caagaaagt a ccaacagat a t t caaat t aa at caaaat gg acaacgaaat accggat t ga aat t gagt t t at gaaagt t c at gaaaagt g ct gaaact t t agt agaaact cgt t t t at gt ccagacaggg t t ggt gaaat 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1354 <210> <211> <212> <213> 555 1369 DNA Arabi dopsi s t hal i ana <400> 555 tttttgaccc t gacagagt t ttcgaagaga aaaaaat aaa at at cat cca caaat at aga gggagaagag aaagt t gact t aagt gat aa t t aaaat t aa t t agt gt t at t gt aacagaa gcat at aat g ct t ct gggga ttttcaaaca cagaacaaag agt ct t gct g ccaat at t aa t t agt cgat g agacagt gt c t at t aaat t g cat t agaagc t t aaaaccat gt caaagat t t ct gt gcat a ct at t t ct ag agaagaacat caaacaat ac gaaaacgaag t t t t t ct ct a t t t aggt aaa at t cat gt aa aaacaaagag at acat gaat act t t t acat t ggat at gct t t gagt t at c t cccat gat t aaaaat t act gagcat cgaa aat gaaact t agacagagca t t caat t t t t t aat at t acc cact t t aaaa ggagagcat a acact ct ct a t gt t ct t t t c t cat t ggt t t t aact t t t t a at gaaat gaa at aaaaat ac acagat t aaa t ggat aat ag at ggagaaaa t t t t agat t t ttttggagaa act t gaaaaa at t at ggcat t t t t t t t t ca t t t gact cga gt at t caat c caacaaat t c at ct acat aa caaat gaagt ccccat t t ga at aagaggaa aagt t t gaag gat gt t at t a agaaat aat c aggt t cat ag t ct ct aaat t cat t t act ga at ggat t t t t ct t t gggat t at t aat t at a gat t aaccat t ccat t gat t 120 180 240 300 360 420 480 540 600 660 720 780 Page 480 cat t gt act a gt t aaaat at ct t t gact aa acaccggagt aagat at gt g gt t t t t aact gt ct aaact t t t at gt aacc gcaat gt at g ggagact cat t at at ggt aa aat at ggt ag tctgacggga caaact t t gt t at ctt gct t tt ct gtt aat act aacattt caaggt ct ag tggggaaaac cgtt gcaaca 12689250 Sequence ct t aaagt aa t aaat t t aca ctatttttat agcctatcag aaggt t aaaa aaaaacagat t ccaat t t aa aacgaactt a aagt at agaa gat aatt cac gt t aaaaaaa aaaaaaaaaa t ccaaaacaa ct aaacat ag gcccaagccc aagcccatat attgtgaagc taagtccgag acaacaaatt t cagaagat c Li st i ng. t xt gt ct t t gact aat caaaat a agagagtttt act aat t t aa ct aaat t t t a gt agt gctt c cagat agcgt gagggcgat a t agt cactt a cgaagacga cat gt aat t a aat t t agat t ggcgt aaat t t t ct at ccaa gcat t t ggt a t t t t t t t act t ccat at cat cact gaaat g accggagat c 840 900 960 1020 1080 1140 1200 1260 1320 1369 <210> <211> <212> <213> 556 1297 DNA Arabi dopsi s t hal i ana <400> 556 aggaagcat c t at agcaaag t caact aaaa t aaagt aaaa cat ct t at t t aacagt caag acgagacat g ct acat ggat ggt agt t aat aat aaagt t t t ggagaaat t act gcagcag ct t ccaagt a t t t cgt at at at gaaaccat caccaaaggc t ct ggt gcaa aaaat caaaa aaaaaagaaa acacgacgat cat cgaat ag t agct gt t at ttaccaaaaa t t gt gaaaat tgaaaacaga gcaaacaaga cagacaaaag cct aggat ca t at at acaaa t t t t t t t gt a tgcgtgggca ctgatgagag gat aaat t ct acaatt gt t c aat cct t act caat t aagat aaat gact t a at gt t at t t t aat at at at a cct ccat cat agagagcgag aat agt gact t aat aacaat ttgt t t agt a cact acgcaa gat agat agc aggact t gaa cggt cgcat g act t gaat at at ggct ggt t at ggacgt ag t t t ct ct at a t t t t gt t t gt at t gt t t gct at t t t aaacc tt att ccaac cacaaat caa ggt cacaaat tacaaaggca cct ct t cct c at cacat at a t ct ct gt aat gtt acaaaca ct t aaaaagt acaaccat gg tcaacagacg aaccgtggcg gt gcaaacgt t gat ct acca cgt t aat gt a aacct cact t aaacagagca at ct t t at t t t gaatt caaa at aaaaaaaa t ggaaggat a t aagt t t t cc gt aaaaat t g aaat t acccc gaggaactt c t act gat at a ct t t t gt t t t ct caat caca taggaggagg at cagacgac at gaagat ag tt gggcacct cat gaagcac tgaaacaaag gat at t caac cacat t ct aa aagcaggct a gagat cat gt t t t agt ct gt agt caat t gt gagcaaaaaa aaat t t t gt t gt at aaacaa t gagaaaat c aatt ccaat c cat ct ct t gc t t t t gt t t ca tcgagaaaag aaaaagt gt a gat gat ccac agaagatgga ct aagat caa ccat t gaaaa agagt cct ct aagt cact t c t at gcat t gc cattttttt t ctt ct t t t t t t t t at aaaaa aaat acaaaa aaaagaat aa t ct t t agcag ggt t t t aaca t ggggcct t g t gt gagat ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 ctgcacaaca tcttgtctat tttcaatcga aacctctgtg agatcgcttt tgatttttgt Page 481 12689250 Sequence Listing.txt aattctcgtc gcaggatatc tctctcattg ggatcag 1297 <210> <211> <212> <213> 557 1399 DNA Arabidopsis thal i ana <400> 557 ct t ggaagca gat cgt t gct t aaat t cgaa aaaggaaaac ggcgct aaac at at ct acga tgtgtcccag cct t t gaat t at caaaaat t t gt t t t t gt t t gat ggaaat t at aat gt t g gccgct cacc ct aat cat ag t gagt gt t t a at aaaat t ga tttttttttt tttgtttacg t gt gaaat aa ct aaaact ac cat t agcaca agaggaccaa cgt t t ct t t c agagggataa ttcaagagag cacagcgaaa t t cgat aagg t t t t t aaat g gt t gccat gg gct gt gct t g gt aaat aat g t acat gt t ac at t agt gaat t at aagat gg gtggtgaagg cagctcgagg t cggct acct aagggaat aa cccccaaaag aagt t t gggg ct aat gacca t t t at gat gc at at t ct gaa aaggagacca aaaaaat aat aaggt ccaaa cgat t ct cgg ggtacccag tcgtggagag acacct t t ga t aaact acca at ggt aacga agacgat t cc acact gt t gc ccccgt ct aa aat t at t t gt t t t t at t t t t at aat at gat aagaagagtt gt agt cccat t t ct gccgt c gcagcact ag t t t t agatta t t t cctcttg t gggat gcaa t t gaat ggct gt t aat t gaa gat t agt aca t t gt t t t t at aacgaat aaa at t t t t cct g t gt ggct cag tgggagcggt t acat at at a gt gat gat ga aaaagaccgt accat cggcc at t at t t t gt t aaacaaat g at t t t t t gaa aatggaagcg taagaaaaca at cggt t t cc aact t cgact cagcaacaaa at gaggaaaa t t ggt gt ggt t gt t t act ct at gat gaaac gaat t t gaaa aaaact t agc t at at t at t a ct gt at ct ct ggaat caaac cgt ct caat g atcaggaggc t gt t at ct ag tccggaacgg cagggtaagg act t t accac ct t t t aaat t aaaccagaat cggcat t gat t t gaagat gg atggcagaga t ct aact cgg gatacagagg t gt t at at gg ccgt ct t t ac gat t ct act c gt t t t t t aat at t t gagt t a at t t gat t ac t aaat t t aat t t ggt aagt g catt cgccgg gcat cgccga aacagcccgt t ct t gt ccaa ct t t t at gct t t t ggtcgca t gt ct aaagg caat aggcgc gt t t at t t t g t agt gt t t t a t agt t aagt t tgaatggaga tat t ggggag t ggt t cacga agcct t caaa t t t t gacttt t t t cagat gt at gcct t t t t t t cgt t aaaa t ct t t aaaag aagagct t gg t aat t acggt gaaacacaaa agt t t ccagc gaatcggaag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1399 <210> 558 <211> 1400 <212> DNA <213> Arabidopsis thaliana <400> 558 aagattctga aagcaagtga cgatgatgat tcttttttgt agttcttaca accaaataag aactaaattt tattttgttt agtcatgtct atttaattac aatttaagtt aaaattacaa Page 482 120 12689250 Sequence Listing.txt tataaattat tttttattaa ctagaaatga aaataaaatt ccaaatttat agaaagaaat t gaaagt gag t t cggt t t ac at acggt t t t gaaacaaagc gat gacacgc aagt t ggt gc agt gat gt t g gt t gt gt ggt t t t t gcattt at t caaagaa at t ggaacaa gagat gacat tcagt t t t t g t t t t t t t ct c t t gaacat t t caaacaagat at cct t caat t gat gggct a cgacgccaga aaagaacaag aaaacaacag aat ct t t agt at ggt t t t aa gagaaaaaaa gagt agat t a gcgaaacgcg gagtgggcga ct cct cct ct t t t t agaggt tgt t gtgcag t cgt agat ga gt t t t t t ctc at ggaggt ga caact cggt t t ct ct ct aaa gat t gcacat t t aaaacat c gact agt at g aaat agccca t t agt gagt c gaagat cccg aaaaaaaaac t t ccaagct t aat caaaat t aaaaaaaaaa gaaat aaaca aggt cggt gc at t ccggagc cct cct ct at caccaaaaaa ccat t t gt t a agaaat cgaa at t t t gct ag act at acaaa acaagcact c at gt t at aga aagact t t aa aaagat t cca t at gt acat a taaaaggccc acat aaccct tttttaaaaa ct t t at cat c aat t t gggt t at gct t t cga tgat t t t gt g at ct ggt t at act ct gat t g t at t aat t t t aat ct at t t t cacaggt t ga gt gagt t gaa t t t cctgttt ggt t gt t gca agt ggact t t t acgaat cct cat aat ccaa t ct aaact t c agt aaaat t g at t aaact t g ct t ggaaaga at gaat gt at ccaact at t c cggt t t gcat gat cacgcgt ggt t t acccc gaaatggagc gct gaaaaaa t cgt cgt t ct gagat act aa agct t at aac tat t t t ctga t t at gt t t t c acgat aacat tggccaagac t t gt t gaat a ct t t t t t t t a at t cat ct t c t t gat aagaa ggt t t agact gt ct caacac t t t ggt t t t t t at t cact ag caat ct t t gg t at at aat t a t ggt at t t t g t acagaaat t gcgt gaggt g t agaaat agt t ct t ct gaaa aaat at t t cg t gaaaat t gg acat at gaaa t t gact t tag t ct cct t aat aatttttttt aaggaaaaag t at gaagct a aat ct t caac aacaaaacaa t t agat t caa ttgcagagaa 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1400 <210> 559 <211> 1283 <212> DNA <213> Arabidopsis thaliana <400> 559 tcaccagaaa aacaaaaact agaaaccagg aagt t aatca acgtcat t aa gt t at t atat tcattgtaca ttttggtgac tggaagtttt aacctaaaga ct caat cct c tttgtctaca atgtattata accaaactac tttattctct tttctcgaga tgatatcatc aatct t aat a taaattaaca cggtcgt t ct agctttgtag gagcgaaata atttattctt aattatcttt aaaaactt ag at aact acat t gt cacgt gg aat t aaat ac caaaact at t t caact t caa cat cgaaat a gccagatttt Page 48 gaaaat cat a t ct at at aat taaacaagaa at t at cacga gcat t ggt gt ct t t t aaagt gt t t t aaat g t aaaaacct t gagt t aagca ct ct gt t t cg cgt at t cgcc aaaaagcttt gcaaat acgt aaagcaaacg t caaaaaaat t aagcat at a 120 180 240 300 360 420 480 12689250 Sequence Listing.txt taattcaact aaaagaattt taactatttg tgactatcta gacttgaagc aaat gagt ag acaatttttt at gagat t aa aat t ct aaaa at act cgat a aaaaat at ca aat ct caat a acat ccct ga t ggt t aaaaa gacaccaacc agt caacgca tt at aacccc cct cacgaag <210> 560 acat aact ca tttttggtca agaaat agt c tagagacaca t act aat acc ct t ct at aaa t t ct t aat t t at t aat aaat aagt t act t a t aat gacaat gct t at gt gt t t t aggt t t t aagaagaaga at t cct gct g aacaat t ct t ccaaat agca aact t aacaa t t aaaat at t t t aat aat t a at agaggt t t t t t agaaat g tcaagacaag ttgt t t gat t cat agt aaga tt ct aacaca cga t t gat ccat a t cagt t gt aa agcaacaaaa agct t gat ac t t t t ctagt t cgat aat t t a t act aaat t g t gaat t aacg t at gaagt at t at t t gt cac ct t t t t gtct cgcct ct aat act caacaaa gct agaat at ct aaaacat t aat gaaacct ct aaat t aat at gaaat t t a tagaaacaac aat act t t t g cacgt gat t a ct aact agag act at agt ag ct ccgcgcac aaaaagt caa t at gt gt t t a t acaagat ag aacacaaaca caat gaat t a aat t t aacct gt aaaccat t t aat t cgagt t t ct t gt gaa aacgt t t aat act ct ct cac aaagacgaat acacacacac 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1283 <211> <212> <213> 1396 DNA Arabidopsis thal i ana <400> 560 t cagaat ct a agact t ggaa cct t t agaga at t ct at ct a t cgggt ct t t caaat cat ca aaagt t gaga aacacaacaa at gcat cgac at at t gaat g at t aaagt ca at aact t t t a gt t at t acaa at at act t ag gct t acgggc t aat aaaaaa tgggagggga t t at t t t t ag ttagat t t t a t at t gcat ca ttact t cttt t at aaat aag t t cgaagact att aacaaca t act aaacgc aat gat gat g t aaat aaaaa t t t aaaact a ct agt t t at t t ct t t t at t t aaacct gacc aaat at at cc act caaact a gaat cat t ca aat aagt t gc ct aacat t t t gt t t agt ggt aggacgat aa ct t t at aagt cat at act ac gt t t caat ga t aat agt aga t cat ct at aa t t aagt at ca aaaaaaat t g at t gt t at t g aagat gt ggg t acat t t ct t at gact acaa act t at aagt gt caact at a t t t cct t t ct t at t at t aca ggat cggt ct cat t ggat t t aaat t cgagt ct ggt aaaca t gct aacat a t gcgt gt aag acat caat cg t t at cat cat t aat at gcga aagt t cgaaa catttttttt at cat at gga gt t ct cat t t caaaaacatt at t t ct gt ag ttaagaagga t cagct at aa gt agt aaat a taaaaacccc t at gt aact a agct cacaat ct t gcat aaa gaaaat gat t ct cgat t t t a aaat gact t g ct gcaaat at t t aaaat act ggagacaaaa t t t t gt t gt c t ct t at t t t t tttaaaagac aaaagt aaaa acaagt aaag acaaat t aac aat at aat at t ct ct gt t ac t at t t gaat a aat acagt at t gct t t t gaa at aat gct at caact gagt t gt at aat t ct aat at t t gca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 tactttgttg attgagtttc tgaaaaatca taattgagtt tttaaattag ttggtttgta Page 484 t gcat t t gac t at aat t t t a ct gat aaaaa t aat at t t at aaat t t at t a t ctt cgt cac ct ct ct t t ct aact t ccaat caacaat aat t ccagt t t ag t agt cgat at gccat t cgag t gat ct ccca cgt t t g 12689250 Sequence t t ct t t t aaa t at at cact t t gaaat gt cg acccaaaaat atgtatttgt attatagggg t gggt acat a t gt at gt t ct aaacaaggca t ct ct atttt cgacgatctc ccaaactcat Li st i ng. t xt t t cat at at t at acat t t aa aaaacaat t a t t t acgat t a t t t gct t ct t t t ct ct acgt ct t gt agagc aggcat t t cg t at t at t ggt t gccat caaa ct aat agact t cat cgat ct 1080 1140 1200 1260 1320 1380 1396 <210> <211> <212> <213> 561 1367 DNA Arabidopsis thal i ana <400> 561 t cgt gaaccc t t acat cgt g att acagat t t cgggacat t t t t t gt t t aa gctt gt cttt at gat gggcc t cat at ct cg aat aaagt t g t aaaat gaaa cagt t aat at ccacct caca tggaagacca t t t t t t at t t cat agaaaca caat gaagcg act at t cat a gacaaaggaa ttaacgaaac cct aacat t t aaat caggat t ccgact t aa cct t cct t ca at ccat at t c at gt ct t t gt ct aaaagaat aggacaaggt tagaaaagaa t cct caaat g at at at aaat t t t aacattt aat t t gt at t aat at gct at tttt ccgt ag cgt acgt acg aggt t t caag atttttgct a aat aacaat g at aaccggt a at ct t t t at t tagcaacaag gat t t t gt cg cat cacct gt aagt t gat aa ccgggtcgga acat t cgat c t t t gctt gac aaat t t agga ttggtagcca aaat gaat gc gt agaggt t g t cgcgaagct aaaaact cga t ct aat t t t a gat t t at at c aaacaat t t a cat gtt gcat accaat t at g at t agcaat t aat cct acaa aat ct at cga gt at ctt cga aat aat gaat t t at gtt cat agat t t t t aa cgtt at cgt t ct t ct at gaa t cct ggt gag at cacgaagc cgct t ccat a aaacacagac ct agt caaag at t at aagaa at t agt t at t caat ggt at a gt at gct aca ttct t t t cat t cat t ct cag aat gat t gca at agcat aat cat gt ct caa t t aacggat t t t t ggt ct ca cat caacaaa gact t cat at t at caaagct t t cgct gt cg acgtct t t t t aat at cgt cc aaaaacat t a t act agt at c caaagaacaa aacaat ccac acagt t ggt c aact t aaaag aat aaaaaaa gt t aaagt aa agcgaaagag aaaacaaggt gt gt ccccca t acgagct aa agt cccacct t t t ggt ctt a at ccct ccat aaaaccggt t tgacaaaaaa agcaat t aaa acgat caaaa t ct at aat t c t t t aat t caa caggt t ct ac t t gtcagcag t ct t acaaaa t at ct cat t a tttctcc cccgaagct t aat gat aat c gcaagat t t a ccct t t aaca aat gt gt t gg aaagcat agc t t caat gcac tt ggctt ggc aat t ct t aat t gaacaacat agt aacacca gat t t ct at a caagat t t t a aat at aaaaa ct t t ccgaat t gct aaagt a at acgacaaa caat gaaacg ggct aaaat t aaaaaaat t g at ccaaat ac caatt cat at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1367 Page 485 12689250 Sequence Listing.txt <210> 562 <211> 128 <212> DNA <213> Aral <400> 562 t gat acgt t a at ccaacggc t at acagt gg t at ccct cct tt ctt caaca t t t ct t atga t t aat t t at t t aagagt t t t t gat t agt ag cat cggt ggt t act t gaaga aaat t t cat g t at gt t gcat t ct t at t gaa t t t t gt t gt a t t at at cact act t t at t t a aat aat t t gt gaggaggacc t t caat at ga ggt t ct ct gc t t t ggt t t t t 7 bidopsis thaliana cgcct t cct a t aat aacgaa gcct ccacct cacagt cat c aaaat cct ca aact t at cga gt cct t aat a gt t accgaat act cgct t ag at gaat t t at t t t at cgat t ct ct t gt t at gt gagt ct t g caaaact t t c at ct gggt at t t ct ct agca tt gctt ccac agt t t gt aca gagt t agcct act ct ct gat t t t t t t t ct c gt t gt caat g t ccat ccgt c acat t agccc gt aat t gt cc t ct t t ct t cg t t t ct ct gag t t ct ct t ct c ct cgt ggat t t t cat gt t ct at t t t gt gat t t ggggt t t t at agt t gt ga t gt cgcgat t gt gcaat t ga cat t t at gt t ttgct t t ct g aaagaagt ct gct at t gct t at at at t gcg at t t t t agt c at aaat at at t gt gat ggt c aat aggt aacgat t aaa acacgaaat c at t t aggaga caaat aaat g aat ct ggt gc gt t gat t gt c t gt t cat aaa t cgat t t t ga ct gt gt at gg aggt t t gt t t tcgtttttag gt ggcgat t t gat t gt t t cc gaaaaggttt at t t cggcaa t t agt cat t t cgt ggct gat t gt aacat t g act t t acaaa gt at acat ac t ct t gat t ct t ccaat cgt t t cct t t t t t t cagcgcggtt t aaccat ct c t t t agct t t g t t at act t t t tcgt t t t ct g at ct ggat ga t t t gt ggct a gat t acgt aa t t gat t gaga t cagt gt ct t t t t t t t t act ataggaacgg gcaaat gt cg gcaaagt caa tgagagagac t cct t cgt t a t t t t ct t t t a t ggt t gact t aact t t ct t t caat at agag t t gacgggat aat t t agcat t ct t ct cgt t gt gagct ct c agt t ct ct gc agcagt t agg t t gt at gat g at t t t ggttt t cggact gt t agaagaaaat at gaat caat agat t gct t g at gt t t aggg at t t caaggt gagagttttt caat t aaact t ct gt t t gag t gt t gcat ca t t at gggt at gt t t t atgt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1287 <210> 563 <211> 1343 <212> DNA <213> Arabidopsis thaliana <400> 563 gtgaggtcat attcaggacc gatccaacaa agttttattt ttaattatca taaacgacat attaaaaact aaaatcatca aaacgaaaag tgagttatta aacattttag gttttaggct acgaaaaatt ctaattttat tttggactct tat t gagggt tttactccaa gtaaaatttt aaatataata tggaaagatc acaaatact g gaaaaaagaa aaaattgggt tcaactctca taaatct t t a aaaaaaatca gaactgaaaa gattcatagc ttatgtcgct tatgtagtta Page 486 120 180 240 300 12689250 Sequence Listing.txt tgctagggat gaatctgtat ttcgt t accg taatgagagt tcgatactct acgat t ct gg acaaaat t t a aat cat at t a ct gat t ggat t ct at t t ggt aat t aagt at aact gt aaaa t t ggct aaat t ct ct gact t t aat at at cc t at t at cat t accaat t aca acat gt gt ct cat t gt t aat cgt ggcgt aa acat t cat ca t t ct ct ct t c agcat gt t ac t t t t t at t t g aaaaccat aa t t t gat ccat t t t at t t at a t t t t ct at t t ccat at ggt c at aat aaat c t aat gt at ac at gt t aagag aaaaaaaaat t t t gtt aaag aaggt ct at g at agat aaaa acgt at ccat ccat ct at ct tccggcaaga at t t t t t t ct gct ggt cct a ct t ct gccac ct cat ct gat ct t agt t aca t t t t t t aat t t aat t at aga t gct t gaagc cgacat accc at t ccaccat acaact ggac caaact t at t t cagaaat cg t at ct agt t c cgagt cact t act ct t t act aaa tt ccgt caac ct caagacaa t ct gt t t t t t t t t t t agct c t at at gat t a tagat t t t t g at gaaaact t at at t gt t at t at gat t t ag aacat at ct a agct ggct cg agaacgt t ca gat t agct t a gt ccaaat t a gt aat at ct t ct ct t ct ct t aacaact t t a at ct t ct gcc tttttttttg aacaat t t ac t cgaact agt t gaat t cat t caacgaat cc t at t t agt t g at gt t gat t t at t at t t gca t cccat t gt t tgtgtgagaa t t aagt aaac aact at t t t c t at aaccaaa ct cacat caa ct t act t gt t at at ggt aaa gacat cacat t aaccat t aa t t gcacat t t at ct ct t t at t acagt agaa at acaact t a gat t t gacga t t cccat t ct t t gt aat aaa t ct t acgt cc gt t ggt gt cg t at act at at at aact gcca gt ct t ccaac t t at t cat ag 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1343 <210> <211> <212> <213> 564 1043 DNA Arabidopsis thal i ana <400> 564 caact t aacg agt gct acac at cct ct gac t t t t at t gt c gct ccgat at gt t t accaaa gt t gaat aat at at act cca at gaaat aaa aacgt t cagt cat aaat agt t gat acgt t g acct gccat t taaagaaaag gccgcat t aa acat ccggga gat t at t gt g t cgt agt act caat ccagt a t t t caat act t cct t t cat a acaaactt ag t cat gct t t a t aat cat t gc ct gt agcgt a ggt t t acaca at gcgt ct ga at t t t t ct aa at t t t aaaac t ct gct ggt c caat act cac aat t t t t t t g gt acat gct g aaagt t gt ag t t t agt agt a cct acaat ct t agcat aagc t acaacaat a t acgt at ct c t aat t t t ct c at cacat gt t at agat t caa ccgaaaaagt t t t t t t aat g ct gcagt t gt t aat gaaagc act gcat aag ct agct cccc acacaaacct t at t acgt t g cat ct ct ct t t t at gt t t t c gt t t ccgt ag ggt aaaagcc taccagccca at t at t t t t t aaat t gcat g t ct aaat t ga t aat gcat gc t acat ct acg t gt gt t aaaa gaagct gt at t gct cgcgct t t t ccaagt t t act t gt gct cat t cacct t at at cgt ggt taaaaaaaaa tat t t t atga gat ct acact tttttttttt agt ct gct gt 120 180 240 300 360 420 480 540 600 660 720 cgggaatatg gttttgccgc ccatccgtag aggctatatc ggatatttcg catggtaaaa Page 487 aaacgt t at a t ggt catt ca aat ggat t t t acacat aaga gccgtt aaat 12689250 Sequence Listing.txt tacccaactt t tctgggaga aggtcat t ct ct t aataagg atgtcaaat t gtaaattaac ccttaaaaaa atgcaaaata atttgtttat gtgacaaaga atctataact atcaagaaga agaaaacact cggctatata tatacgtaca aaaacaaacc aatcacttca ct ct ct ct aa t caaaaagct tttaacctca ctttctccga tca 840 900 960 1020 1043 <210> <211> <212> <213> 565 1296 DNA Arabi dopsi s t hal i ana <400> 565 t cggaat ct g t acaagt aga agagagat aa acagccgt cg ccagtgacgg gat ct cccga gt at t at t t t ct t gcaagt c t aat cgat gt ggt t aat t t a t at caat act t aat cgat t t t t ccact t t g at act gt at c ct aggctt gc act t t acgt t ct ct t aat gt at ct act ct a aagaaat caa t cgacat aat agacagcaaa t ct t t gtgt t ct ggt aat ct gaat aagaga ggaagatacg ct at t t t at t cggt t agagc gaat t t t gaa tt t cgt agt a gggt cggat c gat t t t aat g gt caagaaaa gt aaaaat t g gacat cacac t gaat t t t aa tct ct gcaac t ggact caaa at at acaaca agaaat at t a at ct aat aaa at cgt t ct ac at aaaact ca t at gt agggt ctt ctt cct c acgcaaagta tact t gtaat cagcgacagt gagagtgat c t t caat gaat aacggagtag ggaaggtgag taggccggcg tttgcggaga acgaagaaga ggat t gat gg ggct aagt aa t ct ct caat t gat agacaca aaact t caga agcct aggt a gaat acaacc t aat gt at t t ttttttacaa at aaat t t t t gt gt at at ct at at ct t t ac caaaaaat aa caat at aaac ct cagat t ct gaat t ggaat atagcgcgag t ct cggaaga gt gact gt ga at ct ccat t t ggacggagaa acaat ct aat gct gggct t g t aaaat t aac gt aaaacgca gaat ct cat t aatt agt gca tt ct t t t t t a tt at acggca aaaat t act a at t t aaccat cat gggt at c gat ct t gt ga aat aat aat g gcagcct cgt ct t t ca gaggaaatgg aagaacggac tggacacggc aagcaaagat tt gt ggatt c gct acacat t gggccaggat gcaaat agcc t gacgt aaat gt gt t t t aca t t gat aaat t t at at gt aat cccact acca aaat t at t ca acat at at gg t ggact aagg aatttttttt t at t t t acga at act ccat a cgt ct ct t ca t gaaat caat gacgcct t ct ggt ggcgct g cggagact t g t t t gggt t t c t t ct aact t a ccggt t agac aaat at aaaa ccccct t cag gaat ct ct t t aaagtttttt t t aagt gt ac cct gt t t t cg t t aaat t t ca aacct caaac agct t ccaca t t caat aggt gcgaat at cg taaaggaaaa t at at t cgt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1296 <210> <211> <212> <213> 566 1277 DNA Arabidopsis thal i ana Page 488 12689250 Sequence Listing.txt <400> 566 t ct cccaaat aaat agt gga t aat t t t aaa ggaaacccaa gt t gaaat at ct aat t t at t act t gt t t t a t aaacaact t t t t t t caat a aagt t t aat g t ct at t aaca t t t agcat t t at t gt t cgct gt gt aat cag aaat gt gt t c t gaagt at t t cat t acat ac aact t gagct gacgtagaag t gccgt t t t c agct aaat t a agct t t gt gg aaaaat gaga aat at at acc at aaaaaat c at at aat acc at accct at a t t ggt t gaat gt gt t act t t t t t aat gct a agat at aaat tgaacaacaa t gt t aat t aa t at aaat at t t at t ct gcaa t cgt aaaat t t aat t at gct caat t gaaaa ct gt at t gt c aat gt aaat a acagaagcat cgt t t ct act gaaacct gac ttggagg gcaaacact a ct aaat t gga t ct t t t aaat at aaact t at t t ggaaat ag agat t t t at a t aagaat t t t tttttgtcca t t aaaact ga cat t aat t ct t at aact aga tgtatgagaa t t cat t t at t gt t t t gtgt a at gacgt t at cacaaaaat c gcaaaggaaa t at aaat ggg cgt t gt gact ttact t t t ca atctaatatt aaattgaatt aaaaactttt aat aaaaaac ggt gaaaat a at t aaaat ga aaaact caaa t aaact t gt g t caaat aat c aaaaact t aa gt t gat t aat tttttaaaaa aaaaaaaat c ct t t ct ct aa t at ct gat at at gt t acat a aaacaaacga gacaaaattt at t t at t t ct ct t at t gggt cccgt t t gt g att cagaaac ccaaat at aa t at accct aa at caat at t t t at aat at t t gt at t at t at at t t gagt gc aaat gt gct a t aaaagt gt c at t t t gt t t t aat ct act aa t t cagt t cat accagt t aac aat t aat aga t aaat t ccga t aaaaat at t t gt cct aaaa cct ct aat gg at t t aggaat gcct ct ct cg t att acaaac at t ggaaat a ct t t t aaat a aaat t t at t t t gt ccat aaa t aat t at gt g t t t gtgggaa acacaaaaaa at act at t at aact aggtt t ccagt t aacc ct t aaat gt t at caaat t t a t t cat gat t a t t agat ct t a ggccat t t gg gct t gcct t t ccgcact gct t cgt ct t caa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1277 gatctctctc tctctctctc tctcgatcgg ataatatttg <210> 567 <211> 1359 <212> DNA <213> Arabidopsis thaliana <400> 567 ggatcgaaca ct ct ct cgt a cgt caaggaa ctcgcgacgg aaaggttgct gagccgagtc gagaaacgcg acgaatagga gagaagaaaa catgactgaa aaaaaacaac cggagatttc gagagtcgct acaagtcgct tacggcgagg ctaatttacc ctcaagtttt attattaagg ttgtgacctg ctttgcctat atggctatat gtggagaatc agagaaagaa attgaagctg agcact gt ga ggtacgagaa cgaact cgat gct cacct cc gagcagaaat t gacct gacc gt gat acct a aat aagacac Page 48. t gccagt gag acgagt ccgc cgcgaat ccg cgat t t t gga gggaaaat t a t gct ct gt ct t aat cacaag t at at gggag gat gacct gg at agaaacaa at ct caact c ct ggact ggc aggct aat t a at at gt gat a gat at t t cag agat t gaaag 120 180 240 300 360 420 480 12689250 Sequence Listing.txt gaagctgttg ggccattttg gtgtagcggg tcgcaagtcg agcgtgagac gccat t gcag gat ggt t gt t t t t aat ggac t t t ccat t ga gt t gat t ggt at t ggt act g at t at gt t t a t agat at ggt aat ct t t cat t ct cacgt at t agt t t accc aacaagcaac ccaat t acct t ct t ct t gt c gaat gcaaac t t gt t gt agg cggt gt at gt t gct gt agca agat aagt t a at gt aaat t c t t gct gact a t t ccat t t ca ct at act acg at aaaaact t t cat at t t t a gt gt cact t c aat gccacgt cccaaagt ct agaggaaaga aaat gcct t t cat ct t gt ct gt ct ct caca aaat gt t gt g aat gct t t ag agagccact c aat cat gat a t ct acgt t gc t agt cgccaa gct gaaaaat t cat gt cgt c gt t t act cac cct ct t cct t t t t cacaaat caat gagt at t gcact gt gt t t aagct ct g at t t gaat ct agaat gt at a ct ct t t gct g t gcat t gact aagt t t t gca at t gaaaat g at cgt cacag gt t t t cccca act cct t t aa at ct ct t gg gggaaacgga gt t aaacgct gagcacaaca gt t t ggat gg ggaat gaat a caggcaat aa t t gcaat t cg t t t t ccat gt aaat gt t t aa gagaat gaat ct gacgaaga agaaat at cc acaagct cgt t t at t gct gt t acat gct ca agct gt cct g act t gcaat g ct at gaacaa gaaagat gt g t at accaat c gcaat cgt t c ggcgt t cgga at t agt agaa ggt aaact ac aat t agaaac aaact aacac aact gt t t ca 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1359 <210> 568 <211> 1331 <212> DNA <213> Arabi dopsi s tha i ana <400> 568 agt gat t ccc cgt aact cat gct ct gt gct caacagatca t t caaagt t t gt ggat cagc at cct gt gag ctaacagcac cttggaaaag aagcttcttc tgttgctcaa gacccctgca t t t ggt cggt t gagt gat ga tacctacaag ct ct t gaggt t ccaagt cag aacctacagg ccactcagat ttccaaaatc aataatccaa caataacaat actgtatacc aacaaatccc caaaatcaaa atccagactt gcgcaatcaa agaaaaaaga gact agagat ttcgttcaga aaagggtctg aattttagca aaacgtagtc aaaaacagaa actgatccca atcagaatcc catgaaaatt gcatcggaaa gaagaagaaa cattcgtaga ggagatggga agatgagcga gagaacaatg agagaatttt tttttatttt caagaaccaa cagct gaat a ccgt cccaat t t t cagt at c aggt gact ct ct cat cagga aaact t gt t t t t cct cct t g ccact ccaag aaat gcggt a aacagaagca at ggccgagc gat gt gaat t acct ggacga t t t at t gt aa at agaat ct a aat ct ct ct g t cgat t ct at t t t gcagt ag ct t gcaaaaa t acat gt cat t gacat t cga tacaagcaaa aaat caaacc gaat cgat t c t ccaaacct t acagcaat t g t gcaaat t ac gat t t t cat t aaat aat aat aact aat aca aaagt t gcca cct gt ct at t at ccagagca caacat cact cgt t ct ccat aaaacct t t g t gt ct t gt ac caggaagaaa acgagagcga aaaat aaaat t gaat t cggc caaagaagaa gagaggt caa t gacaaaat t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 aattaaataa atttgctttg cactcacaga acagaacaga agatgacatt ttctctattt Page 490 tcatatttgt ctttttggtt gaattgtttt agaaaatgct t gt aaat ct a tttat t acac agatttatcc aaatttcaat agaaagcatc ttt ggactt g t ct ct ct ct c t cgagt ct ct ggaggatcag 12689250 Sequence taatcgactg gttatttatt aatt gaatt a tt agatt gct acacaaaatt acttttaagt agccaaat aa gaaaaaaaaa tctgattttc tctatagacg cgacggtcgc atatcaaaag Li st i ng. t xt t at t accct a tcagacagca t t t act t ggt agagaaaaag caat caccgc t ccat t t ct t aaaaaaaacg aaaacat t at aat aat at t t cat at aagaa cgat t ct ct c t ct t ct ct t t 1020 1080 1140 1200 1260 1320 1331 <210> <211> <212> <213> 569 1385 DNA Arabidopsis thal i ana <400> 569 aagat t t t cc aat gt gct ac at t caat t t a at t aaagaac aggt aagt t g t agat t t ct g aat aagaat c t t aat t t aat at t t at aaat aaaagt ct gt ct gt acat ca cat t t at cag caat gcat ca ct ct acgcct cagt ct ct t c caat cgaaaa ct t ct t ct ac at t ct ct gt t at gt t aaaat t at ccgt gaa t ct ct agat t at t gt t t cga t at caat gt g gctacgggaa at gagat gt t t t t t t ct aat at at t t t t t g at gt gat t t a t at t t t t aac aaat ccact a aacact caat tt t acat aaa t t aagat caa t at agt t at c at t t caaat a gt cat at ct t cgaaaacagt tt att ccat t aaaagaaagg gt gt agat ct ct t at cact g tt aggt t t cg gaacat agac cgt cacct at ct cat t t gt t tgtatgt t t t t t t gaacct g t t t t t t gct a ct agaat t t g gt aagat at a ct gt aaagt c t aaagaagt c act at t t t ca t t cat t t aaa t t at t agat t t t at agat t t t caagtt gt a acagat ct ca cagcat ccaa t t t t at t t at ttttcaaacc t aaat ct ct c gat ct t t gat at t cagt gt g gt t t t gt t t c gagt at gt ag gaagaagat t t at gcct at t gggt t ct gat aaaat gct ga aagt at gagt ct t t cct agg agt gagt t t t t t t t caaat t t t t t caaat c gt aacagt aa aat t t aaaat gaat acaccg aact t aaat c at act gt aat acact agcat cact t gt caa t t at cat t cc act t gcaaaa t ct ct ct aat t gt at gt t t c t t t gat at ct t gct t t t gaa at ct t act t c cat t gt gt t c t t ct ct at gt t t t gt aggat tttttaaaga t t aaatt gga acaaat at ag t at at aat t t ct at ct aaaa ct t t caaat c aaaggt t gat ccat aat t aa ct ct aaat at taaaggccca acccgt t ggg ggct acacac cct t ccat t g at t t ct cat t t t cgaat cag cat cgtt cgt t ggagat ct c aaat ccgat t cgat t t t gct ggatt cgcgt t t aat ct aga t ctt aat cgg t t gt ct aga aaat t t agct t at at acat c gt act gaat t t gat gat t t t ct at gagat t ct t caaaat t t t t t aaatt t at aacat aaa acacaat gt a t aat accgt c cccagt ggcc gt gt cagat t gat ct ct t aa gt at ctt cat at ctt ct ct t t t cgt agt t t gat ct cat cg t gt gt gt agg ct agat t cgt t gaagaat t t t gat t aggt t t gaagaaat g t t gt t gaat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 Page 491 12689250 Sequence Listing.txt gaaga <210> <211> <212> <213> 1385 570 1287 DNA Arabidopsis thal i ana <400> 570 gtaat t t ggg cacgcgatat agt actt at a gct t cat cat t ct cgaaagt tagatctgcg gtgaat t ct g tggaaacaag gt at ct ct ca acat cgt t ct gt t gact t at t gcct at t gt cat cat t cga t cct gat aaa ct t t t t cct t tcgaaaggt g ggt t acagca tactcaaacg cat t at gt ag cct gagt acc t t at aat t t t ct cct act ac t aaaaacct c aaacgctaag taaaggcact t accaaat ca ct t gcggttt t t t t ctctct gt t t t t t t ca gagt t cat aa cgat t gacgt t at cgt cct a acctgaaacc ctt aggcttt gat at acaat cat t gt t gt g tggat t t gt t gtaaaaagaa ttgaagacgt t gagt t t cct tctct t ctta agcatgcacc ctt ctt t caa aagt cct gt c ggaacggcaa gtataaggt c ct t t ct acct t caact t aat t ct gaat cat gtagatctgt ggt t t gaggg t t cgcgat ct cgt t ct t cga ggt t t t ggt a aacgact t gg gt t t t t agt t t gaaaat t ga tgatct t t t g t t gt ct aat a gt ccagcagt tcgtccaaac ct at aat gct act at gt t t t gat ct t cat a agctcctttt gctcaca at t ggaaat a aaat acgt aa ttct t t t cct cgt t t ct ct c ct ct at ct gg t ccgt cgt t g t t t cgt t t ga t gat ct at cg t t gat t cat t t aagagat t g t t t cgat t t t at ggat ct gc at ggt t ct t a aat acct ggt cagaaaagcg agt agt t cct ggt ggaat ca at at ccgat t gct t t t t gt t at at agt t at ct gct ggt t c at caacggct ttgtcaggga cctcacgaac aaat t t aggg t gagt ct ct t aggat aat gt t at ct ggaga at ct t cat t t agcct agct a at at t cgcat tgt t t t gtta t at aat aat t t ggt t t at ag at t t t t at t t tgaatgagat t at t t t acga agaaat t caa t ctt caat ct gcagt gct cc agctttcttt ccggatccaa cagat agct t gcat aaccct gcat ct cat t t t t t ct ctt t t t t t gaat t g t t t t t aatcg aaggggt t t c at at at gt aa ggt t agat ag gcat ggt tag cat gt at gat ggt t t ct act at t cgt at t g t t cat ct gat t at gat ggag agct cccct c at ct t ct gt c ct at ggt at c aagaggccat act t t ccagt t cgt t ct ct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1287 <210> 571 <211> 1356 <212> DNA <213> Arabidopsis thaliana <400> 571 cgcaacgata ggtgcctatg gaaactgaat caacagattt ggttttgata tcatatatca tcagctgtct actatttgat ctaggacaac acaaaagctt at t ct t ctcc aaaatggct a ctggtaatga ttgcgtaaca ctacgattca ctatcgaata tatttgttcc caggtcttgt tctctgaatt gaacgaccat attatcattt gttggagagg tttactaacc gataagcaca Page 492 120 180 240 12689250 Sequence Listing.txt aacggt t att caggctgcgt gtgataatgt ttctatgatc tgct t ccgca t agagat aac t cccggt t aa t at t aat ccg aaact t caca cggt t t gt at act t ggct gc agtggcggga gt cacaact a gt t agt t t t a t act t cagac cgcct ggaaa gagcagagat t t gcat t at t t at gct t at t aaaaaaaaac at gt aat gcg ct agcaaat t gcagccgcgc t t gaaaagt t ccaat aaagc at aaggagaa t gt agt agcg at gccact ga tcaagacaga t t at aaacaa t t gtggaaaa agtggaggag accaat acaa t at t gagat t ttcaccggaa t ct ct gt ct a t gt at caat t gt at at acaa ct t t aat agt t cggat ct at tttttggccg t cggt gt gga gat ccat ct a gaaggat t ga t t t t ct aggc t t t t gt cttt t t t gtt gcaa tgt t gt t t aa gt t ggt aaat aat cagcagt gat t t agat c cgat t gt caa ct ct ct caag t t gagaggat caat ccaat g tcgatgggcc t aaagcccat aaat t gaagc aggaaggat a gat ct aacgc cat acagagc t at acct cca caagt t cgga gccaaat cca cct ggaaaca ggt t t ggt aa at gat at cgt aacat gat gg tttcccgcca ct t ccat t gt aat at cct ca t t t gt t ctga tt gat t t t t t ct aaccct at t agt t aaaaa cat t t t ct ag aagaga t aaaact t t a at gcccccga agt gt t ggt g agagt t at ac aat t t aacgt gggaaacgt c tcaaagaggc t ct gat gat a ggcaacacca gctgagcaga ttgcaagcag acggt gt cgt gt gat ggat a cct t gaggag ccct aacaaa cccagagct a gt cat t agt t aaaggagct t at t t ct t t ct gacgaggaag ct aaaat caa at aaccaaac gact aaaat a gat gccat cg aaacaagacc tcaacaacac acgt act ggg t caact gt t t act t gaat ct ggggaagcaa acat gaaaga gaagat aaaa tct ct t t aat t at t gt t gac t t t t cgt cga 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1356 <210> 572 <211> 1288 <212> DNA <213> Arabidopsis thaliana <400> 572 tcacacacac t gagt ct cac gtagcttcgg agacgacgca gtgacgccgc t t aaat t ggc gcatagtgat agtgtgtgat ggaatgtggc ggt t aat cat acaaaactaa gt cagct t t a ggcgggaggt gt acatt acg aagaagtct c cgccacgtaa ttat t ggaaa gtt ctt aaat gaactggtga gggaaggatt aaacaagt t a ct aaat t t aa act ccgt cac aat ggct ct c aaacagagga aacacgat t g tccccacaac at t t t gt gaa atggat t t t t aaaact gaat aagaggagaa tgagt t t t t a att aagct aa cgt ct ccgcc cggct t cct c gaat aat aga at t ct gat cc aactt gat gc at t gat t t ga tttttttttg t aacgcaat t t att cgagct t t t cat t t t a tgcgcagttt gtctccaccg at cgccgat t accgtagaag cat cact gt c cat t t t t t t a gt t t at t gaa t tat agt gt t t agt t at agt aat t aaaat t ttat t t gtca tgt t t gcttt ccgcat t t cg t ct t cct t t g aggat aat aa gacat ct ccg ttttaagaaa ttgtggggat aatt acgat t tatgcggaga gt t aat t gga t gaagat aac ttatctggag 120 180 240 300 360 420 480 540 600 660 ttttttttgt ttcagttact atttgatgat ggatgattac agcagtaaaa aagaaaaaca Page 493 cat at t aact aatt at gt ga at aacaaaag ct t t ccaat c t ctt cat cca ct at t t act c t t t cacat t t ggggct t gaa t gt gt aat t g t ct gt t at t g act cagccag caaaact aaa ctaaagggcg t t t cct t t t t t ct ct ct ccg ttctct t t ct aggat ct at c accct agat t t at t t t ggag gt t t aggt t t 12689250 Sequence gcccagccca aatccctct c aat gat gaaa t acacaaaat gt t ccaact a t t t cccaat c gcctcagagg ccattggcga tcgcttcttc ctaacgattc cttt aggat c gatt ggaacc cctttaatcg gcaaattagt tcagct t aat ctgggtaat t aagat att ga t t t ggtt agg gt gaaggt Li st i ng. t xt aacgcagcca t t ggaggat a t t t at aat at agaacaaagt ct t cact cga ct aggat ct c at gt gagct t t catt gggt t t at t aat caa ggcccagccg t aaat gt aaa accct cgaga t t gaacctt t at caggt aaa t gagt t t t t c t t gat gat t t gt gaaat t t c t ct at at t t c 780 840 900 960 1020 1080 1140 1200 1260 1288 <210> <211> <212> <213> 573 1316 DNA Arabidopsis thal i ana <400> 573 acaacgt gt t gccacgacag ct agt gccag acct gcagga acggagaagg ct gt ggatt c gt ct cgcat c aaat agt at a t gcacaact a gt aaat t t gt aagaat ccgt agaggaggat t caat acttt at t ccat ggc t t t t act t t t at ct at t t ga gact caccaa aat t gat gat t at aagtt gg aat aact t at t gat gt gt ca cgt t gacgat t t ct t gcgag cgat acggt g agaagt gaat t gct gcggt g t t gat ct gt c t at ct agct t at cgat at ga t t t gat t cgt at aaaat ct g gt t at act aa t t t ggt t t gt t aacagacct act at at t ca t agt ggt at a caagaat ct g cact t t t t ca gct at t ct at acacat t acc ccgt t ct t gc gat aaat gcg acgt ct t gcg aat cat gaag agct acat aa gt gagt gaga t t t at t aat t agcgt agt gt t at ct accca caagagaaaa ttaaaaggag cat t act aat act aat t aac aaagaagaag t ct t t gtct t ct t t aagt t g t gt gcagt ca cct t cat aca t at t aacagt acccaaaaaa ttttagaagc t t aaggat t a tgagtcagag aaagagat gc gat gt gggag t ggat caaaa aat gt cat ct t ct t gt ct gt t t aacct t aa aaagt ggt ct aaacaat at a t t at gct cac at at at at at aagaacaaac t att gaccaa ct t t t t aat t gt gt gct t gt t gagagct ga ttgat t t t t g aaaaaaaaga at ct gcagat cgat ct t ggt aacgagt t t g cggagatgaa aagccaacgt t cgaat gt t c cat cat caac gt t t cccct t t t agt cccct aagggactga ct at act agt ccagt t t aaa at at at t ccc acaat aagag t aaagt aat t gct ggt gt ac t act at t gt t at t cat cact acagt aaat t agaaccattt t ccgaagct g cat gagt ct t t t aggt t t t g gaagacgaag gaaaat ct gg t gggaagct t agagaaaaca t ggct t t t aa t t agcgt agt ct agt t ggag ct t t t ct cgt t aaaat aaaa aaaat t aaag at t ggt gt ag t ct cccct t t aat t at at t g t t at ct cat t t t gt t agt t a tgtct gccag t t ggacat t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 494 12689250 Sequence Listing.txt cgtggttttg ggtttgtcta gcgacctttt atgggccaat gatatgaggc ccagtaagga aacaaagaac ggcccattat aatgtaaaca tatttagtac aaggaaaagg ggtaaa 1260 1320 <210> <211> <212> <213> 574 1282 DNA Arabidopsis thal i ana <400> 574 ccacatgggg t cagt cggt a at t aat gt gc agt aagt t at aaact t t at a t at cggat t a tcggaaacaa ttttcttgcg ggtcgacccg gcatat t t t t agt ct cat t t agt acgt t t a aaaaaaaaca ggt t gt t cca acct cat aac act t ct t t ct ct cagct cag cct ct t act g t ct ct ggt t t cat gt gcat c cgt cggat ct cgt t t t t cct gat t t t gaag at aagt t gat acgagact at ggt t t t gt aa aaaagt ct cg t cat t at cat aagagagt t g ttacat t t t a gtgcaagaag gtgggccggg at gt gaagt a t gagt act t a cacaacat aa ggaaccaaac t t gcct t aac t t t gtgtgt t at acat t gca accgat cat t ttttttgccc t ct ct ct t t g ctct t t t t t g gggagt t t t g at at cgaaat t at cgt gt t a agaaacat ca aat ggagt t g gt t at aat t t t t t t cccttt gagt agt t t a ct t t ccgcca gacggacaaa gaaacaact t t agaact agg caagt t acaa acgat gct ga aaaaacaat t gcgct ct caa gt gggaat aa t gt gaaat t t t cgt t t t cct t at ccat agg gt at aat cgg ct ct t t ccca ag aagat gaaga ccat t t gt gt gt at gt t ggg t t gcct t aac t t gt t gcgt t ttct t t t gat ggat ccaact acacct ct t t gtggcacaga gaccacat t c cat gat ct t g cacacat gat cgt cggccac t aat act at a aggt gaaaaa aat aaacgaa caact aagt a gt t t t t t t t c gt t t t ct t cg agct aat t cg cgggt t t t cg aaat aat aca t ggaacgact t at t t gggga gcat ct ct aa tact t t t gt a at ccat gaga agt t t agcat aacacact t a gt cact cact at aagat aga cgt acccaat t t cagct caa aaacgt agca act aaact cg aaact ggt ct at t aaat acg t at aat aact t cct t cct ct ct ct ct at ct ttcgt t t t ct ct t gagaaat aaat t t gt ca gt t ct cgt at t t agat t t ca act at cagt a aat t t t t t ca t t t ggat t aa gat t at ggt t aggagat t gt t t t t gcgttt t gt t gt t gga t gggat cat t aact cccaag aaaagagt aa gt aaccaaac at ct t aagca aaccccaat t cat at ct t ga cct ct ct ct c cct cgat ct t t ct t cat ct t caagcgtttt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1282 <210> 575 <211> 1343 <212> DNA <213> Arabidopsis thaliana <400> 575 tagatgcgct aggagattcc cttctccatg cattacatca gaggacaata attctatgtt caaagccaaa aaaatggaaa attagccgca gtaaacatga aaaaagcgac aaactaaaca gcaaaatcta agaaagtacc acatataagt tggtggctag actttatact agatttctct Page 495 12689250 Sequence Listing.txt gagaacatga ggggtgagcc accatctatc ccatcaatcc catacttgag ct t t gagat t tttccaagag t aagat ct ag gccat t t ct t at at at aacc t cct t cat t c t ct at at t ct at aaaagaca cat caaaaat cat ct aaact aggaagagt c t agggct t ct t at ct ct ct t cgcaat t t t c at ct aagat t caaat gct t t ccat t gaagg agaaat gcgg t t t t gaaat t tggctctatc aatt cttcatagta ccct caaggccatc gtg! gatctgagtt aact ctaaacgaag cca~ caaaat cagt t ac~ gaaacaagaa caat ctattttgat aca ctccttgcaa ctt acacacatct ctt~ aatctctgaa cgtt ccacat t gt a tca ccccttacct aaat cgggttttca aac~ ct tct ct ttc gt c gagccgcctc gac! acgacggaaa tta! atacggagat tta ggat at gt ca ct g acaaga ggt ct g ggaaaag t cact a aact t gg aagaggg ttgaca cat ggag ccaaac aaagatt t t t t ct act ccaa t cccaa aagaaat aact cag gat gt t g gaagacg ct aat at accagcaaca t at aaaccct aact gct t ac tcagacaaaa t t at agct ca cat t gt ggt t aaaaaaactt at at gt t t ac t caagt gaaa ct ct caacgt caccacacca cagat t t act caccagact c tgagccacac cct t gcgt ga at acct t aga aagat t t t gg gaat cct ct t caaccat ct t t aat t accat aggaat cacc t cagt gcct t t cat t t t t at aaaat t agcc ct t t gaaaca t t gcat t ct t agaat act t a acct aaat t c at t t cct aaa gt t ct cgaac agt aagt gat agat t gagat ccgagat gca ggt t t gct t t cgaaaggcag caaaat caca gt t at aaaca ct t caagaac accaaagat g gat ggcct t c cct ccct t ca aacaaacagt cgagt cat ca t gct t caaat gaaccagt ca ct caacct t t gcct caccac ggat cgcaac ttagcaagag t gct gaagt t t aagaat gaa gat gcagt ga 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1343 tcgaattcac aacggaagta cgccgcggcg <210> 576 <211> 1344 <212> DNA <213> Arabi dopsi s tha i ana <400> 576 t cagt t at aa acct t t ggga accac( ttttcacggt atcgaagagc atttaa tcagagagag ctttgtcccg ctgct( gt t gct gcaa ccgct t cgt t t ct t t atgatcttct tattcatcac caatg( tgcacacaca cacagt t t gt act t a ttttagtgat ctgcttcttc aagtgt atcgggcttg tagcgcccat tttcat tgcaatggag aaattgatat caaac~ aaat act cca aat cat aagc agt t c cgcgc t t ggcacacg acaaaat gt a gt t aagat t g acgca caagt cct t g cat t a ggt t t :tttt act g at t gt cat aa ct t t ct gt ct gct t cat cac acagct gcgt t gt t gct cct cat t caaagt gct t accaca cccaccact c gaaact aggc cat acat t t a ccat gat agc gagaggct aa ccct ct cagc t t t t cggcat aacgaaaaca gat t gt gt t c t ccat cct ct t gacgagt aa acccaaat aa at t at ct ct t t gct t cct t a aaggat ggag cacat t ct ac t gt t t t t t t g ct t t gt agt a t act t t caac ct aaaaacag at cgagaaat 120 180 240 300 360 420 480 540 600 660 cgt at cat at cccacaagt c agcgt aat ac cat ccaaacc aaacgat gaa gaaaacaat g Page 496 12689250 Sequence Listing.txt gagcaagt aa agagagct cc gcat gct gaa caacgagcct at ggt gt ct g act aat at ac t at ccat ct t caaacaggat gaat aaggca cgct t cgct g ct caat cct t t gt t ccaagg gat acgcggg cagaagaacc at t aacagat tcaggagaaa t gt aat gt gg cct t gt ct t a act cgt aaaa at gat t t t gg act aat ct gg t t gt agt agt gt at at t caa aagcacat at aacat at at a aaaat t cgaa aggt caagaa caagcaacgg caagaagt gg aaact gct aa at gcaaagac t t ct ggt gga gt cgaat acg agt at at ct c cggagagaga ttta gagt t cgaat gaaaat gaaa cacgat t aac aaat cgagaa aagacgagcc acgagagcaa gt t t ct gt t t t cat t ct agt ggt agacccg at cat caat c t ct gcgagag t t caagt t aa agcaacgacg at t gt agaga gat cgaagac agat ct gagg aggt act ct c gcaagaaggt caat ct ct cc gggccgt at g ggt t gagat c aggct t t t ga aaagagagat gaaaaact t g t t acggat t t at act t ggaa ggt t caat t t t at t at t gt c aaat at aagc at gggcct aa ccgacgt gt g gcct cggaaa cagat t ccgg 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1344 <210> <211> <212> <213> 577 1211 DNA Arabi dopsi s t hal i ana <400> 577 cacgcagaat t agt t gccaa cgat t aggt t at ggt t gat a t t acct t gcg aat t t accac aat t at at at aagagaaaga t gccct ct t c caaat gt cat ttgcaggaga at cggaat ct t ct caat at t acgaaagact gt aagat gac at t at ct act gt aagaggcc ct caccact g aact at ct gt t t gct gat at cct t gt cacg t ct cct t ct t agact aagca aaagt at aaa t gaacaat ga aat t gcat aa ct gaaat t cc taaagagaga cct aat t t cc ct cct ccgag cct cgaat gt gccat accgg gt t t t gat t c cat t agagt c t ccct t at gt at acagaaac cat at acat t t t gct gagac cat cggt at g gat t ccagaa acat t cacat agat aaaaac ct cagcat ct aaat t t cat c t t gt t ct t ac t at act ccat cat t cggcga t act gct gct aggt t gt cgt acgt at t t ga aaaaat at at aagct gacct aat gaacat c cgt gaccaat ttcaaccagc ggat act gga agct aaacac aaacact t gc t cacagct t c cat gagat t c aaaact t caa agt gcccat a agcat t ct t c t agagt aaca gaccaccgga ggacgt ggt t at act ct ggg at t gaat aat t t ct gagt ac aat at caat t at gcat ccaa t act gat at a aat aaacaat aacagt gct a aat caaat t c agcagt gt ca t t t ct cagt a aat t gaacaa gt aact t cgc gt cgacgct t cggcggcgcg gt gaccgt t g ct t ct cgcct ct aaacgat t at ct t gt t t t t at gat t at a ccacat act t t t t at aact t agt at gat aa aagat ccgt t gat act t gaa aat t caccag accaggcaag at gcaat t ga t t t t gct ccc t ggact acaa t gat gt t gag tcgccgaaac t gaat gccaa agt aggt gac t at cgccacc t gggcct aat cct aat at ga t t agat gacg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 taggtctaat ctcagatttc gatttatttc ttcgagataa Page 497 12689250 Sequence Listing.txt t ggcggt aca tcct t cgata tct t cgataa at acgcaaga gtgagct t ca cggtcagaca aaagctgact ctacgctctc tcaaaagtcg aatatcccaa attcgatccg ttttccggcg at aat ct ccg 1140 1200 1211 <210> <211> <212> <213> 578 1270 DNA Arabidopsis thal i ana <400> 578 t acaaat cca acagagat gt t t t gggcgt g t t ct t ccat a t t t agt t gca t cgat cagct agctgcgt t t aat gt ggt ac at t ggt at at aaaat gt t cg at t gct t acc agt gcagt t c tctgat t t t t ccct aaaaag gcat t gat ag gt t caat t t t gt t gt t t t t a ggcat at gcc ggt caaact c t aat aat ct t aacct caccg agat ct cacg aagagat t cc cgt t at act t t ggat aact t gt aat at t ga aggaaaaagt gct gaggt gc ct gt at cct g gt at gt at t a gt t agt cat a gct cat gct g t ct caacaaa cgaat aat t c gaat t ccct c aaaaccact c t gt t aat t ga t t t gaat t t g gcaagt t gt t aagt ct gaga t t aacct aac gt t t at ct gt at aaat cct c agatgaagta aagaagttgt gccttatgct gatccaaacg ggaact ct gt ct ccaaagac caagact at t gaaaacggtt at acct act t gat cact t t a gat t cct t cc t aat agt at t t ccact cct t ttct t t t gat t cact caact t act ct t gaa at t acagct a aagt cat gca aact gacagt t agaaat t gt t act ccaacg cacggt t aag gagat at t cg t at t cat cat gt agt t cagg ttgaaaaaac t t gggat t t g t cgt aacat t agaaccgt cc gaaacaaaac t t gat gagt g at t ctct t t a t t gggccgct t ggt t ct ct c aact t t t gat cacgt t t act acat ct at ga t at agt at gc aaat aaaat t gggacacgt g cact gact ga at ct t aaagc ccgct t cccc ccacaacaaa t t t gcaggat tagtgaaagg gtgcct t t t t gct gct t ct t at agat t cac aaacat gagg at t aaaccgg t t t cat at ca cgt t gct t t c t ct gact ct a aat cact t at t act at gagg ggggt ggact gt t act act a aat t t t t aag t ggcacgt t g ct gacccct a cgt t gagat t t t ggccggct cct ct t ct t c ccat cct gt a t aacaagt gt t aaaat acga ttgt t t t gt c t gaagaaaat accat gct t g ct at t gt acc t agct t t aaa at t t t t t t aa ggccgcagaa t ct agat t at aaaaat t t aa att gcgcaaa aagt t t aacg att aaaagac ctccaggagg ct t aaccggt t t cccacat g at aaat cgat agt ct gat ag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1270 <210> 579 <211> 1116 <212> DNA <213> Arabidopsis thaliana <400> 579 cctcagcaaa taagaggacg ataaggatcg gtcttcagct ataaacaagt aaagaaagtt gagattcgaa gactctttat aagtcattgg atttgtagta aataacaaat taacaacaca Page 498 12689250 Sequence Listing.txt acaaattaac aacacatata ctacaaattc gagttaaaaa ccccaatata cgact act aa aat gaat gat gt cat aaat a t t t at t t aaa acaact agt t ttagtct t t t gggcaaacct aaaaaaaat a t gt t gat t ga t t gacaact t ttttacaaca aaaaat ccag t t t at t agt c t at t agccat gt cact gat c t t t ctcgttt acgcgt t t ca gat gt aat ag aaaat cat ct act at t aagt t at t aaaaaa at t t at t gt t gaccaagat g t at cct acat gt t t ct gaaa ccaat t t ct t at aat t gaaa t t t agat gt a gat at t gggt tcgagaaaca tcccacgacg gct ct acgaa at gact ggt a t agat gct aa at aat gcgt g at caacat ca at t gt t at ca at t gt aat at t gggaagt t c ttct t cattt aat cat aat t t t aaat at at t gt cgaccca t t t gt at t at acat at gt at aggcat ct ct at ct cccaaa aat cagccgt aacat at gt a cat aagct ca t aagct t gca at cggaaaat t cat ct cgat gcgaaaat ga gaaact gcaa tttttttaaa gagt t t t t aa cact t t t cat aaaat at aca aggggaaacc gt t ct t t t ac at t t t t t t gc ct cat t t ct c ttaaac act at ct ct g caat t at t t g t aaaaat aca gat t t gct t t t t t aat aat g ct t gcaact g at at gt at aa at act aat at at t agt t ggt at at t ct t gt tttaaaggca aat t at at t a gat t at gcca t t ct t ct aat t acgt t cat c at at at gcat t t acat at t g aat aat t aaa gt at at aact t gaagt t at t ct at at at ac agt t gct t ac t t ct t aat aa t t gcat act t t t gt at gcat agagct at aa t t t cgct gat t t ggt t aat a t caaaaaat t agact t ct t c gat ct ct ct c 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1116 <210> <211> <212> <213> <223> 581 11181 DNA Arabidopsis thal i ana artificial sequence vector pNOV2374 binary Gateway desti nat i on vector <400> 581 act gacagaa cgccagt acc agt t t gt aca t agat t t t gc tggcggccgc ggat t t t gag aaat cact gg cat t t cagt c ttttaaagac cccgcct gat t at gggat ag cgctctggag ccgcaacgct gagct t t gca aaaaagct ga at aaaaaaca at t aggcacc t t aggat ccg at at accacc agt t gct caa cgt aaagaaa gaat gct cat t gt t caccct t gaat accac gcaggaat t g tgcctgcagg acgagaaacg gact acat aa ccaggct t t a t cgagat t t t gt t gat at at t gt acct at a aat aagcaca ccggaat t cc t gt t acaccg gacgat t t cc gccgcagcgg t cgact ct ag t aaaat gat a t act gt aaaa cact t t at gc caggagct aa cccaat ggca accagaccgt agt t t t at cc gt at ggcaat t t t t ccat ga ggcagt t t ct ccat t t aaat aggat cccct t aaat at caa cacaacat at t t ccggct cg ggaagct aaa t cgt aaagaa t cagct ggat ggcct t t at t gaaagacggt gcaaact gaa acacat at at caat t gggcg cgaat caaca t at at t aaat ccagt cact a t at aat gt gt at ggagaaaa cat t t t gagg at t acggcct cacat t ct t g gagct ggt ga acgt t t t cat t cgcaagat g 120 180 240 300 360 420 480 540 600 660 720 tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg gtttattgag aatatgtttt Page 499 12689250 Sequence Listing.txt t cgt ct cagc acaact t ct t t gat gccgct t gct t aat ga ggat ct ggat t gcggt at aa tgaagcagcg t gat gt caat tgccgaacgc aat gaacggc cct at aaaag cgcccgggcg cccgt gaact at at ggccag aaaat gacat t t at acacag t at gt agt ct gt t t ct cgt t gt agaaaccc cgcgaaaact at t gct gt gc ggcaacgt ct gt gct gcgt t at ggagcat c gggaaaagt g at t agt agt a aat t gct t t t accaaaattt act at cccgc cat gat t t ct acct gggt gg gt t gact ggc caggt ggt t g caat ccct gg cgcccccgtt ggcgat t cag at t acaacag ccggct t act gaat at at ac t at t acagt g at ct ccggt c tggaaagcgg t ct t t t gct g agagagccgt acggat ggt g t t acccggt g t gt gccggt c caaaaacgcc ccagt ct gca gt t t t t t at g cagct t t ct t caacccgt ga gt ggaat t ga caggcagttt ggt at cagcg t cgat gcggt agggcggct a t acgt aagt t at at aat at t ct gt agt t t a gt t gat gt gc cgggaat ggt t t aact at gc acgat at cac aggt ggt ggc caact ggaca gt gagt t t ca t t caccat gg gt t cat cat g t act gcgat g aaaagccaga t gat at gt at acagt t gaca t ggt aagcac aaaat cagga acgagaacag t at cgt ct gt at ccccct gg gt gcat at cg t ccgt t at cg at t aacct ga ggt cgaccat caaaat ct aa gt acaaagt g aat caaaaaa t cagcgt t gg t aacgat cag cgaagt ct t t cact cat t ac t acgccat t t t ct gct t ct a t caaat at t t t aagt gt gt a aggt at cacc gat t accgac cggaat ccat cgt ggt gacg caat ggt gat aggcact agc ccagt t t t ga gcaaat at t a ccgt t t gt ga agt ggcaggg t aacagt at g acccgaagt a gcgacagct a aaccat gcag agggat ggct gggct ggt ga t t gt ggat gt ccagt gcacg gggat gaaag gggaagaagt t gt t ct gggg agt gact gga t t t aat at at gt t gat t cga ct cgacggcc tgggaaagcg t t cgccgat g at accgaaag ggcaaagt gt gaagccgat g cct t t gat at t t t t caaaat t at t t t aat t gt t t gt gt ga gaaaacggca cgcagcgt aa cat gt cgcgc gt cagcgt t g gggact t t gc tttaaacgtg gccaatatgg tacgcaaggc t ggct t ccat cggggcgt aa cgt at t t gcg t gt caaaaag t cagt t gct c aat gaagccc gaggt cgccc aat gcagt t t acagagt gat t ct gct gt ca ct ggcgcat g ggct gat ct c aat at aaat g t at gt t gt gt t gat at t t at ggt cgaccat t gt gggcat t cgt t acaaga cagat at t cg gt t gggcagg gggt caat aa t cacgccgt a at at at aat a aaaagaat gt t at aact t t t acaacgaact agaaaaagca t gct ct acac aagact gt aa aact gcgt ga aagt ggt gaa gacaaggt gc gt cggcagaa agat ccct ag cgct gat t t t aggt at gct a aaggcat at a gt cgt ct gcg ggt t t at t ga aaggt t t aca at t at t gaca gat aaagt ct at gaccaccg agccaccgcg t caggct ccc t t t acagt at at cat t t t ac ggt ccgt cct cagt ct ggat aagccgggca t aat t at gcg ccagcgt at c t caggaagt g t gt t at t gcc at t at cat t a agt at at agc ct aat at at g gaact ggcag gt ct t act t c cacgccgaac ccacgcgt ct t gcggat caa t ccgcacct c 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 tggcaaccgg gtgaaggtta tctctatgaa ctgtgcgtca cagccaaaag ccagacagag Page 500 12689250 Sequence Listing.txt t gt gat at ct ct gat t aacc cgt ggcaaag ggggccaact gat gaacat g ggcat t ggt t ggggaaact c cacccaagcg cgggaat at t acct gcgt ca gt gct gt gcc gagaaggt ac at caccgaat agt gaagagt gccgt cgt cg cgcgt t ggcg t t t ct gct gc aaacaat gaa t ccccgaat t gt t gccggt c at t aacat gt t t at acat t t cgcgcggt gt t ggt gt at cg agat at aat a acggt cacgc ggct t ggcgt t ggaat agaa at aaacgacc gcact gat t t gat at t cat t tgagcccaga t ct gcaccat acccgct t cg acaaaccgtt gat t cgat aa cct accgt ac gcat cgt ggt tcgaagcggg agcaagcgca t ggt gat gt g t cgcgccact at gt aat gt t t gaaccgt t a tggaaaaaga acggcgt gga at cagt gt gc gt gaacaggt gt aacaagaa aaaaacgct g t caacaact c t ccccgat cg t t gcgat gat aat gcat gac aat acgcgat cat ct at gt t agat t ggt t a ggaagcaagg gcat t ccgt t gt cagcgt at acagaat acc aaat t agt ag gaaaaat ct c aat ct t at ct acgacgcccg cgt caaccac cgt cggcat c ct act t t act cgt gct gat g ct cgcat t ac gat t gat gaa caacaagccg ct t acaggcg gagt at t gcc ggcggaagca ct gcgacgct t t acggat gg act t ct ggcc t acgt t agcc at ggct ggat at ggaat t t c agggat ct t c gact ggcat g t cct ggcgca t t caaacat t t at cat at aa gt t at t t at g agaaaacaaa act agat cgg t gaaat t cag ct at t t at cc ct t gct gt aa ct at t caaaa cgcgaaat t c aaaaat aaaa aat at aaaca agt t t ct caa gccgacat cc t acat cgaga cggt cagt gg ggct t t ggt c gt gcacgacc cct t acgct g act gct gct g aaagaact gt at t aaagagc aacgaaccgg acgcgt aaac cacaccgat a t at gt ccaaa tggcaggaga gggct gcact at gt at cacc gccgat t t t g act cgcgacc aact t cggt g ccat cgt cgg t ggcaat aaa t t t ct gt t ga agat gggt t t at at agcgcg gaat t gggt a at gct agt gt at t t ct gaaa agcgt t gt t t gt cgt t aat g aggcccggtt act gact cgg aagacggcca aaaaat t cat gccgt gccac caagcacggt cagtgaaggg cgaacagttc gt cat gaaga acgcat t aat aagagat gct t cggct t t aa acagcgaaga t gat agcgcg at acccgt cc tcgacccgac ccat cagcga gcggcgattt aact gcat ca caat gt acac gcgt ct t t ga cgacct cgca gcaaaccgaa aaaaaccgca ct acagcct c gt t t ct t aag at t acgt t aa t t at gat t ag caaact agga ct ggcgcgcc aat gt at t gg aggcgaaat g ggt acact t t gct gcggat c gccat gt cct at act t acgt caagaaaaaa at ct t ccaca cgaggcggac caact t ccgt t gcggact t g ggact ggat t cgact gggca cct ct ct t t a ggcagt caac tgacaaaaac gcaaggt gca gcgt ccgat c t ct ct t t gat ggaaacggca gccgat t at c cgacat gt gg t cgcgt cagc aggcat at t g gt cggcggct gcagggaggc gggaat t aga at t gaat cct gcat gt aat a agt cccgcaa t aaat t at cg gaat t cgat t t aat t t ggga cgt caccgcg t gact agcga aagaaaaagt acacgccgaa cacgt ct t gc ccaaaacacc cgt ggat cca at gccggcgg accgagccgc 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960 4020 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 4680 4740 4800 4860 aggaaccgca ggagtggacg gacgacctcg tccgtctgcg ggagcgctat ccctggctcg Page 501 12689250 Sequence Listing.txt tcgccgaggt ggacggcgag gtcgccggca tcgcctacgc gggcccctgg aaggcacgca acgcct acga gact gggct c gcgt ggt cgc gat at gcccc t gggt t t ct g ccgagat ct g aat caccagt gat aagggaa t agt at gt at aaat ccagt g ccat ggt gat ccgcacgcgt gt gcccgggc t t t gt t t aca cggcacaaaa ct t at cat cg ggt at ggct g t t ct ggat aa ct gt t gacaa cacacaggaa aggt agt t gg gct ccgcagt ccgt aaggct ct t cccct gg acat cat t cc at gacat t ct tgacaaaagc at ccggt t cc cgccgcccga acagcgcagt gcct gccggc aagat cgct t ct ggacggcc cacgct ct ac t gt cat cggg ccgcggcat g gcagct ggac at ct cacgcg ct ct ct ct ac t t agggt t ct t t gt at t t gt ggt accgagc cact gcaggc gggcccgttt ggccagcat g ccacaat at a t caccact cg act gcacggt t gcaggt cgt t gt t t t t t gc t t aat cat cc acagaccat g cgt cat cgag ggat ggcggc t gat gaaaca agagagcgag gt ggcgt t at t gcaggt at c aagagaacat t gaacaggat ct gggct ggc aaccggcaaa ccagt at cag ggcct cgcgc gagt cgaccg acccacct gc ct gcccaacg ct gcgggcgg t t cagcct gc t ct aggat cc aaat ct at ct t at agggt t t aaaat act t c t cgaat t cga at gcaagct t aaacct cgag gccgt at ccg t cct gccacc at acaggcag gcaccaat gc aaat cact gc gccgacat ca ggct cgt at a agggaagcgt cgccat ct cg ct gaagccac acgcggcgag at t ct ccgcg ccagct aagc ttcgagccag agcgt t gcct ct at t t gagg gat gagcgaa at cgcgccga cccgt cat ac gcagat cagt t gt acgt ct c t gaagt ccct acccgagcgt ccggct t caa cggt accgcc t ct agagt cg ct ct ct at aa cgct cat gt g t at caat aaa gct cggt acc cgt acgt t aa agat ct gct a caat gt gt t a agccagccaa cccat cagaa t t ct ggcgt c at aat t cgt g t aacggt t ct at gt gt ggaa t gat cgccga aaccgacgtt acagt gat at ct t t gat caa ct gt agaagt gcgaact gca ccacgat cga t ggt aggt cc cgct aaat ga at gt agt gct aggat gt cgc t t gaagct ag t ggaagaat t cccccgccac ggaggcacag gcgcat gcac gcacgggaac ccgt ccggt c acct gcaggc t aat gt gt ga t t gagcat at at t t ct aat t cggggat cct t t aat t cgaa gccct gcagg t t aagt t gt c cagct ccccg t t aat t ct ca aggcagccat t cgct caagg ggcaaat at t t t gt gagcgg agt at cgact gct ggccgt a t gat t t gct g cgacct t t t g caccat t gt t at t t ggagaa cat t gat ct g agcggcggag aacct t aacg t acgt t gt cc t gccgact gg gcaggct t at t gt t cact ac cagcggacgg ggct t caaga gaggcgct cg t ggcat gacg ct gcccgt ca at gccgct ga gt agt t ccca aagaaaccct cct aaaacca ct agagt cga tccggagcgg aaat t t accg t aagcgt caa accggcagct t gt t t gacag cggaagct gt cgcact cccg ct gaaat gag at aacaat t t caact at cag cat t t gt acg gt t acggt ga gaaact t cgg gt gcacgacg tggcagcgca gct at ct t gc gaact ct t t g ct at ggaact cgcat t t ggt gcaat ggagc ct t ggacaag gt gaaaggcg 4920 4980 5040 5100 5160 5220 5280 5340 5400 5460 5520 5580 5640 5700 5760 5820 5880 5940 6000 6060 6120 6180 6240 6300 6360 6420 6480 6540 6600 6660 6720 6780 6840 6900 agat caccaa agt agt cggc aaat aaagct ct agt ggat c t ccgt acccc cgggggat ct Page 502 12689250 Sequence Listing.txt ggct cgcggc agct gt aacc aaat act t gg t t agct ggag t t cagat t t t cgct at gcgg agccgacagc t gat ct aaat t gat at t cca act cgagct a acgcat t cgg t t t t t gt t t t cggt gaggt g ccgcgacgtt t t at gacagg gt caagccct gcaggt t t cg ggcgt gagcg ct ggt ggaga cgccccggt g ccggcagccg t t cgt t ccga gt t t t ccgt c gacgggcacg ct ggt act ga ggagacaagc cgagccgat g acgcacgt t g gagggt gaag t acat cgaga gacgt gct ga t accgcct gg tacgaacgca ggacgcacga t cgaagcgt t t t cgcat t t t at gat t gt ac agat t gaaag cat ct t at t a acccagt t ca t t aggt cgt g at cat aat t a ggagcaagt g gt t gcct t gc act gact gga aaact t acgg ct at cgcgcg agt at agat g caact gat aa cacgggggga gt cgcaaacc agt t gaaggc aat cgt ggca gt gcgccgt c t gct ct at ga t gt cgaagcg t agaggt t t c t ggcggt t t c ccggccgcgt gcggaaagca ccat gcagcg cct t gat t ag t cgagct agc cggt t caccc cacgccgcgc gt ggcagcgc cgccggggcg t cact t gt aa t gt cat ccgc at cct t cacg gt gagccgt t t t gaat acct caagagt act aagat gggct t cagt ggcga at t t t at cgc gcgt gcgccc cact t aat ct caggt gagt t agcaact t ct t t ct cat t t t aaacagcaag cgat ggcagc at ccggcccg cgcgcaggcc agcggccgct gat t aggaag cgt gggcacc tgaccgacga cgcagggccg ccat ct aacc gt t ccgt cca gaaagacgac tacgaagaag ccgct acaag t gat t ggat g cgat t act t t cgcaggcaag cggagagt t c agaccat agg caacgat t ga ggt cagccgc t gaaaat t t c gaaacacgtt t acgat ccac ct ct t ccgcg cgagat cgt t ccgcct t gag t aagccgt t c caacgt t gt c caggcaacgt caat ct t ct C cat t gccagt gaggct gcgc aggt gccggt ct gagccaat gt acaaat cg gcccagcggc gat cgaat cc ccgcccaagg cgcgat agt c gct ggcgagg gccggcat gg gaat ccat ga cacgt t gcgg ct ggt agaaa gccaagaacg at cgt aaaga taccgcgaga t t gat cgat c gcagaagcca aagaagt t ct cgat ct cct a aat caat agt gaat t t t t gt aat t ct gacg t caagcgct g ct t ct t gt cg gcct t caaag acggt cgat g cgt aat ct gg gagacggat a agt at cagag cgct ccaaag cgct t gat gt ct cgcgt t t t cgagt acgcg cgcaaact t g t at t t ct t t g t cccagat cc gcgcggcgct aacgcat cga gcaaagaat c gcgacgagca gcagcat cat t gat ccgct a ccagt gt gt g accgat accg acgt act caa cct gcat t cg gccgcct ggt gcgaaaccgg tcacagaagg ccggcat cgg gat ggt t gt t gt t t caccgt cat aaaat t g aact gcccat tgaacaaggg at gacgacgt tgaccgcggt t cgt ggt t gt cggcaaagt c aagt t gt t gc agt t t ct agc accgacggt c ccgaagct gg tagagaaacc acgaggaggt aggcagat cc acgcggacgt ccgaggaat c gggt gat gac ggcagaagca ccggcaaccg accagatttt ggacgt ggcc cgagct t cca ggat t acgac ggaagggaag gt t ct gccgg gt t aaacacc gacggt at cc gcggccggag caagaacccg ccgt t t t ct c caagacgat c gcgcaagct g 6960 7020 7080 7140 7200 7260 7320 7380 7440 7500 7560 7620 7680 7740 7800 7860 7920 7980 8040 8100 8160 8220 8280 8340 8400 8460 8520 8580 8640 8700 8760 8820 8880 8940 atcgggtcaa atgacctgcc ggagtacgat ttgaaggagg aggcggggca ggctggcccg Page 503 12689250 Sequence Listing.txt at cct agt ca acggagcaga cct gt ggat a at t gggaacc gagaaaaaag cgcct ggcct accct t cggt ggccgct caa ccgt cgccac t accaggcct gct t t gt t gt gcgt t gt cgg caaagccgcc at t ct gat t a t at caat acc agt t ccat ag t acaacct at t gacgact ga gggagaggcg t cggt cgt t c acagaat cag aaccgt aaaa cacaaaaat c gcgt t t cccc t acct gt ccg t at ct cagt t cagcccgacc gact t at cgc ggt gct acag ggt at ct gcg ggcaaacaaa agaaaaaaag aacgaaaact t gcgct accg t gct agggca gcacgt acat caaagccgt a gcgat t t t t c gt gcat aact cgct gcgct c aaat ggct gg tcgaccgccg gaat cgcccc aggt ggacca gaagat gcgt gt cccgt caa gaaaaact ca at at t t t t ga gat ggcaaga t aat t t cccc at ccggt gag gt t t gcgt at ggct gcggcg gggat aacgc aggccgcgtt gacgct caag ct ggaagct c cct t t ct ccc cggt gt aggt gct gcgcct t cact ggcagc agt t ct t gaa ct ct gct gaa ccaccgct gg gat ct caaga cacgt t aagg caacct gat c aat t gccct a tgggaaccca cat t gggaac cgcct aaaac gt ct ggccag cct acgcccc cct acggcca gcgct gaggt at cat ccagc gt t ggt gat t gat ct gat cc gt cagcgt aa t cgagcat ca aaaagccgtt t cct ggt at c t cgt caaaaa aat ggcaaaa t gggcgct ct agcggt at ca aggaaagaac gct ggcgt t t t cagaggt gg cct cgt gcgc ttcgggaagc cgt t cgct cc at ccggt aac agccact ggt gt ggt ggcct gccagt t acc t agcggt ggt agat cct t t g gat t t t ggt c gagggcgaag gcaggggaaa aagccgt aca cggt cacaca t ct t t aaaac cgcacagccg gccgct t cgc ggcaat ct ac ct gcct cgt g cagaaagt ga t t gaact t t t t t caact cag t gct ct gcca aat gaaact g t ct gt aat ga ggt ct gcgat t aaggt t at c gct ct gcat t t ccgct t cct gct cact caa at gt gagcaa t t ccat aggc cgaaacccga t ct cct gt t c gt ggcgct t t aagct gggct t at cgt ct t g aacaggat t a aact acggct ttcggaaaaa t t t t t t gt t t at ct t t t ct a at gagat t at cat ccgccgg t t cct aat gt aaggt cgaaa ttgggaaccg t gt aagt gac t t at t aaaac aagagct gca gt cggcct at cagggcgcgg aagaaggt gt gggagccacg gct t t gccac caaaagt t cg gt gt t acaac caat t t at t c aggagaaaac t ccgact cgt aagt gagaaa aat gaat cgg cgct cact ga aggcggt aat aaggccagca tccgcccccc caggact at a cgaccct gcc ct cat agct c gt gt gcacga agt ccaaccc gcagagcgag acact agaag gagt t ggt ag gcaagcagca cggggt ct ga caaaaaggat aggt ct ct t t gaacccgt ac t gat at aaaa t ct t aaaacc aaaagcgcct cgcggccgct acaagccgcg t gct gact ca gt t gat gaga ggaacggt ct at t t at t caa caat t aacca at at caggat tcaccgaggc ccaacat caa t caccat gag ccaacgcgcg ct cgct gcgc acggt t at cc aaaggccagg tgacgagcat aagat accag gct t accgga acgct gt agg accccccgtt ggt aagacac gt at gt aggc aacagt at t t ct ct t gat cc gat t acgcgc cgct cagt gg ct t cacct ag 9000 9060 9120 9180 9240 9300 9360 9420 9480 9540 9600 9660 9720 9780 9840 9900 9960 10020 10080 10140 10200 10260 10320 10380 10440 10500 10560 10620 10680 10740 10800 10860 10920 10980 atccttttga tccggaatta attcctgtgg ttggcatgca catacaaatg gacgaacgga Page 504 t aaacct t t t gt t t acccgc aat ct gat ca acaagccgtt 12689250 Sequence Listing.txt cacgcccttt taaatatccg attattctaa taaacgctct tttctcttag caatatatcc tgtcaaacac tgatagttta aactgaaggc gggaaacgac tgagcggaga attaagggag tcacgttatg acccccgccg atgacgcggg t t acgt t t gg 11040 11100 11160 11181 <210> <211> <212> <213> <223> 582 2001 DNA Arabi dopsi s t hal i ana artificial sequence GUS gene W th intron (GIG) <400> 582 at ggt ccgt c t t cagt ct gg gaaagccggg cgt aat t at g ggccagcgt a aat caggaag t at gt t at t g t aat t at cat gt agt at at a t t ct aat at a ct gaact ggc cagt ct t act accacgccga aaccacgcgt gat gcggat c aat ccgcacc agccagacag ggcgaacagt gat gcggact at ggact gga ct cgact ggg aacct ct ct t gaggcagt ca cgt gacaaaa ccgcaaggt g ct gt agaaac at cgcgaaaa caat t gct gt cgggcaacgt t cgt gct gcg t gat ggagca ccgggaaaag t aat t agt ag gcaat t gct t t gaccaaaat agact at ccc t ccat gat t t acacct gggt ct gt t gact g aacaggt ggt t ct ggcaacc agt gt gat at t cct gat t aa t gcgt ggcaa ttggggccaa cagat gaaca t aggcat t gg acggggaaac accacccaag cacgggaat a cccaacccgt ct gt ggaat t gccaggcagt ct ggt at cag t t t cgat gcg tcagggcggc t gt acgt aag t aat at aat a t t ct gt agt t t t gt t gat gt gccgggaat g ct t t aact at ggacgat at c gcaggt ggt g t gcaact gga gggt gaaggt ct acccgct t ccacaaaccg aggat t cgat ct cct accgt t ggcat cgt g tttcgaagcg tcagcaagcg cgt ggt gat g tttcgcgcca gaaat caaaa gat cagcgt t t t t aacgat c cgcgaagt ct gt cact cat t t at acgccat t t t ct gct t c t t t caaat at t at aagt gt g gcaggt at ca gt gat t accg gccggaat cc accgt ggt ga gccaat ggt g caaggcact a t at ct ct at g cgcgt cggca t t ct act t t a aacgt gct ga acct cgcat t gt gat t gat g ggcaacaagc cact t acagg t ggagt at t g ct ggcggaag aact cgacgg ggt gggaaag agt t cgccga t t at accgaa acggcaaagt ttgaagccga t acct t t gat ttttttcaaa t at at t t t aa ccgt t t gt gt acgaaaacgg at cgcagcgt cgcat gt cgc at gt cagcgt gcgggacttt aact gt gcgt t ccggt cagt ct ggct t t gg t ggt gcacga accct t acgc aaact gct gc cgaaagaact cgat t aaaga ccaacgaacc caacgcgt aa cct gt gggca cgcgt t acaa t gcagat at t aggt t gggca gt gggt caat t gt cacgccg at at at at aa at aaaagaat t t t at aact t gaacaacgaa caagaaaaag aat gct ct ac gcaagact gt t gaact gcgt gcaagt ggt g cacagccaaa ggcagt gaag t cgt cat gaa ccacgcat t a t gaagagat g t gt cggct t t gt acagcgaa gct gat agcg ggat acccgt act cgacccg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 acgcgt ccga t cacct gcgt caat gt aat g t t ct gcgacg ct cacaccga t accat cagc Page 505 gat ct ct t t g ttggaaacgg cagccgat t a accgacat gt gat cgcgt ca caaggcat at aagtcggcgg cagcagggag at gt gct gt g cagagaaggt t cat caccga ggagtgaaga gcgccgtcgt t gcgcgt t gg ct t t t ct gct gcaaacaat g 12689250 Sequence cctgaaccgt tat t acggat actggaaaaa gaacttctgg atacggcgtg gatacgt t ag gtatcagtgt gcatggctgg cggtgaacag gtatggaatt cggtaacaag aaagggatct gcaaaaacgc tggactggca Li st i ng. txt ggt at gt cca cctggcagga ccgggctgca at at gt at ca t cgccgat t t t cact cgcga t gaact t cgg aagcggcgat gaaact gcat ct caat gt ac ccgcgt ct t t t gcgacct cg ccgcaaaccg tgaaaaaccg 1620 1680 1740 1800 1860 1920 1980 2001 <210> <211> <212> <213> <223> 583 1721 DNA Arabidopsis thal i ana artificial sequence Arabidopsis Ubiquiti n3 prorroter plus intron (Ubq3 (AT)) <400> 583 at t t ggagcc aacagagtag aaaat cct ga gaggagaaat cct cat at at cggt t caaca aaatt caagg at t aacaaca aaaaaaacat aaaaacatac aat t aggt t t ggtagataag ttagagtaga at cat aat ca at at t at gat aat ct t t t t t ttgaggaaaa at gacct aat acggtggaaa t caact gat a agacggtttt aagt ct cat a taagaacaga acat ctt at t t t aagct ct t t ct t ct t ct a t t t t t t t t gt cccaactgtt acaacaaaaa agat t t t at c agat ct t ct a agagt t t t gg t t t cct t at t t t agaat ct t acat gaaat t tgct t at t ct gaact t t t t c gt at at acaa ccaaccacca at at gacacg t ct t cct t t t gggct t t t gg aacgccatt g gaagagagag ttagcaaaga ggact t gtga t gtt acct ga t t t gagt t at ttttttttta aagataaaga atgaaaaaaa at t at t aact aat t aaacca t t aat t agt c ttatgccaag aaaagaaaaa taatgggttg ct t at t gat t aaagaaaaat ccat aggat g t at cat at ga t t t gt t t t gg t t t gcgat at tggaagaaag agtgtgagat gaaagagtt c at t gt t ccgc aaaccggcat tatctgggct agaagt t gct aaat aat aac gagaaaagaa tttct t aaaa aaaagat t gt aat ggt agat t at t gat aaa t ct cat at at ggt t aaccaa aaatt ctt ct agaaaaatgt t t t ctacttg ttcct t cct t ct aaagat at aaagaagacc t ct t gagt t g acat gaat t g cgagt ct gt a ct ct t gaat a t t aat ct cgc t aat aacgca gtt aaaaaaa aat t act t t a at aaaaact t at t aggt cct t ct aaaaaat actttttttt t t aaat caag agt at t agt a gacat agt ct at agaaaaga cagt gaagca agt cggt ct t t agt t t cgt g t t t at t ctca t t cgt gt gga gtggtaatgt tcgggcaaca gcagaagagt ct t ct t caat gggt t t attc ggcct gaaat aaaaaaggga at t gt agact ggatcaaaaa ttttcccaac act caaat t t ct t t t ct t t a aagat aaact ttctctatat taatggaaag aagaaatt at gatgtaatgg ttaaaaacgc at aat aat cc ttaatagaaa agat aat aat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 506 t cat cct t t c aat ct ct cgc at ct gat ct c aat at t gt t c t cgt agt gt t gat ct at t ca t agat ccgt t t t cgct gat t <210> 584 <211> 39 <212> DNA <213> Arti <400> 584 ggccagtgaa <210> 585 <211> 39 <212> DNA <213> Arti <400> 585 ggccagtgaa <210> 586 <211> 11 <212> DNA <213> Arti <400> 586 t ggtt cggac <210> 587 <211> 26 <212> DNA <213> Arti <400> 587 agactt cact <210> 588 <211> 24 <212> DNA <213> Arti <400> 588 gt gt ggaaat gtct t t t t ct gact ct ct ct caat t t t t gt gt t t cgt caa t acat ct gt g at t t t t gtgt t ct ct t t ggt ggt t t ct act 12689250 Sequence gact ct t caa t ct ct cccaa ttcaaggtat attttctgat tatgtggatt attgaatctt tccagcttct aaattttgtc taatttcttg cttgattgtg tttctttgtt cgattctctc gt t gt t t t ga t t t ct ct t ac tgttctattg ttttatttca Li st i ng. t xt agcct aaagc t ct t t t t gt t t t gt at aaat ct gat t act a aaat t aggat t gt t t t aggt ggct t t t gat gat ct ct gca t t t gat t cgt t gct t t t gac agat at cgat tttcaaggac t t ct t at gt t t t ggt at at g 1320 1380 1440 1500 1560 1620 1680 1721 ficial Sequence t t gt aat acg act cact at a gggaggcgg ficial Sequence ttgtaatacg actcactata gggaggcgg ficial Sequence ficial Sequence gcaacatggt gcccac ficial Sequence gacacagatt gtga ficial Sequence <210> <211> <212> <213> <400> 589 19 DNA Art i 589 Page 507 12689250 Sequence Listing.txt agacgggtgc aatgaaacg <210> 590 <211> 28 <212> DNA <213> Arti <400> 590 cgcgaacaag ficial Sequence aactgtgctc ctatcatg <210> 591 <211> 19 <212> DNA <213> Artificial Se <400> 591 gccgtgagct ccgt t ctct <210> 592 <211> 17 <212> DNA <213> Artificial Se <400> 592 tcgtgccatg ccaatcg !quence !quence <210> 598 <211> 2000 <212> DNA <213> Cryza sat i va <400> 598 gaggaagggg tgttcaccat t gt accagt a t gt gt t t agc tcagaacccc gcagaaattg tgtggcccaa t ctt gctt ga t ct t gt at t a t gct t gt agg gtttgaggtc atgtaatttt acatatgcat gctttgttgc ttgaagattg tgagtgacta gccatgaaac tatgcgatga cacagt t cac gctgtctggg atttgtatgc tttacaacac gtttaaagct gtattttaat aat t t aacaa ct t at at at g ttaaggaaaa cagt gat gat tactaattct gtatgttttt t ccat att ga t at agat t t g aaaggggaga atctgt t t gt t t cagct at g caaatgacga aatgaagtgg gaat aggt gt at act t gat a act gt t aact tgaccaagaa gt act t t cgc atggat t t gc ttgtaaaaaa gct gcat agc t t ct agt gt c ct t t at at gc t ct t ggt t cc agaaggtgac ct t t ggt t ga ct cacat aat acct gt t t ct t aggct ct ag aagact gt t c ctt cagacct ct at aaaat c acct ct t t gg tagt t aggct tt cacat gaa cat t at t t t a agt t agt tag at t agt t ct c aaaaat gt cc ttctgt t gt g gt ggct acat ct ct gat gcc aaaat at t ct gt t t t gggcc gt gt at aat t ggat acaat g cct t t t t t ag ct ggt cat ga ct t aaaat ct t aaat cat ga gaaat gt gt t act t aggt gc t act cct ct a aaagt caaat at aagaaaga at gcagaat g ct aggggat c aat at gat t t t gt gt at t gg caagt caact t gt cat gt at t t gct t agt a gt t t t gt gat gccat gt t ca ct t t ct gcat aagaaaaaaa t aggt gt cat t aat gagaga agt aagcagt acagt aat t a t aaat t t ct t acagaat cac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 508 12689250 Sequence Listing.txt cgtttttgcc gcgcgagagg ctctttaagc agcagcagta ttttcagaac acacgt at ct ccgcct ccag t t gt t t t ct c gccgt ggggt ct ct cat t t c aagt t gcaca gat t t ct gat t acagt at ct t cat at agcc aacagt t t ca t t t gt gt gat at act t cat a ccat aat t ca at t at ggagg aacagct gca t t at ccaaaa gcact t gt t t gaaagggcgt ct t gt t t at g t ggt ct t gt t gt acaacat g agt act t cag aat aaat act ttaagcaagc at t aat t t ct t t gt agggct ggct ggt gt t ct t gcct at t t agt ct t aca t act acaaca gggcaat gt c gaaat gcaaa ct t agccact at t gcaaaga t at gat gt gg at agt gagt c t acgt t caga t ct cat ggag cat aggggt g gt t gaagt t a t gt t gt caag acat t ct t ac cat ggt at at ggt agcct aa cat gt caaat t t gccaat ag gt t t ggt t ac caagt gaat c t t t cat aaaa gcaacaagtt t cacct ccgt t ggggt t ct t cagt gacat g tcgggaaaaa at agct t caa act ct t gt t a at gat ggt ag ct cacct t t t act gt gt t gt t ct gcat gt t t cagt t aaat gagt ccggag acat gat t t t agcat aggct caaact act a ct t gaccct t t gccat t ccc t t aat t gat t t t t ct t gt t t ggagt gaacg ggct t cct at t at t act gt t t at t ggaat a t ct ccct t t c t t gt at gcag t t t gt t ct gg ct ccaaccga ggaaaaaaaa at t t ggaacc gccgt at gaa gt act gt t t g agcacaat ca ttgaccaagc ct cgct ct t g cct gcgt t gc t at aaacagg acat ggt t t c ggt gt ct t t t ggt aact gt g t cagt t t gt t t cagcagt at caagt ccat g agcact t t ag gt ggcat ct g ggacaccaat aaagat t aga at t gt t agaa t t cat t ggt c t at t gt gcat 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2000 <210> 599 <211> 1500 <212> DNA <213> Or-yza sat i va <400> 599 atgctccacc cccgaaaaat cct ct cgcgt cct caaggt a ggacggatgc agatctttgt t cct ct gaca ccat cgat aa gaccagcagc gtctcatctt tacaacatcc agaaggagtc at ct t t gt ca agact ct gac atcgacaacg tcaaggccaa ctcatcttcg ccggcaagca aaggagtcca ccctccacct acactgaccg gcaagaccat aaggccaaga tccaggacaa t t ct ccccaa t ct cgcgagg ct ct cgt cgt cgaat cgaat cgct gct t ct gaagacat t g t gt caaggct cgct ggcaag gaccct t cac cggcaagact gat ccaggac gct ggaggat t gt cct ccgc caccct cgag ggagggcatt cct ct cct cg accggcaaga aagat ccaag cagct ggagg ct t gt cct cc at caccct t g aaagagggca ggcaggaccc ct ccgt ggt g gt ggaat ct t cccccggacc ct t cgt t t cg ct at caccct at aaggaggg at ggcaggac gcct ccgt gg aggt ggagt c tccccccaga t t gct gact a gcat gcagat ct gacaccat agcagcgt ct at t cgat t t c cgaggt ggag cat ccccccg cct t gct gac t ggcat gcag t t ct gacacc ccagcagcgt caacat ccag ct t t gt caag cgacaacgt c cat ct t t gcc 120 180 240 300 360 420 480 540 600 660 720 ggcaagcagc ttgaggacgg caggaccctt gctgactaca acatccagaa ggagtcaacg Page 509 12689250 Sequence List i ng.txt ct t cacct t g aagaccat ca c ag gac aag g gaggat ggca ct ccgcct t c ct ggaggt tg ggcat ccccc accct cgccg ggt ggt at gc agct ccgaca g ac cag cag c t acaacat cc t cct ccgt ct ccct cgaggt agggcat tcc ggaccct tgc gt ggt ggt at agagct cgga c ag ac cag ca act acaacat agat ct t cgt ccat tgat aa gt ct gat ct t agaaggagt c c ag g ggag gc ggagt ct t ct ccc g gac cag t gact acaac gcagat ctt t caccat cgac gcgt ct cat c cc agaag gag gaagacctt g t gt gaaggcc cgct ggcaag caccct ccac at gcaaat ct gat accat cg cagcgcct ca at ccagaagg gt caagaccc aacgt caagg t tcgccggca t ct accct cc act gggaaga aagat ccagg cagct ggagg ct ggt gct cc t cgt gaagac t ct gaccggc acaat gt caa tct t tgct gg agt ccaccct t cacaggcaa ccaagat cca agcagct cga acct ggt gct ccat cact t t ac aag gagg g at ggacgcac gcct ccgt gg ggccaagat c caagcagct g ccacct t gt g gaccat cacc ggacaaggag ggat ggccgc t cgt ct ccgt ggaggt t gag gat tccccca cct cgccgac t ggt cagt aa 840 900 960 1 020 1 080 1140 1200 1260 1320 1380 1440 1 500 <210> <211> <212> 600 499 PmT <213> O- yza sat iva <400> 600 Ibt ValI Le u Leu His Pro 03 u Se r Asn Al a Se r Phe Ar g Pr o Lys Ilie Ser Pro Ser Arg (31u Al a Leu Val Leu Al a Ser Ser 25 (31y Tyr Al aAl a Arg Phe Asp Ar g [vbt O31n Thr Leu Thr I Ie Ser Ser Pro Leu Phe Val Lys Ser Asp Thr Il e Asp Asn O3 y Lys Thr Val Lys Al a 70 Ar g Leu Ilie Ilie Lys5 Leu 03 u Val Il e 03 n As p Asp Lys5 G31y Ilie Pr o (31n (1n Phe Al a 03 y Ty r Thr Leu Al a Leu Ar g Leu 115 Lys Thr Ilie Asn Il e 0 n Lys 03 n Leu 90 O3 u Ser Thr Phe Val Lys 31u Asp G31y Ar g O31y 1y Ib t O31n 120 O31u Le u Th r 125 Ilie Hi s Leu Val 110 Leu Th r 03 y Asp Asn Val Thr Leu 03 u 130 Lvs Al a ValI 135 Lys5 Ser Ser Asp Lys IlIe 03 n O31u 31y Ilie Pr o 1 G31y Asp O31n O31n Il e Phe Al a O31y 1 65 Lys5 03 n Leu 03 u Ar g Thr Leu Al a Asp 1 Leu Arg Tyr Asn Il e 03 u Ser Thr Leu Val Leu Page 510 12689250 Sequence List i ng.txt Oy Oy [vbt On Ilie Phe Val 1 95 Leu G3 u Val 210 G3 n Asp Lys 225 G3 y Lys G3 n Lys 03 u Ser I I e Phe Val 275 Ser Ser As p 290 (31y I Ie Pr o 305 G uAs p Gy Leu Hi s Leu Thr Leu Thr 355 Il e Asp Asn 370 Asp O31n O31n 385 Thr Leu Al a Leu Arg Leu Lys Thr Ilie 435 Lys Al a Lys 450 Leu Il e Phe 465 Tyr Asn Il e O31y (1y (1n <210> 601 <211> 1283 <212> DNA <213> Ar abi Ser O31y O31u 245 Le u Th r I Ie As p Th r 325 Le u Ly s Ly s Le u Ty r 405 O31y Le u O31n O31y Ly s 485 As p 215 Pr o O31y Le u Th r As n 295 O31n Al a Le u I Ie Ly s 375 Ph e I Ie vb t ValI Ly s 455 O31n Ser Lys Thr Leu Thr 03 y Lys 200 205 Thr I Ie Asp As n Val Lys 220 Pr o As p O31n O31n Ar g Leu 235 Ar g Thr Leu Al a As p Ty r 250 Val Leu Ar g Leu Ar g O31y 265 (31y Lys Thr I Ie Thr Leu 280 285 Val Lys Al a Lys I Ie O31n 300 Ar g Leu I Ie Phe Al a O31y 315 As p Ty r As n I I e G n Lys 330 Ar g O31y O31y 'bt O31n I Ie 345 Thr Leu O31u Val O31u Ser 360 365 1 1 e 03 n As p Lys 03 u 03 y 380 Al a 03 y Lys 03 n Leu 03 u 395 O3 n Lys 03 u Ser Thr Leu 410 O31n I Ie Phe Val Lys Thr 425 O31u Ser Ser As p Thr I Ie 440 445 O31u O31y I Ie Pr o Pr o As p 460 Leu 03 u As p 03 y Ar g Thr 475 Thr Leu Hi s Leu Val Leu 490 Thr I Ie Thr Al a I Ie As n O31y 270 O31u As p Ly s G u Ph e 350 Ser I Ie As p Hi s Le u 430 As p O31n Le u Ar g Ly s Ph e I Ie 255 vb t ValI Ly s O31n Ser 335 ValI As p Pr o O31y Le u 415 Thr As n O31n Al a Le u 495 dopsi s t hal i ana <400> 601 caaat actt a ctttt gaat c cgttttttt c att gttt gat cgatt at ggg aaaggtt ct t Page 511 12689250 Sequence Listing.txt cagaaagagg t t cact agaa t gt cact ct g ccagggcat g accggagaaa accgaaggt a gacaacacca at ccgt at t c acggt ct at t gt aggat t ag gt t acggt t a gat gct t t ct ggt at aat t g cat aagggaa cct ct cat ct caggaaat ga gccgat t at g gt gat t gat g ct t cct caaa at aact at at at t t t ggggg cgt t cggat t gggaaacagg at t t gcat at agat cgt ggg aagt cggagt t ggaaaact a t aacat acgg cagacaat ct cccct at gaa gcggt t t agg t t agt act t c t ggt gagccg at accgt ct C aact t gt t at ttgagaggaa t agat at ggc tcaacaccgc ttgccaacac t gt t at t t t c aat t t ggt at taaaaaaaaa agccgcgaaa agaaaaggat ggt caagaac cgt ggt gact t gggt gt t t g t t gt ccaaaa t ggt t act cc cccct t ggac gt at cacggg t cat gt agga ggagaaaaag tgacccaaaa t gcgact cat ggt t ggt gca gat ggt aat g cgggaaacac cat ggaacgg at t gaagcct t t gt t t ggt t gct t aat cca aaa gacaat t ccg gt gaggt t ca gagt ggggaa gaagt cggag gt cagct cgt t caat ccaaa gaccacat gg gccgccgcac ct cgacaaac gt gaaat t t g agagat gagg cagat t aagg t cact t ct t c cccgagaagc ggaagt at ga aacat cact g ct agagaaag aat cct aat t t gt cgt act a at aaat gaat gagttctctc gcctttcagt aagt gt t at t t gt ct act t a ccaaagt gac gcgggt cat g cgt acggat t tttgcgagga cgct cct ct g ccggt at gca ccaaggct at cgat t aat cg at gcaat ggg cgt t gct cgg cact cgagct t aggagggat cggat at t ga ccgacgt t ag t at aagt t t t at t aagt t t t agt at gat at t t gt ggaat t t cct ct t gt c t aaat t caaa t gacagct gc cccgt act ac aggt t t cgt c t gccggt at c cat cggt gt g gggt act aag gct t ggt gcg t act at ggat t t t gct gaag acct gt cat g aaaagagacc gct t at ct ct gt accgct t t aagcat t aaa gt gt t gt t gt at at gat aag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1283 <210> 602 <211> 497 <212> DNA <213> Arabi dopsi s thali ar <220> <221> N-regi on <222> (492)..(492) <223> n any nucleotide <400> 602 tcgagcggcc gcccgggcag gta caacacttac catcctcaac gags cattgcaact atttggagtt ggt! gggagaaaac actccaactt atac t t t ct t cat a t gat gact cc gaa t gaaact cgg agctcaggct t t ct gcctcccaac ttcgccttct tctt na caggaag gaagct g gt gggt t gct gt ca gact t ca cgt ggg cagat g gt ct t cgt gt aggcgatttt ttttcgcagc t cggcct t ag aacaagacgt ct gcaggaaa ttcgaagccg Page 51' gaaagagcca t gaggacat a at t gt at gct t ct gact at a caggct gt t g act agaaaca ggt t t t gcaa aaacct gaga aat ccct ct c t t gt ct gagt t acct acgac ct t gcacct g aat ggt gt t g gct gct gct a 120 180 240 300 360 420 12689250 Sequence Listing.txt agcat gaat c gaaaccct ct gat gaaact t ct gaaagt ct ccaagat gca t cat cgt cac cagaagaagc tntgaac <210> 603 <211> 260 <212> DNA <213> Arabi dopsi s tha i ana <400> 603 cat ct cct gg ttcgaaccgg gt gact cggt caagt cccgg agcgt ggct t at t ct t cgac ct gacggt t g cacatggaag ccatggggaa gactagaagc atggcgtgag gctggttact ct gacact ct aggt t at cgt t t cgagct t t t ccaagacgg t at agccacc gcagt t t ct g catcgtcgtc gatcagtttg aaaatggcgg gagttttgtt attgatgtta ccggcggtac ctgcccgggc ggccgctcga 480 497 120 180 240 260 <210> 604 <211> 497 <212> DNA <213> Arabi dopsi s tha i ana <400> 604 tcgagcggcc gcccgggcag gtaccagcat ctttcttttt tttccttcac acacattctt atagttatag caacaaaagg agcgatacaa gaactgaacc cttctcatat gctttctcca ccagaaaacc agatattgct attgctatga taacttgata tggcgccaca atcaccaaga ctggaaaagt gaagaaaagt cgagcaataa ctcccttaat ccaggcgacg atgctcgatg ctgacttatg tatacca <210> 605 <211> 1737 <212> DNA <213> Arabi dopsi s tha i ana <400> 605 tttctcactg acaagagaga gatagagaga gattatacac tttccaacat ggcggcctct ccaggt t t ga acggcgtggt t cgt t ct t ac cctaaggcgc accagaacca aaccaccaac catgacgatg acgaagacgt gtcgagcgaa gt gt act aca aggagat gat aagaaaat cc ccgagggacg aatacacggc tgatagctgg t t ct t aaaga acact gat t a ct gt gacacc agct ccct gc t t gcccct t c gt aat gcgat gcgccaagaa gt gcgt ggag gcggat aaat acat t t t t gg aat aat ggct t cgt gagaaa t t gct t ccct gaat ggaagc t gct gcccat agcgaat agt gcaagcacat tcaggagcat acagt ggcaa t gt t ggaaag ct t at gt t ca t caagt t cac acaccat act ggaacaacaa 120 180 240 300 360 420 480 497 gaaaaaggct gt agat aat c aaacct cccg caaaccgt gt gacgagaacg aacgccgagc at cgagcgt a Page 51 ct gct t t ct a gccaat acgc ttccaggccg t ct t gaaacc agacacacaa t t gaaccgt c accct t ccat cat t at t t ac t cgt ct cgag gt ccgat t cc agccaaggtt cagcaacgcc cgt t t t ggac ggt acgt ct c 120 180 240 300 360 420 12689250 Sequence Listing.txt acagggaaac atcccttcaa ctccgaggcg cctcttaacc gtttaatgca at cacccct g gccgaat gga cagct cgt ct cgccgt aagg gt t t ccacct tttagccgaa ggt gccggaa ccat caagag ggt t t t ccgg aaacgaat ca gt t t t acct t gagt acat aa at t ct t ccca t ccggaggt g aacgt at gt g t gt t t t t ggt t at gt ggt t t t gggct agt a ggt t ggagt t ggt t t ggaag ttgcagccaa t cccgt t gca cggt cgaggt ccgagt t t gc aacagaacat ccgt gt ggcg aaggcggcgc ct gct ggt t c acat cat t t t t t cggat cat t t gt cacaac ct t t ggt aga t caacgagct t caacgct t t gaaaaaaagt cact t gacca cact t gaggt acgcaaacag agcat aagga acagt accgg gcgaat cgct at ct agagaa ct acgt t cgt gaccggat t c ttaccgcgag ggt gaagaag t ggt gt ccct t ct caacgt c caaat acgga ggct t at at g cat ccccggt t aaagaat cc cgccgaact c aaacat aaac cacaacccaa gacccgt gt g tcaagagaag t gaggt t t t g aaccgaggat gaggct aaag gt t t at aact agcact cgca gat gggt t ac aaccacggcc gt caaacggc ttcgccgcga t caaagggat ct ct gcgacg t gct t cgaag acgagcat ca caaaacggag t t cat t ggt g gacaat t t ct gccgacgaag t ccgt gat t a agacct t at a gaggt cacgg ccaaacaagt gact t gct t a gacat t ct t g at t t ggt acg gaagct gt gc tgcggaccac aacgt gaagg acgt ccct aa ccat gaaat t cgct agt ct g t caact gggg t act gcgt cg ggt cggagga agaaggaat a agt at ct aac gccggat ggt accat t t caa aaggt t ggt g cgacgccat g ct t t aaaggg t agat ggt gg at gggaagt t gt gccaaaga tgagggaaga t cgt t gaaat ttagggaaca cgcct at gat aggat ct ct t ccacgggttt agcccaat gg caccat ggac cgcggggaac at ccgccgga ct gcgggat c t ct t ccgggc t gccat ggat accagaccac t aaat ggt t g ggacaacaga gt at aagcca tcacgaggag t t acgcat at agagacat gg ct ggt gt t gg gat t gct gt t gct agaagga cgcaaaggaa t at ccct gaa t cagt t t gcg aat ct t c 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 <210> 606 <211> 1635 <212> DNA <213> Arabi dopsi s tha i ana <400> 606 gggcaaggaa aat gt cggt t t t cct ct gt t tcttgaacgt tttgaaacct tcgaaatata tcatcggaaa cttacaccaa cgcaggacac aaat gt at gg accagtggcg ct t ct ccaat cgaaagaagc agcagaggaa gtgctaaaga aggcggccgg aatgagagca actttttaca gt gacgagt g gagt ct gat g cggaagctct ttcaatcttt caagtatatc atagaggaag agtttgctac gagacaatct ccggtgaatc t cct cgt cct ct t acccct g agcttcctcc gggtccaaag tacatccgag gaat cgccgt acggat t cgt ccccgtggtc t caacgat ct t gagt gt t gt act t caaaga cat cgggat g cggtggtcga gct ct t cagc agaat aact t gt gt gt caag ttgagagagc cat t t t cact Page 514 at ct t aat ct aagct t ccca aat ct cgccg gcgat ct ct t agt cgaccag gcaccct t cg gt gaaaaagc aaact ct ct g t t agt cggaa 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt atatagtgtg tcgtatcggg tacgggataa atctctatga gtgtgatttc tttgaagctg at agagt cgt at t t ct t t cc t gaagaat aa agcctggaag agaacgatgg t at t t gt agc tcagaaaccc agaaggagag aggagacat t at at caagat cgatgggtcg t cgacagt t c ggagaat at g t gct t t act t gggaagaaaa ct at caaaaa t gaaaat t gt gagt cagct t ggat ct t gt g cggaagaat c t t t ct cggt a agagagct ct agat gct ct c agggat t ggt aagagt gat g aat caaagaa aaggt t acat t caaggct ac t gat ccaaaa cgt t gat t t t t ccagggat g ct t cgat t gg t t t t ct agcc aaat at at at at ct gct t t t ct aaa ct aaaggct g ggt aggt t ca gt agacact t act at t gt t g aagt t cacca ggagt cgct g aagaaagt gc gaagat ct aa ccaacaact c gat gt t cct g ct ct gggaaa aaaggaaaaa acaat gggt a ggat t ggcga tt ct t t caag at at ggt t t t gat t t agt t t aagcggt cat t cgact gt at t ct t t cagaa act t gat gat ct gat cat ct gcat aacact aagacgagat accaact t ca cact ct t gct cgaaaacaca acgcagat ga act acgagt t ccat act t gt agcaggagga tt ctt cacca gt t at gt aca caat aaaaat aagagaaact ct ccggt cag t gt t ct t aac cgat at gaag caaaggaat g at ggggaat g tcggacaaca ct act t t aag cccaagacag gat ct t agt c gt t t aaccct t at accgt t t t gagat ggca agccaaagag ct gaat at at ct agt t t ct a aaaaagt gat gt gt t ct ct g aacaggagat gagcat ct t a aagaagcaag at ct cggaca accgagct ga ct t ggggaca ct cgt ggt ca acaat gt ct c aacgt t t at g gacaggt t t c ggat ct ggt a t t gt t gaat t at cat caat g ggt t t ggt t c agat aat aaa gat ccagt t g 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1635 <210> <211> <212> <213> <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> <220> <221> <222> <223> 607 400 DNA Arabidopsis thal i ana Nr egi on (329)..(329) n any nucleotide Nr egi on (340)..(340) n any nucleotide Nr egi on (380)..(380) n any nucleotide Nr egi on (388)..(388) n any nucleotide Page 515 <400> 607 at acat t gaa t at ct t cgct gaagt t gt gc caagaat cag agcacacaag t cgt gt gt at at at t t ct gg aacaaaat ag gct ct t gt t c gagaagccaa t gcat t aacc t gt at ct gt t t t t acat aaa t t t t aat t gn 12689250 Sequence taataatcat catggctaag tctttgctgc tttcgacgca gtgggacatg gtcaggggtt ttgaaggagc caaacatgga acgtcccatg ttaaatctac aataagtcnc tgtcactct n ttattttncc gccggttggt Li st i ng. txt t t t gct t cca ccggcaatgg tgcggaaaca t cat gcaact cact aat ct t t gagt aact t t cat caccct tggaagcaca gt aat gcat g at gt ct t ccc t ggt gct aaa t at gacat gc 120 180 240 300 360 400 <210> <211> <212> <213> <220> <221> <222> <223> 608 738 DNA Arabidopsis thal i ana Nr egi on (586)..(586) n any nucleotide <400> 608 caactt agaa gtaggtgct c cacaaccagg gcct at gct c gggcct t acg aacatgtggg tgtggtcact aggt gt aaca aacgagaagc cgtgtatat g caaat aat ac t aat t at t at aaat gaat t t ttgt t ct t cc cacgaggagc ggagctacgc gggaaaactt ttagcgagaa acact caagt atggtggaac cat act aat g t at cagt at t aaataagagc agt t at at at tactggctat tctcgatttt taatcgtctt tgtagctctt ct cgaaagct ggtaggcgt a agaacaact a agcctggggt ggct aact ac tgtttggaga cat aat cagt aagt aat gat t caat aagga tgagat t acg caagatagcc ggt cccat gc agaggcaact agcggtgact aact acgct g aagt cagt ga t gcaact at g gt gat cat gc gcat cat at g agaat ct at t cacaagat t a agtgggacga gcagactcat tgtctggcgt cgaacacgt g gactcggat g atcctcgtgg at acacacgt caggangtat t aaat t aaaa t ctaagggt t gagggt t gca acact ct ggt ct ccgccgt g caatggagt t t gccaaagt g gaat t atgt g acataaagga caat at t t at gt t acat act 120 180 240 300 360 420 480 540 600 660 720 gtaaaatatg tggccttttt aaaagttaca taattaatta t t at agt t aa t gt ct t t c <210> 609 <211> 257 <212> DNA <213> Arabidopsis thaliana <220> <221> N region <222> <223> n any nucleotide <220> <221> N region <222> Page 516 00 12689250 Sequence Listing.txt <223> n any nucleotide <220> C) <221> N regi on <222> (69) (69) <223> n any nucleotide <220> <221> N region <222> (7 <223> n any nucleotide 0 <220> S<221> N region C1 <222> (114). (114) 00 <223> n any nucleotide C1 <220> <221> N region <222> (145)..(145) <223> n any nucleotide <220> <221> N region <222> (147)..(147) <223> n any nucleotide <220> <221> N region <222> (153)..(153) <223> n any nucleotide <220> <221> N region <222> (158)..(158) <223> n any nucleotide <220> <221> N region <222> (160)..(160) <223> n any nucleotide <400> 609 gcggccgcgt cgacgccatg ttttgcttga ttttntcnca ggaaaccata gcaagaagag ccctagttna aaaaaaaaat aaaaaaaaaa tccntaatat taaattaagg atgntctttt 120 atttttcttt cttttttctt ttccntnatt tgntgccncn tttattgata gtttgaatta 180 atccccaatg aacggatatg ttcataaaat cgaaacctat atacttatgt ttttaaaaaa 240 aaaaaaaaaa aaaaaaa 257 <210> 610 <211> 528 <212> DNA <213> Arabidopsis thaliana <220> Page 517 12689250 Sequence Listing.txt 00 <221> N regi on <222> <223> n any nucleotide 1) <220> <221> N region <222> (138). (138) S<223> n any nucleotide <220> 0\ <221> Nr egi on t- <222> (158). (158) S<223> n any nucleotide C1 <220> 00 <221> N regi on <222> (197)..(197) <223> n any nucleotide <220> <221> N region <222> (236)..(236) <223> n any nucleotide <220> <221> N region <222> (252)..(252) <223> n any nucleotide <220> <221> N region <222> (321)..(322) <223> n any nucleotide <220> <221> N region <222> (360)..(360) <223> n any nucleotide <220> <221> N region <222> (398)..(398) <223> n any nucleotide <220> <221> N region <222> (413)..(413) <223> n any nucleotide <220> <221> N region <222> (447)..(447) <223> n any nucleotide <220> <221> Nr egi on <222> (452)..(452) Page 518 12689250 Sequence Listing.txt <223> n any nucleotide <220> <221> N region <222> (456)..(456) <223> n any nucleotide <220> <221> N region <222> (476)..(476) <223> n any nucleotide <220> <221> N region <222> (479)..(479) <223> n any nucleotide <220> <221> N region <222> (488)..(489) <223> n any nucleotide <220> <221> N region <222> (503)..(504) <223> n any nucleotide <220> <221> N region <222> (511). (511) <223> n any nucleotide <220> <221> N region <222> (513)..(514) <223> n any nucleotide <220> <221> N region <222> (523)..(523) <223> n any nucleotide <220> <221> N region <222> (525)..(525) <223> n any nucleotide <400> 610 gcggccgcgt cgacgtcgac gcggccgcgc tccgcgcaaa at t acttttc cgtgcaaaaa agtccgccac caattggnta tccgactaga gcagtggaga caaactncaa gggcgtcaac tgtatggagt gnatcttctg ctgcggcgt a agcatgatga gacagaagtc nnttgcccaa t cct cagct c cct t cagaga gatgcggngg cccgaagcca tgctccagcc acctgaaaaa Page 51 ct caaat cag ct t cat cagg t gggt gat cc t aat gagt t g t ct gt acat c caacacgtga aat gant caa gccgtacaca tccggcagca ttttantact agaggaaat c t agaaaat cn 120 180 240 300 360 12689250 Sequence atatatatat atatatatat atatgttata gtaggcgntt taaaggttat tttgtttcac tggttanaat gnaacnattt atataaanna gcagaaaata gannaaccaa ngnngcctag Li st i ng. t xt ctatggattt ccncgccttt gct t cgt gga t t t t gnt t nt t cntncag 420 480 528 <210> <211> <212> <213> 611 1472 DNA Arabidopsis thal i ana <400> 611 tacgtcaggg cact gt t ct t t at t aaaaat ct t act t gt g t t t gt gt ct a t ct t t agt ag ggaagatcgg gt at ct gaac ggagcct t gt caaaggat t g gat t ggt t t a aact at gagc ggaccagcgc t cct ggt ggt aaccacggga cgt agggact cat caaggt t t ccacat t t g t gt t gat gaa cctcaaagaa ggt t gcaaag agaacgt t at t at t gaat ga agt ccaagat t cgat t at ga t t t agt ct t a t act aaat t t t t t agt t at g cat t gt aaat gat t ct t gaa t t ggtaccca t gct t gat ca aat gt t gt t g t ct agcgt ca at t act cccg gct t gcat gg ttagagagaa at aggcct t a t acat t ccac ccggaaatat ggt ggaact g t gt gt ggt gg at t cagggaa at t at t caag ggat t act gg cggccagaaa t t at cgact a at gt t ggacg t t caat t t gc at aat t t t ag ct at cat t t c t t at t t t gt c aaacaat at a agagtctgt t t ct t at t ct a aat at t gat t agaacgatat at ggt t gcgt aagacagaat gaaagagtac gagctgcaag gaat cat t ct aaggaatgt t aacaat t t ga ggagagat t c ct act ggagt aaccagtaga t t ggct ct gg tggcaggtga t gggaat at c acgcggggaa aact gt t cga t gt gccggt t t t at t act gg t t gact act t t t gat t t t gg t ct t at ct ac accaaaaaat agcaaacttt t ggt t at t gg t t t caat agt cact gaat t g ggct cgt at c cgcgt at agt at t gat agag aggct at aaa gagggcact a ggagaaaact aaat cct gca agccgggaaa agggaagt t c aagt ccggt a t at cgt ccca agaggctat t ctctggagcc act cat t gt g t t cgat t aga t cgt gt gt aa t caat gt gaa cc t at caaat t t t t t agagt t t t gccaagat t gaccat t gag act t cat aaa ttcagaaaac at t ggt aaca gctgcgaagc at gat caaag ccaact gct g gt gat cct t g ggtgcagagc gaagcgattt aaccccgaga gt agat at at ctcaaggagc ct t agcggag t t caat t t gg gaaacagcca gcagcagcgg gt ggt t t t t c tatgaagcag taagagacgt at gt at aaaa at ct agt t t t agaagt t cca gt t ct t gaat t at t t t gt gc t ccat t t t gt agaccaacat caccaatggt t t gagat gat atgcagaaga gtaacaccgg t gat gcct t c t t cat ct ct c taagcaaaac t t cat t accg t ggt cgct gg agaacaaaga gtcaaccagg act t aaccat agct t ct t gc ct gcgt t aaa ctagtggagg agaat t t gcc t cgt t ccaag agt t gcat at 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1472 <210> <211> <212> 612 1176 DNA Page 520 12689250 Sequence Listing.txt <213> Arabi dopsi s tha i ana <400> 612 ct t caact gc cgcggat cgc ct at ct ct ca ct t gagaaag t t gaat t t t c t ct t ct t ct t gt t ccaaact aacagaccac agt acagt ct ggct caagag t ccaaagat c aagcagaagc tttcagaaca cggagat gct t t gagagt ac accat gt gcc tcagcgcacc ggct t gact t cat t t t ggt c t at t t t ct t a t gcgt ct agc t gcagt agt a aaagaaaagc ttcaacagaa caaagaaaca ct t t t ggat t cagat t cgt c cgt ct acagc caagct ct ac gaat cagt ga aat ct gct at aagcat t ggc gacgagcaag gcgagaat ct t t aagct ct c ct t cat gt ga accggt cgt t t t gacgct ct gt t t t t t aag gaaagaaaga t gt cct t cct ccct gacgt a agacaacttt gat gat gt t c gat caat ct c at t cagaaga acaaaaagaa ggaat acggc agggaaaaga cgat gaagat t ct t gaagag t aaacaat t a aacaaagct g aacggaagag t cct cagt t c acacgt gt cg gccggt caat t cgt cct agg t t t cat ggac aaaacagatt t ct t cat t t g gcct t t ct t c at t t gcaaaa gagaaagacg aaat caaat c t ct t cat gga acaagaactt gacgaagacg agcgagagag ggt gat aact acct t caaag gggt t acgag aagcaaacgg aaccgt cggc t acat gcaca gt cccgccac gcgt gggct c t cct aagt ct cagat at gca aat at t ct t ct t ct t c t t cct t t act acagagtttt at ct gggt ct cat ct gt t t c acgagagttt t cat ccgagg ct ggagt at c aagaagacac ccaggaaaaa at cacagt ac caagacaagt aggt agact g tacaaaaaga tgagcccacc cacaacct ca ct gct acgag t t t t act t gc t gt agt t gt t t ct agct cag t t ct cat ct t t t t t t ct t at aagct t aggc t gt t act cct t act t ct t ca aat cgacgt g t t cacct aac agat ccacaa gct t agact t t ct caat ccg ggaagt t t gg cgagt t ct t a agt aacggaa cact act t t g ggct gct acg gat at ct cac aaccaaaggg aacat gt at g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1176 <210> 613 <211> 1592 <212> DNA <213> Arabi dopsi s tha i ana <400> 613 at accct aat t ccacat at c tacacacaag caaagagaaa atggtgacac taacaccatc tttcatgaag aataacaaca gtcacagcag cagcaaggaa gat gct ct t g tcacgaccaa catgagcatt aatccaaaac aagaggagtt tcctgaagat ccagagattg gtgtgtttgg ttcagaccaa ggt t ct agt g t t ct gt ct ct cgactcgaag cagagcgcga agaaatctac ct ggaat agt cagagcgtgt t gct t cagaa caat aaagag t t ct gct agt t ct ct at gt t gaagct cat g t ggt gat gag agct gagaag gacaaaccca t ggt act ccg caaact ggt g Page 521 agaaaaaaat accccaaaaa t ct t ct t ct t gaaccaagca aagaaaat gg t act t caat g gaagt t gaga agt gt ccggt aat agct gca at aaagct ac cat ct t t t ga ct t act t gag aaacact aaa tgaagaaagc gagacat gga gaaccgt cgt ct gaat caag acagt t cct t 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt caaggaaaag aagaacagta atggtcagat tcaaaaggtg accaataata t ct cgcaaat gaaaacct cg t gcggat at g gt ct ct t gaa gaaact acca gagt gat t ca t ct t acgagg cgt agagt gg aacaagt cct at cagcaccg ct gcaagagt caagacacaa cgaaacacga t agt cagt ga at aat t t ct a at caaat caa agagt gacaa t t ggggt gt a gt caagagaa aacacagaac gt t t t t ggat t t gcct ccat agct cggat c caaggaagt g agcat agt ga gt aagaagaa cagagacgga cat aaat ct g ccgagt t acg agaaggat ca ttgcaagaaa ct gagt gaga tttaaaaggg at gat t ggag aat gcgcct g gcgct gat cc t gat caagat ct ccagt ggc ggaaat cgag t t t t cgagat at ccagct t c cagcaagt gc accgacct ac aat cgagt ag t t at ggt t t c t t cct agat t gcaacagct c gt aaaagaaa aagat at gt a t t t ct t agag agt gaaaaca ct ct gat ggg gaat at ct ct tcagaagcaa t at t gagaag aacagaggag agagggt ct t acct acgt gt agcagat t t c t cagat t cct cagcagcgga t ggt gat t t a cccaat ggag gat t t ct cac aagat t t caa aact t at aat acct t t t gca aa gat t ct gt ag gt t at cacaa gaggagt t at aagagt agt g gacgacacaa acagggaacc t at gcgccaa t ct gt t at gt cgaat ccct a gggaat ggt t gacagaagaa act act aaac acacaat cat t ct ct gat ct aat gt ct t gt cct aagaaaa agaagagttt at gt cgacga t gagat ct t c cacagaggaa t t gt t cagaa agagt gaagg ct aaacct t t gt gaagt aag cagaat gt gc t t accgct aa t ct t gat gag gcagcat gaa ct aagagt t t ct ct t ct t t a ct ct t t at ct t gagt ccat g agt t act caa 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1592 <210> <211> <212> <213> 614 1694 DNA Arabi dopsi s t hal i ana <400> 614 aaaaagcaga gct t ct act c acaaaat ct t agggt t gct t ccgt ct aggg t aaaacacat gcgt aacggt gct t t cggt t ggaagagaat ggat t t gagc acaaagt t t c at ct ct gagc gcacaaacca gt ccccat t t cct cct ggcc cat aaat gt c tttgccccaa gacct t gagt aaagacat cg cgcgaat t ct gact t gat ga aaaat cct ct t t t gat aaca aat at gact t gaggaacgac t ct t ct t gt t cagcaaagct t t cacgat ct t ggt cgt aat gt t gt t caag gat t t ggggt t t agcgt gaa t caagaaact t t ggt ct cac agcat gt cga t t agat t ct c aat aat ggt g aat ct t cacc t ccgat cat c ct ccaagaaa t t cat caagt acct at cact t t acggt gat aaaagt t caa gaaagaat t g t gcgagt at c tcaggaaagc t gat t t t t t c agt ct t ct at aagaagat ca ggaaacct ac cacggacct g gaagcagct g at ggcct caa gaat ggagag t cct t caagt gct t cgaagc at at t cagaa at caaagaac cct act gct g ct t t t t t ct t aggagt caaa accagct cca t gat gcat ct aagaagct ct gggt t t t t t c agct gcgt aa at at t agaga aat ct ccggt ccgcct t t gg t gat gt t t ga gt ct t aaat g 120 180 240 300 360 420 480 540 600 660 720 gtttataggc tttgtgtcag gccaacataa gaggctttac aacgtcttca acagggttga Page 522 12689250 Sequence Listing.txt tacttttttt aatcatatag ttgatgatca tcactcgaag aaagcaactc aagatcgtcc t gat at ggt c gct caccgt t aagcgccat c gaaagct caa agaagat ct t cccagcagct cgacat t cct at cct ggaaa caagggacat agct at ggcg gaat at gcct t aagaaagt t gaaaaacat a t t t t gaat ct ct t t t t t t t t <210> 615 gacgct at ct gat cat ct ca acct t gat t t gacgagat cc gat aagct t c cct ct t t t ac cagaagagag aat cct gaag agct gt gagt at cgcaacca gagaagaaga cct ct t gagc t ggt at gct t aat gt t aaat at ct t agat at gat aaggagt cct gggcgat ggc gaact t gcat aat act t gaa t t cct cgaga ct ct t ct t gt agt t t aaccc t gt t accat t t t gaat t ggg aagat at gga t t ct gccagt at t gt at aaa t at at t ccca agat aat gaa ct caaat at a agagct cgt t t ggaat caaa gct t gt ggt g aacaat ggct t aat gcat gg ggagaggttt t ggct ct ggt gct ct t gaat cat ggaagaa t at t cgcat c ccagaacgct aaaaaaat aa caacaat at g t at cacgct g agaaacccgc caggaaggaa aaagaaacct gat at caaga t ct at aggac at t gat t gt c cggagaattt t t gct ct act gct ggt gat c agt t t gt aga t t t t ct t gt t t cgt t gt t ac cat ct t t caa gaat t gacac gggt aat gaa gaat cat gga t aagact aca t t caaggct a gagat ccgga ct gt ggat t a gt ccaggaat t ct t t gat t g t cact gt t ga t gagct t gga t at t t t gt t t aat cagaact 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1694 <211> <212> <213> 2896 DNA Arabi dopsi s t hal i ana <400> 615 gcggccgct a agt gaacgt a act t ct t ct t at ggagt t t t gct gt t aagc cgaagcaaga t t ccct aaag gt ct ct gacg at gact ggt t gacaccgaca t gt ggaaggc act cct at ca agcaaaaaca t t gat t cgat tggagaggcc gat ct ct t ga ct t caaagcc caagat t gcc cggat t cggg acat gt ct ag t at ccgagga ct acacagcc at t ct t ccaa agaat gaggt t gt t aaact a aggacgacca cagaaggggt at gat gct cg aagagcccct gat t t t caac t cct ct t gat gct aagat t a agacgat ggc gaggt cat cc gct caaaact t cact gt ccc ggaaat t gt t t gccaaaat c caaaaaggac gggcaat acc aaat gat aaa ccagaaggag ccat ct cct t cct t cgagt g ggcaacaaca cagaaagaac at t agt t t ca gaagaat ct a gct ct at cca at agt ct at g ggaagaaact agagat t gt g ggaact ccct at caagt t ca gcat t gaggc aaagct t t gg t gaat gat gc gaaaagagac aaggt agt ag ccgct gaat g agct gt ccag cat ct t ct ga cgt t gcagca ccagcagt gg gccggt t t ct t caagaat gg t ct ggaat ct t t gggat gca ct cat ggat t at t ccat cac ggagt cat t a acat ggat ca t agt aagt gg gggat t gt ca t gaagt ggaa at caggggcc gact t t t gt t at t ct t t acc gcaagggcca aaaaagt t ac t ct cacagt c ggt t gaagt t ct ccaaat cc agaggt t gt g 120 180 240 300 360 420 480 540 600 660 720 780 840 Page 523 12689250 Sequence Listing.txt caaact at aa gacat cgt aa gt ct caagt t caggaat ccg t gagcaat ga aagcct gat a t caaagt ct t t cgaat aaca gaagt aat ag gggat t gat c cgt ct ccct g tcacgcgagg gcgact gt cc at aaact aca gat cagaagg gagcccct cc gct ccggcaa gaaggcct gt acgt cct gga ttcaaaccaa aagggcacgg aacaaggct c ct t cct act c t gcccaggt g gact ct gcaa ggaat t gt at gt at t ggct g gct gcacct a ccaagt accc acgggt gct g at gct t t at g ttgcacaaag at caacacgt gagat aaagc cct ccaccac aagt gggaag t cct at t ct t aaagccaagg gt t ct act ac tcaggacacc ggcat gaaga ggcagaggga t agct accac at aat cccat aaat at t ggg agaagat aag ct aaaagcgg gagagct t ca aaaat cct t t caaat gt aga gggct t caca aagcaat aaa t aaaaccgt t gt gaact at a accgagcgt g t ct acgct t c gagagt t gt t ggt t ct at gc at cgagacct act t t gat t t gcaaacgaag agt caaact c gt cat acaag gt cgcacacc at ct cact t t t gt t aaat ag agcat gct t t t ggat gcacc at gat ggagt cct ct ct ct g t gt t gt t gt a acct acacct t ggacgt gt t t ct t t t acgt cagt t gggac t ct t gagcgc t at ct t t gca gagaaat t gt agacgcaat c aaagaaat t c at act t cat c gt ct gagaga t gaagct gt c ct cgaagcct aaagat ccaa gggct ct ggt t gccat gaaa cat cgaaagg ttttcagacc t gcact act t agcagaggtt gaagcct gag at cat t cat g gagat ccaaa gt t cgt agga t gct at t gat t t t caggggt ccccagcagt agat ccgagc ct t t cgcggg gt t gagcat a gct t gt gaat cct t t gat t c ct t gt agt t g ggt agacaaa t ct act ccaa at ggaacccg ct at cggat a at agagaaga t cagacagct cggt t t ct t c agagat caga t ggaact t t t ggt gt gcagc acagagat gg agagagct t c gt ct at cct c gcgagt ggag gat act ggca gcaat ggaga gaaat cat t t t ct acgcacg gacagacaac gt t at cggct aat at act gc acgacct gca agt caaccac act gaagaat t ggt gggcac aagaat aggc at ccct gt ga agt cggt t ag at caat t ggc at cgaaaaag t ct acggact t caaat t at g ct t aaat at a caagacagt c cggggt caaa aagaat t gat gagaaaggga at t t cgt cat t t ct t gaat t aggggccaga gggagat t ac t ccact t gca ttgagggaag agaact cgaa cagat gct aa t gcct cacaa acacagcggg gt gt ccat t t aaacaat gat ccct t ct gga t ct gt t t gat ct at gaaaat t agaat at ct tcaagaagga caccccagct t acccacat t acat t gcgcc t gggt at ct t aaaagacatt gt ct t gt agg gat ccaaagg ct ct cat acg at ccgaat gc t ggacat t ga caaaact ct t aacggct ct t cact at ggt a agat gaggct gct caaaagc gt t gagcaca t at acgccaa cagt gat cct gacagagt at gacagat caa t gt gcagt t g acct at gcgt t gat cat gt a agt ggt gaaa tacgcggccc t aaggagagt act acat cat ggt t gagct g gt t gaat cgt t cat ccat t c t acagact t c at t gacagaa t cact gct t a t ggacacat a t at t at t cca t gt t gcagaa t gagat aat c gt t gt at gag t gccaacat c t agacagt t g tggggcaaat aggcat gagt caaagat at a cct ct t ct aa ct t ct caat a t t cat gt t t a 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 Page 524 12689250 Sequence Listing.txt aaaaaaaaaa aaaaaa <210> 616 <211> 973 <212> DNA <213> Arabi dopsi s tha i ana 2896 <400> 616 acgaaaccaa ct cagct t cc tcggccaaga aaaccact ac at aaggacaa gaaact t ccc aacacgccaa accacat ct a aagct agat c t ct t t gct cc t gt accgcct agcgt t t agg gat gaaaccc t t at t t t t t t t t ct t t t gag gt agat cat c t aat at t t t g cggacgaaga t t t cat ccgc gct cgt cggt caagaacgat agacaaagat aact t ct caa acgcggt t cc cggct t cct c at act acggc t cct t ct t ct t ccacgt caa t cgcat gagc aat gt gcaag at cat ccat c agt gt t ggag t t gaacct t t gt t gaaact cacg cgt act cct c gat aact ccg gagagct ct g aacagcaaca gccct aggt g at gacat cat aacaaccacc ggaggggaca aacccaccga caaat act at aagagact aa at cat gt gag at t gct gt ga act gaaaaaa t t gaggt t t t at t t cat gaa ccaaagaaaa acaact t at c agaat at caa acaggagatt gacat caaaa acct t cat ca accaccgt ca tcaaacgccg caat caacgg t caaggcgt t t aaggagcct t ct cgat ct t t at at gagac agaaaaagaa cacat t ct ga cgt caacgt t agccgccatt cgcagaacct ggacaaagac cgagt gt cac cgct cacaaa t cat cagcct ct at ccgt ct t cgt act act aagt cct t t a t act cat ct t aat aact ggc cat ct ct gac t cgct gat cg aacgagaacc gt t t at at t a gaat cct t ct at t cgt ct ct t ct gat cat c aaagaaaaag t act gct t ca cgt gaacgt c cat gaccct c tggacgacgg caaggaat ac ggt t t gt gcg caccagct t c cgt acagat t t t ct t t t at t gt gaagccaa ggt t t ggt gt at aaagggaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 973 <210> 617 <211> 2240 <212> DNA <213> Arabi dopsi s tha i ana <400> 617 tgagatttct ccatttccgt agcttctggt aatcacttct tcttcttctt cttctcgatt gaat t aaaaa t ggaat ct t t at cgaat cca tctctaaagt ggaattttgt aaagagaaga gagacaaatt cgtctggaga agatctggtt aagcaacgtg aaaggtggac tgaggaagaa tatggtagag catggcagaa gattgaagaa agaagtcacg ctcagaaatt tttctccaag gct at gggt c aagcgctaga cat agct at t ct ct t t t ct t t ct t act gt t agct gat t t t t ct gaagt t g at t aagact c cat aat agat cat gt agcaa gt agagaaag cct cct ccac Page 52E t gt t t cat t g t t ct t at cca gt t t ct t t ca t gt agaggag ggaagccat a t cat t gaagc caaaaact gc aggct gaagc ggcct aagcg at caaaagca acgaaat ct g t t gaat cat c ct t agt gat g t acgat aaca t t t gaggct t t gt ccagat a t aaaggt gt a taaaccaaac 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt aatcct t atc ctcgaaagac gggaagtgga acgatcct t a tgtcaaaaac gatggaaaag cgacaacaat act cat cagt gcaagcactt agaaaggagt cct cagact t t ct ct at cac cagt cgt t t c gcaact t t cg aact cacct c tgggctgcca cat cct ccat caacatggt t t cact ggact gcaacacct g cggt cct cgt gaaaggcaag cct caaact t t ggaagt ct g ttgccgcaaa caaagat at c caagaggaga agtagaggaa agaat cct ca t t ggaaact c t t t t t aagt t t t cct t gt cc gt t t aat ct t agt ccct t gg caaagcctga at ct ct ct gc t ccgcgagt t caaactcaga at ccgat gca at cct cct t c ct aat cat at cct cat cat t cgaat ct ggc at ggat t at t ct act t t t gg ctgtgcagag cagaggatgt agagt gat gc gtggctcaaa aggatggcac cagagtccaa tgtctgacga gt t t t acat a caat ggcact agagaaacac gaacaggttt acaacaatcc aagct t ccac ttcaagacca at agt ct t cc t t t t t t t cca atcagaaaaa agagaaaact t gcat cct cc ct t gcct t ca t t t gaat gca t at ccct gt g agagccagat aat gt caacc t t ggcct ccc t gccat ggcc acct t t at gt accat cat gt ccgagagcaa t gaaaat aag aaagggt t ca cact ccgt cg caat ggt gag tgcacgccgc gggt cgaat t tcgagaagaa t gat ct t aac aggat t t ct t t aaaccat ac tat cat t cat at gagact ct ct gct acat t t gt aacat t t gt gt cgcat c ctgcaggaag at gaat aaaa cgggaagagg aaat ct ct gg ct agt gccat agt cat cccc ct t t t acaaa gat t ct agt g gcagccact g gct cct ct t a gat gt agagt gaacactccg agtaaaccag gatggagcag agt agt gat g gtgaaagaaa agt agaat ca gcct t ccaag cacagagagg t t cacagct c ggaat cggat aaaagatgt t gtggaacaga at t t t cat ct t t ct t t t t ct gact ct gt at ct gagat ggc acaact gt t c gt t gt at aga gaagtcagaa aaaacggtaa t ggggagct c acacagt t gc caccggctct gtggctcacc t t gcagct gc gt t caggt gg acacaaaagc aggcatcaaa t t t gt cat ga gagacagaaa at gt t gaggc cgaatgaaga gct ccaat at ct ct ct t ct c aagaacaaca agt t aacacc t agat gct t c ccatggaagc aagatcccaa gat ct gt t gt t t t gaggcct tat t caacaa gggt gt gaat caat gaagat agat t gt t t c gacatcaaac t aacagggt a tgagcaagga aat aacaagt aggagat t at t t at act gcc tgt t ccaggg t agt gct t gg t t t cact agt aagcact t t a ggct cgat ct gcagcct t ct acaagt t gac ggat gcat ca cact aat aaa aaccgatcca cagagaggt a acaacaagaa agt t gat gat aaagct aat g caaagaaagt acggatgcgg t t gt act ct g t t gt at t t gt at cat aaact 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2240 <210> 618 <211> 757 <212> DNA <213> Arabidopsis thaliana <400> 618 caacttagaa aaatgaattt tactggctat tctcgatttt taatcgtctt tgtagctctt gtaggtgctc t t gt t ct t cc ctcgaaagct caagatagcc cacaagat t a t ctaagggt t Page 526 120 cacaaccagg gcct at gct c gggcct t acg aacat gt ggg t gt ggt cact aggt gt aaca aacgagaagc cgt gt at at g caaat aat ac t aat t at t at t t at agt t aa cacgaggagc ggagct acgc gggaaaactt ttagcgagaa acact caagt at ggt ggaac cat act aat g t at cagt at t aaat aagagc agt t at at at t gt ct t t caa 12689250 Sequence ggt aggcgt a ggt cccat gc agaacaacta agaggcaact agcctggggt agcggtgact ggct aact ac aact acgct g tgtttggaga aagt cagt ga cat aat cagt t gcaact at g aagt aat gat gt gat cat gc t caat aagga gcat cat at g t gagat t acg agaat ct at t gtaaaatatg tggccttttt aaaaaaaaaa aaaaaaa Li st i ng. t xt agt gggacga gcagact cat t gt ct ggcgt cgaacacgt g gact cggat g at cct cgt gg at acacacgt caggaygt at t aaat t aaaa aaaagt t aca gagggt t gca acact ct ggt ct ccgccgt g caat ggagt t t gccaaagt g gaat t at gt g acat aaagga caat at t t at gt t acat act t aat t aat t a 180 240 300 360 420 480 540 600 660 720 757 <210> <211> <212> <213> 619 1944 DNA Arabi dopsi s t hal i ana <400> 619 ccacgcgt cc t gaat t cct c gt t cct ccaa agaact cggt ccct t gagaa gcgt t ccggt aaccacgagc t ggt ggat gc gct at gccga cat t ccat ca t gct t gaaga accct gaat a t ccaggt ct a gagt t at t gc accct gaaag agt at at gaa caagcggacc ggagagacgg agct gct aga gt cat t t ct c gagt t ct ct t caacat gacc t aaggaaat g at gggagaag gt t cgt cat g cat gaacgct t t ggt gggga gct t at acag at gt ggagga gat cagcaag t at ct cct t g ct cagat t t c ggaaat t caa caacgggacc at cgt cact t t cat gat gcc aacat ggaat acat ggagac at cat aacaa at caaacgt a tttgcgaaga aagt t cact c ct ccacgt t c t t accgct cg agt t t gat gg t t ggt ggaga at ggt t caaa aacgt aggag aaccct gat c ggat gt gat t at gaggagct gt aggaat gg t ggagat t cc caggcat at g ggcgagt aca agcgagt at g caact cct at agagagagaa aagat gccaa tgaagccgcc acgagaagac t ct cat accc acacagt aac ct ct gaaagg aagat ggacc agcacggt ct act ct t gcag t t gt ct acac ct gt gcct gt t ccgt gaacg gacct t gt gg ccggaat t gg ct gaat caat agaacct ccc gaaagttttt ct t cagcgaa aaaaact at g gagt t ct aga aacat at cag ct t cacgcca acact ccaag aat gt caggg agct ggt gt g t at gaat t at caaact ccag t at cccct t g agacaaat ct cct aagagga at t t gaaggc agaat t gaga agagt t ccag t gggaaaact agaagat act cat ggaat gg aggt at ct t t gaat t gacac aaccaagaaa ttccaagcaa gaaggt gaaa aacgacgct a cat t t gaaca gaaggt gt ga aact gggaag gt cgt t at gt cct ccat ggg gggagaagga agaacacct a t acat aggag t acccat cat t gct acgaca aact ggggaa gaat t t t t ca t act ccggga caaggaagcg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 527 12689250 Sequence Listing.txt atcaggaaag gtagctggaa ttcactggca ctacaacacc gagcaaagct cagct gagct ct aagat gt t gggagcaacc cgacaaggca gcgcat t cgg t t act t acct agt t t gt t aa ct ggaagt ga ct gct t t agt t gt ct gat aa aat t ct gggt at cct at gag t t ct ccaaaa aaccgct gga caacaaacat tgagcacgcg ggccggaacc acaagt ggt a aagaat gaac gaacat gaag cct t t at gt t gt aat t t ccc at aact agag cagagt caga t t t t cct t gt aaaaaaaaaa t at t acaaca ggagt t gt gc aat t gct cac gaact agcag gcaacaaat a aagcggt t at gaaggt ggt c ggat t t gt ca acat aggt ac agat caaacc gcaaagagaa acat cat ct t aagg caagaaacca t caact t cac cagaaggt ct gggagaacgc ggt cagat t c t t gagggt ca at gggaggag aaggcaagat at acat at ag agt aagagt g gcaaaat caa cat act ct t a t gacgggt at ct gcat ggag ggt caagcaa gct agaacga t ggaaat ggg aaat t ggcag act ct caaaa cgct gagaat t gt ggt gt t t t t aaagct at gat gat gt ac at ct caaat a aggt cacacg ct gccaat ag at gaaagacg gt acagaacg t at gact cga t t aaccgcat cagt t agt gg gaagacacaa gt ggaggagg at t gt at t cc agat t t gcac act t agat gt ct at gcat t t 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1944 <210> 620 <211> 483 <212> DNA <213> Arabi dopsi s tha i ana <400> 620 atggcacaag ct acgcgt ca gt ct t ct ct a aaagcatcgg caaagaagtt ccatcacatg gccactcctg acatccaggg atgtgctttg gtcatattct ggaactatgc tattgatggg gcagt ggat c aggagaagaa t ct gt t agt g gagt t caaga gct t ct t ggt cact at ccag gt ggt t aagt gt cact t aaa at at gagagg ct gct cgagt tgtttgccaa gctaaccgaa t ag caaggggaac ct ct ct ggaa cat gagggag cagcccaagg ct cagggt aa gcaaccccaa at t gacgaga t caat ggacg t t gaggt aga gaccacaaga agt t t ggcaa tggcgaggga t agat ggaga agct aagagg aggt ggct cc aaat gct t t t t at t gagat c t gt t gcgaaa agt t ggcagt gaggat cgag t ct gacgaaa accgggaagt tccggagaag gt ccgaagt c 120 180 240 300 360 420 480 483 <210> 621 <211> 924 <212> DNA <213> Arabi dopsi s tha i ana <400> 621 atggctccag caagtccacg agtggtggtg aaggtggatt tgaagaagag gccatggcag cagact cagc cact gcacaa t agat ggcat cct gagat ac cat cagt t gc agaggt caag act ggt gagt t gt t t agggt ggaaat ggt t gact ggacgg gaggt gcagt caaagaagat ggct ct gct g gcgat at t aa gagcat agac ct ct ct act g t t cat t acct gagt gggccg Page 528 120 180 240 12689250 Sequence Listing.txt at caaagt t g t ggat gagga t ggt gt t gca gct aggccag gt gat ct t ct at at gcaact gagaat ggcg tttgaaggga gacat gcat t ggat t t ct ag at gggaccaa t t ct cagagt ct agat gcaa t t cggt t act cggat agt ga aagaat t caa t gggt cct ct gt ggt t t ct t t t t at gcct a t ct cacaggg aact caagt g ccact ct cca ggct t gt ct t cagt t gcat a caaaagaaca agaagccaga gt t ct t cct c t ccaggagat gact gaccat ct ct cct caa t gat ggcgag t gaaat cat t cgt aaacccg cgagggt at c caaacgagca ggat at t cgt cgt t at gaag t t ag gaat ggggat t t ccct t gt g at t cct gaag at ct cat t ct agaaacggga at ct t t gaga agt gt cgat g gt t ct caacg ccaaagaccc agcacat at g t cact gct t c cgaccaaagc gagct aat ct gt ggagccat t gcaggagt a t agggccagt agagcggaag ct at agact a gaaaagt t cc at ggaaaact t gct gt t gag at t t gat aga t at t t ggt ac aagcaccggt agagat gagt cct t acacca ggaaccaaga acaacat t ac t ct ct t caaa aacaggagct t ccaat t aca 300 360 420 480 540 600 660 720 780 840 900 924 <210> 622 <211> 1581 <212> DNA <213> Arabi dopsi s tha i ana <400> 622 agaaagagac t t act t t gga t at ca cct at ct aca acat acat aa acct ci t gat cgat gt t agct ct gt c tccgg cttgatacgt cgtgtggatt cacgat ttcgctgacc acggt gat ct t ct t g ggagatgtgc ttcctgactt ggagat cacatgaacg cttcttcaac gatta act act aagg gt agt t cggg gaaagl gcggagacgg tgacttatga cggtg tcttccaaga acaat cggat cagt a acaccagagc tacacaggag attcgt gttccttctc gaattctgga gcttat agt cacct cc aaaaat at ag gt ct c gct aat t gga cacgcaaaag gcat at cggactaaaa at ggat ggct t gcac gt ggct gt t g caccgccacc t gt cc~ catcccacgg ttgatcagtc cattat t ct accgcca t gcct aat cc gccgt 1 aaaca t gt ct t gct c t ct at t ct aa t t aagt ct ag at ct cgaca t at a acat c cgac ct acg gt gaa acagt acaac ggag ggga at cgg ct at cggca accac gccg t t gg acgat t at at agagat ggt t aacccggagg at t gact t cg cct gagat ct acgt cggat a gaagt cgt aa gaccggaaaa gaagggaaga gcagt ggaac gt ccat t gt c aaacat t t gc ggagt agaca cccact ct cg cat cat t t t a cat gt gt ggc gt ct ccgat t t t ggat ct ac gcgacggagc aggaggagga acgat at at t t at ccgggga agact gat ag gcaaaagaga ggaagt at t c gaaaggt gaa agt t aggagt t cact cgt ca t agct cgt ga ccggt gct aa ggt t t ccacc ggcccct gca ccaaacactt ct ccct at t g aagt gaagat gt cagagt t t gt t t ccggat cggt gt ggcc t t t ct ccaat tcaaggggag cgat gt t gcg ct ct t cagct ggt ggat t gg ggacaaagct caacgt t gct ggccgaagcg t ct t aat ggt accaccaccc t gt gt gggga acct ccgcct gcat ccaaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 tttagagcac cgccagttgc cggaatcccg catgctctgc cgccgcatca cacgatgtac Page 529 aaaccaaat c agcgt ggat g t t aaat ccgc gt t cct ccga acaaaaat t c t gt t t t gacc aggagaccct t aaat t acct t t ggat t t gg cagccat agg cggct gt t ga ccgcgt ct t g gacgacat gt ct t gt t gt gt t t t cat at gt at at t t t ct C 12689250 Sequence t ggt gct cgt cctccggtag agat gt at t g acgaggccat cggt gt t at g acagagcttc tgcctgaaac gcacaagatc ct t t caat t a t t gt act t t t taagagttac tctttactta at ct agggt g ct aagt acac Li st i ng. t xt act t acat cc ggct gccact accgt cacgg cgt aggcaag at t t t t ccaa at cgt cccat t ct agct ct a gt caaaagag t ccgt t ggga t gt ct ct gag cgagaaccaa gggt t gt t t t gt gact aat a ttaacgcaaa 1200 1260 1320 1380 1440 1500 1560 1581 <210> <211> <212> <213> 623 666 DNA Arabi dopsi s t hal i ana <400> 623 at gagct cct cgccgt gt aa ttcacccgcc t cat cagcat at gccgggac agagacat ga ttgaaacagg cgccggaat g t t ct at gt ac t cgat agaca t t cat t ct t c t aat gt tcaccaccac accaaccgtt ggagaacctt tagcacagga t at cacct ga gggagat gt t ct t at ct gt c at t acgt t ca ggacgccaaa at gt t gt gga acat gaaat c caacact ccg ct ccgt cgt a gact t cact a gaaat ggggg ggat gcggcg ggaccacat g t caggat ct c ggcggcaaac ggt gt acgag gt t t ct t gct t ct ggact cg ccaccgt at c t gt t gcaccg at aacat t ca acaaggt cat gcgcgt at aa t cgt ggaggt accaacgcca gagct t gt cg t cct at ct t t t aat t cagt t t t acat t aga t cct ccggaa gagaacccca cggt cat cgg t cat aaagga aacagacggc acgt t at at t t gaacat ct t aaaacat gag act acgagaa gt at t gt at a t at aaaat t a aat t t at cat acaagacat c cggt gct acg gaaat at t t t t gaaggat t a ct acat aaga gccggagagt t gagt t ggat gacat t gaaa t at gt ct t t c aaagact t aa 120 180 240 300 360 420 480 540 600 660 666 <210> 624 <211> 1932 <212> DNA <213> Arabi dopsi s tha i ana <400> 624 atgaagcaga ggagtttctt atcaatcctc tctgtttcag cacaaacatg catagagaac gattcaaatc gtcgtcttat cctctcttct ttctactacg gttcgattgg agaagagcaa ccaagatcaa ct ccaagt ga ct gt t t t aat caggactgtg ttaaccagac agacgcgtat gtccgctact ccaacatttc tttctcagga t gt t t t at t c cgt aaat at t ct t cct aaca gat cgagt ct t gt at aaagg t at t gggcac t ct gcagct t Page 53( t t ct agcct t tcacacccaa at accgcct c acgcat t agg gt gcggcagg t t gat ccaac t t t gggagat t ggt gt t gct cggt acat ac t caagat ggt gat gt gcat c ct ggt t gat a gct gt gcct t t gagcct cag 120 180 240 300 360 420 12689250 Sequence Listing.txt tatttggtct tgaacactgc aactatcgcc tcagatctaa cggatttcaa gaacatatgg gaagact t aa agt gat aacc t t gat gcaat gt act t gagt acct t ggct a ggggcgaaca accgt t ccag aggagaaagt t cat t gcaat aagct cggcg gt t gccgt ga gct gt t ct t g caaggagat g ct at t cgat c gggat t gct c gact t caaag ggaat ggcca acct t cgt t t gt ct acagct taccagaacg agaaacggct gaagt cact a ccaaagt t gt ggcat accgg gagt cgggt t cat ct cgt ac at t acagagt gcacgccaga accagt cat g ct cct ccgaa ct acagat aa cagt t gt cat ct t t gcaaag acgagt t t aa aaggt cgat t agaggct ct c t at caaaaat gaaagttttt ct gaaaagca aagggat act ccagcaacat cagt t t t t gg acat gt ct cc t t ggaat ct t at gaaact ac cgcaact aaa gat gcat cca caaccat t gt gat t t t t ccc aa gat t act gca t gat t t t gcg t at t t ct t ct ct gt ggaaat acct ccaat g t gat agcagg cgt ct t gat a aact gaat t t gacaat t gaa t ggggaagt t gaaagt at ca t cagcat agg gat ct at gag aggt gagct a acat ct t cat t ct ct t agat aat ggaggaa cgagt at gcg gat t ct t gaa t act gct ggc gct gct ggat t at t gct ct t ct caat gct t ccaaagcagg gcct ccgcag aact t gacaa gacgaat gt a aat act ggag aat gt accgc ggagt ct cag ct agt t gt t c gaat ct gat a gct gcaacga t acaagggt a ggacaagaca aat ct ggct a t t t gt t ct ca gact ggact c caagat ccac gccgat at ga agt cgaggca gt gcat ggca at t at t agcg aact t ggt t a t cgt ccat ag t t at gt gt t c accagt aaca cgt gaact gg caagaagcac aat t ccagaa acaact gt ct gct at gct t t gaccacct t c ct ggaat t gt t aggat t t t t gt gat gt t t c at aagt t t t c agt t t t caaa caaaaaagtt ggct t ct t gg acaaaagcct ggcgat acaa agct cacaat acccaaaaat at acgaact g aat t ct ccat gcaagaagaa cct at gct t g gaaggaact a aagagaat cc caat cagt gt acccat t at c accat cgt cc t at ct acgca ccaacgaggc t cat aacat t t gt aggt cac agt ggt gat t t at t t gct gg t act acaaat gaagagcaat t ggaact gaa caggaacgag gt t ct gt t t g t gact at t t t gat cat t gga cat at at cgt t t cagat t t t gat agct gaa gaaat ct gac t agcagcct c gaggct t t gg t cagagt aat agaagaccgt gccagcgcct t gagggat t a 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1932 <210> 625 <211> 804 <212> DNA <213> Arabi dopsi s tha i ana <400> 625 at gt t agt t g cgt at cct gc gggaact at c aacaaat t cc aaat ccct at t agct gct at gccgctttgc gcagattcaa gacctcgcct tttggtcctc tgattagttg ttctgtctcc ccaagatcaa tagttccttg tagcaatgta gagtgttatc attcatcttt ttctacaaaa tatcaaattc agcctcgaag gagatggatg ttttgtaact cacgcaacaa ctcgatttca Page 531 120 180 240 t cggat gat g accgt gat t g gcat at gcgc t cgggt t t ag at t acgct gt t t gt gt acgc at t ct aagca at ggct t ggc gaggagccgt agt cct caat agt accgt t c gacgcggt ga t cggct gcac agat t gagag ct gaggt aga cgcaaccaac aat ggagagg t gcct gt gaa ccgt t gt ggc ggccacgggt 12689250 Sequence at cacccaac at agct at aa aggcgaaacc ttaaaagaat agat gaagat ttgagaaaag aatggagaat tacggcggaa t gaat gcat a t t gt ggct t a agt cat aaga t ggt cat cga cttttgcgcg gttatagcaa gact ct t cag ttggagcaga cagccgaatg agactggtgt t t aa <210> 626 <211> 1533 <212> DNA <213> Arabi dopsi s tha i ana <400> 626 at ggct t caa gt t caact t c at t cccgt t a aaccggagaa aaccgaggtt aaccgtgtgg acgaagggtg atgatgactc ggaggggaaa t t cggt aagt t accggacat gaagt cact g ttagtgtttg gtaataacag aaagaaagat ggacaagctg gt at acgcat agctcaaacg ggtgttcctg accttggagc tgcccaggac ttatcaaatg atgaagtcaa gaggctaaac tcaatagcaa aagcgattgg aaacgcaacc aatggtccgg acgcccaagt ttcgacctca ctagccggcg taagccacgt ggcgatagtc aacgtgctcg acgggattac ttcgtttttc act at ct ct g acct cat cga gaaagtagct acgagt t t ga cggaggattt ctctccggag gggagtaaca gcggcagtgg cagtagttcg attgcgtcgc tggttgcaga catttttgcg gaagt t t ct a cagat ccat c agcaccatcg ccggaagatg ggaggagaaa agtgtacgca gaagctaaag ttgcagctga caaagctcga aagcaaatgc aaaagctctc tgagaaagaa accaccgcgc gct aagcaaa caaaaaggga at accggt t g cct ggt act a ct ct t acaac ct agct cgt g gcagt t caat aaagt t gt t g gacgcat t gc tacgacggca ggcaat ct t t caaaccgacg aaagct t at a t ccgaggcct aat acggccg cggccagt gg gat gcgat cg gaggcagcag gcagaagct g Li st i ng. t xt gt ct t t t aag t cat at ccgc aact aat ggc gcacacagac ggat t gt at t ct cct t cggt at gcat act a t ggcggt gac t t agcact ct caccgcaggg cggcgt t t ca agaacccgtt t gacgaat cc t t t t t gt ggc gaggat t cag t cgcggct ac ccccct t cca t t acggt cgg t cgt ggt cca ccat cagcgg t cgcaaaat c t ggcct acac acgt cgt cgt acaaagt t cc tggcagaaaa acgagctttt caagggaaag aagcagcgaa caagct t agc acgt t acagg t ggt gt gaat t at gaaagat t aaat ccaaa cat cacgat c t t ct gat gaa t at t agagga aggacaat ca cgaggt agt g t gt caggt t t act cgggaaa ccagt t t gat t t ct accggt t ggt gct acg t gt t cgagcc t t acaagat t agacgct gaa ggcaacagag agcagct gag gt ccacat ac t cagccat t g act cat aaag t t cagct gaa gaagt t gaag t aaggt ggt g cagt gt aat c agcagaggaa agagt t t gag t gaggat gcg 300 360 420 480 540 600 660 720 780 804 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 cagcagaaag cagatgccgt gggagtcaca gtggacggtt t at t t aacaa ggcgaaagat Page 532 at cagct cag gct t cagaga ttgccaccca aaagaagaac aagcaagaga gt ct ct cct g cgcct aaagt agaaagcagt gt ccgaagaa ccat ct acat 12689250 Sequence Listing.txt gaat aaact c ggct ct cagt t cgcaact gc gat t cagaac acaggttgct acggtcagag gacaagcaaa agcgagaaac ggtgaagcag agaccatcat ctccgtttgc atccaaaccg gccggagaaa gaggtgagga aagtgttcgg aggattgttt cgacgatgat tga 1320 1380 1440 1500 1533 <210> <211> <212> <213> 627 1457 DNA Arabi dopsi s t hal i ana <400> 627 t t ct t ccggc ct ccgccgcc ct ct t cgt cc aagcgt cgca gat caat ggg at t gat aagc ct t ggt t aga act gagaggt t gct ggt ct g ct ct t at gcg t gat at cccg aggaagact c cagaat ggga agccagacca tggagaagca caaggat at c at t t gaagat tttcaaggac tcccgagggc gggaaagaag tggaggaagc t ggcgt t t t g t t t ct aat t g at cat at at t at aaaaaaaa gt t t t t ct ct t ct cact t ac t cgt cccct g t ct t ct t ct c acgagcacga gct aaagaag t t ggggt ggc ggagct aat g ct t aat gct t gact t at t cc at gaaat at g cct gat gct g ct t gat gaca gaccgt agt g ggaggacagt aaagaaaaga cct t cat t ca t acgct gaag at agt cat t g gagct t t cgg ccagat aagc gt cct ct t gt acaaat t at a cct t gcaaac aaaaaaa ccgccgt t t c t ct gt t cct c t cgt t gct ct t gt t t cct ca ggaagat t t c at at caaagt acgat gct gg gaagt ct t ag t aaagct cat agt t agct ag ggagagt t ga gacct cct t c aggaaat agt gt t ggggaaa cat ggacagt gggacgacga agaact at gc cccat gccaa aaaacgt t cc at t cgat gaa cat t acccac ccact ct ct t t at t t t gat t aaaaacat ca gccaccgt ac caccagagt c t t ct t cct ct ct cct cct t c accaaaat gt t ct t ct ccgg t act t at aac gt t t gaggct t cagcct ct c tgccacagca t gt t gt agca accagct gat t gcct t gt ct acct gagaca gaaat ggct c t ct t ct ggt g agagaagt at gct t agcaat agagaagt t c aaagaagat a aaat t act t c t ggt ggt aat t ct ct t acct t t gt aaagt t gt gacaat gt t ct ct t t ccc acat cgccac gt gct t caga gccgct t ct g act aagt t t t aagaat at t g gagct t aagc aaagacaagt at agaggagg cct gaacagt cat t t gagag ggt gcacat a aagt acacga aagt t cgaca t t acccact g gct gaagat g ct cggt gcaa gt agct gcaa agagcagagt ct caacat ca aacaact ccg acat acat aa t gct t gaat a ct gt t t ct ct ccgccgt cac at t ct ct t gg aaaaacat cc at gcagct ca gccat cccat aggagt ggcc at gct gcaaa at cct aacat ct ggt ggt cc gt ccagaaga at gt t t t ct a cct t agggag aaact ggacc act ct t at t t at gcggcgct t ggct gcat t aat t t gat cc agt at t ct ac at gaagcaat t aat t gccat at t t ct ct gg t t acgt ggt g aaat cacct t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1457 Page 533 12689250 Sequence Listing.txt <210> 628 <211> 222; <212> DNA <213> Ar a <400> 628 at gat t acgt at gat t ct cg ggcat caacc accaacgat c at cat gct cg at gat cacaa t t gat cgcca t gt at cat t t at t at ggagc gt cgt t t ct C aagct t cat g cggcct t cga aact t t aacc t t t ggt ccgg aact t cgaag ccggcggccg act aaaccga gcgagccat g t ccgacgt gt gct aaggaga ggt gat gat a gcagggct ga ggcggcggca at aat ggt gt at at gggct c at ct ccat ac gcact t caac agat t t at aa gacct t ct cc tttgcaaaag )i dopsi s t hal i ana ggcacgactt cct acggat c gct t cgt cgc ct t acgccat t ct t act t gc t ct t ct ct ct t gt acggaac ggt acact ct agt t cccgga t cgacggt ca t t accgt gag at ct t accgg at t ct gat t t cggat t t gt a agaacaacgc gt t cgt accc at aaaat t cc acgct aagga ttggcggagg t t cggat ggt t cggcggt ct at aaaat ggg acaacggaac ggagaaagct t t gt t gct t a t ct cagat gc ccaaaat cat ccggt ccggc gt at agccat agt acaat gt gt acaccgt c cgt acagt gg t at ct t cgcc gaat t t ccgc t ct at gggct cagcact ct c ct acgcaggt cct t ct ct t c gact ggt gcg t gat t t t ct t gaaat caaac agct gagat c t t act ct gt t ct ccgt t caa cgt t aaat at ggct ccgaac taaagaaaac gct t cacat g tgcaggcgac t gt ct ct gat t gat agt gga gt ct aat t cc acat at gccg gat cagaaac ccggt ggcat t ggt ct t gga t gct t gt ggg cat cat ggct cgt t caggct gcat cccacg ct caccgccg t ggaagat at gt ccct ct cc t t cgt cgccg aacct aacca ccaaacact c t ct ct aat gg ct ct t cgagt t ct at t gt t t gagacggat g gcat cgagac t at agt ct t a at ggggt t t c t ct t ct cgt g ggat t t t aca ccggagtttt caacagcaac t t t gt t t gga aacgt ggcaa caacct cgaa gagggagaaa acggcggagc ccgacaagt g ccaaacacgt gt ggct at gc at ggct at gt aact ct gt cg gt t gct ggga gcgt t gcct c at t ct aagca t ggt accact t ct caccaga t ct cct t cca ccgacacgct agaacggt ag t t gt cat ggg t ccaagt cgt accgt ggcgc cgt t t aaagt ct gagat agg ggt cact gat gt t cgact cc ccggcgggag gt ccgact cc at aacact aa caaccggt ac tgcaagagaa gct caagcgc cggaacaat c agagt aat gc gagagat aga tagaggcggc t gat gacacg act ccagt ct ccaaaat at t t cagct t agg ccacgt t t gc t t gccat t gg aaggaat agt ct ggggt cat t t acgt agct ccagt gct cc ct t cat ct cc t caaaaaat c ct t ggagt gg gat ccct ct g t gt t ct t cag t aagct t ct t t gaat ccgac aaacgacggg gat gact cca gagaggt t ct gct t t cgaat acggcct t cg cagt t ct gt t gggt gt t t ca agat agcaaa t t ct ccggt c t gaacaaggt t agaggt ggt gaaagct aca t ggt ggagat act gat at t g aat cggt ct c acaacaat cc t t t at t cat g cat ggcggt c ct t acacggc t ccgt t t gt g at t t ggaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 ttaatagcct tacctataac tctggtctac tatattcttc ttggcctttg aattggtccc Page 534 t t cagt at ca gaggaaat aa t gcct t t ggt aacaagactt t ct t cct t ac aagcagat at ac act ggct t t t gcaaagcctt t t t t ggt t cc gaat ggat t a ct t t at t t t t gt aat at gca 12689250 Sequence gagagagcct cggagattgg t at t t t t cgt gggaaagagt aat at t t gt g gaacacaaca aggatcttgt cttcattaga tcattttctg tttttgttaa cct t t t t t t a t t aat t gaag Li st i ng. t xt gct at t t at t ggaaggt gaa aaagt act t a cgccaagaag t t t at gt gt a t t t ct agt t a t ct cggt aac gaagaagat a ggat t t agt g aaaaaat ct c gat t t t ggct agt ggt t t at 1920 1980 2040 2100 2160 2220 2222 <210> <211> <212> <213> 629 1497 DNA Arabi dopsi s t hal i ana <400> 629 at gaagt t ca aagct ccacg at gt t accgc gct agt t t ga ggat t ggt gg cagat ggt t c ggaaacgt ag aagaaccct g t t gggat gt g t t cat gagga caagt aggaa acct ggagat ct t caagcat gccggcgagt aat agcgagt gaccaact cc aaggt agct g ggat act aca cat ggagt t g gcgaat t gct accgaact ag gt agcaacaa aacaagcggt ct cacgagaa t t ct ct cat a t cgacacagt t ggccct gaa agaaagat gg aaaagcacgg gagact ct t g at ct t gt ct a at t ct gt gcc gct t ccgt ga t gggacct t g t ccccggaat at gct gagt c acaagaacct at ggaaagt t t at ct t cagc gaat t cact g acacaagaaa t gct caact t caccagaagg caggggagaa at aggt caga t at t t gaggg gacct t cacg cccacact cc aacaat gt ca aggagct ggt acct at gaat t ct caaact c cagt at cccc cacagacaaa t gt cct aaga acgat t t gaa t ggagaat t g t ggagagt t c aat cgggaaa cccagaagat t t t cat ggaa gaaaggt at c gcact acaac ccat gacggg cacct gcat g t ct ggt caag cgcgct agaa t t ct ggaaat t caaaat t gg ccagaaggt g aagaacgacg gggcat t t ga gt ggaaggt g t at aact ggg caggt cgt t a t t gcct ccat t ct gggagaa ggaagaacac ggct acat ag agat acccat cagt gct acg act aact ggg act gaat t t t t ggt act ccg tttcaaggaa accaggt cac t at ct gccaa gagat gaaag caagt acaga cgat at gact gggt t aaccg cagcagt t ag aaaccct t ga ct agcgt t cc acaaaccacg t gat ggt gga aaggct at gc t gt cat t cca gggt gct t ga ggaaccct ga ct at ccaggt gaggagt t at cat accct ga acaagt at at gaacaagt gg tcaggagaga ggaagct gct gcggagcaaa acgcagct ga t agct aagat acggggagca acgcgacaag caagcgcatt cat t t act t a t ggagt t t gt gaaat gggag ggt gt t cgt c agccat gaac t gct t ggt gg cgagct t at a t caat gt gga agagat cagc at at at ct cc ct act cagat t gcggaaat t gagcaacggg gaaat cgt ca acct cat gat cggaacat gg agaacat gga gct at cagga gct aaccgct gt t caacaaa acct gagcac gcaggccgga cggacaagt g cct aagaat g t aagaacat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 Page 535 12689250 Sequence Listing.txt aaggaaggt g gt cat gggag gagact ct ca aaagaagaca caact ggaag t gacct t t at gttggatttg tcaaaggcaa gatcgctgag aatgtggagg aggctgcttt agtgtaa 1440 1500 <210> <211> <212> <213> 630 3093 DNA Arabi dopsi s t hal i ana <400> 630 at gt cgaact caaagat gga cgggat ct cg cgagt t gct t gaat acaagc gcat ct at gg ct t gct aaaa agggaaaaga t t t gt gt ggg t caat aggt g at ct t gct aa t t gcagt t ca gggagcagac at t ggggat c gagt caagct ct t t caggaa aggact gagt ct gcaggt ga gt gct t acat t t caccaact act at cat t g t t t gct at ga act at gggt t at ggt ggt ca gaaagct t t g cagaacact g ccgacagaaa cgt aaagagc t gct aaggga ggt cgt ccgt at aagct cgc t ct t t gt t ca t cact gat ga ttcgcaaaaa aagt at ct gt t at t t ggaga aagcgct cca t gggt gt t gc gt at act ct t gagact t gga aagaaat ct c aagt t ccagc t at cgggt ga ccaaagt aca ggggaaagtt agct aaat gg t t gt ggt at t ggt cat ct ga t cgt t gccgt agaagct gat ct t ct act t g acaaagt at g agt t ggaact gt t cggaagt gagcaat act at aagat act t t t cgaggt g gt ct at cgt g t gact acgag aaaggcagct ggt t aagaaa t gacact aag at ct ct ct ct aaat cgt t at cgacat aacc tacagaagga ggt ggt cat g tcgagagaag cat t cat gac t gat gggat t gagt gaaccg aaacggt t ct gat ggaaacg ggt t gcaaca gt gcat aagg agat gcat t g gcccgaaggt gagt gacagg cat ct gcact gat ct gt gat at cggaagaa t gt t aaggat cgaat t cggt caagat t gag gaggct aaga aagaaccgt a aacaagaagc ct t cat t t ca gccggatttt agcct ggcac gaaggcat t c act gagaaac ct t at cat t c tttccaaggg gt t act gcga aagaagat t a t t ggt cgt t g t t t at at ct g t cacat gt ga gcaaaaat gc ct ggt t gat g at cat t ggt a t t t gt ct t gg acgct act t g t t gccact gg gcact t gt ga gat aagacag aaggt t caag gt t cagagca aaagat ggaa t t act t t t ag ccgt t caat t at ccgt cgct ct cgt cgt t t accagat cca t t gat gct gc ct at t gaagc agaaaggt gg gt t cgagt ga cagccagat c t t at ggt ct g gaat gt at ga t cagt gat t a t t gt ccaggt gagat gt agt gat acaact t at aaagagaa t ggt gacaac gt ggagaaga agat t ggt t t acaaagcaac act act t t gc cagt gacct t ggcat ct t gc ggacct t aac agaggcaaga ct ct t t t gca acact cagat gt ggt gat t t cagacaagaa ggaagcgcgt ccgt aacat t ggaaaagat c t gct aggccg agat gaact t agt t gaagaa agt gcccat t at t ct t aat g cgct gt agt a tgggacaggg caagcaat ct caccagagat t cat ct at ct ggagat agac gcct t t t ct g agt t ggt at g tgagacgcct aagct t t gct gt ct ggt agc aat t t ct gt a aagt t t ggca t gcat gt gag t act aat cac gggaagt aaa gggt at at t t cct aggat cc taacacgcag gaaaat gt ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 gttcttatag ctcttcctgg aggtggtgca cgggctttct gcaaaggggc atctgaaata Page 536 12689250 Sequence Listing.txt gt gt t gaaaa t gt gt gagaa t gt t gt ggat tcaaacggag aatcggttcc gctaactgaa gaacggat t a ct ct gct t gg t at acaat gg gt t cagact t acagccaaag ggt t cagaat gt aat ggct c ggagaggt ag gacat t ggac at cat aat gg t at at aaaca at caact t t g t gggt caaca gaaggt t t aa tggagaaaca gcaggcaaat at ct t caact aagat aaat g acggt agt gt ct gagct ggc gt t at cct ca cct t ct ggt c cgagt at ct c tttacaaaga t agcagt t gt gt caagct gc caat t gct aa t ccgggact t ggt ct ct gcc t t gcagt gac t agcaat ggg acgat aact t t t cagaagt t t ct ct gct t g t gat cat gga t gaaacgt gc t cgct ggt ca cact t ct caa cct t cgt ct t t gt t caaagg t t caagt gat aacat t ggt t aat gcgt acc ct t ct t ct t c t gat at cat a t ct agat gaa t ggaat caag cggaat cacc ggaat gt ggc gt ct cct cac at t ggacaaa aggggat ggg aat agct gga caaaacaat a t gt t cagt t c cat cacagga cacact t ggt accaat agct aagcgt t t ac act agat ggc t t gccaggt g aat gt t caat cat agt ggag act at caat c t gt cgagt ct caact ct gcc gagggt t t t g gct cccagt g gat ccagt ac gt ccgt at gg at at at accg gaaat gaggg cat act t t ag acaaacgat g acagaggt t g gt aaat gt gg cagct aact g t ct gct ccac gcat t ggct c agaaccgcca cagt t gat t g cct gact cca t t caat gaga agct gggt gt t t t ct t ggag t t gat aggat cgt caccat c t ga cct cagaggc gagagct ccc gt ccgggt gt t cact ggaga aaggaggctt ccat t at ccc t cagcaact t cccct gcgt t cgaaagagaa ct agat gggg t t aat gt ggt t cact gct gt tagcaacaga gct t t at cac t ct t aggaat cagccgt t ct t caat agccg t cact t gggt cat t t gct ag cat t gaacat acgat ggct a t ct gaggact t gat ggaggc tagggaagcc t aacat aagc agct at agaa caaaat t cag gaggaaaatt gcat gaggca t gct gat gt g acgt gct gt a t gct ct gat c gcaact gct t acct cccaat caaaaccat g t ct caat t t c t aacacagt c ggaaat agag aat gact gt g caccgt t cct gat t gt agcc cgacct gct t 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3093 <210> 631 <211> 1077 <212> DNA <213> Arabi dopsi s tha i ana <400> 631 atgtctgctt ttgtcagcaa atacgaagat acgccgggaa ggggcatttt ggcggcagat gccggaatca at gt t gagaa cact gagt cc act t cccct g gcagt t accc t t gcct ct ct cagaaaacct ctgatggcaa accct t cgt c ggaatcaaag tggacaaggg tttggttgat caaggt ct ag act cgct t gg t gcacggt gt gagctgatca agacggccaa gagagcacag aaaccattgg aaccgccaag cttaccgtga ggtgttatac tcttcgagga gatctcctca tggagaacgg ctagcaggga ccaacggcga cagcagtatt acgaggcagg Page 537 gt acat t gca gaaacgat t c gct cct ct t c aaccct ct ac t gt t at cccc gaccact act agcccggttt 120 180 240 300 360 420 12689250 Sequence Listing.txt gctaaatggc gtgcattctt caagattggg gccaccgagc caagcgttct gaggacgcca at cgt cgagc act gagacgg ggcact ct gc gaact aat t g ggaat cgt gt at gaacaagc cagcaaagt g aagt t cct ga gct t ct ggt g <210> 632 gggt gct agc cagaggt gct t t ct t gct gc t t aaacct aa cggaat acac t cct ct cagg t t gat gt gt t ct at caaggc ccaggt gcaa ccgct at gcc gacaggt ggg cgt gt t caag cat ggt cact ggt gact gct cat acagcgc gaagccat gg t t gggccggt ggct aacaag at cat ct gcc agccacgaca gcct t gaact cccggct ccg ct gcgccgca gaagagcaag acgct cact t aagcccgaga gacgct accc aggagaat gg t caagaaat g accaccat gt acagcccaaa cagt cccacc cgacact aaa t ct ct t t t gg acgt agccaa t cgggaaat a ct ccat ccaa act t gt cccg t gcggcggt g cct cct cgaa ggt t gcaccg agct at t ccg t ct aaacgca cggagccct c agct caggcc caccggct gg 480 540 600 660 720 780 840 900 960 1020 1080 actcggccgc tttcgagaac ttggttgtga taggatacag gtactag <211> <212> <213> 1188 DNA Arabi dopsi s t hal i ana <400> 632 at ggct t cgc t ct t cat cct at cccgct ct ccggt t acat agcat ct t gg gat gaagggt cgt gat t ggg act t t ggt cg aagacggt ag cagaaat acg gagat t aagt cggt t at gt g aaat cggt t t gct aggct ga gct ggccct c caagat gcaa cggt t ct t t c agt gacacgg aaggat at gg t caggct ccc cct cct ccgg t cccat ct t c gcagt gct gc t ggt t ggagc at gat gt t ag gcgccaccgt ggat t cacac at t gggaagg t ct t t t act c act gcact ga gt t t cat gca ggggaacaga cact t at agc gt gcat ggac at gt cacaac aat ggacaaa t ct t ct ct gc t gact ct aga cgct caact a gcgt ct gt ct ct cat cct ca agcggt t aat cact ggaact gt gt t t ggt c agt caat gcg t gt aat t gat aaaagt t gct aat ccacaat gaagt t t ct t aggt ct t at t tgcaccaacg t t t acgt aat aacacaagaa cgt t ccagt t t gat gt cgcg t ccgat gact gaagt act t g gt t acacgt g t ggcgacgt t t ccct t aacc ct t gct ccgg ttgggaagac cgaccaagac gat ct t agt a t gt gccacgg ct gat acaat t gcgat aagc caggaat ct g ggt caat at g cgt gt t gct t gagaagat ca gt gat aacac t ct gt at t ga gat agact gg gagact aat a caggat t act gcaat ct cat ct ct cacacc gggaacgat c ggacaccagt aaat cgt ccg cagct cccgc aaccggagac gacgt cct ga gt gcaaaggc at cct gaggt gt t t aaacca cagt t ccaat acat ggacac at ggaaaact t at gt gagag gagt cacgcg cct t ct caga gt ct act agg t cagcaacat act t ct ga ccaccacaac ggaaaat aca aat agt agt t ccgacccacg gagagct t t a cgat t t cct c aat t ccggcg agaacct at c aat ggggat t t cct ct cat g cat caccat t t ct ggaggag acaggat at t t ct gact t t c act t gct ggg t cagt t aact ggt t ct ct cg t gt ggat cag at t gaagaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1188 ct aaaggat c t aaaggcaca at ccaagcaa t cagacat ct Page 538 12689250 Sequence Listing.txt <210> <211> <212> <213> 633 5148 DNA Arabi dopsi s t hal i ana <400> 633 at ggt cgacg gat gat agat at t t t ct ct g aagt gccgac at agt gaaga tggcggaacg cgaagcgact aggggt agaa ccaggct gca ct t gct agag gact t cgaca gt aaaccct c ct t ct t gt t c gact ggct t g caat gt cagg ct gt t ct cca at gaaat t cg ct gaagggca tccgacaaga gaaat ct t t c t ct ct t gccg t t t gt aact g aaaat cat ca aacagt caat gt t aaagcca cat at gt at a gact t gt t cc acat gt t at c aat at gccct aagaggat ca agt t t caacg ttgcggagga agaat t t cgc gt agcaaggg aacat t gct t ct ct at ggga ccgaact t gt t t ggagt ct a t cat ccgcag cagct t at ga gggagt t t ca aagt t acaag t cgat gat gt gt cct ggaag t caacgagat gat gt gcgt t t t gact at gc aaacaccttt t t t t cgt t aa t ggacat t gt gt t gt ggat t tttcagaaaa at gat caaag ct ct t at aga t aaact t gga acct aagat a ttccagggga ct ct gcact c gcagt aaact cgct cagct g caaaggcgt c gagt gat gca t t cgt ccaaa cct agt t gt g ggaact gaaa cat cgcagac ggaaaaaatt ct ccagat t g ct t aggt at a ccaact t t cc ggagaagggc act gagcat t gcgcaagcca ct t gat aat c at at aaggt t t ggaaaagat taacggaaac agat at gaag act t aaaagc gt t t acct t t ct t t ccacga cagagt gcaa cgat gagat a acat aaagaa cact t ct aac t ct gacgat t ccct caat t t ttttccacaa t aagaaact t t t ccgt gcaa t ct gt t t t cg gct at agcga ggat gcct aa gt t ccagt t t aagat gt at c ttacgaggag gt t gccgacg acgaagat ag t ggggt at gg cgt gact t t g t t ct t t gggt ct t t t aaaaa ct gggt gcaa gt aacct ct c caaggct t aa gt accagat c ccgt t agct c agt gt agt cc agct at gat g aggggagcaa gt t ggaat cg gt gaat aat t gggat gt gt t at ccgagaat t t accct t ca t acagct cca ct gccgcct g aact t t ggt t tggggaggaa t t act t aacg ccagt gaaga aagcaagggt acgagttttt t ct acggact ct gacgat aa gacacgt at c t gcgt caaaa aat act t gct caggcat agg aagct t ct t g t gt t ggagaa cat t acgcag cgagt t t t ct aagat aagca acaagcacga agaacct cct t cagcat t t g t t gaact caa cact t agt gt at gt ggacaa aagct ct t gt t gat ct acga at agat t cgt cagaacaggg aaggt cat at t t aat cct ac agct aagact t t cagt at ct ccaagaat ct t t gat gaact ct cagcaagc t t cagt ggt g gaaggt t t ct cacaaat t cc ggt agacgag at ct cat aag act t gaccgt ttgcaagcaa aaagacgaca ct t cat cgaa acagct t ggc t aaaagaat t t t gt gaat t t agt act t gt c gt ct ct gcag ggagt t gt ct tggcaagaat gcgacat ct t cagt gagaag cgt aat gcaa ggacaaaagt t gt t ggcct c ggat gct t cc ct at gaagat cgcgt t t cag gaaggat cct t ct ccact gg t gt ggaact c t gaggt ct t a t caat at t ct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 Page 539 12689250 Sequence Listing.txt ccaaacatcg aaaaaattga tctcaaggga tgtttagagc ttcaaagttt ggt caat t ac ccaaaggt gc t cct cgt t ga t ct t ct aacc agt ct gcct g gaact t gaag gccat aaaag at ggagaat t gccgt gct ga ct t aaagagt act ct ct ct g acgggaatga gaaat t at t g agagaat t gc t gcaat agac gat ct at caa ct acgt ccag t t t t at gaac gagat acgt t gt gccagt aa aacct cagat t cat cact gc aat t gct t t g gt agaat gca cct t cacct a at t ct aaat c t t ct caaagg cat gct cat a at t aacgacg gaaggagat g aagcaggaga aat gat gcag aacat ct ccg caccgt ct at at cat t cat c aagat caccg acat t gt t at at at t caggg aagt t ccat c gcgaaaggct aact gt ct gg t at at ct agc aagt t gt t ct gt aaat t gga t t gat ct t cc caccgt ct at t ccgacact t act gct caga ct cct acagt at agagt t ac ggat gcct t c gcat caagga cact cccaca aat t aat t ac gcct accct c ggaaaccaca caagt cgaga caaaaacgag at t t t cat ga agcgt gat aa at cacat gt t t ct t cggt at t gcat gt agg ct ggcagct c t at t gt ggat t aggaaact g agaaagccaa t aagcaagt c t t t t gagt ct tttcccccaa at cgt t gt gc t cgagact t g ct gct caaat cggcaccgcg at t agat t t g at t t ct t gt c t ct aaat ct a t ggggat ct a gccaat ggaa gct t gaggt a cat gct act t t t t gtctct t act aaagaca tttttctaaa gctt ccccga t ccggact t c acacat ggt t gcagggact c t t ccaaact t aagt acgct c t act gct ggt t at t t t t cat t gt ct t t t t t ct t ggct gat cgat agt t gc aagt ggaaac ct ct caact t cat ct t cagg agat t aacaa t t gaagct ga ct t gaagt t c aacct gaaaa caccat at ct ccgatgggaa ct t gagaat a gt aaaggagt gagaact gca at gct caaac at agagt t at gct ct act t g at gcacaact t t cacaagt t cgat caaagc t at aaggcac t t ggat ct t a ct act cagt c agt ct acaac aagcagct t c agcgaagttt gaaaacgcac t at ct acaac gt gggct t t g t t aggct t t a t gct gggct c gat ct t aaaa ct agt t gt ct act at aacaa acgat gaccc gcaaaaagat gaact ggcat ggaagt t gga aagat agt t c ttgat t t t t c ggt t at at ct ct aaact t gt t gagt aacat t t aaggaact t cccat caac agaagct t ca t gt ct ggct g at ct t gct gg at acat t aga t gaat cccct ct ct accaaa t gccat t t t g gt t t gcagt a gtaggaacgg t aaggt t acg t ct t gaat gc ct aggt at t a t agcgaat gc t agct t gcag caggat ct t c caat at t agt gat ggaat ga caggcgaagt t gcat ccaag t t gaaat at t aat gt ggagt cacaat gt t c gagt t aact a tccagacacg aaaaagttt t aagagat ct a aaat gt t agc t cat t t ggga t ggt t gct ca ggct aagact caaact agat gaaat at ct t cccaagaaac act gct t gag gggt ct gcca ct cgaagct g cact gccat c t ct gaagaac caaagt t ct t agt cagagaa t t t ct t cat a t at t cct gaa tttcacagag ct at t gcgaa gcacggt t gt cacgt t cagt accggct at t ct t ct gt ct a aacaat gat t t gaagt t t ca caaaaaaggc t gt t cct aag t at at t at t t cccggt caac ct at gt gat t t t caat ggat t gct ggt t t a 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 tcactgaagc ttttagatgg taaaggtaaa aagagattaa Page 540 12689250 Sequence Listing. ctttac atagcatgtt tgctcggt aaacaaagag aaaaagcttt attt gat t t gct ag t t agct ggca caacgaaatt t ct at ggct t aaagat t t t c cgt at at gca t caagcat t g caat t gat gg t acaaggt ca aaaact t gca gcagcaagca aaagt t gcca cagagagt ag cggt t ct t cc gagaaggt aa ct caagaat a t at gagaaat aat ggt at ac gcacccat t g gct ggcggt g t acaat gct g cacaat t t ct ggt at ct cat t t t ct agaga cgggct cacc ggaaacaatt t aggt gat aa cggt t gt t gt agat cat gaa at cct t ccga agaagact at t t gct ggcga at gat at t cg act gcgacca aat t gccat a t agat at gaa at t t agt t gg gt gaagagag cgt at caagt gt ggt t gt gt gccggacaat gagct caacc t gct agt act agat at at ct aat cat t cat ttgcaaccga cat cagt gac gat cct aagc gt t ct cggaa gt gct gggaa t at t cggaat aaat gat gag at gt t ct t t g gaagaagct a t gat aat cca t gat ggaaat gcagacgct g aaggcggcac gt ct t ct t t c at acact aag aggat gct t c gccaaccgct t t accct cct gact t t gt aa t ccaat gggg at gct gcct g aacaat gat g t t t ct caaga cgct cgct ga aat t acgcct gagt t gggt c caaagcggac cgacaaagat aat t gggcat at at cat caa tgggagaccg cat gat gaat aaagagat at at caat aaca aat cct acaa aggaagaagc t acat gggt c t acccggct g acggcgt at g t t gagt ct aagt aat g ct agcaca t t t t t gt g agct cgt a tcaacaaa cct caagt aagt agt g act t t gga ggagt cgt cggat gca aaaaat t a agt t t aat ct ct agt g at caaat t t t t gt t t c gcct gcat acat gagt aaaat caa caccacaa ct gt ct aa t xt gg tgagaaggct ac gctcgaagac at gcct cct ct g taa agagt t ggt a ag cttccatggt It a caaaggaatt tgt tataaaagaa tt gtgcttgctt at gcctattttc taa aggt t t caag gc tttgactgat iga cat gat t gaa tgg gaaacaaatt aa at gt at agag ag ccttacaacg ac aaaagtaaaa at gcat at at t g ac gttgtatgac gg agaagagaaa tgg gataactaaa tgc caatcaaggt 3900 3960 4020 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 4680 4740 4800 4860 4920 4980 5040 5100 5148 <210> 634 <211> 1778 <212> DNA <213> Arabi dopsi s thai i ana <400> 634 aaatcatatt gagaacaaat agatttggtt gacgt t agt c gcct t cgt gc t gacgt ggac ggcgacggat ttggctgata cggtggctga t at cgt cggg gct ggt gt ag gt ggt t cggc t cgagt acat gtgatcgaga gggacatgag gcaacctggc ggacgactca t gct t t ct aa agatgcacag aaagccacgg gtttggcagt ttttccagtg gataacaaca atttttctta ccgattcgtc caacaactgc gtcgaaaggc at at at ggct ggt gt t ct ac ggat caaaaa t ct cgcat at agaaccagaa act t ggcct t t t at aaagat t gaacct t ct t t t t t ct ct t Page 541 tttacgcacg ct t accaaca gacggt gct g gct ct t gct a agaat gat gg caagat t gct ggaaaagaag gct cgat ct t t ccaat gt gc t t t gt t t at g tgaagaagaa ct gacgt cat aggat gggcg gt gagt t t at tggaagacat cagacgcacc t t cacaat gg gcct ggaaga 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt aggaacggt g taaagaaggc ct caaacct t t ggt t acat c t aaaccat ct ggt t ct cccc t act at agt t gggagcacac aggt gt gat t gat ggt t t t a cggcgat gca gt cggcgacg tgaagcaaaa t acgt caggg t cat ct t t gt cct t cgcat t t ct caaagct aagct at at g gcacgaat ct t at gt aacgg at t at t t t t g aagt ct t t ac gaagaaacaa cgt cggt ct C t caaagaat t t t caccat gg gaaaat t t t c cct caggt ac at aaaagt gg gt at t gggag ct gt cggaca aacaaagt ct gt t aacacat gaggcaat ga at gat ggct c gccat cact c t ggcat agcc gaaggagt t a gct gcaacca at gt gat t gt ggaaaagt t c t t gt t acaag tagaagaaaa cagcct t ggc t t aat gat ga gt cggct t ga t at accaaat ct t ct at t gc ct ccaaaact tgccggcaaa at gcat t caa t t ct cat t ct cagaagt t at t gggaaat gc gacagggt gt t gct cggcgg t at cct ccat t caagct t t t gccaaat gt t ct ct ct aaac gcat t t ggt a taaacacaaa t aat gct ct t aggagt ggt c acct ct cact caacaat gct agaacccgaa aagcagcact aaat ggt gaa ccgcaaaat a gcgcat gaca t at gcgt cat acgccgt ct t caat t cct t t at t t t ct caa ct at gat t ac cat gaat cct t ggccaact g t ggt t t ggcc gt t t ccagca t t t gat gct c aacgt gt at t aaaat aaact tttttttt aaaggagt ga cat acaagaa gt ggt at gcg gagat t at gt aagct acact gacgt t cgt t at gt ct act t tttttgaaag t ct act t t aa ccagt t gt t g ct t cagccat t at gat at cc gt act aat t g ct t t gt agt g cgt cct ct ct ct ct ct ccat at gaaaat gt aat gcagccg t caat cgcaa gcagt gct t a t t t gaat gt t acggt t gct a cgt acat agt t gat at t gt c gt ggt t t t ga t cat gaagaa gt at agat ga gcaagaagaa cat ct ggaat taagcaacct gcaagccaat gat caacgga gcgggt t t cg ct ct cgt ct a t t ccct ct cc t ggt t cccaa cgt at cacaa t at at at gga t aat t at t ag at at gt gt ga 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1778 <210> 635 <211> 499 <212> DNA <213> Arabi dopsi s tha i ana <400> 635 aaaagacttt cagtttcaat aatctttcta t t t ct cgagt gct gcaat ca tcaaagcaac at gt cgct at tcctaaagga cacct t gcgg gat t cgt ggt t ccagt gaca t act t aagt c cagaagaaga gt t t gggt t t gatcacccaa aaatcttcat cgatctcgcc tctcgcctta gatcaaactc tttttctagc cgtttcttca gagctgtctt ttaactgatg taagattgtt at t aat at t a at caaaat c t aaaaagat c t t ct aaaat c t t t acgt agg accct t gt t t t gggt ggcct gcact t cat c agagat gat c t t gt t t t t cc aacat ccat g gt t gt cacac agagat gat g t caaaagct t cact at t ccc gt gat t caca t ct ccct t ga t cacgt t t ca gct at cagaa t cat caaaca cagaagagga ct aagaaaag t gcact gaac ct gaccagt a aagt gaaaca gt gagt t gt g 120 180 240 300 360 420 480 499 Page 542 12689250 Sequence Listing.txt <210> <211> <212> <213> 636 1468 DNA Arabidopsis thal i ana <400> 636 aagaaaagaa ct act gt cac at gt t ggt t g gat t t ccaag gaccat ct at gt caaat t ca t t ggtggggc acggat t t ga gat gt ccaac t t t aaact ga t cct cct t cg aaact agt aa t t gt t t gcat t t gagagt gt ct t aat t cac aagcct t t t g t aaccct gt t act ccacaaa ct caat t gt a aagaat gaat t gt gt t t aga cgacggatcg ct gcaaat t c gt aat aacaa t aacat at ga aaaaaaaccc ttgaagacac acggaat gt c t t t ct ggt gt ct t t t gt t ct act t caaaat at caaaacga aggagaggtt caaact t ccc t agagt t t t c at ggt gaaga t gt acccaaa ccgat t acgt t ggaccaact t aat t t at ac ct gat aaat a aat caaat gg tct t cgaggg t ccact act g agct cat gt t t at cagt gaa aat cggt t ga t ct t at ggag cgat at at ct aagagat t at tcctacggta tatttgacga ccattatcct cataatgggt cat aaaagag t aagct act g caaatggagg ggagat caca cggcat agt c aaaaaaacgt t ct t gt gaac cgt t acccgt tccgaaaaac gcat t ct t ct aggaaacgga caccaat ggt caat cgcaat t t t t t t ctaa gt t acat gt t at t cct t at g gat t t ct agt agt at ct ct g gaacct at ct t gat acaaac t t gt at t gaa aagaat gcat at gt gt gt gt aaaacat a aagct aaagg acggaaaaag t t ggt t at ac gat gaaaagt cct cagact g agt caagggt gacaaagcgg at ccct cgaa t ccagat t ca t acgagt t ca gatggaaaag cct aagggt g cact gt gaaa gct t t acat t t gat aaagag gggacgtccc gaacgat cag at gcgcat t a accagt t at a at gcat gt t t t gt t acaat a gct t gt ct at t aaat at t t t accgcaaaaa t t aagaat t g gct t at ct ag gt accgggt c gaccagat t a t agccaact t ggt t ct at gc caatggggac cct ggaagat cggt t ggacc gaaat t cgt t gaacat t ggc ct ggt at gt g t t ct at at gc t gt agat at t aagt t t t t gc at at at at cg at ggat t cga t at t ct gcca at acaaaat a acgt t caacc aagt act acc at t ccgact c cgct cact t c t cagt cat t g gggaagaaaa t act t gggac t t gct t cgt t cat at cacac cgagat aagc agcagaacgc cacgcagttt acggagatgg gt ct t t gt at cat ct at aaa t at at at aca at at acat at ggt t t cct t a cgctggagga gt gt t gagat tcaagaccag t at t gt gat g at aaaat aag gaat ccgt t g t t t ctt gtt t t t aaaaaat a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1468 <210> 637 <211> 858 <212> DNA <213> Arabidopsis thaliana <400> 637 atgctcaaaa tgggtctaac gtcatttgaa gacgccataa aagagcagct aaaggaacga aaaaacgctc acttcatgtt ggttgatgga atgtcgaagc tactgactga aaaagttaac Page 543 12689250 Sequence Listing.txt aattttcagt cattggattt ccaagtatct ggtctcaaat ggaggttgct gct gt gggag gggcct aat t gaat at t t ct aaat t cat aa t at gccgaga at ggggacag t ggaagat ca gt t ggaccac aact cgt t gt agaacat t cg at t ggt at gt cat at at cat <210> 638 t aaaagat t a gggaagt caa acgt ct cggt cacacacgca taagcgaaga cagaacgctt cgaagt t t t c ggagat ggaa ct t t gt at t t cggt ct at aa at gt at acgt ct at at aa t ct at ct gt t at t caact t c ggggt gt cac gt t aaaggag agt cat acca t aaact gat a ct cct t caat act agt aat g gaat gcgt ct at t gagagt g aat t at cct t gct gt at gga aaaat cggcc aacgaaaaac aggt t t ct t g aact t cct cg gaggt ggct c ggt gaagagc tacccaagag aat t acgt ca ttggaccaac t t gt t t t ct a t cat agat ga t act ccct ca aacct gct ca tgaacgacaa t t act ggt at ggaacaat t c at t cgt ct t a gaaccggaga ccaat aat gg t ccat cgcaa agct t t acat t at acaacca aaagt gt acc gact ggacct aggggt agt c agcggt t t t c ccct cgaacc cagat t cacc cgagt t cacg tgggaaagga tccaaagggg t cat t t cgaa t t t ct at at g 180 240 300 360 420 480 540 600 660 720 780 840 858 <211> <212> <213> 1563 DNA Arabi dopsi s t hal i ana <400> 638 at gt ccgcca cggcgat at c gacct at cac ct ct t ccct t ct ct t cat t c gt aat ggt ga cct aaat act aagaaaat cg t act t ggaga ggt at cat gg agact aaaca ct at t ct ct c caaaccct ac at caaaccaa gccgcagcag ct cgcct t t t aaaccaagt c agat t t caat ggaact ct aa gccacacatt act t cat gt t t ct at ccat t gct t ct t cgg t t ct agaaga gagt gagt ga t t gacgt t gt aggat aaaac ccggt cgt gt agt t t t gcca cacgaagcca ccct aat gaa ccagact ct t ccggt t gcag aacgcaaagg at t ccaagct accaaaat ac gat ct t caac agt agcat t t gat aagct t g gat caaaaaa t gt cggact c t gat ct t cct ggt cgggaga caaacat gat t at t ggcat c ggaaat t t at gt accct aaa cact t t ggt c cgt ct ct ct t act aaccgt c t t gt ct ct t t ct t gt ct t t c caaaat ggcc gt agaaggag gaggcgggag at gagccat g gaaggt t t t c gagat gt t cg caagt t at ga gaaat gaaag ct t gt ct t t g act t cct t ca t t cgt gaaga ccat t gat t t t t gt t cat gt t gcat ccct t act aacgact gt t t gt aacc t at t ct accg ct t ct t ct ct ct ct t ct caa gcgt aat aag agat gggt gt gagcggggag aagt gt t gaa t cgaagggt t t cgt t ggagg at gagt t agt at acat ct ct aat cagacaa t ccat gat gg ggggt cct t t act ct t t at c acgt t t cat c at aggact t t gt t t at cct c cct ccaat cc at ccgact ct gt cat t t ct c caaagt gat g agcggt t t t g gagaggaggg ct t gagagat t t at t at ct a t cgt aaagag t caccgat at gcgaagct gg ccgt ct cgcg cgcagccgca aat cccgat c tcaaaaacaa at t ggaccct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 ct ct at gt t g cat t cgct t t gagaaagaaa aacat caaaa ct gt aacgt a t agt t t gagt Page 544 agggt at ct g gacggt caag accact t gt a gt cat cgt t c ggt ct t aagg caat t t ct cg gaggt ggcca ct cact agaa t aa agat t t t ggc ccat ggagaa gagaacct t a ccgt ggct gt cact t gaccc accct gt ct C acaat gt t ca aagacaagt a 12689250 Sequence tccgatcaag acggtgagac at t gt t aacc gaaggagatc cctgcttagg tttagccctt gacggtacac gt gacct t ct gcttttcttc ctcttggatc cggtgccacg tgccaagatc gagt gat at t gggaaggcgt tttgatcttg gccggtaata Li st i ng. t xt t gacccgt ga t cgt t gt t t g t gt t caccga t ct acggt ac ct t at cct ac ct gat ggaaa t ggat t t cga at ggagt agt t cgggt gagc t cct gaagga ggt t agt gat aacggcgagt ct acaccat c gt t gaagt t t gt gcacaagt t aagaaaaat 1140 1200 1260 1320 1380 1440 1500 1560 1563 <210> <211> <212> <213> 639 705 DNA Arabi dopsi s t hal i ana <400> 639 at ggcgat t g aagacgat t c agt gt ct ct c gct ggaat ag ggaaat gaga cgt cat t t t g gt gat ggat g aat act t t t g t cat acgagg at gaagat ac aaagat acag aaagat aaca ct ct ct cgt c at ggat t agg t ccggcgt ag ggaaaggt ac t t ccgat t t c gat gt gt gct cat ct ggt gt tggaacagac cgct t gagt t t agagt ct t a ttgaaagagg t ct ct t at at gt cgt cgacg gacagt act t cgccgt ggt c agct gat t cg t gat t t at gg ct gt cggaaa t gct ct t gt t t aagt t t aaa cgt t t caggg cat ggaagga cggct ggcaa acgcaaggt a at cacgt cca cct ggt t at t gt gt cggcca ct agat acag aaagat agga cgagcagctt ct gat cggac ggagaggt ct gt t t ct gt t a taccgccaag caaggcggaa at ct ct t t cg t t act ct gca cggt caaat c ttaccggagc t gaaagt ct t aggccgt t gt at ct t gcaga cgggaagcat at gcggat cc cat t t acacc act ggaaact t ct t agt t gc t ct aa gccgaagct g t cact t t cgt t t ct t ccgga ggat t t gaga t gcat t t gct aaagaaggat cgat caggct aaaccacgca caaagct gct ct cgt t t at g t ggccct ggg 120 180 240 300 360 420 480 540 600 660 705 <210> 640 <211> 1048 <212> DNA <213> Arabi dopsi s tha i ana <400> 640 actcatctct aacgtttctt tctatctcta ctttcgacag atctccgatg tttcgagtct gagaaacaac cgtcccacca tctcttacac tccattcacc ggagaaaaac ctctccctcc aaaagctccc aatcctcatc tacttccacg ctccacccta ccacactttc ctcacatccg at ccaagcaa acaagagt gg cacaaaacgg gcat ct acct gt ggt ggt t t ccgt cgcggc Page 54E caat at ggat ccgcat t gag t gt cgt ct cc accggagaaa cat cat agaa cgct aact gc t ccgt aat cg cgt ct ct t ag aaagat at ca gt cact gt t a acagcttttt t t agcaat at 120 180 240 300 360 12689250 Sequence Listing.txt cagt aaact a at t cact caa aacacggcga at cat ct aac t ct t gat t ca tggggaagac gagt t gat ga gggt t t t ggt agaagct gaa gt cat gt t t t t t gaagagt t ct aat gaat a t cgccgt gca at gggt gt t a tttcggaaaa gat gcgagcg t ccat at t t t gaaaggt gt t t ccat ggct c aat ggt agct gaagagt gga t cat t t gaag t at t aat aag aat t ggt gt a ccggagt t t c act cat at ca gt gt t t ct cg aaaaaagaga t ggagt aaga gaaggaagtt aacgt t gt cg ggagat gat c t gggaaggt g aat cct aat a t aat at t aat t t t gt cac cggt acct at caggaaccgg ccggagat ag agct t t gt ga ct ccaat t ga ggagagt cgc gat cggat cc t gt t t gt gag aagt t gaggt gt gat aat gc gagt acgt t t t ccgt acgaa gat t cat ggg accggaaact cgccggt gga t t ct t t aat a t gagt t t gag aagt ccgaat at ccggat t a acaaggt t gg aat ggagact t cgt caagt c gt cgaaggag t ggat aaaca aacat at ct C t cagggat aa gt t agagacg agt aaacaag ggt t gcgggc t gt t acgcgg aaaaat gaag gt gaagaaac at aacaat aa 420 480 540 600 660 720 780 840 900 960 1020 1048 <210> <211> <212> <213> 641 1953 DNA Arabi dopsi s t hal i ana <400> 641 at gt ct gct t agagt t t ct g t caagaaact aat gct t act t t cggact t t t t t gct gct a accacaact a t t t aat gat g agcaagaaat gt t cagt gca aaagaat t at t acgaggt t t t cggct cct c t ct acagt ga gct gt t t t t a aat gat gaag gaagct gcaa gt t t acaagg at acct cat t cacaat t aca ccat t t act t t ct ct t t ggg at ct ct gcaa aagacact cg caat aat cac cagt gct t t c t cgcggt t aa t t cct gat ct at t t caacaa acccct t t t a cact gccact t aat cat agc gct t ccat gc aggat gacat cagat aagt t t act act acc aaact t t ct c agaccct act ct caaat ct c at ct cat agt aggagat ct c gagt cgat gt t t ggaacact t ct gat gaaa gaagt cggat gact agt gaa agt t ggagga caaggaaact ggt t t caact aat t gt t gt g at caaagaga cacaact gca t t cgat gt gt gggcacactt t t cct t ct aa t acgt t ggcc caaacacttt ct cacaaagg t cgccagaat ccagacagga cagaaagt t a aagt ct gcag t t cagt t cgt gat t gt gt aa agat t t ct t g at agagggaa cct t cct t t c ccagt cgct a gcaaagaaga gggt cact gc aacaaact cg cct aat ggag cct t ct t cat at gt t t gt ac t gacct ct ct gt caaaact c ct t gccgt ga at at at t cat cggcggacca aggaagcggc ct cagt cgct t gt gt ct t ca t cccaagt t g ct gt t ct t cc cgcct ggaaa t ct cagt t ct ct t at gat ac agt t t gat t t gt caaggt gg t acaagt t gc t ggaagct t g t aacaggat a t t ct t ct aac cgacat ggt g gt gcgt cat c ggat act gt a at ct gaccgg t aat agt aca t t at gcat cg acaat ccat c t aact caagg t ccgccggt t aggcaaaaat t at t t gt gt a accaggt gca caaggt t at t at t t ggacaa cgt gaagaga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 546 12689250 Sequence Listing.txt ct at cgaaaa cat caggaca aggt gagaaa gagt t caaga acgaagt t gt aagct t cagc at act t gt ct at gcagagcc at t ct gt at c aacat cct t t t t t gaaat ag t ct cccgagt gt ct t agt t c agt t t t ggaa ct cgt ggat t at t gcct t at caaat gct t a cggagt aat c gct gcat cga acagaaat ct acgagt t t gt agct ggat t g t ccat caaga t ggat gct ga accaaacaga at gcgat gt a t t gagat t at act t ggt t ac ct t cct t t cg t at gt gt cca ct act agct c at gaacaagc t t act at t t t ggt caagct t gt ct aat aaa gact acacgg t t cacgact c t at gaacccg agcccat acg t ggccaat t t aagcgggagg at at act t gg agat agt t at agaagat act gat cgccct c aggt ccat cg ct t ggat t t t agcct t gat t t acaagat ca acaat cat ac aaagt t gcag agaagagt ag t caat gaaat aaaaat agca aggct at gga caaagaaacg gaaaat cgt c gct gt acct c at ggat aagt gt t t ggagag act t ct t gt t t aggaggaat at cgagacct at t t t ggaat t t ggaacct a ct gat gt ct a gcct ct at ca gt gacgggt c aaat aat t ag caaccat gt c aaccacct gg ct t ct ct t t g t gt t gt ggca agaagagaaa t gat t ct aga t gct agagga aaaagcgggt ggcgaggatt t ggt t acat g t agct t cggt gat ggat gct accact agac at gcat ccat agcgat cgt c at t t t t ct t c t t ccat t gat 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1953 agct cct cgt t aa <210> 642 <211> 678 <212> DNA <213> Arabi dopsi s tha i ana <400> 642 atggccgaga aagaagaggg tgtgaagctt agagt cgaga t ggct ct caa gct t aaaggc t t ggt t gt ca agagcccgtt gct act t caa t t ggt t caca at ggt aaaat attacccgag acat ggacaa acaat cct at t ct accgcaa tgggccaaat ttgtcgatga acaagtcaca gagaagagaa t t gacgt t gc aatcgaagaa caaatcacgg gaaaaaaatt attcggcgga ggaagt at ga t cccgt t t t g t ct agct cgg ccagaagaaa agtttccaga gctaaacaga gt gagggaat gcat t cct ga tagagagaaa cgaatcaaag cggtctaa <210> 643 <211> 2034 <212> DNA <213> Arabi dopsi s tha i ana <400> 643 at agggt ct t gt gccgt acg ct caacccgg t cacaact ga agt ccat at g at gat agggc gt t caagaac gagacgat ag gct t gggaag t ggat caaga cat at t gagc gggcaagccc at t act t gga tttacaaaaa t cct t gaat a acaaagccat t caggt cact t aat t at gt t gat t ct t gga gt at gggaat at t t gaaaga acat gat gaa at t cagt cgt cgaagat t at agt ccct gt t cat cgaccaa ggct cgat t c ggt gaaat ca gct t gagaat cat ggt cgt g t gat at gat t aat t gagat c aat t gt agga 120 180 240 300 360 420 480 540 600 660 678 Page 547 12689250 Sequence Listing.txt atggttgact tttttgtttg tggacgtggt agtaactatt ctggagcaga at t gaggcac ccaaagt t cc gagat gact c ttgagaagca cat at t gct g t gcct t ct at ggt cat acca agt act gaag gct ct gt act gacaaggat g gacgct ggaa gt ggaccgt g gcacat gt t g ccaagt ct t a at t gggt at t tgcgaccaag at t aaagagt aacat t ct cc gat aaagat a ct t gccgt ca at act gaaat aaacccaact t caagt ggt t aaaaacagac t t t gct gcag ggcagggcaa gcaat gcaaa gcact cat cc at gcccgt ag gt caccat t a cacgt cat gc t ct cgaat t t gt t gt gt ct t aaagcggagg gt act aacct cggagatttt at ggaacgcc ct aaat gggg tcgagcaaaa aagt t gt t ga agagt gaggg acgccat t ga ct cct t t t ct at aaat t t ga aagt gaggaa ct t t gaaggc tggacgagca at aaaggact at ggt t ct t t t t at aaaacg acgt ggcggc cgaaacat ct t gaact ggga tacgaaacaa acat ct t t ca t cgaat ct gt at t acgt caa gct t t acaat ct t t ggccac gt t ct gt t gc t caaat cct t cat t cct t t t gcat t at at c t acagcgct c at gaaccaca t t gat gat aa agt t ggcacc t aaat t gt cg t ggt ggaat g aat ggaacgt t cat ct ggaa ct caagccgt ggct ct t gt t act gaat cca aggacgct at t ggaaat aac agacct t gt g gt t t aact t a caaaagt at a agat gaagat at gcaacat c t ccaat t cat t t gt ccagct aaaaaat gag gggt gt t ggg ct t t gact cc aagcggct t a t gagaggt gg t aagt cat t a cgct ct t ct c accaggt ggt t aacccaact aacaat at gt acat gt ggcc t ggcgt gat c t ggt ggat t c acacct t ccg ttcaaaggag tttgacgaac acgt cacct a gat ct t t t cg agt aat gggg gt caagagca ct agt gaagg cagacaccgc gcat cggt aa cat gt t ct t a t t ggagat gg aagggaat at aaagcaattt gat t cgaaat ggt gt cct t g ggt aggact t t t aaaccgat t cggct gcaa t caaaat acc gcct ct ct t a caagat gt gg at t act t gt c agagct aggg acact ggcgc acaat acagt gt ggt agcgg t at at cagcg ct ct t cat t t act ct t at t t t t gccct t ac act gcaat t g t t cct t t t cg cccagt t ct g at gagt t t aa cct aat gt at cgggcgacac ct ct ccct gg aaaaagaat g at acaggaga agat t at ct t t acat gt ggc cat ct gct t t aggacgaaga ct acct gt ct ct t cct t gt a tgaaaaccac tacaagggaa at gt t at t ct gt ct t t cat a caacaaaggg aaaacgaaca t t ct t aacag ct gcat at at acgggaat ac t t gct agccg at at t gccga t ct t at t at a cagt gccact ct ct t gt agc at agcaaaaa t t ct gct t t t gggcgcagtt t act t t t t t c cccat gt gaa caat ct t t at t ggat accag caaaaaaaaa ct gaaagct g t ct agat aga cgaacct gt g t gaagat gt c t ct ggagaag t t caat t ct t t gaat gt cca t act cat ggc agct agt ct g t ggaaat act ggt gaacgcg cgaagccgt a agacgacaac caaacat ct t t gat gaat at cggagcat cc t gt t t at gt c t t acgaaat t act t ggt cag gt t gat gcat acct t t gcac taaccacgag gt cagaggt g cgct at t cac agaccct aag cacagt gacg accaaact t g t gacat ct t g gggt gat ct a at t act at gc at ggct t t t a cct t ggccct t t t act aat c agggggaaag t t aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 Page 548 12689250 Sequence Listing.txt <210> <211> <212> <213> 644 858 DNA Arabi dopsi s t hal i ana <400> 644 t ct t at ct t c ct gat at t ca gaaat at gaa ct gat gaagc ct agcaaaca t ct gcct t ga at t gcgat ga ccact ggaat act t t gaccc cat ct cct t t agaaagagca accagcct ga t ggct cat gc ggct t gagt a ct at ggat ga t t ct ct t ct c gat t ct t act gat acaat gt t gct ct ct gc ccaacgcctt gaaggcagct ggcgacccat ccgagt t gct at ct aat cag gt gggct acc act cgat ct c t caagaggct t cat t cct ac ccggt at gat at ct act a ct caact t ga t ct caggaaa gat gt gt gt g gct aaat gt g t t t ct t gact t t cat at t ct gcgccaaat a ct t agt t cca cagagt ct ct gat gaat t ct ggggagct gg ct accggt ag aacagacct a gat gaagaag at acacagaa accct aaaga agaaagct cc acgt t gaggt ct ct ct caac gt gt agagga ct cgct ct gc ct agt t gcaa ct aaaccgcc t cagct act c at t ggct t gc ccgaagt t cc t gaagt ccaa agcact t cct acaaaact gc aaat t t ct gt ggccacgctt t cat gct gct t aaat t ccct t agggct ct g t aat caccag t caagaagt g aact cagcaa t gat ct t gac agagat gggt cgagct t t cc t gt acccaac agt ccccgac t t t t at t cct t t t t t t t t ga at at gt t gt g aat aaact cg ccct gcgaca ct ct gcagag aggt t ct t ag gaaaagaat c cccgct gct c t gcagt aat a ct gt t t ggt g t t t t cacat t aagaagcaga ct aggct aaa 120 180 240 300 360 420 480 540 600 660 720 780 840 858 <210> 645 <211> 2283 <212> DNA <213> Arabi dopsi s tha i ana <400> 645 atgaagaaga cgat t caaat cct cct ct t c ct ct caat ct cct ct gacgg cggcgt t ct c cgtcaattac tcgaattcgc cgaacgaagc ttcgagaatc cgagattgcg aaatgcttat ctctctgatc caaacaattt cacttcgaat ggagttttct gttctccggc gcttgataat ctcaatcacg cagatatcgc tggttattta gctttgtttc atgttaattc aaaccggttt cttaagcttt tattcgagct tgatcttagt gttgtcttgc aattaccgtc gttgaagttt act gt accga aagagctttt t agt aaagat ttccggtttg aattaccgga gaattttggt t t ct t ct t cc t ccgat aacg gt caaaat ca at agct ct ac t ggat cggat cggaagat t c cct gaagagc t gt ggt act g aacaaccggt t t agat ct cc ct t gacgcga gat t cgccgg t cat caat ct aagt ccgt ca ccgt t gat cc aagct t ggaa ccaat gt ct g gt accgt cgc t t ggt t t gt t taccacaccg t cgct gggaa ggt t t aat ga t t t t cat aaa t t t cggt t at caccaacgct cat t caacgc t t ct ct aaac acaagcgatt t aact acacc cggaat cgat at cagat ct t gt t t aaccgg gt t t ccgacg at t t gaagga ccat aaccgg t gt t t t ggcg 120 180 240 300 360 420 480 540 600 660 720 Page 549 12689250 Sequence Listing.txt aat aaccggt t ccat ggt t g t gt accat cg agct t ggt gg agat gaagaa at cat ct t ca aacgt gacgg ggt gagat gg ccggcgagt a accggagaag ccgggaagac gt aaat t gt g ccgt t accac cct acgct t a ccaccgccac ccgccagt at t ct ccgccac ccaccgcct c ccacct cct c ccacct cact agct caccac cct cccccac ccaccaccca cct ccaccgt ccgcct ccac ccacct ccgc t ct ccgcct c gaagaat ct c cacagcccac ggaccact ac tga t gaacaat gg t gt t t gacgt t t t cggt gga t t t gt cagt t cgcct gt gt g ct gct cagag gat cgt t t ag cgcct t ct t t ct t ccccacc cacct cct cc at t ct cct cc caccat cgcc cacct cct cc cgcct t ct cc cgccgccacc caccaccgca cgat t t at cc ct ccggt ct a gt at agagt a cgccagt ct a caccgccggt cat ct ccagt ct ccaccagc cacct ccagt caccggt cat t ct t aat t ct cagt t t t aat gcagct t aat accgaagctt t t t gaggt t g gt ct ccaggg t t gt ggccgt gccat ct ccg acct ccgt ca ggt at at t ct accaccacca gcct ccaccg gccggt at ac agcaccaact accacaattt t t ct t caccg at at ct gt ct t t cccct cct t t cacct cct ct acagct ct t cat t acagc acact acagc accggt agt t gat ccaccaa cggcgt at ca t gt t t accgt gaact t gt t g gt ggcgcat a gagaat t t ca ccggagt t t g caat gt aaag t ct gt gt cgc cct ccacct g ccgcct ccgc cct ccaccac ccgcccccac cct ccgccag t ct cct ccgc ccagt t t at t t ct cct ccac ccgccgcatt ccaccgcccc cccccacct c cct cct ccac ccgccacct c t ct ccgccac t ct ccaccac caccacagt c agcccaccac t acgcat ct c ct gat at cgg ggccgt t acc at at gt t gt c ct t at agt t a at gat cggag cgt t t t t gt c ct cgt cct cc cgccaatttt ct gt t t at t c caccgccccc cgcct cct cc t ct act ct cc ct ccgccagt gcacccgt cc cacct gaacc cacct ccacc caccaacacc ct t gt at aga cagt cgt t ca caccagt ct a caccagaagt cgccaccat c caccaccgcc cgccat ct cc ct ccaccacc t ct t aacgag acggt t aaag ggagagt gt t ggggaagatt caat t t ct t t aaat t gt t t g t cgt ccgccg ggt t gt aacg ct caacacct t ccccct cct accgcct cct gccagt at at cccaccacca at act ct t ct accaccccca t t act act ac accacat t ca agt t t ct t ct accaccacca t t at agct ct t t acagct ct ccat t accat agct ccat gt cat ggt t cac t gaat at gaa gccgt t ct at 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2283 <210> 646 <211> 513 <212> DNA <213> Arabi dopsi s tha i ana <400> 646 atgcggattt tgtgcgatgc ttgcgagaac gcagccgcaa tcatcttttg cgccgccgat gaagctgccc t t t gt cgccc ct gcgat gaa aaagt t cat a t gt gcaacaa gct agct agt cggcat gt ac gt gt t ggt t t agct gaacca agcaat gccc cat gct gt ga t at at gcgaa aat gcacct g cct t ct t t t a ct gt gagat a gacggt agt t ct ct t t gt ct gcaat gt gac Page 550 120 180 240 at ggt agt ac agggacaatt at t gat gacg aat aacaacg gt t ggaccct at gt t ggt gg tgcagaacca aaat gat t ga ggat t gat gt ttaaacgaga 12689250 Sequence Listing.txt caagagaaca cacggggata agcctaaaga aaacaatacg aagagtctct acaaatggaa atggtgaagc caatgggaag t ct aaat gct aat ccacaaa gagt acat ga gccat cat ca aaataacgag aacaatcacg agcctgcagg ccttgtacca gtctgagaag t ga 300 360 420 480 513 <210> <211> <212> <213> 647 693 DNA Arabi dopsi s t hal i ana <400> 647 at ggct t t ct ccgat t act t cggaat ccga t ct aaaaat t gt aaaggaca at cgt t aacg ct gt acgaaa ggaggt caag gcagagt t cc aagt t ct t ga aaat t ct t gg t t ccaaat cg ct t acgcat c ccgct t t t ct gt aat ggggt t cagt gt ct a t cgat ggt aa t t gcgt caag aat acaagaa aacccggt t c ct at at t cga aat ccaat gc t t gat aaaaa aggt accaaa at t ct ct aca cggcccat ca t t cagt gaaa t gcaagggct t gat gt t t ct at gt ggt t t g ccaaggattt aaaccct gag caaggt t gat t ggt ggt t t c gggcaaagtt t t caaaact c ccct t caat g t t gagat t ct t cgt caaat a gcagcagaga ct ggacaaat acat cat caa gagat t ct t g at caagcaat gt gaat ggac ct t ggt gat a gt cgagagat t aa gt t t cgct gc ccaccagaac gt cat aggt t agt ct gt t ca tcaaaggaaa at t act caga ct t t ccct t g t t gct t gcac caagcacagc t cat caaat g accct cccac aaat ccgt ct at caaaaact t ct agt caaa cgat t t cact acct t t gt t g gct t t ct caa caat caat t t ccggt t caaa t ccaat ct ac gaact t t gag aact t ct cct 120 180 240 300 360 420 480 540 600 660 693 <210> <211> <212> <213> 648 874 DNA Arabi dopsi s t hal i ana <400> 648 agccact at g ccacaagaca cggt gct t ac aggaact t t a aagct act cg gagggaattt gaat gt gcgg ggaagaggt g cat ct acgat gcagt ct cct gaaat t ct ct caaaaaggca gt ggcaact t cct t t t gt gg gact t caggg gt caggt t ca gt t t at gat c at gaaagaga cact ct caat gt ccaaat cc gcggaaacaa caat t ccagc at cgagaaga cacat gat t c taccaacgga t agt gaagca gggt ggaaga ccgct gt ggt at cact caaa ttggaagaga aact t t gct t cgggt at t ct agcct t caaa gaaaaacgac t aagt t t gca tggaaagaac ggt t t ct cac gct t gt t gt t aggcaagct c ct t gct gaag t act at t acc gat agat act at ccat gaag gcaccaaacc t at t acacgt caacaat ct c t act t t cat c t t gt gggagt agat accaaa cat cagact g t gcaact gca t aggt cct at aagt agct ac t t gagt at gg 120 180 240 300 360 420 480 540 600 act aagaact cct at ct at g caaccact t c ct t t gcaaca gt ggcagt t g gaaacaacag Page 551 at act acact t caagt t gt g ct cct at at a t t t cat at t c t t cagt aat t ct cat agt t g gccgact ct t t ct t t ct ct C t ct at aact c caat caaact 12689250 Sequence Listing.txt gagcaaatga gagaaggtgg aggaaagtga aaaagcagct t gaagat cct tcagatttga caaacacaag aaacat ct t a tctgtggtta caaaactgtc tgtagataac aatttgatat caacgatggt tttggcattg tgagttagat tctgagttgg t gcaaat aaa gt gg 660 720 780 840 874 <210> <211> <212> <213> 649 863 DNA Arabi dopsi s t hal i ana <400> 649 at cat cct ct ttagaggcaa cagcat t gac ct t t t t t ggt t t gat gt t ca ct acat cat a cagt t cgaga at agcagat t agt t t gagca gt gat gaact gggct cgagc aaact aat cc at ggt aat at t t gcaat t ct acaggt at gt t cct ct aaat gcaaggact c ct cagaat ca cgaggct ct t gaggt t ct t g cct ccgt t t t t ggt ggt gca t cct gct gct t at gaaagt t gaaaggt ggt acgcat at t t t t ggagggt g cgaagcagac cagagacaga aaagcat t gc t ct ct t ct t c act cacccac ggct gt gccg caaaaccct c cagagccct g gcggct cat c gat ggt aat g cgact at ct g gt t at aaagc ct t ct gaaaa aacggt ct t g aat t gt agt t aagaacaat a gagaaagat c taa t ct t t gct gc aagt at cct c t t at agccga gacat cgt ct at cagcagca gt gt t gct aa agagcagaat aaat ccct gt caagacct ac gt gt t gaaga t gaacat t ga ct agcagaga t ct ct aggaa gat at gaccc tacagaaaaa accccat ct a t aagggat t c aacgat t t t g at t cgagt t c t cact at gga t gt cgt gact gaaacagt ca t aaaggat ct gaggaaagaa t t gt aacgat t gaaaat caa gagcaat cct t gact at gac gaat caagct ct cat ggaca aat gt ggat c cggat ggaac cagcat t t cc ct aat t acat aaat ct agt g gaaact ggca ggaat ggaag gat t acgaca t ct t cat ct g gt gt t gaaga gcat ct cgt a cgcagcaatt 120 180 240 300 360 420 480 540 600 660 720 780 840 863 <210> 650 <211> 735 <212> DNA <213> Arabi dopsi s tha i ana <400> 650 atgccaaaaa tcttgtggaa aagcctccat tattcttcgc cgtgtattcc tccgtcctcc cgt ccct cca t cgt t ct t ct caat aact t c cacccccatc gtgtcattga cttaccttct tcct cct cat ccacgt cat c at at gaat ct gcttcccgtc gcttcttctt ctcttcccct gaaccccggt caagagaatt ctctgataat ct t t gct t cc gccgat cccg aacct cct t t t cct ccacca gat at ct ct c ggt cggt cca t acgat gat g Page cgt caaat ct acggt at cat accacaat ga caaccact cc ct gat gt at c at gcaat cac ccact at cac caccaaat gc ccaacct aac caaccaccac ggccgccacc cgccgctttt cgact cacca gt cgacaaag 120 180 240 300 360 420 aagaagaaaa ggaggaaccg t ccat gcaag gacggt t acg acacacaagt cgccggat at 12689250 Sequence agaaggt t t a t gat aat t cc gt gact act a ccgt aact ca acacgt ggac t caccggat c agatgattga cgccgccatc gacgccggag at t t ct t gga t gaact gct a ct cact t at c tcgtcatcag ggctttctcc gacatattgg gct ga Li st i ng. t xt ct act acgag gct t at aagc cgt t aact ga ct t ccggcga aact t agt cg tgatccgaac t ct ct t t aaa t ccagccgac t ct cact ct t gtcggaagaa 480 540 600 660 720 735 <210> <211> <212> <213> 651 1920 DNA Arabi dopsi s t hal i ana <400> 651 at gt t aagaa gct gat ccgg t t agcgt ct g t ct gaagaaa gat at t gaaa gggat t gat g ggt acgt gga aacaagacag cagat gaaca ccat ct cgt g aacaaat cgt gagt acaat t aagggggct g act gaaggt a at cgcgaat t gt t t at ccca gagcagaact agt cat t at g gccgt agaac ccccgagaag ggcgat gat g aat at gt t ag gt ggaagaga at act gat gc ccaact ct t g at gat ggt ca ct t ct gt t ga gt gaagacaa gcaaggacac caggaagcaa at gat acaga gcacagggaa aaacagtttt cggt t ccgct t at cagat ca at gcat ct gc caagcat cct aat t t gt t gt t t gagcat t a ct gat acct g t cacact t ct gt t cagaat t gaat gct gga gagat t ct ga gt gct gat aa cgaagact ga t gaggcat ca agaagt t aag t gt act t gt a gat gagcct t t gaacgact c t at gggaat g gt cagat gag t gt t acgt ct agaacaggaa t t acct t t t g at at gat gac aggcaaagac t gat acagaa agggaat ggt t ggcct t gat ggt cagt ggt ttcaaaaggc aagcagggac t gt agaact t ct ct t ccaat ggt t cat at g tgagccaaag ct act gcacc ggat t t gat a gcat aaggag gagcacgcac agcat caat g tcagccaggg gt cat t ggac aaagt t gt gc gat t t gagt c gcagagat ca aagcaaagt g tttcccaaga gat aat aact agcagt gaca accgagat ca gagt t caaga gt gat t caca gcaaaggt ct aat gacaagt tcagaagaaa ct gaaagagt ggaaact t ca t gggt gagat t t gagt t t aa t ct gt ccagg aagccaat gc agagagaaag gcgaagagt a agcagaat gc t t gaacct at cct t t gat gg ggaat gat ga gt agcacagt agat aaat aa aaagt gagat ttttgaagca at gagt t ct t acgt t t caaa gcagagcgt c ggat ggagcc t gt ct t ccaa acct cagaaa ct t t ggt aaa t t gagct t ca cggct t caaa act t aaagct t agaagt ct a acaacaaaaa agcaaat aga agaaggaagc gt aat aagct caggggacac cgat t ct aga accat t at cc t gt aaacct g t t cagggaat gaaagat aca t aacaat aca aagt caat t g ggagcaaaat ggt t gat caa caat t ct cga tggagggaaa caaggaagca t cct t gt t ct t accat caag aggcaccct a cgt gaagcat aaact t t at t cggt gt ggat t gcat acaaa gt cact t gaa cccaccagag gt cagagccc agt act gaag gagat act t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 Page 553 gaggaact ga agagaaaagg gggat ggt ga gagaaagcag acaat gaagt gcggacgt aa gcaggt aaca t ccat ggccc acct gagat a cgat t gt ggc gt gaagcaga agaaagagaa ct t ct t at at gt aaaagt ga at t t gcgt t t ct ccaat t t t 12689250 Sequence cgggaacat a t t caaggaga t t t gagact t gact t ggaag ggagatgaaa gagtggcgaa ggaaaacat a agacagagt c agt ct at aga aaaat t gaaa aaattttttt tgtttggaga ggat t gcgt t t cccat ccct t gt accgcct t ct acat cac Li st i ng. t xt t ggaccgt ga gt at gaagga agagagt gga t t gagcaaat aaaat gggca aact aat cat ct cct cct cc acaaaggaca agccggt gt g aaggcaagag ggcagagat g t gt at t cgat t aat t ct t t t acaat gt ggt t cct cat cgt aggcccat aa 1500 1560 1620 1680 1740 1800 1860 1920 <210> <211> <212> <213> 652 1140 DNA Arabi dopsi s t hal i ana <400> 652 at gcaat caa t t act cacca at t t gt at t t ccaacat t t c t caccacct c t t gt ct cct c gct at t acac accacaccac acgccacct c t ccaagaat g ct aaccact g gcagt t act t ggcat ct t ct tcaccaacaa acaacaccac t gct ccaaga ct t ct aacca gat gcagt t a t t t ggcat ct gaat ggcct c t cgt t agagc gt aat ccggg aaccagcacc cagt agcgac cgcaaaccac ct ccact at c cggcact t cc ct ccaccagc at accgaact gaagagcaga gt t t t t gt aa t t aaagt t t g t ct caccacc ct cct ccacc at gat accga ct ggt agggc ct t gt t t t t g t ct t t aaggt aact act at a agat aaccat accacct ccg gccagccaat cacaccacca accacct cct acct cct cca tccaaaaccc aat cacacca aaagat at gt accat gt t gt at cggt agga t ggt cgt agg accact t cct agct at t aca act aaagat a agagccat gt t aagt cggt a t t gt ggt cgc at cct t t t cc t ccgt t t act ccgcaacccg gaccaaccac gcact t cct c ccaccagct a ccagct at t a ttaccgccac ccact at cac gccggaat t c t ccat cat cc gcaccacgtt at t ccacaag ccacaaactt ccaccact at t gt gccggaa t gt t ccat cg ggagcaagac aggat t ccac t ct cat t cag gccct ccacc at ccccaacc cgcct ccacc caaaaccctt ttacaccacc caccaccacc cgt t gt ct cc ct cct ct ggt t agcaat t ag gaaat gt gt c t t t ccct ct c gct t t agct g taaaaccgcc cacct cct ct t t ct agcaat t acgt aat gt gt t t t t ccct aaggct t t ag cat cat t cca accacct t gt gccaact cca acagagcaca accgccaccg acct ccacca t ccact agcg tccacaaacc t ggaat at gt t gacggt ct t t gat ct t gat t cccaat t t c t cccggt cca tccgccacaa agt t ggcat a t agt gacggc gt ct gat ct t ct ct cccaat ct gcccat ga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 <210> <211> <212> <213> <400> 653 657 DNA Arabi dopsi s t hal i ana 653 Page 554 12689250 Sequence Listing.txt atgaatgatt ccacgaggac tgaagtatcc attgataaat cagaagtaga gacaagat t g gaat ccacag t t t gcat gct gagaagt aca caagagat gg ccaaaat ct a act ct t aacc t cat cccct a tccgcaaaag gaggacacag <210> 654 at t t t gggga ct agacaact at gcagt t t c aaagaccaat t aat gt t at c t t ct t gct gc agat t agt gg t t gct caagc at gt cgccga at at at at cg at t at gcaac t gt t cgcgac cgt t aaat at gt ggat aact gaccagcgt c t gt aagt gga gcaagt gt t t ct at t t ct gg t gt cacaaaa cgat ccaacc gat t t t gagt at t ct t gaaa aaggat ccgg agt ggat t ag t t gcgaat t a gat t t gat ag gagcat gaag acat cccgcc gact t gaccg gat ccaaat a gcacaagt ag tcagagaagg t t agaagt t t agaat cct ac aat ggacagt t aaaagt caa agt ct t ggga gcct ct t t gc ccaat t t gac aggt cagct t caagct t gt t t ccgcaggt g aaat cgt gcc t acaggt cgt cgt gacggt a gaaaggaaaa at ct gaat t t ct t at ct t ca t gct t ct gaa aact cgaaaa cat ct aa 120 180 240 300 360 420 480 540 600 660 <211> <212> <213> 2448 DNA Arabi dopsi s t hal i ana <400> 654 at gagagaaa gt t gct t t gg agt t ct ccag caacat cggt cgaaacaacc cagct at t cg t ct aaaacag gacggagaag ggt at gaagc accct aaaag caat t gat ct ggct t at cct t t cacgt cga agat t ggt at t ggat ct t ag t acgcagt t t aagcct aaat at t ccaacca gat act t cat t t cat t ct ct at t at aat gt accaggtttt t t ct t ggct t ct ct ct acgg acggt gaaca caaat aat cc aagct gt gt t ttgggaaaaa at ccat cacc taaggaagaa tcaccggcgc gt gcgcaaga taaacaacac caaacact gc gt gggat t aa caggt cgaaa act gt gaaaa ggt ct t ggt a gt t t t cgt t a cat cact cca ccaact t ggg gt ggt acat g cacat ct gga caaagcct t g t ct ct t gaag gt ggcagagt ct t caagaca aggt gat t t c t ggagat t cg t ccagct at g agt aaact ac agggaaact c accagaagat tagcaaaaac gt ggaat at t gaaagat gca t gat gcaaaa t ct ct gt t ct aaagaat t cc t t ct t t agt c gaaccat t t g t t ct t aaact t ggt cat cat at at cat gt t t t t gat t at c caaat ggaat acgt t gt ct c agt t at t cct ggacgt gaga t cgt ggact c cat aggt t t a gagt gt gact acacct t ct t tcaagaggag t t t gt gaagt aat gaaat ga t aat t t cat c t t aaggat gg ttgaccaaga cagt cgt t t g t gagt t ccct cat cat cat c cagggaat ct ccat gaat ac ggt ct ct at c t agacact cg at cgct t agg at agct t at t cacggcacag t ccagt cgaa act act ct at gt t cat gt t t cat acgggt g t t ccaggt ct cact t gagga t t ct ct ct ct agat accct t agagcaacct ggt ggccaac t ggcgat ct t aacaaaagct cat at ct agt aat act cgcg at ct t ggaaa aggt t t acca at ct t ggaac cgact acaaa aat t gt ct ca gcaaaaccag at gt ggagct acaagggt t c t gt ccat gag gaaat t acca t t gt aagat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 aagt gt t caa gt aact gct c t t gcacggct t at gcgaat a ccgat at t cg cgagggagga Page 555 12689250 Sequence Listing.txt aaaggt t gt t caagat gt t t ggaat ggt cg ttcagaaaga gaggact t gg t cat acgt ga gat ggacaag ttcaagaacg ggt t gt t gt a t t agact t ct aacat t at ca at cat acat a at ct cagat t agagt cgt gg gt gaaat ct g aat cgagggt t gggt t gaag at cccagagg gacagaccaa cct acacagc t t acgt t cac t gct t t ggt t acat aagaat t t ggat cggt agat aat gaa act t gcct at act t cct agg agat t gcggt aagt aaaact t t caaggt ga t t at at t t ga at ggagt agc gagat t t aaa t t ggat t ggc gt act t at gg at gt gt t cag t t cgt cat gc acagagaaat t act aagat g ccat ggct t c ccggtttttt aaaacgaagt t ggt gact t g gggt t t t gct ggt ggcaat t gagat at cga ttttgacaga aagaggaggt gaagaggt t a aat cgcaaaa agaat gt at g tgaaaggaga t cgt gggat t agct ggaaat t aaat ct t t t t t acat gcct t t t cggt gt a agat cat gat agaagt acca t at t cat gt t t gt t gt ct t g caccaaccgg t t ccat caca gt t gat at ga aaaat agagt gcagt t gt t t ggt gaaaat t aaaacgattt tttggaccag t ccgcgaact ct t caacat c t t gat ct at g agt acagaat ct ct at ct t c gt t t t act ag gggggagat C ccggagt at g t t agt act t g ct t aat ct t c gaggaagagt gct t t gct t t at gt t t ggga aacgt t cct g at gt t gcagg gagagtattc tagctttgga ttaaaggaag t ggt t gt agt tcagaaaagg ct at agccac t t t at aaggg caggacaagg gt aacct cgt agt at at gcc t ggat t ggaa at caagat t c at aat gat at agagt gaat c caat t gat gg agat cat aac t t ggacacgt ggt t ggaaga gt gt t caaca gt gat agt t c at at ct ct t c gacggt ag agaagt ggt g gt t t gcat gt gat t gaggaa t gat gat t t t t aagt t agaa agt ggaagaa gaggct t t t a t aacaaaagt gaagagaat g aagat t gaga gaaccct aag aagcacaaac acat t t ct ct tggcaagacg gt ggaaaat g gacaagt gt t aaaaccagaa t ct t cct cac at ct t t aagc 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2448 <210> 655 <211> 2049 <212> DNA <213> Arabi dopsi s thai i ana <400> 655 atggtcgact tgcgagcaat cttctggttt ccaacctgca t t cagagat c t gat t t t t t c cgagccatgc t ct cgt ct ct t cct t ct cgt ccgttcaagc ccggtccaaa catagcacat acacaggatt gttcggattg tatcacgagt aatcaggcag aagct at aga ct ggagt t ct aaccact t ga t t aat ggat c gctcgacgaa aaat at aaca cat cgt t cgg ccaaaccaac ttgatggatc gtgtgataaa caaagttgat gt ggt cat aa aaagccaat g gt caaggat a ggt ct gggga gt at ct cat a ggagact cgc gacat aat at t t gacggagt ggt t cat t gt Page gct ct t gt gc gt cct t acga at gaaggct t t gt gt agccg cct t gct aca t t t gt t t ggt gggcagagt a t t aaat ct ac acgccaacag t gt cgct gca cat aaacct c ct acaagact aggt act aca cacat gt cct acgct act ca t at t gagt ac at ggcaggct cat acaagag 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt ctcgggtcgt tcccgttcag gagcatttat gccatcgcgc aatgcaataa aagt t gaact gggatacaag t t ct t gggac t ct at t gt gg t acgct gt t t t ct gt t t caa act t gt ct ac gat gt t ct t a agt aaaggt c agact at ct a gcaagact t c aagat act ag at aaaaaggg ggaat t ct ct agcaacatt c aat t t cagag at gccacct g ggagt at t aa ggt t cagt aa t t t t t t act c agat gcat t c t ccacaat at ggat t t t t ct ccgt ct act a cct cgt t aa gcgagaaat g t aggct at at t at t cat aaa caat t gcggt ctaggaggag gaagacct ag ggt t cagaga aat t t t ct ca at gcgaat ga aaccat caga agcat aaaaa t t t acgagt t t t caact gga at ct t cat ca t ct t agat gc t t aat caaac agt at gt ggc t t ct ggagat gcaact t ggt ct t t t aact a acat t gggt t t t cgaat gct tcagggaaag gcat gt ct t t t t t acaacat tgcaaggacc t gggat ct t t t gt t t ccgt c aaaagct t ac gaggcct t at t ct acggat g act act t gat gcagggtat g t caaggt gag t ct t gt t gga t gtt cccaac ttggccaaga agat t cacgg cgagat gaat cgaagcaaat aaat ggacag t at t ggt ggc cacacat gt a t ccagccat t at t at gcgt t aact aat gt t at ccgagccg t act t gt t cg ct t aggat ag agct gt t t ca cgcgcagat g gtcgtgagca caat cat t t g ggt act gcat at ct t acagc t gt at t at aa t t t ccgaat g ccagagt t ca ct t ct cgggt aaaagt ct cg cgacacaaca ct cacaat ca ccaaaaat ag acaggaagag tt ct caacga aagaagaat a agt t t t aaaa ggggagaatt caagaaaat c t caat t act c aacccat t ag gt agat gat g at aat aggt c t gcgat ggga gcaaaaat at ct gt act gct caagt gaaaa caccagat ga gt cct cagt t act cagat gc gaacagaagt agaacgaggt t t t cggt t ga at cat t t cct t t atcgaggg t acaccgt ga ct gat t t t gg t agt t ggcac aat ct gat gt gt agct t cca cgccaat gca at gat aaaga ct gacgat cg t gcct gt gcc ccgagagat t cat cgat cac agacct aacc atgctgccgg t ct acagcca at ct acagga t gctct t gga t gggt at t t c t ggt aaaaat tacaaggcat t aat t ct gat t gct gccaag gct t ct t gt a aggagaagag ttttgacccg aat cact cga cct caaagcc t ct ggct agg at t cggt t ac gt at agct t t t cagat agat at cacaaat a t gaagt cat t t ccaagcat g tcaaccacct act gcct ggt gagt gt t aga 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2049 <210> 656 <211> 1875 <212> DNA <213> Arabidopsis thaliana <400> 656 atggcaacga agagttgtga attagtcctc tgtttcttcg tctttttcgt cataagcttc agtgcaattt ctgtttctgc acaaacttgt gacaatacta ctgggacttt catacccaat agtcct t atg ataagaaccg ccgactcatc ctctctactc tagct t ctaa cgtcacggct caggagggct acttcatcgg ttcgattggg attgctcctg accaagtctt tgccaccggg atgtgcgctc caggttctga gcgtgatgtt tgctctctct gtatcaggtc cacatcagag Page 557 120 180 240 300 12689250 Sequence Listing.txt agt t t act ac aaagct gt ct ggaccaggca gat gct t t ct t ct ggt cagg t gaagaaacc ct t t gcct t g ggagcgat at aat aat t t aa t cgt ct aagt t t gat gcaat gt t gt cgact t gct t t t t cc aagt caccgc aagt cct t gt ggaggaaaaa t ct t ccaaac gaagcggaat gaacgt aaca ggcact gaaa aagaacgagg t t t t cgct ac gat t at t t cc at cat cggt g at acat cgt g gct gat t t cg gt ggt ct gga caagat t t ca gaaaat cct g at cact ct gc ccgggt cagt gt t aat cct c t t cgct acgc tcaacacagg ct t ct agt at act act caga gcacaccaga at gat aat t g gat gggaagt cgcct t cagt t ggaacat t t t cgcagcaat ct t at act t t ct gt t caat t agct t ggt aa t t gcggt gaa t t gt t gt t gt aaggagaaga t ct t t gat cc gcat t act cg acct caaagc gaat ggcaag aact t t ggga caagcgagga cagat cgacc ct gt gcct ct cgaacagcaa gt t ga taaccgcccc agaact caat gat cgccgga cgat at agct cgt t t cgt ca t t gccgt ggg gt at ccct t c t acat ccccg t cagaaaat t agt t gt t gt t gaagt ggt ac t gat t t aaag aggt ggat t c gagat t gt cg t gcaaagct t gaagt t act c tacgaagaga agggat t t t a gagt aacat t gat ct t t gga gaacaaat ca agt cat t aga aacaat gt ct gcct cct gga gt ct t t t gct t t ct ct ggat acgaat caga at aact t ct t ct t gt gccgg gaggat t gt a caccaaggcg t ct ggt gct a t ct cct at ag gt ct t gt t t t act gt t aaga tttttcacga acaat t gaat ggt gaagt t t aaaacat cag caacacat aa gt ct at gagt aat cagt t gg t at ct t cat c ct ct t agat g gt ggaccaaa t t gcat gagc t acat t cat a acgat t cacc t t t t t ct t t a t gt t ct gt ag t gct cgt cat ct gt gt t t ga ct t cat cagg at t t t aagaa acact t gct t gcgt t at gt c t t gat cagat ccaacat aac gt t t ct at ag aagcaaaagc gct t at t t at cggcgacaag acaagggt at gacaaggt ga at ct t gt t ag t t gt t t caaa act ggacaat aagat t cgcg cggat at gaa ct gt t gccaa t ct t agat cc t t ggact gt t aaat gct cac ggaacggacc at gaagcaac ggat ccact a t at cgagt gg ggggaacaat cat at cagcg acggcaaaac aaggccaaac t aat t t acca caagaat gga t agaat at ca aagaaat gga ggt t t caaat caact t t t cc gct cat gaat agt agagt t c act t ct cggg caaaagcct c gcgacgcaat gct gaagat c t ccaaaaat c tacaggaaga ct t t at caac at gt gt t caa aaacagct ca aggat cgaac aat cacggac 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1875 <210> 657 <211> 930 <212> DNA <213> Arabi dopsi s tha i ana <400> 657 at gt ct ggt t cgacccggaa agaaat ggat cggat caaag gaccat ggag t cct gaagaa gacgat ct gt t acaat cgt t ggt t cagaag cacggaccaa ggaact ggt c t ct gat aagc aaat caat cc ct ggacgt t c cggt aaat ct t gccgt ct cc gt t ggt gt aa t cagct t t ct Page 558 12689250 Sequence Listing.txt ccggaggtag agcaccgtgg attcacggcg gaggaagatg atacgattat gct cggt t t g gcgat t aaga g gagaggaag gat gaaaaac t t aagt ccaa gt t t ct t ct t t cat cat cgc t gggt caat g gagaaggaaa at t aagacgg ggaggaggag gagt t t at t g gt aacaagt g at cact ggaa ggcagagt t g cgt t aaaacg cgggat ct ga ct t gt cat gt cggaggagga agt caacaac gagagat t t c aggt t aggag caagt t cgt g gat t aggt ag ggcgacgatt ct caacgct g t gat t t cggt gagggcgagt cgt cagcgag ttttaaaccg ggagaaagat tccaccggag t ggact t ggt t t acat ggcg t at ggt gcaa gat cgagt ag gcacggct t c aagcggaaat ggt aat ggag ggt ggaggag caat cgcaat acggcgagag ccgat gact t t t gt t t ccgg ggagat t t t a gat t t acagc ggaact aat g t caat ggt cg gt agcggcgg ggt at gat gg gagt t gt t gt ct agt ggat c ct ggt ggagt gt t t gaggt t tgaagagaga t gacggt ggt taggaaacgg gt cgt aat gt act agcgcac cact gat aac aggcggcgga t aat t t aact ggt gacggcg t gt t t t accg ggt gat t gag gt ct t t gcct agaagaagaa gcaggagat g cggaggagct agggt t t aga 240 300 360 420 480 540 600 660 720 780 840 900 930 <210> 658 <211> 543 <212> DNA <213> Arabi dopsi s tha i ana <400> 658 atgtggaacc ctaacaaaat tgaagaattg gcctttgagc aagacactaa aggcaacatc acttgcaatt tctgccgccg tgagttccgt gtccaccgcc gtgaccgcgc ctcatctagg gctagaagcg gccacggggg gatgttactc acact t at aa t acaat ccac ggcgagtaac caaaacccta gt ggcat t t t t ggt aat t ct cctcctcggc t t at t gaat a ttcgacagga gcgacaggaa catcagtgga tgagcttgat tga gaggat gat g t ct ggt acca t ct gct caag gct cat caag aat t ct t gt g at t gaaggt t ggt gacat gg gat gat gaga ct t gaact t c at gaat ct t g ct t ggcct cc cct t aggcgg gt t ccaccgt ct ccgccgt t t gt cccat t t t gaat ct t t a gcat t ggct c ggct agggca ggaagt caaa aagat ct t ac t cacat gaat t gcggct gcg gcct acaacg ct accaact g t gt agaagt t gat gaaagaa ccat ccaccg 120 180 240 300 360 420 480 540 543 <210> 659 <211> 972 <212> DNA <213> Arabi dopsi s tha i ana <400> 659 at ggct gt at cacttccaac caagtaccct ctacgaccta tcaccaacat cccaaaaagc caccgt ccct cgct t ct ccg t gt acgt gt c acct gct ct g t t act accac caagcct cag cct aat cgt g agaagct t ct ggt agagcaa cgcact gt ga at ct t cct ct gt ccaacgac caat ct ct gc aat cgaccaa gcct cgccct aaccgt gaga agct t gt ggt t gagcaacgc Page 559 120 180 240 12689250 Sequence Listing.txt ct t gccagcc ct cct ct gt c caat gaccca act t t gaaat cgacat ggac t gggt t gcag t t t gat t ct c gat ct aggt t gt agt aggaa agacggcaat ccact agacc t gcat at t gt cct ct cgt gg cat cat cgag ct ggat gaga agacct aggt aaccaagcat <210> 660 cgggct gcac at ct ct gcct ccggt gt ct a cccaaat cga ttgccaacaa t t gcat t t aa ttagccagca t cgcgt t gca caccgt at aa gt aaggt ct t cat ggagt ga aa cact t t gt t t cgaaccagct ccact gggcc agcat t t cag t ct acacgct cgaccct gt g at t ccat gct ggacat gggg caacaat t ac t gaggcat t g gccaaact ct gt ct ct t t ag t t agccggt t at t gat aact ggt caccaca ct ggct caag t t t cacggct t gggcacat g t t act t gt t t t gcat cgt ga gagat ggt gt gact ggat ag ct aaat ct gt at gcagggt a acggt gat ga agt ggcct t g t cat aacct t t t gt gt gcac gaaccaagag cacggagaca gt ggagcat g t t t at t t cca aagaaaccga t caccggt t a cat t ggaggg cat ct t agct gt caacacct gacaat cacc cacagt t ct t at t t gcat t t caagct t cca gcat gcggaa gaacaat gt t gct t ggggt g aat ct ccaac 300 360 420 480 540 600 660 720 780 840 900 960 972 <211> <212> <213> 1046 DNA Arabi dopsi s t hal i ana <400> 660 at gcagt aat agat gaagat cacct gcaga accaat gggg caat t gat at aat act act t gggagggagc ggcacact cc tacaccaagc aagagccttt aagggaacca agact cgaaa t t t ct t ggac gat ct ccat t ggagagt t ga agaaaaaacc gct t t gt aaa ct gat aaaac t at gat gat g t gct cagaca acact t aaac tcaaaggagg cacaaacgca aggagat gt g t t ct gaacat aaaagat gga cct ct ct cag cacagcacaa gt act acaga cat cct t ggc ggacact t ct gat gt t ccac caat t aaaat act aat aat c cct ccacaga at t aagct ct gaaggagt ag cct cact t ca caaat at t t t acact agt ga at aat agaaa cacct ccat g agct t t gct g at gaaggaga gt ggaagt ag t acat t ggt t aaggt gaggt ttcaagaaca gaccacgagc agt t t t acat t ct t at gt ag gat t t ccaac gct t ct t ct c t gt t t ggat a ccacat gcgc acaaccacaa accacgt ct g acaagaact a gagt ccaat a t ggt ggcaag aat t ggt gaa gaagaat cga cact cact ac caat gt caaa at t caagacc gt gt cgat aa t gt ct at t gg t act gt gt t g aaaacaggaa cat gt ccct c taaaggcaaa ggt cggt aaa at t gaat t ca t aat gt t gcc t acct t act g t gcagct gag t ct ct t caaa gct aaaggaa cacaagacac t cct cct t gc ggaacaagt a gt gt caaccc aaaagaaacc t t t gt t t aga t t gt t t acga ct aaaacaca at ct gcat t g aat ggaccaa t t gcaat ct c at acaccgt g at gt t ct t cg caaat gcat t ct gcacat gg at cggcact g gagagact ca at t gaacgt a tccgagaacg gaact act ca ct caacggcc ggt aacaaaa accct aat t a ct t gat at ac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1046 gatttccaaa aaaaaaaaaa aaaaaa Page 560 12689250 Sequence Listing.txt <210> <211> <212> <213> 661 813 DNA Arabi dopsi s t hal i ana <400> 661 at gcct ccat agt at cgcga at at gcct cg ggat act gcc aacact cgt t gagcaat t ac gat cgccgga t ggcggagat ct t cagt ggc cgct t ggaac gagccat gga gt t cat t t t g gggcagaccg t accgt cct c aat t t gcagg aaaat ct aac t t gct t gt at t t gat t cct g caat t ct t cg ggt cgaggga cat t t ggacg gagct agcat t aagt t t ggc ttagaagaga cgt cagct ct ggat gt t ggt cacggcgcct gagagcaat a cgagcgaaga t cggaagt gg gt t t at cgt t t gat cgt gag t gt t t t ggaa accaggt t ca at acact aag ggt gaat gat gct t caggca t t t cat caaa ggaagat gaa t cgt ct t cga t t t cct gct t tccgccgccg agcagct t ca agt gat t t t g act t t aact t aact ct agct gt t cct gat t caat t acgag tacaccaaag gt cct t ggag aggct t gaga gt ct cct ct c gacat ct t cg tacaggggaa t gat cacggt agaggaat t g ct t ct agaaa at cat cggaa caagat caag ct gt t at ct t ct gt t cgat t caaagat aac at cct gat cc gagagaat aa t t cgaaaat t at cgccggaa at cgt gt ccg gt gcaagcac t cct ct t t gt at accat aag t aat cct t cc gccat t gcca ccagcgaaag acat t caagg t gaaagaat t ct cagt t at t tcgacaaacc ct t gt ct gat 120 180 240 300 360 420 480 540 600 660 720 780 813 120 171 aaggtggata tattttggca tgaactaagg taa <210> 662 <211> 171 <212> DNA <213> Arabi dopsi s tha i ana <400> 662 atgcaacaag cccttcgaat tttagaggta cggtctgctt attgtaaaaa cgatttcgaa t gggat aat a t gaagcgcct cgcct t caag aacgt agat g at t ccaacac aagact cat g cgcgagt acg t ct t ggagac t agccat gt c gaaaccgat t ct gat aagt g <210> 663 <211> 1935 <212> DNA <213> Arabi dopsi s tha i ana <400> 663 at ggct gct t cagt acact g t acct t gat g gctcggccga aacttccaaa ctcgtctttg gctgctactc gattcaagaa ggaaacaaca ccaaccacta at t ct gagag agctaagcag gattttcaac caattccatc ttttgaagaa gaagttgtcc atgaagaatc tggtcatgtt t ccgt cgt at t t acct ggat accacaagag agaaaacaca t gct t t cct a ct t aaagt t c Page 561 gcaacaacaa t cgat gt t gt ccact t t gac ccat t gat cc agagcact aa cct t t cgt cg gaat cact ct t gt t caagct gt t t gat cca t t ct t ct cct agaacacaag t gt t cat t t g 120 180 240 300 360 12689250 Sequence Listing.txt t ct ggt ggt g agccagct t t t gat aat t at gacact agt g gt cct caaaa cacat t gggc ccaagat aca t gt gct act a gcgat t at cc t t ct t ggt ca gaagt ct aca act ggt cgt c ggt acggt gc t gggaggt t t at ccat gct g gt t t cgcgt g gct t acgagc at t ggagat g gagct cct t a aat gaagggc gagt ggt gt a ggat at gat c ct t ct t t gct gccggggt t a cat gct cagg caat t t gct c gct gat ggag aagat aacag gcaat cagac agcggagaac gct gct caga t t gct aagct ct caaat gt a gggagaagct ct t ccaacaa aggt caat gc aggt t cagt g acat ccat ga ct at t t at ca tcagagagac gagt t t t gct gaggat ccat act gggat ga gt ct gagacc ct caaggt ga cagggcat gt acgaggcacc acat t acct c at gt aacacc t agcat acaa cat gggacga t gt cgt t gga ccaaggt t gc aagacat ccg aaggaat gga agcacggt ga aat aa aaggaaggag ct acgct aag agaccct gag gaagcat t t g gaat at cgga ggcaaccat g gacacgt gag agcact t gag t ct gat t gaa gcgt t acat t t cat gct aaa cat t ct agac t ggct ccat t act aact cgc cccaat gcac at t ct acacc t gccat t gga aaaagaacac gat cgccgct t gcgct gagc ccccat gact acact t t t gc aaagt at gca t gct at gagt agt aggt gga t ggat t gat c caagggat ca t t t gt aagat gagct ggaac aact ct gct g t ggggagct g t ggat cct aa aaagt ggat g caagct gagc ccct t aact g t ggt gct t ag at ct gt aacc t at gat gct a cgagcgt ggg aagat t ccag ct t ggt cct t gct gccaat a ct t gggct ac cat gcagct g aaagcgcggt gct at gt ct t t ccat gt gt g gaggagaat g gaagaat t ca gaaat at at t gt cgggagaa t aact gagga cagaagt t gc cgat gat t gt t t gccagct c at acaat cat ggaat t cagc gaat t gct ga aaggt gt aga ccaagcgttt ct t accacaa agt at gat gt acgacact gc aaaaagat gt agaat at gca t gact act ga t t ggagcct t caaacaggga at ct agccaa t t gagt t t ag t ccat gat ga gaccaaaatt gt t at ggcag acat cgcaaa tgccagagag t gt t aat gca gct aggaaca aat gct ct ac acgaggacgg t ggt agaaag t at t gaagag ggat ct ct ca t gt gcct gt t gaat ct t aac ct at t t caca gaccgggat c ggagaacttt ggct ct t t cc t cagt t t gca gcaggt gat g gaagcagt t g t at t gcccct gggt acagct cgat gt gaag acagcat cca at ggat ggac aact ct t cca ct gct ct at g t gct gaagaa gaaaaccatt ct at gt caaa 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1935 <210> 664 <211> 684 <212> DNA <213> Arabi dopsi s tha i ana <400> 664 at gt ct t t gg acact gt gga caaact ggt g gt gt t ct t gg ccaagagaga cggaat agac aagctt gt ga agacat t cca at acgt ggcc aagct cgcgt gct ggcat gt t gaagccaca cgacccgaag ccgccgatag gttcaagaag tgggaggtcg cctccggtct cagccgcaaa gccttcagga ccgggagatc tctcacgggg ttcaatgcgc tgagacgaaa ccccggggca Page 562 120 180 240 accccgat ga t t t gat cat t at gagt t t ca t gcat ct t ca ccgaaagaag at gggaat t t ccct t t t gt a t ggt at agga t ccgt t t ct t ttct t t ggt t t at ccgct t t taaagcaaag aaat cggcgc cagccaacgt accacaccat act ggccat c 12689250 Sequence ggcggt cct a gccaat t cag atcaagaatt ggttcaatag tggcgaatct ttcggctaca actcaaatcc ttaaaaaagc caaaattagc gagattcggg t gcggat t t a ct t at agcat tacactcggt ataagcggtt gt ga Li st i ng. txt gggaaatggt acgccaagct ct t t ct t cat t ccagcat t c gagat at agt t ggcagaaat t agt t t cagc t t act t ct t c cgccaagaaa cat cat t gat gaccgat gaa aat gaggt t g tcacccaaac t t gggccggt 300 360 420 480 540 600 660 684 <210> <211> <212> <213> 665 2952 DNA Arabidopsis thal i ana <400> 665 at gat gaaag gagt t t gcgg ttcaagaacg acggagt cat aagt t cgggg t ccaat agt a aat aat gat t ct agacct t t cat ct cat at ggat at ct t t cct t ct t caa t t t ggtgaac act aaccat t at cgacct cc t gt ct cacct ggcaat t t ga ccaat t gcac accggaacac gaaaaccatt at t act t t gg t cgaat t t aa at at ccaaat gct at at t ac ct t ct act ag agt t t gagac ggacgaat aa at gt gat cga gtct t t t t ag t t at t ggt ca ct agaaat ca t t gt t gact t cgcat ct cac t t ggaaat ct t t ccat ct t c t t gt t ggt aa acaaaaat aa ct t t t at cct accagt t gga t act aaat ct t t cct t ct aa tcacgggacc aaaat aacca ct gt gt t acg t agt caacct t ct t t ct t t t acact t gt gt tttggaggaa cagt gat t gt gct agat ct t gctt ccacaa aat cccat ct t t t cagt ggt ct ct cat aac ct ct t t t aac ct ct t at ct c act t ggaagt aat t ccgt ct ttttgttggc ct ct gat aac cat ct t aaat aagaaaact g cat gagt t ca t ct ccct t ct act t aacggt cct t gggaat t aaggaact t ct t at cat t c gat cct gat c t ct t gt t t t g t gt t at t ggg agct t agt t ct t cgt t t t c t cact t gaaa agaatt cct t aat t t t agt g ct t t ct t at a accact t t gc ct t t t ccat c t cact t ggaa gaaat accat aacat t gt t g gt t aagt cca t caacat t at ct ct ccaact t ccct t t t ca agt ct t gggt aacaact t t a gacct t t caa tcat t t t t aa aaagt gat gc acagcaat at at ggt at caa gcct ccgt gg t aaccact ct ccct ct ct aa ct t cgat t gg gt caaat ccc acaat t t t ag gcct ct ct cg t caccgat t t at ct t t cgca t t t cact t gg gt gaaat t cc at aagct t ag ct ct ct t t aa t gaagt t gt t acat t ccct c t t gggaat at gagggccaat act acaacac t t t cct t gac aat t ct cgaa t cct ct gaag at gt gat gcc ccagct caat t gacct t t ct t ct caccact aaacct t t ct at ct t cact t t ggt agggt t t aat agt t t t gat cct cgat t ct cacct ct aaat ct t t ct at ctt ct t t t t ggcagct t c caat cggt t a tgacgcaacg t t t gaaaact at cct cat ac ccat agat ct ccaagggt t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 Page 563 12689250 Sequence Listing.txt gt t gacttta ccatct t ctc gcatctcaag t ctatcgaat acct t aacct aacaccact a ct agat ct ct t t ggt at t ga ct aagat ccc gt gcct ggat t t cat cggt t at gaggcaat gagt t gcct t act t gt at gg ct t agt ggac aaccaact ag aat gt ggaga ct acaagt t c t ct aagct gc ttct t t gtca gagaccat ga at gaat aaag ttttcgggaa cat gt t ct t a ct cat ggcac gagt t aggga ct at t accag ggcct t t at g t ct gacat gg gt t at aggat tacaaaccag at aact cat t ct acgat t ga caggcagcca t aagt cagt t aagagctt at ggt t at ggat t cgaaagat c tgt t t t gct c at ct cagcac gaaat at t ca tt ctt ccaga tgggaaagct gcaacaaaat t t gt cct t cg gaat cat t ga act ggact gc gcaacat gt a gagt agagat acaaat t t ga act t at caaa t t gagt cact aact cacgt a gaggt act ca gccct t cgct cgccagagcc t cat act t gg at t ggt t caa ag cat gt acgag t gt t t ct act gt act t gt cg gct gact ct a gct ccct gt t aacgaaactt caacaacaat t ct cgat t t t gt ct ccat at gaat at at t t t ccaagat cg cagt gat acg ct ccaat gca cat at ccggt aat gt t t t ca cat gagt aca ggagctggaa aggagagatt caat gct t t a ggacgt t t cc cct t gcat ac at t t caaaca t gagaaaat t agaagaagat t act gcct t g gaat cct t t c at ct t at caa acaaacaaga ggat gcggt a gacat t t cca t t gaact acg ggact cacct ttcacgggaa t ct aacaaca ct t caagct c gagagt ct aa t t aagt cat a t t t cctt tat t t t t at ggac aat caat t ca ct ggacgaaa gat t at t t ct cgt gt cct aa ccaaagt cca agt ggccaca caaaacaagc at gaact t ct cagaagt gct t gt gat at cc gaagaagagg ggat t gacat gt ccgagaca gt t t caagt t gt t cact t t c t caccgagt t acaacaagat t gaat ct t t c ct at ccaaga acat t ccat c aat t caat gg t aaat ct t cg t t t ctct t ga t ct ct t ccct ggt t gagt t c caat agagaa at ggt acgt t at gaagat ca act t t gat t c aagt ct t cac ttggt t t at t t cgcat cat c tttcaggaga ct cat aacca ct t ct t t cga at gggaaaac t gat aagt t g t t ggat gcat aacgcagaaa ct cccat t t g at t ggat aca t aat t ct t ca t ccaaagt t c caaaggt caa caacaacact accaccagct t t t cat at gt t t caat t cct acat aat cgt cgt cggccat t ggact t ct g t ct acaagag aacacagtt t gccagcaaat at ct aacgga aat ggt t t t g agt gat t gac gaaagagct t tatggggaac aat t ccacaa gct ggt aggt ggacaat cat accgcaacaa gat agcagct at t gt t t t cc cat aggcacc 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 2952 <210> 666 <211> 3036 <212> DNA <213> Arabidopsis thaliana <400> 666 atgcgatgcc aagtttggaa tgtgatagag ctgaacctta gctccagctg cctccatggc ctgctcaatt ccaaaagtaa tatctttagc cttcaaaacc ttcgttttct agacctttca aataatcatt tcagtggtca aatcttatct tcacttggaa acttttctag tctcaccact Page 564 120 180 12689250 Sequence Listing.txt ct t gacct t t cat ct cacat ggt aacct t t ccat t t t cac gct ggt gaaa caaaacaacc t t agct gt t g aat t t gt cag agct cgct ct cct t cat ct c aat ggaaat a aacaact t ca gacct t t ct C t t gct cgt ga t t ct t at caa at aaacaaga aact at acca gct gaaccag gt agaccgat acagat cgt c accgat cgt c tccgaacaaa aagaat ggcc aaccaacggg gct t t at cga t cgat aagcc t t cacgat cg caaat t cgca t t t t t ct ggc ct ct ct agca aat t t cagt a ccagagaat a aagct t ccaa ct gaaaat ca ct ct t gacct ct cat ct cac t t ggaaat ct t cccat cct c t ggt t ggt ga aagaaaacga at t t at cact ccaact t ggt t cct caacat t t gagt t t gg gagggt caat at t t caacac at ct t cact t gt t t caagt c gt t cagt t t c t ggt caagaa ct agct gggt cgt cggat cg ggacgggt ca ggat gggat g cccaagggt g at ggt gaaac aaagt ccacg ccaccat gga gagggct caa gat acgagt c t at t ggccaa gccaacaaca at gat t t at g gt act ct t ca t at ct gaaag gat ct t t ggt t t t cagt ggt cact gat aac cct t ct t ct C t t ct cat ct c at t t gaaaat aat cccat ct gt t t act gga ct cccgcaac gt t at t t t ac t cct t ct t t g gaat at at ct ccacaaat cc ccaaggct ca at cccat t t g act agat aca aaat cct gt a acggact aaa ggt ccggt ca gat cggccag gat gggacag gat cgagaaa gcccat ggat cgat cggaga gat at cgt t c ccggacggt g t cgt ccacca at t gaacgaa gat t aat ggt at t t cacggg tgacaacaaa agct ct acat t t t aaagt cg t cgt at ct ct caaat cccat ct t cact agg aaat ct t t t g aat t t cgt t g ct t ggt gct a accgat ct t a ct t t cacat c t t t t t t ggt a aact t t ct cc cagt t cacag gcagacgcca agt t gt t t t g t cat cact at at t t ccaaac at aaact t t a aacacaacaa t t ggacct ct acgaccgcag at act t cgt c gct gacact c accgat cgt c gccgat cgt c agt gt t cgag gcaat ct t ac t cggt aagac ggaccgcat g ttaaacgacc t gt agccgga t ggagt aagg acct t cat cc aaat ct t cca t t caat ggt t ct aagaaaga ct t gacgt cg t ct ct t gaag gt gacat ccc acaacct t gt ccct ct gt ga t caccaat ct gt t t t aacca t t at act act gcacgct t cc acgct t t cac act t gagt ga cagact t act t agt caacct gcat ct t ct C ct acaat t ga caggcaat ca gccaggt t gt ggaagaacga gagct gacga ggacggat cg tgacggaacg gaaacacaag cggt aaagag caaaggat ga gat ggat cat agat cggaaa ggggcggatt cgacaaacgg t ggcaaccat gct t t cat at caat cccacg at cat ct t ag gt cat aacca t t ct t aat gt aact t cact t t ggt gaaat c aaacgacctt cgat ct ct ca gt t agt t agc aaat ct aaca t cct aat at g t ggaact at c t aaccaact c cct t ggcaat t t acacact t ggat ct caag ct t gaat aca t at t t cagcc tttaggagaa gaaagt t caa at cggat ct g gggcagccag ggt cggacag gat aagt gca agaaagcacc ct cgacggat taagccaaaa cat ggat ggt ggt t cat caa gct aaggaat ct at gaat t a gt gcat t gcg t t gcat ggga t ggagt t t t t act agt ggga ggaaaacaac 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 aaaataaatg atacatttcc tttctggttg agttctctag aagagctaca agttcttgtc Page 565 12689250 Sequence Listing.txt ct t cgt t caa at cgacgt at acggt aat gt agct at t aca at cct aaaaa aagt ccat t g ggccacat cc aacaagcttt aact t ct ccc aat t gct ct t gat at ccat g gaggt gat aa at gat gggat aacaaaagca at gcat t cca cccat aacca t t t t act t gg gt gat t caat t ct t t acat c gt ct at t aaa cat ct t caat caggagacat at aaccagct ct t t cgagga ggaaaacaat gt t ggat agc acat at t ggt gaagcacaag t ggaccgat g ct t caacgga agaaaat gaa agt t gt gat g agt agact t t agagct t cat gggaaaact a tccacaagac t gt gggt cca gaat gcagga gcaagagt cc agct acaat t t t gct acaaa cagcacaact caacaaacca ggt t t cct aa ct t gcgaat c act t t gccat gat cagt t t a aat aaaggcc tcgagaaaca gt gct caact agagagct cg ct aggggat c ttaccagggg cat t t t ggcc gaaat gccag ggat t cat ac ccagagt ggt cgt t aa cagat t t ct t at ggagagt a t agaaat gga aat t t gaagg t gt caagcaa aat cact aga t ct cat acct gcact cagt t ct t cact t ga gat cagaaga ct ggt at t gc t cat gaacgt t gt aaat t gg cat gggt aca gat ggt acgt agagat t cca t act t t cact t gt t gct caa t gcgt acat g t ct aacacag gaaagt t t gt agat gaagaa ct t t ggat t g ttttggcaaa 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3036 <210> <211> <212> <213> 667 2673 DNA Arabi dopsi s t hal i ana <400> 667 at gt caaagg cat t caagt a cagat t caag gacacct t t a aggaagt gt c cgt t acgt t g aat ct caaaa t cct cat t ca ggt agt t t cc cact t ct ccg ct t aat ct cg agat t agaga agt aacct ga ccact t gt ac ggagt cat cc cgct t t t gca t t t t t act t t cct t cacgaa at ggagt ct g t cagt ggaac at ct ccaaaa gat t agaggg gt aacct aac cact t gt ccg gaact ct gaa ct t t t aacaa act t gat t ct cccggt t aac agaat ct aac ct t ct t ct ct t t t gcat t t t aaat t t t cat gt t cacgaac gt gcgat aac t t t gaagt cc caacaact t a ct t gt t t ct t aat gct t gct gggt ct aaga t ccaaacagt ct t t agt t cc t t cct ct aat t aagt t gt ac caacct ct ac cct cact ct g ct t t ct ct gt t t t act ggt a gagt t t gaca t cgacgggt g aacagt agcc acct cct ct t t cct ct aat g caat t agacc aagct cat t g agcct ct t t g t cact ccct t ggct t t t ct g ct t gaccaaa gaat t agacc cct t t ct t ag t t t t act ct g t t gt ggcct g cccgcggt t g cggt cgcagt t ct t cgggt t cact ccct t c gct t cct cgg t t t ct t at aa t t t t agacct agt t gcacca ccaaat t t gg gt caagt t cc acaagct cac t t t cat at aa cacat ct t gc cct cat ct ag t t gt gt ct gc t cgt ccccac caacaacagt gct acaact a t cat caact c cgggt t t gga ccaagt t cct caagct cact t t cat at aat gct acgt t ac caat ct ccat t t ccacaat t t agt agt t t c t aagt t ct t t t ct acgt gaa gct cgaaat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 aacaatcttg ctggttctgt tgaagtttca aactcctcta Page 566 12689250 Sequence Listing.txt gaagga caaatcctag agcctatct c atgtatctcg ggtctaacca tttt aacct caagc t t ct cct ccc agt t t aagt t gacat caacg tccaacaaca t cagt aact c aat t cat cag ct accact ct ct t t caat ct ggt cccat t c gaaggaagt a cat aaccgat ct aagcgt t a aat t t gcaag caaggt cct c ggaagct t gc gat gggggt t acagat gct t t cct acgccg ggt ct ct t ga cccct gt ct a t cggggact a cacaaccaac t cct t cgaag ggt gcacct c gaagaagaag t t gct t ggat at aat t ggt c at ct t gacct t caaat ct t t cagat t cat a agtt ccccaa gaatgaaagg t t ggaaat aa t gct gct at t ct at caaagg gcaaccgaag ct ccat gt t t t ccct gacgc taacagggaa taaacaacag t cct t act ct t cgggt t t cc caccaaact a t at at at ggt t agat t t gca ccat t gat t t aggcact gat t ggccaat ct t t cct aat gg t cacgggt ga ggaatgcagg cgat gt at ca aagaagtgt t t ggcaat agc ttct t t cct a gcggt ct ct g cat cccgt t g cat ct t aaag gaagat ccct t t at t t caca at at t t ggat ct t cggt gt g ct ct ct cgct gcgt aat t t g gct ct gt gat gctt ccaagg aat t gaagat t cgct caaac agagctgcgg ct t t gt gaac atacgaggag at acaaaggt t t ct ggaaat t gcagt caat t gagaat ct c actagggagc aat accacaa gct t t gt ggt ccaaaagcaa gaacgggaga acaagt t at t aacacaagct gat ct t t ct g accct ggaaa accct caagg gagt ggt t at ggt t t ccaag t caaacaat t gct agt aat a gct at t gat c gaact t gt gt ggt gcct ct t t ct t t t gtaa acgt t t ccgt agat t t t at g at at t t gaaa t ggaaagcat aagct gt t cg ct acacat gg agact cgaag at at caaaca gagt cact ag at t t cgt t t t ggaacacaga ct t cct ct ca gaagacaaag gcagtggcaa gct t cat aca acccaat aga gaaacagt at t gt t gacct t agt t ggt gt a ggagcct acc gt t cggcgga ttgaaggagc gt t t cacaag t at cct acaa at ct ccggaa t acggacact act gct cat c t ct ggct caa gt cct at at c t at ct gacaa cat cacgcac acgaaggcgg agcaagct aa gacagat t cc acgcct t cac acat gt caag t ggcgt acat t t actgggca aggaaagt t g aagaagaaga t agggt at gg aaccggagt g aaagct cat c ct t aaaact c at ct t cggcc gaggcactgt t at agacat a t ct t ct acag gat t t t agt a act t cct gat t gagat acct caact t cact caacaacct g cgat gt t agc gct aaagt t t agct t t accg t cct ccgcat t aagt t t aca gat gaat caa ct at ggt t at ggccct cact t gaat ct at t aggccat at t aaaccaact c caacgt gt ct at ct aaat ct ct t cggcact agaagaagaa at ct ggat t a gct t gt caag 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2673 tgaataaacg cagaaagcgt <210> 668 <211> 1047 <212> DNA <213> Arabidopsis thaliana <400> 668 atggcgtcaa ctatcatctt cattctaact atttttttaa gttacagaac ttcgctagct acatctcgtg gtagcctttt tgaagcttct gccattgaga aacatgagca atggatggct Page 567 120 12689250 Sequence Listing.txt cgat t caat c aagaat t t gg at caacgagt gt gcccgagg agat at ggt a acacccgt t a gt ggaaggca t t agact gcg gagt acat aa caacaaactt ggat at gaga cct gt t t ct g ttcaacggag agt gaagaag aat ggt t aca at t ct t gct t gcgt t t act c agt t cgt cca t ct ct gat ct ct at aaccag at gt t agt ga agt at caagg t t acaaagat acagagact a tcaaaaacca gt agt t cat c ccgt t ccaat t ggggat aga aat gt gggac ggact aagt a t gaggat caa t ct at ccact tgacgaaacc gaact t caac tacggacgaa aat t t caact t aat ggcgag ccgat gt gga t act aaaggc caat caagga aggcat cact aact accct a gaat aat gag agggacaggg ggat t t gcat t t gggt ggt g gagagacgt g t gct t ga gagaaaagaa at gaat aat a gaat t t cggg ct gt ct t cgg agcat ggat t ggat gct ggg gagct t gt at t gt cgt ggag act gaggat a t cat cgt ct t gaagcgt t gc gccgcgt t t a cat gcggt t a aagaat t ct t gat gcacccc at cggt t t aa t at ct t caag aaat cacgt a cgacacacac gcaaaaacac ggagacaaga cgt t t t ccgc cgct at cgga ggat aat gt c act at ccat a t t cgt gcagc t acaagcagt ggcact act c caat t gt t gg ggggggaaac aaggaat gt g caaggt t gat t ggact agt t ggt gccgt t t gggggct gt t ggt ggcggca gcaacaact c gaaagct t t c ccaagaat ca cacaat cagc gt ct caacag gggaggggt a t t at ggaat g ttggggagaa t ggt t t ggcc 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1047 <210> 669 <211> 2268 <212> DNA <213> Arabi dopsi s tha i ana <400> 669 at ggcggaat caagct ct cc t ct ccct cct ttcttaagag ctgtggatct cactattcta atcctacatg tgaaccaaaa ggacaccgtt ttcaccttcg tatggttgct tattaccaac t at cct gaaa gact t gat ga aagggt t cat acagctgatc cagtaaggga gccaccgctt gct gt gaat t atccggcgaa caaactagct ct cact t act t ct ct ct caa ggaagcttct aagaaat aca acgt t agagt t agagct cct gcagaaggtt ctgaattcag taaagactgg agccagaaag tagaagatgc cactggaagt gaagctttct taaacacaaa atcaaatgat aacaagggag gtgtgggaga cgaaaaagag ct t t gt gaaa ggt ct t ct cc t ggat cgt gg at aaaat gga gaact t ccac at cgt t gt aa t gt t at gt gt aagt t t gcca t t t at gt act gaaat gact a t cccact ggt cat t caact a gt ccct cat g ggat ct caca t t t ct ct t ct ct t t cct ct g gt cct gct ga cggt agat at acacagt gct cggacgat gg agat t t gggt t cagaaact c agagggagt a t ggat gcaga t agt t aaggt t t gt t t acat t gaact t t ct caaaagct at at t gt accgt t gaaact t gt t t at aaaaca gt t cgt gacc t t cact gt t a at gt t cacct t cct t t ct gc tccggaagca cgagaagt t a agat gact t t ggt at gggaa ttcaagagag ggt aagagt g 120 180 240 300 360 420 480 540 600 660 720 780 840 aaaagaccaa atcactttca tcattacaaa gctggagcca Page 568 12689250 Sequence Listing.txt tcagggttga tgacaaatgc accatacatg ttgaacgtag actgcgacat gaagcagat g cat t gt gct t gt ct t acaat t caggat gct gat ggaagt c gaat t t ggga tttccccaaa t acgagt acc gat gt gaaca gat ccaccgg cgacgat ggg at gt t t t gt c ggt ct aagat aact ct gct t cat t gt ct t t gt cact caat gt cat act ca cct gagacca caagat t ccg gt gt t agt ga ggt ggt t cgg t t t ct aaagg gct gcct t t t t ggt gcgaca t t gt t caat a t at act t ggg t ccat act ag t t t ct t caat at t ct aagga aaaacct t aa aaaccagct g cgagt at cgg cgt t t ct t gg ccacgggact ggaaaat aag caat ccct ga t at t t cccaa acact t t at g cat t t gggag agct t ct t gg agt cagggt c gt aaat t t ga at ct agccgc gt t t ggcaga gt at gt t t ga t agcagcgt t agcaat gt gt ccct caagat acgaggaatt aagagt t at g t gct acaagg gat ggt gaaa ggact ct ct a gggaaaaaac aat t cat t cg gt gt at gcca gct cgaaat c at t t cgacaa gct ct t t t ac gggagt t t ac ggaat t cat g gat aaaaaca aat at cgaaa t ggat ccaaa at t t gat ggc t ct agccggt agcct gt ggc gaagggaaaa at t t gt t gt t at at t t ct gc t t ct at gat t gcgggaat cc t acggt ct t t aagt at t t ag t cggt ggt cg gaaacggct c at cggt t ggt agagggt gga caaggaggac ct t t t caaca agct t ggct t t gt ct cct gc t t aggt at aa aacct t ggt t act t gt agt t accgt ct t ca aaat ct caac t cgct t t at t t gt t t ggt gg t gt at t t t gg t at ggt at cc t t at ct gt gg aaaaat caat ct aat gt cgg aaggacct ca ct ct agacga ct gaggagag at gcgt t aca aagaaat ggg t at acgat t c ct agct cat a cggaggt aat aacaaagt cc at t t at acgt ct gct t at t g t cat cacact t t t ccat aca ggt t at t t ag t agt cact aa gagaagt t ga t ct t gccggg gt ct acaaag t ggt t at t t t cat t t t ct ac gaaact ag gt at gt caat ggat t caaac t gaact cacc gt at gcgggt tttaggagac t ct aacgaga aagaaaacca acat t gt cac aacaacggaa t at t t t t ccg ggt t cagcag gt t gat cgga t t t cagt t gg cct act ccac t gt ggggat a at cgt ggt at cgt cct t gat aaagact at g t t gt ccaaac cacat t cat t tcgcggagga gt t cct t cca t ct t t ct aaa 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2268 <210> 670 <211> 2010 <212> DNA <213> Arabi dopsi s tha i ana <400> 670 at gt t ct t ca t caagct ct t caccat ct t c tcctcttccc aaattatcga cttcacttac tccatcctag gaattgccac catcacacca atgcagagca ccggtcacgc ct t ct at acc ggt accgt at cgt ct t t ct c cact acct t t gcacacggca t ggcct t t gt cat cgct cct caat acct cg gt ct ct t caa t gt aacaaac t t ct t aagct aat ggct t cc aacggt ct ct aaaccaat cc gt ct t cgct a aaccct cgcc aacggt aacg Page 56 t ct t t t ggca gt cct ccacc t aaagct aac ggt t caaaga t t cact ct ca t cccat t cgg ttaggaacca at ccct aaag gacagacat a caacacaacc t t ct ccaaac gat acccat t cagcccct t a t gt at t cgct 120 180 240 300 360 420 12689250 Sequence Listing.txt gtcgaacttg acacgattat gaatatcgag ttcaatgata cgaacaataa ccatgttgga at cgat at ca gat cagt t t c gat ggt ccca aaaccgct t g ggt t t ct cgt ggagt gaat g gact t aaaac t t gt t gat t c aagt t cgcgg aaggact t gt t t cgggagt g gt ct cgaacg cagat gagt c ct t ct ggt gt gaggt aaccc t t t t at ct cc gt ct t gt t ag gat cacgggt gat cacat ca ct act agaag gt cgt gct cg gat cct aat c t t gt t gt gct ct aaggggag at gt t gggaa gct t at t ct c at agct t gaa acaat ct gac cccat ct aat t t t ct at cgt ct gcgaccgg gggaagct ca ct acaagagt cct t ct t gct aggaggt gga act at gcaac tttacaaagg aat ct agaca accggaactt acgat t acat t cgat t ggaa acgaggagt g at gcagagct ct gat cct ca ggacaggacg t ggcgt gcgg t ggat t gggt t agggt ct ga ct cact ccga at gcaat gt t cccacaacgg t act ct ccag ct cggt gaaa t t t gat cagt cgat gt aacc cagagat ct g t aat at t gt g gccat t ggcc ct acagat t c t at aat ct t c agat t gggaa aaaagggt t c t at cat gccc aggt t t gaaa ggt t cct ct c gcccaat gga acagaggttt ggaacaagt g caat gggaga aact acgcgt ggccacaacc cagacgt cct t t t t agat t t gt at gaccaa cccact agct accagat ct g gt ct aat gag t gggaggt ga agt t ct cct g agt aaacgga at ggct ccgt t cct ct gt t c t ccgaaat t t t t at cgaaac t acaagaact ct cgt gcgct acagagt t cg aaggat aaga aagacgaaga gagt t cgt gg gt gggt t at t agt t t agaca aaagt cat t a gt gat t cacc ct cggggat t gt agt t ggaa act act gat g at cgagat t a t ggat ggagg aaagaagt cg agaccaacaa t cgccgt t gg t cgggcat gt ct gggt act g t gcaggt t t g t t ggt gaggt ttttgcaaga t t gt t ct t gg t t ccgagat t gggt gccgt t t cat cat gaa ggaagaaccg acat t ct t gg aggagat t gc ct gagat cgt gccgt cggag agt at t t gt a at ggcgt ggc gcgacgt caa t cggt t t agc ct t ggggat a t t t t t gcat t at aaccaaag caaacatttt aaat ggt t t t t gagacaagt act t gcgt gg t t accagt gg ggacgagaac ggt cgat t t c t aaacct agg t at gt t cgt a gt ggagt t t t gccggt gt gg gat ct ccct c gaggagaaga act aaggt t c at caggcggg cgt gaaaaga gagt at t gga agacgagctt caat agt cca ct ct gcct t a agccagcaac t cagt t gt gt cct agccccc t ggggt gct c t ggt gagagg ggat gct aag gaagct aggt gct acagt at gagt gggat c at ct t ct gt t 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2010 <210> 671 <211> 1506 <212> DNA <213> Arabi dopsi s thai i ana <400> 671 at gcct aat a gcagct cagt t gt t ct cat g aat gt act aa agct gat aga ccagt acgac ttgtacggcc aagtagctta tccaaagcat cataaacaat ctgaagttcc agatatttac cgcttagcag ccaaaactaa gggggttttc atcaatccag ctctggtgga gccgtttggt Page 570 12689250 Sequence Listing.txt ctcacgctca ttgaggcagc tgcttatggt ctgcccattg ttgccaccag cct gt t gat a gccat t t ct g aaaaacggac t cccat gt cg ccagaagagc acagaaggag gt cgat gcca ggt agaaggc aaagcgaatt aaaggaaaaa acgcagaaaa gagat ct act t acaaat ggc gcagct gagg t ct gt gaaac agaggt t t aa ct at gt gcat t cgaagact g ggcct ccaca cgcagt gaag t at gt caaag aagt aa t t gt gaaggc at gccct t ct t caagaacat aacat t gcag t cacaagt ga at t t cact ct t aagt caaat agat gct ct t t gaacgaaat t aggt t t t gt acct gat t aa at ccat ggag ct ggt gaaag at gat at cac aaggagt caa ggt gcaacat caagaat aca t t t t ct t t t t aaaccat cat aaaact t caa agaat ggcgg act aaat aat aaagct agt t ccaccgtttt aaaccgt cac t t ct ct acga caacggagaa gaat t caat g t gt ggt t gct cat caagaat gct agcct ca t ct ggaagat agat at gat g t at aaggt cg agagt at gcg gact cgaaga t gt ct at act agcact cagg gg gagagaaa t ct aaaaggt aagagaagac at ct caggaa ggt ct cct ag gct aacaaac t cat ggccag cct accagt a gat gt cgat g ct agat gcag aaaggt t gct gt t gat t cct at gat aaaag ggt t caagct t t t gat gcaa gt t gat gcag gt gat t ct ga agt t cat gca gt cgat gacc cat gcagcaa t at ct t t ct a ggagacacag gt agt gggat gcagt t ccac at t at gt cca tcgaccccca at ct ct gggc agcact gccg gt ct cgacat acat at ct t t gt accagaca cagct gct at at gat gacaa ct gcagat t t t acaagaagt t agt t t gcaa act acgaaac gact t at at g gt acaagat g ttaggcagag caaggct aaa t aaggt gggg act at gagga cagacagt ga aagagagccc ct t t agaggc aaat ggaggg tgaccaacaa t gagt gt aga t aact acct a cat gaaagt t gagat t ct cc gaagaaact a ct acagccct cggaaacat c gacat cgggt t gt ggat at t cagcggaagt t cat gt ggaa tacggaacct ct at gcaat t gct t cggat g t gt t at acca aat t gacat g ct t act gggt gaagct t ct t t aacat t t cc ct at gggat c 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1506 <210> 672 <211> 1320 <212> DNA <213> Arabi dopsi s tha i ana <400> 672 at gaagt ct t ct t ct t ct ca gagct acgat cgccact cat tagtcagcca cct t cgt aag aacgacaaca ggatcgaaag aagccgcaaa aact caagga t ct cact t gt cgt ct t ct ct gatgaattgg tgaagatcca agagtgctat ttctacaaag tagatccttc tcatgttaga ggggagacct gcaagggcag aacagagaat gaggtagcac atctagccgg agaagatctt gt t t t cccca gaact t gat c at aaccccag aaaaact at g gagaaat t gg aaacagaccg gagaagcgaa cggaact ggc Page 571 act t cagagg gcaaat t cat agct gt t at t ct t ct t ccac at caaat ggt gagagt t t gg aat ggat gcg gt agcgaagc ggaagat gt c caat acat t c ggct at agaa gt ggt gct t a gat t cct at t cat ggt ct t t agct ct agca agaaat gct t 120 180 240 300 360 420 480 12689250 Sequence Listing.txt gaaaatatcg ccaaggatgt ttcaaacaaa ctcttccccc catcaaataa tttcagtgac t t cgt cggga aaagcgcgaa gct t t at aca aaaat acgga ct t t gt caga acgaaagt t c ggacgt at ca ct t ct caagg gct cat caaa gagct t gcag ggt t cgt ct t agt aat ggga gt t gcgt gt t t t gaagct ca t gat agggat gcagact caa gcgact at ga aggat at aaa t t at t gt t ct gat ggt t t gg ct cacaacat t gt t t t gt cg at gaagct gc t t agaagact at aaact aaa t aaccaat gg t at agaggca t t gt gggcct aagcgact t c ccagaagct g gat agaggaa t gat gat gt t at ct gagagc t gcacat gt t at at gct t t t aaagat t gcc agacaaggaa aat cagct at tgagaaaaaa t t gat at caa t ct gaaact g caccat cggg t at t gggaag t gt ggt gcgg gat gat at ag aaaat t gt t g t at gaagt gg gggaaaaact ggt aat cgt c caat gggt ga gat gagt t ag t caat at t aa t gt t gcgct t gt aagact ac ct t t cgt agc aacaat t t ct tggaacaacg agct act aaa t gat t act ca gat t cccat c ct ccacct ca ct aaagct ct agat gct at c at ggt aaagg aaat t at t gc cgact ccaag t at aggaaga at at aagaga at cagaaat t act aaagcac aacgt t ggt a aaaaagggaa agaagaact g t ggt t t caac caagt acgt t ggagt t ccgt t caagat t ac at ggagat ag 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 <210> <211> <212> <213> 673 1149 DNA Arabi dopsi s t hal i ana <400> 673 at gcaaaaca t t gat agaga t caagt cgat t t t ct cagaa t t gt gt gt t a caaagt aaat at ggacgt t c ccgagat gt a t ccgat gcgg gt t ggt cat c acggat t aca t acct ct ct t t t ct ggct ct t ct gcat t t t gat gt ct gt a gat gt ct gct gt t at agaaa at t t gt t at t at gct at aaa acgt cct t t t ccacagcgt c gt ggat t aag t t gct aagat aggat at ggg gt gt gccaag t accagt cag ggt cat t act at gct ggt cg ct gct t t ccg t at ggt cagg aaccgaagca gcct aaaact aat at t gact ggcaacagag aat t gat at t ggct at ggat t t t ct ct gt t acagggt t gt gct ggacaaa at t aact cat t t cat t gt ct at acct aggt agaacaaat a t t t gagct t a t ct t cct cga t cct gat t t g ggaaggagga aat ct ggaga t t t gat agag ct agt gaaag t ct aaagct t at cccgt ct g caagt aaat g t ct t t at cgc gct gcgggag ct t agct t cg caggagat aa ct accat t gg aagaagaaaa at aact t cag ggt t gt at ca aat ccgcat a ct t ggt ct t c ct t gt gt ct c aat t t gt aaa at t accacaa t cgact cggt agt t t gt t ca gt gagct agc cat at ct ct t ct aagaaat a cggat gat ct aggacagat a ttaccaaaag t t ggat cct g t at t gt ggag aagagat t ga aagct aagct ggccct t gaa at gcaaact c agaccgt ct a ggagt caat c t caat ggt ct ct ggat t at g agggt at t t t t gt t at aagc cggct at cat cat gat ct t c ccat t t t gat act t acat ca gaccgct cgc t acct gcaac gaaact at gt cagt t gggt t agaaat aaat t ct at gggt t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 572 12689250 Sequence Listing.txt caatggatag aaaaatatct actcaaagct gatacct t ct ggtccgt t aa agccacaact aataggggct cttggatgtg gtcaaagctt attaaataca gagatgtcgc aaaggagttt tgtaaagtgg aagtgagaaa cggacaccgc acctctttct ggtatgagaa ggcaatcaac agt ggct ga 1020 1080 1140 1149 <210> <211> <212> (7 <213> 674 1524 DNA Arabidopsis thal i ana <400> 674 at ggaaaaca gagcaaaacc at ct ccat at tccgcaagag ggt ggagct t t t ccat aat t gat t ct cat a cgaact t t aa acat cgt t ca gt agaagt gt aaaat cggaa aaaaagct t c aggct t gagg t at t gt gcag gaacat cat c tggggagct c aat acaat ct aat aat cgag ggt gagat t t act ggaaagc t t aacgggt c t gggct ct t c aaaggt caat caaccagaag gt t aaagct t t cat t t gat g aaagccat ag agcgtggcct t cat aat cct tcaaagcaca ccaccaact a t t gt aaagaa t ct t t at at c tgt t ct t gt t gacaat t acc acacat acaa at ggagat gt at at gt t t aa t t gat ct t ct at caaaacca t ccacgat ca ggct t aggat ct act gt t at ct aaagt t t c caacaagggt tt act acaaa gcacaccgat caagact aac act cacaaaa caagt t at ag tcaacaaaag at at t at gcc caaccaccac aat t t cgat a t gcaat t ct c agaat t at ct cagct acact t cat aagt t t agt at t gaat agat gacat c cccacagaca ggaat t agag gt acaaagga t gat aat gct t agt cgat t a t aggat cct g t aat t t t aag cgccct t gat ccaccggaac agat t t cgga t at t ggcacc at cagat gt t cgat t caaga caat agagag agat ct t at t accgt t gat g t act gat t ct at ga cggt accat c aat gccat aa ct aat aat ca tgcaaagaaa t ct agccct g t t aat t caaa t act t ccaca aagagagat t aaaagt t gt a at t gcaacaa gt act aagt g agt aaccaaa caat gcccat at at at gaat aat ct aaagg tgcgccagag t t caagt gt a ttagccaaaa acaggat at c t at agt t at g cgtccacggg aaaat t agt g caggtagcag acggat gt t g t ct cggt t t c at ct at cat c t cat cat t gg t t t t act t ca gctt caacaa gt aaaacacc at caat cagc t ccaat gt gt gt ct at act c ggagaagccg at aat t t cag at ggaact gt agcat gaaga at t t ggt gga t t at gcct aa at cgacct ca ccct agagt t ccaacatttt ccggat ccga t t gct ccaga gt at t gt gt t gacaagat gt aaat ggt gga ccat agcagc t t cact cgct ct agccgt ag ccggagccac cat ct ccat t t cgact t aaa cat gaacaat t aaacat t ca aat gagt gga at t t ct t at t aagaaaccca agccgaagga cgaggagaag t gcggccat t acggt cct t t at t act t gga t ggt act gt g accat t agat t ct t cat gag gt t ggat caa t aaact aaac gt at gcct ct act acaact c cct t gt ct ct t ccaaccat g agt gt gt gt g cat t cccct t agaaagct t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1524 Page 573 <210> <211> <212> <213> 12689250 Sequence Listing.txt 675 2163 DNA Arabi dopsi s t hal i ana <400> 675 at gggt gt t g gcagct gct t agt gt t cct t gt ct gcaaaa gt t act t cct cagagt ccat aat t t gaagg aacaacaagg ggaaat gaga cagat gact a gaaccaaat a t t gt t t act c cgt ct t gat c gaagat agct ggat gccaag t gt gt gaat g act ct aagct agaaagcgga ttgcagcaac aat gacct gg cagggt acag aaagcat t ga at t aaccacc t t ggt gt at g gat t t cccaa t cct acct gc at ct t at t gg gct at agat g gagt acct t c ct cat t gagc at gt t gggt g at gt gaaacg ct act t t t cc accct t t t gg gct cat caga t caacct t gg t gaaacat t c gcagcccatt cgt t cat gaa t acggt ct t a t accacct ct aacaaggat g ct cccgaat t t t t cct acat at caat gt t c acat t gat ga t t ct cggat c ct ggat t gt t aagt agccaa aaacat cct t agaat gcaac t t t at aaagg aagaagagaa gaaat gt t gt agt t cat t cc t gagt t ggga at t cagccgt at gaaaagca at act cact t agt caaacca t ct t aaccgg ct t act t cct t t t cct t gt t t ct t gcact t gat aggcaaa ccaacaaccg agaccct t t c aggt t gt ccc ct t cat at ca t gt cacaggg caaaggagct t ct t caact t t caggt t gcc aat ggagt ac gact t caaag gt gt cacaat at gcagagat at at agat gt gct cct gat c gcagaagagg cct t cacgga agat agat t c aat gt t ggaa t ct t gaggaa caagat ct t a t aaccgt aac agt ccgt ct c ct ct at cccc ccgt gcaaaa gacaacaatt ct t cacaggg agagaaacct cgaagccat g gt t at gct t c cgt aat t gca ggt t gt t aca at act t t t gt t ccat at cgg aacagagat g gaaaacaat a t t acaaat cg aacacgagtt caagt gt t t g t t ct t gact c agt gaat at a cgt gt act at ggt t acgaag ccacat ct aa gagaaaactt t t t ggt at gt aagt t ct t cc agt gt aaat a aat gcgagca gat gggat ga t t cat caat g ggat gt t gt c ct t t t cgat c t gcat cgcct at t t at caca gt gt cagat t gt acaaggaa aagagt gacg gt ct ccct t c agaaat gaca t gt t gcgaat gt gaccat t g agaacaaat g t gccaaggat t gt acaacaa gt t at t cat c agt t cacagc t gggat gt ga gt gt t ggt t a at gcaaccgt aat t cacat t caacaat aga gcaaaggcaa gaaat ccct a at aaat gt gg ggccagccat ggt t gct at g aaagaaat gg gaaccaaagt gaat act ggg t cgt cgct gt agat cat cct ttgagacaga at ct t cacaa gt gaagt cgc gagat gt caa t t ggaat ct c caat t ggat a t ct acagct t tgagacggca gact t cat ga ct gt gaat ac cggaaat gt t gt t cgagat t ccgacgagca at t ct acat c at cgt ct t t g t gt t ggat gc gacgacat gt caaat gct gc agagaaact g gt cagggt cc gct agaat gg t acat t t t t c cat acct ggc aaagagaaag t t t aagt gga caaggct aat aggt t t gt t g t t t cagct cc gcaaggcggg caagaagt cc t t t gt cacag agt acct at a cccat cagag cgat gccct c gt caacaaat gagat ct gt t t gt agaccca t ggggt t t t g agaagt ccga aat t ct cgat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 Page 574 12689250 Sequence Listing.txt gctcgcatca aggaggaatg tgatcgagaa gaggtccttg cagtagccaa acttgcaaga aggtgtttga gcttgaactc agaacatcgc cctaccatga gagacgtatt tattgaacta gataggatgc aatctaagag aaaaggtact caaagccaag cacagaatgg tgaagagcat gcacacatcc agatcgcaat gccggagtct atgtcact t a gt t actct t c tccgaacat a gtcgttgaaa acagctcatt ctcattggac acaaaaccac taatgcctca caagacacag tga 1920 1980 2040 2100 2160 2163 <210> <211> <212> <213> 676 2043 DNA Arabidopsis thal i ana <400> 676 at ggct acaa t t t ctct t ag aaat ggt at c t t cat caat t aaaat gccag at ct act t ac gt t cct gt t a agt gat at t g cact t t ccag gagt t gat ga gaat t ggt t c at gaggaat c accggcacaa gcggt t t gt g gcagagccac aggaggctca gcat t t gt ac cccct t at ag caaggt t cc at gcagat gt act agat t ct gcgt t t caac ggt gt ct cga gct ggt gt t g t aaacgat at ct t t t gct at t caagggct t t ggact t cag aaccggaact t cgggct caa att ggacaaa at aaact ct c gt at aat cat t ggt t gat ct tgacgaggaa t at ct aacag agt ccat agt ct caaact ca gt gacat ct a t agt cggcgt agt cact cgc aagt gt at gt t t cccggaat ccaaat t t ga acat gt t cca agct caacag t t ccaat gaa ct ggt gagat tggagtagca gcagcaatca atatagtcac agcatttgct at t caggat t aagaagt agt gt cct at at t cgt cgat cac gatct t t t t c caaaggatt g t t t at caaat ct t t ggat t t gct t t t aact acaaat gcag gccagt aat a gcct gcagct acagacaaga ct at gacaat cgcat act t c caacat cgaa ct ct gt ct at cgcct t gaag aggt t t tgt c at t cat caat t t t cctt aac agcgacct t c at t gaggt t g caaccagt aa t ct at acaaa cgt t t cct ca gct ggact cg cct at agct t gat cggt t aa at accaaat g t ct at t gt at caggcagt cc aacct gct t g aagat at ct g t t t gtatct t aacccaacag ct ggcat t gc t t cct cacct ggcatagaga ct caacat ga at ct t cct cc agcacat cct gt t t t ccttg cagt ct gcaa t t t at aacat aagccact ca at gacagagt ct ggt ggct t at t ggat gcc at t ct gt t gt gt gt t gct t t ggcat t ccaa ggt cacct ag acat ct ct ga acgat gcaac at t acaat at aggagaaaca tcaagagccg aat ggct aac cat at gt aga t ct t ct t cat aggct t t cc aat t gt t aaa t t t t cct ccc cat t agaaag gaagcat agt acgat at t cc acat aat ggt t aat ct at ca ct at t t ccct t ggt agcaaa t gaagcact t ct act t gagg t acaacaat g t at aagt t t c at cact at ct ct t gat t t ct caaact ct cg aaacaaacac gaggt t aagg ttggggagca t gagt gggct cct t aagat a gat accaat a t t t ct t gaag gt cgat cat c aagaat act c aagagct gca caccgggact aaaaacaat t ggat ggat gg t ct aaagaac 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 tccttcctcg tgagaaccga aaaagatagg gaagaagcaa ctgatcctgg aaccatagga Page 575 t t caacact g t t t gt agt gt aagt t ct ggc t t gat gggt c ccgt t gct ga acat acccgt t t aaacct ca gat ccagagc aagcgt t ggt t aa cagt t agccc accgt cat ca ct gacgt t ca t t ct aagcac ccat cgggt t tacaggaagc aagct t t cct ccgaggagaa cat ggaggaa 12689250 Sequence catccttctc ccttttatcc ggt t at aaat gt gt at aacc caggcgtgtt gt gaccgcat caaacacgct t ct aagt cca ccacaaacac tgcaaaaacc cat gat caaa gat acact gg tcgagacgcc tacgcccacc gct ggagt ct gat at gt cac cact cct t t g cctagcaaag Li st i ng. t xt t cgt ct t ct t aaaagt at ga t ggt t gt t t c ct cct t t gct gt t accaacc accgt at acg cggagt t cag ct ccggat t t at agct gccg cggcct ggct gagcgcaggg acagct gct c gct t gt gct t t gcct t t gt c tgaaccaaac ggt t ggagaa agt ggccact t gaaat accc 1560 1620 1680 1740 1800 1860 1920 1980 2040 2043 <210> <211> <212> <213> 677 801 DNA Arabi dopsi s t hal i ana <400> 677 at ggt t ggag gct gt gaagg at gct ggat t ct ct t gaaag aaggaaacat gaacct gcaa gat gct cct a ct t ggt t t ca cct gat aat a caggggaat g ggaaacaaat cagggt cat g t caggt t t ga t gggt aaat a gaagct t at c aagcat t t gg t aagagct aa at cacgt gaa t t cat gat gt aat at gct ca gt gccgt t gc gt gt ct t gct acaaaagaat at ggccat gt ccgt aaat ga aagat t t agt t t ct t act ca t aagagact a accggcgttt ggct gat caa t agagt cgat t ct gct act t acgt t cat ac gt t t ct cgag aaggat gacg ct caact gga acgt gt agca at at gagt ca ct t gt accag cat ggagt t t t at gct t ct a act at t gat g cct gcaaaat t t agct accg cgt t t caat g at ct at t ct t at ct t gaat g gaact cat ga gat acaaaaa aat t t t at ca t t t ct cgaga gaggt agggt t caaacgt t t t at t t gagat aagcgacgt c at cgagagt t t cgt t ccaag cct t cct t cc t gaaggaat c at t at agcgc aagaccaccg ccacccct ct gcaagct t aa t at t gacaat t ct t ggt t gt tcaagagaac caat gagat g at acat caat t ct cgacat c gat gcgggaa agct gaggct at t t cgt gat ccgt agagt c gaat ct ggt t cgaagct gag ggcaaggt t c gt accaacaa t gcact t ct t cact ggt cca cat aat ggt t 120 180 240 300 360 420 480 540 600 660 720 780 801 <210> 678 <211> 1173 <212> DNA <213> Arabi dopsi s tha i ana <400> 678 at gt cggaat at agat t t ag gt acgat gat gat ggcat gt ggt t gt ggcc gt cgt cagt t tccttcatta gcgaagactg tttttacaga tttttaagaa tcactggagt ttatgcttat tgcctcatga gaattgtagg tattggtcaa tctcaaacag tgagttttga tttgaaacaa Page 576 120 180 12689250 Sequence Listing.txt gagt caaaac act t t gaat g gct gagcaga t cacat gcca gct gagat ca agt gacat gc ct agct ggag gaagt t at t a gcgt ccagct aagacagcct cct gat gt ga gt t gat gat a gat t t ggct a ct aagagaga gcggt gacaa gacgct at aa at ggt gt t gt agcct at t t c acaat ct t t t ttttcggagc cggcggaatt t cgagat gat gaagaggaaa at t t cat gt t agct cat cag t at t t gact g ct t t agt ggc cagaacaaat t t t t ggat t t aaggt aact t t cat t gagt c aaggt ggggg agaat ct aca gt t ggt gact at cgat t gt t t ggt ggt aaa agcaggcct a acacact gca ggaaacagtt t gct caagcg t caggt gat c cgacaccaag tgcgagcacc gt acgagt t t cact cagt cg aacagcacct agagt t t t gt gat t aagaga gt gt ct acct ct gt t t gagc ggt gcggaaa agaat gagac aaggaact aa agct t gat ac cat gagct t t t cat ggt act aaagact t t g ct cgacgagt aaaggagct g gggaagaat c acagagcagc gt gat t t t cg gaggcgggtt gcacaagaat cgaagt ggct t agt agct gt at ccggt t t t cgggt t t ggt caacagaaca acgacgat gt tcggcacaag t agcaaat ct caagcggaga act t act caa ccat t t t cag t cggt ct ct c tcgggaagcc ct ct ggagag ct ct ggaaga t ggct aggga t caggt cggc t gat ct gcag gat at ct gcg at t cct t gt a ccggcgt t t g gt t agacgag agt agcggt g cgagaat ct t gat aaagcag aagt t t ct ac cagagt t gag t t t ccagat a agcagggagt ggagccaagg agcgat t gaa gaaagct gat t ct agaagat 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1173 ataacctcga aagaattgat tag <210> <211> <212> <213> 679 2805 DNA Arabi dopsi s t hal i ana <400> 679 at ggagggt t ct ccat act t t acaaat at c gat gt t gt cc at cat cgggc caagt accga t act t t gt cc aaat cct t cg at at t gccat t t gat ccct c at gcagacga aaagct agag at gaat t t gt gt t t aggaca cat t t t ccaa at t ct gat t a aggcct cat c caagaacct c ccat cacat t gagccact ct ggt ggagaaa t at t gact ga aggaggct aa gggt at t cgt agat t ggt at t gaaat ct aa aaaccaaaca gct ct gcct c caccacgaga t gcagcct t g t at gcaagct ct ct gcaaca cgat gact ca t gt agt t gcc cgct t t acaa t gat gat caa t gt ccacat g gat ggaggaa t gaacgcggt act gaaat ca act t ccat t a ct t gcgat t c gacct aat ca gagt t cat ga t gccct ct t c t cccaggt ca at t t at gt gg gacgt gcaag at t ct caagg ccaccaact c ggat at gt at agcagct t gg aagt aggagt acat at cgt t acat t agaga agaacgagca t aagact cgc t gacat ccat aagccat t gc acaacgaatt ct t t t gt agt agct t t at aa t cggt t t t cg ggt t at t gac agaat at gca agt gct t gat gt ccgat t t c t t ccat ggaa agt gagcgcc cgat aaat ca caacagccct ggccat t gt c cggagaagga caat agat gt gct aat gacg t t t t t t t cag agat ggagt g aggggt t t t g 120 180 240 300 360 420 480 540 600 660 720 780 Page 577 12689250 Sequence Listing.txt ggtgtaagga gccatatccc caaatcgaag aagctcaaaa atttcagatt aaaat gt t t c gat t cgat ca gat cat ccga t acggt ccaa t t t gagct ca agt gaagaga aaaaacact a aaagat gt t c ccagt gaaga at gaccccaa t ccgt cat t c t accaagt at ggagat gt aa gagt ct ggag agacct t gga at cgt gt gga at aggcacca gt gagt aat t cagagct at a acaaat t gga gt gcgt gaac gccgt agaat gcct acat t a t t t aagact g tcaagggcga tttaagaaac ct gagt agct at ct t cgt gg t ct t t ccgt g cat at gt t ca agccct t t ga gagct aagaa ct aagaaggg ct gcct t agc t cgct t cagg gt ct t ct t aa ttaacgggca gaat t at t gg cgt cggt t t t cgaaaggt t g aaggct t ct t cgggt t act g ct aaat at at acaccggggt ccat agt agc t gt ct at gat gct t agacct ttttagaaca gt t t t t ggt t t agcgaggt t cagcgaat ct aagat ct cat ttttgaaaag gt gacgagct aagt t at cct ct ggt t t t gg t ct t gaat gt caaat aat t g t ct ggggt ct ccaact t ct t gaaagct aaa aggagaat gc ccgat cagag gagt gt ct t c gaat gat gaa aat ggccgt g aaat aat aag ggct ct t t cg gct agaat ca at t at ggaga aggggagagg gcagat t cca agaat t t gt g t at t gagat a t gct t t ct t g t agt t cgt ct aaacaggt ca ggt acccct c at gggt cacc t agagt caac cgct t t ct cc t gt ggt gt t g cacat ct t t c aaaat t t aac t caaggat t t t t t t t ccaat t t ct cagaat ct t t gt at t c gacacaaggc t ccagat ct a gt t t ct t at a gt acgaacac at t t t t ggt t cgt t cat aat cact ccat t g cat ct cgagc gagat gaaca gagaaaacca acaaat t t gg aat gt aagat t cagt gt t t g ccaagcaat g ttggggccgg acaaacggaa gat gcaaaga t t cgaagct g t ct ccagat g at t t ct act a t t gt at gt cg aaggacaaca act gct t gct accgact t t c act at gaact gt gt ggt gct t t cacagt aa aaaaat at t g gat gaat cac gggacaat ca t cct ccaaat cccaaaaaat gaagaaat gc aacact t ct c gcaggcat ag aagcacacgc agaaact t t g gt gagt t cac ccgcgaagt c ggt gaact t t t t t t t gcgt t at at aaagag ggacat t agg t caat ggt t t acgt cat caa gaat t gt t aa t gat at ggcc agat gct t ag t agat ccgat t t ct t aagaa agaact at ga t ggcct at ga at t t cacgct agaacacat g t t t t cgt at t gcggaccgcc ttgcccaccg t cgt ggt gct aact gt t gca ggt accaacg agct caaacc cagcat ct t t at act at ggt cacct t t gac aacat at t ga t gt cct ccaa ct t cat t ct t t ct t t gat ga at gagaaaga ct at t act ca cggaacagt a t cact acgca aat ga gagat gggaa aagggcat at ct t gcggt at ggt ct ct cgc agccggagag t at t at t gga t gcaaagt ca cggt aagt cg agt gggcat t cagt aacgca gt t gcct t ac t gaaat ggt a cgcagt t gt g accgt acaca ggt gt t cct c cat t gggt t t tcaccaccaa t gagaaggt g t gt gct gat t accaacggtt gggcact t t c t t t t ggt t ct t gacgaagt g t gaaccat cc ggat gat gt c gaat aaat gg ccat ct cagc ggcact t ct c ct ccgaaaat cat aaagt cg aggaagt t ca t agggaat t a at cagaacaa 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2805 gttgaagatg aggaatctgc tataattcaa tgtgaaggtg Page 578 12689250 Sequence Listing.txt <210> <211> <212> <213> 680 2049 DNA Arabi dopsi s t hal i ana <400> 680 at ggaaaact aaat t gt t ac t gt aaaggat t gcagt t aca gaagagt gcg aagct gat cc gt cggt t at c gccct act t g gaagcgcacg cat t t ct aca agt ccagaag gt ccct ggt t cgccagct t c ccaccaccgg agacgcat t g t gt ct t cagt at caaccgcc t gcagagt t t t t t gat t at t ct ggaaggga at agaccat t agt gt agt t t agaaagagat cat gt gt gcc aggt gt agt a at agat gt ac t t acact act agt t gcgat g at gaggcat a ggcaaat at t t t ct agct t t cat t t cat ga gct act cgac tttacggagg ccaagccgtt t cct acat ac gct gt aacaa t t ct t gct ga aaacact cag cat gt cat ca cat accacac acgccgacaa accact gt gg t t ct t gt t aa act t cact t g gcaat t t cat acgaccaccg gt cgt aaaaa t t gt t cat t c cacccgaaga acggccat aa gt accgcat g gcgat t t cat acaaccagcc t t t gccgt ca ggt gcgct ac ct acagat ca at t gt gaat t ggt acgacga ggt gt gaagc gtttaatgaa ggcatggacg aagaagcatc ccgccttgaa gcat ccgt t g cgat gaagat acaccgat gc accagaaat c cagagt t t ct at gt gat t t c aaact cct ac agat t gcaaa at gcgagt t t t t ct cacccc gaaat gt ct t t gt t t gt aat aagcccgagg caat gct t gt gat t cat cgg cat ct ct t ac t gt ggat ggg aagat gcgca agaggaaact gcat aact t a t gt ct t ccag aat ccacgaa act cgaact a act gt t cacc agt t acagaa ct ccaaacgt t ggt t t gt gc t cat cct ct c at gcgaaaca at acct t t t a aat acaagct aat gagccgg gaccaccctt aat t gt agt t cagt t ggat t ct acacgagc gggt gt ggcc t t cat cagt c caacacccac ct t t gccgcg t t t agcat at acccat gaac gggacgct ag gaat gt at cg acccct cgcc t t ct acgggg accagaaggg gcgccat t ca cgaat t t acc at t t gt t cag aaat gt gcca t ccccaaat t ggt t t ct cct ccat t t gagc t gt agt gagt t t caagt gt g t t ct t at ct t act gt gaacc cacgct t cga t cat t t ccaa gat gt gat gt accat ccagt t ct gt ggaga t gat t t gcgc at ccact cca at caacat aa t ggat t gt at t caagt cact aggagt t t ga gcagat cat g at caact t ca gt gaccgt ag at ct acct cg t cggacat gg ct t at t ct t g at gt at ggga aggt gat t ga gt ggct ggaa aaccgt t ct a agct t cct t t at ggcacct a at gaat ct aa acaaaagcca gt gat ggaga ct at t t t acc at ggcgaaat cagaaaaat g gt t caccaag acat cccggt t gt gt t ccat ccat cct ct c at t t t t cagt gaggagacca act ct cccat t agct ggaaa cgat cat aat t agat at gaa t ggaat ccat t at gaaaaac cct t gt t cca ccct t at t t c cgt cat aaac agacgggaaa ct ccaaat gt cat gat cgaa tgacaacacg t ct accagaa cgct t gcgcg aaagaaacgt ct t gt t acat ggaat t aagg t caacat cct cgggaaat t g gcaaaaggt a gggcgt ggct gt t ct at aca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 tgcaatgagt gtgggatcac attgcatatt tcctgcgtta tagatactta cgcatattcc Page 579 12689250 Sequence Listing.txt atgccaggag cgttaacatg gataggaaca tttgtatcca accactctat ttgtagacca ct t t gct gcg t gt gcaact c t cgat gcaag ct ccct t t cg t t cgt acgct t cccgaggac ggagctgatc tat acttttg ttctttagaa tgtgagaaga aaatttatca ggaacgtttc gaat t t t aa 1920 1980 2040 2049 <210> <211> <212> <213> 681 3504 DNA Arabi dopsi s t hal i ana <400> 681 at ggct acct t t ccgt gggg aaacagat ca ttcaaggaga t caaaat ggt at t cccat ct gct gcgct ca gct t t gacgt gagaccacgc tcggaagaaa aaggct gat a at t acgggt g ggt aaat cca gcct t act cc at gct ct t ga aaggagaaac cat at acaaa at t gcaagac t at t t t gt ac cgt cat t t t g gt gcgt t acg aaaagcct ct at t cgagacc t t t ct t gaca t cat ct ggt c at gat t t aca cat cct cggt acgaact t cg at gt ct t cat t cgaaaaat c gct t gaacga t ct acaat gt caaaaacgca at gt t agt ct t cat cgat aa gcacaagt gg agat t t ccgg acaagagaga ct ct ct t gaa aaaat at aag aggagt t gct t act t aaaaa aact t ct t aa gcgcagt gac cgct gt t gag cggcccacca cgagaggcca ct t act ggga gt gt ct t gca t agct t gt t t ccgcct t t t c t t t ct gat ag ggt cagcagt t aat aat t t c agat gaggct t agaat cgcg gct ggt t aaa ggaacct gct ggaaaat gat cct ggt gggc aat cgt cgac at cagt ggat gct caaccag cgagact cgt ggcct t ct at cgaact t gt a cccagat gaa cacagt t t t c ggat cat cga t cgt gat ct g ccat cgagat aaacaacaag cccact aat t agagaaact a ggt aact t at cagat cccat taaagcgaca t cgagt ggag gt acct cagc gt cagccat c gt ggaaaaag ct ggct at ca at gaaagaac acggt t aggt agt gat ggcc t t cccgt t t a gccgt t t t gc caaggt cggg cgcct aaagg at t gt cgaag gagacat gga aaagcaat gg aacat agacg at t gt t ct ag aaat gggct a ct t cat gagg gggt t aaat c gaagct t t ca ct caagct at aaat cact cc gat gaact ga gat t t ggt t t gt t acaat ag at gcat gat c agcaccaagt t ggat aaggc gt gaaaat ct tttcccaaaa tcgagggcaa at cagaagga agat gaagaa acagt aaaag aaaagct gag gagaagaagt aact ggaaga t t gt t gggat aaaccaggtt gat t gggacg aggagacat a at ggcat aag agaaaggaag at t caat ggt act t t t gt ca t gaaggagt c t gggt gaaga caaaaagt ct gt caagt gca at gt aaagag acgct ct caa t t t t gt at ac gt t cct caat ct t aagaggg agat aacct t gt acacagag act cgt gacg agcgt t t ggt at ggaaagag caaggagaag t aaaat at cg agaagaagca aaaggt agcc gcct ggcat t ct t gagcagc ct t gact ggg cgaaccat ac tgacgagaca caagat t gt g ccgt t at act ct at gcgt t c aaaagagttt gct t cgt gag cagccaaaat aaaggat gcg ct t act ggat agacat gt t c gt t t gccat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 Page 580 12689250 Sequence Listing.txt gaacttggtc cagaagccag ggatgacgat ggaagaggga gacaccggat cacaaccaag agt gt gagaa gat t acct ca caggaat gt a gaggt ccgat cccaagaat c gagaaggat g t t gt cagggc ct gaaaacgt gggt gcacgg ct cagcaact t at t t ggat g gt caaat t at ct gaaagt t c gt gat gaaga at t cct cat a ct t t cgaat g aaact t gt gt gaat cat t aa t ccacat t ca ccagaggctt gaagcagt t g gt cggaat ag t ct t gt t cag t acaaaat ct aagaaaggca t gct caaaca gct t t cct t g t t aagat t gg t t at cat cat ggaagt gt at t ct caaaat g <210> 682 at aat aaagg gt t t t t t cct agaacat gcg cgcct aagga gt t t gcat t g t cgt agat ct caccaaaact t at cacaggc t gct t ct ggg gt ct t gagt c gt t caaact t gcact gcaat acat gaaaga t gcaagaact acat gcaat g t at cct cact at at t cgt ct ct at t ccaga caacagt t gc ttttcacaaa t gt t cagcac gat ccgt ct t ct t t at gt gc t gacat gcac cct t t gat cg acaaact aaa gcat aaaat g aat t t ggt gt t at at gcat c ct gat t caac ct accaat ac ggagaggact caggct caat cgat at gt at aaat ct ccgg aaacat acac gct aaat t t c t aagct ccct aaggt gggt c t ct aaat ct t t ccggaaaat t ct t cccaag ggaggagttt aaaaacactt ct gcgagat g agt at gct ct ct t gcagat t t gagcgt ct a gct t t ct caa gct t ccaaca gaat ccact g ct gcgacaaa t t gct t ccct aaagct caac t gt t gt ggga gt t t aacat a act ggt t ggg gaagacggaa t ct t caagac gacagat aag t gat gaaccc tccgacaagg cat t agagaa t t ga aggct gct aa gt cat gaaga t acct caagt at ccct ggag cccaaagat g t acagt aaga gat ct caat c gaaagat t ga at ggcaagt c at t aat t t ga t gggt gat t t cct caagaca ct cgt t aagc ggt t gt aaaa t t act gct t g t gct t aagca ct gaaat ggc aat ct t cagt gct act cat t t t agaccgt a gggt gt gaag ct gct cccac t cct t accaa gcaagt aaag cgt t ggaaca t cagat cat g cagcact cag gaat ct cgac cagaagacaa aacggcagtt gaagat t cca aaagaccagg cggacgt gac t t t acagt t c aact t gagct aact t ccaca t t agacagat act caagt aa acct t gaagg t t gt t t t cct gat ct ct gaa cagaaacttt t ggt aaagct ttcccgaaga gact cagt ag at ggaacagc gaaacgagaa t ggact t gaa gct t agat gc tgccaacgga ct gcaaagga t acct t cat g at t ggaacga at t gccaaga at t ccaagaa aacat ggcaa t ct t t at ct g gt acat gt ac t cgaggt gct act cagat at ccaacact ac at at ct t aca at ggcat cat aggcagcacc ct t aggcact t cat t gt cct t ccact t gaa agat t t cat c ttggagagaa gt t ggaaaac ttgcacagca gaat t t gaaa gact ct cat c at at acgct a aacaagcct c at t t gacaag t ct cccagat aat aacaaag gat cagt t gt gt act gt acg aaacggat gt gcagat t cat gggt t t t gt t gt t t t gt cat aaat aggt t t acaaaccaac aggggat cct caaact agac ct at accaga tccaaccgaa caagt gt ggg ggat ct t t ca t acat ct agt tgaagcacaa 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3504 Page 581 <211> <212> <213> 12689250 Sequence Listing.txt 1911 DNA Arabi dopsi s t hal i ana <400> 682 at gggagt t g at gaaaacac at t gat cct t gacaagacac at t cacat ca agaggcgaga at t gt t gaca ct gt ct gcgt ccaagaat ca gcagaat caa gt gt t t gggg gcgt gt gct a ggcagt gaca acaaact cca gaggt gaaac aat at t agt g gct at cat ca aagt act t gc gaaaaat gga gaagat t aca ccaaaagacc ccat ggct t a t ct gt gt t ct ct gat t gt aa aacaat t gt t cgcct t cct a t t gcct gat g caacgaacat gcggcat gga ccacaaggca gct t t gcaaa at ggaaaat t t cgaaaact g t gt t t ct gt t t t gt agcagt t t t at t at ct t t gt cgt gca t t at ct ct gt cgct ggt gt c t t cgt at gt a agt gggct gg ct t t t t ggt a ggacat ccga acat t cgat t cagact t cga caaaggattt ct t t gggcca t at gt gt ct c aat cat ct ac t gt cct accg aat ggcgaga t cagact cga acat cat gga at t t agcgaa cgagaggcaa gt gacct cca cct caaccag at at caagt t t caggt t at a ggaaacat t g cgcaact caa at cgacgaaa gaaaagt gt t gaggaagacc t at ccct t t g agt t t gt gt c t at aaccgag t t ct aaggct t ct ccccat t agagagaat a t ccgct t t at agct gct t t a ct t gagt t ca ct gcaat ct c cct caacact ct t t ggt at g tcccaggaaa aaacct ggag t ggt t t act c t act agagt a ggt gat acca aaccaaggga aaccaaacgc t gat ggt t gg cagct t cat a gct gaaaagt agat ggt gac aaccgt aat g t at agct t ct ct cacagcaa caaaaggaag t ct t gcat ca agat acagca agaggacgct gt ct t gt t ag at t gat t ct c at t cgt act t accat t gct c acact gaaaa cct caggt gg ct gaaat gga aaagaagt ga aact t at t cc at agaaagga acggt cacgg t cat gcccat t at at t gat g t t cgt ct at t acaagcaact t t gt t t gct g gat gaaat gg gagt at t t ga accgaagagg t at ct t t act ct act agaag gt gagagagg acaaccggat at at gt ggag act t t gact g cat t t aaat g t ggcgct cat ct t t ct aaga acgct ct acg gat t gt t cca t gaagaaggt ct t gcgt ggt agagat t t t g t t at t gacac ct cgt t ct ca cacgt ct cct t ggt t ct cac t t at act t ag caagagcctt t t t acat gct aaagcaaat g at ct gct t t g t aat cgaccc cat t gaagt c gct t t t ggt g ct gcagggga t gct cat t gg aggagaaaag aggaacgcat aagct ct t ct t ggat at gt t ct gt gt gcga ggcacccagt ct cat gaaat aact t ct ct t aagt t gaagg t ct t t caaag gggcagcatt caagagacaa t gt ccagat t gct ct cct ga ttacgggaag t gct t t ggct ct t cact t t c gt t ct at gt g agcat ct t t g ct t t cact t c t ct cat acca t caat at gt a t ggt acggt a gcacagt t at t t ggcgt gcc taaacgagct t gcccaaat c aggggt gct a gggt ct ccgt gat t t t ct t t aaacgt t cag gagagacaca t aggagat t t t cgt agt ct t aaagcgt gt t ccgggt gaag t gaggagat g gggt gt ccgc caat ggt t ct t t t cat t ct c acaaaaact c ct t cat t caa cgagaat at t t gt gt ccaaa t at gt cacct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1911 ccggtacctc ataaaccagc cgatctcgag tttgcaaagg ctgaagcata Page 582 12689250 Sequence Listing.txt <210> <211> <212> <213> 683 1083 DNA Arabi dopsi s t hal i ana <400> 683 at gagaat ca t t cat cgccg aacaaagccg gt t at cggcg act t t ct t ct cat gt gt t gc gagt t t ccgt t ct t t t gcca ggt gaggaac cat gct cacg ct t caccgga gt aat aggaa gct t t aat gt aat t t caat t ggaat cgcgg at cgt acagg gact t t ct cg cacaaaacgt t at ccgccgg gagcaaaaaa t t at gt t t cc t cgt gacgaa ct gaaggt t a t cacaggct t cgt cgt at t t aagacgct gg gt cacacgca ct cgagt t gt t at cact ggg t t cat caat g gt at gt caat t t ggaat ggc gagt cct t aa cagcagattt caagct ct t a cgagt ct aaa at acaagat c gt t act aggc agcct t cgcc cgagaagct g t at cgcgat g ccat aaagcg cggt ggcggt cgggat t gt t t gcacaggt g agcat cacaa ct t cgagggt t acaat aat g aat t t caagc cgct gcat ct t at gcaccct ct ct t ct t ct t gcgagt gt t gccgcaat t c aagt t ct t cc gccggcgt aa acgt caccgt gt cgct gcga cat t t t aaaa ggcggt ggag ggcgt ggaat t t ggaagt gg agt ccagaca ct t ggt ct t g t cgat ct t t t t ct t at gat g gcaggcat cc aagat gcagt t ct t ct t cat cacacgaaga ct t ccgt t ct cgt cgct t aa t cct cgcaac gt ct gaaagg t t t t gact ct cgt ct aagag acgagct cgg ct ggt gaat c gaat aat agt ct gccaaagc gt ggt t gcat t ct ct gt aac at t cgagt cc t cat ct at at ct aacact ag ct cct t cct c cgacgaggca ggccgccgga accggagacg ggggt t t at g agaagcct gg t t ccgt t gac gat t ggt gac gt t gcacgt g ccaggt t cag acat t cagt g t ct ct t t gct cgcccaggga gacgccggt a aacggct ct g gt ct ct agt g act t caaat c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1083 at ggct cat a t ct ct ct t ct t gt t ggagca ggggt t at gt cact t t t agc t aaat gggct t aa <210> 684 <211> 264 <212> DNA <213> Arabi dopsi s tha i ana <400> 684 atgacgagaa ccaaactgcg gcgtttaaga gaacatacca ccatcgtcca agaatcaatt gt gggcat gc atgcgcaagg agatgcagta agacatcgag gaagaaagtt tgtcacagag cct gt ggaag ttgttgtgcc aagtgtcagt gtgtgccgcc gggaacctcc ggcaacacag cat cat gt cc ttgctacgcc agt at ccgt a cacat ggcaa t aaact caaa t gt cct t aaa agact t ct ca t t t ct caact at ag 120 180 240 264 <210> <211> <212> <213> 685 1098 DNA Arabi dopsi s t hal i ana Page 583 12689250 Sequence Listing.txt <400> 685 at ggct t at t acat t accct t ct ggagat c ccat ct at ct cct gct t tag at t ct ggcaa t gt t t gcct g gct t t gct cg aagcgtgaag gacat ccaaa gat aagacaa gt agt t cact caat cact ca t ccat ct t ac gt cacaacac agt ccaact g t at at ggccc at t aaact t c at ggct aagt ct aaagcat g ct ct t gccgg t at cat gt ca tagtggcgag gacccgat cg ccggt t t t at aagacccgt g t t t t aat gat gagaagtcgt ccct agaaaa gt gaact t ct cggt t gt gat t cgcagcgct aagcgcaat t cat t t gggat cgt t gat cgt t t gt t aat ct at gt cct cgg gggcatag t t at aaact c t aacgct gaa caacaacaaa cat gat cggt agaaat gt ct gcat gt gt t a gcagaagttt t gaat cgt t t accgt t agaa t ggat ct agt t aggaat aaa cggt ct t gct t t gt t t ccat taaaagcaaa t gt t t t aggg t gt gggggt c ct t ggct cat t t at gt t gct accaccatt a aacgcagat g gaagcacaaa gt t t ct t t ac gt gat cgt t a ccggact ct t cct t t cgcca gcgat gt gt g aat ggat ct a t at gt ggaga gt cat t gct c at gggggct t caact ct t t g acaaat t gga at ggccat ac t t gaat gcgt gaat t ct t cg acct t cacag caat t ct t ct t aagt gaat g agct t aaaat cgt t gt t t t c agact t t agc t cgat gat ct ct t t cat cac cgt at gcgag at t ct gt gga aacaagaaaa agat at t gga ct gat aacaa aaggaatggg caat ggt gt t aaaagat at a gct ct gcagg gaccaaagat gtgccgcagg t ct at cct t c caaagct gaa cat t gcaat c t cgt t ccat t t t cgggt gt a cacat ccaaa gat gat ct cg gagaact agt tacacaaaac ggt caacgaa act aggt at c at gt accgt a act cggt ggt t t t ct t t t cg t gat gagaca at t gt t gat t acaagggaat aat gt ct t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1098 <210> 686 <211> 291 <212> DNA <213> Arabidopsis thaliana <400> 686 atgggtgata cagatatgaa gcaaacaatg aaggaagatg ctcttagttt ggcctccaaa gcccttgatt gctttgatgt caccgaacct actcaaattg cccgtttcat caagaaggaa tttgatagaa gttacggatc ggggtggcag tgcatcgtgg ggacgcattt cggatctttc gtgactcatt gcagtggctg cttcatccat ttctcagttg gcagccttac cattttgctt tttaagggat cagtcggtga accggccccc ccggaccgat gctctagct a <210> 687 <211> 1776 <212> DNA <213> Arabidopsis thaliana <400> 687 atggaaactg tagaagcggg gcatgatgat gttattggtt ggttcgagca tgtgtcggaa aatgcctgca aggtccaaag cgagacgcta cgaaggattc ttgagctcaa ctccggagtg Page 584 120 180 240 291 120 12689250 Sequence Listing.txt gagtatctga ggaaatggct tgggactgtt gatgtagaaa agatggacga gaaact ct ct caaagaat t g t ccct cagct gcgcaaact a ataagggaag ggaggat t ga aagcaagaga t t cggt cagt t t t gt cgcct tggagagaga aagat gagaa gaggagatct t ggccaaat g aagctgaggc agct ggat t g at t cccact t at at gcat t g gggcaggagt gacgt agt t g agaaaact ca gt t gacaagg agccacgcag gaagcagat g t at ggt t at g gaacgaggaa cagt t t aaga act at caaga t t act t cct t cagatggaga ct ggaacaac ct ct t cagat gagggaggat cagt aggaac caacaaaat c gcacgt at t g ctgct t t t t c t at gt gct ga aagct gt at t gct t agagct caaaat t t at at t acgct gg gt gt gaacgt t t agt t act t acggt gact t at gaact t gt aagt gaccag t ct t aaccat cgt ct cagct acgt t at agc at aaggct ct t agt gt cgag cgt t t gggaa ct cct aggt g gat t t cgcag ggt gcct at t gact t ct cca agaagggaga t t t ccgat t a t ct t gagt t t ggccacaaca ct t cact t gt t cat ct cct a t t acaccat t cat caaggaa ggct ct gat c ggaaacaaat ct cct caat c cggt t t gccc t gat cct cat cgagt t t at a t gt cgaagat t ct aaccact ttttcacaaa caacat t gac t ct gagccga t cgt cct ggt t gaagaat gt gcgcat gaac agtggcggaa t acgaccaat t t cagct t at gt t t cacat g t t act t acac caaaagt at g t cagcggct t at at acgccg cat t act at g agcccacaag ct t ggt ct cc gt ccaagct t ggt aat ct t a agaccaaacc ct cggat ggt at gacgggt t ctggtgagcg ctt cccccag ccact t t aca aaacct gt gc t t cacaggt c ggcacaccga aaaaacacag tccacacgag cact at gt aa tgcagagaga t cgat cggac aggt gcgt t g t ct gt t at gc gact ag ct gat t t aga aagaacccat t t ccgt t t ac acagat caag ggaaggaatt cgagcgaaga aggt cat ct c at t act ccag tctcct t t t t gct caagaat cgt ct ct t gc t cggt t t gat ct at gct gcc cagat t acgg aagacgtgag ggcggcagaa cgct ct ct ca t t t at aggt a agct aagt t t agaaggat ct ct gaagt cgt t ct act ggga t ggat act gc cgct t gagct ggaaatgcgg t t gacat act t t at act t t g t cct t at at t cact gt t ct g t cgccat agc gt t t t at cca t aagacgct a gt t t aaaact aggaggtgat ccaagt agag cgaagaaat c cact ct gccc t t ct cat at c t t caaagct t gt acct gaat gt caaccgag ct t t gct gt g t caat cagac agt caaact c cagat t agga cat at acaga t cagagagt t t gact t t aca aat cagagga t t t t gtggac acgagt t gt g t ggct t gaac t aat gat t ct 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1776 <210> 688 <211> 309 <212> DNA <213> Arabidopsis thaliana <400> 688 atggagaaga tatcaaattt gttagaagac aagcccgtgg tgatattcag caagacgtcc tgctgtatga gtcactcgat caagtcgctt atatctggtt acggtgcgaa ttcaacagtg tatgagctag acgaaatgtc taatggacca gagatcgaac gagcacttgt agagcttggg Page 585 120 180 12689250 Sequence Listing.txt t gcaaaccga ct gt gccagc t gt ct t t at a gggcaagagc t cgt aggt gg t gcaaat caa ct t at gt ct c t t caagt cag gaaccaact a gct t cgt t gc t ccgaagagc t ggagccat a t ggat t t aa 240 300 309 <210> <211> <212> <213> 689 1191 DNA Arabi dopsi s t hal i ana <400> 689 at gggt t ggt aagt t gt t ag t ct gct gat g gaagaatttt aggaaagatt at ggt gagt g ccagt cacgg tggaaaagaa gcct t ccct a gat ggt gt cg t t t gt ccggg agt gt t gt ag ct t ct at t ag gaagaagacc t at t ggct gt at aaaaaat a ct ct cact ga gaact aaaga gaagaaagt g gact ct gccc t gt ggaggaa aagagct cat agat ccgcaa act gccaat g ct aat gt gag gt cacaagaa t ct at cat gg gaat gaagat ggcccct cgt cgaagct gac ttgaggcaga t ct cagagaa gat acaaaag ct gaagat ac caaaat t aaa t gggt caaat gat gcat ggg agat ccaaaa aagat gaat t aagacat t t c gaagaaaaag cgaat gt t gc agccaccaac gt at t caggt gggaggagat ct t t at gaaa t gt t aagaaa agcagaagat at at at gat t t gat t t ct ct agat ggat t c aacagat gt c ct at t t cgag t gat gaat t a gaaagaccga ct t agaacaa t cct agt gaa at ct ct t aat t aacgact ct t t ccacagt g aat aagaagc gat ggcaaat at t t t cagcc aagaacgaga ct t gt gt gcc t t ggt t ggat cat t at ggat at cgccact g t t gt ct cat c cact gcgt ct t at agt t act t t t gcct t t g cat t at cgag gat t at t t t a ccaat ggaag gagct ct t t c gaagt t ccaa aaagat t ct t t ct t ct ct t t gt cct ct caa t agaat t gga ccaat cccat act ct aat ct accat cccat gcaacat agc gt t gt ct t ga t agaaat aga ct t t agct t a ggaat at ct t caat cccaga t t gct gat aa gaat ct t t at gagaagaaaa acaagaaaag agat t gcaga aaat gaaagc cgat ggt gga at cgaggaga ct t caggt ca accaaact t a gagaggagcc caaat t ct t c t gt t cat cag gat act cat c agt t t cat cg gt t gaaat at tgagaagccg cct t cacact at t ggat gaa aggagaaaca t t acgt t aac gggt ct aacg agaaagt gaa acat gct cga t cgaaagat g t t t ccggat g ggt ggccaaa agaagaagaa aacccaat t c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1191 <210> 690 <211> 1206 <212> DNA <213> Arabi dopsi s tha i ana <400> 690 at gat gct t t acagcaacaa tagttggaga ctctccattg tggccgccgc agactccgcc gtggttaacg gcgactttga gacaccgccg gaggacacct ccgagatccc gagctggcga t cgaat t cca ggcaaaactt t caaacggct t ccgat ggt a Page 58E t t t t aat act caccggt cga t ccct gat ga cggt ggagct t ct act t ggt agacggct t g cgcaat cat c aat caagt cc 120 180 240 12689250 Sequence Listing.txt ggt caaaaac ggaaacgat g acgt t cagt g t ct gat gaac t gggat ccat aaccct ggca aagct ct t t a ggt ccat gga gaaat at cgt t cagaccat t ggcat aat t t ttgggacacg caagcacaga aact t cact g aggacggacg t ccgggt ct a at ct ag aaggcgggat cagagat cag cagcccgcac ct at cgcat c at gcat gggc t ggaggat ga ct cct gat aa t gt t t aggaa ct ct t cct gg t ct cggt ccc ct caaat ggt caggggacaa act t t cat t a cgaaagct ga at at gact t c gt agaat t gg gat ct t gat c ccaagaactt at gt gcacaa gcaaaccatt gt t t gaagcg t cct act t gt acccaaaggc cact acct t a at ggaccgt c cgaggggaag tgagacaaag gt gt aaggaa t at ggcgcaa acgt acgagg at t gt gt gga at t t agt t t t gt gcct gagg acggt ggaga ct cgagt cgc gact t gcaaa gt t gt ggat c ggacct at ca aat gcagt ga ggt gt t ct gc gaat cgaacc cgagct t t gg gcgaacat t c cct t t ggct g gcaaact cga at cgcct t ct cct gt gat t g ccgct t t t t a gccgt cacgc t gt t agat t a aaggct ct at t gaat gt t t c cggt gt acag gcgt ccggt t t t gacgacat t t aat ggaga ttccaacaaa gagcagt acg aact t t t at c cgt acaagat t aat ggct t t gt t t cgaaag acagcat t t a at gacgt t aa t t ct t ct t t c ct act ccgt c ggt agct t ct cgt t caagga ggt t t t t aag t gccgt t aag ttttgaagaa cct cgat gaa gt t t at t gac gggcaaagaa gt ct t t ct ct t gct ggagat at cggagt t g t t acaat acg ggt t t ggt t c t t t ggt t t t c 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1206 <210> <211> <212> <213> 691 1203 DNA Arabi dopsi s t hal i ana <400> 691 at gt t caaag act t ct cat g at cat caaga gcaagct cct agccaat t cg acggcaccag aaggt gat gg t cgct gt t aa t at gat act t gt t agggat a gaat t caaca cat at t cgt a aggaaacaca aagaagaaat gt t at t caat at agccggt c t caat ct gaa t ggagat t ga aat ct ct gat agt t t ct t ct at gaat t gat cct ct gt t ct t gat ggagaa acaaaat gga t t ct t t t t t t acacgaagcg aagct acaac ccct ggcct t t cat ct ggt g t acagaact c t t cagccat g t t t gagct t t gt ct cagt ca ggagt t t gag t t act t gaat t ccagagt t c gacagct ct c cat t cgaaca aaaagct aaa aaccaaaaaa ggt ct t cct c aat cccggag t t ggaaccat at gaaacct c ggaat cgct g gaggagt t ca t ct at gaaat caagaat t ag t ct gt gcgat accgct t at c agt cct gaga gagagggacc t gat caagct aagacct ct g caagacaaga ggaaaccggt t gct cat gga at aagt t t gc aggaaaaagg ccagt t ct ca ggaaaccggt cgaat gt t ca ccacact t t c aaagcaggct t at acaaaag t t gct t cat g caacaccgaa gat cat acct t t ct t cat t t t gt t cat gag aagacaagaa at t cgat at g gct acgaccg t t t ggat ct c at t t gt caag agagt gt t ct caagt cacca aaat cat ct c 120 180 240 300 360 420 480 540 600 660 720 780 Page 587 cacgct t acg acagt act ct at cact gct g aggcgt gt ac at act gt cat aacaagccat aaggaggttt tga agagcct t ct ct t t acagaa ct ggaact gg cct t t t gcgc gggct gt gaa gt t caagt t t act acagagc 12689250 Sequence ctctttaatg ataggcaatg at cat gt gga gagct ct cag aatcgctgtg ctcttttctg aaat aagt t c t t cgacact g t agact cagg gaggt gat t g gaaagatgac gaaatcataa t gcaacggt a at cgcggt gt Li st i ng. t xt at cat cgaca agct t ct gac t cgt at gt ag ggct t ggt t t t t cat gt caa acagt gt gga t t gcgct t ag caaacacaca t cagt t ct ct cct t gct t ca gagt t t ggt a taggaaagcg gagaagt at g gt t t gcat gt 840 900 960 1020 1080 1140 1200 1203 <210> <211> <212> <213> 692 1185 DNA Arabi dopsi s t hal i ana <400> 692 at ggaagcaa aacat t aaac at t ct t gcaa t ct ct cct gg ct t ggagaca aat gaacaaa aat gt t t cga ggt ccaaaga agat ccacgg ct aagt t gt t t t aagt aat a agaaagccgg gat aaggct t cagat gat ga t act ct ccca caat t cct gc caaat gct aa ccat t ggaaa act t ct t t t a t at agaggaa aaccct t agc caaaat t aaa agat t cgaag at t t gt at ga cacaagt t gt tgaacaacaa aaagcaacaa at gt t gaagt aat t gt at t t gt t ct t t aaa at t cagat ga t gact aaaag cat t gt t gga gt at gggaaa t gggt ct agg ct at gaat gt gct t t ct t aa at t gct ct ca ct caat t ccc gcaacggttt at cat cat ca agat gaagat accaaagaac gaccgagt ac t ccggt gagt taagaagaag at gt gt t gaa t act acagct t gct t ct t ca gaggaagt at t gaat cagat aaaacgaagc t gaggct at c t ggat t aat a aat gcat at g tcaagcaacc t cacccaagt gccat t cgt g aaagt ct gcg t agt t at t at t ct gaaccaa t at at ggagc aacggt t ct t agcgagggtt cagt ct aagc ct aaagt cct t cat caacat cct cct gat g t cgaagt t t t ggagat at t g gat gcgaaga acagaagt cc aaat at at gc agaccacct a ggt gcagcag ggt t t t ccgg ggact aat t c gt gcct t cgt t ccgcct caa cgct cgccaa acat gat t t c t ggt gt gt ga ttcaaaagca tcaagaaaaa cacaacaaga ccaaaat cga t aat t gat gt agcaat ct gc ct cgaggaac aagaagaaga cacaagt t ca at aagt t at a ggaccct t ca cgat gt t gcc caacaccaac ggat gaacaa caaacact cc gt gt t t ct ca act t agaaga act aa t ccat cat ca aaat gggcag acgt aggcaa cat caagat t taaagaaacc at t t gagaga t t ct gct aaa agct gt t ggt t t cgagagat at caacct at tgcgagaaca t gaaagagat act t caagt t aat gggt cat at caat accg tgcaccacca t at ct t t t ct gact caggct t gcaat gcaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1185 <210> <211> <212> <213> 693 1200 DNA Arabi dopsi s t hal i ana Page 588 12689250 Sequence Listing.txt <400> 693 t t t t t act at t ggaccacgt acact gact a acgat gt ggt gct aaaat aa t t cat t caaa t t at ccat ct at t aat t at t t t gt at at t t agt at at aat at aaaacat a t caaat acct aat at aaacg cgt cgct aat at gaaacat g ccgaaaaaaa t gaaagt ct t aacgt aagt g t t aaggcct t aaagaat aaa ggaat at gt t gaacagat ac at t gt cagt a t gcacaaggt at t t aat t gt acat t t ggat t at at gcat a aaagaatt ag aaat t at t at t gt gt cacat t ct t t t atca aagat at act at caaacaaa t t aagt t t ga tttttttttt cagcaagaaa caaaagactt cat ggaccga cact gct cct at caaat act acgtggacgg at at at ggt t t t ct aacaac ccacgtt ct t at aaaat t ag aat at at gt t t gaat aaaaa aaaat t t t gg gt at at t t at ct aat at t at aat t aacct t t t t t at t t cc acgat t t t t a t cgact gct c gt gcat t t ga ct aaacaagg ttaacggaaa at t gcacccg t at cat ct ca t act t t t gaa aact gaat t g gagt t gagca gt t aat t t cg gt at at t gt t t at agt t t at t act at gt ga t t at t caaaa aat t acaggt at t t aaat t t tagaagacaa t t cct t t agg aaaaat cct t gaaat caaac at gt t ccgt g t t t t aat aat aagaaaagaa aacaaaaaac at cacgt cgg aact t ct cat tccgtttttt agt t t ccgaa aagt t caaaa tggtgcacga t t t t aat gt g at t t t t t t aa t agact gat g tgt t t gt t gt at at at gt at t t ct at t t t c gt t ggact t t t gcccaaat c t cgt t t gat c gt t caagt t t acaat gt t aa tttttaaaaa cggt cggt ga aaaaaacaaa t t ccacaat c t t at t aat aa tcat t gt t t g t ggaaaaat a t ggt ccgt ac aaacaaaat g ct aat t gt gc t at gt at at g gt ggaacat t t gt t cagat t gt gt t gt at a at aaat cgt t t cat aat aaa cgt agcacca t aat at t aat gat cgt t ggt aact gaat ac ct gat aacac ggaact ct cg aact ct t gt g ct ct cct at a gt gt cat t ca at cgat t at g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 694 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 694 gaacaaacga agaggaagaa gagtaaatct tcatagtacg acacgtgtga atctgtttaa gataacttga aaattggtcc aattagtttt aaagacaaaa taaagtatag agatgatgaa ctttgcttat cttgctccac ccaatccaaa cctctatctt ttttcaatcg agtcgccatt ctcatgaaac ttcatcatca tcgaccttga gatctctctt tgaactgctt ctgttgttca gaataatggg tattcttgtt gcaggtttca ggagtctttt ttgtctgaat cttcttcttg gct gt t accc t t gat t t at g aaaat gt aac t t aacggct t ttgagacaca aacat at t t c ccct gat agg aggt t ccaaa gacgacaccc t t gagacact Page 58. aacacacagt aaaaaagaga gaat agaccc tggacaagcc at ct ct at cg at caagct ca t t at t aaagt cat gct caga at t at t gt aa gt t t ct t ct t acaaaaacaa agaaat t at t aat caaaaga acacgat t cg aat ccacgcg gt ccact ct t t act gcat t t ct cagt aat t at ct gct gct ggt ct gt gat 120 180 240 300 360 420 480 540 600 t t t t gt t t t g ttctct t t ga gct gt aagaa accat t ct ca at t t at t gaa t t ggat t cat t t t cct ggca t caggt gaca cct caagcca acat t cggaa t t ct cat t ag aat t gt at t t t ccaagact g ggt t agat t t gt cat ccaat t t t cat agat at t ggcgagg at gcct t ct t ct gggagt t t gt t t gtgcca 12689250 Sequence gacgttcttt attccggttc ttttgttcct tttctggtat accaacatgt ttcccctgt g t ct t t at aga agt gaaagt a ttgagctatg ggtgttacgc aagtaagtac ctcataacgg ctgttcttcc ttcaggagag tggtatctct aatggaacac cttgaccggt act atcgaaa gaatgagt t g gacagaataa Li st i ng. txt agat t t act g gt t gt gaaag ct cggccaca aat aagct aa t t t gt t agt g gt t t gt t t ct agt t gat t ca gcct gcagaa aaacagaaca act acat ggt gat cgt cat t ct at gagat t agctgccgt t ct ggt agcaa ct t t gggt t t gcagat t t t a cagat acagt gagt t t cgt a accagt at ca t t at aacat g 660 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 695 1200 DNA Arabidopsis thal i ana <400> 695 at aggt at t t aaaaaat t ag aat t t t at ca aacgt t t aat at t gat t t at t aagaacat t aat aaaagga aat at acggt t at at acggt t t at caccac t cgt tat ttt t t agt aaaac t att acagga gt t gacaaaa caacaat t ca aat aact at g cat aagaaaa at acgt gt ga ct t cat ct cc atagagagag t aaact t t t a at t act caat aaaaaaaat a aagaaaat aa caaat ct caa gat aaact t c acgt t ct t cc t t acaaaat a t t t aat agt t gat t t t t gt a t ccat t aaaa aaat t cgact aat aaccaat aaaaat ct at cat t t gt t t t at cgt t t at t aat t aat agt ttcggcacga accaacgat g aaaaaggct c gat gat t caa aat t aat at c t t act aaaaa at at t gt at g gaat aaaagg t aaact agaa t at gt t aaca t at at at at a accgat caca t t act aat t t at aaat acgg aaaat aaaaa gat agat aat acat gt t t cc at aacaaat a t t t gt caaac aagt gt gt aa t act t cct aa cat t t t cgt c t gct t t ct ac ggaact at t t t cat caat at acagt t t aat at t at t agat aacgt t gt t c ccaggggtaa cat t gat aat t at at at at a aaaact at t a t at t gt ct t ct t t gt gt ca ggaaaatgt t aat at at t at gagacgt t aa gggacaaaaa aaat ggat gg gaaaaaaatt aagcat acca ct cagat ct c at t at t t acg t aggt t at aa t gct gaaaat aagaaaat at t t t t t ctgat ct at gt t t aa aaggat t at c t ct t t aat t t t at at at at a aaat t aat ag aat t at t t t g cgaat aagaa t t gat cacat t caat t gt ca aagt at t t ac t gat cat t t t t t t at t t aat aaaaagt cac ct aggct ct c at ct cact ca at t at acact t aat gt at ac cacat ct gt t t act gaaaat aat at gt t t a ct gaaggat a aaat ct caat act ggt aaaa t at at at at a t at gaaaat a t cat ct t t t a aacat at t t t t t t t at at t g t cat t t ct t a aaaact act c aagct t at t g t aagt caagt aaat ggt ccc t cct at at aa caagagagag t t ccaacat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 590 12689250 Sequence Listing.txt <210> <211> <212> <213> 696 1041 DNA Arabidopsis thal i ana <400> 696 t gagcaat aa ttagat t t t g ct t t t gt t ca caacaaaatt t t t t atgtcg gagaaatttt at t cagat t g aat gt at t t t ggagct t caa aagaaagct c attcaaacac act t ggt t aa caaaaaagt a cctaaaccca ccgcaaagcc t t t t aat cat t t t aaccttt aaaacagagc t at aagct cg ttgtgt t at a t t t act t aaa ct aaat t t t g t at t at at t g gatccaaaaa at t at t gaaa ttgtcatgga ttttttttta ct ct cat t ct cgt t gaagat ct gt at acgt aaat at gaat ttagagaat g tgagtggat t t cact t t gaa aaaactaaaa caggaat aat at aacat gga taagaaagaa aact agat ca tttttttaag aat t aaat at aaaaaagaaa t caaat aat c aat aat aaag act aat t gt t tgt t ccggt t tgt t t gctat act t ctattt tt gat aaagc ct acgt agag cgaaccact a t t at t at aga cact aaaat a aagtgtatga at gtt ccaat taatcaacac t aaaaaat gt agaacaaaga gagaaataag gaat t t at aa t t t gt t aat a atggaagat g aaacaccatt act t t at t t t ct t at agt t g agt gt aaacc aaccat t ggg ccacaaaaag caccggacag cat aat ccaa agcaaaaaac aat aaat caa ggat t aagat t at agcat aa aaaat aagat atctagaaca t t gaat t t ga tagtaggcac ctggaatgat aat gacat cg t at caat aat t gcgat aat t t gagat t t t c cccat aaact taagagt t t g t gact t at ga agaagacgga at aagat t ct gaaagaataa ct t t t t cat t at gtt cagat acagaacaaa aaaacgatac t ggaat t t at gat cgaat ct ctct t t t t ac gcacacaaaa aagt t gcggt accacaagt c aagat t t ggg t at t t t at t c ggt t at t t ga t aaagagat t acaaaaaaca 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1041 <210> 697 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 697 aagaaactag agatttaaat taccttttta aagtaacaat aatagcaaac ataaaatgca agcaagaaaa atatataaaa agat t gaat a aaacggaagt gacgagaatg ttatatatgg catcgataac agtttgtttc atgtctccga tcggatccat atttgtttga atttccataa atatatatat atatatatat atagcatttt caacaatggt ttgtttatac atagcaactt gagattgatg aatcatccaa ttttcgatta t agcat at gt tataaaaaca aaaat gat aa gacaatggga at agt at t t a ct aagaact a gcct at gt ac agctagt t t g taggtgt t t c Page 591 t ct aat at t g tgat t gggaa agataagaga cgagcatt ac t act acat ac ct cacaat at cct aaat t ca aaaat t t acc ct ggt aaaat t ct t gct agc t t t gggat ga at t aaat ct c at acat cagg gact gt t t ca at at at at at tagccacgac accggcgcaa gt aaat ggga 120 180 240 300 360 420 480 540 12689250 Sequence Listing. attctg aaaacttggt tgaacgat aacttcgaca gcatgcagtt tctt atcaaccaaa gaaaat t t t g tttgcacaaa cagactacca act at t ggct cccatgcaaa ggt agat aaa gagaaggcgt cacaagtaga tat t t ct t ca <210> 698 ccat gt cgat accgct t agt t t at gaaat a act acacttt tggt t aagat gccaaagcag caagccatca acgtggtgt g aaat aaact c cacaacacat agt ccctt at t t t atct t gt gct t acact a t cct aact gt ct t at t t ct t ccgct t aggt act at t aat a t t t aat t t gt accatcattt acat acat ac gt t gt at gt c t cat at at at t t act at caa gaaaat ct ct gcat ct t gt c t acat t agat agccaagat c ggggt acaat cat t at aaat att gaaaaca taggggtg agt t t t ca ggtat t t a gaat t t aa caagtcag att gagag t t at ct t t t agct cgt aaaggt t g aagt agt a txt gc tgctcttgag at gctcaaatcg icc tttcaaaaat itt cgacgaaggt t a t actt gt gt a at taaccagccg ag aaaaat at gt t a at at t agct c t a tt gt accat g at ctct t aagct iat aat cat cat g 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 698 gt t at aaat t t cat caaat t at t gat aaaa t t t t ctgat t gaaaat t t t a t acaaaat t a t t caat t t aa cggt gat ct t at ct at t gac t t ct t cagga taagaaacaa ct t t t at t t t ct gt at t at t aaat t t gaaa aat gacat ga at at at act a aaaat gt gt g t ggacaat t g gt gagat ct a act at t t at t t t aaaat t ct at at acat aa cggagggagt aagccagt gc ggt at act ga act gcgt at t tttttttttt t gt t t ct ct a ct t t t cagcc at aat t ct t g t t ct gact gt tacaggaaga t agaagact t aacaact at a aaaaaat at a taaggacaag caat gaat at t agt t aacaa cat gct aaac caat t aat t g act caat ct t at at gt t at t at at cagt ag agat agaaga agt gt t t gga tttttttttt cgt cact at t at aggcaaga act t t t t t t c aaat at aat c t at ct t caaa aaat t agaat t acaat gt t t t caacaat gg att gacaaaa t act aaaat a aaaaaaaaag t at t t ct cgt at t ct t gaaa t t t aat acaa gct t agaat c t caaaat t gg acacaaaagt aaaaaaaaac t t t t t ct t t t t t act t acgt gt gat agaga t t t t at t t ga t t aat t gcca acat t t t gaa cat gaagaaa ct t aat aaac caaagct acc aaat agt t ac ct cacacat g aaaaaaat ag aact at t aac t t cat aacct aaaaact t t a acagat t cat t aaat gat at agat cggt ca aaagt gt at a t ggat aaat c cat agat gt g t act cat at g aaat t gact g aact gt ccga t gaagt aat a aaaaaaacac t t cat t t agg gat acgaaac gaaaacaact gaccat gt at t t t t caaat c caat agt aat t t t aat at t g aaaaat caat at caggat t g acgaaggcgg cct agagt t t caat gt caat t caat gggt g gcggcat at a cat gaaacac t agat at aaa t acgat t t t t t at gaaat t c aaaacaact g gt at act t ac aat at t agga t ct at t cat t ttacaaaaac t ct at at aag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 cgatgtttac gaaccccaaa atcataacac aacaataacc attatcaact tagaaaaatg Page 592 12689250 Sequence Listing.txt <210> <211> <212> <213> 699 1200 DNA Arabidopsis thal i ana <400> 699 agaaaacat c at gt act gt t cat gcgt t ag gaagt t t gaa at gt t t aaaa t cat caagaa t cccact t t t t ggaaat gat ct aat t t at a t t ccaat aat gact caacga gat aagt cca gat t agat t t t aat cat ct t t cat t aat ga t t aggt t t ga at t t ccat t g tgcccccagc at at t aacct aact t t at t t at t at aagt t t t t ggt cat t aagaagaaaa t t caat t caa gt aaaaggt t gggt aat cat aat aat ct ca aaaaat t t ga tgct t t atta t gt caaagag t ct aaccaat aat gt t t aaa t gt t ct cgcc t gact gagaa t gcgact at a gagagt t at a aact aaagt t t agt cacat a ttct t ct t cc gcaaaaacag at t aat aagt aagt ggt t ca t ct gaaaaga ccgt t t t t gt t t t aaagt gt t t t t t t t act t aaaat aaag t t t t act at t gt t gcagat a gat t t ct aaa cacct acaac acaaaaaaaa t t at gaat at aat t gat gt c at t at t at t t tagaggggga t t caat agt t ct cat gat t g t t t act t t ct agtttttttt t t aaat agt a gt at t t t at a t t ct ggt aca acgacaaaag agt t t gaat t ttagt t cttc gcaacaagcg ct t t ct cgac t caat cat ag gt t ggat gt g gt gat at at a t t t ctgt t t g agaaaat t t g agt aaacaaa cct aaaaat t acat aaacat t at cacaat t caaagaagca t at t ct t gac caaaat ct ct cat ct t ct at t ct t at ct t g t aaaact cat ct t t aat t t t t t t aat t aac t gact ct t t g aat agcaat c t t aaaat t t g caat t gt t ca t cgaaaaat g act aggt cat gt t gaaat aa aggcaaaat a at ct at t agt t aat at agt a t ct t agat t t aagt t t gt ga t t t aat aat a ct ct ct ct ct ct ct caaaag agaaagt t ca gcggtcgacc tt cat aaaaa t t t at cact a gcct t t agag aaat t at cat at aaat t cca t ggt aact t a aaagat ct aa at at t ct cac aacgaaaagt agat aact ac gat agt t t ga t caacat t aa gat aat t t gt t t at aaaat t at aact at t t gcct ct ct at aaaagcagac acagaagat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 700 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 700 ttttctcatg taaaactctt tgacaccatt tattgacgat gtcattaatt tgttaattgt tttttgttta atttatttat ccaatgtatt attttagttt aagtaattaa gagagtcctt cagagagtcc tttaagaata ataaaaaaat aatcagattg ttttaaacct gcacacaatc at catctcaa ttatctcaca aggttcaaca ggt aaaat cc at gat ggt t a at t t t at t t t t aagaat aat gt t ggt t t at caacaaaat c at cgt at gaa Page 59, aat cat t t at at aagt t t at aacat acaat aaaaaaat gt aacagagagt at ct ct t ct t t gt gaagt at ct t ccgat ct ccaat gt at g gct aat caaa t ggt t t at aa cct caat gca gct aaagct a at agccat ac 120 180 240 300 360 420 12689250 Sequence Listing.txt acccccat ct ggat t cat at acaaaaaaaa ct cat ct ct t t acggt t gga t ct t aaaaac aaaaaaaaca gt gagt agt a t at aat t cct ttggcaaaga caat gact ca aaaaaaaaaa acat at ct ac gt aat t aacg agt t t ct aaa acaacct at a t t t t aat t t a gt t ggt t t t a gt at agt agt ct t t ct gaaa t act agt act ttct t t t gt g t t cat t at cc aact ggaacc aaacagct at acacaagcaa t gt ccgaacg aaaacaaaaa t gaat aat ag t aat t t t t t t gt t gct acaa gct aaacgt c t ct gcaggt t gt t gt at t aa gtcacggggg aaat at at aa cact cacat t caaacat at c taaagagaga gat t gt acat gt at t aat at aaacgat cct t t gt aat t gt at at agt agc t acgt acaaa agaaaat t at t aagt cct aa t aacat t at t at aaat cagt gat cagat ca gt ct t gt t ac aaaaaat at a tagagcacct ccaatgtat a t t t at t t t t a cccaat t t t a at gagt t t ct aaat gt gt t t at acgaat at t t aagagat c t cgaat t aag t t t cgat t at tcacacacac t aacact t t t t ct at t t at a aagct accaa t at ccaaaaa gagat t ct ct act aaaaaac t aaat cggaa t gaacgt at a cagt t gcaat accat at cat t ct ccat t ga aacaact ggc agct t cagaa ccct aat t cc agagaaaat g 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 701 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 701 taagatgatg tgatctcatc gattaaggtt atgacatgtc aactagtcga ttatgagcat ctctctctgt aagat t atct ttgcatcaaa ggtccttgag agagaagaag tctctttaat agt agt aaaa t ccat gt act t at acgt aga agagtacaca cct aact t ca cct t gct aat tagttcatct cattactcac tgtgacccca tttttgtctt agtttcaaca tttgcaggaa gataactttg aaaaaggagg attgatgggt gt at gt ct aa t gagagat at at t gcgtagg tatattgatt gattttatcc ggctattttt gtaagctaca atttaaacac aagataaatc atatataccc catatttacg tctcctatgc ccaaccataa ttgcaggtt t ataaaacat gcaccatgga gtcaatttta aaat t ctat g at actaatga aaaaat t gag at cataaaaa gaaaaatatg tacaagtgaa atgtggatca caat ct t t t a cat t aat gat gagacaaaac t aggt ct t t c cat aat aacc gat ct ct t ac t t t cat ct gc t cact t t aag ttgt t t t t ga ggaagt gaat acct accgcg t gt aat cat a t t ccat acgc gaact aacga aggt t t gat t aact aat gaa t t ct t t agct aat t ggt t aa cat gaacat g at ccaagaga t at t t ct ct c aaat aggt cc ct acaaact c ttacgaccaa aaacaact aa t gagat ggat gt agat t aga acacacggt a t agt at agt a t aat t agt t t aaaaaat aaa ttcaaaagga gaagt agat t at at at at at ct at ct ggt g gaccacct ct ct aaaagt at t ct ct ct at t gt gat gt t aa t cact gt t t t t t t ggtaagc ct aaaaagga at t t gt at at ggt ct ccct a t agat t cat a ct at gct t cc tttttttttt at at aaacat aaaat ct aat t at t t at t t t aggt ggat aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 594 12689250 Sequence Listing.txt acgtaaaatc acgcacgcgg ccatgctgca ggggttaaat agcaattatg tatttacctt aaaaagaatt tgggaacctc tctttaaatt aatgcaaaac ctcatctctt gtctttctgt ctctgaaaca gtcaacctca ttct t ctcaa actcactctc cctgatctct ctct t ccat g 1080 1140 1200 <210> <211> <212> <213> 702 1137 DNA Arabidopsis thal i ana 0(7 <400> 702 t atcaaaggag 0 ggat at caaa Cr ct aaat aat t 00 0 t ccat t t at g C1 ttt ccaaaat agct t aat gc t gggt ct aaa gagt gt gaga gt agt cgt ct act ggt ggga t t gacct t t t gt gaacaagt t t cagccact aaaaagt t t g ggtctct t t t at t t ct t act ccaagct gat t t gt cat ggt gat t agt gga gaagaagaag agaat ct aac t t cat cct t t t caaagt gt t t t caaact t t t gaact gt t g aaaaat aagc at agcgcgt g agat t cccgg gagat t aacg t aacct t t ca t gat gt t aag agt at gt t ga aggt t aaaga ct t t gt t t ca gt t t t ct t at t t t gt t t ct t ct t t ct t at t at t t t gt aaa aagaagagcc tttttgaggc cattcatgaa ttggaatgaa acaaaggcca ct ct ct ct ct gt aaat t cct gt ccccattt aat aacgat a ccaat at aaa tagtgaaccg gt ccact gat at ctt aagt a ggtagtcccg atggacaaga cat at ggcag t aat t at agt ttgatcaaaa ccaacgaaat t cat t gaat c gt t t gatgaa gagaagatct cgt cct t cct gtctctggt c caagact at a agt ctt ct ac tgggcct t at act at ggccc cacgagaat g gt t t ctagt g ggt cccact a gaactcgtgg at gt aactt g t t t ct t t t gt ggct gagat t gcaaat cact ct ggaatt aa at ct ct ct aa t aact t gact gaagt t gtgt t caat ct t t c t t t t t t agct tatgagatgt cctt cat gca gctaaaagaa aaat aagt t t cgcgt t cgat tatcagacac gat caagat a cct agaat ac aacaaaagct agcctcgaaa t ct ccat t t c t ctt ctt ct t aaat ggaat c aggt act t aa tgat t gt t t t agaggagctt ct t ct t gt aa caaagt at ca t t t gt t t cat tggt t agct t caaaacct t a aggt ccat t a tgt t ggtgaa gtgtcgacaa ttataacgaa aaagaaggtt gaat cat ct c t aaat aaat t cgt agctt ct ct t ct t ct cg t t t at cgaat gat t gat t t a ttgt t t t gt g agt gat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 <210> 703 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 703 gttataaatt actatttatt catgctaaac tcatcaaatt ttaaaattct caattaattg at t gat aaaa at at acat aa act caat ct t ttttctgatt cggagggagt atatgttatt gaaaatttta aagccagtgc atatcagtag tat t t ctcgt at t ctt gaaa t t t aat acaa gctt agaat c t caaaatt gg Page 59E aact at t aac tt cat aacct aaaaact t t a acagatt cat t aaat gat at caat agt aat t t t aat at t g aaaaatcaat atcaggattg acgaaggcgg 120 180 240 300 12689250 Sequence Listing.txt tacaaaat t a ggtatactga agatagaaga acacaaaagt agat cggt ca t t caat t t aa cggt gat ct t at ct at t gac tt ctt cagga taagaaacaa ct t t t at t t t ct gt at t at t aaat t t gaaa aat gacat ga at at at act a aaaat gt gt g t ggacaat t g gt gagat ct a cgat gt t t ac act gcgt at t tttttttttt t gt t t ct ct a ct t t t cagcc at aat t ct t g t t ct gact gt tacaggaaga t agaagact t aacaact at a aaaaaat at a taaggacaag caat gaat at t agt t aacaa gaaccccaaa agt gt t t gga tttttttttt cgt cact at t at aggcaaga act t t t t t t c aaat at aat c t at ct t caaa aaat t agaat t acaat gt t t t caacaat gg at t gacaaaa t act aaaat a aaaaaaaaag at cat aacac aaaaaaaaac t t t t t ct t t t t t act t acgt gt gat agaga t t t t at t t ga t t aat t gcca acat t t t gaa cat gaagaaa ct t aat aaac caaagct acc aaat agt t ac ct cacacat g aaaaaaat ag aacaat aacc aaagt gt at a t ggat aaat c cat agat gt g t act cat at g aaat t gact g aact gt ccga t gaagt aat a aaaaaaacac t t cat t t agg gat acgaaac gaaaacaact gaccat gt at t t t t caaat c at t at caact cct agagt t t caat gt caat t caat gggt g gcggcat at a cat gaaacac t agat at aaa t acgat t t t t t at gaaat t c aaaacaact g gt at act t ac aat at t agga t ct at t cat t ttacaaaaac t ct at at aag t agaaaaat g 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 704 1200 DNA Arabidopsis thal i ana <400> 704 aaagcct t aa gt aacat agt caaagt cct g ggcagt ccaa gaat gt aat a aaaagt caca aaat at t agc acacaaat ct cacaagaaat aat caagaga aagccaaaat t ct ct gt aac act caat t at ct t t gt aact gt gaaaaggt gt at t aagca at caat at cc t t at t t ccat at ggaaaggt accaggcaag aagaat gt gc at t gt t agaa tacaggaagg tt cat acaga act t aat aat t at cagct t a t aaaaact gc ct at agct ga gat gt gacat t aaaat aagc ttacagacac gt gaat caca at aacaacat tgacaaaaaa aaaaaaacag cacagt gaac ttttttgggt aaagt ccaag aat aat t aaa tttcagccac t ct ccat t aa aggagtgagt at acacat gt t t acaat t t t aacct t t gt g t act ggct aa tcgaacacac at t t cat t t a at at aat ct t aat aaacaga aagagggcca caat ggat ct aaaaagaaaa t cgt t gcact acgaaccat g caagggaaaa aagt aaat aa aaat gagaat t cat t t t gag at at t t t ct c ct t t t gacac acaacaaggg caaacagaat gt aact ct ac t caat gagt a tgagaagaaa gt cagct gct t aagcaaagt t at ct t ct t a t gt t caaagc gcaagact ag ttcaccaaca t t at t aact a gt gt aagaac t ccact t t t t at gt t ct t aa ct t aat gcat gcagcat t ca at at at ggca t gat at aaaa ttttcaaaga t agt t ggaaa t aat agccac t t caaact aa caaaggtgag at t t aagagt ggaaaacaaa aaacat ct t c cgt gt gacaa gaaat t gggt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 ctccaaacca cagccaatca atattcttta taaatacaaa cacacaaaca gcatctttct Page 596 ctcaaacaca at cat aacaa at caaacgt a tttgcgaaga aacat at ct t agagagagaa aagatgccaa tgaagccgcc 12689250 Sequence Listing.txt ctatcaaaca ccaacagctc tat t ctctac ctcatttct c aaaaactatg gaattgacac tgaattcctc gagttctctt gagttctaga aaccaagaaa gttcctccaa caacatgacc aacatatcaa ttccaagcaa agaactcggt taaggaaatg 1020 1080 1140 1200 <210> <211> 0 <212> r1 <213> 705 1078 DNA Arabidopsis thal i ana <400> 705 aat at at cat at at t t t t at aaat acagt a gacacaaagt aat t t t t aaa t at t t t ct aa t t at at t cct t t gacat gaa gct at aaat t aat t act cat ct cccaaaaa t t at t t aat t cgat at t ggt gcct at aaca taaggccaca ccct t ccgt a gact ct t gcc gact ct cact caat gt ct cc aaat t t t cat t t aat t t t ac acct at t t ca t gt t t acaca t agt at ggt a aaagct aact aaaagt gat t t t acat at aa cct t cct t ac t cgat t aat g t gcagat t ct ct t ct gaacg at ct t gaaac act gact aaa agggaccat a acagat t gct cact ct cat t gagt t cct t g aaat gat gac t cacagaat a at t gt t gt gt at cgt ct t t t t t at act at c aagcat gaat ct ct t t t act gt at t t acat aat gat t t gt gaagt t gt gt acaagcaggg ggct t act t t at at t t t t t t agaaat t gt a t at at t ccaa t at t ctt t cc aaaacct aga t caaat aaat at gaaacaaa gat ct gaagt gcact t act t t t t t aaat ga t at t t t at t g t ccat gcacc aat at acat t gcaaaagttt t agagt t at t t gt gat at gt ctt caagaca t gt cgt t ggt t t t gtggaca gt t gt t at ca cat t ccgaaa t ct at aaaag gaagcaaaac cagat ct t gc t cgcat at t a gt caaggat a t t acaaat gg ct ct at t t at at t aat caat gaat t t cct a t ct caaat gt ct acgt aaaa agat aaaaat gaat agt act acgt acat gc at at gt ggt c cat cact gaa gct agt t ggt ct ct gt t gt t t ggct at gaa at ccgt at aa t at gaaaat a act aat at ga t cat at cct t ccaaccat t a t ggat aaat t gat gaggaat aaagagaaaa t t t cat t t t c agct t agagg acacat gacg at acgagaag t t t cgaat aa gat aat agca t cgat t ccca ct gat t cct t t t agaat aat tttccaaaga gaaaaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 <210> 706 <211> 762 <212> DNA <213> Arabidopsis thaliana <400> 706 cttctacaaa aaaattaaaa ttaaaattta gaagctaata ccaagattgt ataatcaagc atatgttcca ataataaagc aataacttgg tgttttcttt agttggcgtc ttggagattc cacataaagc aactcacgaa acgagaaagc acacaaaggg ttaaatttgc accattcacc gagtaaatcc aagatcttcg acataatata gacaagcaac tctctagata gtataatcag aaatgtttaa agagagaaat ggtttttttt tgttggttgc aaattccaaa atcaagaaac Page 597 120 180 240 300 t at agaaat t t gt ct caaaa at t cct ct ct t t caact t ct t ggcatt at a aaagagggat ctt aat acaa gagagaat at ggaaggtaaa at agctt act ccggacat t a t gt cat ct at agat gaat ga ct t t t act gt gaggggaaaa aaat t accgt 12689250 Sequence tgatatgcgt ccttaatcta t t gaat t gca aaact gcgca gaaagacctt attcagaaca gcattaccac acacagacac cagtcatttg t ctagatat a cctttagttg tagataagga caaaaaacaa aaat at at t a acttattggg ttcggtttga il i ana act aacacaa t gt ccat cac gt t t cggt ca t cagagt gt a t at gggccca t gtt cacaaa aaaacatgaa agaccgagt a gaaaaacaaa aacaaaaat c Li st i ng. t xt t at gcact at att act aagt t aagt t aaga aaagagacat gaaaagt ct a at ct gcaaga acat agaacg tg ct acagat t c cgaat gt aat aaat gt cat t acgt gt ct t a caat at ggt g gcaat gat t c t gt gagt aga 360 420 480 540 600 660 720 762 120 180 240 285 <210> 707 <211> 285 <212> DNA <213> Arabidopsis tha <400> 707 ttt caccgga t t t gatt acc atttcgtagt caacgacat c aatattgggc tttaaagtct gagataacag ccgacgtggc ctcccatttc atatccttaa agagaacaca acttgattct tttgacgacg tcgtttccat t t ct aaaat a caaaact t t a gacattatag ttttgagct c aaat g <210> <211> <212> <213> 708 1200 DNA Arabidopsis thal i ana <400> 708 t t gcat cgag gaggatt cga t t cct ccct t t t t t aaggt a ccatt at at t t t at gat gt c ctgtagggaa ccct agccaa aacaaat at c acact gacgc cctt ccaagt cgct ggagt c at gaacaaca t t t gat t t t c ttttcaaaag ct t t t t cat a t t t ggct t t t caaat cgat a t t t t aaat t t aaat gt at t t cccaact cac gt t t att tcc at t agt t gaa t ccgat aagc agcagt agat cact t gat t a tt gttt aaaa caat ct cttt cgaat ccact aaat t t accg t ct t t gaaag acat at ct t g tt cacat t t t at gccaat ct gaaact aat g aaat acaaca t gagt t ct ag gacagcaaca tt cgt t t t t t cct ccagt ct cat at gt gat at gt at at t t gtt ccgt gga ctt aagcgaa cat t at aaat caaaacaaaa cgct cgat at acact gacgc cgacgaat t a t aaat t t cgt agct at acca caagct acaa t gaaagat t t gt gact acgt gt aaccct gt aact gaaccc att ct gt agt agt act gat t t gt ggagat t aggacccaat att agtt gaa gcct aat act ggtt ct gat c gtgtgcaacg taaccacaga aat at t agaa gt aggact ca t t cacaact c cgcaaat gca t t t gt t t t t c cat at ccccg t t accat t t t acaaaaacaa aat aaaaacg cat ct ggt cc gt t t cat t at aacgaagaga gt t acat t t t 120 180 240 300 360 420 480 540 600 660 720 780 840 cgattattta tttatttatt tttattttga atgaactgtt aattcacaca aaaaaaaaca Page 598 acat gt t t t a ct gcggaat a t cacccat t a caaacat t at t ggatt caac acagat acat caagt at ggt t aat t t aat t t t acaagt aa t gacat t t t a att agaaaaa gcgt aat t ac 12689250 Sequence catagactaa aaacagagct ctcgaaatat gaaagttgct acgaagggac tcgtcatttt ccat cgact a t gt aggt t t a tcgcagagaa act t cactta taacacaaag atcacagcta Li st i ng. t xt ttacaaggac at ggaact gt cagggt at t a gggaccact t t ct gtt cgaa t aat ct cgt a at agcat t t g gtcgtagttc t ct aaact t t at t agt t t ga cgggt t agaa aggcaaaat g 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 709 826 DNA Arabidopsis thal i ana <400> 709 aaaagcagcc at agaaat t g t ggact cat a t gt t t t t ct t aaat t gat g t acactt gt a gcat at gaat gcttttt aat t t aagagat t caaat t t t ct ttttttgcca t t t caacttt t agact ct t a t ct t at gcag t t t at t t at g at aaggt act ttccaacgac t t ccagt gcc t t t t t ct t t t gct t gt gct t t gct gact ct ct cagat gt a ct t gacact t gat cgcaat a ct t at ggat a caat t t gaac cacat ct t aa at aagggcat tttacttaaa cataaaaatt gatttgggat cagaattgag ct t gaaagt t at cgt t t t gg caaaaat aaa t gcagccat t cat ccaat t c gaat t t ct ca t t t aaat gat t ccaaagat c t t agct aat c t ct ct aagga agggaagct g t t cct aat at t gct t cat t g ctctct t t t c gaat ccccat at aaaaagaa caaaat at t g tttgcagcga aaaaggt aag cgt aagt at c agaggt t t t g t ccaaat cca ct aagaat ga attttttttt agtt gt gtt a at at ggt t aa t caat t ct ct gatt ctt gt t t ct t t gt ct a gt ggctt caa ttaaaacaca cct t t t t t t g t gt gaatt gg at t t t ct t ct aat cat t t ga at t gagagat t ct agaacat ttcaaagaaa gaaat g t cat cat at t t gctt ct at c catt gatt ga at t cacct ct caaaaagtt t tt cat at ct t t ccagt at ga t gagct aat a at t t gt t t t c gaaat t aggg caat gt at ct aaagct t ct c 120 180 240 300 360 420 480 540 600 660 720 780 826 <210> 710 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 710 tgaaccaaac attagatttc ttattcatct agttaatttt tggaatcatt ttaactttcg ttatattagg atttattacg ttgtgaatct atctacagaa aatctgctag gacgt t gcaa at at t gtatt caaat t aata ttgaaaaaca atacgtatat ctaagattat ataagtgtca tgcatgacgg tcaatcacat caaatttttt aat cat t t aa t t gat gat at at t t t at at a aaat at aaga gcacat t t t a act t t t aat t aacgagacac Page 59. ggat cacaag t t ggcccact ccgaact t ga gt agt t t t ga gat cat t t at tt aagt gttt at acat at t t at t t ggt gat att gat gggt aat t t at at c t at t t t gaaa at t t t aat t a t t t ct t t t t a t acat t at t a 120 180 240 300 360 420 12689250 Sequence Listing.txt ttgcaaataa ctactaaccg aataaaaatc tattgaattt tctgagattc tctccggctt gt t t t t gct a aaaaccgat a t ct ct aact g at cagcgcgt aaaagt t cat aaat at aaac t t aact gat c caaat gat at caggat caag gt ggcat t ct acacaaacaa t ccacat t gt cgat gat aca aaagaaggtt aat gt cccag gt ct t ccggt tagcaaaaca agt ct caaat cct caaaacc t acgt gaaaa at t ct gt ggc at t at t ggct cgaaagt ct t t t act t gagg aaaccact gc tttagaggaa t t t t t at t aa t cct gt t cga caat t gct t a t cgat gat t t gat at caaat t at t gt agca gaaaagt t ca at gaat ct gt t t t t gcact c tacaagcaaa aaat t gcagc gacgaggaga accgt t t ct t at t ggaaat g ggaat at t t a t ct ct at t aa t ct gat t t aa aagct gaaag gaccaagcca t ct ggat aac gccat ggt gt acgt ct aagc at ct gagt ct aaaaaccaat cagat t t ccg t cagt t t ct t t t t gt at aga ct t gat cgct ct t t aat t t c t at t t t aat a cat caagat a aaaagcat ca ct cat ct t ca t cgt cgacct ggt t agt gca acaat agat t at t t agat t a cat t cagt t g aaaat at aat gt t ct ct gat at t t t ggat a t ct gggcccg t gt t aat gac t ccgt aaaaa at gt at ggca cct ct t aat g 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 711 <211> 540 <212> DNA <213> Arabi dopsi s thai i ana <400> 711 ctcctctttt tggcccccaa gaaag acacagggca acaaaaagga ggcttl agtatttgtt tatatatctg atctct tttctttgag ttcccgtaag ctaaa cgcccaacgc tcactacact tagcgl cctaacgatc ttcaaatgta caggg caaatgttag gcgcgtgacg ttcac( ccagt t at ga gt gggt gact ccaaa ttttcatttc tcttcttcca acatti cgaaa t t ct ct ca t gag gt aa ccaa ct gt g agat t ct c ccaaat ccca t t gt agaat t cact caat t t aaact t ggca agcacgcgcc t at act cggc aat cagct ga ccaact ggca t ccgccgt t t ccaaaaaggt gct gct ccct cgt gcaact t t t t ggct aaa aaccaggatt ccat aaact c t agaaat cat acgcat t at t cgccaccgt a t t ct t t t t t g t at aggagt g gcagcagagg agaaact caa gact caaagg t aaaact at t t at ccaat at gacacgt gt c cgt gacaat g 120 180 240 300 360 420 480 540 <210> 712 <211> 1200 <212> DNA <213> Arabi dopsi s tha i ana <400> 712 gcgatgagat aaaaccatca tccagagacc caaaggagtg agagcctcat gaaggagaga acgagatcgg gatcttcagg tggaatttca gaagtctgag gcgaaaattt ggatagaaac gct t ct gcct t ggat ggat c ggccagatct gact ct gt t c ttgaaaaaga caaaagacaa acagcaaaaa cctaaaaaga aaaagtttta gactttaaat atctagatct gaaaattcag attcacaacc gttggaaaaa aat ct ct cag at gt gt ct aa agaaaaaaga t agaat aaga Page 600 120 180 240 300 12689250 Sequence Listing.txt gcaaaacaaa gcaggcggcg acgagat gt g t gt aat acct aacaaaat at aat t aat t ca t aat t t t t at gaat caaaac ttcgccaaag t aagat agt a tccccccaca caaat t at t a act gcct caa t act t t t ct C t aat t ccccc <210> 713 agact t cggt gct gaagt t t agct t gt aaa caaat t t t at t at t ct agt c acggt ccaac gt ggct gt ct t caaat t aca t t cct t t t gc gcaaaaaagc t gcgat t t gc at caaact t a aaacct ct at t ct t aacggt t caaat gcca t ccggt gct g t agagat aag t act t t t t gt t t t t aat gat at agcaagt g agt ggct t gg t ct t gt acca tttacaagaa aact t t caaa acgact at t c cacgt gt ccc aaaaacgt aa aaat t agt cc t t ct t t at t a ct gt ct t ct c agaaagccac ggagagt aaa aat at aagt a at t t acaagt agaat t t t t g acct aacact agt acgt aga aaacgat cac aagt gcaaaa cat aaaact g cact caact a t cgt gt gct g t ct t caccga t t ct t cccat act aaact cc cggaaccggt aaccctaaac at t t t aagag act at aagt a ttaacaaaac aaaagggaat aat gacat t t caat agt t t g agat acagt a acaagaccac t ccacgt ccc cacacaaagc at gat gat ga at cct t gt aa t ct acccct c aaaacccacc t gagaaaat g at at t t t t t t aaaaacaaaa aaat gt gt gt gt t ct t ct t g at t ct at t t g at t cgt agt a t t t t t cgt aa tccagaaaac cat t t t aaat t gacact gt c at cat t act g t t ct t cct t c ggaaaaaat g 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabi dopsi s t hal i ana <400> 713 aaagcct t aa gt aacat agt caaagt cct g ggcagt ccaa gaat gt aat a aaaagt caca aaat at t agc acacaaat ct cacaagaaat aat caagaga aagccaaaat t ct ct gt aac act caat t at ct t t gt aact gt gaaaaggt gt at t aagca at caat at cc t t at t t ccat at ggaaaggt accaggcaag aagaat gt gc at t gt t agaa tacaggaagg t t cat acaga act t aat aat t at cagct t a t aaaaact gc ct at agct ga gat gt gacat t aaaat aagc ttacagacac gt gaat caca at aacaacat tgacaaaaaa aaaaaaacag cacagt gaac t t t t t t gggt aaagt ccaag aat aat t aaa tttcagccac t ct ccat t aa aggagt gagt at acacat gt t t acaat t t t aacct t t gt g t act ggct aa tcgaacacac at t t cat t t a at at aat ct t aat aaacaga aagagggcca caat ggat ct aaaaagaaaa t cgt t gcact acgaaccat g caagggaaaa aagt aaat aa aaat gagaat t cat t t t gag at at t t t ct c ct t t t gacac acaacaaggg caaacagaat gt aact ct ac t caat gagt a tgagaagaaa gt cagct gct t aagcaaagt t at ct t ct t a t gt t caaagc gcaagact ag ttcaccaaca t t at t aact a gt gt aagaac t ccact t t t t at gt t ct t aa ct t aat gcat gcagcat t ca at at at ggca t gat at aaaa ttttcaaaga t agt t ggaaa t aat agccac t t caaact aa caaaggt gag at t t aagagt ggaaaacaaa aaacat ct t c cgt gt gacaa gaaat t gggt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 601 ctccaaacca ctcaaacaca at cat aacaa at caaacgt a tttgcgaaga cagccaatca aacat at ct t agagagagaa aagatgccaa tgaagccgcc 12689250 Sequence at at t ct t t a t aaat acaaa ctatcaaaca ccaacagct c aaaaactatg gaat t gacac gagttctaga aaccaagaaa aacatatcaa t t ccaagcaa Li st i ng. txt cacacaaaca t att ct ct ac t gaat t cct c gt t cct ccaa agaactcggt gcat ct t t ct ct cat t t ct c gagt t ct ct t caacat gacc t aaggaaat g 960 1020 1080 1140 1200 <210> <211> <212> <213> 714 561 DNA Arabidopsis thal i ana <400> 714 t cccaat aga aaggt aat cg agaagtt ct t aaaaacgacc t ct aat t gaa acct aat aat gagacgaaaa ggt cgggt t t ct ct ct ct ct gt agt gagag at agaagaga aagat t ct t c aagcccaatt t cct aaaaaa agacgcaaaa at t cgt ct gg t gaaat t t ga gggt att cca ct cagct gcc cgt gagaaat gaaagatttt t gat cct ggc tgacgggacg gagaacagag gt aat act at gt cggat cat at gaat gggc taaaaagcaa ttcgcgccgg caaggcaaca t ccgct ct at gcgt t aat t a agtttttttt aaggaaaaaa tt gat t t ggt cct at ccgag ct ct cacacc cgaagt t t gt gct gaact ca t agaat cgat cggcgagcaa gaggatt gag aaaaaagagc t gcct act aa cat aaaat cc aat ccgtt gc ct ct gt gaaa aaact gacaa gat t t gaggg agt at t at cc gt at gaggt a at gt t t at gg t t at ccgaat gagt aat t cg t caat cct ct acagagagt g 120 180 240 300 360 420 480 540 561 <210> 715 <211> 759 <212> DNA <213> Arabidopsis thaliana <400> 715 ggccactgtg ttcctgcagc attagttgtc tgcattcacc atcctattct ctacaacggc ctttgttggt tttcccattt aaaccccctc aaatgcccaa gtgaggggtt gatcgatccc ttacccttgg aactcttgat aattcatcta tggaaccctt gatactccat aagcataat a atggttgcac cacctctgtt gtcatctcat tcccaagtgg gaggtatatg aacggcccaa gcttcaatgt ccaatatatc taagtttggt ggtat t atag aaaatactct gctcgt t ct c ctggtatcct tggctagcat ttggtagtgt taactgcatt cctagaaaat attaccctaa ccat ccat gt t acacaaact aaggt gt t gg gggact t ggg caact ctt gt cat t t ggct t gaatt caggt tt at gacaaa ggt gactt at ct ccacat at cgaagcaat c aggcct t t gt cct gcat agc at caaaggaa gaacagtttt att cggt gt a gt accat at a cgaaaggaac t cggacct gg aggaaagaaa gtt ggtt gaa ccaagacttt att cct cggg ct t ggcat cc agt t t act ga t cacct t t aa gt t t act t at ccat at at aa t aaat at cct atgtgggcca t ct ct ct ct c at t at at t t g gtgtgagaga ggagat aat g ct ct cct t t c t cgt cct ct t 120 180 240 300 360 420 480 540 600 660 720 Page 602 12689250 Sequence Listing.txt cagcaggaac tcatctgtct tggaagaatt attaccatg <210> <211> <212> <213> 716 1200 DNA Arabidopsis thal i ana <400> 716 gcat aaat ca agcacaacgg t t t cgat aag t t t gt cat gt gt aat t t caa t ct t aagagc t t aagaggt t t gt gacaaca gat t t t t aat t t t at aaaat cact t aaaca at t t aaaat t t ct t t aat t t t cggat t t gg cacct ct t gc aact gaaaat aat ccagct a agt at t at t a agcaaat gca t gt cgt ct ct t caaat t cct t t at ct ggt t aaaaaaaaac caaagt t t ct gaat at gaat at ccacat t g t t t gagt agt t cat t t at t t at at at at ca aaaat at aaa ttagaccaaa ttaaaaaacc aat t t gt gaa gat gt cacat ct ct t ggt ag agt t t at t ag aaact at acc tagaaaccga agccaat cca ct at ct ct ct gat acagat t t t gt acct cg at gt t ccaaa t aaacact t g gaaaaat t ga gt t aat t t t t t aagaaact c at t t t t ct t t t at t t gt t t a t aat t t t t at aaaaat t aat ct t t t aaggg t t aacat t ga t t ggtacccg agt t t t ggat t at aat at at cgat cagaat t at aaagt t t at gt caaaac ct ccct cact t t gt gt t gcc agt t t t at t a t aat t cat gt t t cgt ggaat t ggt at ct t t gat ggat gt t t aacgt gt ca t cgcgt t t t t aaat t t t aat t t t aat aat a ggt ccaacaa ct caaccat t ct aat t t t aa gcaagt t cga gaggct t gaa ct t ct caaca t agaaaat t t cct t t t t ct t accagct gac t gt ct ct ct c aaaat at ggg t at t ct t t ag t gct acgt t t at gcaaact t gat cacct cg t t t aaat at a t t gt gt gt gt ctctcttttt at t t gt at t a at at at t t t t aaacact t t a ggggt t gct c aat cact t ga aaat gact t t act t t t gt at aacat t gcaa gagcccaagc aat aat at t t gt ggccaaac t ct t ct ct t c gt at gat cca ccgt ggaat a gt t ggcgcct t t at ggat t c aat ggt ggt t aaact t t at t ggt t gaacat t agt ccat at t t t at t aagt ttaaaagacc at t t t t t at a tt act ccaag aat t gt t t ca t aaact t t ga tgcagcccat at at t t t gaa ccggcccgt a at t ccgt ggc gcacat caac gacact gat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 717 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 717 tatgacatag actctgtccc acacttgggc gcaatgtagt agattttaaa ataattgacc gaaatgtggg agattttgaa aaattcccct aaataaacaa ttgaaggctt t tacatcttg atttgcgagg gttactaact ttatcaattt tgtaaggaca cattatgtca cattttctta at t t t t gtcc aaat gt agca ttttttcaaa gt cggat t ga gt aact gaaa t t aat at t at Page at aat t ccac gat t t t gaat ccat gat gt t t caaat t gt t act agt caac at t cct aaca gcgagt aat t gt t acat t ct ct t at aaagg t t t t t t t ct t aat gaat aag t ggcat ct t a 120 180 240 300 360 12689250 Sequence Listing.txt aaatgattaa tattttcatt catattagag ataaacttat actttggata gt t t gaat t t gaaaggat aa t at caat at t act t t ct caa t t at aaaagc t t t acgacat at t ct ct gcg cat t t gct t t cagt at agaa cat t gact t t tcgaggaaaa aaat t at t gt agaat ccat a <210> 718 act t cagct c gt t t gtgttt t aat agaat t aat t t cact t t gt t at at aa at gat acct c t agaat cct g cat t aat ct g gcgt t aat aa gt t t gt t t gt cgaaaggaaa t aacacgt at aaacgggaaa cgaaaccat g gt ct t t t aaa t t t ggt at aa actttttttt acgat t ct ac at t ct t cttt gcgaacggtt caaat ct gct at cct t ggt c tttttttttt tgt t t gggaa ttacat t t t c t catt t t t ga t aaaacaat a ct ct t t agaa aaat agggt t t gt cgt ct t a t acggacaat t caat gct ct at aat t gt ac aat aaaccgc aaccaat at g t t t t t aat ac aagaact t t g aacgaacggc aagt ct t caa t aaaact caa t t aaact gt t gt t t t t gcaa gat t t at t at t ct t t ct gac tgatggagag at cgt t ggaa agt at ct ggg t at gt caagg t t t aaaact t t acacgt t ag ct gt at at aa gcagacgct c gt t t at at gt t aat ct t ct t t acaat aat a acaagaaaga tt att gcaaa cct t at t aga ct t caat gat at t t cat gca at t cgaat ct gagct t ccac ccaagaaaca t t gaaagat a t t t t ctgtat t agaat aat g 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 718 cact cact gt acat at gt ca accggaggat agt t at gct c aat t at t t t g at caacact t at agct cct c tgt t t t t gt c aagagat ct a ttttcaagcc t at t t aat ac at at gt at gt at at at acct t ct t gtcgag t cct aaacag t t t cgaaat a t gt t agat t c aaacaaat gt at ggt t cgt a t ct agggt ga caaagt t gga gt acgt agca t gaat aggct aggt t t aat a agat ct t t aa gaaaat agt a aaagt aat t a t gaat gt aaa act gcat at a tccaaacagg ct t ggagat c aaat t cat ga t cgat aacgt gct t ggt gga t t agat at ga gat t t t ct ag cat gat caat t at gt acaaa cat at t t at c cacaaaaatt agt cat gaat t ggat caaac t at at at t ct agaaat aaaa t act t aacgg t ct agggat a gat at caat c acagt cgaaa gggaat aaat aacaaaat aa t cgaat ct gt act ggccaga gcaacgtggt t gat act ccc cgt cgat t ca cagcat t agt at aat cat ga acct aaaact t cacat t aaa t gt cat gt cg t t aat t at ca t t t caacct t gaaccaaaga gaaaat t t at gt aggt gct t aaat t agt gt ct aat t ct at aaat t aat gt t ggagat gt a aaagt aacgt ttagcccaag t t t caat t gg act aat t t aa t t t t cat aaa at caaaaaat t cgaat gat g at t ggaccat acat ct t agc t t t aaaccat ct gat gaccg aat t ggt cat gt t t gagggg gcgggaat t c ggt agcaact t cat aat t gt cccaggcact aaaaaaat at ggt gt gt at g ct t ggt gaag taaaaaagag aaat aat t at t cgt gt aaat at at acat at ct aaacccga gaacaacagt gcacaagaaa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 tctgatagga ccctaatcgt gtttaaccgg aaaaccccgc tctattttta tttttttgag Page 604 12689250 Sequence caagtaagat atataaccgg ttctatttat atcctaattg ctgcctatat atagtaacat atgcatacca atttcttctc ccaaagaaaa gaaaaaaaaa ccctcctacg gtatatttga Li st i ng. txt aaacccct t c gtcaaggaat accccaatac aaactataga cgaccat t at cctcataat g 1080 1140 1200 <210> <211> <212> <213> 719 1200 DNA Arabidopsis thal i ana <400> 719 gaaat at at c cct t t at aat gct t aat t gg gt gt t t gacg gaat t t agt t gcat t at at t aat gat at t a aat gat at t a ct aggt cct t aat t gct t t a t gaaaccct t aaaccgggt a aacggt t aca acgaact at t t aaaact t t t t gcaaat gaa aacat gt gt a t t t t t ct t t t t aat t gaaac ct t ct cact c t aggcacat t t gt cacccca t cgt acat gt ggaccggagg at ct t ct ct a agcaacact a t aat t t t gt a t acaat at gt ct aat agggt t t at gt t t aa tttttgtcaa ct cgat gccg accgt agat a t t aact t gt t t ggt t accaa t at at cct ca ggggt gaat a t t t ct t t ct a ccct t t gt ca caat acaact gcat t gt gt t ct gt t gt tag caaaacgaat at t cgt act a gct t ggat t t gt t acgt aaa t gcat gcaac acaat t t aca cacat at t t a ccaaaat t ca caat at t at a aaaagt cccg agat t aaat a gt agt t gt t a aaaagt at t t aggt t at caa aat t agccga agagcaact a aggaacct gt aaaaaaat ct gct cgt ct ct at t ct agat a gt gct t ggt a gat at gat cg tgt t agaccg gt t t t gt gct gtagtgggag aat gat at t c t cggt ct at t gcat ct aat a t t aat ct t cg gaat gat aca caaaacat ga acccgaaaag aat acaaagt t t t ggaccat aaaaccggct agat at at at ct at at at ac act acggt gt ggctaggcgt acgt gggaat gaaacaaaat aat at t t ct a gccagccagt cagat ct ggg at gt at cagc ccaaagt aac cat t t accca t agct t cgaa aat gcgat t a t aaat gagat at t t gaact t agt t t ggat a aat t t t t at g acggat t cct ct acat t aaa aaccggt t ca agacat at gc at t at acgac acat at ct cg acat gt aggt aaaagct agt t t t ct at gt g t cat gt ggt a acat gcaaca t gt at caact gccccat gca aggaaagat a gt cat gaat t caaccgaaag agat t aaat a aacat aggaa t t ct t acat t t t gaat gat g gt caagacca t t aaact t gt at t t at accc t tact act t t gaccat cat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 720 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 720 tttaattttt atttttgtag tatgt cataccaaag at t gcaaaca tgt at taggaggtgc atagagtgat tagtt taacaattat aaactacatt cattt cctta ccaaaaagag ggtca tgttggtttt ataag ctgtccaata aattt aacaattatt Page 605 at t t t gct ca aat t t aat t t tt aat t ggat t gggccaac aaagaccat c ggaat t agt t ct t aaat t ct aaaat gat t a 120 180 240 12689250 Sequence Listing.txt ttttccataa atttaatatt tgtaggacag tttttctttc tttcttgttt act cgagat a cgagagaaga t caat at at g agt t ct t gt c aat acaagt t aat caat t cg ggat ct ccat gacggt t t ca t cacgt act t t at t t t at t t ct ct ct t t cg ct at t acat g gat caat ggc gt t at at aaa aat acacgaa at ct aat t t g t aat acaaaa t gt act agt c at at caaaaa ct t gact t t c aaat aaaagt cacat at at t gt t t cgct t t gat t agt t at aacaccacaa act aaaat t t t t cact t ggt aat t t ct t gt t accccat ca ggaaggaaaa t aggt acgt g caaaaact cg gt at gat t t c ccct t t aaca aaccat t t aa aagaat aacc t t at t gt t t g gt t aaaaaat gt gt gt t aat at aat t ct ac t t ggt ct acg aat ct cat t t aaat t t gcaa at t gct ct cc gaaaagt aaa gaccct t t t a act t ggt cct t ct at cat t a aaaaaaaagt t gaaacat cc tttacaggag gt t at gact g t at gat t aac ct ccgat t at t t aaat at t a agt cat acac aat gcat aat cacgct cat g aacat gcat t aagaaaagct at t t cgt t ct at aacacat c gt t t t agttt t ct t gt cat a caact t gt ac aat aat acgc t t aaagt aca aagt ggaaca acact aact t aacaaaagct gcgt at act t cat gcaagct ggt aat t t cg gt ct gaaacc tt ct gcaaaa t ggt at caag cgt agacaac aat t aat gat cct aaacgt a tcaaacaaca at at aat agt aat at t aaac t at at gt aat agact t gt aa t gact cgt gt t t gact cat a t t caaat at a gt acgt t aac t ct t ggt t t t aaacaaagaa t ccaaacat g 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 721 <211> 274 <212> DNA <213> Arabidopsis thaliana <400> 721 ttgcagaaag agaaatgtag agagal agaacgtgca cgaacatatg ctctt1 acgtggcatg tgactttttc tctga! caactccacg agattaacca acggt ttattatttt cagctttctc tggaa! tagaa gcggcttgag tagat tcttttccca gat ca t ggt t acat c atgag aaaatcgatt gcaac aatg ctttgagctt accctaaaca t t t t gccct a aagtt aact g atctgacacg tgtaataacc t cgat t t t aa at t gggat t a 120 180 240 274 v <210> 722 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 722 tgtatcgttt aatttaaata attttattta at t t t t agt a t t aaaat at a t t t t t t ct aa attcataaac aacgttcact ttaaaccaat tctaagaatt tgatgatgaa gcttcgtacg aatatcaatt taagataata acctactcaa tcatactttg agtttggatt acaaaaaaaa t ct aat cat t caacat t ct c ttaacagaag t at t at gcac at cat t aat t at cat t t t t t Page cact aat aca aat at t t cat t t agt ct t cg ct t cggagat aat ccacgat t t aat at caa at aagaaat t cat acacat a at t ct agaaa t t at t at at a t at t at at ac tttttttaaa 120 180 240 300 360 12689250 Sequence Listing.txt tatgtaaaaa taaataatat gtaaaat t aa ataataaata tatcat t aac ctataaatta gaagt t gaag gt gt caat ca aaaat ct aga aaaaagcaat t t t cct t gga t agaaact t g tttttagaaa t t t ct agacc aaaacccat t at t t gct t t c at gagt cagg accccat cat t ct cat cgt c agct t t t at t tatggagaga ttctctattt tttggaagaa aaaat aaaga gagaaat cac t ct t t gt aaa tagaggaccg ccat gct t t g ct ccat at aa agact t at t a cact t t cacg t t cact cat c aat t at t t aa gt t ggccaac t t gtagggaa ttaaaaccac t t t t cat aaa t agaaaact a aact acct t g ct t aaaact a ct ct caat gg t act gaggac gagt ccaacc ct ct acat ct t ct aacgt t t t agacacat g t t t cat at at acaatttttt tttgaaaacc aacct caaaa gt t t cccaaa gt ccat t ggt gt aaaacat t at at at aacc at cgaat t gt at t t gcat t c acgct t acaa ct t t ct at ct gcaaat act a at gat t t ggc t cat aat ct t acaact at t c ccat ggt t t t at t t t aaat a t aaccaagt a gaat gccaat at gat t cat t cat gt cat t g aat t at t gt t aaacact ct t ct aat ccaag aagt gat gat aaaagat ct c caaagggttt aaaaccat aa tagt t t cttt aaaaaat cat t t aaat aat t tgtgtgaaag t gaacat gt g t cacat ccat agtagcagag ct t at ct t ct caacaat at g 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 723 1200 DNA Arabidopsis thal i ana <400> 723 gt agt at aag t t ct t at t at aaaat t aact gtt gagccca act acaact t ct t aacgcag t t cat aat at act agagagt aat cagt t t g agagagt t t g tttcccaaaa aaaat at gaa t gat gaagca ct t gt aact g cccaaagt ct at gggt t gt g t gt cat t t ct t gaat t t gaa at aat gagt g ggt at t agaa aat t t cggt c t t t agt gcct gaaaacaaaa ttttgttgga tgaggaacag t gact ct aaa gt caacgaca gt gaaggct a ttcgt t t gt t ccgcggct t g gt gact at aa ctt aaacaac t caaaat t t a t at aaacggg acaaaaggaa aagct t cgat gacaagt aag gggtgcagag acagagat gt agagat gaaa at t ct t at ac at ct t t agag gaaact gaag act t ct t t gg ct gaat aat c gagt caacaa at at at t t t t gt t at t t gat agct cct t aa t t caagaat g t t t t ct t agt t gt aagat ac aat t aat ggc gat aagt aag t gat gt aagt acagt t gt aa t caacct t cc caact agggt t aaggaagt a acaaacct at gt caaagt aa at t t gggaaa cagt gat cca aagt t gggaa agat t ct t gt t aaat t act t t ct t gt ct ag t t at ct gct g t acggt t t t c aggcaagttt agt t cagaag act aagagt g t t gggaagt t aaacggtgt t t aaat at cca at t t at aat t at gat t ct t t agcccacgt t cat at cacca ccact agcct t gt agcaaag ccat t t t t gt aat t t t at t t at t ct t acac tgcagaagca cagt t ct t t g gaaaagt act t t gat t ggt t act gt t t t aa gt ct t at t ga act ccaat ag gt at t t t ct c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 gttcatcttt ttccggaaac agattcatga aaacttaaaa Page 607 t ggt t t t cgc t caat acaag tacaaaaacc accaat cat t t t t cacat gt tt gat gtt ac aaaat t t gac aaaagtaaga 12689250 Sequence Listing.txt ttattctaac atttttatag ttgattcatt ttagttaaca tcgtctagtt gaccttgact tgcaagatat tgtattattt tggcttggct ggtttcacat tcattcatta gtactcataa aaaaaaaaac agagaagaaa ggtgaagaaa ttggataatg 1020 1080 1140 1200 <210> <211> <212> (7 <213> 724 895 DNA Arabidopsis thal i ana <400> 724 aat t aat gat t t t t gt t at t at t aat gggt gat t t at ggt gagt t aat t t acat t aacgt t agt gat at t ct t caagaaa gaaagat gaa aagt t agaca t t t aggcgt g ct t ct agat g gt gt t cct ca aagact t t t c t aaagatt ag aact t cat aa gaaaaat t t g gcaaagt gaa ttagggaaaa gt t t cat gat at ggt t ggt t ct ggt t ct at t gaaat aaca agacat gt t c t aaat at t ga aggat aaat g t t ct t ggat g caact ct aat ct t t t ccttt t t aaat aaaa ct ct gt gt t c aaacaaagaa at gt t gaaag t aaat t gcaa gt gt ggt at t agt at t ggt t t agt ct at cc aat at t t aac aaagaat at t ct t t t gagga at gaacacat at acacat ga t agaaaagt a at gcct t t at aat aaaaaag t aaaat aat g t gaat gt agc ct acct at t a agat ggt aaa gat t acat ga t at t gt gt at aat t t t gt at at at ct ct cc t at caaaaaa t t ct t aat at t t gaaagt ga at gat gt aca t gct t aat t a aat cat gcac agggaaagag act at gt gca t gcat t ggt a t aact at aat t ct aagt t cg t gt t t gat aa t gt t ggt t ag t agcat t ggt aaaact aat a aaaaaaaaga tttgaacaca cggcaat gat t acat t gaac t t ggagt aaa aat t t acat g at t gat t at t gct t gt at cg gt ggaaat ca aacaaat at t at aggat gat gaagt t aaga t at t t cagt t aact caat t t cat cggaagt gagat ggt ca t gct gaaat g gaacaact ac t t gt aat t aa t ggcct aaaa agagt at at a caat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 <210> 725 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 725 cggataaaga gaaagaaaat ataggattta aacatcgtca aatttact t c ttttggtgt a ttatgtaaga aattgtttcc aaaaat t act tcaatgaatt tataaggaaa aaaataaaat aaagttgttt caagcgtaat tgttttttgg taataaactt gcgatgaact aaaatttctg atctataagt tttagctata tccgcttaaa atagttgggc tacaacagtg aaaccgtaat aaaat ct t at t at t t ct cat gt at act gac aaaaat at aa ct agagaat g gt at t cct ac ccccgcct ca taggaaagaa Page agcgagtaag t at at ggcgt t t t gt aat ct agt at gat gt aat at acagc aat caat gaa act t gt ct c at gat aaaaa act t at t t ga at at at ct gt t gt t t t gat a acat gt aaaa aacagt aaac t cact aat t t t ggt ct gggt cccaat ccag 120 180 240 300 360 420 480 12689250 Sequence Listing.txt aagcttactg caagataaag agaaagatca tgaagaggta ggagtgattc cagggt cacg acgact t cag t aaagacat g at t t aaggca gagaccacca ct t gt cact t ct t caaagt c cat gt acaat acagaagat a t gact cgaga at t act caaa t t gt cact t t agt ct t ct aa gt aagat t cc aagat aggt t ct t t cact t g t ct cccagga t t ct aat gag t t cat at gaa agaacggcct cat t aacgac t t ct t ct gcc ct cccagaaa t gggt t agt a at at at at ga ttaaggcaga agt cgagaca aaaaaaaat a t t agt aact g aact ct gt gt t t gct t at t t t t t t aaaact act t ggt aag aat acaaat t t aact cgggt aaact ct gt g agacaagaac gt aacgacat caaat t t aga gggt cat ct t gt ggt ggat t at aggagacc aagggacgaa t ct t t t t ct c t agact aact cat ct t t t aa t gt ggt ggat gacct t t ggc t t agaat t t g ccaact at at t t aat cgccg gcat ccaaga accact cct c cct t aagcaa t gcccct t ct at at aacaaa at at aaggag t ct ct ggct t t gct t t t t t c t t at t t at ag cat t act cat taggagacga gct t t caaga cagt t t t aag t cgat aacca aagct ct t gc gcat gcaat g 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 726 <211> 1080 <212> DNA <213> Arabi dopsi s tha i ana <400> 726 ggaat at gaa t at gt cccac gt aagt gaga tcataaattg ttgttcatcc atcacttgca cagct t aacg ggt gcct t gg cct acagggt ttggtaaggt cattgaagat aaacatttgg tgcaaatatg caatgcactg caattattag cgaaaccata gt at gt aaaa at at aat cac at aacaat t g t gt t t t aaga t acaaacct a t t t t ggt at a t at ccaaat c at t t t t cagg aaagaaggaa ggccacaaga tgtcaagatt gatgttaaag aaaattaatg tctcctccaa ccgacgaaat tgttgtgcaa cattgtccac cacgaaaaag aaaacat t aa t agt ct t t t c caaaat aaaa gt gaaaaaga cagagaagaa aaatgaggac ctacaaggca ttgatgtaag aaatcctctt aaaatctcat aaagcccttc acacagaaac aaaactgctt t t gt ct ct ca ttgaacaagg gaaagaaaca aaaggctgat cagat t ct t a ct t ct cagga aaaccctaaa ggt t ct cct t gt ct ct t gga at t ggcgt t t gt aagcccaa t t gt t t ggat gt t caaaagc ct t t gt gt t a gaat t aaat a agt ccacaca t gt gt aat ac aagat t t t ac agt t t ccat a gagaaacaac aagcaaggat t t at ct t ct t agt cat t t gt aact cct gt t gaaaat t t ct ct gaacaat t t t t t agggt t t ggt acgt ag agaaacagag t t gat at aga t gt t aat t t g t gat at t t gt ct gact t t at aat gccaaca agaccgcaaa ggct acact g agat cgct t t gagagccgca gt ggt t cct t ct ct t ct cct ct t at aaaaa t t gt agat t c gt t t t t t t t t cct t t ct cat t t agct aaca aagaaacctt t t ccgt gcat gact gagt ct t t at aat ct t t cact at t gg t ct ct aaaag aact t gt at t aacat t t cag t at acagact acaat aat at gaagt t gaag t t gat t ccca caact t gaat ggt t ct gt t c ct ct gat at t gagaaat at g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 609 <210> <211> <212> <213> 12689250 Sequence Listing.txt 727 1200 DNA Arabidopsis thal i ana <400> 727 t aagcat gat t gaagact t t agaacaaaag ct t t gcgcgt t at t t t gt t a acaaat ggt t t aggcaat ct at t gggat gt t t ct t t t t t t tt cat acaac agaaggt caa gaat acat gt at gt t t gt at t t gcaaagt t cggat t gt ct aggtaaagga aaaaaagat a ct t t ct cct t ccaaaaat ac ct t ct cct t c t agt t acgaa tgct t t t gt t agaagct ct c t t t t t gt t at at cgaagaat taaggaaagc ct t acaact t at gagaaaag t t t t gcat ag t aat cacat g aacgat at ac t aaaat t at a at acaat aag t t t gt aaacc t cacct aaat t aat t aat t t ggct t gagag ccacacaaac t ct t at aaca caat t ct cac at aat gt cac t gct t ct gt a caaaagct gg aaccactttt gt cacat aaa at ct at caat t cgt aacaat t aaacaaagt aat caact t g cat t t cat aa aaaaat agt t t aaat cat t c tt gt t t t t ct caaaat accg t act t at at t gat aat t at a ct gt gaat ga at aacacaaa t t gt at t at t cccat t gcca ct cat aat aa gaat ct ct t a cat t at cct t at at aaacat t ccgagt t gt at t aat aagt at t aat ggaa gaggccacaa at t aat acgt t aat at agaa caat cgaaac t gaagt aact aaaaccgaaa aat aaat t ac at aaat act a at t t t t gaaa gct gt ct ct c acct cct t ct at acaat ct a ct t cact cag at t at t agat aaaacgaatt ttct t ccttt gcaaact t at t t t t gctat t cgt acat aat t at cat at gt gacct t t cgg cct aat t tag aaccccatt c caaaat aaag at aat act ag t cat gt caaa accgaaacca caaagagatt gaacaaattt t t t cat t t at ct t cct cat a cat t t cccat at gt ct t gt a gtct t t attt t caacat caa acct cct aat t t t t t at t ag t aaaat at ct acat t aaat t gct gt agat g t gaagacat c gt gaaat at a ctt agacaaa caaaat t t at gt t aat act t ccgaaccaaa aaccgt aaac at at gggaat tgagaaaaaa at t ccaacct t ct ccat at c t t t accct cg cact gacat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 728 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 728 atagggtgcg tcaaaacttt tctttcaaat ctacaaaaga atgtttatgg aggagagaaa actaagataa ttgtgaagat gttagttaaa aacaacatta aaccaaggcc actagtagag ccaccaattt tttaaatggc tggtgacatt aat t cctttt gtggctct t a ttttttcccc cttttaaaaa tagtcacacg tcctaaatct gctgccgtca aaaaaaggaa gaaaaaaaat t t t acaaat g gaat aaact t t gaaat t gag t t t t gagagt tggcccaccc ct t t t t t at t t agcacat t a cagat at agg Page 61C ccaaagt gt c t gggat gaaa aagatgggca t gct t t caac acct ct t t gt t at t t t agt t t gt t gcct aa at ccaagaaa at aaaacaag agaaaaaaaa aggaaat aaa t act t ct cga ct t t t t t t t t t at at t t t t a aact at t t ca t cat at aaag 120 180 240 300 360 420 480 12689250 Sequence Listing.txt at t gt gacaa gggagacaca at at cat t t t t aact caaaa t act acacac caat t aaaga t t gcct t gt c t ct ct t ct gg t t at aat ct t cgt aat cct a t ccact t ct c t gagat t cat aaaaaacttt t tcaccaaaa t caat gacct tct t t t t gt c ccat at ccac at t t ct t gt t aaaaaagtaa acagcaaagt tctctcaagc aaat aat t t c accct t aat c at ct ct t t ca cccagat ggc t gat aat ggt t at aaaagat aggt act act t t gat t ggag at gct t at t t t t ct cct cca t t ccct t t ct t act t cct cc at cgcaat ct caaaggaaac at t gagt ct t ct ct gat at a t t ggt ct t cc t t aat cat aa aaat t t act c gat act ct cg t cgt agaaaa t ct t ct t ct t t t aat t aat c gagt t ct ggc aagacaacga atgtgt t t ca t cgt ctt cac at ggact t t a caat at t t t c t t t t cgt t ag aaagt agt aa gcgcgtggat agat t t t aaa ct agtt cct t agaggt t t ct t aaagat t t t tttttagtcg aacaat ct t a t cct aaat ca tcaaaacaca t t aacagat t ttggtcgt cg ggagcgattt gataccaacg act t at t at t gtct t gaagg 540 600 660 720 780 840 900 960 1020 1080 1140 1200 tgttccgaat agagtgagaa aaagacttat aagattgaaa aagcaagatg <210> <211> <212> <213> 729 1200 DNA Arabidopsis thal i ana <400> 729 gcaacgaaag t t t act t aac t t gcaat t t t cacacat aca aagact at t t cat aat gt ca cacacaaggg cggct accat acat t act ct ttagccagac ttgggacagg t ct caggt ct t t ggagt ccc gat gat t t ac t cgagaaat a t caaat ccat t t t t act t ct t ggacct t t t t t ct t aagat act aat acaa caact gt at t t gt ct t gt gt gt ggat ggat aagt t t cct a tccaaggaga ct t ccaccat act accaaat cact gt t t ca t aaact tt ag aat ccaaaca t aat agcat a t ccgt gt t at at at cgt at a agct t t t gca t t t caaaat t at gggt gt aa t t t gt t cgt g t aat gt at t a at at gt t t ca ct t cat t aaa at at at at at aaattt cagt cat gt gcaag ggcaacaat g caaaagt at t caact aaaca t aaaacat t c ct ct t t gt ga at ct cct t cc cgt at t at ca gt t gcaact t ccgct cagct agt gaat at a t at t t gt gga t t ct ct t act cct gact ct t aagt ct t at g tt ct gaaaag at at at at at t gt at gaacg t aat aaat gc cgat at t t gg ggt ct caagt cgt ct t cacc at gaaaat ag gt ct at gaca agat t gat t c at cat t t aga t gt t gcat ga t t t ccgt aca at at ggaaaa t at ggaat t t tgccagcaca t at gat t t gt gagt aaat t g t t aaat acat at at at at t t gt aact t act aacacacat a caagat t t ga acacct aaat caaaaaat cg t gagt at act agggt at at a at ccacct ct t t act act ac t aagct t t ga gcat t t t t gt cgt gt t ct cc aaaat t t ct t acacact at t t t t agct gt a at at at caaa agt agat t gt t ct t aaagat at t t t t cct c ggaagacaca cggact ccat cact t t gcaa gt ct acaaac t ggat ccat a t aacct cat t t t t t agacgt caaagagggc ct agt ct t ca caaat gccat ct gt gacct a aat gaagct g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 Page 611 12689250 Sequence Listing.txt ctatttagcg ttaattttta agtagctctc ccttttcctt agaacgtgta acacttttgt at atgtagat aatagtagaa cgaaaaaaaa ctctttgtag agagattctg cttctatatg <210> 730 <211> 433 <212> DNA <213> Arabidopsis thaliana <400> 730 gtctatctat ctctgcagtt ataaagact a tgcttttaag at atttat t a ttactatcat gttttacaat acattcatgt tgcagtttat atcaagattc tctttttaaa actcactaac ct t ct ct aac cagt aaat ga t cat aaat cg attatccttt tatccacatg ttgatgaatg atttcttatc cacatagaac actaaggcaa t gt agccact at g ttgtggt t t t ct ggt aat t g t gt t gaat t c t t at aaact g at ct at cacc gagaagtccg t gt t cat t ga gt ct ct t gt a t agt at t t t g aagt t ct at g t aaacat aca acat at t gt t agaaat gacg t cat caaaag gt ct cagt t t gagcaaaaac t t t t t gact t aat gcagt at t gcat ct t t c aaaat cagaa t ct t gt gaat 1140 1200 120 180 240 300 360 420 433 <210> <211> <212> <213> 731 1200 DNA Arabidopsis thal i ana <400> 731 att ggt at ca gccaagatga aaacatagag gt t t ct acag t t gct t gggt cagt cat t t t t t t ctgctta t ccgt t t cat gggacgcagc aatgt t gtag gaacgaaagc caagagctac gggagt t t t g gtggagacag agaatcgaga tggat t t gt t ccggagt t t c tagaggt t t t tggtgacaaa ttgtaccaag agt at ccat g atagggacgt ct aaacaact gtgatgagag caat t ggtgt t ct t t t acag aggtgtctca t t gaagt cat tt gagaat ac at at aggaaa tagctgtgag t gcagagat a gt t t atagt t at gtt gaact aagaagagtg gagaaagaaa agct t t acaa t t t gt t cttg gat gt t t aac t ct ct t at gg gt t ccaaat a tgggcat t at caggagattg acgtgtgtat acagagccaa ct att cgaac ctaggcattg acgtggcaag t at agct aat t cct t t t gga ct t gtagat g at gat gctgt acgt t t t aca tatggt t t t g atcggcaagg ttgaagaaaa ct gcgacat g aagcat acat gaat att ct a gt t gt t t aca cagctt gtt a acct t t gacc t gaacctt gt at at at ggt t agaagat t gg gaagat cat t tt ct ct ct ga agaagt t gat tccaaaacag agtctaggag aagt caaaat aggcagaact aggggagatt at gcat tgct ggagaactaa tttggacaca agct t ggaag aact gaaat t ctgggat t gg ggtcaaggt t agatacgtgg tcgatgggag tgcagaagat gaat cgtt cc at cgcgt t t c t att gt acaa at t caagt t c gaagcaaaag t gtt cat at t t agact ct t t aataaaaccg gt t cat acac gagtgacaaa tgaggccaag tgccaaagac aat t gaat ca cttccaacag caat acat ga tgttagagga aagaagattt t cat cgtt cg atgaacgttg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 Page 612 12689250 Sequence Listing.txt tatcaatctg tcattttcat taatcgaagc attctatttc gttactctgt tttccgttat cacaatcatc ctcttcctct aaattctctt ct t ctctttg ctgctacaga aaaagaatca agctttagag gcaagcaagg actcactcac ccacaagtat cctcacccca t ctactcat g 1080 1140 1200 <210> <211> <212> <213> 732 1200 DNA Arabidopsis thal i ana <400> 732 ct aaact at t at t agt ct ct t at t at at t g ccgccaaat t t t at t act t g aaat at ct t t ccagt gaaaa ct agggaagt aat ct aggt g acact cat t c taaaccaaaa t cagct t t gt aaaccaaaag gt acaat acc aaggat cct a tcgcaacaca gct acat aac at t t at aaaa accct t t t at at ccat t t gc gaat at at ga at aaccaaat gat gct t cat accagcaaca at t ct acat g aggact t gat ct cgacct ga at t cgt act t gt ggat gaat ttcatgt t t t t t t at aaat a acgt cagt gt aaagaaaaac ct t t agggaa at aact gcat agt gcat gt g gat t ct t gaa aaaaaatttt aaacgt aat g t cgagacat t aacacaacaa cat aat gaga ttccat t t t t gcaagt cat a t acaaaaat a taacaaaaca cct t gt gaat gaact aacgt t t acct cct c gcagt t t t t g ct at cat t ca ttcagaaaga gct aaaaaag t at at gt t t t t t agt caat a gt gat t ggt g aat gagat aa t ggcaacat t agacat t t gt t at t ct ct t a gt t t caaacc t aat t cacaa aact t gct t c act gat cgt t gccagactt a aat at t gacc cggaccggt g ct t t t ggct t gt gagct ct c t aact ct gac cgt at aaaga aagt gaat aa t aaat at ct g gt aaacgt t t gt aacaagt a aacacat agt at cagaaaat agt at at aga gt at at at at at t ct t ct ct tttcaccaac aaacat gct t t at ccat t gt agt cct t aac gt t at agt ca gaat gacct g gaccaaaaac t acgt cat ga t ct at t ct gt aaact at aag aagacct ccc t t acaaaat t at at t cagat cat caat at a ccat t act t t caacat t gag t gat agt aat aat t t gt aaa gt t acaaaat ct ct ct ct ct acgact t cac ccaaaat act ct at t ct cac t cct t at t ag ct ggt ggct a ct t ggt cgga caggaccaca tgagtggacc aat t gcaagg gt t at at gaa cat gt t gaat agaaagt at c t cact cgaaa gat gaaaat t gt t t t gcaag act ct cggt a t cact t at t c t t acat aagt aagccaacat ct t aaaaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 733 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 733 tttttccact tacctttttg tcgttttccc tgtccgaaat taaagtgttt tattttgtcg aggcatatag tcccaataat cagttcacaa ttttattaag aagttgacat tctatagacc ttttttgttt aaatatttat atatagacct ttttattagt cttcaaaact tgcttcaaaa tagtaaccca aaactcaaaa gagaactcta ggcttaacat cattgtccaa ttctaaattg Page 613 120 180 240 12689250 Sequence Listing.txt tgttgttata actaactcat caacacttct aattaatttt atgaaatcca gagt t caaca act t t ccgt a cgaaat t t gt aat caaaat t acct cgt gt g gt acgct t cg at t caat cca t t ct gggt t g ttctttctcc t ggaact caa t at act at aa ct gt gt aat c ct gt gaacaa t cct t t gggt gt aacct t ca <210> 734 t acat gt at g aaaaggaat a aaaccct gga t t t gt caat t at t ct t cttc t cgt t agt ga t t agagact c t t t gct gat c gat cact t t t at t cct t at t gt t gat t aac gt t t agt gga gt t t aat gga t ct t ct ct t c agt t ct t t t c gaaaacgat a aacct aaact ccaat ccat c t gaagt ct ct t t cat at aga tgcggagacg atcggagacg acacat at ct ccggt ct at t t gcagct t ga t t t gacat ct t gt gcaat gc aggaacagct t t t t ccacct t t t ggtggac aacacagagc aaaat aaaat ttct t ct t ct t t gact ct t c ttcccagaaa aaagat at cg t ct acgt t ag t ct t t ct t cg gt gt aat cca gt t t at t aaa aat gt gt at t aaaggtcgt g t t t acaaagt t gt t gat tag at cat t gt cg acaaaat t t c caat caat ca t ct t ct t cgc tttcaccgcc cat cgaaact ccggaaaat c cgt acccaat at t ct ct ct c cgcaccaaaa agt ct t t gt c t gct ggt aac t agaact aga ct ct t t gt ct t cat ggagat ttttattacc t t aaagt t ac ggaaat t gt a at aaaat t gc ct ct ct ct ct gcagt t aaaa ggtgaaccgt at t ggcct ag cct cct cct c aggt gat t ct aat cagat ct t t t ct gggt t agggcgtgt c agaagagttt ct t gt t t t t c ggtgccaaag t t aaagcat g 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 734 at ct aat at a gt t cat at t a t cct t gacaa t at gat at t t t agt caaaat gat cat gt aa at caaat t aa caaccat t aa at at ct aaat agt t aaagag aagagat t ag t at t at t aga aaat gagt t a agacct t t at aat at cgcca t t t t at ccga t at t gaaat t gagt t t at t g t t agt cagaa t at gt at gt a t at act t t t t tat t t t gtga t at at t at t t t caaagagt g agct ggaaaa acgcgcgaaa aatgct t t t t t ct t agat t t aaaaagatt g t cct aat t ac cgaat aagac t gaat cgat t at aat t gcca t t agat act a acgt at t gga t t at ggaaat t t t t gt at t t at gaat gagc aat at gaggc at t ggt act c tttttttttg cagat t t gcg at t t t t ggt t tttgaaaagc cagt t t act g t t t gatgttg aaaaagagt t t aat at at gg acaaccat t a aaat aat t ag t agt t aat t t aaagat gaga t ct t t gt t ca gt aat t gct t gtcggacgaa t agt t ct t ct t t gaaaaagt aaat t agt at ccacagt gcc cacaat aaaa t aacat t aga t gt t agt agt at at t t t gt g aaat t t ggca t at gact aca t gcacat gga t t ggt t t aaa t ggt t acacg at gagt gt t a tt gt t t caac t t ccct aat t ct t gaact t c at t cgt act a aaaaaaat ag ct t t aaaaat caat t t at ca cgt at t ggaa at at at gt aa aaagt caaag acaaaaagcc t aacggat ca gt aaaact cg aaat t t aaaa t t aagat t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 ccttctcgtg atcacaaaat gcaaaagatc ttcgaattct gaaacaacac gtgatgacgt Page 614 gaacccgaat aat at gat t g caaaccct at t t t t gt caac aaat ccat cg at agat aat g acat t t t cat tgaaaccaaa acgt t ct cct t t cat aaat a 12689250 Sequence at t ct ct cca t gcgt gct t g t t cat aaaaa t at cat aaat actccaagct tcaatctaaa act at gaagc t accaactt g gtccaaattt cgcaaatgca Li st i ng. t xt gact gat tag t aaact acaa catagtcaac acggcgaaac cgcaagattt acgcaaacat at aact ct aa ct t cat cct c aaaccaaaca aaaaaat at g 960 1020 1080 1140 1200 <210> <211> <212> <213> 735 759 DNA Arabidopsis thal i ana <400> 735 gt gagacgct t t t t act t ct ct t t t t t gaa t gt gaggt aa at at aaaacc aataggcgag cact ccgat g ccggcggcgc aggt acatt a ct ggaat ct t aaccctt aag acct t ct gat at gcaact at ct gct t ct t g ctt aacgcaa t gaat aagt a t acaggaat g tt at ccaaaa gaaagaccat act actt ctt tt ctt ccgcc ct act ctt ct cgt t t t at gt t t caaaatt g tttctggagc cct gt ct t t c t t t t ct cttt aat t cagat g ctt ct t t agt t cat ctt gaa gt agggagat t aagccgtt g cctt cct cct act cct ggct ct at t t cacc gat ct t gt t a t ggt acaat t cct at t gat t t t t act t gct tt act ct ct t ct ct acatt g t gat t at t t g aaat act aac tgtgt t attt gccacacaat ccct gctt ct t t cct ct aag t gt caccact caaaaagttt ct at ct cat a gt at at ct cc t cagt t at g ct aat aaaag t t ggt gact t ttggt t t t gg caaaaaat aa aaaagggaac cagat at cat cct ccct cag at ccaccacc t ct ct gat ct ccat t acgat ggt gt t t t ga cact ct act c aaagacaat a ggt gt gat ct t t aacct ct t aaaat agaga cacagaaaaa aat ggaaat a ccgcct t t ct gt ct t cccca t t ggat t at t t ccat gt t t t gt t gaat gat t t at ct t act 120 180 240 300 360 420 480 540 600 660 720 759 <210> 736 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 736 gttcacgttt cttggtacac gtatctctta gagctctctc gctagagaca acgatgatct ggtcttcatt ttctattggt acatgtcagt tggtctatat ttttaaaaaa acttcttact cggaaattac aataaaccgg acccttaaaa actactattg ttgtaaacta atgttttttc tgaaatacat tttattaaca aatttgcgaa aaagaaaaaa aaaagaaaaa actgttgttc gtttctgttc gatagaggtt catctcgatc cagt ct ccac t at acct gt t tt gggccat c t gccaaat aa t caat aggaa gcct ccaaat t at gt t aaag at ggaggct t gat cat caag Page 61E gt t gaagt gt acgt t cat gt cggt cact at t t ggt ccact aact ct ggat t ccaat ct ct at t t t gaat a at t t aat t t c gaat cgact t aagcat t aca gccaaaatt a t at acacat a aaaaaat agt ct t ggct t ca at t t t cct ag ct agt agcca tagaaacgaa ggagaaaat a 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt gacaagaaaa ggtttcagtc cactacgaaa agcatactta agaaaatgga agcgtgggcc gtatatgggc tgcgt t gact atccaccaga aaact at caa gat t agccac aaactt ccaa acaaaacacc acatccacag gtacgaagt t at gatt gct t cttaaaacaa gagccatgt c t caat aat gc t gat ct gat t at ct at t t at cat ggccat c ct aat aat t t acaat ct gca gacaaaaaaa t at t t agt aa tt agat acag acgtcaggac aat gcat gaa ct gaccact c tgct t t t gt c aaaact t aat t t t t t t t cat t t gaat t t at at aaaaaaat agagtcacaa gccagctacg t t cacct gt g t t t t act t ga gct ccctt at ct t gctaggc gaggaatcat tctgaagaaa ctt gactt gt ctgtatgtga aactagaaag t ct ct aagt t ggtcgccaac atcgtcgcaa at ggt gact a t t at acact c at ct t gt ct c agt t t t at ca cgctt at gcc t t t t cat aat aaaaaaggtt gagt agt at t gat t ggct at t gt acat at c agt caaggaa t t aat ct gag t ggcgt t caa t t gaagt t ca aagt t gggt t gt gt at t at a t cat t cgat g 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 737 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 737 cgagattata gctgtgatct ttgtgt t agt cgaacagata agtgaagtt t ctctgcgat t gt ggt ccct a aacct acat a gt cgat ggt a ataccctgaa aatgacgagt ccct t cgttt agttccatag caactttcat atttcgagaa ccttgtaaag ctctgttttt agtctatgac tgtgaattaa cagttcattc aaaataaaaa tgtggttatt gtagcttgaa aaaacgaata ttgcacactg gtatagcttg ttgctgtcat tcagaaccac gaaatttact agaactcagc tat t aggcta at t cgtcgtg ttgtattttt caactaatgc gtcagtgtca ttagtttcgg tgggtcctat atcgagcgag attggcatgt tctccacatt ttgttttgaa aatgtgaaaa tcagtactat ttataatgca agatatgtaa tacagaattt cgcttaagct ttcaaagata gttcagtttc cacggaaccg gtaaatttaa agggttacaa atatacatag tggattcgta gtagtcacat cacatatgaa agagattgct aatt acgcat t t t ct aat gt aaat gt caat act t gt aat a t t aaat t at a cat act t gt a t aaat aaat a at caagt gt g ct act gct ga ttatcggaac caact aat gc aaataaacga gagt t gggt t at acat t t t t at t t aaaaga t cgat t t gaa aagccaaat a tgaaaaagaa t t t gaaaat c gtatctgttt t gaat ccat c aat gt t t gaa atgggtgaga ttccgcagca aaacat gt t g aat aat cgaa ttgt t cattc ct ccagcgat ttggaagggg gtcagtgtcg t at t t gt t t t ggctagggaa ccctacagcg cat cat aaga t at aat ggt g cct t aaaaga gggaggaatg gaat cct ct t ct aacccgt t aaact aat aa agt t t agat a actacgacac aat gct at gt ttttttttt g aat gt aact c t ct t cgt t cg aat gaaacga accagatgag t t t t t at t t t gt t t t t gtat aat ggt aaaa gggatatgaa aaaacaaaac cat t t gcggg gt t gtgaaac agt cct acac ct aat at t aa 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 616 12689250 Sequence Listing.txt atctttcaag actggaggtt ttaaacaaga aaatcaaact cgatgcaaaa gaacaaaatg 1200 <210> <211> <212> <213> 738 1081 DNA Arabidopsis thal i ana <400> 738 t ggt t cct t t ct at at t t t t at t t t at t t t caaagagaga aaaaaaacag t ct ct ct ct c gacat cagaa t at t gt ct t a accat t t t ca ttttcaaaaa aat agcat t t aaacaaat cg t gcgt gt t t t t act agcagg ct cgt t gaaa t at caaagat t t at t aat gt ccacaaat ag t cct cgat ct ccct t t t t t c gt aat gt caa gt agt t acga cct aactt ag t ct caat t t t at t cagaat t aat t gt at t t act ct at t t t taaaggaaaa t acact t t ac aat ct agct t gact t t t t ag t ct t t t cct t t t gaggct aa t gact cacat gcat ggt t at t caaat act a gggt t t caga t at gagacca act t gt at cg t t t ct aaat t t t ggat at ga ct aat acaag at gat agcat acgct gt aat aaaagaat t g t at agaagt a at cacaaaat gat cat t t t t at at gagt ca gt t ct gt acg t gat t aacat agt ccaagt c t caaat ggaa act acaccac t cat gt agct acaagt ct t a cgcaaaat aa acaat aaact t aagat at t a at at t at t t t gaaaaat cag t t aagct t ct t at t ct aat g accat cgaag t aaagt cact gcgt gt t t t g t t aaat gcat t agat gat gc gt at at t ct t tagcacaaaa at t t t aaact ccacacat t a ct caat t t t t gt aaacgccg t aact at t at at t t gt cagg t t t gt cagaa t at at t t t t a cgtgaggaga taaaaaaaaa at cat ct cca at gt ct t t ac at gct t ct t a agat gacaat t cacat cct c at cgat cacg t cat aagccc at t t caaagt gat gat t aaa t ct caagaaa gaaact agaa gt t t t at t t t taaaaaaaga aat t aaagga aagaat ct t t t t t t t t cgt g gt accgcaaa aaagt t t t aa aaat aaagt t gt aaccgaaa t t t gtggaga t t t t t ct t t c caagcat gt c gt t t t t agct caaaacct aa t gt caat t t c t t t ct aagaa aaagaaaaat 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 <210> 739 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 739 ggtaaagacg gtcttgataa gagtcaatct cgacaaataa aacgaccgaa aaaacttaaa tgtgcatcta attacgttta aatagaaat c tgcaaaggaa aaacaaagaa tgtaaccttc ggaaataact taaaaacgaa atgatttaaa gctctcaagt accacaaaat gtactctct c gctgtcccca ctaacagctg tttcccaaat tttattttag cctcactctc tcagacgcat ttcacaacct aaactaaaga t acgt t t t t a gatacgacaa ttcaaaaatt cgaccatacc aaaatgcaaa aagaaaagat ttaaataaaa aaacgatcca tatcacacac agt t acatat ccccaaccgt tttcttggaa ttcccccaat cgcaccgcca Page 617 t t t t at aat c cgat aat t aa aacact gat a aat t aaaaaa cct gt gaaac t t cgt t gaca agatt ct ct c ctccctgacg 120 180 240 300 360 420 480 12689250 Sequence Listing.txt tgcttttttc tttctctttt aaatattcaa atacgtatca gactactcac ct t cat aact at agt t t t ac gt t t gcat t t at agt t agaa t aat aat t t g gt t caaact a acat at t t aa t gcacgt aaa aaaaat t acc cct ct t cat a at agaaacgc aat ccaat t t at aact acaa at aat t t gat at acagt aca t t t aat agt a aat t t cagt t t t t ggt gt ca t aat ccaat t ggtagaaccg ct t at cct ct gt t t t aaacg agaaat t acg t at at aagaa gt aat at gat tgt t t agtta t at at t t t aa t t ct ggat t t gt t aact t at t acagt t acg gt gacct ct c caacaaaaat at t t gat t ag caaaat t act t t t t t aat t t ttgat t t t ag t aat t t aaca t aaat t t t ag t aaaat aat a t t caat t t ga aaaaaat t aa t aaccat cac cgt t t cgt t c t t t t aaat t a t aaat at gt a gaat t t at t t t t t aat at ct t t cat t aaaa t t t agagt t t aaacaaaacg t t cagt t gaa gtt gtt gtt a acct cgt at a ttttccaaaa aaaagaagat gt t acat ct t gt t t t t ctaa tttgaaaaag ttgt t t agt t aaacat t aca at t t t gt t t g gcat at act t aacat aat t a act aacgat a aat act ct ct tccccaaaca ccgagcgat g 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 740 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 740 cattgtactg taatttgact attttccaag tatttgaaaa acatgacaaa acaaatgcaa ttgaaatcta actaagttcc caacgaaacc ctttcggata tcaaggccca tcactttcag ttgtattgtt taatattcct agatttcaaa aaaaaaataa tatctttttc tacatagaaa t acat gt gt g t ct at at aca t at at at agt ccatatgtag gatgat t atc taaaatatat agtcatttgt cagattctaa caagagaatc tgcttcacca tttaaagcaa acaaactttg aaccaacttt ttcctctgta aatcttcacc tttataattg ggcttcctat attaattaaa tgtaacaaag taccaacgaa gatagaagaa cgtaaagttt atgtaaattt ctgaaagtag gagatttcat ttttactttt ctaacatata taatagaaag aaagtatatt atatttgcaa gtgtgtgctc ttacactcag taaaatttat tgatattttg aagataagtg acctatctat gaaaat t aag t ccacat ggt act gt t aat g ct ct t ct gt t t t gt aat aaa at aat at at a t t t gt aat gt ggt cat aat g t t agt aat ga t ct t aacct t t at t t at at a t gcat agcaa cacacct aaa ct aaat at at t aagt t act a at t t t gaat t t at ggt aaaa gt at at at t t tccct t t t t a ccaacgaaag t aact t t aca t t t cat t t t t at t t t cat t t t aat at gagt at gt at at ct t cacagt gt a tcacgcaaag acat aggaca aaat gcacac t ct at gat t t agt gt agagt gt aagt aat t aaat at gt t t at acat gt ga t t t ctt tcat cgggt ccat c t gt t t t t at t agcgat ct ag t ggcaaat at t t cccaaaat ggcat aagaa t cat acacca cgaat t gt ag aagct act ct ct aagat ct t t caaagt cac t at t t t t t t a aat gt ct t t t t ct at aaaga aat agaaaat gcact ggat t t aat caaaat t t act gaaaa t t t at at t t a 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 ttattgaatt tacatcatta tccaccaacc ccatcttctt cttattcttc atcttctact Page 618 12689250 Sequence Listing.txt gagatttctc tttttttttc aaaaaaaact catccttgaa tgaacggtgg tgcatggatg 1200 <210> <211> <212> <213> 741 572 DNA Arabidopsis thal i ana <400> 741 t gt t aagt cg t ccat gact g at gt aat gat ct aacat at t ct aat cgaat t t gggt aggt gaaact aaga aact t ggt aa t aact t gagc t t gacaact t gt t act gt ct gt t t t ct t aa acat t caaat cgaat t t cgc t t caaact aa gggt ct aat a at ccaaat t t t gt gct agt g gt aat ct cgt t cacct gcaa cgct t ccgt g cgt gt t t at t t aat t t aat t at ct at t aaa accagaat aa gaacattttt gaaaact at t at cagaat ag cct ct at cat t cact ct caa t t gaat t cgc t at aat t t at t agt at aacc at aagacaaa ct gat t gaga gaaat gt t gg t t at aagt t t ccccacgcaa gt ggt t at gc tg aagt ct cgat t t cgt ct at c at t t t t gt t g cacaat ccaa aaccgaacaa gt t t t aat t t gacagt t ggt cact aacgt a cgt cgagaaa t t t t cgat cg ct ccagat t g gt caaacgt t at at accaat aaat t at t t t gt aact t t t a at aaccaacc caccacaaca acgagaacat 120 180 240 300 360 420 480 540 572 <210> 742 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 742 cgcggagtcg gaatatggac aaatgaatat ttatgttctg cgcccctacc tgatcatcac tctgagaatt gcctctcttt tttttttttt aataaacagt ttatatatta agaatagtga agaaagaaaa atgggagacg ggtttttatg aggtaacaca caccaaaaag ataacacaat aatattgtat aatgttttaa caaaaaagtg aaaagaaaca cat t cgaaaa ctttcatct a acctaaagga atcaacgt t a tagctaggac aggttcctat gtttccacat ctttgatgga tctctagtct ataccttaat ttagcttata gact t gaact cct t gact t a tgtaattttt gtacatacta aaagaaatac acaacataat gaactttatg ttaataatac gatgttagaa atgatctccc acccctaagt ttgtttattt tccgaaaaca at cacat gt t t t t cat t t ct aagat t t ct c gggcaggcac t at t t gt t t c t t t agt at at ttatct t t gg cgt t t t aact agt t t t ccaa ttct t t cct c ggt caacact t acaat at t t aacgt at t aa agggaat at c t t t t gtcttg tcact t t t t c ct aaat gaaa t cgat aagaa cat ct gt gga gt t aaagt t t aggt ct t t ac at cggat cca gcat gt aaag taaaagcaca caaaaat gat at t act acaa t aat gt t gac act aat aaca gt accacaga agacgt t at g t t t t acgttt t gctct t t t a gtgggaagct t t at acgaaa t gaaat t gac aaaagaaat t t gaaat agaa agacat acag caaaat gt gg att gaccaac at t gacccac aagat aaaaa t agacct at g aaat cact t g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 gaattggata acactagttt ttaggttttt ctttaacttt tgttttcctt tatttatttt Page 619 t t at t t t aag gt gt cagt ca t aat t agt t t t t t t t ct ccc aaaaaaaaaa t gcagt aat c ccagat t t gg ttttttttta 12689250 Sequence Listing.txt tgaagaagag agggctatat atagcagagc cattcccgcc tgataaaacc ctccacaggt ttaatttaga aaaaactcat aaaatatat cttcttcatct ttttttctta atgcaaaccc gcagagattt ccaacaaaac aggaactaaa acacaagatg 1020 1080 1140 1200 <210> <211> <212> <213> 743 1200 DNA Arabi dopsi s t hal i ana <400> 743 cact cat acc ct t gaat ct c taaaccaaac at agcagaaa gaaat accgg t gct gacaaa ggaaaagaat at cagaat ct agagt t caaa aaaagagaaa ct gat gaaat t caat ct aac ccaaat t ct c gaaagcagcc ct agggat t t gagcat t t ca aacct ccat t ggagcagcga cgaaccgagt at t aaaaccc t cagcaat cg acct aaaagg cacat cact t t gct ggaggc agct t t gct t aagaaggaat aagcaat aca t t acaat t t t tttgaggcag cat aaagaat aaagcat cac ggact cagca cgcct t gagc ct ggat t t ag agaaaaaaag gcagaat cgt tcagagacaa t gaagt aaaa caacccaaac gact t t t cca cagct t t t gt aaagt t agaa t ct ggaaact at t cacagat cggaat cct t acat at t cag aat at aaaac at cat aat cc aact aaact a gaaact t t t g ccagt agaat gct act aaaa t gaggact ag t t gcat at at agat agacaa aaagcccat g gagagt t cgt gat cct t cac cgacat t aac agagcccgac t t cat t aaga aaagaagat g gcaacacagt gat ct ggt ag gt gt aat gat aat t t cgaga t t at ct ct gc agt caaat t c t cgagaact g agaact t aca ct gat t aaaa t agaaaat at t t t cggcact agagagaatt t caaaat gag ct ct at t aaa cgt t agt ggc gct cgct ct t cggt t at ct a ttcaccgaac gcaaagt gag cacat gaact t at t t acct c at aaagat aa gcaacgct t g t gact t caaa t cact ccat a aagt t ccgag agt aacat t t at agct ct cc aacacagaaa cgacggagaa aacagt agga t cat t agct c ct aaaagct a gt cgcgat ct aat ggt gaag cat t t t gaaa t ct gaaat cc aat ct ct at c at gaagcaat ccaat gaaat gt t ct cat t a t gaact cct c ct t gt acgt a gt aat ccaca aat t cat at a aat gat t t at cct t cacagc ggt gt t ct ct gaat ct aact agaagct t aa gaagaaacga t t gat gaaaa gagaagt t ac gct cct t t gg t t t t t t ct t c caagaaat ct at ct cggt t c t ccggt cat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 744 <211> 1200 <212> DNA <213> Arabi dopsi s tha i ana <400> 744 agcatct t ca t gact t ct t t act t acgt t g cagt gaggat agt gt ct gct cagct t gagg ttgatttgac gatttatttg attttttaat ttaggtgttt tgattaataa taagtatgat ctatatatta attaatgaca gagttataat cctgaggctt atatggagtt gagagaattt Page 620 120 180 12689250 Sequence Listing.txt t t agat acaa t cacgt caca aat t ct agt a t t aaagt t at at ct ct t t ac at aact agt t gt t t t t t t t t t t cgat t at t ggat at t t cg t t at t at t t t gt ct agaaat t caaaat at t ccat t caat t gt t cgggt aa t t gt aaagaa tt act aacga ct t gat at ga act ct gt aag t gaact t agg t gcgt t t t ct acat t at act tttttccaaa at gggagt t c ttgt t t t aaa t cggt t t t aa gt t at t t t cg t gaat at t t t aact aat act t cggt t ggt a agt t t ct ggt t t t gctcagg aaact act t c aat cact t t g gat aagt aaa t gacggt gat t accaaaaat t t t t gt gaaa tttttaacaa cat t ct t t t c ggt t t at at a gt t t agt at c aat t t agt at gat at t t t t g t act at t t t a t t t gt aaat a t ccat t cggt t at cggt t ct cacaaaact a aaaaat t gcg t agat ccat a ct aaaaact t aaat t ct gcg aacacct t t a ct t gt t agt t at t t ggt t t t gt acat t gga t t cggt t t ca cct t cggt t a ccat ccggt t gat at t t t cg t at at at t ct t at at t t at a t t cagt t at t gt t t act t cg acagt at aac t gacgaaat a agt t t cgt at t gct t t t t t t ccactctcat gcgtcgctct t at caat aat ggaat t t t aa ct t ct agt aa agt t gaaact gt t at at cca at t t gaaagt at t t t gcttt gat at t t t ga caagt at t t t t t t t agat at t cgat t at aa gt t t t ggttt acaact aat c caat t ccacc at ct t t t agg t t t t ggtat c t t ggt t at aa t t ct aggt t c tttagacgaa t ct cact aaa t t cgct t at c t cggt t cggt aaat at t t t t gat at t t t t t t aat t at ct a t t t t ggt t ac aat t t t gt at ggt acat ct g t gt gcaat t c aat t t gcgga t at acat at a at ggccaat g 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 745 1200 DNA Arabidopsis thal i ana <400> 745 ggt t aggt cc ct t t t cacat t aaat t ct t t aat at aaat g aat t ct ct t c t gat t t gat t t t cagt t aaa t caagat at g at at caat gt t at at caat t gt aat t caat t cacgt acct aaagaggaca ct t t gt t t t c gt aat aagcc ggat at acat cgt caat ggt gt agat aaat t gaacaat t a ct gat aat aa gact t acat a cgct t t caga gt cat t gcca gaaggt aat t t at ct t aat a cgt cagcaag gcagt t t t t g caacgat aag t at gaat t t a t gt t gt t act t t t t aaaat a at t cgt ggt t ttgtcat t t a t t gt t at gca at aat t gaac acat ct at t a t act t t t t at t cct gat caa cct t t cgct t caat t at t t c accat gacaa cgat at acaa t gt gagat t a at t t t ccgag ct t t gaat ga cgt ct caaaa t t at t t at ca aacagat at t cat agt aaca t ggt t t act c cggacaccaa t aggct gcat t cat at t t ct t t t ct at acg ttagt t t gt t tct t t ctat t t t t t ct aat g at at at cgac gaat t gaaat aaat at gt gg gagaaat caa gt ccaat t t a gt gaaacgac ttttcgacaa tgggccgtga t aat at t ggg aaacat gat a t aaat at caa t aagaagaat t t t ct agat a t gt at t t gat at cat gt ct c acaaaacat a t t t t t atggt cat t acaat g gt t ct cct cc aat at ct gag caat at t cag 120 180 240 300 360 420 480 540 600 660 720 780 Page 621 acgat t cagg cact cagt gt at cagat cca gt agt ccaac t ccgat t t gt t gcaat at t c cat t gat cgt aggt t cgt t c gcgcgat t ca aagct t cct c aat t gt cgat t ct t caat at gact ct t t gt at cggaaact 12689250 Sequence ct t t t t t aaa ggaccctaat tttcaaaaac gagccagcct ttccaggttc gaatccttga gt t t t t gata gagagt t t t g tatgtgcatg aaactttttt tgttctcatg ctcgtcgatt gtgattgatt gattcatatt Li st i ng. txt cact ct gagt ct t ct t cct t t t t ct ccat g t agat t t t ct t t t aagat t g t cgat gt gt t ttcgt t t gt c accact gact cgt ct act ag aat gt gcat g ccggcgaaat t gcgt t t aga t ct gt t aat c t ccagct at g 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 746 1200 DNA Arabidopsis thal i ana <400> 746 gccaagat at gt at t t t ct t aagaaacaac t gat gt gt cc t gaaccat at ct at ct at ac acgcaggaca gt aaaaacag aaaat at t cg cccgacgt ca t t t cct t t aa t at at cggt t cagt t t cgag act cat ct at at t agaat t a t t t at gt at t tt ctt ccaca at gagacaac aaaacaaaca cgt ct t cat c gaact ct caa t t t ct gt at a cat t agaaat t t t aggct t t gt at agt ct c gt t t t t gttt cgt at gaagt at t aat agct t at ccacaca aat t t cccct gat agggat c ct aat gcat a ct aaagccca t t gt gcacaa t ct t caat ct t t t ct t t t tt cat at aaaag aagagcaagt aaagacaaaa at ct ccat aa t t t ccaaat a cgt t t ct t gg at t t at t t t a aaaaaaaaaa tt gtt gtttt gt t t aaaaat gacgt ct t t c t gat agt cca acat t agaga gaaagggaca ggat t at gca acat ct t t t a agct ccagcc at t ct cggat cgat at acga at caaat t t c at t aaat t t c at t agccacc cccgat t gt t ct t cat at ct cagagacagc t t t cat at ga t ct agct at c ct t gat gt gt at aaagat cg gt caaaaat t accgccgt aa aaacgcacgt t caat cat ga cct caaaat t gt at aaaat t t att cacaca caaat caaac t t ct ccaacg acct aaat aa t t t t t at gt a t t t t t t t at c acaaagcaat tcgagcacaa cct ct cct t c t gaagat caa acaaaat t ga t t caat t at t cct t t t aagt at t gaat t t g t gggt at cca at ct t t ggat at aaaagt t t aaccgat at a acgt ggt t cg aagcaaat cg at gaaat agc ccat t t gt ac ct ct caat ca t t gaaaacat t at aaaat at ccaagct t ga cat ct t aaaa acgt cact t g cgct gccacc aggataagga t t at gagaaa t t gcct aact gacacat caa at t t t cct ct ggat gt t cga t gacagt ccg t t gcat gt t g t t gaaggt ca t gt aat ct t t t at t t ggt ga t at t t gt t t aaagaacct c t t t t at aaaa t t t t t t t ct t t t gt gaaagt at at caaaac ct caaacccc t ggt t t acat gt t agcgat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 747 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 747 Page 622 12689250 Sequence Listing.txt ctaagaacca cataaataaa ctcaattcga tacaagtgaa tcacataaat aat t cgat ac aagct at caa t at gaaact c t t ggt at at g gaaat ct t cg cgaaaat caa cacaat gcaa t ct gt gact g gt t t t t caca t t t aat at ca t at t aact t a ggaacagaag tacaaggacc ggaaat t t ca gat gaaaaat t t t gt t t cat t acacaat gc t t t gt at at a t t t cat t at t aagt gaat ag agct gt t t ga aat t cgat ac aaccacat aa agaaacggat accct aaat g accct aaat c agaagct gaa caaat t aaaa t aagt t t t ac cat caat t t t gagt agt t aa tttggaaaaa aat ggct ct t at aat at t t g cct ct t t t ag gt acaacgt g aagacat gaa gat at t caag t cacaaaaat aact caat t c at gagaat ag at cgat ct ca agcgaaagcc t gat t aaaac gat t acaaaa gagt accaac aaagt aact t tt ct ct ccac t at ggaacaa t aat aat gt c t aaaat gt gg t aat gt ggt g gat cat t at a t cgct ggt t t aagaagactt t at gaaacca aaact aagaa agt caagt t c gat acaaat g t at aagt t ca aat aaacat a ccaat t cgaa cccaaat cga gaaacat aac ct gaagagt c t t gt at t at t t ctat t t gt t aaaaaaaaaa gat cggact a t agagat t t c gagat t t aag ccaccaaagt ggcct caat g ggt agaaat a t gt t at cat c aaagaagaaa t aat cgat ac aat gct cact aagct at at g cat gt caaaa acccgt aaat aact caaaca ct t ct caaat cat aaaat t t t at at t at t t at t at t aaaa accaaaat at t ccct gt ct t agat ggct ct at agct ct t t ccacacacga aat aat gt ag ct ggt t t aat acaat ct cct aacaacactt agt caagt t c aagt gaat ac at caaagct a aaact caaaa agaagccat t ct cagaaacc agacaaatt t tagaaccaga gat gt t t t gg tttttgtaaa at aat aat aa caaat t t t at gt t t t aat gc t t gat at ggt aact t aat gt gt t ct ct at t t ggagact t t t t at ct ct at t t t ct t gct t cgaaaaaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 748 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 748 agt t agttta gggccattgt tttcaggtcc ccatgacccg gacatttttc cggtttagat agcgat t gtc ccacaacctc ctgtaaacgc aaacacacaa atttctttat actgttacag tttaaaactt catttccgcc agtgtttgcg tttttttttt cttttttatc tacattagta agtgaagtaa tataacttca tttttcagga actagaggct gattttaatg atagatccta cct t aagat a ct t t aacct a t t gggt ct at agtcacgttg tagacccaaa gcaataaaag gat ccgt acc t aagt gct aa t acgt at gcc at t gct agat acagcaggcg t t t t gcat ct ct t gt agt t t aagaaat caa aaaacggat t at t t gggat a gt ccggt ccg t at ccgat aa ggt t gct aac at cgct t gca gt t gcgggt g t t t gct at gt t t at gaaaga accaaat gaa ttat t gcttt gt t aaaccat at cggt t aac t t t agggct a at t t aacaac at cgt t agat gt t caaat ga at t t t gagat aaaaacaaaa at caccagaa ggat ct at aa at gat ggt t a 120 180 240 300 360 420 480 540 600 660 gtgtgtggtt ttaaaacctt aagatatttg tgtaggagac taggaattgt gtgttttgaa Page 623 aagaagcagc t t gaaaat ga at aaagaccg gt cagt caac ct t t ct t gct agact ct t gg ct aat t at gg acgcaat t ct t ctt aagacg cact aaaat c t t t t t cagt a ttgagaaaag ccat gt gat a t t t t cat t gt aacacaat ca agacgt gt t a cgagt t caag gagt cgt ggg 12689250 Sequence at at gat aag t caat aaagg ccaaaat caa t t aat ct ct t t cgtatttgg atcaccacaa t at at gagaa t gaaaccat g t gat act caa aaaggt aaaa gtcat t ct t a t t act ct ttt gcagct cct a at agaaact t aacgagtttg agat t caaaa cgaataacag cgattgttgt Li st i ng. txt aagcagct ac ccat gt ct cg gt t agt agaa t t at cacat t aaact t aaaa ct t t ccct t c gt gt cat cca gcct t gt gt g t ct t gggat g t gaaat t t ac t t t aat gt at agact t gt t a acaat t t cca gaaagat gaa t t t ct cact t gaacaaagag t t gatgggcc gt at caaat g 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 749 1200 DNA Arabidopsis thal i ana <400> 749 at t t t gagt c t t aat aaaat t at acat gt t aggt gaat t t t t at ct t t t g t gt aaaat t t t t t agt ct ca aaaaaacttt t t aaat t cat t cct t t gct t cact t t gcca t t t gt ccgt a at t t aat t cc t ct t t gagt c aagggct act t t at at ct gt caat cccaag t ggccct tag agct t t aaca t ct ct ct cac <210> 750 gcat t gt at g ggt t at t t t a cact cat ct t t at gagt aga gaaact t t t a t gt act aat c acaat t gcaa aaagt t at ga t aat t ct gga agcat aat at cat cact t t g t gt t aagt t t gat t ccaagc cgaat acaaa aacct ct cga aat t ct gaac t t at acgt t a ct gct t gacc t t t t ct gagt at t t ct t aat ct gat t ccat ct t t t gccaa t ccat at gat t acact at gt aaacaccaaa t aagt t caaa act t t t gaga gt aat ct aag gcat aaagaa ttct t gt t cg aagat gaaca at t t ct gt aa t at acgt aag gact t cacat at cgacaaac at at gt gaga agggat t t t a at act t t t gt caaat t gt gc ccaaaat cat at gt t cggct cagt t at t ga ctt acaacac aaat aat t ag ct t aat at t a aagt at t aca cat at at aca t t at agact t at ct gaaaaa gt t t agcgat at ct acaagc ttttgaacac aaat t t caac t ctat t t t gt gt at ccaat g agagaaat t a agt aacgt ag t ct agcccga aaaaccacat t ct t ct aaaa t t ggt cggt t gt ggt t gt gt t t gct aaaat t aaagt aat a cat gcat t ga t gagt at gt c gcaaact ct t gt agt gct ac t aaccat ct t ct cct aaaat t gacaaacgt at gt gagaat t aacgt ggat t t t ctt cttt t gt ct gt cgt t caacggat g aaat cact t c at acaaaaga at ct gt at t t gat at t t caa t caagt gt gg t t at gt t t t a gaat t cggt t t gat at t t at t agt gat cac t t at cccaaa at ct cacat a t ct agct aaa ct cgat cat g gcaacgtggt at ct aat gt g t t t caaact g t acat act t t cgaaaccact t at gct acgt cat gt gat t c at gat t t gag ct t cacgt at at accct t ct gacaaccat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 Page 624 <211> <212> <213> 12689250 Sequence Listing.txt 1200 DNA Arabidopsis thal i ana <400> 750 t t aat gccga gcaat aacat act t gaaaaa agcaaaagca at at t gat t c aacggaat ac ttgacgcaga t ccaat t at t at t cgaagat agggagataa ggaaccaat a t at t t t aaca t gacgt gt ac aagcaaaaaa at at cat gcg aact t gacca acat caat aa tttaaaccac at t t t ggcat agagt at t t t gat t cgagac t at ct acat a at cagt t at t aaaat ggat t gat t acgaaa t aat aaaat a aacgaaaaaa t gct agaat t cgt at aat at gat t t gt gag cat at t cgct at t gct gcct aat gt t cgag aaaaaaaat c aacct cccaa t t aact t t ga t gt t gcaat t acact caaca acccgct t ct t at t gat act aagggccact acaat aact t gcat t at gga t t t aaat t at tat t gt t gt t gaat agacca at gaat aaaa t gat accaat gaat t at t cg accagcgt at t t t t aat at t t ggct cgt gt caaggcacag aagaat ggct act t t aacca gt gaaggct g tgt t t t t t cc tcgcaccaca tggcaaacca at t t t t aaat t gcct t aggg t gcaagt t gt gt t acat gat t t t ct at gaa t gct t gt t ca act t ggt caa agaacacaaa ct ct gt at t a t ggacact ca t t agat aat t t gt gat t t cc ggtctgcgaa gt agagagat t t gt accat a gccaat acaa aagcat at aa cat gt cat t t cact t cact c agt aacaaaa t gt gcccat a agt t act t ca aat t gt aaga aggat t t gac aaaat t caat cgt t gt cgaa at act aaaaa aagt aaacaa t t t ccgat t g cacact cat c t aaaat acat t t t gct t ct a t gcact aaga aaat at agac agct t agt t t acgaat t ct c aaact ggcaa ggt cccact t acaat t t ct t accaaaacat cat at t acct accat t t aca gagtagagcc cagacacaac t t cacaat t t ggcat ggt at gaaagcct cg aaat t act cg t t t t ggt t ct t cgact at t t t t at t t gt t t cct gat t t t a gt acaacgt g cacaaaacca t caaaacggt accgagt t aa at agaat ct a cct t agt at a at at aagacc t t gt at t at a at agggaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 751 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 751 atactgatat ggctttaatt tgtagctaaa atatacctct ggctaaattt tagtttcttt gtaaatagtt gatgtacatt ctgaagtagt tcgcctcttc tacggctttg tctcgttctc ctatttattt gactgatcaa aaaaaaaata tgacaggacc tgattgattt tgtttttgta gactcaatga tttcatcctt t ctacttttg tatttttttg aaaatttaag agataaaata aact gct aat gat acaaaat gagt agcgt a acaat t at t a cact at t t at agat t cgt gg gggt t t gaag gggcatacga Page 62E at gaaaat aa aaat ct agga t t act t ccaa gct t at at cc ct t t gt gggt t t t t t cgat t t caact gt t t agccaat at c t t at t ct ct a ct at aaat t c aacaact gca at at aagaca t t t aaaact a t cat gaaaat at gt ggacgt t agat aagt g 120 180 240 300 360 420 480 12689250 Sequence Listing.txt atttaggaac aaaaaccaat aacaaaatga agaaacatta cttgacttca gacgagtggc acat act t t a aaagat acat t t gt t t t at t agt t t ct t cg ct t t cact t g ccagt t t acg t t t t at aaat at t t cact t g t t act caat t t at at t t t t t gact agt cat t cgt t at cct at at t at t ga aat aacgaaa ggt ct agt t t at ggt aact t t gaagt t cat aaat aat at t act at gacct t t at at ct t c act t aat t ct cat t caccac aat t t gaat c caat ct t t t a aaaaaacgac t t t t aaact t aggat cagt a aaat t t t at t tgcat t t t t t ct t t gt t ct t t cat cact cc aaagcgcaaa ggaccacaca t t at at aaga t t cgat at ga ct aat at gat t agt ggt ggt ggt cat at ca at t cat gt at t t t t t t gtt a t t t t t at t at ct t gaat cag at aacccacc t aacgt t ct a gt aat t aat t at t aaaaacc at ct at ct ac at gt gt acaa ct aaat aaca at agt t t t ca aagt gt at ac t t t t gtgtgt gagtaagaag acaaat agac t at aat t gaa cgtt gaaaaa aacat gat ag aagt t t aat t t aaat at acg aact agt cca caaaaaaagc aaat t at at a 540 600 660 720 780 840 900 960 1020 1080 1140 1200 tagaaacata tccaaaaacc tcaagaaatc tcct t ccat g <210> 752 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 752 agacaacatg actacaaaac tgttgactcc tcagttattc agtaatgtct ccatgagatg tcatccaccc tcatgatgat ccgcagacat aagaggtctt attggagaaa cgttaatatg gctttgtcaa acattttttt tatagaaaaa ctcaaagatt cgacgatctg gagaaagatt caaaagacca ttccgtttac ggaaaccaag gagagtatat gtagacaaat gtaactaaaa ttat t atatg aatcagaata tgacacaat t cttagtcatg actctctgtc cttgtgcaaa tattattttt tccggataga gacttgatag aatagatgga agtttgtcta ccacaaaata aggt gcggt a ct aat accac t at t gt ggt t aaatgattaa atcccaaacc ttaacacaat caattagatt tagttatggc tatcgtttac acacataatg agatgataca cgttgtacat aagataagtt acaccggaga caaacacacg ttatgatccg tcattcatct tcgtgagtct at gagat gaa at gaacat aa t ctt accgag t at t t gt cct t ct gt gccct tacgaaaacc t t ggt gacct aagacaact a at t t gcaat t t agaat gccg t acaat t gt g gt t t t gact a cggat caaaa ggaaaacaga t aact cat ca ct ct aat caa t t gt accaca ct accat gac cat aacaaag caaaact gt t tt gt t t acag aacacct gaa t t t t t t ggt g acaat gcgt g t cat t t ggga t gat t gt t ac gat ggaagt t t agt cacgag gt ccgaat t a aaaacaaaag ccaaaacgt g gt agt gacac gct t cact ca agcct ct at t acat t acaaa ct ct aat at a acat t act at gact ccat gc at accaagag act gt aaaga at at ggat gt t at agct gca ct ct gt cgcg t ct t gct cct aaaagaaaga ggt gaacaat aaaccaagat aat gaaagt t at t t gat acc at t gt t t t t g at agt gact t aat gt gagac t t gaaaacct ggt t aagccg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 aaacgacttt tcccttgacc aatacaaaaa atgacttttc cacactctcc ataaattttc Page 626 12689250 Sequence Listing.txt tttacaaact caaataccta aaccaaagcc aacaaaaata aaataaaact cttcaccatg 1200 <210> <211> <212> <213> 753 1200 DNA Arabidopsis thal i ana <400> 753 taaaagaaaa ttgt t gt t ca at at t t cccc ct cgt t ccaa aacat gat t t ct cagat t ct aaaggt at cc aaacagagct caat aat aca acaaact t gg gt cccgaaca gaggct at ag at t ct ct at c t at t ct t t t a gat t aggaca gccat ct gag aaat t aaat a t at aaaat gc aggagat t cg t gt at agat g act acat at t ct cgaat aat t at gaaagaa t ct cat gat c at at agact t t ct t caact a aat t t gt gaa aact acat ac cat cat cat a t caccgt gt c aagacgt t ga aaacaagt ct tccacaaaag gagaat ct ga gaat at t t gt agat gat aat t caagaaat a aaagct aagt gt at gt aaaa t t t at gt aaa t t gggt agat aaat t t cat a aagaacat at ccat t t acac t aaagt ccat t agaaaccat agcct ggaat t t at acgact t t acact cat gaaaacaggg agt t cagt ga aat gggaaca aaaagaagag t aat gcaaaa ct t t ccagag acaaat aaca aaagagt aaa t gat at acaa tgccgaaaaa t t t t gt caaa gaaaaaaaac t t t at t gct a aacat ct t t g t caagat at a act at t t cag t cacat t agt at gct ct gt t t t aaat ccct ct t t ct caac t t t aat cgag gagat t gaga aat gaagaaa acaaact t ga aaact t act a t t t aacggt t agaaccacac gagaaaaagg aact aggcct t cgaat t t gt ggagt t t t ca aat at ccat g t at t acaat t ttgtctcttt t t at t gt gca t t aacct t t a t t t cgggt t t ccat t acct c t t t ctt cacg ct caat cat a t t agt t at t t aggcagact c aaggcct t gt aaaaggaaat aaaaagaaga at aacat gac caaagt t aag agggaccttt ct ct t ccacc tcct t t t at c gt gcaacaat aat t t gt cat t t gagt t t gg t t ccat gt ca aat gagcat c ccat ct cct a gaagtggct t cacat gccaa t ct ct acct g ttcacagaaa agaggt t t t g gagaaaacat gccact cat t gggt gat t ct ttctct t t ga gt t gt aat at caaat at t t g t gt at gagat ct at agct ga t at t gct gt g cagagagat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 754 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 754 tcgaaaacca ttccaggtat ttttttttaa gagaccatgt tgtttttctt cccaagaaaa ataaaaggct tgcaatcaat ggaggtttta atagtgataa ctggacaaga tttttaaaaa gaaagtaaat gtcataaaat cataat t at g ctgtcacatt agacatttac ttatttacaa gtcttacaac atattctttc tttaacctct attgctatga attatagtcc atgggtgaat atgccacaaa cacaaaagaa cgtagtgtgc aaggtatact ttttgcaaat tggttattta Page 627 cggtgaggag t cct aagt gt aat gt cat ca aaaaaaaat t t t ggt aaaat ct t aaat aaa 120 180 240 300 360 12689250 Sequence Listing.txt t gaaact t at t ct aacagaa aat ggaaat t at acggat t t caaaagaagc tacccgacaa at acaat t t a gt t t gt t at t gcagaaaacg aat ggcaaaa agaagaat ag aaat ct t t gc t ct act ct at t gt at aaact acgcaaaat a at at at ccct t t t cat t ct t ct t aagagat aagt t aaagc ggattttttt acacat t acg aagt at gt at caaaat ct aa t caacaat aa cat gt ggcct t caact t ggc gacgt ct t t t aaat aaaat c ct gt t agt ac t cct at aaaa t t gaat t t ct t aat ct cct t agt ggt at t t t t t gggt t t a tttacaaaaa aat at t t gaa t cct at at t a t at t t gct at t t agagt gga tcaccccacc ct gact aagc accacat t ga ct agat gt t a t t acgt agat gacat ggat t at agat t t t g t t t agt t cca aaaat ct caa ggt ggcaaat t t t t gggtgg aggat gt ggc accaaccgt g ccccccct ct t cat at ct t t cgat gt cat t at t aaaaaaa t t t ct gt aag gaattttttt ccat t t t t t a ccatttttta t at t t t t t t t gt gt t t cat t ct aaaagat t t gt t gat gag cct aaccct a aagcat ccgt ccacaaggcc t aat t t gt t t cat t at t agg aaaaacaat t t agt at aat t t t gaaccaat t at ct aaaaa agggt at agt t t t t aaagt t t aagt cacaa ccgt t gacgt at aggat gag ct caaaaat g cct ccat t t c 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 tcagaatgct ttcttgagaa tgtttttatc ttctttctat agtttccatg <210> <211> <212> <213> 755 1200 DNA Arabidopsis thal i ana <400> 755 ct t ct aaaat ggt cgat aaa tttttaacca tacgtacggg ct cat accac t t aat gaccc cat t t t t at a aact t ggaaa t t ggagaagt gt agt cgt t g t ggt act t t c ggcccat ccg t aat at t at t at agagaaag at aaaat at g at t gat t agg aaggcacat a t at t t t aat t at caaaaaat t t gacat t ct at at t t caac act aagccag tgtgagaaag tggaaaaaca cagggt ct t c ggt t at gcac at t ct t t t ct gcgt acaaag t gt t aaact a agaagaagaa at caaaagt t gggt gcaaat t at at at at a at gt t at at t t agt cacct a agt at t at ca t ct gt t acac t gat agat ac cgt aagaaat gt ggcgt t aa t ggt t cgat g gagt at aaga t acat aagt t aat t at acct aaat aaaat t aaaat t at ac aaat agt at g gaaaaat at g t at at at at a t t acgt t ct c ccat at acaa t aacaat t gt t aat gt t gt a aact t t t aat t aaaacat aa aacggt aat c acct t gat gc t cact t cct t t at ccgat ag t aat ggggt c t aaagt aat t ct aagt aaag t at t at aaca aact aggaac t at at at at a t t t aaact at tagt t t t ct t t t t t gt gaat t caggt t cat t aat t t t ct g gccaggaaga agat cagagt t cct aaagcc gcct cat ct t agt t at ggt c at at t t t at t t t at gaat t a aagaaaagt t agaaaagatt at agggt cat t t acgagat t t at caacgt a t t acccgt ac t at cat t t aa gt gccgt aat agaaaact ac gaagt at caa agaaacaaga aagggtgaga aat ct ggt ac t agt t acaag aaat at at aa aat t t t at aa gaaaacct at agt caaaaaa at t acat at t 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 Page 628 taagccaaat tttgaccacc gtaggt t at g aacgat at gc aat t caaat t tt gccct cca aggagaggaa gat acct at c 12689250 Sequence Listing.txt t ctaactaaa taagcttgaa ttataaaagt tatggggtcc tttagctccg cccccgccca tccttgtgtc tttccgaaac cccatgaaca ct t atatacc aacat t ctct tctctatct t attatttctt gttcattatg agatttttcg gtgttagatg 1020 1080 1140 1200 <210> <211> <212> (7 <213> 756 1200 DNA Arabidopsis thal i ana <400> 756 t cat agt aaa gcaaaagaca act aaaaaga t t t cat ct at ct gt caaact t aat t at t t g at gagt cgaa t aaaagact a accaaaagac aaaaact t at at t cgt ct t t cat t ccgt t t t t gt t t at t t t t t ct ct aat aat t t gt t at aat t t t aaaa tat t t t ctcc gt aaact t t t t ct t t aat t t ct t caat t cg t act acaat t t gcaaagat t cgt t gaaaat gaact aaggt aaagt at aaa ct t t t caact at aaaaaaag gaacat at ag caaaaat cca ct at ct at ct t ct t at gt at t act aaaagt agt t t gat t t aat cgt aaac t gaaat gt t t t t gcgt aacc cgaat t t aat t t t ctt acca at t ct act ct tttccaaacc tttttttttt aaagacgct a ggcaaat at c t t t t t aact a at gcaacaat act t t aat at aaaagaat aa t caat t gat c aaaaggaatt cgcgggagga t at ct t t aag t aat at acca t gggt gcat t t t t t t ct aat t cgt aat at a gt aat t t t aa at t t aat t t a ttttttttaa at acat at ct t at act ct cc tttttttttt t gat aagaaa aat cat caat at at at at at agaaaat at g ct t aat t t ac gt agt t t gag at ct acacat t ct ctt caca aaacaaact g aaaagt cttt aat t gcaaat t gat at cttt aat cgt aaat cct caat t t c t t t t aaact t ct aaaaat ga t actt ccat t gccat at at g t ct ct cacaa ggt aaaat at gaaaact aat ccaaat aat a at at at accg agaaat aaaa cat gagaaag aacaattttt gt t gt t gtta t t at gt gt at ct cgt gt t gg t gcatt gt gc aat t cagct c tt cat cgaac t t at t at cgt aaat t t aaat gct caccct a t at gat aagc g t ctt ttt t t gt at at aga aaat gaacaa t acaaat t t a cggat at at a t t cat at acc t at gat t aca t aaaat at at aaat gaaaat taaaaaaggg gaagaaaaaa ggccat aaat t gct t t t gt a acct acacaa cagt t t act c t t at gt aaat caaat at t cc at t t t at t t t agatgggt t t ct t t at gt at t ct gacgt ct gagaaat at a caccaaaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 757 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 757 ttgattttct ctcaccttct ctaggtttcc tcgttgctta ctcaaaaaat ggtaaaatct gcttaaagaa atgataaaaa tgaccaaatt gcttcttttt ttttgtattg taatggaaag aaaaatacaa aatggttcgt gtccgtgcga ctcggttgct ttataaccag aagaaacgag Page 629 12689250 Sequence Listing.txt aaccgtacat ctatcgatgt tgatgaaaaa tagacggcgg agatgaaaga t t t gggggt g gt t at acggt ggaaat t aga t ct t t at at t t t ccat t aat at at gct gt g ct acaagt aa t t t t t aaat t accat gact a at gaaaaaaa t cgt ct caca t ct t t ggcat t t at cct ct t ggcct caat t t gaaaaggt t cct gagcct g aaggagtaag t t gcat t aaa agaacgtgcg t cct t acgt a ct gat t ccat ggat t t t t gg at gt aaccat aaaggat aaa t gggct cct t t ct caagt ac gt aaaaact a t t t t cgt t t t t t t t cgt ggt cat at gggt c gact ct t gac aacagaact t t gagaagcat t gggact ct c t at ggat act at gt t t t t t t t t caat t t gc ccgct acaag cccaat gt t t aggaccat ga tgt t t catta cggcat at cc aaaact ct t g gt t t t gat aa t t ct gct gt a aaggt ct t cg t t at ct gt gt ct ggct cct g ggaat caaca aat at cat ca t t caat ct ca tttcccccac at t gaat t ca t aaat gt t t t at t t t ggttc ct aat t t t gg at t gt caccc at t t t aaaac tgt t t cgttt agaat at ct t t ct acggagc t t gat gt at c tagagaaaga aaagaaagag agt ct ct ct c gat cct aaat t t t t cgt tat acgaat t gt c gt t t cggaaa tttttttttt gat at cgaat gcccat t t t g aaat agt gat gt t ct gct t a gaagaggct c aaaacat t t a ccact act ga at gcaaaat t gagagagaga agagagagag agaat aagat cgt agct t at gaaat t t aat aaagt t t ct c at t t t ct gaa t t t cat t t ca t t gtcgaccg tttaggccca aaat t aaagg ct at at t t t a at gaaaaact aacat gaat a att acaaaca aat at gt t at aat at aaaag gagaggt gt g agagcaaat g 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 758 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 758 attcacccat aaggccgtaa cacacaagat at gacgat ca caat gcaat g aacat at ccg aaaaaaaaac tttgataagt tttggcaata ttggatctaa atttatattt ttggcaagag aaaaaacttc gtcaccttta caattttcaa gcct t aaaaa ttcaaaaata gtggt t gtga agaactaata aaggtttttt ttttttgaag ttaaacacat tagattagga atcaaaaaca ccactgaaca aatcaaaaat ttccaatatt acaaaatcca aaaagctcac taccatctac tcacggtttt agtttactct tgagtctctt tttctttctt tctttctttc atagttcact aactcctcaa cactttattt ttctcttttt t acgat agat acaagctt gt accacgat at agt agat t cg tt at ccaccc t att t t t caa t gt ccat caa ccaaaat aac t t t ct ct aga t at t gagt t t t cat agt t t c gt aat aaact t ctt cacgga agat t t act t t t ct at t t t a t aaaat acgg gacccacttt aaaaacaaat acat gt t cac ct t t at at t a cat agccact agggaaat aa t ct t gt cat c at agcaagag t agaaggct c t t t at aaacc at t gat cacg t aaaat acaa at aat gaaga ct t acaaaca aat t aaaaat act t gct aaa t ccacgt at t agt t gt t t ac acgt t accag ttct t t t t gg cat ct t at t a acat t t t gcc aat aaat t t g 120 180 240 300 360 420 480 540 600 660 720 780 840 cagtaatctt tttagtcgaa tgatctttga acatatttcc taaatacccg gcaagaaaaa Page 630 at t ct gaat t t t ct t cct cc t t t t agcttt t t t gt gact g aacagagcaa t gt ct gtt ga t t t ctcatta tctgt t t ct t t ct agaaggt t gacat at ag gt t gagat t t at ct t t t t t t 12689250 Sequence at t t ct ccaa aaat acaaac cttcttcttc tgttacttac atttgaattg aatctgatct ttcatgtttc tttttggatg tttggaaatt tcctacatca acaggaagag t agt agt at c Li st i ng. t xt t t t t ctt ct t aagagcaat a t t t t t ct t t c ct ct ct gt gt actt gcaaag ttct t t t t ca cct ct gt t t c aagct t t gt c t t cagt gt t g caaacaaaaa t ct gaat ct g cagt ct cat g 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 759 1200 DNA Arabidopsis thal i ana <400> 759 ct t t at t t t t acat cct t t c gt at agagaa gt agaaat ca caat gt t aac gagat t t gaa gt cat at t t g caacgt at ag at gacaacaa ggat t aaaat aat t t at gt t t caagt at ga aaaaagt ct c aagacgaaga acat actt gc at cat ct t ct agt t t ctt gc gat ct t ct t a gt t gt t t ggt gt t t ct t caa gt t t t ggt aa t t t t t t ct t t cagaacat ca t aat ct aaaa aaaaaaaaaa t t caggt cga ttagaaaccg acgaaaacag at at gt t t cc t aat acaaat t caat t t t ct at at at gat a t gaagt t t t a cgaagct gct t t cct cat t c cct t ccat ca t cat cct t ct aggat cct gt ggat t aagt t t gagt t gaga tttgcatata tacccctgat attttttgct tttatcaaat gacat ct t ct t t acat acat t aat t caaaa catt gaggt g ttct t ct t ct agaggcgaga agagaaacag t aaat t at at at cacat cat ct t acaaat t at ccct agt a ct t t gct t ct t ct t gt t cat gat t gaacat t cagat acga tggt t t t gt t t gcgat gaat t ct t gaggaa cct ct t caat t gct agaaag gcact t acaa caaaat cgat aaaagt gat t ggcat gt aca gt gt aagcac agcaaaccaa aaat t acgaa cat cgt ct t t at agt t t t ac t aaat at aga agaaat ct t g t gt cgt ct t t agt t t cgct c t t aggt t aaa tt ct t t t ct t t t gacat gat t aagat t t gt ct t t at ggat agt t t agt gt t aaaact ct t at gacct cat gtat t gt t t c t at at gat t t t caaaagt ca at aaagat ca t t t at at aag gat at at aga aact acaaca ct cgt gaaca aact ccct t a ggt ct gct t a accggaagat aggcgctcga t ct act cgag t ct t ct aat t at t t at gcac t aggccaat t gat gctt at t gt ct at ccat at gaaaacct at at at t at a t t ggt t t ggg aagcagagt t agt aaat ct a aat t t accaa aat ct t t at a aaaat t gaag aaaact ct aa ct ct gcgaag t at at cggt a t agggtt acg gcaat t t t ct at t gat t ct g cggct att gt t ctat t t t ct t t agaccat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 760 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 760 tagcacttca agataacata aacttcgcat gttctcatgt tgattgtcgt gttcttttga Page 631 12689250 Sequence Listing.txt gtggatgtcc atgt t actcc cctagtaacc tcatcaacca tgcatctatc gccatgaat c t t t att acca t t accat aac t t ct ct at ct at ct ggt t aa gct act at ga acagaagagg aaagt t cgca aacact t at c cgt act gacc acat gaacat t cagaaact a act cgaat gt agat t agat a cacagt cgt c caaact at ag t t gt acct t t cagct t t t t c gct t cat aat <210> 761 agccaat gga t aat ccgagt at at at agag t t at at t aaa at aaacat ga aat gt gt t t t gat t t t gat t aaaat t t gat gt aaacat ag t t ggaaat at acgcat t t ct t agt at gt t a agcgt agact act t t cgt gc aaaaaaaact t t t t gt t gt a gt t t gct t t a ccct agt t gc agaaact at t gagt t t gct c agaagtggt t ggt aat t t ca gcaagaaagg ct t gt t gttg aat caaat t t cacaat t at t aact t accgt gt agaacaac aaagattttt aaccat t gga aat caat cgt ct cccgaat g cgatcggct g at t t t t ccat aagct t gaat ccaaaaggat ggaact gcaa t t t ctct t t a t t aaaggt aa t t t t t t ct at aaaaaaaaaa caat gt t t at gat aacat aa ccat ct cat t aat t aagct t gt aacgt t t t t t t t t t t at a t aaat gacac t t ccct ct t c at ct cat gt g at act t aat c acacccaat t ct t t acat ag t ct gggt t t t tttcaagaac at t at gct at t t t catt t t t at gt gt aggc accat at t t c gagt t at aat aagcaaaaaa t t cagt aat g t t aaaat t ga ct t t t ct t t t aaggt aacgt gt ggt gaaat t cct t ggt at ggct t gt ct c acccact t ga t cccgt ggaa t t ccagt t ac gt gagaat ga t ccggact ca cgaaaaacac acct t t aagt t at ggaaact ttacaagcaa t t aaat t t cc gaaagt at cc accct t at t a at t ct t cacc t t ct t t ggt g at t t cgat at ct t agcct ca tgaacaaaaa gt ccacccaa t t t t gt at t t gaaaaggagt t t t ct at aga t gacgt cat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 761 t aagat ccat caagt gagt c t t gagat at t at gat t cat a gat t t agct t gt gat gt t t c at gt caat ct ct t ct t cgac cgt ccat gac t aacccggt t aat aat aaaa t t t at ggt gt t t ct t gcat t t cat gt t aaa acct at ct ga t ggacct ct t at gggt at gt at cgcagaaa ct ccacct t c acaaccgaag t at ct cgt cg t t t t cact ac t t cat t agt a t t agaagt cc at aat ggagg agt ct ct caa t at aat ggt t gagct at t gg agaaact cat ggccccaaca ccat ct ct t t at ccggt cag aaaaaaaat a t t t t cat t gt t at agt t act acacacattt agat cgact c at aggaaat t t agagat t t a ct agaaacct ccaccaaagc t t agct t t ga aacct at t aa at at agt aat ggt t t gt t gc gt gt gt at t t aaat gcaaag ggat gaagt t gaaat agat t t ggct gat t t ct ct t aaaaa ccct ct ccga t ccaaat t gc gcaaat t t t a tttccaaaaa at t t agt gt c t gt gt ct ct t aagagaat t g ct t t cat t ga t ccaat ggag t cgt at acat cggacagaag t caccat cgt aaccgggtt g gacat t t gt t ggt t t t t ggt 120 180 240 300 360 420 480 540 600 660 Page 632 t cgt gt t aca ct ct t agt ag acggt ct t gc gt t ggagat t aat aat gaca acct t aact t t at gt at at a agaggaagtt aaaact aat a t t t t ct gaga gat at t cact tagctggaag t t t caaagt a aaaagcgagc cgt t t gt t t t t at at at gt a aaagaat gac at acct t t at 12689250 Sequence t gat ccat ca act t acaat t agcgtaaaat cactcgatta aaaaat at t a at ct t ccgt c aacgt at gt a gcct aaaaaa tcatattatc gtccatacaa cttttacatt cttgttttca tgtgtgtgtg t gt at at at g ataaagt t cc tcaatctgga gtattacttt gtcctgttcg Li st i ng. txt ggcat t t t ct t gacct t t t a aaacaagt aa aaggagtgac t t t t t t t ct g t at t t t at t t caat agaaca t gaaagt gat t t t gtggttt t t ct ct ct ct t gt gt at gaa gaagact gaa aat ggt agag t cact t t caa t at ct ct at a at aaaaaaga gaaccct aga t gt gt t gat g 720 780 840 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 762 1200 DNA Arabidopsis thal i ana <400> 762 acaaagt gga cagt t t ccaa at cact gt t t t accat agt t ct aat cat t c accaat t t t g t t ct t acgt a ccgt agat t c ct t t gat aat act ct t t gt t gacatttttt caat aaagac att gaaacaa t aagt t t t aa gt t t ct ggaa aagt at cat c agcacat cca at t aat t aat t gt gt t ggcc agaggaaaaa agagcgatcg aaaagct aca cat gt gt tag t t t t t t t gt t gt t at gagca t agat gat aa ct at t cacct t t t ct agaga t at act at at t cat aat gat acct t gt gt t at aat aat aa t t t t t t t ggt t t cgt cgaca t cgt t cgt cg gcct t t cgt t accat t t t ag gat gt cat t t ggccaggtgg cgaaaaagag aaatttttta atacgggtca gaaggaagat attaaaaaag at t gt at t t t t t at t at at t tttttttttt at t act gt t a tgt t gggat t ccat t gat at aaaat ccct a acgaaatgag t gat t t t t t t t at t aat aat t at at at t t a t ct agaaaat at aat t t t at t gctt gt acc gact t t agct ttttacgtgg gt ct t t aaga t at t aat t t t aaaaaaaaac at gat cact c t t t aaat at a gt t t t t t t t g aagt aat t t t at ggaaacct at t at at aga acaaaactaa gccaat t ccc tgaaaaacat ct t gt gat aa act aat t t t g caat catt aa gaaaccaatt t aaccct aat gtctcgagac tgtggtatta ggat t t cat g gt t t at gt at tagagcaaaa at gatt agaa ctaggtgat c at aaat gt aa cctaacaaca aaaaaagtt g t t aat agct c t gt gt t t cac t ct gt aat ga acat at aaaa t aaat t t t t a cat aagaaat agaaaaaagg cgcaaccact t aaaat gaat t ct cacat gg at t acat at a t t gt gt cgt g gt at agct t t gagagatttt acttttttt t acccacgct a t t t t agt tat t t cat cat aa at gact cagt at t cgat acc t t cat gact t gat t at t at t aaacat t aat aaaaaagaag at aaaaaat c aagt at t at a caat aaat at aaaacct t ga t t gct gt t ag caat at ct t a gt ct t cacgt gagaaatgag aat cat cat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 763 <211> 1200 Page 633 12689250 Sequence Listing.txt <212> DNA <213> Arabidopsis thaliana <400> 763 at t t at t t t a t t t t t acaat ct t cgat t ct gccct ct t t t aaat t aat t t t ct t ct t cgt cgacaaaat c gct aat t t t a at aat at t t t t t t t aaaaat gaaat ggt aa t t acagt t t t aacat ccaaa t at t t t act t ct aat ccat t acgt t t t gag caaaaaccct ttagaacgac gt aat t at ct at ct ccgacc aaat t caaca ct t at t t cat agat t t cct c gccaacat ca gat t t cat t c cgt gagaaac t caat t t gaa t t t t t acct t t t t at aaaaa t aaat aaaat aat t ggaat t at t agt act g t aaaaaaat a t t t t t gatta t gct aact aa cat acat at a aat cat t t gg caaaacaat g t ggaat t gt t att ccaccaa t aaaaaat t a cgt ct caat a aat t aat cgc aat t cgat aa gt cgt aat ga t ct ct at t cg t ct cat t gt a t t t caaagt t ct t t t gggat t aat gct t ca gtat t t t t gt t at gt gt at t at t gagt t t t aat t t t t t t a t t t aaat at g t t t t ggt t ct cctct t ct t t tgccacaggg ggagt t ggt t aacccaaacc t gagaaaat c t caaaaagat t acgagaat c ccgt t cacaa at at gat at c aat t t aaat c t t t ct cat ga t t t t t t gct t aat at agaaa t cacct aaac t ct gagat t a t t t ctt cgaa t t t t gagaat aaat at at t t t t t ct aacat cagt t ct t ga t ccccct agg act aat t aaa aaaacacttt at t t t t gt t t t aaat act t t gt t t t t t ct t gct gat aaac tctcct t t t a aacgaccatt ct t gacaaat t t t t t caat t ttttcttgaa ct aaat gt t a at t agt agt t aat t gt t t ga ccggaaagt a t t t cct t t ct aaaat ct t t g gt t cacat at t cat ct agt g ct aat t t aca t t aaacagct t t agaggcct cgcaaaat ca aaaat agt ca gagat t caat aat t t t t gt c at cgccact a gat at gaact aaacaatttt cat aaaaat g tttaaaaaag at aaaat aac aat t gagct a gt at t aaact at gggt agt a t t t at t aaca tagaagcaca at t aaaaggc cat t at ct ct t at aat at cc t gt aaaaat t ttttattccc t t ct accat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 764 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 764 agttttgtat agttgttgac tctatctctc ataacttttg tgtaaaccag taaagatatt ttct t ataga gtttagtaaa aaaacaattt cctcaaccac tgtaaatgtt ttgttatcca taaagcatct ctcgttaaac aatttgtatt ttttaaatta gaatccttaa ggttgggtat taatggaact aaataactgt tttttattag ataataaaac tagttctatt attgatatat aaacacatag ctaagattgt ttactagaga t gt aat gt ga t cgct agt ga tttatgtgaa at aat aat ca taatggagac gttttagcga aaaact t ct a at t cct aagc tttttgcaaa acaaaaataa t t t t aagt ga at caaaaat t aaaactgaat acgtacatat at aaat t aaa cacagat t t a gact t aat t a ggt ct ct ct a Page 634 t t t ct t gcaa at aggaaaat at act cacca t t t t aacttt aacgct gt at ccaaccaaag accaaagt ac gat t at at t t t t ggt ggt ct 120 180 240 300 360 420 480 540 12689250 Sequence Listing.txt agct t ggctt cactctcgaa ttgaagagga acaacgtgtt tgtttagtct t acaat t acg t gaacat ggt ggat t t cgt g act aaaat ga aat at t t t aa cat gagt gt t cat t t t t at t gaagaccgaa t at act aaca aact act at t t ct ct gt t gt cccaaat ggg gat at t at t g t t t agct t ac t cat t at t t t ggt t aaaggt ct ct ct cggt gt ggact t ca t gaat cat cg t cat at t t gg tt agt aacga caagt t gt aa t aagt t gcag cact ct cact agt agaccaa t gagaacat c ct ct ct t ct c t t t t ct t gt a gaggagtaaa t t gt gat agt t ggcaaat aa aaagaagaca ggaaaaaaga ct gcat ccac aact gt t tag gat t t gccaa agt ct ct cac cgggaat t aa t t t gccaggt gat cat t gt t aaat aat gat agt agagaaa gt gagact gc agt t t aat aa t gt t gaaact ccat at ct ca taccacagcc aat at t gcag act t ct t cga gat acaggat t t t t aggact t t agt at at t at t caaaat t cccaccgt ag t t acccct t t t gt gcat cca cagccacagc t t aagt t cga att gacgaca agt t t t gat t agat agt at g 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 765 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 765 aaagcaaaat atctatgcgt agacctaaag taaataatac gtaagacata catgttaaag t agagt cat a at at cgacat aat cagacgt taaagagaga agggaaattg tgttttagat cgacatatag ttaagctctt tattttagtg tattcacgag aaaaacagtt tcatagttac tttctcactt ttaaaaatct attccctaaa aaagtagctc tcaactatct ataatttaat ctccaagact aactatttgt agttttgtac gatggccgca tacgcatttc ccttttctta ttcgcgaatt tgattacaca gatcacatgg cgaataatcg attattattt gtttatgttc tttttgacaa cctcgagtaa gt t attttag attatttatg aaatatacaa ctaacgttca atttatttcg agggacaaaa acaaatacgg tgaattatgg agtcaaactt gtttgagttc tattgaatgt catattaaag tatataatat tagaacagca cctgt t aaaa t at aact at a acat cgat ca ct cccaat ct ccccccgtat gt aacaggat t ccaaat cga t t gagt cggt agt t ccgt ac t t t gtt aaga acgt agcat g gat t cgat t t ttttaacaaa cat ggct aaa t caat t t aaa gccat t t gat gaat gaggat aat cat t t t c ttcgt t t t t a gcct agt gat tacagagaaa aacaaaaaat t aact gt gt a t t t gtccttt agagat t tag ggaaagggaa t gact t caca aat t t at gt c t at aat cgt a t at gaacgt a t t ct t t t at t aaaaaat aaa at t t t t aaaa at t aat t at t cat cat gacc agacaaaaaa t t gaat caca t t t aaaat t t t t t t catgt a aaaaagcaaa t aaagaaat a acacacactt at t ct t ccat at agaat t at at t gggaaaa t gt gct t aag gacat t ct ct ttttttaaag gt agaggat c t cat cccct c aaat t agt ag caacat gt t c gt t t agat t c aaaat gat t g tt ct gct cag t ct at t acaa aaat gt t t t a t t t t ctt ct t t t aaaat aaa agt ct t ct ct t gagat aaat t ct t cgcgt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 Page 635 12689250 Sequence Listing.txt gtcatcattt atcattatca tcgagaagag attcaaattc aaaccctaat cgataagatg 1200 <210> <211> <212> <213> 766 1200 DNA Arabidopsis thal i ana <400> 766 aat at agat g aat at t t t ca t cgggt cat t ct t t gtcttt t at t t t t ct a aaacct at ga t t agaaaat t gt gt t t t cga t ct gaact aa aat gt aaaaa act aggt t t g aaact gt t t g t at ccaat aa ct ct at ccaa agaat cat cc aagaagat ga gt aaact gt t aacaat gaaa caat gaaact ggcct aat ga t t ggt t at aa ttcct t t ct c at at t t agt a t ccaat at ga aaacaccaat t ct aat gacg acaat at t gc gagacagct g tt act ct cat aat t t t ct t c cat at caaac aaacct gt ga cat gagt aat acaagt act a at gt ggt ggc aaaaccattt aat ct ct ct g t cct ct gt t t agggtgccgc gcct t t t aat cct cacaat t acacaaacat aaact aaaag acaaat gcga t t t gt at aaa gt ct gcgt t t t aaact at ca t t ccacgt aa ttct t ct t cc t ccaacact t t t t at gt t t g t gtt cagaaa ttttcacaca ttttgttacg t caagact ac gt ct t t cat a ct caggt aag t t at t ggat c caaaat t gt c aat gt agact t t t gact t gt t gt t ct cgt a t at t gt t t t c at agacaat t cct t cat ct a t t aagacat a caaat t gagt at gcat cct c t acagcaaat gct ccagcga at aaaaagt a ct act t caaa ct gat cat ct ct t t t ctct a ggt act t gca agt gat acaa t gacccat t t at t aaat t ga t t agcaat t t ct t gt agagt t acat at at t agt t cgacgt aaacgccaaa t ct ct ct cag aact accaaa t agct t aaga cgaaaaaagg gcaat ct at a t gaaagt ct a gct gaaat gt at cact act c ct t t t ct t ct tttcagacaa gagt t gt cat aaacat act a t gaggacat t t gt ct ccct c gcacgct gt g t t t t ct ctt t aaaat t aaag t t cat gt t t g cact acat ct t gt cacaat g gt at caagct t at t agt gaa gaaccaat ca at ct gt gat t t gcat gt gat at ct at at aa aagaaat at g gct t t ct cag accaaagt aa cat ccagcaa aacaaacttt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 caagagagta at at ataagg aaatgt t ggc ttcttttttt tgt t gctaat cagacgaat g <210> 767 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 767 ttctcaaata aagttttttt t tataaaaat tagaaataaa aaat t aaatt ggagctttta cgggaaagac aggcgaatag ttttgtttga ttaaagagag aattaaaaaa acaaatcttt acttgccaac ctcaacaatt aaaaaaaaat tgagaaaaca ctttgacaaa aagaaaacaa ggaaaatt at ttatcct t gt gctggccaac ctcgtcctgg gtt at att gc ttaaaaaagg Page 63E cgt at t ggt g t at acgggt c gagt t t at t t caaaaaagaa ttct t t gtat aat at at at a gaaaaagct g ggtgacacgt ttgt t t t cgt agaaagattt at t t t t t aat tacat t t gt t 120 180 240 300 360 12689250 Sequence Listing.txt agtttaacaa atttgagatt aaaaagaaaa aacaactttg ctagtgggaa aagaaaagat ct t caact ag t t ct aaaaaa t ct gaaggt t at aaaggaga gt aat ct t gc cct t t t acaa aacaaccact at cgat t gaa gat aaaact a cact at ct t g aaat at t aac agagt t aaat <210> 768 t agct t aaga t cat acat ga aact t t t aac aaat aaaat t accacat at g t t gcact t t t cat gccgcac t t at gat cat t gat t t ggt t cgacaagt aa gagct t cact at acaat at a aat cat at t c gacat t t agt cat t gt aaat t t caaaaat a t gacgt ccat tttttgtccc t ct at ggct t aacagcccaa at aaat gaat t t gaccacat aaaaaaagac ct cacacaca t act t gcat a aat t t t t t t a aaagat gt aa aaaacaaaat aaacaat aat t t t ct t t t ca ttct t ct t cc gt t caat cgt gct t acggt a at agat gt gt at ggaacggt gct t cat gt a cat cat ccca aact aact t g aaaaggat gt aaaaaaat ac at acccacaa ct aact t t ct t t gct gaaat ct t cact cgt at ccact cac at t t t ct t aa at at gt act t ccat gcaaat agt t ccaaca tcat t t t t ca gt t ct ctt ag tagaagggaa t t aat at gga at gt t aagag agt t ct at ag aagt at agt t t cat t t t gaa t t t t gctct t at t gtgt t t t aat gat ggca cat at t t gaa ccat gt ggct t aat ct ccac gt at at at ag at at t t ct t t agcagt aat g 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 768 t caaat acac gcat ccaaag t gt gt acct a t aat cact t c t t t t t gt t t g ct aagat t aa t t aat t accc caaaaccat g gact t t aaat aagagccagc ggt cct acat at aggaagaa t cat t t aat a t gaaagat ag ggaccacact t gt t gagact cacgt t aat g at gt act gt g aat t ccaaag ttgaaagcag gt t at ct t gg cgt ggagct a at caagaaga gt t ggaat t t gt t agacccg ct gcccaaaa gct t at ccag ct aaaat t t t at gt ccat at ct aacat gat ggat gat agt gggat act ct at t cat ccgt t acccaaat t at gt act gt g t t t gaat gt t aggt aagat g gt cct t t gct taatgaggga t gt gcacat a t t gaaact ag gaaaaaat at at at t t gcag ct aat gct at t t gt ggct at gt aggggt t t t gcaaaat ac tctgtgagaa atggggacag ccgagt t cac t accct aaca t ct caaagaa agct ggt gac t t acggagct gaaaaaggct gt gat t gagg t ggt gggcct ct t gt gcaac agaaagaaac t t t gct caaa gt t cact gaa tggt t t t t ag at t gt ggt t a cgcat t aat a ct t agat t t t caaccaggga gt t t t cgt ct t ggt cat gt a aat t ct gt t t t t gat aagat aaagagat t c agagagatag gt gaaggt t g t t gt gat gct at caaaaggt t aggt gt t t g t t at t aacag gaggcaaaga aggt t t gt ct gct gt at t aa t ct gt gt gt t t gaat gat ac ggt t aggaga ct acccacgt t t ggct t at a gt t t t aggat at t cgact t g aaagtgaagg aggt t gaggg t ggt agct t t t agccacat t agagctctgt ggaaagcagt ggct t t cat g ct t gagcaca gt gt t gat ag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 aatttgtgga cggtaattcc cgtgacattt atggaggtat ctgtgatgtt tttcctatta 1020 Page 637 12689250 Sequence aaaagacaaa taaggagacc caaaaaaggt gagaacaact tttgtctgaa gacaaaggga gtggtcaatg accagtactc cttctactca ttttgcatcg ttgagatact atactgcctt Li st i ng. txt at acacacaa agaaat t t ct ccctatatgt tgggttagga gaaagacttg gtattggat g 1080 1140 1200 <210> <211> <212> <213> 769 1200 DNA Arabidopsis thal i ana <400> 769 acat gt at ga aaact ggacc cgt gaacgaa aaat gaaat t t t gaaagcgt gaaat at at a agt t aagt cg t at t at agat aat aat ccga taaacgacaa at t t t gagaa agaat ct t t g t cgt t ct gt c at t aaaaccc gaaat aaagc caat t t gt t a gt accaat ct caagcaaat c t at aaat t ac t acaccaat a at cat t at cc gat gaagt t g t acagaat gt cacat gaaat caat at aact cacgact caa cat aat t aca t gcgat t gt t acgt acaaag t ggcggct ct gct t t t ctta t t gat t ct gg at t t caat aa ccaat ggt t t acat gct t gt ttaacaaagg gaacaaaat c at gt aaaat a caat gact t t t at at acaca ttttgtaata ctctaaagaa tatatgttgt gattgatatc t gaat t agt t cggaat t gac t ct t t at t aa aaat t at gt g at ct gagt ac t gct aat agc gggagagtat gggagcatct t t cgt ggt at t aat aat t aa ttctct t t ct t ctat t t t gc t t at gagaat t aaaaat cca gt t gt ct t at aaaagt at aa agagggcct a cct t gaacca cagagt t t ct aat t acgt ag t act t ct cat t t agct agag t agt t acgt t agaat caat c at at at t aac at agat t ggt t cct cgt gat at at at agt t aat gaat t t c gat aggct cc aaact agt cc at t acaccga aat t ct ccaa t aat t t gat c agcat t aat t t caat aat t t gact caaaca tgacacacac agcgct gaac at at at at t t cct aat ct t t t aat agagt a aaaat t aagt gt gcgt aaga tctct t t t at ct t ct ccagc at at accgt c t t caaaat cg aat gt t cct c caagaacccg at at t t t t t t at t aat caac t cat at ggag t aaaat aat a ttcccaaaca aaccagat ct at t cagat aa aaaaaaat ca gaaaaat t aa caaact cgt a cat at t gagg t t ccat t at c t aaaccct aa t t gt cggt cg t t gt acggct at t ct aat gt aaaacgagat t act t t t ct c aaaaat aaat at t gaact t a t caagaat t g gt t agat t at t t gggt t t aa aaaat t t ct a cct t at t act t at caaaat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 770 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 770 aacatggcaa gctagaagtt gcgtacatta gtcaagttgt taaatagacc agattgttat tttctaagtg aatagctttc aacttcttcg ttacttgaat tcaaacaaat agaggagtct t at gaat aga caagccct ag t ct t at aaga t t t t caat ct Page 63E t cat t t gaaa aagt ct cgag gaat aaact a gcgt ct t tag ttagtaagca ct t ccaaaag caaaagtcgc at aaaatt cg 120 180 240 12689250 Sequence Listing.txt gaaggcttac tttgatcttg aagatgcttt gaatgtagat attgaatgtg aaattcttaa cgt cat caaa act cgggct t t at at gcaat gt t t t t t t ct t t t ggtcgca aagct t gt ag tagt t t t t ag aat t t agggt cat t gaaaga t t gaat t ct t act gat gaca ccaact ct aa gt caat gt t g t t cgt t t gca t aaagt ct t t <210> 771 aagcaccccg agagggagac cct aat act g ct t gcgggt t act at ct t ct aagct ct caa t gaaaat ct t gt aaaaaaaa at ct t aacaa gaggaccgtt t gagt agacc at t ccacaaa aaaat t caac aact cgt t at ct t t t at aaa t t t ct t aggt t t ccaat caa at t t ct t t aa caat cct aat t cat t t cgaa agaat t t gaa tacaaggaag t t gat t t act t t aat t t at a cgt cact aac ccaaaat cgt aacaat t aaa aagaat t t ga ggagt t ct t g aaaaaaaaac ttgtctattt agat agact t t gt t aaaaat gaat t t gat a t gt t ct aagc agct at agac t ggt at aat a gaacaatttt cat t t agat g aagt aaaaag agagat cacc at gt t t at t t gt acct t aaa aaaat act ct agaaaaaaaa t at ct accat gt at gt t tag gat t t t aact aggt aaaact gaat t t t ggg aact t gt gac aagaacaaat gcat aact t g at ct aat t cc at at t aaaaa aagat at gt g ct gaat t t t c aat ct aat ct ct gct cat t t acagt t t agg gcat gt cat g gagggaacaa aaaat aagt a t ggaagat ag aagct ct caa aaaaat t ct t t ggggt gat g gaaaagct ag agccgagtgt aaaat aaaaa at aaat cct t gcgggaggac t at t at ccat gat t t aaaac t t t agagat g 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <211> <212> <213> 1200 DNA Arabidopsis thal i ana <400> 771 at t ct agcaa at ct cgat t t cat agagt ca ttaggagcaa gagggaaaac cat at t at at acact t t aga gaaagcggac act t t t aaca t t t at gcaaa act t aggt ac ct t at t caca t aaaagat gt ct t t t cgt ct t t ct cgt t ga t gt agt t t at t agact t ggt t t t agat ggg t cact aaact gct aat gt gc t t gacaaggt at cat ct at g t at ct t t aca gaaat aat gg ttgtgct t t g t t gccat t ga ct t t t t cct c acacagct ct at aagt t at a t gt agcaat t gt ct gat gca act t t t at gc aaaacat caa cat t t gggat at t gt at agt at caaccat c t at gcat cca caacaat cat t t t t t cacat gt gt ct ct ga gat ct ccaaa accct t ggac caaagt caca gt ggt t gaat t t gt t aagt a t gaggaagt a aacgt agt ag ggt caaccaa aaggtgaaag gt t t aaggt t t gt t agggt t aat gt t t gt g t gt t gaaat t at ct aaaacc agat at t gca t gat at aaag aat ct t gct t t aagt at gt t gacggt t at a ggt ggt aact t t t cat at ga aat ct aact a cat t cagcat agt gacagaa gacggct t t g t aacat aat a ct t gat t gct agcccat t t c acaat ccct c agt aaaaaca ct t t t t t at g at agacaact agcaagcagc ct gt gaaaag aat caacccc gaagt aagca ct aagct t ga t ccat gt cag act gaaaaca caat agt agc t ctt gaaaca t t caaaaat c cccactt gt t agt t ggct at 120 180 240 300 360 420 480 540 600 660 720 780 840 Page 639 tt ctt acaaa taagagagaa t gt gat gaag aaaat ct caa caaaaagaac act gcccgaa taaaaagaaa cct agt aaac agggtcccaa tggt t t t t at agctgggagg aagaacact g 12689250 Sequence gaaaagaaaa gtggt t at t g cactgccgta gagaccaatg aat ggat aca ct t ct ct acc ttagttcgat aaagtattat at t t t gt t t a aaaggacact cagaaacact ttcctctgt t Li st i ng. txt gt ccat at t a actacgacca t at t t t t t aa t ccgaat gt a acactt ccca cacagagaaa gat aaat gag ccacgcggtt cat at cct ac aat agt ggaa gct t t ccct c gagagagat g 900 960 1020 1080 1140 1200 <210> <211> <212> <213> 772 1200 DNA Arabidopsis thal i ana <400> 772 at t aaagt ca t at aaaaaac cat t acaat g at t aaaacat t gat cct gaa ttgt t gt t gt acat gt at t a t at t t ct t t t at cct t gct t gaagcat gcc at agcat gat at t at aat gc t caagat t ga caat t t t t at gcat t gagt g t t t at aaat c t t at t at at g gt t at at t t a at aat gt aaa ccct caaat c act at t t caa aaat acaaca at aaat aaat aact aat aaa cct ccaagat t ct t gat t t c ct at cat caa agagct aat t ct agt t gat g gaaaagat gt cgt cat gt t a at aacat aat tgccaccgcc ct t at aat ct at cat at at g at t t t at gt t gagttttttt t t t gtgaaag t act t caaaa ttcat t t t ga at at at t cat at aat agt ac acat aaacac aaat t aaat t caccaaaat a t t t t t t gt at t at cgct t aa t t t t t gcttg cact t t ct t t t gaacgt gcg ct ggaacaaa aat t gaggat aacagt t gaa acagat aaaa ggagcatttt t t t t t t t at g t t t aaat ct t t t t t at cact aat t aaaat a gggt gt gccc at aaat t t aa act gagcat t acaaaat aaa cgt aat gcaa t t gt ccat aa gat t ct t gct acgt gt t agc at t t t gcat a caaaaaat ca t t t ggagt ct gaagt at t gc gaacat gat a acggggt t aa aat aaagaga ggaaat at t t tttttttttt gt t t ggcat a at aact gat t aat t t gt ggg cct t t aaaat at aaat gaaa tcaacacaaa t t aacact t g t at t aat at t acat t t gat g t gt t cagat c aat at t t t at cgcaat t gag agagagacat t acaat cgaa at gat at aat t caat gcact at gaacagac ct aat t t at c at at agagt a aat t t at at a at t gt t t ct a at t gt t aaaa t at accat t g gagggccatt at t at t t aaa t t aaaat t aa aaaaaat t ac caggt aaat t gaatcggagg gt at aaat t c t t t cct t ccc ccat ct cat a ct cat aacaa gt t t gct at c gacagct t gc gat t agagat aagat t t caa act t t gaaat at t t t gt aat t aat at t at a t aat at at gt at t gaggat t gagt aagt ga gt t ggagat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 773 <211> 1200 <212> DNA <213> Arabidopsis thaliana <400> 773 aacaacatta ttgatttttt ttcggtctac ctaagttttg acctttcttt tttaattgtg Page 640 12689250 Sequence Listing.txt aacaattgaa catgcgtatt acttattgat aatactttta tcttcaataa acacgaacat t ggacgct t t t at aggaat t ggagtgat t t aaaaggaaat gt aaaat t ca tgaagacaaa t t t t ct at t c cat aagaat g ttct t t cat t aat act t at a gat t at at t t cgt t t t gt at tcgaccaaag at t caaggat aat gaaat aa t cat t aaccc cacaaaaacc aat at agaaa gt cat t gcat t gcat gacga t t t t t gt gt t acacaaaacc cggat cagac aaaaaaaaaa tttttccgaa t ct ct ct ct c t at t ccat ca ct act ct t t a t t t ct at aaa taaaaaaaag t t t gt aaggt aaat at at ga gcgaat at ac caagaaagaa ggaaaaaaaa t agt gt gt t c agt t t ggaat aaaaat caat ttcctat t t a at at at aaat gtt aaaaaaa aggaat t gca agcagagaaa t at t t ct t t g at at t t t t gt t cgcgt at at acgt gcgaaa caact gt aaa aat aat t t gt gt t gt ggacc at at at acaa aaccaaagt g aaagacaagt caat act gga t t at t cct aa act agt t gca ct acat t gt g at at gt at at t gat cat gct at cgt agat t aatggagcgg t ct t t cact a t t t aaaat ga at agt at agt act cat t t ca t t t t at aaat act aat t ct c aaaacaaat c ccgact at at t gaagt ccga aaagaaagct gat t at t tag act t tct aag ttat t gt t gg t t t t cct gac at at at ct ga t aat at t t t c t gact t t at t t ggagagt ag t cgcccat ac aagat t t aca ct at gt t at t cat at acaaa ct gaaat gt a cat act gt gt ct ccat at ca at at at at at at ct ct ct ga t t gt t cagt t aaaaaat aac t aagt t cagt aacacgt t ct agcat cat cg ct t t t aaat a ct gt gct caa t cat t t t aga t t t at t t aat agaagtcggt at t at ct t gc cat t aacat g at gaat t t at caaat at at a acat t t gaag cat aat cacg at caat cat a at gct gct ct t t ct acaat t t act t caat g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 <210> 774 <211> 690 <212> DNA <213> Cryza sat i va <400> 774 atggcgaact caccgacgct ctctccgcag ccggcgtgtc aagaaccagg ccccgaacgg aacgccgccc gctcctactc gagatcgctg ccttcttcgc gagataaatg gggcaaacat ccgggcaaga agtactacgg ccggcgggaa agaacatcgg ccgacaatct ctttcaagac tcgcaggggt ttggcgccac aaccccggtg ccgtcaacgc gacaat gct g tgtggagagc ctgcgccggt gggct t cgcc gcat gt caca ggactat t gt acgcgggccg gttcgacggg ggcgctctgg catccgggcc aagggtaaac gtgt t cctag gtggtcaccg aagagcttct aatgaccgca cacgagact g gacaagagca ct gcagat ct ctaagggacc ttctggatga at caacggt g tactacaagg ctat t gggct aggcgttctt acacacggca ccaacgacga gacatatgt g acaaacagt g cat ggaact t cagacaaggt acaacgtgca cgctcgagt g act act gccg at ct ct cgt c caacgggat c at cgtt cct g ctccaagcgc ct acat caat gccgtgccag caactacggg ggcgcaggac ccaggtgat g caacgggaag ccagtt cggc 120 180 240 300 360 420 480 540 600 660 690 gtcagcccgg ggggcaacct ctactgttga Page 641 12689250 Sequence Listing.txt <210> 775 <211> 501 <212> DNA <213> Cr yza sat i va <400> 775 at gt cct cct ccacct t cag gcctccgccc t cgt at t ct c ggcgtggcgt ccatgggggt gtcatggccg gagt gct cgg atcaacccca aggccaagcc ctcgcctgcg gcctcgccgg ggagtcaggg ccaatgcgca tttgcagaag ctcttgctct ggccaat ct c gagcggatta <210> 776 <211> 1191 <212> DNA <213> Or-yza sat i va <400> 776 atggccgcac t t gat acct t aagct ct gcg accaagtctc agcaaggtcg ct t gt gagac accaccaagg ctaacgttga tttgtgtcag ctgatgtcgg cagcagtccc ct gacat t gc at t ggt gct g gtgaccaggg at gcccct ca gccat gt cct aat gggacct gcgcatggct aatgagagcg gtgccagggt gatgagacag tcaccaacga gt cat t cccg agcagtacct t t cgt cat t g gcggacctca act t at ggt g gctggggagc gt t gaccgca gt ggagcat a ct t gct cgcc gct gcat t gt gt at t cgt cg acacatacgg cggcgat gag ct gcat gggc gat gcgcccg t at ct acggc ct act t cct c t ct cgccgcc gcagccaaag gt at ggcct g accgcccctt gccgcct acg gaact ggt ga ct cat cat cg t t cgacggt t ggcat ggcca t t gt t cgt gg at t gt t ggt a t ct t cggct t gcaccgccaa t gaagt ccat ccgt cat cat acgcccacct t cggcat cgt gcat gat cct t cat cct ct c cct cggcgcc gagcggcgt g cgt ccccgt c ct ccaccggc ct cct ccggt cggcgacgcc cat cct t at c gt ct cgt gcg 120 180 240 300 360 420 480 501 cct ct t t acc t cggagt ct g t gaacgaggg ccaccct gac agat gct gt g ct gcaccaag ct at gagaag t ct cgat gct acagggt gt g acacat gt t t t gct accaag caggcct gac ccct gt ccgt t gagat t gct t gat gagaag t ggt gat gct t cacggt ggt cgt cgcaagg ccaagt at ca cact ggcagg ct t gat gcct acaaacat gg at t gt caggg gaccact gca cacggccact ggat at gcaa ct t ggcgct c gggaagaccc gt ccacaccg gct gacct ga acaat ct t cc ggt ct cact g ggt gcct t ct caagct gcca t acgccat cg at ccct gaca gcct cgccga t cat ggt ct t agacat gccg aggt gct t gt tcaccaagcg ct gat gagac gt ct t acgga aagt gact gt t cct cat ct c aggagcat gt at ct t aaccc gccggaagat ct ggcaagga agagcat t gt gt gt cccaga aggagat cct ggaccct gac t ggt gagat c t aacat cggt gaacat cgag ccct gaggag ccct gagt t g ggt t cgcaag t gagt accgc tacccagcat cat caagcct at ct ggt cgc cat cat t gac cccaaccaag t gct agt ggc gccact gt cc caagat t gt g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 aaggagaact tcgacttcag gcctggcatg atcatcatca accttgacct caagaaaggc Page 642 12689250 Sequence Listing.txt ggcaacggac gctacctcaa gacggcggct tacggtcact tcggaaggga cgacccagac t t cacct ggg aggt ggt gaa gcccct caag t gggagaagc ct t ct gcct a <210> 777 <211> 1467 <212> DNA <213> Or-yza sat i va <400> 777 atggtggtga gcgtggccgc t t ct t cgcct cccgctacgt atcccgaggg aggcggcgta cggctaaacc tgggcgtcct at gat agct c acct gt t caa acggtgggat cctcagaagc aacaaacgga aggaacaggg gt t caggt t t gctgggagaa ct cagt gaag gat act at gt act at at gcg t t gcggccat t t at t gaat a at ct cct aac cggtatgacc aggaaaactg tttatagcac cttttctata agcat caat g tcagtgggca t ggcgaagca aagaggat t t gaccagccga cgttcactct t at caact aa t acgcct ggg aacacagcaa t act aaggga gaggccggtg tgcccttggt gacatctccg agcacctgag aacgccgagc acgtcgccgt gccgagcggc tcgtctcgga caggt gct ga agat ct ccag accaagaaga gcgtcctgga aagaagaagc agaccggaat gaccgact cg ccgcgacccg ccagat cat c t cgt caccac t gcaccaat c aat t at gct t gaagccat gt at t t gccaga cat ggat cct ct t gggct ct agaaaagaat gt gggat gt g ccct gagct t caagt at ggc gcct gaagaa gaact t ct cc at t cgaggga aggcat agag ggcgt t ct cg gaggt t cggc cct ccgcgt c cat cgt caag cgccat cgcg gaccgagagg ct gct ag gacacggccc ct cccgcggt aacgacgagc gt ggat ggag aaggaggat g gcaggact gg gacaaaccca t at t t t gaag gt aaaggct g act ct cact g aaggaaact g ccaat t cat g gaat gggact ct t gt gt at c ct cat t t t cc aaaggt t cca tacaagaaca gcgact ggt c ct caaggaca t ggat cgt gc gt cat caggg at cct gcacg aagcagcaat gagat ct t cg agccggt gca t caggat gcc t gat gct gga cccgaaaccg aaacagct at cat t caagag acat t gt t ac t agaact gaa t t gaaat ggt gagagt t t ga ggt t gt ct ct t t gat gcagc t caggct acc caggt gt t gg at at aaact a gccagat aat t cat gcagaa gat t cgaaat gcggcaggt a cggcgt acac aggact t cag agct ggacgc cgggcgacga cgt act ggag gt act ccacc ggagcagt cg cgggaacccg t t gt gt gaat t ggagt t ggg gaagt ggcaa t ggt gct aat ggaggt t aag ggat gagaac ggat gt t aag ggaggt cgat aagt ggagga act ggt gaag t t gggt cat t t ct ggggaca cgcacagt ac t t gcat ggag cct ct ccaag caccgt gt t c cat gccggcc ccggagcct c ccat t cggcc t ggcgt ggt c ggaccaggt g 1140 1191 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1467 <210> <211> <212> <213> 778 510 DNA Qr yza sat i va Page 643 <400> 778 at ggccggcg gccgcggcgg gt cggggt gg gt cgt cat gg gggat caacc ggcct cgckt gccggcgt ca at cct t at ct t ct cgt gct g cgt cgacct t cgct cgt ct t cgt ccat ggg ccggggt gct ccaaggccaa gcggcct cgc gggcaaat gc ttgcggaagc gt caat cccg 12689250 Sequence cagtggcgac gagacggcgc ctcctgcatg ggcgccgcct cgtcatgcgc ccggagctcg cgggat ct ac ggcct cat ca gccct act t c ct ct t cgacg cggcctcgcc gccgggatgg t caacagcca aaact gt t t g gcttgcactc tatggtctca agcggact aa <210> 779 <211> 1545 <212> DNA <213> Or-yza sat i va <400> 779 at ggcct cca t ct ccgccgc aacgagctcg ccgccgccgc gggaggaggg cgcgttcggg gccccgtcct ccactggctc agggcgtcgg agcgcgccag ctgatgacgc acgacgtctg gaggacgcca aggtttggga accagcgatg agcgagcaaa aacat caagt act t ct at ga aagggtgttt gccacgttgc cttggtactg attctcatac ggaaacactg atgctggttt actatcaggt ttgtattaga t t acaaat t a t t ggt gagat ggat caact g t ggaaagt ct gaagctggtg gcaagaatgg ggcaagacat cagt t gaat a gactaccggt t t gat gt at c aaccgtgctc tagcaagaga t gt act ggag gt aaaact ga aagaaggt t a aagt t cccac ct cccccgt c cgt ggcgccg gagggt ccgc ggt gaagagc ct t ggagccc cggccct ggc ccgagagaag ccgaaat gt t cat caaggac t ct t gct caa at gcaat gct t gt gat gggc t ggagaaat g t t ct gt at ct aaat at ggaa t gt t gt gcct cgagcct gt c caaat t ggag gt gcaaagat ggact t ct t t t t t t ct cgt t gct gggaagg tcgcagcagc gccgt cgcrTa gccat gacga ggggagaacg accat cggga gt ggt cat ca gat at cct ca ct gagcaat t gagggt cat t ggagcct t t g act gggaagg ccacct t at t ggcgcaacat gagcgaat ga gccgat caaa t at agt gat g ccagt t at t g gt caagat t g gct gct gcga ccggcgacac Li st i ng. t xt cct t ct t cgg acggcacggc t cat gaaat c t cgccgt cat gct acgcgca ccat cggcat t gggt at gat t t gt t ggt at ccgccgcctt agct gcagcg agcccgcccg tgacagagaa t gt gggt gga t ct t caagcg tcccggacca gggat t t ct g tcaaagcaaa gcagaccagg gccaat t t gc ct ct t ct t aa t act t gcaaa acaaat ccat cact gt gcaa ct acat t t aa ct caggccag ccaagccaca accgggt ct a aggt gt t ct t aaaaggt gt g ct t cct gggc caagagcggc cat cgt gccc cat cagcacc cct ct cct cc cgt cggcgac cct cat cct t cat ct t gt ca cgcccacaag gagggt gagt cgcccccggg gat cct ggcg cgt cgacgt g ggagt t cggg ct acat ct t c t at ggagcag t ccagact ac cgaggt t ct c aact ggaat t ggt gcct cca ggat ct gat t ggagt t t gt t cat ggt t at t ct at ct t gag at t t gt t agt t t cacct gac t at cggt t ct agct t cgggg gat ggacat t 120 180 240 300 360 420 480 510 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 Page 644 t at agcat cc ggt t gt gat a t at gct cgca aggat ggggc gccct caccg ct gt accagg caccagct ag t gaat gaacc acaaggaagg gat acgt cac 12689250 Sequence agccggtggc aaaacttgct ccctagttgt ggtgcttgtt t at ggt gt gt gtgtcgacga gcagat ct ac ct ggct t cgc tgaccccagg gact t cct ga Li st i ng. t xt cacagat at t t gaggaggct t gggt ggt cc t cgt gat aca cgaacaggaa t t t ccccggc cgttcaccgc ggcggct t ca t gt aa 1320 1380 1440 1500 1545 <210> 780 <211> 519 <212> DNA <213> Or-yza sat i va <400> 780 at gt cgaaca cgagggt gt t gtgatggagc tgtacgcgaa accggcgaga agggcgtggg caccgcgt ga t cccggagt t ggaggggagt cgatctacgg agccccggca t cct gt ccat atctgcaccg t gccct gcag gagggcatgg acgtcgtcaa aagccggtcg tcatcgccga <210> 781 <211> 747 <212> DNA <213> Or-yza sat i va <400> 781 at gt cgggca acat cgcct t gcct acgt cg ccgagt t cat at cgcct aca ccaagt t gac gcggtgtgcc acgggttcgg ggccacgtca acccggccgt accggcgtct t ct act ggat cagttctgca ccggcgtggc ggcgt ggt ga t ggagat cat gccgacccca agaaggggtc ggcgccaaca t cct cgt cgc ttcggccccg ccgtcgccag ctcgtcggcg gcggcctcgc gcccccgttg ccagcagcga ct t cgacat g ggacgt gccg caagagcggc cat gt gccag cgagaagt t c ggcgaacgcc ct ggct ggac ggccat cgag ct gcggccag cggccgct t c ct ccaccct c cggcggcgcg gct gt t cgt g cacct t cggc cgcccagct c gacaccgacg cgt cacct t c gct cggcacc cggcccct t c cggcgact ac cggcct cgt c gt t ct aa accgt cggcg cggacggcgg aagccgct gc ggcggcgact gccgacgagg gggcccaaca gggaagcacg aaggt gggat ct ct cct ag gat gact cct gt ct t cgt ct ccgct t gacc gcggt ggcca ct cgccct cg ct cggcgcca cacgggct gt gggct ggt gt at cgcgccca t ccggcggct accaacat ct t accggt acg gagct ccggc agaact t ccg act acaaggg tcacccgcgg t gt t caagt t ct aacgggt c t cgt gt t cgg cccgcggcgg tcagcgcggc t cgccggcgt cggccgggct tcggcgccaa gcggccagat t cgt cggcgc ccggcgt ggg acaccgt gt a t cgccat cgg ccat gaaccc ggat ct act g t ct acat gt g ggggcggat c cgcgct ct gc gagcacct t c caacggcacg caagcacgac ccagt t ct t c ccgcgt cgt c gagcaccgcc ct ccct caag cggct ccgcc ggt cgccgt g cat ct ccggc caccat cct c cgt cct cgt c cgcgt t cgag cgccaccgcc ct t cat cgt c ggcgcgct cc ggt cggcccc cggcgaccac 120 180 240 300 360 420 480 519 120 180 240 300 360 420 480 540 600 660 720 747 Page 645 12689250 Sequence Listing.txt <210> 782 <211> 1455 <212> DNA <213> Or-yza sat i va <400> 782 atggcgctgt cgacggcgca t acgt gcgca cggcgct gcc gcgt accaga t cat caacga t cct t cgt ca ccacgt ggat aagaactacg tcgacatgga aacat gat cg cgcatctgtt ggcacggtgg ggtcgtcgga cagaacagga t gaaggccga aacgtgcagg t gt gct ggga aagctgaccc aagggtacta aacaccatct gcgtcgccgc aagat gct ca acgacctcct catgtggacg cggcgagcgg gact t ccggc t gccgct ggt tacgccggcg tcgggtgggt t t ccacat ca act acct cgg t cgaaccaga t aat t gcgca gacatcatgc agaactgccg ggccact t cg acgt ggt gt c gact cgt cgc ggt acacggt gtgccggcgt acaccatgcc cgcgaggact tcagccgcgg gccgatatgg acgcccacgc gacaggggag tcgatgcact cctcgacgtt cgcgtcgcgg gaggt t cagg cgagct gat g ggagcccgag t gagt acccc caacgcgccg ggccat cat g ggggaagccc gaagt t cgcg cgt gat gaac cat cct cggc caccgccaag cgggt t cat c gaagagcat c cat ct ggcgc cgccgaccag gt at t accag ggacaacgcg caaggact cc gt t cgaggt g cgccgacgct cct cgccgag cgt caagaag at gccggaga ct cgacggca tgcgacaagc gt caccaccg at cggggacg ct ggcggggc cacgacaagc cgct act t cg ccggagaagg t ccaccct ca aacgccgaga gcgccgt t ca aacgt cagcg aacaaggagg ccaacct t ca ct cat t cgt c acggt gct cc ggcgt gccgc gccgagagcc gagcacgt cg cgcct cat ca gccgccgccg agt cgat ccc acccgcgcct t cat gat ggc agct ccagaa acgagacggc t ggcgt t caa ccaacat cgt aggt ggagct ccgt ggagat acggcgagtt cagggt ggaa t ct acccgga gccacaagt a acct ccccga cgct caact t t cggat t cga gggaggggat t ggt ggcct t t ccgccgct t ccgt gat gcg ccgacct cac agccggccaa caaggacgcg gaacct ggcg cgccat caac ccggt gcgt g ggt cggggt g gaggaagt gg gacgggggcc caaggaggt g ggt cgacgag cgaggacgt c cacgccgat c gct ggagt gg cgggct cgt c t gagct cat c ct ccaaagga ggggt acaag cgagaagacg ct ccct caag cggct ggat c cgt cgt cat c caagacggt g gaagaccgt g 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1455 120 cgggagatag agaaggaggt gaccacctac tggcggagtt tcgtcgccag gaagaagagc agcct cgt ct gct ga <210> 783 <211> 1302 <212> DNA <213> Or-yza sat i va <400> 783 atggttctga cgcacgtcga ggcggtggag gagggcagcg aggcggcggc cgccgtgttc gcgtcgaggt acgtgcagga cccggtgccg aggtacgagc tcggcgagag gtcgatatcc Page 646 12689250 Sequence Listing.txt aaggacgccg cgtaccagat cgtccacgac gagctcctcc tggacagcag aacct ggcgt gccat caaca agcaat caaa gt t gt ggt t g gacggcgaga gaggt ggagc gccgt ggaca accggcgagt acgggt t ggg at ct acccgg ggccacaagt aact gcat gg at cat ct cca gcgcaggcct acgat gccgg ggccggccgc ct cgccgcca accgccggcg agcgt cccgc cct t cgt cac agaact acgc caaaccaaac acaaccggt g aggcggt cgg t caaggaggt t ggt cgacga t cgaggacgt acacgccgat agct ggagt g acgggct cgt agagcscaag aggaggaggg t caggct gt c cggcgct gga tcgccgagcg gggcccccgt aggaggcct c t cgt cgccgg cacct ggat g cgacat ggac cat at accct cgt gaacat c ggt gggcacg gaagct gacc gaacaccat c caggcgcct c ccacgt cgac ggact t ccgg ct acgccggc gacgct ccgg cgt gccgct g gt cgggcct g gcacat gacg gt t cct gt cc gcccagggt g gat cagggt g caaaaccaag gagcccgagt gagt accccg cgt ct cct cc at agcgaggc gt gt gct ggg gaaggct gct t gcgt cgccg aacgacct cc gcggcgagcg ct gccgct gg gt cgggt ggg gagggcct gg gt ggcct t ca cgccggt acg gt gct ccgcg cacgt cagga cagct cacca gt caagagcg ggcgt t t gct gcgacaggct tcaccaccga at at cct t t c t gt t caat gc agaagt t cgc acgt gat gga ccat cct cgg tcgccgccaa gcgggt t cat t gaagagcat ggt acaagag agaagacggg cgt t caagga ggt ggat cgt t cgt cgt ccg t ggccct gga t cgagct cgg aggccgt gcc ag cccgcgcct g cat cct cgag gct ccaggca gt t gcat t gc gccggt gggc gcgct act t c ccccgt caag ct ccaccct c gaacaagcgg cgcgccgt t c caacgt cagc cgt gat gaag gcggt t cacc cggcgccggc gccggcgt ac ggaagact t c cgagat ggac ccccgcccgg cgt gcgcaag 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1302 <210> 784 <211> 1185 <212> DNA <213> Or-yza sat i va <400> 784 atggcggcgg agacgttcct ctgtgcgacc aggt gt cgga aaggtggcgt gcgagacgtg accaaggcca ccgtcgacta gtgtccgacg acgtcggcct cagtcgcccg acatcgcgca ggcgccggcg accagggcca cccctcagcc acgtcctcgc ggcacctgcg cctggctcag gacgccggcg ccatggtccc ct t cacct cc cgcggt gct c caccaagacc cgagaagat c cgacgccgac gggggt gcac cat gt t cggc caccaagct c gcccgacggc cgt ccgcgt c gagt ccgt ga gacgcgt gcc aacat ggt ga gt ccgcgaca cgct gcaagg ggccact t ca tacgccaccg ggcgcccgcc aagacccagg cacaccgt cc acgagggt ca tcgcccagga t ggt gt t cgg cct gccgcgg t gct cgt caa ccaagcgccc acgagacccc t caccgaggt t caccgt t ga t cat ct ccac cccggacaag ccccgacagc cgagat cacc cat cggct t c cat cgagcag cgaggagat c cgagct gat g ccgcaagaac gt acct caac ccagcacgac 120 180 240 300 360 420 480 540 600 660 gagaccgtca ccaacgacga gatcgccgcc gacctcaagg agcacgtcat caagccggtc Page 647 at ccccgaca gt cat cggcg t acggcggat gaccggagcg gcccgccgct t t cgt cgact gagaact t cg aaccggt t ca t gggaggt t g agt acct cga ggccccacgg ggggcgcgca gcgcct acat gcat cgt gca cct acggcac at t t caggcc tcaagaccgc t caagccgct 12689250 Sequence cgagaagacc atcttccacc cgacgccggc ctcaccggcc cggcggcggc gccttctccg cgccaggcag gccgccaaga ggtgtcgtac gccatcggcg cggcaagatc cccgacaagg cgggatgatg accatcaacc ggcgtacggc cat t t cggcc caagt at gag aaggcatctt <210> 785 <211> 660 <212> DNA <213> Or-yza sat i va <400> 785 atggggagga gacctgcaag aggt act gcc gt ggt gt ccc aagggt gt gg at gagt t ct c accagtgagg ct ct t gaggc ggaaaggatg ccttccacct aagatgcttt cgtgtgccgg aagccacagg gaacctgtgc aagcccaaca atgctgtcca ggt cgccaaa agat cat t ga tacgtcaggc ttaagagcga ggt t gccat g gt cgcct ct c <210> 786 <211> 543 <212> DNA <213> Or-yza sat i va <400> 786 at gt ct t cca cgt t caacgg ctcgccctcg tcttctcatg gt ggcgcaca t gggggt gat atggccggag t gct ggggat aacccgacgg cgatgccgta gccacggggc tctgcgccct cktcagggcc aacgcgcagc gt gct at cgc tgaccccaag ccact gt gt g t gcgcgt at c cagggt t cgg ggcagat agg t agggt gkat tgccagcgaa gagt agaaag gggt agaat c t gct cgt gcc cgacgagt t c cat gggggcg gcggccggag ct acggcct c ct accact t c cgccgccggc agcccaagct cagat caaga at caggat ct cat ct cgt ct gcgt gcaaca gt t cacccgt ct ccagact g at t ggccagg gccct ccgt c t ggggt t t ca at gcct gat g cccgggaagg gccccgt t ct gcgt acggga ct ggt gat ga at cat cgct g gacggct ccg ct cgccat cg gt t cgt cggc Page 64 Li st i ng. t xt t caacccct c gcaagat cat gcaaggaccc gcat cgt cgc tccccgagcc agat cct caa t cgacct caa gcgaggat cc cct ga acaagccgt a acgat gt cgg ct t gggagaa agt acat gac t ccat gt gct gaat gagggg t cct cct t t c gt gccaagt t ccaagt t cag gt gt caat gc cct t cct ct c t gggct t cat cggct cggag agt cgat cgt t cat cat cac t ccacct cgc gcgt cgt cgg at gat cct ca gggccgct t c cat cgacacc gaccaaggt c cagcggcct c gct ct ccgt g gat cgt caag gaggggcggc cgact t caca ccccaagt ca gat gaagaag ggagaat gt c caagt ct gca ccgt at caac t gct t t t ggg t gt gcggt gc caagt t ccct ccgcgat gaa t aagct act t agccgct t ag cggcgccgcc cggcgt cggg gccggt ggt g caccggcat c cgcggggct c cgacgcccgg t cct cat ct t 720 780 840 900 960 1020 1080 1140 1185 120 180 240 300 360 420 480 540 600 660 120 180 240 300 360 420 12689250 Sequence Listing.txt cgccgaggcg ct t ggcct ct acggcct cat cgt cggcat c at cct ct cct cccgcgccgg ccagt cgcgc gcccact aat ccat t acat c gcat ct ct aa t t t t at t t at t t t t acagt a taa <210> 787 <211> 495 <212> DNA <213> Or-yza sat i va <400> 787 atggcggcgg agacgttcct ctgtgcgacc aggt gt ccga aaggtggcgt gcgagacctg accaaggcca gcgtcgacta gtctccgacg acat cggt ct cagtcgcccg acattgccca ggcgccggcg accagggcca cccctcagcc acgtcctcgc ggcacctgcg cctag <210> 788 <211> 1743 <212> DNA <213> Or-yza sat i va <400> 788 atggccgacg accactactc t accagggt a caagcaaaac aagtcgggag aaactataaa agagaccttg at gct ct acc gat cagat aa gcagagct ga t cat ct ggt a at ct ct ct ag caaat gaaaa t agct aacaa aaatccatgc aagccaaatc ggtgatcctg caaccgaaag gcaaagcaat t ggt gat t ga ggct at t ct c agcagggcta ggggcaccga tgcaacagcc cccccacaat atggcgctcc t at cagact g ggt gggat ca ct t cacct cc cgccgt cct c caccaagaca cgagaagat c cgacgccgac gggcgt gcac cat gt t cggc caccaagct c ct ccaagcgc aat t gaaat c gaat ct ccaa t ggct cacag gcaat t gat c ccgaaagt ac t aaggt t ggt t ggt gct cgt aacagt gt at ggt t accagt t cgccct cct t ggt t at ggt t cagcaacct at cat caaat gagt ccagga gacgt gt gcc aacat ggt ca gt ccgcgaca cgct gcaagg ggccact t ca tacgccaccg ggcgcccgcc aacacacct c ccaaat ggaa ct t cagt cag acaagacct g aat gaggt cc aat gct cct c ct ggt t at t g at acaggt t g at t gat ggt a gagaat cgt g cgt cct cagt t acat gcagc t at ggt agct cagcaat ct c acgagggt ca tcgcccagga t ggt gt t cgg cct gccgcaa t gct cgt caa ccaagcaccc acgagacccc t aaccgaggt agt act cct c gggt t ggt gt gagct aagat t t gaact t t c tggcagaggc aaccaggcgc gaaagggt gg t ccct t t gca cacaagagca ct agaaaccc caaat t gggg ct ggagcct a accct ccagc agcaggcccc ccccgacaag ccccgagggc cgagat cacc cat cggct t c cat cgagcag cgaggagat c t gagct gat g ccgcaagaac at at ggt gga t at cat t gga ccaagt gaca aggcact cct t gat gct gct t gat caat t c t gagact at a t t t accccct aat t gaaact aat gt caggt t ccgcat ggg t cct ggggca at ct gggggt ccct ggcacg 480 540 543 120 180 240 300 360 420 480 495 120 180 240 300 360 420 480 540 600 660 720 780 840 900 ggct at gat t at t acaat ca acagcagcaa cct caacagc aacaat ct gc ccct gggact Page 649 12689250 Sequence Listing.txt gct gcacct g gt gat gct ac t agct at aat t ccagccagc ct cct gcat a t gct t cccaa gggt at gat t agt t at t at c cagggct at g ccaagct at g cct gct gct g t acccggt gc t at ggt accc cagggacaga gct cct ggt g ccacct t acc ggt gat cct t act gct gct g acaaccgct g taa cgt cct acgc agacccaagg gaact t ct gg gt gct caagg gaagccaccc aagggt ct gc agccccaacc aggct cct cc ggt at ggcca ct ggt gcacc at ggcagt gg cct cccagga ct ccggcacc t cagcagagt gcagcagcag ct at gggt cc t gt agct ggt t ggct at t ca t cct cagt ct acagggaggg t aacact t ct gt at ggt t ac t gct gcaagc aagct acggg ccaat ct gct t act gct cct ggt gggcagc ggct act ct C gccgct aat t caagcat ct c agccaaccac ggat at gggg t at agt cagg cct t acggac t ct cagaacc cacccaggct cagcct gccg t ccgcccct g gagaacagt g agcaggcat a agcagact gg caact cagga ct gggcagca ct act agcgc cgccaccacc gcagct at gg aggcgccgcc agcaaggct a at ggt cagca cat at t ct ac cgact ggt gc gagcccaaag t gat t at t ct at at gat cag t gggt ccgca aact t caact t gct t caagc acagact ggc cgcacct ccg t cct ggat ct tggcgcacct gcagt cct at t gaagct aca t gcgcct gcg ccct gct agt 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1743 <210> 789 <211> 927 <212> DNA <213> Or-yza sat i va <400> 789 atgtcgcagc ct gct gagct caggccgaga ggtatgagga tctgaggagc t cact gt t ga ggtgctcgcc gt gcgt cat g ggt aat gagg at cgt t gcac t ccaagat ct gt gat ggcat gct ccagagt ccaaggtctt gagt t t aaga ct ggagct ga gccgctcagg at at t gccct gccct caact t ct cggt gt t cttgcaaagc aggct t t cga tacaaggaca gcact t t gat gat at ct cgg aggcgctccc ct t ggct ggt ggat t ct cga at gt cact t a t t ggat cgat t t cccgt gag gat ggt t gag ggagcgcaac gcgcat cat a gct cat caag cct caagct t ct acct caag gaggaaggat ggcagagt t g t t at t acgag t gaggct at c cat gcagct t cat t t at gt t gct gcact t t t gt t t t cggt gagaat gt gt t t cat ggaga ct t ct at cag t cat ccat t g gaat acaggg ct t gact ccc at gaaaggcg gct gct gaga cccccaact c at cct caact t cagagct gg ct gcgt gat a gt gct aggaa ct t t gggggg at gt cagt t g acat ggct aa aggt t gct aa t t gct t acaa aacagaagga gaaagat t ga acct t gt gcc act act acag acaccat ggt at cct at cag ct cct gaccg acact ct gag acct gacgct acct t gcagc agcct caggg t gagcct t at gct t gcagag gacagt t gac gaat gt t at t agagagccgt aact gagct c t t cat ccact gt acct cgca ggcat acaaa act t gggct g t gct t gcaat t gaggaat cc gt ggact t cc t gact cgct g t ggaacct t c ct ccagagt c 120 180 240 300 360 420 480 540 600 660 720 780 840 900 Page 650 12689250 Sequence Listing.txt cacagt cct g ccaagt gccc t gct t ga <210> 790 <211> 798 <212> DNA <213> Or-yza sat i va <400> 790 atgtcgccgg cggagccgac gcggagcggt acgaggagat gcctccggcg gggaggagct aacgtcatcg gcgcccgccg gagggccgcg ggaacgacgc gccgagctcg cccgcatctg tccgccggcg ccgccgagtc taccttgcgg agtttaagtc gcatacaagg ctgctcagga ct t gggct t g cact caact t gcctgcaacc tcgcgaagca gaagaatcct acaaggacag t ggact t cag at gccaat ga gagcctgggg atcagtga <210> 791 <211> 657 <212> DNA <213> Or-yza sat i va <400> 791 atgtcgtttc tgctggattg gccaagatcc t ct t cct cgg t cacaagaga at ct ggcggt at cgggcgga t caggt t caa tggagggact actacgccca gcggtaagcc cggtaaccgc gacgctgccg acaggtgccg gacgacgcgc tcgctggcgt gcggtgcccg agcaggagct ggcaat gt ca acct t gcagg gt ccgccgga t gggct acgg gagggaggag ggt ggagt ac cacggt ggag cgcgt cgt gg ccacgccgcc cgacggcat c caaggt ct t c tggcgacgag cat t gct ct C t t cagt gt t c ggcgt t t gat cact t t gat c t gat ggt ggt gt t ct acgac cct cgacaac gcaccagccg ggcgt t cgac gt t aaggagg ggt aacccgc gt t cgccgag gccgt t cct c gt gct act ac aaccggagt g cgat ggct t c agcgt gt aca at ggagcgct gagcggaacc cggat cat ct accat ccgct ct ggccct gc t acct caaga aggaagcagg gcagat t t gg t act at gaga gaggccat at at gcagct cc gacgaaat ca gt gct ggcgt gccggcaaaa acgcagcacc ct cggcggcc t t accgcggt ggt aaggt t g t cgaagat gg gt cct cggga ct cgggct ca cggccggt cg aggt ggat gt aggcgaagct tggcgcgcgc t gct gt ccgt cgt cgat cga cct acagggg t cgact ccca tgaagggcga ct gcggagag ct ccgaccca t ct t gaact c cagaact gga t gcgt gacaa aggaagccgc cgat cgggct ccaccct ct t cgacgt cgga accggat cgc at t t at at t a at gcagt agt agct ggacgc acaagat aga ccggcct cac aggt ct t cat cacagt acat ggcggagcag ggcggggggc ggcgt acaag gcagaaggag caagat cgag cct cgt cccc ct accacagg caccat gaat ccccat aagg ccct gaccgt cagcct t ggt ct t gact ct g agct ccaaaa gt ggcaggag ct acat gct c ggagct gagc t cgccgcgt c ccgt accccc t t acgt ggt g cct cct gt ct cat cccgt ac caccggcaag gt gcagcgt c caagt ag 927 120 180 240 300 360 420 480 540 600 660 720 780 798 120 180 240 300 360 420 480 540 600 660 <210> 792 Page 651 12689250 Sequence Listing.txt <211> 843 <212> DNA <213> Or-yza sat i va <400> 792 at gct ggcgt t cct ggct ct gcgcagaact gcggctgcca ggcaaggact actgcggaga ggaggaggag gaggcggagg agcgt ggt ca ccgaggcgtt ggcaagaact tttacacacg gccagggacc gcaccaacga act cat gaga ccggacat at tgcgacaaga acaacaagca ccgct gcaga t ct cgt ggaa gggctgaggg acccggacag t ggt t ct gga t gaacaacgt gccatcaacg gtgcgctcga aact act aca aagact act g tga <210> 793 <211> 582 <212> DNA <213> Or-yza sat i va <400> 793 at gt t cct gt gggactggtt gccaagatcc t ct t cct cgg aaggacgaga ggt t ggt gca at cgggaaga t caagt t caa tggaaggact actacgccaa gaacgt t t t g ct gagt caaa gctgtcccgt ttcttatcct gagct gcggt at cat ct agg ggt gaat cca at gt ccggcc t at ggt gacg ggt t caagt g t gggct agcg gt cgaacat g t gggt gccgc t ggt ggaggc ct t caat ggg acagt cgt t t t gact ccaag gt gct acat c gt ggccgt gc ct acaact ac ggt ggcgcag gcaccaggt g gt gcaacggc ccgccaat t c ct acggcgt g cct cgacaac gcaccagccg ggcct t cgat ggt t gat gct aaaggagctt tggcaacaag cct t agcaac cct cgaagt g ggt ct cccag ct cct cct ct t gct gcagca t ct ggcccgt ggaggcggag at caagaacc ct t aacgct g cgt gagat cg aacgagat aa cagccgggga gggcct gcgg gacccgacga at gt t gcagg aagaaccccg ggcgt t gacc ct cgcgt cgc gccggcaaga acgcagcacc ct cggcggcc gt ggt at at t gat gcact cc at t gacat t c t t cacaact g t t cat gt gca t acat caagt ccgccaccgg aat gggggt a gct acggcgg gcagcggcgt aggccccgaa cccact cct a ct gcct t ct t acggggcgag agaagt act a ggcagaacat t ct cct t caa ggt t cggcgc gcgccgt caa cgggt ggcaa t ggggct gt g ccaccct cct cgacgt cgga accagat cgc t ggt agat gc t at ct gat ga cat at gct gc ggaagggcaa gt gt t gt ccg ag ccaggcgagc ct gcggcacg cggcggcggt gt ct gt agag cggt t gcgcc ct cgggct t c t gcccacgt c cat ggact ac cgggcgcggg cgggt t cgac gacggcgct c caccat ccgg cgcaagggt a cct t t act gt gcagaaggag ccacat gct c ggagct cagc ccgccgcgt c t t at gat aag t t cat t ggca t t ct gaagag ggt cagcct a caagat gggc 120 180 240 300 360 420 480 540 600 660 720 780 840 843 120 180 240 300 360 420 480 540 582 <210> <211> <212> <213> 794 1866 DNA Qr yza sat i va Page 652 12689250 Sequence Listing.txt <400> 794 at ggcct ccg tt cct cgccc gt t at gct ct t acagcct t c gaggt t ct cg aagact gat c t t gat cct ga t ct t cat t gg act cacaaga caagaaagt c tgggaaccaa tttgaacacc act gggct ag cgcgacgat g ct t gt ccat g ggt gt t ggca at gct gt at a t cat t cat ct at caaggcaa gt gct t aaca gt ggt t gt gt gt t ggt agaa cct at ccct g ct t cct t t t g aaggt gt act at ct gt gt aa t ggact t cat t at t accat g act ct t at gt at gct t aaac gct gt cgggt gact aa ctgccgccca cgcggccacc gccaccgccg ccgccgtcct cctcgtcgtc ccctt gccgc gggtgaacaa cgt t t t gcca gtggaaatga atggt t t gt c caaggccaaa ttttgttgga acat cgt cat caaagct t at ccaat gt aac agat ccat t g t gt caat gat at gat ctt ga gggatgtttt cacagtt gt c ttgggcgagg ct ggat at gt t gat t at gac caat cgct at t cat cct gt g actggagtgg agaagaaat g gt agcat ct t at gt at at gg caatcgt t gg t ctt ct ct gc tgaagaccaa tctgcctagg ccaagaattt at ctt ggat c cgcctccgac ggttggcccg t ccgt ccaat gct gat t gac tcaggggat g cagtt at ct g gaggcagaac cagat acaat tgatgcgggt at t t gct cac gt t ct caat c tttgatgcgc aact ct ggaa ccggcctcct t gct ctt at t agct at t gt c cagtggtgca agcat cact a att ct at cga ggcct t cat a tgccccaaat gt acct cacg cat t gagat g gt t cat gt t g t acat at t t c t gct t ct act gatgtcagga tttaggaaca gatggt t t ca gacactgttt tccgaccaca tacaacaacc aaccccgtgc agccagatcg ttgacaaggg atgcgat t ga agaaatagcg ggcaatcaga aaggcattgg cgct t t gat g t t caat t ct t act ct aagaa agagat gt ca cgcagt t t gg ctgctagtga acaacat t ca ct t t att cac t t t ccgt t t a t catt agct g t ct t t ccct c aat ccat gca cct t ct gt ca t act t t gt ct ct agt ct t t t ct gct gaat g gccgt ct at g ttctttcaga ctctgcggt a act gat accg gtgagaagga agt accaat c cacaggaaac ataaatgggg acat aaagt t t accat t t gt gagtt cat at at aacaaat a t t at t cat gt at at gacat a t at acct t ga t cat gat ggt at gact at gc gt gaagaat c ctct t ct t t c ttttgttggc ttgt t t gtta ggcatggggg tgtgct t t gg ccat accat t ttgctct t t t gagtaaagac t t gccct cat t cacat cat t t gat cct cat cggagaact a t t t acct ct a caagt t t ct a agct ctt ct a gt t cgaat t t t ctacaggaa ggaggagaag t t acaact ac agggct t ggg tggaatgttt t caat t gaac tggt t t gaat t t t cct t t t c t aat ctt act ttctgtcaag ct accct t t c tatct t t ct c aaagt at gcc tggatggaag agccct t gt t aat cat cgga t gccct t act gaaaaactgg aat t ggcct a tggtact at g gggaactgtt t att cct cgc gggaggact g t t ggaact ac aatt gt cacc ccactggcag ct ccgt at at ct t cggct ac caccggtagc tggtccaggt cat aaagt gt 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1866 <210> 795 <211> 1530 Page 653 12689250 Sequence Listing.txt <212> DNA <213> Cr-yza sat i va <400> 795 at gggaaccc cccgct t cgg aacagccgcg ggcgccaact at gaacgcgc at ct t cgaca gcgt gct t ct aaaccagct g gt t acagt ag ggt ct t act t t ccgt t ggag cct gt t t t ct tttggaggaa aagggat at t t gt gct agcg gccat aat ca aagacagt t g ccat caaaaa gct ggaat t a cccat gt gca aagact cagg at gggagaat at t ggaggca gct gct gccc ct ct ggat cc at gagggt t g gcagcgt ggc cggcggaggg tcgccgcgcg ccct t ggcgg agt act t cgg ct ggcagct c t ccact ct cg ccat t cagt a gt gat ct ggt t cat ggt t gc at gcagt acc cct t ct ggt t t ggat cct ag ggcagt t t ga gt t gt t cagc ct gaaat caa t t t ct caat a t ct gct ct ca agagcgt agt at gcct gt ga at ct cat ct t cat ct gt gga aaaagt t t gc agt gcat cag t gggt gacgt gct t cgcgaa ct t ggt gct c t t t ggt gcgg act ct ccggc cggcgggggt ggagat t ggc caacct ct gg ct acaagt cc t ggcact ggt t gt gaaagat aaaat t t gat t gt gt ggt at caaccgt cat ccact acaag gat gggcgat t at agcagac t gagaagat c tggacaacag ggt t ggt ct g agat gat gaa gat ggct gt t gaact acat t ct gt ggcagc act gaagcca t ggat t caca t t t cat gggt gt cggcct aa ct cgcggccg at cgcgct ga gaggaagggg gagggcgaca gt cggcact c gt gccgt cgg ggacagt cga t caat t gct g caggaat t ca ggcat t ct t g aagat ggt ag t ct gat gaag ggcaaccat a gt cct gat t g t ct ggaacat ggt gct act g at cct agat c t gt act t t t g gct ggggaat gt at ggat gc aat cagct ct ct t gcat cca gaagagt at a gccat ggaca gcct accat a t gct gct cca agaagcgccc cgcgccggct t cgt ggcgct cgccgcagaa ccaagt gct a gcact t at ca ggt t t t t cag t t gaagct ac ggct t ggat t agcaaggt ct gagaaggt gg cat at gt t cc gaggaaagac cat t gct t gc gggt agt cag t gt t gct t gc at gggaagca cgaat ggt ct agaaccaact gt gacaagct t gcct gagat t t ct gaaggt t ccct cct cc ccgt gt t cga agccct cct c gat cgacgag gggcct ccgc gaagaact ac at t caccgt c ct t ct cgat t gaagaat gga cgaggat agt caaggagcca t caggaaat a t gt cagt gag t gaaat t gt g agt ct ct cag cacaggattt tggccccaca ccaagaat gc tgagacacag t ggt gt t agt ccaaagt ggt tgcacagaac cccaagt cca t t cat t cacc t ggt gaagga ccgt ggt cct ct acggcaag 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1530 <210> 796 <211> 540 <212> DNA <213> Or-yza sat i va <400> 796 atggcgagca agaaccccaa ggtgttcttc gacatcctca tcggcaaggc cagggccggc cgcgt cgt ca t ggagct ct t cgccgacacc gt ccccaaga cggccgagaa ct t t cgct gc ctctgcaccg gcgagaaggg cctcggcgcc tccggcaagc cgctgcatta caagggttct Page 654 120 180 gcct t ccacc ggcaccggcg cacaccggcc t t ct t cat ct gt cgt cgacg acggccgagc gt at cat acc gcgagt ccat caggcgt gct gcaccacgcg gat acaccgt gcgt cct cat 12689250 Sequence caact t cat g t gccagggcg ctacggcgac aggt t t gccg ctccatggcc aacgcagggc taccacctgg ctcgacggca cgtcgagaag atggagcaag cgaggactgc ggccagctcg <210> 797 <211> 453 <212> DNA <213> Or-yza sat i va <400> 797 atggggttgg aggaggacac gccgcgggga ccatctgggg cgccacgtcg cgctcccggg acgttcgccg ccgtcggggg aagaagaacg actacgtcaa ggctacaaag gaaggagcat t ct gct gcgc t ggacgt t gg ccctacacag tcgagaataa <210> 798 <211> 591 <212> DNA <213> Or-yza sat i va <400> 798 atggggagaa ggcctgctag aggt act gcc gt ggt gt ccc aagggcgtgg atgagttccc gt cagggt t c acccgt t cca gat aggct cc aaact ggaat gt ggacat t g gt caggt t ct gaagaggcac tccgccgtgc aggaagtggg gcttcaccaa agaat t at gt ct gat ggt gt cgtgcccctg ggaaggcgtt gccggt gacg cacggt cgt c gct cat ccgg cct ct acat c cggggccgt c ccct t ct gcg cggcaacact gcccgct cat gt gct accgc tgaccccaag ct act gt gt g t gt cct t cgt gaggggt gct t ct t t ct gt t caagt t caag gt t cacccgt caat gct cag t ct ggct gag aagacggt ga gcgacat ggc acgct caaga ggcgt cgagc ggcgcct t cg ct cat agct g accagagt gg tga cagat caaga at caggat ct cact t ggaaa at caacaaga tttgggaagc cgt t gcaagg t t ccct ggcc gaggagt acg ct gct t ggt t accat t caag Li st i ng. t xt gcgact t cac acgagaactt caaacaccaa agcacgt cgt t cggct ccgg ccgacgacca aggccgcggc acgacgt gcc t gt gcggcag agct cgt gct tcgccggcgc gt t ct t gcct acaat ggcaa acaagccgt a at gat gt t gg ggat gcct t t t gct ct cgt g ct cagggt ac agagcaat gc ggcaaaagat t caagt t gaa ct cat ggt cg ct t ccgcct a ccgcggcaac caagct gcgc cggct cccaa tttcggcaag ct ct ggcggc cgccaact ga gacggggct g ccgggt cgag ct acggggcc gagccagcgc caccat at t c ggct t t t aca agagt act at ccccaagt ca cat gaagaag ccacct aagg t gct ggggct ct gt gct cgc t aaacat gct cat ccacagc ggct gagggc t ct t gcgaag 240 300 360 420 480 540 120 180 240 300 360 420 453 120 180 240 300 360 420 480 540 591 <210> <211> <212> <213> 799 783 DNA Cr-yza sat i va Page 655 12689250 Sequence Listing.txt <400> 799 at gt cgcct g gct gagcgt t ggt gagct ca gct cggaggg aat gaggcat aagat ct gt g gcagagt cca t t t aagt cag gcccaggat a ct gaact t ct gcaaagcagg aaggacagca aat gcggagg t aa ct gaggcat c acgaggaaat ccgt t gagga cat cgt ggag at gt t gcat c at ggt at cct aggt gt t ct a gagct gagag t t gcact cgc cagt gt t ct a cgt t cgacga cct t gat cat at ggt ggt ga gcgt gaggag ggt cgaat t c gcggaacct g gat cat ct ct aat t aaggag t aagct t ct g cct gaaaat g gaaggaagca t gacct gcct ct at gagat a t gct at t gct gcaact t ct t cgagat caag aat gt gt aca at ggagaagg ct t t ct gt gg t ct at t gagc t accgt agca gat t cccacc aagggt gact gct gagaaca acaact cacc ct gaact cac gaact ggaca cgt gacaat c gaagcagcga t ggcaaagct tggcaaagac ct t acaagaa agaaggagga ggat t gaaac t t gt cccat c accacaggt a ct ct t gt ggc cgat aaggct cagaccgt gc ct ct t ggcga t gact ct ct g agcct gaagg tgccgagcag cact gat gt t cgt gat t ggt gagccgt ggg t gagct cagc t gccact gct cct t gct gag at acaagt ct t ggact t gca t t gcaacct t ggagt ct t ac gacct ct gac agagggccac 120 180 240 300 360 420 480 540 600 660 720 780 783 <210> 800 <211> 495 <212> DNA <213> Or-yza sat i va <400> 800 atggcggcgg agacgttcct ctgtgcgacc aagt gt ccga aaggtggcgt gcgagacctg accaaggcca ccgtcgacta gtctccgacg acat cggcct cagtcgcccg acattgccca ggcgccggcg accagggcca cccctcagcc acgtcctcgc ggcacctgcg cctag ct t cacct cc cgccgt cct c caccaagaca cgagaagat c cgacgccgac gggcgt gcac cat gt t yggc caccaagct c gagt ccgt ga gacgcgt gcc aacat ggt ca gt ccgcgaca cgct gcaagg ggccact t ca t acr ccaccr ggcgcccgcc acgagggt ca ttgcccagga t ggt gt t cgg cct gccacaa t gct cgt caa ccaagcaccc acgagacccc t caccgaggt ccccgacaag ccccgagggc cgagat cacc cat cggct t c cat cgagcag cgaggagat c cgagct gat g ccgcaagaac 120 180 240 300 360 420 480 495 <210> 801 <211> 750 <212> DNA <213> Or-yza sat i va <400> 801 atggcgaagg aaccgatgcg cgtgctcgtc accggcgccg caggacaaat tggatatgct ct t gt cccca t gat t gct ag gggt gt gat g t t gggt gct g accagcct gt t at t ct acac atgcttgaca ttccaccagc tactgaatct cttaatggcc ttaagatgga gctggttgat Page 656 gckgcat t cc ggt gt gaat g ct cat ct t aa ct t gaccaca gt gaagaat g gccact gt ga t gggaacat t ct gat ct act ccgat cgacg aagacgct cg ct ct t t t gaa t t gcgt t at g aagaat t cgc acagggcact cgat cat ct g agact cccag t ggt ct ccat cgt t cccagt agt t ct caag ct t act cat g 12689250 Sequence gggaat t gt c gcaacaact g gttggttctg gtagttgcca tccatccatc cctgagaaga t ggccagat c tctgaaaaac gggcaaccac tcatccaccc tggagagaag cctgtcaggg gggtgtgtac tctgatggtt aacat gcagt ggt ggcgaat gaagaagatg gacgcgactg cct caact aa <210> 802 <211> 1122 <212> DNA <213> Or-yza sat i va <400> 802 atggaggggc ggatcgtggt ttccgcgcgc tctgcaccgg t acaagat t a gcct t aaagc caccggattg t caaaggt t t gggggagagt caat at at gg agaaaaggga t gt t at caat atcactacaa cccgaacacc aaaggaatgg gggtggttcg at t act gat a t t gt t at t gt gtgaactttt tcagtgatgg cct gcagaaa t t t ct t ggt g t at t t caaga aaaaagat t a t t agat ct t t gctgggagaa acaaagt caa t t at act cac ggt gct t t ac tagatgcaga ttccgtcaag gacaggcacg aagcatgctc t gcagt t gga aaaaagaaga t t gct gacag ccatcaggag gttcagagaa ggagct ct ac cgagaagggc agcat ct gt c t at ggt gcaa at t gaat t t t ggct aat gct acacct t gat ct caat ggaa t gat t gcgga t gacat gt at gat gact gct caagacagcc agaagagat a aaat agt t ct ct t t gcat t g cat cgcact c gccaaacgat acgggat cag gat cgat gaa gcct ccgt gg gt cagcgccg t gcact aggt ggt ggggat a gaggat gaaa ggt cccgat a gggaaacat g cat gt t t ct g gaact accag cct gat t ggc gt t gact ct g ct caaaaagt gat gaagaga gct t gcaagc cgt gaagggg aat gat at t g ggaggaat t a gagcggaaag gagaat aact Li st i ng. t xt at gt t gt gga at ccagcaaa acat t act t g t t aat gt cca agt accct ga aact cgt t gc cgt at ggt gt ggacgat t gt cccaggagct cccct cgcac ccaccggcgt ggaccaaggg t t accgct gg at t t t gt cct caaat ggat c t t gt t t t t gg t t ggagaat c aaggt gccag caaat gat ct caaaat ct t t acagaaaagc agagct cagc t gaagt t ggg agggaaaccc at gct gcagt agcgggagct cat t t t ct ag ga ggcct gcact caccaacgct cct cacccgt agt t act gat t gt t aaccac t gat gat gag gcct gct ggt t cagggt ct c gt cggaggag ggcggagaac gccgct t cac t t cat gt at t t gat ggaact gaagcat gag t cagt t t t t c aagggt t at a t gaccgt cca t gat ggagt t cgaagaaaaa t ggaaat gaa t at gagat ac act acggaag agat t t gaaa aaaagctttt t gagagct t c ggct gct gca gat gt t ccaa 240 300 360 420 480 540 600 660 720 750 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1122 <210> 803 <211> 1944 Page 657 12689250 Sequence Listing.txt <212> DNA <213> Cr-yza sat i va <400> 803 at gacgacgc gcggacat cg ct cggcct gc t ct gt t ct t g acgccgacgc ggt gcgt t t c cgt caaccat agccaagt ca acagct gcaa caat cagaca t t t gct gat g gagct t act c act t ggagac ggccct gat g at aat ggccg at ggt gat ag ggt gcct t ga ggcact cct g at agt t gct g gcaggt t t t g ggat t aat gc gggggccct g gct ct t gt t g ggagt t aat g gt ggt gcgaa gcgcat ggt g caggcagacc t caat agct a at t gaaat gt t act cgt t t t agggat gt ga acaat gcct g cgaagccgag acat cgccaa gccccgagca at gagt t gaa cgct cgggga t t gat aaaaa cacaagggcc t acccat gga acaat ct t ct aagcgct gt t t aat gct cag cagat gaagt gggt cat gga aaaaaggt at t at t agct ct ggaacagcaa ct gt cct aat t gt t agt gca at aaaat t gc gt t ct gat at ct cagt gt gc at gt t gt ggc aagct ggat g t t gt agt t gc at gcgt ct t t gt aaaggagc ct ct gaaat t agt t ct at gg at accaagca cgcat gt t cc gggccagcat gt ct t cct ac cccgacgat c cgccgt yt cg ct t cgat ct c agggcagcag gggcaagt cg gat aaagt t t t act t t t gga t gagt t caat t gct gct gct caat agact a acgcct gat a t agacgct t t t gt aaat gac ggt gagagaa t acaact t cc ggcaggt gag gaaagat gcc cgct ggacct act gaagt t g t ggaact gag t at t at t gt g t gggaagcct cgt caat ct t aat caacaag ggct gct ggt ggt t gat ct t t t t gt at cct t gct agcggc aggct t ct ca at ccat gaag t ggagccggt aaggccct gc cggaggct gg ccgt t cccca t acggcaagt gat gggt act accaccaccg gt gat t ggt g at caaggggg ct t cat ct t a at cgat acaa t gt ccgccaa aagct t ggaa gcaagact t g cggt t cct ga acaggct t t g ct t gct gaca cct at t act g at ccat ccaa t t t gct aat a gt t gggaagg aaat t cat gg gccacaat t a t t ggat cat g gct aaacat a t t t gcat cag gct t t t gat g ggact cgcgg t t agaat ct g gt t gaat act aacct cccaa ggcgcgccgt t t cat ct acc t t ct acgaaa acgt ggcgt c t cgccgacat acaaggccaa acgt ggt ggt t t gggct gt g t t gaggt t gt gt gct gct gg caggagat at ggat t t t cca acaaagaagg t ct caaagac acat agaccc gaaaaat cac acat agct gt tgagggaaag ct gat gat ct cgct cat gca t t gct cat gg gt ggat at gt at at aaagt g gagct ct t aa cat at gt gag tcgcaaacac at act gaagc ct gt t gt ct g ttcaacgggc gcat aaagga ct gaacaggc t at gcat ggc ct ggct t cgt cact ggt t gg t cgacgt cga gccggt gccg cgccgcggag ggt gt t gct g cggcggcat c ccaggcgct g cact t gcct c aggt ggct ac ccat gcaat t t gaggct t ca aaagaggcgc agat cct aat t gaat ccat t t at t ggacaa agct agt gag gct t ggaaga t ggt gt t ggg aact ct t gaa aaact ct t cg agt t acagag t agat at agt aat gcat ggt t gaaaat gt g aaagagt t at agaaat ggac cact caccat at gt gagagc gaagat t gag ggagaagcag gaaaact cag gct t cct at c caccat gagc cacagccact 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 Page 658 12689250 Sequence Listing.txt ggcaaagt ca t gggt ct gt c at aa <210> 804 <211> 561 <212> DNA <213> Or-yza sat i va <400> 804 at gcact cgc cct gt t t cgt aacccggccg tgacgttcgg t t ct act gga t cgcccagct acccacggca aggccatccc gt gat ggaga t cgt cat cac ccgaagaagg gatcgctcgg aacatcctcg ccgccggccc cccgccgtcg ccgccggcaa ggcggcggcc tcgccgggct gccgaccagg actacgctta <210> 805 <211> 765 <212> DNA <213> Or-yza sat i va <400> 805 at ggt gaagc t cgcat t cgg tacgtggcgg agt t cat cgc gcct at ggt g agctcaacgc ggccggcctt gt gcgat cgc ccgcaacatc tcggcggcca cacat cacca t cct caccgg gcctgcctcc t cct caagt t ggcatcagcg agctggaggg acggtgtacg ccacggcggc gcgatcggct t cat cgt cgg atgaacccag cgcgctcctt gt ct act ggg tcggcccgct ttcat cggct cataccagcc gggcgt t t cc cct cgccgt c gct aggcgcc gacgcacggc gt t cgcgct g caccat cgcg gt t cagcggc ct t cgct ggc cgt gt acggc aagct t gggt caccct cct c at t t t cccgg gat cgcccat cct t aacccg act ct t ct ac t gt cacccac cgt cgt gat g ggacccgaag cgccaacat c cggccccgcc gat cggcggc cgt t gccgac gt cgccgcca ggcggccaca t ccat cgcct gt cgccggca gt gt acacgg cccat cgcga ggct ccat ga aact gggt ct gacgt gt t ca gact cct t ca t t cgt ct t cg gcaat t gacc gcat t ggcct gccgt gacgt t ggat cgccc ggcaaggcca gagat cgt ca aagggat cgc ct cgccgccg gt cgccgccg ggcct cgccg caggact acg acat ct ccgg t caccat cct gcct cct cct t cagcgagct t gt acgccac t cggct t cat acccagcgcg act gggt cgg t cggct cat a gcgccacgt c ct ggcgt cgg aat ggt ggcg gt t cgt gggc t cggcct cgc agct gct agg tcccgacgca t cacgt t cgc t cggcaccat gcccgt t cag gcaact t cgc ggct cgt gt a ct t aa cggccacctt caccggact c caagt t t gt c ggagggcgt c ggcggcggac cgt cggcgcc ct cct t cggc cccgct gat c ccagcccgtt cgt gaaggcc at ccgccat t ccct cgaccc gt t t ccgt cg cgt cggcggc cgcct ccat c cggcgt cgcc gct ggt gt ac cgcgcccat c cggcggct cc t ggcaact gg cggcgacgt g 1944 120 180 240 300 360 420 480 540 561 120 180 240 300 360 420 480 540 600 660 720 765 <210> <211> <212> <213> 806 1287 DNA Qr yza sat i va Page 659 12689250 Sequence Listing.txt <220> <221> <222> <223> Nr egi on (1240)..(1240) n any nucleotide <400> 806 at gccggagc ct ggacggca tgcgacaagc gt caccaccg t ct gaggcca gcagccggca t gggagaagt t act acgt ca gcggcgat cc ct gct cacca agcggcgggt ct ggt gaaga t ggt gcat ct ct cggcgccg gcacagt at t tgccaggaga gt gt ccaagg aacgagt t cg at gccccccg cgcacgct cg ct cccsgccc gaa\ft ggaa agt cgat ccc acccgcggct t cat ccaggc aact ccaggg t cat gct cgc agccat gcga t cgcgcgat a tggacccagc t cgggt cgac agaagaacgc t cat cgcgcc gcat caacgt ggaggagcaa accagcccac accaact aat acgcgat ggt acaacggcgt agat ct ccga acgcgcagca ccgagcgcct gcgt cgt cgc aaaccacccc caaggaggcg gaacct cgcg ct ccgt caac gact ct gaaa cggt t t ggcc caagcct aac ct t cgaggt t t aaggccgt g gct gaacggg tgaaacaggc gt t cct gt ac gagcgggcac ggaggat ct g ct t caccct c ccgcct aggc gct gaagcag gccgct ggt g ct t cct ccgc cgt caccgt g cgt gct cgac caacggcggc aagt t aa gcgt accaga t cgt t cgt ca aagaact acg cggccgt cgg ttcaagagga at t gt caccg gagct caagg gat at ggt cg gagt t cgagg tgggacacgc ccggagct gg aagt acggcc cct gaggagc aact t ct cca t t t gaggggt gggct ggaga gcct t ct ccc cgct t cggct ct ccgcgt cg gt cgagaagg aaccccgcgn t cat caacga ccacgt ggat t cgacat gga aagt cggcac ggt ggcagaa gcgccaat gt aagt gaagct acgagaacac acgt gaagct cgat ccacgt agt gggact t t cgt ct acgc t cat ct t cca agggt t ccag acaagaacat agacggggcg tcaaggacag ggat cgt gcc t cat ccgcga t gct gcacga ccgct t sggc cgagct gat g ggagcccgag cgagt acccc t gt cggct cg caagat gaag ccaagt t t gc gagt gacggc cat ct gcgt c gct caacgat ggacgcggcg ccggct gccg cgggat cggg cat caact ac ccaggt cat t cat ggagaac gt t caacat c cgcccggcac ggcct acacc ggact t cagc gct cgacgcg aaccaaaarg 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1287 <210> 807 <211> 1191 <212> DNA <213> Or-yza sat i va <400> 807 at ggct gagg t t gacacct t aagct ct gcg accagat at c agcaaggttg cct gt gagac actaccaagg ccaat gt t ga tttgtgtcca at gat gt agg cagcaatccc ctgacatcgc cct ct t cact cgat gcggt g ct gcaccaag ct at gagaag cct t gat gct acagggt gt c t ccgagt ccg ct t gat gcgt accaacat gg at cgt caggg gaacact gt a cat gggcact Page 66( t caat gaggg gcct t gct ga t cat ggt gt t at act t gccg aggt gct t gt tcaccaagcg acaccct gac agaccct gaa t ggt gagat c cggcat cggc caacat t gag cccagaggag 120 180 240 300 360 12689250 Sequence Listing.txt at t ggt gccg at gccact ca aacggggcat aacgacaat g gat gagaccg gt cat cccag t t cgt cat cg acct acggt g gt t gaccgga ct cgct cgcc gt gt t cgt cg acggagaact ggcaat ggcc t t cacct ggg gt gaccaggg gccat gt t ct gcgcat ggct gcgccat ggt tgaccaacga agcagt acct gt ggacct ca gct ggggagc gcggagcct a gct gcat t gt acact t acgg t cgat t t cag gct acct gaa aggt ggt gaa gcat at gt t t t gct accaag gaggcct gat ccct ct ccgc cgaaat cgcg t gat gagaag t ggagat gca acacggt ggt cat t gcaagg ccaggt gt cg cacgggcaag gccaggcat g gacggcggct acccct caag ggat at gcaa ct cggt gccc gggaagacgc gt ccacaccg gct gacct ga accat ct t cc ggt ct caccg ggcgcgt t ct caggcggcca t at gccat t g at ccct gaca at cat cat ca t acgggcact tgggaggagc ccgat gagac ccct gaat t g gt ct caccga aagt gacagt t gct cat ct c aggagcat gt at ct caaccc ggcgcaagat ccggcaagga agagcat cgt gcgt gccgga gggagat cct acct cgacct tcgggaggga ct t ct gcat a ggt ccgcaag ggagt accag cacccagcac cat caagcct gt caggt cgc cat cat cgac cccgaccaag t gccaat ggc accgct ct cg caggat t gt c gat gaggggt ggaccct gac 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1191 <210> 808 <211> 834 <212> DNA <213> Or-yza sat i va <400> 808 atgttcctgg ttgactggtt gccaagatcc t ct t cct cgg aaggacgagc ggctcgtgca at cggcaaga t caagt t caa tggaaggact actacgccaa aacagt at ct t ccagagt t c t t acaaat ga ggaaactgtg gct ct t aat a t gat t t ct at aaaat cgaat gt t ggcagt g gccgctgaca aggaacggtt gat t ccct ag caaccgtgcc gcgtcggagg aggaactccg aacgtgaacc tagccgactc cgcaagatgg gctacggcga ct acggggt g cct cgacaac gcaccagccg ggcct t cgac gggt gcaggg agt t ct t gct gt t at ggggt accat t ggca cagt cat gga t gcggagt cg t t t cct gat a ct act acct t caacgt gcgg aggct t caaa ct ggcct cgc gccggcaaga acgcagt acc ct cggcggcc aaat t at gca t t aagt cct g gccaacattt t ct t gct gt c ct agt cgat g aagaaggaac ct gggaaaca ggt ct gagca cct ct ggaga t ggat gt ccc t ggggct gt g ccaccct cct cgacgt cgga accagat cgc cgt gt cacga t t t gggt ct c t aact gt ggt agagaaacaa ct gt t gt t t a t cgat gccct agat cgacat act t caccac t ct t cat gt g agt acat caa gcagaaggag ccacat gct c ggagct gagc ccgccgcgt c t at t gat ggc ct ggcgcagt gt t acgagat agat at gggg ct t ggt ggat cct t gcagac cccat acgcc cggt aagggc cagcgt cgt c at ga 120 180 240 300 360 420 480 540 600 660 720 780 840 <210> 809 <211> 1713 <212> DNA Page 661 12689250 Sequence Listing.txt <213> Cryza sat i va <400> 809 at gcat cgcc ct caccgt cg gaagaaacgg t acaact at t ggtctcggag t t aaaaaat g caat t t act g gagactgaca aaat acaat g gaagctggca tttgcacgac t t ct ccat ct ttgatgagga t cgct t gaaa cggcctcct c gct ct t at t c gct at cat ca agtggtggcc gcatcccttt t t ct accgt t gct t t cat ct gctcccaaca t at ct t acac at cgagat gt t t cat gct gc act t at t t ct gcat ct act g at gt ct ggct ct aggaat t c gcggcggcca cggggctgcc t t aaact ct g acagcctccc aggtcct t gg tggagaaggg at gccat t ga aaaacaacga gt aacaggat aaaaatt gga gt t t t gaagt t caat t cgt t cgctgagaaa gagatgt t ag gtagct t ggt tgctcgtgat cgacctt cat t ct act caag ttccgt t ct t cactggcagc ct t t t ccttt at ccct gt cg cct ct gt t at act t t gt gt t tggt t t t t gt tgt t gaatgc cct t gt at gt t cttccagac ccacctccac cct cgct t cc ggtaaacaag at t t t gt cag cggcaatgag t cccat t t gc aaggt cat at aaataagcac aat t cat gt c cat gacct at t t act t ggac cat gatggt t tgat t atgcg tgaggaatct gt t t ct t t ct t gt at t ggcc cgt gt gct ac gaatggcggc gtgct t t t cg cat accat t t ggt t ctattg tgt t aagaca ct cat t gat g t act t cat t c t at cct cat a t gagaatt ac ct acct at ac aagctt ct ac ct cct cct cg gcct ccgagt gt t ggcccgt ccat ct gaaa ct gat cgat a acaatt gaac tggt t t gaac t acct ct aca aat ct t act c tcagtgaagt t acccgt t ct at t t t ct t ga aaat at gct c ggttggaagc gct t t t gttg at t gt t ggca gccctt acat aaaaactgga at t ggat t gg ggcacaatgg ggaactgtag at t ccacgt c ggtggactac t ggaact aca at agt cacca catt ggcaat t caat at act tttggctaca ccgccgccgt ccgaccacaa acaataaccc accccgcaca gtcaggt t ga ttgatgacaa t ct t cat agg cacacaagaa aagagtcacc gggtgcaaac t t gaacat ca ctggat t ggt gtgaagatga ttgtccatgg gtat t ggcac t gt t at at gt ct t t cat t t c t aaagt ct at tgttgaacac ttgtcatat t ttggtagaaa ct at t cct ga t cccct t cgg aggt t t att a t at gt gt cac ggacat cat t at t at cacgt cctt gat gt t cgt cct cct c gt acaaagct gcaagaaact caaatggggt t at aaagt t t caagat t caa ttttgttggt cat t gt t gt t t aagct t ct t aaacgt t gca gat ccat t gg at caat gat a t gat ct ggag agat gt t t t c tcagctggcg tggacgagga t ggat at gt t gat cct t act cat t gct at c t gt cct gt gg t t ggagt ggt gaagaagtgg cagcat ct t c t gt at at ggt t at t gt gggt ct t ct ccgct gaagacaaag ct gcct cgga 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1713 tttgcggtac tgt t gaaaca tga <210> <211> <212> <213> <400> 810 759 DNA Cryza sat i va 810 Page 662
AU2008200749A 2000-06-23 2008-02-15 Promoters for regulation of plant gene expression Ceased AU2008200749B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2008200749A AU2008200749B2 (en) 2000-06-23 2008-02-15 Promoters for regulation of plant gene expression

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US60/213,848 2000-06-23
US60/214,087 2000-06-23
US60/258,692 2000-12-29
AU2005247022A AU2005247022B2 (en) 2000-06-23 2005-12-22 Promoters for regulation of plant gene expression
AU2008200749A AU2008200749B2 (en) 2000-06-23 2008-02-15 Promoters for regulation of plant gene expression

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2005247022A Division AU2005247022B2 (en) 2000-06-23 2005-12-22 Promoters for regulation of plant gene expression

Publications (2)

Publication Number Publication Date
AU2008200749A1 true AU2008200749A1 (en) 2008-03-13
AU2008200749B2 AU2008200749B2 (en) 2012-06-14

Family

ID=39244027

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2008200749A Ceased AU2008200749B2 (en) 2000-06-23 2008-02-15 Promoters for regulation of plant gene expression

Country Status (1)

Country Link
AU (1) AU2008200749B2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10472642B2 (en) 2015-02-05 2019-11-12 British American Tobacco (Investments) Limited Method for the reduction of tobacco-specific nitrosamines or their precursors in tobacco plants
CN114561387A (en) * 2022-02-28 2022-05-31 北京大学现代农业研究院 Peanut promoter and application thereof

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE537679C2 (en) * 2013-08-29 2015-09-29 Sveriges Stärkelseproduct Förening Upa Genetically modified Beta vulgaris

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10472642B2 (en) 2015-02-05 2019-11-12 British American Tobacco (Investments) Limited Method for the reduction of tobacco-specific nitrosamines or their precursors in tobacco plants
CN114561387A (en) * 2022-02-28 2022-05-31 北京大学现代农业研究院 Peanut promoter and application thereof
CN114561387B (en) * 2022-02-28 2023-08-15 北京大学现代农业研究院 Peanut promoter and application thereof

Also Published As

Publication number Publication date
AU2008200749B2 (en) 2012-06-14

Similar Documents

Publication Publication Date Title
AU2020223682B2 (en) Plant regulatory elements and uses thereof
AU2016202373C1 (en) Isolated Polynucleotides and Polypeptides and Methods of Using Same for Increasing Plant Yield
AU2013312198B2 (en) Fluorescence activated cell sorting (FACS) enrichment to generate plants
AU2001286811B2 (en) Stress-regulated genes of plants, transgenic plants containing same, and methods of use
AU2021225152A2 (en) Isolated polypeptides and polynucleotides useful for increasing nitrogen use efficiency, abiotic stress tolerance, yield and biomass in plants
KR102243727B1 (en) Engineered transgene integration platform (etip) for gene targeting and trait stacking
KR20200124702A (en) The novel CAS9 ortholog
CN104024438B (en) Snp loci set and usage method and application thereof
AU2018201613A1 (en) Optimal soybean loci
AU777342B2 (en) Compositions and methods for the modification of gene transcription
AU2018200913B2 (en) Plant regulatory elements and uses thereof
US20030131386A1 (en) Stress-induced polynucleotides
KR20140014374A (en) Multiple virus resistance in plants
RU2756102C2 (en) Tobacco protease genes
KR20170116034A (en) Gene determination genes and their use in sarcoma
CA2396359A1 (en) Nucleic acid molecules and other molecules associated with soybean cyst nematode resistance
AU2002322469B2 (en) Nuclear fertility restorer genes and methods of use in plants
AU2022202318A1 (en) Methods of increasing specific plants traits by over-expressing polypeptides in a plant
CA2492136A1 (en) Nuclear fertility restorer genes and methods of use in plants
AU2008200749A1 (en) Promoters for regulation of plant gene expression
KR20220165764A (en) How to Control Nicotine Levels in Nicotiana Tabacum
RU2817119C2 (en) Tomato plants resistant to tomato brown rugose fruit virus
KR20230113283A (en) Dicer-Like Knockout Plant Cells
CN116648513A (en) Cutting enzyme sample knocked out plant cells
AU2004205117B2 (en) Compositions and methods for the modification of gene transcription

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired