NZ795228A

NZ795228A - Use of biological rna scaffolds with in vitro selection to generate robust small molecule binding aptamers for genetically encodable biosensors

Info

Publication number: NZ795228A
Application number: NZ795228A
Authority: NZ
Inventors: Robert T Batey; Ely B Porter
Original assignee: The Regents Of The University Of Colorado A Body Corporate
Priority date: 2016-12-12
Filing date: 2017-12-11
Publication date: 2022-12-23

Abstract

Provided herein are libraries of scaffolds derived from riboswitches and small ribozymes and their methods of use. The scaffolds of the invention yield aptamers that are easily identified and characterized by virtue of the structural scaffold. The nature of the scaffold predisposes these RNAs for coupling to readout domains to engineer biosensors that function in vitroand in vivo. Biosensors, synthetic RNA agents and synthetic DNA agents, and their methods of use, are also provided. upling to readout domains to engineer biosensors that function in vitroand in vivo. Biosensors, synthetic RNA agents and synthetic DNA agents, and their methods of use, are also provided.

Description

Provided herein are libraries of scaffolds derived from riboswitches and small ribozymes and their methods of use. The scaffolds of the invention yield aptamers that are easily identified and terized by virtue of the structural scaffold. The nature of the scaffold predisposes these RNAs for coupling to readout domains to engineer sors that function in vitroand in vivo.

Biosensors, synthetic RNA agents and synthetic DNA agents, and their methods of use, are also provided.

NZ 795228 USE OF BIOLOGICAL RNA SCAFFOLDS WITH IN VITRO SELECTION TO GENERATE ROBUST SMALL MOLECULE BINDING APTAMERS FOR GENETICALLY ENCODABLE BIOSENSORS RELATED APPLICATION This application claims the benefit of priority of U.S. Provisional Patent ation No. 62/432,879, filed on December 12, 2016, the content of which is hereby incorporated by reference herein in its entirety for all purposes.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR PMENT This invention was made with government support under grant number CMMI CHE1150834 awarded by the National Science Foundation. The government has certain rights in the invention.

Allosteric RNA devices are increasingly viewed as important tools capable of monitoring enzyme ion, optimizing ered metabolic pathways, facilitating discovery of novel genes, and regulators of nucleic acid based therapeutics. One bottleneck in the development of these platforms, r, is the availability of small molecule binding RNA aptamers that robustly function in the cellular environment. While aptamers can be raised against nearly any d target by in vitro selection, many of these RNA-based aptamers cannot be easily integrated into devices or do not reliably function in a cellular context. Accordingly, there remains a need for rs and methods for ping aptamers.

SUMMARY A novel approach is described herein using scaffolds derived from riboswitches and small ribozymes. This approach, applied here to 5-hydroxytryptophan in an exemplary aspect, yields aptamers that are easily identified and terized by virtue of the structural scaffold. The nature of the scaffold predisposes these RNAs for coupling to readout domains to engineer nucleic acid devices that function in vitro and in the cellular context.

In one aspect, a library of oligonucleotides is provided sing a plurality of nonidentical oligonucleotides is provided. Individual oligonucleotides of the library comprise a first ce comprising a helix domain, a second sequence comprising a first hairpin domain, and a third sequence sing a second hairpin domain, wherein the helix domain, first hairpin domain and second hairpin domain form an oligonucleotide junction containing a ligand-binding domain, and wherein the library comprises a plurality of non-identical ligand- binding s.

In one embodiment, each helix domain independently is a fully complementary helix optionally comprising one or more ilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair and a bulge. In one ment, each helix domain is a fully complementary helix.

In one embodiment, each first hairpin domain independently comprises one or more destabilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair and a bulge and/or each second hairpin domain independently comprises one or more ilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair and a bulge.

In one embodiment, the helix domain is at least 4 to 10 base-pairs in length, or at least base-pairs in length.

In one embodiment, the oligonucleotides are oligoribonucleotides.

In one embodiment, the oligonucleotides individually se a sequence having a series of linked sequences according to Formula I: P1-J1/2-P2-L2-P2’-J2/3-P3-L3-P3’-J3/1- P1’ (I), wherein “-” ents a bond, P1 and P1’ form the helix, P2, L2 and P2’ form the first hairpin, P3, L3 and P3’ form the second hairpin and J1/2, J2/3 and J3/1 together form the oligonucleotide junction. In one embodiment, J2/3 comprises a T-loop motif. In one embodiment, the T-loop motif comprises the sequence UUGAA, optionally wherein the guanosine of the T-loop forms a Watson-Crick base-pair with a cytidine in J3/1.

In one embodiment, the helix domain has a first end and a second end, and the first end is proximal to the oligonucleotide junction, and the second end is linked to an oligonucleotide-based readout module. In one embodiment, the oligonucleotide-based readout module is a genic, e.g., a Broccoli fluorophore binding aptamer, or a based readout module, e.g., a pbuE . In one embodiment, the oligonucleotide-based readout module is an oligoribonucleotide-based readout module.

In one embodiment, individual oligonucleotides have sequence correspondence to a Bacillus subtilis xpt-pbuX guanine riboswitch sequence sing about 23 variable nucleotide residues within the oligonucleotide junction, or individual oligonucleotides have ce correspondence to a Vibrio cholera Vc2 cyclic di-GMP riboswitch sequence comprising about 21 variable nucleotide residues within the oligonucleotide junction, or individual oligonucleotides have sequence correspondence to a Schistosoma mansoni hammerhead ribozyme sequence comprising about 21 variable nucleotide residues within the oligonucleotide on.

In one ment, the oligonucleotide junction is an N-way junction, wherein N is two, three, four or five, or wherein N is two, or wherein N is three, or wherein N is four, or wherein N is five.

In one embodiment, the library comprises from about 421 to about 423 non-identical members.

In another aspect, a library of oligonucleotides comprising a plurality of non-identical oligonucleotides is provided. Individual oligonucleotides of the library comprise a first sequence comprising a helix domain, a second sequence comprising a first hairpin domain, and a third sequence comprising a second n domain, n the helix domain, the first hairpin domain and the second hairpin domain form an ucleotide on containing a pre-selected ligand-binding domain, and wherein the library comprises a plurality of nonidentical ligand-binding domains.

In one embodiment, each helix domain independently is a fully complementary helix optionally comprising one or more ilizing nucleotides selected from the group consisting of a ched base pair, a G•U wobble base pair and a bulge. In one embodiment, each helix domain is a fully complementary helix.

In one embodiment, each first hairpin domain independently comprises one or more ilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair and a bulge and/or each second n domain independently comprises one or more destabilizing nucleotides selected from the group ting of a mismatched base pair, a G•U wobble base pair and a bulge.

In one embodiment, the helix domain is at least 4 to 10 base-pairs in length or at least base-pairs in length.

In one embodiment, the oligonucleotides are oligoribonucleotides.

In one embodiment, individual oligonucleotides comprise a sequence having a series of linked sequences according to Formula I: P1-J1/2-P2-L2-P2’-J2/3-P3-L3-P3’-J3/1-P1’(I), wherein “-” represents a bond, P1 and P1’ form the helix, P2, L2 and P2’ form the first n, P3, L3 and P3’ form the second hairpin, and J1/2, J2/3 and J3/1 together form the ucleotide junction. In one embodiment, J2/3 comprises a T-loop motif. In one embodiment, the T-loop motif comprises the sequence UUGAA, optionally wherein the guanosine of the T-loop forms a Watson-Crick base-pair with a cytidine in J3/1.

In one embodiment, the helix domain has a first end and a second end, and the first end is proximal to the oligonucleotide on, and the second end is linked to an oligonucleotide-based readout . In one embodiment, the oligonucleotide-based readout module is a fluorogenic, e.g., is a Broccoli fluorophore binding aptamer, or switchbased readout module, e.g., a pbuE switch. In one embodiment, the ucleotide-based readout module is an oligoribonucleotide-based readout module.

In one embodiment, individual oligonucleotides comprise sequences having sequence correspondence to a Bacillus subtilis uX guanine riboswitch sequence comprising about 23 variable nucleotide residues within the oligonucleotide junction, or individual oligonucleotides comprise sequences having sequence correspondence to a Vibrio cholera Vc2 cyclic di-GMP riboswitch sequence comprising about 21 variable nucleotide residues within the oligonucleotide junction, or individual oligonucleotides comprise sequences having sequence correspondence to a Schistosoma mansoni hammerhead ribozyme ce comprising about 21 variable nucleotide residues within the oligonucleotide junction.

In one embodiment, the oligonucleotide junction is an N-way junction, wherein N is two, three, four or five, or wherein N is two, or wherein N is three, or wherein N is four, or wherein N is five.

In one embodiment, the preselected ligand-binding site comprises a binding site for a nd ed from the group consisting of an amino acid, a peptide, a nucleobase, a nucleoside, a nucleotide, a metal ion, a neurotransmitter, a hormone, an active pharmaceutical ient, and tives f. In one embodiment, the preselected ligand-binding site comprises a binding site for a ligand selected from the group consisting of an amino acid, a nucleobase, a nucleoside, a nucleotide, a neurotransmitter, a hormone, and derivatives thereof. In one embodiment, the preselected ligand-binding site comprises a binding site for at least one ligand selected from the group consisting of 5-hydroxy-L-tryptophan, L- phan, serotonin, and 5-hydroxy-L-tryptophan-methylamide. In one embodiment, the ligand is at least one of 5-hydroxy-L-tryptophan or serotonin.

In still another aspect, a method of selecting a ity of non-identical ligandbinding oligonucleotides is provided. The method es a step of contacting a library of ucleotides comprising a plurality of oligonucleotides with a ligand under conditions suitable for ligand g, wherein individual oligonucleotides comprise a first sequence sing a helix domain, a second sequence comprising a first hairpin domain, and a third sequence comprising a second hairpin , wherein the helix domain, first hairpin domain and second hairpin domain form an oligonucleotide junction, and a step of partitioning the library of oligonucleotides in a spatially sable such that the plurality of non-identical -binding oligonucleotides is selected, wherein the oligonucleotides having the oligonucleotide junction further comprise a ligand-binding domain, and wherein the ligandbinding domains of the library of oligonucleotides comprise variable nucleotide residues, is selected.

In one embodiment, the method r comprises a step comprising competitively partitioning the library of oligonucleotides with a solution of free ligand between the step of contacting and the step of partitioning.

BRIEF DESCRIPTION OF THE DRAWINGS The foregoing and other features and advantages of the present invention will be more fully understood from the following detailed description of illustrative embodiments taken in conjunction with the anying drawings. The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary Figure 1A shows the GR ld. The GR scaffold is derived from the aptamer domain of the B. subtilis uX guanine riboswitch. The aptamer is comprised of three paired (P) s connected by the g (J) regions of the three-way junction that contains the guanine (Gua, magenta) binding site (dashed lines represent direct RNA-ligand interactions). Nucleotides in outlined cyan are those that were randomized for selection. The terminal loops of P2 and P3 (L2 and L3, green box) participate in a tertiary interaction that zes the domain. Below is the three dimensional structure of the RNA (PDB ID 4FE5) with the same coloring scheme emphasizing the spatial relationship between the ligand binding site and the randomized nucleotides.

Figure 1B shows the CDG scaffold. The secondary (top) and ry structure (bottom) of the CDG scaffold derived from the aptamer domain V. cholera Vc2 cyclic di- GMP itch (PDB ID 3IWN). The labeling and coloring scheme are as described in Figure 1A.

Figure 1C shows the HH scaffold. The secondary (top) and ry structure (bottom) of the HH scaffold d from the S. mansoni hammerh ead ribozyme (PDB ID 3ZP8). The labeling and coloring scheme are as described in Figure 1A.

Figure 2A shows the chemical structure of 5-hydroxy-L-tryptophan.

Figure 2B shows an unrooted phylogenetic tree representation of the distance matrix of ces derived from round 7 of the GR-SSIII selection. Sequences are grouped into three main clusters, which are independently colored. The distance is expressed as the maximum likelihood estimate (MLE) of how many tutions have occurred per site n two nodes of the tree (bar shown for scale).

Figure 2C shows an unrooted phylogenetic tree representation of the distance matrix of sequences derived from round 7 of the GR-GsI selection. The four clusters from which entative sequences were analyzed are shown in ndent colors (legend shown to right); black region represent unanalyzed clusters and regions of the tree.

Figure 2D shows a covariation model of six observed clusters derived from the GR selections; colors are consistent with Figures 2B and 2C. The dashed lines correspond to the regions of the scaffold that were randomized and the line connecting L2 and L3 denote clusters where the sequences were maintained that would support the tertiary interaction.

Figure 3A shows the outcome of ive 2'-Hydroxyl Acylation analyzed by Primer Extension (“SHAPE”) chemical probing of three sequences (5HTP-I, -II, and –III) in comparison to the native xpt guanine riboswitch. For clarity, the regions of the gel corresponding to the J2/3 strand and L3 are shown (entire gel shown in Supplementary Figure 4a). While all three RNAs reveal ligand-dependent decreases in chemical reactivity of the RNA backbone within or adjacent to J2/3, only 5HTP-II preserves a signature hotspot of reactivity in L3 that is indicative of the formation of its interaction with L2 in the xpt guanine riboswitch.

Figure 3B is a quantitation of the ligand-dependent differential intensities of SHAPE probing of 5HTP-II in the presence and absence of 5HTP reveals that the majority of vity changes are localized to the on. ement of the L3 signal suggests coupling between ligand binding in the junction and tertiary structure formation.

Figure 3C shows an isothermal titration metric (ITC) analysis of binding of 5HTP to the 5HTP-II with (left) and without (right) the 5'- and 3'-amplification cassettes, trating that these s do not affect ligand binding.

Figure 3D shows a crystal ure of the 5HTP-II aptamer in x with 5- hydroxytryptophan (magenta). The green highlights the L2-L3 interaction of the parent scaffold ( Figure 1A) and cyan indicates the nucleotides that were randomized in the starting RNA library.

Figure 3E shows an overlay of the quantified SHAPE vity data in panel (B) on the crystal structure emphasizing the relationship between ligand-dependent changes in the dynamics of the RNA backbone and the structure.

Figure 4A shows the binding pocket of 5HTP in the 5HTP-II aptamer. The 5HTP- binding pocket within the three-way junction forms a set of hydrogen bonding interactions that s every polar functional group in 5-hydroxytryptophan except for one oxygen atom in the carboxylate group that becomes an amide to immobilize the compound. In addition, the complex is stabilized by stacking ctions between the hydroxyindole ring of 5HTP and to adenine bases (A48 and A49) in J2/3.

Figure 4B shows the binding pocket of 5HTP in the 5HTP-II aptamer. The core of the binding pocket in the 5HTP-II aptamer (green) is a T-loop that superimposes almost perfectly with the T-loops from tRNAPhe (orange) and the thiamine pyrophosphate (TPP) riboswitch (cyan). In each of the three es, the space between two purines at the T4 and T5 positions (T1-5 ing indicates the nucleotide on within the T-loop motif) enables intercalation of an aromatic ring.

Figure 5A shows that a 5HTP aptamer-based biosensor ons in E. coli. The wild-type 5HTP-II aptamer specifically activates the fluorescence of the Broccoli reporter in the presence of 5HTP. At t=0 minutes, 2 mM 5HTP was added to the media.

Figure 5B shows that the wild-type 5HTP-II aptamer does not specifically activate the fluorescence of the Broccoli reporter in the ce of L-tryptophan. At t=0 minutes, 5 mM L-tryptophan was added to the media.

Figure 5C shows that a single point mutation in the 5HTP-binding pocket of the 5HTP-II aptamer (A48U) also ablates fluorescence in the presence of the fluorophore. At t=0 minutes, 2 mM 5HTP was added to the media.

Figure 5D shows single cell traces of fluorescence induction for the wild type 5HTP- II-Broccoli sensor in the presence of 5HTP. At t=0 minutes, 2 mM 5HTP was added to the media.

Figure 5E shows single cell traces of fluorescence ion for the wild type 5HTP- II-Broccoli sensor in the ce of L-tryptophan. At t=0 s, 5 mM L-tryptophan was added to the media.

Figure 5F shows single cell traces of fluorescence induction for the binding incompetent 5HTP-II A48U construct in the presence of 5HTP. At t=0 minutes, 2 mM 5HTP was added to the media.

Figure 6A shows the secondary ure of an artificial 5HTP/serotonin "ON" riboswitch based upon 5HTP-IV aptamer. The 5HTP-IV aptamer is boxed in a dashed line, and the solid boxed nucleotides correspond nucleotides directly involved in alternative structure formation.

Figure 6B shows fied single-turnover transcription ons of the riboswitch demonstrating robust antitermination upon addition of 5HTP, nin, or 5HTP-NHme.

Gel images of transcription reactions are shown to the right, displaying the ligand-dependent transition from terminated (T) to read-through (RT) products. Similar titration with L- tryptophan failed to yield read-through transcription.

Figure 7A shows an unrooted phylogenetic tree representation of the distance matrix of sequences derived from round 7 of the CDG selection using the GsI e transcriptase.

The cluster from which the 5HTP aptamer was derived is highlighted in red. The distance is expressed as the maximum likelihood estimate (MLE) of how many tutions have ed per site between two nodes of the tree (bar shown for scale).

Figure 7B shows a covariation model of the 5HTP-VII aptamer; the solid red line corresponds to the regions of the bioscaffold that were ized and the line connecting L2 and P3 denote the tertiary interaction.

Figure 7C shows an unrooted phylogenetic tree representation of the distance matrix of sequences derived from round 7 of the HH selection using the GsI e transcriptase.

The cluster from which the 5HTP-VIII aptamer was derived is highlighted in purple; black regions represent unanalyzed clusters and regions of the tree.

Figure 7D shows a covariation model of the 5HTP-VIII aptamer; the solid purple line corresponds to the regions of the ffold that were randomized and the line connecting P2 and L3 denote the tertiary interaction.

Figure 8A shows a significant accumulation of mutations in the ffold by round 7 of the selection with some ons at the 3’ end achieving greater than 90% mutation frequency in the initial selection using SuperScript III (Life Technologies). There is also a strong propensity for the accumulation of mutations in sequence elements critical for secondary and tertiary structure (P2 and P3).

Figure 8B shows the modified selection protocol using a developed group II intron RT IC) shows a relief in the amount of accumulated mutations in the GR bioscaffold, particularly in the P2 and P3 regions. This allows for the preservation of structural elements designed into the sequence.

Figure 8C shows the observed error frequencies as a function of tide position in round 7 of the CDG/GsI selection.

Figure 8D shows the ed error frequencies as a function of nucleotide position in round 7 of the HH/GsI selection.

Figure 9A shows a SHAPE analysis of the V, -V and –VI aptamers in the absence and presence of 5HTP. Bars to the side highlight the J2/3 and L3 regions demonstrating varying ligand-dependent tions in the three-way junction and the presence of the signature reactivity hotspot in L3 diagnostic of the L2-L3 interaction.

Figure 9B shows a SHAPE analysis of a CDG scaffolded 5HTP binding aptamer.

The raw gel shows clear ligand dependent modifications in J1/2 and J2/3. The parental Vc2 RNA shows a ligand dependent protection in P3 at the tetra-loop g site, while the 5HTP-VII aptamer shows a ligand dependent cation on the opposite side of the helix.

Additionally, neither RNA shows any modifications in the ce of the irrespective ligand.

Figure 9C shows a SHAPE analysis of a HH scaffolded 5HTP binding aptamer. The raw gel shows clear ligand dependent modifications in J1/2 and J2/3. The changes in J2/3 localize mainly to positions 3 and 4 of a predicted T-loop motif. Additionally, if the structure was maintained the terminal loop of P3 (L3) that docks into P2 in the parental RNA shows a ligand dependent protection. Integration of the bands as a function of distance down the gel is shown on the left.

Figure 10A shows the 2Fo-Fc electron density map of the 5HTP-II/5HTP complex around the model contoured at 2σ. All regions of the RNA are well-defined by the electron y, making placement of the residues and backbone unambiguous. Cyan nucleotides were ized in the original RNA library and 5HTP is shown in .

Figure 10B shows a composite omit of the ligand binding pocket of the 5HTP- II/5HTP complex contoured at 1σ showing clear density supporting placement of the ligand (5HTP) and adjacent iridium hexammine (IrHex).

Figure 10C shows the final 2Fo-Fc electron density map of the 5HTP binding pocket of the 5HTP-II/5HTP complex red at 1σ.

Figure 11A shows an R2R diagram derived from variation analysis of J2/3 of the III aptamer of the most populous cluster of the HH/GsI selection.

Figure 11B shows a variation analysis of J2/3 of the 5HTP-VIII aptamer comparing the variation pattern of T-loops found in biological RNAs1 (top) and the cluster containing the 5HTP-VIII aptamer.

Figure 12 shows the construction scheme for roccoli biosensors. Red nucleotides in the Broccoli secondary structure denote the G-quartet that forms the platform for DFHBI and green tides denote the ences between Spinach and li.

Figure 13 is a graphical abstract showing the design of novel scaffold aptamers of the invention.

Figure 14A is a schematic of the secondary structure of genetically encodable biosensors of 5HTP and L-DOPA in which a GR-scaffolded aptamer (cyan) is coupled to a fluorogenic aptamer oli, green) via a communication module (orange, CM; sequences on bottom) and stabilized in vivo with the tRNA scaffold (yellow).

Figure 14B and Figure 14C depict heat maps of the observed ligand-induced fluorescence (top) and brightness of the ligand-bound sensor relative to a tRNA/Broccoli control (bottom) for a series of GR-scaffolded aptamers coupled to Broccoli with CMs of 2 to base pairs.

Figure 14D and Figure 14E depict heat maps of the performance of the same sensors in E. coli.

Figure 15A and Figure 15B show that the wild-type 5GR-II aptamer specifically activates the fluorescence of the Broccoli reporter in the ce of 5HTP, but not in the presence of L-tryptophan.

Figure 15 C shows that a single point mutation in the 5HTP-binding pocket of the 5GR-II aptamer (A48U) also ablates fluorescence in the presence of the fluorophore.

Figure 15D, Figure 15E and Figure 15F depict single-cell traces of fluorescence ion for the wild type 5GR-II-Broccoli sensor in the presence of 5HTP, L-tryptophan, and the binding incompetent 5GR-II A48U construct in the presence of 5HTP. At t=0 minutes, either 2 mM 5HTP or 5 mM L-tryptophan was added to the media.

Figure 16A shows superimposition of parental B. subtillis xpt guanine riboswitch and 5GR-11 RNAs over all backbone atoms. The e riboswitch (PDB 4FE5) is shown in red and its ligand, hypoxanthine, shown in magenta. The 5GR-II r is shown in blue and its ligand, 5HTP, shown in green.

Figure 16B shows mposition of the two RNAs using backbone atoms in P2 and P3 only.

Figure 16C depicts a view of the superimposition of two base quadruples that se the core of the L2-L3 interaction, showing a complete preservation of the individual base interactions that establish this tertiary interaction.

Figure 17A – Figure 17D depicts sequence and secondary structure of l RNA libraries. The green box highlights the constant region that is used for priming and the yellow box highlights the e specific to each scaffold. Nucleotide positions that were randomized in the starting library are highlighted in cyan.

Figure 17A depicts the sequence of the guanine riboswitch aptamer (GR) RNA library for the ion using the SuperScript III RT.

Figure 17B depicts the sequence of the GR RNA library used for the selection using the GsI-IIC RT.

Figure 17C depicts the sequence of the di-cyclic GMP riboswitch aptamer (CG) library. Figure 17D depicts the sequence of the hammerhead ribozyme (HR) library.

Figure 18A – Figure 18C depict selection of scaffolded aptamers that selectively bind 3,4-dihydroxyphenylalanine (L-DOPA).

Figure 18A depicts chemical structure of dopamine (1) and L-DOPA (2).

Figure 18B s an unrooted phylogenetic tree representation of the distance matrix of sequences derived from round 7 of a GR-GsI-IIC selection against L-DOPA. The four clusters from which representative sequences were orated into genic sors are shown in independent colors. The black region represents unanalyzed clusters and s of the tree.

Figure 18C depicts covariation models of the four clusters, with colors consistent with those of panel (B). Note that the I MFE structure has an alternative secondary structure that, if correct, ablates the tertiary loop-loop interaction.

Figure 19 depicts SHAPE analysis of GR scaffolded 5HTP binding aptamers. This Figure shows the full sequencing region of the gel that was used to te Figure 4A.

Figure 20 depicts SHAPE analysis of GR-scaffolded 5HTP-binding aptamers. The complete and red image of the sequencing gel shown in Figure 4A (regions corresponding to J2/3 and L3 were cropped to produce Figure 4A). SHAPE analysis of the , -V and –VI aptamers are depicted in the absence and presence of 5HTP. Bars to the side highlight the J2/3 and L3 regions demonstrating varying ligand-dependent protections in the 3WJ and the presence of the signature reactivity hotspot in L3 diagnostic of the L2-L3 interaction.

Figure 21 depicts SHAPE analysis of a CG-scaffolded 5HTP-binding aptamer. The raw gel (inset) shows clear ligand dependent modifications in J1/2 and J2/3. The al Vc2 RNA shows a ligand dependent protection in P3 at the tetra-loop docking site, while the 5CG-I aptamer shows a ligand dependent cation on the opposite side of the helix.

Additionally, neither RNA shows modifications in the presence of the irrespective ligand. ation of the bands as a function of distance down the gel is shown at the bottom. After normalization and assignment, the ligand dependent s are still evident (colored asterisks), particularly in J1/2.

Figure 22 depicts SHAPE analysis of a ffolded 5HTP-binding aptamer. The raw gel (right) shows clear ligand ent modifications in J1/2 and J2/3. The changes in J2/3 localize mainly to positions 3 and 4 of a predicted T-loop motif. Additionally, if the structure was maintained, the terminal loop of P3 (L3) that docks into P2 in the parental RNA shows ligand-dependent protection. Integration of the bands as a function of distance down the gel is shown on the left. After normalization and assignment, the ligand-dependent changes are still evident (colored asterisks). Note that there is no background ge at the equivalent site in the parental head RNA (top green asterisk).

Figure 23 depicts engineered sensors of the invention that were synthesized as G- blocks. “N” represents a position where the composition of A, C, G and T is approximately % each. RNA aptamer and sensor sequences are given as their equivalent DNA sequences.

Separate domains of Broccoli sensors are color coded to denote the tRNA ld (grey), 1T binding Broccoli aptamer (yellow), communication module (cyan) and GR scaffolded aptamer (red).

DETAILED DESCRIPTION The means to generate synthetic RNA and/or DNA elements with novel regulatory and sensing abilities is powerfully enabled by in vitro selection and the pool of ble synthetic aptamers is currently large. r, only a few small molecule binding RNA aptamers have transitioned into effective and widely used intracellular biosensors of their cognate ligand or other RNA devices. This discrepancy between in vitro binding and intracellular activity is problematic, suggesting t selection strategies cannot easily access small molecule binding RNA aptamers capable of functioning robustly in the cellular environment. While in vivo selection strategies for small molecule binding RNAs might be more successful at ting cell-capable aptamers, these approaches are not currently broadly practical. Thus, current strategies continue to rely upon a protracted workflow incorporating traditional in vitro selection with tandem, ation-specific selections for enhanced function.

Unlike synthetic rs, the small molecule binding s of natural riboswitches have evolved in the t of the cell and incorporate additional features extending beyond the ligand binding site that include high fidelity folding and an ability to communicate with ream regulatory switches to yield a detectable output. These aptamers are highly modular and robust, observed in a broad spectrum of bacterial species and interface with diverse regulatory domains acting on transcription, translation, alternative splicing and mRNA stability. Thus, they are highly flexible with regards to mechanisms of communication with adjacent domains or sequences that elicit an output (e.g., gene regulation). These aptamers have been successful in synthetic applications and have been used to validate synthetic RNA tools. While there has been a substantial effort to identify and characterize natural aptamers, they are inherently limited in their diversity and in application due to the nous pools of effector ligand that are difficult to modulate.

In response to these difficulties, provided herein are recurrent architectural folds found in natural RNA aptamers and small nulceolytic ribozymes that can be reprogrammed using in vitro selection to host a broad spectrum of small molecule binding sites while preserving the robust folding and highly stable architectural properties of the parent and their methods of use. Using partially structured RNA libraries in the selection of small molecule binding aptamers has been previously employed, but these simple hairpins and helices do not have the potential to form higher-order structure akin to natural aptamers. Selection of an RNA ligase ribozyme of very modest activity by randomizing a terminal loop in the Tetrahymena me P456 domain trated that it is possible to obtain ribozymes for a scaffolded library. However, the P456 architecture is large (160 nucleotides) and restricted to the IC1 and IC2 subclasses of group I plicing introns. Thus, it may not be well suited as a general platform for raising diverse small molecule binding aptamers that are active in a broad spectrum of cellular environments.

Using scaffolds derived from two ent riboswitch aptamer s and a ribozyme, a diverse set of aptamers was obtained that selectively binds 5-hydroxytryptophan (5HTP) and/or serotonin (5HT). While each of the scaffolds provides unique ons for ition, they all converge on similar binding affinities and discriminate t chemically related L-tryptophan. These aptamers are predisposed by the structural scaffold for coupling to fluorogenic and switch-based readout modules. While screening strategies are met with varying degrees of success, the diversity of aptamers readily achieved using this ch enables more flexible strategies with less in vitro characterization ed to implement practical RNA devices. sed devices are increasingly viewed as a potentially robust and predictable tool in synthetic biology. RNA has a unique feature set when compared to protein-based atives, including the ability to regulate in cis, predictable secondary structure, and a small genetic footprint. Among the more sought after abilities of RNA devices are the capacity to sense external stimuli and te a genetic or phenotypic response in the absence of additional protein factors. s have focused on creating tic riboswitches, aptazymes and fluorogenic RNA sensors, yet their potential has yet to be fully realized in part due to the limited availability of RNA sensing domains. The compositions and methods provided herein surprisingly trate that using naturally evolved riboswitches or ribozymes as scaffolds for selection can produce a robust sensing domain capable of functioning both in vitro and in the cellular context, on par with the best artificial and natural aptamers to date.

A key strength of the compositions and methods described herein is the use of multiple scaffolds in parallel selections to obtain a suite of rs. While the aptamers derived from different scaffolds have similar affinities for 5HTP and selectivity against L- tryptophan, they clearly have distinct characteristics with respect to their ability to communicate with a readout domain via the P1 helix, a common feature to all of the scaffolds. t being bound by theory, it is hypothesized that this is due to variation in the spatial relationship between the ligand and the interdomain (P1) helix, a feature that cannot be fully controlled in the selection. In biological riboswitches, the ligand is either in direct contact or induces conformational changes in the RNA that e the P1 helix that links the aptamer to the downstream regulatory switch.

With a suite of aptamers, combinatorial approaches can be employed to rapidly screen for the sensors with desired ties without the extensive aptamer characterization or the device optimization. Typically, the traditional in vitro selection only yields a single small molecule g aptamer, and development of an RNA device es screening many communication modules and adaptor sequences while leaving the sensory aptamer as a fixed node. With the scaffolded selection ch, a set of distinct aptamers can be combinatorially d to a set of communication modules and y screened for variants with the desired activity. In this fashion, this approach should eliminate the key bottleneck in the development of RNA devices and sensors. Notably, while in this study only the most populous rs were focused on in each selection for characterization and sensor design, within each selection there were many clusters containing alternative sequences that could further enrich the l pool of aptamers for developing downstream applications.

A second powerful advantage of this selection gy is the robust folding in the cellular context provided by the tertiary interaction of the oligonucleotide junction architecture (e.g., the three-way junction). Each of these aptamers has a fold that has undergone extensive biological evolution and, in ular, the distal tertiary interactions that organize the three-way junction core are highly stable. Both the L2-L3 ction of purine riboswitch and the tetraloop-tetraloop receptor of the cyclic di-GMP riboswitch scaffold are capable of stably forming outside of the context of other RNA structure. This enables these ts to potentially guide the folding of all members of the initial library such that the vast ty of the population contains the prescribed secondary and tertiary structure.

Misfolding is often a significant problem for traditional synthetic aptamers, which can be greatly exacerbated when the RNA element is coupled to another or placed in the context of a larger RNA. Since there is no significant selection pressure for high fidelity g in a typical selection protocol, providing this information in the starting library can be a path s robust folding RNAs.

While the three-way junction scaffolds is exemplified herein, the diversity of natural riboswitches and ribozymes can provide further feedstock for this approach. Within the three-way junction family, there is a broad array of sequences that vary the orientation of the three helices, size of the joining regions, and the nature of the distal tertiary interaction that may provide superior scaffolds for a particular ligand or sensor. Furthermore, other folds may be predisposed to bind a target small molecule based on the nature of the e ligand.

For example, another choice for a scaffold to bind 5HTP is the lysine riboswitch aptamer domain that contains a ay junction that houses the ligand binding site and positions it adjacent to the P1 helix. Larger s may be more easily accommodated by flavin mononucleotide or min itch derived scaffolds, while dinucleotides such as NADH may be readily odated by one of the other di-cyclic nucleotide aptamers.

Thus, the scaffolded selection approaches described herein have the potential to facilitate the development of powerful new tools for monitoring and responding to small molecules in the cellular environment across a broad range of applications using RNA devices.

Generally, nomenclature used in connection with cell and tissue culture, lar biology, immunology, microbiology, genetics and protein and nucleic acid try and hybridization described herein are those well-known and commonly used in the art. The methods and techniques provided herein are lly performed according to conventional methods well known in the art and as described in various general and more specific references that are cited and discussed throughout the present specification unless otherwise indicated.

Enzymatic reactions and purification techniques are performed according to manufacturer’s ications, as commonly accomplished in the art or as described herein.

The nomenclature used in connection with, and the laboratory procedures and techniques of, analytical chemistry, synthetic organic chemistry, and medicinal and pharmaceutical chemistry described herein are those well-known and commonly used in the art. Standard ques are used for chemical syntheses, chemical analyses, pharmaceutical preparation, ation, and delivery, and treatment of patients.

Unless otherwise defined herein, scientific and technical terms used herein have the meanings that are commonly understood by those of ordinary skill in the art. In the event of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Unless otherwise required by context, ar terms shall include pluralities and plural terms shall include the singular. The use of “or” means “and/or” unless stated otherwise. The use of the term “including,” as well as other forms, such as “includes” and “included,” is not limiting.

So that the invention may be more y understood, certain terms are first defined.

The terms “aptamer” and “aptamer domain” refer to short, -stranded DNA, RNA or peptide sequences that specifically bind to various molecular targets such as small molecules, proteins, nucleic acids, cells, s and the like with high specificity and affinity.

Aptamers are generally highly specific, relatively small in size, and non-immunogenic.

Similar to antibodies, aptamers interact with their targets by recognizing a specific threedimensional structure and are thus also known as “chemical antibodies.” In contrast to n antibodies, DNA or RNA aptamers offer unique chemical and biological characteristics based on their oligonucleotide properties.

The term “riboswitch” refers to an element commonly found in the 5′-untranslated region of mRNAs that exerts its tory control over the transcript in a shion by directly binding a small molecule ligand. The typical riboswitch contains two ct onal domains: an aptamer domain, which adopts a compact three-dimensional fold to ld the ligand binding pocket; and an expression platform, which contains a secondary structural switch that interfaces with the transcriptional or translational machinery.

Regulation is achieved by virtue of a region of overlap n these two domains, known as the switching sequence, whose pairing directs folding of the RNA into one of two mutually exclusive structures in the sion platform that represent the on and off states of the mRNA. In certain exemplary embodiments, a preferred riboswitch is the B. is xpt-pbuX guanine riboswitch (referred to herein as “GR”) or the Vibrio cholerae Vc2 cyclic di-GMP riboswitch (referred to herein as “CDG”).

The term “ribozyme” refers to an RNA molecule that acts as an enzyme and is capable of catalyzing ic biochemical reactions, similar to the action of protein enzymes.

Ribozyme classes include GIR1 branching ribozyme, glmS ribozyme, Group I self-splicing intron, Group II self-splicing intron, hairpin ribozyme, hammerhead ribozyme and HDV ribozyme. In certain exemplary embodiments, a preferred ribozyme is Schistosoma mansoni hammerhead ribozyme (referred to herein as “HH”).

The terms “synthetic RNA agent,” “synthetic DNA agent,” “biosensor” and “scaffold” refer to nucleic acid sensory devices described herein that comprise secondary and tertiary structural scaffolds derived from aptamers that exist in nature, e.g., from riboswitches and ribozymes. A “synthetic RNA agent,” “synthetic DNA ” “biosensor” or “structural ld” of the invention includes a helix domain, first and second hairpin domains, and an oligonucleotide junction that contains a ligand-binding domain. A biosensor of the invention can comprise an N-way junction, wherein N is 2, 3, 4 or 5.

In certain embodiments, a biosensor of the invention comprises a sequence having a series of linked ents according to Formula I: (I) P1-J1/2-P2-L2-P2’-J2/3-P3-L3-P3’- J3/1-, n “-” represents a bond, “P1” and “P1'” form the helix, “P2,” “L2” and “P2'” form the first hairpin, “P3,” “L3” and “P3'” form the second n, and “J1/2,” “J2/3” and “J3/1” together form the oligonucleotide junction. (See, e.g., Figure 1.) In n embodiments, a biosensor of the invention comprises a “read-out” module by which the specificity and/or affinity of a module for a ligand can be determined. A “readout” can be visually detectable, e.g., a genic read-out, such as, e.g. Broccoli, or can be a riboswitch-based read-out or an oligonucleotide-based readout, as described further herein.

The term “helix domain” refers to two or more cleotides that are held together, e.g., by en, Hoogsteen or reversed Hoogsteen bonds, thus forming a double helix or a triple helix structure.

The term “hairpin domain” refers to the ability of a polynucleotide to base pair with itself such that the 5' end and the 3' end of the polynucleotide are t into proximity to one another and are linked by a non-hybridizing portion of the cleotide that forms a loop structure.

The term “oligonucleotide junction” refers to two, three, four or five regions of a scaffold that form a ligand binding site, e.g., a Gua binding site. (See, e.g., Figure 1.) The term “oligonucleotide library” refers to a collection of synthetic oligonucleotide sequences, each sequence comprising a structural scaffold of the invention, wherein each structural scaffold includes at least a helix domain, first and second hairpin domains, and an oligonucleotide junction that contains a ligand-binding domain.

In certain ary embodiments, assays for screening ligands or test compounds which bind to a sor of the invention are ed. The test compounds of the present invention can be obtained using any of the us approaches in combinatorial library methods known in the art, including: biological libraries; spatially addressable parallel solid phase or solution phase libraries; tic library methods requiring olution; the “one-bead one-compound” library method; and synthetic library methods using affinity chromatography selection. The ical library ch is limited to peptide libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K. S. (1997) Anticancer Drug Des. 12:145).

The term “nucleoside” refers to a molecule having a purine or pyrimidine base covalently linked to a ribose or deoxyribose sugar. Exemplary nucleosides include ine, guanosine, cytidine, uridine and thymidine. Additional ary nucleosides include inosine, 1-methyl inosine, pseudouridine, 5,6-dihydrouridine, ribothymidine, 2N- methylguanosine and 2,2N,N-dimethylguanosine (also referred to as “rare” sides).

The term “nucleotide” refers to a nucleoside having one or more phosphate groups joined in ester linkages to the sugar moiety. Exemplary nucleotides include nucleoside monophosphates, phates and triphosphates. The terms “polynucleotide” and “nucleic acid molecule” are used interchangeably herein and refer to a polymer of nucleotides joined together by a phosphodiester linkage between 5' and 3' carbon atoms.

The term “RNA” or “RNA molecule” or ucleic acid molecule” refers to a polymer of ribonucleotides (e.g., 2, 3, 4, 5, 10, 15, 20, 25, 30, or more ribonucleotides). The term “DNA” or “DNA molecule” or “deoxyribonucleic acid le” refers to a polymer of deoxyribonucleotides. DNA and RNA can be synthesized naturally (e.g., by DNA replication or transcription of DNA, tively). RNA can be post-transcriptionally modified. DNA and RNA can also be chemically synthesized. DNA and RNA can be single-stranded (i.e., ssRNA and ssDNA, respectively) or multi-stranded (e.g., double stranded, i.e., dsRNA and dsDNA, respectively). “mRNA” or “messenger RNA” is single-stranded RNA that specifies the amino acid sequence of one or more ptide chains. This information is translated during protein synthesis when ribosomes bind to the mRNA.

The term “nucleotide analog” or “altered nucleotide” or “modified nucleotide” refers to a non-standard nucleotide, including non-naturally occurring cleotides or deoxyribonucleotides. Exemplary nucleotide analogs are modified at any position so as to alter certain chemical properties of the tide yet retain the ability of the nucleotide analog to perform its intended function. Examples of positions of the nucleotide which may be derivatized include the 5 position, e.g., 5-(2-amino)propyl uridine, 5-bromo uridine, 5- propyne e, 5-propenyl e, etc.; the 6 position, e.g., 6-(2-amino)propyl uridine; the tion for ine and/or guanosines, e.g., 8-bromo guanosine, 8-chloro guanosine, 8- fluoroguanosine, etc. Nucleotide analogs also include deaza nucleotides, e.g., a- adenosine; O- and N-modified (e.g., ted, e.g., hyl adenosine, or as otherwise known in the art) nucleotides; and other heterocyclically modified nucleotide analogs such as those described in Herdewijn, Antisense Nucleic Acid Drug Dev., 2000 Aug. 10(4):297-310.

Nucleotide analogs may also comprise modifications to the sugar portion of the nucleotides. For example the 2' OH-group may be replaced by a group selected from H, OR, R, F, Cl, Br, I, SH, SR, NH2, NHR, NR2, COOR, or OR, wherein R is substituted or unsubstituted C1-C6 alkyl, alkenyl, alkynyl, aryl, etc. Other possible modifications include those described in U.S. Pat. Nos. 5,858,988, and 6,291,438.

The phosphate group of the nucleotide may also be modified, e.g., by substituting one or more of the s of the phosphate group with sulfur (e.g., phosphorothioates), or by making other substitutions which allow the nucleotide to perform its intended function such as described in, for example, Eckstein, Antisense Nucleic Acid Drug Dev. 2000 Apr. (2):117-21, Rusckowski et al. Antisense Nucleic Acid Drug Dev. 2000 Oct. 10(5):333-45, Stein, Antisense Nucleic Acid Drug Dev. 2001 Oct. 11(5): , Vorobjev et al. Antisense Nucleic Acid Drug Dev. 2001 Apr. 11(2):77-85, and U.S. Pat. No. 5,684,143. Certain of the above-referenced modifications (e.g., phosphate group modifications) preferably se the rate of hydrolysis of, for e, polynucleotides comprising said analogs in vivo or in vitro.

In certain exemplary embodiments, a detectable label can be used to detect one or more ucleotides and/or polynucleotides described herein. Examples of detectable markers e various ctive moieties, enzymes, prosthetic groups, fluorescent markers, luminescent markers, bioluminescent markers, metal particles, protein-protein binding pairs, protein-antibody binding pairs and the like. Examples of fluorescent proteins include, but are not limited to, yellow fluorescent protein (YFP), green fluorescent protein (GFP), cyan fluorescent protein (CFP), umbelliferone, fluorescein, fluorescein isothiocyanate, ine, dichlorotriazinylamine fluorescein, dansyl chloride, phycoerythrin and the like. Examples of bioluminescent markers include, but are not limited to, luciferase (e.g., bacterial, firefly, click beetle and the like), luciferin, in and the like. Examples of enzyme systems having visually detectable signals e, but are not limited to, osidases, glucorinidases, phosphatases, peroxidases, cholinesterases and the like.

Identifiable markers also include radioactive compounds such as 125I, 35S, 14C or 3H.

Identifiable markers are commercially ble from a variety of s.

Fluorescent labels and their attachment to nucleotides and/or oligonucleotides are described in many reviews, including Haugland, Handbook of Fluorescent Probes and ch als, Ninth Edition (Molecular Probes, Inc., Eugene, 2002); Keller and Manak, DNA , 2nd Edition (Stockton Press, New York, 1993); Eckstein, editor, Oligonucleotides and Analogues: A Practical Approach (IRL Press, Oxford, 1991); and , Critical Reviews in Biochemistry and Molecular Biology, 26:227-259 (1991).

Particular methodologies applicable to the invention are disclosed in the following sample of references: U.S. Pat. Nos. 4,757,141, 507 and 519. In one aspect, one or more fluorescent dyes are used as labels for d target ces, e.g., as disclosed by U.S. Pat.

No. 5,188,934 (4,7-dichlorofluorescein dyes); U.S. Pat. No. 5,366,860 (spectrally resolvable rhodamine dyes); U.S. Pat. No. 5,847,162 (4,7-dichlororhodamine dyes); U.S. Pat. No. 4,318,846 (ether-substituted fluorescein dyes); U.S. Pat. No. 5,800,996 (energy transfer dyes); Lee et al.; U.S. Pat. No. 5,066,580 (xanthine dyes); U.S. Pat. No. 5,688,648 (energy transfer dyes); and the like. Labelling can also be carried out with quantum dots, as disclosed in the ing patents and patent publications: U.S. Pat. Nos. 6,322,901, 291, 6,423,551, 6,251,303, 6,319,426, 6,426,513, 6,444,143, 5,990,479, 6,207,392, 2002/0045045 and 2003/0017264. As used herein, the term “fluorescent label” includes a signaling moiety that conveys information through the scent absorption and/or emission properties of one or more molecules. Such fluorescent properties include fluorescence intensity, fluorescence lifetime, emission spectrum characteristics, energy transfer, and the like.

The term “oligonucleotide” refers to a short polymer of nucleotides and/or nucleotide analogs. The term “RNA analog” or “DNA analogue” refers to a polynucleotide (e.g., a chemically synthesized polynucleotide) having at least one altered or modified nucleotide as compared to a corresponding unaltered or unmodified DNA or RNA but retaining the same or similar nature or function as the corresponding unaltered or unmodified DNA or RNA. As sed above, the oligonucleotides may be linked with linkages which result in a lower rate of hydrolysis of the RNA or DNA analog as compared to an RNA or DNA molecule with phosphodiester linkages. For example, the nucleotides of the analog may comprise methylenediol, ethylene diol, oxymethylthio, oxyethylthio, oxycarbonyloxy, phosphorodiamidate, phosphoroamidate, and/or phosphorothioate linkages. Preferred RNA or DNA analogues include sugar- and/or backbone-modified ribonucleotides and/or ibonucleotides. Such alterations or modifications can further include addition of non- tide al, such as to the end(s) of the RNA or DNA or internally (at one or more nucleotides of the RNA or DNA).

As used herein, the term “isolated RNA” or “isolated DNA” refers to RNA or DNA molecules which are substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically sized.

The term “in vitro” has its art recognized meaning, e.g., involving purified reagents or extracts, e.g., cell extracts. The term “in vivo” also has its art recognized meaning, e.g., involving living cells, e.g., immortalized cells, primary cells, cell lines, and/or cells in an organism.

As used , the term “transgene” refers to any nucleic acid molecule, which is inserted by artifice into a cell, and becomes part of the genome of the organism that develops from the cell. Such a transgene may include a gene that is partly or entirely heterologous (i.e., foreign) to the transgenic sm, or may ent a gene homologous to an endogenous gene of the organism. The term “transgene” also means a nucleic acid molecule that includes one or more selected nucleic acid sequences, e.g., DNAs, that encode one or more engineered RNA precursors, to be expressed in a transgenic organism, e.g., animal, which is partly or entirely heterologous, i.e., foreign, to the transgenic animal, or homologous to an endogenous gene of the transgenic animal, but which is designed to be inserted into the animal’s genome at a location which differs from that of the natural gene. A transgene includes one or more promoters and any other DNA, such as s, necessary for expression of the selected nucleic acid sequence, all operably linked to the selected sequence, and may e an enhancer sequence.

A gene “involved” in a disease or disorder includes a gene, the normal or aberrant expression or function of which effects or causes the disease or er or at least one symptom of said e or disorder.

As used herein, the term “sample population” refers to a population of individuals comprising a statistically significant number of individuals. For example, the sample population may comprise 50, 75, 100, 200, 500, 1000 or more individuals. In particular embodiments, the sample population may comprise individuals which share at least on common disease phenotype (e.g., a gain-of-function disorder) or mutation (e.g., a gain-offunction mutation).

As used herein, the term “heterozygosity” refers to the fraction of individuals within a population that are heterozygous (e.g., contain two or more different s) at a particular locus (e.g., at a SNP). Heterozygosity may be calculated for a sample population using methods that are well known to those skilled in the art.

The phrase “examining the function of a gene in a cell or organism” refers to examining or studying the expression, ty, function or phenotype arising therefrom.

As used herein, the term “rare nucleotide” refers to a naturally occurring nucleotide that occurs infrequently, including naturally occurring deoxyribonucleotides or ribonucleotides that occur infrequently, e.g., a naturally ing ribonucleotide that is not guanosine, adenosine, cytosine, or uridine. Examples of rare nucleotides include, but are not limited to, e, 1-methyl inosine, pseudouridine, 5,6-dihydrouridine, ribothymidine, 2N- methylguanosine and 2,2N,N-dimethylguanosine.

The term “engineered,” as in an engineered RNA precursor, or an engineered nucleic acid molecule, indicates that the sor or molecule is not found in nature, in that all or a portion of the nucleic acid sequence of the precursor or molecule is d or selected by a human. Once created or selected, the sequence can be replicated, translated, transcribed, or otherwise processed by mechanisms within a cell. Thus, an RNA precursor produced within a cell from a transgene that includes an engineered nucleic acid molecule is an engineered RNA precursor.

As used herein, the term “bond strength” or “base pair strength” refers to the strength of the interaction between pairs of nucleotides (or nucleotide analogs) on opposing strands of an ucleotide , due primarily to H-bonding, van der Waals interactions, and the like between said nucleotides (or nucleotide analogs).

As used herein the term “destabilizing nucleotide” refers to a first tide or nucleotide analog capable of forming a base pair with second tide or tide analog such that the base pair is of lower bond strength than a conventional base pair (i.e., Watson- Crick base pair). In certain ments, the ilizing nucleotide is capable of forming a mismatch base pair with the second nucleotide. In other embodiments, the destabilizing nucleotide is capable of forming a wobble base pair with the second nucleotide. In yet other embodiments, the destabilizing nucleotide is capable of g an ambiguous base pair with the second nucleotide. In yet another embodiment, the destabilizing nucleotide is capable of g a bulge, n the destabilizing nucleotide does not pair with the second nucleotide.

As used , the term “base pair” refers to the interaction between pairs of nucleotides (or nucleotide analogs) on opposing s of an oligonucleotide duplex, due primarily to H-bonding, van der Waals interactions, and the like between said nucleotides (or nucleotide analogs). As used herein, the term “bond strength” or “base pair strength” refers to the strength of the base pair.

As used herein, the term “mismatched base pair” refers to a base pair consisting of non-complementary or non-Watson-Crick base pairs, for example, not normal complementary G:C, A:T or A:U base pairs. As used herein the term uous base pair” (also known as a non-discriminatory base pair) refers to a base pair formed by a universal nucleotide.

As used herein, term “universal nucleotide” (also known as a “neutral nucleotide”) include those nucleotides (e.g. certain destabilizing nucleotides) having a base (a “universal base” or “neutral base”) that does not significantly discriminate between bases on a complementary polynucleotide when forming a base pair. Universal nucleotides are predominantly hydrophobic molecules that can pack efficiently into antiparallel duplex nucleic acids (e.g., double-stranded DNA or RNA) due to stacking interactions. The base portions of universal nucleotides lly comprise a nitrogen-containing aromatic heterocyclic moiety.

As used herein, the terms cient complementarity” or “sufficient degree of complementarity” mean that an ucleotide sequence is sufficiently complementary to bind a desired target oligonucleotide.

Various methodologies of the instant invention include step that involves comparing a value, level, feature, characteristic, property, etc. to a “suitable control,” referred to interchangeably herein as an priate control.” A “suitable control” or “appropriate control” is any control or standard ar to one of ordinary skill in the art useful for comparison es. In one embodiment, a “suitable control” or priate control” is a value, level, feature, characteristic, property, etc. determined prior to performing a methodology, as described herein. For example, a transcription rate, mRNA level, translation rate, protein level, biological activity, cellular characteristic or property, genotype, phenotype, etc. can be determined prior to introducing a synthetic RNA or DNA agent of the invention into a cell or sm. In r embodiment, a “suitable control” or “appropriate l” is a value, level, feature, characteristic, property, etc. determined in a cell or organism, e.g., a control or normal cell or organism, exhibiting, for example, normal traits. In yet another embodiment, a “suitable control” or “appropriate l” is a predefined value, level, feature, characteristic, property, etc. synthetic RNA or DNA agents of the invention may be directly introduced into the cell (i.e., intracellularly), or introduced extracellularly into a cavity, titial space, into the circulation of an organism, introduced orally, or may be introduced by bathing a cell or organism in a on containing the nucleic acid. Vascular or ascular ation, the blood or lymph system, and the cerebrospinal fluid are sites where synthetic RNA or DNA agent may be introduced.

The synthetic RNA or DNA agents of the invention can be introduced using nucleic acid delivery methods known in art including ion of a solution containing the nucleic acid, bombardment by particles covered by the RNA agent, g the cell or organism in a solution of the RNA agent, or electroporation of cell membranes in the presence of the RNA agent. Other methods known in the art for introducing nucleic acids to cells may be used, such as lipid-mediated carrier transport, chemical-mediated transport, and cationic liposome transfection such as calcium phosphate, and the like. The synthetic RNA or DNA agent may be introduced along with other components that perform one or more of the following activities: enhance nucleic acid uptake by the cell or otherwise increase inhibition of the target gene.

Physical methods of introducing nucleic acids e injection of a solution containing the biosensor, bombardment by particles covered by the biosensor, g the cell or organism in a solution of the biosensor, or electroporation of cell membranes in the presence of the biosensor. A viral construct ed into a viral le would accomplish both efficient introduction of an expression construct into the cell and transcription of RNA encoded by the sion construct. Other methods known in the art for introducing nucleic acids to cells may be used, such as lipid-mediated carrier transport, chemical-mediated ort, such as calcium phosphate, and the like. Thus the RNA may be introduced along with components that perform one or more of the following activities: enhance RNA uptake by the cell, inhibit annealing of single strands, stabilize the single strands, or other-wise increase inhibition of the target gene.

The synthetic RNA or DNA agent may be directly introduced into the cell (i.e., intracellularly), or introduced extracellularly into a , titial space, into the circulation of an organism, introduced orally, or may be introduced by bathing a cell or organism in a solution containing the RNA or DNA. Vascular or extravascular circulation, the blood or lymph system, and the cerebrospinal fluid are sites where the RNA or DNA may be introduced.

A target cell may be from the germ line or somatic, totipotent or pluripotent, dividing or non-dividing, parenchyma or epithelium, immortalized or transformed, or the like. The cell may be a stem cell or a differentiated cell. Cell types that are differentiated include adipocytes, fibroblasts, myocytes, cardiomyocytes, endothelium, neurons, glia, blood cells, megakaryocytes, lymphocytes, macrophages, neutrophils, eosinophils, basophils, mast cells, leukocytes, ocytes, keratinocytes, chondrocytes, osteoblasts, osteoclasts, hepatocytes, and cells of the endocrine or exocrine glands.

The synthetic RNA or DNA agent may be introduced in an amount which allows delivery of at least one copy per cell. Higher doses (e.g., at least 5, 10, 100, 500 or 1000 copies per cell) of material may yield more ive inhibition; lower doses may also be useful for ic applications.

In an exemplary aspect, the efficacy of a sor of the invention is tested for its ability to specifically modulate transcription, translation, alternative ng and/or mRNA stability of a target in a cell. Cells can be transfected with one or more biosensors described herein. Selective reduction in target DNA, target RNA (e.g., mRNA) and/or target protein is ed. Reduction of target DNA, RNA or protein can be compared to levels of target DNA, RNA or protein in the absence of a biosensor or in the presence of a biosensor that does not target the DNA, RNA or protein. Exogenously-introduced DNA, RNA or protein can be assayed for comparison purposes. When utilizing neuronal cells, which are known to be somewhat resistant to standard transfection techniques, it may be ble to uce biosensors by passive uptake.

“Treatment,” or “treating,” as used , is defined as the application or administration of a therapeutic agent (e.g., a synthetic RNA or DNA agent) to a patient, or application or administration of a therapeutic agent to an isolated tissue or cell line from a patient, who has the disease or disorder, a symptom of disease or disorder or a predisposition toward a disease or disorder, with the purpose to cure, heal, ate, relieve, alter, remedy, ameliorate, improve or affect the disease or disorder, the symptoms of the disease or disorder, or the predisposition toward disease.

In one aspect, the invention provides a method for preventing in a subject, a disease or disorder, by administering to the subject a therapeutic agent (e.g., a synthetic RNA or DNA agent or vector or transgene ng same). ts at risk for the e can be identified by, for example, any or a combination of diagnostic or prognostic. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the disease or disorder, such that the disease or disorder is prevented or, alternatively, delayed in its ssion.

Another aspect of the invention pertains to methods treating subjects therapeutically, i.e., alter onset of symptoms of a disease or disorder. In an exemplary embodiment, the modulatory method of the invention involves contacting a cell expressing disorder with a therapeutic agent (e.g., a synthetic RNA or DNA agent or vector or transgene encoding same) that is specific for one or more target sequences, such that a sequence ic interaction with the target sequence is ed. These methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject).

With regards to both prophylactic and therapeutic methods of treatment, such treatments may be specifically tailored or modified, based on dge obtained from the field of pharmacogenomics. “Pharmacogenomics,” as used herein, refers to the application of genomics technologies such as gene sequencing, statistical genetics, and gene expression analysis to drugs in clinical development and on the market. More specifically, the term refers the study of how a patient's genes determine his or her response to a drug (e.g., a patient’s “drug response ype,” or “drug response genotype”). Thus, another aspect of the invention provides methods for tailoring an dual's prophylactic or therapeutic treatment with either the target gene molecules of the present invention or target gene tors according to that individual's drug response genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or eutic ents to patients who will most benefit from the treatment and to avoid ent of patients who will experience toxic drug-related side effects.

Therapeutic agents can be tested in an appropriate animal model. For example, a tic RNA or DNA agent (or expression vector or transgene encoding same) as described herein can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with said agent. Alternatively, a therapeutic agent can be used in an animal model to determine the mechanism of action of such an agent. For example, an agent can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent can be used in an animal model to determine the mechanism of action of such an agent.

A pharmaceutical composition containing a synthetic RNA or DNA agent of the ion can be administered to any t diagnosed as having or at risk for developing a disorder. In one embodiment, the patient is sed as having a disorder, and the patient is otherwise in general good health. For example, the patient is not terminally ill, and the patient is likely to live at least 2, 3, 5 or more years following diagnosis. The patient can be treated immediately following diagnosis, or treatment can be delayed until the patient is experiencing more debilitating symptoms. In another embodiment, the patient has not reached an advanced stage of the disease.

An a synthetic RNA or DNA agent can be administered at a unit dose less than about 1.4 mg per kg of bodyweight, or less than 10, 5, 2, 1, 0.5, 0.1, 0.05, 0.01, 0.005, 0.001, 0.0005, 0.0001, 0.00005 or 0.00001 mg per kg of bodyweight, and less than 200 nmole of RNA agent (e.g., about 4.4 x 1016 copies) per kg of ight, or less than 1500, 750, 300, 150, 75, 15, 7.5, 1.5, 0.75, 0.15, 0.075, 0.015, 0.0075, , 5, 0.00015 nmole of a synthetic RNA or DNA agent per kg of bodyweight. The unit dose, for example, can be stered by injection (e.g., intravenous or intramuscular, intrathecally, or directly into the brain), an inhaled dose, or a topical application. Particularly red dosages are less than 2, 1, or 0.1 mg/kg of body weight.

Delivery of a synthetic RNA or DNA agent directly to an organ can be at a dosage on the order of about 0.00001 mg to about 3 mg per organ, or preferably about -0.001 mg per organ, about 0.03-3.0 mg per organ, about 0.1-3.0 mg per eye or about 0.3-3.0 mg per organ. The dosage can be an amount ive to treat or prevent a disorder. In one embodiment, the unit dose is administered less frequently than once a day, e.g., less than every 2, 4, 8 or 30 days. In another embodiment, the unit dose is not administered with a frequency (e.g., not a regular frequency). For example, the unit dose may be administered a single time. In one embodiment, the effective dose is administered with other traditional therapeutic modalities.

In one embodiment, a subject is administered an initial dose, and one or more maintenance doses of a synthetic RNA or DNA agent. The maintenance dose or doses are generally lower than the l dose, e.g., one-half less of the initial dose. A maintenance regimen can include treating the subject with a dose or doses ranging from 0.01 g to 1.4 mg/kg of body weight per day, e.g., 10, 1, 0.1, 0.01, 0.001, or 0.00001 mg per kg of bodyweight per day. The maintenance doses are preferably administered no more than once every 5, 10, or 30 days. Further, the treatment regimen may last for a period of time which will vary depending upon the nature of the ular disease, its ty and the overall condition of the patient. In preferred embodiments the dosage may be delivered no more than once per day, e.g., no more than once per 24, 36, 48, or more hours, e.g., no more than once every 5 or 8 days. Following treatment, the patient can be monitored for changes in his condition and for ation of the ms of the disease state. The dosage of the compound may either be sed in the event the patient does not respond icantly to current dosage levels, or the dose may be decreased if an alleviation of the symptoms of the disease state is observed, if the disease state has been ablated, or if undesired ffects are observed.

The effective dose can be administered in a single dose or in two or more doses, as desired or considered appropriate under the specific stances. If desired to facilitate repeated or nt ons, implantation of a delivery , e.g., a pump, semi- permanent stent (e.g., intravenous, intraperitoneal, intracisternal or intracapsular), or reservoir may be advisable. In one embodiment, a pharmaceutical composition includes a plurality of a synthetic RNA or DNA agent species. In another embodiment, the a synthetic RNA or DNA agent species has sequences that are non-overlapping and jacent to another species with respect to a naturally occurring target sequence. In another embodiment, the plurality of a synthetic RNA or DNA agent species is specific for different naturally occurring targets. In another embodiment, the ity of a synthetic RNA or DNA agent species target two or more target sequences (e.g., two, three, four, five, six, or more target sequences). ing successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the compound of the invention is administered in maintenance doses, ranging from 0.01 g to 100 g per kg of body weight (see U.S. Pat. No. 6,107,094).

The concentration of the a synthetic RNA or DNA agent composition is an amount sufficient to be effective in treating or preventing a er or to regulate a physiological condition in humans. The concentration or amount of a synthetic RNA or DNA agent administered will depend on the parameters determined for the agent and the method of administration, e.g. nasal, buccal, or pulmonary. For example, nasal formulations tend to e much lower concentrations of some ingredients in order to avoid irritation or burning of the nasal passages. It is sometimes desirable to dilute an oral formulation up to 10-100 times in order to provide a suitable nasal formulation.

Certain s may influence the dosage required to effectively treat a subject, including but not limited to the severity of the disease or disorder, previous treatments, the general health and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a therapeutically effective amount of an a synthetic RNA or DNA agent can include a single treatment or, preferably, can e a series of treatments. It will also be appreciated that the effective dosage of a synthetic RNA or DNA agent for treatment may increase or decrease over the course of a particular treatment. Changes in dosage may result and become apparent from the s of diagnostic assays as described herein. For example, the subject can be monitored after administering a tic RNA or DNA agent composition.

Based on information from the monitoring, an additional amount of the synthetic RNA or DNA agent composition can be administered.

Dosing is dependent on severity and responsiveness of the disease condition to be d, with the course of treatment g from several days to l months, or until a cure is effected or a diminution of disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. s of ordinary skill can easily determine m dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual compounds, and can generally be estimated based on EC50s found to be effective in in vitro and in vivo animal models.

The invention pertains to uses of the above-described agents for lactic and/or therapeutic treatments as described Infra. Accordingly, the modulators (e.g., synthetic RNA or DNA agents) of the t invention can be incorporated into pharmaceutical compositions suitable for administration. Such compositions typically comprise the nucleic acid molecule, protein, antibody, or modulatory compound and a pharmaceutically acceptable carrier. As used herein the ge “pharmaceutically acceptable carrier” is intended to include any and all solvents, dispersion media, coatings, antibacterial and ngal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical stration. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.

A pharmaceutical composition of the invention is formulated to be compatible with its intended route of administration. Examples of routes of administration include parenteral, e.g., intravenous (IV), intradermal, subcutaneous (SC or SQ), intraperitoneal, intramuscular, oral (e.g., inhalation), transdermal (topical), and transmucosal administration. Solutions or suspensions used for eral, intradermal, or subcutaneous application can e the following components: a sterile diluent such as water for injection, saline on, fixed oils, polyethylene glycols, glycerine, propylene glycol or other tic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The eral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, iostatic water, Cremophor ELTM (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be e and should be fluid to the extent that easy syringability . It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof.

The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various cterial and antifungal agents, for example, ns, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars, cohols such as mannitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin.

Sterile injectable solutions can be prepared by incorporating the active compound in the ed amount in an appropriate solvent with one or a ation of ingredients enumerated above, as ed, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle which contains a basic dispersion medium and the ed other ients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable ons, the preferred methods of ation are vacuum drying and freeze-drying which yields a powder of the active ingredient plus any additional desired ingredient from a previously e-filtered solution thereof.

Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic stration, the active nd can be incorporated with excipients and used in the form of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as part of the ition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring.

For administration by inhalation, the compounds are delivered in the form of an aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as carbon e, or a nebulizer.

Systemic administration can also be by transmucosal or ermal means. For transmucosal or transdermal administration, penetrants riate to the barrier to be permeated are used in the formulation. Such penetrants are lly known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. For ermal administration, the active compounds are ated into ointments, salves, gels, or creams as generally known in the art.

The compounds can also be prepared in the form of suppositories (e.g., with conventional itory bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery.

The synthetic RNA or DNA agent can also be administered by transfection or ion using methods known in the art, including but not limited to the methods described in McCaffrey et al. (2002), Nature, 93), 38-9 (hydrodynamic ection); Xia et al. (2002), Nature Biotechnol., 20(10), 1006-10 (viral-mediated delivery); or Putnam (1996), Am. J. Health Syst. Pharm. 53(2), 151-160, erratum at Am. J. Health Syst. Pharm. 53(3), 325 (1996).

The synthetic RNA or DNA agent can also be administered by any method suitable for administration of nucleic acid agents, such as a DNA e. These methods include gene guns, bio injectors, and skin patches as well as needle-free methods such as the microparticle DNA vaccine technology disclosed in U.S. Pat. No. 6,194,389, and the ian transdermal needle-free vaccination with powder-form e as disclosed in U.S. Pat. No. 6,168,587. Additionally, intranasal delivery is le, as described in, inter alia, Hamajima et al. (1998), Clin. l. Immunopathol., 88(2), 205-10. Liposomes (e.g., as described in U.S. Pat. No. 6,472,375) and microencapsulation can also be used. Biodegradable able microparticle delivery s can also be used (e.g., as described in U.S. Pat. No. 6,471,996).

In one embodiment, the active compounds are prepared with carriers that will protect the compound against rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent to those d in the art. The materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral ns) can also be used as pharmaceutically acceptable carriers. These can be prepared according to methods known to those skilled in the art, for example, as bed in U.S. Pat. No. 4,522,811.

It is especially advantageous to formulate oral or parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of active compound calculated to produce the desired therapeutic effect in ation with the required pharmaceutical carrier.

The specification for the dosage unit forms of the invention are dictated by and directly dependent on the unique characteristics of the active compound and the particular therapeutic effect to be achieved, and the limitations inherent in the art of nding such an active compound for the treatment of individuals.

Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical ures in cell es or mental animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and eutic effects is the eutic index and it can be expressed as the ratio LD50/ED50. Compounds that exhibit large therapeutic indices are preferred. Although compounds that exhibit toxic side effects may be used, care should be taken to design a delivery system that targets such compounds to the site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, reduce side s.

The data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that e the ED50 with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. For any compound used in the method of the invention, the therapeutically effective dose can be ted initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating plasma concentration range that includes the EC50 (i.e., the concentration of the test compound which achieves a aximal response) as determined in cell culture. Such information can be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for example, by high performance liquid chromatography.

The pharmaceutical compositions can be included in a container, pack or dispenser together with optional instructions for administration.

As d , a therapeutically effective amount of synthetic RNA or DNA agent (i.e., an effective dosage) depends on the synthetic RNA or DNA agent ed. For instance, single dose amounts in the range of approximately 1 g to 1000 mg may be administered; in some embodiments, 10, 30, 100 or 1000g may be administered. In some embodiments, 1-5 g of the compositions can be administered. The compositions can be administered one from one or more times per day to one or more times per week; including once every other day. The skilled artisan will appreciate that certain factors may influence the dosage and timing required to effectively treat a t, including but not d to the severity of the disease or disorder, previous ents, the general health and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a therapeutically effective amount of a synthetic RNA or DNA agent can include a single treatment or, preferably, can include a series of treatments.

The nucleic acid molecules of the invention can be inserted into expression ucts, e.g., viral vectors, retroviral vectors, expression cassettes, or plasmid viral vectors, e.g., using methods known in the art, including but not d to those described in Xia et al., (2002), Supra. Expression constructs can be delivered to a subject by, for example, inhalation, orally, intravenous ion, local administration (see U.S. Pat. No. 5,328,470) or by stereotactic injection (see e.g., Chen et al. (1994), Proc. Natl. Acad. Sci. USA, 91, 3054- 3057). The pharmaceutical preparation of the delivery vector can include the vector in an acceptable diluent, or can comprise a slow release matrix in which the delivery vehicle is imbedded. Alternatively, where the complete delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells which produce the gene delivery system.

The route of delivery can be dependent on the er of the patient. In certain exemplary embodiments, a subject can be administered a synthetic RNA or DNA agent of the ion by IV or SC administration. In addition to a synthetic RNA or DNA agent of the invention, a patient can be administered a second therapy, e.g., a palliative therapy and/or disease-specific therapy. The secondary therapy can be, for example, symptomatic (e.g., for alleviating symptoms), protective (e.g., for slowing or halting disease progression), or restorative (e.g., for reversing the disease process).

In general, a synthetic RNA or DNA agent of the invention can be stered by any le method. As used herein, topical delivery can refer to the direct application of a synthetic RNA or DNA agent to any surface of the body, including the eye, a mucous membrane, surfaces of a body , or to any al surface. Formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, , and liquids. tional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. l administration can also be used as a means to selectively deliver the synthetic RNA or DNA agent to the epidermis or dermis of a subject, or to specific strata thereof, or to an underlying tissue.

Compositions for intrathecal or intraventricular administration may include sterile aqueous solutions which may also n s, diluents and other suitable additives.

Compositions for intrathecal or intraventricular administration preferably do not include a transfection reagent or an additional lipophilic moiety besides, for example, the lipophilic moiety attached to the synthetic RNA or DNA agent.

Formulations for parenteral administration may include sterile s solutions which may also contain buffers, diluents and other suitable additives. Intraventricular injection may be facilitated by an entricular catheter, for example, attached to a reservoir. For intravenous use, the total tration of solutes should be controlled to render the preparation isotonic.

A synthetic RNA or DNA agent of the invention can be administered to a subject by ary delivery. Pulmonary delivery compositions can be delivered by inhalation of a dispersion so that the composition within the dispersion can reach the lung where it can be readily absorbed through the alveolar region directly into blood circulation. Pulmonary delivery can be effective both for systemic delivery and for localized delivery to treat diseases of the lungs. ary delivery can be achieved by different approaches, including the use of nebulized, aerosolized, micellular and dry powder-based formulations. Delivery can be achieved with liquid nebulizers, aerosol-based inhalers, and dry powder dispersion devices. d-dose devices are preferred. One of the benefits of using an atomizer or inhaler is that the potential for contamination is minimized because the devices are ontained. Dry powder dispersion devices, for example, deliver drugs that may be readily formulated as dry powders. A synthetic RNA or DNA agent composition may be stably stored as lyophilized or spray-dried powders by itself or in ation with suitable powder carriers. The delivery of a composition for inhalation can be mediated by a dosing timing element which can include a timer, a dose counter, time ing device, or a time indicator which when incorporated into the device enables dose tracking, ance monitoring, and/or dose triggering to a t during administration of the aerosol medicament.

The types of pharmaceutical excipients that are useful as carriers include stabilizers such as Human Serum Albumin (HSA), bulking agents such as carbohydrates, amino acids and polypeptides; pH adjusters or buffers; salts such as sodium chloride; and the like. These rs may be in a crystalline or amorphous form or may be a mixture of the two.

Bulking agents that are ularly le include compatible carbohydrates, polypeptides, amino acids or combinations thereof. Suitable carbohydrates include monosaccharides such as galactose, D-mannose, sorbose, and the like; disaccharides, such as lactose, trehalose, and the like; cyclodextrins, such as oxypropyl--cyclodextrin; and polysaccharides, such as ose, maltodextrins, dextrans, and the like; alditols, such as mannitol, xylitol, and the like. A preferred group of carbohydrates includes lactose, trehalose, raffinose maltodextrins, and mannitol. le polypeptides include aspartame.

Amino acids include alanine and glycine, with glycine being preferred.

Suitable pH adjusters or buffers include organic salts prepared from organic acids and bases, such as sodium citrate, sodium ascorbate, and the like; sodium citrate is preferred.

A synthetic RNA or DNA agent of the invention can be administered by oral and nasal delivery. For e, drugs administered through these membranes have a rapid onset of action, provide therapeutic plasma , avoid first pass effect of hepatic metabolism, and avoid exposure of the drug to the hostile gastrointestinal (GI) environment. Additional ages include easy access to the membrane sites so that the drug can be applied, localized and removed easily. In one embodiment, a tic RNA or DNA agent administered by oral or nasal delivery has been modified to be capable of traversing the blood-brain r.

In one embodiment, unit doses or measured doses of a composition that include synthetic RNA or DNA agents are dispensed by an implanted device. The device can include a sensor that monitors a ter within a subject. For example, the device can include a pump, such as an osmotic pump and, optionally, associated electronics.

A synthetic RNA or DNA agent can be packaged in a viral natural capsid or in a chemically or enzymatically produced artificial capsid or structure derived therefrom.

In n other s, the invention provides kits that include a suitable container containing a pharmaceutical formulation of a synthetic RNA or DNA agent. In certain embodiments the individual ents of the pharmaceutical formulation may be provided in one container. Alternatively, it may be desirable to e the components of the pharmaceutical formulation separately in two or more containers, e.g., one container for a synthetic RNA or DNA agent preparation, and at least another for a carrier compound. The kit may be packaged in a number of different configurations such as one or more containers in a single box. The different components can be combined, e.g., according to instructions provided with the kit. The components can be combined according to a method described , e.g., to prepare and ster a pharmaceutical composition. The kit can also include a delivery device.

It will be readily apparent to those skilled in the art that other suitable modifications and adaptations of the s described herein may be made using suitable equivalents without departing from the scope of the embodiments disclosed herein. Having now described certain embodiments in detail, the same will be more clearly understood by reference to the following examples, which are included for purposes of illustration only and are not intended to be limiting.

Unless ise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials r or equivalent to those described herein can be used in the practice or testing of the present invention, suitable s and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by nce in their entirety. In case of conflict, the present specification, including definitions, will control. In on, the materials, methods, and examples are illustrative only and not intended to be limiting.

EXAMPLES Example 1: Scaffolding selection libraries with ation from biological RNAs ation of small biological RNAs with helix packing (i.e., tertiary folding) indicated two recurrent architectures that may be considered privileged scaffolds. The first is the H-type pseudoknot, which is broadly found in ical RNAs including small ribozyme ribosomal frame-shifting elements in viral mRNAs, and natural and synthetic rs.

However, from a design perspective this fold can be difficult to engineer. The other is the three-way junction (3WJ) supported by a remote tertiary ction that organizes the helical arrangement around the junction. This fold is more suitable for the design of RNA devices that incorporate aptamers as it positions a designable helical element (called the P1 helix) proximal to the ligand-binding site typically housed in the junction.

Within the three-way junction fold group, there are a large selection of potential candidates that could be used to scaffold an initial library of sequences for in vitro selection.

Three are ified herein: the aptamer domain of the B. subtilis xpt-pbuX guanine riboswitch (referred to herein as “GR”), the aptamer domain of the Vibrio cholerae Vc2 cyclic di-GMP riboswitch (referred to herein as , and the Schistosoma mansoni hammerhead ribozyme (referred to herein as “HH”) (Figure 1). In each of these parental RNA scaffolds, the junction hosts the key biological activity proximal to the P1 helix that can serve as the secondary structural bridge to a readout domain.

Starting libraries were designed to preserve the overall secondary and tertiary structure of the scaffold while randomizing a sufficient number of nucleotides in the junction to ensure te pool diversity such that winners emerge. All nucleotides in the g strands of the junction were randomized (equal populations of the four nucleotides at each position), as well as at least one base pair in each helix al to the junction (Figure 1).

For the GR scaffold, this yielded an initial y of 23 randomized nucleotide positions equating to a y size of about 7x1013 sequences (423 sequences); the CDG and HH contain similar levels of diversity (21 randomized nucleotide positions equating to a library size of about 4x1012 sequences; 421 sequences). This number of ces is theoretically fully represented in the initial pool of RNA with at least five-fold redundancy. While this is substantially lower ity than what is recommended for a typical selection, novel aptamers have been attained from ng pools with even more limited sampling of sequence space.

The three scaffolds were ated into a library cassette with specific design features. The P1 helix of each scaffold containing the initial and terminal bases of the scaffold sequences was replaced in all libraries with a designed helix-containing ured amplification cassettes based upon those developed for Selective 2'-Hydroxyl Acylation analyzed by Primer Extension E”) chemical probing of RNA structure (Figure 17).

This ensures that the constant regions necessary for replication are ured and less likely to be incorporated into the selected aptamer. To further minimize the ial for the constant regions to participate in formation of the ligand-binding site, the P1 helix was ed to at least ten base pairs. Full sequences of DNA templates encoding the initial starting libraries are given in Table 1 and Figure 23.

Table 1. Sequences of oligonucleotides and templates.

Oligonucleotide Sequence In vitro ion GR/SSIII library templatea GCGCGCGAATTCTAATACGACTCACTATAGGACT TCGGTCCAAGCTAATGCACTCNNNNNNNCGCGT GGATATGGCACGCANNNNNNNNNNGGGCACCGT AAATGTCCNNNNNNGGGTGCATTAGCAAAATCG GGCTTCGGTCCGGTTC GR/GsI y template GCGCGCGAATTCTAATACGACTCACTATAGGACT TCGGTCCTTGGATAGGACTCNNNNNNNCGCGTG GATATGGCACGCANNNNNNNNNNGGGCACCGTA AATGTCCNNNNNNGGGTCCTATCCCCAATCGGG CTTCGGTCCGGTTC CDG/GsI library template GCGCGCGAATTCTAATACGACTCACTATAGGACT TCGGTCCTTGGATAGGACANNNNNNNNNCAAAC AAAGAGTGGGACGNNNNNCCTCCGGCC TAAACCGAAAGGTAGGTAGCGGGGNNNNNNNT GTCCTATCCCCAATCGGGCTTCGGTCCGGTTC HH/GsI library template GCGCGCGAATTCTAATACGACTCACTATAGGACT TCGGTCCTTGGATAGGAGCNNNNTGGTATCCAA TGAAAATGTACTACCANNNNNNNNNNCCCAAAT AGGNNNNNNNGCTCCTATCCCCAATCGGGCTTC GGTCCGGTTC T7 site appending primer GCGCGCGAATTCTAATACGACTCACTATAGGAC CCAAGCTAATGCACTC RT-PCR primer GAACCGGACCGAAGCCCG High hput sequencing HTS e sequencing primer, CAAGCAGAAGACGGCATACGAGATGTCGTGTAG GR/SSIII CCTAGTCAGTCAGCCGAACCGGACCGAAGCCCG HTS reverse sequencing primer, CAAGCAGAAGACGGCATACGAGATTGGTCAACG GR/GsI ATAAGTCAGTCAGCCGAACCGGACCGAAGCCCG HTS reverse sequencing primer, CAAGCAGAAGACGGCATACGAGATATCACCAGG CDG/GsI TGTAGTCAGTCAGCCGAACCGGACCGAAGCCCG HTS reverse sequencing primer, CAAGCAGAAGACGGCATACGAGATGCTGTACGG HH/GsI ATTAGTCAGTCAGCCGAACCGGACCGAAGCCCG Forward sequencing primer AATGATACGGCGACCACCGAGATCTACACTATG GTAATTGTGCGCGCGAATTCTAATACGACTCACT RT primer AGTCAGTCAGCCGAACCGGACCGAAGCCCG Indexing primer CGGGCTTCGGTCCGGTTCGGCTGACTGACT Crystallization 5HTP-II GGACACTCTGATGATCGCGTGGATATGGCACGC ATTGAATTGTTGGACACCGTAAATGTCCTAACAC GTGTCCA Isothermal ion calorimetry 5HTP-I aptamerb GGAGCTAATGCACTCTTAACGCCGCGTGGATAT GCACGCAACCGTGAATCGGGCACCGTAAATTCC GTAAGTGGGTGCATTAGC 5HTP-II aptamer GGCACTCTGATGATCGCGTGGATATGGCACGCA TTGAATTGTTGGACACCGTAAATGTCCTAACACG GGTGCC 5HTP-III aptamer GGAGCTAATGCACTCCCATTTTCCGTGGATATGG CACGCTACCGATGTTGGGACCGTAATGTCCATTA CGGGTGCATTAGC 5HTP-IV aptamer GACTCTCTGGTTCGCGTAGATATGGCAC GCAATTGAAGAATGGGCACCGTAAATGTCTGTA GACGGGTCCTATCC 5HTP-V aptamer GGATAGGACTCATTCGGCCGCGTGGATATGGCA CGCAGGAGATGTGTGGACACCGTAAATGTCCGT AGGCGGGTCCTATCC 5HTP-VI aptamer GGATAGGACTCAACATCTCGCGTGGATATGGCA CGCAGACTTCCAGTGGGCACCGTAAATGTCCGT AGACGGGTCCTATCC 5HTP-VII aptamer GGATAGGACATGTAATCTCCAAACCATTCGAAA GACGCTAGACCTCCGGCCTAAACCGAA AGGTAGGTAGCGGGGCTAGGTATGTCCTATCC 5HTP-VIII aptamer GGATAGGAGCTGTTTGGTATCCAATGAAAATGT ACTACCAACTTGAATCTCCCAAATAGGCTAGGTA GCTCCTATCC SHAPE chemical probing 5HTP-I, SHAPE CGGTCCAAGCTAATGCACTCTTAACGCC GCGTGGATATGCACGCAACCGTGAATCGGGCAC CGTAAATTCCGTAAGTGGGTGCATTAGCAATCG ATCCGGTTCGCCGGATCCAAATCGGGCTTCGGTC CGGTTC 5HTP-II, SHAPE GGACTTCGGTCCAAGCTAATGCACTCTGATGATC ATATGGCACGCATTGAATTGTTGGACA CCGTAAATGTCCTAACACGGGTGCATTAGCAAT CGATCCGGTTCGCCGGATCCAAATCGGGCTTCG GTCCGGTTC 5HTP-III, SHAPE GGACTTCGGTCCAAGCTAATGCACTCCCATTTTC CGTGGATATGGCACGCTACCGATGTTGGGACCG TAATGTCCATTACGGGTGCATTAGCAAAATCGA TCCGGTTCGCCGGATCCAAATCGGGCTTCGGTCC GGTTC 5HTP-IV, SHAPE GGACTTCGGTCCTTGGATAGGACTCTCTGGTTCG CGTAGATATGGCACGCAATTGAAGAATGGGCAC CGTAAATGTCTGTAGACGGGTCCTATCCAATCG GGTCCGGTTC 5HTP-V, SHAPE GGACTTCGGTCCTTGGATAGGACTCATTCGGCCG CGTGGATATGGCACGCAGGAGATGTGTGGACAC CGTAAATGTCCGTAGGCGGGTCCTATCCAATCG GGCTTCGGTCCGGTTC 5HTP-VI, SHAPE GGACTTCGGTCCTTGGATAGGACTCAACATCTCG CGTGGATATGGCACGCAGACTTCCAGTGGGCAC CGTAAATGTCCGTAGACGGGTCCTATCCAATCG GGCTTCGGTCCGGTTC 5HTP-VII, SHAPE GGACTTCGGTCCTTGGATAGGACATGTAATCTCC AAACCATTCGAAAGAGTGGGACGCTAGACCTCC GGCCTAAACCGAAAGGTAGGTAGCGGGGCTAGG TATGTCCTATCCAATCGGGCTTCGGTCCGGTTC 5HTP-VIII, SHAPE GGACTTCGGTCCTTGGATAGGAGCTGTTTGGTAT CCAATGAAAATGTACTACCAACTTGAATCTCCC AAATAGGCTAGGTAGCTCCTATCCAATCGGGCT TCGGTCCGGTTC Broccoli sensors 5HTP-II/A-Broccoli GGACGGAGACGGTCGGGTCATATGATGATCGCG TGGATATGGCACGCATTGAATTGTTGGACACCG TAAATGTCCTAACAATATTCGAGTAGAGTGTGG 5HTP-II/U-Broccoli GGACGGAGACGGTCGGGTCTATAGATGATCGCG TGGCACGCATTGAATTGTTGGACACCG TAAATGTCCTAACATATATCGAGTAGAGTGTGG GCTCCGTCC 5HTP-IV/A-Broccoli AGACGGTCGGGTCATATTCTGGTTCGC GTAGATATGGCACGCAATTGAAGAATGGGCACC GTAAATGTCTGTAGACATATTCGAGTAGAGTGT GGGCTCCGTCC V/U-Broccoli GGACGGAGACGGTCGGGTCTATATCTGGTTCGC GTAGATATGGCACGCAATTGAAGAATGGGCACC GTAAATGTCTGTAGACTATATCGAGTAGAGTGT GGGCTCCGTCC 5HTP-VII/A-Broccoli GGACGGAGACGGTCGGGTCATATATATTCGATG CCAAACCATTCGAAAGAGTGGGACGCT AGACCTCCGGCCTAAACCGAAAGGTAGGTAGCG GGGCTAGGTAGTAGAGTGTGGGCTCCGTCC 5HTP-VII/U-Broccoli AGACGGTCGGGTCTATATGTAATCTCC AAACCATTCGAAAGAGTGGGACGCTAGACCTCC GGCCTAAACCGAAAGGTAGGTAGCGGGGCTAGG TATATATCGAGTAGAGTGTGGGCTCCGTCC 5HTP-VIII/A-Broccoli GGACGGAGACGGTCGGGTCATATTGTTTGGTAT CCAATGAAAATGTACTACCAACTTGAATCTCCC AAATAGGCTAGGTAATATTCGAGTAGAGTGTGG GCTCCGTCC 5HTP-VIII/U-Broccoli GGACGGAGACGGTCGGGTCTATATGTTTGGTAT CCAATGAAAATGTACTACCAACTTGAATCTCCC AAATAGGCTAGGTATATATCGAGTAGAGTGTGG GCTCCGTCC In vitro single turnover transcription 5HTP-IV/pbuE riboswitch AATATTGAGCTGTTGACAATTAATCATCGGCTCG TATAATGTGTGGAATTAAATAGCTATTATCAGGA TTTTTCTGGTTCGCGTAGATATGGCACGCAATTG AAGAATGGGCACCGTAAATGTCTGTAGACAAAA TCCTGATTACAAAATTTGTTTATGACATTTTTTGT GATTTTTTTATTTATCAAAACATTTAAG TAAAGGAGTTTGTTATG a “N” represents a position where the composition of A, C, G and T is imately 25% each. b RNA aptamer and sensor sequences are given as their equivalent DNA sequences.

A further complicating issue for scaffold selection was the low fidelity of viral reverse transcriptases (RTs). Engineering of MMLV RT to improve stability and processivity to create the most commonly used versions of this enzyme decreased its already low fidelity.

Misincorporation or deletion of nucleotides in conserved sequences of the scaffold readily disrupts tertiary interactions that ize the global fold. RNAs lacking structure are ied more efficiently by RT, which can introduce a significant bias during the replication step of each round of selection, which in part likely leads to the “tyranny of small motifs” phenomenon observed in selection. To address this, a recently characterized RT derived from a mobile group II intron from the thermophile Geobacillus stearothermophilus (GsI-IIC-MRF or "GsI") that retains ty up to 70 °C (versus 55 °C for SSIII) and has inherently higher fidelity than MMLV-derived RTs was adopted. For comparison, the GR scaffold selection was performed with an RT derived from MMLV Script III or “SSIII”) along with GsI.

Example 2: lded selection against 5HTP yields many potential aptamers The target for selection was 5-hydroxy-L-tryptophan (5HTP; Figure 2A), the immediate biosynthetic sor of serotonin, which was lized on a solid matrix via its carboxylate group. Seven rounds of selection with each y were carried out, with counterselections against L-tryptophan and singly stringent washing procedures in later rounds. In the SSIII selection, a conventional SELEX protocol was adopted in which the affinity column was extensively washed in early rounds prior to competitive elution to remove nonbinding RNAs. Competitive elution was initially observed in round four and peaked at >50% of total input RNA in round six. The GsI selections used a less stringent protocol than generally recommended in which roughly the final 10% of total RNA left on the column under competitive elution was collected for amplification in the initial four rounds to preserve sequence diversity in the pool before increasing wash stringency. Details of the selections are given in Examples 6-8 and Table 2.

Table 2: Selection conditions per cycle.

Round [RNA] pmol Washes* Notes SS-III selection 1 1000 3 Counter selection against AcO-sepharose 2 400 6 Counter selection against pharose 3 400 10 4 400 10 100 10 6 100 10 30 second counter ion with 100 mM L-tryptophan 7 100 10 30 second counter selection with 100 mM L-tryptophan GsI selections 1 1000 3 Counter ion t AcO-sepharose 2 400 3 Counter selection against AcO-sepharose 3 400 5 4 400 5 400 10 6 200 6 30 second counter selection with 100 mM L-tryptophan 7 200 10 30 second counter selection with 100 mM L-tryptophan *Each wash tuted 3 column volumes of buffer In preserving sequence diversity and minimizing early stochastic events, a combination of next generation sequencing (NGS) and downstream bioinformatic analysis was relied on to reveal potential aptamers and elucidate key features of the selection. For each ion, >200,000 reads were obtained for RNA from the final round, and resultant sequences were clustered and maximum-likelihood trees generated. Comparison of the SSIII and GsI selections using the GR scaffold revealed several important features.

A ce matrix of the GR/SSIII selection clearly showed only a few isolated clusters, and within each cluster the sequences have a high degree of internal relatedness (Figure 2B). The majority (>80%) of sequences cluster into three distinct sequence-related families, referred to as 5HTP-I, -II, and –III ( Figure 2D), with the remaining ring into small populations that are ult to ret. This is typical of a traditional SELEX where single isolates are often identified and further mutagenesis and selection are necessary to obtain covariation information. In contrast, the GR/GsI selection yielded more diverse clusters with higher sampling of sequences that populate regions between major clusters (Figures 2C and 2D ). The CDG and HH selections with GsI are similarly diverse in their sequence space with many potential aptamers (Figure 7). While traditional selection approaches often rely upon over-selection to facilitate finding an aptamer with limited sequence information, preservation of the diversity of s and sequence analysis by NGS d for a more thorough analysis of conservation and covariation patterns, aiding in determining sus aptamer sequences. Similar results were observed in the GR- scaffolded L-DOPA selection ( Figure 18). A subset of 250 sequences from each of the clusters that d validated 5HTP aptamers is described further herein.

Unexpectedly, in addition to the limited sequence diversity of the SSIII selection, a heavy lation of deletions and point mutations was observed such that no sequences retaining the full identity of the nt regions of the scaffold were red in the final round. Two of the clusters, 5HTP-I and 5HTP-III ( Figure 2D), have deletions in L2 or L3 of the scaffold essential to the formation of the loop-loop interaction of purine riboswitches. In addition, II members contain several point deletions acquired during the selection that yields the potential for a cally alternative secondary structure. Minimal free energy (MFE) and covariation analysis of this sequence suggest a ary structure tent with the consensus sequence of the L-tryptophan aptamer (an aptamer that comprises a two-way junction; Majerfeld & Yarus, Nucleic Acids Research, 2005, 33, 5482-5493), further suggesting that the scaffold was not maintained in the 5HTP-III family. 5HTP-II is the only major cluster maintaining sequence requirements necessary for the tertiary structure designed into the library and is the only abundant sequence shared between the SSIII and GsI selections. In contrast to the selection using SSIII, the GsI selections exhibited low amounts of mutations accruing in the ld’s constant region, indicating robust maintenance of the ld (Figure 8).

To identify ces with high aptamer potential, the ten most populous clusters from each pool were individually aligned, MFE structures predicted, and covariation models generated. This allowed for an information-rich view of the major consensus sequences ted by the ions. Since the II experiment was highly overselected, the most abundant sequence from each of the three major clusters was chosen for further validation. For the GsI selections, the dominant sequences from one or more clusters whose consensus MFE structure was consistent with the parent scaffold were selected.

Example 3: Most populated clusters preserve scaffold ecture and bind 5HTP with high selectivity The structural ld greatly facilitates validation of the structural and interaction features of resultant aptamers. Chemical probing of RNA structure using N-methylisatoic anhydride (“NMIA”), a que referred to as “SHAPE,” reveals whether the secondary and tertiary architecture of the parental lds were preserved as well as ligand-dependent structural changes in the aptamer. In the GR/SSIII selection, the 5HTP-I and 5HTP-II aptamers have zed changes in the NMIA reactivity patterns in the presence of ligand within the three-way junction elements, tent with this being the ligand-binding site (Figure 3A, Figure 19). 5HTP-III, however, shows changes outside of J2/3 in the constant regions, consistent with the predicted structure and L-Trp binding site of a previously described tryptophan aptamer (Majerfeld & Yarus). Preservation of the GR scaffold was ed using a unique, ligand-independent NMIA vity signature in L3 that is only present when it interacts with L2 (Stoddard et al., RNA, 2008, 14, 675-684). 5HTP-II is the only sequence from the three clusters of the SSIII selection exhibiting this feature.

Conversely, all tested sequences from the GR/GsI ion have this tertiary structure signature ( Figures 9A and 20). These data strongly indicate that the GR/SSIII selection yielded three distinct aptamers with only 5HTP-II preserving the structural scaffold while the GR/GsI ion produced le solutions maintaining the scaffold. While the 5HTP- dependent ures for the GR/GsI es are weaker than those of 5HTP-II and the parental aptamer, fication s that they localize to the junction in an analogous manner ( Figure 3B ). SHAPE characterization of the CDG/GsI and HH/GsI selections shows ligand-dependent changes for the new classes of aptamers and an overall reactivity n resembling the parental scaffold (Figures 9B, 9C, 21 and 22).

The affinity and selectivity of these aptamers for 5HTP and a set of chemically similar compounds was assessed by isothermal titration calorimetry (ITC). Importantly, for all of the tested aptamers, the 5’- and 3’-cassette ces were not ed for 5HTP binding, indicating the successful design of l sequences (Figure 3C). Several trends emerged from this analysis. First, both aptamers that do not preserve the parent scaffold (5HTP-I, -III) do not discriminate between 5HTP and L-tryptophan (Table 3), a crucial requirement for cell-based applications. Second, the ty of the aptamers preserving the three-way junction scaffold have higher affinities for 5HTP than the aptamers with disrupted scaffolds and all strongly discriminate against L-tryptophan. This indicates that the architecture of the scaffold is important for ng a selective binding pocket while ining affinities comparable to other synthetic and natural aptamers that bind amino acids. 5HTP-I and 5HTP-III show strong discrimination between 5HTP and serotonin, ng that main chain atoms are directly recognized. In contrast, many of the aptamers that preserve the scaffold bind ylhydroxy-L-tryptophanomide with 2- to 4-fold higher affinity than 5HTP.

Also, they bind serotonin, the decarboxylation product of 5HTP, suggesting a lesser requirement for the main chain atoms in binding (Table 3). Thus, a few of these aptamers could be outstanding serotonin sensors. Most striking from the GsI selections is that the dominant aptamers from each selection, despite having different scaffold architectures, converged on highly similar binding affinities and selectivity profiles, revealing that three- way junctions are a robust fold for hosting 5HTP binding pockets. Together, these data show that distinct oligonucleotide junctions, such as three-way junction architectural variants, are able to find robust solutions for 5HTP recognition.

Table 3. Affinity of aptamers for 5HTP and related compounds Selection sequence 5HTP L-Trp nin Me-5HTP KD, µMa KD, µM KD, µM KD, µM II 5HTP-I 33 ± 1 41 ± 1 >1000 —-- 5HTP-II 3.9 ± 0.1 280 ± 30 38 ± 8 26 ± 9 5HTP-III 38 ± 14 20 ± 5 >1000 —-- GR/GsI 5HTP-IV 8.8 ± 1.5 520 ± 90 4.7 ± 0.3 1.3 ± 0.1 5HTP-V 11 ± 1 170 ± 10 16 ± 4 6.6 ± 0.4 5HTP-VI 60 ± 15 N.D.b 16 ± 2 25 ± 8 CDG/GsI 5HTP-VII 9.3 ± 0.3 N.D. 1.2 ± 0.1 2.1 ± 0.4 HH/GsI 5HTP-VIII 7.3 ± 2.8 N.D. 1.2 ± 0.2 2.5 ± 0.5 a All measurements taken at 25 °C in 10 mM MgCl 2 containing . b Not detectable.

Example 4: Structural analysis of 5GR-II aptamer reveals a recurrent RNA motif is used for 5HTP binding To further demonstrate that the scaffolded selection strategy described herein preserved the fold of the parental RNA and to ate how RNA can recognize 5HTP, the structure of 5HTP-II complexed with 5HTP was determined at 2.0 Å resolution ( Figure 3D, representative electron density maps are shown in Figure 10 and crystallographic statistics are given in Table 4). This structure globally superimposes with the parental xpt guanine riboswitch aptamer with an r.m.s.d. of 6.5 Å over all backbone atoms in residues 19-77, with the main sources of deviation produced by a different angle for P1 in on to the binding pocket and the varied junction region (Figures 16A-C). Within the L2-L3 tertiary ction the pattern of base-base interactions and backbone ry is almost identical between the two RNAs (r.m.s.d. 0.96 Å over all atoms in residues 31-39, 61-67). Thus, the GR scaffold remained intact, both globally and locally, during the selection process.

Table 4. Crystallographic data and refinement statistics. 5HTP-II TP Data tion Space group C121 Cell dimensions a, b, c (Å) 127.55, 26.59, 63.37 α, β, γ () 90, 106.32, 90 tion (Å) 19.95 – 2.00 (2.07 – 2.00)* Rsym or Rmerge 0.084 (0.191) I /σI 11.2 (5.4) Completeness (%) 96.2 (73.8) Redundancy 4.34 (3.65) Refinement Resolution (Å) 18.33 – 2.00 (2.07 – 2.00) No. unique reflections 13,725 Rwork / Rfree 21.7/25.8 (20.1/26.0) No. atoms RNA 1513 Ligand/ion 16/90 Water 112 B-factors (average) RNA 29.5 Ligand/ion 16.2/35 Water 23.5 r.m.s. deviations Bond lengths (Å) 0.007 Bond angles () 1.308 *Values in parentheses are for the highest-resolution shell.

The ligand binding pocket of 5HTP-II resides within the three-way junction that has a radically different local structure from the parent RNA. Direct ligand ts are primarily mediated by nucleotides in J2/3 using a common RNA ural module, the T-loop (Figure 4A ). The first five nucleotides of J2/3 form a canonical T-loop structure superimposing almost tly with a tRNAPhe T-loop (r.m.s.d. 0.49 Å for backbone residues). Stabilization of position 3 in the tRNA T-loop by long range -Crick g with the D-loop is critical for activity. 5HTP-II possesses a similar interaction n G47 of the T-loop and C75 of J3/1. The 5HTP-II T-loop hosts 5HTP stacked between positions 4 and 5 in a manner orthologous to how the tRNA T-loop hosts an intercalating purine from the D-loop and is also similar to thiamine pyrophosphate (TPP) recognition by its riboswitch (Figure 4B).

While the T-loop is directly responsible for recognition of the ligand, nucleotides from all three randomized regions are involved in local structure aiding in the formation of a t junction that stabilizes the T-loop. Given such a complex set of interactions supporting the T-loop, it is ly that the isolated T-loop binds 5HTP.

The crystal structure of 5HTP-II yields additional insights into 5HTP recognition by the other scaffolded aptamers. The most abundant cluster in the GR/GsI selection, 5HTP-IV, also contains the UUGAA signature of the T-loop. The motif, however, is fted by a single tide, likely leading to an alternative orientation within the way junction as suggested by significant sequence differences in J1/2 and J3/1 n 5HTP-II and 5HTPIV.

In the HH selection, the most nt sequence of the most us cluster (5HTPVIII ) also contains the conserved UUGAA sequence of the T-loop in J2/3. Sequence variation analysis of this region of the 5HTP-VIII aptamer reveals a pattern of conservation matching that of the ical T-loops with only slight deviations ( Figure 11). This suggests that the T-loop motif may be a robust module for the recognition of small planar nds by RNA. While there is no clearly identifiable T-loop in RNAs from the CDG selection, the binding parameters almost perfectly match that of the other two selections, suggesting a r recognition mode.

Example 5: Scaffolded aptamers can be readily incorporated into robust small molecule sensory devices With scaffolded selection techniques proving capable in creating well folded, highly structured, and specific RNA aptamers, its ability to produce functional synthetic RNA biosensors was tested. To create these devices, a strategy of linking a small molecule binding aptamer to a fluorophore-binding module via a short helical element was used. The lead candidate aptamer from each library was coupled to the Broccoli fluorophore binding aptamer with two helical variants (the ication modules are referred to as "A" and "U", Figure 12) linking the two aptamers. This ed in a set of RNAs capable of sensing 5HTP and/or serotonin over several orders of magnitude in vitro with varying output fluorescence dynamic ranges (Table 5 and Table 6). Many of these sensors, when in the presence of ligand, are capable of producing fluorescence levels equal to or greater than that of the unconjugated li aptamer alone under identical ions. Inherent to this system is a reduced apparent F50 (defined as the ligand tration required to elicit a half maximal fluorescent response) relative to the K D of the ed aptamer as monitored by the Broccoli fluorescence. However, several scaffolded aptamers show only a ~10-fold ence between their KD and F50. Overall, this compares favorably to examples of natural riboswitch aptamer domains in the literature whose differences in K D and F50 can approach 1000-fold, an important trait to consider when sensitivity or ligand toxicity is a limiting factor in riboswitch application.

Table 5. In vitro performance of Broccoli based 5HTP/serotonin sensors linker/ligand A/5HTPa A/5HT U/5HTP U/5HT aptamer F50, µM F50, µM F50, µM F50, µM 5HTP-II 190±30 N.D.b 180±20 N.D. 5HTP-IV N.D. 190±70 240±20 52±4 5HTP-VIII N.D. 790±190 590±90 260±50 aAll measurements taken at 25 °C in 5 mM MgCl 2 ning buffer. bNot able.

Table 6: Performance of 5HTP-Broccoli sensors Fold Fmax Sensor Ligand [MgCl2], mM Inductiona (% Broccoli)b F50c, µM 5HTP-II/A 5HTP 1 3.5 18.5 3 6.3 71 7.3 122 190±30 6 168 4.1 180 5HT 1 1.3 6.9 3 3.1 34.6 3.7 62.7 n.d. 3.9 108 3 134 5HTP-II/U 5HTP 1 1.9 21.1 3 3.1 72.2 3.3 105 180±20 3.3 130 3.2 135 5HT 1 0.4 4.7 3 0.8 18.2 1.1 33.2 n.d. 1.3 52.6 1.6 66.4 5HTP-IV/A 5HTP 1 1.2 2.7 3 1.3 2.9 1.4 3.1 n.d. 1.7 3.8 2 4.3 5HT 1 1.0 2.4 3 1.3 2.75 1.4 3.2 190±70 1.8 4 2.2 4.7 5HTP-IV/A 5HTP 1 2.5 8.5 3 6 37.1 8.2 71.2 240±20 8.2 111 6.1 126 5HT 1 5.1 17.2 3 8.8 54.1 9 78.1 52±4 7.2 98.3 5.1 105 5HTP-VIII/A 5HTP 1 1.1 3.7 3 1.8 7 2.4 10.4 3.7 18 5.1 26.9 5HT 1 1.5 5 3 4.4 17.7 7.2 31.4 0 10.9 53.3 13 69.5 5HTP-VIII/U 5HTP 1 1.6 12.3 3 2.5 43.3 2.8 69.7 590±90 3.1 109 3 126 5HT 1 3.8 30 3 5.5 93.4 4.8 116 260±50 3.7 131 3.2 136 a Defined as (fluorescence at saturating ligand/fluorescence in absence of ligand); grey shading denotes sensors that showed strong performance. b Defined as um sensor fluorescence/isolated broccoli aptamer fluorescence)*100. c d as the concentration of ligand ed to elicit the half maximal fluorescence response.

Of the above devices, 5HTP-II(A) is capable of specifically sensing 5HTP in E. coli.

This genetically encoded sensor d a rapid induction of scence upon on of 2 mM 5HTP to E. coli growing in a rich chemically defined medium (10 minutes), with approximately 80% of ia displaying an observable response within 20 minutes (Figure ). The fluorescence signal was completely dependent upon the RNA device binding 5HTP.

No signal gain was observed when L-tryptophan was included in the media or when the sensor contained a point on (A48U) in the T-loop module that d ligand binding to the isolated aptamer (data not shown). Furthermore, the increase in relative fluorescence in the presence of 5HTP was able to robust cyclic dinucleotide sensors based upon natural riboswitch aptamer domains in live cells. Importantly, these observations are in contrast to claims that non-natural aptamers have d intracellular performance compared to natural aptamers in the context of fluorometric sensors (You, PNAS (2015) 112:21, E2756-2765).

Select scaffolded 5HTP aptamers were also coupled to engineered modular secondary switches derived from natural riboswitch expression platforms to generate gene regulatory elements. Using a ng strategy in which the P1 helix of the aptamer and expression platform is directly coupled, a proficient ligand-dependent regulator of transcription was engineered by fusing the 5HTP-IV sensor and pbuE “ON” switch platform (Figure 6A). The resulting RNA element is capable of activating transcriptional read-through in vitro with a specificity profile cal to the aptamer domain in isolation and ses a dynamic range consistent with natural riboswitches (Figure 6B); surprisingly, L-Trp is completely incapable of enabling read-through transcription. Again, the discrepancy between KD and T50 was not insignificant (6-fold for serotonin, 22-fold for 5HTP), but reflected observed trends for natural riboswitches where an aptamer’s thermodynamic properties do not always dictate its ability to communicate with an adaptor sequence.

Example 6: Scaffolded aptamers according to exemplary embodiments The Broccoli aptamer was coupled to a tRNA scaffold to stabilize the biosensor for cell-based applications. Four different GR-scaffolded 5HTP aptamers were coupled to four ication modules of differing lengths (two to five A-U and U-A base pairs; Figure 14A) and each resultant biosensor tested for the ability to fluoresce in a ligand-dependent fashion. Each sensor was assessed for their ligand-dependent fold change in fluorescence and maximal brightness relative to the isolated li aptamer both in vitro (Figure 14B; Tables 7 and 8) and in E. coli (Figure 14D; Tables 9 and 10). To enable rapid screening of candidates in vitro, the biosensors were transcribed and directly used in the fluorometric assay t further purification. These data reveal three aptamers (5GR-II, -IV, and -V) yielded s that can detect 5HTP and/or serotonin both in vitro and in the cellular context, with 5GR-II demonstrating the best performance with respect to combined fold increase in fluorescence and l brightness.

To further trate the potential of scaffolded aptamers, live cell imaging was used to visualize the uptake of 5HTP by E. coli using the 5GR-II/CM-4 biosensor. scence imaging of single cells revealed a rapid induction of fluorescence upon addition of 2 mM 5HTP to E. coli growing in a rich chemically defined medium, with approximately 80% of bacteria displaying an observable response within 20 minutes es 15A, D). The fluorescence signal was completely dependent upon the RNA device binding 5HTP; no detectable signal gain was observed when L-tryptophan was included in the media (Figures 15B, E) or when the sensor ned a point mutation (A48U) in the T-loop module that ablated ligand g to the ed aptamer (Figures 15C, F). The ed increase in relative fluorescence in the presence of 5HTP was comparable to robust cyclic dinucleotide sensors based upon natural itch aptamer domains in live cells. These results contrast us claims that synthetic aptamers have reduced intracellular performance ed to natural aptamers and it is shown here that multiple synthetic aptamers are capable of functioning within E. coli in the context of an allosteric fluorogenic RNA.

The above 5HTP biosensors were designed with knowledge from biochemical and biophysical analysis of select aptamers. However, an l workflow for rapid development of biosensors would be able to use information derived only from the computational analysis of the selection to design candidate RNAs. To demonstrate that scaffolded aptamers incorporate design principles that enable sor engineering in the e of experimental characterization, the above biosensor strategy was employed for four aptamers derived from the L-DOPA ion. None of these aptamers were validated in any fashion prior to their incorporation into allosteric fluorogenic sensors. Screening of the resultant sors with L-DOPA and dopamine in vitro (Figure 14C; Tables 7 and 8) and in E. coli (Figure 14E; Tables 9 and 10) revealed two aptamers (DG-I and DG–II) that function in both contexts.

Table 7. In vitro-fold induction of fluorogenic GR-scaffolded aptamers aLigand concentration is 2 mM. bFold Induction (FI) is ated as (total fluorescence, +ligand) / (total fluorescence, - ligand).

Error is reported as the standard error of the mean for three independent experiments.

Table 8. In vitro brightness of fluorogenic GR-scaffolded aptamers relative to parental Broccoli aAptamer ligand concentration is 2 mM. bPercent brightness is calculated as (total fluorescence sensor, +ligand) / (total fluorescence Broccoli, + ligand). Error is reported as the standard error of the mean for three independent experiments.

Table 9. In vivo-fold ion of fluorogenic GR-scaffolded rs aLigand concentration is 2 mM. bFold induction (FI) is calculated as (total scence, +ligand) / (total fluorescence, - ligand). Error is reported as the standard error of the mean for three independent ments.

Table 10. In vivo brightness of fluorogenic GR-scaffolded aptamers relative to parental li aptamer aLigand concentration is 2 mM. bPercent brightness is calculated as (total fluorescence sensor, + ligand) / (total fluorescence Broccoli, d). Error is ed as the standard error of the mean for three independent experiments.

Example 7: Discussion RNA-based devices are progressing towards becoming a robust tool in synthetic y, driven by a unique feature set when compared to protein-based alternatives, including the ability to te in cis, predictable secondary structure and a small genetic int. Efforts have focused on creating synthetic riboswitches, aptazymes and fluorogenic RNA sensors, but their ial has yet to be fully realized in significant part due to the limited availability of small molecule receptors that function in the context of such devices. In the work presented herein, a strategy has been designed that exploits the secondary and tertiary structural architecture of naturally evolved riboswitches and ribozymes to scaffold small molecule binding pockets raised through in vitro selection. Importantly, using no information beyond that obtained from high throughput sequencing of the final round of selection, rs selected to L-DOPA using this approach were coupled to a fluorogenic aptamer module to produce cally encodable biosensors that function in the cellular t.

One key strength of the methods and itions described herein is the use of multiple scaffolds in parallel ions to obtain a suite of aptamers. This differs significantly from traditional selections known in the art where the same subset of solutions is reproducibly generated from a simple randomized pool which significantly constrains sensor diversity and development. While the aptamers derived from different lds have similar affinities for 5HTP and selectivity against L-tryptophan, they clearly have distinct characteristics with respect to their ability to communicate with a readout domain via the P1 helix, a common feature to all of the scaffolds. In biological itches, the ligand is either in direct contact or induces conformational changes in the RNA that involve the P1 helix that links the r to the ream regulatory switch. Without intending to be bound by scientific theory, it is hypothesized that differences in sensor performance across different aptamers is in part due to variation in the spatial relationship between the ligand and the interdomain (P1) helix, a feature that cannot be fully controlled in the selection. However, unlike deep selections, the scaffolded selection approach presented here strongly biases selections towards a favorable ligand/P1 orientation by constraining the possible ligand position.

With a suite of aptamers, combinatorial approaches can be employed to rapidly screen for sensors without extensive aptamer characterization or device optimization as typified by the dopamine sensor development herein (Figure 19). pment of an RNA device from aptamers derived from deep selections requires thorough characterization along with broad screening of ication s while leaving the sensory r as a fixed, unalterable node due to the lack of diversity. With the scaffolded ion methods and compositions described herein, a set of distinct aptamers can be combinatorially coupled to a set of communication modules and rapidly ed for variants with the desired activity, as demonstrated with the L-DOPA selection. In this fashion, the methods and compositions provided herein should facilitate the expedient development of RNA devices and sensors by easing a key bottleneck in their development. Notably, while in this study only the most populous clusters were focused on in each selection for characterization and/or sensor design, within each selection there are many clusters ning alternative ces that could further enrich the initial pool of aptamers for developing downstream applications.

A second powerful advantage of the selection methods and compositions described herein is the potential for robust folding in the cellular context provided by the tertiary interaction of the three-way junction architecture. Each of these aptamers has a fold that has undergone extensive biological ion. Further, the distal tertiary interactions that ze the three-way junction core can be highly stable. Both the L2-L3 interaction of the purine riboswitch and the tetraloop-tetraloop receptor of the cyclic di-GMP riboswitch scaffold are capable of stably g outside the context of other RNA structure. I n contrast, the long range interaction zing the S. mansoni hammerhead ribozyme is dynamic, which is another aspect of diversity with respect to the chosen scaffolds. The presence of robust secondary and tertiary structure in the scaffold enables these elements to potentially guide the folding of all members of the initial library. In contrast, RNA ding during selection and or the presence of le MFE structures in the final aptamers is often a significant problem for traditional deep selection. Since there is no significant selection pressure for high-fidelity folding in a typical ion protocol, providing this information in the starting library can be a path towards robust folding RNAs.

While three-way junction scaffolds were chosen as the focus of this study, the diversity of natural riboswitches and ribozymes can e further feedstock for this approach. Within the three-way junction family, there is a broad array of sequences that vary the orientation of the three helices, size of the joining regions, and the nature of the distal ry interaction that may provide superior lds for a particular ligand or sensor.

Furthermore, other folds may be predisposed to bind a target small molecule based on the nature of the cognate ligand. For example, r logical choice for a scaffold to bind 5HTP is the lysine itch r domain. Larger ligands may be more easily recognized by flavin mononucleotide or cobalamin riboswitch-derived scaffolds, while dinucleotides such as NADH may be readily accommodated by one of the di-cyclic nucleotide aptamers. As natural RNA aptamers have been discovered to recognize chemically diverse small molecules, exploiting their architectures towards the selection of novel aptamers has the potential to facilitate the pment of powerful new tools for monitoring and responding to small molecules in the cellular environment across a broad range of applications.

Example 8: Library construction For each scaffold, tides within an 8 Å shell surrounding the ligand binding site or active site of the parent RNA were identified from their crystal structure (GR, PDB ID 4FE5; CDG, PDB ID 3IWN; HH, PDB ID 3ZD5). The corresponding positions were randomized in a DNA ultramer that spanned the entire aptamer domain with conserved flanking sequences for e transcription and amplification (Integrated DNA Technologies; sequences of all nucleic acids used in this study are provided in Table 1). ssDNA was converted into dsDNA templates for ription using standard Taq PCR conditions in which ~2x10-12 mol DNA (corresponding to ~1012 individual sequences) was used in each 100 µL PCR reaction and amplified for 15 cycles with the T7 site appending and RT-PCR primers. Approximately 1x1014 sequences were ribed in 12.5 mL transcription reaction containing 40 mM Tris-HCl, pH 8.0, 25 mM DTT, 2 mM spermidine, 0.01% Triton X-100, 4 mM each rNTP pH 8.0, 0.08 units inorganic phosphatase (Sigma-Aldrich, lyophilized powder), and 0.25 mg/mL T7 RNA polymerase and incubated at 37 °C for 4 hours. Transcription samples were then precipitated in 75% ethanol at -20 °C, pelleted, and reconstituted in a solution of 300 µL formamide, 3 mL 8 M Urea, and 300 µL 0.5 M EDTA pH 8.0. Full length RNA was purified with a denaturing 8%, 29:1 acrylamide:bisacrylamide gel. Product RNA was excised from the gel after visualizing by UV shadowing and eluted in 0.3 M NaOAc pH 5.0 before exchange and storage in 0.5x TE. e 9: Synthesis of 5HTP affinity column matrix For the derivatized columns, 3 mL bed volume of EAH Sepharose 4B (GE Healthcare) was dehydrated with dimethylformamide (DMF). 10 µmoles of Fmoc hydroxy-L-tryptophan and 10 µmoles of benzotriazolyl-oxytripyrrolidinophosphonium hexafluorophosphate (PyBOP) were dissolved in 1 mL of DMF and added to the ated column with 20 µmoles N,N-diisopropylethylamine (DIPEA) and incubated with agitation for 2 hours at room temperature. The column matrix was then drained and washed ively with DMF. ted sepharose amines were acetylated by adding 1 mmole of acetic anhydride and 1 mmole DIPEA in approximately 1 mL DMF and mixed at room temperature for 1 hour. The column was drained of the acetylating mixture and washed with DMF prior to Fmoc deprotection using 20% v/v piperidine/DMF. Amino acid concentration on the column was determined by measuring the concentration of Fmoc in the deprotection fractions (A301 nm = 8000 M-1 cm-1). This method generated approximately 0.5-1 mM deprotected amino acid per mL resin. For counter selection, EAH sepharose was prepared exactly the same except omitting the ligand ng step resulting in acetylated sepharose.

Example 10: In vitro ion For the GR scaffold selection using the SuperScript III reverse transcriptase (GR/SSIII), 350 µL acetylated ose was brated in selection buffer (10 mM Na- HEPES, pH 7.0, 250 mM NaCl, 50 mM KCl, 10 mM MgCl2, 0.1 mg/mL tRNA) and 1 nmol of y RNA in 350 µL of selection buffer was incubated at room temperature for 30 minutes with agitation. The applied solution was removed and the column matrix washed once with 350 µL of selection buffer. The pooled flow through and wash (750 µL total) was added to pre-equilibrated 5HTP-derivatized sepharose 4B column and incubated for 45 minutes. The column was then drained and washed three times with selection buffer before elution with 10 mM 5HTP in selection buffer (two 1 hour incubations in 350 µL; total 700 µL eluted volume). The eluted fractions were then concentrated to 50 µL in a 0.5 mL Ultracel 10kD MWCO filter (Millipore) and ethanol precipitated in 0.3 M sodium acetate (pH .0), 5 µg glycogen, and brought to a final concentration of 75% ethanol before storage at -70 °C for 30 minutes. Details of the conditions of each cycle are provided in Table 2.

To convert the competitively eluted RNA into a new population of RNA, the elution fractions were ethanol precipitated, ed at 13000 x g at 4 °C, decanted, and dried under vacuum. The dried pellet was reconstituted with 0.7 mM each dNTP, 7 µM RT-PCR primer, and brought to a total volume of 14 µL before heating to 65 °C for 5 s and incubation on ice for 10 minutes. The solution was then brought up to 1x SuperScript III first-strand buffer (5x: 250 mM Cl, pH 8.3, 375 mM KCL, 15 mM MgCl2) with 5 mM DTT and 200 units SuperScript III (Life Technologies) in a total volume of 20 µL before a 15 minute extension at 54 °C. The entire 20 µL reverse ription solution was PCR amplified in a total volume of 500 µL using standard Taq DNA rase conditions. The amplified pool was then ribed by adding 100 µL of the PCR reaction to a 1 mL transcription reaction containing 40 mM Tris-HCl, pH 8.0, 25 mM DTT, 2 mM spermidine, 0.01% Triton X-100, 4 mM each rNTP pH 8.0, 0.08 units inorganic phosphatase, and 0.25 mg/mL T7 RNA polymerase and incubated at 37 °C for 2 hours. A 100 µL transcription reaction for 32P labeled RNA was performed under similar condition with the exception that the rNTPs were d to 2 mM for UTP, CTP, and GTP while ATP was reduced to 200 µM and ~100 µCi 32P-ATP. Transcription samples were gel purified as described above with the gel loading conditions scaled accordingly.

Selections using the GsI reverse transcriptase were performed as described above with the following changes. The buffer for selection contained a reduced magnesium concentration and more logically relevant monovalent cations: 25 mM Na-HEPES, pH 7.0, 150 mM KCl, 50 mM NaCl, 3 mM MgCl2). GsI-IIC-MRF reverse transcriptase was used in place of SuperScript III. GsI-IIC MRF e riptase was expressed in E. coli and purified as described (Mohr et al., RNA, 2013, 19, 958-970). The precipitated RNA pellet was brought up in 1.25 mM dNTPs and 20 µM RT-PCR primer prior to ration at 65 °C, annealing at 4 °C, and equilibration at 60 °C. The solution was then brought up to 1x GsI-IIC-MRF buffer conditions (10 mM NaCl, 1 mM MgCl2, 20 mM TrisCl, pH 7.5, 1 mM DTT) in 20 µL total volume and ient enzyme was added for extension at 60 °C. PCR was as med as described above.

Example 11: High-throughput cing and bioinformatic analysis Standard PCR was conducted to append the Illumina hybridization sequences ary for annealing to the flow cell. Each library was amplified with the forward sequencing primer and a unique reverse primer containing a distinguishing 12 nucleotide barcode (sequences are given in Table 1). The samples were sequenced using a v3 reagents kit for 150 cycles on a MiSeq (Illumina) with custom read and indexing s.

The resulting sequences were iplexed, trimmed, and quality ed using scripts from QIIME (Caporaso et al., Nat. Methods, 2010, 7, 335-336). All sequence information outside of the P1 stem was trimmed and only sequences ning a Phred score ≥ 20 for each nucleotide were used in the analysis. The resulting fasta format files for each library were then subjected to clustering by USEARCH (Edgar, Bioinformatics, 26, 2460- 2461), which generated seed sequences that were clustered at 90% identity; any clusters containing a single sequence were ded. The top ten populous clusters were then mapped back to their original sequence file, and 250 individual sequences were randomly taken as a representative sample of each cluster for further analysis. Sequences in each cluster were aligned using MUSCLE (Edgar, NAR, 2004, 32, 1792-1797) and the resultant alignment analyzed using CMfinder (Yao et al., Bioinformatics, 2006, 22, 445-452). R2R (Weinberg & Breaker, BMC ormatics, 2011, 12, 3) was run at its default settings to generate figures of the sequence conservation mapped onto the minimum free energy (MFE) secondary structure.

Example 12: NMIA chemical g RNA was ed as described previously (Edwards et al., Methods Mol. Biol., 2009, 535, 135-163). Structure cassettes flanking the 5' and 3' ends of the RNA were added to facilitate reverse transcription and NMIA modification was performed using the established protocols (Wilkinson et al., Nat. Protoc., 2006, 1, 1610-1616) at 25 °C. RNA was probed at 100 nM in 100 mM Na-HEPES, pH 8.0, 100 mM NaCl, and 6 mM MgCl2. Ligand concentration was 500 µM where indicated. Gel images were analyzed by SAFA (Das et al., RNA, 2005, 11, 344-354) and ImageJ (NIH).

Example 13: Isothermal ion Calorimetry (ITC) All RNAs tested were exchanged into the SSIII selection buffer (10 mM Na-HEPES, pH 7.0, 250 mM NaCl; 50 mM KCl; 10 mM MgCl2) and washed three times in 10 kD MWCO filter (EMD Millipore). The ligand was brought up from a dry solid directly into the binding buffer and concentration established on a NanoDrop 2000 o Scientific) using an extinction coefficient at 275 nm of 8000 mol-1 cm-1 for the 5-hydroxyindole moiety. The RNA was diluted to between 50-100 µM and the ligand was titrated at roughly 10 times the concentration of RNA. Titrations were performed at 25 °C using an al iTC200 microcalorimeter (GE Healthcare) using established protocols (Gilbert and Batey, Methods Mol. Biol., 2009, 540, 97-114). Data was ed and fitting was performed with the Origin .0 software suite (Origin Laboratories).

Example 14: Structure ination of the 5HTP-II/5HTP x RNA for crystallization was prepared as previously described (Edwards et al., Methods Mol. Biol., 2009, 535, 135-163). The RNA was concentrated in an Amicon Ultra 10k MWCO filter (EMD Millipore, Inc.) and exchanged into 0.5x T.E. buffer. Diffraction quality crystals were obtained by mixing 2 µL RNA:ligand complex (1:1) and 3.5 µL mother liquor (8-14% 2-methyl-2,4-pentanediol, 40 mM sodium cacodylate pH 5.5, 4 mM MgCl2, 12 mM NaCl, 80 mM KCl, and 4-9 mM cobalt hexamine), micro-seeding, and tion at 22 °C for 1-3 days. The crystals needed no further cryoprotection and were flash frozen in liquid nitrogen before data collection. Data was ted with a Rigaku R-Axis IV image plate system using CuK radiation (1.5418 Å) at 100 K, and was indexed and scaled using D*TREK (Pflugrath, Acta Crystallogr. D Biol. Crystallogr., 1999, 55, 1718-1725). Data on a heavy atom derivative made by replacing cobalt hexamine with 1-11 mM iridium hexamine was also collected on the home x-ray source. Phases were determined using the single isomorphous replacement with anomalous scattering (SIRAS) method. AutoSol (Adams et al., Acta Crystallogr. D Biol. llogr., 2010, 66, 1) was used to find 12 iridium atoms that were then used to calculate . The resulting experimental density map displayed unambiguous features of the RNA backbone and helices and was used for building the model.

The initial model was iteratively built without the ligand in Coot (Emsley & Cowtan, Acta Crystallogr. D Biol. Crystallogr., 2004, 60, 2126-2132) between rounds of refinement in PHENIX (Adams et al., Acta Crystallogr. D Biol. Crystallogr., 2010, 66, 213-221). The RNA model was brought through several rounds of refinement and ted annealing before 5HTP was built into the model. At this point of building, there was clear ligand density in the binding pocket that d for the ent placement and orientation of the ligand. The placement of the ligand and bases was validated by a composite omit map (Figure 10B).

Water placement was automated in final rounds of ment after ligand placement based on peak size in the Fo-Fc difference map. The resultant model had good geometry as judged using MolProbity (Chen et al., 2010, Acta Crystallogr. D Biol. Crystallogr., 2010, 66, 213- 221) and final model statistics (Rwork and Rfree are 21.9% and 26.2%, respectively). All crystallographic data and model statistics are given in Table 4.

Example 15: In vitro Broccoli sensor assays RNA was prepared as described above, with additional 0.5x T.E. buffer washes in a 10k MWCO Amicon Ultra (Millipore) to minimize carry-over of metal ions. All RNA sensors were assayed at concentrations of 0.5 µM RNA and 10 µM (Z)(3,5-difluoro hydroxybenzylidene)-1,2-dimethyl-1H-imidazol-5(4H)-one (DFHBI) in a buffer containing 80 mM Tris-HCl, pH 7.4, 150 mM KCl, and 50 mM NaCl. The buffer, , magnesium ntrations given in Table 6), and DFHBI were mixed prior to the addition of RNA and all reactions allowed to incubate for 30 minutes at room temperature. DFHBI fluorescence was measured by placing 200 µL reaction volume in a Greiner 96-well flat bottom black fluorescence plate (Thermo Scientific) and reading in a Tecan Infinite M200 PRO plate reader. Samples were excited at 460 nm and fluorescence emission measured as the average signal between 506 and 510 nm. The concentration of ligand to elicit a half maximal fluorescence response was determined by fitting the observed fluorescence as a function of ligand concentration to a two state model.

Engineered s were synthesized as G-blocks (sequences of sensors given in Figure 23; Integrated DNA Technologies) and cloned between the XbaI and BlpI sites in pET30b using standard molecular cloning techniques. All resultant plasmids were ce verified. For T7 RNA polymerase transcription ons, a DNA template was generated by PCR using 1 μM outer primers (5’: GGCCGTAATACGACTCACTATAGGAGCCCGGATAGCTCAGTCGGTAGAGCAG, 3’: TGGCGCCCGAACAGGGACTTGAACCCTGGA) using a standard PCR reaction. tes were ly added to an in vitro transcription reaction (see above) and RNA synthesis was allowed to proceed for 2 hours at 37 °C. RNA from the above transcription reaction was directly used in assays without further purification.

The activity of each sensor was monitored in a 100 μL reaction containing 50 μL of in vitro transcription reaction, 10 μL of 10x Survey Buffer (1x: 50 mM K-HEPES, pH 7.5, mM MgCl2, 150 mM KCl, 50 mM NaCl), 30 μM DFHBI-1T and 2 mM ligand (for plus ligand reactions). Reactions were incubated at room temperature for 30 minutes and DFHBI fluorescence was ed by placing 90 μL reaction volume in a Greiner 96-well flat bottom black scence plate (Thermo Scientific) and reading in a Tecan Infinite M200 PRO plate reader. Samples were excited at 460 nm and fluorescence emission measured as the average signal between 506 and 510 nm. For all experiments, a positive l of a tRNA-scaffolded li aptamer was performed in the presence and absence of ligand, which was also used as a reference for relative brightness. Fold-induction was calculated by dividing the fluorescence values for the DFHBI-1T plus ligand reaction by the scence value for the DFHBI-1T condition alone. All experiments were performed in triplicate and fied data reported with the rd error of the mean (s.e.m.) Example 16: In vitro Broccoli sensor assays E. coli One Shot® BL21 Star (DE3) cells (Thermo Fisher) were transformed with a pET30b-derived plasmid containing a sensor under inducible control, plated onto LB agar supplemented with 50 μg/mL kanamycin and incubated at 37 °C for approximately 16 hours.

Individual es were picked and grown overnight (approximately 16 hours) in 5 mL of LB supplemented 50 μg/mL kanamycin to allow the culture to reach saturation. For screening experiments, 5 μL of the saturated overnight culture was added to 5 mL of LB supplemented with 50 μg/mL kanamycin and grown to mid-log phase (OD600 approximately 6) at 37 °C. To induce sion of the Broccoli aptamer alone or the Broccoli/riboswitch aptamer fusion constructs, IPTG was added to a final concentration of 1 mM in each culture, which were then grown for an additional 2 hours at 37 °C. Cells were then pelleted by centrifugation and washed once with 5 mL of 1X M9 salts supplemented with MgSO4 at a final concentration of 5 mM and kanamycin at a final concentration of 50 μg/mL. After washing, cells were pelleted by centrifugation, resuspended in 250 μL of the above M9 medium and split into two 100 μl aliquots. In half of the aliquots, DFHBI-1T was added to a final tration of 50 μM in a final volume of 110 μL. In the other half of the aliquots, DFHBI-1T was added to a final concentration of 50 μM and the ligand (5HTP, 5HP or dopamine) was added to a final concentration of 1 mM in a final volume of 110 μL. Cells were then incubated at 37 °C for 30 minutes to allow for uptake of each compound.

Following the 30 minute incubation, 100 μL of each t was pipetted into a Greiner 96- well black microplate and chilled on ice for 30 s. For fluorescence measurements, DFHBI-1T was monitored at an tion wavelength of 472 nm and a 520 nm emission ngth. Quantified data represent the average fluorescence values ± standard error of the mean (s.e.m.) from three biological replicates, which were background corrected using a pET30b empty vector control. Fold-induction was calculated by ng the average fluorescence values of cells exposed to ligand by the average fluorescence of cells without ligand.

Example 17: Intracellular fluorescence imaging of 5HTP DNA and cultures were prepared as described (Paige et al., Science, 2012, 335, 1194).

Briefly, the tRNA/Broccoli fusion sequence was cloned into pET30b n the XbaI and BlpI sites downstream of an inducible T7 promoter. The sequence-verified plasmid was transformed into BL21 (DE3) STAR cells rogen) and single colonies were grown up overnight in Luria Broth (LB) supplemented with 50 µg/mL kanamycin. The overnight culture was used to inoculate fresh LB/kanamycin medium at a 1:1000 dilution and the culture grown at 37 °C to an OD 600 = 0.4-0.6 before induction with 1 mM IPTG and growth at 37 °C for 2-4 hours. 200 μL of the resultant culture was centrifuged, decanted, and resuspended in 2 mL of M9 minimal salts medium supplemented with 50 µg/mL kanamycin, 5 mM MgSO 4, and 1 mM IPTG. 200 μL of the resuspended culture was transferred to 96- well -lysine coated glass bottom plates (MatTek) and incubated at 37 °C for one hour.

The media was then removed and the wells washed with M9/kanamycin/1 mM IPTG medium before adding 200 μL of M9 media, 1 mM IPTG, and 400 μM DFHBI-1T (Lucerna). The live scence images were taken with an Andor iXon3 897 EMCCD using a 60x oil objective, an excitation filter 472/30, dichroic mirror 490 (long pass) and on filter 520/40 on a Nikon Ti-E microscope and ed with FIJI (Schindelin et al., Nat. Methods, 2012, 9, 676-682).

Example 18: Single turnover in vitro transcription assays dsDNA templates were transcribed as previously described (Trausch et al., Structure, 2011, 19, 1413-1423). In brief, 50 ng of DNA template were incubated at 37 °C for 10 minutes in 12.5 µL of 2x transcription buffer (140 mM Tris-HCl, pH 8.0, 140 mM NaCl, 0.2 mM EDTA, 28 mM β-mercaptoethanol and 70 mg/mL BSA), 2.5 µL 50 mM MgCl2, 100-200 µCi of 32P-ATP, and 0.25 units of E. coli RNA polymerase σ70 holoenzyme (Epicentre Biotechnologies) per reaction were brought to 23 µL. The equilibrated reactions were then ted with the addition of 7.5 µL reaction buffer (165 µM each rNTP, 0.2 mg/mL n, and the desired ligand concentration) and incubated for 15 s at 37 °C before quenching with 8 M urea. The reactions were then separated on an 8% denaturing PAGE, dried, and exposed on a phosphor imager screen. Quantitation of the gels then carried out in ImageJ (NIH) and the data fit to a ate model.

Accession Codes Coordinates and structure s have been deposited in the RSCB Protein Data Bank under the accession code 4ZAQ.

Incorporation by Reference The contents of all references (including literature references, issued patents, published patent ations, and co-pending patent applications) cited throughout this application are hereby expressly incorporated herein by reference in their entireties. Unless otherwise defined, all technical and scientific terms used herein are accorded the meaning commonly known to one with ordinary skill in the art.

Equivalents Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents of the specific embodiments provided herein.

Such equivalents are intended with be encompassed by the following claims.

Claims

1. A library of oligonucleotides comprising a plurality of non-identical oligonucleotides, wherein the individual oligonucleotides comprise: a) a first sequence sing a helix domain; b) a second sequence comprising a first hairpin domain; and c) a third sequence comprising a second hairpin domain; n the helix , first hairpin domain and second hairpin domain form an oligonucleotide junction containing a ligand-binding domain, and n the library comprises a ity of non-identical -binding s.

2. A library of oligonucleotides comprising a plurality of non-identical oligonucleotides, wherein the individual oligonucleotides comprise: a) a first sequence comprising a helix domain; b) a second sequence comprising a first hairpin domain; and c) a third sequence comprising a second hairpin domain; wherein the helix domain, the first hairpin domain and the second hairpin domain form an oligonucleotide junction containing a lected ligand-binding domain, and wherein the library comprises a plurality of non-identical ligand-binding domains.

3. The library of oligonucleotides of claim 1 or 2, wherein each helix domain independently is a fully complementary helix optionally comprising one or more ilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair, and a bulge.

4. The library of oligonucleotides of claim 1 or 2, wherein each helix domain is a fully complementary helix.

5. The library of oligonucleotides of claim 1 or 2, wherein each first hairpin domain independently comprises one or more destabilizing nucleotides selected from the group ting of a mismatched base pair, a G•U wobble base pair, and a bulge.

6. The library of oligonucleotides of claim 1 or 2, wherein each second n domain ndently comprises one or more destabilizing nucleotides selected from the group consisting of a mismatched base pair, a G•U wobble base pair, and a bulge.

7. The library of oligonucleotides of claim 1 or 2, wherein the helix domain is at least 4 to 10 base-pairs in length.

8. The y of oligonucleotides of claim 7, wherein the helix domain is at least 10 base-pairs in length.

9. The library of oligonucleotides of claims 1 or 2, wherein the oligonucleotides are oligoribonucleotides.

10. The library of oligonucleotides of claim 1 or 2, wherein the individual oligonucleotides comprise a ce having a series of linked sequences according to Formula I: (I) 2-P2-L2-P2’-J