VIEW. Flipping Off the Riboswitch: RNA Structures That Control Gene Expression

Flipping Off the Riboswitch: RNA Structures That Control Gene Expression Dipali G. Sashital and Samuel E. Butcher* Department of Biochemistry, University of Wisconsin Madison, 433 Babcock Drive, Madison, Wisconsin 53706 One of the overriding themes in biology is the need for exquisite control and precision in the regulation of gene expression for the normal growth and development of organisms. Regulated gene expression has been discovered at every level, from transcription of the genetic code to post-translational protein modification. In its simplest terms, genetic regulation is the process by which the cell recognizes its metabolic needs and acts to modulate the levels of certain gene products on the basis of these requirements. Traditionally, these functions were thought to be controlled exclusively by proteins. However, it is now known that 2% of bacterial genes are regulated by metabolitesensing RNAs without the assistance of proteins (1 3). These riboswitches form within the leader sequences of messenger RNAs (mrnas) and bind their target metabolites with the affinity and specificity required for the precise regulation of gene expression. Metabolite binding to the riboswitch effects an allosteric conformational change that modulates gene expression at the transcriptional or translational level. In the past, our knowledge of how riboswitches could transfer the binding energy of specific metabolites into optimal genetic control was limited by a shortage of 3D structural data. A new flood of riboswitch crystal structures from the laboratories of Batey (4), Ban (5), and Patel (6) now presents a clearer picture of the structural complexity required for these functions (Figure 1). Similar to RNA aptamers selected in vitro to bind small molecules (7, 8), riboswitches have evolved in vivo to bind a remarkably wide range of molecules with extremely high affinity, including several diverse metabolites and coenzymes such as guanine and adenine (9 12), L-lysine (13, 14), thiamine pyrophosphate (TPP) (15), and S-adenosylmethionine (SAM) (16 18) (for review, see refs 1 3). This observation is not wholly surprising, given the numerous types of artificial RNA aptamers that had been previously isolated in vitro (7, 8, 19). One of the truly fascinating and unique features of riboswitches is the elegant way in which they utilize a variety of ligand-induced structural rearrangements to modulate gene expression. This mechanism depends on the riboswitch s intricate architecture, which uses both secondary and tertiary structures with far greater complexity than is typically seen in aptamers selected in vitro (Figure 1) (20). Within a riboswitch, the ligand-binding aptamer domain is coupled to an expression platform whose conformation controls gene expression through a variety of methods. Transcription can be turned off or on through either the formation or the preclusion of a terminator stem (Figure 2, panel a), translation can be inhibited by the sequestration of the Shine Dalgarno (SD) sequence within a structure (Figure 2, panel b), or the mrna can be cleaved through enzymatic activity of the riboswitch itself (1 3, 21). Riboswitches, like their metabolitesensing protein counterparts, must be able ABSTRACT Riboswitches are metabolitesensing RNA structures that have been discovered in regulatory regions of messenger RNA (mrna). They have the remarkable ability to shut off the transcription or translation of their own mrnas in response to binding a specific metabolite. In other words, riboswitches regulate their own genes using RNA instead of protein. Three new crystal structures reveal how S-adenosylmethionine and thiamine pyrophosphate riboswitches accomplish this task. *Corresponding author, butcher@nmrfam.wisc.edu. Published online July 21, 2006 10.1021/cb002465 CCC: $33.50 2006 by American Chemical Society www.acschemicalbiology.org VOL.1 NO.6 ACS CHEMICAL BIOLOGY 341

Figure 1. Structures of TPP-sensing riboswitches from E. coli and A. thaliana and a SAM-sensing riboswitch. Overall folding schemes for a) TPP and b) SAM riboswitches. Crystal structures of E. coli and A. thaliana c) TPP riboswitches and d) a SAM riboswitch. Ligands (green) are bound at the interface of parallel helices in all three structures. Helix P1, the ligand, and the PK are labeled. Surface representation of e) TPP and f) SAM riboswitches reveals the significant burial of the ligand within the globular RNA fold. to discriminate against other small molecules in order to be viable gene regulators. Accordingly, riboswitches are exceptionally specific and can reject even subtle variations of the natural ligand. For example, riboswitches distinguish between molecules on the basis of the presence or absence of functional groups (15, 18, 22), atomic charge (23), and stereochemistry (14, 24). This ligand selectivity has led to several seemingly paradoxical observations about the types of molecules that are bound. In several cases, including the TPP riboswitch, negatively charged phosphate groups on the ligand are required for optimal binding to the polyanionic RNA (15, 21, 22); this reveals another rather puzzling facet to the extraordinary ligand specificity displayed by riboswitches. So how do riboswitches translate a binding event into a conformational rearrangement dramatic enough to turn off a gene? And how is a binding pocket created that can comfortably house negatively charged molecules that would normally be repelled by RNA? New crystal structures of the TPP- and SAM-sensing riboswitches offer insights into these questions (4 6). The TPP-responsive riboswitch is located in the 5= untranslated region of mrnas involved in thiamine biosynthesis, controlling the expression of these genes by inhibiting translation, in the case of the crystallized TPP riboswitch, or transcription, as has been observed for other known TPP riboswitches (15). Interestingly, this riboswitch is one of the most ubiquitous of the known classes, with representatives identified in all three domains of life (25). Appropriately, the structures of both a prokaryotic (Escherichia coli) and a eukaryotic (Arabidopsis thaliana) TPP riboswitch are revealed in publications from the Patel and Ban laboratories, respectively; this allows for an evolutionary comparison (Figure 1, panels a, c, and e) (5, 6). The Batey laboratory unveiled the structure of a SAM-responsive riboswitch, a common bacterial regulator that modulates the expression of genes involved in sulfur metabolism and methionine biosynthesis through transcription termination (Figure 1, panels b, d, and f) (4, 16 18). One of the most striking features of the riboswitches is their elaborate tertiary structures, in which the RNAs adopt globular folds that surround and bury their ligands within their cores (Figure 1, panels c f). Complex RNA structures are made up of preformed secondary structural elements, including helices, loops, and junction regions that organize into tertiary folds through RNA RNA stacking or hydrogenbonding interactions (Figure 1, panels a and b) (26). Intriguingly, in the riboswitch structures, the ligands appear to be the catalysts for the formation of many of the tertiary contacts. Binding of the ligand in each structure occurs at the interface between two parallel helices, creating intricate hydrogen-bonding and electrostatic networks between the RNA, bound Mg 2 ions, and the small molecule (Figure 3). These interactions drive the induced-fit binding mechanism of the riboswitch, with the ligand serving to juxtapose and tether separate domains that form tertiary contacts only upon ligand binding. The resulting compaction is instrumental in the formation of a crucial helix (P1) that turns off both riboswitches through the sequestration of the SD sequence (Figure 2, panel a) or the formation of a terminator stem (Figure 2, panel b). Along with helical packing, ligand binding triggers the organization of single-stranded regions of the RNA that would likely be dis- 342 VOL.1 NO.6 341 345 2006 SASHITAL AND BUTCHER www.acschemicalbiology.org

Figure 2. Genetic control by riboswitches is achieved by the coupling of aptamer and expression platform domains. a) Translational and b) transcriptional methods for genetic regulation are shown, as they apply to the TPP and SAM riboswitches, respectively. a) In the absence of TPP, the riboswitch contains little tertiary structure, and helix P8* forms, leaving the SD sequence free to interact with the ribosome. Upon binding of TPP, the aptamer domain folds into a compact, globular structure, forming helix P1. Helix P8 forms within the expression platform, sequestering the SD, thus inhibiting translation. b) The SAM riboswitch forms a PK interaction in the absence of ligand, forming a partial binding pocket for the ligand. An anti-terminator stem forms, portions of which are mutually exclusive with helix P1. Ligand binding stabilizes the formation of P1, thus disrupting the anti-terminator and allowing a terminator stem to form. ordered in the absence of the small molecule. Many of these unpaired strands are located at the junctions between helices and must be folded in order for helical stacking to occur (Figure 1, panels a and b). This is especially true in the TPP-sensing riboswitch, in which the pyrimidine ring of the ligand interacts extensively with a singlestranded junction between two helices, resulting in a helical stack that forms onehalf of the parallel helical motif that dominates the structure (Figure 1, panel c). In general, the globular structure of the TPP riboswitch appears to form only upon ligand binding. Structure-probing experiments presented by Serganov et al.(6) suggest that the tertiary contacts observed in the structure only occur when the ligand is bound. In contrast, the tertiary structure in the SAM-sensing riboswitch is partially preformed (18, 23) through a pseudoknot (PK) interaction (27) that occurs between a loop region of one stem-loop and the junction region between two other helices (Figure 1, panels b and d). In the SAM riboswitch, the PK preserves the global architecture of the structure, including a helical stack and a kink-turn motif (28) (Figure 1, panel b, and Figure 2, panel b). The partial formation of the SAM riboswitch structure likely aids in ligand recognition (4). One of the most surprising aspects of the TPP riboswitch is its ability to select for negatively charged phosphate groups. Previous studies of the TPP riboswitch revealed that the aptamer also binds thiamine and thiamine phosphate, albeit with decreasing affinity; this suggests that the presence and the length of the phosphate functional group are instrumental in binding specificity (15). The structures of the TPP riboswitch confirm this hypothesis, because the RNA forms two binding pockets: one for the pyrimidine ring present in all three compounds and one for the pyrophosphate found solely in TPP (Figure 3, panels a and b). In the first pocket, the pyrimidine ring of thiaminecontaining compounds forms several hydrogen bonds with nucleotides within the riboswitch. Additionally, the pyrimidine ring is sandwiched between two purines, creating a very snug fit for the ligand (Figure 4, panel b). However, for the two helical domains of the riboswitch to be bridged, the pyrophosphatebinding pocket must also be occupied. Binding of the TPP pyrophosphate moiety occurs in a large pocket formed at a junction between two helices, which also coordinates Mg 2 ions that shield the negative charge of the phosphate groups (Figure 3, panels a and b). Binding of the TPP pyrophosphate moiety occurs in a large pocket formed at a junction between two helices. The helices also coordinate the Mg 2 ions that shield the negative charge of the phosphate groups (Figure 3, panels a and b). The E. coli structure contains two partially hydrated Mg 2 ions, which directly coordinate to the pyrophosphate and form watermediated hydrogen bonds between the pyrophosphate and the RNA (Figure 3, panel a). The A. thaliana structure also shows an identically bound Mg 2 ion (Figure 3, panel b). The second Mg 2 ion was not observed in the A. thaliana structure, possibly because of differences in the data resolution or the occupancy of the second ion. In both structures, the nonbridging oxygen atoms of the pyrophosphates hydrogen-bond to nucleotides within the junction. The structures highlight the importance of Mg 2 in neutralizing repulsive forces between the ligand and RNA; this process allows the pyrophosphate to hydrogen-bond within the binding pocket. Like the TPP riboswitch, the SAM riboswitch integrates the polar functionalities of the ligand into an extensive hydrogenbonding network (Figure 3, panel c). The methionine amino acid group of SAM stacks on top of its own adenine ring, and both the amino acid and the adenosyl moieties of SAM are recognized by three or four hydro- www.acschemicalbiology.org VOL.1 NO.6 341 345 2006 343

Figure 3. Close-up views of ligand-binding pockets reveal extensive hydrogen-bonding and electrostatic interactions. a) and b) The TPP riboswitch forms two binding pockets for the pyrimidine ring and pyrophosphate groups. The pyrimidine ring forms three hydrogen bonds with two nucleotides in both TPP riboswitch structures. a) The E. coli pyrophosphate-binding pocket contains two Mg 2 ions (purple spheres) that coordinate (blue dashed lines) the RNA, the ligand, and the water molecules (blue spheres). Several water-mediated and direct ligand RNA hydrogen bonds (black dashed lines) also form to facilitate pyrophosphate binding. b) The A. thaliana structure only contains one Mg 2 ion in the pyrophosphate-binding region. The hydrogen bonds occur directly between the ligand and the RNA. c) The adenosyl and amino acid moieties of SAM hydrogen-bond extensively with the RNA, securing the ligand in a compact conformation. Two carbonyl oxygens in the RNA mediate selectivity for SAM through favorable electrostatic interactions with the positively charged sulfonium ion (indicated by arrows). gen bonds with adjacent residues within the nearby helix. As a result of these RNA ligand interactions, the molecule is held in a compact conformation in which its positively charged sulfonium ion is juxtaposed with carbonyl oxygens in the minor groove of helix P1 (Figure 3, panel c). This arrangement of the ligand creates a unique electrostatic basis for recognition of SAM over the analogous molecule S-adenosylhomocysteine, which lacks the positively charged sulfonium ion (18). The TPP and SAM riboswitches reveal a variety of strategies with which RNA can selectively bind small molecules. However, information about features that are not recognized by riboswitches may also be informative. For example, the TPP riboswitch does not recognize the TPP thiazole ring through any specific contacts; this may explain why pyrithiamine pyrophosphate (PTPP) is an effective antimicrobial compound (5, 6, 29). In PTPP, a pyridine ring replaces the thiazole ring, but the compound binds the riboswitch with similar affinity to TPP and subsequently can turn off gene expression in a vital metabolic pathway. This example elegantly demonstrates how understanding the structures of bacterial-riboswitch-binding pockets can contribute significantly to our search for novel antimicrobial agents. Given the ubiquity of riboswitches in bacteria, it is somewhat surprising how few have been found in eukaryotes. The one exception is the TPP riboswitch, which has been identified in fungi and plants and appears to retain many of the characteristics of the prokaryotic riboswitch (25). The structures of the E. coli and A. thaliana riboswitches from the Patel and Ban groups confirm this conservation, revealing very few differences between the prokaryotic and eukaryotic RNAs (Figure 4). The divergences occur mainly in regions that do not contact the ligand (Figure 4, panel a). Therefore, ligand binding and specificity are achieved nearly identically in the two structures (Figure 4, panels b and c). These observations support the hypothesis that the TPP riboswitch is a relic of an ancient RNA world, in which RNA controlled all the processes of life. The known riboswitches appear to have survived because of the economy that they provide the cell. Given that this efficiency is inherent in a selfregulating mrna, it is tempting to speculate that modern riboswitches may have evolved in eukaryotes that are not present in prokaryotes. The multitude of additional mrna processing steps unique to eukaryotes affords a variety of targets for RNA-based genetic control. Further research into potential eukaryotic riboswitches will help to shed Figure 4. A comparison of prokaryotic and eukaryotic riboswitch structures. a) Overall folding of the E. coli (orange) and A. thaliana (purple) riboswitches. Regions with minimal divergence are highlighted in gray. b) A close-up of the TPP pyrimidine-ring-binding pocket reveals a high degree of similarity between the two structures. c) A close-up of the pyrophosphate-binding pocket. The RNA does not diverge significantly, although the positioning of the pyrophosphate moiety does. The Mg 2 ion observed in the A. thaliana structure occupies the same space as one of the Mg 2 ions in the E. coli structure. 344 VOL.1 NO.6 341 345 2006 SASHITAL AND BUTCHER www.acschemicalbiology.org

further light onto these fascinating RNAs and their structures. REFERENCES 1. Nudler, E., and Mironov, A. S. (2004) The riboswitch control of bacterial metabolism, Trends Biochem. Sci. 29, 11 17. 2. Soukup, J. K., and Soukup, G. A. (2004) Riboswitches exert genetic control through metaboliteinduced conformational change, Curr. Opin. Struct. Biol. 14, 344 349. 3. Winkler, W. C., and Breaker, R. R. (2003) Genetic control by metabolite-binding riboswitches, Chem- BioChem 4, 1024 1032. 4. Montange, R. K., and Batey, R. T. (2006) Structure of the S-adenosylmethionine riboswitch regulatory mrna element, Nature 441, 1172 1175. 5. Thore, S., Leibundgut, M., and Ban, N. (2006) Structure of the eukaryotic thiamine pyrophosphate riboswitch with its regulatory ligand, Science 312, 1208 1211. 6. Serganov, A., Polonskaia, A., Phan, A. T., Breaker, R. R., and Patel, D. J. (2006) Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch, Nature 441, 1167 1171. 7. Feigon, J., Dieckmann, T., and Smith, F. W. (1996) Aptamer structures from A to zeta, Chem. Biol. 3, 611 617. 8. Hermann, T., and Patel, D. J. (2000) Adaptive recognition by nucleic acid aptamers, Science 287, 820 825. 9. Mandal, M., Boese, B., Barrick, J. E., Winkler, W. C., and Breaker, R. R. (2003) Riboswitches control fundamental biochemical pathways in Bacillus subtilis and other bacteria, Cell 113, 577 586. 10. Mandal, M., and Breaker, R. R. (2004) Adenine riboswitches and gene activation by disruption of a transcription terminator, Nat. Struct. Mol. Biol. 11, 29 35. 11. Batey, R. T., Gilbert, S. D., and Montange, R. K. (2004) Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine, Nature 432, 411 415. 12. Serganov, A., Yuan, Y. R., Pikovskaya, O., Polonskaia, A., Malinina, L., Phan, A. T., Hobartner, C., Micura, R., Breaker, R. R., and Patel, D. J. (2004) Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mrnas, Chem. Biol. 11, 1729 1741. 13. Grundy, F. J., Lehman, S. C., and Henkin, T. M. (2003) The L box regulon: lysine sensing by leader RNAs of bacterial lysine biosynthesis genes, Proc. Natl. Acad. Sci. U.S.A. 100, 12057 12062. 14. Sudarsan, N., Wickiser, J. K., Nakamura, S., Ebert, M. S., and Breaker, R. R. (2003) An mrna structure in bacteria that controls gene expression by binding lysine, Genes Dev. 17, 2688 2697. 15. Winkler, W., Nahvi, A., and Breaker, R. R. (2002) Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression, Nature 419, 952 956. 16. Epshtein, V., Mironov, A. S., and Nudler, E. (2003) The riboswitch-mediated control of sulfur metabolism in bacteria, Proc. Natl. Acad. Sci. U.S.A. 100, 5052 5056. 17. McDaniel, B. A., Grundy, F. J., Artsimovitch, I., and Henkin, T. M. (2003) Transcription termination control of the S box system: direct measurement of S-adenosylmethionine by the leader RNA, Proc. Natl. Acad. Sci. U.S.A. 100, 3083 3088. 18. Winkler, W. C., Nahvi, A., Sudarsan, N., Barrick, J. E., and Breaker, R. R. (2003) An mrna structure that controls gene expression by binding S-adenosylmethionine, Nat. Struct. Biol. 10, 701 707. 19. Wilson, D. S., and Szostak, J. W. (1999) In vitro selection of functional nucleic acids, Annu. Rev. Biochem. 68, 611 647. 20. Gilbert, S. D., and Batey, R. T. (2005) Riboswitches: natural SELEXion, Cell. Mol. Life Sci. 62, 2401 2404. 21. Winkler, W. C., Nahvi, A., Roth, A., Collins, J. A., and Breaker, R. R. (2004) Control of gene expression by a natural metabolite-responsive ribozyme, Nature 428, 281 286. 22. Winkler, W. C., Cohen-Chalamish, S., and Breaker, R. R. (2002) An mrna structure that controls gene expression by binding FMN, Proc. Natl. Acad. Sci. U.S.A. 99, 15908 15913. 23. Lim, J., Winkler, W. C., Nakamura, S., Scott, V., and Breaker, R. R. (2006) Molecular-recognition characteristics of SAM-binding riboswitches, Angew Chem., Int. Ed. 45, 964 968. 24. Nahvi, A., Sudarsan, N., Ebert, M. S., Zou, X., Brown, K. L., and Breaker, R. R. (2002) Genetic control by a metabolite binding mrna, Chem. Biol. 9, 1043 25. Sudarsan, N., Barrick, J. E., and Breaker, R. R. (2003) Metabolite-binding RNA domains are present in the genes of eukaryotes, RNA 9, 644 647. 26. Leontis, N. B., and Westhof, E. (2003) Analysis of RNA motifs, Curr. Opin. Struct. Biol. 13, 300 308. 27. Staple, D. W., and Butcher, S. E. (2005) Pseudoknots: RNA structures with diverse functions, PLoS Biol. 3, e213 28. Winkler, W. C., Grundy, F. J., Murphy, B. A., and Henkin, T. M. (2001) The GA motif: an RNA element common to bacterial antitermination systems, rrna, and eukaryotic RNAs, RNA 7, 1165 1172. 29. Sudarsan, N., Cohen-Chalamish, S., Nakamura, S., Emilsson, G. M., and Breaker, R. R. (2005) Thiamine pyrophosphate riboswitches are targets for the antimicrobial compound pyrithiamine, Chem. Biol. 12, 1325 1335. www.acschemicalbiology.org VOL.1 NO.6 341 345 2006 345