Citation: Staple DW, Butcher SE (2005) Pseudoknots: RNA Structures with Diverse Functions. PLoS Biol 3(6): e213. doi:10.1371/journal.pbio.0030213
Published: June 14, 2005
Copyright: © 2005 Staple and Butcher. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abbreviations: HDV, hepatitis delta virus; MMTV, mouse mammary tumor virus; mRNA, messenger RNA; NMR, nuclear magnetic resonance; SARS-CoV, severe acute respiratory syndrome coronavirus
RNA molecules fulfill a diverse set of biological functions within cells, from the transfer of genetic information from DNA to protein, to enzymatic catalysis. Reflecting this range of roles, simple linear strings of RNA—made up of uracil, guanine, cytosine, and adenine—form a variety of complex three-dimensional structures. Just as proteins form distinct structural motifs such as zinc fingers and beta barrels, certain structures are also commonly adopted by RNA molecules. Among the most prevalent RNA structures is a motif known as the pseudoknot. First recognized in the turnip yellow mosaic virus , a pseudoknot is an RNA structure that is minimally composed of two helical segments connected by single-stranded regions or loops (Figure 1). Although several distinct folding topologies of pseudoknots exist, the best characterized is the H type. In the H-type fold, the bases in the loop of a hairpin form intramolecular pairs with bases outside of the stem (Figure 1A and 1B). This causes the formation of a second stem and loop, resulting in a pseudoknot with two stems and two loops (Figure 1C). The two stems are able to stack on top of each other to form a quasi-continuous helix with one continuous and one discontinuous strand. The single-stranded loop regions often interact with the adjacent stems (loop 1–stem 2 or loop 2–stem 1) to form hydrogen bonds and to participate in the overall structure of the molecule. Hence, this relatively simple fold can yield very complex and stable RNA structures. Due to variation of the lengths of the loops and stems, as well as the types of interactions between them, pseudoknots represent a structurally diverse group. It is fitting that they play a variety of diverse roles in biology. These roles include forming the catalytic core of various ribozymes [2,3], self-splicing introns , and telomerase . Additionally, pseudoknots play critical roles in altering gene expression by inducing ribosomal frameshifting in many viruses [6–9].
(A) Linear arrangement of base-pairing elements within an H-type RNA pseudoknot. Base pairing is indicated with dashed lines.
(B) Formation of initial hairpin within pseudoknot sequence. Base pairings from loop to bases outside the hairpin are indicated with dashed lines.
(C) Classic H-type pseudoknot fold.
(D) Three-stemmed RNA pseudoknot fold from SARS-CoV.
Catalytically Active Pseudoknots
Hepatitis delta virus (HDV) is a satellite virus of hepatitis B virus. Infection of humans by both HDV and hepatitis B virus is generally more severe than a hepatitis B virus infection alone . HDV has a circular genome that is replicated by the host RNA polymerase II through a double-rolling-circle mechanism. This mechanism produces long strands of RNA that must be processed into unit lengths for viral replication. The processing of the viral RNA is achieved by the self-cleaving HDV ribozyme encoded in the RNA . The HDV ribozyme folds into a double-pseudoknot conformation and self-cleaves, producing single-genome-length HDV RNAs. The HDV ribozyme is the fastest-known naturally occurring self-cleaving ribozyme, with a cleavage rate greater than one per second, and is active in vitro in the absence of any proteins . The HDV ribozyme consists of five helical segments that form two coaxial stacks of two (stems P2 and P3) and three (stems P1, P1.1, and P4) helices each (Figure 2A) [3,13]. Two pseudoknots are formed, each with one helix from each coaxial stack (stems P1 and P2, and stems P3 and P1.1). These two pseudoknots stack on top of each other, forming a nested double-pseudoknot conformation .
Stems and loops are numbered sequentially, unless otherwise noted. Structure coordinates were obtained from the Protein Data Bank (http://www.rcsb.org), and structural representations were produced using MOLMOL software.
(A) HDV (1SJ3). Numbering of stems reflects standard nomenclature for HDV. The U1A RNA binding domain is colored gray and is not included in the three-dimensional structure.
(B) Diels-Alder ribozyme (DA-R) (1YLS).
(C) Human telomerase (hTR) (1YMO).
(D) MMTV (1RNK).
(E) Pea enation mosaic virus RNA1 (PEMV-1) (1KPZ).
(F) Simian retrovirus 1 (SRV-1) (1E95).
The removal of introns from pre–messenger RNA (pre-mRNA) is fundamentally important for eukaryotic life. Most introns are removed by a ribonucleoprotein complex called the spliceosome. A subset of introns are self-cleaving, catalyzing their own removal from pre-mRNA without the aid of proteins . One such class of introns are the group I self-splicing introns, with the most well-studied example being from the ciliate Tetrahymena. The structure of this ribozyme is made up of three helical domains, with many tertiary contacts between the domains . The only portion of the RNA that spans all three helical domains is a pseudoknot belt that wraps around the molecule, base-pairing with all three helices . The pseudoknot establishes the catalytic core of the group I self-splicing introns.
Naturally occurring ribozymes appear to perform mainly hydrolysis and transesterification reactions ; however, in vitro selection has yielded RNAs capable of performing a wide variety of enzymatic reactions . Recently the structure of an RNA capable of catalyzing carbon–carbon bond formation by the Diels-Alder reaction was solved (Figure 2B) . The RNA adopts a λ-shaped fold of its three helices in which stems 2 and 3 stack coaxially, with stem 1 abutting the active site, forming a pocket precisely complementary to the reaction product. The 5′ end of the RNA bridges helical stems 3 and 1, generating a complex nested pseudoknot topology. Although conformationally distinct from the HDV ribozyme , it is worthwhile to note that they are two of the fastest-known ribozymes, and both utilize a nested pseudoknot architecture .
Chromosomes possess protective ends known as telomeres to protect themselves from degradation due to successive rounds of DNA synthesis. Telomerase, the ribonucleoprotein complex responsible for the maintenance of the telomere ends , is upregulated in most cancers  and might play a role in aging . Human telomerase is made up of a 451-nucleotide RNA, a reverse transcriptase, and other proteins . At the 5′ end of the RNA is a highly conserved pseudoknot, required for activity, which lies at the core of telomerase. The structure of the human telomerase pseudoknot reveals a classic H-type pseudoknot fold with a slight bend between the stems (Figure 2C) . A triple-helix structure flanks the junction of the helices and extends into each stem. Mutations within the telomerase pseudoknot have been directly linked to the diseases autosomal dyskeratosis congenita  and aplastic anemia .
Not all pseudoknots with biological functions are catalytically active. In fact, one of the most common functions of pseudoknots is to induce ribosomes to slip into alternative reading frames, otherwise known as frameshifting. Ribosomes typically translate mRNA without shifting the translational reading frame . However, a number of organisms have evolved mechanisms to cause site-specific or programmed frameshifting of the ribosome in either the +1 or −1 direction . Programmed −1 ribosomal frameshifting is typically found in viruses and is required for the replication and proliferation of all retroviruses. Therefore, the pseudoknot structures involved in frameshifting are attractive targets for the development of antiviral drugs. The frameshift event is induced by two RNA elements within the mRNA: (i) a heptanucleotide slippery sequence X XXY YYZ (spaced triplets represent preframeshift codons) and (ii) a downstream RNA structure, typically a pseudoknot . The mechanism behind how these elements promote −1 frameshifting is not fully understood. The current model posits that the ribosome encounters the downstream pseudoknot while the slippery sequence is being decoded by the ribosome. The pseudoknot structure likely causes the ribosome to pause, which is necessary but not sufficient for frameshifting to occur . While paused on the slippery sequence, the ribosome slips back one nucleotide and subsequently continues translation in the −1 reading frame.
The nuclear magnetic resonance (NMR) structure of the mouse mammary tumor virus (MMTV) frameshift-inducing pseudoknot was the first structure of a frameshift-inducing pseudoknot (Figure 2D) . The MMTV pseudoknot forms a compact structure of two guanine/cytosine-rich A-form helices. The MMTV pseudoknot has a bend of approximately 60° between the two helices, caused by an unpaired adenine that intercalates between the helices and may act as a hinge. Subsequent structural and functional studies of several variants of the MMTV pseudoknot reveal that the intercalated nucleotide and the resulting bend between stems 1 and 2 are required for efficient frameshifting .
In beet western yellow virus, pea enation mosaic virus, and other luteoviruses, an RNA pseudoknot also stimulates a −1 frameshift between the P1 and P2 genes . These structures, solved by X-ray crystallography and NMR, respectively, revealed compact H-type pseudoknots with extensive loop–stem interactions (Figure 2E) [7,9]. Like that of MMTV, frameshift-inducing pseudoknots in both the beet western yellow virus and pea enation mosaic virus have an unpaired nucleotide at the junction of the stems; however, this nucleotide is displaced from the helix, not intercalated as in MMTV.
The frameshift-inducing pseudoknot from simian retrovirus 1 contains a number of unique features (Figure 2F) . Although predicted to resemble that of MMTV, with an unpaired adenine between the helices, the structure revealed the formation of a uracil–adenine pair at the junction, allowing the two stems to stack directly on top of each other (Figure 2F) . The simian retrovirus 1 pseudoknot forms an extensive loop 2–stem 1 triplex, which contains a ribose zipper motif in addition to base–base and base–sugar interactions .
The severe acute respiratory syndrome coronavirus (SARS-CoV) genome contains two large genes, ORF 1a and ORF 1b, separated by a programmed −1 frameshift element required for ORF 1b expression . Recent work has suggested that the SARS-CoV frameshift-inducing pseudoknot may be unique because it contains a third stem–loop [31,32]. In this issue of PLoS Biology, bioinformatic, phylogenetic, and structural evidence is reported indicating that the SARS-CoV pseudoknot is indeed a three-stemmed RNA pseudoknot (see Figure 1D) . Dinman and co-workers report the potential for the formation of this three-stemmed pseudoknot in all coronaviruses in the GenBank database. NMR experiments confirmed the proposed three-stemmed pseudoknot structure in SARS-CoV. Although the atomic-resolution structure has not yet been determined, this study identifies a new secondary structure capable of promoting frameshifting that is structurally distinct from previously described pseudoknots (see Figure 1D).
RNA pseudoknots have been identified in nearly every organism and comprise functional domains within ribozymes, self-splicing introns, ribonucleoprotein complexes, viral genomes, and many other biological systems. It is clear that the pseudoknot topology can result in many different, complex structures. The pseudoknot, therefore, represents an important piece of RNA architecture, as it provides a means for a single RNA strand to fold upon itself to produce a globular structure capable of performing important biological functions.
- 1. Rietveld K, Van Poelgeest R, Pleij CW, Van Boom JH, Bosch L (1982) The tRNA-like structure at the 3′ terminus of turnip yellow mosaic virus RNA. Differences and similarities with canonical tRNA. Nucleic Acids Res 10: 1929–1946.
- 2. Rastogi T, Beattie TL, Olive JE, Collins RA (1996) A long-range pseudoknot is required for activity of the Neurospora VS ribozyme. EMBO J 15: 2820–2825.
- 3. Ke A, Zhou K, Ding F, Cate JH, Doudna JA (2004) A conformational switch controls hepatitis delta virus ribozyme catalysis. Nature 429: 201–205.
- 4. Adams PL, Stahley MR, Kosek AB, Wang J, Strobel SA (2004) Crystal structure of a self-splicing group I intron with both exons. Nature 430: 45–50.
- 5. Theimer CA, Blois CA, Feigon J (2005) Structure of the human telomerase RNA pseudoknot reveals conserved tertiary interactions essential for function. Mol Cell 17: 671–682.
- 6. Shen LX, Tinoco I (1995) The structure of an RNA pseudoknot that causes efficient frameshifting in mouse mammary tumor virus. J Mol Biol 247: 963–978.
- 7. Nixon PL, Rangan A, Kim YG, Rich A, Hoffman DW, et al. (2002) Solution structure of a luteoviral P1-P2 frameshifting mRNA pseudoknot. J Mol Biol 322: 621–633.
- 8. Michiels PJ, Versleijen AA, Verlaan PW, Pleij CW, Hilbers CW, et al. (2001) Solution structure of the pseudoknot of SRV-1 RNA, involved in ribosomal frameshifting. J Mol Biol 310: 1109–1123.
- 9. Egli M, Minasov G, Su L, Rich A (2002) Metal ions and flexibility in a viral RNA pseudoknot at atomic resolution. Proc Natl Acad Sci U S A 99: 4302–4307.
- 10. Lai MM (1995) The molecular biology of hepatitis delta virus. Annu Rev Biochem 64: 259–286.
- 11. Kuo MY, Sharmeen L, Dinter-Gottlieb G, Taylor J (1988) Characterization of self-cleaving RNA sequences on the genome and antigenome of human hepatitis delta virus. J Virol 62: 4439–4444.
- 12. Thill G, Vasseur M, Tanner NK (1993) Structural and sequence elements required for the self-cleaving activity of the hepatitis delta virus ribozyme. Biochemistry 32: 4254–4262.
- 13. Ferre-D'Amare AR, Zhou K, Doudna JA (1998) Crystal structure of a hepatitis delta virus ribozyme. Nature 395: 567–574.
- 14. Cech TR (1986) The generality of self-splicing RNA: Relationship to nuclear mRNA splicing. Cell 44: 207–210.
- 15. Adams PL, Stahley MR, Gill ML, Kosek AB, Wang J, et al. (2004) Crystal structure of a group I intron splicing intermediate. RNA 10: 1867–1887.
- 16. Doudna JA, Cech TR (2002) The chemical repertoire of natural ribozymes. Nature 418: 222–228.
- 17. Wilson DS, Szostak JW (1999) In vitro selection of functional nucleic acids. Annu Rev Biochem 68: 611–647.
- 18. Serganov A, Keiper S, Malinina L, Tereshko V, Skripkin E, et al. (2005) Structural basis for Diels-Alder ribozyme-catalyzed carbon-carbon bond formation. Nat Struct Mol Biol 12: 218–224.
- 19. McEachern MJ, Krauskopf A, Blackburn EH (2000) Telomeres and their control. Annu Rev Genet 34: 331–358.
- 20. Blasco MA (2003) Telomeres and cancer: A tale with many endings. Curr Opin Genet Dev 13: 70–76.
- 21. Marciniak RA, Johnson FB, Guarente L (2000) Dyskeratosis congenita, telomeres and human ageing. Trends Genet 16: 193–195.
- 22. Kelleher C, Teixeira MT, Forstemann K, Lingner J (2002) Telomerase: Biochemical considerations for enzyme and substrate. Trends Biochem Sci 27: 572–579.
- 23. Vulliamy T, Marrone A, Dokal I, Mason PJ (2002) Association between aplastic anaemia and mutations in telomerase RNA. Lancet 359: 2168–2170.
- 24. Farabaugh PJ, Bjork GR (1999) How translational accuracy influences reading frame maintenance. EMBO J 18: 1427–1434.
- 25. Gesteland RF, Atkins JF (1996) Recoding: Dynamic reprogramming of translation. Annu Rev Biochem 65: 741–768.
- 26. ten Dam EB, Pleij CW, Bosch L (1990) RNA pseudoknots: Translational frameshifting and readthrough on viral RNAs. Virus Genes 4: 121–136.
- 27. Somogyi P, Jenner AJ, Brierley I, Inglis SC (1993) Ribosomal pausing during translation of an RNA pseudoknot. Mol Cell Biol 13: 6931–6940.
- 28. Chen X, Kang H, Shen LX, Chamorro M, Varmus HE, et al. (1996) A characteristic bent conformation of RNA pseudoknots promotes - 1 frameshifting during translation of retroviral RNA. J Mol Biol 260: 479–483.
- 29. Miller AW, Dinesh-Kumar SP, Paul CP (1995) Luteovirus gene expression. CRC Crit Rev Plant Sci 14: 179–211.
- 30. Rota PA, Oberste MS, Monroe SS, Nix WA, Campagnoli R, et al. (2003) Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science 300: 1394–1399.
- 31. Baranov PV, Henderson CM, Anderson CB, Gesteland RF, Atkins JF, et al. (2005) Programmed ribosomal frameshifting in decoding the SARS-CoV genome. Virology 332: 498–510.
- 32. Ramos FD, Carrasco M, Doyle T, Brierley I (2004) Programmed -1 ribosomal frameshifting in the SARS coronavirus. Biochem Soc Trans 32: 1081–1083.
- 33. Plant EP, Pérez-Alvarado GC, Jacobs JL, Mukhopadhyay B, Hennig M, et al. (2005) A three-stemmed mRNA pseudoknot in the SARS coronavirus frameshift signal. PLoS Biol 3: e172. doi: 10.1371/journal.pbio.0030172.