1. Nucleic Acids Research, 1993, Vol. 21, No. 13 3051-3054
Collection of small subunit (16S- and 16S-like) ribosomal
RNA structures
Robin Ray Gutell
MCB Biology, Campus Box 347, University of Colorado, Boulder, CO 80309-0347, USA
INTRODUCTION
Inferring higher-order structure for complex RNA molecules,
such as the ribosomal RNAs has relied primarily on comparative
methods. Underlying these methods is the premise that molecules
with different primary structure and similar functional
characteristics have similar secondary and tertiary structure
[Reviewed in: 1]. For these methods to be effective, the RNA
molecules under study need to be sufficiently similar at the
primary structure level to obtain good sequence alignments,
however these same sequences also need to be proportionately
different for positional covariance to occur, the indicator
suggesting the existence of a basepair.
The higher-order structure models for 16S rRNA have evolved
in stages. Initially, with a small number of 16S rRNA sequences
in hand, a minimal secondary structure was proposed. Increases
in the number and diversity of available 16S rRNA sequences
and paralleled with improvements in correlation analysis
algorithms has lead to the continual refinement of this structure
model. In the earlier stages only secondary structure pairings were
identified. In contrast during the latter stages only minor
refinements in these pairings occurred while several novel tertary
and non-canonical pairing constraints were proposed [reviewed
in: 2-3]. And now with over 2,200 16S and 16S-like rRNA
available sequences spanning the three phylogenetic domains and
the two organelles (Mitochondria and Chloroplast), detailed
phylogenetic, structural, and structural evolution information is
now being deciphered in great detail, although the resulting
analysis from different groups is not always congruent.
Over the years several groups have developed 16S rRNA
secondary structure models. The current versions for each are
fairly analogous with one another[2, 4-7], although this has not
always been the case. And while the current differences are small,
some are significant for the Escherichia coli and other Bacterial,
Archaea, Eucarya, and mitochondrial structure models. These
versions should also not be considered final as these models are
expected to undergo minor revisions as the number and diversity
of 16S and 16S-like sequences increases and is paralleled with
continued improvements and alternative correlation analysis
interpretations [8, unpublished work].
Over the next few years this collection, and the accompanying
23S rRNA (see Gutell, Gray, Schnare, this issue) will grow in
size and detail. The complexity of structure and the evolutionary
dimension of these structures presents us with a wonderful
opportunity to investigate RNA structural motifs and map with
some precision, the evolution ofthese RNAs and underlying RNA
structural characteristics associated with different phylogenetic
assemblages. This collection of structures should also be ofvalue
to the experimentalist studying rRNA structure and function. And
equally valuable to those studying rRNA based phylogeny.
OBJECTIVES
The objectives for this annual collection are:
1. Present our most current comparative interpretation of the
Escherichia coli 16S rRNA secondary structure [with
general agreement between C.R.Woese, H.F.Noller, and
myselfl. While this secondary structure model is stable,
minor adjustments are expected in some of the helical
pairings.
2. Present other significant correlations that suggest tertiary
and other complex structure in the Escherichia coli model.
The numbers for such structural interactions should increase
over the next few years.
3. Present a sampling of different 16S and 16S-like rRNA
higher-order structure models. Starting with a broad
sampling of phylogenetically and structurally distinct
models, additional examples will be developed to fill in this
broad phylogenetic and structure matrix. A parallel effort
will refine all previously proposed models as new
comparative structure information is obtained. To be
discussed elsewhere and in more detail, this collection of
structures will serve as the database for detailed analysis
of RNA evolution and RNA structural motifs.
The first 16S and 16Slike structure release is set for
summer/fall- 1993, and will include major representatives
from the three phylogenetic domains and organelles. The
approximate numbers will be: 25 (eu)bacteria, 10 archaea,
15 eucarya, 2 chloroplast, and 10 mitochondria. Readers
are encouraged to contact the author with suggestions for
additional higherorder structure models not currently
available.
4. Illustrated in this article are three divergent 16S and 16S-
like rRNAs structure models for: Escherichia coli, a
member of the (eu)bacteria phylogenetic domain; nuclear
encoded Saccharomyces cerevisiae, a member of the
eucarya domain; and Caenorhabditis elegans mitochondria,
one of the smallest known 16S-like rRNAs.
This work is an adjunct to the monumental effort of the RDP
(Ribosomal Database Project, see this issue); Their mandate being
to generate and make various tpes of ribosomal RNA data and
interpretation generally available. On-line access to these 16S
rRNA secondary structure files is available from the RDP
k.l 1993 Oxford University Press
2. 3052 Nucleic Acids Research, 1993, Vol. 21, No. 13
aOU °AaU A
0A 0 OAA A0cc Q.%AAUOC AUCUOOGAUOUC 0A
a II I. 11111-0 1l C II A
UOC0A oUGOACACUUA0AU a COO
0-oc--C
0-co-C
O-C
O-C-
C-O-
0 a
0
U-A
0-C
-C--0
U-UU-A
0-C-is
GA - U
AAC %UA AkAACCUOOO UOCAUCUOA CUOOCAAOC
-C to-IIl ..lI1IlIIt --IIl-1 /c0UCOOOCCC OUOUAOACU OA UUOUUUO /ICUCA
C A I A I C .
UAA GO IO0U
UA C
U
-A
A
0
0
U I I * I I1
UUCC00 UAUC
I AA0AA440
AOaCCUOaA 0CAO'V"l(0A%A
II -
1111 0
cCOCGOU ACOUAU %
A-SO
A
?0-cU-A
O:CAco A-U
A 0350 C-0
UCd,,OAIOCAO-CA
CU UU
O A, C-Oe
Aa
^ CA-
a CA OUC AC
SW A allII I -1 110
~A0UCAA00UC coo UG
'oAoo GA% 0
a
A(
A,AU C A
AU lP
UI U a
CO A
A*
CcAU
OACOCh^ au C COOAC
05.U (1 A:
C-a
AO-C
0-C
-U
C-0
U-AA
C-0- 11
UA~~~~~~
AG I A AUAALQ3A0aAAA
-cGocucA a CAO -
-W c' AA
A-
C0-C
-A 0
AC
A
5,
AA
OU CAOOAAGO
II I lII - I I I
CA OUCUUUCLa 0 I
0o
OOOAOUACOOCC 4
--.11
uAA
-u u
O-C
a - C 1100
o-C, I
U * OC CAAA 0 C u aAGuU ACOAOC CCUUA UCCUUUOU CC COOuC
O I I I I I I I -I1 I I I I I I I 1011 1 1 1 C
U . O UOCUCO AOOOU AOOAAAC A 00 aCCOO
A-U I A U 0 0 I UC A
I
A-U C O% AC 1150
-A 0 . . A
U o-cAA UGU O-C .....OOU
c-o 0 0-
U U 0 C
O-C AA C
-U-A
U C C-o0
U 0 O - UC-1200 aA
c-o A -A A
C-0 0-CU A U
O OA AU-A
aAUOAOAAU OU A
111.11 C 00C UU0
A S ° G O-C
IuU. 'OA.eC UAC-AU
%CUU 1000 CUe0 A
C 0 A UC-OCC A CeU/U-ZA
w COU 95~A0X: XU^
0 0-C
CU.% ~ A
0-CA~0eso A C-0
A 0
U ~~~~~0C 0oA U IACCU a
A UU-A a 0aA
A-~~~00~aU UOClla %
O~~~~C UIAAU ~ 15 A~U-A CaAa
A A10C- 5AA a
0
'cu~ ~ ~ AU A CUC AAA -0A AGAC 0-C CAa
A-U
a
, OAA
CUCA UUO3AA
AA I AAA
AItI U UO ACOOOOCCCOCACAA
COOUACU I I I I** III
* * AC UOU UCCOGOC
LOUUu aA C A C
IA C
10 C U
1400-C---- I -UU
o-C 0 C
0C Aul I U A A C
UAACCOUAGOOO
U U * IlII I I*I
CA-GA UUOOCOUCCA
C oAA
-A-U- U
C-0 C
C-0 A
A O O C
U*0 C
0-U U
0-C C
O AA C
A O o U-
0-U U
-U-A- A
O-C I
AAOC 0-U 3'
U ° - U
UI U-A
uuco u * a0-U
C-o
A-U
A-U
-A-U-
A C
0 A
A
A C
O-C
U-A
A-U
0-U
0-C
U* 0
AO-C-C-0
U-0
AU -0
A A
C-0
C -0
1450-U 0
UC
Figure 1. Higher-order structure diagram for Escherichia coil 16S rRNA [2-3]. For secondary structure basepairings, short lines connect canonical pairs (C:G,
U:A), G:U pairs are denoted with dots, A:G pairings with larger open circles, and other non-canonical pairings with closed circles. Terdary interactions are connected
with thicker and longer solid lines. Every 10th nucleotide position is marked with a tic mark, while every 50th posiion is numbered [Sequence Accession number is J01695].
_u .- A 1
3. Nucleic Acids Research, 1993, Vol. 21, No. 13 3053
U U
0
u
u
LAAACUU0UOAAUUAAAAAAOUUUCAU U U U
OUo,ocucuuo0c 0 A A CCAOoAC 0
u 1,111 I I 1111-1 A
UCCU0A0UUCC A A u C OOUCUU A
c c
uu0 A CUQO MCAAC0000OCC ty Uuuuco AGuoUc^coocu
u II 1111 IIII 11111. U
UUAGCc UoOCC ooUU0acccooo00
A
AAAUA c C
A^ ac c ua u
0 U o-
U A 0
AA A
AGUC
So
U C UAAUAICAAUOU! AOU
i. Hg1 U
CO UCA A
A u
0A 00
UUUCCOUAOOU
.11I I I I I I I (I
OAAOOC0UCCA
A
U
C
A
uU
3.
Figure 2. Higher-order structure diagram for Saccharomyces cerevisiae nucleocytoplasmic 16S-like rRNA. Higher-order structure interactions illustrated and noted
as in Fig. 1. [Sequence is slighdy modified from acccession number J01353].
A
;U A
oao U
A
u
u
A
A
A
A
le u
a
u
Aae Ac
AUUCC
4. 3054 Nucleic Acids Research, 1993, Vol. 21, No. 13
AAUUU O OAUUUAUAUUAOU o
a ...... ...I . I I .
uoou U C UAOAUUUUCuuu A
u
A
u
u
IC
A
cu
aA
U U
O-c
u u
u u
AUA-4
C -0A
Au
u-A
AUUACCU A
A
AA c0oa. A
AA *UoO
U A
A U
U A
^u
0 A
a A
OAC OLUA
u
u
u
c
u
AU AA AUUUU
u
UA
U UAAAA A
u 11t1 1 u
A- UUAUUUU u
O-C u
a .U A A
A A
u
U AAU
0*uUA^u-A
O0* uA
U - A
Au U
UO -c
u c
U-C
UC-0A
Ac-A
u
UU
uuuC_u
C0U
U - AUA-U
CUU-A
A AU..uA
C-0
u
u
UAQAUUUAO0G0,1111 1II A
A UCUA A UC CAA
u0
Au u
3.
FIgr 3. Higher-order structure diagram for Caenorhabditis elegans mitochondrial 16S-like rRNA. Higher-order structre interactions illustrated and noted as in
Fig. 1. [Sequence Accession number is X54252].
anonymous ftp site (info.mcs.anl.gov). Currently, these
PostScript files are maintained in the
'/pub/RDP/SSU rRNA/sec_struct' directory. Additional
information about this server and access to the Ribosomal
Database Project is described in their article in this issue. Readers
unable to access this on-line information, can obtain a set of 16S
rRNA hardcopy diagrams from the author.
The interactive graphics program XRNA (developed by Bryn
Weiser, UC Santa Cruz) facilitated the generation ofthe higher-
order structure models displayed here and the PostScript files
available at the RDP anonymous ftp site. Manual alignment of
these 16S and 16S-like rRNA sequences was facilitated with the
interactive sequence alignment editor AE2 (developed by Tom
Macke, Scripps Clinic).
ACKNOWLEDGMENTS
Refining and interpreting these rRNA structure models is an on-
going and long term collaboration with Drs Carl Woese and
Harry Noller. Bryn Weiser and Tom Macke are greatfully
acknowleged for developing the wonderful programs that make
much of this analysis and presentation possible. I also wish to
tank the W.M.Keck Foundation for their generous support of
RNA science on the Boulder campus, and SUN Microsystems
for their timely donation of computer equipment. The author is
an Associate in the Program in Evolutionary Biology of the
Canadian Institute of Advanced Research. This work was
supported by the NIH (GM 48207).
REFERENCES
1. Gutell R.R. (1993a) Curr. Opin. Struct. Biol. 3, in press.
2. Gutell R.R., Larsen N., and Woese C.R. (1993). In Zimmernmann R.A, and
Dahlberg A.E.(eds). Ribosomal RNA: Stucue, Evolution, Gene Expression
and Function in Protein Synthesis. CRC Press, Boca Raton, FL. in press.
3. Gutell R.R. (1993b). In Nierhaus K.H., Subramanian A.R., Erdiann V.A.,
Franceschi F., and Wittman-Liebold B. (eds). The Translational Apparatus.
Plenum Publishing Corporation, NY, NY in press.
4. Brimacombe R., Greuer B., Mitchell P., Osswald M., Rinke-Appel J.,
Schuler D., and Stade K. (1990). In Hill W.E., Dahlberg A., Garrett R.A.,
Moore P.B., Schlessinger D., and Warner J.R. (eds). The Ribosome:
Structure, Function, and Evolution. American Society for Microbiology,
Washington D.C. pp93-106.
5. Ehresnann B., E C., Romby P., Mougel M., Baudin F., Westhof
E., and Ebel J-P. (1990). In Hill W.E., Dahlberg A., Garrett R.A., Moore
P.B., Schlessinger D., and Warner J.R. (eds). The Ribosome: Structure,
Function, and Evolution. American Society for Microbiology, Washington
D.C. ppl48-159.
6. Noller H.F., Moazed D., Stern S., Powers T., Allen P.N., Robertson J.M.,
Weiser B., and Triman K. (1990). In Hill W.E., Dahlberg A., Garrett R.A.,
Moore P.B., Schlessinger D., and Warner J.R. (eds). The Ribosome:
Structure, Function, and Evolution. American Society for Microbiology,
Washington D.C. pp73-92.
7. De Rijk P., Neefs J-M., Van de Peer Y., De Wachter R., (1992). Nucleic
Acids Res. 20,Supplement 2075 -2089.
8. Gutell R.R., Power A., Hertz G.J., Putz E.J., and Stormo G.D. (1992).
Nucleic Acids Res. 20, 5785-5795.