SlideShare uma empresa Scribd logo
1 de 10
Comparative studies of RNA: inferring higher-order
structure from patterns of sequence variation
Robin R. Gutell
University of Colorado, Boulder, USA
RNA structural chemistry and evolutionary biology, long considered
disparate fields of study, are the topics of this review. Evolution transcends
all of the sciences, adding a dimension that enriches every science
it touches, and in its enrichment of those fields contributes back to
evolutionary thought. RNA, on the other hand, is a molecule with
an inordinate number of possible conformations; knowing which to
evaluate experimentally is problematic. Analysis of RNA sequences from
an evolutionary perspective reveals patterns of variation and sequence
constraints, and suggests how an RNA sequence is folded up into its
higher-order structure.
Current Opinion in Structural Biology 1993, 3:313-322
Introduction
“How far back the historian wishes to place the origins
and antecedentsof the GlassBead Game is, ultimately,
a matterof his personal choice. For like everygreat idea
it has no real beginning; rather, it has always been, at
leastthe ideaof it.” HermannHesse- MagisterLudi (The
GlassBeadGame)
Imagine you’re a graduate student, and your thesis
project is to fold up a large RNA molecule (let us say
hypothetically, 16sand 23s ribosomal RNA[rRNA]) into
its correct secondary and tertiary structure. And if you
have some extra time, deduce something about the
functionally important regions of these RNAmolecules.
Unfortunately, your own laboratory experimentation is
producing little useful information. You ask: Can these
intrinsically complex molecules be folded up? How will
I be able to saysomething profound about its function.
And most important, will I be able to obtain an advanced
degreebefore the year 2000?
And then one day, as if in a Herman Hessenovel, you
arethrilled to learn that the experiments havebeen done
for you (CRWoese,personal communication). The task
now is to find the notebooks containing these results,
question how this information is stored, and wonder
how to interpret it. To keep a long story short, the
notebooks are found and are filled with homologous
nucleic acidsequenceinformation, andwith the sequenc-
ing and computer information revolutions upon us, you
find a large number of RNAsequences,all in computer-
readableform.
The interpretation of this sequenceinformation is at first
glance not readily apparent. Upon reflecting on your
graduate course in evolutionary biology, which at the
time was taken simply for the fun of it, not for the se-
rious molecular biologist, you come up with two simple
but powerful conclusions:
(a) These RNAmolecules evolved to their present form,
starting from a simpler lessconstrained state and being
progressivelydefined and relined in structure and func-
tion. Each structure existing today is thus highly tuned
for its own structure/function and for its complex and
dynamic interactions with the outside world. (It should
be noted that this refinement in RNA structure is far
more subtle than we can fully appreciate with today’s
experimental methodology [1,2].>
(b) Principles of RNAstructure, specific functional con-
straints, and overall cellular mechanics mold the evolu-
tion of these molecules. Although not necessarily im-
plicit in its RNA structure, these principles, constraints
and mechanics are encoded in the actual sequence of
nucleotides.
Decoding this collection of nucleic acid sequencesbe-
gins with the realization that molecules performing sim-
ilar functions (e.g. tRNAs) must have a similar three-
dimensional structure. And knowing that different base-
pair types(e.g.AU and GC) compose homologous struc-
tural elements,e.g.a helix, it follows that similar higher-
order structures can be derived from different primary
structures. (The details and extent of these realizations
and deductions remain to be elucidated and fully appre-
ciated.)
The scientific logic used here to deduce structural and
functional relationships is tierent from conventional
experimental analysis.Instead of designing and execut-
ing experiments to test the hypothesis, here the exper-
Abbreviations
RNase-ribonuclease; rRNA-ribosomal RNA.
@ Current Biology Ltd ISSN 095940X 313
314 Nucleic acids
iments havealreadybeen done, and the sequencesare
the answers!Now we areasking:what are the questions,
what are the hypothesisbeing tested,and how were the
experimentsdone?Thesequestionsunderlie the analysis
of our homologous RNAsequencedatasets.Patternsof
consemtion and variation are analyzed,our initial goal
beiig to infer higher-order structure.
Comparative sequence methods in practice
‘Since all of the sRNAs[soluble RNAs]in the processof
transferingtheir respectiveamino acidsto protein on the
surfaceof ribosomesprobably haveto meetsimilar,if not
identical,spatialconstraints,it would seemreasonableto
conclude that all sRNAswill be capableof adopting es-
sentiallyidentical three-dimensionalstructures.” [31
“Figure 3 shows,however,that it is possible to construct
very similar base-pairedstructures in spite of the limited
similaritiesin sequences.”[4]
Transfer RNA
Shortly after the first few tRNA sequenceswere deter-
mined, comparative sequence analysiswas applied to
thesesequences,resulting in the well known cloverleaf
secondarystructure [3-6]. For this analysis,secondary
structure helicescontaining only canonical (e.g.AU and
GC) and GU pairings in common with all sequencesin
this datasetwere identiIied and scored positively. The
cloverleafwasthe only structure to satisfythis condition
for a molecule76 (or so) nucleotides in length.
5S ribosomal RNA
Nearly a decade later: comparative sequence methods
were again called upon to fold 5s rRNA, a molecule
120 nucleotides in length, into its secondary structure
[7]. Criteria similar to those applied to tRNA were used
here.Whereasexperimental methods resulted in several
different secondarystructure models, thesecomparative
methods,when applied to 10phylogeneticallydiversese-
quences,resultedin a single analogoussecondarystruc-
ture.
16s and 23s ribosomal RNA
In I978 and 1980,the first complete sequencesfor ES-
cbericbiucoli I6S and 23srRNAwere determined IS-lo],
ad shortlythereafter,comparativestructurestudieswere
initiated to fold theseRNAsof length 1542and 2904nu-
cleondes,the largest RNA.sso far for which structures
have been provided using these methods. Criteria sim-
ilar to thoseused on tRNAand 5SrRNAresultedin asin-
gle secondarystructure shared by all membersof their
respectiverRNAdatasets(lb& 111-131;23s, [lP161).
During this time, a helix was considered comparatively
proven if two or more base pairs within the potential
helix each contained at least one covanation. As the
number and diversity of sequencesincreased, the I6S
and 23s rRNAsecondarystructures modelswere refined,
and variations in the different rRNAmodelswere largely
. resolved.The most recentrefinementsof the 16sand 23s
rRNAstructures are basedon a large and diversecollec-
tion of sequences[17.,18*,19].At this time, the question
of comparative proof can accesseach base pair in the
model; the vast majority of all base pairs in the higher-
order structure can now be considered proven!
Although appreciatedby some,this methodologywasnot
widely acceptedduring the early1980sandwasevencast
negativelyby some:“I don’t know how you can suggest
structure by just iooking at sequences”.With the success
and attention given to the rRNAs,attitudeschanged.Many
of us started(mis)pronouncing genus-speciesnameswe
knew nothing about, except for some RNA sequence.
Many of us became(un)certilied microbiologist, proto-
zoologist and the like.
Other RNAs
During the 198Os,comparativesequenceanalysiswasap-
plied to other functionally important RNAmolecules,re-
sulting in secondarystructure models for each of these
comparativesequencedatasets.This list includes:group I
[20,21] and II [22] introns, ribonuclease (RNase)P RNA
[23,24], U-PNAs(Ul, U2, U4, U5 and u6 [25]), 7SSRP
RNA [261,and telomeraseRNA [271.
Are these comparatively derived structures congruent
with formal laboratory experimentation? The quick an-
swer is ‘yes’,although the question cannot be addressed
for all F@IAsnoted above,and the long detailed answer
is beyond the scope of this review. Sufficeit to say,all
of the comparatively inferred secondary structure base
pairs were present in the yeast tRNAphecrystal struc-
ture 128,291,revealing the authenticity of this approach.
Chemical probing experiments of the entire I6S rRNA
(301were largely consistent with the comparatively de-
rived secondarystructure, suggestingsuch a methodol-
ogy could also be applied to large RNAs,although it
doesn’t by itself prove this model. Other experiments,
discussedin part in the following sections, lend addi-
tional support for the higher-order interactionsproposed
with thesecomparativestudies.
Searching for tertiary interactions
Transfer RNAs
With the strong implication that comparative methods
can correctly deduce secondary structure, we can now
askcan such a methodology alsodeduce tertiary interac-
tions?The first attemptwasmadeon tRNA[31], resulting
in, a few correct and a few incorrect tertiary interaction
proposals (when comparedwith the crystalstructure so-
lution 128,291).But with significantly larger and diverse
tRNAdatasets,and relined correlation analysismethods,a
larger proportion of the higher-order structure can now
be correctly inferred [32,33,34*,35].
Comparative studies of RNA Gutell 315
16s and 23s ribosomal RNAs
A searchfor tertiaryinteractions in the 16s and 23srRNAs
wasinitiated in the early 1980sand resulted in severalre-
finementsin secondary-structurepairings and a few can-
didatesfor tertiary pairings [36]. The best candidate in-
volved positions 570and 866 (E. cofi numbering) in 16.5
rRNA 1371,forming a pseudoknot structure (see Fig. 1).
As the l6.S and 23S-rRNAdatabasesgrew in number and
diversity, the number and variety of tertiary interactions
increased(see Figs 1 and 2 and next section; for recent
work on 16sand 23siRNA,see [17*,18*,19,38]).Genetic
and biochemical experimental analyseshave addressed
and substantiateda number of these higher-order inter-
actions (see also next section) [39*,40,41=,42,43,44].
Fig. 1. Secondary structure diagrams for E. co/i 165 rRNA. All nu-
cleotides are replaced with small open circles. Higher-order inter-
actions more complex than the secondary structure helices are
denoted with a thick line or a large filled circle. Adapted from
117.1 (for details, see I17*, X3*1).
Group I introns
Covarianceanalysisof the group I intron databasehas
been most impressive,producing a well establishedsec-
ondary and tertiary structure model [21] and forming
the basis for a detailed three-dimensional model upon
which functional experimental analysis can be based.
Severalproposed base-triples [45], pseudoknots, and
non-canonical,pairings [46] have been substantiated us-
ing site-directedmutagenesis.
Ribonuclease P RNA
The RNA component of the RNaseP ribonuclease has
been extensively studied by comparative and experi-
mental methodologies. An evaluation of the phyloge-
netic commonality and diversity in RNaseP RNAslead
to the development of a mini-P RNA,a minimally con-
figured RNAwith normal enzymatic activity [47]. Several
comparatively derived base pairings in the two pseudo-
knot structures were testedand substantiated using site-
directed mutagenesis[48=].
Emerging principles of RNA structure
“If you’ve seenone helix, you’ve seen them all.”
“If I see one more secondary structure model, I’ll
scream!” (Comments overheard at a ribosome confer-
ence)
Fortunately, there is more to higher-order structure than
just secondary-structure helices. Now we can ask what
are these additional RNA structure principles and can
they be inferred from comparative methods? Altema-
tively we can ask what types of structural features are
discernable using comparative methods?Before we ad-
dress these questions, it is important to step back and
evaluatethesemethods, albeit in a most brief fashion.
Initial searchesfor tertiary interactions haveusedanewer
covarianceanalysisalgorithm, Merent from those used
for inferring RNAsecondarystructure. The most notable
differences are: correlating positions are identilied re-
gardlessof the pairing type,in contrast to previous meth-
ods that specifically looked for canonical and GU pairs;
and the current algorithm only looks for correlating pairs,
independent of surrounding structure, which is in sharp
contrast with older methods that only identified pairings
within a potential secondary-structure helix. Analysis of
the 16s and 23s rRNAdatasetsusing this newer algorithm,
and without any knowledge of previous secondary-struc-
ture proposals, identified the vast majority of all previ-
ously suspectedsecondary-strucm-repairings (RRGutell,
unpublished data)‘[34*]. Thus, searchingonly for corre-
lating positions resulted in the two basic and underlying
principles of RNAstructure: namely,AU, GCand GU pair-
ings, and the antiparallel and contiguous arrangementof
thesepairings!
It is of interest to note that the majority of all pairings in
the 16s and 23s rRNAdatasetsidentilied using our most
recent analysisalgorithms (RRGutell, unpublished data)
1341 are canonical or GU, and that the tiajority of these
are found in the conventional secondarystructure helix.
Only a small percentageof all correlating pairs are non-
canonical.Only a smallpercentageof all correlating pairs
lie outside of the secondarystructure, and these usually
form pseudoknot structures or heliceswith a single base
pair. These exceptional 16s and 23s rRNA interactions
are emphasizedin Figs 1 and 2, and discussedbelow.
(a)
316 Nucleic acids
JFig. 2. Secondary structure diagrams for E. co/i 235 rRNA: (a) 5’ half; fb) 3’ half. All nucleotides are replaced with small open circles.
Higher-order interactions more complex than the secondary structure helices are denoted with a thick line or a large filled circle. Dashed
lines represent tentative tertiary interactions. Adapted from 117*1(for details, see 1170, 18.1).
Non-canonical pairs
Of the 16 possible pairing types, the six canonical and
GU pairings account for the vast majority of all com-
paratively’derived base pairs. In the majority of phylo-
geneticbase-pairreplacements,one of thesesix typesis
replacedwith anotherof thesesix. Ten of the 16possible
pairings occur infrequently; however, specific classesof
pairing typesand their phylogenetic replacementsarebe-
ginning to emerge,with the most salient ones described
below (others havebeen identified [17*]).
A:C+tC:A
Severalexamples of this replacement type occur in
the rRN& One occurs in 16s rRNAbetween positions
1357 and 1365, at the end of a helix [2,17*]. Within
(eu)bacteria, chloroplasts and mitochondria, this pair
interchangesbetween AG and GA; within Archue and
Eucurya, the interchangeoccurs solely between canon-
ical pairing types.A second good example is found in
23srRNAbetweenpositions 2112and 2169,which along
with severalcanonical pairings forms a parallel structure
(also seebelow) [170,381.Most interestingly,thesesame
two nucleotides are associatedwith the E site in trans-
lation [49], suggestingthat this unusual pairing and/or
the parallel structure are functionally important aswell
asstructurally unique.
A:A+-L:C
Correlations between thesetwo pairing typeshavebeen
found in 16s rRNAbetween positions 722 and 733 [38],
and in 5s rRNAbetweenpositions 76and 100(RRGutell,
unpublished data).A similar setof correlated pairings is
present in the HIV1 Revbinding region of an in vitro
genetically selectedRNA 1501.In these three cases,this
pairing is found in an internal loop, immediatelyadjacent
to a helical structure. The Revprotein binding suggests
that this non-canonicalpairing could be ageneralprotein
recognition motif.
u:u+K:c
Severalexamples of these correlated replacementsare
found in 16s and 23s rRNA The two found in 16s lie
in different internal loops, immediately adjacent to the
end of the helix. These same types of correlated pair-
ings associatedomains 4 and 5 of 23s rRNA [38]. Ther-
modynamic studies have addressedthese pairing types
and found that UU and CC+ pairs can stabilize a du-
Comparative studies of RNA Gutell 317
plex [51]. Interestingly, these two pairings Can form an
isomot+ic structure when one of the cytosines are pro-
tonated, whereas the unprotonated form of the CC pair
is destabilizing and not isomorphic in structure.
C:UctA:C
A percentage of the GU helical base pairings are very con-
served, and for a percentage of these, replacement yields
an AC pair. The three best examples are found in 16s
rRNA and all lie at the end of one helical element and in
close proximity to the end of an associated helix [17.1. In
the translational decoding site of 16s rRNA, there is a CA
pairing at positions 1402 and 1500 in the overwhelming
majority of all 16s (and 16S-like) rRNA sequences. A few
phylogenetically distinct mitochondria change this pair to
a UG base pair [18*]. A GU and an AC base pair can form
isomorphic structures when the adenine (of the AC) is
protonated. The thermostability of this A+C pair is close
to that of an AU pair [52**] suggesting that AC pairs, with
the adenine protonated, may well be paired in a specific
structural context.
Tetra loops
The hairpin loop of four bases is a common feature in
the rRNAs, occuring in over 50% and 40% of all 16s and
23s hairpin loops, respectively. Among the 256 different
loop sequences of size four, there is a strong bias in the
rRNAs for three major classes: UUCG, GNRA, and CUUG
[531. Thermodynamic analysis revealed that these loops
are surprisingly very stable [54,55], and structural analy-
sis of these loops reveals an unusually compact structure
[56,57].
Pseudoknots
Pseudoknots are a popular and fashionable class of RNA
structure, delined as a set of base pairings that cross an
existing secondary-structure helix. Comparative methods
have been used to elucidate many examples in 16s and
235 rRNAs [17.1, aswell as in many other RNA molecules
[58-*]. Within the i-RN& these structures vary from one
to three base pairs in length, and are situated immedl-
ately adjacent to another helical structure on one or both
ends of its helix, suggesting a possible coaxial stack. NMR
studies of a simple pseudoknot structure have revealed
coaxial stacking of the helices [59].
Site-directed mutagenesis has substantiated several of
these pseudoknot interactions. In 16s rRNA, a helix of
three base pairs formed between a side bulge at posi-
tion 505 and the apex of the hairpin loop at position
525 is nested between other helices, forming a complex
pseudoknotted structure (assuming each helix occurs si-
multaneously) with multiple coaxial stacking possibilities.
This same region is highly conserved in primary and sec-
ondary structure and strongly implicated in translational
function. A series of elegant site-directed mutagenesis ex-
periments has addressed the structure of this proposed
helix [39*] and revealed that this helix is not only struc-
turalIy correct, but also is directly implicated in trans-
lational function, streptomycin binding, and binding to
ribosomal protein S12 (also see below).
A complex pseudoknot structure situated with multiple
coaxial stacking possibilities involves 235 rRNA positions
1343-1344 with 1403-1404 (E. coli numbering). This
general region of domain III is the binding site for an
essential early-assembling ribosomal protein. This helix
has recently been experimentally altered in a Saccha-
romyces cerevtie in vitro protein binding system, using
site-directed mutagenesis. The results clearly show that
canonical pairing in this short helix is required for proper
protein binding [41l]_Other comparatively derived pseu-
doknot structures have been proposed in RNaseP,group
I introns, and telomerase RNA [GO]. Some of these struc-
tural elements have been strongly implicated in catalytic
function, and have been evaluated and substantiated by
site-directed genetic analysis [46,48*]. It is interesting
to note that the lengths of non-ribosomal RNA pseu-
doknot helices are usually greater than those found in
rRNk Other naturally occuring pseudoknot structures
have been suggested by comparative and experimental
criteria, and discussed in some detail [589-l. More re-
cently, the analysis of a collection of sequences derived
from an in vitro amplification and selection for binding
to HIV1 reverse transcrlptase has ident.i&i a pseudoknot-
motif binding site [61].
Coaxial stacking
The concept of comparative evidence for a coaxial stack
was initially proposed a decade ago [2] and states that
two adjoining helices that vary in length might be coaxi-
alIy stacked upon one another if their combined length
remains constant. Based on simple spatial considera-
tions, many secon% and pseudoknot helices in the
16s and 23s rPNA can potentially stack upon one an-
other; however, comparative evidence is lacking for all
but two that do satisfy this condition. The first exam-
ple of coaxial stacking involves helices 500-504/541-545
and 5ll-515/536-540 in 16s rRNA, the second is at the
base of the cr-sarcin loop in 23s rRNA, involving he-
lices 26462652/2668-2674 and- 2675-2680/2727-2732
[170,18*]. For both proposed coaxial stackings, the
lengths of the underlying helices remain the same in
all (eu)bacteria, chloroplast and mitochondria, but are of
different lengths in the Arcbaeand Eucarya phylogenetic
domains. Both of these rRNA regions have been directly
implicated in translational function [39**,49] and some of
these functions are overlapping, suggesting that if these
coaxial stacks do occur, they could be associated with
ribosomal function and with each other in acoordinated
manner. As noted earlier, the re@on of 16s rRNA between
positions 500 and 545 is considered to be quite com-
plex, with a pseudoknot structure and potential coaxial
stacking. The coaxial stack proposed here on the basis of
comparative evidence would only make this region more
complex, and for alI of these suggested coaxial stackings
to occur, conformational rearrangements would be re-
quired (i.e. not all of them can occur simultaneously).
318 Nucleic acids
This should be aninteresting set of ideasto testexperi-
mentally.
Parallel interactions
comparative evidenceexists for two setSof parallel in-
teractions in rENA.The more interesting of the two is
found in the 23s rRNA,involving three pairings arranged
in parallel: 2112-2169,2113-2170,and 2117-2172.The
iirst pair covaries between an AG and a GA pairing,
whereasthe latter two change from one canonical pair
to another [17*]. This region is structured further with
the interaction between position 2111 and the first and
lastnucleotide of the ten-aloop at 2144-2147 [19]. This
unusual structure is associatedwith the translational E-
site [48].
Base triples
Transfer ENA, a molecule 76 nucleotides in length,
contains three base-triple interactions (28,291.These
have been partially predicted by comparative methods
[31-33,340,35].No convincing base triples have been
uncovered(so far) within the RNAs,nor haveany been
identified (yet) in RNaseP RNAand the LJ-RNAs.Com-
parative analysisof the group I introns have revealed
howeverseveralbasetriples 121,451,and thesehavebeen
substantiatedby site-directedmutagenesisand modeled
in three dimensions. More recently, these two group I
intron basetriples havebeen characterizedand substan-
tiatedby NMR[62].
Conclusions
“The G:U basepair in the upper stem might be impor-
tant. Is it possible that the activatingenzymescould ex-
tract enough information from the G:U pair and other
featuresof this double-stranded region for it to act asa
recognition site?”[4]
An important question to askat this juncture iswhat type
of information can be obtained from comparative se-
quenceanalysis?Severalexamplesof largestructuresand
variousstructuralelementshavebeen inferred from such
analysii, and many of these are consistent with and/or
proven with experimental methods. Beyond these, are
additional constraints present in RNAmolecules,and if
SO, whit are they?Do they suggestnew typesof higher-
order structural motifs, recognition sitesfor proteins or
other RNAmoleculesor do theyrepresentsubtle thermo-
dynamic and/or structural reiinements?Wii we be able
to decode this information from sequenceinformation
alone?
Addressing such questions will require additional se-
quences and diversity for each RNA dataset (e.g. 16s
rRNA). In parallel, the computer tools used for com-
parativesequenceanalysiswill need to be expanded and
relined. Quantitation correlation analysisalgorithms are
capableof uncovering subtle constraints [32,33,34*,35].
The most recent application of these methods is be-
ginning to identify structural constraints beyond simple
pairings (secondary and tertiary), and suggestsin some
c&s that certain base-pair types (or simply bases) in-
fluence the types of pairings (or bases) in close three-
dimensional proximity (i.e. context eifect; see Figs 3
and 4) [34-j. At the moment, these quantitative cor-
relation analysismethods do not incorporate the num-
ber of phylogenetic eventsunderlying each coordinated
basechange(i.e. the number of compensatorychanges
that have occurred throughout the phylogenetic tree;
the larger this number, the more significant the set of
changes),although such eventshave been incorporated
into a non-quantitative prologbased covarianceanalysis
program [63]. Incorporating a ‘phylogenetic events’fac-
tor into quantitative correlation algorithms could well
improve these quantitative methods and help identify
or strengthen the argument for new structural elements
and/or new.structural principles.
O-3’
0
0
0
S’-0 - 0
o-o
0 - O-70
o-o
o-o
o-o
o-o TO
o”iir. ;f
0000
I I I I
i7 ofi-;-q&g”~~~
20 o--o Law
o-o
o-o
30-O- O-40
o-o
0 0
0 0
Oo O
0
00
0
I 0
0 / 0
00
11
--
Fig. 3. Secondary-structure diagram of tRNA (yeast Phe number-
ing) highlighting those positions correlating best with position 13
(identified with filled triangle). Two large filled circles identify the
two highest correlating positions (22 and 461, and smaller filled
circles identify the next five highest correlating positions with
position 13. Adapted from f34.1.
What is called comparativesequenceanalysis(or phylo-
genetic analysisby manywho are referring to structural
analysisof thetypediscussedhere) goesbeyond the anal-
ysis of a single RNAmolecule. It should be appreciated
that such analysiscan and should encompassthe com-
parison of different RNAmoleculesor subsetsof a given
molecular database,when there is biological rationalefor
doing so. For example, the complete tFWAsequence
Comparative studies of RNA &tell 319
VARIABLE LOQP VARIABLE LOOP
Fig. 4. Stereo pair of the three-dimensional structure for this tRNA, with the seven best correlating positions with position 13 identified
as in Fig. 3. Adapted from I34e1.
databasecan be subdivided into the 20 amino acid ac-
ceptors and analyzed for subtle structural differences,
which could be the recognition signals for the differ-
ent amino acyl synthetases[64,65]. The first attempt at
this wascompleted in 1966 [4], and is surprisingly good
given the small sequencedatasetused. An example of
intermolecular analysis includes a 16%23s rRNA com-
parison. These two moIecules are befieved to interact
during translation, and thus any significant intermolec-
ular correlation could wetl be pointing at a structurally
and functionally important site.
Comparative structure analysis is not only generating
higher-order structures that are widely accepted in var-
ious RNAfields,it is alsoestablishing an agendafor vari-
ous experimental designs.I havenoted how this method
hasidentified manyof the sign&ant RNAstructure prin-
ciples,including Watson-Crick and GU pairings, antipar-
allel and contiguous arrangement of these pairings, te-
tra loops, pseudoknots, severalclassesof non-canonical
pairings (and their replacements),helix coaxial stacking,
base-tripleinteractions,and setsof pairs that form paral-
lel structures,The majority of thesestructures havenow
beenevalulatedand substantiatedin one form or another
usingexperimental methods.When astructure of interest
is atafunctional site,that function can be experimentally
evaluatedin the light of that structure. Thus a compara-
tively derived higher-order structure can and should be
considered a hypothesis,testedwith each new rRNAse-
quence,evaluatedexperimentally, and subject to molec-
ular modeling.
Superimposinghomologous RNAstructures,for example
16s and 16S-likerRNA,allows us the opportunity to eval-
uate more than its higher-order structure, presenting
us with a glimpse of what structural features are con
served throughout evolution or part thereof, which in
turn suggestsstructural elements of possible functional
significance.When the number and diversity of struc-
tures is sufficient, as it is for 16s and 23s rRNA (there
now exist over 2200 16s and 16S-likeand over 200 23s
and 23Slike sequences), evolutionary events and path-
ways can be mapped in great detail. Not only can the
reconstruction of these events be played like a Disney
movie flip book (i.e. a frame by frame snapshot anima-
tion of (r)RNA evolving), but underlying constraints on
structure and function can be deduced,from which prin-
ciples and refinementsof RNAstructure and function can
be inferred.
Darwinian or natural evolution has generated a won-
derfully diverse collection of RNA molecules for us to
compare and contrast utilizing the comparative meth-
ods discussedabove. Recently,the advent of new tech-
niques in biochemistry has atlowed the molecular biol-
ogist to practise a little evolutionary home brewing for
themselves.Starting with a very large collection of ran-
dom oligonucleotide sequences,one can subject these
macromoleculesto multiple rounds of selection and am-
plification, enricl-+g for those sequences that best sat-
isfy the constraint conditions. Such methodology now
puts some of Mother Nature’s authority into the hands
of the research scientist. But, instead of designing ter-
ribly complex and obtuse experiments, as it appears
to us mere mortals, the scientist can now select and
enrich for something far simpler, such as a small fig
and or protein-binding site on an RNA molecule, or
RNA molecules capable of defined catalytic functional-
ity [66**,67**]. Such newer methodology lets the genie
out of the bottle. ~Althoughthe sequencesare the an
swers,we now have a better appreciation of how the
experiments were done; for we ask the questions and
know what the underlying hypothesis is. The sequences
resulting from this work will now have a Yii associated
with its Yang. The next few yearsshould be an exciting
and rewarding time. (And people like me canstill saythat
the experiments havebeen done for us.)
320 Nucleic acids
So,in closing,we.cansit backand marvelabout the rapid
technologicaladvanceshappening all around us.The se-
quencing revolution makes it possible to fill volumes
of notebooks with homologous sequenceinformation.
Computersand their networks allow us to readily store,
manipulate, access,and analyze these notebooks. The
comparativesequence/structure analysisparadigm puts
somemeaningand dimensions to this information. It is
aparadigmthatis itselfstill being definedanddeveloped.
Acknowledgements
This work was supponed by the NIH (GM 48207). RRGutell is an
Associatein the Program in Evolutionary Biology of the Canadian
Institute of AdvancedResearch.I wish to thank the WM Keck Foun-
dation for their generous suppop of RNA science on the Boulder
campus, SUN Microsystemsfor their donation of computer equip-
ment, and B@ Weiser,Tom Mackeand others Fordeveloping much
OFthe computer code used to analyze and present RNA structural
information.
References and recommended reading
Papersof part.ic;larinterest,publishedwithin the annualreviewperiod,
havebeen highlightedas:
.
. .
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
of special interest
of outstanding interest
WOEsECR:Just So Stories and Rube Goldberg Machines:
Speculations on the Origin of the Protein Synthetic Ma-
chinery. In HBOSOMES: Struchrre, Function, and Geneticr
Edited by ChamblissG ef al. Baltimore: University Park Press;
1980:357-373.
WOnE CR, GUT&UR, GUPTAR, NOUERHF: Detailed Analy-
sis of the Higher Order Structure of 16S-Like RIbosomal
Ribonucleic Acids. Microbial Rev 1983,47621669.
RAJBHANoARVUI+STUARTA, FAUI~XNERRD,CHANCSH,KHORANA
HG: Nucleotide Sequence Studies on Yeast Phenylalanine
sRNA Cokd Spring Harb S’p Quant Biol1966, 31~425-434.
MADISONJT, EWEIT GA, KUNGHK: On the Nucleotide Se-
quence of Yeast Tyrosine Transfer RNA Cold Spring Harb
SVmp Quanr Biol 1966,31:409-416.
HOW RW,&CARJ, EVERETTGA, MADISONJT, MARQUISEEM,
M~aatnSH,PEN~VV~CKJR,ZAMIRA: Structure-of a R&nucleic
Acid. Science 1965, 147:1462-1465.
ZACHAUHG, DUI-~NGD, FELDMANNH, MEKHERSF, KARAU
W: Serine Specific Transfer Ribonucleic Acids. XIV. Com-
pa&on of NucIeotIde Sequences and Secondary Structure
Models. cdd Spring Harb 5jmip Quant Bioi 1966, 3h417-424.
FOXGE, WOESECR: 5s RNA Secondary Structure. Nature
1975,256:50>507.
BROSIUSJ, PALMERMI, KENNEDYPJ,NOUERI-IF:Complete Nu-
cleotide Sequence of a 16SRibosomaI RNA Gene from Es-
cberlcbia colt Pm Nat1 Acad Sci USA 1978,75:4801-@05.
BROSIUSJ, DULLT, NOLLERI-IF: Complete Nucleotide Se-
quenceof a 23s Ribosomal RNA Gene from Escberfcbfa
colL Pnx Nat1 Acad Sci USA 1980, iT201-204.
BRG~IUSJ, DUU +IJ,SLEETERDD, NOLLERHF: Gene Organi-
zation and Primary Structure of a Ribosomal RNA Operon
from Bscbedcbia coli J Mol Biol 1981, i48:107-127.
WOESE CR, MAGRUMLJ,GUPTAI( SIEGELRB, STAHLDA, KOPJ,
CRAWFORDN, BROSIUSJ, Gur~u. R,HOGANJJ,NOLIXRI-IF:Sec-
12.
13.
14.
15.
16.
17.
.
ondary Structure Model for Bacterial 16S RIbosomaI RN&
PhylogenetIc, Enzymatic and Chemical Evidence. Nucleic
Acids Res 1980,8:2275-2293.
STIECLERP, CARBONP, EBELJP, EHRESMANNC: A GeneraI
Secondary Structure Model for Procaryotic and Eucaryotic
RNAs of the Small Ribosomal Subunits. Eur J Bicxbem 1981,
120:487-495.
ZWEB C, GLOTZ C, B~COMBE R: Secondary Struc-
ture Comparisons between Small Subunit Ribosomal RNA
Molecules from Six Different Species. Nucleic Acids Res
1981,93621-3640.
NOUERHF, KOPJ. WHEATONV, BROSIUSJ, GIJTEURR,KOPYLOV
AM, DOHMEF, HERRW, STAHLDA, GUPTAR,WOE~ECk Sec-
ondary Structure Model for 23s RibosomaI RNA Ntrcleic
Acids R~s 1981,9:6167-6189.
GLOIZC,?%IEBC, BRIMACOMBER:Secondary Structure of the
Large Subunit RIbosomaI RNA from Escberichfa coli, Zea
mays ChIoroplast, and Human and Mouse Mitochondrial
Ribosomes. Nucleic Acids Res 1981,9:3287-3306.
BRAND C, KROL4 MACHAT~MA,POUYETJ, EUELJP,EDWA~UX
K, KO~~ELH: primary and Secondary Structures of Es-
cbericbia coli mre 600 23s RIbosomal RNA Comparison
with Models of Secondary Structure for Maize Chloroplast
23s rRNA and for Large Portions of Mouse and Human 16s
MitochondriaI rRNAs. Nucleic Aciak Res 1981,9:43034324.
Gunu RR,LARSENN, WOESECR: Lessons from an Evolving
RIbosomal RNA: 16s and 23s rRNA Structure from a Com-
parative Perspective. In Ribosomal RNA Structure, Evolution,
Gene Eapressim and Function in Protein Synthesis Edited
by Ziimermann RA, Dahlberg AE. Boca Raton: CRC Press;
1993:in press.
ia. GUTEURR:The Simplicity behind the Elucidation of Com-
. plex Structure In RIbosomal RNA 7%e Translational Ap
puratus Edited by Nierhaus KH ef al. New York: Plenum
Publishing Corporation; 1993: in press.
Another brief review of rRNAsttucture, lacedwith a few newer struc-
tural themesin rRNAhigher-order constraints.Additional correlations
for a few of the highly conservedand functionallysignificantregionsof
theseRNAsare presented.
19. LARSENN: Higher Order Interactions in 23s rRNA Pm Nat1
Acad Sci USA 1992,89:5044-5048.
20. CECHm Conserved Sequences and Structures of Group I
Introns: Building an Active Site for RNA Catalysis-a Re-
view. Gene 1988,73: 259-271.
21. MICHELF, WOOF E: Modelling of the Three-Dimensional
Architecture of Group I Catalytic lntrons Based on Com-
parative Sequence Analysis. J MoI Biol 1990,216:585-610.
22. M~CHEI.F, UME~ONOK, OZEKIH: Comparative and Functional
Anatomy of Group II Catalytic Introns-a Review. Gene
1989, 825-30.
23. JAMESBD, OLSENGJ, LIUJ, PACENR: The Secondary Struc-
ture of Ribonuclease P RNA, the Catalytic Element of a
RIbonucleoproteIn Enzyme. CeN1988, 52:19-26.
24. BROWNJW, HAASES,JAMESBD, HUNTDA, PACENR Phyloge-
netic Analysis and Evolution of RNaseP RNA in Proteobac-
teria. J Bade61 1991, 1733855-3863.
25. GUIHRE C. PATTEW~NB: SpIiceosomal snRNAs.Annu Rev
Genet 1988, 22:387419,
26. Z~P~EBC: Structure and Function of Signal Recognition Par-
ticle RNA Prog Nucleic Acid Res Mol Biol 1989,37207-234.
27. ROMERODP, BIACKBURNEH: A Conserved Secondary Struc-
ture for Telomerase RNA Cell 1991,67:343-353.
A review intermixed with original comparativeinformation on 16sand
23s rRNAhigher.order strucmreand rRNAstructural motifs.Written in
1991/1992,alreadybecoming a bit out of date,and the book is still not
out yet (as of February 1993).
Comparative studies of RNA Cutell 321
28.
29.
30.
31.
32.
33.
34.
.
KM SH:Three-Dimensional Structure of Transfer RNA Prog mal IINk an IntcaRNA Crosslinking Study. Nucleiic Acids Res
Nucleic Acid Res Mol Biol 1976, 17:181-216. 1992, 20:15931597.
QIJIGEYGJ, RICH A: Structural Domains of Transfer RNA
Molecules. Science 1976, 19&796-806.
MOAZEDD, STERNS, NOUER I-IF: Rapid Chemical Robing
of Conformation in 16S Ribosomal RNA and 30s Ribo-
somal Subunits Using Primer Extension. J Mol Biol 1986,
187399-416.
45. MICHEL F, ELLINGTON AD, COV~URE S, SZOSTAK PJV: Phyloge-
netic and Genetic Evidence for Base-Triples in the CataIytic
Domain of Group I Introns. Nafure 1990, 347~578-580.
m M: Detailed Molecular Model for Transfer Ribonu-
cleic Acid. Nature 1969, 224759-763.
46. COUTURES, EUINGTON AD, GERBER AS, CHERRY JM, DOUDNA
JA, GREEN R, HANNA M, PACE U, RAJACOPAL J, SZOSTAK JW: Mu-
tational Analysis of Conserved Nucleotides In a Self-SpIicing
Group I Intron. J Mol Biol 1990, 215:345-358.
OBEN GJ: Comparative Analysis of Nucleotide Sequence
Data [PhD Thesis]. University of Colorado Health Sciences
Center; 1984.
47. WAUGH DS, GREEN CJ, PACE M: The Design and Catalytic
Properties of a Simplified Ribonuclease P RNA Science 1989,
2441569-1571.
HASEw T, CHAPPEIEAR JE, Fox GE: Fidelity of Secondary
and Tertiary Interactions in tRNA Nucleic Acids Res 1988,
16:5673-5684.
48. HAAS Es, MORSE DP, BROWN JW, SCHMIDT FJ, PACE NR: Long-
. Range Structure In Ribonuclease P RNA Scieuce 1991,
254:853-856.
&TELL RR, POWER A, HER-IZ GZ, PUIZ EJ, STORMO GD:
Identifying Constraints on the Higher-Order Structure of
RNA: Continued Development and Application of Compar-
ative Sequence Analysis Methods. Nucleic Acids Res 1992,
205785-5795.
Another in the ‘although you told me, I want fo veri@ it for myself
series.This time, the pseudoknots in RNaseP RNAare evaluatedusing
site-directedmutagenesis.
49. NOUER HF, MOAZED D, STERN S, POWZRS T, ALIXN PN,
ROBERTSON JM, WEARER B, TRIMAN K: Structure of rRNA
and its Functional Interactions in Translation. In The Ri-
bawme: Structure, Function G Evolution Edited by Hill WE
ef al. WashinggtonDC: American Society for Microbiology;
1990~7392.
Rehemems in quantitative correlation an&is methods and signif-
icantly larger RNA databases are revealing newer and subtle RNA
structure! constraints. This methods paper, with a sampling of results,
suggeststhat hirther analysiswith these methods will reveal more in-
teresting structwal constraints.
35. CHIU DKY, KOLODZ~EJCZAK T: Inferring Consensus Structure
from Nucleic Acid Sequences. CompuI Appr Biosci 1991,
7~347-352.
36. GLITELL RR, WEISER B, WOESE CR, NOUER HF: Comparative
Anatomy of 16%Lie RIbosomaI RNA. Prog Nucleic Acid Res
Mol Biol 1985,32:155-216.
37. GU~EURR,NOUER HF, WOE~E CR: Higher Order Structure
in RibosomaI RNA EMBO J 1986, 5:1111-1113.
38. GUTEURR,WOESE CR Higher-Order Structural Elements in
RIbosomaIRN& Pseudoknots and the Use of Noncanonical
Palm Proc Nat1 Acad Sci USA 1990,87:663-667.
39. POWERST, NOUER HF: A Functional Pseudoknot In 16S Ri-
bosomal RNA EMBO J 1991, 10:22032214.
yrnos, revealing manuscript. Not only is the comparatively inferred
pseudoknot helix substantiated,this structure is shown to be associ-
atedwith translationalfunction.
40. CUNNINGHAM PR, NURSE K, BAKIN A, WEIIZMANN CJ, PFWMM
M, OFENGAND J: Interaction between the fwo Conserved
Single-StrandedRegions at the Decoding Site of SmaIISub-
unit Ribosomal RNA Is Essential for RIbosome Function.
Bicxbemirtry 1992,31:12012-12022.
41. KOOI EA, RLITGER~ CA, MULDER A, RIET JV, VENEMA J, RAUE HA:
. The PhylogeneticalIy Conserved Doublet Tertiary Interac-
. tion In Domain III of the Large Subunit rRNA Is Crucial for
RibosomaI Protein Binding. Proc Nat1 Acud Sci USA 1993,
90~213216.
Another in a growing seriesof experimental works that substantiarea
comparatively derived pseudoknot StwNw in this case, this StIUCNre
is shown to be involved in ribosomal protein binding.
42. BRIMACOMBE R, GREUER B, MITCHEU P, Ossww M, RINKE-APPEL
J, SCHULER D, STADEK: Three-Dimensional Structure and
Function of &zbericbia CON16Sand 23s rRNA as Studied
by Cross-Liig Techniques. In 7% Rhome: Structure,
Function 6 Evolution Edited by Hill WE et al. Washington
DC: American Society for Microbiology; 1990:93-106.
43. Rcrurl PC, LU M, DRAPER DE: Recognition of the Highly Con-
served GTPase Center of 23s RIbosomaI RNA by Riboso-
mal Protein Lll and the Antibiotic Thiostrepton. / Mol Eiol
1991,221:1257-1268.
44. DOIUNGT, GREUERB, BRIMACOMBE R: The Topography of
the 3’ TermInaI Region of Eschetfchfa calf 16s Riboso-
50.
51.
52.
. .
BARTEL DP, ZAPP ML, GREEN MR, SZOSTAK Jw: HIV1 Rev Regu-
Iadon Involves Recognition of Non-Watson-Crick BasePairs
in Vii RNA. Cell 1991, 67529-536.
SANTA-LUCIA J JR, KIERZEK R, TURNER DH: Stabiities of Con-
secutive A-C, C-C, G-G, U-C, and U-U Mismatches in RNA
Internal Loops: Evidence for Stable Hydrogen-Bonded U-U
and C-C+ Pairs. Biocbemishy 1991, 30: 8242-8251.
CHASTAIN M, T~QcO 1JR: svuctural Elements in RNA Pmg
Nucleic Acid Res Mol Biol 1991, 41:131-177._-... -
The most current and comprehensive rewewot RNAStruCNral motits,
with a biasfrom the physicalchemistryperspective.An abbreviated Sys
terns Adminhrafion Guide to RNAstructure.
53.
54.
55.
56.
57.
58.
. .
WOESE CR, WIM(ER S, Gur~u. RR: ArcbitecNre of Ribosomal
RN& Constraints on the Sequence of Tetraloops. PIW.ZNull
Acad Sci USA 1990, 878467-8471.
TLIERK C. GAUSS P. THERMES C, GROEBE DR, GA- M, GUILD
N, STO&O G, D’A~BE~oN-CARAFA Y, UHLENBECK OC, TINOCO
1. BRODY EN. GOLD L: CUUCGG Hairpins: Extraordinarily
&able RNA ‘Secondary Structures Associated with Va&
ous Biochemical Processes. Proc Nat1 Acad Sci USA 1988,
85:1364-13&X
ANTAOVP, IA SY,TINOCO1JR: A Thermodynamic Study of
Unusually Stable RNA and DNA Hairpins. Nucleic Aciak Res
1991, 19:5901-5905.
VARANIG, CHE~NG C, TINOCOI JR:Structure of an Unusually
Stable RNA Hairpin. Biocbemwy 1991, 30:3280-3289.
HEUS H, PARDIA: Structural Features that Give Rise to the
Unusual Stability of RNA HaIrpIns Containing GNRA Loops.
Science 1991, 253~191-194.
TEN DAM E, PLED K, DRAPER D: Structural and Func-
tional Aspects of RNA Pseudoknots. Biock+hy 1992,
31:11665-11676.
Evemng (almost) you need to know about pseudoknots (but forgot
to ask). Current in 1333,we should expect the 1995edition to be much
larger.
59. PUGuSJD, ‘WYATI JR,TINOCOI JR: Conformation of an RNA
Pseudoknot. J Mol Biol 1990, 214:437-453.
60. TEN DAM E, BE~XUM AV, PUIJ CW& A Conserved Pseudoknot
In Telomerase RNA Nucleic Acids Res 1991, 19~6951.
61. TUERK C, M~cDouGti S, Gow L RNA Pseudoknots that
Inhibit Human Immunodeficiency Viis Type 1 Reverse
Transcriptase. Proc Nat1 Acud Sci USA 1992, 896988-6992.
322 Nucleic acids
62.
63.
64.
65.
66.
. .
CHA~~A~HM, Tuwco I JR:A Base-Triple Structurai Domain
.in RNA. Biochemishy 1992,31:12733-12741.
See[67**].
WINKERS, OVERBEEKR, WOESECR OISENGJ, PFLUGERN:
Structure Detection through Automated Covariance Search.
Coinput Appl Biosci 1990, 6365-371.
McCw WH, Foss K: Changing the Identity of a tRNA by
Introducing a GU Wobble Pair near the 3’ Acceptor End.
Science19BB,240:7937%.
67. JOYCE GF: Directed Molecular Evolution. Sci Am 1992,
26790-97.
zese papers [66°0,6700]setthestagefor what’sto come.Thesenewer
methods in biochemistry are changing the pace (no pun intended,
Norm) at which sequencevariation is analyzedto infer structure and
function.
XHULhLUrlLH: Recognition of tRNAs by AminoacyLtRNA
Synthetaws. Prog Nucleic Acid Res Mol Biol 1991,41~23-87.
SZOSTAKJW In vihu Genetics. Trends Biocbem Sci 1992, RRGuteU,MCBBiology CampusBox 347,Universityof Colorado,Boul-
1789-93. der, Colorado 80309-0347,USA

Mais conteúdo relacionado

Destaque

9789740330790
97897403307909789740330790
9789740330790
CUPress
 
Frasiutili chiedere se qualcuno parla una lingua.ppt
Frasiutili chiedere se qualcuno parla una lingua.pptFrasiutili chiedere se qualcuno parla una lingua.ppt
Frasiutili chiedere se qualcuno parla una lingua.ppt
Prof. Sonia Santana
 
القيود القانونية على الحقوق المدنية والسياسية
القيود القانونية على الحقوق المدنية والسياسيةالقيود القانونية على الحقوق المدنية والسياسية
القيود القانونية على الحقوق المدنية والسياسية
Jamaity
 
Kids3d - Spanish
Kids3d - SpanishKids3d - Spanish
Kids3d - Spanish
Mark Wylie
 

Destaque (14)

Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013
 
Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676
 
Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803
 
9789740330790
97897403307909789740330790
9789740330790
 
the dangers of Junk food
the dangers of Junk foodthe dangers of Junk food
the dangers of Junk food
 
Frasiutili chiedere se qualcuno parla una lingua.ppt
Frasiutili chiedere se qualcuno parla una lingua.pptFrasiutili chiedere se qualcuno parla una lingua.ppt
Frasiutili chiedere se qualcuno parla una lingua.ppt
 
Herramientas de la web 2
Herramientas de la web 2Herramientas de la web 2
Herramientas de la web 2
 
ITB Technologies - Автоинкассация. Смарт-сейфы
ITB Technologies - Автоинкассация. Смарт-сейфыITB Technologies - Автоинкассация. Смарт-сейфы
ITB Technologies - Автоинкассация. Смарт-сейфы
 
القيود القانونية على الحقوق المدنية والسياسية
القيود القانونية على الحقوق المدنية والسياسيةالقيود القانونية على الحقوق المدنية والسياسية
القيود القانونية على الحقوق المدنية والسياسية
 
1º actividad del 2º trimestre Lengua 2 Mandioca/2º eso 10 ma.
1º actividad del 2º trimestre      Lengua 2 Mandioca/2º eso 10 ma.      1º actividad del 2º trimestre      Lengua 2 Mandioca/2º eso 10 ma.
1º actividad del 2º trimestre Lengua 2 Mandioca/2º eso 10 ma.
 
Proceso productivo de la pelota
Proceso productivo de la pelotaProceso productivo de la pelota
Proceso productivo de la pelota
 
Kids3d - Spanish
Kids3d - SpanishKids3d - Spanish
Kids3d - Spanish
 
Ради жизни на земле
Ради жизни на землеРади жизни на земле
Ради жизни на земле
 
EUROPE
EUROPEEUROPE
EUROPE
 

Semelhante a Gutell 028.cosb.1993.03.0313

Gutell 025.nar.1992.20.05785
Gutell 025.nar.1992.20.05785Gutell 025.nar.1992.20.05785
Gutell 025.nar.1992.20.05785
Robin Gutell
 
Gutell 034.mr.1994.58.0010
Gutell 034.mr.1994.58.0010Gutell 034.mr.1994.58.0010
Gutell 034.mr.1994.58.0010
Robin Gutell
 
Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2
Robin Gutell
 
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocrGutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
Robin Gutell
 
Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419
Robin Gutell
 
Rna lecture
Rna lectureRna lecture
Rna lecture
nishulpu
 
Gutell 085.jmb.2003.325.0065
Gutell 085.jmb.2003.325.0065Gutell 085.jmb.2003.325.0065
Gutell 085.jmb.2003.325.0065
Robin Gutell
 
Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216
Robin Gutell
 
Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301
Robin Gutell
 
Gutell 015.nar.1988.16.r175
Gutell 015.nar.1988.16.r175Gutell 015.nar.1988.16.r175
Gutell 015.nar.1988.16.r175
Robin Gutell
 
Gutell 068.rna.1999.05.1430
Gutell 068.rna.1999.05.1430Gutell 068.rna.1999.05.1430
Gutell 068.rna.1999.05.1430
Robin Gutell
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105
Robin Gutell
 
Nikola_Ivica_Thesis
Nikola_Ivica_ThesisNikola_Ivica_Thesis
Nikola_Ivica_Thesis
Nikola Ivica
 

Semelhante a Gutell 028.cosb.1993.03.0313 (20)

Gutell 025.nar.1992.20.05785
Gutell 025.nar.1992.20.05785Gutell 025.nar.1992.20.05785
Gutell 025.nar.1992.20.05785
 
Gutell 034.mr.1994.58.0010
Gutell 034.mr.1994.58.0010Gutell 034.mr.1994.58.0010
Gutell 034.mr.1994.58.0010
 
Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473
 
Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383
 
Gutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.goodGutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.good
 
Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2
 
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocrGutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
Gutell 053.book r rna.1996.dahlberg.zimmermann.p111-128.ocr
 
Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419
 
Rna lecture
Rna lectureRna lecture
Rna lecture
 
Gutell 085.jmb.2003.325.0065
Gutell 085.jmb.2003.325.0065Gutell 085.jmb.2003.325.0065
Gutell 085.jmb.2003.325.0065
 
Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216
 
Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301
 
Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167
 
Gutell 015.nar.1988.16.r175
Gutell 015.nar.1988.16.r175Gutell 015.nar.1988.16.r175
Gutell 015.nar.1988.16.r175
 
Gutell 068.rna.1999.05.1430
Gutell 068.rna.1999.05.1430Gutell 068.rna.1999.05.1430
Gutell 068.rna.1999.05.1430
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105
 
Nikola_Ivica_Thesis
Nikola_Ivica_ThesisNikola_Ivica_Thesis
Nikola_Ivica_Thesis
 
Gutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiGutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xi
 
Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200
 
Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768
 

Mais de Robin Gutell

Mais de Robin Gutell (20)

Gutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigGutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfig
 
Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22
 
Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011
 
Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011
 
Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497
 
Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485
 
Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195
 
Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277
 
Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769
 
Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2
 
Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043
 
Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016
 
Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535
 
Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289
 
Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533
 
Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931
 
Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978
 
Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655
 
Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193
 
Gutell 095.imb.2005.14.625
Gutell 095.imb.2005.14.625Gutell 095.imb.2005.14.625
Gutell 095.imb.2005.14.625
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Gutell 028.cosb.1993.03.0313

  • 1. Comparative studies of RNA: inferring higher-order structure from patterns of sequence variation Robin R. Gutell University of Colorado, Boulder, USA RNA structural chemistry and evolutionary biology, long considered disparate fields of study, are the topics of this review. Evolution transcends all of the sciences, adding a dimension that enriches every science it touches, and in its enrichment of those fields contributes back to evolutionary thought. RNA, on the other hand, is a molecule with an inordinate number of possible conformations; knowing which to evaluate experimentally is problematic. Analysis of RNA sequences from an evolutionary perspective reveals patterns of variation and sequence constraints, and suggests how an RNA sequence is folded up into its higher-order structure. Current Opinion in Structural Biology 1993, 3:313-322 Introduction “How far back the historian wishes to place the origins and antecedentsof the GlassBead Game is, ultimately, a matterof his personal choice. For like everygreat idea it has no real beginning; rather, it has always been, at leastthe ideaof it.” HermannHesse- MagisterLudi (The GlassBeadGame) Imagine you’re a graduate student, and your thesis project is to fold up a large RNA molecule (let us say hypothetically, 16sand 23s ribosomal RNA[rRNA]) into its correct secondary and tertiary structure. And if you have some extra time, deduce something about the functionally important regions of these RNAmolecules. Unfortunately, your own laboratory experimentation is producing little useful information. You ask: Can these intrinsically complex molecules be folded up? How will I be able to saysomething profound about its function. And most important, will I be able to obtain an advanced degreebefore the year 2000? And then one day, as if in a Herman Hessenovel, you arethrilled to learn that the experiments havebeen done for you (CRWoese,personal communication). The task now is to find the notebooks containing these results, question how this information is stored, and wonder how to interpret it. To keep a long story short, the notebooks are found and are filled with homologous nucleic acidsequenceinformation, andwith the sequenc- ing and computer information revolutions upon us, you find a large number of RNAsequences,all in computer- readableform. The interpretation of this sequenceinformation is at first glance not readily apparent. Upon reflecting on your graduate course in evolutionary biology, which at the time was taken simply for the fun of it, not for the se- rious molecular biologist, you come up with two simple but powerful conclusions: (a) These RNAmolecules evolved to their present form, starting from a simpler lessconstrained state and being progressivelydefined and relined in structure and func- tion. Each structure existing today is thus highly tuned for its own structure/function and for its complex and dynamic interactions with the outside world. (It should be noted that this refinement in RNA structure is far more subtle than we can fully appreciate with today’s experimental methodology [1,2].> (b) Principles of RNAstructure, specific functional con- straints, and overall cellular mechanics mold the evolu- tion of these molecules. Although not necessarily im- plicit in its RNA structure, these principles, constraints and mechanics are encoded in the actual sequence of nucleotides. Decoding this collection of nucleic acid sequencesbe- gins with the realization that molecules performing sim- ilar functions (e.g. tRNAs) must have a similar three- dimensional structure. And knowing that different base- pair types(e.g.AU and GC) compose homologous struc- tural elements,e.g.a helix, it follows that similar higher- order structures can be derived from different primary structures. (The details and extent of these realizations and deductions remain to be elucidated and fully appre- ciated.) The scientific logic used here to deduce structural and functional relationships is tierent from conventional experimental analysis.Instead of designing and execut- ing experiments to test the hypothesis, here the exper- Abbreviations RNase-ribonuclease; rRNA-ribosomal RNA. @ Current Biology Ltd ISSN 095940X 313
  • 2. 314 Nucleic acids iments havealreadybeen done, and the sequencesare the answers!Now we areasking:what are the questions, what are the hypothesisbeing tested,and how were the experimentsdone?Thesequestionsunderlie the analysis of our homologous RNAsequencedatasets.Patternsof consemtion and variation are analyzed,our initial goal beiig to infer higher-order structure. Comparative sequence methods in practice ‘Since all of the sRNAs[soluble RNAs]in the processof transferingtheir respectiveamino acidsto protein on the surfaceof ribosomesprobably haveto meetsimilar,if not identical,spatialconstraints,it would seemreasonableto conclude that all sRNAswill be capableof adopting es- sentiallyidentical three-dimensionalstructures.” [31 “Figure 3 shows,however,that it is possible to construct very similar base-pairedstructures in spite of the limited similaritiesin sequences.”[4] Transfer RNA Shortly after the first few tRNA sequenceswere deter- mined, comparative sequence analysiswas applied to thesesequences,resulting in the well known cloverleaf secondarystructure [3-6]. For this analysis,secondary structure helicescontaining only canonical (e.g.AU and GC) and GU pairings in common with all sequencesin this datasetwere identiIied and scored positively. The cloverleafwasthe only structure to satisfythis condition for a molecule76 (or so) nucleotides in length. 5S ribosomal RNA Nearly a decade later: comparative sequence methods were again called upon to fold 5s rRNA, a molecule 120 nucleotides in length, into its secondary structure [7]. Criteria similar to those applied to tRNA were used here.Whereasexperimental methods resulted in several different secondarystructure models, thesecomparative methods,when applied to 10phylogeneticallydiversese- quences,resultedin a single analogoussecondarystruc- ture. 16s and 23s ribosomal RNA In I978 and 1980,the first complete sequencesfor ES- cbericbiucoli I6S and 23srRNAwere determined IS-lo], ad shortlythereafter,comparativestructurestudieswere initiated to fold theseRNAsof length 1542and 2904nu- cleondes,the largest RNA.sso far for which structures have been provided using these methods. Criteria sim- ilar to thoseused on tRNAand 5SrRNAresultedin asin- gle secondarystructure shared by all membersof their respectiverRNAdatasets(lb& 111-131;23s, [lP161). During this time, a helix was considered comparatively proven if two or more base pairs within the potential helix each contained at least one covanation. As the number and diversity of sequencesincreased, the I6S and 23s rRNAsecondarystructures modelswere refined, and variations in the different rRNAmodelswere largely . resolved.The most recentrefinementsof the 16sand 23s rRNAstructures are basedon a large and diversecollec- tion of sequences[17.,18*,19].At this time, the question of comparative proof can accesseach base pair in the model; the vast majority of all base pairs in the higher- order structure can now be considered proven! Although appreciatedby some,this methodologywasnot widely acceptedduring the early1980sandwasevencast negativelyby some:“I don’t know how you can suggest structure by just iooking at sequences”.With the success and attention given to the rRNAs,attitudeschanged.Many of us started(mis)pronouncing genus-speciesnameswe knew nothing about, except for some RNA sequence. Many of us became(un)certilied microbiologist, proto- zoologist and the like. Other RNAs During the 198Os,comparativesequenceanalysiswasap- plied to other functionally important RNAmolecules,re- sulting in secondarystructure models for each of these comparativesequencedatasets.This list includes:group I [20,21] and II [22] introns, ribonuclease (RNase)P RNA [23,24], U-PNAs(Ul, U2, U4, U5 and u6 [25]), 7SSRP RNA [261,and telomeraseRNA [271. Are these comparatively derived structures congruent with formal laboratory experimentation? The quick an- swer is ‘yes’,although the question cannot be addressed for all F@IAsnoted above,and the long detailed answer is beyond the scope of this review. Sufficeit to say,all of the comparatively inferred secondary structure base pairs were present in the yeast tRNAphecrystal struc- ture 128,291,revealing the authenticity of this approach. Chemical probing experiments of the entire I6S rRNA (301were largely consistent with the comparatively de- rived secondarystructure, suggestingsuch a methodol- ogy could also be applied to large RNAs,although it doesn’t by itself prove this model. Other experiments, discussedin part in the following sections, lend addi- tional support for the higher-order interactionsproposed with thesecomparativestudies. Searching for tertiary interactions Transfer RNAs With the strong implication that comparative methods can correctly deduce secondary structure, we can now askcan such a methodology alsodeduce tertiary interac- tions?The first attemptwasmadeon tRNA[31], resulting in, a few correct and a few incorrect tertiary interaction proposals (when comparedwith the crystalstructure so- lution 128,291).But with significantly larger and diverse tRNAdatasets,and relined correlation analysismethods,a larger proportion of the higher-order structure can now be correctly inferred [32,33,34*,35].
  • 3. Comparative studies of RNA Gutell 315 16s and 23s ribosomal RNAs A searchfor tertiaryinteractions in the 16s and 23srRNAs wasinitiated in the early 1980sand resulted in severalre- finementsin secondary-structurepairings and a few can- didatesfor tertiary pairings [36]. The best candidate in- volved positions 570and 866 (E. cofi numbering) in 16.5 rRNA 1371,forming a pseudoknot structure (see Fig. 1). As the l6.S and 23S-rRNAdatabasesgrew in number and diversity, the number and variety of tertiary interactions increased(see Figs 1 and 2 and next section; for recent work on 16sand 23siRNA,see [17*,18*,19,38]).Genetic and biochemical experimental analyseshave addressed and substantiateda number of these higher-order inter- actions (see also next section) [39*,40,41=,42,43,44]. Fig. 1. Secondary structure diagrams for E. co/i 165 rRNA. All nu- cleotides are replaced with small open circles. Higher-order inter- actions more complex than the secondary structure helices are denoted with a thick line or a large filled circle. Adapted from 117.1 (for details, see I17*, X3*1). Group I introns Covarianceanalysisof the group I intron databasehas been most impressive,producing a well establishedsec- ondary and tertiary structure model [21] and forming the basis for a detailed three-dimensional model upon which functional experimental analysis can be based. Severalproposed base-triples [45], pseudoknots, and non-canonical,pairings [46] have been substantiated us- ing site-directedmutagenesis. Ribonuclease P RNA The RNA component of the RNaseP ribonuclease has been extensively studied by comparative and experi- mental methodologies. An evaluation of the phyloge- netic commonality and diversity in RNaseP RNAslead to the development of a mini-P RNA,a minimally con- figured RNAwith normal enzymatic activity [47]. Several comparatively derived base pairings in the two pseudo- knot structures were testedand substantiated using site- directed mutagenesis[48=]. Emerging principles of RNA structure “If you’ve seenone helix, you’ve seen them all.” “If I see one more secondary structure model, I’ll scream!” (Comments overheard at a ribosome confer- ence) Fortunately, there is more to higher-order structure than just secondary-structure helices. Now we can ask what are these additional RNA structure principles and can they be inferred from comparative methods? Altema- tively we can ask what types of structural features are discernable using comparative methods?Before we ad- dress these questions, it is important to step back and evaluatethesemethods, albeit in a most brief fashion. Initial searchesfor tertiary interactions haveusedanewer covarianceanalysisalgorithm, Merent from those used for inferring RNAsecondarystructure. The most notable differences are: correlating positions are identilied re- gardlessof the pairing type,in contrast to previous meth- ods that specifically looked for canonical and GU pairs; and the current algorithm only looks for correlating pairs, independent of surrounding structure, which is in sharp contrast with older methods that only identified pairings within a potential secondary-structure helix. Analysis of the 16s and 23s rRNAdatasetsusing this newer algorithm, and without any knowledge of previous secondary-struc- ture proposals, identified the vast majority of all previ- ously suspectedsecondary-strucm-repairings (RRGutell, unpublished data)‘[34*]. Thus, searchingonly for corre- lating positions resulted in the two basic and underlying principles of RNAstructure: namely,AU, GCand GU pair- ings, and the antiparallel and contiguous arrangementof thesepairings! It is of interest to note that the majority of all pairings in the 16s and 23s rRNAdatasetsidentilied using our most recent analysisalgorithms (RRGutell, unpublished data) 1341 are canonical or GU, and that the tiajority of these are found in the conventional secondarystructure helix. Only a small percentageof all correlating pairs are non- canonical.Only a smallpercentageof all correlating pairs lie outside of the secondarystructure, and these usually form pseudoknot structures or heliceswith a single base pair. These exceptional 16s and 23s rRNA interactions are emphasizedin Figs 1 and 2, and discussedbelow.
  • 4. (a) 316 Nucleic acids JFig. 2. Secondary structure diagrams for E. co/i 235 rRNA: (a) 5’ half; fb) 3’ half. All nucleotides are replaced with small open circles. Higher-order interactions more complex than the secondary structure helices are denoted with a thick line or a large filled circle. Dashed lines represent tentative tertiary interactions. Adapted from 117*1(for details, see 1170, 18.1). Non-canonical pairs Of the 16 possible pairing types, the six canonical and GU pairings account for the vast majority of all com- paratively’derived base pairs. In the majority of phylo- geneticbase-pairreplacements,one of thesesix typesis replacedwith anotherof thesesix. Ten of the 16possible pairings occur infrequently; however, specific classesof pairing typesand their phylogenetic replacementsarebe- ginning to emerge,with the most salient ones described below (others havebeen identified [17*]). A:C+tC:A Severalexamples of this replacement type occur in the rRN& One occurs in 16s rRNAbetween positions 1357 and 1365, at the end of a helix [2,17*]. Within (eu)bacteria, chloroplasts and mitochondria, this pair interchangesbetween AG and GA; within Archue and Eucurya, the interchangeoccurs solely between canon- ical pairing types.A second good example is found in 23srRNAbetweenpositions 2112and 2169,which along with severalcanonical pairings forms a parallel structure (also seebelow) [170,381.Most interestingly,thesesame two nucleotides are associatedwith the E site in trans- lation [49], suggestingthat this unusual pairing and/or the parallel structure are functionally important aswell asstructurally unique. A:A+-L:C Correlations between thesetwo pairing typeshavebeen found in 16s rRNAbetween positions 722 and 733 [38], and in 5s rRNAbetweenpositions 76and 100(RRGutell, unpublished data).A similar setof correlated pairings is present in the HIV1 Revbinding region of an in vitro genetically selectedRNA 1501.In these three cases,this pairing is found in an internal loop, immediatelyadjacent to a helical structure. The Revprotein binding suggests that this non-canonicalpairing could be ageneralprotein recognition motif. u:u+K:c Severalexamples of these correlated replacementsare found in 16s and 23s rRNA The two found in 16s lie in different internal loops, immediately adjacent to the end of the helix. These same types of correlated pair- ings associatedomains 4 and 5 of 23s rRNA [38]. Ther- modynamic studies have addressedthese pairing types and found that UU and CC+ pairs can stabilize a du-
  • 5. Comparative studies of RNA Gutell 317 plex [51]. Interestingly, these two pairings Can form an isomot+ic structure when one of the cytosines are pro- tonated, whereas the unprotonated form of the CC pair is destabilizing and not isomorphic in structure. C:UctA:C A percentage of the GU helical base pairings are very con- served, and for a percentage of these, replacement yields an AC pair. The three best examples are found in 16s rRNA and all lie at the end of one helical element and in close proximity to the end of an associated helix [17.1. In the translational decoding site of 16s rRNA, there is a CA pairing at positions 1402 and 1500 in the overwhelming majority of all 16s (and 16S-like) rRNA sequences. A few phylogenetically distinct mitochondria change this pair to a UG base pair [18*]. A GU and an AC base pair can form isomorphic structures when the adenine (of the AC) is protonated. The thermostability of this A+C pair is close to that of an AU pair [52**] suggesting that AC pairs, with the adenine protonated, may well be paired in a specific structural context. Tetra loops The hairpin loop of four bases is a common feature in the rRNAs, occuring in over 50% and 40% of all 16s and 23s hairpin loops, respectively. Among the 256 different loop sequences of size four, there is a strong bias in the rRNAs for three major classes: UUCG, GNRA, and CUUG [531. Thermodynamic analysis revealed that these loops are surprisingly very stable [54,55], and structural analy- sis of these loops reveals an unusually compact structure [56,57]. Pseudoknots Pseudoknots are a popular and fashionable class of RNA structure, delined as a set of base pairings that cross an existing secondary-structure helix. Comparative methods have been used to elucidate many examples in 16s and 235 rRNAs [17.1, aswell as in many other RNA molecules [58-*]. Within the i-RN& these structures vary from one to three base pairs in length, and are situated immedl- ately adjacent to another helical structure on one or both ends of its helix, suggesting a possible coaxial stack. NMR studies of a simple pseudoknot structure have revealed coaxial stacking of the helices [59]. Site-directed mutagenesis has substantiated several of these pseudoknot interactions. In 16s rRNA, a helix of three base pairs formed between a side bulge at posi- tion 505 and the apex of the hairpin loop at position 525 is nested between other helices, forming a complex pseudoknotted structure (assuming each helix occurs si- multaneously) with multiple coaxial stacking possibilities. This same region is highly conserved in primary and sec- ondary structure and strongly implicated in translational function. A series of elegant site-directed mutagenesis ex- periments has addressed the structure of this proposed helix [39*] and revealed that this helix is not only struc- turalIy correct, but also is directly implicated in trans- lational function, streptomycin binding, and binding to ribosomal protein S12 (also see below). A complex pseudoknot structure situated with multiple coaxial stacking possibilities involves 235 rRNA positions 1343-1344 with 1403-1404 (E. coli numbering). This general region of domain III is the binding site for an essential early-assembling ribosomal protein. This helix has recently been experimentally altered in a Saccha- romyces cerevtie in vitro protein binding system, using site-directed mutagenesis. The results clearly show that canonical pairing in this short helix is required for proper protein binding [41l]_Other comparatively derived pseu- doknot structures have been proposed in RNaseP,group I introns, and telomerase RNA [GO]. Some of these struc- tural elements have been strongly implicated in catalytic function, and have been evaluated and substantiated by site-directed genetic analysis [46,48*]. It is interesting to note that the lengths of non-ribosomal RNA pseu- doknot helices are usually greater than those found in rRNk Other naturally occuring pseudoknot structures have been suggested by comparative and experimental criteria, and discussed in some detail [589-l. More re- cently, the analysis of a collection of sequences derived from an in vitro amplification and selection for binding to HIV1 reverse transcrlptase has ident.i&i a pseudoknot- motif binding site [61]. Coaxial stacking The concept of comparative evidence for a coaxial stack was initially proposed a decade ago [2] and states that two adjoining helices that vary in length might be coaxi- alIy stacked upon one another if their combined length remains constant. Based on simple spatial considera- tions, many secon% and pseudoknot helices in the 16s and 23s rPNA can potentially stack upon one an- other; however, comparative evidence is lacking for all but two that do satisfy this condition. The first exam- ple of coaxial stacking involves helices 500-504/541-545 and 5ll-515/536-540 in 16s rRNA, the second is at the base of the cr-sarcin loop in 23s rRNA, involving he- lices 26462652/2668-2674 and- 2675-2680/2727-2732 [170,18*]. For both proposed coaxial stackings, the lengths of the underlying helices remain the same in all (eu)bacteria, chloroplast and mitochondria, but are of different lengths in the Arcbaeand Eucarya phylogenetic domains. Both of these rRNA regions have been directly implicated in translational function [39**,49] and some of these functions are overlapping, suggesting that if these coaxial stacks do occur, they could be associated with ribosomal function and with each other in acoordinated manner. As noted earlier, the re@on of 16s rRNA between positions 500 and 545 is considered to be quite com- plex, with a pseudoknot structure and potential coaxial stacking. The coaxial stack proposed here on the basis of comparative evidence would only make this region more complex, and for alI of these suggested coaxial stackings to occur, conformational rearrangements would be re- quired (i.e. not all of them can occur simultaneously).
  • 6. 318 Nucleic acids This should be aninteresting set of ideasto testexperi- mentally. Parallel interactions comparative evidenceexists for two setSof parallel in- teractions in rENA.The more interesting of the two is found in the 23s rRNA,involving three pairings arranged in parallel: 2112-2169,2113-2170,and 2117-2172.The iirst pair covaries between an AG and a GA pairing, whereasthe latter two change from one canonical pair to another [17*]. This region is structured further with the interaction between position 2111 and the first and lastnucleotide of the ten-aloop at 2144-2147 [19]. This unusual structure is associatedwith the translational E- site [48]. Base triples Transfer ENA, a molecule 76 nucleotides in length, contains three base-triple interactions (28,291.These have been partially predicted by comparative methods [31-33,340,35].No convincing base triples have been uncovered(so far) within the RNAs,nor haveany been identified (yet) in RNaseP RNAand the LJ-RNAs.Com- parative analysisof the group I introns have revealed howeverseveralbasetriples 121,451,and thesehavebeen substantiatedby site-directedmutagenesisand modeled in three dimensions. More recently, these two group I intron basetriples havebeen characterizedand substan- tiatedby NMR[62]. Conclusions “The G:U basepair in the upper stem might be impor- tant. Is it possible that the activatingenzymescould ex- tract enough information from the G:U pair and other featuresof this double-stranded region for it to act asa recognition site?”[4] An important question to askat this juncture iswhat type of information can be obtained from comparative se- quenceanalysis?Severalexamplesof largestructuresand variousstructuralelementshavebeen inferred from such analysii, and many of these are consistent with and/or proven with experimental methods. Beyond these, are additional constraints present in RNAmolecules,and if SO, whit are they?Do they suggestnew typesof higher- order structural motifs, recognition sitesfor proteins or other RNAmoleculesor do theyrepresentsubtle thermo- dynamic and/or structural reiinements?Wii we be able to decode this information from sequenceinformation alone? Addressing such questions will require additional se- quences and diversity for each RNA dataset (e.g. 16s rRNA). In parallel, the computer tools used for com- parativesequenceanalysiswill need to be expanded and relined. Quantitation correlation analysisalgorithms are capableof uncovering subtle constraints [32,33,34*,35]. The most recent application of these methods is be- ginning to identify structural constraints beyond simple pairings (secondary and tertiary), and suggestsin some c&s that certain base-pair types (or simply bases) in- fluence the types of pairings (or bases) in close three- dimensional proximity (i.e. context eifect; see Figs 3 and 4) [34-j. At the moment, these quantitative cor- relation analysismethods do not incorporate the num- ber of phylogenetic eventsunderlying each coordinated basechange(i.e. the number of compensatorychanges that have occurred throughout the phylogenetic tree; the larger this number, the more significant the set of changes),although such eventshave been incorporated into a non-quantitative prologbased covarianceanalysis program [63]. Incorporating a ‘phylogenetic events’fac- tor into quantitative correlation algorithms could well improve these quantitative methods and help identify or strengthen the argument for new structural elements and/or new.structural principles. O-3’ 0 0 0 S’-0 - 0 o-o 0 - O-70 o-o o-o o-o o-o TO o”iir. ;f 0000 I I I I i7 ofi-;-q&g”~~~ 20 o--o Law o-o o-o 30-O- O-40 o-o 0 0 0 0 Oo O 0 00 0 I 0 0 / 0 00 11 -- Fig. 3. Secondary-structure diagram of tRNA (yeast Phe number- ing) highlighting those positions correlating best with position 13 (identified with filled triangle). Two large filled circles identify the two highest correlating positions (22 and 461, and smaller filled circles identify the next five highest correlating positions with position 13. Adapted from f34.1. What is called comparativesequenceanalysis(or phylo- genetic analysisby manywho are referring to structural analysisof thetypediscussedhere) goesbeyond the anal- ysis of a single RNAmolecule. It should be appreciated that such analysiscan and should encompassthe com- parison of different RNAmoleculesor subsetsof a given molecular database,when there is biological rationalefor doing so. For example, the complete tFWAsequence
  • 7. Comparative studies of RNA &tell 319 VARIABLE LOQP VARIABLE LOOP Fig. 4. Stereo pair of the three-dimensional structure for this tRNA, with the seven best correlating positions with position 13 identified as in Fig. 3. Adapted from I34e1. databasecan be subdivided into the 20 amino acid ac- ceptors and analyzed for subtle structural differences, which could be the recognition signals for the differ- ent amino acyl synthetases[64,65]. The first attempt at this wascompleted in 1966 [4], and is surprisingly good given the small sequencedatasetused. An example of intermolecular analysis includes a 16%23s rRNA com- parison. These two moIecules are befieved to interact during translation, and thus any significant intermolec- ular correlation could wetl be pointing at a structurally and functionally important site. Comparative structure analysis is not only generating higher-order structures that are widely accepted in var- ious RNAfields,it is alsoestablishing an agendafor vari- ous experimental designs.I havenoted how this method hasidentified manyof the sign&ant RNAstructure prin- ciples,including Watson-Crick and GU pairings, antipar- allel and contiguous arrangement of these pairings, te- tra loops, pseudoknots, severalclassesof non-canonical pairings (and their replacements),helix coaxial stacking, base-tripleinteractions,and setsof pairs that form paral- lel structures,The majority of thesestructures havenow beenevalulatedand substantiatedin one form or another usingexperimental methods.When astructure of interest is atafunctional site,that function can be experimentally evaluatedin the light of that structure. Thus a compara- tively derived higher-order structure can and should be considered a hypothesis,testedwith each new rRNAse- quence,evaluatedexperimentally, and subject to molec- ular modeling. Superimposinghomologous RNAstructures,for example 16s and 16S-likerRNA,allows us the opportunity to eval- uate more than its higher-order structure, presenting us with a glimpse of what structural features are con served throughout evolution or part thereof, which in turn suggestsstructural elements of possible functional significance.When the number and diversity of struc- tures is sufficient, as it is for 16s and 23s rRNA (there now exist over 2200 16s and 16S-likeand over 200 23s and 23Slike sequences), evolutionary events and path- ways can be mapped in great detail. Not only can the reconstruction of these events be played like a Disney movie flip book (i.e. a frame by frame snapshot anima- tion of (r)RNA evolving), but underlying constraints on structure and function can be deduced,from which prin- ciples and refinementsof RNAstructure and function can be inferred. Darwinian or natural evolution has generated a won- derfully diverse collection of RNA molecules for us to compare and contrast utilizing the comparative meth- ods discussedabove. Recently,the advent of new tech- niques in biochemistry has atlowed the molecular biol- ogist to practise a little evolutionary home brewing for themselves.Starting with a very large collection of ran- dom oligonucleotide sequences,one can subject these macromoleculesto multiple rounds of selection and am- plification, enricl-+g for those sequences that best sat- isfy the constraint conditions. Such methodology now puts some of Mother Nature’s authority into the hands of the research scientist. But, instead of designing ter- ribly complex and obtuse experiments, as it appears to us mere mortals, the scientist can now select and enrich for something far simpler, such as a small fig and or protein-binding site on an RNA molecule, or RNA molecules capable of defined catalytic functional- ity [66**,67**]. Such newer methodology lets the genie out of the bottle. ~Althoughthe sequencesare the an swers,we now have a better appreciation of how the experiments were done; for we ask the questions and know what the underlying hypothesis is. The sequences resulting from this work will now have a Yii associated with its Yang. The next few yearsshould be an exciting and rewarding time. (And people like me canstill saythat the experiments havebeen done for us.)
  • 8. 320 Nucleic acids So,in closing,we.cansit backand marvelabout the rapid technologicaladvanceshappening all around us.The se- quencing revolution makes it possible to fill volumes of notebooks with homologous sequenceinformation. Computersand their networks allow us to readily store, manipulate, access,and analyze these notebooks. The comparativesequence/structure analysisparadigm puts somemeaningand dimensions to this information. It is aparadigmthatis itselfstill being definedanddeveloped. Acknowledgements This work was supponed by the NIH (GM 48207). RRGutell is an Associatein the Program in Evolutionary Biology of the Canadian Institute of AdvancedResearch.I wish to thank the WM Keck Foun- dation for their generous suppop of RNA science on the Boulder campus, SUN Microsystemsfor their donation of computer equip- ment, and B@ Weiser,Tom Mackeand others Fordeveloping much OFthe computer code used to analyze and present RNA structural information. References and recommended reading Papersof part.ic;larinterest,publishedwithin the annualreviewperiod, havebeen highlightedas: . . . 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. of special interest of outstanding interest WOEsECR:Just So Stories and Rube Goldberg Machines: Speculations on the Origin of the Protein Synthetic Ma- chinery. In HBOSOMES: Struchrre, Function, and Geneticr Edited by ChamblissG ef al. Baltimore: University Park Press; 1980:357-373. WOnE CR, GUT&UR, GUPTAR, NOUERHF: Detailed Analy- sis of the Higher Order Structure of 16S-Like RIbosomal Ribonucleic Acids. Microbial Rev 1983,47621669. RAJBHANoARVUI+STUARTA, FAUI~XNERRD,CHANCSH,KHORANA HG: Nucleotide Sequence Studies on Yeast Phenylalanine sRNA Cokd Spring Harb S’p Quant Biol1966, 31~425-434. MADISONJT, EWEIT GA, KUNGHK: On the Nucleotide Se- quence of Yeast Tyrosine Transfer RNA Cold Spring Harb SVmp Quanr Biol 1966,31:409-416. HOW RW,&CARJ, EVERETTGA, MADISONJT, MARQUISEEM, M~aatnSH,PEN~VV~CKJR,ZAMIRA: Structure-of a R&nucleic Acid. Science 1965, 147:1462-1465. ZACHAUHG, DUI-~NGD, FELDMANNH, MEKHERSF, KARAU W: Serine Specific Transfer Ribonucleic Acids. XIV. Com- pa&on of NucIeotIde Sequences and Secondary Structure Models. cdd Spring Harb 5jmip Quant Bioi 1966, 3h417-424. FOXGE, WOESECR: 5s RNA Secondary Structure. Nature 1975,256:50>507. BROSIUSJ, PALMERMI, KENNEDYPJ,NOUERI-IF:Complete Nu- cleotide Sequence of a 16SRibosomaI RNA Gene from Es- cberlcbia colt Pm Nat1 Acad Sci USA 1978,75:4801-@05. BROSIUSJ, DULLT, NOLLERI-IF: Complete Nucleotide Se- quenceof a 23s Ribosomal RNA Gene from Escberfcbfa colL Pnx Nat1 Acad Sci USA 1980, iT201-204. BRG~IUSJ, DUU +IJ,SLEETERDD, NOLLERHF: Gene Organi- zation and Primary Structure of a Ribosomal RNA Operon from Bscbedcbia coli J Mol Biol 1981, i48:107-127. WOESE CR, MAGRUMLJ,GUPTAI( SIEGELRB, STAHLDA, KOPJ, CRAWFORDN, BROSIUSJ, Gur~u. R,HOGANJJ,NOLIXRI-IF:Sec- 12. 13. 14. 15. 16. 17. . ondary Structure Model for Bacterial 16S RIbosomaI RN& PhylogenetIc, Enzymatic and Chemical Evidence. Nucleic Acids Res 1980,8:2275-2293. STIECLERP, CARBONP, EBELJP, EHRESMANNC: A GeneraI Secondary Structure Model for Procaryotic and Eucaryotic RNAs of the Small Ribosomal Subunits. Eur J Bicxbem 1981, 120:487-495. ZWEB C, GLOTZ C, B~COMBE R: Secondary Struc- ture Comparisons between Small Subunit Ribosomal RNA Molecules from Six Different Species. Nucleic Acids Res 1981,93621-3640. NOUERHF, KOPJ. WHEATONV, BROSIUSJ, GIJTEURR,KOPYLOV AM, DOHMEF, HERRW, STAHLDA, GUPTAR,WOE~ECk Sec- ondary Structure Model for 23s RibosomaI RNA Ntrcleic Acids R~s 1981,9:6167-6189. GLOIZC,?%IEBC, BRIMACOMBER:Secondary Structure of the Large Subunit RIbosomaI RNA from Escberichfa coli, Zea mays ChIoroplast, and Human and Mouse Mitochondrial Ribosomes. Nucleic Acids Res 1981,9:3287-3306. BRAND C, KROL4 MACHAT~MA,POUYETJ, EUELJP,EDWA~UX K, KO~~ELH: primary and Secondary Structures of Es- cbericbia coli mre 600 23s RIbosomal RNA Comparison with Models of Secondary Structure for Maize Chloroplast 23s rRNA and for Large Portions of Mouse and Human 16s MitochondriaI rRNAs. Nucleic Aciak Res 1981,9:43034324. Gunu RR,LARSENN, WOESECR: Lessons from an Evolving RIbosomal RNA: 16s and 23s rRNA Structure from a Com- parative Perspective. In Ribosomal RNA Structure, Evolution, Gene Eapressim and Function in Protein Synthesis Edited by Ziimermann RA, Dahlberg AE. Boca Raton: CRC Press; 1993:in press. ia. GUTEURR:The Simplicity behind the Elucidation of Com- . plex Structure In RIbosomal RNA 7%e Translational Ap puratus Edited by Nierhaus KH ef al. New York: Plenum Publishing Corporation; 1993: in press. Another brief review of rRNAsttucture, lacedwith a few newer struc- tural themesin rRNAhigher-order constraints.Additional correlations for a few of the highly conservedand functionallysignificantregionsof theseRNAsare presented. 19. LARSENN: Higher Order Interactions in 23s rRNA Pm Nat1 Acad Sci USA 1992,89:5044-5048. 20. CECHm Conserved Sequences and Structures of Group I Introns: Building an Active Site for RNA Catalysis-a Re- view. Gene 1988,73: 259-271. 21. MICHELF, WOOF E: Modelling of the Three-Dimensional Architecture of Group I Catalytic lntrons Based on Com- parative Sequence Analysis. J MoI Biol 1990,216:585-610. 22. M~CHEI.F, UME~ONOK, OZEKIH: Comparative and Functional Anatomy of Group II Catalytic Introns-a Review. Gene 1989, 825-30. 23. JAMESBD, OLSENGJ, LIUJ, PACENR: The Secondary Struc- ture of Ribonuclease P RNA, the Catalytic Element of a RIbonucleoproteIn Enzyme. CeN1988, 52:19-26. 24. BROWNJW, HAASES,JAMESBD, HUNTDA, PACENR Phyloge- netic Analysis and Evolution of RNaseP RNA in Proteobac- teria. J Bade61 1991, 1733855-3863. 25. GUIHRE C. PATTEW~NB: SpIiceosomal snRNAs.Annu Rev Genet 1988, 22:387419, 26. Z~P~EBC: Structure and Function of Signal Recognition Par- ticle RNA Prog Nucleic Acid Res Mol Biol 1989,37207-234. 27. ROMERODP, BIACKBURNEH: A Conserved Secondary Struc- ture for Telomerase RNA Cell 1991,67:343-353. A review intermixed with original comparativeinformation on 16sand 23s rRNAhigher.order strucmreand rRNAstructural motifs.Written in 1991/1992,alreadybecoming a bit out of date,and the book is still not out yet (as of February 1993).
  • 9. Comparative studies of RNA Cutell 321 28. 29. 30. 31. 32. 33. 34. . KM SH:Three-Dimensional Structure of Transfer RNA Prog mal IINk an IntcaRNA Crosslinking Study. Nucleiic Acids Res Nucleic Acid Res Mol Biol 1976, 17:181-216. 1992, 20:15931597. QIJIGEYGJ, RICH A: Structural Domains of Transfer RNA Molecules. Science 1976, 19&796-806. MOAZEDD, STERNS, NOUER I-IF: Rapid Chemical Robing of Conformation in 16S Ribosomal RNA and 30s Ribo- somal Subunits Using Primer Extension. J Mol Biol 1986, 187399-416. 45. MICHEL F, ELLINGTON AD, COV~URE S, SZOSTAK PJV: Phyloge- netic and Genetic Evidence for Base-Triples in the CataIytic Domain of Group I Introns. Nafure 1990, 347~578-580. m M: Detailed Molecular Model for Transfer Ribonu- cleic Acid. Nature 1969, 224759-763. 46. COUTURES, EUINGTON AD, GERBER AS, CHERRY JM, DOUDNA JA, GREEN R, HANNA M, PACE U, RAJACOPAL J, SZOSTAK JW: Mu- tational Analysis of Conserved Nucleotides In a Self-SpIicing Group I Intron. J Mol Biol 1990, 215:345-358. OBEN GJ: Comparative Analysis of Nucleotide Sequence Data [PhD Thesis]. University of Colorado Health Sciences Center; 1984. 47. WAUGH DS, GREEN CJ, PACE M: The Design and Catalytic Properties of a Simplified Ribonuclease P RNA Science 1989, 2441569-1571. HASEw T, CHAPPEIEAR JE, Fox GE: Fidelity of Secondary and Tertiary Interactions in tRNA Nucleic Acids Res 1988, 16:5673-5684. 48. HAAS Es, MORSE DP, BROWN JW, SCHMIDT FJ, PACE NR: Long- . Range Structure In Ribonuclease P RNA Scieuce 1991, 254:853-856. &TELL RR, POWER A, HER-IZ GZ, PUIZ EJ, STORMO GD: Identifying Constraints on the Higher-Order Structure of RNA: Continued Development and Application of Compar- ative Sequence Analysis Methods. Nucleic Acids Res 1992, 205785-5795. Another in the ‘although you told me, I want fo veri@ it for myself series.This time, the pseudoknots in RNaseP RNAare evaluatedusing site-directedmutagenesis. 49. NOUER HF, MOAZED D, STERN S, POWZRS T, ALIXN PN, ROBERTSON JM, WEARER B, TRIMAN K: Structure of rRNA and its Functional Interactions in Translation. In The Ri- bawme: Structure, Function G Evolution Edited by Hill WE ef al. WashinggtonDC: American Society for Microbiology; 1990~7392. Rehemems in quantitative correlation an&is methods and signif- icantly larger RNA databases are revealing newer and subtle RNA structure! constraints. This methods paper, with a sampling of results, suggeststhat hirther analysiswith these methods will reveal more in- teresting structwal constraints. 35. CHIU DKY, KOLODZ~EJCZAK T: Inferring Consensus Structure from Nucleic Acid Sequences. CompuI Appr Biosci 1991, 7~347-352. 36. GLITELL RR, WEISER B, WOESE CR, NOUER HF: Comparative Anatomy of 16%Lie RIbosomaI RNA. Prog Nucleic Acid Res Mol Biol 1985,32:155-216. 37. GU~EURR,NOUER HF, WOE~E CR: Higher Order Structure in RibosomaI RNA EMBO J 1986, 5:1111-1113. 38. GUTEURR,WOESE CR Higher-Order Structural Elements in RIbosomaIRN& Pseudoknots and the Use of Noncanonical Palm Proc Nat1 Acad Sci USA 1990,87:663-667. 39. POWERST, NOUER HF: A Functional Pseudoknot In 16S Ri- bosomal RNA EMBO J 1991, 10:22032214. yrnos, revealing manuscript. Not only is the comparatively inferred pseudoknot helix substantiated,this structure is shown to be associ- atedwith translationalfunction. 40. CUNNINGHAM PR, NURSE K, BAKIN A, WEIIZMANN CJ, PFWMM M, OFENGAND J: Interaction between the fwo Conserved Single-StrandedRegions at the Decoding Site of SmaIISub- unit Ribosomal RNA Is Essential for RIbosome Function. Bicxbemirtry 1992,31:12012-12022. 41. KOOI EA, RLITGER~ CA, MULDER A, RIET JV, VENEMA J, RAUE HA: . The PhylogeneticalIy Conserved Doublet Tertiary Interac- . tion In Domain III of the Large Subunit rRNA Is Crucial for RibosomaI Protein Binding. Proc Nat1 Acud Sci USA 1993, 90~213216. Another in a growing seriesof experimental works that substantiarea comparatively derived pseudoknot StwNw in this case, this StIUCNre is shown to be involved in ribosomal protein binding. 42. BRIMACOMBE R, GREUER B, MITCHEU P, Ossww M, RINKE-APPEL J, SCHULER D, STADEK: Three-Dimensional Structure and Function of &zbericbia CON16Sand 23s rRNA as Studied by Cross-Liig Techniques. In 7% Rhome: Structure, Function 6 Evolution Edited by Hill WE et al. Washington DC: American Society for Microbiology; 1990:93-106. 43. Rcrurl PC, LU M, DRAPER DE: Recognition of the Highly Con- served GTPase Center of 23s RIbosomaI RNA by Riboso- mal Protein Lll and the Antibiotic Thiostrepton. / Mol Eiol 1991,221:1257-1268. 44. DOIUNGT, GREUERB, BRIMACOMBE R: The Topography of the 3’ TermInaI Region of Eschetfchfa calf 16s Riboso- 50. 51. 52. . . BARTEL DP, ZAPP ML, GREEN MR, SZOSTAK Jw: HIV1 Rev Regu- Iadon Involves Recognition of Non-Watson-Crick BasePairs in Vii RNA. Cell 1991, 67529-536. SANTA-LUCIA J JR, KIERZEK R, TURNER DH: Stabiities of Con- secutive A-C, C-C, G-G, U-C, and U-U Mismatches in RNA Internal Loops: Evidence for Stable Hydrogen-Bonded U-U and C-C+ Pairs. Biocbemishy 1991, 30: 8242-8251. CHASTAIN M, T~QcO 1JR: svuctural Elements in RNA Pmg Nucleic Acid Res Mol Biol 1991, 41:131-177._-... - The most current and comprehensive rewewot RNAStruCNral motits, with a biasfrom the physicalchemistryperspective.An abbreviated Sys terns Adminhrafion Guide to RNAstructure. 53. 54. 55. 56. 57. 58. . . WOESE CR, WIM(ER S, Gur~u. RR: ArcbitecNre of Ribosomal RN& Constraints on the Sequence of Tetraloops. PIW.ZNull Acad Sci USA 1990, 878467-8471. TLIERK C. GAUSS P. THERMES C, GROEBE DR, GA- M, GUILD N, STO&O G, D’A~BE~oN-CARAFA Y, UHLENBECK OC, TINOCO 1. BRODY EN. GOLD L: CUUCGG Hairpins: Extraordinarily &able RNA ‘Secondary Structures Associated with Va& ous Biochemical Processes. Proc Nat1 Acad Sci USA 1988, 85:1364-13&X ANTAOVP, IA SY,TINOCO1JR: A Thermodynamic Study of Unusually Stable RNA and DNA Hairpins. Nucleic Aciak Res 1991, 19:5901-5905. VARANIG, CHE~NG C, TINOCOI JR:Structure of an Unusually Stable RNA Hairpin. Biocbemwy 1991, 30:3280-3289. HEUS H, PARDIA: Structural Features that Give Rise to the Unusual Stability of RNA HaIrpIns Containing GNRA Loops. Science 1991, 253~191-194. TEN DAM E, PLED K, DRAPER D: Structural and Func- tional Aspects of RNA Pseudoknots. Biock+hy 1992, 31:11665-11676. Evemng (almost) you need to know about pseudoknots (but forgot to ask). Current in 1333,we should expect the 1995edition to be much larger. 59. PUGuSJD, ‘WYATI JR,TINOCOI JR: Conformation of an RNA Pseudoknot. J Mol Biol 1990, 214:437-453. 60. TEN DAM E, BE~XUM AV, PUIJ CW& A Conserved Pseudoknot In Telomerase RNA Nucleic Acids Res 1991, 19~6951. 61. TUERK C, M~cDouGti S, Gow L RNA Pseudoknots that Inhibit Human Immunodeficiency Viis Type 1 Reverse Transcriptase. Proc Nat1 Acud Sci USA 1992, 896988-6992.
  • 10. 322 Nucleic acids 62. 63. 64. 65. 66. . . CHA~~A~HM, Tuwco I JR:A Base-Triple Structurai Domain .in RNA. Biochemishy 1992,31:12733-12741. See[67**]. WINKERS, OVERBEEKR, WOESECR OISENGJ, PFLUGERN: Structure Detection through Automated Covariance Search. Coinput Appl Biosci 1990, 6365-371. McCw WH, Foss K: Changing the Identity of a tRNA by Introducing a GU Wobble Pair near the 3’ Acceptor End. Science19BB,240:7937%. 67. JOYCE GF: Directed Molecular Evolution. Sci Am 1992, 26790-97. zese papers [66°0,6700]setthestagefor what’sto come.Thesenewer methods in biochemistry are changing the pace (no pun intended, Norm) at which sequencevariation is analyzedto infer structure and function. XHULhLUrlLH: Recognition of tRNAs by AminoacyLtRNA Synthetaws. Prog Nucleic Acid Res Mol Biol 1991,41~23-87. SZOSTAKJW In vihu Genetics. Trends Biocbem Sci 1992, RRGuteU,MCBBiology CampusBox 347,Universityof Colorado,Boul- 1789-93. der, Colorado 80309-0347,USA