SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
Engaging a Scientific Community in Contributing
to a Biological Database
Paul Gardner
June 21, 2013
Paul Gardner Engaging Scientists
What is RNA?
RNA is a fundamental biological molecule, essential for untold
biological processes
My aim is to build an analog to the Periodic Table for
classifying RNA families and motifs, enabling researchers to
predict function.
New technologies are accelerating the rate of RNA discovery.
base basepair
R
A
U
A
G
A
U Y
A
C
A
U
U
5´
Y
G
A
A
R
5´
C
U
U C
G
G
5´
R
U
R R R
Y
5´
R
R
G
C
G
U
R A
R
A
G
C
Y
5´
R
Y
G
G
A
G
Y
R RR
R
C RR
G
A
R
R
5´
C
G
A
A
G
Y
Y
R
Y
Y RR
G
G
G
R
U
G
G
A
G
5´
C
C
R
A
Y
C
C
C
R
U C
C
G
A
A
C
U
Y
G
G
5´
A N Y A G N R A U N C G T loop U t ur n k t r n1 k t r n2 tw ist
R
C
Y
R
G
G
A
AC
U
G
A
RC
R
U
Y
AG
U
A
C
G
GG
A R R A5´
Y
Y
Y
A
GU
A
G Y R
A
G
G
A
A
R
R
R
5´ R
Y
G
R
Y
A
A
Y
C
RY
A Y
Y
A
G
R
GA
A
Y
C
5´
R
C
A
GG A
G
Y
5´
A
C
A
C U
G
R
Y
R
Y G Y R R R R
R
Y
C
A
R
U
Y
5´
R
A
G
C
R
C
G
R
A
G
Y AY
G
Y
Y
R
G
U
U
Y
5´
A
A
A
A
A
G
C
Y
R
Y
Y R
R
Y
G
G
Y
U
U
U
U
UU
Y U Y5´
R
R
A
R
R Y
Y
U
U
UU
U U Y5´
sar r ic1 sar r ic2 U A A G A N C sr C loop dom V t er m 1 t er m 2
R Y Y Y Y
G
C
G
A
G
C
A
G
A
C
G
C
A
R
A
A
C
R
C
C
C
R
R
Y
R
R
Y
G
G
G
Y
G
U
U
Y
U
G
C
G
U
C
U
G
C
U
C
G
C
R R R R5´
Y
U
Y
UC
U
C
A
A
C
AG
UG
Y
U
U
G
R
R
R
A
A
Y
5´ Y
Y
Y
Y
Y
A
U
GA
Y G
R
Y
Y
Y
YA
A
A Y
Y
Y
YY
R
R
G
R
R
Y
C U GAU
Y
Y
Y
R
R
R
5´
G
G
G
U
C
U
C
U
C
U
G
Y
U
A
G
A
C
C
A
G
AU
CU
G
A
G
C
C
UG
GG
A
G
C
U
C
U
C
U
G
G
C
U
A
R
C
U
A
G
G
G
A
A
C
C
CA
C5´ UG
U
A
A
A
C
A
U
C
CU
Y G
A
C
U
G
G
A
A
G
C
UG
U
R
R
R
Y R Y
R
R
RR
G
C
U
U
U
C
A
G
U
C
G
G
A
U
G
U
U
U
G
C
5´ U CU
U
U
G
G
U
U
A
U
C
U
A
G
C
U
G
UA
U
G
AG
U
G
Y
Y R
C
RU
C
A
UA
A
A
G
C
U
A
G
A
U
A
C
C
G
A
AR
U5´ C
Y
Y
R
UC
C
C
U
G
A
G
A
C
C
C
U
A
A
C
Y
U
G
U
G
AG
Y
U
Y
YY
A
G
Y
UU
C
A
C
A
R
G
U
R
G
G
Y
U
C
U
Y
G
G
G
R
CY
R
G
G
5´
G
C
U
A
A
A
A
G
G
A
A
C
G
A
U
C
G
U
U
G
U
G
A
U
A
U
GC
G
U
U
RRU
U
YC
G
U
U
AC
A
U
A
U
C
A
C
A
G
U
G
A
U
U
U
U
C
C
U
U
U
A
U
A
R
CG
C5´ C Y GY
G
Y
Y
C
A
U
C
U
U
A
C
Y
G
RG
C
A
G
U
G
U
U
G
GA
U
G
Y
YY R
R
G
Y
C
UC
U
A
A
Y
A
C
U
G
YC
U
G
G
U
A
A
Y
G
A
U
G
R
C
RY
C G G5´ Y Y Y Y R R GY
A
C
A
U
R
C
U
U
C
U
U
U
A
U
A
U
C
C
C
A
U
AY
R
A
Y
R
R
R
CU
A
U
G
G
A
A
U
G
U
A
A
A
G
A
A
G
U
A
U
G
U
AY
Y Y G G Y5´ Y R R YY
C
R
U
C
A
A
A
R
U
G
G
Y
U
G
U
G
A
R U
G
U
Y
R
U
CA
U
A
U
C
A
C
A
G
C
C
A
C
U
U
U
G
A
U
G
AG
Y U Y R R5´ Y A A RA
A
G
G
G
A
A
Y
R
G
U
U
G
C
U
G
U
G
A
U
R
U
A
Y
Y
Y
A Y
Y
Y
Y
U
YU
A
U
A
U
C
A
C
A
G
U
G
G
C
U
G
U
U
C
U
U
U
UU
G G U Y5´ Y
C
R
G
G
U
G
A
G
G
U
A
G
U
A
G
G
U
U
G
U
A
U
A
G
U
U
RR
R
R
Y
Y
Y
Y YG
G
A
GY
A
A
C
U
R
U
A
C
A
A
Y
C
U
R
C
U
A
C
U
U
Y
C
C
U
G
R
5´
G
G
C
U
G
G
U
C
C
G
A
R
RG
U
A
G
U
G
G
G
U
U
A
Y
R
U
Y
A
AY
Y
Y
Y
U
U
R
Y
Y Y YU
C
Y
C
C
CYC
Y
C
A
C
U RC
UR
YA
C
U
U
G
A
C
U
R
G
C
CU
U U5´ Y
Y
Y
C
U
G
Y
R
R
U
G
U
C
G
UA
R
Y
Y
Y
Y
Y
U
G
A
R
C
CRAY
Y
Y
Y
Y
Y
G
G
G
R
G
Y
Y
Y
Y
Y
R
G
G
YA
G C
C
C
YY
G
G
GA
A
R
C
A
A
R
Y
R
R
R
R
Y
R
C
C
C A CCU
R
R
R
Y
R
YRG
G
U
U
C
A
R
R
R
R
Y
A
C
G
G
C
A
Y
Y
R
Y
G
G
R
Y
YY
Y5´
Y
Y
R
C
G
R
C
C
A
UA
C
R
R
R
G
R
A
R
C
A
CC
Y
G
R
U
C
C
CA
U C
C
G
A A
CY
C
R
GA
A
G
U UA
A
GC
Y
Y
Y Y
GG
C Y
R R
G
U
A C U
R
G R YG
RG
R
AYC
CUG
GG
AA
RY
RGGU
G
Y
Y
G
Y
R
RY
5´
G
RU
A
GYYY
AR
Y
G
G
Y AR
R R C
RY
Y
R
G
Y
U
Y A
A
Y
Y
R
R
RR
Y
R
RG
G UU
C
R
AR
U
C
C
Y
YY
YR
5´
R
R
AAR
Y
U
C
R
Y
R
R
R
R
GYYAC
R
R
YG
A
G
U
R
Y
Y R
YRCUC
Y
CYYYY
G G G A A GGU
C U G A G
A
R
G
C
CAY
Y
R
C
C
CU
G
GGGYR
Y
Y
Y
Y
Y
Y
GR
R
R
R
G
R
R
R
R Y G R G Y Y
A
C
C
AG
A A A Y
R
R Y Y
Y
Y
R
RGY
U
U
GGAA
RRCUYRY
GGCY
RG Y R R Y U
A
G
U
C
A
A
U
R
Y
GRR
Y
R
R
Y
Y
Y
R
AAC
Y
C
R
A
UUCAG
A
C
U
A
UCU
Y
Y
5´
T R I T I R E SE C I S m ir -T A R m ir -30 m ir -9 lin-4 m ir -5 m ir -8 m ir -1 m ir -2 m ir -6 let -7 Y R N A 6S 5S t R N A R N aseP
AURRGRYA
G
G
YA
U
U
G
AA
CUGU
AU
U
G
U
G
CR
C
C
UU
GCAUARAGCUAAAGCACUAAAAAGGAGUAA5´
A
G
U
C
A
U
G
A
U
YG
C
U
A
U
U
C
Y
Y Y
A
A
A
U
A
G
UG
A
U
U
G
U
G
A
U
AG
C
G
A
U
G
C
G
G
Y
G
U
G
U
UG C
G
C
A
C
R
Y
C
G
Y
A
Y
C
G
CG
C U5´
AGAGGAARCR
G
G
G
G
C
CAY
G
C
A
GAAGC
G
U
UC
ACG
U
C
G
C
G
G
C
C
C
CU
GUC
A
G
A
U
U
C
RGU
R
A
A
U
C
U
GC
GAAUUCUGCU5´
G A U AC
A
U
A
G
G
A
A
C
C
U
C
C
U
C A
A
A
G
G
A
U
U
C
U
A
U
GG
A C AG
U
C
G
A
U
G
C
A
G
G
G
A
G
G
G A CR
R
C
U
C
C
C
U
G
C
A
U
C
G
G
CG
A U U U U5´ A
C
G
R
RG
U
R RA
R
UG
C
G
A U A A Y A YA
A
U
A
A
U
GAAA
U
U
C
C
U
CU
U U G A C
G
G
C
C
A
A
U
A
GC
GA
U
A
U
U
G
G
C
CA
U
U
U
U
U
U
U
5´ R
Y
C
U
U
U
A
G
C
G
GG
Y
U
R
RR
U
Y A R U CURG
Y
Y
G
G
Y
G
U
U
U
C
G
C
C
G
R
C
Y YU
R
C
Y
Y
U
G
A
Y
R
Y
5´
RYYRYYCC
G
U
G
G
UG
A
U
U
U
G
RYC
GGCCGG
C
U
U
G
C
AG
C
C
A
C
GU
UAAAYAAUCGCUAAARAGGCCGRGGRRR5´
G
UCGRR
U
Y Y C A
C
UG
A U G AG U C Y
U R
ARGAC
G
A
AA
C
5´ Y Y R
A
U
Y
U
AAA
RA
A
A
C
A G CU
U UC
A AG
U G CCU U U Y U GC
A G
U
U
YYY
CARGAGCGC
A
A
G
A
U
RG
R U A5´
R
Y
G
GY
Y G
Y
U
U
G
C
C
A
U
A
C
G
C
C
C
YY
Y YY
C
G
G
C A
GG
U
A
U
G
G
A
A
R
C
A
C
C
C
YC
G Y A CG
A
C
U
G
GY
Y
C G
G
A
C
A
CY
GY
C
G
U
C
CC
G
C
C
A
G
A
U
C
5´ CA
C
A
U
C
A
G
A
U U U
C
C
U
G
G
U
G
UA
A CG
A
A
U
U
U
U
C
A
A
G
U
G
C
U U C
U
U
G
C
A
U
A
A
G
C
A
A
G
U
U
URA
U
C
C
C
G
C
Y
C
CY
YC
G
R
G
Y
C
G
G
G
A
UU
U5´ A U GG
A
G
A
C
A
UGGCR
U
AA
AG
C C AG
A R
A
G U R A
G
A
AC
R U A A C
Y
U
A
G
A
C
U
R
U
ACUUGAA
C
U
G
A
U
UYRC
A
U
C
U
CA
U U U U5´
G
C
R
C
Y
G
C
AA
AA
U
C
R
G
R
Y
G
C
C G G G A
U UG
G
YA
YCCCG
R
A
Y
R
R
R
R
Y
R
A R C G C
Y
GCGYU
U
U
U
U
U
5´
Y U R C G U G A C G A A G CG
C
G
C
G
CA
A
A
G
UGG
A C
AA
U
A
A
AG
C
C
UR
A G C
RU
Y
R
A
G
UAG
U
C
G
Y
CAG
A
C
G
C
C
G
G
U
U A A
G
C
C
G
G
C
G
U
UU
U U U5´ YR
Y
A
C
G
UR
Y
C
Y
G
U
U
R
UR
G
Y
C
C
G
G
U
U
G
C
U
U
UG
GU
C
G
G
U
G
A
C
C
G
G
R
R R R
R
A
G
C
C
C
R
C
UU G
G
U
G
G
G
Y
U
UU
U U5´
G
G
Y
C
R
G
C
Y
C
R
CC
C C CC
R
G
R
G
C
Y
G
R
C
C
G A C G G C C C C C G C
U CC
C
C
CCY
GGCGGGGGYCGUC
C
C
Y
Y
5´
U U G G C G A U R UU
U
U
U
G
GU
U G
G
A
A
U
G
UAGUGY
YY
UU
A
R C A C U AA
A CG
C U G
CC
A C AA
A
U
A
A
CCUG
U
CAGU
U
A
U
U
U
C
A
Y
C
A
A
A
AA
U A A A5´
RYYRYUG
C
C
C
UCY
G
G
G
CG
UUUCCUCCCUAGACUU
G
G
C
Y
Y
YY
R
R
G
G
C
CU
UUUUUUUYYY5´
SA M V sym R C P E B 3 F inP sr oB m sr SA M a H H 3 V m nt n3 livK D sr A C A E SA R isr K sr oD isr B 6C r spL suhB
UY
G
C
A
UCCGCYAA
Y
CGGUYA
G C C GU G UC
G C GG A
A G
G
U
U
Y Y
Y
A
A
C
CA
G C UR
Y
Y U Y Y G RA
ACRRAG
RRA
GGUG
A
G
C
G
5´
UG
A
A
A
GAC
G
C
G
C
A
U
U
U
GU
U A U C A U CA
UC
C CU
G
U Y
C
A
G
AG
A
U
GY
A
A
U
U
U
GG
CC
AC
AG
Y
RY
G
U
G
G
C
C
U
U
U
U
C
5´
* U
U
C
U
A
C
U
G
A
C
U
C
UU
U
U
A
AA
A
U
A
AU
U
A
U
U
C
A
U
U
G
G
AG
G U UU
A
A
UA
U
G
A
A
U
A
UA
A A G G A U G A G CA
U A
U
A
G
A
AG
C
GUUUG
C
UCYUU
GU
U
A
G
AU
C
R
G
U
U
A
G
U
A
G
G
AA
5´
G A U U UG
G
U
R
R
C
U
G
C
G
C
U
C
UU
C UA
A
G
C
C
A
G
U
U
A
C
C
CG
G
U
U
C
A
A
A
R
A
U
U
G C C
A
G
C
U
U
Y
G
A
A
C
CU
UC
G
A
A
A
A
A
C
C
A
C
C
U
Y CR
R
G
G
U
G
G
U
U
U
U
U
U
C
GU
5´
R R R R R R R R
C
U
C
R
U
AU
A
A
YYYCRRR
AA
U
A
UG GY
Y Y G R R A
GU
U U C UAC
C R R G Y R
C CG
U
AAA
YRYYYG
A
CU
A
Y
G
A
G
RR
R5´
C
G
G
C
A
U
C
C
C
C
A
U
U
A C C
U
A
U
G
G AC
A
CG
G
U
G
C
C
G
C A R G C U C U G G R A
G UU
C
GUYCCRGAGYYUG
Y
Y
G
G
A
A
R
G
G
U
U
U
U
C
C
G
U
G
U
C
C
A
G
5´
R
R
Y
G
G
A
R
G
CRR
U
GA
R
Y
R
Y
Y
Y
YU
Y
A
U
YU
G G GCA
C
Y
U
G
R
R
R
Y
R
YG
G
A
G
C
YAG
U R GU
G
C
A
ACCG
R
C
C
R
Y
R
R
R
5´
G
U
U
G
U
A
A
C
U
AU
G
U
U
G
C
A
R
YA
R A C G AG
A
A
C
C
G
AG
U
A
U
A
G
U
U
C
A
U
GG
G
R
U Y A
CA
UG
AA
UU G U UU
A
A
CU
RU
CC
U
C
U
GG
A
U U
C
CC
G
U
C
C
AU
G
R
C
A
GU
C
G
G
U
U
C
5´
CUUA
C
U
G
A
GA
G
C
A
C
AA
A
GU
UUC
C
C
G
U
GC
CA
A
C
A
G
G
G
A
G
U
G
U
UAU
A
AC
G
G
UU
UAUU
A
G
U
C
U
G
G
AG
ACG
G
C
A
G
A
C
U
AU
CCUCUUC
C
C
G
G
U
C
CC
CUA
U
G
C
C
G
G
GU
UUUUUUUAUGUC5´
UURGRYUYRCCUG
A
A
U
G
U
G
A
CU
A
U
C
A
C
U
U
CA
AACRRYGRGYAACCUCAGUAUCAUCRYRGAGYUA
A
A
C
C
C
U
C
G
C
C
G
C
CUG
A
C
G
G
Y
G
A
G
G
G
U
U
UU
CUUUUGGR5´
U G U A A A A A A C A U Y A U U UA
G
C
GUGAYU
U
U
C
U
A
U
C
A
ACAG
C U A A C
A
A
U
U
G
U
UA
U
U
A
C
UG
C
CUA
A
Y
G
Y
U
C
A
UA
A G G G U A AUU
U
U
A
A
A
A
A
AGG
G CG
A
U
A
A
AA
A
A
C
G
A
U
U
G G GGGA
U
G
A
G
A
Y
A
U
G
AAC
G
C
UC
A A G C A5´
C C C A G A G G U A U U G A UU
G
G
U
G
A
U
R
R
C
A
Y
Y
U C U
R
U
G
Y
U
Y
A
U
UY
A
U
UR
C
A
C
C
A
A C C U G C G C RG
A
UGCGCAGGU
U
U
U
U
U
U
U
5´
AR
R
R
Y
Y
YYYAAURYCAACYUUUAGCGCACG
G
C
U
C
U
YY
A
A
G
A
G
C
CA
UUYCCCUA
G
R
C
C
A
A
A
C
A
G
GAAU
Y
G
U
U
U
G
G
Y
C
UU
UUUUU5´
G
G
G
C
A
R
G
A
U
A
U
G
U
G
A
A
GU
R
GC
Y
A
C
C
GC
AA
GC
YGR
U
A
CY
CUU
CAC
Y
Y Y C C
U
U
A U UC
G C
U
Y
GC
U
CAAC
GGR
A
U
C
Y
U
G
C
U
CU
G C G A G G C Y5´
GUGCRRYCYRAUUYYR
G
Y
Y
G
Y
G
C
C
Y
R
Y
R
A
R
AAC
AUCAYAA
R
A
U
A
CG
G
C
R
C
R
R
CC
ACRAUUUCCCUG
G
U
G
U
U
GG
C
G
C
A
GU
AUU
C
G
C
G
C
A
C
CC
CGGUCUACC5´
Y
U
U
Y
R
Y
U
R
R
U
U
U
Y
A
U
C
A
R
A
YC
U GU
U
U
G
A
U
R
R
A
A
G
Y
U
A
R
Y
G
A
R
R Y Y C A Y UA
A
C
R
G
C
U
Y
U
Y
GC
Y G
G
C
Y Y G
A
C
C
C
G
A
G
R
Y
Y
G
U
UU
U U U U5´
RACGUUCAY
C
C
Y
YY
R
G
G
RC
GCAYRA
Y
C
A
R
R
Y
C
A
Y
GG
AAC
G
G
G
G
R
Y
Y
U
G
R
R
5´
sucA Sr aD sxy R N A I P ur ine SA M -C hl cdiG M P 2 A nt i-Q G adY r nk ldr P r fA O m r A -B R yeB t r aJ 2 Sr aH 23Sm et h D S-p ep
U U C G G C C Y CG
C
R
R
C
G
YU
U YU
Y
C
G
Y
Y
G
CC
C U C U G C A YG
C
C
G
U
C
G
C
C
G
A
CGCAY
U
C
C
Y
A
U
U
CG
A
A Y Y G U
G
C
G
A
U
C
C
U
G
U
C
G
C CY
U
C
C
U
GC
G
G
C
G
C
G
G
C
5´ CG
Y
R
G
C
G
C
U
U
G
U
UA
U U
U
R
Y
Y
G C U
G
U
G
U
A
G U GUC
G
U
C
Y
YR
A R Y Y R G R R Y Y Y
A
A
A
C
C
C
C
G
C
C
Y
UU
Y
G
G
C
G
G
G
G
U
U
U
UG
C U U U U U5´
** C
U
U
A
C
C
G
G
A
G
GY
R
U
A
UGGAC
C
C
UG
A UC
C C AC
Y C C U
C
U
C
C
C
C G
A
UG
G
A
G
AA
U
Y
Y
YU
U
U
C
C
G
G
U
A
A
GC
C Y G Y C U Y Y
R
C
U
G
Y
Y
U
U
A
C
C
G
G UG
Y
G
U
A
A
G
G
C
A
G
UG
A C G U Y U5´
G
G
R
A
G
R
Y
R
Y
CU
G
GU G R
Y
C
G
G
C
U
UC
A AA
CC
GR
Y G
RR
G
Y
R
Y
Y
Y
Y
G
G
Y
RGG
U
U
C
G
AY
U
C
C
Y
RY
Y
C
U
Y
C
C
5´ U
G
A
C
C
C
U
U
U
A R
C
C
R
A
G
G
G
U
C
AC
C U A G C C A A C U G A C GU
U
G
U
U
AG
U
G
A
A
Y
YY
A
U
G
U
U
C
A
C A
RA
U
A
R
GC
C
A
A
U
C
G
C
U
U
U
G
C
G
R
U
U
G
GC
U U U U U U U U U5´ C U U A A UR
A
A
CAA
G
A
A
A
A
C
YAA
R C G
U
A
C
Y
U
U
C
C
Y C
C
U
G
AG
UU
C
A
G
G
C
U
G
G
A
A
UG
C
G
C A
CAG
C U RA
U U G U U G A U AA
G G G CU
ACUC
AUACCGACAA
GC
CAGU
G
A
A
G
C
G
AUG
A
AU
G
U
C
GG
U
U
CC
A C5´
R
U
Y
Y
RC
U
G
A
Y
GA
G
U
C
C
C
A
A AU
A
G
G
A
CGA
A
A C G C
GCGU
CY
G
R
A
U
5´ CU
C
C
A
U
GU
A
U
C
U
UU
G
G
G
A
C
C
U
G
U
C
A
GC
UG
U
G
G
C
A
G U
CU
C
C
C U
UC
C
U
A
G
CC
A
U
G
G
AA
G A G C A U A U U C UU
G
U
U
U
AU
U
G
G
C
A
A
A
GC
U
G
U
CA
C
C
A
U
UU
RA
U
U
G
G
UA
U
C
A
G
A U U
C
U
GAC
U
U
G
C
A
C
A
AG
U
A
A
C
AU
U C5´ C Y G G U U GG
U
G
G
C
G
C
A
C
U
U
C
C
Y
Y
A
C
G
G
G
C
G
G
U
G
U R
U
Y
A
CG
Y R Y U R Y R R Y A G A R R R A Y A C C
A
G
C
C
C
G
C
Y
RR
R
A
G
C
G
G
G
C
UU
U U U U5´
G
U
C
A
U
A
C
U
A
C
G
G
UG
C
A
A
Y
GY
R
RA
A
A
G
U A
A
AC
G
A
U
G
A
C
C C Y
A
RG
A
A
C
U
C
Y
RG
G U A
A A
A
U
R
CR
UAUC
A
A
A
A
U
G
Y
A
A
A
A
U
U
G
U
Y U G A C C U G G GR
UY
Y
UCCGGGUYRG
Y
U
Y
U
U
U
U
5´
U R U G C U A A C U R R R A A YG
U
U
G
Y
A U
R
Y
A
A
CCC
U
U
G
R
Y
G
C
U
U
A
U Y
CC
U
U
U
R
Y
C
A
A
GC
A U A U U A Y AR
C
G
R
U
C
G
Y
YA
A A G G A G A A A U G5´
U C R A A A G A A C AU
G
A
A
A
U
G
G
A
G
G
AGAAAUU
AC
A
GC
A A U U UA
UC
AR
C U
G
A
A
A
UU
A
U
AG
G
U
GU
AG
ACA
C A
UGUC
A
GC
R G UG
G
A
A
A
CAGUU
UC U A
UC
A A A A UU
A A AG
U
A
U
UUAG
A
G
AUUUU
C
C
U
C A
AA
U
U
U
C
AA
A U5´
ACAG
G
G
U
A
R
G
G
R
Y
Y
Y
Y
Y
UU
RU
R
R
R
R
R
Y
C
C
U
U
A
C
C
G
GR
UUUCU
C
A
A
R
U
Y
G
G
R
G
YA
AA
Y
C
C
G
R
U
U
G
RA
RUAUARAGGARG5´
CGYGUUA
U
A
U
G
CC
UU
U
A
U
U
G
UC
ACARUUYUUUUUYYG
Y
U
G
R
Y
C
A
U
U
G
GYAY
YA
U
U
R
A
U
U
Y
C
C
A
G
CR
AUAAAYG
A
C
A
A
G
C
C
C
G
A
A
C
RY
U
G
U
U
C
G
G
G
C
U
U
U
UU
UUURRUYA5´
Y Y Y AU
G
G
Y
G
G
Y
G
R
G
G
G
R
RCC
UU
Y
G GG Y
Y
G
C
C
G
GUU
C
C
YY
R
CCG
GU Y U RC
C
A
A
C
C
C
Y
Y
R
C
Y
R
C
C
AC
C Y5´
AUGGAYRU
G
C
G
C
A
GGA
A
G
C
G
CR
AAGACARACAGGGACACRYAGGRA
C
CCG
GA
UGGYGGRRYAGGAUGUCAGGRAACAGUCUGCA
A
A
G
C
C
C
C
G
C
YY
YG
G
C
G
G
G
G
U
U
U
U
5´
P s-R ho r nk ps M gsens t R N A S Q r r isr C H H 1 SN R 24 T r p ldr gr eA pr eQ 12 H A R 1F T er m L eu M icC C 4 R sm Y R ib osom e
Paul Gardner Engaging Scientists
What is Rfam?
A database of ncRNA alignments and structures
Used for annotating RNAs in genome sequences, bioinformatic
algorithm development and molecular evolutionary analyses
Gardner et al. (2008) Rfam: updates to the RNA families database
Nucleic Acids Research.
Paul Gardner Engaging Scientists
How can we keep textual descriptions of RNAs up to date?
AC RF00005
ID tRNA
CC Transfer RNA (tRNA) molecules are approximately 80 nucleotides in
CC length. Their secondary structure includes four short
CC double-helical elements and three loops (D, anti-codon, and T
CC loops). Further hydrogen bonds mediate the characteristic
CC L-shaped molecular structure. tRNAs have two regions of
CC fundamental functional importance: the anti-codon, which is
CC responsible for specific mRNA codon recognition, and the 3’ end,
CC to which the tRNAs corresponding amino acid is attached (by
CC aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of
CC the genetic code in two manners: having more than one tRNA (with
CC a specific anti-codon) for a particular amino acid; and ’wobble’
CC base-pairing, i.e. permitting non-standard base-pairing at the
CC 3rd anti-codon position.
RN [1]
RM 8256282
RT The tertiary structure of tRNA and the development of the genetic
RT code.
RA Hou YM;
RL Trends Biochem Sci 1993;18:362-364.
RN [2]
RM 9023104
RT tRNAscan-SE: a program for improved detection of transfer RNA genes
RT in genomic sequence.
RA Lowe TM, Eddy SR;
RL Nucleic Acids Res 1997;25:955-964.
Paul Gardner Engaging Scientists
This Wikipedia thing looks pretty good!
Paul Gardner Engaging Scientists
WikiProject RNA
The WikiProjects are social corners of Wikipedia for interested
parties to discuss themed articles
Involved in reviewing, ranking and rating articles
Now rolled into the larger WikiProject Molecular and Cellular
Biology
Paul Gardner Engaging Scientists
How has the Wikipedia experiment gone?
x x x x
x
x
x x
x
x
x x x x x x x
x x x x x x x x
x x
x
x x x x x x x x x x
x
x x
x
x x x
0
2000
4000
6000
8000
10000
Number of Rfam pages edited
Year
Numberofedits
2007 2008 2009 2010 2011
9089
x x xxxxxxxxxxx xxxxxxxxxxxx xxxxx xx x
106
Total edits
Vandalism
Gardner et al. (2011) Rfam: Wikipedia, clans and the “decimal”
release Nucleic Acids Research.
Paul Gardner Engaging Scientists
Who are these Wikipedians donating their time?
Rfambot
Ppgardne
Citationbot1
WillowW
SmackBot
DOI_bot
Addbot
Alexbateman
Jebus989
JenniferRfm
Zashaw
Rjwilmsi
Qwyrxian
Yobot
RE73
Narayanese
RichFarmbrough
Addshore
Wgscott
MiRroar
RjwilmsiBot
Arcadian
DO11.10
Gortonk
Banus
Drmed36
FrescoBot
Boghog
Top 20 Rfam wikiproject editors
Numberofedits
0
200
400
600
800
1000
Bots
Proof Readers
Scientists
Paul Gardner Engaging Scientists
What incentives can we give to Academics?
Academics love publishing articles
Introducing the “families track” at RNA Biology
Publication requirements are an alignment & a Wikipedia
article
100s of new families have been added thanks to this track
Paul Gardner Engaging Scientists
Who else is now using this model?
Finn, Gardner, Bateman (2012) Making your database available
through Wikipedia: the pros and cons Nucleic Acids Research.
Paul Gardner Engaging Scientists
Wikipedia need you!
What is the highest impact contribution academics can make?
Rule 1: Register an Account
Rule 2: Learn the Five Pillars
ENCYC, NPOV, FREE, RESPECT, NORULES
Rule 3: Be Bold, but Not Reckless
Rule 4: Know Your Audience
Rule 5: Do Not Infringe Copyright
...
Paul Gardner Engaging Scientists
Who might be reading about your field?
Paul Gardner Engaging Scientists
Thanks!
The Rfam Consortium
Wikipedians & the long
tail!
PPG is supported by a Rutherford Discovery Fellowship from Government funding, administered by the Royal
Society of New Zealand.
Paul Gardner Engaging Scientists
Engaging Scientific Communities in Contributing to a Biological Database

Mais conteúdo relacionado

Destaque

Bioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAsBioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAs
Paul Gardner
 
A visit to london
A visit to londonA visit to london
A visit to london
sehrish123
 
Vizbi2013: Visualising RNA
Vizbi2013: Visualising RNAVizbi2013: Visualising RNA
Vizbi2013: Visualising RNA
Paul Gardner
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
Paul Gardner
 

Destaque (12)

Later is NOW! Extract Bb Content
Later is NOW! Extract Bb ContentLater is NOW! Extract Bb Content
Later is NOW! Extract Bb Content
 
Citizen Scientists
Citizen ScientistsCitizen Scientists
Citizen Scientists
 
Bioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAsBioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAs
 
My Favourite Pastime
My Favourite PastimeMy Favourite Pastime
My Favourite Pastime
 
A visit to london
A visit to londonA visit to london
A visit to london
 
Vizbi2013: Visualising RNA
Vizbi2013: Visualising RNAVizbi2013: Visualising RNA
Vizbi2013: Visualising RNA
 
Random RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotesRandom RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotes
 
Sakai: Set Up Your Course Site
Sakai: Set Up Your Course SiteSakai: Set Up Your Course Site
Sakai: Set Up Your Course Site
 
Sakai: Get oriented
Sakai: Get orientedSakai: Get oriented
Sakai: Get oriented
 
BIOL335: Homology search
BIOL335: Homology searchBIOL335: Homology search
BIOL335: Homology search
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
 
BIOL335: RNA bioinformatics
BIOL335: RNA bioinformaticsBIOL335: RNA bioinformatics
BIOL335: RNA bioinformatics
 

Semelhante a Engaging Scientific Communities in Contributing to a Biological Database

как это работает4
как это работает4как это работает4
как это работает4
Vladislav Troshin
 
Waukegan west 1984
Waukegan west 1984Waukegan west 1984
Waukegan west 1984
Dave Levine
 
Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984
Dave Levine
 
PFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBookPFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBook
punxsyscience
 
Sopa de letras
Sopa de letrasSopa de letras
Sopa de letras
nirvana18
 

Semelhante a Engaging Scientific Communities in Contributing to a Biological Database (20)

Colageno I Secuenciacion
Colageno I SecuenciacionColageno I Secuenciacion
Colageno I Secuenciacion
 
Gene Sequences
 Gene Sequences Gene Sequences
Gene Sequences
 
как это работает4
как это работает4как это работает4
как это работает4
 
Waukegan west 1984
Waukegan west 1984Waukegan west 1984
Waukegan west 1984
 
CONJUNTOS
CONJUNTOSCONJUNTOS
CONJUNTOS
 
Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984
 
taller sobre la hidrografia colombia.docx
taller  sobre la hidrografia colombia.docxtaller  sobre la hidrografia colombia.docx
taller sobre la hidrografia colombia.docx
 
3D Printing Basics: Going From Bytes To Atoms
3D Printing Basics: Going From Bytes To Atoms3D Printing Basics: Going From Bytes To Atoms
3D Printing Basics: Going From Bytes To Atoms
 
Projek akhir asas pengangkutan data a168611
Projek akhir asas pengangkutan data a168611Projek akhir asas pengangkutan data a168611
Projek akhir asas pengangkutan data a168611
 
PFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBookPFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBook
 
Puente diciembre del 5 al 8.
Puente diciembre del 5 al 8.Puente diciembre del 5 al 8.
Puente diciembre del 5 al 8.
 
¡Recordemos las letras!
¡Recordemos las letras!¡Recordemos las letras!
¡Recordemos las letras!
 
Practiquemos las letras
Practiquemos las letrasPractiquemos las letras
Practiquemos las letras
 
Animaliak
AnimaliakAnimaliak
Animaliak
 
Sopa de letras
Sopa de letrasSopa de letras
Sopa de letras
 
Landschap en Energie Inpassingsstrategien West-Brabant
Landschap en Energie Inpassingsstrategien West-BrabantLandschap en Energie Inpassingsstrategien West-Brabant
Landschap en Energie Inpassingsstrategien West-Brabant
 
Diapositivas para Proyecto.pptx
Diapositivas para Proyecto.pptxDiapositivas para Proyecto.pptx
Diapositivas para Proyecto.pptx
 
Permainan bahasa
Permainan bahasaPermainan bahasa
Permainan bahasa
 
Come realizzare una fondazione a "Platea Calda"
Come realizzare una fondazione a "Platea Calda"Come realizzare una fondazione a "Platea Calda"
Come realizzare una fondazione a "Platea Calda"
 
PPT Komp. Makam Arung Palakka
PPT Komp. Makam Arung PalakkaPPT Komp. Makam Arung Palakka
PPT Komp. Makam Arung Palakka
 

Mais de Paul Gardner

Mais de Paul Gardner (20)

ppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdfppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdf
 
ppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdfppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdf
 
ppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdfppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdf
 
ppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdfppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdf
 
ppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdfppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdf
 
Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?
 
Machine learning methods
Machine learning methodsMachine learning methods
Machine learning methods
 
Clustering
ClusteringClustering
Clustering
 
Monte Carlo methods
Monte Carlo methodsMonte Carlo methods
Monte Carlo methods
 
The jackknife and bootstrap
The jackknife and bootstrapThe jackknife and bootstrap
The jackknife and bootstrap
 
Contingency tables
Contingency tablesContingency tables
Contingency tables
 
Regression (II)
Regression (II)Regression (II)
Regression (II)
 
Regression (I)
Regression (I)Regression (I)
Regression (I)
 
Analysis of covariation and correlation
Analysis of covariation and correlationAnalysis of covariation and correlation
Analysis of covariation and correlation
 
Analysis of two samples
Analysis of two samplesAnalysis of two samples
Analysis of two samples
 
Analysis of single samples
Analysis of single samplesAnalysis of single samples
Analysis of single samples
 
Centrality and spread
Centrality and spreadCentrality and spread
Centrality and spread
 
Fundamentals of statistical analysis
Fundamentals of statistical analysisFundamentals of statistical analysis
Fundamentals of statistical analysis
 
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
 
A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...
 

Último

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Último (20)

FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Buy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptxBuy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptx
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 

Engaging Scientific Communities in Contributing to a Biological Database

  • 1. Engaging a Scientific Community in Contributing to a Biological Database Paul Gardner June 21, 2013 Paul Gardner Engaging Scientists
  • 2. What is RNA? RNA is a fundamental biological molecule, essential for untold biological processes My aim is to build an analog to the Periodic Table for classifying RNA families and motifs, enabling researchers to predict function. New technologies are accelerating the rate of RNA discovery. base basepair R A U A G A U Y A C A U U 5´ Y G A A R 5´ C U U C G G 5´ R U R R R Y 5´ R R G C G U R A R A G C Y 5´ R Y G G A G Y R RR R C RR G A R R 5´ C G A A G Y Y R Y Y RR G G G R U G G A G 5´ C C R A Y C C C R U C C G A A C U Y G G 5´ A N Y A G N R A U N C G T loop U t ur n k t r n1 k t r n2 tw ist R C Y R G G A AC U G A RC R U Y AG U A C G GG A R R A5´ Y Y Y A GU A G Y R A G G A A R R R 5´ R Y G R Y A A Y C RY A Y Y A G R GA A Y C 5´ R C A GG A G Y 5´ A C A C U G R Y R Y G Y R R R R R Y C A R U Y 5´ R A G C R C G R A G Y AY G Y Y R G U U Y 5´ A A A A A G C Y R Y Y R R Y G G Y U U U U UU Y U Y5´ R R A R R Y Y U U UU U U Y5´ sar r ic1 sar r ic2 U A A G A N C sr C loop dom V t er m 1 t er m 2 R Y Y Y Y G C G A G C A G A C G C A R A A C R C C C R R Y R R Y G G G Y G U U Y U G C G U C U G C U C G C R R R R5´ Y U Y UC U C A A C AG UG Y U U G R R R A A Y 5´ Y Y Y Y Y A U GA Y G R Y Y Y YA A A Y Y Y YY R R G R R Y C U GAU Y Y Y R R R 5´ G G G U C U C U C U G Y U A G A C C A G AU CU G A G C C UG GG A G C U C U C U G G C U A R C U A G G G A A C C CA C5´ UG U A A A C A U C CU Y G A C U G G A A G C UG U R R R Y R Y R R RR G C U U U C A G U C G G A U G U U U G C 5´ U CU U U G G U U A U C U A G C U G UA U G AG U G Y Y R C RU C A UA A A G C U A G A U A C C G A AR U5´ C Y Y R UC C C U G A G A C C C U A A C Y U G U G AG Y U Y YY A G Y UU C A C A R G U R G G Y U C U Y G G G R CY R G G 5´ G C U A A A A G G A A C G A U C G U U G U G A U A U GC G U U RRU U YC G U U AC A U A U C A C A G U G A U U U U C C U U U A U A R CG C5´ C Y GY G Y Y C A U C U U A C Y G RG C A G U G U U G GA U G Y YY R R G Y C UC U A A Y A C U G YC U G G U A A Y G A U G R C RY C G G5´ Y Y Y Y R R GY A C A U R C U U C U U U A U A U C C C A U AY R A Y R R R CU A U G G A A U G U A A A G A A G U A U G U AY Y Y G G Y5´ Y R R YY C R U C A A A R U G G Y U G U G A R U G U Y R U CA U A U C A C A G C C A C U U U G A U G AG Y U Y R R5´ Y A A RA A G G G A A Y R G U U G C U G U G A U R U A Y Y Y A Y Y Y Y U YU A U A U C A C A G U G G C U G U U C U U U UU G G U Y5´ Y C R G G U G A G G U A G U A G G U U G U A U A G U U RR R R Y Y Y Y YG G A GY A A C U R U A C A A Y C U R C U A C U U Y C C U G R 5´ G G C U G G U C C G A R RG U A G U G G G U U A Y R U Y A AY Y Y Y U U R Y Y Y YU C Y C C CYC Y C A C U RC UR YA C U U G A C U R G C CU U U5´ Y Y Y C U G Y R R U G U C G UA R Y Y Y Y Y U G A R C CRAY Y Y Y Y Y G G G R G Y Y Y Y Y R G G YA G C C C YY G G GA A R C A A R Y R R R R Y R C C C A CCU R R R Y R YRG G U U C A R R R R Y A C G G C A Y Y R Y G G R Y YY Y5´ Y Y R C G R C C A UA C R R R G R A R C A CC Y G R U C C CA U C C G A A CY C R GA A G U UA A GC Y Y Y Y GG C Y R R G U A C U R G R YG RG R AYC CUG GG AA RY RGGU G Y Y G Y R RY 5´ G RU A GYYY AR Y G G Y AR R R C RY Y R G Y U Y A A Y Y R R RR Y R RG G UU C R AR U C C Y YY YR 5´ R R AAR Y U C R Y R R R R GYYAC R R YG A G U R Y Y R YRCUC Y CYYYY G G G A A GGU C U G A G A R G C CAY Y R C C CU G GGGYR Y Y Y Y Y Y GR R R R G R R R R Y G R G Y Y A C C AG A A A Y R R Y Y Y Y R RGY U U GGAA RRCUYRY GGCY RG Y R R Y U A G U C A A U R Y GRR Y R R Y Y Y R AAC Y C R A UUCAG A C U A UCU Y Y 5´ T R I T I R E SE C I S m ir -T A R m ir -30 m ir -9 lin-4 m ir -5 m ir -8 m ir -1 m ir -2 m ir -6 let -7 Y R N A 6S 5S t R N A R N aseP AURRGRYA G G YA U U G AA CUGU AU U G U G CR C C UU GCAUARAGCUAAAGCACUAAAAAGGAGUAA5´ A G U C A U G A U YG C U A U U C Y Y Y A A A U A G UG A U U G U G A U AG C G A U G C G G Y G U G U UG C G C A C R Y C G Y A Y C G CG C U5´ AGAGGAARCR G G G G C CAY G C A GAAGC G U UC ACG U C G C G G C C C CU GUC A G A U U C RGU R A A U C U GC GAAUUCUGCU5´ G A U AC A U A G G A A C C U C C U C A A A G G A U U C U A U GG A C AG U C G A U G C A G G G A G G G A CR R C U C C C U G C A U C G G CG A U U U U5´ A C G R RG U R RA R UG C G A U A A Y A YA A U A A U GAAA U U C C U CU U U G A C G G C C A A U A GC GA U A U U G G C CA U U U U U U U 5´ R Y C U U U A G C G GG Y U R RR U Y A R U CURG Y Y G G Y G U U U C G C C G R C Y YU R C Y Y U G A Y R Y 5´ RYYRYYCC G U G G UG A U U U G RYC GGCCGG C U U G C AG C C A C GU UAAAYAAUCGCUAAARAGGCCGRGGRRR5´ G UCGRR U Y Y C A C UG A U G AG U C Y U R ARGAC G A AA C 5´ Y Y R A U Y U AAA RA A A C A G CU U UC A AG U G CCU U U Y U GC A G U U YYY CARGAGCGC A A G A U RG R U A5´ R Y G GY Y G Y U U G C C A U A C G C C C YY Y YY C G G C A GG U A U G G A A R C A C C C YC G Y A CG A C U G GY Y C G G A C A CY GY C G U C CC G C C A G A U C 5´ CA C A U C A G A U U U C C U G G U G UA A CG A A U U U U C A A G U G C U U C U U G C A U A A G C A A G U U URA U C C C G C Y C CY YC G R G Y C G G G A UU U5´ A U GG A G A C A UGGCR U AA AG C C AG A R A G U R A G A AC R U A A C Y U A G A C U R U ACUUGAA C U G A U UYRC A U C U CA U U U U5´ G C R C Y G C AA AA U C R G R Y G C C G G G A U UG G YA YCCCG R A Y R R R R Y R A R C G C Y GCGYU U U U U U 5´ Y U R C G U G A C G A A G CG C G C G CA A A G UGG A C AA U A A AG C C UR A G C RU Y R A G UAG U C G Y CAG A C G C C G G U U A A G C C G G C G U UU U U U5´ YR Y A C G UR Y C Y G U U R UR G Y C C G G U U G C U U UG GU C G G U G A C C G G R R R R R A G C C C R C UU G G U G G G Y U UU U U5´ G G Y C R G C Y C R CC C C CC R G R G C Y G R C C G A C G G C C C C C G C U CC C C CCY GGCGGGGGYCGUC C C Y Y 5´ U U G G C G A U R UU U U U G GU U G G A A U G UAGUGY YY UU A R C A C U AA A CG C U G CC A C AA A U A A CCUG U CAGU U A U U U C A Y C A A A AA U A A A5´ RYYRYUG C C C UCY G G G CG UUUCCUCCCUAGACUU G G C Y Y YY R R G G C CU UUUUUUUYYY5´ SA M V sym R C P E B 3 F inP sr oB m sr SA M a H H 3 V m nt n3 livK D sr A C A E SA R isr K sr oD isr B 6C r spL suhB UY G C A UCCGCYAA Y CGGUYA G C C GU G UC G C GG A A G G U U Y Y Y A A C CA G C UR Y Y U Y Y G RA ACRRAG RRA GGUG A G C G 5´ UG A A A GAC G C G C A U U U GU U A U C A U CA UC C CU G U Y C A G AG A U GY A A U U U GG CC AC AG Y RY G U G G C C U U U U C 5´ * U U C U A C U G A C U C UU U U A AA A U A AU U A U U C A U U G G AG G U UU A A UA U G A A U A UA A A G G A U G A G CA U A U A G A AG C GUUUG C UCYUU GU U A G AU C R G U U A G U A G G AA 5´ G A U U UG G U R R C U G C G C U C UU C UA A G C C A G U U A C C CG G U U C A A A R A U U G C C A G C U U Y G A A C CU UC G A A A A A C C A C C U Y CR R G G U G G U U U U U U C GU 5´ R R R R R R R R C U C R U AU A A YYYCRRR AA U A UG GY Y Y G R R A GU U U C UAC C R R G Y R C CG U AAA YRYYYG A CU A Y G A G RR R5´ C G G C A U C C C C A U U A C C U A U G G AC A CG G U G C C G C A R G C U C U G G R A G UU C GUYCCRGAGYYUG Y Y G G A A R G G U U U U C C G U G U C C A G 5´ R R Y G G A R G CRR U GA R Y R Y Y Y YU Y A U YU G G GCA C Y U G R R R Y R YG G A G C YAG U R GU G C A ACCG R C C R Y R R R 5´ G U U G U A A C U AU G U U G C A R YA R A C G AG A A C C G AG U A U A G U U C A U GG G R U Y A CA UG AA UU G U UU A A CU RU CC U C U GG A U U C CC G U C C AU G R C A GU C G G U U C 5´ CUUA C U G A GA G C A C AA A GU UUC C C G U GC CA A C A G G G A G U G U UAU A AC G G UU UAUU A G U C U G G AG ACG G C A G A C U AU CCUCUUC C C G G U C CC CUA U G C C G G GU UUUUUUUAUGUC5´ UURGRYUYRCCUG A A U G U G A CU A U C A C U U CA AACRRYGRGYAACCUCAGUAUCAUCRYRGAGYUA A A C C C U C G C C G C CUG A C G G Y G A G G G U U UU CUUUUGGR5´ U G U A A A A A A C A U Y A U U UA G C GUGAYU U U C U A U C A ACAG C U A A C A A U U G U UA U U A C UG C CUA A Y G Y U C A UA A G G G U A AUU U U A A A A A AGG G CG A U A A AA A A C G A U U G G GGGA U G A G A Y A U G AAC G C UC A A G C A5´ C C C A G A G G U A U U G A UU G G U G A U R R C A Y Y U C U R U G Y U Y A U UY A U UR C A C C A A C C U G C G C RG A UGCGCAGGU U U U U U U U 5´ AR R R Y Y YYYAAURYCAACYUUUAGCGCACG G C U C U YY A A G A G C CA UUYCCCUA G R C C A A A C A G GAAU Y G U U U G G Y C UU UUUUU5´ G G G C A R G A U A U G U G A A GU R GC Y A C C GC AA GC YGR U A CY CUU CAC Y Y Y C C U U A U UC G C U Y GC U CAAC GGR A U C Y U G C U CU G C G A G G C Y5´ GUGCRRYCYRAUUYYR G Y Y G Y G C C Y R Y R A R AAC AUCAYAA R A U A CG G C R C R R CC ACRAUUUCCCUG G U G U U GG C G C A GU AUU C G C G C A C CC CGGUCUACC5´ Y U U Y R Y U R R U U U Y A U C A R A YC U GU U U G A U R R A A G Y U A R Y G A R R Y Y C A Y UA A C R G C U Y U Y GC Y G G C Y Y G A C C C G A G R Y Y G U UU U U U U5´ RACGUUCAY C C Y YY R G G RC GCAYRA Y C A R R Y C A Y GG AAC G G G G R Y Y U G R R 5´ sucA Sr aD sxy R N A I P ur ine SA M -C hl cdiG M P 2 A nt i-Q G adY r nk ldr P r fA O m r A -B R yeB t r aJ 2 Sr aH 23Sm et h D S-p ep U U C G G C C Y CG C R R C G YU U YU Y C G Y Y G CC C U C U G C A YG C C G U C G C C G A CGCAY U C C Y A U U CG A A Y Y G U G C G A U C C U G U C G C CY U C C U GC G G C G C G G C 5´ CG Y R G C G C U U G U UA U U U R Y Y G C U G U G U A G U GUC G U C Y YR A R Y Y R G R R Y Y Y A A A C C C C G C C Y UU Y G G C G G G G U U U UG C U U U U U5´ ** C U U A C C G G A G GY R U A UGGAC C C UG A UC C C AC Y C C U C U C C C C G A UG G A G AA U Y Y YU U U C C G G U A A GC C Y G Y C U Y Y R C U G Y Y U U A C C G G UG Y G U A A G G C A G UG A C G U Y U5´ G G R A G R Y R Y CU G GU G R Y C G G C U UC A AA CC GR Y G RR G Y R Y Y Y Y G G Y RGG U U C G AY U C C Y RY Y C U Y C C 5´ U G A C C C U U U A R C C R A G G G U C AC C U A G C C A A C U G A C GU U G U U AG U G A A Y YY A U G U U C A C A RA U A R GC C A A U C G C U U U G C G R U U G GC U U U U U U U U U5´ C U U A A UR A A CAA G A A A A C YAA R C G U A C Y U U C C Y C C U G AG UU C A G G C U G G A A UG C G C A CAG C U RA U U G U U G A U AA G G G CU ACUC AUACCGACAA GC CAGU G A A G C G AUG A AU G U C GG U U CC A C5´ R U Y Y RC U G A Y GA G U C C C A A AU A G G A CGA A A C G C GCGU CY G R A U 5´ CU C C A U GU A U C U UU G G G A C C U G U C A GC UG U G G C A G U CU C C C U UC C U A G CC A U G G AA G A G C A U A U U C UU G U U U AU U G G C A A A GC U G U CA C C A U UU RA U U G G UA U C A G A U U C U GAC U U G C A C A AG U A A C AU U C5´ C Y G G U U GG U G G C G C A C U U C C Y Y A C G G G C G G U G U R U Y A CG Y R Y U R Y R R Y A G A R R R A Y A C C A G C C C G C Y RR R A G C G G G C UU U U U U5´ G U C A U A C U A C G G UG C A A Y GY R RA A A G U A A AC G A U G A C C C Y A RG A A C U C Y RG G U A A A A U R CR UAUC A A A A U G Y A A A A U U G U Y U G A C C U G G GR UY Y UCCGGGUYRG Y U Y U U U U 5´ U R U G C U A A C U R R R A A YG U U G Y A U R Y A A CCC U U G R Y G C U U A U Y CC U U U R Y C A A GC A U A U U A Y AR C G R U C G Y YA A A G G A G A A A U G5´ U C R A A A G A A C AU G A A A U G G A G G AGAAAUU AC A GC A A U U UA UC AR C U G A A A UU A U AG G U GU AG ACA C A UGUC A GC R G UG G A A A CAGUU UC U A UC A A A A UU A A AG U A U UUAG A G AUUUU C C U C A AA U U U C AA A U5´ ACAG G G U A R G G R Y Y Y Y Y UU RU R R R R R Y C C U U A C C G GR UUUCU C A A R U Y G G R G YA AA Y C C G R U U G RA RUAUARAGGARG5´ CGYGUUA U A U G CC UU U A U U G UC ACARUUYUUUUUYYG Y U G R Y C A U U G GYAY YA U U R A U U Y C C A G CR AUAAAYG A C A A G C C C G A A C RY U G U U C G G G C U U U UU UUURRUYA5´ Y Y Y AU G G Y G G Y G R G G G R RCC UU Y G GG Y Y G C C G GUU C C YY R CCG GU Y U RC C A A C C C Y Y R C Y R C C AC C Y5´ AUGGAYRU G C G C A GGA A G C G CR AAGACARACAGGGACACRYAGGRA C CCG GA UGGYGGRRYAGGAUGUCAGGRAACAGUCUGCA A A G C C C C G C YY YG G C G G G G U U U U 5´ P s-R ho r nk ps M gsens t R N A S Q r r isr C H H 1 SN R 24 T r p ldr gr eA pr eQ 12 H A R 1F T er m L eu M icC C 4 R sm Y R ib osom e Paul Gardner Engaging Scientists
  • 3. What is Rfam? A database of ncRNA alignments and structures Used for annotating RNAs in genome sequences, bioinformatic algorithm development and molecular evolutionary analyses Gardner et al. (2008) Rfam: updates to the RNA families database Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 4. How can we keep textual descriptions of RNAs up to date? AC RF00005 ID tRNA CC Transfer RNA (tRNA) molecules are approximately 80 nucleotides in CC length. Their secondary structure includes four short CC double-helical elements and three loops (D, anti-codon, and T CC loops). Further hydrogen bonds mediate the characteristic CC L-shaped molecular structure. tRNAs have two regions of CC fundamental functional importance: the anti-codon, which is CC responsible for specific mRNA codon recognition, and the 3’ end, CC to which the tRNAs corresponding amino acid is attached (by CC aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of CC the genetic code in two manners: having more than one tRNA (with CC a specific anti-codon) for a particular amino acid; and ’wobble’ CC base-pairing, i.e. permitting non-standard base-pairing at the CC 3rd anti-codon position. RN [1] RM 8256282 RT The tertiary structure of tRNA and the development of the genetic RT code. RA Hou YM; RL Trends Biochem Sci 1993;18:362-364. RN [2] RM 9023104 RT tRNAscan-SE: a program for improved detection of transfer RNA genes RT in genomic sequence. RA Lowe TM, Eddy SR; RL Nucleic Acids Res 1997;25:955-964. Paul Gardner Engaging Scientists
  • 5. This Wikipedia thing looks pretty good! Paul Gardner Engaging Scientists
  • 6. WikiProject RNA The WikiProjects are social corners of Wikipedia for interested parties to discuss themed articles Involved in reviewing, ranking and rating articles Now rolled into the larger WikiProject Molecular and Cellular Biology Paul Gardner Engaging Scientists
  • 7. How has the Wikipedia experiment gone? x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x 0 2000 4000 6000 8000 10000 Number of Rfam pages edited Year Numberofedits 2007 2008 2009 2010 2011 9089 x x xxxxxxxxxxx xxxxxxxxxxxx xxxxx xx x 106 Total edits Vandalism Gardner et al. (2011) Rfam: Wikipedia, clans and the “decimal” release Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 8. Who are these Wikipedians donating their time? Rfambot Ppgardne Citationbot1 WillowW SmackBot DOI_bot Addbot Alexbateman Jebus989 JenniferRfm Zashaw Rjwilmsi Qwyrxian Yobot RE73 Narayanese RichFarmbrough Addshore Wgscott MiRroar RjwilmsiBot Arcadian DO11.10 Gortonk Banus Drmed36 FrescoBot Boghog Top 20 Rfam wikiproject editors Numberofedits 0 200 400 600 800 1000 Bots Proof Readers Scientists Paul Gardner Engaging Scientists
  • 9. What incentives can we give to Academics? Academics love publishing articles Introducing the “families track” at RNA Biology Publication requirements are an alignment & a Wikipedia article 100s of new families have been added thanks to this track Paul Gardner Engaging Scientists
  • 10. Who else is now using this model? Finn, Gardner, Bateman (2012) Making your database available through Wikipedia: the pros and cons Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 11. Wikipedia need you! What is the highest impact contribution academics can make? Rule 1: Register an Account Rule 2: Learn the Five Pillars ENCYC, NPOV, FREE, RESPECT, NORULES Rule 3: Be Bold, but Not Reckless Rule 4: Know Your Audience Rule 5: Do Not Infringe Copyright ... Paul Gardner Engaging Scientists
  • 12. Who might be reading about your field? Paul Gardner Engaging Scientists
  • 13. Thanks! The Rfam Consortium Wikipedians & the long tail! PPG is supported by a Rutherford Discovery Fellowship from Government funding, administered by the Royal Society of New Zealand. Paul Gardner Engaging Scientists