SlideShare uma empresa Scribd logo
1 de 18
Baixar para ler offline
Life as a scientific database curator


          Sandra Orchard




                EBI is an Outstation of the European Molecular Biology Laboratory.
What is a database curator

       Curator – OED

            - a keeper of a museum or other collection

            - from LATIN curare – take care of




2/17
What is a database curator

       The job
       • Creating a structure for unstructured biological data
       • Generating order from chaos
       • Combining literature and automated processes to provide
         biomolecules with correct sequence/structure,
         nomenclature, function and contextual information
       • Give biological context to large experimental datasets
       The qualification
       • Need an attention to detail which would annoy even the
         best of housemates
       • Passion for reading and understanding literature

3/17
What is a database curator

       The Pros

       • Read about and gain understanding of all areas of
         biology

       The Cons

       • No specialisation
       • Persuading biologists that there are benefits to this.




4/17
What is a database curator

• The International Society for Biocuration (ISB) definition:
...integration of information relevant to biology into a
    database or resource that enables integration of the
    scientific literature...and large experimental data sets.
• Goals are
...accurate and comprehensive representation...
...to facilitate access to data for scientists...as a resource for
    computational analysis
What does a database curator do?
Collects, annotates, and validates information (in a
database).


Extracts & organizes data from literature


Describes data using standards, protocols and
vocabularies (enabling computational queries and data
exchange).

Communicates with researchers to ensure the accuracy
of curated information and to foster good practice in data
exchange.
What does a database curator do?

            Takes part in the development of shared
            biomedical data standards and ontologies
            and (ideally) enforces their use.

            Trains users in effectively accessing and
            using the data in the databases

            Promotes database usage through talks,
            conference attendance/posters,
            publications etc…..



7/17
What do I do?

       • Curate the molecular interaction database




8/17
What do I do?




       Custom curation tools designed by the curation team


9/17
What do I do?

                        Controlled vocabulary maintenance




10/17
Qualifications for the job

        • A biology B.Sc./M.Sc./PhD + lab experience

              or

        • A bioinformatics M.Sc

        Plus – an enquiring mind, ability to write good English and
          the right attitude

        Training – largely database specific and will be given ‘on-
          the-job’



11/17
Qualifications for the job

        • Do I need to be able to do programming?

        • Answer – no. It is often helpful to have some database
          query ability but it is perfectly possible to do the job
          without (in most databases)




12/17
Career Progression

        Within the EBI
        • Progress as a curator – senior curator, curation
          coordinator

        • Project management – grant coordinator, project leader

        Post –EBI
        • Curation/project leadership positions at many other
          institutes
        • Related areas – academic research, research project
          management, lectureships, journal publishing

13/17
Will I still be allowed to publish?

        Curation
        The annotation of both human and mouse kinomes in
          UniProtKB/Swiss-Prot - (MCP)
        Data Standards
        The Minimum Information required for reporting a Molecular
          Interaction Experiment (MIMIx) – (NBT)
        Data Formats
        The HUPO PSI's molecular interaction format--a community
          standard for the representation of protein interaction data.
          – (NBT)



14/17
Will I still be allowed to publish?

        Tool development
          Rintact: enabling computational analysis of molecular
          interaction data from the IntAct repository.
          (Bioinformatics)
        Ontologies
        The use of common ontologies and controlled vocabularies
          to enable data exchange and deposition for complex
          proteomic experiments (Pac Symp Biocomput)
        Training
        Submit your interaction data the IMEx way - a step by step
          guide to trouble-free deposition (Proteomics)


15/17
Curation as a profession




16/17
Curation as a profession

        • Biocuration conference every 12 months – 2102 in
          Cambridge, UK

        • Opportunities for further training – bioinformatic tools,
          programming, career development/management

        • Attendance at biological/computational biology
          conferences encouraged – the EBI often provides
          speakers




17/17
Summary

        • Curation is not for everyone – it does require a certain
          mindset

        • Exposes you to all areas of biology (and chemistry)

        •   Now a recognised profession and our numbers are
            growing

        • Many opportunities to be become involved in “extra-
          curriculum” activities – its not all reading papers



18/17

Mais conteúdo relacionado

Mais procurados

Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBIgeetikaJethra
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuBen Busby
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2Mohd Affan
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission ToolsRishikaMaji
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in BioinformaticsArindam Ghosh
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological databaseKAUSHAL SAHU
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid databaseEsakkiammal S
 
Biological databases
Biological databasesBiological databases
Biological databasesAfra Fathima
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES nadeem akhter
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in BioinformaticsMeghaj Mallick
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 

Mais procurados (20)

Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osu
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Biological databases
Biological databasesBiological databases
Biological databases
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 

Destaque

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)phdcareers
 
P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)phdcareers
 
Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012phdcareers
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)phdcareers
 
E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)phdcareers
 
Publishing Career Day Presentation AM
Publishing Career Day Presentation AMPublishing Career Day Presentation AM
Publishing Career Day Presentation AMphdcareers
 

Destaque (7)

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)
 
P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)
 
Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)
 
E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)
 
Publishing Career Day Presentation AM
Publishing Career Day Presentation AMPublishing Career Day Presentation AM
Publishing Career Day Presentation AM
 
PhDretreat
PhDretreat PhDretreat
PhDretreat
 

Semelhante a E2 life as_a_scientific_database_curator_(sandra_orchard)

Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Susanna-Assunta Sansone
 
Teaching Case Studies
Teaching Case StudiesTeaching Case Studies
Teaching Case StudiesJulie Goldman
 
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...John Scally
 
"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...Incremental Project
 
Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...NASIG
 
E-Science: New Roles for Libraries
E-Science: New Roles for LibrariesE-Science: New Roles for Libraries
E-Science: New Roles for LibrariesElaine Martin
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)phdcareers
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminarseanb
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
LIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Europe
 
Designing Biological Databases
Designing Biological DatabasesDesigning Biological Databases
Designing Biological DatabasesArjei Balandra
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation HeidornBryan Heidorn
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENTJen Rutner
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen ARDC
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Robin Rice
 

Semelhante a E2 life as_a_scientific_database_curator_(sandra_orchard) (20)

Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015
 
Teaching Case Studies
Teaching Case StudiesTeaching Case Studies
Teaching Case Studies
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
 
"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...
 
Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
E-Science: New Roles for Libraries
E-Science: New Roles for LibrariesE-Science: New Roles for Libraries
E-Science: New Roles for Libraries
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Library Linkages
Library LinkagesLibrary Linkages
Library Linkages
 
LIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data Literacy
 
Designing Biological Databases
Designing Biological DatabasesDesigning Biological Databases
Designing Biological Databases
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENT
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...
 

Último

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Último (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

E2 life as_a_scientific_database_curator_(sandra_orchard)

  • 1. Life as a scientific database curator Sandra Orchard EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. What is a database curator Curator – OED - a keeper of a museum or other collection - from LATIN curare – take care of 2/17
  • 3. What is a database curator The job • Creating a structure for unstructured biological data • Generating order from chaos • Combining literature and automated processes to provide biomolecules with correct sequence/structure, nomenclature, function and contextual information • Give biological context to large experimental datasets The qualification • Need an attention to detail which would annoy even the best of housemates • Passion for reading and understanding literature 3/17
  • 4. What is a database curator The Pros • Read about and gain understanding of all areas of biology The Cons • No specialisation • Persuading biologists that there are benefits to this. 4/17
  • 5. What is a database curator • The International Society for Biocuration (ISB) definition: ...integration of information relevant to biology into a database or resource that enables integration of the scientific literature...and large experimental data sets. • Goals are ...accurate and comprehensive representation... ...to facilitate access to data for scientists...as a resource for computational analysis
  • 6. What does a database curator do? Collects, annotates, and validates information (in a database). Extracts & organizes data from literature Describes data using standards, protocols and vocabularies (enabling computational queries and data exchange). Communicates with researchers to ensure the accuracy of curated information and to foster good practice in data exchange.
  • 7. What does a database curator do? Takes part in the development of shared biomedical data standards and ontologies and (ideally) enforces their use. Trains users in effectively accessing and using the data in the databases Promotes database usage through talks, conference attendance/posters, publications etc….. 7/17
  • 8. What do I do? • Curate the molecular interaction database 8/17
  • 9. What do I do? Custom curation tools designed by the curation team 9/17
  • 10. What do I do? Controlled vocabulary maintenance 10/17
  • 11. Qualifications for the job • A biology B.Sc./M.Sc./PhD + lab experience or • A bioinformatics M.Sc Plus – an enquiring mind, ability to write good English and the right attitude Training – largely database specific and will be given ‘on- the-job’ 11/17
  • 12. Qualifications for the job • Do I need to be able to do programming? • Answer – no. It is often helpful to have some database query ability but it is perfectly possible to do the job without (in most databases) 12/17
  • 13. Career Progression Within the EBI • Progress as a curator – senior curator, curation coordinator • Project management – grant coordinator, project leader Post –EBI • Curation/project leadership positions at many other institutes • Related areas – academic research, research project management, lectureships, journal publishing 13/17
  • 14. Will I still be allowed to publish? Curation The annotation of both human and mouse kinomes in UniProtKB/Swiss-Prot - (MCP) Data Standards The Minimum Information required for reporting a Molecular Interaction Experiment (MIMIx) – (NBT) Data Formats The HUPO PSI's molecular interaction format--a community standard for the representation of protein interaction data. – (NBT) 14/17
  • 15. Will I still be allowed to publish? Tool development Rintact: enabling computational analysis of molecular interaction data from the IntAct repository. (Bioinformatics) Ontologies The use of common ontologies and controlled vocabularies to enable data exchange and deposition for complex proteomic experiments (Pac Symp Biocomput) Training Submit your interaction data the IMEx way - a step by step guide to trouble-free deposition (Proteomics) 15/17
  • 16. Curation as a profession 16/17
  • 17. Curation as a profession • Biocuration conference every 12 months – 2102 in Cambridge, UK • Opportunities for further training – bioinformatic tools, programming, career development/management • Attendance at biological/computational biology conferences encouraged – the EBI often provides speakers 17/17
  • 18. Summary • Curation is not for everyone – it does require a certain mindset • Exposes you to all areas of biology (and chemistry) • Now a recognised profession and our numbers are growing • Many opportunities to be become involved in “extra- curriculum” activities – its not all reading papers 18/17