SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Genome Resources at the EBI -
     Ensembl and Ensembl Genomes
     Bert Overduin, Ph.D.




PAG XX, January 15th 2012, San Diego
                                       EBI is an Outstation of the European Molecular Biology Laboratory. 
EBI Database Workshop
Outline

     •  Introduction to Ensembl / Ensembl Genomes
     •  Highlights in 2011
     •    Demo 1: Browser basics
     •    Demo 2: Variant Effect Predictor
     •    Demo 3: Adding custom tracks
     •    Demo 4: BioMart
     •  Future plans for 2012
     •  Help & Workshops
     •  Acknowledgements




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Goal

     To provide access to genome-scale data from
     completely sequenced species of scientific
     interest from across the taxonomy




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
History

     •  1999: Start of Ensembl project for the Human Genome Project
     •  2000: First release of data and web interface
     •  2009: First release of Ensembl Genomes
     •  2011: Ensembl v65: 63 genomes
     •  2011: Ensembl Genomes v12: 335 genomes


     •  Ensembl: EBI & Wellcome Trust Sanger Institute
     •  Ensembl Genomes: EBI




                                                                      © John Freebrey

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Species Ensembl

             Primates
          Rodents etc.
        Laurasiatheria
            Afrotheria
             Xenartha
      Other mammals
       Birds & reptiles
           Amphibians
                  Fish
      Other chordates
     Other eukaryotes
     On Pre! Ensembl

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Species Ensembl Genomes




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Annotation

     •  Inclusion of species depends on various criteria (model organism?
        community interest / demand? funding? completeness / quality of
        genome assembly?)
     •  A broad taxonomic coverage is aimed for



     •  Annotation in-house by the Ensembl team


     •  Annotation preferably by or in collaboration with the scientific
        community for the species in question




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Ensembl genebuild




         Genome
         assembly
                +                      Genebuild pipeline
                                                            Ensembl
      Experimental                                          Genes
       evidence
   (cDNAs & proteins)




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Data

     •  Genomic sequence
     •  Gene/transcript/protein models
     •  External references
     •  Mapped cDNAs, proteins, microarray probes, BACs, cytogenetic
        bands, markers, repeats etc.
     •  Comparative data: orthologs and paralogs, protein families, whole
        genome alignments, syntenic regions
     •  Variation data: sequence variants, structural variants
     •  Regulatory data: “best guess” set of regulatory elements




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Access to data

     •  Web browser                    http://www.ensembl.org
                                       (with US West, US East and Asia mirrors
                                       and Pre! and Archive! sites)
                                       http://www.ensemblgenomes.org

     •  BioMart                        http://www.biomart.org

     •  FTP                            ftp.ensembl.org/pub
                                       ftp.ensemblgenomes.org/pub

     •  Public MySQL server            ensembldb.ensembl.org:5306:anonymous
                                       mysql.ebi.ac.uk:4157:anonymous

     •  Ensembl API                    http://www.ensembl.org/info/docs/api



PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Highlights in 2011


     •    Genebuilds for turkey and cod
     •    Genebuild on new cow assembly (UMD 3.1)
     •    Added rabbit to whole-genome multiple alignments
     •    3-way avian whole-genome alignment and constrained elements
          (chicken, turkey, zebra finch)
     •    Variation db for cat (dbSNP127)
     •    Updated variation data for cow (dbSNP133), dog (DGVa), pig (Illumina
          PorcineSNP60 Bead Chip, DGVa)
     •    Improved Variant Effect Predictor (VEP) and failed variation pipeline
     •    Sortable tracks, saving of configurations and configuration sets
     •    Support for large file formats (BAM, BigWig, VCF)


PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Highlights in 2011


     •    31 new species
     •    Plants: Chlamydomonas reinhardtii, Cyanidioschyzon merolae, Glycine
          max, Oryza glaberrima, Selaginella moellendorffii
     •    Fungal plant pathogens: Ashbya gossypii, Fusarium oxysporum,
          Gibberella moniliformis, Gibberella zeae, Mycosphaerella graminicola,
          Nectria haematococca, Phaeosphaeria nodorum, Puccinia triticina,
          Ustilago maydis
     •    Oomycete plant pathogens: Phytophthora infestans, Phytophthora
          ramorum, Phytophthora sojae, Pythium ultimum
     •    Active collaborations within PhytoPath (http://www.phytopathdb.org/)
          and PomBase projects
     •    Variation db for Arabidopsis thaliana contains over 14 million variants
          from over 1600 strains

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Demo 1 - Browser basics
  Background:
  The CYN gene encodes cyanate hydratase,
  an enzyme found in bacteria and plants that
  catalyses the reaction of cyanate with
  bicarbonate to produce ammonia and
  carbon dioxide:

  NCO- + HCO3- + 2H+ = NH3 + 2CO2

  Task:
  Explore the CYN gene of Vitis vinifera
  (grape).




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Variant Effect Predictor (VEP)

     •  Predicts functional consequences of variants on Ensembl genes
     •  Web interface, standalone Perl script and Perl API
     •  Accepts tab-delimited, VCF and pileup format as input




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Demo 2 - Variant Effect Predictor
  Background:
  Variants in the bestrophin 1 (BEST1) gene are
  associated with various retinal disorders in man.
  Dog is used as a model to study these. The
  following are a number of new variants discovered
  in the BEST1 gene of a Lapponian Herder:
  chr   start         end          alleles strand

  18    57500034      57500034     A/G     +
  18    57500028      57500028     G/T     +
  18    57500027      57500027     G/T     +
  18    57499959      57499958     -/C     +
  18    57499929      57499929     G/T     +
  18    57499981      57499981     G/T     +
  18    57499834      57499834     A/T     +
  18    57449754      57449754     C/T     +


  Task:
  Determine the effect of the variants on dog BEST1.
                                                       © Royal Canin

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Adding custom tracks

     •  Upload data to Ensembl (5 MB size limit) or attach file on web-
        accessible server (http or ftp) to Ensembl (no size limit)
     •  Possible formats:

          BAM                          sequence alignments (no upload)
          BED                          genes / features
          BedGraph                     continuous-valued data
          BigWig                       continuous-valued data (no upload)
          GBrowse                      genes / features
          GFF                          genes / features
          GTF                          genes / features
          PSL                          sequence alignments
          VCF                          variants (no upload)
          WIG                          continuous-valued data



PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Demo 3 - Adding custom tracks
  Background:
  The file SRR070570.bam contains alignments
  of Illumina RNAseq reads from a wildtype
  Arabidopsis thaliana strain.
  The bam file and its bam.bai index file are
  located at http://www.ebi.ac.uk/~bert/.

  Task:
  Attach SRR070570.bam to Ensembl Genomes.
  Check the expression of a constitutive and a
  non-constitutive Arabidopsis gene, e.g.
  RBCS1A (ribulose bisphosphate carboxylase
  small chain 1A) and PR1 (pathogenesis-related
  protein 1).

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
BioMart

     •  Data retrieval tool
     •  Originally developed for Ensembl (EnsMart)
     •  Now used by many large data resources
     •  Integrated with several widely used software packages
     •  Joint project between the European Bioinformatics Institute (EBI)
        and the Ontario Institute for Cancer Research (OICR)
     •  Website : http://www.biomart.org




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Principle

     •  Step 1 – Dataset
        Choose your dataset

     •  Step 2 – Filters
        Limit your dataset

     •  Step 3 – Attributes
        Specify what information you want to output

     •  Step 4 – Results
        Preview and output your results




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Demo 4 - BioMart
  Background:
  “Lactation” (GO:0007595) is the
  Gene Ontology (GO) term for the
  biological process of “the secretion
  of milk by the mammary gland”.

  Task:
  Retrieve all cow genes that are
  annotated with the GO term
  “lactation”.




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Future plans for 2012


     •  Genebuilds for duck (?), salmon (?), sheep (?), tilapia
     •  Genebuilds on new assemblies for cat (Felis_catus-6.2), chicken
        (Gallus_gallus-4.0), dog (CanFam3.1), pig (Sscrofa10.2)
     •  Include RNAseq data in genebuild
     •  VEP support for structural variants
     •  New BLAST/BLAT interface
     •  http://www.ensembl.info/roadmap




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Future plans for 2012


     •  New species: barley, Brassica (from BrassEnsembl), foxtail millet,
        Oryza brachyanta, potato, tomato, Gaeumannomyces graminis,
        Magnaporthe oryzae, Magnaporthe poae, tsetse fly
     •  New assemblies: maize (B73_RefGen_v3), Oryza sativa ssp.
        japonica cu. Nipponbare (Os-Nipponbare-Reference-IRGSP-1.0;
        IRGSP1.0), poplar
     •  Variation db and new gene annotation for wheat stem rust pathogen
     •  New query interface for data re plant-fungal pathogen interactions
        (PhytoPath; http://www.phytopathdb.org/)
     •  Widened development of community annotation pipelines




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Help

     •  Helpdesk:
          helpdesk@ensembl.org
          helpdesk@ensemblgenomes.org

     •  Mailing lists:
          http://www.ensembl.org/info/about/contact/mailing.html
          http://plants.ensembl.org/info/about/contact/mailing.html

     •  Ensembl YouTube and YouKu (              ) channels:
          http://www.youtube.com/user/EnsemblHelpdesk
          http://u.youku.com/user_show/uid_Ensemblhelpdesk




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
EBI Train online




      http://www.ebi.ac.uk/training/online/course/ensembl-browsing-chordate-genomes

PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Workshops




until now:                     in 2011:
49 countries on 5 continents   ~ 90 workshops
Workshops

     •  Browser (0.5-2 days) and API (1-3 days) workshops
     •  Combination of lectures and hands-on exercises
     •  Advertised on http://www.ensembl.info/workshops/calendar/


     •  You can host your own workshop!
     •  For academic institutions there is, apart from the instructor’s
        expenses, no fee
     •  You only need a computer room and participants
     •  You can get more info from me (bert@ebi.ac.uk) or at the EBI booth
        (302)




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Stay in touch

     •  Blog:
          http://www.ensembl.info

     •  Facebook:
          http://www.facebook.com/Ensembl.org

     •  Twitter:
          http://twitter.com/Ensembl




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Acknowledgements



      •  WTSI                          •    CADRE                  •  OICR
                                       •    Gramene
                                       •    VectorBase
                                       •    WormBase
                                       •    PomBase

      •    Wellcome Trust              •  EMBL
      •    NIH-NHGRI                   •  BBSRC
      •    EMBL                        •  Wellcome Trust
      •    EU                          •  Bill and Melinda Gates
                                          Foundation
                                       •  EU



PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Acknowledgements

Paul Flicek, Ridwan Amode, Daniel Barrell, Kathryn Beal, Simon Brent, Denise Carvalho-Silva,
Clapham P, Guy Coates, Susan Fairley, Stephen Fitzgerald, Laurent Gil, Leo Gordon, Maurice
Hendrix, Thibaut Hourlier, Nathan Johnson, Andreas Kähäri, Damian Keefe, Stephen Keenan,
Rhoda Kinsella, Monika Komorowska, Gautier Koscielny, Eugene Kulesha, Pontus Larsson,
Ian Longden, Will McLaren, Matthieu Muffato, Bert Overduin, Miguel Pignatelli, Bethan
Pritchard, Harpreet Riat, Graham Ritchie, Magali Ruffier, Michael Schuster, Daniel Sobral,
Amy Tang, Kieron Taylor, Stephen Trevanion, Jana Vandrovcova, Simon White, Mark Wilson,
Steven Wilder, Bronwen Aken, Ewan Birney, Fiona Cunningham, Ian Dunham, Richard Durbin,
Xosé Fernández-Suarez, Jennifer Harrow, Javier Herrero, Tim Hubbard, Anne Parker, Glenn
Proctor, Giulietta Spudich, Jan Vogel, Andy Yates, Amonida Zadissa, Steve Searle

Paul Kersey, Dan Staines, Dan Lawson, Eugene Kulesha, Paul Derwent, Jay Humphrey,
Daniel Hughes, Stephen Keenan, Arnaud Kerhornou, Gautier Koscielny, Nick Langridge, Mark
McDowall , Karyn Megy, Uma Maheswari, Michael Nuhn, Michael Paulini, Helder Pedro, Iliana
Toneva, Derek Wilson, Andy Yates, Ewan Birney
Posters

      P941
      Genome Annotation in Ensembl
      Susan Fairley

      P942
      Ensembl Plants: An Integrating Resource for Plant
      Genomics and Variation
      Paul Kersey




PAG XX, January 15th 2012, San Diego
EBI Database Workshop
Training courses                                                                Careers


                       Meet the experts
                                                         Brochures and factsheets



     Come and see us at booth 302!

 PhD and post doc opportunities
                                                                     Industry programme


                                       Research and services



            Visitor’s programme
PAG XX, January 15th 2012, San Diego
                                                          EBI is an Outstation of the European Molecular Biology Laboratory. 
EBI Database Workshop
PDF of this presentation

      http://www.ebi.ac.uk/~bert/past_workshops.html




PAG XX, January 15th 2012, San Diego
EBI Database Workshop

Mais conteúdo relacionado

Mais procurados

Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...VHIR Vall d’Hebron Institut de Recerca
 
BITs: Genome browsers and interpretation of gene lists.
BITs: Genome browsers and interpretation of gene lists.BITs: Genome browsers and interpretation of gene lists.
BITs: Genome browsers and interpretation of gene lists.BITS
 
BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1BITS
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesJackie Wirz, PhD
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseRai University
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Sreekanth Gali
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS
 

Mais procurados (20)

Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
BITs: Genome browsers and interpretation of gene lists.
BITs: Genome browsers and interpretation of gene lists.BITs: Genome browsers and interpretation of gene lists.
BITs: Genome browsers and interpretation of gene lists.
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 database
 
NCBI
NCBINCBI
NCBI
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Proteome databases
Proteome databasesProteome databases
Proteome databases
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
 
Ncbi
NcbiNcbi
Ncbi
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
Protein Data Bank (PDB)
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Bioinformatics in a Nutshell
Bioinformatics in a NutshellBioinformatics in a Nutshell
Bioinformatics in a Nutshell
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databases
 

Destaque

Functionally annotate genomic variants
Functionally annotate genomic variantsFunctionally annotate genomic variants
Functionally annotate genomic variantsDenis C. Bauer
 
The Application of the Human Phenotype Ontology
The Application of the Human Phenotype Ontology The Application of the Human Phenotype Ontology
The Application of the Human Phenotype Ontology mhaendel
 
The UCSC genome browser: A Neuroscience focused overview
The UCSC genome browser: A Neuroscience focused overviewThe UCSC genome browser: A Neuroscience focused overview
The UCSC genome browser: A Neuroscience focused overviewVictoria Perreau
 
Presentazione Tesi: Terra di Mezzo
Presentazione Tesi: Terra di MezzoPresentazione Tesi: Terra di Mezzo
Presentazione Tesi: Terra di MezzoSara M
 
Consulta de portales
Consulta de portalesConsulta de portales
Consulta de portalesLauma1416
 
Chapter XI Board and Board Provisions (Cos Act 2013)
Chapter XI Board and Board Provisions (Cos Act 2013)Chapter XI Board and Board Provisions (Cos Act 2013)
Chapter XI Board and Board Provisions (Cos Act 2013)Mamta Binani
 

Destaque (20)

Genome Browser
Genome BrowserGenome Browser
Genome Browser
 
Functionally annotate genomic variants
Functionally annotate genomic variantsFunctionally annotate genomic variants
Functionally annotate genomic variants
 
The Application of the Human Phenotype Ontology
The Application of the Human Phenotype Ontology The Application of the Human Phenotype Ontology
The Application of the Human Phenotype Ontology
 
Windows Vista
Windows VistaWindows Vista
Windows Vista
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
The UCSC genome browser: A Neuroscience focused overview
The UCSC genome browser: A Neuroscience focused overviewThe UCSC genome browser: A Neuroscience focused overview
The UCSC genome browser: A Neuroscience focused overview
 
NCBI
NCBINCBI
NCBI
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
Presentazione Tesi: Terra di Mezzo
Presentazione Tesi: Terra di MezzoPresentazione Tesi: Terra di Mezzo
Presentazione Tesi: Terra di Mezzo
 
Shot Types
Shot TypesShot Types
Shot Types
 
Jose esteves 1
Jose esteves 1Jose esteves 1
Jose esteves 1
 
Sub formulario2
Sub formulario2Sub formulario2
Sub formulario2
 
Consulta de portales
Consulta de portalesConsulta de portales
Consulta de portales
 
Beautiful
BeautifulBeautiful
Beautiful
 
Advertising wed
Advertising wedAdvertising wed
Advertising wed
 
Chapter XI Board and Board Provisions (Cos Act 2013)
Chapter XI Board and Board Provisions (Cos Act 2013)Chapter XI Board and Board Provisions (Cos Act 2013)
Chapter XI Board and Board Provisions (Cos Act 2013)
 
Formulario devoluciones
Formulario devolucionesFormulario devoluciones
Formulario devoluciones
 
Mayrikis voski dzerqer
Mayrikis voski dzerqerMayrikis voski dzerqer
Mayrikis voski dzerqer
 
Discover Great Reasons to move to ConfigMgr 2012 SP1
Discover Great Reasons to move to ConfigMgr 2012 SP1Discover Great Reasons to move to ConfigMgr 2012 SP1
Discover Great Reasons to move to ConfigMgr 2012 SP1
 
50 states
50 states50 states
50 states
 

Semelhante a Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes

Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
Ensembl Plants: Visualising, mining and analysing crop genomics data
Ensembl Plants: Visualising, mining and analysing crop  genomics dataEnsembl Plants: Visualising, mining and analysing crop  genomics data
Ensembl Plants: Visualising, mining and analysing crop genomics dataDan Bolser
 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOAEBI
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Monica Munoz-Torres
 
What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?ylog
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128GenomeInABottle
 
ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartAraport
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017Monica Poelchau
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
The European Nucleotide Archive
The European Nucleotide ArchiveThe European Nucleotide Archive
The European Nucleotide ArchiveEBI
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseHilmar Lapp
 
EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference Denise Carvalho-Silva, PhD
 
Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupGenomeInABottle
 
XMLPipeDB
XMLPipeDBXMLPipeDB
XMLPipeDBbosc
 
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesLeighton Pritchard
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
BioRuby -- Bioinformatics Library
BioRuby -- Bioinformatics LibraryBioRuby -- Bioinformatics Library
BioRuby -- Bioinformatics Libraryngotogenome
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisMohamedHasan816582
 

Semelhante a Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes (20)

Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Ensembl Plants: Visualising, mining and analysing crop genomics data
Ensembl Plants: Visualising, mining and analysing crop  genomics dataEnsembl Plants: Visualising, mining and analysing crop  genomics data
Ensembl Plants: Visualising, mining and analysing crop genomics data
 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick Provart
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
The European Nucleotide Archive
The European Nucleotide ArchiveThe European Nucleotide Archive
The European Nucleotide Archive
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference
 
Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working Group
 
XMLPipeDB
XMLPipeDBXMLPipeDB
XMLPipeDB
 
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Stanford workshop2020
Stanford workshop2020Stanford workshop2020
Stanford workshop2020
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
BioRuby -- Bioinformatics Library
BioRuby -- Bioinformatics LibraryBioRuby -- Bioinformatics Library
BioRuby -- Bioinformatics Library
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysis
 

Último

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Último (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes

  • 1. Genome Resources at the EBI - Ensembl and Ensembl Genomes Bert Overduin, Ph.D. PAG XX, January 15th 2012, San Diego EBI is an Outstation of the European Molecular Biology Laboratory. EBI Database Workshop
  • 2. Outline •  Introduction to Ensembl / Ensembl Genomes •  Highlights in 2011 •  Demo 1: Browser basics •  Demo 2: Variant Effect Predictor •  Demo 3: Adding custom tracks •  Demo 4: BioMart •  Future plans for 2012 •  Help & Workshops •  Acknowledgements PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 3. Goal To provide access to genome-scale data from completely sequenced species of scientific interest from across the taxonomy PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 4. History •  1999: Start of Ensembl project for the Human Genome Project •  2000: First release of data and web interface •  2009: First release of Ensembl Genomes •  2011: Ensembl v65: 63 genomes •  2011: Ensembl Genomes v12: 335 genomes •  Ensembl: EBI & Wellcome Trust Sanger Institute •  Ensembl Genomes: EBI © John Freebrey PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 5. Species Ensembl Primates Rodents etc. Laurasiatheria Afrotheria Xenartha Other mammals Birds & reptiles Amphibians Fish Other chordates Other eukaryotes On Pre! Ensembl PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 6. Species Ensembl Genomes PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 7. Annotation •  Inclusion of species depends on various criteria (model organism? community interest / demand? funding? completeness / quality of genome assembly?) •  A broad taxonomic coverage is aimed for •  Annotation in-house by the Ensembl team •  Annotation preferably by or in collaboration with the scientific community for the species in question PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 8. Ensembl genebuild Genome assembly + Genebuild pipeline Ensembl Experimental Genes evidence (cDNAs & proteins) PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 9. Data •  Genomic sequence •  Gene/transcript/protein models •  External references •  Mapped cDNAs, proteins, microarray probes, BACs, cytogenetic bands, markers, repeats etc. •  Comparative data: orthologs and paralogs, protein families, whole genome alignments, syntenic regions •  Variation data: sequence variants, structural variants •  Regulatory data: “best guess” set of regulatory elements PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 10. Access to data •  Web browser http://www.ensembl.org (with US West, US East and Asia mirrors and Pre! and Archive! sites) http://www.ensemblgenomes.org •  BioMart http://www.biomart.org •  FTP ftp.ensembl.org/pub ftp.ensemblgenomes.org/pub •  Public MySQL server ensembldb.ensembl.org:5306:anonymous mysql.ebi.ac.uk:4157:anonymous •  Ensembl API http://www.ensembl.org/info/docs/api PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 11. Highlights in 2011 •  Genebuilds for turkey and cod •  Genebuild on new cow assembly (UMD 3.1) •  Added rabbit to whole-genome multiple alignments •  3-way avian whole-genome alignment and constrained elements (chicken, turkey, zebra finch) •  Variation db for cat (dbSNP127) •  Updated variation data for cow (dbSNP133), dog (DGVa), pig (Illumina PorcineSNP60 Bead Chip, DGVa) •  Improved Variant Effect Predictor (VEP) and failed variation pipeline •  Sortable tracks, saving of configurations and configuration sets •  Support for large file formats (BAM, BigWig, VCF) PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 12. Highlights in 2011 •  31 new species •  Plants: Chlamydomonas reinhardtii, Cyanidioschyzon merolae, Glycine max, Oryza glaberrima, Selaginella moellendorffii •  Fungal plant pathogens: Ashbya gossypii, Fusarium oxysporum, Gibberella moniliformis, Gibberella zeae, Mycosphaerella graminicola, Nectria haematococca, Phaeosphaeria nodorum, Puccinia triticina, Ustilago maydis •  Oomycete plant pathogens: Phytophthora infestans, Phytophthora ramorum, Phytophthora sojae, Pythium ultimum •  Active collaborations within PhytoPath (http://www.phytopathdb.org/) and PomBase projects •  Variation db for Arabidopsis thaliana contains over 14 million variants from over 1600 strains PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 13. Demo 1 - Browser basics Background: The CYN gene encodes cyanate hydratase, an enzyme found in bacteria and plants that catalyses the reaction of cyanate with bicarbonate to produce ammonia and carbon dioxide: NCO- + HCO3- + 2H+ = NH3 + 2CO2 Task: Explore the CYN gene of Vitis vinifera (grape). PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 14. Variant Effect Predictor (VEP) •  Predicts functional consequences of variants on Ensembl genes •  Web interface, standalone Perl script and Perl API •  Accepts tab-delimited, VCF and pileup format as input PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 15. Demo 2 - Variant Effect Predictor Background: Variants in the bestrophin 1 (BEST1) gene are associated with various retinal disorders in man. Dog is used as a model to study these. The following are a number of new variants discovered in the BEST1 gene of a Lapponian Herder: chr start end alleles strand 18 57500034 57500034 A/G + 18 57500028 57500028 G/T + 18 57500027 57500027 G/T + 18 57499959 57499958 -/C + 18 57499929 57499929 G/T + 18 57499981 57499981 G/T + 18 57499834 57499834 A/T + 18 57449754 57449754 C/T + Task: Determine the effect of the variants on dog BEST1. © Royal Canin PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 16. Adding custom tracks •  Upload data to Ensembl (5 MB size limit) or attach file on web- accessible server (http or ftp) to Ensembl (no size limit) •  Possible formats: BAM sequence alignments (no upload) BED genes / features BedGraph continuous-valued data BigWig continuous-valued data (no upload) GBrowse genes / features GFF genes / features GTF genes / features PSL sequence alignments VCF variants (no upload) WIG continuous-valued data PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 17. Demo 3 - Adding custom tracks Background: The file SRR070570.bam contains alignments of Illumina RNAseq reads from a wildtype Arabidopsis thaliana strain. The bam file and its bam.bai index file are located at http://www.ebi.ac.uk/~bert/. Task: Attach SRR070570.bam to Ensembl Genomes. Check the expression of a constitutive and a non-constitutive Arabidopsis gene, e.g. RBCS1A (ribulose bisphosphate carboxylase small chain 1A) and PR1 (pathogenesis-related protein 1). PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 18. BioMart •  Data retrieval tool •  Originally developed for Ensembl (EnsMart) •  Now used by many large data resources •  Integrated with several widely used software packages •  Joint project between the European Bioinformatics Institute (EBI) and the Ontario Institute for Cancer Research (OICR) •  Website : http://www.biomart.org PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 19. Principle •  Step 1 – Dataset Choose your dataset •  Step 2 – Filters Limit your dataset •  Step 3 – Attributes Specify what information you want to output •  Step 4 – Results Preview and output your results PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 20. Demo 4 - BioMart Background: “Lactation” (GO:0007595) is the Gene Ontology (GO) term for the biological process of “the secretion of milk by the mammary gland”. Task: Retrieve all cow genes that are annotated with the GO term “lactation”. PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 21. Future plans for 2012 •  Genebuilds for duck (?), salmon (?), sheep (?), tilapia •  Genebuilds on new assemblies for cat (Felis_catus-6.2), chicken (Gallus_gallus-4.0), dog (CanFam3.1), pig (Sscrofa10.2) •  Include RNAseq data in genebuild •  VEP support for structural variants •  New BLAST/BLAT interface •  http://www.ensembl.info/roadmap PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 22. Future plans for 2012 •  New species: barley, Brassica (from BrassEnsembl), foxtail millet, Oryza brachyanta, potato, tomato, Gaeumannomyces graminis, Magnaporthe oryzae, Magnaporthe poae, tsetse fly •  New assemblies: maize (B73_RefGen_v3), Oryza sativa ssp. japonica cu. Nipponbare (Os-Nipponbare-Reference-IRGSP-1.0; IRGSP1.0), poplar •  Variation db and new gene annotation for wheat stem rust pathogen •  New query interface for data re plant-fungal pathogen interactions (PhytoPath; http://www.phytopathdb.org/) •  Widened development of community annotation pipelines PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 23. Help •  Helpdesk: helpdesk@ensembl.org helpdesk@ensemblgenomes.org •  Mailing lists: http://www.ensembl.org/info/about/contact/mailing.html http://plants.ensembl.org/info/about/contact/mailing.html •  Ensembl YouTube and YouKu ( ) channels: http://www.youtube.com/user/EnsemblHelpdesk http://u.youku.com/user_show/uid_Ensemblhelpdesk PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 24. EBI Train online http://www.ebi.ac.uk/training/online/course/ensembl-browsing-chordate-genomes PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 25. Workshops until now: in 2011: 49 countries on 5 continents ~ 90 workshops
  • 26. Workshops •  Browser (0.5-2 days) and API (1-3 days) workshops •  Combination of lectures and hands-on exercises •  Advertised on http://www.ensembl.info/workshops/calendar/ •  You can host your own workshop! •  For academic institutions there is, apart from the instructor’s expenses, no fee •  You only need a computer room and participants •  You can get more info from me (bert@ebi.ac.uk) or at the EBI booth (302) PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 27. Stay in touch •  Blog: http://www.ensembl.info •  Facebook: http://www.facebook.com/Ensembl.org •  Twitter: http://twitter.com/Ensembl PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 28. Acknowledgements •  WTSI •  CADRE •  OICR •  Gramene •  VectorBase •  WormBase •  PomBase •  Wellcome Trust •  EMBL •  NIH-NHGRI •  BBSRC •  EMBL •  Wellcome Trust •  EU •  Bill and Melinda Gates Foundation •  EU PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 29. Acknowledgements Paul Flicek, Ridwan Amode, Daniel Barrell, Kathryn Beal, Simon Brent, Denise Carvalho-Silva, Clapham P, Guy Coates, Susan Fairley, Stephen Fitzgerald, Laurent Gil, Leo Gordon, Maurice Hendrix, Thibaut Hourlier, Nathan Johnson, Andreas Kähäri, Damian Keefe, Stephen Keenan, Rhoda Kinsella, Monika Komorowska, Gautier Koscielny, Eugene Kulesha, Pontus Larsson, Ian Longden, Will McLaren, Matthieu Muffato, Bert Overduin, Miguel Pignatelli, Bethan Pritchard, Harpreet Riat, Graham Ritchie, Magali Ruffier, Michael Schuster, Daniel Sobral, Amy Tang, Kieron Taylor, Stephen Trevanion, Jana Vandrovcova, Simon White, Mark Wilson, Steven Wilder, Bronwen Aken, Ewan Birney, Fiona Cunningham, Ian Dunham, Richard Durbin, Xosé Fernández-Suarez, Jennifer Harrow, Javier Herrero, Tim Hubbard, Anne Parker, Glenn Proctor, Giulietta Spudich, Jan Vogel, Andy Yates, Amonida Zadissa, Steve Searle Paul Kersey, Dan Staines, Dan Lawson, Eugene Kulesha, Paul Derwent, Jay Humphrey, Daniel Hughes, Stephen Keenan, Arnaud Kerhornou, Gautier Koscielny, Nick Langridge, Mark McDowall , Karyn Megy, Uma Maheswari, Michael Nuhn, Michael Paulini, Helder Pedro, Iliana Toneva, Derek Wilson, Andy Yates, Ewan Birney
  • 30. Posters P941 Genome Annotation in Ensembl Susan Fairley P942 Ensembl Plants: An Integrating Resource for Plant Genomics and Variation Paul Kersey PAG XX, January 15th 2012, San Diego EBI Database Workshop
  • 31. Training courses Careers Meet the experts Brochures and factsheets Come and see us at booth 302! PhD and post doc opportunities Industry programme Research and services Visitor’s programme PAG XX, January 15th 2012, San Diego EBI is an Outstation of the European Molecular Biology Laboratory. EBI Database Workshop
  • 32. PDF of this presentation http://www.ebi.ac.uk/~bert/past_workshops.html PAG XX, January 15th 2012, San Diego EBI Database Workshop