SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
Ensembl Plants:
Visualising, mining and analysing crop
genomics data
Dan Bolser
Ensembl Plants project leader
EMBL-EBI
http://plants.ensembl.org
#EnsemblGenomes
Visualising, mining and
analysing data:
● The Ensembl
genome browser
● BioMart
● Tools for processing
your own data
Overview
Background:
● Ensembl Plants
● History
● Data
● Recent updates
● Wheat
● Barley
EBI Ensembl is developed
jointly by the EBI and
the Wellcome Trust
Sanger Institute
Ensembl Plants uses Ensembl technology
Ensembl:
● A platform for genome browsing, annotation and analysis
developed jointly by the EBI and Wellcome Trust Sanger Institute.
● Has modules for handling:
● Genomic data, Variations, Comparative genomics, Gene prediction, ...
● Multiple points of access to data:
● Browser-based application, Perl and REST APIs, direct access
(MySQL), BioMart data mining tool, DAS (client and server), FTP.
● Upload your own data and compare it to the reference seq. and annotation.
Ensembl was originally developed for vertebrate genomes, subsequently
extended to non-vertebrate species:
● Ensembl Genomes → Ensembl Plants
Currently 33 genomes in
Ensembl Plants
http://plants.ensembl.org
Dicots in
Ensembl Plants
(10)
Brassicales
Fabales
Malpighiales
Rosales
Solanales
Vitales
Monocots in
Ensembl Plants
(12+5)
Poales
Zingiberales
'Others' (5)
Types of data in Ensembl (Ensembl Plants)
● Genomic sequence
● Gene, transcript, and protein annotations
● External references and ontology terms
● Mapped sequences: cDNAs, proteins,
probes, BACs, repeats, markers, ...
● Variation data:
● sequence variants
● structural variants
● Comparative data:
● gene trees, orthologues, paralogues
● whole genome alignments and synteny
Recent data updates
Wheat data in Ensembl Plants
● The chromosome survey sequence
from the International Wheat Genome
Sequencing Consortium.
● Version 2.1 of the IWGSC gene models called
on the chromosome survey sequence.
● Repeats
● Repbase
● The Triticeae Repeat Sequence Database
(TREP)
● Alignments
● RNA-seq from various studies in ENA
● ESTs and UniGene clusters
● 5x 454 Brenchley et al.
● Triticum turgidum cDNA assemblies
Wheat data in Ensembl Plants
● Whole genome alignments
● Between wheat(s) and:
● Rice
● Brachypodium
● Within wheat
● A vs. B
● A vs. D
● B vs. D
● Gene trees
● Aegilops tauschii
● Triticum urartu
● and other more
distant relatives
WGA between wheat, rice and brachy
WGA within wheat A, B and D sub-genomes
Gene trees
Gene trees
Walk through ‘demo’ for
Ensembl Plants
Search
Variant Effect Predictor (VEP)
● Predicts functional consequences of known and
unknown variants
● For substitutions, insertions, deletions and structural
variants
● Web interface (for up to 750 variants), standalone Perl
script, Perl API and REST API
Visualise your own data
Upload data:
● Data saved on server
● 5 MB limit
● Large file formats?
Attach remote files:
● URL-based
● HTTP or FTP
● No size limit
Upload formats:
● BED genes / features
● Gbrowse genes / features
● GFF/GTF genes / features
● PSL sequence alignments
● WIG continuous-valued data
● BedGraph continuous-valued data
● TrackHub collections of tracks
Attach formats:
● BigBed genes / features
● BAM sequence alignments
● BigWig continuous-valued data
● VCF variants
User added tracks:
● Can be saved or shared
● Only trivial security, do not use for sensitive data!
The barley Gene-ome
● Step 1 – Dataset
● Choose your dataset
and species
● Step 2 – Filters
● Limit your dataset
● Step 3 – Attributes
● Specify what
information you want
to output
● Step 4 – Results
● Preview and output
your results
Blast and
BioMart...
pkersey@ebi.ac.uk10/01/2014
Funding (Ensembl Plants)
• Ensembl Genomes Funded by
• EMBL
• EU (INFRAVEC, Microme, transPLANT, AllBio)
• BBSRC (PhytoPath, wheat, barley and midge sequencing,
UK-US collaboration, RNAcentral)
• Wellcome Trust (PomBase)
• NIH/NIAID (VectorBase)
• NSF (Gramene collaboration)
• Bill and Melinda Gates Foundation (wheat rust)
pkersey@ebi.ac.uk10/01/2014
People (Ensembl Plants)
• James Allen, Irina Armean, Dan Bolser, Mikkel
Christensen, Paul Davies, Christoph Grabmueller, Kevin
Howe, Malcolm Hinsley, Jay Humphrey, Arnaud
Kerhornou, Paul Kersey, Julia Khobdova, Eugene
Kulesha, Nick Langridge, Dan Lawson, Mark McDowall,
Uma Maheswari, Gareth Maslen, Michael Nuhn, Chuang
Kee Ong, Michael Paulini, Helder Pedro, Anton Petrov,
Dan Staines, Mary Ann Tuli, Brandon Walts, Gary
Williams
• If you have a question that is not answered here,
please Contact our HelpDesk:
• helpdesk@ensemblgenomes.org

Mais conteúdo relacionado

Mais procurados

Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
Shruthi Choudary
 
Pros and cons of biotechnology
Pros and cons of biotechnologyPros and cons of biotechnology
Pros and cons of biotechnology
educationprojects
 

Mais procurados (20)

Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Computer Aided Vaccine Design
Computer Aided Vaccine DesignComputer Aided Vaccine Design
Computer Aided Vaccine Design
 
genome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sirgenome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sir
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Gene prediction strategies
Gene prediction strategies Gene prediction strategies
Gene prediction strategies
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programs
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Bioinformatics Projects And Applications
Bioinformatics Projects And ApplicationsBioinformatics Projects And Applications
Bioinformatics Projects And Applications
 
Molecular pharming
Molecular pharmingMolecular pharming
Molecular pharming
 
Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Viral Metagenomics (CABBIO 20150629 Buenos Aires)Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Viral Metagenomics (CABBIO 20150629 Buenos Aires)
 
Plant genomics general overview
Plant genomics general overviewPlant genomics general overview
Plant genomics general overview
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Tissue culture and virus indexing for the production of clean planting materials
Tissue culture and virus indexing for the production of clean planting materialsTissue culture and virus indexing for the production of clean planting materials
Tissue culture and virus indexing for the production of clean planting materials
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
Pros and cons of biotechnology
Pros and cons of biotechnologyPros and cons of biotechnology
Pros and cons of biotechnology
 
PLANTS AS BIOREACTOR
PLANTS AS BIOREACTORPLANTS AS BIOREACTOR
PLANTS AS BIOREACTOR
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl database
 

Destaque

Chuong 7 doi moi tu duy va cai cach the che
Chuong 7   doi moi tu duy va cai cach the cheChuong 7   doi moi tu duy va cai cach the che
Chuong 7 doi moi tu duy va cai cach the che
Le Thuy Hanh
 
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit ZaehlerInstallation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
guestfe21f2
 
Chuong 2 rui ro tham hut tai khoa
Chuong 2   rui ro tham hut tai khoaChuong 2   rui ro tham hut tai khoa
Chuong 2 rui ro tham hut tai khoa
Le Thuy Hanh
 
Interacting Galaxies
Interacting GalaxiesInteracting Galaxies
Interacting Galaxies
ninabean47
 

Destaque (20)

20-Line Lifesavers: Coding simple solutions in the GATK
20-Line Lifesavers: Coding simple solutions in the GATK20-Line Lifesavers: Coding simple solutions in the GATK
20-Line Lifesavers: Coding simple solutions in the GATK
 
Creating a SNP calling pipeline
Creating a SNP calling pipelineCreating a SNP calling pipeline
Creating a SNP calling pipeline
 
Amazon Ec2
Amazon Ec2Amazon Ec2
Amazon Ec2
 
IBM MQ v8 enhancements
IBM MQ v8 enhancementsIBM MQ v8 enhancements
IBM MQ v8 enhancements
 
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
 
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
 
Pecha Kucha
Pecha KuchaPecha Kucha
Pecha Kucha
 
Portuguese Hidden Champions
Portuguese Hidden ChampionsPortuguese Hidden Champions
Portuguese Hidden Champions
 
Chuong 7 doi moi tu duy va cai cach the che
Chuong 7   doi moi tu duy va cai cach the cheChuong 7   doi moi tu duy va cai cach the che
Chuong 7 doi moi tu duy va cai cach the che
 
NETTAB 2012 flyer
NETTAB 2012 flyerNETTAB 2012 flyer
NETTAB 2012 flyer
 
41035
4103541035
41035
 
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit ZaehlerInstallation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
 
Chuong 2 rui ro tham hut tai khoa
Chuong 2   rui ro tham hut tai khoaChuong 2   rui ro tham hut tai khoa
Chuong 2 rui ro tham hut tai khoa
 
Photofraphy by Solve Sundsbo
Photofraphy by Solve SundsboPhotofraphy by Solve Sundsbo
Photofraphy by Solve Sundsbo
 
Nice 2012, BioWikis and DASWiki
Nice 2012, BioWikis and DASWikiNice 2012, BioWikis and DASWiki
Nice 2012, BioWikis and DASWiki
 
Blood Diamond
Blood DiamondBlood Diamond
Blood Diamond
 
如何开展社会化媒体营销?品牌拟人化
如何开展社会化媒体营销?品牌拟人化如何开展社会化媒体营销?品牌拟人化
如何开展社会化媒体营销?品牌拟人化
 
The Trust Economy
The Trust EconomyThe Trust Economy
The Trust Economy
 
Cellnetrix brochure 2013
Cellnetrix brochure 2013Cellnetrix brochure 2013
Cellnetrix brochure 2013
 
Interacting Galaxies
Interacting GalaxiesInteracting Galaxies
Interacting Galaxies
 

Semelhante a Ensembl Plants: Visualising, mining and analysing crop genomics data

Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
EBI
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
CGIAR Generation Challenge Programme
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
CGIAR Generation Challenge Programme
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
Senthil Natesan
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 

Semelhante a Ensembl Plants: Visualising, mining and analysing crop genomics data (20)

Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
Role of ensembl in genome browsing
Role of ensembl in genome browsingRole of ensembl in genome browsing
Role of ensembl in genome browsing
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with Ensembl
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
 
Gramene
GrameneGramene
Gramene
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
Data cycle microbes
Data cycle microbesData cycle microbes
Data cycle microbes
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Functional ANNOTATION OF GENOME.pptx
Functional ANNOTATION OF GENOME.pptxFunctional ANNOTATION OF GENOME.pptx
Functional ANNOTATION OF GENOME.pptx
 
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
 
Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
 
Bioinformatics Introduction
Bioinformatics IntroductionBioinformatics Introduction
Bioinformatics Introduction
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 

Mais de Dan Bolser

Semantic MediaWiki Workshop
Semantic MediaWiki WorkshopSemantic MediaWiki Workshop
Semantic MediaWiki Workshop
Dan Bolser
 

Mais de Dan Bolser (6)

Ramona Tăme - Email Encryption and Digital SIgning
Ramona Tăme - Email Encryption and Digital SIgningRamona Tăme - Email Encryption and Digital SIgning
Ramona Tăme - Email Encryption and Digital SIgning
 
Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012
 
Semantic MediaWiki Workshop
Semantic MediaWiki WorkshopSemantic MediaWiki Workshop
Semantic MediaWiki Workshop
 
Wikis at work
Wikis at workWikis at work
Wikis at work
 
BioWikis BSB10
BioWikis BSB10BioWikis BSB10
BioWikis BSB10
 
Wikipedia and the Global Brain
Wikipedia and the Global BrainWikipedia and the Global Brain
Wikipedia and the Global Brain
 

Último

Último (20)

Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 

Ensembl Plants: Visualising, mining and analysing crop genomics data

  • 1. Ensembl Plants: Visualising, mining and analysing crop genomics data Dan Bolser Ensembl Plants project leader EMBL-EBI http://plants.ensembl.org #EnsemblGenomes
  • 2. Visualising, mining and analysing data: ● The Ensembl genome browser ● BioMart ● Tools for processing your own data Overview Background: ● Ensembl Plants ● History ● Data ● Recent updates ● Wheat ● Barley
  • 3. EBI Ensembl is developed jointly by the EBI and the Wellcome Trust Sanger Institute
  • 4. Ensembl Plants uses Ensembl technology Ensembl: ● A platform for genome browsing, annotation and analysis developed jointly by the EBI and Wellcome Trust Sanger Institute. ● Has modules for handling: ● Genomic data, Variations, Comparative genomics, Gene prediction, ... ● Multiple points of access to data: ● Browser-based application, Perl and REST APIs, direct access (MySQL), BioMart data mining tool, DAS (client and server), FTP. ● Upload your own data and compare it to the reference seq. and annotation. Ensembl was originally developed for vertebrate genomes, subsequently extended to non-vertebrate species: ● Ensembl Genomes → Ensembl Plants
  • 5. Currently 33 genomes in Ensembl Plants http://plants.ensembl.org
  • 9. Types of data in Ensembl (Ensembl Plants) ● Genomic sequence ● Gene, transcript, and protein annotations ● External references and ontology terms ● Mapped sequences: cDNAs, proteins, probes, BACs, repeats, markers, ... ● Variation data: ● sequence variants ● structural variants ● Comparative data: ● gene trees, orthologues, paralogues ● whole genome alignments and synteny
  • 11.
  • 12. Wheat data in Ensembl Plants ● The chromosome survey sequence from the International Wheat Genome Sequencing Consortium. ● Version 2.1 of the IWGSC gene models called on the chromosome survey sequence. ● Repeats ● Repbase ● The Triticeae Repeat Sequence Database (TREP) ● Alignments ● RNA-seq from various studies in ENA ● ESTs and UniGene clusters ● 5x 454 Brenchley et al. ● Triticum turgidum cDNA assemblies
  • 13. Wheat data in Ensembl Plants ● Whole genome alignments ● Between wheat(s) and: ● Rice ● Brachypodium ● Within wheat ● A vs. B ● A vs. D ● B vs. D ● Gene trees ● Aegilops tauschii ● Triticum urartu ● and other more distant relatives
  • 14. WGA between wheat, rice and brachy
  • 15. WGA within wheat A, B and D sub-genomes
  • 18. Walk through ‘demo’ for Ensembl Plants
  • 19.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. Variant Effect Predictor (VEP) ● Predicts functional consequences of known and unknown variants ● For substitutions, insertions, deletions and structural variants ● Web interface (for up to 750 variants), standalone Perl script, Perl API and REST API
  • 26. Visualise your own data Upload data: ● Data saved on server ● 5 MB limit ● Large file formats? Attach remote files: ● URL-based ● HTTP or FTP ● No size limit Upload formats: ● BED genes / features ● Gbrowse genes / features ● GFF/GTF genes / features ● PSL sequence alignments ● WIG continuous-valued data ● BedGraph continuous-valued data ● TrackHub collections of tracks Attach formats: ● BigBed genes / features ● BAM sequence alignments ● BigWig continuous-valued data ● VCF variants User added tracks: ● Can be saved or shared ● Only trivial security, do not use for sensitive data!
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. ● Step 1 – Dataset ● Choose your dataset and species ● Step 2 – Filters ● Limit your dataset ● Step 3 – Attributes ● Specify what information you want to output ● Step 4 – Results ● Preview and output your results Blast and BioMart...
  • 34.
  • 35. pkersey@ebi.ac.uk10/01/2014 Funding (Ensembl Plants) • Ensembl Genomes Funded by • EMBL • EU (INFRAVEC, Microme, transPLANT, AllBio) • BBSRC (PhytoPath, wheat, barley and midge sequencing, UK-US collaboration, RNAcentral) • Wellcome Trust (PomBase) • NIH/NIAID (VectorBase) • NSF (Gramene collaboration) • Bill and Melinda Gates Foundation (wheat rust)
  • 36. pkersey@ebi.ac.uk10/01/2014 People (Ensembl Plants) • James Allen, Irina Armean, Dan Bolser, Mikkel Christensen, Paul Davies, Christoph Grabmueller, Kevin Howe, Malcolm Hinsley, Jay Humphrey, Arnaud Kerhornou, Paul Kersey, Julia Khobdova, Eugene Kulesha, Nick Langridge, Dan Lawson, Mark McDowall, Uma Maheswari, Gareth Maslen, Michael Nuhn, Chuang Kee Ong, Michael Paulini, Helder Pedro, Anton Petrov, Dan Staines, Mary Ann Tuli, Brandon Walts, Gary Williams • If you have a question that is not answered here, please Contact our HelpDesk: • helpdesk@ensemblgenomes.org