SlideShare uma empresa Scribd logo
1 de 21
Vicky Schneider, Andrew Pask, Denis O’Meally,
Philippa Griffin, Jeff Christiansen, Mike
Charleston, Dominique Gorse, Andrew Treloar,
Jason Williams, Rebecca Johnson and Andrew Lonie
With comments from Paul Flicek, Michelle Barker,
Susanna Sansone, Dave Burt
An Oz Mammals
Bioinformatics and
Data Resource
• Whole-genome
sequencing reads
• Exon-capture
sequencing reads
• RADseq/GBS/exon-
capture sequencing
reads
Raw Processed
• Genome assemblies
• Gene alignments
• Phylogenetic trees
• Variant calls
• Transcriptome data
• Annotations
• Sequence alignments
• Phylogenetic trees
• Microsatellite datasets
• Cytological data (images?)
• Phenotype information
• …..... From Australian Alps on Flickr:
https://www.flickr.com/photos/australianalps/69549
40609 CC BY-NC-ND 2.0
From Australian Alps on Flickr:
https://www.flickr.com/photos/australianalps/69549
40609 CC BY-NC-ND 2.0
OMG Project
data
● Hugely valuable for
○ Understanding our natural heritage
○ Tackling evolutionary and ecological questions
○ Placental mammal research including human
biomedical research
● Uniquely Australian
● Often irreplaceable samples and data
We are leading the world in generating these data
Could also be leading in sharing the data!
+
Data Life Cycle
framework
visualising
Data Life Cycle
framework
visualising
Metadata (contextual
information about the data)
is key to making this work
e.g.
Sample
- species
- tissue type
- collection
location
- museum ID
Experiment
- sample
processing
method
- technology used
- settings
• A place to store and share data and metadata for the OMG project
• A place to store and share existing Oz mammal datasets
• A place to share data processing/analysis workflows
• A place to access data processing, analysis and visualisation tools
(with appropriate compute resources)
• Integration with external tools, e.g. Atlas of Living Australia
What is not covered by the OMG project?
• 5 strains x 2 growth conditions of 2 bacterial species
• Genomic, transcriptomic, metabolomic and proteomic profiles
Select datasets using drop-down menu
of metadata values:
e.g.
• ‘all raw transcriptomic datasets from
E. coli grown in blood media’
• ‘all datasets from bacterial samples
collected from patients in NSW
before 2010’
Send to local
desktops, HPC
systems for
analysis
Log in
Process/analyse/visualise
in common cloud-based
environment with pre-
installed software tools
Submit data and
metadata to
international
repository
• Large collaborative project funded by Research Data Services (RDS), linked
with the BPA Antibiotic-Resistant Pathogens Project
• Project members from VicNode, QCIF, Melbourne Bioinformatics (formerly
VLSCI), Intersect
• -> There is expertise in Australia in developing this kind of resource
• data storage
• research data management
• delivering analysis tools in a common cloud environment
• linking across storage/management/analysis layers
• Many of the pieces can be reused/adapted for different research projects
Existing Oz Mammal
data resources• For within-project
collaboration
• Focus on data sharing,
(storage), community
genome annotation
• Datasets mostly
unpublished as yet
Tools:
• File downloads
• JBrowse
• BLAST
• Apollo
http://copo-project.org/
Aims to provide an easy-to-use
interface for researches to
access interoperable
• Metadata annotation
services
• Data repository services
• Data analysis services
• Data publishing services
www.cyverse.org
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
• Enable sharing via submission to appropriate international
repositories
• encourage best-practice data formats
• encourage complete, rich metadata that complies with repository
and community standards
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
• Enable sharing via submission to appropriate international
repositories
• encourage best-practice data formats
• encourage complete, rich metadata that complies with repository
and community standards
• Use and build on existing platforms like the OMICS platform
• Long-term hosting and maintenance
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
Current way forward
• Drafting a proposal aimed at ANDS/NeCTAR/RDS
• No funding scheme yet - but possibly later this year
• Engaging with European Bioinformatics Institute (Ensembl Vertebrates),
ISA-Tools, Cyverse for potential collaborations and advice
• Aligns with broader digital infrastructure strategy currently being mapped
at national level
Vicky Schneider, Andrew Pask, Denis
O’Meally, Philippa Griffin, Jeff Christiansen, Mike
Charleston, Dominique Gorse, Andrew Treloar,
Jason Williams, Rebecca Johnson and Andrew
Lonie
With comments from Paul Flicek, Michelle Barker,
Susanna Sansone, Dave Burt
Timescale
• Year 1-2: scoping requirements, building, ongoing testing
• Year 2-3: building, release, outreach/training, improvement
Expertise required
• Research Software Engineering
• Business Analyst expertise + domain knowledge
• Biocuration
• Input on Bioinformatics Needs
• Input on User Experience Design
• Input on Training/Outreach
• Project Management
For comparison
• COPO: 4 FTE for 3 years
• Cyverse: 35 FTE for 5 years ( US$100 million
over 10 years )
Matt Francey on Flickr: https://www.flickr.com/photos/howfardad/31879952075
CC BY-NC 2.0
Your thoughts?
• Are there OMG project needs not covered in this
list?
• Any other Oz Mammal portals/resources to be
aware of / consider incorporating?
• What do you see as the highest priority in data
management / accessing compute resources /
sharing and storing data for the OMG:
• Currently?
• A year from now?

Mais conteúdo relacionado

Mais procurados

How to share useful data
How to share useful dataHow to share useful data
How to share useful dataPeter McQuilton
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeCarly Strasser
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015Carly Strasser
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataLe_GFII
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1bensparrowau
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesapaari
 
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...ariadnenetwork
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIDaniel S. Katz
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceCarly Strasser
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingDaniel S. Katz
 
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataOpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataKrzysztof Gorgolewski
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARECASRAI
 
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Krzysztof Gorgolewski
 
Spark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit
 
Reproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachReproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachKrzysztof Gorgolewski
 

Mais procurados (20)

How to share useful data
How to share useful dataHow to share useful data
How to share useful data
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research data
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1
 
The CATE Project
The CATE ProjectThe CATE Project
The CATE Project
 
sDiv_IJSCM-part_2
sDiv_IJSCM-part_2sDiv_IJSCM-part_2
sDiv_IJSCM-part_2
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resources
 
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSI
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meeting
 
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataOpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
 
Spark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van Ham
 
Reproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachReproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approach
 

Semelhante a An Oz Mammals Bioinformatics and Data Resource

Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planC. Tobin Magle
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNADaniel S. Katz
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleEnis Afgan
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016Philippa Griffin
 
e-infrastructural needs to support informatics
e-infrastructural needs to support informaticse-infrastructural needs to support informatics
e-infrastructural needs to support informaticsDavid Wallom
 
Community Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDCommunity Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDrlwalls2008
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesLouise Corti
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planC. Tobin Magle
 
Elixir at de.nbi meeting
Elixir at de.nbi meetingElixir at de.nbi meeting
Elixir at de.nbi meetingNiklas Blomberg
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsPascale Gaudet
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ LibraryARDC
 

Semelhante a An Oz Mammals Bioinformatics and Data Resource (20)

Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management plan
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNA
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
 
COPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob DaveyCOPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob Davey
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
 
e-infrastructural needs to support informatics
e-infrastructural needs to support informaticse-infrastructural needs to support informatics
e-infrastructural needs to support informatics
 
Community Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDCommunity Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHD
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management plan
 
Elixir at de.nbi meeting
Elixir at de.nbi meetingElixir at de.nbi meeting
Elixir at de.nbi meeting
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next Developments
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 

Último

Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 

Último (20)

Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 

An Oz Mammals Bioinformatics and Data Resource

  • 1. Vicky Schneider, Andrew Pask, Denis O’Meally, Philippa Griffin, Jeff Christiansen, Mike Charleston, Dominique Gorse, Andrew Treloar, Jason Williams, Rebecca Johnson and Andrew Lonie With comments from Paul Flicek, Michelle Barker, Susanna Sansone, Dave Burt An Oz Mammals Bioinformatics and Data Resource
  • 2. • Whole-genome sequencing reads • Exon-capture sequencing reads • RADseq/GBS/exon- capture sequencing reads Raw Processed • Genome assemblies • Gene alignments • Phylogenetic trees • Variant calls • Transcriptome data • Annotations • Sequence alignments • Phylogenetic trees • Microsatellite datasets • Cytological data (images?) • Phenotype information • …..... From Australian Alps on Flickr: https://www.flickr.com/photos/australianalps/69549 40609 CC BY-NC-ND 2.0
  • 3. From Australian Alps on Flickr: https://www.flickr.com/photos/australianalps/69549 40609 CC BY-NC-ND 2.0 OMG Project data ● Hugely valuable for ○ Understanding our natural heritage ○ Tackling evolutionary and ecological questions ○ Placental mammal research including human biomedical research ● Uniquely Australian ● Often irreplaceable samples and data We are leading the world in generating these data Could also be leading in sharing the data! +
  • 5. Data Life Cycle framework visualising Metadata (contextual information about the data) is key to making this work e.g. Sample - species - tissue type - collection location - museum ID Experiment - sample processing method - technology used - settings
  • 6. • A place to store and share data and metadata for the OMG project • A place to store and share existing Oz mammal datasets • A place to share data processing/analysis workflows • A place to access data processing, analysis and visualisation tools (with appropriate compute resources) • Integration with external tools, e.g. Atlas of Living Australia What is not covered by the OMG project?
  • 7. • 5 strains x 2 growth conditions of 2 bacterial species • Genomic, transcriptomic, metabolomic and proteomic profiles
  • 8.
  • 9. Select datasets using drop-down menu of metadata values: e.g. • ‘all raw transcriptomic datasets from E. coli grown in blood media’ • ‘all datasets from bacterial samples collected from patients in NSW before 2010’ Send to local desktops, HPC systems for analysis Log in Process/analyse/visualise in common cloud-based environment with pre- installed software tools Submit data and metadata to international repository
  • 10. • Large collaborative project funded by Research Data Services (RDS), linked with the BPA Antibiotic-Resistant Pathogens Project • Project members from VicNode, QCIF, Melbourne Bioinformatics (formerly VLSCI), Intersect • -> There is expertise in Australia in developing this kind of resource • data storage • research data management • delivering analysis tools in a common cloud environment • linking across storage/management/analysis layers • Many of the pieces can be reused/adapted for different research projects
  • 11. Existing Oz Mammal data resources• For within-project collaboration • Focus on data sharing, (storage), community genome annotation • Datasets mostly unpublished as yet Tools: • File downloads • JBrowse • BLAST • Apollo
  • 12. http://copo-project.org/ Aims to provide an easy-to-use interface for researches to access interoperable • Metadata annotation services • Data repository services • Data analysis services • Data publishing services
  • 14. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 15. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 16. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project • Enable sharing via submission to appropriate international repositories • encourage best-practice data formats • encourage complete, rich metadata that complies with repository and community standards What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 17. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project • Enable sharing via submission to appropriate international repositories • encourage best-practice data formats • encourage complete, rich metadata that complies with repository and community standards • Use and build on existing platforms like the OMICS platform • Long-term hosting and maintenance What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 18.
  • 19. Current way forward • Drafting a proposal aimed at ANDS/NeCTAR/RDS • No funding scheme yet - but possibly later this year • Engaging with European Bioinformatics Institute (Ensembl Vertebrates), ISA-Tools, Cyverse for potential collaborations and advice • Aligns with broader digital infrastructure strategy currently being mapped at national level Vicky Schneider, Andrew Pask, Denis O’Meally, Philippa Griffin, Jeff Christiansen, Mike Charleston, Dominique Gorse, Andrew Treloar, Jason Williams, Rebecca Johnson and Andrew Lonie With comments from Paul Flicek, Michelle Barker, Susanna Sansone, Dave Burt
  • 20. Timescale • Year 1-2: scoping requirements, building, ongoing testing • Year 2-3: building, release, outreach/training, improvement Expertise required • Research Software Engineering • Business Analyst expertise + domain knowledge • Biocuration • Input on Bioinformatics Needs • Input on User Experience Design • Input on Training/Outreach • Project Management For comparison • COPO: 4 FTE for 3 years • Cyverse: 35 FTE for 5 years ( US$100 million over 10 years ) Matt Francey on Flickr: https://www.flickr.com/photos/howfardad/31879952075 CC BY-NC 2.0
  • 21. Your thoughts? • Are there OMG project needs not covered in this list? • Any other Oz Mammal portals/resources to be aware of / consider incorporating? • What do you see as the highest priority in data management / accessing compute resources / sharing and storing data for the OMG: • Currently? • A year from now?