SlideShare uma empresa Scribd logo
1 de 58
Let’s go on a
FAIR Safari!
Prof Carole Goble
The FAIRDOM Consortium
ELIXIR UK Head of Node
BioComputeObject Partnership
The University of Manchester, UK
carole.goble@manchester.ac.uk
COMBINE 2019, EU-STANDS4PM, Heidelberg,Germany 18 July 2019
A European standardization framework for data
integration and data-driven in silico models for
personalised medicine
harmonised transnational standards, recommendations
and guidelines that allow a broad application of
predictive in silico methodologies in personalised
medicine across Europe.
A European standardization framework for data
integration and data-driven in silico models for
personalised medicine
Scientific Data 3, 160018 (2016)
doi:10.1038/sdata.2016.18
A potted history
Many went before
2014 - Lorentz workshop
2015 - BioHackathon
2016 - Published
Went bananas
Grassroots activity that has
become a top down one.
sharing/publishing assets in public archives…
Data Models
*top three most popular
The evolution of standards and data management practices in systems biology
(2015). Stanford et al, Molecular Systems Biology, 11(12):851
… model reuse is tricky…
Stanford et alThe evolution of standards and data management practices in systems biology,
Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053
COMBINE sessions on Reproducibility
... different repositories, owners,
sovereignties, infrastructure, platforms …
The evolution of standards and data management practices in systems biology (2015).
Stanford et al, Molecular Systems Biology, 11(12):851
A jungle
An ecosystem
http://genexplain.com/mypathsem/
FAIR for the widest possible use from EHR to Research …
Just how feasible is it to interoperate (integrate)
and reuse data collected for another purpose in
another domain?
The FAIR Jungle
The FAIR Hype
Clarity
Infrastructure
Methodologies
Incentives
Cutting a path through the jungle….
PEST – political, economic, social, technical
What does it mean to be FAIR?
What is the cost / benefit analysis
Lets
examine
FAIR more
closely….
FAIR principles in the paper…
some people seem to have taken as the law of the jungle…
FAIR Principles
machine-actionable data and metadata
Findable Accessible Interoperable Reusable
Find: with
machine
readable
metadata
Locate and id:
with standard
identification
mechanism
Available and
obtainable
Human &
machine
Metadata
always
STANDARDS
Semantically
encoded,
syntactically
parsable
References
Sufficiently
described
Provenance
Least restrictive
licenses
Community
compliant
Increase exchange, integration and reuse
Across disciplines and borders
FAIR Principles reality check
• An aspiration, a journey.
• A call for machine actionability
of data and metadata.
• Ambiguous.
• Work in progress.
• A subset of indicators:
– ROI, impact, community
need, sustainability of
repository, quality of
service….
Are Are not
• A standard.
• Strict.
• Just about humans being able to
find, access, reformat and finally
reuse data.
• Technology specific.
• Domain specific.
• Tablets of stone
Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open
ScienceCloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824.
Dunning et al Are the FAIR Data Principles fair? IDCC17
Lets measure it!
Framework for metrics
Automated services
Manual services
Authorities
Wilkinson et al, Evaluating FAIR MaturityThrough a Scalable, Automated,Community-Governed Framework
https://doi.org/10.1101/649202
Lets measure it!
Dunkelziffer
“Not everything that
can be counted counts.
Not everything that
counts can be counted”
[William Bruce Cameron]
Compliance
Awareness
Expectation
setting
Self-evaluation
Reporting
By Providers,
Users &
Community
Certification
Judgement
Regulation
ByWhom???
Comparison
Monitoring
Review
Quality
By Community
Contract
http://blog.ukdataser
vice.ac.uk/fair-data-
assessment-tool/
https://fairshake.cloud/
Indicators: Robustness,
Humility,Transparency,
Diversity, Reflexivity*
Context dependency
Community standards
Incremental
Matrix of metrics
Maturity levels for each
+
*The MetricTide, https://responsiblemetrics.org/the-metric-tide/
F and A are not so bad
I and R are hard
A FAIR Ecosystem means....
FAIR indicators, models and trust
Transparent
evaluation
Capability Maturity Model
of entities & their capabilities
Indicators and metrics
measuring levels
Foundational
Components
FAIRification
Process
Awareness and Policy
Standards and Guidelines
People
Infrastructure
Value Based Assessment
Selection
Goal Setting
Process planning
Modelling
Transformation
Publishing
Impl. Outcome:
Dataset
Persistent Identification
Data Set Discovery
Machine Readability
Data Access and Usage
Preservation and Sustainability
RDA FAIR Data
Maturity Model
Working Group
Cataloguing the FAIR ecosystem
What do we mean by a Maturity Model?
[Susheel Varma]
Only way more
elaborate ….
[Wilkinson et al, 2019]
FAIR Evaluator Workflows
Rubrics, Indicators andTests
are FAIR objects and community decisions
https://doi.org/10.1101/649202
Scale up and scale out
automation of
indicators and their
evaluation…
FAIRification Pipelines
Rare Diseasehttps://www.go-fair.org/fair-
principles/fairification-process/
[Marco Roos]
FAIRification Cookbooks … for models?
https://fairplus-project.eu/
More than just data
Software, models, workflows, SOPs, Lab Protocols….
FAIR Digital Objects
FAIR Models
properties of data + software
FAIR Software
FAIR Workflows*
Maintainability
Testing
Portability
Composite structure
Forms (spec or code?)
Versioning
Executability
Maturity models
Contributor policy
Identity
Copyright
Licenses Documentation
Sustainability
Model
Reproducibility
& Exchange
*FAIR ComputationalWorkflows https://doi.org/10.5281/zenodo.3268653
FAIR Precision
Medicine
Models …
Indicators &
Maturity
Model eXchange
Standards
Identifiers
AAI & Licensing
Repositories
Search
DMP
Policies Governance
Cloud of
registries
Federation
Scale out mark-up for federation
https://eosc-edmi.github.io/
http://bioschemas.org
EOSC Dataset
Minimum
Information
FAIR OPEN
SAFE
privacy preservation, regulatory rigour
crossing domain and sovereignty boundaries
Privacy Preservation of data
data book keeping
https://f1000research.com/posters/7-1036
https://www.monarc.lu
[Pinar Alper]
Privacy Preservation of analysis
take (distributed) analysis to the (distributed) data
https://www.health-ri.org/
Personal Health Train
Collect privacy sensitive data using mobile containers
Regulatory Practice
robust, safe exchange and reuse of
HTS computational analytical
workflows
http://biocomputeobject.org
IEEE P2791
BioCompute
Working Group
[Vahan Simonyan]
BioCompute Framework
to advance Regulatory Science to support NGS analysis
Emphasis on robust, safe reuse.
Describe and validate the
metadata of packages, and
their contents, both inside
and outside
Standardise data formats and
elements and exchange of
Electronic Health Records
Describe and
validate analysis
workflows, to be
portable and
interoperable
Standardise and support
sharing and analysis of
Genomic data
Alterovitz, Dean II, Goble, Crusoe, Soiland-Reyes et al “Enabling
Precision Medicine via standard communication of NGS provenance,
analysis, and results” PLOS Biology 2018
Bechhofer et al (2013)Why linked data is not enough for scientists https://doi.org/10.1016/j.future.2011.08.004
Bechhofer et al (2010) Research Objects:Towards Exchange and Reuse of Digital Knowledge, https://eprints.soton.ac.uk/268555/
Self-describing machine processable
metadata in common and specific to
different object types.
bundle together references or
the objects themselves. Relate
digital resources
snapshot | cite | exchange
Research Object
Framework
COMBINE was early to the party….
Combine Archive
Scharm M,Wendland F, Peters M,Wolfien M,TheileT,Waltemath D SEMS, University of Rostock zip-like file with a manifest & metadata
- Bundle files - Keep provenance
- Exchange data - Ship results
Bergmann, F.T. (2014). COMBINE archive and OMEX format: one file to share all information
to reproduce a modeling project. BMC bioinformatics,15(1), 1.
https://sems.unirostock.de/projects/combinearchive/
Big data distributed over multiple locations,
Efficiently and safely moved on demand
ROs are verified collections of references
[Chard, et al 2016]
FAIR Research Objects
The KnowledgeObject Reference Ontology (KORO): A formalism to support management and sharing of computable
biomedical knowledge for learning health systems
Flynn, Friedman, Boisvert, Landis‐Lewis, Lagoze (2018), https://doi.org/10.1002/lrh2.10054
Graphs of ROs
Track ROs
Combine and enrich ROs
Learning Health Systems
and Research Objects
EOSC-Life: FAIR data and tools (workflows, models) for
cloud use
RI data (distributed over
facilities)
Ecosystem of innovative
tools in EOSC
Publish FAIR life
science data in EOSC
Data Catalogues
Tools Catalogues
Workflow Catalogues
Service Catalogues
[Niklas Blomberg]
FAIR Challenges
for Projects
Track collection of data and metadata X X X
Maintain experimental context X X
Find and exchange assets X X X X
Long-term retain results beyond a project X X X
Share, disseminate and publish assets sensitively X X X
Consistently report for interpretation, interoperability
& comparison
X X
Promote standardised metadata practices. X X
Organise and link assets X X X
Reuse tools and community archives X X
Integrate with other data stores and platforms X X X
Support reproducible publications X X X X
Credit owners X X
Public Project
Commons
Platform
Service hosted at HITS
50+
installations
140+
projects
Support
A Commons
Project
Investigations and Assets
Simulate model
Launch workflows
Models
SOPs
People Projects
Publications
Documents
Presentations
Workflows
Data
Events
Federated Catalogue, Integrated view
interlinked objects, structured organisation, resources ecosystem
Investigations
Studies
Assays/Analyses
Workflows
Federated Catalogue, Integrated view
interlinked objects, structured organisation, resources ecosystem
Stores Archives
FAIR Membrane
Investigations
Studies
Assays/Analyses
Federated Catalogue, Integrated view
A Commons is only as FAIR as the content (inside and outside)
FAIR Membrane
FAIR(ish) after death ….
https://fairdom
hub.org/projec
ts/129
https://wellcomeopenresearch.org/articles/4-104/v1
Zielinski, Hay, Millar, The grant is dead, long live the data - migration as a pragmatic exit strategy
for research data preservation,
Data Sovereignty: FAIR but not yet Open
A Project
Commons
not an
integrated
data
warehouse
e.g. (Pillar III)
in-house in-house
All LiSyM
Patient-related
clinical data
Aggregated data
API
External Tools
API
Data Sovereignty: FAIR but never Open
[Mueller]
Data Sovereignty: Personal Health Tram
Less automatic, more transparent, when partners cannot share
Share table structure
Share common code
Share summaries
FAIR at the First Mile
[Christian R Bauer]
FAIR at the First Mile
Project Commons Integrated Data Warehouse[Christian R Bauer]
EU-STAND4PM: First and Last Mile
Neylon, Knowledge Exchange Report: http://www.knowledge-exchange.info/event/ke-approach-open-scholarship
FAIR at last mile
FAIR at first
mile / source
FAIR Protected
Data/Compute
FAIR
Objects
FAIRification
EU-STANDS4PM
FAIR path through the jungle
Indicators and Maturity Models
obtainable & understandable
Technical infrastructure & Stewardship Skills
possible
Communities & Culture
easy (or at least feasible)
User Experience
normative
rewarding
Incentives
required
Policies
Based on Matt Spritzer’s figure, COS
Acknowledgements
FAIRDOM Team
– http://www.fair-dom.org
Research Object Team
– http://www.researchobject.org
BioComputeObject
– http://biocomputeobject.org/
FAIR folks, esp. FAIRplus and FAIR Metrics
– https://fairplus-project.eu/
– http://www.fairmetrics.org
CommonWorkflow Language
– http://www.commonwl.org
ELIXIR
– http://www.elixir-europe.org
BioExcel
– http://bioexcel.eu
Acknowledgements

Mais conteúdo relacionado

Mais procurados

FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
Carole Goble
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Mais procurados (20)

FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Open Science: how to serve the needs of the researcher?
Open Science: how to serve the needs of the researcher? Open Science: how to serve the needs of the researcher?
Open Science: how to serve the needs of the researcher?
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 

Semelhante a Let’s go on a FAIR safari!

FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 

Semelhante a Let’s go on a FAIR safari! (20)

The FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharing
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Introduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research ObjectsIntroduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research Objects
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
 
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
FAIR, FAIRplus and the FAIR Cookbook
FAIR, FAIRplus and the FAIR Cookbook FAIR, FAIRplus and the FAIR Cookbook
FAIR, FAIRplus and the FAIR Cookbook
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 

Mais de Carole Goble

RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
Carole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Carole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble
 

Mais de Carole Goble (15)

Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 

Último

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 

Último (20)

Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Let’s go on a FAIR safari!

  • 1. Let’s go on a FAIR Safari! Prof Carole Goble The FAIRDOM Consortium ELIXIR UK Head of Node BioComputeObject Partnership The University of Manchester, UK carole.goble@manchester.ac.uk COMBINE 2019, EU-STANDS4PM, Heidelberg,Germany 18 July 2019
  • 2. A European standardization framework for data integration and data-driven in silico models for personalised medicine harmonised transnational standards, recommendations and guidelines that allow a broad application of predictive in silico methodologies in personalised medicine across Europe.
  • 3. A European standardization framework for data integration and data-driven in silico models for personalised medicine
  • 4. Scientific Data 3, 160018 (2016) doi:10.1038/sdata.2016.18 A potted history Many went before 2014 - Lorentz workshop 2015 - BioHackathon 2016 - Published Went bananas Grassroots activity that has become a top down one.
  • 5. sharing/publishing assets in public archives… Data Models *top three most popular The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851
  • 6. … model reuse is tricky… Stanford et alThe evolution of standards and data management practices in systems biology, Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053 COMBINE sessions on Reproducibility
  • 7. ... different repositories, owners, sovereignties, infrastructure, platforms … The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851 A jungle An ecosystem
  • 9. FAIR for the widest possible use from EHR to Research …
  • 10. Just how feasible is it to interoperate (integrate) and reuse data collected for another purpose in another domain?
  • 13. Cutting a path through the jungle…. PEST – political, economic, social, technical What does it mean to be FAIR? What is the cost / benefit analysis
  • 15. FAIR principles in the paper… some people seem to have taken as the law of the jungle…
  • 16. FAIR Principles machine-actionable data and metadata Findable Accessible Interoperable Reusable Find: with machine readable metadata Locate and id: with standard identification mechanism Available and obtainable Human & machine Metadata always STANDARDS Semantically encoded, syntactically parsable References Sufficiently described Provenance Least restrictive licenses Community compliant Increase exchange, integration and reuse Across disciplines and borders
  • 17. FAIR Principles reality check • An aspiration, a journey. • A call for machine actionability of data and metadata. • Ambiguous. • Work in progress. • A subset of indicators: – ROI, impact, community need, sustainability of repository, quality of service…. Are Are not • A standard. • Strict. • Just about humans being able to find, access, reformat and finally reuse data. • Technology specific. • Domain specific. • Tablets of stone Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open ScienceCloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824. Dunning et al Are the FAIR Data Principles fair? IDCC17
  • 18. Lets measure it! Framework for metrics Automated services Manual services Authorities Wilkinson et al, Evaluating FAIR MaturityThrough a Scalable, Automated,Community-Governed Framework https://doi.org/10.1101/649202
  • 19. Lets measure it! Dunkelziffer “Not everything that can be counted counts. Not everything that counts can be counted” [William Bruce Cameron]
  • 21. Indicators: Robustness, Humility,Transparency, Diversity, Reflexivity* Context dependency Community standards Incremental Matrix of metrics Maturity levels for each + *The MetricTide, https://responsiblemetrics.org/the-metric-tide/ F and A are not so bad I and R are hard A FAIR Ecosystem means.... FAIR indicators, models and trust Transparent evaluation
  • 22. Capability Maturity Model of entities & their capabilities Indicators and metrics measuring levels Foundational Components FAIRification Process Awareness and Policy Standards and Guidelines People Infrastructure Value Based Assessment Selection Goal Setting Process planning Modelling Transformation Publishing Impl. Outcome: Dataset Persistent Identification Data Set Discovery Machine Readability Data Access and Usage Preservation and Sustainability RDA FAIR Data Maturity Model Working Group Cataloguing the FAIR ecosystem
  • 23. What do we mean by a Maturity Model? [Susheel Varma] Only way more elaborate ….
  • 24. [Wilkinson et al, 2019] FAIR Evaluator Workflows Rubrics, Indicators andTests are FAIR objects and community decisions https://doi.org/10.1101/649202 Scale up and scale out automation of indicators and their evaluation…
  • 26. FAIRification Cookbooks … for models? https://fairplus-project.eu/
  • 27. More than just data Software, models, workflows, SOPs, Lab Protocols…. FAIR Digital Objects
  • 28. FAIR Models properties of data + software FAIR Software FAIR Workflows* Maintainability Testing Portability Composite structure Forms (spec or code?) Versioning Executability Maturity models Contributor policy Identity Copyright Licenses Documentation Sustainability Model Reproducibility & Exchange *FAIR ComputationalWorkflows https://doi.org/10.5281/zenodo.3268653
  • 31. Scale out mark-up for federation https://eosc-edmi.github.io/ http://bioschemas.org EOSC Dataset Minimum Information
  • 32. FAIR OPEN SAFE privacy preservation, regulatory rigour crossing domain and sovereignty boundaries
  • 33. Privacy Preservation of data data book keeping https://f1000research.com/posters/7-1036 https://www.monarc.lu [Pinar Alper]
  • 34. Privacy Preservation of analysis take (distributed) analysis to the (distributed) data https://www.health-ri.org/ Personal Health Train Collect privacy sensitive data using mobile containers
  • 35. Regulatory Practice robust, safe exchange and reuse of HTS computational analytical workflows http://biocomputeobject.org IEEE P2791 BioCompute Working Group [Vahan Simonyan]
  • 36. BioCompute Framework to advance Regulatory Science to support NGS analysis Emphasis on robust, safe reuse. Describe and validate the metadata of packages, and their contents, both inside and outside Standardise data formats and elements and exchange of Electronic Health Records Describe and validate analysis workflows, to be portable and interoperable Standardise and support sharing and analysis of Genomic data Alterovitz, Dean II, Goble, Crusoe, Soiland-Reyes et al “Enabling Precision Medicine via standard communication of NGS provenance, analysis, and results” PLOS Biology 2018
  • 37. Bechhofer et al (2013)Why linked data is not enough for scientists https://doi.org/10.1016/j.future.2011.08.004 Bechhofer et al (2010) Research Objects:Towards Exchange and Reuse of Digital Knowledge, https://eprints.soton.ac.uk/268555/ Self-describing machine processable metadata in common and specific to different object types. bundle together references or the objects themselves. Relate digital resources snapshot | cite | exchange Research Object Framework
  • 38. COMBINE was early to the party…. Combine Archive Scharm M,Wendland F, Peters M,Wolfien M,TheileT,Waltemath D SEMS, University of Rostock zip-like file with a manifest & metadata - Bundle files - Keep provenance - Exchange data - Ship results Bergmann, F.T. (2014). COMBINE archive and OMEX format: one file to share all information to reproduce a modeling project. BMC bioinformatics,15(1), 1. https://sems.unirostock.de/projects/combinearchive/
  • 39. Big data distributed over multiple locations, Efficiently and safely moved on demand ROs are verified collections of references [Chard, et al 2016] FAIR Research Objects
  • 40. The KnowledgeObject Reference Ontology (KORO): A formalism to support management and sharing of computable biomedical knowledge for learning health systems Flynn, Friedman, Boisvert, Landis‐Lewis, Lagoze (2018), https://doi.org/10.1002/lrh2.10054 Graphs of ROs Track ROs Combine and enrich ROs Learning Health Systems and Research Objects
  • 41. EOSC-Life: FAIR data and tools (workflows, models) for cloud use RI data (distributed over facilities) Ecosystem of innovative tools in EOSC Publish FAIR life science data in EOSC Data Catalogues Tools Catalogues Workflow Catalogues Service Catalogues [Niklas Blomberg]
  • 42.
  • 43. FAIR Challenges for Projects Track collection of data and metadata X X X Maintain experimental context X X Find and exchange assets X X X X Long-term retain results beyond a project X X X Share, disseminate and publish assets sensitively X X X Consistently report for interpretation, interoperability & comparison X X Promote standardised metadata practices. X X Organise and link assets X X X Reuse tools and community archives X X Integrate with other data stores and platforms X X X Support reproducible publications X X X X Credit owners X X
  • 44. Public Project Commons Platform Service hosted at HITS 50+ installations 140+ projects Support
  • 45. A Commons Project Investigations and Assets Simulate model Launch workflows
  • 46. Models SOPs People Projects Publications Documents Presentations Workflows Data Events Federated Catalogue, Integrated view interlinked objects, structured organisation, resources ecosystem Investigations Studies Assays/Analyses
  • 47. Workflows Federated Catalogue, Integrated view interlinked objects, structured organisation, resources ecosystem Stores Archives FAIR Membrane Investigations Studies Assays/Analyses
  • 48. Federated Catalogue, Integrated view A Commons is only as FAIR as the content (inside and outside) FAIR Membrane
  • 49. FAIR(ish) after death …. https://fairdom hub.org/projec ts/129 https://wellcomeopenresearch.org/articles/4-104/v1 Zielinski, Hay, Millar, The grant is dead, long live the data - migration as a pragmatic exit strategy for research data preservation,
  • 50. Data Sovereignty: FAIR but not yet Open A Project Commons not an integrated data warehouse
  • 51. e.g. (Pillar III) in-house in-house All LiSyM Patient-related clinical data Aggregated data API External Tools API Data Sovereignty: FAIR but never Open [Mueller]
  • 52. Data Sovereignty: Personal Health Tram Less automatic, more transparent, when partners cannot share Share table structure Share common code Share summaries
  • 53. FAIR at the First Mile [Christian R Bauer]
  • 54. FAIR at the First Mile Project Commons Integrated Data Warehouse[Christian R Bauer]
  • 55. EU-STAND4PM: First and Last Mile Neylon, Knowledge Exchange Report: http://www.knowledge-exchange.info/event/ke-approach-open-scholarship FAIR at last mile FAIR at first mile / source FAIR Protected Data/Compute FAIR Objects FAIRification
  • 56. EU-STANDS4PM FAIR path through the jungle Indicators and Maturity Models obtainable & understandable Technical infrastructure & Stewardship Skills possible Communities & Culture easy (or at least feasible) User Experience normative rewarding Incentives required Policies Based on Matt Spritzer’s figure, COS
  • 57. Acknowledgements FAIRDOM Team – http://www.fair-dom.org Research Object Team – http://www.researchobject.org BioComputeObject – http://biocomputeobject.org/ FAIR folks, esp. FAIRplus and FAIR Metrics – https://fairplus-project.eu/ – http://www.fairmetrics.org CommonWorkflow Language – http://www.commonwl.org ELIXIR – http://www.elixir-europe.org BioExcel – http://bioexcel.eu