SlideShare uma empresa Scribd logo
1 de 74
Reproducibility,
Research Objects
and Reality
Professor Carole Goble
The University of Manchester, UK
Software Sustainability Institute, UK
ELIXIR UK,
FAIRDOMAssociation e.V.
carole.goble@manchester.ac.uk
University of Leiden,The Netherlands, 24 November 2016
Acknowledgements
• Dagstuhl Seminar 16041 , January 2016
– http://www.dagstuhl.de/en/program/calendar/semhp/?semnr=16041
• ATI Symposium Reproducibility, Sustainability and Preservation , April 2016
– https://turing.ac.uk/events/reproducibility-sustainability-and-preservation/
– https://osf.io/bcef5/files/
• CTitus Brown
• Juliana Freire
• David De Roure
• Stian Soiland-Reyes
• Barend Mons
• Tim Clark
• Daniel Garijo
• Norman Morrison
• Katy Wolstencroft
Phil Bourne
Natalie Stanford
Jacky Snoep
Stuart Owen
Marco Roos
Kristina Hettne
AlanWilliams
Sean Bechhofer
Ian Fore
Rafael Jimenez
…. And many more
Michael Crusoe
Paul Groth
Niall Beard
Context: Computational Science
http://tpeterka.github.io/maui-project/
From:The Future of ScientificWorkflows, Report of DOEWorkshop 2015,
http://science.energy.gov/~/media/ascr/pdf/programdocuments/docs/workflows_final_report.pd
1. Observational,
experimental
2. Theoretical
3. Simulation
4. Data intensive
Motivation: Knowledge Turning
research infrastructures
• Computational tools
• Sharing platforms
• Knowledge
Exchange
• Reproducible
research
• Software and data
practices
• Policies
[Josh Sommer, for the picture]
Reproducibility
Rampancy
NIH Rigor and Reproducibility
https://www.nih.gov/research-
training/rigor-reproducibility
Plenty of
guidelines
cos.io/top
Plenty of
principles
https://wellcomeopenresearch.org/ Nature Scientific Data
Data as a first class citizen + Data Citation
Scholarly Communications Providers
Software as a first class citizen +
Software Citation
Funders
http://www.acmedsci.ac.uk/policy/policy-projects/reproducibility-and-reliability-of-biomedical-research/
republic of science*
regulation of science
institution cores / libraries / public services
*Merton’s four norms of scientific behaviour (1942)
FAIR
Findable
Accessible
Interoperable
Reusable
Intelligible
Reproducible
Citable
Track & Countable
http://ec.europa.eu/research/participants/data/ref/h2020/
grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
Research Infrastructure for
FAIR Management and Sharing of
Data, Operating Procedures, Model
For Systems and Synthetic Biology
Projects
Research Infrastructure for
FAIR Data for Life Sciences in
Europe
Data-Driven Science
design
cherry picking data, random seed
reporting, non-independent bias, poor
positive and negative controls, dodgy
normalisation, arbitrary cut-offs,
premature data triage, un-validated
materials, improper statistical analysis,
poor statistical power, stop when “get to
the right answer”, software
misconfigurations misapplied black box
software
reporting
incomplete reporting of software configurations, parameters & resource
versions, missed steps, missing data, vague methods, missing software
Empirical Statistical Computational
V. Stodden, IMS Bulletin (2013)
Reproducibility and reliability of biomedical
research: improving research practice
“When I use a word," Humpty Dumpty
said in rather a scornful tone, "it means
just what I choose it to mean - neither
more nor less.”
Carroll, Through the Looking Glass
re-compute
replicate
rerun
repeat
re-examine
repurpose
recreate
reuse
restore
reconstruct review
regenerate
revise
recycle
redo
robustness
tolerance
verificationcompliancevalidation assurance
remix
Scientific publications goals:
(i) announce a result
(ii) convince readers its correct.
Papers in experimental science
should describe the results and
provide a clear enough protocol to
allow successful repetition and
extension.
Papers in computational science
should describe the results and
provide the complete software
development environment, data
and set of instructions which
generated the figures.
VirtualWitnessing*
*Leviathan and theAir-Pump: Hobbes, Boyle, and the
Experimental Life (1985) Shapin and Schaffer.
Jill Mesirov
David Donoho
Computational
Complex Assemblies
Remote Calls
“Micro” Reproducibility
“Macro” Reproducibility
Fixivity
Validate
Verify
Trust
Repeatability:
“Sameness”
Same result
1 Lab
1 experiment
Reproducibility:
“Similarity”
Similar result
> 1 Lab
> 1 experiment
why the differences?
https://2016-oslo-
repeatability.readthedocs.org/en/latest/repeatability-discussion.htm
Validate
Verify
Method Reproducibility
the provision of enough detail about
study procedures and data so the
same procedures could, in theory or in
actuality, be exactly repeated.
Result Reproducibility
(aka replicability)
obtaining the same results from the
conduct of an independent study
whose procedures are as closely
matched to the original experiment
as possible
Goodman, et al ScienceTranslational Medicine 8 (341) 2016
Validate
Verify
Productivity
Track differences
Validate
Verify
reviewers want additional work
statistician wants more runs
analysis needs to be repeated
post-doc leaves,
student arrives
new/revised datasets
updated/new versions of
algorithms/codes
sample was contaminated
better kit - longer simulations
new partners, new projects
Personal & Lab
Productivity
Public Good
Reproducibility
Computational “Datascopes”
Methods
techniques, algorithms,
spec. of the steps, models
Materials
datasets, parameters,
algorithm seeds
Instruments
codes, services, scripts,
underlying libraries,
workflows, ref datasets
Laboratory
sw and hw infrastructure,
systems software,
integrative platforms
computational environment
“Datascope” Practicalities
Methods
Materials
Instruments
Laboratory
Change Dependencies
science,
methods,
datasets
questions stay,
answers change
breakage, labs
decay, services,
techniques and
instruments
change, updated
datasets, services,
codes, hardware
software entropy
one offs,
streams,
stochastics,
sensitivities,
scale,
non-portable data
supercomputer
access
non-portable
software
licensing restrictions
unreliable resources
and third party codes
complexity
Blackboxes
blackbox
software
hidden manual
steps
blackbox
software
hidden manual
steps
Active Instrument
Byte level preservation
Reproduce by RunningReproduce by Reading
Archived Record
Prepare to repair
ELNs
Markup Languages
Reporting Guidelines
Common Formats
Community
vocabularies
Record All
Automate All
Contain All
Expose All
Findable
Accessible
Interoperable
Reusable
provenance
portability
preservation
robustness
versioning
access description
standards
common APIs
licensing
standards,
common metadata
change
variation sensitivity
discrepancy handling
packaging, containers
FAIR RACE shades of reproducibility
dependencies
stepsids
A robust infrastructure
for biological information.
bio.tools
https://usegalaxy.org/
Workflow Description
Workflows Preservation
Workflow Portability
Workflow Interoperability
Workflow Preservation and Exchange
Experiments
Workflows &Workflow Runs
Workflow Commons
Third Party Services
Scattered resources
Workflow Preservation and Exchange
Experiments
Workflows &Workflow Runs
Workflow Commons
Third Party Services
Scattered resources
Rich descriptions
Prepare to Repair
Standards-based metadata framework for bundling resources
with context
Citable Reproducible Packaging
Metadata for bundling resources scattered and stored somewhere else
Container
Research Object in a nutshell
Packaging content & links:
Zip files, BagIt, Docker images
Catalogues & Commons Platforms:
FAIRDOM, myExperiment
Manifest
Construction
Aggregates
link things together
Annotations
about things & their
relationships
Container
Research Object in a nutshell
Manifest
Description
Dependencies
what else is
needed
Versioning
its evolution
Checklists
what should
be there
Provenance
where it
came from
Identification
locate things
regardless where
id
Packaging content & links:
Zip files, BagIt, Docker images
Catalogues & Commons Platforms:
FAIRDOM, myExperiment
Manifest
Construction
Aggregates
link things together
Annotations
about things & their
relationships
Container
Research Object Profile forWorkflows…
Manifest
Description
Identification
locate things
regardless where
Minimum information
for one content type
Common properties
among content
types
Research Object Profile forWorkflows…
Manifest
Description
Minimum information
for one content type
Common properties
among content
types
Belhajjame et al (2015) Using a suite of ontologies for preserving workflow-centric research objects,
JWeb Semantics doi:10.1016/j.websem.2015.01.003
Hettne KM, et al (2014), Structuring research methods and data with the research object model: genomics workflows as a
case study. J. Biomedical Semantics 5: 41
Workflow Research Object Bundles
exchange, portability and maintenance
BagIt
workflows packaged into
various containers for sharing
Checksum
Workflow and Workflow Management System Zoo
https://github.com/common-workflow-language/common-workflow-language/wiki/Existing-Workflow-systems
bio.tools
A community led standard way
of expressing and running
workflows and command line
tools using containers
Ontologies for describing tools
and their inputs and outputs
Metadata framework for the
manifest versioning, file
integrity, more metadata
about the workflow
Workflow fragment containers
Findable
Accessible
Interoperable
Reusable
Data
Operations
Models
Systems and Synthetic Biology Projects
Funder: Legacy!
Partners
Project Support
Community Actions
Platforms,Tools
Web-based Portal
Public Commons
50+ projects
5 programmes
400+ people
22 independent
installations
Systems Approach…
Multiple, interrelated assets, Multiple, dispersed repositories
Literature
SOPS
STANDARDS
versioning,
tracking:
provenance,
parameters,
citation
Operations
FAIR Data and Metadata Standards that
help to improve understanding and exchange….
Nicolas Le Novère, Babraham Institute, UK.
…researchers do not always use them....
… model reuse and reproducibility tricky…
Stanford et alThe evolution of standards and data management practices in systems
biology, Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053
Systems Approach…
teams, processes, multi-partner, multi-discipline, legacy
Funders
Researchers
Publishers
What methods are been used to determine
enzyme activity?
What SOP was used for this sample?
Where is the validation data for this model?
Is there any group generating kinetic data?
Is this data available?
Track versions of my model
Whats the relationship between the data and
model?
Which data belong to
which publications?
FAIR
A Commons
fairdomhub.org
Investigation
Study Analysis
Data
Model
SOP(Assay)
….organised in Investigation, Study, Assay/Analysis format
….registered using Just Enough Results description
….organised in Investigation, Study, Assay/Analysis format
….registered using Just Enough Results description.
Just Enough
Results Model
Common elements
….organised in Investigation, Study, Assay/Analysis format
….registered using Just Enough Results description.
Uploaded into the
FAIRDOM Store
Linked to entry
in Public Archive
Linked to entry in
Project store
... aggregating catalogue
metadata across repositories, retain context-> reproduce, reuse
Local Stores
External
Databases
Publishing services
Secure
Stores
Model
Resources
… in situ reproducible models
metadata annotation against standards
model validation, comparison and simulation
SBML Model simulation
Model comparison
Model versioning
Reproducing simulations
[Jacky Snoep, Dagmar Waltemate, Martin Peters, Martin Scharm]
…. Nested Packages
context and credit
Research Objects
• Link
• Nest
• Span
• Bundle
• Snapshot
Systematic, Standards-
based metadata
framework for logically
and physically bundling
resources with context
• Exchange
• Reproduce
• Release packages
Reproducible Exchange and Publishing
and better credit
Author List: Joe Bloggs; Jane Doe
Title: My Investigation
Date: September 2016
DOI: https://doi.org/10.15490/seek##
information travels with the data and models
How do we do? Pretty well.
Reproducibility window. But that’s ok!
• Can’t contain everything
– Pesky Internet in a Box
• Can’t automate everything
– Pesky people
• Can’t fix everything
– Pesky science
Asthma Research
e-Laboratory
Release builds of
pharmacological
knowledge
warehouse
Exchanging
large datasets
Samiul Hasan, GSK
Biocuration need in Pharma: Drivers from aTranslational Bioinformatics Perspective,
Poster S16
1st EASYMConference, Berlin 2016
Reality
Preparation pain. Goldilocks paradox.
[Norman Morrison]
replication hostility no funding, time, recognition, place to publish
resource intensive access to the complete environment
“Data Parasites”
“Data Flirters”
“Share Drift”
Family
Friends
Potential Friends
Acquaintances
Strangers
Rivals
Reciprocity
Using FAIRDOM my own
lab colleagues saw what I
was doing and called to
collaborate!
Jurgen Hannstra
Vrije Universiteit Amsterdam, Netherlands
Trust …
Half of researchers make research data available
so they can be used by another.
Most not experienced any direct benefits
nor experienced many bad effects.
Caveat:
shared but usable?
fake sharing
funder requirements
fear data will be
misused or
misinterpreted
journal requirements
good research practice
facilitate collaborations
enable validation and
replication
higher citation rates
time and effort
new collaborations
extra funding for cost of data prep
enhance their academic reputation
feedback on how other researchers were using
their data
taken into account in funding
taken into account in career
jeopardise future publications
its not ready to share
scrutiny scruples
answering questions
I won’t get credited
Metadata in by side effect
Tooling for annotations and checklist templates for different types of assay data.
Embed ontologies into
Excel templates
Excel spreadsheets enriched
with ontology annotations
Upload, extract metadata and register
http://www.rightfield.org.uk
Spreadsheet Ramps!!
Sharing by side effect …. libertarian paternalism
[Kristian Garza]
Finding and Citing by side effect
• Schema.org
• Structured
markup in web
pages
• Supported by
Content
Management
Systems
• Harvested by
search engines
• Builds snippets
and sidebars
Bioschemas.org
Data
repository
Data
repository
Training
Resource
Bioschemas Bioschemas Bioschemas
Search engine Bio Registries
Biosharing
OLS, TeSS
bio.tools
UKCRC Tissue
Directory
bioCADDIE DATAMED
PDBe UniProt
Interpro Molgenis Pfam
Gene3DBiosamples
Biobank websites
BRENDA HPA
TransPlantEGA Beacons
EBI-Search
Google
Finding and Citing by side effect
Bioschemas.org
Big co-operative data-driven
science makes reproducibility
desirable but also means
dependency and change are to be
expected
Words matter.
50 Shades of Reproducibility.
form vs function
Reproducibility is not a end.
Beware zealots.
Amplify Side effects
Think Research Objects!

Mais conteúdo relacionado

Mais procurados

Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMCarole Goble
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceCarole Goble
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCarole Goble
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...FAIRDOM
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...FAIRDOM
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIMartin Scharm
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openlyFAIRDOM
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 

Mais procurados (20)

Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOM
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 
ROHub
ROHubROHub
ROHub
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Ngsp
NgspNgsp
Ngsp
 

Semelhante a Reproducibility, Research Objects and Reality, Leiden 2016

Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science Carole Goble
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and modelsmyGrid team
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 
The Future of Research (Science and Technology)
The Future of Research (Science and Technology)The Future of Research (Science and Technology)
The Future of Research (Science and Technology)Duncan Hull
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research ObjectsDavid De Roure
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityOscar Corcho
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015William Gunn
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and ScholarshipDavid De Roure
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARECASRAI
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...Open Science Fair
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesDavid De Roure
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurghJun Zhao
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ NettabDuncan Hull
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...Open Science Fair
 
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...GigaScience, BGI Hong Kong
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 

Semelhante a Reproducibility, Research Objects and Reality, Leiden 2016 (20)

Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Aussois bda-mdd-2018
Aussois bda-mdd-2018Aussois bda-mdd-2018
Aussois bda-mdd-2018
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
The Future of Research (Science and Technology)
The Future of Research (Science and Technology)The Future of Research (Science and Technology)
The Future of Research (Science and Technology)
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research Objects
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and Scholarship
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ Nettab
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
 
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 

Mais de Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the FutureCarole Goble
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardCarole Goble
 

Mais de Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 

Último

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 

Último (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 

Reproducibility, Research Objects and Reality, Leiden 2016

  • 1. Reproducibility, Research Objects and Reality Professor Carole Goble The University of Manchester, UK Software Sustainability Institute, UK ELIXIR UK, FAIRDOMAssociation e.V. carole.goble@manchester.ac.uk University of Leiden,The Netherlands, 24 November 2016
  • 2. Acknowledgements • Dagstuhl Seminar 16041 , January 2016 – http://www.dagstuhl.de/en/program/calendar/semhp/?semnr=16041 • ATI Symposium Reproducibility, Sustainability and Preservation , April 2016 – https://turing.ac.uk/events/reproducibility-sustainability-and-preservation/ – https://osf.io/bcef5/files/ • CTitus Brown • Juliana Freire • David De Roure • Stian Soiland-Reyes • Barend Mons • Tim Clark • Daniel Garijo • Norman Morrison • Katy Wolstencroft Phil Bourne Natalie Stanford Jacky Snoep Stuart Owen Marco Roos Kristina Hettne AlanWilliams Sean Bechhofer Ian Fore Rafael Jimenez …. And many more Michael Crusoe Paul Groth Niall Beard
  • 3. Context: Computational Science http://tpeterka.github.io/maui-project/ From:The Future of ScientificWorkflows, Report of DOEWorkshop 2015, http://science.energy.gov/~/media/ascr/pdf/programdocuments/docs/workflows_final_report.pd 1. Observational, experimental 2. Theoretical 3. Simulation 4. Data intensive
  • 4. Motivation: Knowledge Turning research infrastructures • Computational tools • Sharing platforms • Knowledge Exchange • Reproducible research • Software and data practices • Policies [Josh Sommer, for the picture]
  • 5.
  • 7. NIH Rigor and Reproducibility https://www.nih.gov/research- training/rigor-reproducibility Plenty of guidelines cos.io/top
  • 9. https://wellcomeopenresearch.org/ Nature Scientific Data Data as a first class citizen + Data Citation Scholarly Communications Providers
  • 10. Software as a first class citizen + Software Citation
  • 12. republic of science* regulation of science institution cores / libraries / public services *Merton’s four norms of scientific behaviour (1942)
  • 14. Research Infrastructure for FAIR Management and Sharing of Data, Operating Procedures, Model For Systems and Synthetic Biology Projects Research Infrastructure for FAIR Data for Life Sciences in Europe Data-Driven Science
  • 15.
  • 16. design cherry picking data, random seed reporting, non-independent bias, poor positive and negative controls, dodgy normalisation, arbitrary cut-offs, premature data triage, un-validated materials, improper statistical analysis, poor statistical power, stop when “get to the right answer”, software misconfigurations misapplied black box software reporting incomplete reporting of software configurations, parameters & resource versions, missed steps, missing data, vague methods, missing software Empirical Statistical Computational V. Stodden, IMS Bulletin (2013) Reproducibility and reliability of biomedical research: improving research practice
  • 17. “When I use a word," Humpty Dumpty said in rather a scornful tone, "it means just what I choose it to mean - neither more nor less.” Carroll, Through the Looking Glass re-compute replicate rerun repeat re-examine repurpose recreate reuse restore reconstruct review regenerate revise recycle redo robustness tolerance verificationcompliancevalidation assurance remix
  • 18. Scientific publications goals: (i) announce a result (ii) convince readers its correct. Papers in experimental science should describe the results and provide a clear enough protocol to allow successful repetition and extension. Papers in computational science should describe the results and provide the complete software development environment, data and set of instructions which generated the figures. VirtualWitnessing* *Leviathan and theAir-Pump: Hobbes, Boyle, and the Experimental Life (1985) Shapin and Schaffer. Jill Mesirov David Donoho
  • 21. Repeatability: “Sameness” Same result 1 Lab 1 experiment Reproducibility: “Similarity” Similar result > 1 Lab > 1 experiment why the differences? https://2016-oslo- repeatability.readthedocs.org/en/latest/repeatability-discussion.htm Validate Verify
  • 22. Method Reproducibility the provision of enough detail about study procedures and data so the same procedures could, in theory or in actuality, be exactly repeated. Result Reproducibility (aka replicability) obtaining the same results from the conduct of an independent study whose procedures are as closely matched to the original experiment as possible Goodman, et al ScienceTranslational Medicine 8 (341) 2016 Validate Verify
  • 24. reviewers want additional work statistician wants more runs analysis needs to be repeated post-doc leaves, student arrives new/revised datasets updated/new versions of algorithms/codes sample was contaminated better kit - longer simulations new partners, new projects Personal & Lab Productivity Public Good Reproducibility
  • 25. Computational “Datascopes” Methods techniques, algorithms, spec. of the steps, models Materials datasets, parameters, algorithm seeds Instruments codes, services, scripts, underlying libraries, workflows, ref datasets Laboratory sw and hw infrastructure, systems software, integrative platforms computational environment
  • 26. “Datascope” Practicalities Methods Materials Instruments Laboratory Change Dependencies science, methods, datasets questions stay, answers change breakage, labs decay, services, techniques and instruments change, updated datasets, services, codes, hardware software entropy one offs, streams, stochastics, sensitivities, scale, non-portable data supercomputer access non-portable software licensing restrictions unreliable resources and third party codes complexity Blackboxes blackbox software hidden manual steps blackbox software hidden manual steps
  • 27.
  • 28. Active Instrument Byte level preservation Reproduce by RunningReproduce by Reading Archived Record Prepare to repair ELNs Markup Languages Reporting Guidelines Common Formats Community vocabularies
  • 29. Record All Automate All Contain All Expose All Findable Accessible Interoperable Reusable
  • 30. provenance portability preservation robustness versioning access description standards common APIs licensing standards, common metadata change variation sensitivity discrepancy handling packaging, containers FAIR RACE shades of reproducibility dependencies stepsids
  • 31. A robust infrastructure for biological information. bio.tools
  • 33. Workflow Preservation and Exchange Experiments Workflows &Workflow Runs Workflow Commons Third Party Services Scattered resources
  • 34. Workflow Preservation and Exchange Experiments Workflows &Workflow Runs Workflow Commons Third Party Services Scattered resources Rich descriptions Prepare to Repair
  • 35. Standards-based metadata framework for bundling resources with context Citable Reproducible Packaging Metadata for bundling resources scattered and stored somewhere else
  • 36. Container Research Object in a nutshell Packaging content & links: Zip files, BagIt, Docker images Catalogues & Commons Platforms: FAIRDOM, myExperiment
  • 37. Manifest Construction Aggregates link things together Annotations about things & their relationships Container Research Object in a nutshell Manifest Description Dependencies what else is needed Versioning its evolution Checklists what should be there Provenance where it came from Identification locate things regardless where id Packaging content & links: Zip files, BagIt, Docker images Catalogues & Commons Platforms: FAIRDOM, myExperiment
  • 38. Manifest Construction Aggregates link things together Annotations about things & their relationships Container Research Object Profile forWorkflows… Manifest Description Identification locate things regardless where Minimum information for one content type Common properties among content types
  • 39. Research Object Profile forWorkflows… Manifest Description Minimum information for one content type Common properties among content types
  • 40. Belhajjame et al (2015) Using a suite of ontologies for preserving workflow-centric research objects, JWeb Semantics doi:10.1016/j.websem.2015.01.003 Hettne KM, et al (2014), Structuring research methods and data with the research object model: genomics workflows as a case study. J. Biomedical Semantics 5: 41 Workflow Research Object Bundles exchange, portability and maintenance BagIt workflows packaged into various containers for sharing Checksum
  • 41. Workflow and Workflow Management System Zoo https://github.com/common-workflow-language/common-workflow-language/wiki/Existing-Workflow-systems
  • 42. bio.tools A community led standard way of expressing and running workflows and command line tools using containers Ontologies for describing tools and their inputs and outputs Metadata framework for the manifest versioning, file integrity, more metadata about the workflow Workflow fragment containers
  • 45. Project Support Community Actions Platforms,Tools Web-based Portal Public Commons 50+ projects 5 programmes 400+ people 22 independent installations
  • 46. Systems Approach… Multiple, interrelated assets, Multiple, dispersed repositories Literature SOPS STANDARDS versioning, tracking: provenance, parameters, citation Operations
  • 47. FAIR Data and Metadata Standards that help to improve understanding and exchange…. Nicolas Le Novère, Babraham Institute, UK. …researchers do not always use them....
  • 48. … model reuse and reproducibility tricky… Stanford et alThe evolution of standards and data management practices in systems biology, Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053
  • 49. Systems Approach… teams, processes, multi-partner, multi-discipline, legacy Funders Researchers Publishers
  • 50. What methods are been used to determine enzyme activity? What SOP was used for this sample? Where is the validation data for this model? Is there any group generating kinetic data? Is this data available? Track versions of my model Whats the relationship between the data and model? Which data belong to which publications? FAIR
  • 52.
  • 53. Investigation Study Analysis Data Model SOP(Assay) ….organised in Investigation, Study, Assay/Analysis format ….registered using Just Enough Results description
  • 54. ….organised in Investigation, Study, Assay/Analysis format ….registered using Just Enough Results description. Just Enough Results Model Common elements
  • 55. ….organised in Investigation, Study, Assay/Analysis format ….registered using Just Enough Results description. Uploaded into the FAIRDOM Store Linked to entry in Public Archive Linked to entry in Project store
  • 56. ... aggregating catalogue metadata across repositories, retain context-> reproduce, reuse Local Stores External Databases Publishing services Secure Stores Model Resources
  • 57. … in situ reproducible models metadata annotation against standards model validation, comparison and simulation SBML Model simulation Model comparison Model versioning Reproducing simulations [Jacky Snoep, Dagmar Waltemate, Martin Peters, Martin Scharm]
  • 59. Research Objects • Link • Nest • Span • Bundle • Snapshot Systematic, Standards- based metadata framework for logically and physically bundling resources with context • Exchange • Reproduce • Release packages
  • 60. Reproducible Exchange and Publishing and better credit Author List: Joe Bloggs; Jane Doe Title: My Investigation Date: September 2016 DOI: https://doi.org/10.15490/seek## information travels with the data and models
  • 61. How do we do? Pretty well. Reproducibility window. But that’s ok! • Can’t contain everything – Pesky Internet in a Box • Can’t automate everything – Pesky people • Can’t fix everything – Pesky science
  • 62. Asthma Research e-Laboratory Release builds of pharmacological knowledge warehouse Exchanging large datasets
  • 63. Samiul Hasan, GSK Biocuration need in Pharma: Drivers from aTranslational Bioinformatics Perspective, Poster S16 1st EASYMConference, Berlin 2016 Reality
  • 64. Preparation pain. Goldilocks paradox. [Norman Morrison] replication hostility no funding, time, recognition, place to publish resource intensive access to the complete environment
  • 65. “Data Parasites” “Data Flirters” “Share Drift” Family Friends Potential Friends Acquaintances Strangers Rivals Reciprocity
  • 66. Using FAIRDOM my own lab colleagues saw what I was doing and called to collaborate! Jurgen Hannstra Vrije Universiteit Amsterdam, Netherlands Trust …
  • 67.
  • 68.
  • 69. Half of researchers make research data available so they can be used by another. Most not experienced any direct benefits nor experienced many bad effects. Caveat: shared but usable? fake sharing funder requirements fear data will be misused or misinterpreted journal requirements good research practice facilitate collaborations enable validation and replication higher citation rates time and effort new collaborations extra funding for cost of data prep enhance their academic reputation feedback on how other researchers were using their data taken into account in funding taken into account in career jeopardise future publications its not ready to share scrutiny scruples answering questions I won’t get credited
  • 70. Metadata in by side effect Tooling for annotations and checklist templates for different types of assay data. Embed ontologies into Excel templates Excel spreadsheets enriched with ontology annotations Upload, extract metadata and register http://www.rightfield.org.uk Spreadsheet Ramps!!
  • 71. Sharing by side effect …. libertarian paternalism [Kristian Garza]
  • 72. Finding and Citing by side effect • Schema.org • Structured markup in web pages • Supported by Content Management Systems • Harvested by search engines • Builds snippets and sidebars Bioschemas.org
  • 73. Data repository Data repository Training Resource Bioschemas Bioschemas Bioschemas Search engine Bio Registries Biosharing OLS, TeSS bio.tools UKCRC Tissue Directory bioCADDIE DATAMED PDBe UniProt Interpro Molgenis Pfam Gene3DBiosamples Biobank websites BRENDA HPA TransPlantEGA Beacons EBI-Search Google Finding and Citing by side effect Bioschemas.org
  • 74. Big co-operative data-driven science makes reproducibility desirable but also means dependency and change are to be expected Words matter. 50 Shades of Reproducibility. form vs function Reproducibility is not a end. Beware zealots. Amplify Side effects Think Research Objects!