1. FAIRer Research
Professor Carole Goble CBE FREng FBCS
The University of Manchester, UK
carole.goble@manchester.ac.uk
STM Conference, London, 3rd Dec 2014
2. “An article about computational
science in a scientific publication
is not the scholarship itself, it is
merely advertising of the
scholarship. The actual
scholarship is the complete
software development
environment, [the complete
data] and the complete set of
instructions which generated the
figures.”
David Donoho, “Wavelab and Reproducible
Research,” 1995
datasets
data collections
standard operating
procedures
software
algorithms
configurations
tools and apps
codes
workflows
scripts
code libraries
services,
system software
infrastructure,
compilers
hardware
8. Data discovery
Data assembly,
cleaning, and
refinement
Modeling
Statistical analysis
Data collection
Insights Scholarly Communication
& Reporting
Material & Methods
9. BioSTIF
instruments and laboratory
Data discovery
Data assembly,
cleaning, and
refinement
Modeling
Statistical analysis
Data collection
Insights Scholarly Communication
& Reporting
Material & Methods
15. scientific ego-system for open science
trust, reciprocity, competition
fame
competitive
advantage
productivity
credit
adoption
kudos
for love
blame
scooped
uncredited
misinterpretation
scrutiny
shame
insecurity
cost/time/skills
distraction
responsibility
disruption
staff churn
inertia
16. Howard Ratner, STM Innovations Seminar 2012
was: Chair STM Future Labs Committee, CEO EVP Nature Publishing Group,
now: Director of Development for CHORUS (Clearinghouse for the Open Research of US)
http://www.youtube.com/watch?v=p-W4iLjLTrQ&list=PLC44A300051D052E5
http://www.myexperiment.org/packs/196.html
17. http://www.researchobject.org/
Outputs are first class
citizens to be managed,
credited and tracked:
data, software
A Framework to Bundle and Relate multi-hosted
(digital) resources of a scientific experiment or
investigation using standard mechanisms & uniform
access protocols. Carriers of Research Context
Research Objects
18. What is the RO Framework?
• A framework of models and
conventions
• Representations
• API specifications
• Implementations mapped into
legacy / commodity platforms
19. KnowledgeTurns
Unit of Scholarly Currency, RO Commons
Circulate in the Scholarly Ecosystem
Citation? Credit? Link to Publishers?
Goble, De Roure, Bechhofer, Accelerating Knowledge Turns, I3CK, 2013
20. Goble, De Roure, Bechhofer, Accelerating Knowledge Turns, I3CK, 2013
Collaboration to support safe use of
patient and research data for medical
research
Farr Commons
Research Object packages codes, study,
and metadata to exchange coded
descriptions of clinical study cohorts
KnowledgeTurns
Unit of Scholarly Currency, RO Commons
Circulate in the Scholarly Ecosystem
Citation? Credit? Link to Publishers?
arch, Discover, Index, Harvest, Port
21. Schopf, Treating Data Like Software: A Case for Production Quality Data, JCDL 2012Goble, De Roure, Bechhofer, Accelerating Knowledge Turns, I3CK, 2013
Profile Focus
Body of knowledge around methods, workflows,
software, data, person, rather than publication.
Citation, credit
22. Release Research
Evolution, Emergence, Discourse, Threaded
Comparison, Historical review, Anti-Salami
Forks, Merges, Fixivity, Citation? Credit?
Flow across groups, projects and articles
Schopf, Treating Data Like Software: A Case for Production Quality Data, JCDL 2012
23. Reproduce Research
Repeat, Replicate, Recompute, Reuse….
Entropy, Citation? Credit?
icanhascheezburger.com
Zhao, et al .Why workflows break - Understanding and combating
decay inTaverna workflows, 8th Intl Conf e-Science 2012
Can I repeat &
defend my
results?
Can I review, reproduce
and compare my
results/method with your
results/method?
Can I review,
replicate and certify
your results?
Can I transfer your
results into my
research and reuse
this method?
Hettne et al Structuring research methods and data with the research object model: genomics
workflows as a case study 2014 http://www.jbiomedsem.com/content/pdf/2041-1480-5-41.pdf
26. Checklists aka Minimum Information Models, Reporting Guidelines
Minim Checklist Ontology, http://purl.org/net/mim/ns
Zhao et. al. A Checklist-Based Approach for QualityAssessment of Scientific Information 3rd In.Workshop on Linked Science, 2013
Hettne et al Structuring research methods and data with the research object model: genomics
workflows as a case study 2014 http://www.jbiomedsem.com/content/pdf/2041-1480-5-41.pdf
33. Nanopub: represents structured
data along with its provenance in a
single publishable and citable entry
Galaxy workflows: re-enact the analysis
Research Object:
aggregates the
(digital) resources
contributing to
findings of
(computational)
research (results,
data and software)
as citable
compound digital
objects
http://isa-tools.github.io/soapdenovo2/
http://sandbox.wf4ever-project.org/portal/ro?ro=http://sandbox.wf4ever-project.org/rodl/ROs/SOAP2denovo2-Aureus/
[Alejandra Gonzalez-Beltran
Philippe Rocca-Serra]
34. • Id & Cite fluid
things
• Uniform handling
1st class citizens
• Compound,
multi-authored
• Mixed, leaky
containers
• Span outcomes,
evolve outputs,
emergence
• Profiles
• Bridge
researchers,
platforms,
resources
Bechhofer,Why linked data is not enough for scientists, DOI: 10.1016/j.future.2011.08.004
38. • Open research is like Open software
• Multi-part, multi-contributor, updating
• Tardis & Commons
• Implications for metrics? publishing?
• Learning from open software development
40. • Barend Mons
• Sean Bechhofer
• Philip Bourne
• Matthew Gamble
• Raul Palma
• Jun Zhao
• Alan Williams
• Stian Soiland-Reyes
• Paul Groth
• Tim Clark
• Juliana Freire
• Alejandra Gonzalez-Beltran
• Philippe Rocca-Serra
• Ian Cottam
• Susanna Sansone
• James Howison
• James Herbsleb
• Kristian Garza
All the members of the Wf4Ever team
iSOCO: Intelligent Software Components S.A.,
Spain
University of Manchester, School of Computer
Science, Manchester, United Kingdom
University of Oxford, Department of Zoology,
Oxford, UK
Poznan Supercomputing and Networking
Center. Poznan, Poland
IAA: Instituto de Astrofísica de Andalucía,
Granada, Spain
Leiden University Medical Centre, Centre for
Human and Clinical Genetics, The Netherlands
Colleagues in Manchester’s Information
Management Group
RO Advisory Board Members
http://www.researchobject.org
http://www.wf4ever-project.org
http://www.fair-dom.org
http://www.datafairport.org