SlideShare uma empresa Scribd logo
1 de 1
myExperiment Research Objects: Beyond Workflows and Packs
Stian Soiland-Reyes1, Don Cruickshank2, Finn Bacall1, Jun Zhao2, Khalid Belhajjame1, David De Roure3, Carole A. Goble1
1 School of Computer Science, University of Manchester, UK
2 Department of Zoology, University of Oxford, UK
3 Oxford e-Research Centre, University of Oxford, UK
ABSTRACT
We have evolved Research Objects as a mechanism to preserve digital resources
related to research, by providing mechanisms, formats and architecture for
describing aggregated resources
(hypothesis, workflow, datasets, scripts, services), their relations (is input
for, explains, used by), provenance (graph was derived from dataset A, B and C) and
attribution (who contributed what, and when?).
The website myExperiment is already popular for collaborating on, publishing and
sharing scientific workflows, however we have found that for understanding and
preserving a workflow over time, its definition is not enough, specially faced with
workflow decay, services and tools that change over time. We have therefore
adapted the research object model as a foundation for the myExperiment
packs, allowing uploading of workflow runs, inputs, outputs and other files relevant
to the workflow, relating them with annotations and integrated the Wf4Ever
architecture for performing decay analysis and tracking a research object’s
evolution as it and its constituent resources change over time.
MAKING RESEARCH OBJECTS
myExperiment is a website for collaboration and sharing of experiments, in
particular scientific workflows. We are enhancing myExperiment’s packs to be based
on the Research Object model, allowing users to form collections of
workflows, example input data, results, presentation slides, hypothesis, workflow
runs and documentation, effectively building a Research Object (RO). These
uploaded resources can then be further related, typed, described and given their
own attribution provenance record. ROs are versioned and shareable.
WF4EVER ARCHITECTURE
The architecture for Research Objects is realized as a Linked Data platform of
RESTful web services that support preservation aspects such as decay monitoring
and evolution tracking, presented to the user through a regular web interface on
myExperiment.
RESEARCH OBJECT MODEL
A research object (RO) is described in an RDF manifest which lists the aggregated
resources and their annotations as separate RDF graphs containing user
annotations (title, description, example value), typing information (hypothesis,
workflow, input data, etc) and automatically extracted metadata (provenance,
workflow structure).
The ontologies for the RO Model is based on standards for aggregations (OAI-ORE)
and annotations (Annotation Ontology, W3C Open Annotation Core OAC).
RESEARCH OBJECTS AS FILES
A RO Bundle is a JSON-LD-based serialization of a research object as a ZIP file
(Adobe UCF, ePub), allowing a hybrid of embedded resources and external
references (URIs). This allows a self-contained RO to be downloaded, transferred,
modified and inspected without requiring a dedicated web server, well suited for
desktop environments such as scientific workflow systems.
Taverna uses RO Bundle to make a workflow run bundle, a single file that contains
the input and output values, the workflow definition, complete with provenance
of the run and intermediate values. This allows sharing of a workflow run, e.g.
uploading to myExperiment, and later reloaded in a different Taverna installation.
This work was enabled by the Wf4Ever project funded by the European Commission’s 7th FWP
(FP7-ICT-2007-6 270192), and the myGrid platform grant by the EPSRC (EP/G026238/1)
Project sites http://www.myexperiment.org/ http://www.wf4ever-project.org/
Source code http://myexperiment.rubyforge.org/svn/ https://github.com/wf4ever/
License BSD 3-Clause License MIT license
http://www.researchobject.org/
Research
Object
Resource
Resource
Resource
Annotation
Annotation
Annotation
oa:hasTarget
oa:hasBody
ore:aggregates
Manifest
Annotation graph
http://purl.org/wf4ever/model
outputA.txt
outputC.jpg
outputB/
intermediates/
1.txt
2.txt
3.txt
de/def2e58b-50e2-4949-9980-fd310166621a.txt
inputA.txt
workflow
URI
references
attribution
execution
environment
Aggregating in Research Object
ZIP folder structure (RO Bundle)
mimetype
application/vnd.wf4ever.robundle+zip
workflowrun.prov.ttl
(provenance)
.ro/manifest.json
http://alpha.myexperiment.org/packs/387
http://sandbox.wf4ever-project.org/portal/
The Research Object is stored and manipulated in a Research Object Digital Library
using REST APIs, allowing any tools to view and modify the RO, like the RO portal:
https://w3id.org/bundle
REST API
RDF triple store
(RO structure,
Annotations)
RO index
Uploaded files
RO Portal
Checklist
service
...

Mais conteúdo relacionado

Mais de Stian Soiland-Reyes

2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator projectStian Soiland-Reyes
 
2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wildStian Soiland-Reyes
 
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)Stian Soiland-Reyes
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)Stian Soiland-Reyes
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)Stian Soiland-Reyes
 
2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?Stian Soiland-Reyes
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
 
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...Stian Soiland-Reyes
 
2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow systemStian Soiland-Reyes
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXStian Soiland-Reyes
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Stian Soiland-Reyes
 
Bringing caBIG services together using Taverna
Bringing caBIG services together using TavernaBringing caBIG services together using Taverna
Bringing caBIG services together using TavernaStian Soiland-Reyes
 

Mais de Stian Soiland-Reyes (15)

2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status
 
2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project
 
2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild
 
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
 
2013-05-29 Taverna Provenance
2013-05-29 Taverna Provenance2013-05-29 Taverna Provenance
2013-05-29 Taverna Provenance
 
2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
 
2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
 
Bringing caBIG services together using Taverna
Bringing caBIG services together using TavernaBringing caBIG services together using Taverna
Bringing caBIG services together using Taverna
 

2013 07-18 myExperiment research objects poster (PPTX)

  • 1. myExperiment Research Objects: Beyond Workflows and Packs Stian Soiland-Reyes1, Don Cruickshank2, Finn Bacall1, Jun Zhao2, Khalid Belhajjame1, David De Roure3, Carole A. Goble1 1 School of Computer Science, University of Manchester, UK 2 Department of Zoology, University of Oxford, UK 3 Oxford e-Research Centre, University of Oxford, UK ABSTRACT We have evolved Research Objects as a mechanism to preserve digital resources related to research, by providing mechanisms, formats and architecture for describing aggregated resources (hypothesis, workflow, datasets, scripts, services), their relations (is input for, explains, used by), provenance (graph was derived from dataset A, B and C) and attribution (who contributed what, and when?). The website myExperiment is already popular for collaborating on, publishing and sharing scientific workflows, however we have found that for understanding and preserving a workflow over time, its definition is not enough, specially faced with workflow decay, services and tools that change over time. We have therefore adapted the research object model as a foundation for the myExperiment packs, allowing uploading of workflow runs, inputs, outputs and other files relevant to the workflow, relating them with annotations and integrated the Wf4Ever architecture for performing decay analysis and tracking a research object’s evolution as it and its constituent resources change over time. MAKING RESEARCH OBJECTS myExperiment is a website for collaboration and sharing of experiments, in particular scientific workflows. We are enhancing myExperiment’s packs to be based on the Research Object model, allowing users to form collections of workflows, example input data, results, presentation slides, hypothesis, workflow runs and documentation, effectively building a Research Object (RO). These uploaded resources can then be further related, typed, described and given their own attribution provenance record. ROs are versioned and shareable. WF4EVER ARCHITECTURE The architecture for Research Objects is realized as a Linked Data platform of RESTful web services that support preservation aspects such as decay monitoring and evolution tracking, presented to the user through a regular web interface on myExperiment. RESEARCH OBJECT MODEL A research object (RO) is described in an RDF manifest which lists the aggregated resources and their annotations as separate RDF graphs containing user annotations (title, description, example value), typing information (hypothesis, workflow, input data, etc) and automatically extracted metadata (provenance, workflow structure). The ontologies for the RO Model is based on standards for aggregations (OAI-ORE) and annotations (Annotation Ontology, W3C Open Annotation Core OAC). RESEARCH OBJECTS AS FILES A RO Bundle is a JSON-LD-based serialization of a research object as a ZIP file (Adobe UCF, ePub), allowing a hybrid of embedded resources and external references (URIs). This allows a self-contained RO to be downloaded, transferred, modified and inspected without requiring a dedicated web server, well suited for desktop environments such as scientific workflow systems. Taverna uses RO Bundle to make a workflow run bundle, a single file that contains the input and output values, the workflow definition, complete with provenance of the run and intermediate values. This allows sharing of a workflow run, e.g. uploading to myExperiment, and later reloaded in a different Taverna installation. This work was enabled by the Wf4Ever project funded by the European Commission’s 7th FWP (FP7-ICT-2007-6 270192), and the myGrid platform grant by the EPSRC (EP/G026238/1) Project sites http://www.myexperiment.org/ http://www.wf4ever-project.org/ Source code http://myexperiment.rubyforge.org/svn/ https://github.com/wf4ever/ License BSD 3-Clause License MIT license http://www.researchobject.org/ Research Object Resource Resource Resource Annotation Annotation Annotation oa:hasTarget oa:hasBody ore:aggregates Manifest Annotation graph http://purl.org/wf4ever/model outputA.txt outputC.jpg outputB/ intermediates/ 1.txt 2.txt 3.txt de/def2e58b-50e2-4949-9980-fd310166621a.txt inputA.txt workflow URI references attribution execution environment Aggregating in Research Object ZIP folder structure (RO Bundle) mimetype application/vnd.wf4ever.robundle+zip workflowrun.prov.ttl (provenance) .ro/manifest.json http://alpha.myexperiment.org/packs/387 http://sandbox.wf4ever-project.org/portal/ The Research Object is stored and manipulated in a Research Object Digital Library using REST APIs, allowing any tools to view and modify the RO, like the RO portal: https://w3id.org/bundle REST API RDF triple store (RO structure, Annotations) RO index Uploaded files RO Portal Checklist service ...