2014-04-24 A dip into Research Objects

•Transferir como PPTX, PDF•

1 gostou•1,215 visualizações

Technical dive into the serialization of Research Objects (RO) as RO Bundles, a Adobe UCF based ZIP-format with a JSON-LD manifest. https://w3id.org/bundle http://www.researchobject.org/ Presented on 2014-04-24 at COMBINE HARMONY 2014 http://co.mbine.org/events/HARMONY_2014 pptx source: https://onedrive.live.com/view.aspx?cid=37935FEEE4DF1087&resid=37935FEEE4DF1087%21679&app=PowerPoint&authkey=%21AI6c4YT_419J3zY&wdo=1

Tecnologia Educação

A dip into
Research Objects
Stian Soiland-Reyes
myGrid, University of Manchester
HARMONY 2014, Manchester, 2014-04-24
This work is licensed under a
Creative Commons Attribution 3.0 Unported
License

Saving a research object:
RO bundle
Single, transferrable research object
Self-contained snapshot
Which files in ZIP, which are URIs? (Up to
user/application)
Regular ZIP file, explored and unpacked with standard
tools
JSON manifest is programmatically accessible without
RDF understanding
Works offline and in desktop applications – no REST
API access required
Basis for RO-enabled file formats, e.g. Taverna run
bundle

Workflow Results Bundle
workflowrun.prov.ttl
(RDF)
outputA.txt
outputC.jpg
outputB/
https://w3id.org/bundle
intermediates/
1.txt
2.txt
3.txt
de/def2e58b-50e2-4949-9980-fd310166621a.txt
inputA.txt
workflow
URI
reference
s
attribution
execution
environment
Aggregating in Research Object
ZIP folder structure (RO Bundle)
mimetype
application/vnd.wf4ever.robundle+zip
.ro/manifest.jso
n

RO Bundle
What is aggregated? File In ZIP or external URI
Who made the RO? When?
Who?
External URIs placed in folders
Embedded annotation
External annotation, e.g. blogpost
JSON-LD context  RDF
RO provenance
.ro/manifest.json
Format
Note: JSON "quotes" not shown above for brevity
http://json-ld.org/
http://orcid.org/
https://w3id.org/bundle

http://json-ld.org/
http://www.w3.org/TR/json-ld/
<http://dbpedia.org/resource/John_Lennon> <http://xmlns.com/foaf/0.1/name> "John Lennon" .
<http://dbpedia.org/resource/John_Lennon> <http://schema.org/birthDate> "1940-10-09".
<http://dbpedia.org/resource/John_Lennon> <http://schema.org/spouse>
<http://dbpedia.org/resource/Cynthia_Lennon> .
Defines RDF triples:

API for RO bundles
https://github.com/wf4ever/robundle/

Mais conteúdo relacionado

Mais de Stian Soiland-Reyes

2014-10-31 Taverna 3 architectureStian Soiland-Reyes

2014-10-30 Taverna 3 statusStian Soiland-Reyes

2014-10-30 Taverna as an Apache Incubator projectStian Soiland-Reyes

2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)Stian Soiland-Reyes

2013 06-24 Wf4Ever: Annotating research objects (PDF)Stian Soiland-Reyes

2013 06-24 Wf4Ever: Annotating research objects (PPTX)Stian Soiland-Reyes

2013-05-29 Taverna ProvenanceStian Soiland-Reyes

2013-03-21 What can provenance do for me?Stian Soiland-Reyes

2013-01-17 Research ObjectStian Soiland-Reyes

2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes

2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...Stian Soiland-Reyes

2011-06-08 Taverna workflow systemStian Soiland-Reyes

Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXStian Soiland-Reyes

Taverna workflow management system (2010 11-30 Bath Workflow Tools)Stian Soiland-Reyes

Bringing caBIG services together using TavernaStian Soiland-Reyes

Mais de Stian Soiland-Reyes (15)

2014-10-31 Taverna 3 architecture

2014-10-30 Taverna 3 status

2014-10-30 Taverna as an Apache Incubator project

2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)

2013 06-24 Wf4Ever: Annotating research objects (PDF)

2013 06-24 Wf4Ever: Annotating research objects (PPTX)

2013-05-29 Taverna Provenance

2013-03-21 What can provenance do for me?

2013-01-17 Research Object

2012 03-28 Wf4ever, preserving workflows as digital research objects

2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...

2011-06-08 Taverna workflow system

Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX

Taverna workflow management system (2010 11-30 Bath Workflow Tools)

Bringing caBIG services together using Taverna

Último

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

A Domino Admins Adventures (Engage 2024)Gabriella Davis

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Artificial Intelligence: Facts and MythsJoaquim Jorge

A Year of the Servo Reboot: Where Are We Now?Igalia

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

GenCyber Cyber Security Day PresentationMichael W. Hawkins

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Developing An App To Navigate The Roads of BrazilV3cube

Tech Trends Report 2024 Future Today Institute.pdfhans926745

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

2014-04-24 A dip into Research Objects

1. A dip into Research Objects Stian Soiland-Reyes myGrid, University of Manchester HARMONY 2014, Manchester, 2014-04-24 This work is licensed under a Creative Commons Attribution 3.0 Unported License

2. Saving a research object: RO bundle Single, transferrable research object Self-contained snapshot Which files in ZIP, which are URIs? (Up to user/application) Regular ZIP file, explored and unpacked with standard tools JSON manifest is programmatically accessible without RDF understanding Works offline and in desktop applications – no REST API access required Basis for RO-enabled file formats, e.g. Taverna run bundle

3. ZIP-based format (Adobe UCF, ePub)

4. Workflow Results Bundle workflowrun.prov.ttl (RDF) outputA.txt outputC.jpg outputB/ https://w3id.org/bundle intermediates/ 1.txt 2.txt 3.txt de/def2e58b-50e2-4949-9980-fd310166621a.txt inputA.txt workflow URI reference s attribution execution environment Aggregating in Research Object ZIP folder structure (RO Bundle) mimetype application/vnd.wf4ever.robundle+zip .ro/manifest.jso n

5. RO Bundle What is aggregated? File In ZIP or external URI Who made the RO? When? Who? External URIs placed in folders Embedded annotation External annotation, e.g. blogpost JSON-LD context  RDF RO provenance .ro/manifest.json Format Note: JSON "quotes" not shown above for brevity http://json-ld.org/ http://orcid.org/ https://w3id.org/bundle

6. http://json-ld.org/ http://www.w3.org/TR/json-ld/ <http://dbpedia.org/resource/John_Lennon> <http://xmlns.com/foaf/0.1/name> "John Lennon" . <http://dbpedia.org/resource/John_Lennon> <http://schema.org/birthDate> "1940-10-09". <http://dbpedia.org/resource/John_Lennon> <http://schema.org/spouse> <http://dbpedia.org/resource/Cynthia_Lennon> . Defines RDF triples:

7. RO Bundle manifest as RDF

8. API for RO bundles https://github.com/wf4ever/robundle/

Notas do Editor

When I first heard about Provenance, I thought it was something French, like Provance. Provenance is classically understood as where something is coming from (Origin); like in this example – are the shallots from Holland or France? Was there some kind of Derivation that changed their nationality? Obviously if we are going to talk about somethings’ provenance, we have to be clear about what that thing is.. The shallots? The sign? The picture? Or this Flickr page?Provenance also covers other aspects, mainly Attribution (who did it), Dates (when?), and Activities (what happened). There are Attributes to describe the state of the thing. Perhaps not always considered provenance, but anyway relevant, are aggregations (one thing is part of another), Licensing (Can I use it?) and of course Annotations – what do others say about it?
Let’s take an example of a biomedical lab that sequences genome data. There would be lots of questions relating to attributions – different people play different roles, even act on behalf of others. We can call these Agents – things that can perform stuff. People are obvious agents, Organizations (like The Lab), but also Software can be active agents.
When we talk about things, or entities, we might want to relate them to each other. An extracted genome can be said to be derived from the sample. The sequence we select from the genome is a kind of quote. The result we get from analysing this is derived from the sequence, and is a revision of the old result – which again has its own chain of influences which might differ.
Activities is what is happening – typically using existing entities and generating new ones, somewhat under control by one or more agents. Taken together, you can describe a whole lineage of activities that generate and consume each other’s entities.
So these three classes are what is at the core of the W3C PROV model, which we have helped build. The Entity is derived from other entities, and attributed to an Agent. An Activity use one entity and generates another, and is associated with an agent.
http://purl.org/wf4ever/model
http://purl.org/wf4ever/model
Most of the user-contributed content in a research object is recorded as annotations
Typing of resources and relating them to each-other are individual annotations
The annotation framework basically allows “any annotation”, so we had to write guidelines on which annotation properties we are going to recommend and “natively” understand. Reused existing vocabularies like Dublin Core Terms, PROV and PAV, but also had to make our own more specific vocabularies.
So not everyone have access to set up a RESTful semantic web servers, in particular we’ve run into this with desktop applications – users just want to save files and then they decide where they are stored. So we decided to write a serialization format for Research Object, which we call the RO Bundle.We wanted this to be accessible for applicaton developers, so we’ve adopted ZIP and JSON, and in a way this would let you create research objects and make annotations without ever seing any RDF.
So let’s have a look at what a Research Object looks like. The core is the concept of the Research Object itself, which you may also known as an ORE aggregation. This is described by the manifest, which is simply an RDF file. The RO aggregates a series of resources – in Linked Data these could be anywhere in the world. Additionally it aggregates a set of annotations, which we know is the link between a target resource (here aggregated in the RO), and an body resource. In Wf4Ever we typically provide the body as a separate RDF Graph, so that we can use existing vocabularies to describe and relate the resources.
This is how we represent a workflow run as a Workflow Results RO Bundle. We aggregate the workflowoutputs, , workflow definition, the inputs used for execution, a description of the execution environment, external URI references (such as the project homepage) and attribution to scientists who contributed to the bundle. This effectively forms a Research Object, all tied together by the RO Bundle Manifest, which is in JSON-LD format. (normal JSON that is also valid RDF).
This shows how the JSON manifest focuses on the most common aspect of a research object – who made it? When? What is aggregated – files in the ZIP but also external URIs – up to the application or person making the bundle to decide what is to be included in the ZIP. Annotations are included at the bottom here, we see that there’s an annotation “about” (target) the analysis JPEG, and the content (the body) is within the annotations/ folder. Similarly, the next annotations relates the external resource (a blog post) with our aggregation of a resource.This is processable as JSON-LD – so it is not just JSON, it is also RDF, and out comes normal ORE aggregations and OA annotations.
https://github.com/wf4ever/robundle/
Here’s another example of light-weight usage of RDFa to turn a normal index.html into a research object. Here the author is given as a creator of the RO, and the excel files that helped form this analysis are aggregated by the research object. This way of using the Research Object model requires not infrastructure or special packaging – and we have augmented this page to also have a downloadable RO Bundle so you can get all the aggregated resources in a one-go operation.

2014-04-24 A dip into Research Objects

Recomendados

Recomendados

Mais conteúdo relacionado

Mais de Stian Soiland-Reyes

Mais de Stian Soiland-Reyes (15)

Último

Último (20)

2014-04-24 A dip into Research Objects

Notas do Editor