SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Grant agreement no.: 27092




Workflows Preservation!
Data Curation and Preservation Session!

              Jose Enrique Ruiz!
                   IAA-CSIC!
                          !
                October 17th 2011!
       2011 IVOA Fall Interop Meeting - Pune!
Who am I ?!

Instituto Astrofísica de Andalucia - CSIC!




                                                       2
What is Wf4Ever ?!

EU funded FP7 STREP Project!
December 2010 – December 2013 !
                      1.  Intelligent Software Components (ISOCO, Spain)!
                      2.  University of Manchester (UNIMAN, UK)!
   2     7            3.  Universidad Politécnica de Madrid (UPM, Spain)!
    5!       4!       4.  Poznan Supercomputing and Networking Centre
                          (PSNC, Poland)!
                      5.  Universisty of Oxford (OXF, UK)!
                      6.  Instituto de Astrofísica de Andalucía (IAA, Spain)!
 1! 3!
                      7.  Leiden University Medical Centre (LUMC, NL)!
  6!




                                                                                3
What is Wf4Ever ?!

Technological infrastructure for the preservation and efficient retrieval and reuse
                of scientific workflows in a range of disciplines!

Partners!
•  One SME!
                                           Goals!
•  Six public organizations!                !
                                            Archival, classification, and indexing
Technological Core Competencies!            of scientific workflows and their
                                            associated materials in scalable
•    Digital Libraries!
•    Workflow Management !
                                            semantic repositories, providing
                                            advanced access and recommendation
•    Semantic Web!
•    Integrity & Authenticity!
                                            capabilities!
•    Provenance!                            !
•    Information Quality!                   Creation of scientific communities to
                                            collaboratively share, reuse and evolve
Case Studies!                               workflows and their parts, stimulating
                                            the development of new scientific
•  Astronomy (IAA)!
                                            knowledge!
•  Genome-wide Analysis and
   Biobanking (LUMC)!
                                            !
                                                                                      4
What are workflows ?!

Combination of data and processes into a
configurable and structured set of steps that
implement semi-automated computational
solutions in problem solving!
!
Types of workflows in Astronomy!

•    Personal script-based recipes !
•    Internal group developments (*)!
•    Multi-archive VO experiments!
•    The classical processing pipeline (*)!
•    Driving pipelines from VO services (TBD)!

(*) Scientifically exploitable results vs. scientific insight!
Easily accessible and reproducible!
                                                                     5
Why workflow preservation is important ?!

       Astronomy research is entirely digital !
       Time has come to go “Beyond the PDF”!
!
Preserved experiments!
•    Methodology “in action”!
•    All data are exposed!
•    Reproducible!
•    Repeatable!
•    Re-usable!
•    Re-purposeable!
•    Participatory!
•    Collaborative!
•    Formative!
                                                             6
Considerations!

(Data)   Workflow preservation!
!
•  Interpreted through their execution!
    •  Complex models are required to describe them!
•  Severely vulnerable to obsolescence !
    •  Applications !
    •  Libraries!
    •  Operating environment!
•  Provenance is a complex issue in a cloud of services!
•  Resources are often beyond control of scientists!
•  Alleviate decay of external resources via alternates!
•  Ensure trustworthiness and authenticity!
                                                            7
Considerations!

(Data)    Workflow preservation!
!
•    Versioning of the whole or its components !
•    Restricted access on data and processes!
•    Permissions, licenses, platform, costs, etc.!
•    Semantic discovery of Wfs, processes, web services!
•    Metrics for quality: use stats, logs uptime, etc.!

Workflows and Processes should benefit of the same
privileges acquired by Data!



                                                             8
A first approach in Workflow Preservation!

Preserve, Retrieve, Reconstruct, Replay!
!
•  Retrieve!                                Characterization!
   •  Functionality of the Wf or its modules!
   •  What are the inputs and outputs!
   •  Authority…!
•  Reconstruct!                                Modeling!
   •  Understand dependencies and components!
   •  Technical specificities!
•  Replay!                                       Tools!
   •  Check the success of the preservation method!

•  Referenced and acknowledged!
                                                                9
!
Preserving digital experiments!

RO . The Research Object!
!
All components related to the research lifecycle of an
experiment should be available. !
!
Preserved and easily retrievable !
!
•    Proposals!
•    Data!
•    Processes!
•    Workflows!
•    Publications!
!                                                         10
Wf4Ever Update!

!
User Requirements!
•    Functional requirements for Wf4Ever “working” platform!
•    Focused on improving collaboration and reuse!
•    Interoperability in exchanging scientific methodology!
•    Expose experiment in a structured way to be understood by others!
!
We need to build what we would like to preserve!
!
RO Modeling!
•  Model for interlinked components in a Research Object!
•  Strategies for assessing integrity and authenticity!
•  Attempts in metrics for Information Quality!
!
                                                                     11
Wf4Ever Update!



Architecture!
•  Search & Retrieval Service!
•  Recommender Service!
•  I&A Evaluation Service!
•  Notification Service!
!
!
!
User-Tools Prototypes!
•  RO Command Line Tool!
•  RO Annotator!
•  RO Box!
                            12
Wf4Ever Update!

ROBox!
!
Seamless contribution to a
working collaborative platform!
!
A shared folder in Dropbox
becomes a Working RO!
!
!
!
!
!
!
Automatic generation of
metadata !
                             13
Wf4Ever Update!

    !
!




        •    Anatomy of a Research Object!
        •    Annotations on RO components!
        •    RO Graphical Representation!
        •    Data/Sessions Inspection (SAMP)!


                                                             14
Wf4Ever Update!


Notification Service for Authors!
What should be notified ?!

    •    Fails!
    •    Downloads!
    •    Annotations!
    •    Linked/Similarity!
    •    Modifications on Working RO!
    •    Acknowledgements!
!
Notification Management Tool!
Avoid spam!


                                                     15
Related activities in the VO!

US VAO!
Work on semantic linking of proposals, publications, data!
!
IVOA Working Groups!
•  Data Modeling!
    Characterization, Provenance..!
•  Semantics!
    Ontologies, Vocabularies, Annotations..!    workflow@ivoa.net!
•  Data Access Layer!                                   !
    Self-descriptive Protocols..!
                                                   IVOA Note!
•  Grid and Web Services!                  André Schaaff & Jose Enrique Ruiz!
    UWS, VOSpace, SSO..!
•  IG . Data Curation and Preservation!
    Persistent Identifiers, Curation of VO Resources..!
                                                                           16

Mais conteúdo relacionado

Mais procurados

OOR--Open-Ontology-Repository--jun2010
OOR--Open-Ontology-Repository--jun2010OOR--Open-Ontology-Repository--jun2010
OOR--Open-Ontology-Repository--jun2010Peter Yim
 
2012 CNI Fall Membership Meeting
2012 CNI Fall Membership Meeting2012 CNI Fall Membership Meeting
2012 CNI Fall Membership MeetingPaolo Ciccarese
 
Ontology Browser WormBase Workshop International Worm Meeting 2015
Ontology Browser WormBase Workshop International Worm Meeting 2015Ontology Browser WormBase Workshop International Worm Meeting 2015
Ontology Browser WormBase Workshop International Worm Meeting 2015raymond91105
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Paolo Ciccarese
 
V Rolfe - Open Education and Technical Opportunities - October 2012
V Rolfe - Open Education and Technical Opportunities - October 2012V Rolfe - Open Education and Technical Opportunities - October 2012
V Rolfe - Open Education and Technical Opportunities - October 2012Vivien Rolfe
 
The role of annotation in reproducibility (Empirical 2014)
The role of annotation in reproducibility (Empirical 2014)The role of annotation in reproducibility (Empirical 2014)
The role of annotation in reproducibility (Empirical 2014)Oscar Corcho
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
 

Mais procurados (7)

OOR--Open-Ontology-Repository--jun2010
OOR--Open-Ontology-Repository--jun2010OOR--Open-Ontology-Repository--jun2010
OOR--Open-Ontology-Repository--jun2010
 
2012 CNI Fall Membership Meeting
2012 CNI Fall Membership Meeting2012 CNI Fall Membership Meeting
2012 CNI Fall Membership Meeting
 
Ontology Browser WormBase Workshop International Worm Meeting 2015
Ontology Browser WormBase Workshop International Worm Meeting 2015Ontology Browser WormBase Workshop International Worm Meeting 2015
Ontology Browser WormBase Workshop International Worm Meeting 2015
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
 
V Rolfe - Open Education and Technical Opportunities - October 2012
V Rolfe - Open Education and Technical Opportunities - October 2012V Rolfe - Open Education and Technical Opportunities - October 2012
V Rolfe - Open Education and Technical Opportunities - October 2012
 
The role of annotation in reproducibility (Empirical 2014)
The role of annotation in reproducibility (Empirical 2014)The role of annotation in reproducibility (Empirical 2014)
The role of annotation in reproducibility (Empirical 2014)
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 

Destaque

Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
banking industry in india
banking industry in indiabanking industry in india
banking industry in indiaGeeta Udapikar
 
Cystic Fibrosis (rough draft)
Cystic Fibrosis (rough draft)Cystic Fibrosis (rough draft)
Cystic Fibrosis (rough draft)sbinck
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflowsJose Enrique Ruiz
 
Healing the Mind, Body, & Spirit
Healing the Mind, Body, & Spirit Healing the Mind, Body, & Spirit
Healing the Mind, Body, & Spirit danielleconway
 

Destaque (8)

Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
banking industry in india
banking industry in indiabanking industry in india
banking industry in india
 
Boletin mhj julio 2016
Boletin mhj julio 2016Boletin mhj julio 2016
Boletin mhj julio 2016
 
Cystic Fibrosis (rough draft)
Cystic Fibrosis (rough draft)Cystic Fibrosis (rough draft)
Cystic Fibrosis (rough draft)
 
Digital Science
Digital ScienceDigital Science
Digital Science
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
 
Healing the Mind, Body, & Spirit
Healing the Mind, Body, & Spirit Healing the Mind, Body, & Spirit
Healing the Mind, Body, & Spirit
 
audit
auditaudit
audit
 

Semelhante a Workflow Preservation

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual ObservatoryJose Enrique Ruiz
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataJose Enrique Ruiz
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managersNick Sheppard
 
Validating ontologies with OOPS! - EKAW2012
Validating ontologies with OOPS! - EKAW2012Validating ontologies with OOPS! - EKAW2012
Validating ontologies with OOPS! - EKAW2012María Poveda Villalón
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationJoint ALMA Observatory
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceDavid De Roure
 
Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...tulipbiru64
 
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...Beniamino Murgante
 
CASE-3 Assisting Special Needs Children with the Power of Open Source!
CASE-3 Assisting Special Needs Children with the Power of Open Source!CASE-3 Assisting Special Needs Children with the Power of Open Source!
CASE-3 Assisting Special Needs Children with the Power of Open Source!Alfresco Software
 
Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011Rudy Potenzone
 
Scientific data management from the lab to the web
Scientific data management   from the lab to the webScientific data management   from the lab to the web
Scientific data management from the lab to the webJose Manuel Gómez-Pérez
 
Open hpi semweb-06-part2
Open hpi semweb-06-part2Open hpi semweb-06-part2
Open hpi semweb-06-part2Nadine Ludwig
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsSimeon Warner
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014Susanna-Assunta Sansone
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin
 
ORCID Identifiers in Reactome
ORCID Identifiers in ReactomeORCID Identifiers in Reactome
ORCID Identifiers in ReactomeORCID, Inc
 

Semelhante a Workflow Preservation (20)

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D data
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managers
 
Validating ontologies with OOPS! - EKAW2012
Validating ontologies with OOPS! - EKAW2012Validating ontologies with OOPS! - EKAW2012
Validating ontologies with OOPS! - EKAW2012
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science Preservation
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
Research Objects in Wf4Ever
Research Objects in Wf4EverResearch Objects in Wf4Ever
Research Objects in Wf4Ever
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...
 
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
 
CASE-3 Assisting Special Needs Children with the Power of Open Source!
CASE-3 Assisting Special Needs Children with the Power of Open Source!CASE-3 Assisting Special Needs Children with the Power of Open Source!
CASE-3 Assisting Special Needs Children with the Power of Open Source!
 
Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011
 
COPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob DaveyCOPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob Davey
 
Scientific data management from the lab to the web
Scientific data management   from the lab to the webScientific data management   from the lab to the web
Scientific data management from the lab to the web
 
Open hpi semweb-06-part2
Open hpi semweb-06-part2Open hpi semweb-06-part2
Open hpi semweb-06-part2
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
ORCID Identifiers in Reactome
ORCID Identifiers in ReactomeORCID Identifiers in Reactome
ORCID Identifiers in Reactome
 

Mais de Jose Enrique Ruiz

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroidsJose Enrique Ruiz
 
IPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesIPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesJose Enrique Ruiz
 
Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesJose Enrique Ruiz
 
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable PapersJose Enrique Ruiz
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paperJose Enrique Ruiz
 
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyJose Enrique Ruiz
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VODataJose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesJose Enrique Ruiz
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCAJose Enrique Ruiz
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VOJose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropJose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 

Mais de Jose Enrique Ruiz (14)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
 
IPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesIPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutables
 
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
 
Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxies
 
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable Papers
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
 
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VOData
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 

Último

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 

Último (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 

Workflow Preservation

  • 1. Grant agreement no.: 27092 Workflows Preservation! Data Curation and Preservation Session! Jose Enrique Ruiz! IAA-CSIC! ! October 17th 2011! 2011 IVOA Fall Interop Meeting - Pune!
  • 2. Who am I ?! Instituto Astrofísica de Andalucia - CSIC! 2
  • 3. What is Wf4Ever ?! EU funded FP7 STREP Project! December 2010 – December 2013 ! 1.  Intelligent Software Components (ISOCO, Spain)! 2.  University of Manchester (UNIMAN, UK)! 2 7 3.  Universidad Politécnica de Madrid (UPM, Spain)! 5! 4! 4.  Poznan Supercomputing and Networking Centre (PSNC, Poland)! 5.  Universisty of Oxford (OXF, UK)! 6.  Instituto de Astrofísica de Andalucía (IAA, Spain)! 1! 3! 7.  Leiden University Medical Centre (LUMC, NL)! 6! 3
  • 4. What is Wf4Ever ?! Technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows in a range of disciplines! Partners! •  One SME! Goals! •  Six public organizations! ! Archival, classification, and indexing Technological Core Competencies! of scientific workflows and their associated materials in scalable •  Digital Libraries! •  Workflow Management ! semantic repositories, providing advanced access and recommendation •  Semantic Web! •  Integrity & Authenticity! capabilities! •  Provenance! ! •  Information Quality! Creation of scientific communities to collaboratively share, reuse and evolve Case Studies! workflows and their parts, stimulating the development of new scientific •  Astronomy (IAA)! knowledge! •  Genome-wide Analysis and Biobanking (LUMC)! ! 4
  • 5. What are workflows ?! Combination of data and processes into a configurable and structured set of steps that implement semi-automated computational solutions in problem solving! ! Types of workflows in Astronomy! •  Personal script-based recipes ! •  Internal group developments (*)! •  Multi-archive VO experiments! •  The classical processing pipeline (*)! •  Driving pipelines from VO services (TBD)! (*) Scientifically exploitable results vs. scientific insight! Easily accessible and reproducible! 5
  • 6. Why workflow preservation is important ?! Astronomy research is entirely digital ! Time has come to go “Beyond the PDF”! ! Preserved experiments! •  Methodology “in action”! •  All data are exposed! •  Reproducible! •  Repeatable! •  Re-usable! •  Re-purposeable! •  Participatory! •  Collaborative! •  Formative! 6
  • 7. Considerations! (Data) Workflow preservation! ! •  Interpreted through their execution! •  Complex models are required to describe them! •  Severely vulnerable to obsolescence ! •  Applications ! •  Libraries! •  Operating environment! •  Provenance is a complex issue in a cloud of services! •  Resources are often beyond control of scientists! •  Alleviate decay of external resources via alternates! •  Ensure trustworthiness and authenticity! 7
  • 8. Considerations! (Data) Workflow preservation! ! •  Versioning of the whole or its components ! •  Restricted access on data and processes! •  Permissions, licenses, platform, costs, etc.! •  Semantic discovery of Wfs, processes, web services! •  Metrics for quality: use stats, logs uptime, etc.! Workflows and Processes should benefit of the same privileges acquired by Data! 8
  • 9. A first approach in Workflow Preservation! Preserve, Retrieve, Reconstruct, Replay! ! •  Retrieve! Characterization! •  Functionality of the Wf or its modules! •  What are the inputs and outputs! •  Authority…! •  Reconstruct! Modeling! •  Understand dependencies and components! •  Technical specificities! •  Replay! Tools! •  Check the success of the preservation method! •  Referenced and acknowledged! 9 !
  • 10. Preserving digital experiments! RO . The Research Object! ! All components related to the research lifecycle of an experiment should be available. ! ! Preserved and easily retrievable ! ! •  Proposals! •  Data! •  Processes! •  Workflows! •  Publications! ! 10
  • 11. Wf4Ever Update! ! User Requirements! •  Functional requirements for Wf4Ever “working” platform! •  Focused on improving collaboration and reuse! •  Interoperability in exchanging scientific methodology! •  Expose experiment in a structured way to be understood by others! ! We need to build what we would like to preserve! ! RO Modeling! •  Model for interlinked components in a Research Object! •  Strategies for assessing integrity and authenticity! •  Attempts in metrics for Information Quality! ! 11
  • 12. Wf4Ever Update! Architecture! •  Search & Retrieval Service! •  Recommender Service! •  I&A Evaluation Service! •  Notification Service! ! ! ! User-Tools Prototypes! •  RO Command Line Tool! •  RO Annotator! •  RO Box! 12
  • 13. Wf4Ever Update! ROBox! ! Seamless contribution to a working collaborative platform! ! A shared folder in Dropbox becomes a Working RO! ! ! ! ! ! ! Automatic generation of metadata ! 13
  • 14. Wf4Ever Update! ! ! •  Anatomy of a Research Object! •  Annotations on RO components! •  RO Graphical Representation! •  Data/Sessions Inspection (SAMP)! 14
  • 15. Wf4Ever Update! Notification Service for Authors! What should be notified ?! •  Fails! •  Downloads! •  Annotations! •  Linked/Similarity! •  Modifications on Working RO! •  Acknowledgements! ! Notification Management Tool! Avoid spam! 15
  • 16. Related activities in the VO! US VAO! Work on semantic linking of proposals, publications, data! ! IVOA Working Groups! •  Data Modeling! Characterization, Provenance..! •  Semantics! Ontologies, Vocabularies, Annotations..! workflow@ivoa.net! •  Data Access Layer! ! Self-descriptive Protocols..! IVOA Note! •  Grid and Web Services! André Schaaff & Jose Enrique Ruiz! UWS, VOSpace, SSO..! •  IG . Data Curation and Preservation! Persistent Identifiers, Curation of VO Resources..! 16