SlideShare uma empresa Scribd logo
1 de 40
Baixar para ler offline
Grant agreement no.: 27092




          Workflows Preservation!
José Enrique Ruiz, Lourdes Verdes-Montenegro, Susana Sánchez, !
       Juan de Dios Santander-Vela and the Wf4Ever Team !
                           IAA-CSIC!
                                      !
                              January 18th 2012!
              7th Workflow Working Group Meeting - AS OV France!
Who am I ?!

Instituto Astrofísica de Andalucia - CSIC!




                                                       2
AMIGA Group!

Analysis of the interstellar Medium of Isolated Galaxies!
!
           Statistical baseline of isolated galaxies to compare!
    !
        with the behaviour of galaxies in denser environments!

                    Multi   study of ~1000 galaxies!
                                +!
         Need of intensive and complex analysis of 3D data!
                      2D spatial + 1 Velocity!

                                                           IAA-CSIC!
                    Uuiv . Granada, Obs. Marseille, Obs. Paris, NAOJ, !
                     FCRAO, UNAM, Univ. Edinburgh, IRAM, ESO,!
                                    Kapteyn Astronomical Institute.!
                                                                       !
                                     P.I. Lourdes Verdes-Montenegro!
                                                  http://amiga.iaa.es!
                                                                           3
What is Wf4Ever ?!

EU funded FP7 STREP Project!
December 2010 – December 2013 !
                      1.  Intelligent Software Components (ISOCO, Spain)!
                      2.  University of Manchester (UNIMAN, UK)!
   2     7            3.  Universidad Politécnica de Madrid (UPM, Spain)!
    5!       4!       4.  Poznan Supercomputing and Networking Centre
                          (PSNC, Poland)!
                      5.  Universisty of Oxford (OXF, UK)!
                      6.  Instituto de Astrofísica de Andalucía (IAA, Spain)!
 1! 3!
                      7.  Leiden University Medical Centre (LUMC, NL)!
  6!




                                                                                4
What is Wf4Ever ?!

Technological infrastructure for the preservation and efficient retrieval and reuse
                of scientific workflows in a range of disciplines!

Partners!
•  One SME!
                                           Goals!
•  Six public organizations!                !
                                            Archival, classification, and indexing
Technological Core Competencies!            of scientific workflows and their
                                            associated materials in scalable
•    Digital Libraries!
•    Workflow Management !
                                            semantic repositories, providing
                                            advanced access and recommendation
•    Semantic Web!
•    Integrity & Authenticity!
                                            capabilities!
•    Provenance!                            !
•    Information Quality!                   Creation of scientific communities to
                                            collaboratively share, reuse and evolve
Case Studies!                               workflows and their parts, stimulating
                                            the development of new scientific
•  Astronomy (IAA)!
                                            knowledge!
•  Genome-wide Analysis and
   Biobanking (LUMC)!
                                            !
                                                                                      5
What are our Scientific workflows ?!

Combination of data and processes into a configurable
and structured set of steps that implement semi-
automated computational solutions in problem solving!

Types of workflows in Astronomy!

•  Personal script-based recipes !
   Python, IDL, Software..!
•  Multi-archive VO recipes!
•  Internal group developments !
   GRID, Clusters..!
•  Processing pipelines!
   Provide Data, Computing Infrastructure, Tools..!


Scientifically exploitable results vs. scientific insight !    Wfs on
Easily accessible and reproducible (Shared)!                  steroids !
                                                                           6
Why workflow preservation is important ?!

!         Astronomy research is entirely digital!
!         Time has come to go “Beyond the PDF” !
!
Preserved experiments!
•    Methodology “in action”!           Discoverable !!
•    All data are exposed!
•    Reproducible!
•    Repeatable! Trust assessment
•    Re-usable!
•    Re-purposeable!
•    Participatory!
•    Collaborative!
•    Formative!        Social aspect

                                                               7
Related Initiatives!

Cyber-SKA!
Provide infrastructure that will be required to address the needs of
future radio telescopes such as the Square Kilometre Array!
!
Web based workflow builder !
•  Image segmentation!
•  Image mosaicking (Montage)!
•  Spatial reprojection!
•  Plane extraction from data cubes!


IceCore!
University of Helsinki!
Web portal for executing workflows – University of Helsinki!
Common interface for Wfs distributed in different engine servers!
                                                                       8
!
Related Initiatives!
Montage!
•  FITS Image Mosaicking!
•  Toolkit for Desktops, Clusters and Grids!
!
Astro-WISE!
•  Distributed data storage and computing infrastructure!
•  Track process provenance of final data products!
•  Calibration and analysis of images!
!
Helio-VO!
•  Solar physics Virtual Observatory!
•  Enable workflow execution via Taverna Server!
!
Workflows VO France!
•  Provide use cases mainly oriented VO !
•  AÏDA Workflow System implements FITS validation with CharDM !
                                                                      9
Tools!

Taverna!
    •    Strongly typed bioinformatics!
    •    Taverna Engine!
    •    Taverna Server!
    •    Taverna Workbench!
Kepler!
    •  Generic Science!
    •  Workflow System!
Triana!
    •    Local execution!
    •    Clusters RMI!
    •    GRID!
    •    Web Services!
                                              10
!
Tools!

Aladin JLOW Plugin!
Aladin plugin API permits graphical replacement of Aladin tools!




                                                                   11
Tools!

Aladin JLOW Plugin!




                          12
Tools!

ESO Reflex!
Finland’s in-kind contribution to ESO!
   •  Prototype/feasibility study!
   •  Initially based on Taverna 1!
Current implementation based on Kepler!
!
AstroTaverna!
AstroGrid Development!
Prototype, marrying of VO Desktop & Taverna 1!
Library of Taverna functions to access VO Desktop’s API!
!
Status!
Wrapper libraries only for Taverna 1!
                                                               13
Digital Repositories!


The recipes store!
Oxford e-Research Centre!
!
•    Find workflows!
•    Share workflows and files!
•    Find people!
•    Build communities!
•    Publish packages!
•    Tag workflows!
•    Score and rate workflows!
•    Comment on workflows!
•    Write reviews!


                                                     14
Digital Repositories!

 !!
!!

      Astronomy in MyExperiment!
      •  10 interested users !
      •  No VO-services-based Wfs!
      •  Some Helio Project Wfs!
           •  VOTables parsing!
           •  Internal services!
           •  Astro-Shims !
      •  BioCatalogue vs. VORegistry !
      !
      Astro-Wf4Ever specific Wfs!
      •  Catalogue Queries!
      !
      !
                                         15
The upcoming context!


    Processes should benefit of the same privileges acquired by Data!


    Digital Libraries of Workflows may boost the use of the existing
                       infrastructure of data (VO)!

Users need templates !!
!
Wf4Ever is also a project about!
  •  How to publish!
  •  How to do review by peers!
  •  Improve visibility by reference and attribution!
!
Publishers should play an import role!                                  16
The upcoming context!


The next generation of archives!
!
Much wider FoV and spectral coverage!
•  Huge sized datasets (~ tens TB)!
•  Big Data science highly dependent on I/O data rates!
•  Subproducts as virtual data generated on-the-fly!

Automated surveys!
•  Huge amount of tabular data!
•  Services for Knowledge Discovery in Databases!
!

                                                          17
The upcoming context!


We are moving into a world where !
•  computing and storage are cheap !
•  data movement is death!

Archives should evolve from data providers into virtual data
and services providers, where web services may help to solve
bandwidth issues.!
!
Archives speaking self-descriptif web services!
•  Smaller virtual data subproducts!
•  Distributed, multi-archive, multi-wavelength astronomy!


                                                               18
Considerations!

(Data)   Workflow preservation!
!
•  Interpreted through their execution!
    •  Complex models are required to describe them!
•  Severely vulnerable to obsolescence !
    •  Applications !
    •  Libraries!
    •  Operating environment!
•  Provenance is a complex issue in a cloud of services!
•  Resources are often beyond control of scientists!
•  Alleviate decay of external resources via alternates!


                                                           19
Considerations!

(Data)    Workflow preservation!
!
•    Versioning of the whole or its components !
•    Restricted access on data and processes!
•    Permissions, licenses, platform, costs, etc.!
•    Semantic discovery of Wfs, processes, web services!
•    Metrics for quality: use stats, logs uptime, etc.!
•    Integrity evaluation!
•    Completeness checking!
•    Ensure trustworthiness and authenticity!
•    Workflows for workflow curation!

                                                            20
A first approach in Workflow Preservation!

Preserve, Retrieve, Reconstruct, Replay!
!
•  Retrieve!                                Characterization!
   •  Functionality of the Wf or its modules!
   •  What are the inputs and outputs!
   •  Metadata, authority, keywords!        Semantics and
•  Reconstruct!                                Modeling!
   •  Understand dependencies and components!
   •  Technical specificities!
•  Replay!                                  Execution Tools!
   •  Check the success of the preservation method!
!
•  Referenced and acknowledged!             Long-term IDs!
                                                             21
!
Wf4Ever Update!

RO. The Research Object!
!
All components related to the research lifecycle of an
experiment should be available. !
!
Preserved and easily retrievable !
!
•    Proposals!
•    Data!
•    Processes!
•    Publications!
!
                                                         22
Astronomy WP in Wf4Ever!

Development and Implementation of Golden Exemplars!
    •  Local catalogue curation based on VO Archives!
    •  Sources extraction and crossmatching from 2D images!
    •  Modeling and analysis of 3D velocity cubes of galaxies!
    !
Create a community of users!
    •  Development of Prototypes and Tools!
    •  Dissemination!
     !
Integrate existing astronomy software with Wf4Ever Tools!
   •  SAMP and WebSAMP!
!
Provide interoperable models, ontologies and vocabularies for the
characterization of workflows, processes and RO components !
!
                                                                 23
Astronomy WP in Wf4Ever!
!
•  !   Characterization of the Astronomy domain in Wf!
• !    Detailed study of standards and web services in IVOA!
•      Exploration of similar initiatives for the curation of digital objects !
•      Sociological study and working methodology of astronomers!
•      Extraction of user and technical requirements!
•      Extraction of Taverna user requirements for Astronomy!
•      Implementation of first Golden Exemplar!
•      Early contacts in IVOA for the creation of a community of users!




                                                                                  24
Wf4Ever Update!

!
Users’ Requirements!
•    Functional requirements for Wf4Ever “working” platform!
•    Focused on improving collaboration and reuse!
•    Interoperability in exchanging scientific methodology!
•    Expose experiment in a structured way to be understood by others!
!
RO Modeling!
•  Model for interlinked components in a Research Object!
•  Strategies for assessing integrity and authenticity!
•  Attempts in metrics for Information Quality!
!
!
     We need to build what we would like to preserve!
                                                                     25
Wf4Ever Update!




             26
Wf4Ever Update!
         !
!!
! ! Proposed   improvements for Taverna!
     !
     •  VO Registry Access perspective!
     •  STILTS VOTable Library Integration!
     •  SAMP (Connectivity with VO Software)!
     •  Python based Beanshells - Done!
     •  Simple standard functions for Astronomy!
     •  ODBC Connector to DB!
     !
     !
     !
     !



                                                            27
Wf4Ever Update!



Architecture!
•  Search & Retrieval Service!
•  Recommender Service!
•  I&A Evaluation Service!
•  Notification Service!
!
!
!
User-Tools Prototypes!
•  RO Command Line Tool!
•  RO Annotator!
•  RO Box!
                            28
Wf4Ever Update!

ROBox!
!
Seamless contribution to a
working collaborative platform!
!
A shared folder in Dropbox
becomes a Working RO!
!
!
!
!
!
!
Automatic generation of
metadata !
                             29
Wf4Ever Update!

RO Digital Library and RO Import!




                                                 30
Wf4Ever Update!

    !
!




        •    Anatomy of a Research Object!
        •    Annotations on RO components!
        •    RO Graphical Representation!
        •    Data/Sessions Inspection (SAMP)!


                                                             31
Wf4Ever Update!

    !
!




                     32
Wf4Ever Update!

  !
RO Visualization!
!




                                 33
Wf4Ever Update!




Keywords    [galaxies][catalogs]
Integrity
 Rating

Downloads 36
Citations   [2]
Re-used     [1]
Comments    [4]

[Previous version | Next version]


                                       34
Wf4Ever Update!

    !
!




            Keywords       [galaxies][catalogs]
         Integrity
         Rating

               Downloads 36


        Re-used      [1]
        Comments     [4]

        [Previous version | Next version]


                                                   35
Wf4Ever Update!


Notification Service for Authors!
What should be notified ?!

    •    Fails!
    •    Downloads!
    •    Annotations!
    •    Linked/Similarity!
    •    Modifications on Working RO!
    •    Acknowledgements!
!
Notification Management Tool!
Avoid spam!


                                                     36
Astronomy WP in Wf4Ever!

Astronomy WP!
•  Development and Implementation of “Extraction of Sources”!
•  Development and Implementation of “Modelling of 3D Data”!
•  Explore experiments subject to be migrated to Wf/RO methodology!
•  Contribute to IVOA in Semantics for Processes!
!
Other WPs!
Continue Providing Feedback!
•  RO Model, Architecture, Integrity & Authenticity, Information Quality, etc. !
•  Software integration and improved functionalities (SAMP, Taverna, etc.)!
•  Prototypes for management and visualization of RO!
!
Community engagement!
•  Approach Astro-Informaticians!
•  Continue pushing in the IVOA Community!
•  Tackle collaboration with Publishers!
!
!                                                                              37
Workflows & IVOA!

Distributed data analysis in the VO!
•  Panchromatic, multi-archive, multi-facility!
•  Executes in the VO Infrastructure!
•  Orchestration of simple services!
!
                                      Workflows VO Characterization!
Present processing pipelines!         •  Inputs!
•  Produce exploitable data!          •  Outputs!
•  Provenance modeling!               •  Processes!
                                      •  Descriptions!
•  VO compliant data !                •  Metadata!
!                                     •  Etc..!
Data processing from the VO!
•  Provide custom re-processing to VO users!
•  Virtual data generation through UWS in VOSpace!
                                                                       38
Related activities in the VO!

IVOA Working Groups!
•  Data Modeling!
   Characterization, Provenance..!
•  Semantics!
   Ontologies, Vocabularies for Processes!
•  Data Access Layer!
   TAP, self-descriptive Protocols..!                       !
•  Grid and Web Services!                               IVOA Note!
   UWS, VOSpace, SSO..!                         Scientific Workflows in the VO!
•  Applications!                              André Schaaff & Jose Enrique Ruiz!
   SAMP!                                                       !
•  IG. KDD!                                         workflow@ivoa.net!
   Knowledge Discovery and Data Mining!
•  IG. Data Curation and Preservation!
   Persistent Identifiers, Curation of VO Resources..!
   Wf4Ever Project, US VAO semantic linking of proposals, publications, data!
                                                                                39
Questions!

!
More info!
http://amiga.iaa.es/p/212-workflows.htm!
http://www.wf4ever-project.org!
workflow@ivoa.net!
!
!




                                                   40

Mais conteúdo relacionado

Semelhante a Wf4Ever: Workflow Preservation

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
Jose Enrique Ruiz
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
Jose Enrique Ruiz
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science Preservation
Joint ALMA Observatory
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
guru122
 
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Beniamino Murgante
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbA
Daniel Vila Suero
 

Semelhante a Wf4Ever: Workflow Preservation (20)

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science Preservation
 
Research Objects in Wf4Ever
Research Objects in Wf4EverResearch Objects in Wf4Ever
Research Objects in Wf4Ever
 
VO Course 12: Workflows & the Wf4Ever project
VO Course 12: Workflows & the Wf4Ever projectVO Course 12: Workflows & the Wf4Ever project
VO Course 12: Workflows & the Wf4Ever project
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Roberts leiden110213
Roberts leiden110213Roberts leiden110213
Roberts leiden110213
 
Oak meeting 18/09/2014
Oak meeting 18/09/2014Oak meeting 18/09/2014
Oak meeting 18/09/2014
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
 
Where are we going and how are we going to get there?
Where are we going and how are we going to get there?Where are we going and how are we going to get there?
Where are we going and how are we going to get there?
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTR
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managers
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbA
 
2012 09 aos-workshop-johanneskeizer
2012 09 aos-workshop-johanneskeizer2012 09 aos-workshop-johanneskeizer
2012 09 aos-workshop-johanneskeizer
 

Mais de Jose Enrique Ruiz

Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
Jose Enrique Ruiz
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VOData
Jose Enrique Ruiz
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
Jose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
Jose Enrique Ruiz
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
Jose Enrique Ruiz
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
Jose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
Jose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Jose Enrique Ruiz
 

Mais de Jose Enrique Ruiz (15)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
 
IPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesIPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutables
 
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
 
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable Papers
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
 
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VOData
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Digital Science
Digital ScienceDigital Science
Digital Science
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

Wf4Ever: Workflow Preservation

  • 1. Grant agreement no.: 27092 Workflows Preservation! José Enrique Ruiz, Lourdes Verdes-Montenegro, Susana Sánchez, ! Juan de Dios Santander-Vela and the Wf4Ever Team ! IAA-CSIC! ! January 18th 2012! 7th Workflow Working Group Meeting - AS OV France!
  • 2. Who am I ?! Instituto Astrofísica de Andalucia - CSIC! 2
  • 3. AMIGA Group! Analysis of the interstellar Medium of Isolated Galaxies! ! Statistical baseline of isolated galaxies to compare! ! with the behaviour of galaxies in denser environments! Multi study of ~1000 galaxies! +! Need of intensive and complex analysis of 3D data! 2D spatial + 1 Velocity! IAA-CSIC! Uuiv . Granada, Obs. Marseille, Obs. Paris, NAOJ, ! FCRAO, UNAM, Univ. Edinburgh, IRAM, ESO,! Kapteyn Astronomical Institute.! ! P.I. Lourdes Verdes-Montenegro! http://amiga.iaa.es! 3
  • 4. What is Wf4Ever ?! EU funded FP7 STREP Project! December 2010 – December 2013 ! 1.  Intelligent Software Components (ISOCO, Spain)! 2.  University of Manchester (UNIMAN, UK)! 2 7 3.  Universidad Politécnica de Madrid (UPM, Spain)! 5! 4! 4.  Poznan Supercomputing and Networking Centre (PSNC, Poland)! 5.  Universisty of Oxford (OXF, UK)! 6.  Instituto de Astrofísica de Andalucía (IAA, Spain)! 1! 3! 7.  Leiden University Medical Centre (LUMC, NL)! 6! 4
  • 5. What is Wf4Ever ?! Technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows in a range of disciplines! Partners! •  One SME! Goals! •  Six public organizations! ! Archival, classification, and indexing Technological Core Competencies! of scientific workflows and their associated materials in scalable •  Digital Libraries! •  Workflow Management ! semantic repositories, providing advanced access and recommendation •  Semantic Web! •  Integrity & Authenticity! capabilities! •  Provenance! ! •  Information Quality! Creation of scientific communities to collaboratively share, reuse and evolve Case Studies! workflows and their parts, stimulating the development of new scientific •  Astronomy (IAA)! knowledge! •  Genome-wide Analysis and Biobanking (LUMC)! ! 5
  • 6. What are our Scientific workflows ?! Combination of data and processes into a configurable and structured set of steps that implement semi- automated computational solutions in problem solving! Types of workflows in Astronomy! •  Personal script-based recipes ! Python, IDL, Software..! •  Multi-archive VO recipes! •  Internal group developments ! GRID, Clusters..! •  Processing pipelines! Provide Data, Computing Infrastructure, Tools..! Scientifically exploitable results vs. scientific insight ! Wfs on Easily accessible and reproducible (Shared)! steroids ! 6
  • 7. Why workflow preservation is important ?! ! Astronomy research is entirely digital! ! Time has come to go “Beyond the PDF” ! ! Preserved experiments! •  Methodology “in action”! Discoverable !! •  All data are exposed! •  Reproducible! •  Repeatable! Trust assessment •  Re-usable! •  Re-purposeable! •  Participatory! •  Collaborative! •  Formative! Social aspect 7
  • 8. Related Initiatives! Cyber-SKA! Provide infrastructure that will be required to address the needs of future radio telescopes such as the Square Kilometre Array! ! Web based workflow builder ! •  Image segmentation! •  Image mosaicking (Montage)! •  Spatial reprojection! •  Plane extraction from data cubes! IceCore! University of Helsinki! Web portal for executing workflows – University of Helsinki! Common interface for Wfs distributed in different engine servers! 8 !
  • 9. Related Initiatives! Montage! •  FITS Image Mosaicking! •  Toolkit for Desktops, Clusters and Grids! ! Astro-WISE! •  Distributed data storage and computing infrastructure! •  Track process provenance of final data products! •  Calibration and analysis of images! ! Helio-VO! •  Solar physics Virtual Observatory! •  Enable workflow execution via Taverna Server! ! Workflows VO France! •  Provide use cases mainly oriented VO ! •  AÏDA Workflow System implements FITS validation with CharDM ! 9
  • 10. Tools! Taverna! •  Strongly typed bioinformatics! •  Taverna Engine! •  Taverna Server! •  Taverna Workbench! Kepler! •  Generic Science! •  Workflow System! Triana! •  Local execution! •  Clusters RMI! •  GRID! •  Web Services! 10 !
  • 11. Tools! Aladin JLOW Plugin! Aladin plugin API permits graphical replacement of Aladin tools! 11
  • 13. Tools! ESO Reflex! Finland’s in-kind contribution to ESO! •  Prototype/feasibility study! •  Initially based on Taverna 1! Current implementation based on Kepler! ! AstroTaverna! AstroGrid Development! Prototype, marrying of VO Desktop & Taverna 1! Library of Taverna functions to access VO Desktop’s API! ! Status! Wrapper libraries only for Taverna 1! 13
  • 14. Digital Repositories! The recipes store! Oxford e-Research Centre! ! •  Find workflows! •  Share workflows and files! •  Find people! •  Build communities! •  Publish packages! •  Tag workflows! •  Score and rate workflows! •  Comment on workflows! •  Write reviews! 14
  • 15. Digital Repositories! !! !! Astronomy in MyExperiment! •  10 interested users ! •  No VO-services-based Wfs! •  Some Helio Project Wfs! •  VOTables parsing! •  Internal services! •  Astro-Shims ! •  BioCatalogue vs. VORegistry ! ! Astro-Wf4Ever specific Wfs! •  Catalogue Queries! ! ! 15
  • 16. The upcoming context! Processes should benefit of the same privileges acquired by Data! Digital Libraries of Workflows may boost the use of the existing infrastructure of data (VO)! Users need templates !! ! Wf4Ever is also a project about! •  How to publish! •  How to do review by peers! •  Improve visibility by reference and attribution! ! Publishers should play an import role! 16
  • 17. The upcoming context! The next generation of archives! ! Much wider FoV and spectral coverage! •  Huge sized datasets (~ tens TB)! •  Big Data science highly dependent on I/O data rates! •  Subproducts as virtual data generated on-the-fly! Automated surveys! •  Huge amount of tabular data! •  Services for Knowledge Discovery in Databases! ! 17
  • 18. The upcoming context! We are moving into a world where ! •  computing and storage are cheap ! •  data movement is death! Archives should evolve from data providers into virtual data and services providers, where web services may help to solve bandwidth issues.! ! Archives speaking self-descriptif web services! •  Smaller virtual data subproducts! •  Distributed, multi-archive, multi-wavelength astronomy! 18
  • 19. Considerations! (Data) Workflow preservation! ! •  Interpreted through their execution! •  Complex models are required to describe them! •  Severely vulnerable to obsolescence ! •  Applications ! •  Libraries! •  Operating environment! •  Provenance is a complex issue in a cloud of services! •  Resources are often beyond control of scientists! •  Alleviate decay of external resources via alternates! 19
  • 20. Considerations! (Data) Workflow preservation! ! •  Versioning of the whole or its components ! •  Restricted access on data and processes! •  Permissions, licenses, platform, costs, etc.! •  Semantic discovery of Wfs, processes, web services! •  Metrics for quality: use stats, logs uptime, etc.! •  Integrity evaluation! •  Completeness checking! •  Ensure trustworthiness and authenticity! •  Workflows for workflow curation! 20
  • 21. A first approach in Workflow Preservation! Preserve, Retrieve, Reconstruct, Replay! ! •  Retrieve! Characterization! •  Functionality of the Wf or its modules! •  What are the inputs and outputs! •  Metadata, authority, keywords! Semantics and •  Reconstruct! Modeling! •  Understand dependencies and components! •  Technical specificities! •  Replay! Execution Tools! •  Check the success of the preservation method! ! •  Referenced and acknowledged! Long-term IDs! 21 !
  • 22. Wf4Ever Update! RO. The Research Object! ! All components related to the research lifecycle of an experiment should be available. ! ! Preserved and easily retrievable ! ! •  Proposals! •  Data! •  Processes! •  Publications! ! 22
  • 23. Astronomy WP in Wf4Ever! Development and Implementation of Golden Exemplars! •  Local catalogue curation based on VO Archives! •  Sources extraction and crossmatching from 2D images! •  Modeling and analysis of 3D velocity cubes of galaxies! ! Create a community of users! •  Development of Prototypes and Tools! •  Dissemination! ! Integrate existing astronomy software with Wf4Ever Tools! •  SAMP and WebSAMP! ! Provide interoperable models, ontologies and vocabularies for the characterization of workflows, processes and RO components ! ! 23
  • 24. Astronomy WP in Wf4Ever! ! •  ! Characterization of the Astronomy domain in Wf! • ! Detailed study of standards and web services in IVOA! •  Exploration of similar initiatives for the curation of digital objects ! •  Sociological study and working methodology of astronomers! •  Extraction of user and technical requirements! •  Extraction of Taverna user requirements for Astronomy! •  Implementation of first Golden Exemplar! •  Early contacts in IVOA for the creation of a community of users! 24
  • 25. Wf4Ever Update! ! Users’ Requirements! •  Functional requirements for Wf4Ever “working” platform! •  Focused on improving collaboration and reuse! •  Interoperability in exchanging scientific methodology! •  Expose experiment in a structured way to be understood by others! ! RO Modeling! •  Model for interlinked components in a Research Object! •  Strategies for assessing integrity and authenticity! •  Attempts in metrics for Information Quality! ! ! We need to build what we would like to preserve! 25
  • 27. Wf4Ever Update! ! !! ! ! Proposed improvements for Taverna! ! •  VO Registry Access perspective! •  STILTS VOTable Library Integration! •  SAMP (Connectivity with VO Software)! •  Python based Beanshells - Done! •  Simple standard functions for Astronomy! •  ODBC Connector to DB! ! ! ! ! 27
  • 28. Wf4Ever Update! Architecture! •  Search & Retrieval Service! •  Recommender Service! •  I&A Evaluation Service! •  Notification Service! ! ! ! User-Tools Prototypes! •  RO Command Line Tool! •  RO Annotator! •  RO Box! 28
  • 29. Wf4Ever Update! ROBox! ! Seamless contribution to a working collaborative platform! ! A shared folder in Dropbox becomes a Working RO! ! ! ! ! ! ! Automatic generation of metadata ! 29
  • 30. Wf4Ever Update! RO Digital Library and RO Import! 30
  • 31. Wf4Ever Update! ! ! •  Anatomy of a Research Object! •  Annotations on RO components! •  RO Graphical Representation! •  Data/Sessions Inspection (SAMP)! 31
  • 32. Wf4Ever Update! ! ! 32
  • 33. Wf4Ever Update! ! RO Visualization! ! 33
  • 34. Wf4Ever Update! Keywords [galaxies][catalogs] Integrity Rating Downloads 36 Citations [2] Re-used [1] Comments [4] [Previous version | Next version] 34
  • 35. Wf4Ever Update! ! ! Keywords [galaxies][catalogs] Integrity Rating Downloads 36 Re-used [1] Comments [4] [Previous version | Next version] 35
  • 36. Wf4Ever Update! Notification Service for Authors! What should be notified ?! •  Fails! •  Downloads! •  Annotations! •  Linked/Similarity! •  Modifications on Working RO! •  Acknowledgements! ! Notification Management Tool! Avoid spam! 36
  • 37. Astronomy WP in Wf4Ever! Astronomy WP! •  Development and Implementation of “Extraction of Sources”! •  Development and Implementation of “Modelling of 3D Data”! •  Explore experiments subject to be migrated to Wf/RO methodology! •  Contribute to IVOA in Semantics for Processes! ! Other WPs! Continue Providing Feedback! •  RO Model, Architecture, Integrity & Authenticity, Information Quality, etc. ! •  Software integration and improved functionalities (SAMP, Taverna, etc.)! •  Prototypes for management and visualization of RO! ! Community engagement! •  Approach Astro-Informaticians! •  Continue pushing in the IVOA Community! •  Tackle collaboration with Publishers! ! ! 37
  • 38. Workflows & IVOA! Distributed data analysis in the VO! •  Panchromatic, multi-archive, multi-facility! •  Executes in the VO Infrastructure! •  Orchestration of simple services! ! Workflows VO Characterization! Present processing pipelines! •  Inputs! •  Produce exploitable data! •  Outputs! •  Provenance modeling! •  Processes! •  Descriptions! •  VO compliant data ! •  Metadata! ! •  Etc..! Data processing from the VO! •  Provide custom re-processing to VO users! •  Virtual data generation through UWS in VOSpace! 38
  • 39. Related activities in the VO! IVOA Working Groups! •  Data Modeling! Characterization, Provenance..! •  Semantics! Ontologies, Vocabularies for Processes! •  Data Access Layer! TAP, self-descriptive Protocols..! ! •  Grid and Web Services! IVOA Note! UWS, VOSpace, SSO..! Scientific Workflows in the VO! •  Applications! André Schaaff & Jose Enrique Ruiz! SAMP! ! •  IG. KDD! workflow@ivoa.net! Knowledge Discovery and Data Mining! •  IG. Data Curation and Preservation! Persistent Identifiers, Curation of VO Resources..! Wf4Ever Project, US VAO semantic linking of proposals, publications, data! 39