SlideShare a Scribd company logo
1 of 23
Download to read offline
Introduction
                    An Event-Centric Model
                                Summary




An Event-Centric Provenance Model for Digital
                  Libraries

       C. Tang        D. Castelli L. Candela                      P. Manghi
                       P. Pagano C. Thanos
 Istituto di Scienza e Tecnologie dell’Informazione “A. Faedo” – CNR, Pisa - Italy
                         name.surname@isti.cnr.it

     6th Italian Research Conference on Digital Libraries
                Padua, Italy, 28-29 January 2010



                             C. Tang et al.   An Event-Centric Provenance Model
Introduction
                   An Event-Centric Model
                               Summary


Outline



  1   Introduction
         Motivations


  2   An Event-Centric Model
        The Constituents
        Exploiting the Model




                            C. Tang et al.   An Event-Centric Provenance Model
Introduction
                   An Event-Centric Model    Motivations
                               Summary


What is Provenance?



  Some pseudo-definitions:
      “a summary of the history and context of the data”
      “the parts of the input that influenced (or that explain) a
      part of the output”
      “the part of the input that shows where a part of the output
      came from”
      “a causal graph that shows how a result was computed”




                            C. Tang et al.   An Event-Centric Provenance Model
Introduction
                    An Event-Centric Model    Motivations
                                Summary


What is Provenance?



     Provenance is thus information about
          source, derivation, influences, history
     . . . of an object
          program result, database query
     In e-Science (thus in DLs), it is essential for
          efficiency, reproducibility, accountability, explanation, data
          cleaning, certifying scientific value of data




                             C. Tang et al.   An Event-Centric Provenance Model
Introduction
                   An Event-Centric Model    Motivations
                               Summary


What is the Problem?


  Many models are being developed
      Where-provenance, links output parts to equal input parts
      Why-provenance, explains “why” some data appears in the
      result
      How-provenance, explains “how” a result was calculated
      Workflow, describes result of a parallel/distributed program
  . . . using different assumptions, e.g. system scope, program,
  granularity
  Our goal: develop a “non invasive” and “open” model
  supporting “provenance generation”



                            C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Idea


  Add a layer dedicated to capture provenance-oriented data

                                                             Reference Objects




                                                             Information Objects




                                                              Events




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Event is a happening having an effect on a Reference Object
  <happenedTo> an Object




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Each Event has a Type for filtering purposes




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Description captures the “how” of the Event




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Place captures the “where” of the Event




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Time captures the “when” of the Event




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  The Agent controls the Event




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  Rationale captures the “why” of the Event




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


The Model

  The Parameter is any additional information




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                             The Constituents
                   An Event-Centric Model
                                             Exploiting the Model
                               Summary


The Model

  Don’t reinvent the wheel!!!




                            C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                           The Constituents
             An Event-Centric Model
                                           Exploiting the Model
                         Summary


Computing the provenance



                                       4                                1




                  5
                                                3                   2




                      C. Tang et al.       An Event-Centric Provenance Model
Introduction
                                             The Constituents
                   An Event-Centric Model
                                             Exploiting the Model
                               Summary


The granularity issue

  High flexibility by relying on the Information Object relationships

                                                              Reference Objects

                       part-of


                                                              Information Objects




                                                               Events




                            C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                           The Constituents
                 An Event-Centric Model
                                           Exploiting the Model
                             Summary


The AquaMaps scenario



     AquaMaps is one of the VRE supported by the D4Science
     e-Infrastructure
         Aggregate data on species from multiple and evolving data
         sources (e.g. OBIS, GBIF)
         Curate aggregated data
         Generate species distribution and biodiversity prediction
         maps




                          C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


Example 1



  Find the events occurred to the Salmon object




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


Example 2



  Find the contributors to the Salmon object




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                                            The Constituents
                  An Event-Centric Model
                                            Exploiting the Model
                              Summary


Example 3



  How to explain the existence of the Salmon object




                           C. Tang et al.   An Event-Centric Provenance Model
Introduction
                An Event-Centric Model
                            Summary


Summary

    Provenance is an essential feature in Digital Libraries and
    eScience scenarios
    Many provenance models are being developed using
    different assumptions
    A DL oriented provenance model that is event-based,
    “open” and “non invasive”
    Future steps
        validation and consolidation of the model in the context of
        new DLs application scenarios
        implementation of a infrastructural service realising the
        model in the D4Science infrastructure




                         C. Tang et al.   An Event-Centric Provenance Model
Introduction
                An Event-Centric Model
                            Summary


Summary
    Provenance is an essential feature in Digital Libraries and
    eScience scenarios
    Many provenance models are being developed using
    different assumptions
    A DL oriented provenance model that is event-based,
    “open” and “non invasive”
    Future steps
        validation and consolidation of the model in the context of
        new DLs application scenarios
        implementation of a infrastructural service realising the
        model in the D4Science infrastructure



               http://www.d4science.eu
                 http://www.dlorg.eu
                         C. Tang et al.   An Event-Centric Provenance Model

More Related Content

Viewers also liked

An Extensible Virtual Digital Libraries Generator @ ECDL 2008
An Extensible Virtual Digital Libraries Generator @ ECDL 2008An Extensible Virtual Digital Libraries Generator @ ECDL 2008
An Extensible Virtual Digital Libraries Generator @ ECDL 2008
Leonardo Candela
 
วารุณี
วารุณีวารุณี
วารุณี
warunee18
 
IEC Orientation first year for B-Tech
IEC Orientation first year for B-TechIEC Orientation first year for B-Tech
IEC Orientation first year for B-Tech
iec
 
Extending the consequentiality of invisible work in the food justice movement
Extending the consequentiality of invisible work in the food justice movementExtending the consequentiality of invisible work in the food justice movement
Extending the consequentiality of invisible work in the food justice movement
jamiejosephson
 
Tugas metpen ane nurussyamsiyah (062410045)
Tugas metpen ane nurussyamsiyah (062410045)Tugas metpen ane nurussyamsiyah (062410045)
Tugas metpen ane nurussyamsiyah (062410045)
zhukma
 
A presentation on economic news for the week
A presentation on economic news for the weekA presentation on economic news for the week
A presentation on economic news for the week
gaganhanda11 gaganhanda11
 
20130528 solution linux_frousseau_nopain_webdev
20130528 solution linux_frousseau_nopain_webdev20130528 solution linux_frousseau_nopain_webdev
20130528 solution linux_frousseau_nopain_webdev
Frank Rousseau
 

Viewers also liked (20)

Ramas de la biologia
Ramas de la biologiaRamas de la biologia
Ramas de la biologia
 
Chatham 2014
Chatham 2014Chatham 2014
Chatham 2014
 
SJob 2
SJob 2SJob 2
SJob 2
 
An Extensible Virtual Digital Libraries Generator @ ECDL 2008
An Extensible Virtual Digital Libraries Generator @ ECDL 2008An Extensible Virtual Digital Libraries Generator @ ECDL 2008
An Extensible Virtual Digital Libraries Generator @ ECDL 2008
 
SJob
SJobSJob
SJob
 
วารุณี
วารุณีวารุณี
วารุณี
 
Romanesque art sp16
Romanesque art sp16Romanesque art sp16
Romanesque art sp16
 
IEC Orientation first year for B-Tech
IEC Orientation first year for B-TechIEC Orientation first year for B-Tech
IEC Orientation first year for B-Tech
 
Extending the consequentiality of invisible work in the food justice movement
Extending the consequentiality of invisible work in the food justice movementExtending the consequentiality of invisible work in the food justice movement
Extending the consequentiality of invisible work in the food justice movement
 
Mabini(lecture5)
Mabini(lecture5)Mabini(lecture5)
Mabini(lecture5)
 
Lean & Green at Local Restaurants
Lean & Green at Local RestaurantsLean & Green at Local Restaurants
Lean & Green at Local Restaurants
 
Tugas metpen ane nurussyamsiyah (062410045)
Tugas metpen ane nurussyamsiyah (062410045)Tugas metpen ane nurussyamsiyah (062410045)
Tugas metpen ane nurussyamsiyah (062410045)
 
A presentation on economic news for the week
A presentation on economic news for the weekA presentation on economic news for the week
A presentation on economic news for the week
 
Galaxy
GalaxyGalaxy
Galaxy
 
Kannanotto: Oppisopimuskoulutus tarvitsee oman kehittämisohjelmansa
Kannanotto: Oppisopimuskoulutus tarvitsee oman kehittämisohjelmansaKannanotto: Oppisopimuskoulutus tarvitsee oman kehittämisohjelmansa
Kannanotto: Oppisopimuskoulutus tarvitsee oman kehittämisohjelmansa
 
Making systemic change[1]
Making systemic change[1]Making systemic change[1]
Making systemic change[1]
 
Building a Single Page App: One Page at a Time
Building a Single Page App: One Page at a TimeBuilding a Single Page App: One Page at a Time
Building a Single Page App: One Page at a Time
 
Gagan
GaganGagan
Gagan
 
Edgarbpp
EdgarbppEdgarbpp
Edgarbpp
 
20130528 solution linux_frousseau_nopain_webdev
20130528 solution linux_frousseau_nopain_webdev20130528 solution linux_frousseau_nopain_webdev
20130528 solution linux_frousseau_nopain_webdev
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

An Event-Centric Provenance Model for Digital Libraries @ IRCDL 2010

  • 1. Introduction An Event-Centric Model Summary An Event-Centric Provenance Model for Digital Libraries C. Tang D. Castelli L. Candela P. Manghi P. Pagano C. Thanos Istituto di Scienza e Tecnologie dell’Informazione “A. Faedo” – CNR, Pisa - Italy name.surname@isti.cnr.it 6th Italian Research Conference on Digital Libraries Padua, Italy, 28-29 January 2010 C. Tang et al. An Event-Centric Provenance Model
  • 2. Introduction An Event-Centric Model Summary Outline 1 Introduction Motivations 2 An Event-Centric Model The Constituents Exploiting the Model C. Tang et al. An Event-Centric Provenance Model
  • 3. Introduction An Event-Centric Model Motivations Summary What is Provenance? Some pseudo-definitions: “a summary of the history and context of the data” “the parts of the input that influenced (or that explain) a part of the output” “the part of the input that shows where a part of the output came from” “a causal graph that shows how a result was computed” C. Tang et al. An Event-Centric Provenance Model
  • 4. Introduction An Event-Centric Model Motivations Summary What is Provenance? Provenance is thus information about source, derivation, influences, history . . . of an object program result, database query In e-Science (thus in DLs), it is essential for efficiency, reproducibility, accountability, explanation, data cleaning, certifying scientific value of data C. Tang et al. An Event-Centric Provenance Model
  • 5. Introduction An Event-Centric Model Motivations Summary What is the Problem? Many models are being developed Where-provenance, links output parts to equal input parts Why-provenance, explains “why” some data appears in the result How-provenance, explains “how” a result was calculated Workflow, describes result of a parallel/distributed program . . . using different assumptions, e.g. system scope, program, granularity Our goal: develop a “non invasive” and “open” model supporting “provenance generation” C. Tang et al. An Event-Centric Provenance Model
  • 6. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Idea Add a layer dedicated to capture provenance-oriented data Reference Objects Information Objects Events C. Tang et al. An Event-Centric Provenance Model
  • 7. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Event is a happening having an effect on a Reference Object <happenedTo> an Object C. Tang et al. An Event-Centric Provenance Model
  • 8. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Each Event has a Type for filtering purposes C. Tang et al. An Event-Centric Provenance Model
  • 9. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Description captures the “how” of the Event C. Tang et al. An Event-Centric Provenance Model
  • 10. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Place captures the “where” of the Event C. Tang et al. An Event-Centric Provenance Model
  • 11. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Time captures the “when” of the Event C. Tang et al. An Event-Centric Provenance Model
  • 12. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model The Agent controls the Event C. Tang et al. An Event-Centric Provenance Model
  • 13. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Rationale captures the “why” of the Event C. Tang et al. An Event-Centric Provenance Model
  • 14. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model The Parameter is any additional information C. Tang et al. An Event-Centric Provenance Model
  • 15. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The Model Don’t reinvent the wheel!!! C. Tang et al. An Event-Centric Provenance Model
  • 16. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary Computing the provenance 4 1 5 3 2 C. Tang et al. An Event-Centric Provenance Model
  • 17. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The granularity issue High flexibility by relying on the Information Object relationships Reference Objects part-of Information Objects Events C. Tang et al. An Event-Centric Provenance Model
  • 18. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary The AquaMaps scenario AquaMaps is one of the VRE supported by the D4Science e-Infrastructure Aggregate data on species from multiple and evolving data sources (e.g. OBIS, GBIF) Curate aggregated data Generate species distribution and biodiversity prediction maps C. Tang et al. An Event-Centric Provenance Model
  • 19. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary Example 1 Find the events occurred to the Salmon object C. Tang et al. An Event-Centric Provenance Model
  • 20. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary Example 2 Find the contributors to the Salmon object C. Tang et al. An Event-Centric Provenance Model
  • 21. Introduction The Constituents An Event-Centric Model Exploiting the Model Summary Example 3 How to explain the existence of the Salmon object C. Tang et al. An Event-Centric Provenance Model
  • 22. Introduction An Event-Centric Model Summary Summary Provenance is an essential feature in Digital Libraries and eScience scenarios Many provenance models are being developed using different assumptions A DL oriented provenance model that is event-based, “open” and “non invasive” Future steps validation and consolidation of the model in the context of new DLs application scenarios implementation of a infrastructural service realising the model in the D4Science infrastructure C. Tang et al. An Event-Centric Provenance Model
  • 23. Introduction An Event-Centric Model Summary Summary Provenance is an essential feature in Digital Libraries and eScience scenarios Many provenance models are being developed using different assumptions A DL oriented provenance model that is event-based, “open” and “non invasive” Future steps validation and consolidation of the model in the context of new DLs application scenarios implementation of a infrastructural service realising the model in the D4Science infrastructure http://www.d4science.eu http://www.dlorg.eu C. Tang et al. An Event-Centric Provenance Model