SlideShare a Scribd company logo
1 of 18
Date: 09/06/2014
User Requirements for
Geospatial Provenance
Daniel Garijo, Andreas Harth, Yolanda Gil
Ontology Engineering Group. Universidad Politécnica de Madrid
Information Sciences Institute, University of Southern California
Institute AIFB, Karlsruhe Institute of Technology
Problem statement
Maps can integrate many different sources
•Open Street Maps
•GeoNames
•CIA World Factbook
•Etc.
Interaction to standarize
2
Outline
1. Challenges
2. Assumptions
3. Types of provenance in the geospatial domain
1. Provenance of datasets and sets of datasets
2. Provenance of objects and sets of objects
3. Provenance of properties and sets of properties
4. Other requirements related to provenance
4. Modeling geospatial provenance with PROV-O
1. Dataset level provenance
• Updating a map
2. Object level provenance
3. Property level provenance
5. Summary
6. Conclusions and Future work
3
Challenges concerning provenance
Versioning and provenance
(Map updates )
Trust based provenance Data integration and provenance
Crowdsourcing and provenance Granularity and provenance
Aggregation and provenance
4
Assumptions
Simplifying the problem…
•The entities across datasets have been mapped.
•The datasets share the same data model and vocabulary.
•Each dataset contains objects with unique identifiers.
•The integrated map is going to be presented to a user who is interested in
using the information for some purpose.
5
Summary
1. Challenges
2. Assumptions
3. Types of provenance in the geospatial domain
1. Provenance of datasets and sets of datasets
2. Provenance of objects and sets of objects
3. Provenance of properties and sets of properties
4. Other requirements related to provenance
4. Modeling geospatial provenance with PROV-O
1. Dataset level provenance
• Updating a map
2. Object level provenance
3. Property level provenance
5. Summary
6. Conclusions and Future work
6
Types of provenance: Provenance of Datasets and sets of Datasets
Provenance of a map…
•Sources used to create the map
•Creator of the map
•Creation process used (algorithms, etc.)
•Recent changes of the map
•Reason why the map has been updated
Browsing different versions of a map…
•Most recent maps
•Maps from an organization
•Maps created from a version of a dataset or algorithm
Map
release
June
OSM FAO GADM
Integration June
7
Types of provenance: Provenance of Objects and sets of Objects
Objects: lower granularity entities in the map
•Original data source of the object
•Organizations responsible for the creation of the object
•Date of creation of the object
•Date of insertion of the object in the map
•Process of inclusion in the dataset
Provenance of collections of objects…
•Source of the objects of a region/area
•Objects from a specific organization
•Objects belonging to a type of source (e.g., crowdsourced map)
•Objects introduced in the last version of the map
A
B
C
bridge
stadium
intersection
8
Types of provenance: Provenance of Properties and sets of Properties
Properties: attributes of objects in a map
•Sources of the property
•Creator of the property
•Date of the creation/update of the property
•Process by which the property was added
Provenance of sets of properties…
•Properties of objects coming from one data source
•Properties of objects belonging to a crowdsourced
map
•Properties of the selected objects that have the same source
9
Source A Source B
Height: 20 m
Length: 1 km
Name: 405
Fwy overpass
Other requirements related to provenance
10
Other requirements might not be straightforward to answer…
•How did a set of manual corrections help to improve the map?
•What is new in this map?
•What objects are integrated with a high confidence?
•Why is an object not appearing?
•General highlights of the map
…but they can be addressed having provenance records
Summary
1. Challenges
2. Assumptions
3. Types of provenance in the geospatial domain
1. Provenance of datasets and sets of datasets
2. Provenance of objects and sets of objects
3. Provenance of properties and sets of properties
4. Other requirements related to provenance
4. Modeling geospatial provenance with PROV-O
1. Dataset level provenance
• Updating a map
2. Object level provenance
3. Property level provenance
5. Summary
6. Conclusions and Future work
11
Modeling provenance in the geospatial domain: PROV-O extension
Simple PROV-O extension to model the dataset level
12
Dataset Level Provenance: Example
13
Dataset integration approaches
There are different alternatives for updating a map
14
Object level provenance: scalability
15
Property level provenance
16
Asserted properties do not have URIs!
•New entities for describing their provenance
Source A Source B
:Bridge :height 20m
:Bridge :length 1 km
:Bridge :name “405 Fwy overpass”
:metadata1
:metadata2
prov:wasDerivedFrom
prov:wasDerivedFrom
Conclusions
17
Requirements and
major challenges for
geospatial
provenance
4 main categories:
•Provenance of datasets
•Provenance of objects
appearing in the map
•Provenance of
properties
•Other
Analogous
questions are
relevant for
dataset/object/prop
erty provenance in
non-geospatial
domains.
Date: 09/06/2014
User Requirements for
Geospatial Provenance
Daniel Garijo, Andreas Harth, Yolanda Gil
Ontology Engineering Group. Universidad Politécnica de Madrid
Information Sciences Institute, University of Southern California
Institute AIFB, Karlsruhe Institute of Technology

More Related Content

Viewers also liked

Research Objects in Scientific Publications
Research Objects in Scientific PublicationsResearch Objects in Scientific Publications
Research Objects in Scientific Publicationsdgarijo
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...dgarijo
 
Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)dgarijo
 
Frag Flow: Automated Fragment Detection in Scientific Workflows
Frag Flow: Automated Fragment Detection in Scientific WorkflowsFrag Flow: Automated Fragment Detection in Scientific Workflows
Frag Flow: Automated Fragment Detection in Scientific Workflowsdgarijo
 
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...dgarijo
 
Common Motifs in Scientific Workflows: An Empirical Analysis
Common Motifs in Scientific Workflows: An Empirical AnalysisCommon Motifs in Scientific Workflows: An Empirical Analysis
Common Motifs in Scientific Workflows: An Empirical Analysisdgarijo
 
Power point ses diana
Power point ses dianaPower point ses diana
Power point ses dianaDianiela
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...dgarijo
 
PROV-O Tutorial. DC-2013 Conference
PROV-O Tutorial. DC-2013 ConferencePROV-O Tutorial. DC-2013 Conference
PROV-O Tutorial. DC-2013 Conferencedgarijo
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overviewdgarijo
 
OEG tools for supporting Ontology Engineering
OEG tools for supporting Ontology EngineeringOEG tools for supporting Ontology Engineering
OEG tools for supporting Ontology Engineeringdgarijo
 
OPM Overview
OPM Overview OPM Overview
OPM Overview dgarijo
 

Viewers also liked (14)

Research Objects in Scientific Publications
Research Objects in Scientific PublicationsResearch Objects in Scientific Publications
Research Objects in Scientific Publications
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...
 
P-Plan
P-PlanP-Plan
P-Plan
 
Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)
 
Frag Flow: Automated Fragment Detection in Scientific Workflows
Frag Flow: Automated Fragment Detection in Scientific WorkflowsFrag Flow: Automated Fragment Detection in Scientific Workflows
Frag Flow: Automated Fragment Detection in Scientific Workflows
 
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...
On Specifying and Sharing Scientific Workflow Optimization Results Using Rese...
 
Common Motifs in Scientific Workflows: An Empirical Analysis
Common Motifs in Scientific Workflows: An Empirical AnalysisCommon Motifs in Scientific Workflows: An Empirical Analysis
Common Motifs in Scientific Workflows: An Empirical Analysis
 
Power point ses diana
Power point ses dianaPower point ses diana
Power point ses diana
 
Opmw
OpmwOpmw
Opmw
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...
 
PROV-O Tutorial. DC-2013 Conference
PROV-O Tutorial. DC-2013 ConferencePROV-O Tutorial. DC-2013 Conference
PROV-O Tutorial. DC-2013 Conference
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overview
 
OEG tools for supporting Ontology Engineering
OEG tools for supporting Ontology EngineeringOEG tools for supporting Ontology Engineering
OEG tools for supporting Ontology Engineering
 
OPM Overview
OPM Overview OPM Overview
OPM Overview
 

Similar to User Requirements for Geospatial Provenance

Lecture 3 needs assessment
Lecture 3   needs assessmentLecture 3   needs assessment
Lecture 3 needs assessmentyihongyuan19
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2Gianpaolo Coro
 
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSM
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSMTheoretical Concepts in GIS Vanessa Quiñones Orfila PSM
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSMVanessa Quiñones
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsHPCC Systems
 
ArchaeoSTOR Map: Publishing archaeological geodata on the web
ArchaeoSTOR Map: Publishing archaeological geodata on the webArchaeoSTOR Map: Publishing archaeological geodata on the web
ArchaeoSTOR Map: Publishing archaeological geodata on the webYuma Matsui
 
Review of Digital Soil Mapping steps
Review of Digital Soil Mapping stepsReview of Digital Soil Mapping steps
Review of Digital Soil Mapping stepsFAO
 
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web AppsGIS in the Rockies
 
MapInfo Professional 12.5 and Discover3D 2014 - A brief overview
MapInfo Professional 12.5 and Discover3D 2014 - A brief overviewMapInfo Professional 12.5 and Discover3D 2014 - A brief overview
MapInfo Professional 12.5 and Discover3D 2014 - A brief overviewPrakher Hajela Saxena
 
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders Australia
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders AustraliaPlacement at Sahmakum Teang Tnaut funded by Engineers Without Borders Australia
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders AustraliaWilfred Waters
 
Bring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsBring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsDatabricks
 
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence Camille Tardy
 
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdf
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdfIGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdf
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdfnoureddinebassa1
 
Polar CI: A Clound based Polar Resource Discovery Engine
Polar CI: A Clound based Polar Resource Discovery EnginePolar CI: A Clound based Polar Resource Discovery Engine
Polar CI: A Clound based Polar Resource Discovery EngineYongyao Jiang
 
GIS Introduction.ppt
GIS Introduction.pptGIS Introduction.ppt
GIS Introduction.pptmisterjis
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site RemediationJoseph Luchette
 
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...nacis_slides
 

Similar to User Requirements for Geospatial Provenance (20)

IMIA Chiang Spatial Computing - 2016
IMIA Chiang Spatial Computing - 2016IMIA Chiang Spatial Computing - 2016
IMIA Chiang Spatial Computing - 2016
 
Lecture 3 needs assessment
Lecture 3   needs assessmentLecture 3   needs assessment
Lecture 3 needs assessment
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
 
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSM
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSMTheoretical Concepts in GIS Vanessa Quiñones Orfila PSM
Theoretical Concepts in GIS Vanessa Quiñones Orfila PSM
 
What is gis
What is gisWhat is gis
What is gis
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC Systems
 
ArchaeoSTOR Map: Publishing archaeological geodata on the web
ArchaeoSTOR Map: Publishing archaeological geodata on the webArchaeoSTOR Map: Publishing archaeological geodata on the web
ArchaeoSTOR Map: Publishing archaeological geodata on the web
 
Review of Digital Soil Mapping steps
Review of Digital Soil Mapping stepsReview of Digital Soil Mapping steps
Review of Digital Soil Mapping steps
 
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps
2017 GIS in Education Track: Sharing Historical Maps and Atlases in Web Apps
 
GIS Presentation.pptx
GIS Presentation.pptxGIS Presentation.pptx
GIS Presentation.pptx
 
MapInfo Professional 12.5 and Discover3D 2014 - A brief overview
MapInfo Professional 12.5 and Discover3D 2014 - A brief overviewMapInfo Professional 12.5 and Discover3D 2014 - A brief overview
MapInfo Professional 12.5 and Discover3D 2014 - A brief overview
 
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders Australia
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders AustraliaPlacement at Sahmakum Teang Tnaut funded by Engineers Without Borders Australia
Placement at Sahmakum Teang Tnaut funded by Engineers Without Borders Australia
 
Topic basic gis session 1
Topic  basic gis session 1Topic  basic gis session 1
Topic basic gis session 1
 
Bring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsBring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science Workflows
 
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
 
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdf
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdfIGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdf
IGIS Workshop - Introduction to ArcGIS Pro - Apr 2022 - Presentation.pdf
 
Polar CI: A Clound based Polar Resource Discovery Engine
Polar CI: A Clound based Polar Resource Discovery EnginePolar CI: A Clound based Polar Resource Discovery Engine
Polar CI: A Clound based Polar Resource Discovery Engine
 
GIS Introduction.ppt
GIS Introduction.pptGIS Introduction.ppt
GIS Introduction.ppt
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site Remediation
 
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...
Geohistory-Géohistoire Canada: Developing a partnership for historical GIS an...
 

More from dgarijo

FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesFOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesdgarijo
 
FAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Futuredgarijo
 
Towards Reusable Research Software
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Softwaredgarijo
 
SOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationdgarijo
 
A Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasetsdgarijo
 
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge GraphsOBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphsdgarijo
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadatadgarijo
 
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...dgarijo
 
WDPlus: Leveraging Wikidata to Link and Extend Tabular Data
WDPlus: Leveraging Wikidata to Link and Extend Tabular DataWDPlus: Leveraging Wikidata to Link and Extend Tabular Data
WDPlus: Leveraging Wikidata to Link and Extend Tabular Datadgarijo
 
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...dgarijo
 
Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019dgarijo
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
 
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...dgarijo
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologiesdgarijo
 
Towards Automating Data Narratives
Towards Automating Data NarrativesTowards Automating Data Narratives
Towards Automating Data Narrativesdgarijo
 
Automated Hypothesis Testing with Large Scale Scientific Workflows
Automated Hypothesis Testing with Large Scale Scientific WorkflowsAutomated Hypothesis Testing with Large Scale Scientific Workflows
Automated Hypothesis Testing with Large Scale Scientific Workflowsdgarijo
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Softwaredgarijo
 
Software Metadata: Describing "dark software" in GeoSciences
Software Metadata: Describing "dark software" in GeoSciencesSoftware Metadata: Describing "dark software" in GeoSciences
Software Metadata: Describing "dark software" in GeoSciencesdgarijo
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overviewdgarijo
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsdgarijo
 

More from dgarijo (20)

FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesFOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
 
FAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Future
 
Towards Reusable Research Software
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Software
 
SOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentation
 
A Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasets
 
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge GraphsOBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadata
 
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
 
WDPlus: Leveraging Wikidata to Link and Extend Tabular Data
WDPlus: Leveraging Wikidata to Link and Extend Tabular DataWDPlus: Leveraging Wikidata to Link and Extend Tabular Data
WDPlus: Leveraging Wikidata to Link and Extend Tabular Data
 
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
 
Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
 
Towards Automating Data Narratives
Towards Automating Data NarrativesTowards Automating Data Narratives
Towards Automating Data Narratives
 
Automated Hypothesis Testing with Large Scale Scientific Workflows
Automated Hypothesis Testing with Large Scale Scientific WorkflowsAutomated Hypothesis Testing with Large Scale Scientific Workflows
Automated Hypothesis Testing with Large Scale Scientific Workflows
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
 
Software Metadata: Describing "dark software" in GeoSciences
Software Metadata: Describing "dark software" in GeoSciencesSoftware Metadata: Describing "dark software" in GeoSciences
Software Metadata: Describing "dark software" in GeoSciences
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overview
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflows
 

Recently uploaded

4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 

Recently uploaded (20)

4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 

User Requirements for Geospatial Provenance

  • 1. Date: 09/06/2014 User Requirements for Geospatial Provenance Daniel Garijo, Andreas Harth, Yolanda Gil Ontology Engineering Group. Universidad Politécnica de Madrid Information Sciences Institute, University of Southern California Institute AIFB, Karlsruhe Institute of Technology
  • 2. Problem statement Maps can integrate many different sources •Open Street Maps •GeoNames •CIA World Factbook •Etc. Interaction to standarize 2
  • 3. Outline 1. Challenges 2. Assumptions 3. Types of provenance in the geospatial domain 1. Provenance of datasets and sets of datasets 2. Provenance of objects and sets of objects 3. Provenance of properties and sets of properties 4. Other requirements related to provenance 4. Modeling geospatial provenance with PROV-O 1. Dataset level provenance • Updating a map 2. Object level provenance 3. Property level provenance 5. Summary 6. Conclusions and Future work 3
  • 4. Challenges concerning provenance Versioning and provenance (Map updates ) Trust based provenance Data integration and provenance Crowdsourcing and provenance Granularity and provenance Aggregation and provenance 4
  • 5. Assumptions Simplifying the problem… •The entities across datasets have been mapped. •The datasets share the same data model and vocabulary. •Each dataset contains objects with unique identifiers. •The integrated map is going to be presented to a user who is interested in using the information for some purpose. 5
  • 6. Summary 1. Challenges 2. Assumptions 3. Types of provenance in the geospatial domain 1. Provenance of datasets and sets of datasets 2. Provenance of objects and sets of objects 3. Provenance of properties and sets of properties 4. Other requirements related to provenance 4. Modeling geospatial provenance with PROV-O 1. Dataset level provenance • Updating a map 2. Object level provenance 3. Property level provenance 5. Summary 6. Conclusions and Future work 6
  • 7. Types of provenance: Provenance of Datasets and sets of Datasets Provenance of a map… •Sources used to create the map •Creator of the map •Creation process used (algorithms, etc.) •Recent changes of the map •Reason why the map has been updated Browsing different versions of a map… •Most recent maps •Maps from an organization •Maps created from a version of a dataset or algorithm Map release June OSM FAO GADM Integration June 7
  • 8. Types of provenance: Provenance of Objects and sets of Objects Objects: lower granularity entities in the map •Original data source of the object •Organizations responsible for the creation of the object •Date of creation of the object •Date of insertion of the object in the map •Process of inclusion in the dataset Provenance of collections of objects… •Source of the objects of a region/area •Objects from a specific organization •Objects belonging to a type of source (e.g., crowdsourced map) •Objects introduced in the last version of the map A B C bridge stadium intersection 8
  • 9. Types of provenance: Provenance of Properties and sets of Properties Properties: attributes of objects in a map •Sources of the property •Creator of the property •Date of the creation/update of the property •Process by which the property was added Provenance of sets of properties… •Properties of objects coming from one data source •Properties of objects belonging to a crowdsourced map •Properties of the selected objects that have the same source 9 Source A Source B Height: 20 m Length: 1 km Name: 405 Fwy overpass
  • 10. Other requirements related to provenance 10 Other requirements might not be straightforward to answer… •How did a set of manual corrections help to improve the map? •What is new in this map? •What objects are integrated with a high confidence? •Why is an object not appearing? •General highlights of the map …but they can be addressed having provenance records
  • 11. Summary 1. Challenges 2. Assumptions 3. Types of provenance in the geospatial domain 1. Provenance of datasets and sets of datasets 2. Provenance of objects and sets of objects 3. Provenance of properties and sets of properties 4. Other requirements related to provenance 4. Modeling geospatial provenance with PROV-O 1. Dataset level provenance • Updating a map 2. Object level provenance 3. Property level provenance 5. Summary 6. Conclusions and Future work 11
  • 12. Modeling provenance in the geospatial domain: PROV-O extension Simple PROV-O extension to model the dataset level 12
  • 14. Dataset integration approaches There are different alternatives for updating a map 14
  • 15. Object level provenance: scalability 15
  • 16. Property level provenance 16 Asserted properties do not have URIs! •New entities for describing their provenance Source A Source B :Bridge :height 20m :Bridge :length 1 km :Bridge :name “405 Fwy overpass” :metadata1 :metadata2 prov:wasDerivedFrom prov:wasDerivedFrom
  • 17. Conclusions 17 Requirements and major challenges for geospatial provenance 4 main categories: •Provenance of datasets •Provenance of objects appearing in the map •Provenance of properties •Other Analogous questions are relevant for dataset/object/prop erty provenance in non-geospatial domains.
  • 18. Date: 09/06/2014 User Requirements for Geospatial Provenance Daniel Garijo, Andreas Harth, Yolanda Gil Ontology Engineering Group. Universidad Politécnica de Madrid Information Sciences Institute, University of Southern California Institute AIFB, Karlsruhe Institute of Technology

Editor's Notes

  1. This presentation is a summary of the OWS-9 y OWS-10 discussions (In the context of OGC) Maps integrate information from many resources. Normally the data integration process is automatic, although it may have some manual steps (curate data, etc). Each source may have their own properties, geometries, data, etc, but when presenting to a user just a value for each thing is shown. Maps can be updated (e.g., a new road is built), and we need to track the provenance of the information to check its authenticity. This work summarizes the discussions with researchers and practitioners at several meetings and workshops on geospatial data. This effort is also of great importance for the community, as there is an ongoing effort on standarizing how to link entities in geospatial data (OGC and W3C)
  2. Given the previous problem, in this presentation we will show the challenges derived from the problem, A set of assumptions to simplify the integration scenario, the types of provenance that we can find on it, How to model it with PROV and the conclusions and future work.
  3. Trust based provenance: If a map is created from many datasets, we need to know if that dataset is a trusted one or not. Data integration and provenance: knowing which data came from each dataset can be very relevant to understand why a map is the way it is. Crowdsourcing and provenance: Some datasets like OSM depend on the data provided by users. It is key to know who contributed in what to assess its quality Granularity and provenance: different datasets provide different levels of granularity. A geographical feature can be a point, line or 3d area. Aggregation and provenance: maps are aggregations of features from other sources. Versioning and provenance: map updates
  4. Given the heterogeneity of the data, in this first approach to the problem we decided to simplify it. In a nutshell, what we assume is that the datasets are using the same model and that the entities across different datasets have been mapped. This is unrealistic, as it is a great effort. However, the W3C and OGC are already talking on how to align existent approaches to make a standard. We do this to be able to tackle and describe the main challenges regarding provenance in this scenario.
  5. Next I’ll talk about the types of provenance that we can find in the geospatial domain.
  6. Types of provenance: provenance of datasets. This is the most typical one, as it aims to describe the main features of a map: which sources were used, which process led to its creation, what are the changes made to the map, etc. A map may have been updated, and different versions might be available. Therefore we are also interested in browsing the provenance of sets of maps.
  7. Drilling down in granularity: maps are made of objects, and these objects may have its provenance as well. You could ask where does the object come from the organizations responsible for its appearance in the map, the date when the object was inserted, etc. As happened with the maps, we may be also interested in annotating sets of objects (in case they all share different annotations) instead of having them annotated individually.
  8. An object can have properties which have been integrated from different sources. The questions related to them are analogous to those that we could do to an object.
  9. Other requirements are not that easy to answer (not directly with a sparql query), but they can be benefited from the previous types of provenance. For example, if we want to answer how a set of corrections helped to improve a map, we can show the previous map and slowly introduce the changes, thus showing how the map is complete. We could answer the second question by retrieving the objects introduced in the newer version of the map, we could retrieve those with high confidence by modeling extra metadata from the algorithm, etc.
  10. Now that we have introduced the main requirements, how do we tackle them with PROV?
  11. First we need to introduce some basic extensions to PROV. These are very basic extensions and additional ones could be necessary to deal with the different levels of granularity. This is a work in progress and we still haven’t published the vocabulary extensions. We wanted to distinguish crowdsourced maps from integrated maps, as the former will be the inputs and the latter the outputs of the map integration processes. Other entities are the additional datasets consulted by the algorithm responsible for the integration of the map. We were going to introduce roles as well, but in the end decided to cut them out for simplicity.
  12. This would be an example of an integration of a map created from two different maps (GM and OSM). Explain a little the example
  13. There are three alternative approaches to creating new versions of the map: the new version of the map is generated anew, the new version of the map is generated taking into account the previous version of the map, and only the delta of the changes are generated. We assumed the second one in the previous example, although each approach is possible.
  14. This figure shows an example of several ways to store object level provenance. Maps can be big, and storing the provenance of every object might bring scalability issues. Recording partial provenance: Only particular aspects of provenance could be stored. For example, the only provenance assertions for an object could be references to the original objects identifiers. • Recording provenance selectively: During the integration process, specific decisions would be made as to what objects grant a detailed provenance record and which ones do not. For example, if an object was created with low confidence then detailed provenance would be recorded. • Aggregating provenance of objects: Objects with equivalent provenance could be grouped into collections, and the provenance would be attached to the collections. • Storing provenance separately: Provenance can be stored separately from the map itself. Several provenance services could be set up for the same map.
  15. The problem of modeling properties is that they do not have an identifier. Therefore we need to create a new entity (annotation, bundle, etc) which will contain the provenance for it. Explain the example with the bridge
  16. This is a summary of all the previous requirements, which is the main contribution. Discuss a little the difference between the sections and summarize each one a bit. Another contribution is the PROV extension