SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Geographica: A Benchmark for
Geospatial RDF Stores
George Garbis, Kostis Kyzirakos, Manolis Koubarakis
Dept. of Informatics and Telecommunications,
National and Kapodistrian University of Athens, Greece

12th International Semantic Web Conference
(Evaluation Track)
Outline
•

•

Motivation
The benchmark Geographica
Real-world workload
Synthetic workload
Evaluating the performance of geospatial RDF
stores using Geographica
Conclusions
•

•

•

•

23/10/2013

2
Motivation


Lots of geospatial data is available on the Web today.



Lots of geospatial data is quickly being transformed into
linked geospatial data!



People have started building applications using such data.



Geospatial extensions of SPARQL (e.g., GeoSPARQL and
stSPARQL) have been recently developed.



RDF stores provide support for GeoSPARQL (e.g., Strabon,
Oracle 12c, uSeekM, Parliament) or provide limited
geospatial functionality (e.g., Virtuoso, BigOwlim,
AllegroGraph)
23/10/2013

3
The Benchmark Geographica
•

•

Aim: measure the performance of today’s geospatial RDF stores
Organized around two workloads:
Real-world workload:
Based on existing linked geospatial datasets and known
application scenarios
Synthetic workload:
Measure performance in a controlled environment where we can
play around with selectivity of queries.
•

•

•

•

•

Γεωγραφικά: 17-volume geographical
encyclopedia by Στράβων (AD 17)

23/10/2013

4
Outline
•

•

Motivation
The benchmark Geographica
Real-world workload
Synthetic workload
Evaluating the performance of geospatial RDF
stores using Geographica
Conclusions
•

•

•

•

23/10/2013

5
Real-World Workload
Datasets
•

Datasets: Real-world datasets for the
geographic area of Greece playing an important
role in the LOD cloud or having complex
geometries
LinkedGeoData (LGD) for rivers and main roads
in Greece
GeoNames for Greece
DBpedia for Greece
Greek Administrative Geography (GAG)
CORINE land cover (CLC) for Greece
Hotspots
•

•

•

•

•

•

23/10/2013

6
Real-World Workload
Datasets

Dataset

Size

# of
Triples

# of
Points

# of Lines
# of Polygons
(max/min/avg (max/min/avg
points/line)
points/polygon)

GeoNames

45MB

400K

22K

-

-

Dbpedia

89MB

430K

8K

-

-

LGD

29MB

150K

-

12K (1.6K/2/21)

-

GAG

33MB

4K

-

-

325
(15K/4/400)

CLC

401MB

630K

-

-

45K
(5K/4/140)

Hotspots

90MB

450K

-

-

37K
(4/4/4)

23/10/2013

7
Real-World Workload
Parts
•

For this workload, Geographica has
two parts (following Jackpine):
Micro part: Tests primitive spatial
functions offered by geospatial RDF
stores
Macro part: Simulates some typical
application scenarios
•

•

23/10/2013

8
Real-World Workload
Micro part
•

•

29 SPARQL queries that consist of one or two triple
patterns and a spatial function.
Functions included:
Non-topological: boundary, envelope, convex
•

•

•

•

hull, buffer, area
Topological: equals, intersects, overlaps,
crosses, within, distance, disjoint
Spatial aggregates: extent, union

These functions are used for spatial selections
and spatial joins

23/10/2013

9
Example – non-topological
Micro part
•

Construct the boundary of all polygons of CLC

PREFIX geof:
<http://www.opengis.net/def/function/geosparql/
>
PREFIX dataset:
<http://geographica.di.uoa.gr/dataset/>
PREFIX clc:
<http://geo.linkedopendata.gr/corine/ontology#>
SELECT ( geof:boundary(?o1) as ?ret )
WHERE {
GRAPH dataset:clc { ?s1 clc:asWKT ?o1. }
23/10/2013
10
}
Example – spatial selection
Micro part

Find all points in GeoNames that are within a
given polygon.
PREFIX dataset:
<http://geographica.di.uoa.gr/dataset/>
PREFIX geonames:
<http://www.geonames.org/ontology#>
•

SELECT ?s1 ?o1
WHERE {
GRAPH dataset:geonames { ?s1
geonames:asWKT ?o1 }
FILTER( geof:sfWithin(?o1,
"POLYGON((…))"^^geo:wktLiteral)).
23/10/2013
}

11
Example – spatial join
Micro part
•

Find all pairs of GAG polygons that overlap

PREFIX dataset:
<http://geographica.di.uoa.gr/dataset/>
PREFIX gag:
<http://geo.linkedopendata.gr/gag/ontology/>
PREFIX clc:
<http://geo.linkedopendata.gr/corine/ontology#>
SELECT ?s1 ?s2
WHERE {
GRAPH dataset:gag {?s1 gag:asWKT ?o1}
GRAPH dataset:clc {?s2 clc:asWKT ?o2}
FILTER( geof:sfOverlaps(?o1, ?o2) )
23/10/2013
12
}
Real-World Workload
Micro part
•

Spatial Selections
Query Point
Points

Query Line

Within Buffer
Distance

Query Polygon
Within
Disjoint

Lines

Intersects
Disjoint

Polygons
•

Equals
Crosses
Intersects

Equals
Overlaps

Points

Lines

Polygons

Equals

Intersects

Intersects
Within

Spatial Joins
Points
Lines

Intersects
Within
Crosses

Polygons

Within
Touches
Overlaps

23/10/2013

13
Real-World Workload
Macro part: Scenarios
•

Reverse Geocoding

23/10/2013

14
Real-World Workload
Macro part: Scenarios
•

•

Reverse Geocoding
Web Map Search and Browsing

23/10/2013

15
Real-World Workload
Macro part: Scenarios
•

•

•

Reverse Geocoding
Web Map Search and Browsing
Rapid Mapping for Fire Monitoring

23/10/2013

16
Outline
•

•

Motivation
The benchmark Geographica
Real-world workload
Synthetic workload
Evaluating the performance of geospatial RDF
stores using Geographica
Conclusions
•

•

•

•

23/10/2013

17
Synthetic Workload
•

Goal: Evaluate performance in a controlled environment
where we can vary the thematic and spatial selectivity of
queries
Thematic selectivity: the fraction of the total
geographic features of a dataset that satisfy the nonspatial part of a query
Spatial selectivity: the fraction of the total geographic
features of a dataset which satisfy the topological
relation in the FILTER clause of a query
•

•

23/10/2013

18
Synthetic Workload
Generator
•

Dataset: As in VESPA, the produced datasets are
geographic features on a synthetic map:
States in a country ((n/3)2)
Land ownership (n2)
Roads (n)
POI (n2)
•

•

•

•

23/10/2013

19
Synthetic Workload
Ontology
•

•

Based roughly on the ontology of OpenStreetMap and the
GeoSPARQL vocabulary
Tagging each feature with a key enables us to select a
known fraction of features in a uniform way

23/10/2013

20
Synthetic Workload

Query template for spatial selections
SELECT ?s
WHERE {
?s ns:hasGeometry ?g.
?s c:hasTag ?tag.
?g ns:asWKT ?wkt.
?tag ns:hasKey “THEMA”
FILTER(FUNCTION(?wkt, “GEOM”^^geo:wktLiteral))}
•

Parameters:
ns: specifies the kind of feature (and geometry type) examined
FUNCTION: specifies the topological function examined
THEMA: defines the thematic selectivity of the query using
another parameter k
GEOM: specifies a rectangle that controls the spatial selectivity
of the query
•

•

•

•

23/10/2013

21
Synthetic Workload

Query template for spatial joins
SELECT ?s1 ?s2
WHERE {
?s1 ns1:hasGeometry ?g1.
?s1 ns1:hasTag ?tag1.
?g1 ns1:asWKT ?wkt1.
?tag1 ns1:hasKey “THEMA” .
?s2 ns2:hasGeometry ?g2.
?s2 ns2:hasTag ?tag2.
?g2 ns2:asWKT ?wkt2.
?tag2 ns2:hasKey “THEMA’” .
FILTER(FUNCTION(?wkt1, ?wkt2))}

23/10/2013

22
Outline
•

•

Motivation
The benchmark Geographica
Real-world workload
Synthetic workload
Evaluating the performance of geospatial
RDF stores using Geographica
Conclusions
•

•

•

•

23/10/2013

23
Experimental Setup
•

•

•

Geospatial RDF stores tested: Strabon, Parliament, uSeekM
Machine: Intel Xeon E5620, 12MB L3 cache, 2.4GHz, 24GB RAM, 4
HDD with RAID-5
Micro part (real-world workload) & synthetic workload:
Metric: response time
Run 3 times and compute the median
Time out: 1 hour
Run both on warm caches and cold caches
Macro part (real-world workload) :
Run many instantiations of each scenario for one hour without
cleaning caches
Metric: Average time for a complete execution
•

•

•

•

•

•

•

23/10/2013

24
Results

Real Workload - micro part (cold caches)

23/10/2013

25
Results

Macro part

Scenario

Strabon

uSeekM

Parliament

Reverse Geocoding

65 sec

0.77 sec

2.6 sec

Map Search and
Browsing

0.9 sec

0.6 sec

22.2 sec

Rapid Mapping for Fire
Monitoring

207.4 sec

-

-

23/10/2013

26
Results

Synthetic Workload
•

We generate the synthetic dataset with n=512.
This results in:
28,900 states
262,144 land ownerships
512 roads
262,144 points of interest
•

•

•

•

•

Size: 3,880,224 triples (745 MB)

23/10/2013

27
Results

Synthetic Workload – spatial selections

Intersects
Tag 1, cold caches
23/10/2013

Intersects
Tag 512, cold caches
28
Results

Synthetic Workload - Spatial Joins

Touches
23/10/2013

29
Conclusions
•

We defined Geographica, a new comprehensive
benchmark for geospatial RDF stores, and used it
to compare 3 relevant systems
Strabon
Parliament
uSeekM
•

•

•

•

Two workloads: real-world and synthetic

23/10/2013

30
Future Work
•

•

•

•

•

•

Capture the full GeoSPARQL standard.
Study scaling issues with larger datasets.
Add more application scenarios
Extent the generator to produce datasets
that do not follow a uniform distribution.
Extend the benchmark to include
time-evolving geospatial data.

23/10/2013

31
Thanks!
 Geographica: http://geographica.di.uoa.gr
 This work was supported in part by the European Commission
project TELEIOS http://www.earthobservatory.eu

Any Questions?

23/10/2013

32

Mais conteúdo relacionado

Mais procurados

Location based services for Nokia X and Nokia Asha using Geo2tag
Location based services for Nokia X and Nokia Asha using Geo2tagLocation based services for Nokia X and Nokia Asha using Geo2tag
Location based services for Nokia X and Nokia Asha using Geo2tagMicrosoft Mobile Developer
 
Wms Performance Tests Map Server Vs Geo Server
Wms Performance Tests Map Server Vs Geo ServerWms Performance Tests Map Server Vs Geo Server
Wms Performance Tests Map Server Vs Geo ServerDonnyV
 
MapServer #ProTips 2015
MapServer #ProTips 2015MapServer #ProTips 2015
MapServer #ProTips 2015Jeff McKenna
 
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GISMinPa Lee
 
State of GeoServer 2.14
State of GeoServer 2.14State of GeoServer 2.14
State of GeoServer 2.14Jody Garnett
 
GeoServer on steroids
GeoServer on steroidsGeoServer on steroids
GeoServer on steroidsGeoSolutions
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryGhislain Atemezing
 
Overview of MassGIS Web Mapping Services
Overview of MassGIS Web Mapping ServicesOverview of MassGIS Web Mapping Services
Overview of MassGIS Web Mapping Servicesaleda_freeman
 
HACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a SupercomputerHACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a Supercomputerinside-BigData.com
 
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...GeoSolutions
 
Querying Linked Geospatial Data with Incomplete Information
Querying Linked Geospatial Data with  Incomplete InformationQuerying Linked Geospatial Data with  Incomplete Information
Querying Linked Geospatial Data with Incomplete InformationCharalampos (Babis) Nikolaou
 
Java Image Processing for Geospatial Community
Java Image Processing for Geospatial CommunityJava Image Processing for Geospatial Community
Java Image Processing for Geospatial CommunityJody Garnett
 
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...GeoSolutions
 
Raster data in GeoServer and GeoTools: Achievements, issues and future devel...
Raster data in GeoServer and GeoTools:  Achievements, issues and future devel...Raster data in GeoServer and GeoTools:  Achievements, issues and future devel...
Raster data in GeoServer and GeoTools: Achievements, issues and future devel...GeoSolutions
 
OSGeo Conferences Report
OSGeo Conferences ReportOSGeo Conferences Report
OSGeo Conferences ReportJeff McKenna
 
GeoServer on Steroids
GeoServer on Steroids GeoServer on Steroids
GeoServer on Steroids GeoSolutions
 
Spatio-temporal Data Handling With GeoServer for MetOc And Remote Sensing
Spatio-temporal Data Handling With GeoServer for MetOc And Remote SensingSpatio-temporal Data Handling With GeoServer for MetOc And Remote Sensing
Spatio-temporal Data Handling With GeoServer for MetOc And Remote SensingGeoSolutions
 
GeoNetwork, The Open Source Solution for the interoperable management of ge...
GeoNetwork, The Open Source Solution  for the interoperable management  of ge...GeoNetwork, The Open Source Solution  for the interoperable management  of ge...
GeoNetwork, The Open Source Solution for the interoperable management of ge...GeoSolutions
 

Mais procurados (20)

Location based services for Nokia X and Nokia Asha using Geo2tag
Location based services for Nokia X and Nokia Asha using Geo2tagLocation based services for Nokia X and Nokia Asha using Geo2tag
Location based services for Nokia X and Nokia Asha using Geo2tag
 
Wms Performance Tests Map Server Vs Geo Server
Wms Performance Tests Map Server Vs Geo ServerWms Performance Tests Map Server Vs Geo Server
Wms Performance Tests Map Server Vs Geo Server
 
MapServer #ProTips 2015
MapServer #ProTips 2015MapServer #ProTips 2015
MapServer #ProTips 2015
 
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS
[FOSS4G Seoul 2015] New Geoprocessing Toolbox in uDig Desktop GIS
 
NASA Terra Data Fusion
NASA Terra Data FusionNASA Terra Data Fusion
NASA Terra Data Fusion
 
State of GeoServer 2.14
State of GeoServer 2.14State of GeoServer 2.14
State of GeoServer 2.14
 
GeoServer on steroids
GeoServer on steroidsGeoServer on steroids
GeoServer on steroids
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their Geometry
 
Overview of MassGIS Web Mapping Services
Overview of MassGIS Web Mapping ServicesOverview of MassGIS Web Mapping Services
Overview of MassGIS Web Mapping Services
 
HACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a SupercomputerHACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a Supercomputer
 
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...
Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Develo...
 
Why is postgis awesome?
Why is postgis awesome?Why is postgis awesome?
Why is postgis awesome?
 
Querying Linked Geospatial Data with Incomplete Information
Querying Linked Geospatial Data with  Incomplete InformationQuerying Linked Geospatial Data with  Incomplete Information
Querying Linked Geospatial Data with Incomplete Information
 
Java Image Processing for Geospatial Community
Java Image Processing for Geospatial CommunityJava Image Processing for Geospatial Community
Java Image Processing for Geospatial Community
 
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
 
Raster data in GeoServer and GeoTools: Achievements, issues and future devel...
Raster data in GeoServer and GeoTools:  Achievements, issues and future devel...Raster data in GeoServer and GeoTools:  Achievements, issues and future devel...
Raster data in GeoServer and GeoTools: Achievements, issues and future devel...
 
OSGeo Conferences Report
OSGeo Conferences ReportOSGeo Conferences Report
OSGeo Conferences Report
 
GeoServer on Steroids
GeoServer on Steroids GeoServer on Steroids
GeoServer on Steroids
 
Spatio-temporal Data Handling With GeoServer for MetOc And Remote Sensing
Spatio-temporal Data Handling With GeoServer for MetOc And Remote SensingSpatio-temporal Data Handling With GeoServer for MetOc And Remote Sensing
Spatio-temporal Data Handling With GeoServer for MetOc And Remote Sensing
 
GeoNetwork, The Open Source Solution for the interoperable management of ge...
GeoNetwork, The Open Source Solution  for the interoperable management  of ge...GeoNetwork, The Open Source Solution  for the interoperable management  of ge...
GeoNetwork, The Open Source Solution for the interoperable management of ge...
 

Semelhante a Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013

Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresKostis Kyzirakos
 
Big Linked Data Querying - ExtremeEarth Open Workshop
Big Linked Data Querying - ExtremeEarth Open WorkshopBig Linked Data Querying - ExtremeEarth Open Workshop
Big Linked Data Querying - ExtremeEarth Open WorkshopExtremeEarth
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech ProjectsJody Garnett
 
Representing and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic WebRepresenting and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic WebKostis Kyzirakos
 
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...Dongpo Deng
 
Big Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open WorkshopBig Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open WorkshopExtremeEarth
 
Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation George Percivall
 
Building Scalable Semantic Geospatial RDF Stores
Building Scalable Semantic Geospatial RDF StoresBuilding Scalable Semantic Geospatial RDF Stores
Building Scalable Semantic Geospatial RDF StoresKostis Kyzirakos
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech ProjectsJody Garnett
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Symeon Papadopoulos
 
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...multimediaeval
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit
 
Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...jins0618
 
Getting Started with Geospatial Data in MongoDB
Getting Started with Geospatial Data in MongoDBGetting Started with Geospatial Data in MongoDB
Getting Started with Geospatial Data in MongoDBMongoDB
 
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...BigData_Europe
 
Geospatial Options in Apache Spark
Geospatial Options in Apache SparkGeospatial Options in Apache Spark
Geospatial Options in Apache SparkDatabricks
 
Teste de performance de WMS entre GeoServer e Mapserver
Teste de performance de WMS entre GeoServer e MapserverTeste de performance de WMS entre GeoServer e Mapserver
Teste de performance de WMS entre GeoServer e MapserverGeoLivre Conference
 
High Performance and Scalable Geospatial Analytics on Cloud with Open Source
High Performance and Scalable Geospatial Analytics on Cloud with Open SourceHigh Performance and Scalable Geospatial Analytics on Cloud with Open Source
High Performance and Scalable Geospatial Analytics on Cloud with Open SourceDataWorks Summit
 
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdfHow to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdfCARTO
 

Semelhante a Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013 (20)

Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF Stores
 
Big Linked Data Querying - ExtremeEarth Open Workshop
Big Linked Data Querying - ExtremeEarth Open WorkshopBig Linked Data Querying - ExtremeEarth Open Workshop
Big Linked Data Querying - ExtremeEarth Open Workshop
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech Projects
 
Representing and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic WebRepresenting and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic Web
 
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
 
Big Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open WorkshopBig Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open Workshop
 
Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation
 
Building Scalable Semantic Geospatial RDF Stores
Building Scalable Semantic Geospatial RDF StoresBuilding Scalable Semantic Geospatial RDF Stores
Building Scalable Semantic Geospatial RDF Stores
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech Projects
 
Open geo data - technical issue
Open geo data  - technical issueOpen geo data  - technical issue
Open geo data - technical issue
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
 
Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...
 
Getting Started with Geospatial Data in MongoDB
Getting Started with Geospatial Data in MongoDBGetting Started with Geospatial Data in MongoDB
Getting Started with Geospatial Data in MongoDB
 
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...
BigDataEurope 1st SC5 Workshop, Project Teleios & LEO, by M. Koubarakis, Univ...
 
Geospatial Options in Apache Spark
Geospatial Options in Apache SparkGeospatial Options in Apache Spark
Geospatial Options in Apache Spark
 
Teste de performance de WMS entre GeoServer e Mapserver
Teste de performance de WMS entre GeoServer e MapserverTeste de performance de WMS entre GeoServer e Mapserver
Teste de performance de WMS entre GeoServer e Mapserver
 
High Performance and Scalable Geospatial Analytics on Cloud with Open Source
High Performance and Scalable Geospatial Analytics on Cloud with Open SourceHigh Performance and Scalable Geospatial Analytics on Cloud with Open Source
High Performance and Scalable Geospatial Analytics on Cloud with Open Source
 
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdfHow to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
 

Mais de Kostis Kyzirakos

ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial Data
ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial DataESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial Data
ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial DataKostis Kyzirakos
 
The spatiotemporal RDF store Strabon
The spatiotemporal RDF store StrabonThe spatiotemporal RDF store Strabon
The spatiotemporal RDF store StrabonKostis Kyzirakos
 
Linked Earth Observation Data:The Projects TELEIOS and LEO
Linked Earth Observation Data:The Projects TELEIOS and LEOLinked Earth Observation Data:The Projects TELEIOS and LEO
Linked Earth Observation Data:The Projects TELEIOS and LEOKostis Kyzirakos
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLKostis Kyzirakos
 
Data Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataData Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataKostis Kyzirakos
 
Data Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataData Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataKostis Kyzirakos
 
Strabon: A Semantic Geospatial Database System
Strabon: A Semantic Geospatial Database SystemStrabon: A Semantic Geospatial Database System
Strabon: A Semantic Geospatial Database SystemKostis Kyzirakos
 

Mais de Kostis Kyzirakos (7)

ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial Data
ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial DataESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial Data
ESWC2015 - Tutorial on Publishing and Interlinking Linked Geospatial Data
 
The spatiotemporal RDF store Strabon
The spatiotemporal RDF store StrabonThe spatiotemporal RDF store Strabon
The spatiotemporal RDF store Strabon
 
Linked Earth Observation Data:The Projects TELEIOS and LEO
Linked Earth Observation Data:The Projects TELEIOS and LEOLinked Earth Observation Data:The Projects TELEIOS and LEO
Linked Earth Observation Data:The Projects TELEIOS and LEO
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
 
Data Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataData Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial Data
 
Data Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial DataData Models and Query Languages for Linked Geospatial Data
Data Models and Query Languages for Linked Geospatial Data
 
Strabon: A Semantic Geospatial Database System
Strabon: A Semantic Geospatial Database SystemStrabon: A Semantic Geospatial Database System
Strabon: A Semantic Geospatial Database System
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 

Último (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 

Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013

  • 1. Geographica: A Benchmark for Geospatial RDF Stores George Garbis, Kostis Kyzirakos, Manolis Koubarakis Dept. of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece 12th International Semantic Web Conference (Evaluation Track)
  • 2. Outline • • Motivation The benchmark Geographica Real-world workload Synthetic workload Evaluating the performance of geospatial RDF stores using Geographica Conclusions • • • • 23/10/2013 2
  • 3. Motivation  Lots of geospatial data is available on the Web today.  Lots of geospatial data is quickly being transformed into linked geospatial data!  People have started building applications using such data.  Geospatial extensions of SPARQL (e.g., GeoSPARQL and stSPARQL) have been recently developed.  RDF stores provide support for GeoSPARQL (e.g., Strabon, Oracle 12c, uSeekM, Parliament) or provide limited geospatial functionality (e.g., Virtuoso, BigOwlim, AllegroGraph) 23/10/2013 3
  • 4. The Benchmark Geographica • • Aim: measure the performance of today’s geospatial RDF stores Organized around two workloads: Real-world workload: Based on existing linked geospatial datasets and known application scenarios Synthetic workload: Measure performance in a controlled environment where we can play around with selectivity of queries. • • • • • Γεωγραφικά: 17-volume geographical encyclopedia by Στράβων (AD 17) 23/10/2013 4
  • 5. Outline • • Motivation The benchmark Geographica Real-world workload Synthetic workload Evaluating the performance of geospatial RDF stores using Geographica Conclusions • • • • 23/10/2013 5
  • 6. Real-World Workload Datasets • Datasets: Real-world datasets for the geographic area of Greece playing an important role in the LOD cloud or having complex geometries LinkedGeoData (LGD) for rivers and main roads in Greece GeoNames for Greece DBpedia for Greece Greek Administrative Geography (GAG) CORINE land cover (CLC) for Greece Hotspots • • • • • • 23/10/2013 6
  • 7. Real-World Workload Datasets Dataset Size # of Triples # of Points # of Lines # of Polygons (max/min/avg (max/min/avg points/line) points/polygon) GeoNames 45MB 400K 22K - - Dbpedia 89MB 430K 8K - - LGD 29MB 150K - 12K (1.6K/2/21) - GAG 33MB 4K - - 325 (15K/4/400) CLC 401MB 630K - - 45K (5K/4/140) Hotspots 90MB 450K - - 37K (4/4/4) 23/10/2013 7
  • 8. Real-World Workload Parts • For this workload, Geographica has two parts (following Jackpine): Micro part: Tests primitive spatial functions offered by geospatial RDF stores Macro part: Simulates some typical application scenarios • • 23/10/2013 8
  • 9. Real-World Workload Micro part • • 29 SPARQL queries that consist of one or two triple patterns and a spatial function. Functions included: Non-topological: boundary, envelope, convex • • • • hull, buffer, area Topological: equals, intersects, overlaps, crosses, within, distance, disjoint Spatial aggregates: extent, union These functions are used for spatial selections and spatial joins 23/10/2013 9
  • 10. Example – non-topological Micro part • Construct the boundary of all polygons of CLC PREFIX geof: <http://www.opengis.net/def/function/geosparql/ > PREFIX dataset: <http://geographica.di.uoa.gr/dataset/> PREFIX clc: <http://geo.linkedopendata.gr/corine/ontology#> SELECT ( geof:boundary(?o1) as ?ret ) WHERE { GRAPH dataset:clc { ?s1 clc:asWKT ?o1. } 23/10/2013 10 }
  • 11. Example – spatial selection Micro part Find all points in GeoNames that are within a given polygon. PREFIX dataset: <http://geographica.di.uoa.gr/dataset/> PREFIX geonames: <http://www.geonames.org/ontology#> • SELECT ?s1 ?o1 WHERE { GRAPH dataset:geonames { ?s1 geonames:asWKT ?o1 } FILTER( geof:sfWithin(?o1, "POLYGON((…))"^^geo:wktLiteral)). 23/10/2013 } 11
  • 12. Example – spatial join Micro part • Find all pairs of GAG polygons that overlap PREFIX dataset: <http://geographica.di.uoa.gr/dataset/> PREFIX gag: <http://geo.linkedopendata.gr/gag/ontology/> PREFIX clc: <http://geo.linkedopendata.gr/corine/ontology#> SELECT ?s1 ?s2 WHERE { GRAPH dataset:gag {?s1 gag:asWKT ?o1} GRAPH dataset:clc {?s2 clc:asWKT ?o2} FILTER( geof:sfOverlaps(?o1, ?o2) ) 23/10/2013 12 }
  • 13. Real-World Workload Micro part • Spatial Selections Query Point Points Query Line Within Buffer Distance Query Polygon Within Disjoint Lines Intersects Disjoint Polygons • Equals Crosses Intersects Equals Overlaps Points Lines Polygons Equals Intersects Intersects Within Spatial Joins Points Lines Intersects Within Crosses Polygons Within Touches Overlaps 23/10/2013 13
  • 14. Real-World Workload Macro part: Scenarios • Reverse Geocoding 23/10/2013 14
  • 15. Real-World Workload Macro part: Scenarios • • Reverse Geocoding Web Map Search and Browsing 23/10/2013 15
  • 16. Real-World Workload Macro part: Scenarios • • • Reverse Geocoding Web Map Search and Browsing Rapid Mapping for Fire Monitoring 23/10/2013 16
  • 17. Outline • • Motivation The benchmark Geographica Real-world workload Synthetic workload Evaluating the performance of geospatial RDF stores using Geographica Conclusions • • • • 23/10/2013 17
  • 18. Synthetic Workload • Goal: Evaluate performance in a controlled environment where we can vary the thematic and spatial selectivity of queries Thematic selectivity: the fraction of the total geographic features of a dataset that satisfy the nonspatial part of a query Spatial selectivity: the fraction of the total geographic features of a dataset which satisfy the topological relation in the FILTER clause of a query • • 23/10/2013 18
  • 19. Synthetic Workload Generator • Dataset: As in VESPA, the produced datasets are geographic features on a synthetic map: States in a country ((n/3)2) Land ownership (n2) Roads (n) POI (n2) • • • • 23/10/2013 19
  • 20. Synthetic Workload Ontology • • Based roughly on the ontology of OpenStreetMap and the GeoSPARQL vocabulary Tagging each feature with a key enables us to select a known fraction of features in a uniform way 23/10/2013 20
  • 21. Synthetic Workload Query template for spatial selections SELECT ?s WHERE { ?s ns:hasGeometry ?g. ?s c:hasTag ?tag. ?g ns:asWKT ?wkt. ?tag ns:hasKey “THEMA” FILTER(FUNCTION(?wkt, “GEOM”^^geo:wktLiteral))} • Parameters: ns: specifies the kind of feature (and geometry type) examined FUNCTION: specifies the topological function examined THEMA: defines the thematic selectivity of the query using another parameter k GEOM: specifies a rectangle that controls the spatial selectivity of the query • • • • 23/10/2013 21
  • 22. Synthetic Workload Query template for spatial joins SELECT ?s1 ?s2 WHERE { ?s1 ns1:hasGeometry ?g1. ?s1 ns1:hasTag ?tag1. ?g1 ns1:asWKT ?wkt1. ?tag1 ns1:hasKey “THEMA” . ?s2 ns2:hasGeometry ?g2. ?s2 ns2:hasTag ?tag2. ?g2 ns2:asWKT ?wkt2. ?tag2 ns2:hasKey “THEMA’” . FILTER(FUNCTION(?wkt1, ?wkt2))} 23/10/2013 22
  • 23. Outline • • Motivation The benchmark Geographica Real-world workload Synthetic workload Evaluating the performance of geospatial RDF stores using Geographica Conclusions • • • • 23/10/2013 23
  • 24. Experimental Setup • • • Geospatial RDF stores tested: Strabon, Parliament, uSeekM Machine: Intel Xeon E5620, 12MB L3 cache, 2.4GHz, 24GB RAM, 4 HDD with RAID-5 Micro part (real-world workload) & synthetic workload: Metric: response time Run 3 times and compute the median Time out: 1 hour Run both on warm caches and cold caches Macro part (real-world workload) : Run many instantiations of each scenario for one hour without cleaning caches Metric: Average time for a complete execution • • • • • • • 23/10/2013 24
  • 25. Results Real Workload - micro part (cold caches) 23/10/2013 25
  • 26. Results Macro part Scenario Strabon uSeekM Parliament Reverse Geocoding 65 sec 0.77 sec 2.6 sec Map Search and Browsing 0.9 sec 0.6 sec 22.2 sec Rapid Mapping for Fire Monitoring 207.4 sec - - 23/10/2013 26
  • 27. Results Synthetic Workload • We generate the synthetic dataset with n=512. This results in: 28,900 states 262,144 land ownerships 512 roads 262,144 points of interest • • • • • Size: 3,880,224 triples (745 MB) 23/10/2013 27
  • 28. Results Synthetic Workload – spatial selections Intersects Tag 1, cold caches 23/10/2013 Intersects Tag 512, cold caches 28
  • 29. Results Synthetic Workload - Spatial Joins Touches 23/10/2013 29
  • 30. Conclusions • We defined Geographica, a new comprehensive benchmark for geospatial RDF stores, and used it to compare 3 relevant systems Strabon Parliament uSeekM • • • • Two workloads: real-world and synthetic 23/10/2013 30
  • 31. Future Work • • • • • • Capture the full GeoSPARQL standard. Study scaling issues with larger datasets. Add more application scenarios Extent the generator to produce datasets that do not follow a uniform distribution. Extend the benchmark to include time-evolving geospatial data. 23/10/2013 31
  • 32. Thanks!  Geographica: http://geographica.di.uoa.gr  This work was supported in part by the European Commission project TELEIOS http://www.earthobservatory.eu Any Questions? 23/10/2013 32