SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
Visualizing Open Data with Plone
a practical guide on how to query and visualize Linked Open Data
                      with eea.daviz product




                                       Antonio De Marinis
                                         Web Technology Management
                                         European Environment Agency
                                                   www.eea.europa.eu
Linked Data evolution




     2007




                                          as per 2011
     keeps growing...
     > 1 million datasets
     Watch video STRATA conference 2013
Open Data - what is it?

    Open data is a philosophy and practice requiring
    that certain data be freely available to everyone,
    without restrictions from copyright, patents or
    other mechanisms of control.


    Linked Open Data (LOD) or simply Linked
    Data is a technique to interlink all open datasets
    into a web of data, aka semantic web, using
    technologies like RDF and SPARQL.
Linked Data vs classic ODBC
SPARQL query structure

 A SPARQL query comprises, in order:
 ● Prefix declarations, for abbreviating URIs
 ● Dataset definition, stating what RDF graph(s) are being queried
 ● A result clause, identifying what information to return from the query
 ● The query pattern, specifying what to query for in the underlying dataset
 ● Query modifiers, slicing, ordering, and otherwise rearranging query results
 # prefix declarations
 PREFIX foo: <http://example.com/resources/>
 ...
 # dataset definition
 FROM ...
 # result
              clause
 SELECT ...
 # query pattern
 WHERE {
     ...
 }
 # query modifiers
 ORDER BY ...
Real example querying DBpedia


SELECT * WHERE {
?subject rdf:type <http://dbpedia.org/ontology/City>.
?subject rdfs:label ?label.
?subject rdfs:comment ?abstract.
?subject <http://dbpedia.org/ontology/populationTotal> ?populationTotal.
FILTER (lang(?label) = "en" && lang(?abstract) = "en")
} LIMIT 5
Let's dive into a real example


 SELECT * WHERE {
 ?subject rdf:type <http://dbpedia.org/ontology/City>.
 ?subject rdfs:label ?label.
 ?subject rdfs:comment ?abstract.
 ?subject <http://dbpedia.org/ontology/populationTotal> ?populationTotal.
 FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd:
 integer))
 } LIMIT 5
Let's dive into a real example

 PREFIX o: <http://dbpedia.org/ontology/>
 PREFIX p: <http://dbpedia.org/property/>
 PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>

 SELECT DISTINCT * WHERE {
 ?subject a o:City.
 ?subject rdfs:label ?label.
 OPTIONAL {?subject rdfs:comment ?abstract.}
 ?subject p:populationTotal ?populationTotal.
 OPTIONAL {?subject geo:lat ?latitude.}
 OPTIONAL {?subject geo:long ?longitude.}
 FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd:integer
 && ?populationTotal < "60000000"^^xsd:integer))
 }
 ORDER BY DESC(?populationTotal)

      find all properties by exploring dbpedia e.g.
      dbpedia http://dbpedia.org/page/Tokyo

      Example without duplicates http://daviz.eionet.
      europa.eu/data/local-sparql-queries/most-
      populated-cities
Corresponding data visualisation with
Daviz




 We have been able to create a data visualisation of
 open linked data with filters/facets entirely through
 the web in about 10 minutes!

 live demo http://www.eea.europa.eu/sandbox/plog2013/most-populated-cities-
 with-coordinates-plus
Removing redundancies
TIP: In order to get rid of some rednundancy you can use "SAMPLE" or "SELECT DISTINCT"

PREFIX   o: <http://dbpedia.org/ontology/>
PREFIX   p: <http://dbpedia.org/property/>
PREFIX   rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX   geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>

SELECT ?subject (sql:SAMPLE(?subject) as ?city)
(sql:SAMPLE(?label) as ?label)
(sql:SAMPLE(?latitude) as ?latitude)
                                                        live example http://daviz.eionet.europa.
(sql:SAMPLE(?longitude) as ?longitude)
 max(?populationTotal) as ?maxPopulation
                                                        eu/visualisations/most-populated-cities
 max(?rainyDays) as ?rainyDays

WHERE {
?subject a o:City.
?subject rdfs:label ?label.
OPTIONAL {?subject rdfs:comment ?abstract.}
?subject p:populationTotal ?populationTotal.
OPTIONAL {?subject geo:lat ?latitude.}
OPTIONAL {?subject geo:long ?longitude.}
OPTIONAL {?subject p:yearPrecipitationDays ?rainyDays.}
FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd:integer))
}
GROUP BY ?subject
ORDER BY DESC(?maxPopulation)
More examples at Daviz show room
      daviz.eionet.europa.eu
Example 2: Large companies (DBpedia)

 PREFIX o: <http://dbpedia.org/ontology/>
 PREFIX p: <http://dbpedia.org/property/>
 PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>

 SELECT * WHERE {
 ?subject rdf:type <http://dbpedia.org/ontology/Company>.
 ?subject rdfs:label ?label.
 ?subject rdfs:comment ?abstract.
 ?subject p:numEmployees ?employees.
 ?subject o:location ?location.
 ?location geo:lat ?latitude.
 ?location geo:long ?longitude.
 FILTER (lang(?label) = "en" && lang(?abstract) = "en" && ?employees > 10000)
 }
 ORDER BY DESC(?employees)
 LIMIT 20
Example 3: Energy plants (Enipedia)

 SPARQL Endpoint:http://enipedia.tudelft.nl/sparql


 BASE <http://enipedia.tudelft.nl/wiki/>
 PREFIX a: <http://enipedia.tudelft.nl/wiki/>
 PREFIX prop: <http://enipedia.tudelft.nl/wiki/Property:>
 PREFIX cat: <http://enipedia.tudelft.nl/wiki/Category:>
 PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
 select ?Name ?Point ?Generation_capacity where {
 ?powerPlant prop:Country a:Italy .
 ?powerPlant rdfs:label ?Name .
 ?powerPlant prop:Point ?Point .
 ?powerPlant prop:Generation_capacity_electrical_MW ?Generation_capacity .
 }
More resources



 ●   SPARQL endpoints and their status: http://labs.mondeca.
     com/sparqlEndpointsStatus/index.html
 ●   SPARQL tutorial by example: http://www.cambridgesemantics.com/semantic-
     university/sparql-by-example
 ●   eea.sparql package gives you a sparql client and data holder for plone
     available on pypi
 ●   eea.daviz bundle includes eea.sparql and the visualisations tools
Data table manipulation via drag and drop
Modular framework
EEA Daviz




       And much more...
EEA Daviz




            Live Demo
EEA Daviz - Resources


More live examples

   ○   Eionet
       http://daviz.eionet.europa.eu


   ○   EEA
       http://www.eea.europa.eu/data-and-maps/daviz

Mais conteúdo relacionado

Semelhante a Visualize open data with Plone - eea.daviz PLOG 2013

RDF Analytics... SPARQL and Beyond
RDF Analytics... SPARQL and BeyondRDF Analytics... SPARQL and Beyond
RDF Analytics... SPARQL and Beyond
Fadi Maali
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
RDTF-Discovery
 

Semelhante a Visualize open data with Plone - eea.daviz PLOG 2013 (20)

Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
The Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLThe Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQL
 
Linked Data in Learning Analytics Tools
Linked Data in Learning Analytics ToolsLinked Data in Learning Analytics Tools
Linked Data in Learning Analytics Tools
 
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCAccessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBC
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 
Data access
Data accessData access
Data access
 
Linked Open Data (LOD) part 2
Linked Open Data (LOD)  part 2Linked Open Data (LOD)  part 2
Linked Open Data (LOD) part 2
 
Functional manipulations of large data graphs 20160601
Functional manipulations of large data graphs 20160601Functional manipulations of large data graphs 20160601
Functional manipulations of large data graphs 20160601
 
RDF Analytics... SPARQL and Beyond
RDF Analytics... SPARQL and BeyondRDF Analytics... SPARQL and Beyond
RDF Analytics... SPARQL and Beyond
 
Semantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorialSemantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorial
 
Consuming open and linked data with open source tools
Consuming open and linked data with open source toolsConsuming open and linked data with open source tools
Consuming open and linked data with open source tools
 
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
 
Information Intermediaries
Information IntermediariesInformation Intermediaries
Information Intermediaries
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
Introduction to SPARQL
Introduction to SPARQLIntroduction to SPARQL
Introduction to SPARQL
 
Introduction to SPARQL
Introduction to SPARQLIntroduction to SPARQL
Introduction to SPARQL
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Visualize open data with Plone - eea.daviz PLOG 2013

  • 1. Visualizing Open Data with Plone a practical guide on how to query and visualize Linked Open Data with eea.daviz product Antonio De Marinis Web Technology Management European Environment Agency www.eea.europa.eu
  • 2. Linked Data evolution 2007 as per 2011 keeps growing... > 1 million datasets Watch video STRATA conference 2013
  • 3. Open Data - what is it? Open data is a philosophy and practice requiring that certain data be freely available to everyone, without restrictions from copyright, patents or other mechanisms of control. Linked Open Data (LOD) or simply Linked Data is a technique to interlink all open datasets into a web of data, aka semantic web, using technologies like RDF and SPARQL.
  • 4. Linked Data vs classic ODBC
  • 5. SPARQL query structure A SPARQL query comprises, in order: ● Prefix declarations, for abbreviating URIs ● Dataset definition, stating what RDF graph(s) are being queried ● A result clause, identifying what information to return from the query ● The query pattern, specifying what to query for in the underlying dataset ● Query modifiers, slicing, ordering, and otherwise rearranging query results # prefix declarations PREFIX foo: <http://example.com/resources/> ... # dataset definition FROM ... # result clause SELECT ... # query pattern WHERE { ... } # query modifiers ORDER BY ...
  • 6. Real example querying DBpedia SELECT * WHERE { ?subject rdf:type <http://dbpedia.org/ontology/City>. ?subject rdfs:label ?label. ?subject rdfs:comment ?abstract. ?subject <http://dbpedia.org/ontology/populationTotal> ?populationTotal. FILTER (lang(?label) = "en" && lang(?abstract) = "en") } LIMIT 5
  • 7. Let's dive into a real example SELECT * WHERE { ?subject rdf:type <http://dbpedia.org/ontology/City>. ?subject rdfs:label ?label. ?subject rdfs:comment ?abstract. ?subject <http://dbpedia.org/ontology/populationTotal> ?populationTotal. FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd: integer)) } LIMIT 5
  • 8. Let's dive into a real example PREFIX o: <http://dbpedia.org/ontology/> PREFIX p: <http://dbpedia.org/property/> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> SELECT DISTINCT * WHERE { ?subject a o:City. ?subject rdfs:label ?label. OPTIONAL {?subject rdfs:comment ?abstract.} ?subject p:populationTotal ?populationTotal. OPTIONAL {?subject geo:lat ?latitude.} OPTIONAL {?subject geo:long ?longitude.} FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd:integer && ?populationTotal < "60000000"^^xsd:integer)) } ORDER BY DESC(?populationTotal) find all properties by exploring dbpedia e.g. dbpedia http://dbpedia.org/page/Tokyo Example without duplicates http://daviz.eionet. europa.eu/data/local-sparql-queries/most- populated-cities
  • 9. Corresponding data visualisation with Daviz We have been able to create a data visualisation of open linked data with filters/facets entirely through the web in about 10 minutes! live demo http://www.eea.europa.eu/sandbox/plog2013/most-populated-cities- with-coordinates-plus
  • 10. Removing redundancies TIP: In order to get rid of some rednundancy you can use "SAMPLE" or "SELECT DISTINCT" PREFIX o: <http://dbpedia.org/ontology/> PREFIX p: <http://dbpedia.org/property/> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> SELECT ?subject (sql:SAMPLE(?subject) as ?city) (sql:SAMPLE(?label) as ?label) (sql:SAMPLE(?latitude) as ?latitude) live example http://daviz.eionet.europa. (sql:SAMPLE(?longitude) as ?longitude) max(?populationTotal) as ?maxPopulation eu/visualisations/most-populated-cities max(?rainyDays) as ?rainyDays WHERE { ?subject a o:City. ?subject rdfs:label ?label. OPTIONAL {?subject rdfs:comment ?abstract.} ?subject p:populationTotal ?populationTotal. OPTIONAL {?subject geo:lat ?latitude.} OPTIONAL {?subject geo:long ?longitude.} OPTIONAL {?subject p:yearPrecipitationDays ?rainyDays.} FILTER (lang(?label) = "en" && lang(?abstract) = "en" && (?populationTotal >= "5000000"^^xsd:integer)) } GROUP BY ?subject ORDER BY DESC(?maxPopulation)
  • 11. More examples at Daviz show room daviz.eionet.europa.eu
  • 12. Example 2: Large companies (DBpedia) PREFIX o: <http://dbpedia.org/ontology/> PREFIX p: <http://dbpedia.org/property/> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> SELECT * WHERE { ?subject rdf:type <http://dbpedia.org/ontology/Company>. ?subject rdfs:label ?label. ?subject rdfs:comment ?abstract. ?subject p:numEmployees ?employees. ?subject o:location ?location. ?location geo:lat ?latitude. ?location geo:long ?longitude. FILTER (lang(?label) = "en" && lang(?abstract) = "en" && ?employees > 10000) } ORDER BY DESC(?employees) LIMIT 20
  • 13. Example 3: Energy plants (Enipedia) SPARQL Endpoint:http://enipedia.tudelft.nl/sparql BASE <http://enipedia.tudelft.nl/wiki/> PREFIX a: <http://enipedia.tudelft.nl/wiki/> PREFIX prop: <http://enipedia.tudelft.nl/wiki/Property:> PREFIX cat: <http://enipedia.tudelft.nl/wiki/Category:> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> select ?Name ?Point ?Generation_capacity where { ?powerPlant prop:Country a:Italy . ?powerPlant rdfs:label ?Name . ?powerPlant prop:Point ?Point . ?powerPlant prop:Generation_capacity_electrical_MW ?Generation_capacity . }
  • 14. More resources ● SPARQL endpoints and their status: http://labs.mondeca. com/sparqlEndpointsStatus/index.html ● SPARQL tutorial by example: http://www.cambridgesemantics.com/semantic- university/sparql-by-example ● eea.sparql package gives you a sparql client and data holder for plone available on pypi ● eea.daviz bundle includes eea.sparql and the visualisations tools
  • 15. Data table manipulation via drag and drop
  • 17. EEA Daviz And much more...
  • 18. EEA Daviz Live Demo
  • 19. EEA Daviz - Resources More live examples ○ Eionet http://daviz.eionet.europa.eu ○ EEA http://www.eea.europa.eu/data-and-maps/daviz