SlideShare uma empresa Scribd logo
1 de 29
Baixar para ler offline
5/31/2011




Consuming Open and
Linked Data with Open
    Source Tools
Or, how to do something useful
     with all this free data



    Jo Cook, Geospatial Consultant, Astun Technology
Talk Outline

 ● The data and the distribution technologies
   explained
 ● The software stack explained
 ● Actually getting some data and doing something
   with it
 ● Gotchas
 ● Further resources and links
The Data and Technologies
The Data and Technologies
The Data and Technologies




           Point in Latitude/Longitude format (WGS84)
The Data and Technologies
The Data and Technologies
The Data and Technologies
PREFIX naptan: <http://transport.data.gov.uk/def/naptan/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>
SELECT DISTINCT ?stopPoint_airport ?name ?iataRef ?lat ?long
WHERE {
  ?item a naptan:StopArea ;
   naptan:administrativeArea <http://transport.data.gov.uk/id/administrative-
area/145> ;
   skos:prefLabel ?name ;
   geo:lat ?lat ;
   geo:long ?long ;
   naptan:stopPoint [ naptan:airport ?stopPoint_airport ] .
  ?stopPoint_airport naptan:iataRef ?iataRef .
}
The Data and Technologies
The Data and Technologies
The Data and Technologies




http://wallpapertree.com/wallpaper/Sad-Kitten/
The Data and Technologies
The Data and Technologies
The Data and Technologies
The Data and Technologies




http://freeimagesarchive.com
The Data and Technologies
The Data and Technologies




1m4ge.blogspot.com
The Data and Technologies
The Data and Technologies
Actually doing something with the data
Actually doing something with the data
Gotchas




smashingmagazine.com
Gotchas
Mitigation



                    Keep your data here so you're
                    not relying directly on the service
                    for updates to customers




(Maybe even
keep it here too)
Conclusions
A dash of all these, some readily
available python libraries, and
some relatively simple scripting
gets you useful data for your own
applications...

... and by using PostgreSQL to
store your data, you can provide
a buffer in case of api changes or
outages...

... and even do some nifty
manipulation to analyse and
present the data differently!
Conclusions




        Simples!
Conclusions
Not *that* simple, but...
Resources
Linked Data Resources and Example Queries

Best API for transport data: http://openuplabs.tso.co.uk/
O'Reilly Book: http://www.learningsparql.com/
Blog: http://talis-systems.com/blog/
Airport Query: http://getthedata.org/questions/137/where-can-i-get-a-list-of-all-airports-codes-names-locations
Beginner Guide: http://www.pezholio.co.uk/2011/01/a-beginners-guide-to-sparqling-linked-data-part-1/

NHS Choices- RSS

http://www.nhs.uk/servicedirectories/Pages/ServiceSearch.aspx

Police Data- JSON/REST

API: http://policeapi2.rkh.co.uk/api/docs/
CSV/SHP: http://www.police.uk/data

Basic Software Resources

Mapserver dynamic charting: http://mapserver.org/output/dynamic_charting.html
Python SparQLWrapper: http://sparql-wrapper.sourceforge.net/
Python SimpleJSON: https://github.com/simplejson/simplejson
Python PygreSQL: http://www.pygresql.org/
Loader: https://github.com/AstunTechnology/Loader
Thank You!




         www.archaeogeek.com
         @archaeogeek
         jocook@astuntechnology.com

Mais conteúdo relacionado

Semelhante a Consuming open and linked data with open source tools

Flightlogg.in'
Flightlogg.in'Flightlogg.in'
Flightlogg.in'
xx xx
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
Addy Pope
 
PowerPoint
PowerPointPowerPoint
PowerPoint
Videoguy
 
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
it-people
 

Semelhante a Consuming open and linked data with open source tools (20)

SplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding OverviewSplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding Overview
 
SplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding OverviewSplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding Overview
 
Lyft talks #4 Orchestrating big data and ML pipelines at Lyft
Lyft talks #4 Orchestrating big data and ML pipelines at LyftLyft talks #4 Orchestrating big data and ML pipelines at Lyft
Lyft talks #4 Orchestrating big data and ML pipelines at Lyft
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...
 
Data Science with Spark
Data Science with SparkData Science with Spark
Data Science with Spark
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadata
 
Organizing the Data Chaos of Scientists
Organizing the Data Chaos of ScientistsOrganizing the Data Chaos of Scientists
Organizing the Data Chaos of Scientists
 
Big Data Meetup #7
Big Data Meetup #7Big Data Meetup #7
Big Data Meetup #7
 
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
 
DataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data ManagementDataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data Management
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of Python
 
Flightlogg.in'
Flightlogg.in'Flightlogg.in'
Flightlogg.in'
 
Maximizing Your Data’s Potential: DOTs & DPWs Edition
Maximizing Your Data’s Potential: DOTs & DPWs EditionMaximizing Your Data’s Potential: DOTs & DPWs Edition
Maximizing Your Data’s Potential: DOTs & DPWs Edition
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
 
PowerPoint
PowerPointPowerPoint
PowerPoint
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
 
FIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
FIWARE Wednesday Webinars - Machine Learning with Cosmos and SparkFIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
FIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
 
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
 
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
 

Mais de Joanne Cook

Introduction to OSGeo:UK
Introduction to OSGeo:UKIntroduction to OSGeo:UK
Introduction to OSGeo:UK
Joanne Cook
 
The Business Case for Open Source GIS
The Business Case for Open Source GISThe Business Case for Open Source GIS
The Business Case for Open Source GIS
Joanne Cook
 
Open Source and Open Data
Open Source and Open DataOpen Source and Open Data
Open Source and Open Data
Joanne Cook
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notes
Joanne Cook
 
Lancaster University GIS Course 2010
Lancaster University GIS Course 2010Lancaster University GIS Course 2010
Lancaster University GIS Course 2010
Joanne Cook
 
Databases and web mapping the Open Source way
Databases and web mapping the Open Source wayDatabases and web mapping the Open Source way
Databases and web mapping the Open Source way
Joanne Cook
 
The Impact of Open Source
The Impact of Open SourceThe Impact of Open Source
The Impact of Open Source
Joanne Cook
 
Open Source GIS for Local Government
Open Source GIS for Local GovernmentOpen Source GIS for Local Government
Open Source GIS for Local Government
Joanne Cook
 

Mais de Joanne Cook (20)

Intro to the Open Source Geospatial Foundation
Intro to the Open Source Geospatial FoundationIntro to the Open Source Geospatial Foundation
Intro to the Open Source Geospatial Foundation
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GIS
 
Introduction to OSGeo:UK
Introduction to OSGeo:UKIntroduction to OSGeo:UK
Introduction to OSGeo:UK
 
The Business Case for Open Source GIS
The Business Case for Open Source GISThe Business Case for Open Source GIS
The Business Case for Open Source GIS
 
Open Source and Open Data
Open Source and Open DataOpen Source and Open Data
Open Source and Open Data
 
AGI 2010 Notes
AGI 2010 NotesAGI 2010 Notes
AGI 2010 Notes
 
AGI 2010: It's all one big opportunity
AGI 2010: It's all one big opportunityAGI 2010: It's all one big opportunity
AGI 2010: It's all one big opportunity
 
Worskhop OSGIS2010
Worskhop OSGIS2010Worskhop OSGIS2010
Worskhop OSGIS2010
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notes
 
Worskhop Leicester 2010
Worskhop Leicester 2010Worskhop Leicester 2010
Worskhop Leicester 2010
 
Lancaster University GIS Course 2010
Lancaster University GIS Course 2010Lancaster University GIS Course 2010
Lancaster University GIS Course 2010
 
Databases and web mapping the Open Source way
Databases and web mapping the Open Source wayDatabases and web mapping the Open Source way
Databases and web mapping the Open Source way
 
How archaeologists use GIS
How archaeologists use GISHow archaeologists use GIS
How archaeologists use GIS
 
The Impact of Open Source
The Impact of Open SourceThe Impact of Open Source
The Impact of Open Source
 
Open Source GIS for Local Government
Open Source GIS for Local GovernmentOpen Source GIS for Local Government
Open Source GIS for Local Government
 
Bcs Talk Notes
Bcs Talk NotesBcs Talk Notes
Bcs Talk Notes
 
Gateway Seminar
Gateway SeminarGateway Seminar
Gateway Seminar
 
Foss4g Portable Gis
Foss4g Portable GisFoss4g Portable Gis
Foss4g Portable Gis
 
Bcs Talk Notes
Bcs Talk NotesBcs Talk Notes
Bcs Talk Notes
 
Agi 2008
Agi 2008Agi 2008
Agi 2008
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Consuming open and linked data with open source tools

  • 1. 5/31/2011 Consuming Open and Linked Data with Open Source Tools Or, how to do something useful with all this free data Jo Cook, Geospatial Consultant, Astun Technology
  • 2. Talk Outline ● The data and the distribution technologies explained ● The software stack explained ● Actually getting some data and doing something with it ● Gotchas ● Further resources and links
  • 3. The Data and Technologies
  • 4. The Data and Technologies
  • 5. The Data and Technologies Point in Latitude/Longitude format (WGS84)
  • 6. The Data and Technologies
  • 7. The Data and Technologies
  • 8. The Data and Technologies PREFIX naptan: <http://transport.data.gov.uk/def/naptan/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> SELECT DISTINCT ?stopPoint_airport ?name ?iataRef ?lat ?long WHERE { ?item a naptan:StopArea ; naptan:administrativeArea <http://transport.data.gov.uk/id/administrative- area/145> ; skos:prefLabel ?name ; geo:lat ?lat ; geo:long ?long ; naptan:stopPoint [ naptan:airport ?stopPoint_airport ] . ?stopPoint_airport naptan:iataRef ?iataRef . }
  • 9. The Data and Technologies
  • 10. The Data and Technologies
  • 11. The Data and Technologies http://wallpapertree.com/wallpaper/Sad-Kitten/
  • 12. The Data and Technologies
  • 13. The Data and Technologies
  • 14. The Data and Technologies
  • 15. The Data and Technologies http://freeimagesarchive.com
  • 16. The Data and Technologies
  • 17. The Data and Technologies 1m4ge.blogspot.com
  • 18. The Data and Technologies
  • 19. The Data and Technologies
  • 20. Actually doing something with the data
  • 21. Actually doing something with the data
  • 24. Mitigation Keep your data here so you're not relying directly on the service for updates to customers (Maybe even keep it here too)
  • 25. Conclusions A dash of all these, some readily available python libraries, and some relatively simple scripting gets you useful data for your own applications... ... and by using PostgreSQL to store your data, you can provide a buffer in case of api changes or outages... ... and even do some nifty manipulation to analyse and present the data differently!
  • 26. Conclusions Simples!
  • 28. Resources Linked Data Resources and Example Queries Best API for transport data: http://openuplabs.tso.co.uk/ O'Reilly Book: http://www.learningsparql.com/ Blog: http://talis-systems.com/blog/ Airport Query: http://getthedata.org/questions/137/where-can-i-get-a-list-of-all-airports-codes-names-locations Beginner Guide: http://www.pezholio.co.uk/2011/01/a-beginners-guide-to-sparqling-linked-data-part-1/ NHS Choices- RSS http://www.nhs.uk/servicedirectories/Pages/ServiceSearch.aspx Police Data- JSON/REST API: http://policeapi2.rkh.co.uk/api/docs/ CSV/SHP: http://www.police.uk/data Basic Software Resources Mapserver dynamic charting: http://mapserver.org/output/dynamic_charting.html Python SparQLWrapper: http://sparql-wrapper.sourceforge.net/ Python SimpleJSON: https://github.com/simplejson/simplejson Python PygreSQL: http://www.pygresql.org/ Loader: https://github.com/AstunTechnology/Loader
  • 29. Thank You! www.archaeogeek.com @archaeogeek jocook@astuntechnology.com