SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
Smart City data via
LOD/LOG Service
P. Bellini, P. Nesi, N. Rauch
Dipartimento di Ingegneria dell’Informazione, DINFO
Università degli studi di Firenze
Via S. Marta 3, 50139, Firenze, Italy
tel: +39-055-4796567,
fax: +39-055-4796363

DISIT Lab
http://www.disit.dinfo.unifi.it/ alias http://www.disit.org
nadia.rauch@unifi.it
Slides for: LOD2014 event.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

1
Research objectives
• Why: Create an ontology that allows to combine
all data provided by the city of Florence and the
Tuscan region.
• Problems: data have different formats, they must
be reconciled in order to be effectively
interconnected to each other, but sometimes
information is incomplete.
• Objective: take advantage of the created
repository and ontology to implement new
integrated services related to mobility; to provide
repository access to SMEs to create new services.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

2
Analysis of Available Data
•
•
•
•
•
•
•
•
•

519 OpenData (Municipality of Florence)
145 OpenData (Tuscany Region)
LPT Timetable and LPT Route
Street Graph
Points of Interest
Real Time Data from traffic sensors
Real Time Data from parking sensors
Real Time Data from AVM systems
Weather Forecast (consortium Lamma)
DISIT Lab (DINFO UNIFI), 20-21/02/2014

3
DataSet already integrated

• From MIIC web services (real time)
o
o
o
o

Parking payloadPublication (updated every h)
Traffic sensors payloadPublication (updated every 5-10min)
AVM client pull service (updated every 24h)
Street Graph

• From Municipality of Florence:
o Tram line: KMZ file that represents the path of tram in Florence
o Statistics on monthly access to the LTZ, tourist arrivals per year, annual
sales of bus tickets, accidents per year for every street, number of
vehicles per year
o Municipality of Florence resolutions

• From Tuscany Region:
o Museums, monuments, theaters, libraries, banks, courier services,
police, firefighters, restaurants, pubs, bars, pharmacies, airports, schools,
universities, sports facilities, hospitals, emergency rooms, doctors'
offices, government offices, hotels and many other categories
o Weather forecast of the consortium Lamma (updated twice a day)
DISIT Lab (DINFO UNIFI), 20-21/02/2014

4
Ontology’ Macroclasses
• Maps and Geographical information: formed by
classes Road, Node, RoadElement, AdministrativeRoad,
Milestone, StreetNumber, RoadLink, Junction, Entry,
and EntryRule, Manoeuver, is used to represent the
entire road system of Tuscany region.
• Point of Interest: economical services (public and
privates), activities, which may be useful to the citizen
and who may have the need to search for and to arrive
at. Classification will be based on the division into
categories planned at regional level.
• Weather: including status and forecasts from the
consortium Lamma in Tuscany.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

5
Ontology’ Macroclasses
• Transport: data coming from major LPT companies
including scheduled times, the rail graph, data relating to
real time passage at bus stops. Classes: bus line, Ride,
Route, record, RouteSection, BusStopForeast, RouteLink.
• Sensors: concerning data coming from sensors; they may
include information such as pressure, humidity, pollution,
car flow, car velocity, number of passed cars and tracks, etc.
• Administration: includes information coming from public
administrations such as resolutions issued by each
administration, planned events, changes in the traffic
arrangement, planned VIP visits, sports events, etc.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

6
Maps Macroclass

• RoadElement: delimited by a start node and an end node
(ObjectProperties "starts" e "ends");
• Road: composed by RoadElement and Node ("contains")
• AdministrativeRoad: connected to RoadElement
(“isComposed” e “forming”), to Road (“coincideWith”).
Road : AdministrativeRoad = N:M. Both in a 1:N relation
with RoadElement;
• EntryRule: connected to RoadElement ("hasRule",
"accessTo ");
• Maneouvre: linked to EntryRule ("isDescribed").
Described through "hasFirstElem", "hasSecondElem" and
"hasThirdElem". "concerning" fastes a maneouvre to the
concerned junction.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

7
Maps Macroclass

• Node: georeferenced through geo:lat and geo:long.
• Milestone: associated with 1 AdministrativeRoad
("placedIn"), georeferenced through geo:lat and
geo:long.
• StreetNumber: always related to at least 1entry (internal
or external). Connected to RoadElement and Road
("standsIn" and "belongTo"); reverse:"hasStreetNumber".
• Entry: connected to StreetNumber through
"hasInternalAccess" and "hasExternalAccess", with
cardinality restrictions, subclass of geo:SpatialThing,
maximum cardinality restriction 1 to geo:lat and geo:long
• "ownerAuthority" and "managingAuthority": linked to PA
macroclass.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

8
Maps Macroclass
subClassOf

otn:Geometric
otn:Edge

subClassOf
subClassOf
otn:Node
subClassOf

Junction

AdministrativeRoad
Milestone
ending
situated

hasSegment

RoadLink

isComposed
coincideWith

Road

subClassOf
starting

placedIn

forming

belongTo
isPartOf
contains

hasStreetNumber
StreetNumeber

ends
RoadElement

hasInternalAccess
hasEsternalAccess

Entry

Node

starts

concerning
subClassOf

hasFirstElem

subClassOf

hasSecondElem
hasThirdElem

Maneuver

otn:Road

subClassOf

otn:Maneuver

otn:Road_Element
hasRule

EntryRule

accessTo

isDescribed

DISIT Lab (DINFO UNIFI), 20-21/02/2014

9
Reused Vocabulary
• OTN: an ontology of traffic networks that is more
or less a direct encoding of GDF (Geographic Data
Files) in OWL;
• dcterms: set of properties and classes maintained
by the Dublin Core Metadata Initiative;
• foaf: dedicated to the description of the relations
between people or groups;
• vCard: for a description of people and
organizations;
• wgs84_pos: vocabulary representing latitude and
longitude, with the WGS84 Datum, of geo-objects.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

10
Macroclasses’ Connections

DISIT Lab (DINFO UNIFI), 20-21/02/2014

11
DISIT Lab (DINFO UNIFI), 20-21/02/2014

12
From Open Data to Triples
• Phase 1: collect data from different sources
(MIIC Web Service, Osservatorio dei Trasporti
e della Mobilita’ portal, Municipality of
Florence and Tuscany Region Web Sites).
• Phase 2: first processing means ETL tool and
NoSQL database storage.
• Phase 3: second transformation using ETL
tools and RDF triples creation.
• Phase 4: Saving triple in RDF store.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

13
Helpful Tools
• ETL Trasformation

• To realize the R2RML model
• RDF Store
DISIT Lab (DINFO UNIFI), 20-21/02/2014

14
Architecture

• To automate the different phases, we have
created an architecture that includes a process
scheduler.
• The process scheduler implementation was
necessary to repeat the 4 phases, from ingestion
to transformation in triple.
• We storing data in Hbase according to a
programmed rate, which is closely linked to the
type of data (static/real time):
o Real-time data: every 10min;
o Other data: 2 - 15 times a day;
o Static data: once a month or more.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

15
Architecture’ Block Diagram

DISIT Lab (DINFO UNIFI), 20-21/02/2014

16
DISIT Lab (DINFO UNIFI), 20-21/02/2014

17
Data Validation & Reconciliation
• Major problems with the data:
o inconsistent data (different municipality to the same
service, city names that are not a municipality)
o missing data (street number)
o incorrect data (spelling errors)

• Need to validate the data, but above all to
reconcile them to be able to connect with each
other:
o Service – Street Name Reconciliation
o Service – Coordinate Reconciliation
DISIT Lab (DINFO UNIFI), 20-21/02/2014

18
Reconciliation Numbers
• Services: ~ 30.100 (all over Tuscan region) of
which:
o Geolocalized Services: ~ 12.400
o Services located at street level: ~ 8.300

• Remaining Services: ~ 9.000 of which:
o Non-unique results to locate the service at street level
o Street Number missing
o Unusual letters in municipality names or street names
o Address does not exist on Street Graph: ~ 2.200 (next
step: use the Google geocoding API)
DISIT Lab (DINFO UNIFI), 20-21/02/2014

19
Real Time Data Numbers
• Weather: 286 files uploaded twice a day 
270,000 Hbase rows/month  ~4 million
triples/month;
• Sensors: 126 active sensors  18.000 Hbase
rows/day, 50 supervised parking 
~10GB/month;
• Street Graph: 68M triples.
• For an amount of ~ 80MTriples on repository
DISIT Lab (DINFO UNIFI), 20-21/02/2014

20
App Examples
• Linked Open Graph (LOG): a tool developed to
allow exploring semantic graph of the relation
among the entities. It can be used to access to
many different LOD repository.
(http://log.disit.org/)
• Maps: service based on OpenStreetMaps that
allows to search services available in a preset
range from the selected bus stop.
(http://servicemap.sii-mobility.org/)
DISIT Lab (DINFO UNIFI), 20-21/02/2014

21
http://log.disit.org

DISIT Lab (DINFO UNIFI), 20-21/02/2014

22
http://servicemap.sii-mobility.org

DISIT Lab (DINFO UNIFI), 20-21/02/2014

23
Future Works
• Integration of rail graph into the ontology;
• Insertion of other static datasets from the
municipality of Florence and other Tuscany PA;
• Using Google Geocoding API to finish services
reconciliation;
• Improvement of services’ list and their
geolocation;
• Creation of other apps that suggest to SME
and PA how to use data.
DISIT Lab (DINFO UNIFI), 20-21/02/2014

24
DISIT Lab (DINFO UNIFI), 20-21/02/2014

25

Mais conteúdo relacionado

Destaque

Il cloud per l’accelerazione del business delle PMI: il progetto Icaro
Il cloud per l’accelerazione del business delle PMI: il progetto Icaro Il cloud per l’accelerazione del business delle PMI: il progetto Icaro
Il cloud per l’accelerazione del business delle PMI: il progetto Icaro Paolo Nesi
 
Overview of Social Networks
Overview of Social NetworksOverview of Social Networks
Overview of Social NetworksPaolo Nesi
 
Augmented Reality and Sport
Augmented Reality and SportAugmented Reality and Sport
Augmented Reality and SportPaolo Nesi
 
Le tecnologie del social learning e sistemi mobili
Le tecnologie del social learning e sistemi mobiliLe tecnologie del social learning e sistemi mobili
Le tecnologie del social learning e sistemi mobiliPaolo Nesi
 
Metadata Quality assessment tool for Open Access
Metadata Quality assessment tool for Open AccessMetadata Quality assessment tool for Open Access
Metadata Quality assessment tool for Open AccessPaolo Nesi
 
Introduzione al Cloud - Progetto ICARO
Introduzione al Cloud - Progetto ICAROIntroduzione al Cloud - Progetto ICARO
Introduzione al Cloud - Progetto ICAROPaolo Nesi
 
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...Paolo Nesi
 
Improving the Search Experience in a Social Network with Cross Media Contents
Improving the Search Experiencein a Social Network with Cross Media ContentsImproving the Search Experiencein a Social Network with Cross Media Contents
Improving the Search Experience in a Social Network with Cross Media ContentsPaolo Nesi
 
Overview of Distributed Systems course by Paolo Nesi
Overview of Distributed Systems course by Paolo NesiOverview of Distributed Systems course by Paolo Nesi
Overview of Distributed Systems course by Paolo NesiPaolo Nesi
 
Music accessibility for visual impaired
Music accessibility for visual impaired Music accessibility for visual impaired
Music accessibility for visual impaired Paolo Nesi
 
Overview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersOverview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersPaolo Nesi
 

Destaque (11)

Il cloud per l’accelerazione del business delle PMI: il progetto Icaro
Il cloud per l’accelerazione del business delle PMI: il progetto Icaro Il cloud per l’accelerazione del business delle PMI: il progetto Icaro
Il cloud per l’accelerazione del business delle PMI: il progetto Icaro
 
Overview of Social Networks
Overview of Social NetworksOverview of Social Networks
Overview of Social Networks
 
Augmented Reality and Sport
Augmented Reality and SportAugmented Reality and Sport
Augmented Reality and Sport
 
Le tecnologie del social learning e sistemi mobili
Le tecnologie del social learning e sistemi mobiliLe tecnologie del social learning e sistemi mobili
Le tecnologie del social learning e sistemi mobili
 
Metadata Quality assessment tool for Open Access
Metadata Quality assessment tool for Open AccessMetadata Quality assessment tool for Open Access
Metadata Quality assessment tool for Open Access
 
Introduzione al Cloud - Progetto ICARO
Introduzione al Cloud - Progetto ICAROIntroduzione al Cloud - Progetto ICARO
Introduzione al Cloud - Progetto ICARO
 
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...
Sistemi Distribuiti part 5: P2P systems: from simple to distributed P2P trust...
 
Improving the Search Experience in a Social Network with Cross Media Contents
Improving the Search Experiencein a Social Network with Cross Media ContentsImproving the Search Experiencein a Social Network with Cross Media Contents
Improving the Search Experience in a Social Network with Cross Media Contents
 
Overview of Distributed Systems course by Paolo Nesi
Overview of Distributed Systems course by Paolo NesiOverview of Distributed Systems course by Paolo Nesi
Overview of Distributed Systems course by Paolo Nesi
 
Music accessibility for visual impaired
Music accessibility for visual impaired Music accessibility for visual impaired
Music accessibility for visual impaired
 
Overview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersOverview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for Beginners
 

Último

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 

Último (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 

Smart City data via LOD/LOG Service

  • 1. Smart City data via LOD/LOG Service P. Bellini, P. Nesi, N. Rauch Dipartimento di Ingegneria dell’Informazione, DINFO Università degli studi di Firenze Via S. Marta 3, 50139, Firenze, Italy tel: +39-055-4796567, fax: +39-055-4796363 DISIT Lab http://www.disit.dinfo.unifi.it/ alias http://www.disit.org nadia.rauch@unifi.it Slides for: LOD2014 event. DISIT Lab (DINFO UNIFI), 20-21/02/2014 1
  • 2. Research objectives • Why: Create an ontology that allows to combine all data provided by the city of Florence and the Tuscan region. • Problems: data have different formats, they must be reconciled in order to be effectively interconnected to each other, but sometimes information is incomplete. • Objective: take advantage of the created repository and ontology to implement new integrated services related to mobility; to provide repository access to SMEs to create new services. DISIT Lab (DINFO UNIFI), 20-21/02/2014 2
  • 3. Analysis of Available Data • • • • • • • • • 519 OpenData (Municipality of Florence) 145 OpenData (Tuscany Region) LPT Timetable and LPT Route Street Graph Points of Interest Real Time Data from traffic sensors Real Time Data from parking sensors Real Time Data from AVM systems Weather Forecast (consortium Lamma) DISIT Lab (DINFO UNIFI), 20-21/02/2014 3
  • 4. DataSet already integrated • From MIIC web services (real time) o o o o Parking payloadPublication (updated every h) Traffic sensors payloadPublication (updated every 5-10min) AVM client pull service (updated every 24h) Street Graph • From Municipality of Florence: o Tram line: KMZ file that represents the path of tram in Florence o Statistics on monthly access to the LTZ, tourist arrivals per year, annual sales of bus tickets, accidents per year for every street, number of vehicles per year o Municipality of Florence resolutions • From Tuscany Region: o Museums, monuments, theaters, libraries, banks, courier services, police, firefighters, restaurants, pubs, bars, pharmacies, airports, schools, universities, sports facilities, hospitals, emergency rooms, doctors' offices, government offices, hotels and many other categories o Weather forecast of the consortium Lamma (updated twice a day) DISIT Lab (DINFO UNIFI), 20-21/02/2014 4
  • 5. Ontology’ Macroclasses • Maps and Geographical information: formed by classes Road, Node, RoadElement, AdministrativeRoad, Milestone, StreetNumber, RoadLink, Junction, Entry, and EntryRule, Manoeuver, is used to represent the entire road system of Tuscany region. • Point of Interest: economical services (public and privates), activities, which may be useful to the citizen and who may have the need to search for and to arrive at. Classification will be based on the division into categories planned at regional level. • Weather: including status and forecasts from the consortium Lamma in Tuscany. DISIT Lab (DINFO UNIFI), 20-21/02/2014 5
  • 6. Ontology’ Macroclasses • Transport: data coming from major LPT companies including scheduled times, the rail graph, data relating to real time passage at bus stops. Classes: bus line, Ride, Route, record, RouteSection, BusStopForeast, RouteLink. • Sensors: concerning data coming from sensors; they may include information such as pressure, humidity, pollution, car flow, car velocity, number of passed cars and tracks, etc. • Administration: includes information coming from public administrations such as resolutions issued by each administration, planned events, changes in the traffic arrangement, planned VIP visits, sports events, etc. DISIT Lab (DINFO UNIFI), 20-21/02/2014 6
  • 7. Maps Macroclass • RoadElement: delimited by a start node and an end node (ObjectProperties "starts" e "ends"); • Road: composed by RoadElement and Node ("contains") • AdministrativeRoad: connected to RoadElement (“isComposed” e “forming”), to Road (“coincideWith”). Road : AdministrativeRoad = N:M. Both in a 1:N relation with RoadElement; • EntryRule: connected to RoadElement ("hasRule", "accessTo "); • Maneouvre: linked to EntryRule ("isDescribed"). Described through "hasFirstElem", "hasSecondElem" and "hasThirdElem". "concerning" fastes a maneouvre to the concerned junction. DISIT Lab (DINFO UNIFI), 20-21/02/2014 7
  • 8. Maps Macroclass • Node: georeferenced through geo:lat and geo:long. • Milestone: associated with 1 AdministrativeRoad ("placedIn"), georeferenced through geo:lat and geo:long. • StreetNumber: always related to at least 1entry (internal or external). Connected to RoadElement and Road ("standsIn" and "belongTo"); reverse:"hasStreetNumber". • Entry: connected to StreetNumber through "hasInternalAccess" and "hasExternalAccess", with cardinality restrictions, subclass of geo:SpatialThing, maximum cardinality restriction 1 to geo:lat and geo:long • "ownerAuthority" and "managingAuthority": linked to PA macroclass. DISIT Lab (DINFO UNIFI), 20-21/02/2014 8
  • 10. Reused Vocabulary • OTN: an ontology of traffic networks that is more or less a direct encoding of GDF (Geographic Data Files) in OWL; • dcterms: set of properties and classes maintained by the Dublin Core Metadata Initiative; • foaf: dedicated to the description of the relations between people or groups; • vCard: for a description of people and organizations; • wgs84_pos: vocabulary representing latitude and longitude, with the WGS84 Datum, of geo-objects. DISIT Lab (DINFO UNIFI), 20-21/02/2014 10
  • 11. Macroclasses’ Connections DISIT Lab (DINFO UNIFI), 20-21/02/2014 11
  • 12. DISIT Lab (DINFO UNIFI), 20-21/02/2014 12
  • 13. From Open Data to Triples • Phase 1: collect data from different sources (MIIC Web Service, Osservatorio dei Trasporti e della Mobilita’ portal, Municipality of Florence and Tuscany Region Web Sites). • Phase 2: first processing means ETL tool and NoSQL database storage. • Phase 3: second transformation using ETL tools and RDF triples creation. • Phase 4: Saving triple in RDF store. DISIT Lab (DINFO UNIFI), 20-21/02/2014 13
  • 14. Helpful Tools • ETL Trasformation • To realize the R2RML model • RDF Store DISIT Lab (DINFO UNIFI), 20-21/02/2014 14
  • 15. Architecture • To automate the different phases, we have created an architecture that includes a process scheduler. • The process scheduler implementation was necessary to repeat the 4 phases, from ingestion to transformation in triple. • We storing data in Hbase according to a programmed rate, which is closely linked to the type of data (static/real time): o Real-time data: every 10min; o Other data: 2 - 15 times a day; o Static data: once a month or more. DISIT Lab (DINFO UNIFI), 20-21/02/2014 15
  • 16. Architecture’ Block Diagram DISIT Lab (DINFO UNIFI), 20-21/02/2014 16
  • 17. DISIT Lab (DINFO UNIFI), 20-21/02/2014 17
  • 18. Data Validation & Reconciliation • Major problems with the data: o inconsistent data (different municipality to the same service, city names that are not a municipality) o missing data (street number) o incorrect data (spelling errors) • Need to validate the data, but above all to reconcile them to be able to connect with each other: o Service – Street Name Reconciliation o Service – Coordinate Reconciliation DISIT Lab (DINFO UNIFI), 20-21/02/2014 18
  • 19. Reconciliation Numbers • Services: ~ 30.100 (all over Tuscan region) of which: o Geolocalized Services: ~ 12.400 o Services located at street level: ~ 8.300 • Remaining Services: ~ 9.000 of which: o Non-unique results to locate the service at street level o Street Number missing o Unusual letters in municipality names or street names o Address does not exist on Street Graph: ~ 2.200 (next step: use the Google geocoding API) DISIT Lab (DINFO UNIFI), 20-21/02/2014 19
  • 20. Real Time Data Numbers • Weather: 286 files uploaded twice a day  270,000 Hbase rows/month  ~4 million triples/month; • Sensors: 126 active sensors  18.000 Hbase rows/day, 50 supervised parking  ~10GB/month; • Street Graph: 68M triples. • For an amount of ~ 80MTriples on repository DISIT Lab (DINFO UNIFI), 20-21/02/2014 20
  • 21. App Examples • Linked Open Graph (LOG): a tool developed to allow exploring semantic graph of the relation among the entities. It can be used to access to many different LOD repository. (http://log.disit.org/) • Maps: service based on OpenStreetMaps that allows to search services available in a preset range from the selected bus stop. (http://servicemap.sii-mobility.org/) DISIT Lab (DINFO UNIFI), 20-21/02/2014 21
  • 22. http://log.disit.org DISIT Lab (DINFO UNIFI), 20-21/02/2014 22
  • 24. Future Works • Integration of rail graph into the ontology; • Insertion of other static datasets from the municipality of Florence and other Tuscany PA; • Using Google Geocoding API to finish services reconciliation; • Improvement of services’ list and their geolocation; • Creation of other apps that suggest to SME and PA how to use data. DISIT Lab (DINFO UNIFI), 20-21/02/2014 24
  • 25. DISIT Lab (DINFO UNIFI), 20-21/02/2014 25