SlideShare a Scribd company logo
1 of 27
Download to read offline
The One and Many Maps:
Participatory and Temporal
Diversities in OpenStreetMap
Tyng-Ruey Chuang1, Dong-Po Deng1,3, Chun–Chen Hsu1,2,
Rob Lemmens3
!
1Institute

of Information Science, Academia Sinica, Taiwan
2Department of Computer Science and Information Engineering, National Taiwan University
3Faculty of Geo–Information Science and Earth Observation (ITC), University of Twente,
Netherlands

Second ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic
Information (GEOCROWD) 2013 In conjunction with ACM SIGSPATIAL 2013
Background
•

OSM is a wiki-style online mapping
platform in which tens of thousands of
people voluntarily contribute geospatial
data into the making of a global map
(Haklay & Weber, 2008).

•

Its peer production model demonstrates
that more and more mapping activities
are done by the citizens.

•

It represents the success of a collective
form of geospatial content creation.
Collaborative Geospatial Content
Creation
•

OSM is a type of PPGIS, as well as
the characteristics of data
collaboration in OSM as the subjects
of VGI research.

•

The current state of OSM actually is
an assembly of many edits and
updates over a period of time.

•

Every edit or update should be a
meaningful unit in the understanding
of data collaboration activities in
OSM.
Aims
•

We intend to look for ways to systematically and efficiently
discover data collaboration patterns and diversities in OSM

•

It is an initial study of the OSM dataset (at least about the
part of Taiwan) by developing a set of metrics to summarize

•
•
•

user participation, and
spatiotemporal variations of updates in defined areas of
OSM

We hope to see the OSM not as one collective map but as
many overlapping maps concurrently in the making with
each in its own characteristics
OSM Data Model
Node
Way
Open polyline
Closed polyline
Area
Relation
Tag
OSM Data
An example of a Node
<node id='1762782473'
timestamp='2012-12-12T03:49:16Z'
uid='1048' user='dongpo' visible='true'
version='2' changeset='14245247'
lat='23.864527' lon='121.5217101'>
<tag k='name' v='⽴立川漁場' />
<tag k='tourism' v='attraction' />
<tag k='source' v='survey' />
<tag k='addr: housenumber' v='45' />
<tag k='addr:district' v='⿂魚池' />
<tag k='addr:town' v='壽豐鄉' />
<tag k='addr:county' v='花蓮縣' />
</node>
OSM Data
An example of a Way
<way id='118416207'
timestamp='2012-05-23T17:43:06Z'
uid='1048' user='dongpo' visible='true'
version='4' changeset='14246301'>
<nd ref='1088092959' />
<nd ref='1088092953' />
....
<nd ref='1600948228' />
<tag k='highway' v='primary' />
<tag k='lanes' v='2' />
<tag k='oneway' v='yes' />
<tag k='ref' v='Hwy 11C' />
<tag k='ref:zh' v='台11丙線' />
</way>
OSM Data
An example of a Relation
<relation id='2498406'
timestamp='2012-10-14T19:01:55Z'
uid='1048' user='dongpo' visible='true'
version='1' changeset='13497007'>
<member type='way' ref='185846446' role='outer'
<member type='way' ref='185846444' role='outer'
<member type='way' ref='151063000' role='outer'
<member type='way' ref='185846448' role='outer'
<member type='way' ref='185846445' role='outer'
<tag k='admin_level' v='8' />
<tag k='boundary' v='administrative' />
<tag k='name' v='草屯鎮 (Caotun)' />
<tag k='name:en' v='Caotun' />
<tag k='name:zh' v='草屯鎮' />
<tag k='type' v='boundary' />
</relation>

/>
/>
/>
/>
/>
The definitions for the metrics
To measure participatory and temporal differences among cells
in OSM, we define the following functions on Dc. The c in Dc
means in a cell. When the context is clear, we omit the
subscript c and simply write:

where di, 0 < i < n -1 , is a node in the cell, and n
is the total number of nodes.
The definitions for the metrics
For a node di, we write:

where ki is the node id of di,
ti the age,
ui the user id of its contributor, and
pi the position (i.e., the pair of its lat and
lon values).
Note that, by definition, geographically pi is
within the boundary of c for all
.
Node and Mapper Density
In general, we use areac to denote the area covered by a cell
c, and we use popc for the people population in region c.
When it is clear in context, we omit the subscript and
simply write area and pop.
The following measures the densities of nodes, as well as
those of their contributors, i.e., the mappers.
Node age and temporality
Recall the following auxiliary functions for a sequence s

where min s is the minimum of s, max s the maximum of s, s
the average of s, and cv(s) the coefficient of variation for
elements in s.
Node age and temporality(cont.)
The 4-tuple <min t, max t, t, cv(t)> measures the age
characteristics of the nodes in a cell. That is,
Node age and temporality(cont.)
We define a sequence g = (g0, g1, g2,..., gn-2) to measure the gaps
between any two consecutive elements in t. That is,

which is the gap in days between the dates when the two nodes
di+1 and di were added into the cell.
Node age and temporality(cont.)
The 4-tuple <min g, max g, g, cv(g)> measures the
Graphing cells by two metrics
•

As multiple metrics are in use, a cell can be measured in
two metrics and the two results compared.

•

Often we will compare the two sets of measurement over
all cells to see if there are patterns.
Number of mappers and number of
nodes

Figure 1: Distribution of the cells by both mapper
count and node count.

Figure 2: Mapping the cells in Taiwan by their
types. (c.f. Figure 1)
Locations and Cities in Taiwan

Figure 3: Cities in Taiwan
Distributions of Mappers

Figure 4: Spatial distribution of mappers over
area.

Figure 5: Spatial distribution of mappers over
population.
Distributions of Nodes

Figure 6: Spatial distribution of nodes over area.

Figure 7: Spatial distribution of nodes over
population
Node Age Average and Variance

Figure 8: Spatial distribution of average node age.

Figure 9: Spatial distribution of the variance of
node age.
Number of Mappers and Update
Interval

Figure 10: Distribution of the cells by both mapper
count and average time gap between two additions.

Figure 11: Spatial distribution of the cells by their
types. (c.f. Figure 10)
The 80/20 Hypothesis

Figure 12: Distribution of the cells by both mapper
count and ratio of mappers needed for combined
80% node contribution.

Figure 13: Spatial distribution of the cells by their
types. (c.f. Figure 12)
Related work
•

Previous investigations into the data quality issues of OSM have shown
that the OSM dataset can be fairly accurate, and is mostly comparable
to commercial/gov. datasets at least in urban areas [3, 8, 6, 9].

•

Researchers had also developed visual analytics to gain insights into
the spatial diversity of OSM datasets, e. g. to see whether users in
different countries would exhibit distinct mapping activities and habits
[13].

•

These visualization tools can provide valuable information when
improving the data quality of OSM. Neis and Zipf identified active
mappers and casual mappers by examining quantitatively their
contributions in OSM [12].

•

Mooney and Corcoran examined directly the characteristics of “heavily
edited” objects in OpenStreetMap of UK, and they considered these
characteristics might be developed as data quality indicators for
OpenStreetMap in the future [10].
Conclusion and future work
•

This paper is a preliminary study in two sense. We only analyze

•
•

the Taiwan part of OpenStreetMap, and
the cells independently (though spatial distribution is
visualized and discussed).

•

Because of the time constraint, we have not looked into other
geographical areas in OSM.

•

Also, as a mapper may contribute to multiple cells, we ought to
look into mapping activities across the cells. We intend to
pursue these directions in the future.

•

The programs we use to analyze the data are in their early stage
of development, and the way we prepare the data for analysis is
rather ad hoc.
Conclusion and future work
•

We are currently consider how better to structure the
programs so that they can be easily ported and reused.

•

Metrics-based analysis tools like these can be very useful
in improving the data quality in OpenStreetMap as it
helps discover areas where there is participatory or
temporal unevenness in the map making process itself.
Thank for your
attention!
Question?
dongpo@iis.sinica.edu.tw
d.-.p.deng@utwente.nl

To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this paper. For the avoidance of doubt, this work is
released under the CC0 Public Domain Dedication (http://creativecommons.org/publicdomain/zero/1.0/). Anyone can copy, modify, distribute and perform this
work, even for commercial purposes, all without asking permission.

More Related Content

Viewers also liked

Provadeinteligencia 1
Provadeinteligencia 1Provadeinteligencia 1
Provadeinteligencia 1
Milton Bastos
 
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMGTIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
firda_athaya
 
Projekt Kino Úsmev
Projekt Kino ÚsmevProjekt Kino Úsmev
Projekt Kino Úsmev
Lubos Bibig
 
Proyectossegundo
ProyectossegundoProyectossegundo
Proyectossegundo
mpsg
 
Final Report HeroMoto
Final Report HeroMotoFinal Report HeroMoto
Final Report HeroMoto
Vivek Pabla
 
Fuel Up to Play Poem
Fuel Up to Play PoemFuel Up to Play Poem
Fuel Up to Play Poem
Erin Audiss
 

Viewers also liked (19)

Provadeinteligencia 1
Provadeinteligencia 1Provadeinteligencia 1
Provadeinteligencia 1
 
Gülsuyu Bosch Kombi Servisi - 61
Gülsuyu Bosch Kombi Servisi - 61Gülsuyu Bosch Kombi Servisi - 61
Gülsuyu Bosch Kombi Servisi - 61
 
Activación clase 28 de junio
Activación clase 28 de junioActivación clase 28 de junio
Activación clase 28 de junio
 
X body presentación 13.2
X body   presentación 13.2 X body   presentación 13.2
X body presentación 13.2
 
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMGTIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
TIK BAB 2 KLS 9 SMT 1 SMP 18 SMG
 
Egutegi ofiziala
Egutegi ofizialaEgutegi ofiziala
Egutegi ofiziala
 
Kumpulan tips air hangat
Kumpulan tips air hangatKumpulan tips air hangat
Kumpulan tips air hangat
 
Recent Projects
Recent ProjectsRecent Projects
Recent Projects
 
Projekt Kino Úsmev
Projekt Kino ÚsmevProjekt Kino Úsmev
Projekt Kino Úsmev
 
Proyectossegundo
ProyectossegundoProyectossegundo
Proyectossegundo
 
Vedic maths
Vedic mathsVedic maths
Vedic maths
 
Tarta de zanahoria y manzanas
Tarta de zanahoria y manzanas Tarta de zanahoria y manzanas
Tarta de zanahoria y manzanas
 
Final Report HeroMoto
Final Report HeroMotoFinal Report HeroMoto
Final Report HeroMoto
 
Passion: What Software Teams and Executives Can Learn from Eco-Pirates
Passion: What Software Teams and Executives Can Learn from Eco-PiratesPassion: What Software Teams and Executives Can Learn from Eco-Pirates
Passion: What Software Teams and Executives Can Learn from Eco-Pirates
 
Fuel Up to Play Poem
Fuel Up to Play PoemFuel Up to Play Poem
Fuel Up to Play Poem
 
GD1RevisedMagBackCover
GD1RevisedMagBackCoverGD1RevisedMagBackCover
GD1RevisedMagBackCover
 
Tema18
Tema18Tema18
Tema18
 
perkembangan peserta didik
perkembangan peserta didikperkembangan peserta didik
perkembangan peserta didik
 
Mirella Margore
Mirella Margore Mirella Margore
Mirella Margore
 

Similar to 20131106 acm geocrowd

ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
Harry Zhang
 
2016 Poster Launch Models
2016 Poster Launch Models2016 Poster Launch Models
2016 Poster Launch Models
Richard Ottaway
 

Similar to 20131106 acm geocrowd (20)

The One and Many Maps: Participatory and Temporal Diversities in OpenStreetMap
The One and Many Maps: Participatory and Temporal Diversities in OpenStreetMapThe One and Many Maps: Participatory and Temporal Diversities in OpenStreetMap
The One and Many Maps: Participatory and Temporal Diversities in OpenStreetMap
 
Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...
 
ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
ZhangTorkkolaLiSchreinerZhangGardnerZhao(04279048)
 
rworldmap: A New R package for Mapping Global Data
rworldmap: A New R package for Mapping Global Datarworldmap: A New R package for Mapping Global Data
rworldmap: A New R package for Mapping Global Data
 
Where Next
Where NextWhere Next
Where Next
 
Massivegraph telecom ppt
Massivegraph telecom pptMassivegraph telecom ppt
Massivegraph telecom ppt
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 
2016 Poster Launch Models
2016 Poster Launch Models2016 Poster Launch Models
2016 Poster Launch Models
 
CI_SIModule_QGIS.pptx .
CI_SIModule_QGIS.pptx                         .CI_SIModule_QGIS.pptx                         .
CI_SIModule_QGIS.pptx .
 
Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2
 
Trajectory Segmentation and Sampling of Moving Objects Based On Representativ...
Trajectory Segmentation and Sampling of Moving Objects Based On Representativ...Trajectory Segmentation and Sampling of Moving Objects Based On Representativ...
Trajectory Segmentation and Sampling of Moving Objects Based On Representativ...
 
On the development of methodology for planning and cost modeling of a wide ar...
On the development of methodology for planning and cost modeling of a wide ar...On the development of methodology for planning and cost modeling of a wide ar...
On the development of methodology for planning and cost modeling of a wide ar...
 
2213ijccms02.pdf
2213ijccms02.pdf2213ijccms02.pdf
2213ijccms02.pdf
 
A gis anchored technique to obtain optimal path through minimal alteration of...
A gis anchored technique to obtain optimal path through minimal alteration of...A gis anchored technique to obtain optimal path through minimal alteration of...
A gis anchored technique to obtain optimal path through minimal alteration of...
 
A GIS ANCHORED TECHNIQUE TO OBTAIN OPTIMAL PATH THROUGH MINIMAL ALTERATION OF...
A GIS ANCHORED TECHNIQUE TO OBTAIN OPTIMAL PATH THROUGH MINIMAL ALTERATION OF...A GIS ANCHORED TECHNIQUE TO OBTAIN OPTIMAL PATH THROUGH MINIMAL ALTERATION OF...
A GIS ANCHORED TECHNIQUE TO OBTAIN OPTIMAL PATH THROUGH MINIMAL ALTERATION OF...
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
Shortest route finding using an object oriented database approach
Shortest route finding using an object oriented database approachShortest route finding using an object oriented database approach
Shortest route finding using an object oriented database approach
 

More from Dongpo Deng

20140710 tca gsdi
20140710 tca gsdi20140710 tca gsdi
20140710 tca gsdi
Dongpo Deng
 
開放資料: 全球化的草根性運動
開放資料:  全球化的草根性運動開放資料:  全球化的草根性運動
開放資料: 全球化的草根性運動
Dongpo Deng
 
Social Web Meets Sensor Web: Linked Crowdsourced Observation Data
Social Web Meets Sensor Web: Linked Crowdsourced Observation DataSocial Web Meets Sensor Web: Linked Crowdsourced Observation Data
Social Web Meets Sensor Web: Linked Crowdsourced Observation Data
Dongpo Deng
 

More from Dongpo Deng (20)

20180226 data driven smart governance
20180226 data driven smart governance20180226 data driven smart governance
20180226 data driven smart governance
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open Data
 
Construction and reuse of linked traceable agricultural product records - An ...
Construction and reuse of linked traceable agricultural product records - An ...Construction and reuse of linked traceable agricultural product records - An ...
Construction and reuse of linked traceable agricultural product records - An ...
 
農產品產銷履歷資料鏈結化處理 (Linked Traceable Agricultural Data )
農產品產銷履歷資料鏈結化處理 (Linked Traceable Agricultural Data )農產品產銷履歷資料鏈結化處理 (Linked Traceable Agricultural Data )
農產品產銷履歷資料鏈結化處理 (Linked Traceable Agricultural Data )
 
開放街圖社群經營的不等式
開放街圖社群經營的不等式開放街圖社群經營的不等式
開放街圖社群經營的不等式
 
OSM 與 LocalWiki 的整合: 支援社區層級災害管理
OSM 與 LocalWiki 的整合: 支援社區層級災害管理OSM 與 LocalWiki 的整合: 支援社區層級災害管理
OSM 與 LocalWiki 的整合: 支援社區層級災害管理
 
啟動開放,創新價值
啟動開放,創新價值 啟動開放,創新價值
啟動開放,創新價值
 
2016年歐洲資料論壇
2016年歐洲資料論壇2016年歐洲資料論壇
2016年歐洲資料論壇
 
From Structured Data to Linked Open Governmental Data
From Structured Data to Linked Open Governmental DataFrom Structured Data to Linked Open Governmental Data
From Structured Data to Linked Open Governmental Data
 
開放街圖: 集合群眾之力的製圖 (OpenStreetMap: A crowdsoucing map )
開放街圖: 集合群眾之力的製圖 (OpenStreetMap: A crowdsoucing map )開放街圖: 集合群眾之力的製圖 (OpenStreetMap: A crowdsoucing map )
開放街圖: 集合群眾之力的製圖 (OpenStreetMap: A crowdsoucing map )
 
20150427_NCDR_OSM_Disaster_Mapping
20150427_NCDR_OSM_Disaster_Mapping20150427_NCDR_OSM_Disaster_Mapping
20150427_NCDR_OSM_Disaster_Mapping
 
Crowdsourced mapping for open collaboration: A story of Taiwan so far
Crowdsourced mapping for open collaboration: A story of Taiwan so farCrowdsourced mapping for open collaboration: A story of Taiwan so far
Crowdsourced mapping for open collaboration: A story of Taiwan so far
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
 
20141018_OD_meetup#3
20141018_OD_meetup#320141018_OD_meetup#3
20141018_OD_meetup#3
 
20141001 climate change&osm
20141001 climate change&osm20141001 climate change&osm
20141001 climate change&osm
 
20140721 open geomeeting
20140721 open geomeeting20140721 open geomeeting
20140721 open geomeeting
 
20140710 tca gsdi
20140710 tca gsdi20140710 tca gsdi
20140710 tca gsdi
 
開放資料: 全球化的草根性運動
開放資料:  全球化的草根性運動開放資料:  全球化的草根性運動
開放資料: 全球化的草根性運動
 
Social Web Meets Sensor Web: Linked Crowdsourced Observation Data
Social Web Meets Sensor Web: Linked Crowdsourced Observation DataSocial Web Meets Sensor Web: Linked Crowdsourced Observation Data
Social Web Meets Sensor Web: Linked Crowdsourced Observation Data
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

20131106 acm geocrowd

  • 1. The One and Many Maps: Participatory and Temporal Diversities in OpenStreetMap Tyng-Ruey Chuang1, Dong-Po Deng1,3, Chun–Chen Hsu1,2, Rob Lemmens3 ! 1Institute of Information Science, Academia Sinica, Taiwan 2Department of Computer Science and Information Engineering, National Taiwan University 3Faculty of Geo–Information Science and Earth Observation (ITC), University of Twente, Netherlands Second ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information (GEOCROWD) 2013 In conjunction with ACM SIGSPATIAL 2013
  • 2. Background • OSM is a wiki-style online mapping platform in which tens of thousands of people voluntarily contribute geospatial data into the making of a global map (Haklay & Weber, 2008). • Its peer production model demonstrates that more and more mapping activities are done by the citizens. • It represents the success of a collective form of geospatial content creation.
  • 3. Collaborative Geospatial Content Creation • OSM is a type of PPGIS, as well as the characteristics of data collaboration in OSM as the subjects of VGI research. • The current state of OSM actually is an assembly of many edits and updates over a period of time. • Every edit or update should be a meaningful unit in the understanding of data collaboration activities in OSM.
  • 4. Aims • We intend to look for ways to systematically and efficiently discover data collaboration patterns and diversities in OSM • It is an initial study of the OSM dataset (at least about the part of Taiwan) by developing a set of metrics to summarize • • • user participation, and spatiotemporal variations of updates in defined areas of OSM We hope to see the OSM not as one collective map but as many overlapping maps concurrently in the making with each in its own characteristics
  • 5. OSM Data Model Node Way Open polyline Closed polyline Area Relation Tag
  • 6. OSM Data An example of a Node <node id='1762782473' timestamp='2012-12-12T03:49:16Z' uid='1048' user='dongpo' visible='true' version='2' changeset='14245247' lat='23.864527' lon='121.5217101'> <tag k='name' v='⽴立川漁場' /> <tag k='tourism' v='attraction' /> <tag k='source' v='survey' /> <tag k='addr: housenumber' v='45' /> <tag k='addr:district' v='⿂魚池' /> <tag k='addr:town' v='壽豐鄉' /> <tag k='addr:county' v='花蓮縣' /> </node>
  • 7. OSM Data An example of a Way <way id='118416207' timestamp='2012-05-23T17:43:06Z' uid='1048' user='dongpo' visible='true' version='4' changeset='14246301'> <nd ref='1088092959' /> <nd ref='1088092953' /> .... <nd ref='1600948228' /> <tag k='highway' v='primary' /> <tag k='lanes' v='2' /> <tag k='oneway' v='yes' /> <tag k='ref' v='Hwy 11C' /> <tag k='ref:zh' v='台11丙線' /> </way>
  • 8. OSM Data An example of a Relation <relation id='2498406' timestamp='2012-10-14T19:01:55Z' uid='1048' user='dongpo' visible='true' version='1' changeset='13497007'> <member type='way' ref='185846446' role='outer' <member type='way' ref='185846444' role='outer' <member type='way' ref='151063000' role='outer' <member type='way' ref='185846448' role='outer' <member type='way' ref='185846445' role='outer' <tag k='admin_level' v='8' /> <tag k='boundary' v='administrative' /> <tag k='name' v='草屯鎮 (Caotun)' /> <tag k='name:en' v='Caotun' /> <tag k='name:zh' v='草屯鎮' /> <tag k='type' v='boundary' /> </relation> /> /> /> /> />
  • 9. The definitions for the metrics To measure participatory and temporal differences among cells in OSM, we define the following functions on Dc. The c in Dc means in a cell. When the context is clear, we omit the subscript c and simply write: where di, 0 < i < n -1 , is a node in the cell, and n is the total number of nodes.
  • 10. The definitions for the metrics For a node di, we write: where ki is the node id of di, ti the age, ui the user id of its contributor, and pi the position (i.e., the pair of its lat and lon values). Note that, by definition, geographically pi is within the boundary of c for all .
  • 11. Node and Mapper Density In general, we use areac to denote the area covered by a cell c, and we use popc for the people population in region c. When it is clear in context, we omit the subscript and simply write area and pop. The following measures the densities of nodes, as well as those of their contributors, i.e., the mappers.
  • 12. Node age and temporality Recall the following auxiliary functions for a sequence s where min s is the minimum of s, max s the maximum of s, s the average of s, and cv(s) the coefficient of variation for elements in s.
  • 13. Node age and temporality(cont.) The 4-tuple <min t, max t, t, cv(t)> measures the age characteristics of the nodes in a cell. That is,
  • 14. Node age and temporality(cont.) We define a sequence g = (g0, g1, g2,..., gn-2) to measure the gaps between any two consecutive elements in t. That is, which is the gap in days between the dates when the two nodes di+1 and di were added into the cell.
  • 15. Node age and temporality(cont.) The 4-tuple <min g, max g, g, cv(g)> measures the
  • 16. Graphing cells by two metrics • As multiple metrics are in use, a cell can be measured in two metrics and the two results compared. • Often we will compare the two sets of measurement over all cells to see if there are patterns.
  • 17. Number of mappers and number of nodes Figure 1: Distribution of the cells by both mapper count and node count. Figure 2: Mapping the cells in Taiwan by their types. (c.f. Figure 1)
  • 18. Locations and Cities in Taiwan Figure 3: Cities in Taiwan
  • 19. Distributions of Mappers Figure 4: Spatial distribution of mappers over area. Figure 5: Spatial distribution of mappers over population.
  • 20. Distributions of Nodes Figure 6: Spatial distribution of nodes over area. Figure 7: Spatial distribution of nodes over population
  • 21. Node Age Average and Variance Figure 8: Spatial distribution of average node age. Figure 9: Spatial distribution of the variance of node age.
  • 22. Number of Mappers and Update Interval Figure 10: Distribution of the cells by both mapper count and average time gap between two additions. Figure 11: Spatial distribution of the cells by their types. (c.f. Figure 10)
  • 23. The 80/20 Hypothesis Figure 12: Distribution of the cells by both mapper count and ratio of mappers needed for combined 80% node contribution. Figure 13: Spatial distribution of the cells by their types. (c.f. Figure 12)
  • 24. Related work • Previous investigations into the data quality issues of OSM have shown that the OSM dataset can be fairly accurate, and is mostly comparable to commercial/gov. datasets at least in urban areas [3, 8, 6, 9]. • Researchers had also developed visual analytics to gain insights into the spatial diversity of OSM datasets, e. g. to see whether users in different countries would exhibit distinct mapping activities and habits [13]. • These visualization tools can provide valuable information when improving the data quality of OSM. Neis and Zipf identified active mappers and casual mappers by examining quantitatively their contributions in OSM [12]. • Mooney and Corcoran examined directly the characteristics of “heavily edited” objects in OpenStreetMap of UK, and they considered these characteristics might be developed as data quality indicators for OpenStreetMap in the future [10].
  • 25. Conclusion and future work • This paper is a preliminary study in two sense. We only analyze • • the Taiwan part of OpenStreetMap, and the cells independently (though spatial distribution is visualized and discussed). • Because of the time constraint, we have not looked into other geographical areas in OSM. • Also, as a mapper may contribute to multiple cells, we ought to look into mapping activities across the cells. We intend to pursue these directions in the future. • The programs we use to analyze the data are in their early stage of development, and the way we prepare the data for analysis is rather ad hoc.
  • 26. Conclusion and future work • We are currently consider how better to structure the programs so that they can be easily ported and reused. • Metrics-based analysis tools like these can be very useful in improving the data quality in OpenStreetMap as it helps discover areas where there is participatory or temporal unevenness in the map making process itself.
  • 27. Thank for your attention! Question? dongpo@iis.sinica.edu.tw d.-.p.deng@utwente.nl To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this paper. For the avoidance of doubt, this work is released under the CC0 Public Domain Dedication (http://creativecommons.org/publicdomain/zero/1.0/). Anyone can copy, modify, distribute and perform this work, even for commercial purposes, all without asking permission.