SlideShare uma empresa Scribd logo
1 de 54
Baixar para ler offline
1
Geographies of
crowdsourced information
and their implications
@a_ballatore
VGI-Alive, AGILE
June 2018, Lund, Sweden
Andrea Ballatore*
Department of Geography
Birkbeck, University of London
Stefano De Sabbata
School of Geography, Geology and the Env.,
University of Leicester
aballatore.space
2
Outline
1. Where are we?
2. Core research questions
3. Paradigm limitations
4. Beyond the usual suspects
5. Case studies: Search behaviour
6. [Stefano will do the rest]
3
Where are we?
4
The success of
crowdsourcing
"85% of the top global brands have used
crowdsourcing in the last ten years. […] According
to Gartner, 75% of the world’s high performing
enterprises will be using crowdsourcing by 2018."
(Deloitte, 2016)
It is when new, successful technologies withdraw
into the "woodwork of everyday banality" that
their effects become real and profound.
(Vincent Mosco, 2004)
https://ideascale.com
5
The success of
crowdsourcing
"85% of the top global brands have used
crowdsourcing in the last ten years. […] According
to Gartner, 75% of the world’s high performing
enterprises will be using crowdsourcing by 2018."
(Deloitte, 2016)
It is when new, successful technologies withdraw
into the "woodwork of everyday banality" that
their effects become real and profound.
(Vincent Mosco, 2004)
https://ideascale.com
6
Crowdsourcing + geolocation:
A mature field
(See et al, 2016)
7
Volunteered or Crowdsourced GI?
8
Placing the crowds
• Crowdsourced
geographic information
(CGI)
• From experimental phase
to oligopoly (e.g.,
Google, Facebook)
• From cyberoptimism to
cyberpessimism
https://www.cagle.com
9
Placing the crowds
• Crowdsourced
geographic information
(CGI)
• From experimental phase
to oligopoly (e.g.,
Google, Facebook)
• From cyberoptimism to
cyberpessimism
(2016)
10
Researching CGI
as domain
Using CGI for
other domains
11
Core CGI research
1. Who are the contributors and why do they engage
in spatial information production, and what
incentives work or do not work? How do they
collaborate and organise? How do we include
marginalised communities?
(Budhathoki and Haythornthwaite 2013)
2. How can we calculate the quality and fitness for
purpose of crowdsourced data in a reliable,
preferably intrinsic way? (Goodchild and Li 2012)
3. What are the limitations of such models and what
are their spatial, epistemic and cultural biases?
(Dodge and Kitchin 2013)
12
Core CGI research
1. Who are the contributors and why do they engage
in spatial information production, and what
incentives work or do not work? How do they
collaborate and organise? How do we include
marginalised communities?
(Budhathoki and Haythornthwaite 2013)
2. How can we calculate the quality and fitness for
purpose of crowdsourced data in a reliable,
preferably intrinsic way? (Goodchild and Li 2012)
3. What are the limitations of such models and what
are their spatial, epistemic and cultural biases?
(Dodge and Kitchin 2013)
13
CGI for other domains
Natural sciences: biology,
climate science, Earth
sensing
Social sciences: urban planning,
transportation,
public health, economics,
human geography
Humanities: digital humanities,
history, cultural analytics
14
Limitations of
crowdsourcing
15
• "Ignorance of crowds"
(Carr, 2007)
• Conditions for wisdom
• Menial work, no real
innovation/creativity
• Undermining paid work
• Variable quality
Limitations of the paradigm
2004
16
• "Ignorance of crowds"
(Carr, 2007)
• Conditions for wisdom
• Menial work, no real
innovation/creativity
• Undermining paid work
• Variable quality
Limitations of the paradigm
2007
17
• Large volumes of information
do not imply usefulness or
fitness for purpose
• We need representative
samples, not large samples
(e.g., random sample of
1,000 > 1M non-random)
Limitations of the paradigm
18
Each CGI source is a particular viewpoint
and will return a different image of the
social and natural world.
suifaijohnmak.wordpress.com
19
Diversity/biases of CGI
• Thematic: e.g., tourism, outdoors,
typical/atypical behaviour, sharing bias
• Demographic: Western Educated Industrialised
Rich Democratic (WEIRD) (not always!)
• Social: 90%-9%-1%, hyperactive minorities of
contributors
• Geographic: urban/rural,
developed/developing, central/peripheral,
human/natural
20
Diversity/biases of CGI
• Thematic: e.g., tourism, outdoors,
typical/atypical behaviour, sharing bias
• Demographic: Western Educated Industrialised
Rich Democratic (WEIRD) (not always!)
• Social: 90%-9%-1%, hyperactive minorities of
contributors
• Geographic: urban/rural,
developed/developing, central/peripheral,
human/natural
21
• Without centralised planning and protocols,
data quality remains uneven (coverage!)
• Wikipedia replaced Britannica, but
OpenStreetMap is not replacing Google
Maps
• CGI cannot replace established data
collection protocols and sources, but can
provide useful ancillary data
CGI strictures
22
• Without centralised planning and protocols,
data quality remains uneven (coverage!)
• Wikipedia replaced Britannica, but
OpenStreetMap is not replacing Google
Maps
• CGI cannot replace established data
collection protocols and sources, but can
provide useful ancillary data
CGI strictures
23
Reinventing wheels
• Some CGI replicates work
that has been done better
by professionals
• More useful to focus on
"missing" data:
bookscrounger.com
24
Open data vs CGI
Authoritative datasets
are becoming
cheaper/free
25
Broadening our
horizons
26
Most studies
on OSM,
Wikipedia,
Twitter, Flickr.
There's
more out
there!
https://fineartamerica.com
The usual suspects
27
28
• Google Maps
• From 5M to 50M
contributors in
2017
• 700K new places
monthly
• Gamification
29
• Hundreds of millions of users, billions
of reviews
• Measurable effects on spatial and
economic behaviour
• Sentiment about points of interest,
cities, and neighbourhoods
30
• Micro-economic data (e.g. price of onions
in India, new shops in Ghana)
• For profit, contributors are paid
• Applications: International Development,
Government, Global Security, and Business
31
Case studies
32
33
Online visibility of CGI projects
• Search engines are the key entry point
to discover new information
• Feedback loop between Wikipedia and
Google Search to attract new
contributors
• Making CGI findable and consumable
for search engines and social media
• Study on CGI on Google Search (2018)
34
Online visibility of CGI projects
• Search engines are the key entry point
to discover new information
• Feedback loop between Wikipedia and
Google Search to attract new
contributors
• Making CGI findable and consumable
for search engines and social media
• Study on CGI on Google Search (2018)
35
Interest in CGI projects
(Ballatore & Jokar Arsanjani, 2018)
(Ballatore & Jokar Arsanjani, 2018)
Search Interest in
Wikimapia/OSM combined
37
Search Interest in
Wikimapia vs OSM
(Ballatore & Jokar Arsanjani, 2018)
Andrea Ballatore
Department of Geography,
Birkbeck, University of London
Stefano De Sabbata*
School of Geography, Geology and the Env.,
University of Leicester
S c h o o l o f
G e o g r a p h y ,
G e o l o g y a n d t h e
E n v i r o n m e n t
Geographies of
crowdsourced information
and their implications
AGILE 2018 – Lund, Sweden
Case studies
• Operationalising quality in health studies
– with J Bright, S Lee, B Ganesh, DK Humphreys
– OpenStreetMap
– impact of availability on alcohol-related harm
• Analysis of global gazetteers
– with E Acheson and RS Purves
– GeoNames and Getty TGN
– patterns of coverage
• Urban Information Geographies
– with A Ballatore
– with N Tate
– quality and representativeness of VGI
Operationalising
VGI in health
studies
• with J Bright, S Lee, B
Ganesh, DK Humphreys
• OpenStreetMap
• impact of availability on
alcohol-related harm
• https://www.sciencedirect.com/scie
nce/article/pii/S1353829217305804
Quality indicator
Values Transformation
Num. of map features per square kilometre Log transformed, scaled
Average num. of edits per map feature Log transformed, scaled
Num. of users who made at least one edit Log transformed, scaled
Num. of features edited by more than one user Log transformed, scaled
Num. of days with at least one edit Log transformed, scaled
Days since the last edit in the area Inversed, scaled
Proportion of buildings with full address -1 if 0%, 0 if no buildings
• Intrinsic quality indicator based on Barron et al. (2014)
• Calculated as a linear combination of:
Replicating CRESH study
The overall conclusions are essentially identical
• significant positive correlation between density of
alcohol outlets and incidence of alcohol related harms
• an effect size which is comparable (though smaller)
The correlation
between OSM-based
estimates and CRESH
is stronger where the
quality index is higher
Analysis of global
gazetteers
• with E Acheson and RS
Purves
• GeoNames and
Getty TGN
• patterns of coverage
• https://www.sciencedirect.com/scie
nce/article/pii/S0198971516302496
Comparing GeoNames and TGN
Two distinct levels of coverages
• in “high coverage” countries, TGN same order of
magnitude as counts in GeoNames
• in “low coverage” countries, GeoNames has over 60 times
as much content as TGN
GeoNames and TGN in Great Britain
Gazetteer Features Pop place
GeoNames 54,701 16,475
Getty TGN 24,003 16,816
Ordnance Survey 50k 248,626
Ordnance Survey OpenNames 41,490
Feature
types
Urban
Information
Geographies
• with A Ballatore
• with N Tate
• quality and
representativeness of
VGI
Findings
• Overall confirmed (some) hypothesis
– representation similar biases as participation
– Higher qualifications strongest factor
– Wealth (house prices) strong factor in both, more so for Wikipedia
– Twitter strongly influenced by perc. of ppl. aged 30-44 (positively) and
households with de-pendent children (negatively)
• However
– models account only for about 44–55% of variability
– need for more explanatory factors, e.g., tourism-related activities
• …or can these difference be used as indicators?
– ethnic composition is not a factor in the UK
• Twitter and Wikipedia similar but distinct geographies
– only representative of themselves
Comparing Twitter and Wikipedia
Number of tweets
(951 obs.)
~ Number of articles
Adj. R2 = 0.490
De Sabbata
and Tate
(forthcoming)
A geodemographic approach
Conclusions
• More interaction between research on CGI
core and domain applications
– beyond technology, towards social domains
– beyond the usual suspects (OSM, etc)
• Access to data: sampling and scraping
• Systematic work on limitations is needed
– Urban Information Geographies
• Impact of GDPR on access and use of data
Thank you for your attention.
Any questions?
Stefano De Sabbata
School of Geography, Geology and the Env.,
University of Leicester
Andrea Ballatore
Department of Geography,
Birkbeck, University of London
a.ballatore@bbk.ac.uk
aballatore.space
@a_ballatore
Thanks!

Mais conteúdo relacionado

Mais procurados

GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2crooksAndrew
 
2016 Annual Report - UN Global Pulse
2016 Annual Report - UN Global Pulse 2016 Annual Report - UN Global Pulse
2016 Annual Report - UN Global Pulse UN Global Pulse
 
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...crooksAndrew
 
Not the Geography You Remember
Not the Geography You RememberNot the Geography You Remember
Not the Geography You RememberBill Bass
 
Data Innovation: Generating Climate Solutions Event
Data Innovation: Generating Climate Solutions EventData Innovation: Generating Climate Solutions Event
Data Innovation: Generating Climate Solutions EventUN Global Pulse
 
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...ESD UNU-IAS
 
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISS
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISSGSDI Liaison report on Earth Observation-related activities for the CEOS WGISS
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISSRemetey-Fülöpp Gábor
 

Mais procurados (8)

GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2
 
2016 Annual Report - UN Global Pulse
2016 Annual Report - UN Global Pulse 2016 Annual Report - UN Global Pulse
2016 Annual Report - UN Global Pulse
 
David Cowen UW-Madison Geospatial Summit 2015
David Cowen UW-Madison Geospatial Summit 2015David Cowen UW-Madison Geospatial Summit 2015
David Cowen UW-Madison Geospatial Summit 2015
 
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...
 
Not the Geography You Remember
Not the Geography You RememberNot the Geography You Remember
Not the Geography You Remember
 
Data Innovation: Generating Climate Solutions Event
Data Innovation: Generating Climate Solutions EventData Innovation: Generating Climate Solutions Event
Data Innovation: Generating Climate Solutions Event
 
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...
 
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISS
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISSGSDI Liaison report on Earth Observation-related activities for the CEOS WGISS
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISS
 

Semelhante a Crowdsourced Geodata Implications

From crowdsourced geographic information to participatory citizen science - e...
From crowdsourced geographic information to participatory citizen science - e...From crowdsourced geographic information to participatory citizen science - e...
From crowdsourced geographic information to participatory citizen science - e...Muki Haklay
 
Current Disruptions in Media: Earthquakes or New Openings? Stanford as Catalyst
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystCurrent Disruptions in Media: Earthquakes or New Openings? Stanford as Catalyst
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystMartha Russell
 
Digitization and Landscape-Research Nexus
Digitization and Landscape-Research NexusDigitization and Landscape-Research Nexus
Digitization and Landscape-Research NexusPrivate
 
The Global Land Programme
The Global Land ProgrammeThe Global Land Programme
The Global Land ProgrammeCIFOR-ICRAF
 
Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Imran Ghaznavi
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsJody Garnett
 
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...Muki Haklay
 
Extreme Citizen Science: the socio-political potential of citizen science
Extreme Citizen Science: the socio-political potential of citizen scienceExtreme Citizen Science: the socio-political potential of citizen science
Extreme Citizen Science: the socio-political potential of citizen scienceMuki Haklay
 
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017SoBigData - Exploring human mobility and migration with BigData @ NTTS2017
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017Vittorio Romano
 
Modeling of future transport systems from different point of views: data, tec...
Modeling of future transport systems from different point of views: data, tec...Modeling of future transport systems from different point of views: data, tec...
Modeling of future transport systems from different point of views: data, tec...Sonia Yeh
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
 
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...michellep
 
Citizen Science in Open Science context: measuring & understanding impacts of...
Citizen Science in Open Science context: measuring & understanding impacts of...Citizen Science in Open Science context: measuring & understanding impacts of...
Citizen Science in Open Science context: measuring & understanding impacts of...Muki Haklay
 
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...Liliana Bounegru
 
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)Carl Sack
 
AGIForesight _2020_Full Article
AGIForesight _2020_Full ArticleAGIForesight _2020_Full Article
AGIForesight _2020_Full ArticleMartin Penney
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.Vyron
 

Semelhante a Crowdsourced Geodata Implications (20)

Understanding the Volunteer in VGI
Understanding the Volunteer in VGIUnderstanding the Volunteer in VGI
Understanding the Volunteer in VGI
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
From crowdsourced geographic information to participatory citizen science - e...
From crowdsourced geographic information to participatory citizen science - e...From crowdsourced geographic information to participatory citizen science - e...
From crowdsourced geographic information to participatory citizen science - e...
 
Current Disruptions in Media: Earthquakes or New Openings? Stanford as Catalyst
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystCurrent Disruptions in Media: Earthquakes or New Openings? Stanford as Catalyst
Current Disruptions in Media: Earthquakes or New Openings? Stanford as Catalyst
 
Digitization and Landscape-Research Nexus
Digitization and Landscape-Research NexusDigitization and Landscape-Research Nexus
Digitization and Landscape-Research Nexus
 
The Global Land Programme
The Global Land ProgrammeThe Global Land Programme
The Global Land Programme
 
Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial Applications
 
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
 
Extreme Citizen Science: the socio-political potential of citizen science
Extreme Citizen Science: the socio-political potential of citizen scienceExtreme Citizen Science: the socio-political potential of citizen science
Extreme Citizen Science: the socio-political potential of citizen science
 
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017SoBigData - Exploring human mobility and migration with BigData @ NTTS2017
SoBigData - Exploring human mobility and migration with BigData @ NTTS2017
 
Modeling of future transport systems from different point of views: data, tec...
Modeling of future transport systems from different point of views: data, tec...Modeling of future transport systems from different point of views: data, tec...
Modeling of future transport systems from different point of views: data, tec...
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
 
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
 
Citizen Science in Open Science context: measuring & understanding impacts of...
Citizen Science in Open Science context: measuring & understanding impacts of...Citizen Science in Open Science context: measuring & understanding impacts of...
Citizen Science in Open Science context: measuring & understanding impacts of...
 
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
 
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
 
AGIForesight _2020_Full Article
AGIForesight _2020_Full ArticleAGIForesight _2020_Full Article
AGIForesight _2020_Full Article
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.
 

Último

Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 

Último (20)

Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

Crowdsourced Geodata Implications

  • 1. 1 Geographies of crowdsourced information and their implications @a_ballatore VGI-Alive, AGILE June 2018, Lund, Sweden Andrea Ballatore* Department of Geography Birkbeck, University of London Stefano De Sabbata School of Geography, Geology and the Env., University of Leicester aballatore.space
  • 2. 2 Outline 1. Where are we? 2. Core research questions 3. Paradigm limitations 4. Beyond the usual suspects 5. Case studies: Search behaviour 6. [Stefano will do the rest]
  • 4. 4 The success of crowdsourcing "85% of the top global brands have used crowdsourcing in the last ten years. […] According to Gartner, 75% of the world’s high performing enterprises will be using crowdsourcing by 2018." (Deloitte, 2016) It is when new, successful technologies withdraw into the "woodwork of everyday banality" that their effects become real and profound. (Vincent Mosco, 2004) https://ideascale.com
  • 5. 5 The success of crowdsourcing "85% of the top global brands have used crowdsourcing in the last ten years. […] According to Gartner, 75% of the world’s high performing enterprises will be using crowdsourcing by 2018." (Deloitte, 2016) It is when new, successful technologies withdraw into the "woodwork of everyday banality" that their effects become real and profound. (Vincent Mosco, 2004) https://ideascale.com
  • 6. 6 Crowdsourcing + geolocation: A mature field (See et al, 2016)
  • 8. 8 Placing the crowds • Crowdsourced geographic information (CGI) • From experimental phase to oligopoly (e.g., Google, Facebook) • From cyberoptimism to cyberpessimism https://www.cagle.com
  • 9. 9 Placing the crowds • Crowdsourced geographic information (CGI) • From experimental phase to oligopoly (e.g., Google, Facebook) • From cyberoptimism to cyberpessimism (2016)
  • 10. 10 Researching CGI as domain Using CGI for other domains
  • 11. 11 Core CGI research 1. Who are the contributors and why do they engage in spatial information production, and what incentives work or do not work? How do they collaborate and organise? How do we include marginalised communities? (Budhathoki and Haythornthwaite 2013) 2. How can we calculate the quality and fitness for purpose of crowdsourced data in a reliable, preferably intrinsic way? (Goodchild and Li 2012) 3. What are the limitations of such models and what are their spatial, epistemic and cultural biases? (Dodge and Kitchin 2013)
  • 12. 12 Core CGI research 1. Who are the contributors and why do they engage in spatial information production, and what incentives work or do not work? How do they collaborate and organise? How do we include marginalised communities? (Budhathoki and Haythornthwaite 2013) 2. How can we calculate the quality and fitness for purpose of crowdsourced data in a reliable, preferably intrinsic way? (Goodchild and Li 2012) 3. What are the limitations of such models and what are their spatial, epistemic and cultural biases? (Dodge and Kitchin 2013)
  • 13. 13 CGI for other domains Natural sciences: biology, climate science, Earth sensing Social sciences: urban planning, transportation, public health, economics, human geography Humanities: digital humanities, history, cultural analytics
  • 15. 15 • "Ignorance of crowds" (Carr, 2007) • Conditions for wisdom • Menial work, no real innovation/creativity • Undermining paid work • Variable quality Limitations of the paradigm 2004
  • 16. 16 • "Ignorance of crowds" (Carr, 2007) • Conditions for wisdom • Menial work, no real innovation/creativity • Undermining paid work • Variable quality Limitations of the paradigm 2007
  • 17. 17 • Large volumes of information do not imply usefulness or fitness for purpose • We need representative samples, not large samples (e.g., random sample of 1,000 > 1M non-random) Limitations of the paradigm
  • 18. 18 Each CGI source is a particular viewpoint and will return a different image of the social and natural world. suifaijohnmak.wordpress.com
  • 19. 19 Diversity/biases of CGI • Thematic: e.g., tourism, outdoors, typical/atypical behaviour, sharing bias • Demographic: Western Educated Industrialised Rich Democratic (WEIRD) (not always!) • Social: 90%-9%-1%, hyperactive minorities of contributors • Geographic: urban/rural, developed/developing, central/peripheral, human/natural
  • 20. 20 Diversity/biases of CGI • Thematic: e.g., tourism, outdoors, typical/atypical behaviour, sharing bias • Demographic: Western Educated Industrialised Rich Democratic (WEIRD) (not always!) • Social: 90%-9%-1%, hyperactive minorities of contributors • Geographic: urban/rural, developed/developing, central/peripheral, human/natural
  • 21. 21 • Without centralised planning and protocols, data quality remains uneven (coverage!) • Wikipedia replaced Britannica, but OpenStreetMap is not replacing Google Maps • CGI cannot replace established data collection protocols and sources, but can provide useful ancillary data CGI strictures
  • 22. 22 • Without centralised planning and protocols, data quality remains uneven (coverage!) • Wikipedia replaced Britannica, but OpenStreetMap is not replacing Google Maps • CGI cannot replace established data collection protocols and sources, but can provide useful ancillary data CGI strictures
  • 23. 23 Reinventing wheels • Some CGI replicates work that has been done better by professionals • More useful to focus on "missing" data: bookscrounger.com
  • 24. 24 Open data vs CGI Authoritative datasets are becoming cheaper/free
  • 26. 26 Most studies on OSM, Wikipedia, Twitter, Flickr. There's more out there! https://fineartamerica.com The usual suspects
  • 27. 27
  • 28. 28 • Google Maps • From 5M to 50M contributors in 2017 • 700K new places monthly • Gamification
  • 29. 29 • Hundreds of millions of users, billions of reviews • Measurable effects on spatial and economic behaviour • Sentiment about points of interest, cities, and neighbourhoods
  • 30. 30 • Micro-economic data (e.g. price of onions in India, new shops in Ghana) • For profit, contributors are paid • Applications: International Development, Government, Global Security, and Business
  • 32. 32
  • 33. 33 Online visibility of CGI projects • Search engines are the key entry point to discover new information • Feedback loop between Wikipedia and Google Search to attract new contributors • Making CGI findable and consumable for search engines and social media • Study on CGI on Google Search (2018)
  • 34. 34 Online visibility of CGI projects • Search engines are the key entry point to discover new information • Feedback loop between Wikipedia and Google Search to attract new contributors • Making CGI findable and consumable for search engines and social media • Study on CGI on Google Search (2018)
  • 35. 35 Interest in CGI projects (Ballatore & Jokar Arsanjani, 2018)
  • 36. (Ballatore & Jokar Arsanjani, 2018) Search Interest in Wikimapia/OSM combined
  • 37. 37 Search Interest in Wikimapia vs OSM (Ballatore & Jokar Arsanjani, 2018)
  • 38. Andrea Ballatore Department of Geography, Birkbeck, University of London Stefano De Sabbata* School of Geography, Geology and the Env., University of Leicester S c h o o l o f G e o g r a p h y , G e o l o g y a n d t h e E n v i r o n m e n t Geographies of crowdsourced information and their implications AGILE 2018 – Lund, Sweden
  • 39. Case studies • Operationalising quality in health studies – with J Bright, S Lee, B Ganesh, DK Humphreys – OpenStreetMap – impact of availability on alcohol-related harm • Analysis of global gazetteers – with E Acheson and RS Purves – GeoNames and Getty TGN – patterns of coverage • Urban Information Geographies – with A Ballatore – with N Tate – quality and representativeness of VGI
  • 40. Operationalising VGI in health studies • with J Bright, S Lee, B Ganesh, DK Humphreys • OpenStreetMap • impact of availability on alcohol-related harm • https://www.sciencedirect.com/scie nce/article/pii/S1353829217305804
  • 41. Quality indicator Values Transformation Num. of map features per square kilometre Log transformed, scaled Average num. of edits per map feature Log transformed, scaled Num. of users who made at least one edit Log transformed, scaled Num. of features edited by more than one user Log transformed, scaled Num. of days with at least one edit Log transformed, scaled Days since the last edit in the area Inversed, scaled Proportion of buildings with full address -1 if 0%, 0 if no buildings • Intrinsic quality indicator based on Barron et al. (2014) • Calculated as a linear combination of:
  • 42. Replicating CRESH study The overall conclusions are essentially identical • significant positive correlation between density of alcohol outlets and incidence of alcohol related harms • an effect size which is comparable (though smaller) The correlation between OSM-based estimates and CRESH is stronger where the quality index is higher
  • 43. Analysis of global gazetteers • with E Acheson and RS Purves • GeoNames and Getty TGN • patterns of coverage • https://www.sciencedirect.com/scie nce/article/pii/S0198971516302496
  • 44. Comparing GeoNames and TGN Two distinct levels of coverages • in “high coverage” countries, TGN same order of magnitude as counts in GeoNames • in “low coverage” countries, GeoNames has over 60 times as much content as TGN
  • 45. GeoNames and TGN in Great Britain Gazetteer Features Pop place GeoNames 54,701 16,475 Getty TGN 24,003 16,816 Ordnance Survey 50k 248,626 Ordnance Survey OpenNames 41,490
  • 47. Urban Information Geographies • with A Ballatore • with N Tate • quality and representativeness of VGI
  • 48.
  • 49. Findings • Overall confirmed (some) hypothesis – representation similar biases as participation – Higher qualifications strongest factor – Wealth (house prices) strong factor in both, more so for Wikipedia – Twitter strongly influenced by perc. of ppl. aged 30-44 (positively) and households with de-pendent children (negatively) • However – models account only for about 44–55% of variability – need for more explanatory factors, e.g., tourism-related activities • …or can these difference be used as indicators? – ethnic composition is not a factor in the UK • Twitter and Wikipedia similar but distinct geographies – only representative of themselves
  • 50. Comparing Twitter and Wikipedia Number of tweets (951 obs.) ~ Number of articles Adj. R2 = 0.490
  • 51. De Sabbata and Tate (forthcoming) A geodemographic approach
  • 52. Conclusions • More interaction between research on CGI core and domain applications – beyond technology, towards social domains – beyond the usual suspects (OSM, etc) • Access to data: sampling and scraping • Systematic work on limitations is needed – Urban Information Geographies • Impact of GDPR on access and use of data
  • 53. Thank you for your attention. Any questions? Stefano De Sabbata School of Geography, Geology and the Env., University of Leicester Andrea Ballatore Department of Geography, Birkbeck, University of London