Consistency across geosocial media platforms

•

1 gostou•335 visualizações

Talk given at Location Based Services conference 2019 in Vienna. Paper available at https://lbsconference.org/wp-content/uploads/2019/11/5_1.pdf Abstract: The increasing use of geosocial media in research to draw quantitative and qualitative conclusions about urban environments bears questions about the consistency of the data across the different platforms. This paper therefore presents a comparative analysis of data from six different geosocial media platforms (Facebook, Twitter, Google, Foursquare, Flickr, and Instagram) for Washington, D.C., using population and zoning data for reference. We find that there is little consistency between the different platforms at small spatial units and even semantically rich datasets have severe limitations when predicting functional zones in a city. The results show that researchers need to carefully evaluate which platform they can use for a particular study, and that more work is needed to better understand the differences between the different platforms.

Ciências

Consistency Across
Geosocial Media Platforms
Carsten Keßler1
& Grant D. McKenzie2
1
Department of Planning, Aalborg University, Copenhagen, Denmark
2
Department of Geography, McGill University, Montreal, Canada
1

Motivation
Geosocial media data are used to study urban structure, functional regions, to
generate gazetteers, for population mapping, to study population mobility, and
to detect events [...]
2

Study area
All data within the boundary of
Washington, D.C.
!
!

Map by Peter Fitzgerald, CC BY 3.0
4

Pairwise
Correlation of
Densities
Per census tract
(179 in DC)
7

Pairwise
Correlation of
Densities
Per census block
(6507 in DC)
8

Hmmm… that did not go so well.
But what about the
semantic information
in our geosocial media data?
9

Washington, D.C.
Zoning
729 zones, each belonging to one of
149 classes, grouped into 6 zoning groups
10

Can we predict the zone
group from the present
POIs?
- Experiment with 24,428 Foursquare
POISs
- 10 top-level and 449 second-level
categories*
- Only 126 zones actually have POIs in
them
- Mean: 31 POIs per zone
*
https://developer.foursquare.com/docs/resources/categories
12

Random forest classiﬁer
Trained on frequencies of foursquare POI types to predict zone group
13

Random forest classiﬁer
Trained on frequencies of foursquare POI types to predict zone group
Results
Out-of-bag estimate of error rate of 38.1% (ﬁrst-level)
and 36.5% (second-level).
14

Conclusions
— Consistency between platforms is limited
— We should be skeptical about insights derived from
geosocial media if they are only based on a single platform
— Even rich semantic annotations are of limited used when
studying city structure
— But: Also hard to say what is really going on in reality
17

Future work needs to...
— investigate which data source can be used for which kinds of
inferences
— study the di!erences between data sources (user groups, data
collection mechanisms, business models etc.)
— study robustness when deriving new insights (without ground
truth data) from geosocial media
Carsten Keßler – @carstenkessler
Department of Planning, Aalborg University, Copenhagen, Denmark
Grant D. McKenzie – @grantdmckenzie
Department of Geography, McGill University, Montreal, Canada
18

Mais conteúdo relacionado

Último

G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2

Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25

GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji

9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari

Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY

Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar

Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823

Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal

Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India

zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069

Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P

Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji

Engler and Prantl system of classification in plant taxonomyNistarini College, Purulia (W.B) India

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl

Destaque

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Destaque (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Consistency across geosocial media platforms

1. Consistency Across Geosocial Media Platforms Carsten Keßler1 & Grant D. McKenzie2 1 Department of Planning, Aalborg University, Copenhagen, Denmark 2 Department of Geography, McGill University, Montreal, Canada 1

2. Motivation Geosocial media data are used to study urban structure, functional regions, to generate gazetteers, for population mapping, to study population mobility, and to detect events [...] 2

3. Motivation Geosocial media data are used to study urban structure, functional regions, to generate gazetteers, for population mapping, to study population mobility, and to detect events [...] But do datasets from di!erent platforms actually tell a similar story? 3

4. Study area All data within the boundary of Washington, D.C. ! ! Map by Peter Fitzgerald, CC BY 3.0 4

5. Datasets 5

6. Visual comparison 6

7. Pairwise Correlation of Densities Per census tract (179 in DC) 7

8. Pairwise Correlation of Densities Per census block (6507 in DC) 8

9. Hmmm… that did not go so well. But what about the semantic information in our geosocial media data? 9

10. Washington, D.C. Zoning 729 zones, each belonging to one of 149 classes, grouped into 6 zoning groups 10

11. 11

12. Can we predict the zone group from the present POIs? - Experiment with 24,428 Foursquare POISs - 10 top-level and 449 second-level categories* - Only 126 zones actually have POIs in them - Mean: 31 POIs per zone * https://developer.foursquare.com/docs/resources/categories 12

13. Random forest classiﬁer Trained on frequencies of foursquare POI types to predict zone group 13

14. Random forest classiﬁer Trained on frequencies of foursquare POI types to predict zone group Results Out-of-bag estimate of error rate of 38.1% (ﬁrst-level) and 36.5% (second-level). 14

15. Confusion matrix Level 1 15

16. Confusion matrix Level 2 16

17. Conclusions — Consistency between platforms is limited — We should be skeptical about insights derived from geosocial media if they are only based on a single platform — Even rich semantic annotations are of limited used when studying city structure — But: Also hard to say what is really going on in reality 17

18. Future work needs to... — investigate which data source can be used for which kinds of inferences — study the di!erences between data sources (user groups, data collection mechanisms, business models etc.) — study robustness when deriving new insights (without ground truth data) from geosocial media Carsten Keßler – @carstenkessler Department of Planning, Aalborg University, Copenhagen, Denmark Grant D. McKenzie – @grantdmckenzie Department of Geography, McGill University, Montreal, Canada 18

Consistency across geosocial media platforms

Recomendados

Recomendados

Mais conteúdo relacionado

Último

Último (20)

Destaque

Destaque (20)

Consistency across geosocial media platforms