SlideShare uma empresa Scribd logo
1 de 17
Machine Learning and
Big Data at Foursquare
Blake Shaw, PhD
Data Scientist @ Foursquare
@metablake
What is foursquare?

      An app that helps you
      explore your city and
      connect with friends


      A platform for location
      based services and
What is foursquare?

      People use foursquare to:
       •   check in to places
       •   discover new places
       •   share w/ friends
       •   get tips about places
       •   get deals
       •   earn points and badges
What is foursquare?


Mobile               Social




          Local
Stats

10,000,000+ people
25,000,000+ places
1,000,000,000+ check-ins
10,000+ actions/second
Growth
Growth
Growth
Learning with location data

• Check-ins are a rich source of data that
 describe human behavior


• We apply machine learning algorithms to
 the collective movement patterns of millions
 of people to build exciting new services
Recommendation engine

• foursquare explore
 provides realtime
 recommendations using:
  •   location
  •   time of day
  •   check-in history
  •   friends preferences
Signals about places
Networks of people
Networks of people
Brooklyn




            Manha-an
                                SF



                Australia
Open questions

• How to measure similarity between people
 and places?
• How to determine influence in large
 networks of people and places?
• What statistics can we use to describe
 people’s behavior in the real-world?
• How do we predict what information will be
Our data stack

• MongoDB
• Amazon S3, Elastic Mapreduce
• Hadoop
• Hive
• Flume
• R and Matlab
Join us!
foursquare is hiring!
85+ people and growing

foursquare.com/jobs

Blake Shaw
@metablake
blake@foursquare.com

Mais conteúdo relacionado

Semelhante a Nyas ml-4sq-2

Foursquare
FoursquareFoursquare
Foursquare
nycgov
 
6. FOMS _Data Mining_ Analysis_ Eric Robson
6. FOMS _Data Mining_ Analysis_ Eric Robson6. FOMS _Data Mining_ Analysis_ Eric Robson
6. FOMS _Data Mining_ Analysis_ Eric Robson
FOMS011
 

Semelhante a Nyas ml-4sq-2 (20)

Introduction to Information Architecture & Design - 10/03/15
Introduction to Information Architecture & Design - 10/03/15Introduction to Information Architecture & Design - 10/03/15
Introduction to Information Architecture & Design - 10/03/15
 
Phd Colloquium Spatial Analysis
Phd Colloquium Spatial AnalysisPhd Colloquium Spatial Analysis
Phd Colloquium Spatial Analysis
 
Foursquare
FoursquareFoursquare
Foursquare
 
Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15
 
Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15
 
Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)
 
Geolocation and Business
Geolocation and BusinessGeolocation and Business
Geolocation and Business
 
Digital Marketing & Discoverability for the Performing Arts
Digital Marketing & Discoverability for the Performing ArtsDigital Marketing & Discoverability for the Performing Arts
Digital Marketing & Discoverability for the Performing Arts
 
Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16
 
Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16
 
Introduction to Information Architecture & Design - 6/20/15
Introduction to Information Architecture & Design - 6/20/15Introduction to Information Architecture & Design - 6/20/15
Introduction to Information Architecture & Design - 6/20/15
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked Data
 
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
 
Smart data and branding
Smart data and brandingSmart data and branding
Smart data and branding
 
Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14
 
Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16
 
Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17
 
Foursquare
FoursquareFoursquare
Foursquare
 
Introduction to Artificial Intelligence on AWS
Introduction to Artificial Intelligence on AWSIntroduction to Artificial Intelligence on AWS
Introduction to Artificial Intelligence on AWS
 
6. FOMS _Data Mining_ Analysis_ Eric Robson
6. FOMS _Data Mining_ Analysis_ Eric Robson6. FOMS _Data Mining_ Analysis_ Eric Robson
6. FOMS _Data Mining_ Analysis_ Eric Robson
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Nyas ml-4sq-2

Notas do Editor

  1. \n
  2. At foursquare, we think there is a great opportunity to leverage massive amounts of location data to help people better understand and connect to places\n
  3. So, what is foursquare? It’s an app that help you explore your city and connect with friends.\n\nIt’s also a platform for people to build location based services and collect and share location data\n
  4. People on foursquare “check-in” on their phones when they get to a place, to find out more about it, share that they are there with friends etc.\n
  5. Foursquare is in a unique place, sitting at the intersection between mobile, social, and geo.\n\n\n
  6. Foursquare is generating a ton of data, every second 35 people check-in to a location. \n\nThis data offers an unprecedented view into the behavior of millions of people worldwide, as they move around cities.\n\n\n
  7. Here we see the growth of the service over the last two years since, it started in mid 2009\n
  8. \n
  9. Foursquare now has data on over 25 million places all over the world\n
  10. Check-ins are a rich source of information describing human behavior.\n\nWe apply machine learning algorithms to the collective movement patterns of millions of people to build exciting new services.\n\nWe use a variety of ML algorithms, collaborative filtering, pagerank, clustering, classification and regression\n\n\n
  11. For example, Last year we launched foursquare explore. A recommendation engine that uses a variety of signals to recommend places in real time that a user might be interested in.\n\nExplore uses a variety of machine learning models to rank venues, we combine many signals, including:\n\nthe location of the user, and the time of day\nthe persons past check-in history,\nthe places their friends check-in\nthe similarities between different venues\n\n
  12. Consider these signals about places. Each place has a different signature based on who is coming to the place, when, and for how long.\n\nThis plot shows 3 different places:\n\nGorilla Coffee, Gray’s Papaya, Amorino (a restaurant)\n\nSee how gorilla coffee is busy more in the morning, where amorino is busy in the evening.\n\nGray’s papaya clearly has a strong lunch crowd, but also a late night peak on the weekends.\n\nHow can we use machine learning to learn from these signals which places are similar?\n\n\n
  13. We also have unique signals that describe people,\n\nWhich people are friends. Who is checking in together. Etc\n\nFrom checkins we can build a large colocation network that can be used to better understand how people interact with each other in the real world.\n\nHere we see an example of graph embedding to the foursquare employee network. People are placed near each other in 2D if they often colocate at similar places.\n\n\n\n
  14. Different parts of this map line up to the different places in the world where foursquare employees live.\n\nThis plot was made by applying minimum volume embedding, a non-linear graph based dimensionality reduction algorithm, to the foursquare employee network.\n\nEach person on this map can be described by thousands of numbers, showing how often the visit different places. The goals is to reduce the dimensionality of this space to 2D while preserving the strong pairwise relationships.\n\n\n\n
  15. We are constantly considering the best ways to address many of these questions\n\n
  16. All of this is possible because of our world-class data stack. Amazon S3 and EC2 allow us access to on-demand access to huge computational resources\n\n\n
  17. Thanks so much.\n\nFoursquare is hiring, if these projects seem interesting to you, please contact us at foursquare.com/jobs\n
  18. afinn sentiment analysis word list\n
  19. Friend graph for marriage equalitiocalypse, who are friends who checked in at this event\n