SlideShare uma empresa Scribd logo
1 de 40
Baixar para ler offline
Personalisation for Smarter Cities
               Neal Lathia
        UCL Computer Science
         n.lathia@cs.ucl.ac.uk
But first...
●   How many of you …
    ●   own a computer...?
    ●   use Google...?
    ●   have a Facebook account...?
    ●   buy things on Amazon...?
    ●   use Last.fm/Spotify...?
Web Companies
●   Offer you
    ●   Personalised services and advertisements
    ●   Who you know, what you are looking for,
        recommendations for what you may like
●   Make money by
    ●   Doing useful things with the data you create
    ●   Selling advertisements, giving recommendations
●   Everything is centred around “you”
Data Mining (web)
●   Using huge amounts of data
    ●   Clicks onto links
    ●   Ratings for movies
    ●   Friendships
●   With data mining algorithms to
    ●   Understand (predict) how people behave
    ●   Build systems that will help them
Forget the web
●   40,000+ people die on European roads each
    year
●   Congestion costs an estimated 1% of EU GDP
    = 100 Billion Euros
●   Transport accounts for 30% of total energy
    consumption in the EU
●   It is expected that 70% of the world population
    will live in cities by 2050
Lots of problems to be solved
●   Road safety/traffic monitoring
●   Reducing congestion
●   Building sustainable transport networks
●   Urban navigation
Another question
●   How many of you...
    ●   have a smart phone?
    ●   have an Oyster card?
Oyster cards/Mobile Phones
●   These devices produce data that is very similar
    to the data you create online
    ●   Talking/texting friends
    ●   Checking in/rating to locations
    ●   Travelling around London with your Oyster card
Example
●   On last.fm, you only listen to rock music
●   On TfL, you only travel on buses

●   Both are implicit indications of your preferences
My research
●   Can we use the technologies that work so well
    for web companies to solve problems in cities?
My research
●   Can we use the technologies that work so well
    for web companies to solve problems in cities?

●   Today's examples:
    ●   Personalised tube services
    ●   Ticket recommendations
Today's examples
●   Will give you a very brief introduction to things
    people who are doing data mining are
    interested in:
    ●   Clustering
    ●   Regression
    ●   Ranking
    ●   Classification
Clustering
2 x ~300,000 travellers (5%)
~7,000,0000 tube trips
...is this you?
Clustering
●   We are looking for the different habits that
    travellers may have
●   Clustering is a process of automatically
    organising data into groups, so that each group
    has very similar members
How does it work?
Clustering
●   We can start seeing
    how different
    travellers move about
    the city
Regression
Predicting travel time
●   How long will it take me to get there?
●   Every time you travel, you make some data
    ●   From where + what time → to where + what time
●   Every one is creating their own data

●   When you want to travel, can we give you a
    personalised travel time estimate?
How does it work?
●   We design algorithms that leverage this data:
    ●   Self-similarity: how long it took you before
    ●   Familiarity: people who are similar to you
    ●   Context: time you are travelling
How well does it work?
●   On average, how much error in the predictions?
    ●   Using the mean trip time, 11.45 minutes
    ●   Using zone-zone mean time, 8.56 minutes
    ●   Using journey planner, ~6 minutes
    ●   Using our algorithms, < 3 minutes
    ●   Combined algorithm: 2.92 minutes
Ranking
Station Alerts
●   How often do you get to a station and find that
    there is a problem?
●   Travel alerts/disruptions: you need to look
    manually for what is relevant to you
●   But every time you touch in a station, you are
    showing that you are potentially interested in
    what happens there
Ranking
●   Is the process of making an ordered list
    ●   We can automatically make a unique list for each
        person
    ●   The stations will be sorted according to how
        relevant they are to your travels
How does it work?
●   Each station has a weight (a number) that we
    use to sort them
    ●   At first, the weight is just how popular the station is
    ●   We increase the weights of stations you visit often
    ●   We increase the weights of stations that are similar
        to the ones you visit often
         –   Similar, in this case, means that “people who travel
             to/from station X also travel to/from station Y”
Does it work?
●   We use a metric
    called percentile
    ranking
●   Smaller values are
    better
Classification
Paying for travel
●   Is it cheaper to use pay as you go?
●   Which travel card is best?
●   … how do you decide?
Paying for travel
●   Is it cheaper to use pay as you go?
●   Which travel card is best?
●   … how do you decide?

●   The cheapest fare will depend on where you
    need to go, when you need to travel, and how
    you tend to go there (bus, train)
Wasting money?
●   The Oyster card data we have shows what
    ticket people were using
●   We can use their trips to compute what the
    cheapest fare would have been, and then see
    how much money they could have saved
Wasting Money!
●   Based on the data, travellers could save about
                £200 million per year
●   If they were buying the cheapest tickets for their
    travel needs
●   Can we help them buy the best fare?
Classification
●   Is the process of assigning some data to a
    group. In our case,
    ●   Data = a person's travel habits
    ●   Group = the cheapest ticket
How does it work?
●   We used decision
    trees: an automatic
    way of recursively
    partitioning data and
    discovering rules to
    classify data
Example
●   Neal's travel habits:
    ●   2.5 average trips per day
    ●   85% trips on the tube / 15% trips on buses
    ●   75% of trips during peak-hours
    ●   95% of trips between Zone 1 and Zone 2

    ●   Decision tree says: Neal should buy a Zone 1-2
        travel card
Does this work?
●   We can ask our algorithm to predict what the
    best ticket for a person will be, and see if it
    predicts correctly
●   We pick a group that could have saved
    £479,583.91
●   Our algorithm is > 98% accurate; if this group
    followed our recommendations, it would have
    saved £473,918.38
Summary
Summary
●   Data mining for the city
    ●   People are already carrying around Oyster cards
        and mobile phones, and making lots of useful data
        about their movements
    ●   There are a lot of problems that can be tackled
        using data mining
Summary
●   We have looked at examples of
    ●   Clustering: grouping people's behaviours
    ●   Regression: predicting travel times
    ●   Ranking: making an ordered list of stations
    ●   Classification: recommending the best ticket
Personalisation for Smarter Cities
               Neal Lathia
        UCL Computer Science
         n.lathia@cs.ucl.ac.uk

Mais conteúdo relacionado

Semelhante a Personalisation for Smarter Cities

How to start a travel startup
How to start a travel startupHow to start a travel startup
How to start a travel startup
Gillian Morris
 
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
Adaptive Path
 
Designing for Multi-touchpoint Experiences
Designing for Multi-touchpoint ExperiencesDesigning for Multi-touchpoint Experiences
Designing for Multi-touchpoint Experiences
Jamin Hegeman
 
IdeaHacks - Mover's edge
IdeaHacks - Mover's edgeIdeaHacks - Mover's edge
IdeaHacks - Mover's edge
dpseud1
 

Semelhante a Personalisation for Smarter Cities (20)

Seeding Ideas for Intelligent Transport
Seeding Ideas for Intelligent TransportSeeding Ideas for Intelligent Transport
Seeding Ideas for Intelligent Transport
 
How to start a travel startup
How to start a travel startupHow to start a travel startup
How to start a travel startup
 
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
Mapping Experiences and Orchestrating Touchpoints | Chris Risdon & Patrick Qu...
 
How the transport matrix is being reshaped by the consumer's changing expecta...
How the transport matrix is being reshaped by the consumer's changing expecta...How the transport matrix is being reshaped by the consumer's changing expecta...
How the transport matrix is being reshaped by the consumer's changing expecta...
 
Groupand travel deck v2
Groupand travel   deck v2Groupand travel   deck v2
Groupand travel deck v2
 
Webinar: Using smart card and GPS data for policy and planning: the case of T...
Webinar: Using smart card and GPS data for policy and planning: the case of T...Webinar: Using smart card and GPS data for policy and planning: the case of T...
Webinar: Using smart card and GPS data for policy and planning: the case of T...
 
When togo pitch deck
When togo pitch deckWhen togo pitch deck
When togo pitch deck
 
on Service Design
on Service Designon Service Design
on Service Design
 
Designing for Multi-touchpoint Experiences
Designing for Multi-touchpoint ExperiencesDesigning for Multi-touchpoint Experiences
Designing for Multi-touchpoint Experiences
 
Servcice Design Principles of OneTicket
Servcice Design Principles of OneTicketServcice Design Principles of OneTicket
Servcice Design Principles of OneTicket
 
Amazon Treasure Truck Value Proposition
Amazon Treasure Truck Value PropositionAmazon Treasure Truck Value Proposition
Amazon Treasure Truck Value Proposition
 
RTPI 2013 Guy Dangerfield
RTPI 2013 Guy DangerfieldRTPI 2013 Guy Dangerfield
RTPI 2013 Guy Dangerfield
 
IdeaHacks - Mover's edge
IdeaHacks - Mover's edgeIdeaHacks - Mover's edge
IdeaHacks - Mover's edge
 
Groupand travel deckv2.pptx
Groupand travel deckv2.pptxGroupand travel deckv2.pptx
Groupand travel deckv2.pptx
 
Optimum Design for Tourism Line Based on Improved Ant Colony System Algorithm
Optimum Design for Tourism Line Based on Improved Ant Colony System AlgorithmOptimum Design for Tourism Line Based on Improved Ant Colony System Algorithm
Optimum Design for Tourism Line Based on Improved Ant Colony System Algorithm
 
Final Project
Final ProjectFinal Project
Final Project
 
How to Design an On-Demand Transit Service
How to Design an On-Demand Transit ServiceHow to Design an On-Demand Transit Service
How to Design an On-Demand Transit Service
 
Using Analytics for User Research
Using Analytics for User ResearchUsing Analytics for User Research
Using Analytics for User Research
 
Activate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter CitiesActivate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter Cities
 
TIGG
TIGGTIGG
TIGG
 

Mais de Neal Lathia

Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
Neal Lathia
 

Mais de Neal Lathia (20)

Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)
 
Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)
 
Using language models to supercharge Monzo’s customer support
 Using language models to supercharge Monzo’s customer support Using language models to supercharge Monzo’s customer support
Using language models to supercharge Monzo’s customer support
 
Making Better Decisions Faster
Making Better Decisions FasterMaking Better Decisions Faster
Making Better Decisions Faster
 
Machine Learning, Faster
Machine Learning, FasterMachine Learning, Faster
Machine Learning, Faster
 
AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised Travel
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation Engine
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product Managers
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone Data
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital Health
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily Life
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone Data
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self Meetup
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealth
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to Deployment
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
 

Personalisation for Smarter Cities

  • 1. Personalisation for Smarter Cities Neal Lathia UCL Computer Science n.lathia@cs.ucl.ac.uk
  • 2. But first... ● How many of you … ● own a computer...? ● use Google...? ● have a Facebook account...? ● buy things on Amazon...? ● use Last.fm/Spotify...?
  • 3. Web Companies ● Offer you ● Personalised services and advertisements ● Who you know, what you are looking for, recommendations for what you may like ● Make money by ● Doing useful things with the data you create ● Selling advertisements, giving recommendations ● Everything is centred around “you”
  • 4. Data Mining (web) ● Using huge amounts of data ● Clicks onto links ● Ratings for movies ● Friendships ● With data mining algorithms to ● Understand (predict) how people behave ● Build systems that will help them
  • 5. Forget the web ● 40,000+ people die on European roads each year ● Congestion costs an estimated 1% of EU GDP = 100 Billion Euros ● Transport accounts for 30% of total energy consumption in the EU ● It is expected that 70% of the world population will live in cities by 2050
  • 6. Lots of problems to be solved ● Road safety/traffic monitoring ● Reducing congestion ● Building sustainable transport networks ● Urban navigation
  • 7. Another question ● How many of you... ● have a smart phone? ● have an Oyster card?
  • 8. Oyster cards/Mobile Phones ● These devices produce data that is very similar to the data you create online ● Talking/texting friends ● Checking in/rating to locations ● Travelling around London with your Oyster card
  • 9. Example ● On last.fm, you only listen to rock music ● On TfL, you only travel on buses ● Both are implicit indications of your preferences
  • 10. My research ● Can we use the technologies that work so well for web companies to solve problems in cities?
  • 11. My research ● Can we use the technologies that work so well for web companies to solve problems in cities? ● Today's examples: ● Personalised tube services ● Ticket recommendations
  • 12. Today's examples ● Will give you a very brief introduction to things people who are doing data mining are interested in: ● Clustering ● Regression ● Ranking ● Classification
  • 14. 2 x ~300,000 travellers (5%) ~7,000,0000 tube trips
  • 16. Clustering ● We are looking for the different habits that travellers may have ● Clustering is a process of automatically organising data into groups, so that each group has very similar members
  • 17. How does it work?
  • 18. Clustering ● We can start seeing how different travellers move about the city
  • 20. Predicting travel time ● How long will it take me to get there? ● Every time you travel, you make some data ● From where + what time → to where + what time ● Every one is creating their own data ● When you want to travel, can we give you a personalised travel time estimate?
  • 21. How does it work? ● We design algorithms that leverage this data: ● Self-similarity: how long it took you before ● Familiarity: people who are similar to you ● Context: time you are travelling
  • 22. How well does it work? ● On average, how much error in the predictions? ● Using the mean trip time, 11.45 minutes ● Using zone-zone mean time, 8.56 minutes ● Using journey planner, ~6 minutes ● Using our algorithms, < 3 minutes ● Combined algorithm: 2.92 minutes
  • 24. Station Alerts ● How often do you get to a station and find that there is a problem? ● Travel alerts/disruptions: you need to look manually for what is relevant to you ● But every time you touch in a station, you are showing that you are potentially interested in what happens there
  • 25. Ranking ● Is the process of making an ordered list ● We can automatically make a unique list for each person ● The stations will be sorted according to how relevant they are to your travels
  • 26. How does it work? ● Each station has a weight (a number) that we use to sort them ● At first, the weight is just how popular the station is ● We increase the weights of stations you visit often ● We increase the weights of stations that are similar to the ones you visit often – Similar, in this case, means that “people who travel to/from station X also travel to/from station Y”
  • 27. Does it work? ● We use a metric called percentile ranking ● Smaller values are better
  • 29. Paying for travel ● Is it cheaper to use pay as you go? ● Which travel card is best? ● … how do you decide?
  • 30. Paying for travel ● Is it cheaper to use pay as you go? ● Which travel card is best? ● … how do you decide? ● The cheapest fare will depend on where you need to go, when you need to travel, and how you tend to go there (bus, train)
  • 31. Wasting money? ● The Oyster card data we have shows what ticket people were using ● We can use their trips to compute what the cheapest fare would have been, and then see how much money they could have saved
  • 32. Wasting Money! ● Based on the data, travellers could save about £200 million per year ● If they were buying the cheapest tickets for their travel needs ● Can we help them buy the best fare?
  • 33. Classification ● Is the process of assigning some data to a group. In our case, ● Data = a person's travel habits ● Group = the cheapest ticket
  • 34. How does it work? ● We used decision trees: an automatic way of recursively partitioning data and discovering rules to classify data
  • 35. Example ● Neal's travel habits: ● 2.5 average trips per day ● 85% trips on the tube / 15% trips on buses ● 75% of trips during peak-hours ● 95% of trips between Zone 1 and Zone 2 ● Decision tree says: Neal should buy a Zone 1-2 travel card
  • 36. Does this work? ● We can ask our algorithm to predict what the best ticket for a person will be, and see if it predicts correctly ● We pick a group that could have saved £479,583.91 ● Our algorithm is > 98% accurate; if this group followed our recommendations, it would have saved £473,918.38
  • 38. Summary ● Data mining for the city ● People are already carrying around Oyster cards and mobile phones, and making lots of useful data about their movements ● There are a lot of problems that can be tackled using data mining
  • 39. Summary ● We have looked at examples of ● Clustering: grouping people's behaviours ● Regression: predicting travel times ● Ranking: making an ordered list of stations ● Classification: recommending the best ticket
  • 40. Personalisation for Smarter Cities Neal Lathia UCL Computer Science n.lathia@cs.ucl.ac.uk