SlideShare uma empresa Scribd logo
1 de 23
Menno van der Sman
       Lead Developer


   Coen Stevens
   Recommendation Engineer
Mission:
Discover software & games
Updates
Searching




            powered by
Recommendations




   Codename: Ludwig
How to get started?



Research                                              Mathemagicians
 Amazon, Netflix etc
                                                          Peter Tegelaar & Coen Stevens




                         Ludwig created
                      recommender system in ruby running on EC2
Challenges
when building your first recommender system
Data
                     what do we have?

  Usage (implicit)         vs.      Ratings (explicit)

• Noisy                       • Accurate
• Only positive               • Positive and negative
  feedback                       feedback


• Easy to collect             • Hard to collect
Item-Based Collaborative Filtering
             User software usage matrix
                       Software items




               220   90         180          22

               280   12    42           80

     Users     175 210          210          45

               165   14    35   195     13   25

                     100   50   185          35   190

                     60         65                185
Classified user software usage matrix (1, 2, 3)
                    Software items




            3   2            2           2

            3   2      1             2

Users       3   3            2           3

            2   1      2     2       3   2

                3      2     2           2   3

                1            2               3
How do we predict the probability that I would like to use GMail?
                              Software items




                      3   2            2           2

                      3   2      1             2

         Users        3   3      ?     2           3

                      2   1      2     2       3   2

                          3      2     2           2   3

                          1            2               3
Calculate the similarities between Gmail and the other software items.
                                      Software items




                          3       2                 2       2

                          3       2        1            2

            Users         3       3                 2       3

                          2       1        2        2   3   2

                                  3        2        2       2   3

                                  1                 2           3


                       Similarity(Firefox, Gmail)
Calculate the similarities between Gmail and the other software items.
    Gmail similarities




              0.6        3   2       2       2

              0.8        3   2   1       2

              1.0        3   3       2       3

              0.4        2   1   2   2   3   2

              0.4            3   2   2       2   3

              0.3            1       2           3

              0.3
Calculate the predicted value for Gmail
Gmail similarities   User usage




          0.6               3

          0.8               3

          1.0

          0.4               2

          0.4

          0.3               3

          0.3
Calculate the predicted value for Gmail
Gmail similarities   User usage



                                      We take only the ‘K’ most similar items (say 2)
          0.6               3

          0.8               3

          1.0

          0.4               2

          0.4

          0.3               3                          0.6*3 + 0.8*3
                                                                               = 2.8
                                                    0.6 + 0.8 + 0.4 + 0.3
          0.3
Calculate all unknown values and
show the Top-N recommendations to each user
                    Software items




            3   2      ?     2 ?     ?   2

            3   2      1 ? 2 ? ?
Users       3   3      ? 2 ? 3 ?
            2   1      2 2 3 2 ?
            ?   3      2 2 ? 2 3
            ?   1      ? 2 ? ? 3
Metrics
                  measure for success


                     Space complexity: O(m + Kn)


Computational complexity: O(m + n²)


      Performance: Root Mean Squared Error
Evaluating the approach


Maximize
           (      performance

                      cost      )
      This is easy with EC2
Why EC2?

Low cost

              Flexibility

Ease of use
Infrastructure
Wakoopa                      EC2
              checkout
Repository
                           Computing
Application                 power



 Database     ssh tunnel
                             Big
                           Database
Want more?


 http://recked.org

 Time & place TBD
Wakoopa Recommendations Engine on AWS

Mais conteúdo relacionado

Destaque

AWS Customer Presentation-Costcutter
AWS Customer Presentation-CostcutterAWS Customer Presentation-Costcutter
AWS Customer Presentation-Costcutter
Amazon Web Services
 
AWS Customer Presentation - Melodeo
AWS Customer Presentation - MelodeoAWS Customer Presentation - Melodeo
AWS Customer Presentation - Melodeo
Amazon Web Services
 
AWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavisAWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavis
Amazon Web Services
 
Geocloud blue raster web mapping cloud deployment lessons from the field 201...
Geocloud blue raster web mapping cloud deployment  lessons from the field 201...Geocloud blue raster web mapping cloud deployment  lessons from the field 201...
Geocloud blue raster web mapping cloud deployment lessons from the field 201...
Amazon Web Services
 

Destaque (19)

AWS Customer Presentation-Costcutter
AWS Customer Presentation-CostcutterAWS Customer Presentation-Costcutter
AWS Customer Presentation-Costcutter
 
AWS Customer Presentation - Iloverewards
AWS Customer Presentation - IloverewardsAWS Customer Presentation - Iloverewards
AWS Customer Presentation - Iloverewards
 
AWS Customer Presentation - SchoolofEverything
AWS Customer Presentation - SchoolofEverythingAWS Customer Presentation - SchoolofEverything
AWS Customer Presentation - SchoolofEverything
 
AWS Customer Presentation - Cloud Made
AWS Customer Presentation - Cloud MadeAWS Customer Presentation - Cloud Made
AWS Customer Presentation - Cloud Made
 
AWS Customer Presentation - Melodeo
AWS Customer Presentation - MelodeoAWS Customer Presentation - Melodeo
AWS Customer Presentation - Melodeo
 
AWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavisAWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavis
 
Geocloud blue raster web mapping cloud deployment lessons from the field 201...
Geocloud blue raster web mapping cloud deployment  lessons from the field 201...Geocloud blue raster web mapping cloud deployment  lessons from the field 201...
Geocloud blue raster web mapping cloud deployment lessons from the field 201...
 
AWS Customer Presenatation - SlingMedia uses AWS
AWS Customer Presenatation - SlingMedia uses AWSAWS Customer Presenatation - SlingMedia uses AWS
AWS Customer Presenatation - SlingMedia uses AWS
 
AWS Customer Presentation - Skifta
AWS Customer Presentation - SkiftaAWS Customer Presentation - Skifta
AWS Customer Presentation - Skifta
 
AWS Customer Presentation - Zynga
AWS Customer Presentation - ZyngaAWS Customer Presentation - Zynga
AWS Customer Presentation - Zynga
 
AWS Tech Summit - Berlin 2011 - Running Java Applications on AWS
AWS Tech Summit - Berlin 2011 - Running Java Applications on AWSAWS Tech Summit - Berlin 2011 - Running Java Applications on AWS
AWS Tech Summit - Berlin 2011 - Running Java Applications on AWS
 
Building a PaaS with Docker and AWS
Building a PaaS with Docker and AWSBuilding a PaaS with Docker and AWS
Building a PaaS with Docker and AWS
 
Databases in the Cloud
Databases in the CloudDatabases in the Cloud
Databases in the Cloud
 
AWS Elastic Beanstalk
AWS Elastic BeanstalkAWS Elastic Beanstalk
AWS Elastic Beanstalk
 
Werner Vogels
Werner Vogels Werner Vogels
Werner Vogels
 
(MBL313) NEW! AWS IoT: Understanding Hardware Kits, SDKs, & Protocols
(MBL313) NEW! AWS IoT: Understanding Hardware Kits, SDKs, & Protocols(MBL313) NEW! AWS IoT: Understanding Hardware Kits, SDKs, & Protocols
(MBL313) NEW! AWS IoT: Understanding Hardware Kits, SDKs, & Protocols
 
AWS Customer Presentation - Heavy.com
AWS Customer Presentation - Heavy.com AWS Customer Presentation - Heavy.com
AWS Customer Presentation - Heavy.com
 
STG201 Understanding AWS Storage Options - - AWS re: Invent 2012
STG201 Understanding AWS Storage Options - - AWS re: Invent 2012STG201 Understanding AWS Storage Options - - AWS re: Invent 2012
STG201 Understanding AWS Storage Options - - AWS re: Invent 2012
 
Big Data & The Cloud
Big Data & The CloudBig Data & The Cloud
Big Data & The Cloud
 

Semelhante a Wakoopa Recommendations Engine on AWS

Kicking ass with redis
Kicking ass with redisKicking ass with redis
Kicking ass with redis
Dvir Volk
 
Infrastructure for cloud_computing
Infrastructure for cloud_computingInfrastructure for cloud_computing
Infrastructure for cloud_computing
JULIO GONZALEZ SANZ
 
Autodesk Maya 2023.pdf
Autodesk Maya 2023.pdfAutodesk Maya 2023.pdf
Autodesk Maya 2023.pdf
HamzaJani4
 
Lead Allocation System's Attribute Driven Design (ADD)
Lead Allocation System's Attribute Driven Design (ADD)Lead Allocation System's Attribute Driven Design (ADD)
Lead Allocation System's Attribute Driven Design (ADD)
Amin Bandeali
 
ngGoBuilder and collaborative development between San Francisco and Tokyo
ngGoBuilder and collaborative development between San Francisco and TokyongGoBuilder and collaborative development between San Francisco and Tokyo
ngGoBuilder and collaborative development between San Francisco and Tokyo
notolab
 

Semelhante a Wakoopa Recommendations Engine on AWS (20)

Fast Depth Paper Review
Fast Depth Paper ReviewFast Depth Paper Review
Fast Depth Paper Review
 
3D Analyst 2.3 Release Notes
3D Analyst 2.3 Release Notes3D Analyst 2.3 Release Notes
3D Analyst 2.3 Release Notes
 
CRM Vendor Evaluation Matrix
CRM Vendor Evaluation MatrixCRM Vendor Evaluation Matrix
CRM Vendor Evaluation Matrix
 
Cloud computing_processing frameworks
Cloud computing_processing frameworksCloud computing_processing frameworks
Cloud computing_processing frameworks
 
Real World Patterns for Cloud Computing
Real World Patterns for Cloud ComputingReal World Patterns for Cloud Computing
Real World Patterns for Cloud Computing
 
Kicking ass with redis
Kicking ass with redisKicking ass with redis
Kicking ass with redis
 
Infrastructure for cloud_computing
Infrastructure for cloud_computingInfrastructure for cloud_computing
Infrastructure for cloud_computing
 
[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
 
Autodesk Maya 2023.pdf
Autodesk Maya 2023.pdfAutodesk Maya 2023.pdf
Autodesk Maya 2023.pdf
 
Lead Allocation System's Attribute Driven Design (ADD)
Lead Allocation System's Attribute Driven Design (ADD)Lead Allocation System's Attribute Driven Design (ADD)
Lead Allocation System's Attribute Driven Design (ADD)
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GIS
 
Email Marketing Vendor Evaluation
Email Marketing Vendor EvaluationEmail Marketing Vendor Evaluation
Email Marketing Vendor Evaluation
 
MongoDB Stitch Introduction
MongoDB Stitch IntroductionMongoDB Stitch Introduction
MongoDB Stitch Introduction
 
Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2
 
srs-example.pdf
srs-example.pdfsrs-example.pdf
srs-example.pdf
 
Cloud Gaming Architectures: From Social to Mobile to MMO
Cloud Gaming Architectures: From Social to Mobile to MMOCloud Gaming Architectures: From Social to Mobile to MMO
Cloud Gaming Architectures: From Social to Mobile to MMO
 
ngGoBuilder and collaborative development between San Francisco and Tokyo
ngGoBuilder and collaborative development between San Francisco and TokyongGoBuilder and collaborative development between San Francisco and Tokyo
ngGoBuilder and collaborative development between San Francisco and Tokyo
 
Metaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdfMetaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdf
 
[MongoDB.local Bengaluru 2018] Introduction to MongoDB Stitch
[MongoDB.local Bengaluru 2018] Introduction to MongoDB Stitch[MongoDB.local Bengaluru 2018] Introduction to MongoDB Stitch
[MongoDB.local Bengaluru 2018] Introduction to MongoDB Stitch
 
IRJET- Augmented Reality based Building Modelling
IRJET- Augmented Reality based Building Modelling IRJET- Augmented Reality based Building Modelling
IRJET- Augmented Reality based Building Modelling
 

Mais de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Mais de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Wakoopa Recommendations Engine on AWS

  • 1.
  • 2. Menno van der Sman Lead Developer Coen Stevens Recommendation Engineer
  • 5. Searching powered by
  • 6. Recommendations Codename: Ludwig
  • 7. How to get started? Research Mathemagicians Amazon, Netflix etc Peter Tegelaar & Coen Stevens Ludwig created recommender system in ruby running on EC2
  • 8. Challenges when building your first recommender system
  • 9. Data what do we have? Usage (implicit) vs. Ratings (explicit) • Noisy • Accurate • Only positive • Positive and negative feedback feedback • Easy to collect • Hard to collect
  • 10. Item-Based Collaborative Filtering User software usage matrix Software items 220 90 180 22 280 12 42 80 Users 175 210 210 45 165 14 35 195 13 25 100 50 185 35 190 60 65 185
  • 11. Classified user software usage matrix (1, 2, 3) Software items 3 2 2 2 3 2 1 2 Users 3 3 2 3 2 1 2 2 3 2 3 2 2 2 3 1 2 3
  • 12. How do we predict the probability that I would like to use GMail? Software items 3 2 2 2 3 2 1 2 Users 3 3 ? 2 3 2 1 2 2 3 2 3 2 2 2 3 1 2 3
  • 13. Calculate the similarities between Gmail and the other software items. Software items 3 2 2 2 3 2 1 2 Users 3 3 2 3 2 1 2 2 3 2 3 2 2 2 3 1 2 3 Similarity(Firefox, Gmail)
  • 14. Calculate the similarities between Gmail and the other software items. Gmail similarities 0.6 3 2 2 2 0.8 3 2 1 2 1.0 3 3 2 3 0.4 2 1 2 2 3 2 0.4 3 2 2 2 3 0.3 1 2 3 0.3
  • 15. Calculate the predicted value for Gmail Gmail similarities User usage 0.6 3 0.8 3 1.0 0.4 2 0.4 0.3 3 0.3
  • 16. Calculate the predicted value for Gmail Gmail similarities User usage We take only the ‘K’ most similar items (say 2) 0.6 3 0.8 3 1.0 0.4 2 0.4 0.3 3 0.6*3 + 0.8*3 = 2.8 0.6 + 0.8 + 0.4 + 0.3 0.3
  • 17. Calculate all unknown values and show the Top-N recommendations to each user Software items 3 2 ? 2 ? ? 2 3 2 1 ? 2 ? ? Users 3 3 ? 2 ? 3 ? 2 1 2 2 3 2 ? ? 3 2 2 ? 2 3 ? 1 ? 2 ? ? 3
  • 18. Metrics measure for success Space complexity: O(m + Kn) Computational complexity: O(m + n²) Performance: Root Mean Squared Error
  • 19. Evaluating the approach Maximize ( performance cost ) This is easy with EC2
  • 20. Why EC2? Low cost Flexibility Ease of use
  • 21. Infrastructure Wakoopa EC2 checkout Repository Computing Application power Database ssh tunnel Big Database
  • 22. Want more? http://recked.org Time & place TBD