9. Every Domain is Unique
● Data
● Time
○ year
○ time of day
○ mood
○ item first appeared
● Item consumed before
● Whose opinion
● Cluster users
10. Solutions
● Apache Mahout
○ hadoop, open source
● Myrrix
○ hadoop, cloud, Cloudera
● easyrec
○ open source, restful
● LensKit
○ open source, movielens
11. Benefits
● Increase on sales
○ Amazon 2006 %35
● Based on real activity
○ Always Up-To-Date
● Great for discovery
● Right item to right user
● Personalization
● Reduced organizational maintenance
○ Navigation
12. Drawbacks
● Personalized recommenders are difficult to
set up
○ Algorithms
○ Scalability
● Maintenance
○ System
○ Monitoring
● Sometimes they’re wrong
● Attacks
○ Outliers
27. Item Based vs User Based vs SVD
● Least memory: SVD
● Most accurate: SVD
● Explanation: Item Based
28. Content Based vs Collaborative
● Least memory: Content Based
● Least learning: Content Based
● No content needed: Collaborative
● Cold start: Collaborative
● Social: Collaborative
● Shortest Prediction Time: Collaborative