SlideShare a Scribd company logo
1 of 25
Download to read offline
Using Trust
in Recommender Systems:
  an experimental analysis

            Paolo Massa
         University of Trento

(joint work with Bobby Bhattacharjee, UMD)
Motivation:
1. Recommender Systems recommends
   items the user might like, based on
   past ratings.
2. Now, Decentralized publishing of info:
  – Ratings on Items
  – Trust on Principals
     [Semantic Web]
3. New issues (sparseness, scalability,
   trust, attacks, ...)
           ... Trust-aware Decentralized RS
Summary
1. Recommender Systems (RSs)
  – Weaknesses
2. Solution: trust-awareness
  – Trust and trust metrics
3. Experiments on Epinions.com
  – Evidence trust solves RSs problems
  – (~50.000 users!)
4. Future works
Collaborative Filtering (CF)
1.       Input: ratings given by users to items
     ●     I like “ Titanic” as 4/5
2.       I ask recommendation
3.       RS computes the similarity of me
         against every other user
     ●     Pearson correlation coefficient
4.       RS find similar users and suggests to
         me items liked by them.
Item 1
                         It e m 2
                                    It e m 3
                                               It e m 4
                   I
   User1       2 5 ? 5
               2 5 5 5
   User2       5                    1          3
   User3       5         5                     1
   User4       2
               2         5
                         5          5
                                    5          4
                                               4

It does not consider the content of the items, only
     the ratings given by users.
It works independently of the domain (also jokes)
                             BUT
Overlapping of rated items required!
RSs weaknesses
1. Ratings Matrix sparseness (95-99%)
  – Low or No overlapping (users not comparable)
2. Cold start
  – New users have 0 ratings (->not comparable)
3. Easy Attacks by Malicious Users
  – Copy profile and become the most similar
4. Hard to understand and control
  – Black box (bad recs -> user gives up)

                Solution? Trust of course!
Trust-awareness
1. Trust statement =Rating by human to
   human about her usefulness (ex: in
   providing good movie reviews)
2. Explicitly provided
3. Trust is subjective! T(A,Z)=1 & T(B,Z)=0
  – No Global BAD principals!!!
4. Trust is asymmetric! I trust Bill Gates.
5. FOAF (Friend-Of-A-Friend) is an XML
   format to express relationships
  – Some millions files out there...
Trust Networks




     ME




6 degrees of separation
“ theorem”
Trust metrics
1. Task: based on known trust edges,
   predict trustworthiness of principals
2. Trust propagation (A->B,B->C|A-?->C)
3. Global (pagerank, ebay, ...)
4. Local (personalized)


                                  ME
Trust solves RS problems
1.   Trust solves CF sparseness problem
   – trust propagation and “ 6 degrees” -> reach many
2. Trust solves Cold Start problem
   – “ just add 1 friend”
3. Trust metrics resistant to copy-profile-attack.
   – “ you can be similar but if no trust path to you ...”
4. Trust easier to understand and control
     –    trust nets supports Explanation (HCI tests needed)

              EVIDENCE of 1 and 2 provided by analyzing a
                 REAL, VAST community (Epinions.com)
Experiment: Epinions.com
1. Epinions.com' users can
  –   Review and rate items (from 1 to 5)
  –   Keep web of trust (trust=1) and block list (trust=0).
  –   “ Reviewers whose reviews and ratings you have
      consistently found to be valuable” (Epinions FAQ)
2. Dataset (by crawling site):
  – ~50K users, ~140K items, ~660K ratings.
  – ~500K trust statements.
      •   No block list (not shown on site)
Epinions' recommendations
Taken one user “ ME” , we can
- use CF on ratings and compute
    “ similarity” of other users
- use Trust Metric and compute
    “ trustworthiness” of other users
Then we can suggest items liked by similar
    or trustable users.

          On how many users are they
               computable?
Statistics (1)




#Ratings expressed by Users
(#rev<5) = 52.82%! [Cold start users]
Statistics (2)




#Trust statements expressed by Users
(#trust<5) = 70.18%!
User Similarity Computability
1. Ideally, every user should be
   comparable against every other user.
2. BUT ratings sparseness = 99.99135%
   -> tiny overlapping between 2 users
3. Pearson correlation coefficient
   meaningful only if overlapping(A,U)>1
4. Question: taken one user, how many
   users are comparable?
US computability (cont.)
1. Taken one user, we computed all the
   comparable users.
  – On average an user has 161 comparable
    users (ideally ~50.000!)
2. We have averaged
   #comparable_users over users who
   expressed a certain number of
   reviews.
US computability (cont.)




         Cold Start Users


Ex: users with 40 reviews have ~800 comparable users.
                            BUT users (y axis) are ~50.000!
             And for Cold Start Users (>50%) this is 2.74
Trust computability
1. Trust metrics predict trust in unknown
   users based on known trust
   statements.
2. Distance from ME to U is a first
   measure of Trust computability
3. On average,
  – In 2 steps, reach 400 users
  – In 3 steps, reach 4386 users
Mean # Reachable Users (in k steps) for users
   expressing X trust statements
    In few steps, you can predict trust in every user!
                         Even for Cold Start Users!!!
Trust and US computability
                  comparison
  Mean number of Comparable                Mean number of Comparable
      users for All users                   users for Cold Start users
     Propagating Trust             Using        Propagating Trust      Using
Dist 1   Dist 2   Dist 3   Dist 4 Pearson Dist 1 Dist 2 Dist 3 Dist 4 Pearson

9.88     400      4386 16334      161    2.14 94.54 1675       9121    2.74
Contribution
Experimental evidence that
  – CF is ineffective in real world scenarios
     •   Especially for Cold Start users.
  – Trust can solve CF problems
     •   Sparseness
     •   Cold Start
     •   Attacks (self-evident)


  Trust is computable on many more users than
     user similarity
     Especially for cold start users (the majority!)
Future works
1. US and Trust correlate? Contradict?
  – US over trusted is higher than usual?
2. Distrust?
  – Propagation? Properties?
3. Design a Trust Metric (for RS)
  – Create and evaluate a Trust-aware RS
     •   Input data
Thanks for your attention!



   Questions?

                                    Paolo Massa
                             Email: massa@itc.it
   Blog: http://moloko.itc.it/paoloblog/index.html
Collaborative Filtering
Similarity measure: Pearson Correlation
Coefficient of user a and u
                         m
                    ∑i=1 r a ,i −r a r u ,i −r u 
   w a , u=
               ∑    m
                     i=1
                         r a , i −r u 
                                           2      m
                                               ∑i=1 r u , i −r u 
                                                                      2



 Prediction of rating given by user a to ite
                         n
                    ∑u=1  r u , i −ru ∗w a , u
   p a , i =r a 
                                n
                             ∑u=1 w a , u
Hard Trust and Soft Trust
1. Vocabulary:
  – Hard Trust: about security, identity of
    something (user, device, information)
     •   Public key cryptography
  – Soft Trust: appreciation of some principal
    (explicitly provided by another principal)
     •   Social Networks and Trust Metrics

More Related Content

Similar to Using Trust in Recommender Systems: an experimental analysis

IJCAI Workshop Presentation
IJCAI Workshop PresentationIJCAI Workshop Presentation
IJCAI Workshop Presentation
Neal Lathia
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
Ernesto Mislej
 
Endorphin. Making sense of social data
Endorphin. Making sense of social dataEndorphin. Making sense of social data
Endorphin. Making sense of social data
Artem Zavyalov
 

Similar to Using Trust in Recommender Systems: an experimental analysis (20)

Controversial Users demand Local Trust Metrics: an Experimental Study on Epin...
Controversial Users demand Local Trust Metrics: an Experimental Study on Epin...Controversial Users demand Local Trust Metrics: an Experimental Study on Epin...
Controversial Users demand Local Trust Metrics: an Experimental Study on Epin...
 
Trustlet, Open Research on Trust Metrics
Trustlet, Open Research on Trust MetricsTrustlet, Open Research on Trust Metrics
Trustlet, Open Research on Trust Metrics
 
Estimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyEstimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User Study
 
Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...
 
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
 
Rated Ranking Evaluator (RRE) Hands-on Relevance Testing @Chorus
Rated Ranking Evaluator (RRE) Hands-on Relevance Testing @ChorusRated Ranking Evaluator (RRE) Hands-on Relevance Testing @Chorus
Rated Ranking Evaluator (RRE) Hands-on Relevance Testing @Chorus
 
IRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion MiningIRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion Mining
 
The Wisdom of the Few @SIGIR09
The Wisdom of the Few @SIGIR09The Wisdom of the Few @SIGIR09
The Wisdom of the Few @SIGIR09
 
IJCAI Workshop Presentation
IJCAI Workshop PresentationIJCAI Workshop Presentation
IJCAI Workshop Presentation
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CF
 
A Reliable Peer-to-Peer Platform for Adding New Node Using Trust Based Model
A Reliable Peer-to-Peer Platform for Adding New Node Using Trust Based Model    A Reliable Peer-to-Peer Platform for Adding New Node Using Trust Based Model
A Reliable Peer-to-Peer Platform for Adding New Node Using Trust Based Model
 
Telefonica Lunch Seminar
Telefonica Lunch SeminarTelefonica Lunch Seminar
Telefonica Lunch Seminar
 
Enhancing Multi-Aspect Collaborative Filtering for Personalized Recommendation
Enhancing Multi-Aspect Collaborative Filtering for Personalized RecommendationEnhancing Multi-Aspect Collaborative Filtering for Personalized Recommendation
Enhancing Multi-Aspect Collaborative Filtering for Personalized Recommendation
 
Computing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback CommentsComputing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback Comments
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataCollective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and Metadata
 
A Survey on Evaluating Sentiments by Using Artificial Neural Network
A Survey on Evaluating Sentiments by Using Artificial Neural NetworkA Survey on Evaluating Sentiments by Using Artificial Neural Network
A Survey on Evaluating Sentiments by Using Artificial Neural Network
 
IAC22 Safe Tech Audit Presentation Noreen Whysel.pptx
IAC22 Safe Tech Audit Presentation Noreen Whysel.pptxIAC22 Safe Tech Audit Presentation Noreen Whysel.pptx
IAC22 Safe Tech Audit Presentation Noreen Whysel.pptx
 
Endorphin. Making sense of social data
Endorphin. Making sense of social dataEndorphin. Making sense of social data
Endorphin. Making sense of social data
 
Rating System Algorithms Document
Rating System Algorithms DocumentRating System Algorithms Document
Rating System Algorithms Document
 

More from Paolo Massa

Gamification Features 4 Fitcity
Gamification Features 4 FitcityGamification Features 4 Fitcity
Gamification Features 4 Fitcity
Paolo Massa
 
Social fitness (fitcity project)
Social fitness (fitcity project)Social fitness (fitcity project)
Social fitness (fitcity project)
Paolo Massa
 
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
Paolo Massa
 
The Simplicity Cycle by Dan Ward
The Simplicity Cycle by Dan WardThe Simplicity Cycle by Dan Ward
The Simplicity Cycle by Dan Ward
Paolo Massa
 
Invited talk at Future Networked Technologies / FIT-IT research calls opening...
Invited talk at Future Networked Technologies / FIT-IT research calls opening...Invited talk at Future Networked Technologies / FIT-IT research calls opening...
Invited talk at Future Networked Technologies / FIT-IT research calls opening...
Paolo Massa
 
OMG Girlz Don't Exist on teh Intarweb!!!!1
OMG Girlz Don't Exist on teh Intarweb!!!!1OMG Girlz Don't Exist on teh Intarweb!!!!1
OMG Girlz Don't Exist on teh Intarweb!!!!1
Paolo Massa
 

More from Paolo Massa (20)

Monitoraggio - Alternanza Scuola Lavoro - 2016 (Slides del Ministro)
Monitoraggio - Alternanza Scuola Lavoro - 2016 (Slides del Ministro)Monitoraggio - Alternanza Scuola Lavoro - 2016 (Slides del Ministro)
Monitoraggio - Alternanza Scuola Lavoro - 2016 (Slides del Ministro)
 
Manypedia: Comparing Language Points of View of Wikipedia Communities
Manypedia: Comparing  Language Points of View  of Wikipedia CommunitiesManypedia: Comparing  Language Points of View  of Wikipedia Communities
Manypedia: Comparing Language Points of View of Wikipedia Communities
 
Gamification Features 4 Fitcity
Gamification Features 4 FitcityGamification Features 4 Fitcity
Gamification Features 4 Fitcity
 
Rete e Reti: Per-che' e per-chi?
Rete e Reti: Per-che' e per-chi?Rete e Reti: Per-che' e per-chi?
Rete e Reti: Per-che' e per-chi?
 
Social fitness (fitcity project)
Social fitness (fitcity project)Social fitness (fitcity project)
Social fitness (fitcity project)
 
DESIGN PRINCIPLES OF WIKIS AND THEIR IMPACT ON KNOWLEDGE EXCHANGE PROCESSES
DESIGN PRINCIPLES OF WIKIS AND THEIR IMPACT ON KNOWLEDGE EXCHANGE PROCESSES  DESIGN PRINCIPLES OF WIKIS AND THEIR IMPACT ON KNOWLEDGE EXCHANGE PROCESSES
DESIGN PRINCIPLES OF WIKIS AND THEIR IMPACT ON KNOWLEDGE EXCHANGE PROCESSES
 
Reputation: local or global?
Reputation: local or global?Reputation: local or global?
Reputation: local or global?
 
Collective Memory building in Wikipedia: the case of North African uprisings
Collective Memory building in Wikipedia: the case of North African uprisingsCollective Memory building in Wikipedia: the case of North African uprisings
Collective Memory building in Wikipedia: the case of North African uprisings
 
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
Social networks of Wikipedia - Paolo Massa - Presentation at (2011). ACM Hype...
 
Social net-work 4 your business
Social net-work 4 your businessSocial net-work 4 your business
Social net-work 4 your business
 
An Empirical Analysis on Social Capital and Enterprise 2.0 Participation in a...
An Empirical Analysis on Social Capital and Enterprise 2.0 Participation in a...An Empirical Analysis on Social Capital and Enterprise 2.0 Participation in a...
An Empirical Analysis on Social Capital and Enterprise 2.0 Participation in a...
 
Supporting Collaborative Networks in Organizational Settings using an Enterpr...
Supporting Collaborative Networks in Organizational Settings using an Enterpr...Supporting Collaborative Networks in Organizational Settings using an Enterpr...
Supporting Collaborative Networks in Organizational Settings using an Enterpr...
 
Combining Ridesharing& Social Networks
Combining Ridesharing& Social NetworksCombining Ridesharing& Social Networks
Combining Ridesharing& Social Networks
 
The Simplicity Cycle by Dan Ward
The Simplicity Cycle by Dan WardThe Simplicity Cycle by Dan Ward
The Simplicity Cycle by Dan Ward
 
Invited talk at Future Networked Technologies / FIT-IT research calls opening...
Invited talk at Future Networked Technologies / FIT-IT research calls opening...Invited talk at Future Networked Technologies / FIT-IT research calls opening...
Invited talk at Future Networked Technologies / FIT-IT research calls opening...
 
The Future of Work, Fun, and Being Social: an introduction to the nascent adv...
The Future of Work, Fun, and Being Social: an introduction to the nascent adv...The Future of Work, Fun, and Being Social: an introduction to the nascent adv...
The Future of Work, Fun, and Being Social: an introduction to the nascent adv...
 
Feedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesFeedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online Communities
 
Bowling Alone and Trust Decline in Social Network Sites
Bowling Alone and  Trust Decline in  Social Network SitesBowling Alone and  Trust Decline in  Social Network Sites
Bowling Alone and Trust Decline in Social Network Sites
 
Social Networking 4 your business
Social Networking 4 your businessSocial Networking 4 your business
Social Networking 4 your business
 
OMG Girlz Don't Exist on teh Intarweb!!!!1
OMG Girlz Don't Exist on teh Intarweb!!!!1OMG Girlz Don't Exist on teh Intarweb!!!!1
OMG Girlz Don't Exist on teh Intarweb!!!!1
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Using Trust in Recommender Systems: an experimental analysis

  • 1. Using Trust in Recommender Systems: an experimental analysis Paolo Massa University of Trento (joint work with Bobby Bhattacharjee, UMD)
  • 2. Motivation: 1. Recommender Systems recommends items the user might like, based on past ratings. 2. Now, Decentralized publishing of info: – Ratings on Items – Trust on Principals [Semantic Web] 3. New issues (sparseness, scalability, trust, attacks, ...) ... Trust-aware Decentralized RS
  • 3. Summary 1. Recommender Systems (RSs) – Weaknesses 2. Solution: trust-awareness – Trust and trust metrics 3. Experiments on Epinions.com – Evidence trust solves RSs problems – (~50.000 users!) 4. Future works
  • 4. Collaborative Filtering (CF) 1. Input: ratings given by users to items ● I like “ Titanic” as 4/5 2. I ask recommendation 3. RS computes the similarity of me against every other user ● Pearson correlation coefficient 4. RS find similar users and suggests to me items liked by them.
  • 5. Item 1 It e m 2 It e m 3 It e m 4 I User1 2 5 ? 5 2 5 5 5 User2 5 1 3 User3 5 5 1 User4 2 2 5 5 5 5 4 4 It does not consider the content of the items, only the ratings given by users. It works independently of the domain (also jokes) BUT Overlapping of rated items required!
  • 6. RSs weaknesses 1. Ratings Matrix sparseness (95-99%) – Low or No overlapping (users not comparable) 2. Cold start – New users have 0 ratings (->not comparable) 3. Easy Attacks by Malicious Users – Copy profile and become the most similar 4. Hard to understand and control – Black box (bad recs -> user gives up) Solution? Trust of course!
  • 7. Trust-awareness 1. Trust statement =Rating by human to human about her usefulness (ex: in providing good movie reviews) 2. Explicitly provided 3. Trust is subjective! T(A,Z)=1 & T(B,Z)=0 – No Global BAD principals!!! 4. Trust is asymmetric! I trust Bill Gates. 5. FOAF (Friend-Of-A-Friend) is an XML format to express relationships – Some millions files out there...
  • 8. Trust Networks ME 6 degrees of separation “ theorem”
  • 9. Trust metrics 1. Task: based on known trust edges, predict trustworthiness of principals 2. Trust propagation (A->B,B->C|A-?->C) 3. Global (pagerank, ebay, ...) 4. Local (personalized) ME
  • 10. Trust solves RS problems 1. Trust solves CF sparseness problem – trust propagation and “ 6 degrees” -> reach many 2. Trust solves Cold Start problem – “ just add 1 friend” 3. Trust metrics resistant to copy-profile-attack. – “ you can be similar but if no trust path to you ...” 4. Trust easier to understand and control – trust nets supports Explanation (HCI tests needed) EVIDENCE of 1 and 2 provided by analyzing a REAL, VAST community (Epinions.com)
  • 11. Experiment: Epinions.com 1. Epinions.com' users can – Review and rate items (from 1 to 5) – Keep web of trust (trust=1) and block list (trust=0). – “ Reviewers whose reviews and ratings you have consistently found to be valuable” (Epinions FAQ) 2. Dataset (by crawling site): – ~50K users, ~140K items, ~660K ratings. – ~500K trust statements. • No block list (not shown on site)
  • 12. Epinions' recommendations Taken one user “ ME” , we can - use CF on ratings and compute “ similarity” of other users - use Trust Metric and compute “ trustworthiness” of other users Then we can suggest items liked by similar or trustable users. On how many users are they computable?
  • 13. Statistics (1) #Ratings expressed by Users (#rev<5) = 52.82%! [Cold start users]
  • 14. Statistics (2) #Trust statements expressed by Users (#trust<5) = 70.18%!
  • 15. User Similarity Computability 1. Ideally, every user should be comparable against every other user. 2. BUT ratings sparseness = 99.99135% -> tiny overlapping between 2 users 3. Pearson correlation coefficient meaningful only if overlapping(A,U)>1 4. Question: taken one user, how many users are comparable?
  • 16. US computability (cont.) 1. Taken one user, we computed all the comparable users. – On average an user has 161 comparable users (ideally ~50.000!) 2. We have averaged #comparable_users over users who expressed a certain number of reviews.
  • 17. US computability (cont.) Cold Start Users Ex: users with 40 reviews have ~800 comparable users. BUT users (y axis) are ~50.000! And for Cold Start Users (>50%) this is 2.74
  • 18. Trust computability 1. Trust metrics predict trust in unknown users based on known trust statements. 2. Distance from ME to U is a first measure of Trust computability 3. On average, – In 2 steps, reach 400 users – In 3 steps, reach 4386 users
  • 19. Mean # Reachable Users (in k steps) for users expressing X trust statements In few steps, you can predict trust in every user! Even for Cold Start Users!!!
  • 20. Trust and US computability comparison Mean number of Comparable Mean number of Comparable users for All users users for Cold Start users Propagating Trust Using Propagating Trust Using Dist 1 Dist 2 Dist 3 Dist 4 Pearson Dist 1 Dist 2 Dist 3 Dist 4 Pearson 9.88 400 4386 16334 161 2.14 94.54 1675 9121 2.74
  • 21. Contribution Experimental evidence that – CF is ineffective in real world scenarios • Especially for Cold Start users. – Trust can solve CF problems • Sparseness • Cold Start • Attacks (self-evident) Trust is computable on many more users than user similarity Especially for cold start users (the majority!)
  • 22. Future works 1. US and Trust correlate? Contradict? – US over trusted is higher than usual? 2. Distrust? – Propagation? Properties? 3. Design a Trust Metric (for RS) – Create and evaluate a Trust-aware RS • Input data
  • 23. Thanks for your attention! Questions? Paolo Massa Email: massa@itc.it Blog: http://moloko.itc.it/paoloblog/index.html
  • 24. Collaborative Filtering Similarity measure: Pearson Correlation Coefficient of user a and u m ∑i=1 r a ,i −r a r u ,i −r u  w a , u= ∑ m i=1 r a , i −r u  2 m ∑i=1 r u , i −r u  2 Prediction of rating given by user a to ite n ∑u=1  r u , i −ru ∗w a , u p a , i =r a   n ∑u=1 w a , u
  • 25. Hard Trust and Soft Trust 1. Vocabulary: – Hard Trust: about security, identity of something (user, device, information) • Public key cryptography – Soft Trust: appreciation of some principal (explicitly provided by another principal) • Social Networks and Trust Metrics