SlideShare a Scribd company logo
1 of 13
Download to read offline
1
                                       ITWP2011, Barcelona




Using Social- and Pseudo-
Social Networks to Improve
Recommendation Quality
Alan Said, Ernesto W. De Luca, Sahin Albayrak
2




Abstract
 The accumulated amount of data in the digital universe
  reached 1.2 Zettabytes (1 billion terabytes) in 2010.
   50% increase since 2008.
 Websites increasingly accumulate a wider variety of data on
  their users
   Without necessarily using it


 This paper: how can this data be used to improve
  recommendation
3




Outline
 Introduction
   Recommender Systems
   Problem statement
 Dataset
   Statistics
   Social and Pseudo-Social networks
 Approach
 Results
4




Introduction
 IMDb, one of the first online recommender systems, turned
  20 on October 17th 2010.

 Ever since their beginning, recommender systems have,
  through relatively simple techniques, produced
  recommendations for their users

 Today’s online systems contain more information about their
  users, we should use that information.
   Which information is important?
5




The Problem
• What to do with the heaps of information available?
   •   What and how to use in order to improve, or learn how to
       improve recommendations

 • How should we treat
       •   Friendships?
       •   Comments?
       •   Idols?
       •   common interests?
 • How important are these in terms of recommendation
   quality?
6




Dataset
 From the movie domain – Moviepilot.de
     Germany’s largest movie recommendation community
     1M+ users
     13M ratings
     50K movies

 Subset used here
     10, 000 randomly selected users with minimum 30 ratings
     1.5M ratings
     50, 000 comments
     4, 000 friendships
     170, 000 idols
     25, 000 ”diggs”
7




Social- and Pseudo-Social
networks
 Social networks
     Explicit statements of friendship between users

 Pseudo social networks
     Users commenting on the same movie
     Users being fans of the same people
     Users ”digging” the same news articles, trailers, etc.

   38% of ratings performed by users with friends
   45% of ratings performed by users with comments
   77% of ratings performed by users who are fans
   29% of ratings performed by users who ”digg”
8




The Approach
 Augmentig k-Nearest Neighbor neighborhoods by using
  information from (pseudo) social networks

   Using standard Pearson Similarity
    Increasing the similarity of users in the same networks in order to add
     them to the neighborhood
9




The Approach




    Standard neighborhood   Augmented neighborhood
10




Motivation
 Similarity metrics (Pearson, Jaccard, etc) are based on co-
  ratings
   Popular items often heighten similarities without adding ”value”
    e.g. movies like ”The Matrix” and ”The Lord of The Rings” often
    have similar (high) ratings, even if users do not share taste
   Adding importance to users who share other interests filters out
    some of the effects of popular items.
11




Results
10

 9

 8

 7

 6

 5                                           MAP
                                             P@10
 4

 3

 2

 1

 0
     Friendships   Comments   Fans   Diggs
12




Conclusion
 Social and interaction (co-commenting, etc) networks seem
  to hold more information than standard CF is able to identify
 Similarity metrics do not always tell the complete truth

 ToDo’s:
   Find items that are important for establishing similarity between
    users
   Investigate what other information can be used for measuring
    similarities
13




Questions?


             Thank you!

More Related Content

Viewers also liked

[Stp]canon 550 d
[Stp]canon 550 d[Stp]canon 550 d
[Stp]canon 550 d
Jihun Hyun
 

Viewers also liked (14)

[Stp]canon 550 d
[Stp]canon 550 d[Stp]canon 550 d
[Stp]canon 550 d
 
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesPersonalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
 
RecSysChallenge Opening
RecSysChallenge OpeningRecSysChallenge Opening
RecSysChallenge Opening
 
Users and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender SystemsUsers and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender Systems
 
Inferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceInferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender Performance
 
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
 
CaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationCaRR 2012 Opening Presentation
CaRR 2012 Opening Presentation
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsA Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
 
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
 
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
 
State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 

Similar to Using Social- and Pseudo-Social Networks to Improve Recommendation Quality

Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
Davide Feltoni Gurini
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
Dave King
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
idoguy
 

Similar to Using Social- and Pseudo-Social Networks to Improve Recommendation Quality (20)

Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter Design
 
Netnography
NetnographyNetnography
Netnography
 
Social Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionSocial Media Analytics: The Value Proposition
Social Media Analytics: The Value Proposition
 
Increasing Social Media ROI Using Gladwell's Tipping Point Framework
Increasing Social Media ROI Using Gladwell's Tipping Point FrameworkIncreasing Social Media ROI Using Gladwell's Tipping Point Framework
Increasing Social Media ROI Using Gladwell's Tipping Point Framework
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
 
CIDR 2009: Jeff Heer Keynote
CIDR 2009: Jeff Heer KeynoteCIDR 2009: Jeff Heer Keynote
CIDR 2009: Jeff Heer Keynote
 
Facebook's impact on ideological segregation
Facebook's impact on ideological segregationFacebook's impact on ideological segregation
Facebook's impact on ideological segregation
 
Hosanagar Supernova 2008
Hosanagar Supernova 2008Hosanagar Supernova 2008
Hosanagar Supernova 2008
 
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital StrategiesUniv. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systems
 
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
 
How to use LLMs for creating a content-based recommendation system for entert...
How to use LLMs for creating a content-based recommendation system for entert...How to use LLMs for creating a content-based recommendation system for entert...
How to use LLMs for creating a content-based recommendation system for entert...
 
ASTD_TU119_Web2.0_Social_Media_Learning_Japan
ASTD_TU119_Web2.0_Social_Media_Learning_Japan ASTD_TU119_Web2.0_Social_Media_Learning_Japan
ASTD_TU119_Web2.0_Social_Media_Learning_Japan
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative Filtering
 
Social Media Strategy And Metrics
Social Media Strategy And MetricsSocial Media Strategy And Metrics
Social Media Strategy And Metrics
 
Social Network Analysis (SNA) and its implications for knowledge discovery in...
Social Network Analysis (SNA) and its implications for knowledge discovery in...Social Network Analysis (SNA) and its implications for knowledge discovery in...
Social Network Analysis (SNA) and its implications for knowledge discovery in...
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Using Social- and Pseudo-Social Networks to Improve Recommendation Quality

  • 1. 1 ITWP2011, Barcelona Using Social- and Pseudo- Social Networks to Improve Recommendation Quality Alan Said, Ernesto W. De Luca, Sahin Albayrak
  • 2. 2 Abstract  The accumulated amount of data in the digital universe reached 1.2 Zettabytes (1 billion terabytes) in 2010.  50% increase since 2008.  Websites increasingly accumulate a wider variety of data on their users  Without necessarily using it  This paper: how can this data be used to improve recommendation
  • 3. 3 Outline  Introduction  Recommender Systems  Problem statement  Dataset  Statistics  Social and Pseudo-Social networks  Approach  Results
  • 4. 4 Introduction  IMDb, one of the first online recommender systems, turned 20 on October 17th 2010.  Ever since their beginning, recommender systems have, through relatively simple techniques, produced recommendations for their users  Today’s online systems contain more information about their users, we should use that information.  Which information is important?
  • 5. 5 The Problem • What to do with the heaps of information available? • What and how to use in order to improve, or learn how to improve recommendations • How should we treat • Friendships? • Comments? • Idols? • common interests? • How important are these in terms of recommendation quality?
  • 6. 6 Dataset  From the movie domain – Moviepilot.de  Germany’s largest movie recommendation community  1M+ users  13M ratings  50K movies  Subset used here  10, 000 randomly selected users with minimum 30 ratings  1.5M ratings  50, 000 comments  4, 000 friendships  170, 000 idols  25, 000 ”diggs”
  • 7. 7 Social- and Pseudo-Social networks  Social networks  Explicit statements of friendship between users  Pseudo social networks  Users commenting on the same movie  Users being fans of the same people  Users ”digging” the same news articles, trailers, etc.  38% of ratings performed by users with friends  45% of ratings performed by users with comments  77% of ratings performed by users who are fans  29% of ratings performed by users who ”digg”
  • 8. 8 The Approach  Augmentig k-Nearest Neighbor neighborhoods by using information from (pseudo) social networks  Using standard Pearson Similarity  Increasing the similarity of users in the same networks in order to add them to the neighborhood
  • 9. 9 The Approach Standard neighborhood Augmented neighborhood
  • 10. 10 Motivation  Similarity metrics (Pearson, Jaccard, etc) are based on co- ratings  Popular items often heighten similarities without adding ”value” e.g. movies like ”The Matrix” and ”The Lord of The Rings” often have similar (high) ratings, even if users do not share taste  Adding importance to users who share other interests filters out some of the effects of popular items.
  • 11. 11 Results 10 9 8 7 6 5 MAP P@10 4 3 2 1 0 Friendships Comments Fans Diggs
  • 12. 12 Conclusion  Social and interaction (co-commenting, etc) networks seem to hold more information than standard CF is able to identify  Similarity metrics do not always tell the complete truth  ToDo’s:  Find items that are important for establishing similarity between users  Investigate what other information can be used for measuring similarities
  • 13. 13 Questions? Thank you!