SlideShare a Scribd company logo
1 of 2
Download to read offline
Top-N Recommender Systems: Revisiting Item Neighborhood Methods
George Karypis
Department of Computer Science & Engineering
University of Minnesota
karypis@cs.umn.edu
http://www.cs.umn.edu/~karypis


Abstract
Top-N recommender systems are designed to generate a ranked list of items that a user will find
useful based on the user’s prior activity. These systems have become ubiquitous and are an
essential tool for information filtering and (e-)commerce. Over the years, collaborative filtering,
which derive these recommendations by leveraging past activities of groups of users, has
emerged as the most prominent approach for solving this problem. Among the multitude of
methods that have been developed, item-based nearest neighbor algorithms are among the
simplest and yet best-performing methods for Top-N recommender systems. These methods
rank the items to be recommended based on how similar they are to the items in a user’s prior
activity history, using various co-occurrence similarity measures.
In this talk we present our recent work in these item-based neighborhood methods that has
substantially improved the accuracy of the predictions. One shortcoming of traditional item-
based neighborhood methods is that they rely on a similarity measure that needs to be specified
a priori. To address this problem we developed a class of item-based neighborhood methods
that directly estimate from the training data a sparse item-item similarity matrix. This similarity
matrix is estimated using a structural equation modeling (SEM) framework, which requires each
column of the user-item matrix to be approximated as a sparse aggregation of some other
columns. These other columns correspond to the learned neighbors and their aggregation
weights to the learned similarities. A second shortcoming of item-based neighborhood methods
is that the item-item similarity measures rely on co-occurrences, which become problematic
when the datasets are very sparse and the number of items pairs with sufficiently many co-
occurrences is small. To address this problem we extended the SEM framework to estimate a
factored version of the item-item similarity matrix. This factored representation projects the
items in a lower dimensional space, which allows for meaningful similarity estimates between
items that never co-occurred in the original user-item matrix. In addition to the above, we also
discuss and present result from our work to enhance the above SEM-models by incorporating
item side information to further improve the Top-N recommendation accuracy and to also
address the item cold-start recommendation problem.

Bio
George Karypis is a professor at the Department of Computer Science & Engineering at the
University of Minnesota, Twin Cities. His research interests spans the areas of data mining,
bioinformatics, cheminformatics, high performance computing, information retrieval,
collaborative filtering, and scientific computing. His research has resulted in the development of
software libraries for serial and parallel graph partitioning (METIS and ParMETIS), hypergraph
partitioning (hMETIS), for parallel Cholesky factorization (PSPASES), for collaborative filtering-
based recommendation algorithms (SUGGEST), clustering high dimensional datasets (CLUTO),
finding frequent patterns in diverse datasets (PAFI), and for protein secondary structure
prediction (YASSPP). He has coauthored over 200 papers on these topics and a book title
“Introduction to Parallel Computing” (Publ. Addison Wesley, 2003, 2nd edition). In addition, he is
serving on the program committees of many conferences and workshops on these topics, and
on the editorial boards of the IEEE Transactions on Knowledge and Data Engineering, Social
Network Analysis and Data Mining Journal, International Journal of Data Mining and
Bioinformatics, the journal on Current Proteomics, Advances in Bioinformatics, and Biomedicine
and Biotechnology.

More Related Content

What's hot

Interpreting sslar
Interpreting sslarInterpreting sslar
Interpreting sslarRatzman III
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsKatrien Verbert
 
Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationAlan Said
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.Giuseppe Ricci
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and TextNBER
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation SystemAnamta Sayyed
 
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
 
A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectioncsandit
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterElena Simperl
 
Contractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACContractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACwebuploader
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Vala Ali Rohani
 
Algorithms of Online Platforms and Networks
Algorithms of Online Platforms and NetworksAlgorithms of Online Platforms and Networks
Algorithms of Online Platforms and NetworksAnsgar Koene
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsBrowse Jobs
 

What's hot (20)

Analytical Tools Primer
Analytical Tools PrimerAnalytical Tools Primer
Analytical Tools Primer
 
Interpreting sslar
Interpreting sslarInterpreting sslar
Interpreting sslar
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systems
 
Sub1579
Sub1579Sub1579
Sub1579
 
Data Models
Data ModelsData Models
Data Models
 
Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System Evaluation
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
PhD defense
PhD defense PhD defense
PhD defense
 
Data models
Data modelsData models
Data models
 
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles Using Data Analytics for Behavioral Dissertation Resear...
 
A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detection
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on Twitter
 
Contractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACContractor-Borner-SNA-SAC
Contractor-Borner-SNA-SAC
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)
 
Algorithms of Online Platforms and Networks
Algorithms of Online Platforms and NetworksAlgorithms of Online Platforms and Networks
Algorithms of Online Platforms and Networks
 
Data model
Data modelData model
Data model
 
Phd thesis final presentation
Phd thesis   final presentationPhd thesis   final presentation
Phd thesis final presentation
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systems
 

Similar to George

Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...IRJET Journal
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015Journal For Research
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...IJTET Journal
 
Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...IOSR Journals
 
25.ranking on data manifold with sink points
25.ranking on data manifold with sink points25.ranking on data manifold with sink points
25.ranking on data manifold with sink pointsVenkatesh Neerukonda
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...Editor IJCATR
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMIJERA Editor
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstractsbutest
 
A Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender SystemA Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender SystemAndrew Parish
 
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...IJCI JOURNAL
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docxKevalKabariya
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Editor IJAIEM
 
Context Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsContext Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsIJCSIS Research Publications
 
The application of data mining to recommender systems
The application of data mining to recommender systems The application of data mining to recommender systems
The application of data mining to recommender systems sunsine123
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptxprathammishra28
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf13DikshaDatir
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemIJTET Journal
 
Fuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender SystemFuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender SystemRSIS International
 

Similar to George (20)

B1802021823
B1802021823B1802021823
B1802021823
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
 
Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...
 
25.ranking on data manifold with sink points
25.ranking on data manifold with sink points25.ranking on data manifold with sink points
25.ranking on data manifold with sink points
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMM
 
At4102337341
At4102337341At4102337341
At4102337341
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
 
A Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender SystemA Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender System
 
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docx
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
 
Context Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsContext Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word Pairs
 
The application of data mining to recommender systems
The application of data mining to recommender systems The application of data mining to recommender systems
The application of data mining to recommender systems
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation System
 
Fuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender SystemFuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender System
 

George

  • 1. Top-N Recommender Systems: Revisiting Item Neighborhood Methods George Karypis Department of Computer Science & Engineering University of Minnesota karypis@cs.umn.edu http://www.cs.umn.edu/~karypis Abstract Top-N recommender systems are designed to generate a ranked list of items that a user will find useful based on the user’s prior activity. These systems have become ubiquitous and are an essential tool for information filtering and (e-)commerce. Over the years, collaborative filtering, which derive these recommendations by leveraging past activities of groups of users, has emerged as the most prominent approach for solving this problem. Among the multitude of methods that have been developed, item-based nearest neighbor algorithms are among the simplest and yet best-performing methods for Top-N recommender systems. These methods rank the items to be recommended based on how similar they are to the items in a user’s prior activity history, using various co-occurrence similarity measures. In this talk we present our recent work in these item-based neighborhood methods that has substantially improved the accuracy of the predictions. One shortcoming of traditional item- based neighborhood methods is that they rely on a similarity measure that needs to be specified a priori. To address this problem we developed a class of item-based neighborhood methods that directly estimate from the training data a sparse item-item similarity matrix. This similarity matrix is estimated using a structural equation modeling (SEM) framework, which requires each column of the user-item matrix to be approximated as a sparse aggregation of some other columns. These other columns correspond to the learned neighbors and their aggregation weights to the learned similarities. A second shortcoming of item-based neighborhood methods is that the item-item similarity measures rely on co-occurrences, which become problematic when the datasets are very sparse and the number of items pairs with sufficiently many co- occurrences is small. To address this problem we extended the SEM framework to estimate a factored version of the item-item similarity matrix. This factored representation projects the items in a lower dimensional space, which allows for meaningful similarity estimates between items that never co-occurred in the original user-item matrix. In addition to the above, we also discuss and present result from our work to enhance the above SEM-models by incorporating item side information to further improve the Top-N recommendation accuracy and to also address the item cold-start recommendation problem. Bio George Karypis is a professor at the Department of Computer Science & Engineering at the University of Minnesota, Twin Cities. His research interests spans the areas of data mining, bioinformatics, cheminformatics, high performance computing, information retrieval, collaborative filtering, and scientific computing. His research has resulted in the development of software libraries for serial and parallel graph partitioning (METIS and ParMETIS), hypergraph partitioning (hMETIS), for parallel Cholesky factorization (PSPASES), for collaborative filtering- based recommendation algorithms (SUGGEST), clustering high dimensional datasets (CLUTO), finding frequent patterns in diverse datasets (PAFI), and for protein secondary structure prediction (YASSPP). He has coauthored over 200 papers on these topics and a book title “Introduction to Parallel Computing” (Publ. Addison Wesley, 2003, 2nd edition). In addition, he is
  • 2. serving on the program committees of many conferences and workshops on these topics, and on the editorial boards of the IEEE Transactions on Knowledge and Data Engineering, Social Network Analysis and Data Mining Journal, International Journal of Data Mining and Bioinformatics, the journal on Current Proteomics, Advances in Bioinformatics, and Biomedicine and Biotechnology.