SlideShare uma empresa Scribd logo
1 de 1
Baixar para ler offline
1

Probabilistic Collaborative Filtering
with Negative Cross Entropy
1,3

2

3

Alejandro Bellogín , Javier Parapar , Pablo Castells

alejandro.bellogin@uam.es, javierparapar@udc.es, pablo.castells@uam.es
1
Information Access, Centrum Wiskunde & Informatica
2
Information Retrieval Lab, University of A Coruña
3
Information Retrieval Group, Universidad Autónoma de Madrid

Introduction

Relevance Modelling for Recommendation

• Neighbourhood
identification
in
memory-based CF algorithms is based
on selecting those users who are most
similar to the active user according to a
certain similarity metric.
• Hypothesis: neighbour-based CF techniques may be improved by using Relevance Models (RM) from Information Retrieval (IR) to identify such neighbourhoods (and also to weight them).
• Experiments: a relevance-based language model has been introduced into
a neighbour-based CF algorithm which
outperforms other standard techniques in
terms of ranking precision. Further improvements are achieved when we use a
complete probabilistic representation of
the problem.

We decompose the rating prediction task r(u, i) = C
ˆ

v∈Nk (u)

sim(u, v)r(v, i) (*) as:

1. We compute a relevance model for each user in order to capture how relevant any other
user would be as a potential neighbour.
Probability of a neighbour v under the relevance model Ru for a given user u:
p(v|Ru ) =

p(i)p(v|i)
i∈P RS(u)

p(j|i)

(1)

j∈I(u)

where p(i) is the probability of the item i in the collection, p(v|i) is the probability of
the neighbour v given the item i, and p(i|j) is the conditional probability of item i given
another item j. I(u) corresponds to the set of items rated by user u, and P RS(u) ⊂ I(u)
is the subset of items rated by u above some specific threshold.
2. Then we replace the rating prediction by weighted average in CF with the negative cross
entropy (from IR) to incorporate the information learnt from the RM:
r(u, i)
ˆ

=

H(p(·|Ru ); p(·|i)|Nk (u)) =

p(v|Ru ) log p(v|i)

(2)

v∈Nk (u)

Notation: RMUB: Eqs. (1) + (*) and RMCE: Eqs. (1) + (2)

Experiments and Results
Evaluation methodology: TestItems [1] (for each user a ranking is generated by predicting a score for every item in the test set).
Baselines: UB (user-based CF with Pearson’s correlation as similarity measure), NC+P (Normalised Cut (NC) with Pearson similarity [2])
(better performing than a matrix factorization algorithm), UIR (relevance model for log-based CF [6]), URM (rating-based probability
estimations [7]).
RM U Bλ=opt ×

RM CEλ=opt

RM U Bλ=0.5

RM CEλ=0.5 ◦

UB

MovieLens 100K

•

NC + P

0.25

0.2

0.15 ◦

◦

◦

◦

◦

◦

◦

◦

◦

◦

◦

◦

◦

◦

Method
UB
NC+P
UIR
URM
RMUB
RMCE

P@5
nDCG@5 nDCG@10
0.049cd 0.041cd 0.047cd
0.111acde 0.097acde 0.095acde
0.004
0.002
0.002
0.005
0.003
0.018
0.081acd 0.064acd 0.062acd
0.224abcde0.204abcde0.204abcde

P@50
cvg
0.056ce 100%
0.058ce
83%
0.002
100%
ce
0.054
100%
0.050c
60%
0.138abcde100%

P@5

MovieLens 1M
0.1
×
×
0.05 •

×

×

×

×

×

×

•
•

•

•

•

•

•

×
•

×
•

×
•

•
×

•
×

•
×

0
100

200

300

400

500

600

700

k

Method
UB
NC+P
UIR
URM
RMUB
RMCE

Evolution of the performance of the compared methods
in terms of P @5 when varying k on the MovieLens 100K collection.

Conclusions
• RMUB outperforms other state-of-the-art
approaches but is not optimal.
• The complete probabilistic model
(RMCE) achieves even larger improvements.
• Improvement in performance are consistent across different datasets
We have also produced different mappings
the involved variables [5], nevertheless,
more research is still needed on this point.

P@5
nDCG@5 nDCG@10
0.035cd 0.031cd 0.031cd
0.037acd 0.033acd 0.036acd
0.001
0.001
0.001
0.001
0.001
0.006
0.075abcd 0.061abcd 0.057abcd
0.187abcde0.176abcde0.168abcde

P@50
cvg
0.039cde 100%
0.048acde 99%
0.001
100%
0.034c
100%
0.038c
41.4%
0.108abcde 100%

Summary of comparative effectiveness

References
[1] B ELLOGÍN , A., C ASTELLS , P., AND C ANTADOR , I. Precision-oriented evaluation of recommender systems: an
algorithmic comparison. In RecSys (2011)
[2] B ELLOGÍN , A., AND PARAPAR , J. Using graph partitioning techniques for neighbour selection in user-based
collaborative filtering. In RecSys (2012)
[3] L AVRENKO , V., AND C ROFT, W. B. Relevance based language models. In SIGIR (2001)
[4] M C L AUGHLIN , M. R., AND H ERLOCKER , J. L. A collaborative filtering algorithm and evaluation metric that
accurately model the user experience. In SIGIR (2004)
[5] PARAPAR , J., B ELLOGÍN , A., C ASTELLS , P., AND B ARREIRO , A. Relevance-based language modelling for
recommender systems. IPM. 49, 4 (2013)
[6] WANG , J., DE V RIES , A., AND R EINDERS , M. A user-item relevance model for log-based collaborative filtering.
In ECIR (2006)
[7] WANG , J., DE V RIES , A. P., AND R EINDERS , M. J. T. Unified relevance models for rating prediction in collaborative filtering. ACM TOIS 26 (2008)

RecSys 2013, 7th ACM Conference on Recommender Systems. October 12–16, 2013. Hong Kong, China.

Mais conteúdo relacionado

Mais procurados

lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.pptbutest
 
Instance based learning
Instance based learningInstance based learning
Instance based learningswapnac12
 
Graph Matching Unsupervised Domain Adaptation
Graph Matching Unsupervised Domain Adaptation Graph Matching Unsupervised Domain Adaptation
Graph Matching Unsupervised Domain Adaptation Debasmit Das
 
Visualization Techniques for Outlier data
Visualization Techniques for Outlier data Visualization Techniques for Outlier data
Visualization Techniques for Outlier data Seoung-Ho Choi
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemDing Li
 
A new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score levelA new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score levelSotiris Mitracos
 
Application of interpolation in CSE
Application of interpolation in CSEApplication of interpolation in CSE
Application of interpolation in CSEMd. Tanvir Hossain
 
ModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_AliModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_AliMDO_Lab
 
Selection K in K-means Clustering
Selection K in K-means ClusteringSelection K in K-means Clustering
Selection K in K-means ClusteringJunghoon Kim
 
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...ijiert bestjournal
 
Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)Ha Phuong
 
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Bartlomiej Twardowski
 
Interpolation and-its-application
Interpolation and-its-applicationInterpolation and-its-application
Interpolation and-its-applicationApurbo Datta
 
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...IJERA Editor
 

Mais procurados (19)

lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.ppt
 
Instance based learning
Instance based learningInstance based learning
Instance based learning
 
Graph Matching Unsupervised Domain Adaptation
Graph Matching Unsupervised Domain Adaptation Graph Matching Unsupervised Domain Adaptation
Graph Matching Unsupervised Domain Adaptation
 
Visualization Techniques for Outlier data
Visualization Techniques for Outlier data Visualization Techniques for Outlier data
Visualization Techniques for Outlier data
 
Ijetr042101
Ijetr042101Ijetr042101
Ijetr042101
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
A new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score levelA new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score level
 
AROPUB-IJPGE-14-30
AROPUB-IJPGE-14-30AROPUB-IJPGE-14-30
AROPUB-IJPGE-14-30
 
Application of interpolation in CSE
Application of interpolation in CSEApplication of interpolation in CSE
Application of interpolation in CSE
 
ModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_AliModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_Ali
 
Selection K in K-means Clustering
Selection K in K-means ClusteringSelection K in K-means Clustering
Selection K in K-means Clustering
 
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
 
Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)
 
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...
 
Interpolation and-its-application
Interpolation and-its-applicationInterpolation and-its-application
Interpolation and-its-application
 
Av33274282
Av33274282Av33274282
Av33274282
 
Cukoo srch
Cukoo srchCukoo srch
Cukoo srch
 
Nearest neighbors
Nearest neighborsNearest neighbors
Nearest neighbors
 
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
 

Destaque

Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Alejandro Bellogin
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationAlejandro Bellogin
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013Alejandro Bellogin
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationAlejandro Bellogin
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013Alejandro Bellogin
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsAlejandro Bellogin
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 

Destaque (7)

Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 

Semelhante a Probabilistic Collaborative Filtering with Negative Cross Entropy

A comparison of SIFT, PCA-SIFT and SURF
A comparison of SIFT, PCA-SIFT and SURFA comparison of SIFT, PCA-SIFT and SURF
A comparison of SIFT, PCA-SIFT and SURFCSCJournals
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringAllenWu
 
HOP-Rec_RecSys18
HOP-Rec_RecSys18HOP-Rec_RecSys18
HOP-Rec_RecSys18Matt Yang
 
Lec14: Evaluation Framework for Medical Image Segmentation
Lec14: Evaluation Framework for Medical Image SegmentationLec14: Evaluation Framework for Medical Image Segmentation
Lec14: Evaluation Framework for Medical Image SegmentationUlaş Bağcı
 
RBHF_SDM_2011_Jie
RBHF_SDM_2011_JieRBHF_SDM_2011_Jie
RBHF_SDM_2011_JieMDO_Lab
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Daniel Valcarce
 
LCBM: Statistics-Based Parallel Collaborative Filtering
LCBM: Statistics-Based Parallel Collaborative FilteringLCBM: Statistics-Based Parallel Collaborative Filtering
LCBM: Statistics-Based Parallel Collaborative FilteringFabio Petroni, PhD
 
STATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARSTATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARRaniBhagat1
 
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...ijcsit
 
Calibration of Environmental Sensor Data Using a Linear Regression Technique
Calibration of Environmental Sensor Data Using a Linear Regression TechniqueCalibration of Environmental Sensor Data Using a Linear Regression Technique
Calibration of Environmental Sensor Data Using a Linear Regression Techniqueijtsrd
 
Tutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsTutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsYONG ZHENG
 
LSH for
 Prediction Problem in Recommendation
LSH for
 Prediction Problem in RecommendationLSH for
 Prediction Problem in Recommendation
LSH for
 Prediction Problem in RecommendationMaruf Aytekin
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...IJAEMSJORNAL
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsAladejubelo Oluwashina
 
Partha Sengupta_structural analysis.pptx
Partha Sengupta_structural analysis.pptxPartha Sengupta_structural analysis.pptx
Partha Sengupta_structural analysis.pptxJimmyPhoenix2
 
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...Daniel Valcarce
 
Sensitivity analysis in a lidar camera calibration
Sensitivity analysis in a lidar camera calibrationSensitivity analysis in a lidar camera calibration
Sensitivity analysis in a lidar camera calibrationcsandit
 

Semelhante a Probabilistic Collaborative Filtering with Negative Cross Entropy (20)

A comparison of SIFT, PCA-SIFT and SURF
A comparison of SIFT, PCA-SIFT and SURFA comparison of SIFT, PCA-SIFT and SURF
A comparison of SIFT, PCA-SIFT and SURF
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clustering
 
HOP-Rec_RecSys18
HOP-Rec_RecSys18HOP-Rec_RecSys18
HOP-Rec_RecSys18
 
Lec14: Evaluation Framework for Medical Image Segmentation
Lec14: Evaluation Framework for Medical Image SegmentationLec14: Evaluation Framework for Medical Image Segmentation
Lec14: Evaluation Framework for Medical Image Segmentation
 
RBHF_SDM_2011_Jie
RBHF_SDM_2011_JieRBHF_SDM_2011_Jie
RBHF_SDM_2011_Jie
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
 
LCBM: Statistics-Based Parallel Collaborative Filtering
LCBM: Statistics-Based Parallel Collaborative FilteringLCBM: Statistics-Based Parallel Collaborative Filtering
LCBM: Statistics-Based Parallel Collaborative Filtering
 
STATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARSTATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSAR
 
lcr
lcrlcr
lcr
 
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
 
Building the Professional of 2020: An Approach to Business Change Process Int...
Building the Professional of 2020: An Approach to Business Change Process Int...Building the Professional of 2020: An Approach to Business Change Process Int...
Building the Professional of 2020: An Approach to Business Change Process Int...
 
Calibration of Environmental Sensor Data Using a Linear Regression Technique
Calibration of Environmental Sensor Data Using a Linear Regression TechniqueCalibration of Environmental Sensor Data Using a Linear Regression Technique
Calibration of Environmental Sensor Data Using a Linear Regression Technique
 
Tutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsTutorial: Context In Recommender Systems
Tutorial: Context In Recommender Systems
 
LSH for
 Prediction Problem in Recommendation
LSH for
 Prediction Problem in RecommendationLSH for
 Prediction Problem in Recommendation
LSH for
 Prediction Problem in Recommendation
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 
Partha Sengupta_structural analysis.pptx
Partha Sengupta_structural analysis.pptxPartha Sengupta_structural analysis.pptx
Partha Sengupta_structural analysis.pptx
 
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
 
FinalReport
FinalReportFinalReport
FinalReport
 
Sensitivity analysis in a lidar camera calibration
Sensitivity analysis in a lidar camera calibrationSensitivity analysis in a lidar camera calibration
Sensitivity analysis in a lidar camera calibration
 

Mais de Alejandro Bellogin

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Alejandro Bellogin
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosAlejandro Bellogin
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsAlejandro Bellogin
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...Alejandro Bellogin
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsAlejandro Bellogin
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Alejandro Bellogin
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 
Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesAlejandro Bellogin
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamAlejandro Bellogin
 
Predicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - PosterPredicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - PosterAlejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 

Mais de Alejandro Bellogin (14)

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenarios
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systems
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender Systems
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 
Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - Slides
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slam
 
Predicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - PosterPredicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - Poster
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 

Último

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 

Último (20)

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Probabilistic Collaborative Filtering with Negative Cross Entropy

  • 1. 1 Probabilistic Collaborative Filtering with Negative Cross Entropy 1,3 2 3 Alejandro Bellogín , Javier Parapar , Pablo Castells alejandro.bellogin@uam.es, javierparapar@udc.es, pablo.castells@uam.es 1 Information Access, Centrum Wiskunde & Informatica 2 Information Retrieval Lab, University of A Coruña 3 Information Retrieval Group, Universidad Autónoma de Madrid Introduction Relevance Modelling for Recommendation • Neighbourhood identification in memory-based CF algorithms is based on selecting those users who are most similar to the active user according to a certain similarity metric. • Hypothesis: neighbour-based CF techniques may be improved by using Relevance Models (RM) from Information Retrieval (IR) to identify such neighbourhoods (and also to weight them). • Experiments: a relevance-based language model has been introduced into a neighbour-based CF algorithm which outperforms other standard techniques in terms of ranking precision. Further improvements are achieved when we use a complete probabilistic representation of the problem. We decompose the rating prediction task r(u, i) = C ˆ v∈Nk (u) sim(u, v)r(v, i) (*) as: 1. We compute a relevance model for each user in order to capture how relevant any other user would be as a potential neighbour. Probability of a neighbour v under the relevance model Ru for a given user u: p(v|Ru ) = p(i)p(v|i) i∈P RS(u) p(j|i) (1) j∈I(u) where p(i) is the probability of the item i in the collection, p(v|i) is the probability of the neighbour v given the item i, and p(i|j) is the conditional probability of item i given another item j. I(u) corresponds to the set of items rated by user u, and P RS(u) ⊂ I(u) is the subset of items rated by u above some specific threshold. 2. Then we replace the rating prediction by weighted average in CF with the negative cross entropy (from IR) to incorporate the information learnt from the RM: r(u, i) ˆ = H(p(·|Ru ); p(·|i)|Nk (u)) = p(v|Ru ) log p(v|i) (2) v∈Nk (u) Notation: RMUB: Eqs. (1) + (*) and RMCE: Eqs. (1) + (2) Experiments and Results Evaluation methodology: TestItems [1] (for each user a ranking is generated by predicting a score for every item in the test set). Baselines: UB (user-based CF with Pearson’s correlation as similarity measure), NC+P (Normalised Cut (NC) with Pearson similarity [2]) (better performing than a matrix factorization algorithm), UIR (relevance model for log-based CF [6]), URM (rating-based probability estimations [7]). RM U Bλ=opt × RM CEλ=opt RM U Bλ=0.5 RM CEλ=0.5 ◦ UB MovieLens 100K • NC + P 0.25 0.2 0.15 ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ Method UB NC+P UIR URM RMUB RMCE P@5 nDCG@5 nDCG@10 0.049cd 0.041cd 0.047cd 0.111acde 0.097acde 0.095acde 0.004 0.002 0.002 0.005 0.003 0.018 0.081acd 0.064acd 0.062acd 0.224abcde0.204abcde0.204abcde P@50 cvg 0.056ce 100% 0.058ce 83% 0.002 100% ce 0.054 100% 0.050c 60% 0.138abcde100% P@5 MovieLens 1M 0.1 × × 0.05 • × × × × × × • • • • • • • × • × • × • • × • × • × 0 100 200 300 400 500 600 700 k Method UB NC+P UIR URM RMUB RMCE Evolution of the performance of the compared methods in terms of P @5 when varying k on the MovieLens 100K collection. Conclusions • RMUB outperforms other state-of-the-art approaches but is not optimal. • The complete probabilistic model (RMCE) achieves even larger improvements. • Improvement in performance are consistent across different datasets We have also produced different mappings the involved variables [5], nevertheless, more research is still needed on this point. P@5 nDCG@5 nDCG@10 0.035cd 0.031cd 0.031cd 0.037acd 0.033acd 0.036acd 0.001 0.001 0.001 0.001 0.001 0.006 0.075abcd 0.061abcd 0.057abcd 0.187abcde0.176abcde0.168abcde P@50 cvg 0.039cde 100% 0.048acde 99% 0.001 100% 0.034c 100% 0.038c 41.4% 0.108abcde 100% Summary of comparative effectiveness References [1] B ELLOGÍN , A., C ASTELLS , P., AND C ANTADOR , I. Precision-oriented evaluation of recommender systems: an algorithmic comparison. In RecSys (2011) [2] B ELLOGÍN , A., AND PARAPAR , J. Using graph partitioning techniques for neighbour selection in user-based collaborative filtering. In RecSys (2012) [3] L AVRENKO , V., AND C ROFT, W. B. Relevance based language models. In SIGIR (2001) [4] M C L AUGHLIN , M. R., AND H ERLOCKER , J. L. A collaborative filtering algorithm and evaluation metric that accurately model the user experience. In SIGIR (2004) [5] PARAPAR , J., B ELLOGÍN , A., C ASTELLS , P., AND B ARREIRO , A. Relevance-based language modelling for recommender systems. IPM. 49, 4 (2013) [6] WANG , J., DE V RIES , A., AND R EINDERS , M. A user-item relevance model for log-based collaborative filtering. In ECIR (2006) [7] WANG , J., DE V RIES , A. P., AND R EINDERS , M. J. T. Unified relevance models for rating prediction in collaborative filtering. ACM TOIS 26 (2008) RecSys 2013, 7th ACM Conference on Recommender Systems. October 12–16, 2013. Hong Kong, China.