A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]

•

0 gostou•240 visualizações

Poster for the ECIR 2015 short paper: Daniel Valcarce, Javier Parapar, Alvaro Barreiro: A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems. ECIR 2015: 346-351 http://dx.doi.org/10.1007/978-3-319-16354-3_38

Dados e análise

A Study of Smoothing Methods for Relevance-Based
Language Modelling of Recommender Systems
Daniel Valcarce, Javier Parapar, Álvaro Barreiro
{daniel.valcarce, javierparapar, barreiro}@udc.es – http://www.irlab.org
Information Retrieval Lab, Computer Science Department, University of A Coruña
Overview
Language Models have been traditionally used in several ﬁelds such as speech recognition or document retrieval. Recently,
Relevance-Based Language Models have been extended to Collaborative Filtering Recommender Systems [1]. In
this ﬁeld, a Relevance Model is estimated for each user based on the probabilities of the items. As it was thoroughly studied,
smoothing plays a key role in the estimation of a Language Model [2]. Our aim in this work is to study smoothing methods
in the context of Collaborative Filtering Recommender Systems.
RM for Recommendation
IR RecSys
Query Target user
Document Neighbour
Term Item
RM1 : p(i|Ru) ∝
v∈Vu
p(v)p(i|v)
j∈Iu
p(j|v) (1)
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v)p(v)
p(i)
p(j|v) (2)
• Iu is the set of items rated by the user u
• Vu is the set of neighbours of the user u
• p(i) and p(v) are considered uniform
• p(i|u) is computed smoothing pml(i|u) =
ru,i
j∈Iu
ru,j
Smoothing methods
Smoothing deals with data sparsity and plays a similar role to
the IDF using a background model: p(i|C) = v∈U rv,i
j∈I, v∈U rv,j
.
Jelinek-Mercer (JM) Linear interpolation. Parameter λ.
pλ(i|u) = (1 − λ) pml(i|u) + λ p(i|C) (3)
Dirichlet Priors (DP) Bayesian analysis. Parameter µ.
pµ(i|u) =
ru,i + µ p(i|C)
µ + j∈Iu
ru,j
(4)
Absolute Discounting (AD) Subtract a constant δ.
pδ(i|u) =
max(ru,i − δ, 0) + δ |Iu| p(i|C)
j∈Iu
ru,j
(5)
Experiments
0
0.05
0.1
0 100 200 300 400 500 600 700 800 900 1000
µ
RM1 + AD
RM1 + JM
RM1 + DP
RM2 + AD
RM2 + JM
RM2 + DP
0.25
0.3
0.35
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
P@5
λ / δ
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
01002003004005006007008009001000
0
0.2
0.4
0.6
0.8
1
P@5
δ
#ratings
P@5
0
0.2
0.4
0.6
0.8
1
Precision at 5 of the RM1 and the RM2 algorithms using Abso-
lute Discounting (AD), Jelinek-Mercer (JM) and Dirichlet pri-
ors (DP) smoothing methods for the MovieLens 100k dataset.
Precision at 5 of the RM2 algorithm using AD when varying
the smoothing intensity and considering diﬀerent number of
ratings in the user proﬁles for the MovieLens 1M dataset.
Conclusions
• There are no big diﬀerences in terms of optimal pre-
cision among the studied smoothing techniques.
• Dirichlet priors and, specially, Jelinek-Mercer suﬀer a sig-
niﬁcant decrease in precision when a high amount
of smoothing is applied.
• Absolute Discounting behaves almost as a
parameter-free smoothing method.
Bibliography
[1] J. Parapar, A. Bellogín, P. Castells, and A. Barreiro.
Relevance-based language modelling for recommender sys-
tems. IPM, 49(4):966–980, July 2013.
[2] C. Zhai and J. Laﬀerty. A study of smoothing methods
for language models applied to information retrieval. ACM
TOIS, 22(2):179–214, Apr. 2004.
ECIR 2015, 37th European Conference on Information Retrieval. 29 March - 2 April, 2015, Vienna, Austria

Mais conteúdo relacionado

Mais procurados

Nonlinear programming 2013sharifz

Combinatorial optimization CO-1man003

Tracking Faces using Active Appearance ModelsComponica LLC

Taylor introms10 ppt_03QA Cmu

Transportation AssignmentNilam Kabra

Operation research model for solving TSPDrGovindshaysharma

Mais procurados (6)

Nonlinear programming 2013

Combinatorial optimization CO-1

Tracking Faces using Active Appearance Models

Taylor introms10 ppt_03

Transportation Assignment

Operation research model for solving TSP

Destaque

Konsep dasar kewiraushaaan dalam koperasiYunna II

Evalaucion del curriculo, del maestro y del portafolio profesional carnicero04

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce

Autodesk CertificatesMichael Vorster

Motor listrikOpertor MI Kbk

Norma kesopananOpertor MI Kbk

suicide silenceitzel-hdez

CLS_Import_Substitution 14-01-2015 _E_16-9Dmitry Churin

Cutting on the beatnaomi2121

Appscarnicero04

Shorter Multimarker signatures: a new tool to facilitate cancer diagnosisdanieltm33

Cutting on the beatnaomi2121

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Daniel Valcarce

Romancing the Media for your BusinessAndrew Chow ✯ Keynote Speaker ✯

Konsep KoperasiViviantika Nurifda K

Itsm group15 projectJugal Shah

Positioning terminology of lower limbsBISHAL KHANAL

Destaque (17)

Konsep dasar kewiraushaaan dalam koperasi

Evalaucion del curriculo, del maestro y del portafolio profesional

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...

Autodesk Certificates

Motor listrik

Norma kesopanan

suicide silence

CLS_Import_Substitution 14-01-2015 _E_16-9

Cutting on the beat

Apps

Shorter Multimarker signatures: a new tool to facilitate cancer diagnosis

Cutting on the beat

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...

Romancing the Media for your Business

Konsep Koperasi

Itsm group15 project

Positioning terminology of lower limbs

Semelhante a A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]

A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...Daniel Valcarce

Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Daniel Valcarce

Ranking and Diversity in Recommendations - RecSys Stammtisch at SoundCloud, B...Alexandros Karatzoglou

Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAIJack Clark

Interactive Information Retrieval inspired by Quantum TheoryIngo Frommholz

Probabilistic Collaborative Filtering with Negative Cross EntropyAlejandro Bellogin

Strategies for Cooperation Emergence in Distributed Service DiscoveryMiguel Rebollo

A model-based relevance estimation approach for feature selection in microarr...Gianluca Bontempi

Matrix Factorization Technique for Recommender SystemsAladejubelo Oluwashina

Multimodal Biometrics Recognition by Dimensionality Diminution MethodIJERA Editor

The Evaluation of Topsis and Fuzzy-Topsis Method for Decision Making System i...IRJET Journal

Jaya with Experienced LearningIRJET Journal

08 2008 068_prvulovic_03Barbara Onwutalobi

Sparse Kernel Learning for Image AnnotationSean Moran

Information retrieval as statistical translationBhavesh Singh

Part 1butest

12. Random ForestFAO

DBMS CS3Infinity Tech Solutions

LFA-NPG-Paper.pdfharinsrikanth

Semelhante a A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster] (20)

A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...

Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...

Ranking and Diversity in Recommendations - RecSys Stammtisch at SoundCloud, B...

Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI

Interactive Information Retrieval inspired by Quantum Theory

Probabilistic Collaborative Filtering with Negative Cross Entropy

Strategies for Cooperation Emergence in Distributed Service Discovery

A model-based relevance estimation approach for feature selection in microarr...

Matrix Factorization Technique for Recommender Systems

Multimodal Biometrics Recognition by Dimensionality Diminution Method

The Evaluation of Topsis and Fuzzy-Topsis Method for Decision Making System i...

Jaya with Experienced Learning

08 2008 068_prvulovic_03

Sparse Kernel Learning for Image Annotation

Information retrieval as statistical translation

Part 1

12. Random Forest

DBMS CS3

LFA-NPG-Paper.pdf

Mais de Daniel Valcarce

Information Retrieval Models for Recommender Systems - PhD slidesDaniel Valcarce

On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...Daniel Valcarce

LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]Daniel Valcarce

When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...Daniel Valcarce

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce

Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Daniel Valcarce

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Daniel Valcarce

Mais de Daniel Valcarce (7)

Information Retrieval Models for Recommender Systems - PhD slides

On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...

LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]

When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...

Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...

Último

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823

➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...amitlee9823

CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...gajnagarg

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Discover Why Less is More in B2B Researchmichael115558

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]

1. A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems Daniel Valcarce, Javier Parapar, Álvaro Barreiro {daniel.valcarce, javierparapar, barreiro}@udc.es – http://www.irlab.org Information Retrieval Lab, Computer Science Department, University of A Coruña Overview Language Models have been traditionally used in several fields such as speech recognition or document retrieval. Recently, Relevance-Based Language Models have been extended to Collaborative Filtering Recommender Systems [1]. In this field, a Relevance Model is estimated for each user based on the probabilities of the items. As it was thoroughly studied, smoothing plays a key role in the estimation of a Language Model [2]. Our aim in this work is to study smoothing methods in the context of Collaborative Filtering Recommender Systems. RM for Recommendation IR RecSys Query Target user Document Neighbour Term Item RM1 : p(i|Ru) ∝ v∈Vu p(v)p(i|v) j∈Iu p(j|v) (1) RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v)p(v) p(i) p(j|v) (2) • Iu is the set of items rated by the user u • Vu is the set of neighbours of the user u • p(i) and p(v) are considered uniform • p(i|u) is computed smoothing pml(i|u) = ru,i j∈Iu ru,j Smoothing methods Smoothing deals with data sparsity and plays a similar role to the IDF using a background model: p(i|C) = v∈U rv,i j∈I, v∈U rv,j . Jelinek-Mercer (JM) Linear interpolation. Parameter λ. pλ(i|u) = (1 − λ) pml(i|u) + λ p(i|C) (3) Dirichlet Priors (DP) Bayesian analysis. Parameter µ. pµ(i|u) = ru,i + µ p(i|C) µ + j∈Iu ru,j (4) Absolute Discounting (AD) Subtract a constant δ. pδ(i|u) = max(ru,i − δ, 0) + δ |Iu| p(i|C) j∈Iu ru,j (5) Experiments 0 0.05 0.1 0 100 200 300 400 500 600 700 800 900 1000 µ RM1 + AD RM1 + JM RM1 + DP RM2 + AD RM2 + JM RM2 + DP 0.25 0.3 0.35 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 P@5 λ / δ 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 01002003004005006007008009001000 0 0.2 0.4 0.6 0.8 1 P@5 δ #ratings P@5 0 0.2 0.4 0.6 0.8 1 Precision at 5 of the RM1 and the RM2 algorithms using Abso- lute Discounting (AD), Jelinek-Mercer (JM) and Dirichlet priors (DP) smoothing methods for the MovieLens 100k dataset. Precision at 5 of the RM2 algorithm using AD when varying the smoothing intensity and considering different number of ratings in the user profiles for the MovieLens 1M dataset. Conclusions • There are no big differences in terms of optimal precision among the studied smoothing techniques. • Dirichlet priors and, specially, Jelinek-Mercer suffer a sig- nificant decrease in precision when a high amount of smoothing is applied. • Absolute Discounting behaves almost as a parameter-free smoothing method. Bibliography [1] J. Parapar, A. Bellogín, P. Castells, and A. Barreiro. Relevance-based language modelling for recommender systems. IPM, 49(4):966–980, July 2013. [2] C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM TOIS, 22(2):179–214, Apr. 2004. ECIR 2015, 37th European Conference on Information Retrieval. 29 March - 2 April, 2015, Vienna, Austria

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (6)

Destaque

Destaque (17)

Semelhante a A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]

Semelhante a A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster] (20)

Mais de Daniel Valcarce

Mais de Daniel Valcarce (7)

Último

Último (20)

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems [ECIR '15 SP Poster]