SlideShare uma empresa Scribd logo
1 de 20
Competence Center Information Retrieval & Machine Learning
UMAP‘13 Doctoral Consortium
Evaluation of Cross-Domain News Article Recommendations
Benjamin Kille
13. Juni 2013
Agenda
213. Juni 2013
► Problem description
► Challenges in News Article Recommendation
 Sparsity
 Dynamic item collection
 Evaluation
► Research Questions
► Data outline
► Preliminary results
► Conclusions
► Next steps
Problem description
313. Juni 2013
► Information overload
 amount of on-line accessible news articles increases
 limited user perception
 limited time capacity
► Solution: Recommender System  filtering news articles with
respect to relevance/utility
► Special challenges for news recommender systems
 Sparsity
 Dynamics
► General challenges for recommender systems
 Evaluation strategy
Problem formalization
413. Juni 2013
►
Sparsity
513. Juni 2013
►
Dynamics
613. Juni 2013
► News  dynamic content
Billsus, D. & Pazzani, M.J., 2007. Adaptive News Access. In P. Brusilovsky, A. Kobsa, &
W. Nejdl, eds. The Adaptive Web. Springer, pp. 550–570.
► Unlike music or movies rarely re-consumed
► For instance: Deutsche Presse Agentur (DPA)
 750 messages
 220k words
 1,5k images
http://www.dpa.de/Zahlen-Fakten.152.0.html
Evaluation
713. Juni 2013
► Strategy
 on-line: A/B testing (user-centric)
 off-line: data set (data-centric)
► Numerous facets
 utility
 relevance
 novelty
 serendipity
 …
► Dependending on the model formulation
 preference prediction (requires numerical preference data)
 item ranking
Evaluation (cont‘d)
813. Juni 2013
Dispatcher
● recommendation request
 click
●
●
●
●
●
●

Evaluation (cont‘d)
913. Juni 2013
Dispatcher
● recommendation request
 click
● ● ● ● ●●
 
? ?
Li, L. et al., 2011. Unbiased offline evaluation of contextual-bandit-based news
article recommendation algorithms. In Proceedings of the fourth ACM international
conference on Web search and data mining - WSDM ’11. p. 297.
Cross-domain setting
1013. Juni 2013
D1
D2
U
U
I
I
D1
D2
U
U
I
I
D1
D2
U
U
I
I
D1
D2
U
U
I
I
nooverlap
Useroverlap
Itemoverlap
fulloverlap
Cremonesi, P., Tripodi, A. & Turrin, R., 2011. Cross-Domain Recommender Systems. In
2011 IEEE 11th International Conference on Data Mining Workshops. IEEE, pp. 496–503.
Research Questions
1113. Juni 2013
► How can other publishers' user interactions contribute to
decrease sparsity for the target publisher?
► What characteristics must recommender algorithms exhibit to
successfully cope with dynamically changing item collections?
► How to evaluate cross-domain recommender systems with
dynamically changing item collections? How do standard
evaluation metrics compare to the observed clicks?
Data outline
1213. Juni 2013
► > 1-2M impressions by 12 publishers (general news, local news,
finance, information technology, sports, etc.) on a daily basis
► user features such as
 browser
 ISP
 OS
 device
► news article features such as
 title
 text
 URL
 Image
► http://www.dai-labor.de/en/irml/epen/
►Real interactions with actual users!
Preliminary results
1313. Juni 2013
► Sparsity
► Histogram of the relative frequency of user interactions
Preliminary results (cont‘d)
1413. Juni 2013
► Dynamics
Preliminary results (cont‘d)
1513. Juni 2013
► Popularity
Conclusions
1613. Juni 2013
►
Next steps
1713. Juni 2013
► Implementation of existing cross-domain recommender
algorithms
► Evaluating recommender algorithms with respect to
 CTR
 novelty
 diversity
► Investigate UI effects
► Analyze applicability of context-sensitive recommendations
► User/Item clustering to speed-up computation time
Thank you for the attention!
1813. Juni 2013
Questions???
Announcement: NRS 2013
1913. Juni 2013
► International News Recommender Systems Workshop and Challenge
► In conjunction with ACM RecSys 2013
IMPORTANT DATES
 July 21, 2013 paper submission deadline
 July 1, 2013 data set release
 August 15, 2013 on-line challenge kick-off
HIGHLIGHTS
 Access to a real recommender system
 Real-time requirements
 Big Data
 Cross-domain
 Implicit feedback
Website: https://sites.google.com/site/newsrec2013/home
Twitter: @NRSws2013
Competence Center Information Retrieval &
Machine Learning
www.dai-labor.de
Fon
Fax
+49 (0) 30 / 314 – 74
+49 (0) 30 / 314 – 74 003
DAI-Labor
Technische Universität Berlin
Fakultät IV – Elektrontechnik & Informatik
Sekretariat TEL 14
Ernst-Reuter-Platz 7
10587 Berlin, Deutschland
20
Benjamin Kille
Researcher / PhD student
benjamin.kille@dai-labor.de
74128
13. Juni 2013

Mais conteúdo relacionado

Semelhante a Evaluation of Cross-Domain News Article Recommendations

Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
SocietyConsulting
 
Knowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at ConcernKnowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at Concern
Ellen Ward
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-Edition
Gang Li
 
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Zoltan Nagy
 

Semelhante a Evaluation of Cross-Domain News Article Recommendations (20)

Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
 
Knowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at ConcernKnowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at Concern
 
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven SummarizationTowards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
 
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraAlgorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
 
Big Data and User Segmentation in Mobile Context
Big Data and User Segmentation in Mobile ContextBig Data and User Segmentation in Mobile Context
Big Data and User Segmentation in Mobile Context
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-Edition
 
Data lakes a tool for minimizing expenditure on storage
Data lakes a tool for minimizing expenditure on storageData lakes a tool for minimizing expenditure on storage
Data lakes a tool for minimizing expenditure on storage
 
Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...
 
Information Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT CoimbatoreInformation Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT Coimbatore
 
How to use data to increase user retention presented by Kevin Bravo of Phitur...
How to use data to increase user retention presented by Kevin Bravo of Phitur...How to use data to increase user retention presented by Kevin Bravo of Phitur...
How to use data to increase user retention presented by Kevin Bravo of Phitur...
 
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
 
Big data in transport an international transport forum overview oct 2013
Big data in transport    an international transport forum overview oct 2013Big data in transport    an international transport forum overview oct 2013
Big data in transport an international transport forum overview oct 2013
 
Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013
 
UXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter togetherUXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter together
 
Advertising using big data
Advertising using big dataAdvertising using big data
Advertising using big data
 
Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012
 
Web mining
Web miningWeb mining
Web mining
 
User Involvement in Software Evolution Practice: A Case Study
User Involvement in Software Evolution Practice: A Case StudyUser Involvement in Software Evolution Practice: A Case Study
User Involvement in Software Evolution Practice: A Case Study
 
Mobile where it Matters - A Strategic Approach to Digital Ecosystems
Mobile where it Matters - A Strategic Approach to Digital EcosystemsMobile where it Matters - A Strategic Approach to Digital Ecosystems
Mobile where it Matters - A Strategic Approach to Digital Ecosystems
 
dabblr_report_final
dabblr_report_finaldabblr_report_final
dabblr_report_final
 

Último

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Último (20)

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 

Evaluation of Cross-Domain News Article Recommendations

  • 1. Competence Center Information Retrieval & Machine Learning UMAP‘13 Doctoral Consortium Evaluation of Cross-Domain News Article Recommendations Benjamin Kille 13. Juni 2013
  • 2. Agenda 213. Juni 2013 ► Problem description ► Challenges in News Article Recommendation  Sparsity  Dynamic item collection  Evaluation ► Research Questions ► Data outline ► Preliminary results ► Conclusions ► Next steps
  • 3. Problem description 313. Juni 2013 ► Information overload  amount of on-line accessible news articles increases  limited user perception  limited time capacity ► Solution: Recommender System  filtering news articles with respect to relevance/utility ► Special challenges for news recommender systems  Sparsity  Dynamics ► General challenges for recommender systems  Evaluation strategy
  • 6. Dynamics 613. Juni 2013 ► News  dynamic content Billsus, D. & Pazzani, M.J., 2007. Adaptive News Access. In P. Brusilovsky, A. Kobsa, & W. Nejdl, eds. The Adaptive Web. Springer, pp. 550–570. ► Unlike music or movies rarely re-consumed ► For instance: Deutsche Presse Agentur (DPA)  750 messages  220k words  1,5k images http://www.dpa.de/Zahlen-Fakten.152.0.html
  • 7. Evaluation 713. Juni 2013 ► Strategy  on-line: A/B testing (user-centric)  off-line: data set (data-centric) ► Numerous facets  utility  relevance  novelty  serendipity  … ► Dependending on the model formulation  preference prediction (requires numerical preference data)  item ranking
  • 8. Evaluation (cont‘d) 813. Juni 2013 Dispatcher ● recommendation request  click ● ● ● ● ● ● 
  • 9. Evaluation (cont‘d) 913. Juni 2013 Dispatcher ● recommendation request  click ● ● ● ● ●●   ? ? Li, L. et al., 2011. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In Proceedings of the fourth ACM international conference on Web search and data mining - WSDM ’11. p. 297.
  • 10. Cross-domain setting 1013. Juni 2013 D1 D2 U U I I D1 D2 U U I I D1 D2 U U I I D1 D2 U U I I nooverlap Useroverlap Itemoverlap fulloverlap Cremonesi, P., Tripodi, A. & Turrin, R., 2011. Cross-Domain Recommender Systems. In 2011 IEEE 11th International Conference on Data Mining Workshops. IEEE, pp. 496–503.
  • 11. Research Questions 1113. Juni 2013 ► How can other publishers' user interactions contribute to decrease sparsity for the target publisher? ► What characteristics must recommender algorithms exhibit to successfully cope with dynamically changing item collections? ► How to evaluate cross-domain recommender systems with dynamically changing item collections? How do standard evaluation metrics compare to the observed clicks?
  • 12. Data outline 1213. Juni 2013 ► > 1-2M impressions by 12 publishers (general news, local news, finance, information technology, sports, etc.) on a daily basis ► user features such as  browser  ISP  OS  device ► news article features such as  title  text  URL  Image ► http://www.dai-labor.de/en/irml/epen/ ►Real interactions with actual users!
  • 13. Preliminary results 1313. Juni 2013 ► Sparsity ► Histogram of the relative frequency of user interactions
  • 14. Preliminary results (cont‘d) 1413. Juni 2013 ► Dynamics
  • 15. Preliminary results (cont‘d) 1513. Juni 2013 ► Popularity
  • 17. Next steps 1713. Juni 2013 ► Implementation of existing cross-domain recommender algorithms ► Evaluating recommender algorithms with respect to  CTR  novelty  diversity ► Investigate UI effects ► Analyze applicability of context-sensitive recommendations ► User/Item clustering to speed-up computation time
  • 18. Thank you for the attention! 1813. Juni 2013 Questions???
  • 19. Announcement: NRS 2013 1913. Juni 2013 ► International News Recommender Systems Workshop and Challenge ► In conjunction with ACM RecSys 2013 IMPORTANT DATES  July 21, 2013 paper submission deadline  July 1, 2013 data set release  August 15, 2013 on-line challenge kick-off HIGHLIGHTS  Access to a real recommender system  Real-time requirements  Big Data  Cross-domain  Implicit feedback Website: https://sites.google.com/site/newsrec2013/home Twitter: @NRSws2013
  • 20. Competence Center Information Retrieval & Machine Learning www.dai-labor.de Fon Fax +49 (0) 30 / 314 – 74 +49 (0) 30 / 314 – 74 003 DAI-Labor Technische Universität Berlin Fakultät IV – Elektrontechnik & Informatik Sekretariat TEL 14 Ernst-Reuter-Platz 7 10587 Berlin, Deutschland 20 Benjamin Kille Researcher / PhD student benjamin.kille@dai-labor.de 74128 13. Juni 2013