SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
EPG content recommendation in
large scale: a case study on
interactive TV platform
D. Zibriczky, Z. Petres, M. Waszlavik, D. Tikk
ICMLA 2013 - Machine Learning with Multimedia Data
7th December 2013. Miami. United States
Outline
• Introduction
• Problem
• Solution
• Offline results
• Online results
• Conclusion
Enter date in master2 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Consumption trends
3 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Electronic Program Guide
4 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Goal
• SaskTel
• Finding relevant contents with minimal effort
• Time-shifting
• Multiple devices per household
• Graphical User Interface
• Increasing content consumption / watching length
• Increasing click through rate (CTR) using Gravity’s GUI
Enter date in master5 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Problem / Recommendation concept
• User: Device
 Users cannot be distinguished explicitly
 More than one device per household
• Item: Scheduled contents (time, program id, channel id)
 Typically series or programs without episodes
 Metadata: Information about the items
• Event: Remote controller / set-top-box based implicit feedbacks
 Switching channel, set to record, rewind, replay, stop, pause
 Next schedule, watching duration
• Recommendable items
 Set of series or programs that are broadcasted at the moment of
recommendation request or later (on now, on later scenario)
• Recommendation
 Sorting recommendable items by prediction values
 Other recommendation logic (randomization, mixing, etc..)
Enter date in master6 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Problem / Difficulties
• Implicit feedbacks only (no explicit data)
• Huge but noisy data set (zapping, leave-on, irrelevant events, …)
• Cold start problem (new items, short lifetime)
• Small recommendable set at a time
• Context dependency (time, multiple users per household)
• Difference between offline and online optimization
Enter date in master7 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Baselines
• Most popular channels
• Most popular contents (series or programs)
• Users’ favourite channels
• Users’ favourite contents (series or programs)
Enter date in master8 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Content-based filtering
• Cosine Similarity
• User model: Weighted average of meta vectors
• Prediction: Cosine similarity of vectors
• Improvement: Term frequency based weighting (TFIDF)
Enter date in master9 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
M The Simpsons
How I Met
Your Mother
Futurama …
Genre = Animation 1 0 1 …
Genre = Comedy 1 1 1 …
… … … … …
Director = Matt Groening 1 0 1 …
Director = Carter Bays 0 1 0 …
Actor = Dan Castellaneta 1 0 0 …
Actor = Billy West 0 0 1 …
… … … … …
User 1
0.53
0.81
…
0.18
0.00
0.18
0.00
…
Solution / Collaborative Filtering
• Matrix Factorization
• User model: User factors
• Prediction: Dot product of latent factors
• Solver: Alternating Least Squares with Coordinate Descent (IALS1)
Enter date in master10 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R The Simpsons
How I Met
Your Mother
Futurama …
User 1 1 …
User 2 1 1 u2*i3 …
User 3 1 …
… … … … …
Item factors
i11 i21 i31 …
i21 i22 i32 …
User factors
u11 u12
u21 u22
u31 u32
… …
Solution / Hybrid filtering
Enter date in master11 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R* The Simpsons
How I Met
Your Mother
Futurama …
User 1 1 0 0 …
User 2 1 1 0 …
User 3 0 1 0 …
… … … … …
Genre = Animation 1 0 1 …
Genre = Comedy 1 1 1 …
… … … … …
Director = Matt Groening 1 0 1 …
Director = Carter Bays 0 1 0 …
Actor = Dan Castellaneta 1 0 0 …
Actor = Billy West 0 0 1 …
… … … … …
User factors
u11 u12
u21 u22
u31 u32
… …
pu11 pu12
pu22 pu22
… …
… …
… …
… …
… …
… …
Item factors
i11 i21 I31 …
i21 i22 I32 …
• Hybrid IALS1
TP
factors
Solution / Channel recommendation
• Tensor factorization (ITALS1)
• Prediction: Hadamard product of latent factors
• Improvement: Watching duration based weighting
Enter date in master12 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R (4:00-12:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 1 …
User 2 1 …
User 3 1 …
… … … … …
R (12:00-20:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 …
User 2 1 1 …
User 3 1 …
… … … … …
R (20:00-4:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 …
User 2 1 …
User 3 u3°i2°t3 …
… … … … …
User factors
u11 u12
u21 u22
u31 u32
… …
Item factors
i11 i21 i31 …
i21 i22 i32 …
t11
t12
t21
t22
t31
t32
Solution / Item grouping
Enter date in master13 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Preprocessing
Enter date in master14 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Original set
308M
82M
23M
Train set
22M
Test set
676K
Event type based filtering by significance
1
2
3 3
1
2
3
Filtering by leave-on and short duration
Splitting by time
Offline results / Measurement
Enter date in master15 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Recall@N
 Mean Reciprocial Rank (MRR)
• Item splits:
 Having events on training set or not
o old items
o new items
 Popularity 20-80 split
o popular items
o tail items
 Episode of a series or not
o series
o non-series
• Recall@15
* Items are grouped by series ids or program ids
** Items are grouped by channel ids
*** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series
Offline results / Comparison
Enter date in master16 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Algorithm Type All items
Old
items
New
items
Popular
items
Tail
items
Series
Non-
series
Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91%
Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00%
Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28%
Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09%
CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93%
IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65%
ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06%
Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63%
Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41%
Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
• Recall@15
* Items are grouped by series ids or program ids
** Items are grouped by channel ids
*** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series
Offline results / Comparison
Enter date in master17 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Algorithm Type All items
Old
items
New
items
Popular
items
Tail
items
Series
Non-
series
Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91%
Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00%
Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28%
Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09%
CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93%
IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65%
ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06%
Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63%
Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41%
Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
Online results / User Interface
Enter date in master18 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Online results / Measurement
Enter date in master19 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Click-through rate (CTR)
 Watching Length Ratio (WR): The average watching length of the contents
that were watched at least 1 minutes by the user.
 Completed Watched Ratio (CWR): The average ratio of the events in the
content was watched at least 90% of it’s remaining length.
• Methods:
 EPG-Z: Standard consumption method (EPG and channel zapping)
 R4U: Recommended 4 U
Online results / Measurement
Enter date in master20 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Click-through rate (CTR)
 Watching Length Ratio (WR): The average watching length of the contents
that were watched at least 1 minutes by the user.
 Completed Watched Ratio (CWR): The average ratio of the events in the
content was watched at least 90% of it’s remaining length.
• Methods:
 EPG-Z: Standard consumption method (EPG and channel zapping)
 R4U: Recommended 4 U
EPG-Z vs. R4U?
Online results / Clicks
Enter date in master21 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
0%
10%
20%
30%
40%
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Distribution of clicks by position
• Users like to click on the first item.
• 80% of the clicks comes from one of the Top5 positions.
• More clicks in the 15th position (2.2%) than in the 14th (1.3%).
Online results / CTR
Enter date in master22 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
33,20%
35,30%
33,12%
26,49%
39,05%
52,07%
33,16%
All items
Non-series
Series
Tail items
Popular items
New items
Old items
Click-through rate by different item splits
Online results / CTR by usage
Enter date in master23 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
0%
10%
20%
30%
40%
50%
60%
70%
80%
1 10 100
Average CTR vs. # of rec. requests from the first use of R4U
Online results / Watching behavior
Enter date in master24 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Watching Length Ratio Completed Watching Ratio
Item splits EPG-Z R4U EPG-Z R4U
Old items 30.02% 42.04% 16.02% 31.03%
New items 21.11% 35.51% 8.01% 23.12%
Popular items 30.81% 44.19% 16.30% 32.27%
Long-tail items 28.01% 38.43% 15.11% 27.66%
Series 31.04% 43.00% 16.92% 31.51%
Non-series 17.94% 15.26% 5.31% 7.22%
All items 29.90% 42.02% 15.91% 30.53%
Online results / Watching behavior
• Contents selected via R4U are watched 40% longer and completed
with almost twice more probability than in standard way.
Enter date in master25 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Watching Length Ratio Completed Watching Ratio
Item splits EPG-Z R4U EPG-Z R4U
Old items 30.02% 42.04% 16.02% 31.03%
New items 21.11% 35.51% 8.01% 23.12%
Popular items 30.81% 44.19% 16.30% 32.27%
Long-tail items 28.01% 38.43% 15.11% 27.66%
Series 31.04% 43.00% 16.92% 31.51%
Non-series 17.94% 15.26% 5.31% 7.22%
All items 29.90% 42.02% 15.91% 30.53%<< <<
Online results / Offline vs. Online metrics
• High correlation between Recall/MRR and Completed Waching Ratio
Enter date in master26 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Offline Online
Recall@15 MRR CWR
Old items 56.98% 0.1070 31.03%
New items 26.15% 0.0405 23.12%
Popular items 57.64% 0.1097 32.27%
Long-tail items 43.61% 0.0712 27.66%
Series 57.91% 0.1094 31.51%
Non-series 24.41% 0.0322 7.22%
All items 55.48% 0.1038 30.53%
Conclusion
• Linear recommendation difficulties.
• Metadata based item modeling (CBF) is quite effective, additional
improvement by combining with CF.
• Users prefer first items, they don’t do much effort.
• High click-through rate, especially for new items.
• R4U affects user behavior and satisfaction.
• Contents selected via R4U are watched 40% longer and completed
with almost twice more probability than in standard way.
• High correlation between the proposed offline and online metrics.
Enter date in master27 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States

Mais conteúdo relacionado

Destaque

Case american airlines and the sabre system
Case   american airlines and the sabre systemCase   american airlines and the sabre system
Case american airlines and the sabre systemMarshal Richard
 
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar
 
Understanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesUnderstanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesDynamic Yield
 
How Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceHow Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceTuri, Inc.
 
The Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelThe Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelSabre Corporation
 
Personalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPersonalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPancrazio Auteri
 

Destaque (6)

Case american airlines and the sabre system
Case   american airlines and the sabre systemCase   american airlines and the sabre system
Case american airlines and the sabre system
 
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
 
Understanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesUnderstanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best Practices
 
How Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceHow Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer Experience
 
The Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelThe Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in Travel
 
Personalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPersonalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from Netflix
 

Semelhante a EPG content recommendation in large scale: a case study on interactive TV platform

[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)Kunwoo Park
 
Event Mining in Social Multimedia
Event Mining in Social MultimediaEvent Mining in Social Multimedia
Event Mining in Social MultimediaSymeon Papadopoulos
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain
 
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Lippo Group Digital
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain
 
Scalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsScalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsYuanyuan Tian
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...LINE Corp.
 
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Provectus
 
Architecting IoT with Machine Learning
Architecting IoT with Machine LearningArchitecting IoT with Machine Learning
Architecting IoT with Machine LearningRudradeb Mitra
 
The Data Distribution Service Tutorial
The Data Distribution Service TutorialThe Data Distribution Service Tutorial
The Data Distribution Service TutorialAngelo Corsaro
 
Synthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerSynthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerQuantUniversity
 
Analysis random org nist2005
Analysis random org nist2005Analysis random org nist2005
Analysis random org nist2005eliecerherrera
 
VSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsVSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsBigML, Inc
 
Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...HostedbyConfluent
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis PresentationWajdi Khattel
 
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...Jihoo Kim
 
Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Jim Gilsinn
 
Big Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleBig Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleDaniel Marcous
 
Scalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsScalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsAntonio Severien
 

Semelhante a EPG content recommendation in large scale: a case study on interactive TV platform (20)

[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)
 
Event Mining in Social Multimedia
Event Mining in Social MultimediaEvent Mining in Social Multimedia
Event Mining in Social Multimedia
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business Value
 
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 
Scalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsScalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on Microblogs
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
 
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
 
Architecting IoT with Machine Learning
Architecting IoT with Machine LearningArchitecting IoT with Machine Learning
Architecting IoT with Machine Learning
 
The Data Distribution Service Tutorial
The Data Distribution Service TutorialThe Data Distribution Service Tutorial
The Data Distribution Service Tutorial
 
Synthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerSynthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGanger
 
Analysis random org nist2005
Analysis random org nist2005Analysis random org nist2005
Analysis random org nist2005
 
VSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsVSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and Deepnets
 
Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis Presentation
 
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
 
Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2
 
Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)
 
Big Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleBig Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @Google
 
Scalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsScalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data Streams
 

Mais de David Zibriczky

Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)David Zibriczky
 
Predictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesPredictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesDavid Zibriczky
 
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...David Zibriczky
 
Recommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewRecommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewDavid Zibriczky
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...David Zibriczky
 
Fast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsFast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsDavid Zibriczky
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
Data Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsData Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsDavid Zibriczky
 
Entropy based asset pricing
Entropy based asset pricingEntropy based asset pricing
Entropy based asset pricingDavid Zibriczky
 

Mais de David Zibriczky (9)

Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
 
Predictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesPredictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment Businesses
 
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
 
Recommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewRecommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature review
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
Fast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsFast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender Systems
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Data Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsData Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender Systems
 
Entropy based asset pricing
Entropy based asset pricingEntropy based asset pricing
Entropy based asset pricing
 

Último

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Último (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

EPG content recommendation in large scale: a case study on interactive TV platform

  • 1. EPG content recommendation in large scale: a case study on interactive TV platform D. Zibriczky, Z. Petres, M. Waszlavik, D. Tikk ICMLA 2013 - Machine Learning with Multimedia Data 7th December 2013. Miami. United States
  • 2. Outline • Introduction • Problem • Solution • Offline results • Online results • Conclusion Enter date in master2 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 3. Introduction / Consumption trends 3 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 4. Introduction / Electronic Program Guide 4 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 5. Introduction / Goal • SaskTel • Finding relevant contents with minimal effort • Time-shifting • Multiple devices per household • Graphical User Interface • Increasing content consumption / watching length • Increasing click through rate (CTR) using Gravity’s GUI Enter date in master5 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 6. Problem / Recommendation concept • User: Device  Users cannot be distinguished explicitly  More than one device per household • Item: Scheduled contents (time, program id, channel id)  Typically series or programs without episodes  Metadata: Information about the items • Event: Remote controller / set-top-box based implicit feedbacks  Switching channel, set to record, rewind, replay, stop, pause  Next schedule, watching duration • Recommendable items  Set of series or programs that are broadcasted at the moment of recommendation request or later (on now, on later scenario) • Recommendation  Sorting recommendable items by prediction values  Other recommendation logic (randomization, mixing, etc..) Enter date in master6 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 7. Problem / Difficulties • Implicit feedbacks only (no explicit data) • Huge but noisy data set (zapping, leave-on, irrelevant events, …) • Cold start problem (new items, short lifetime) • Small recommendable set at a time • Context dependency (time, multiple users per household) • Difference between offline and online optimization Enter date in master7 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 8. Solution / Baselines • Most popular channels • Most popular contents (series or programs) • Users’ favourite channels • Users’ favourite contents (series or programs) Enter date in master8 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 9. Solution / Content-based filtering • Cosine Similarity • User model: Weighted average of meta vectors • Prediction: Cosine similarity of vectors • Improvement: Term frequency based weighting (TFIDF) Enter date in master9 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States M The Simpsons How I Met Your Mother Futurama … Genre = Animation 1 0 1 … Genre = Comedy 1 1 1 … … … … … … Director = Matt Groening 1 0 1 … Director = Carter Bays 0 1 0 … Actor = Dan Castellaneta 1 0 0 … Actor = Billy West 0 0 1 … … … … … … User 1 0.53 0.81 … 0.18 0.00 0.18 0.00 …
  • 10. Solution / Collaborative Filtering • Matrix Factorization • User model: User factors • Prediction: Dot product of latent factors • Solver: Alternating Least Squares with Coordinate Descent (IALS1) Enter date in master10 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R The Simpsons How I Met Your Mother Futurama … User 1 1 … User 2 1 1 u2*i3 … User 3 1 … … … … … … Item factors i11 i21 i31 … i21 i22 i32 … User factors u11 u12 u21 u22 u31 u32 … …
  • 11. Solution / Hybrid filtering Enter date in master11 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R* The Simpsons How I Met Your Mother Futurama … User 1 1 0 0 … User 2 1 1 0 … User 3 0 1 0 … … … … … … Genre = Animation 1 0 1 … Genre = Comedy 1 1 1 … … … … … … Director = Matt Groening 1 0 1 … Director = Carter Bays 0 1 0 … Actor = Dan Castellaneta 1 0 0 … Actor = Billy West 0 0 1 … … … … … … User factors u11 u12 u21 u22 u31 u32 … … pu11 pu12 pu22 pu22 … … … … … … … … … … … … Item factors i11 i21 I31 … i21 i22 I32 … • Hybrid IALS1
  • 12. TP factors Solution / Channel recommendation • Tensor factorization (ITALS1) • Prediction: Hadamard product of latent factors • Improvement: Watching duration based weighting Enter date in master12 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R (4:00-12:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 1 … User 2 1 … User 3 1 … … … … … … R (12:00-20:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 … User 2 1 1 … User 3 1 … … … … … … R (20:00-4:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 … User 2 1 … User 3 u3°i2°t3 … … … … … … User factors u11 u12 u21 u22 u31 u32 … … Item factors i11 i21 i31 … i21 i22 i32 … t11 t12 t21 t22 t31 t32
  • 13. Solution / Item grouping Enter date in master13 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 14. Solution / Preprocessing Enter date in master14 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Original set 308M 82M 23M Train set 22M Test set 676K Event type based filtering by significance 1 2 3 3 1 2 3 Filtering by leave-on and short duration Splitting by time
  • 15. Offline results / Measurement Enter date in master15 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Recall@N  Mean Reciprocial Rank (MRR) • Item splits:  Having events on training set or not o old items o new items  Popularity 20-80 split o popular items o tail items  Episode of a series or not o series o non-series
  • 16. • Recall@15 * Items are grouped by series ids or program ids ** Items are grouped by channel ids *** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series Offline results / Comparison Enter date in master16 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Algorithm Type All items Old items New items Popular items Tail items Series Non- series Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91% Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00% Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28% Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09% CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93% IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65% ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06% Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63% Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41% Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
  • 17. • Recall@15 * Items are grouped by series ids or program ids ** Items are grouped by channel ids *** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series Offline results / Comparison Enter date in master17 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Algorithm Type All items Old items New items Popular items Tail items Series Non- series Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91% Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00% Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28% Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09% CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93% IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65% ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06% Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63% Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41% Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
  • 18. Online results / User Interface Enter date in master18 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 19. Online results / Measurement Enter date in master19 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Click-through rate (CTR)  Watching Length Ratio (WR): The average watching length of the contents that were watched at least 1 minutes by the user.  Completed Watched Ratio (CWR): The average ratio of the events in the content was watched at least 90% of it’s remaining length. • Methods:  EPG-Z: Standard consumption method (EPG and channel zapping)  R4U: Recommended 4 U
  • 20. Online results / Measurement Enter date in master20 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Click-through rate (CTR)  Watching Length Ratio (WR): The average watching length of the contents that were watched at least 1 minutes by the user.  Completed Watched Ratio (CWR): The average ratio of the events in the content was watched at least 90% of it’s remaining length. • Methods:  EPG-Z: Standard consumption method (EPG and channel zapping)  R4U: Recommended 4 U EPG-Z vs. R4U?
  • 21. Online results / Clicks Enter date in master21 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 0% 10% 20% 30% 40% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Distribution of clicks by position • Users like to click on the first item. • 80% of the clicks comes from one of the Top5 positions. • More clicks in the 15th position (2.2%) than in the 14th (1.3%).
  • 22. Online results / CTR Enter date in master22 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 33,20% 35,30% 33,12% 26,49% 39,05% 52,07% 33,16% All items Non-series Series Tail items Popular items New items Old items Click-through rate by different item splits
  • 23. Online results / CTR by usage Enter date in master23 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 0% 10% 20% 30% 40% 50% 60% 70% 80% 1 10 100 Average CTR vs. # of rec. requests from the first use of R4U
  • 24. Online results / Watching behavior Enter date in master24 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Watching Length Ratio Completed Watching Ratio Item splits EPG-Z R4U EPG-Z R4U Old items 30.02% 42.04% 16.02% 31.03% New items 21.11% 35.51% 8.01% 23.12% Popular items 30.81% 44.19% 16.30% 32.27% Long-tail items 28.01% 38.43% 15.11% 27.66% Series 31.04% 43.00% 16.92% 31.51% Non-series 17.94% 15.26% 5.31% 7.22% All items 29.90% 42.02% 15.91% 30.53%
  • 25. Online results / Watching behavior • Contents selected via R4U are watched 40% longer and completed with almost twice more probability than in standard way. Enter date in master25 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Watching Length Ratio Completed Watching Ratio Item splits EPG-Z R4U EPG-Z R4U Old items 30.02% 42.04% 16.02% 31.03% New items 21.11% 35.51% 8.01% 23.12% Popular items 30.81% 44.19% 16.30% 32.27% Long-tail items 28.01% 38.43% 15.11% 27.66% Series 31.04% 43.00% 16.92% 31.51% Non-series 17.94% 15.26% 5.31% 7.22% All items 29.90% 42.02% 15.91% 30.53%<< <<
  • 26. Online results / Offline vs. Online metrics • High correlation between Recall/MRR and Completed Waching Ratio Enter date in master26 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Offline Online Recall@15 MRR CWR Old items 56.98% 0.1070 31.03% New items 26.15% 0.0405 23.12% Popular items 57.64% 0.1097 32.27% Long-tail items 43.61% 0.0712 27.66% Series 57.91% 0.1094 31.51% Non-series 24.41% 0.0322 7.22% All items 55.48% 0.1038 30.53%
  • 27. Conclusion • Linear recommendation difficulties. • Metadata based item modeling (CBF) is quite effective, additional improvement by combining with CF. • Users prefer first items, they don’t do much effort. • High click-through rate, especially for new items. • R4U affects user behavior and satisfaction. • Contents selected via R4U are watched 40% longer and completed with almost twice more probability than in standard way. • High correlation between the proposed offline and online metrics. Enter date in master27 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 28. ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States