SlideShare uma empresa Scribd logo
1 de 35
Big Data as a Streaming Service 
Big Data as a Streaming Service 
Julie Knibbe 
Product Manager – Deezer 
@julieknibbe 
Manuel Moussalam 
R&D – Deezer
Big Data as a Streaming Service 
Product Manager 
Defines features that meet users needs 
Based on: 
• Market research 
• Product Data Analytics 
• Users feedback 
• Competitive Analysis 
• Creativity 
Big Data as a Streaming Service 
The Leanback Experience Team at Deezer 
• Product Manager 
• Project Manager 
• R&D Developers 
• Big Data developers 
• Web developers (front/back) 
• Mobile developers 
• QA
Big Data as a Streaming Service 
Deezer 
Active users 30M 
Countries 180+ 
Tracks in catalog 35M 
Artists in catalog 1M 
Music providers 1K+
Big Data as a Streaming Service 
The recommendation problem 
No one wants to hear music 
they don’t like
Big Data as a Streaming Service 
The recommendation problem 
No one wants to hear the 
same 200 tracks over and 
over again
Big Data as a Streaming Service 
The recommendation problem 
You need to hear a song from 
1 to 7 times to like it
Big Data as a Streaming Service 
The recommendation problem 
Parameters and variables: 
• Mood 
• Tastes 
• Habits 
• Openness 
• Sociological profile 
• … 
Dimensions: 
• 35M tracks 
• 1M artists 
• 30M users
Big Data as a Streaming Service 
Building a user profile 
Onboarding users 
Monitoring user actions
Big Data as a Streaming Service 
Deezer – User qualification
Big Data as a Streaming Service 
User Profile
Big Data as a Streaming Service 
User Profile – Implicit / Explicit feedback 
Adaptation 
Add new information 
Forget old interests
Big Data as a Streaming Service 
Music Recommendation 
Given a listening profile for user X, what music should we 
recommend?
Recommendation system – adapting to user types 
Big Data as a Streaming Service 
Savants 
Enthusiasts 
Casuals 
Indifferents 
Riskier 
recommendations 
Popular 
recommendations 
Finding the right mix between novelty, familiarity and relevance
Recommendation system – adapting to user types 
Big Data as a Streaming Service 
Sources: 
http://alchemi.co.uk/archives/mus/groups_and_beha.html 
http://musicmachinery.com/2014/01/14/the-zero-button-music-player-2/
Big Data as a Streaming Service 
Use cases 
Playlist / Channel generation 
Discovery 
Personal Search 
…
Big Data as a Streaming Service 
Deezer features – Flow
Big Data as a Streaming Service 
Deezer features – Hear This
Big Data as a Streaming Service 
At Deezer 
Mixing collaborative filtering with semi-supervised 
approaches 
• Curation: Deezer Editors 
• Multi-layered graph structure of tracks & artists 
• Usage monitoring 
Based on Hadoop + ElasticSearch + Spark
Big Data as a Streaming Service 
Collaborative Filtering: Matching 
Collaborative Filtering : 
« User X listened to the Rolling Stones. Users listening 
to the Rolling Stones usually also listen to the Who, 
let's suggest the Who to user X. » 
Popularized by the Netflix Prize
Big Data as a Streaming Service 
Collaborative Filtering 
Either compute similarity upon users or items.. or both
Big Data as a Streaming Service 
Real data
Big Data as a Streaming Service 
Collaborative filtering: Exemplar based 
Association rules 
• Market basket analysis 
• A priori Algorithm 
• .. 
But: 
• Scalability issues 
• Hubs and Island issues (Stromae example)
Big Data as a Streaming Service 
Collaborative filtering: Model based 
Matrix Factorization 
A 
n 
m 
= U 
I 
X 
k 
• U is low-dimensional model on users 
• I on items 
Recommended items are missing entries of A
Big Data as a Streaming Service 
Collaborative Filtering: Limitations 
• Cold Start problem 
• Sparse user-item matrix (1% coverage) 
• Only based on social behaviors 
• Popularity bias (« The rich gets richer »)
Content-based filtering: Music items representation 
Big Data as a Streaming Service
Big Data as a Streaming Service 
Content-based filtering: Limitations 
• Cold Start problem 
• Users with atypical tastes 
• Lack of novelty 
• Subjectivity not taken into account
Big Data as a Streaming Service 
Content Similarity 
Clustering tracks, artists, albums… 
Methods: 
• Matrix Factorization techniques 
• Spectral clustering 
• Musical features extraction 
• Louvain algorithm 
• …
Big Data as a Streaming Service 
Example: Multiple Spectral Clustering
Big Data as a Streaming Service 
Cleaning 
• Mislabeled data: Different sources tell different things 
about songs, artists, albums 
• No universally adopted music ontology 
• Subjectivity 
• Outlier detection: confronting several sources and 
models
Big Data as a Streaming Service 
Cleaning: Example
Big Data as a Streaming Service 
In real life… 
A/B Testing
Big Data as a Streaming Service 
Algorithms A/B Testing 
Algo A 
Algo B 
Observe results: 
• Daily Active Users 
• Streams / users 
• Satisfaction 
• … 
Deezer users
Big Data as a Streaming Service 
Algorithms A/B Testing: Example 
Test: Are new users (with no profile data) more likely to be 
more satisfied with charts items or with new ones?
Big Data as a Streaming Service 
Thanks !

Mais conteúdo relacionado

Mais procurados

How data drives spotify
How data drives spotifyHow data drives spotify
How data drives spotifyAli Sarrafi
 
Big Data At Spotify
Big Data At SpotifyBig Data At Spotify
Big Data At SpotifyAdam Kawa
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali
 
Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018Fabien Gouyon
 
Product School - Spotify presentation
Product School - Spotify presentationProduct School - Spotify presentation
Product School - Spotify presentationSuleiman Younossi
 
Music Recommendation Tutorial
Music Recommendation TutorialMusic Recommendation Tutorial
Music Recommendation TutorialOscar Celma
 
Analysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature IdeasAnalysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature IdeasSarah L. Miller
 
Digital Marketing - Spotify
Digital Marketing - SpotifyDigital Marketing - Spotify
Digital Marketing - SpotifyLaura Sorrentino
 
A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify Maura Hickey
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotifypdicorpo
 
Spotify Chords - Creating Music Moments
Spotify Chords - Creating Music MomentsSpotify Chords - Creating Music Moments
Spotify Chords - Creating Music MomentsRyan Cunningham
 
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...MME 4.5 / Music 4.5 / 2Pears
 
The Music Streaming Industry
The Music Streaming IndustryThe Music Streaming Industry
The Music Streaming IndustryRudyJoon
 
Spotify Company presentation
Spotify Company presentationSpotify Company presentation
Spotify Company presentationalifost
 

Mais procurados (20)

How data drives spotify
How data drives spotifyHow data drives spotify
How data drives spotify
 
Big Data At Spotify
Big Data At SpotifyBig Data At Spotify
Big Data At Spotify
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
 
Spotify
SpotifySpotify
Spotify
 
Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018
 
Product School - Spotify presentation
Product School - Spotify presentationProduct School - Spotify presentation
Product School - Spotify presentation
 
Music Recommendation Tutorial
Music Recommendation TutorialMusic Recommendation Tutorial
Music Recommendation Tutorial
 
Analysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature IdeasAnalysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature Ideas
 
Digital Marketing - Spotify
Digital Marketing - SpotifyDigital Marketing - Spotify
Digital Marketing - Spotify
 
Spotify-Direct Marketing
Spotify-Direct MarketingSpotify-Direct Marketing
Spotify-Direct Marketing
 
A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotify
 
Digital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-SaavnDigital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-Saavn
 
Spotify Chords - Creating Music Moments
Spotify Chords - Creating Music MomentsSpotify Chords - Creating Music Moments
Spotify Chords - Creating Music Moments
 
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
 
The Music Streaming Industry
The Music Streaming IndustryThe Music Streaming Industry
The Music Streaming Industry
 
Music
MusicMusic
Music
 
Spotify's Brand DNA
Spotify's Brand DNASpotify's Brand DNA
Spotify's Brand DNA
 
Spotify Company presentation
Spotify Company presentationSpotify Company presentation
Spotify Company presentation
 

Destaque

Recommendation @Deezer
Recommendation @DeezerRecommendation @Deezer
Recommendation @Deezerrecsysfr
 
Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?Julie Knibbe
 
Flexible recommender systems based on graphs
Flexible recommender systems based on graphsFlexible recommender systems based on graphs
Flexible recommender systems based on graphsrecsysfr
 
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia CombazTheFamily
 
The Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanThe Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanRania Papadopoulou
 
Vidgo tv streaming service
Vidgo tv streaming serviceVidgo tv streaming service
Vidgo tv streaming serviceRodney Bailey
 
GlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming WorksGlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming WorksGlobalDots
 
How to Perform A/B Testing?
How to Perform A/B Testing?How to Perform A/B Testing?
How to Perform A/B Testing?QATestLab
 
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...Grid Dynamics
 
The New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming ServicesThe New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming ServicesBIEvents
 
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark Summit
 
OTT Streaming Services in Germany
OTT Streaming Services in GermanyOTT Streaming Services in Germany
OTT Streaming Services in GermanyRené C.G. Arnold
 
CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)Ria Pituita
 
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsNetflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsChris Saint-Amant
 

Destaque (15)

Recommendation @Deezer
Recommendation @DeezerRecommendation @Deezer
Recommendation @Deezer
 
Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?
 
Flexible recommender systems based on graphs
Flexible recommender systems based on graphsFlexible recommender systems based on graphs
Flexible recommender systems based on graphs
 
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
 
The Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanThe Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing Plan
 
Vidgo tv streaming service
Vidgo tv streaming serviceVidgo tv streaming service
Vidgo tv streaming service
 
GlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming WorksGlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming Works
 
How to Perform A/B Testing?
How to Perform A/B Testing?How to Perform A/B Testing?
How to Perform A/B Testing?
 
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
 
The New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming ServicesThe New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming Services
 
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
 
OTT Streaming Services in Germany
OTT Streaming Services in GermanyOTT Streaming Services in Germany
OTT Streaming Services in Germany
 
CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)
 
Build Features Not Apps
Build Features Not AppsBuild Features Not Apps
Build Features Not Apps
 
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsNetflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
 

Semelhante a Deezer - Big data as a streaming service

Analytics in media and entertainment industry
Analytics in media and entertainment industryAnalytics in media and entertainment industry
Analytics in media and entertainment industrySupreethaKrishna2
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...AIST
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Karthik Murugesan
 
FindStream investor deck
FindStream investor deckFindStream investor deck
FindStream investor deckFindStream
 
Group discussion2 (New media ecology)
Group discussion2 (New media ecology)Group discussion2 (New media ecology)
Group discussion2 (New media ecology)Yuchen LIU
 
Story of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer FlowStory of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer Flowrecsysfr
 
Music data analysis big data presentation
Music data analysis big data presentationMusic data analysis big data presentation
Music data analysis big data presentationShubhanshu Gupta
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoveryKarthik Murugesan
 
Group discussion- Netease Cloud Music
Group discussion- Netease Cloud MusicGroup discussion- Netease Cloud Music
Group discussion- Netease Cloud MusicXuanting ZHANG
 
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Bellakarina Solorzano
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova musicNAVER D2
 
Presentation by purshotam verma
Presentation by purshotam vermaPresentation by purshotam verma
Presentation by purshotam vermaRohit malav
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Timo van Niedek
 
Walmart Big Data Expo
Walmart Big Data ExpoWalmart Big Data Expo
Walmart Big Data ExpoBigDataExpo
 
ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise MobileUXLondon
 
So, What Does a Data Scientist do?
So, What Does a Data Scientist do?So, What Does a Data Scientist do?
So, What Does a Data Scientist do?Jameel Syed
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataAndy Stretton
 

Semelhante a Deezer - Big data as a streaming service (20)

Analytics in media and entertainment industry
Analytics in media and entertainment industryAnalytics in media and entertainment industry
Analytics in media and entertainment industry
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018
 
FindStream investor deck
FindStream investor deckFindStream investor deck
FindStream investor deck
 
Group discussion2 (New media ecology)
Group discussion2 (New media ecology)Group discussion2 (New media ecology)
Group discussion2 (New media ecology)
 
Story of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer FlowStory of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer Flow
 
Social music slides4
Social music slides4Social music slides4
Social music slides4
 
Music data analysis big data presentation
Music data analysis big data presentationMusic data analysis big data presentation
Music data analysis big data presentation
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music Discovery
 
Group discussion- Netease Cloud Music
Group discussion- Netease Cloud MusicGroup discussion- Netease Cloud Music
Group discussion- Netease Cloud Music
 
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Presentation by purshotam verma
Presentation by purshotam vermaPresentation by purshotam verma
Presentation by purshotam verma
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
69 kuta
69 kuta69 kuta
69 kuta
 
Walmart Big Data Expo
Walmart Big Data ExpoWalmart Big Data Expo
Walmart Big Data Expo
 
ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise
 
So, What Does a Data Scientist do?
So, What Does a Data Scientist do?So, What Does a Data Scientist do?
So, What Does a Data Scientist do?
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect data
 

Último

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 

Último (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 

Deezer - Big data as a streaming service

  • 1. Big Data as a Streaming Service Big Data as a Streaming Service Julie Knibbe Product Manager – Deezer @julieknibbe Manuel Moussalam R&D – Deezer
  • 2. Big Data as a Streaming Service Product Manager Defines features that meet users needs Based on: • Market research • Product Data Analytics • Users feedback • Competitive Analysis • Creativity 
  • 3. Big Data as a Streaming Service The Leanback Experience Team at Deezer • Product Manager • Project Manager • R&D Developers • Big Data developers • Web developers (front/back) • Mobile developers • QA
  • 4. Big Data as a Streaming Service Deezer Active users 30M Countries 180+ Tracks in catalog 35M Artists in catalog 1M Music providers 1K+
  • 5. Big Data as a Streaming Service The recommendation problem No one wants to hear music they don’t like
  • 6. Big Data as a Streaming Service The recommendation problem No one wants to hear the same 200 tracks over and over again
  • 7. Big Data as a Streaming Service The recommendation problem You need to hear a song from 1 to 7 times to like it
  • 8. Big Data as a Streaming Service The recommendation problem Parameters and variables: • Mood • Tastes • Habits • Openness • Sociological profile • … Dimensions: • 35M tracks • 1M artists • 30M users
  • 9. Big Data as a Streaming Service Building a user profile Onboarding users Monitoring user actions
  • 10. Big Data as a Streaming Service Deezer – User qualification
  • 11. Big Data as a Streaming Service User Profile
  • 12. Big Data as a Streaming Service User Profile – Implicit / Explicit feedback Adaptation Add new information Forget old interests
  • 13. Big Data as a Streaming Service Music Recommendation Given a listening profile for user X, what music should we recommend?
  • 14. Recommendation system – adapting to user types Big Data as a Streaming Service Savants Enthusiasts Casuals Indifferents Riskier recommendations Popular recommendations Finding the right mix between novelty, familiarity and relevance
  • 15. Recommendation system – adapting to user types Big Data as a Streaming Service Sources: http://alchemi.co.uk/archives/mus/groups_and_beha.html http://musicmachinery.com/2014/01/14/the-zero-button-music-player-2/
  • 16. Big Data as a Streaming Service Use cases Playlist / Channel generation Discovery Personal Search …
  • 17. Big Data as a Streaming Service Deezer features – Flow
  • 18. Big Data as a Streaming Service Deezer features – Hear This
  • 19. Big Data as a Streaming Service At Deezer Mixing collaborative filtering with semi-supervised approaches • Curation: Deezer Editors • Multi-layered graph structure of tracks & artists • Usage monitoring Based on Hadoop + ElasticSearch + Spark
  • 20. Big Data as a Streaming Service Collaborative Filtering: Matching Collaborative Filtering : « User X listened to the Rolling Stones. Users listening to the Rolling Stones usually also listen to the Who, let's suggest the Who to user X. » Popularized by the Netflix Prize
  • 21. Big Data as a Streaming Service Collaborative Filtering Either compute similarity upon users or items.. or both
  • 22. Big Data as a Streaming Service Real data
  • 23. Big Data as a Streaming Service Collaborative filtering: Exemplar based Association rules • Market basket analysis • A priori Algorithm • .. But: • Scalability issues • Hubs and Island issues (Stromae example)
  • 24. Big Data as a Streaming Service Collaborative filtering: Model based Matrix Factorization A n m = U I X k • U is low-dimensional model on users • I on items Recommended items are missing entries of A
  • 25. Big Data as a Streaming Service Collaborative Filtering: Limitations • Cold Start problem • Sparse user-item matrix (1% coverage) • Only based on social behaviors • Popularity bias (« The rich gets richer »)
  • 26. Content-based filtering: Music items representation Big Data as a Streaming Service
  • 27. Big Data as a Streaming Service Content-based filtering: Limitations • Cold Start problem • Users with atypical tastes • Lack of novelty • Subjectivity not taken into account
  • 28. Big Data as a Streaming Service Content Similarity Clustering tracks, artists, albums… Methods: • Matrix Factorization techniques • Spectral clustering • Musical features extraction • Louvain algorithm • …
  • 29. Big Data as a Streaming Service Example: Multiple Spectral Clustering
  • 30. Big Data as a Streaming Service Cleaning • Mislabeled data: Different sources tell different things about songs, artists, albums • No universally adopted music ontology • Subjectivity • Outlier detection: confronting several sources and models
  • 31. Big Data as a Streaming Service Cleaning: Example
  • 32. Big Data as a Streaming Service In real life… A/B Testing
  • 33. Big Data as a Streaming Service Algorithms A/B Testing Algo A Algo B Observe results: • Daily Active Users • Streams / users • Satisfaction • … Deezer users
  • 34. Big Data as a Streaming Service Algorithms A/B Testing: Example Test: Are new users (with no profile data) more likely to be more satisfied with charts items or with new ones?
  • 35. Big Data as a Streaming Service Thanks !

Notas do Editor

  1. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  2. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  3. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  4. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  5. Rich gets richer
  6. Collect information to describe items – and work on similarity
  7. Collect information to describe items – and work on similarity
  8. Collect information to describe items – and work on similarity
  9. Artist / Artist matrix to find similar artists