This document summarizes a presentation on generating personalized music playlists by leveraging social media sources. It discusses how the exponential growth of available online music has changed listening habits but created an information overload problem. It proposes personalized playlists as a solution and describes an approach that extracts a user's explicit music preferences from Facebook and generates new playlist recommendations by comparing the user's preferences to similar artists using distributional models. The system is called Play.me and generates personalized playlists by analyzing social media data, enriching explicit preferences, and recommending popular songs from resulting artists.
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Leveraging Social Media Sources to Generate Personalized Music Playlists
1. EC-Web 2012 - 13th International Conference on Electronic Commerce and Web Technologies
Vienna (Austria), 04.09.2012
Cataldo Musto, Giovanni Semeraro, Pasquale Lops, Marco de Gemmis, Fedelucio Narducci
Leveraging Social Media Sources to
Generate Personalized Music Playlists
2. Some stats
28,000,000 songs available on iTunes Store (*)
around 31,000 hours of music
a typical user spends 1.5 hours for day listening to music
=
56 years
to listen to the whole iTunes Library
(*) http://www.digitalmusicnews.com/permalink/2012/120425itunes
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
3. exponential growth
of the available music
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
4. online music libraries totally
changed our listening habits
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
5. Information Overload
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
6. paradox of choice
(Barry Schwartz, TED talk “Why more is less”)
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
7. what music should I listen to?
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
8. solution
personalization.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
9. solution
personalized music playlists
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
10. Is this something new?
No.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
11. Amazon.com
Recommendations
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
12. Genius @iTunes
Recommendations
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
13. All the state of the art
platforms share an
important drawback.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
14. need for
explicit
information
about
user interests.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
15. training is a bottleneck.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
16. social media
changed the rules for information
management and knowledge acquisition
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
17. social media
give new opportunities for modeling and
gathering information about user preferences
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
18. Facebook
user preferences in music.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
19. Our contribution
Play.me
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
20. Play.me
personalized music playlists
• Goal
• To provide users with personalized music playlists
• Insights
• Extraction of explicit user preferences from Facebook
• Playlist creation by enriching explicit user preferences.
• New artists are added to those explicitly extracted from
Facebook
• Comparison of two enrichment techniques
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
21. Play.me
architecture
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
22. Play.me
architecture
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
23. Myusic
pre-processing
• Crawling from Last.fm
• Public API
• Content-based features
• Name of the artist + Social tags
• Noise processing
• Information locally stored
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
24. Myusic
pre-processing
Sigur Ròs tag cloud from Last.fm
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
25. Play.me
architecture
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
26. Myusic
data extraction from Facebook
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
27. Myusic
data extraction from Facebook
explicit preferences
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
28. Myusic
data extraction from Facebook
implicit preferences
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
29. Play.me
architecture
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
30. Play.fm
enrichment
• Rationale
• Given a set of explicit preferences extracted from
Facebook
• Play.me enrichs this set
• Extraction of artists similar to those the user
explicity likes
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
31. Play.fm
enrichment example
Coldplay extracted from Facebook
enrichment
radiohead red hot chili peppers kings of leon
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
32. Play.me
architecture
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
33. Play.me
playlist
Most popular songs of the artists extracted from Last.fm (as well as
those added through the enrichment) are proposed to the user.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
34. let’s go
deeper
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
35. Play.fm
enrichment
• Comparison of two approaches
•
Content-based strategy
• Distributional Models
• Linked Data
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
36. Play.fm
enrichment based on Distributional Models
• Content-based strategy
• Each artist is represented through a set of
tags
• Each artist is represented as a point in a
semantic geometrical space
• Distributional Models
• Similarity calculations to extract the most
similar artists.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
37. distributional models
insight
by analyzing large corpus of textual data it is possible
to infer information about the usage (about the meaning)
of the terms.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
38. distributional models
“meaning
is its use”
L.Wittgenstein
(Austrian philosopher)
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
39. distributional models
insight
by analyzing large corpus of textual data it is possible
to infer information about the usage (about the meaning)
of the terms.
example
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
40. distributional models
term/context matrix (WordSpace)
c1 c2 c3 c4 c5 c6 c7 c8 c9
t1 ✔ ✔ ✔ ✔
t2 ✔ ✔ ✔ ✔
t3 ✔ ✔ ✔
t4 ✔ ✔ ✔ ✔
Cataldo Musto - Enhanced Vector Space Models for Content-based Recommender Systems - Ph.D. defense - University of Bari Aldo Moro, Italy - 08.06.12
41. distributional models
beer vs. glass: good overlap
c1 c2 c3 c4 c5 c6 c7 c8 c9
t1 ✔ ✔ ✔ ✔
t2 ✔ ✔ ✔ ✔
t3 ✔ ✔ ✔
t4 ✔ ✔ ✔ ✔
Cataldo Musto - Enhanced Vector Space Models for Content-based Recommender Systems - Ph.D. defense - University of Bari Aldo Moro, Italy - 08.06.12
42. distributional models
beer vs. spoon: no overlap
c1 c2 c3 c4 c5 c6 c7 c8 c9
t1 ✔ ✔ ✔ ✔
t2 ✔ ✔ ✔ ✔
t3 ✔ ✔ ✔
t4 ✔ ✔ ✔ ✔
Cataldo Musto - Enhanced Vector Space Models for Content-based Recommender Systems - Ph.D. defense - University of Bari Aldo Moro, Italy - 08.06.12
43. distributional models
rock vs. post rock = good overlap
a1 a2 a3 a4 a5 a6
rock ✔ ✔ ✔
post rock ✔ ✔
jazz ✔
classical ✔ ✔ ✔
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
44. distributional models
rock vs. classical = no overlap
a1 a2 a3 a4 a5 a6
rock ✔ ✔ ✔
post rock ✔ ✔
jazz ✔
classical ✔ ✔ ✔
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
45. WordSpace
example
rock
post-rock
jazz
classical
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
46. representation of documents (*)
can be inferred by combining the representation of
the terms (**) occurring in the document.
(*) documents = artists
(**) terms = tags
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
47. distributional models
term/context matrix (DocSpace)
c1 c2 c3 c4 c5 c6 c7 c8 c9
t2 ✔ ✔ ✔ ✔
t3 ✔ ✔ ✔
d1 ✔ ✔ ✔ ✔ ✔
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
48. Play.fm
enrichment based on Distributional Models
Coldplay
Radiohead
Kings of Leon
Lady Gaga
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
49. Play.fm
enrichment based on Distributional Models
input: vector space representation
output: artists with the highest cosine similarity
radiohead the killers kings of leon
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
50. Linked Open Data Cloud
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
51. Linked Open Data Cloud
Structured
(RDF)
representation
of the information
stored in Wikipedia.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
52. Play.fm
enrichment based on Linked Data
Coldplay play Alternative Rock
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
53. Play.me
RDF triple
Relationships are explictly encoded in RDF.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
54. Play.fm
enrichment based on Linked Data
• Linked Open Data Cloud
• Each artist is mapped on a DBpedia node.
• univocal URI
• Relationship between artists (nodes) are explicitly
encoded
• e.g. genre, artist category, etc.
• Use of SPARQL to extract artists (nodes) that
share the same features
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
55. Play.fm
enrichment based on Linked Data
input: SPARQL query
output: artists sharing the same properties
radiohead the smiths the verve
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
56. recap
enrichment process
input: artist output: similar artists
coldplay the smiths
Linked Data
radiohead
the verve
kings of leon
Distributional Models
radiohead
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
57. experimental
evaluation.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
58. experimental design
• Experiment
• Which one is the enrichment technique that
can provide users with the best playlists ?
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
59. experimental design
settings
• 30 users
• Heterogeneous musical knowledge
• Last.fm crawl: 228,878 artists
• Extraction & Recommendation step
• 325 artists extracted
• 11 per user, on average
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
60. experimental setup
Given a playlist, each user can freely express her own
feedback (like/dislike) on the proposed tracks.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
61. experimental setup
Experiment repeated three times (one run with Linked Data enrichment,
another one with Distributional Models, one with a simple baseline based on
popularity).
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
62. experimental setup
Users were unaware of the adopted configuration.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
63. experimental design
results
76.3
80
75.2 Linked Data
Distributional Models
Baseline (Popularity)
73.75
69.7
67.5
65.9
64.6
61.25 63.2
58 58 58
55
n=1 n=2 n=3
n = number of artists added for each extracted artist
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
64. experimental design
results
76.3
80
75.2 Linked Data
Distributional Models
Baseline (Popularity)
73.75
69.7
67.5
65.9
64.6
61.25 63.2
58 58 58
55
n=1 n=2 n=3
distributional models overcome linked data
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
65. experimental design
results
76.3
80
75.2 Linked Data
Distributional Models
Baseline (Popularity)
73.75
69.7
67.5
65.9
64.6
61.25 63.2
58 58 58
55
n=1 n=2 n=3
precision in distributional models drops down more rapidly
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
66. experimental design
results
76.3
80
75.2 Linked Data
Distributional Models
Baseline (Popularity)
73.75
69.7
67.5
65.9
64.6
61.25 63.2
58 58 58
55
n=1 n=2 n=3
good results for baseline, as well (poor music knowledge?)
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
67. Play.me
recap of the experiment
• Play.me
• Platform for generating personalized music playlists
•Extraction of favourite artists from Facebook
• Experiments
• Enrichment based on distributional models
overcomes linked data-based one in a significant
way
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
68. conclusions.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
69. both enrichment techniques
overcome the baseline
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
70. distributional models
overcome linked data
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
71. future research.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
72. merging different
enrichment techniques
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
73. evaluation with user-based metrics
(serendipity, novelty, unexpectedness)
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
74. modeling context.
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12
75. questions?
C. Musto, G. Semeraro, P. Lops, M. de Gemmis, F. Narducci.
Leveraging Social Media Sources to Generate Personalized Music Playlists - EC-WEB 2012 - 04.09.12