SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Metric Learning for Music Discovery with

Source and Target Playlists
Ying-Shu Kuo

August 12 2015
Proposed Idea
No Name Artist
1 Song_A Artist_A
2 Song_B Artist_B
3 Song_C Artist_A
4 Song_D Artist_C
5 Song_E Artist_B
6 Song_F Artist_D
7 Song_G Artist_E
8 Song_H Artist_E
9 Song_I Artist_F
Playlist Your Set Target Set
Search
Song_A
Artist_A
Parameter
= Song
= Your Set
= Target Set
= Others
= Chosen

Playlist
= Similarity
※ x-y axis has no meaning
Use Case
Parameter • Explore unknown music genre

(e.g. from Jazz to Metal)
• Get to know your friend’s jam

(e.g. from your favs to her favs)
No Name Artist
1 Song_A Artist_A
2 Song_B Artist_B
3 Song_C Artist_A
4 Song_D Artist_C
5 Song_E Artist_B
6 Song_F Artist_D
7 Song_G Artist_E
8 Song_H Artist_E
9 Song_I Artist_F
Playlist Your Set Target Set
Search
Song_A
Artist_A
Parameter
What I need for this
1. Song to play with => Million Song Dataset / Spotify API
2. Music similarity => EchoNest Audio Features
3. Cluster song sets => Metric Learning
4. 2-D Visualization => Dimension Reduction
5. Playlist Generation
Million Song Dataset
• Criteria for a good dataset
• Why use MSD?
Bertin-Mahieux, Thierry, et al. "The million song dataset." ISMIR 2011: Proceedings of the 12th International Society for Music Information
Retrieval Conference, October 24-28, 2011, Miami, Florida. University of Miami, 2011.
http://audiocontentanalysis.org/data-sets
Dataset RWC CAL500 GTZAN MusiCLEF MSD
size 465 502 1,000 200,000 1,000,000
has audio Y Y Y Y N*
has metadata Y Y Y (update) ? Y
* A partial of it has 7digital audio preview. All of the songs have content-based features.
EchoNest Feature
• Metadata: artist name / song title / album name /

year / duration
• Low-level: segment time / loudness / pitch / timbre
• Time: tempo / time signature / section time / bar time …
http://developer.echonest.com/docs/v4/_static/AnalyzeDocumentation.pdf
EchoNest Feature
•
•
• Codebook-based
mean
mean
stddev
mean
stddev
Metric Learning
• Metric: define the way you measure the distance
between data
http://en.wikipedia.org/wiki/File:Manhattan_distance.svg
Bellet, Aurélien, Amaury Habrard, and Marc Sebban. "A survey on metric learning for feature vectors and structured data." arXiv preprint
arXiv:1306.6709 (2013).
Metric Learning
• Mahanalobis Distance
http://stats.stackexchange.com/questions/62092/bottom-to-top-explanation-of-the-mahalanobis-distance
Metric Learning
• Metric Learning: learning distance function
Bellet, Aurélien, Amaury Habrard, and Marc Sebban. "A survey on metric learning for feature vectors and structured data." arXiv preprint
arXiv:1306.6709 (2013).
Metric Learning
• Why I need to reshape the feature space?
original metric learned
Metric Learning – LMNN
Large Margin Nearest Neighbor
Weinberger, Kilian Q., John Blitzer, and Lawrence K. Saul. "Distance metric learning for large margin nearest neighbor classification."
Advances in neural information processing systems. 2005.
NOT the unlabeled one!!!
Metric Learning – GB-LMNN
Gradient-Boosted Large Margin Nearest Neighbor
Kedem, Dor, Zhixiang Eddie Xu, and Kilian Q. Weinberger. "Gradient Boosted Large Margin Nearest Neighbors."
• Kernel trick, non-linear
• Gradient Boosted Regression Tree
Metric Learning – Evaluation
• Does starting / ending songs cluster?
• Davies–Bouldin Index
Metric Learning – Evaluation
10 vs 10 ø LMNN GB-LMNN OASIS
average 9.46 10.85 5.62 12.49
max – 16.43 15.66 13.25
min – 8.89 0.61 11.99
Dimension Reduction
• High dimension to low dimension based on constraints
• Keep the distance between data the same
• 2-D visualization
Van der Maaten, Laurens, and Geoffrey Hinton. "Visualizing data using t-SNE." Journal of Machine Learning Research 9.2579-2605 (2008):
85.
Dimension Reduction – t-SNE
http://commons.wikimedia.org/wiki/File:T_distribution_1df_enhanced.svg
Van der Maaten, Laurens, and Geoffrey Hinton. "Visualizing data using t-SNE." Journal of Machine Learning Research 9.2579-2605 (2008):
85.
• Pairwise distance
• Effective neighbors = local
• Gaussian vs t-distribution
Playlist Generation
• Trying to create a list of music based on some
assumptions/rules/constraints.
Playlist Generation – Related Work
Zheleva et al.
[1]
McFee et al.
[2]
Chen et al.

[3]
mine
assumption /
constraint
matching
user taste
and song
taste
natural
language
natural
language
2 clusters,

smooth
input
(dataset)
triplet

(user, song, t)
tag 0/1;
content-
based
playlists
content-
based
approach
topic model

(LDA)
Markov chain

ensemble
Markov chain
nearest

neighbors
evaluation
entropy-
based
log likelihood log likelihood ?
[1] Zheleva, Elena, et al. "Statistical models of music-listening sessions in social media." Proceedings of the 19th international conference
on World wide web. ACM, 2010.
[2] McFee, Brian, and Gert RG Lanckriet. "The Natural Language of Playlists." ISMIR. 2011.
[3] Chen, Shuo, et al. "Playlist prediction via metric embedding." Proceedings of the 18th ACM SIGKDD international conference on
Knowledge discovery and data mining. ACM, 2012.
Playlist Generation – Related Work
Flexer [4] Van Gulik [5] Lamere [6] mine
assumption /
constraint
specifying
start and end
high-level
control of
playlist
boil the frog
2 clusters,

smooth
input
(dataset)
content-
based
songs with
metadata
songs with
artist info
content-
based
approach
divergence
ratio
visualization

path drawing
artist
similarity
nearest

neighbors
evaluation same genre – – ?
[4] Flexer, Arthur, et al. "Playlist Generation using Start and End Songs." ISMIR. 2008.
[5] Van Gulik, Rob, and Fabio Vignoli. "Visual Playlist Generation on the Artist Map." ISMIR. Vol. 5. 2005.
[6] http://static.echonest.com/frog/
Playlist Generation – Method
• number of songs
• threshold
http://www.pstcc.edu/departments/natural_behavioral_sciences/Web%20Physics/E2020D0103.gif
Playlist Generation – Result
• demo
Future Work and Discussion
• Discussion
• feature representation
• path finding
• Future Work
• Implementation on Spotify API
• User Study
Thank you!
Questions / Comments?

Mais conteúdo relacionado

Semelhante a Metric Learning for Music Discovery with Source and Target Playlists

Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionIsabelle Augenstein
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs:A DBpedia StudyCrowdsourcing the Quality of Knowledge Graphs:A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia StudyMaribel Acosta Deibe
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music PlaylistsKeunwoo Choi
 
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...MusicNet
 
Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure David De Roure
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyricsFrancesco Cucari
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Timo van Niedek
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.pptaashnareddy1
 
Table Retrieval and Generation
Table Retrieval and GenerationTable Retrieval and Generation
Table Retrieval and Generationkrisztianbalog
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuPyData
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findabilitySten Govaerts
 
guest_lecture_DD2476_2016_Boxun
guest_lecture_DD2476_2016_Boxunguest_lecture_DD2476_2016_Boxun
guest_lecture_DD2476_2016_BoxunBoxun Zhang
 
Collins
CollinsCollins
Collinsanesah
 

Semelhante a Metric Learning for Music Discovery with Source and Target Playlists (20)

Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant Supervision
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
Report
ReportReport
Report
 
auto_playlist
auto_playlistauto_playlist
auto_playlist
 
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs:A DBpedia StudyCrowdsourcing the Quality of Knowledge Graphs:A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
 
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
J. S. Downie, D. De Roure, K. Page.Towards Web-Scale Analysis of Musical Stru...
 
Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure Towards Web-Scale Analysis of Musical Structure
Towards Web-Scale Analysis of Musical Structure
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyrics
 
MULHER@AVI2012
MULHER@AVI2012MULHER@AVI2012
MULHER@AVI2012
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
sa.ppt
sa.pptsa.ppt
sa.ppt
 
Table Retrieval and Generation
Table Retrieval and GenerationTable Retrieval and Generation
Table Retrieval and Generation
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda Uruchurtu
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
guest_lecture_DD2476_2016_Boxun
guest_lecture_DD2476_2016_Boxunguest_lecture_DD2476_2016_Boxun
guest_lecture_DD2476_2016_Boxun
 
Collins
CollinsCollins
Collins
 
Semantics reloaded
Semantics reloadedSemantics reloaded
Semantics reloaded
 

Último

8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxalwaysnagaraju26
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyAnusha Are
 

Último (20)

8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodology
 

Metric Learning for Music Discovery with Source and Target Playlists

  • 1. Metric Learning for Music Discovery with
 Source and Target Playlists Ying-Shu Kuo August 12 2015
  • 2. Proposed Idea No Name Artist 1 Song_A Artist_A 2 Song_B Artist_B 3 Song_C Artist_A 4 Song_D Artist_C 5 Song_E Artist_B 6 Song_F Artist_D 7 Song_G Artist_E 8 Song_H Artist_E 9 Song_I Artist_F Playlist Your Set Target Set Search Song_A Artist_A Parameter = Song = Your Set = Target Set = Others = Chosen
 Playlist = Similarity ※ x-y axis has no meaning
  • 3. Use Case Parameter • Explore unknown music genre
 (e.g. from Jazz to Metal) • Get to know your friend’s jam
 (e.g. from your favs to her favs)
  • 4. No Name Artist 1 Song_A Artist_A 2 Song_B Artist_B 3 Song_C Artist_A 4 Song_D Artist_C 5 Song_E Artist_B 6 Song_F Artist_D 7 Song_G Artist_E 8 Song_H Artist_E 9 Song_I Artist_F Playlist Your Set Target Set Search Song_A Artist_A Parameter What I need for this 1. Song to play with => Million Song Dataset / Spotify API 2. Music similarity => EchoNest Audio Features 3. Cluster song sets => Metric Learning 4. 2-D Visualization => Dimension Reduction 5. Playlist Generation
  • 5. Million Song Dataset • Criteria for a good dataset • Why use MSD? Bertin-Mahieux, Thierry, et al. "The million song dataset." ISMIR 2011: Proceedings of the 12th International Society for Music Information Retrieval Conference, October 24-28, 2011, Miami, Florida. University of Miami, 2011. http://audiocontentanalysis.org/data-sets Dataset RWC CAL500 GTZAN MusiCLEF MSD size 465 502 1,000 200,000 1,000,000 has audio Y Y Y Y N* has metadata Y Y Y (update) ? Y * A partial of it has 7digital audio preview. All of the songs have content-based features.
  • 6. EchoNest Feature • Metadata: artist name / song title / album name /
 year / duration • Low-level: segment time / loudness / pitch / timbre • Time: tempo / time signature / section time / bar time … http://developer.echonest.com/docs/v4/_static/AnalyzeDocumentation.pdf
  • 8. Metric Learning • Metric: define the way you measure the distance between data http://en.wikipedia.org/wiki/File:Manhattan_distance.svg Bellet, Aurélien, Amaury Habrard, and Marc Sebban. "A survey on metric learning for feature vectors and structured data." arXiv preprint arXiv:1306.6709 (2013).
  • 9. Metric Learning • Mahanalobis Distance http://stats.stackexchange.com/questions/62092/bottom-to-top-explanation-of-the-mahalanobis-distance
  • 10. Metric Learning • Metric Learning: learning distance function Bellet, Aurélien, Amaury Habrard, and Marc Sebban. "A survey on metric learning for feature vectors and structured data." arXiv preprint arXiv:1306.6709 (2013).
  • 11. Metric Learning • Why I need to reshape the feature space? original metric learned
  • 12. Metric Learning – LMNN Large Margin Nearest Neighbor Weinberger, Kilian Q., John Blitzer, and Lawrence K. Saul. "Distance metric learning for large margin nearest neighbor classification." Advances in neural information processing systems. 2005. NOT the unlabeled one!!!
  • 13. Metric Learning – GB-LMNN Gradient-Boosted Large Margin Nearest Neighbor Kedem, Dor, Zhixiang Eddie Xu, and Kilian Q. Weinberger. "Gradient Boosted Large Margin Nearest Neighbors." • Kernel trick, non-linear • Gradient Boosted Regression Tree
  • 14. Metric Learning – Evaluation • Does starting / ending songs cluster? • Davies–Bouldin Index
  • 15. Metric Learning – Evaluation 10 vs 10 ø LMNN GB-LMNN OASIS average 9.46 10.85 5.62 12.49 max – 16.43 15.66 13.25 min – 8.89 0.61 11.99
  • 16. Dimension Reduction • High dimension to low dimension based on constraints • Keep the distance between data the same • 2-D visualization Van der Maaten, Laurens, and Geoffrey Hinton. "Visualizing data using t-SNE." Journal of Machine Learning Research 9.2579-2605 (2008): 85.
  • 17. Dimension Reduction – t-SNE http://commons.wikimedia.org/wiki/File:T_distribution_1df_enhanced.svg Van der Maaten, Laurens, and Geoffrey Hinton. "Visualizing data using t-SNE." Journal of Machine Learning Research 9.2579-2605 (2008): 85. • Pairwise distance • Effective neighbors = local • Gaussian vs t-distribution
  • 18. Playlist Generation • Trying to create a list of music based on some assumptions/rules/constraints.
  • 19. Playlist Generation – Related Work Zheleva et al. [1] McFee et al. [2] Chen et al. [3] mine assumption / constraint matching user taste and song taste natural language natural language 2 clusters, smooth input (dataset) triplet
 (user, song, t) tag 0/1; content- based playlists content- based approach topic model
 (LDA) Markov chain ensemble Markov chain nearest neighbors evaluation entropy- based log likelihood log likelihood ? [1] Zheleva, Elena, et al. "Statistical models of music-listening sessions in social media." Proceedings of the 19th international conference on World wide web. ACM, 2010. [2] McFee, Brian, and Gert RG Lanckriet. "The Natural Language of Playlists." ISMIR. 2011. [3] Chen, Shuo, et al. "Playlist prediction via metric embedding." Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2012.
  • 20. Playlist Generation – Related Work Flexer [4] Van Gulik [5] Lamere [6] mine assumption / constraint specifying start and end high-level control of playlist boil the frog 2 clusters, smooth input (dataset) content- based songs with metadata songs with artist info content- based approach divergence ratio visualization path drawing artist similarity nearest neighbors evaluation same genre – – ? [4] Flexer, Arthur, et al. "Playlist Generation using Start and End Songs." ISMIR. 2008. [5] Van Gulik, Rob, and Fabio Vignoli. "Visual Playlist Generation on the Artist Map." ISMIR. Vol. 5. 2005. [6] http://static.echonest.com/frog/
  • 21. Playlist Generation – Method • number of songs • threshold http://www.pstcc.edu/departments/natural_behavioral_sciences/Web%20Physics/E2020D0103.gif
  • 22. Playlist Generation – Result • demo
  • 23. Future Work and Discussion • Discussion • feature representation • path finding • Future Work • Implementation on Spotify API • User Study