SlideShare uma empresa Scribd logo
1 de 18
Genre Classification
and Analysis
Anat Gilboa
Yanjun Qi, PhD
The Journey
Music Information
Retrieval 101
Constructing approaches
to a not-so-well-defined
problem
Finding good data
Simplifying the
problem
Data visualization
Finding not-so-
good data
Fall 2014
Today!Iterating
Machine
Learning
101
Spring 2015
Let’s find out…
• What makes one song similar to another?
• What are the characteristics by which we can “classify” the genre of a song?
The Problem
• Supervised
• Regression (Linear, Logistic, Ridge)
• Classification (Bagging, Naive Bayes, SVM, NN, KNN)
• Unsupervised
• Dimension Reduction (PCA)
• Clustering (K-means, GMM/EM, Hierarchical)
Machine Learning 101
Music Information Retrieval
101
• Aims to extend the understanding and usefulness of
music data, through research, development and
application of computational approaches and tools
• Combines concepts and techniques from music,
computer science, signal processing and cognition
• Music information: bibliographical, surveys, tags,
scores, MIDI, audio, etc
Adoption
• USPOP2002
• Magnatagatune
• CAL500
• RWC MDB
• International Society for Music Information Retrieval
(ISMIR) 2011 Dataset
• Collection of audio features and metadata for 1,000,000 contemporary popular
music tracks.
• 44,745 unique artists w/dated tracks starting from 1922
• 10,000 song subset (1%, 1.8 gb)
• Each song has a number of features…
The Million Song Dataset
loudness
mode
mode confidence
release
release 7digitalid
sections confidence
sections start
segments confidence
segments loudness max
segments loudness max time
segments loudness max start
segments pitches
segments start
segments timbre
similar artists
song hotttnesss
song id
start of fade out
tatums confidence
tatums start
tempo
time signature
time signature confidence
title
track id
track 7digitalid
year
analysis sample rate
artist 7digitalid
artist familiarity
artist hotttnesss
artist id
artist latitude
artist location
artist longitude
artist mbid
artist mbtags
artist mbtags count
artist name
artist playmeid
artist terms
artist terms freq
artist terms weight
audio md5
bars confidence
bars start
beats confidence
beats start
danceability
duration
end of fade in
energy
key
key confidence
key
tempo
Song Fields
Numerical Features
Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable.
Duration - the length of the song in seconds.
Energy - the overall energy of the song, 0 is least, 100 is most.
Hotttnesss - the popularity of the song, 0 is least, 100 is most.
Key - the key the song. 0 is C, 1 is C# and so on.
Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live.
Loudness - the overall loudness of the song in decibels.
Mode - the mode of the song where major is 0 and minor is 1.
Speechiness - how much spoken word is in the song. 0 is least, 100 is most
Tempo - the most frequently occurring tempo in the song, in beats-per-minute.
Time signature - the number of beats per measure in the song.
Acousticness how acoustic vs. electric is the song
Valence how positive or negative is the mood of the song
Inspiration came from…
• 8,761 songs
• (ty, API request timeouts & rate limiting)
• 307 genres-extracted from the Artist API
• k-means centroids
• 3,944 artists
• Between 1 - 11 appearances in the set
The Facts
• Use K-means to create centroids for each genre
• Hypothesis: If there are 307 genres
represented, would each be in the same
cluster?
• Create K-nearest neighbor tool to fetch k nearest
songs to some specified datapoint
• f(Tempo, Key, K)
Tasks
K-Means
K-NN
The Future
• There’s a long way to go…
• No one can predict the future…
• MIR is awesome and powerful
• But seriously, K-fold cross validation
–Anat
“Inspirational Quote”
Sources
• http://developer.echonest.com/docs/v4/_static/AnalyzeDocumentation.pdf
• https://github.com/echonest
• https://github.com/tbertinmahieux/MSongsDB
Thank You!

Mais conteúdo relacionado

Mais procurados

Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyChris Johnson
 
Notes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgNotes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgTess Ferrandez
 
Introduction to Keras
Introduction to KerasIntroduction to Keras
Introduction to KerasJohn Ramey
 
Machine learning for creative AI applications in music (2018 nov)
Machine learning for creative AI applications in music (2018 nov)Machine learning for creative AI applications in music (2018 nov)
Machine learning for creative AI applications in music (2018 nov)Yi-Hsuan Yang
 
Machine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyMachine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen
 
Deep Learning: Application & Opportunity
Deep Learning: Application & OpportunityDeep Learning: Application & Opportunity
Deep Learning: Application & OpportunityiTrain
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mirYi-Hsuan Yang
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ SpotifyOscar Carlsson
 
Deep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsDeep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsBuhwan Jeong
 
Tech Circle #23 Next Music Production by Google Magenta
Tech Circle #23 Next Music Productionby Google MagentaTech Circle #23 Next Music Productionby Google Magenta
Tech Circle #23 Next Music Production by Google MagentaTakahiro Kubo
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs
 
Spotify Discover Weekly: The machine learning behind your music recommendations
Spotify Discover Weekly: The machine learning behind your music recommendationsSpotify Discover Weekly: The machine learning behind your music recommendations
Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca
 
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드BOAZ Bigdata
 
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6Demian Antony DMello
 
3. M2M and IoT - Technology Fundamentals
3. M2M and IoT - Technology Fundamentals3. M2M and IoT - Technology Fundamentals
3. M2M and IoT - Technology FundamentalsJitendra Tomar
 
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectDeep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectKeunwoo Choi
 
Audio Fingerprinting Introduction
Audio Fingerprinting IntroductionAudio Fingerprinting Introduction
Audio Fingerprinting IntroductionVikesh Khanna
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 

Mais procurados (20)

Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
Notes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgNotes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew Ng
 
Introduction to Keras
Introduction to KerasIntroduction to Keras
Introduction to Keras
 
Machine learning for creative AI applications in music (2018 nov)
Machine learning for creative AI applications in music (2018 nov)Machine learning for creative AI applications in music (2018 nov)
Machine learning for creative AI applications in music (2018 nov)
 
1_Introduction
1_Introduction1_Introduction
1_Introduction
 
Machine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyMachine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at Spotify
 
Deep Learning: Application & Opportunity
Deep Learning: Application & OpportunityDeep Learning: Application & Opportunity
Deep Learning: Application & Opportunity
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ Spotify
 
Deep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsDeep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applications
 
Tech Circle #23 Next Music Production by Google Magenta
Tech Circle #23 Next Music Productionby Google MagentaTech Circle #23 Next Music Productionby Google Magenta
Tech Circle #23 Next Music Production by Google Magenta
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 
Spotify Discover Weekly: The machine learning behind your music recommendations
Spotify Discover Weekly: The machine learning behind your music recommendationsSpotify Discover Weekly: The machine learning behind your music recommendations
Spotify Discover Weekly: The machine learning behind your music recommendations
 
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드
제 19회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [HAUL의 움직이는 리포트] : 투자성향 기반 주식 추천 및 기업 정보 제공 대시보드
 
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6
Python Programming ADP VTU CSE 18CS55 Module 2 Chapter 6
 
3. M2M and IoT - Technology Fundamentals
3. M2M and IoT - Technology Fundamentals3. M2M and IoT - Technology Fundamentals
3. M2M and IoT - Technology Fundamentals
 
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectDeep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
 
Audio Fingerprinting Introduction
Audio Fingerprinting IntroductionAudio Fingerprinting Introduction
Audio Fingerprinting Introduction
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 
Jason vs leatherface 02 de 03
Jason vs leatherface 02 de 03Jason vs leatherface 02 de 03
Jason vs leatherface 02 de 03
 

Semelhante a Genre Classification and Analysis

Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findabilitySten Govaerts
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music PlaylistsKeunwoo Choi
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Rui Pedro Paiva
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictechJia Liu
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classificationxiaohusmile
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachEnrico Daga
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010ocor203
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualizationicchp2012
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova musicNAVER D2
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara Leeya Ressom
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicJAYALAKSHMI NAIR
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicDiane Rasmussen Pennington
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Oscar Celma
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe Research Council of Norway, IKTPLUSS
 
Gracenote: Music Navigation and Discovery
Gracenote: Music Navigation and DiscoveryGracenote: Music Navigation and Discovery
Gracenote: Music Navigation and Discoveryswissnex San Francisco
 

Semelhante a Genre Classification and Analysis (20)

MIR
MIRMIR
MIR
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
 
Capturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid ApproachCapturing Themed Evidence, a Hybrid Approach
Capturing Themed Evidence, a Hybrid Approach
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
Let the Computer Do the Work
Let the Computer Do the WorkLet the Computer Do the Work
Let the Computer Do the Work
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualization
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Big Data Research - Pandara
Big Data Research - Pandara Big Data Research - Pandara
Big Data Research - Pandara
 
Dancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop musicDancing Machine to Doesn't Really Matter: Analysis of pop music
Dancing Machine to Doesn't Really Matter: Analysis of pop music
 
Enfuse_QS.pdf
Enfuse_QS.pdfEnfuse_QS.pdf
Enfuse_QS.pdf
 
Searching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in musicSearching for the right feelings: Emotional metadata in music
Searching for the right feelings: Emotional metadata in music
 
Setlistör Demo Slides
Setlistör Demo SlidesSetlistör Demo Slides
Setlistör Demo Slides
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
 
Gracenote: Music Navigation and Discovery
Gracenote: Music Navigation and DiscoveryGracenote: Music Navigation and Discovery
Gracenote: Music Navigation and Discovery
 

Último

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Último (20)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Genre Classification and Analysis

  • 2. The Journey Music Information Retrieval 101 Constructing approaches to a not-so-well-defined problem Finding good data Simplifying the problem Data visualization Finding not-so- good data Fall 2014 Today!Iterating Machine Learning 101 Spring 2015
  • 3. Let’s find out… • What makes one song similar to another? • What are the characteristics by which we can “classify” the genre of a song? The Problem
  • 4. • Supervised • Regression (Linear, Logistic, Ridge) • Classification (Bagging, Naive Bayes, SVM, NN, KNN) • Unsupervised • Dimension Reduction (PCA) • Clustering (K-means, GMM/EM, Hierarchical) Machine Learning 101
  • 5. Music Information Retrieval 101 • Aims to extend the understanding and usefulness of music data, through research, development and application of computational approaches and tools • Combines concepts and techniques from music, computer science, signal processing and cognition • Music information: bibliographical, surveys, tags, scores, MIDI, audio, etc
  • 6. Adoption • USPOP2002 • Magnatagatune • CAL500 • RWC MDB • International Society for Music Information Retrieval (ISMIR) 2011 Dataset
  • 7. • Collection of audio features and metadata for 1,000,000 contemporary popular music tracks. • 44,745 unique artists w/dated tracks starting from 1922 • 10,000 song subset (1%, 1.8 gb) • Each song has a number of features… The Million Song Dataset
  • 8. loudness mode mode confidence release release 7digitalid sections confidence sections start segments confidence segments loudness max segments loudness max time segments loudness max start segments pitches segments start segments timbre similar artists song hotttnesss song id start of fade out tatums confidence tatums start tempo time signature time signature confidence title track id track 7digitalid year analysis sample rate artist 7digitalid artist familiarity artist hotttnesss artist id artist latitude artist location artist longitude artist mbid artist mbtags artist mbtags count artist name artist playmeid artist terms artist terms freq artist terms weight audio md5 bars confidence bars start beats confidence beats start danceability duration end of fade in energy key key confidence key tempo Song Fields
  • 9. Numerical Features Danceability - how danceable a song is. 0 is least danceable, 100 is most danceable. Duration - the length of the song in seconds. Energy - the overall energy of the song, 0 is least, 100 is most. Hotttnesss - the popularity of the song, 0 is least, 100 is most. Key - the key the song. 0 is C, 1 is C# and so on. Liveness - the likelihood that a song was performed in front of an audience. Above 80 is usually live. Loudness - the overall loudness of the song in decibels. Mode - the mode of the song where major is 0 and minor is 1. Speechiness - how much spoken word is in the song. 0 is least, 100 is most Tempo - the most frequently occurring tempo in the song, in beats-per-minute. Time signature - the number of beats per measure in the song. Acousticness how acoustic vs. electric is the song Valence how positive or negative is the mood of the song
  • 11. • 8,761 songs • (ty, API request timeouts & rate limiting) • 307 genres-extracted from the Artist API • k-means centroids • 3,944 artists • Between 1 - 11 appearances in the set The Facts
  • 12. • Use K-means to create centroids for each genre • Hypothesis: If there are 307 genres represented, would each be in the same cluster? • Create K-nearest neighbor tool to fetch k nearest songs to some specified datapoint • f(Tempo, Key, K) Tasks
  • 14. K-NN
  • 15. The Future • There’s a long way to go… • No one can predict the future… • MIR is awesome and powerful • But seriously, K-fold cross validation

Notas do Editor

  1. It’s been a long ride to get here.
  2. Expectation-maximization Hierarchical
  3. non-profit organisation which, among other things, oversees the organisation of the ISMIR Conference. The ISMIR conference is held annually and is the world's leading research forum on processing, searching, organising and accessing music-related data. six original collections: the Popular Music Database (100 songs), Royalty-Free Music Database (15 songs), Classical Music Database (50 pieces), Jazz Music Database (50 pieces), Music Genre Database (100 pieces), and Musical Instrument Sound Database (50 instruments)
  4. MSD is a freely-available collection of audio features and metadata for a million contemporary popular music tracks. By the way, this is metadata…I didn’t casually download 10,000 songs and make a hadoop cluster to compute, although this could potentially go there… Each song has a number of features but we’re interested in
  5. I met an engineer who represented Spotify,
  6. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.
  7. Not entirely sure why Aerosmith and Red Hot Chilly Peppers have 11 songs, but maybe it’s because they came out with more songs, too.