An Introduction to Metric Learning for Clustering

•

1 gostou•2,486 visualizações

Federal University of Technology - Paraná/Brazil (UTFPR)

An introduction to metric learning for clustering

Educação

Metric Learning for Clustering
SCC5945 - Análise Semi-Supervisionada e Não-Supervisionada
de Padrões em Dados
(Seminar)
Sidgley Camargo de Andrade
PhD student in computer science
Institute of Computer Science and Mathematics
University of São Paulo
June 2016
1 / 12

Agenda
Constraint-based algorithms
Motivation
Metrics
Metric learning for clustering
MPCK-means algorithm
References
2 / 12

Constraint-based algorithms
How to help the unsupervised algorithms to ﬁnd better
solution?
Constraint-based methods– e.g. background knowledge
through pairwise constraints Wagstaﬀ et al. (2001)
Con ⊆ DxD : must-link constraints
Con= ⊆ DxD : cannot-link constraints
Active- and self-learning
Other . . .
Are there “problems” related to algorithms above?
3 / 12

Motivation
Figure: (Basu et al., 2008). Legend [–] must-link [- -] cannot-link
4 / 12

Metrics
The metrics depict the relationships between the data (e.g.
euclidean distance, mahalanobis distance, etc. . . )
What is the right metric?
There are few forms or systemic mechanisms to tweak distance
metrics, and them are often by hand Xing et al. (2003).
5 / 12

Metric learning for clustering
Assumption: keeping dissimilar points far from each other and
similar points closest to each other reduces the risk of errors.
Xing et al. (2003)
Suppose a user indicates that certain points in an input space (say,
n) are considered by them to be “similar” (or “dissimilar”). Can we
automatically learn a distance metric over n that respects these
relationships, i.e., one that assigns small distances between the
similar pairs and greater distances otherwise?
Learn a metric d : nx n → over the input space.
6 / 12

Problem
A simple way is to require that similar pairs (must-linked) have
small distance between them, whereas dissimilar pairs (cannot-link)
have greater distance between them
d(x, y) = dA(x, y) = ||x − y||A = (x − y)T A(x − y)
min
A
(xi ,xj )∈S ||xi − xj ||2
A
s.t. (xi ,xj )∈D ||xi − xj ||2
A ≥ c
A 0
, where A 0 is a constraint that symmetric matrix A must be
positive semi-deﬁnite – “pseudo metric” – and c any positive
constant ≥ 1
1
Question for class – Why is constant c positive?
2
Question for class – How to transform to max problem?
7 / 12

Metric Pairwise Constraint K-means
(MPCK-means)
Assumes a matrix Ah (metric) for each cluster h
Permits the speciﬁcation of an individual weight for each constraint
(fM and fC ); the penalty for constraint violations is proportional to
the violated constraints weight
9 / 12

MPCK-means algorithm – Bilenko et al. (2004)
10 / 12

MPCK-means algorithm – Bilenko et al. (2004)
11 / 12

References
Basu, S., Davidson, I., and Wagstaﬀ, K. (2008). Constrained Clustering:
Advances in Algorithms, Theory, and Applications. Chapman &
Hall/CRC, 1 edition.
Bilenko, M., Basu, S., and Mooney, R. J. (2004). Integrating constraints
and metric learning in semi-supervised clustering. In Proceedings of
the Twenty-ﬁrst International Conference on Machine Learning, ICML
’04, pages 11–, New York, NY, USA. ACM.
Wagstaﬀ, K., Cardie, C., Rogers, S., and Schrödl, S. (2001). Constrained
k-means clustering with background knowledge. In Proceedings of the
Eighteenth International Conference on Machine Learning, ICML ’01,
pages 577–584, San Francisco, CA, USA. Morgan Kaufmann
Publishers Inc.
Xing, E. P., Ng, A. Y., Jordan, M. I., and Russell, S. (2003). Distance
metric learning, with application to clustering with side-information. In
Advances in Neural Information Processing System, pages 505–512.
MIT Press.
12 / 12

Mais conteúdo relacionado

Destaque

論文輪読: Deep neural networks are easily fooled: High confidence predictions for...mmisono

Distance Metric LearningSanghyuk Chun

Information-Theoretic Metric LearningKoji Matsuda

Adversarial Networks の画像生成に迫る @WBAFLカジュアルトーク#3Daiki Shimada

Image net classification　with Deep Convolutional Neural NetworksShingo Horiuchi

Deep Residual Learning (ILSVRC2015 winner)Hirokatsu Kataoka

20150930nlab_utokyo

MIRU2014 tutorial deeplearningTakayoshi Yamashita

Deep Convolutional Generative Adversarial Networks - Nextremer勉強会資料tm_2648

Destaque (9)

論文輪読: Deep neural networks are easily fooled: High confidence predictions for...

Distance Metric Learning

Information-Theoretic Metric Learning

Adversarial Networks の画像生成に迫る @WBAFLカジュアルトーク#3

Image net classification　with Deep Convolutional Neural Networks

Deep Residual Learning (ILSVRC2015 winner)

20150930

MIRU2014 tutorial deeplearning

Deep Convolutional Generative Adversarial Networks - Nextremer勉強会資料

Semelhante a An Introduction to Metric Learning for Clustering

block-mdp-masters-defense.pdfJunghyun Lee

Projection methods for stochastic structural dynamicsUniversity of Glasgow

Intro to Model Selectionchenhm

Chapter5.pdfsravan66

ClusteringNLPseminar

Comparison on PCA ICA and LDA in Face Recognitionijdmtaiir

CSC446: Pattern Recognition (LN6)Mostafa G. M. Mostafa

1376846406 14447221Editor Jacotech

A Novel Algorithm for Design Tree Classification with PCAEditor Jacotech

Lecture on linerar discriminatory analysisdevcb13d

theory of computation lecture 018threspecter

Self-organizing Network for Variable Clustering and Predictive ModelingHui Yang

SASA 2016Mzabalazo Ngwenya

mlcourse.ai. ClusteringYury Kashnitsky

ENS Macrh 2022.pdfCharles Martin

Teaching Mathematics Concepts via Computer Algebra Systemsinventionjournals

20070702 Text Categorizationmidi

Recent Advances in Crop ClassificationCIMMYT

Shriram Nandakumar & Deepa NaikShriram Nandakumar

recko_paperTereza Mlynářová

Semelhante a An Introduction to Metric Learning for Clustering (20)

block-mdp-masters-defense.pdf

Projection methods for stochastic structural dynamics

Intro to Model Selection

Chapter5.pdf

Clustering

Comparison on PCA ICA and LDA in Face Recognition

CSC446: Pattern Recognition (LN6)

1376846406 14447221

A Novel Algorithm for Design Tree Classification with PCA

Lecture on linerar discriminatory analysis

theory of computation lecture 01

Self-organizing Network for Variable Clustering and Predictive Modeling

SASA 2016

mlcourse.ai. Clustering

ENS Macrh 2022.pdf

Teaching Mathematics Concepts via Computer Algebra Systems

20070702 Text Categorization

Recent Advances in Crop Classification

Shriram Nandakumar & Deepa Naik

recko_paper

Mais de Federal University of Technology - Paraná/Brazil (UTFPR)

Situational awareness in social media: lessons learned using information entr...Federal University of Technology - Paraná/Brazil (UTFPR)

Does keyword noise change over space and time? A case study of flood- and rai...Federal University of Technology - Paraná/Brazil (UTFPR)

Mining rainfall spatio-temporal patterns in Twitter: a temporal approachFederal University of Technology - Paraná/Brazil (UTFPR)

An introduction to automated analysis of feature models through propositional...Federal University of Technology - Paraná/Brazil (UTFPR)

pSets TSI32B - Estrutura, Pesquisa e Ordenação de Dados (TSI UTFPR-Toledo)Federal University of Technology - Paraná/Brazil (UTFPR)

Aulas TSI32B - Estrutura, Pesquisa e Ordenação de Dados (TSI UTFPR-Toledo)Federal University of Technology - Paraná/Brazil (UTFPR)

pSets TSI33A - Banco de Dados I (TSI UTFPR-Toledo)Federal University of Technology - Paraná/Brazil (UTFPR)

Aulas TSI33A - Banco de Dados I (TSI UTFPR-Toledo)Federal University of Technology - Paraná/Brazil (UTFPR)

Mais de Federal University of Technology - Paraná/Brazil (UTFPR) (8)

Situational awareness in social media: lessons learned using information entr...

Does keyword noise change over space and time? A case study of flood- and rai...

Mining rainfall spatio-temporal patterns in Twitter: a temporal approach

An introduction to automated analysis of feature models through propositional...

pSets TSI32B - Estrutura, Pesquisa e Ordenação de Dados (TSI UTFPR-Toledo)

Aulas TSI32B - Estrutura, Pesquisa e Ordenação de Dados (TSI UTFPR-Toledo)

pSets TSI33A - Banco de Dados I (TSI UTFPR-Toledo)

Aulas TSI33A - Banco de Dados I (TSI UTFPR-Toledo)

Último

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

Accessible design: Minimum effort, maximum impactdawncurless

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Measures of Central Tendency: Mean, Median and ModeThiyagu K

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

Introduction to AI in Higher Education_draft.pptxpboyjonauth

Paris 2024 Olympic Geographies - an activityGeoBlogs

mini mental status format.docxPoojaSen20

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

microwave assisted reaction. General introductionMaksud Ahmed

Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle

Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management

Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732

_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon

Crayon Activity Handout For the Crayon AUnboundStockton

An Introduction to Metric Learning for Clustering

1. Metric Learning for Clustering SCC5945 - Análise Semi-Supervisionada e Não-Supervisionada de Padrões em Dados (Seminar) Sidgley Camargo de Andrade PhD student in computer science Institute of Computer Science and Mathematics University of São Paulo June 2016 1 / 12

2. Agenda Constraint-based algorithms Motivation Metrics Metric learning for clustering MPCK-means algorithm References 2 / 12

3. Constraint-based algorithms How to help the unsupervised algorithms to ﬁnd better solution? Constraint-based methods– e.g. background knowledge through pairwise constraints Wagstaﬀ et al. (2001) Con ⊆ DxD : must-link constraints Con= ⊆ DxD : cannot-link constraints Active- and self-learning Other . . . Are there “problems” related to algorithms above? 3 / 12

4. Motivation Figure: (Basu et al., 2008). Legend [–] must-link [- -] cannot-link 4 / 12

5. Metrics The metrics depict the relationships between the data (e.g. euclidean distance, mahalanobis distance, etc. . . ) What is the right metric? There are few forms or systemic mechanisms to tweak distance metrics, and them are often by hand Xing et al. (2003). 5 / 12

6. Metric learning for clustering Assumption: keeping dissimilar points far from each other and similar points closest to each other reduces the risk of errors. Xing et al. (2003) Suppose a user indicates that certain points in an input space (say, n) are considered by them to be “similar” (or “dissimilar”). Can we automatically learn a distance metric over n that respects these relationships, i.e., one that assigns small distances between the similar pairs and greater distances otherwise? Learn a metric d : nx n → over the input space. 6 / 12

7. Problem A simple way is to require that similar pairs (must-linked) have small distance between them, whereas dissimilar pairs (cannot-link) have greater distance between them d(x, y) = dA(x, y) = ||x − y||A = (x − y)T A(x − y) min A (xi ,xj )∈S ||xi − xj ||2 A s.t. (xi ,xj )∈D ||xi − xj ||2 A ≥ c A 0 , where A 0 is a constraint that symmetric matrix A must be positive semi-deﬁnite – “pseudo metric” – and c any positive constant ≥ 1 1 Question for class – Why is constant c positive? 2 Question for class – How to transform to max problem? 7 / 12

8. Example – Xing et al. (2003) 8 / 12

9. Metric Pairwise Constraint K-means (MPCK-means) Assumes a matrix Ah (metric) for each cluster h Permits the speciﬁcation of an individual weight for each constraint (fM and fC ); the penalty for constraint violations is proportional to the violated constraints weight 9 / 12

10. MPCK-means algorithm – Bilenko et al. (2004) 10 / 12

11. MPCK-means algorithm – Bilenko et al. (2004) 11 / 12

12. References Basu, S., Davidson, I., and Wagstaff, K. (2008). Constrained Clustering: Advances in Algorithms, Theory, and Applications. Chapman & Hall/CRC, 1 edition. Bilenko, M., Basu, S., and Mooney, R. J. (2004). Integrating constraints and metric learning in semi-supervised clustering. In Proceedings of the Twenty-first International Conference on Machine Learning, ICML ’04, pages 11–, New York, NY, USA. ACM. Wagstaff, K., Cardie, C., Rogers, S., and Schrödl, S. (2001). Constrained k-means clustering with background knowledge. In Proceedings of the Eighteenth International Conference on Machine Learning, ICML ’01, pages 577–584, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc. Xing, E. P., Ng, A. Y., Jordan, M. I., and Russell, S. (2003). Distance metric learning, with application to clustering with side-information. In Advances in Neural Information Processing System, pages 505–512. MIT Press. 12 / 12

An Introduction to Metric Learning for Clustering

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (9)

Semelhante a An Introduction to Metric Learning for Clustering

Semelhante a An Introduction to Metric Learning for Clustering (20)

Mais de Federal University of Technology - Paraná/Brazil (UTFPR)

Mais de Federal University of Technology - Paraná/Brazil (UTFPR) (8)

Último

Último (20)

An Introduction to Metric Learning for Clustering