SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
Notes on Spectral Clustering
             Politecnico di Milano, 28/05/2012




                        Davide Eynard
    Institute of Computational Sciences - Faculty of Informatics
                  Università della Svizzera Italiana
                      davide.eynard@usi.ch
Talk outline

    Spectral Clustering
            Distances and similarity graphs
            Graph Laplacians and their properties
            Spectral clustering algorithms
            SC under the hood




24/04/2011                  Notes on Spectral Clustering and Joint Diagonalization   2/15
Similarity graph

    The objective of a clustering algorithm is partitioning
     data into groups such that:
            Points in the same group are similar
            Points in different groups are dissimilar


    Similarity graph G=(V,E)                                                       (undirected graph)

            Vertices vi and vj are connected by a weighted edge if their
             similarity is above a given threshold
            GOAL: find a partition of the graph such that:
                 edges within a group have high weights
                 edges across different groups have low weights
24/04/2011                     Notes on Spectral Clustering and Joint Diagonalization               3/15
Weighted adjacency matrix

    Let G(V,E) be an undirected graph with vertex set
     V={v1,...,vn}
    Weighted adjacency matrix W=(wij )i,j=1,...,n
            wij≥0 is the weight of the edge between vi and vj
            wij=0 means that vi and vj are not connected by an edge
            wij=wji

    Degree of a vertex vi∈V:                          di=∑j=1..nwij
    Degree matrix D=diag(d1,...,dn)

24/04/2011                  Notes on Spectral Clustering and Joint Diagonalization   4/15
Different similarity graphs
 
     ε-neighborhood
            Connect all points whose pairwise distance is less than ε
    k-nearest neighbors
            if vi ∈ knn(vj) OR vj ∈ knn(vi)
            if vi ∈ knn(vj) AND vj ∈ knn(vi)                                          (mutual knn)
            after connecting edges, use similarity as weight
    fully connected
            all points with similarity sij>0 are connected
            To control neighborhoods to be local, use a similarity
             function like the Gaussian: s(xi,xj)=exp(-║xi-xj║2/(2σ2))
24/04/2011                    Notes on Spectral Clustering and Joint Diagonalization                  5/15
Graph Laplacians

    Graph Laplacian:
            L=D–W                       (symmetric and positive semi-definite)
    Properties
            Smallest eigenvalue λ1=0 with eigenvector = �
            n non-negative, real-valued eigenvalues 0=λ1≤λ2≤...≤λn
            the multiplicty k of the eigenvalue 0 of L equals the number
             of connected components A1,...,Ak in the graph




24/04/2011                  Notes on Spectral Clustering and Joint Diagonalization   6/15
Spectral Clustering algorithm (1)

     Spectral Clustering algorithm
     Input: Similarity matrix S ∈ ℝn×n, number k of clusters to construct.
       1. Construct a similarity graph as previously described. Let W be its
          weighted adjacency matrix.
       2. Compute the unnormalized Laplacian L
       3. Compute the first k eigenvectors u1,…,uk of L

       4. Let U ∈ ℝn×k be the matrix containing the vectors u1,…,uk as columns

       5. For i=1,...,n let yi ∈ ℝk be the vector corresponding to the i-th row of U

       6. Cluster the points (yi)i=1,...,n in ℝk with the k-means algorithm into clusters
          C1,...,Ck.

     Output: Clusters A1,...,Ak with Ai= {j | yj ∈ Ci}.

24/04/2011                    Notes on Spectral Clustering and Joint Diagonalization   7/15
Normalized Graph Laplacians
    Normalized graph Laplacians
            Symmetric: Lsym=D-1/2LD-1/2 = I-D-1/2WD-1/2
            Random Walk: Lrw=D-1L=I-D-1W
    Properties
            λ is an eigenvalue of Lrw with eigenvector u iff λ is an
             eigenvalue of Lsym with eigenvector w=D1/2u
            λ is an eigenvalue of Lrw with eigenvector u iff λ and u solve
             the generalized eigenproblem Lu=λDu




24/04/2011                   Notes on Spectral Clustering and Joint Diagonalization   8/15
Normalized Graph Laplacians
    Normalized graph Laplacians
            Symmetric: Lsym=D-1/2LD-1/2 = I-D-1/2WD-1/2
            Random Walk: Lrw=D-1L=I-D-1W
    Properties (follow)
            0 is an eigenvalue of Lrw with � as eigenvector, and an
             eigenvalue of Lsym with eigenvector D1/2�.
            Lsym and Lrw are positive semi-definite and have n non-
             negative, real-valued eigenvalues 0=λ1≤λ2≤...≤λn
            the multiplicty k of the eigenvalue 0 of both Lsym and Lrw
             equals the number of connected components A1,...,Ak
24/04/2011                   Notes on Spectral Clustering and Joint Diagonalization   9/15
Spectral Clustering algorithm (2)

     Normalized Spectral Clustering
     Input: Similarity matrix S ∈ ℝn×n, number k of clusters to construct.
    Lrw:

       3. Compute the first k generalized eigenvectors u1,…,uk of the generalized
          eigenproblem Lu=λDu
    Lsym:

       2. Compute the normalized Laplacian Lsym

       3. Compute the first k eigenvectors u1,…,uk of Lsym

       4. normalize the eigenvectors


     Output: Clusters A1,...,Ak with Ai= {j | yj ∈ Ci}.
24/04/2011                  Notes on Spectral Clustering and Joint Diagonalization   10/15
A spectral clustering example




24/04/2011   Notes on Spectral Clustering and Joint Diagonalization   11/15
Under the hood

    0-eigenvalues in the ideal case
    parameters are crucial:
            k in k nearest neighbors
       
             σ in Gaussian kernel
            k (another one!) in k-means




24/04/2011                  Notes on Spectral Clustering and Joint Diagonalization   12/15
Random Walk point of view

    Random walk: stochastic process which randomly
     jumps from one vertex to another
            Clustering: finding a partition such that a random walk stays
             long within a cluster and seldom jumps between clusters
    Transition probability pij:=wij/di
    Transition matrix: P = D-1W                            =>            Lrw= I - P
            λ is an eigenvalue of Lrw with eigenvector u iff 1-λ is an
             eigenvalue of P with eigenvector u



24/04/2011                   Notes on Spectral Clustering and Joint Diagonalization    13/15
References

    Von Luxburg, U. (2007). A tutorial on spectral clustering.
     Statistics and Computing, 17(4), 395-416. Springer.




24/04/2011              Notes on Spectral Clustering and Joint Diagonalization   14/15
Thank you!



             Thanks for your attention!

                        Questions?




24/04/2011       Notes on Spectral Clustering and Joint Diagonalization   15/15

Mais conteúdo relacionado

Mais procurados

K means and dbscan
K means and dbscanK means and dbscan
K means and dbscanYan Xu
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureRajesh Piryani
 
directed-research-report
directed-research-reportdirected-research-report
directed-research-reportRyen Krusinga
 
Intro to MATLAB and K-mean algorithm
Intro to MATLAB and K-mean algorithmIntro to MATLAB and K-mean algorithm
Intro to MATLAB and K-mean algorithmkhalid Shah
 
K-means Clustering
K-means ClusteringK-means Clustering
K-means ClusteringAnna Fensel
 
Enhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial DatasetEnhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial DatasetAlaaZ
 
K means clustering | K Means ++
K means clustering | K Means ++K means clustering | K Means ++
K means clustering | K Means ++sabbirantor
 
Clustering: Large Databases in data mining
Clustering: Large Databases in data miningClustering: Large Databases in data mining
Clustering: Large Databases in data miningZHAO Sam
 
DBSCAN : A Clustering Algorithm
DBSCAN : A Clustering AlgorithmDBSCAN : A Clustering Algorithm
DBSCAN : A Clustering AlgorithmPınar Yahşi
 
Visualization using tSNE
Visualization using tSNEVisualization using tSNE
Visualization using tSNEYan Xu
 
K means clustering
K means clusteringK means clustering
K means clusteringThomas K T
 
Ram minimum spanning tree
Ram   minimum spanning treeRam   minimum spanning tree
Ram minimum spanning treeRama Prasath A
 
Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sangwoo Mo
 
3.4 density and grid methods
3.4 density and grid methods3.4 density and grid methods
3.4 density and grid methodsKrish_ver2
 
Clustering techniques
Clustering techniquesClustering techniques
Clustering techniquestalktoharry
 
PCA (Principal component analysis) Theory and Toolkits
PCA (Principal component analysis) Theory and ToolkitsPCA (Principal component analysis) Theory and Toolkits
PCA (Principal component analysis) Theory and ToolkitsHopeBay Technologies, Inc.
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Yan Xu
 

Mais procurados (20)

K means and dbscan
K means and dbscanK means and dbscan
K means and dbscan
 
Data miningpresentation
Data miningpresentationData miningpresentation
Data miningpresentation
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structure
 
directed-research-report
directed-research-reportdirected-research-report
directed-research-report
 
Intro to MATLAB and K-mean algorithm
Intro to MATLAB and K-mean algorithmIntro to MATLAB and K-mean algorithm
Intro to MATLAB and K-mean algorithm
 
K-means Clustering
K-means ClusteringK-means Clustering
K-means Clustering
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
Enhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial DatasetEnhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial Dataset
 
K means clustering | K Means ++
K means clustering | K Means ++K means clustering | K Means ++
K means clustering | K Means ++
 
Clustering: Large Databases in data mining
Clustering: Large Databases in data miningClustering: Large Databases in data mining
Clustering: Large Databases in data mining
 
DBSCAN : A Clustering Algorithm
DBSCAN : A Clustering AlgorithmDBSCAN : A Clustering Algorithm
DBSCAN : A Clustering Algorithm
 
Visualization using tSNE
Visualization using tSNEVisualization using tSNE
Visualization using tSNE
 
K means clustering
K means clusteringK means clustering
K means clustering
 
Ram minimum spanning tree
Ram   minimum spanning treeRam   minimum spanning tree
Ram minimum spanning tree
 
Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)
 
Pca ankita dubey
Pca ankita dubeyPca ankita dubey
Pca ankita dubey
 
3.4 density and grid methods
3.4 density and grid methods3.4 density and grid methods
3.4 density and grid methods
 
Clustering techniques
Clustering techniquesClustering techniques
Clustering techniques
 
PCA (Principal component analysis) Theory and Toolkits
PCA (Principal component analysis) Theory and ToolkitsPCA (Principal component analysis) Theory and Toolkits
PCA (Principal component analysis) Theory and Toolkits
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering
 

Destaque

Laplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsLaplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsDavide Eynard
 
icml2004 tutorial on spectral clustering part I
icml2004 tutorial on spectral clustering part Iicml2004 tutorial on spectral clustering part I
icml2004 tutorial on spectral clustering part Izukun
 
icml2004 tutorial on spectral clustering part II
icml2004 tutorial on spectral clustering part IIicml2004 tutorial on spectral clustering part II
icml2004 tutorial on spectral clustering part IIzukun
 
Distributed approximate spectral clustering for large scale datasets
Distributed approximate spectral clustering for large scale datasetsDistributed approximate spectral clustering for large scale datasets
Distributed approximate spectral clustering for large scale datasetsBita Kazemi
 
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloud
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloudA secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloud
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloudPvrtechnologies Nellore
 
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB MongoDB
 
Privacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud dataPrivacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud dataIGEEKS TECHNOLOGIES
 
Eigen value and eigen vector
Eigen value and eigen vectorEigen value and eigen vector
Eigen value and eigen vectorRutvij Patel
 
Spectral clustering - Houston ML Meetup
Spectral clustering - Houston ML MeetupSpectral clustering - Houston ML Meetup
Spectral clustering - Houston ML MeetupYan Xu
 
Data security in cloud computing
Data security in cloud computingData security in cloud computing
Data security in cloud computingPrince Chandu
 

Destaque (12)

Laplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsLaplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformations
 
icml2004 tutorial on spectral clustering part I
icml2004 tutorial on spectral clustering part Iicml2004 tutorial on spectral clustering part I
icml2004 tutorial on spectral clustering part I
 
icml2004 tutorial on spectral clustering part II
icml2004 tutorial on spectral clustering part IIicml2004 tutorial on spectral clustering part II
icml2004 tutorial on spectral clustering part II
 
Introduction to Sparse Methods
Introduction to Sparse Methods Introduction to Sparse Methods
Introduction to Sparse Methods
 
Distributed approximate spectral clustering for large scale datasets
Distributed approximate spectral clustering for large scale datasetsDistributed approximate spectral clustering for large scale datasets
Distributed approximate spectral clustering for large scale datasets
 
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloud
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloudA secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloud
A secure-anti-collusion-data-sharing-scheme-for-dynamic-groups-in-the-cloud
 
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB
Webinar: Adobe Experience Manager Clustering Made Easy on MongoDB
 
Privacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud dataPrivacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud data
 
Eigen value and eigen vector
Eigen value and eigen vectorEigen value and eigen vector
Eigen value and eigen vector
 
Spectral clustering - Houston ML Meetup
Spectral clustering - Houston ML MeetupSpectral clustering - Houston ML Meetup
Spectral clustering - Houston ML Meetup
 
Cloud Computing Security Issues
Cloud Computing Security Issues Cloud Computing Security Issues
Cloud Computing Security Issues
 
Data security in cloud computing
Data security in cloud computingData security in cloud computing
Data security in cloud computing
 

Semelhante a Notes on Spectral Clustering

20070823
2007082320070823
20070823neostar
 
An Introduction to Spectral Graph Theory
An Introduction to Spectral Graph TheoryAn Introduction to Spectral Graph Theory
An Introduction to Spectral Graph Theoryjoisino
 
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdf
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdfhttp://old.nathalievilla.org/IMG/pdf/Presentation-27.pdf
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdftuxette
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelstuxette
 
Density theorems for Euclidean point configurations
Density theorems for Euclidean point configurationsDensity theorems for Euclidean point configurations
Density theorems for Euclidean point configurationsVjekoslavKovac1
 
A discussion on sampling graphs to approximate network classification functions
A discussion on sampling graphs to approximate network classification functionsA discussion on sampling graphs to approximate network classification functions
A discussion on sampling graphs to approximate network classification functionsLARCA UPC
 
On Convolution of Graph Signals and Deep Learning on Graph Domains
On Convolution of Graph Signals and Deep Learning on Graph DomainsOn Convolution of Graph Signals and Deep Learning on Graph Domains
On Convolution of Graph Signals and Deep Learning on Graph DomainsJean-Charles Vialatte
 
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdf
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdfEigen-Decomposition: Eigenvalues and Eigenvectors.pdf
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdfNehaVerma933923
 
Proportional and decentralized rule mcst games
Proportional and decentralized rule mcst gamesProportional and decentralized rule mcst games
Proportional and decentralized rule mcst gamesvinnief
 
Graph theory concepts complex networks presents-rouhollah nabati
Graph theory concepts   complex networks presents-rouhollah nabatiGraph theory concepts   complex networks presents-rouhollah nabati
Graph theory concepts complex networks presents-rouhollah nabatinabati
 
Webinar on Graph Neural Networks
Webinar on Graph Neural NetworksWebinar on Graph Neural Networks
Webinar on Graph Neural NetworksLucaCrociani1
 
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...Ceni Babaoglu, PhD
 
Steven Duplij, "Polyadic Hopf algebras and quantum groups"
Steven Duplij, "Polyadic Hopf algebras and quantum groups"Steven Duplij, "Polyadic Hopf algebras and quantum groups"
Steven Duplij, "Polyadic Hopf algebras and quantum groups"Steven Duplij (Stepan Douplii)
 
Jones matrix formulation by the unitary matrix
Jones matrix formulation by the unitary matrixJones matrix formulation by the unitary matrix
Jones matrix formulation by the unitary matrixSachidanandChikkpeti
 
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...SUTAPAMUKHERJEE12
 

Semelhante a Notes on Spectral Clustering (20)

Graph Partitioning and Spectral Methods
Graph Partitioning and Spectral MethodsGraph Partitioning and Spectral Methods
Graph Partitioning and Spectral Methods
 
EE8120_Projecte_15
EE8120_Projecte_15EE8120_Projecte_15
EE8120_Projecte_15
 
Graph Theory
Graph TheoryGraph Theory
Graph Theory
 
20070823
2007082320070823
20070823
 
An Introduction to Spectral Graph Theory
An Introduction to Spectral Graph TheoryAn Introduction to Spectral Graph Theory
An Introduction to Spectral Graph Theory
 
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdf
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdfhttp://old.nathalievilla.org/IMG/pdf/Presentation-27.pdf
http://old.nathalievilla.org/IMG/pdf/Presentation-27.pdf
 
project report(1)
project report(1)project report(1)
project report(1)
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction models
 
Density theorems for Euclidean point configurations
Density theorems for Euclidean point configurationsDensity theorems for Euclidean point configurations
Density theorems for Euclidean point configurations
 
A discussion on sampling graphs to approximate network classification functions
A discussion on sampling graphs to approximate network classification functionsA discussion on sampling graphs to approximate network classification functions
A discussion on sampling graphs to approximate network classification functions
 
On Convolution of Graph Signals and Deep Learning on Graph Domains
On Convolution of Graph Signals and Deep Learning on Graph DomainsOn Convolution of Graph Signals and Deep Learning on Graph Domains
On Convolution of Graph Signals and Deep Learning on Graph Domains
 
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdf
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdfEigen-Decomposition: Eigenvalues and Eigenvectors.pdf
Eigen-Decomposition: Eigenvalues and Eigenvectors.pdf
 
26 spanning
26 spanning26 spanning
26 spanning
 
Proportional and decentralized rule mcst games
Proportional and decentralized rule mcst gamesProportional and decentralized rule mcst games
Proportional and decentralized rule mcst games
 
Graph theory concepts complex networks presents-rouhollah nabati
Graph theory concepts   complex networks presents-rouhollah nabatiGraph theory concepts   complex networks presents-rouhollah nabati
Graph theory concepts complex networks presents-rouhollah nabati
 
Webinar on Graph Neural Networks
Webinar on Graph Neural NetworksWebinar on Graph Neural Networks
Webinar on Graph Neural Networks
 
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...
4. Linear Algebra for Machine Learning: Eigenvalues, Eigenvectors and Diagona...
 
Steven Duplij, "Polyadic Hopf algebras and quantum groups"
Steven Duplij, "Polyadic Hopf algebras and quantum groups"Steven Duplij, "Polyadic Hopf algebras and quantum groups"
Steven Duplij, "Polyadic Hopf algebras and quantum groups"
 
Jones matrix formulation by the unitary matrix
Jones matrix formulation by the unitary matrixJones matrix formulation by the unitary matrix
Jones matrix formulation by the unitary matrix
 
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...
PPT on Graph Theory ( Tree, Cotree, nodes, branches, incidence , tie set and ...
 

Mais de Davide Eynard

Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsDavide Eynard
 
An integrated approach to discover tag semantics
An integrated approach to discover tag semanticsAn integrated approach to discover tag semantics
An integrated approach to discover tag semanticsDavide Eynard
 
SAnno: a unifying framework for semantic annotation
SAnno: a unifying framework for semantic annotationSAnno: a unifying framework for semantic annotation
SAnno: a unifying framework for semantic annotationDavide Eynard
 
A Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationA Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationDavide Eynard
 
ReSearch - Searching for Researchers
ReSearch - Searching for ResearchersReSearch - Searching for Researchers
ReSearch - Searching for ResearchersDavide Eynard
 
PhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsPhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsDavide Eynard
 
Exploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationExploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationDavide Eynard
 
Performance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsPerformance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsDavide Eynard
 
Cracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsCracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsDavide Eynard
 
Fast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonFast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonDavide Eynard
 
Unambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesUnambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesDavide Eynard
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systemsDavide Eynard
 

Mais de Davide Eynard (14)

Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and Manifolds
 
An integrated approach to discover tag semantics
An integrated approach to discover tag semanticsAn integrated approach to discover tag semantics
An integrated approach to discover tag semantics
 
SAnno: a unifying framework for semantic annotation
SAnno: a unifying framework for semantic annotationSAnno: a unifying framework for semantic annotation
SAnno: a unifying framework for semantic annotation
 
A Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationA Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and Participation
 
Talk Hpl
Talk HplTalk Hpl
Talk Hpl
 
ReSearch - Searching for Researchers
ReSearch - Searching for ResearchersReSearch - Searching for Researchers
ReSearch - Searching for Researchers
 
PhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsPhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD Students
 
Exploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationExploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotation
 
Performance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsPerformance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection Systems
 
Cracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsCracking Codes With Genetic Algorithms
Cracking Codes With Genetic Algorithms
 
Rewire the Net
Rewire the NetRewire the Net
Rewire the Net
 
Fast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonFast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparison
 
Unambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesUnambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional Languages
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
 

Último

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Notes on Spectral Clustering

  • 1. Notes on Spectral Clustering Politecnico di Milano, 28/05/2012 Davide Eynard Institute of Computational Sciences - Faculty of Informatics Università della Svizzera Italiana davide.eynard@usi.ch
  • 2. Talk outline  Spectral Clustering  Distances and similarity graphs  Graph Laplacians and their properties  Spectral clustering algorithms  SC under the hood 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 2/15
  • 3. Similarity graph  The objective of a clustering algorithm is partitioning data into groups such that:  Points in the same group are similar  Points in different groups are dissimilar  Similarity graph G=(V,E) (undirected graph)  Vertices vi and vj are connected by a weighted edge if their similarity is above a given threshold  GOAL: find a partition of the graph such that:  edges within a group have high weights  edges across different groups have low weights 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 3/15
  • 4. Weighted adjacency matrix  Let G(V,E) be an undirected graph with vertex set V={v1,...,vn}  Weighted adjacency matrix W=(wij )i,j=1,...,n  wij≥0 is the weight of the edge between vi and vj  wij=0 means that vi and vj are not connected by an edge  wij=wji  Degree of a vertex vi∈V: di=∑j=1..nwij  Degree matrix D=diag(d1,...,dn) 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 4/15
  • 5. Different similarity graphs  ε-neighborhood  Connect all points whose pairwise distance is less than ε  k-nearest neighbors  if vi ∈ knn(vj) OR vj ∈ knn(vi)  if vi ∈ knn(vj) AND vj ∈ knn(vi) (mutual knn)  after connecting edges, use similarity as weight  fully connected  all points with similarity sij>0 are connected  To control neighborhoods to be local, use a similarity function like the Gaussian: s(xi,xj)=exp(-║xi-xj║2/(2σ2)) 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 5/15
  • 6. Graph Laplacians  Graph Laplacian:  L=D–W (symmetric and positive semi-definite)  Properties  Smallest eigenvalue λ1=0 with eigenvector = �  n non-negative, real-valued eigenvalues 0=λ1≤λ2≤...≤λn  the multiplicty k of the eigenvalue 0 of L equals the number of connected components A1,...,Ak in the graph 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 6/15
  • 7. Spectral Clustering algorithm (1) Spectral Clustering algorithm Input: Similarity matrix S ∈ ℝn×n, number k of clusters to construct. 1. Construct a similarity graph as previously described. Let W be its weighted adjacency matrix. 2. Compute the unnormalized Laplacian L 3. Compute the first k eigenvectors u1,…,uk of L 4. Let U ∈ ℝn×k be the matrix containing the vectors u1,…,uk as columns 5. For i=1,...,n let yi ∈ ℝk be the vector corresponding to the i-th row of U 6. Cluster the points (yi)i=1,...,n in ℝk with the k-means algorithm into clusters C1,...,Ck. Output: Clusters A1,...,Ak with Ai= {j | yj ∈ Ci}. 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 7/15
  • 8. Normalized Graph Laplacians  Normalized graph Laplacians  Symmetric: Lsym=D-1/2LD-1/2 = I-D-1/2WD-1/2  Random Walk: Lrw=D-1L=I-D-1W  Properties  λ is an eigenvalue of Lrw with eigenvector u iff λ is an eigenvalue of Lsym with eigenvector w=D1/2u  λ is an eigenvalue of Lrw with eigenvector u iff λ and u solve the generalized eigenproblem Lu=λDu 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 8/15
  • 9. Normalized Graph Laplacians  Normalized graph Laplacians  Symmetric: Lsym=D-1/2LD-1/2 = I-D-1/2WD-1/2  Random Walk: Lrw=D-1L=I-D-1W  Properties (follow)  0 is an eigenvalue of Lrw with � as eigenvector, and an eigenvalue of Lsym with eigenvector D1/2�.  Lsym and Lrw are positive semi-definite and have n non- negative, real-valued eigenvalues 0=λ1≤λ2≤...≤λn  the multiplicty k of the eigenvalue 0 of both Lsym and Lrw equals the number of connected components A1,...,Ak 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 9/15
  • 10. Spectral Clustering algorithm (2) Normalized Spectral Clustering Input: Similarity matrix S ∈ ℝn×n, number k of clusters to construct.  Lrw: 3. Compute the first k generalized eigenvectors u1,…,uk of the generalized eigenproblem Lu=λDu  Lsym: 2. Compute the normalized Laplacian Lsym 3. Compute the first k eigenvectors u1,…,uk of Lsym 4. normalize the eigenvectors Output: Clusters A1,...,Ak with Ai= {j | yj ∈ Ci}. 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 10/15
  • 11. A spectral clustering example 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 11/15
  • 12. Under the hood  0-eigenvalues in the ideal case  parameters are crucial:  k in k nearest neighbors  σ in Gaussian kernel  k (another one!) in k-means 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 12/15
  • 13. Random Walk point of view  Random walk: stochastic process which randomly jumps from one vertex to another  Clustering: finding a partition such that a random walk stays long within a cluster and seldom jumps between clusters  Transition probability pij:=wij/di  Transition matrix: P = D-1W => Lrw= I - P  λ is an eigenvalue of Lrw with eigenvector u iff 1-λ is an eigenvalue of P with eigenvector u 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 13/15
  • 14. References  Von Luxburg, U. (2007). A tutorial on spectral clustering. Statistics and Computing, 17(4), 395-416. Springer. 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 14/15
  • 15. Thank you! Thanks for your attention! Questions? 24/04/2011 Notes on Spectral Clustering and Joint Diagonalization 15/15