SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Glocalized Weisfeiler-Lehman Graph Kernels:
Local-Global Feature Maps of Graphs
IEEE ICDM 2017
Christopher Morris, Kristian Kersting, Petra Mutzel
20. November 2017
TU Dortmund University, Algorithm Engineering Group
TU Darmstadt, Machine Learning Group
Motivation
Question
How similar are two graphs?
(a) Sildenafil (b) Vardenafil
1
High-level View: Supervised Graph Classification
2
High-level View: Supervised Graph Classification
⊆ H
φ: G → H
2
High-level View: Supervised Graph Classification
⊆ H
φ: G → H
2
Primer on Graph Kernels
Question
How similar are two graphs?
3
Primer on Graph Kernels
Question
How similar are two graphs?
Definition (Graph Kernel)
Let 𝒢 be a non-empty set of graphs and let k: 𝒢 × 𝒢 → R. Then k is
a graph kernel if there is a Hilbert space ℋ and a feature map
𝜑: 𝒢 → ℋ such that k(G, H) = ⟨𝜑(G), 𝜑(H)⟩.
3
Example: Weisfeiler-Lehman Subtree Kernel
Idea
Graph kernel based on well-known heuristic for graph
isomorphism testing: 1-WL or color refinement
Iteration: Two vertices get same colors iff if they have the same
colored neighborhood
N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt.
“Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011),
pp. 2539–2561 4
Example: Weisfeiler-Lehman Subtree Kernel
Idea
Graph kernel based on well-known heuristic for graph
isomorphism testing: 1-WL or color refinement
Iteration: Two vertices get same colors iff if they have the same
colored neighborhood
𝜑(G1) = ( )
(a) G1
𝜑(G2) = ( )
(b) G2
N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt.
“Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011),
pp. 2539–2561 4
Example: Weisfeiler-Lehman Subtree Kernel
Idea
Graph kernel based on well-known heuristic for graph
isomorphism testing: 1-WL or color refinement
Iteration: Two vertices get same colors iff if they have the same
colored neighborhood
𝜑(G1) = (2, 2, 2, )
(a) G1
𝜑(G2) = (1, 1, 3, )
(b) G2
N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt.
“Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011),
pp. 2539–2561 4
Example: Weisfeiler-Lehman Subtree Kernel
Idea
Graph kernel based on well-known heuristic for graph
isomorphism testing: 1-WL or color refinement
Iteration: Two vertices get same colors iff if they have the same
colored neighborhood
𝜑(G1) = (2, 2, 2, 2, 2, 2, 0, 0)
(a) G1
𝜑(G2) = (1, 1, 3, 2, 0, 1, 1, 1)
(b) G2
N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt.
“Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011),
pp. 2539–2561 4
Global vs. Local Graph Properties
Observation
Most graph kernels only take local graph properties into account,
e.g., they look at h-neighborhood around vertices.
h
5
Global vs. Local Graph Properties
Observation
Most graph kernels only take local graph properties into account,
e.g., they look at h-neighborhood around vertices.
h
Challenge
Design a scalable graph kernel that can take local as well global
graph properties into account.
5
Talk Structure
1 k-Dimensional Weisfeiler-Lehman
2 A Local Kernel Based on the k-dim. WL
3 Approximation Algorithms
4 Experimental Evaluation
6
k-Dimensional Weisfeiler-Lehman
k-dimensional Weisfeiler-Lehman
• Colors vertex tuples from Vk
• Two tuples v, w are i-neighbors if vj = wj for all j ̸= i
Idea of the Algorithm
Initially Initially two k-tuples v, w get the same color if vi ↦→ wi
induces a (graph) isomorphism between G[v] and G[w]
Iteration Two tuples with the same color get different colors if
there exists a color c and 1 ≤ i ≤ k such that v and w
have different i-neighbors of color c 7
Local k-dimensional WL
Idea
Define “local neighborhood” by taking underlying graph structure
into account.
8
Local k-dimensional WL
Idea
Define “local neighborhood” by taking underlying graph structure
into account.
v1 v2 v3
v4 v5 v6
(a) Subset of local neighborhood.
v1 v2 v3
v4 v5 v6
(b) Subset of global neighborhood.
8
Local k-dimensional WL
Idea
Define “local neighborhood” by taking underlying graph structure
into account.
v1 v2 v3
v4 v5 v6
(a) Subset of local neighborhood.
v1 v2 v3
v4 v5 v6
(b) Subset of global neighborhood.
Advantages
1 Considers “local” properties
2 Respects sparsity of original graph
3 Can be approximated by sampling 8
Scalability: Approximation by Sampling
Problem
Algorithm does not scale.
9
Scalability: Approximation by Sampling
Problem
Algorithm does not scale.
Solution
Approximate feature vector after h iterations by sampling.
9
Scalability: Approximation by Sampling
Problem
Algorithm does not scale.
Solution
Approximate feature vector after h iterations by sampling.
Highlevel Idea of Algorithm
1 Sample a number of subsets of size k
2 Explore h-neighborhood around each such set
3 Compute algorithm on each h-neighborhood
9
Scalability: Approximation by Sampling
Question
Why does this lead to correct results?
10
Scalability: Approximation by Sampling
Question
Why does this lead to correct results?
t
1
2
3
0
Insight
Color of central k-set t after h iterations is correct. 10
Scalability: Approximation by Sampling
Theorem (Informal)
With high probability the sampling algorithm approximates the
(normalized) feature vector of the local k-dimension WL such that
⃦
⃦
⃦̂︀𝜑k-LWL(G) − ̃︀𝜑k-LWL(G)
⃦
⃦
⃦
1
≤ 𝜀1 .
For bounded-degree graphs the running time is independent of the
size of the graph, i.e. the number of nodes and edges.
11
Scalability: Approximation by Sampling
Theorem (Informal)
Given a finite set 𝒢 of graphs. With high probability the sampling
algorithm approximate the kernel function of the local k-dimension
WL such that
sup
G,H∈𝒢
⃒
⃒
⃒̂︀kh
k-LWL(G, H) − ̃︀kh
k-LWL(G, H)
⃒
⃒
⃒ ≤ 𝜖2 .
For bounded-degree graphs the running time is independent of the
size of the graph, i.e. the number of nodes and edges.
12
Scalability: Approximation by Sampling
Problems
1 Algorithm is restricted to bounded-degree graphs!
2 How do we compute the sample size for general graphs?
13
Scalability: Approximation by Sampling
Problems
1 Algorithm is restricted to bounded-degree graphs!
2 How do we compute the sample size for general graphs?
Solution: Adaptive Sampling Algorithm
while Desired accurracy is not reached do
Increase sample size
Compute h neighborhoods for new sample
Compute algorithm in each h-neighborhood
end while
13
Scalability: Approximation by Adaptive Sampling
Theorem (Informal)
Let G be a graph, then the above procedure approximates the
normalized feature vector ̂︀𝜑k-LWL(G) of the k-LWL for h iterations
such that with high probability
sup
l∈Σ
⃒
⃒
⃒̂︀𝜑k-LWL(G)l − ̃︀𝜑k-LWL(G)l
⃒
⃒
⃒ ≤ 𝜀3 .
14
Scalability: Approximation by Adaptive Sampling
Theorem (Informal)
Let G be a graph, then the above procedure approximates the
normalized feature vector ̂︀𝜑k-LWL(G) of the k-LWL for h iterations
such that with high probability
sup
l∈Σ
⃒
⃒
⃒̂︀𝜑k-LWL(G)l − ̃︀𝜑k-LWL(G)l
⃒
⃒
⃒ ≤ 𝜀3 .
Remark
Proof relies on self-bounding properties of bounds based on
conditional Rademacher Averages.
14
Experimental Evaluation: Classification Accurary
PROTEINS
REDDIT
ENZYMES
IMDB-BINARY NCI1
MUTAG
0
10
20
30
40
50
60
70
80
90ClassificationAccuracy
3-LWL
1-LWL
3-GWL
15
Experimental Evaluation: Running Times
3-LWL-SP(0.1)
3-LWL-S(0.1)
3-LWL-SP(0.05)
3-LWL-S(0.05)
3-LWL-L
3-LWL-P
3-LWL
Algorithm
0
1000
2000
3000
4000
5000
6000
7000
8000
RunningTimes[s]
PROTEINS
16
Conclusion
1 Graph kernel based on k-dimensional Weisfeiler-Lehman
• Considers local as well as global graph properties
2 Approximation algorithms based on sampling
• Constant running time for bounded-degree graphs
• Adaptive sampling algorithm for general graphs
3 Promising experimental results
Collection of Graph Classification Benchmarks
graphkernels.cs.tu-dortmund.de
17

Mais conteúdo relacionado

Mais procurados

Presentation-Alex-20150421
Presentation-Alex-20150421Presentation-Alex-20150421
Presentation-Alex-20150421alexfang2014
 
Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)Bean Yen
 
Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sangwoo Mo
 
Group homomorphism
Group homomorphismGroup homomorphism
Group homomorphismNaliniSPatil
 
Introduction to Genetic Algorithms and Evolutionary Computation
Introduction to Genetic Algorithms and Evolutionary ComputationIntroduction to Genetic Algorithms and Evolutionary Computation
Introduction to Genetic Algorithms and Evolutionary ComputationAleksander Stensby
 
Multi-Agent Reinforcement Learning
Multi-Agent Reinforcement LearningMulti-Agent Reinforcement Learning
Multi-Agent Reinforcement LearningSeolhokim
 
Wasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 IWasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 ISungbin Lim
 
Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningKhang Pham
 
ShuffleNet - PR054
ShuffleNet - PR054ShuffleNet - PR054
ShuffleNet - PR054Jinwon Lee
 
Flow based generative models
Flow based generative modelsFlow based generative models
Flow based generative models수철 박
 
An introduction to reinforcement learning (rl)
An introduction to reinforcement learning (rl)An introduction to reinforcement learning (rl)
An introduction to reinforcement learning (rl)pauldix
 
Detailed Description on Cross Entropy Loss Function
Detailed Description on Cross Entropy Loss FunctionDetailed Description on Cross Entropy Loss Function
Detailed Description on Cross Entropy Loss Function범준 김
 
Introduction to the theory of computation
Introduction to the theory of computationIntroduction to the theory of computation
Introduction to the theory of computationprasadmvreddy
 
An introduction to reinforcement learning
An introduction to reinforcement learningAn introduction to reinforcement learning
An introduction to reinforcement learningSubrat Panda, PhD
 

Mais procurados (20)

Presentation-Alex-20150421
Presentation-Alex-20150421Presentation-Alex-20150421
Presentation-Alex-20150421
 
Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)Reinforcement learning:policy gradient (part 1)
Reinforcement learning:policy gradient (part 1)
 
Support vector machine
Support vector machineSupport vector machine
Support vector machine
 
Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)
 
Group homomorphism
Group homomorphismGroup homomorphism
Group homomorphism
 
Introduction to Genetic Algorithms and Evolutionary Computation
Introduction to Genetic Algorithms and Evolutionary ComputationIntroduction to Genetic Algorithms and Evolutionary Computation
Introduction to Genetic Algorithms and Evolutionary Computation
 
Multi-Agent Reinforcement Learning
Multi-Agent Reinforcement LearningMulti-Agent Reinforcement Learning
Multi-Agent Reinforcement Learning
 
CatBoost intro
CatBoost   introCatBoost   intro
CatBoost intro
 
Wasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 IWasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 I
 
Recurrent Neural Network
Recurrent Neural NetworkRecurrent Neural Network
Recurrent Neural Network
 
Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep Learning
 
ShuffleNet - PR054
ShuffleNet - PR054ShuffleNet - PR054
ShuffleNet - PR054
 
Capsule networks
Capsule networksCapsule networks
Capsule networks
 
Flow based generative models
Flow based generative modelsFlow based generative models
Flow based generative models
 
Geometric algorithms
Geometric algorithmsGeometric algorithms
Geometric algorithms
 
An introduction to reinforcement learning (rl)
An introduction to reinforcement learning (rl)An introduction to reinforcement learning (rl)
An introduction to reinforcement learning (rl)
 
Policy gradient
Policy gradientPolicy gradient
Policy gradient
 
Detailed Description on Cross Entropy Loss Function
Detailed Description on Cross Entropy Loss FunctionDetailed Description on Cross Entropy Loss Function
Detailed Description on Cross Entropy Loss Function
 
Introduction to the theory of computation
Introduction to the theory of computationIntroduction to the theory of computation
Introduction to the theory of computation
 
An introduction to reinforcement learning
An introduction to reinforcement learningAn introduction to reinforcement learning
An introduction to reinforcement learning
 

Semelhante a Glocalized Weisfeiler-Lehman Graph Kernels: Global-Local Feature Maps of Graphs

Incremental and parallel computation of structural graph summaries for evolvi...
Incremental and parallel computation of structural graph summaries for evolvi...Incremental and parallel computation of structural graph summaries for evolvi...
Incremental and parallel computation of structural graph summaries for evolvi...Till Blume
 
Minimizing cost in distributed multiquery processing applications
Minimizing cost in distributed multiquery processing applicationsMinimizing cost in distributed multiquery processing applications
Minimizing cost in distributed multiquery processing applicationsLuis Galárraga
 
MLIP - Chapter 6 - Generation, Super-Resolution, Style transfer
MLIP - Chapter 6 - Generation, Super-Resolution, Style transferMLIP - Chapter 6 - Generation, Super-Resolution, Style transfer
MLIP - Chapter 6 - Generation, Super-Resolution, Style transferCharles Deledalle
 
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defense
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defenseLarge-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defense
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defenseAapo Kyrölä
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsDavid Gleich
 
Graph theory concepts complex networks presents-rouhollah nabati
Graph theory concepts   complex networks presents-rouhollah nabatiGraph theory concepts   complex networks presents-rouhollah nabati
Graph theory concepts complex networks presents-rouhollah nabatinabati
 
Svm map reduce_slides
Svm map reduce_slidesSvm map reduce_slides
Svm map reduce_slidesSara Asher
 
Lego like spheres and tori, enumeration and drawings
Lego like spheres and tori, enumeration and drawingsLego like spheres and tori, enumeration and drawings
Lego like spheres and tori, enumeration and drawingsMathieu Dutour Sikiric
 
141205 graphulo ingraphblas
141205 graphulo ingraphblas141205 graphulo ingraphblas
141205 graphulo ingraphblasgraphulo
 
141222 graphulo ingraphblas
141222 graphulo ingraphblas141222 graphulo ingraphblas
141222 graphulo ingraphblasMIT
 
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...ssuser2624f71
 
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...thanhdowork
 
2012 mdsp pr08 nonparametric approach
2012 mdsp pr08 nonparametric approach2012 mdsp pr08 nonparametric approach
2012 mdsp pr08 nonparametric approachnozomuhamada
 

Semelhante a Glocalized Weisfeiler-Lehman Graph Kernels: Global-Local Feature Maps of Graphs (20)

Incremental and parallel computation of structural graph summaries for evolvi...
Incremental and parallel computation of structural graph summaries for evolvi...Incremental and parallel computation of structural graph summaries for evolvi...
Incremental and parallel computation of structural graph summaries for evolvi...
 
Minimizing cost in distributed multiquery processing applications
Minimizing cost in distributed multiquery processing applicationsMinimizing cost in distributed multiquery processing applications
Minimizing cost in distributed multiquery processing applications
 
06 mlp
06 mlp06 mlp
06 mlp
 
MLIP - Chapter 6 - Generation, Super-Resolution, Style transfer
MLIP - Chapter 6 - Generation, Super-Resolution, Style transferMLIP - Chapter 6 - Generation, Super-Resolution, Style transfer
MLIP - Chapter 6 - Generation, Super-Resolution, Style transfer
 
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defense
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defenseLarge-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defense
Large-Scale Graph Computation on Just a PC: Aapo Kyrola Ph.D. thesis defense
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphs
 
Exhaustive Combinatorial Enumeration
Exhaustive Combinatorial EnumerationExhaustive Combinatorial Enumeration
Exhaustive Combinatorial Enumeration
 
graph theory
graph theorygraph theory
graph theory
 
Graph theory concepts complex networks presents-rouhollah nabati
Graph theory concepts   complex networks presents-rouhollah nabatiGraph theory concepts   complex networks presents-rouhollah nabati
Graph theory concepts complex networks presents-rouhollah nabati
 
Svm map reduce_slides
Svm map reduce_slidesSvm map reduce_slides
Svm map reduce_slides
 
Lego like spheres and tori, enumeration and drawings
Lego like spheres and tori, enumeration and drawingsLego like spheres and tori, enumeration and drawings
Lego like spheres and tori, enumeration and drawings
 
gSpan algorithm
 gSpan algorithm gSpan algorithm
gSpan algorithm
 
gSpan algorithm
gSpan algorithmgSpan algorithm
gSpan algorithm
 
Planted Clique Research Paper
Planted Clique Research PaperPlanted Clique Research Paper
Planted Clique Research Paper
 
141205 graphulo ingraphblas
141205 graphulo ingraphblas141205 graphulo ingraphblas
141205 graphulo ingraphblas
 
141222 graphulo ingraphblas
141222 graphulo ingraphblas141222 graphulo ingraphblas
141222 graphulo ingraphblas
 
Graph Coloring using Peer-to-Peer Networks
Graph Coloring using Peer-to-Peer NetworksGraph Coloring using Peer-to-Peer Networks
Graph Coloring using Peer-to-Peer Networks
 
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
 
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
 
2012 mdsp pr08 nonparametric approach
2012 mdsp pr08 nonparametric approach2012 mdsp pr08 nonparametric approach
2012 mdsp pr08 nonparametric approach
 

Último

Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Cherry
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cherry
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsbassianu17
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCherry
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot GirlsKanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot GirlsDeepika Singh
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxCherry
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptxCherry
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxANSARKHAN96
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 

Último (20)

Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot GirlsKanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Early Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdfEarly Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdf
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 

Glocalized Weisfeiler-Lehman Graph Kernels: Global-Local Feature Maps of Graphs

  • 1. Glocalized Weisfeiler-Lehman Graph Kernels: Local-Global Feature Maps of Graphs IEEE ICDM 2017 Christopher Morris, Kristian Kersting, Petra Mutzel 20. November 2017 TU Dortmund University, Algorithm Engineering Group TU Darmstadt, Machine Learning Group
  • 2. Motivation Question How similar are two graphs? (a) Sildenafil (b) Vardenafil 1
  • 3. High-level View: Supervised Graph Classification 2
  • 4. High-level View: Supervised Graph Classification ⊆ H φ: G → H 2
  • 5. High-level View: Supervised Graph Classification ⊆ H φ: G → H 2
  • 6. Primer on Graph Kernels Question How similar are two graphs? 3
  • 7. Primer on Graph Kernels Question How similar are two graphs? Definition (Graph Kernel) Let 𝒢 be a non-empty set of graphs and let k: 𝒢 × 𝒢 → R. Then k is a graph kernel if there is a Hilbert space ℋ and a feature map 𝜑: 𝒢 → ℋ such that k(G, H) = ⟨𝜑(G), 𝜑(H)⟩. 3
  • 8. Example: Weisfeiler-Lehman Subtree Kernel Idea Graph kernel based on well-known heuristic for graph isomorphism testing: 1-WL or color refinement Iteration: Two vertices get same colors iff if they have the same colored neighborhood N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt. “Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011), pp. 2539–2561 4
  • 9. Example: Weisfeiler-Lehman Subtree Kernel Idea Graph kernel based on well-known heuristic for graph isomorphism testing: 1-WL or color refinement Iteration: Two vertices get same colors iff if they have the same colored neighborhood 𝜑(G1) = ( ) (a) G1 𝜑(G2) = ( ) (b) G2 N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt. “Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011), pp. 2539–2561 4
  • 10. Example: Weisfeiler-Lehman Subtree Kernel Idea Graph kernel based on well-known heuristic for graph isomorphism testing: 1-WL or color refinement Iteration: Two vertices get same colors iff if they have the same colored neighborhood 𝜑(G1) = (2, 2, 2, ) (a) G1 𝜑(G2) = (1, 1, 3, ) (b) G2 N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt. “Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011), pp. 2539–2561 4
  • 11. Example: Weisfeiler-Lehman Subtree Kernel Idea Graph kernel based on well-known heuristic for graph isomorphism testing: 1-WL or color refinement Iteration: Two vertices get same colors iff if they have the same colored neighborhood 𝜑(G1) = (2, 2, 2, 2, 2, 2, 0, 0) (a) G1 𝜑(G2) = (1, 1, 3, 2, 0, 1, 1, 1) (b) G2 N. Shervashidze, P. Schweitzer, E. J. van Leeuwen, K. Mehlhorn, and K. M. Borgwardt. “Weisfeiler-Lehman Graph Kernels”. In: Journal of Machine Learning Research 12 (2011), pp. 2539–2561 4
  • 12. Global vs. Local Graph Properties Observation Most graph kernels only take local graph properties into account, e.g., they look at h-neighborhood around vertices. h 5
  • 13. Global vs. Local Graph Properties Observation Most graph kernels only take local graph properties into account, e.g., they look at h-neighborhood around vertices. h Challenge Design a scalable graph kernel that can take local as well global graph properties into account. 5
  • 14. Talk Structure 1 k-Dimensional Weisfeiler-Lehman 2 A Local Kernel Based on the k-dim. WL 3 Approximation Algorithms 4 Experimental Evaluation 6
  • 15. k-Dimensional Weisfeiler-Lehman k-dimensional Weisfeiler-Lehman • Colors vertex tuples from Vk • Two tuples v, w are i-neighbors if vj = wj for all j ̸= i Idea of the Algorithm Initially Initially two k-tuples v, w get the same color if vi ↦→ wi induces a (graph) isomorphism between G[v] and G[w] Iteration Two tuples with the same color get different colors if there exists a color c and 1 ≤ i ≤ k such that v and w have different i-neighbors of color c 7
  • 16. Local k-dimensional WL Idea Define “local neighborhood” by taking underlying graph structure into account. 8
  • 17. Local k-dimensional WL Idea Define “local neighborhood” by taking underlying graph structure into account. v1 v2 v3 v4 v5 v6 (a) Subset of local neighborhood. v1 v2 v3 v4 v5 v6 (b) Subset of global neighborhood. 8
  • 18. Local k-dimensional WL Idea Define “local neighborhood” by taking underlying graph structure into account. v1 v2 v3 v4 v5 v6 (a) Subset of local neighborhood. v1 v2 v3 v4 v5 v6 (b) Subset of global neighborhood. Advantages 1 Considers “local” properties 2 Respects sparsity of original graph 3 Can be approximated by sampling 8
  • 19. Scalability: Approximation by Sampling Problem Algorithm does not scale. 9
  • 20. Scalability: Approximation by Sampling Problem Algorithm does not scale. Solution Approximate feature vector after h iterations by sampling. 9
  • 21. Scalability: Approximation by Sampling Problem Algorithm does not scale. Solution Approximate feature vector after h iterations by sampling. Highlevel Idea of Algorithm 1 Sample a number of subsets of size k 2 Explore h-neighborhood around each such set 3 Compute algorithm on each h-neighborhood 9
  • 22. Scalability: Approximation by Sampling Question Why does this lead to correct results? 10
  • 23. Scalability: Approximation by Sampling Question Why does this lead to correct results? t 1 2 3 0 Insight Color of central k-set t after h iterations is correct. 10
  • 24. Scalability: Approximation by Sampling Theorem (Informal) With high probability the sampling algorithm approximates the (normalized) feature vector of the local k-dimension WL such that ⃦ ⃦ ⃦̂︀𝜑k-LWL(G) − ̃︀𝜑k-LWL(G) ⃦ ⃦ ⃦ 1 ≤ 𝜀1 . For bounded-degree graphs the running time is independent of the size of the graph, i.e. the number of nodes and edges. 11
  • 25. Scalability: Approximation by Sampling Theorem (Informal) Given a finite set 𝒢 of graphs. With high probability the sampling algorithm approximate the kernel function of the local k-dimension WL such that sup G,H∈𝒢 ⃒ ⃒ ⃒̂︀kh k-LWL(G, H) − ̃︀kh k-LWL(G, H) ⃒ ⃒ ⃒ ≤ 𝜖2 . For bounded-degree graphs the running time is independent of the size of the graph, i.e. the number of nodes and edges. 12
  • 26. Scalability: Approximation by Sampling Problems 1 Algorithm is restricted to bounded-degree graphs! 2 How do we compute the sample size for general graphs? 13
  • 27. Scalability: Approximation by Sampling Problems 1 Algorithm is restricted to bounded-degree graphs! 2 How do we compute the sample size for general graphs? Solution: Adaptive Sampling Algorithm while Desired accurracy is not reached do Increase sample size Compute h neighborhoods for new sample Compute algorithm in each h-neighborhood end while 13
  • 28. Scalability: Approximation by Adaptive Sampling Theorem (Informal) Let G be a graph, then the above procedure approximates the normalized feature vector ̂︀𝜑k-LWL(G) of the k-LWL for h iterations such that with high probability sup l∈Σ ⃒ ⃒ ⃒̂︀𝜑k-LWL(G)l − ̃︀𝜑k-LWL(G)l ⃒ ⃒ ⃒ ≤ 𝜀3 . 14
  • 29. Scalability: Approximation by Adaptive Sampling Theorem (Informal) Let G be a graph, then the above procedure approximates the normalized feature vector ̂︀𝜑k-LWL(G) of the k-LWL for h iterations such that with high probability sup l∈Σ ⃒ ⃒ ⃒̂︀𝜑k-LWL(G)l − ̃︀𝜑k-LWL(G)l ⃒ ⃒ ⃒ ≤ 𝜀3 . Remark Proof relies on self-bounding properties of bounds based on conditional Rademacher Averages. 14
  • 30. Experimental Evaluation: Classification Accurary PROTEINS REDDIT ENZYMES IMDB-BINARY NCI1 MUTAG 0 10 20 30 40 50 60 70 80 90ClassificationAccuracy 3-LWL 1-LWL 3-GWL 15
  • 31. Experimental Evaluation: Running Times 3-LWL-SP(0.1) 3-LWL-S(0.1) 3-LWL-SP(0.05) 3-LWL-S(0.05) 3-LWL-L 3-LWL-P 3-LWL Algorithm 0 1000 2000 3000 4000 5000 6000 7000 8000 RunningTimes[s] PROTEINS 16
  • 32. Conclusion 1 Graph kernel based on k-dimensional Weisfeiler-Lehman • Considers local as well as global graph properties 2 Approximation algorithms based on sampling • Constant running time for bounded-degree graphs • Adaptive sampling algorithm for general graphs 3 Promising experimental results Collection of Graph Classification Benchmarks graphkernels.cs.tu-dortmund.de 17