SlideShare a Scribd company logo
1 of 20
MixHop: Higher-Order Graph Convolutional
Architectures via Sparsified Neighborhood Mixing
Reader:
LeapMind, DL Engineer
Jira JINDALERTUDOMDEE
LeapMind ICML2019 Reading Session
1. Paper Info
2. Background
3. Limitation of current method
4. Proposed method
5. Experiment
6. Conclusion
2
Contents
LeapMind Inc. © 2019
Paper Info
3
● “MixHop: Higher-Order Graph Convolutional Architectures via
Sparsified Neighborhood Mixing”
○ ICML 2019
○ Link to paper here
○ Author: Sami Abu-El-Haija, et al.
○ Figures and tables from the paper have been used in these slides to explain the
result of the experiment and show the examples of network architecture.
● Why did I choose this paper ?
○ Graph can represent various kinds of data, e.g. chemical compounds, networks
○ Graph convolution is a way to apply NN to graph and solve more problems
LeapMind Inc. © 2019
Background: Graph Neural Network
4
Input: A graph, a set of given classes, features of all nodes
Output: A graph with labeled nodes
LeapMind Inc. © 2019
Background: Notation
5
● Notation used in a graph convolutional network
LeapMind Inc. © 2019
Background: Notation
6
● Notation used in a graph convolutional network
LeapMind Inc. © 2019
Background: Graph convolutional layer
7
● Convolutional layer for graph proposed by Kipf and Welling [1]
[1] Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016).
LeapMind Inc. © 2019
Background: Graph convolutional Layer
8
● Convolutional layer for graph proposed by Kipf and Welling [1]
[1] Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016).
Consider only neighbor’s feature
LeapMind Inc. © 2019
Limitation of normal graph convolutional network
9
Do not consider node’s neighbors
with distance more than one
Cannot be adapted to Gabor-like filters (e.g. edge detection in images)
Modify convolutional layer to receive information of farther nodes in one layer
LeapMind Inc. © 2019
Proposed method: Power of adjacency matrix
10
Â( Â (... (Â H(i))...)) if i ≥ 1
i times
Âi H(i) =
Identity matrix if
i = 0
LeapMind Inc. © 2019
Proposed method: MixHop graph convolutional layer
11
LeapMind Inc. © 2019
Proposed method: Output layer
12
LeapMind Inc. © 2019
Proposed method: MixHop graph convolutional network
13
● Stack graph convolutional layers and end with an output layer
LeapMind Inc. © 2019
Proposed method: Learning network architecture
14
● Train the model with group regularization over each column of weight
total loss = cross-entropy loss(input label, YO) +
LeapMind Inc. © 2019
Proposed method: Learning network architecture
15
● Train the model with group regularization over each column of weight
total loss = cross-entropy loss(input label, YO) +
● Remove all columns less than a threshold
* the authors choose threshold that make #columns = 60
● Restart training using standard L2 regularization instead of group regularization
LeapMind Inc. © 2019
Experiment: Model setting
16
● #convolutional layers: 2
● Optimizer: Gradient descent
● Steps: 2000 or validation accuracy doesn’t improve for 40 steps
● Initial learning: 0.05 decays by 0.0005 every 40 steps
● Weight decay rate: 0.0005
LeapMind Inc. © 2019
Experiment 1: Synthetic dataset
17
● Networks with different homophily are generated based on [2]
homophily = prob. that same labeled nodes attaching to each other
[2] Karimi, Fariba, et al. "Visibility of minorities in social networks." arXiv preprint arXiv:1702.00150 (2017).
Adjacency
matrix with
power > 1
Image by Sami Abu-El-Haija, et al. from the paper “MixHop: Higher-Order Graph Convolutional Architectures via
Sparsified Neighborhood Mixing”, published as a conference paper in ICML 2019
LeapMind Inc. © 2019
Experiment 2: Real world datasets
18
● 3 datasets, Citeseer, Cora, and Pubmed, are used in the evaluation
Accuracy Table
* default architecture: All weights have the same dimension
Table by Sami Abu-El-Haija, et
al. from the paper “MixHop:
Higher-Order Graph
Convolutional Architectures via
Sparsified Neighborhood Mixing”,
published as a conference paper
in ICML 2019
LeapMind Inc. © 2019
Experiment 2: Real world datasets
19
● 3 datasets, Citeseer, Cora, and Pubmed, are used in the evaluation
Example networks
Image by Sami Abu-El-Haija, et al. from the paper “MixHop: Higher-Order Graph Convolutional Architectures via
Sparsified Neighborhood Mixing”, published as a conference paper in ICML 2019
LeapMind Inc. © 2019
Conclusion
20
● A MixHop graph convolutional layer is proposed
○ using multiple adjacency powers so that node receiving info from farther nodes
● The less homophily the graph,
the more impact of MixHop graph convolutional layer
● Learning network architecture is important because
○ finding optimal architecture for given dataset
○ pruning the weight

More Related Content

What's hot

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Ryo Takahashi
 
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
rodrickmero
 

What's hot (20)

Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
 
computer networking
computer networkingcomputer networking
computer networking
 
YOLACT
YOLACTYOLACT
YOLACT
 
cloud compute
cloud computecloud compute
cloud compute
 
N ns 1
N ns 1N ns 1
N ns 1
 
Image classification using CNN
Image classification using CNNImage classification using CNN
Image classification using CNN
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)
 
Aerial detection part2
Aerial detection part2Aerial detection part2
Aerial detection part2
 
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsMethods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
 
DNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review pptDNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review ppt
 
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
 
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
Presentation of Eco-efficient Cloud Computing Framework for Higher Learning I...
 
VIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape EstimationVIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape Estimation
 
GraphSage vs Pinsage #InsideArangoDB
GraphSage vs Pinsage #InsideArangoDBGraphSage vs Pinsage #InsideArangoDB
GraphSage vs Pinsage #InsideArangoDB
 
20120140506024
2012014050602420120140506024
20120140506024
 
virtualization
virtualizationvirtualization
virtualization
 
Summary of survey papers on deep learning method to 3D data
Summary of survey papers on deep learning method to 3D dataSummary of survey papers on deep learning method to 3D data
Summary of survey papers on deep learning method to 3D data
 
A Minimum Spanning Tree Approach of Solving a Transportation Problem
A Minimum Spanning Tree Approach of Solving a Transportation ProblemA Minimum Spanning Tree Approach of Solving a Transportation Problem
A Minimum Spanning Tree Approach of Solving a Transportation Problem
 
Realizing Robust and Scalable Evolutionary Algorithms toward Exascale Era
Realizing Robust and Scalable Evolutionary Algorithms toward Exascale EraRealizing Robust and Scalable Evolutionary Algorithms toward Exascale Era
Realizing Robust and Scalable Evolutionary Algorithms toward Exascale Era
 
Graph Neural Network #2-1 (PinSage)
Graph Neural Network #2-1 (PinSage)Graph Neural Network #2-1 (PinSage)
Graph Neural Network #2-1 (PinSage)
 

Similar to [Icml2019] mix hop higher-order graph convolutional architectures via sparsified neighborhood mixing

Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
miyurud
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
miyurud
 
Enhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulationsEnhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulations
Alexander Pozdneev
 
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docxCOMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
noel23456789
 
MCPL2013 - Social network analyses in organizations: challenges and approache...
MCPL2013 - Social network analyses in organizations: challenges and approache...MCPL2013 - Social network analyses in organizations: challenges and approache...
MCPL2013 - Social network analyses in organizations: challenges and approache...
Vagner Santana
 
COMP1609 (202223)Network and Internet Technologyand DesignF.docx
COMP1609 (202223)Network and Internet Technologyand DesignF.docxCOMP1609 (202223)Network and Internet Technologyand DesignF.docx
COMP1609 (202223)Network and Internet Technologyand DesignF.docx
robert345678
 

Similar to [Icml2019] mix hop higher-order graph convolutional architectures via sparsified neighborhood mixing (20)

ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph AnalysisICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
 
Visual Search and Question Answering II
Visual Search and Question Answering IIVisual Search and Question Answering II
Visual Search and Question Answering II
 
RunPool: A Dynamic Pooling Layer for Convolution Neural Network
RunPool: A Dynamic Pooling Layer for Convolution Neural NetworkRunPool: A Dynamic Pooling Layer for Convolution Neural Network
RunPool: A Dynamic Pooling Layer for Convolution Neural Network
 
Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
Scalable Graph Convolutional Network Based Link Prediction on a Distributed G...
 
Image Segmentation: Approaches and Challenges
Image Segmentation: Approaches and ChallengesImage Segmentation: Approaches and Challenges
Image Segmentation: Approaches and Challenges
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
 
Enhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulationsEnhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulations
 
Exploring attention mechanism for graph similarity learning .pptx
Exploring attention mechanism for graph similarity learning .pptxExploring attention mechanism for graph similarity learning .pptx
Exploring attention mechanism for graph similarity learning .pptx
 
gridcomputing (506).PPTX
gridcomputing (506).PPTXgridcomputing (506).PPTX
gridcomputing (506).PPTX
 
A detailed analysis of the supervised machine Learning Algorithms
A detailed analysis of the supervised machine Learning AlgorithmsA detailed analysis of the supervised machine Learning Algorithms
A detailed analysis of the supervised machine Learning Algorithms
 
Deep Graph Contrastive Representation Learning.pptx
Deep Graph Contrastive Representation Learning.pptxDeep Graph Contrastive Representation Learning.pptx
Deep Graph Contrastive Representation Learning.pptx
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
 
Green cloud computing
Green cloud computing Green cloud computing
Green cloud computing
 
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docxCOMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
COMP1609(2022-23)Network and Internet Technologyand DesignFaculty Head.docx
 
MCPL2013 - Social network analyses in organizations: challenges and approache...
MCPL2013 - Social network analyses in organizations: challenges and approache...MCPL2013 - Social network analyses in organizations: challenges and approache...
MCPL2013 - Social network analyses in organizations: challenges and approache...
 
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
 
Seongmyung_Jeong_Presentation.pdf
Seongmyung_Jeong_Presentation.pdfSeongmyung_Jeong_Presentation.pdf
Seongmyung_Jeong_Presentation.pdf
 
COMP1609 (202223)Network and Internet Technologyand DesignF.docx
COMP1609 (202223)Network and Internet Technologyand DesignF.docxCOMP1609 (202223)Network and Internet Technologyand DesignF.docx
COMP1609 (202223)Network and Internet Technologyand DesignF.docx
 
Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...Community detection of political blogs network based on structure-attribute g...
Community detection of political blogs network based on structure-attribute g...
 
PointNet
PointNetPointNet
PointNet
 

More from LeapMind Inc

Final presentation optical flow estimation with DL
Final presentation  optical flow estimation with DLFinal presentation  optical flow estimation with DL
Final presentation optical flow estimation with DL
LeapMind Inc
 

More from LeapMind Inc (16)

Final presentation optical flow estimation with DL
Final presentation  optical flow estimation with DLFinal presentation  optical flow estimation with DL
Final presentation optical flow estimation with DL
 
Survey on optical flow estimation with DL
Survey on optical flow estimation with DLSurvey on optical flow estimation with DL
Survey on optical flow estimation with DL
 
[Icml2019]LIT: Learned Intermediate Representation Training for Model Compres...
[Icml2019]LIT: Learned Intermediate Representation Training for Model Compres...[Icml2019]LIT: Learned Intermediate Representation Training for Model Compres...
[Icml2019]LIT: Learned Intermediate Representation Training for Model Compres...
 
エッジ向けDeepLearningプロジェクトで必要なこと
エッジ向けDeepLearningプロジェクトで必要なことエッジ向けDeepLearningプロジェクトで必要なこと
エッジ向けDeepLearningプロジェクトで必要なこと
 
20190227[EDLS]JAL's INNOVATION エアラインのAI活用
20190227[EDLS]JAL's INNOVATION エアラインのAI活用20190227[EDLS]JAL's INNOVATION エアラインのAI活用
20190227[EDLS]JAL's INNOVATION エアラインのAI活用
 
E20190227[EDLS]インテル®︎FPGAによるエッジAI
E20190227[EDLS]インテル®︎FPGAによるエッジAIE20190227[EDLS]インテル®︎FPGAによるエッジAI
E20190227[EDLS]インテル®︎FPGAによるエッジAI
 
20190227[EDLS]進化するAI on Edge 〜 CloudとEdgeの最適な関係
20190227[EDLS]進化するAI on Edge 〜 CloudとEdgeの最適な関係20190227[EDLS]進化するAI on Edge 〜 CloudとEdgeの最適な関係
20190227[EDLS]進化するAI on Edge 〜 CloudとEdgeの最適な関係
 
20180831 [DeLTA TECH] 深く青い脂
20180831 [DeLTA TECH] 深く青い脂20180831 [DeLTA TECH] 深く青い脂
20180831 [DeLTA TECH] 深く青い脂
 
20180831 [DeLTA TECH] 新・深層の世紀 〜第3集 ディープラーニング・時代はAIを求めた 〜
20180831 [DeLTA TECH] 新・深層の世紀 〜第3集 ディープラーニング・時代はAIを求めた 〜20180831 [DeLTA TECH] 新・深層の世紀 〜第3集 ディープラーニング・時代はAIを求めた 〜
20180831 [DeLTA TECH] 新・深層の世紀 〜第3集 ディープラーニング・時代はAIを求めた 〜
 
20180831 [DeLTA TECH] DeLTA-Liteを支える技術(システム構成編)
20180831 [DeLTA TECH] DeLTA-Liteを支える技術(システム構成編)20180831 [DeLTA TECH] DeLTA-Liteを支える技術(システム構成編)
20180831 [DeLTA TECH] DeLTA-Liteを支える技術(システム構成編)
 
20180831 [DeLTA TECH] DeLTA-FamilyによるIndustry4.1
20180831 [DeLTA TECH] DeLTA-FamilyによるIndustry4.120180831 [DeLTA TECH] DeLTA-FamilyによるIndustry4.1
20180831 [DeLTA TECH] DeLTA-FamilyによるIndustry4.1
 
20180613 [TensorFlow分散学習] Horovodによる分散学習の実装方法と解説
20180613 [TensorFlow分散学習] Horovodによる分散学習の実装方法と解説20180613 [TensorFlow分散学習] Horovodによる分散学習の実装方法と解説
20180613 [TensorFlow分散学習] Horovodによる分散学習の実装方法と解説
 
An Introduction of DNN Compression Technology and Hardware Acceleration on FPGA
An Introduction of DNN Compression Technology and Hardware Acceleration on FPGAAn Introduction of DNN Compression Technology and Hardware Acceleration on FPGA
An Introduction of DNN Compression Technology and Hardware Acceleration on FPGA
 
2018年1月19日開催 IoTビジネス共創ラボ 第6回勉強会
2018年1月19日開催 IoTビジネス共創ラボ 第6回勉強会2018年1月19日開催 IoTビジネス共創ラボ 第6回勉強会
2018年1月19日開催 IoTビジネス共創ラボ 第6回勉強会
 
Pitch v2.2
Pitch v2.2Pitch v2.2
Pitch v2.2
 
JUIZ DLK 組込み向けDeep Learningコンパイラ
JUIZ DLK 組込み向けDeep LearningコンパイラJUIZ DLK 組込み向けDeep Learningコンパイラ
JUIZ DLK 組込み向けDeep Learningコンパイラ
 

Recently uploaded

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 

Recently uploaded (20)

Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 

[Icml2019] mix hop higher-order graph convolutional architectures via sparsified neighborhood mixing

  • 1. MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing Reader: LeapMind, DL Engineer Jira JINDALERTUDOMDEE LeapMind ICML2019 Reading Session
  • 2. 1. Paper Info 2. Background 3. Limitation of current method 4. Proposed method 5. Experiment 6. Conclusion 2 Contents
  • 3. LeapMind Inc. © 2019 Paper Info 3 ● “MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing” ○ ICML 2019 ○ Link to paper here ○ Author: Sami Abu-El-Haija, et al. ○ Figures and tables from the paper have been used in these slides to explain the result of the experiment and show the examples of network architecture. ● Why did I choose this paper ? ○ Graph can represent various kinds of data, e.g. chemical compounds, networks ○ Graph convolution is a way to apply NN to graph and solve more problems
  • 4. LeapMind Inc. © 2019 Background: Graph Neural Network 4 Input: A graph, a set of given classes, features of all nodes Output: A graph with labeled nodes
  • 5. LeapMind Inc. © 2019 Background: Notation 5 ● Notation used in a graph convolutional network
  • 6. LeapMind Inc. © 2019 Background: Notation 6 ● Notation used in a graph convolutional network
  • 7. LeapMind Inc. © 2019 Background: Graph convolutional layer 7 ● Convolutional layer for graph proposed by Kipf and Welling [1] [1] Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016).
  • 8. LeapMind Inc. © 2019 Background: Graph convolutional Layer 8 ● Convolutional layer for graph proposed by Kipf and Welling [1] [1] Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016). Consider only neighbor’s feature
  • 9. LeapMind Inc. © 2019 Limitation of normal graph convolutional network 9 Do not consider node’s neighbors with distance more than one Cannot be adapted to Gabor-like filters (e.g. edge detection in images) Modify convolutional layer to receive information of farther nodes in one layer
  • 10. LeapMind Inc. © 2019 Proposed method: Power of adjacency matrix 10 Â( Â (... (Â H(i))...)) if i ≥ 1 i times Âi H(i) = Identity matrix if i = 0
  • 11. LeapMind Inc. © 2019 Proposed method: MixHop graph convolutional layer 11
  • 12. LeapMind Inc. © 2019 Proposed method: Output layer 12
  • 13. LeapMind Inc. © 2019 Proposed method: MixHop graph convolutional network 13 ● Stack graph convolutional layers and end with an output layer
  • 14. LeapMind Inc. © 2019 Proposed method: Learning network architecture 14 ● Train the model with group regularization over each column of weight total loss = cross-entropy loss(input label, YO) +
  • 15. LeapMind Inc. © 2019 Proposed method: Learning network architecture 15 ● Train the model with group regularization over each column of weight total loss = cross-entropy loss(input label, YO) + ● Remove all columns less than a threshold * the authors choose threshold that make #columns = 60 ● Restart training using standard L2 regularization instead of group regularization
  • 16. LeapMind Inc. © 2019 Experiment: Model setting 16 ● #convolutional layers: 2 ● Optimizer: Gradient descent ● Steps: 2000 or validation accuracy doesn’t improve for 40 steps ● Initial learning: 0.05 decays by 0.0005 every 40 steps ● Weight decay rate: 0.0005
  • 17. LeapMind Inc. © 2019 Experiment 1: Synthetic dataset 17 ● Networks with different homophily are generated based on [2] homophily = prob. that same labeled nodes attaching to each other [2] Karimi, Fariba, et al. "Visibility of minorities in social networks." arXiv preprint arXiv:1702.00150 (2017). Adjacency matrix with power > 1 Image by Sami Abu-El-Haija, et al. from the paper “MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing”, published as a conference paper in ICML 2019
  • 18. LeapMind Inc. © 2019 Experiment 2: Real world datasets 18 ● 3 datasets, Citeseer, Cora, and Pubmed, are used in the evaluation Accuracy Table * default architecture: All weights have the same dimension Table by Sami Abu-El-Haija, et al. from the paper “MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing”, published as a conference paper in ICML 2019
  • 19. LeapMind Inc. © 2019 Experiment 2: Real world datasets 19 ● 3 datasets, Citeseer, Cora, and Pubmed, are used in the evaluation Example networks Image by Sami Abu-El-Haija, et al. from the paper “MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing”, published as a conference paper in ICML 2019
  • 20. LeapMind Inc. © 2019 Conclusion 20 ● A MixHop graph convolutional layer is proposed ○ using multiple adjacency powers so that node receiving info from farther nodes ● The less homophily the graph, the more impact of MixHop graph convolutional layer ● Learning network architecture is important because ○ finding optimal architecture for given dataset ○ pruning the weight