SlideShare a Scribd company logo
1 of 29
Download to read offline
Tutorial of
Topological Data Analysis
Tran Quoc Hoan
@k09hthaduonght.wordpress.com/
Paper Alert 2016-04-15, Hasegawa lab., Tokyo
The University of Tokyo
Part III - Mapper Algorithm
My TDA = Topology Data Analysis ’s road
TDA Road 2
Part I - Basic concepts &
applications
Part II - Advanced TDA
computation
Part III - Mapper Algorithm
Part V - Applications in…
Part VI - Applications in…
Part IV - Software Roadmap
He is following me
TDA Road Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
Mapper Algorithm
Basic motivation
Mapper Algorithm 4
Basic idea
Perform clustering at different “scales”, track how
clusters change as scale varies
Motivation
• Coarser than manifold learning, but
still works in nonlinear situation
• Extract meaningful geometric
information about dataset
• Efficiently computable (for large
dataset) Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition.
G Singh, F Mémoli, GE Carlsson - SPBG, 2007
Morse theory
Mapper Algorithm 5
Basic idea
Describe topology of a smooth manifold M using level
sets of a suitable function h : M -> R
• Recover M by looking at h-1((∞, t]), as t scans over the
range of h
• Topology of M changes at critical points of h
Reeb graphs
Mapper Algorithm 6
• For each t in R, contract each
component of f-1(t) to a point
• Resulting structure is a graph
Mapper
Mapper Algorithm 7
The mapper algorithm is a generalization of this procedure (Singh-
Memoli-Carlsson)
Input
✤ Filter (continuous) function f: X -> R
✤ Cover L of im(f) by open intervals:
Method
✤ Cluster each inverse image f-1(Lα) into various connected components
✤ The Mapper is the nerve of V
• Clusters are vertices
• 1 k-simplex per (k+1)-fold intersection
connected cover V
✤ Color vertices according to average value of f in the cluster
k
i=0Vi 6= ;, V0, ..., Vk 2 V
Workflow - Illustration
Mapper Algorithm 8Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
f could be in n-dimension
Workflow - Illustration
Mapper Algorithm 9Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
f could be in n-dimension
Workflow - Illustration
Mapper Algorithm 10Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
f could be in n-dimension
Mapper in practice
Mapper Algorithm 11
Input
✤ Filter (continuous) function f: P -> R
✤ Cover L of im(f) by open intervals:
Method
✤ Cluster each inverse image f-1(Lα) into various connected components
in G
✤ The Mapper is the nerve of V
connected cover V
✤ Color vertices according to average value of f in the cluster
- Point cloud P with metric dP
- Compute neighborhood graph G = (P, E)
• Clusters are vertices
• 1 k-simplex per (k+1)-fold intersection
k
i=0Vi 6= ;, V0, ..., Vk 2 V
(intersections materialized
by data points)
Mapper in practice
Mapper Algorithm 12Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
Mapper in practice
Mapper Algorithm 13Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
Mapper in practice
Mapper Algorithm 14Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
Mapper in practice
Mapper Algorithm 15
Parameters
✤ Filter (continuous) function f: P -> R
✤ Cover L of im(f) by open intervals:
✤ Neighborhood size δ
Example: uniform cover L
• Resolution / granularity: r (diameter of intervals)
• Gain: g (percentage of overlap)
range scale
geometric scale
Filter functions
Mapper Algorithm 16
Choice of filter function is essential
• Some kind of density measure
• A score measure difference (distance) from some baseline
• An eccentricity measure
Statistics

Mean/Max/Min
Variance
n-Moment
Density
…
Machine Learning

PCA/SVD
Auto encoders
Isomap/MDS/TSNE
SVM Distance
Error/Debugging Info
…
Geometry

Centrality

Curvature

Harmonic Cycles
…
Filter functions
Mapper Algorithm 17
Eccentricity
Density
- How close the point lies to the “center” of the point cloud.
- How close the point to the surrounding points
Mapper in applications
Mapper Algorithm 18
Extracting insights from the shape of complex data using topology,
Lum et al., Nature, 2013
Topological Data Analysis for Discovery in Preclinical Spinal Cord
Injury and Traumatic Brain Injury, Nielson et al., Nature, 2015
Using Topological Data Analysis for Diagnosis Pulmonary Embolism,
Rucco et al., arXiv preprint, 2014
Topological Methods for Exploring Low-density States in
Biomolecular Folding Pathways, Yao et al., J. Chemical Physics, 2009
CD8 T-cell reactivity to islet antigens is unique to type 1 while
CD4 T-cell reactivity exists in both type 1 and type 2 diabetes,
Sarikonda et al., J. Autoimmunity, 2013
Innate and adaptive T cells in asthmatic patients: Relationship
to severity and disease mechanisms, Hinks et al., J. Allergy Clinical
Immunology, 2015
✤
✤
✤
✤
✤
✤
Mapper in practice
Mapper Algorithm 19
1. Clustering
2. Feature selection
Mapper in clustering
Mapper Algorithm 20
(1) Compute the Mapper
(2) Detect interesting topological substructures
(“loops”, “flares”)
(3) Use substructure to
cluster data
select parameters
Not easy (Tutorial part 1 + 2)
Mapper Algorithm 21
Extracting insights from the shape of complex data using topology,
Lum et al., Nature, 2013
f: 1st and 2nd SVD r = 120, g = 22%
PCA can show the
Republican/
Democrat cluster
but TDA gives
more information
House Party representative grouping
Point: member of
the House
PCA
Mapper Algorithm 22
Extracting insights from the shape of complex data using topology,
Lum et al., Nature, 2013
Detect new clusters for NBA players
Mapper Algorithm 23
Innate and adaptive T cells in asthmatic patients: Relationship
to severity and disease mechanisms, Hinks et al., J. Allergy Clinical Immunology, 2015
The TDA used 62 subjects
with most complete data.
f: 1st and 2nd SVD
r = 120, g = 14%, equalized
Mapper in feature selection
Mapper Algorithm 24
(1) Compute the Mapper
(2) Detect interesting topological substructures
(“loops”, “flares”)
(3) Select features that best
discriminate data in substructure
select parameters Kolmogorov-Smirnov test on (substructure)
feature vs. (whole dataset) feature,
select features with low p-val
Mapper Algorithm 25
Extracting insights from the shape of complex data using topology,
Lum et al., Nature, 2013
Goal: detect factors that influence survival after therapy in breast cancer patients
Points: breast cancer patients that went through specific therapy
PCA/Single-linkage clustering cannot see this
f: eccentricity
r = 1/30, g = 33%
Mapper Algorithm 26
Topological Data Analysis for Discovery in Preclinical Spinal Cord
Injury and Traumatic Brain Injury, Nielson et al., Nature, 2015
Select Parameters
Mapper Algorithm 27
parameter r
parameter g
parameter δ
parameter f
• Small r -> fine cover 

(close to Reeb) (sensitive to δ)
• Large r -> rough cover 

(less sensitive to δ)
• g ≈ 1 -> more points inside
intersections , less sensitive to
δ but far from Reeb
• g ≈ 0 -> controlled Mapper
dimension, close to Reeb
• Large δ -> fewer nodes, clean
Mapper but far from Reeb
(more straight lines)
• Small δ -> distinct
topological structure but lots
of nodes (noisy)
• Depend mostly on the
dataset
coordinate, density estimation,
eccentricity, eigenvector
Select Parameters
Mapper Algorithm 28
Example: P in R2 sampled from known distribution
f = density estimator, r = 1/30, g = 20%
δ = percentage of the diameter of X
Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
Reference links
Mapper Algorithm 29
• INF563 Topological Data Analysis Course

http://www.enseignement.polytechnique.fr/informatique/INF563/
• AYASDI

http://www.ayasdi.com/
• …

More Related Content

What's hot

Variational Inference
Variational InferenceVariational Inference
Variational InferenceTushar Tank
 
Bayesian Deep Learning
Bayesian Deep LearningBayesian Deep Learning
Bayesian Deep LearningRayKim51
 
混合ガウスモデルとEMアルゴリスム
混合ガウスモデルとEMアルゴリスム混合ガウスモデルとEMアルゴリスム
混合ガウスモデルとEMアルゴリスム貴之 八木
 
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
PRML EP法 10.7 10.7.2
PRML EP法 10.7 10.7.2 PRML EP法 10.7 10.7.2
PRML EP法 10.7 10.7.2 tmtm otm
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisJaclyn Kokx
 
Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryKenta Oono
 
TSPを山登り法と焼きなまし法で解く
TSPを山登り法と焼きなまし法で解くTSPを山登り法と焼きなまし法で解く
TSPを山登り法と焼きなまし法で解くRui High
 
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装 [DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装 Deep Learning JP
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for MoleculesIchigaku Takigawa
 
Probabilistic Graphical Models 輪読会 #1
Probabilistic Graphical Models 輪読会 #1Probabilistic Graphical Models 輪読会 #1
Probabilistic Graphical Models 輪読会 #1Takuma Yagi
 
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -Akisato Kimura
 
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...Masahiro Suzuki
 
深層生成モデルと世界モデル(2020/11/20版)
深層生成モデルと世界モデル(2020/11/20版)深層生成モデルと世界モデル(2020/11/20版)
深層生成モデルと世界モデル(2020/11/20版)Masahiro Suzuki
 
第11回 配信講義 計算科学技術特論B(2022)
第11回 配信講義 計算科学技術特論B(2022)第11回 配信講義 計算科学技術特論B(2022)
第11回 配信講義 計算科学技術特論B(2022)RCCSRENKEI
 
Topological Data Analysis.pptx
Topological Data Analysis.pptxTopological Data Analysis.pptx
Topological Data Analysis.pptxColleen Farrelly
 

What's hot (20)

Variational Inference
Variational InferenceVariational Inference
Variational Inference
 
Bayesian Deep Learning
Bayesian Deep LearningBayesian Deep Learning
Bayesian Deep Learning
 
混合ガウスモデルとEMアルゴリスム
混合ガウスモデルとEMアルゴリスム混合ガウスモデルとEMアルゴリスム
混合ガウスモデルとEMアルゴリスム
 
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
 
PRML EP法 10.7 10.7.2
PRML EP法 10.7 10.7.2 PRML EP法 10.7 10.7.2
PRML EP法 10.7 10.7.2
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant Analysis
 
Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistry
 
Chapter2.3.6
Chapter2.3.6Chapter2.3.6
Chapter2.3.6
 
TSPを山登り法と焼きなまし法で解く
TSPを山登り法と焼きなまし法で解くTSPを山登り法と焼きなまし法で解く
TSPを山登り法と焼きなまし法で解く
 
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装 [DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装
[DLHacks]PyTorch, PixyzによるGenerative Query Networkの実装
 
Rを用いたGIS
Rを用いたGISRを用いたGIS
Rを用いたGIS
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for Molecules
 
Probabilistic Graphical Models 輪読会 #1
Probabilistic Graphical Models 輪読会 #1Probabilistic Graphical Models 輪読会 #1
Probabilistic Graphical Models 輪読会 #1
 
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
 
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacks輪読) How to Train Deep Variational Autoencoders and Probabilistic Lad...
 
深層生成モデルと世界モデル(2020/11/20版)
深層生成モデルと世界モデル(2020/11/20版)深層生成モデルと世界モデル(2020/11/20版)
深層生成モデルと世界モデル(2020/11/20版)
 
Csc446: Pattren Recognition (LN1)
Csc446: Pattren Recognition (LN1)Csc446: Pattren Recognition (LN1)
Csc446: Pattren Recognition (LN1)
 
第11回 配信講義 計算科学技術特論B(2022)
第11回 配信講義 計算科学技術特論B(2022)第11回 配信講義 計算科学技術特論B(2022)
第11回 配信講義 計算科学技術特論B(2022)
 
Topological Data Analysis.pptx
Topological Data Analysis.pptxTopological Data Analysis.pptx
Topological Data Analysis.pptx
 
CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)
 

Similar to Tutorial of topological data analysis part 3(Mapper algorithm)

Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...JuanPabloCarbajal3
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingIAEME Publication
 
Vol 9 No 1 - January 2014
Vol 9 No 1 - January 2014Vol 9 No 1 - January 2014
Vol 9 No 1 - January 2014ijcsbi
 
A Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data MiningA Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data Miningijdmtaiir
 
A general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measuresA general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measuresMehran Mesbahzadeh
 
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTION
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTIONOPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTION
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTIONsipij
 
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODSFARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODSijcseit
 
Farsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methodsFarsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methodsijcseit
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...ijcseit
 
fuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptfuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptAbdullah Gubbi
 
Citython presentation
Citython presentationCitython presentation
Citython presentationAnkit Tewari
 
Применение машинного обучения для навигации и управления роботами
Применение машинного обучения для навигации и управления роботамиПрименение машинного обучения для навигации и управления роботами
Применение машинного обучения для навигации и управления роботамиSkolkovo Robotics Center
 
Interpolation of-geofield-parameters
Interpolation of-geofield-parametersInterpolation of-geofield-parameters
Interpolation of-geofield-parametersCemal Ardil
 
Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ahmed Ammar Rebai PhD
 
Multi fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageMulti fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageeSAT Publishing House
 
Multi fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageMulti fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageeSAT Journals
 

Similar to Tutorial of topological data analysis part 3(Mapper algorithm) (20)

How to Decide the Best Fuzzy Model in ANFIS
How to Decide the Best Fuzzy Model in ANFIS How to Decide the Best Fuzzy Model in ANFIS
How to Decide the Best Fuzzy Model in ANFIS
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of moving
 
Vol 9 No 1 - January 2014
Vol 9 No 1 - January 2014Vol 9 No 1 - January 2014
Vol 9 No 1 - January 2014
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
A Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data MiningA Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data Mining
 
A general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measuresA general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measures
 
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTION
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTIONOPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTION
OPTIMAL GLOBAL THRESHOLD ESTIMATION USING STATISTICAL CHANGE-POINT DETECTION
 
Pca part
Pca partPca part
Pca part
 
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODSFARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS
FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS
 
Farsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methodsFarsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methods
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
fuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptfuzzy LBP for face recognition ppt
fuzzy LBP for face recognition ppt
 
Citython presentation
Citython presentationCitython presentation
Citython presentation
 
Применение машинного обучения для навигации и управления роботами
Применение машинного обучения для навигации и управления роботамиПрименение машинного обучения для навигации и управления роботами
Применение машинного обучения для навигации и управления роботами
 
Interpolation of-geofield-parameters
Interpolation of-geofield-parametersInterpolation of-geofield-parameters
Interpolation of-geofield-parameters
 
Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...
 
Multi fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageMulti fractal analysis of human brain mr image
Multi fractal analysis of human brain mr image
 
Multi fractal analysis of human brain mr image
Multi fractal analysis of human brain mr imageMulti fractal analysis of human brain mr image
Multi fractal analysis of human brain mr image
 

More from Ha Phuong

QTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapQTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapHa Phuong
 
CCS2019-opological time-series analysis with delay-variant embedding
CCS2019-opological time-series analysis with delay-variant embeddingCCS2019-opological time-series analysis with delay-variant embedding
CCS2019-opological time-series analysis with delay-variant embeddingHa Phuong
 
SIAM-AG21-Topological Persistence Machine of Phase Transition
SIAM-AG21-Topological Persistence Machine of Phase TransitionSIAM-AG21-Topological Persistence Machine of Phase Transition
SIAM-AG21-Topological Persistence Machine of Phase TransitionHa Phuong
 
018 20160902 Machine Learning Framework for Analysis of Transport through Com...
018 20160902 Machine Learning Framework for Analysis of Transport through Com...018 20160902 Machine Learning Framework for Analysis of Transport through Com...
018 20160902 Machine Learning Framework for Analysis of Transport through Com...Ha Phuong
 
017_20160826 Thermodynamics Of Stochastic Turing Machines
017_20160826 Thermodynamics Of Stochastic Turing Machines017_20160826 Thermodynamics Of Stochastic Turing Machines
017_20160826 Thermodynamics Of Stochastic Turing MachinesHa Phuong
 
016_20160722 Molecular Circuits For Dynamic Noise Filtering
016_20160722 Molecular Circuits For Dynamic Noise Filtering016_20160722 Molecular Circuits For Dynamic Noise Filtering
016_20160722 Molecular Circuits For Dynamic Noise FilteringHa Phuong
 
015_20160422 Controlling Synchronous Patterns In Complex Networks
015_20160422 Controlling Synchronous Patterns In Complex Networks015_20160422 Controlling Synchronous Patterns In Complex Networks
015_20160422 Controlling Synchronous Patterns In Complex NetworksHa Phuong
 
013_20160328_Topological_Measurement_Of_Protein_Compressibility
013_20160328_Topological_Measurement_Of_Protein_Compressibility013_20160328_Topological_Measurement_Of_Protein_Compressibility
013_20160328_Topological_Measurement_Of_Protein_CompressibilityHa Phuong
 
011_20160321_Topological_data_analysis_of_contagion_map
011_20160321_Topological_data_analysis_of_contagion_map011_20160321_Topological_data_analysis_of_contagion_map
011_20160321_Topological_data_analysis_of_contagion_mapHa Phuong
 
010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian ProcessHa Phuong
 
009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain Networks009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain NetworksHa Phuong
 
PRML Reading Chapter 11 - Sampling Method
PRML Reading Chapter 11 - Sampling MethodPRML Reading Chapter 11 - Sampling Method
PRML Reading Chapter 11 - Sampling MethodHa Phuong
 
Approximate Inference (Chapter 10, PRML Reading)
Approximate Inference (Chapter 10, PRML Reading)Approximate Inference (Chapter 10, PRML Reading)
Approximate Inference (Chapter 10, PRML Reading)Ha Phuong
 
008 20151221 Return of Frustrating Easy Domain Adaptation
008 20151221 Return of Frustrating Easy Domain Adaptation008 20151221 Return of Frustrating Easy Domain Adaptation
008 20151221 Return of Frustrating Easy Domain AdaptationHa Phuong
 
007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics
007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics
007 20151214 Deep Unsupervised Learning using Nonequlibrium ThermodynamicsHa Phuong
 
006 20151207 draws - Deep Recurrent Attentive Writer
006 20151207 draws - Deep Recurrent Attentive Writer006 20151207 draws - Deep Recurrent Attentive Writer
006 20151207 draws - Deep Recurrent Attentive WriterHa Phuong
 
005 20151130 adversary_networks
005 20151130 adversary_networks005 20151130 adversary_networks
005 20151130 adversary_networksHa Phuong
 
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamicsHa Phuong
 
003 20151109 nn_faster_andfaster
003 20151109 nn_faster_andfaster003 20151109 nn_faster_andfaster
003 20151109 nn_faster_andfasterHa Phuong
 
002 20151019 interconnected_network
002 20151019 interconnected_network002 20151019 interconnected_network
002 20151019 interconnected_networkHa Phuong
 

More from Ha Phuong (20)

QTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapQTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature Map
 
CCS2019-opological time-series analysis with delay-variant embedding
CCS2019-opological time-series analysis with delay-variant embeddingCCS2019-opological time-series analysis with delay-variant embedding
CCS2019-opological time-series analysis with delay-variant embedding
 
SIAM-AG21-Topological Persistence Machine of Phase Transition
SIAM-AG21-Topological Persistence Machine of Phase TransitionSIAM-AG21-Topological Persistence Machine of Phase Transition
SIAM-AG21-Topological Persistence Machine of Phase Transition
 
018 20160902 Machine Learning Framework for Analysis of Transport through Com...
018 20160902 Machine Learning Framework for Analysis of Transport through Com...018 20160902 Machine Learning Framework for Analysis of Transport through Com...
018 20160902 Machine Learning Framework for Analysis of Transport through Com...
 
017_20160826 Thermodynamics Of Stochastic Turing Machines
017_20160826 Thermodynamics Of Stochastic Turing Machines017_20160826 Thermodynamics Of Stochastic Turing Machines
017_20160826 Thermodynamics Of Stochastic Turing Machines
 
016_20160722 Molecular Circuits For Dynamic Noise Filtering
016_20160722 Molecular Circuits For Dynamic Noise Filtering016_20160722 Molecular Circuits For Dynamic Noise Filtering
016_20160722 Molecular Circuits For Dynamic Noise Filtering
 
015_20160422 Controlling Synchronous Patterns In Complex Networks
015_20160422 Controlling Synchronous Patterns In Complex Networks015_20160422 Controlling Synchronous Patterns In Complex Networks
015_20160422 Controlling Synchronous Patterns In Complex Networks
 
013_20160328_Topological_Measurement_Of_Protein_Compressibility
013_20160328_Topological_Measurement_Of_Protein_Compressibility013_20160328_Topological_Measurement_Of_Protein_Compressibility
013_20160328_Topological_Measurement_Of_Protein_Compressibility
 
011_20160321_Topological_data_analysis_of_contagion_map
011_20160321_Topological_data_analysis_of_contagion_map011_20160321_Topological_data_analysis_of_contagion_map
011_20160321_Topological_data_analysis_of_contagion_map
 
010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process
 
009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain Networks009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain Networks
 
PRML Reading Chapter 11 - Sampling Method
PRML Reading Chapter 11 - Sampling MethodPRML Reading Chapter 11 - Sampling Method
PRML Reading Chapter 11 - Sampling Method
 
Approximate Inference (Chapter 10, PRML Reading)
Approximate Inference (Chapter 10, PRML Reading)Approximate Inference (Chapter 10, PRML Reading)
Approximate Inference (Chapter 10, PRML Reading)
 
008 20151221 Return of Frustrating Easy Domain Adaptation
008 20151221 Return of Frustrating Easy Domain Adaptation008 20151221 Return of Frustrating Easy Domain Adaptation
008 20151221 Return of Frustrating Easy Domain Adaptation
 
007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics
007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics
007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics
 
006 20151207 draws - Deep Recurrent Attentive Writer
006 20151207 draws - Deep Recurrent Attentive Writer006 20151207 draws - Deep Recurrent Attentive Writer
006 20151207 draws - Deep Recurrent Attentive Writer
 
005 20151130 adversary_networks
005 20151130 adversary_networks005 20151130 adversary_networks
005 20151130 adversary_networks
 
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
 
003 20151109 nn_faster_andfaster
003 20151109 nn_faster_andfaster003 20151109 nn_faster_andfaster
003 20151109 nn_faster_andfaster
 
002 20151019 interconnected_network
002 20151019 interconnected_network002 20151019 interconnected_network
002 20151019 interconnected_network
 

Recently uploaded

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 

Recently uploaded (20)

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Tutorial of topological data analysis part 3(Mapper algorithm)

  • 1. Tutorial of Topological Data Analysis Tran Quoc Hoan @k09hthaduonght.wordpress.com/ Paper Alert 2016-04-15, Hasegawa lab., Tokyo The University of Tokyo Part III - Mapper Algorithm
  • 2. My TDA = Topology Data Analysis ’s road TDA Road 2 Part I - Basic concepts & applications Part II - Advanced TDA computation Part III - Mapper Algorithm Part V - Applications in… Part VI - Applications in… Part IV - Software Roadmap He is following me
  • 3. TDA Road Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/ Mapper Algorithm
  • 4. Basic motivation Mapper Algorithm 4 Basic idea Perform clustering at different “scales”, track how clusters change as scale varies Motivation • Coarser than manifold learning, but still works in nonlinear situation • Extract meaningful geometric information about dataset • Efficiently computable (for large dataset) Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition. G Singh, F Mémoli, GE Carlsson - SPBG, 2007
  • 5. Morse theory Mapper Algorithm 5 Basic idea Describe topology of a smooth manifold M using level sets of a suitable function h : M -> R • Recover M by looking at h-1((∞, t]), as t scans over the range of h • Topology of M changes at critical points of h
  • 6. Reeb graphs Mapper Algorithm 6 • For each t in R, contract each component of f-1(t) to a point • Resulting structure is a graph
  • 7. Mapper Mapper Algorithm 7 The mapper algorithm is a generalization of this procedure (Singh- Memoli-Carlsson) Input ✤ Filter (continuous) function f: X -> R ✤ Cover L of im(f) by open intervals: Method ✤ Cluster each inverse image f-1(Lα) into various connected components ✤ The Mapper is the nerve of V • Clusters are vertices • 1 k-simplex per (k+1)-fold intersection connected cover V ✤ Color vertices according to average value of f in the cluster k i=0Vi 6= ;, V0, ..., Vk 2 V
  • 8. Workflow - Illustration Mapper Algorithm 8Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/ f could be in n-dimension
  • 9. Workflow - Illustration Mapper Algorithm 9Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/ f could be in n-dimension
  • 10. Workflow - Illustration Mapper Algorithm 10Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/ f could be in n-dimension
  • 11. Mapper in practice Mapper Algorithm 11 Input ✤ Filter (continuous) function f: P -> R ✤ Cover L of im(f) by open intervals: Method ✤ Cluster each inverse image f-1(Lα) into various connected components in G ✤ The Mapper is the nerve of V connected cover V ✤ Color vertices according to average value of f in the cluster - Point cloud P with metric dP - Compute neighborhood graph G = (P, E) • Clusters are vertices • 1 k-simplex per (k+1)-fold intersection k i=0Vi 6= ;, V0, ..., Vk 2 V (intersections materialized by data points)
  • 12. Mapper in practice Mapper Algorithm 12Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
  • 13. Mapper in practice Mapper Algorithm 13Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
  • 14. Mapper in practice Mapper Algorithm 14Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
  • 15. Mapper in practice Mapper Algorithm 15 Parameters ✤ Filter (continuous) function f: P -> R ✤ Cover L of im(f) by open intervals: ✤ Neighborhood size δ Example: uniform cover L • Resolution / granularity: r (diameter of intervals) • Gain: g (percentage of overlap) range scale geometric scale
  • 16. Filter functions Mapper Algorithm 16 Choice of filter function is essential • Some kind of density measure • A score measure difference (distance) from some baseline • An eccentricity measure Statistics
 Mean/Max/Min Variance n-Moment Density … Machine Learning
 PCA/SVD Auto encoders Isomap/MDS/TSNE SVM Distance Error/Debugging Info … Geometry
 Centrality
 Curvature
 Harmonic Cycles …
  • 17. Filter functions Mapper Algorithm 17 Eccentricity Density - How close the point lies to the “center” of the point cloud. - How close the point to the surrounding points
  • 18. Mapper in applications Mapper Algorithm 18 Extracting insights from the shape of complex data using topology, Lum et al., Nature, 2013 Topological Data Analysis for Discovery in Preclinical Spinal Cord Injury and Traumatic Brain Injury, Nielson et al., Nature, 2015 Using Topological Data Analysis for Diagnosis Pulmonary Embolism, Rucco et al., arXiv preprint, 2014 Topological Methods for Exploring Low-density States in Biomolecular Folding Pathways, Yao et al., J. Chemical Physics, 2009 CD8 T-cell reactivity to islet antigens is unique to type 1 while CD4 T-cell reactivity exists in both type 1 and type 2 diabetes, Sarikonda et al., J. Autoimmunity, 2013 Innate and adaptive T cells in asthmatic patients: Relationship to severity and disease mechanisms, Hinks et al., J. Allergy Clinical Immunology, 2015 ✤ ✤ ✤ ✤ ✤ ✤
  • 19. Mapper in practice Mapper Algorithm 19 1. Clustering 2. Feature selection
  • 20. Mapper in clustering Mapper Algorithm 20 (1) Compute the Mapper (2) Detect interesting topological substructures (“loops”, “flares”) (3) Use substructure to cluster data select parameters Not easy (Tutorial part 1 + 2)
  • 21. Mapper Algorithm 21 Extracting insights from the shape of complex data using topology, Lum et al., Nature, 2013 f: 1st and 2nd SVD r = 120, g = 22% PCA can show the Republican/ Democrat cluster but TDA gives more information House Party representative grouping Point: member of the House PCA
  • 22. Mapper Algorithm 22 Extracting insights from the shape of complex data using topology, Lum et al., Nature, 2013 Detect new clusters for NBA players
  • 23. Mapper Algorithm 23 Innate and adaptive T cells in asthmatic patients: Relationship to severity and disease mechanisms, Hinks et al., J. Allergy Clinical Immunology, 2015 The TDA used 62 subjects with most complete data. f: 1st and 2nd SVD r = 120, g = 14%, equalized
  • 24. Mapper in feature selection Mapper Algorithm 24 (1) Compute the Mapper (2) Detect interesting topological substructures (“loops”, “flares”) (3) Select features that best discriminate data in substructure select parameters Kolmogorov-Smirnov test on (substructure) feature vs. (whole dataset) feature, select features with low p-val
  • 25. Mapper Algorithm 25 Extracting insights from the shape of complex data using topology, Lum et al., Nature, 2013 Goal: detect factors that influence survival after therapy in breast cancer patients Points: breast cancer patients that went through specific therapy PCA/Single-linkage clustering cannot see this f: eccentricity r = 1/30, g = 33%
  • 26. Mapper Algorithm 26 Topological Data Analysis for Discovery in Preclinical Spinal Cord Injury and Traumatic Brain Injury, Nielson et al., Nature, 2015
  • 27. Select Parameters Mapper Algorithm 27 parameter r parameter g parameter δ parameter f • Small r -> fine cover 
 (close to Reeb) (sensitive to δ) • Large r -> rough cover 
 (less sensitive to δ) • g ≈ 1 -> more points inside intersections , less sensitive to δ but far from Reeb • g ≈ 0 -> controlled Mapper dimension, close to Reeb • Large δ -> fewer nodes, clean Mapper but far from Reeb (more straight lines) • Small δ -> distinct topological structure but lots of nodes (noisy) • Depend mostly on the dataset coordinate, density estimation, eccentricity, eigenvector
  • 28. Select Parameters Mapper Algorithm 28 Example: P in R2 sampled from known distribution f = density estimator, r = 1/30, g = 20% δ = percentage of the diameter of X Image source: http://www.enseignement.polytechnique.fr/informatique/INF563/
  • 29. Reference links Mapper Algorithm 29 • INF563 Topological Data Analysis Course
 http://www.enseignement.polytechnique.fr/informatique/INF563/ • AYASDI
 http://www.ayasdi.com/ • …