SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
Neighbourhood Component
       Analysis



         T.S. Yo
References
Outline

●   Introduction
●   Learn the distance metric from data
●   The size of K
●   Procedure of NCA
●   Experiments
●   Discussions
Introduction (1/2)

●   KNN
    –   Simple and effective
    –   Nonlinear decision surface
    –   Non-parametric
    –   Quality improved with more data
    –   Only one parameter, K -> easy for tuning
Introduction (2/2)
●   Drawbacks of KNN
    –   Computationally expensive: search through the
        whole training data in the test time
    –   How to define the “distance” properly?


●   Learn the distance metric from data, and
    force it to be low rank.
Learn the Distance from Data (1/5)
●   What is a good distance metric?
     –   The one that minimize (optimize) the cost!


●   Then, what is the cost?
     –   The expected testing error
     –   Best estimated with leave-one-out (LOO) cross-
         validation error in the training data
Kohavi, Ron (1995). "A study of cross-validation and bootstrap for accuracy estimation and model selection".
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence 2 (12): 1137–1143. (Morgan
Kaufmann, San Mateo)
Learn the Distance from Data (2/5)
●   Modeling the LOO error:
    –   Let pij be the probability that point xj is selected as
        point xi's neighbour.
    –   The probability that points are correctly classified
        when xi is used as the reference is:


●   To maximize pi for all xi means to minimize
    LOO error.
Learn the Distance from Data (3/5)
●   Then, how do we define pij ?
    –   According to the softmax of the distance dij


                                                       Softmax Function
                                                  1

                                                 0.9


    –   Relatively smoother than dij
                                                 0.8

                                                 0.7

                                                 0.6




                                       exp(-X)
                                                 0.5

                                                 0.4

                                                 0.3

                                                 0.2

                                                 0.1

                                                  0
                                                              X
Learn the Distance from Data (4/5)
●   How do we define dij ?
●   Limit the distance measure within Mahalanobis
    (quadratic) distance.



●   That is to say, we project the original feature
    vectors x into another vector space with q
    transformation matrix, A
Learn the Distance from Data (5/5)
●   Substitute the dij in pij :



●   Now, we have the objective function :


●   Maximize f(A) w.r.t. A → minimize overall
    LOO error
The Size of k
●   For the probability distribution pij :



●   The perplexity can be used as an estimate for
    the size of neighbours to be considered, k
Procedure of NCA (1/2)
●   Use the objective function and its gradient to
    learn the transformation matrix A and K from
    the training data, Dtrain(with or without dimension
    reduction).
●   Project the test data, Dtest, into the transformed
    space.
●   Perform traditional KNN (with K and ADtrain) on
    the transformed test data, ADtest.
Procedure of NCA (2/2)
●   Functions used for optimization
Experiments – Datasets (1/2)
●   4 from UCI ML Repository, 2 self-made
Experiments – Datasets (2/2)




n2d is a mixture of two bivariate normal distributions with different means and
covariance matrices. ring consists of 2-d concentric rings and 8 dimensions of
uniform random noise.
Experiments – Results (1/4)




Error rates of KNN and NCA with the same K.
It is shown that generally NCA does improve the
performance of KNN.
Experiments – Results (2/4)
Experiments – Results (3/4)
●   Compare with
    other classifiers
Experiments – Results (4/4)
                   ●   Rank 2
                       dimension
                       reduction
Discussions (1/8)
●   Rank 2 transformation for wine
Discussions (2/8)
●   Rank 1 transformation for n2d
Discussions (3/8)
●   Results of
    Goldberger
    et al.
(40 realizations of
   30%/70% splits)
Discussions
    (4/8)

●   Results of
    Goldberger
    et al.
(rank 2
   transformation)
Discussions (5/8)
●   Results of experiments suggest that with the
    learned distance metric by NCA algorithm, KNN
    classification can be improved.

●   NCA also outperforms traditional dimension
    reduction methods for several datasets.
Discussions (6/8)
●   Comparing to other classification methods (i.e.
    LDA and QDA), NCA usually does not give the
    best accuracy.

●   Some odd performance on dimension reduction
    suggests that a further investigation on the
    optimization algorithm is necessary.
Discussions (7/8)
●   Optimize a matrix
●
    Can we Optimize these Functions? (Michael L. Overton)
    –   Globally, no. Related problems are NP-hard (Blondell-
        Tsitsiklas, Nemirovski)
    –   Locally, yes.
         ●
             But not by standard methods for nonconvex,
             smooth optimization
         ●
             Steepest descent, BFGS or nonlinear conjugate
             gradient will typically jam because of nonsmoothness
Discussions (8/8)
 ●   Other methods learn distant metric from data
      –   Discriminant Common Vectors(DCV)
           ●   Similar to NCA, DCV focuses on optimizing the distance
               metric on certain objective functions


      –   Laplacianfaces(LAP)
           ●   Emphasizes more on dimension reduction

J. Liu and S. Chen , Discriminant Common Vecotors Versus Neighbourhood Components
Analysis and Laplacianfaces: A comparative study in small sample size problem. Image and
Vision Computing
Question?
Thank you!
Derive the Objective Function (1/5)
●   From the assumptions, we have :
Derive the Objective Function (2/5)
Derive the Objective Function (3/5)
Derive the Objective Function (4/5)
Derive the Objective Function (5/5)

Mais conteúdo relacionado

Mais procurados

Image Smoothing using Frequency Domain Filters
Image Smoothing using Frequency Domain FiltersImage Smoothing using Frequency Domain Filters
Image Smoothing using Frequency Domain FiltersSuhaila Afzana
 
The Singular Value Decomposition theroy + example
 The Singular Value Decomposition theroy + example  The Singular Value Decomposition theroy + example
The Singular Value Decomposition theroy + example shivam choubey
 
Planning in Artificial Intelligence
Planning in Artificial IntelligencePlanning in Artificial Intelligence
Planning in Artificial Intelligencekitsenthilkumarcse
 
Applications of WSN
Applications of WSNApplications of WSN
Applications of WSNADEEBANADEEM
 
Energy consumption of wsn
Energy consumption of wsnEnergy consumption of wsn
Energy consumption of wsnDeepaDasarathan
 
MQTT with Java - a protocol for IoT and M2M communication
MQTT with Java - a protocol for IoT and M2M communicationMQTT with Java - a protocol for IoT and M2M communication
MQTT with Java - a protocol for IoT and M2M communicationChristian Götz
 
Solving 0-1 knapsack problems based on amoeboid organism algorithm
Solving 0-1 knapsack problems based on amoeboid organism algorithmSolving 0-1 knapsack problems based on amoeboid organism algorithm
Solving 0-1 knapsack problems based on amoeboid organism algorithmjuanjo_23
 
Recipe Detection Of Image Using Deep Learning.pptx
Recipe Detection Of Image Using Deep Learning.pptxRecipe Detection Of Image Using Deep Learning.pptx
Recipe Detection Of Image Using Deep Learning.pptxDhawalTank4
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationVarun Ojha
 
Data compression using huffman coding
Data compression using huffman codingData compression using huffman coding
Data compression using huffman codingSATYENDRAKUMAR279
 
Design Issues and Applications of Wireless Sensor Network
Design Issues and Applications of Wireless Sensor NetworkDesign Issues and Applications of Wireless Sensor Network
Design Issues and Applications of Wireless Sensor Networkijtsrd
 
lecture 30
lecture 30lecture 30
lecture 30sajinsc
 
Frequency Domain Image Enhancement Techniques
Frequency Domain Image Enhancement TechniquesFrequency Domain Image Enhancement Techniques
Frequency Domain Image Enhancement TechniquesDiwaker Pant
 
Uncertain knowledge and reasoning
Uncertain knowledge and reasoningUncertain knowledge and reasoning
Uncertain knowledge and reasoningShiwani Gupta
 
Image enhancement techniques
Image enhancement techniques Image enhancement techniques
Image enhancement techniques Arshad khan
 

Mais procurados (20)

WRAN
WRANWRAN
WRAN
 
Image Smoothing using Frequency Domain Filters
Image Smoothing using Frequency Domain FiltersImage Smoothing using Frequency Domain Filters
Image Smoothing using Frequency Domain Filters
 
The Singular Value Decomposition theroy + example
 The Singular Value Decomposition theroy + example  The Singular Value Decomposition theroy + example
The Singular Value Decomposition theroy + example
 
Planning in Artificial Intelligence
Planning in Artificial IntelligencePlanning in Artificial Intelligence
Planning in Artificial Intelligence
 
Applications of WSN
Applications of WSNApplications of WSN
Applications of WSN
 
Energy consumption of wsn
Energy consumption of wsnEnergy consumption of wsn
Energy consumption of wsn
 
MQTT with Java - a protocol for IoT and M2M communication
MQTT with Java - a protocol for IoT and M2M communicationMQTT with Java - a protocol for IoT and M2M communication
MQTT with Java - a protocol for IoT and M2M communication
 
Solving 0-1 knapsack problems based on amoeboid organism algorithm
Solving 0-1 knapsack problems based on amoeboid organism algorithmSolving 0-1 knapsack problems based on amoeboid organism algorithm
Solving 0-1 knapsack problems based on amoeboid organism algorithm
 
Recipe Detection Of Image Using Deep Learning.pptx
Recipe Detection Of Image Using Deep Learning.pptxRecipe Detection Of Image Using Deep Learning.pptx
Recipe Detection Of Image Using Deep Learning.pptx
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier Transformation
 
Data compression using huffman coding
Data compression using huffman codingData compression using huffman coding
Data compression using huffman coding
 
Design Issues and Applications of Wireless Sensor Network
Design Issues and Applications of Wireless Sensor NetworkDesign Issues and Applications of Wireless Sensor Network
Design Issues and Applications of Wireless Sensor Network
 
FUZZY LOGIC
FUZZY LOGICFUZZY LOGIC
FUZZY LOGIC
 
lecture 30
lecture 30lecture 30
lecture 30
 
Frequency Domain Image Enhancement Techniques
Frequency Domain Image Enhancement TechniquesFrequency Domain Image Enhancement Techniques
Frequency Domain Image Enhancement Techniques
 
Uncertain knowledge and reasoning
Uncertain knowledge and reasoningUncertain knowledge and reasoning
Uncertain knowledge and reasoning
 
Routing in vanet
Routing in vanetRouting in vanet
Routing in vanet
 
Image enhancement techniques
Image enhancement techniques Image enhancement techniques
Image enhancement techniques
 
Mc culloch pitts neuron
Mc culloch pitts neuronMc culloch pitts neuron
Mc culloch pitts neuron
 
Lecture 3 problem solving
Lecture 3   problem solvingLecture 3   problem solving
Lecture 3 problem solving
 

Destaque

Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013
Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013
Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013Enrique Frias-Martinez
 
Unit 1 planning c oncepts ppt
Unit 1   planning c oncepts  pptUnit 1   planning c oncepts  ppt
Unit 1 planning c oncepts pptEzhil Tamizh
 
Neighborhood Analysis, Cox Meadows
Neighborhood Analysis, Cox MeadowsNeighborhood Analysis, Cox Meadows
Neighborhood Analysis, Cox MeadowsCrystal Rova
 
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...Courtney Porter
 
Broyhill McLean Estates Neighborhood Stormwater Improvement Project
Broyhill McLean Estates Neighborhood Stormwater Improvement ProjectBroyhill McLean Estates Neighborhood Stormwater Improvement Project
Broyhill McLean Estates Neighborhood Stormwater Improvement ProjectFairfax County
 
Intervalos de confianza en r commander
Intervalos de confianza en r commanderIntervalos de confianza en r commander
Intervalos de confianza en r commanderEdy Crackmaster
 
Parts of the neighborhood
Parts of the  neighborhoodParts of the  neighborhood
Parts of the neighborhoodVero Marin
 
Neighborhood Watch: Toolkit Training Presentation
Neighborhood Watch: Toolkit Training PresentationNeighborhood Watch: Toolkit Training Presentation
Neighborhood Watch: Toolkit Training PresentationAdam Cohen
 
Presentation my neighborhood
Presentation my neighborhoodPresentation my neighborhood
Presentation my neighborhoodRamirez Ramirez
 
Rotterdam Tarwewijk, a resilient neighborhood? A case study
Rotterdam Tarwewijk, a resilient neighborhood? A case studyRotterdam Tarwewijk, a resilient neighborhood? A case study
Rotterdam Tarwewijk, a resilient neighborhood? A case studyBart Litjens
 
Broyhill McLean Estates BMP Neighborhood Improvement Project
Broyhill McLean Estates BMP Neighborhood Improvement ProjectBroyhill McLean Estates BMP Neighborhood Improvement Project
Broyhill McLean Estates BMP Neighborhood Improvement ProjectFairfax County
 
Metric learning ICML2010 tutorial
Metric learning  ICML2010 tutorialMetric learning  ICML2010 tutorial
Metric learning ICML2010 tutorialzukun
 
2009 05 14 Oc Division Program Optimized
2009 05 14 Oc Division Program Optimized2009 05 14 Oc Division Program Optimized
2009 05 14 Oc Division Program OptimizedRonKrater
 
Probabilistic PCA, EM, and more
Probabilistic PCA, EM, and moreProbabilistic PCA, EM, and more
Probabilistic PCA, EM, and morehsharmasshare
 
Councillor Briefing: Neighbourhood planning
Councillor Briefing: Neighbourhood planning Councillor Briefing: Neighbourhood planning
Councillor Briefing: Neighbourhood planning PAS_Team
 
Neighbourhood planning - learning from the pioneers
Neighbourhood planning - learning from the pioneersNeighbourhood planning - learning from the pioneers
Neighbourhood planning - learning from the pioneersLocality
 
Zoning and subdivision of land
Zoning and subdivision of landZoning and subdivision of land
Zoning and subdivision of landctlachu
 

Destaque (20)

Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013
Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013
Urban Analysis for the XXI Century: Jornadas BigDataCyL 2013
 
Neighborhood
NeighborhoodNeighborhood
Neighborhood
 
Unit 1 planning c oncepts ppt
Unit 1   planning c oncepts  pptUnit 1   planning c oncepts  ppt
Unit 1 planning c oncepts ppt
 
Neighborhood Analysis, Cox Meadows
Neighborhood Analysis, Cox MeadowsNeighborhood Analysis, Cox Meadows
Neighborhood Analysis, Cox Meadows
 
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...
The Los Angeles Healthy Community Neighborhood Initiative- A Ten Year Ex...
 
Broyhill McLean Estates Neighborhood Stormwater Improvement Project
Broyhill McLean Estates Neighborhood Stormwater Improvement ProjectBroyhill McLean Estates Neighborhood Stormwater Improvement Project
Broyhill McLean Estates Neighborhood Stormwater Improvement Project
 
Intervalos de confianza en r commander
Intervalos de confianza en r commanderIntervalos de confianza en r commander
Intervalos de confianza en r commander
 
Parts of the neighborhood
Parts of the  neighborhoodParts of the  neighborhood
Parts of the neighborhood
 
Neighborhood Watch: Toolkit Training Presentation
Neighborhood Watch: Toolkit Training PresentationNeighborhood Watch: Toolkit Training Presentation
Neighborhood Watch: Toolkit Training Presentation
 
Presentation my neighborhood
Presentation my neighborhoodPresentation my neighborhood
Presentation my neighborhood
 
Rotterdam Tarwewijk, a resilient neighborhood? A case study
Rotterdam Tarwewijk, a resilient neighborhood? A case studyRotterdam Tarwewijk, a resilient neighborhood? A case study
Rotterdam Tarwewijk, a resilient neighborhood? A case study
 
Broyhill McLean Estates BMP Neighborhood Improvement Project
Broyhill McLean Estates BMP Neighborhood Improvement ProjectBroyhill McLean Estates BMP Neighborhood Improvement Project
Broyhill McLean Estates BMP Neighborhood Improvement Project
 
Metric learning ICML2010 tutorial
Metric learning  ICML2010 tutorialMetric learning  ICML2010 tutorial
Metric learning ICML2010 tutorial
 
2009 05 14 Oc Division Program Optimized
2009 05 14 Oc Division Program Optimized2009 05 14 Oc Division Program Optimized
2009 05 14 Oc Division Program Optimized
 
Probabilistic PCA, EM, and more
Probabilistic PCA, EM, and moreProbabilistic PCA, EM, and more
Probabilistic PCA, EM, and more
 
Councillor Briefing: Neighbourhood planning
Councillor Briefing: Neighbourhood planning Councillor Briefing: Neighbourhood planning
Councillor Briefing: Neighbourhood planning
 
Neighbourhood planning - learning from the pioneers
Neighbourhood planning - learning from the pioneersNeighbourhood planning - learning from the pioneers
Neighbourhood planning - learning from the pioneers
 
Zoning and subdivision of land
Zoning and subdivision of landZoning and subdivision of land
Zoning and subdivision of land
 
Neighbourhood Planning
Neighbourhood PlanningNeighbourhood Planning
Neighbourhood Planning
 
Unit 1 ppt
Unit 1 pptUnit 1 ppt
Unit 1 ppt
 

Semelhante a Neighborhood Component Analysis 20071108

Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaYohanes Nuwara
 
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAminaRepo
 
Medical diagnosis classification
Medical diagnosis classificationMedical diagnosis classification
Medical diagnosis classificationcsandit
 
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...cscpconf
 
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Ram Sriharsha
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Zihui Li
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersSeunghyun Hwang
 
Multiobjective optimization and trade offs using pareto optimality
Multiobjective optimization and trade offs using pareto optimalityMultiobjective optimization and trade offs using pareto optimality
Multiobjective optimization and trade offs using pareto optimalityAmogh Mundhekar
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchGreg Makowski
 
2017 09-29 ndt loop closure
2017 09-29 ndt loop closure2017 09-29 ndt loop closure
2017 09-29 ndt loop closureiMorpheus ai
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality ReductionSaad Elbeleidy
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)byteLAKE
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxSivam Chinna
 
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix DecompositionBeyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix DecompositionFrank Ong
 
Reading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxReading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxcongtran88
 
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya
 

Semelhante a Neighborhood Component Analysis 20071108 (20)

Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
 
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
 
Medical diagnosis classification
Medical diagnosis classificationMedical diagnosis classification
Medical diagnosis classification
 
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...
MEDICAL DIAGNOSIS CLASSIFICATION USING MIGRATION BASED DIFFERENTIAL EVOLUTION...
 
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
 
Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017
 
ngboost.pptx
ngboost.pptxngboost.pptx
ngboost.pptx
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
 
Multiobjective optimization and trade offs using pareto optimality
Multiobjective optimization and trade offs using pareto optimalityMultiobjective optimization and trade offs using pareto optimality
Multiobjective optimization and trade offs using pareto optimality
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient search
 
2017 09-29 ndt loop closure
2017 09-29 ndt loop closure2017 09-29 ndt loop closure
2017 09-29 ndt loop closure
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
 
Group Project
Group ProjectGroup Project
Group Project
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix DecompositionBeyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition
Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition
 
Reading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxReading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptx
 
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
 
Data mining with weka
Data mining with wekaData mining with weka
Data mining with weka
 

Mais de Ting-Shuo Yo

20141030 ntustme computer_programmingandbeyond_share
20141030 ntustme computer_programmingandbeyond_share20141030 ntustme computer_programmingandbeyond_share
20141030 ntustme computer_programmingandbeyond_shareTing-Shuo Yo
 
Tag2Card User's manual v01
Tag2Card User's manual v01Tag2Card User's manual v01
Tag2Card User's manual v01Ting-Shuo Yo
 
Introduction to BCI
Introduction to BCIIntroduction to BCI
Introduction to BCITing-Shuo Yo
 
Design Thinking and Innovation
Design Thinking and InnovationDesign Thinking and Innovation
Design Thinking and InnovationTing-Shuo Yo
 
A Comparison of Evaluation Methods in Coevolution 20070921
A Comparison of Evaluation Methods in Coevolution 20070921A Comparison of Evaluation Methods in Coevolution 20070921
A Comparison of Evaluation Methods in Coevolution 20070921Ting-Shuo Yo
 
The Neurophysiology of Speech
The Neurophysiology of SpeechThe Neurophysiology of Speech
The Neurophysiology of SpeechTing-Shuo Yo
 
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?Diffusion MRI, Tractography,and Connectivity: what machine learning can do?
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?Ting-Shuo Yo
 
Simulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational SimulationSimulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational SimulationTing-Shuo Yo
 

Mais de Ting-Shuo Yo (9)

20141030 ntustme computer_programmingandbeyond_share
20141030 ntustme computer_programmingandbeyond_share20141030 ntustme computer_programmingandbeyond_share
20141030 ntustme computer_programmingandbeyond_share
 
Tag2Card User's manual v01
Tag2Card User's manual v01Tag2Card User's manual v01
Tag2Card User's manual v01
 
InnoCentive
InnoCentiveInnoCentive
InnoCentive
 
Introduction to BCI
Introduction to BCIIntroduction to BCI
Introduction to BCI
 
Design Thinking and Innovation
Design Thinking and InnovationDesign Thinking and Innovation
Design Thinking and Innovation
 
A Comparison of Evaluation Methods in Coevolution 20070921
A Comparison of Evaluation Methods in Coevolution 20070921A Comparison of Evaluation Methods in Coevolution 20070921
A Comparison of Evaluation Methods in Coevolution 20070921
 
The Neurophysiology of Speech
The Neurophysiology of SpeechThe Neurophysiology of Speech
The Neurophysiology of Speech
 
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?Diffusion MRI, Tractography,and Connectivity: what machine learning can do?
Diffusion MRI, Tractography,and Connectivity: what machine learning can do?
 
Simulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational SimulationSimulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational Simulation
 

Último

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 

Último (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Neighborhood Component Analysis 20071108

  • 1. Neighbourhood Component Analysis T.S. Yo
  • 3. Outline ● Introduction ● Learn the distance metric from data ● The size of K ● Procedure of NCA ● Experiments ● Discussions
  • 4. Introduction (1/2) ● KNN – Simple and effective – Nonlinear decision surface – Non-parametric – Quality improved with more data – Only one parameter, K -> easy for tuning
  • 5. Introduction (2/2) ● Drawbacks of KNN – Computationally expensive: search through the whole training data in the test time – How to define the “distance” properly? ● Learn the distance metric from data, and force it to be low rank.
  • 6. Learn the Distance from Data (1/5) ● What is a good distance metric? – The one that minimize (optimize) the cost! ● Then, what is the cost? – The expected testing error – Best estimated with leave-one-out (LOO) cross- validation error in the training data Kohavi, Ron (1995). "A study of cross-validation and bootstrap for accuracy estimation and model selection". Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence 2 (12): 1137–1143. (Morgan Kaufmann, San Mateo)
  • 7. Learn the Distance from Data (2/5) ● Modeling the LOO error: – Let pij be the probability that point xj is selected as point xi's neighbour. – The probability that points are correctly classified when xi is used as the reference is: ● To maximize pi for all xi means to minimize LOO error.
  • 8. Learn the Distance from Data (3/5) ● Then, how do we define pij ? – According to the softmax of the distance dij Softmax Function 1 0.9 – Relatively smoother than dij 0.8 0.7 0.6 exp(-X) 0.5 0.4 0.3 0.2 0.1 0 X
  • 9. Learn the Distance from Data (4/5) ● How do we define dij ? ● Limit the distance measure within Mahalanobis (quadratic) distance. ● That is to say, we project the original feature vectors x into another vector space with q transformation matrix, A
  • 10. Learn the Distance from Data (5/5) ● Substitute the dij in pij : ● Now, we have the objective function : ● Maximize f(A) w.r.t. A → minimize overall LOO error
  • 11. The Size of k ● For the probability distribution pij : ● The perplexity can be used as an estimate for the size of neighbours to be considered, k
  • 12. Procedure of NCA (1/2) ● Use the objective function and its gradient to learn the transformation matrix A and K from the training data, Dtrain(with or without dimension reduction). ● Project the test data, Dtest, into the transformed space. ● Perform traditional KNN (with K and ADtrain) on the transformed test data, ADtest.
  • 13. Procedure of NCA (2/2) ● Functions used for optimization
  • 14. Experiments – Datasets (1/2) ● 4 from UCI ML Repository, 2 self-made
  • 15. Experiments – Datasets (2/2) n2d is a mixture of two bivariate normal distributions with different means and covariance matrices. ring consists of 2-d concentric rings and 8 dimensions of uniform random noise.
  • 16. Experiments – Results (1/4) Error rates of KNN and NCA with the same K. It is shown that generally NCA does improve the performance of KNN.
  • 18. Experiments – Results (3/4) ● Compare with other classifiers
  • 19. Experiments – Results (4/4) ● Rank 2 dimension reduction
  • 20. Discussions (1/8) ● Rank 2 transformation for wine
  • 21. Discussions (2/8) ● Rank 1 transformation for n2d
  • 22. Discussions (3/8) ● Results of Goldberger et al. (40 realizations of 30%/70% splits)
  • 23. Discussions (4/8) ● Results of Goldberger et al. (rank 2 transformation)
  • 24. Discussions (5/8) ● Results of experiments suggest that with the learned distance metric by NCA algorithm, KNN classification can be improved. ● NCA also outperforms traditional dimension reduction methods for several datasets.
  • 25. Discussions (6/8) ● Comparing to other classification methods (i.e. LDA and QDA), NCA usually does not give the best accuracy. ● Some odd performance on dimension reduction suggests that a further investigation on the optimization algorithm is necessary.
  • 26. Discussions (7/8) ● Optimize a matrix ● Can we Optimize these Functions? (Michael L. Overton) – Globally, no. Related problems are NP-hard (Blondell- Tsitsiklas, Nemirovski) – Locally, yes. ● But not by standard methods for nonconvex, smooth optimization ● Steepest descent, BFGS or nonlinear conjugate gradient will typically jam because of nonsmoothness
  • 27. Discussions (8/8) ● Other methods learn distant metric from data – Discriminant Common Vectors(DCV) ● Similar to NCA, DCV focuses on optimizing the distance metric on certain objective functions – Laplacianfaces(LAP) ● Emphasizes more on dimension reduction J. Liu and S. Chen , Discriminant Common Vecotors Versus Neighbourhood Components Analysis and Laplacianfaces: A comparative study in small sample size problem. Image and Vision Computing
  • 30. Derive the Objective Function (1/5) ● From the assumptions, we have :
  • 31. Derive the Objective Function (2/5)
  • 32. Derive the Objective Function (3/5)
  • 33. Derive the Objective Function (4/5)
  • 34. Derive the Objective Function (5/5)