SlideShare uma empresa Scribd logo
1 de 38
1




MACHINE LEARNING FOR
MEDICAL IMAGING DATA
Yiou (Leo) Li
Background
2



    Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory,
    University of California San Francisco
    •   Developed unsupervised learning method for feature extraction of brain
        imaging data
    •   Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive
        modeling of brain trauma
    •   Designed batch data processing protocol to perform image registration,
        segmentation, band-pass filtering, smoothing, and linear model fitting


    Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal
    processing Laboratory, University of Maryland Baltimore County
    •   Developed the effective degrees of freedom of random process and applied it
        to the model order selection by Information Theoretic Criteria
    •   Developed a linear filtering mechanism in independent component analysis for
        feature enhancement
    •   Analyzed canonical correlation analysis for multiple datasets
Outline
3




     Independent component analysis (ICA) and its
      application to sparse feature extraction from
      multivariate dataset


     Multi-set canonical correlation analysis and its
      application to joint pattern extraction from a group of
      datasets


     Order selection of principal component analysis (PCA)
      and its application to data dimension reduction
PCA vs ICA
4




                  PCA                               ICA
           Linear projection                 Linear projection
             (Orthogonal)
       Uncorrelated components           Independent components
             (non sparse)             (sparse, “long tail” distribution)
      Typically analytical solution     Typically iterative solution
                 (SVD)                    (Iterative optimization)
ICA detects independent factors with
    long tails in multivariate dataset
5
Long tail factors are sparse features in
    data samples
6



                                          Weights of
                                           features
                  Data points (N)

                                    ICA
    Sensors (M)         X            =       A         .         S




                                                           Sparse features



                                     X= AS
ICA model
7


           x1   a11 a12 ... a1M   s1 
          x  a       a 22   a 2M   s 2 
           2    21               
           ...        ...          ... 
                                   
           x M  a M1 a M2   a MM  s M 


                x : Observed variables
                A : Mixing matrix
                s : Latent factors



                  x= As -> s =A-1x
ICA by maximum likelihood estimation
8


    Transformation of multivariate random variable: x = As
                              p(s 1, s2 , ... , sM )
    p(x 1,x 2 , ... , x M )                                 (1)
                                    det(A)
    Statistical independence condition of s:

    p(s 1, s2 , ... , sM )  i 1 p(si )
                                          M
                                                             (2)

    Log likelihood function of x with parameter A:

    log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A)
                                                      -1

                                      i
ICA Application: Sparse feature extraction from
    multivariate dataset
9
Functional MRI experiment
10
Analyze functional MRI data of resting
     state brain
11




                              Sparse features




                    ICA
Feature 1. Primary visual network
12



 +
                                         A




 -
Feature 2. “Default mode network”
13
Feature 3. Attention control network
14
Hierarchical clustering shows link
15
     between features (brain regions)
Predicative modeling of brain trauma
16

                                                 Pattern weights
                                 N

       Healthy
                                 X           =      A        .                 S
       Patients

                                                                               Sparse
                                                                           spatial features


     Subject 1

                             …
                 Subject 2
          16                                                                       Pattern 2
                                                                   Feature 1                   …
                                 Subject M                                         Feature 2

                                                           Y.-O. Li, et al., HBM, 2011
ICA Pattern classification for predictive
     modeling of brain trauma
17




      • 29 healthy + 29 trauma, 10-fold cross-validation


             Classifier                9 patterns                14 patterns
                                   Classification error      Classification error
            Naïve Bayes                0.35+/-0.03               0.32+/-0.03

        K nearest neighbor             0.29+/-0.02               0.30+/-0.03

      Support vector classifier         0.36+/-0.02               0.30 +/-0.02
                                  (c=1, number of SV: 46)   (c=1, number of SV: 20)
Outline
18




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to dimension reduction
Joint pattern extraction requires coherency
     on extracted patterns across datasets
19


               Model:   x k =Aksk , k=1,2,...,M




                                     Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation analysis
20




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation
     analysis
21




          Correlation matrix of [S1,S2, … SM]

                                        Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Application: joint pattern extraction from a
     group of datasets
22




     •   Analyze group functional MRI data from
         simulated driving experiment
Simulated driving experiment
23

     •   Forty subjects, three repeated sessions (120 datasets)
     •   Experiment paradigm:




     •   Behavioral records:
          •   Average speed (AS)
          •   Differential of speed (DS)
          •   Average steering offset (AR)
          •   Differential steering offset (DR)
          •   Differential pedal offset (DP)
          •   Occurrence of yellow line crossing (YLC)
          •   Occurrence of white passenger-side line crossing (WPLC)

                                                    Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step I: M-CCA for joint feature extraction
24




                           Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step II: PCA and behavioral association
25




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Pattern 1: Primary visual function
26




                                                   D = 0:85
                                                   W = 0:42




                        95% CI of behavioral association
Pattern 2: “default mode network”
27




                                                  D = -0.63
                                                  W = -0.39




                      95% CI of behavioral association
Pattern 3: Motor coordination
28




                                                  D = 0.86
                                                  W = 0.15




                       95% CI of behavioral association
Pattern 4: Executive control network
29




                                                 D = 0.64
                                                 W = 0.61




                       95% CI of behavioral association
Cross correlation of Pattern 1
30




                       Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Outline
31




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to data dimension reduction
Decreased reproducibility of independent
     component on high-dimensional dataset
32

          •    Functional MRI with 120 time points
          •    Twenty Monte Carlo trials of ICA algorithm
          •    Clustering the IC estimates
          •    Reproducible ICs: compact and separated clusters




        K=20                           K=40                             K=90

                                                     Y.-O. Li, et al., HBM, 2007
Dimension reduction of high-dimensional
     data by PCA
33


                                       ICA              N
               N


       M       X               =            A   .       S


                                       MxM

                       PCA dimension reduction + ICA


                           .       A    .           S
           X   =   E                                        +      N


                                       K-largest PCs            M-k PCs
Failure of information-theoretic criteria with
     uncorrected degrees of freedom
34



                        AIC, MDL
            ˆ
            k  arg min k {l ( x | k )  g (k )}

                                                                            ( M k)
                                                                    
                                                i k 1 
                                                     M       1/ ( M  k )

                          l(x |  k )  N ln  M             i
                                                                     
                                             
                                              i k 1 i / ( M  k) 
                                                                     

                                           AIC : k (2M  k )  1
                            g ( k )  
                                       MDL : 0.5  ln N (k (2M  k )  1)

                                           Y.-O. Li, et al., HBM, 2007
Estimation of degrees of freedom by
     entropy rate
35


                 Entropy rate of a Gaussian process
                                        1       
                     h( x)  ln 2 e 
                                       4        ln s()d
                                               


            h( x)  ln 2 e     iff x[n] is an i.i.d. random process




       h(x) = 0.40               h(x) = 1.28                  h(x) = 1.41
                                                     Y.-O. Li, et al., HBM, 2007
Application: Order selection of high-
     dimensional dataset
36
Corrected order selection criteria
     significantly improves order selection
37




            Original       With correction on degrees of freedom


                                    Y.-O. Li, et al., HBM, 2007
Summary
38




     •   ICA extracts useful patterns from high dimensional imaging data for
         predictive modeling

     •   M-CCA reveals patterns from several datasets in a coherent order

     •   Dimension reduction by PCA improves the reproducibility of ICA extracted
         patterns


         Exploratory multivariate analysis are promising tools for
         data mining applications

Mais conteúdo relacionado

Mais procurados

Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
Armando Vieira
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
Dr. Jai Sakthi
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
Jordan McBain
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
Enrico Scapin
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
Tatsuya Yokota
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
zukun
 
Pres110811
Pres110811Pres110811
Pres110811
shotlub
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
Sandip Joardar
 

Mais procurados (17)

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and Selection
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
 
Eurogen v
Eurogen vEurogen v
Eurogen v
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship Report
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2
 
Pres110811
Pres110811Pres110811
Pres110811
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
 

Destaque

Machine Learning
Machine LearningMachine Learning
Machine Learning
butest
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
butest
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
Dushyant Bheda
 

Destaque (20)

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICO
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processing
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeria
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splines
 
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLSMULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data Analysis
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different Industries
 
Audio mining
Audio miningAudio mining
Audio mining
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis Introduction
 

Semelhante a Machine learning for medical imaging data

Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
United States Air Force Academy
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
ICAC09
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 

Semelhante a Machine learning for medical imaging data (20)

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis
 
Sp18_P1.pptx
Sp18_P1.pptxSp18_P1.pptx
Sp18_P1.pptx
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Presentation on machine learning
Presentation on machine learningPresentation on machine learning
Presentation on machine learning
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
 
Ph.D. Presentation
Ph.D. PresentationPh.D. Presentation
Ph.D. Presentation
 
Dbm630 lecture09
Dbm630 lecture09Dbm630 lecture09
Dbm630 lecture09
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty Detection
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and Related
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdf
 
tutorial.ppt
tutorial.ppttutorial.ppt
tutorial.ppt
 
Spatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understandingSpatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understanding
 
190 195
190 195190 195
190 195
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
 
Sunum
SunumSunum
Sunum
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Machine learning for medical imaging data

  • 1. 1 MACHINE LEARNING FOR MEDICAL IMAGING DATA Yiou (Leo) Li
  • 2. Background 2 Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory, University of California San Francisco • Developed unsupervised learning method for feature extraction of brain imaging data • Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive modeling of brain trauma • Designed batch data processing protocol to perform image registration, segmentation, band-pass filtering, smoothing, and linear model fitting Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal processing Laboratory, University of Maryland Baltimore County • Developed the effective degrees of freedom of random process and applied it to the model order selection by Information Theoretic Criteria • Developed a linear filtering mechanism in independent component analysis for feature enhancement • Analyzed canonical correlation analysis for multiple datasets
  • 3. Outline 3  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 4. PCA vs ICA 4 PCA ICA Linear projection Linear projection (Orthogonal) Uncorrelated components Independent components (non sparse) (sparse, “long tail” distribution) Typically analytical solution Typically iterative solution (SVD) (Iterative optimization)
  • 5. ICA detects independent factors with long tails in multivariate dataset 5
  • 6. Long tail factors are sparse features in data samples 6 Weights of features Data points (N) ICA Sensors (M) X = A . S Sparse features X= AS
  • 7. ICA model 7  x1   a11 a12 ... a1M   s1  x  a a 22 a 2M   s 2   2    21    ...   ...   ...         x M  a M1 a M2 a MM  s M  x : Observed variables A : Mixing matrix s : Latent factors x= As -> s =A-1x
  • 8. ICA by maximum likelihood estimation 8 Transformation of multivariate random variable: x = As p(s 1, s2 , ... , sM ) p(x 1,x 2 , ... , x M )  (1) det(A) Statistical independence condition of s: p(s 1, s2 , ... , sM )  i 1 p(si ) M (2) Log likelihood function of x with parameter A: log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A) -1 i
  • 9. ICA Application: Sparse feature extraction from multivariate dataset 9
  • 11. Analyze functional MRI data of resting state brain 11 Sparse features ICA
  • 12. Feature 1. Primary visual network 12 + A -
  • 13. Feature 2. “Default mode network” 13
  • 14. Feature 3. Attention control network 14
  • 15. Hierarchical clustering shows link 15 between features (brain regions)
  • 16. Predicative modeling of brain trauma 16 Pattern weights N Healthy X = A . S Patients Sparse spatial features Subject 1 … Subject 2 16 Pattern 2 Feature 1 … Subject M Feature 2 Y.-O. Li, et al., HBM, 2011
  • 17. ICA Pattern classification for predictive modeling of brain trauma 17 • 29 healthy + 29 trauma, 10-fold cross-validation Classifier 9 patterns 14 patterns Classification error Classification error Naïve Bayes 0.35+/-0.03 0.32+/-0.03 K nearest neighbor 0.29+/-0.02 0.30+/-0.03 Support vector classifier 0.36+/-0.02 0.30 +/-0.02 (c=1, number of SV: 46) (c=1, number of SV: 20)
  • 18. Outline 18  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to dimension reduction
  • 19. Joint pattern extraction requires coherency on extracted patterns across datasets 19 Model: x k =Aksk , k=1,2,...,M Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 20. Multi-set canonical correlation analysis 20 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 21. Multi-set canonical correlation analysis 21 Correlation matrix of [S1,S2, … SM] Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 22. Application: joint pattern extraction from a group of datasets 22 • Analyze group functional MRI data from simulated driving experiment
  • 23. Simulated driving experiment 23 • Forty subjects, three repeated sessions (120 datasets) • Experiment paradigm: • Behavioral records: • Average speed (AS) • Differential of speed (DS) • Average steering offset (AR) • Differential steering offset (DR) • Differential pedal offset (DP) • Occurrence of yellow line crossing (YLC) • Occurrence of white passenger-side line crossing (WPLC) Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 24. Step I: M-CCA for joint feature extraction 24 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 25. Step II: PCA and behavioral association 25 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 26. Pattern 1: Primary visual function 26 D = 0:85 W = 0:42 95% CI of behavioral association
  • 27. Pattern 2: “default mode network” 27 D = -0.63 W = -0.39 95% CI of behavioral association
  • 28. Pattern 3: Motor coordination 28 D = 0.86 W = 0.15 95% CI of behavioral association
  • 29. Pattern 4: Executive control network 29 D = 0.64 W = 0.61 95% CI of behavioral association
  • 30. Cross correlation of Pattern 1 30 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 31. Outline 31  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 32. Decreased reproducibility of independent component on high-dimensional dataset 32 • Functional MRI with 120 time points • Twenty Monte Carlo trials of ICA algorithm • Clustering the IC estimates • Reproducible ICs: compact and separated clusters K=20 K=40 K=90 Y.-O. Li, et al., HBM, 2007
  • 33. Dimension reduction of high-dimensional data by PCA 33 ICA N N M X = A . S MxM PCA dimension reduction + ICA . A . S X = E + N K-largest PCs M-k PCs
  • 34. Failure of information-theoretic criteria with uncorrected degrees of freedom 34 AIC, MDL ˆ k  arg min k {l ( x | k )  g (k )} ( M k)    i k 1  M 1/ ( M  k ) l(x |  k )  N ln  M i    i k 1 i / ( M  k)    AIC : k (2M  k )  1 g ( k )   MDL : 0.5  ln N (k (2M  k )  1) Y.-O. Li, et al., HBM, 2007
  • 35. Estimation of degrees of freedom by entropy rate 35 Entropy rate of a Gaussian process 1  h( x)  ln 2 e  4   ln s()d  h( x)  ln 2 e iff x[n] is an i.i.d. random process h(x) = 0.40 h(x) = 1.28 h(x) = 1.41 Y.-O. Li, et al., HBM, 2007
  • 36. Application: Order selection of high- dimensional dataset 36
  • 37. Corrected order selection criteria significantly improves order selection 37 Original With correction on degrees of freedom Y.-O. Li, et al., HBM, 2007
  • 38. Summary 38 • ICA extracts useful patterns from high dimensional imaging data for predictive modeling • M-CCA reveals patterns from several datasets in a coherent order • Dimension reduction by PCA improves the reproducibility of ICA extracted patterns Exploratory multivariate analysis are promising tools for data mining applications