SlideShare a Scribd company logo
Analysis of Multiple Experiments TIGR Multiple Experiment Viewer (MeV) Joseph White DFCI January 24,2008
MeV ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Expression Matrix is a representation of data from multiple microarray experiments. Each element is a log ratio (usually log  2  (Cy5 / Cy3) )  Red indicates a  positive log ratio, i.e, Cy5 > Cy3  Green indicates a negative log ratio , i.e., Cy5 < Cy3  Black indicates a log ratio of zero, i. e.,  Cy5 and Cy3 are very  close in value  Gray indicates missing data   Exp 1 Exp 2 Exp 3 Exp 4 Exp 5 Exp 6 Gene 1 Gene 2 Gene 3 Gene 4 Gene 5 Gene 6
Expression Vectors ,[object Object],[object Object],Log2(cy5/cy3) -0.8 0.8 1.5 1.8 0.5 -1.3 -0.4 1.5
Expression Vectors As Points in ‘Expression Space’ Experiment 1 Experiment 2 Experiment 3 Similar Expression -0.8 -0.6 0.9 1.2 -0.3 1.3 -0.7 Exp 1 Exp 2 Exp 3 G1 G2 G3 G4 G5 -0.4 -0.4 -0.8 -0.8 -0.7 1.3 0.9 -0.6
Distance and Similarity  -the ability to calculate a distance (or similarity, it’s inverse) between two expression vectors is fundamental to clustering algorithms -distance between vectors is the basis upon which decisions are made when grouping similar patterns of expression -selection of a  distance metric  defines the concept of distance
Distance: a measure of similarity between genes. ,[object Object],[object Object],3.  Pearson correlation p 0 p 1 Exp 1 Exp 2 Exp 3 Exp 4 Exp 5 Exp 6 Gene A Gene B x 1A x 2A x 3A x 4A x 5A x 6A x 1B x 2B x 3B x 4B x 5B x 6B 6 ,[object Object],6
Distance is Defined by a Metric 4.2 1.4 -1.00 -0.90 Euclidean   Pearson(r*-1) Distance Metric : D D
Normal distribution X =  μ (mean of the distribution) σ  = std. deviation of the distribution
Current MeV Algorithms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Demos ,[object Object],[object Object],[object Object],[object Object],[object Object]
GeneChip Oncology Database
GeneChip Oncology Database
GCOD statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
MeV Team ,[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysis
Dmitry Grapov
 
4 partial least squares modeling
4  partial least squares modeling4  partial least squares modeling
4 partial least squares modeling
Dmitry Grapov
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
feiwin
 

What's hot (18)

High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
 
Strategies for Metabolomics Data Analysis
Strategies for Metabolomics Data AnalysisStrategies for Metabolomics Data Analysis
Strategies for Metabolomics Data Analysis
 
3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysis
 
20151130
2015113020151130
20151130
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
 
Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
 
0 introduction
0  introduction0  introduction
0 introduction
 
Document Classification and Clustering
Document Classification and ClusteringDocument Classification and Clustering
Document Classification and Clustering
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic Manifold
 
Data Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesData Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological Studies
 
4 partial least squares modeling
4  partial least squares modeling4  partial least squares modeling
4 partial least squares modeling
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysis
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
 
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
 
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
 
geolocation twitter network text geotagging
geolocation twitter network text geotagginggeolocation twitter network text geotagging
geolocation twitter network text geotagging
 

Similar to MeV: Joe White

Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Natalio Krasnogor
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015
pdiddyboy2
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
Alexander Decker
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
Alexander Decker
 
Multivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological dataMultivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological data
Dmitry Grapov
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slides
pannicle
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
Computer Science Club
 

Similar to MeV: Joe White (20)

Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
 
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015
 
Integrative Networks Centric Bioinformatics
Integrative Networks Centric BioinformaticsIntegrative Networks Centric Bioinformatics
Integrative Networks Centric Bioinformatics
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
 
Multivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological dataMultivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological data
 
Basen Network
Basen NetworkBasen Network
Basen Network
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slides
 
Two methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersTwo methods for optimising cognitive model parameters
Two methods for optimising cognitive model parameters
 
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
 
Gene expression profiling ii
Gene expression profiling  iiGene expression profiling  ii
Gene expression profiling ii
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
 
5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data
 
Inria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCInria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCC
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Microarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector MachineMicroarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector Machine
 
annInstance28Nov6pm
annInstance28Nov6pmannInstance28Nov6pm
annInstance28Nov6pm
 
Clustering
ClusteringClustering
Clustering
 

More from niranabey

MAGE-TAB introduction: Alvis Brazma (EBI)
MAGE-TAB introduction: Alvis Brazma (EBI)MAGE-TAB introduction: Alvis Brazma (EBI)
MAGE-TAB introduction: Alvis Brazma (EBI)
niranabey
 
caArray: Juli Klemm (NCICB)
caArray: Juli Klemm (NCICB)caArray: Juli Klemm (NCICB)
caArray: Juli Klemm (NCICB)
niranabey
 
GenePattern: Ted Liefeld
GenePattern: Ted LiefeldGenePattern: Ted Liefeld
GenePattern: Ted Liefeld
niranabey
 
Stanford Microarray Database: Don Maier
Stanford Microarray Database: Don MaierStanford Microarray Database: Don Maier
Stanford Microarray Database: Don Maier
niranabey
 
ArrayExpress: Helen Parkinson
ArrayExpress: Helen ParkinsonArrayExpress: Helen Parkinson
ArrayExpress: Helen Parkinson
niranabey
 
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
niranabey
 
RAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin LiuRAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin Liu
niranabey
 
caBIG: Ian Fore
caBIG: Ian ForecaBIG: Ian Fore
caBIG: Ian Fore
niranabey
 

More from niranabey (8)

MAGE-TAB introduction: Alvis Brazma (EBI)
MAGE-TAB introduction: Alvis Brazma (EBI)MAGE-TAB introduction: Alvis Brazma (EBI)
MAGE-TAB introduction: Alvis Brazma (EBI)
 
caArray: Juli Klemm (NCICB)
caArray: Juli Klemm (NCICB)caArray: Juli Klemm (NCICB)
caArray: Juli Klemm (NCICB)
 
GenePattern: Ted Liefeld
GenePattern: Ted LiefeldGenePattern: Ted Liefeld
GenePattern: Ted Liefeld
 
Stanford Microarray Database: Don Maier
Stanford Microarray Database: Don MaierStanford Microarray Database: Don Maier
Stanford Microarray Database: Don Maier
 
ArrayExpress: Helen Parkinson
ArrayExpress: Helen ParkinsonArrayExpress: Helen Parkinson
ArrayExpress: Helen Parkinson
 
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
TCGA data coordination center: Carl Schaefer and Ari Kahn (NCICB)
 
RAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin LiuRAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin Liu
 
caBIG: Ian Fore
caBIG: Ian ForecaBIG: Ian Fore
caBIG: Ian Fore
 

Recently uploaded

zidauu _business communication.pptx /pdf
zidauu _business  communication.pptx /pdfzidauu _business  communication.pptx /pdf
zidauu _business communication.pptx /pdf
zukhrafshabbir
 
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
Khaled Al Awadi
 

Recently uploaded (20)

Global Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdfGlobal Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdf
 
Byrd & Chen’s Canadian Tax Principles 2023-2024 Edition 1st edition Volumes I...
Byrd & Chen’s Canadian Tax Principles 2023-2024 Edition 1st edition Volumes I...Byrd & Chen’s Canadian Tax Principles 2023-2024 Edition 1st edition Volumes I...
Byrd & Chen’s Canadian Tax Principles 2023-2024 Edition 1st edition Volumes I...
 
Did Paul Haggis Ever Win an Oscar for Best Filmmaker
Did Paul Haggis Ever Win an Oscar for Best FilmmakerDid Paul Haggis Ever Win an Oscar for Best Filmmaker
Did Paul Haggis Ever Win an Oscar for Best Filmmaker
 
Unleash Data Power with EnFuse Solutions' Comprehensive Data Management Servi...
Unleash Data Power with EnFuse Solutions' Comprehensive Data Management Servi...Unleash Data Power with EnFuse Solutions' Comprehensive Data Management Servi...
Unleash Data Power with EnFuse Solutions' Comprehensive Data Management Servi...
 
TriStar Gold Corporate Presentation May 2024
TriStar Gold Corporate Presentation May 2024TriStar Gold Corporate Presentation May 2024
TriStar Gold Corporate Presentation May 2024
 
Team-Spandex-Northern University-CS1035.
Team-Spandex-Northern University-CS1035.Team-Spandex-Northern University-CS1035.
Team-Spandex-Northern University-CS1035.
 
LinkedIn Masterclass Techweek 2024 v4.1.pptx
LinkedIn Masterclass Techweek 2024 v4.1.pptxLinkedIn Masterclass Techweek 2024 v4.1.pptx
LinkedIn Masterclass Techweek 2024 v4.1.pptx
 
Hyundai capital 2024 1quarter Earnings release
Hyundai capital 2024 1quarter Earnings releaseHyundai capital 2024 1quarter Earnings release
Hyundai capital 2024 1quarter Earnings release
 
Luxury Artificial Plants Dubai | Plants in KSA, UAE | Shajara
Luxury Artificial Plants Dubai | Plants in KSA, UAE | ShajaraLuxury Artificial Plants Dubai | Plants in KSA, UAE | Shajara
Luxury Artificial Plants Dubai | Plants in KSA, UAE | Shajara
 
zidauu _business communication.pptx /pdf
zidauu _business  communication.pptx /pdfzidauu _business  communication.pptx /pdf
zidauu _business communication.pptx /pdf
 
How to Maintain Healthy Life style.pptx
How to Maintain  Healthy Life style.pptxHow to Maintain  Healthy Life style.pptx
How to Maintain Healthy Life style.pptx
 
HR and Employment law update: May 2024.
HR and Employment law update:  May 2024.HR and Employment law update:  May 2024.
HR and Employment law update: May 2024.
 
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
 
What Does A 3-Band Wedding Band Mean?
What Does A 3-Band Wedding Band Mean?What Does A 3-Band Wedding Band Mean?
What Does A 3-Band Wedding Band Mean?
 
Copyright: What Creators and Users of Art Need to Know
Copyright: What Creators and Users of Art Need to KnowCopyright: What Creators and Users of Art Need to Know
Copyright: What Creators and Users of Art Need to Know
 
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdfMatt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
 
Equinox Gold Corporate Deck May 24th 2024
Equinox Gold Corporate Deck May 24th 2024Equinox Gold Corporate Deck May 24th 2024
Equinox Gold Corporate Deck May 24th 2024
 
12 Conversion Rate Optimization Strategies for Ecommerce Websites.pdf
12 Conversion Rate Optimization Strategies for Ecommerce Websites.pdf12 Conversion Rate Optimization Strategies for Ecommerce Websites.pdf
12 Conversion Rate Optimization Strategies for Ecommerce Websites.pdf
 
Potato Flakes Manufacturing Plant Project Report.pdf
Potato Flakes Manufacturing Plant Project Report.pdfPotato Flakes Manufacturing Plant Project Report.pdf
Potato Flakes Manufacturing Plant Project Report.pdf
 
Special Purpose Vehicle (Purpose, Formation & examples)
Special Purpose Vehicle (Purpose, Formation & examples)Special Purpose Vehicle (Purpose, Formation & examples)
Special Purpose Vehicle (Purpose, Formation & examples)
 

MeV: Joe White

  • 1. Analysis of Multiple Experiments TIGR Multiple Experiment Viewer (MeV) Joseph White DFCI January 24,2008
  • 2.
  • 3.
  • 4. The Expression Matrix is a representation of data from multiple microarray experiments. Each element is a log ratio (usually log 2 (Cy5 / Cy3) ) Red indicates a positive log ratio, i.e, Cy5 > Cy3 Green indicates a negative log ratio , i.e., Cy5 < Cy3 Black indicates a log ratio of zero, i. e., Cy5 and Cy3 are very close in value Gray indicates missing data Exp 1 Exp 2 Exp 3 Exp 4 Exp 5 Exp 6 Gene 1 Gene 2 Gene 3 Gene 4 Gene 5 Gene 6
  • 5.
  • 6. Expression Vectors As Points in ‘Expression Space’ Experiment 1 Experiment 2 Experiment 3 Similar Expression -0.8 -0.6 0.9 1.2 -0.3 1.3 -0.7 Exp 1 Exp 2 Exp 3 G1 G2 G3 G4 G5 -0.4 -0.4 -0.8 -0.8 -0.7 1.3 0.9 -0.6
  • 7. Distance and Similarity -the ability to calculate a distance (or similarity, it’s inverse) between two expression vectors is fundamental to clustering algorithms -distance between vectors is the basis upon which decisions are made when grouping similar patterns of expression -selection of a distance metric defines the concept of distance
  • 8.
  • 9. Distance is Defined by a Metric 4.2 1.4 -1.00 -0.90 Euclidean Pearson(r*-1) Distance Metric : D D
  • 10. Normal distribution X = μ (mean of the distribution) σ = std. deviation of the distribution
  • 11.
  • 12.
  • 15.
  • 16.