Keynote Talk presented at the 1st Annual BiVi Community Annual Meeting (17 December 2014)
http://bivi.co/page/bivi-annual-meeting-16-17th-december-2014
Visualization Approaches for Biomedical Omics Data: Putting It All Together
The rapid proliferation of high quality, low cost genome-wide measurement technologies such as whole-genome and transcriptome sequencing, as well as advances in epigenomics and proteomics, are enabling researchers to perform studies that generate heterogeneous datasets for cohorts of thousands of individuals. A common feature of these studies is that a collection of genome-wide, molecular data types and phenotypic or clinical characterizations are available for each individual. These data can be used to identify the molecular basis of diseases and to characterize and describe the variations that are relevant for improved diagnosis, prognosis and targeted treatment of patients. An example for a study in which this approach has been successfully applied is The Cancer Genome Atlas project (http://cancergenome.nih.gov).
In my talk I will discuss how visualization approaches can be applied to enable exploration and support analysis of data generated by such studies. Specifically, I will review techniques and tools for visual exploration of individual omics data types, their ability to scale to large numbers of individuals or samples, and emerging techniques that integrate multiple omics data types for interactive visual analysis. I will also examine technical and legal challenges that developers of such visualization tools are facing. To conclude my talk, I will outline research opportunities for the biological data visualization community that address major challenges in this domain.
The Mariana Trench remarkable geological features on Earth.pptx
Visualization Approaches for Biomedical Omics Data: Putting It All Together
1. Visualization Approaches for
Biomedical Omics Data:
Putting It All Together
Nils Gehlenborg
Harvard Medical School
Center for Biomedical Informatics
!nils_gehlenborg
5. “In every chain of reasoning, the evidence of the last conclusion can
be no greater than that of the weakest link of the chain, whatever
may be the strength of the rest.”
- Thomas Reid, Essays on the Intellectual Powers of Man (1786)
6. Human
"
INTERPRETATION
Data
COMPUTATION GENERATION
#
Machine
Hypotheses
Discoveries
Knowledge
Cognition
28. Proteome
HOW?
mass spectrometry of peptides
array-based techniques
WHAT?
presence of peptides & proteins
abundance of peptides & proteins
29.
30. Genome
Transcriptome
Proteome
Metabolome
What is the DNA sequence?
Which genes are active?
Which proteins are present?
Which metabolites can be identified?
31. Metabolome
HOW?
mass spectrometry
NMR spectroscopy
WHAT?
presence of metabolites
abundance of metabolites
32. Genome
Transcriptome
Proteome
Metabolome
What is the DNA sequence?
Which genes are active?
Which proteins are present?
Which metabolites can be identified?
Interactome Which molecules are interacting?
33. Interactome
HOW?
mass spectrometry, yeast-2-hybrid
text mining
WHAT?
links between molecules
34. Epigenome
Genome
Transcriptome
Proteome
Metabolome
How are DNA and associated proteins modified?
What is the DNA sequence?
Which genes are active?
Which proteins are present?
Which metabolites can be identified?
Interactome Which molecules are interacting?
35. Epigenome
HOW?
ChIP-seq, ChIP-chip (histones modifications)
bisulfite sequencing (DNA methylation)
WHAT?
histone modifications along genome
DNA methylation patterns along genome
39. Nucleome
Epigenome
Genome
Transcriptome
Proteome
Metabolome
How is the DNA organized in space/time?
How are DNA and associated proteins modified?
What is the DNA sequence?
Which genes are active?
Which proteins are present?
Which metabolites can be identified?
Interactome Which molecules are interacting?
40. Nucleome
HOW?
3C/4C/5C chromosome conformation capture
Hi-C sequencing
WHAT?
contact probabilities for different parts
of the genome
41. Lieberman-Aiden et al., Comprehensive Mapping of Long-Range Interactions
Reveals Folding Principles of the Human Genome, 2009
58. StratomeX
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, PJ Park, N Gehlenborg, “Guided Visual
Exploration of Genomic Stratifications in Cancer“, Nature Methods 11:884-885 (2014)
A Lex, M Streit, H-J Schulz, C Partl, D Schmalstieg, PJ Park, N Gehlenborg, “StratomeX: Visual Anal-ysis
of Large-Scale Heterogeneous Genomics Data for Cancer Subtype Characterization“, Computer
Graphics Forum 31:1175-1184 (2012)
59.
60. Is there a mutation that overlaps with this mRNA cluster?
Is there a mutually exclusive mutation?
Is there a CNV that affects survival?
Is there a pathway that is enriched in this cluster?
Guided Exploration
Query
Rank
Visualize
Stratifications
Clinical Params
Pathways
77. Acknowledgements
Harvard SEAS Alexander Lex, Hanspeter Pfister
MD Anderson Cancer Center
University of Rostock
Psalm Haseley, Richard Park, Peter J Park
Michael S Noble, Douglas Voet, Lihua Zou, Spring Liu, Hailei
Zhang, Sachet Shukla, Aaron McKenna, Andrew Cherniak,
Pei Lin, Gad Getz
Jianhua Zhang, Terrence Wu, Ian Watson, Steven Quayle,
Lynda Chin
Harvard Medical School
Broad Institute of MIT & Harvard
Graz University of Technology Christian Partl, Dieter Schmalstieg
Johannes Kepler University Linz Samuel Gratzl, Stefan Luger, Marc Streit
Hans-Jörg Schulz
Harvard School of Public Health
Funding
NIH/NHGRI K99 HG007583
Ilya Sytchev, Shannan Ho Sui, Winston Hide