SlideShare uma empresa Scribd logo
1 de 45
Baixar para ler offline
Building a search engine to find and
robustly identify environmental factors
with phenotype and disease
Chirag J Patel
Unite for Sight
4/14/2018
chirag@hms.harvard.edu
@chiragjp
www.chiragjpgroup.org
P = G + EType 2 Diabetes
Cancer
Alzheimer’s
Gene expression
Phenotype Genome
Variants
Environment
Infectious agents
Diet + Nutrients
Pollutants
Drugs
We are great at G investigation!
>4000 (as of 1/1/18)
36,066 G-P associations
Genome-wide Association Studies (GWAS)
https://www.ebi.ac.uk/gwas/
G
Nothing comparable to elucidate E influence!
E: ???
We lack high-throughput methods
and data to discover new E in P…
A similar paradigm for discovery should exist
for E!
Why?
σ2
P = σ2
G + σ2
E
…
σ2
G
σ2
P
H2 =
Heritability (H2) is the range of phenotypic
variability attributed to genetic variability in a
population
Indicator of the proportion of phenotypic
differences attributed to G.
Height is an example of a heritable trait:
Francis Galton shows how its done (1887)
“mid-height of 205 parents
described 60% of variability of 928
offspring”
What else describes height?
Source: SNPedia.com
Heritability estimates for burdensome diseases are low and variable
Type 2 Diabetes (25%)
Heart Disease (25-30%)
cancer?
Source: SNPedia.com
G estimates for complex disease (P) are low and variable:
massive opportunity for high-throughput E discovery
σ2
E
What describes this variation NOT explained by
genetics?
Physical activity?
Is it coffee…?
HR: 0.9 in N=500K
HR: 0.9 in N=50K
Chemicals?
EPA Chemical Substances List (~80K)
via tylervigen.com
… we just don’t know
… we just don’t know
xkcd.com
We just don’t know:
Is everything we are exposed to associated with cancer?
Schoenfeld and Ioannidis, AJCN 2012
50 random ingredients from
Boston Cooking School
Cookbook
Any associated with cancer?
Of 50, 40 studied in cancer risk
Weak statistical evidence:
non-replicated
inconsistent effects
non-standardized
… we just don’t know
http://fivethirtyeight.com/features/you-cant-trust-what-you-read-about-nutrition/
The problem remains:
(1) what explains the missing variation in phenotype…
σ2
E
So the problem remains:
(2) and how do we find the stuff that matters?
E: ???
Diet
Infection
Pollution
Drugs
We are great at G investigation!
>4000 (as of 1/1/18)
36,066 G-P associations
Genome-wide Association Studies (GWAS)
https://www.ebi.ac.uk/gwas/
G
How did genetics-based investigations advance?
(And advance so quickly?)
A new paradigm of GWAS for discovery of G in P:
Human Genome Project to GWAS
Sequencing of the genome
2001
HapMap project:
http://hapmap.ncbi.nlm.nih.gov/
Characterize common variation
2001-current day
High-throughput variant
assay
< $99 for ~1M variants
Measurement tools
~2003 (ongoing)
Nature 2008
Comprehensive, high-throughput analyses
GWAS
How can we do better in both discovery and
translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Explaining the missing variation:
A data-driven paradigm for robust discovery of E in disease
via systematic study of the “exposome”
what to measure? how to measure?
“A more comprehensive view of
environmental exposure is
needed ... to discover major
causes of diseases...”
how to analyze in relation to health?
Wild, 2005, 2012
Rappaport and Smith, 2010, 2011
Buck-Louis and Sundaram 2012
Miller and Jones, 2014
Patel CJ and Ioannidis JPAI, 2014
Possible to use existing technologies for E
Exposure (and P) Assessment…
CEBP 2017
… however, heterogeneous measures that require different
study designs and analytic approaches.
Promises and Challenges in creating a search engine for
identifying E in P
JAMA 2014
ARPH 2016
JECH 2014
Curr Epidemiol Rep 2017
Examples of data-driven discovery for E associations
Gold standard for breadth of human exposure information:
National Health and Nutrition Examination Survey1
since the 1960s
now biannual: 1999 onwards
10,000 participants per survey
1 http://www.cdc.gov/nchs/nhanes.htm
>250 exposures (serum + urine)
GWAS chip
>200 quantitative clinical traits
(e.g., serum glucose, lipids, body
mass index)
Death index linkage (cause of
death)
Gold standard for breadth of exposure & behavior data:
National Health and Nutrition Examination Survey
Nutrients and Vitamins
vitamin D, carotenes
Infectious Agents
hepatitis, HIV, Staph. aureus
Plastics and consumables
phthalates, bisphenol A
Physical Activity
e.g., stepsPesticides and pollutants
atrazine; cadmium; hydrocarbons
Drugs
statins; aspirin
What E are associated with aging:
all-cause mortality, heart disease, and
telomere length?
Int J Epidem 2013
Int J Epidem 2016
Identifying E associated with all-cause mortality:
Data-driven searching through 253 associations
age (10 years)
income (quintile 2)
income (quintile 1)
male
black income (quintile 3)
any one smoke in home?
Multivariate cox (age, sex, income, education, race/ethnicity, occupation [in red])
serum and urine cadmium
[1 SD]
past smoker?
current smoker?serum lycopene
[1SD]
physical activity
[low, moderate, high activity]*
*derived from METs per activity and categorized by Health.gov guidelines
R2 ~ 14%
(2%)
R2 < 10-20%
Required
What about other factors related to aging?:
452 associations in Telomere Length!
Int J Epidem 2016
PCBs
FDR<5%
Trunk Fat
Alk. PhosCRP
Cadmium
Cadmium (urine)cigs per day
retinyl stearate
R2 ~ 1%
VO2 Maxpulse rate
shorter telomeres longer telomeres
adjusted by age, age2, race, poverty, education, occupation
median N=3000; N range: 300-7000
2-8 years
Interdependencies of the exposome:
Correlation globes paint a complex view of exposure
Red: positive ρ
Blue: negative ρ
thickness: |ρ|
for each pair of E:
Spearman ρ
(575 factors: 81,937 correlations)
permuted data to produce
“null ρ”
sought replication in > 1
cohort
Pac Symp Biocomput. 2015
JECH. 2015
Red: positive ρ
Blue: negative ρ
thickness: |ρ|
for each pair of E:
Spearman ρ
(575 factors: 81,937 correlations)
Interdependencies of the exposome:
Correlation globes paint a complex view of exposure:
average correlation of < 0.3
permuted data to produce
“null ρ”
sought replication in > 1
cohort
Pac Symp Biocomput. 2015
JECH. 2015
Effective number of
variables:
500 (10% decrease)
How can we do better in both discovery and translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and
explain variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to
detecting signals in observational and
large data
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and explain
variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to detecting
signals in observational and large data
ARPH 2016
JAMA 2014
JECH 2015
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and explain
variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to
detecting signals in observational and
large data ARPH 2016
JAMA 2014
JECH 2015
How can we do better in both discovery and translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Explaining the missing variation:
A data-driven paradigm for robust discovery of E in disease
via systematic study of the “exposome”
what to measure? how to measure?
“A more comprehensive view of
environmental exposure is
needed ... to discover major
causes of diseases...”
how to analyze in relation to health?
Wild, 2005, 2012
Rappaport and Smith, 2010, 2011
Buck-Louis and Sundaram 2012
Miller and Jones, 2014
Patel CJ and Ioannidis JPAI, 2014
Need to assess the exposome globally:
(e.g., India and China)
c/o Getty Images c/o AFP
… and Sub-Saharan Africa!
Can we predict HIV as a function of the exposome?
AIDS 2018
Harvard DBMI
Susanne Churchill
Nathan Palmer
Sophia Mamousette
Sunny Alvear
Chirag J Patel
chirag@hms.harvard.edu
@chiragjp
www.chiragjpgroup.org
NIH Common Fund
Big Data to Knowledge
Acknowledgements
RagGroup
Arjun Manrai
Nam Pho
Jake Chung
Kajal Claypool
Chirag Lakhani
Danielle Rasooly
Alan LeGoallec
Sivateja Tangirala
Mentioned Collaborators
Isaac Kohane
John Ioannidis
Dennis Bier
Hugo Aschard

Mais conteúdo relacionado

Mais procurados

dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET
 
Protein-Protein Interaction Presentation
Protein-Protein Interaction PresentationProtein-Protein Interaction Presentation
Protein-Protein Interaction Presentation
Usman (Ali) Ahmed
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaper
Mathias Hibbard
 
MathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinalMathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinal
Mathias Hibbard
 
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Andrew Su
 

Mais procurados (20)

Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposures
 
Informatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryInformatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discovery
 
AACR 041616 digital exposomes
AACR 041616 digital exposomesAACR 041616 digital exposomes
AACR 041616 digital exposomes
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven E
 
Searching for predictors of male fecundity
Searching for predictors of male fecunditySearching for predictors of male fecundity
Searching for predictors of male fecundity
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
 
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databases
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Protein-Protein Interaction Presentation
Protein-Protein Interaction PresentationProtein-Protein Interaction Presentation
Protein-Protein Interaction Presentation
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaper
 
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
 
MathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinalMathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinal
 
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
 

Semelhante a Chirag patel unite for sight 041418

(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
Hyung Jin Choi
 
Sexual Risk Behaviors Research
Sexual Risk Behaviors Research Sexual Risk Behaviors Research
Sexual Risk Behaviors Research
Brittney Johns
 
Alzheimer
AlzheimerAlzheimer
Alzheimer
Harold
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
muink
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
Sean Paul
 
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docxSheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
bjohn46
 

Semelhante a Chirag patel unite for sight 041418 (20)

Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
 
Studying the elusive in larger scale
Studying the elusive in larger scaleStudying the elusive in larger scale
Studying the elusive in larger scale
 
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
 
Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...
 
Sexual Risk Behaviors Research
Sexual Risk Behaviors Research Sexual Risk Behaviors Research
Sexual Risk Behaviors Research
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
 
Green
GreenGreen
Green
 
Alzheimer
AlzheimerAlzheimer
Alzheimer
 
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v320160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
 
의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래
 
Data Science in Drug Discovery
Data Science in Drug DiscoveryData Science in Drug Discovery
Data Science in Drug Discovery
 
Introduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceIntroduction to Epidemiology and Surveillance
Introduction to Epidemiology and Surveillance
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
 
Module5_Study_Design.pptx
Module5_Study_Design.pptxModule5_Study_Design.pptx
Module5_Study_Design.pptx
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...
 
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docxSheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
 

Último

💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
Sheetaleventcompany
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Sheetaleventcompany
 
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Sheetaleventcompany
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
Sheetaleventcompany
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan 087776558899
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Sheetaleventcompany
 

Último (20)

Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
 
Call girls Service Phullen / 9332606886 Genuine Call girls with real Photos a...
Call girls Service Phullen / 9332606886 Genuine Call girls with real Photos a...Call girls Service Phullen / 9332606886 Genuine Call girls with real Photos a...
Call girls Service Phullen / 9332606886 Genuine Call girls with real Photos a...
 
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
 
💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Piya 📲🔝8868886958🔝Call Girls In Chandigarh No...
 
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
 
tongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacytongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacy
 
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
 
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
 
Bhawanipatna Call Girls 📞9332606886 Call Girls in Bhawanipatna Escorts servic...
Bhawanipatna Call Girls 📞9332606886 Call Girls in Bhawanipatna Escorts servic...Bhawanipatna Call Girls 📞9332606886 Call Girls in Bhawanipatna Escorts servic...
Bhawanipatna Call Girls 📞9332606886 Call Girls in Bhawanipatna Escorts servic...
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
 
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptxANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
 
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
 
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
 
💚Reliable Call Girls Chandigarh 💯Niamh 📲🔝8868886958🔝Call Girl In Chandigarh N...
💚Reliable Call Girls Chandigarh 💯Niamh 📲🔝8868886958🔝Call Girl In Chandigarh N...💚Reliable Call Girls Chandigarh 💯Niamh 📲🔝8868886958🔝Call Girl In Chandigarh N...
💚Reliable Call Girls Chandigarh 💯Niamh 📲🔝8868886958🔝Call Girl In Chandigarh N...
 
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
 
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
 
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
 

Chirag patel unite for sight 041418

  • 1. Building a search engine to find and robustly identify environmental factors with phenotype and disease Chirag J Patel Unite for Sight 4/14/2018 chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org
  • 2. P = G + EType 2 Diabetes Cancer Alzheimer’s Gene expression Phenotype Genome Variants Environment Infectious agents Diet + Nutrients Pollutants Drugs
  • 3. We are great at G investigation! >4000 (as of 1/1/18) 36,066 G-P associations Genome-wide Association Studies (GWAS) https://www.ebi.ac.uk/gwas/ G
  • 4. Nothing comparable to elucidate E influence! E: ??? We lack high-throughput methods and data to discover new E in P…
  • 5. A similar paradigm for discovery should exist for E! Why?
  • 6. σ2 P = σ2 G + σ2 E …
  • 7. σ2 G σ2 P H2 = Heritability (H2) is the range of phenotypic variability attributed to genetic variability in a population Indicator of the proportion of phenotypic differences attributed to G.
  • 8. Height is an example of a heritable trait: Francis Galton shows how its done (1887) “mid-height of 205 parents described 60% of variability of 928 offspring” What else describes height?
  • 9. Source: SNPedia.com Heritability estimates for burdensome diseases are low and variable Type 2 Diabetes (25%) Heart Disease (25-30%) cancer?
  • 10. Source: SNPedia.com G estimates for complex disease (P) are low and variable: massive opportunity for high-throughput E discovery σ2 E
  • 11. What describes this variation NOT explained by genetics?
  • 13. Is it coffee…? HR: 0.9 in N=500K
  • 14. HR: 0.9 in N=50K
  • 16. via tylervigen.com … we just don’t know
  • 17. … we just don’t know xkcd.com
  • 18. We just don’t know: Is everything we are exposed to associated with cancer? Schoenfeld and Ioannidis, AJCN 2012 50 random ingredients from Boston Cooking School Cookbook Any associated with cancer? Of 50, 40 studied in cancer risk Weak statistical evidence: non-replicated inconsistent effects non-standardized
  • 19. … we just don’t know http://fivethirtyeight.com/features/you-cant-trust-what-you-read-about-nutrition/
  • 20. The problem remains: (1) what explains the missing variation in phenotype… σ2 E
  • 21. So the problem remains: (2) and how do we find the stuff that matters? E: ??? Diet Infection Pollution Drugs
  • 22. We are great at G investigation! >4000 (as of 1/1/18) 36,066 G-P associations Genome-wide Association Studies (GWAS) https://www.ebi.ac.uk/gwas/ G How did genetics-based investigations advance? (And advance so quickly?)
  • 23. A new paradigm of GWAS for discovery of G in P: Human Genome Project to GWAS Sequencing of the genome 2001 HapMap project: http://hapmap.ncbi.nlm.nih.gov/ Characterize common variation 2001-current day High-throughput variant assay < $99 for ~1M variants Measurement tools ~2003 (ongoing) Nature 2008 Comprehensive, high-throughput analyses GWAS
  • 24. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 25. Explaining the missing variation: A data-driven paradigm for robust discovery of E in disease via systematic study of the “exposome” what to measure? how to measure? “A more comprehensive view of environmental exposure is needed ... to discover major causes of diseases...” how to analyze in relation to health? Wild, 2005, 2012 Rappaport and Smith, 2010, 2011 Buck-Louis and Sundaram 2012 Miller and Jones, 2014 Patel CJ and Ioannidis JPAI, 2014
  • 26. Possible to use existing technologies for E Exposure (and P) Assessment… CEBP 2017 … however, heterogeneous measures that require different study designs and analytic approaches.
  • 27. Promises and Challenges in creating a search engine for identifying E in P JAMA 2014 ARPH 2016 JECH 2014 Curr Epidemiol Rep 2017
  • 28. Examples of data-driven discovery for E associations
  • 29. Gold standard for breadth of human exposure information: National Health and Nutrition Examination Survey1 since the 1960s now biannual: 1999 onwards 10,000 participants per survey 1 http://www.cdc.gov/nchs/nhanes.htm >250 exposures (serum + urine) GWAS chip >200 quantitative clinical traits (e.g., serum glucose, lipids, body mass index) Death index linkage (cause of death)
  • 30. Gold standard for breadth of exposure & behavior data: National Health and Nutrition Examination Survey Nutrients and Vitamins vitamin D, carotenes Infectious Agents hepatitis, HIV, Staph. aureus Plastics and consumables phthalates, bisphenol A Physical Activity e.g., stepsPesticides and pollutants atrazine; cadmium; hydrocarbons Drugs statins; aspirin
  • 31. What E are associated with aging: all-cause mortality, heart disease, and telomere length? Int J Epidem 2013 Int J Epidem 2016
  • 32. Identifying E associated with all-cause mortality: Data-driven searching through 253 associations age (10 years) income (quintile 2) income (quintile 1) male black income (quintile 3) any one smoke in home? Multivariate cox (age, sex, income, education, race/ethnicity, occupation [in red]) serum and urine cadmium [1 SD] past smoker? current smoker?serum lycopene [1SD] physical activity [low, moderate, high activity]* *derived from METs per activity and categorized by Health.gov guidelines R2 ~ 14% (2%)
  • 34. What about other factors related to aging?: 452 associations in Telomere Length! Int J Epidem 2016 PCBs FDR<5% Trunk Fat Alk. PhosCRP Cadmium Cadmium (urine)cigs per day retinyl stearate R2 ~ 1% VO2 Maxpulse rate shorter telomeres longer telomeres adjusted by age, age2, race, poverty, education, occupation median N=3000; N range: 300-7000 2-8 years
  • 35. Interdependencies of the exposome: Correlation globes paint a complex view of exposure Red: positive ρ Blue: negative ρ thickness: |ρ| for each pair of E: Spearman ρ (575 factors: 81,937 correlations) permuted data to produce “null ρ” sought replication in > 1 cohort Pac Symp Biocomput. 2015 JECH. 2015
  • 36. Red: positive ρ Blue: negative ρ thickness: |ρ| for each pair of E: Spearman ρ (575 factors: 81,937 correlations) Interdependencies of the exposome: Correlation globes paint a complex view of exposure: average correlation of < 0.3 permuted data to produce “null ρ” sought replication in > 1 cohort Pac Symp Biocomput. 2015 JECH. 2015 Effective number of variables: 500 (10% decrease)
  • 37. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 38. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data
  • 39. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data ARPH 2016 JAMA 2014 JECH 2015
  • 40. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data ARPH 2016 JAMA 2014 JECH 2015
  • 41. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 42. Explaining the missing variation: A data-driven paradigm for robust discovery of E in disease via systematic study of the “exposome” what to measure? how to measure? “A more comprehensive view of environmental exposure is needed ... to discover major causes of diseases...” how to analyze in relation to health? Wild, 2005, 2012 Rappaport and Smith, 2010, 2011 Buck-Louis and Sundaram 2012 Miller and Jones, 2014 Patel CJ and Ioannidis JPAI, 2014
  • 43. Need to assess the exposome globally: (e.g., India and China) c/o Getty Images c/o AFP
  • 44. … and Sub-Saharan Africa! Can we predict HIV as a function of the exposome? AIDS 2018
  • 45. Harvard DBMI Susanne Churchill Nathan Palmer Sophia Mamousette Sunny Alvear Chirag J Patel chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org NIH Common Fund Big Data to Knowledge Acknowledgements RagGroup Arjun Manrai Nam Pho Jake Chung Kajal Claypool Chirag Lakhani Danielle Rasooly Alan LeGoallec Sivateja Tangirala Mentioned Collaborators Isaac Kohane John Ioannidis Dennis Bier Hugo Aschard