SlideShare uma empresa Scribd logo
1 de 25
NIH’s Day, Months, Four Years of 
Data 
Philip E. Bourne Ph.D. 
Associate Director for Data Science 
National Institutes of Health
What is NIH’s Overall Approach to 
Data and What Does It Mean to You?
Some Context: NIH Data Science History 
6/12 2/14 3/14 
• Findings: 
• Sharing data & software through catalogs 
• Support methods and applications development 
• Need more training 
• Need campus-wide IT strategy 
• Hire CSIO 
• Continued support throughout the lifecycle
My Bias 
 Still a scientist 
 A funder who still thinks like a PI 
 Not yet attuned to the federal system 
 Big supporter of open science through prior work with 
the Public Library of Science, FORCE11 etc.
Data – A Few Observations … 
 We talk about the promise of big data, but we don’t 
even know the value of little data (aka could “Big 
Data” be the new “AI”) 
 Good data is expensive in terms of time and money 
 Looking at data retroactively is really expensive 
 Good data begats trust; trust begats community; 
community is God 
 The way we support scientific data currently is not 
sustainable 
 That is, is no business model currently for scientific 
data
Data – A Few NIH Observations … 
1. We have little idea how much we spend on data – 
estimated over $1bn per year 
2. We have even less idea how much we should be 
spending 
 Point 2 is part of a culture clash between the more 
observational history of biomedicine and the new 
analytical approach to discovery
Data – An Academic Medical Center 
Observation 
 A digital enterprise exists when data connections are 
made across different areas of the organization such 
that productivity and competitiveness are improved 
 For example, between education and research 
 I am not aware that any such academic institutions 
exist? 
 Many are starting to wake up to the idea of getting 
there 
JAMIA 2014, 21(2), 194
ADDS Mission 
Statement 
To foster an ecosystem that enables 
biomedical research to be conducted 
as a digital enterprise that enhances 
health, lengthens life and reduces 
illness and disability
What Problems Are We Trying to Solve? 
One Possible Solution 
 Sustainability – 50% business model 
 Efficiency – sharing best practices in longitudinal 
clinical studies; “trusted investigator” 
 Collaboration - identification of collaborators at the 
point of data collection not publication 
 Reproducibility – data accessible with publication 
 Integration – phenotype homogenization 
 Accessibility – clinical trials registration 
 Quality – sharing CDEs across institutes 
 Training – keeping trainees in the ecosystem
The Data Ecosystem 
Community Policy 
Infrastructure 
• Sustainable 
business 
model 
• Collaboration 
• Training
The Data Ecosystem 
Community Policy 
Infrastructure 
• Sustainable 
business 
model 
• Collaboration 
• Training 
Virtuous 
Research 
Cycle
The Virtuous Cycle 
http://goo.gl/fkWjhS
Raw Materials to Seed the Ecosystem 
 NIH mandate & support 
 ADDS team of 8 people 
 Intramural participation of over 100 team members 
across ICs 
 Funding through BD2K: 
– ~$30M in FY14 
– ~$80M in FY15 
– ....
Organization to Seed the Ecosystem…
Associate Director for Data Science 
Scientific Data Council External Advisory Board 
Programmatic Theme 
Sustainability Education Innovation Process 
Deliverable 
Commons Training 
BD2K Efficiency 
Example Features • IC’s 
• Cloud – Data & 
Compute 
• Search 
• Security 
• Reproducibility 
Standards 
• App Store 
• Coordinate 
• Hands-on 
• Syllabus 
• MOOCs 
• Community 
• Centers 
• Training Grants 
• Catalogs 
• Standards 
• Analysis 
• Data 
Resource 
Support 
• Metrics 
• Best 
Practices 
• Evaluation 
• Portfolio 
Analysis 
Collaboration 
Partnerships 
• Researchers 
• Federal 
Agencies 
• International 
Partners 
• Computer 
Scientists 
The Biomedical Research Digital Enterprise
Associate Director for Data Science 
Scientific Data Council External Advisory Board 
Programmatic Theme 
Sustainability Education Innovation Process 
Deliverable 
Commons Training 
BD2K Efficiency 
Example Features • IC’s 
• Cloud – Data & 
Compute 
• Search 
• Security 
• Reproducibility 
Standards 
• App Store 
• Coordinate 
• Hands-on 
• Syllabus 
• MOOCs 
• Community 
• Centers 
• Training Grants 
• Catalogs 
• Standards 
• Analysis 
• Data 
Resource 
Support 
• Metrics 
• Best 
Practices 
• Evaluation 
• Portfolio 
Analysis 
Collaboration 
Partnerships 
• Researchers 
• Federal 
Agencies 
• International 
Partners 
• Computer 
Scientists 
The Biomedical Research Digital Enterprise
Example Communities 
– NIH 
• 27 ICs 
– Agencies 
• NSF 
• DOE 
• DARPA 
• NIST 
– Government 
• OSTP 
• HHS HDI 
• ONC 
• CDC 
• FDA 
– Private sector 
• Phrma 
• Google 
• Amazon 
– Organizations 
• PCORI, GA4GH 
• RDA, ELIXIR 
• CCC 
• CATS 
• FASEB, ISCB 
• Biophysical Society 
• Sloan Foundation 
• Moore Foundation
Example Policies 
– Clinical data harmonization 
– DbGaP in the cloud 
– Data citation 
– Machine readable data sharing plans on all grants 
– New review models, audiences etc. 
• Open review 
• Micro funding 
• Standing data committees to explore best 
practices 
• Crowd sourcing
Example Infrastructure: The Commons 
Data 
The Long Tail 
Core Facilities/HS Centers 
Clinical /Patient 
The Why: 
Data Sharing Plans 
The How: 
NIH Knowledge 
The 
Government 
Commons 
Data 
Discovery 
Index 
The End Game: 
Scientific 
Discovery 
Usability 
Quality 
Security/ 
Privacy 
Sustainable 
Storage 
Awardees 
Private 
Sector Metrics/ 
Standards 
Rest of 
Academia 
Software Standards 
Index 
BD2K 
Centers 
Cloud, Research Objects, 
Business Models
What Does the Commons Enable? 
 Dropbox like storage 
 The opportunity to apply quality metrics 
 Bring compute to the data 
 A place to collaborate 
 A place to discover 
http://100plus.com/wp-content/uploads/Data-Commons-3- 
1024x825.png
One Possible Commons Business Model 
[Adapted from George Komatsoulis] 
HPC, Institution …
Pilots Around A Virtuous Cycle 
Expect a FY15 Funding Call to Work in 
the Commons
TTrraaiinniinngg && DDiivveerrssiittyy 
 Training & Diversity Goals: 
– Develop a sufficient cadre of diverse researchers skilled in 
the science of Big Data 
– Elevate general competencies in data usage and analysis 
across the biomedical research workforce 
– Combat the Google bus 
 How: 
– Traditional training grants 
– Work with IC’s on a needs assessment 
– Standards for course descriptions with EU 
– Work with institutions on raising awareness 
– Partner with minority institutions 
– Virtual/physical training center(s)?
Closing Question 
Calls for increased NIH funding has so 
far gone unheeded, what can the 
ADDS do (that you have not heard 
about) to improve data science 
activities?
NNIIHH…… 
philip.bourne@nih.gov 
TTuurrnniinngg DDiissccoovveerryy IInnttoo HHeeaalltthh

Mais conteúdo relacionado

Mais procurados

Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointMark Parsons
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?Philip Bourne
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsJisc
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data EnterprisePhilip Bourne
 
Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Jisc
 
Data and information governance: getting this right to support an information...
Data and information governance: getting this right to support an information...Data and information governance: getting this right to support an information...
Data and information governance: getting this right to support an information...Jisc
 
Methods for measuring citizen-science impact
Methods for measuring citizen-science impactMethods for measuring citizen-science impact
Methods for measuring citizen-science impactLuigi Ceccaroni
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Academy of Science of South Africa (ASSAf)
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
 
Health Policy and Management as it Relates to Big Data
Health Policy and Management as it Relates to Big DataHealth Policy and Management as it Relates to Big Data
Health Policy and Management as it Relates to Big DataPhilip Bourne
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthPhilip Bourne
 
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkRDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkASIS&T
 
Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Philip Bourne
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality Paul Courtney
 
Making Data Meaningful
Making Data MeaningfulMaking Data Meaningful
Making Data MeaningfulAmanda Makulec
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global HealthPhilip Bourne
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 

Mais procurados (20)

Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analytics
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
 
Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...
 
Data and information governance: getting this right to support an information...
Data and information governance: getting this right to support an information...Data and information governance: getting this right to support an information...
Data and information governance: getting this right to support an information...
 
Methods for measuring citizen-science impact
Methods for measuring citizen-science impactMethods for measuring citizen-science impact
Methods for measuring citizen-science impact
 
African Open Science Platform
African Open Science PlatformAfrican Open Science Platform
African Open Science Platform
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...
 
Data and communication of research: incentives and disincentives
Data and communication of research: incentives and disincentivesData and communication of research: incentives and disincentives
Data and communication of research: incentives and disincentives
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical Research
 
Health Policy and Management as it Relates to Big Data
Health Policy and Management as it Relates to Big DataHealth Policy and Management as it Relates to Big Data
Health Policy and Management as it Relates to Big Data
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
 
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkRDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
 
Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Data Science BD2K Update for NIH
Data Science BD2K Update for NIH
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
 
Kotarski2011
Kotarski2011Kotarski2011
Kotarski2011
 
Making Data Meaningful
Making Data MeaningfulMaking Data Meaningful
Making Data Meaningful
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global Health
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 

Destaque

Training Quantitative Scientists for Biomedical Science Through the BD2K Init...
Training Quantitative Scientists for Biomedical Science Through the BD2K Init...Training Quantitative Scientists for Biomedical Science Through the BD2K Init...
Training Quantitative Scientists for Biomedical Science Through the BD2K Init...Philip Bourne
 
Societal aspects of Big Data
Societal aspects of Big DataSocietal aspects of Big Data
Societal aspects of Big DataPhilip Bourne
 
Regional Student Group NBIC Career Presentation April 18, 2011
Regional Student Group NBIC Career Presentation April 18, 2011Regional Student Group NBIC Career Presentation April 18, 2011
Regional Student Group NBIC Career Presentation April 18, 2011Philip Bourne
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
 

Destaque (6)

Training Quantitative Scientists for Biomedical Science Through the BD2K Init...
Training Quantitative Scientists for Biomedical Science Through the BD2K Init...Training Quantitative Scientists for Biomedical Science Through the BD2K Init...
Training Quantitative Scientists for Biomedical Science Through the BD2K Init...
 
Societal aspects of Big Data
Societal aspects of Big DataSocietal aspects of Big Data
Societal aspects of Big Data
 
Regional Student Group NBIC Career Presentation April 18, 2011
Regional Student Group NBIC Career Presentation April 18, 2011Regional Student Group NBIC Career Presentation April 18, 2011
Regional Student Group NBIC Career Presentation April 18, 2011
 
The Commons
The CommonsThe Commons
The Commons
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 

Semelhante a Yale Day of Data

The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHPhilip Bourne
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH Philip Bourne
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGPhilip Bourne
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemGlobus
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterprisePhilip Bourne
 
The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?Philip Bourne
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
 
Opportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big DataOpportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big DataPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery InformaticsPhilip Bourne
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingDenodo
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowJisc
 
NIH Big Data to Knowledge (BD2K)
NIH Big Data to Knowledge (BD2K)NIH Big Data to Knowledge (BD2K)
NIH Big Data to Knowledge (BD2K)Lance K. Manning
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHPhilip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 

Semelhante a Yale Day of Data (20)

The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAG
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital Enterprise
 
The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
Opportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big DataOpportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big Data
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we now
 
NIH Big Data to Knowledge (BD2K)
NIH Big Data to Knowledge (BD2K)NIH Big Data to Knowledge (BD2K)
NIH Big Data to Knowledge (BD2K)
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIH
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 

Mais de Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in ResearchPhilip Bourne
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 

Mais de Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 

Último

URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 

Último (20)

URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 

Yale Day of Data

  • 1. NIH’s Day, Months, Four Years of Data Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health
  • 2. What is NIH’s Overall Approach to Data and What Does It Mean to You?
  • 3. Some Context: NIH Data Science History 6/12 2/14 3/14 • Findings: • Sharing data & software through catalogs • Support methods and applications development • Need more training • Need campus-wide IT strategy • Hire CSIO • Continued support throughout the lifecycle
  • 4. My Bias  Still a scientist  A funder who still thinks like a PI  Not yet attuned to the federal system  Big supporter of open science through prior work with the Public Library of Science, FORCE11 etc.
  • 5. Data – A Few Observations …  We talk about the promise of big data, but we don’t even know the value of little data (aka could “Big Data” be the new “AI”)  Good data is expensive in terms of time and money  Looking at data retroactively is really expensive  Good data begats trust; trust begats community; community is God  The way we support scientific data currently is not sustainable  That is, is no business model currently for scientific data
  • 6. Data – A Few NIH Observations … 1. We have little idea how much we spend on data – estimated over $1bn per year 2. We have even less idea how much we should be spending  Point 2 is part of a culture clash between the more observational history of biomedicine and the new analytical approach to discovery
  • 7. Data – An Academic Medical Center Observation  A digital enterprise exists when data connections are made across different areas of the organization such that productivity and competitiveness are improved  For example, between education and research  I am not aware that any such academic institutions exist?  Many are starting to wake up to the idea of getting there JAMIA 2014, 21(2), 194
  • 8. ADDS Mission Statement To foster an ecosystem that enables biomedical research to be conducted as a digital enterprise that enhances health, lengthens life and reduces illness and disability
  • 9. What Problems Are We Trying to Solve? One Possible Solution  Sustainability – 50% business model  Efficiency – sharing best practices in longitudinal clinical studies; “trusted investigator”  Collaboration - identification of collaborators at the point of data collection not publication  Reproducibility – data accessible with publication  Integration – phenotype homogenization  Accessibility – clinical trials registration  Quality – sharing CDEs across institutes  Training – keeping trainees in the ecosystem
  • 10. The Data Ecosystem Community Policy Infrastructure • Sustainable business model • Collaboration • Training
  • 11. The Data Ecosystem Community Policy Infrastructure • Sustainable business model • Collaboration • Training Virtuous Research Cycle
  • 12. The Virtuous Cycle http://goo.gl/fkWjhS
  • 13. Raw Materials to Seed the Ecosystem  NIH mandate & support  ADDS team of 8 people  Intramural participation of over 100 team members across ICs  Funding through BD2K: – ~$30M in FY14 – ~$80M in FY15 – ....
  • 14. Organization to Seed the Ecosystem…
  • 15. Associate Director for Data Science Scientific Data Council External Advisory Board Programmatic Theme Sustainability Education Innovation Process Deliverable Commons Training BD2K Efficiency Example Features • IC’s • Cloud – Data & Compute • Search • Security • Reproducibility Standards • App Store • Coordinate • Hands-on • Syllabus • MOOCs • Community • Centers • Training Grants • Catalogs • Standards • Analysis • Data Resource Support • Metrics • Best Practices • Evaluation • Portfolio Analysis Collaboration Partnerships • Researchers • Federal Agencies • International Partners • Computer Scientists The Biomedical Research Digital Enterprise
  • 16. Associate Director for Data Science Scientific Data Council External Advisory Board Programmatic Theme Sustainability Education Innovation Process Deliverable Commons Training BD2K Efficiency Example Features • IC’s • Cloud – Data & Compute • Search • Security • Reproducibility Standards • App Store • Coordinate • Hands-on • Syllabus • MOOCs • Community • Centers • Training Grants • Catalogs • Standards • Analysis • Data Resource Support • Metrics • Best Practices • Evaluation • Portfolio Analysis Collaboration Partnerships • Researchers • Federal Agencies • International Partners • Computer Scientists The Biomedical Research Digital Enterprise
  • 17. Example Communities – NIH • 27 ICs – Agencies • NSF • DOE • DARPA • NIST – Government • OSTP • HHS HDI • ONC • CDC • FDA – Private sector • Phrma • Google • Amazon – Organizations • PCORI, GA4GH • RDA, ELIXIR • CCC • CATS • FASEB, ISCB • Biophysical Society • Sloan Foundation • Moore Foundation
  • 18. Example Policies – Clinical data harmonization – DbGaP in the cloud – Data citation – Machine readable data sharing plans on all grants – New review models, audiences etc. • Open review • Micro funding • Standing data committees to explore best practices • Crowd sourcing
  • 19. Example Infrastructure: The Commons Data The Long Tail Core Facilities/HS Centers Clinical /Patient The Why: Data Sharing Plans The How: NIH Knowledge The Government Commons Data Discovery Index The End Game: Scientific Discovery Usability Quality Security/ Privacy Sustainable Storage Awardees Private Sector Metrics/ Standards Rest of Academia Software Standards Index BD2K Centers Cloud, Research Objects, Business Models
  • 20. What Does the Commons Enable?  Dropbox like storage  The opportunity to apply quality metrics  Bring compute to the data  A place to collaborate  A place to discover http://100plus.com/wp-content/uploads/Data-Commons-3- 1024x825.png
  • 21. One Possible Commons Business Model [Adapted from George Komatsoulis] HPC, Institution …
  • 22. Pilots Around A Virtuous Cycle Expect a FY15 Funding Call to Work in the Commons
  • 23. TTrraaiinniinngg && DDiivveerrssiittyy  Training & Diversity Goals: – Develop a sufficient cadre of diverse researchers skilled in the science of Big Data – Elevate general competencies in data usage and analysis across the biomedical research workforce – Combat the Google bus  How: – Traditional training grants – Work with IC’s on a needs assessment – Standards for course descriptions with EU – Work with institutions on raising awareness – Partner with minority institutions – Virtual/physical training center(s)?
  • 24. Closing Question Calls for increased NIH funding has so far gone unheeded, what can the ADDS do (that you have not heard about) to improve data science activities?
  • 25. NNIIHH…… philip.bourne@nih.gov TTuurrnniinngg DDiissccoovveerryy IInnttoo HHeeaalltthh

Notas do Editor

  1. Computing Community Consortium Committee on Applied and Theoretical Statistics Office of the National Coordinator