SlideShare uma empresa Scribd logo
1 de 44
July 2, 2020
Open Science for research questions,
data, and analyses?
Ewout W. Steyerberg, PhD
Professor of Clinical Biostatistics and
Medical Decision Making
Thanks to many for assistance and inspiration,
including the GAP3 consortium, CENTER-TBI Study
Open Science: what is it at LMU?
2-Jul-202 Insert > Header & footer
Open Science: what is it in the Netherlands?
2-Jul-203 Insert > Header & footer
https://www.openscience.nl/
https://www.coalition-s.org/
Open vs closed science
Long ago
- Performed by few, elitarian scientists
- Doing private experiments
- Discussion in small, closed communities
Recent
- Science as a profession
- Protect data + code as intellectual property
- Aim for shocking findings in high IF journals
https://www.sciencemag.org/news/2020/06/whos-blame-these-three-scientists-are-heart-surgisphere-covid-19-scandal
Overall claim
“Open Science will make research better”
Vote pro / con
Aims today:
- Highlight some strong points in Open Science
- Hint at some challenges in Open Science
Reflections based on personal 30-yr research experience,
specific focus on prediction / decision making
2-Jul-205 Insert > Header & footer
Open Science to better address
Big research questions
Open science research questions: case 1
Example 1: Red cards and dark skin soccer players
https://psyarxiv.com/qkwst/
2-Jul-207 Insert > Header & footer
Open science research questions: case 1
• 29 teams involving 61 analysts; same dataset; same research question:
whether soccer referees are more likely to give red cards to dark skin
toned players than light skin toned players
• Estimated odds ratios 0.89 –2.93 (median 1.3)
• 20 teams: statistically significant positive effect, 9: non-significant relation
2-Jul-208 Insert > Header & footer
Estimated odds ratios by 29 research teams
2-Jul-209 Insert > Header & footer
“Logistic regression”
2-Jul-2010 Insert > Header & footer
Open science research questions: case 1
• 29 teams involving 61 analysts; same dataset; same research question:
whether soccer referees are more likely to give red cards to dark skin toned
players than light skin toned players
• Estimated odds ratios 0.89 –2.93 (median 1.3).
• 20 teams: statistically significant positive effect, 9: non-significant relation.
• 21 unique combinations of covariates
• “Variation in analysis of complex data may be difficult to
avoid, even by experts with honest intentions”
2-Jul-2011 Insert > Header & footer
Open science research questions: case 2
2-Jul-2012 Insert > Header & footer
Machine learning vs conventional modeling
1. Findings convincing?
2. Systematic / ”it depends” ?
2-Jul-2013 Insert > Header & footer
Findings not convincing
Cox, #4, 30 vars, max c =0.793
RF, #7, 600 vars, c=0.797
Elastic, #9, 600 vars, c=0.801
2-Jul-2014 Insert > Header & footer
Machine learning vs conventional modeling
1. Findings convincing?
“We found that random forests did not outperform Cox models despite their
inherent ability to accommodate nonlinearities and interactions. …
Elastic nets achieved the highest discrimination performance …, demonstrating
the ability of regularisation to select relevant variables and optimise model
coefficients in an EHR context.”
2-Jul-2015 Insert > Header & footer
Machine learning vs conventional modeling
1. Findings convincing? Not in case-study
2. Systematic / ”it depends” ?
2-Jul-2016 Insert > Header & footer
2-Jul-2017 Insert > Header & footer
2-Jul-2018 Insert > Header & footer
Open science research questions: case 2
• 243 real datasets from “the OpenML database”
• RF performed better than LR:
mean difference between RF and LR was 0.041 (95%-CI =[0.031,0.053]) for
the Area Under the ROC Curve
• Results were dependent on the inclusion criteria used to select the example
datasets
• ES: Results rely on 10 x 10-fold cross-validation
2-Jul-2019 Insert > Header & footer
Open science research questions: case 2
• More clarification needed when ML / RF works best; at least large N needed
2-Jul-2020 Insert > Header & footer
Systematic review on ML vs classic modeling
2-Jul-2021 Insert > Header & footer
Differences in discrimination
Summary on examples of Open Science
to better address Big research questions
• 1 data set
• multiple modelers
• Multiple modeling options
• 1 neutral comparison; 243 OpenML databases
• Review of 282 comparative studies: meta-research
2-Jul-2023 Insert > Header & footer
Open Science: data sharing
2-Jul-2025 Insert > Header & footer
Heterogeneity in data .. ignored
2-Jul-2026 Insert > Header & footer
Data sharing
• Pro:
• Allowed for larger sample size in a rare disease
• Cons:
• Heterogeneity?
• Substantial politics / efforts
2-Jul-2027 Insert > Header & footer
Open Science: analyses and interpretation
OHDSI: bridging data sharing - analyses
Analyses: ODHSI model
2-Jul-2030 Insert > Header & footer
OHDSI: COVID and other research topics
2-Jul-2031 Insert > Header & footer
The power of OHDSI
2-Jul-2032 Insert > Header & footer
OMOP common data model enables sharing of
model development code
2-Jul-2033 Insert > Header & footer
Performance for different outcomes in multiple cohorts
2-Jul-2034 Insert > Header & footer
OHDSI: bridging data sharing - analyses
• Keep data local
• Run locally started, centrally available analyses
• Share results centrally
Open Science: analyses and interpretation
Open Science challenge:
dealing with heterogeneity
Heterogeneity
• Study design
• Selection of subjects
• Measurement of covariates
• Measurement of outcomes
• Associations of covariates with outcome
• Overall outcome rates
• Performance of prediction models
Analyses: dealing with heterogeneity
2-Jul-2038 Insert > Header & footer
15 cohorts: 11 RCTs, 4 Observational studies
2-Jul-2039 Insert > Header & footer
Heterogeneous case-mix
2-Jul-2040 Insert > Header & footer
Heterogeneous predictor effects
2-Jul-2041 Insert > Header & footer
Heterogeneous predictions
2-Jul-2042 Insert > Header & footer
Heterogeneity in individual predictions
2-Jul-2043 Insert > Header & footer
“Open Science will make research better”
1. Research questions in competitions
• Red cards
• Neutral comparisons / meta-analysis
2. Data sharing
• old-fashioned?
3. Analyses
• OHDSI: modern
• Heterogeneity
Open science research extends discussions from meta-analysis;
contrast Cochrane reviews vs Big Data
2-Jul-2044 Insert > Header & footer

Mais conteúdo relacionado

Mais procurados

IRJET - An Effective Stroke Prediction System using Predictive Models
IRJET -  	  An Effective Stroke Prediction System using Predictive ModelsIRJET -  	  An Effective Stroke Prediction System using Predictive Models
IRJET - An Effective Stroke Prediction System using Predictive ModelsIRJET Journal
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_sciencetvk66866
 
Multi-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gainMulti-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gainPaul Agapow
 
Data science in health care
Data science in health careData science in health care
Data science in health careChetan Khanzode
 
Big Data Analytics for Healthcare
Big Data Analytics for HealthcareBig Data Analytics for Healthcare
Big Data Analytics for HealthcareChandan Reddy
 
Unifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and EnvironmentsUnifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and EnvironmentsAnne Thessen
 
Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015MedicReS
 
ARX - A Generic Method for Assessing the Quality of De-Identified Health Data
ARX - A Generic Method for Assessing the Quality of De-Identified Health DataARX - A Generic Method for Assessing the Quality of De-Identified Health Data
ARX - A Generic Method for Assessing the Quality of De-Identified Health Dataarx-deidentifier
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...MIS Quarterly
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingBram Zandbelt
 
Deep learning for episodic interventional data
Deep learning for episodic interventional dataDeep learning for episodic interventional data
Deep learning for episodic interventional dataDeakin University
 
Development and Problems in the Field of Medical Information Reporting
Development and Problems in the Field of Medical Information ReportingDevelopment and Problems in the Field of Medical Information Reporting
Development and Problems in the Field of Medical Information ReportingYogeshIJTSRD
 
Open science and the individual researcher
Open science and the individual researcherOpen science and the individual researcher
Open science and the individual researcherBram Zandbelt
 
Journal club summary: Open Science save lives
Journal club summary: Open Science save livesJournal club summary: Open Science save lives
Journal club summary: Open Science save livesDorothy Bishop
 

Mais procurados (20)

Project and Thesis
Project and ThesisProject and Thesis
Project and Thesis
 
IRJET - An Effective Stroke Prediction System using Predictive Models
IRJET -  	  An Effective Stroke Prediction System using Predictive ModelsIRJET -  	  An Effective Stroke Prediction System using Predictive Models
IRJET - An Effective Stroke Prediction System using Predictive Models
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_science
 
Multi-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gainMulti-omics for drug discovery: what we lose, what we gain
Multi-omics for drug discovery: what we lose, what we gain
 
Clinical data analytics
Clinical data analyticsClinical data analytics
Clinical data analytics
 
Data science in health care
Data science in health careData science in health care
Data science in health care
 
Big Data Analytics for Healthcare
Big Data Analytics for HealthcareBig Data Analytics for Healthcare
Big Data Analytics for Healthcare
 
Unifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and EnvironmentsUnifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and Environments
 
Stroke Prediction
Stroke PredictionStroke Prediction
Stroke Prediction
 
Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015
 
Casey, "Measuring Science Impact Among Citations (case studies)"
Casey, "Measuring Science Impact Among Citations (case studies)"Casey, "Measuring Science Impact Among Citations (case studies)"
Casey, "Measuring Science Impact Among Citations (case studies)"
 
ARX - A Generic Method for Assessing the Quality of De-Identified Health Data
ARX - A Generic Method for Assessing the Quality of De-Identified Health DataARX - A Generic Method for Assessing the Quality of De-Identified Health Data
ARX - A Generic Method for Assessing the Quality of De-Identified Health Data
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...
 
Deep learning for episodic interventional data
Deep learning for episodic interventional dataDeep learning for episodic interventional data
Deep learning for episodic interventional data
 
Development and Problems in the Field of Medical Information Reporting
Development and Problems in the Field of Medical Information ReportingDevelopment and Problems in the Field of Medical Information Reporting
Development and Problems in the Field of Medical Information Reporting
 
Open science and the individual researcher
Open science and the individual researcherOpen science and the individual researcher
Open science and the individual researcher
 
Journal club summary: Open Science save lives
Journal club summary: Open Science save livesJournal club summary: Open Science save lives
Journal club summary: Open Science save lives
 
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
Effectiveness of New, Informationist-led Curriculum Changes at the College of...Effectiveness of New, Informationist-led Curriculum Changes at the College of...
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
 

Semelhante a Open science LMU session contribution E Steyerberg 2jul20

Open Science Better Science? Steyerberg 2June2022.pptx
Open Science Better Science? Steyerberg 2June2022.pptxOpen Science Better Science? Steyerberg 2June2022.pptx
Open Science Better Science? Steyerberg 2June2022.pptxEwout Steyerberg
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...William Gunn
 
Lecture5.pdf
Lecture5.pdfLecture5.pdf
Lecture5.pdfTake1As
 
Scott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingScott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingGigaScience, BGI Hong Kong
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxEwout Steyerberg
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and TrainingNUI Galway
 
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...jybufgofasfbkpoovh
 
Collaborative research network and scientific productivity
Collaborative research network and scientific productivityCollaborative research network and scientific productivity
Collaborative research network and scientific productivityHanbat National Univerisity
 
Introduction to research
Introduction to researchIntroduction to research
Introduction to researchHKRabby2
 
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...Textkernel
 
The Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesThe Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesWilliam Gunn
 
Transparency in Data Analysis
Transparency in Data AnalysisTransparency in Data Analysis
Transparency in Data AnalysisChristian Bokhove
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 

Semelhante a Open science LMU session contribution E Steyerberg 2jul20 (20)

Open Science Better Science? Steyerberg 2June2022.pptx
Open Science Better Science? Steyerberg 2June2022.pptxOpen Science Better Science? Steyerberg 2June2022.pptx
Open Science Better Science? Steyerberg 2June2022.pptx
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
 
Lecture5.pdf
Lecture5.pdfLecture5.pdf
Lecture5.pdf
 
Scott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingScott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data Publishing
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptx
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
 
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...
 
Collaborative research network and scientific productivity
Collaborative research network and scientific productivityCollaborative research network and scientific productivity
Collaborative research network and scientific productivity
 
Introduction to research
Introduction to researchIntroduction to research
Introduction to research
 
209-OECD ISSA project
209-OECD ISSA project209-OECD ISSA project
209-OECD ISSA project
 
eResearch New Zealand Keynote
eResearch New Zealand KeynoteeResearch New Zealand Keynote
eResearch New Zealand Keynote
 
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...
Intuition's Fall from Grace - Algorithms and Data in (Pre)-Selection by Colin...
 
Open Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality DataOpen Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality Data
 
Democratizing Data Science by Bill Howe
Democratizing Data Science by Bill HoweDemocratizing Data Science by Bill Howe
Democratizing Data Science by Bill Howe
 
The Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesThe Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United States
 
Transparency in Data Analysis
Transparency in Data AnalysisTransparency in Data Analysis
Transparency in Data Analysis
 
Shifting the goal post – from high impact journals to high impact data
 Shifting the goal post – from high impact journals to high impact data Shifting the goal post – from high impact journals to high impact data
Shifting the goal post – from high impact journals to high impact data
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 

Último

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 

Último (20)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 

Open science LMU session contribution E Steyerberg 2jul20

  • 1. July 2, 2020 Open Science for research questions, data, and analyses? Ewout W. Steyerberg, PhD Professor of Clinical Biostatistics and Medical Decision Making Thanks to many for assistance and inspiration, including the GAP3 consortium, CENTER-TBI Study
  • 2. Open Science: what is it at LMU? 2-Jul-202 Insert > Header & footer
  • 3. Open Science: what is it in the Netherlands? 2-Jul-203 Insert > Header & footer https://www.openscience.nl/ https://www.coalition-s.org/
  • 4. Open vs closed science Long ago - Performed by few, elitarian scientists - Doing private experiments - Discussion in small, closed communities Recent - Science as a profession - Protect data + code as intellectual property - Aim for shocking findings in high IF journals https://www.sciencemag.org/news/2020/06/whos-blame-these-three-scientists-are-heart-surgisphere-covid-19-scandal
  • 5. Overall claim “Open Science will make research better” Vote pro / con Aims today: - Highlight some strong points in Open Science - Hint at some challenges in Open Science Reflections based on personal 30-yr research experience, specific focus on prediction / decision making 2-Jul-205 Insert > Header & footer
  • 6. Open Science to better address Big research questions
  • 7. Open science research questions: case 1 Example 1: Red cards and dark skin soccer players https://psyarxiv.com/qkwst/ 2-Jul-207 Insert > Header & footer
  • 8. Open science research questions: case 1 • 29 teams involving 61 analysts; same dataset; same research question: whether soccer referees are more likely to give red cards to dark skin toned players than light skin toned players • Estimated odds ratios 0.89 –2.93 (median 1.3) • 20 teams: statistically significant positive effect, 9: non-significant relation 2-Jul-208 Insert > Header & footer
  • 9. Estimated odds ratios by 29 research teams 2-Jul-209 Insert > Header & footer
  • 11. Open science research questions: case 1 • 29 teams involving 61 analysts; same dataset; same research question: whether soccer referees are more likely to give red cards to dark skin toned players than light skin toned players • Estimated odds ratios 0.89 –2.93 (median 1.3). • 20 teams: statistically significant positive effect, 9: non-significant relation. • 21 unique combinations of covariates • “Variation in analysis of complex data may be difficult to avoid, even by experts with honest intentions” 2-Jul-2011 Insert > Header & footer
  • 12. Open science research questions: case 2 2-Jul-2012 Insert > Header & footer
  • 13. Machine learning vs conventional modeling 1. Findings convincing? 2. Systematic / ”it depends” ? 2-Jul-2013 Insert > Header & footer
  • 14. Findings not convincing Cox, #4, 30 vars, max c =0.793 RF, #7, 600 vars, c=0.797 Elastic, #9, 600 vars, c=0.801 2-Jul-2014 Insert > Header & footer
  • 15. Machine learning vs conventional modeling 1. Findings convincing? “We found that random forests did not outperform Cox models despite their inherent ability to accommodate nonlinearities and interactions. … Elastic nets achieved the highest discrimination performance …, demonstrating the ability of regularisation to select relevant variables and optimise model coefficients in an EHR context.” 2-Jul-2015 Insert > Header & footer
  • 16. Machine learning vs conventional modeling 1. Findings convincing? Not in case-study 2. Systematic / ”it depends” ? 2-Jul-2016 Insert > Header & footer
  • 17. 2-Jul-2017 Insert > Header & footer
  • 18. 2-Jul-2018 Insert > Header & footer
  • 19. Open science research questions: case 2 • 243 real datasets from “the OpenML database” • RF performed better than LR: mean difference between RF and LR was 0.041 (95%-CI =[0.031,0.053]) for the Area Under the ROC Curve • Results were dependent on the inclusion criteria used to select the example datasets • ES: Results rely on 10 x 10-fold cross-validation 2-Jul-2019 Insert > Header & footer
  • 20. Open science research questions: case 2 • More clarification needed when ML / RF works best; at least large N needed 2-Jul-2020 Insert > Header & footer
  • 21. Systematic review on ML vs classic modeling 2-Jul-2021 Insert > Header & footer
  • 23. Summary on examples of Open Science to better address Big research questions • 1 data set • multiple modelers • Multiple modeling options • 1 neutral comparison; 243 OpenML databases • Review of 282 comparative studies: meta-research 2-Jul-2023 Insert > Header & footer
  • 25. 2-Jul-2025 Insert > Header & footer
  • 26. Heterogeneity in data .. ignored 2-Jul-2026 Insert > Header & footer
  • 27. Data sharing • Pro: • Allowed for larger sample size in a rare disease • Cons: • Heterogeneity? • Substantial politics / efforts 2-Jul-2027 Insert > Header & footer
  • 28. Open Science: analyses and interpretation
  • 29. OHDSI: bridging data sharing - analyses
  • 30. Analyses: ODHSI model 2-Jul-2030 Insert > Header & footer
  • 31. OHDSI: COVID and other research topics 2-Jul-2031 Insert > Header & footer
  • 32. The power of OHDSI 2-Jul-2032 Insert > Header & footer
  • 33. OMOP common data model enables sharing of model development code 2-Jul-2033 Insert > Header & footer
  • 34. Performance for different outcomes in multiple cohorts 2-Jul-2034 Insert > Header & footer
  • 35. OHDSI: bridging data sharing - analyses • Keep data local • Run locally started, centrally available analyses • Share results centrally
  • 36. Open Science: analyses and interpretation
  • 37. Open Science challenge: dealing with heterogeneity Heterogeneity • Study design • Selection of subjects • Measurement of covariates • Measurement of outcomes • Associations of covariates with outcome • Overall outcome rates • Performance of prediction models
  • 38. Analyses: dealing with heterogeneity 2-Jul-2038 Insert > Header & footer
  • 39. 15 cohorts: 11 RCTs, 4 Observational studies 2-Jul-2039 Insert > Header & footer
  • 41. Heterogeneous predictor effects 2-Jul-2041 Insert > Header & footer
  • 43. Heterogeneity in individual predictions 2-Jul-2043 Insert > Header & footer
  • 44. “Open Science will make research better” 1. Research questions in competitions • Red cards • Neutral comparisons / meta-analysis 2. Data sharing • old-fashioned? 3. Analyses • OHDSI: modern • Heterogeneity Open science research extends discussions from meta-analysis; contrast Cochrane reviews vs Big Data 2-Jul-2044 Insert > Header & footer