SlideShare uma empresa Scribd logo
1 de 53
Baixar para ler offline
ML and AI:
a blessing and curse for
statisticians and medical doctors
Maarten van Smeden
University Medical Center Utrecht
Julius Center for Health Sciences and Primary Care
The Netherlands
Twitter: @MvanSmeden
Email: M.vanSmeden@umcutrecht.nl
STRATOS member (TG6: diagnostic tests and prediction models)
9 March 2020
Freiburg, Germany, Institut für Medizinische Biometrie und Statistik
Biometrischen Kolloquium
Sides available at https://www.slideshare.net/MaartenvanSmeden
I have no conflicts of interest to declare
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2CwW43A
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Terminology
In medical research, “artificial intelligence” usually
just means “machine learning” or “algorithm”
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2v2aokk
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2TOdd0F
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Tech company business model
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Tech company business model
https://bit.ly/2HSp8X5; https://bit.ly/2Z0Pfop; https://bit.ly/2KIcpHG; https://bit.ly/33IJhr9
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Other success stories
https://go.nature.com/2VG2hS7; https://bbc.in/2Z1drXQ; https://bit.ly/2TAfRIP
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
IBM Watson winning Jeopardy! (2011)
https://bbc.in/2TMvV8I
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
IBM Watson for oncology
https://bit.ly/2LxiWGj
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenForsting, J Nuc Med, 2017, DOI: 10.2967/jnumed.117.190397
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Machine learning everywhere
https://bit.ly/2ka0HLq; https://go.nature.com/33TQgO6; https://bit.ly/2kp6X23; https://bit.ly/2lZuKWt; https://bit.ly/2lI298g
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Reviewer #2
what are these
machine learning methods?
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/38A1ng0
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
“Everything is an ML method”
https://bit.ly/2lEVn33
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
“ML methods come from computer science”
https://bit.ly/2zhbwPv; https://stanford.io/2TVp1xK; https://stanford.io/2ZfED0k
Leo Breiman Jerome H Friedman Trevor Hastie
CART, random forest Gradient boosting Elements of statistical learning
Education Physics/Math Physics Statistics
Job title Professor of Statistics Professor of Statistics Professor of Statistics
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
“ML methods for prediction, statistics for explaining”
Damen, BMJ, 2016, DOI:10.1136/bmj.i2416
363 developed models how many?
Decision trees 0
Random forests 0
Support vector machines 0
Nearest neighbor algorithms 0
Neural networks 1
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
“ML methods for prediction, statistics for explaining”
1See further: Kreiff and Diaz Ordaz; https://bit.ly/2m1eYdK
ML and causal inference, small selection1
• Superlearner (e.g. van der Laan)
• High dimensional propensity scores (e.g. Schneeweiss)
• The book of why (Pearl)
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Two cultures
Breiman, Stat Sci, 2001, DOI: 10.1214/ss/1009213726
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Statistics Machine learning
Covariates Features
Outcome variable Target
Model Network, graphs
Parameters Weights
Model for discrete var. Classifier
Model for continuous var. Regression
Log-likelihood Loss
Multinomial regression Softmax
Measurement error Noise
Subject/observation Sample/instance
Dummy coding One-hot encoding
Measurement invariance Concept drift
Statistics Machine learning
Prediction Supervised learning
Latent variable modeling Unsupervised learning
Fitting Learning
Prediction error Error
Sensitivity Recall
Positive predictive value Precision
Contingency table Confusion matrix
Measurement error model Noise-aware ML
Structural equation model Gaussian Bayesian network
Gold standard Ground truth
Derivation–validation Training–test
Experiment A/B test
Adapted from Daniel Obserski: https://bit.ly/2YN12Xf and Robert Tibshirani: https://stanford.io/2zqEGfr
Language
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenRobert Tibshirani: https://stanford.io/2zqEGfr
Machine learning: large grant = $1,000,000
Statistics: large grant = $50,000
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
ML refers to a culture, not to methods
Distinguishing between statistics and machine learning
• Substantial overlap methods used by both cultures
• Substantial overlap analysis goals
• Attempts to separate the two frequently result in disagreement
Pragmatic approach:
I’ll use “ML” to refer to models roughly outside of the traditional regression
types of analysis: decision trees (and descendants), SVMs, neural networks
(including Deep learning), boosting etc.
Examples where
“ML” has done well
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Example: retinal disease
Gulshan et al, JAMA, 2016, 10.1001/jama.2016.17216; Picture retinopathy: https://bit.ly/2kB3X2w
Diabetic retinopathy
Deep learning (= Neural network)
• 128,000 images
• Transfer learning (preinitialization)
• Sensitivity and specificity > .90
• Estimated from training data
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Example: lymph node metastases
Bejnordi et al, JAMA, 2018, doi: 10.1001/jama.2017.14585. See our letter to the editor for a critical discussion: https://bit.ly/2kcYS0e
Deep learning competition
But:
• 390 teams signed up, 23 submitted
• “Only” 270 images for training
• Test AUC range: 0.56 to 0.99
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Deep learning on images
Many similar studies and challenges in
• radiology
• pathology
• dermatology
• opthalmology
• gastroenterology
• cardiology
• ….
Topol, Nature Medicine, 2019, DOI: 10.1038/s41591-018-0300-7
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
The future?
Topol, Nature Medicine, 2019, DOI: 10.1038/s41591-018-0300-7
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Other sources of “medical” data
• Large scale gene expression data
• e.g. diagnosis of acute myeloid leukemia
https://bit.ly/2k8Ao8e
• Prognostication by text mining electronic health records
• e.g. predicting life expectancy
https://bit.ly/2k8Ao8e
• Analyzing social media posts
• e.g. pharmacovigilance, adverse events monitoring via Twitter posts
https://bit.ly/2m0KKrg
• Speech signal processing
• e.g. Parkinson‟s disease,
https://bit.ly/2v3ZdHR
Examples where
“ML” has done poorly
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Adversarial examples
https://bit.ly/2N4mQFo; https://bit.ly/2W7X9rF
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Recidivism Algorithm
Pro-publica (2016) https://bit.ly/1XMKh5R
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Skin cancer and rulers
Esteva et al., Nature, 2016, DOI: 10.1038/nature21056; https://bit.ly/2lE0vV0
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Predicting mortality – the conclusion
PlosOne, 2018, DOI: 10.1371/journal.pone.0202344
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Predicting mortality – the results
PlosOne, 2018, DOI: 10.1371/journal.pone.0202344
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Predicting mortality – the media
PlosOne, 2018, DOI: 10.1371/journal.pone.0202344; https://bit.ly/2Q6H41R; https://bit.ly/2m3RLrn
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
HYPE!
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Systematic review clinical prediction models
Christodoulou et al. Journal of Clinical Epidemiology, 2019, doi: 10.1016/j.jclinepi.2019.02.004
“ML” versus
traditional statistics and
medical doctors
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Comparison “ML” vs statistical models
• “ML” versus statistical models is a false dichotomy
• Advanced “ML” shows promise, especially in areas that are
not the traditional “tabular data” (e.g. images, sound)
• Tabular data settings where “ML” can be compared with
traditional regression model techniques show little added value
in medical applications
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Sources of prediction error
Y = # $ + &
For a model ' the expected test prediction error is:
σ)
+ bias) .#/ $ + var .#/ $
See equation 2.46 in Hastie et al., the elements of statistical learning, https://stanford.io/2voWjra
Irreducible error Mean squared prediction error
(with E & = 0, var & = 9)
, values in $ are not random)
What we don’t model How we model
≈≈
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Irreducible error is often large
• Health and lack thereof complex to measure (‘no gold standard’)
• Predictors of diseases are often imperfectly and partly
measured
• We often don’t know all the causal mechanisms at play
• much easier to predict if you know the causal mechanisms!
• Predicting the future even more difficult
Understanding prediction uncertainty is key
Courtesy Cecile Janssens: https://bit.ly/2Jf5ft6
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Classification versus risk prediction
Most “ML” classifiers don’t come naturally with risk prediction, i.e.
a probability estimate of predicted outcome for individuals
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Classification versus risk prediction
Most “ML” classifiers don’t come naturally with risk prediction, i.e.
a probability estimate of predicted outcome for individuals
• Models can be trained to be optimized for a certain predictive
performance (e.g. AUC, classification accuracy, calibration)
• Which performance to use to compare models are optimized
for different types of performance?
• Possibly much large sample size needed to obtain reliable
(calibrated) risk predictions1 than reliable classifications
Van Smeden et al., Stat Meth Med Res, 2019, doi: 10.1177/0962280218784726
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Flexible algorithms are data hungry
From slide deck Ben van Calster: https://bit.ly/38Aqmjs
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Where do we stand on “ML” vs doctors?
Radiology and pathology
• Article hits: 12,000
• After screening: 22
• Out-of-sample comparison “ML” vs doctors: 2
Faes et al., Lancet Digital Health, 2019, doi: 10.1016/S2589-7500(19)30123-2
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Some personal observations
• Doctors did not work under realistic time constraints and/or no
access to all regular diagnostic information
• The output generated by algorithms and physicians not
evaluated on the same scale
• Apparent (optimistic) model performance vs medical doctors
Van Smeden et al., JAMA, 2018, doi: 10.1001/jama.2018.1466
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Algorithms, the environment and costs
The costs of running (cloud computing) the Transformer algorithm
are estimated at 1 to 3 million Dollars
https://bit.ly/33Dj38X
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Final remarks
• Algorithms are high maintenance
• Developed models need repeated testing and updating to
remain useful over time and place
• Many new barriers: black box proprietary algorithms, computing
costs
• Regulation and quality control of algorithms
• New data quality issues
Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://twitter.com/DrHughHarvey/status/1230218991026819077
Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
Email: M.vanSmeden@umcutrecht.nl
Twitter: @MaartenvSmeden

Mais conteúdo relacionado

Mais procurados

Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyondMaarten van Smeden
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Maarten van Smeden
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxEwout Steyerberg
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IMaarten van Smeden
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Maarten van Smeden
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkStats Statswork
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicineMaarten van Smeden
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsMaarten van Smeden
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxMaarten van Smeden
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Maarten van Smeden
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIMaarten van Smeden
 
Whatever happened to design based inference
Whatever happened to design based inferenceWhatever happened to design based inference
Whatever happened to design based inferenceStephenSenn2
 
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareThree Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareHealth Catalyst
 
Machine learning in medicine: calm down
Machine learning in medicine: calm downMachine learning in medicine: calm down
Machine learning in medicine: calm downBenVanCalster
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Ewout Steyerberg
 

Mais procurados (20)

Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyond
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptx
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Predictimands
PredictimandsPredictimands
Predictimands
 
Uncertainty in AI
Uncertainty in AIUncertainty in AI
Uncertainty in AI
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicine
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutions
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradox
 
Clinical prediction models
Clinical prediction modelsClinical prediction models
Clinical prediction models
 
Algorithm based medicine
Algorithm based medicineAlgorithm based medicine
Algorithm based medicine
 
UMC Utrecht AI Methods Lab
UMC Utrecht AI Methods LabUMC Utrecht AI Methods Lab
UMC Utrecht AI Methods Lab
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part II
 
Whatever happened to design based inference
Whatever happened to design based inferenceWhatever happened to design based inference
Whatever happened to design based inference
 
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareThree Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
 
Machine learning in medicine: calm down
Machine learning in medicine: calm downMachine learning in medicine: calm down
Machine learning in medicine: calm down
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 

Semelhante a ML and AI: a blessing and curse for statisticians and medical doctors

COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...Maarten van Smeden
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirusMaarten van Smeden
 
Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Ewout Steyerberg
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureMichel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical researchMaarten van Smeden
 
Big data for healthcare analytics final -v0.3 miz
Big data for healthcare analytics   final -v0.3 mizBig data for healthcare analytics   final -v0.3 miz
Big data for healthcare analytics final -v0.3 mizYusuf Brima
 
Societal, policy, and regulatory implications of AI for healthcare and medicine
Societal, policy, and regulatory implications of AI for healthcare and medicineSocietal, policy, and regulatory implications of AI for healthcare and medicine
Societal, policy, and regulatory implications of AI for healthcare and medicineDan Elton
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Kees van Bochove
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancerpaperpublications3
 
Social Media Datasets for Analysis and Modeling Drug Usage
Social Media Datasets for Analysis and Modeling Drug UsageSocial Media Datasets for Analysis and Modeling Drug Usage
Social Media Datasets for Analysis and Modeling Drug Usageijtsrd
 
The future interface of mental health with information technology: high touch...
The future interface of mental health with information technology: high touch...The future interface of mental health with information technology: high touch...
The future interface of mental health with information technology: high touch...HealthXn
 
Journal for Clinical Studies: Close Cooperation Between Data Management and B...
Journal for Clinical Studies: Close Cooperation Between Data Management and B...Journal for Clinical Studies: Close Cooperation Between Data Management and B...
Journal for Clinical Studies: Close Cooperation Between Data Management and B...KCR
 
Deep learning for biomedical discovery and data mining I
Deep learning for biomedical discovery and data mining IDeep learning for biomedical discovery and data mining I
Deep learning for biomedical discovery and data mining IDeakin University
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in healthBig Data Colombia
 

Semelhante a ML and AI: a blessing and curse for statisticians and medical doctors (20)

COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
 
The big 3
The big 3 The big 3
The big 3
 
The big 3
The big 3 The big 3
The big 3
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirus
 
O1
O1O1
O1
 
Big data
Big dataBig data
Big data
 
Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical research
 
Big data for healthcare analytics final -v0.3 miz
Big data for healthcare analytics   final -v0.3 mizBig data for healthcare analytics   final -v0.3 miz
Big data for healthcare analytics final -v0.3 miz
 
Societal, policy, and regulatory implications of AI for healthcare and medicine
Societal, policy, and regulatory implications of AI for healthcare and medicineSocietal, policy, and regulatory implications of AI for healthcare and medicine
Societal, policy, and regulatory implications of AI for healthcare and medicine
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancer
 
Social Media Datasets for Analysis and Modeling Drug Usage
Social Media Datasets for Analysis and Modeling Drug UsageSocial Media Datasets for Analysis and Modeling Drug Usage
Social Media Datasets for Analysis and Modeling Drug Usage
 
The future interface of mental health with information technology: high touch...
The future interface of mental health with information technology: high touch...The future interface of mental health with information technology: high touch...
The future interface of mental health with information technology: high touch...
 
Journal for Clinical Studies: Close Cooperation Between Data Management and B...
Journal for Clinical Studies: Close Cooperation Between Data Management and B...Journal for Clinical Studies: Close Cooperation Between Data Management and B...
Journal for Clinical Studies: Close Cooperation Between Data Management and B...
 
Deep learning for biomedical discovery and data mining I
Deep learning for biomedical discovery and data mining IDeep learning for biomedical discovery and data mining I
Deep learning for biomedical discovery and data mining I
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in health
 
Sharing is caring
Sharing is caringSharing is caring
Sharing is caring
 

Mais de Maarten van Smeden

Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Maarten van Smeden
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Maarten van Smeden
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Maarten van Smeden
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the futureMaarten van Smeden
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemMaarten van Smeden
 
Anatomy of a successful science thread
Anatomy of a successful science threadAnatomy of a successful science thread
Anatomy of a successful science threadMaarten van Smeden
 
Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Maarten van Smeden
 

Mais de Maarten van Smeden (9)

Associate professor lecture
Associate professor lectureAssociate professor lecture
Associate professor lecture
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the future
 
Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problem
 
Anatomy of a successful science thread
Anatomy of a successful science threadAnatomy of a successful science thread
Anatomy of a successful science thread
 
Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?
 

Último

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 

Último (20)

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 

ML and AI: a blessing and curse for statisticians and medical doctors

  • 1. ML and AI: a blessing and curse for statisticians and medical doctors Maarten van Smeden University Medical Center Utrecht Julius Center for Health Sciences and Primary Care The Netherlands Twitter: @MvanSmeden Email: M.vanSmeden@umcutrecht.nl STRATOS member (TG6: diagnostic tests and prediction models) 9 March 2020 Freiburg, Germany, Institut für Medizinische Biometrie und Statistik Biometrischen Kolloquium Sides available at https://www.slideshare.net/MaartenvanSmeden I have no conflicts of interest to declare
  • 2. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2CwW43A
  • 3. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Terminology In medical research, “artificial intelligence” usually just means “machine learning” or “algorithm”
  • 4. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2v2aokk
  • 5. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/2TOdd0F
  • 6. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Tech company business model
  • 7. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Tech company business model https://bit.ly/2HSp8X5; https://bit.ly/2Z0Pfop; https://bit.ly/2KIcpHG; https://bit.ly/33IJhr9
  • 8. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Other success stories https://go.nature.com/2VG2hS7; https://bbc.in/2Z1drXQ; https://bit.ly/2TAfRIP
  • 9. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden IBM Watson winning Jeopardy! (2011) https://bbc.in/2TMvV8I
  • 10. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden IBM Watson for oncology https://bit.ly/2LxiWGj
  • 11. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenForsting, J Nuc Med, 2017, DOI: 10.2967/jnumed.117.190397
  • 12. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Machine learning everywhere https://bit.ly/2ka0HLq; https://go.nature.com/33TQgO6; https://bit.ly/2kp6X23; https://bit.ly/2lZuKWt; https://bit.ly/2lI298g
  • 13. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Reviewer #2
  • 14. what are these machine learning methods?
  • 15. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://bit.ly/38A1ng0
  • 16. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden “Everything is an ML method” https://bit.ly/2lEVn33
  • 17. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden “ML methods come from computer science” https://bit.ly/2zhbwPv; https://stanford.io/2TVp1xK; https://stanford.io/2ZfED0k Leo Breiman Jerome H Friedman Trevor Hastie CART, random forest Gradient boosting Elements of statistical learning Education Physics/Math Physics Statistics Job title Professor of Statistics Professor of Statistics Professor of Statistics
  • 18. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden “ML methods for prediction, statistics for explaining” Damen, BMJ, 2016, DOI:10.1136/bmj.i2416 363 developed models how many? Decision trees 0 Random forests 0 Support vector machines 0 Nearest neighbor algorithms 0 Neural networks 1
  • 19. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden “ML methods for prediction, statistics for explaining” 1See further: Kreiff and Diaz Ordaz; https://bit.ly/2m1eYdK ML and causal inference, small selection1 • Superlearner (e.g. van der Laan) • High dimensional propensity scores (e.g. Schneeweiss) • The book of why (Pearl)
  • 20. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Two cultures Breiman, Stat Sci, 2001, DOI: 10.1214/ss/1009213726
  • 21. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Statistics Machine learning Covariates Features Outcome variable Target Model Network, graphs Parameters Weights Model for discrete var. Classifier Model for continuous var. Regression Log-likelihood Loss Multinomial regression Softmax Measurement error Noise Subject/observation Sample/instance Dummy coding One-hot encoding Measurement invariance Concept drift Statistics Machine learning Prediction Supervised learning Latent variable modeling Unsupervised learning Fitting Learning Prediction error Error Sensitivity Recall Positive predictive value Precision Contingency table Confusion matrix Measurement error model Noise-aware ML Structural equation model Gaussian Bayesian network Gold standard Ground truth Derivation–validation Training–test Experiment A/B test Adapted from Daniel Obserski: https://bit.ly/2YN12Xf and Robert Tibshirani: https://stanford.io/2zqEGfr Language
  • 22. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenRobert Tibshirani: https://stanford.io/2zqEGfr Machine learning: large grant = $1,000,000 Statistics: large grant = $50,000
  • 23. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden ML refers to a culture, not to methods Distinguishing between statistics and machine learning • Substantial overlap methods used by both cultures • Substantial overlap analysis goals • Attempts to separate the two frequently result in disagreement Pragmatic approach: I’ll use “ML” to refer to models roughly outside of the traditional regression types of analysis: decision trees (and descendants), SVMs, neural networks (including Deep learning), boosting etc.
  • 25. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
  • 26. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Example: retinal disease Gulshan et al, JAMA, 2016, 10.1001/jama.2016.17216; Picture retinopathy: https://bit.ly/2kB3X2w Diabetic retinopathy Deep learning (= Neural network) • 128,000 images • Transfer learning (preinitialization) • Sensitivity and specificity > .90 • Estimated from training data
  • 27. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Example: lymph node metastases Bejnordi et al, JAMA, 2018, doi: 10.1001/jama.2017.14585. See our letter to the editor for a critical discussion: https://bit.ly/2kcYS0e Deep learning competition But: • 390 teams signed up, 23 submitted • “Only” 270 images for training • Test AUC range: 0.56 to 0.99
  • 28. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Deep learning on images Many similar studies and challenges in • radiology • pathology • dermatology • opthalmology • gastroenterology • cardiology • …. Topol, Nature Medicine, 2019, DOI: 10.1038/s41591-018-0300-7
  • 29. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden The future? Topol, Nature Medicine, 2019, DOI: 10.1038/s41591-018-0300-7
  • 30. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Other sources of “medical” data • Large scale gene expression data • e.g. diagnosis of acute myeloid leukemia https://bit.ly/2k8Ao8e • Prognostication by text mining electronic health records • e.g. predicting life expectancy https://bit.ly/2k8Ao8e • Analyzing social media posts • e.g. pharmacovigilance, adverse events monitoring via Twitter posts https://bit.ly/2m0KKrg • Speech signal processing • e.g. Parkinson‟s disease, https://bit.ly/2v3ZdHR
  • 32. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Adversarial examples https://bit.ly/2N4mQFo; https://bit.ly/2W7X9rF
  • 33. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Recidivism Algorithm Pro-publica (2016) https://bit.ly/1XMKh5R
  • 34. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Skin cancer and rulers Esteva et al., Nature, 2016, DOI: 10.1038/nature21056; https://bit.ly/2lE0vV0
  • 35. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Predicting mortality – the conclusion PlosOne, 2018, DOI: 10.1371/journal.pone.0202344
  • 36. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Predicting mortality – the results PlosOne, 2018, DOI: 10.1371/journal.pone.0202344
  • 37. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Predicting mortality – the media PlosOne, 2018, DOI: 10.1371/journal.pone.0202344; https://bit.ly/2Q6H41R; https://bit.ly/2m3RLrn
  • 38. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden HYPE!
  • 39. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Systematic review clinical prediction models Christodoulou et al. Journal of Clinical Epidemiology, 2019, doi: 10.1016/j.jclinepi.2019.02.004
  • 41. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Comparison “ML” vs statistical models • “ML” versus statistical models is a false dichotomy • Advanced “ML” shows promise, especially in areas that are not the traditional “tabular data” (e.g. images, sound) • Tabular data settings where “ML” can be compared with traditional regression model techniques show little added value in medical applications
  • 42. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Sources of prediction error Y = # $ + & For a model ' the expected test prediction error is: σ) + bias) .#/ $ + var .#/ $ See equation 2.46 in Hastie et al., the elements of statistical learning, https://stanford.io/2voWjra Irreducible error Mean squared prediction error (with E & = 0, var & = 9) , values in $ are not random) What we don’t model How we model ≈≈
  • 43. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Irreducible error is often large • Health and lack thereof complex to measure (‘no gold standard’) • Predictors of diseases are often imperfectly and partly measured • We often don’t know all the causal mechanisms at play • much easier to predict if you know the causal mechanisms! • Predicting the future even more difficult Understanding prediction uncertainty is key Courtesy Cecile Janssens: https://bit.ly/2Jf5ft6
  • 44. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Classification versus risk prediction Most “ML” classifiers don’t come naturally with risk prediction, i.e. a probability estimate of predicted outcome for individuals
  • 45. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Classification versus risk prediction Most “ML” classifiers don’t come naturally with risk prediction, i.e. a probability estimate of predicted outcome for individuals • Models can be trained to be optimized for a certain predictive performance (e.g. AUC, classification accuracy, calibration) • Which performance to use to compare models are optimized for different types of performance? • Possibly much large sample size needed to obtain reliable (calibrated) risk predictions1 than reliable classifications Van Smeden et al., Stat Meth Med Res, 2019, doi: 10.1177/0962280218784726
  • 46. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Flexible algorithms are data hungry From slide deck Ben van Calster: https://bit.ly/38Aqmjs
  • 47. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Where do we stand on “ML” vs doctors? Radiology and pathology • Article hits: 12,000 • After screening: 22 • Out-of-sample comparison “ML” vs doctors: 2 Faes et al., Lancet Digital Health, 2019, doi: 10.1016/S2589-7500(19)30123-2
  • 48. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Some personal observations • Doctors did not work under realistic time constraints and/or no access to all regular diagnostic information • The output generated by algorithms and physicians not evaluated on the same scale • Apparent (optimistic) model performance vs medical doctors Van Smeden et al., JAMA, 2018, doi: 10.1001/jama.2018.1466
  • 49. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden
  • 50. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Algorithms, the environment and costs The costs of running (cloud computing) the Transformer algorithm are estimated at 1 to 3 million Dollars https://bit.ly/33Dj38X
  • 51. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Final remarks • Algorithms are high maintenance • Developed models need repeated testing and updating to remain useful over time and place • Many new barriers: black box proprietary algorithms, computing costs • Regulation and quality control of algorithms • New data quality issues
  • 52. Freiburg, 9 March 2019 Twitter: @MaartenvSmedenhttps://twitter.com/DrHughHarvey/status/1230218991026819077
  • 53. Freiburg, 9 March 2019 Twitter: @MaartenvSmeden Email: M.vanSmeden@umcutrecht.nl Twitter: @MaartenvSmeden