SlideShare uma empresa Scribd logo
1 de 41
Baixar para ler offline
Thoughts on
Machine Learning and Artificial Intelligence
Maarten van Smeden, PhD

Leiden University Medical Center, Netherlands

STRATOS Lorenz Meeting

21/09/2018
Interested reader perspective
• Statistician by training

• Limited experience applying machine learning techniques

• Three examples that I think are illustrative for ML/AI in medicine
as it is applied nowadays

• Focus: prediction
Tech company business model
Apple Watch 4
FDA Approval
https://www.statnews.com/2018/09/13/heres-the-data-behind-the-new-apple-watch-ekg-app/?mc_cid=0fbfd65c13&mc_eid=75f1d5aea2
Impressive artificial intelligence
IBM Watson win against 2 Jeopardy’s champions in 2011
Reviewer #2
Less impressive artificial intelligence
Warning!
Statistical policing going on
Yesterday’s news
http://www.timvanderzee.com/the-wansink-dossier-an-overview/

Example 1: ML predicting mortality
• Caliber dataset (UK, EHR)

• N = 80,000 pre-existing coronary artery disease

• Predict all cause mortality (18,000 events, time horizon unclear)

• “used Cox models, random forests and elastic net regression”

• 586 candidate predictors vs 27 pre-selected variables

• Complete case / multiple imputation / missing indicator method

• Cox models: linear main effects only

• Split sample (1/3 test, 2/3 training)
Example 1: ML predicting mortality
Example 1: ML predicting mortality
Example 1: ML predicting mortality
One take
Linear regression is an example of
Machine Learning?
If so, what isn’t Machine Learning?
Perhaps more reasonable?
Beam & Kohane, JAMA, 2018
Example 2: lymph node metastases
Example 2: lymph node metastases
Example 2: lymph node metastases
• Researcher challenge competition

• Whole slide images of women diagnosed with breast cancer

• Training data: N = 270 (110 events); test data: N = 129 (49 events)

• 11 pathologists evaluating the test data

• 390 teams signed up for the competition

• 23 teams submitted 32 algorithms for evaluation
Example 2: lymph node metastases
Example 2: lymph node metastases
• Unfair comparison between pathologists and DL

• Pathologists no access to regularly available diagnostics

• AUC comparison DL (continuous) vs pathologists (5-item
scale) 

• Promising algorithms overrepresented (390 teams -> 32
algorithms submitted)
Example 2: lymph node metastases
• No attention to risk prediction / calibration

• ML: attention classification only without probability

• Hugh (often implicit) difference between the traditional (risk)
prediction modeling in medicine and (traditional ML)

• Probably fine for Netflix recommendations; not so much for
real life medical decision making
Misuse of “risk"
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
• Patients with newly diagnosed diabetes (N = 8980) 

• 6 continuous variables 

• K-means clustering (‘unsupervised learning’)
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
BS detection simulation
• Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations 

• “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
K-means clustering
“K-means finds a Voronoi partition, only if that partition coincides with a
"clustering" does it have a hope of actually doing clustering”

Max Little: https://twitter.com/MaxALittle/status/970277900871262213
Freak examples?
Probably?
Maybe?
What I observe is:
• Confusion and disagreement about what is and isn’t ML/AI 

• Analyses labeled “ML/AI” have a tendency to concentrate on
classification (exceptions exist, e.g. high dimensional PS
approaches suggested that are called “ML”) 

• Analyses labeled “ML/AI” in medicine are surprisingly often
done by people not thoroughly trained in statistics

• Basic statistical principles are often forgotten or ignored (e.g.
improper scoring rules)
Concluding remarks (1)
• Just because an algorithm is novel or flexible doesn’t mean it is
any good, obviously

• Dismissing the potential value of novel “ML/AI” algorithms out-
of-hand doesn’t make sense

• We need more realistic simulations and many applications to
compare the traditional vs more novel / flexible algorithms

• The primary issue in medical applications seems to be with the
modelers not so much with the models
Concluding remarks (2)
• Statisticians should be more involved in the application and
evaluation of novel / flexible algorithms, especially for risk
prediction

• Statisticians should be involved in studying performance of
novel / flexible algorithms (e.g. data hungriness) -> realistic
simulation studies

• Collaboration with computer scientists

• Computationally intensive -> may not be cheap

• Serious experimental design and reporting
Simulation is…
“…it is using simulation for multiplication that I find objectionable. Eight patients are
eight patients and so should remain.”
“All the impressive achievements of
deep learning amount to just curve
fitting”
Judea Pearl
Thoughts on Machine Learning and Artificial Intelligence
Thoughts on Machine Learning and Artificial Intelligence

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyond
 
The basics of prediction modeling
The basics of prediction modeling The basics of prediction modeling
The basics of prediction modeling
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...
 
ML and AI: a blessing and curse for statisticians and medical doctors
ML and AI: a blessing and curse forstatisticians and medical doctorsML and AI: a blessing and curse forstatisticians and medical doctors
ML and AI: a blessing and curse for statisticians and medical doctors
 
Machine learning in medicine: calm down
Machine learning in medicine: calm downMachine learning in medicine: calm down
Machine learning in medicine: calm down
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 models
 
Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...
 
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AIStr-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
 
Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20
 
Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical research
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confounding
 
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
 

Semelhante a Thoughts on Machine Learning and Artificial Intelligence

Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...
University of Malaya
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06
Joe Gricar, MS
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
pingxiaoou
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
pingxiaoou
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
Reetabrata Bhattacharyya
 

Semelhante a Thoughts on Machine Learning and Artificial Intelligence (20)

Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
 
staistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptxstaistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptx
 
Presentation (9).pptx
Presentation (9).pptxPresentation (9).pptx
Presentation (9).pptx
 
Informatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside careInformatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside care
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06
 
Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015
 
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENTCOMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
 
man0 ppt.pptx
man0 ppt.pptxman0 ppt.pptx
man0 ppt.pptx
 
Interpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical ResearchInterpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical Research
 
poster_Reza
poster_Rezaposter_Reza
poster_Reza
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
 
A review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signalsA review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signals
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
 
Presentation CIOMS VIII
Presentation CIOMS VIIIPresentation CIOMS VIII
Presentation CIOMS VIII
 
Chase presentation
Chase presentationChase presentation
Chase presentation
 
Health advances ai in diagnostic development
Health advances ai in diagnostic developmentHealth advances ai in diagnostic development
Health advances ai in diagnostic development
 

Mais de Maarten van Smeden

Mais de Maarten van Smeden (18)

Uncertainty in AI
Uncertainty in AIUncertainty in AI
Uncertainty in AI
 
UMC Utrecht AI Methods Lab
UMC Utrecht AI Methods LabUMC Utrecht AI Methods Lab
UMC Utrecht AI Methods Lab
 
Rage against the machine learning 2023
Rage against the machine learning 2023Rage against the machine learning 2023
Rage against the machine learning 2023
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicine
 
Associate professor lecture
Associate professor lectureAssociate professor lecture
Associate professor lecture
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
 
Predictimands
PredictimandsPredictimands
Predictimands
 
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
 
Algorithm based medicine
Algorithm based medicineAlgorithm based medicine
Algorithm based medicine
 
Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligence
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the future
 
Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirus
 
Anatomy of a successful science thread
Anatomy of a successful science threadAnatomy of a successful science thread
Anatomy of a successful science thread
 

Último

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 

Último (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 

Thoughts on Machine Learning and Artificial Intelligence

  • 1. Thoughts on Machine Learning and Artificial Intelligence Maarten van Smeden, PhD Leiden University Medical Center, Netherlands STRATOS Lorenz Meeting 21/09/2018
  • 2. Interested reader perspective • Statistician by training • Limited experience applying machine learning techniques • Three examples that I think are illustrative for ML/AI in medicine as it is applied nowadays • Focus: prediction
  • 4.
  • 7.
  • 8. Impressive artificial intelligence IBM Watson win against 2 Jeopardy’s champions in 2011
  • 14. Example 1: ML predicting mortality • Caliber dataset (UK, EHR) • N = 80,000 pre-existing coronary artery disease • Predict all cause mortality (18,000 events, time horizon unclear) • “used Cox models, random forests and elastic net regression” • 586 candidate predictors vs 27 pre-selected variables • Complete case / multiple imputation / missing indicator method • Cox models: linear main effects only • Split sample (1/3 test, 2/3 training)
  • 15. Example 1: ML predicting mortality
  • 16. Example 1: ML predicting mortality
  • 17. Example 1: ML predicting mortality
  • 18. One take Linear regression is an example of Machine Learning? If so, what isn’t Machine Learning?
  • 19. Perhaps more reasonable? Beam & Kohane, JAMA, 2018
  • 20. Example 2: lymph node metastases
  • 21. Example 2: lymph node metastases
  • 22. Example 2: lymph node metastases • Researcher challenge competition • Whole slide images of women diagnosed with breast cancer • Training data: N = 270 (110 events); test data: N = 129 (49 events) • 11 pathologists evaluating the test data • 390 teams signed up for the competition • 23 teams submitted 32 algorithms for evaluation
  • 23. Example 2: lymph node metastases
  • 24. Example 2: lymph node metastases • Unfair comparison between pathologists and DL • Pathologists no access to regularly available diagnostics • AUC comparison DL (continuous) vs pathologists (5-item scale) • Promising algorithms overrepresented (390 teams -> 32 algorithms submitted)
  • 25. Example 2: lymph node metastases • No attention to risk prediction / calibration • ML: attention classification only without probability • Hugh (often implicit) difference between the traditional (risk) prediction modeling in medicine and (traditional ML) • Probably fine for Netflix recommendations; not so much for real life medical decision making
  • 27. Example 3: 5 types of diabetes
  • 28. Example 3: 5 types of diabetes
  • 29. Example 3: 5 types of diabetes • Patients with newly diagnosed diabetes (N = 8980) • 6 continuous variables • K-means clustering (‘unsupervised learning’)
  • 30. Example 3: 5 types of diabetes
  • 31. Example 3: 5 types of diabetes
  • 32. BS detection simulation • Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations • “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
  • 33. K-means clustering “K-means finds a Voronoi partition, only if that partition coincides with a "clustering" does it have a hope of actually doing clustering” Max Little: https://twitter.com/MaxALittle/status/970277900871262213
  • 35. What I observe is: • Confusion and disagreement about what is and isn’t ML/AI • Analyses labeled “ML/AI” have a tendency to concentrate on classification (exceptions exist, e.g. high dimensional PS approaches suggested that are called “ML”) • Analyses labeled “ML/AI” in medicine are surprisingly often done by people not thoroughly trained in statistics • Basic statistical principles are often forgotten or ignored (e.g. improper scoring rules)
  • 36. Concluding remarks (1) • Just because an algorithm is novel or flexible doesn’t mean it is any good, obviously • Dismissing the potential value of novel “ML/AI” algorithms out- of-hand doesn’t make sense • We need more realistic simulations and many applications to compare the traditional vs more novel / flexible algorithms • The primary issue in medical applications seems to be with the modelers not so much with the models
  • 37. Concluding remarks (2) • Statisticians should be more involved in the application and evaluation of novel / flexible algorithms, especially for risk prediction • Statisticians should be involved in studying performance of novel / flexible algorithms (e.g. data hungriness) -> realistic simulation studies • Collaboration with computer scientists • Computationally intensive -> may not be cheap • Serious experimental design and reporting
  • 38. Simulation is… “…it is using simulation for multiplication that I find objectionable. Eight patients are eight patients and so should remain.”
  • 39. “All the impressive achievements of deep learning amount to just curve fitting” Judea Pearl