SlideShare a Scribd company logo
1 of 21
Yifan Peng1, Xiaosong Wang2, Le Lu2, Mohammadhadi Bagheri2,
Ronald Summers2, Zhiyong Lu1
1 NCBI/NLM/NIH
2 CC/NIH
Twitter: #AMIA2017
Text Mining Radiology Reports for Deep
Learning Radiology Images
Methods for Identification, Classification, and Association using EHR Data
S23
• The availability of well-labeled data is the key for large scale machine learning, e.g. deep
learning
• Hospitals have accumulated a large number of raw radiology images and reports
• Conventional ways for collecting image labels are NOT applicable
• the security and privacy issues
• requires comprehension of domain-specific medical knowledge
All Start with Data
Large scale natural image datasets
Large scale
Medical Image dataset
2AMIA 2017 | amia.org
Overview
Mining image labels via NLP for multi-label pathology classification
3AMIA 2017 | amia.org
A Sample Entry
Image Report Label
findings: pa and lateral views of
the chest demonstrate significantly
improved bilateral lower lung field
interstitial markings compatible with
linear atelectasis. unchanged right
9th rib fracture peripherally.
unchanged ossification left
coracoacromial ligament. the cardiac
and mediastinal contours are stable.
impression: improved bilateral lower
lung field linear atelectasis.
Atelectasis
4AMIA 2017 | amia.org
8 Common Thorax Diseases
5AMIA 2017 | amia.org
Challenges
Negative and equivocal findings may indicate the absence of findings
mentioned within the radiology report
Findings: right internal jugular catheter remains in place. Large metastatic lung mass in
the lateral left upper lobe is again noted. No infiltrate or effusion. Extensive surgical
clips again noted left axilla.
Impression: no significant change.
Reason for exam (entered by ordering clinician into cris): bilateral pneumonia no
change in the tracheostomy tube or right internal jugular venous catheter. Unchanged
bilateral alveolar infiltrates, fluid in the right minor fissure, lucency at the right
costophrenic angle suggesting pneumothorax. Overall, no significant change
6AMIA 2017 | amia.org
Related Work
Chapman W, et al. A simple algorithm for identifying negated findings and diseases in
discharge summaries. Journal of Biomedical Informatics. 2001;34:301-310.
Harkema H, et al. ConText: an algorithm for determining negation, experiencer, and
temporal status from clinical reports. Journal of biomedical informatics. 2009;42:839-851.
Mutalik P, et al. Use of general-purpose negation detection to augment concept indexing
of medical documents: a quantitative study using the UMLS. Journal of the American
Medical Informatics Association. 2001;8:598-609.
Sohn S, Wu S, Chute C. Dependency parser-based negation detection in clinical
narratives. In AMIA Summits on Translational Science proceedings AMIA Summit on
Translational Science. 2012;2012:1-8.
Mehrabi S, et al. DEEPEN: A negation detection system for clinical text incorporating
dependency relation into NegEx. Journal of Biomedical Informatics. 2015;54:213-219.
7AMIA 2017 | amia.org
Related Work
Ogren P, et al. Constructing evaluation corpora for automated clinical named entity
recognition. In Proceedings of the Sixth International Conference on Language
Resources and Evaluation (LREC'08). 2008;28-30.
Uzuner South B, et al. 2010 i2b2/VA challenge on concepts, assertions, and relations in
clinical text. Journal of the American Medical Informatics Association. 2011;18:552-556.
Suominen H, et al. Overview of the ShARe/CLEF eHealth evaluation lab 2013. In
International Conference of the Cross-Language Evaluation Forum for European
Languages. 2013;212-231.
Albright D, et al. Towards comprehensive syntactic and semantic annotations of the
clinical narrative. Journal of the American Medical Informatics Association. 2013;20:922-
930.
etc..
8AMIA 2017 | amia.org
Our overall method
1. MetaMap and DNorm were used to map every mention of keywords in a report
to a unique concept ID in the Systematized Nomenclature of Medicine Clinical
Terms (SNOMED-CT)
• MetaMap (Aronson et al. 2010)
• DNorm (Leaman, Lu, 2014)
2. Remove negative and equivocal findings within the radiology report
9AMIA 2017 | amia.org
• Utilize the universal dependency graph to define patterns
• a directed graph
• vertices are labeled with information such as the word, part-of-speech and
the word lemma
• edges represent typed dependencies from the governor to its dependent
and are labeled with dependency type
10AMIA 2017 | amia.org
Negation and Uncertainty detection
Sample rules
11AMIA 2017 | amia.org
• Defined rules on the dependency graphs by utilizing the dependency label
and direction information
Experiments
Three benchmarking corpora
12AMIA 2017 | amia.org
Dataset Reports Positives Negatives
OpenI (Demner-Fushman et al) 3,851 1,354 -
ChestX-ray (Wang et al) 900 2,131 -
• Demner-Fushman D, Kohli M, Rosenman M, et al. Preparing a collection
of radiology examinations for distribution and retrieval. Journal of the
American Medical Informatics Association. 2015;23:304-310.
Results on OpenI and ChestX-ray
13AMIA 2017 | amia.org
OpenI ChestX-ray
P R F P R F
DNorm/MetaMap 13.8 85.7 23.8 72.3 95.7 82.4
DNorm/MetaMap + Negation 89.8 85.0 87.3 94.4 94.4 94.4
NIH Chest X-ray Dataset
One of the largest publicly available chest x-ray datasets to scientific
community
• 112,120 frontal-view X-ray images
• 30,805 unique patients
14AMIA 2017 | amia.org
https://nihcc.app.box.com/v/ChestXray-NIHCC
Multi-label Classification and Localization
Multi-label Classification and Localization Framework
15AMIA 2017 | amia.org
Multi-label Classification and Localization
16AMIA 2017 | amia.org
Radiology image classification
• Training (70%), validation (10%) and testing (20%)
• Multi-label CNN architecture is implemented using Caffe framework
• The ImageNet pre-trained models, i.e., AlexNet, GoogLeNet, VGGNet-16
and ResNet-50 are obtained from the Caffe model zoo
17AMIA 2017 | amia.org
Multi-label Disease Classification Results
Wang X, Peng Y, Lu L, Bagheri M, Lu Z, Summers R. ChestX-ray8: Hospital-scale
Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and
Localization of Common Thorax Diseases. IEEE Conference on Computer Vision and
Pattern Recognition (CVPR). 2017.
18AMIA 2017 | amia.org
Conclusion and Future work
• We propose a new algorithm (NegBio), based on dependency graph, to
determine negative and equivocal findings in radiology reports
• We provide one of the largest publicly available chest x-ray datasets to
scientific community
• We explore the combination of text mining with radiology imaging analysis in
the era of deep learning.
Future work
• To explore NegBio’s applicability in clinical texts beyond radiology reports.
• The current results suggest that building fully-automated high precision CAD
systems remains challenging.
19AMIA 2017 | amia.org
Acknowledgment
This work was supported by the Intramural Research Program of the National
Institutes of Health, at National Library of Medicine and Clinical Center. We are
also grateful to Robert Leaman for his editorial comments. We thank NVIDIA
Corporation for the GPU donation.
20AMIA 2017 | amia.org
Thank you!
yifan.peng@nih.gov
https://nihcc.app.box.com/v/ChestXray-NIHCC

More Related Content

What's hot

E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara CarvalhoSara Carvalho
 
How does machine learning help in cancer detection
How does machine learning help in cancer detection How does machine learning help in cancer detection
How does machine learning help in cancer detection GlobalTechCouncil
 
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
 The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A... The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...Carestream
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906Mark Gusack
 
Content based image retrieval by metric learning from radiology reports appli...
Content based image retrieval by metric learning from radiology reports appli...Content based image retrieval by metric learning from radiology reports appli...
Content based image retrieval by metric learning from radiology reports appli...I3E Technologies
 
First seminar presentation
First seminar presentationFirst seminar presentation
First seminar presentationFatmaSamy
 
Publications_list_2015
Publications_list_2015Publications_list_2015
Publications_list_2015Dr.Fatma Taher
 
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Erik R. Ranschaert, MD, PhD
 
Information Visualization in Medical Informatics
Information Visualization in Medical InformaticsInformation Visualization in Medical Informatics
Information Visualization in Medical InformaticsUniversity of Maryland
 
Shneiderman info vismedical-amia-panel-v2
Shneiderman info vismedical-amia-panel-v2Shneiderman info vismedical-amia-panel-v2
Shneiderman info vismedical-amia-panel-v2University of Maryland
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzCirdan
 
Ph d research review meeting
Ph d research review meetingPh d research review meeting
Ph d research review meetingAbhuday Tripathi
 
IRJET - Cervical Cancer Prognosis using MARS and Classification
IRJET - Cervical Cancer Prognosis using MARS and ClassificationIRJET - Cervical Cancer Prognosis using MARS and Classification
IRJET - Cervical Cancer Prognosis using MARS and ClassificationIRJET Journal
 
Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Peter Embi
 
Convolutional capsule network for covid 19 detection
Convolutional capsule network for covid 19 detectionConvolutional capsule network for covid 19 detection
Convolutional capsule network for covid 19 detectionShamik Tiwari
 
Embi cri review-2012-final
Embi cri review-2012-finalEmbi cri review-2012-final
Embi cri review-2012-finalPeter Embi
 
Embi cri review-2013-final
Embi cri review-2013-finalEmbi cri review-2013-final
Embi cri review-2013-finalPeter Embi
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...Andre Dekker
 

What's hot (20)

E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara Carvalho
 
How does machine learning help in cancer detection
How does machine learning help in cancer detection How does machine learning help in cancer detection
How does machine learning help in cancer detection
 
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
 The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A... The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
 
Content based image retrieval by metric learning from radiology reports appli...
Content based image retrieval by metric learning from radiology reports appli...Content based image retrieval by metric learning from radiology reports appli...
Content based image retrieval by metric learning from radiology reports appli...
 
First seminar presentation
First seminar presentationFirst seminar presentation
First seminar presentation
 
Publications_list_2015
Publications_list_2015Publications_list_2015
Publications_list_2015
 
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
 
Information Visualization in Medical Informatics
Information Visualization in Medical InformaticsInformation Visualization in Medical Informatics
Information Visualization in Medical Informatics
 
Shneiderman info vismedical-amia-panel-v2
Shneiderman info vismedical-amia-panel-v2Shneiderman info vismedical-amia-panel-v2
Shneiderman info vismedical-amia-panel-v2
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
 
Ph d research review meeting
Ph d research review meetingPh d research review meeting
Ph d research review meeting
 
IRJET - Cervical Cancer Prognosis using MARS and Classification
IRJET - Cervical Cancer Prognosis using MARS and ClassificationIRJET - Cervical Cancer Prognosis using MARS and Classification
IRJET - Cervical Cancer Prognosis using MARS and Classification
 
Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014
 
Convolutional capsule network for covid 19 detection
Convolutional capsule network for covid 19 detectionConvolutional capsule network for covid 19 detection
Convolutional capsule network for covid 19 detection
 
Embi cri review-2012-final
Embi cri review-2012-finalEmbi cri review-2012-final
Embi cri review-2012-final
 
Embi cri review-2013-final
Embi cri review-2013-finalEmbi cri review-2013-final
Embi cri review-2013-final
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
 

Similar to Text Mining Radiology Reports for Deep Learning Radiology Images

Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Kaela Johnson
 
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...University of Malaya
 
IRJET- Oral Cancer Detection using Machine Learning
IRJET- Oral Cancer Detection using Machine LearningIRJET- Oral Cancer Detection using Machine Learning
IRJET- Oral Cancer Detection using Machine LearningIRJET Journal
 
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...CSCJournals
 
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...Damian R. Mingle, MBA
 
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...Levi Shapiro
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUEScscpconf
 
deep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumordeep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumorVenkat Projects
 
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...IRJET Journal
 
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...semualkaira
 
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...semualkaira
 
X-Ray Disease Identifier
X-Ray Disease IdentifierX-Ray Disease Identifier
X-Ray Disease IdentifierIRJET Journal
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Pubrica
 
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...ijdms
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareAfrica Perianez
 
My top 5 papers of 2017 about clinical informatics and digital health implica...
My top 5 papers of 2017 about clinical informatics and digital health implica...My top 5 papers of 2017 about clinical informatics and digital health implica...
My top 5 papers of 2017 about clinical informatics and digital health implica...Sarah Dorward
 
[대한병리학회] 의료 인공지능 101: 병리를 중심으로
[대한병리학회] 의료 인공지능 101: 병리를 중심으로[대한병리학회] 의료 인공지능 101: 병리를 중심으로
[대한병리학회] 의료 인공지능 101: 병리를 중심으로Yoon Sup Choi
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Pubrica
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic ClassifierPhil J. Morrison
 

Similar to Text Mining Radiology Reports for Deep Learning Radiology Images (20)

Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
 
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
 
IRJET- Oral Cancer Detection using Machine Learning
IRJET- Oral Cancer Detection using Machine LearningIRJET- Oral Cancer Detection using Machine Learning
IRJET- Oral Cancer Detection using Machine Learning
 
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
 
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
 
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...
Professor Harrison Bai, Artificial Intelligence Applications in Radiology_mHe...
 
AI in Gynaec Onco
AI in Gynaec OncoAI in Gynaec Onco
AI in Gynaec Onco
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
 
deep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumordeep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumor
 
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...
Lung Nodule Feature Extraction and Classification using Improved Neural Netwo...
 
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
 
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
The Clinical use of Artificial Intelligence in the Analysis of Chest Radiogra...
 
X-Ray Disease Identifier
X-Ray Disease IdentifierX-Ray Disease Identifier
X-Ray Disease Identifier
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...
 
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcare
 
My top 5 papers of 2017 about clinical informatics and digital health implica...
My top 5 papers of 2017 about clinical informatics and digital health implica...My top 5 papers of 2017 about clinical informatics and digital health implica...
My top 5 papers of 2017 about clinical informatics and digital health implica...
 
[대한병리학회] 의료 인공지능 101: 병리를 중심으로
[대한병리학회] 의료 인공지능 101: 병리를 중심으로[대한병리학회] 의료 인공지능 101: 병리를 중심으로
[대한병리학회] 의료 인공지능 101: 병리를 중심으로
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic Classifier
 

Recently uploaded

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 

Recently uploaded (20)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 

Text Mining Radiology Reports for Deep Learning Radiology Images

  • 1. Yifan Peng1, Xiaosong Wang2, Le Lu2, Mohammadhadi Bagheri2, Ronald Summers2, Zhiyong Lu1 1 NCBI/NLM/NIH 2 CC/NIH Twitter: #AMIA2017 Text Mining Radiology Reports for Deep Learning Radiology Images Methods for Identification, Classification, and Association using EHR Data S23
  • 2. • The availability of well-labeled data is the key for large scale machine learning, e.g. deep learning • Hospitals have accumulated a large number of raw radiology images and reports • Conventional ways for collecting image labels are NOT applicable • the security and privacy issues • requires comprehension of domain-specific medical knowledge All Start with Data Large scale natural image datasets Large scale Medical Image dataset 2AMIA 2017 | amia.org
  • 3. Overview Mining image labels via NLP for multi-label pathology classification 3AMIA 2017 | amia.org
  • 4. A Sample Entry Image Report Label findings: pa and lateral views of the chest demonstrate significantly improved bilateral lower lung field interstitial markings compatible with linear atelectasis. unchanged right 9th rib fracture peripherally. unchanged ossification left coracoacromial ligament. the cardiac and mediastinal contours are stable. impression: improved bilateral lower lung field linear atelectasis. Atelectasis 4AMIA 2017 | amia.org
  • 5. 8 Common Thorax Diseases 5AMIA 2017 | amia.org
  • 6. Challenges Negative and equivocal findings may indicate the absence of findings mentioned within the radiology report Findings: right internal jugular catheter remains in place. Large metastatic lung mass in the lateral left upper lobe is again noted. No infiltrate or effusion. Extensive surgical clips again noted left axilla. Impression: no significant change. Reason for exam (entered by ordering clinician into cris): bilateral pneumonia no change in the tracheostomy tube or right internal jugular venous catheter. Unchanged bilateral alveolar infiltrates, fluid in the right minor fissure, lucency at the right costophrenic angle suggesting pneumothorax. Overall, no significant change 6AMIA 2017 | amia.org
  • 7. Related Work Chapman W, et al. A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of Biomedical Informatics. 2001;34:301-310. Harkema H, et al. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports. Journal of biomedical informatics. 2009;42:839-851. Mutalik P, et al. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS. Journal of the American Medical Informatics Association. 2001;8:598-609. Sohn S, Wu S, Chute C. Dependency parser-based negation detection in clinical narratives. In AMIA Summits on Translational Science proceedings AMIA Summit on Translational Science. 2012;2012:1-8. Mehrabi S, et al. DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx. Journal of Biomedical Informatics. 2015;54:213-219. 7AMIA 2017 | amia.org
  • 8. Related Work Ogren P, et al. Constructing evaluation corpora for automated clinical named entity recognition. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08). 2008;28-30. Uzuner South B, et al. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. Journal of the American Medical Informatics Association. 2011;18:552-556. Suominen H, et al. Overview of the ShARe/CLEF eHealth evaluation lab 2013. In International Conference of the Cross-Language Evaluation Forum for European Languages. 2013;212-231. Albright D, et al. Towards comprehensive syntactic and semantic annotations of the clinical narrative. Journal of the American Medical Informatics Association. 2013;20:922- 930. etc.. 8AMIA 2017 | amia.org
  • 9. Our overall method 1. MetaMap and DNorm were used to map every mention of keywords in a report to a unique concept ID in the Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT) • MetaMap (Aronson et al. 2010) • DNorm (Leaman, Lu, 2014) 2. Remove negative and equivocal findings within the radiology report 9AMIA 2017 | amia.org
  • 10. • Utilize the universal dependency graph to define patterns • a directed graph • vertices are labeled with information such as the word, part-of-speech and the word lemma • edges represent typed dependencies from the governor to its dependent and are labeled with dependency type 10AMIA 2017 | amia.org Negation and Uncertainty detection
  • 11. Sample rules 11AMIA 2017 | amia.org • Defined rules on the dependency graphs by utilizing the dependency label and direction information
  • 12. Experiments Three benchmarking corpora 12AMIA 2017 | amia.org Dataset Reports Positives Negatives OpenI (Demner-Fushman et al) 3,851 1,354 - ChestX-ray (Wang et al) 900 2,131 - • Demner-Fushman D, Kohli M, Rosenman M, et al. Preparing a collection of radiology examinations for distribution and retrieval. Journal of the American Medical Informatics Association. 2015;23:304-310.
  • 13. Results on OpenI and ChestX-ray 13AMIA 2017 | amia.org OpenI ChestX-ray P R F P R F DNorm/MetaMap 13.8 85.7 23.8 72.3 95.7 82.4 DNorm/MetaMap + Negation 89.8 85.0 87.3 94.4 94.4 94.4
  • 14. NIH Chest X-ray Dataset One of the largest publicly available chest x-ray datasets to scientific community • 112,120 frontal-view X-ray images • 30,805 unique patients 14AMIA 2017 | amia.org https://nihcc.app.box.com/v/ChestXray-NIHCC
  • 15. Multi-label Classification and Localization Multi-label Classification and Localization Framework 15AMIA 2017 | amia.org
  • 16. Multi-label Classification and Localization 16AMIA 2017 | amia.org
  • 17. Radiology image classification • Training (70%), validation (10%) and testing (20%) • Multi-label CNN architecture is implemented using Caffe framework • The ImageNet pre-trained models, i.e., AlexNet, GoogLeNet, VGGNet-16 and ResNet-50 are obtained from the Caffe model zoo 17AMIA 2017 | amia.org
  • 18. Multi-label Disease Classification Results Wang X, Peng Y, Lu L, Bagheri M, Lu Z, Summers R. ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017. 18AMIA 2017 | amia.org
  • 19. Conclusion and Future work • We propose a new algorithm (NegBio), based on dependency graph, to determine negative and equivocal findings in radiology reports • We provide one of the largest publicly available chest x-ray datasets to scientific community • We explore the combination of text mining with radiology imaging analysis in the era of deep learning. Future work • To explore NegBio’s applicability in clinical texts beyond radiology reports. • The current results suggest that building fully-automated high precision CAD systems remains challenging. 19AMIA 2017 | amia.org
  • 20. Acknowledgment This work was supported by the Intramural Research Program of the National Institutes of Health, at National Library of Medicine and Clinical Center. We are also grateful to Robert Leaman for his editorial comments. We thank NVIDIA Corporation for the GPU donation. 20AMIA 2017 | amia.org

Editor's Notes

  1. The motivation of this project is straightforward. In general computer vision, we have seen great use of neural network and deep learning techniques on different image processing tasks, such as image classification, object detection and caption generation. But we rarely see computer vision applications of deep learning in the clinical domain. The reason is probably we don’t have a large scale medical image dataset to fulfil the data-hungry DL needs. For natural image, we can use crowd-sourcing. But it is not applicable for X-ray images because the issues of security and privacy. Also, it usually require domain knowledge to label the X-ray. Although hospital have accumulated a large number of raw radiology images and reports. how we can generate labels for a large scale dataset remains challenging. In this project, we provide a text-mining method to automatcially genreate labels from radiology reports, and we show we can successfully train DL models uisng this dataset.
  2. The figure shows the overview of our approach. We have raw images and reports from Picture Archiving and Communication Systems. We mined the labels from the reports. We used the labeled images to train deep learning models for multi-label classification. In this talk, I will focus on the first step of how we constructed the labels.
  3. So the target of my side is to find diseases/findings from the clinical report
  4. Including atelectasis, we mainly focus on 8 diseases such as mass, nodule, and effusion.
  5. Different from other text, there are many negative or equivocal findings in the clinical text. For negative findings, we refer to findings that were ruled out by the radiologist such as no XXX. For equivocal findings, we refer to findings which radiologist is suspicious of. Such as “suggesting obstructive lung disease”. Since they may indicate the absence of findings mentioned within the radiology report, identifying them is as important as identifying positive findings. Otherwise, information extraction algorithms that do not distinguish negative and equivocal findings from positive ones may return many irrelevant results. Even though many natural language processing applications have been developed in recent years that successfully extract findings mentioned in medical reports, discriminating between positive, negative, and equivocal findings remains challenging
  6. We use a two-pass approach to achieve this. In the first pass, we use named-entity recognition tools to detect the findings from the report and normailzied to a unique ID in SNOMED MetaMap is a knowledge-intensive rule-based approach to map biomedical text to the UMLS Metathesaurus DNorm is a machine learning method, developed by our group for disease recognition and normalization Then we remove negative and equivocal findings from the reports.
  7. The motiviation of using dg is that we can use Less rules to capture more text variants
  8. Several rules that are frequently matched in the text
  9. To test the performance of NegBio Open I is one of the largest corpus where positive findings are annotated
  10. We can detect and remove more negative cases. As a result, the precision for positive finding detection increases.
  11. The NIH Clinical Center recently released over 100,000 anonymized chest x-ray images and their corresponding data to the scientific community. We hope The release will allow researchers across the country and around the world to freely access the datasets and increase their ability to teach computers how to detect and diagnose disease.
  12. We used the labeled images to train deep learning models for multi-label classification.
  13. We hope could be a baseline We hope The release will increase their ability to teach computers how to detect and diagnose disease. allow researchers across the country and around the world to freely access the datasets