SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
1
Using Text Mining to Explore 
Concept Complexity in Obesity
through Concept Maps
George Karystianis
School of Computer Science
Supervisors: Goran Nenadic, Iain Buchan
Advisor: Andrea Schalk
2
Motivation
● Complex nature of obesity.
● Wide range of biomedical data sources available.
– implementation of biomedical text/data mining.
● Possible to reveal hidden links between obesity and other
diseases.
● Partial completed knowledge representation models of obesity.
● A systematic approach required for:
– analysis and interpretation of clinical knowledge.
3
Concept Maps
● Knowledge representation models.
● Consisted of:
– nodes (concepts).
– links (relationships between the nodes).
● Aim: gather, understand, explore knowledge.
● Variety of users.
● No explicit detail.
● Implemented primarily in education.
4
Concept Map Example
5
Aim
● To design a framework to build/enhance medical concept maps.
● To improve the understanding of health care concept
complexity.
● Assist medical professionals in the representation, exploration
and validation of their expert knowledge.
● Improvement of the clinical health care.
6
Objectives
● Design and implement methods for health care concept
detection.
● Concept organisation in a concept map form.
● Method generation for concept map updates.
● Build a framework for the design/enhancement/validation of
medical concept maps.
● Methodology evaluation through the health problem of obesity:
– validation of obesity related concepts with current structured obesity
information available.
– identify gaps in clinical knowledge.
7
Research Hypothesis &
Questions
-The analysis required to extract health care concepts.
-The approach to built and enhance a concept map.
-The concept map contribution in the representation/validation of knowledge.
-The text mining results help to understand/explore clinical problems.
Biomedical
Text Mining
Scientific
literature
Concept
map
Improvement of
health care
Framework
8
Obesity
● Worldwide problem.
● Epidemic proportions:
– WHO rates (2005): 1.6 billion overweight, 400 million obese.
● Associations to various diseases.
● Complex risk factors and complications.
● Various aspects.
● Lots of research.
9
10
Biomedical Text Mining
● Extraction of information from unstructured data of biomedical
nature.
● Discovery of new, previously unknown knowledge.
● Performed on documents with complex/specific terminology and
expressions.
● Challenges:
– language ambiguity.
– variation of language expression.
● Various tools and applications (Termine, Whatizit, GATE).
● Adaptation to user's tasks and requirements.
11
What we are looking for?
● Risk Factors
● Causal Factors
● Confounding Factors
● Outcomes
● Complications
● Interventions
● ...
12
Methodology Overview
1. Document retrieval.
2. Term/concept extraction.
3. Feature engineering and Information extraction:
- application of classification/clustering techniques.
4. Concept map design.
13
Evaluation-Obesity Case Study
● Comparison:
– What ?
● biomedical text mining results.
● concept map information.
– How ?
● concepts and relationships.
● New ones.
● Examination/manipulation/validation of new knowledge by experts.
● Enhancement of the concept map.
14
Progress so far (1)
● Corpus collection.
● Application of Automated Term Recognition (ATR).
● C-value method.
● Single word ATR:
– terminological head identification.
– word of a multi-word term that defines the term class.
– example:
● “Childhood diabetes type II”.
● Terminological head: “diabetes”.
15
Progress so far (2)
● Ranking head measures:
– total head frequency,
– single head frequency,
– maximum and average C-value,
– abstract frequency,
– ratio of single head frequency/total head frequency,
– tf*idf (term frequency*inverse document frequency).
16
Results
tf*idf total freq single freq abstract freq word freq max_c aver_c ratio
0
5
10
15
20
25
30
35
40
45
0
10
20
30
40
50
Statistical measure
Numberofkeywords
17
Progress so far (3)
● Pattern extraction from abstracts for:
– risk, confounding and causal factors,
– interventions,
– complications,
– outcomes.
Obesity risk is increased among women with psychiatric disorders
Potential risk factor
18
Example
Potential risk factors Potential interventions Potential complications
19
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (1/2): Concept extraction
20
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (2/2): Concept structuring
21
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 3: Design of the medical concept map
22
Summary
● Framework creation for clinical concept map building and
enhancement.
● Improved understanding of health care concept complexity.
● So far:
– comprehension of literature review.
– methodology design.
– single ATR.
– pattern design.
23
End
Acknowledgements
2. School of Computer Science
University of Manchester
1. Medical Research Council

Mais conteúdo relacionado

Semelhante a First year present

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperNicholas Tenhue
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...healthcareisi
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19openEHR-Japan
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Maulik Kamdar
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docxalinainglis
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdfAsiyaKhan63
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingNethminiWijesinghe
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMartin McMorrow
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Universitat Politècnica de València
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...Vlad Manea
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Thien Q. Tran
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informaticsimprovemed
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimizationrahulmonikasharma
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Nicola Amoroso
 

Semelhante a First year present (20)

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White Paper
 
MVilla IUI 2012 Lisbon
MVilla IUI 2012 LisbonMVilla IUI 2012 Lisbon
MVilla IUI 2012 Lisbon
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
 
Mapping innovation missions
Mapping innovation missionsMapping innovation missions
Mapping innovation missions
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdf
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL Session
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
 
36411
3641136411
36411
 
Medinfor Gesiti Hospitais
Medinfor Gesiti HospitaisMedinfor Gesiti Hospitais
Medinfor Gesiti Hospitais
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informatics
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
 

Último

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 

Último (20)

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 

First year present

  • 2. 2 Motivation ● Complex nature of obesity. ● Wide range of biomedical data sources available. – implementation of biomedical text/data mining. ● Possible to reveal hidden links between obesity and other diseases. ● Partial completed knowledge representation models of obesity. ● A systematic approach required for: – analysis and interpretation of clinical knowledge.
  • 3. 3 Concept Maps ● Knowledge representation models. ● Consisted of: – nodes (concepts). – links (relationships between the nodes). ● Aim: gather, understand, explore knowledge. ● Variety of users. ● No explicit detail. ● Implemented primarily in education.
  • 5. 5 Aim ● To design a framework to build/enhance medical concept maps. ● To improve the understanding of health care concept complexity. ● Assist medical professionals in the representation, exploration and validation of their expert knowledge. ● Improvement of the clinical health care.
  • 6. 6 Objectives ● Design and implement methods for health care concept detection. ● Concept organisation in a concept map form. ● Method generation for concept map updates. ● Build a framework for the design/enhancement/validation of medical concept maps. ● Methodology evaluation through the health problem of obesity: – validation of obesity related concepts with current structured obesity information available. – identify gaps in clinical knowledge.
  • 7. 7 Research Hypothesis & Questions -The analysis required to extract health care concepts. -The approach to built and enhance a concept map. -The concept map contribution in the representation/validation of knowledge. -The text mining results help to understand/explore clinical problems. Biomedical Text Mining Scientific literature Concept map Improvement of health care Framework
  • 8. 8 Obesity ● Worldwide problem. ● Epidemic proportions: – WHO rates (2005): 1.6 billion overweight, 400 million obese. ● Associations to various diseases. ● Complex risk factors and complications. ● Various aspects. ● Lots of research.
  • 9. 9
  • 10. 10 Biomedical Text Mining ● Extraction of information from unstructured data of biomedical nature. ● Discovery of new, previously unknown knowledge. ● Performed on documents with complex/specific terminology and expressions. ● Challenges: – language ambiguity. – variation of language expression. ● Various tools and applications (Termine, Whatizit, GATE). ● Adaptation to user's tasks and requirements.
  • 11. 11 What we are looking for? ● Risk Factors ● Causal Factors ● Confounding Factors ● Outcomes ● Complications ● Interventions ● ...
  • 12. 12 Methodology Overview 1. Document retrieval. 2. Term/concept extraction. 3. Feature engineering and Information extraction: - application of classification/clustering techniques. 4. Concept map design.
  • 13. 13 Evaluation-Obesity Case Study ● Comparison: – What ? ● biomedical text mining results. ● concept map information. – How ? ● concepts and relationships. ● New ones. ● Examination/manipulation/validation of new knowledge by experts. ● Enhancement of the concept map.
  • 14. 14 Progress so far (1) ● Corpus collection. ● Application of Automated Term Recognition (ATR). ● C-value method. ● Single word ATR: – terminological head identification. – word of a multi-word term that defines the term class. – example: ● “Childhood diabetes type II”. ● Terminological head: “diabetes”.
  • 15. 15 Progress so far (2) ● Ranking head measures: – total head frequency, – single head frequency, – maximum and average C-value, – abstract frequency, – ratio of single head frequency/total head frequency, – tf*idf (term frequency*inverse document frequency).
  • 16. 16 Results tf*idf total freq single freq abstract freq word freq max_c aver_c ratio 0 5 10 15 20 25 30 35 40 45 0 10 20 30 40 50 Statistical measure Numberofkeywords
  • 17. 17 Progress so far (3) ● Pattern extraction from abstracts for: – risk, confounding and causal factors, – interventions, – complications, – outcomes. Obesity risk is increased among women with psychiatric disorders Potential risk factor
  • 18. 18 Example Potential risk factors Potential interventions Potential complications
  • 19. 19 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (1/2): Concept extraction
  • 20. 20 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (2/2): Concept structuring
  • 21. 21 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 3: Design of the medical concept map
  • 22. 22 Summary ● Framework creation for clinical concept map building and enhancement. ● Improved understanding of health care concept complexity. ● So far: – comprehension of literature review. – methodology design. – single ATR. – pattern design.