SlideShare uma empresa Scribd logo
1 de 20
BoostingBiomedical Entity Extraction by Using Syntactic Patterns for Semantic Relation Discovery SvitlanaVolkova, PhD Student, CLSP JHU  DoinaCaragea, William H. Hsu, John Drouhard, Landon Fowles Department of Computing and Information Sciences, K-State Research supported by: K-State National Agricultural Biosecurity Center (NABC) and the US Department of Defense
Agenda Introduction Related Work ,[object Object]
Ontology Learning Methodology ,[object Object]
Step 2: Automated Relationship Extraction
Step 3: Automated Ontology Construction
Step 4: Biomedical Entity ExtractionExperimental Design and Results Summary 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Veterinary Medicine Data Online Structured Data Unstructured Data Official reports by different organizations: state and federal laboratories, bioportals;  health care providers; governmental agricultural or environmental agencies. Web-pages News E-mails (e.g., ProMed-Mail) Blogs Medical literature (e.g., books) Scientific papers (e.g., PubMed) 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Entity Extraction + Event Recognition “The US saw its latest FMD outbreak in Montebello, CA in 1929 where 3,600 pigs were slaughtered”. CRF/HMM Location Montebello, CA USA REG. EXPRESSIONS Date 1929 PATTERN MATCH. Species 3,600 pigs ONTOLOGY? Disease FMD ,[object Object]
LACK OF LABELED DATA FOR SEQUENCE LABELING 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Related Work in Biomedical Entity Extraction Methods: dictionary-based bio-entity name recognition in bio-literature protein name recognition using gazetteer gene-disease relation extraction conditional random fields has been applied for identifying gene and protein mentions Limitations: based on static dictionaries  limits the recall of the system by the size of the dictionary requires annotated training corpora for learning 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Emergency Surveillance Systems BioCaster  manually-constructed ontology of 50 animal diseases Pattern-based Understanding & Learning System PULS  a list of 2400 human and animal disease HealthMap a list of 1100 human and animal disease Limitations based on dictionary lookup Do not extract disease synonyms, viruses and serotypes 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Automated Ontology Learning for Boosting Biomedical Entity Extraction Learn animal disease ontology automatically  from web using syntactic pattern matching for semantic relation discovery  WWW 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Related Work: Relation Extraction 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Methodology 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Step1. Manual Ontology Construction |OINIT| = 429 terms 	|OSyn| = 453 terms 	|OAbbr| = 581 terms 	|OS+A| = 605 terms 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Step 2. Automated Relationship Extraction ,[object Object],		E1 = “swine influenza” is a kind of E2 = “swine fever” ,[object Object],		E1 = “anthrax”, E2 = “yellow fever” are diseases ,[object Object],		E1 = “Ovine epididymitis” is caused by E2 = “Brucella ovis” 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Step 3. Automated Ontology Construction Synonymic Relationship “is a kind of”  OINIT  = {“foot and mouth disease”} “Foot-and-mouth disease, FMD or hoof-and-mouth disease (Aphtae epizooticae) is a highly contagious and sometimes fatal viral disease”. OR  ={“foot and mouth disease”,“FMD”, “hoof-and-mouth disease”, “Aphtae epizooticae”} 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Step 3. Automated Ontology Construction Causative Relationship “is caused by”  O’INIT  = {“foot and mouth disease”, “FMD”,  “hoof-and-mouth disease”, “Aphtae epizooticae”} “FMD is caused by foot-and-mouth disease virus (FMDV)” OR  ={“foot and mouth disease”,“FMD”, “hoof-and-mouth disease”, “Aphtae epizooticae”, “foot-and-mouth disease virus”, “FMDV”} 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Step 4. Biomedical Entity Extraction “Species infecting domestic livestock are B. melitensis(goats and sheep, see Brucellamelitensis), B. suis (pigs, see Swine brucellosis), B. abortus(cattle and bison), B. ovis(sheep), and B. canis(dogs)” Terminology Extraction – “B. melitensis”, “B.suis” …  Segmentation – [43..54], [98..105]  Association Extraction – e.g. “B. melitensis” is a synonym of “Brucellamelitensis”  Normalization – “Brucellosis” - “B. melitensis” - “B.suis”  2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
Experiment ,[object Object]

Mais conteúdo relacionado

Destaque

Cyber bulling panagopoulos_georgie
Cyber bulling panagopoulos_georgieCyber bulling panagopoulos_georgie
Cyber bulling panagopoulos_georgiegeorgepana
 
LinkedIn – I Have a Profile and Some Connections - Now What?
LinkedIn – I Have a Profile and Some Connections - Now What?LinkedIn – I Have a Profile and Some Connections - Now What?
LinkedIn – I Have a Profile and Some Connections - Now What?Social Jack
 
Linked in Social Selling 5 Easy Steps to Convert Connections to New Sales - ...
Linked in Social Selling  5 Easy Steps to Convert Connections to New Sales - ...Linked in Social Selling  5 Easy Steps to Convert Connections to New Sales - ...
Linked in Social Selling 5 Easy Steps to Convert Connections to New Sales - ...Social Jack
 
人生沒有彩排 (With music)
人生沒有彩排 (With music)人生沒有彩排 (With music)
人生沒有彩排 (With music)Dhamma Jata
 
Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Svitlana volkova
 
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemus
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemusWebinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemus
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemusJarno Malaprade
 
NAHJ Mobile Journalism Live Video
NAHJ Mobile Journalism Live VideoNAHJ Mobile Journalism Live Video
NAHJ Mobile Journalism Live VideoMo Krochmal
 

Destaque (9)

Cyber bulling panagopoulos_georgie
Cyber bulling panagopoulos_georgieCyber bulling panagopoulos_georgie
Cyber bulling panagopoulos_georgie
 
LinkedIn – I Have a Profile and Some Connections - Now What?
LinkedIn – I Have a Profile and Some Connections - Now What?LinkedIn – I Have a Profile and Some Connections - Now What?
LinkedIn – I Have a Profile and Some Connections - Now What?
 
MedEx'10
MedEx'10MedEx'10
MedEx'10
 
Linked in Social Selling 5 Easy Steps to Convert Connections to New Sales - ...
Linked in Social Selling  5 Easy Steps to Convert Connections to New Sales - ...Linked in Social Selling  5 Easy Steps to Convert Connections to New Sales - ...
Linked in Social Selling 5 Easy Steps to Convert Connections to New Sales - ...
 
Egames
EgamesEgames
Egames
 
人生沒有彩排 (With music)
人生沒有彩排 (With music)人生沒有彩排 (With music)
人生沒有彩排 (With music)
 
Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Grace Hopper Celebration 2010
Grace Hopper Celebration 2010
 
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemus
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemusWebinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemus
Webinaarimateriaali: Terveystalon OmaTerveys ja mobiili asiakakaskokemus
 
NAHJ Mobile Journalism Live Video
NAHJ Mobile Journalism Live VideoNAHJ Mobile Journalism Live Video
NAHJ Mobile Journalism Live Video
 

Semelhante a Web Intelligence 2010

Data has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsData has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsPietro Leo
 
Future of Life Sciences
Future of Life SciencesFuture of Life Sciences
Future of Life SciencesMelanie Swan
 
Open Science at Genome Scale
Open Science at Genome ScaleOpen Science at Genome Scale
Open Science at Genome ScaleLizLyon
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960mare34
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsDuncan Hull
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
Professor Aboul ella Hassanien (Abo) full list of publications till October...
Professor Aboul ella Hassanien (Abo)   full list of publications till October...Professor Aboul ella Hassanien (Abo)   full list of publications till October...
Professor Aboul ella Hassanien (Abo) full list of publications till October...Aboul Ella Hassanien
 
Ontology Engineering for Big Data
Ontology Engineering for Big DataOntology Engineering for Big Data
Ontology Engineering for Big DataKouji Kozaki
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsmikaelhuss
 
Curriculum vitae 170521 updated new
Curriculum vitae 170521 updated newCurriculum vitae 170521 updated new
Curriculum vitae 170521 updated newAbimbolaAfolayan2
 
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK

Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK

Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK
NeISSProject
 
Iot related articles published in cs & it proceedings from january 2020 t...
Iot related articles published in cs & it proceedings from january 2020 t...Iot related articles published in cs & it proceedings from january 2020 t...
Iot related articles published in cs & it proceedings from january 2020 t...ijujournal
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformaticsbiinoida
 
VIVO: Online Networking and Research Management for Scientists
VIVO:  Online Networking and Research Management for ScientistsVIVO:  Online Networking and Research Management for Scientists
VIVO: Online Networking and Research Management for ScientistsJohnC_HPL
 
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...Global R & D Services
 

Semelhante a Web Intelligence 2010 (20)

Master Thesis
Master ThesisMaster Thesis
Master Thesis
 
Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007
 
9th b3 sc
9th b3 sc9th b3 sc
9th b3 sc
 
Data has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsData has a gravity and is attracting decisions
Data has a gravity and is attracting decisions
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
 
Future of Life Sciences
Future of Life SciencesFuture of Life Sciences
Future of Life Sciences
 
Bioinformatics on internet
Bioinformatics on internetBioinformatics on internet
Bioinformatics on internet
 
Open Science at Genome Scale
Open Science at Genome ScaleOpen Science at Genome Scale
Open Science at Genome Scale
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Professor Aboul ella Hassanien (Abo) full list of publications till October...
Professor Aboul ella Hassanien (Abo)   full list of publications till October...Professor Aboul ella Hassanien (Abo)   full list of publications till October...
Professor Aboul ella Hassanien (Abo) full list of publications till October...
 
Ontology Engineering for Big Data
Ontology Engineering for Big DataOntology Engineering for Big Data
Ontology Engineering for Big Data
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
 
Curriculum vitae 170521 updated new
Curriculum vitae 170521 updated newCurriculum vitae 170521 updated new
Curriculum vitae 170521 updated new
 
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK

Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK

Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK

 
Iot related articles published in cs & it proceedings from january 2020 t...
Iot related articles published in cs & it proceedings from january 2020 t...Iot related articles published in cs & it proceedings from january 2020 t...
Iot related articles published in cs & it proceedings from january 2020 t...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
VIVO: Online Networking and Research Management for Scientists
VIVO:  Online Networking and Research Management for ScientistsVIVO:  Online Networking and Research Management for Scientists
VIVO: Online Networking and Research Management for Scientists
 
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...
9th International Conference on Biotechnology, Bio Informatics, Bio Medical S...
 

Mais de Svitlana volkova

Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalSvitlana volkova
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)Svitlana volkova
 
Methods Of Reliability Analysis
Methods Of Reliability AnalysisMethods Of Reliability Analysis
Methods Of Reliability AnalysisSvitlana volkova
 
Ukraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversityUkraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversitySvitlana volkova
 

Mais de Svitlana volkova (11)

Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location Retrieval
 
WiML Poster
WiML PosterWiML Poster
WiML Poster
 
Topics Modeling
Topics ModelingTopics Modeling
Topics Modeling
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
Social Networks
Social NetworksSocial Networks
Social Networks
 
Methods Of Reliability Analysis
Methods Of Reliability AnalysisMethods Of Reliability Analysis
Methods Of Reliability Analysis
 
Ohio Project
Ohio ProjectOhio Project
Ohio Project
 
Ukraine Presentation
Ukraine PresentationUkraine Presentation
Ukraine Presentation
 
Ukraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversityUkraine Presentation at Kansas State University
Ukraine Presentation at Kansas State University
 
Communicatons Fulbright
Communicatons FulbrightCommunicatons Fulbright
Communicatons Fulbright
 
Communications Ternopil
Communications TernopilCommunications Ternopil
Communications Ternopil
 

Último

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 

Último (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 

Web Intelligence 2010

  • 1. BoostingBiomedical Entity Extraction by Using Syntactic Patterns for Semantic Relation Discovery SvitlanaVolkova, PhD Student, CLSP JHU DoinaCaragea, William H. Hsu, John Drouhard, Landon Fowles Department of Computing and Information Sciences, K-State Research supported by: K-State National Agricultural Biosecurity Center (NABC) and the US Department of Defense
  • 2.
  • 3.
  • 4. Step 2: Automated Relationship Extraction
  • 5. Step 3: Automated Ontology Construction
  • 6. Step 4: Biomedical Entity ExtractionExperimental Design and Results Summary 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 7. Veterinary Medicine Data Online Structured Data Unstructured Data Official reports by different organizations: state and federal laboratories, bioportals; health care providers; governmental agricultural or environmental agencies. Web-pages News E-mails (e.g., ProMed-Mail) Blogs Medical literature (e.g., books) Scientific papers (e.g., PubMed) 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 8.
  • 9. LACK OF LABELED DATA FOR SEQUENCE LABELING 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 10. Related Work in Biomedical Entity Extraction Methods: dictionary-based bio-entity name recognition in bio-literature protein name recognition using gazetteer gene-disease relation extraction conditional random fields has been applied for identifying gene and protein mentions Limitations: based on static dictionaries limits the recall of the system by the size of the dictionary requires annotated training corpora for learning 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 11. Emergency Surveillance Systems BioCaster manually-constructed ontology of 50 animal diseases Pattern-based Understanding & Learning System PULS a list of 2400 human and animal disease HealthMap a list of 1100 human and animal disease Limitations based on dictionary lookup Do not extract disease synonyms, viruses and serotypes 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 12. Automated Ontology Learning for Boosting Biomedical Entity Extraction Learn animal disease ontology automatically from web using syntactic pattern matching for semantic relation discovery WWW 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 13. Related Work: Relation Extraction 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 14. Methodology 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 15. Step1. Manual Ontology Construction |OINIT| = 429 terms |OSyn| = 453 terms |OAbbr| = 581 terms |OS+A| = 605 terms 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 16.
  • 17. Step 3. Automated Ontology Construction Synonymic Relationship “is a kind of” OINIT = {“foot and mouth disease”} “Foot-and-mouth disease, FMD or hoof-and-mouth disease (Aphtae epizooticae) is a highly contagious and sometimes fatal viral disease”. OR ={“foot and mouth disease”,“FMD”, “hoof-and-mouth disease”, “Aphtae epizooticae”} 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 18. Step 3. Automated Ontology Construction Causative Relationship “is caused by” O’INIT = {“foot and mouth disease”, “FMD”, “hoof-and-mouth disease”, “Aphtae epizooticae”} “FMD is caused by foot-and-mouth disease virus (FMDV)” OR ={“foot and mouth disease”,“FMD”, “hoof-and-mouth disease”, “Aphtae epizooticae”, “foot-and-mouth disease virus”, “FMDV”} 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 19. Step 4. Biomedical Entity Extraction “Species infecting domestic livestock are B. melitensis(goats and sheep, see Brucellamelitensis), B. suis (pigs, see Swine brucellosis), B. abortus(cattle and bison), B. ovis(sheep), and B. canis(dogs)” Terminology Extraction – “B. melitensis”, “B.suis” … Segmentation – [43..54], [98..105] Association Extraction – e.g. “B. melitensis” is a synonym of “Brucellamelitensis” Normalization – “Brucellosis” - “B. melitensis” - “B.suis” 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 20.
  • 21. 100 manually labeled document for entity extraction - DExtOINIT Manually constructed OSynonyms OAbbreviations OSyn+Abbrev OINIT Automatically constructed Google Sets expansion approach ORelation OGoogleSets 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 22. Entity Extraction Results OINIT Precision – 0.54 Recall – 0.25 OG Precision – 0.85 Recall – 0.71 OR Precision – 0.85 Recall – 0.79 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 23. Entity Extraction Results: ROC Curves 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 24. Entity Extraction Results: Learning Curves |OG|=754..1238 |OR|=772..1287 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 25. Summary & Future Work Our results: OR – Precision – 84.8, Recall – 78.9 and F-score – 81.7 OG – Precision – 84.7, Recall – 71.3 BioCaster 200 news articles, F-score – 76.9 DNA, RNA, cell type extraction SVM and orthographic features, F-score – 66.5 Biomedical Entity Extraction multilingual ontology construction using Wikipedia Automated Ontology Construction generalize for other named entities 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada
  • 26. Thank you! Svitlana Volkova, svitlana@jhu.edu http://people.cis.ksu.edu/~svitlana 2010 IEEE / WIC / ACM International Conference on Web Intelligence, 31 Aug - 03 Sept 2010, Toronto Canada