SlideShare uma empresa Scribd logo
1 de 50
IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 23-26 May, Vancouver, BC, Canada Computational Knowledge & Information Management in Veterinary Epidemiology Svitlana Volkova and William H. Hsu Laboratory for Knowledge Discovery in Databases Department of Computing and Information Sciences Kansas State University Sponsors: K-State National Agricultural Biosecurity Center (NABC) US Department of Defense
Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces Automated Web-Services Framework for Epidemiological Analytics Web Crawling & Search Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
Animal Infectious Disease Outbreaks influence on the travel and trade cause economic crises, political instability diseases, zoonotic in type can cause loss of life IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces Automated Web-Services Framework for Epidemiological Analytics System Functionality Web Crawling Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
Animal Disease Monitoring Systems: ManuallySupportedWeb Interfaces (1) International: World Animal Health Information Database (WAHID) Interface - http://www.oie.int/wahis/public.php?page=home WHO Global Atlas of Infectious Diseases - http://diseasemaps.usgs.gov/index.htm Emergency Prevention System (EMPRES) for Transboundary Animal and Plant Pests and Diseases - http://www.fao.org/EMPRES/default.html IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Animal Disease Monitoring Systems: ManuallySupportedWeb Interfaces(2) USA Centers for Disease Control and Prevention (CDC) - http://www.cdc.gov U.S. Department of Agriculture (USDA) - http://www.usda.gov/wps/portal/usdahome U.S. Geological Survey (USGS) and U.S. Geological Survey (USGS) National Wildlife Health Center (NWHC) - http://www.nwhc.usgs.gov Iowa State University Center for Food Security and Public Health (CFSPH) - http://www.cfsph.iastate.edu BioPortal - http://biocomputingcorp.com/bpsystem.html FMD BioPortal - https://fmdbioportal.ucdavis.edu United Kingdom Department for Environment Food and Rural Affairs (DEFRA) - http://www.defra.gov.uk IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces AutomatedWeb-Services Framework for Epidemiological Analytics System Functionality Web Crawling Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
Animal Disease Monitoring Systems: Automated Web Services (1) BioCaster - http://biocaster.nii.ac.jp/ ,[object Object]
classifies documents as topically relevant or not
taxonomy of 4300 named entities (50 disease names, 243 country names, 4025 province/city names, latitudes and longitudes)
identifies 40 diseases at up to 25-30 locations per day
multilingual information extraction on to English, French, Spanish, Chinese, Thai, Vietnamese, Japanese
uses ontology pattern matching approaches to recognize disease-location-verb pairs
plots events on a Google Map
does not classify events into categories and does not report past outbreaks
no timeline visualizationIEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
BioCaster -http://biocaster.nii.ac.jp/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Animal Disease Monitoring Systems: Automated Web Services (1) Information retrieval system MedISys  -  http://medusa.jrc.it/medisys/homeedition/all/home.html Pattern-based Understanding and Learning System (PULS) - http://sysdb.cs.helsinki.fi/puls/jrc/all ,[object Object]
collects an average 50000 news articles per day from about 1400 news portals and about 150 specialized Public Health sites
43 languages
current ontology contains 2400 disease names, 400 organisms, 1500 political entities and over 70000 location names including towns, cities, provinces
real-time news clustering and filtering by matching 3000 patterns
does not classify events and does not report past outbreaks.IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
MedISys - http://medusa.jrc.it/medisys/homeedition/all/home.html *part of the Europe Media Monitor (EMM) product family http://emm.jrc.it/overview.html IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Pattern-based Understanding and Learning System (PULS) IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Animal Disease Monitoring Systems: Automated Web Services (1) HealthMap - http://healthmap.org/en ,[object Object]
2300 locations and 1100 disease names
identifies between 20-30 outbreaks per day
multiple languages English, Russian, Arabic, French, Portuguese, Spanish, Chinese
manually supported systemEpiSpider- http://www.epispider.org/ ,[object Object]
ProMED-Mail - www.promedmail.org
The Global Disaster Alert Coordinating System (GDACS) - www.gdacs.org
Central Intelligence Agency (CIA) Factbook - https://www.cia.gov/library/publications/the-world-factbook/
TheUnited Nations Human Development Report sites - http://hdr.undp.org/enIEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
HealthMap - http://healthmap.org/en IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
ProMED-Mail -www.promedmail.org IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
EpiSpider - http://www.epispider.org/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Animal Disease-related Data Online Structured Data Unstructured Data Official reports by different organizations: state and federal laboratories, bioportals;  health care providers; governmental agricultural or environmental agencies. Web-pages News E-mails (e.g., ProMed-Mail) Blogs Medical literature (e.g., books) Scientific papers (e.g., PubMed) IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
Research Challenges (1) ,[object Object]
Extract facts/structured information from unstructured text
Manage the specificity of the content  (e.g. blogosphere, biomedical literature, news, official reports etc.)
Necessity of data aggregation from multiple sources
Speculate about event confidence
Solution: Reliability of the source by majority voting
Multiple locations, different case status, different victim types lead to multiple data base entries
Solution:  spurious event detection and event disambiguation e.g.,source 1: 10 victims vs. source 2: 15 victims,[object Object]
What geo-tag in Virginia, USA or UK?
Solution: track geo-tag of the original source of information

Mais conteúdo relacionado

Semelhante a IEEE ISI'10

Supporting epidemic intelligence, personalised and public health with advance...
Supporting epidemic intelligence, personalised and public health with advance...Supporting epidemic intelligence, personalised and public health with advance...
Supporting epidemic intelligence, personalised and public health with advance...Joao Pita Costa
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
 
Improving Disease Surveillance in the United States Using Companion Animal Data
Improving Disease Surveillance in the United States Using Companion Animal DataImproving Disease Surveillance in the United States Using Companion Animal Data
Improving Disease Surveillance in the United States Using Companion Animal DataPamela Okerholm
 
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...Ann Séror
 
Cloud Computing: A Key to Effective & Efficient Disease Surveillance System
Cloud Computing: A Key to Effective & Efficient Disease Surveillance SystemCloud Computing: A Key to Effective & Efficient Disease Surveillance System
Cloud Computing: A Key to Effective & Efficient Disease Surveillance Systemidescitation
 
Evolve: InSTEDD's Global Early Warning and Response System
Evolve: InSTEDD's Global Early Warning and Response SystemEvolve: InSTEDD's Global Early Warning and Response System
Evolve: InSTEDD's Global Early Warning and Response SystemTaha Kass-Hout, MD, MS
 
A governance model for ubiquitous medical devices accessing eHealth data: the...
A governance model for ubiquitous medical devices accessing eHealth data: the...A governance model for ubiquitous medical devices accessing eHealth data: the...
A governance model for ubiquitous medical devices accessing eHealth data: the...Massimiliano Masi
 
Lecture 3 softwares used in health care (2)
Lecture 3 softwares used in health care (2)Lecture 3 softwares used in health care (2)
Lecture 3 softwares used in health care (2)Munef Almadhi
 
Webinar: Innovations in Mobile Health: Highlights and Future Directions
Webinar: Innovations in Mobile Health: Highlights and Future DirectionsWebinar: Innovations in Mobile Health: Highlights and Future Directions
Webinar: Innovations in Mobile Health: Highlights and Future DirectionsHHS Digital
 
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdf
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdfFebruary_2024 Top 10 Read Articles in Computer Networks & Communications.pdf
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdfIJCNCJournal
 
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...Ofer Atzmon
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformaticsbiinoida
 
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...RIFF - A Social Network and Collaborative Platform For Public Health Disease ...
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...InSTEDD
 

Semelhante a IEEE ISI'10 (20)

Web Intelligence 2010
Web Intelligence 2010Web Intelligence 2010
Web Intelligence 2010
 
MS Thesis Short
MS Thesis ShortMS Thesis Short
MS Thesis Short
 
Supporting epidemic intelligence, personalised and public health with advance...
Supporting epidemic intelligence, personalised and public health with advance...Supporting epidemic intelligence, personalised and public health with advance...
Supporting epidemic intelligence, personalised and public health with advance...
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
From Clinical Information Systems toward HealthGrid
From Clinical Information Systems toward HealthGridFrom Clinical Information Systems toward HealthGrid
From Clinical Information Systems toward HealthGrid
 
Summit2013 ho-jin choi - summit2013
Summit2013   ho-jin choi - summit2013Summit2013   ho-jin choi - summit2013
Summit2013 ho-jin choi - summit2013
 
Improving Disease Surveillance in the United States Using Companion Animal Data
Improving Disease Surveillance in the United States Using Companion Animal DataImproving Disease Surveillance in the United States Using Companion Animal Data
Improving Disease Surveillance in the United States Using Companion Animal Data
 
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...
Integrating Health Care Communities of Practice - Uruguay Final Cme New Integ...
 
Cloud Computing: A Key to Effective & Efficient Disease Surveillance System
Cloud Computing: A Key to Effective & Efficient Disease Surveillance SystemCloud Computing: A Key to Effective & Efficient Disease Surveillance System
Cloud Computing: A Key to Effective & Efficient Disease Surveillance System
 
Evolve: InSTEDD's Global Early Warning and Response System
Evolve: InSTEDD's Global Early Warning and Response SystemEvolve: InSTEDD's Global Early Warning and Response System
Evolve: InSTEDD's Global Early Warning and Response System
 
A governance model for ubiquitous medical devices accessing eHealth data: the...
A governance model for ubiquitous medical devices accessing eHealth data: the...A governance model for ubiquitous medical devices accessing eHealth data: the...
A governance model for ubiquitous medical devices accessing eHealth data: the...
 
Zoology Related Softwares
Zoology Related SoftwaresZoology Related Softwares
Zoology Related Softwares
 
Zoology Related Software
Zoology Related SoftwareZoology Related Software
Zoology Related Software
 
Lecture 3 softwares used in health care (2)
Lecture 3 softwares used in health care (2)Lecture 3 softwares used in health care (2)
Lecture 3 softwares used in health care (2)
 
Webinar: Innovations in Mobile Health: Highlights and Future Directions
Webinar: Innovations in Mobile Health: Highlights and Future DirectionsWebinar: Innovations in Mobile Health: Highlights and Future Directions
Webinar: Innovations in Mobile Health: Highlights and Future Directions
 
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdf
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdfFebruary_2024 Top 10 Read Articles in Computer Networks & Communications.pdf
February_2024 Top 10 Read Articles in Computer Networks & Communications.pdf
 
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...
The Evolution of Wearable M-Health Applications - Mobile Health Expo New York...
 
RML Rendezvous: Transcending Borders Globally
RML Rendezvous: Transcending Borders GloballyRML Rendezvous: Transcending Borders Globally
RML Rendezvous: Transcending Borders Globally
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...RIFF - A Social Network and Collaborative Platform For Public Health Disease ...
RIFF - A Social Network and Collaborative Platform For Public Health Disease ...
 

Mais de Svitlana volkova

Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Svitlana volkova
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalSvitlana volkova
 
Multilingual Ner Using Wiki
Multilingual Ner Using WikiMultilingual Ner Using Wiki
Multilingual Ner Using WikiSvitlana volkova
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)Svitlana volkova
 
Methods Of Reliability Analysis
Methods Of Reliability AnalysisMethods Of Reliability Analysis
Methods Of Reliability AnalysisSvitlana volkova
 
Ukraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversityUkraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversitySvitlana volkova
 

Mais de Svitlana volkova (14)

EACL'12 Poster
EACL'12 PosterEACL'12 Poster
EACL'12 Poster
 
Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Grace Hopper Celebration 2010
Grace Hopper Celebration 2010
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location Retrieval
 
Multilingual Ner Using Wiki
Multilingual Ner Using WikiMultilingual Ner Using Wiki
Multilingual Ner Using Wiki
 
WiML Poster
WiML PosterWiML Poster
WiML Poster
 
Topics Modeling
Topics ModelingTopics Modeling
Topics Modeling
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
Social Networks
Social NetworksSocial Networks
Social Networks
 
Methods Of Reliability Analysis
Methods Of Reliability AnalysisMethods Of Reliability Analysis
Methods Of Reliability Analysis
 
Ohio Project
Ohio ProjectOhio Project
Ohio Project
 
Ukraine Presentation
Ukraine PresentationUkraine Presentation
Ukraine Presentation
 
Ukraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversityUkraine Presentation at Kansas State University
Ukraine Presentation at Kansas State University
 
Communicatons Fulbright
Communicatons FulbrightCommunicatons Fulbright
Communicatons Fulbright
 
Communications Ternopil
Communications TernopilCommunications Ternopil
Communications Ternopil
 

Último

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

IEEE ISI'10

  • 1. IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 23-26 May, Vancouver, BC, Canada Computational Knowledge & Information Management in Veterinary Epidemiology Svitlana Volkova and William H. Hsu Laboratory for Knowledge Discovery in Databases Department of Computing and Information Sciences Kansas State University Sponsors: K-State National Agricultural Biosecurity Center (NABC) US Department of Defense
  • 2. Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces Automated Web-Services Framework for Epidemiological Analytics Web Crawling & Search Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
  • 3. Animal Infectious Disease Outbreaks influence on the travel and trade cause economic crises, political instability diseases, zoonotic in type can cause loss of life IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 4. Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces Automated Web-Services Framework for Epidemiological Analytics System Functionality Web Crawling Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
  • 5. Animal Disease Monitoring Systems: ManuallySupportedWeb Interfaces (1) International: World Animal Health Information Database (WAHID) Interface - http://www.oie.int/wahis/public.php?page=home WHO Global Atlas of Infectious Diseases - http://diseasemaps.usgs.gov/index.htm Emergency Prevention System (EMPRES) for Transboundary Animal and Plant Pests and Diseases - http://www.fao.org/EMPRES/default.html IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 6. IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 7. Animal Disease Monitoring Systems: ManuallySupportedWeb Interfaces(2) USA Centers for Disease Control and Prevention (CDC) - http://www.cdc.gov U.S. Department of Agriculture (USDA) - http://www.usda.gov/wps/portal/usdahome U.S. Geological Survey (USGS) and U.S. Geological Survey (USGS) National Wildlife Health Center (NWHC) - http://www.nwhc.usgs.gov Iowa State University Center for Food Security and Public Health (CFSPH) - http://www.cfsph.iastate.edu BioPortal - http://biocomputingcorp.com/bpsystem.html FMD BioPortal - https://fmdbioportal.ucdavis.edu United Kingdom Department for Environment Food and Rural Affairs (DEFRA) - http://www.defra.gov.uk IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 8. Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces AutomatedWeb-Services Framework for Epidemiological Analytics System Functionality Web Crawling Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
  • 9.
  • 10. classifies documents as topically relevant or not
  • 11. taxonomy of 4300 named entities (50 disease names, 243 country names, 4025 province/city names, latitudes and longitudes)
  • 12. identifies 40 diseases at up to 25-30 locations per day
  • 13. multilingual information extraction on to English, French, Spanish, Chinese, Thai, Vietnamese, Japanese
  • 14. uses ontology pattern matching approaches to recognize disease-location-verb pairs
  • 15. plots events on a Google Map
  • 16. does not classify events into categories and does not report past outbreaks
  • 17. no timeline visualizationIEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 18. BioCaster -http://biocaster.nii.ac.jp/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 19.
  • 20. collects an average 50000 news articles per day from about 1400 news portals and about 150 specialized Public Health sites
  • 22. current ontology contains 2400 disease names, 400 organisms, 1500 political entities and over 70000 location names including towns, cities, provinces
  • 23. real-time news clustering and filtering by matching 3000 patterns
  • 24. does not classify events and does not report past outbreaks.IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 25. MedISys - http://medusa.jrc.it/medisys/homeedition/all/home.html *part of the Europe Media Monitor (EMM) product family http://emm.jrc.it/overview.html IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 26. Pattern-based Understanding and Learning System (PULS) IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 27.
  • 28. 2300 locations and 1100 disease names
  • 29. identifies between 20-30 outbreaks per day
  • 30. multiple languages English, Russian, Arabic, French, Portuguese, Spanish, Chinese
  • 31.
  • 33. The Global Disaster Alert Coordinating System (GDACS) - www.gdacs.org
  • 34. Central Intelligence Agency (CIA) Factbook - https://www.cia.gov/library/publications/the-world-factbook/
  • 35. TheUnited Nations Human Development Report sites - http://hdr.undp.org/enIEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 36. HealthMap - http://healthmap.org/en IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 37. ProMED-Mail -www.promedmail.org IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 38. EpiSpider - http://www.epispider.org/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 39.
  • 40. Animal Disease-related Data Online Structured Data Unstructured Data Official reports by different organizations: state and federal laboratories, bioportals; health care providers; governmental agricultural or environmental agencies. Web-pages News E-mails (e.g., ProMed-Mail) Blogs Medical literature (e.g., books) Scientific papers (e.g., PubMed) IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 41.
  • 42. Extract facts/structured information from unstructured text
  • 43. Manage the specificity of the content (e.g. blogosphere, biomedical literature, news, official reports etc.)
  • 44. Necessity of data aggregation from multiple sources
  • 46. Solution: Reliability of the source by majority voting
  • 47. Multiple locations, different case status, different victim types lead to multiple data base entries
  • 48.
  • 49. What geo-tag in Virginia, USA or UK?
  • 50. Solution: track geo-tag of the original source of information
  • 51. Deal with unknown or undiagnosed diseases
  • 52. “The deadly outbreak has so far killed 16 people in Gabon”
  • 53. Solution: Look into the context/recent outbreaks in this location
  • 55. “FMD outbreak was reported last week/today…”
  • 56.
  • 57. Existing Systems vs. Designed System (2) Existing Systems Designed System IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 58. Targeting Audience Research and Public Health communities 1. Managing the specificity of blogosphere Health Care Providers (e.g. hospitals) Governmental Agencies (e.g. Center for Disease Control and Prevention ) 2. Dealing with biomedical literature 3. News content & official reports processing Laboratories 4. Capturing all possible breakdowns in communication channels between levels of animal disease management State National International IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 59. Agenda IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Overview Animal Disease Monitoring Systems Manually Supported Web-Interfaces Automated Web-Services Framework for Epidemiological Analytics System Functionality Web Crawling Domain-specific Entity Extraction Animal Disease-related Event Recognition Summary
  • 60. Framework forEpidemiological Analytics IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 61. 1. Data Collection (1) Periodically crawl the web using Heritrix crawler - http://crawler.archive.org/ set of seeds (ProMED-Mail, DEFRA etc.) set of terms (animal disease names from the ontology) Text-to-tag ratio-based method for content extraction from web pages IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 62. 1. Data Collection (2) WWW Email Crawler DB Document Collection Query Literature
  • 63. 2. Data Sharing IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Document relevance classification using Naive Bayes Classifier from Mallet - http://mallet.cs.umass.edu Relevant Non-relevant
  • 64. 3. Search IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Lucene-based* ranking Query-based keyword search Search by animal disease name and/or location *Lucene - http://lucene.apache.org
  • 65. 4. Data Analysis Event example: “On 12 September 2007, a new foot-and-mouth disease outbreak was confirmed in Egham, Surrey” IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 66. Domain Meta-data Domain-independent knowledge Domain-specific knowledge Location hierarchy names of countries, states, cities; Time hierarchy canonical dates. Medical ontology diseases, serotypes, and viruses. IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 67. Event Recognition Methodology Step 1. Entity recognition from raw text. Step 2. Sentence classification from which entities are extracted as being related to an event or not; if they are related to an event we classify them as confirmed or suspected. Step 3. Combination of entities within an event sentence into the structured tuples and aggregation of tuples related to the same event into one comprehensive tuple. IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 68. Step 1.Entity Recognition IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Locate and classify atomic elements into predefined categories: Disease names:“foot and mouth disease”, “rift valley fever”; viruses: “picornavirus”; serotypes: “Asia-1”; Species: “sheep”, “pigs”, “cattle” and “livestock”; Locationsof events specified at different levels of geo-granularity: “United Kingdom", “eastern provinces of Shandong and Jiangsu, China”; Datesin different formats: “last Tuesday”, “two month ago”.
  • 69. Entity Recognition Tools Animal Disease Extractor* relies on a medical ontology, automatically-enriched with synonyms and causative viruses. Species Extractor* pattern matching on a stemmed dictionary of animal names from Wikipedia. Location Extractor Stanford NER Tool** (uses conditional random fields); NGA GEOnet Names Database (GNS)*** for location disambiguation and retrieving latitude/longitude. Date/Time Extractor set of regular expressions. *KDD KSU DSEx - http://fingolfin.user.cis.ksu.edu:8080/diseaseextractor/ **Stanford NER - http://nlp.stanford.edu/ner/index.shtml ***GNS - http://earth-info.nga.mil/gns/html/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 70. Step 2. Event Sentence Classification Constraint: True events should include a disease name together with a status verb from Google Sets* and WordNet** (eliminate event non-related sentences). “Foot and mouth disease is[V] a highly pathogenic animal disease”. Confirmed status verbs “happened” and verb phrases “strike out” “On 9 Jun 2009, the farm's owner reported[V] symptoms of FMD in more than 30 hogs”. Suspected status verbs “catch” and verb phrases “be taken in” “RVF is suspected[V] in Saudi Arabia in September 2000”. *GoogleSets - http://labs.google.com/sets **WordNet - http://wordnet.princeton.edu/ IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 71. Step 3. Event Tuple Generation IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Event attributes: disease date location species confirmation status Event tuple: Eventi = < disease; date; location; species; status > = <FMD, 9 Jun 2009, Taoyuan, hog, confirmed> Event tuple with missing attributes: Eventj = <FMD, ?, ?, ?, confirmed>
  • 72. Event Recognition Workflow Step 1: Entity Recognition Foot-and-mouth disease[DIS]on hog[SP] farm in Taoyuan[LOC]. Taiwan's TVBS television station reports that agricultural authorities confirmed foot-and-mouth disease[DIS] on a hog[SP] farm in Taoyuan[LOC]. On 9 Jun 2009[DT], the farm's owner reported symptoms of FMD[DIS] in more than 30 hogs[SP]. Subsequent testing confirmed FMD[DIS]. Agricultural authorities asked the farmer to strengthen immunization. The outbreak has not affected other farms. Authorities stipulated that the affected hog[SP] farm may not sell pork for 2 weeks. Step 2: Sentence Classification YES 1. Foot-and-mouth disease[DIS]on hog[SP] farm in Taoyuan[LOC]. YES 2.Taiwan's TVBS television station reports that agricultural authorities confirmedfoot-and-mouth disease[DIS]on a hog[SP] farm in Taoyuan[LOC]. YES 3. On 9 Jun 2009[DT], the farm's owner reported symptoms of FMD[DIS] in more than 30 hogs[SP]. YES 4. Subsequent testing confirmedFMD[DIS]. NO 5. Agricultural authorities asked the farmer to strengthen immunization. NO 6. The outbreak has not affected other farms. NO 7. Authorities stipulated that the affected hog[SP] farm may not sell pork for 2 weeks. Step 3a: Tuple Generation E1 = <Foot-and-mouth disease, ?, Taoyuan, hog, ?> E3 = <FMD, 9 Jun 2009,?, hog, reported> E2 = <Foot-and-mouth disease, ?, Taoyuan, hog, confirmed > E4 = <FMD, ?, ?, ?, confirmed> Step 3b: Tuple Aggregation E = <disease, date, location, species, status> = <Foot-and-mouth disease, 9 Jun 2009, Taoyuan, hog, confirmed > The First International Workshop on Web Science and Information Exchange in the Medical Web (MedEx 2010)
  • 73.
  • 74. Animal Disease Extraction Results (2) The First International Workshop on Web Science and Information Exchange in the Medical Web (MedEx 2010)
  • 75. Animal Disease Extraction Results (3) The First International Workshop on Web Science and Information Exchange in the Medical Web (MedEx 2010)
  • 76. Event Recognition Results Scorei = < wddisease; wtdate; wllocation; wsspecies; wcstatus… >, subject to disease + status = 2 Interpret the Pyramid score values -http://www1.cs.columbia.edu/~becky/DUC2006/2006-pyramid-guidelines.html_ducviewas an event extraction accuracy Apply list of verbs from GoogleSets and WordNet We use NS (unstemmed) and S (stemmed) versions of the verb lists The First International Workshop on Web Science and Information Exchange in the Medical Web (MedEx 2010)
  • 77. 5. Visualization IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 Map View GoogleMaps API - http://code.google.com/apis/maps/ TimeLine View SIMILE API - http://www.simile-widgets.org/timeline/
  • 78. Event Representation by Date/TimeTimeline View http://fingolfin.user.cis.ksu.edu/timemap.1.4/FMD_2007_UK_Viz/FMD_Viz.htm
  • 79. Event Representation by LocationMap View http://fingolfin.user.cis.ksu.edu/timemap.1.4/FMD_2007_UK_Viz/FMD_Viz.htm
  • 80. Summary perform focused crawling of different sources (books, research papers, blogs, governmental sources, etc.) use semantic relationship learning approach (including synonymic, hyponymic, causal relationships) for automated-ontology expansion for domain-specific entity extraction (e.g., diseases, viruses) recognize geo-entities using CRF approach and disambiguates them using GNServer extract animal disease-related events with more descriptive event attributes such as: species, dates, event confirmation status, in contrast to ”disease-location” pairs support timeline representation of extracted events in SIMILE in addition to visualized events on GoogleMaps IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 81. IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010
  • 82. Acknowledgments IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 KDD Lab alumni: Tim Weninger (crawler deployment) and Jing Xia (rule-based event extraction) KDD Lab assistants: Information Extraction Team (John Drouhard, Landon Fowles, Swathi Bujuru) Spatial Data Mining Team (Wesam Elshamy, AndrewBerggren) Topic Detection & Tracking Team (Surya Kallumadi, Danny Jones, Srinivas Reddy) Faculty at the University of Illinois at Urbana-Champaign (2009 Data Sciences Summer Institute) ChengXiangZhai, Dan Roth, Jiawei Han and Kevin Chang.
  • 83. References IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010 S. Volkova, W. Hsu, and D. Caragea, “Named entity recognition and tagging in the domain of epizootics”, In Proc. of Women in Machine Learning Workshop (WiML’09). S. Volkova, D. Caragea, W. H. Hsu, and S. Bujuru, “Animal disease event recognition and classification,” In Proc. of The First International Workshop on Web Science and Information Exchange in the Medical Web, 19th World Wide Web Conference WWW-2010. S. Volkova, D. Caragea, W. H. Hsu, J. Drouhard, and L. Fowles, “Boosting Biomedical Entity Extraction by using Syntactic Patterns for Semantic Relation Discovery”, ACM Web Intelligence Conference, 2010 (to appear).
  • 84. Thank you! Svitlana Volkova, svitlana.volkova@gmail.com http://people.cis.ksu.edu/~svitlana William H. Hsu, bhsu@cis.ksu.edu http://people.cis.ksu.edu/~bhsu IEEE International Conference on Intelligence and Security Informatics Public Safety and Security, ISI 2010