SlideShare uma empresa Scribd logo
1 de 1
Baixar para ler offline
Named Entity Annotation and Tagging in the Domain of Epizootics
K-State Laboratory for Knowledge                                                                                 Svitlana Volkova, William Hsu, Doina Caragea
  Discovery in Databases (KDD)
                                                                    Kansas State University, Department of Computing and Information Sciences, Manhattan, KS 66506

                                                                                                                  GAZETEER COLLECTION AND ONTOLOGY CONSTRUCTION                                                                                                                     THE EFFECT OF THE ONTOLOGY SIZE AND QUALITY ON THE
  OVERVIEW
                                                                                                                  The main purpose of IE using a gazetteer is to retrieve tokens that match at least
                                                                                                                                                                                                                                                                                    ACCURACY OF DISEASE EXTRACTION
  We present an information extraction (IE) application in the domain of animal
  diseases. Previously, such tasks were performed only for human disease related                                  one term with synonyms, abbreviations from known animal disease names. We                                                                                         The data set is sampled from animal disease crawled sources with number of
  data. As opposed to that, our task is directly related to web crawling for retrieving                           collect prior domain specific knowledge and, as a result, construct ontology of                                                                                   occurrences of the disease named entities above predefined threshold. All animal
  animal infectious disease related information.                                                                  animal disease concepts. The extraction technique is based on a pattern matching                                                                                  diseases were manually annotated within the dataset for future cross-validation.
                                                                                                                  approach. The gazetteer is semi-automatically collected from official web-portals.                                                                                In the first experiment the baseline run is processed using dictionary look-up with
            WWW (official reports about animal disease outbreaks,
                                                                                                                  Using initial gazetteer we enriched ontology with latent synonymic and causal                                                                                     and w/o capitalization feature (1a, 1b). The next runs include addition only
            surveillance networks disease descriptions, fact sheets etc.)
                                                                                                                  relations between related concepts.                                                                                                                               synonyms (2a, 2b) and abbreviations (3a, 3b) respectively. The last run (4a, 4b)
  EMAIL                                                                                                                                                                                                                                                                             combines all above mentioned features.
                                                                                                                                                         Run 1,
                                                                       DOMAIN SPECIFIC                               Averaged                            3.60%                             1. Disease names and fact sheets from Iowa State University Center for
                                                                       KNOWLEDGE                                  "FMD" Extraction                                  Run 2 - only
                                                                                                                                                                   capitalization,            Food Security and Public Health (CFSPH):                                              In the second experiment we divide data into training and test sets. Using training
                                                                                                                    Performance
                                                   Document                                                       over 50 web-sites
                                                                                                                                                                     14.00%
                                                                                                                                                                                                  http://www.cfsph.iastate.edu/diseaseinfo/animaldiseaseindex.htm                   examples we learn the model for animal disease name extraction by discovering
                                                                        Medical ontology, containing                                                                                       2. Word Organization of Animal Health (OIE) Animal Disease Data:
                                                   Collection           names of diseases, viruses,                                                                                               http://www.oie.int/eng/maladies/en_alpha.htm
                                                                                                                                                                                                                                                                                    relations between concepts; we report accuracy on test data set.
                                                                        animal species etc., organized in                                                                                  3. Department for Environmental Food and Rural Affairs, UK (DEFRA):                      In the third experiment, we compare our approach of learning relations between
                                                                        a conceptual hierarchy.                                  Run 3 - only
                                                                                                                                                                                                  http://www.defra.gov.uk/animalh/diseases/vetsurveillance/az_index.htm
                                                                                                                                abbreviations
                                                                                                                                + synonyms,
                                                                                                                                   84.36%                                                  4. United States Department of Agriculture (USDA), Animal and Plant
                                                                                                                                                                                                                                                                                    concepts with Google Sets method. We report results in terms of precision, recall
                              CRAWLER                                                                                                                                                         Health Inspection Service                                                             and F-measure. We build learning curves for both methods in order to show the
                    DB                                                                                               Averaged                   Run 1,
                                                                       DOMAIN INDEPENDENT                         "RVF" Extraction
                                                                                                                                                0.20%                                             http://www.aphis.usda.gov/animal_health/animal_diseases/
                                                                                                                                                                                                                                                                                    influence of the ontology size and quality on the accuracy of extracted results.
                                                                       KNOWLEDGE                                    Performance                                    Run 2 - only
                                                                                                                                                                   capitalization
                                                                                                                                                                                           5. Medline Plus, Service of National Library of Medicine and National
                                                                                                                  over 50 web-sites                                  38.02%                   Institute of Health
  LITERATURE                    QUERY                                   Location hierarchy, containing                                                                                            http://www.nlm.nih.gov/medlineplus/animaldiseasesandyourhealth.html                   List’s look up features:                    Document level features: keyword                                      Word level
                                                                        names of countries, states or                       Run 3 - only
                                                                                                                            abbreviation                                                   6. Wikipedia                                                                                 flexible pattern match                    appearance within predefined window.                               morphological features
                                                                        provinces, cities, etc; canonical                   + synonyms,
                                                                                                                              57.52%                                                              http://en.wikipedia.org/wiki/Animal_diseases
                                                                        date/time representation.
                                                                                                                  RELATION DISCOVERY BETWEEN CONCEPTS                                                                                                                                                                             Method A: Number of Training Instances
                                                                                                                                                                                                                                                                                                            429             773        955        1159         1287            1442          1561         1590          1619          1682
                                                                                                                  Synonymy (“is a kind of” relation, e.g. “Swine influenza” is a kind of “Swine fever”);                                                                                Accuracy
                                                                                                                                                                                                                                                                                                           0.964        0.929          0.927      0.925        0.964           0.929        0.927         0.925         0.964         0.929
  INFORMATION EXTRACTION IN THE DOMAIN OF EPIZOOTICS                                                                                                                                                                                                                                                                              Method B: Number of Training Instances
  The IE task in the domain of the epizootics can be defined as automatic extraction                                                                                                                                                                                                                        429             754        925        1118         1238            1385          1497         1524          1552          1611
                                                                                                                                                                                                                                                                                        Accuracy
  of structured information that is related to animal diseases from unstructured web                                                                                                                                                                                                                       0.962        0.961          0.864      0.862        0.962           0.961        0.864         0.862         0.962         0.961
  documents with different content. The IE task is related to development of several                              Example A: “Diseases such as Foot and Mouth Disease, Bovine TB or Johne’s Disease                                                                                                                     Dictionary Look-Up: Number of Instances (max. 429)
  modules for tagging of specific entities such as: animal disease name, species,                                 have far-reaching potential for major economic impact on cattle producers”.                                                                                                               1a              1b          2a         2b           3a              3b           4a            4b              -              -
  vaccines, serotypes etc. at the document-level within a crawled collection of                                                                                                                                                                                                         Accuracy
                                                                                                                  Causal links (“is caused by”, e.g. “Ovine epididymitis is caused by Brucella ovis”).                                                                                                     0.885        0.920          0.886      0.896        0.887           0.922        0.889         0.933            -              -
  documents.                                                   ANIMAL                                                                                                                                                                                                                                                                                                                  Learning Curve for Method B
                                                                                                                                                                                                                                                                                   Accuracy                Learning Curve for Method A
                                                                                                                                                                                                                                                                                                                                                           Accuracy                  (Relation Discovery using Google Sets)
                                                                             DISEASE                                                                                                                                                                                                                 (Relation Discovery within Training Data)
   DOCUMENT          Goal: to extract structured                                                                                                                                                                                                                                      1.00                                                                      1.00
                     information with facts and                                                                                                                                                                                                                                       0.98                                                                      0.98
  COLLECTION         entities related to events from                                                                                                                                                                                                                                  0.96                                                                      0.96
                                                          Dipylidium                                              Example F: “Bluetongue virus (BTV), a member of Orbivirus genus within the                                                                                          0.94                                                                      0.94
                     unstructured or semistructured                         Q fever         Baylisascariasis
                                                           infection                                              Reoviridae family causes Bluetongue disease in livestock (sheep, goat, cattle)”.                                                                                    0.92                                                                      0.92
                     sources.                                                                                                                                                                                                                                                         0.90                                                                      0.90
                                                                                                                                                                                                                                                                                      0.88                                                                      0.88
                                                                               Coxiella         Baylisascaris
                                                                                                                  DICTIONARY LOOKUP METHOD FOR DISEASE EXTRACTION                                                                                                                     0.86                                                                      0.86
                                                             Tapeworm                                                                                                                                                                                                                 0.84                                                                      0.84
                                                                               burnetii          procyonis
                                                                                                                                                                                                                            Output:                                                   0.82                                                                      0.82
                                                                                                                                                                                                                                                                                      0.80                                                                      0.80
                                                                                                                                                                                                                                                                                           400       650          900          1150      1400       1650               400            650         900         1150       1400       1650
                                                                                                                                                                                                                         Index of the first/last character                                                                   Number of Ontology Concepts                                                    Number of Ontology Concepts
                                                                              C. burnetii         B. melis
                                                                                                                                                                                                                                                                                         1
                                                                                                                                                                                                                                                                                                                        F-Measure                                                            Precision/Recall
                                                                                                                                                                                           Disease                       Matched text and length                                       0.9                                                                             1
                                                                                                                                                                                                                                                                                       0.8                                                                           0.8
                                                                                                                                                                                          Extractor                                                                                    0.7
                                                                                                B. procyonis                                                                               Module                        Canonical disease names                                       0.6                                                                           0.6
                                                                                                                                                         Input:                                                                                                                                                                                    Method B
  Example: The US saw its latest FMD outbreak in Montebello,                                                                                                                                                                                                                           0.5
                                                                                                                                                                                                                                                                                       0.4                                                         Method A
                                                                                                                                                                                                                                                                                                                                                                     0.4
                                                                                                                                                  Text from file                                                         Associated Synonyms/Abbreviations                                                                                                           0.2
  California in 1929 where 3,600 animals were slaughtered.                                                                                                                                                                                                                             0.3                                                         Gazetteer
                                                                                                                                                                                                                                                                                       0.2                                                                             0
                                                                                                B. transfuga                                                                                                                                                                           0.1                                                                                 0           0.2          0.4          0.6            0.8            1
                                                                                                                  1.0                                                                                                    Non-unique/Unique diseases                                      0
          Animal Disease Names                         Locations                                                  0.9         Precision, Recall, F-measure
                                                                                                                                                                                                                                                                                             1   2    3      4    5     6     7    8    9    10
                                                                                                                                                                                                                                                                                                                                                  Runs                           Metod A          Metod B          Dictionary Look-Up
                                                                                                                  0.8                                                                                             1a - using only initial gazetteer w/o capitalization
          Dates/Times                                  Quantities                                                 0.7                                                                                             1b - using initial gazetteer + capitalization
                                                                                                                  0.6                                                                                                                                                                 FUTURE WORK
  CLASSIFICATION-BASED NAMED ENTITY RECOGNITION                                                                   0.5
                                                                                                                                                                                                                  2a - initial gazetteer + only synonyms w/o capitalization
                                                                                                                                                                                                                  2b - initial gazetteer + only synonyms with capitalization
                                                                                                                                                                                                                                                                                                                                                                                                                  NLP TASKS
                                                                                                                  0.4                                                                                                                                                                 The animal disease extraction task is a
                                                                                                                  0.3                                                                                             3a - init. gazetteer + only abbreviations w/o capitalization        prerequisite for more advanced content
  Named Entity Recognition (NER) task is a subtask of IE which seeks to locate and                                0.2                                                                                             3b - init. gazetteer + only abbreviations with capitalization                                                                                                                            Foot-and-mouth disease[DIS] killed 15

                                                                                                                                                                                                                  4a - init. gazetteer + synonyms + abbrev. w/o capitaliz.
                                                                                                                                                                                                                                                                                      analysis of the unstructured documents within                                                                        hog on farm in Taiwan[LOC]
  classify atomic elements in text into predefined categories, such as:                                           0.1
                                                                                                                                                                                                            Run                                                                       corpora. So, the design of an NER-driven
                                                                                                                  0.0                                                                                             4b - init. gazetteer + synonyms + abbrev. with capitaliz.                                                                                          Syntactic Analysis                    Foot-and-mouth disease [SUBJ] killed[VP]
    disease names (e.g. “foot and mouth disease”);                                                                         4b             4a     3b          3a        2b           2a     1b    1a                                                                                  system for extracting structured tuples that                                                                         15 hog on farm in Taiwan [PP]

                                                                                                                                           Precision          Recall         F-Measure
                                                                                                                                                                                                              ACKNOWLEDGEMENTS                                                        describe animal disease-related events will
    viruses (e.g. “picornavirus”) and serotypes (e.g. “Asia-1”);                                                 1.0
                                                                                                                                                                                                              This work is supported through a grant from the U.S. Department         be performed.
                                                                                                                                                                                                                                                                                                                                                                                                           Fact:
                                                                                                                                                                                                                                                                                                                                                                                                           Disease:
                                                                                                                                                                                                                                                                                                                                                                                                                         killed
                                                                                                                                                                                                                                                                                                                                                                                                                         foot-and-mouth disease
                                                                                                                                                                                                       4b     of Defense. A collaborative program on IE with faculty at the                                                                                                                                Location:     Taiwan
    species and its quantities (e.g. “sheep”, “pigs”);                                                           0.9          Recall Range
                                                                                                                                                                                                                                                                                      The approach extends the shared NER task                                              Extraction                     Species:       hog
                                                                                                                                                                                                       3b     University of Illinois at Urbana-Champaign (ChengXiang Zhai, Dan
                                                                                                                  0.8
                                                                                                                                                                                                              Roth, Jiawei Han, and Kevin Chang), the 2009 Data Sciences              of identifying persons, organizations, and                                                                           Quantity:      15

    locations where outbreak happened (e.g. “United Kingdom”, “eastern provinces                                 0.7                                                                                  3a     Summer Institute (DSSI) on Multimodal Information Access and            locations with not only disease names but
                                                                                                                                                                                                                                                                                                                                                                                                           Foot-and-mouth disease killed 15 hog
                                                                                                                  0.6
     of Shandong and Jiangsu, China” – different level of granularity);                                           0.5
                                                                                                                                                                                                       2b     Synthesis (MIAS), was made possible through the support of
                                                                                                                                                                                                              DHS/ONR.
                                                                                                                                                                                                                                                                                      constituent entities and attributes of these                                         Co-reference                    on farm in Taiwan. Outbreak was
                                                                                                                                                                                                                                                                                                                                                                                                           reported on 9 June.
                                                                                                                  0.4                                                                                  2a                                                                             event tuples. These include dates and times,                                          Resolution
    dates in different formats including special cases (e.g. “last Tuesday”, “two                                0.3                                                                                  1b
                                                                                                                                                                                                              We appreciate effective discussions with Dr. Chris Callison-Burch,
                                                                                                                                                                                                                                                                                      quantities with relevant units, and geo-                                                                             Event:         outbreak
                                                                                                                                                                                                              Dr. Mark Dredze and Dr. Jason Eisner from Center for Language
     month ago”);                                                                                                 0.2
                                                                                                                                                                                                       1a     and Speech Processing, Johns Hopkins University; Tim Weninger,          referenced locations. A primary overall                                                                              Species:
                                                                                                                                                                                                                                                                                                                                                                                                           Disease:
                                                                                                                                                                                                                                                                                                                                                                                                                          15 hog
                                                                                                                                                                                                                                                                                                                                                                                                                          foot-and-mouth disease
                                                                                                                  0.1
                                                                                                                                                                                                              Research Fellow, UIUC;                                                  objective of the IE task is to support timeline                                                                      Location:      Taiwan
    organizations that reports outbreak (e.g. “DEFRA”, “CDC”).                                                   0.0
                                                                                                                                                                                                   4a                                                                                                                                                            Template Generation                       Date/Time:     9 June
                                                                                                                        0                         50                         100
                                                                                                                                                                                     Document number          John Drouhard,      Landon Fowles (KDD Lab, IE Team) for                and map-based visualization of events.
                                                                                                                                                                                                              assistance with experiments.

          KANSAS STATE UNIVERSITY                                                                            KNOWLEDGE DISCOVERY IN DATABASES LABORATORY                                                                                                                           NATIONAL AGRICULTURAL BIOSECURITY CENTER @ K-STATE

Mais conteúdo relacionado

Destaque

Preventive and failure costs mastitis
Preventive and failure costs mastitisPreventive and failure costs mastitis
Preventive and failure costs mastitisHenk Hogeveen
 
University Of Maryland’S Academic Program
University Of Maryland’S Academic ProgramUniversity Of Maryland’S Academic Program
University Of Maryland’S Academic ProgramMark Varner
 
Effect of sensor systems on production, health, reproduction and economic res...
Effect of sensor systems on production, health, reproduction and economic res...Effect of sensor systems on production, health, reproduction and economic res...
Effect of sensor systems on production, health, reproduction and economic res...Henk Hogeveen
 
Faculty and courses
Faculty and coursesFaculty and courses
Faculty and coursesstatisense
 
Formal Concept Analysis
Formal Concept AnalysisFormal Concept Analysis
Formal Concept AnalysisSSA KPI
 
Inaugural address: Management of animal health
Inaugural address: Management of animal healthInaugural address: Management of animal health
Inaugural address: Management of animal healthHenk Hogeveen
 
Economic farm figures and the milk quotum
Economic farm figures and the milk quotumEconomic farm figures and the milk quotum
Economic farm figures and the milk quotumHenk Hogeveen
 
Associations between farmer participation in veterinary herd health programs ...
Associations between farmer participation in veterinary herd health programs ...Associations between farmer participation in veterinary herd health programs ...
Associations between farmer participation in veterinary herd health programs ...Henk Hogeveen
 
Land and Food Systems Presentation
Land and Food Systems PresentationLand and Food Systems Presentation
Land and Food Systems PresentationGRS UBC
 
Antibiotics in food animal production and resistant bacteria in humans
Antibiotics in food animal production and resistant bacteria in humansAntibiotics in food animal production and resistant bacteria in humans
Antibiotics in food animal production and resistant bacteria in humansGlobal Risk Forum GRFDavos
 
Animal health economics and precision farming
Animal health economics and precision farmingAnimal health economics and precision farming
Animal health economics and precision farmingHenk Hogeveen
 
8 animal farm
8 animal farm8 animal farm
8 animal farmT3ach3r
 
Animal genetic resources for improved productivity under harsh environmental ...
Animal genetic resources for improved productivity under harsh environmental ...Animal genetic resources for improved productivity under harsh environmental ...
Animal genetic resources for improved productivity under harsh environmental ...SIANI
 
Replacement strategy for dairy farmers
Replacement strategy for dairy farmersReplacement strategy for dairy farmers
Replacement strategy for dairy farmersHenk Hogeveen
 
Animal Farm Chapter 9
Animal Farm Chapter 9Animal Farm Chapter 9
Animal Farm Chapter 9mrbelprez
 

Destaque (20)

Preventive and failure costs mastitis
Preventive and failure costs mastitisPreventive and failure costs mastitis
Preventive and failure costs mastitis
 
Slide share
Slide shareSlide share
Slide share
 
University Of Maryland’S Academic Program
University Of Maryland’S Academic ProgramUniversity Of Maryland’S Academic Program
University Of Maryland’S Academic Program
 
Effect of sensor systems on production, health, reproduction and economic res...
Effect of sensor systems on production, health, reproduction and economic res...Effect of sensor systems on production, health, reproduction and economic res...
Effect of sensor systems on production, health, reproduction and economic res...
 
Faculty and courses
Faculty and coursesFaculty and courses
Faculty and courses
 
Formal Concept Analysis
Formal Concept AnalysisFormal Concept Analysis
Formal Concept Analysis
 
Inaugural address: Management of animal health
Inaugural address: Management of animal healthInaugural address: Management of animal health
Inaugural address: Management of animal health
 
The role of Water Economics in Food Security
The role of Water Economics in Food Security The role of Water Economics in Food Security
The role of Water Economics in Food Security
 
Economic farm figures and the milk quotum
Economic farm figures and the milk quotumEconomic farm figures and the milk quotum
Economic farm figures and the milk quotum
 
Associations between farmer participation in veterinary herd health programs ...
Associations between farmer participation in veterinary herd health programs ...Associations between farmer participation in veterinary herd health programs ...
Associations between farmer participation in veterinary herd health programs ...
 
Land and Food Systems Presentation
Land and Food Systems PresentationLand and Food Systems Presentation
Land and Food Systems Presentation
 
Antibiotics in food animal production and resistant bacteria in humans
Antibiotics in food animal production and resistant bacteria in humansAntibiotics in food animal production and resistant bacteria in humans
Antibiotics in food animal production and resistant bacteria in humans
 
Farm power and its sources
Farm power and its sourcesFarm power and its sources
Farm power and its sources
 
Animal health economics and precision farming
Animal health economics and precision farmingAnimal health economics and precision farming
Animal health economics and precision farming
 
8 animal farm
8 animal farm8 animal farm
8 animal farm
 
Animal genetic resources for improved productivity under harsh environmental ...
Animal genetic resources for improved productivity under harsh environmental ...Animal genetic resources for improved productivity under harsh environmental ...
Animal genetic resources for improved productivity under harsh environmental ...
 
Replacement strategy for dairy farmers
Replacement strategy for dairy farmersReplacement strategy for dairy farmers
Replacement strategy for dairy farmers
 
cc aed aetc am&e p&p;1 minor irrigation
cc aed aetc am&e p&p;1 minor irrigationcc aed aetc am&e p&p;1 minor irrigation
cc aed aetc am&e p&p;1 minor irrigation
 
Seminar presentations
Seminar presentationsSeminar presentations
Seminar presentations
 
Animal Farm Chapter 9
Animal Farm Chapter 9Animal Farm Chapter 9
Animal Farm Chapter 9
 

Semelhante a WiML Poster

Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Sage Base
 
Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Sage Base
 
Rocha comple net2012-melbourne
Rocha comple net2012-melbourneRocha comple net2012-melbourne
Rocha comple net2012-melbourneJuan C. Rocha
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalSvitlana volkova
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterMonica Munoz-Torres
 
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Sage Base
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksBITS
 
Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Sage Base
 
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Sage Base
 
German Russian Workshop 2011 - geneXplain
German Russian Workshop  2011 - geneXplainGerman Russian Workshop  2011 - geneXplain
German Russian Workshop 2011 - geneXplaingeneXplain GmbH
 
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...Thitichai Sripan
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...open_phacts
 
Stephen Friend National Heart Lung & Blood Institute 2011-07-19
Stephen Friend National Heart Lung & Blood Institute 2011-07-19Stephen Friend National Heart Lung & Blood Institute 2011-07-19
Stephen Friend National Heart Lung & Blood Institute 2011-07-19Sage Base
 

Semelhante a WiML Poster (20)

Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04
 
Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20
 
Rocha comple net2012-melbourne
Rocha comple net2012-melbourneRocha comple net2012-melbourne
Rocha comple net2012-melbourne
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location Retrieval
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of Exeter
 
David
DavidDavid
David
 
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networks
 
Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01
 
ECCS12
ECCS12ECCS12
ECCS12
 
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
 
Genome comparision
Genome comparisionGenome comparision
Genome comparision
 
Biological Network Inference via Gaussian Graphical Models
Biological Network Inference via Gaussian Graphical ModelsBiological Network Inference via Gaussian Graphical Models
Biological Network Inference via Gaussian Graphical Models
 
German Russian Workshop 2011 - geneXplain
German Russian Workshop  2011 - geneXplainGerman Russian Workshop  2011 - geneXplain
German Russian Workshop 2011 - geneXplain
 
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
 
Stephen Friend National Heart Lung & Blood Institute 2011-07-19
Stephen Friend National Heart Lung & Blood Institute 2011-07-19Stephen Friend National Heart Lung & Blood Institute 2011-07-19
Stephen Friend National Heart Lung & Blood Institute 2011-07-19
 
B.3.5
B.3.5B.3.5
B.3.5
 
Pathogen Genome Data
Pathogen Genome DataPathogen Genome Data
Pathogen Genome Data
 

Mais de Svitlana volkova (17)

EACL'12 Poster
EACL'12 PosterEACL'12 Poster
EACL'12 Poster
 
Grace Hopper Celebration 2010
Grace Hopper Celebration 2010Grace Hopper Celebration 2010
Grace Hopper Celebration 2010
 
Web Intelligence 2010
Web Intelligence 2010Web Intelligence 2010
Web Intelligence 2010
 
Master Thesis
Master ThesisMaster Thesis
Master Thesis
 
MS Thesis Short
MS Thesis ShortMS Thesis Short
MS Thesis Short
 
IEEE ISI'10
IEEE ISI'10IEEE ISI'10
IEEE ISI'10
 
MedEx'10
MedEx'10MedEx'10
MedEx'10
 
Multilingual Ner Using Wiki
Multilingual Ner Using WikiMultilingual Ner Using Wiki
Multilingual Ner Using Wiki
 
Topics Modeling
Topics ModelingTopics Modeling
Topics Modeling
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
Social Networks
Social NetworksSocial Networks
Social Networks
 
Methods Of Reliability Analysis
Methods Of Reliability AnalysisMethods Of Reliability Analysis
Methods Of Reliability Analysis
 
Ohio Project
Ohio ProjectOhio Project
Ohio Project
 
Ukraine Presentation
Ukraine PresentationUkraine Presentation
Ukraine Presentation
 
Ukraine Presentation at Kansas State University
Ukraine Presentation at Kansas State UniversityUkraine Presentation at Kansas State University
Ukraine Presentation at Kansas State University
 
Communicatons Fulbright
Communicatons FulbrightCommunicatons Fulbright
Communicatons Fulbright
 
Communications Ternopil
Communications TernopilCommunications Ternopil
Communications Ternopil
 

Último

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 

Último (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

WiML Poster

  • 1. Named Entity Annotation and Tagging in the Domain of Epizootics K-State Laboratory for Knowledge Svitlana Volkova, William Hsu, Doina Caragea Discovery in Databases (KDD) Kansas State University, Department of Computing and Information Sciences, Manhattan, KS 66506 GAZETEER COLLECTION AND ONTOLOGY CONSTRUCTION THE EFFECT OF THE ONTOLOGY SIZE AND QUALITY ON THE OVERVIEW The main purpose of IE using a gazetteer is to retrieve tokens that match at least ACCURACY OF DISEASE EXTRACTION We present an information extraction (IE) application in the domain of animal diseases. Previously, such tasks were performed only for human disease related one term with synonyms, abbreviations from known animal disease names. We The data set is sampled from animal disease crawled sources with number of data. As opposed to that, our task is directly related to web crawling for retrieving collect prior domain specific knowledge and, as a result, construct ontology of occurrences of the disease named entities above predefined threshold. All animal animal infectious disease related information. animal disease concepts. The extraction technique is based on a pattern matching diseases were manually annotated within the dataset for future cross-validation. approach. The gazetteer is semi-automatically collected from official web-portals. In the first experiment the baseline run is processed using dictionary look-up with WWW (official reports about animal disease outbreaks, Using initial gazetteer we enriched ontology with latent synonymic and causal and w/o capitalization feature (1a, 1b). The next runs include addition only surveillance networks disease descriptions, fact sheets etc.) relations between related concepts. synonyms (2a, 2b) and abbreviations (3a, 3b) respectively. The last run (4a, 4b) EMAIL combines all above mentioned features. Run 1, DOMAIN SPECIFIC Averaged 3.60% 1. Disease names and fact sheets from Iowa State University Center for KNOWLEDGE "FMD" Extraction Run 2 - only capitalization, Food Security and Public Health (CFSPH): In the second experiment we divide data into training and test sets. Using training Performance Document over 50 web-sites 14.00% http://www.cfsph.iastate.edu/diseaseinfo/animaldiseaseindex.htm examples we learn the model for animal disease name extraction by discovering Medical ontology, containing 2. Word Organization of Animal Health (OIE) Animal Disease Data: Collection names of diseases, viruses, http://www.oie.int/eng/maladies/en_alpha.htm relations between concepts; we report accuracy on test data set. animal species etc., organized in 3. Department for Environmental Food and Rural Affairs, UK (DEFRA): In the third experiment, we compare our approach of learning relations between a conceptual hierarchy. Run 3 - only http://www.defra.gov.uk/animalh/diseases/vetsurveillance/az_index.htm abbreviations + synonyms, 84.36% 4. United States Department of Agriculture (USDA), Animal and Plant concepts with Google Sets method. We report results in terms of precision, recall CRAWLER Health Inspection Service and F-measure. We build learning curves for both methods in order to show the DB Averaged Run 1, DOMAIN INDEPENDENT "RVF" Extraction 0.20% http://www.aphis.usda.gov/animal_health/animal_diseases/ influence of the ontology size and quality on the accuracy of extracted results. KNOWLEDGE Performance Run 2 - only capitalization 5. Medline Plus, Service of National Library of Medicine and National over 50 web-sites 38.02% Institute of Health LITERATURE QUERY Location hierarchy, containing http://www.nlm.nih.gov/medlineplus/animaldiseasesandyourhealth.html List’s look up features: Document level features: keyword Word level names of countries, states or Run 3 - only abbreviation 6. Wikipedia flexible pattern match appearance within predefined window. morphological features provinces, cities, etc; canonical + synonyms, 57.52% http://en.wikipedia.org/wiki/Animal_diseases date/time representation. RELATION DISCOVERY BETWEEN CONCEPTS Method A: Number of Training Instances 429 773 955 1159 1287 1442 1561 1590 1619 1682 Synonymy (“is a kind of” relation, e.g. “Swine influenza” is a kind of “Swine fever”); Accuracy 0.964 0.929 0.927 0.925 0.964 0.929 0.927 0.925 0.964 0.929 INFORMATION EXTRACTION IN THE DOMAIN OF EPIZOOTICS Method B: Number of Training Instances The IE task in the domain of the epizootics can be defined as automatic extraction 429 754 925 1118 1238 1385 1497 1524 1552 1611 Accuracy of structured information that is related to animal diseases from unstructured web 0.962 0.961 0.864 0.862 0.962 0.961 0.864 0.862 0.962 0.961 documents with different content. The IE task is related to development of several Example A: “Diseases such as Foot and Mouth Disease, Bovine TB or Johne’s Disease Dictionary Look-Up: Number of Instances (max. 429) modules for tagging of specific entities such as: animal disease name, species, have far-reaching potential for major economic impact on cattle producers”. 1a 1b 2a 2b 3a 3b 4a 4b - - vaccines, serotypes etc. at the document-level within a crawled collection of Accuracy Causal links (“is caused by”, e.g. “Ovine epididymitis is caused by Brucella ovis”). 0.885 0.920 0.886 0.896 0.887 0.922 0.889 0.933 - - documents. ANIMAL Learning Curve for Method B Accuracy Learning Curve for Method A Accuracy (Relation Discovery using Google Sets) DISEASE (Relation Discovery within Training Data) DOCUMENT Goal: to extract structured 1.00 1.00 information with facts and 0.98 0.98 COLLECTION entities related to events from 0.96 0.96 Dipylidium Example F: “Bluetongue virus (BTV), a member of Orbivirus genus within the 0.94 0.94 unstructured or semistructured Q fever Baylisascariasis infection Reoviridae family causes Bluetongue disease in livestock (sheep, goat, cattle)”. 0.92 0.92 sources. 0.90 0.90 0.88 0.88 Coxiella Baylisascaris DICTIONARY LOOKUP METHOD FOR DISEASE EXTRACTION 0.86 0.86 Tapeworm 0.84 0.84 burnetii procyonis Output: 0.82 0.82 0.80 0.80 400 650 900 1150 1400 1650 400 650 900 1150 1400 1650 Index of the first/last character Number of Ontology Concepts Number of Ontology Concepts C. burnetii B. melis 1 F-Measure Precision/Recall Disease Matched text and length 0.9 1 0.8 0.8 Extractor 0.7 B. procyonis Module Canonical disease names 0.6 0.6 Input: Method B Example: The US saw its latest FMD outbreak in Montebello, 0.5 0.4 Method A 0.4 Text from file Associated Synonyms/Abbreviations 0.2 California in 1929 where 3,600 animals were slaughtered. 0.3 Gazetteer 0.2 0 B. transfuga 0.1 0 0.2 0.4 0.6 0.8 1 1.0 Non-unique/Unique diseases 0 Animal Disease Names Locations 0.9 Precision, Recall, F-measure 1 2 3 4 5 6 7 8 9 10 Runs Metod A Metod B Dictionary Look-Up 0.8 1a - using only initial gazetteer w/o capitalization Dates/Times Quantities 0.7 1b - using initial gazetteer + capitalization 0.6 FUTURE WORK CLASSIFICATION-BASED NAMED ENTITY RECOGNITION 0.5 2a - initial gazetteer + only synonyms w/o capitalization 2b - initial gazetteer + only synonyms with capitalization NLP TASKS 0.4 The animal disease extraction task is a 0.3 3a - init. gazetteer + only abbreviations w/o capitalization prerequisite for more advanced content Named Entity Recognition (NER) task is a subtask of IE which seeks to locate and 0.2 3b - init. gazetteer + only abbreviations with capitalization Foot-and-mouth disease[DIS] killed 15 4a - init. gazetteer + synonyms + abbrev. w/o capitaliz. analysis of the unstructured documents within hog on farm in Taiwan[LOC] classify atomic elements in text into predefined categories, such as: 0.1 Run corpora. So, the design of an NER-driven 0.0 4b - init. gazetteer + synonyms + abbrev. with capitaliz. Syntactic Analysis Foot-and-mouth disease [SUBJ] killed[VP]  disease names (e.g. “foot and mouth disease”); 4b 4a 3b 3a 2b 2a 1b 1a system for extracting structured tuples that 15 hog on farm in Taiwan [PP] Precision Recall F-Measure ACKNOWLEDGEMENTS describe animal disease-related events will  viruses (e.g. “picornavirus”) and serotypes (e.g. “Asia-1”); 1.0 This work is supported through a grant from the U.S. Department be performed. Fact: Disease: killed foot-and-mouth disease 4b of Defense. A collaborative program on IE with faculty at the Location: Taiwan  species and its quantities (e.g. “sheep”, “pigs”); 0.9 Recall Range The approach extends the shared NER task Extraction Species: hog 3b University of Illinois at Urbana-Champaign (ChengXiang Zhai, Dan 0.8 Roth, Jiawei Han, and Kevin Chang), the 2009 Data Sciences of identifying persons, organizations, and Quantity: 15  locations where outbreak happened (e.g. “United Kingdom”, “eastern provinces 0.7 3a Summer Institute (DSSI) on Multimodal Information Access and locations with not only disease names but Foot-and-mouth disease killed 15 hog 0.6 of Shandong and Jiangsu, China” – different level of granularity); 0.5 2b Synthesis (MIAS), was made possible through the support of DHS/ONR. constituent entities and attributes of these Co-reference on farm in Taiwan. Outbreak was reported on 9 June. 0.4 2a event tuples. These include dates and times, Resolution  dates in different formats including special cases (e.g. “last Tuesday”, “two 0.3 1b We appreciate effective discussions with Dr. Chris Callison-Burch, quantities with relevant units, and geo- Event: outbreak Dr. Mark Dredze and Dr. Jason Eisner from Center for Language month ago”); 0.2 1a and Speech Processing, Johns Hopkins University; Tim Weninger, referenced locations. A primary overall Species: Disease: 15 hog foot-and-mouth disease 0.1 Research Fellow, UIUC; objective of the IE task is to support timeline Location: Taiwan  organizations that reports outbreak (e.g. “DEFRA”, “CDC”). 0.0 4a Template Generation Date/Time: 9 June 0 50 100 Document number John Drouhard, Landon Fowles (KDD Lab, IE Team) for and map-based visualization of events. assistance with experiments. KANSAS STATE UNIVERSITY KNOWLEDGE DISCOVERY IN DATABASES LABORATORY NATIONAL AGRICULTURAL BIOSECURITY CENTER @ K-STATE