SlideShare a Scribd company logo
1 of 15
Download to read offline
Issues in Learning an
Ontology from Text
Christopher Brewster, Simon Jupp, Joanne Luciano, David
Shotton, Robert Stevens, and Ziqi Zhang
The Use Case: Animal Behaviour
• Animal behaviour community
recognises the need for an
ontology, e.g. for video
annotation/retrieval
• The community created an
“Animal Behaviour Ontology” -
339 terms
• Can we (semi-) automatically
build from text?
Some Questions
• Do we get a “good ontology”?
• If not, is it useful?
• Is it low-effort?
• Should the result be “tidied up” or used as a
donor?
Methodology: Dataset
• Journal “Animal Behaviour” from Elsevier
• 623 articles from Vol 71 (2006) - Vol 74 (2007)
• 2.2 million words
• Various formats - most usefully xml
We Want an Ontology of Green
• An ontology of “animal behaviours”
• Not an ontology of the corpus
We want the
green terms in
the ontology
Processing Steps (1)
1. Text extracted from XML - excluding affiliations,
acknowledgements, bibliography except for title
etc.
2. Noise removed - person names, animal names,
place names
3. Lemmatiser used to reduce data sparsity
4. Term extraction applied
Processing Steps (2)
5. Term selection
Regular expression used to select
terms ending in behaviour, display,
construction, inspection plus generic
-ing, -ism, etc.
Build hierarchies using String Inclusion
5. Top level terms filtered using “Hearst
Patterns” to test if X ISA
behaviour/activity/etc.
Walking
Running
Jumping
Hunting
Pecking
Reed Bunting
Corn Bunting
Herring
Courtship
Studentship
Cannibalism
Dimorphism
Applying String Inclusion /Rules to
Terms
C
BCAC
ABC
Selection
Mate Selection
Natural Selection
Female Mate
Selection
Lexico-Syntactic Patterns
• X such as P, Q, R; X is a Y
• Grooming is a behaviour
• Copulation is an activity
• Dimorphism is a behaviour
• Calls such as trills, whistles, grunts
Results
• 64,000 terms extracted
• The regexp selected 10,335 terms
• Step 6a resulted in an ontology with 17,776
classes and 1295 top level classes
• Step 6b resulted in an ontology with 13,058
classes and 912 top level classes
Results (2) - Copulation Sub-tree
Results(3)
• Evaluation of terms excluded by regexp:
• 56,000 terms excluded
• Random sample of 3140 terms evaluated by hand
• 7 verbs and 42 nouns should not have been excluded
• E.g., “interaction”
• A recall of .905
Discussion: The problem of focus
Other Issues
• More a vocabulary than an ontology
• SKOS-like rather than OWL-like
• Can deal with “selection”, “mate selection” and
“natural selection
• Highly compositional terms “Adult male
grooming behaviour”
• Cleanish list of top level terms: Canabalism,
copulation, eating, foraging, fighting, grooming
Discussion: Is it useful?
• Answers: No, yes, yes, donor
• Useful ontological fragments
• Bringing ontology to ontology learning is the research
challenge
• Limitations: noise; the problem of focus; only
taxonomic relations
• Advantages: speed; ease; a step towards formal
ontologies

More Related Content

Viewers also liked

Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationMathieu d'Aquin
 
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...Semantic Monitoring of Personal Web Activity to Support the Management of Tru...
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...Mathieu d'Aquin
 
Linux常用命令与工具简介
Linux常用命令与工具简介Linux常用命令与工具简介
Linux常用命令与工具简介weihe
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesrobertstevens65
 
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...Mathieu d'Aquin
 

Viewers also liked (6)

Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in education
 
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...Semantic Monitoring of Personal Web Activity to Support the Management of Tru...
Semantic Monitoring of Personal Web Activity to Support the Management of Tru...
 
Linux常用命令与工具简介
Linux常用命令与工具简介Linux常用命令与工具简介
Linux常用命令与工具简介
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
 
Using The Semantic Web
Using The Semantic WebUsing The Semantic Web
Using The Semantic Web
 
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...
Visualizing Consensus with Online Ontologies to Support Quality in Ontology D...
 

Similar to Issues in Learning an Ontology from Text

Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for studentsLynette Chan
 
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionAldo Gangemi
 
Tutorial OWL and drug discovery ICBO 2013
Tutorial OWL and drug discovery ICBO 2013Tutorial OWL and drug discovery ICBO 2013
Tutorial OWL and drug discovery ICBO 2013Samuel Croset
 
Logic programming (1)
Logic programming (1)Logic programming (1)
Logic programming (1)Nitesh Singh
 
Assignment Objectives Describe abiotic and bioti.docx
Assignment Objectives   Describe abiotic and bioti.docxAssignment Objectives   Describe abiotic and bioti.docx
Assignment Objectives Describe abiotic and bioti.docxlascellesjaimie
 
Can there be such a thing as Ontology Engineering?
Can there be such a thing as Ontology Engineering?Can there be such a thing as Ontology Engineering?
Can there be such a thing as Ontology Engineering?robertstevens65
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biologyrobertstevens65
 
Automated Abstracts and Big Data
Automated Abstracts and Big DataAutomated Abstracts and Big Data
Automated Abstracts and Big DataSameer Wadkar
 
BellWork, Inside Outside Circle, Formative Assessment
BellWork, Inside Outside Circle, Formative AssessmentBellWork, Inside Outside Circle, Formative Assessment
BellWork, Inside Outside Circle, Formative AssessmentAndrea B.
 
Classifying Non-Referential It for Question Answer Pairs
Classifying Non-Referential It for Question Answer PairsClassifying Non-Referential It for Question Answer Pairs
Classifying Non-Referential It for Question Answer PairsJinho Choi
 
Biol208 lecture2 introduction_toecology
Biol208 lecture2 introduction_toecologyBiol208 lecture2 introduction_toecology
Biol208 lecture2 introduction_toecologypolat abdilla
 
Biol208_Lecture2_IntroductionToEcology (1).pdf
Biol208_Lecture2_IntroductionToEcology (1).pdfBiol208_Lecture2_IntroductionToEcology (1).pdf
Biol208_Lecture2_IntroductionToEcology (1).pdfMariaRowenaFlores
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersSamuel Croset
 
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]Shakeeb Ahmad Mohammad Mukhtar
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giantsBenjamin Good
 
Thinking, Language, and Intelligence
Thinking, Language, and IntelligenceThinking, Language, and Intelligence
Thinking, Language, and IntelligenceTan Gent
 

Similar to Issues in Learning an Ontology from Text (20)

Teleology
TeleologyTeleology
Teleology
 
Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for students
 
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
 
Tutorial OWL and drug discovery ICBO 2013
Tutorial OWL and drug discovery ICBO 2013Tutorial OWL and drug discovery ICBO 2013
Tutorial OWL and drug discovery ICBO 2013
 
Logic programming (1)
Logic programming (1)Logic programming (1)
Logic programming (1)
 
BT02.pptx
BT02.pptxBT02.pptx
BT02.pptx
 
Assignment Objectives Describe abiotic and bioti.docx
Assignment Objectives   Describe abiotic and bioti.docxAssignment Objectives   Describe abiotic and bioti.docx
Assignment Objectives Describe abiotic and bioti.docx
 
Can there be such a thing as Ontology Engineering?
Can there be such a thing as Ontology Engineering?Can there be such a thing as Ontology Engineering?
Can there be such a thing as Ontology Engineering?
 
Meghyn slides-hse-2014
Meghyn slides-hse-2014Meghyn slides-hse-2014
Meghyn slides-hse-2014
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Automated Abstracts and Big Data
Automated Abstracts and Big DataAutomated Abstracts and Big Data
Automated Abstracts and Big Data
 
BellWork, Inside Outside Circle, Formative Assessment
BellWork, Inside Outside Circle, Formative AssessmentBellWork, Inside Outside Circle, Formative Assessment
BellWork, Inside Outside Circle, Formative Assessment
 
Classifying Non-Referential It for Question Answer Pairs
Classifying Non-Referential It for Question Answer PairsClassifying Non-Referential It for Question Answer Pairs
Classifying Non-Referential It for Question Answer Pairs
 
Biol208 lecture2 introduction_toecology
Biol208 lecture2 introduction_toecologyBiol208 lecture2 introduction_toecology
Biol208 lecture2 introduction_toecology
 
Biol208_Lecture2_IntroductionToEcology (1).pdf
Biol208_Lecture2_IntroductionToEcology (1).pdfBiol208_Lecture2_IntroductionToEcology (1).pdf
Biol208_Lecture2_IntroductionToEcology (1).pdf
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasoners
 
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]
Genetic Algorithm: A Crisp And Concise Introduction [Shakeeb A.]
 
Metaphor detection
Metaphor detectionMetaphor detection
Metaphor detection
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
Thinking, Language, and Intelligence
Thinking, Language, and IntelligenceThinking, Language, and Intelligence
Thinking, Language, and Intelligence
 

More from robertstevens65

The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016robertstevens65
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in robertstevens65
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysisrobertstevens65
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologiesrobertstevens65
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology developmentrobertstevens65
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biologyrobertstevens65
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefactsrobertstevens65
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templatesrobertstevens65
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agilerobertstevens65
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)robertstevens65
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Roserobertstevens65
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...robertstevens65
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Disciplinerobertstevens65
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2robertstevens65
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 robertstevens65
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Realityrobertstevens65
 
Making Semantics do Some Work
Making Semantics do Some WorkMaking Semantics do Some Work
Making Semantics do Some Workrobertstevens65
 

More from robertstevens65 (20)

The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Reality
 
Making Semantics do Some Work
Making Semantics do Some WorkMaking Semantics do Some Work
Making Semantics do Some Work
 

Recently uploaded

cybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationcybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationSanghamitraMohapatra5
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsDanielBaumann11
 
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaEGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaDr.Mahmoud Abbas
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfSubhamKumar3239
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learningvschiavoni
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxzeus70441
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clonechaudhary charan shingh university
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxpriyankatabhane
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...Chayanika Das
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPirithiRaju
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyChayanika Das
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasChayanika Das
 

Recently uploaded (20)

cybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationcybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitation
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
 
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaEGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Ultrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptxUltrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptx
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdf
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptx
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clone
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
 
Introduction Classification Of Alkaloids
Introduction Classification Of AlkaloidsIntroduction Classification Of Alkaloids
Introduction Classification Of Alkaloids
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPR
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
 

Issues in Learning an Ontology from Text

  • 1. Issues in Learning an Ontology from Text Christopher Brewster, Simon Jupp, Joanne Luciano, David Shotton, Robert Stevens, and Ziqi Zhang
  • 2. The Use Case: Animal Behaviour • Animal behaviour community recognises the need for an ontology, e.g. for video annotation/retrieval • The community created an “Animal Behaviour Ontology” - 339 terms • Can we (semi-) automatically build from text?
  • 3. Some Questions • Do we get a “good ontology”? • If not, is it useful? • Is it low-effort? • Should the result be “tidied up” or used as a donor?
  • 4. Methodology: Dataset • Journal “Animal Behaviour” from Elsevier • 623 articles from Vol 71 (2006) - Vol 74 (2007) • 2.2 million words • Various formats - most usefully xml
  • 5. We Want an Ontology of Green • An ontology of “animal behaviours” • Not an ontology of the corpus We want the green terms in the ontology
  • 6. Processing Steps (1) 1. Text extracted from XML - excluding affiliations, acknowledgements, bibliography except for title etc. 2. Noise removed - person names, animal names, place names 3. Lemmatiser used to reduce data sparsity 4. Term extraction applied
  • 7. Processing Steps (2) 5. Term selection Regular expression used to select terms ending in behaviour, display, construction, inspection plus generic -ing, -ism, etc. Build hierarchies using String Inclusion 5. Top level terms filtered using “Hearst Patterns” to test if X ISA behaviour/activity/etc. Walking Running Jumping Hunting Pecking Reed Bunting Corn Bunting Herring Courtship Studentship Cannibalism Dimorphism
  • 8. Applying String Inclusion /Rules to Terms C BCAC ABC Selection Mate Selection Natural Selection Female Mate Selection
  • 9. Lexico-Syntactic Patterns • X such as P, Q, R; X is a Y • Grooming is a behaviour • Copulation is an activity • Dimorphism is a behaviour • Calls such as trills, whistles, grunts
  • 10. Results • 64,000 terms extracted • The regexp selected 10,335 terms • Step 6a resulted in an ontology with 17,776 classes and 1295 top level classes • Step 6b resulted in an ontology with 13,058 classes and 912 top level classes
  • 11. Results (2) - Copulation Sub-tree
  • 12. Results(3) • Evaluation of terms excluded by regexp: • 56,000 terms excluded • Random sample of 3140 terms evaluated by hand • 7 verbs and 42 nouns should not have been excluded • E.g., “interaction” • A recall of .905
  • 14. Other Issues • More a vocabulary than an ontology • SKOS-like rather than OWL-like • Can deal with “selection”, “mate selection” and “natural selection • Highly compositional terms “Adult male grooming behaviour” • Cleanish list of top level terms: Canabalism, copulation, eating, foraging, fighting, grooming
  • 15. Discussion: Is it useful? • Answers: No, yes, yes, donor • Useful ontological fragments • Bringing ontology to ontology learning is the research challenge • Limitations: noise; the problem of focus; only taxonomic relations • Advantages: speed; ease; a step towards formal ontologies

Editor's Notes

  1. copulation --> grandfather copulation, cannibalism copulation, harassment copulation, inferred copulation, long copulation, palp copulation, elements copulation, behavioural elements copulation, face to copulation