SlideShare a Scribd company logo
1 of 27
WoMBO @ ICBO, Buffalo, July 2011 Use of Multiple Ontologiesto Characterise the Bioactivityof Small Molecules Ying Yan1 Janna Hastings2,3 Jee-Hyub Kim1 Stefan Schulz4 Christoph Steinbeck2 Dietrich Rebholz-Schuhmann1 1 Text Mining, European Bioinformatics Institute, UK 2Chemoinformatics and Metabolism, European Bioinformatics Institute, UK 3 Swiss Centre for Affective Sciences, University of Geneva, Switzerland 4 Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria
Bioactivity is what small molecules doin biological systems Small molecules bind to receptors Biochemical pathway is altered On a macro scale, a phenotypic effect is observed Tuesday, July 26, 2011 2 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
ChEBI is an ontology of small molecules and their properties Tuesday, July 26, 2011 3 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 ChEBI Ontology chemical entity role biological role chemical substance molecular entity application chemical role group carbonyl compound pharmaceutical solvent carboxy group carboxylic acid antibacterial drug cyclooxygenaseinhibitor has part has role cefpodoxime (CHEBI:606443)
ChEBI role assertions are sparse Roles Tuesday, July 26, 2011 4 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Chemical entities (26000) Chemical entities mapped to roles (3000) Mapped roles (600) has role
Bioactivity is reportedin the scientific literature “Resveratrol inhibits cyclooxygenase-2 transcription and activity in phorbol ester-treated human mammary epithelial cells” “Curcumininhibits cyclooxygenase-2 transcription in bile acid-and phorbol ester-treated human gastrointestinal epithelial cells” Tuesday, July 26, 2011 5 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
ChEBI bioactivities are pre-coordinated Tuesday, July 26, 2011 6 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Bioactivity refers to multiple semantic types Enzymes / proteins in general  Biological processes Cellular or anatomical locations  Organism type Tuesday, July 26, 2011 7 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
The language of bioactivity inhibitor   activator   modulator agonist   antagonist   regulator suppressor   adaptor   stimulator toxin   factor   messenger   blocker   Tuesday, July 26, 2011 8 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 chemical target Relation extraction via trigger words as features
Targets and types of interaction beta-adrenergic receptor inhibitor Tuesday, July 26, 2011 9 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 type ofinteraction target
Severalsyntactical structures Noun phrase or adjective/adverb composition: Kinase suppressor, HIV transcriptase inhibitor Prepositional phrase modifier: Suppressor of fused protein Oct-1 CoActivator in S phase protein Verb phrase as noun phrase modifier: Carbonic-anhydrase inhibitors causing adverse effects in therapeutic use Relative clauses as modifier: Factor that binds to inducer of short transcripts protein 1 Tuesday, July 26, 2011 10 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Text mining approach Syntactic parsing Chemical tagging (Oscar, Jochem) Named entity recognition(UniProtKB, Organ, Organisms and GO Biological Process) Target disambiguation (nested types) Pruning ‘noisy’ results using rules source:  MEDLINE abstracts Tuesday, July 26, 2011 11 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Pruning out noise Largest challenges: Difficulty in small molecule term recognition Small molecule – protein disambiguation Remove triples from the candidate list when the putative small molecule term: is a role term according to ChEBI(e.g. antibiotic) has the suffix -ase (normally enzyme names) has less than threecharacters Tuesday, July 26, 2011 12 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Results: distribution (feature/target) Tuesday, July 26, 2011 13 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Organ and Organism: Target vs. Location Organ and organism often provide contextual/ locational information However there are some true positives (as bioactivity targets) Tuesday, July 26, 2011 14 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Caesium ion antagonism to chlorpromazine- and L-dopa- produced behavioural depression in mice. bothropsjararaca inhibitor thyroid stimulator
Noise On the other hand, … Influence of peritoneal dialysis on factors affecting oxygen transport… Without influenceon WDS were: hysotigmine, atropine … The cellulase component was notmarkedly inhibited by … Tuesday, July 26, 2011 15 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 body part? species? bioactive?
Tagging chemicals Tuesday, July 26, 2011 16 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Jochem – dictionary-based approach:  better precision, lower recall Oscar3 – machine learning approach:  better recall, much more noise
The ontology of bioactivity Tuesday, July 26, 2011 17 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 chemical entity bioactivity has_role has_target Organ Target is_a Organism Macromolecule Biological process
Macromolecules m1 is a beta adrenergic receptor: m1 subclassOfbearer of some 	(realized by only 		(Inhibition and 			(has target some BetaAdrenergicReceptor))) Tuesday, July 26, 2011 18 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Biological processes m2 is a mitosis stimulator: m2 subclassOfbearer of some 	(realized by only 	     (Stimulation and 		(has target some 		     (participant of some Mitosis)))) Tuesday, July 26, 2011 19 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Organ as target m3 is a thyroid stimulator: m3 subclassOfbearer of some 	(realized by only 	     (Stimulation and 		(has target some 		     (has locus some ThyroidGland)))) Tuesday, July 26, 2011 20 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Species as definitional constraint m4 is a mouse thyroid stimulator: m4 subclassOfbearer of some 	(realized by only 	     (Stimulation and 		(has target some 		     (has locus some (ThyroidGland			and part of some Mouse))))) Tuesday, July 26, 2011 21 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Contextual vs. Definitional Organisms, organs and body parts appear frequently as contextual, locational modifiers for bioactivities In these cases, the above formalism is too strict We therefore introduce an additional relationship: has contextbetween a bioactivity and an organism, organ, body part Non-definitional:the bioactivity can take place in many organisms, but was discoveredthrough investigations in one organism. Tuesday, July 26, 2011 22 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Relating context to chemical-bioactivity associations Context applies not to bioactivity alone but to small molecule – bioactivity associations (i.e. a ternary relationship) Tuesday, July 26, 2011 23 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Next-generation curation tools Text mining support for human curation knowledge discovery effort Multiple ontology-based reasoning for automated consistency checking and error detection Tuesday, July 26, 2011 24 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Conclusions Language model for extracting small molecule bioactivity information from text Ontology model for accurately representing such information, and allowing automated reasoning across ontologies from chemicals to their targets Tuesday, July 26, 2011 25 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Future work Gold standard for chemical bioactivity in text to be used to evaluate our approach and to train machine learning tools  Extending the relationship extraction approach to include chemical roles, applications and structural relationships Tuesday, July 26, 2011 26 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
Acknowledgements Thanks Colin Batchelor (RSC), Adam Bernard (EBI) Funding BBSRC, grant agreement number BB/G022747/1 within the "Bioinformatics and biological resources" fund  Tuesday, July 26, 2011 27

More Related Content

Similar to Using multiple ontologies to characterise the bioactivity of small molecules

Hyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologistHyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologistJanna Hastings
 
A chemical view into biological systems
A chemical view into biological systemsA chemical view into biological systems
A chemical view into biological systemsJanna Hastings
 
CE508 Lecture 1 2006.ppt
CE508 Lecture 1 2006.pptCE508 Lecture 1 2006.ppt
CE508 Lecture 1 2006.pptMuzan10
 
CE508-Lecture 1 2007.ppt
CE508-Lecture 1 2007.pptCE508-Lecture 1 2007.ppt
CE508-Lecture 1 2007.pptMuzan10
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationBennie George
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationBennie George
 
Genomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryGenomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryPhilip Bourne
 
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting HinxonPhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting HinxonChristoph Steinbeck
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3plmiami
 
Towards integration of systems biology and biomedical ontologies
Towards integration of systems biology and biomedical ontologiesTowards integration of systems biology and biomedical ontologies
Towards integration of systems biology and biomedical ontologiesRobert Hoehndorf
 
Journal of Biochemistry & Physiology:Open Access
Journal of Biochemistry & Physiology:Open AccessJournal of Biochemistry & Physiology:Open Access
Journal of Biochemistry & Physiology:Open AccessOMICS International
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBIDuncan Hull
 
Modelling metabolite concentrations in OWL using Pronto
Modelling metabolite concentrations in OWL using ProntoModelling metabolite concentrations in OWL using Pronto
Modelling metabolite concentrations in OWL using ProntoJanna Hastings
 
Building an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomicsBuilding an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomicsChristoph Steinbeck
 
Enzymes (General Introduction & Action Mechanism)
Enzymes (General Introduction & Action Mechanism) Enzymes (General Introduction & Action Mechanism)
Enzymes (General Introduction & Action Mechanism) Dr. Mohammedazim Bagban
 
A statistical framework for multiparameter analysis at the single cell level
A statistical framework for multiparameter analysis at the single cell levelA statistical framework for multiparameter analysis at the single cell level
A statistical framework for multiparameter analysis at the single cell levelShashaanka Ashili
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesJanna Hastings
 

Similar to Using multiple ontologies to characterise the bioactivity of small molecules (20)

Hyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologistHyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologist
 
A chemical view into biological systems
A chemical view into biological systemsA chemical view into biological systems
A chemical view into biological systems
 
CE508 Lecture 1 2006.ppt
CE508 Lecture 1 2006.pptCE508 Lecture 1 2006.ppt
CE508 Lecture 1 2006.ppt
 
CE508-Lecture 1 2007.ppt
CE508-Lecture 1 2007.pptCE508-Lecture 1 2007.ppt
CE508-Lecture 1 2007.ppt
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and Application
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and Application
 
Genomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryGenomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug Discovery
 
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting HinxonPhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
 
soutenance
soutenancesoutenance
soutenance
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3
 
Towards integration of systems biology and biomedical ontologies
Towards integration of systems biology and biomedical ontologiesTowards integration of systems biology and biomedical ontologies
Towards integration of systems biology and biomedical ontologies
 
Journal of Biochemistry & Physiology:Open Access
Journal of Biochemistry & Physiology:Open AccessJournal of Biochemistry & Physiology:Open Access
Journal of Biochemistry & Physiology:Open Access
 
Bms 2010
Bms 2010Bms 2010
Bms 2010
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBI
 
Modelling metabolite concentrations in OWL using Pronto
Modelling metabolite concentrations in OWL using ProntoModelling metabolite concentrations in OWL using Pronto
Modelling metabolite concentrations in OWL using Pronto
 
Building an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomicsBuilding an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomics
 
Enzymes (General Introduction & Action Mechanism)
Enzymes (General Introduction & Action Mechanism) Enzymes (General Introduction & Action Mechanism)
Enzymes (General Introduction & Action Mechanism)
 
A statistical framework for multiparameter analysis at the single cell level
A statistical framework for multiparameter analysis at the single cell levelA statistical framework for multiparameter analysis at the single cell level
A statistical framework for multiparameter analysis at the single cell level
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challenges
 
Interactomeee
InteractomeeeInteractomeee
Interactomeee
 

More from Janna Hastings

Pipeline for automated structure-based classification in the ChEBI ontology
Pipeline for automated structure-based classification in the ChEBI ontologyPipeline for automated structure-based classification in the ChEBI ontology
Pipeline for automated structure-based classification in the ChEBI ontologyJanna Hastings
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data IntegrationJanna Hastings
 
Using ChEBI to explore the underlying biology in metabolomics studies
Using ChEBI to explore the underlying biology in metabolomics studiesUsing ChEBI to explore the underlying biology in metabolomics studies
Using ChEBI to explore the underlying biology in metabolomics studiesJanna Hastings
 
Chemical classification for the Semantic Web
Chemical classification for the Semantic WebChemical classification for the Semantic Web
Chemical classification for the Semantic WebJanna Hastings
 
Emotion Ontology and Affective Neuroscience
Emotion Ontology and Affective NeuroscienceEmotion Ontology and Affective Neuroscience
Emotion Ontology and Affective NeuroscienceJanna Hastings
 
Ontologies for Mental Health and Disease
Ontologies for Mental Health and DiseaseOntologies for Mental Health and Disease
Ontologies for Mental Health and DiseaseJanna Hastings
 
Waves and fields in bio-ontologies
Waves and fields in bio-ontologiesWaves and fields in bio-ontologies
Waves and fields in bio-ontologiesJanna Hastings
 
Representing addiction in Mental Functioning and Disease ontologies
Representing addiction in Mental Functioning and Disease ontologiesRepresenting addiction in Mental Functioning and Disease ontologies
Representing addiction in Mental Functioning and Disease ontologiesJanna Hastings
 
Mental functioning ontology for interdisciplinary research into mental diseas...
Mental functioning ontology for interdisciplinary research into mental diseas...Mental functioning ontology for interdisciplinary research into mental diseas...
Mental functioning ontology for interdisciplinary research into mental diseas...Janna Hastings
 
From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...Janna Hastings
 
Modularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBIModularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBIJanna Hastings
 
The SHAPES workshop, and Holes in living beings
The SHAPES workshop, and Holes in living beings The SHAPES workshop, and Holes in living beings
The SHAPES workshop, and Holes in living beings Janna Hastings
 
Chemical diagrams and the IAO
Chemical diagrams and the IAOChemical diagrams and the IAO
Chemical diagrams and the IAOJanna Hastings
 
The emotion ontology: enabling interdisciplinary research in the affective sc...
The emotion ontology: enabling interdisciplinary research in the affective sc...The emotion ontology: enabling interdisciplinary research in the affective sc...
The emotion ontology: enabling interdisciplinary research in the affective sc...Janna Hastings
 
Processes and Properties
Processes and PropertiesProcesses and Properties
Processes and PropertiesJanna Hastings
 
Representing sequences of parts in processes using OWL
Representing sequences of parts in processes using OWLRepresenting sequences of parts in processes using OWL
Representing sequences of parts in processes using OWLJanna Hastings
 
Chemical ontologies: what are they, what are they for, and what are the chall...
Chemical ontologies: what are they, what are they for, and what are the chall...Chemical ontologies: what are they, what are they for, and what are the chall...
Chemical ontologies: what are they, what are they for, and what are the chall...Janna Hastings
 
Ontological dependence, dispositions and institutional reality in chemistry
Ontological dependence, dispositions and institutional reality in chemistryOntological dependence, dispositions and institutional reality in chemistry
Ontological dependence, dispositions and institutional reality in chemistryJanna Hastings
 
Chemical Structures and Relations
Chemical Structures and RelationsChemical Structures and Relations
Chemical Structures and RelationsJanna Hastings
 
Automatic classification in ChEBI
Automatic classification in ChEBIAutomatic classification in ChEBI
Automatic classification in ChEBIJanna Hastings
 

More from Janna Hastings (20)

Pipeline for automated structure-based classification in the ChEBI ontology
Pipeline for automated structure-based classification in the ChEBI ontologyPipeline for automated structure-based classification in the ChEBI ontology
Pipeline for automated structure-based classification in the ChEBI ontology
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
Using ChEBI to explore the underlying biology in metabolomics studies
Using ChEBI to explore the underlying biology in metabolomics studiesUsing ChEBI to explore the underlying biology in metabolomics studies
Using ChEBI to explore the underlying biology in metabolomics studies
 
Chemical classification for the Semantic Web
Chemical classification for the Semantic WebChemical classification for the Semantic Web
Chemical classification for the Semantic Web
 
Emotion Ontology and Affective Neuroscience
Emotion Ontology and Affective NeuroscienceEmotion Ontology and Affective Neuroscience
Emotion Ontology and Affective Neuroscience
 
Ontologies for Mental Health and Disease
Ontologies for Mental Health and DiseaseOntologies for Mental Health and Disease
Ontologies for Mental Health and Disease
 
Waves and fields in bio-ontologies
Waves and fields in bio-ontologiesWaves and fields in bio-ontologies
Waves and fields in bio-ontologies
 
Representing addiction in Mental Functioning and Disease ontologies
Representing addiction in Mental Functioning and Disease ontologiesRepresenting addiction in Mental Functioning and Disease ontologies
Representing addiction in Mental Functioning and Disease ontologies
 
Mental functioning ontology for interdisciplinary research into mental diseas...
Mental functioning ontology for interdisciplinary research into mental diseas...Mental functioning ontology for interdisciplinary research into mental diseas...
Mental functioning ontology for interdisciplinary research into mental diseas...
 
From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...
 
Modularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBIModularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBI
 
The SHAPES workshop, and Holes in living beings
The SHAPES workshop, and Holes in living beings The SHAPES workshop, and Holes in living beings
The SHAPES workshop, and Holes in living beings
 
Chemical diagrams and the IAO
Chemical diagrams and the IAOChemical diagrams and the IAO
Chemical diagrams and the IAO
 
The emotion ontology: enabling interdisciplinary research in the affective sc...
The emotion ontology: enabling interdisciplinary research in the affective sc...The emotion ontology: enabling interdisciplinary research in the affective sc...
The emotion ontology: enabling interdisciplinary research in the affective sc...
 
Processes and Properties
Processes and PropertiesProcesses and Properties
Processes and Properties
 
Representing sequences of parts in processes using OWL
Representing sequences of parts in processes using OWLRepresenting sequences of parts in processes using OWL
Representing sequences of parts in processes using OWL
 
Chemical ontologies: what are they, what are they for, and what are the chall...
Chemical ontologies: what are they, what are they for, and what are the chall...Chemical ontologies: what are they, what are they for, and what are the chall...
Chemical ontologies: what are they, what are they for, and what are the chall...
 
Ontological dependence, dispositions and institutional reality in chemistry
Ontological dependence, dispositions and institutional reality in chemistryOntological dependence, dispositions and institutional reality in chemistry
Ontological dependence, dispositions and institutional reality in chemistry
 
Chemical Structures and Relations
Chemical Structures and RelationsChemical Structures and Relations
Chemical Structures and Relations
 
Automatic classification in ChEBI
Automatic classification in ChEBIAutomatic classification in ChEBI
Automatic classification in ChEBI
 

Recently uploaded

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleCeline George
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 

Recently uploaded (20)

Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP Module
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 

Using multiple ontologies to characterise the bioactivity of small molecules

  • 1. WoMBO @ ICBO, Buffalo, July 2011 Use of Multiple Ontologiesto Characterise the Bioactivityof Small Molecules Ying Yan1 Janna Hastings2,3 Jee-Hyub Kim1 Stefan Schulz4 Christoph Steinbeck2 Dietrich Rebholz-Schuhmann1 1 Text Mining, European Bioinformatics Institute, UK 2Chemoinformatics and Metabolism, European Bioinformatics Institute, UK 3 Swiss Centre for Affective Sciences, University of Geneva, Switzerland 4 Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria
  • 2. Bioactivity is what small molecules doin biological systems Small molecules bind to receptors Biochemical pathway is altered On a macro scale, a phenotypic effect is observed Tuesday, July 26, 2011 2 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 3. ChEBI is an ontology of small molecules and their properties Tuesday, July 26, 2011 3 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 ChEBI Ontology chemical entity role biological role chemical substance molecular entity application chemical role group carbonyl compound pharmaceutical solvent carboxy group carboxylic acid antibacterial drug cyclooxygenaseinhibitor has part has role cefpodoxime (CHEBI:606443)
  • 4. ChEBI role assertions are sparse Roles Tuesday, July 26, 2011 4 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Chemical entities (26000) Chemical entities mapped to roles (3000) Mapped roles (600) has role
  • 5. Bioactivity is reportedin the scientific literature “Resveratrol inhibits cyclooxygenase-2 transcription and activity in phorbol ester-treated human mammary epithelial cells” “Curcumininhibits cyclooxygenase-2 transcription in bile acid-and phorbol ester-treated human gastrointestinal epithelial cells” Tuesday, July 26, 2011 5 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 6. ChEBI bioactivities are pre-coordinated Tuesday, July 26, 2011 6 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 7. Bioactivity refers to multiple semantic types Enzymes / proteins in general Biological processes Cellular or anatomical locations Organism type Tuesday, July 26, 2011 7 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 8. The language of bioactivity inhibitor activator modulator agonist antagonist regulator suppressor adaptor stimulator toxin factor messenger blocker Tuesday, July 26, 2011 8 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 chemical target Relation extraction via trigger words as features
  • 9. Targets and types of interaction beta-adrenergic receptor inhibitor Tuesday, July 26, 2011 9 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 type ofinteraction target
  • 10. Severalsyntactical structures Noun phrase or adjective/adverb composition: Kinase suppressor, HIV transcriptase inhibitor Prepositional phrase modifier: Suppressor of fused protein Oct-1 CoActivator in S phase protein Verb phrase as noun phrase modifier: Carbonic-anhydrase inhibitors causing adverse effects in therapeutic use Relative clauses as modifier: Factor that binds to inducer of short transcripts protein 1 Tuesday, July 26, 2011 10 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 11. Text mining approach Syntactic parsing Chemical tagging (Oscar, Jochem) Named entity recognition(UniProtKB, Organ, Organisms and GO Biological Process) Target disambiguation (nested types) Pruning ‘noisy’ results using rules source: MEDLINE abstracts Tuesday, July 26, 2011 11 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 12. Pruning out noise Largest challenges: Difficulty in small molecule term recognition Small molecule – protein disambiguation Remove triples from the candidate list when the putative small molecule term: is a role term according to ChEBI(e.g. antibiotic) has the suffix -ase (normally enzyme names) has less than threecharacters Tuesday, July 26, 2011 12 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 13. Results: distribution (feature/target) Tuesday, July 26, 2011 13 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 14. Organ and Organism: Target vs. Location Organ and organism often provide contextual/ locational information However there are some true positives (as bioactivity targets) Tuesday, July 26, 2011 14 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Caesium ion antagonism to chlorpromazine- and L-dopa- produced behavioural depression in mice. bothropsjararaca inhibitor thyroid stimulator
  • 15. Noise On the other hand, … Influence of peritoneal dialysis on factors affecting oxygen transport… Without influenceon WDS were: hysotigmine, atropine … The cellulase component was notmarkedly inhibited by … Tuesday, July 26, 2011 15 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 body part? species? bioactive?
  • 16. Tagging chemicals Tuesday, July 26, 2011 16 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 Jochem – dictionary-based approach: better precision, lower recall Oscar3 – machine learning approach: better recall, much more noise
  • 17. The ontology of bioactivity Tuesday, July 26, 2011 17 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011 chemical entity bioactivity has_role has_target Organ Target is_a Organism Macromolecule Biological process
  • 18. Macromolecules m1 is a beta adrenergic receptor: m1 subclassOfbearer of some (realized by only (Inhibition and (has target some BetaAdrenergicReceptor))) Tuesday, July 26, 2011 18 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 19. Biological processes m2 is a mitosis stimulator: m2 subclassOfbearer of some (realized by only (Stimulation and (has target some (participant of some Mitosis)))) Tuesday, July 26, 2011 19 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 20. Organ as target m3 is a thyroid stimulator: m3 subclassOfbearer of some (realized by only (Stimulation and (has target some (has locus some ThyroidGland)))) Tuesday, July 26, 2011 20 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 21. Species as definitional constraint m4 is a mouse thyroid stimulator: m4 subclassOfbearer of some (realized by only (Stimulation and (has target some (has locus some (ThyroidGland and part of some Mouse))))) Tuesday, July 26, 2011 21 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 22. Contextual vs. Definitional Organisms, organs and body parts appear frequently as contextual, locational modifiers for bioactivities In these cases, the above formalism is too strict We therefore introduce an additional relationship: has contextbetween a bioactivity and an organism, organ, body part Non-definitional:the bioactivity can take place in many organisms, but was discoveredthrough investigations in one organism. Tuesday, July 26, 2011 22 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 23. Relating context to chemical-bioactivity associations Context applies not to bioactivity alone but to small molecule – bioactivity associations (i.e. a ternary relationship) Tuesday, July 26, 2011 23 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 24. Next-generation curation tools Text mining support for human curation knowledge discovery effort Multiple ontology-based reasoning for automated consistency checking and error detection Tuesday, July 26, 2011 24 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 25. Conclusions Language model for extracting small molecule bioactivity information from text Ontology model for accurately representing such information, and allowing automated reasoning across ontologies from chemicals to their targets Tuesday, July 26, 2011 25 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 26. Future work Gold standard for chemical bioactivity in text to be used to evaluate our approach and to train machine learning tools Extending the relationship extraction approach to include chemical roles, applications and structural relationships Tuesday, July 26, 2011 26 Multiple Ontologies for Small Molecule Bioactivity – WoMBO 2011
  • 27. Acknowledgements Thanks Colin Batchelor (RSC), Adam Bernard (EBI) Funding BBSRC, grant agreement number BB/G022747/1 within the "Bioinformatics and biological resources" fund Tuesday, July 26, 2011 27

Editor's Notes

  1. 30 minutes  ~25 slides @1 minute per slide.
  2. Bioactivity comprises the total effect which a small molecule has in a biological system. They are the active (realizable) properties. Their operation is at the molecular level of granularity and yet their effect is observed at the macro level of granularity. The observable effect is a phenotypic effect. Bioactive molecules can have positive eects, such as repressing the developmentof disease, or they can have negative (toxic) eects, leading to illness or evendeath. The dierentiation of bioactive molecules from non-bioactive molecules isone of the core requirements for in silico drug discovery approaches [11], as aredelineating molecules which share similar activity proles [9]
  3. Put the usual ChEBI picture and talk around it. ChEBI is manually curated. Chemicals are given a structure-based classification and assigned with the has_role relationship to the role ontology. Bioactivity as we have defined it loosely corresponds to the biological role branch of the ChEBI role ontology. The additional roles which do not correspond to our bioactivity definition are being ignored for the purposes of this paper.
  4. Just less than 3000 chemical entities are mapped to just less than 500 roles – many chemical entities are thus not adequately described in terms of their biological context.Also, ChEBI roles are not explicitly linked (through OWL intersections or OBO cross-products) to
  5. Importantly, this is an example of relationship extraction from the scientific literature. We are looking for a special kind of association between a chemical and a biological entity. It is not an example of named entity recognition alone.
  6. We wanted to classify bioactivity terms by which semantic type they belonged to. This led to challenges in that there were many examples of nested types. For example, to formalise a description ofenzymatic inhibitor activity requires reference to the enzyme which is being inhibited;to formalise participation in a in a particular biological process requiresreference to the process; and bioactivity descriptions may require reference tothe exact location of the activity and the organism within which, or againstwhich, the activity took place.
  7. We first dened a language model for bioactivity terminology based on the examinationof relevant portions of the Metathesaurus of the Unied Medical LanguageSystem (UMLS) [1] and the ChEBI biological roles. given a set of language features: \\inhibitor" and \\activator", \\modulator",\\agonist" and \\antagonist", \\toxin", \\regulator", \\suppressor", \\adaptor",\\stimulator", \\factor", \\messenger" and \\blocker"; these will be called triggerwords.
  8. Ideally, the phrase composing (<modier>) is constituted by one or moretokens which denote the target of the bioactivity, whereas the head word speciesthe nature of the interaction between the small molecule and the target. Forexample, `beta-adrenergic receptor inhibitor' has as modier `beta-adrenergicreceptor' (the target) and as head word `inhibitor' (the nature of the interactionis inhibition).
  9. In Step 4, when we encountered nested types: We retain the tag which is in the last positionwithin the modifier, ignoring other tags.
  10. The largest challenges faced from a practical side on the named entity recognition
  11. Table 1: ordering by target type and featureMost common: proteins
  12. Manual examination of the results revealed that organ and organism most commonly appear as locational or contextual modifiers rather than directly as targets. Disambiguating these two scenarios is not obvious.
  13. In particular we found it very difficult to get Oscar to distinguish chemical names from protein names. Oscar3 yields many more triples than Jochem does. This is expected, sinceOscar3 recognises any chemical-like string. However, Oscar3's approach alsoresults in a considerable number of false positives due to its recognition ofchemical-like nomenclature appearing as a component in larger strings (suchas protein names). Furthermore, we can observe a smaller number of triplesidentied by UniProtKB and Oscar3 compared to the set identied by UniProtKBand Jochem. This is because Oscar3 produces annotations that nest withina protein mention in the sentence and thus lowers the subsequent annotationprotein mentions. Jochem performs more long-form matching than Oscar3 does,therefore the following protein identication has a higher likelihood of identifyinga protein term within the sentence, hence yielding a greater number of triples.
  14. Formal ontology ofbioactivity: explicit link from bioactivity to the target of the bioactivity. We already have in ChEBI different types of bioactivity. Based on our analysis of bioactivity phrases in the literature, we have identied macromolecules and biological processes as the most common types oftargets for the bioactivity of small molecules. We could therefore introduce ahas target relationship to relate a bioactivity description to either a macromoleculeor a biological process. However, strictly speaking, the range of thehas target relationship should be restricted to those entities with which thechemical entity can physically interact { macromolecules. We can assume thatbiological processes are mentioned where the exact macromolecular target isunknown. In the same way, anatomical or subcellular locations may be mentionedwhen the exact target is unknown.
  15. Still something missing in this, which is the implicit claim that the mitosis process itself is “stimulated”, i.e. probably either enabled or made faster, by the presence of the molecule in question
  16. Importantly, we are not proposing to pre-populateChEBI from text-mining results. There is far too much noise in the data for that to work out. Rather, we are proposing the development of enhanced curation tools which support the work of the human curators.