SlideShare uma empresa Scribd logo
1 de 21
Representing chemicalsusing OWL, Description Graphsand Rules Janna Hastings, EBI, UKMichel Dumontier, Carleton University, CanadaDuncan Hull, EBI, UKMatthew Horridge, Manchester, UKChristoph Steinbeck, EBI, UKUlrike Sattler, Manchester, UKRobert Stevens, Manchester, UKTertia Hӧrne, University of South AfricaKatarina Britz, Meraka Institute, South Africa OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Problem We wish to represent and reason over structured objects 		i.e. their representation contains 	also their parts OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Chemical structures single bond Carbon atom Hydrogen atom double bond Nitrogen atom Oxygen atom caffeineMoleculesconsist of atomsconnected by bonds OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Chemical ontology Chemical ontology consists of chemical classes which can be defined by parts of structuresand/or properties of structures carboxylic acid if molecule has part some carboxy group cyclic molecule if molecule has property cyclic, i.e. a self-connectedcyclic path exists through the molecule’s atoms OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
OWL representation Without structure, all parts must be explicitly asserted(combinatorial explosion for larger molecules) But the structure of complex molecules breaks the OWL Tree Model requirement does not have a model in the shape of a tree OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Description Graphs A recent, decidable extension to OWL 2, allowing expression of complex structures as graphs within the ontology A description graph consists of a set of labelled vertices and a set of directed edges Each description graph has a main class which links the graph to the main OWL ontology OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Strong separation In order to preserve decidability of knowledge bases enriched with description graphs, 	atomic properties used as graph edges have to be different to those used in axioms in the main OWL ontology This is known as the strong separation requirement OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Rules Enhance OWL with the capacity to express if – then constructions Consist of ‘antecedent’ (if conditions) and ‘consequent’ (then result) Antecedent and consequent are composed of conjunctions of atomic statements OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Goal Can we represent chemical structures using OWL and Description Graphs? Can we reason over the information encoded in chemical structures using OWL, Description Graphs and Rules? OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
OWL ontology OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Chemical description graphs Generated based on structures converted from a chemical database (ChEBI) OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Rules Generated for properties, e.g. being cyclic molecule(?x), atom(?a1), atom (?a2), atom(?a3), atom(?a4), bond(?b1), bond (?b2), bond(?b3), bond (?b4), has_atom(?x, ?a1), has_atom(?x, ?a2), has_atom(?x, ?a3), has_atom(?x, ?a4),has_bond(?a1, ?b1), has_bond(?a1, ?b4), has_bond(?a2, ?b1), has_bond(?a2, ?b2),has_bond(?a3, ?b2), has_bond(?a3, ?b3), has_bond(?a4, ?b3), has_bond(?a4, ?b4)-> cyclic_entity(?x) cyclobutane tetrahedrane OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Rules Generated for parthood, e.g. carboxylic acid molecule(?y), atom(?a0), oxygen_atom(?a1), carbon_atom(?a2), oxygen_atom (?a3), has_atom(?y, ?a0), has_atom (?y, ?a1), has_atom (?y, ?a2), has_atom (?y, ?a3), double_bond(?b0), single_bond (?b1), single_bond (?b2), has_bond(?a0, ?b2), has_bond(?a1, ?b1), has_bond(?a2, ?b0), has_bond(?a2, ?b1), has_bond(?a2, ?b2), has_bond(?a3, ?b0)                                      ->  carboxylic_acid(?y) carboxylic acid benzoicacid benzoic acid	has this part		so: is a carboxylic acid OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Testing the reasoning Can we use a reasoner to deduce the classification hierarchy based on the graphs and rules?  No asserted hierarchy between test classes and molecules with generated graphs OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Results Inferred hierarchy shows classified molecules OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Testing the performance How many molecules (description graphs) can we include in our knowledge base? How does the reasoning task (classification) scale with respect to the number of graphs, both with and without rules in the knowledge base? OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Results OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Experiences Difficult to debug Tools support needs to be improved Difficult to construct rules for properties which depend on all atoms or all bonds in a given molecule 	e.g. saturated -> all bonds in molecule are single OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Conclusion Using OWL, Description Graphs and Rules we can represent chemical structures at the class level in our knowledge base and reason over the structural information Scalability of the reasoning with the rules is a concern OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
Acknowledgements Special thanks to KirillDegtyarenko, Stefan Schulz, Colin Batchelor, BirteGlimm and the ChEBI team Funding: Meraka Institute, South Africa; BBSRC (BB/G022747/1); NSERC Discovery Grant OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules

Mais conteúdo relacionado

Destaque

Going the extra mile in designing for the web
Going the extra mile in designing for the webGoing the extra mile in designing for the web
Going the extra mile in designing for the web
Home
 
Design Thinking in EFL Context
Design Thinking in EFL ContextDesign Thinking in EFL Context
Design Thinking in EFL Context
Debopriyo Roy
 
Wicked Wiki
Wicked WikiWicked Wiki
Wicked Wiki
jactlc
 
Resources
ResourcesResources
Resources
jactlc
 

Destaque (15)

AtticTV Pte. Ltd. Strategy Slide
AtticTV Pte. Ltd. Strategy SlideAtticTV Pte. Ltd. Strategy Slide
AtticTV Pte. Ltd. Strategy Slide
 
Going the extra mile in designing for the web
Going the extra mile in designing for the webGoing the extra mile in designing for the web
Going the extra mile in designing for the web
 
GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.
 
Los gansos
Los gansosLos gansos
Los gansos
 
Real World Applications of OWL
Real World Applications of OWLReal World Applications of OWL
Real World Applications of OWL
 
Design Thinking in EFL Context
Design Thinking in EFL ContextDesign Thinking in EFL Context
Design Thinking in EFL Context
 
Wicked Wiki
Wicked WikiWicked Wiki
Wicked Wiki
 
Russian & English: Nastas Ltrs of Confirmation-Contributions
Russian & English: Nastas Ltrs of Confirmation-ContributionsRussian & English: Nastas Ltrs of Confirmation-Contributions
Russian & English: Nastas Ltrs of Confirmation-Contributions
 
Convegno Polizie Locali Sicurezza Stradale Riccione 09 09
Convegno Polizie Locali Sicurezza Stradale Riccione 09 09Convegno Polizie Locali Sicurezza Stradale Riccione 09 09
Convegno Polizie Locali Sicurezza Stradale Riccione 09 09
 
Cassass presentation WLF2 Rome
Cassass presentation WLF2 RomeCassass presentation WLF2 Rome
Cassass presentation WLF2 Rome
 
Resources
ResourcesResources
Resources
 
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy ResourcesLine Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
 
Connexions Roy 2013
Connexions Roy 2013Connexions Roy 2013
Connexions Roy 2013
 
SP3 찬송가 옵션 작업후기 (Logic Audio)
SP3 찬송가 옵션 작업후기 (Logic Audio)SP3 찬송가 옵션 작업후기 (Logic Audio)
SP3 찬송가 옵션 작업후기 (Logic Audio)
 
Proiect 1000 - Stefan Szakal
Proiect 1000 - Stefan SzakalProiect 1000 - Stefan Szakal
Proiect 1000 - Stefan Szakal
 

Semelhante a Representing chemicals using OWL, Description Graphs and Rules

Perspective of pharmaceutical molecular design
Perspective of pharmaceutical molecular designPerspective of pharmaceutical molecular design
Perspective of pharmaceutical molecular design
Peter Kenny
 
Harvey-2015-Control of ozonolysi
Harvey-2015-Control of ozonolysiHarvey-2015-Control of ozonolysi
Harvey-2015-Control of ozonolysi
Rebecca Harvey
 
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
Michel Dumontier
 
Formal representation of models in systems biology
Formal representation of models in systems biologyFormal representation of models in systems biology
Formal representation of models in systems biology
Michel Dumontier
 

Semelhante a Representing chemicals using OWL, Description Graphs and Rules (20)

Representing the ChEBI ontology in OWL
Representing the ChEBI ontology in OWLRepresenting the ChEBI ontology in OWL
Representing the ChEBI ontology in OWL
 
Automatic classification in ChEBI
Automatic classification in ChEBIAutomatic classification in ChEBI
Automatic classification in ChEBI
 
Relational Patterns in OWL and their application to OBO
Relational Patterns in OWL and their application to OBORelational Patterns in OWL and their application to OBO
Relational Patterns in OWL and their application to OBO
 
20130724 cisrg sugars_batchelor
20130724 cisrg sugars_batchelor20130724 cisrg sugars_batchelor
20130724 cisrg sugars_batchelor
 
Perspective of pharmaceutical molecular design
Perspective of pharmaceutical molecular designPerspective of pharmaceutical molecular design
Perspective of pharmaceutical molecular design
 
Harvey-2015-Control of ozonolysi
Harvey-2015-Control of ozonolysiHarvey-2015-Control of ozonolysi
Harvey-2015-Control of ozonolysi
 
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network"
 
ISES 2013 - Day 2 - Professor John M. Dhaw (Professor, University of Albert...
ISES 2013   - Day 2 - Professor John M. Dhaw (Professor, University of Albert...ISES 2013   - Day 2 - Professor John M. Dhaw (Professor, University of Albert...
ISES 2013 - Day 2 - Professor John M. Dhaw (Professor, University of Albert...
 
ISES 2013 - Professor John M. Shaw (Professor, University of Alberta) - Ener...
ISES 2013  - Professor John M. Shaw (Professor, University of Alberta) - Ener...ISES 2013  - Professor John M. Shaw (Professor, University of Alberta) - Ener...
ISES 2013 - Professor John M. Shaw (Professor, University of Alberta) - Ener...
 
An Objective Alternative To IUPAC S Approach To Assign Oxidation States
An Objective Alternative To IUPAC S Approach To Assign Oxidation StatesAn Objective Alternative To IUPAC S Approach To Assign Oxidation States
An Objective Alternative To IUPAC S Approach To Assign Oxidation States
 
Macromolecules or Big Small Molecules? Handling biopolymers in a chemical reg...
Macromolecules or Big Small Molecules? Handling biopolymers in a chemical reg...Macromolecules or Big Small Molecules? Handling biopolymers in a chemical reg...
Macromolecules or Big Small Molecules? Handling biopolymers in a chemical reg...
 
Applying Computer Assisted Structure Elucidation Algorithms For The Purpose O...
Applying Computer Assisted Structure Elucidation Algorithms For The Purpose O...Applying Computer Assisted Structure Elucidation Algorithms For The Purpose O...
Applying Computer Assisted Structure Elucidation Algorithms For The Purpose O...
 
Revisiting the nmr assignments of hexacyclinol
Revisiting the nmr assignments of hexacyclinolRevisiting the nmr assignments of hexacyclinol
Revisiting the nmr assignments of hexacyclinol
 
About chemistry pdf
About chemistry pdfAbout chemistry pdf
About chemistry pdf
 
Oct 2011 ualr
Oct 2011 ualrOct 2011 ualr
Oct 2011 ualr
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
Formal representation of models in systems biology
Formal representation of models in systems biologyFormal representation of models in systems biology
Formal representation of models in systems biology
 
MCAT Syllabus by UHS for 2016
MCAT Syllabus by UHS for 2016MCAT Syllabus by UHS for 2016
MCAT Syllabus by UHS for 2016
 
Utilizing open source software to facilitate communication of chemistry at rsc
Utilizing open source software to facilitate communication of chemistry at rscUtilizing open source software to facilitate communication of chemistry at rsc
Utilizing open source software to facilitate communication of chemistry at rsc
 

Mais de Michel Dumontier

CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
Michel Dumontier
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
Michel Dumontier
 

Mais de Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 

Representing chemicals using OWL, Description Graphs and Rules

  • 1. Representing chemicalsusing OWL, Description Graphsand Rules Janna Hastings, EBI, UKMichel Dumontier, Carleton University, CanadaDuncan Hull, EBI, UKMatthew Horridge, Manchester, UKChristoph Steinbeck, EBI, UKUlrike Sattler, Manchester, UKRobert Stevens, Manchester, UKTertia Hӧrne, University of South AfricaKatarina Britz, Meraka Institute, South Africa OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 2. OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 3. Problem We wish to represent and reason over structured objects i.e. their representation contains also their parts OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 4. Chemical structures single bond Carbon atom Hydrogen atom double bond Nitrogen atom Oxygen atom caffeineMoleculesconsist of atomsconnected by bonds OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 5. Chemical ontology Chemical ontology consists of chemical classes which can be defined by parts of structuresand/or properties of structures carboxylic acid if molecule has part some carboxy group cyclic molecule if molecule has property cyclic, i.e. a self-connectedcyclic path exists through the molecule’s atoms OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 6. OWL representation Without structure, all parts must be explicitly asserted(combinatorial explosion for larger molecules) But the structure of complex molecules breaks the OWL Tree Model requirement does not have a model in the shape of a tree OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 7. Description Graphs A recent, decidable extension to OWL 2, allowing expression of complex structures as graphs within the ontology A description graph consists of a set of labelled vertices and a set of directed edges Each description graph has a main class which links the graph to the main OWL ontology OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 8. Strong separation In order to preserve decidability of knowledge bases enriched with description graphs, atomic properties used as graph edges have to be different to those used in axioms in the main OWL ontology This is known as the strong separation requirement OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 9. Rules Enhance OWL with the capacity to express if – then constructions Consist of ‘antecedent’ (if conditions) and ‘consequent’ (then result) Antecedent and consequent are composed of conjunctions of atomic statements OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 10. Goal Can we represent chemical structures using OWL and Description Graphs? Can we reason over the information encoded in chemical structures using OWL, Description Graphs and Rules? OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 11. OWL ontology OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 12. Chemical description graphs Generated based on structures converted from a chemical database (ChEBI) OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 13. Rules Generated for properties, e.g. being cyclic molecule(?x), atom(?a1), atom (?a2), atom(?a3), atom(?a4), bond(?b1), bond (?b2), bond(?b3), bond (?b4), has_atom(?x, ?a1), has_atom(?x, ?a2), has_atom(?x, ?a3), has_atom(?x, ?a4),has_bond(?a1, ?b1), has_bond(?a1, ?b4), has_bond(?a2, ?b1), has_bond(?a2, ?b2),has_bond(?a3, ?b2), has_bond(?a3, ?b3), has_bond(?a4, ?b3), has_bond(?a4, ?b4)-> cyclic_entity(?x) cyclobutane tetrahedrane OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 14. Rules Generated for parthood, e.g. carboxylic acid molecule(?y), atom(?a0), oxygen_atom(?a1), carbon_atom(?a2), oxygen_atom (?a3), has_atom(?y, ?a0), has_atom (?y, ?a1), has_atom (?y, ?a2), has_atom (?y, ?a3), double_bond(?b0), single_bond (?b1), single_bond (?b2), has_bond(?a0, ?b2), has_bond(?a1, ?b1), has_bond(?a2, ?b0), has_bond(?a2, ?b1), has_bond(?a2, ?b2), has_bond(?a3, ?b0) -> carboxylic_acid(?y) carboxylic acid benzoicacid benzoic acid has this part so: is a carboxylic acid OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 15. Testing the reasoning Can we use a reasoner to deduce the classification hierarchy based on the graphs and rules? No asserted hierarchy between test classes and molecules with generated graphs OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 16. Results Inferred hierarchy shows classified molecules OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 17. Testing the performance How many molecules (description graphs) can we include in our knowledge base? How does the reasoning task (classification) scale with respect to the number of graphs, both with and without rules in the knowledge base? OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 18. Results OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 19. Experiences Difficult to debug Tools support needs to be improved Difficult to construct rules for properties which depend on all atoms or all bonds in a given molecule e.g. saturated -> all bonds in molecule are single OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 20. Conclusion Using OWL, Description Graphs and Rules we can represent chemical structures at the class level in our knowledge base and reason over the structural information Scalability of the reasoning with the rules is a concern OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules
  • 21. Acknowledgements Special thanks to KirillDegtyarenko, Stefan Schulz, Colin Batchelor, BirteGlimm and the ChEBI team Funding: Meraka Institute, South Africa; BBSRC (BB/G022747/1); NSERC Discovery Grant OWLED2010:San Francisco:Representing chemicals using OWL, Description Graphs and Rules