SlideShare uma empresa Scribd logo
1 de 29
oreChem: Planning and
Enacting Chemistry on the
Semantic Web
Microsoft Research eScience Workshop 2010
Berkeley, CA USA
Mark Borkum, Simon Coles and Jeremy Frey
12 October 2010
Overview
• Introduction
• Ontology
• Case Study: X-ray Crystallography
• Future Work
• Summary
2
The Scientific Method
• A systematic process
for knowledge
acquisition
• Becoming increasingly
data-intensive
Planning
Enactment
Analysis
Publication
3
The Data Deluge
4
• In Haiku:
– Lots of producers;
Generating more data
than ever before.
• 40 years ago, a PhD
student would
determine 3 structures
over the entire course
of their study!
The Great Wave off Kanagawa by Katsushika Hokusai
The Scientific Method (on the Web)
5
Provenance (The Elephant in the Room)
• The 7 W’s [Goble 2002]
– Who, What, Where,
Why, When, Which, &
(W)How
• The Why aspect is
often ignored 
6
Why
Planning
Who
Authorship
What &
(W)How
Enactment
Where & When
Annotations
The oreChem Project
• Funded by Microsoft
Research
• Investigating the design and
deployment of a semantic-
based eScience infrastructure
for Chemistry
• Project website:
– http://research.microsoft.com/
en-us/projects/orechem/
7
Why
Planning
Who
Authorship
What &
(W)How
Enactment
Where & When
Annotations
oreChem
Dublin Core, FOAF, SIOC, OWL Time, GeoNames, etc…
oreChem Core Ontology
8
Planning
• Prospective provenance
• Describes a scientific
experiment that will be
enacted (in the future)
• Three entity types:
– Plan
– Plan Stage
– Plan Object
9
Enactment
• Retrospective provenance
• Describes a scientific
experiment that was
enacted
• Three entity types:
– Run
– Stage
– Object
10
“In theory, there is no difference
between theory and practice.
But, in practice, there is.”
Unknown (possibly Yogi Berra)
Realisation (is not Instantiation)
• Each ‘run thing’ is
linked to zero or one
‘plan thing’
– Deviation from the plan
is allowed
12
X-RAY CRYSTALLOGRAPHY
Case Study
13
Current Practice in Crystallography
• Crystallography data is
highly structured
– The de facto standard
adopted by the
community is the CIF
(Crystallographic
Information File)
• Relatively few crystal
structures are openly
available online
14
http://www.rin.ac.uk/our-work/data-management-and-
curation/share-or-not-share-research-data-outputs
Crystallography and Fraud
15
The eCrystals Federation
• JISC project
• Network of
crystallography
resources
• All published records
are available as
Open Data
• Based on EPrints
repository 16
http://ecrystals.chem.soton.ac.uk/
eCrystal #20
• Each eCrystals record
contains:
– Bibliographic metadata
– Fundamental and
derived data (excluding
raw images)
– Final structure solution
17
Single Crystal Structure Determination
18
1. Take powder
specimen of chemical
substance
2. Measure diffraction of
X-rays
3. Compute electron
densities
4. Solve for crystal
structure
oreChem Plan for eCrystals
• Machine-readable
representation of
methodology
• Describes requirements
for software and data
products
• Available online at:
– http://ecrystals.chem.soton.
ac.uk/plan.rdf
19
oreChem Run for eCrystal #20
• Exported by “oreChem”
plug-in for EPrints 3.1
– RDF/XML serialisation
– Uses SWRL rules to infer
causal relationships
• Describes:
– Software
– Data products
20
http://ecrystals.chem.soton.ac.uk/cgi/export/20/ORE_Chem/ecry
stals-eprint-20.xml?include_xsl=1
Retrospective Provenance
Graphs for eCrystal #20
Stages and Objects Objects
21
used (dashed)
emitted (solid)
derivedFrom (solid)
used(?s, ?o1) & emitted(?s, ?o2)
 derivedFrom(?o2, ?o1)
Crystallography and Fraud – SPARQL
PREFIX orechem: <http://www.openarchives.org/2010/05/24-orechem-ns#>
PREFIX ecrystals: <http://ecrystals.chem.soton.ac.uk/plan.rdf#>
SELECT ?run ?raw ?derived ?reported
WHERE {
?run a orechem:Run ;
orechem:hasPlan ecrystals:Ecrystals ;
orechem:containsObject ?raw ;
orechem:containsObject ?derived ;
orechem:containsObject ?reported .
?raw a orechem:File ;
orechem:hasPlanObject ecrystals:HKL .
?derived a orechem:File ;
orechem:derivedFrom ?raw .
?reported a orechem:File ;
orechem:hasPlanObject ecrystals:CIF ;
orechem:derivedFrom ?derived .
}
22
Crystallography and Fraud – SPARQL (2)
23
Crystallography and Fraud – SPARQL (3)
24
?run ?raw
?reported
?derived
http://ecrystals.chem.soton.ac.uk/cgi/export/20/ORE_Chem/ecry
stals-eprint-20.xml?include_xsl=1
Crystallography and Fraud – SPARQL (4)
?run ?raw ?derived ?reported
_:eCrystal_20_Run 02sot126.hkl 02sot126.prp 02sot126.cif
_:eCrystal_20_Run 02sot126.hkl 02sot126.lst 02sot126.cif
_:eCrystal_20_Run 02sot126.hkl 02sot126.res 02sot126.cif
25
Future Work
• oreChem Core Ontology
– Support for conditionals and continuations
• oreChem Lower Ontology
– Specialised for Physical and Computational Chemistry
• Applications and Services
– oreChem Plan Designer and Enactor
– oreChem Run Inspector
26
Summary
• <summary/>
27
Acknowledgements
• Microsoft Research
– Tony Hey
– Lee Dirks
– Savas Parastatidis
– Alex Wade
• oreChem Project
– Carl Lagoze, Theresa Velden
– Jeremy Frey, Simon Coles
– Peter Murray-Rust, Nick
Day, Jim Downing
– C. Lee Giles, Prasenjit Mitra,
William Brouwer, Na Li
– Marlon Pierce, Sashi Kiran
Challa
28
Thank You
• Questions?
29

Mais conteúdo relacionado

Mais procurados

Royal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesRoyal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesValery Tkachenko
 
Troy_Williams__Resume
Troy_Williams__ResumeTroy_Williams__Resume
Troy_Williams__ResumeTroy Williams
 
Opportunities in chemical structure standardization
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardizationValery Tkachenko
 
The application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsThe application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsValery Tkachenko
 
GeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 PresentationGeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 PresentationCameron Kiddle
 
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationStuart Chalk
 
Semantic data integration proof of concept
Semantic data integration proof of conceptSemantic data integration proof of concept
Semantic data integration proof of conceptNicolas Bertrand
 

Mais procurados (11)

Royal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesRoyal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and libraries
 
Troy_Williams__Resume
Troy_Williams__ResumeTroy_Williams__Resume
Troy_Williams__Resume
 
Opportunities in chemical structure standardization
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardization
 
The application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsThe application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platforms
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
GeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 PresentationGeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 Presentation
 
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
Bh14 ogo
Bh14 ogoBh14 ogo
Bh14 ogo
 
Semantic data integration proof of concept
Semantic data integration proof of conceptSemantic data integration proof of concept
Semantic data integration proof of concept
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 

Semelhante a oreChem: Planning and Enacting Chemistry on the Semantic Web

Dealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data onlineDealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data onlineKen Karapetyan
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Anubhav Jain
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...Ricard de la Vega
 
Lessons Learned in Building Linked Data for the American Art Collaborative
Lessons Learned in Building Linked Data for the American Art CollaborativeLessons Learned in Building Linked Data for the American Art Collaborative
Lessons Learned in Building Linked Data for the American Art CollaborativeCraig Knoblock
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...Lisette Giepmans
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...Ardan Patwardhan
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Acceleration of XML Parsing through Prefetching
Acceleration of XML  Parsing through PrefetchingAcceleration of XML  Parsing through Prefetching
Acceleration of XML Parsing through PrefetchingRohit Deshpande
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Databricks
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation SlidesDuraSpace
 
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsNina Jeliazkova
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College LondonTorsten Reimer
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesOCLC
 

Semelhante a oreChem: Planning and Enacting Chemistry on the Semantic Web (20)

Dealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data onlineDealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data online
 
Dealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data onlineDealing with the complex challenge of managing diverse chemistry data online
Dealing with the complex challenge of managing diverse chemistry data online
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...
 
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Toxico...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Toxico...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Toxico...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Toxico...
 
The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...
 
The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...The Catalan Research portal: collecting information from Catalan universities...
The Catalan Research portal: collecting information from Catalan universities...
 
Lessons Learned in Building Linked Data for the American Art Collaborative
Lessons Learned in Building Linked Data for the American Art CollaborativeLessons Learned in Building Linked Data for the American Art Collaborative
Lessons Learned in Building Linked Data for the American Art Collaborative
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...
BioSHaRE: Opal and Mica: a software suite for data harmonization and federati...
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Acceleration of XML Parsing through Prefetching
Acceleration of XML  Parsing through PrefetchingAcceleration of XML  Parsing through Prefetching
Acceleration of XML Parsing through Prefetching
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
 
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College London
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 

Último

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Último (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

oreChem: Planning and Enacting Chemistry on the Semantic Web

  • 1. oreChem: Planning and Enacting Chemistry on the Semantic Web Microsoft Research eScience Workshop 2010 Berkeley, CA USA Mark Borkum, Simon Coles and Jeremy Frey 12 October 2010
  • 2. Overview • Introduction • Ontology • Case Study: X-ray Crystallography • Future Work • Summary 2
  • 3. The Scientific Method • A systematic process for knowledge acquisition • Becoming increasingly data-intensive Planning Enactment Analysis Publication 3
  • 4. The Data Deluge 4 • In Haiku: – Lots of producers; Generating more data than ever before. • 40 years ago, a PhD student would determine 3 structures over the entire course of their study! The Great Wave off Kanagawa by Katsushika Hokusai
  • 5. The Scientific Method (on the Web) 5
  • 6. Provenance (The Elephant in the Room) • The 7 W’s [Goble 2002] – Who, What, Where, Why, When, Which, & (W)How • The Why aspect is often ignored  6 Why Planning Who Authorship What & (W)How Enactment Where & When Annotations
  • 7. The oreChem Project • Funded by Microsoft Research • Investigating the design and deployment of a semantic- based eScience infrastructure for Chemistry • Project website: – http://research.microsoft.com/ en-us/projects/orechem/ 7 Why Planning Who Authorship What & (W)How Enactment Where & When Annotations oreChem Dublin Core, FOAF, SIOC, OWL Time, GeoNames, etc…
  • 9. Planning • Prospective provenance • Describes a scientific experiment that will be enacted (in the future) • Three entity types: – Plan – Plan Stage – Plan Object 9
  • 10. Enactment • Retrospective provenance • Describes a scientific experiment that was enacted • Three entity types: – Run – Stage – Object 10
  • 11. “In theory, there is no difference between theory and practice. But, in practice, there is.” Unknown (possibly Yogi Berra)
  • 12. Realisation (is not Instantiation) • Each ‘run thing’ is linked to zero or one ‘plan thing’ – Deviation from the plan is allowed 12
  • 14. Current Practice in Crystallography • Crystallography data is highly structured – The de facto standard adopted by the community is the CIF (Crystallographic Information File) • Relatively few crystal structures are openly available online 14 http://www.rin.ac.uk/our-work/data-management-and- curation/share-or-not-share-research-data-outputs
  • 16. The eCrystals Federation • JISC project • Network of crystallography resources • All published records are available as Open Data • Based on EPrints repository 16 http://ecrystals.chem.soton.ac.uk/
  • 17. eCrystal #20 • Each eCrystals record contains: – Bibliographic metadata – Fundamental and derived data (excluding raw images) – Final structure solution 17
  • 18. Single Crystal Structure Determination 18 1. Take powder specimen of chemical substance 2. Measure diffraction of X-rays 3. Compute electron densities 4. Solve for crystal structure
  • 19. oreChem Plan for eCrystals • Machine-readable representation of methodology • Describes requirements for software and data products • Available online at: – http://ecrystals.chem.soton. ac.uk/plan.rdf 19
  • 20. oreChem Run for eCrystal #20 • Exported by “oreChem” plug-in for EPrints 3.1 – RDF/XML serialisation – Uses SWRL rules to infer causal relationships • Describes: – Software – Data products 20 http://ecrystals.chem.soton.ac.uk/cgi/export/20/ORE_Chem/ecry stals-eprint-20.xml?include_xsl=1
  • 21. Retrospective Provenance Graphs for eCrystal #20 Stages and Objects Objects 21 used (dashed) emitted (solid) derivedFrom (solid) used(?s, ?o1) & emitted(?s, ?o2)  derivedFrom(?o2, ?o1)
  • 22. Crystallography and Fraud – SPARQL PREFIX orechem: <http://www.openarchives.org/2010/05/24-orechem-ns#> PREFIX ecrystals: <http://ecrystals.chem.soton.ac.uk/plan.rdf#> SELECT ?run ?raw ?derived ?reported WHERE { ?run a orechem:Run ; orechem:hasPlan ecrystals:Ecrystals ; orechem:containsObject ?raw ; orechem:containsObject ?derived ; orechem:containsObject ?reported . ?raw a orechem:File ; orechem:hasPlanObject ecrystals:HKL . ?derived a orechem:File ; orechem:derivedFrom ?raw . ?reported a orechem:File ; orechem:hasPlanObject ecrystals:CIF ; orechem:derivedFrom ?derived . } 22
  • 23. Crystallography and Fraud – SPARQL (2) 23
  • 24. Crystallography and Fraud – SPARQL (3) 24 ?run ?raw ?reported ?derived http://ecrystals.chem.soton.ac.uk/cgi/export/20/ORE_Chem/ecry stals-eprint-20.xml?include_xsl=1
  • 25. Crystallography and Fraud – SPARQL (4) ?run ?raw ?derived ?reported _:eCrystal_20_Run 02sot126.hkl 02sot126.prp 02sot126.cif _:eCrystal_20_Run 02sot126.hkl 02sot126.lst 02sot126.cif _:eCrystal_20_Run 02sot126.hkl 02sot126.res 02sot126.cif 25
  • 26. Future Work • oreChem Core Ontology – Support for conditionals and continuations • oreChem Lower Ontology – Specialised for Physical and Computational Chemistry • Applications and Services – oreChem Plan Designer and Enactor – oreChem Run Inspector 26
  • 28. Acknowledgements • Microsoft Research – Tony Hey – Lee Dirks – Savas Parastatidis – Alex Wade • oreChem Project – Carl Lagoze, Theresa Velden – Jeremy Frey, Simon Coles – Peter Murray-Rust, Nick Day, Jim Downing – C. Lee Giles, Prasenjit Mitra, William Brouwer, Na Li – Marlon Pierce, Sashi Kiran Challa 28