SlideShare uma empresa Scribd logo
1 de 21
The Chemical Analysis 
Metadata Platform (ChAMP): 
Thoughts and Ideas on the 
Semantic Identification of 
Analytical Metrics 
Stuart J. Chalk, Department of Chemistry, University of North Florida 
Antony Williams and Valery Tkachenko, RSC Cheminformatics 
schalk@unf.edu 
2014 Fall ACS Meeting
Overview 
 Initial Idea 
 Motivation 
 Why a Platform? 
 Pieces of the Puzzle 
 Existing Resources 
 What are the Most Important Metadata? 
 Minimum Information About a Chemical Analysis? 
 Different Perspectives on a Chemical Analysis 
 Example: Metadata Categories in XML 
 Example: Metadata Categories in JSON-LD 
 Planned Activities 
 Conclusion
Initial Idea 
 Develop a set of data standards for 
representation/annotation of chemical analysis 
information 
 Are there important characteristics (metadata) about 
analysis methodologies that, if captured, would add 
value to a resource? 
 Must be easy to implement 
 Must be useful across multiple disciplines
RSC Data Repository 
Compounds 
Widgets 
Compounds 
API 
Paid 3rd party integrations (various platforms – SharePoint, Google, etc) 
Reactions 
Widgets 
Reactions 
API 
Spectra 
Widgets 
Spectra 
API 
Materials 
Widgets 
Materials 
API 
Documents 
Widgets 
Documents 
API 
Compounds Reactions Spectra Materials Documents 
User 
interface 
components 
tier 
Data access 
tier 
Data tier 
Analytical Laboratory application 
User 
interface tier 
(examples) Electronic Laboratory Notebook 
Chemical Inventory application
Motivation 
 Access to knowledge in existing literature 
 Annotation of research in future publications 
 Annotation of unpublished/self published 
(but potentially useful) work 
 Annotation of data captured in ELN’s 
 Data ingest into the RSC’s Data Repository 
 Complements/enhances existing activities 
 The haystack is so big – we need to make it 
easy to make the needle show up
Why a Platform? 
 Develop it to be as broadly applicable as possible 
 Chemical analysis is a not tangible like a spectrum 
 Users have domain specific needs 
 Users has a favorite/required format to store 
information 
 SQL Relational Database 
 Excel Spreadsheet 
 XML, YAML 
 JSON or JSON-LD 
 ChAMP should define the types of metadata and 
general organization of the information, not the 
format it is stored in (this is like MIAME [1]) 
[1] http://www.mged.org/Workgroups/MIAME/miame.html
First Thoughts 
 Covers metadata for a chemical analysis methodology 
not raw analytical instrument data 
 Two main sections? 
 Fundamental method development 
 Method application 
 How big should the platform scope be? 
 What information is most important? 
 How do we get community involvement/buy-in?
Pieces of the Puzzle 
 Ontology of chemical analysis terms 
 Taxonomy of chemical analysis metadata 
 Controlled vocabularies for specific metadata items 
 Definitions of required metadata (in context) 
 Naming and design rules
Existing Resources 
 Ontologies 
 Chemical Methods Ontology (CMO) [2] 
 SemanticScience CHEMINF Ontology [3] 
 Chemical Entities of Biological Interest (ChEBI) [4] 
 “Ontology on Property” by René Dybkær [5] 
 Ontobee (ontology search) [6] 
[2] http://www.rsc.org/ontologies/CMO/ 
[3] https://code.google.com/p/semanticscience/ 
[4] http://www.ebi.ac.uk/chebi/ 
[5] http://ontology.iupac.org/ 
[6] http://www.ontobee.org/
Existing Resources 
 Controlled Vocabularies/Taxonomies 
 MESH [6] 
 LCSH [7] 
 CAS Subject Headings [8] 
 IUPAC Orange Book [9] 
 IUPAC Gold Book [10] 
 … do they address how to organize the metadata? 
[6] http://www.ncbi.nlm.nih.gov/mesh 
[7] http://id.loc.gov/authorities/subjects.html 
[8] http://cas.org 
[9] http://iupac.org/publications/analytical_compendium 
[10] http://goldbook.iupac.org/
 Other 
Existing Resources 
 JCAMP-DX [11] 
 Analytical Information Markup Language (AnIML) [12] 
 Units Markup Language (UnitsML) [13] 
 NASA Quantities, Units, Dimensions and Data Types [14] 
 Electronic Laboratory Notebook Manifest 
(elnItemManifest) [15] 
[11] JCAMP-DX – http://www.jcamp-dx.org/ 
[12] AnIML – http://animl.sourceforge.net/ 
[13] UnitsML – http://unitsml.nist.gov/ 
[14] QUDT– http://qudt.org/ 
[15] elnItemManifest –http://www.jcheminf.com/content/5/1/52
What are the 
Most Important Metadata? 
 Depends on who you talk to… 
 Platform should describe (as completely as possible) 
the types of metadata important in analysis… 
 … but leave the description of what’s important to 
the users 
 Standards for different industries, with different 
requirements, could be developed based on the 
platform
Minimum Information About a 
Chemical Analysis? 
 MIAChA (my-ache-a?) 
 Can the community agree on a minimum set of 
metadata items needed to annotate an analysis? 
 Must be for a more specific area of analysis 
 MIASA – Spectrochemical Analysis 
 MIACA – Chromatographic Analysis 
 MIAEA – Electrochemical Analysis 
 MIATA – Thermal Analysis
Different Perspectives 
on a Chemical Analysis 
 Users and publishers have different needs/wants 
when they view/present information 
 Defining perspectives(views) would extract out of a 
record only what a certain type of chemist would 
expect to see 
 Could be defined broadly or narrowly 
 This could include aggregation and/or calculation of 
new metrics derived from the basic metadata
Example: Metadata Categories in XML 
<?xml version="1.0" encoding="UTF-8"?> 
<chemicalAnalysis id="http://example.com/analysis_007"> 
<description> 
<title/><focus/><citation/><doi/><analysisType/><applicationArea/>... 
</description> 
<analytes> 
<analyte id="http://www.chemspider.com/…” type="sci:CHEMINF_000000"> 
<inchikey/><name/>... 
</analyte> 
</analytes> 
<matrices> 
<matrix id="http://champ.org/ont/champ:m0000001" type="champ:MAT_000001"> 
<name/><state/><form/>... 
</matrix> 
</matrices> 
<instruments> 
<instrument id="http://champ.org/ont/champ:i0000001” type="champ:INS_000001"> 
<name/><description/><settings/>... 
</instrument> 
</instruments> 
<validation> 
<srmAnalysis/><recoveryStudy/><methodComparison/><interferences/>... 
</validation> 
<samplePrep> 
<collection/><stabilization/><storage/><workup/>... 
</samplePrep> 
</chemicalAnalysis>
Example: Metadata Categories in JSON-LD 
{ 
"@context":"http://champ.org/chemicalanalysis.jsonld", 
“@id”: “http://example.com/analysis_007”, 
“description”: { “title”: …, “focus”: …, “citation”: …, “doi”: … 
“analysisType”: …, “applicationArea”: … } 
"analytes": [{"@id”: …, "@type”: …, "inchikey”: …, "name": … },… ], 
"matrices": [{"@id”: …, "@type": …, "name": …, “state”: …, “form”: … }, … ] 
"instruments": [{"@id": …, "@type": …, "name": …, "description”: …, 
"settings": … }, … ], 
"metrics": { "detection limit”: …, “linear dynamic range”: … }, 
“validation”: { "reference material": …, “recovery study”: …, 
“method comparison”: …, “interferences”: … }, 
“samplePrep”: { “collection”: …, “stabilization”: …, “storage”: …, 
“workup”: … } 
}
Immediate Plans 
 Get the word out 
 Put up a website to provide focal point for project 
 Get on social media and promote, encourage participation 
 Survey the community 
 Do an analysis of existing literature for metadata 
 Using resources develop an initial alpha (first pass) version 
of the platform 
 Provide mechanism for crowdsourced feedback 
 Publish examples of the use in different scenarios
Longer Term Plan 
 Version 1 of platform 
 Controlled vocabularies 
 Example documents 
 Example applications
Conclusion 
 This approach to metadata identification will provide 
value to existing resources 
 It will enhance basic searching 
 It will allow semantic searching 
 It will provide efficient annotation of large amounts 
of curated data that is not from traditional publishing
Conclusion 
 It also fits well with the mission of the 
Research Data Alliance (RDA) [16] 
 RDA Vision: Researchers and innovators openly sharing 
data across technologies, disciplines, and countries to 
address the grand challenges of society. 
 RDA Mission: The Research Data Alliance (RDA) builds 
the social and technical bridges that enable open 
sharing of data. 
[16] http://rd-alliance.org
Questions? 
 schalk@unf.edu 
 Phone: 904-620-5311 
 Skype: stuartchalk 
 LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk 
 ORCID: http://orcid.org/0000-0002-0703-7776 
 ResearcherID: http://www.researcherid.com/rid/D-8577-2013

Mais conteúdo relacionado

Mais procurados

Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.FAIRDOM
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIRDOM
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyFAIRDOM
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openlyFAIRDOM
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIMartin Scharm
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Carole Goble
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Valery Tkachenko
 
The application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsThe application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsValery Tkachenko
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Stuart Chalk
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSValery Tkachenko
 

Mais procurados (20)

Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...
 
eScience at the Royal Society of Chemistry and our current initiatives
eScience at the Royal Society of Chemistry and our current initiativeseScience at the Royal Society of Chemistry and our current initiatives
eScience at the Royal Society of Chemistry and our current initiatives
 
The application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platformsThe application of cloud computing to royal society of chemistry data platforms
The application of cloud computing to royal society of chemistry data platforms
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 

Destaque

Semantic Web - basic taxonomies
Semantic Web - basic taxonomiesSemantic Web - basic taxonomies
Semantic Web - basic taxonomiesRobin Houdmeyers
 
Unit 1 Semantics
Unit 1 SemanticsUnit 1 Semantics
Unit 1 Semanticsmjgvalcarce
 
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
Unit 6 -  Predicates, Referring Expressions, and Universe of DiscourseUnit 6 -  Predicates, Referring Expressions, and Universe of Discourse
Unit 6 - Predicates, Referring Expressions, and Universe of DiscourseAshwag Al Hamid
 
Unit 4 - Referring Expressions
Unit 4 -  Referring ExpressionsUnit 4 -  Referring Expressions
Unit 4 - Referring ExpressionsAshwag Al Hamid
 
Unit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsUnit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsAshwag Al Hamid
 

Destaque (6)

Semantic Web - basic taxonomies
Semantic Web - basic taxonomiesSemantic Web - basic taxonomies
Semantic Web - basic taxonomies
 
Semantics (04)
Semantics (04)Semantics (04)
Semantics (04)
 
Unit 1 Semantics
Unit 1 SemanticsUnit 1 Semantics
Unit 1 Semantics
 
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
Unit 6 -  Predicates, Referring Expressions, and Universe of DiscourseUnit 6 -  Predicates, Referring Expressions, and Universe of Discourse
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
 
Unit 4 - Referring Expressions
Unit 4 -  Referring ExpressionsUnit 4 -  Referring Expressions
Unit 4 - Referring Expressions
 
Unit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsUnit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and Propositions
 

Semelhante a ACS 248th Paper 71 ChAMP Project

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldAmit Sheth
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...faflrt
 
A Look into the Apache OODT Ecosystem
A Look into the Apache OODT EcosystemA Look into the Apache OODT Ecosystem
A Look into the Apache OODT EcosystemChris Mattmann
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Amit Sheth
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataStuart Chalk
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search enginesunyil96
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentManjulaPatel
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...Salam Shah
 
Enhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemEnhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemMichael Day
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalWaqas Tariq
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyCornelius Puschmann
 

Semelhante a ACS 248th Paper 71 ChAMP Project (20)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
 
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
 
A Look into the Apache OODT Ecosystem
A Look into the Apache OODT EcosystemA Look into the Apache OODT Ecosystem
A Look into the Apache OODT Ecosystem
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search engines
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...
 
data.ac.uk briefing paper
data.ac.uk briefing paperdata.ac.uk briefing paper
data.ac.uk briefing paper
 
Enhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemEnhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization system
 
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
 

Mais de Stuart Chalk

Semantic properties and units
Semantic properties and unitsSemantic properties and units
Semantic properties and unitsStuart Chalk
 
Open semantic chemical structures
Open semantic chemical structuresOpen semantic chemical structures
Open semantic chemical structuresStuart Chalk
 
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...Stuart Chalk
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardStuart Chalk
 
Scientific Units in the Electronic Age
Scientific Units in the Electronic AgeScientific Units in the Electronic Age
Scientific Units in the Electronic AgeStuart Chalk
 
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Stuart Chalk
 
Bringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebBringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebStuart Chalk
 
Reactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseReactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseStuart Chalk
 
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Stuart Chalk
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXStuart Chalk
 
Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Stuart Chalk
 
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaStuart Chalk
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectStuart Chalk
 
ACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka CollaborationACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka CollaborationStuart Chalk
 
247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk
 
247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)Stuart Chalk
 
Liberating Laboratory Data - Eureka
Liberating Laboratory Data - EurekaLiberating Laboratory Data - Eureka
Liberating Laboratory Data - EurekaStuart Chalk
 
Liberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIMLLiberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIMLStuart Chalk
 

Mais de Stuart Chalk (19)

Semantic properties and units
Semantic properties and unitsSemantic properties and units
Semantic properties and units
 
Open semantic chemical structures
Open semantic chemical structuresOpen semantic chemical structures
Open semantic chemical structures
 
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data Standard
 
Scientific Units in the Electronic Age
Scientific Units in the Electronic AgeScientific Units in the Electronic Age
Scientific Units in the Electronic Age
 
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
 
Bringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebBringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic Web
 
Reactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseReactions to the Open Spectral Database
Reactions to the Open Spectral Database
 
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSX
 
Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)
 
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData Project
 
ACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka CollaborationACS 248th Paper 67 Eureka Collaboration
ACS 248th Paper 67 Eureka Collaboration
 
247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench
 
247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)
 
Liberating Laboratory Data - Eureka
Liberating Laboratory Data - EurekaLiberating Laboratory Data - Eureka
Liberating Laboratory Data - Eureka
 
Liberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIMLLiberating Laboratory Data - AnIML
Liberating Laboratory Data - AnIML
 

Último

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Último (20)

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

ACS 248th Paper 71 ChAMP Project

  • 1. The Chemical Analysis Metadata Platform (ChAMP): Thoughts and Ideas on the Semantic Identification of Analytical Metrics Stuart J. Chalk, Department of Chemistry, University of North Florida Antony Williams and Valery Tkachenko, RSC Cheminformatics schalk@unf.edu 2014 Fall ACS Meeting
  • 2. Overview  Initial Idea  Motivation  Why a Platform?  Pieces of the Puzzle  Existing Resources  What are the Most Important Metadata?  Minimum Information About a Chemical Analysis?  Different Perspectives on a Chemical Analysis  Example: Metadata Categories in XML  Example: Metadata Categories in JSON-LD  Planned Activities  Conclusion
  • 3. Initial Idea  Develop a set of data standards for representation/annotation of chemical analysis information  Are there important characteristics (metadata) about analysis methodologies that, if captured, would add value to a resource?  Must be easy to implement  Must be useful across multiple disciplines
  • 4. RSC Data Repository Compounds Widgets Compounds API Paid 3rd party integrations (various platforms – SharePoint, Google, etc) Reactions Widgets Reactions API Spectra Widgets Spectra API Materials Widgets Materials API Documents Widgets Documents API Compounds Reactions Spectra Materials Documents User interface components tier Data access tier Data tier Analytical Laboratory application User interface tier (examples) Electronic Laboratory Notebook Chemical Inventory application
  • 5. Motivation  Access to knowledge in existing literature  Annotation of research in future publications  Annotation of unpublished/self published (but potentially useful) work  Annotation of data captured in ELN’s  Data ingest into the RSC’s Data Repository  Complements/enhances existing activities  The haystack is so big – we need to make it easy to make the needle show up
  • 6. Why a Platform?  Develop it to be as broadly applicable as possible  Chemical analysis is a not tangible like a spectrum  Users have domain specific needs  Users has a favorite/required format to store information  SQL Relational Database  Excel Spreadsheet  XML, YAML  JSON or JSON-LD  ChAMP should define the types of metadata and general organization of the information, not the format it is stored in (this is like MIAME [1]) [1] http://www.mged.org/Workgroups/MIAME/miame.html
  • 7. First Thoughts  Covers metadata for a chemical analysis methodology not raw analytical instrument data  Two main sections?  Fundamental method development  Method application  How big should the platform scope be?  What information is most important?  How do we get community involvement/buy-in?
  • 8. Pieces of the Puzzle  Ontology of chemical analysis terms  Taxonomy of chemical analysis metadata  Controlled vocabularies for specific metadata items  Definitions of required metadata (in context)  Naming and design rules
  • 9. Existing Resources  Ontologies  Chemical Methods Ontology (CMO) [2]  SemanticScience CHEMINF Ontology [3]  Chemical Entities of Biological Interest (ChEBI) [4]  “Ontology on Property” by René Dybkær [5]  Ontobee (ontology search) [6] [2] http://www.rsc.org/ontologies/CMO/ [3] https://code.google.com/p/semanticscience/ [4] http://www.ebi.ac.uk/chebi/ [5] http://ontology.iupac.org/ [6] http://www.ontobee.org/
  • 10. Existing Resources  Controlled Vocabularies/Taxonomies  MESH [6]  LCSH [7]  CAS Subject Headings [8]  IUPAC Orange Book [9]  IUPAC Gold Book [10]  … do they address how to organize the metadata? [6] http://www.ncbi.nlm.nih.gov/mesh [7] http://id.loc.gov/authorities/subjects.html [8] http://cas.org [9] http://iupac.org/publications/analytical_compendium [10] http://goldbook.iupac.org/
  • 11.  Other Existing Resources  JCAMP-DX [11]  Analytical Information Markup Language (AnIML) [12]  Units Markup Language (UnitsML) [13]  NASA Quantities, Units, Dimensions and Data Types [14]  Electronic Laboratory Notebook Manifest (elnItemManifest) [15] [11] JCAMP-DX – http://www.jcamp-dx.org/ [12] AnIML – http://animl.sourceforge.net/ [13] UnitsML – http://unitsml.nist.gov/ [14] QUDT– http://qudt.org/ [15] elnItemManifest –http://www.jcheminf.com/content/5/1/52
  • 12. What are the Most Important Metadata?  Depends on who you talk to…  Platform should describe (as completely as possible) the types of metadata important in analysis…  … but leave the description of what’s important to the users  Standards for different industries, with different requirements, could be developed based on the platform
  • 13. Minimum Information About a Chemical Analysis?  MIAChA (my-ache-a?)  Can the community agree on a minimum set of metadata items needed to annotate an analysis?  Must be for a more specific area of analysis  MIASA – Spectrochemical Analysis  MIACA – Chromatographic Analysis  MIAEA – Electrochemical Analysis  MIATA – Thermal Analysis
  • 14. Different Perspectives on a Chemical Analysis  Users and publishers have different needs/wants when they view/present information  Defining perspectives(views) would extract out of a record only what a certain type of chemist would expect to see  Could be defined broadly or narrowly  This could include aggregation and/or calculation of new metrics derived from the basic metadata
  • 15. Example: Metadata Categories in XML <?xml version="1.0" encoding="UTF-8"?> <chemicalAnalysis id="http://example.com/analysis_007"> <description> <title/><focus/><citation/><doi/><analysisType/><applicationArea/>... </description> <analytes> <analyte id="http://www.chemspider.com/…” type="sci:CHEMINF_000000"> <inchikey/><name/>... </analyte> </analytes> <matrices> <matrix id="http://champ.org/ont/champ:m0000001" type="champ:MAT_000001"> <name/><state/><form/>... </matrix> </matrices> <instruments> <instrument id="http://champ.org/ont/champ:i0000001” type="champ:INS_000001"> <name/><description/><settings/>... </instrument> </instruments> <validation> <srmAnalysis/><recoveryStudy/><methodComparison/><interferences/>... </validation> <samplePrep> <collection/><stabilization/><storage/><workup/>... </samplePrep> </chemicalAnalysis>
  • 16. Example: Metadata Categories in JSON-LD { "@context":"http://champ.org/chemicalanalysis.jsonld", “@id”: “http://example.com/analysis_007”, “description”: { “title”: …, “focus”: …, “citation”: …, “doi”: … “analysisType”: …, “applicationArea”: … } "analytes": [{"@id”: …, "@type”: …, "inchikey”: …, "name": … },… ], "matrices": [{"@id”: …, "@type": …, "name": …, “state”: …, “form”: … }, … ] "instruments": [{"@id": …, "@type": …, "name": …, "description”: …, "settings": … }, … ], "metrics": { "detection limit”: …, “linear dynamic range”: … }, “validation”: { "reference material": …, “recovery study”: …, “method comparison”: …, “interferences”: … }, “samplePrep”: { “collection”: …, “stabilization”: …, “storage”: …, “workup”: … } }
  • 17. Immediate Plans  Get the word out  Put up a website to provide focal point for project  Get on social media and promote, encourage participation  Survey the community  Do an analysis of existing literature for metadata  Using resources develop an initial alpha (first pass) version of the platform  Provide mechanism for crowdsourced feedback  Publish examples of the use in different scenarios
  • 18. Longer Term Plan  Version 1 of platform  Controlled vocabularies  Example documents  Example applications
  • 19. Conclusion  This approach to metadata identification will provide value to existing resources  It will enhance basic searching  It will allow semantic searching  It will provide efficient annotation of large amounts of curated data that is not from traditional publishing
  • 20. Conclusion  It also fits well with the mission of the Research Data Alliance (RDA) [16]  RDA Vision: Researchers and innovators openly sharing data across technologies, disciplines, and countries to address the grand challenges of society.  RDA Mission: The Research Data Alliance (RDA) builds the social and technical bridges that enable open sharing of data. [16] http://rd-alliance.org
  • 21. Questions?  schalk@unf.edu  Phone: 904-620-5311  Skype: stuartchalk  LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk  ORCID: http://orcid.org/0000-0002-0703-7776  ResearcherID: http://www.researcherid.com/rid/D-8577-2013