SlideShare uma empresa Scribd logo
1 de 33
Implementing chemistry platform for
OpenPHACTS: Lessons learned
Colin Batchelor, Alexey Pshenichnov, Jon Steele,
Valery Tkachenko
Royal Society of Chemistry
ACS Spring 2016
San Diego, CA
March 17th 2016
Open PHACTS Mission:
Integrate Multiple Research
Biomedical Data Resources
Into A Single Open & Sustainable
Access Point
info@openphactsfoundation.org @Open_PHACTS
Open PHACTS Practical Semantics
Acknowledgements
GlaxoSmithKline – Coordinator
Universität Wien – Managing entity
Technical University of Denmark
University of Hamburg, Center for
Bioinformatics
BioSolveIT GmBH
Consorci Mar Parc de Salut de Barcelona
Leiden University Medical Centre
Royal Society of Chemistry
Vrije Universiteit Amsterdam
Novartis
Merck Serono
H. Lundbeck A/S
Eli Lilly
Netherlands Bioinformatics Centre
Swiss Institute of Bioinformatics
ConnectedDiscovery
EMBL-European Bioinformatics Institute
Janssen Esteve Almirall
OpenLink Scibite
The Open PHACTS Foundation
Spanish National Cancer Research Centre
University of Manchester
Maastricht University
Aqnowledge
University of Santiago de Compostela
Rheinische Friedrich-Wilhelms-Universität
Bonn
AstraZeneca
Pfizer
Why is it so hard to….
Competitors?
What’s the
structure?
Are they in our
file?
What’s
similar?
What’s the
target?Pharmacology
data?
Known
Pathways?
Working On
Now?
Connections to
disease?
Expressed in right
cell type?
IP?
Literature
PubChem
Genbank
Patents
Databases
Downloads
Data Analysis Data Integration Firewalled Databases
How do R&D companies use public data?
@gray_alasdair Big Data Integration 9
Patent annotations in Open PHACTS
• Huge amount of knowledge hidden in patent corpus
• Most of which will never be published elsewhere
• Substantial lag between patent and scientific literature
• SureChEMBL system already extracts chemical entities from full-text
patent documents
• Text (title, abstract, description, claims), images, molfiles
• Complemented with gene and disease entity annotations
• Using the Termite text-mining tool by SciBite
• Relevance scoring to reduce noise
• Tested for recall
• Patent, compound, gene, disease info available via API
Open PHACTS Expanding EcoSystem
Further
Apps
Data
Warrior
• VM install of Open PHACTS
– Docker Image is now available
• Updating to ver 2.0 Open PHACTS
• Allows you to customise and load your own data into the
environment
Want to load your data into
Open PHACTS?
Want to run Open PHACTS
within your environment?
Usage
>500 million queries
All Users by Sector Type
Challenge
of
migrating
between
versions of
the API
Upgrading
Explorer Explorer2 ChemBioNavigator Target Dossier Pharmatrek Helium
MOE Collector Cytophacts Utopia Garfield SciBite
KNIME Mol. Data Sheets PipelinePilot scinav.it Taverna
openphactsfoundation.org/apps.html
Explorer.openphacts.org
http://data.openphacts.org
/artifactory/
Nanopub
Db
VoID
Data Cache
(Virtuoso Triple Store)
Semantic Workflow Engine
Linked Data API (RDF/XML, TTL, JSON)
Domain
Specific
Services
Identity
Resolution
Service
Chemistry
Registration
Normalisation
& Q/C
Identifier
Management
Service
Indexing
CorePlatform
P12374
EC2.43.4
CS4532
“Adenosine
receptor 2a”
VoID
Db
Nanopub
Db
VoID
Db
VoID
Nanopub
VoID
Public Content Commercial
Public Ontologies
User
Annotations
Apps
We integrate, standardize and host the chemical
compound collection underpinning Open
PHACTS.
We have developed a structure validation and
normalization platform (CVSP) to ensure chemical
structures are normalized to rules derived from the
FDA structure normalization guidelines and
modified based on input from members of EFPIA.
http://cvsp.chemspider.com/
The Royal Society of Chemistry’s
role in Open PHACTS
Freely-available (requires logging in)
chemical validation system for:
• Structure validation: warning on query
atoms, pseudoatoms, nonsensical or
unclear stereo
• Standardization workflows.
CVSP and the Open
Pharmacological Space Chemical
Registration System (OPS CRS)
Chemical data sources
Data source Number of records in
source
DrugBank 6828
PDB ligands 18681
MeSH (extracted by text
mining)
24381
ChEBI 40503
HMDB 41494
ChEMBL 20 1456020
SureChEMBL 1.0 14228299
We generate RDF that:
1. Describes synonyms and identifiers
2. Provides linksets between our data
sources and the OPS identifiers
3. Describes molecule–molecule relations of
interest to the pharma industry
4. Delivers calculated physicochemical
properties of compounds
5. Lists the validation and standardization
issues found by CVSP.
Royal Society of Chemistry data
provided to Open PHACTS
• Use standard ontologies where possible
(CHEMINF for cheminformatics
properties, QUDT for units, OBO
ontologies elsewhere)
• Use an event-based pattern for
cheminformatics outputs. This enables
us to add arbitrary provenance
information.
Principles
Use the CHEMINF ontology:
https://github.com/semanticchemistry
Validated ChemSpider synonyms,
Unvalidated ChemSpider synonyms,
Validated database identifiers, Unvalidated
database identifiers, InChI, InChIKey,
SMILES, preferred ChemSpider name
1. Synonyms and identifiers
Metadata describing the RDF:
• Can be used to build a directory of the
RDF available
• Find what’s there without having to
download all of it first
• Describes how Datasets are linked by the
Linksets using SKOS.
Recommendations here:
http://www.openphacts.org/specs/2013/WD-
datadesc-20130912/
2. Linksets:
Vocabulary of Interlinked
Datasets
We relate molecules to “parent” forms,
variously, those which are:
• uncharged
• not isotopically-specified
• not stereochemically-specified
• the preferred tautomer
• the largest fragment
• the “superparent” (all of the above)
3. Molecule–molecule relations in
CHEMINF
log P, log D (at pH 5.5 and 7.4),
bioconcentration factor, KOC (at pH 5.5 and
7.4), index of refraction, polar surface
area, molar refractivity, molar volume,
polarizability, surface tension, density at
STP, flash point a 1 atm, enthalpy of
vaporization at STP, vapour pressure at
STP.
4. Calculated physicochemical
properties
5. Issues from validation and
standardization
We use the CHEMINF ontology again.
We distinguish between information,
warnings and errors. Only serious failures
to process, such as a structure having an
invalid atom, count as errors.
This is the world we live in
Data quality issue and CVSP
– Robochemistry
– Proliferation of errors in public and
private databases
• ChemSpider
• PubChem
• DrugBank
• KEGG
• ChEBI/ChEMBL
– Automated quality control system
Chemistry Validation and Standardization Platform
Chemistry Validation and Standardization Platform
Thank you
Email: tkachenkov@rsc.org
Slides:
http://www.slideshare.net/valerytkachenko16

Mais conteúdo relacionado

Mais procurados

Small Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichSmall Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichEmma Schymanski
 
Building a semantic chemistry platform with the royal society of chemistry
Building a semantic chemistry platform with the royal society of chemistryBuilding a semantic chemistry platform with the royal society of chemistry
Building a semantic chemistry platform with the royal society of chemistryValery Tkachenko
 
The royal society of chemistry and its adoption of semantic web technologies ...
The royal society of chemistry and its adoption of semantic web technologies ...The royal society of chemistry and its adoption of semantic web technologies ...
The royal society of chemistry and its adoption of semantic web technologies ...Valery Tkachenko
 
SETAC Rome Non-Target Screening For Chemical Discovery
SETAC Rome Non-Target Screening For Chemical DiscoverySETAC Rome Non-Target Screening For Chemical Discovery
SETAC Rome Non-Target Screening For Chemical DiscoveryEmma Schymanski
 
Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...Valery Tkachenko
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...open_phacts
 
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity KnowledgeDMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity KnowledgeEmma Schymanski
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigData_Europe
 
ICIC 2017: New Product Introduction info apps
ICIC 2017: New Product Introduction info appsICIC 2017: New Product Introduction info apps
ICIC 2017: New Product Introduction info appsDr. Haxel Consult
 
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...Dr. Haxel Consult
 

Mais procurados (20)

Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
 
Small Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichSmall Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica Munich
 
Building a semantic chemistry platform with the royal society of chemistry
Building a semantic chemistry platform with the royal society of chemistryBuilding a semantic chemistry platform with the royal society of chemistry
Building a semantic chemistry platform with the royal society of chemistry
 
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpider
 
The royal society of chemistry and its adoption of semantic web technologies ...
The royal society of chemistry and its adoption of semantic web technologies ...The royal society of chemistry and its adoption of semantic web technologies ...
The royal society of chemistry and its adoption of semantic web technologies ...
 
SETAC Rome Non-Target Screening For Chemical Discovery
SETAC Rome Non-Target Screening For Chemical DiscoverySETAC Rome Non-Target Screening For Chemical Discovery
SETAC Rome Non-Target Screening For Chemical Discovery
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...Clustering the royal society of chemistry chemical repository to enable enhan...
Clustering the royal society of chemistry chemical repository to enable enhan...
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity KnowledgeDMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
 
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspnRSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
 
Connecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpiderConnecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpider
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & Health
 
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
 
ICIC 2017: New Product Introduction info apps
ICIC 2017: New Product Introduction info appsICIC 2017: New Product Introduction info apps
ICIC 2017: New Product Introduction info apps
 
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...
ICIC 2017: Looking at the gift horse: pros and cons of over 20 million patent...
 

Destaque

OpenPHACTS - Chemistry Platform Update and Learnings
OpenPHACTS - Chemistry Platform Update and LearningsOpenPHACTS - Chemistry Platform Update and Learnings
OpenPHACTS - Chemistry Platform Update and LearningsValery Tkachenko
 
Opportunities in chemical structure standardization
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardizationValery Tkachenko
 
Experiences and adventures with no sql and its applications to cheminformatic...
Experiences and adventures with no sql and its applications to cheminformatic...Experiences and adventures with no sql and its applications to cheminformatic...
Experiences and adventures with no sql and its applications to cheminformatic...Valery Tkachenko
 
Text mining to produce large chemistry datasets for community access
Text mining to produce large chemistry datasets for community accessText mining to produce large chemistry datasets for community access
Text mining to produce large chemistry datasets for community accessValery Tkachenko
 
Not just another reaction database
Not just another reaction databaseNot just another reaction database
Not just another reaction databaseValery Tkachenko
 
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3Marilyn Gardner Milton MA
 
Archives from my sales and marketing blog part two
Archives from my sales and marketing blog part twoArchives from my sales and marketing blog part two
Archives from my sales and marketing blog part twoDan Galante
 
別當正常的傻瓜:教你如何明智做決策
別當正常的傻瓜:教你如何明智做決策別當正常的傻瓜:教你如何明智做決策
別當正常的傻瓜:教你如何明智做決策文龍 陳
 
แนะนำกังหันลมโนอาห์
แนะนำกังหันลมโนอาห์แนะนำกังหันลมโนอาห์
แนะนำกังหันลมโนอาห์noahwindmill
 
Embracing Employee Generated Video for Knowledge Sharing
Embracing Employee Generated Video for Knowledge SharingEmbracing Employee Generated Video for Knowledge Sharing
Embracing Employee Generated Video for Knowledge SharingRodan van Orden
 
Understanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalUnderstanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalAndrew Gregoris
 
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...rtngmittal
 
Carta de derechos de las personas con autismo.
Carta de derechos de las personas con autismo.Carta de derechos de las personas con autismo.
Carta de derechos de las personas con autismo.José María
 
Brave New World of Responsive Design
Brave New World of Responsive DesignBrave New World of Responsive Design
Brave New World of Responsive DesignYuan Wang
 

Destaque (19)

OpenPHACTS - Chemistry Platform Update and Learnings
OpenPHACTS - Chemistry Platform Update and LearningsOpenPHACTS - Chemistry Platform Update and Learnings
OpenPHACTS - Chemistry Platform Update and Learnings
 
Opportunities in chemical structure standardization
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardization
 
Experiences and adventures with no sql and its applications to cheminformatic...
Experiences and adventures with no sql and its applications to cheminformatic...Experiences and adventures with no sql and its applications to cheminformatic...
Experiences and adventures with no sql and its applications to cheminformatic...
 
Text mining to produce large chemistry datasets for community access
Text mining to produce large chemistry datasets for community accessText mining to produce large chemistry datasets for community access
Text mining to produce large chemistry datasets for community access
 
Not just another reaction database
Not just another reaction databaseNot just another reaction database
Not just another reaction database
 
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
 
Aventura gráfica_Esquema básico
Aventura gráfica_Esquema básico Aventura gráfica_Esquema básico
Aventura gráfica_Esquema básico
 
Archives from my sales and marketing blog part two
Archives from my sales and marketing blog part twoArchives from my sales and marketing blog part two
Archives from my sales and marketing blog part two
 
別當正常的傻瓜:教你如何明智做決策
別當正常的傻瓜:教你如何明智做決策別當正常的傻瓜:教你如何明智做決策
別當正常的傻瓜:教你如何明智做決策
 
Present simple
Present simplePresent simple
Present simple
 
แนะนำกังหันลมโนอาห์
แนะนำกังหันลมโนอาห์แนะนำกังหันลมโนอาห์
แนะนำกังหันลมโนอาห์
 
Steganography
SteganographySteganography
Steganography
 
Embracing Employee Generated Video for Knowledge Sharing
Embracing Employee Generated Video for Knowledge SharingEmbracing Employee Generated Video for Knowledge Sharing
Embracing Employee Generated Video for Knowledge Sharing
 
Reglamento
ReglamentoReglamento
Reglamento
 
Understanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalUnderstanding The Big Data Opportunity Final
Understanding The Big Data Opportunity Final
 
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...
Innovative Solution to Authorickshaws not agreeing to ferrying the passengers...
 
Carta de derechos de las personas con autismo.
Carta de derechos de las personas con autismo.Carta de derechos de las personas con autismo.
Carta de derechos de las personas con autismo.
 
Circus
CircusCircus
Circus
 
Brave New World of Responsive Design
Brave New World of Responsive DesignBrave New World of Responsive Design
Brave New World of Responsive Design
 

Semelhante a Implementing chemistry platform for OpenPHACTS

Open PHACTS (Sept 2013) EBI Industry Programme
Open PHACTS (Sept 2013) EBI Industry ProgrammeOpen PHACTS (Sept 2013) EBI Industry Programme
Open PHACTS (Sept 2013) EBI Industry ProgrammeSciBite Limited
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
Open chemistry registry and mapping platform based on open source cheminforma...
Open chemistry registry and mapping platform based on open source cheminforma...Open chemistry registry and mapping platform based on open source cheminforma...
Open chemistry registry and mapping platform based on open source cheminforma...Valery Tkachenko
 
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance
 
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BigData_Europe
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts
 
Mashing Up Drug Discovery
Mashing Up Drug DiscoveryMashing Up Drug Discovery
Mashing Up Drug DiscoverySciBite Limited
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTSStian Soiland-Reyes
 
A Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway DataA Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway Dataguest9fc5f3
 
Time for an open standards platform?
Time for an open standards platform?Time for an open standards platform?
Time for an open standards platform?openEHR Foundation
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectKen Karapetyan
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...Michel Dumontier
 
Royal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discoveryRoyal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discoveryKen Karapetyan
 

Semelhante a Implementing chemistry platform for OpenPHACTS (20)

Open PHACTS (Sept 2013) EBI Industry Programme
Open PHACTS (Sept 2013) EBI Industry ProgrammeOpen PHACTS (Sept 2013) EBI Industry Programme
Open PHACTS (Sept 2013) EBI Industry Programme
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Open chemistry registry and mapping platform based on open source cheminforma...
Open chemistry registry and mapping platform based on open source cheminforma...Open chemistry registry and mapping platform based on open source cheminforma...
Open chemistry registry and mapping platform based on open source cheminforma...
 
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
 
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
Mashing Up Drug Discovery
Mashing Up Drug DiscoveryMashing Up Drug Discovery
Mashing Up Drug Discovery
 
Practical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS projectPractical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS project
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS
 
A Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway DataA Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway Data
 
Time for an open standards platform?
Time for an open standards platform?Time for an open standards platform?
Time for an open standards platform?
 
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
ChemSpider – An Online Database and Registration System Linking the Web
ChemSpider – An Online Database and  Registration System Linking the WebChemSpider – An Online Database and  Registration System Linking the Web
ChemSpider – An Online Database and Registration System Linking the Web
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
Royal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discoveryRoyal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discovery
 
Royal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discoveryRoyal society of chemistry developments to support open drug discovery
Royal society of chemistry developments to support open drug discovery
 

Mais de Valery Tkachenko

Evolution of public chemistry databases: past and the future
Evolution of public chemistry databases: past and the futureEvolution of public chemistry databases: past and the future
Evolution of public chemistry databases: past and the futureValery Tkachenko
 
In silico design of new functional materials
In silico design of new functional materialsIn silico design of new functional materials
In silico design of new functional materialsValery Tkachenko
 
Metal-organic frameworks: from database to supramolecular effects in complexa...
Metal-organic frameworks: from database to supramolecular effects in complexa...Metal-organic frameworks: from database to supramolecular effects in complexa...
Metal-organic frameworks: from database to supramolecular effects in complexa...Valery Tkachenko
 
Abstract recommendation system: beyond word-level representations
Abstract recommendation system: beyond word-level representationsAbstract recommendation system: beyond word-level representations
Abstract recommendation system: beyond word-level representationsValery Tkachenko
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsValery Tkachenko
 
Chemical workflows supporting automated research data collection
Chemical workflows supporting automated research data collectionChemical workflows supporting automated research data collection
Chemical workflows supporting automated research data collectionValery Tkachenko
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsDeep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsValery Tkachenko
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Valery Tkachenko
 
Need and benefits for structure standardization to facilitate integration and...
Need and benefits for structure standardization to facilitate integration and...Need and benefits for structure standardization to facilitate integration and...
Need and benefits for structure standardization to facilitate integration and...Valery Tkachenko
 
Development and comparison of deep learning toolkit with other machine learni...
Development and comparison of deep learning toolkit with other machine learni...Development and comparison of deep learning toolkit with other machine learni...
Development and comparison of deep learning toolkit with other machine learni...Valery Tkachenko
 
Living in a world of federated knowledge challenges, principles, tools and ...
Living in a world of federated knowledge   challenges, principles, tools and ...Living in a world of federated knowledge   challenges, principles, tools and ...
Living in a world of federated knowledge challenges, principles, tools and ...Valery Tkachenko
 
Using the structured product labeling format to index versatile chemical data
Using the structured product labeling format to index versatile chemical dataUsing the structured product labeling format to index versatile chemical data
Using the structured product labeling format to index versatile chemical dataValery Tkachenko
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesValery Tkachenko
 
Chemistry Validation and Standardization Platform v2.0
Chemistry Validation and Standardization Platform v2.0Chemistry Validation and Standardization Platform v2.0
Chemistry Validation and Standardization Platform v2.0Valery Tkachenko
 
Open Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials researchOpen Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials researchValery Tkachenko
 
Evolution of open chemical information
Evolution of open chemical informationEvolution of open chemical information
Evolution of open chemical informationValery Tkachenko
 
OMPOL – visualisation of large chemical spaces
OMPOL – visualisation of large chemical spacesOMPOL – visualisation of large chemical spaces
OMPOL – visualisation of large chemical spacesValery Tkachenko
 
Supporting the exploding dimensions of the chemical sciences via global netwo...
Supporting the exploding dimensions of the chemical sciences via global netwo...Supporting the exploding dimensions of the chemical sciences via global netwo...
Supporting the exploding dimensions of the chemical sciences via global netwo...Valery Tkachenko
 
Royal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesRoyal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesValery Tkachenko
 

Mais de Valery Tkachenko (20)

Evolution of public chemistry databases: past and the future
Evolution of public chemistry databases: past and the futureEvolution of public chemistry databases: past and the future
Evolution of public chemistry databases: past and the future
 
In silico design of new functional materials
In silico design of new functional materialsIn silico design of new functional materials
In silico design of new functional materials
 
Metal-organic frameworks: from database to supramolecular effects in complexa...
Metal-organic frameworks: from database to supramolecular effects in complexa...Metal-organic frameworks: from database to supramolecular effects in complexa...
Metal-organic frameworks: from database to supramolecular effects in complexa...
 
Abstract recommendation system: beyond word-level representations
Abstract recommendation system: beyond word-level representationsAbstract recommendation system: beyond word-level representations
Abstract recommendation system: beyond word-level representations
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpoints
 
Chemical workflows supporting automated research data collection
Chemical workflows supporting automated research data collectionChemical workflows supporting automated research data collection
Chemical workflows supporting automated research data collection
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsDeep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
 
Need and benefits for structure standardization to facilitate integration and...
Need and benefits for structure standardization to facilitate integration and...Need and benefits for structure standardization to facilitate integration and...
Need and benefits for structure standardization to facilitate integration and...
 
Development and comparison of deep learning toolkit with other machine learni...
Development and comparison of deep learning toolkit with other machine learni...Development and comparison of deep learning toolkit with other machine learni...
Development and comparison of deep learning toolkit with other machine learni...
 
Living in a world of federated knowledge challenges, principles, tools and ...
Living in a world of federated knowledge   challenges, principles, tools and ...Living in a world of federated knowledge   challenges, principles, tools and ...
Living in a world of federated knowledge challenges, principles, tools and ...
 
Using the structured product labeling format to index versatile chemical data
Using the structured product labeling format to index versatile chemical dataUsing the structured product labeling format to index versatile chemical data
Using the structured product labeling format to index versatile chemical data
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databases
 
Chemistry Validation and Standardization Platform v2.0
Chemistry Validation and Standardization Platform v2.0Chemistry Validation and Standardization Platform v2.0
Chemistry Validation and Standardization Platform v2.0
 
Open Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials researchOpen Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials research
 
Evolution of open chemical information
Evolution of open chemical informationEvolution of open chemical information
Evolution of open chemical information
 
OMPOL – visualisation of large chemical spaces
OMPOL – visualisation of large chemical spacesOMPOL – visualisation of large chemical spaces
OMPOL – visualisation of large chemical spaces
 
Supporting the exploding dimensions of the chemical sciences via global netwo...
Supporting the exploding dimensions of the chemical sciences via global netwo...Supporting the exploding dimensions of the chemical sciences via global netwo...
Supporting the exploding dimensions of the chemical sciences via global netwo...
 
Royal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and librariesRoyal Society of Chemistry open source cheminformatics platforms and libraries
Royal Society of Chemistry open source cheminformatics platforms and libraries
 

Último

Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxSimeonChristian
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 

Último (20)

Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 

Implementing chemistry platform for OpenPHACTS

  • 1. Implementing chemistry platform for OpenPHACTS: Lessons learned Colin Batchelor, Alexey Pshenichnov, Jon Steele, Valery Tkachenko Royal Society of Chemistry ACS Spring 2016 San Diego, CA March 17th 2016
  • 2. Open PHACTS Mission: Integrate Multiple Research Biomedical Data Resources Into A Single Open & Sustainable Access Point
  • 3. info@openphactsfoundation.org @Open_PHACTS Open PHACTS Practical Semantics Acknowledgements GlaxoSmithKline – Coordinator Universität Wien – Managing entity Technical University of Denmark University of Hamburg, Center for Bioinformatics BioSolveIT GmBH Consorci Mar Parc de Salut de Barcelona Leiden University Medical Centre Royal Society of Chemistry Vrije Universiteit Amsterdam Novartis Merck Serono H. Lundbeck A/S Eli Lilly Netherlands Bioinformatics Centre Swiss Institute of Bioinformatics ConnectedDiscovery EMBL-European Bioinformatics Institute Janssen Esteve Almirall OpenLink Scibite The Open PHACTS Foundation Spanish National Cancer Research Centre University of Manchester Maastricht University Aqnowledge University of Santiago de Compostela Rheinische Friedrich-Wilhelms-Universität Bonn AstraZeneca Pfizer
  • 4. Why is it so hard to…. Competitors? What’s the structure? Are they in our file? What’s similar? What’s the target?Pharmacology data? Known Pathways? Working On Now? Connections to disease? Expressed in right cell type? IP?
  • 5. Literature PubChem Genbank Patents Databases Downloads Data Analysis Data Integration Firewalled Databases How do R&D companies use public data?
  • 6. @gray_alasdair Big Data Integration 9
  • 7. Patent annotations in Open PHACTS • Huge amount of knowledge hidden in patent corpus • Most of which will never be published elsewhere • Substantial lag between patent and scientific literature • SureChEMBL system already extracts chemical entities from full-text patent documents • Text (title, abstract, description, claims), images, molfiles • Complemented with gene and disease entity annotations • Using the Termite text-mining tool by SciBite • Relevance scoring to reduce noise • Tested for recall • Patent, compound, gene, disease info available via API
  • 8. Open PHACTS Expanding EcoSystem Further Apps Data Warrior
  • 9. • VM install of Open PHACTS – Docker Image is now available • Updating to ver 2.0 Open PHACTS • Allows you to customise and load your own data into the environment Want to load your data into Open PHACTS? Want to run Open PHACTS within your environment?
  • 10.
  • 12. All Users by Sector Type
  • 14.
  • 15. Explorer Explorer2 ChemBioNavigator Target Dossier Pharmatrek Helium MOE Collector Cytophacts Utopia Garfield SciBite KNIME Mol. Data Sheets PipelinePilot scinav.it Taverna
  • 18. Nanopub Db VoID Data Cache (Virtuoso Triple Store) Semantic Workflow Engine Linked Data API (RDF/XML, TTL, JSON) Domain Specific Services Identity Resolution Service Chemistry Registration Normalisation & Q/C Identifier Management Service Indexing CorePlatform P12374 EC2.43.4 CS4532 “Adenosine receptor 2a” VoID Db Nanopub Db VoID Db VoID Nanopub VoID Public Content Commercial Public Ontologies User Annotations Apps
  • 19. We integrate, standardize and host the chemical compound collection underpinning Open PHACTS. We have developed a structure validation and normalization platform (CVSP) to ensure chemical structures are normalized to rules derived from the FDA structure normalization guidelines and modified based on input from members of EFPIA. http://cvsp.chemspider.com/ The Royal Society of Chemistry’s role in Open PHACTS
  • 20. Freely-available (requires logging in) chemical validation system for: • Structure validation: warning on query atoms, pseudoatoms, nonsensical or unclear stereo • Standardization workflows. CVSP and the Open Pharmacological Space Chemical Registration System (OPS CRS)
  • 21. Chemical data sources Data source Number of records in source DrugBank 6828 PDB ligands 18681 MeSH (extracted by text mining) 24381 ChEBI 40503 HMDB 41494 ChEMBL 20 1456020 SureChEMBL 1.0 14228299
  • 22. We generate RDF that: 1. Describes synonyms and identifiers 2. Provides linksets between our data sources and the OPS identifiers 3. Describes molecule–molecule relations of interest to the pharma industry 4. Delivers calculated physicochemical properties of compounds 5. Lists the validation and standardization issues found by CVSP. Royal Society of Chemistry data provided to Open PHACTS
  • 23. • Use standard ontologies where possible (CHEMINF for cheminformatics properties, QUDT for units, OBO ontologies elsewhere) • Use an event-based pattern for cheminformatics outputs. This enables us to add arbitrary provenance information. Principles
  • 24. Use the CHEMINF ontology: https://github.com/semanticchemistry Validated ChemSpider synonyms, Unvalidated ChemSpider synonyms, Validated database identifiers, Unvalidated database identifiers, InChI, InChIKey, SMILES, preferred ChemSpider name 1. Synonyms and identifiers
  • 25. Metadata describing the RDF: • Can be used to build a directory of the RDF available • Find what’s there without having to download all of it first • Describes how Datasets are linked by the Linksets using SKOS. Recommendations here: http://www.openphacts.org/specs/2013/WD- datadesc-20130912/ 2. Linksets: Vocabulary of Interlinked Datasets
  • 26. We relate molecules to “parent” forms, variously, those which are: • uncharged • not isotopically-specified • not stereochemically-specified • the preferred tautomer • the largest fragment • the “superparent” (all of the above) 3. Molecule–molecule relations in CHEMINF
  • 27. log P, log D (at pH 5.5 and 7.4), bioconcentration factor, KOC (at pH 5.5 and 7.4), index of refraction, polar surface area, molar refractivity, molar volume, polarizability, surface tension, density at STP, flash point a 1 atm, enthalpy of vaporization at STP, vapour pressure at STP. 4. Calculated physicochemical properties
  • 28. 5. Issues from validation and standardization We use the CHEMINF ontology again. We distinguish between information, warnings and errors. Only serious failures to process, such as a structure having an invalid atom, count as errors.
  • 29. This is the world we live in
  • 30. Data quality issue and CVSP – Robochemistry – Proliferation of errors in public and private databases • ChemSpider • PubChem • DrugBank • KEGG • ChEBI/ChEMBL – Automated quality control system
  • 31. Chemistry Validation and Standardization Platform
  • 32. Chemistry Validation and Standardization Platform

Notas do Editor

  1. Remember this, some of these questions are easier to answer than others
  2. Using available public data is critical to drug discovery
  3. 10 Can go get everything Open PHACTS not a repo of the world, specific sources
  4. 8
  5. Open PHACTS was developed to support the key questions of drug discovery Business questions have been at the heart of Open PHACTS and have driven the development of the platform Mx/psa, how calculated who did it? Mash up. With your data too, - top layer join together but need them all commercial Data provided by many publishers Originally in many formats: relational, SD files and RDF Worked closely with publishers Data licensing was a major issue Over 5 billion triples – 14 datasets & growing Hosted on beefy hardware; data in memory (aim) Extensive memcaching Pose complex queries to extract data
  6. 4 million full-text patent documents annotated • USPTO, WIPO, EPO – English language • Life-sciences relevant • Patents mapped to SureChEMBL IDs (e.g. EP-1339685-A2) • Title, publication date, classification codes • Compounds mapped to SCHEMBL IDs (e.g. SCHEMBL15064) • Genes mapped to HGNC symbols (e.g. FDFT1) • Diseases mapped to MeSH terms (e.g. D009765)
  7. Db Stds :which ones (later) Access (API). Driven by the API. Acelerate bulding if apps
  8. Open PHACTS discover platform is now supported by a Foundation
  9. Seen a growing usage of the platform both in volume and in registered applications API remains the cornerstone of the delivery
  10. Connected to different consumer groups
  11. Once users get connected to an API, they tend to stick with it. We are listening to this and will have a version independent URL
  12. Here we see the RSC dataset in Open PHACT’s data repository, which is running the open source Artifactory. The repository understands Maven metadata, and also maintain and verifies checksums of data artifacts. We see here it includes the suggested <dependency> setting for using the dataset from a different Maven project – while this would be a bit exotic perhaps, doing so would put the dataset directly on the classloader without any worrying about downloads or file paths. We can see the hierarchy of the dataset on the left – the repository has expanded the archive for us. The .ro folder contains the Research Object manifest, the void file is the Dataset description – the rest is the “actual data”. One power of Maven is the ease of setting up mirroring – the dataset above is actually from a mirror of the Maven repository of the build server in Manchester.
  13. All have probably seen this slide. Want to pick out some of the key changes and tomorrow will here more