SlideShare uma empresa Scribd logo
1 de 51
Baixar para ler offline
Data for AI
Models, The Past,
The Present, The
Future
John P. Overington
jpo@md.catapult.org.uk
© 2019 Medicines Discovery Catapult. All rights reserved.
“Public data is the
worst form of
training data for AI
except for all those
other forms that
have been tried
from time to time”
Winston Churchill, 2016
© 2019 Medicines Discovery Catapult. All rights reserved.
National facility connecting the UK
community to accelerate innovative
drug discovery
• Independent not-for-profit organisation
• Part of the U.K.’s Catapult network
• Helping to deliver the U.K.’s Industrial Strategy
• Funded by Innovate U.K., part of UK Research
and Innovation, reporting to the Department
for Business, Energy & Industrial Strategy
• Focus on SME and translational academic
sector support
MDC - Medicines Discovery Catapult
© 2019 Medicines Discovery Catapult. All rights reserved.
ChEMBL, SureChEMBL & UniChem
© 2019 Medicines Discovery Catapult. All rights reserved.
• Originally developed 2003 at
Inpharmatica
• Spun out to public domain
• The world’s largest primary public
database of medicinal chemistry data
• ~2.3 million compounds
• ~11,000 targets
• ~15 million bioactivities
• Truly Open Data - CC-BY-SA license
• API, MyChEMBL VM, RDF, full tables
download….
• Basis of vast majority of AI innovation
in compound design/optimisation
Gaulton et al (2012) Nucleic Acids Research Database Issue. 40 D1100-1107
ChEMBL – www.ebi.ac.uk/chembl
© 2019 Medicines Discovery Catapult. All rights reserved.
Compound
Assay
Ki=4.5 nM
>Thrombin
MAHVRGLQLPGCLALAALCSLVHSQHVFLAPQQARSLLQRVRRANTFLEEVRKGNLERECVEETCSY
EEAFEALESSTATDVFWAKYTACETARTPRDKLAACLEGNCAEGLGTNYRGHVNITRSGIECQLWRS
RYPHKPEINSTTHPGADLQENFCRNPDSSTTGPWCYTTDPTVRRQECSIPVCGQDQVTVAMTPRSEG
SSVNLSPPLEQCVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGD
EEGVWCYVAGKPGDFGYCDLNYCEEAVEEETGDGLDEDSDRAIEGRTATSEYQTFFNPRTFGSGEAD
CGLRPLFEKKSLEDKTERELLESYIDGRIVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVL
TAAHCLLYPPWDKNFTENDLLVRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLK
KPVAFSDYIHPVCLPDRETAASLLQAGYKGRVTGWGNLKETWTANVGKGQPSVLQVVNLPIVERPVC
KDSTRIRITDNMFCAGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFY
THVFRLKKWIQKVIDQFGE
ED2=230 nM
Inhibition of
human Thrombin
PTT (partial
thromboplastin
time)
ChEMBL
© 2019 Medicines Discovery Catapult. All rights reserved.
• Public chemistry patent resource
• Donated by Digital Science –
SureChem commercial product
• Automatically extracted chemical
structures from full-text patents
• >18 million chemical structures
• Updated daily
• Full chemistry data download
SureChEMBL– www.surechembl.org
Papadatos et al (2016) Nucl. Acids Res Database Issue D1220-1228
© 2019 Medicines Discovery Catapult. All rights reserved.
UniChem – www.ebi.ac.uk/unichem
• Simple chemical
integration service
• >144 million structures
from ~30 sources
• URI/resource ID/Standard
InChI based lookups
• Available chemicals,
PubChem, ZINC, real
time, private
• Chemical structure ‘Time
Machine’
Chambers et al (2013) J. Cheminf. DOI:10.1186/1758-2946-5-3
© 2019 Medicines Discovery Catapult. All rights reserved.
Personal Perspectives on ChEMBL
• Things that worked well
• Single, major visionary funder – Wellcome Trust
• Focus on data content/backend not GUI
• Clear License – CC-BY-SA - same license as Wikipedia content
• Private/secure services
• Opportunism – SureChEMBL
• Open Data in ChEMBL re-invigorated cheminformatics research
• Things that didn’t work so well
• Community curation attempts – armchair critics
• Publisher interactions – except Royal Society of Chemistry
• I would do things very differently now
© 2019 Medicines Discovery Catapult. All rights reserved.
The Reproducibility Reproducibility Crisis!
Begley & Lee (2012) Nature DOI:10.1038/483531 & Prinz et al (2011) NRDD DOI:10.1038/nrd3439-c1
© 2019 Medicines Discovery Catapult. All rights reserved.
Enhanced data
model for ChEMBL
can appear as
‘errors’: e.g.
complexes,
receptor sets,
model organisms
“The more complex
the parameter, the
more frequent the
errors”
Errors in ChEMBL
Tiikkainen et al (2013) JCIM DOI:10.1021/ci400099q
© 2019 Medicines Discovery Catapult. All rights reserved.
Errors in SureChEMBL
Senger et al (2015) J Cheminf DOI:10.1186/s13321-015-0097-z
© 2019 Medicines Discovery Catapult. All rights reserved.
0.2
0.4
0.6
−4 −2 0 2 4
diff
density
Inter-species Assay Variability
Distribution of potency
differences
Scatter plot of
measured potencies
n = 2.781
Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333
Same compound, same end-point for rat and human orthologs
pKi human
pKirat
diff(human, rat)
norm.dens.
2
4
6
8
10
12
2 4 6 8 10 12
orthoFrame$afnty1
orthoFrame$afnty2
© 2019 Medicines Discovery Catapult. All rights reserved.
2
4
6
8
10
12
2 4 6 8 10 12
sampleFrame$afnty1
sampleFrame$afnty2
0.2
0.4
0.6
−4 −2 0 2 4
diffdensity
pKi Assay1
pKiAssay2
diff(assay1, assay2)
n = 3.000
norm.dens.
Scatter plot of measured
potencies
Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333
Same compound, same species, different publication
Distribution of potency
differences
Inter-lab Assay Variability
© 2019 Medicines Discovery Catapult. All rights reserved.
density
Inter-species vs Inter-lab Variability
Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333
pKii - pKij
density Inter-laboratory
Inter-species
© 2019 Medicines Discovery Catapult. All rights reserved.
Garnett et al (2012) Nature DOI:10.1371/journal.pcbi.1002333 & Barretina et al (2012) Nature DOI:10.1038/nature11003
Large-Scale Cell-line Screening Data
© 2019 Medicines Discovery Catapult. All rights reserved.
Inconsistent Cell-line Screening Data
Haibe-Kains et al (2013) Nature DOI:10.1038/nature12831 (see also Stransky et al (2015) Nature DOI:10.1038/nature15736)
© 2019 Medicines Discovery Catapult. All rights reserved.
Primary Data – Batches and Replicates
http://www.wexlerwallace.com/wp-content/uploads/2012/04/Southeast-Laborers-Health-v-Pfizer.pdf
© 2019 Medicines Discovery Catapult. All rights reserved.
Incorrect Chemical Structures
Bosutinib Voxtalisib
http://cen.acs.org/articles/90/web/2012/05/Bosutinib-Buyer-Beware.html, & Overington & Wennerberg unpublished
© 2019 Medicines Discovery Catapult. All rights reserved.
Biochemical
assay
Cell-
based
screen
Functional
assay
Animal
disease
model
Human
clinical
trial
Variance – From Simple to Complex
Inter study variance
Number of assay variables
Steady state Time dependent
© 2019 Medicines Discovery Catapult. All rights reserved.
The Present
© 2019 Medicines Discovery Catapult. All rights reserved.
MDC Collaborating With The Sector
© 2019 Medicines Discovery Catapult. All rights reserved.
DeepADMET
• DeepADMET – InnovateUK grant
• Optibrium Ltd.
• Intellegens Ltd.
• Medicines Discovery Catapult
• MDC engineering software pipeline to
supply ‘SAR data on demand’
• Flexible wrt document source
• Fast and responsive
• Significantly boost public/internal data
• Deliver provenanced activity ‘vectors’
• Develop broader range of robust
ADMET models using deep learning
Document
gathering
NLP /
NER
Data
Extraction
&
Heuristics
SAR
vectors
© 2019 Medicines Discovery Catapult. All rights reserved.
Secondary
(compiled from literature review, databases)
Primary (preferred)
(measured in the same assay)
Assay conditions Assay conditions
Compound
Compound
*
DeepADMET – Data Structure
© 2019 Medicines Discovery Catapult. All rights reserved.
The Future
© 2019 Medicines Discovery Catapult. All rights reserved.
https://stevenmiller888.github.io/mind-how-to-build-a-neural-network/
Neural Networks
© 2019 Medicines Discovery Catapult. All rights reserved.
Assays in Drug Discovery
Biochemical
assays
Cell-based
assays
Functional
assays
In vivo
assays
Human
studies
Proteins Cell lines Tissues &
organs
Animal models Humans
ancient
“Human clinical trial”
• Error prone, serendipitous discoveries
• Traditional medicines: aspirin, quinine, …
© 2019 Medicines Discovery Catapult. All rights reserved.
Assays in Drug Discovery
Biochemical
assays
Cell-based
assays
Functional
assays
In vivo
assays
Human
studies
Proteins Cell lines Tissues &
organs
Animal models Humans
1910s ancient
Animal in vivo assays
• Faster, safer, cheaper
• … but less predictive
© 2019 Medicines Discovery Catapult. All rights reserved.
Assays in Drug Discovery
Biochemical
assays
Cell-based
assays
Functional
assays
In vivo
assays
Human
studies
Proteins Cell lines Tissues &
organs
Animal models Humans
1920s 1910s ancient
Ex vivo assays
• Higher throughput, cheaper
• Mechanistic insights
• … but less predictive
© 2019 Medicines Discovery Catapult. All rights reserved.
Assays in Drug Discovery
Biochemical
assays
Cell-based
assays
Functional
assays
In vivo
assays
Human
studies
Proteins Cell lines Tissues &
organs
Animal models Humans
1950s 1920s 1910s ancient
Cell-based assays
• Higher throughput, cheaper
• Mechanistic insights
• … but less predictive
© 2019 Medicines Discovery Catapult. All rights reserved.
Assays in Drug Discovery
Biochemical
assays
Cell-based
assays
Functional
assays
In vivo
assays
Human
studies
Proteins Cell lines Tissues &
organs
Animal models Humans
1970s 1950s 1920s 1910s ancient
Biochemical assays
• Higher throughput
• Mechanistic insights
• Recombinant DNA technology
• … but less predictive
© 2019 Medicines Discovery Catapult. All rights reserved.
Example Assay Path: Anti-inflammatory Drugs
Prostaglandin
G/H synthase 2
LPS-stimulated
THP-1 cells
LPS-stimulated
human whole blood
carrageenan-
injected rat
acute gout
patient
© 2019 Medicines Discovery Catapult. All rights reserved.
© 2019 Medicines Discovery Catapult. All rights reserved.
• Finding Assays
• Text-mining across papers, patents, vendor catalogues
• Indexing of Assays
• specialist dictionaries - techniques, equipment, genes, end-points, ….
• Classification of assays
• Efficacy/ADMET & biochemical, cell-based, organoid, tissue, ….
• Similarity of Assays
• how ‘similar’ are two assays?
• Chaining of Assays
• constructing the directed graph
• Learning thresholds
• Identification of ‘triggers’ from chained, directed assay pairs
AssayNet – Building the Network
© 2019 Medicines Discovery Catapult. All rights reserved.
© 2019 Medicines Discovery Catapult. All rights reserved.
© 2019 Medicines Discovery Catapult. All rights reserved.
© 2019 Medicines Discovery Catapult. All rights reserved.
© 2019 Medicines Discovery Catapult. All rights reserved.
Assay 1 Assay 2
• Decision Thresholds
• What activity threshold in Assay 1 makes it worth measuring in Assay 2?
• Learn from statistical distributions
• Probably artefactually thresholded at integral pIC50 thresholds – e.g. 1mM (cf P-value distributions)
Learning Decision Thresholds
pIC50
pIC50
#
#
Compounds
selected for
screening in
assay 2
Distribution of activity values of
compounds in Assay 1
Sharp cutoff
Sampled cutoff
© 2019 Medicines Discovery Catapult. All rights reserved.
Bayesian Networks
© 2019 Medicines Discovery Catapult. All rights reserved.
Bioassay data - ChEMBL Database
IC50 4.5 nM
>Thrombin
MAHVRGLQLPGCLALAALCSLVHSQHVFLA
PQQARSLLQRVRRANTFLEEVRKGNLEREC
VEETCSYEEAFEALESSTATDVFWAKYTAC
ETARTPRDKLAACLEGNCAEGLGTNYRGHV
APTT
11 min
Target
Compoun
d
Bioassay data
Compound
Assay
• Data manually extracted by a team of
curators from published pharmacology
and drug discovery literature (e.g.
Journal of Medicinal Chemistry)
• ChEMBL has transformed many aspects
of cheminformatics research
− Target prediction
− Large-scale QSAR
− Matched Molecular Pairs
− …
• ChEMBL is foundation data source of
almost all published AI compound
design research
© 2019 Medicines Discovery Catapult. All rights reserved.
1
a
b
d
2
3c
5e
4 g
f
h
6
ChEMBL as a Graph
assay-assay network
compound-compound network
b
f
c
h
ge
a d
1
a
1 a
compound assay
has activity in
Zwierzyna & Overington (in preparation)
1
2
4
6
5
3
© 2019 Medicines Discovery Catapult. All rights reserved.
Assay Network: Binding Assay Data (Subset)
A subset of the assay network (~6,000 nodes)
constructed using protein-binding assay data
from ChEMBL
Zwierzyna & Overington (in preparation)
© 2019 Medicines Discovery Catapult. All rights reserved.
Assay Network: Preclinical Assay Data
PPAR binding assay
DPP-4 binding assay
in vivo assay
cell-based assay
Zwierzyna & Overington (in preparation)
• Fragment of the assay network with a
subset of bioassays testing antidiabetic
compounds
• Assays involving closely related
biological targets are clustered
together, e.g. assays involving various
peroxisome proliferator-activated
receptors in the green cluster
• Antidiabetic compounds with different
mechanism of action (e.g. DPP-4
inhibitors and PPAR agonists) are often
tested in the same animal model (such
as Zucker diabetic rat) → in vivo
assays link distinct clusters
© 2019 Medicines Discovery Catapult. All rights reserved.
Animal Models: Assay Descriptions
CHEMBL893931:
“Inhibition of carrageenan-induced paw oedema
in Sprague-Dawley rat at 5.16 mg/kg, sc after 3 hrs.”
© 2019 Medicines Discovery Catapult. All rights reserved.
Animal Models: Assay Descriptions
Induced
Model Phenotype
Genetic
Strain
Dosage Administratio
n Route
Timing
CHEMBL893931:
“Inhibition of carrageenan-induced paw oedema
in Sprague-Dawley rat at 5.16 mg/kg, sc after 3 hrs.”
© 2019 Medicines Discovery Catapult. All rights reserved.
Information Extraction From Assay Descriptions
Antiallodynicactivity in Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed as attenuation of mechanicalallodynia
JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN
NP PP NP VP PP NP PP NP
S
CHEMBL1799193:
Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia.
Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia
Experiment Phenotype PhenotypeStrain
Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia
A
B
C
D
Antiallodynicactivity in Wistar albino rat chronicconstriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia
JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN
NP PP NP VP PP NP PP NP
S
CHEMBL1799193:
Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia.
Antiallodynicactivity Wistar albino rat chronicconstriction injury-induced neuropathic pain model assessed attenuation mechanical allodynia
Experiment Phenotype PhenotypeStrain
Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanical allodynia
A
B
C
D
Sentence
Noun Phrase
Verb Phrase
AdjectiveNoun Verb
Prepositional Phrase
Antiallodynicactivity in Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed as attenuation of mechanical allodynia
JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN
NP PP NP VP PP NP PP NP
S
CHEMBL1799193:
Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia.
Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia
[9.11,8.73,9.19,...] [-0.17,-0.57,0.01,...] [8.95,3.39,-5.22,...] [9.08,8.02,8.09,...][9.11,8.73,9.19,...][9.56,9.14,2.10,...][9.10,8.72,9.18,...]
Experiment Phenotype PhenotypeStrain
Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanical allodynia
A
B
C
D
E
Zwierzyna & Overington (in preparation)
© 2019 Medicines Discovery Catapult. All rights reserved.
PCA of Word2Vec Assay Descriptions
Each assay description: average over its word vectors. Data points projected from a 200-dimensional
space to 2D using PCA
Zwierzyna & Overington, unpublished
© 2019 Medicines Discovery Catapult. All rights reserved.
Word2vec Embedding of Assays
L01 (antineoplastic)M01 (anti-inflammatory)
ChEMBL assays of known drugs annotated with different ATC codes (~15k of ~94k)
N03 (antiepileptic)
A10 (antidiabetic)C02 (antihypertensive) N02 (analgesic)
Zwierzyna&Overington,unpublished
© 2019 Medicines Discovery Catapult. All rights reserved.
Biochemical
assay
Cell-based
screen
Functional
assay
Animal
disease
model
Human
clinical trial
Build assay networks
from literature/patent
co-occurrence
Link to animal models
and genetics
Understand target
engagement/
pharmacodynamics
through development
Directed graph of all
assays from targets to
clinical trials
AssayNet – Translational Path From Lab To Clinic
Compound
© 2019 Medicines Discovery Catapult. All rights reserved.
Acknowledgements
Bissan Al-Lazikani Aroon Hingorani,
Juan Pablo-Casas
Marc Marti-Renom
Francesco Martinez
Magda Zwierzyna
Mark Davies
Krister Wennerberg
Mark Warren, Gemma Holliday, Andrew Pannifer
Richard Seacome, James Welsh, Matthew Hodsgkiss
Charles Bury, Kepa Brurusco-Goni, Daiel James, Adam Poulston,
Matt Cockayne, Baydr Earls, Herve Barjat, Dave Allen, James Peach
Nathan Dedman, George Papadatos,
Grace Mugumbate, Anna Gaulton,
Prudence Mutowo, Louisa Bellis,
Anne Hersey, Jon Chambers,
Michal Nowotka, Anneli Karlsson,
Ines Smit, Francis Atkinson,
Paula Magarinos, Felix Kruger, Rita Santos

Mais conteúdo relacionado

Mais procurados

Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Akash Arora
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsChristoph Steinbeck
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)Elia Brodsky
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics CompleteInterpretOmics
 
Computational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityComputational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityThet Su Win
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicinePranavathiyani G
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Industry Program In For Sci
Industry Program In For SciIndustry Program In For Sci
Industry Program In For Scibiinoida
 
Exploring Chemical and Biological Knowledge Spaces with PubChem
Exploring Chemical and Biological Knowledge Spaces with PubChemExploring Chemical and Biological Knowledge Spaces with PubChem
Exploring Chemical and Biological Knowledge Spaces with PubChemPaul Thiessen
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Pistoia Alliance
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)Pistoia Alliance
 

Mais procurados (20)

Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research
 
Ai and biology
Ai and biologyAi and biology
Ai and biology
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics Complete
 
Computational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityComputational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activity
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
 
Pine Biotech
Pine BiotechPine Biotech
Pine Biotech
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Industry Program In For Sci
Industry Program In For SciIndustry Program In For Sci
Industry Program In For Sci
 
Exploring Chemical and Biological Knowledge Spaces with PubChem
Exploring Chemical and Biological Knowledge Spaces with PubChemExploring Chemical and Biological Knowledge Spaces with PubChem
Exploring Chemical and Biological Knowledge Spaces with PubChem
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
David Tyrpak CV
David Tyrpak CVDavid Tyrpak CV
David Tyrpak CV
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 

Semelhante a Data for AI models, the past, the present, the future

Impact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteImpact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteVictoria Lebedeva- Baxter ACIM
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Rick Silva
 
Biosample exchanges – the past, the current and the future – how do we make i...
Biosample exchanges – the past, the current and the future – how do we make i...Biosample exchanges – the past, the current and the future – how do we make i...
Biosample exchanges – the past, the current and the future – how do we make i...Pistoia Alliance
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Rick Silva
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...University of California, San Francisco
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryJeremy Yang
 
Roc ipposi dec2015 challenges and opportunities for clinical research
Roc ipposi dec2015 challenges and opportunities for clinical researchRoc ipposi dec2015 challenges and opportunities for clinical research
Roc ipposi dec2015 challenges and opportunities for clinical researchipposi
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
Slides for rare disorders meeting
Slides for rare disorders meetingSlides for rare disorders meeting
Slides for rare disorders meetingSean Ekins
 
April 25 webinar Bill Faloon presentation slides
April 25 webinar Bill Faloon presentation slides April 25 webinar Bill Faloon presentation slides
April 25 webinar Bill Faloon presentation slides maximuspeto
 
Cell Phones And Brain Cancer
Cell Phones And Brain CancerCell Phones And Brain Cancer
Cell Phones And Brain CancerDocJess
 
Transition transplant path to tissue engineer path new banff class 2017
Transition transplant path to tissue engineer path new banff class 2017 Transition transplant path to tissue engineer path new banff class 2017
Transition transplant path to tissue engineer path new banff class 2017 Kim Solez ,
 
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Philip Bourne
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literaturepetermurrayrust
 
Break through in biochemistry biotechnology[616]
Break through in biochemistry biotechnology[616]Break through in biochemistry biotechnology[616]
Break through in biochemistry biotechnology[616]Dr.K Madhuri
 

Semelhante a Data for AI models, the past, the present, the future (20)

Impact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteImpact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger Institute
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...
 
Biosample exchanges – the past, the current and the future – how do we make i...
Biosample exchanges – the past, the current and the future – how do we make i...Biosample exchanges – the past, the current and the future – how do we make i...
Biosample exchanges – the past, the current and the future – how do we make i...
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Plosslides
PlosslidesPlosslides
Plosslides
 
PLOS slides
PLOS slidesPLOS slides
PLOS slides
 
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
 
Roc ipposi dec2015 challenges and opportunities for clinical research
Roc ipposi dec2015 challenges and opportunities for clinical researchRoc ipposi dec2015 challenges and opportunities for clinical research
Roc ipposi dec2015 challenges and opportunities for clinical research
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
Slides for rare disorders meeting
Slides for rare disorders meetingSlides for rare disorders meeting
Slides for rare disorders meeting
 
April 25 webinar Bill Faloon presentation slides
April 25 webinar Bill Faloon presentation slides April 25 webinar Bill Faloon presentation slides
April 25 webinar Bill Faloon presentation slides
 
HRB-Health Research In Action booklet (feat. NICB)
HRB-Health Research In Action booklet (feat. NICB)HRB-Health Research In Action booklet (feat. NICB)
HRB-Health Research In Action booklet (feat. NICB)
 
Biomarker-Vol-8
Biomarker-Vol-8Biomarker-Vol-8
Biomarker-Vol-8
 
Cell Phones And Brain Cancer
Cell Phones And Brain CancerCell Phones And Brain Cancer
Cell Phones And Brain Cancer
 
Transition transplant path to tissue engineer path new banff class 2017
Transition transplant path to tissue engineer path new banff class 2017 Transition transplant path to tissue engineer path new banff class 2017
Transition transplant path to tissue engineer path new banff class 2017
 
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literature
 
Break through in biochemistry biotechnology[616]
Break through in biochemistry biotechnology[616]Break through in biochemistry biotechnology[616]
Break through in biochemistry biotechnology[616]
 

Mais de Pistoia Alliance

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiPistoia Alliance
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinarPistoia Alliance
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRPistoia Alliance
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Pistoia Alliance
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcarePistoia Alliance
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEPistoia Alliance
 
Blockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesBlockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesPistoia Alliance
 
Knowledge Graphs for Pharma PA Slideshow
Knowledge Graphs for Pharma PA SlideshowKnowledge Graphs for Pharma PA Slideshow
Knowledge Graphs for Pharma PA SlideshowPistoia Alliance
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Pistoia Alliance
 
Pistoia alliance harmonizing fair data catalog approaches webinar
Pistoia alliance harmonizing fair data catalog approaches webinarPistoia alliance harmonizing fair data catalog approaches webinar
Pistoia alliance harmonizing fair data catalog approaches webinarPistoia Alliance
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Pistoia Alliance
 
Pistoia Alliance datathon for drug repurposing for rare diseases
Pistoia Alliance datathon for drug repurposing for rare diseasesPistoia Alliance datathon for drug repurposing for rare diseases
Pistoia Alliance datathon for drug repurposing for rare diseasesPistoia Alliance
 
blockchain-introduction-pistoia-alliance
blockchain-introduction-pistoia-allianceblockchain-introduction-pistoia-alliance
blockchain-introduction-pistoia-alliancePistoia Alliance
 

Mais de Pistoia Alliance (20)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Fair by design
Fair by designFair by design
Fair by design
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIR
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcare
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoE
 
Blockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesBlockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab Slides
 
Knowledge Graphs for Pharma PA Slideshow
Knowledge Graphs for Pharma PA SlideshowKnowledge Graphs for Pharma PA Slideshow
Knowledge Graphs for Pharma PA Slideshow
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
 
Pistoia alliance harmonizing fair data catalog approaches webinar
Pistoia alliance harmonizing fair data catalog approaches webinarPistoia alliance harmonizing fair data catalog approaches webinar
Pistoia alliance harmonizing fair data catalog approaches webinar
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
 
Pistoia Alliance datathon for drug repurposing for rare diseases
Pistoia Alliance datathon for drug repurposing for rare diseasesPistoia Alliance datathon for drug repurposing for rare diseases
Pistoia Alliance datathon for drug repurposing for rare diseases
 
blockchain-introduction-pistoia-alliance
blockchain-introduction-pistoia-allianceblockchain-introduction-pistoia-alliance
blockchain-introduction-pistoia-alliance
 

Último

Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...narwatsonia7
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiSuhani Kapoor
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...narwatsonia7
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Deliverynehamumbai
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsGfnyt
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableDipal Arora
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...Neha Kaur
 

Último (20)

Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD available
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
 

Data for AI models, the past, the present, the future

  • 1. Data for AI Models, The Past, The Present, The Future John P. Overington jpo@md.catapult.org.uk
  • 2. © 2019 Medicines Discovery Catapult. All rights reserved. “Public data is the worst form of training data for AI except for all those other forms that have been tried from time to time” Winston Churchill, 2016
  • 3. © 2019 Medicines Discovery Catapult. All rights reserved. National facility connecting the UK community to accelerate innovative drug discovery • Independent not-for-profit organisation • Part of the U.K.’s Catapult network • Helping to deliver the U.K.’s Industrial Strategy • Funded by Innovate U.K., part of UK Research and Innovation, reporting to the Department for Business, Energy & Industrial Strategy • Focus on SME and translational academic sector support MDC - Medicines Discovery Catapult
  • 4. © 2019 Medicines Discovery Catapult. All rights reserved. ChEMBL, SureChEMBL & UniChem
  • 5. © 2019 Medicines Discovery Catapult. All rights reserved. • Originally developed 2003 at Inpharmatica • Spun out to public domain • The world’s largest primary public database of medicinal chemistry data • ~2.3 million compounds • ~11,000 targets • ~15 million bioactivities • Truly Open Data - CC-BY-SA license • API, MyChEMBL VM, RDF, full tables download…. • Basis of vast majority of AI innovation in compound design/optimisation Gaulton et al (2012) Nucleic Acids Research Database Issue. 40 D1100-1107 ChEMBL – www.ebi.ac.uk/chembl
  • 6. © 2019 Medicines Discovery Catapult. All rights reserved. Compound Assay Ki=4.5 nM >Thrombin MAHVRGLQLPGCLALAALCSLVHSQHVFLAPQQARSLLQRVRRANTFLEEVRKGNLERECVEETCSY EEAFEALESSTATDVFWAKYTACETARTPRDKLAACLEGNCAEGLGTNYRGHVNITRSGIECQLWRS RYPHKPEINSTTHPGADLQENFCRNPDSSTTGPWCYTTDPTVRRQECSIPVCGQDQVTVAMTPRSEG SSVNLSPPLEQCVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGD EEGVWCYVAGKPGDFGYCDLNYCEEAVEEETGDGLDEDSDRAIEGRTATSEYQTFFNPRTFGSGEAD CGLRPLFEKKSLEDKTERELLESYIDGRIVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVL TAAHCLLYPPWDKNFTENDLLVRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLK KPVAFSDYIHPVCLPDRETAASLLQAGYKGRVTGWGNLKETWTANVGKGQPSVLQVVNLPIVERPVC KDSTRIRITDNMFCAGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFY THVFRLKKWIQKVIDQFGE ED2=230 nM Inhibition of human Thrombin PTT (partial thromboplastin time) ChEMBL
  • 7. © 2019 Medicines Discovery Catapult. All rights reserved. • Public chemistry patent resource • Donated by Digital Science – SureChem commercial product • Automatically extracted chemical structures from full-text patents • >18 million chemical structures • Updated daily • Full chemistry data download SureChEMBL– www.surechembl.org Papadatos et al (2016) Nucl. Acids Res Database Issue D1220-1228
  • 8. © 2019 Medicines Discovery Catapult. All rights reserved. UniChem – www.ebi.ac.uk/unichem • Simple chemical integration service • >144 million structures from ~30 sources • URI/resource ID/Standard InChI based lookups • Available chemicals, PubChem, ZINC, real time, private • Chemical structure ‘Time Machine’ Chambers et al (2013) J. Cheminf. DOI:10.1186/1758-2946-5-3
  • 9. © 2019 Medicines Discovery Catapult. All rights reserved. Personal Perspectives on ChEMBL • Things that worked well • Single, major visionary funder – Wellcome Trust • Focus on data content/backend not GUI • Clear License – CC-BY-SA - same license as Wikipedia content • Private/secure services • Opportunism – SureChEMBL • Open Data in ChEMBL re-invigorated cheminformatics research • Things that didn’t work so well • Community curation attempts – armchair critics • Publisher interactions – except Royal Society of Chemistry • I would do things very differently now
  • 10. © 2019 Medicines Discovery Catapult. All rights reserved. The Reproducibility Reproducibility Crisis! Begley & Lee (2012) Nature DOI:10.1038/483531 & Prinz et al (2011) NRDD DOI:10.1038/nrd3439-c1
  • 11. © 2019 Medicines Discovery Catapult. All rights reserved. Enhanced data model for ChEMBL can appear as ‘errors’: e.g. complexes, receptor sets, model organisms “The more complex the parameter, the more frequent the errors” Errors in ChEMBL Tiikkainen et al (2013) JCIM DOI:10.1021/ci400099q
  • 12. © 2019 Medicines Discovery Catapult. All rights reserved. Errors in SureChEMBL Senger et al (2015) J Cheminf DOI:10.1186/s13321-015-0097-z
  • 13. © 2019 Medicines Discovery Catapult. All rights reserved. 0.2 0.4 0.6 −4 −2 0 2 4 diff density Inter-species Assay Variability Distribution of potency differences Scatter plot of measured potencies n = 2.781 Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333 Same compound, same end-point for rat and human orthologs pKi human pKirat diff(human, rat) norm.dens. 2 4 6 8 10 12 2 4 6 8 10 12 orthoFrame$afnty1 orthoFrame$afnty2
  • 14. © 2019 Medicines Discovery Catapult. All rights reserved. 2 4 6 8 10 12 2 4 6 8 10 12 sampleFrame$afnty1 sampleFrame$afnty2 0.2 0.4 0.6 −4 −2 0 2 4 diffdensity pKi Assay1 pKiAssay2 diff(assay1, assay2) n = 3.000 norm.dens. Scatter plot of measured potencies Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333 Same compound, same species, different publication Distribution of potency differences Inter-lab Assay Variability
  • 15. © 2019 Medicines Discovery Catapult. All rights reserved. density Inter-species vs Inter-lab Variability Krüger & Overington (2012) PLoS Comp. Biol. DOI:10.1371/journal.pcbi.1002333 pKii - pKij density Inter-laboratory Inter-species
  • 16. © 2019 Medicines Discovery Catapult. All rights reserved. Garnett et al (2012) Nature DOI:10.1371/journal.pcbi.1002333 & Barretina et al (2012) Nature DOI:10.1038/nature11003 Large-Scale Cell-line Screening Data
  • 17. © 2019 Medicines Discovery Catapult. All rights reserved. Inconsistent Cell-line Screening Data Haibe-Kains et al (2013) Nature DOI:10.1038/nature12831 (see also Stransky et al (2015) Nature DOI:10.1038/nature15736)
  • 18. © 2019 Medicines Discovery Catapult. All rights reserved. Primary Data – Batches and Replicates http://www.wexlerwallace.com/wp-content/uploads/2012/04/Southeast-Laborers-Health-v-Pfizer.pdf
  • 19. © 2019 Medicines Discovery Catapult. All rights reserved. Incorrect Chemical Structures Bosutinib Voxtalisib http://cen.acs.org/articles/90/web/2012/05/Bosutinib-Buyer-Beware.html, & Overington & Wennerberg unpublished
  • 20. © 2019 Medicines Discovery Catapult. All rights reserved. Biochemical assay Cell- based screen Functional assay Animal disease model Human clinical trial Variance – From Simple to Complex Inter study variance Number of assay variables Steady state Time dependent
  • 21. © 2019 Medicines Discovery Catapult. All rights reserved. The Present
  • 22. © 2019 Medicines Discovery Catapult. All rights reserved. MDC Collaborating With The Sector
  • 23. © 2019 Medicines Discovery Catapult. All rights reserved. DeepADMET • DeepADMET – InnovateUK grant • Optibrium Ltd. • Intellegens Ltd. • Medicines Discovery Catapult • MDC engineering software pipeline to supply ‘SAR data on demand’ • Flexible wrt document source • Fast and responsive • Significantly boost public/internal data • Deliver provenanced activity ‘vectors’ • Develop broader range of robust ADMET models using deep learning Document gathering NLP / NER Data Extraction & Heuristics SAR vectors
  • 24. © 2019 Medicines Discovery Catapult. All rights reserved. Secondary (compiled from literature review, databases) Primary (preferred) (measured in the same assay) Assay conditions Assay conditions Compound Compound * DeepADMET – Data Structure
  • 25. © 2019 Medicines Discovery Catapult. All rights reserved. The Future
  • 26. © 2019 Medicines Discovery Catapult. All rights reserved. https://stevenmiller888.github.io/mind-how-to-build-a-neural-network/ Neural Networks
  • 27. © 2019 Medicines Discovery Catapult. All rights reserved. Assays in Drug Discovery Biochemical assays Cell-based assays Functional assays In vivo assays Human studies Proteins Cell lines Tissues & organs Animal models Humans ancient “Human clinical trial” • Error prone, serendipitous discoveries • Traditional medicines: aspirin, quinine, …
  • 28. © 2019 Medicines Discovery Catapult. All rights reserved. Assays in Drug Discovery Biochemical assays Cell-based assays Functional assays In vivo assays Human studies Proteins Cell lines Tissues & organs Animal models Humans 1910s ancient Animal in vivo assays • Faster, safer, cheaper • … but less predictive
  • 29. © 2019 Medicines Discovery Catapult. All rights reserved. Assays in Drug Discovery Biochemical assays Cell-based assays Functional assays In vivo assays Human studies Proteins Cell lines Tissues & organs Animal models Humans 1920s 1910s ancient Ex vivo assays • Higher throughput, cheaper • Mechanistic insights • … but less predictive
  • 30. © 2019 Medicines Discovery Catapult. All rights reserved. Assays in Drug Discovery Biochemical assays Cell-based assays Functional assays In vivo assays Human studies Proteins Cell lines Tissues & organs Animal models Humans 1950s 1920s 1910s ancient Cell-based assays • Higher throughput, cheaper • Mechanistic insights • … but less predictive
  • 31. © 2019 Medicines Discovery Catapult. All rights reserved. Assays in Drug Discovery Biochemical assays Cell-based assays Functional assays In vivo assays Human studies Proteins Cell lines Tissues & organs Animal models Humans 1970s 1950s 1920s 1910s ancient Biochemical assays • Higher throughput • Mechanistic insights • Recombinant DNA technology • … but less predictive
  • 32. © 2019 Medicines Discovery Catapult. All rights reserved. Example Assay Path: Anti-inflammatory Drugs Prostaglandin G/H synthase 2 LPS-stimulated THP-1 cells LPS-stimulated human whole blood carrageenan- injected rat acute gout patient
  • 33. © 2019 Medicines Discovery Catapult. All rights reserved.
  • 34. © 2019 Medicines Discovery Catapult. All rights reserved. • Finding Assays • Text-mining across papers, patents, vendor catalogues • Indexing of Assays • specialist dictionaries - techniques, equipment, genes, end-points, …. • Classification of assays • Efficacy/ADMET & biochemical, cell-based, organoid, tissue, …. • Similarity of Assays • how ‘similar’ are two assays? • Chaining of Assays • constructing the directed graph • Learning thresholds • Identification of ‘triggers’ from chained, directed assay pairs AssayNet – Building the Network
  • 35. © 2019 Medicines Discovery Catapult. All rights reserved.
  • 36. © 2019 Medicines Discovery Catapult. All rights reserved.
  • 37. © 2019 Medicines Discovery Catapult. All rights reserved.
  • 38. © 2019 Medicines Discovery Catapult. All rights reserved.
  • 39. © 2019 Medicines Discovery Catapult. All rights reserved. Assay 1 Assay 2 • Decision Thresholds • What activity threshold in Assay 1 makes it worth measuring in Assay 2? • Learn from statistical distributions • Probably artefactually thresholded at integral pIC50 thresholds – e.g. 1mM (cf P-value distributions) Learning Decision Thresholds pIC50 pIC50 # # Compounds selected for screening in assay 2 Distribution of activity values of compounds in Assay 1 Sharp cutoff Sampled cutoff
  • 40. © 2019 Medicines Discovery Catapult. All rights reserved. Bayesian Networks
  • 41. © 2019 Medicines Discovery Catapult. All rights reserved. Bioassay data - ChEMBL Database IC50 4.5 nM >Thrombin MAHVRGLQLPGCLALAALCSLVHSQHVFLA PQQARSLLQRVRRANTFLEEVRKGNLEREC VEETCSYEEAFEALESSTATDVFWAKYTAC ETARTPRDKLAACLEGNCAEGLGTNYRGHV APTT 11 min Target Compoun d Bioassay data Compound Assay • Data manually extracted by a team of curators from published pharmacology and drug discovery literature (e.g. Journal of Medicinal Chemistry) • ChEMBL has transformed many aspects of cheminformatics research − Target prediction − Large-scale QSAR − Matched Molecular Pairs − … • ChEMBL is foundation data source of almost all published AI compound design research
  • 42. © 2019 Medicines Discovery Catapult. All rights reserved. 1 a b d 2 3c 5e 4 g f h 6 ChEMBL as a Graph assay-assay network compound-compound network b f c h ge a d 1 a 1 a compound assay has activity in Zwierzyna & Overington (in preparation) 1 2 4 6 5 3
  • 43. © 2019 Medicines Discovery Catapult. All rights reserved. Assay Network: Binding Assay Data (Subset) A subset of the assay network (~6,000 nodes) constructed using protein-binding assay data from ChEMBL Zwierzyna & Overington (in preparation)
  • 44. © 2019 Medicines Discovery Catapult. All rights reserved. Assay Network: Preclinical Assay Data PPAR binding assay DPP-4 binding assay in vivo assay cell-based assay Zwierzyna & Overington (in preparation) • Fragment of the assay network with a subset of bioassays testing antidiabetic compounds • Assays involving closely related biological targets are clustered together, e.g. assays involving various peroxisome proliferator-activated receptors in the green cluster • Antidiabetic compounds with different mechanism of action (e.g. DPP-4 inhibitors and PPAR agonists) are often tested in the same animal model (such as Zucker diabetic rat) → in vivo assays link distinct clusters
  • 45. © 2019 Medicines Discovery Catapult. All rights reserved. Animal Models: Assay Descriptions CHEMBL893931: “Inhibition of carrageenan-induced paw oedema in Sprague-Dawley rat at 5.16 mg/kg, sc after 3 hrs.”
  • 46. © 2019 Medicines Discovery Catapult. All rights reserved. Animal Models: Assay Descriptions Induced Model Phenotype Genetic Strain Dosage Administratio n Route Timing CHEMBL893931: “Inhibition of carrageenan-induced paw oedema in Sprague-Dawley rat at 5.16 mg/kg, sc after 3 hrs.”
  • 47. © 2019 Medicines Discovery Catapult. All rights reserved. Information Extraction From Assay Descriptions Antiallodynicactivity in Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed as attenuation of mechanicalallodynia JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN NP PP NP VP PP NP PP NP S CHEMBL1799193: Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia. Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia Experiment Phenotype PhenotypeStrain Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia A B C D Antiallodynicactivity in Wistar albino rat chronicconstriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN NP PP NP VP PP NP PP NP S CHEMBL1799193: Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia. Antiallodynicactivity Wistar albino rat chronicconstriction injury-induced neuropathic pain model assessed attenuation mechanical allodynia Experiment Phenotype PhenotypeStrain Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanical allodynia A B C D Sentence Noun Phrase Verb Phrase AdjectiveNoun Verb Prepositional Phrase Antiallodynicactivity in Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed as attenuation of mechanical allodynia JJ NN IN NNP NN NN JJ NN JJ JJ NN NN VBN IN NN IN JJ NN NP PP NP VP PP NP PP NP S CHEMBL1799193: Antiallodynicactivity in Wistar albino rat chronic constriction injury-induced neuropathic pain model assessed as attenuation of mechanical allodynia. Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanicalallodynia [9.11,8.73,9.19,...] [-0.17,-0.57,0.01,...] [8.95,3.39,-5.22,...] [9.08,8.02,8.09,...][9.11,8.73,9.19,...][9.56,9.14,2.10,...][9.10,8.72,9.18,...] Experiment Phenotype PhenotypeStrain Antiallodynicactivity Wistar albino rat chronicconstrictioninjury-induced neuropathic pain model assessed attenuation mechanical allodynia A B C D E Zwierzyna & Overington (in preparation)
  • 48. © 2019 Medicines Discovery Catapult. All rights reserved. PCA of Word2Vec Assay Descriptions Each assay description: average over its word vectors. Data points projected from a 200-dimensional space to 2D using PCA Zwierzyna & Overington, unpublished
  • 49. © 2019 Medicines Discovery Catapult. All rights reserved. Word2vec Embedding of Assays L01 (antineoplastic)M01 (anti-inflammatory) ChEMBL assays of known drugs annotated with different ATC codes (~15k of ~94k) N03 (antiepileptic) A10 (antidiabetic)C02 (antihypertensive) N02 (analgesic) Zwierzyna&Overington,unpublished
  • 50. © 2019 Medicines Discovery Catapult. All rights reserved. Biochemical assay Cell-based screen Functional assay Animal disease model Human clinical trial Build assay networks from literature/patent co-occurrence Link to animal models and genetics Understand target engagement/ pharmacodynamics through development Directed graph of all assays from targets to clinical trials AssayNet – Translational Path From Lab To Clinic Compound
  • 51. © 2019 Medicines Discovery Catapult. All rights reserved. Acknowledgements Bissan Al-Lazikani Aroon Hingorani, Juan Pablo-Casas Marc Marti-Renom Francesco Martinez Magda Zwierzyna Mark Davies Krister Wennerberg Mark Warren, Gemma Holliday, Andrew Pannifer Richard Seacome, James Welsh, Matthew Hodsgkiss Charles Bury, Kepa Brurusco-Goni, Daiel James, Adam Poulston, Matt Cockayne, Baydr Earls, Herve Barjat, Dave Allen, James Peach Nathan Dedman, George Papadatos, Grace Mugumbate, Anna Gaulton, Prudence Mutowo, Louisa Bellis, Anne Hersey, Jon Chambers, Michal Nowotka, Anneli Karlsson, Ines Smit, Francis Atkinson, Paula Magarinos, Felix Kruger, Rita Santos