SlideShare uma empresa Scribd logo
1 de 30
CINF20 - 31 March 2019
Dr Frederik van den Broek, Elsevier Professional Services
Data-driven drug
discovery for rare
diseases
Tales from the trenches
This is what we are all after in drug discovery…
Image: Elsevier
If drug discovery and development only were that simple…
Disease
Drug
compound
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
This makes it all a lengthy and costly process
Image: https://www.phrma.org/graphic/the-biopharmaceutical-research-and-development-process
With rare diseases it is even harder
Small(er) patient populations leading to
• Less (integral) medical and scientific knowledge
• Small population for clinical trials
• Unawareness with doctors, researchers, policymakers
• Smaller potential market size for a drug
Image: http://www.campingtourist.com/camping-activities/climbing/difficult-mountains-climb/
Drug repurposing: a new hope for rare diseases
• Less costly and of interest for pharma
• Quicker to Phase II/III tests, so hopefully quicker to market
• Need reliable information from various sources to find suitable repurposing
candidates
Image: https://www.starwars.com/news/poll-what-is-the-best-scene-in-star-wars-a-new-hope
Accelerate with new knowledge and data
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
Various initiatives we were recently involved in
• Project with Findacure to find drug repurposing candidates for Congenital
Hyperinsulinism
• Pistoia Hackaton: Elsevier-Findacure challenge on Friedrich’s Ataxia
• Sub-network enrichment analysis for neuromuscular disorder pathways
• Disease pathway analysis for Huntingdon's Disease
• Pistoia Datathon for drug repurposing for rare diseases
| 13
• A rare genetic disease
• Permanently excessive level of insulin in the
blood
• Develops within the first few days of life
• Can lead to brain injury or even death
• In the most severe cases the only viable treatment is
the removal of the pancreas, consigning the patient to
a lifetime of diabetes
Congenital hyperinsulinsm (CHI)
https://res.cloudinary.com/indiegogo-media-prod-
cld/image/upload/c_limit,w_620/v1440424745/uzvnq
zhvbpsrtthzxqpu.jpg
Creating a comprehensive view of CHI
• CHI Literature Library
• Disease, Target, Pathway, and
Compound Analysis
• Research Landscape Analysis
Information Assets Applied
• Content Elsevier’s vast set of literature and patent data
• Data normalization Taxonomies and dictionaries to
normalize author names, institutions, drugs, targets, and
other important terms
• Information extraction Finding semantic
relationships, targets, pathways, drugs, and bioactivities
Building and refining the CHI disease model
Picked relevant
pathways
(from a collection of 1800
models)
Explored functions of
proteins using 6.2M pre-
text mined relations
and embedded Gene
Ontology
Summarized what is known
about CHI mechanism in an
overview model
From pathways to CHI treatments:
Automated analysis combines bioassay data with pathway data
Mean of activities among
these targets
Me
Targets and activities for
each compound
Drug-likeness
metrics for
sorting/classification
• All compounds that
were observed to bind
to targets in pathway
• Sorted by number of
active targets.
Too many targets may
suggest lack of specificity.
Find all targets that
could be used to affect
the disease state
Query for each target to find
compounds that have high
affinity for them (>6 log units)
Collate data by compound to summarize the
targets/activities related to disease that the
compound hits
• Compute geometric mean of activities for ranking
• Rank by number of targets and geometric mean of
activities against targets
Step 1 Step 2
Step 3
Pistoia Hackathon Challenge (2017)
Elsevier would like you to demonstrate the ability of deep learning to help
Findacure, a UK-based charity, accelerate treatment and clinical research for
Friedreich’s ataxia (FRDA). You’ll have access to a heterogeneous set of
data related to the disease: biological pathway analysis, associated chemical
compounds and bioactivities, potential candidates for drug re-purposing, full-
text scientific literature and clinical trial data.
Basically, giving others a go with the data sets we worked with on CHI….
Promising results, but still hard work
“We spent most of our time the first day just trying to get our heads around
the data, so we could start to find some solutions. Even opening the files was
tricky.” The students used various tools to try to extract data from the
provided XML files, but it was slow going. Daniel [one of the participants]
commented that, “we wound up having to do a lot of things manually, so we
could at least read the files in plain text.”
Sharing disease pathways
• Shared curated pathways (with supporting literature
references) with rare disease organisations to help their
discussions with researchers and fill in potential “blanks”
• Comparing gene expression algorithms for the identification
of expression regulators
• Well-defined datasets, with supporting
literature references which resonate
with researchers
Datathon (2019):
Applying AI in Drug Repurposing for Rare Diseases
“Machine learning
won’t work if your data
is rigidly siloed.”
“One major challenge
is collecting enough
reliable information to
properly train AI systems.
AI is as good as the
data.”
Nick Patience
Founder, 451
Research
“Organizations need to
make sure that the data
being accessed is
treated and defined
consistently across the
sources. Otherwise,
virtualization won't work.”
“All the major AI
advances have been
fueled by advances in
data sets. The algorithms
are easy….
"Collecting, classifying
and labeling datasets
used to train the
algorithms is the grunt
work that’s difficult”
Aspuru-Guzik
Professor of Chemistry &
Machine Learning, Harvard
University JJ Guy
CTO, Jask (AI co.)
‘Siloed’ Lack of standards
Requires labeling and
contextPoor quality1
2 3 4
Using the Entellect Platform and Data Curation
Access, curation of
authoritative life science
data
Integration of disparate
data, structured and
unstructured
Normalized and
standardized data with
industry standard
taxonomies
Build custom and off-the-
shelf analytics tools
‘Un-siloed’ Harmonized Enriched and linkedQuality
Nick Patience
Founder, 451
Research
Aspuru-Guzik
Professor of Chemistry &
Machine Learning, Harvard
University
1
2 3 4
Using the Entellect Platform and Data Curation
Adverse
Event
Person
Org
Which
drugs
affect this
target?
Bio-
Activity
Pathway
Disease
Bioprocess
Trial
Disease
Species
Target
Drug
Assay
Substance:
- provenanceName
- substance
- name
- compoundType
- substanceTypeName
- inchiCode
- molecularFormula
- charge
- numberOfAtoms
- numberOfComponents
- numberOfElements
- numberOfFragments
- numberOfStructure
- molWeightPublishedValue
- molWeightPublishedUnit
- molWeightStandardValue
- mpvalue
Bioactivity:
- provenanceName
- effect
- inducedBy
- target
- targetsCount
- bioactivityParameterName
- displayValue
- publishedValue
- publishedUnit
- pX
Target:
- provenanceName
- target
- uniprotId
- sequence
- targetType
- label
- speciesId
- speciesName
- geneSymbol
Entellect Platform and Data Curation
Various teams using various approaches
• Semantic data: Target Identification
• Semantic data: Small Molecule Binding
• Machine Learning
− Ensemble Learning
− Mol2Vec, Prot2Vec
− Network diffusion
• Expert collaboration
− Virtual docking
− Adverse Event profiling
“I could work on the important stuff straight away, using all the data”
Promising results so far (March 2019)
Aiming to make data-driven drug discovery for rare diseases
a little easier…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
Conclusions
• Data, data, data…
• Data has to be FAIR and of good and trusted provenance as the
researchers and clinicians will want to see the “chain of evidence” (beware
of black box models)
• Data sets also have to be FAIR for each other: enabling the integral
approaches repurposing needs have to be linked data sets across siloes
and domains to go from disease to target to compound (and back)
Image: Sangya Pundir, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=53414062
Acknowledgements
• Maria Shkrob
• Jabe Wilson
• Anton Yuryev
• Matthew Clark
• Christy Wilson
• Finlay Maclean
• Elsevier’s Entellect team
• Pistioia hackaton and datathon teams
Questions?
By Malis - https://commons.wikimedia.org/w/index.php?curid=2633354
Appendix – Datathon approaches

Mais conteúdo relacionado

Mais procurados

Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsChris Southan
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...Chris Southan
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-upopen_phacts
 
GtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in contextGtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in contextChris Southan
 
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...Frederik van den Broek
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionPaul Groth
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...open_phacts
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy trainingSunghwan Kim
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChemSunghwan Kim
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbChris Southan
 
Generating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web TechnologiesGenerating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web TechnologiesMichel Dumontier
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem Sunghwan Kim
 
Applicationsofbioinformaticsindrugdiscoveryandprocess
ApplicationsofbioinformaticsindrugdiscoveryandprocessApplicationsofbioinformaticsindrugdiscoveryandprocess
Applicationsofbioinformaticsindrugdiscoveryandprocessjaidev53ster
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationSunghwan Kim
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Data Consortium
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignShahir Shamsir
 
Automate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilanceAutomate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilanceAnn-Marie Roche
 

Mais procurados (20)

Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEs
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
 
GtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in contextGtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in context
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tension
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
 
Generating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web TechnologiesGenerating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web Technologies
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem
 
Applicationsofbioinformaticsindrugdiscoveryandprocess
ApplicationsofbioinformaticsindrugdiscoveryandprocessApplicationsofbioinformaticsindrugdiscoveryandprocess
Applicationsofbioinformaticsindrugdiscoveryandprocess
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics education
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug Design
 
Automate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilanceAutomate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilance
 

Semelhante a Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31)

Presentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioPresentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioAnton Yuryev
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designingW Roseybala Devi
 
Biomedical Literature
Biomedical Literature Biomedical Literature
Biomedical Literature Arete-Zoe, LLC
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposingSean Ekins
 
Mobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseasesMobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseasesMaria Shkrob
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinarAnn-Marie Roche
 
Research methodology
Research methodologyResearch methodology
Research methodologyTosif Ahmad
 
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...José M. Guerrero
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesPhilip Payne
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBigData_Europe
 
Nursing Research Resources
Nursing Research Resources Nursing Research Resources
Nursing Research Resources Ann Celestine
 
Patient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a LecturePatient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a LectureCMDLMS
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...Human Variome Project
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDavid Lee Scher, MD
 
Amia tbi-14-final
Amia tbi-14-finalAmia tbi-14-final
Amia tbi-14-finalRuss Altman
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Adam Ford
 

Semelhante a Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31) (20)

Presentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioPresentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-Antonio
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
Biomedical Literature
Biomedical Literature Biomedical Literature
Biomedical Literature
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
 
Mobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseasesMobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseases
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinar
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
Search for evidence
Search for evidenceSearch for evidence
Search for evidence
 
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
Nursing Research Resources
Nursing Research Resources Nursing Research Resources
Nursing Research Resources
 
Patient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a LecturePatient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a Lecture
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
 
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
 
IRDiRC: progress and expectations
IRDiRC: progress and expectationsIRDiRC: progress and expectations
IRDiRC: progress and expectations
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient Advocate
 
Amia tbi-14-final
Amia tbi-14-finalAmia tbi-14-final
Amia tbi-14-final
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 

Último

Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...chandars293
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...perfect solution
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...narwatsonia7
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...vidya singh
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...indiancallgirl4rent
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Dipal Arora
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...Taniya Sharma
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...narwatsonia7
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 

Último (20)

Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 8250192130 ⟟ Call Me For Ge...
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 

Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31)

  • 1. CINF20 - 31 March 2019 Dr Frederik van den Broek, Elsevier Professional Services Data-driven drug discovery for rare diseases Tales from the trenches
  • 2. This is what we are all after in drug discovery… Image: Elsevier
  • 3. If drug discovery and development only were that simple… Disease Drug compound
  • 4. If drug discovery and development only were that simple… Disease Protein Target Drug compound
  • 5. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • …
  • 6. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 7. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 8. This makes it all a lengthy and costly process Image: https://www.phrma.org/graphic/the-biopharmaceutical-research-and-development-process
  • 9. With rare diseases it is even harder Small(er) patient populations leading to • Less (integral) medical and scientific knowledge • Small population for clinical trials • Unawareness with doctors, researchers, policymakers • Smaller potential market size for a drug Image: http://www.campingtourist.com/camping-activities/climbing/difficult-mountains-climb/
  • 10. Drug repurposing: a new hope for rare diseases • Less costly and of interest for pharma • Quicker to Phase II/III tests, so hopefully quicker to market • Need reliable information from various sources to find suitable repurposing candidates Image: https://www.starwars.com/news/poll-what-is-the-best-scene-in-star-wars-a-new-hope
  • 11. Accelerate with new knowledge and data Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 12. Various initiatives we were recently involved in • Project with Findacure to find drug repurposing candidates for Congenital Hyperinsulinism • Pistoia Hackaton: Elsevier-Findacure challenge on Friedrich’s Ataxia • Sub-network enrichment analysis for neuromuscular disorder pathways • Disease pathway analysis for Huntingdon's Disease • Pistoia Datathon for drug repurposing for rare diseases
  • 13. | 13 • A rare genetic disease • Permanently excessive level of insulin in the blood • Develops within the first few days of life • Can lead to brain injury or even death • In the most severe cases the only viable treatment is the removal of the pancreas, consigning the patient to a lifetime of diabetes Congenital hyperinsulinsm (CHI) https://res.cloudinary.com/indiegogo-media-prod- cld/image/upload/c_limit,w_620/v1440424745/uzvnq zhvbpsrtthzxqpu.jpg
  • 14. Creating a comprehensive view of CHI • CHI Literature Library • Disease, Target, Pathway, and Compound Analysis • Research Landscape Analysis Information Assets Applied • Content Elsevier’s vast set of literature and patent data • Data normalization Taxonomies and dictionaries to normalize author names, institutions, drugs, targets, and other important terms • Information extraction Finding semantic relationships, targets, pathways, drugs, and bioactivities
  • 15. Building and refining the CHI disease model Picked relevant pathways (from a collection of 1800 models) Explored functions of proteins using 6.2M pre- text mined relations and embedded Gene Ontology Summarized what is known about CHI mechanism in an overview model
  • 16. From pathways to CHI treatments: Automated analysis combines bioassay data with pathway data Mean of activities among these targets Me Targets and activities for each compound Drug-likeness metrics for sorting/classification • All compounds that were observed to bind to targets in pathway • Sorted by number of active targets. Too many targets may suggest lack of specificity. Find all targets that could be used to affect the disease state Query for each target to find compounds that have high affinity for them (>6 log units) Collate data by compound to summarize the targets/activities related to disease that the compound hits • Compute geometric mean of activities for ranking • Rank by number of targets and geometric mean of activities against targets Step 1 Step 2 Step 3
  • 17. Pistoia Hackathon Challenge (2017) Elsevier would like you to demonstrate the ability of deep learning to help Findacure, a UK-based charity, accelerate treatment and clinical research for Friedreich’s ataxia (FRDA). You’ll have access to a heterogeneous set of data related to the disease: biological pathway analysis, associated chemical compounds and bioactivities, potential candidates for drug re-purposing, full- text scientific literature and clinical trial data. Basically, giving others a go with the data sets we worked with on CHI….
  • 18. Promising results, but still hard work “We spent most of our time the first day just trying to get our heads around the data, so we could start to find some solutions. Even opening the files was tricky.” The students used various tools to try to extract data from the provided XML files, but it was slow going. Daniel [one of the participants] commented that, “we wound up having to do a lot of things manually, so we could at least read the files in plain text.”
  • 19. Sharing disease pathways • Shared curated pathways (with supporting literature references) with rare disease organisations to help their discussions with researchers and fill in potential “blanks” • Comparing gene expression algorithms for the identification of expression regulators • Well-defined datasets, with supporting literature references which resonate with researchers
  • 20. Datathon (2019): Applying AI in Drug Repurposing for Rare Diseases
  • 21. “Machine learning won’t work if your data is rigidly siloed.” “One major challenge is collecting enough reliable information to properly train AI systems. AI is as good as the data.” Nick Patience Founder, 451 Research “Organizations need to make sure that the data being accessed is treated and defined consistently across the sources. Otherwise, virtualization won't work.” “All the major AI advances have been fueled by advances in data sets. The algorithms are easy…. "Collecting, classifying and labeling datasets used to train the algorithms is the grunt work that’s difficult” Aspuru-Guzik Professor of Chemistry & Machine Learning, Harvard University JJ Guy CTO, Jask (AI co.) ‘Siloed’ Lack of standards Requires labeling and contextPoor quality1 2 3 4 Using the Entellect Platform and Data Curation
  • 22. Access, curation of authoritative life science data Integration of disparate data, structured and unstructured Normalized and standardized data with industry standard taxonomies Build custom and off-the- shelf analytics tools ‘Un-siloed’ Harmonized Enriched and linkedQuality Nick Patience Founder, 451 Research Aspuru-Guzik Professor of Chemistry & Machine Learning, Harvard University 1 2 3 4 Using the Entellect Platform and Data Curation
  • 23. Adverse Event Person Org Which drugs affect this target? Bio- Activity Pathway Disease Bioprocess Trial Disease Species Target Drug Assay Substance: - provenanceName - substance - name - compoundType - substanceTypeName - inchiCode - molecularFormula - charge - numberOfAtoms - numberOfComponents - numberOfElements - numberOfFragments - numberOfStructure - molWeightPublishedValue - molWeightPublishedUnit - molWeightStandardValue - mpvalue Bioactivity: - provenanceName - effect - inducedBy - target - targetsCount - bioactivityParameterName - displayValue - publishedValue - publishedUnit - pX Target: - provenanceName - target - uniprotId - sequence - targetType - label - speciesId - speciesName - geneSymbol Entellect Platform and Data Curation
  • 24. Various teams using various approaches • Semantic data: Target Identification • Semantic data: Small Molecule Binding • Machine Learning − Ensemble Learning − Mol2Vec, Prot2Vec − Network diffusion • Expert collaboration − Virtual docking − Adverse Event profiling “I could work on the important stuff straight away, using all the data”
  • 25. Promising results so far (March 2019)
  • 26. Aiming to make data-driven drug discovery for rare diseases a little easier… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 27. Conclusions • Data, data, data… • Data has to be FAIR and of good and trusted provenance as the researchers and clinicians will want to see the “chain of evidence” (beware of black box models) • Data sets also have to be FAIR for each other: enabling the integral approaches repurposing needs have to be linked data sets across siloes and domains to go from disease to target to compound (and back) Image: Sangya Pundir, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=53414062
  • 28. Acknowledgements • Maria Shkrob • Jabe Wilson • Anton Yuryev • Matthew Clark • Christy Wilson • Finlay Maclean • Elsevier’s Entellect team • Pistioia hackaton and datathon teams
  • 29. Questions? By Malis - https://commons.wikimedia.org/w/index.php?curid=2633354
  • 30. Appendix – Datathon approaches