SlideShare a Scribd company logo
1 of 26
Download to read offline
Molecular scaffolds are special
and useful guides for discovery
Jeremy Yang, UNM & IU
Cristian Bologa, UNM
David Wild, IU
Tudor Oprea, UNM
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
Molecular scaffolds are special
and useful guides for discovery
Jeremy Yang, UNM & IU
Cristian Bologa, UNM
David Wild, IU
Tudor Oprea, UNM
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
What is a molecular scaffold?
• "Ring-system"
• "Ring"
• "Core"
• "Framework"
Can you identify this famous scaffold?
Some famous scaffolds
beta – lactam
(penicillins,
cephalosporins )
Millions saved,
billions earned
steroid
(testosterone,
hydrocortisone, etc. )
Wonder drugs and
hormones
benzodiazepine
(Valium, flurazepam,
etc. )
“Mother’s little
helper”
Scaffolds are special because:
3D shape
Estradiol
docked into
ER-α
(OpenEye
Fred, Vida)
Scaffold scholarship & software
• Bemis & Murcko, “Molecular
frameworks”, 1996.
• Lewell et al., “Drug rings
database”, 2003.
• Wilkens et al., “HierS: hierarchical
scaffolds”, 2005.
• Ertl et al., “Quest for the Rings”,
2006.
• Clustering, indexing databases.
• Navigation of chemical space.
• Data reduction, visualization.
• R-group / SAR analyses.
• Bioactivity prediction.
• Promiscuity prediction.
Scaffold Applications
What can be done with scaffolds?
"The only rules that really matter are these: what a man can do and what a man can't do."
- Jack Sparrow
Scaffold Applications: Scaffold Hunter
Interactive exploration of chemical space with Scaffold Hunter,
S Wetzel, K Klein, S Renner, D Rauh, T Oprea, P Mutzel, H
Waldmann, Nat Chem Bio, 5, 2009, 581-583.
Scaffold Applications: Scaffold Hopper
Scaffold Hopper, NCATS/NCGC, http://tripod.nih.gov,
http://tripod.nih.gov/files/ACS_apr8_2013.pdf.
Scaffold Applications: CARLSBAD
CARLSBAD:
The Power to Explore Biological
Networks via Chemical Patterns
The CARLSBAD Database: A Confederated Database of Chemical Bioactivities,
S. L. Mathias, J. Hines-Kay, J. J. Yang, G. Zahoransky-Kohalmi, C. G. Bologa, O.
Ursu and T. I. Oprea, Database, 2013, bat044. http://carlsbad.health.unm.edu
Scaffold Applications: Molecule Cloud
The Molecule Cloud - compact visualization of large collections of
molecules, P Ertl and B Rohde, J. Cheminfo, 2012, 4:12.
Scaffold Applications: Badapple
(BioActivity Data Associative Promiscuity Pattern Learning Engine)
Translational Informatics Public Webapps:
http://pasilla.health.unm.edu/
See also my Badapple talk in CINF session "Integrative Chemogenomics Knowledge Mining Using NIH
Open Access Resources", Tues. Sept. 9, 10:45am, Rm. 140.
Scaffold Applications:
Badapple Promiscuity Plugin
Badapple Promiscuity
Plugin for BARD,
http://bard.nih.gov
Scaffold software: UNM-Biocomp-HScaf
(Open-source Google Code project)
http://code.google.com/p/unm-biocomp-hscaf/
UNM Translational Informatics Public Web Apps:
http://pasilla.health.unm.edu/
Demo web app: HScaf
Scaffold analysis algorithm
• Remove non-linking chains
• Keep linking chains
• Keep atoms multiply-bonded to rings and chains
• Special case: ignore solo-benzene.
HierS scaffold hierarchy
quinine
Bemis-Murcko
framework
scaffolds
Cheminformatics and scaffolds:
Relevant methods
• SSSR (Smallest Set of Smallest Rings)
• Canonicalization (e.g. Morgan, CanSMILES)
• Scaffolds vs. MCS (max common subgraph)
• Fingerprints, descriptors, similarity
• Proposed new method: scaffold-based similarity
More scaffold charms
• Patents, Markush, $$$.
• Lead discovery ~ scaffold discovery.
• Organic chemists like scaffolds.
• Scaffolds can be "privileged".
Scaffolds & drug-scaffolds, the privileged few
explaining a lot of activity...
Dataset:
BARD,
MLSMR,
MLP HTS
Totals: compounds:
373,802 ; scaffolds:
146,024 ; assays: 528
; wells/results:
30,612,714;
drugs: 283;
drugscafs: 1958
% total
activity
# scaffolds %
scaffolds
All 50% 1979 1.4%
All 75% 11,645 8%
Drugs 50% 54 2.8%
Drugs 90% 327 16.7%
“activity of DB” ~ # active scaffold-instances
Privileged scaffolds concept
Nature favors a few privileged scaffolds, a.k.a.
"privileged structures", for multiple receptors.
"What is clear is that certain “privileged structures” are capable
of providing useful ligands for more than one receptor and that
judicious modification of such structures could be a viable
alternative in the search for new receptor agonists and
antagonists."*
*Methods for drug discovery: development of potent, selective, orally effective
cholecystokinin antagonists, Evans et al., J. Med. Chem., 1988, 31, 2235.
News: antibiotic, scaffold:
Anthracimycin
Anthracimycin, a Potent Anthrax Antibiotic from a Marine-Derived Actinomycete,
Kyoung Hwa Jang et al., Angewandte Chemie, vol. 52, no 30, 2013, pp7822–7824; doi:
10.1002/anie.201302749.
Problems with scaffolds
• Definition of "scaffold" not consistent & rigorous
among chemists & cheminformaticians.
Testosterone
Estradiol
Danazol
Cyproterone
acetate
"We think in generalities, but we live in detail." - Alfred North Whitehead
http://en.wikipedia.org/wiki/Steroids
Steroidogenesis
[#8]~[#6;R1]~1~[#6;R1]~[#6;R1]~[#6;R2]~2~[#6;R2]~1
~[#6;R2]~[#6;R1]~[#6;R2]~1~[#6;R2]~2~[#6;R1]~[#6;R
1]~[#6;R2]~2~[#6;R1]~[#6;R1](~[#8])~[#6;R1]~[#6;R1]
~[#6;R2]~1~2
Steroid pattern
definition via
SMARTS
Problems solved by Cheminformatics
Conclusion:
Molecular scaffolds
(like cheminformatics itself)
are special and useful guides
for discovery
in chemical biology,
chemogenomics,
and drug discovery
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
Thank Yous:
Cristian Bologa, UNM
Tudor Oprea, UNM
Oleg Ursu, UNM
David Wild, IU
Gary Wiggins, IU
Happy Explorations!
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science

More Related Content

What's hot

Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screeningJoon Jyoti Sahariah
 
Role of Drug Design in Medicinal Chemistry
Role of Drug Design in Medicinal ChemistryRole of Drug Design in Medicinal Chemistry
Role of Drug Design in Medicinal ChemistryGirinath Pillai
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusShikha Popali
 
Pharmacophore
PharmacophorePharmacophore
Pharmacophoreirecen
 
Pharmacophore Identification Programs.pptx
Pharmacophore Identification Programs.pptxPharmacophore Identification Programs.pptx
Pharmacophore Identification Programs.pptxAnkita Nishad
 
Clinical data management
Clinical data managementClinical data management
Clinical data managementAjay Murali
 
Lecture 4 ligand based drug design
Lecture 4 ligand based drug designLecture 4 ligand based drug design
Lecture 4 ligand based drug designRAJAN ROLTA
 
Analog design medicinal chemistry
Analog design medicinal chemistryAnalog design medicinal chemistry
Analog design medicinal chemistryMohit umare
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSDENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSShikha Popali
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping GamitKinjal
 
Protein-ligand docking
Protein-ligand dockingProtein-ligand docking
Protein-ligand dockingbaoilleach
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design pptAbhik Seal
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryAbhik Seal
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision TreeNextMove Software
 
Target identification
Target identificationTarget identification
Target identificationSachin Jangra
 

What's hot (20)

Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screening
 
Role of Drug Design in Medicinal Chemistry
Role of Drug Design in Medicinal ChemistryRole of Drug Design in Medicinal Chemistry
Role of Drug Design in Medicinal Chemistry
 
Denovo Drug Design
Denovo Drug DesignDenovo Drug Design
Denovo Drug Design
 
2D - QSAR
2D - QSAR2D - QSAR
2D - QSAR
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabus
 
QSAR
QSARQSAR
QSAR
 
Pharmacophore
PharmacophorePharmacophore
Pharmacophore
 
Denovo
DenovoDenovo
Denovo
 
Pharmacophore Identification Programs.pptx
Pharmacophore Identification Programs.pptxPharmacophore Identification Programs.pptx
Pharmacophore Identification Programs.pptx
 
Clinical data management
Clinical data managementClinical data management
Clinical data management
 
Lecture 4 ligand based drug design
Lecture 4 ligand based drug designLecture 4 ligand based drug design
Lecture 4 ligand based drug design
 
Analog design medicinal chemistry
Analog design medicinal chemistryAnalog design medicinal chemistry
Analog design medicinal chemistry
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSDENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping
 
Protein-ligand docking
Protein-ligand dockingProtein-ligand docking
Protein-ligand docking
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug Discovery
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision Tree
 
Qsar by hansch analysis
Qsar by hansch analysisQsar by hansch analysis
Qsar by hansch analysis
 
Target identification
Target identificationTarget identification
Target identification
 

Similar to Molecular scaffolds are special and useful guides to discovery

Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesAmanda Whitmire
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Mark Wilkinson
 
Evolution of e-Research
Evolution of e-ResearchEvolution of e-Research
Evolution of e-ResearchDavid De Roure
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
 
Investigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noInvestigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noseminary
 
5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revisedangelicagonzalez10
 
Advances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookAdvances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookmantu verma
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Model organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYModel organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYMuunda Mudenda
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1AlyciaGold776
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgGigaScience, BGI Hong Kong
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the futurePistoia Alliance
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...GigaScience, BGI Hong Kong
 

Similar to Molecular scaffolds are special and useful guides to discovery (20)

Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universities
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Evolution of e-Research
Evolution of e-ResearchEvolution of e-Research
Evolution of e-Research
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
Investigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noInvestigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o no
 
5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised
 
Advances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookAdvances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain book
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
Model organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYModel organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITY
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Biomarker-Vol-8
Biomarker-Vol-8Biomarker-Vol-8
Biomarker-Vol-8
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
Ppt jitu[1]
Ppt jitu[1]Ppt jitu[1]
Ppt jitu[1]
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
Organoid Poster
Organoid PosterOrganoid Poster
Organoid Poster
 

More from Jeremy Yang

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsJeremy Yang
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerJeremy Yang
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesJeremy Yang
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APIJeremy Yang
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerJeremy Yang
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Jeremy Yang
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterJeremy Yang
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Jeremy Yang
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discoveryJeremy Yang
 
BioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingBioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingJeremy Yang
 
The Language Diversity of Computing
The Language Diversity of ComputingThe Language Diversity of Computing
The Language Diversity of ComputingJeremy Yang
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsJeremy Yang
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsJeremy Yang
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds posterJeremy Yang
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDJeremy Yang
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesJeremy Yang
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...Jeremy Yang
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsJeremy Yang
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingJeremy Yang
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNJeremy Yang
 

More from Jeremy Yang (20)

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS Analytics
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST API
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles Explorer
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource poster
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discovery
 
BioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingBioMISS: Language Diversity of Computing
BioMISS: Language Diversity of Computing
 
The Language Diversity of Computing
The Language Diversity of ComputingThe Language Diversity of Computing
The Language Diversity of Computing
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubts
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformatics
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARD
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case Studies
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applications
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in Biocomputing
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
 

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Molecular scaffolds are special and useful guides to discovery

  • 1. Molecular scaffolds are special and useful guides for discovery Jeremy Yang, UNM & IU Cristian Bologa, UNM David Wild, IU Tudor Oprea, UNM ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 2. Molecular scaffolds are special and useful guides for discovery Jeremy Yang, UNM & IU Cristian Bologa, UNM David Wild, IU Tudor Oprea, UNM ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 3. What is a molecular scaffold? • "Ring-system" • "Ring" • "Core" • "Framework" Can you identify this famous scaffold?
  • 4. Some famous scaffolds beta – lactam (penicillins, cephalosporins ) Millions saved, billions earned steroid (testosterone, hydrocortisone, etc. ) Wonder drugs and hormones benzodiazepine (Valium, flurazepam, etc. ) “Mother’s little helper”
  • 5. Scaffolds are special because: 3D shape Estradiol docked into ER-α (OpenEye Fred, Vida)
  • 6. Scaffold scholarship & software • Bemis & Murcko, “Molecular frameworks”, 1996. • Lewell et al., “Drug rings database”, 2003. • Wilkens et al., “HierS: hierarchical scaffolds”, 2005. • Ertl et al., “Quest for the Rings”, 2006.
  • 7. • Clustering, indexing databases. • Navigation of chemical space. • Data reduction, visualization. • R-group / SAR analyses. • Bioactivity prediction. • Promiscuity prediction. Scaffold Applications What can be done with scaffolds? "The only rules that really matter are these: what a man can do and what a man can't do." - Jack Sparrow
  • 8. Scaffold Applications: Scaffold Hunter Interactive exploration of chemical space with Scaffold Hunter, S Wetzel, K Klein, S Renner, D Rauh, T Oprea, P Mutzel, H Waldmann, Nat Chem Bio, 5, 2009, 581-583.
  • 9. Scaffold Applications: Scaffold Hopper Scaffold Hopper, NCATS/NCGC, http://tripod.nih.gov, http://tripod.nih.gov/files/ACS_apr8_2013.pdf.
  • 10. Scaffold Applications: CARLSBAD CARLSBAD: The Power to Explore Biological Networks via Chemical Patterns The CARLSBAD Database: A Confederated Database of Chemical Bioactivities, S. L. Mathias, J. Hines-Kay, J. J. Yang, G. Zahoransky-Kohalmi, C. G. Bologa, O. Ursu and T. I. Oprea, Database, 2013, bat044. http://carlsbad.health.unm.edu
  • 11. Scaffold Applications: Molecule Cloud The Molecule Cloud - compact visualization of large collections of molecules, P Ertl and B Rohde, J. Cheminfo, 2012, 4:12.
  • 12. Scaffold Applications: Badapple (BioActivity Data Associative Promiscuity Pattern Learning Engine) Translational Informatics Public Webapps: http://pasilla.health.unm.edu/
  • 13. See also my Badapple talk in CINF session "Integrative Chemogenomics Knowledge Mining Using NIH Open Access Resources", Tues. Sept. 9, 10:45am, Rm. 140. Scaffold Applications: Badapple Promiscuity Plugin Badapple Promiscuity Plugin for BARD, http://bard.nih.gov
  • 14. Scaffold software: UNM-Biocomp-HScaf (Open-source Google Code project) http://code.google.com/p/unm-biocomp-hscaf/
  • 15. UNM Translational Informatics Public Web Apps: http://pasilla.health.unm.edu/ Demo web app: HScaf
  • 16. Scaffold analysis algorithm • Remove non-linking chains • Keep linking chains • Keep atoms multiply-bonded to rings and chains • Special case: ignore solo-benzene.
  • 18. Cheminformatics and scaffolds: Relevant methods • SSSR (Smallest Set of Smallest Rings) • Canonicalization (e.g. Morgan, CanSMILES) • Scaffolds vs. MCS (max common subgraph) • Fingerprints, descriptors, similarity • Proposed new method: scaffold-based similarity
  • 19. More scaffold charms • Patents, Markush, $$$. • Lead discovery ~ scaffold discovery. • Organic chemists like scaffolds. • Scaffolds can be "privileged".
  • 20. Scaffolds & drug-scaffolds, the privileged few explaining a lot of activity... Dataset: BARD, MLSMR, MLP HTS Totals: compounds: 373,802 ; scaffolds: 146,024 ; assays: 528 ; wells/results: 30,612,714; drugs: 283; drugscafs: 1958 % total activity # scaffolds % scaffolds All 50% 1979 1.4% All 75% 11,645 8% Drugs 50% 54 2.8% Drugs 90% 327 16.7% “activity of DB” ~ # active scaffold-instances
  • 21. Privileged scaffolds concept Nature favors a few privileged scaffolds, a.k.a. "privileged structures", for multiple receptors. "What is clear is that certain “privileged structures” are capable of providing useful ligands for more than one receptor and that judicious modification of such structures could be a viable alternative in the search for new receptor agonists and antagonists."* *Methods for drug discovery: development of potent, selective, orally effective cholecystokinin antagonists, Evans et al., J. Med. Chem., 1988, 31, 2235.
  • 22. News: antibiotic, scaffold: Anthracimycin Anthracimycin, a Potent Anthrax Antibiotic from a Marine-Derived Actinomycete, Kyoung Hwa Jang et al., Angewandte Chemie, vol. 52, no 30, 2013, pp7822–7824; doi: 10.1002/anie.201302749.
  • 23. Problems with scaffolds • Definition of "scaffold" not consistent & rigorous among chemists & cheminformaticians. Testosterone Estradiol Danazol Cyproterone acetate "We think in generalities, but we live in detail." - Alfred North Whitehead
  • 25. Conclusion: Molecular scaffolds (like cheminformatics itself) are special and useful guides for discovery in chemical biology, chemogenomics, and drug discovery ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 26. Thank Yous: Cristian Bologa, UNM Tudor Oprea, UNM Oleg Ursu, UNM David Wild, IU Gary Wiggins, IU Happy Explorations! ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science