SlideShare a Scribd company logo
1 of 55
Andrew Lang
Professor of Mathematics
 Oral Roberts University
Open Drug
Discovery for
Neglected
Diseases
Malaria
Schistosomiasis
Gram positive bacteria
Breast Cancer
Drugs for neglected diseases




        need to be…
cheap and…
easy to make.
combinatorial
                      library
solubility data

                      docking
  solubility
   models
                     synthesis

   solvent
  selection
                                      biological
                  recrystallization
                                        assay
melting point
  models


melting point     The big picture
    data
combinatorial
                         library
solubility data

                         docking
  solubility
   models
                        synthesis

   solvent
  selection
                                         biological
                     recrystallization
                                           assay
melting point
  models


melting point
                  Oral Roberts University
    data
                  undergraduate research
David Bulger
                    MD/PhD Student
                      Tennessee




Cameron Neylon     Solubility Measurements and Ugi
Biophysicist RAL      Product Synthesis at ORU,
                            Drexel, and RAL

                    Submeta ONS Award Winner,
                        BOE Award Winner

                   Supervisors: Robert Stewart, Lois
                   Ablin, Bill Collier, Joel Gaikwad,
                      Jean-Claude Bradley, and
                           Cameron Neylon
Lacey Condron
                             Chemistry Major




 Lizzie Clark
Nursing Major


                 Samantha Gaines, Lizzie
                Clark, and Lacey Condron

            Solubility Measurements and
             Solubility Modeling at ORU

         Supervisors: Ken Weed, Lois Ablin
Daryl Charron, Alejandro
     Hernandez, Maria
      Hernandez, Jesse
  Patsolic, Matthew Wilson

Cluster Computer Construction
 and In-Silico Docking at ORU

  Supervisors: Ken Preston
combinatorial
                      library
solubility data

                      docking
  solubility
   models
                     synthesis

   solvent
  selection
                                      biological
                  recrystallization
                                        assay
melting point
  models


melting point     Let’s focus
    data
Early models, before 2005 were…
…specialized
1979 Martin – disubstituted benzenes
1987 Hanson – normal alkanes
1988 Needham – normal and branched alkanes
1990 Abramowitz – non-hydrogen bonded benzenes
1991 Dearden – anilines
1993 Katritzky – aldehydes, amines, and ketones
1994 Simamora – rigid aromatic
1996 Charlton – alkanes
1996 Katritzky – pyridines
1999 Zhao – aliphatic
2001 Chickos – homologous series
2003 Bergstrom – druglike (N = 277, r2 = 0.54)
In 2005…




     …everything changed
MDPI - cheminformatics.org




Karthikeyan 2005 N = 4173, r2 = 0.65
PHYSPROP




Clark 2005 N = 6257, r2 = 0.61
Recent melting point models
         use these datasets…
…never reproducing r2 = 0.65 (0.47 – 0.56)
Even though [a] melting point
can be measured accurately, its
prediction has been a
notoriously difficult problem.
We began
measuring, collecting, and
curating melting points in the
Jean-Claude Bradley’s
          Chemical Information Retrieval
                Course at Drexel




567 curated and referenced measurements from
Fall 2010 Chemical Information Retrieval course
Most popular data sources…




    …chemical vendors
Alfa Aesar donates ~13,000
melting points to the public domain
collection




measurement                                     curation
                         ONS
                     melting point
                      workflow




        validation                   modeling
Collection: Open Data
source             data points   curated values   source year   data type
Bell               2483          1631             1995          donated-CC0
Bergstrom          277           277              2003          open
MDPI-Karthikeyan   4450          4084             2005          open
Hughes             287           262              2008          open
Oxford-MSDS        3217          1481             2010          open
Drugbank           875           875              2011          open
Griffiths          3757          278              2011          donated-CC0
Alfa Aesar         12986         8739             2011          donated-CC0
PHYSPROP           11645         9694             2011          donated-CC0
ONS                471           471              2012          open


             27792 curated measurements
                for 19410 compounds
Antony Williams – RSC ChemSpider



             Curation is…




…lots of hard, tedious work
(Jean-Claude Bradley and Antony Williams)
Inconsistencies and SMILES problems
within the “high trust level” MDPI dataset
PHYSPROP Structure Errors (Incorrect Valence)
2315 out of 43543 contained pentavalent nitrogens
PHYSPROP Errors: Structure displayed is for the neutral
compound dopamine but the associated CAS Number and
 chemical name in the file are for the hydrobromide salt.
unit errors: Kelvin/Celsius, Fahrenheit/Celsius
bad SMILES (non-rendering, hypervalency)
salts associated with SMILES for free base
using boiling point for melting point
Some melting points can’t be resolved
 only with literature: 4-benzyltoluene
Open lab notebook page
measuring the melting point of 4-benzyltoluene
CDK
                     descriptor calculator



                              R
                     statistical computing


                                             Melting
melting point data                            Point
                                             Model
use this model
compounds              CDK
doubleplusgood   descriptor calculator

single
                          R
                 statistical computing


                                         Melting
                                          Point
                                         Model
Comparison of model with
double+ validated measurements
Straight chain carboxylic acids from 1 to 10 carbons




    Straight chain alcohols from 1 to 10 carbons
Cyclic primary amines from 3 to 6 carbons
cyclobutylamine flagged for measurement
       only single source available
Publication of double+ validated
     melting point dataset




                …as a preprint
Publication of double+ validated
     melting point dataset




                     …as a book
Data and model deployed…




        web service




                      …on the web
…in Google spreadsheets
…as an app
   Can the solvents used to recrystallize compounds in
    organic teaching labs be improved?
   Trans-dibenzalacetone




   Aldol condensation between two molecules of
    benzaldehyde and one molecule of acetone



        [Matthew McBride: Undergraduate Research Assistant - Drexel]
   First recrystallized in ethyl acetate in 1906: Straus
    and Ecker, Ber. 39, 2988 (1906)
   Recrystallized in ethyl acetate in Organic Syntheses
   Recommended recrystallization solvent: ethyl acetate.




    (http://classes.kvcc.edu/chm230/mixed%20aldol%20condensation.pdf




    (http://www.xula.edu/chemistry/documents/orgleclab/Aldol_notes.pdf)
Enter compound identification and desired parameters
How does it work?
1. Look up the solvent boiling point

2. Look up the room temperature solubility or predict it via measured or
predicted Abraham descriptors

3. Look up the solute melting point or predict it via a model

4. Use the melting point and the solubility at room temperature to predict
the solubility at boiling

5. Calculate the predicted recrystallization yield
Lists solvents and their predicted recrystallization yield.
Prediction is generated by the temperature dependent
solubility curves.
   ethyl acetate (predicted yield of 72%) vs ethanol
    (predicted yield of 93%)
   ethyl acetate                                   2.06M


                                            0.62M




   ethanol                                           1.1M



                                              0.09M
Dibenzalacetone derivatives docking against tubulin
                 (paclitaxel site)
   Derivatives of dibenzalacetone may be synthesized
    by altering the aldehyde used
   From a library of derivatives, the following
    compound was the top hit for the docking site of
    Taxol




   Uses phenanthrene-9-carboxaldehyde
   Perform a Reaxys search to determine availability
    of synthesis procedures




   No results



        [Matthew McBride: Undergraduate Research Assistant - Drexel]
(http://usefulchem.wikispaces.com/EXP286)

   Used methanol and benzene




   Melting Point: 264-265 C
        [Matthew McBride: Undergraduate Research Assistant - Drexel]
Acknowledgements
ORU Biology and Chemistry Faculty
Jean-Claude Bradley (Drexel)
Cameron Neylon (RAL)
Antony Williams (RSC ChemSpider)
Evan Curtin (Drexel)
Matthew McBride (Drexel)

ORU research assistants: David Bulger, Daryl
Charron, Lizzie Clark, Lacey Condron, Samantha
Gaines, Alejandro Hernandez, Maria Hernandez, Jesse
Patsolic, and Matthew Wilson

More Related Content

What's hot

Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation ProductCorrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
ijtsrd
 
Hawari 1985 JAmChemSoc
Hawari 1985 JAmChemSocHawari 1985 JAmChemSoc
Hawari 1985 JAmChemSoc
Jalal Hawari
 
Nucleic Acids Research (1995) 23, 159-164
Nucleic Acids Research (1995) 23, 159-164Nucleic Acids Research (1995) 23, 159-164
Nucleic Acids Research (1995) 23, 159-164
Dinesh Barawkar
 
Different methods for tritium determination in surface water by lsc
Different methods for tritium determination in surface water by lscDifferent methods for tritium determination in surface water by lsc
Different methods for tritium determination in surface water by lsc
trabajomuestreo
 

What's hot (17)

Evaluation of Commercially Available Biochemical Kit for Assay of Total Calci...
Evaluation of Commercially Available Biochemical Kit for Assay of Total Calci...Evaluation of Commercially Available Biochemical Kit for Assay of Total Calci...
Evaluation of Commercially Available Biochemical Kit for Assay of Total Calci...
 
A green and efficient friendly process for synthesis of drug against dormant ...
A green and efficient friendly process for synthesis of drug against dormant ...A green and efficient friendly process for synthesis of drug against dormant ...
A green and efficient friendly process for synthesis of drug against dormant ...
 
Raja paper 1
Raja paper 1Raja paper 1
Raja paper 1
 
Characterization of Semi-Interpenetrated Network Alginate/Gelatin Wound Dress...
Characterization of Semi-Interpenetrated Network Alginate/Gelatin Wound Dress...Characterization of Semi-Interpenetrated Network Alginate/Gelatin Wound Dress...
Characterization of Semi-Interpenetrated Network Alginate/Gelatin Wound Dress...
 
Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation ProductCorrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
Corrosion Inhibition of Mild Steel in HCl Medium by a Condensation Product
 
FIBER OPTIC AIDED SPECTROPHOTOMETRIC DETERMINATION OF GADOLINIUM IN FBR REPRO...
FIBER OPTIC AIDED SPECTROPHOTOMETRIC DETERMINATION OF GADOLINIUM IN FBR REPRO...FIBER OPTIC AIDED SPECTROPHOTOMETRIC DETERMINATION OF GADOLINIUM IN FBR REPRO...
FIBER OPTIC AIDED SPECTROPHOTOMETRIC DETERMINATION OF GADOLINIUM IN FBR REPRO...
 
Cellulose nanofibre from energy cane
Cellulose nanofibre from energy caneCellulose nanofibre from energy cane
Cellulose nanofibre from energy cane
 
A04 3-0110
A04 3-0110A04 3-0110
A04 3-0110
 
Hawari 1985 JAmChemSoc
Hawari 1985 JAmChemSocHawari 1985 JAmChemSoc
Hawari 1985 JAmChemSoc
 
Hydrogen bonding and molecular design (EuroQSAR 2010)
Hydrogen bonding and molecular design (EuroQSAR 2010)Hydrogen bonding and molecular design (EuroQSAR 2010)
Hydrogen bonding and molecular design (EuroQSAR 2010)
 
Dielectric behavior of acetonitrile + n-Butyl alcohol binary mixtures at micr...
Dielectric behavior of acetonitrile + n-Butyl alcohol binary mixtures at micr...Dielectric behavior of acetonitrile + n-Butyl alcohol binary mixtures at micr...
Dielectric behavior of acetonitrile + n-Butyl alcohol binary mixtures at micr...
 
Growth, structural, mechanical and dielectric studies of undoped and urea dop...
Growth, structural, mechanical and dielectric studies of undoped and urea dop...Growth, structural, mechanical and dielectric studies of undoped and urea dop...
Growth, structural, mechanical and dielectric studies of undoped and urea dop...
 
Aijrfans14 270
Aijrfans14 270Aijrfans14 270
Aijrfans14 270
 
Synthesis of new 2 h pyrano[3,2-h]quinolines with potential biological activity
Synthesis of new 2 h pyrano[3,2-h]quinolines with potential biological activitySynthesis of new 2 h pyrano[3,2-h]quinolines with potential biological activity
Synthesis of new 2 h pyrano[3,2-h]quinolines with potential biological activity
 
Studies on Biofield Treated p-Dichlorobenzene | Omicsonline
Studies on Biofield Treated p-Dichlorobenzene | OmicsonlineStudies on Biofield Treated p-Dichlorobenzene | Omicsonline
Studies on Biofield Treated p-Dichlorobenzene | Omicsonline
 
Nucleic Acids Research (1995) 23, 159-164
Nucleic Acids Research (1995) 23, 159-164Nucleic Acids Research (1995) 23, 159-164
Nucleic Acids Research (1995) 23, 159-164
 
Different methods for tritium determination in surface water by lsc
Different methods for tritium determination in surface water by lscDifferent methods for tritium determination in surface water by lsc
Different methods for tritium determination in surface water by lsc
 

Similar to modeling melting points

Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
Maciej Przybyłek
 

Similar to modeling melting points (20)

The Value of Openness in Research and Teaching
The Value of Openness in Research and TeachingThe Value of Openness in Research and Teaching
The Value of Openness in Research and Teaching
 
Bradley Open Notebook Science Georgia Tech OA week
Bradley Open Notebook Science Georgia Tech OA weekBradley Open Notebook Science Georgia Tech OA week
Bradley Open Notebook Science Georgia Tech OA week
 
CINF 2012 talk Recrystallization App
CINF 2012 talk Recrystallization AppCINF 2012 talk Recrystallization App
CINF 2012 talk Recrystallization App
 
Bradley Opal 2011
Bradley Opal 2011Bradley Opal 2011
Bradley Opal 2011
 
MiniSymp2011 Bradley
MiniSymp2011 BradleyMiniSymp2011 Bradley
MiniSymp2011 Bradley
 
Chemical Information Retrieval Class 1
Chemical Information Retrieval Class 1Chemical Information Retrieval Class 1
Chemical Information Retrieval Class 1
 
The collection, curation and modeling of Open Melting Point measurements
The collection, curation and modeling of Open Melting Point measurementsThe collection, curation and modeling of Open Melting Point measurements
The collection, curation and modeling of Open Melting Point measurements
 
Open Notebooks Science
Open Notebooks ScienceOpen Notebooks Science
Open Notebooks Science
 
Instant notes analytical-chemistry
Instant notes analytical-chemistryInstant notes analytical-chemistry
Instant notes analytical-chemistry
 
The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...The influence of data curation on QSAR Modeling – Presented at American Chemi...
The influence of data curation on QSAR Modeling – Presented at American Chemi...
 
ChemInfo 2011 class1
ChemInfo 2011 class1ChemInfo 2011 class1
ChemInfo 2011 class1
 
White House Open Notebook Science Poster
White House Open Notebook Science PosterWhite House Open Notebook Science Poster
White House Open Notebook Science Poster
 
Bradley Open Notebook Science ACSfall2012
Bradley Open Notebook Science ACSfall2012Bradley Open Notebook Science ACSfall2012
Bradley Open Notebook Science ACSfall2012
 
Molecular design: One step back and two paths forward
Molecular design:  One step back and two paths forwardMolecular design:  One step back and two paths forward
Molecular design: One step back and two paths forward
 
Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
Application of COSMO-RS-DARE as a Tool for Testing Consistency of Solubility ...
 
Open Notebook Science talk at the 2012 ACS
Open Notebook Science talk at the 2012 ACS Open Notebook Science talk at the 2012 ACS
Open Notebook Science talk at the 2012 ACS
 
Drexel College of Arts and Sciences (COAS) Research Day 2013
Drexel College of Arts and Sciences (COAS) Research Day 2013Drexel College of Arts and Sciences (COAS) Research Day 2013
Drexel College of Arts and Sciences (COAS) Research Day 2013
 
The Role of Trust in Science at SLA 2011
The Role of Trust in Science at SLA 2011The Role of Trust in Science at SLA 2011
The Role of Trust in Science at SLA 2011
 
The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...
 
PhD Thesis
PhD ThesisPhD Thesis
PhD Thesis
 

More from Andrew Lang

More from Andrew Lang (11)

Lewis, Science, Religion, and Aliens
Lewis, Science, Religion, and AliensLewis, Science, Religion, and Aliens
Lewis, Science, Religion, and Aliens
 
The Revelation of Jean-Claude Bradley
The Revelation of Jean-Claude BradleyThe Revelation of Jean-Claude Bradley
The Revelation of Jean-Claude Bradley
 
Higher Education
Higher EducationHigher Education
Higher Education
 
Quantum Psychology
Quantum PsychologyQuantum Psychology
Quantum Psychology
 
Lewis' view of Venus in Perelandra
Lewis' view of Venus in PerelandraLewis' view of Venus in Perelandra
Lewis' view of Venus in Perelandra
 
Written rummage
Written rummageWritten rummage
Written rummage
 
I'm a professor
I'm a professorI'm a professor
I'm a professor
 
Lewis' view of Mars in out of the silent planet
Lewis' view of Mars in out of the silent planetLewis' view of Mars in out of the silent planet
Lewis' view of Mars in out of the silent planet
 
Sortase A Inhibition By Ugi Products (Complex)
Sortase A Inhibition By Ugi Products (Complex)Sortase A Inhibition By Ugi Products (Complex)
Sortase A Inhibition By Ugi Products (Complex)
 
Chemistry in Second Life
Chemistry in Second LifeChemistry in Second Life
Chemistry in Second Life
 
Why the Universe appears designed and why it doesn’t have to be
Why the Universe appears designed and why it doesn’t have to beWhy the Universe appears designed and why it doesn’t have to be
Why the Universe appears designed and why it doesn’t have to be
 

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 

Recently uploaded (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 

modeling melting points

  • 1. Andrew Lang Professor of Mathematics Oral Roberts University
  • 2.
  • 4. Drugs for neglected diseases need to be…
  • 7. combinatorial library solubility data docking solubility models synthesis solvent selection biological recrystallization assay melting point models melting point The big picture data
  • 8. combinatorial library solubility data docking solubility models synthesis solvent selection biological recrystallization assay melting point models melting point Oral Roberts University data undergraduate research
  • 9. David Bulger MD/PhD Student Tennessee Cameron Neylon Solubility Measurements and Ugi Biophysicist RAL Product Synthesis at ORU, Drexel, and RAL Submeta ONS Award Winner, BOE Award Winner Supervisors: Robert Stewart, Lois Ablin, Bill Collier, Joel Gaikwad, Jean-Claude Bradley, and Cameron Neylon
  • 10. Lacey Condron Chemistry Major Lizzie Clark Nursing Major Samantha Gaines, Lizzie Clark, and Lacey Condron Solubility Measurements and Solubility Modeling at ORU Supervisors: Ken Weed, Lois Ablin
  • 11. Daryl Charron, Alejandro Hernandez, Maria Hernandez, Jesse Patsolic, Matthew Wilson Cluster Computer Construction and In-Silico Docking at ORU Supervisors: Ken Preston
  • 12. combinatorial library solubility data docking solubility models synthesis solvent selection biological recrystallization assay melting point models melting point Let’s focus data
  • 13. Early models, before 2005 were…
  • 14. …specialized 1979 Martin – disubstituted benzenes 1987 Hanson – normal alkanes 1988 Needham – normal and branched alkanes 1990 Abramowitz – non-hydrogen bonded benzenes 1991 Dearden – anilines 1993 Katritzky – aldehydes, amines, and ketones 1994 Simamora – rigid aromatic 1996 Charlton – alkanes 1996 Katritzky – pyridines 1999 Zhao – aliphatic 2001 Chickos – homologous series 2003 Bergstrom – druglike (N = 277, r2 = 0.54)
  • 15. In 2005… …everything changed
  • 16. MDPI - cheminformatics.org Karthikeyan 2005 N = 4173, r2 = 0.65
  • 17. PHYSPROP Clark 2005 N = 6257, r2 = 0.61
  • 18. Recent melting point models use these datasets… …never reproducing r2 = 0.65 (0.47 – 0.56)
  • 19. Even though [a] melting point can be measured accurately, its prediction has been a notoriously difficult problem.
  • 20. We began measuring, collecting, and curating melting points in the
  • 21. Jean-Claude Bradley’s Chemical Information Retrieval Course at Drexel 567 curated and referenced measurements from Fall 2010 Chemical Information Retrieval course
  • 22. Most popular data sources… …chemical vendors
  • 23. Alfa Aesar donates ~13,000 melting points to the public domain
  • 24. collection measurement curation ONS melting point workflow validation modeling
  • 25. Collection: Open Data source data points curated values source year data type Bell 2483 1631 1995 donated-CC0 Bergstrom 277 277 2003 open MDPI-Karthikeyan 4450 4084 2005 open Hughes 287 262 2008 open Oxford-MSDS 3217 1481 2010 open Drugbank 875 875 2011 open Griffiths 3757 278 2011 donated-CC0 Alfa Aesar 12986 8739 2011 donated-CC0 PHYSPROP 11645 9694 2011 donated-CC0 ONS 471 471 2012 open 27792 curated measurements for 19410 compounds
  • 26. Antony Williams – RSC ChemSpider Curation is… …lots of hard, tedious work (Jean-Claude Bradley and Antony Williams)
  • 27. Inconsistencies and SMILES problems within the “high trust level” MDPI dataset
  • 28. PHYSPROP Structure Errors (Incorrect Valence) 2315 out of 43543 contained pentavalent nitrogens
  • 29. PHYSPROP Errors: Structure displayed is for the neutral compound dopamine but the associated CAS Number and chemical name in the file are for the hydrobromide salt.
  • 30. unit errors: Kelvin/Celsius, Fahrenheit/Celsius bad SMILES (non-rendering, hypervalency) salts associated with SMILES for free base using boiling point for melting point
  • 31. Some melting points can’t be resolved only with literature: 4-benzyltoluene
  • 32. Open lab notebook page measuring the melting point of 4-benzyltoluene
  • 33. CDK descriptor calculator R statistical computing Melting melting point data Point Model
  • 35. compounds CDK doubleplusgood descriptor calculator single R statistical computing Melting Point Model
  • 36.
  • 37. Comparison of model with double+ validated measurements Straight chain carboxylic acids from 1 to 10 carbons Straight chain alcohols from 1 to 10 carbons
  • 38. Cyclic primary amines from 3 to 6 carbons cyclobutylamine flagged for measurement only single source available
  • 39. Publication of double+ validated melting point dataset …as a preprint
  • 40. Publication of double+ validated melting point dataset …as a book
  • 41. Data and model deployed… web service …on the web
  • 44. Can the solvents used to recrystallize compounds in organic teaching labs be improved?  Trans-dibenzalacetone  Aldol condensation between two molecules of benzaldehyde and one molecule of acetone [Matthew McBride: Undergraduate Research Assistant - Drexel]
  • 45. First recrystallized in ethyl acetate in 1906: Straus and Ecker, Ber. 39, 2988 (1906)  Recrystallized in ethyl acetate in Organic Syntheses
  • 46. Recommended recrystallization solvent: ethyl acetate. (http://classes.kvcc.edu/chm230/mixed%20aldol%20condensation.pdf (http://www.xula.edu/chemistry/documents/orgleclab/Aldol_notes.pdf)
  • 47. Enter compound identification and desired parameters
  • 48. How does it work? 1. Look up the solvent boiling point 2. Look up the room temperature solubility or predict it via measured or predicted Abraham descriptors 3. Look up the solute melting point or predict it via a model 4. Use the melting point and the solubility at room temperature to predict the solubility at boiling 5. Calculate the predicted recrystallization yield
  • 49. Lists solvents and their predicted recrystallization yield. Prediction is generated by the temperature dependent solubility curves.
  • 50. ethyl acetate (predicted yield of 72%) vs ethanol (predicted yield of 93%)  ethyl acetate 2.06M 0.62M  ethanol 1.1M 0.09M
  • 51. Dibenzalacetone derivatives docking against tubulin (paclitaxel site)
  • 52. Derivatives of dibenzalacetone may be synthesized by altering the aldehyde used  From a library of derivatives, the following compound was the top hit for the docking site of Taxol  Uses phenanthrene-9-carboxaldehyde
  • 53. Perform a Reaxys search to determine availability of synthesis procedures  No results [Matthew McBride: Undergraduate Research Assistant - Drexel]
  • 54. (http://usefulchem.wikispaces.com/EXP286)  Used methanol and benzene  Melting Point: 264-265 C [Matthew McBride: Undergraduate Research Assistant - Drexel]
  • 55. Acknowledgements ORU Biology and Chemistry Faculty Jean-Claude Bradley (Drexel) Cameron Neylon (RAL) Antony Williams (RSC ChemSpider) Evan Curtin (Drexel) Matthew McBride (Drexel) ORU research assistants: David Bulger, Daryl Charron, Lizzie Clark, Lacey Condron, Samantha Gaines, Alejandro Hernandez, Maria Hernandez, Jesse Patsolic, and Matthew Wilson

Editor's Notes

  1. http://usefulchem.wikispaces.com/D-EXP022From a library of derivatives, it was the hop hit for the docking site of taxol