SlideShare uma empresa Scribd logo
1 de 23
Molecular Similarity Characterization of
          ADME Landscapes


                       ACS Annual Meeting
                        San Francisco 2010

     Bin Chen‡, Rishi Gupta* and Eric Gifford†
    ‡ School of Informatics and Computing, Indiana University, Bloomington, IN 47408
          * Anti Bacterial Research Unit, Pfizer Global R&D, Groton, CT 06340
          † Computational Sciences CoE, Pfizer Global R&D, Groton, CT 06340

                                   Pfizer Confidential
Outline


     Introduction

     Methods

     Results & discussions

     Use cases

     Conclusions




2                             Pfizer Confidential
What has been done so far?
                                              A lot of excellent work
                                              in the Activity space
                                              using a variety of
                                              similarity methods and
                                              descriptors




Current work focuses
primarily on ADME end
points and Molecular
properties while
examining various
descriptor types and
similarity methods
3                       Pfizer Confidential
Do similar compounds have similar ADME properties?


    Similar ADME                                                         Varies based on
                   Similarity 0.92             Similarity 0.85
    Properties?                                                          descriptors used
                                                              O
                                      OH
                                                                  OH
                      OH




                                                        0.9            0.8     0.7




4                               Pfizer Confidential
Do different ADME endpoints have different landscapes?

                                                                # neighbors with same class
     Probe Compound
                                              Ratiosimilarity
                                                                     # total neighbors
     High Risk Compound
                                                                    HLM
     Low Risk Compound
                                                                             4
                                                                  Ratio0.9        0.8
                          0.9           0.8      0.7                         5
                                                                              8
                                                                 Ratio0.8         0.67
                                                                             12

                                                                      RRCK
                                                                             4
                                                                  Ratio0.9      0.8
                                0.9        0.8      0.7                      5
                                                                              9
                                                                  Ratio0.8        0.75
                                                                             12
5                                     Pfizer Confidential
Hypothesis: Visualizing Chemical Landscape
                                                                       Identical Compounds
                                                                             Ratio ~1.0

                      1.0
              Ratio




                            Endpoint1

                      0.5   Endpoint2
                            Endpoint3

                            Endpoint4
                                         Ratio=f(endpoint, similarity)


      Ratio ~ High (low)         0.2              0.5            0.8     1
    risk compounds/total
                                        Similarity cutoff
         compounds
6                                          Pfizer Confidential
Datasets, Assays and Bins
    Endpoint                       Description                                 Result unit         Low Risk   High
                                                                                                              Risk
                                                                                 -6
     RRCK              passive permeability in RRCK cell line                  10 cm/sec             >10      <=10

      HLM         metabolic stability using human liver microsomes             µL/min/mg             <20      >=20

                                                                                 -6
      MDR                 Pgp influenced permeability and                      10 cm/sec             >10      <=10
                            efflux in MDCK-MDR1 cells
    CYP1A2        CYP1A2 inhibition in a substrate cocktail assay              % Inhibition          <10      >=10
    CYP3A4        CYP3A4 inhibition in a substrate cocktail assay              % Inhibition          <10      >=10
    CYP2D6         CYP2D6inhibition in a substrate cocktail assay              % Inhibition          <10      >=10

    CYP2C9        CYP2C9 inhibition in a substrate cocktail assay              % Inhibition          <10      >=10
    *Solubility        ADMET Aqueous Solubility properties                   Solubility level        >2       <=2
     *cLogP                logarithm partition coefficient                   Octanol-Water           <3       >=3
                                                                           Partition Coefficient


    • Full matrix consisting of 17787 compounds and 9 endpoints
    • Solubility and cLogP are predicted endpoints using in-house computational models
    on datasets with more than 10K compounds ,the rest are experimental results
7                                                    Pfizer Confidential
Characterize Chemical Landscape: Proposed Workflow*
      Full matrix                     FCFP6                               Similarity         • Structure similarity
   (cmpd*endpoint)                    Tanimoto                             matrix                 • Fingerprint (4)
                         Select all high/low                                                           • MDL public keys
                       risk compounds in an                                                            • Atom pairs
                              Endpoint
                                                                                                       • FCFP6
                         Select one similarity                                                         • ECFC4
                                cutoff                                                            • Coefficient (2)
                                                                                                       • Tanimoto
                       Select one compound
                                                                          Iterate all high             • Cosine
    Iterate all
    Cutoffs             Calculate the ratio of
                                                                          risk compounds     • Risk categorization (2)
    (total 14)            each compound                                                           • High risk
                    Ratiosimilarity
                                      # neighbors _ with _ same _ class
                                             total _# neighbors
                                                                                                  • Low risk
                                                                                             • Endpoints (9)
                          Average the ratio of                                               • Complexity: 4*2*2*9=144
                          all the compounds

                   Plot: Similarity cutoff
                           & ratio
Workflow for Plotting landscape of an endpoint using FCFP6 and tanimoto as similarity measurement
8 *Molecular   Similarity Characterization of ADME Landscapes; Chen et al., JCIM, Submitted, 2010
                                                 Pfizer Confidential
What are we evaluating?

     Compound ID   Similarity 0.9   Similarity 0.8          Similarity 0.7   …

         PF_1           0.9               0.9                    0.7         …

         PF_2            1                0.5                    0.7         …


         PF_3          0.95               0.8                    0.7         …


          …             …                  …                     …           …


         PF_N          0.91              0.85                   0.68         …


        average        0.95              0.85                    0.7         …

     Calculate the ratio of all compounds, individually.
     Average the ratio of all the compounds at each similarity threshold,
      ignoring the ratio is 0 (either no same class neighbor or no neighbor)
9                                     Pfizer Confidential
Results: Compare Different Endpoints




      (a) ECFC4, Tanimoto, low risk                        (b) ECFC4, Tanimoto, high risk



 • Rate of “fall” of a given curve defines how easy/difficult it would be to modify a compound and
 modify its property i.e. transform a compound from being high risk to low risk or vice versa
 • Compounds in MDR are relatively difficult to come out of a High Risk Class compared to HLM at
 any given similarity cutoff
 •
10 Ratio stays constant after a given certain similarity threshold (i.e. 0.4 in the case of CYP2C9 )
                                              Pfizer Confidential
Results: Compare Different Fingerprints*




       (a) RRCK high risk                                            (b) RRCK low risk


  • Ratio is different among fingerprints, the order is always FCFP6> Atom-
  pairs >ECFC4>MDL

 *Molecular
11            Similarity Characterization of ADME Landscapes; Chen et al., JCIM, Submitted, 2010
                                                Pfizer Confidential
Results: Compare different similarity coefficients




             (a) RRCK Low Risk                         (b) RRCK High Risk




   • Ratio is different among similarity coefficients, the order is always
12 tanimoto>Cosine                     Pfizer Confidential
Use Case: Which one is better to optimize?
                                                                                    MDR:LOW
                 MDR: HIGH
                                                                        N           RRCK:HIGH
                 RRCK: LOW
         O                                                                          …
                 …
     N                                                          N
             N
                                                        N               N       S




                    Probability of Success?
                 MDR: LOW?                                                           MDR:LOW?
                 RRCK: LOW?                                                          RRCK:LOW?
         O       …                                                  N                …
     N
                                                            N
             N
                                                    N               N       S




13                            Pfizer Confidential
Use Case: Data Driven Compound Prioritization?


                    l                          h
                            Ei (ratio)             (1 E j (ratio))
                    i                          j
     ADMET score
                                          l        h




                                                                      CYP2D6

                                                                                CYP2C9
                                                   CYP1A2

                                                            CYP3A4




                                                                                         Aq. Sol.

                                                                                                    cLogP
                               RRCK

                                         MDR
                        HLM




         Compds                                                                                                 # High     SCORE
                                                                                                                 Risk

       Compound1        -      -      +            -        -        -         -         -          -       1            0.688

       Compound2    +          -      -            -        -        -         -         -          -       1            0.694

       Compound3        -      -      -            -        -        -         -         +          +       2            0.623
       Compound4        -      -      -            +        +        -         +         -          -       3            0.627

                   + and - represent high risk and low risk endpoint, respectively
14                                                              Pfizer Confidential
Potential Combinations




     • 4 descriptor types are used
     • 2 similarity metrics are used
     • 9 endpoints,
     • 512 combinations.
     • Overlap means some compounds with higher risk endpoints should go first than those
     with lower e.g.: MDL+Tanimoto Coeff.
15                                         Pfizer Confidential
Results: Ranking matrix




                                                                                     CYP2C9
                                                                        CYP2D6
                                            CYP1A2

                                                           CYP3A4




                                                                                                  Aq. Sol.
                                                                                                                         # high           Score at         Score at




                                                                                                                 cLogP
                     RRCK

                                MDR
           HLM



                                                                                                                              endpoints       similarity       similarity
                                                                                                                                              0.5              0.6


     +           +          +         +              +              +            +            +              +                 9               0.326676         0.275558

     -           +          +         +              +              +            +            +              +                 8               0.372088         0.333456

     +           -          +         +              +              +            +            +              +                 8               0.372646         0.332717

     -           -          +         +              +              +            +            +              +                 7               0.418058         0.390616

     +           +          -         +              +              +            +            +              +                 8               0.374459         0.336353

     ...         ...        ...       ...            ...            ...          ...          ...            ...              …           …                …

     -           -          +         +              +              +            +            +              +                 1               0.679591         0.714969

     +           +          -         -              -              -            -            -              -                 2               0.635992         0.660706

     -           +          -         -              -              -            -            -              -                 1               0.681403         0.718605

     +           -          -         -              -              -            -            -              -                 1               0.681962         0.717866

     -           -          -         -              -              -            -            -              -                 0               0.727373         0.775765

            • + and - represent high risk and low risk endpoint, respectively
16          • totally, 9 endpoints and 512 combinations Pfizer Confidential
Conclusion
      Small structural changes result in change of class
       (High/Low Risk) within a given endpoint
      Different endpoints behave differently from each other
       e.g. MDR may be difficult to modify than CYP2C9
      Curves are relatively parallel to each other independent
       of descriptor and similarity metric
      Derived scoring function out of the plots to prioritize
       compounds (for screening or series selection)
      Ratios could be used for differentiating between
       “difficult” endpoints versus “easy” endpoints
                              1.0
                      Ratio




                                    Difficult

                              0.5



                                    Easy



                                           0.2          0.5            0.8   1
17                                               Pfizer Confidential
                                                 Similarity cutoff
Reference
        Martin YC et al. Do Structurally Similar Molecules Have Similar Biological
         Activity?. J. Med. Chem. 2002, 45, 4350-4358
        Medina-Franco, JL; et al. Characterization of Activity Landscapes Using 2D and
         3D Similarity Methods: Consensus activity Cliffs. J. Chem. Inf. Model. 2009, 49,
         477-491
        Segall MD, et al. Focus on Success: Using a Probabilistic Approach to Achieve
         an Optimal Balance of Compound Properties in Drug Discovery. Expert Opin.
         Drug Metab. Toxicol. 2006, 2, 325-37




18                                       Pfizer Confidential
Acknowledgement

        David Wild (School of Informatics and Computing, Indiana University)
        Veerabahu Shanmugasundaram (AB RU)
        Robyn Ayscue
        Hua Gao




19                                  Pfizer Confidential
Thanks
Questions and Comments
Results
     RRCK, ECFC4, Tanimoto, High Risk                    RRCK, ECFC4, Tanimoto, Low Risk




 Heatmap for ratios of all compounds at 14 similarity cutoffs
21                                 Pfizer Confidential
Discussion & further work

    Normal distribution
    Outliers analysis
    Ranking function validation
    Implementation
      On virtue of full matrix and ADME predictive
       model, any given compound can be assigned a
       score for prioritization




22                         Pfizer Confidential
Backup—Normal distribution
                                                           700



                                                                                                                                                620

                                                           600

                                                                                                                       554       548
                                                                                                                 523

                                                           500
                                                                                                                                       456


                                                                                                                                                      397   396
                                                           400


                                                                                                           331
                                                                                                                                                                                                             308
                                                           300
                                                                                                                                                                  263


                                                                                                                                                                                           212
                                                                                                                                                                            198
                                                           200                                       185
                                                                                                                                                                                                 161
                                                                                                                                                                                  148                  150

                                                                                            121
                                                                             104      101
                                                           100

                                                                        42


                                                                0
                                                                    0    0.05   0.1     0.15   0.2     0.25   0.3   0.35   0.4     0.45   0.5     0.55   0.6   0.65   0.7     0.75   0.8     0.85   0.9   0.95     1
                                                                                                                           Binned Ratio


     RRCK, ECFC4, high, similarity 0.85                             RRCK, ECFC4, high, similarity 0.65




23                                        Pfizer Confidential

Mais conteúdo relacionado

Último

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Destaque

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Destaque (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Molecular Similarity Characterization of ADME Landscapes

  • 1. Molecular Similarity Characterization of ADME Landscapes ACS Annual Meeting San Francisco 2010 Bin Chen‡, Rishi Gupta* and Eric Gifford† ‡ School of Informatics and Computing, Indiana University, Bloomington, IN 47408 * Anti Bacterial Research Unit, Pfizer Global R&D, Groton, CT 06340 † Computational Sciences CoE, Pfizer Global R&D, Groton, CT 06340 Pfizer Confidential
  • 2. Outline  Introduction  Methods  Results & discussions  Use cases  Conclusions 2 Pfizer Confidential
  • 3. What has been done so far? A lot of excellent work in the Activity space using a variety of similarity methods and descriptors Current work focuses primarily on ADME end points and Molecular properties while examining various descriptor types and similarity methods 3 Pfizer Confidential
  • 4. Do similar compounds have similar ADME properties? Similar ADME Varies based on Similarity 0.92 Similarity 0.85 Properties? descriptors used O OH OH OH 0.9 0.8 0.7 4 Pfizer Confidential
  • 5. Do different ADME endpoints have different landscapes? # neighbors with same class Probe Compound Ratiosimilarity # total neighbors High Risk Compound HLM Low Risk Compound 4 Ratio0.9 0.8 0.9 0.8 0.7 5 8 Ratio0.8 0.67 12 RRCK 4 Ratio0.9 0.8 0.9 0.8 0.7 5 9 Ratio0.8 0.75 12 5 Pfizer Confidential
  • 6. Hypothesis: Visualizing Chemical Landscape Identical Compounds Ratio ~1.0 1.0 Ratio Endpoint1 0.5 Endpoint2 Endpoint3 Endpoint4 Ratio=f(endpoint, similarity) Ratio ~ High (low) 0.2 0.5 0.8 1 risk compounds/total Similarity cutoff compounds 6 Pfizer Confidential
  • 7. Datasets, Assays and Bins Endpoint Description Result unit Low Risk High Risk -6 RRCK passive permeability in RRCK cell line 10 cm/sec >10 <=10 HLM metabolic stability using human liver microsomes µL/min/mg <20 >=20 -6 MDR Pgp influenced permeability and 10 cm/sec >10 <=10 efflux in MDCK-MDR1 cells CYP1A2 CYP1A2 inhibition in a substrate cocktail assay % Inhibition <10 >=10 CYP3A4 CYP3A4 inhibition in a substrate cocktail assay % Inhibition <10 >=10 CYP2D6 CYP2D6inhibition in a substrate cocktail assay % Inhibition <10 >=10 CYP2C9 CYP2C9 inhibition in a substrate cocktail assay % Inhibition <10 >=10 *Solubility ADMET Aqueous Solubility properties Solubility level >2 <=2 *cLogP logarithm partition coefficient Octanol-Water <3 >=3 Partition Coefficient • Full matrix consisting of 17787 compounds and 9 endpoints • Solubility and cLogP are predicted endpoints using in-house computational models on datasets with more than 10K compounds ,the rest are experimental results 7 Pfizer Confidential
  • 8. Characterize Chemical Landscape: Proposed Workflow* Full matrix FCFP6 Similarity • Structure similarity (cmpd*endpoint) Tanimoto matrix • Fingerprint (4) Select all high/low • MDL public keys risk compounds in an • Atom pairs Endpoint • FCFP6 Select one similarity • ECFC4 cutoff • Coefficient (2) • Tanimoto Select one compound Iterate all high • Cosine Iterate all Cutoffs Calculate the ratio of risk compounds • Risk categorization (2) (total 14) each compound • High risk Ratiosimilarity # neighbors _ with _ same _ class total _# neighbors • Low risk • Endpoints (9) Average the ratio of • Complexity: 4*2*2*9=144 all the compounds Plot: Similarity cutoff & ratio Workflow for Plotting landscape of an endpoint using FCFP6 and tanimoto as similarity measurement 8 *Molecular Similarity Characterization of ADME Landscapes; Chen et al., JCIM, Submitted, 2010 Pfizer Confidential
  • 9. What are we evaluating? Compound ID Similarity 0.9 Similarity 0.8 Similarity 0.7 … PF_1 0.9 0.9 0.7 … PF_2 1 0.5 0.7 … PF_3 0.95 0.8 0.7 … … … … … … PF_N 0.91 0.85 0.68 … average 0.95 0.85 0.7 …  Calculate the ratio of all compounds, individually.  Average the ratio of all the compounds at each similarity threshold, ignoring the ratio is 0 (either no same class neighbor or no neighbor) 9 Pfizer Confidential
  • 10. Results: Compare Different Endpoints (a) ECFC4, Tanimoto, low risk (b) ECFC4, Tanimoto, high risk • Rate of “fall” of a given curve defines how easy/difficult it would be to modify a compound and modify its property i.e. transform a compound from being high risk to low risk or vice versa • Compounds in MDR are relatively difficult to come out of a High Risk Class compared to HLM at any given similarity cutoff • 10 Ratio stays constant after a given certain similarity threshold (i.e. 0.4 in the case of CYP2C9 ) Pfizer Confidential
  • 11. Results: Compare Different Fingerprints* (a) RRCK high risk (b) RRCK low risk • Ratio is different among fingerprints, the order is always FCFP6> Atom- pairs >ECFC4>MDL *Molecular 11 Similarity Characterization of ADME Landscapes; Chen et al., JCIM, Submitted, 2010 Pfizer Confidential
  • 12. Results: Compare different similarity coefficients (a) RRCK Low Risk (b) RRCK High Risk • Ratio is different among similarity coefficients, the order is always 12 tanimoto>Cosine Pfizer Confidential
  • 13. Use Case: Which one is better to optimize? MDR:LOW MDR: HIGH N RRCK:HIGH RRCK: LOW O … … N N N N N S Probability of Success? MDR: LOW? MDR:LOW? RRCK: LOW? RRCK:LOW? O … N … N N N N N S 13 Pfizer Confidential
  • 14. Use Case: Data Driven Compound Prioritization? l h Ei (ratio) (1 E j (ratio)) i j ADMET score l h CYP2D6 CYP2C9 CYP1A2 CYP3A4 Aq. Sol. cLogP RRCK MDR HLM Compds # High SCORE Risk Compound1 - - + - - - - - - 1 0.688 Compound2 + - - - - - - - - 1 0.694 Compound3 - - - - - - - + + 2 0.623 Compound4 - - - + + - + - - 3 0.627 + and - represent high risk and low risk endpoint, respectively 14 Pfizer Confidential
  • 15. Potential Combinations • 4 descriptor types are used • 2 similarity metrics are used • 9 endpoints, • 512 combinations. • Overlap means some compounds with higher risk endpoints should go first than those with lower e.g.: MDL+Tanimoto Coeff. 15 Pfizer Confidential
  • 16. Results: Ranking matrix CYP2C9 CYP2D6 CYP1A2 CYP3A4 Aq. Sol. # high Score at Score at cLogP RRCK MDR HLM endpoints similarity similarity 0.5 0.6 + + + + + + + + + 9 0.326676 0.275558 - + + + + + + + + 8 0.372088 0.333456 + - + + + + + + + 8 0.372646 0.332717 - - + + + + + + + 7 0.418058 0.390616 + + - + + + + + + 8 0.374459 0.336353 ... ... ... ... ... ... ... ... ... … … … - - + + + + + + + 1 0.679591 0.714969 + + - - - - - - - 2 0.635992 0.660706 - + - - - - - - - 1 0.681403 0.718605 + - - - - - - - - 1 0.681962 0.717866 - - - - - - - - - 0 0.727373 0.775765 • + and - represent high risk and low risk endpoint, respectively 16 • totally, 9 endpoints and 512 combinations Pfizer Confidential
  • 17. Conclusion  Small structural changes result in change of class (High/Low Risk) within a given endpoint  Different endpoints behave differently from each other e.g. MDR may be difficult to modify than CYP2C9  Curves are relatively parallel to each other independent of descriptor and similarity metric  Derived scoring function out of the plots to prioritize compounds (for screening or series selection)  Ratios could be used for differentiating between “difficult” endpoints versus “easy” endpoints 1.0 Ratio Difficult 0.5 Easy 0.2 0.5 0.8 1 17 Pfizer Confidential Similarity cutoff
  • 18. Reference  Martin YC et al. Do Structurally Similar Molecules Have Similar Biological Activity?. J. Med. Chem. 2002, 45, 4350-4358  Medina-Franco, JL; et al. Characterization of Activity Landscapes Using 2D and 3D Similarity Methods: Consensus activity Cliffs. J. Chem. Inf. Model. 2009, 49, 477-491  Segall MD, et al. Focus on Success: Using a Probabilistic Approach to Achieve an Optimal Balance of Compound Properties in Drug Discovery. Expert Opin. Drug Metab. Toxicol. 2006, 2, 325-37 18 Pfizer Confidential
  • 19. Acknowledgement  David Wild (School of Informatics and Computing, Indiana University)  Veerabahu Shanmugasundaram (AB RU)  Robyn Ayscue  Hua Gao 19 Pfizer Confidential
  • 21. Results RRCK, ECFC4, Tanimoto, High Risk RRCK, ECFC4, Tanimoto, Low Risk Heatmap for ratios of all compounds at 14 similarity cutoffs 21 Pfizer Confidential
  • 22. Discussion & further work  Normal distribution  Outliers analysis  Ranking function validation  Implementation  On virtue of full matrix and ADME predictive model, any given compound can be assigned a score for prioritization 22 Pfizer Confidential
  • 23. Backup—Normal distribution 700 620 600 554 548 523 500 456 397 396 400 331 308 300 263 212 198 200 185 161 148 150 121 104 101 100 42 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 Binned Ratio RRCK, ECFC4, high, similarity 0.85 RRCK, ECFC4, high, similarity 0.65 23 Pfizer Confidential