SlideShare uma empresa Scribd logo
1 de 12
http://pistoiaalliance.org @PistoiaAlliance
Pistoia Alliance HELM Project
- What About the Big Guys?
The emerging HELM standard for macromolecular
representation
Domain Lead – Sergio Rotstein
Business Technology, Pfizer
What is a “Biomolecule”?
2
Peptides
Therapeutic
Proteins
ADCs
Antibodies
Vaccines
ASOs
siRNAs
For our purposes, anything
that is not a small molecule is
a biomolecule
Goal
• Eliminate biomolecule
penalty
• Make these entities first-
class citizens of the
Informatics tool portfolio
G
A
P
So what’s the problem?
3
N
NH
O
O
O
N
NH
O
O
O
Small
Molecules
Sequences
Biomolecules
Small Molecule Tools Sequence-Based Tools
“Fit-for-Purpose” Structure Representation
We need to enable the
representation, manipulation and
visualization of each molecule type in
a way that is appropriate for its size
and complexity
4
Fit for Purpose: “Monomer” Level
• While you could draw out an oligonucleotide like this:
• The representation is likely more intuitive / practical:
5
Fit for Purpose: Sequence Level
• But even the monomer level representation would not scale well to
proteins with hundreds of amino acids. Larger molecules require a
more sequence-oriented representation:
6
Fit for Purpose: Component Level
• For multi-component structures such as antibody drug
conjugates, component level representations are required to enable
each component to dealt with separately.
7
“Collapsed” Antibody
Expanded Drug
Ab
Hierarchical Editing Language for Macromolecules
– Hierarchical – Amenable to the various “levels”
• Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom
– Extensible
• Allowing addition of new biopolymer types
– (Reasonably) comprehensive
• e.g. Allowing representation of oligonucleotide
hybridization
– Canonicalizable
• Facilitating uniqueness checking
– (Somewhat) human-readable
8
HELM Example: Simple polymer
• HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A
– Non-natural amino acid codes are enclosed in square
brackets
• Natural equivalent: ARGFCKXEDA
9
HELM Example: Complex Polymer
10
Monomer Database
• Each monomer used in the notation needs to be predefined in a
monomer database
• The database includes the chemical structure of the monomer and
a description of all acceptable attachment points
11
J. Chem. Inf. Model 2012, 52, 2796-2806
12

Mais conteúdo relacionado

Semelhante a HELM Notation Overview

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challengesmeijia_yang
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Adam Ford
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardEMBL-ABR
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...Medicines Discovery Catapult
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgePaul Agapow
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit PresentationIguanaBio Iguana
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5plmiami
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxashharnomani
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning Skyl.ai
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyNeil Swainston
 
Fake news detection
Fake news detection Fake news detection
Fake news detection shalushamil
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionAladdin Ayesh
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchQIAGEN
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxSangeetaTripathi8
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Lee Larcombe
 

Semelhante a HELM Notation Overview (20)

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challenges
 
Innovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekterInnovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekter
 
Session ii g2 lab modeling mmc
Session ii g2 lab modeling mmcSession ii g2 lab modeling mmc
Session ii g2 lab modeling mmc
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledge
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit Presentation
 
Neolite Business Credential
Neolite Business CredentialNeolite Business Credential
Neolite Business Credential
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning
 
Neo4j and bioinformatics
Neo4j and bioinformaticsNeo4j and bioinformatics
Neo4j and bioinformatics
 
Switching from academia to industry - and back
Switching from academia to industry - and backSwitching from academia to industry - and back
Switching from academia to industry - and back
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Fake news detection
Fake news detection Fake news detection
Fake news detection
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognition
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome Research
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptx
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014
 

Último

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 

Último (20)

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 

HELM Notation Overview

  • 1. http://pistoiaalliance.org @PistoiaAlliance Pistoia Alliance HELM Project - What About the Big Guys? The emerging HELM standard for macromolecular representation Domain Lead – Sergio Rotstein Business Technology, Pfizer
  • 2. What is a “Biomolecule”? 2 Peptides Therapeutic Proteins ADCs Antibodies Vaccines ASOs siRNAs For our purposes, anything that is not a small molecule is a biomolecule Goal • Eliminate biomolecule penalty • Make these entities first- class citizens of the Informatics tool portfolio
  • 3. G A P So what’s the problem? 3 N NH O O O N NH O O O Small Molecules Sequences Biomolecules Small Molecule Tools Sequence-Based Tools
  • 4. “Fit-for-Purpose” Structure Representation We need to enable the representation, manipulation and visualization of each molecule type in a way that is appropriate for its size and complexity 4
  • 5. Fit for Purpose: “Monomer” Level • While you could draw out an oligonucleotide like this: • The representation is likely more intuitive / practical: 5
  • 6. Fit for Purpose: Sequence Level • But even the monomer level representation would not scale well to proteins with hundreds of amino acids. Larger molecules require a more sequence-oriented representation: 6
  • 7. Fit for Purpose: Component Level • For multi-component structures such as antibody drug conjugates, component level representations are required to enable each component to dealt with separately. 7 “Collapsed” Antibody Expanded Drug Ab
  • 8. Hierarchical Editing Language for Macromolecules – Hierarchical – Amenable to the various “levels” • Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom – Extensible • Allowing addition of new biopolymer types – (Reasonably) comprehensive • e.g. Allowing representation of oligonucleotide hybridization – Canonicalizable • Facilitating uniqueness checking – (Somewhat) human-readable 8
  • 9. HELM Example: Simple polymer • HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A – Non-natural amino acid codes are enclosed in square brackets • Natural equivalent: ARGFCKXEDA 9
  • 10. HELM Example: Complex Polymer 10
  • 11. Monomer Database • Each monomer used in the notation needs to be predefined in a monomer database • The database includes the chemical structure of the monomer and a description of all acceptable attachment points 11
  • 12. J. Chem. Inf. Model 2012, 52, 2796-2806 12

Notas do Editor

  1. Paper will soon be posted on the upcoming HELM web site.