SlideShare uma empresa Scribd logo
1 de 11
SMILES Review
There are six generic SMILES encoding rules, corresponding to
specification of atoms, bonds, branches, ring closures, and
disconnections and isomerism.
SMILES RULES
Basics of SMILES
• SMILES specifically represents a valence model of a molecule, not a
computer data structure, a mathematical abstraction, or an “actual
substance”.
• The function of SMILES is to clearly represent a particular valence
model, not dictate which one should be used. For example one chemist
might represent nitromethane as C[ N+](=O)[O-] with a nitrogen of valence
4 in a charge-separated structure whereas another might represent it as
CN(=O)=O with a neutral five-valent nitrogen. Both of them are correct.
QUIZ : Is this a correct SMILES for nitormethane CN([O])[O]?
• SMILES represents a chemist’s model of molecules, not a computer
scientist’s model of a chemical data structures.
• SMILES grammar is such that it may be canonicalized, i.e., among all
possible valid SMILES for a given molecule or reaction, a single, canonical
(unique)
1. Atom Specifications
• The SMILES atom specification sublanguage represents the atomic
properties: element identity, isotope, formal charge, and implicit hydrogen
count.
• Elements in the "organic subset" B, C, N, O, P, S, F, Cl, Br, and I may be
written without brackets if the number of attached hydrogens conforms to
the lowest normal valence consistent with explicit bonds. "Lowest normal
valences" are B (3), C (4), N (3,5), O (2), P (3,5), S (2,4,6), and 1 for the
halogens.
• Atoms in aromatic rings are specified by lower case letters, e.g., aliphatic
carbon is represented by the capital letter C, aromatic carbon by lower case
c.
• The symbol ‘*’ (“star” or “asterisk”) is treated by SMILES as a valid atomic
symbol meaning “unspecified atomic number” and is represented as an
atom of atomic number zero.
Examples of Atom specifications
Structure SMILES Name Remarks
S [S] Elemental sulfur Defaults inside brackets: mass
unspecified, charge 0, hcount 0.
Au [Au] Elemental gold Second character of two-character
symbols is lower case
PH3 P Phosphine Lowest normal valence of phosphorus is
3
OH- [OH-] or
[OH-1]
Hydroxide anion If charge value is missing, 1 is
assumed. i.e., ‘+’ is equivalent to
‘+1’ and ‘-’ is equivalent to ‘-1’.
Fe2+ [Fe2+] or
Fe[++]
Iron(II) cation Charge sign may be repeated or have a
signed value, e.g., ‘+-t’is
equivalent to ‘+2’.
235U [235U] Uranium-235 A leading integer represents a
specified atomic mass
H2S S Hydrogen sulfide Lowest normal valence of sulfur is 2.
2. Bond Specifications
• Single, double, triple, and aromatic bonds are represented in SMILES
by the symbols -, =, #, and :, respectively. Adjacent atoms without an
intervening symbol are connected by a valence-dictated bond
(typically a single or aromatic bond). “-” (single) and “:” (aromatic)
bond symbols may always be omitted on input.
• There is no “preferred’ or “correct” ordering in SMILES, e.g. CCO and
OCC are equally valid SMILES for ethanol.
SMILES Name Emp Formula
cc Ethane (CH3CH3)
C=C Ethene (CH2=CH2)
C#N Hydrogen Cyanide HCN
CCO Ethanol CH3CH2OH
3.Branching
Structure SMILES Name
CC( C)C( =O)O Isobutyric acid
FC(F)F or C(F)(F)F Fluroform
?? Perchlorate anion
Branches are specified in SMILES by enclosing them in parentheses, which
may be nested or stacked. First three rules (atoms, bonds, branching) allow
specification of any non-cyclic molecule
4. Ring Specifications
Taken from Handbook of Cheminformatics J.Gasteiger
A useful way of thinking about SMILES ring specification is as follows. There is
a graph theorem that says, “There is always a way of breaking one bond per ring in
a connected molecule which leaves you with a still-connected but acyclic molecule.’’
(Actually, graph theoreticians talk about “graphs” instead of ‘‘molecules’’and “edges”
instead of “bonds”, but if they thought about chemistry, that is how
they might say it.) Pick one bond in each ring in this way, numbering them in any
order. Break the numbered bonds, appending the bond number to the atoms on
the ends of the bonds so broken.
Examples
Structure SMILES Name
ClCCCCCl Cyclohexane
C1CCC=CC1
C1=CCCCC1
C1CCCC=C1
Cyclohexene
c1cc2ccccc2cc1
c12c(cccc1)cccc2
C1=CC2=C(C=C1)C=CC=C2
Napthalene
?? Biphenyl
5. Disconnections
• The ‘. ‘ (“period” or “dot”) is used in SMILES to represent
disconnections. In terms of the valence model being
represented, the dot literally represents a bond of formal
order zero: the atoms on either side of the dot are explicitly
not bonded to each other.
• It is often a surprise to SMILES-parser implementers that
c1cc([O-].[Na+])ccc1 is a valid synonym for[Na+].[O-
]c1ccccc1.Because bonds can be specified with “ring closures”,
not all SMILES which contain dots are disconnected nor are all
SMILES which contain ring closures cyclic. Although somewhat
perverse, C1.02.Cl2 is a valid SMILES for ethanol.
Isomersim
• SMILES provides for four types of specification which are so
important to the molecular model that they are included even
though they are outside the valence model. They are: isotopism,
orientation about double bonds, stereo specification, and (for
reactions) reactant-product atom mapping. These are
collectively known as “isomeric SMILES”.
Check
http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html

Mais conteúdo relacionado

Mais procurados

Semantic Technology: The Basics
Semantic Technology: The BasicsSemantic Technology: The Basics
Semantic Technology: The BasicsPeter Berger
 
Presentation1
Presentation1Presentation1
Presentation1firesea
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataAbhik Seal
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusShikha Popali
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayiKiranmayiKnv
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new moleculesMadhuraDatar
 
Molecular docking
Molecular dockingMolecular docking
Molecular dockingpalliyath91
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingAkshay Kank
 
energy minimization
energy minimizationenergy minimization
energy minimizationpradeep kore
 
Drug properties (ADMET) prediction using AI
Drug properties (ADMET) prediction using AIDrug properties (ADMET) prediction using AI
Drug properties (ADMET) prediction using AIIndrajeetKumar124
 
Proteome
ProteomeProteome
ProteomeHARIS.P
 

Mais procurados (20)

Alpha domain structurs
Alpha domain structursAlpha domain structurs
Alpha domain structurs
 
Semantic Technology: The Basics
Semantic Technology: The BasicsSemantic Technology: The Basics
Semantic Technology: The Basics
 
Presentation1
Presentation1Presentation1
Presentation1
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical data
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabus
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
Autodock review ppt
Autodock review pptAutodock review ppt
Autodock review ppt
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Protein docking
Protein dockingProtein docking
Protein docking
 
Molecular modeling in drug design
Molecular modeling in drug designMolecular modeling in drug design
Molecular modeling in drug design
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular Modeling
 
Molecular modelling
Molecular modellingMolecular modelling
Molecular modelling
 
energy minimization
energy minimizationenergy minimization
energy minimization
 
Ramachandran plot
Ramachandran plotRamachandran plot
Ramachandran plot
 
Protein Predictinon
Protein PredictinonProtein Predictinon
Protein Predictinon
 
Drug properties (ADMET) prediction using AI
Drug properties (ADMET) prediction using AIDrug properties (ADMET) prediction using AI
Drug properties (ADMET) prediction using AI
 
Proteome
ProteomeProteome
Proteome
 
Chemoinformatic
Chemoinformatic Chemoinformatic
Chemoinformatic
 

Destaque

Assessment of smile
Assessment of smileAssessment of smile
Assessment of smilekashmira483
 
Emotions ( Lying)
Emotions ( Lying)Emotions ( Lying)
Emotions ( Lying)Ducaen H
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureDevakumar Jain
 
Mystery of smile
Mystery of smileMystery of smile
Mystery of smileAtif-0072
 
Smile slideshare-101106022924-phpapp01
Smile slideshare-101106022924-phpapp01Smile slideshare-101106022924-phpapp01
Smile slideshare-101106022924-phpapp01gomathijaymani
 

Destaque (9)

Assessment of smile
Assessment of smileAssessment of smile
Assessment of smile
 
Keep smiling
Keep smilingKeep smiling
Keep smiling
 
Emotions ( Lying)
Emotions ( Lying)Emotions ( Lying)
Emotions ( Lying)
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
 
Mystery of smile
Mystery of smileMystery of smile
Mystery of smile
 
Smile please !!!
Smile please !!!Smile please !!!
Smile please !!!
 
Smile powerpoint
Smile powerpointSmile powerpoint
Smile powerpoint
 
The Power of Smiling
The Power of SmilingThe Power of Smiling
The Power of Smiling
 
Smile slideshare-101106022924-phpapp01
Smile slideshare-101106022924-phpapp01Smile slideshare-101106022924-phpapp01
Smile slideshare-101106022924-phpapp01
 

Semelhante a Understanding Smiles

Struktur atom dan ikatan antar atom
Struktur atom dan ikatan antar atomStruktur atom dan ikatan antar atom
Struktur atom dan ikatan antar atomAnam Taktujjuh
 
C3b Ionic Bonding
C3b   Ionic BondingC3b   Ionic Bonding
C3b Ionic BondingM F Ebden
 
Molecular Orbital Theory (MOT)
Molecular Orbital Theory  (MOT)Molecular Orbital Theory  (MOT)
Molecular Orbital Theory (MOT)Shivaji Burungale
 
Introduccion a la fisica del estado solido
Introduccion a la fisica del estado solidoIntroduccion a la fisica del estado solido
Introduccion a la fisica del estado solidoIgnacio Rojas Rodriguez
 
Chap 14 mass spec
Chap 14 mass specChap 14 mass spec
Chap 14 mass specceutics1315
 
Molecular orbital theory
Molecular orbital theoryMolecular orbital theory
Molecular orbital theoryPallavi Kumbhar
 
Lecture1 - Crystal Properaties and planes.pptx
Lecture1 - Crystal Properaties and planes.pptxLecture1 - Crystal Properaties and planes.pptx
Lecture1 - Crystal Properaties and planes.pptxMussaab Ibrahim Niass
 
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]Manorama Singh
 
Free ebooks download
Free ebooks downloadFree ebooks download
Free ebooks downloadEdhole.com
 
Attacking The TEKS: Atomic Structure
Attacking The TEKS: Atomic StructureAttacking The TEKS: Atomic Structure
Attacking The TEKS: Atomic StructurePaul Schumann
 

Semelhante a Understanding Smiles (20)

Coordination chemistry i
Coordination chemistry iCoordination chemistry i
Coordination chemistry i
 
Molecular Orbital Theory
Molecular Orbital Theory  Molecular Orbital Theory
Molecular Orbital Theory
 
L 02
L 02L 02
L 02
 
Struktur atom dan ikatan antar atom
Struktur atom dan ikatan antar atomStruktur atom dan ikatan antar atom
Struktur atom dan ikatan antar atom
 
Coordination notes
Coordination notesCoordination notes
Coordination notes
 
C3b Ionic Bonding
C3b   Ionic BondingC3b   Ionic Bonding
C3b Ionic Bonding
 
Molecular Orbital Theory (MOT)
Molecular Orbital Theory  (MOT)Molecular Orbital Theory  (MOT)
Molecular Orbital Theory (MOT)
 
Introduccion a la fisica del estado solido
Introduccion a la fisica del estado solidoIntroduccion a la fisica del estado solido
Introduccion a la fisica del estado solido
 
Chap 14 mass spec
Chap 14 mass specChap 14 mass spec
Chap 14 mass spec
 
mass spectroscopy
mass spectroscopymass spectroscopy
mass spectroscopy
 
Atomic structure
Atomic structureAtomic structure
Atomic structure
 
Phy351 ch 3
Phy351 ch 3Phy351 ch 3
Phy351 ch 3
 
Phy351 ch 3
Phy351 ch 3Phy351 ch 3
Phy351 ch 3
 
Molecular orbital theory
Molecular orbital theoryMolecular orbital theory
Molecular orbital theory
 
Elements and isotopes
Elements and isotopesElements and isotopes
Elements and isotopes
 
Lecture1 - Crystal Properaties and planes.pptx
Lecture1 - Crystal Properaties and planes.pptxLecture1 - Crystal Properaties and planes.pptx
Lecture1 - Crystal Properaties and planes.pptx
 
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]
bonding in chemistry-govt. model science college jabalpur [ Manorama Singh ]
 
Free ebooks download
Free ebooks downloadFree ebooks download
Free ebooks download
 
Attacking The TEKS: Atomic Structure
Attacking The TEKS: Atomic StructureAttacking The TEKS: Atomic Structure
Attacking The TEKS: Atomic Structure
 
Chemical bonding
Chemical bondingChemical bonding
Chemical bonding
 

Mais de Abhik Seal

Clinicaldataanalysis in r
Clinicaldataanalysis in rClinicaldataanalysis in r
Clinicaldataanalysis in rAbhik Seal
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryAbhik Seal
 
Data manipulation on r
Data manipulation on rData manipulation on r
Data manipulation on rAbhik Seal
 
Data handling in r
Data handling in rData handling in r
Data handling in rAbhik Seal
 
Modeling Chemical Datasets
Modeling Chemical DatasetsModeling Chemical Datasets
Modeling Chemical DatasetsAbhik Seal
 
Introduction to Adverse Drug Reactions
Introduction to Adverse Drug ReactionsIntroduction to Adverse Drug Reactions
Introduction to Adverse Drug ReactionsAbhik Seal
 
Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to functionAbhik Seal
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
Learning chemistry with google
Learning chemistry with googleLearning chemistry with google
Learning chemistry with googleAbhik Seal
 
3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using dataAbhik Seal
 
R scatter plots
R scatter plotsR scatter plots
R scatter plotsAbhik Seal
 
Q plot tutorial
Q plot tutorialQ plot tutorial
Q plot tutorialAbhik Seal
 
Pharmacohoreppt
PharmacohorepptPharmacohoreppt
PharmacohorepptAbhik Seal
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design pptAbhik Seal
 

Mais de Abhik Seal (20)

Chemical data
Chemical dataChemical data
Chemical data
 
Clinicaldataanalysis in r
Clinicaldataanalysis in rClinicaldataanalysis in r
Clinicaldataanalysis in r
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug Discovery
 
Data manipulation on r
Data manipulation on rData manipulation on r
Data manipulation on r
 
Data handling in r
Data handling in rData handling in r
Data handling in r
 
Networks
NetworksNetworks
Networks
 
Modeling Chemical Datasets
Modeling Chemical DatasetsModeling Chemical Datasets
Modeling Chemical Datasets
 
Introduction to Adverse Drug Reactions
Introduction to Adverse Drug ReactionsIntroduction to Adverse Drug Reactions
Introduction to Adverse Drug Reactions
 
Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to function
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Learning chemistry with google
Learning chemistry with googleLearning chemistry with google
Learning chemistry with google
 
3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data
 
Poster
PosterPoster
Poster
 
R scatter plots
R scatter plotsR scatter plots
R scatter plots
 
Indo us 2012
Indo us 2012Indo us 2012
Indo us 2012
 
Q plot tutorial
Q plot tutorialQ plot tutorial
Q plot tutorial
 
Weka guide
Weka guideWeka guide
Weka guide
 
Pharmacohoreppt
PharmacohorepptPharmacohoreppt
Pharmacohoreppt
 
Document1
Document1Document1
Document1
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
 

Último

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 

Último (20)

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 

Understanding Smiles

  • 2. There are six generic SMILES encoding rules, corresponding to specification of atoms, bonds, branches, ring closures, and disconnections and isomerism. SMILES RULES
  • 3. Basics of SMILES • SMILES specifically represents a valence model of a molecule, not a computer data structure, a mathematical abstraction, or an “actual substance”. • The function of SMILES is to clearly represent a particular valence model, not dictate which one should be used. For example one chemist might represent nitromethane as C[ N+](=O)[O-] with a nitrogen of valence 4 in a charge-separated structure whereas another might represent it as CN(=O)=O with a neutral five-valent nitrogen. Both of them are correct. QUIZ : Is this a correct SMILES for nitormethane CN([O])[O]? • SMILES represents a chemist’s model of molecules, not a computer scientist’s model of a chemical data structures. • SMILES grammar is such that it may be canonicalized, i.e., among all possible valid SMILES for a given molecule or reaction, a single, canonical (unique)
  • 4. 1. Atom Specifications • The SMILES atom specification sublanguage represents the atomic properties: element identity, isotope, formal charge, and implicit hydrogen count. • Elements in the "organic subset" B, C, N, O, P, S, F, Cl, Br, and I may be written without brackets if the number of attached hydrogens conforms to the lowest normal valence consistent with explicit bonds. "Lowest normal valences" are B (3), C (4), N (3,5), O (2), P (3,5), S (2,4,6), and 1 for the halogens. • Atoms in aromatic rings are specified by lower case letters, e.g., aliphatic carbon is represented by the capital letter C, aromatic carbon by lower case c. • The symbol ‘*’ (“star” or “asterisk”) is treated by SMILES as a valid atomic symbol meaning “unspecified atomic number” and is represented as an atom of atomic number zero.
  • 5. Examples of Atom specifications Structure SMILES Name Remarks S [S] Elemental sulfur Defaults inside brackets: mass unspecified, charge 0, hcount 0. Au [Au] Elemental gold Second character of two-character symbols is lower case PH3 P Phosphine Lowest normal valence of phosphorus is 3 OH- [OH-] or [OH-1] Hydroxide anion If charge value is missing, 1 is assumed. i.e., ‘+’ is equivalent to ‘+1’ and ‘-’ is equivalent to ‘-1’. Fe2+ [Fe2+] or Fe[++] Iron(II) cation Charge sign may be repeated or have a signed value, e.g., ‘+-t’is equivalent to ‘+2’. 235U [235U] Uranium-235 A leading integer represents a specified atomic mass H2S S Hydrogen sulfide Lowest normal valence of sulfur is 2.
  • 6. 2. Bond Specifications • Single, double, triple, and aromatic bonds are represented in SMILES by the symbols -, =, #, and :, respectively. Adjacent atoms without an intervening symbol are connected by a valence-dictated bond (typically a single or aromatic bond). “-” (single) and “:” (aromatic) bond symbols may always be omitted on input. • There is no “preferred’ or “correct” ordering in SMILES, e.g. CCO and OCC are equally valid SMILES for ethanol. SMILES Name Emp Formula cc Ethane (CH3CH3) C=C Ethene (CH2=CH2) C#N Hydrogen Cyanide HCN CCO Ethanol CH3CH2OH
  • 7. 3.Branching Structure SMILES Name CC( C)C( =O)O Isobutyric acid FC(F)F or C(F)(F)F Fluroform ?? Perchlorate anion Branches are specified in SMILES by enclosing them in parentheses, which may be nested or stacked. First three rules (atoms, bonds, branching) allow specification of any non-cyclic molecule
  • 8. 4. Ring Specifications Taken from Handbook of Cheminformatics J.Gasteiger A useful way of thinking about SMILES ring specification is as follows. There is a graph theorem that says, “There is always a way of breaking one bond per ring in a connected molecule which leaves you with a still-connected but acyclic molecule.’’ (Actually, graph theoreticians talk about “graphs” instead of ‘‘molecules’’and “edges” instead of “bonds”, but if they thought about chemistry, that is how they might say it.) Pick one bond in each ring in this way, numbering them in any order. Break the numbered bonds, appending the bond number to the atoms on the ends of the bonds so broken.
  • 9. Examples Structure SMILES Name ClCCCCCl Cyclohexane C1CCC=CC1 C1=CCCCC1 C1CCCC=C1 Cyclohexene c1cc2ccccc2cc1 c12c(cccc1)cccc2 C1=CC2=C(C=C1)C=CC=C2 Napthalene ?? Biphenyl
  • 10. 5. Disconnections • The ‘. ‘ (“period” or “dot”) is used in SMILES to represent disconnections. In terms of the valence model being represented, the dot literally represents a bond of formal order zero: the atoms on either side of the dot are explicitly not bonded to each other. • It is often a surprise to SMILES-parser implementers that c1cc([O-].[Na+])ccc1 is a valid synonym for[Na+].[O- ]c1ccccc1.Because bonds can be specified with “ring closures”, not all SMILES which contain dots are disconnected nor are all SMILES which contain ring closures cyclic. Although somewhat perverse, C1.02.Cl2 is a valid SMILES for ethanol.
  • 11. Isomersim • SMILES provides for four types of specification which are so important to the molecular model that they are included even though they are outside the valence model. They are: isotopism, orientation about double bonds, stereo specification, and (for reactions) reactant-product atom mapping. These are collectively known as “isomeric SMILES”. Check http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html