SlideShare uma empresa Scribd logo
1 de 1
Looking closer at the failed samples provides some insight to the nature of the failure.  ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm.  It was determined that this peak was due to the aromatic methyl group in the structure.  The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond.  As a result, this rotamer produces an experimental spectrum that looks like a mixture.  The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis.  Figure 2.   ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond.  The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts.  The software also flagged the sample ID #7 to be considered for closer inspection.  In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture.  Based on some of the spectral features in both the  1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products.  As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3.   ID #7 was flagged as an ambiguous result.  Spectral features in the  1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above.  The final sample in the blind test set (ID #10) was also flagged by the software.  In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak.  Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak.  Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1.   The results of the 19 Aldrich datasets.  For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software.  The software was unable to confirm the proposed structure in Figure 1 (ID #8).  Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly.  As a result, the software flagged this result as ambiguous.  It was later observed that the low integration value could be due to enol formation.  A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software.  Figure 1.   ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low.  A second compound, ID #11, was rejected by the software as well.  Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components.  In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look.  Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion.  No changes to the processing or verification parameters were made.  The results of this test are shown in table 2.  Table 2.   The results of the 10 blind Aldrich datasets.  For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software.  Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th  floor, Toronto Ontario, Canada  M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com   Validating Automated Structure Confirmation in a Blind Study Figure 6 .  ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum.  CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds.  The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds.  Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions.  The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in.  This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist.  These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives.  The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment.  ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study.  REFERENCES 1. Automated Structure Verification Based on  1 H NMR Prediction,  Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer .  Magn. Reson. Chem.  2006; 44: 524-538.  2. Automated Evaluation of a Chemical Structure with Only 1D  1 H and 2D  1 H– 13 C HSQC,  Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer  .  ENC Poster  2006.   INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using  1 H,  13 C, and 2D NMR verification algorithms. 1–2   These studies have shown that these systems have performed extremely well through numerous challenges.  The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test.  This study was designed to prove that such a system would hold up in an industrial environment without any human intervention.  This study consisted of two distinct sets of structures and spectra.  The first contained 19 spectra sets (each dataset contained 1D  1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options.  This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run.  Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance.  The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed.  Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks.  The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros.  These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios.  These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues.  The second attempt was improved but had some issues with the referencing in one of the 2D datasets.  In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets.  These issues were rectified by decreasing the line broadening setting in the software by a factor of 10.  Following this modification, the settings were then deemed to be sufficient.  Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2   Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically.  The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided.  In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention.  The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1).   Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15).  The other two ambiguous results require a closer look to be explained.

Mais conteúdo relacionado

Mais procurados

Selected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisSelected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisIS-X
 
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Shimadzu Scientific Instruments
 
LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsSCIEX
 
Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015SGS
 
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)IS-X
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...SCIEX
 
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...pharmaindexing
 
Case Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsCase Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsJordi Labs
 
Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Rana Tayyarah
 
Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Bhavana Gundavarapu
 
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachMALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachShimadzu Scientific Instruments
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...SCIEX
 
Refocus: Analysis of malodor
Refocus: Analysis of malodorRefocus: Analysis of malodor
Refocus: Analysis of malodorIS-X
 
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Bhavana Gundavarapu
 

Mais procurados (20)

Selected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisSelected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysis
 
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
 
Archana B lc ppt
Archana B lc pptArchana B lc ppt
Archana B lc ppt
 
LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminants
 
Ionization Techniques in LC-MS
Ionization Techniques in LC-MSIonization Techniques in LC-MS
Ionization Techniques in LC-MS
 
Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015
 
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
 
DETECTORS IN HPLC
DETECTORS IN HPLCDETECTORS IN HPLC
DETECTORS IN HPLC
 
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
 
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
 
Case Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsCase Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical Instruments
 
Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615
 
Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.
 
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachMALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...
 
Refocus: Analysis of malodor
Refocus: Analysis of malodorRefocus: Analysis of malodor
Refocus: Analysis of malodor
 
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
 
LCMS
LCMSLCMS
LCMS
 
HPLC Method Development
HPLC Method DevelopmentHPLC Method Development
HPLC Method Development
 

Semelhante a Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...Simone Brogi
 
Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Alexander Decker
 
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...
IRJET- Detection and Identification of Artificially Ripened Fruits using ...IRJET Journal
 
Neural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentNeural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentIAESIJAI
 
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER ecij
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...Kamel Mansouri
 
2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtaiSirris
 
High Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionHigh Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionThermo Fisher Scientific
 
Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALJessica Sitko
 
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint PropagationA Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagationijseajournal
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsSean Ekins
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET Journal
 
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...Jessica Navarro
 

Semelhante a Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study (20)

The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
 
Event 32
Event 32Event 32
Event 32
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
 
Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’
 
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
 
Neural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentNeural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatment
 
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
 
A360105
A360105A360105
A360105
 
2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
High Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionHigh Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant Detection
 
Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINAL
 
EMSDVici
EMSDViciEMSDVici
EMSDVici
 
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint PropagationA Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
 
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
 

Último

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Último (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

  • 1. Looking closer at the failed samples provides some insight to the nature of the failure. ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm. It was determined that this peak was due to the aromatic methyl group in the structure. The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond. As a result, this rotamer produces an experimental spectrum that looks like a mixture. The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis. Figure 2. ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond. The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts. The software also flagged the sample ID #7 to be considered for closer inspection. In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture. Based on some of the spectral features in both the 1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products. As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3. ID #7 was flagged as an ambiguous result. Spectral features in the 1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above. The final sample in the blind test set (ID #10) was also flagged by the software. In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak. Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak. Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1. The results of the 19 Aldrich datasets. For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software. The software was unable to confirm the proposed structure in Figure 1 (ID #8). Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly. As a result, the software flagged this result as ambiguous. It was later observed that the low integration value could be due to enol formation. A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software. Figure 1. ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low. A second compound, ID #11, was rejected by the software as well. Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components. In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look. Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion. No changes to the processing or verification parameters were made. The results of this test are shown in table 2. Table 2. The results of the 10 blind Aldrich datasets. For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software. Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th floor, Toronto Ontario, Canada M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com Validating Automated Structure Confirmation in a Blind Study Figure 6 . ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum. CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds. The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds. Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions. The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in. This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist. These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives. The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment. ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study. REFERENCES 1. Automated Structure Verification Based on 1 H NMR Prediction, Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . Magn. Reson. Chem. 2006; 44: 524-538. 2. Automated Evaluation of a Chemical Structure with Only 1D 1 H and 2D 1 H– 13 C HSQC, Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . ENC Poster 2006. INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using 1 H, 13 C, and 2D NMR verification algorithms. 1–2 These studies have shown that these systems have performed extremely well through numerous challenges. The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test. This study was designed to prove that such a system would hold up in an industrial environment without any human intervention. This study consisted of two distinct sets of structures and spectra. The first contained 19 spectra sets (each dataset contained 1D 1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options. This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run. Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance. The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed. Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks. The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros. These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios. These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues. The second attempt was improved but had some issues with the referencing in one of the 2D datasets. In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets. These issues were rectified by decreasing the line broadening setting in the software by a factor of 10. Following this modification, the settings were then deemed to be sufficient. Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2 Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically. The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided. In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention. The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1). Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15). The other two ambiguous results require a closer look to be explained.