SlideShare uma empresa Scribd logo
1 de 26
1
Adding Complex Expert KnowledgeAdding Complex Expert Knowledge
into Chemical Databases:into Chemical Databases:
Transforming Surfactants in WastewaterTransforming Surfactants in Wastewater
Emma Schymanski
Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg.
Email: emma.schymanski@uni.lu
Juliane Hollender (Eawag, Dübendorf, Switzerland)
Chris Grulke (NCCT, US EPA, Research Triangle Park, NC, USA)
Antony J. Williams (NCCT, US EPA, Research Triangle Park, NC, USA)
The views expressed in this presentation are those of the authors and do not necessarily reflect the views or policies of the U.S. Environmental Protection Agency.
2
Why Add Complex Expert Knowledge to Databases?
(l) Schymanski et al. 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374. (r) Source: N. Zamboni
Unknowns and High Resolution Mass Spectrometry
o Over 60 % of HR-MS peaks are potentially relevant but unknown
3
Why Add Complex Expert Knowledge to Databases?
Schymanski et al. 2014, ES&T, DOI: 10.1021/es4044374; M. Loos & H Singer, 2017. J. Cheminf. DOI: 10.1186/s13321-017-0197-z
Homologous Series and UVCBs in our samples ….
S OO
OH
CH3
CH3
m
n
C9H19
O
O
S
O
O
OHm
4
Why Add Complex Expert Knowledge to Databases?
Homologue screening plots from Swiss Wastewater (Schymanski et al 2014, left) and Novi Sad (right)
o Complex mixtures (UVCBs) are a huge
and very challenging part of the puzzle
5
Swiss Wastewater: Top 30 Peaks (ESI-)
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
Artificial Sweeteners
Diclofenac
Pictures: www.coca-cola-com; www.rivella.ch; www.voltargengel.com
6
Grouping Isotopes and Adducts
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
0
3000
6000
9000
12000
15000
positive
2%
27%
100%
Noise/Blank Targets Non-targets
0
3000
6000
9000
12000
15000
positivenegative
1%
30%
100%
7
Swiss Wastewater: Top 30 Peaks (ESI-)
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
S OO
O
-
O
S
O
-
O
CH2
m/z = 79.96 m/z = 183.01
Picture: www.momsteam.com
8
Surfactant Screening From Literature
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
Literature sources
o Formulas, masses (ions), retention times and intensities
o Spectra of selected compounds (different instruments)
Gonzalez et al. Rapid Comm.
Mass Spec. 2008, 22: 1445-54
Lara-Martin et al. EST. 2010, 44: 1670-1676
9
Surfactant Screening From Literature
Adding literature spectra to
www.massbank.eu/MassBank/
Enable access and comparison
Lara-Martin et al. EST. 2010, 44: 1670-1676
39 literature spectra (so far)
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
10
Supporting Evidence for Homologues
M. Loos & H Singer, 2017. J. Cheminf. DOI: 10.1186/s13321-017-0197-z & Schymanski et al. 2014, ES&T DOI: 10.1021/es4044374
S OO
OH
CH3
CH3
m
n
DATS
S OO
OH
O
OH
CH3 ()n ()m
SPAC
S OO
OH
O
OHCH3
()n
()m
STAC
http://www.envihomolog.eawag.ch/
11
Supporting Evidence for Homologues
Schymanski et al. (2014),
ES&T, 48: 1811-1818.
DOI: 10.1021/es4044374
12
Supporting Evidence for Homologues
Stravs et al. (2013), J. Mass Spectrom, 48(1):89-99. DOI: 10.1002/jms.3131
OHSO
O
CH3
O
OH
m n
SPA-9C
m+n=6
Formulas: http://sourceforge.net/projects/genform/
Meringer et al, 2011, MATCH 65, 259-290
Data: Schymanski et al. 2014, ES&T, 48:
1811-1818. DOI: 10.1021/es4044374
Chromatography and MS/MS Annotation
Literature: LIT00034,35
Sample: ETS00002
Standard: ETS00016,17,19,20
https://github.com/MassBank/RMassBank/
13
Swiss Wastewater: Top 30 Peaks (ESI-)
Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
Acesulfame
Diclofenac
Cyclamate
Saccharin
C10DATS
C10SPAC
SPA5C
C15DATS
STA6C
C9DATS
SPA2DC
S OO
OH
O
OH
CH3
S OO
OH
CH3
CH3
()n
()m
SPAC
DATS
()n ()m
14
NORMAN Suspect Exchange
o http://www.norman-network.com/?q=node/236
Schymanski, Aalizadeh et al. in prep.
15
Eawag Surfactant List
Schymanski et al 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
16
Using Generic Structures for Screening/Linking
o https://github.com/schymane/RChemMass/
17
Eawag Surfactant List (after many late nights…)
Schymanski et al 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf
18
Eawag Surfactant List in CompTox Dashboard
Schymanski, Grulke, Williams et al, in prep. & Williams et al. 2017 J. Cheminformatics 9:61 DOI: 10.1186/s13321-017-0247-6
https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf
CDK Depict
19
Cross-Linking with Lists in CompTox Dashboard
20
Structures now accessible for Computational MS
Schymanski, Grulke, Williams et al, in prep. & Williams et al. 2017 J. Cheminformatics 9:61 DOI: 10.1186/s13321-017-0247-6
https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf
21
Acknowledgements
emma.schymanski@uni.lu
Further Information:
www.massbank.eu
http://www.norman-network.com/?q=node/236
https://github.com/MassBank/RMassBank/
https://github.com/schymane/RChemMass/
https://comptox.epa.gov/dashboard/
https://wwwen.uni.lu/lcsb/
.eu
2.32.3
EU Grant
603437
22
23
European (World-)Wide Exchange of Suspects
Schymanski et al. 2015, ABC, DOI: 10.1007/s00216-015-8681-7
NORMAN Suspect List Exchange:
http://www.norman-network.com/?q=node/236
Schymanski et al. 2015, ABC,
DOI: 10.1007/s00216-015-8681-7
Tentatively Identified Spectra:
http://goo.gl/0t7jGp
Hits in GNPS MassIVE datasets:
TPs in skin: http://goo.gl/NmO4tx
Surfactants: http://goo.gl/7sY9Pf
24
2015: European Non-target Screening Trial
Schymanski et al, 2014, ES&T. DOI: 10.1021/es5002105 & Schymanski et al. 2015, ABC, DOI: 10.1007/s00216-015-8681-7
Peak
picking
Non-target HR-MS(/MS) Acquisition
Target
Screening
Suspect
Screening
Non-target
Screening
Start
Level 1 Confirmed Structure
by reference standard
Level 2 Probable Structure
by library/diagnostic evidence
Start
Level 3 Tentative Candidate(s)
suspect, substructure, class
Level 4 Unequivocal Molecular Formula
insufficient structural evidence
Start
Level 5 Mass of Interest
multiple detection, trends, …
“downgrading” with
contradictory evidence
Increasing identification
confidence
Target list Suspect list
Peak picking or XICs
25
Eawag Surfactant List in CompTox Dashboard
CDK Depict
https://www.slideshare.net/AntonyWilliams/
markush-enumeration-to-manage-mesh-and-manipulate-substances-of-unknown-or-variable-composition
26
Target, Suspect and Non-Target Screening
KNOWNS SUSPECTS No Prior Knowledge
HPLC separation and HR-MS/MS
TARGET
ANALYSIS
SUSPECT
SCREENING
NON-TARGET
SCREENING
Targets found Suspects found Masses of interest
(Molecular formula)
DATABASE
SEARCH
STRUCTURE
GENERATION
Confirmation and quantification of compounds present
Candidate selection (retention time, MS/MS, calculated properties)
Sampling extraction (SPE) HPLC separation HR-MS/MS
Time, Effort & Number of Compounds….
SUSPECTS
SPECTRUM
SEARCH
Spectral match

Mais conteúdo relacionado

Mais procurados

New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The influence of data curation on QSAR Modeling – examining issues of qualit...
 The influence of data curation on QSAR Modeling – examining issues of qualit... The influence of data curation on QSAR Modeling – examining issues of qualit...
The influence of data curation on QSAR Modeling – examining issues of qualit...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Mais procurados (20)

New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
 
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity KnowledgeDMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
The influence of data curation on QSAR Modeling – examining issues of qualit...
 The influence of data curation on QSAR Modeling – examining issues of qualit... The influence of data curation on QSAR Modeling – examining issues of qualit...
The influence of data curation on QSAR Modeling – examining issues of qualit...
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
 
How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Semelhante a Adding complex expert knowledge into chemical database and transforming surfactants in wastewater

Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Emma Schymanski
 
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
Emma Schymanski
 
Curating and Sharing Structures and Spectra for the Environmental Community
Curating and Sharing  Structures and Spectra for the Environmental CommunityCurating and Sharing  Structures and Spectra for the Environmental Community
Curating and Sharing Structures and Spectra for the Environmental Community
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Automated Structure Annotation and Curation for MassBank: Potential and Pitfalls
Automated Structure Annotation and Curation for MassBank: Potential and PitfallsAutomated Structure Annotation and Curation for MassBank: Potential and Pitfalls
Automated Structure Annotation and Curation for MassBank: Potential and Pitfalls
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
draft 7 luchinsky_CV
draft 7 luchinsky_CVdraft 7 luchinsky_CV
draft 7 luchinsky_CV
luchinskyd
 
Developing tools for high resolution mass spectrometry-based screening via th...
Developing tools for high resolution mass spectrometry-based screening via th...Developing tools for high resolution mass spectrometry-based screening via th...
Developing tools for high resolution mass spectrometry-based screening via th...
Andrew McEachran
 
Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6
Zhe (Henry) He
 
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Bringing it all together: A Web-based Database for Chemical and Biological Da...
Bringing it all together: A Web-based Database for Chemical and Biological Da...Bringing it all together: A Web-based Database for Chemical and Biological Da...
Bringing it all together: A Web-based Database for Chemical and Biological Da...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Semelhante a Adding complex expert knowledge into chemical database and transforming surfactants in wastewater (20)

Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
 
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
RSC Environmental Cheminformatics to Identify Unknowns Feb 2019
 
Curating and Sharing Structures and Spectra for the Environmental Community
Curating and Sharing  Structures and Spectra for the Environmental CommunityCurating and Sharing  Structures and Spectra for the Environmental Community
Curating and Sharing Structures and Spectra for the Environmental Community
 
Automated Structure Annotation and Curation for MassBank: Potential and Pitfalls
Automated Structure Annotation and Curation for MassBank: Potential and PitfallsAutomated Structure Annotation and Curation for MassBank: Potential and Pitfalls
Automated Structure Annotation and Curation for MassBank: Potential and Pitfalls
 
A Conceptual Building-Block and Practical OpenStreetMap-Interface for Sharing...
A Conceptual Building-Block and Practical OpenStreetMap-Interface for Sharing...A Conceptual Building-Block and Practical OpenStreetMap-Interface for Sharing...
A Conceptual Building-Block and Practical OpenStreetMap-Interface for Sharing...
 
RMG at the Flame Chemistry Workshop 2014
RMG at the Flame Chemistry Workshop 2014RMG at the Flame Chemistry Workshop 2014
RMG at the Flame Chemistry Workshop 2014
 
draft 7 luchinsky_CV
draft 7 luchinsky_CVdraft 7 luchinsky_CV
draft 7 luchinsky_CV
 
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine AssaysUS-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
 
129526918 old-ccs
129526918 old-ccs129526918 old-ccs
129526918 old-ccs
 
High throughput mining of the scholarly literature; talk at NIH
High throughput mining of the scholarly literature; talk at NIHHigh throughput mining of the scholarly literature; talk at NIH
High throughput mining of the scholarly literature; talk at NIH
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
QMC: Transition Workshop - Selected Highlights from the Probabilistic Numeric...
QMC: Transition Workshop - Selected Highlights from the Probabilistic Numeric...QMC: Transition Workshop - Selected Highlights from the Probabilistic Numeric...
QMC: Transition Workshop - Selected Highlights from the Probabilistic Numeric...
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Developing tools for high resolution mass spectrometry-based screening via th...
Developing tools for high resolution mass spectrometry-based screening via th...Developing tools for high resolution mass spectrometry-based screening via th...
Developing tools for high resolution mass spectrometry-based screening via th...
 
Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6
 
PubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingPubChem as a resource for chemical information training
PubChem as a resource for chemical information training
 
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
 
Bringing it all together: A Web-based Database for Chemical and Biological Da...
Bringing it all together: A Web-based Database for Chemical and Biological Da...Bringing it all together: A Web-based Database for Chemical and Biological Da...
Bringing it all together: A Web-based Database for Chemical and Biological Da...
 
Printout webinar r ax costanza 05 05-2020
Printout webinar r ax costanza 05 05-2020Printout webinar r ax costanza 05 05-2020
Printout webinar r ax costanza 05 05-2020
 
Non-targeted screening to improve substance identity for Chemical Substances ...
Non-targeted screening to improve substance identity for Chemical Substances ...Non-targeted screening to improve substance identity for Chemical Substances ...
Non-targeted screening to improve substance identity for Chemical Substances ...
 

Último

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 

Último (20)

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 

Adding complex expert knowledge into chemical database and transforming surfactants in wastewater

  • 1. 1 Adding Complex Expert KnowledgeAdding Complex Expert Knowledge into Chemical Databases:into Chemical Databases: Transforming Surfactants in WastewaterTransforming Surfactants in Wastewater Emma Schymanski Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg. Email: emma.schymanski@uni.lu Juliane Hollender (Eawag, Dübendorf, Switzerland) Chris Grulke (NCCT, US EPA, Research Triangle Park, NC, USA) Antony J. Williams (NCCT, US EPA, Research Triangle Park, NC, USA) The views expressed in this presentation are those of the authors and do not necessarily reflect the views or policies of the U.S. Environmental Protection Agency.
  • 2. 2 Why Add Complex Expert Knowledge to Databases? (l) Schymanski et al. 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374. (r) Source: N. Zamboni Unknowns and High Resolution Mass Spectrometry o Over 60 % of HR-MS peaks are potentially relevant but unknown
  • 3. 3 Why Add Complex Expert Knowledge to Databases? Schymanski et al. 2014, ES&T, DOI: 10.1021/es4044374; M. Loos & H Singer, 2017. J. Cheminf. DOI: 10.1186/s13321-017-0197-z Homologous Series and UVCBs in our samples …. S OO OH CH3 CH3 m n C9H19 O O S O O OHm
  • 4. 4 Why Add Complex Expert Knowledge to Databases? Homologue screening plots from Swiss Wastewater (Schymanski et al 2014, left) and Novi Sad (right) o Complex mixtures (UVCBs) are a huge and very challenging part of the puzzle
  • 5. 5 Swiss Wastewater: Top 30 Peaks (ESI-) Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 Artificial Sweeteners Diclofenac Pictures: www.coca-cola-com; www.rivella.ch; www.voltargengel.com
  • 6. 6 Grouping Isotopes and Adducts Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 0 3000 6000 9000 12000 15000 positive 2% 27% 100% Noise/Blank Targets Non-targets 0 3000 6000 9000 12000 15000 positivenegative 1% 30% 100%
  • 7. 7 Swiss Wastewater: Top 30 Peaks (ESI-) Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 S OO O - O S O - O CH2 m/z = 79.96 m/z = 183.01 Picture: www.momsteam.com
  • 8. 8 Surfactant Screening From Literature Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 Literature sources o Formulas, masses (ions), retention times and intensities o Spectra of selected compounds (different instruments) Gonzalez et al. Rapid Comm. Mass Spec. 2008, 22: 1445-54 Lara-Martin et al. EST. 2010, 44: 1670-1676
  • 9. 9 Surfactant Screening From Literature Adding literature spectra to www.massbank.eu/MassBank/ Enable access and comparison Lara-Martin et al. EST. 2010, 44: 1670-1676 39 literature spectra (so far) Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
  • 10. 10 Supporting Evidence for Homologues M. Loos & H Singer, 2017. J. Cheminf. DOI: 10.1186/s13321-017-0197-z & Schymanski et al. 2014, ES&T DOI: 10.1021/es4044374 S OO OH CH3 CH3 m n DATS S OO OH O OH CH3 ()n ()m SPAC S OO OH O OHCH3 ()n ()m STAC http://www.envihomolog.eawag.ch/
  • 11. 11 Supporting Evidence for Homologues Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
  • 12. 12 Supporting Evidence for Homologues Stravs et al. (2013), J. Mass Spectrom, 48(1):89-99. DOI: 10.1002/jms.3131 OHSO O CH3 O OH m n SPA-9C m+n=6 Formulas: http://sourceforge.net/projects/genform/ Meringer et al, 2011, MATCH 65, 259-290 Data: Schymanski et al. 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 Chromatography and MS/MS Annotation Literature: LIT00034,35 Sample: ETS00002 Standard: ETS00016,17,19,20 https://github.com/MassBank/RMassBank/
  • 13. 13 Swiss Wastewater: Top 30 Peaks (ESI-) Schymanski et al. (2014), ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 Acesulfame Diclofenac Cyclamate Saccharin C10DATS C10SPAC SPA5C C15DATS STA6C C9DATS SPA2DC S OO OH O OH CH3 S OO OH CH3 CH3 ()n ()m SPAC DATS ()n ()m
  • 14. 14 NORMAN Suspect Exchange o http://www.norman-network.com/?q=node/236 Schymanski, Aalizadeh et al. in prep.
  • 15. 15 Eawag Surfactant List Schymanski et al 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374
  • 16. 16 Using Generic Structures for Screening/Linking o https://github.com/schymane/RChemMass/
  • 17. 17 Eawag Surfactant List (after many late nights…) Schymanski et al 2014, ES&T, 48: 1811-1818. DOI: 10.1021/es4044374 https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf
  • 18. 18 Eawag Surfactant List in CompTox Dashboard Schymanski, Grulke, Williams et al, in prep. & Williams et al. 2017 J. Cheminformatics 9:61 DOI: 10.1186/s13321-017-0247-6 https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf CDK Depict
  • 19. 19 Cross-Linking with Lists in CompTox Dashboard
  • 20. 20 Structures now accessible for Computational MS Schymanski, Grulke, Williams et al, in prep. & Williams et al. 2017 J. Cheminformatics 9:61 DOI: 10.1186/s13321-017-0247-6 https://comptox.epa.gov/dashboard/chemical_lists/eawagsurf
  • 22. 22
  • 23. 23 European (World-)Wide Exchange of Suspects Schymanski et al. 2015, ABC, DOI: 10.1007/s00216-015-8681-7 NORMAN Suspect List Exchange: http://www.norman-network.com/?q=node/236 Schymanski et al. 2015, ABC, DOI: 10.1007/s00216-015-8681-7 Tentatively Identified Spectra: http://goo.gl/0t7jGp Hits in GNPS MassIVE datasets: TPs in skin: http://goo.gl/NmO4tx Surfactants: http://goo.gl/7sY9Pf
  • 24. 24 2015: European Non-target Screening Trial Schymanski et al, 2014, ES&T. DOI: 10.1021/es5002105 & Schymanski et al. 2015, ABC, DOI: 10.1007/s00216-015-8681-7 Peak picking Non-target HR-MS(/MS) Acquisition Target Screening Suspect Screening Non-target Screening Start Level 1 Confirmed Structure by reference standard Level 2 Probable Structure by library/diagnostic evidence Start Level 3 Tentative Candidate(s) suspect, substructure, class Level 4 Unequivocal Molecular Formula insufficient structural evidence Start Level 5 Mass of Interest multiple detection, trends, … “downgrading” with contradictory evidence Increasing identification confidence Target list Suspect list Peak picking or XICs
  • 25. 25 Eawag Surfactant List in CompTox Dashboard CDK Depict https://www.slideshare.net/AntonyWilliams/ markush-enumeration-to-manage-mesh-and-manipulate-substances-of-unknown-or-variable-composition
  • 26. 26 Target, Suspect and Non-Target Screening KNOWNS SUSPECTS No Prior Knowledge HPLC separation and HR-MS/MS TARGET ANALYSIS SUSPECT SCREENING NON-TARGET SCREENING Targets found Suspects found Masses of interest (Molecular formula) DATABASE SEARCH STRUCTURE GENERATION Confirmation and quantification of compounds present Candidate selection (retention time, MS/MS, calculated properties) Sampling extraction (SPE) HPLC separation HR-MS/MS Time, Effort & Number of Compounds…. SUSPECTS SPECTRUM SEARCH Spectral match