SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Chemistry and reactions from non-
US patents
Daniel Lowe and Roger Sayle
NextMove Software
Cambridge, UK
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Topics
• Coverage of USPTO vs EPO patents
• Extraction of non-chemical-compound data
• Can text-mining provide insights into reaction
informatics
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
What am I missing by only using the
USPTO data?
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
EPO and USPTO background
• USPTO: 303k grants published in 2013
• EPO: 67k grants published in 2013
• EPO patents must be in one or more of
English, French or German
• USPTO documents are all in English
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Epo/USPTO Compounds
(using StdInChI)
3,345,877
2,674,069
445,588
USPTO = 1976-2013 patent grants + 2001-2013 patent applications
EPO = 1978-2013 patent grants and applications
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Epo/USPTO Reactions
(using separately sorted StdInChIs for reactants/agents and products)
706,524
317,533
99,681
USPTO = 1976-2013 patent grants + 2001-2013 patent applications
EPO = 1978-2013 patent grants and applications
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Effect of different Languages
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
Numberofstructuresextracted
English
German
French
Translation performed by Lexichem v2014.Jun
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
First Disclosure Occurrence by
Language
Excluding compounds
disclosed by USPTO (1976-
2013) patents
All compounds
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Timeliness
• Competitive intelligence requires up to date
information
• Methodology:
– Consider compounds present in both EPO and
USPTO date not disclosed by either prior to 2006
– Compare the earliest publication date for each
compound
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
0
20000
40000
60000
80000
100000
120000
140000
160000
180000
US more
than 5
years
earlier
US 3-5
years
earlier
US 2-3
years
earlier
US 1-2
years
earlier
US 6-12
months
earlier
US 3-6
months
earlier
US 1-3
months
earlier
US 1-4
weeks
earlier
Within a
week
US 1-4
weeks
later
US 1-3
months
later
US 3-6
months
later
US 6-12
months
later
US 1-2
years
later
US 2-3
years
later
US 3-5
years
later
US more
than 5
years
later
Compounddisclosures
Average lag between EPO/USPTO
Compounds first disclosed between 2006-2013
Mean: US 540 days earlier
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
0
10000
20000
30000
40000
50000
60000
70000
US more
than 5
years
earlier
US 3-5
years
earlier
US 2-3
years
earlier
US 1-2
years
earlier
US 6-12
months
earlier
US 3-6
months
earlier
US 1-3
months
earlier
US 1-4
weeks
earlier
Within a
week
US 1-4
weeks
later
US 1-3
months
later
US 3-6
months
later
US 6-12
months
later
US 1-2
years
later
US 2-3
years
later
US 3-5
years
later
US more
than 5
years
later
Compounddisclosures
Applications Vs Applications
Mean: US 223 days earlier
Compounds first disclosed between 2006-2013
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
0
20000
40000
60000
80000
100000
120000
140000
US more
than 5
years
earlier
US 3-5
years
earlier
US 2-3
years
earlier
US 1-2
years
earlier
US 6-12
months
earlier
US 3-6
months
earlier
US 1-3
months
earlier
US 1-4
weeks
earlier
Within a
week
US 1-4
weeks
later
US 1-3
months
later
US 3-6
months
later
US 6-12
months
later
US 1-2
years
later
US 2-3
years
later
US 3-5
years
later
US more
than 5
years
later
Compounddisclosures
Grants vs Grants
Compounds first disclosed between 2006-2013
Mean: US 287 days earlier
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Epo/US 1976-2013 and chembl19
overlap
187,486
1,155,034
6,278,048
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
0
1000
2000
3000
4000
5000
6000
7000
8000
9000
10000
Patents 4-5
years earlier
Patents 3-4
years earlier
Patents 2-3
years earlier
Patents 1-2
years earlier
Patents 0-1
years earlier
Patents 0-1
years later
Patents 1-2
years later
Patents 2-3
years later
Patents 3-5
years later
Patents 4-5
years later
Average lag between
Patents/CHEMbl19
Compounds first disclosed between 2006-2013
Mean: Patents 345 days
earlier
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Other data in patents
• Genes/proteins
• Reactions including role of reagents
• Physical quantities
• Spectra
• Diseases
• Organisms
• Companies
• …
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Gene/protein identification
• Identify trends in drug target popularity
• Count number of patents (in a year)
mentioning a given gene or its gene products
and map to the HGNC symbol
• Many genes are referenced by short
ambiguous terms hence great care is required
to have good precision
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Gene/protein identification
(Hepatocyte growth factor receptor)(Epidermal growth factor receptor)
0.00%
0.10%
0.20%
0.30%
0.40%
0.50%
0.60%
0.70%
0.80%
0.90%
0.00%
0.05%
0.10%
0.15%
0.20%
0.25%
0.30%
0.35%
1976197819801982198419861988199019921994199619982000200220042006200820102012
Percentageofprotein/genementionsinthatyear(ChEMBL)
Percentageofprotein/genementionsinthatyear(Patents)
EGFR
0.00%
0.10%
0.20%
0.30%
0.40%
0.50%
0.60%
0.70%
0.00%
0.02%
0.04%
0.06%
0.08%
0.10%
0.12%
0.14%
1976197819801982198419861988199019921994199619982000200220042006200820102012
Percentageofprotein/genementionsinthatyear(ChEMBL)
Percentageofprotein/genementionsinthatyear(Patents)
MET
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Gene/protein identification
(Mammalian target of rapamycin) (Tissue plasminogen activator)
0.00%
0.05%
0.10%
0.15%
0.20%
0.25%
0.30%
0.35%
0.40%
0.45%
0.00%
0.02%
0.04%
0.06%
0.08%
0.10%
0.12%
0.14%
1976197819801982198419861988199019921994199619982000200220042006200820102012
Percentageofprotein/genementionsinthatyear(ChEMBL)
Percentageofprotein/genementionsinthatyear(Patents)
MTOR
0.00%
0.05%
0.10%
0.15%
0.20%
0.25%
0.30%
0.35%
0.40%
0.00%
0.05%
0.10%
0.15%
0.20%
0.25%
0.30%
0.35%
0.40%
0.45%
0.50%
1976 1978 1980 1982 1984 1986 19881990 1992 1994 1996 1998 2000 2002 2004 20062008 2010 2012
Percentageofprotein/genementionsinthatyear(ChEMBL)
Percentageofprotein/genementionsinthatyear(Patents)
PLAT
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Melting/Boiling Point Extraction
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Melting/Boiling Point Extraction
Over 99k so far!
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
CHEMICAL reactions
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Predicting yield
• Features to consider:
• 2D fingerprints (especially around the reaction
centers)
• Reaction type
• Temperature
• Time
• Change in complexity e.g. chiral centres
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Data extraction
• Can pull out yields, quantities, times and
temperatures
• Can sanity check text-mined yield by
calculating from amounts (or even masses)
𝑚𝑎𝑠𝑠 𝑜𝑓 𝑐𝑜𝑚𝑝𝑜𝑢𝑛𝑑(𝑔)
𝑚𝑜𝑙𝑎𝑟 𝑚𝑎𝑠𝑠 [𝑐𝑎𝑙𝑐 𝑓𝑟𝑜𝑚 𝑠𝑡𝑟𝑢𝑐𝑡𝑢𝑟𝑒](𝑔𝑚𝑜𝑙−1)
= 𝑚𝑜𝑙𝑠 𝑜𝑓 𝑐𝑜𝑚𝑝𝑜𝑢𝑛𝑑
𝑚𝑜𝑙𝑠 𝑜𝑓 𝑝𝑟𝑜𝑑𝑢𝑐𝑡
𝑚𝑜𝑙𝑠 𝑜𝑓 𝑙𝑖𝑚𝑖𝑡𝑖𝑛𝑔 𝑟𝑒𝑎𝑐𝑡𝑎𝑛𝑡
= %𝑦𝑖𝑒𝑙𝑑
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Outliers inevitable
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
sCale vs yield
2001-2013 US applications, Suzuki couplings
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Temperatures
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Identify Synthetic Routes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Intermediates 197702103114 56611 31403 17268 9230 5057 2701 1256 639 301 136 58 15 5 2
Terminal Products 385149149445 81837 47579 27670 16619 9320 5263 2511 1330 678 373 111 63 8 6 5
0
100000
200000
300000
400000
500000
600000
700000
Occurrences
Number of steps
Intermediates
Terminal Products
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Number of steps vs Complexity
ringComplexityScore = log10(nRingBridgeAtoms + 1) + log10(nSpiroAtoms + 1)
stereoComplexityScore = log10(nStereoCenters + 1)
macrocyclePenalty = log10(nMacrocycles + 1)
sizePenalty = natoms1.005 − natoms
∑ Ertl & Schuffenhauer 2009
doi:10.1186/1758-2946-1-8
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Number of steps vs Complexity
0
5
10
15
20
25
30
35
40
45
50
1 2 3 4 5 6 7 8 9 10 11
Numberofproductheavyatoms
Number of steps
Average number of heavy atoms
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Yield vs Change in Complexity
2001-2013 US applications, all reactions
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Trends in Reaction Types
0.0%
1.0%
2.0%
3.0%
4.0%
5.0%
6.0%
7.0%
8.0%
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
Suzukicouplingsasapercentageofreactionsinayear
Classification performed using NameRXN
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Trends In Solvent Use
0.0%
5.0%
10.0%
15.0%
20.0%
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
Percentageofreactionsinthatyear
Tetrahydrofuran
Dichloromethane
Water
Dimethylformamide
Methanol
Ethyl acetate
Ethanol
1,4-Dioxane
Toluene
Acetonitrile
Acetic acid
Chloroform
Acetone
Benzene
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Are solvents getting greener?
1976 2013
Water (21%) Tetrahydrofuran (15%)
Ethanol (11%) Dichloromethane (14%)
Benzene (8%) Water (13%)
Methanol (7%) Dimethylformamide (10%)
Tetrahydrofuran (5%) Methanol (8%)
Dichloromethane (4%) Ethyl acetate (7%)
Dimethylformamide (4%) Ethanol (5%)
Acetic acid (4%) 1,4-Dioxane (4%)
Chloroform (3%) Toluene (3%)
Acetone (3%) Acetonitrile (3%)
Total for top 10: 71% 82%
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Conclusions
• A significant amount of novel chemistry, from EPO patents, comes
from the non-English patents
• Compounds disclosed by both the USPTO and EPO are on average
published earlier by the USPTO but for many compounds an EPO
patent will be the earlier disclosure
• Gene/protein identification can identify clear changes in patenting
behaviour over time
• Text mining provides the tools to answer many reaction
informatics questions
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Acknowledgements
• Funding provided by:
248th ACS National Meeting, San Francisco CA, USA 10th August 2014
Thank you for your time!
http://nextmovesoftware.com
http://nextmovesoftware.com/blog
daniel@nextmovesoftware.com

Mais conteúdo relacionado

Mais procurados

In grammars we trust: LeadMine, a knowledge driven solution
In grammars we trust: LeadMine, a knowledge driven solutionIn grammars we trust: LeadMine, a knowledge driven solution
In grammars we trust: LeadMine, a knowledge driven solutionNextMove Software
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesNextMove Software
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...NextMove Software
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...NextMove Software
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...NextMove Software
 
Chemical structure representation in PubChem
Chemical structure representation in PubChemChemical structure representation in PubChem
Chemical structure representation in PubChemNextMove Software
 
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...NextMove Software
 
ICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CASICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CASDr. Haxel Consult
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...NextMove Software
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical RepresentationsNextMove Software
 
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...Dr. Haxel Consult
 
Acs 2013 indianapolis_cvsp
Acs 2013 indianapolis_cvspAcs 2013 indianapolis_cvsp
Acs 2013 indianapolis_cvspKen Karapetyan
 
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...Standardization and Generation of Parents for Open PHACTS Chemical Registry S...
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...Ken Karapetyan
 
Chemical Text Mining for Current Awareness of Pharmaceutical Patents
Chemical Text Mining for Current Awareness of Pharmaceutical PatentsChemical Text Mining for Current Awareness of Pharmaceutical Patents
Chemical Text Mining for Current Awareness of Pharmaceutical Patentsdan2097
 
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open DataGraph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open DataMaulik Kamdar
 
2020 scifinder-n manual (2020) english
2020 scifinder-n manual (2020) english2020 scifinder-n manual (2020) english
2020 scifinder-n manual (2020) englishPOSTECH Library
 

Mais procurados (18)

In grammars we trust: LeadMine, a knowledge driven solution
In grammars we trust: LeadMine, a knowledge driven solutionIn grammars we trust: LeadMine, a knowledge driven solution
In grammars we trust: LeadMine, a knowledge driven solution
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...
 
Chemical structure representation in PubChem
Chemical structure representation in PubChemChemical structure representation in PubChem
Chemical structure representation in PubChem
 
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
 
ICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CASICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CAS
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical Representations
 
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
 
Acs 2013 indianapolis_cvsp
Acs 2013 indianapolis_cvspAcs 2013 indianapolis_cvsp
Acs 2013 indianapolis_cvsp
 
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...Standardization and Generation of Parents for Open PHACTS Chemical Registry S...
Standardization and Generation of Parents for Open PHACTS Chemical Registry S...
 
Data model
Data modelData model
Data model
 
Chemical Text Mining for Current Awareness of Pharmaceutical Patents
Chemical Text Mining for Current Awareness of Pharmaceutical PatentsChemical Text Mining for Current Awareness of Pharmaceutical Patents
Chemical Text Mining for Current Awareness of Pharmaceutical Patents
 
Building support for the semantic web for chemistry at the Royal Society of C...
Building support for the semantic web for chemistry at the Royal Society of C...Building support for the semantic web for chemistry at the Royal Society of C...
Building support for the semantic web for chemistry at the Royal Society of C...
 
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open DataGraph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
 
2020 scifinder-n manual (2020) english
2020 scifinder-n manual (2020) english2020 scifinder-n manual (2020) english
2020 scifinder-n manual (2020) english
 

Destaque

Representation and display of non-standard peptides using semi-systematic ami...
Representation and display of non-standard peptides using semi-systematic ami...Representation and display of non-standard peptides using semi-systematic ami...
Representation and display of non-standard peptides using semi-systematic ami...NextMove Software
 
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...NextMove Software
 
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...NextMove Software
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsNextMove Software
 
Using Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextUsing Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextNextMove Software
 
Evidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesEvidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesNextMove Software
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision TreeNextMove Software
 

Destaque (8)

Representation and display of non-standard peptides using semi-systematic ami...
Representation and display of non-standard peptides using semi-systematic ami...Representation and display of non-standard peptides using semi-systematic ami...
Representation and display of non-standard peptides using semi-systematic ami...
 
InChI for Large Molecules
InChI for Large MoleculesInChI for Large Molecules
InChI for Large Molecules
 
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...Peptide Informatics - Bridging the gap between small-molecule and large-molec...
Peptide Informatics - Bridging the gap between small-molecule and large-molec...
 
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
Using Matched Molecular Series as a Predictive Tool To Optimize Biological Ac...
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filings
 
Using Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make nextUsing Matched Series to decide what compound to make next
Using Matched Series to decide what compound to make next
 
Evidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular seriesEvidence-based medicinal chemistry using matched molecular series
Evidence-based medicinal chemistry using matched molecular series
 
Revising the Topliss Decision Tree
Revising the Topliss Decision TreeRevising the Topliss Decision Tree
Revising the Topliss Decision Tree
 

Semelhante a Chemistry and reactions from non-US patents

New Product Introductions - CAS
New Product Introductions - CASNew Product Introductions - CAS
New Product Introductions - CASDr. Haxel Consult
 
ISO 14001 Tip #6 - How to Meet Compliance Obligations?
ISO 14001 Tip #6 - How to Meet Compliance Obligations?ISO 14001 Tip #6 - How to Meet Compliance Obligations?
ISO 14001 Tip #6 - How to Meet Compliance Obligations?Aury Hathout
 
Introduction and CrossRef Overview 2010 Annual Meeting
Introduction and CrossRef Overview 2010 Annual MeetingIntroduction and CrossRef Overview 2010 Annual Meeting
Introduction and CrossRef Overview 2010 Annual MeetingCrossref
 
2014 voter attitudes about energy
2014 voter attitudes about energy2014 voter attitudes about energy
2014 voter attitudes about energycleanwi
 
ICIC 2014 New Product Introduction CAS
ICIC 2014 New Product Introduction CASICIC 2014 New Product Introduction CAS
ICIC 2014 New Product Introduction CASDr. Haxel Consult
 
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...UKSG: connecting the knowledge community
 
Env 320 Technology levels--snaptutorial.com
Env 320 Technology levels--snaptutorial.comEnv 320 Technology levels--snaptutorial.com
Env 320 Technology levels--snaptutorial.comsholingarjosh125
 
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08rahsco3
 
Organic System Plans: Field and Row Crops and Pasture and Range Systems
Organic System Plans: Field and Row Crops and Pasture and Range SystemsOrganic System Plans: Field and Row Crops and Pasture and Range Systems
Organic System Plans: Field and Row Crops and Pasture and Range SystemsGardening
 
5658 2018 10-22-13-55-26
5658 2018 10-22-13-55-265658 2018 10-22-13-55-26
5658 2018 10-22-13-55-26hesam ahmadian
 
Who are cams users today by Popp
Who are cams users today by PoppWho are cams users today by Popp
Who are cams users today by PoppCopernicus ECMWF
 
Patents 101: How to Do a Patent Search
Patents 101: How to Do a Patent SearchPatents 101: How to Do a Patent Search
Patents 101: How to Do a Patent SearchKristina Gomez
 
Multi-Industry Letter to Congress on EPA Ozone Standards
Multi-Industry Letter to Congress on EPA Ozone StandardsMulti-Industry Letter to Congress on EPA Ozone Standards
Multi-Industry Letter to Congress on EPA Ozone Standardsartba
 
Legislative and Regulatory Update
Legislative and Regulatory UpdateLegislative and Regulatory Update
Legislative and Regulatory UpdateNational Pork Board
 
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdf
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdfAWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdf
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdfLuisDaniel522665
 
Doer biomass emissions & safety regulations
Doer biomass emissions & safety regulationsDoer biomass emissions & safety regulations
Doer biomass emissions & safety regulationsCarlos Mendes
 
Toolkit for Life Cycle Management in the Australian building and construction...
Toolkit for Life Cycle Management in the Australian building and construction...Toolkit for Life Cycle Management in the Australian building and construction...
Toolkit for Life Cycle Management in the Australian building and construction...LCANZ
 

Semelhante a Chemistry and reactions from non-US patents (20)

New Product Introductions - CAS
New Product Introductions - CASNew Product Introductions - CAS
New Product Introductions - CAS
 
ISO 14001 Tip #6 - How to Meet Compliance Obligations?
ISO 14001 Tip #6 - How to Meet Compliance Obligations?ISO 14001 Tip #6 - How to Meet Compliance Obligations?
ISO 14001 Tip #6 - How to Meet Compliance Obligations?
 
Introduction and CrossRef Overview 2010 Annual Meeting
Introduction and CrossRef Overview 2010 Annual MeetingIntroduction and CrossRef Overview 2010 Annual Meeting
Introduction and CrossRef Overview 2010 Annual Meeting
 
AWWA G481-14
AWWA G481-14AWWA G481-14
AWWA G481-14
 
2014 voter attitudes about energy
2014 voter attitudes about energy2014 voter attitudes about energy
2014 voter attitudes about energy
 
ICIC 2014 New Product Introduction CAS
ICIC 2014 New Product Introduction CASICIC 2014 New Product Introduction CAS
ICIC 2014 New Product Introduction CAS
 
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...
UKSG webinar: COUNTER for Librarians with Anna Franca, King's College London ...
 
Env 320 Technology levels--snaptutorial.com
Env 320 Technology levels--snaptutorial.comEnv 320 Technology levels--snaptutorial.com
Env 320 Technology levels--snaptutorial.com
 
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08
EPA Seminar, Cosmetic Cleaning Ordinances, 7-15-08
 
Organic System Plans: Field and Row Crops and Pasture and Range Systems
Organic System Plans: Field and Row Crops and Pasture and Range SystemsOrganic System Plans: Field and Row Crops and Pasture and Range Systems
Organic System Plans: Field and Row Crops and Pasture and Range Systems
 
5658 2018 10-22-13-55-26
5658 2018 10-22-13-55-265658 2018 10-22-13-55-26
5658 2018 10-22-13-55-26
 
Who are cams users today by Popp
Who are cams users today by PoppWho are cams users today by Popp
Who are cams users today by Popp
 
Patents 101: How to Do a Patent Search
Patents 101: How to Do a Patent SearchPatents 101: How to Do a Patent Search
Patents 101: How to Do a Patent Search
 
Multi-Industry Letter to Congress on EPA Ozone Standards
Multi-Industry Letter to Congress on EPA Ozone StandardsMulti-Industry Letter to Congress on EPA Ozone Standards
Multi-Industry Letter to Congress on EPA Ozone Standards
 
Legislative and Regulatory Update
Legislative and Regulatory UpdateLegislative and Regulatory Update
Legislative and Regulatory Update
 
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdf
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdfAWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdf
AWWA-E103-Horizontal and Vertical Line-Shaft Pumps.pdf
 
Doer biomass emissions & safety regulations
Doer biomass emissions & safety regulationsDoer biomass emissions & safety regulations
Doer biomass emissions & safety regulations
 
Kk m5re9v2e3
Kk m5re9v2e3Kk m5re9v2e3
Kk m5re9v2e3
 
Toolkit for Life Cycle Management in the Australian building and construction...
Toolkit for Life Cycle Management in the Australian building and construction...Toolkit for Life Cycle Management in the Australian building and construction...
Toolkit for Life Cycle Management in the Australian building and construction...
 
Api
ApiApi
Api
 

Mais de NextMove Software

Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...NextMove Software
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedNextMove Software
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESNextMove Software
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionNextMove Software
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...NextMove Software
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsNextMove Software
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...NextMove Software
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKitNextMove Software
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics DatabaseNextMove Software
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesNextMove Software
 
Challenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeChallenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeNextMove Software
 
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]NextMove Software
 
RDKit UGM 2016: Higher Quality Chemical Depictions
RDKit UGM 2016: Higher Quality Chemical DepictionsRDKit UGM 2016: Higher Quality Chemical Depictions
RDKit UGM 2016: Higher Quality Chemical DepictionsNextMove Software
 
GHS and NFPA diamonds: where they come from and how they can be useful
GHS and NFPA diamonds: where they come from and how they can be usefulGHS and NFPA diamonds: where they come from and how they can be useful
GHS and NFPA diamonds: where they come from and how they can be usefulNextMove Software
 
Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)NextMove Software
 
Which is the best fingerprint for medicinal chemistry?
Which is the best fingerprint for medicinal chemistry?Which is the best fingerprint for medicinal chemistry?
Which is the best fingerprint for medicinal chemistry?NextMove Software
 
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...CINF 29: Visualization and manipulation of Matched Molecular Series for decis...
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...NextMove Software
 

Mais de NextMove Software (18)

DeepSMILES
DeepSMILESDeepSMILES
DeepSMILES
 
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule Implementations
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKit
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics Database
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
 
Challenges in Chemical Information Exchange
Challenges in Chemical Information ExchangeChallenges in Chemical Information Exchange
Challenges in Chemical Information Exchange
 
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]
RDKit: Six Not-So-Easy Pieces [RDKit UGM 2016]
 
RDKit UGM 2016: Higher Quality Chemical Depictions
RDKit UGM 2016: Higher Quality Chemical DepictionsRDKit UGM 2016: Higher Quality Chemical Depictions
RDKit UGM 2016: Higher Quality Chemical Depictions
 
GHS and NFPA diamonds: where they come from and how they can be useful
GHS and NFPA diamonds: where they come from and how they can be usefulGHS and NFPA diamonds: where they come from and how they can be useful
GHS and NFPA diamonds: where they come from and how they can be useful
 
Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)Line notations for nucleic acids (both natural and therapeutic)
Line notations for nucleic acids (both natural and therapeutic)
 
Which is the best fingerprint for medicinal chemistry?
Which is the best fingerprint for medicinal chemistry?Which is the best fingerprint for medicinal chemistry?
Which is the best fingerprint for medicinal chemistry?
 
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...CINF 29: Visualization and manipulation of Matched Molecular Series for decis...
CINF 29: Visualization and manipulation of Matched Molecular Series for decis...
 

Último

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Chemistry and reactions from non-US patents

  • 1. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Chemistry and reactions from non- US patents Daniel Lowe and Roger Sayle NextMove Software Cambridge, UK
  • 2. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Topics • Coverage of USPTO vs EPO patents • Extraction of non-chemical-compound data • Can text-mining provide insights into reaction informatics
  • 3. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 What am I missing by only using the USPTO data?
  • 4. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 EPO and USPTO background • USPTO: 303k grants published in 2013 • EPO: 67k grants published in 2013 • EPO patents must be in one or more of English, French or German • USPTO documents are all in English
  • 5. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Epo/USPTO Compounds (using StdInChI) 3,345,877 2,674,069 445,588 USPTO = 1976-2013 patent grants + 2001-2013 patent applications EPO = 1978-2013 patent grants and applications
  • 6. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Epo/USPTO Reactions (using separately sorted StdInChIs for reactants/agents and products) 706,524 317,533 99,681 USPTO = 1976-2013 patent grants + 2001-2013 patent applications EPO = 1978-2013 patent grants and applications
  • 7. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Effect of different Languages 0 500000 1000000 1500000 2000000 2500000 3000000 3500000 4000000 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 Numberofstructuresextracted English German French Translation performed by Lexichem v2014.Jun
  • 8. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 First Disclosure Occurrence by Language Excluding compounds disclosed by USPTO (1976- 2013) patents All compounds
  • 9. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Timeliness • Competitive intelligence requires up to date information • Methodology: – Consider compounds present in both EPO and USPTO date not disclosed by either prior to 2006 – Compare the earliest publication date for each compound
  • 10. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 0 20000 40000 60000 80000 100000 120000 140000 160000 180000 US more than 5 years earlier US 3-5 years earlier US 2-3 years earlier US 1-2 years earlier US 6-12 months earlier US 3-6 months earlier US 1-3 months earlier US 1-4 weeks earlier Within a week US 1-4 weeks later US 1-3 months later US 3-6 months later US 6-12 months later US 1-2 years later US 2-3 years later US 3-5 years later US more than 5 years later Compounddisclosures Average lag between EPO/USPTO Compounds first disclosed between 2006-2013 Mean: US 540 days earlier
  • 11. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 0 10000 20000 30000 40000 50000 60000 70000 US more than 5 years earlier US 3-5 years earlier US 2-3 years earlier US 1-2 years earlier US 6-12 months earlier US 3-6 months earlier US 1-3 months earlier US 1-4 weeks earlier Within a week US 1-4 weeks later US 1-3 months later US 3-6 months later US 6-12 months later US 1-2 years later US 2-3 years later US 3-5 years later US more than 5 years later Compounddisclosures Applications Vs Applications Mean: US 223 days earlier Compounds first disclosed between 2006-2013
  • 12. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 0 20000 40000 60000 80000 100000 120000 140000 US more than 5 years earlier US 3-5 years earlier US 2-3 years earlier US 1-2 years earlier US 6-12 months earlier US 3-6 months earlier US 1-3 months earlier US 1-4 weeks earlier Within a week US 1-4 weeks later US 1-3 months later US 3-6 months later US 6-12 months later US 1-2 years later US 2-3 years later US 3-5 years later US more than 5 years later Compounddisclosures Grants vs Grants Compounds first disclosed between 2006-2013 Mean: US 287 days earlier
  • 13. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014
  • 14. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Epo/US 1976-2013 and chembl19 overlap 187,486 1,155,034 6,278,048
  • 15. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 Patents 4-5 years earlier Patents 3-4 years earlier Patents 2-3 years earlier Patents 1-2 years earlier Patents 0-1 years earlier Patents 0-1 years later Patents 1-2 years later Patents 2-3 years later Patents 3-5 years later Patents 4-5 years later Average lag between Patents/CHEMbl19 Compounds first disclosed between 2006-2013 Mean: Patents 345 days earlier
  • 16. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Other data in patents • Genes/proteins • Reactions including role of reagents • Physical quantities • Spectra • Diseases • Organisms • Companies • …
  • 17. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Gene/protein identification • Identify trends in drug target popularity • Count number of patents (in a year) mentioning a given gene or its gene products and map to the HGNC symbol • Many genes are referenced by short ambiguous terms hence great care is required to have good precision
  • 18. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Gene/protein identification (Hepatocyte growth factor receptor)(Epidermal growth factor receptor) 0.00% 0.10% 0.20% 0.30% 0.40% 0.50% 0.60% 0.70% 0.80% 0.90% 0.00% 0.05% 0.10% 0.15% 0.20% 0.25% 0.30% 0.35% 1976197819801982198419861988199019921994199619982000200220042006200820102012 Percentageofprotein/genementionsinthatyear(ChEMBL) Percentageofprotein/genementionsinthatyear(Patents) EGFR 0.00% 0.10% 0.20% 0.30% 0.40% 0.50% 0.60% 0.70% 0.00% 0.02% 0.04% 0.06% 0.08% 0.10% 0.12% 0.14% 1976197819801982198419861988199019921994199619982000200220042006200820102012 Percentageofprotein/genementionsinthatyear(ChEMBL) Percentageofprotein/genementionsinthatyear(Patents) MET
  • 19. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Gene/protein identification (Mammalian target of rapamycin) (Tissue plasminogen activator) 0.00% 0.05% 0.10% 0.15% 0.20% 0.25% 0.30% 0.35% 0.40% 0.45% 0.00% 0.02% 0.04% 0.06% 0.08% 0.10% 0.12% 0.14% 1976197819801982198419861988199019921994199619982000200220042006200820102012 Percentageofprotein/genementionsinthatyear(ChEMBL) Percentageofprotein/genementionsinthatyear(Patents) MTOR 0.00% 0.05% 0.10% 0.15% 0.20% 0.25% 0.30% 0.35% 0.40% 0.00% 0.05% 0.10% 0.15% 0.20% 0.25% 0.30% 0.35% 0.40% 0.45% 0.50% 1976 1978 1980 1982 1984 1986 19881990 1992 1994 1996 1998 2000 2002 2004 20062008 2010 2012 Percentageofprotein/genementionsinthatyear(ChEMBL) Percentageofprotein/genementionsinthatyear(Patents) PLAT
  • 20. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014
  • 21. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Melting/Boiling Point Extraction
  • 22. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Melting/Boiling Point Extraction Over 99k so far!
  • 23. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 CHEMICAL reactions
  • 24. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Predicting yield • Features to consider: • 2D fingerprints (especially around the reaction centers) • Reaction type • Temperature • Time • Change in complexity e.g. chiral centres
  • 25. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Data extraction • Can pull out yields, quantities, times and temperatures • Can sanity check text-mined yield by calculating from amounts (or even masses) 𝑚𝑎𝑠𝑠 𝑜𝑓 𝑐𝑜𝑚𝑝𝑜𝑢𝑛𝑑(𝑔) 𝑚𝑜𝑙𝑎𝑟 𝑚𝑎𝑠𝑠 [𝑐𝑎𝑙𝑐 𝑓𝑟𝑜𝑚 𝑠𝑡𝑟𝑢𝑐𝑡𝑢𝑟𝑒](𝑔𝑚𝑜𝑙−1) = 𝑚𝑜𝑙𝑠 𝑜𝑓 𝑐𝑜𝑚𝑝𝑜𝑢𝑛𝑑 𝑚𝑜𝑙𝑠 𝑜𝑓 𝑝𝑟𝑜𝑑𝑢𝑐𝑡 𝑚𝑜𝑙𝑠 𝑜𝑓 𝑙𝑖𝑚𝑖𝑡𝑖𝑛𝑔 𝑟𝑒𝑎𝑐𝑡𝑎𝑛𝑡 = %𝑦𝑖𝑒𝑙𝑑
  • 26. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Outliers inevitable
  • 27. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 sCale vs yield 2001-2013 US applications, Suzuki couplings
  • 28. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Temperatures
  • 29. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Identify Synthetic Routes 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Intermediates 197702103114 56611 31403 17268 9230 5057 2701 1256 639 301 136 58 15 5 2 Terminal Products 385149149445 81837 47579 27670 16619 9320 5263 2511 1330 678 373 111 63 8 6 5 0 100000 200000 300000 400000 500000 600000 700000 Occurrences Number of steps Intermediates Terminal Products
  • 30. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Number of steps vs Complexity ringComplexityScore = log10(nRingBridgeAtoms + 1) + log10(nSpiroAtoms + 1) stereoComplexityScore = log10(nStereoCenters + 1) macrocyclePenalty = log10(nMacrocycles + 1) sizePenalty = natoms1.005 − natoms ∑ Ertl & Schuffenhauer 2009 doi:10.1186/1758-2946-1-8
  • 31. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Number of steps vs Complexity 0 5 10 15 20 25 30 35 40 45 50 1 2 3 4 5 6 7 8 9 10 11 Numberofproductheavyatoms Number of steps Average number of heavy atoms
  • 32. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Yield vs Change in Complexity 2001-2013 US applications, all reactions
  • 33. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Trends in Reaction Types 0.0% 1.0% 2.0% 3.0% 4.0% 5.0% 6.0% 7.0% 8.0% 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 Suzukicouplingsasapercentageofreactionsinayear Classification performed using NameRXN
  • 34. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Trends In Solvent Use 0.0% 5.0% 10.0% 15.0% 20.0% 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 Percentageofreactionsinthatyear Tetrahydrofuran Dichloromethane Water Dimethylformamide Methanol Ethyl acetate Ethanol 1,4-Dioxane Toluene Acetonitrile Acetic acid Chloroform Acetone Benzene
  • 35. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Are solvents getting greener? 1976 2013 Water (21%) Tetrahydrofuran (15%) Ethanol (11%) Dichloromethane (14%) Benzene (8%) Water (13%) Methanol (7%) Dimethylformamide (10%) Tetrahydrofuran (5%) Methanol (8%) Dichloromethane (4%) Ethyl acetate (7%) Dimethylformamide (4%) Ethanol (5%) Acetic acid (4%) 1,4-Dioxane (4%) Chloroform (3%) Toluene (3%) Acetone (3%) Acetonitrile (3%) Total for top 10: 71% 82%
  • 36. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Conclusions • A significant amount of novel chemistry, from EPO patents, comes from the non-English patents • Compounds disclosed by both the USPTO and EPO are on average published earlier by the USPTO but for many compounds an EPO patent will be the earlier disclosure • Gene/protein identification can identify clear changes in patenting behaviour over time • Text mining provides the tools to answer many reaction informatics questions
  • 37. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Acknowledgements • Funding provided by:
  • 38. 248th ACS National Meeting, San Francisco CA, USA 10th August 2014 Thank you for your time! http://nextmovesoftware.com http://nextmovesoftware.com/blog daniel@nextmovesoftware.com