SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Applications of Machine Learning
for Materials Discovery at NREL
Caleb Phillips, Ph.D.
Data Analysis and Visualization
Computational Sciences Center
National Renewable Energy Laboratory
NREL | 2
Modern “Full Stack” Materials Science
Synthesis
Characterization
Computation
Golden, Colorado
NREL | 3
Skeptics Allowed
“I’ll admit it, there may be something to
this ‘big data’ and ‘machine learning’
thing everyone keeps talking about.”
- Anonymous cynic (2017)
What changed?
• Computational power
• Deep Neural Networks
• Cheap storage, big data
• Increasing adoption/investment
NREL | 4
Setting Realistic Expectations
Machine learning &
Deep Learning
Image source: Gartner.com, Aug. 2017
NREL | 5
Overview of the talk
Compelling examples of materials-oriented machine learning at
work at NREL:
• Improving the throughput of experimentation:
• Interpretation: Accelerate the data->knowledge path
• Automation: Replace onerous manual tasks
• Prediction: Predict properties not measured
• Augmenting or replacing DFT simulations in candidate screening
• Prediction: End-to-end deep learning on molecular and
atomistic structures
Will cover applications at a high level – talk to or email me for more info
} Do work
faster with
more insight
} Focus work,
avoid some
altogether
NREL | 6
But first, the Data
The Materials Project
http://materials.nrel.gov
http://organiceletronics.nrel.gov
http://htem.nrel.gov
{
{ {
Experimental
Theoretical
Both
NREL | 7
Example: Experimental Materials Discovery
Taylor et al. Adv. Funct. Mater. 18, 3169 (2008)
% In
Conductivity of Annealed InZnO
Goal: Make a PhD Thesis Amount of Analysis a Routine Activity
Composition
Structure
Property
Process
Slide credit: John Perkins
NREL | 8
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
NREL | 9
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
AI/ML
Opportunities
Improve fidelity
Guided search
Faster screening
Automation
Visualization
Property Prediction{
NREL | 10
Initial motivating problem: Accelerate Slow Analysis Tasks
880 Unanalyzed XRD Patterns (Data)
1 Structure Phase Map (Knowledge)
Slow
NREL | 11
Clustering by structure and composition
Samples
Results
Extracted Data Set
~ 1000 XRD patterns
Spectral Clustering
NNLS Decomposition
Apply Machine Learning to Determine Clusters in XRD Patterns
Fast: ~ 30 seconds on a laptop
Calculated XRD Patterns
NREL | 12
Automatic band gap calculation
Goal: replace highly subjective manual
Process with something scalable, automated,
and (more) accurate.
Combining experimental and theoretical data compare properties across a
wide landscape of materials systems and synthesis conditions.
Schwarting et al., Materials Discovery (2018)
NREL | 13
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
AI/ML
Opportunities
Improve fidelity
Guided search
Faster screening
Automation
Visualization
Property Prediction{
NREL | 14
High throughput screening using computational results
Constraints
Molecule
Generator
Predictive
(Machine Learning)
Model
Simulation on
Supercomputer
$$$
Results
Database
OR
Best candidates
All candidates
(sequentially)
Visualization &
Analysis
Materials
Synthesis
$$$$$
Measurement
and Validation
New
Materials
Theoretical
Experimental
Training
on
Past Results
Phillips et al. CoDA (2016)
NREL | 15
Predict opto-electric properties of molecules
Support Vector Regression (SVR) performance when predicting calculated band gap. Residual
error is linear and normally distributed. Median error is effectively zero, RMSE is 0.25 EV or
less for most scenarios.
First try: learn using
molecular descriptors
(traditional feature
engineering)
2 million candidates
NREL | 16
End-to-end Learning: Skip the feature extraction
Image Recognition: Convolutional
Neural Networks (CNNs)
O
Message
Passing
Blocks
Node Recurrent Units
Node
Embedding
Layer
Graph
Output
Layer(s)
Dense
Regression
Layers
Predictions
Input
Graph
(Molecule)
Molecular Graphs: Message Passing Neural
Networks (MPNNs)Gilmer et al., CoRR (2017)
Key hypothesis: model
can learn which features
are important directly from
structure.
NREL | 17
End-to-end Learning: Skip the feature extraction
Duplicate 1 (DFT)
MachineLearningPrediction
3-5x improvement over manually engineered features.
Accuracy approaching repeated-measures accuracy of DFT.
Gap 0.90
HOMO 1.05
LUMO 0.89
Spectral overlap 1.28
Polymer HOMO 1.24
Polymer LUMO 1.03
Polymer gap 1.19
Polymer optical LUMO 1.02
!"#$("&'ℎ)*+ ,+&-*)*.)
!"#$(012 3456)'&7+8)
St. John et al. https://arxiv.org/abs/1807.10363. (2018)
NREL | 18
Transfer learning and training set size
St. John et al. https://arxiv.org/abs/1807.10363. (2018)
NREL | 19
End-to-end learning for crystalline materials
Represent crystal structure as a graph
to allow end-to-end learning.
Kamdar. 2018. NREL/US DOE CSGF.
NREL | 20
Thanks to Many Collaborators
(and many funding sources)
Theory
Stephan Lany
Vladan Stevonvic
Aaron Holder
@ LBNL
Gerd Ceder
Kristin Persson
Data
Robert White
Kristin Munch
Peter Graf
@ NIST
Zachary Trautt
Robert Hanisch
Experiment
Andriy Zakutayev
John Perkins
Philip Parilla
David Ginley
Bill Tumas
Sebastian Siol
Lauren Garten
Elisabetta Arca
Matthew Taylor
@ NIST
Martin Green
Jae Hattrick-Simpers
Nam Nguyen
@ SLAC
Apurva Mehta
@ ANL
Debbie Myers
AI/ML
Jacob Hinkle
Marcus Schwarting
Peter St. John
@ Harvard
Harshil Kamdar Slide credit: John Perkins
NREL | 21
Selected Publications
Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson,
Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen.
Message-passing neural networks for high-throughput polymer screening.
In submission. ArXiv preprint: https://arxiv.org/abs/1807.10363
Marcus Schwarting, Sebastian Siol, Kevin Talley, Andriy Zakutayev, Caleb Phillips.
Automated algorithms for band gap analysis from optical absorption spectra.
Materials Discovery, April 18, 2018. https://doi.org/10.1016/j.md.2018.04.003
Andriy Zakutayev, Nick Wunder, Marcus Schwarting, John Perkins, Robert White,
Kristin Munch, William Tumas, and Caleb Phillips.
An open experimental database for exploring inorganic materials.
Nature. Scientific Data. April 3, 2018. https://www.nature.com/articles/sdata201853
Caleb Phillips, Ross Larson, Kristin Munch, Nikos Kopidakis.
Guided Search for Organic Photovoltaic Materials Using Predictive Data Modeling.
Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico.
www.nrel.gov
Thank you
This work was authored by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy,
LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by U.S.
Department of Energy Office of Energy Efficiency and Renewable Energy. The views expressed in the article do not
necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher,
by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up,
irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for
U.S. Government purposes.
caleb.phillips@nrel.gov
NREL | 23
Save work: predict not-measured properties
• Electrical conductivity prediction using random forest model
• Training variables: chemical composition, XRD peak count, deposition conditions
• Training process: 10-fold cross-validation by withdrawing 25% sample libraries
• Training set: 16K data points varying by 9-10 orders of magnitude
Predicted vs Measured
Conductivity
Prediction accuracy for
Conductivity
Prediction accuracy of
1-2 orders of
magnitude, reasonable
for semiconductors
Zakutayev et al. Scientific Data 5 180053 (2018)
NREL | 24
What’s in my database?
tSne model can group
70K samples based on
similarity of their
chemical compositions
t-distributed stochastic neighbor embedding (tSne) dimensionality reduction model
Zakutayev et al. Scientific Data 5 180053 (2018)

Mais conteúdo relacionado

Mais procurados

Progress in all inorganic perovskite solar cell
Progress in all inorganic perovskite solar cellProgress in all inorganic perovskite solar cell
Progress in all inorganic perovskite solar cellMd Ataul Mamun
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...aimsnist
 
Quantum calculations and calculational chemistry
Quantum calculations and calculational chemistryQuantum calculations and calculational chemistry
Quantum calculations and calculational chemistrynazanin25
 
Machine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methodsMachine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methodsAnubhav Jain
 
Machine Learning for Chemical Sciences
Machine Learning for Chemical SciencesMachine Learning for Chemical Sciences
Machine Learning for Chemical SciencesIchigaku Takigawa
 
Density functional theory
Density functional theoryDensity functional theory
Density functional theorysandhya singh
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationKAMAL CHOUDHARY
 
Computational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsComputational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsAnubhav Jain
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Anubhav Jain
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Dataaimsnist
 
Domain Transfer and Adaptation Survey
Domain Transfer and Adaptation SurveyDomain Transfer and Adaptation Survey
Domain Transfer and Adaptation SurveySangwoo Mo
 
Lecture: Interatomic Potentials Enabled by Machine Learning
Lecture: Interatomic Potentials Enabled by Machine LearningLecture: Interatomic Potentials Enabled by Machine Learning
Lecture: Interatomic Potentials Enabled by Machine LearningDanielSchwalbeKoda
 
Density Functional Theory
Density Functional TheoryDensity Functional Theory
Density Functional Theorykrishslide
 
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...ABDERRAHMANE REGGAD
 
Hartree method ppt physical chemistry
Hartree method ppt physical chemistryHartree method ppt physical chemistry
Hartree method ppt physical chemistryalikhan1414
 

Mais procurados (20)

Lecture2
Lecture2Lecture2
Lecture2
 
Progress in all inorganic perovskite solar cell
Progress in all inorganic perovskite solar cellProgress in all inorganic perovskite solar cell
Progress in all inorganic perovskite solar cell
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
 
Hartree fock theory
Hartree fock theoryHartree fock theory
Hartree fock theory
 
Quantum calculations and calculational chemistry
Quantum calculations and calculational chemistryQuantum calculations and calculational chemistry
Quantum calculations and calculational chemistry
 
Machine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methodsMachine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methods
 
NANO266 - Lecture 4 - Introduction to DFT
NANO266 - Lecture 4 - Introduction to DFTNANO266 - Lecture 4 - Introduction to DFT
NANO266 - Lecture 4 - Introduction to DFT
 
Machine Learning for Chemical Sciences
Machine Learning for Chemical SciencesMachine Learning for Chemical Sciences
Machine Learning for Chemical Sciences
 
Density functional theory
Density functional theoryDensity functional theory
Density functional theory
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
Computational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsComputational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methods
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
 
Domain Transfer and Adaptation Survey
Domain Transfer and Adaptation SurveyDomain Transfer and Adaptation Survey
Domain Transfer and Adaptation Survey
 
Lecture: Interatomic Potentials Enabled by Machine Learning
Lecture: Interatomic Potentials Enabled by Machine LearningLecture: Interatomic Potentials Enabled by Machine Learning
Lecture: Interatomic Potentials Enabled by Machine Learning
 
Density Functional Theory
Density Functional TheoryDensity Functional Theory
Density Functional Theory
 
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
 
Lecture6
Lecture6Lecture6
Lecture6
 
Dft presentation
Dft presentationDft presentation
Dft presentation
 
Hartree method ppt physical chemistry
Hartree method ppt physical chemistryHartree method ppt physical chemistry
Hartree method ppt physical chemistry
 

Semelhante a Applications of Machine Learning for Materials Discovery at NREL

2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML modelaimsnist
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFIan Foster
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Anubhav Jain
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningKAMAL CHOUDHARY
 
Materials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learningMaterials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learningAnubhav Jain
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectAnubhav Jain
 
Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...Anubhav Jain
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Anubhav Jain
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
Conducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectConducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectAnubhav Jain
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignKAMAL CHOUDHARY
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Anubhav Jain
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...Anubhav Jain
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Anubhav Jain
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Anubhav Jain
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstracttsysglobalsolutions
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLAnubhav Jain
 
The Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource ProvisioningThe Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource ProvisioningRafael Ferreira da Silva
 

Semelhante a Applications of Machine Learning for Materials Discovery at NREL (20)

CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Materials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learningMaterials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learning
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials Project
 
Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
Conducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectConducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials Project
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNL
 
The Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource ProvisioningThe Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource Provisioning
 

Mais de aimsnist

Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...aimsnist
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Designaimsnist
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AIaimsnist
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...aimsnist
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...aimsnist
 
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses FasterCoupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses Fasteraimsnist
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applicationsaimsnist
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionaimsnist
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networksaimsnist
 
Pathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of MaterialsPathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of Materialsaimsnist
 
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...aimsnist
 
Materials Data in Action
Materials Data in ActionMaterials Data in Action
Materials Data in Actionaimsnist
 
Combinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials DiscoveryCombinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials Discoveryaimsnist
 

Mais de aimsnist (14)

Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AI
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
 
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses FasterCoupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisition
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networks
 
Pathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of MaterialsPathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of Materials
 
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
 
Materials Data in Action
Materials Data in ActionMaterials Data in Action
Materials Data in Action
 
Combinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials DiscoveryCombinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials Discovery
 

Último

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 

Último (20)

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 

Applications of Machine Learning for Materials Discovery at NREL

  • 1. Applications of Machine Learning for Materials Discovery at NREL Caleb Phillips, Ph.D. Data Analysis and Visualization Computational Sciences Center National Renewable Energy Laboratory
  • 2. NREL | 2 Modern “Full Stack” Materials Science Synthesis Characterization Computation Golden, Colorado
  • 3. NREL | 3 Skeptics Allowed “I’ll admit it, there may be something to this ‘big data’ and ‘machine learning’ thing everyone keeps talking about.” - Anonymous cynic (2017) What changed? • Computational power • Deep Neural Networks • Cheap storage, big data • Increasing adoption/investment
  • 4. NREL | 4 Setting Realistic Expectations Machine learning & Deep Learning Image source: Gartner.com, Aug. 2017
  • 5. NREL | 5 Overview of the talk Compelling examples of materials-oriented machine learning at work at NREL: • Improving the throughput of experimentation: • Interpretation: Accelerate the data->knowledge path • Automation: Replace onerous manual tasks • Prediction: Predict properties not measured • Augmenting or replacing DFT simulations in candidate screening • Prediction: End-to-end deep learning on molecular and atomistic structures Will cover applications at a high level – talk to or email me for more info } Do work faster with more insight } Focus work, avoid some altogether
  • 6. NREL | 6 But first, the Data The Materials Project http://materials.nrel.gov http://organiceletronics.nrel.gov http://htem.nrel.gov { { { Experimental Theoretical Both
  • 7. NREL | 7 Example: Experimental Materials Discovery Taylor et al. Adv. Funct. Mater. 18, 3169 (2008) % In Conductivity of Annealed InZnO Goal: Make a PhD Thesis Amount of Analysis a Routine Activity Composition Structure Property Process Slide credit: John Perkins
  • 8. NREL | 8 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization
  • 9. NREL | 9 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization AI/ML Opportunities Improve fidelity Guided search Faster screening Automation Visualization Property Prediction{
  • 10. NREL | 10 Initial motivating problem: Accelerate Slow Analysis Tasks 880 Unanalyzed XRD Patterns (Data) 1 Structure Phase Map (Knowledge) Slow
  • 11. NREL | 11 Clustering by structure and composition Samples Results Extracted Data Set ~ 1000 XRD patterns Spectral Clustering NNLS Decomposition Apply Machine Learning to Determine Clusters in XRD Patterns Fast: ~ 30 seconds on a laptop Calculated XRD Patterns
  • 12. NREL | 12 Automatic band gap calculation Goal: replace highly subjective manual Process with something scalable, automated, and (more) accurate. Combining experimental and theoretical data compare properties across a wide landscape of materials systems and synthesis conditions. Schwarting et al., Materials Discovery (2018)
  • 13. NREL | 13 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization AI/ML Opportunities Improve fidelity Guided search Faster screening Automation Visualization Property Prediction{
  • 14. NREL | 14 High throughput screening using computational results Constraints Molecule Generator Predictive (Machine Learning) Model Simulation on Supercomputer $$$ Results Database OR Best candidates All candidates (sequentially) Visualization & Analysis Materials Synthesis $$$$$ Measurement and Validation New Materials Theoretical Experimental Training on Past Results Phillips et al. CoDA (2016)
  • 15. NREL | 15 Predict opto-electric properties of molecules Support Vector Regression (SVR) performance when predicting calculated band gap. Residual error is linear and normally distributed. Median error is effectively zero, RMSE is 0.25 EV or less for most scenarios. First try: learn using molecular descriptors (traditional feature engineering) 2 million candidates
  • 16. NREL | 16 End-to-end Learning: Skip the feature extraction Image Recognition: Convolutional Neural Networks (CNNs) O Message Passing Blocks Node Recurrent Units Node Embedding Layer Graph Output Layer(s) Dense Regression Layers Predictions Input Graph (Molecule) Molecular Graphs: Message Passing Neural Networks (MPNNs)Gilmer et al., CoRR (2017) Key hypothesis: model can learn which features are important directly from structure.
  • 17. NREL | 17 End-to-end Learning: Skip the feature extraction Duplicate 1 (DFT) MachineLearningPrediction 3-5x improvement over manually engineered features. Accuracy approaching repeated-measures accuracy of DFT. Gap 0.90 HOMO 1.05 LUMO 0.89 Spectral overlap 1.28 Polymer HOMO 1.24 Polymer LUMO 1.03 Polymer gap 1.19 Polymer optical LUMO 1.02 !"#$("&'ℎ)*+ ,+&-*)*.) !"#$(012 3456)'&7+8) St. John et al. https://arxiv.org/abs/1807.10363. (2018)
  • 18. NREL | 18 Transfer learning and training set size St. John et al. https://arxiv.org/abs/1807.10363. (2018)
  • 19. NREL | 19 End-to-end learning for crystalline materials Represent crystal structure as a graph to allow end-to-end learning. Kamdar. 2018. NREL/US DOE CSGF.
  • 20. NREL | 20 Thanks to Many Collaborators (and many funding sources) Theory Stephan Lany Vladan Stevonvic Aaron Holder @ LBNL Gerd Ceder Kristin Persson Data Robert White Kristin Munch Peter Graf @ NIST Zachary Trautt Robert Hanisch Experiment Andriy Zakutayev John Perkins Philip Parilla David Ginley Bill Tumas Sebastian Siol Lauren Garten Elisabetta Arca Matthew Taylor @ NIST Martin Green Jae Hattrick-Simpers Nam Nguyen @ SLAC Apurva Mehta @ ANL Debbie Myers AI/ML Jacob Hinkle Marcus Schwarting Peter St. John @ Harvard Harshil Kamdar Slide credit: John Perkins
  • 21. NREL | 21 Selected Publications Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson, Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen. Message-passing neural networks for high-throughput polymer screening. In submission. ArXiv preprint: https://arxiv.org/abs/1807.10363 Marcus Schwarting, Sebastian Siol, Kevin Talley, Andriy Zakutayev, Caleb Phillips. Automated algorithms for band gap analysis from optical absorption spectra. Materials Discovery, April 18, 2018. https://doi.org/10.1016/j.md.2018.04.003 Andriy Zakutayev, Nick Wunder, Marcus Schwarting, John Perkins, Robert White, Kristin Munch, William Tumas, and Caleb Phillips. An open experimental database for exploring inorganic materials. Nature. Scientific Data. April 3, 2018. https://www.nature.com/articles/sdata201853 Caleb Phillips, Ross Larson, Kristin Munch, Nikos Kopidakis. Guided Search for Organic Photovoltaic Materials Using Predictive Data Modeling. Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico.
  • 22. www.nrel.gov Thank you This work was authored by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy, LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by U.S. Department of Energy Office of Energy Efficiency and Renewable Energy. The views expressed in the article do not necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for U.S. Government purposes. caleb.phillips@nrel.gov
  • 23. NREL | 23 Save work: predict not-measured properties • Electrical conductivity prediction using random forest model • Training variables: chemical composition, XRD peak count, deposition conditions • Training process: 10-fold cross-validation by withdrawing 25% sample libraries • Training set: 16K data points varying by 9-10 orders of magnitude Predicted vs Measured Conductivity Prediction accuracy for Conductivity Prediction accuracy of 1-2 orders of magnitude, reasonable for semiconductors Zakutayev et al. Scientific Data 5 180053 (2018)
  • 24. NREL | 24 What’s in my database? tSne model can group 70K samples based on similarity of their chemical compositions t-distributed stochastic neighbor embedding (tSne) dimensionality reduction model Zakutayev et al. Scientific Data 5 180053 (2018)