SlideShare a Scribd company logo
1 of 43
Download to read offline
Discovering and Exploring New
Materials through the Materials
Project
Anubhav Jain
Staff Scientist, Lawrence Berkeley National Laboratory
Associate Director, Materials Project
materialsproject.org
The Materials Project
Slides (already) uploaded to https://hackingmaterials.lbl.gov
Outline of talk
1.What is the Materials Project?
2.Applications to functional materials design
3.Community data contributions
4.Using the software tools
The core of Materials Project is a free database of
calculated materials properties and crystal structures
Free, public resource
• www.materialsproject.org
Data on ~150,000 materials,
including information on:
• electronic structure
• phonon and thermal
properties
• elastic / mechanical properties
• magnetic properties
• ferroelectric properties
• piezoelectric properties
• dielectric properties
Powered by hundreds of millions
of CPU-hours invested into high-
quality calculations
3
The core data set keeps growing with time …
4
Apps give insight into data
Materials Explorer
Phase Stability Diagrams
Pourbaix Diagrams
(Aqueous Stability)
Battery Explorer
5
The code powering the Materials Project is
available open source (BSD/MIT licenses)
just-in-time error correction, fixing your
calculations so you don’t have to
‘recipes' for common materials
science simulation tasks
making materials science web apps easy
workflow management software for
high-throughput computing
materials science analysis code:
make, transform and analyze crystals,
phase diagrams and more
& more … MP team members also contribue to
several other non-MP codes, e.g. matminer for
machine learning featurization
6
The Materials Project is used heavily by the research
community
> 180,000 registered
users
> 40,000 new users last year
~100 new registrations/day
~10,000 users log on every day
> 2M+ records downloaded through API each day; 1.8 TB of data served per month 7
Used in academia and in industry
8
3.5%
Schrodinger: Many of our customers are active users of
the Materials Project and use MP databases for
their projects. Enabling direct access to MP databases
from within Schrödinger software is a powerful addition
that will be appreciated by our users.
Toyota: “Materials Project
is a wonderful project.
Please accept my
appreciation to you to
release it free and easy to
access.”
Hazen Research: “Amazing
and well done data base. I
still remember searching
Landolt-Börnstein series
during my PhD for similar
things.”
Student
44%
Academia
36%
Industry
10%
Government
5%
Other
5%
Outline of talk
1.What is the Materials Project?
2.Applications to functional materials design
3.Community data contributions
4.Using the software tools
Historically, the Materials Project originated in battery
research and screening
10
Plain Oxides
(9204)
Silicates (1857)
Phosphates (1609)
Borates (1035)
Carbonates (370)
Vanadates (1488)
Sulfates (330)
Nitrates(61)
No Oxygen (4153)
Li
Containing
Compounds
Computed
Jain, Hautier, Moore,
Ong, Fischer,
Mueller, Persson,
Ceder
Comp. Mat. Sci
(2011)
High-throughput computational screening led
to the identification of several novel Li-ion
battery cathodes.
The data formed the basis of the original
Materials Project release.
The Materials Project continues to be used to identify
new battery materials
11
High-Throughput Computational Screening of Li-
Containing Fluorides for Battery Cathode Coatings
Bo Liu, Da Wang, Maxim Avdeev, Siqi Shi, Jiong
Yang, and Wenqing Zhang
ACS Sustainable Chem. Eng. 2020, 8, 2, 948–957
Researchers use the Materials
Project to design new cathode
coating materials to prevent
degradation
The Materials Project is used as a data set to train ML
models on battery properties
12
Using Materials Project to train
models for electrode voltage and
volume change upon intercalation
Machine Learning Screening of Metal-Ion Battery Electrode Materials
Isaiah A. Moses, Rajendra P. Joshi, Burak Ozdemir, Neeraj Kumar, Jesse
Eickholt, and Veronica Barone
ACS Appl. Mater. Interfaces 2021, 13, 45, 53355–53362
MP has been used to design many new
materials that have experimentally
confirmed useful properties
MP for p-type transparent conductors
References
✦ Hautier, G., Miglio,A., Ceder, G., Rignanese, G.-M. & Gonze, X. Identification and
design principles of low hole effective mass p-type transparent conducting oxides.
Nature Communications 4, (2013)
✦ Bhatia,A. et al. High-Mobility Bismuth-based Transparent p-Type Oxide from High-
Throughput Material Screening. Chemistry of Materials 28, 30–34 (2015)
✦ Ricci, F. et al.An ab initio electronic transport database for inorganic materials.
Scientific Data 4, (2017)
Prediction
Screening based on band
gap, transport properties
and band alignments.
Experiment
Predictions revealed
material with s–p
hybridized valence band
(thought to correlate
well with dopability).
When synthesized,
material has excellent
transparency and readily
dopable with K.
Ba2BiTaO6
MP for thermoelectrics
References
✦ Aydemir, U. et al.YCuTe2: a member of a new class of thermoelectric materials with
CuTe4-based layered structure. Journal of Materials Chemistry A 4, 2461–2472 (2016)
✦ Zhu, H. et al. Computational and experimental investigation ofTmAgTe2and
XYZ2compounds, a new group of thermoelectric materials identified by first-principles
high-throughput screening. Journal of Materials Chemistry C 3, 10554–10565 (2015).
✦ Pöhls, J.-H. et al. Metal phosphides as potential thermoelectric materials. Journal of
Materials Chemistry C 5, 12441–12456 (2017).
Prediction
Screening of tens of
thousands of materials
with predicted electron
transport properties
revealed a family of
promising XYZ2
candidates
Experiment
Several materials made:
YCuTe2 (zT = 0.75),
TmAgTe2 (zT = 0.47, 1.8
theoretical), novel NiP2
phosphide
TmAgTe2
MP for phosphors
References
✦ Wang, Z. et al. Mining Unexplored Chemistries for Phosphors for High-Color-
Quality White-Light-Emitting Diodes. Joule 2, 914–926 (2018)
✦ Li, S. et al. Data-Driven Discovery of Full-Visible-Spectrum Phosphor. Chemistry of
Materials 31, 6286–6294 (2019)
✦ Ha, J. et al. Color tunable single-phase Eu2+ and Ce3+ co-activated Sr2LiAlO4
phosphors. Journal of Materials Chemistry C 7, 7734–7744 (2019)
Prediction
Statistical analysis of existing
materials that co-occur with
word ‘phosphor’ followed
by structure prediction for
new materials
Experiment
Predicted first known Sr-Li-
Al-N quaternary, showed
green-yellow/blue emission
with quantum efficiency of
25% (Eu), 40% (Ce), 55%
(co-activated Eu, Ce)
Sr2LiAlN4
≈ç ≈
13
If you are interested in more case studies like this,
we describe more examples in a review paper
Jain, A.; Shin, Y.; Persson, K. A. Computational Predictions of Energy
Materials Using Density Functional Theory. Nature Reviews Materials
2016, 1 (1), 15004. https://doi.org/10.1038/natrevmats.2015.4
Many more examples since writing this
(increasing rate of computationally-driven discoveries)
14
Outline of talk
1.What is the Materials Project?
2.Applications to functional materials design
3.Community data contributions
4.Using the software tools
How can we use Materials Project to build a
community of materials researchers?
Materials Project now has
high visibility (e.g., by search
engines)
How can we use this
platform to help add value to
the community of materials
researchers?
16
Beyond calculations: MPContribs allows the research
community to contribute their own data
A “materials detail page,”
containing all the information MP
has calculated about a specific
material
Experimental data on a
material (either specific
phase, composition, or
chemical system)
“MPContribs” bridges
the gap
17
2. Materials Project links
to your contribution
3. Your data set and
paper are linked
1. Google links to
Materials Project page
18
From Google search to your data and your research, via MP
Current status of contributions
19
Main MPContribs Lightsources Machine-Learning
Public projects 27 1 13
Private projects 12 4 0
Contributions
(~1GB)
834,002 188 408,062
Structures (~12GB) 505,773 0 231,307
Tables (~1GB) 385,678 189 0
Attachments
(~38GB)
521,477 2 0
https://next-gen.materialsproject.org/catalysis
“standard” data sets
“enhanced” data sets
Advanced Search, Visualize, etc.
MPContribs is open for contributions
You can now apply to contribute
your data set and we will work
with you to disseminate via MP
Designed for:
• smaller data sets (e.g., MBs to
GBs); for large data files see
NOMAD or other repos
• Linking to MP compositions
Available via mpcontribs.org
20
New to MPContribs: Benchmarking machine
learning algorithms (“Matbench”)
21
Dunn, A.; Wang, Q.; Ganose, A.; Dopp, D.; Jain, A. Benchmarking Materials Property Prediction Methods: The Matbench Test Set and Automatminer
Reference Algorithm. npj Comput Mater 2020, 6 (1), 138. https://doi.org/10.1038/s41524-020-00406-3.
The ingredients of the Matbench
benchmark
ü Standard data sets
ü Standard test splits according to nested cross-validation procedure
ü An online leaderboard that encourages reproducible results
22
Matbench has an online leaderboard you can submit to – matbench.materialsproject.org
Outline of talk
1.What is the Materials Project?
2.Applications to functional materials design
3.Community data contributions
4.Using the software tools
Steps for theory & simulation
25
material generation
simulation procedure
calculation & execution
data analysis
(crystal, surface/interface,
molecule, etc...)
simulation parameters and
sequence (workflow)
job management,
execution, error recovery
databases, plotting,
analysis, insight
Software stack for theory &
simulation
26
material generation
simulation procedure
calculation & execution
data analysis
Custodian
Steps for theory & simulation
27
material generation
simulation procedure
calculation & execution
data analysis
(crystal, surface/interface,
molecule, etc...)
simulation parameters and
sequence (workflow)
job management,
execution, error recovery
databases, plotting,
analysis, insight
Step 1: materials generation
28
material
generation
• Pymatgen can help generate
models for:
– crystal structures
– molecules
– systems (surfaces, interfaces, etc.)
• Tools include:
– order-disorder (shown at right) and
SQS
– interstitial finding
– surface / slab generation
– structure matching and analysis
– get structures from Materials Project
Example: Order-disorder
resolve partial or mixed
occupancies into a fully ordered
crystal structure
(e.g., mixed oxide-fluoride site
into separate oxygen/fluorine)
Steps for theory & simulation
29
material generation
simulation procedure
calculation & execution
data analysis
(crystal, surface/interface,
molecule, etc...)
simulation parameters and
sequence (workflow)
job management,
execution, error recovery
databases, plotting,
analysis, insight
Atomate contains a library of simulation procedures
30
Mathew, K. et al Atomate: A high-level interface to generate, execute, and analyze
computational materials science workflows, Comput. Mater. Sci. 139 (2017) 140–152.
electronic bandstrcutre
elastic tensor
piezo/dielectric tensor
ferrolectricity
nudged elastic band
EELS/XAS
dipole moment
thermo expansion
exchange
QChem FF optimization
charge density
lobster
Gibbs
magnetic
SCAN funtional
LAMMPS
adsorption 2017 package
CP2K [WIP]
QChem force [WIP]
NBO [WIP]
...
lattice dymamics [WIP]
2022 package
2017|first
launch 2022|5 years
progress 202X|future
plan
a
b
Each simulation procedure in atomate is composed of
multiple levels of detail / abstraction
31
Workflow – complete set of calculations to get a materials property
Firework – one step in the Workflow (typically one DFT calculation)
Firetask – one step in a Firework
Starting with just a crystal structure,
this workflow performs four
calculations to get an optimized
structure, optimized charge
density, and band structure on two
types of grids (uniform and line)
Steps for theory & simulation
32
material generation
simulation procedure
calculation & execution
data analysis
(crystal, surface/interface,
molecule, etc...)
simulation parameters and
sequence (workflow)
job management,
execution, error recovery
databases, plotting,
analysis, insight
FireWorks allows you to write your workflow once and execute (almost) anywhere
33
• Execute workflows locally
or at a supercomputing
center
• Queue systems supported
– PBS
– SGE
– SLURM
– IBM LoadLeveler
– NEWT (a REST-based API at
NERSC)
– Cobalt (Argonne LCF)
• Cloud based services
(user-generated)
– https://github.com/CovertLa
b/borealis
Dashboard with status of all jobs
34
Job provenance and automatic metadata storage
35
what machine
what time
what directory
what was the output
when was it queued
when did it start running
when was it completed
Steps for theory & simulation
36
material generation
simulation procedure
calculation & execution
data analysis
(crystal, surface/interface,
molecule, etc...)
simulation parameters and
sequence (workflow)
job management,
execution, error recovery
databases, plotting,
analysis, insight
Examples of analyses
37
phase diagrams
Pourbaix diagrams
diffusivity from MD
band structure analysis
• Papers: good general overview / vision but of no practical help
• Online MP Workshop videos and tutorials: good way to get started
but not comprehensive. A “zero to one” situation
• Code documentation: Most comprehensive way to learn, although
some experience probably necessary
• Help channels: Useful if you already started and run into problems
38
What resources are available to learn?
• Resources on http://workshop.materialsproject.org
• Videos on MP channel of Youtube:
– https://www.youtube.com/channel/UC6pqY-__Nu-
mKkv0LMP8FJQ
39
Online MP workshop and videos
Online documentation
• Online documentation is the most comprehensive writeup
– www.pymatgen.org
– https://materialsproject.github.io/fireworks/
– https://materialsproject.github.io/custodian/
– https://hackingmaterials.github.io/atomate/
• The online documentation includes installation, examples, tutorials, and
descriptions of how to use the code
• If you want to do “everything”, suggest starting with atomate and going
from there
40
Help lists
• A help forum for each code is at https://discuss.matsci.org
41
Note: matsci.org now
includes help forums
for ~25 different
codes!
Including LAMMPS,
ASE, NOMAD, and
many more
Concluding thoughts
The Materials Project is a free resource providing data and tools to
help perform research and development of new materials
The number of proven examples of data-driven materials design is
increasing, and joint computational–experimental discoveries are
becoming common
Even more can be accomplished as a unified community to push
forward data dissemination as well as the capabilities of machine
learning
42
Kristin Persson
MP Director
The team Intro
Thank you!
Matt Horton
Staff Developer
(Materials
Project)
Patrick Huck
Staff Developer
(MPContribs)
Slides (already) uploaded to https://hackingmaterials.lbl.gov

More Related Content

Similar to Discovering and Exploring New Materials through the Materials Project

Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NREL
aimsnist
 

Similar to Discovering and Exploring New Materials through the Materials Project (20)

Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
An AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesisAn AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesis
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNL
 
Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NREL
 
The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model
 
DeLong - Complex Materials and Devices - Spring Review 2013
DeLong - Complex Materials and Devices - Spring Review 2013DeLong - Complex Materials and Devices - Spring Review 2013
DeLong - Complex Materials and Devices - Spring Review 2013
 
Machine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methodsMachine learning for materials design: opportunities, challenges, and methods
Machine learning for materials design: opportunities, challenges, and methods
 
The Materials Project
The Materials ProjectThe Materials Project
The Materials Project
 
Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...
 
Computational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsComputational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methods
 
The Materials Project - Combining Science and Informatics to Accelerate Mater...
The Materials Project - Combining Science and Informatics to Accelerate Mater...The Materials Project - Combining Science and Informatics to Accelerate Mater...
The Materials Project - Combining Science and Informatics to Accelerate Mater...
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open data
 
Hattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsHattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in Materials
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 

More from Anubhav Jain

More from Anubhav Jain (20)

Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...
 
Applications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
 
Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...
 
Efficient methods for accurately calculating thermoelectric properties – elec...
Efficient methods for accurately calculating thermoelectric properties – elec...Efficient methods for accurately calculating thermoelectric properties – elec...
Efficient methods for accurately calculating thermoelectric properties – elec...
 
Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...
 
Machine Learning for Catalyst Design
Machine Learning for Catalyst DesignMachine Learning for Catalyst Design
Machine Learning for Catalyst Design
 
Accelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine LearningAccelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine Learning
 
DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …
 
Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...
 
Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...
 
Machine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignMachine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst Design
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
 
Assessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data AnalysisAssessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data Analysis
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...
 
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
 
Automating materials science workflows with pymatgen, FireWorks, and atomate
Automating materials science workflows with pymatgen, FireWorks, and atomateAutomating materials science workflows with pymatgen, FireWorks, and atomate
Automating materials science workflows with pymatgen, FireWorks, and atomate
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 

Recently uploaded

The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 

Recently uploaded (20)

Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Discovering and Exploring New Materials through the Materials Project

  • 1. Discovering and Exploring New Materials through the Materials Project Anubhav Jain Staff Scientist, Lawrence Berkeley National Laboratory Associate Director, Materials Project materialsproject.org The Materials Project Slides (already) uploaded to https://hackingmaterials.lbl.gov
  • 2. Outline of talk 1.What is the Materials Project? 2.Applications to functional materials design 3.Community data contributions 4.Using the software tools
  • 3. The core of Materials Project is a free database of calculated materials properties and crystal structures Free, public resource • www.materialsproject.org Data on ~150,000 materials, including information on: • electronic structure • phonon and thermal properties • elastic / mechanical properties • magnetic properties • ferroelectric properties • piezoelectric properties • dielectric properties Powered by hundreds of millions of CPU-hours invested into high- quality calculations 3
  • 4. The core data set keeps growing with time … 4
  • 5. Apps give insight into data Materials Explorer Phase Stability Diagrams Pourbaix Diagrams (Aqueous Stability) Battery Explorer 5
  • 6. The code powering the Materials Project is available open source (BSD/MIT licenses) just-in-time error correction, fixing your calculations so you don’t have to ‘recipes' for common materials science simulation tasks making materials science web apps easy workflow management software for high-throughput computing materials science analysis code: make, transform and analyze crystals, phase diagrams and more & more … MP team members also contribue to several other non-MP codes, e.g. matminer for machine learning featurization 6
  • 7. The Materials Project is used heavily by the research community > 180,000 registered users > 40,000 new users last year ~100 new registrations/day ~10,000 users log on every day > 2M+ records downloaded through API each day; 1.8 TB of data served per month 7
  • 8. Used in academia and in industry 8 3.5% Schrodinger: Many of our customers are active users of the Materials Project and use MP databases for their projects. Enabling direct access to MP databases from within Schrödinger software is a powerful addition that will be appreciated by our users. Toyota: “Materials Project is a wonderful project. Please accept my appreciation to you to release it free and easy to access.” Hazen Research: “Amazing and well done data base. I still remember searching Landolt-Börnstein series during my PhD for similar things.” Student 44% Academia 36% Industry 10% Government 5% Other 5%
  • 9. Outline of talk 1.What is the Materials Project? 2.Applications to functional materials design 3.Community data contributions 4.Using the software tools
  • 10. Historically, the Materials Project originated in battery research and screening 10 Plain Oxides (9204) Silicates (1857) Phosphates (1609) Borates (1035) Carbonates (370) Vanadates (1488) Sulfates (330) Nitrates(61) No Oxygen (4153) Li Containing Compounds Computed Jain, Hautier, Moore, Ong, Fischer, Mueller, Persson, Ceder Comp. Mat. Sci (2011) High-throughput computational screening led to the identification of several novel Li-ion battery cathodes. The data formed the basis of the original Materials Project release.
  • 11. The Materials Project continues to be used to identify new battery materials 11 High-Throughput Computational Screening of Li- Containing Fluorides for Battery Cathode Coatings Bo Liu, Da Wang, Maxim Avdeev, Siqi Shi, Jiong Yang, and Wenqing Zhang ACS Sustainable Chem. Eng. 2020, 8, 2, 948–957 Researchers use the Materials Project to design new cathode coating materials to prevent degradation
  • 12. The Materials Project is used as a data set to train ML models on battery properties 12 Using Materials Project to train models for electrode voltage and volume change upon intercalation Machine Learning Screening of Metal-Ion Battery Electrode Materials Isaiah A. Moses, Rajendra P. Joshi, Burak Ozdemir, Neeraj Kumar, Jesse Eickholt, and Veronica Barone ACS Appl. Mater. Interfaces 2021, 13, 45, 53355–53362
  • 13. MP has been used to design many new materials that have experimentally confirmed useful properties MP for p-type transparent conductors References ✦ Hautier, G., Miglio,A., Ceder, G., Rignanese, G.-M. & Gonze, X. Identification and design principles of low hole effective mass p-type transparent conducting oxides. Nature Communications 4, (2013) ✦ Bhatia,A. et al. High-Mobility Bismuth-based Transparent p-Type Oxide from High- Throughput Material Screening. Chemistry of Materials 28, 30–34 (2015) ✦ Ricci, F. et al.An ab initio electronic transport database for inorganic materials. Scientific Data 4, (2017) Prediction Screening based on band gap, transport properties and band alignments. Experiment Predictions revealed material with s–p hybridized valence band (thought to correlate well with dopability). When synthesized, material has excellent transparency and readily dopable with K. Ba2BiTaO6 MP for thermoelectrics References ✦ Aydemir, U. et al.YCuTe2: a member of a new class of thermoelectric materials with CuTe4-based layered structure. Journal of Materials Chemistry A 4, 2461–2472 (2016) ✦ Zhu, H. et al. Computational and experimental investigation ofTmAgTe2and XYZ2compounds, a new group of thermoelectric materials identified by first-principles high-throughput screening. Journal of Materials Chemistry C 3, 10554–10565 (2015). ✦ Pöhls, J.-H. et al. Metal phosphides as potential thermoelectric materials. Journal of Materials Chemistry C 5, 12441–12456 (2017). Prediction Screening of tens of thousands of materials with predicted electron transport properties revealed a family of promising XYZ2 candidates Experiment Several materials made: YCuTe2 (zT = 0.75), TmAgTe2 (zT = 0.47, 1.8 theoretical), novel NiP2 phosphide TmAgTe2 MP for phosphors References ✦ Wang, Z. et al. Mining Unexplored Chemistries for Phosphors for High-Color- Quality White-Light-Emitting Diodes. Joule 2, 914–926 (2018) ✦ Li, S. et al. Data-Driven Discovery of Full-Visible-Spectrum Phosphor. Chemistry of Materials 31, 6286–6294 (2019) ✦ Ha, J. et al. Color tunable single-phase Eu2+ and Ce3+ co-activated Sr2LiAlO4 phosphors. Journal of Materials Chemistry C 7, 7734–7744 (2019) Prediction Statistical analysis of existing materials that co-occur with word ‘phosphor’ followed by structure prediction for new materials Experiment Predicted first known Sr-Li- Al-N quaternary, showed green-yellow/blue emission with quantum efficiency of 25% (Eu), 40% (Ce), 55% (co-activated Eu, Ce) Sr2LiAlN4 ≈ç ≈ 13
  • 14. If you are interested in more case studies like this, we describe more examples in a review paper Jain, A.; Shin, Y.; Persson, K. A. Computational Predictions of Energy Materials Using Density Functional Theory. Nature Reviews Materials 2016, 1 (1), 15004. https://doi.org/10.1038/natrevmats.2015.4 Many more examples since writing this (increasing rate of computationally-driven discoveries) 14
  • 15. Outline of talk 1.What is the Materials Project? 2.Applications to functional materials design 3.Community data contributions 4.Using the software tools
  • 16. How can we use Materials Project to build a community of materials researchers? Materials Project now has high visibility (e.g., by search engines) How can we use this platform to help add value to the community of materials researchers? 16
  • 17. Beyond calculations: MPContribs allows the research community to contribute their own data A “materials detail page,” containing all the information MP has calculated about a specific material Experimental data on a material (either specific phase, composition, or chemical system) “MPContribs” bridges the gap 17
  • 18. 2. Materials Project links to your contribution 3. Your data set and paper are linked 1. Google links to Materials Project page 18 From Google search to your data and your research, via MP
  • 19. Current status of contributions 19 Main MPContribs Lightsources Machine-Learning Public projects 27 1 13 Private projects 12 4 0 Contributions (~1GB) 834,002 188 408,062 Structures (~12GB) 505,773 0 231,307 Tables (~1GB) 385,678 189 0 Attachments (~38GB) 521,477 2 0 https://next-gen.materialsproject.org/catalysis “standard” data sets “enhanced” data sets Advanced Search, Visualize, etc.
  • 20. MPContribs is open for contributions You can now apply to contribute your data set and we will work with you to disseminate via MP Designed for: • smaller data sets (e.g., MBs to GBs); for large data files see NOMAD or other repos • Linking to MP compositions Available via mpcontribs.org 20
  • 21. New to MPContribs: Benchmarking machine learning algorithms (“Matbench”) 21 Dunn, A.; Wang, Q.; Ganose, A.; Dopp, D.; Jain, A. Benchmarking Materials Property Prediction Methods: The Matbench Test Set and Automatminer Reference Algorithm. npj Comput Mater 2020, 6 (1), 138. https://doi.org/10.1038/s41524-020-00406-3.
  • 22. The ingredients of the Matbench benchmark ü Standard data sets ü Standard test splits according to nested cross-validation procedure ü An online leaderboard that encourages reproducible results 22
  • 23. Matbench has an online leaderboard you can submit to – matbench.materialsproject.org
  • 24. Outline of talk 1.What is the Materials Project? 2.Applications to functional materials design 3.Community data contributions 4.Using the software tools
  • 25. Steps for theory & simulation 25 material generation simulation procedure calculation & execution data analysis (crystal, surface/interface, molecule, etc...) simulation parameters and sequence (workflow) job management, execution, error recovery databases, plotting, analysis, insight
  • 26. Software stack for theory & simulation 26 material generation simulation procedure calculation & execution data analysis Custodian
  • 27. Steps for theory & simulation 27 material generation simulation procedure calculation & execution data analysis (crystal, surface/interface, molecule, etc...) simulation parameters and sequence (workflow) job management, execution, error recovery databases, plotting, analysis, insight
  • 28. Step 1: materials generation 28 material generation • Pymatgen can help generate models for: – crystal structures – molecules – systems (surfaces, interfaces, etc.) • Tools include: – order-disorder (shown at right) and SQS – interstitial finding – surface / slab generation – structure matching and analysis – get structures from Materials Project Example: Order-disorder resolve partial or mixed occupancies into a fully ordered crystal structure (e.g., mixed oxide-fluoride site into separate oxygen/fluorine)
  • 29. Steps for theory & simulation 29 material generation simulation procedure calculation & execution data analysis (crystal, surface/interface, molecule, etc...) simulation parameters and sequence (workflow) job management, execution, error recovery databases, plotting, analysis, insight
  • 30. Atomate contains a library of simulation procedures 30 Mathew, K. et al Atomate: A high-level interface to generate, execute, and analyze computational materials science workflows, Comput. Mater. Sci. 139 (2017) 140–152. electronic bandstrcutre elastic tensor piezo/dielectric tensor ferrolectricity nudged elastic band EELS/XAS dipole moment thermo expansion exchange QChem FF optimization charge density lobster Gibbs magnetic SCAN funtional LAMMPS adsorption 2017 package CP2K [WIP] QChem force [WIP] NBO [WIP] ... lattice dymamics [WIP] 2022 package 2017|first launch 2022|5 years progress 202X|future plan a b
  • 31. Each simulation procedure in atomate is composed of multiple levels of detail / abstraction 31 Workflow – complete set of calculations to get a materials property Firework – one step in the Workflow (typically one DFT calculation) Firetask – one step in a Firework Starting with just a crystal structure, this workflow performs four calculations to get an optimized structure, optimized charge density, and band structure on two types of grids (uniform and line)
  • 32. Steps for theory & simulation 32 material generation simulation procedure calculation & execution data analysis (crystal, surface/interface, molecule, etc...) simulation parameters and sequence (workflow) job management, execution, error recovery databases, plotting, analysis, insight
  • 33. FireWorks allows you to write your workflow once and execute (almost) anywhere 33 • Execute workflows locally or at a supercomputing center • Queue systems supported – PBS – SGE – SLURM – IBM LoadLeveler – NEWT (a REST-based API at NERSC) – Cobalt (Argonne LCF) • Cloud based services (user-generated) – https://github.com/CovertLa b/borealis
  • 34. Dashboard with status of all jobs 34
  • 35. Job provenance and automatic metadata storage 35 what machine what time what directory what was the output when was it queued when did it start running when was it completed
  • 36. Steps for theory & simulation 36 material generation simulation procedure calculation & execution data analysis (crystal, surface/interface, molecule, etc...) simulation parameters and sequence (workflow) job management, execution, error recovery databases, plotting, analysis, insight
  • 37. Examples of analyses 37 phase diagrams Pourbaix diagrams diffusivity from MD band structure analysis
  • 38. • Papers: good general overview / vision but of no practical help • Online MP Workshop videos and tutorials: good way to get started but not comprehensive. A “zero to one” situation • Code documentation: Most comprehensive way to learn, although some experience probably necessary • Help channels: Useful if you already started and run into problems 38 What resources are available to learn?
  • 39. • Resources on http://workshop.materialsproject.org • Videos on MP channel of Youtube: – https://www.youtube.com/channel/UC6pqY-__Nu- mKkv0LMP8FJQ 39 Online MP workshop and videos
  • 40. Online documentation • Online documentation is the most comprehensive writeup – www.pymatgen.org – https://materialsproject.github.io/fireworks/ – https://materialsproject.github.io/custodian/ – https://hackingmaterials.github.io/atomate/ • The online documentation includes installation, examples, tutorials, and descriptions of how to use the code • If you want to do “everything”, suggest starting with atomate and going from there 40
  • 41. Help lists • A help forum for each code is at https://discuss.matsci.org 41 Note: matsci.org now includes help forums for ~25 different codes! Including LAMMPS, ASE, NOMAD, and many more
  • 42. Concluding thoughts The Materials Project is a free resource providing data and tools to help perform research and development of new materials The number of proven examples of data-driven materials design is increasing, and joint computational–experimental discoveries are becoming common Even more can be accomplished as a unified community to push forward data dissemination as well as the capabilities of machine learning 42
  • 43. Kristin Persson MP Director The team Intro Thank you! Matt Horton Staff Developer (Materials Project) Patrick Huck Staff Developer (MPContribs) Slides (already) uploaded to https://hackingmaterials.lbl.gov