SlideShare uma empresa Scribd logo
1 de 51
Baixar para ler offline
Karsten Peters
peters@dkrz.de
Applying machine learning
to address pressing issues
of fundamental weather
and climate research
Maria Moreno de Castro
moreno@dkrz.de
the presentation follow this perspective article
Organism
Biome
Region
Landscape
ECOSYSTEM
Organ
Cell
Molecule
GLOBE
Complex
Biology + Chemistry + Physics
Unique
The Earth System
Slide courtesy from the author Markus Reichstein
THE EARTH SYSTEM
THE EARTH SYSTEM
The behavior is dominated by spatial and temporal relations
Main research focus:
○ seasonal meteorological predictions
○ forecasting extreme events: floods, fires,...
○ long term climate predictions
It’s not like we
haven’t got
enough data at
our hands…
There’s
observational
data…
The A-Train
- 7 satellites flying in formation
- Operating since ~18 years
- Aqua collects about 89 GB of data/day
Example of observational data collection by remote sensing
It’s not like we
haven’t got
enough data at
our hands…
There’s
observational
data …
...and model
data.
Model data are the result of simulations generated by numerically solved
differential equations derived from physical models by discretizing the Earth and
representing key processes with parameterizations
Repeat for every model timestep and for every point of the globe
Model data Calculate physical
processes
Apply boundary
conditions
Multimodel analysis
©DKRZ/MPI-M
Last report:
3.5PBytes
Next report:
~ 30PBytes
Climate Models at the 1km scale are coming up
~650 GB of data per output time step
<<
Mistral High
Performace Computer
- 6 years old (new in 2020)
- 3.9 PFlops (#80 in Top500)
- 52 PiBytes disk (#4 in IO500)
Tape archive
- >200 PiBytes
- 5PiBytes disk cache
<<
example of spatio-temporal relations:
the prediction of fire occurrence, the stimation of burnt
area, and the trace of gas emissions depend on:
● instantaneous climatic drivers: temperature, humidity,...
● sources of ignition: humans, lightning,...
● state variables: available fuel,..
● moisture, terrain, wind speed and direction,..
Machine learning applications often do not directly and
exhaustively account for spatio-temporal correlations
Deep learning is a promising approach
Example: convolutional networks (spatial) + recurrent networks (memory, sequence learning)
Examples of Deep Learning applications in
Earth System Science
Slide courtesy from the author Markus Reichstein
Deep learning challenges
in Earth System science
● Diverse sources of noise → poor signal-to-noise ratio
Deep learning challenges
in Earth System science
● Diverse sources of noise → poor signal-to-noise ratio
● Inconsistencies
Deep learning challenges
in Earth System science
Fundamental
laws
of physics
energy and mass conservations,.... and we
must assure the deep learning models do not
allow for negative densities, precipitations,...
Noether's theorem explains why
conservation laws exists (wikipedia)
● Diverse sources of noise→ poor signal-to-noise ratio
● Inconsistencies → energy and mass conservations, density must be positive,...
● Extrapolation problem
Deep learning challenges
in Earth System science
Extrapolation
problem:
classification
Extrapolation
problem:
classification
the model should
show is not certain
about predicting in
undersampled
regions...
input data
Extrapolation
problem:
regression
Non-
stationary
system
Data shift or concept drift
● training data are not longer representative if the system has changed
● the accuracy of the trained model definitely decreased under data shift/concept drift
● Diverse sources of noise→ poor signal-to-noise ratio
● Inconsistencies → energy and mass conservations, density must be positive,...
● Extrapolation problem → system changes in time: data shift or concept drift
Deep learning challenges
in Earth System science
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
Images
Deep learning challenges
in Earth System science
what is the scale of this? →
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
Images
Deep learning challenges
in Earth System science
Scale invariant!
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
● No ImageNet → and difficult to have, example: labelling clouds
Images
Deep learning challenges
in Earth System science
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
● No ImageNet → and difficult to have, example: labelling clouds
● Missing data → a solution Christopher Kadow, the leader of DKRZ
machine learning research group, adapted the Nvidia Technology for
image inpainting
Deep learning challenges
in Earth System science
Images
‘ground-truth’
original data
masked data
missing values
reconstruction
by Deep Convolutional NN
Image inpainting to reconstruct temperature missing observations
Hybrid models
Physical
models
ML and DL
models
Physical
models
ML and DL
models
Lightweighting/simplifying/speeding up physical models
● improve parametrizations
● analysis of model-observations mismatch
● emulation
Physical
models
ML and DL
models
Domain knowledge can guide/optimize the pure data-driven methods
● design the architecture
● constrain the cost (or reward) function
● physically based data augmentation: expansion of the data set
for undersampled regions
Depth
(m)
Temp
(°C)
feature prediction
Example: lakes simulations to predict temperature from depth measurements
Physical model
example: Tempd+1
= Temp d
+ sun - wind - upwelling
given that we measured Td=surface
= 15°C
Depth
(m)
Temp
(°C)
feature prediction
Physical model
example: Tempd+1
= Temp d
+ sun - wind - upwelling
given that we measured Td=surface
= 15°C
Moderate model skills
and of course zero
inconsistency
Depth
(m)
Temp
(°C)
feature prediction
Neural Network
might allow negative densities!
_
Better model skills and
but the inconsistency
spreads
Depth
(m)
Density
(g/L)
Temp
(°C)
features prediction
DATA AUGMENTATION: to include
new features that comes from physical
knowledge and then NN
Depth
(m)
Density
(g/L)
Temp
(°C)
features prediction
DATA AUGMENTATION: to include
new features that comes from physical
knowledge and then NN
Even better model skills
and a bit less
inconsistency but it still
spreads
Depth
(m)
Density
(g/L)
Temp
(°C)
✓
X
features prediction
physical model + NN + constrain loss
function: denser water must be deeper
Depth
(m)
Density
(g/L)
Temp
(°C)
✓
X
features prediction
physical model + NN + constrain loss
function: denser water must be deeper
Totally consistent and
high model skills!
Great model
performance (~1°C less
error) and totally
consistent
References
● Earth System figure: https://karenbakker.org/the-climate-system/
● Data cube image: Earth Syst. Dynam., 11, 201–234 (2020) https://doi.org/10.5194/esd-11-201-2020
● Climate model image: A. Gettelman and R.B. Rood, Demystifying Climate Models, Earth Systems
Data and Models 2, doi: 10.1007/978-3-662-48959-8_5
● Multimodel figure: Michael Böttinger (DKRZ) and Joachem Marotzke (Max Planck Institute
Meteorology)
● Mistral picture: Michael Böttinger (DKRZ) .
● Wildfire picture: https://pixnio.com/miscellaneous/fire-flames-pictures/aerial-ignition-interior-high
-rates-of-spread-in-open-savannas
● Hockey stick IPCC https://www.ipcc.ch/report/ar3/wg1/ (Chapter 2)
● Scale invariant issue with chiguagua and dingo: Christian Staudt http://clstaudt.me
● Scale free image http://paulbourke.net/fractals/googleearth/
● Image inpainting:
○ Nvidia Technology https://www.nvidia.com/research/inpainting/
○ Kadow et al. (2020), Artificial Intelligence reconstructs missing Climate Information (in review)
● Physics-guided neural networks : https://arxiv.org/pdf/1710.11431.pdf and
https://towardsdatascience.com/physics-guided-neural-networks-pgnns-8fe9dbad9414
DKRZ Unit: Machine Learning as a Service
● Provide a knowledge base
● Bring prototypes to production
● Train, educate, and exchange
machinelearning-join@lists.dkrz.de
Summary of the main topics discussed in the kick-off workshop
● Applying machine learning to Earth System modelling
○ Hybrid approaches to (i) improve parametrizations and (ii) validate physical models
○ Increase the availability of training data via (i) data augmentation and (ii) labelling
○ Infer causality of the patterns found in observational data
● Technology
○ Support for Python, portation to GPUs, and larger memory
○ Machine learning libraries for NetDCF data handling
○ Adaptive learning integrated with physical models during running time on the HPC
○ Distributed training and execution
○ Portability between HPC centres
● Uncertainty and reproducibility
○ Performance metrics for (i) unsupervised learning and (ii) data shift/concept drift
○ Adoption of interpretable models and uncertainty quantification and explainability methods
○ Sharing of training scripts and training data or trained model
● Community activities and capacity building: workshops, summer schools,...
Artificial intelligence and machine learning
activities at DKRZ
EXTRASLIDE
Climate models
● physical models derived from first principles (mechanistic)
● used to simulate how the Earth’s climate changes in time (dynamical)
● written in the form of coupled differential equations
● solution depends on boundary and initial conditions
● run with different conditions allows to create scenarios (see for instance, RCP)
● solved numerically with long lasting parallel runs
● calibrated and validated against observational data
● results are called model data
EXTRASLIDE
year
RCP 2.6: the best
case scenario
historical
RPC 8.5:
business
as usualExample of a basic climate model
including time (t) and space (x):
Tempt+1,x+1
= Temp t,x
+ warmingt,x
- coolingt,x
given that Tempt=0
= 25°C
High Performance Computing Data Center
Ongoing efforts to reduce our carbon footprint:
● Power Usage Effectiveness ~1 (PUE = 1.19)
● Cold aisle containment reduce CO2
emissions by 20%
● Hot air is recycled for heating nearby facilities
● Cooling water is recycled in our toilets
● Greener energy supplier possible
BEFORE AFTER
EXTRASLIDE

Mais conteúdo relacionado

Mais procurados

Cloud of things (IoT + Cloud Computing)
Cloud of things (IoT + Cloud Computing)Cloud of things (IoT + Cloud Computing)
Cloud of things (IoT + Cloud Computing)Zakaria Hossain
 
Machine Learning and Internet of Things
Machine Learning and Internet of ThingsMachine Learning and Internet of Things
Machine Learning and Internet of ThingsSofian Hadiwijaya
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom ConceptsImage Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom Conceptsmmjalbiaty
 
Green computing 28thdec15_siddharth
Green computing 28thdec15_siddharthGreen computing 28thdec15_siddharth
Green computing 28thdec15_siddharthSiddharth Dwivedi
 
Analytical learning
Analytical learningAnalytical learning
Analytical learningswapnac12
 
Image degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem AshrafImage degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem AshrafMD Naseem Ashraf
 
Trends in computer architecture
Trends in computer architectureTrends in computer architecture
Trends in computer architecturemuhammedsalihabbas
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationVarun Ojha
 
Deep Learning techniques for boundaries detection
Deep Learning techniques for boundaries detectionDeep Learning techniques for boundaries detection
Deep Learning techniques for boundaries detectionplan4all
 
An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep LearningPoo Kuan Hoong
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 

Mais procurados (20)

Cloud of things (IoT + Cloud Computing)
Cloud of things (IoT + Cloud Computing)Cloud of things (IoT + Cloud Computing)
Cloud of things (IoT + Cloud Computing)
 
Machine Learning and Internet of Things
Machine Learning and Internet of ThingsMachine Learning and Internet of Things
Machine Learning and Internet of Things
 
Digital signal processor part 3
Digital signal processor part 3Digital signal processor part 3
Digital signal processor part 3
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom ConceptsImage Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom Concepts
 
U-Net (1).pptx
U-Net (1).pptxU-Net (1).pptx
U-Net (1).pptx
 
Green computing 28thdec15_siddharth
Green computing 28thdec15_siddharthGreen computing 28thdec15_siddharth
Green computing 28thdec15_siddharth
 
Analytical learning
Analytical learningAnalytical learning
Analytical learning
 
Unit ii
Unit iiUnit ii
Unit ii
 
Ada boost
Ada boostAda boost
Ada boost
 
Image degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem AshrafImage degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem Ashraf
 
Trends in computer architecture
Trends in computer architectureTrends in computer architecture
Trends in computer architecture
 
Deep learning
Deep learningDeep learning
Deep learning
 
Green computing ppt
Green computing pptGreen computing ppt
Green computing ppt
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier Transformation
 
Regularization
RegularizationRegularization
Regularization
 
Deep Learning techniques for boundaries detection
Deep Learning techniques for boundaries detectionDeep Learning techniques for boundaries detection
Deep Learning techniques for boundaries detection
 
An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep Learning
 
Deep learning
Deep learning Deep learning
Deep learning
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 
Neural network and fuzzy logic
Neural network and fuzzy logicNeural network and fuzzy logic
Neural network and fuzzy logic
 

Semelhante a Machine learning and climate and weather research

TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...Brad Evans
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...PyData
 
Deep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchDeep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchinside-BigData.com
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningNoah Brenowitz
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Frank Nielsen
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Learning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningLearning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningZachary Labe
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017SERC at Carleton College
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
How to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC HardwareHow to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC Hardwareinside-BigData.com
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Igor Sfiligoi
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Frank Wuerthwein
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the CloudFrank Wuerthwein
 

Semelhante a Machine learning and climate and weather research (20)

CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
 
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
 
MAXSS & NVIDIA
MAXSS & NVIDIAMAXSS & NVIDIA
MAXSS & NVIDIA
 
Complex Models for Big Data
Complex Models for Big DataComplex Models for Big Data
Complex Models for Big Data
 
Deep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchDeep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorch
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine Learning
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Learning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningLearning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learning
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
Slide 1
Slide 1Slide 1
Slide 1
 
Slide 1
Slide 1Slide 1
Slide 1
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
How to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC HardwareHow to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC Hardware
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the Cloud
 

Último

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 

Último (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 

Machine learning and climate and weather research

  • 1. Karsten Peters peters@dkrz.de Applying machine learning to address pressing issues of fundamental weather and climate research Maria Moreno de Castro moreno@dkrz.de
  • 2. the presentation follow this perspective article
  • 3. Organism Biome Region Landscape ECOSYSTEM Organ Cell Molecule GLOBE Complex Biology + Chemistry + Physics Unique The Earth System Slide courtesy from the author Markus Reichstein
  • 5. THE EARTH SYSTEM The behavior is dominated by spatial and temporal relations Main research focus: ○ seasonal meteorological predictions ○ forecasting extreme events: floods, fires,... ○ long term climate predictions
  • 6. It’s not like we haven’t got enough data at our hands… There’s observational data…
  • 7. The A-Train - 7 satellites flying in formation - Operating since ~18 years - Aqua collects about 89 GB of data/day Example of observational data collection by remote sensing
  • 8. It’s not like we haven’t got enough data at our hands… There’s observational data … ...and model data.
  • 9. Model data are the result of simulations generated by numerically solved differential equations derived from physical models by discretizing the Earth and representing key processes with parameterizations
  • 10. Repeat for every model timestep and for every point of the globe Model data Calculate physical processes Apply boundary conditions
  • 12. Climate Models at the 1km scale are coming up ~650 GB of data per output time step
  • 13. << Mistral High Performace Computer - 6 years old (new in 2020) - 3.9 PFlops (#80 in Top500) - 52 PiBytes disk (#4 in IO500) Tape archive - >200 PiBytes - 5PiBytes disk cache
  • 14. <<
  • 15. example of spatio-temporal relations: the prediction of fire occurrence, the stimation of burnt area, and the trace of gas emissions depend on: ● instantaneous climatic drivers: temperature, humidity,... ● sources of ignition: humans, lightning,... ● state variables: available fuel,.. ● moisture, terrain, wind speed and direction,..
  • 16. Machine learning applications often do not directly and exhaustively account for spatio-temporal correlations Deep learning is a promising approach Example: convolutional networks (spatial) + recurrent networks (memory, sequence learning)
  • 17. Examples of Deep Learning applications in Earth System Science Slide courtesy from the author Markus Reichstein
  • 18. Deep learning challenges in Earth System science
  • 19. ● Diverse sources of noise → poor signal-to-noise ratio Deep learning challenges in Earth System science
  • 20. ● Diverse sources of noise → poor signal-to-noise ratio ● Inconsistencies Deep learning challenges in Earth System science
  • 21. Fundamental laws of physics energy and mass conservations,.... and we must assure the deep learning models do not allow for negative densities, precipitations,... Noether's theorem explains why conservation laws exists (wikipedia)
  • 22. ● Diverse sources of noise→ poor signal-to-noise ratio ● Inconsistencies → energy and mass conservations, density must be positive,... ● Extrapolation problem Deep learning challenges in Earth System science
  • 24. Extrapolation problem: classification the model should show is not certain about predicting in undersampled regions...
  • 26.
  • 27. Non- stationary system Data shift or concept drift ● training data are not longer representative if the system has changed ● the accuracy of the trained model definitely decreased under data shift/concept drift
  • 28. ● Diverse sources of noise→ poor signal-to-noise ratio ● Inconsistencies → energy and mass conservations, density must be positive,... ● Extrapolation problem → system changes in time: data shift or concept drift Deep learning challenges in Earth System science
  • 29. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales Images Deep learning challenges in Earth System science
  • 30. what is the scale of this? → ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features Images Deep learning challenges in Earth System science
  • 32. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features ● No ImageNet → and difficult to have, example: labelling clouds Images Deep learning challenges in Earth System science
  • 33. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features ● No ImageNet → and difficult to have, example: labelling clouds ● Missing data → a solution Christopher Kadow, the leader of DKRZ machine learning research group, adapted the Nvidia Technology for image inpainting Deep learning challenges in Earth System science Images
  • 34.
  • 35. ‘ground-truth’ original data masked data missing values reconstruction by Deep Convolutional NN Image inpainting to reconstruct temperature missing observations
  • 37. Physical models ML and DL models Lightweighting/simplifying/speeding up physical models ● improve parametrizations ● analysis of model-observations mismatch ● emulation
  • 38. Physical models ML and DL models Domain knowledge can guide/optimize the pure data-driven methods ● design the architecture ● constrain the cost (or reward) function ● physically based data augmentation: expansion of the data set for undersampled regions
  • 39. Depth (m) Temp (°C) feature prediction Example: lakes simulations to predict temperature from depth measurements Physical model example: Tempd+1 = Temp d + sun - wind - upwelling given that we measured Td=surface = 15°C
  • 40. Depth (m) Temp (°C) feature prediction Physical model example: Tempd+1 = Temp d + sun - wind - upwelling given that we measured Td=surface = 15°C Moderate model skills and of course zero inconsistency
  • 41. Depth (m) Temp (°C) feature prediction Neural Network might allow negative densities! _ Better model skills and but the inconsistency spreads
  • 42. Depth (m) Density (g/L) Temp (°C) features prediction DATA AUGMENTATION: to include new features that comes from physical knowledge and then NN
  • 43. Depth (m) Density (g/L) Temp (°C) features prediction DATA AUGMENTATION: to include new features that comes from physical knowledge and then NN Even better model skills and a bit less inconsistency but it still spreads
  • 44. Depth (m) Density (g/L) Temp (°C) ✓ X features prediction physical model + NN + constrain loss function: denser water must be deeper
  • 45. Depth (m) Density (g/L) Temp (°C) ✓ X features prediction physical model + NN + constrain loss function: denser water must be deeper Totally consistent and high model skills! Great model performance (~1°C less error) and totally consistent
  • 46. References ● Earth System figure: https://karenbakker.org/the-climate-system/ ● Data cube image: Earth Syst. Dynam., 11, 201–234 (2020) https://doi.org/10.5194/esd-11-201-2020 ● Climate model image: A. Gettelman and R.B. Rood, Demystifying Climate Models, Earth Systems Data and Models 2, doi: 10.1007/978-3-662-48959-8_5 ● Multimodel figure: Michael Böttinger (DKRZ) and Joachem Marotzke (Max Planck Institute Meteorology) ● Mistral picture: Michael Böttinger (DKRZ) . ● Wildfire picture: https://pixnio.com/miscellaneous/fire-flames-pictures/aerial-ignition-interior-high -rates-of-spread-in-open-savannas ● Hockey stick IPCC https://www.ipcc.ch/report/ar3/wg1/ (Chapter 2) ● Scale invariant issue with chiguagua and dingo: Christian Staudt http://clstaudt.me ● Scale free image http://paulbourke.net/fractals/googleearth/ ● Image inpainting: ○ Nvidia Technology https://www.nvidia.com/research/inpainting/ ○ Kadow et al. (2020), Artificial Intelligence reconstructs missing Climate Information (in review) ● Physics-guided neural networks : https://arxiv.org/pdf/1710.11431.pdf and https://towardsdatascience.com/physics-guided-neural-networks-pgnns-8fe9dbad9414
  • 47. DKRZ Unit: Machine Learning as a Service ● Provide a knowledge base ● Bring prototypes to production ● Train, educate, and exchange
  • 49. Summary of the main topics discussed in the kick-off workshop ● Applying machine learning to Earth System modelling ○ Hybrid approaches to (i) improve parametrizations and (ii) validate physical models ○ Increase the availability of training data via (i) data augmentation and (ii) labelling ○ Infer causality of the patterns found in observational data ● Technology ○ Support for Python, portation to GPUs, and larger memory ○ Machine learning libraries for NetDCF data handling ○ Adaptive learning integrated with physical models during running time on the HPC ○ Distributed training and execution ○ Portability between HPC centres ● Uncertainty and reproducibility ○ Performance metrics for (i) unsupervised learning and (ii) data shift/concept drift ○ Adoption of interpretable models and uncertainty quantification and explainability methods ○ Sharing of training scripts and training data or trained model ● Community activities and capacity building: workshops, summer schools,... Artificial intelligence and machine learning activities at DKRZ EXTRASLIDE
  • 50. Climate models ● physical models derived from first principles (mechanistic) ● used to simulate how the Earth’s climate changes in time (dynamical) ● written in the form of coupled differential equations ● solution depends on boundary and initial conditions ● run with different conditions allows to create scenarios (see for instance, RCP) ● solved numerically with long lasting parallel runs ● calibrated and validated against observational data ● results are called model data EXTRASLIDE year RCP 2.6: the best case scenario historical RPC 8.5: business as usualExample of a basic climate model including time (t) and space (x): Tempt+1,x+1 = Temp t,x + warmingt,x - coolingt,x given that Tempt=0 = 25°C
  • 51. High Performance Computing Data Center Ongoing efforts to reduce our carbon footprint: ● Power Usage Effectiveness ~1 (PUE = 1.19) ● Cold aisle containment reduce CO2 emissions by 20% ● Hot air is recycled for heating nearby facilities ● Cooling water is recycled in our toilets ● Greener energy supplier possible BEFORE AFTER EXTRASLIDE