Deep Learning for Fast Simulation

•

0 gostou•243 visualizações

Helix Nebula The Science Cloud

Presentation by S. Vallecorsa F.Carminati G. Khattak, at HNSciCloud procurer hosted event in Geneva, 14 June 2018

Software

1
Deep Learning for Fast Simulation
HNSciCloud M-PIL-3.2 meeting
June 2018
S. Vallecorsa F.Carminati G. Khattak

2
Our objective
• Activities on-going to speedup Monte Carlo techniques
• Not enough to cope with HL-LHC expected needs
• Current fast simulation solutions are detector dependent
• A general fast simulation tool based on Machine
Learning/Deep Learning
• Optimizing training time becomes crucial
Improved, efficient and accurate fast simulation
2

3
Requirements
Precise simulation results
Detailed validation process
A fast inference step
Generic customizable tool
Easy-to-use and easily extensible framework
Large hyper-parameters scans and meta-optimisation:
Training time under control
Scalability
Possibility to work across platforms
3

4
Generator G generates data from random noise
Discriminator D learns how to distinguish real data
from generated data
4
Simultaneously train two networks that compete and cooperate with each other
Generative adversarial networks
arXiv:1406.2661v1
Image source:
The (blind) counterfeiter/detective case
Counterfeiter shows the Monalisa
Detective says it is fake and gives feedback
Counterfeiter makes new Monalisa based on feedback
Iterate until detective is fooled
https://arxiv.org/pdf/1701.00160v1.pdf

$5 Generated images Interpret detector output as a 3D image 5 GAN generated electron shower Y moment (width) Average shower section 3D convolutional GAN generate realistic detector output Customized architecture (includes auxiliary regression tasks) Agreement to standard Monte Carlo in terms of physics is remarkable! Energy fraction measured by the calorimeter on Caltech ibanks GPU cluster thanks to Prof M. Spiropulu$

6
Distributed training is needed
Inference:
Monte Carlo: 17 s/particle vs 3DGAN: 7 ms/particle
è speedup factor > 2500 on CPU!!
Training:
45 min/epoch on a NVIDIA P100
Introduce data parallel training using mpi-learn
(Elastic Averaging Stochastic Gradient Descent)
Computing performance
Calorimeter energy
response:
GAN prediction stays
stable through 20
nodes!
Strong scaling measured
at CSCS Swiss National
Super Computing Center
(J-R. Vlimant)
Time to create an electron shower
Method Machine
Time/Shower
(msec)
Full Simulation
(geant4)
Intel Xeon Platinum
8180
17000
3d GAN
(batch size 128)
Intel Xeon Platinum
8180
7
3d GAN
(batchsize 128)
P100 0.04

7
DL with the HNSciCloud
First tests during prototype (2017)
Single GPU training benchmark ( RHEA, T-Systems,
IBM)
P100 (RHEA - Exoscale) vs K80 (IBM)
Current tests
MPI based distributed training (ssh/TCP)
Local input storage
Single GPU per node
Comparison to HPC environment
Trials with HTCondor on Exoscale cloud (5 VMs)
(still under investigation) 2
2 P100 T-Systems
(CSCS)

8
Next steps
Continue with tests/optimisation:
• Schedulers (SLURM)
• Input storage options
• GPU/node configuration
• Possibility to combine GPUs from different resources
Additional GPUs are needed
First results are very promising
8

Mais conteúdo relacionado

Mais procurados

MATLAB Projects for Master Thesis StudentsPhdtopiccom

OpenACC Monthly Highlights April 2018NVIDIA

Sparksummit2016 sharePing Yan

OpenACC Monthly Highlights: May 2019OpenACC

MATLAB Thesis ProjectsPhdtopiccom

"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...Edge AI and Vision Alliance

MATLAB Project TopicsPhdtopiccom

SigOpt at GTC - Tuning the UntunableSigOpt

On the Capability and Achievable Performance of FPGAs for HPC ApplicationsWim Vanderbauwhede

Performance_and_Cost_EvaluationSantiago Gómez Sáez

MATLAB Project Topics for StudentsPhdtopiccom

Adapting to a Cambrian AI/SW/HW explosion with open co-design competitions an...Grigori Fursin

20072311272506Vinod Vyas

resume_parbhatPrabhat Kangra

ODVSML_PresentationShounak Mitra

Automated Program Repair Keynote talkAbhik Roychoudhury

HiPEAC 2020: Energy-aware Task Scheduling in LEGaTO: Low Energy Toolset for H...LEGATO project

Stephan berg track fAlona Gradman

"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...Edge AI and Vision Alliance

Varun Gatne - Resume - FinalVarun Gatne

Mais procurados (20)

MATLAB Projects for Master Thesis Students

OpenACC Monthly Highlights April 2018

Sparksummit2016 share

OpenACC Monthly Highlights: May 2019

MATLAB Thesis Projects

"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...

MATLAB Project Topics

SigOpt at GTC - Tuning the Untunable

On the Capability and Achievable Performance of FPGAs for HPC Applications

Performance_and_Cost_Evaluation

MATLAB Project Topics for Students

Adapting to a Cambrian AI/SW/HW explosion with open co-design competitions an...

20072311272506

resume_parbhat

ODVSML_Presentation

Automated Program Repair Keynote talk

HiPEAC 2020: Energy-aware Task Scheduling in LEGaTO: Low Energy Toolset for H...

Stephan berg track f

"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...

Varun Gatne - Resume - Final

Semelhante a Deep Learning for Fast Simulation

Panel: NRP Science ImpactsLarry Smarr

OpenACC Monthly Highlights: May 2020OpenACC

Early Application experiences on Summit Ganesan Narayanasamy

Possibility of hpc application on cloud infrastructure by container clusterKyunam Cho

BDW16 London - Ingrid Funie, Imperial College London - Machine Learning and F...Big Data Week

Real time intrusion detection in network traffic using adaptive and auto-scal...Gobinath Loganathan

Deep learning for FinTechgeetachauhan

Interactive Data Analysis for End Users on HN Science CloudHelix Nebula The Science Cloud

Training ImageNet-1k ResNet50 in 15min pfnMila, Université de Montréal

FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...Flink Forward

HPC + Ai: Machine Learning Models in Scientific Computinginside-BigData.com

Performance Optimization of CGYRO for Multiscale Turbulence SimulationsIgor Sfiligoi

OpenACC Monthly Highlights: October2020OpenACC

(Im2col)accelerating deep neural networks on low power heterogeneous architec...Bomm Kim

Manycores for the MassesIntel® Software

How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com

Checkpointing the Un-checkpointable: MANA and the Split-Process Approachinside-BigData.com

Available HPC Resources at CSUCCSUC - Consorci de Serveis Universitaris de Catalunya

Opportunities of ML-based data analytics in ABCIRyousei Takano

Scallable Distributed Deep Learning on OpenPOWER systemsGanesan Narayanasamy

Semelhante a Deep Learning for Fast Simulation (20)

Panel: NRP Science Impacts

OpenACC Monthly Highlights: May 2020

Early Application experiences on Summit

Possibility of hpc application on cloud infrastructure by container cluster

BDW16 London - Ingrid Funie, Imperial College London - Machine Learning and F...

Real time intrusion detection in network traffic using adaptive and auto-scal...

Deep learning for FinTech

Interactive Data Analysis for End Users on HN Science Cloud

Training ImageNet-1k ResNet50 in 15min pfn

FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...

HPC + Ai: Machine Learning Models in Scientific Computing

Performance Optimization of CGYRO for Multiscale Turbulence Simulations

OpenACC Monthly Highlights: October2020

(Im2col)accelerating deep neural networks on low power heterogeneous architec...

Manycores for the Masses

How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...

Checkpointing the Un-checkpointable: MANA and the Split-Process Approach

Available HPC Resources at CSUC

Opportunities of ML-based data analytics in ABCI

Scallable Distributed Deep Learning on OpenPOWER systems

Mais de Helix Nebula The Science Cloud

M-PIL-3.2 Public SessionHelix Nebula The Science Cloud

Container Federation Use CasesHelix Nebula The Science Cloud

CERN Batch in the HNSciCloudHelix Nebula The Science Cloud

LHCb on RHEA and T-SystemsHelix Nebula The Science Cloud

HNSciCloud CMS status-reportHelix Nebula The Science Cloud

Helix Nebula Science Cloud usage by ALICEHelix Nebula The Science Cloud

Hybrid cloud for scienceHelix Nebula The Science Cloud

HNSciCloud PILOT PLATFORM OVERVIEWHelix Nebula The Science Cloud

HNSciCloud Overview Helix Nebula The Science Cloud

This Helix Nebula Science Cloud Pilot Phase Open SessionHelix Nebula The Science Cloud

Cloud Services for Education - HNSciCloud applied to the UP2U projectHelix Nebula The Science Cloud

Network experiences with Public Cloud Services @ TNC2017Helix Nebula The Science Cloud

EOSC in practice - Silvana Muscella (chair EOSC HLEG)Helix Nebula The Science Cloud

Helix Nebula Science Cloud Pilot Phase, 6 February 2018, Bologna, ItalyHelix Nebula The Science Cloud

Pilot phase Award Ceremony - INFN Introduction and welcomeHelix Nebula The Science Cloud

Early adopter group and closing of webinar - João Fernandes (CERN)Helix Nebula The Science Cloud

HNSciCloud pilot phase - Andrea Chierici (INFN)Helix Nebula The Science Cloud

Pilot phase Award Ceremony - T-SystemsHelix Nebula The Science Cloud

Pilot phase Award Ceremony - RHEAHelix Nebula The Science Cloud

Overview of HNSciCloud - Bob Jones (CERN)Helix Nebula The Science Cloud

Mais de Helix Nebula The Science Cloud (20)

M-PIL-3.2 Public Session

Container Federation Use Cases

CERN Batch in the HNSciCloud

LHCb on RHEA and T-Systems

HNSciCloud CMS status-report

Helix Nebula Science Cloud usage by ALICE

Hybrid cloud for science

HNSciCloud PILOT PLATFORM OVERVIEW

HNSciCloud Overview

This Helix Nebula Science Cloud Pilot Phase Open Session

Cloud Services for Education - HNSciCloud applied to the UP2U project

Network experiences with Public Cloud Services @ TNC2017

EOSC in practice - Silvana Muscella (chair EOSC HLEG)

Helix Nebula Science Cloud Pilot Phase, 6 February 2018, Bologna, Italy

Pilot phase Award Ceremony - INFN Introduction and welcome

Early adopter group and closing of webinar - João Fernandes (CERN)

HNSciCloud pilot phase - Andrea Chierici (INFN)

Pilot phase Award Ceremony - T-Systems

Pilot phase Award Ceremony - RHEA

Overview of HNSciCloud - Bob Jones (CERN)

Último

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

Asset Management Software - InfographicHr365.us smith

Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden

Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin

HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Project Based Learning (A.I).pptx detail explanationkaushalgiri8080

5 Signs You Need a Fashion PLM Software.pdfWave PLM

chapter--4-software-project-planning.pptkotipi9215

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Deep Learning for Fast Simulation

1. 1 Deep Learning for Fast Simulation HNSciCloud M-PIL-3.2 meeting June 2018 S. Vallecorsa F.Carminati G. Khattak

2. 2 Our objective • Activities on-going to speedup Monte Carlo techniques • Not enough to cope with HL-LHC expected needs • Current fast simulation solutions are detector dependent • A general fast simulation tool based on Machine Learning/Deep Learning • Optimizing training time becomes crucial Improved, efficient and accurate fast simulation 2

3. 3 Requirements Precise simulation results Detailed validation process A fast inference step Generic customizable tool Easy-to-use and easily extensible framework Large hyper-parameters scans and meta-optimisation: Training time under control Scalability Possibility to work across platforms 3

4. 4 Generator G generates data from random noise Discriminator D learns how to distinguish real data from generated data 4 Simultaneously train two networks that compete and cooperate with each other Generative adversarial networks arXiv:1406.2661v1 Image source: The (blind) counterfeiter/detective case Counterfeiter shows the Monalisa Detective says it is fake and gives feedback Counterfeiter makes new Monalisa based on feedback Iterate until detective is fooled https://arxiv.org/pdf/1701.00160v1.pdf

5. 5 Generated images Interpret detector output as a 3D image 5 GAN generated electron shower Y moment (width) Average shower section 3D convolutional GAN generate realistic detector output Customized architecture (includes auxiliary regression tasks) Agreement to standard Monte Carlo in terms of physics is remarkable! Energy fraction measured by the calorimeter on Caltech ibanks GPU cluster thanks to Prof M. Spiropulu

6. 6 Distributed training is needed Inference: Monte Carlo: 17 s/particle vs 3DGAN: 7 ms/particle è speedup factor > 2500 on CPU!! Training: 45 min/epoch on a NVIDIA P100 Introduce data parallel training using mpi-learn (Elastic Averaging Stochastic Gradient Descent) Computing performance Calorimeter energy response: GAN prediction stays stable through 20 nodes! Strong scaling measured at CSCS Swiss National Super Computing Center (J-R. Vlimant) Time to create an electron shower Method Machine Time/Shower (msec) Full Simulation (geant4) Intel Xeon Platinum 8180 17000 3d GAN (batch size 128) Intel Xeon Platinum 8180 7 3d GAN (batchsize 128) P100 0.04

7. 7 DL with the HNSciCloud First tests during prototype (2017) Single GPU training benchmark ( RHEA, T-Systems, IBM) P100 (RHEA - Exoscale) vs K80 (IBM) Current tests MPI based distributed training (ssh/TCP) Local input storage Single GPU per node Comparison to HPC environment Trials with HTCondor on Exoscale cloud (5 VMs) (still under investigation) 2 2 P100 T-Systems (CSCS)

8. 8 Next steps Continue with tests/optimisation: • Schedulers (SLURM) • Input storage options • GPU/node configuration • Possibility to combine GPUs from different resources Additional GPUs are needed First results are very promising 8

9. 9 Thanks! Questions?

Deep Learning for Fast Simulation

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Deep Learning for Fast Simulation

Semelhante a Deep Learning for Fast Simulation (20)

Mais de Helix Nebula The Science Cloud

Mais de Helix Nebula The Science Cloud (20)

Último

Último (20)

Deep Learning for Fast Simulation