SlideShare uma empresa Scribd logo
1 de 41
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thomas Delteil, Applied Scientist @ AWS Deep Engine
APJCTech Summit 2018, Macau
Debugging MXNet Gluon
modelsAnd other performance tricks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thomas Delteil, Applied Scientist @ AWS Deep Engine
APJCTech Summit 2018, Macau
Debugging MXNet Gluon models
And other performance tricks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Remote debugging with PyCharm
Visualizing deep learning
Performance tricks and gotchas
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Apache History
• CMU project of PHD students in 2015 and the Distributed
Machine Learning Community (DMLC)
• 2017 => MXNet Gluon Imperative API is released
Tianqi Chen
UW
Mu Li
Amazon AI
Yutian Li
Stanford
Min Lin
MILA
Naiyan Wang
TuSimple
Minjie Wang
NYU CS
Tianjun Xiao
Tesla
Bing Xu
Apple AI
Chiyuan Zhang
Google Brain
Zheng Zhang
MSR Asia
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Imperative vs Symbolic computational graphs
Symbolic
define, compile, run
Imperative
define-by-run in the host language
Inception model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Imperative > Symbolic
Debuggable
Fast to prototype
Hybridizable
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Interactive Debugging
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Shapes
Values
Gradients
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Youtube tutorial
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Visualizing Deep Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Network Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXNet native code (#1) print(net)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXNet native code (#2) mx.viz.plot_network(sym)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXNet native code (#3) mx.viz.print_summary(sym)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Netron (online tool)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXBoard sw.add_graph(net)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
System performance
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPU: gpu_monitor (github)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CPU / RAM: > top i
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Training metrics
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXBoard
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXBoard Scalars
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MXBoard Images
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Console
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance
Tips and tricks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
130 samples/sec 1.25x 2.41x 2.46x 2.53x 3.84x
GPU utilization
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Environment
mxnet-mkl (32x)
vs
mxnet
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
I/O Bound
→ GPU Starvation
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
#1 Asynchronously pre-fetching data (low CPU) (1.25x)
DataLoader(num_workers=CPU_COUNT-3)
#2 Offline preprocessing (full CPU)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPU → CPU memcopy synchronization idling
#3 Smart synchronization calls (2.46x)
→ Small networks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Copy to GPU
Forward/Backward
Metric
Copy to GPU
Forward/Backward
Metric
Copy to GPU
Forward/Backward
Metric
Copy to GPU
Forward/Backward
Metric
Copy to GPU
Copy to GPU
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Execution engine
Imperative → Symbolic (2.41x)
net.hybridize()
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameters
Batchsize (2.56x)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Optimizer
Performance:
Time to accuracy
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Mixed precision training
float32 → float16 (3.84x)
net.cast("float16")
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Profiling
profiler.set_state('run')
…
profiler.set_state('stop')
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Conclusion
- Use Gluon to debug and iterate quickly
- Hybridize and optimize for speed
- Know your model: Visualize performance
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
Follow-up:
tdelteil@
Github.com/thomasdelteil
AWS Deep Engine, Vancouver

Mais conteúdo relacionado

Último

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 

Último (20)

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 

Destaque

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destaque (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Debugging and Performance tricks for MXNet Gluon

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thomas Delteil, Applied Scientist @ AWS Deep Engine APJCTech Summit 2018, Macau Debugging MXNet Gluon modelsAnd other performance tricks © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thomas Delteil, Applied Scientist @ AWS Deep Engine APJCTech Summit 2018, Macau Debugging MXNet Gluon models And other performance tricks
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Remote debugging with PyCharm Visualizing deep learning Performance tricks and gotchas
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Apache History • CMU project of PHD students in 2015 and the Distributed Machine Learning Community (DMLC) • 2017 => MXNet Gluon Imperative API is released Tianqi Chen UW Mu Li Amazon AI Yutian Li Stanford Min Lin MILA Naiyan Wang TuSimple Minjie Wang NYU CS Tianjun Xiao Tesla Bing Xu Apple AI Chiyuan Zhang Google Brain Zheng Zhang MSR Asia
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Imperative vs Symbolic computational graphs Symbolic define, compile, run Imperative define-by-run in the host language Inception model
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Imperative > Symbolic Debuggable Fast to prototype Hybridizable
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Interactive Debugging
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Shapes Values Gradients
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Youtube tutorial
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Visualizing Deep Learning
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Network Architecture
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXNet native code (#1) print(net)
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXNet native code (#2) mx.viz.plot_network(sym)
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXNet native code (#3) mx.viz.print_summary(sym)
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Netron (online tool)
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXBoard sw.add_graph(net)
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. System performance
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPU: gpu_monitor (github)
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CPU / RAM: > top i
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Training metrics
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXBoard
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXBoard Scalars
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MXBoard Images
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Console
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance Tips and tricks
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 130 samples/sec 1.25x 2.41x 2.46x 2.53x 3.84x GPU utilization
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Environment mxnet-mkl (32x) vs mxnet
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. I/O Bound → GPU Starvation
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. #1 Asynchronously pre-fetching data (low CPU) (1.25x) DataLoader(num_workers=CPU_COUNT-3) #2 Offline preprocessing (full CPU)
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPU → CPU memcopy synchronization idling #3 Smart synchronization calls (2.46x) → Small networks
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Copy to GPU Forward/Backward Metric Copy to GPU Forward/Backward Metric Copy to GPU Forward/Backward Metric Copy to GPU Forward/Backward Metric Copy to GPU Copy to GPU
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Execution engine Imperative → Symbolic (2.41x) net.hybridize()
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameters Batchsize (2.56x)
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Optimizer Performance: Time to accuracy
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Mixed precision training float32 → float16 (3.84x) net.cast("float16")
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Profiling profiler.set_state('run') … profiler.set_state('stop')
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Conclusion - Use Gluon to debug and iterate quickly - Hybridize and optimize for speed - Know your model: Visualize performance
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! Follow-up: tdelteil@ Github.com/thomasdelteil AWS Deep Engine, Vancouver

Notas do Editor

  1. Data loading issue Nan values Loss exploding suddenly
  2. Explain ssh tunnel and tensorboard
  3. 22M$ GPU