SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Developing Large Scale Machine Learning Algorithms
on Amazon SageMaker
Amir Sadoughi
Senior Software Engineer

Amazon AI Labs
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
The Amazon ML stack: Broadest & deepest set of capabilities
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3 

& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S
E L A S T I C 

I N F E R E N C E
Language Forecasting Recommendations
T E X T R A C T
New
C O M P R E H E N D &
C O M P R E H E N D
M E D I C A L New
NewNew
F O R E C A S T P E R S O N A L I Z E
A M A Z O N
S A G E M A K E R
G R O U N D T R U T H
New
N O T E B O O K S
A W S M A R K E T P L A C E
New
A L G O R I T H M S
R E I N F O R C E M E N T 

L E A R N I N G
New
T R A I N I N G
O P T I M I Z A T I O N
( N E O ) New
D E P L O Y M E N T
H O S T I N G
New
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Machine Learning
Amazon SageMaker
Amazon SageMaker provides every
developer and data scientist with the ability
to build, train, and deploy machine learning
models quickly.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
The Amazon ML stack: Broadest & deepest set of capabilities
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3 

& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S
E L A S T I C 

I N F E R E N C E
Language Forecasting Recommendations
T E X T R A C T
New
C O M P R E H E N D &
C O M P R E H E N D
M E D I C A L New
NewNew
F O R E C A S T P E R S O N A L I Z E
A M A Z O N
S A G E M A K E R
G R O U N D T R U T H
New
N O T E B O O K S
A W S M A R K E T P L A C E
New
A L G O R I T H M S
R E I N F O R C E M E N T 

L E A R N I N G
New
T R A I N I N G
O P T I M I Z A T I O N
( N E O ) New
D E P L O Y M E N T
H O S T I N G
New
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker: Algorithms
• Built-in algorithms
• NLP
• Computer Vision
• Supervised
• Unsupervised
• AWS Marketplace for Machine Learning
• Bring Your Own Algorithm
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Algorithm
Development
Lifecycle
Interface design
System design
Testing
Communications
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Interface
Design
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design
• Storage
• Compute
• Network
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: Storage
• Tiers: Amazon S3, Amazon EBS, GPU mem., CPU mem., CPU cache
• Access patterns
• Capacities: Throughput, Latency
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: Storage
File mode
• easy to implement
• faster for many epochs
• initial download time
• increased size for data disk space
• maxes out at 16 TB
Pipe mode
• harder to implement
• faster for single pass
• downloads each epoch
• sizing only for model disk space
• no limit on length
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: Compute
• CPU
• GPU
• Multi-GPU
• Elastic inference
• Mobile
• IoT
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: Network
• Training: Single machine or distributed across many machines
• Inference: number of concurrent requests, size of payload
• Throughput
• Latency
• Jitter
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Testing: traditional testing
• Unit tests
• Functional tests
• Integration tests
• Load testing
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. !19
Testing: traditional testing
• Unit tests
• Functional tests
• Integration tests
• Load testing
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Testing: benchmarking
• Measure time, cost, accuracy
• DAWNBench
• Training: end-to-end throughput
• Inference: end-to-end latency
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: performance optimizations
• Beware of the tradeoffs
• Training
• Low or mixed precision
• Increase batch size
• Optimize communication between workers
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: performance optimizations
• Inference
• Caching
• Queueing
• Low precision
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
System design: EMA example
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
CreateAlgorithm: EMA example
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS Marketplace for Machine Learning
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Training data
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Training data: train/test split
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hyperparameter tuning job: EMA example
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hyperparameter tuning job: EMA example
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hyperparameter tuning job: EMA example
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
The Amazon ML stack: Broadest & deepest set of capabilities
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3 

& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S
E L A S T I C 

I N F E R E N C E
Language Forecasting Recommendations
T E X T R A C T
New
C O M P R E H E N D &
C O M P R E H E N D
M E D I C A L New
NewNew
F O R E C A S T P E R S O N A L I Z E
A M A Z O N
S A G E M A K E R
G R O U N D T R U T H
New
N O T E B O O K S
A W S M A R K E T P L A C E
New
A L G O R I T H M S
R E I N F O R C E M E N T 

L E A R N I N G
New
T R A I N I N G
O P T I M I Z A T I O N
( N E O ) New
D E P L O Y M E N T
H O S T I N G
New
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
!33
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. !34
Resources
• SageMaker Product Page
• SageMaker Console
• Ground Truth Product Page
• Neo Product Page
• SageMaker RL Documentation
• SageMaker 10-Minute Tutorial
• SageMaker Related Blogs
• Ground Truth Webinar (Dec 2018)

Mais conteúdo relacionado

Semelhante a Amir sadoughi developing large-scale machine learning algorithms on amazon sage maker

ML Centrepiece for Digital Transformation
ML Centrepiece for Digital TransformationML Centrepiece for Digital Transformation
ML Centrepiece for Digital Transformation
Amazon Web Services
 

Semelhante a Amir sadoughi developing large-scale machine learning algorithms on amazon sage maker (20)

AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...
AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...
AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
 
Building an Immersive, Interactive Customer Experience using Artificial Intel...
Building an Immersive, Interactive Customer Experience using Artificial Intel...Building an Immersive, Interactive Customer Experience using Artificial Intel...
Building an Immersive, Interactive Customer Experience using Artificial Intel...
 
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
 
Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...
 
Uu 2019-05-08 - machine learning -alternative
Uu   2019-05-08 - machine learning -alternativeUu   2019-05-08 - machine learning -alternative
Uu 2019-05-08 - machine learning -alternative
 
Applying Maching Learning to Build Smarter Video Workflows
Applying Maching Learning to Build Smarter Video WorkflowsApplying Maching Learning to Build Smarter Video Workflows
Applying Maching Learning to Build Smarter Video Workflows
 
Amazon SageMaker In Action
Amazon SageMaker In Action Amazon SageMaker In Action
Amazon SageMaker In Action
 
Building Intelligent Applications (No Machine Learning Experience Required!)
Building Intelligent Applications (No Machine Learning Experience Required!)Building Intelligent Applications (No Machine Learning Experience Required!)
Building Intelligent Applications (No Machine Learning Experience Required!)
 
Leveraging_Artificial_Intelligence_Across_Enterprise
Leveraging_Artificial_Intelligence_Across_EnterpriseLeveraging_Artificial_Intelligence_Across_Enterprise
Leveraging_Artificial_Intelligence_Across_Enterprise
 
Machine Learning using Kubeflow and Kubernetes
Machine Learning using Kubeflow and KubernetesMachine Learning using Kubeflow and Kubernetes
Machine Learning using Kubeflow and Kubernetes
 
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up LoftArtifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
 
Build an AI Virtual Concierge - AWS Summit Sydney
Build an AI Virtual Concierge - AWS Summit SydneyBuild an AI Virtual Concierge - AWS Summit Sydney
Build an AI Virtual Concierge - AWS Summit Sydney
 
AI/ML Week: Improve Education Outcomes
AI/ML Week: Improve Education OutcomesAI/ML Week: Improve Education Outcomes
AI/ML Week: Improve Education Outcomes
 
Accelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-ServicesAccelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-Services
 
ML Centrepiece for Digital Transformation
ML Centrepiece for Digital TransformationML Centrepiece for Digital Transformation
ML Centrepiece for Digital Transformation
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
 
Drive Digital Transformation using Machine Learning
Drive Digital Transformation using Machine LearningDrive Digital Transformation using Machine Learning
Drive Digital Transformation using Machine Learning
 
Getting started with AWS Machine Learning
Getting started with AWS Machine LearningGetting started with AWS Machine Learning
Getting started with AWS Machine Learning
 

Mais de MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 

Mais de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Amir sadoughi developing large-scale machine learning algorithms on amazon sage maker

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Developing Large Scale Machine Learning Algorithms on Amazon SageMaker Amir Sadoughi Senior Software Engineer
 Amazon AI Labs
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark M L F R A M E W O R K S & I N F R A S T R U C T U R E The Amazon ML stack: Broadest & deepest set of capabilities A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 
 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C 
 I N F E R E N C E Language Forecasting Recommendations T E X T R A C T New C O M P R E H E N D & C O M P R E H E N D M E D I C A L New NewNew F O R E C A S T P E R S O N A L I Z E A M A Z O N S A G E M A K E R G R O U N D T R U T H New N O T E B O O K S A W S M A R K E T P L A C E New A L G O R I T H M S R E I N F O R C E M E N T 
 L E A R N I N G New T R A I N I N G O P T I M I Z A T I O N ( N E O ) New D E P L O Y M E N T H O S T I N G New
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Machine Learning Amazon SageMaker Amazon SageMaker provides every developer and data scientist with the ability to build, train, and deploy machine learning models quickly.
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark M L F R A M E W O R K S & I N F R A S T R U C T U R E The Amazon ML stack: Broadest & deepest set of capabilities A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 
 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C 
 I N F E R E N C E Language Forecasting Recommendations T E X T R A C T New C O M P R E H E N D & C O M P R E H E N D M E D I C A L New NewNew F O R E C A S T P E R S O N A L I Z E A M A Z O N S A G E M A K E R G R O U N D T R U T H New N O T E B O O K S A W S M A R K E T P L A C E New A L G O R I T H M S R E I N F O R C E M E N T 
 L E A R N I N G New T R A I N I N G O P T I M I Z A T I O N ( N E O ) New D E P L O Y M E N T H O S T I N G New
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon SageMaker: Algorithms • Built-in algorithms • NLP • Computer Vision • Supervised • Unsupervised • AWS Marketplace for Machine Learning • Bring Your Own Algorithm
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Algorithm Development Lifecycle Interface design System design Testing Communications
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Interface Design
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design • Storage • Compute • Network
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: Storage • Tiers: Amazon S3, Amazon EBS, GPU mem., CPU mem., CPU cache • Access patterns • Capacities: Throughput, Latency
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: Storage File mode • easy to implement • faster for many epochs • initial download time • increased size for data disk space • maxes out at 16 TB Pipe mode • harder to implement • faster for single pass • downloads each epoch • sizing only for model disk space • no limit on length
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: Compute • CPU • GPU • Multi-GPU • Elastic inference • Mobile • IoT
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: Network • Training: Single machine or distributed across many machines • Inference: number of concurrent requests, size of payload • Throughput • Latency • Jitter
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Testing: traditional testing • Unit tests • Functional tests • Integration tests • Load testing
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. !19 Testing: traditional testing • Unit tests • Functional tests • Integration tests • Load testing
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Testing: benchmarking • Measure time, cost, accuracy • DAWNBench • Training: end-to-end throughput • Inference: end-to-end latency
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: performance optimizations • Beware of the tradeoffs • Training • Low or mixed precision • Increase batch size • Optimize communication between workers
  • 23. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: performance optimizations • Inference • Caching • Queueing • Low precision
  • 24. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. System design: EMA example
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. CreateAlgorithm: EMA example
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Marketplace for Machine Learning
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Training data
  • 28. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Training data: train/test split
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hyperparameter tuning job: EMA example
  • 30. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hyperparameter tuning job: EMA example
  • 31. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hyperparameter tuning job: EMA example
  • 32. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark M L F R A M E W O R K S & I N F R A S T R U C T U R E The Amazon ML stack: Broadest & deepest set of capabilities A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 
 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C 
 I N F E R E N C E Language Forecasting Recommendations T E X T R A C T New C O M P R E H E N D & C O M P R E H E N D M E D I C A L New NewNew F O R E C A S T P E R S O N A L I Z E A M A Z O N S A G E M A K E R G R O U N D T R U T H New N O T E B O O K S A W S M A R K E T P L A C E New A L G O R I T H M S R E I N F O R C E M E N T 
 L E A R N I N G New T R A I N I N G O P T I M I Z A T I O N ( N E O ) New D E P L O Y M E N T H O S T I N G New
  • 33. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you! !33
  • 34. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. !34 Resources • SageMaker Product Page • SageMaker Console • Ground Truth Product Page • Neo Product Page • SageMaker RL Documentation • SageMaker 10-Minute Tutorial • SageMaker Related Blogs • Ground Truth Webinar (Dec 2018)