Deploy ONNX Models with MXNet Model Server

•

0 gostou•451 visualizações

This document discusses deploying deep learning models in ONNX format with the Apache MXNet Model Server. It describes how ONNX serves as a common format that can be exported from many frameworks and imported into MXNet. It also outlines how the MXNet Model Server provides an API and containerization for serving models, and how it can be deployed in serverless environments on AWS Fargate for scalable inference.

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deploying Your ONNX Deep
Learning Models with Apache MXNet
Model Server
Girish Patil
Deep Learning Solutions
Architect
A I M 4 1 3
Steffen Rochel
Head of Engineering
Deep Engines

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Performance
Availability
Networking
Monitoring
Modeldecoupling
Crossframework
Crossplatform
The
undifferentiated
heavy lifting of
model serving

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture
Back

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Trained
network
Model
signature
Custom
code
Auxiliary
assets
Modelarchive
Model ExportCLI
Model archive
Back

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
O(n2)
Pairs
MXNet
Caffe2
PyTorch
Tensorflow
CNTKCoreML
TensorRT
NGraph
SNPE
Open Neural Network eXchange—Overview
Many frameworks
Many platforms
ONNX: Common IR
Neural Network format
Open source
Growing support

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
# Import into MXNet (from MXNet 1.2)
sym, arg_params, aux_params = onnx_mxnet.import_model('model.onnx’)
# create module
mod = mx.mod.Module(symbol=sym, data_names=['input_0'], label_names=None)
mod.bind(for_training=False, data_shapes=[('input_0', input_img.shape)])
mod.set_params(arg_params=arg_params, aux_params=aux_params)
ONNX—Usage example
Back

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
ONNX—Usage example
# Synthetic input for tracing
x = Variable(torch.randn(batch_size, 1, 224, 224), requires_grad=True)
# Export the model
torch_out = torch.onnx.export(model, x, "model.onnx")

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
REST and OpenAPI
Back

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Client code generation

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
MMS
docker
image
Pull or build
run
Containerization
Containercluster
MMS Container
MMS ContainerMMScontainer
MXNet ModelServer
MXNet NGINX
Load
balance
r

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
A practical scenario

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
MMS Server
Client
Pre-processing & post processing
Request
Response
Pre-
processing
Post
processing
Model
Keeps clients simple!

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Pre-processing sample code

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Post processing sample code

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
+
MXNet Model Server AWS Fargate
Serverless model serving

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Serverless model serving architecture
VPC
ECS cluster and service
Fargate task(s) Load
balancer
Internet
CloudWatch
MMS container
Model
Server

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker:
https://aws.amazon.com/sagemaker/
Using Apache MXNet with Amazon SageMaker:
https://docs.aws.amazon.com/sagemaker/
latest/dg/mxnet.html
Contact: mxnet-info@amazon.com

Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Girish Patil Steffen Rochel
girpatil@amazon.com steroche@amazon.com

Deploy ONNX Models with MXNet Model Server

Mais conteúdo relacionado

Mais procurados

Day Two Operations of Kubernetes on AWS (GPSTEC309) - AWS re:Invent 2018Amazon Web Services

Build a Voice-Based Chatbot for Your Amazon Connect Contact Center (BAP401-R1...Amazon Web Services

Extending Data Centers to the Cloud: Connectivity Options and Best Practices ...Amazon Web Services

Getting Started with AWS Greengrass (IOT215-R3) - AWS re:Invent 2018Amazon Web Services

Machine Learning at the IoT Edge (IOT214) - AWS re:Invent 2018Amazon Web Services

Building the Technical Foundation for Your Security Practice (GPSCT205) - AWS...Amazon Web Services

Securely Deploying at Scale (SEC378-R1) - AWS re:Invent 2018Amazon Web Services

How Rovio Uses ML to Acquire, Retain, and Monetize Users (GAM304) - AWS re:In...Amazon Web Services

Inventory, Track, and Respond to AWS Asset Changes within Seconds at Scale (S...Amazon Web Services

Best Practices for Building Multi-Region, Active-Active Serverless Applicatio...Amazon Web Services

Private Network Connectivity: Connecting AWS into Public Sector Networks (WPS...Amazon Web Services

[NEW LAUNCH!] Introducing AWS Elemental MantaRay (CTD325) - AWS re:Invent 2018Amazon Web Services

Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...Amazon Web Services

Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018Amazon Web Services

Run Production Workloads on Spot, Save up to 90%Amazon Web Services

Manage & Deliver Your 3D Assets to Your AR/VR JavaScript Applications (MOB312...Amazon Web Services

Deploy, Customize, Start, & Monitor a Channel with Live Streaming on AWS (CTD...Amazon Web Services

Deploy Alexa for Business in Your Organization & Build Your First Private Ski...Amazon Web Services

Breaking Up the Monolith While Migrating to AWS (GPSTEC320) - AWS re:Invent 2018Amazon Web Services

Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...Amazon Web Services

Mais procurados (20)

Day Two Operations of Kubernetes on AWS (GPSTEC309) - AWS re:Invent 2018

Build a Voice-Based Chatbot for Your Amazon Connect Contact Center (BAP401-R1...

Extending Data Centers to the Cloud: Connectivity Options and Best Practices ...

Getting Started with AWS Greengrass (IOT215-R3) - AWS re:Invent 2018

Machine Learning at the IoT Edge (IOT214) - AWS re:Invent 2018

Building the Technical Foundation for Your Security Practice (GPSCT205) - AWS...

Securely Deploying at Scale (SEC378-R1) - AWS re:Invent 2018

How Rovio Uses ML to Acquire, Retain, and Monetize Users (GAM304) - AWS re:In...

Inventory, Track, and Respond to AWS Asset Changes within Seconds at Scale (S...

Best Practices for Building Multi-Region, Active-Active Serverless Applicatio...

Private Network Connectivity: Connecting AWS into Public Sector Networks (WPS...

[NEW LAUNCH!] Introducing AWS Elemental MantaRay (CTD325) - AWS re:Invent 2018

Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...

Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018

Run Production Workloads on Spot, Save up to 90%

Manage & Deliver Your 3D Assets to Your AR/VR JavaScript Applications (MOB312...

Deploy, Customize, Start, & Monitor a Channel with Live Streaming on AWS (CTD...

Deploy Alexa for Business in Your Organization & Build Your First Private Ski...

Breaking Up the Monolith While Migrating to AWS (GPSTEC320) - AWS re:Invent 2018

Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...

Semelhante a Deploy ONNX Models with MXNet Model Server

Apache MXNet EcoSystem - ACNA2018Apache MXNet

CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...Amazon Web Services

Intro To AI & ML at Amazon: Collision 2018Amazon Web Services

Vonage & Aspect: Transform Real-Time Communications & Customer Engagement (TL...Amazon Web Services

More Containers Less OperationsDonnie Prakoso

Demystifying Machine Learning On AWS - AWS Summit Sydney 2018Amazon Web Services

Deep Dive into AWS X-Ray: Monitor Modern Applications (DEV324) - AWS re:Inven...Amazon Web Services

Supercharge Your Machine Learning Model with Amazon SageMakerAmazon Web Services

Building Microservices with the Twelve Factor App Pattern on AWSAmazon Web Services

Building Microservices with the 12 Factor App Pattern on AWS - Tony PujalsAmazon Web Services

Intelligence of Things: IoT, AWS DeepLens and Amazon SageMaker - AWS Summit S...Amazon Web Services

Orchestrating containers on AWS | AWS Floor28Amazon Web Services

Cheat your Way into the CloudAmazon Web Services

Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018Amazon Web Services

DataXDay - Machine learning models at scale with Amazon SageMaker DataXDay Conference by Xebia

Sequence-to-Sequence Modeling with Apache MXNet, Sockeye, and Amazon SageMake...Amazon Web Services

Microservices for StartupsAmazon Web Services

Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...Amazon Web Services

Architect Your Legacy Microsoft Apps into Modern Cloud WorkloadsAmazon Web Services

Building Microservices with the 12 Factor App Pattern on AWSAmazon Web Services

Semelhante a Deploy ONNX Models with MXNet Model Server (20)

Apache MXNet EcoSystem - ACNA2018

CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...

Intro To AI & ML at Amazon: Collision 2018

Vonage & Aspect: Transform Real-Time Communications & Customer Engagement (TL...

More Containers Less Operations

Demystifying Machine Learning On AWS - AWS Summit Sydney 2018

Deep Dive into AWS X-Ray: Monitor Modern Applications (DEV324) - AWS re:Inven...

Supercharge Your Machine Learning Model with Amazon SageMaker

Building Microservices with the Twelve Factor App Pattern on AWS

Building Microservices with the 12 Factor App Pattern on AWS - Tony Pujals

Intelligence of Things: IoT, AWS DeepLens and Amazon SageMaker - AWS Summit S...

Orchestrating containers on AWS | AWS Floor28

Cheat your Way into the Cloud

Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018

DataXDay - Machine learning models at scale with Amazon SageMaker

Sequence-to-Sequence Modeling with Apache MXNet, Sockeye, and Amazon SageMake...

Microservices for Startups

Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...

Architect Your Legacy Microsoft Apps into Modern Cloud Workloads

Building Microservices with the 12 Factor App Pattern on AWS

Mais de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services

Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services

Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services

Costruire Applicazioni Moderne con AWSAmazon Web Services

Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services

Open banking as a serviceAmazon Web Services

Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services

OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services

Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services

Computer Vision con AWSAmazon Web Services

Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services

Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services

API moderne real-time per applicazioni mobili e webAmazon Web Services

Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services

Tools for building your MVP on AWSAmazon Web Services

How to Build a Winning Pitch DeckAmazon Web Services

Building a web application without serversAmazon Web Services

Fundraising EssentialsAmazon Web Services

AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services

Introduzione a Amazon Elastic Container ServiceAmazon Web Services

Mais de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...

Big Data per le Startup: come creare applicazioni Big Data in modalità Server...

Esegui pod serverless con Amazon EKS e AWS Fargate

Costruire Applicazioni Moderne con AWS

Come spendere fino al 90% in meno con i container e le istanze spot

Open banking as a service

Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...

OpsWorks Configuration Management: automatizza la gestione e i deployment del...

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads

Computer Vision con AWS

Database Oracle e VMware Cloud on AWS i miti da sfatare

Crea la tua prima serverless ledger-based app con QLDB e NodeJS

API moderne real-time per applicazioni mobili e web

Database Oracle e VMware Cloud™ on AWS: i miti da sfatare

Tools for building your MVP on AWS

How to Build a Winning Pitch Deck

Building a web application without servers

Fundraising Essentials

AWS_HK_StartupDay_Building Interactive websites while automating for efficien...

Introduzione a Amazon Elastic Container Service

Deploy ONNX Models with MXNet Model Server

2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deploying Your ONNX Deep Learning Models with Apache MXNet Model Server Girish Patil Deep Learning Solutions Architect A I M 4 1 3 Steffen Rochel Head of Engineering Deep Engines

4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Performance Availability Networking Monitoring Modeldecoupling Crossframework Crossplatform The undifferentiated heavy lifting of model serving

9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. O(n2) Pairs MXNet Caffe2 PyTorch Tensorflow CNTKCoreML TensorRT NGraph SNPE Open Neural Network eXchange—Overview Many frameworks Many platforms ONNX: Common IR Neural Network format Open source Growing support

10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. # Import into MXNet (from MXNet 1.2) sym, arg_params, aux_params = onnx_mxnet.import_model('model.onnx’) # create module mod = mx.mod.Module(symbol=sym, data_names=['input_0'], label_names=None) mod.bind(for_training=False, data_shapes=[('input_0', input_img.shape)]) mod.set_params(arg_params=arg_params, aux_params=aux_params) ONNX—Usage example Back

11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. ONNX—Usage example # Synthetic input for tracing x = Variable(torch.randn(batch_size, 1, 224, 224), requires_grad=True) # Export the model torch_out = torch.onnx.export(model, x, "model.onnx")

14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. MMS docker image Pull or build run Containerization Containercluster MMS Container MMS ContainerMMScontainer MXNet ModelServer MXNet NGINX Load balance r

17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. MMS Server Client Pre-processing & post processing Request Response Pre- processing Post processing Model Keeps clients simple!

22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Serverless model serving architecture VPC ECS cluster and service Fargate task(s) Load balancer Internet CloudWatch MMS container Model Server

24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon SageMaker: https://aws.amazon.com/sagemaker/ Using Apache MXNet with Amazon SageMaker: https://docs.aws.amazon.com/sagemaker/ latest/dg/mxnet.html Contact: mxnet-info@amazon.com

Deploy ONNX Models with MXNet Model Server

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Deploy ONNX Models with MXNet Model Server

Semelhante a Deploy ONNX Models with MXNet Model Server (20)

Mais de Amazon Web Services

Mais de Amazon Web Services (20)

Deploy ONNX Models with MXNet Model Server