Pipelines for model deployment

•

3 gostaram•311 visualizações

Ramon Navarro

Onna presentation about how we deploy static models and dynamic models on production.

Dados e análise

Pipelines for model deployment
Ramon Navarro Bosch, CTO Onna

What is
• Startup Barcelona / San
Francisco
• SaaS / On-Premise
• Connect sources of
information, gather, analyze
and offer a UI to search for
information
• Knowledge Management and
apps for eDiscovery and
contract management
• API centric solution

“41 components architecture distributed with
kubernetes (k8s) and deployed with jenkins
CI/CD”

Static models
• Calculate a model ofﬂine
• SVM / CNN / RF / RNN /
LSTM / Summary / …
• Mostly tensorﬂow, sklearn

Static models
Dataset
Algorithm
Test
Pack
Deploy
dev
stage
prod
tf…
Train

Static models
• Detect interesting content type
• Get dataset - apply generic feature extraction
• Train model … for example: SVM (binary) - versions of the
models as contained ﬁles
• Embed in processing container jenkinsﬁle / dockerﬁle
• Build container and push version to registry
• Adding to production stack
• Continuous build on each commit

Arquitecture - k8s
Canonical REST API
Guillotina
Processing engine
…
Account namespace Shared namespace

Queue system
New
document
Pipeline RabbitMQ
Feature
Extraction
Beat Beat
Write
Each component redirect messages processed to next
exchange
Director Exit

• Testing accuracy on production processing new messages.
Plug to queue and don’t ack messages to get reprocessed
• Scale the # of pods/beats for each one based on the size of the
queue
• Dead letter queues for not well processed
• Delay queues for incremental processing
• Experimental beats
• REST API to our Beats engine to extract information about a
resource
Static models

“We allow users to create their own models”

Guillotina
• Open Source scalable resource management framework (data
plaform)
• Provides an extensible Transactional Traversal REST API with
distributed DB support
• Event triggering
• Security model
• All async
• PyData talk 21/5

Dynamic models
• Distributed continuous training
• Serve multi model live
• Model storage
• Direct access to distributed
data
• Inference and train

Model+Meta
Worker ParameterServer Serving
Algorithm
DocumentsLabels Metadata
EmbeddingVocabulary

Simpliﬁed example
• User deﬁnes labels - ML - BUSINESS
• Asks to train a classiﬁcation model based on
logistic regression
• A model is saved on the DB (SavedModel)
• User deﬁnes a Rule to apply the model (tf serving) -
each time I have a new Mail from Gmail

tf.serving pods (c++)
• gRPC from Canonical to inference with mini batch /
model spec
• SourceAdapter (Loader) connector for Canonical
model loading (versions handling of the model)
• ServableAdapter with sharing vocabulary and
multiple loaded models (multiple models on a
serving component)
• k8s scaling and monitoring

Dynamic distributed ﬂow
• Allocate variables and workers pods dynamic with k8s python api (there are
limits)
• Packs an Experiment and the Estimator (with Keras model or direct TF)
• input_fn is a feed generator that gets the data from the web socket API to
tfRecord (Guillotina model to Protobuff model)
• Re-train with new documents loading the model last check
• Write the model by the main worker on the canonical
• Another pod runs the validation on the saved canonical for each checkpoint
• Dev can runs the tensorboard to the canonical model endpoint

Canonical Ecosystem
Serving TrainingTrainingTrainingTrainingTraining

Preguntes ?
Gràcies!
Ramon Navarro Bosch
ramon@onna.com

Mais conteúdo relacionado

Mais procurados

Kafka in the Enterprise—A Two-Year Journey to Build a Data Streaming Platform...confluent

Flink Forward San Francisco 2019: Building production Flink jobs with Airstre...Flink Forward

Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, VectorizedHostedbyConfluent

Vyacheslav Zholudev – Flink, a Convenient Abstraction Layer for Yarn?Flink Forward

Javantura v4 - Getting started with Apache Spark - Dinko SrkočHUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association

Bay Area Apache Flink Meetup Community Update August 2015Henry Saputra

It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, ShopifyHostedbyConfluent

Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...Flink Forward

InfluxDB 2.0 Client Libraries by Noah CrowleyInfluxData

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...Databricks

A Walkthrough of InfluxCloud 2.0 by Tim HallInfluxData

Streaming your Lyft Ride Prices - Flink Forward SF 2019Thomas Weise

Kafka Streams - From the Ground Up to the CloudVMware Tanzu

FlinkML - Big data application meetupTheodoros Vasiloudis

Spark Summit EU talk by Sol Ackerman and Franklyn D'souzaSpark Summit

Tensorflow data preparation on Apache Beam using Portable Flink Runner, Ankur...Bowen Li

InfluxEnterprise Architecture Patterns by Tim Hall & Sam DillardInfluxData

Apache Beam @ GCPUG.TW Flink.TW 20161006Randy Huang

Setting Up InfluxDB for IoT by David G SimmonsInfluxData

A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH Flink Forward

Mais procurados (20)

Kafka in the Enterprise—A Two-Year Journey to Build a Data Streaming Platform...

Flink Forward San Francisco 2019: Building production Flink jobs with Airstre...

Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized

Vyacheslav Zholudev – Flink, a Convenient Abstraction Layer for Yarn?

Javantura v4 - Getting started with Apache Spark - Dinko Srkoč

Bay Area Apache Flink Meetup Community Update August 2015

It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify

Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...

InfluxDB 2.0 Client Libraries by Noah Crowley

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...

A Walkthrough of InfluxCloud 2.0 by Tim Hall

Streaming your Lyft Ride Prices - Flink Forward SF 2019

Kafka Streams - From the Ground Up to the Cloud

FlinkML - Big data application meetup

Spark Summit EU talk by Sol Ackerman and Franklyn D'souza

Tensorflow data preparation on Apache Beam using Portable Flink Runner, Ankur...

InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard

Apache Beam @ GCPUG.TW Flink.TW 20161006

Setting Up InfluxDB for IoT by David G Simmons

A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH

Semelhante a Pipelines for model deployment

Containerized architectures for deep learningAntje Barth

Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsLightbend

Distributed & Highly Available server applications in Java and ScalaMax Alexejev

Event-Driven Model Serving: Stream Processing vs. RPC with Kafka and TensorFl...confluent

Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Kai Wähner

Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...Guido Schmutz

TechTalk: Connext DDS 5.2.Real-Time Innovations (RTI)

Operationalizing Machine Learning: Serving ML ModelsLightbend

NextGenML Moldovan Radu Adrian

What's New in .Net 4.5Malam Team

Kafka for Microservices – You absolutely need Avro Schemas! | Gerardo Gutierr...HostedbyConfluent

messaging.pptxNParakh1

MLOps pipelines using MLFlow - From training to productionFabian Hadiji

Deploy your machine learning models to production with Kubernetescnvrg.io AI OS - Hands-on ML Workshops

Productionizing Machine Learning - Bigdata meetup 5-06-2019Iulian Pintoiu

Legion - AI Runtime PlatformAlexey Kharlamov

End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageAnimesh Singh

Strata parallel m-ml-ops_sept_2017Nisha Talagala

Apache NiFi: A Drag and Drop ApproachCalculated Systems

Big Data Streams Architectures. Why? What? How?Anton Nazaruk

Semelhante a Pipelines for model deployment (20)

Containerized architectures for deep learning

Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams

Distributed & Highly Available server applications in Java and Scala

Event-Driven Model Serving: Stream Processing vs. RPC with Kafka and TensorFl...

Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...

Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...

TechTalk: Connext DDS 5.2.

Operationalizing Machine Learning: Serving ML Models

NextGenML

What's New in .Net 4.5

Kafka for Microservices – You absolutely need Avro Schemas! | Gerardo Gutierr...

messaging.pptx

MLOps pipelines using MLFlow - From training to production

Deploy your machine learning models to production with Kubernetes

Productionizing Machine Learning - Bigdata meetup 5-06-2019

Legion - AI Runtime Platform

End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage

Strata parallel m-ml-ops_sept_2017

Apache NiFi: A Drag and Drop Approach

Big Data Streams Architectures. Why? What? How?

Mais de Ramon Navarro

Plone serverRamon Navarro

How containers helped a SaaS startup be developed and go liveRamon Navarro

Plone 5 and machine learningRamon Navarro

CI on large open source software : Plone & Plone 5 is here!Ramon Navarro

Resource registries plone conf 2014Ramon Navarro

PyramidRamon Navarro

Multilingual sites in ploneRamon Navarro

Cafè amb webRamon Navarro

Presentacio meetup Python BCNRamon Navarro

plone.app.multilingual Ramon Navarro

WPD Barcelona 2008 Què és Plone ?Ramon Navarro

Mais de Ramon Navarro (11)

Plone server

How containers helped a SaaS startup be developed and go live

Plone 5 and machine learning

CI on large open source software : Plone & Plone 5 is here!

Resource registries plone conf 2014

Pyramid

Multilingual sites in plone

Cafè amb web

Presentacio meetup Python BCN

plone.app.multilingual

WPD Barcelona 2008 Què és Plone ?

Último

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...gajnagarg

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal

CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann

Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...gajnagarg

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Pipelines for model deployment

1. Pipelines for model deployment Ramon Navarro Bosch, CTO Onna

2. What is • Startup Barcelona / San Francisco • SaaS / On-Premise • Connect sources of information, gather, analyze and offer a UI to search for information • Knowledge Management and apps for eDiscovery and contract management • API centric solution

3. “41 components architecture distributed with kubernetes (k8s) and deployed with jenkins CI/CD”

4. Static models • Calculate a model ofﬂine • SVM / CNN / RF / RNN / LSTM / Summary / … • Mostly tensorﬂow, sklearn

5. Static models Dataset Algorithm Test Pack Deploy dev stage prod tf… Train

6. Static models • Detect interesting content type • Get dataset - apply generic feature extraction • Train model … for example: SVM (binary) - versions of the models as contained files • Embed in processing container jenkinsfile / dockerfile • Build container and push version to registry • Adding to production stack • Continuous build on each commit

7. Arquitecture - k8s Canonical REST API Guillotina Processing engine … Account namespace Shared namespace

8. Queue system New document Pipeline RabbitMQ Feature Extraction Beat Beat Write Each component redirect messages processed to next exchange Director Exit

9. • Testing accuracy on production processing new messages. Plug to queue and don’t ack messages to get reprocessed • Scale the # of pods/beats for each one based on the size of the queue • Dead letter queues for not well processed • Delay queues for incremental processing • Experimental beats • REST API to our Beats engine to extract information about a resource Static models

10. “We allow users to create their own models”

11. Guillotina • Open Source scalable resource management framework (data plaform) • Provides an extensible Transactional Traversal REST API with distributed DB support • Event triggering • Security model • All async • PyData talk 21/5

12. Dynamic models • Distributed continuous training • Serve multi model live • Model storage • Direct access to distributed data • Inference and train

13. Model+Meta Worker ParameterServer Serving Algorithm DocumentsLabels Metadata EmbeddingVocabulary

14. Simplified example • User defines labels - ML - BUSINESS • Asks to train a classification model based on logistic regression • A model is saved on the DB (SavedModel) • User defines a Rule to apply the model (tf serving) - each time I have a new Mail from Gmail

15. tf.serving pods (c++) • gRPC from Canonical to inference with mini batch / model spec • SourceAdapter (Loader) connector for Canonical model loading (versions handling of the model) • ServableAdapter with sharing vocabulary and multiple loaded models (multiple models on a serving component) • k8s scaling and monitoring

16. Dynamic distributed ﬂow • Allocate variables and workers pods dynamic with k8s python api (there are limits) • Packs an Experiment and the Estimator (with Keras model or direct TF) • input_fn is a feed generator that gets the data from the web socket API to tfRecord (Guillotina model to Protobuff model) • Re-train with new documents loading the model last check • Write the model by the main worker on the canonical • Another pod runs the validation on the saved canonical for each checkpoint • Dev can runs the tensorboard to the canonical model endpoint

17. Canonical Ecosystem Serving TrainingTrainingTrainingTrainingTraining

18. Preguntes ? Gràcies! Ramon Navarro Bosch ramon@onna.com

Pipelines for model deployment

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Pipelines for model deployment

Semelhante a Pipelines for model deployment (20)

Mais de Ramon Navarro

Mais de Ramon Navarro (11)

Último

Último (20)

Pipelines for model deployment