SlideShare uma empresa Scribd logo
1 de 27
Revamping
ML Pipelines
with MLOps
2
Presented by
Sameer Mahajan
Principal Architect
Sameer Mahajan has 25 years of experience in the
software industry. He has worked for companies
like Microsoft and Symantec across areas like
machine learning, storage, cloud, big data,
networking and analytics in the United States &
India.
Sameer holds 9 US patents and is an alumnus of IIT
Bombay and Georgia Tech. He not only conducts
hands-on workshops and seminars but also
participates in panel discussions in upcoming
technologies like machine learning and big data.
Sameer is one of the mentors for the Machine
Learning Foundations course at Coursera.
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
• ML spend will reach $57.6 billion by 2021
• More and more ML systems are going into production
• Gartner 2019 Survey suggests that
i. 59% have AI deployed today
ii. The average number of deployed AI projects to increase to 35 by 2022
• Streamline ML lifecycle
• Machine Learning Operations
• Started traction in 2018
Background
ML Lifecycle
Process model – Option B
• Reality we are
trying to
model
• Source of data
Ingest data from
sensors, devices,
databases
• Cleanse and
transform data
• Signal processing
Visual analytics
to capture trends
indicative of
underlying
model processes
Use the models in
the real world
applications and
processes for
predictions, insights
etc.
Data Engineering
Data
Capturing
Data
Preparation
Data
Visualization
Machine
Learning
Train models that
reflect the real-
world phenomena
InferenceWorld
Challenges
• Dealing with data, models and code
• Deployment and automation
• Collaboration : data engineers, data
scientists, ML engineers, business analysts,
operations
• Continuous Integration (CI), Deployment
(CD), Training (CT)
• Reproducibility of results
• Transformations
• Hyperparameters
• Initializers
• Hardware
More Challenges
• Complex pipelines
1. Ensemble
2. Retraining
3. Transfer learning
4. Multiple prediction pipelines in
parallel (Canary)
• Self-updating ML pipelines
• Governance : tracing failed result back
to data or code
• Scalability
Examples of end-to-end ML platforms
1. Uber’s Michaelangelo
2. Facebook’s FBLearner
3. Google has TFX
4. Airbnb has BigHead
5. Databricks introduced mlflow which is now open source
6. Sagemaker
7. Azure
8. Datarobot
9. Polyaxon and KubeFlow
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
Best Practices
• Data Pipeline: Discoverable and Accessible Data - data lake, data mesh
• Versioned control: github, Data Science Version Control (DVC), mlflow Projects
• Data Exploration: Jupyter,
pandas, numpy, seaborn
• ML: scikit-learn
• CI/CD: Jenkins
• Packaging: Docker
• Orchestrator: Airflow, Kubernetes
• Monitoring: ELK, Prometheus
Methodologies
1. Combination of DevOps (CI/CD), Software Engineering and ML
2. ML experiments are captured as runs
3. Each run captures all its steps, its data, parameters, hyper parameters, code,
initializers, model evaluations, artifacts like trained models and business results
after deployment
4. Packaging a model: container
Closer look at some pipelines
1. Build pipeline
• Triggered on schedule or when new code checked in / data becomes available
• Building code and running unit tests
• Data tests: schema and distribution conformance
2. Retrain pipeline
• Triggered on a schedule or when new data becomes available
• Train, evaluate and register model
3. Release pipeline
• Triggered every time a new artifact is available
• Package, test, deploy to production, start monitoring
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
MLflow Tracking
Record and query
experiments: code, data,
config, and results
mlflow
MLflow Projects
Package data science code in
a format to reproduce runs on
any platform
MLflow Models
Deploy machine learning
models in diverse serving
environments
MLflow Registry
Store, annotate, discover,
and manage models in a
central repository
mlflow demo
Airflow demo
Model serving
Embedded model
1. Serialized pickle file
2. Language agnostic exchange formats like PMML, PFA and ONNX
3. H2O exports a POJO in a JAR
Separate service
1. Cloud providers’ tools and SDKs wrapping models
2. Kubeflow
3. mlflow models
Published as data
1. Typically used in streaming / real time scenarios
Tensorflow.js model serving demo
1. Open google chrome
2. Open chrome://apps/
3. Start web server
4. RockPaperScissorsTensorflow.jsDemo (based on a courser assignment)
5. Open http://127.0.0.1:8887 in chrome
6. Open developer tools
7. Demo retraining and predictions
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
TFX-based MLOps system on Google Cloud
MLOps using Azure Machine Learning
• Evolving field
• Applying learning from other fields like DevOps, Software Engineering
• Taking holistic view
• Upcoming tools and practices
• Key in making ML productization successful
Conclusion
• https://www.brighttalk.com/webcast/6793/365540/machine-learning-operations-mlops-deploy-at-scale
• https://www.linkedin.com/pulse/mlops-101-modern-operation-machine-learning-payam-mokhtarian/
• https://en.wikipedia.org/wiki/MLOps
• https://cloud.google.com/solutions/machine-learning/architecture-for-mlops-using-tfx-kubeflow-pipelines-and-cloud-build
• https://docs.microsoft.com/en-us/samples/microsoft/mlopspython/mlops-with-azure-ml/
• https://ibm-cloud-architecture.github.io/refarch-data-ai-analytics/methodology/MLops/
• https://www.kdnuggets.com/2018/04/operational-machine-learning-successful-mlops.html
• https://www.pgs-soft.com/blog/more-effective-machine-learning-production-with-mlops/
• https://www.xenonstack.com/blog/mlops/
• https://towardsdatascience.com/mlops-reducing-the-technical-debt-of-machine-learning-dac528ef39de
• https://towardsdatascience.com/ml-ops-challenges-solutions-and-future-trends-d2e59b74dc6b
References
• https://www.bristoldatascientists.org/wp-content/uploads/sites/5/2020/02/Luke-The-Future-of-MLOps.pdf
• https://www.cognilytica.com/2020/03/03/ml-model-management-and-operations-2020-mlops/
• https://martinfowler.com/articles/data-monolith-to-mesh.html
• https://emilygorcenski.com/post/data-versioning/
• https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf
• https://martinfowler.com/articles/cd4ml.html
• https://mlflow.org/docs/latest/index.html
• https://airflow.apache.org/docs/stable/
• https://www.tensorflow.org/
• https://www.coursera.org/specializations/tensorflow-in-practice
• https://www.coursera.org/specializations/tensorflow-data-and-deployment
References (contd.)
Thank you!
sameer.mahajan@gslab.com
For more information please reach out to me at
https://in.linkedin.com/in/sameersmahajan

Mais conteĂşdo relacionado

Mais procurados

The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
DataPhoenix
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 

Mais procurados (20)

MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
 
[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure
 
“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
 
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughModel versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 Walkthrough
 
Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
 
The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflowContinuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
 
What's Next for MLflow in 2019
What's Next for MLflow in 2019What's Next for MLflow in 2019
What's Next for MLflow in 2019
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
Magdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine LearningMagdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine Learning
 
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model ServingDAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflow
 

Semelhante a Nasscom ml ops webinar

MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
Databricks
 
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
DataScienceConferenc1
 
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
 

Semelhante a Nasscom ml ops webinar (20)

MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Pythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlowPythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlow
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
 
Introduction to MLflow
Introduction to MLflowIntroduction to MLflow
Introduction to MLflow
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
 
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
 
Scaling up Machine Learning Development
Scaling up Machine Learning DevelopmentScaling up Machine Learning Development
Scaling up Machine Learning Development
 
MLflow with Databricks
MLflow with DatabricksMLflow with Databricks
MLflow with Databricks
 
Mlflow with databricks
Mlflow with databricksMlflow with databricks
Mlflow with databricks
 
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
 
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
 
A Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowA Collaborative Data Science Development Workflow
A Collaborative Data Science Development Workflow
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdraĹźanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
Legion - AI Runtime Platform
Legion -  AI Runtime PlatformLegion -  AI Runtime Platform
Legion - AI Runtime Platform
 
MLOps with Kubeflow
MLOps with Kubeflow MLOps with Kubeflow
MLOps with Kubeflow
 

Mais de Sameer Mahajan

Leapfrog in deep learning
Leapfrog in deep learningLeapfrog in deep learning
Leapfrog in deep learning
Sameer Mahajan
 

Mais de Sameer Mahajan (9)

blockchainpost.pptx
blockchainpost.pptxblockchainpost.pptx
blockchainpost.pptx
 
Crypto101.pptx
Crypto101.pptxCrypto101.pptx
Crypto101.pptx
 
Software Architect Track
Software Architect TrackSoftware Architect Track
Software Architect Track
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Computer Networking 101
Computer Networking 101Computer Networking 101
Computer Networking 101
 
Apache spark
Apache sparkApache spark
Apache spark
 
Feature engineering
Feature engineeringFeature engineering
Feature engineering
 
Leapfrog in deep learning
Leapfrog in deep learningLeapfrog in deep learning
Leapfrog in deep learning
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Nasscom ml ops webinar

  • 2. 2 Presented by Sameer Mahajan Principal Architect Sameer Mahajan has 25 years of experience in the software industry. He has worked for companies like Microsoft and Symantec across areas like machine learning, storage, cloud, big data, networking and analytics in the United States & India. Sameer holds 9 US patents and is an alumnus of IIT Bombay and Georgia Tech. He not only conducts hands-on workshops and seminars but also participates in panel discussions in upcoming technologies like machine learning and big data. Sameer is one of the mentors for the Machine Learning Foundations course at Coursera.
  • 3. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 4. • ML spend will reach $57.6 billion by 2021 • More and more ML systems are going into production • Gartner 2019 Survey suggests that i. 59% have AI deployed today ii. The average number of deployed AI projects to increase to 35 by 2022 • Streamline ML lifecycle • Machine Learning Operations • Started traction in 2018 Background
  • 6. Process model – Option B • Reality we are trying to model • Source of data Ingest data from sensors, devices, databases • Cleanse and transform data • Signal processing Visual analytics to capture trends indicative of underlying model processes Use the models in the real world applications and processes for predictions, insights etc. Data Engineering Data Capturing Data Preparation Data Visualization Machine Learning Train models that reflect the real- world phenomena InferenceWorld
  • 7. Challenges • Dealing with data, models and code • Deployment and automation • Collaboration : data engineers, data scientists, ML engineers, business analysts, operations • Continuous Integration (CI), Deployment (CD), Training (CT) • Reproducibility of results • Transformations • Hyperparameters • Initializers • Hardware
  • 8. More Challenges • Complex pipelines 1. Ensemble 2. Retraining 3. Transfer learning 4. Multiple prediction pipelines in parallel (Canary) • Self-updating ML pipelines • Governance : tracing failed result back to data or code • Scalability
  • 9. Examples of end-to-end ML platforms 1. Uber’s Michaelangelo 2. Facebook’s FBLearner 3. Google has TFX 4. Airbnb has BigHead 5. Databricks introduced mlflow which is now open source 6. Sagemaker 7. Azure 8. Datarobot 9. Polyaxon and KubeFlow
  • 10. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 11. Best Practices • Data Pipeline: Discoverable and Accessible Data - data lake, data mesh • Versioned control: github, Data Science Version Control (DVC), mlflow Projects • Data Exploration: Jupyter, pandas, numpy, seaborn • ML: scikit-learn • CI/CD: Jenkins • Packaging: Docker • Orchestrator: Airflow, Kubernetes • Monitoring: ELK, Prometheus
  • 12. Methodologies 1. Combination of DevOps (CI/CD), Software Engineering and ML 2. ML experiments are captured as runs 3. Each run captures all its steps, its data, parameters, hyper parameters, code, initializers, model evaluations, artifacts like trained models and business results after deployment 4. Packaging a model: container
  • 13. Closer look at some pipelines 1. Build pipeline • Triggered on schedule or when new code checked in / data becomes available • Building code and running unit tests • Data tests: schema and distribution conformance 2. Retrain pipeline • Triggered on a schedule or when new data becomes available • Train, evaluate and register model 3. Release pipeline • Triggered every time a new artifact is available • Package, test, deploy to production, start monitoring
  • 14. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 15. MLflow Tracking Record and query experiments: code, data, config, and results mlflow MLflow Projects Package data science code in a format to reproduce runs on any platform MLflow Models Deploy machine learning models in diverse serving environments MLflow Registry Store, annotate, discover, and manage models in a central repository
  • 18. Model serving Embedded model 1. Serialized pickle file 2. Language agnostic exchange formats like PMML, PFA and ONNX 3. H2O exports a POJO in a JAR Separate service 1. Cloud providers’ tools and SDKs wrapping models 2. Kubeflow 3. mlflow models Published as data 1. Typically used in streaming / real time scenarios
  • 19.
  • 20. Tensorflow.js model serving demo 1. Open google chrome 2. Open chrome://apps/ 3. Start web server 4. RockPaperScissorsTensorflow.jsDemo (based on a courser assignment) 5. Open http://127.0.0.1:8887 in chrome 6. Open developer tools 7. Demo retraining and predictions
  • 21. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 22. TFX-based MLOps system on Google Cloud
  • 23. MLOps using Azure Machine Learning
  • 24. • Evolving field • Applying learning from other fields like DevOps, Software Engineering • Taking holistic view • Upcoming tools and practices • Key in making ML productization successful Conclusion
  • 25. • https://www.brighttalk.com/webcast/6793/365540/machine-learning-operations-mlops-deploy-at-scale • https://www.linkedin.com/pulse/mlops-101-modern-operation-machine-learning-payam-mokhtarian/ • https://en.wikipedia.org/wiki/MLOps • https://cloud.google.com/solutions/machine-learning/architecture-for-mlops-using-tfx-kubeflow-pipelines-and-cloud-build • https://docs.microsoft.com/en-us/samples/microsoft/mlopspython/mlops-with-azure-ml/ • https://ibm-cloud-architecture.github.io/refarch-data-ai-analytics/methodology/MLops/ • https://www.kdnuggets.com/2018/04/operational-machine-learning-successful-mlops.html • https://www.pgs-soft.com/blog/more-effective-machine-learning-production-with-mlops/ • https://www.xenonstack.com/blog/mlops/ • https://towardsdatascience.com/mlops-reducing-the-technical-debt-of-machine-learning-dac528ef39de • https://towardsdatascience.com/ml-ops-challenges-solutions-and-future-trends-d2e59b74dc6b References
  • 26. • https://www.bristoldatascientists.org/wp-content/uploads/sites/5/2020/02/Luke-The-Future-of-MLOps.pdf • https://www.cognilytica.com/2020/03/03/ml-model-management-and-operations-2020-mlops/ • https://martinfowler.com/articles/data-monolith-to-mesh.html • https://emilygorcenski.com/post/data-versioning/ • https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf • https://martinfowler.com/articles/cd4ml.html • https://mlflow.org/docs/latest/index.html • https://airflow.apache.org/docs/stable/ • https://www.tensorflow.org/ • https://www.coursera.org/specializations/tensorflow-in-practice • https://www.coursera.org/specializations/tensorflow-data-and-deployment References (contd.)
  • 27. Thank you! sameer.mahajan@gslab.com For more information please reach out to me at https://in.linkedin.com/in/sameersmahajan