SlideShare a Scribd company logo
1 of 16
EPAM ML/AI Accelerator
Open Data Analytics Hub
ODAHU
Dmitrii Suslov
Vladislav Tokarev
January 2020
AGENDA
1 G E N E R A L M L /A I P R O D U C T L I F E C Y L E
2
3
6
M L /A I P R O D U C T S O P E R AT I O N A L I Z AT I O N
C H A L L E N G E S
O DA H U K E Y F E AT U R E S I N 1 . X R E L E A S E
O DA H U F O R I N T E R N A L E PA M P R O J E TC S
8 D E M O
9 Q & A
4 O DA H U A R C H I T E C T U R E
5 O DA H U R OA D M A P
7 C O O P E R AT I O N A N D C O N T R I B U T I O N
2
GENERAL ML/AI PRODUCT LIFECYCLE
Ideation
Phase
Data Preparation
Phase
Data Exploratory
Phase
ML Models Training &
Tuning Phase
AI Products
Development &
Integration Phase
AI Products
Production Phase
AI product feedback loop
ML model
selection & tuning loop
AI product development cycle
Define AI
product
Collect AI
product
requirements
Discover
available
datasets
Develop and
deliver data
ETL pipelines
Deliver data
product
Prepare
training and
testing data
sets
Evaluate and
choose right
ML algorithms
Train, test
and tune ML
model
Build binary
file with
trained
model
Build AI services
with trained ML
models wrapped
into RESTful service
and Docker
containers
Build AI products
with families of AI
services and
automation
pipelines
Deploy and test AI
services and
products
Deliver AI products in
production
- Service mesh
- A/B testing
- Traffic mirroring
- Req. & Resp.
logging
Monitoring &
Alerting
Collect feedback and
monitor prediction
accuracy
Automate ML
CI pipelines
3
COMMON ML/AI PRODUCTS OPERATIONALIZATION CHALLENGES
• CICD for ML models
• Dependency management
• ML training experiments evaluation
• Keeping track of data and models
• Packaging models for different target environments
• Scaling ML model training and runtime environment
• Enterprise level infrastructure: automated, secured, multi-tenant, scalable, manageable,
etc..
4
KEY FEATURES IN RELEASE 1.X
• Pluggable ML toolchains system and Mlflow support
• Kubernetes native services for training, packaging, deploying ML models with APIs in OpenAPI (ex.
Swagger) specification
• AI service catalog
• Connections manager
• SDKs generated from OpenAPI specifications and command line tool
• ML feedback loop components
• GPU for ML training loads in K8S
• Horizontal scaling with Knative for models deployed as services in kubernetes
• Advanced traffic routing schemas with Istio for ML models deployed as RESTful AI services
• Plugin for JupyterLab
• Plugin for Airflow
• SSO with OpenID Connect protocol
• System monitoring
• Deployment automation in major kubernetes platforms: GCP GKE, AWS EKS, Azure AKS
• Open source under Apache 2.0 https://github.com/odahu
• Open documentation https://docs.odahu.org/
5
HIGH-LEVEL LAYERED COMPONENTS VIEW ON ODAHU
Plugins for ML IDEsCommand line tools Plugins for workflow and CICD engines
SDKs SDKs (Python, Go and other languages) generated from ODAHU OpenAPI specifications (Ex. Swagger)
Core
Components
Training
ML models
Infrastructure deployment
automation
Packaging
ML models
External
Systems
Deploying
ML models
ML training clusters (K8S, Spark, HPC, others)
Infrastructure AWS Azure GCP
Feedback
loop
On-Premise
KMS SSO
AI runtime clusters (K8S, Spark, Hadoop, others)
ML frameworks (Mlflow, Sklearn, TensorFlow, others) Data sources (Object storages, DBs, File systems, others)
VSC (github gitlab, bitbucket, TFS, others)
Docker registries Package registries ETL CICD
Web control panel
ML scripts
ODAHU componentLegend: External component Custom scripts of ML project Logical group Depends
Connections
manager
Monitoring
Alerting
Logging
ML pipelines CICD pipelines
User Facing
Components
ML/AI Project
Components
ML/AI productsData pipelinesODAHU manifests
6
ODAHU SERVICE FOR ML MODEL TRAINING
Data Scientist IDEODAHU Command Line Tool
Core
Components Training ML Model Service
External
Systems
User Facing
Components
Connections ManagerAudit Service
orchestrate ML training jobs
get credentialssend audit info
get ML
scripts
get
data
send package with ML model
send log msg & metrics from cluster
send ML training metrics
Version Control System
Data Source
Compute cluster
Package repository
ML metrics tracking system
Cluster monitoring system
submit training
request with
ODAHU manifest
Workflow engine
ODAHU Plugin ODAHU Plugin
ODAHU componentLegend: External component Custom scripts of ML project Logical group Get
Send
Control
send req. to train ML model
ML scripts
Prepared datasets
ML training jobs
ML model archive
7
ODAHU SERVICE FOR ML MODEL PACKAGING
Data Scientist IDEODAHU Command Line Tool
Core
Components
Packaging ML model Service
External
Systems
User Facing
Components
Connections ManagerAudit Service
orchestrate ML training job
get credentialssend audit info
get archive
with ML model
send docker imagePackage registry Compute cluster
submit packaging
request with
ODAHU manifest
Workflow engine
ODAHU Plugin ODAHU Plugin
Registry
send log msg
& metrics
from cluster
Monitoring system
ODAHU componentLegend: External component ML project artifact Logical group Get
Control
send req. to package model for target platform
ML model archive ML model packaging job
ML model packaged for
target platform
8
ODAHU SERVICE FOR ML MODEL DEPLOYMENT
Data scientist IDEODAHU command line tool
Core
Components Deploying ML model service
External
Systems
ODAHU componentLegend: External component ML project artifact Logical group
User Facing
Components
Connections manager
Audit service
orchestrate model deployment job
get credentials
send audit info
submit packaging
request with
ODAHU manifest
Workflow engine
Control
Get
Send
ODAHU plugin ODAHU plugin
get docker
image and deployRegistry Compute cluster
send log msg & metrics
from cluster
Monitoring system
Service Catalog
send req. to deploy model on target platform
AI services or/and jobsAI services or/and jobs
9
Workflow engine with ODAHU plugin
ML model delivery pipeline
ML/AI PIPELINES EXAMPLE BASED ON ODAHU COMPONETS
ODAHU Core
Components
Deploying ML model
service
ODAHU componentLegend: External component ML project artifact Control
Send
load
data
transform
data
train & validate
ml model
validate
data
package model for target
platform and store it in registry
deploy
model
ML model inference pipeline
load
data
transform
data
request
inference
validate
data
deliver
inference
Training ML model
service
Packaging ML model
service
Feedback loop
components
log inference req. & resp.
send inference feedback message
Compute cluster
Inferencereq.,resp.,feedbackmessages
Data Storage
Historical
data
Inference
req., resp.
and
feedback
data
Inference
input data
Inference services or/and
batch jobs
send inference req. & resp.
ML
model’s
inference
consuming
system
deploy
model
send inference
Get
get
inferencesend ml training req. send ml training req. send ml packaging req.
10
KEY FEATURES IN ODAHU ROADMAP FOR NEXT RELEASES
• Web control panel
• Role-based access control
• Activity audit
• Advanced logging and alerting
• Support more platforms for ML model training loads
• Support more runtime platforms ML models
• Plugins for more IDEs
• Plugins for more workflow engines
• Additional services for ML/AI lifecycle
• Deployment automation for on-premise infrastructure with OpenShift and pure K8S
• Deployment from Cloud Marketplaces – Google, AWS and other clouds
• More and better documentation
11
ODAHU FOR INTERNAL EPAM PROJECTS
• Employee vs positions matching
• Employee attrition score
• Employee star score
• Automated language assessment
• Employee productivity model
• And more ….
12
COOPERATION AND CONTRIBUTION TO ODAHU
• ODAHU team helps with conducting demo for EPAM clients
• ODAHU team helps with ODAHU deployment and configuration for EPAM clients
• Contribution to ODAHU projects is welcome https://github.com/odahu
13
ODAHU DEMO
D L A B + O DA H U + 3 d p a r t y
s e r v i c e s
M L p r o j e c t s ex a m p l e s
a d a p t e d t o O DA H U
• Wine quality inference
(ODAHU + Mlflow + Scikit-
learn + Airflow DAG)
• Text classification (ODAHU+
Mlflow + Keras + TensorFlow)
• Image recognition (ODAHU +
Mlflow + Keras + TensorFlow)
• Data Scientist IDEs
orchestration with DLab
• Core ODAHU services
• 3d party systems
integrated with ODAHU
14
DEMO
15
Q&A
16

More Related Content

What's hot

Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...
Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...
Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...CA Technologies
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18Harvinder Atwal
 
Deploying ML models in the enterprise
Deploying ML models in the enterpriseDeploying ML models in the enterprise
Deploying ML models in the enterprisedoppenhe
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4jNeo4j
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflowCharmi Chokshi
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design PatternsJohn Yeung
 
Intro to Neo4j
Intro to Neo4jIntro to Neo4j
Intro to Neo4jNeo4j
 
Treinamento MLOps Databricks 2023
Treinamento MLOps Databricks 2023Treinamento MLOps Databricks 2023
Treinamento MLOps Databricks 2023Douglas Mendes
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformDatabricks
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDATAVERSITY
 
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageEnd to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageAnimesh Singh
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsMárton Kodok
 
Building a modern data stack to maintain an efficient and safe electrical grid
Building a modern data stack to maintain an efficient and safe electrical gridBuilding a modern data stack to maintain an efficient and safe electrical grid
Building a modern data stack to maintain an efficient and safe electrical gridNeo4j
 
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...Split Miner: Discovering Accurate and Simple Business Process Models from Eve...
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...Marlon Dumas
 
Data Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data ArchitectureData Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data ArchitectureZaloni
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationCambridge Semantics
 

What's hot (20)

Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...
Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...
Digital Assurance: Develop a Comprehensive Testing Strategy for Digital Trans...
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
 
Neo4j graph database
Neo4j graph databaseNeo4j graph database
Neo4j graph database
 
Deploying ML models in the enterprise
Deploying ML models in the enterpriseDeploying ML models in the enterprise
Deploying ML models in the enterprise
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflow
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design Patterns
 
Intro to Neo4j
Intro to Neo4jIntro to Neo4j
Intro to Neo4j
 
Treinamento MLOps Databricks 2023
Treinamento MLOps Databricks 2023Treinamento MLOps Databricks 2023
Treinamento MLOps Databricks 2023
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageEnd to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
 
Deep learning
Deep learningDeep learning
Deep learning
 
MLOps with Kubeflow
MLOps with Kubeflow MLOps with Kubeflow
MLOps with Kubeflow
 
Building a modern data stack to maintain an efficient and safe electrical grid
Building a modern data stack to maintain an efficient and safe electrical gridBuilding a modern data stack to maintain an efficient and safe electrical grid
Building a modern data stack to maintain an efficient and safe electrical grid
 
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...Split Miner: Discovering Accurate and Simple Business Process Models from Eve...
Split Miner: Discovering Accurate and Simple Business Process Models from Eve...
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 
Data Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data ArchitectureData Lakes - The Key to a Scalable Data Architecture
Data Lakes - The Key to a Scalable Data Architecture
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 

Similar to EPAM ML/AI Accelerator - ODAHU

GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...James Anderson
 
Utilisation de MLflow pour le cycle de vie des projet Machine learning
Utilisation de MLflow pour le cycle de vie des projet Machine learningUtilisation de MLflow pour le cycle de vie des projet Machine learning
Utilisation de MLflow pour le cycle de vie des projet Machine learningParis Data Engineers !
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaData Science Milan
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOpsCarl W. Handlin
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondProvectus
 
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...All Things Open
 
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)dtz001
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in ProductionDataWorks Summit
 
Microsoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenMicrosoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenGoDataDriven
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsAnyscale
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdfPyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdfJim Dowling
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on DatabricksDataScienceConferenc1
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021Sandesh Rao
 
mlflow: Accelerating the End-to-End ML lifecycle
mlflow: Accelerating the End-to-End ML lifecyclemlflow: Accelerating the End-to-End ML lifecycle
mlflow: Accelerating the End-to-End ML lifecycleDatabricks
 

Similar to EPAM ML/AI Accelerator - ODAHU (20)

GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
 
Utilisation de MLflow pour le cycle de vie des projet Machine learning
Utilisation de MLflow pour le cycle de vie des projet Machine learningUtilisation de MLflow pour le cycle de vie des projet Machine learning
Utilisation de MLflow pour le cycle de vie des projet Machine learning
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOps
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and Beyond
 
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
 
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Microsoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenMicrosoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDriven
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
 
DEVOPS AND MACHINE LEARNING
DEVOPS AND MACHINE LEARNINGDEVOPS AND MACHINE LEARNING
DEVOPS AND MACHINE LEARNING
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdfPyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
mlflow: Accelerating the End-to-End ML lifecycle
mlflow: Accelerating the End-to-End ML lifecyclemlflow: Accelerating the End-to-End ML lifecycle
mlflow: Accelerating the End-to-End ML lifecycle
 
Monitoring AI with AI
Monitoring AI with AIMonitoring AI with AI
Monitoring AI with AI
 

Recently uploaded

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 

Recently uploaded (20)

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

EPAM ML/AI Accelerator - ODAHU

  • 1. EPAM ML/AI Accelerator Open Data Analytics Hub ODAHU Dmitrii Suslov Vladislav Tokarev January 2020
  • 2. AGENDA 1 G E N E R A L M L /A I P R O D U C T L I F E C Y L E 2 3 6 M L /A I P R O D U C T S O P E R AT I O N A L I Z AT I O N C H A L L E N G E S O DA H U K E Y F E AT U R E S I N 1 . X R E L E A S E O DA H U F O R I N T E R N A L E PA M P R O J E TC S 8 D E M O 9 Q & A 4 O DA H U A R C H I T E C T U R E 5 O DA H U R OA D M A P 7 C O O P E R AT I O N A N D C O N T R I B U T I O N 2
  • 3. GENERAL ML/AI PRODUCT LIFECYCLE Ideation Phase Data Preparation Phase Data Exploratory Phase ML Models Training & Tuning Phase AI Products Development & Integration Phase AI Products Production Phase AI product feedback loop ML model selection & tuning loop AI product development cycle Define AI product Collect AI product requirements Discover available datasets Develop and deliver data ETL pipelines Deliver data product Prepare training and testing data sets Evaluate and choose right ML algorithms Train, test and tune ML model Build binary file with trained model Build AI services with trained ML models wrapped into RESTful service and Docker containers Build AI products with families of AI services and automation pipelines Deploy and test AI services and products Deliver AI products in production - Service mesh - A/B testing - Traffic mirroring - Req. & Resp. logging Monitoring & Alerting Collect feedback and monitor prediction accuracy Automate ML CI pipelines 3
  • 4. COMMON ML/AI PRODUCTS OPERATIONALIZATION CHALLENGES • CICD for ML models • Dependency management • ML training experiments evaluation • Keeping track of data and models • Packaging models for different target environments • Scaling ML model training and runtime environment • Enterprise level infrastructure: automated, secured, multi-tenant, scalable, manageable, etc.. 4
  • 5. KEY FEATURES IN RELEASE 1.X • Pluggable ML toolchains system and Mlflow support • Kubernetes native services for training, packaging, deploying ML models with APIs in OpenAPI (ex. Swagger) specification • AI service catalog • Connections manager • SDKs generated from OpenAPI specifications and command line tool • ML feedback loop components • GPU for ML training loads in K8S • Horizontal scaling with Knative for models deployed as services in kubernetes • Advanced traffic routing schemas with Istio for ML models deployed as RESTful AI services • Plugin for JupyterLab • Plugin for Airflow • SSO with OpenID Connect protocol • System monitoring • Deployment automation in major kubernetes platforms: GCP GKE, AWS EKS, Azure AKS • Open source under Apache 2.0 https://github.com/odahu • Open documentation https://docs.odahu.org/ 5
  • 6. HIGH-LEVEL LAYERED COMPONENTS VIEW ON ODAHU Plugins for ML IDEsCommand line tools Plugins for workflow and CICD engines SDKs SDKs (Python, Go and other languages) generated from ODAHU OpenAPI specifications (Ex. Swagger) Core Components Training ML models Infrastructure deployment automation Packaging ML models External Systems Deploying ML models ML training clusters (K8S, Spark, HPC, others) Infrastructure AWS Azure GCP Feedback loop On-Premise KMS SSO AI runtime clusters (K8S, Spark, Hadoop, others) ML frameworks (Mlflow, Sklearn, TensorFlow, others) Data sources (Object storages, DBs, File systems, others) VSC (github gitlab, bitbucket, TFS, others) Docker registries Package registries ETL CICD Web control panel ML scripts ODAHU componentLegend: External component Custom scripts of ML project Logical group Depends Connections manager Monitoring Alerting Logging ML pipelines CICD pipelines User Facing Components ML/AI Project Components ML/AI productsData pipelinesODAHU manifests 6
  • 7. ODAHU SERVICE FOR ML MODEL TRAINING Data Scientist IDEODAHU Command Line Tool Core Components Training ML Model Service External Systems User Facing Components Connections ManagerAudit Service orchestrate ML training jobs get credentialssend audit info get ML scripts get data send package with ML model send log msg & metrics from cluster send ML training metrics Version Control System Data Source Compute cluster Package repository ML metrics tracking system Cluster monitoring system submit training request with ODAHU manifest Workflow engine ODAHU Plugin ODAHU Plugin ODAHU componentLegend: External component Custom scripts of ML project Logical group Get Send Control send req. to train ML model ML scripts Prepared datasets ML training jobs ML model archive 7
  • 8. ODAHU SERVICE FOR ML MODEL PACKAGING Data Scientist IDEODAHU Command Line Tool Core Components Packaging ML model Service External Systems User Facing Components Connections ManagerAudit Service orchestrate ML training job get credentialssend audit info get archive with ML model send docker imagePackage registry Compute cluster submit packaging request with ODAHU manifest Workflow engine ODAHU Plugin ODAHU Plugin Registry send log msg & metrics from cluster Monitoring system ODAHU componentLegend: External component ML project artifact Logical group Get Control send req. to package model for target platform ML model archive ML model packaging job ML model packaged for target platform 8
  • 9. ODAHU SERVICE FOR ML MODEL DEPLOYMENT Data scientist IDEODAHU command line tool Core Components Deploying ML model service External Systems ODAHU componentLegend: External component ML project artifact Logical group User Facing Components Connections manager Audit service orchestrate model deployment job get credentials send audit info submit packaging request with ODAHU manifest Workflow engine Control Get Send ODAHU plugin ODAHU plugin get docker image and deployRegistry Compute cluster send log msg & metrics from cluster Monitoring system Service Catalog send req. to deploy model on target platform AI services or/and jobsAI services or/and jobs 9
  • 10. Workflow engine with ODAHU plugin ML model delivery pipeline ML/AI PIPELINES EXAMPLE BASED ON ODAHU COMPONETS ODAHU Core Components Deploying ML model service ODAHU componentLegend: External component ML project artifact Control Send load data transform data train & validate ml model validate data package model for target platform and store it in registry deploy model ML model inference pipeline load data transform data request inference validate data deliver inference Training ML model service Packaging ML model service Feedback loop components log inference req. & resp. send inference feedback message Compute cluster Inferencereq.,resp.,feedbackmessages Data Storage Historical data Inference req., resp. and feedback data Inference input data Inference services or/and batch jobs send inference req. & resp. ML model’s inference consuming system deploy model send inference Get get inferencesend ml training req. send ml training req. send ml packaging req. 10
  • 11. KEY FEATURES IN ODAHU ROADMAP FOR NEXT RELEASES • Web control panel • Role-based access control • Activity audit • Advanced logging and alerting • Support more platforms for ML model training loads • Support more runtime platforms ML models • Plugins for more IDEs • Plugins for more workflow engines • Additional services for ML/AI lifecycle • Deployment automation for on-premise infrastructure with OpenShift and pure K8S • Deployment from Cloud Marketplaces – Google, AWS and other clouds • More and better documentation 11
  • 12. ODAHU FOR INTERNAL EPAM PROJECTS • Employee vs positions matching • Employee attrition score • Employee star score • Automated language assessment • Employee productivity model • And more …. 12
  • 13. COOPERATION AND CONTRIBUTION TO ODAHU • ODAHU team helps with conducting demo for EPAM clients • ODAHU team helps with ODAHU deployment and configuration for EPAM clients • Contribution to ODAHU projects is welcome https://github.com/odahu 13
  • 14. ODAHU DEMO D L A B + O DA H U + 3 d p a r t y s e r v i c e s M L p r o j e c t s ex a m p l e s a d a p t e d t o O DA H U • Wine quality inference (ODAHU + Mlflow + Scikit- learn + Airflow DAG) • Text classification (ODAHU+ Mlflow + Keras + TensorFlow) • Image recognition (ODAHU + Mlflow + Keras + TensorFlow) • Data Scientist IDEs orchestration with DLab • Core ODAHU services • 3d party systems integrated with ODAHU 14