SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
© Stonebranch 2022. All rights reserved.
ORCHESTRATE
the Flow of Data Across
Data Pipelines
May 3, 2022
Ravi Murugesan
Sr. Solution
Engineer
Scott Davis
Global Vice President
2
© Stonebranch 2022. All rights reserved.
DevOps Orchestration Layer
01
What is a Data Pipeline
02
How to Orchestrate a Data Pipeline
03
Data Pipeline Orchestration Demo
04
Questions and Answers
05
Agenda
© Stonebranch 2022. All rights reserved.
About Data Pipelines
3
Scott Davis
Global Vice President
4
© Stonebranch 2022. All rights reserved.
Vendor Landscape for DataOps – From Gartner
Orchestrators
Specialists
Portfolio Cloud Service Providers Servware (Services & Software) System Integrators
Integration Cataloging Governance
MDM Analytics-Ready Enterprise Data Management
Industrial Data Data Quality Observability
Continuous Delivery Accelerators Privacy & Access Control
* Based on “Gartner Data and Analytics Essentials: DataOps,” by Robert Robert Thanaraj
5
© Stonebranch 2022. All rights reserved.
Data Pipeline: Simple View
6
© Stonebranch 2022. All rights reserved.
Software & Tools By Stage
Dashboards
Looker, Tableau, Qlik, Power
BI, SAP BusinessObjects
Embedded Analytics
Sisense, Looker, Cube.js
Augmented Analytics
Throughspot, Outlier,
Anodot, Sisu
App Frameworks
Plotly Dash, Streamlit
Custom Apps
SMS Messages / Emails
Data Science &
Machine Learning
Databricks, SAS, MathWork,
Domino, Dataiku, DataRobot,
TIBCO Software, Spark,
RapidMiner, H2O.AI, AWS, GCP
AI, Azure ML, IBM Watson
Studio, Cloudera, Alteryx,
TensorFlow, Anaconda
Data Lake
Databricks Delta Lake,
Iceberg, Hudi, Hive Acid
Data Lake
within Cloud Storage
AWS S3, Google Cloud
Storage, HDFS,
Azure Data Lake Store
Data Warehouse
Snowflake, BigQuery, Spark,
AWS Redshift, Qubole, SAP
BW, SAP DWC, Oracle ADW,
Hive, Cloudera (for Hadoop)
ETL
(Extract, Transform, Load)
Informatica, IBM, SAP Data
Services, Oracle OWB, SAS,
Talend, AWS Glue, Azure Data
Factory, Pentaho, GCP Data
Fusion
Stream Data Processing
ELT
Kafka, Flink, Storm, GCP
Pub/Sub
Applications / ERP
Oracle, Salesforce, SAP,
ServiceNow
IoT Devices / Sensors
Stream Data
Website & Mobile Apps
Stream Data, Online
Transaction
Cloud Storage
AWS S3, Google Cloud
Storage, Azure
Data Sources Data Integration & Ingestion Data Store Analyze / Computation Delivery
How Do Enterprises Orchestrate Today?
7
© Stonebranch 2022. All rights reserved.
Common Ways to
Connect Data Tools
Within the Pipeline
Point-to-Point
Integrations
Custom
Scripts
Don’t Connect
(Manual Movement)
How Do Enterprises Orchestrate Today?
8
© Stonebranch 2022. All rights reserved.
Common Ways to
Connect Data Tools
Within the Pipeline
Point-to-Point
Integrations
Custom
Scripts
Don’t Connect
(Manual Movement)
Benefits of Proper
Orchestration Solutions
Centralized
View
Root-Cause
Issues
Proactive
Support
Achieve
Scale
Automation Pain Points
Common Ways to
Connect Data Tools
Within the Pipeline
Point-to-Point
Integrations
Custom
Scripts
Don’t Connect
(Manual Movement)
How Do Enterprises Orchestrate Today?
9
© Stonebranch 2022. All rights reserved.
Benefits of Proper
Orchestration Solutions
Centralized
View
Root-Cause
Issues
Proactive
Support
Achieve
Scale
In-Built
Schedulers
Open-Source
Schedulers
Cloud
Schedulers
Legacy On-Prem
Focused Schedulers
Can’t schedule jobs
in other tools
Often batch- or time-
based automation
Focus on their
own ecosystems
Can’t automate jobs in both
on-prem and cloud systems,
i.e., no hybrid IT automation
Data Pipeline
Orchestration
© Stonebranch 2022. All rights reserved. 10
11
© Stonebranch 2022. All rights reserved.
Data Pipeline Orchestration How to accomplish the real-time automation
and file transfers needed to manage the
entire data pipeline.
Data Pipeline Orchestration
Orchestration
How to accomplish the real-time automation
and file transfers needed to manage the
entire data pipeline.
• Centrally schedule and
orchestrate automated processes within
each tool along the entire data pipeline
• Use APIs or Agents to control the various
tools used within each stage
12
© Stonebranch 2022. All rights reserved.
Data Pipeline Orchestration
Orchestration
How to accomplish the real-time automation
and file transfers needed to manage the
entire data pipeline.
• Centrally schedule and
orchestrate automated processes within
each tool along the entire data pipeline
• Use APIs or Agents to control the various
tools used within each stage
What you achieve with this approach:
• Observability of the logs and data for
governance and security
• DataOps lifecycle management (Dev-Test-
Prod) - including simulations
• Centralized control and visibility with
visual workflows
• Quickly root-cause issues with proactive
alerts when something fails
13
© Stonebranch 2022. All rights reserved.
Data Pipeline Orchestration
Orchestration
How to accomplish the real-time automation
and file transfers needed to manage the
entire data pipeline.
• Centrally schedule and
orchestrate automated processes within
each tool along the entire data pipeline
• Use APIs or Agents to control the various
tools used within each stage
What you achieve with this approach:
• Observability of the logs and data for
governance and security
• DataOps lifecycle management (Dev-Test-
Prod) - including simulations
• Centralized control and visibility with
visual workflows
• Quickly root-cause issues with proactive
alerts when something fails
14
© Stonebranch 2022. All rights reserved.
Orchestration
15
© Stonebranch 2022. All rights reserved.
Driven by SOAP
Cloud DevOps ERP/Apps
IaaS
UAC
SOAP
META-ORCHESTRATION
16
© Stonebranch 2022. All rights reserved.
Self-Service
Automation
Centralized collaboration
platform for data,
developers, and
operations
IT ops teams gain
operational visibility
Data teams approve and
trigger automated workflows
& pipelines from common
business applications
Data Pipeline
Putting the Ops in DataOps
17
© Stonebranch 2022. All rights reserved.
For Enterprises Ready for the Next Level of Maturity
Develop/
Orchestrate
Test /
Simulate
Production
/ Deploy
Continuous Improvement Continuous Deployment
Development Controller Production Controller
Develop/
Orchestrate
Test /
Simulate
Production
/ Deploy
Continuous Improvement Continuous Deployment
Development Controller Production Controller
Putting the Ops in DataOps
18
© Stonebranch 2022. All rights reserved.
For Enterprises Ready for the Next Level of Maturity
Web
GUI
As
Code
Via in-built
capabilities
Promotion
Options
Via third-party
repositories like
GitHub
Data Pipeline Orchestration Demo
Ravi Murugesan
Sr. Solution Engineer
© Stonebranch 2022. All rights reserved. 19
© Stonebranch 2022. All rights reserved. 20
Demonstration
Update Visual Dashboard from Multiple Data Sources (both on-prem and cloud-based)
Live orchestration of a data pipeline,
including
• Sources (cloud, on-prem, apps)
• Ingestion, transformation (Informatica)
• Stores (Azure blob, Snowflake)
• Delivery (Tableau)
One of the Largest Global Food & Beverage Manufacturers in the World
Customer Use Case
21
Customer Use Case: Overview
One of the Largest Global Food & Beverage Manufacturers in the World
Evolution & Goal
• Goal: Orchestrate the full pipeline end-to-end
• Objective: Identify a platform that could connect all their critical data tools
Overall Strategy
• On-prem to cloud digital transformation
• Implemented an enterprise analytics data management environment
• Hub-and-spoke model to help keep regional resource groups and services segregated
• Approved services are first developed and deployed at the hub level, with further spoke
deployment via containers
Original Approach
• Their data pipeline for the enterprise data management environment with Azure Data Factory
• Azure Data Factory worked well in an Azure environment
• It served as an entry point for the project
• The Challenge: Data Factory did not integrate with their full stack of solutions used along the
data pipeline
22
© Stonebranch 2022. All rights reserved.
Data Pipeline Orchestration
One of the Largest Global Food & Beverage Manufacturers in the World
Achieving Their Goal
• Secure and robust file transfer
• DataOps: define pipelines as code and gain lifecycle
management (test/dev/prod) capabilities
• Integrate diverse data pipelines that are built using
various cloud-based and on-prem services and tools
• For operations: visibility into the process, improve SLAs,
real-time monitoring, alerting
• Unified view to design and orchestrate workflows
across multiple cloud and on-prem applications
Orchestration
Databases
23
© Stonebranch 2022. All rights reserved.
© Stonebranch 2022. All rights reserved.
Data Pipeline Orchestration Solution
Universal Automation
Center
24
Real Time Hybrid IT Automation
25
© Stonebranch 2022. All rights reserved.
Universal Automation Center Platform
A Platform Approach
Orchestrating IT processes from on-prem,
to cloud, to containerized microservices
Find. Deploy. Extend.
• Download extensions
• Share extensions
• Community driven
• Constant additions (monthly)
• Large Data Pipeline Focus
• Rapid creation of new integrations
Orchestration = Integration
26
© Stonebranch 2022. All rights reserved.
What to Look for in a Data Pipeline Orchestration Solution
27
© Stonebranch 2022. All rights reserved.
Summary
Who is this for?
• Want to keep using existing data tools, but are ready to graduate from opensource
schedulers to enterprise grade platforms
• Would like a single platform to connect Data Teams, Developers, IT Ops, and Cloud Ops
teams – to help scale their data program
• Need to operationalize DataOps methodologies to gain speed and improve data quality
• Want to gain full visibility across the entire pipeline – to move quickly when issue arise
• Have a growing or changing data tool landscape, and need the ability to rapidly build
new integrations (or download pre-existing integrations)
• Need to enable data scientists or business users with simple self-service capabilities
via the platform or third-party tools like ServiceNow, Microsoft Teams, or Slack
• Bonus: Want a central IT automation and orchestration platform (beyond data pipeline
orchestration) to support cloud automation, on-prem automation, traditional job
scheduling, and DevOps orchestration
© Stonebranch 2022. All rights reserved. 28
© Stonebranch 2022. All rights reserved. 29
Q & A
Scott Davis
Global Vice President
scott.davis@stonebranch.com
Stonebranch - Atlanta, USA
Ravi Murugesan
Sr. Solution Engineer
ravi.murugesan@stonebranch.com
Stonebranch – Frankfurt, Germany
Thank You
© Stonebranch 2022. All rights reserved.

Mais conteúdo relacionado

Mais procurados

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 

Mais procurados (20)

Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Databricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With DataDatabricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With Data
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 

Semelhante a Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines

Semelhante a Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines (20)

Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
InfoSphere BigInsights
InfoSphere BigInsightsInfoSphere BigInsights
InfoSphere BigInsights
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Implement a Universal Data Distribution Architecture to Manage All Streaming ...Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
 
Migrating from Oracle to Postgres
Migrating from Oracle to PostgresMigrating from Oracle to Postgres
Migrating from Oracle to Postgres
 
Re-Platforming Applications for the Cloud
Re-Platforming Applications for the CloudRe-Platforming Applications for the Cloud
Re-Platforming Applications for the Cloud
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
A Successful Journey to the Cloud with Data Virtualization
A Successful Journey to the Cloud with Data VirtualizationA Successful Journey to the Cloud with Data Virtualization
A Successful Journey to the Cloud with Data Virtualization
 
Government and Education Webinar: Optimizing Database Performance
Government and Education Webinar: Optimizing Database PerformanceGovernment and Education Webinar: Optimizing Database Performance
Government and Education Webinar: Optimizing Database Performance
 
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus WebinarBuild and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
 
CI/CD for a Data Platform
CI/CD for a Data PlatformCI/CD for a Data Platform
CI/CD for a Data Platform
 

Mais de DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

Mais de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 

Último

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 

Último (20)

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 

Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines

  • 1. © Stonebranch 2022. All rights reserved. ORCHESTRATE the Flow of Data Across Data Pipelines May 3, 2022 Ravi Murugesan Sr. Solution Engineer Scott Davis Global Vice President
  • 2. 2 © Stonebranch 2022. All rights reserved. DevOps Orchestration Layer 01 What is a Data Pipeline 02 How to Orchestrate a Data Pipeline 03 Data Pipeline Orchestration Demo 04 Questions and Answers 05 Agenda
  • 3. © Stonebranch 2022. All rights reserved. About Data Pipelines 3 Scott Davis Global Vice President
  • 4. 4 © Stonebranch 2022. All rights reserved. Vendor Landscape for DataOps – From Gartner Orchestrators Specialists Portfolio Cloud Service Providers Servware (Services & Software) System Integrators Integration Cataloging Governance MDM Analytics-Ready Enterprise Data Management Industrial Data Data Quality Observability Continuous Delivery Accelerators Privacy & Access Control * Based on “Gartner Data and Analytics Essentials: DataOps,” by Robert Robert Thanaraj
  • 5. 5 © Stonebranch 2022. All rights reserved. Data Pipeline: Simple View
  • 6. 6 © Stonebranch 2022. All rights reserved. Software & Tools By Stage Dashboards Looker, Tableau, Qlik, Power BI, SAP BusinessObjects Embedded Analytics Sisense, Looker, Cube.js Augmented Analytics Throughspot, Outlier, Anodot, Sisu App Frameworks Plotly Dash, Streamlit Custom Apps SMS Messages / Emails Data Science & Machine Learning Databricks, SAS, MathWork, Domino, Dataiku, DataRobot, TIBCO Software, Spark, RapidMiner, H2O.AI, AWS, GCP AI, Azure ML, IBM Watson Studio, Cloudera, Alteryx, TensorFlow, Anaconda Data Lake Databricks Delta Lake, Iceberg, Hudi, Hive Acid Data Lake within Cloud Storage AWS S3, Google Cloud Storage, HDFS, Azure Data Lake Store Data Warehouse Snowflake, BigQuery, Spark, AWS Redshift, Qubole, SAP BW, SAP DWC, Oracle ADW, Hive, Cloudera (for Hadoop) ETL (Extract, Transform, Load) Informatica, IBM, SAP Data Services, Oracle OWB, SAS, Talend, AWS Glue, Azure Data Factory, Pentaho, GCP Data Fusion Stream Data Processing ELT Kafka, Flink, Storm, GCP Pub/Sub Applications / ERP Oracle, Salesforce, SAP, ServiceNow IoT Devices / Sensors Stream Data Website & Mobile Apps Stream Data, Online Transaction Cloud Storage AWS S3, Google Cloud Storage, Azure Data Sources Data Integration & Ingestion Data Store Analyze / Computation Delivery
  • 7. How Do Enterprises Orchestrate Today? 7 © Stonebranch 2022. All rights reserved. Common Ways to Connect Data Tools Within the Pipeline Point-to-Point Integrations Custom Scripts Don’t Connect (Manual Movement)
  • 8. How Do Enterprises Orchestrate Today? 8 © Stonebranch 2022. All rights reserved. Common Ways to Connect Data Tools Within the Pipeline Point-to-Point Integrations Custom Scripts Don’t Connect (Manual Movement) Benefits of Proper Orchestration Solutions Centralized View Root-Cause Issues Proactive Support Achieve Scale
  • 9. Automation Pain Points Common Ways to Connect Data Tools Within the Pipeline Point-to-Point Integrations Custom Scripts Don’t Connect (Manual Movement) How Do Enterprises Orchestrate Today? 9 © Stonebranch 2022. All rights reserved. Benefits of Proper Orchestration Solutions Centralized View Root-Cause Issues Proactive Support Achieve Scale In-Built Schedulers Open-Source Schedulers Cloud Schedulers Legacy On-Prem Focused Schedulers Can’t schedule jobs in other tools Often batch- or time- based automation Focus on their own ecosystems Can’t automate jobs in both on-prem and cloud systems, i.e., no hybrid IT automation
  • 10. Data Pipeline Orchestration © Stonebranch 2022. All rights reserved. 10
  • 11. 11 © Stonebranch 2022. All rights reserved. Data Pipeline Orchestration How to accomplish the real-time automation and file transfers needed to manage the entire data pipeline.
  • 12. Data Pipeline Orchestration Orchestration How to accomplish the real-time automation and file transfers needed to manage the entire data pipeline. • Centrally schedule and orchestrate automated processes within each tool along the entire data pipeline • Use APIs or Agents to control the various tools used within each stage 12 © Stonebranch 2022. All rights reserved.
  • 13. Data Pipeline Orchestration Orchestration How to accomplish the real-time automation and file transfers needed to manage the entire data pipeline. • Centrally schedule and orchestrate automated processes within each tool along the entire data pipeline • Use APIs or Agents to control the various tools used within each stage What you achieve with this approach: • Observability of the logs and data for governance and security • DataOps lifecycle management (Dev-Test- Prod) - including simulations • Centralized control and visibility with visual workflows • Quickly root-cause issues with proactive alerts when something fails 13 © Stonebranch 2022. All rights reserved.
  • 14. Data Pipeline Orchestration Orchestration How to accomplish the real-time automation and file transfers needed to manage the entire data pipeline. • Centrally schedule and orchestrate automated processes within each tool along the entire data pipeline • Use APIs or Agents to control the various tools used within each stage What you achieve with this approach: • Observability of the logs and data for governance and security • DataOps lifecycle management (Dev-Test- Prod) - including simulations • Centralized control and visibility with visual workflows • Quickly root-cause issues with proactive alerts when something fails 14 © Stonebranch 2022. All rights reserved.
  • 15. Orchestration 15 © Stonebranch 2022. All rights reserved. Driven by SOAP Cloud DevOps ERP/Apps IaaS UAC SOAP META-ORCHESTRATION
  • 16. 16 © Stonebranch 2022. All rights reserved. Self-Service Automation Centralized collaboration platform for data, developers, and operations IT ops teams gain operational visibility Data teams approve and trigger automated workflows & pipelines from common business applications Data Pipeline
  • 17. Putting the Ops in DataOps 17 © Stonebranch 2022. All rights reserved. For Enterprises Ready for the Next Level of Maturity Develop/ Orchestrate Test / Simulate Production / Deploy Continuous Improvement Continuous Deployment Development Controller Production Controller
  • 18. Develop/ Orchestrate Test / Simulate Production / Deploy Continuous Improvement Continuous Deployment Development Controller Production Controller Putting the Ops in DataOps 18 © Stonebranch 2022. All rights reserved. For Enterprises Ready for the Next Level of Maturity Web GUI As Code Via in-built capabilities Promotion Options Via third-party repositories like GitHub
  • 19. Data Pipeline Orchestration Demo Ravi Murugesan Sr. Solution Engineer © Stonebranch 2022. All rights reserved. 19
  • 20. © Stonebranch 2022. All rights reserved. 20 Demonstration Update Visual Dashboard from Multiple Data Sources (both on-prem and cloud-based) Live orchestration of a data pipeline, including • Sources (cloud, on-prem, apps) • Ingestion, transformation (Informatica) • Stores (Azure blob, Snowflake) • Delivery (Tableau)
  • 21. One of the Largest Global Food & Beverage Manufacturers in the World Customer Use Case 21
  • 22. Customer Use Case: Overview One of the Largest Global Food & Beverage Manufacturers in the World Evolution & Goal • Goal: Orchestrate the full pipeline end-to-end • Objective: Identify a platform that could connect all their critical data tools Overall Strategy • On-prem to cloud digital transformation • Implemented an enterprise analytics data management environment • Hub-and-spoke model to help keep regional resource groups and services segregated • Approved services are first developed and deployed at the hub level, with further spoke deployment via containers Original Approach • Their data pipeline for the enterprise data management environment with Azure Data Factory • Azure Data Factory worked well in an Azure environment • It served as an entry point for the project • The Challenge: Data Factory did not integrate with their full stack of solutions used along the data pipeline 22 © Stonebranch 2022. All rights reserved.
  • 23. Data Pipeline Orchestration One of the Largest Global Food & Beverage Manufacturers in the World Achieving Their Goal • Secure and robust file transfer • DataOps: define pipelines as code and gain lifecycle management (test/dev/prod) capabilities • Integrate diverse data pipelines that are built using various cloud-based and on-prem services and tools • For operations: visibility into the process, improve SLAs, real-time monitoring, alerting • Unified view to design and orchestrate workflows across multiple cloud and on-prem applications Orchestration Databases 23 © Stonebranch 2022. All rights reserved.
  • 24. © Stonebranch 2022. All rights reserved. Data Pipeline Orchestration Solution Universal Automation Center 24
  • 25. Real Time Hybrid IT Automation 25 © Stonebranch 2022. All rights reserved. Universal Automation Center Platform A Platform Approach Orchestrating IT processes from on-prem, to cloud, to containerized microservices
  • 26. Find. Deploy. Extend. • Download extensions • Share extensions • Community driven • Constant additions (monthly) • Large Data Pipeline Focus • Rapid creation of new integrations Orchestration = Integration 26 © Stonebranch 2022. All rights reserved.
  • 27. What to Look for in a Data Pipeline Orchestration Solution 27 © Stonebranch 2022. All rights reserved.
  • 28. Summary Who is this for? • Want to keep using existing data tools, but are ready to graduate from opensource schedulers to enterprise grade platforms • Would like a single platform to connect Data Teams, Developers, IT Ops, and Cloud Ops teams – to help scale their data program • Need to operationalize DataOps methodologies to gain speed and improve data quality • Want to gain full visibility across the entire pipeline – to move quickly when issue arise • Have a growing or changing data tool landscape, and need the ability to rapidly build new integrations (or download pre-existing integrations) • Need to enable data scientists or business users with simple self-service capabilities via the platform or third-party tools like ServiceNow, Microsoft Teams, or Slack • Bonus: Want a central IT automation and orchestration platform (beyond data pipeline orchestration) to support cloud automation, on-prem automation, traditional job scheduling, and DevOps orchestration © Stonebranch 2022. All rights reserved. 28
  • 29. © Stonebranch 2022. All rights reserved. 29 Q & A Scott Davis Global Vice President scott.davis@stonebranch.com Stonebranch - Atlanta, USA Ravi Murugesan Sr. Solution Engineer ravi.murugesan@stonebranch.com Stonebranch – Frankfurt, Germany
  • 30. Thank You © Stonebranch 2022. All rights reserved.