SlideShare uma empresa Scribd logo
1 de 35
Baixar para ler offline
Power of the RunGraph
Andrew Morgan, Dan Smith
Power of the Run Graph
Introduction
Exploring the Problem
The Idea
Registration
Intro: An anecdote
Exploring the Problem
“Many hands” legacy:
Loosely connected yet critical
data pipelines
are remarkably complex
in enterprises when viewed
as a whole system.
They are hard to manage, operate
and improve as a group.
“Entanglement” a major risk.
Loosely
connected,
use-case
focussed,
data
pipelines
How Complex?
Typically across an Enterprise:
100s of Production OLTP databases
Multiple Orchestration/scheduling tools
10s of ETL tools / instances
Many Kafka/Confluent installations
Multiple Logging/monitoring frameworks
10-100 OLAP reporting solutions
1000s of Reports
1000s of Web pages and/or microservices
Several Clouds and Data Centres
Several Data Warehouses
10+ Data Science sand boxes
Multiple Data Lakes
Loosely
connected,
use-case
focussed,
data
pipelines
Challenges:
Data management is difficult:
● Managing change effectively
● Managing quality of service
● Delivering service oversight
● Attributing clear issue ownership
● Resolving complex failures
● Delivering trust: “Ground Truth”
Loosely
connected,
use-case
focussed,
data
pipelines
A side issue, also surfacing
Top-Down Ent. Data Architecture
(methods/governance) are deeply
unpopular, especially with engineers.
Why? It is ineffective
●
●
●
●
●
Loosely
connected,
use-case
focussed,
data
pipelines
Sizing the problem
How complex? (a)
Example:
Here’s a “logical” summary of
data flows in one enterprise,
between production systems.
Shows: 100’s of logical
data pipelines, made up of:
- Batch ETL
- Messaging
- Streaming
Complex Pipeline Dependencies
How complex? (b)
Complexity also exists in the
content, not just in the pipes.
Here’s a conceptual model, a
“canonical data model,” for
most of a global firm:
410 core entities, 14 subjects.
Complex Content Dependencies
How complex? (c)
Even our OLAP reporting
architectures, are now
pipeline oriented and are
“inside out” rather than older
“star schemas”
Fact Pipelines and
sinks
Core dimension
pipelines and sinks
Peripheral Dimensions:
“side inputs,” lookups,
dictionaries, tags
Complex Information Dependencies
Notice the shape of this meta-data?
Notice the amount of existing engineering that must sit behind these views?
The weight of legacy
There is a huge amount of legacy data pipelines, and migrating
them requires retesting everything. Heterogeneous approach.
“Can you stabilize my operation, while moving Net New functionality to the cloud?”
- Many legacy ETL systems
- Many Orchestration / Scheduling instances
- Many datacentres, not just Cloud
- Many monolithic applications, still
- Many legacy flows undocumented, misunderstood
- Many hidden pipelines, in DB stored procedures
- New functionality in Cloud.
Legacy
Pipelines
New
Pipelines
The combined service.
The Idea
data services
RunGraph
Framing the Problem
Framing the Problem
Let’s examine this
Shared Data Platform
GRAKN
Shared Data Platform
Acquisition
Pipelines
Consumer
Pipelines
Data Ops Enabled Pipelines
Add
Pipeline
Registration
Data to
GRAKN
Data Ops Enabled Pipelines
We add
Pipeline
Events and
metrics to
GRAKN
Data Ops Enabled Pipelines
RunGraph
Data Ops Enabled Pipelines
We build
Pipeline
Intelligence
via
GRAKN Run Log
RunGraph
Registration
(tool agnostic)
Instrumentation
(tool agnostic)
Data Ops Enabled Pipelines
We build
Pipeline
Intelligence
via
GRAKN
RunGraph
● We can register ANY pipeline on our
estate, run using any orchestration tool
or ETL scheduler
● We can retro-fit legacy pipelines into the
run graph, even legacy ETL tools
● We can build up complex enterprise
architecture views, and establish ground
truth, about
● We can determine “normal” pipeline
behaviors, and identify strange
behaviours and raise flags
● We can use GRAKN ML facilties to start
doing predictive analytics on operations
Data Ops Enabled Pipelines
We build
Pipeline
Intelligence
via
GRAKN
Registration
(tool agnostic)
Instrumentation
(tool agnostic)
RunGraph
● Studies all Pipeline Instrumentation
● Tracks Data Flows / Lineage
● Creates Data Quality Expectations
● Does Impact Analysis of Failures (usual
ancestors) and prioritisation
● Identifies Key and Critical Data Assets,
(ie Core Dimensions)
● Tracks Data Lineage vs Data Quality
● Maps complex consumers to sources
bringing commercial line of sight
● Does Change Impact Analysis
Data Ops Enabled Pipelines
We build
Pipeline
Intelligence
via
GRAKN
Registration
(tool agnostic)
Instrumentation
(tool agnostic)
Hybrid Data Ops Console
Once we can instrument across legacy and new cloud
environments, we can construct a combined Ops Console.
Legacy
Pipelines
New
Pipelines
The combined service.
Consumer Service Dashboards
and Operations Console.
Registration
RunGraph Model: Registration + Job
RunGraph: Registration + Job
Policy
Feed
Job
Data
RunGraph: Registration + Job
Policy
Source
Feeds
Jobs
Data
We can summarise the core registration
needs here.
Registering these makes them addressable,
actionable, and enriches the pipeline analytics.
RunGraph: Analytics
Even simple use cases, drive out value quickly:
On failure, unplanned change:
- Find descendants - remediation based on impact, contagion
- Find ancestors - apply pressure / corrections upstream
Planned change
- Run analytic queries to show typical connections over 6 months -- reverse
engineer your architectures
- Identify key risks in planned change
RunGraph improves your AI, BI, UI
Try it at home
There are some great open-source projects to check out:
Get in touch
Dr. Daniel A. Smith
Emerging Technology
dan.smith@6point6.co.uk
About 6point6
Integrating digital technology into your business can result in
fundamental changes to how you operate and deliver value to your
customers. To go digital is to reinvent yourself to the core, opening
yourself and your clients to a world of possibilities.
6point6 is a technology consultancy. We bring a wealth of hands-on
experience to help financial service providers, media houses and
government achieve more with digital. Using cutting edge technology
and agile delivery methods, we help you reinvent, transform and
secure a brighter digital future.
Visit us on www.6point6.co.uk
Twitter: @6point6ltd
LinkedIn: linkedin.com/company/6point6

Mais conteúdo relacionado

Mais procurados

Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
Ravi Teja
 

Mais procurados (20)

Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best PracticesNeo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
Lessons from building a stream-first metadata platform | Shirshanka Das, StealthLessons from building a stream-first metadata platform | Shirshanka Das, Stealth
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
 
Stanford DeepDive Framework
Stanford DeepDive FrameworkStanford DeepDive Framework
Stanford DeepDive Framework
 
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4jScalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
Scalability and Graph Analytics with Neo4j - Stefan Kolmar, Neo4j
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
 
The Year of the Graph
The Year of the GraphThe Year of the Graph
The Year of the Graph
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 

Semelhante a Power of the Run Graph

Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
Capgemini
 
Grafana overview deck - Tech - 2023 May v1.pdf
Grafana overview deck  - Tech - 2023 May v1.pdfGrafana overview deck  - Tech - 2023 May v1.pdf
Grafana overview deck - Tech - 2023 May v1.pdf
BillySin5
 

Semelhante a Power of the Run Graph (20)

86921864 olap-case-study-vj
86921864 olap-case-study-vj86921864 olap-case-study-vj
86921864 olap-case-study-vj
 
From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - Overview
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
 
Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
Intelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockIntelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff Pollock
 
Implement Test Harness For Streaming Data Pipelines
Implement Test Harness For Streaming Data PipelinesImplement Test Harness For Streaming Data Pipelines
Implement Test Harness For Streaming Data Pipelines
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Cloud-Scale BGP and NetFlow Analysis
Cloud-Scale BGP and NetFlow AnalysisCloud-Scale BGP and NetFlow Analysis
Cloud-Scale BGP and NetFlow Analysis
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive Applications
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Grafana overview deck - Tech - 2023 May v1.pdf
Grafana overview deck  - Tech - 2023 May v1.pdfGrafana overview deck  - Tech - 2023 May v1.pdf
Grafana overview deck - Tech - 2023 May v1.pdf
 

Mais de Vaticle

Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Vaticle
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
Vaticle
 
Unifying Space Mission Knowledge with NLP & Knowledge Graph
Unifying Space Mission Knowledge with NLP & Knowledge GraphUnifying Space Mission Knowledge with NLP & Knowledge Graph
Unifying Space Mission Knowledge with NLP & Knowledge Graph
Vaticle
 
Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
Vaticle
 
TypeDB Academy | Modelling Principles
TypeDB Academy | Modelling PrinciplesTypeDB Academy | Modelling Principles
TypeDB Academy | Modelling Principles
Vaticle
 
Intro to TypeDB and TypeQL | A strongly-typed database
Intro to TypeDB and TypeQL | A strongly-typed databaseIntro to TypeDB and TypeQL | A strongly-typed database
Intro to TypeDB and TypeQL | A strongly-typed database
Vaticle
 
Graph Databases vs TypeDB | What you can't do with graphs
Graph Databases vs TypeDB | What you can't do with graphsGraph Databases vs TypeDB | What you can't do with graphs
Graph Databases vs TypeDB | What you can't do with graphs
Vaticle
 

Mais de Vaticle (20)

Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
 
Loading Huge Amounts of Data
Loading Huge Amounts of DataLoading Huge Amounts of Data
Loading Huge Amounts of Data
 
Natural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge GraphNatural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge Graph
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
 
Unifying Space Mission Knowledge with NLP & Knowledge Graph
Unifying Space Mission Knowledge with NLP & Knowledge GraphUnifying Space Mission Knowledge with NLP & Knowledge Graph
Unifying Space Mission Knowledge with NLP & Knowledge Graph
 
The Next Big Thing in AI - Causality
The Next Big Thing in AI - CausalityThe Next Big Thing in AI - Causality
The Next Big Thing in AI - Causality
 
Building a Cyber Threat Intelligence Knowledge Graph
Building a Cyber Threat Intelligence Knowledge GraphBuilding a Cyber Threat Intelligence Knowledge Graph
Building a Cyber Threat Intelligence Knowledge Graph
 
Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
 
Building a Distributed Database with Raft.pdf
Building a Distributed Database with Raft.pdfBuilding a Distributed Database with Raft.pdf
Building a Distributed Database with Raft.pdf
 
Enabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfEnabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdf
 
TypeDB Academy | Inference with Rules
TypeDB Academy | Inference with RulesTypeDB Academy | Inference with Rules
TypeDB Academy | Inference with Rules
 
TypeDB Academy | Modelling Principles
TypeDB Academy | Modelling PrinciplesTypeDB Academy | Modelling Principles
TypeDB Academy | Modelling Principles
 
Beyond SQL - Comparing SQL to TypeQL
Beyond SQL - Comparing SQL to TypeQLBeyond SQL - Comparing SQL to TypeQL
Beyond SQL - Comparing SQL to TypeQL
 
TypeDB Academy- Getting Started with Schema Design
TypeDB Academy- Getting Started with Schema DesignTypeDB Academy- Getting Started with Schema Design
TypeDB Academy- Getting Started with Schema Design
 
Comparing Semantic Web Technologies to TypeDB
Comparing Semantic Web Technologies to TypeDBComparing Semantic Web Technologies to TypeDB
Comparing Semantic Web Technologies to TypeDB
 
Reasoner, Meet Actors | TypeDB's Native Reasoning Engine
Reasoner, Meet Actors | TypeDB's Native Reasoning EngineReasoner, Meet Actors | TypeDB's Native Reasoning Engine
Reasoner, Meet Actors | TypeDB's Native Reasoning Engine
 
Intro to TypeDB and TypeQL | A strongly-typed database
Intro to TypeDB and TypeQL | A strongly-typed databaseIntro to TypeDB and TypeQL | A strongly-typed database
Intro to TypeDB and TypeQL | A strongly-typed database
 
Graph Databases vs TypeDB | What you can't do with graphs
Graph Databases vs TypeDB | What you can't do with graphsGraph Databases vs TypeDB | What you can't do with graphs
Graph Databases vs TypeDB | What you can't do with graphs
 
Pandora Paper Leaks With TypeDB
 Pandora Paper Leaks With TypeDB Pandora Paper Leaks With TypeDB
Pandora Paper Leaks With TypeDB
 
Strongly Typed Data for Machine Learning
Strongly Typed Data for Machine LearningStrongly Typed Data for Machine Learning
Strongly Typed Data for Machine Learning
 

Último

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 

Último (20)

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 

Power of the Run Graph

  • 1. Power of the RunGraph Andrew Morgan, Dan Smith
  • 2. Power of the Run Graph Introduction Exploring the Problem The Idea Registration
  • 5. “Many hands” legacy: Loosely connected yet critical data pipelines are remarkably complex in enterprises when viewed as a whole system. They are hard to manage, operate and improve as a group. “Entanglement” a major risk. Loosely connected, use-case focussed, data pipelines
  • 6. How Complex? Typically across an Enterprise: 100s of Production OLTP databases Multiple Orchestration/scheduling tools 10s of ETL tools / instances Many Kafka/Confluent installations Multiple Logging/monitoring frameworks 10-100 OLAP reporting solutions 1000s of Reports 1000s of Web pages and/or microservices Several Clouds and Data Centres Several Data Warehouses 10+ Data Science sand boxes Multiple Data Lakes Loosely connected, use-case focussed, data pipelines
  • 7. Challenges: Data management is difficult: ● Managing change effectively ● Managing quality of service ● Delivering service oversight ● Attributing clear issue ownership ● Resolving complex failures ● Delivering trust: “Ground Truth” Loosely connected, use-case focussed, data pipelines
  • 8. A side issue, also surfacing Top-Down Ent. Data Architecture (methods/governance) are deeply unpopular, especially with engineers. Why? It is ineffective ● ● ● ● ● Loosely connected, use-case focussed, data pipelines
  • 10. How complex? (a) Example: Here’s a “logical” summary of data flows in one enterprise, between production systems. Shows: 100’s of logical data pipelines, made up of: - Batch ETL - Messaging - Streaming Complex Pipeline Dependencies
  • 11. How complex? (b) Complexity also exists in the content, not just in the pipes. Here’s a conceptual model, a “canonical data model,” for most of a global firm: 410 core entities, 14 subjects. Complex Content Dependencies
  • 12. How complex? (c) Even our OLAP reporting architectures, are now pipeline oriented and are “inside out” rather than older “star schemas” Fact Pipelines and sinks Core dimension pipelines and sinks Peripheral Dimensions: “side inputs,” lookups, dictionaries, tags Complex Information Dependencies
  • 13. Notice the shape of this meta-data? Notice the amount of existing engineering that must sit behind these views?
  • 14. The weight of legacy There is a huge amount of legacy data pipelines, and migrating them requires retesting everything. Heterogeneous approach. “Can you stabilize my operation, while moving Net New functionality to the cloud?” - Many legacy ETL systems - Many Orchestration / Scheduling instances - Many datacentres, not just Cloud - Many monolithic applications, still - Many legacy flows undocumented, misunderstood - Many hidden pipelines, in DB stored procedures - New functionality in Cloud. Legacy Pipelines New Pipelines The combined service.
  • 20. Data Ops Enabled Pipelines
  • 22. We add Pipeline Events and metrics to GRAKN Data Ops Enabled Pipelines
  • 23. RunGraph Data Ops Enabled Pipelines We build Pipeline Intelligence via GRAKN Run Log
  • 24. RunGraph Registration (tool agnostic) Instrumentation (tool agnostic) Data Ops Enabled Pipelines We build Pipeline Intelligence via GRAKN
  • 25. RunGraph ● We can register ANY pipeline on our estate, run using any orchestration tool or ETL scheduler ● We can retro-fit legacy pipelines into the run graph, even legacy ETL tools ● We can build up complex enterprise architecture views, and establish ground truth, about ● We can determine “normal” pipeline behaviors, and identify strange behaviours and raise flags ● We can use GRAKN ML facilties to start doing predictive analytics on operations Data Ops Enabled Pipelines We build Pipeline Intelligence via GRAKN Registration (tool agnostic) Instrumentation (tool agnostic)
  • 26. RunGraph ● Studies all Pipeline Instrumentation ● Tracks Data Flows / Lineage ● Creates Data Quality Expectations ● Does Impact Analysis of Failures (usual ancestors) and prioritisation ● Identifies Key and Critical Data Assets, (ie Core Dimensions) ● Tracks Data Lineage vs Data Quality ● Maps complex consumers to sources bringing commercial line of sight ● Does Change Impact Analysis Data Ops Enabled Pipelines We build Pipeline Intelligence via GRAKN Registration (tool agnostic) Instrumentation (tool agnostic)
  • 27. Hybrid Data Ops Console Once we can instrument across legacy and new cloud environments, we can construct a combined Ops Console. Legacy Pipelines New Pipelines The combined service. Consumer Service Dashboards and Operations Console.
  • 30. RunGraph: Registration + Job Policy Feed Job Data
  • 31. RunGraph: Registration + Job Policy Source Feeds Jobs Data We can summarise the core registration needs here. Registering these makes them addressable, actionable, and enriches the pipeline analytics.
  • 32. RunGraph: Analytics Even simple use cases, drive out value quickly: On failure, unplanned change: - Find descendants - remediation based on impact, contagion - Find ancestors - apply pressure / corrections upstream Planned change - Run analytic queries to show typical connections over 6 months -- reverse engineer your architectures - Identify key risks in planned change
  • 34. Try it at home There are some great open-source projects to check out:
  • 35. Get in touch Dr. Daniel A. Smith Emerging Technology dan.smith@6point6.co.uk About 6point6 Integrating digital technology into your business can result in fundamental changes to how you operate and deliver value to your customers. To go digital is to reinvent yourself to the core, opening yourself and your clients to a world of possibilities. 6point6 is a technology consultancy. We bring a wealth of hands-on experience to help financial service providers, media houses and government achieve more with digital. Using cutting edge technology and agile delivery methods, we help you reinvent, transform and secure a brighter digital future. Visit us on www.6point6.co.uk Twitter: @6point6ltd LinkedIn: linkedin.com/company/6point6