SlideShare a Scribd company logo
1 of 39
KENT GRAZIANO
AGILE DATA ENGINEERING:
INTRODUCTION TO DATA VAULT DATA
MODELING
@KentGraziano kentgraziano.com
Agenda
2
Bio
What do we mean by Agile?
What is a Data Vault?
Where does it fit in an DW/BI architecture
How to design a Data Vault model
Being “agile” with Data Vault
What’s new in DV 2.0
My Bio
3
› Senior Technical Evangelist, Snowflake Computing
› Oracle ACE Director (BI/DW)
› Certified Data Vault Master and DV 2.0 Practitioner
› Data Modeling, Data Architecture and Data Warehouse
Specialist
• 30+ years in IT
• 25+ years of Oracle-related work
• 20+ years of data warehousing experience
› Member – DAMA Houston
› Former-Member: Boulder BI Brain Trust
(http://www.boulderbibraintrust.org/)
› Author & Co-Author of a bunch of books
• The Business of Data Vault Modeling
• The Data Model Resource Book (1st Edition)
› Blogger: The Data Warrior
› Past-President of Oracle Development Tools User Group
and Rocky Mountain Oracle User Group
Manifesto for Agile Software Development
4
“We are uncovering better ways of developing software by
doing it and helping others do it.
Through this work we have come to value:
Individuals and interactions over processes and tools
Working software over comprehensive documentation
Customer collaboration over contract negotiation
Responding to change over following a plan
That is, while there is value in the items on the right, we value
the items on the left more.”
http://agilemanifesto.org/
Applying the Agile Manifesto to DW
(C) Kent Graziano 5
User Stories instead of requirements documents
Time-boxed iterations
› Iteration has a standard length
› Choose one or more user stories to fit in that
iteration
Rework is part of the game
› There are no “missed requirements”... only those
that haven’t been delivered or discovered yet.
Data Vault Definition
TDAN.com Article 6
The Data Vault is a detail oriented, historical tracking and uniquely linked
set of normalized tables that support one or more functional areas of
business.
It is a hybrid approach encompassing the best of breed between 3rd normal
form (3NF) and star schema. The design is flexible, scalable, consistent
and adaptable to the needs of the enterprise.
Architected specifically to meet the needs of today’s
enterprise data warehouses
DAN LINSTEDT: Defining the Data Vault
What is Data Vault Trying to Solve?
(C) Kent Graziano 7
What are our other Enterprise Data Warehouse
options?
› Third-Normal Form (3NF): Complex primary keys (PK’s)
with cascading snapshot dates
› Star Schema (Dimensional): Difficult to reengineer fact
tables for granularity changes
Difficult to get it right the first time
Not adaptable to rapid business change
NOT AGILE!
Data Vault Time Line
© LearnDataVault.com 8
20001960 1970 1980 1990
E.F. Codd invented
relational modeling
Chris Date and
Hugh Darwen
Maintained and
Refined Modeling
1976 Dr Peter Chen
Created E-R
Diagramming
Early 70’s Bill Inmon
Began Discussing
Data Warehousing
Mid 60’s Dimension &
Fact Modeling
presented by General
Mills and Dartmouth
University
Mid 70’s AC Nielsen
Popularized
Dimension & Fact
Terms
Mid – Late 80’s Dr Kimball
Popularizes Star Schema
Mid 80’s Bill Inmon
Popularizes Data
Warehousing
Late 80’s – Barry Devlin and
Dr Kimball Release
“Business Data Warehouse”
1990 – Dan Linstedt
Begins R&D on Data
Vault Modeling
2000 – Dan
Linstedt releases
first 5 articles on
Data Vault
Modeling
Data Vault Evolution
(C) Kent Graziano 9
The work on the Data Vault approach began in the early 1990s, and completed around 1999.
Throughout 1999, 2000, and 2001, the Data Vault design was tested, refined, and deployed
into specific customer sites.
In 2002, the industry thought leaders were asked to review the architecture.
This is when I attend my first DV seminar in Denver and met Dan!
In 2003, Dan began teaching the modeling techniques to the mass public.
In 2014, Dan introduced DV 2.0!
Where does a Data Vault Fit?
© LearnDataVault.com 10
STAGING EDW
DATA VAULT
DATA MARTS
(STAR SCHEMAS)
DATA MARTS
(STAR SCHEMAS)
DATA MARTS
(STAR SCHEMAS)
Where does Data Vault fit?
©Oracle Corp 11
Data Vault
goes here
Data Vault: 3 Simple Structures
© LearnDataVault.com 12
EDW
DATA VAULT
HUB
LINK
SATELITE
01
02
03
Data Vault Core Architecture
© LearnDataVault.com 13
HUBS
Unique List of
Business Keys
LINKS
Unique List of
Relationships
across keys
SATELITES
Descriptive Data
› Satellites have one and only one parent table
› Satellites cannot be “Parents” to other tables
› Hubs cannot be child tables
Common Attributes
© LearnDataVault.com 14
Required –
all structures
› Primary key – PK
› Load date time
stamp – DTS
› Record source –
REC_SRC
Required –
Satellites only
› Load end date
time stamp –
LEDTS
› Optional in DV
2.0
Optional –
Hubs & Links
only
› Last seen dates
– LSDTs
› MD5KEY
(REQUIRED IN
DV 2.0)
Optional –
Satellites only
› Load sequence ID –
LDSEQ_ID
› Update user –
UPDT_USER
› Update DTS –
UPDT_DTS
› MD5DIFF
1. Hub = Business Keys
(C) Kent Graziano 15
Hubs = Unique Lists of Business Keys
Business Keys are used to TRACK and IDENTIFY key information
New: DV 2.0 uses MD5 Hash of the BK for the PK
2: Links = Associations
(C) Kent Graziano 16
Links = Transactions and Associations
They are used to hook together multiple sets of information
In DV 2.0 the BK attributes may migrate to the Links for faster query
Modeling Links - 1:1 or 1:M?
(C) Kent Graziano 17
Today Tomorrow With a Link in The Data Vault
Relationship is
a 1:1 so why
model a Link?
The business rule
can change to a
1:M.
You discover new
data later.
No need to
change the EDW
structure.
Existing data is
fine.
New data is
added.
3. Satellites = Descriptors
(C) Kent Graziano 18
Satellites provide context for the Hubs and the Links
Tracks changes over time - Like SCD 2
In DV 2.0 use HASH_DIFF to detect changes
Data Vault Model Flexibility (Agility)
(C) Kent Graziano 19
Goes beyond
standard 3NF
Based on natural
business keys
Hyper normalized
› Hubs and Links only hold keys and meta data
› Satellites split by rate of change and/or source
Enables Agile data modeling
› Easy to add to model without having to change existing
structures and load routines
• Relationships (links) can be dropped and created on-demand.
› No more reloading history because of a missed requirement
Not system surrogate keys
Allows for integrating data across functions
and source systems more easily
› All data relationships are key driven
Data Vault Extensibility
(C) LearnDataVault.com 20
Adding new
components to the
EDW has NEAR
ZERO impact to:
› Existing Loading
Processes
› Existing Data Model
› Existing Reporting &
BI Functions
› Existing Source
Systems
› Existing Star
Schemas and Data
Marts
Data Vault Productivity
(C) Kent Graziano 21
› Standardized modeling rules
• Highly repeatable and learnable modeling
technique
• Can standardize load routines
o Delta Driven process
o Re-startable, consistent loading patterns.
• Can standardize extract routines
o Rapid build of new or revised Data Marts
• Can be automated
• Can use a BI-meta layer to virtualize the
reporting structures
o Example: OBIEE Business Model and
Mapping tool
o Example: BOBJ Universe Business Layer
• Can put views on the DV structures as well
o Simulate ODS/3NF or Star Schemas
Data Vault Adaptability
(C) Kent Graziano 22
› The Data Vault holds granular
historical relationships.
• Holds all history for all time, allowing any
source system feeds to be reconstructed
on-demand
o Easy generation of Audit Trails for data
lineage and compliance.
o Data Mining can discover new
relationships between elements
o Patterns of change emerge from the
historical pictures and linkages.
› The Data Vault can be accessed by
power-users
Other Benefits of a Data Vault
(C) Kent Graziano 23
› Modeling it as a DV forces integration
of the Business Keys upfront
• Good for organizational alignment
› An integrated data set with raw data
extends it’s value beyond BI:
• Source for data quality projects
• Source for master data
• Source for data mining
• Source for Data as a Service (DaaS) in
an SOA (Service Oriented Architecture).
Other Benefits of a Data Vault
(C) Kent Graziano 24
› Upfront Hub integration simplifies the
data integration routines required to
load data marts.
• Helps divide the work a bit.
› It is much easier to implement security
on these granular pieces.
› Granular, re-startable processes enable
pin-point failure correction.
› It is designed and optimized for real-
time loading in its core architecture
(without any tweaks or mods).
How to be Agile using DV
(C) Kent Graziano 25
Model iteratively
› Use Data Vault data
modeling technique
› Create basic components,
then add over time
Virtualize the Access Layer
› Don’t waste time building
facts and dimensions up
front
ETL and testing takes too
long
› “Project” objects using
pattern-based DV model with
database views (or BI meta
layer)
Users see real reports with
real data
› Can always build out for
performance in another
iteration
26
WHAT IS
THE
WORLD'S
SMALLEST
DATA VAULT?
Worlds Smallest Data Vault
© LearnDataVault.com 27
Hub Customer
Hub_Cust_Seq_ID
Hub_Cust_Num
Hub_Cust_Load_DTS
Hub_Cust_Rec_Src
Hub_Cust_Seq_ID
Sat_Cust_Load_DTS
Sat_Cust_Load_End_DTS
Sat_Cust_Name
Sat_Cust_Rec_Src
Satellite
Customer Name
› The Data Vault doesn’t have to be “BIG”.
› A Data Vault can be built incrementally.
› Reverse engineering one component of the
existing models is not uncommon.
› Building one part of the Data Vault, then
changing the marts to feed from that vault is a
best practice.
› The smallest Enterprise Data Warehouse
consists of two tables:
• One Hub,
• One Satellite
Notably…
28
› In 2008 Bill Inmon stated that the
“Data Vault is the optimal approach for
modeling the EDW in the DW2.0
framework.” (DW2.0)
› The number of Data Vault users in the
US surpassed 500 in 2010 and grows
rapidly (http://danlinstedt.com/about/dv-
customers/)
Organizations using Data Vault
29
› WebMD Health Services
› Anthem Blue-Cross Blue Shield
› MD Anderson Cancer Center
› Denver Public Schools
› Independent Purchasing Cooperative
(IPC, Miami)
• Owner of Subway
› Kaplan
› US Defense Department
› Colorado Springs Utilities
› State Court of Wyoming
› Federal Express
› US Dept. Of Agriculture
What’s New in DV2.0?
© LearnDataVault.com 30
Modeling
Structure
Includes…
› NoSQL, and Non-
Relational DB
systems, Hybrid
Systems
› Minor Structure
Changes to support
NoSQL
01 02 03 04
New ETL
Implementation
Standards
› For true real-time
support
› For NoSQL support
New
Architecture
Standards
› To include support
for NoSQL data
management
systems
New
Methodology
Components
› Including CMMI, Six
Sigma, and TQM
› Including Project
Planning, Tracking,
and Oversight
› Agile Delivery
Mechanisms
› Standards, and
templates for
Projects
What’s New in DV2.0?
This model is
fully compliant
with Hadoop,
needs NO
changes to work
properly
Note:
Business Keys
replicated to the Link
structure for “join”
capabilities on the
way out to Data
Marts.
© LearnDataVault.com 31
Summary
32
Data Vault provides a data modeling technique that allows:
Model Agility Productivity So? Agile Data
Warehousing?
01 02 03
› Enabling rapid
changes and
additions
› Enabling low
complexity
systems with high
value output at a
rapid pace
› Easy projections
of dimensional
models
33
› Available on Amazon:
http://www.amazon.com/Better-
Data-Modeling-Introduction-
Engineering-ebook /dp/
B018BREV1C/
Shameless Plug:
34
› Available on Amazon.com
› Soft Cover or Kindle Format
› Now also available in PDF at
LearnDataVault.com
› Hint: Kent is the Technical Editor
Super Charge
Your Data Warehouse
35
› Available on Amazon:
http://www.amazon.com/Buildin
g-Scalable-Data-Warehouse-
Vault/dp/0128025107/
New DV 2.0 Book
Register at wwdvc.com
36
Data Vault References
37
www.youtube.com/LearnDataVault www.facebook.com/learndatavault
www.learndatavault.com
www.danlinstedt.com
38
QUESTIONS?
Contact Information
39
KENT GRAZIANO
Snowflake Computing
www.snowflake.net
kent.graziano@snowflake.net
@KentGraziano
http://kentgraziano.com

More Related Content

What's hot

Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookJames Serra
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Kent Graziano
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Michael Olschimke
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdfBOSupport
 
Data Vault Vs Data Lake
Data Vault Vs Data LakeData Vault Vs Data Lake
Data Vault Vs Data LakeCalum Miller
 
Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Empowered Holdings, LLC
 
Speeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachSpeeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachDatabricks
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Conceptual vs. Logical vs. Physical Data Modeling
Conceptual vs. Logical vs. Physical Data ModelingConceptual vs. Logical vs. Physical Data Modeling
Conceptual vs. Logical vs. Physical Data ModelingDATAVERSITY
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Denodo
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Edureka!
 
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeData Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeWhereScape
 

What's hot (20)

Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Operational Data Vault
Operational Data VaultOperational Data Vault
Operational Data Vault
 
Data Vault Introduction
Data Vault IntroductionData Vault Introduction
Data Vault Introduction
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
 
Data Vault Vs Data Lake
Data Vault Vs Data LakeData Vault Vs Data Lake
Data Vault Vs Data Lake
 
Dimensional Modelling
Dimensional ModellingDimensional Modelling
Dimensional Modelling
 
Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Présentation data vault et bi v20120508
Présentation data vault et bi v20120508
 
Speeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachSpeeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT Approach
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Conceptual vs. Logical vs. Physical Data Modeling
Conceptual vs. Logical vs. Physical Data ModelingConceptual vs. Logical vs. Physical Data Modeling
Conceptual vs. Logical vs. Physical Data Modeling
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
 
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeData Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
 

Viewers also liked

Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileDaniel Upton
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureKent Graziano
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSKent Graziano
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignKent Graziano
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data WarehousingKent Graziano
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsKent Graziano
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerKent Graziano
 
Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Kent Graziano
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016Kent Graziano
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Kent Graziano
 
Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249Daniel Upton
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultDaniel Upton
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingDaniel Upton
 
How to become a certified data vault data modeler #CDVDM
How to become a certified data vault data modeler #CDVDMHow to become a certified data vault data modeler #CDVDM
How to become a certified data vault data modeler #CDVDMErik Fransen
 
Applications In android game - An Upgrade
 Applications In android game - An Upgrade Applications In android game - An Upgrade
Applications In android game - An Upgradepanoramicchambe68
 
Pimping SQL Developer and Data Modeler
Pimping SQL Developer and Data ModelerPimping SQL Developer and Data Modeler
Pimping SQL Developer and Data ModelerKris Rice
 
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...FrederikN
 

Viewers also liked (20)

Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse Design
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data Warehousing
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data Modeler
 
Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
 
Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data Vault
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and Modelstorming
 
Data vault modeling et retour d'expérience
Data vault modeling et retour d'expérienceData vault modeling et retour d'expérience
Data vault modeling et retour d'expérience
 
How to become a certified data vault data modeler #CDVDM
How to become a certified data vault data modeler #CDVDMHow to become a certified data vault data modeler #CDVDM
How to become a certified data vault data modeler #CDVDM
 
Stress FEA 1
Stress FEA 1Stress FEA 1
Stress FEA 1
 
Applications In android game - An Upgrade
 Applications In android game - An Upgrade Applications In android game - An Upgrade
Applications In android game - An Upgrade
 
Pimping SQL Developer and Data Modeler
Pimping SQL Developer and Data ModelerPimping SQL Developer and Data Modeler
Pimping SQL Developer and Data Modeler
 
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
 

Similar to Agile Data Engineering: Introduction to Data Vault Data Modeling

Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesDenodo
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?Denodo
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachKent Graziano
 
The Death of the Star Schema
The Death of the Star SchemaThe Death of the Star Schema
The Death of the Star SchemaDATAVERSITY
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesIvo Andreev
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Modern data warehouse presentation
Modern data warehouse presentationModern data warehouse presentation
Modern data warehouse presentationDavid Rice
 
Introduction to data vault ilja dmitrijev
Introduction to data vault   ilja dmitrijevIntroduction to data vault   ilja dmitrijev
Introduction to data vault ilja dmitrijevIlja Dmitrijevs
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing DataDatabricks
 
Data Vault 2.0 Demystified: East Coast Tour
Data Vault 2.0 Demystified: East Coast TourData Vault 2.0 Demystified: East Coast Tour
Data Vault 2.0 Demystified: East Coast TourWhereScape
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligenceAhsan Kabir
 
Data Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data WarehousingData Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data WarehousingAll Things Open
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Denodo
 
Migrating on premises workload to azure sql database
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql databasePARIKSHIT SAVJANI
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Trivadis
 

Similar to Agile Data Engineering: Introduction to Data Vault Data Modeling (20)

Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business Outcomes
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 
The Death of the Star Schema
The Death of the Star SchemaThe Death of the Star Schema
The Death of the Star Schema
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Modern data warehouse presentation
Modern data warehouse presentationModern data warehouse presentation
Modern data warehouse presentation
 
Introduction to data vault ilja dmitrijev
Introduction to data vault   ilja dmitrijevIntroduction to data vault   ilja dmitrijev
Introduction to data vault ilja dmitrijev
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing Data
 
datavault2.pptx
datavault2.pptxdatavault2.pptx
datavault2.pptx
 
Data Vault 2.0 Demystified: East Coast Tour
Data Vault 2.0 Demystified: East Coast TourData Vault 2.0 Demystified: East Coast Tour
Data Vault 2.0 Demystified: East Coast Tour
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
 
Data Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data WarehousingData Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data Warehousing
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)
 
Migrating on premises workload to azure sql database
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql database
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
 
Speak to Your Data
Speak to Your DataSpeak to Your Data
Speak to Your Data
 

More from Kent Graziano

Balance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudBalance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudKent Graziano
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...Kent Graziano
 
Rise of the Data Cloud
Rise of the Data CloudRise of the Data Cloud
Rise of the Data CloudKent Graziano
 
Delivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with SnowflakeDelivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with SnowflakeKent Graziano
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Kent Graziano
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on ReadKent Graziano
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWKent Graziano
 

More from Kent Graziano (8)

Balance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudBalance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data Cloud
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
 
Rise of the Data Cloud
Rise of the Data CloudRise of the Data Cloud
Rise of the Data Cloud
 
Delivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with SnowflakeDelivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with Snowflake
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 

Recently uploaded

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 

Agile Data Engineering: Introduction to Data Vault Data Modeling

  • 1. KENT GRAZIANO AGILE DATA ENGINEERING: INTRODUCTION TO DATA VAULT DATA MODELING @KentGraziano kentgraziano.com
  • 2. Agenda 2 Bio What do we mean by Agile? What is a Data Vault? Where does it fit in an DW/BI architecture How to design a Data Vault model Being “agile” with Data Vault What’s new in DV 2.0
  • 3. My Bio 3 › Senior Technical Evangelist, Snowflake Computing › Oracle ACE Director (BI/DW) › Certified Data Vault Master and DV 2.0 Practitioner › Data Modeling, Data Architecture and Data Warehouse Specialist • 30+ years in IT • 25+ years of Oracle-related work • 20+ years of data warehousing experience › Member – DAMA Houston › Former-Member: Boulder BI Brain Trust (http://www.boulderbibraintrust.org/) › Author & Co-Author of a bunch of books • The Business of Data Vault Modeling • The Data Model Resource Book (1st Edition) › Blogger: The Data Warrior › Past-President of Oracle Development Tools User Group and Rocky Mountain Oracle User Group
  • 4. Manifesto for Agile Software Development 4 “We are uncovering better ways of developing software by doing it and helping others do it. Through this work we have come to value: Individuals and interactions over processes and tools Working software over comprehensive documentation Customer collaboration over contract negotiation Responding to change over following a plan That is, while there is value in the items on the right, we value the items on the left more.” http://agilemanifesto.org/
  • 5. Applying the Agile Manifesto to DW (C) Kent Graziano 5 User Stories instead of requirements documents Time-boxed iterations › Iteration has a standard length › Choose one or more user stories to fit in that iteration Rework is part of the game › There are no “missed requirements”... only those that haven’t been delivered or discovered yet.
  • 6. Data Vault Definition TDAN.com Article 6 The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business. It is a hybrid approach encompassing the best of breed between 3rd normal form (3NF) and star schema. The design is flexible, scalable, consistent and adaptable to the needs of the enterprise. Architected specifically to meet the needs of today’s enterprise data warehouses DAN LINSTEDT: Defining the Data Vault
  • 7. What is Data Vault Trying to Solve? (C) Kent Graziano 7 What are our other Enterprise Data Warehouse options? › Third-Normal Form (3NF): Complex primary keys (PK’s) with cascading snapshot dates › Star Schema (Dimensional): Difficult to reengineer fact tables for granularity changes Difficult to get it right the first time Not adaptable to rapid business change NOT AGILE!
  • 8. Data Vault Time Line © LearnDataVault.com 8 20001960 1970 1980 1990 E.F. Codd invented relational modeling Chris Date and Hugh Darwen Maintained and Refined Modeling 1976 Dr Peter Chen Created E-R Diagramming Early 70’s Bill Inmon Began Discussing Data Warehousing Mid 60’s Dimension & Fact Modeling presented by General Mills and Dartmouth University Mid 70’s AC Nielsen Popularized Dimension & Fact Terms Mid – Late 80’s Dr Kimball Popularizes Star Schema Mid 80’s Bill Inmon Popularizes Data Warehousing Late 80’s – Barry Devlin and Dr Kimball Release “Business Data Warehouse” 1990 – Dan Linstedt Begins R&D on Data Vault Modeling 2000 – Dan Linstedt releases first 5 articles on Data Vault Modeling
  • 9. Data Vault Evolution (C) Kent Graziano 9 The work on the Data Vault approach began in the early 1990s, and completed around 1999. Throughout 1999, 2000, and 2001, the Data Vault design was tested, refined, and deployed into specific customer sites. In 2002, the industry thought leaders were asked to review the architecture. This is when I attend my first DV seminar in Denver and met Dan! In 2003, Dan began teaching the modeling techniques to the mass public. In 2014, Dan introduced DV 2.0!
  • 10. Where does a Data Vault Fit? © LearnDataVault.com 10 STAGING EDW DATA VAULT DATA MARTS (STAR SCHEMAS) DATA MARTS (STAR SCHEMAS) DATA MARTS (STAR SCHEMAS)
  • 11. Where does Data Vault fit? ©Oracle Corp 11 Data Vault goes here
  • 12. Data Vault: 3 Simple Structures © LearnDataVault.com 12 EDW DATA VAULT HUB LINK SATELITE 01 02 03
  • 13. Data Vault Core Architecture © LearnDataVault.com 13 HUBS Unique List of Business Keys LINKS Unique List of Relationships across keys SATELITES Descriptive Data › Satellites have one and only one parent table › Satellites cannot be “Parents” to other tables › Hubs cannot be child tables
  • 14. Common Attributes © LearnDataVault.com 14 Required – all structures › Primary key – PK › Load date time stamp – DTS › Record source – REC_SRC Required – Satellites only › Load end date time stamp – LEDTS › Optional in DV 2.0 Optional – Hubs & Links only › Last seen dates – LSDTs › MD5KEY (REQUIRED IN DV 2.0) Optional – Satellites only › Load sequence ID – LDSEQ_ID › Update user – UPDT_USER › Update DTS – UPDT_DTS › MD5DIFF
  • 15. 1. Hub = Business Keys (C) Kent Graziano 15 Hubs = Unique Lists of Business Keys Business Keys are used to TRACK and IDENTIFY key information New: DV 2.0 uses MD5 Hash of the BK for the PK
  • 16. 2: Links = Associations (C) Kent Graziano 16 Links = Transactions and Associations They are used to hook together multiple sets of information In DV 2.0 the BK attributes may migrate to the Links for faster query
  • 17. Modeling Links - 1:1 or 1:M? (C) Kent Graziano 17 Today Tomorrow With a Link in The Data Vault Relationship is a 1:1 so why model a Link? The business rule can change to a 1:M. You discover new data later. No need to change the EDW structure. Existing data is fine. New data is added.
  • 18. 3. Satellites = Descriptors (C) Kent Graziano 18 Satellites provide context for the Hubs and the Links Tracks changes over time - Like SCD 2 In DV 2.0 use HASH_DIFF to detect changes
  • 19. Data Vault Model Flexibility (Agility) (C) Kent Graziano 19 Goes beyond standard 3NF Based on natural business keys Hyper normalized › Hubs and Links only hold keys and meta data › Satellites split by rate of change and/or source Enables Agile data modeling › Easy to add to model without having to change existing structures and load routines • Relationships (links) can be dropped and created on-demand. › No more reloading history because of a missed requirement Not system surrogate keys Allows for integrating data across functions and source systems more easily › All data relationships are key driven
  • 20. Data Vault Extensibility (C) LearnDataVault.com 20 Adding new components to the EDW has NEAR ZERO impact to: › Existing Loading Processes › Existing Data Model › Existing Reporting & BI Functions › Existing Source Systems › Existing Star Schemas and Data Marts
  • 21. Data Vault Productivity (C) Kent Graziano 21 › Standardized modeling rules • Highly repeatable and learnable modeling technique • Can standardize load routines o Delta Driven process o Re-startable, consistent loading patterns. • Can standardize extract routines o Rapid build of new or revised Data Marts • Can be automated • Can use a BI-meta layer to virtualize the reporting structures o Example: OBIEE Business Model and Mapping tool o Example: BOBJ Universe Business Layer • Can put views on the DV structures as well o Simulate ODS/3NF or Star Schemas
  • 22. Data Vault Adaptability (C) Kent Graziano 22 › The Data Vault holds granular historical relationships. • Holds all history for all time, allowing any source system feeds to be reconstructed on-demand o Easy generation of Audit Trails for data lineage and compliance. o Data Mining can discover new relationships between elements o Patterns of change emerge from the historical pictures and linkages. › The Data Vault can be accessed by power-users
  • 23. Other Benefits of a Data Vault (C) Kent Graziano 23 › Modeling it as a DV forces integration of the Business Keys upfront • Good for organizational alignment › An integrated data set with raw data extends it’s value beyond BI: • Source for data quality projects • Source for master data • Source for data mining • Source for Data as a Service (DaaS) in an SOA (Service Oriented Architecture).
  • 24. Other Benefits of a Data Vault (C) Kent Graziano 24 › Upfront Hub integration simplifies the data integration routines required to load data marts. • Helps divide the work a bit. › It is much easier to implement security on these granular pieces. › Granular, re-startable processes enable pin-point failure correction. › It is designed and optimized for real- time loading in its core architecture (without any tweaks or mods).
  • 25. How to be Agile using DV (C) Kent Graziano 25 Model iteratively › Use Data Vault data modeling technique › Create basic components, then add over time Virtualize the Access Layer › Don’t waste time building facts and dimensions up front ETL and testing takes too long › “Project” objects using pattern-based DV model with database views (or BI meta layer) Users see real reports with real data › Can always build out for performance in another iteration
  • 27. Worlds Smallest Data Vault © LearnDataVault.com 27 Hub Customer Hub_Cust_Seq_ID Hub_Cust_Num Hub_Cust_Load_DTS Hub_Cust_Rec_Src Hub_Cust_Seq_ID Sat_Cust_Load_DTS Sat_Cust_Load_End_DTS Sat_Cust_Name Sat_Cust_Rec_Src Satellite Customer Name › The Data Vault doesn’t have to be “BIG”. › A Data Vault can be built incrementally. › Reverse engineering one component of the existing models is not uncommon. › Building one part of the Data Vault, then changing the marts to feed from that vault is a best practice. › The smallest Enterprise Data Warehouse consists of two tables: • One Hub, • One Satellite
  • 28. Notably… 28 › In 2008 Bill Inmon stated that the “Data Vault is the optimal approach for modeling the EDW in the DW2.0 framework.” (DW2.0) › The number of Data Vault users in the US surpassed 500 in 2010 and grows rapidly (http://danlinstedt.com/about/dv- customers/)
  • 29. Organizations using Data Vault 29 › WebMD Health Services › Anthem Blue-Cross Blue Shield › MD Anderson Cancer Center › Denver Public Schools › Independent Purchasing Cooperative (IPC, Miami) • Owner of Subway › Kaplan › US Defense Department › Colorado Springs Utilities › State Court of Wyoming › Federal Express › US Dept. Of Agriculture
  • 30. What’s New in DV2.0? © LearnDataVault.com 30 Modeling Structure Includes… › NoSQL, and Non- Relational DB systems, Hybrid Systems › Minor Structure Changes to support NoSQL 01 02 03 04 New ETL Implementation Standards › For true real-time support › For NoSQL support New Architecture Standards › To include support for NoSQL data management systems New Methodology Components › Including CMMI, Six Sigma, and TQM › Including Project Planning, Tracking, and Oversight › Agile Delivery Mechanisms › Standards, and templates for Projects
  • 31. What’s New in DV2.0? This model is fully compliant with Hadoop, needs NO changes to work properly Note: Business Keys replicated to the Link structure for “join” capabilities on the way out to Data Marts. © LearnDataVault.com 31
  • 32. Summary 32 Data Vault provides a data modeling technique that allows: Model Agility Productivity So? Agile Data Warehousing? 01 02 03 › Enabling rapid changes and additions › Enabling low complexity systems with high value output at a rapid pace › Easy projections of dimensional models
  • 33. 33 › Available on Amazon: http://www.amazon.com/Better- Data-Modeling-Introduction- Engineering-ebook /dp/ B018BREV1C/ Shameless Plug:
  • 34. 34 › Available on Amazon.com › Soft Cover or Kindle Format › Now also available in PDF at LearnDataVault.com › Hint: Kent is the Technical Editor Super Charge Your Data Warehouse
  • 35. 35 › Available on Amazon: http://www.amazon.com/Buildin g-Scalable-Data-Warehouse- Vault/dp/0128025107/ New DV 2.0 Book
  • 37. Data Vault References 37 www.youtube.com/LearnDataVault www.facebook.com/learndatavault www.learndatavault.com www.danlinstedt.com
  • 39. Contact Information 39 KENT GRAZIANO Snowflake Computing www.snowflake.net kent.graziano@snowflake.net @KentGraziano http://kentgraziano.com

Editor's Notes

  1. 4/29/2016